BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (131 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_A4WEW3 UPF0102 protein Ent638_3585 n=124 Tax=Enterobact... 161 5e-39 UniRef50_D1P1S8 Putative choloylglycine hydrolase n=5 Tax=Entero... 153 2e-36 UniRef50_C4ZD68 Putative uncharacterized protein n=1 Tax=Eubacte... 151 5e-36 UniRef50_A8SP33 Putative uncharacterized protein n=2 Tax=Clostri... 150 1e-35 UniRef50_A5KJ12 Putative uncharacterized protein n=2 Tax=Clostri... 149 2e-35 UniRef50_A9KLL5 UPF0102 protein Cphy_2398 n=3 Tax=Clostridiales ... 149 3e-35 UniRef50_UPI0001973CC8 hypothetical protein ClM62_15129 n=1 Tax=... 147 8e-35 UniRef50_C1SJ30 Putative uncharacterized protein n=1 Tax=Denitro... 147 8e-35 UniRef50_B5YIA9 UPF0102 protein THEYE_A1950 n=1 Tax=Thermodesulf... 147 1e-34 UniRef50_Q0AWX4 UPF0102 protein Swol_1475 n=1 Tax=Syntrophomonas... 146 2e-34 UniRef50_C4Z0D1 Putative endonuclease n=13 Tax=Clostridiales Rep... 146 3e-34 UniRef50_C2KW07 Putative uncharacterized protein n=1 Tax=Oribact... 146 3e-34 UniRef50_A8MHC8 UPF0102 protein Clos_1471 n=8 Tax=Clostridiaceae... 146 3e-34 UniRef50_C4V2D8 Endonuclease n=1 Tax=Selenomonas flueggei ATCC 4... 145 4e-34 UniRef50_A6FEY4 Putative uncharacterized protein n=1 Tax=Moritel... 145 4e-34 UniRef50_B0P3K1 Putative uncharacterized protein n=2 Tax=Clostri... 145 4e-34 UniRef50_Q5ZR89 UPF0102 protein lpg2994 n=4 Tax=Legionella RepID... 145 4e-34 UniRef50_C7R9K3 Putative uncharacterized protein n=1 Tax=Kangiel... 145 5e-34 UniRef50_Q1JX98 Putative uncharacterized protein n=1 Tax=Desulfu... 144 6e-34 UniRef50_C4L9F6 Putative uncharacterized protein n=1 Tax=Tolumon... 144 8e-34 UniRef50_A0KPY4 UPF0102 protein AHA_3896 n=4 Tax=Proteobacteria ... 144 1e-33 UniRef50_Q6AJE4 UPF0102 protein DP2807 n=1 Tax=Desulfotalea psyc... 144 1e-33 UniRef50_B0U003 UPF0102 protein Fphi_0415 n=15 Tax=Francisella R... 143 2e-33 UniRef50_Q47VU1 UPF0102 protein CPS_4433 n=1 Tax=Colwellia psych... 142 4e-33 UniRef50_C0QTY9 UPF0102 protein PERMA_0362 n=2 Tax=Hydrogenother... 142 4e-33 UniRef50_D1RD77 Putative uncharacterized protein n=1 Tax=Legione... 141 6e-33 UniRef50_D1VVL2 Putative uncharacterized protein n=1 Tax=Peptoni... 141 8e-33 UniRef50_A5F986 UPF0102 protein VC0395_A0112/VC395_0597 n=28 Tax... 141 8e-33 UniRef50_Q3JE65 UPF0102 protein Noc_0355 n=2 Tax=Nitrosococcus o... 141 8e-33 UniRef50_B2V8B3 UPF0102 protein SYO3AOP1_0546 n=2 Tax=Sulfurihyd... 140 8e-33 UniRef50_B8J1T1 Putative uncharacterized protein n=1 Tax=Desulfo... 140 9e-33 UniRef50_Q7N090 UPF0102 protein plu4003 n=2 Tax=Enterobacteriace... 140 9e-33 UniRef50_B8CW28 Putative uncharacterized protein n=1 Tax=Halothe... 140 1e-32 UniRef50_A9NAA4 UPF0102 protein COXBURSA331_A1934 n=6 Tax=Coxiel... 140 1e-32 UniRef50_D1KBC1 Putative uncharacterized protein n=1 Tax=uncultu... 140 1e-32 UniRef50_B7GSU4 UPF0102 protein Blon_1698 n=12 Tax=Bifidobacteri... 140 1e-32 UniRef50_C0N677 Putative uncharacterized protein n=1 Tax=Methylo... 140 2e-32 UniRef50_A1SU47 UPF0102 protein Ping_1176 n=2 Tax=Psychromonas R... 140 2e-32 UniRef50_C8X4T0 Putative uncharacterized protein n=1 Tax=Desulfo... 139 2e-32 UniRef50_B6ELI6 UPF0102 protein VSAL_I2655 n=7 Tax=Vibrionaceae ... 139 2e-32 UniRef50_C5BS52 Putative uncharacterized protein n=1 Tax=Teredin... 139 2e-32 UniRef50_A3XHA2 Putative uncharacterized protein n=1 Tax=Leeuwen... 139 3e-32 UniRef50_Q8R5S3 UPF0102 protein TTE1452 n=9 Tax=Thermoanaerobact... 139 3e-32 UniRef50_A0YER5 Putative uncharacterized protein n=1 Tax=marine ... 137 7e-32 UniRef50_B9MQX5 UPF0102 protein Athe_0977 n=1 Tax=Anaerocellum t... 137 7e-32 UniRef50_Q0VS15 UPF0102 protein ABO_0585 n=2 Tax=Alcanivorax Rep... 137 7e-32 UniRef50_Q1Q244 Putative uncharacterized protein n=1 Tax=Candida... 137 8e-32 UniRef50_Q67PD3 UPF0102 protein STH1475 n=1 Tax=Symbiobacterium ... 137 8e-32 UniRef50_C4GB01 Putative uncharacterized protein n=1 Tax=Shuttle... 137 8e-32 UniRef50_A4BQJ8 Putative uncharacterized protein n=1 Tax=Nitroco... 137 9e-32 UniRef50_B0G5Y9 Putative uncharacterized protein n=4 Tax=Clostri... 137 1e-31 UniRef50_Q2S9Y0 UPF0102 protein HCH_05895 n=1 Tax=Hahella chejue... 137 1e-31 UniRef50_A1AN88 UPF0102 protein Ppro_1186 n=11 Tax=Deltaproteoba... 137 1e-31 UniRef50_C0GQX2 Putative uncharacterized protein n=1 Tax=Desulfo... 137 1e-31 UniRef50_C6A8H5 Putative uncharacterized protein n=4 Tax=Bifidob... 137 1e-31 UniRef50_B5YFD1 UPF0102 protein DICTH_1420 n=2 Tax=Dictyoglomus ... 137 1e-31 UniRef50_C7RDQ1 Putative uncharacterized protein n=1 Tax=Anaeroc... 137 1e-31 UniRef50_B3EJJ5 UPF0102 protein Cphamn1_0017 n=1 Tax=Chlorobium ... 136 2e-31 UniRef50_A5N821 UPF0102 protein CKL_1410 n=16 Tax=Clostridium Re... 136 2e-31 UniRef50_Q7MNW2 UPF0102 protein VV0603 n=80 Tax=Vibrionales RepI... 136 2e-31 UniRef50_A1ZF36 Putative uncharacterized protein n=1 Tax=Microsc... 136 2e-31 UniRef50_D1CBL5 Putative uncharacterized protein n=1 Tax=Thermob... 136 2e-31 UniRef50_C9KJR5 Endonuclease n=2 Tax=Veillonellaceae RepID=C9KJR... 136 2e-31 UniRef50_A8SLV5 Putative uncharacterized protein n=1 Tax=Parvimo... 136 3e-31 UniRef50_B4RXI2 Sigma-54 factor n=2 Tax=Alteromonas macleodii Re... 135 3e-31 UniRef50_A3WNE6 Predicted endonuclease n=1 Tax=Idiomarina baltic... 135 3e-31 UniRef50_B7RSM7 Putative uncharacterized protein n=1 Tax=marine ... 135 3e-31 UniRef50_A6VB97 UPF0102 protein PSPA7_4996 n=17 Tax=Pseudomonada... 135 4e-31 UniRef50_C6WYK5 Putative uncharacterized protein n=1 Tax=Methylo... 135 4e-31 UniRef50_Q7P0B3 UPF0102 protein CV_0654 n=1 Tax=Chromobacterium ... 135 5e-31 UniRef50_B2KC57 Putative uncharacterized protein n=1 Tax=Elusimi... 135 5e-31 UniRef50_Q1GYY7 UPF0102 protein Mfla_2283 n=1 Tax=Methylobacillu... 134 6e-31 UniRef50_A8G183 UPF0102 protein Ssed_4252 n=16 Tax=Shewanella Re... 134 7e-31 UniRef50_Q0TPP8 UPF0102 protein CPF_1959 n=9 Tax=Clostridium per... 134 9e-31 UniRef50_Q31EY6 UPF0102 protein Tcr_1695 n=1 Tax=Thiomicrospira ... 134 9e-31 UniRef50_D0MIM6 Putative uncharacterized protein n=1 Tax=Rhodoth... 134 1e-30 UniRef50_B8DRI1 Putative uncharacterized protein n=2 Tax=Desulfo... 134 1e-30 UniRef50_A4J649 UPF0102 protein Dred_2035 n=1 Tax=Desulfotomacul... 134 1e-30 UniRef50_D0KYK3 Putative uncharacterized protein n=1 Tax=Halothi... 133 2e-30 UniRef50_A8PP71 Putative uncharacterized protein n=1 Tax=Rickett... 133 2e-30 UniRef50_C0YUE8 Possible endonuclease n=2 Tax=Flavobacteriaceae ... 133 2e-30 UniRef50_C8W5C0 Putative uncharacterized protein n=1 Tax=Desulfo... 132 3e-30 UniRef50_UPI0000510419 hypothetical protein BlinB_18076 n=1 Tax=... 132 3e-30 UniRef50_C6BVQ7 Putative uncharacterized protein n=1 Tax=Desulfo... 132 3e-30 UniRef50_Q2BGY7 Putative uncharacterized protein n=1 Tax=Neptuni... 132 3e-30 UniRef50_B2A2P1 UPF0102 protein Nther_1376 n=1 Tax=Natranaerobiu... 132 3e-30 UniRef50_D0I4Y5 Endonuclease n=1 Tax=Grimontia hollisae CIP 1018... 132 4e-30 UniRef50_Q1MRU7 UPF0102 protein LI0223 n=1 Tax=Lawsonia intracel... 132 4e-30 UniRef50_A0LV62 UPF0102 protein Acel_1550 n=8 Tax=Actinomycetale... 132 4e-30 UniRef50_A5EVA6 UPF0102 protein DNO_0639 n=2 Tax=Cardiobacteriac... 132 4e-30 UniRef50_A4FME3 UPF0102 protein SACE_6045 n=2 Tax=Actinomycetale... 132 4e-30 UniRef50_C2LNN6 Possible endonuclease n=3 Tax=Proteus RepID=C2LN... 132 5e-30 UniRef50_C9LW74 Endonuclease n=1 Tax=Selenomonas sputigena ATCC ... 132 5e-30 UniRef50_C3W9T3 Endonuclease n=4 Tax=Fusobacterium RepID=C3W9T3_... 131 6e-30 UniRef50_Q3IG11 UPF0102 protein PSHAa2523 n=3 Tax=Alteromonadale... 131 6e-30 UniRef50_A1SB01 UPF0102 protein Sama_3355 n=5 Tax=Shewanella Rep... 131 7e-30 UniRef50_A3M3I7 UPF0102 protein A1S_1049 n=12 Tax=Acinetobacter ... 131 7e-30 UniRef50_C8PPR7 HD domain protein n=1 Tax=Treponema vincentii AT... 131 8e-30 UniRef50_A1KWG5 UPF0102 protein NMC2069 n=28 Tax=Neisseriaceae R... 131 8e-30 UniRef50_B3JMU0 Putative uncharacterized protein n=6 Tax=Bactero... 130 1e-29 UniRef50_A6TRS2 UPF0102 protein Amet_2739 n=1 Tax=Alkaliphilus m... 130 1e-29 UniRef50_C4FFG3 Putative uncharacterized protein n=1 Tax=Bifidob... 130 2e-29 UniRef50_Q1NMK2 Putative uncharacterized protein n=1 Tax=delta p... 130 2e-29 UniRef50_C7HUU3 Endonuclease n=4 Tax=Anaerococcus RepID=C7HUU3_9... 130 2e-29 UniRef50_Q1MYA7 Putative uncharacterized protein n=1 Tax=Bermane... 129 2e-29 UniRef50_D1U5W3 Putative uncharacterized protein n=1 Tax=Desulfo... 129 2e-29 UniRef50_C6D2Y6 Putative uncharacterized protein n=1 Tax=Paeniba... 129 2e-29 UniRef50_B3QZF2 UPF0102 protein Ctha_1382 n=1 Tax=Chloroherpeton... 129 2e-29 UniRef50_C7N589 Predicted endonuclease related to Holliday junct... 129 3e-29 UniRef50_A8ZV12 UPF0102 protein Dole_2298 n=2 Tax=Desulfobactera... 129 3e-29 UniRef50_B3PLA2 Putative uncharacterized protein n=1 Tax=Cellvib... 129 3e-29 UniRef50_C4F8U2 Putative uncharacterized protein n=2 Tax=Collins... 129 3e-29 UniRef50_C0GHM5 Putative uncharacterized protein n=1 Tax=Dethiob... 129 3e-29 UniRef50_A5D1I2 UPF0102 protein PTH_1707 n=1 Tax=Pelotomaculum t... 129 3e-29 UniRef50_Q0AFH8 UPF0102 protein Neut_1662 n=2 Tax=Proteobacteria... 129 4e-29 UniRef50_A5Z6D1 Putative uncharacterized protein n=1 Tax=Eubacte... 129 4e-29 UniRef50_A6L1J0 UPF0102 protein BVU_1879 n=26 Tax=Bacteroidales ... 129 4e-29 UniRef50_D1NRZ9 Putative endonuclease n=1 Tax=Bifidobacterium ga... 129 4e-29 UniRef50_Q2YCL8 UPF0102 protein Nmul_A0195 n=3 Tax=Nitrosomonada... 129 4e-29 UniRef50_C7IKV6 Putative uncharacterized protein n=1 Tax=Clostri... 128 4e-29 UniRef50_D2SDZ8 Putative uncharacterized protein n=1 Tax=Geoderm... 128 5e-29 UniRef50_C4FZ58 Putative uncharacterized protein n=1 Tax=Abiotro... 128 5e-29 UniRef50_C2D6J2 Putative uncharacterized protein n=1 Tax=Atopobi... 128 6e-29 UniRef50_A6LSN5 UPF0102 protein Cbei_1183 n=5 Tax=Clostridium Re... 128 6e-29 UniRef50_A3N211 UPF0102 protein APL_1363 n=33 Tax=Pasteurellacea... 128 7e-29 UniRef50_C4K712 Putative uncharacterized protein n=1 Tax=Candida... 127 9e-29 UniRef50_A5FR87 UPF0102 protein DehaBAV1_0707 n=5 Tax=Dehalococc... 127 1e-28 UniRef50_B8HR21 Putative uncharacterized protein n=1 Tax=Cyanoth... 127 1e-28 UniRef50_B8KR63 Putative uncharacterized protein n=1 Tax=gamma p... 127 1e-28 UniRef50_C6IVU3 Putative uncharacterized protein n=1 Tax=Paeniba... 127 1e-28 UniRef50_UPI0001C37581 hypothetical protein RflaF_17327 n=1 Tax=... 127 1e-28 UniRef50_D2R2U4 Putative uncharacterized protein n=1 Tax=Pirellu... 127 1e-28 UniRef50_C9LM05 Endonuclease n=1 Tax=Dialister invisus DSM 15470... 127 1e-28 UniRef50_Q2S1J6 UPF0102 protein SRU_1822 n=1 Tax=Salinibacter ru... 127 1e-28 UniRef50_A4SC34 UPF0102 protein Cvib_0014 n=9 Tax=Chlorobiaceae ... 127 1e-28 UniRef50_B3ES88 Putative uncharacterized protein n=1 Tax=Candida... 127 1e-28 UniRef50_C8WAJ1 Putative uncharacterized protein n=1 Tax=Atopobi... 127 1e-28 UniRef50_C2G0R5 Possible endonuclease n=2 Tax=Sphingobacterium s... 127 1e-28 UniRef50_B0VJC5 Putative uncharacterized protein n=1 Tax=Candida... 126 2e-28 UniRef50_C2HKC4 Possible endonuclease n=2 Tax=Finegoldia magna R... 126 2e-28 UniRef50_C6XZ20 Putative uncharacterized protein n=2 Tax=Pedobac... 126 2e-28 UniRef50_C9MR57 Putative endonuclease n=2 Tax=Prevotella RepID=C... 126 2e-28 UniRef50_C8WGY4 Putative uncharacterized protein n=2 Tax=Eggerth... 126 2e-28 UniRef50_C7MNC2 Predicted endonuclease related to Holliday junct... 126 2e-28 UniRef50_Q15PJ2 UPF0102 protein Patl_3694 n=1 Tax=Pseudoalteromo... 126 2e-28 UniRef50_A3HX52 Putative uncharacterized protein n=1 Tax=Algorip... 126 2e-28 UniRef50_D1PKZ1 Putative choloylglycine hydrolase n=1 Tax=Subdol... 126 3e-28 UniRef50_Q3A2F1 UPF0102 protein Pcar_2217 n=2 Tax=Deltaproteobac... 126 3e-28 UniRef50_D0LAN4 Putative uncharacterized protein n=1 Tax=Gordoni... 126 3e-28 UniRef50_A1R7F9 UPF0102 protein AAur_2443 n=3 Tax=Micrococcaceae... 126 3e-28 UniRef50_C9R878 Putative uncharacterized protein n=1 Tax=Ammonif... 125 3e-28 UniRef50_Q8R616 UPF0102 protein FN1370 n=9 Tax=Fusobacterium Rep... 125 4e-28 UniRef50_C0ZFM4 Putative uncharacterized protein n=1 Tax=Breviba... 125 4e-28 UniRef50_C4XKL5 Putative uncharacterized protein n=1 Tax=Desulfo... 125 5e-28 UniRef50_C9MX50 Endonuclease n=2 Tax=Leptotrichia RepID=C9MX50_9... 125 6e-28 UniRef50_C1ZN06 Putative uncharacterized protein n=1 Tax=Plancto... 124 7e-28 UniRef50_A1K3T3 UPF0102 protein azo0871 n=1 Tax=Azoarcus sp. BH7... 124 7e-28 UniRef50_Q1QVF6 UPF0102 protein Csal_2201 n=1 Tax=Chromohalobact... 124 7e-28 UniRef50_A4AH12 Putative uncharacterized protein n=1 Tax=marine ... 124 7e-28 UniRef50_Q1LHS4 UPF0102 protein Rmet_3430 n=2 Tax=Betaproteobact... 124 8e-28 UniRef50_Q5R0L0 UPF0102 protein IL0423 n=1 Tax=Idiomarina loihie... 124 9e-28 UniRef50_B0TH88 Putative uncharacterized protein n=1 Tax=Helioba... 124 1e-27 UniRef50_Q1YQG9 Putative uncharacterized protein n=1 Tax=gamma p... 124 1e-27 UniRef50_B0C8B9 UPF0102 protein AM1_3954 n=1 Tax=Acaryochloris m... 124 1e-27 UniRef50_D2RIH7 Putative uncharacterized protein n=1 Tax=Acidami... 124 1e-27 UniRef50_UPI00016929A4 hypothetical protein Plarl_14719 n=1 Tax=... 124 1e-27 UniRef50_Q60CC4 UPF0102 protein MCA0184 n=1 Tax=Methylococcus ca... 124 1e-27 UniRef50_C1AG13 UPF0102 protein JTY_2914 n=20 Tax=Mycobacterium ... 124 1e-27 UniRef50_Q2JJU2 UPF0102 protein CYB_2119 n=3 Tax=Synechococcus R... 124 1e-27 UniRef50_B9ZKW9 Putative uncharacterized protein n=1 Tax=Thioalk... 123 1e-27 UniRef50_A4YJR8 UPF0102 protein BRADO0179 n=14 Tax=Rhizobiales R... 123 2e-27 UniRef50_C6VV91 Putative uncharacterized protein n=2 Tax=Flexiba... 123 2e-27 UniRef50_C8PW53 Putative uncharacterized protein n=1 Tax=Enhydro... 123 2e-27 UniRef50_B7K4B3 Putative uncharacterized protein n=4 Tax=Cyanoba... 123 2e-27 UniRef50_C7LP67 Putative uncharacterized protein n=1 Tax=Desulfo... 123 2e-27 UniRef50_A8UQV9 Putative uncharacterized protein n=1 Tax=Hydroge... 123 2e-27 UniRef50_A7NKS5 UPF0102 protein Rcas_2007 n=2 Tax=Roseiflexus Re... 123 2e-27 UniRef50_D0GLC9 Putative uncharacterized protein n=1 Tax=Leptotr... 123 2e-27 UniRef50_A3DDG4 UPF0102 protein Cthe_0758 n=6 Tax=Clostridia Rep... 122 3e-27 UniRef50_A5WCR1 UPF0102 protein PsycPRwf_0497 n=1 Tax=Psychrobac... 122 3e-27 UniRef50_A5FKL6 UPF0102 protein Fjoh_1217 n=17 Tax=Bacteroidetes... 122 3e-27 UniRef50_A3YDY3 Putative uncharacterized protein n=1 Tax=Marinom... 122 3e-27 UniRef50_C0VVC1 Endonuclease n=2 Tax=Corynebacterium glucuronoly... 122 3e-27 UniRef50_D2MIZ7 Putative uncharacterized protein n=1 Tax=Candida... 122 3e-27 UniRef50_B8G6B1 UPF0102 protein Cagg_0930 n=3 Tax=Chloroflexus R... 122 3e-27 UniRef50_C0EXX9 Putative uncharacterized protein n=1 Tax=Eubacte... 122 3e-27 UniRef50_D1SBF6 Putative uncharacterized protein n=1 Tax=Micromo... 122 4e-27 UniRef50_Q8XUC6 UPF0102 protein RSc3265 n=6 Tax=Proteobacteria R... 122 4e-27 UniRef50_A1VIW8 UPF0102 protein Pnap_0271 n=10 Tax=Burkholderial... 122 4e-27 UniRef50_Q11XW1 UPF0102 protein CHU_0465 n=1 Tax=Cytophaga hutch... 122 4e-27 UniRef50_C7MB82 Putative uncharacterized protein n=1 Tax=Brachyb... 122 4e-27 UniRef50_Q4FQF2 UPF0102 protein Psyc_1908 n=2 Tax=Psychrobacter ... 121 5e-27 UniRef50_D1W8G8 Putative uncharacterized protein n=1 Tax=Prevote... 121 6e-27 UniRef50_D2RAN4 Putative uncharacterized protein n=1 Tax=Gardner... 121 6e-27 UniRef50_C2KRF5 Possible endonuclease n=2 Tax=Mobiluncus mulieri... 121 6e-27 UniRef50_C9RJM0 Putative uncharacterized protein n=1 Tax=Fibroba... 121 6e-27 UniRef50_A1U3H0 UPF0102 protein Maqu_2464 n=3 Tax=Marinobacter R... 121 6e-27 UniRef50_Q313K2 UPF0102 protein Dde_1093 n=1 Tax=Desulfovibrio d... 121 6e-27 UniRef50_A6SUE7 UPF0102 protein mma_0204 n=4 Tax=Betaproteobacte... 121 6e-27 UniRef50_A6VXY8 UPF0102 protein Mmwyl1_2395 n=1 Tax=Marinomonas ... 121 8e-27 UniRef50_A0Z7D7 Putative uncharacterized protein n=1 Tax=marine ... 120 1e-26 UniRef50_A6GLM3 Putative uncharacterized protein n=1 Tax=Limnoba... 120 1e-26 UniRef50_Q2KU88 UPF0102 protein BAV3162 n=1 Tax=Bordetella avium... 120 1e-26 UniRef50_C7R327 Putative uncharacterized protein n=1 Tax=Jonesia... 120 1e-26 UniRef50_A0QVA9 UPF0102 protein MSMEG_2508 n=6 Tax=Corynebacteri... 120 1e-26 UniRef50_Q1D6H9 UPF0102 protein MXAN_3551 n=2 Tax=Cystobacterine... 120 1e-26 UniRef50_C2BVF9 Possible endonuclease n=1 Tax=Mobiluncus curtisi... 120 2e-26 UniRef50_C9LKQ6 Putative uncharacterized protein n=1 Tax=Prevote... 120 2e-26 UniRef50_A0YUK1 Putative uncharacterized protein n=2 Tax=Cyanoba... 120 2e-26 UniRef50_D1ANU1 Putative uncharacterized protein n=1 Tax=Sebalde... 120 2e-26 UniRef50_B0MPN2 Putative uncharacterized protein n=1 Tax=Eubacte... 119 2e-26 UniRef50_Q146Q2 UPF0102 protein Bxeno_A0149 n=9 Tax=Burkholderia... 119 2e-26 UniRef50_Q6A7T5 UPF0102 protein PPA1431 n=3 Tax=Propionibacteriu... 119 3e-26 UniRef50_Q8DI54 UPF0102 protein tll1737 n=1 Tax=Thermosynechococ... 119 3e-26 UniRef50_C9PT16 Endonuclease n=5 Tax=Prevotella RepID=C9PT16_9BACT 119 4e-26 UniRef50_B2S4F0 UPF0102 protein TPASS_0913 n=3 Tax=Treponema Rep... 119 4e-26 UniRef50_Q025A4 UPF0102 protein Acid_2433 n=1 Tax=Candidatus Sol... 118 5e-26 UniRef50_Q24UC6 UPF0102 protein DSY2577 n=2 Tax=Desulfitobacteri... 118 5e-26 UniRef50_Q3AC88 UPF0102 protein CHY_1414 n=1 Tax=Carboxydothermu... 118 7e-26 UniRef50_B1VG84 Putative uncharacterized protein n=1 Tax=Coryneb... 117 7e-26 UniRef50_Q6FD45 UPF0102 protein ACIAD1132 n=4 Tax=Acinetobacter ... 117 8e-26 UniRef50_C2BNU0 Endonuclease n=2 Tax=Corynebacterium RepID=C2BNU... 117 1e-25 UniRef50_A4A5E8 Putative uncharacterized protein n=2 Tax=unclass... 117 1e-25 UniRef50_C2CWR1 Endonuclease n=1 Tax=Gardnerella vaginalis ATCC ... 117 1e-25 UniRef50_Q6AEA8 UPF0102 protein Lxx14785 n=1 Tax=Leifsonia xyli ... 117 2e-25 UniRef50_B1XJM9 Putative uncharacterized protein n=1 Tax=Synecho... 117 2e-25 UniRef50_A6LF20 UPF0102 protein BDI_2565 n=4 Tax=Bacteroidales R... 116 2e-25 UniRef50_A0LCU4 UPF0102 protein Mmc1_3298 n=1 Tax=Magnetococcus ... 116 2e-25 UniRef50_B1WNM5 Putative uncharacterized protein n=3 Tax=Chrooco... 116 2e-25 UniRef50_Q55761 UPF0102 protein sll0189 n=1 Tax=Synechocystis sp... 116 2e-25 UniRef50_UPI00017886BD protein of unknown function UPF0102 n=1 T... 116 2e-25 UniRef50_A9B5H2 UPF0102 protein Haur_0145 n=1 Tax=Herpetosiphon ... 116 3e-25 UniRef50_UPI0001BC5BE0 endonuclease n=3 Tax=Fusobacterium RepID=... 115 3e-25 UniRef50_D1BMP5 Putative uncharacterized protein n=3 Tax=Veillon... 115 3e-25 UniRef50_A6NUN6 Putative uncharacterized protein n=1 Tax=Bactero... 115 4e-25 UniRef50_B0MUK7 Putative uncharacterized protein n=1 Tax=Alistip... 115 4e-25 UniRef50_D1BJ87 Predicted endonuclease related to Holliday junct... 115 4e-25 UniRef50_B4WKR8 Putative uncharacterized protein n=1 Tax=Synecho... 115 4e-25 UniRef50_C0WCB9 Endonuclease n=1 Tax=Acidaminococcus sp. D21 Rep... 115 5e-25 UniRef50_C0WKI9 Endonuclease n=3 Tax=Actinomycetales RepID=C0WKI... 115 6e-25 UniRef50_A1W341 UPF0102 protein Ajs_0414 n=11 Tax=Betaproteobact... 114 8e-25 UniRef50_A3NEP2 UPF0102 protein BURPS668_3819 n=83 Tax=Proteobac... 114 1e-24 UniRef50_A9HJH4 UPF0102 protein GDI1964/Gdia_0189 n=1 Tax=Glucon... 114 1e-24 UniRef50_C0BHK9 Putative uncharacterized protein n=1 Tax=Flavoba... 114 1e-24 UniRef50_C0E2B0 Putative uncharacterized protein n=2 Tax=Coryneb... 114 1e-24 UniRef50_A1VFE8 UPF0102 protein Dvul_2148 n=3 Tax=Desulfovibrio ... 113 1e-24 UniRef50_UPI000197ABB4 hypothetical protein GHTCC_11038 n=2 Tax=... 113 1e-24 UniRef50_B9CKG2 Putative uncharacterized protein n=1 Tax=Atopobi... 113 2e-24 UniRef50_D1AZ70 Putative uncharacterized protein n=1 Tax=Sulfuro... 113 2e-24 UniRef50_B4VZG7 Putative uncharacterized protein n=1 Tax=Microco... 113 2e-24 UniRef50_B8HA88 UPF0102 protein Achl_2213 n=1 Tax=Arthrobacter c... 113 2e-24 UniRef50_A4A171 Putative uncharacterized protein n=2 Tax=Plancto... 113 2e-24 UniRef50_C7PDZ7 Putative uncharacterized protein n=1 Tax=Chitino... 112 2e-24 UniRef50_C7GYG3 Putative choloylglycine hydrolase n=1 Tax=Eubact... 112 2e-24 UniRef50_Q2NZA5 UPF0102 protein XOO3617 n=18 Tax=Xanthomonadacea... 112 3e-24 UniRef50_A7HN69 UPF0102 protein Fnod_1509 n=1 Tax=Fervidobacteri... 112 3e-24 UniRef50_C1TKL9 Predicted endonuclease related to Holliday junct... 112 3e-24 UniRef50_A1SLR5 UPF0102 protein Noca_3248 n=11 Tax=Actinomycetal... 112 4e-24 UniRef50_A5IKG8 UPF0102 protein Tpet_0671 n=6 Tax=Thermotogaceae... 112 4e-24 UniRef50_C0QVG4 UPF0102 protein BHWA1_02005 n=2 Tax=Brachyspira ... 112 4e-24 UniRef50_D2ATE9 Putative uncharacterized protein n=1 Tax=Strepto... 112 5e-24 UniRef50_A0Q0X6 UPF0102 protein NT01CX_2205 n=3 Tax=Clostridium ... 112 5e-24 UniRef50_C6WEV0 Putative uncharacterized protein n=1 Tax=Actinos... 111 6e-24 UniRef50_Q47S60 UPF0102 protein Tfu_0669 n=2 Tax=Nocardiopsaceae... 111 8e-24 UniRef50_Q4JV13 UPF0102 protein jk1180 n=2 Tax=Corynebacterium j... 111 8e-24 UniRef50_C0BPF0 Putative uncharacterized protein n=1 Tax=Flavoba... 110 1e-23 UniRef50_Q1IJG5 UPF0102 protein Acid345_3985 n=1 Tax=Candidatus ... 110 1e-23 UniRef50_A6GIE5 Putative uncharacterized protein n=1 Tax=Plesioc... 110 1e-23 UniRef50_Q118B0 UPF0102 protein Tery_0733 n=1 Tax=Trichodesmium ... 110 2e-23 UniRef50_C5CAH6 Holliday junction resolvase-like endonuclease n=... 110 2e-23 UniRef50_C8NTW9 Choloylglycine hydrolase n=1 Tax=Corynebacterium... 110 2e-23 UniRef50_C4LJB4 Putative uncharacterized protein n=1 Tax=Coryneb... 109 3e-23 UniRef50_C1A4D5 Putative uncharacterized protein n=1 Tax=Gemmati... 109 3e-23 UniRef50_A0LMM6 Putative uncharacterized protein n=1 Tax=Syntrop... 109 3e-23 UniRef50_C4KCT6 Putative uncharacterized protein n=3 Tax=Betapro... 109 3e-23 UniRef50_B6R5V9 Putative uncharacterized protein n=1 Tax=Pseudov... 109 3e-23 UniRef50_A8F4Z9 UPF0102 protein Tlet_0667 n=1 Tax=Thermotoga let... 109 3e-23 UniRef50_A8HYK3 UPF0102 protein AZC_4471 n=3 Tax=Rhizobiales Rep... 109 4e-23 UniRef50_B2IUS6 Putative uncharacterized protein n=4 Tax=Nostoca... 109 4e-23 UniRef50_A6DUI2 Endonuclease n=1 Tax=Lentisphaera araneosa HTCC2... 108 7e-23 UniRef50_A4CH47 Putative uncharacterized protein n=1 Tax=Robigin... 108 7e-23 UniRef50_B4U6T0 Putative uncharacterized protein n=1 Tax=Hydroge... 107 8e-23 UniRef50_B6BHT8 Putative uncharacterized protein n=1 Tax=Campylo... 107 8e-23 UniRef50_C5CGT1 UPF0102 protein Kole_1919 n=1 Tax=Kosmotoga olea... 107 8e-23 UniRef50_C7H7V1 Endonuclease n=2 Tax=Faecalibacterium prausnitzi... 107 1e-22 UniRef50_Q0F072 Putative uncharacterized protein n=1 Tax=Maripro... 107 1e-22 UniRef50_D2NTN5 Predicted endonuclease distantly related to arch... 107 1e-22 UniRef50_B2UP21 Putative uncharacterized protein n=2 Tax=Verruco... 107 2e-22 UniRef50_B7K7K1 Putative uncharacterized protein n=1 Tax=Cyanoth... 107 2e-22 UniRef50_Q5FF38 UPF0102 protein ERGA_CDS_00540 n=5 Tax=canis gro... 107 2e-22 UniRef50_Q2IJ48 UPF0102 protein Adeh_1910 n=4 Tax=Anaeromyxobact... 106 2e-22 UniRef50_D2LER1 Putative uncharacterized protein n=1 Tax=Rhodomi... 106 2e-22 UniRef50_B2RLS5 UPF0102 protein PGN_1801 n=2 Tax=Porphyromonas g... 106 3e-22 UniRef50_C7QA44 Putative uncharacterized protein n=1 Tax=Catenul... 106 3e-22 UniRef50_Q83I01 UPF0102 protein TW312 n=2 Tax=Tropheryma whipple... 106 3e-22 UniRef50_A0NXE9 Putative uncharacterized protein n=2 Tax=Labrenz... 105 3e-22 UniRef50_Q7UM23 UPF0102 protein RB9115 n=1 Tax=Rhodopirellula ba... 105 3e-22 UniRef50_C2ANQ3 Predicted endonuclease related to Holliday junct... 105 4e-22 UniRef50_Q3M3N9 UPF0102 protein Ava_4800 n=2 Tax=Nostocaceae Rep... 105 4e-22 UniRef50_A1HN64 Putative uncharacterized protein n=1 Tax=Thermos... 105 4e-22 UniRef50_C6XIG7 Putative uncharacterized protein n=1 Tax=Hirschi... 105 5e-22 UniRef50_Q0BTH9 UPF0102 protein GbCGDNIH1_0975 n=1 Tax=Granuliba... 105 5e-22 UniRef50_C3XDT2 Putative uncharacterized protein n=1 Tax=Helicob... 105 6e-22 UniRef50_A3VPC2 Putative uncharacterized protein n=1 Tax=Parvula... 105 6e-22 UniRef50_UPI0001C31AB5 protein of unknown function UPF0102 n=1 T... 104 7e-22 UniRef50_D1Y396 Putative uncharacterized protein n=1 Tax=Pyramid... 104 7e-22 UniRef50_D1N5L9 Putative uncharacterized protein n=1 Tax=Victiva... 104 8e-22 UniRef50_C1F7M4 Putative uncharacterized protein n=1 Tax=Acidoba... 104 9e-22 UniRef50_A4X4J0 UPF0102 protein Strop_1320 n=2 Tax=Micromonospor... 104 1e-21 UniRef50_D0WR56 Putative endonuclease n=1 Tax=Actinomyces sp. or... 104 1e-21 UniRef50_Q0A6J0 UPF0102 protein Mlg_2205 n=2 Tax=Ectothiorhodosp... 104 1e-21 UniRef50_UPI00019790C0 hypothetical protein HcinC1_06745 n=2 Tax... 104 1e-21 UniRef50_B9L042 Putative uncharacterized protein n=2 Tax=Thermom... 104 1e-21 UniRef50_C7NGY4 Predicted endonuclease related to Holliday junct... 103 2e-21 UniRef50_A4ECJ4 Putative uncharacterized protein n=1 Tax=Collins... 103 2e-21 UniRef50_Q5SLC1 UPF0102 protein TTHA0372 n=5 Tax=Thermaceae RepI... 102 3e-21 UniRef50_A7HZ51 UPF0102 protein Plav_3586 n=1 Tax=Parvibaculum l... 102 3e-21 UniRef50_B5JM98 Putative uncharacterized protein n=1 Tax=Verruco... 102 3e-21 UniRef50_D2L5G2 Putative uncharacterized protein n=1 Tax=Desulfo... 102 4e-21 UniRef50_Q6NGK0 UPF0102 protein DIP1513 n=1 Tax=Corynebacterium ... 102 4e-21 UniRef50_D1B6E0 Putative uncharacterized protein n=1 Tax=Therman... 102 5e-21 UniRef50_A8YJ85 Similar to Y189_SYNY3 UPF0102 protein sll0189 n=... 101 6e-21 UniRef50_B6JM54 UPF0102 protein HPP12_0830 n=15 Tax=Epsilonprote... 101 6e-21 UniRef50_Q2RJT6 UPF0102 protein Moth_0988 n=2 Tax=Clostridia Rep... 101 7e-21 UniRef50_A3JND8 Putative uncharacterized protein n=1 Tax=Rhodoba... 101 7e-21 UniRef50_C3JBE2 Putative uncharacterized protein n=2 Tax=Bacteri... 100 1e-20 UniRef50_C3PH56 Putative uncharacterized protein n=1 Tax=Coryneb... 100 1e-20 UniRef50_C3XN83 Putative uncharacterized protein n=3 Tax=Helicob... 100 1e-20 UniRef50_A6FT82 PII uridylyl-transferase n=5 Tax=Rhodobacterales... 100 2e-20 UniRef50_C4DPS8 Predicted endonuclease related to Holliday junct... 99 4e-20 UniRef50_Q2KDE4 UPF0102 protein RHE_CH00320 n=4 Tax=Rhizobiales ... 99 4e-20 UniRef50_B0P7L1 Putative uncharacterized protein n=1 Tax=Anaerot... 98 6e-20 UniRef50_B2GFY9 Putative uncharacterized protein n=1 Tax=Kocuria... 98 9e-20 UniRef50_D1NDV3 Fimbrial usher protein (Fragment) n=1 Tax=Haemop... 98 1e-19 UniRef50_C7JH91 Putative uncharacterized protein n=8 Tax=Acetoba... 97 1e-19 UniRef50_O66457 UPF0102 protein aq_041 n=1 Tax=Aquifex aeolicus ... 97 1e-19 UniRef50_A6Q6T2 UPF0102 protein SUN_0231 n=3 Tax=Epsilonproteoba... 97 1e-19 UniRef50_C5BWW3 UPF0102 protein Bcav_2532 n=21 Tax=Actinomycetal... 97 1e-19 UniRef50_A3K994 Putative uncharacterized protein n=7 Tax=Rhodoba... 97 2e-19 UniRef50_Q0C451 UPF0102 protein HNE_0764 n=1 Tax=Hyphomonas nept... 97 2e-19 UniRef50_A3UIK6 Putative uncharacterized protein n=1 Tax=Oceanic... 96 3e-19 UniRef50_A9BFT1 UPF0102 protein Pmob_0702 n=1 Tax=Petrotoga mobi... 96 3e-19 UniRef50_Q7U7D4 UPF0102 protein SYNW1051 n=3 Tax=Synechococcus R... 96 3e-19 UniRef50_A4EVA8 Putative uncharacterized protein n=1 Tax=Roseoba... 96 4e-19 UniRef50_Q31RH5 UPF0102 protein Synpcc7942_0312 n=2 Tax=Synechoc... 96 4e-19 UniRef50_A9I0M2 UPF0102 protein Bpet0439 n=14 Tax=Proteobacteria... 96 4e-19 UniRef50_Q04SX0 UPF0102 protein LBJ_1427 n=4 Tax=Leptospira RepI... 96 4e-19 UniRef50_B2IFF3 Putative uncharacterized protein n=1 Tax=Beijeri... 96 4e-19 UniRef50_C9M9E0 Putative uncharacterized protein n=1 Tax=Jonquet... 96 5e-19 UniRef50_A1B931 Putative uncharacterized protein n=1 Tax=Paracoc... 96 5e-19 UniRef50_B5Y8F8 Putative uncharacterized protein n=1 Tax=Coproth... 95 6e-19 UniRef50_C0XSA5 Endonuclease n=1 Tax=Corynebacterium lipophilofl... 95 7e-19 UniRef50_A9IXC8 UPF0102 protein BT_1882 n=8 Tax=Rhizobiales RepI... 94 9e-19 UniRef50_A4QF37 UPF0102 protein cgR_1859 n=4 Tax=Corynebacterium... 94 1e-18 UniRef50_B3T5F3 Putative uncharacterized protein family UPF0102 ... 94 1e-18 UniRef50_Q2VYL8 UPF0102 protein amb4503 n=5 Tax=Alphaproteobacte... 94 1e-18 UniRef50_B0T377 UPF0102 protein Caul_0175 n=3 Tax=Caulobacterace... 94 2e-18 UniRef50_A8U078 Predicted endonuclease n=1 Tax=alpha proteobacte... 93 3e-18 UniRef50_Q16B02 UPF0102 protein RD1_1191 n=1 Tax=Roseobacter den... 92 3e-18 UniRef50_UPI00016C4BC8 hypothetical protein GobsU_17186 n=1 Tax=... 92 4e-18 UniRef50_Q0AK98 UPF0102 protein Mmar10_3014 n=1 Tax=Maricaulis m... 92 6e-18 UniRef50_B2S8H0 UPF0102 protein BAbS19_I01690 n=50 Tax=Rhizobial... 92 6e-18 UniRef50_A8ERF6 UPF0102 protein Abu_0255 n=2 Tax=Campylobacteral... 91 1e-17 UniRef50_A3TTV9 Putative uncharacterized protein n=1 Tax=Oceanic... 90 2e-17 UniRef50_Q0FQ74 Putative uncharacterized protein n=1 Tax=Roseova... 90 2e-17 UniRef50_A7ZB75 UPF0102 protein Ccon26_01140 n=23 Tax=Epsilonpro... 90 2e-17 UniRef50_C6QFU1 Putative uncharacterized protein n=1 Tax=Hyphomi... 89 3e-17 UniRef50_C1CZ90 UPF0102 protein Deide_03080 n=3 Tax=Deinococcus ... 89 4e-17 UniRef50_B4CV56 Putative uncharacterized protein n=1 Tax=Chthoni... 89 5e-17 UniRef50_B1ZZB1 Putative uncharacterized protein n=2 Tax=Opituta... 89 6e-17 UniRef50_A7BDE4 Putative uncharacterized protein n=1 Tax=Actinom... 89 6e-17 UniRef50_Q7V7V8 UPF0102 protein PMT_0624 n=2 Tax=Prochlorococcus... 88 7e-17 UniRef50_B3CRA6 Putative uncharacterized protein n=2 Tax=Orienti... 87 1e-16 UniRef50_C0W187 Putative uncharacterized protein n=1 Tax=Actinom... 87 1e-16 UniRef50_A5V3S4 UPF0102 protein Swit_0572 n=4 Tax=Sphingomonadac... 87 1e-16 UniRef50_C2M936 Putative uncharacterized protein n=1 Tax=Porphyr... 87 2e-16 UniRef50_A4WPR4 UPF0102 protein Rsph17025_0472 n=7 Tax=Rhodobact... 86 3e-16 UniRef50_Q2GDU7 Putative uncharacterized protein n=1 Tax=Neorick... 85 6e-16 UniRef50_Q28TZ5 Putative uncharacterized protein n=1 Tax=Jannasc... 85 6e-16 UniRef50_Q7NEX4 UPF0102 protein gll3754 n=1 Tax=Gloeobacter viol... 84 9e-16 UniRef50_B8GXN3 UPF0102 protein CCNA_00142 n=4 Tax=Caulobacterac... 84 1e-15 UniRef50_Q3J5H3 UPF0102 protein RHOS4_03930 n=7 Tax=Rhodobactera... 84 1e-15 UniRef50_C6V4V5 Putative uncharacterized protein n=1 Tax=Neorick... 84 1e-15 UniRef50_C8WWC7 Putative uncharacterized protein n=1 Tax=Alicycl... 84 1e-15 UniRef50_B9KH25 Putative uncharacterized protein n=3 Tax=Anaplas... 84 2e-15 UniRef50_C4YXH4 Protein Mlr4633 n=1 Tax=Rickettsia endosymbiont ... 84 2e-15 UniRef50_A5GTL0 Restriction endonuclease-like n=1 Tax=Synechococ... 83 3e-15 UniRef50_B0S8P6 Endonuclease n=2 Tax=Leptospira biflexa serovar ... 82 5e-15 UniRef50_A5G0S4 UPF0102 protein Acry_2261 n=1 Tax=Acidiphilium c... 82 5e-15 UniRef50_A2VU88 Putative uncharacterized protein n=1 Tax=Burkhol... 82 8e-15 UniRef50_Q1GJI4 UPF0102 protein TM1040_0449 n=9 Tax=Rhodobactera... 81 1e-14 UniRef50_C7N7P3 Predicted endonuclease related to Holliday junct... 81 1e-14 UniRef50_A4E8G7 Putative uncharacterized protein n=1 Tax=Collins... 81 1e-14 UniRef50_A8LJ68 UPF0102 protein Dshi_2830 n=1 Tax=Dinoroseobacte... 81 1e-14 UniRef50_A5KGE0 Putative uncharacterized protein n=1 Tax=Campylo... 80 2e-14 UniRef50_A5GLH9 Restriction endonuclease-like n=5 Tax=Chroococca... 79 3e-14 UniRef50_B1M445 UPF0102 protein Mrad2831_2938 n=10 Tax=Alphaprot... 78 8e-14 UniRef50_B3DVH2 Predicted endonuclease n=2 Tax=Verrucomicrobia R... 77 1e-13 UniRef50_A9GEX7 UPF0102 protein sce2912 n=1 Tax=Sorangium cellul... 77 2e-13 UniRef50_C0W7A2 Putative uncharacterized protein (Fragment) n=1 ... 75 7e-13 UniRef50_B6IVS9 Putative uncharacterized protein n=1 Tax=Rhodosp... 75 9e-13 UniRef50_UPI000190D97F hypothetical protein SentesTyp_00923 n=1 ... 74 1e-12 UniRef50_C8WHA2 Putative uncharacterized protein n=1 Tax=Eggerth... 74 2e-12 UniRef50_Q0I9S1 Uncharacterised protein family protein n=1 Tax=S... 73 2e-12 UniRef50_B9XLY1 Putative uncharacterized protein n=1 Tax=bacteri... 73 2e-12 UniRef50_D1ATK0 Putative uncharacterized protein n=1 Tax=Anaplas... 73 3e-12 UniRef50_Q1GWI7 UPF0102 protein Sala_0262 n=5 Tax=Sphingomonadal... 70 3e-11 UniRef50_C0BCN9 Putative uncharacterized protein n=1 Tax=Coproco... 69 4e-11 UniRef50_C8W847 Putative uncharacterized protein n=1 Tax=Atopobi... 66 3e-10 UniRef50_Q4E9U1 Endonuclease (Fragment) n=5 Tax=Wolbachia RepID=... 62 4e-09 UniRef50_Q73VP8 Putative uncharacterized protein n=1 Tax=Mycobac... 60 2e-08 UniRef50_Q3AKE1 Uncharacterised protein family UPF0102 n=2 Tax=C... 60 3e-08 UniRef50_C7N801 Predicted endonuclease related to Holliday junct... 58 7e-08 UniRef50_Q8W6V7 Putative uncharacterized protein n=1 Tax=Synecho... 56 4e-07 UniRef50_A6DBD4 Putative uncharacterized protein (Fragment) n=1 ... 55 5e-07 UniRef50_Q8TW03 Predicted endonuclease of the RecB family n=1 Ta... 54 2e-06 UniRef50_A8A9D3 Putative uncharacterized protein n=1 Tax=Ignicoc... 54 2e-06 UniRef50_A4GJ57 Putative uncharacterized protein n=1 Tax=uncultu... 54 2e-06 UniRef50_UPI0001699F06 hypothetical protein Epers_29808 n=1 Tax=... 54 2e-06 UniRef50_C5U625 Putative uncharacterized protein n=1 Tax=Methano... 53 4e-06 UniRef50_O33024 UPF0102 protein ML1607 n=2 Tax=Mycobacterium lep... 52 7e-06 UniRef50_Q5GSW9 RecB family endonuclease n=1 Tax=Wolbachia endos... 49 4e-05 UniRef50_Q3BT99 Putative uncharacterized protein n=2 Tax=Xanthom... 45 6e-04 UniRef50_Q5NWY8 Putative uncharacterized protein n=1 Tax=Aromato... 45 7e-04 UniRef50_Q9Y9F5 Putative uncharacterized protein n=1 Tax=Aeropyr... 45 0.001 UniRef50_C1E903 Predicted protein n=1 Tax=Micromonas sp. RCC299 ... 44 0.002 UniRef50_D1TJS5 Putative uncharacterized protein n=1 Tax=Burkhol... 42 0.007 UniRef50_Q6MLA1 Putative uncharacterized protein n=1 Tax=Bdellov... 42 0.007 UniRef50_B7KKS4 Putative uncharacterized protein n=1 Tax=Cyanoth... 42 0.008 UniRef50_A2BIV6 Endonuclease of RecB family n=1 Tax=Hyperthermus... 41 0.015 UniRef50_A3DLW4 Endonuclease (RecB family)-like protein n=1 Tax=... 40 0.018 UniRef50_B5IHF1 ATPase n=1 Tax=Aciduliprofundum boonei T469 RepI... 40 0.022 UniRef50_D0CIJ2 Putative uncharacterized protein n=2 Tax=Bacteri... 40 0.030 UniRef50_A7VFU4 Putative uncharacterized protein n=6 Tax=Bacteri... 39 0.036 UniRef50_C4F8E7 Putative uncharacterized protein n=1 Tax=Collins... 39 0.054 UniRef50_Q05TI5 Putative uncharacterized protein n=1 Tax=Synecho... 39 0.064 UniRef50_A8MBK5 Putative uncharacterized protein n=1 Tax=Caldivi... 38 0.081 UniRef50_A3CY54 Restriction endonuclease n=1 Tax=Methanoculleus ... 38 0.083 UniRef50_Q7VH71 Putative uncharacterized protein n=1 Tax=Helicob... 38 0.090 >UniRef50_A4WEW3 UPF0102 protein Ent638_3585 n=124 Tax=Enterobacteriaceae RepID=Y3585_ENT38 Length = 131 Score = 161 bits (409), Expect = 5e-39, Method: Composition-based stats. Identities = 93/131 (70%), Positives = 115/131 (87%) Query: 1 MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTT 60 MA +P + P +L+ KQTGDAWE +ARRWLEGKGLRFIAANV+ RGGEIDLIM++G+ Sbjct: 1 MAQIPAGADRPGKLSRKQTGDAWELKARRWLEGKGLRFIAANVHGRGGEIDLIMKDGQVI 60 Query: 61 IFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV 120 +F+EVR+R+S+ +GGAAASVT +KQHKLLQTA LWLARHNGSFDTVDCRFDVVAFTGN++ Sbjct: 61 VFIEVRFRQSSRFGGAAASVTLAKQHKLLQTAHLWLARHNGSFDTVDCRFDVVAFTGNDI 120 Query: 121 EWIKDAFNDHS 131 EW+K+AF + + Sbjct: 121 EWLKNAFGEDA 131 >UniRef50_D1P1S8 Putative choloylglycine hydrolase n=5 Tax=Enterobacteriaceae RepID=D1P1S8_9ENTR Length = 128 Score = 153 bits (388), Expect = 2e-36, Method: Composition-based stats. Identities = 56/110 (50%), Positives = 76/110 (69%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 TG +E QA +L+ +GL IA NV R GEIDLIMR+G +FVEVR+R+++ YG A Sbjct: 12 TGRHYENQALAYLQQQGLTLIARNVRCRMGEIDLIMRDGTVLVFVEVRFRKNSDYGNALL 71 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFN 128 SV K+ K+L TA+ WLA+ SF+T CRFD+ A TG + EW+++AFN Sbjct: 72 SVNWHKRRKILATAQYWLAQRQQSFETTPCRFDIYAITGKQFEWVQNAFN 121 >UniRef50_C4ZD68 Putative uncharacterized protein n=1 Tax=Eubacterium rectale ATCC 33656 RepID=C4ZD68_EUBR3 Length = 122 Score = 151 bits (383), Expect = 5e-36, Method: Composition-based stats. Identities = 43/115 (37%), Positives = 61/115 (53%), Gaps = 1/115 (0%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + + G +E A L G +A N R GEID+I ++G T F EV+YRR Sbjct: 7 MNKRSVGSIYEQLAAEQLINMGYSVLACNYRNRFGEIDIIAKDGDTICFCEVKYRRDNGC 66 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFN 128 G A +V SKQ K++ AR +L +H + CRFDV+A +EV +K+AF Sbjct: 67 GRALEAVGYSKQKKIISVARYYLMKHGLD-EWTPCRFDVIAVDDDEVTVLKNAFE 120 >UniRef50_A8SP33 Putative uncharacterized protein n=2 Tax=Clostridiales RepID=A8SP33_9FIRM Length = 127 Score = 150 bits (380), Expect = 1e-35, Method: Composition-based stats. Identities = 35/121 (28%), Positives = 66/121 (54%), Gaps = 1/121 (0%) Query: 8 SGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRY 67 S + +++TG +E A +L+ G +A N GEID++ + +FVEV++ Sbjct: 4 SAVLNEYNSRRTGSEYETAACDYLKNCGYDILARNYRVSAGEIDIVAQSDGYIVFVEVKF 63 Query: 68 RRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 R + G A+ +V KQ ++ + A +L ++ + V RFDV+ +GNE+ I++A+ Sbjct: 64 RSNTHMGAASEAVDHRKQKRISKAALYFLKQYGYGVE-VPVRFDVITVSGNEITHIENAY 122 Query: 128 N 128 + Sbjct: 123 D 123 >UniRef50_A5KJ12 Putative uncharacterized protein n=2 Tax=Clostridiales RepID=A5KJ12_9FIRM Length = 120 Score = 149 bits (378), Expect = 2e-35, Method: Composition-based stats. Identities = 38/115 (33%), Positives = 57/115 (49%), Gaps = 2/115 (1%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + + G +E A +LE G I N R GEID+I ++G +F EV+YR Sbjct: 6 KQNNRSVGAVYEQAAGYYLEQNGYELIEYNYRCRDGEIDIIAKDGDCYVFCEVKYRSGRQ 65 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 G +V + KQ K+ + A + +H + CRFDV+ G E+ IK+AF Sbjct: 66 AGNPLEAVDQRKQKKIFRCALYYTVQHGI--EDAQCRFDVIGVEGTEITHIKNAF 118 >UniRef50_A9KLL5 UPF0102 protein Cphy_2398 n=3 Tax=Clostridiales RepID=Y2398_CLOPH Length = 118 Score = 149 bits (376), Expect = 3e-35, Method: Composition-based stats. Identities = 39/117 (33%), Positives = 63/117 (53%), Gaps = 1/117 (0%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + K G E +A +L +G + +A N R GEID++ RE +FVEV+YR + Sbjct: 1 MNKKVEGLTKETEAANYLSEQGYQILARNYRCRLGEIDIVARENGYLVFVEVKYRTNVEK 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFNDH 130 G ++T KQ ++ TA+ +L + ++ CRFDVV E+ IK+AF+ + Sbjct: 61 GFPEEAITIQKQRRITNTAKYYLLVNRLP-ESTPCRFDVVVMLKEEIRLIKNAFDAY 116 >UniRef50_UPI0001973CC8 hypothetical protein ClM62_15129 n=1 Tax=Clostridium sp. M62/1 RepID=UPI0001973CC8 Length = 126 Score = 147 bits (373), Expect = 8e-35, Method: Composition-based stats. Identities = 46/127 (36%), Positives = 66/127 (51%), Gaps = 1/127 (0%) Query: 1 MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTT 60 M R G + +T+ G +E A +LE +G + N R GEID+I REG T Sbjct: 1 MQEEKRRKGPAGRKSTRARGARYEDLAAAFLEKQGYVILEKNFFCRTGEIDIIAREGDTL 60 Query: 61 IFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV 120 +FVEV+YR+ G A +V KQ K+ + A +L + CRFDVVA G++ Sbjct: 61 VFVEVKYRKDLAAGDPAEAVNERKQEKIRKAAAFYLYARGLPPEQ-PCRFDVVAILGSDF 119 Query: 121 EWIKDAF 127 ++DAF Sbjct: 120 RLLRDAF 126 >UniRef50_C1SJ30 Putative uncharacterized protein n=1 Tax=Denitrovibrio acetiphilus DSM 12809 RepID=C1SJ30_9BACT Length = 111 Score = 147 bits (373), Expect = 8e-35, Method: Composition-based stats. Identities = 36/110 (32%), Positives = 59/110 (53%), Gaps = 4/110 (3%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 G E +A +LE +G + N + GEID+I + IFVEV+ R + +G Sbjct: 5 FGKKGEKKAACFLEKQGYAIVEMNYRCKFGEIDIIAEKNGVLIFVEVKTRSTDKFGLGYE 64 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFN 128 SVT SKQ KL +TA+ ++ + + +FDV++ G+ + I +AF+ Sbjct: 65 SVTLSKQQKLFKTAQHYMVENG----EMPAQFDVISIDGDTLTHIPNAFS 110 >UniRef50_B5YIA9 UPF0102 protein THEYE_A1950 n=1 Tax=Thermodesulfovibrio yellowstonii DSM 11347 RepID=Y1950_THEYD Length = 112 Score = 147 bits (371), Expect = 1e-34, Method: Composition-based stats. Identities = 34/114 (29%), Positives = 57/114 (50%), Gaps = 3/114 (2%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + + G E A +L KG + + N GEID+I ++G + +EV+ R S + Sbjct: 1 MARIELGKEGEKLAIDYLLTKGYKILEKNFRTPFGEIDIIAKDGNFIVIIEVKRRLSDKF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 G SV +KQ KL + A +++ + RFDV+A ++E I++AF Sbjct: 61 GKPELSVNYTKQQKLKKLALYYISMLKKEY---PVRFDVIAINDKKIEHIENAF 111 >UniRef50_Q0AWX4 UPF0102 protein Swol_1475 n=1 Tax=Syntrophomonas wolfei subsp. wolfei str. Goettingen RepID=Y1475_SYNWW Length = 115 Score = 146 bits (369), Expect = 2e-34, Method: Composition-based stats. Identities = 35/116 (30%), Positives = 56/116 (48%), Gaps = 6/116 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 ++ G E A ++L KG + + N + R GE+DL+ + +FVEV+ RRS +G Sbjct: 2 NRELGLWGEELAAQYLRKKGYKILERNFHTRYGELDLVCEKDDNIVFVEVKTRRSTRFGS 61 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----EVEWIKDAF 127 +VT K L + A L+L F + FDVV+ ++ I +AF Sbjct: 62 PEEAVTPRKIGNLKKAAILYLKSTPRFF--PEISFDVVSILVEDGKSKINHIINAF 115 >UniRef50_C4Z0D1 Putative endonuclease n=13 Tax=Clostridiales RepID=C4Z0D1_EUBE2 Length = 115 Score = 146 bits (368), Expect = 3e-34, Method: Composition-based stats. Identities = 38/113 (33%), Positives = 57/113 (50%), Gaps = 1/113 (0%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 + TG E A R+L G + N + GEID+I ++ +FVEV+YR + YG Sbjct: 4 NKRATGADKEQLAARYLVDNGYTVLERNFRNKTGEIDIIAKKDNYIVFVEVKYRSNNKYG 63 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 A +V KQ + + A+ ++ S + CRFDV+ G V IK+AF Sbjct: 64 YAVEAVNYRKQQIIRRVAQFYITTRYKSC-DIPCRFDVIGIDGETVTHIKNAF 115 >UniRef50_C2KW07 Putative uncharacterized protein n=1 Tax=Oribacterium sinus F0268 RepID=C2KW07_9FIRM Length = 188 Score = 146 bits (368), Expect = 3e-34, Method: Composition-based stats. Identities = 36/115 (31%), Positives = 60/115 (52%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 ++ G +E +A +LE KG + N E+DL+ ++G F+EV+ R+ Sbjct: 74 NISNTLKGKVFEDRAVAFLEEKGYEILERNSRFHHLEMDLVAKDGEMLCFIEVKGRKEHS 133 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 Y +V R KQ +L A +L +H+ S CRFDVV+ G +++ I++AF Sbjct: 134 YLSGVYAVDRGKQRRLRTWATAYLCKHSYSLTETACRFDVVSIEGEKIQLIQNAF 188 >UniRef50_A8MHC8 UPF0102 protein Clos_1471 n=8 Tax=Clostridiaceae RepID=Y1471_ALKOO Length = 117 Score = 146 bits (368), Expect = 3e-34, Method: Composition-based stats. Identities = 34/116 (29%), Positives = 57/116 (49%), Gaps = 4/116 (3%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 K+ G E A +L+ KG R + N R GEID+I T +FVEV+ R S +G Sbjct: 2 NKKIGAIGEQLAVHYLKNKGYRILDCNYRTRLGEIDIIAILNDTIVFVEVKTRSSGAFGT 61 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF----TGNEVEWIKDAF 127 + +V KQ + + ++ +L + D + RFDV+ ++ +++AF Sbjct: 62 PSEAVNYKKQMTIRRVSQQYLLSNRIGEDDWNLRFDVIEVQLIEKKYKINHMENAF 117 >UniRef50_C4V2D8 Endonuclease n=1 Tax=Selenomonas flueggei ATCC 43531 RepID=C4V2D8_9FIRM Length = 118 Score = 145 bits (367), Expect = 4e-34, Method: Composition-based stats. Identities = 44/119 (36%), Positives = 62/119 (52%), Gaps = 6/119 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 ++ K GD EA A R+L +G R +A + GEIDLI ++ T +FVEV+ RRS Sbjct: 1 MSNKVLGDRGEACAARYLGAQGYRILAQKYRTKTGEIDLIAKDHDTLVFVEVKTRRSVRC 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG----NEVEWIKDAFN 128 G A +V KQ +++QTA L+L D CRFD+V + +K AF Sbjct: 61 GLPAEAVNYRKQRRIIQTAMLYLCEK--QMDQTPCRFDIVEVYAAGSEWRIHHLKGAFE 117 >UniRef50_A6FEY4 Putative uncharacterized protein n=1 Tax=Moritella sp. PE36 RepID=A6FEY4_9GAMM Length = 135 Score = 145 bits (367), Expect = 4e-34, Method: Composition-based stats. Identities = 47/134 (35%), Positives = 75/134 (55%), Gaps = 13/134 (9%) Query: 7 RSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGR-------- 58 R + + ++ G+ +E A +L+ +GL +A N R GEIDLI + G Sbjct: 2 RKATLNRKQPRKRGEYFEGIAAEFLQRQGLIILARNFACRQGEIDLICQHGASCDIKSST 61 Query: 59 ---TTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF 115 T +FVEV+YR+ YGGA +++ +KQ KL TA+ ++ RH + + CRFDV+A Sbjct: 62 TLPTLVFVEVKYRQYTHYGGAISAIPVAKQRKLRYTAQYYMVRHGINENYTPCRFDVIAI 121 Query: 116 TG--NEVEWIKDAF 127 G + ++WI +AF Sbjct: 122 EGCSDNIQWITNAF 135 >UniRef50_B0P3K1 Putative uncharacterized protein n=2 Tax=Clostridiales RepID=B0P3K1_9CLOT Length = 115 Score = 145 bits (367), Expect = 4e-34, Method: Composition-based stats. Identities = 41/115 (35%), Positives = 68/115 (59%), Gaps = 1/115 (0%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + ++TG EA A +L+ +G + N + GEID+I +E +T +FVEV+YR+ Sbjct: 2 KKNNRETGAKAEAIACWFLKQQGYDVLEQNFYTKVGEIDIIAKEDQTLVFVEVKYRKDDK 61 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 G A +V + KQ K+ ++A ++L +++ SF+ RFDVV G ++ IK AF Sbjct: 62 KGYPAQAVDQRKQQKIRKSAMIYLKKNHLSFEQ-PIRFDVVEILGKKIRVIKHAF 115 >UniRef50_Q5ZR89 UPF0102 protein lpg2994 n=4 Tax=Legionella RepID=Y2994_LEGPH Length = 118 Score = 145 bits (366), Expect = 4e-34, Method: Composition-based stats. Identities = 43/116 (37%), Positives = 68/116 (58%), Gaps = 3/116 (2%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 T++ G E A +L+ GL + N + R GEIDLIMREG +FVEVR R + +GG Sbjct: 2 TQEKGKFAEQLALNYLKENGLALVMQNYHCRLGEIDLIMREGSYLVFVEVRSRSNMNFGG 61 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG--NEVEWIKDAFND 129 AS+T K+ K+++ ++ ++ D RFDV++ G N++ W+K+AF+ Sbjct: 62 GLASITYEKKQKIIKATSHYMIKYRIQ-DKFPIRFDVISIDGKSNKITWLKNAFDA 116 >UniRef50_C7R9K3 Putative uncharacterized protein n=1 Tax=Kangiella koreensis DSM 16069 RepID=C7R9K3_KANKD Length = 122 Score = 145 bits (366), Expect = 5e-34, Method: Composition-based stats. Identities = 50/122 (40%), Positives = 72/122 (59%), Gaps = 5/122 (4%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 ++T+Q GD E A +L+ +GL + N N R GEIDLIM + +FVEVR+R + Y Sbjct: 1 MSTRQRGDHVELFAESYLKKQGLTLVEKNFNSRFGEIDLIMLDKSALVFVEVRFRANTSY 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEWIKDAFND 129 G A +V KQ K+++TA+L+L + DCRFDVV+ T + +EW K+AF Sbjct: 61 GSGAETVNFRKQQKIIKTAQLYLQANK-KMQQRDCRFDVVSVTLSAQEPLIEWHKNAFQA 119 Query: 130 HS 131 S Sbjct: 120 PS 121 >UniRef50_Q1JX98 Putative uncharacterized protein n=1 Tax=Desulfuromonas acetoxidans DSM 684 RepID=Q1JX98_DESAC Length = 121 Score = 144 bits (365), Expect = 6e-34, Method: Composition-based stats. Identities = 43/122 (35%), Positives = 60/122 (49%), Gaps = 6/122 (4%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 G E QA +L + R + N GEIDLI+R G+T FVEV+ R+S Sbjct: 2 TQQRLTLGRWGEQQAADYLRRRLYRIVTCNYRCHYGEIDLIVRRGKTLAFVEVKTRKSRC 61 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----EVEWIKDAFN 128 YG +VT KQ +++ TA+ +L S T RFDV+A + ++ I DAF Sbjct: 62 YGTPQEAVTPRKQQQIIATAQHYLTTQQPSTQT--VRFDVIAINVDGDKTQINHIVDAFE 119 Query: 129 DH 130 H Sbjct: 120 LH 121 >UniRef50_C4L9F6 Putative uncharacterized protein n=1 Tax=Tolumonas auensis DSM 9187 RepID=C4L9F6_TOLAT Length = 124 Score = 144 bits (364), Expect = 8e-34, Method: Composition-based stats. Identities = 52/113 (46%), Positives = 74/113 (65%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 + G +E QAR +LE +GL F+ AN + R GE+DLIMRE T +F+EVR+R S YG Sbjct: 12 NRRSKGQHYEQQARCFLEQQGLLFVCANYHCRQGELDLIMRERDTLVFIEVRFRASRDYG 71 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 GA +SVT +KQHK+ TAR +L + + CRFD+V++ + W+K+AF Sbjct: 72 GALSSVTPAKQHKIRHTARYYLMSQHINEAHQACRFDIVSYDDGQCSWLKNAF 124 >UniRef50_A0KPY4 UPF0102 protein AHA_3896 n=4 Tax=Proteobacteria RepID=Y3896_AERHH Length = 130 Score = 144 bits (363), Expect = 1e-33, Method: Composition-based stats. Identities = 56/109 (51%), Positives = 76/109 (69%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 G +E A RWL+ +GL+ + N RGGEIDLIMR+G T +FVEVRYR +GGAAA Sbjct: 22 KGQHFEQLAERWLQARGLQPVTRNYRCRGGEIDLIMRQGETLVFVEVRYRSQTSHGGAAA 81 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 SVTR KQHK++ AR + +H + + CRFDV+AF G++ +WI++AF Sbjct: 82 SVTRCKQHKIVLAARHYFKQHAINEASQACRFDVIAFEGDQPDWIQNAF 130 >UniRef50_Q6AJE4 UPF0102 protein DP2807 n=1 Tax=Desulfotalea psychrophila RepID=Y2807_DESPS Length = 128 Score = 144 bits (363), Expect = 1e-33, Method: Composition-based stats. Identities = 39/118 (33%), Positives = 63/118 (53%), Gaps = 7/118 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 K+ G E A R+L+ +G + N ++ GEID+I +EG +FVEV+ R ++ +G Sbjct: 5 RKKKGAEGEYLACRFLKKQGYVILQKNYRKKYGEIDIIAQEGGDLVFVEVKTRSNSDWGS 64 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-----VEWIKDAFN 128 A+VT+ KQ K+++ A+ +LA RFDV+ +E E I +AF Sbjct: 65 PVAAVTKQKQRKIIRVAQTYLAE--TELFDEAIRFDVIGIILDENSPPIFELIHNAFE 120 >UniRef50_B0U003 UPF0102 protein Fphi_0415 n=15 Tax=Francisella RepID=Y420_FRAP2 Length = 117 Score = 143 bits (361), Expect = 2e-33, Method: Composition-based stats. Identities = 41/115 (35%), Positives = 66/115 (57%), Gaps = 2/115 (1%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVN-ERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + T + G+ E QA ++L K L+ +A N GEID+I + T +F+EV+YR Sbjct: 1 MKTIEIGNKAEEQASKFLRTKNLQILAQNFKAFPYGEIDIIALDQNTLVFIEVKYRSKTK 60 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 + A +T SKQ KL+ A ++L + F+ +CRFD++A ++ WIK+AF Sbjct: 61 FAKAEEMLTYSKQQKLINAANIFLQENP-KFENYECRFDLIAINKEDINWIKNAF 114 >UniRef50_Q47VU1 UPF0102 protein CPS_4433 n=1 Tax=Colwellia psychrerythraea 34H RepID=Y4433_COLP3 Length = 129 Score = 142 bits (359), Expect = 4e-33, Method: Composition-based stats. Identities = 49/124 (39%), Positives = 76/124 (61%), Gaps = 4/124 (3%) Query: 8 SGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRY 67 S + ++ G E+ A+++L +GLRFI N + R GEIDLIM +G T +FVEV+Y Sbjct: 6 KTSAKNTSSTDKGQVTESYAQQYLSKQGLRFIERNFHSRQGEIDLIMLDGDTYVFVEVKY 65 Query: 68 RRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----EVEWI 123 R+S +GGA A+++ SKQ+K+ +L ++ + CR DVVA G+ +V W+ Sbjct: 66 RKSKGFGGAIAAISASKQNKVKHCITFYLHQNGLNEYNTPCRVDVVALEGDITQPQVTWL 125 Query: 124 KDAF 127 K+AF Sbjct: 126 KNAF 129 >UniRef50_C0QTY9 UPF0102 protein PERMA_0362 n=2 Tax=Hydrogenothermaceae RepID=Y362_PERMH Length = 116 Score = 142 bits (358), Expect = 4e-33, Method: Composition-based stats. Identities = 35/112 (31%), Positives = 56/112 (50%), Gaps = 2/112 (1%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 G E +A +L G R + N R GEID+I + T + VEVR + S YG Sbjct: 6 KGKEGEDKAVEYLRNSGYRILERNFRSRFGEIDIIAEDNGTIVIVEVRSKGSTGYGYPEE 65 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFNDH 130 S+ K K+++TA+ +L + + RFD+++ N + IK+AF+ Sbjct: 66 SIDHKKVRKIIKTAQFYLLKRDIK--GKQVRFDIISIVNNNIFHIKNAFDLD 115 >UniRef50_D1RD77 Putative uncharacterized protein n=1 Tax=Legionella longbeachae D-4968 RepID=D1RD77_LEGLO Length = 119 Score = 141 bits (356), Expect = 6e-33, Method: Composition-based stats. Identities = 45/119 (37%), Positives = 68/119 (57%), Gaps = 3/119 (2%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + T++ G E +A L +GL+ + N R GEIDLIM + +F+EVR R S + Sbjct: 1 MRTQEKGRVAEEKALAHLTKQGLKLVMKNYRCRFGEIDLIMYDKDYLVFIEVRSRVSNQF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN--EVEWIKDAFNDH 130 GG +SVT +K+ K+L+TA ++ H ++ RFDVV+ G+ + WIKDAF Sbjct: 61 GGGISSVTHTKRQKILKTASCFILEHQ-KYNQFGLRFDVVSIDGDAASISWIKDAFGAD 118 >UniRef50_D1VVL2 Putative uncharacterized protein n=1 Tax=Peptoniphilus lacrimalis 315-B RepID=D1VVL2_9FIRM Length = 117 Score = 141 bits (356), Expect = 8e-33, Method: Composition-based stats. Identities = 35/116 (30%), Positives = 60/116 (51%), Gaps = 4/116 (3%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + K+ G+ E+ A +L+ K + N G EID+I ++G +FVEV+ RR+ + Sbjct: 1 MNNKELGNFGESLATDFLQKKNYIILDRNYRALGTEIDIIAKDGEELVFVEVKTRRNHKF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT--GNEVEWIKDAF 127 G A +VT K ++QTA +++ +H RFDV+ + I++AF Sbjct: 61 GEAYEAVTEFKMRNIIQTANVYIYKH--ELYNTQVRFDVIEVYINEKRINHIENAF 114 >UniRef50_A5F986 UPF0102 protein VC0395_A0112/VC395_0597 n=28 Tax=Vibrio RepID=Y1312_VIBC3 Length = 122 Score = 141 bits (356), Expect = 8e-33, Method: Composition-based stats. Identities = 46/117 (39%), Positives = 74/117 (63%), Gaps = 2/117 (1%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 ++ G+ +E A +L +GL + NVN R GE+DLIMR+G T +FVEVRYR + +G Sbjct: 5 NSRHQGNHYEQMAADYLRRQGLTLVTQNVNYRFGELDLIMRDGNTLVFVEVRYRNNTQHG 64 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF--TGNEVEWIKDAFND 129 AA +VTR+K+ +L++ A W+ + + + D RFDV+A G ++W+K+A + Sbjct: 65 HAAETVTRTKRARLIKAANCWMLANKMNSHSADFRFDVIAIHQQGQHIDWLKNAITE 121 >UniRef50_Q3JE65 UPF0102 protein Noc_0355 n=2 Tax=Nitrosococcus oceani RepID=Y355_NITOC Length = 124 Score = 141 bits (356), Expect = 8e-33, Method: Composition-based stats. Identities = 45/121 (37%), Positives = 67/121 (55%), Gaps = 5/121 (4%) Query: 12 RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSA 71 + T + G+ E A +L+ +GLR N + R GEIDLIM + + +F+EVRYRR Sbjct: 2 KPATHRDKGEQAEQLACHYLQARGLRLTQRNYHCRLGEIDLIMEDRESLVFIEVRYRRKG 61 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT----GNEVEWIKDAF 127 +G A S+T +KQ +L+ A+ +L R CRFDVV T + + W++DAF Sbjct: 62 RFGDAIDSITPAKQARLIAAAQHYLQRTG-GAQNKPCRFDVVGITSEKGADNIMWLRDAF 120 Query: 128 N 128 Sbjct: 121 R 121 >UniRef50_B2V8B3 UPF0102 protein SYO3AOP1_0546 n=2 Tax=Sulfurihydrogenibium RepID=Y546_SULSY Length = 115 Score = 140 bits (355), Expect = 8e-33, Method: Composition-based stats. Identities = 38/115 (33%), Positives = 64/115 (55%), Gaps = 2/115 (1%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + Q G +E +A R+LE G + + N + GEID+I +FVEV+ R + + Sbjct: 1 MDKTQKGKFFEDKAVRYLESIGYKVLHKNYRSKYGEIDIIAETDNVIVFVEVKGRFTENF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFN 128 G S+T+ K K+++TA ++ +N D RFDVVA GN++ +++AF+ Sbjct: 61 GSGEESITKKKIDKIVKTALQFIEENN--LQGKDFRFDVVALKGNQIFHLENAFS 113 >UniRef50_B8J1T1 Putative uncharacterized protein n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774 RepID=B8J1T1_DESDA Length = 160 Score = 140 bits (355), Expect = 9e-33, Method: Composition-based stats. Identities = 44/133 (33%), Positives = 61/133 (45%), Gaps = 7/133 (5%) Query: 4 VPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFV 63 R + + G A E A L G G +A N + E+D++ +G T +FV Sbjct: 19 KSVRPATAPAAAHLRLGSAGEDAAAELLTGAGCTLLARNWRQARLELDMVCLDGDTIVFV 78 Query: 64 EVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----E 119 EV+ R S YGG A +V SKQ L + AR WLA H CRFDV+ N Sbjct: 79 EVKTRSSERYGGPAYAVGLSKQRVLCRAARAWLAAH--EAWDKPCRFDVICVLRNGDTLH 136 Query: 120 VEWIKDAFN-DHS 131 +E + AF+ + Sbjct: 137 LEHFRHAFDCPPA 149 >UniRef50_Q7N090 UPF0102 protein plu4003 n=2 Tax=Enterobacteriaceae RepID=Y4003_PHOLL Length = 126 Score = 140 bits (355), Expect = 9e-33, Method: Composition-based stats. Identities = 59/119 (49%), Positives = 86/119 (72%) Query: 12 RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSA 71 ++LT+ G +EAQA+ +L+ +GL FIAANV GGEIDLIM++ +T +F+EVR+R+S Sbjct: 4 KKLTSYLLGRNYEAQAKLFLQKQGLSFIAANVKVHGGEIDLIMKDKQTWVFIEVRFRKSG 63 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFNDH 130 YG A A++TRSK+ KLL A +WL + F+T CRFD+ A TG + EW+++AFN + Sbjct: 64 QYGDALATITRSKRKKLLHAAAVWLFQRGECFETSSCRFDICAITGQQFEWLQNAFNQN 122 >UniRef50_B8CW28 Putative uncharacterized protein n=1 Tax=Halothermothrix orenii H 168 RepID=B8CW28_HALOH Length = 116 Score = 140 bits (355), Expect = 1e-32, Method: Composition-based stats. Identities = 40/118 (33%), Positives = 62/118 (52%), Gaps = 6/118 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + ++ GD E +A R+L+ KG + I N GEID+I + +FVEV+ RRS Y Sbjct: 1 MQNRELGDWGEKKAVRYLKSKGYQVIKTNYRCLIGEIDIIAIDNNFLVFVEVKTRRSIAY 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEWIKDAF 127 G A +V KQ K+ + AR +L + + RFDV++ ++ IK+AF Sbjct: 61 GVPACAVNFDKQKKIRKVARHYLKSN--MINKYQIRFDVISIIVKNNRGFLKHIKNAF 116 >UniRef50_A9NAA4 UPF0102 protein COXBURSA331_A1934 n=6 Tax=Coxiella burnetii RepID=Y1934_COXBR Length = 120 Score = 140 bits (354), Expect = 1e-32, Method: Composition-based stats. Identities = 46/114 (40%), Positives = 68/114 (59%), Gaps = 2/114 (1%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 T++ G E A R+L+ +GL FI N + GEIDLIM + +F+EVRYRR + + Sbjct: 5 TQKIGFNAEKTACRYLQKQGLSFITKNFRYKQGEIDLIMSDQSMLVFIEVRYRRFSDFIH 64 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-VEWIKDAFN 128 A+VT KQ +L++TA +L +H D + CRFD+V T + + WIK+A Sbjct: 65 PVATVTPLKQRRLIKTALHYLQKHRL-LDKISCRFDIVGITADRQITWIKNAIE 117 >UniRef50_D1KBC1 Putative uncharacterized protein n=1 Tax=uncultured SUP05 cluster bacterium RepID=D1KBC1_9GAMM Length = 123 Score = 140 bits (354), Expect = 1e-32, Method: Composition-based stats. Identities = 46/120 (38%), Positives = 70/120 (58%), Gaps = 7/120 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE--GRTTIFVEVRYRRSALY 73 ++ G+ E A +L GL I N + GEID+IM + +T +FVEVRYR++ + Sbjct: 5 KRKVGNQAEDIALEYLSTHGLELIEQNYLTKMGEIDIIMLDKSEQTLVFVEVRYRQNTYF 64 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----EVEWIKDAFND 129 G AA +V ++KQ KL++TA+ +L +H + CRFDVV + ++ WIKDAF Sbjct: 65 GSAADTVDQNKQAKLVRTAQYYLQQH-SKYQEFICRFDVVGVESDLKYPKINWIKDAFGA 123 >UniRef50_B7GSU4 UPF0102 protein Blon_1698 n=12 Tax=Bifidobacterium RepID=Y1698_BIFLI Length = 124 Score = 140 bits (354), Expect = 1e-32, Method: Composition-based stats. Identities = 45/122 (36%), Positives = 59/122 (48%), Gaps = 5/122 (4%) Query: 11 PRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGR-TTIFVEVRYRR 69 R LT KQ G E A WLE G ++ N + R GE+D++M T +FVEV+ RR Sbjct: 3 DRNLTPKQFGALGEQYAAAWLEEHGWTTLSRNWHTRYGELDIVMLNPEYTVVFVEVKSRR 62 Query: 70 SALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEWIKD 125 S YG ++T +KQH L + A WL RFDVV V I++ Sbjct: 63 SMHYGYPQEAITPAKQHNLRKAACDWLLDRRNRVPHTAVRFDVVTIVLRVGRPLVHHIEN 122 Query: 126 AF 127 AF Sbjct: 123 AF 124 >UniRef50_C0N677 Putative uncharacterized protein n=1 Tax=Methylophaga thiooxidans DMS010 RepID=C0N677_9GAMM Length = 120 Score = 140 bits (353), Expect = 2e-32, Method: Composition-based stats. Identities = 48/119 (40%), Positives = 73/119 (61%), Gaps = 6/119 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + ++ G E Q + L+ +G+R I N RGGEIDLIM++ T +F+EVRYR+SA + Sbjct: 1 MFAREKGQQIEKQVAKHLQKQGMRLITRNYQCRGGEIDLIMQDRETLVFIEVRYRQSARF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF----TGNEVEWIKDAFN 128 G A SV ++KQ +++ TA +L + + CRFDVVA TG + +W+K+AF Sbjct: 61 GSALESVNKTKQSRIIHTAEHYLQQSRDGYQ--ACRFDVVAVSPAKTGYQFDWVKNAFQ 117 >UniRef50_A1SU47 UPF0102 protein Ping_1176 n=2 Tax=Psychromonas RepID=Y1176_PSYIN Length = 122 Score = 140 bits (353), Expect = 2e-32, Method: Composition-based stats. Identities = 46/118 (38%), Positives = 67/118 (56%), Gaps = 5/118 (4%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + G E QA +L+ +GLR I N R GEIDLIM + T +F+EVRYR+++ + Sbjct: 8 QASNSKGVLAEKQALSYLQEQGLRLICQNYYCRFGEIDLIMIDQDTLVFIEVRYRKNSDF 67 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE--VEWIKDAFND 129 GG AS+ +SKQ K++ TA+ +L D CRFD +A WI++AF + Sbjct: 68 GGPFASINKSKQRKIITTAKHYL---RTLEDEPFCRFDAIAIDSKSTTPAWIQNAFQE 122 >UniRef50_C8X4T0 Putative uncharacterized protein n=1 Tax=Desulfohalobium retbaense DSM 5692 RepID=C8X4T0_DESRD Length = 134 Score = 139 bits (352), Expect = 2e-32, Method: Composition-based stats. Identities = 42/121 (34%), Positives = 65/121 (53%), Gaps = 8/121 (6%) Query: 14 LTTKQT--GDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSA 71 ++ + G E AR +LE G +A N GGE+DL+ R GR IFVEV+ R S+ Sbjct: 1 MSARHLKTGRDGEEAARAYLESCGYVIVARNWRGGGGELDLVCRLGREIIFVEVKTRASS 60 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF----TGNEVEWIKDAF 127 ++T +KQ +L++ A +L+R+ CRFDV++ +GN+VE +AF Sbjct: 61 GRTLPIQALTPAKQQRLIRAASAYLSRNR--LWETPCRFDVISVFSGPSGNQVEHCTNAF 118 Query: 128 N 128 Sbjct: 119 E 119 >UniRef50_B6ELI6 UPF0102 protein VSAL_I2655 n=7 Tax=Vibrionaceae RepID=Y2655_ALISL Length = 123 Score = 139 bits (352), Expect = 2e-32, Method: Composition-based stats. Identities = 49/118 (41%), Positives = 70/118 (59%), Gaps = 2/118 (1%) Query: 11 PRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRS 70 ++ + G+ +E A+R+LE L FI N + GE+DLIMR+ + +FVEV+YR S Sbjct: 2 EKKPNKRIKGEYYELMAKRYLETHQLTFIERNFYSKTGELDLIMRDRDSFVFVEVKYRAS 61 Query: 71 ALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF--TGNEVEWIKDA 126 + YG A VT KQ KL +TA WL ++ S + RFDVVA G ++ WIK+A Sbjct: 62 SNYGSAQEMVTWQKQRKLQRTALFWLMKNGLSVEHTSFRFDVVAIHSQGQDINWIKNA 119 >UniRef50_C5BS52 Putative uncharacterized protein n=1 Tax=Teredinibacter turnerae T7901 RepID=C5BS52_TERTT Length = 129 Score = 139 bits (352), Expect = 2e-32, Method: Composition-based stats. Identities = 54/126 (42%), Positives = 79/126 (62%), Gaps = 2/126 (1%) Query: 3 TVPTRSGSPRQLTTKQT-GDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTI 61 P R+ + +Q T ++ GD E A+++L +GL +A N R GEIDLIM+ T + Sbjct: 4 PNPFRTPTGKQPTARRKTGDLAEDAAQQYLISQGLTPVARNYRSRFGEIDLIMQHASTLV 63 Query: 62 FVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVE 121 FVEVRYR ++ YG +AA+VT SKQ+K+ QTA+ ++ S + RFDVV +G + + Sbjct: 64 FVEVRYRANSRYGSSAATVTASKQNKIRQTAQQFIIDKKLS-ANLALRFDVVGMSGTQTQ 122 Query: 122 WIKDAF 127 WIK AF Sbjct: 123 WIKGAF 128 >UniRef50_A3XHA2 Putative uncharacterized protein n=1 Tax=Leeuwenhoekiella blandensis MED217 RepID=A3XHA2_9FLAO Length = 118 Score = 139 bits (351), Expect = 3e-32, Method: Composition-based stats. Identities = 31/118 (26%), Positives = 54/118 (45%), Gaps = 7/118 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + + G E A +LE KG + N E+D+I + + VEV+ R S + Sbjct: 1 MNHNELGKWGEEYAANYLEKKGYELLERNWFFNKAELDIIALKNNQLVVVEVKTRNSDFF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF----TGNEVEWIKDAF 127 G VT +K L++ ++ ++ ++ RFDV+A T ++E +DAF Sbjct: 61 GDPQDFVTPAKIKLLVKATNEYIISNDL---DLEVRFDVIAVLKNKTQEQLEHFEDAF 115 >UniRef50_Q8R5S3 UPF0102 protein TTE1452 n=9 Tax=Thermoanaerobacterales RepID=Y1452_THETN Length = 122 Score = 139 bits (350), Expect = 3e-32, Method: Composition-based stats. Identities = 36/123 (29%), Positives = 61/123 (49%), Gaps = 9/123 (7%) Query: 12 RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSA 71 +++ K G E A ++L KG + + N + GEIDLI +FVEV+ R S Sbjct: 2 KKVNKKTVGSVGEKIAAQYLSKKGYKILEKNFKCKIGEIDLIALYKNQIVFVEVKTRTSV 61 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF-------TGNEVEWIK 124 +G + +V KQ K+++ A++++A +F RFD++ T +V I Sbjct: 62 NFGLPSEAVDFHKQQKIVKIAQVYIAS--SNFKQYQPRFDIIEVYLNPEKLTLEKVNHIL 119 Query: 125 DAF 127 +AF Sbjct: 120 NAF 122 >UniRef50_A0YER5 Putative uncharacterized protein n=1 Tax=marine gamma proteobacterium HTCC2143 RepID=A0YER5_9GAMM Length = 128 Score = 137 bits (347), Expect = 7e-32, Method: Composition-based stats. Identities = 46/118 (38%), Positives = 66/118 (55%), Gaps = 6/118 (5%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 G E +A WL+ +GL+ +A N + GEID+IM +G+ +FVEVRYR+SA +G Sbjct: 10 KNTNFGAYVEEKAYHWLQQQGLKSVALNYRCKTGEIDIIMLDGQQLVFVEVRYRKSASFG 69 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-----EVEWIKDAF 127 SV R KQ K+ + A +L F+ + CRFDV+A + W+KDAF Sbjct: 70 DGLESVDRRKQQKIQKAAAHFLTDRP-GFNHLPCRFDVIAAKPSSDSSLHWNWVKDAF 126 >UniRef50_B9MQX5 UPF0102 protein Athe_0977 n=1 Tax=Anaerocellum thermophilum DSM 6725 RepID=Y977_ANATD Length = 119 Score = 137 bits (347), Expect = 7e-32, Method: Composition-based stats. Identities = 42/120 (35%), Positives = 59/120 (49%), Gaps = 7/120 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + KQ G E A +L G + N R GEID+I +E +T +FVEV+ R+S + Sbjct: 1 MNLKQVGRFGENLAVDFLIKHGYEILRTNFRCRLGEIDIIAKEDKTIVFVEVKTRKSLKF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF------TGNEVEWIKDAF 127 G + SV KQ + + A ++A H S D RFDVV ++ IKDAF Sbjct: 61 GLPSESVNFKKQLHIKKVAEYFIAYH-LSQDKYLYRFDVVEIFIDGKNNVTKINLIKDAF 119 >UniRef50_Q0VS15 UPF0102 protein ABO_0585 n=2 Tax=Alcanivorax RepID=Y585_ALCBS Length = 125 Score = 137 bits (347), Expect = 7e-32, Method: Composition-based stats. Identities = 48/119 (40%), Positives = 69/119 (57%), Gaps = 6/119 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + K TG E +A +WL G+GL + N + R GEIDLI+ + T +F EVR+R+ Y Sbjct: 5 RSKKNTGRDAEKRAAKWLTGQGLSIVERNFHCRQGEIDLILLDQETLVFTEVRWRKHQSY 64 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-----VEWIKDAF 127 GGA ASV + KQ +L+ A+ +LARH CRFDV+ + +WI++AF Sbjct: 65 GGALASVDQHKQRRLINAAQHFLARHPEH-HHRPCRFDVLGMEPDSQQAVLYQWIQNAF 122 >UniRef50_Q1Q244 Putative uncharacterized protein n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q244_9BACT Length = 149 Score = 137 bits (347), Expect = 8e-32, Method: Composition-based stats. Identities = 37/127 (29%), Positives = 63/127 (49%), Gaps = 8/127 (6%) Query: 7 RSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVR 66 + S Q K G E A ++L+ KG + + N + GEID+I + + +FVEV+ Sbjct: 20 KKTSDVQPHKKALGKKGEVVAAKFLKKKGYKILQRNYRRKTGEIDIICYDRGSIVFVEVK 79 Query: 67 YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF------TGNEV 120 R S YG +VT +K+ ++++ A ++A + +D RFDVV+ + Sbjct: 80 TRGSDSYGPPELAVTEAKKKQIIKMASRYIAEKKV--EGIDLRFDVVSVFYPPAKKHPAI 137 Query: 121 EWIKDAF 127 K+AF Sbjct: 138 TLYKNAF 144 >UniRef50_Q67PD3 UPF0102 protein STH1475 n=1 Tax=Symbiobacterium thermophilum RepID=Y1475_SYMTH Length = 118 Score = 137 bits (347), Expect = 8e-32, Method: Composition-based stats. Identities = 46/119 (38%), Positives = 64/119 (53%), Gaps = 7/119 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 +++ G+A E A +L G R IA NV R GEIDLI ++G +FVEV+ RR YG Sbjct: 2 SRRVGEAGEQAAAEFLTASGYRIIARNVRFRSGEIDLIAQDGGVLVFVEVKTRRGRRYGT 61 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-----EVEWIKDAFND 129 +VT +KQ +L + A L+LAR + CRFDVV I++AF+ Sbjct: 62 PGEAVTAAKQRRLARLASLYLARLGS--EPPPCRFDVVEVEPGPDGRLRCRLIQNAFHA 118 >UniRef50_C4GB01 Putative uncharacterized protein n=1 Tax=Shuttleworthia satelles DSM 14600 RepID=C4GB01_9FIRM Length = 116 Score = 137 bits (347), Expect = 8e-32, Method: Composition-based stats. Identities = 41/115 (35%), Positives = 61/115 (53%), Gaps = 1/115 (0%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + ++ G +E +A +L G+GL + N + R GEIDL+ REG +FVEV+YRRS Sbjct: 3 SVNRRKEGSFYERRAGDYLTGQGLTLVEFNFSCRLGEIDLVAREGTCLVFVEVKYRRSRR 62 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 +G +V SK + + A + H T RFD+VA G E+ + AF Sbjct: 63 FGLPEEAVGPSKMRTIRKVAGYYCLTHGI-CQTTPVRFDLVAIEGEEIRHYRGAF 116 >UniRef50_A4BQJ8 Putative uncharacterized protein n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BQJ8_9GAMM Length = 119 Score = 137 bits (347), Expect = 9e-32, Method: Composition-based stats. Identities = 49/118 (41%), Positives = 67/118 (56%), Gaps = 4/118 (3%) Query: 12 RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSA 71 R + G EA+A +L+ +GLR + N + R GEIDLIM + +FVEVR R + Sbjct: 3 RGPNPRTLGKQAEARALEFLQRRGLRCLQRNFHTRLGEIDLIMEDTGEVVFVEVRQRATK 62 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-NEVEWIKDAFN 128 +GGA SVT K+ +L+ AR +L H CRFDV+A G +EWI+DAF Sbjct: 63 RFGGALESVTPVKRQRLIAAARYYLLTHAP---NAACRFDVIAIDGQGSIEWIRDAFQ 117 >UniRef50_B0G5Y9 Putative uncharacterized protein n=4 Tax=Clostridiales RepID=B0G5Y9_9FIRM Length = 122 Score = 137 bits (346), Expect = 1e-31, Method: Composition-based stats. Identities = 45/119 (37%), Positives = 64/119 (53%), Gaps = 6/119 (5%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + ++TG +E +A +LE G + + N R GEIDLI R+G +FVEV+YR + + Sbjct: 3 KKNNRRTGTGYERKAGAYLESLGYKIVTYNYRCRLGEIDLIARDGEYLVFVEVKYRTTGV 62 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG----NEVEWIKDAF 127 G A +V KQ + + A +L + D V CRFDVVA G E+ KDAF Sbjct: 63 SGYPAEAVDARKQQTIAKCAMHFLMKQGN--DDVPCRFDVVAIAGAEGQEEITLYKDAF 119 >UniRef50_Q2S9Y0 UPF0102 protein HCH_05895 n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Y5895_HAHCH Length = 124 Score = 137 bits (346), Expect = 1e-31, Method: Composition-based stats. Identities = 42/123 (34%), Positives = 67/123 (54%), Gaps = 5/123 (4%) Query: 9 GSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYR 68 R + + G A E+QA ++ +G + N +GGEIDLI R G +F+EVR+R Sbjct: 2 PFKRLIKSIDIGRAAESQAEKFARAQGFTIVERNFRCKGGEIDLIARHGEHLVFIEVRHR 61 Query: 69 RSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVV---AFTGNEVEWIKD 125 S +G AA S+T+ KQ +++ A ++L + + CRFDV+ + +WI D Sbjct: 62 SSDKFGSAAESITQKKQQRIILAANIYLQKKG--LTNMPCRFDVIVGNLKSNTGFQWIPD 119 Query: 126 AFN 128 AF+ Sbjct: 120 AFS 122 >UniRef50_A1AN88 UPF0102 protein Ppro_1186 n=11 Tax=Deltaproteobacteria RepID=Y1186_PELPD Length = 140 Score = 137 bits (346), Expect = 1e-31, Method: Composition-based stats. Identities = 39/128 (30%), Positives = 60/128 (46%), Gaps = 9/128 (7%) Query: 8 SGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRT--TIFVEV 65 S + + TG E A +L +G R + N +GGE+D++ R +FVEV Sbjct: 12 PSSTARPDNRNTGSRGEEIATSFLGQQGYRILERNFRCKGGELDIVARAPGERSLVFVEV 71 Query: 66 RYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-----V 120 + RR YG +VT KQ ++ + A WL+R+ RFDV+A + + Sbjct: 72 KTRRDRSYGPPQLAVTPFKQRQISKAALTWLSRN--HLHDSQARFDVIAILLEDGGRHSI 129 Query: 121 EWIKDAFN 128 E I +AF Sbjct: 130 EHIVNAFE 137 >UniRef50_C0GQX2 Putative uncharacterized protein n=1 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GQX2_9DELT Length = 132 Score = 137 bits (346), Expect = 1e-31, Method: Composition-based stats. Identities = 40/119 (33%), Positives = 55/119 (46%), Gaps = 5/119 (4%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 G E AR +L G R N RGGE+DL+ G +FVEV+ R Sbjct: 2 SAHNLNLGRYGEEVARDYLTENGYRIKERNWRARGGELDLVCTCGDCIVFVEVKTRAEEG 61 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE---VEWIKDAFN 128 G S+ +Q KLL+TA L+L+RHN + RFD + T +E I++A Sbjct: 62 MGHPLESLGFKQQKKLLRTAGLYLSRHNM--WSSQSRFDFICVTVGREVQIEHIQNAIE 118 >UniRef50_C6A8H5 Putative uncharacterized protein n=4 Tax=Bifidobacterium animalis subsp. lactis RepID=C6A8H5_BIFLB Length = 158 Score = 137 bits (346), Expect = 1e-31, Method: Composition-based stats. Identities = 41/119 (34%), Positives = 57/119 (47%), Gaps = 5/119 (4%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIM-REGRTTIFVEVRYRRSAL 72 LT KQ G E R WL +A N + R GEID+I T +FVEV+ RRS Sbjct: 40 LTAKQIGSLGERLCRAWLIEHHWHVLACNWHCRFGEIDIIALTSHSTIVFVEVKTRRSTS 99 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEWIKDAF 127 G +V +KQ ++ + A WL H + + RFDV+A T + + +AF Sbjct: 100 CGIPEEAVHAAKQMRVRRAAICWLGEHGSTIRHIGVRFDVIAVTVTPTDVFIHHVPEAF 158 >UniRef50_B5YFD1 UPF0102 protein DICTH_1420 n=2 Tax=Dictyoglomus RepID=Y1420_DICT6 Length = 118 Score = 137 bits (345), Expect = 1e-31, Method: Composition-based stats. Identities = 32/120 (26%), Positives = 61/120 (50%), Gaps = 8/120 (6%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + K+ G E +L +G + N GE+D+I ++G IF+EV+ RR+ + Sbjct: 1 MNNKEIGKLGEDFTIDFLNKRGFIILERNYKVPLGEVDIIAQKGDLLIFIEVKTRRNLDF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE------VEWIKDAF 127 G A +V R+KQ ++ + A L+++ F RFD+++ ++ E++ +AF Sbjct: 61 GIPAEAVDRTKQTRIKKIAELYISTKKPKFKK--IRFDIMSIILSKSGKILDWEYLINAF 118 >UniRef50_C7RDQ1 Putative uncharacterized protein n=1 Tax=Anaerococcus prevotii DSM 20548 RepID=C7RDQ1_ANAPD Length = 115 Score = 137 bits (345), Expect = 1e-31, Method: Composition-based stats. Identities = 36/114 (31%), Positives = 62/114 (54%), Gaps = 4/114 (3%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 K+ GD E +L+ K +A N + GEID++ + +FVEV+ R++A + Sbjct: 4 KKEFGDYGENLVEGYLKDKSYEILARNYRKPFGEIDIVAKLSDMIVFVEVKTRKNANFAS 63 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF--TGNEVEWIKDAF 127 A +VT SKQ K++Q ++ +L +N + + RFDV E+ +I++AF Sbjct: 64 PAEAVTPSKQRKVIQASQAFLIENNMT--DMLMRFDVAEVIADKGEINYIENAF 115 >UniRef50_B3EJJ5 UPF0102 protein Cphamn1_0017 n=1 Tax=Chlorobium phaeobacteroides BS1 RepID=Y017_CHLPB Length = 126 Score = 136 bits (344), Expect = 2e-31, Method: Composition-based stats. Identities = 39/122 (31%), Positives = 58/122 (47%), Gaps = 12/122 (9%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 G E A +L KG R + N R EID+I + RT F+EV+ R SA G Sbjct: 5 PHDLGRQGEHTAVTFLIEKGYRILQRNYRHRRNEIDIIALDRRTLCFIEVKTRSSASKGH 64 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----------EVEWIKD 125 +VT KQ ++++ A +L+ + DCRFDV+A + ++E I + Sbjct: 65 PLEAVTPEKQKEIIRAATAYLSAYPSP--EPDCRFDVIAIIAHDFTNGRIREFKLEHITN 122 Query: 126 AF 127 AF Sbjct: 123 AF 124 >UniRef50_A5N821 UPF0102 protein CKL_1410 n=16 Tax=Clostridium RepID=Y1410_CLOK5 Length = 122 Score = 136 bits (344), Expect = 2e-31, Method: Composition-based stats. Identities = 35/119 (29%), Positives = 54/119 (45%), Gaps = 8/119 (6%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 K G E A+ +L G + N + GEID+I ++G F+EV+ R LYG Sbjct: 5 NKDIGSLGEDIAKNYLNQIGYTVLERNFRCKVGEIDIIGKDGDYICFIEVKSRYGKLYGN 64 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF------TGNEVEWIKDAFN 128 SV K+ K+ + A +++ R + RFDV+ ++ IKDAF Sbjct: 65 PCESVNYPKRLKIYKAANIYMLRKK--LFKFNFRFDVIEIIFNTYNDVPSIKLIKDAFQ 121 >UniRef50_Q7MNW2 UPF0102 protein VV0603 n=80 Tax=Vibrionales RepID=Y603_VIBVY Length = 122 Score = 136 bits (344), Expect = 2e-31, Method: Composition-based stats. Identities = 49/114 (42%), Positives = 76/114 (66%), Gaps = 2/114 (1%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 + + G+ +E+ A+ +L+ +GLRFI AN + GEIDLI +E +T +FVEV+YR+++ YG Sbjct: 5 SRRAIGNQYESLAKEYLQRQGLRFIEANFTTKVGEIDLIFKEAQTIVFVEVKYRKNSCYG 64 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF--TGNEVEWIKDA 126 AA V +K +KL++TA LWL +H + RFDVVA G+++ WI +A Sbjct: 65 DAAEMVNPAKANKLIKTAYLWLNKHGYNACNTAMRFDVVAIHSNGHDINWIANA 118 >UniRef50_A1ZF36 Putative uncharacterized protein n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZF36_9SPHI Length = 119 Score = 136 bits (344), Expect = 2e-31, Method: Composition-based stats. Identities = 34/116 (29%), Positives = 60/116 (51%), Gaps = 8/116 (6%) Query: 17 KQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGA 76 ++ G E A +++ KG + N + GEID+I + G +FVEV+ R S +G Sbjct: 6 QKKGKYGENLAAAFMQNKGYTLLERNYRYKRGEIDIIAQTGDVLVFVEVKLRSSDNFGLP 65 Query: 77 AASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-----GNEVEWIKDAF 127 SV+ ++Q+ ++QTA ++ + D RFD+VA ++ + +DAF Sbjct: 66 EESVSENQQNLIIQTAEQYIEEIDWE---SDIRFDIVAIELKSHQSPQITYFEDAF 118 >UniRef50_D1CBL5 Putative uncharacterized protein n=1 Tax=Thermobaculum terrenum ATCC BAA-798 RepID=D1CBL5_THET1 Length = 125 Score = 136 bits (343), Expect = 2e-31, Method: Composition-based stats. Identities = 35/120 (29%), Positives = 51/120 (42%), Gaps = 6/120 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 +K G E A +L KG + IA N R GEID+I ++ +FVEV+ R S G Sbjct: 2 SKSLGRIGEDYACNFLLSKGYKLIARNWRCRQGEIDIIFQDKDEIVFVEVKTRSSLSLGT 61 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE------VEWIKDAFND 129 S+ K +LL A++W+ RFD V T + I++ Sbjct: 62 PEESIDMHKARQLLTLAKIWIFECYDGEKDPPVRFDAVTVTISRSGRVIDSNHIQNCIMP 121 >UniRef50_C9KJR5 Endonuclease n=2 Tax=Veillonellaceae RepID=C9KJR5_9FIRM Length = 132 Score = 136 bits (343), Expect = 2e-31, Method: Composition-based stats. Identities = 40/124 (32%), Positives = 61/124 (49%), Gaps = 6/124 (4%) Query: 9 GSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYR 68 + R + T G E A +LE G +A N GEID++ +GR FVEV+ R Sbjct: 10 NAERLMDTTTIGRQGEEAAAVFLERAGYEILARNFRTPRGEIDIVASKGRMLAFVEVKTR 69 Query: 69 RSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF----TGNEVEWIK 124 R+ +G AA+V KQ K++Q+A +L + + CRFDV+ ++E + Sbjct: 70 RTQRFGRPAAAVDYRKQQKIIQSAHWFLRQR--HLEGCLCRFDVIEIYRAGERWQIEHLP 127 Query: 125 DAFN 128 AF Sbjct: 128 GAFE 131 >UniRef50_A8SLV5 Putative uncharacterized protein n=1 Tax=Parvimonas micra ATCC 33270 RepID=A8SLV5_9FIRM Length = 114 Score = 136 bits (343), Expect = 3e-31, Method: Composition-based stats. Identities = 31/116 (26%), Positives = 56/116 (48%), Gaps = 4/116 (3%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + K G+ E A ++L KG + I N + GEID+I ++ +F+EV+ R++ + Sbjct: 1 MKAKDIGNLGEDMAVKFLLEKGYQIIERNFLKPFGEIDIIAKDKDFLVFIEVKARKNVNF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF--TGNEVEWIKDAF 127 G V K K+ A++++ N RFDV+ ++ I++AF Sbjct: 61 GFPREFVNGIKIKKIQDVAQIYMMEKNLFGAK--IRFDVIEIIFDNYKITHIENAF 114 >UniRef50_B4RXI2 Sigma-54 factor n=2 Tax=Alteromonas macleodii RepID=B4RXI2_ALTMD Length = 113 Score = 135 bits (342), Expect = 3e-31, Method: Composition-based stats. Identities = 34/109 (31%), Positives = 60/109 (55%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 +K G+A E +A +L +GL N + R GE+D++M++G T + +EV+YR+ +G Sbjct: 2 SKLQGNAAEDKACEYLLQQGLTLRCRNYHTRRGELDIVMQDGNTIVCIEVKYRKQNRFGS 61 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIK 124 A VT K ++ +L + + + R DV+A G+ +EW+K Sbjct: 62 AVEFVTAKKLQRIQAAFGFYLLDNGLNPASTPLRIDVIAIDGDNLEWLK 110 >UniRef50_A3WNE6 Predicted endonuclease n=1 Tax=Idiomarina baltica OS145 RepID=A3WNE6_9GAMM Length = 116 Score = 135 bits (342), Expect = 3e-31, Method: Composition-based stats. Identities = 35/116 (30%), Positives = 63/116 (54%), Gaps = 2/116 (1%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 ++T++ G E+QA R+L +GL + N GEID+I R+ T +FVEV+ R+++ + Sbjct: 1 MSTRKRGLEGESQASRYLRQQGLVIVQHNFRVPCGEIDIICRDSDTWVFVEVKRRQNSDF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE--VEWIKDAF 127 +T + ++ + A+ +L + RFD++ ++ VEW KDAF Sbjct: 61 ASILEQITTRQCQRIRRAAQYFLVEQTVNEYLAKMRFDIITINDSQVTVEWYKDAF 116 >UniRef50_B7RSM7 Putative uncharacterized protein n=1 Tax=marine gamma proteobacterium HTCC2148 RepID=B7RSM7_9GAMM Length = 124 Score = 135 bits (342), Expect = 3e-31, Method: Composition-based stats. Identities = 47/121 (38%), Positives = 69/121 (57%), Gaps = 7/121 (5%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 ++ KQ GD +E +A +L +G+ + N R GEIDLI R+ +F+EVR RR+ Sbjct: 3 GISMKQIGDEYERRAAHFLSQQGVEVLICNYRCRCGEIDLIARQNDYLVFIEVRARRNPR 62 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG------NEVEWIKDA 126 + AAASV KQ +LL+TA+ +L RH + CRFDV+ F + +WI+ A Sbjct: 63 FATAAASVDYRKQQRLLRTAQFFLQRH-TKLANLPCRFDVITFEPRQSTANDSPQWIRGA 121 Query: 127 F 127 F Sbjct: 122 F 122 >UniRef50_A6VB97 UPF0102 protein PSPA7_4996 n=17 Tax=Pseudomonadaceae RepID=Y4996_PSEA7 Length = 125 Score = 135 bits (341), Expect = 4e-31, Method: Composition-based stats. Identities = 41/123 (33%), Positives = 64/123 (52%), Gaps = 7/123 (5%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + ++ G E A L +GL + N R GE+DL+M +G T +FVEVR RR Sbjct: 4 RANSRDKGRQAEEMACAHLLRQGLATLGKNWTCRRGELDLVMLDGDTVVFVEVRSRRHRA 63 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT------GNEVEWIKDA 126 +GGA S+ K+ +L+ +A L+L + + CRFDVV ++WI++A Sbjct: 64 WGGALESIDARKRQRLILSAELFLQQ-EARWAKRPCRFDVVTVDTSDGQSPPRLDWIQNA 122 Query: 127 FND 129 F+ Sbjct: 123 FDA 125 >UniRef50_C6WYK5 Putative uncharacterized protein n=1 Tax=Methylotenera mobilis JLW8 RepID=C6WYK5_METML Length = 119 Score = 135 bits (341), Expect = 4e-31, Method: Composition-based stats. Identities = 49/111 (44%), Positives = 61/111 (54%), Gaps = 7/111 (6%) Query: 20 GDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAAS 79 G E A +L+ GL I N GEIDLIMR+G+T +FVEVR R + +GGA S Sbjct: 13 GQLAEQIAATFLQNNGLTVIEKNFRSAYGEIDLIMRDGKTLVFVEVRLRSNTKFGGAGMS 72 Query: 80 VTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVV---AFTGNEVEWIKDAF 127 + SKQ KL +TA +L + CRFD + A VEWIKDAF Sbjct: 73 INASKQQKLTRTAERYLQING----DSACRFDAILMHALDITTVEWIKDAF 119 >UniRef50_Q7P0B3 UPF0102 protein CV_0654 n=1 Tax=Chromobacterium violaceum RepID=Y654_CHRVO Length = 112 Score = 135 bits (340), Expect = 5e-31, Method: Composition-based stats. Identities = 51/111 (45%), Positives = 71/111 (63%), Gaps = 4/111 (3%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 Q G E +A LE +GL+ +A N + RGGEIDLIMR+G +FVEVR+R + +GG Sbjct: 1 MNQAGRDAEDRALALLEKRGLKLVARNWHCRGGEIDLIMRDGDALVFVEVRHRGGSRFGG 60 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFD-VVAFTGNEVEWIKD 125 AA S+T +KQ KLL A ++L+ HN CRFD VV+ G+ +W+K+ Sbjct: 61 AADSITAAKQRKLLLAAEVYLSSHNI---DSPCRFDAVVSVGGDAPQWLKN 108 >UniRef50_B2KC57 Putative uncharacterized protein n=1 Tax=Elusimicrobium minutum Pei191 RepID=B2KC57_ELUMP Length = 122 Score = 135 bits (340), Expect = 5e-31, Method: Composition-based stats. Identities = 39/117 (33%), Positives = 64/117 (54%), Gaps = 7/117 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGR-----TTIFVEVRYRRS 70 + G E A +L+ G + IA N + GE+D+I +G T +F+EV+ R Sbjct: 1 MNKLGVESENAAANFLKKNGYKIIARNYAVQTGEVDIIASQGGLLKQKTLVFIEVKGRAY 60 Query: 71 ALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 YGG A+VT++KQ+K++ A +++ + FD RFDVV ++E I++AF Sbjct: 61 KAYGGPLAAVTKAKQNKIISAATIYVKENFPKFD--SIRFDVVTVVDGKIEHIENAF 115 >UniRef50_Q1GYY7 UPF0102 protein Mfla_2283 n=1 Tax=Methylobacillus flagellatus KT RepID=Y2283_METFK Length = 113 Score = 134 bits (339), Expect = 6e-31, Method: Composition-based stats. Identities = 54/117 (46%), Positives = 72/117 (61%), Gaps = 7/117 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 KQ GD EA A R+L +GL IA N R GEIDL+M++G T +FVEVR R A +GG Sbjct: 1 MKQLGDDAEALAERYLIKQGLVVIARNYRCRFGEIDLVMKQGATIVFVEVRMRSHATFGG 60 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF---TGNEVEWIKDAFND 129 AAAS+ +K+ KL+ TA +L RH + CRFD + + +EWI+DAF+ Sbjct: 61 AAASIHAAKRQKLILTAEHFLQRHG----SAPCRFDAILLSKRDADGIEWIQDAFSA 113 >UniRef50_A8G183 UPF0102 protein Ssed_4252 n=16 Tax=Shewanella RepID=Y4252_SHESH Length = 117 Score = 134 bits (339), Expect = 7e-31, Method: Composition-based stats. Identities = 40/110 (36%), Positives = 65/110 (59%), Gaps = 3/110 (2%) Query: 18 QTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAA 77 + G A E A +L +GL FI NV + GEIDL+M+ G+ IFVEV+YR + YGGA Sbjct: 11 EHGQAGENLAMNYLLEQGLTFIERNVRFKFGEIDLVMKNGKEWIFVEVKYRSKSQYGGAI 70 Query: 78 ASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 +++ + +L + A ++ +N CRFD++A +++W+ +AF Sbjct: 71 NALSSGQIKRLRRAAEHYMQLNNI---DAICRFDLIAVDAGQIQWLPNAF 117 >UniRef50_Q0TPP8 UPF0102 protein CPF_1959 n=9 Tax=Clostridium perfringens RepID=Y1959_CLOP1 Length = 122 Score = 134 bits (338), Expect = 9e-31, Method: Composition-based stats. Identities = 36/119 (30%), Positives = 57/119 (47%), Gaps = 8/119 (6%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 K G E + ++L+ +G + N N GEID+I + F+EV+ R S +G Sbjct: 5 NKSIGFYGEDLSAKFLKKEGYSILEKNFNCSSGEIDIIAIKDEIISFIEVKSRFSNSFGN 64 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN------EVEWIKDAFN 128 SVT SKQ +++ A+ +L H RFDV+ + E+ ++KDAF Sbjct: 65 PKESVTCSKQGRIINAAKYYL--HVKKLYNYYIRFDVIEVNFHIDSSKYELNFLKDAFR 121 >UniRef50_Q31EY6 UPF0102 protein Tcr_1695 n=1 Tax=Thiomicrospira crunogena XCL-2 RepID=Y1695_THICR Length = 120 Score = 134 bits (338), Expect = 9e-31, Method: Composition-based stats. Identities = 46/115 (40%), Positives = 72/115 (62%), Gaps = 4/115 (3%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE-GRTTIFVEVRYRRSALYG 74 +++ G E QA WL+ + + +A N +GGEIDLI + T IF EV+YR+S+ +G Sbjct: 5 SQKIGQQKEQQAAVWLKTQAITIVAQNFRCKGGEIDLIGLDTDDTLIFFEVKYRQSSTFG 64 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN--EVEWIKDAF 127 A+ SVT KQ +L+Q A+ +L +H ++ + RFDV+ F N + EW++DAF Sbjct: 65 TASESVTPQKQQRLIQCAQNFLQKHP-NYQACNMRFDVLFFEDNQTQPEWLQDAF 118 >UniRef50_D0MIM6 Putative uncharacterized protein n=1 Tax=Rhodothermus marinus DSM 4252 RepID=D0MIM6_RHOM4 Length = 127 Score = 134 bits (337), Expect = 1e-30, Method: Composition-based stats. Identities = 41/126 (32%), Positives = 58/126 (46%), Gaps = 14/126 (11%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIM-------REGRTTIFVEVR 66 + T+ G E A +LE +G R +A EIDL+ +G +FVEV+ Sbjct: 1 MDTRTIGTRGEDLAAAYLEQQGYRILARQYRFERAEIDLVCFEPAPRPEDGGEIVFVEVK 60 Query: 67 YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-----GNEVE 121 RR +G +VT KQ L++ AR +L H CRFDV+A E+E Sbjct: 61 TRRGLGFGRPEEAVTPEKQRHLIRAARAYLYEH--HLQRARCRFDVIAIVLHDDRPPEIE 118 Query: 122 WIKDAF 127 +DAF Sbjct: 119 HFRDAF 124 >UniRef50_B8DRI1 Putative uncharacterized protein n=2 Tax=Desulfovibrio RepID=B8DRI1_DESVM Length = 146 Score = 134 bits (337), Expect = 1e-30, Method: Composition-based stats. Identities = 39/135 (28%), Positives = 64/135 (47%), Gaps = 9/135 (6%) Query: 4 VPTRSGSPRQLTTK---QTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTT 60 P R+ P + G E A R L +GLR +A N G E+D++ + T Sbjct: 2 TPPRAAPPTTASATGNAAIGARGEEAAARLLAQRGLRVLARNWRHGGLELDIVCDDRGTL 61 Query: 61 IFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-- 118 +FVEV+ R ++ ++T +K+ KL++ AR +LA H+ CRFD+V + Sbjct: 62 VFVEVKTRAASGPARPDEALTTAKRGKLVRAARQYLAAHD--CWDKPCRFDLVCVVHDGA 119 Query: 119 --EVEWIKDAFNDHS 131 +E AF+ + Sbjct: 120 TLTLEHYPHAFDLTA 134 >UniRef50_A4J649 UPF0102 protein Dred_2035 n=1 Tax=Desulfotomaculum reducens MI-1 RepID=Y2035_DESRM Length = 122 Score = 134 bits (337), Expect = 1e-30, Method: Composition-based stats. Identities = 35/121 (28%), Positives = 58/121 (47%), Gaps = 8/121 (6%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREG-RTTIFVEVRYRRSA 71 + K G+ E +A ++++ G + N + GE+D+I + +F+EVR R Sbjct: 2 SIQRKALGNKGEEEACKYIQNLGYNIMERNYRCKIGELDIIAWDPVGMLVFLEVRSRSGR 61 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN------EVEWIKD 125 +G SV KQ+KL A+ +L F + CRFDV+ N E++ IK+ Sbjct: 62 AFGVPEESVNYRKQNKLRMLAQQFLLTK-SEFAKISCRFDVIGVYFNKEGSVQEIKHIKN 120 Query: 126 A 126 A Sbjct: 121 A 121 >UniRef50_D0KYK3 Putative uncharacterized protein n=1 Tax=Halothiobacillus neapolitanus c2 RepID=D0KYK3_HALNC Length = 165 Score = 133 bits (336), Expect = 2e-30, Method: Composition-based stats. Identities = 50/141 (35%), Positives = 69/141 (48%), Gaps = 13/141 (9%) Query: 1 MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTT 60 M + + TT G E A +L +GL+ I NV GEIDLIM++G T Sbjct: 25 MRGTDAELPNAKAQTTLARGHRAETMAAEYLSRQGLKLIDRNVRAGRGEIDLIMQDGATL 84 Query: 61 IFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-- 118 +FVEVR R++ + AA SV+ +K+ K+++TA L + CRFDVVA Sbjct: 85 VFVEVRARKAGAWVSAAESVSPAKRKKIIETAERLLNEKPV-WRKSPCRFDVVAIGLPSE 143 Query: 119 ----------EVEWIKDAFND 129 EV WI+DAF Sbjct: 144 SSSEPAAKQAEVNWIQDAFQA 164 >UniRef50_A8PP71 Putative uncharacterized protein n=1 Tax=Rickettsiella grylli RepID=A8PP71_9COXI Length = 130 Score = 133 bits (336), Expect = 2e-30, Method: Composition-based stats. Identities = 42/127 (33%), Positives = 68/127 (53%), Gaps = 14/127 (11%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + T++ G E+ + +L + L+ I N GEIDLIM++ +F+EVRYR+S + Sbjct: 1 MNTQKLGHHIESLVQDYLRRQKLKRITRNFRCCFGEIDLIMKDKNVLVFIEVRYRQSLQF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-------------NEV 120 G + S+ KQ+K+++ A +L+ S + + CRFDVV +V Sbjct: 61 GNSLESIHAMKQNKIMKAAEYYLSSQRLS-EKIACRFDVVGVKPITQKLLAVSKLDSAQV 119 Query: 121 EWIKDAF 127 EWIK+AF Sbjct: 120 EWIKNAF 126 >UniRef50_C0YUE8 Possible endonuclease n=2 Tax=Flavobacteriaceae RepID=C0YUE8_9FLAO Length = 125 Score = 133 bits (335), Expect = 2e-30, Method: Composition-based stats. Identities = 29/121 (23%), Positives = 53/121 (43%), Gaps = 8/121 (6%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 G E A +L+ G + + N + EID+I + I VEV+ R + + Sbjct: 4 ANHNDFGKMAEDLAVEYLKKCGYKILVRNFRFQKAEIDVIAEKDNQIIVVEVKARSTDAF 63 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-----VEWIKDAFN 128 +VT++K ++ A ++ N + RFD+++ +E +E I+DAF Sbjct: 64 MLPQEAVTKTKIKSIVSAANHYMEEFNK---DNEVRFDIISVLPDENKNLIIEHIEDAFE 120 Query: 129 D 129 Sbjct: 121 A 121 >UniRef50_C8W5C0 Putative uncharacterized protein n=1 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8W5C0_DESAS Length = 119 Score = 132 bits (334), Expect = 3e-30, Method: Composition-based stats. Identities = 45/121 (37%), Positives = 62/121 (51%), Gaps = 9/121 (7%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + + G E+ A R+L KG I N R GEID+I RE T+FVEVR R + Sbjct: 2 TVKKQLLGRLGESVAARYLYSKGFIIIHQNFRCRLGEIDIIAREKGVTVFVEVRSRCGSS 61 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE------VEWIKDA 126 YG SV KQ KL + A+ ++AR+ + D RFDVVA + +E ++A Sbjct: 62 YGLPQESVVIKKQVKLRKLAQYYIARYALTGD---FRFDVVAVMFEQDNSIKLIEHFRNA 118 Query: 127 F 127 F Sbjct: 119 F 119 >UniRef50_UPI0000510419 hypothetical protein BlinB_18076 n=1 Tax=Brevibacterium linens BL2 RepID=UPI0000510419 Length = 132 Score = 132 bits (333), Expect = 3e-30, Method: Composition-based stats. Identities = 34/117 (29%), Positives = 55/117 (47%), Gaps = 2/117 (1%) Query: 2 ATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTI 61 T RS + L + G E A +L+ +G+ I N GEID+I ++G T + Sbjct: 3 PTTGRRSATTSGLRQRALGQTGEDLAADFLQRQGMVIIERNFRCPRGEIDIIAKDGDTIV 62 Query: 62 FVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN 118 FVEV+ RR+ G +VT +K K+ + +WL++ F R D + + Sbjct: 63 FVEVKTRRTLAQGSPLEAVTAAKLRKIRTLSGIWLSQQKDFFA--SIRIDALGIVMD 117 >UniRef50_C6BVQ7 Putative uncharacterized protein n=1 Tax=Desulfovibrio salexigens DSM 2638 RepID=C6BVQ7_DESAD Length = 134 Score = 132 bits (333), Expect = 3e-30, Method: Composition-based stats. Identities = 35/121 (28%), Positives = 57/121 (47%), Gaps = 8/121 (6%) Query: 14 LTTKQT--GDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSA 71 ++ + G A E A +LE +G N + E+D+I + IFVEV+ R Sbjct: 1 MSPRHLDFGQAGEDYAACFLENRGYFLRQRNWRWKQWELDIICEKDDELIFVEVKTRAGR 60 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF----TGNEVEWIKDAF 127 +VT +K+ KL++ A +L+ + CRFD+V TG E I++AF Sbjct: 61 SAQSGIEAVTPAKRKKLVKAATRYLSAFD--LWERPCRFDLVIVNDDGTGFRAEHIENAF 118 Query: 128 N 128 + Sbjct: 119 D 119 >UniRef50_Q2BGY7 Putative uncharacterized protein n=1 Tax=Neptuniibacter caesariensis RepID=Q2BGY7_9GAMM Length = 119 Score = 132 bits (333), Expect = 3e-30, Method: Composition-based stats. Identities = 50/119 (42%), Positives = 72/119 (60%), Gaps = 5/119 (4%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + ++ G E A +L + LR I N N R GEIDLIM++G + +F+EVR R A + Sbjct: 1 MDRRKRGKDAEQHALVYLSKQKLRLIEQNFNCRFGEIDLIMQDGESIVFIEVRLRTHAEF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT----GNEVEWIKDAFN 128 GGAAASVT +KQ K+++TA+L+L++ +CRFDV+A+ W KDAF Sbjct: 61 GGAAASVTTTKQRKIIKTAQLYLSKRP-RLQNKNCRFDVIAYEYDAAPTHPLWYKDAFR 118 >UniRef50_B2A2P1 UPF0102 protein Nther_1376 n=1 Tax=Natranaerobius thermophilus JW/NM-WN-LF RepID=Y1376_NATTJ Length = 119 Score = 132 bits (333), Expect = 3e-30, Method: Composition-based stats. Identities = 44/120 (36%), Positives = 58/120 (48%), Gaps = 7/120 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNE-RGGEIDLIMREGRTTIFVEVRYRRSAL 72 + K G E AR +L KG + I N R GEIDLI IFVEV+ R S L Sbjct: 1 MNNKSKGRTAEKIARIFLLSKGYQIIFQNYRFSRLGEIDLICCFDNILIFVEVKSRSSLL 60 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-----EVEWIKDAF 127 +G +V KQ +L + A ++L N F RFDV+A N E+ ++DAF Sbjct: 61 WGQPEEAVGYEKQGQLKKLANIFLYEFN-EFTEYQIRFDVIAILNNNKVKCEISHLRDAF 119 >UniRef50_D0I4Y5 Endonuclease n=1 Tax=Grimontia hollisae CIP 101886 RepID=D0I4Y5_VIBHO Length = 122 Score = 132 bits (333), Expect = 4e-30, Method: Composition-based stats. Identities = 57/114 (50%), Positives = 76/114 (66%), Gaps = 2/114 (1%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 L KQTGD +E QA R+LE +GL + N +GGE+DLIMRE +FVEV+YR+ A Y Sbjct: 4 LNRKQTGDHYENQACRFLERQGLTTLDKNARFKGGELDLIMREKSCIVFVEVKYRKQASY 63 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG--NEVEWIKD 125 GGAAA+++R KQ ++L+ A LW+A+ S + RFD V F G N V WIK+ Sbjct: 64 GGAAATISRQKQQRMLKAAYLWMAKKGLSATHTEFRFDAVTFEGSVNSVNWIKN 117 >UniRef50_Q1MRU7 UPF0102 protein LI0223 n=1 Tax=Lawsonia intracellularis PHE/MN1-00 RepID=Y223_LAWIP Length = 132 Score = 132 bits (332), Expect = 4e-30, Method: Composition-based stats. Identities = 33/119 (27%), Positives = 61/119 (51%), Gaps = 6/119 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + + + G E+ A +L KG+ + N + EIDL+ ++ +T +FVEVR R++ Sbjct: 1 MKSCEIGQQGESAAALFLYNKGMSILERNWRKGRFEIDLVCQDIKTLVFVEVRTRKAKGM 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----EVEWIKDAFN 128 ++T SK+ ++ +A+L+L ++ CRFDV+ E+E K+ F Sbjct: 61 LLPEQTLTISKRCNIIHSAQLYLMDKKD--WSMPCRFDVICIISKKTTLELEHYKNVFE 117 >UniRef50_A0LV62 UPF0102 protein Acel_1550 n=8 Tax=Actinomycetales RepID=Y1550_ACIC1 Length = 132 Score = 132 bits (332), Expect = 4e-30, Method: Composition-based stats. Identities = 40/126 (31%), Positives = 58/126 (46%), Gaps = 13/126 (10%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 + G E A + L+ G+ +A N R GE+D++ R+G T + EV+ RR +G Sbjct: 7 AREALGRFGEELAAQHLQTLGMTILARNWRCRSGELDIVARDGYTLVVCEVKTRRGVGFG 66 Query: 75 GAAASVTRSKQHKLLQTARLWLARH--------NGSFDTVDCRFDVVAF-----TGNEVE 121 SVT K +L Q A WL H G+ RFDVVA G +E Sbjct: 67 EPLESVTPRKAARLRQLAVAWLTEHAATRVDTTEGTHGYTAVRFDVVAILHRKEDGPTIE 126 Query: 122 WIKDAF 127 +++ AF Sbjct: 127 YVRGAF 132 >UniRef50_A5EVA6 UPF0102 protein DNO_0639 n=2 Tax=Cardiobacteriaceae RepID=Y639_DICNV Length = 126 Score = 132 bits (332), Expect = 4e-30, Method: Composition-based stats. Identities = 46/119 (38%), Positives = 67/119 (56%), Gaps = 6/119 (5%) Query: 11 PRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRS 70 +++TTK+ G E A +L GL +A NV R GEIDLI ++ R +FVEVR RR+ Sbjct: 7 NKKMTTKKRGQYGELLAADYLTAHGLNIVAKNVYSRYGEIDLIAQDDRVLVFVEVRLRRA 66 Query: 71 ALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF----TGNEVEWIKD 125 AA S+T K + Q+A+ +L ++ DCRFD V T +E+EW+K+ Sbjct: 67 QALVSAAESITPEKLRRCYQSAQDYLQKNYAV--PPDCRFDAVLITQYQTHHEIEWLKN 123 >UniRef50_A4FME3 UPF0102 protein SACE_6045 n=2 Tax=Actinomycetales RepID=Y6045_SACEN Length = 133 Score = 132 bits (332), Expect = 4e-30, Method: Composition-based stats. Identities = 37/118 (31%), Positives = 53/118 (44%), Gaps = 7/118 (5%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 G E A R+LE G+ + N GE+D++ +G T IF EV+ R YG Sbjct: 18 RRHALGVEGERLAARFLEEHGITVLERNWRCDRGELDIVATDGETVIFCEVKARSGVDYG 77 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-----EVEWIKDAF 127 +V+ K L AR WL+ N + T RFDVV+ +E ++ AF Sbjct: 78 APLNAVSPHKVRHLRALARTWLSERNLTGCTA--RFDVVSVLWPPGRPARIEHLEGAF 133 >UniRef50_C2LNN6 Possible endonuclease n=3 Tax=Proteus RepID=C2LNN6_PROMI Length = 125 Score = 132 bits (332), Expect = 5e-30, Method: Composition-based stats. Identities = 52/115 (45%), Positives = 73/115 (63%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 +T G +E +A +L +GL+ I NV GEIDLIM+ RT IFVEVR+RRSA +G Sbjct: 6 STYLVGQYYERKALNYLRQQGLKLIERNVRYPCGEIDLIMQGNRTWIFVEVRFRRSAQFG 65 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFND 129 A +SVT SK+ +L A WLA+ S +TV+CRFD+ AF ++ W+K+ + Sbjct: 66 DAISSVTYSKRRRLWYAANCWLAQRQQSIETVNCRFDICAFDQRQLIWLKNILDH 120 >UniRef50_C9LW74 Endonuclease n=1 Tax=Selenomonas sputigena ATCC 35185 RepID=C9LW74_9FIRM Length = 121 Score = 132 bits (332), Expect = 5e-30, Method: Composition-based stats. Identities = 37/119 (31%), Positives = 60/119 (50%), Gaps = 6/119 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 +TTK GD E A ++LE +G R + + GEID+I + +F+EV+ RR + Sbjct: 1 MTTKSFGDRGEDLAAQYLEKRGCRILERQFRAKTGEIDIIAEDRGALLFIEVKTRRPTRF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT----GNEVEWIKDAFN 128 G A +V +KQ ++ +TA L++ + CRFDV+ V ++AF Sbjct: 61 GAPAQAVGYTKQRRIFRTALLYMQKRAIGERF--CRFDVLEVLVMGGSYTVNHYENAFE 117 >UniRef50_C3W9T3 Endonuclease n=4 Tax=Fusobacterium RepID=C3W9T3_FUSMR Length = 120 Score = 131 bits (331), Expect = 6e-30, Method: Composition-based stats. Identities = 32/111 (28%), Positives = 59/111 (53%), Gaps = 2/111 (1%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 ++ GD +E +A + L +G + + N + GEID+I + T +FVEV+YR++ YG Sbjct: 3 NNREIGDKYEEKAVKLLISRGYKILERNYRVKAGEIDIIAKFEDTIVFVEVKYRKTLKYG 62 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 +V K ++ A+++L + RFD ++F G ++ W K+ Sbjct: 63 YGLEAVDYRKIRRIYNAAKVYLTLNKKLSSK--IRFDCISFLGEKISWTKN 111 >UniRef50_Q3IG11 UPF0102 protein PSHAa2523 n=3 Tax=Alteromonadales RepID=Y2523_PSEHT Length = 123 Score = 131 bits (331), Expect = 6e-30, Method: Composition-based stats. Identities = 39/117 (33%), Positives = 66/117 (56%), Gaps = 5/117 (4%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 +++ G +E QA+++L +GL I N GE+D+IM++G T +FVEV++R++ Sbjct: 9 QNSREKGQYYELQAQKYLVSQGLTAIERNYYCPFGELDVIMKDGNTLVFVEVKFRKNHAR 68 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN---EVEWIKDAF 127 GGA +++ KQ +L ++ +LA N R D VA TG + W+K+ F Sbjct: 69 GGANYALSIQKQARLKRSIYHYLAAKN--LTNQPLRIDYVAITGEPSMHINWLKNVF 123 >UniRef50_A1SB01 UPF0102 protein Sama_3355 n=5 Tax=Shewanella RepID=Y3355_SHEAM Length = 108 Score = 131 bits (330), Expect = 7e-30, Method: Composition-based stats. Identities = 47/108 (43%), Positives = 66/108 (61%), Gaps = 3/108 (2%) Query: 20 GDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAAS 79 G E +A + L GLR A NV GEIDL+MREGR +FVEV++R +G A + Sbjct: 4 GQLAEDRAMKHLCAHGLRLEARNVRYPFGEIDLVMREGRVYVFVEVKFRTPKGFGDAVQA 63 Query: 80 VTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 ++ ++Q +L + A +L H CRFD+VA TG+++EWIKDAF Sbjct: 64 LSAAQQQRLRRAATHYLQCHRI---DAPCRFDMVAITGDKLEWIKDAF 108 >UniRef50_A3M3I7 UPF0102 protein A1S_1049 n=12 Tax=Acinetobacter RepID=Y1049_ACIBT Length = 133 Score = 131 bits (330), Expect = 7e-30, Method: Composition-based stats. Identities = 44/128 (34%), Positives = 68/128 (53%), Gaps = 17/128 (13%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 +Q G E A + L+ + ++A+N + R GE+DLI++ G IFVEV+ R YG Sbjct: 4 AQQLGQWAEQTALKLLKEQNYEWVASNYHSRRGEVDLIVKRGNELIFVEVKARGQGNYGQ 63 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE---------------- 119 A VT SKQ K+++TA +L R+ S+ CRFDV+ F + Sbjct: 64 ACEMVTLSKQKKIIKTAMRFLQRYP-SYQDFYCRFDVICFDFPQKIAKTVQQDFSKFHYD 122 Query: 120 VEWIKDAF 127 ++WI++AF Sbjct: 123 LQWIENAF 130 >UniRef50_C8PPR7 HD domain protein n=1 Tax=Treponema vincentii ATCC 35580 RepID=C8PPR7_9SPIO Length = 462 Score = 131 bits (330), Expect = 8e-30, Method: Composition-based stats. Identities = 39/126 (30%), Positives = 63/126 (50%), Gaps = 8/126 (6%) Query: 12 RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSA 71 +++T ++ G A EA A +WLE G IA N R GEID+I + T IF EV+ Sbjct: 337 KKMTEERLGPAGEAFAAKWLERNGYSVIARNWRTRTGEIDIIAEKNETLIFFEVKTLPHT 396 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-------EVEWIK 124 + V KQ ++ +TA+ +L H ++ + RFDV+ + E ++ Sbjct: 397 AFTDLDIIVGNRKQERICKTAKYFLLTHR-KYNKMHIRFDVLVLPFDPRTTEEAEPVHLE 455 Query: 125 DAFNDH 130 +AF D+ Sbjct: 456 NAFEDY 461 >UniRef50_A1KWG5 UPF0102 protein NMC2069 n=28 Tax=Neisseriaceae RepID=Y2069_NEIMF Length = 115 Score = 131 bits (330), Expect = 8e-30, Method: Composition-based stats. Identities = 40/111 (36%), Positives = 64/111 (57%), Gaps = 3/111 (2%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 + G+A E A +L+ +G +A N + GEIDLI++ G +FVEV+YR++ +GG Sbjct: 4 NHKQGEAGEDAALAFLQSQGCTLLARNWHCAYGEIDLIVKNGGMILFVEVKYRKNRQFGG 63 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-VEWIKD 125 AA S++ SK KL ++ +L ++ V CR D V G+ EWI++ Sbjct: 64 AAYSISPSKLLKLQRSVEYYLQQNR--LTNVPCRLDAVLIEGSRPPEWIQN 112 >UniRef50_B3JMU0 Putative uncharacterized protein n=6 Tax=Bacteroidales RepID=B3JMU0_9BACE Length = 129 Score = 130 bits (329), Expect = 1e-29, Method: Composition-based stats. Identities = 29/126 (23%), Positives = 54/126 (42%), Gaps = 7/126 (5%) Query: 9 GSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYR 68 + + G E A +L KG N + E+D++ ++ I VEV+ R Sbjct: 5 AKNKMAKHNELGKEGENAAAEYLMSKGYSIRHRNWHSGKRELDIVAQKDGELIVVEVKTR 64 Query: 69 RSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEWIK 124 R+ +G ++T K ++ + ++ R + RFD++ TG E +E I+ Sbjct: 65 RNEEFGKPEEAITDRKIRNIIISTDTYIKRFEI---DLPVRFDIITVTGTEPPFHIEHIQ 121 Query: 125 DAFNDH 130 +AF Sbjct: 122 EAFLPP 127 >UniRef50_A6TRS2 UPF0102 protein Amet_2739 n=1 Tax=Alkaliphilus metalliredigens QYMF RepID=Y2739_ALKMQ Length = 114 Score = 130 bits (329), Expect = 1e-29, Method: Composition-based stats. Identities = 40/115 (34%), Positives = 57/115 (49%), Gaps = 5/115 (4%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 +K G+ E ++LE KG R I N + GEID+I +G FVEV+ RRS YG Sbjct: 2 SKSLGELGERIIGQYLEKKGYRLIETNYRTKLGEIDIIAYKGTIIAFVEVKTRRSQSYGM 61 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN---EVEWIKDAF 127 +V KQ +L + A ++AR D RFDV ++ +I +AF Sbjct: 62 PCEAVNWQKQQRLHRVASHYIARKG--LINYDFRFDVAEVIIGKEKKIHYINNAF 114 >UniRef50_C4FFG3 Putative uncharacterized protein n=1 Tax=Bifidobacterium angulatum DSM 20098 RepID=C4FFG3_9BIFI Length = 151 Score = 130 bits (327), Expect = 2e-29, Method: Composition-based stats. Identities = 46/132 (34%), Positives = 63/132 (47%), Gaps = 6/132 (4%) Query: 1 MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRT- 59 M T+ R ++++Q G EA A WLEG +A N + R GE+D+I Sbjct: 21 METIEARLAGN-AVSSRQVGALGEAYAAAWLEGFDWLVLARNWHCRYGELDIIALSPERR 79 Query: 60 TIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE 119 IFVEV+ RR +G +VT SKQ L + A WL R+ RFDVV + ++ Sbjct: 80 IIFVEVKTRRGVRFGTPQEAVTPSKQTNLRRAALQWLERNGHLLRHNGMRFDVVTVSVHD 139 Query: 120 ----VEWIKDAF 127 V I AF Sbjct: 140 GQVAVHRIPGAF 151 >UniRef50_Q1NMK2 Putative uncharacterized protein n=1 Tax=delta proteobacterium MLMS-1 RepID=Q1NMK2_9DELT Length = 120 Score = 130 bits (327), Expect = 2e-29, Method: Composition-based stats. Identities = 44/120 (36%), Positives = 61/120 (50%), Gaps = 6/120 (5%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + G E AR WLEG G R + AN GE+DL+ EG +FVEV+ RR Sbjct: 2 TRQRQGLGRRGEQLARDWLEGAGYRILEANCRTSSGELDLVAEEGGELVFVEVKSRRGDA 61 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEWIKDAFN 128 +G +V R KQ ++++ AR +L+R RFDVVA T +E +K+AF Sbjct: 62 FGSPLEAVDRRKQARIIRCAREYLSRRRSH--GRPARFDVVAVTFTGGKPAIEVVKNAFE 119 >UniRef50_C7HUU3 Endonuclease n=4 Tax=Anaerococcus RepID=C7HUU3_9FIRM Length = 118 Score = 130 bits (327), Expect = 2e-29, Method: Composition-based stats. Identities = 33/117 (28%), Positives = 56/117 (47%), Gaps = 4/117 (3%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 + GD E A ++LE KG + + N + GEID+I + FVEV+ R++ + Sbjct: 4 KKRTIGDFGEEIALKYLEKKGYQILDRNFLKYYGEIDIIAIKNDILTFVEVKTRKNDEFK 63 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF--TGNEVEWIKDAFND 129 A+ V KQ ++ +TA+ ++ + FDV + +IK+AF D Sbjct: 64 PASLDVDYYKQERIKKTAQAYIMEKD--LGEFLISFDVCEVYLENKTIHYIKNAFGD 118 >UniRef50_Q1MYA7 Putative uncharacterized protein n=1 Tax=Bermanella marisrubri RepID=Q1MYA7_9GAMM Length = 124 Score = 129 bits (326), Expect = 2e-29, Method: Composition-based stats. Identities = 55/122 (45%), Positives = 76/122 (62%), Gaps = 3/122 (2%) Query: 8 SGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRY 67 S S + +TG +EAQA+++L +GL FI NVN + GE+DLIM+ + +FVEVRY Sbjct: 3 SNSDHSKSKIETGSFYEAQAKQFLVNQGLIFIEQNVNFKTGELDLIMKHNKHLVFVEVRY 62 Query: 68 RRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG--NEVEWIKD 125 R S YGGA S+T SKQ ++ + A+ +L +H CR DVVAF G + WIK+ Sbjct: 63 RSSQDYGGAVTSITASKQARVRRAAQTYLQKH-FGNRPPPCRIDVVAFEGANTKAIWIKN 121 Query: 126 AF 127 AF Sbjct: 122 AF 123 >UniRef50_D1U5W3 Putative uncharacterized protein n=1 Tax=Desulfovibrio aespoeensis Aspo-2 RepID=D1U5W3_9DELT Length = 130 Score = 129 bits (326), Expect = 2e-29, Method: Composition-based stats. Identities = 38/122 (31%), Positives = 63/122 (51%), Gaps = 6/122 (4%) Query: 11 PRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRS 70 ++ K GD E A R+LE +G+R + N R E+DL+ R+G T +FVEV+ R + Sbjct: 5 DKRTPAKWRGDLGEDAAARYLESRGMRVLDRNWRYRQWELDLVCRDGDTLVFVEVKTRVA 64 Query: 71 ALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----EVEWIKDA 126 A + R+K+ +L++ A +L+ CRFD+ A +VE ++A Sbjct: 65 GSMSAPADGLGRAKRARLVKAAARYLSAKG--LWDEPCRFDLAAVVDTGVSMDVEHTENA 122 Query: 127 FN 128 F+ Sbjct: 123 FD 124 >UniRef50_C6D2Y6 Putative uncharacterized protein n=1 Tax=Paenibacillus sp. JDR-2 RepID=C6D2Y6_PAESJ Length = 122 Score = 129 bits (326), Expect = 2e-29, Method: Composition-based stats. Identities = 47/120 (39%), Positives = 63/120 (52%), Gaps = 9/120 (7%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRR-SALY 73 +QTG A E A R+LE +G I N R GEID+I T +FVEVR RR + Sbjct: 5 RRRQTGLAGETAACRYLEKEGYNVIERNWRCRSGEIDIIATIDHTLVFVEVRTRRTGGRF 64 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE------VEWIKDAF 127 G AA SV R KQ ++ A+++L ++ RFDV+A T + V+ IK AF Sbjct: 65 GTAAESVDRRKQQQVALVAQVYLRMRQLTY--PPMRFDVIAVTMDRNDSISEVKHIKAAF 122 >UniRef50_B3QZF2 UPF0102 protein Ctha_1382 n=1 Tax=Chloroherpeton thalassium ATCC 35110 RepID=Y1382_CHLT3 Length = 129 Score = 129 bits (326), Expect = 2e-29, Method: Composition-based stats. Identities = 37/116 (31%), Positives = 58/116 (50%), Gaps = 4/116 (3%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 T G E A +L+ G + + N EIDLI ++ FVEV+ R + YG Sbjct: 3 TNVAFGKKGEDMASAFLKKCGYQILRRNYRSGNNEIDLITKKDNIVAFVEVKTRHNLNYG 62 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFNDH 130 A +VT SKQ +L++ A+ ++ + RFDVVA +E + ++AFN+ Sbjct: 63 HPAEAVTLSKQKELIKAAQNFINDNPSQGVDY--RFDVVAIILDESK--RNAFNEP 114 >UniRef50_C7N589 Predicted endonuclease related to Holliday junction resolvase n=2 Tax=Slackia RepID=C7N589_SLAHD Length = 167 Score = 129 bits (325), Expect = 3e-29, Method: Composition-based stats. Identities = 34/124 (27%), Positives = 57/124 (45%), Gaps = 8/124 (6%) Query: 12 RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE-GRTTIFVEVRYRRS 70 R++ K+ G EA A L+ KG + N GE D+I + T +FVEV+ RR Sbjct: 44 REMDPKELGRRGEACACMLLDYKGYEILERNWKCPAGEADIIAIDENGTLVFVEVKTRRG 103 Query: 71 ALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-----VEWIKD 125 G ++ R+K+ + + A +L+++ + RFD +A + V I + Sbjct: 104 VENGLPEEAIGRAKRARYEKIAAYYLSQY--TGPDTALRFDTIALLVMDNYRALVRHIVN 161 Query: 126 AFND 129 AF Sbjct: 162 AFGQ 165 >UniRef50_A8ZV12 UPF0102 protein Dole_2298 n=2 Tax=Desulfobacteraceae RepID=Y2298_DESOH Length = 123 Score = 129 bits (325), Expect = 3e-29, Method: Composition-based stats. Identities = 40/117 (34%), Positives = 61/117 (52%), Gaps = 4/117 (3%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 +Q G E A R+L+ +G + N GEID+I ++ T FVEV+ RR+ YG Sbjct: 5 RQQYGRQGEQAAERFLKKEGYTIVCRNYRTPVGEIDIIAKDKTTLAFVEVKARRTESYGS 64 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE--VEWIKDAFNDH 130 S+T+ KQ K+ + A +L + RFDVV G + VE I++AF+ + Sbjct: 65 PRLSITKDKQRKITRAALWYLKDTGQAGARA--RFDVVIVQGRDNSVELIRNAFDAN 119 >UniRef50_B3PLA2 Putative uncharacterized protein n=1 Tax=Cellvibrio japonicus Ueda107 RepID=B3PLA2_CELJU Length = 126 Score = 129 bits (325), Expect = 3e-29, Method: Composition-based stats. Identities = 45/113 (39%), Positives = 65/113 (57%), Gaps = 5/113 (4%) Query: 20 GDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAAS 79 G EA+A+ +LE +GL N + GEIDLIM EG T +FVEVR R + + A S Sbjct: 13 GARAEARAQAYLEQQGLTTWMKNYRCKTGEIDLIMCEGDTLVFVEVRLRTNRFFSSAVES 72 Query: 80 VTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----EVEWIKDAFN 128 +T +K+ K+++TA+ +L D CRFD++A + EWI+DAF Sbjct: 73 ITPAKRQKMIRTAQRFLQERGL-VDKHACRFDIIALDAKGQHAKPEWIRDAFG 124 >UniRef50_C4F8U2 Putative uncharacterized protein n=2 Tax=Collinsella RepID=C4F8U2_9ACTN Length = 158 Score = 129 bits (325), Expect = 3e-29, Method: Composition-based stats. Identities = 35/126 (27%), Positives = 53/126 (42%), Gaps = 13/126 (10%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIM--REGRTTIFVEVRYRRS 70 ++ K G E A R+LE +G I N GE DL+ ++ + VEV+ RRS Sbjct: 32 GMSNKLLGSLGEELAARYLEQRGYDIIDRNYRCPEGEADLVAYDQDDDGVVLVEVKTRRS 91 Query: 71 ALY---GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-----NEVEW 122 +VT KQ + + A + A H + RFDV+ T E+ Sbjct: 92 RSERGGAYPEEAVTPEKQRRYRRIALCYAADH---YPVPSIRFDVIGVTLRPANIGEIRH 148 Query: 123 IKDAFN 128 + AF+ Sbjct: 149 LCGAFD 154 >UniRef50_C0GHM5 Putative uncharacterized protein n=1 Tax=Dethiobacter alkaliphilus AHT 1 RepID=C0GHM5_9FIRM Length = 112 Score = 129 bits (324), Expect = 3e-29, Method: Composition-based stats. Identities = 38/114 (33%), Positives = 50/114 (43%), Gaps = 4/114 (3%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 K G E A L G +A N GEID++ + G +FVEV+ RRS+ G Sbjct: 1 MKTLGQKGEELAVDHLRRAGYLILARNWRCERGEIDIVAKAGNILVFVEVKTRRSSRLGT 60 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG--NEVEWIKDAF 127 +V KQ KL A ++ + RFDV A N V IK+AF Sbjct: 61 PQEAVDFRKQEKLRHLAYRFINATGITAAEY--RFDVAAVNAKNNTVTIIKNAF 112 >UniRef50_A5D1I2 UPF0102 protein PTH_1707 n=1 Tax=Pelotomaculum thermopropionicum SI RepID=Y1707_PELTS Length = 120 Score = 129 bits (324), Expect = 3e-29, Method: Composition-based stats. Identities = 41/119 (34%), Positives = 59/119 (49%), Gaps = 8/119 (6%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 K G E A R+LE KG R ++ N R GE+DL++ +G +FVEVR R YG Sbjct: 4 ARKLLGRMGEEAAARYLEKKGCRILSRNHCCRLGELDLVVSDGDVLVFVEVRARTGEEYG 63 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN------EVEWIKDAF 127 A S+T K+ +L A +L + CRFDV+A + +E ++AF Sbjct: 64 LAQESITGRKKSRLRLLAWQYLKEKGKTGSM--CRFDVIAVLFDREGRVKRLEHFENAF 120 >UniRef50_Q0AFH8 UPF0102 protein Neut_1662 n=2 Tax=Proteobacteria RepID=Y1662_NITEC Length = 116 Score = 129 bits (324), Expect = 4e-29, Method: Composition-based stats. Identities = 53/116 (45%), Positives = 76/116 (65%), Gaps = 4/116 (3%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 +TK G E QA +L+ + L + N R GEIDLIM++G T +FVEVR R + L+G Sbjct: 3 STKNKGSDAEQQATIFLQQQQLTLLEKNYRCRFGEIDLIMQDGDTVVFVEVRMRVNQLFG 62 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-VEWIKDAFND 129 GAAAS+T +KQ KL + AR +LAR + + CRFD + +GN +EWI++AF++ Sbjct: 63 GAAASITPAKQLKLTRAARHYLARCD---EDFPCRFDAILISGNREIEWIQNAFDE 115 >UniRef50_A5Z6D1 Putative uncharacterized protein n=1 Tax=Eubacterium ventriosum ATCC 27560 RepID=A5Z6D1_9FIRM Length = 134 Score = 129 bits (324), Expect = 4e-29, Method: Composition-based stats. Identities = 31/115 (26%), Positives = 54/115 (46%), Gaps = 2/115 (1%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 L + G +E +L G + N + GEID+I ++ F+EV++R S Y Sbjct: 20 LNKRGRGSFYEDVCVEYLIKNGFDILHRNYRCKLGEIDIIAKKDDIIRFIEVKFRGSDSY 79 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFN 128 G A +V KQ ++++ A +L + + V C FDV+ NE + + + Sbjct: 80 GSALEAVDFRKQRRIMRAASWFLNEYG--LNDVQCSFDVMTVENNEARYYFNCYG 132 >UniRef50_A6L1J0 UPF0102 protein BVU_1879 n=26 Tax=Bacteroidales RepID=Y1879_BACV8 Length = 121 Score = 129 bits (324), Expect = 4e-29, Method: Composition-based stats. Identities = 26/119 (21%), Positives = 53/119 (44%), Gaps = 7/119 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 + G E +A +L KG N + E+D++ I +EV+ R++ +G Sbjct: 4 HNEFGKEGEEEAAAYLIDKGYSIRHRNWHCGKKELDIVAEYRNELIVIEVKTRKNTRFGN 63 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEWIKDAFNDH 130 +VT K +++ + +L + + + RFD++ G + +E I++AF Sbjct: 64 PEDAVTDKKIRRIIASTDAYLRKFSV---DLPVRFDIITLVGEKTPFTIEHIEEAFYPP 119 >UniRef50_D1NRZ9 Putative endonuclease n=1 Tax=Bifidobacterium gallicum DSM 20093 RepID=D1NRZ9_9BIFI Length = 144 Score = 129 bits (324), Expect = 4e-29, Method: Composition-based stats. Identities = 35/120 (29%), Positives = 54/120 (45%), Gaps = 5/120 (4%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREG-RTTIFVEVRYRRSA 71 +L+ + G EA + WL G R + N + R GE+DLI FVE++ RR Sbjct: 25 RLSARDLGSWGEAASACWLRTHGWRIVGHNWHCRYGELDLIALSATDELAFVEIKTRRGC 84 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----EVEWIKDAF 127 +G +V +KQ L + A LW+ + + V RFDV+ + + AF Sbjct: 85 QFGTPIEAVGVTKQTNLRRAAMLWMLEADHHINHVGIRFDVIGVLVHAGRIRFTHVPHAF 144 >UniRef50_Q2YCL8 UPF0102 protein Nmul_A0195 n=3 Tax=Nitrosomonadaceae RepID=Y195_NITMU Length = 119 Score = 129 bits (324), Expect = 4e-29, Method: Composition-based stats. Identities = 49/118 (41%), Positives = 69/118 (58%), Gaps = 6/118 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 +T + G+ E A +L G L + N R GEIDLIMR+G T +FVEVR R + + Sbjct: 1 MTLRLKGNQAERYAEAFLAGHRLVLVQRNYRCRFGEIDLIMRDGETLVFVEVRMRTNRNF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE---VEWIKDAFN 128 G A +S+T SKQ K+++ AR +L CRFD V +GNE +EWI++AF+ Sbjct: 61 GDAGSSITLSKQRKVVRAARHYLLSLRTEPC---CRFDAVLLSGNEGRDIEWIRNAFD 115 >UniRef50_C7IKV6 Putative uncharacterized protein n=1 Tax=Clostridium papyrosolvens DSM 2782 RepID=C7IKV6_9CLOT Length = 126 Score = 128 bits (323), Expect = 4e-29, Method: Composition-based stats. Identities = 36/125 (28%), Positives = 59/125 (47%), Gaps = 13/125 (10%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERG-GEIDLIMREGRTTIFVEVRYRRSAL 72 ++ G E +A +L+ G + N GEID+I + F+EV+ RR++ Sbjct: 4 ANKREIGAVGEREAAEFLQRNGYTILKINYRVGRLGEIDIIANDNEYICFIEVKTRRTST 63 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----------VEW 122 +G +VT++KQ K+ Q A ++L N + RFDV+ N+ + Sbjct: 64 FGSPGEAVTKTKQQKIRQIAAIYLT--NTRKMDSNVRFDVIEILMNKSMESVNSIKSINL 121 Query: 123 IKDAF 127 IKDAF Sbjct: 122 IKDAF 126 >UniRef50_D2SDZ8 Putative uncharacterized protein n=1 Tax=Geodermatophilus obscurus DSM 43160 RepID=D2SDZ8_9ACTO Length = 139 Score = 128 bits (323), Expect = 5e-29, Method: Composition-based stats. Identities = 44/135 (32%), Positives = 62/135 (45%), Gaps = 10/135 (7%) Query: 1 MATVPTRSGSP---RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREG 57 + P RSG R TT G E A +L GLR + N R GE+D++ R+G Sbjct: 6 LRDRPDRSGPTTVGRVRTTSDLGAHGERIAAAYLTDSGLRVLDRNWRCRDGELDIVARDG 65 Query: 58 RTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT- 116 +F EV+ RR+ +G +V KQ +L A+ WLA H+ + RFDVV Sbjct: 66 DALVFCEVKTRRAVGFGHPVEAVGHVKQRRLRVLAQRWLAAHDERA--PELRFDVVGVLV 123 Query: 117 ----GNEVEWIKDAF 127 V ++ AF Sbjct: 124 RVDRPALVTHLRAAF 138 >UniRef50_C4FZ58 Putative uncharacterized protein n=1 Tax=Abiotrophia defectiva ATCC 49176 RepID=C4FZ58_ABIDE Length = 113 Score = 128 bits (323), Expect = 5e-29, Method: Composition-based stats. Identities = 36/112 (32%), Positives = 58/112 (51%), Gaps = 3/112 (2%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE-GRTTIFVEVRYRRSALYGGAA 77 G E +A +L KG + + N + GEID+I + VEV+YR S +G Sbjct: 1 MGKEKEEKAAAYLISKGYKILEKNYLRKTGEIDIIAKSADGYLTAVEVKYRSSDRFGSPF 60 Query: 78 ASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-EVEWIKDAFN 128 ++VT KQ K+ +T +++ HN S + RFDV+ G+ +E + +AF Sbjct: 61 SAVTYIKQRKICKTLLFYMSEHNISP-DIKSRFDVIGIYGDGRLEHLVNAFE 111 >UniRef50_C2D6J2 Putative uncharacterized protein n=1 Tax=Atopobium vaginae DSM 15829 RepID=C2D6J2_9ACTN Length = 176 Score = 128 bits (322), Expect = 6e-29, Method: Composition-based stats. Identities = 34/130 (26%), Positives = 57/130 (43%), Gaps = 10/130 (7%) Query: 6 TRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEV 65 R + L +K+ G E A +LE + + N GE+D+I +G T+FVEV Sbjct: 44 PRKSAVNTLNSKELGALGENLACCFLERQDFEILDRNWKCADGEVDIIASKGDETVFVEV 103 Query: 66 RYRRSALYGG--AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF-----TGN 118 + R G +V + KQ + + AR + + + RFDV+A + Sbjct: 104 KTRLQNKSGELFPEIAVDKQKQSRYIALARSYNTAYPMCEN---IRFDVIALAILDDSHA 160 Query: 119 EVEWIKDAFN 128 ++ I+ AF Sbjct: 161 QLRHIQSAFE 170 >UniRef50_A6LSN5 UPF0102 protein Cbei_1183 n=5 Tax=Clostridium RepID=Y1183_CLOB8 Length = 123 Score = 128 bits (322), Expect = 6e-29, Method: Composition-based stats. Identities = 35/121 (28%), Positives = 56/121 (46%), Gaps = 8/121 (6%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 K G E A+++LE + N GEID+I ++ I VEV+ R + YG Sbjct: 5 NKDIGSFSEDLAKKYLEKNDYSILDCNFKNFLGEIDIICKKNTLLIIVEVKSRYNNNYGL 64 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN------EVEWIKDAFND 129 SV SKQ +++ A ++ + ++ RFDV+ N ++ IKDAF Sbjct: 65 PRESVNFSKQRSIIKVANSYI--NYKRLPNINVRFDVIEVYLNLESTNFKINHIKDAFRL 122 Query: 130 H 130 + Sbjct: 123 N 123 >UniRef50_A3N211 UPF0102 protein APL_1363 n=33 Tax=Pasteurellaceae RepID=Y1363_ACTP2 Length = 123 Score = 128 bits (322), Expect = 7e-29, Method: Composition-based stats. Identities = 58/116 (50%), Positives = 75/116 (64%), Gaps = 2/116 (1%) Query: 12 RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSA 71 + LT + G +E +AR +LE GL+FIAAN + GE+DLIMR+G T +FVEVR R+S Sbjct: 5 KTLTKRSQGANFEQKAREFLERNGLKFIAANQQFKCGELDLIMRQGDTFVFVEVRQRKSN 64 Query: 72 LYGGAAASVTRSKQHKLLQTARLWL-ARHNGSFDTVDCRFDVVAFTGNEVE-WIKD 125 +G A S+ KQ K L A +WL RH S DT +CRFDVVAF GN+ WI + Sbjct: 65 RFGSAVESIDYRKQQKWLDAANMWLFTRHKQSLDTANCRFDVVAFEGNDPPLWIPN 120 >UniRef50_C4K712 Putative uncharacterized protein n=1 Tax=Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum) RepID=C4K712_HAMD5 Length = 118 Score = 127 bits (321), Expect = 9e-29, Method: Composition-based stats. Identities = 52/118 (44%), Positives = 75/118 (63%), Gaps = 1/118 (0%) Query: 11 PRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRS 70 + L+ ++ G +E ARR+LE GL F +NV R EIDLIMR+ +T +FVEVR++R+ Sbjct: 2 DKTLSRREIGFRYEMIARRYLEKAGLVFKESNVTLRSAEIDLIMRDQKTWVFVEVRFKRN 61 Query: 71 ALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFN 128 + +G AA S+ KQ +L A +WL++ F RFDV A TGN+ EW ++AFN Sbjct: 62 SFFGSAADSINNKKQKRLRDAAAIWLSKRGSHF-NTSYRFDVFAITGNQFEWFQNAFN 118 >UniRef50_A5FR87 UPF0102 protein DehaBAV1_0707 n=5 Tax=Dehalococcoides RepID=Y707_DEHSB Length = 121 Score = 127 bits (320), Expect = 1e-28, Method: Composition-based stats. Identities = 38/119 (31%), Positives = 60/119 (50%), Gaps = 6/119 (5%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 K+TG+ E A +L+G G I N GEID++ ++G +F+EVR +R YG Sbjct: 4 NRKETGEFGEKLAAEYLKGMGYSIIQTNCRLPEGEIDIVGQDGEYLVFIEVRTKRRLGYG 63 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT----GNEVEWIKDAFND 129 A SVT K+ L+ +A ++ +H + CR D V+ +E IK+A + Sbjct: 64 LPAESVTPRKKAHLMASAESYIQKHR--LEHFPCRIDFVSVDLSQPEPRLELIKNALGE 120 >UniRef50_B8HR21 Putative uncharacterized protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HR21_CYAP4 Length = 164 Score = 127 bits (320), Expect = 1e-28, Method: Composition-based stats. Identities = 34/107 (31%), Positives = 56/107 (52%), Gaps = 2/107 (1%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG- 74 ++Q GD E WL +G + + N GEIDLI+++ FVEV+ R + Sbjct: 2 SRQVGDVGEMLVAHWLTAQGWQIVQRNWQCCWGEIDLILQQDEWLAFVEVKTRSRGNWDQ 61 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVE 121 ++T +KQ KL +TA L+L+ + +F + CRFD+ + +V Sbjct: 62 DGLLAITPTKQRKLWKTATLFLSEYP-NFADLSCRFDLALVSYVKVH 107 >UniRef50_B8KR63 Putative uncharacterized protein n=1 Tax=gamma proteobacterium NOR51-B RepID=B8KR63_9GAMM Length = 128 Score = 127 bits (320), Expect = 1e-28, Method: Composition-based stats. Identities = 43/121 (35%), Positives = 69/121 (57%), Gaps = 6/121 (4%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 + G+ WE +A +L G GL I N GEIDLI + +FVEVR R+ + +G Sbjct: 1 MRSEGNQWEIKAASFLRGHGLTIIVQNFTCPFGEIDLIGDDQGVIVFVEVRKRKRSRFGN 60 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF-----TGNEVEWIKDAFNDH 130 AA+SV R+KQ K++++A +L +H CRFDV+A+ + +W++ AF+ + Sbjct: 61 AASSVGRAKQKKIIRSAAFYLQQHGA-MADTHCRFDVIAYDVGADDPDTPKWLRSAFSAN 119 Query: 131 S 131 + Sbjct: 120 A 120 >UniRef50_C6IVU3 Putative uncharacterized protein n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6IVU3_9BACL Length = 130 Score = 127 bits (320), Expect = 1e-28, Method: Composition-based stats. Identities = 39/123 (31%), Positives = 55/123 (44%), Gaps = 9/123 (7%) Query: 12 RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRS- 70 R K+ G E A L +G + N R GE+D+I R+ + VEVR R Sbjct: 10 RGDGRKERGRKAEQAACEHLISQGYTILERNWRCRSGELDIIARKRDVLVNVEVRSRSQQ 69 Query: 71 -ALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-----VEWIK 124 A +G A SV K ++ TA ++L H + RFDV+A T +E I+ Sbjct: 70 AAAFGTPAESVNARKIKQVRDTAAVYL--HRTGQSDANLRFDVIAVTFGRGDNIALEHIQ 127 Query: 125 DAF 127 AF Sbjct: 128 AAF 130 >UniRef50_UPI0001C37581 hypothetical protein RflaF_17327 n=1 Tax=Ruminococcus flavefaciens FD-1 RepID=UPI0001C37581 Length = 125 Score = 127 bits (320), Expect = 1e-28, Method: Composition-based stats. Identities = 35/120 (29%), Positives = 58/120 (48%), Gaps = 8/120 (6%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 +T +TG E +L G G R IA N +GGEID+I G FVEV+ R+ Sbjct: 1 MTKSETGKLGEESVCSYLLGMGYRIIARNYRIKGGEIDIIAENGDYIAFVEVKSRKPDSL 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-----EVEWIKDAFN 128 +V++ KQ +++TA + +H + RFDV + ++++ +AF+ Sbjct: 61 VSGFEAVSKRKQGLIIKTAADYCLKHPNVWQP---RFDVASVIIENGRVLSIDYVTNAFD 117 >UniRef50_D2R2U4 Putative uncharacterized protein n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R2U4_9PLAN Length = 150 Score = 127 bits (319), Expect = 1e-28, Method: Composition-based stats. Identities = 37/122 (30%), Positives = 57/122 (46%), Gaps = 8/122 (6%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 L K G E A +L +G +A + GEIDL+ +GRT +FVEV+ R + + Sbjct: 23 LQPKSLGRRGEDAAALFLRARGYWIVARSYRTSLGEIDLVAVDGRTIVFVEVKTRVRSDH 82 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE------VEWIKDAF 127 G +V KQ +L + A +L RH+ RFD+V+ +E ++ AF Sbjct: 83 GQPFDAVHPDKQRRLTRLAAAYLKRHD--LTRYASRFDIVSILWPGGRKQPLIEHLQHAF 140 Query: 128 ND 129 Sbjct: 141 EA 142 >UniRef50_C9LM05 Endonuclease n=1 Tax=Dialister invisus DSM 15470 RepID=C9LM05_9FIRM Length = 117 Score = 127 bits (319), Expect = 1e-28, Method: Composition-based stats. Identities = 40/119 (33%), Positives = 58/119 (48%), Gaps = 7/119 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + G E +A +LE KG+ + N + GEIDLIM++G +F+EV+ RRS LY Sbjct: 1 MGNTAFGRMGEDRACLYLEEKGMTLVTRNFRCKHGEIDLIMKDGSVFVFIEVKTRRSRLY 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF-----TGNEVEWIKDAF 127 G +VT KQ + TA ++L V RFDVV + ++AF Sbjct: 61 GEPIEAVTVYKQRHIRYTAEVFLLAR--HLHDVRIRFDVVEVMMAPGRAVRLRHTRNAF 117 >UniRef50_Q2S1J6 UPF0102 protein SRU_1822 n=1 Tax=Salinibacter ruber DSM 13855 RepID=Y1822_SALRD Length = 122 Score = 127 bits (319), Expect = 1e-28, Method: Composition-based stats. Identities = 40/120 (33%), Positives = 55/120 (45%), Gaps = 8/120 (6%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMR--EGRTTIFVEVRYRRSA 71 TT GD E A L+G G +A N E+DL+ R + +FVEV+ R Sbjct: 2 ATTNDIGDRGEEIAAAHLDGAGYEILARNYRHSRNEVDLVCRETDAGEYVFVEVKTRSGT 61 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT----GNEVEWIKDAF 127 +G AS+T K+ L AR +L H + RFDVVA EV+ ++AF Sbjct: 62 GFGAPEASITAKKRAALQHAARGYLHEHGA--EGAPARFDVVAVMLTGGPPEVQHYENAF 119 >UniRef50_A4SC34 UPF0102 protein Cvib_0014 n=9 Tax=Chlorobiaceae RepID=Y014_PROVI Length = 131 Score = 127 bits (319), Expect = 1e-28, Method: Composition-based stats. Identities = 45/122 (36%), Positives = 56/122 (45%), Gaps = 12/122 (9%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 G E A +LE KG R + N EID+I +G T F+EV+ R SA G A Sbjct: 9 LGREGERIAAGFLEKKGYRIVQRNFRFHRNEIDIIAMDGETVCFIEVKTRSSATKGEPAE 68 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----------EVEWIKDAFN 128 +VT KQ ++ + A WLA S DCRFDVV VE DAF+ Sbjct: 69 AVTPGKQREIARAAEAWLAF--SSEGEPDCRFDVVGIIAEPLSGGRFRARSVELFADAFH 126 Query: 129 DH 130 D Sbjct: 127 DP 128 >UniRef50_B3ES88 Putative uncharacterized protein n=1 Tax=Candidatus Amoebophilus asiaticus 5a2 RepID=B3ES88_AMOA5 Length = 122 Score = 127 bits (319), Expect = 1e-28, Method: Composition-based stats. Identities = 33/120 (27%), Positives = 59/120 (49%), Gaps = 7/120 (5%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + + Q G +E A +L+ KGL + N + EID+I ++ F+EV+ R SA Sbjct: 6 EASPHQLGKKYEDLATSYLQQKGLMIMVRNYRYKKAEIDIIAQKDACLYFIEVKARTSAK 65 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEWIKDAFN 128 +G A V KQ + A ++ +++ RFD++A + +E+ +DAF+ Sbjct: 66 FGYPEAFVNTYKQQLIKAAAENYILQNDW---NSSIRFDIIAILDQKGCINLEYFEDAFS 122 >UniRef50_C8WAJ1 Putative uncharacterized protein n=1 Tax=Atopobium parvulum DSM 20469 RepID=C8WAJ1_ATOPD Length = 172 Score = 127 bits (319), Expect = 1e-28, Method: Composition-based stats. Identities = 33/127 (25%), Positives = 58/127 (45%), Gaps = 11/127 (8%) Query: 10 SPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRR 69 +++++Q G+ E A ++L +G + I N + GE+D++ ++G + VEV+ RR Sbjct: 45 PLEEMSSRQIGEKGEEIAAKYLIKRGYKIIQTNWTCQIGEVDIVAQDGDNVVLVEVKTRR 104 Query: 70 ---SALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-----NEVE 121 +V R+KQ K A ++ A H RFDVVA + Sbjct: 105 VLNKDDSIMPELAVNRAKQEKYRTLALMYAALHP---ALTSIRFDVVAINLVAPSTASLR 161 Query: 122 WIKDAFN 128 + AF+ Sbjct: 162 HLIGAFS 168 >UniRef50_C2G0R5 Possible endonuclease n=2 Tax=Sphingobacterium spiritivorum RepID=C2G0R5_9SPHI Length = 118 Score = 127 bits (319), Expect = 1e-28, Method: Composition-based stats. Identities = 36/113 (31%), Positives = 52/113 (46%), Gaps = 6/113 (5%) Query: 18 QTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAA 77 + G E A L G + +A N E+D++ +G +FVEV+ R S +G A Sbjct: 6 EQGKKGEQMALSHLTALGYQILALNWRTGKLEVDILAYDGDILVFVEVKTRSSNAHGEPA 65 Query: 78 ASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE---VEWIKDAF 127 V KQ KL++ AR + D RFD+V+ E + IKDAF Sbjct: 66 DFVDIQKQRKLIRAARACIEERGHQGD---IRFDIVSVYLGEPAYIHLIKDAF 115 >UniRef50_B0VJC5 Putative uncharacterized protein n=1 Tax=Candidatus Cloacamonas acidaminovorans RepID=B0VJC5_9BACT Length = 128 Score = 126 bits (318), Expect = 2e-28, Method: Composition-based stats. Identities = 34/124 (27%), Positives = 60/124 (48%), Gaps = 7/124 (5%) Query: 12 RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSA 71 ++ + + E A R+L G N ++ GEID+I+ + + +F EV+ R S Sbjct: 2 KKYSLQDFSHIGEDLAARYLVSNGYTITCRNYRKKYGEIDIIVEKDQHLVFCEVKTRTSH 61 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF------TGNEVEWIKD 125 A AS+ SKQ K+ +TA+L++ + F RFDV+ E++ ++ Sbjct: 62 SIEWALASIGFSKQRKISRTAQLYINENP-QFAKHIFRFDVLLVFYYENTDTFEIKHFEN 120 Query: 126 AFND 129 AF+ Sbjct: 121 AFDA 124 >UniRef50_C2HKC4 Possible endonuclease n=2 Tax=Finegoldia magna RepID=C2HKC4_PEPMA Length = 115 Score = 126 bits (318), Expect = 2e-28, Method: Composition-based stats. Identities = 31/113 (27%), Positives = 53/113 (46%), Gaps = 4/113 (3%) Query: 17 KQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGA 76 K G E A +L KG I N +ER GE+D++ + VEV+ R +G Sbjct: 5 KNRGKFAEDYACEYLIEKGYEIIDRNYSERIGELDIVCTYENYLVIVEVKARTDDKFGAP 64 Query: 77 AASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN--EVEWIKDAF 127 + VT KQ ++ +T +++ +++ RFDV+ + ++ DAF Sbjct: 65 SDFVTLGKQDRIRKTTEIYIDKND--LYDYQPRFDVIEIYLDNFKLNHYIDAF 115 >UniRef50_C6XZ20 Putative uncharacterized protein n=2 Tax=Pedobacter RepID=C6XZ20_PEDHD Length = 120 Score = 126 bits (317), Expect = 2e-28, Method: Composition-based stats. Identities = 33/119 (27%), Positives = 51/119 (42%), Gaps = 8/119 (6%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 T G E A +LE G R + N E+D+I + IFVEV+ R S Y Sbjct: 2 ATHNDLGWRGEQIAVEYLENLGYRILNRNWKCARAEVDVIADQEGKLIFVEVKTRSSTDY 61 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF-----TGNEVEWIKDAF 127 G V+ K+ +L + ++ N + RFD++A ++ I+DAF Sbjct: 62 GQPEEFVSYKKERQLEFASSAYIEMRNHQGE---IRFDIIAIVFENKDIYKINHIEDAF 117 >UniRef50_C9MR57 Putative endonuclease n=2 Tax=Prevotella RepID=C9MR57_9BACT Length = 121 Score = 126 bits (317), Expect = 2e-28, Method: Composition-based stats. Identities = 31/119 (26%), Positives = 54/119 (45%), Gaps = 8/119 (6%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 G E A +L+ +G + N + ++D++ + + VEV+ R+ + Sbjct: 4 HNDIGKWGEEVAANYLQQQGYTILHRNWMYQHRDLDIVAMDAGALVIVEVKTRKDERFVN 63 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-----NEVEWIKDAFND 129 A A+VT K L A ++ R+N S + RFD++ G +EV +KDAF Sbjct: 64 ADAAVTPQKVRSLSLAANAYVKRYNISLE---IRFDIITIVGCPDDKHEVRHVKDAFLP 119 >UniRef50_C8WGY4 Putative uncharacterized protein n=2 Tax=Eggerthella lenta DSM 2243 RepID=C8WGY4_EGGLE Length = 173 Score = 126 bits (317), Expect = 2e-28, Method: Composition-based stats. Identities = 34/119 (28%), Positives = 57/119 (47%), Gaps = 7/119 (5%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 + G E A R+L+ +G + N GE D+I R+G + +FVEV+ R S G Sbjct: 55 RNAELGRRGEDAAARFLDRRGYEIVERNWTCAAGEADIIARDGDSVVFVEVKTRSSCDCG 114 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFD---VVAFTGNE--VEWIKDAFN 128 A +V +K+ + + A L+L + V RFD +VA + + + +AF+ Sbjct: 115 MPAEAVDEAKRDRYERIAALFLQGFDVV--DVPVRFDIVSIVAISPDRAMIRHHINAFS 171 >UniRef50_C7MNC2 Predicted endonuclease related to Holliday junction resolvase n=1 Tax=Cryptobacterium curtum DSM 15641 RepID=C7MNC2_CRYCD Length = 186 Score = 126 bits (317), Expect = 2e-28, Method: Composition-based stats. Identities = 25/111 (22%), Positives = 49/111 (44%), Gaps = 1/111 (0%) Query: 8 SGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRY 67 + + + ++ G E A +L +G + N + GE D+I + + F+EV+ Sbjct: 57 APTNKSQNNRELGRRGEDAAAAFLTRRGYEIVERNWMCQAGEADIIAQGEGSIHFIEVKT 116 Query: 68 RRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN 118 R SA G + +V K+ + + A +L N + + FDV++ Sbjct: 117 RSSAARGFPSEAVDAKKRSRYERIAECYLRSCN-NLPEMRVTFDVISILAT 166 >UniRef50_Q15PJ2 UPF0102 protein Patl_3694 n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Y3694_PSEA6 Length = 114 Score = 126 bits (317), Expect = 2e-28, Method: Composition-based stats. Identities = 32/111 (28%), Positives = 54/111 (48%), Gaps = 5/111 (4%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 G EAQA +L+ +GL + N R GEID+IMR+ + +FVEV+YR +G A Sbjct: 2 KGAQGEAQALAYLKQQGLTLVTQNYRCRSGEIDIIMRDHQELVFVEVKYRSGQQFGSAVE 61 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-----GNEVEWIK 124 K+ K + ++ + + + R D+V +++ W+K Sbjct: 62 FFHPHKRRKFESAIQHYMLDNKLNPSLIAHRIDIVGIDVLSNNNDKISWLK 112 >UniRef50_A3HX52 Putative uncharacterized protein n=1 Tax=Algoriphagus sp. PR1 RepID=A3HX52_9SPHI Length = 118 Score = 126 bits (317), Expect = 2e-28, Method: Composition-based stats. Identities = 32/117 (27%), Positives = 53/117 (45%), Gaps = 8/117 (6%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 ++G E A +WL KG + + N EIDLI+ + +FVEV++R +G Sbjct: 4 HNRSGQLAEEMAAQWLISKGYQLLEKNYRHGYAEIDLILTHKKLLVFVEVKFRSGTGFGY 63 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-----VEWIKDAF 127 A V +K+ +++ A ++ N D RFD+V + +DAF Sbjct: 64 AEEFVDYTKRKLIIKAADHYIHEKNWK---SDIRFDIVGVYRDRTGAINYRHFEDAF 117 >UniRef50_D1PKZ1 Putative choloylglycine hydrolase n=1 Tax=Subdoligranulum variabile DSM 15176 RepID=D1PKZ1_9FIRM Length = 117 Score = 126 bits (317), Expect = 3e-28, Method: Composition-based stats. Identities = 38/117 (32%), Positives = 54/117 (46%), Gaps = 6/117 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 ++ G EA A ++ +G + N R GEIDLI+ + +F EV+ R + Sbjct: 2 SRNIGQKGEAIAAQYYRQRGYLVLGHNYRTRMGEIDLILYKEDLIVFAEVKTRTGRMLAT 61 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT----GNEVEWIKDAFN 128 A +V KQ +L A +L N F + RFDVV T G +V I DAF Sbjct: 62 PAEAVDLHKQQRLRLAAERYLQ--NSPFSEANVRFDVVEVTPAAKGWQVHCIMDAFQ 116 >UniRef50_Q3A2F1 UPF0102 protein Pcar_2217 n=2 Tax=Deltaproteobacteria RepID=Y2217_PELCD Length = 123 Score = 126 bits (317), Expect = 3e-28, Method: Composition-based stats. Identities = 37/117 (31%), Positives = 58/117 (49%), Gaps = 6/117 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 G E A +L +G++ + N+ GE+D++ R R IFVEV+ RR +G Sbjct: 5 RLSLGRWGEDIAAGYLRRQGMKILDRNIRTPVGELDIVARHKRMLIFVEVKTRRGISHGY 64 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF----TGNEVEWIKDAFN 128 +V +KQ ++L+ A+ +LA D + RFDV+A EVE AF+ Sbjct: 65 PQEAVGAAKQRQILRAAQWYLAERR--LDRLQPRFDVIAVRRRGDEAEVEHFPGAFD 119 >UniRef50_D0LAN4 Putative uncharacterized protein n=1 Tax=Gordonia bronchialis DSM 43247 RepID=D0LAN4_GORB4 Length = 137 Score = 126 bits (317), Expect = 3e-28, Method: Composition-based stats. Identities = 37/138 (26%), Positives = 61/138 (44%), Gaps = 13/138 (9%) Query: 1 MATVPTRSGSPRQLTTKQ-TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRT 59 M PR+ ++ G E A ++ +G R + N R GE+DLI +GR Sbjct: 1 MTAHSAAEPGPRRADRRRHIGHLGEDIAAEFVTNRGWRVLHRNWRNRYGELDLIAADGRV 60 Query: 60 TIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN- 118 + VEV+ R S +Y +VT +K ++ + R+WL+ NGS+ RFDV++ + Sbjct: 61 LVVVEVKTRASLMYSDPLEAVTPAKLSRMRKLTRMWLSEQNGSWS--QIRFDVISVQLDP 118 Query: 119 ---------EVEWIKDAF 127 + F Sbjct: 119 HHPDDRASARIRHHLGVF 136 >UniRef50_A1R7F9 UPF0102 protein AAur_2443 n=3 Tax=Micrococcaceae RepID=Y2443_ARTAT Length = 121 Score = 126 bits (317), Expect = 3e-28, Method: Composition-based stats. Identities = 33/118 (27%), Positives = 57/118 (48%), Gaps = 8/118 (6%) Query: 14 LTTKQT-GDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + K G + EA A +LE +G+R + N GEID++ +G T + EV+ R+S Sbjct: 4 MRAKDLLGRSGEALAADFLENQGMRIVDRNWRCPDGEIDIVAIDGDTLVVAEVKTRKSLD 63 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-----EVEWIKD 125 YG +V +K +L + + W +H + R DVV+ N ++E ++ Sbjct: 64 YGHPFEAVDAAKLARLHRLSSSWCRQHQLNAPRR--RIDVVSVIDNGVVEPQLEHLRG 119 >UniRef50_C9R878 Putative uncharacterized protein n=1 Tax=Ammonifex degensii KC4 RepID=C9R878_AMMDK Length = 114 Score = 125 bits (316), Expect = 3e-28, Method: Composition-based stats. Identities = 42/114 (36%), Positives = 55/114 (48%), Gaps = 10/114 (8%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 G E A +L G I N R GEIDLI REG T +FVEVR R + +G Sbjct: 2 RGKRAEEVAAVYLRKAGWEIIERNYRCRWGEIDLIAREGETIVFVEVRSRSNLAFGLPEE 61 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-------EVEWIKD 125 S+ R KQ KL + AR +LAR CRFDV+A + + +++ Sbjct: 62 SIGRRKQEKLRKVARYFLARLGREL---PCRFDVIAVAWDAATGEIKSLRHLRN 112 >UniRef50_Q8R616 UPF0102 protein FN1370 n=9 Tax=Fusobacterium RepID=Y1370_FUSNN Length = 119 Score = 125 bits (316), Expect = 4e-28, Method: Composition-based stats. Identities = 31/112 (27%), Positives = 63/112 (56%), Gaps = 2/112 (1%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + T++ G+ +E ++ L + + + N + GEID+I + + IF+EV+YR++ + Sbjct: 1 MNTREIGNEYEDKSVEILVKEDYKILERNYQNKFGEIDIIAEKNKEIIFIEVKYRKTNKF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 G +V R K K+L+ A ++ + RFD +++ G+E++WIK+ Sbjct: 61 GYGYEAVDRRKIMKILKLANYYIQSKK--YQDYKIRFDCMSYLGDELDWIKN 110 >UniRef50_C0ZFM4 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0ZFM4_BREBN Length = 125 Score = 125 bits (315), Expect = 4e-28, Method: Composition-based stats. Identities = 39/124 (31%), Positives = 57/124 (45%), Gaps = 13/124 (10%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 + G E A +L KG R + NV + GE+DLI +G+ +F+EVR RRS +G Sbjct: 4 RRRLLGQRGEQLAEGYLVNKGFRIVERNVRTKRGEMDLIALDGKCLVFIEVRTRRSQSFG 63 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-----------VEWI 123 A S+T K+ KL + A +L + RFDV+A +E Sbjct: 64 TAGESITWKKKQKLRELALEYLQKSAQPI--PSFRFDVIAIYTGASTQGEDFMKPVIEHY 121 Query: 124 KDAF 127 + AF Sbjct: 122 ESAF 125 >UniRef50_C4XKL5 Putative uncharacterized protein n=1 Tax=Desulfovibrio magneticus RS-1 RepID=C4XKL5_DESMR Length = 133 Score = 125 bits (314), Expect = 5e-28, Method: Composition-based stats. Identities = 38/120 (31%), Positives = 59/120 (49%), Gaps = 6/120 (5%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + G EA A L KG + N RGGE+DL+ R+G T +FVEV+ R + Sbjct: 2 TAKHLEFGREGEAAAEAHLIAKGFAVVTRNYRARGGEVDLVCRDGDTVVFVEVKARGEGM 61 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEWIKDAFN 128 G +VT +K+ ++++ A +L+ + + CRFDVVA + DAF+ Sbjct: 62 RGRPEEAVTPAKRRRIVRAAAQFLSERD--WWDRPCRFDVVAVESRSGHLTASHVADAFS 119 >UniRef50_C9MX50 Endonuclease n=2 Tax=Leptotrichia RepID=C9MX50_9FUSO Length = 121 Score = 125 bits (314), Expect = 6e-28, Method: Composition-based stats. Identities = 40/123 (32%), Positives = 72/123 (58%), Gaps = 7/123 (5%) Query: 9 GSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIM--REGRTTIFVEVR 66 G + ++ G +E A+ +L +GL F+ +N R GEIDLI ++ +T +FVEV+ Sbjct: 2 GQEYSMNKREIGFKYENVAKEYLILQGLTFVESNFYTRFGEIDLIFFEKKSQTLVFVEVK 61 Query: 67 YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT--GNEVEWIK 124 YR++ +G A VT KQ+K+L +++++L + + R+D+V + +EW+K Sbjct: 62 YRKNDFFGSAIEMVTEEKQNKILASSQIYLLKK---EWDKNVRYDIVGVSRGSGSIEWLK 118 Query: 125 DAF 127 +AF Sbjct: 119 NAF 121 >UniRef50_C1ZN06 Putative uncharacterized protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZN06_PLALI Length = 181 Score = 124 bits (313), Expect = 7e-28, Method: Composition-based stats. Identities = 40/131 (30%), Positives = 61/131 (46%), Gaps = 11/131 (8%) Query: 3 TVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIF 62 P S R L G+ EA+A ++L+ G + +A N+ R GEIDL+ EG T +F Sbjct: 52 RSPHSPSSHRTLN---IGEQGEARAEKYLKELGYQILARNLRTRLGEIDLLALEGETIVF 108 Query: 63 VEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN---- 118 +EV+ R+S G ++ KQ +L + A L + R DV+ TG Sbjct: 109 IEVKTRKSDARGRPEEAIHPRKQKQLSRVAMALLKSKG--WLHRQSRIDVITITGEPESP 166 Query: 119 --EVEWIKDAF 127 E+ + AF Sbjct: 167 DCELRHYRHAF 177 >UniRef50_A1K3T3 UPF0102 protein azo0871 n=1 Tax=Azoarcus sp. BH72 RepID=Y871_AZOSB Length = 137 Score = 124 bits (313), Expect = 7e-28, Method: Composition-based stats. Identities = 43/118 (36%), Positives = 63/118 (53%), Gaps = 3/118 (2%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 + G E +A L +G+R +A N + RGGE+DL+ G +FVEVR R + +GG Sbjct: 20 MQARGREGEERAAAHLAAQGVRILARNRHCRGGELDLVGLHGDMLVFVEVRMRANPRFGG 79 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE---VEWIKDAFNDH 130 AAAS+T K+ +++ A+ WLA CRFDVV G W++ AF+ Sbjct: 80 AAASITAEKRRRVILAAQWWLAGEGRRHAHRPCRFDVVLLEGPATTPPTWLQAAFDAD 137 >UniRef50_Q1QVF6 UPF0102 protein Csal_2201 n=1 Tax=Chromohalobacter salexigens DSM 3043 RepID=Y2201_CHRSD Length = 123 Score = 124 bits (313), Expect = 7e-28, Method: Composition-based stats. Identities = 53/124 (42%), Positives = 72/124 (58%), Gaps = 7/124 (5%) Query: 10 SPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRR 69 S +++ G E +A WL GLR + AN + R GEIDLIMR+G T +F+EVR+RR Sbjct: 2 SNSNNDSRRRGLEMERRAADWLASHGLRLVDANQHARRGEIDLIMRDGDTLVFIEVRHRR 61 Query: 70 SALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----EVEWIKD 125 A +G +VT +KQ +L+ AR +L R+ S CRFDVV TG EWI+ Sbjct: 62 DARHGHPFETVTAAKQRRLIGAARFYLHRNGLSCA---CRFDVVGVTGTPPHLSFEWIRS 118 Query: 126 AFND 129 AF+ Sbjct: 119 AFDA 122 >UniRef50_A4AH12 Putative uncharacterized protein n=1 Tax=marine actinobacterium PHSC20C1 RepID=A4AH12_9ACTN Length = 118 Score = 124 bits (313), Expect = 7e-28, Method: Composition-based stats. Identities = 36/117 (30%), Positives = 53/117 (45%), Gaps = 8/117 (6%) Query: 14 LTTKQ-TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + K G E A L GL + N GE+D++ R+ +FVEV+ R S L Sbjct: 1 MAAKDVLGARGEELATDHLISAGLEILDRNWRCSQGELDIVARDQDDVVFVEVKTRSSVL 60 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-----EVEWIK 124 +G S+T +K +L + A +W H GS T R D +A E+E +K Sbjct: 61 FGHPFESITATKVARLRRLAAVWCDAHPGSGAT--VRIDAIAVIVPSRGAVEIEHLK 115 >UniRef50_Q1LHS4 UPF0102 protein Rmet_3430 n=2 Tax=Betaproteobacteria RepID=Y3430_RALME Length = 134 Score = 124 bits (312), Expect = 8e-28, Method: Composition-based stats. Identities = 50/126 (39%), Positives = 73/126 (57%), Gaps = 8/126 (6%) Query: 4 VPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE-GRTTIF 62 +P R S +T + G E +A +L+ +GL + N +GGEIDLIMR T +F Sbjct: 1 MPARPAS----STTRQGALAEDRALAYLQRQGLVAVERNYRCKGGEIDLIMRAADDTLVF 56 Query: 63 VEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEW 122 VEVR R +GGAAAS+T +KQ ++L+ A +LA + CR DVVA +EW Sbjct: 57 VEVRKRGGRGFGGAAASITLTKQRRVLRAASHYLATLD---RLPPCRVDVVALDPGRLEW 113 Query: 123 IKDAFN 128 +++AF+ Sbjct: 114 LRNAFD 119 >UniRef50_Q5R0L0 UPF0102 protein IL0423 n=1 Tax=Idiomarina loihiensis RepID=Y423_IDILO Length = 116 Score = 124 bits (312), Expect = 9e-28, Method: Composition-based stats. Identities = 36/111 (32%), Positives = 55/111 (49%), Gaps = 2/111 (1%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 TG E + +L+ L I N GGE+D+I R+G +F EV++R + Sbjct: 6 TGKRAELLSAEFLKKNNLTIICKNYRIDGGEVDIIARDGHYWVFCEVKFRDDESFAAVIE 65 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG--NEVEWIKDAF 127 + + ++ TAR +L +N T RFDV+A G ++EW KDAF Sbjct: 66 QIQPQQCRRIRYTARHYLLSNNIDEHTAAIRFDVIAIVGQPTKIEWFKDAF 116 >UniRef50_B0TH88 Putative uncharacterized protein n=1 Tax=Heliobacterium modesticaldum Ice1 RepID=B0TH88_HELMI Length = 119 Score = 124 bits (312), Expect = 1e-27, Method: Composition-based stats. Identities = 36/121 (29%), Positives = 55/121 (45%), Gaps = 7/121 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + G E +A + L G G I N GEIDLI+RE +FVEVR R S + Sbjct: 1 MNRVLLGRWGEERALQHLLGLGWSLICQNYRTPRGEIDLILRESNWIVFVEVRTRSSERF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-----GNEVEWIKDAFN 128 G +V K+ +L+ TA +L + G RFD+++ +++ I+ F Sbjct: 61 GRGEETVDYRKRRRLMATAGHFLGTYQGPPGDP--RFDLISILRLDSGEEQLQHIRGMFT 118 Query: 129 D 129 Sbjct: 119 P 119 >UniRef50_Q1YQG9 Putative uncharacterized protein n=1 Tax=gamma proteobacterium HTCC2207 RepID=Q1YQG9_9GAMM Length = 133 Score = 124 bits (311), Expect = 1e-27, Method: Composition-based stats. Identities = 40/125 (32%), Positives = 61/125 (48%), Gaps = 11/125 (8%) Query: 11 PRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRS 70 P + +G E A +L+ KGL + N R GEIDLIMR+ +FVEVR+R + Sbjct: 7 PNNKKERLSGAEAEQLALDFLQAKGLELVVKNFRTRRGEIDLIMRDNAVLVFVEVRFRSN 66 Query: 71 ALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG----------NEV 120 +G A S+T K +L A+ ++ R + + V RFD VA + + Sbjct: 67 LNFGTAEESITAQKCQRLSSAAQAYMQREGLT-ERVSGRFDAVAISPAKPHRQSSGMYSI 125 Query: 121 EWIKD 125 WI++ Sbjct: 126 NWIQN 130 >UniRef50_B0C8B9 UPF0102 protein AM1_3954 n=1 Tax=Acaryochloris marina MBIC11017 RepID=Y3954_ACAM1 Length = 172 Score = 124 bits (311), Expect = 1e-27, Method: Composition-based stats. Identities = 35/124 (28%), Positives = 52/124 (41%), Gaps = 11/124 (8%) Query: 2 ATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMR------ 55 P RQ Q G+ E +WL + + + R GEID+I R Sbjct: 1 MPSPAAPRPNRQSRNLQVGEWGEQLVCQWLTQQQWHILDRRWHCRWGEIDIIARSNPPLP 60 Query: 56 ---EGRTTIFVEVRYRRSALYG-GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFD 111 FVEV+ RR+ + ++T KQ KL +TA+L+L +H + C+FD Sbjct: 61 GQDSNTRLAFVEVKTRRAQNWDADGLLAITPQKQQKLWKTAQLYLKKHP-ELAELFCQFD 119 Query: 112 VVAF 115 V Sbjct: 120 VALV 123 >UniRef50_D2RIH7 Putative uncharacterized protein n=1 Tax=Acidaminococcus fermentans DSM 20731 RepID=D2RIH7_ACIFE Length = 118 Score = 124 bits (311), Expect = 1e-27, Method: Composition-based stats. Identities = 37/116 (31%), Positives = 56/116 (48%), Gaps = 6/116 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 ++ G+ E A R+LE +G + N GEID+I R FVEV+ R S +G Sbjct: 5 RRRFGNWGEDAAVRYLETRGYEILDRNYRSSWGEIDIIARYRGVLAFVEVKTRHSLKFGR 64 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT----GNEVEWIKDAF 127 AA+VTR KQ +L +TA +L + RFD++ + +K+ F Sbjct: 65 PAAAVTREKQIRLRKTAWCYLRENQVF--RYRSRFDIIEILDLYGKISLNHLKNCF 118 >UniRef50_UPI00016929A4 hypothetical protein Plarl_14719 n=1 Tax=Paenibacillus larvae subsp. larvae BRL-230010 RepID=UPI00016929A4 Length = 134 Score = 124 bits (311), Expect = 1e-27, Method: Composition-based stats. Identities = 45/135 (33%), Positives = 66/135 (48%), Gaps = 9/135 (6%) Query: 1 MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLI-MREGRT 59 M + + + + K G E A R+L+ KG + ++ N R GE+D+I + E R Sbjct: 1 MNSDEFQRETKKLDGRKALGKRGEEIAVRYLKEKGFQILSQNWRCRTGEVDIILLEEPRC 60 Query: 60 TIFVEVRYRR-SALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG- 117 IF EVR RR + +G AA S+ R KQ ++ QTA ++ H F+ RFDVV Sbjct: 61 LIFTEVRSRRVTGKFGSAAESINRRKQQQIRQTALYYVYVHPP-FNRYTIRFDVVTVEFF 119 Query: 118 -----NEVEWIKDAF 127 + IK AF Sbjct: 120 PEKEDPVIHHIKAAF 134 >UniRef50_Q60CC4 UPF0102 protein MCA0184 n=1 Tax=Methylococcus capsulatus RepID=Y184_METCA Length = 123 Score = 124 bits (311), Expect = 1e-27, Method: Composition-based stats. Identities = 54/126 (42%), Positives = 68/126 (53%), Gaps = 11/126 (8%) Query: 8 SGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRY 67 SGS R LT G E+ +L +GLR I N R GEIDL+M EG T +FVEVRY Sbjct: 4 SGSHRPLT----GPQAESWTAEYLTARGLRLIERNYRCRLGEIDLVMAEGATLVFVEVRY 59 Query: 68 RRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----EVEWI 123 R YGGA ASV R K +LL TA+ ++ H + R DVVA + + EWI Sbjct: 60 RSGKRYGGALASVDRHKCRRLLATAQHYMVEHRVTGA---VRLDVVAVSPGAAGPQAEWI 116 Query: 124 KDAFND 129 ++A Sbjct: 117 RNAIEA 122 >UniRef50_C1AG13 UPF0102 protein JTY_2914 n=20 Tax=Mycobacterium RepID=Y2914_MYCBT Length = 128 Score = 124 bits (311), Expect = 1e-27, Method: Composition-based stats. Identities = 40/126 (31%), Positives = 58/126 (46%), Gaps = 12/126 (9%) Query: 10 SPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREG--RTTIFVEVRY 67 + + +T Q G EA A +L GLR + N R GE+D+I + RT +FVEV+ Sbjct: 3 TLKTMTRVQLGAMGEALAVDYLTSMGLRILNRNWRCRYGELDVIACDAATRTVVFVEVKT 62 Query: 68 RRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN--------E 119 R YGG A +VT K +L + A LWLA + R DV+ E Sbjct: 63 RTGDGYGGLAHAVTERKVRRLRRLAGLWLADQEERWA--AVRIDVIGVRVGPKNSGRTPE 120 Query: 120 VEWIKD 125 + ++ Sbjct: 121 LTHLQG 126 >UniRef50_Q2JJU2 UPF0102 protein CYB_2119 n=3 Tax=Synechococcus RepID=Y2119_SYNJB Length = 133 Score = 124 bits (311), Expect = 1e-27, Method: Composition-based stats. Identities = 35/130 (26%), Positives = 61/130 (46%), Gaps = 11/130 (8%) Query: 9 GSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYR 68 PR+ + + TG+ E R++L +G + +A GE+DL+ + IFVEV+ R Sbjct: 3 PLPRRASLQNTGNVGEGWVRQYLCQQGWQILAQRWRCPWGELDLVAHKADVLIFVEVKTR 62 Query: 69 RSALYG-GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-------- 119 + G +V KQ +L++ A+ +L++H + CRFDV Sbjct: 63 SPGSWDRGGLLAVGIPKQRRLIRAAQAFLSQHP-HLSELSCRFDVALIERRASREGVSYA 121 Query: 120 -VEWIKDAFN 128 V+++ AF Sbjct: 122 LVDYLPAAFE 131 >UniRef50_B9ZKW9 Putative uncharacterized protein n=1 Tax=Thioalkalivibrio sp. K90mix RepID=B9ZKW9_9GAMM Length = 118 Score = 123 bits (310), Expect = 1e-27, Method: Composition-based stats. Identities = 43/110 (39%), Positives = 60/110 (54%), Gaps = 4/110 (3%) Query: 20 GDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAAS 79 G A E +A +L G+GL +A NV GE+DL+ REG T + VEVR R +GG A S Sbjct: 8 GQAAEDRAAHYLTGQGLILVARNVRRPWGELDLVAREGDTLVLVEVRKRSHRNFGGGAES 67 Query: 80 VTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-VEWIKDAFN 128 + K+ +LL+ A +L RFDVV G++ +EW+ DA Sbjct: 68 IDAGKRRRLLRAAEGYLQETRWQG---PVRFDVVLLDGDDTIEWLPDAIQ 114 >UniRef50_A4YJR8 UPF0102 protein BRADO0179 n=14 Tax=Rhizobiales RepID=Y179_BRASO Length = 141 Score = 123 bits (310), Expect = 2e-27, Method: Composition-based stats. Identities = 41/129 (31%), Positives = 63/129 (48%), Gaps = 4/129 (3%) Query: 2 ATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTI 61 A P ++ SP ++ +TG + EA+A L KG R +A GEIDLI R+ Sbjct: 14 APKPAKTASPERVAAFRTGLSAEARAAALLIAKGYRILAKRFRTPHGEIDLIARKRGLVA 73 Query: 62 FVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV- 120 FVEV+ R A AA +VT +Q +++ A+ WL H ++ RFD + + Sbjct: 74 FVEVKAR--ASLDDAAYAVTPRQQQRIIDAAQAWLMAHPDH-AELELRFDAILVAPRSLP 130 Query: 121 EWIKDAFND 129 + AF+ Sbjct: 131 RHLMAAFDA 139 >UniRef50_C6VV91 Putative uncharacterized protein n=2 Tax=Flexibacteraceae RepID=C6VV91_DYAFD Length = 119 Score = 123 bits (310), Expect = 2e-27, Method: Composition-based stats. Identities = 31/118 (26%), Positives = 53/118 (44%), Gaps = 8/118 (6%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 G E A +L KG + IA N EIDLI + IF+EV+ R +G Sbjct: 4 ANDLGRWGETTAASFLAEKGFKIIARNYRNWQSEIDLIAAKDDMLIFIEVKTRTGMAFGM 63 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-----GNEVEWIKDAFN 128 V +K +++ A ++ + ++ D RFD+++ ++ I+DAF+ Sbjct: 64 PEEFVNVTKARLIMRAAEQYI--FDVDWEN-DVRFDIISILVLPDGSTDIRHIEDAFS 118 >UniRef50_C8PW53 Putative uncharacterized protein n=1 Tax=Enhydrobacter aerosaccus SK60 RepID=C8PW53_9GAMM Length = 134 Score = 123 bits (309), Expect = 2e-27, Method: Composition-based stats. Identities = 46/127 (36%), Positives = 71/127 (55%), Gaps = 15/127 (11%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNER-GGEIDLIMREGR----TTIFVEVRYRRS 70 ++ GD +E A+ +LE +GL F A N + + GE+DL+M E + VEVR R++ Sbjct: 7 KQRQGDYYETLAKHYLEAQGLTFFAKNWHYKNLGELDLVMLEPTQKIPCLVIVEVRQRKA 66 Query: 71 ALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG---------NEVE 121 + +G + S+T +KQ K+++T +L H FD D RFDVV++ G Sbjct: 67 SQFGTSLDSITPAKQRKIVKTTAAFLQAHP-QFDNFDIRFDVVSYEGAATAGQAVMPTPT 125 Query: 122 WIKDAFN 128 WIKDAF+ Sbjct: 126 WIKDAFS 132 >UniRef50_B7K4B3 Putative uncharacterized protein n=4 Tax=Cyanobacteria RepID=B7K4B3_CYAP8 Length = 144 Score = 123 bits (309), Expect = 2e-27, Method: Composition-based stats. Identities = 33/104 (31%), Positives = 44/104 (42%), Gaps = 4/104 (3%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRT--TIFVEVRYRRSALYG-G 75 G E RWL+ +G + GGEIDLI T FVEV+ R + Sbjct: 4 IGQLGENLVARWLQSQGWTILQQRWRCPGGEIDLIAHSQGTNLITFVEVKTRSRGNWDAD 63 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE 119 ++T KQ KL Q+A +LA + CRFDV + Sbjct: 64 GLLAITPQKQVKLTQSAAYFLAEYP-HLADFPCRFDVALVNYKK 106 >UniRef50_C7LP67 Putative uncharacterized protein n=1 Tax=Desulfomicrobium baculatum DSM 4028 RepID=C7LP67_DESBD Length = 134 Score = 123 bits (309), Expect = 2e-27, Method: Composition-based stats. Identities = 37/118 (31%), Positives = 58/118 (49%), Gaps = 8/118 (6%) Query: 14 LTTKQT--GDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSA 71 + + G A E A +L KG+R + N GEIDLI + T +FVEV+ R A Sbjct: 1 MAARHLITGQAGEELAAAFLVEKGMRIVERNFRCASGEIDLICEDAGTIVFVEVKTRSGA 60 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG----NEVEWIKD 125 + G ++ +K+ +L++ L+L+RH + CRFD+V VE +D Sbjct: 61 VRGEPGEAIGPAKKKRLIKAGALYLSRHRA--WSRPCRFDLVGILFLHGETVVEHWED 116 >UniRef50_A8UQV9 Putative uncharacterized protein n=1 Tax=Hydrogenivirga sp. 128-5-R1-1 RepID=A8UQV9_9AQUI Length = 111 Score = 123 bits (309), Expect = 2e-27, Method: Composition-based stats. Identities = 27/108 (25%), Positives = 50/108 (46%), Gaps = 3/108 (2%) Query: 18 QTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAA 77 + G +E +A +L+ KG +A N + R GEID++ R+G +FVEV+ + G A Sbjct: 2 RRGSEYEERACLYLQDKGYSIVARNYHCRSGEIDIVARQGGELVFVEVKGGKDTSLGHPA 61 Query: 78 ASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 K +++ A ++ R D+V + +E ++ Sbjct: 62 ERFNPRKLDRIIACAFRFMEEMGLEE---PFRVDLVVVLEDRIEHYEN 106 >UniRef50_A7NKS5 UPF0102 protein Rcas_2007 n=2 Tax=Roseiflexus RepID=Y2007_ROSCS Length = 124 Score = 123 bits (309), Expect = 2e-27, Method: Composition-based stats. Identities = 39/121 (32%), Positives = 56/121 (46%), Gaps = 7/121 (5%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 + GD E A R+L +G +A GEID++ R +FVEVR RR G Sbjct: 4 RRTRLGDWGETMAARFLARRGYEVLARKWRCAAGEIDIVARHDGDLVFVEVRTRRGRDPG 63 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN------EVEWIKDAFN 128 AA S+T +K+ +L+ A +LA H+ R DVVA + +E I A Sbjct: 64 MAAESITNAKRARLMALADAFLAAHDLP-SNTPWRIDVVAISVGLRAQEVSIEHIPYAVE 122 Query: 129 D 129 + Sbjct: 123 E 123 >UniRef50_D0GLC9 Putative uncharacterized protein n=1 Tax=Leptotrichia goodfellowii F0264 RepID=D0GLC9_9FUSO Length = 119 Score = 123 bits (309), Expect = 2e-27, Method: Composition-based stats. Identities = 42/121 (34%), Positives = 74/121 (61%), Gaps = 9/121 (7%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREG--RTTIFVEVRYRRS 70 + + ++ G +E A+ +LE + L FI +N + GEIDLI E T IFVEV+YR++ Sbjct: 2 RKSKREVGFEYEEIAKDYLEERKLLFIESNYYTKYGEIDLIFLEKSSETLIFVEVKYRKN 61 Query: 71 ALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEWIKDA 126 +YG A +V + KQ K++Q+++++++++ R+DV+ GN+ + WIK+A Sbjct: 62 NIYGEAVEAVDKRKQEKIIQSSQIYISKNKWK---NSVRYDVIGIIGNKLKNDINWIKNA 118 Query: 127 F 127 F Sbjct: 119 F 119 >UniRef50_A3DDG4 UPF0102 protein Cthe_0758 n=6 Tax=Clostridia RepID=Y758_CLOTH Length = 130 Score = 122 bits (308), Expect = 3e-27, Method: Composition-based stats. Identities = 37/126 (29%), Positives = 57/126 (45%), Gaps = 12/126 (9%) Query: 12 RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERG-GEIDLIMREGRTTIFVEVRYRRS 70 + + G EA A ++L+ + N R GEID+I RE FVEV+ R S Sbjct: 7 NKNNKRAAGSIGEAAAVQFLKENNYEILETNFRYRRLGEIDIISREKDYICFVEVKARSS 66 Query: 71 ALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF---------TGNEVE 121 YG +V KQ + + A+++L ++ + + RFDVV E+ Sbjct: 67 LGYGYPREAVNIRKQENIRRLAQIYLCKNRIN--DLKVRFDVVEVYMEKKGDDIEVKEIS 124 Query: 122 WIKDAF 127 IK+AF Sbjct: 125 LIKNAF 130 >UniRef50_A5WCR1 UPF0102 protein PsycPRwf_0497 n=1 Tax=Psychrobacter sp. PRwf-1 RepID=Y497_PSYWF Length = 142 Score = 122 bits (308), Expect = 3e-27, Method: Composition-based stats. Identities = 55/146 (37%), Positives = 78/146 (53%), Gaps = 20/146 (13%) Query: 1 MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERG-GEIDLIMREGR- 58 M P SP+ ++ G +E A +L+ +GLR IA N + GEIDL++ E Sbjct: 1 MPANPELLISPK----QRQGGGYEQLAADFLQQQGLRLIARNWQQPKVGEIDLVLIEHGR 56 Query: 59 ---TTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF 115 +F EVR R+ YG A AS+TRSKQ KL++TAR +L RH+ + +CRFDVV F Sbjct: 57 SWNVLVFAEVRKRKLLGYGDALASITRSKQKKLIKTARYFL-RHHPEYADFECRFDVVGF 115 Query: 116 TGN----------EVEWIKDAFNDHS 131 T + EW++ AF + Sbjct: 116 TERTGRSGQGEPLQSEWLQGAFLAPA 141 >UniRef50_A5FKL6 UPF0102 protein Fjoh_1217 n=17 Tax=Bacteroidetes RepID=Y1217_FLAJ1 Length = 123 Score = 122 bits (308), Expect = 3e-27, Method: Composition-based stats. Identities = 28/117 (23%), Positives = 51/117 (43%), Gaps = 5/117 (4%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 + G E A LE + + + N + E+D++ ++ + VEV+ R S +G Sbjct: 4 HNELGKLGEDLAAEHLEKENYKILERNWVYKNAEVDILAQKENILVVVEVKTRSSLDFGS 63 Query: 76 AAASVTRSKQHKLLQTARLWL-ARHNGSFDTVDCRFDVVAFTGNE----VEWIKDAF 127 V K L++ ++ R + ++ RFD+VA N +E + DAF Sbjct: 64 PQDFVKPKKIQLLIKAVNAYINYREKDFEEDINVRFDIVAIHKNGESFAIEHLTDAF 120 >UniRef50_A3YDY3 Putative uncharacterized protein n=1 Tax=Marinomonas sp. MED121 RepID=A3YDY3_9GAMM Length = 130 Score = 122 bits (307), Expect = 3e-27, Method: Composition-based stats. Identities = 38/114 (33%), Positives = 54/114 (47%), Gaps = 6/114 (5%) Query: 18 QTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAA 77 G E A+ +L + L I N + GEIDLI + +FVEVRYR+ G AA Sbjct: 19 NKGQLAEEAAKVFLLSQKLSMIEQNFICKLGEIDLICLDNGVIVFVEVRYRQDNSRGSAA 78 Query: 78 ASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVA----FTGNEVEWIKDAF 127 S+ KQ K+++ A+ WL ++ RFD V + W+K AF Sbjct: 79 QSIHLGKQKKVIKAAQYWLLINHK--QDTPIRFDAVLFDQVIDNEHLTWLKSAF 130 >UniRef50_C0VVC1 Endonuclease n=2 Tax=Corynebacterium glucuronolyticum RepID=C0VVC1_9CORY Length = 115 Score = 122 bits (307), Expect = 3e-27, Method: Composition-based stats. Identities = 44/114 (38%), Positives = 62/114 (54%), Gaps = 7/114 (6%) Query: 14 LTTKQT--GDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSA 71 + +K G E ARR+ + +G F+AANV GEIDLIM+ G TT+FVEV+ R ++ Sbjct: 1 MNSKNLLLGRRGETIARRYYQDRGYGFVAANVRYTCGEIDLIMQHGDTTVFVEVKTRTNS 60 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 GGA +VT +K ++ + A WL RFDVV GNE+ + Sbjct: 61 AMGGA-EAVTPAKLRRVQRAAMTWLEGKP----YRPIRFDVVEIIGNEITCFEG 109 >UniRef50_D2MIZ7 Putative uncharacterized protein n=1 Tax=Candidatus Poribacteria sp. WGA-A3 RepID=D2MIZ7_9BACT Length = 128 Score = 122 bits (307), Expect = 3e-27, Method: Composition-based stats. Identities = 48/121 (39%), Positives = 68/121 (56%), Gaps = 7/121 (5%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 LT + G EA A R L KG R + NV GE+D++ R G T IFVEV+ RR+ Sbjct: 2 SLTRQLLGKEAEAAAERLLRQKGYRILDRNVRIGRGELDIVARVGETVIFVEVKARRTDR 61 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-----VEWIKDAF 127 YGG A +VT K+ +L+Q A +LARH + CRFDV+ + + +E +++AF Sbjct: 62 YGGVAHAVTARKERQLIQLAARYLARHR--LERQPCRFDVLLYDAGDPGSPSLEHVENAF 119 Query: 128 N 128 Sbjct: 120 E 120 >UniRef50_B8G6B1 UPF0102 protein Cagg_0930 n=3 Tax=Chloroflexus RepID=Y930_CHLAD Length = 123 Score = 122 bits (307), Expect = 3e-27, Method: Composition-based stats. Identities = 39/101 (38%), Positives = 53/101 (52%), Gaps = 4/101 (3%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 +Q GD E A +LE G IA N R GEID++ R+G +FVEVR RR Sbjct: 5 KRQLGDRGEQVAAVYLERCGYTIIARNWRCRNGEIDMVARDGDYLVFVEVRTRRDE---Y 61 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT 116 A S+ K+ +L+ A +LA H+ +T R DV+A T Sbjct: 62 ALESLLMHKRQRLVTLAYHYLAEHDVP-ETTPWRIDVIALT 101 >UniRef50_C0EXX9 Putative uncharacterized protein n=1 Tax=Eubacterium hallii DSM 3353 RepID=C0EXX9_9FIRM Length = 117 Score = 122 bits (307), Expect = 3e-27, Method: Composition-based stats. Identities = 37/114 (32%), Positives = 55/114 (48%), Gaps = 2/114 (1%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 K +G A E A +LEGKG+R + N GEID+I E + VEV+ R G Sbjct: 2 RKNSGGAAEEAAVLFLEGKGIRILERNFRSYHGEIDIIALEQEMILVVEVKMRSYGDCGT 61 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-EVEWIKDAFN 128 AA +V KQ ++ T + + + + RFDV+ + WI++AF Sbjct: 62 AAEAVDFRKQKRICYTFNYYRMQRRLA-ENTAVRFDVIEVDKDFRCHWIQNAFE 114 >UniRef50_D1SBF6 Putative uncharacterized protein n=1 Tax=Micromonospora aurantiaca ATCC 27029 RepID=D1SBF6_9ACTO Length = 150 Score = 122 bits (307), Expect = 4e-27, Method: Composition-based stats. Identities = 44/136 (32%), Positives = 59/136 (43%), Gaps = 12/136 (8%) Query: 2 ATVPTRSGSPRQL-----TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE 56 + P S R L + G E A R L GLR +A N GEID+I E Sbjct: 17 SAPPATVASGRILAGMTNRNRAVGAYGERCALRHLIETGLRPVARNWRCPEGEIDIIAWE 76 Query: 57 GRTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT 116 G EV+ RRS +G A +V R+K +L A WLA + + RFDV++ Sbjct: 77 GPVLAICEVKTRRSEQFGSPAEAVVRAKARRLRGLAARWLAETGTTAA--EVRFDVLSVR 134 Query: 117 -----GNEVEWIKDAF 127 VE ++ AF Sbjct: 135 LPLTGPARVEHLRGAF 150 >UniRef50_Q8XUC6 UPF0102 protein RSc3265 n=6 Tax=Proteobacteria RepID=Y3265_RALSO Length = 130 Score = 122 bits (306), Expect = 4e-27, Method: Composition-based stats. Identities = 49/108 (45%), Positives = 68/108 (62%), Gaps = 6/108 (5%) Query: 25 AQARRWLEGKGLRFIAANVNERGGEIDLIMRE-GRTTIFVEVR---YRRSALYGGAAASV 80 +A R+L+ +GL IA N + GEIDL+MR+ T +FVEVR R + +GGAAASV Sbjct: 22 DRALRYLQARGLSVIARNYRCKTGEIDLVMRDVAGTLVFVEVRARVARSAQRFGGAAASV 81 Query: 81 TRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFN 128 T +KQ +L+ A +LA H + CRFDV+A G +EW++DAF Sbjct: 82 TPAKQRRLIAAAEDFLAGHP--GEVPACRFDVIAIDGTRIEWMRDAFG 127 >UniRef50_A1VIW8 UPF0102 protein Pnap_0271 n=10 Tax=Burkholderiales RepID=Y271_POLNA Length = 153 Score = 122 bits (306), Expect = 4e-27, Method: Composition-based stats. Identities = 57/135 (42%), Positives = 76/135 (56%), Gaps = 16/135 (11%) Query: 8 SGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERG---GEIDLIMR-EGRTTIFV 63 + P+Q+TTK GDA E+ AR +L G GLR+I +N G GEIDL+MR T +FV Sbjct: 21 AALPKQVTTKSRGDAAESAARAYLVGAGLRWIESNYRTPGRGGGEIDLVMRVPDGTLVFV 80 Query: 64 EVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG------ 117 EVR R SA +GGA AS++ KQ +++ AR +L R CRFDVV G Sbjct: 81 EVRQRSSASHGGAGASISAVKQRRIIFAARHYLMRF---ASLPPCRFDVVLVHGALSGGE 137 Query: 118 ---NEVEWIKDAFND 129 +EW+ AF+ Sbjct: 138 SPQATIEWLPAAFDA 152 >UniRef50_Q11XW1 UPF0102 protein CHU_0465 n=1 Tax=Cytophaga hutchinsonii ATCC 33406 RepID=Y465_CYTH3 Length = 113 Score = 122 bits (306), Expect = 4e-27, Method: Composition-based stats. Identities = 40/113 (35%), Positives = 56/113 (49%), Gaps = 4/113 (3%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + Q G A E +A +LE +G + I N+ GEIDLI F+EV+YR+ Y Sbjct: 1 MEHIQKGIAGEQKACAFLEQQGYKIIEKNLRIGKGEIDLIAVHNNCMCFIEVKYRKHNRY 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV-EWIKD 125 G VT+ K K+ +TA ++ N RFDVVA TG E+ + D Sbjct: 61 GFPEEFVTQKKLLKIQETAEAYIYTVNWQGR---IRFDVVAITGEELPVHLMD 110 >UniRef50_C7MB82 Putative uncharacterized protein n=1 Tax=Brachybacterium faecium DSM 4810 RepID=C7MB82_BRAFD Length = 153 Score = 122 bits (306), Expect = 4e-27, Method: Composition-based stats. Identities = 38/120 (31%), Positives = 58/120 (48%), Gaps = 7/120 (5%) Query: 10 SPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRR 69 R +TT Q G A E A L +G + + N+ R GE+D++ + T +FVEV+ RR Sbjct: 33 RVRDMTTAQLGRAGEELAASHLSAQGWQIVERNLRLRQGELDIVALDHATLVFVEVKTRR 92 Query: 70 SALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-----EVEWIK 124 S + G A+VT K +L + A +L + D R DVVA +E ++ Sbjct: 93 SFVTGVPQAAVTPDKLRRLRRLAGEYLMERSTP--HRDVRIDVVAVHAQLDGTFSIEHLE 150 >UniRef50_Q4FQF2 UPF0102 protein Psyc_1908 n=2 Tax=Psychrobacter RepID=Y1908_PSYA2 Length = 173 Score = 121 bits (305), Expect = 5e-27, Method: Composition-based stats. Identities = 47/154 (30%), Positives = 73/154 (47%), Gaps = 34/154 (22%) Query: 3 TVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANV-NERGGEIDLIMREGR--- 58 P SP+ ++ G +E A +L+ +GL IA N + GE+DL+M E Sbjct: 20 DKPLMLTSPK----QRQGGYFEQLACEFLQEQGLILIAKNWQRPKVGELDLVMLEKGQAW 75 Query: 59 -TTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVA--- 114 T +F+EVR R + +G AA SVT KQ K+++ AR +L +H + +CRFDV+A Sbjct: 76 STLVFIEVRQRNRSHFGDAALSVTAGKQRKIIKVARYFLHQHQ-KYSDYECRFDVIAYNT 134 Query: 115 ---------------------FTGNEVEWIKDAF 127 ++ EW++ AF Sbjct: 135 SNNKNSENETDIRLDNQLNQPLEKDQPEWLQGAF 168 >UniRef50_D1W8G8 Putative uncharacterized protein n=1 Tax=Prevotella buccalis ATCC 35310 RepID=D1W8G8_9BACT Length = 134 Score = 121 bits (305), Expect = 6e-27, Method: Composition-based stats. Identities = 33/123 (26%), Positives = 53/123 (43%), Gaps = 10/123 (8%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMR--EGRTTIFVEVRYRRSA 71 T + G E A +L +G EID+I +G T +FVEV+ RRS Sbjct: 12 ATHNKFGKWGEDTAVDYLHKQGYTIRERGWRHGKFEIDIIALSPDGITCVFVEVKTRRSD 71 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-----EVEWIKDA 126 + +V K L A +++ + + RFDV++ G+ ++E +DA Sbjct: 72 EVALPSDAVDEKKMRNLGIAADVYVKMFDIQEE---LRFDVISIVGSTAENMQIEHFEDA 128 Query: 127 FND 129 FN Sbjct: 129 FNP 131 >UniRef50_D2RAN4 Putative uncharacterized protein n=1 Tax=Gardnerella vaginalis 409-05 RepID=D2RAN4_GARVA Length = 182 Score = 121 bits (305), Expect = 6e-27, Method: Composition-based stats. Identities = 35/105 (33%), Positives = 47/105 (44%), Gaps = 1/105 (0%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGR-TTIFVEVRYRRSA 71 L +K+ G E A L KG I N + R GE+DL+M +FVEV+ RRS Sbjct: 43 SLESKELGKLGETYATLRLIQKGWHVIDQNWHCRNGELDLVMITPEQKLVFVEVKTRRSV 102 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT 116 G ++T+ K+ KL T WL RFD V+ Sbjct: 103 RCGTPLEAITQEKRSKLRTTGMKWLEEFGSDIPHYRIRFDAVSIL 147 >UniRef50_C2KRF5 Possible endonuclease n=2 Tax=Mobiluncus mulieris RepID=C2KRF5_9ACTO Length = 173 Score = 121 bits (305), Expect = 6e-27, Method: Composition-based stats. Identities = 29/120 (24%), Positives = 53/120 (44%), Gaps = 3/120 (2%) Query: 2 ATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGR-TT 60 A P ++ + ++ G A E A +L+ +G + + N R GE+D++ Sbjct: 44 ALKPPKAPR-KNPHNRELGLAGEELAVEFLQTQGYQVLDRNWRCRAGEVDIVALSPDSVL 102 Query: 61 IFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV 120 FVEV+ R + +G A ++T +K ++ W H F D D+V+ + V Sbjct: 103 AFVEVKTRSTRRHGTPAEAITYAKLTRMRCVMGAWFRVHEAPFHH-DVSLDLVSVEWDGV 161 >UniRef50_C9RJM0 Putative uncharacterized protein n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RJM0_FIBSS Length = 138 Score = 121 bits (305), Expect = 6e-27, Method: Composition-based stats. Identities = 38/120 (31%), Positives = 57/120 (47%), Gaps = 7/120 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 + G+ E QA +L +G + + N GGE+D++ R+ T +FVEV+ + G Sbjct: 11 NRAKGNFIETQAVAFLMREGYQVVTRNYAYHGGELDIVARDNGTLVFVEVKSVWNNQEGN 70 Query: 76 AAASVTRSKQHKLLQTARLWLARHN---GSFDTVDCRFDVVAFTGN----EVEWIKDAFN 128 AA V KQ K+ QTA +LA CRFDV++ + IK+AF Sbjct: 71 PAARVNALKQKKIWQTACHFLATQKTIAPKGFDTPCRFDVLSARAYQEPLQFAHIKNAFE 130 >UniRef50_A1U3H0 UPF0102 protein Maqu_2464 n=3 Tax=Marinobacter RepID=Y2464_MARAV Length = 123 Score = 121 bits (305), Expect = 6e-27, Method: Composition-based stats. Identities = 42/120 (35%), Positives = 62/120 (51%), Gaps = 7/120 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 +++ G +E A R+LE KG+R I NV+ RGGEIDLI + +F EVR+R Sbjct: 6 SRKLGQHYEGVAARYLESKGIRIIERNVHNRGGEIDLIGMDAEALVFFEVRFRADGALVD 65 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-----EVEWIKDAFNDH 130 +SV+ KQ +L++ A +L RH R DV+ T ++WIK+A Sbjct: 66 PISSVSAVKQQRLVRAASFYLHRHG--LWDRVSRIDVIGITPGHSSKYRIQWIKNAIQAD 123 >UniRef50_Q313K2 UPF0102 protein Dde_1093 n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. G20 RepID=Y1093_DESDG Length = 202 Score = 121 bits (305), Expect = 6e-27, Method: Composition-based stats. Identities = 29/111 (26%), Positives = 52/111 (46%), Gaps = 2/111 (1%) Query: 6 TRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEV 65 R + G E A +L G+R +A N E+D+I ++ T +F EV Sbjct: 3 ARRAAGSCPAHIAAGRLGEEAACAYLAASGMRILARNWRAGHLELDIIAQDNGTIVFAEV 62 Query: 66 RYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT 116 + R + ++T +K+ +L++ A +WL+ ++ CRFD+V T Sbjct: 63 KTRAARGLESPHEALTPAKRSRLVRAAGMWLSSNDM--WDRPCRFDLVCVT 111 >UniRef50_A6SUE7 UPF0102 protein mma_0204 n=4 Tax=Betaproteobacteria RepID=Y204_JANMA Length = 123 Score = 121 bits (305), Expect = 6e-27, Method: Composition-based stats. Identities = 50/123 (40%), Positives = 75/123 (60%), Gaps = 4/123 (3%) Query: 8 SGSPRQLTTKQT-GDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVR 66 R T KQ G A E QA +L+ +GL+ + N +GGEIDL+M++G+ +FVEVR Sbjct: 4 PAFLRPRTAKQLAGQAGEDQALIYLQQQGLQLLERNFRCKGGEIDLLMQDGKALVFVEVR 63 Query: 67 YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDA 126 R +GGAAAS+ +KQ +L+ A+++L R++ CRFDV+AF E+ W+K+A Sbjct: 64 MRSEKKFGGAAASIGTAKQKRLIIAAQIYLQRYSMP---PPCRFDVIAFDDKEMTWLKNA 120 Query: 127 FND 129 Sbjct: 121 IEA 123 >UniRef50_A6VXY8 UPF0102 protein Mmwyl1_2395 n=1 Tax=Marinomonas sp. MWYL1 RepID=Y2395_MARMS Length = 127 Score = 121 bits (304), Expect = 8e-27, Method: Composition-based stats. Identities = 46/128 (35%), Positives = 64/128 (50%), Gaps = 6/128 (4%) Query: 4 VPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFV 63 P S R+ K GD E A +L +GLRF+ N R GEIDLI + T +FV Sbjct: 2 RPVTSFLNRKKAPKNNGDKAEQAAEAFLRKQGLRFVERNFFCRIGEIDLIFLDQNTYVFV 61 Query: 64 EVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVA----FTGNE 119 EVR+R + +G AA S+ +SK K+ +A LWL ++N RFD + Sbjct: 62 EVRFRANNTHGNAAESLGQSKLKKVRNSAALWLQKNNKV--NNSSRFDAILFDEKIDSQH 119 Query: 120 VEWIKDAF 127 + W+K F Sbjct: 120 LTWLKAVF 127 >UniRef50_A0Z7D7 Putative uncharacterized protein n=1 Tax=marine gamma proteobacterium HTCC2080 RepID=A0Z7D7_9GAMM Length = 123 Score = 120 bits (303), Expect = 1e-26, Method: Composition-based stats. Identities = 46/118 (38%), Positives = 63/118 (53%), Gaps = 5/118 (4%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 QTG E A+ +L +GLR +A NV R GEID+IM +G T +FVEVR R Sbjct: 5 NTQTGKDAEDYAQNFLITQGLRTVARNVCCRYGEIDIIMEQGITVVFVEVRLRAQKGLQT 64 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT----GNEVEWIKDAFND 129 A+SV+ KQ +L++TA L + R RFDV+A+ WI+ AF+ Sbjct: 65 GASSVSYRKQQRLIKTASLVIQRMP-ELQGRPVRFDVIAYDTLQKNRVPHWIQQAFDA 121 >UniRef50_A6GLM3 Putative uncharacterized protein n=1 Tax=Limnobacter sp. MED105 RepID=A6GLM3_9BURK Length = 167 Score = 120 bits (303), Expect = 1e-26, Method: Composition-based stats. Identities = 44/128 (34%), Positives = 63/128 (49%), Gaps = 2/128 (1%) Query: 2 ATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTI 61 V ++ +L G E QA + LE GL + N R GEIDLIM G T + Sbjct: 38 PNVAEKAPPVHKLALLAEGQLAETQALQLLEKHGLILVTRNHRCRCGEIDLIMASGNTAV 97 Query: 62 FVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-GNEV 120 VEVR R + +G A S++ KQ ++ + A+LW + RFDVVA G E Sbjct: 98 IVEVRLRNNKRHGSALESISSHKQARVSRCAKLWWVQQGQR-KFTHLRFDVVALENGTEP 156 Query: 121 EWIKDAFN 128 W+++A+ Sbjct: 157 RWVQNAWQ 164 >UniRef50_Q2KU88 UPF0102 protein BAV3162 n=1 Tax=Bordetella avium 197N RepID=Y3162_BORA1 Length = 145 Score = 120 bits (303), Expect = 1e-26, Method: Composition-based stats. Identities = 49/125 (39%), Positives = 66/125 (52%), Gaps = 2/125 (1%) Query: 7 RSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVR 66 R G R G EAQ R L +GLR +A N R GE+DLIM +G + VEVR Sbjct: 9 RRGLIRPDPRHAQGKRAEAQGLRLLRAQGLRLLARNARNRHGELDLIMLDGEVLVVVEVR 68 Query: 67 YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDA 126 +R + +GGAAAS+ +KQ +L + A WLA RFDV+AF + W++ A Sbjct: 69 WRSGSAFGGAAASIGPAKQARLARAAACWLA--GSEHAGRRLRFDVLAFEAGQARWLRGA 126 Query: 127 FNDHS 131 F + Sbjct: 127 FEPPA 131 >UniRef50_C7R327 Putative uncharacterized protein n=1 Tax=Jonesia denitrificans DSM 20603 RepID=C7R327_JONDD Length = 129 Score = 120 bits (302), Expect = 1e-26, Method: Composition-based stats. Identities = 34/125 (27%), Positives = 54/125 (43%), Gaps = 16/125 (12%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNER---GGEIDLIMREGRTTIFVEVRYRRSA 71 T G E A +WL+ +G + N GEID+I R+G T + VEV+ R + Sbjct: 4 RTYTLGQTGETYAAQWLQKRGYAILERNWRAAYPMRGEIDIIARDGATLVIVEVKTRTTQ 63 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-----------NEV 120 G + +VT K +L + A WL + + RFDV++ + Sbjct: 64 HCGHPSEAVTPRKLTQLRRLAAAWLT--HAGVRPRELRFDVISVLAPSNRYATPTNEWHI 121 Query: 121 EWIKD 125 + +KD Sbjct: 122 DHLKD 126 >UniRef50_A0QVA9 UPF0102 protein MSMEG_2508 n=6 Tax=Corynebacterineae RepID=Y2508_MYCS2 Length = 124 Score = 120 bits (302), Expect = 1e-26, Method: Composition-based stats. Identities = 34/109 (31%), Positives = 52/109 (47%), Gaps = 4/109 (3%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREG--RTTIFVEVRYRRS 70 LT + G E A L GL+ +A N R GE+D+I + T +FVEV+ R Sbjct: 5 SLTRAELGALGEEVAVEHLAALGLKTLARNWRCRYGELDIIAEDAATGTVVFVEVKTRSG 64 Query: 71 ALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE 119 +GG A +VT K ++ + A +WLA + + + R DV+ Sbjct: 65 DGFGGLAEAVTPQKVRRIRRLAAIWLAAQDAHWAVL--RIDVIGVRVGR 111 >UniRef50_Q1D6H9 UPF0102 protein MXAN_3551 n=2 Tax=Cystobacterineae RepID=Y3551_MYXXD Length = 126 Score = 120 bits (302), Expect = 1e-26, Method: Composition-based stats. Identities = 38/125 (30%), Positives = 61/125 (48%), Gaps = 6/125 (4%) Query: 9 GSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYR 68 ++ G+A E A R+LE +G R N R GE+D++ FVEVR R Sbjct: 2 RRAAPAERREYGNAGEEAAVRFLEAQGWRVRDRNWTCRFGELDVVAERDDLVCFVEVRMR 61 Query: 69 RSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----EVEWIK 124 +A +G + SV+ +KQ ++++ A +L H+ RFDV++ G V+ I Sbjct: 62 STATWGDPSHSVSFAKQRRVVKAALRYLFAHDLRGRMF--RFDVISVVGRGERATVDHIP 119 Query: 125 DAFND 129 AF+ Sbjct: 120 GAFDA 124 >UniRef50_C2BVF9 Possible endonuclease n=1 Tax=Mobiluncus curtisii ATCC 43063 RepID=C2BVF9_9ACTO Length = 178 Score = 120 bits (301), Expect = 2e-26, Method: Composition-based stats. Identities = 27/124 (21%), Positives = 49/124 (39%), Gaps = 6/124 (4%) Query: 8 SGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREG-RTTIFVEVR 66 S L +Q G A E A L+ G + N R GE+D++ FVEV+ Sbjct: 53 SPPRNNLHNRQLGMAGEEVAAESLKAAGYVIVDRNWRCRAGEVDIVALSPEGVLGFVEVK 112 Query: 67 YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-----VE 121 R + +G ++T K ++ + WLA+ + D+ + + V+ Sbjct: 113 TRSNHRHGLPIEAITMKKLARMRRVMGAWLAQRDIVPVHRAVSLDLCSVDWDGHGEPVVK 172 Query: 122 WIKD 125 ++ Sbjct: 173 HLQG 176 >UniRef50_C9LKQ6 Putative uncharacterized protein n=1 Tax=Prevotella tannerae ATCC 51259 RepID=C9LKQ6_9BACT Length = 131 Score = 120 bits (301), Expect = 2e-26, Method: Composition-based stats. Identities = 32/120 (26%), Positives = 50/120 (41%), Gaps = 7/120 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 Q G E A +L R + N E+D+I +FVEV+ R Sbjct: 4 HNQLGALGEEVAAHYLSQLEYRLLERNWRTGHLEVDIIADYYGEIVFVEVKTRSYEAEYT 63 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEWIKDAFNDHS 131 A +V R+K+ L++ A ++ H+ CRFD++ G E V DA++ S Sbjct: 64 ALEAVDRTKKKHLVRAAHDYMHLHHL---DAACRFDIITVVGREAPFQVTHYIDAYSPKS 120 >UniRef50_A0YUK1 Putative uncharacterized protein n=2 Tax=Cyanobacteria RepID=A0YUK1_9CYAN Length = 176 Score = 120 bits (301), Expect = 2e-26, Method: Composition-based stats. Identities = 35/113 (30%), Positives = 52/113 (46%), Gaps = 12/113 (10%) Query: 18 QTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE----------GRTTIFVEVRY 67 + G E + WL+ +G + R GEIDLI RE T IFVEV+ Sbjct: 16 KIGTLGEQLVQAWLKQQGWEILFHQYRCRWGEIDLIAREVKDPKVQSKLDSTVIFVEVKT 75 Query: 68 RRSALYG-GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE 119 R + ++T SKQ KL+++A+++L+ H CRFDV + Sbjct: 76 RSKRNWDSDGLLAITPSKQTKLIKSAQIFLSDHP-ELADSPCRFDVALVRCDR 127 >UniRef50_D1ANU1 Putative uncharacterized protein n=1 Tax=Sebaldella termitidis ATCC 33386 RepID=D1ANU1_SEBTE Length = 111 Score = 120 bits (301), Expect = 2e-26, Method: Composition-based stats. Identities = 38/112 (33%), Positives = 67/112 (59%), Gaps = 3/112 (2%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + ++ G +E A+ +L GL ++ +N GEIDLI ++ IFVEV+YR+++ Y Sbjct: 1 MNKREKGFKYENAAKDFLINNGLEYVRSNYYSEYGEIDLIFKDRDFLIFVEVKYRKNSDY 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 G A SVT++K K++ + +++ N + CR+D+VA G E+ W+K+ Sbjct: 61 GFAEESVTQAKLKKIINASLNYISEVNWNEG---CRYDLVAINGEEIIWVKN 109 >UniRef50_B0MPN2 Putative uncharacterized protein n=1 Tax=Eubacterium siraeum DSM 15702 RepID=B0MPN2_9FIRM Length = 132 Score = 119 bits (300), Expect = 2e-26, Method: Composition-based stats. Identities = 30/118 (25%), Positives = 51/118 (43%), Gaps = 11/118 (9%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 G E +L G I N + GEID++ +G +FVEV+ RR Sbjct: 10 KGKLGEDFTADYLIKNGYDIITRNYRKPCGEIDIVASKGDILVFVEVKTRRYRSLVSGVE 69 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN--------EVEWIKDAFN 128 +V K+ +++ TA +LA + + R+D+ T + + + KDAF+ Sbjct: 70 AVGYKKKGRIIATADCFLAEYG---EEKQIRYDIAEVTVSTGDAVRVIDFRYFKDAFD 124 >UniRef50_Q146Q2 UPF0102 protein Bxeno_A0149 n=9 Tax=Burkholderiaceae RepID=Y149_BURXL Length = 140 Score = 119 bits (300), Expect = 2e-26, Method: Composition-based stats. Identities = 52/117 (44%), Positives = 74/117 (63%), Gaps = 2/117 (1%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE-GRTTIFVEVRYRRSALYG 74 +K G A+EA+A+ +L+ + LRF+A NV RGGEIDL+MRE +FVEVR R YG Sbjct: 22 SKLVGAAFEARAQEFLQRQRLRFVARNVACRGGEIDLVMRERDGALVFVEVRARAQRRYG 81 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVD-CRFDVVAFTGNEVEWIKDAFNDH 130 GAAAS+ KQ ++++ A+ +LA + CRFDV+AF + W++DAF Sbjct: 82 GAAASIGWRKQQRIVRAAQHYLATRSSQLRDQPACRFDVIAFEAGRLVWLRDAFRAD 138 >UniRef50_Q6A7T5 UPF0102 protein PPA1431 n=3 Tax=Propionibacterium acnes RepID=Y1431_PROAC Length = 140 Score = 119 bits (299), Expect = 3e-26, Method: Composition-based stats. Identities = 35/124 (28%), Positives = 51/124 (41%), Gaps = 11/124 (8%) Query: 1 MATVP--TRSGSPRQLTTK-------QTGDAWEAQARRWLEGKGLRFIAANVNERGGEID 51 M P +PR + G E A +++E G IA N GEID Sbjct: 2 MTPKPLSAELTTPRGAALRGRRGCRPAFGAWGEDLAAQYVESLGWTIIARNWTCDVGEID 61 Query: 52 LIMREGRTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFD 111 LI R+ +T +F+EV+ R +G S+T +K KL + A WL + R D Sbjct: 62 LIARDDQTVVFIEVKARSGTGFGDPLESITTAKVRKLHELALAWLVNQDDGVH--SVRID 119 Query: 112 VVAF 115 + Sbjct: 120 AIGV 123 >UniRef50_Q8DI54 UPF0102 protein tll1737 n=1 Tax=Thermosynechococcus elongatus BP-1 RepID=Y1737_THEEB Length = 124 Score = 119 bits (299), Expect = 3e-26, Method: Composition-based stats. Identities = 33/117 (28%), Positives = 55/117 (47%), Gaps = 8/117 (6%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE-GRTTIFVEVRYRRSALYG 74 + GD EA WL+ + + +A N + GE+D+I + G +FVEV+ R S + Sbjct: 1 MRHVGDRGEAVVAAWLQTQQCQILAQNWSCPWGELDIIACDPGGVVLFVEVKTRGSYNWD 60 Query: 75 -GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFNDH 130 +++ SKQ KL+ A+ +L + CRFDV + A++ H Sbjct: 61 RDGLDAISPSKQRKLILAAQAFLESQP-QWQEHPCRFDVALV-----RHQRGAYHLH 111 >UniRef50_C9PT16 Endonuclease n=5 Tax=Prevotella RepID=C9PT16_9BACT Length = 124 Score = 119 bits (298), Expect = 4e-26, Method: Composition-based stats. Identities = 34/123 (27%), Positives = 49/123 (39%), Gaps = 10/123 (8%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGR--TTIFVEVRYRRSA 71 G E A L+ +G + + +IDL+ T +FVEV+ R S Sbjct: 2 AKHNDLGKWGEDFAAEHLQKQGYVIRDRDWHCGKRDIDLVAITADMATVVFVEVKTRTSN 61 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-----EVEWIKDA 126 A +V R K L A ++ + N RFDVV G ++E I+DA Sbjct: 62 EVSEPADAVNRQKIRNLGIAANNYIKQFNVVEQ---VRFDVVTIVGTSRENAQLEHIEDA 118 Query: 127 FND 129 FN Sbjct: 119 FNP 121 >UniRef50_B2S4F0 UPF0102 protein TPASS_0913 n=3 Tax=Treponema RepID=Y913_TREPS Length = 126 Score = 119 bits (298), Expect = 4e-26, Method: Composition-based stats. Identities = 38/122 (31%), Positives = 58/122 (47%), Gaps = 8/122 (6%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 K G EA A RWL +G I N GEID+I ++ T +FVEV+ R Y Sbjct: 4 HNKLLGAFGEAYAARWLATRGYIIITRNWRRATGEIDIIAQQDDTIVFVEVKTLRCTSYA 63 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-------EVEWIKDAF 127 A V + KQ ++ +TA+ +LA ++ + RFDV+ + ++ + AF Sbjct: 64 DLAIIVGKRKQKRICETAKHFLASAR-EYNHMCARFDVIVLRSDPFRRQDVDIVHLPHAF 122 Query: 128 ND 129 D Sbjct: 123 ED 124 >UniRef50_Q025A4 UPF0102 protein Acid_2433 n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Y2433_SOLUE Length = 132 Score = 118 bits (297), Expect = 5e-26, Method: Composition-based stats. Identities = 35/113 (30%), Positives = 57/113 (50%), Gaps = 7/113 (6%) Query: 20 GDAWEAQARRWLEGKGLRFIAANVNE--RGGEIDLIMREGRTTIFVEVRYRRSALYGGAA 77 G E A R+L +G +A N GEIDL++ +G FVEV+ R S +G Sbjct: 22 GRIGEDLAHRYLRSQGCTVVARNYRTLAGTGEIDLVVWDGGRLAFVEVKTRSSTDFGPPE 81 Query: 78 ASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT---GNEVEWIKDAF 127 ++V K+ +L AR ++ R + + RFD+V+ ++EW++ AF Sbjct: 82 SAVDAEKRDRLRTAARDYVRRADVDWK--AVRFDIVSVILQASPKIEWLRGAF 132 >UniRef50_Q24UC6 UPF0102 protein DSY2577 n=2 Tax=Desulfitobacterium hafniense RepID=Y2577_DESHY Length = 121 Score = 118 bits (297), Expect = 5e-26, Method: Composition-based stats. Identities = 33/118 (27%), Positives = 54/118 (45%), Gaps = 8/118 (6%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 + G E A + + GL + N GE+D+I REG T IF+EVR R + G Sbjct: 4 HRQALGRYGEELAVKHIRQAGLTVLECNYRCPLGEMDIIAREGETIIFIEVRTRSTGSRG 63 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-------NEVEWIKD 125 S+T K+ +L + A +L ++ + RFD++A ++ WI+ Sbjct: 64 WGEESITAKKRERLYRIATHYL-KYRNYKEWPSLRFDLIAIRCQDQEGKQPDIIWIRG 120 >UniRef50_Q3AC88 UPF0102 protein CHY_1414 n=1 Tax=Carboxydothermus hydrogenoformans Z-2901 RepID=Y1414_CARHZ Length = 118 Score = 118 bits (296), Expect = 7e-26, Method: Composition-based stats. Identities = 28/119 (23%), Positives = 57/119 (47%), Gaps = 6/119 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + ++ G WE A ++L KG + + N RGGEID++ ++G +F+EVR+R + Sbjct: 1 MNRRELGQKWEELAEQYLRKKGYKILTRNYQIRGGEIDIVAQDGEFLVFIEVRFRSDISF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEWIKDAFN 128 G + +V K+ L + ++++ H + R D + + V ++ + Sbjct: 61 GTPSETVNEKKKASLKKAIKVYI--HENFLYHLQPRVDFIGIEQKDNRFFVNHYQNVLD 117 >UniRef50_B1VG84 Putative uncharacterized protein n=1 Tax=Corynebacterium urealyticum DSM 7109 RepID=B1VG84_CORU7 Length = 154 Score = 117 bits (295), Expect = 7e-26, Method: Composition-based stats. Identities = 33/114 (28%), Positives = 52/114 (45%), Gaps = 3/114 (2%) Query: 9 GSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREG-RTTIFVEVRY 67 G +T G E A +L +G R + N + R E+D+I + +FVEV+Y Sbjct: 36 GQNSADSTVGVGRQGENLAGEYLVNQGWRIVERNWHCRFAELDIIALDPAGEMVFVEVKY 95 Query: 68 RRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVE 121 R+ ++G +VT++K ++ A WLA D RFDV+ V Sbjct: 96 RKDTVHGTGVEAVTQTKLRRMRLAAGKWLAEQQRGVDV--VRFDVIDVGPGGVR 147 >UniRef50_Q6FD45 UPF0102 protein ACIAD1132 n=4 Tax=Acinetobacter RepID=Y1132_ACIAD Length = 140 Score = 117 bits (295), Expect = 8e-26, Method: Composition-based stats. Identities = 39/130 (30%), Positives = 63/130 (48%), Gaps = 17/130 (13%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + G E QA L+ G + + N + R GEIDLI+ + IFVEV+ R Y Sbjct: 10 IHAHHLGKWAENQALNILQANGFKLVIRNFHSRVGEIDLIVAKADELIFVEVKARTLGSY 69 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-GNEV------------ 120 A + S+Q K+++TA+ +L R+ + CRFDV+ F +++ Sbjct: 70 AAANEVLLVSQQRKIIKTAQYFLNRYP-DYQQFYCRFDVICFDFPHKIAKTVQQDFSKLR 128 Query: 121 ---EWIKDAF 127 +WI++AF Sbjct: 129 YDQQWIENAF 138 >UniRef50_C2BNU0 Endonuclease n=2 Tax=Corynebacterium RepID=C2BNU0_9CORY Length = 142 Score = 117 bits (294), Expect = 1e-25, Method: Composition-based stats. Identities = 46/124 (37%), Positives = 62/124 (50%), Gaps = 10/124 (8%) Query: 7 RSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREG-RTTIFVEV 65 R + + + G EA A R+L +G IAANV+ R GEIDLI RE T +FVEV Sbjct: 18 RLATKKPRHKQVLGKRGEAFAARYLHERGAEIIAANVSYRVGEIDLIAREPNGTIVFVEV 77 Query: 66 RYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VE 121 + R ++ YG A +VT K +L + A WL S + RFDV+A +E Sbjct: 78 KTRANSNYGVA-EAVTPQKLARLRKAAAQWLDGKPLS----EVRFDVIALVAQGQGFVLE 132 Query: 122 WIKD 125 K Sbjct: 133 HFKG 136 >UniRef50_A4A5E8 Putative uncharacterized protein n=2 Tax=unclassified Gammaproteobacteria RepID=A4A5E8_9GAMM Length = 118 Score = 117 bits (293), Expect = 1e-25, Method: Composition-based stats. Identities = 45/114 (39%), Positives = 65/114 (57%), Gaps = 7/114 (6%) Query: 20 GDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAAS 79 GD +EA++ L+ GLR + + GEID+I + +FVEVR RR +GGAAAS Sbjct: 4 GDDFEARSAALLKSYGLRILDTQYRCKAGEIDIIACDEHHLLFVEVRARRHRSHGGAAAS 63 Query: 80 VTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN------EVEWIKDAF 127 V R+KQ ++ + A +L RH + + CRFDV+A+ E WI+ AF Sbjct: 64 VNRAKQCRIARCAAYFLNRHP-QWCHLPCRFDVIAWEPGCAGQSFEARWIQAAF 116 >UniRef50_C2CWR1 Endonuclease n=1 Tax=Gardnerella vaginalis ATCC 14019 RepID=C2CWR1_GARVA Length = 153 Score = 117 bits (293), Expect = 1e-25, Method: Composition-based stats. Identities = 36/118 (30%), Positives = 53/118 (44%), Gaps = 7/118 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREG-RTTIFVEVRYRRSALYG 74 K G+ E A L K + N + R GE+D++M + +F+EV+ RRS +G Sbjct: 37 NKTIGNLGEEYASLKLILKNWILLDRNWHSRFGELDVVMMDPFGRIVFIEVKTRRSVRFG 96 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-----EVEWIKDAF 127 +VT K K + WL HN F RFDVV+ + ++ I AF Sbjct: 97 TPLEAVTNEKCLKTHKAGFKWLDEHNF-FKHRKIRFDVVSILISKDKNIQLRHILGAF 153 >UniRef50_Q6AEA8 UPF0102 protein Lxx14785 n=1 Tax=Leifsonia xyli subsp. xyli RepID=Y1478_LEIXX Length = 118 Score = 117 bits (293), Expect = 2e-25, Method: Composition-based stats. Identities = 32/113 (28%), Positives = 46/113 (40%), Gaps = 7/113 (6%) Query: 18 QTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAA 77 + G E+ A WLE G I N R GEID+I R G T+FVEV+ R + YG Sbjct: 6 ELGRRGESVAAHWLEAHGYVLIGRNWRIRSGEIDIIARTGNITVFVEVKTRATTHYGHPL 65 Query: 78 ASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-----GNEVEWIKD 125 ++T K +L + W + R D + E+ + Sbjct: 66 EAITPEKAARLRRLTAEWCRTYGPLPG--ALRVDAIGVLNAWSANPEIHHLPG 116 >UniRef50_B1XJM9 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7002 RepID=B1XJM9_SYNP2 Length = 140 Score = 117 bits (293), Expect = 2e-25, Method: Composition-based stats. Identities = 37/144 (25%), Positives = 65/144 (45%), Gaps = 18/144 (12%) Query: 1 MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRT- 59 MA P SP +L G+ E ++ L+ + + +A R GE+DL+ +T Sbjct: 1 MADTP----SPEKLAALAVGEQGELFVQQHLKSQDWQIVATRWRCRWGELDLVAFHAQTK 56 Query: 60 -TIFVEVRYRRSALYGG-AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG 117 FVEV+ R+ ++T SKQ K ++ A +L++ ++T CRFDV T Sbjct: 57 ILAFVEVKTRQQHSLDYQGLLAITPSKQRKTIRAAMQFLSKFP-QYETYGCRFDVALVTY 115 Query: 118 NE----------VEWIKDAFNDHS 131 ++ +++ AF + Sbjct: 116 SKTATFPQGFRLATYLEGAFEADA 139 >UniRef50_A6LF20 UPF0102 protein BDI_2565 n=4 Tax=Bacteroidales RepID=Y2565_PARD8 Length = 121 Score = 116 bits (292), Expect = 2e-25, Method: Composition-based stats. Identities = 28/121 (23%), Positives = 48/121 (39%), Gaps = 7/121 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 G E++AR +L G + N + E+D+I + I VEV+ R Sbjct: 2 ARQNDMGREGESEARAYLVKHGYNVLHTNWHWHHYELDIIAVKEDELIVVEVKTRSEDFL 61 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEWIKDAFND 129 +V K +++ A ++ N + RFD+V E ++ I+DAF Sbjct: 62 LSPEDAVDTKKIRRIVAAADAYVRYFNI---DLPVRFDIVTLIKKETGFLIDHIEDAFYA 118 Query: 130 H 130 Sbjct: 119 P 119 >UniRef50_A0LCU4 UPF0102 protein Mmc1_3298 n=1 Tax=Magnetococcus sp. MC-1 RepID=Y3298_MAGSM Length = 124 Score = 116 bits (292), Expect = 2e-25, Method: Composition-based stats. Identities = 37/119 (31%), Positives = 56/119 (47%), Gaps = 5/119 (4%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 LT K G+ E A + ++ KG + N R GE+D+I G +F EV+ R+ A+ Sbjct: 4 LTPKSFGEQAEDFACKMMKKKGYHILQRNARSRYGELDIIALHGEVVVFCEVKARQGAVS 63 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF----TGNEVEWIKDAFN 128 G A ++ KQ +L + A W + + CRFD V G E ++DAF Sbjct: 64 GSAGEAIDGRKQRQLGRLAEAWRLANPA-WMAAPCRFDAVLVAREAQGWHAEIVQDAFQ 121 >UniRef50_B1WNM5 Putative uncharacterized protein n=3 Tax=Chroococcales RepID=B1WNM5_CYAA5 Length = 153 Score = 116 bits (292), Expect = 2e-25, Method: Composition-based stats. Identities = 32/100 (32%), Positives = 45/100 (45%), Gaps = 4/100 (4%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREG--RTTIFVEVRYRRSALYG-G 75 G E +WL + + + GEID+I + T IFVEV+ R+S + Sbjct: 15 IGKIGEQFVAQWLISQSWQILHERWRSPWGEIDIIAQHHHSNTIIFVEVKTRKSKNWDQS 74 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF 115 +VT KQ K+ QTA +L + F CRFDV Sbjct: 75 GILAVTPQKQAKITQTASYFLGEYP-QFSNFICRFDVALV 113 >UniRef50_Q55761 UPF0102 protein sll0189 n=1 Tax=Synechocystis sp. PCC 6803 RepID=Y189_SYNY3 Length = 150 Score = 116 bits (292), Expect = 2e-25, Method: Composition-based stats. Identities = 34/101 (33%), Positives = 46/101 (45%), Gaps = 4/101 (3%) Query: 18 QTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRT--TIFVEVRYRRSALYG- 74 G A E+ WLE +G + + GEIDLI T FVEV+ R + Sbjct: 3 DLGQAGESLVAAWLEQQGGKILQQRWRSPWGEIDLITHFPDTKIIAFVEVKTRSGGNWDQ 62 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF 115 G +V KQ K+ QTA +LA + +CRFDV+ Sbjct: 63 GGLLAVNARKQEKIWQTANHFLASQP-QWSDWNCRFDVMIV 102 >UniRef50_UPI00017886BD protein of unknown function UPF0102 n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI00017886BD Length = 127 Score = 116 bits (291), Expect = 2e-25, Method: Composition-based stats. Identities = 38/128 (29%), Positives = 61/128 (47%), Gaps = 9/128 (7%) Query: 7 RSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVR 66 S S ++ KQ G A E A L KG R + N R GE+D++ G T + +EVR Sbjct: 2 TSPSGKKDNRKQKGAAAEELAAAALIQKGYRILDRNWRCRFGELDIVAETGETLVVIEVR 61 Query: 67 YRRS-ALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF------TGNE 119 R +G + SV K ++ TA+ ++ H + RFDV++ T + Sbjct: 62 SRSGTTRFGTPSESVNARKVMQVRNTAQQYV--HQKRYYERTIRFDVISVMLREDMTADS 119 Query: 120 VEWIKDAF 127 ++ I++AF Sbjct: 120 MDHIENAF 127 >UniRef50_A9B5H2 UPF0102 protein Haur_0145 n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=Y145_HERA2 Length = 124 Score = 116 bits (291), Expect = 3e-25, Method: Composition-based stats. Identities = 35/116 (30%), Positives = 56/116 (48%), Gaps = 6/116 (5%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 K G E A +L+ G + IA+ + R GEIDLI + T + +EVR RR +G Sbjct: 4 DRKALGRWGEQYAAEYLQQLGYQLIASGWHCRWGEIDLIAYDQATLVIIEVRTRRGTAHG 63 Query: 75 GAAASVTRSKQHKLLQTARLWLARHN--GSFDTVDCRFDVVAFTG----NEVEWIK 124 AA S+T K+ +L + + +L + + D R D +A T ++E + Sbjct: 64 SAAESLTLKKRQRLARLLQAYLQALDAAQTPWLGDYRIDAIAITLSRGQPQLEHFQ 119 >UniRef50_UPI0001BC5BE0 endonuclease n=3 Tax=Fusobacterium RepID=UPI0001BC5BE0 Length = 123 Score = 115 bits (290), Expect = 3e-25, Method: Composition-based stats. Identities = 31/117 (26%), Positives = 53/117 (45%), Gaps = 3/117 (2%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 +Q G+ +E +A L + + N GEID+I + +F+EV+YR++ + Sbjct: 2 QNNRQKGNEYEERAVNILRENQYQILERNFRIFQGEIDIIAEKDGVLVFIEVKYRKNRNF 61 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD-AFND 129 G +V K K+ + A + + R DV+ F G+ W KD A+ D Sbjct: 62 GYGKEAVDSRKLGKIFRVAEYYKTYCGKQYQK--MRIDVIHFLGDTYFWEKDVAWGD 116 >UniRef50_D1BMP5 Putative uncharacterized protein n=3 Tax=Veillonella RepID=D1BMP5_VEIPT Length = 132 Score = 115 bits (290), Expect = 3e-25, Method: Composition-based stats. Identities = 36/133 (27%), Positives = 61/133 (45%), Gaps = 7/133 (5%) Query: 1 MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTT 60 M T+ T +L +K+ G E A ++E GL + N R GEID+I + Sbjct: 1 MKTISTGKAFN-ELDSKELGKWGERVATNYIEKIGLTVVDTNYRTRLGEIDIIAKRDLVY 59 Query: 61 IFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLAR-HNGSFDTVDCRFDVVAFTGNE 119 F+E++ RR +G A +VT+ KQ + + A L+L + + FDV+ +E Sbjct: 60 HFIEIKARRGMQHGLAREAVTKKKQKHIKRAAMLFLYDLNQKKRRWKEISFDVIEVYLHE 119 Query: 120 -----VEWIKDAF 127 + ++ F Sbjct: 120 DFQSSIHYLPQCF 132 >UniRef50_A6NUN6 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6NUN6_9BACE Length = 119 Score = 115 bits (289), Expect = 4e-25, Method: Composition-based stats. Identities = 36/122 (29%), Positives = 57/122 (46%), Gaps = 11/122 (9%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + T G E+ L +G R +A+ R GEIDLI +G +FVEV+ R+S + Sbjct: 1 MNTSLLGRWGESLVAEELRRRGCRVVASGYRTRFGEIDLIAEDGPYLLFVEVKLRKSDRF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG--------NEVEWIKD 125 A V R KQ ++ TA ++LA++ RFDV + ++++ Sbjct: 61 APGRAFVDRGKQERIRTTAEIYLAQNPTERQP---RFDVAEVYAPQGTATAHPRIVYLEN 117 Query: 126 AF 127 AF Sbjct: 118 AF 119 >UniRef50_B0MUK7 Putative uncharacterized protein n=1 Tax=Alistipes putredinis DSM 17216 RepID=B0MUK7_9BACT Length = 121 Score = 115 bits (289), Expect = 4e-25, Method: Composition-based stats. Identities = 32/121 (26%), Positives = 54/121 (44%), Gaps = 8/121 (6%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 TT+ TG E A RWL G + N + E+D++ T F+EV+ RR Sbjct: 3 TTQHTGRLGEETAARWLLDHGFTLLHRNWRQGHYELDIVAARKGTLHFIEVKTRRRDGLT 62 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-----GNEVEWIKDAFND 129 ++ K+ L++ A +L + + + +FD++A EV +I+DA Sbjct: 63 PPEQALDSHKRRALVRAANAYLTENPFAGE---VQFDLIAVETAPAGTPEVRYIEDAIEL 119 Query: 130 H 130 H Sbjct: 120 H 120 >UniRef50_D1BJ87 Predicted endonuclease related to Holliday junction resolvase n=1 Tax=Sanguibacter keddieii DSM 10542 RepID=D1BJ87_SANKS Length = 127 Score = 115 bits (289), Expect = 4e-25, Method: Composition-based stats. Identities = 33/121 (27%), Positives = 48/121 (39%), Gaps = 8/121 (6%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + G E R LEG G + N GGE+DL+ +GR + +EV+ R Sbjct: 5 RTDRAAVGRYGEELVARMLEGAGWVVVDRNWRGTGGELDLVALDGRELVVIEVKTRTGLG 64 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGS---FDTVDCRFDVVAFT-----GNEVEWIK 124 YG + +VT K +L + A WLA R DVV +V+ + Sbjct: 65 YGHPSEAVTPRKLARLRRLAGEWLAGRAAETVPERPTSVRVDVVGVLLEKGRPPQVDHLV 124 Query: 125 D 125 Sbjct: 125 G 125 >UniRef50_B4WKR8 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WKR8_9SYNE Length = 144 Score = 115 bits (289), Expect = 4e-25, Method: Composition-based stats. Identities = 34/111 (30%), Positives = 51/111 (45%), Gaps = 10/111 (9%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMR--------EGRTTIFVEV 65 L + G+ E +WL K + + + R GEIDLI + + T F+EV Sbjct: 2 LNPQDLGNYGEQLVCQWLTQKNCQILQRQWHSRFGEIDLIAKGISGQGSLKAETLAFIEV 61 Query: 66 RYRRSALYG-GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF 115 + R + ++TRSKQ K+ TAR +L RH + CRFD+ Sbjct: 62 KTRSKGNWDADGLLALTRSKQQKIRMTARYFLVRHP-HLSELPCRFDLALV 111 >UniRef50_C0WCB9 Endonuclease n=1 Tax=Acidaminococcus sp. D21 RepID=C0WCB9_9FIRM Length = 117 Score = 115 bits (289), Expect = 5e-25, Method: Composition-based stats. Identities = 34/114 (29%), Positives = 55/114 (48%), Gaps = 6/114 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 + G E A +L +G A N + GE+DL+ R+G +FVEV+ RR+ LYG Sbjct: 4 RTRFGRWGERAAAAYLRHQGYIIEAQNYSSSHGELDLVARKGHLLVFVEVKSRRTDLYGR 63 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT----GNEVEWIKD 125 +VT K ++ +TA +L H D R+DV+ ++ +K+ Sbjct: 64 PRDAVTEEKAARIRETAYEYLQDHKRPGDR--IRYDVIEIMMLFGHFQLNHLKN 115 >UniRef50_C0WKI9 Endonuclease n=3 Tax=Actinomycetales RepID=C0WKI9_9CORY Length = 132 Score = 115 bits (288), Expect = 6e-25, Method: Composition-based stats. Identities = 42/120 (35%), Positives = 59/120 (49%), Gaps = 10/120 (8%) Query: 9 GSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE-GRTTIFVEVRY 67 + ++ G E+ A +L +G IAANV+ R GEIDLI RE T +FVEV+ Sbjct: 10 ATKHPRHRQELGKRGESFAAGYLRERGSDIIAANVSYRVGEIDLIAREPDGTIVFVEVKT 69 Query: 68 RRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF----TGNEVEWI 123 R +A +G A +VT K ++ + A WL RFDV+A G E+E Sbjct: 70 RSTASFGTA-EAVTPHKLARMRRAAVQWLDGKPL----ATVRFDVIALVVNGEGFELEHF 124 >UniRef50_A1W341 UPF0102 protein Ajs_0414 n=11 Tax=Betaproteobacteria RepID=Y414_ACISJ Length = 132 Score = 114 bits (287), Expect = 8e-25, Method: Composition-based stats. Identities = 47/125 (37%), Positives = 67/125 (53%), Gaps = 7/125 (5%) Query: 9 GSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERG---GEIDLIMRE-GRTTIFVE 64 GS TT+ G A E +A L GL + N G GEIDLI+RE T +FVE Sbjct: 10 GSAPARTTRAAGQAGEDRALAHLTAAGLALVERNYRTPGRGGGEIDLILRERDGTLVFVE 69 Query: 65 VRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIK 124 VR R ++ YGGA S+ +KQ +++ A+ +L R CRFD V G+ ++W++ Sbjct: 70 VRSRGASAYGGAGGSIGVAKQRRIVFAAQHYLLRWP---APPPCRFDAVLIEGDRLQWLR 126 Query: 125 DAFND 129 AF+ Sbjct: 127 GAFDA 131 >UniRef50_A3NEP2 UPF0102 protein BURPS668_3819 n=83 Tax=Proteobacteria RepID=Y3819_BURP6 Length = 144 Score = 114 bits (286), Expect = 1e-24, Method: Composition-based stats. Identities = 59/131 (45%), Positives = 79/131 (60%), Gaps = 5/131 (3%) Query: 2 ATVPTRSGSPRQL-TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE-GRT 59 R PR+ + + G A+E +A+R+LE GL +A NV RGGEIDL+MRE T Sbjct: 14 PEAAPRDNFPREAGSKRGIGAAFETRAQRFLERAGLALVARNVTVRGGEIDLVMRERDGT 73 Query: 60 TIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE 119 +FVEVR R ++ YGGAAAS+ K+ +LL A + AR G+ CRFDVVAF G Sbjct: 74 LVFVEVRARANSRYGGAAASIGVRKRMRLLLAAHAFWARTGGANA---CRFDVVAFEGGR 130 Query: 120 VEWIKDAFNDH 130 + W++DAF Sbjct: 131 LVWLRDAFRAD 141 >UniRef50_A9HJH4 UPF0102 protein GDI1964/Gdia_0189 n=1 Tax=Gluconacetobacter diazotrophicus PAl 5 RepID=Y1964_GLUDA Length = 127 Score = 114 bits (286), Expect = 1e-24, Method: Composition-based stats. Identities = 43/129 (33%), Positives = 60/129 (46%), Gaps = 5/129 (3%) Query: 1 MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTT 60 M PTR R + Q G E A WL+ G + R GEIDL+ + G Sbjct: 1 MMAEPTRR-RVRGAASYQRGLQAEQVAGAWLQEHGWTILMHRARTRWGEIDLVAQRGAMI 59 Query: 61 IFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-GNE 119 +F EV+ R Y AA S+ R++ +L+ A WL + + + RFDV+ T G+ Sbjct: 60 VFCEVKCRPH--YTTAAESLGRAQMRRLMNAA-AWLCAAHPGWIYDEMRFDVLLVTAGDA 116 Query: 120 VEWIKDAFN 128 V I DAF Sbjct: 117 VHHIADAFR 125 >UniRef50_C0BHK9 Putative uncharacterized protein n=1 Tax=Flavobacteria bacterium MS024-2A RepID=C0BHK9_9BACT Length = 120 Score = 114 bits (285), Expect = 1e-24, Method: Composition-based stats. Identities = 30/119 (25%), Positives = 52/119 (43%), Gaps = 8/119 (6%) Query: 14 LTTKQT-GDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + G E +A +L KG + N EIDL+M++ + +EV+ R + Sbjct: 1 MAQHNLFGQEAEQKALSFLCNKGYVLLEKNYRFGKAEIDLLMKDKDLLVCIEVKARSTDF 60 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT----GNEVEWIKDAF 127 +G + +T K L+ +L HN + RFDV+++T + I+ AF Sbjct: 61 FGTPESFITSKKIKLLVGAVNHYLEYHNL---DYEVRFDVLSYTIKNKKWICKHIESAF 116 >UniRef50_C0E2B0 Putative uncharacterized protein n=2 Tax=Corynebacterium matruchotii RepID=C0E2B0_9CORY Length = 156 Score = 114 bits (285), Expect = 1e-24, Method: Composition-based stats. Identities = 35/110 (31%), Positives = 56/110 (50%), Gaps = 6/110 (5%) Query: 12 RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGR---TTIFVEVRYR 68 R + G+ EA A ++ +G R +A NV GE+D+I R T +F+EV+ R Sbjct: 32 RSHLAHRVGELGEATAAQFYRDEGYRILARNVRYPVGELDVIARAPDPSGTIVFIEVKTR 91 Query: 69 RSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN 118 + +G A +VT K H++ + A WL + + + RFDVVA + Sbjct: 92 TTLDFGIA-EAVTPRKLHRMHRAAYRWLTERHVPWS--EVRFDVVAIYLD 138 >UniRef50_A1VFE8 UPF0102 protein Dvul_2148 n=3 Tax=Desulfovibrio vulgaris RepID=Y2148_DESVV Length = 134 Score = 113 bits (284), Expect = 1e-24, Method: Composition-based stats. Identities = 38/118 (32%), Positives = 54/118 (45%), Gaps = 6/118 (5%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 TG E +A L+ G R IA N G E+D+I G T +FVEV+ R + Sbjct: 4 ARHATGQHGEDEAAALLQRTGHRIIARNWRHGGLELDIICETGDTIVFVEVKTRAAHGLT 63 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----EVEWIKDAFN 128 ++T K+H+L++ AR CRFD+V T +E I DAF+ Sbjct: 64 SPTDALTHQKRHRLIRAARA--WLAAADAWDRACRFDLVCVTQRGATCTLEHITDAFD 119 >UniRef50_UPI000197ABB4 hypothetical protein GHTCC_11038 n=2 Tax=Proteobacteria RepID=UPI000197ABB4 Length = 122 Score = 113 bits (284), Expect = 1e-24, Method: Composition-based stats. Identities = 41/119 (34%), Positives = 63/119 (52%) Query: 7 RSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVR 66 +S G A E + + +L +GL F N + GEIDL+M++ T +FVEV+ Sbjct: 3 KSAVNNTQNAYHRGLAVEQKVKAYLIAQGLVFKDENFRAKCGEIDLVMKDQDTWVFVEVK 62 Query: 67 YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 YR +G AA +T SK+ KL +T +++A+H + +D R D+ A GN W K Sbjct: 63 YRARPTHGSAADMLTSSKRDKLTKTMYVYMAKHYLNPSIIDHRIDLFAVDGNRARWHKH 121 >UniRef50_B9CKG2 Putative uncharacterized protein n=1 Tax=Atopobium rimae ATCC 49626 RepID=B9CKG2_9ACTN Length = 118 Score = 113 bits (283), Expect = 2e-24, Method: Composition-based stats. Identities = 33/116 (28%), Positives = 48/116 (41%), Gaps = 12/116 (10%) Query: 22 AWEAQARRWLEGKGLRFIAANVNE-RGGEIDLIMREGRTTIFVEVRYRRSALYGG---AA 77 E A +L +G + I N GGE D+I ++G + VEV+ RR Sbjct: 1 MGEQLAADYLAERGYKIIQRNWRCKGGGEADIIAQDGDVYVMVEVKTRRMLQVDANLMPE 60 Query: 78 ASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-----GNEVEWIKDAFN 128 +VT KQ + A L+LA H RFDV+A + + AF+ Sbjct: 61 LAVTAQKQRMYRKMALLYLAFHG---QVSMIRFDVIAINLVAEHNASLRHLIGAFS 113 >UniRef50_D1AZ70 Putative uncharacterized protein n=1 Tax=Sulfurospirillum deleyianum DSM 6946 RepID=D1AZ70_SULD5 Length = 108 Score = 113 bits (283), Expect = 2e-24, Method: Composition-based stats. Identities = 30/107 (28%), Positives = 49/107 (45%), Gaps = 6/107 (5%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 G E +A +LE +G +A N + + GEID+I + F EV+Y + Sbjct: 5 IGKEAETKASAYLEKEGYTILARNFHSKFGEIDIIALKEDILHFCEVKY---SQKYDPLL 61 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 +T SK K++ T + H S+ D ++ G E+E IK+ Sbjct: 62 RITPSKMKKIITTIHYYFLTHPSSYCYQ---IDAISIKGEEIEIIKN 105 >UniRef50_B4VZG7 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VZG7_9CYAN Length = 177 Score = 113 bits (283), Expect = 2e-24, Method: Composition-based stats. Identities = 33/135 (24%), Positives = 48/135 (35%), Gaps = 31/135 (22%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRT---------------- 59 T G E WL+ +G + + R GEIDLI Sbjct: 2 TNAKGQLGEQLVATWLQAQGWTILHHRWHCRWGEIDLIAYRDGEVKENPVSNRDINPQFH 61 Query: 60 -------------TIFVEVRYRRSALYG-GAAASVTRSKQHKLLQTARLWLARHNGSFDT 105 FVEV+ R + ++T SKQ K+ +TA L+LA + Sbjct: 62 LPTPLPANAESPILGFVEVKTRSRGNWDADGQLAITSSKQAKIWRTAELFLAENP-DLSD 120 Query: 106 VDCRFDVVAFTGNEV 120 + CRFDV + + Sbjct: 121 LPCRFDVALVRYHRI 135 >UniRef50_B8HA88 UPF0102 protein Achl_2213 n=1 Tax=Arthrobacter chlorophenolicus A6 RepID=Y2213_ARTCA Length = 132 Score = 113 bits (283), Expect = 2e-24, Method: Composition-based stats. Identities = 31/118 (26%), Positives = 46/118 (38%), Gaps = 8/118 (6%) Query: 14 LTTKQT-GDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + K G E A +LE G+ + N GEID++ +G + EV+ RRS Sbjct: 1 MKAKDLLGRHGEDLAVGYLETLGMLIVERNWRCSEGEIDVVALDGDALVIAEVKTRRSLD 60 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-----VEWIKD 125 YG +V K +L + W R DV+A + VE +K Sbjct: 61 YGHPFEAVGPDKLARLHRLGAAWCRDRELRMPLR--RVDVIAVVDDGGGSPVVEHLKG 116 >UniRef50_A4A171 Putative uncharacterized protein n=2 Tax=Planctomycetaceae RepID=A4A171_9PLAN Length = 137 Score = 113 bits (283), Expect = 2e-24, Method: Composition-based stats. Identities = 37/106 (34%), Positives = 52/106 (49%), Gaps = 8/106 (7%) Query: 30 WLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAASVTRSKQHKLL 89 +L G IA + + GEID+I +GRT +FVEV+ R S+ + +V KQ KL Sbjct: 26 FLRQLGYVIIARSDRSKLGEIDIIAVDGRTVVFVEVKTRSSSDAAHPSEAVDTHKQAKLT 85 Query: 90 QTARLWLARHNGSFDTVDCRFDVVAFTG------NEVEWIKDAFND 129 + A +L RHN RFDV+A T +E +AF Sbjct: 86 RLAISYLRRHNLLECKA--RFDVIAITWPAAAQTPTIEHFLNAFEP 129 >UniRef50_C7PDZ7 Putative uncharacterized protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PDZ7_CHIPD Length = 118 Score = 112 bits (282), Expect = 2e-24, Method: Composition-based stats. Identities = 34/119 (28%), Positives = 46/119 (38%), Gaps = 7/119 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + G E A L IA N R EID+I +F EV+ S LY Sbjct: 2 ASHIALGKKGELIACGHLRLHHYEIIAVNWRHRRREIDIIASRDGCLVFFEVKTLASDLY 61 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-----NEVEWIKDAF 127 G VT +K+ + A ++ R RFDV+A T E+ +DAF Sbjct: 62 GWPEKHVTAAKRRNIQAVASAYMDRMKQLPKV--IRFDVIAITFQPDGTYELVHFEDAF 118 >UniRef50_C7GYG3 Putative choloylglycine hydrolase n=1 Tax=Eubacterium saphenum ATCC 49989 RepID=C7GYG3_9FIRM Length = 109 Score = 112 bits (282), Expect = 2e-24, Method: Composition-based stats. Identities = 31/114 (27%), Positives = 57/114 (50%), Gaps = 5/114 (4%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + K+ G E +L+ + + N R EID+I +G T F+EV+ R S + Sbjct: 1 MENKKIGSLGEEMTCSYLKDRQFVVLEQNYRNRYAEIDVIALKGDTVHFIEVKTRCSEVA 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 G A+ +V SKQ+++ + A ++LA ++ + F VV +++ + AF Sbjct: 61 GRASEAVPVSKQNRIRRLAEIYLADND--LCDKNVEFHVVTIDLHDINY---AF 109 >UniRef50_Q2NZA5 UPF0102 protein XOO3617 n=18 Tax=Xanthomonadaceae RepID=Y3617_XANOM Length = 122 Score = 112 bits (282), Expect = 3e-24, Method: Composition-based stats. Identities = 54/120 (45%), Positives = 71/120 (59%), Gaps = 3/120 (2%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 +Q G EA AR LE GLR + N N RGGE+DL+MR+G++ +FVEVRYRR Sbjct: 2 PAARQQRGAGVEAAARALLEQAGLRLVVGNANYRGGELDLVMRDGQSLVFVEVRYRRDDR 61 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVV--AFTGNEVEWIKDAFNDH 130 +GG AASV K+ KL+ A+L+L H + CRFDVV + + WI+DAF Sbjct: 62 FGGGAASVDWRKRRKLVLAAQLFLGAHPA-LAALPCRFDVVDASGEPPVLHWIRDAFRAD 120 >UniRef50_A7HN69 UPF0102 protein Fnod_1509 n=1 Tax=Fervidobacterium nodosum Rt17-B1 RepID=Y1509_FERNB Length = 133 Score = 112 bits (281), Expect = 3e-24, Method: Composition-based stats. Identities = 24/110 (21%), Positives = 49/110 (44%), Gaps = 1/110 (0%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 K+ E A ++L+ KG + + N GEID+I + IFVEV+ + Sbjct: 19 NKKEWQIAEELAVKYLKEKGYKILEKNFKTPYGEIDIIANKKDIIIFVEVKSGKGIR-IQ 77 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 + V K K++++A +L + + + + DV+ ++ ++ Sbjct: 78 PSERVDDKKYLKIVKSAEFYLEFYLKNKNYKISQIDVIEIINGNIKHYEN 127 >UniRef50_C1TKL9 Predicted endonuclease related to Holliday junction resolvase n=1 Tax=Dethiosulfovibrio peptidovorans DSM 11002 RepID=C1TKL9_9BACT Length = 123 Score = 112 bits (281), Expect = 3e-24, Method: Composition-based stats. Identities = 29/114 (25%), Positives = 51/114 (44%), Gaps = 3/114 (2%) Query: 18 QTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAA 77 + G E A R+L G + NV R GE+D++ R+G T + VEVR+R + + Sbjct: 7 EKGKRGEDLACRYLRNLGWTVLERNVRFRRGELDIVARDGDTLVIVEVRFRTTGIIMSPE 66 Query: 78 ASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFNDHS 131 SV K +L+ ++ + + R D++A T + + D + Sbjct: 67 DSVGPRKLRRLVIAGAAYVEKTGWNGF---WRIDLIALTERKGRLFLNHCRDIT 117 >UniRef50_A1SLR5 UPF0102 protein Noca_3248 n=11 Tax=Actinomycetales RepID=Y3248_NOCSJ Length = 124 Score = 112 bits (281), Expect = 4e-24, Method: Composition-based stats. Identities = 31/104 (29%), Positives = 45/104 (43%), Gaps = 2/104 (1%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 + G E A R L G+G+ + N GEIDL++R+G + EV+ R S YG Sbjct: 10 KQALGAYGETLAARHLVGQGMVLLERNWRCEAGEIDLVLRDGDVLVVCEVKTRSSLRYGT 69 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE 119 +VT K +L + A W+ D R D+V Sbjct: 70 PHEAVTDIKVARLRRLASRWVQDRGV--AVRDIRIDLVGIVRPR 111 >UniRef50_A5IKG8 UPF0102 protein Tpet_0671 n=6 Tax=Thermotogaceae RepID=Y671_THEP1 Length = 108 Score = 112 bits (280), Expect = 4e-24, Method: Composition-based stats. Identities = 27/107 (25%), Positives = 48/107 (44%), Gaps = 5/107 (4%) Query: 21 DAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAASV 80 E A ++L+ KG + + N + GEID++ R+GR +FVEV+ + + Sbjct: 5 KEAEELACKFLKKKGYKILERNYRTKYGEIDIVARDGREIVFVEVK--SGSGKVDPLERI 62 Query: 81 TRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 K L QTAR ++ ++ R D V T ++ + + Sbjct: 63 DLKKVRNLEQTARFYMIQNKLKG---PARVDFVRVTPEGIDHFEGIW 106 >UniRef50_C0QVG4 UPF0102 protein BHWA1_02005 n=2 Tax=Brachyspira RepID=Y2005_BRAHW Length = 121 Score = 112 bits (280), Expect = 4e-24, Method: Composition-based stats. Identities = 41/120 (34%), Positives = 59/120 (49%), Gaps = 7/120 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNER--GGEIDLIMREGRTTIFVEVRYRRSA 71 K G+ E A +LE G I N + GEIDL+M +G +F+EV+YRR Sbjct: 2 ANKKIIGNLGEDIALEYLEKLGYTLIERNFKGKKTRGEIDLVMTKGVVIVFIEVKYRRQG 61 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT----GNEVEWIKDAF 127 +G AA S++ K+ KL +TA +L SF+ C F V E+ +I+D F Sbjct: 62 SFGYAACSISDRKKKKLYETAEEYLIEKGLSFNQK-CSFGAVLIDDTHYNREISFIEDIF 120 >UniRef50_D2ATE9 Putative uncharacterized protein n=1 Tax=Streptosporangium roseum DSM 43021 RepID=D2ATE9_STRRD Length = 117 Score = 112 bits (280), Expect = 5e-24, Method: Composition-based stats. Identities = 33/99 (33%), Positives = 47/99 (47%), Gaps = 2/99 (2%) Query: 18 QTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAA 77 + G E A +L G++ + N GEID++ REGR + VEV+ R +G A Sbjct: 6 ELGRHGEQVAVDYLLAHGMQILDRNWRCPDGEIDVVAREGRALVVVEVKTRSGRTHGTAF 65 Query: 78 ASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT 116 +VT K +L + WLA FD R DV+A Sbjct: 66 EAVTVVKLARLRRLTGRWLAERRERFD--SVRIDVIALE 102 >UniRef50_A0Q0X6 UPF0102 protein NT01CX_2205 n=3 Tax=Clostridium RepID=Y2205_CLONN Length = 120 Score = 112 bits (280), Expect = 5e-24, Method: Composition-based stats. Identities = 31/118 (26%), Positives = 53/118 (44%), Gaps = 8/118 (6%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 K G E + +L KG + + N R GEID+I F EV+ R + +G Sbjct: 5 NKPIGSYGEHISENFLVSKGHKILTKNFRCRSGEIDIISSHNNYICFTEVKTRYNYSFGI 64 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE------VEWIKDAF 127 SVT +K K+ TA+ ++ + + F+V+ N+ + +I++AF Sbjct: 65 PCESVTITKIKKIRNTAKFYIYINKLFKNNFK--FNVIEIILNKYSNDYSINFIENAF 120 >UniRef50_C6WEV0 Putative uncharacterized protein n=1 Tax=Actinosynnema mirum DSM 43827 RepID=C6WEV0_ACTMD Length = 117 Score = 111 bits (279), Expect = 6e-24, Method: Composition-based stats. Identities = 27/116 (23%), Positives = 48/116 (41%), Gaps = 7/116 (6%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 + G E+ A R+LE +GL +A N GE+D++ +G + EV+ R + G Sbjct: 3 ASHVLGRLGESVACRYLERQGLVVLARNWRCASGELDVVATDGVRLVVCEVKCRSGSGRG 62 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-----EVEWIKD 125 + T + ++ +TA W H V R D+V + ++ Sbjct: 63 DPLEAATPEQLDRVRRTAYRWRREHR--LSGVGVRVDLVGLEWPPGGPVRLRHVRG 116 >UniRef50_Q47S60 UPF0102 protein Tfu_0669 n=2 Tax=Nocardiopsaceae RepID=Y669_THEFY Length = 124 Score = 111 bits (278), Expect = 8e-24, Method: Composition-based stats. Identities = 32/110 (29%), Positives = 48/110 (43%), Gaps = 3/110 (2%) Query: 7 RSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVR 66 R+ + + G E A R+L G+R + N R GEID++ R+ RT + VEV+ Sbjct: 2 RTRARLADQRRTLGQRGEELAARYLTRHGMRVLQRNWRCRDGEIDILARQDRTLVVVEVK 61 Query: 67 YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT 116 R +G +V +K+ +L W H S R DVV Sbjct: 62 TRAGRRFGTPLEAVDETKRARLRALGYRWARDHGCS---ARIRVDVVGIL 108 >UniRef50_Q4JV13 UPF0102 protein jk1180 n=2 Tax=Corynebacterium jeikeium RepID=Y1180_CORJK Length = 131 Score = 111 bits (278), Expect = 8e-24, Method: Composition-based stats. Identities = 33/118 (27%), Positives = 51/118 (43%), Gaps = 3/118 (2%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREG-RTTIFVEVRYRRSA 71 + G+ E A +L G + N R GE+DL+ R F+EV+YR SA Sbjct: 14 AANRRAVGNLGEDLAAEYLHRAGYEVLDRNFYTRYGELDLVARTPEDDLAFIEVKYRTSA 73 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTV-DCRFDVVAFTGNEV-EWIKDAF 127 GG A+V K ++ A LWL ++ R DV+ + V E ++ + Sbjct: 74 SDGGGVAAVGPRKLRRIRTLAGLWLEQNREGVQFSGGLRVDVIDVGPDGVREHVEGVW 131 >UniRef50_C0BPF0 Putative uncharacterized protein n=1 Tax=Flavobacteria bacterium MS024-3C RepID=C0BPF0_9BACT Length = 119 Score = 110 bits (277), Expect = 1e-23, Method: Composition-based stats. Identities = 35/116 (30%), Positives = 50/116 (43%), Gaps = 7/116 (6%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 G E A+ +L KG A N R EID+I I VEV+ R + Sbjct: 4 HNDLGAKGERIAQEYLISKGYEIRAVNYRHRKAEIDIIALHENFLIVVEVKTRTAPTIVP 63 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT----GNEVEWIKDAF 127 V SK + L++ A ++ RH + RFD+V T ++E +KDAF Sbjct: 64 LIQLVPPSKINHLIRAANYYMNRHKV---HKEARFDIVYITMKAHSYDLEHLKDAF 116 >UniRef50_Q1IJG5 UPF0102 protein Acid345_3985 n=1 Tax=Candidatus Koribacter versatilis Ellin345 RepID=Y3985_ACIBL Length = 143 Score = 110 bits (276), Expect = 1e-23, Method: Composition-based stats. Identities = 32/125 (25%), Positives = 53/125 (42%), Gaps = 9/125 (7%) Query: 9 GSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERG--GEIDLIMREGRTTIFVEVR 66 P + +TG E A +L G +A N E+D+I G F+EV+ Sbjct: 17 PEPDEPEHLKTGRRGEELAYFFLRKHGYTIVARNFRTPWHKSELDIIGWNGGILCFIEVK 76 Query: 67 YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEW 122 R + A A+V +K++ L + AR +L + + RFD+V + + Sbjct: 77 TRTTRDIATAEAAVDDTKRNDLRRVARHYLRQ---CAENTPTRFDIVTVYLDRPKPEITI 133 Query: 123 IKDAF 127 +K AF Sbjct: 134 LKSAF 138 >UniRef50_A6GIE5 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6GIE5_9DELT Length = 125 Score = 110 bits (276), Expect = 1e-23, Method: Composition-based stats. Identities = 44/118 (37%), Positives = 60/118 (50%), Gaps = 6/118 (5%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGR-----TTIFVEVRYRR 69 T+ G A E A R LE GL +A NV G E+DL+ E T +FVEVR R Sbjct: 9 HTRGRGLAAEQLAARQLERAGLTILARNVELSGAEVDLVASERDREGTPTIVFVEVRSRA 68 Query: 70 SALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 G A +V KQ ++ + A +L R + ++ V RFDV+A G W++DAF Sbjct: 69 DDRRGHPAQTVDARKQARVRRAATAYLVREDL-WERVAVRFDVIAIVGERATWLRDAF 125 >UniRef50_Q118B0 UPF0102 protein Tery_0733 n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Y733_TRIEI Length = 180 Score = 110 bits (276), Expect = 2e-23, Method: Composition-based stats. Identities = 30/117 (25%), Positives = 47/117 (40%), Gaps = 14/117 (11%) Query: 18 QTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRT------------TIFVEV 65 TG E +WL +G + + GE+D++ + + FVEV Sbjct: 17 DTGILGEELVAKWLNLEGWQILHRRWQCPWGELDIVATKTTSSLRDSSNYKFPILAFVEV 76 Query: 66 RYRRSALYG-GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVE 121 + R + +VT SKQ KL +TA ++L+ CRFDV N + Sbjct: 77 KTRSRGNWDQDGLLAVTESKQAKLWKTAEIFLSDRP-ELVDYSCRFDVALVRCNYIR 132 >UniRef50_C5CAH6 Holliday junction resolvase-like endonuclease n=1 Tax=Micrococcus luteus NCTC 2665 RepID=C5CAH6_MICLC Length = 139 Score = 110 bits (275), Expect = 2e-23, Method: Composition-based stats. Identities = 30/118 (25%), Positives = 42/118 (35%), Gaps = 2/118 (1%) Query: 2 ATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTI 61 A P + R T G E A RWL +G N GE+D++ + Sbjct: 10 AQPPGERRASRAHTA--LGRFGEDAAARWLAERGYVIADRNWRGEAGELDIVAHHAGWWV 67 Query: 62 FVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE 119 VEV+ R +G S+ R K +L + W+ H R D VA Sbjct: 68 GVEVKTRSGLAFGDPFESIDRRKLTRLHRLTAAWVRAHAADRRGTPWRVDAVAVLVPR 125 >UniRef50_C8NTW9 Choloylglycine hydrolase n=1 Tax=Corynebacterium genitalium ATCC 33030 RepID=C8NTW9_9CORY Length = 125 Score = 110 bits (275), Expect = 2e-23, Method: Composition-based stats. Identities = 35/101 (34%), Positives = 45/101 (44%), Gaps = 6/101 (5%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMR-EGRTTIFVEVRYRRSALYGGAA 77 G E + E G +A N R GEID+I TT+FVEV+ RR +GGA Sbjct: 13 LGALGETEVATRYEQAGYIIVARNYRCRDGEIDIIAMATDGTTVFVEVKTRRGTCFGGA- 71 Query: 78 ASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN 118 SVT K ++ + A WL RFDVV + Sbjct: 72 ESVTARKLARMRKAAVHWLRDKPFR----QVRFDVVEVLFD 108 >UniRef50_C4LJB4 Putative uncharacterized protein n=1 Tax=Corynebacterium kroppenstedtii DSM 44385 RepID=C4LJB4_CORK4 Length = 154 Score = 109 bits (274), Expect = 3e-23, Method: Composition-based stats. Identities = 31/108 (28%), Positives = 47/108 (43%), Gaps = 8/108 (7%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMR-------EGRTTIFVEVRY 67 +T+ G E +A WL +G + N RGGE+D++ VEV+ Sbjct: 26 STRLLGRWGEDRAAEWLVRQGFVIVDRNWRFRGGELDIVATLNTDARNSPAVCAVVEVKT 85 Query: 68 RRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF 115 RR+ +GG ++TR KQ L + WLA H R D++ Sbjct: 86 RRTQFFGGGVEAITRKKQQTLRRGMSQWLAAHPDVHPQF-IRIDLIDI 132 >UniRef50_C1A4D5 Putative uncharacterized protein n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1A4D5_GEMAT Length = 121 Score = 109 bits (274), Expect = 3e-23, Method: Composition-based stats. Identities = 35/120 (29%), Positives = 60/120 (50%), Gaps = 6/120 (5%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 ++ G E A RWL +G + +A +IDLIM+ + FVEV+ RR Sbjct: 2 TRARQELGLLGERIAARWLIREGWQLVAHRFRHGHRDIDLIMQREQEVAFVEVKARRGEA 61 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----EVEWIKDAFN 128 +G +V K+ +L+++A++W+ RH + ++ RFDV+ + V I+ AF Sbjct: 62 FGSPVEAVHARKRRELVRSAKVWVDRHGT--EGLEYRFDVLGILIDGQNVRVRHIEGAFQ 119 >UniRef50_A0LMM6 Putative uncharacterized protein n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LMM6_SYNFM Length = 95 Score = 109 bits (273), Expect = 3e-23, Method: Composition-based stats. Identities = 37/94 (39%), Positives = 51/94 (54%), Gaps = 6/94 (6%) Query: 40 AANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARH 99 N GEIDLI+R+G+T +FVEV+ R + +G SV+ +KQ +L + A +L Sbjct: 2 ERNFRCAAGEIDLIVRDGKTLVFVEVKSRCGSRFGLPQESVSIAKQRRLTRLALWYLREK 61 Query: 100 NGSFDTVDCRFDVVAFTG----NEVEWIKDAFND 129 F+ RFDVVA T EV WI +AF Sbjct: 62 R--FEGHPARFDVVAVTWSGGKPEVTWIVNAFEA 93 >UniRef50_C4KCT6 Putative uncharacterized protein n=3 Tax=Betaproteobacteria RepID=C4KCT6_THASP Length = 150 Score = 109 bits (273), Expect = 3e-23, Method: Composition-based stats. Identities = 42/98 (42%), Positives = 56/98 (57%), Gaps = 3/98 (3%) Query: 35 GLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARL 94 GLR IA NV RGGE+DL+ + +FVEVR RR+ +GGAA S+T +KQ ++L A+ Sbjct: 52 GLRVIARNVRCRGGEVDLVCLDRSHVVFVEVRLRRNNRFGGAAESITAAKQRRVLIAAQW 111 Query: 95 WLARHNGSFDTVDCRFDVV---AFTGNEVEWIKDAFND 129 WL F CRFD V A + W+ AF+ Sbjct: 112 WLGGAGRRFRDAACRFDAVLLDALDPARIIWLPGAFDA 149 >UniRef50_B6R5V9 Putative uncharacterized protein n=1 Tax=Pseudovibrio sp. JE062 RepID=B6R5V9_9RHOB Length = 135 Score = 109 bits (273), Expect = 3e-23, Method: Composition-based stats. Identities = 33/129 (25%), Positives = 55/129 (42%), Gaps = 4/129 (3%) Query: 2 ATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTI 61 T+S ++ + G E +A L G + + + GEIDLI + +T + Sbjct: 7 TPRATKSNLLKKQAAYRKGLQAELKAEMLLRQAGWQILERRYKTKQGEIDLIAEQDQTIV 66 Query: 62 FVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF-TGNEV 120 FVEV+ RR G ++T+ Q ++ AR W++ H+ RFD V E Sbjct: 67 FVEVKARRGVDDG--LYAITQRSQRRIANAAREWVSHHHEVVGKT-LRFDAVILPKHGEA 123 Query: 121 EWIKDAFND 129 + + F Sbjct: 124 QHFPNLFEA 132 >UniRef50_A8F4Z9 UPF0102 protein Tlet_0667 n=1 Tax=Thermotoga lettingae TMO RepID=Y667_THELT Length = 107 Score = 109 bits (273), Expect = 3e-23, Method: Composition-based stats. Identities = 30/105 (28%), Positives = 44/105 (41%), Gaps = 4/105 (3%) Query: 21 DAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAASV 80 E +A R+L KG + +A N R GEID+I R +FVEV+ + V Sbjct: 4 KEAEEKASRYLRHKGFKILARNYRTRFGEIDIIARYRGYLVFVEVK--SGNSFFLPRTRV 61 Query: 81 TRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 K + A ++ SF R DV+ T +E +D Sbjct: 62 DLQKIRHIQLAANDYIMNTKDSFKGY--RIDVIEVTEKGIEHFED 104 >UniRef50_A8HYK3 UPF0102 protein AZC_4471 n=3 Tax=Rhizobiales RepID=Y4471_AZOC5 Length = 131 Score = 109 bits (272), Expect = 4e-23, Method: Composition-based stats. Identities = 35/128 (27%), Positives = 55/128 (42%), Gaps = 4/128 (3%) Query: 2 ATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTI 61 P R+ G A E +A LE + R +A + GE+DL+ R + Sbjct: 4 PPPPDTPARRRKQAAHARGLAAEDRAAAVLEAQSFRILARRLRTSAGELDLVARRDDLLV 63 Query: 62 FVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF-TGNEV 120 F EV+ RRS AA S+ ++ +++ A L+LA H + RFD + Sbjct: 64 FCEVKLRRS--LAEAAESLQLRQRRRIIAAAELFLADHP-ELAPLAMRFDAILLGRDGGA 120 Query: 121 EWIKDAFN 128 E ++ AF Sbjct: 121 EHLEGAFE 128 >UniRef50_B2IUS6 Putative uncharacterized protein n=4 Tax=Nostocaceae RepID=B2IUS6_NOSP7 Length = 180 Score = 109 bits (272), Expect = 4e-23, Method: Composition-based stats. Identities = 31/130 (23%), Positives = 48/130 (36%), Gaps = 28/130 (21%) Query: 18 QTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGR------------------- 58 G E +WL+ G + R GEID+I + Sbjct: 11 DIGHLGEDLVAQWLQSTGWIILHRRFASRWGEIDIIAQHDGQTGEKLLTQHSLRAKRPAT 70 Query: 59 -------TTIFVEVRYRRSALYG-GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRF 110 FVEV+ R S + G +++T KQ K+ +TA ++LA++ CRF Sbjct: 71 ANSTQHSLLAFVEVKTRSSGSWDAGGRSAITPQKQAKISRTAGIFLAQYPEK-ADYSCRF 129 Query: 111 DVVAFTGNEV 120 DV + Sbjct: 130 DVAIVYCQRI 139 >UniRef50_A6DUI2 Endonuclease n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DUI2_9BACT Length = 127 Score = 108 bits (270), Expect = 7e-23, Method: Composition-based stats. Identities = 36/127 (28%), Positives = 58/127 (45%), Gaps = 11/127 (8%) Query: 12 RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERG-GEIDLIMREGRTTIFVEVRYRRS 70 ++ +TG EA A+R + G + N + GEID+I R+G T FVEV+ R Sbjct: 3 KKAKHLKTGRKGEAMAQRQMRRCGYEILRKNYSLEHIGEIDIIARDGGTLCFVEVKTRHQ 62 Query: 71 A--LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEW---IK- 124 A ++ K+ ++ + A+ +L +H S RFD+V + W IK Sbjct: 63 NKTEDTSPAQAIDSKKRQRIAKCAKYYLKKH--SLTQCSFRFDIVEVILGKFFWQHQIKI 120 Query: 125 --DAFND 129 AF + Sbjct: 121 RTHAFGE 127 >UniRef50_A4CH47 Putative uncharacterized protein n=1 Tax=Robiginitalea biformata HTCC2501 RepID=A4CH47_9FLAO Length = 128 Score = 108 bits (270), Expect = 7e-23, Method: Composition-based stats. Identities = 32/117 (27%), Positives = 53/117 (45%), Gaps = 7/117 (5%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 TT G E A R+L G R + N R EID++ + +EV+ R A Y Sbjct: 11 TTCDIGREGEDYAVRYLLASGYRILCRNYRYRRAEIDVLAFREGVLVVIEVKTRTRAFYE 70 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF----TGNEVEWIKDAF 127 + S+ RSK +L++ A ++ + + RFD++ G + ++DAF Sbjct: 71 ALSRSIPRSKIARLVRAADHYVRSNGLR---AEVRFDIIQVIRLREGYRLVHLEDAF 124 >UniRef50_B4U6T0 Putative uncharacterized protein n=1 Tax=Hydrogenobaculum sp. Y04AAS1 RepID=B4U6T0_HYDS0 Length = 110 Score = 107 bits (269), Expect = 8e-23, Method: Composition-based stats. Identities = 22/107 (20%), Positives = 43/107 (40%), Gaps = 2/107 (1%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 G +E A WL KG + + N + GEID+I + I EV+ + YG Sbjct: 2 KGKEFEDMAFSWLLEKGYKVLKRNHRCKRGEIDIIATKENKLIAFEVKGNNTDTYGLPEE 61 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 + R K ++ + + D + D + +++ +++ Sbjct: 62 RIDRLKLERIRLCLTEYALSNGIDLDNIQI--DAIFIYKDQIRHLEN 106 >UniRef50_B6BHT8 Putative uncharacterized protein n=1 Tax=Campylobacterales bacterium GD 1 RepID=B6BHT8_9PROT Length = 109 Score = 107 bits (269), Expect = 8e-23, Method: Composition-based stats. Identities = 27/110 (24%), Positives = 53/110 (48%), Gaps = 5/110 (4%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 ++ GD E +A ++L G + N R GEID+I + FVEV+ + Sbjct: 2 SRAKGDLAEDRACKFLYENGFMLVDRNFYSRFGEIDIIATKDEVLHFVEVK--SGLDFES 59 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 A ++T K +L++T +++ ++ + V +D + T VE +++ Sbjct: 60 AIQNITPKKLSRLIRTGNVYMKKNKLDVNFV---YDAIVVTPKTVEIVEN 106 >UniRef50_C5CGT1 UPF0102 protein Kole_1919 n=1 Tax=Kosmotoga olearia TBF 19.5.1 RepID=Y1919_KOSOT Length = 115 Score = 107 bits (269), Expect = 8e-23, Method: Composition-based stats. Identities = 29/109 (26%), Positives = 54/109 (49%), Gaps = 5/109 (4%) Query: 18 QTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAA 77 + G +E +A ++L+ +G + +A NV GE+D++ R+G+T +FVEV+ Sbjct: 7 KKGKEFEERASKFLKKQGYKILARNVRYSFGELDIVARKGKTLVFVEVKG--GNPDFPPR 64 Query: 78 ASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-GNEVEWIKD 125 V R+K +L A ++ + F+ R DV+ E+ +K Sbjct: 65 MRVDRAKLRRLELAAYKYIKDFSPKFEES--RLDVIEVLSNGEINHLKG 111 >UniRef50_C7H7V1 Endonuclease n=2 Tax=Faecalibacterium prausnitzii RepID=C7H7V1_9FIRM Length = 120 Score = 107 bits (269), Expect = 1e-22, Method: Composition-based stats. Identities = 42/120 (35%), Positives = 58/120 (48%), Gaps = 8/120 (6%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE-GRTTIFVEVRYRRSAL 72 + +TG EA A R+ + +G +A N R GEIDLI+RE T + EV+ R Sbjct: 1 MDRAETGRTGEAVAARYYQKQGCELVAHNYRTRMGEIDLILREPDGTLVLCEVKTRSPDP 60 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-----NEVEWIKDAF 127 AA+VT +KQ +L++TA +L + RFDV T V IK AF Sbjct: 61 LAAPAAAVTPAKQRRLIRTAEYYLQ--HTGQSDEPVRFDVAEVTPLDSGRWMVHIIKGAF 118 >UniRef50_Q0F072 Putative uncharacterized protein n=1 Tax=Mariprofundus ferrooxydans PV-1 RepID=Q0F072_9PROT Length = 120 Score = 107 bits (268), Expect = 1e-22, Method: Composition-based stats. Identities = 33/121 (27%), Positives = 59/121 (48%), Gaps = 13/121 (10%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 ++T+ G+ E++A R+L+ G R + N GE+D++ G +FVEV+ Sbjct: 1 MSTRD-GNIGESEASRYLQHHGYRILDRNARLGRGELDIVALSGEIVVFVEVKA--HHNR 57 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN---------EVEWIK 124 A +V K +L A++WLA H + ++ CRFD++ T +E ++ Sbjct: 58 ESALLAVHEDKCARLKSAAQMWLALHP-RYASLQCRFDLIIITPRVGLTAWLGSCIEHME 116 Query: 125 D 125 D Sbjct: 117 D 117 >UniRef50_D2NTN5 Predicted endonuclease distantly related to archaeal Holliday junction resolvase n=2 Tax=Rothia mucilaginosa RepID=D2NTN5_9MICC Length = 151 Score = 107 bits (267), Expect = 1e-22, Method: Composition-based stats. Identities = 42/142 (29%), Positives = 53/142 (37%), Gaps = 20/142 (14%) Query: 2 ATVPTRSGSPRQL------TTKQTGDAWEAQARRWLEGKGLRFIAANVNER--------G 47 A TRS R L G E R LE G R + N Sbjct: 8 ARAATRSAPNRPLLRRTSPRAHSVGRWGEELTARILETNGYRILERNWRPPAGLEHEQIR 67 Query: 48 GEIDLIMREG-RTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTV 106 GE+DLI + +FVEV+ R S +G AS+ R K + A LW R + D Sbjct: 68 GELDLIAIDPEDELVFVEVKTRSSEDFGHPFASIDRDKARRTRSLAILWC-RLRENLDFP 126 Query: 107 DCRFDVVAFTGN----EVEWIK 124 R D +A TG E +K Sbjct: 127 RFRIDAIAVTGTCETFTFEHLK 148 >UniRef50_B2UP21 Putative uncharacterized protein n=2 Tax=Verrucomicrobiaceae RepID=B2UP21_AKKM8 Length = 118 Score = 107 bits (267), Expect = 2e-22, Method: Composition-based stats. Identities = 36/115 (31%), Positives = 52/115 (45%), Gaps = 9/115 (7%) Query: 21 DAWEAQARRWLEGKGLRFIAANVN-ERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAAS 79 E A +L +G + N RGGE+D++ REG +FVEV+ R YGGA + Sbjct: 1 MYGELAAASFLRAEGCVILRRNWRPVRGGELDIVCREGECLVFVEVKTRTGNGYGGARRA 60 Query: 80 VTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-----GNEVEWIKDAFND 129 V K+ + + A WL + V+ R+DVV E I+ AF Sbjct: 61 VNARKRALIRRGAAEWL---RLLPEPVNSRYDVVEVLYREGMPPEFRHIRGAFGA 112 >UniRef50_B7K7K1 Putative uncharacterized protein n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7K7K1_CYAP7 Length = 148 Score = 107 bits (267), Expect = 2e-22, Method: Composition-based stats. Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 4/100 (4%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE--GRTTIFVEVRYRRSALYG-G 75 G+ E WL+ + + R GEID+I + + F+EV+ R S + Sbjct: 4 IGELGEKLVSEWLKTQEWSILQHRWRCRWGEIDIISQSTTDHSLAFIEVKTRNSRNWDSD 63 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF 115 ++ KQ KL+++A L+L + S CRFDV Sbjct: 64 GLLAINEKKQIKLIKSASLFLGEYP-SLALFPCRFDVALV 102 >UniRef50_Q5FF38 UPF0102 protein ERGA_CDS_00540 n=5 Tax=canis group RepID=Y054_EHRRG Length = 127 Score = 107 bits (267), Expect = 2e-22, Method: Composition-based stats. Identities = 27/120 (22%), Positives = 50/120 (41%), Gaps = 6/120 (5%) Query: 12 RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSA 71 ++L G E +L+ K I GEID+I + + +F+EV+ Sbjct: 8 KRLAYNTLGYLGEVLIILFLKCKLYHIIKHRYRCPLGEIDIIAHKNKQLVFIEVKTSLFN 67 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF-TGNEVEWIKDAFNDH 130 +T +Q +L++A+ ++A H F RFD+ F + I +A+ + Sbjct: 68 KNIP----ITYKQQKSILKSAKYFIAFHR-KFANYSIRFDLYFFSLSTGLTHIPNAWQEP 122 >UniRef50_Q2IJ48 UPF0102 protein Adeh_1910 n=4 Tax=Anaeromyxobacter RepID=Y1910_ANADE Length = 134 Score = 106 bits (266), Expect = 2e-22, Method: Composition-based stats. Identities = 42/117 (35%), Positives = 54/117 (46%), Gaps = 6/117 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 + G EA A WL +G R + N R GE+DL+ R+G +FVEVR R S GG Sbjct: 12 RQALGREGEALAAAWLAERGFRILDRNHRTRRGEVDLVCRDGEVLVFVEVRSRTSGAQGG 71 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----EVEWIKDAFN 128 +V K +++ A W H G RFDVVA T VE AF+ Sbjct: 72 PEETVGPLKGRRVVAAATDWALGHGGLEQ--AIRFDVVAVTFGDGEPRVEHFPAAFD 126 >UniRef50_D2LER1 Putative uncharacterized protein n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LER1_RHOVA Length = 124 Score = 106 bits (265), Expect = 2e-22, Method: Composition-based stats. Identities = 36/123 (29%), Positives = 58/123 (47%), Gaps = 4/123 (3%) Query: 6 TRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEV 65 S + + + G A E +A+ LE K R +A +GGEIDL+ + G FVEV Sbjct: 3 AASRARNAPNSYKIGVAAETRAKLLLEAKSYRILAERYKTKGGEIDLVAQRGDHLAFVEV 62 Query: 66 RYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-VEWIK 124 + RR+ AA +V +Q ++ A ++L H FDV+ + + + I+ Sbjct: 63 KCRRTQE--EAAYAVLPRQQARIATAAEVFLGEH-AGLSHESASFDVILVSPTQGLSHIE 119 Query: 125 DAF 127 AF Sbjct: 120 QAF 122 >UniRef50_B2RLS5 UPF0102 protein PGN_1801 n=2 Tax=Porphyromonas gingivalis RepID=Y1801_PORG3 Length = 135 Score = 106 bits (265), Expect = 3e-22, Method: Composition-based stats. Identities = 24/120 (20%), Positives = 46/120 (38%), Gaps = 9/120 (7%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 G E A + L +G + A N E+D++ R + VEV+ R Sbjct: 2 ADHNDRGRQGEEIALKHLRQQGYQIEALNWQSGRRELDIVASTSRELVVVEVKTRTEGFL 61 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT------GNEVEWIKDAF 127 +V K+ + ++A ++ + + RFDV++ +E ++AF Sbjct: 62 LAPEEAVDARKRRLISESAHHYVRMYAI---DLPVRFDVISVVLSADGSCKRIEHRENAF 118 >UniRef50_C7QA44 Putative uncharacterized protein n=1 Tax=Catenulispora acidiphila DSM 44928 RepID=C7QA44_CATAD Length = 157 Score = 106 bits (265), Expect = 3e-22, Method: Composition-based stats. Identities = 29/138 (21%), Positives = 49/138 (35%), Gaps = 19/138 (13%) Query: 6 TRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNER----GGEIDLIMREGRTTI 61 L+ Q G E A +L G + N R GE+D+I + Sbjct: 18 PSQADHSTLSPGQLGREGEDLAAAYLTACGYHVLDRNWRWRGPDVRGELDIIALASDLLV 77 Query: 62 FVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVD--------CRFDVV 113 +EV+ RR+A ++T +K+ +L + WLA H R DV+ Sbjct: 78 TIEVKTRRAATGARPFDAITEAKRARLWKLTNRWLAEHRLDPAVRHHLPRGIRGIRLDVI 137 Query: 114 AF-------TGNEVEWIK 124 T ++ ++ Sbjct: 138 GLIYPTDGHTEPTIDHLQ 155 >UniRef50_Q83I01 UPF0102 protein TW312 n=2 Tax=Tropheryma whipplei RepID=Y312_TROW8 Length = 120 Score = 106 bits (265), Expect = 3e-22, Method: Composition-based stats. Identities = 26/117 (22%), Positives = 46/117 (39%), Gaps = 4/117 (3%) Query: 12 RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSA 71 ++ G E +A +L G + N R GE+D+I R+ + VEV+ + Sbjct: 3 HDVSKYALGRIAEDKACNYLSVNGYIVLDRNWYCRFGELDIIARKNGVIVAVEVKGGKRN 62 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG---NEVEWIKD 125 ++T K KL + WL + + +D R D V+ T ++ Sbjct: 63 A-DYPICNITVKKLSKLTFLLKAWLHENKLNEFCIDLRIDAVSVTFIPELQIRHFVG 118 >UniRef50_A0NXE9 Putative uncharacterized protein n=2 Tax=Labrenzia RepID=A0NXE9_9RHOB Length = 134 Score = 105 bits (264), Expect = 3e-22, Method: Composition-based stats. Identities = 38/135 (28%), Positives = 63/135 (46%), Gaps = 9/135 (6%) Query: 1 MATVPTRSG-----SPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMR 55 MA P SG + R+ G + E A +L G R + + GEIDLI + Sbjct: 1 MARAPGGSGKLPAETDRRRRAHALGLSAETLAAWYLRLTGWRILKRRYKTKAGEIDLIAK 60 Query: 56 EGRTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF 115 +T F+EV+ R+S A +VT + Q ++ + A++++A H RFD++ Sbjct: 61 RRKTVAFIEVKARKSRQ--AALEAVTPASQKRITRAAKIFVAEHP-KAGFYTLRFDIIVV 117 Query: 116 TGNEV-EWIKDAFND 129 + E I +AF+ Sbjct: 118 RPRALPERIVNAFHA 132 >UniRef50_Q7UM23 UPF0102 protein RB9115 n=1 Tax=Rhodopirellula baltica RepID=Y9115_RHOBA Length = 167 Score = 105 bits (264), Expect = 3e-22, Method: Composition-based stats. Identities = 42/123 (34%), Positives = 57/123 (46%), Gaps = 11/123 (8%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIM--REGRTTIFVEVRYRRSALY 73 Q G E A + L KGL IA + ++R GEIDLI + R +FVEV+ + Sbjct: 39 NAQLGRRGEQAAAQLLRRKGLNVIAESESDRAGEIDLIALRKRPRLIVFVEVKTLSTTRP 98 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF-------TGNEVEWIKDA 126 G A V +KQ ++ + A +L R + CRFDVVA VE + A Sbjct: 99 GHPADRVDENKQARITRAALRYLKRKKLL--GITCRFDVVAVWWPRDEPRPTRVEHYESA 156 Query: 127 FND 129 FN Sbjct: 157 FNA 159 >UniRef50_C2ANQ3 Predicted endonuclease related to Holliday junction resolvase n=1 Tax=Tsukamurella paurometabola DSM 20162 RepID=C2ANQ3_TSUPA Length = 122 Score = 105 bits (263), Expect = 4e-22, Method: Composition-based stats. Identities = 28/107 (26%), Positives = 40/107 (37%), Gaps = 7/107 (6%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNER----GGEIDLIMRE-GRTTIFVEVRYR 68 + + G A E +L G+G R + N GE+D+I + VEV+ R Sbjct: 1 MGNNEVGRAGEDLVCEYLTGRGWRVLDRNWRFSGSGLRGELDVIAQSADGVLAVVEVKTR 60 Query: 69 RSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF 115 YG +VT K +L WLA R DV + Sbjct: 61 SGTAYGSGFEAVTPRKVAQLRALTARWLAE--SENAYRRVRIDVASV 105 >UniRef50_Q3M3N9 UPF0102 protein Ava_4800 n=2 Tax=Nostocaceae RepID=Y4800_ANAVT Length = 151 Score = 105 bits (263), Expect = 4e-22, Method: Composition-based stats. Identities = 30/113 (26%), Positives = 53/113 (46%), Gaps = 8/113 (7%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE-----GRTTIFVEVRYR 68 ++ + E +WL+ G + + R GEID+I + FVEV+ R Sbjct: 1 MSHLNIANLGEDFVAQWLQSTGWMILNRQFSCRWGEIDIIAQHTRNNQESILAFVEVKTR 60 Query: 69 RSALYGG-AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV 120 + ++T KQ K+ +TAR++LA++ + + CRFDV A ++ Sbjct: 61 SPGNWDDGGRGAITLKKQAKIERTARIFLAKYPDKAEYI-CRFDV-AIVSYQI 111 >UniRef50_A1HN64 Putative uncharacterized protein n=1 Tax=Thermosinus carboxydivorans Nor1 RepID=A1HN64_9FIRM Length = 78 Score = 105 bits (263), Expect = 4e-22, Method: Composition-based stats. Identities = 25/70 (35%), Positives = 34/70 (48%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 G E A +L G + + N R GEID++ T +FVEV+ R S +G A Sbjct: 2 MGKMGENAAADYLARNGYKILMRNYRCRIGEIDIVAERQGTIVFVEVKTRSSEKFGFPAE 61 Query: 79 SVTRSKQHKL 88 +V KQ KL Sbjct: 62 AVNYRKQQKL 71 >UniRef50_C6XIG7 Putative uncharacterized protein n=1 Tax=Hirschia baltica ATCC 49814 RepID=C6XIG7_HIRBI Length = 138 Score = 105 bits (263), Expect = 5e-22, Method: Composition-based stats. Identities = 34/119 (28%), Positives = 59/119 (49%), Gaps = 4/119 (3%) Query: 12 RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSA 71 ++ ++ G E A WL KG + + V R GEIDLI +GR F+EV+ R++ Sbjct: 23 KRRKHEKRGRNAEWLASIWLRLKGYKILQKRVRMRTGEIDLIATKGRVIAFIEVKARKTI 82 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV-EWIKDAFND 129 G SV + ++ +TA +W+A+ F + D R+D+V ++ +K + Sbjct: 83 NIG--LQSVPETSWRRISKTAEIWMAKK-TKFKSHDWRYDLVVVCPWKIPSHLKAFWRP 138 >UniRef50_Q0BTH9 UPF0102 protein GbCGDNIH1_0975 n=1 Tax=Granulibacter bethesdensis CGDNIH1 RepID=Y975_GRABC Length = 158 Score = 105 bits (262), Expect = 5e-22, Method: Composition-based stats. Identities = 33/120 (27%), Positives = 52/120 (43%), Gaps = 4/120 (3%) Query: 12 RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSA 71 R + G E A + LE G + + + GEID++ VEV+YR + Sbjct: 37 RGGKASRDGLEAERIAAQALEADGWQILGRRLRTSAGEIDILAEMDGLLAIVEVKYRPTL 96 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-GNEVEWIKDAFNDH 130 AA ++ ++ +L+ A LA+H + T RFDV+ +V I DAF Sbjct: 97 S--EAAHALGPRQRKRLIAAASYVLAQHP-EYGTEGVRFDVIVVDMAGQVRRITDAFRLD 153 >UniRef50_C3XDT2 Putative uncharacterized protein n=1 Tax=Helicobacter bilis ATCC 43879 RepID=C3XDT2_9HELI Length = 112 Score = 105 bits (262), Expect = 6e-22, Method: Composition-based stats. Identities = 30/110 (27%), Positives = 52/110 (47%), Gaps = 6/110 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 +Q G +E A +L G FI N + R GEIDLIM++ F+EV+ S+ Sbjct: 4 MRQKGRYYEQVALEYLISLGFEFIEQNFHSRYGEIDLIMKKDSILHFIEVK---SSHCIN 60 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 A++T K +L +T ++L + D V+ + + +I++ Sbjct: 61 PLANITPKKLERLTKTIHVFLDQRQIVSHFC---IDAVSIYKDNITFIEN 107 >UniRef50_A3VPC2 Putative uncharacterized protein n=1 Tax=Parvularcula bermudensis HTCC2503 RepID=A3VPC2_9PROT Length = 146 Score = 105 bits (262), Expect = 6e-22, Method: Composition-based stats. Identities = 35/131 (26%), Positives = 54/131 (41%), Gaps = 3/131 (2%) Query: 2 ATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTI 61 A + S L ++ G E +A +L+ KG V GEIDLI+ +G T Sbjct: 8 ARQSAKRQSAEYLAAERLGRRAERRAALFLQLKGYAIRDRRVRTPRGEIDLIVTKGSTLA 67 Query: 62 FVEVRYRRSAL-YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV 120 F+EV+ R SA A V ++ + +W R S RFD++ Sbjct: 68 FIEVKARTSADALQDPATLVPPQNWARIAAASAIW--RARASLMPKIVRFDLILVRRGIP 125 Query: 121 EWIKDAFNDHS 131 +KDA+ + Sbjct: 126 CHVKDAYRPDA 136 >UniRef50_UPI0001C31AB5 protein of unknown function UPF0102 n=1 Tax=Conexibacter woesei DSM 14684 RepID=UPI0001C31AB5 Length = 122 Score = 104 bits (261), Expect = 7e-22, Method: Composition-based stats. Identities = 28/118 (23%), Positives = 46/118 (38%), Gaps = 7/118 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 G E A LE +G + N R GE+D++ + +F EV+ RR Sbjct: 6 RHHLGRIGENLAVEHLERRGFVVLDRNYRTRWGELDVVACDDERIVFCEVKTRRLGSSA- 64 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN------EVEWIKDAF 127 + ++ +L + A WL + RFD V T + +E ++ AF Sbjct: 65 PLEGLREPQRRRLRRMAVSWLQAKPRRTYVPELRFDAVGVTIDATGQLVALEHLEGAF 122 >UniRef50_D1Y396 Putative uncharacterized protein n=1 Tax=Pyramidobacter piscolens W5455 RepID=D1Y396_9BACT Length = 164 Score = 104 bits (261), Expect = 7e-22, Method: Composition-based stats. Identities = 34/108 (31%), Positives = 51/108 (47%), Gaps = 3/108 (2%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 G E A +L+ +GL+ + NV ER E+DL+ EG+T +FVEVR RR Sbjct: 43 RAAIGRWAEELAAGFLQAQGLKILERNVRERFSELDLVALEGKTLVFVEVRCRRKNPVMS 102 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWI 123 A ++ K +LL+ A L+ R + R D+V+ W Sbjct: 103 AQDTIGPLKWRRLLRGAELYTLRRQWRGE---WRLDLVSVDVGHERWH 147 >UniRef50_D1N5L9 Putative uncharacterized protein n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N5L9_9BACT Length = 132 Score = 104 bits (261), Expect = 8e-22, Method: Composition-based stats. Identities = 29/118 (24%), Positives = 51/118 (43%), Gaps = 7/118 (5%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + G EA A R LE KG +A N + GE+D++ R+G + +FVEV+ Sbjct: 5 RAAHLALGRRGEAAACRLLEAKGFDILARNWRVKAGELDIVARDGASVVFVEVKTLHRKG 64 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-----NEVEWIKD 125 + +++ ++ + A+L+L + RFD+V E+ D Sbjct: 65 FFRPLDNLSAHQKKRNFHAAQLYLRM--IGGTGLPVRFDLVEVVASRWRLREIRHHHD 120 >UniRef50_C1F7M4 Putative uncharacterized protein n=1 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F7M4_ACIC5 Length = 157 Score = 104 bits (260), Expect = 9e-22, Method: Composition-based stats. Identities = 34/128 (26%), Positives = 50/128 (39%), Gaps = 9/128 (7%) Query: 8 SGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERG--GEIDLIMREGRTTIFVEV 65 + SP + TG E A +L G +A G++DLI EG +EV Sbjct: 27 AASPEEPAHLTTGRRGELAAYGFLRRNGYTIVARGWRSHICPGDLDLIAWEGEHLCVIEV 86 Query: 66 RYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-----EV 120 + R + A A+V K+ L AR +L RFDVV+ + E Sbjct: 87 KARTTRDVATAEAAVDHQKRRTLRMLARRYLRLAGIPQSAA--RFDVVSVYFDSGHAPEF 144 Query: 121 EWIKDAFN 128 ++AF Sbjct: 145 TLYRNAFG 152 >UniRef50_A4X4J0 UPF0102 protein Strop_1320 n=2 Tax=Micromonosporaceae RepID=Y1320_SALTO Length = 121 Score = 104 bits (260), Expect = 1e-21, Method: Composition-based stats. Identities = 40/119 (33%), Positives = 54/119 (45%), Gaps = 6/119 (5%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + G E A R L GLR +A N GEID+I +G EV+ RR+ Sbjct: 5 SRHNQSVGAYGERCALRHLITAGLRPVARNWRCPHGEIDIIAWDGPVLAICEVKTRRTDT 64 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT----GNEVEWIKDAF 127 +G A+VT +K +L A WLA D + RFDV++ VE +K AF Sbjct: 65 FGTPTAAVTGTKARRLRLLAARWLAETGTRAD--EVRFDVLSIRLTGGPPHVEHLKGAF 121 >UniRef50_D0WR56 Putative endonuclease n=1 Tax=Actinomyces sp. oral taxon 848 str. F0332 RepID=D0WR56_9ACTO Length = 138 Score = 104 bits (260), Expect = 1e-21, Method: Composition-based stats. Identities = 32/122 (26%), Positives = 57/122 (46%), Gaps = 9/122 (7%) Query: 11 PRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLI--MREGRTTIFVEVRYR 68 P L ++ G E A R+L+ G +A N R GE+DL+ + R + VEV+ R Sbjct: 17 PEPLGNQELGKWGEELAARYLQAYGYVVLARNWRRRAGELDLVTACPQRRAVVAVEVKTR 76 Query: 69 RSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-----VEWI 123 + + G+ +++R+K +L + +WL + V D+VA T + + Sbjct: 77 NAEVSVGSVEAISRAKLARLRKLTGMWLQETGTRCERVCL--DLVAITVENDGSWLIRHL 134 Query: 124 KD 125 +D Sbjct: 135 RD 136 >UniRef50_Q0A6J0 UPF0102 protein Mlg_2205 n=2 Tax=Ectothiorhodospiraceae RepID=Y2205_ALHEH Length = 126 Score = 104 bits (259), Expect = 1e-21, Method: Composition-based stats. Identities = 52/124 (41%), Positives = 67/124 (54%), Gaps = 1/124 (0%) Query: 7 RSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVR 66 R +TG+ E +A L G+GL + N R GEIDLIMR+G +FVEVR Sbjct: 3 RRRPSAPAPHLETGNRGERRALEHLTGQGLELLECNFRCRAGEIDLIMRDGEVVVFVEVR 62 Query: 67 YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDA 126 R YGGA AS+T +KQ +L + A WL RH + CRFDVV F G +W++ A Sbjct: 63 VRTHPGYGGALASITPAKQRRLARAAARWLQRHRLT-QRAVCRFDVVTFDGERPQWLRHA 121 Query: 127 FNDH 130 F Sbjct: 122 FTAP 125 >UniRef50_UPI00019790C0 hypothetical protein HcinC1_06745 n=2 Tax=Helicobacter cinaedi CCUG 18818 RepID=UPI00019790C0 Length = 116 Score = 104 bits (259), Expect = 1e-21, Method: Composition-based stats. Identities = 28/112 (25%), Positives = 49/112 (43%), Gaps = 5/112 (4%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRR--SALY 73 ++ G E +A +L G I N R GEID+I + F+EV+ + Sbjct: 4 SRAKGKEAEDKACAFLRENGFEIIERNFFARYGEIDIIAQRDGILHFIEVKSASVGAKSG 63 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 ++T SK KL+ T +L+ N + + D + G +E+I++ Sbjct: 64 FEPIYNITPSKIEKLISTIGFYLSTQNLTQEYC---LDALIIKGGHIEFIEN 112 >UniRef50_B9L042 Putative uncharacterized protein n=2 Tax=Thermomicrobia (class) RepID=B9L042_THERP Length = 125 Score = 104 bits (259), Expect = 1e-21, Method: Composition-based stats. Identities = 36/106 (33%), Positives = 54/106 (50%), Gaps = 1/106 (0%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + + G+A E A RWLE G +A N R GE+D++ +G + VEV+ RR A Sbjct: 1 MGRQCLGEAGERAAARWLEEAGWHVLARNWRCRQGELDIVALDGDVLVAVEVKVRRDAGN 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE 119 A +VT K +LL +LA H + + CR D++A T + Sbjct: 61 EPAEWAVTPRKGRRLLAALSAFLAAHPEHQERL-CRVDLIAVTVDR 105 >UniRef50_C7NGY4 Predicted endonuclease related to Holliday junction resolvase n=1 Tax=Kytococcus sedentarius DSM 20547 RepID=C7NGY4_KYTSD Length = 120 Score = 103 bits (258), Expect = 2e-21, Method: Composition-based stats. Identities = 30/116 (25%), Positives = 49/116 (42%), Gaps = 7/116 (6%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 + G E A RW + +G R + N R GEIDL++ G + EV+ R + +G Sbjct: 5 RRTLGRRGEDIAARWWQERGARVLERNWRHRLGEIDLVVTSGPRLVVCEVKTRSTVAFGQ 64 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-----GNEVEWIKDA 126 V ++ +L + WL H G + + R DV+ V+ + A Sbjct: 65 PVEMVALPQRRRLRRLTAAWLQEHPGRWA--EVRIDVIGVLLPPGGPATVQHVPGA 118 >UniRef50_A4ECJ4 Putative uncharacterized protein n=1 Tax=Collinsella aerofaciens ATCC 25986 RepID=A4ECJ4_9ACTN Length = 161 Score = 103 bits (257), Expect = 2e-21, Method: Composition-based stats. Identities = 26/128 (20%), Positives = 44/128 (34%), Gaps = 14/128 (10%) Query: 12 RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEI--DLIMREGRTTIFVEVRYRR 69 + L+ ++ G E + +G + GE L+ + EV+ RR Sbjct: 33 KGLSPRELGMLGELITIDYFNERGYTLLEQGYRCTEGEADLVLLDELDDVVVMAEVKTRR 92 Query: 70 ----SALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-----EV 120 +V KQ + + A +L H + RFD V T E+ Sbjct: 93 VALDCNTRVFPEEAVDAQKQRRYRRIASCYLMEH---YPLKAIRFDAVGVTIRGGHIAEI 149 Query: 121 EWIKDAFN 128 E +AF+ Sbjct: 150 EHQYNAFD 157 >UniRef50_Q5SLC1 UPF0102 protein TTHA0372 n=5 Tax=Thermaceae RepID=Y372_THET8 Length = 112 Score = 102 bits (256), Expect = 3e-21, Method: Composition-based stats. Identities = 36/109 (33%), Positives = 52/109 (47%), Gaps = 9/109 (8%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 G E +A R+L GKG R + N GE+DL M + + VEV+ R SA +G Sbjct: 5 RGRWAEEEALRFLLGKGYRLLWRNRRTPFGEVDLFMEKDGVYVVVEVKQRASARFGAPLE 64 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----EVEWI 123 ++T K +LLQ+AR L R D + R + V G +E + Sbjct: 65 AITPGKVRRLLQSARFLLGR-----DDLPVRLEAVLVHGTPKDFRLEHL 108 >UniRef50_A7HZ51 UPF0102 protein Plav_3586 n=1 Tax=Parvibaculum lavamentivorans DS-1 RepID=Y3586_PARL1 Length = 133 Score = 102 bits (255), Expect = 3e-21, Method: Composition-based stats. Identities = 41/130 (31%), Positives = 57/130 (43%), Gaps = 5/130 (3%) Query: 2 ATVPTRSGSPR-QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTT 60 A R G+P L + G E A L KG R +A + GEIDL++R GR Sbjct: 7 APRAARKGNPATGLAAYRLGLRAETLAVLLLRLKGYRVVARRLKTPAGEIDLVVRRGRAL 66 Query: 61 IFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV 120 VEV+ R AA ++ +Q +L + A L R+ F +D RFDVV Sbjct: 67 AVVEVKARGEGD--AAAEALLPRQQRRLERAAAHLLGRYP-HFADLDLRFDVVLIVPRRW 123 Query: 121 -EWIKDAFND 129 + DA+ Sbjct: 124 PRHLADAWRP 133 >UniRef50_B5JM98 Putative uncharacterized protein n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JM98_9BACT Length = 141 Score = 102 bits (255), Expect = 3e-21, Method: Composition-based stats. Identities = 33/113 (29%), Positives = 48/113 (42%), Gaps = 5/113 (4%) Query: 7 RSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVR 66 R+ P G E +A + L+ KG + +A N EIDLI G+ +FVEVR Sbjct: 11 RASEPESAA---IGRRGEREAEKLLKRKGYQILARNWRSGRDEIDLICLHGKAVVFVEVR 67 Query: 67 YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE 119 R+ S+ R K+ L + R + + RFDVV +E Sbjct: 68 TRKVGALVSGYDSIDRRKREALRRVCRSYFGMMKPK--PITLRFDVVEIEHDE 118 >UniRef50_D2L5G2 Putative uncharacterized protein n=1 Tax=Desulfovibrio sp. FW1012B RepID=D2L5G2_9DELT Length = 134 Score = 102 bits (255), Expect = 4e-21, Method: Composition-based stats. Identities = 39/115 (33%), Positives = 57/115 (49%), Gaps = 7/115 (6%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRR-SALYGGAA 77 G EA A L G G R N RGGE+DLI +G T +FVEV+ R +L Sbjct: 8 LGREGEAVAEALLVGAGFRVEVRNYRTRGGEVDLICLDGDTVVFVEVKARGPGSLLDRPE 67 Query: 78 ASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----EVEWIKDAFN 128 +VT +K+ ++ + A +L+ ++ CRFDVVA + + + DAF Sbjct: 68 EAVTPAKRGRIARAAAAFLSER--AWWDRPCRFDVVAVSVHGGRRTATHLPDAFG 120 >UniRef50_Q6NGK0 UPF0102 protein DIP1513 n=1 Tax=Corynebacterium diphtheriae RepID=Y1513_CORDI Length = 122 Score = 102 bits (255), Expect = 4e-21, Method: Composition-based stats. Identities = 32/106 (30%), Positives = 44/106 (41%), Gaps = 6/106 (5%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREG-RTTIFVEVRYRRSALY 73 E + +G A NV+ GEID+I +F+EV+ R S Sbjct: 5 HNHYLAVLGEDFVAQQYANEGYDITARNVSFSVGEIDIIATSPQGEVVFIEVKTRSS-SL 63 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE 119 AA +VT +K K+ + A WL D RFDVVA +E Sbjct: 64 MDAAEAVTPTKMRKIHRAASKWLQGKPF----ADIRFDVVAVHVDE 105 >UniRef50_D1B6E0 Putative uncharacterized protein n=1 Tax=Thermanaerovibrio acidaminovorans DSM 6589 RepID=D1B6E0_THEAS Length = 118 Score = 102 bits (254), Expect = 5e-21, Method: Composition-based stats. Identities = 35/117 (29%), Positives = 49/117 (41%), Gaps = 8/117 (6%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + G E A R L G R + NV GEID++ +G +FVEVR R Sbjct: 2 EARNLARGALGEEMAVRHLIRMGWRILGRNVRYPFGEIDIVAHDGTELVFVEVRLR-GPG 60 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT----GNEVEWIKD 125 AA +V +K KL++ R + R D++A G VE I+D Sbjct: 61 PQRAAETVGPAKLRKLIRACRAFAESRG---YDGPFRIDLLAIDQGPCGYRVELIRD 114 >UniRef50_A8YJ85 Similar to Y189_SYNY3 UPF0102 protein sll0189 n=2 Tax=Microcystis aeruginosa RepID=A8YJ85_MICAE Length = 139 Score = 101 bits (253), Expect = 6e-21, Method: Composition-based stats. Identities = 25/104 (24%), Positives = 49/104 (47%), Gaps = 4/104 (3%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIM--REGRTTIFVEVRYRRSALYG-G 75 G+ E WL+ + + GGEIDLI+ + FVEV+ R + + G Sbjct: 4 VGELGENLVADWLQLQQWHILQRRWRSGGGEIDLIVLSKSQAILAFVEVKTRSAGNWDLG 63 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE 119 ++ K+ ++ + A+++L+ + + + CRFDV + + Sbjct: 64 GKLAIDDRKKGRIYEAAQIFLSFYP-QWSDLTCRFDVALVSCQK 106 >UniRef50_B6JM54 UPF0102 protein HPP12_0830 n=15 Tax=Epsilonproteobacteria RepID=Y830_HELP2 Length = 114 Score = 101 bits (253), Expect = 6e-21, Method: Composition-based stats. Identities = 24/111 (21%), Positives = 51/111 (45%), Gaps = 6/111 (5%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 ++ G E +A +L+ G + N + GEID+I + F+EV+ + Sbjct: 7 KHREKGLKAEEEACGFLKSLGFEMVERNFFSQFGEIDIIALKKGVLHFIEVKSGENF--- 63 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 ++T SK K+++T R +L++ + + D D + + E +++ Sbjct: 64 DPIYAITPSKLKKMIKTIRCYLSQKDPNSDFC---IDALIVKNGKFELLEN 111 >UniRef50_Q2RJT6 UPF0102 protein Moth_0988 n=2 Tax=Clostridia RepID=Y988_MOOTA Length = 120 Score = 101 bits (253), Expect = 7e-21, Method: Composition-based stats. Identities = 39/121 (32%), Positives = 54/121 (44%), Gaps = 8/121 (6%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 +T ++ G EA A L G R + N GEID++ +G +F+EVR R S Sbjct: 2 TMTRRRRGQIGEAAAAALLADSGYRILERNYRCPLGEIDIVAAQGEEIVFIEVRTRSSQT 61 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE------VEWIKDA 126 +G SV K+ +L + A + CRFDVVA + VE IK A Sbjct: 62 FGTPQESVDGRKRLRLRRLAAY--YLGSRGLAGRSCRFDVVAVWLDRQERVAGVEVIKGA 119 Query: 127 F 127 F Sbjct: 120 F 120 >UniRef50_A3JND8 Putative uncharacterized protein n=1 Tax=Rhodobacterales bacterium HTCC2150 RepID=A3JND8_9RHOB Length = 117 Score = 101 bits (252), Expect = 7e-21, Method: Composition-based stats. Identities = 28/117 (23%), Positives = 52/117 (44%), Gaps = 4/117 (3%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 T+ G + E R + +G + GEID+I RE IF+EV+ +S Sbjct: 3 GKTSYLAGLSAEEAVERHCKRRGKTILHRRWRGSVGEIDIIAREQDQVIFIEVK--KSKS 60 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-NEVEWIKDAFN 128 + A + ++ ++Q ++ T +LA + RFDV +++ I++A Sbjct: 61 FYDAISHLSVAQQQRIYATGSEYLA-NEELGQNTPVRFDVALVDSMGQIKVIENAIG 116 >UniRef50_C3JBE2 Putative uncharacterized protein n=2 Tax=Bacteria RepID=C3JBE2_9PORP Length = 127 Score = 100 bits (250), Expect = 1e-20, Method: Composition-based stats. Identities = 28/123 (22%), Positives = 49/123 (39%), Gaps = 10/123 (8%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNE-RGGEIDLIMREGRTTIFVEVRYRRSAL 72 G E A R+LE + + N + E+D+ + R I +EV+ R Sbjct: 2 AQHNDLGVLGERAAYRYLEQLKYKILDTNWSIDGKKEVDIFATDERELIVIEVKTRNEDY 61 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN------EVEWIKDA 126 ++VTR KQ ++ ++ + RFDV+ + ++E+ KDA Sbjct: 62 SVSPLSAVTRRKQANIISLTNAYIRLKGITL---PIRFDVLTAVFHPFDQSFDIEYYKDA 118 Query: 127 FND 129 F Sbjct: 119 FRA 121 >UniRef50_C3PH56 Putative uncharacterized protein n=1 Tax=Corynebacterium aurimucosum ATCC 700975 RepID=C3PH56_CORA7 Length = 145 Score = 100 bits (250), Expect = 1e-20, Method: Composition-based stats. Identities = 35/102 (34%), Positives = 52/102 (50%), Gaps = 3/102 (2%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE-GRTTIFVEVRYRRSALYGGAA 77 G A E A + +G + IAANV+ R GE+DL++RE T +F EV+ R + +G A Sbjct: 24 LGKAGEKFAADFYRARGAQVIAANVHYRVGELDLVVRESDGTIVFCEVKTRATRNFGVA- 82 Query: 78 ASVTRSKQHKLLQTARLWLARHNGSFDTVD-CRFDVVAFTGN 118 +VT K +L + A WL+ + RFDV+ Sbjct: 83 EAVTPRKLKRLRKAAAQWLSTARSENQALSKVRFDVLGLVAT 124 >UniRef50_C3XN83 Putative uncharacterized protein n=3 Tax=Helicobacter RepID=C3XN83_9HELI Length = 122 Score = 100 bits (250), Expect = 1e-20, Method: Composition-based stats. Identities = 26/92 (28%), Positives = 44/92 (47%), Gaps = 3/92 (3%) Query: 12 RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSA 71 T Q G E A +LE +G A N N R GEID+I ++ FVEV+ Sbjct: 5 NTANTTQKGKEAEDFACAFLENEGYSIEARNFNTRFGEIDIIAKKDGILHFVEVK--SGI 62 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSF 103 + ++T +K K+++T ++L ++ + Sbjct: 63 GF-EPIYNITPTKVQKIIKTIEIYLKEYHLNL 93 >UniRef50_A6FT82 PII uridylyl-transferase n=5 Tax=Rhodobacterales RepID=A6FT82_9RHOB Length = 175 Score = 99.8 bits (248), Expect = 2e-20, Method: Composition-based stats. Identities = 32/129 (24%), Positives = 53/129 (41%), Gaps = 4/129 (3%) Query: 2 ATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTI 61 A + R L + G A E + +G+ + RGGEIDLI+R+G + Sbjct: 49 AGPAKTARRDRGLRSWLAGAAAEKIVALAYDKRGIDLLETRWRGRGGEIDLILRDGSEIV 108 Query: 62 FVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-NEV 120 F EV+ RS A + ++ ++ A +L R RFD+ G Sbjct: 109 FCEVKAARSTQ--EAIQRLRPAQMRRIHAAASEYLGRVP-EGQLAQVRFDLAVVDGTGRA 165 Query: 121 EWIKDAFND 129 + +++AF Sbjct: 166 DILENAFGH 174 >UniRef50_C4DPS8 Predicted endonuclease related to Holliday junction resolvase n=1 Tax=Stackebrandtia nassauensis DSM 44728 RepID=C4DPS8_9ACTO Length = 207 Score = 99.0 bits (246), Expect = 4e-20, Method: Composition-based stats. Identities = 25/98 (25%), Positives = 41/98 (41%) Query: 11 PRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRS 70 P + G E A L G+R + N GE+D+I E T+F EV+ RRS Sbjct: 2 PHDRRHLRLGCFGENLAVAHLRRDGMRVLQRNWRCEHGELDIIAIERGVTVFCEVKTRRS 61 Query: 71 ALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDC 108 +G ++ +K ++ + A W R+ + Sbjct: 62 LRFGTPMQAIDEAKALRIRRLAASWHRRYRDKPPWAEW 99 >UniRef50_Q2KDE4 UPF0102 protein RHE_CH00320 n=4 Tax=Rhizobiales RepID=Y320_RHIEC Length = 122 Score = 99.0 bits (246), Expect = 4e-20, Method: Composition-based stats. Identities = 36/116 (31%), Positives = 54/116 (46%), Gaps = 4/116 (3%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + + G E A +L KG R +A R GEID++ R+G TIFVEV+ R Sbjct: 10 KRKALRRGRMSEYVAAAFLMLKGYRILALRHRTRLGEIDIVARKGDLTIFVEVKARHGE- 68 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV-EWIKDAF 127 A +V+ + Q ++ + LWLAR R+D++A + DAF Sbjct: 69 -AAAIDAVSVAAQKRIRAASDLWLARQADQARLSQ-RYDIIAVMPGRLPRHFPDAF 122 >UniRef50_B0P7L1 Putative uncharacterized protein n=1 Tax=Anaerotruncus colihominis DSM 17241 RepID=B0P7L1_9FIRM Length = 132 Score = 98.2 bits (244), Expect = 6e-20, Method: Composition-based stats. Identities = 34/124 (27%), Positives = 52/124 (41%), Gaps = 9/124 (7%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 ++ + G A EA A LE +G R + N EIDLI + G FVEV+ R Sbjct: 1 MSARTYGAAGEAFAASALEAEGYRILERNWRSGRSEIDLIAQRGDIIAFVEVKTRGEHAL 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNG-SFDTVDCRFDVVAFTGN--------EVEWIK 124 AA VTR+++ ++ A +L + V RFDV + Sbjct: 61 AAPAAFVTRAQRRRIALAAVEYLRARGIYNTGAVQPRFDVFEIVTGGPDGACVTRFSHLV 120 Query: 125 DAFN 128 +A++ Sbjct: 121 NAYD 124 >UniRef50_B2GFY9 Putative uncharacterized protein n=1 Tax=Kocuria rhizophila DC2201 RepID=B2GFY9_KOCRD Length = 144 Score = 97.9 bits (243), Expect = 9e-20, Method: Composition-based stats. Identities = 31/130 (23%), Positives = 42/130 (32%), Gaps = 14/130 (10%) Query: 3 TVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVN------ERGGEIDLIMRE 56 + + +P T G A E L G N GE+D++ Sbjct: 10 SRSSAVPTPDAPTALDVGRAGEDLIADLLARSGWSVRDRNWRPAPGPGRPRGELDIVAER 69 Query: 57 GRTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT 116 G EV+ R A +G +V K +L AR W H DVVA Sbjct: 70 GGVVTVFEVKTRSGADFGHPCEAVGAEKLRRLHVLARAWAREHRDPRVPT---VDVVAV- 125 Query: 117 GNEVEWIKDA 126 W +DA Sbjct: 126 ----HWPRDA 131 >UniRef50_D1NDV3 Fimbrial usher protein (Fragment) n=1 Tax=Haemophilus influenzae HK1212 RepID=D1NDV3_HAEIN Length = 323 Score = 97.9 bits (243), Expect = 1e-19, Method: Composition-based stats. Identities = 42/80 (52%), Positives = 51/80 (63%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 +Q G ++E QAR +LE KGL FIAAN N + GE+DLIM + T +FVEVR R + YG Sbjct: 167 KRQQGASFEHQARLFLESKGLTFIAANQNFKCGELDLIMNDKETIVFVEVRQRSHSAYGS 226 Query: 76 AAASVTRSKQHKLLQTARLW 95 A SV KQ K L A LW Sbjct: 227 AIESVDWRKQQKWLDAANLW 246 >UniRef50_C7JH91 Putative uncharacterized protein n=8 Tax=Acetobacter pasteurianus RepID=C7JH91_ACEP3 Length = 149 Score = 97.5 bits (242), Expect = 1e-19, Method: Composition-based stats. Identities = 33/115 (28%), Positives = 50/115 (43%), Gaps = 4/115 (3%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 G A E QA WLE G + GEID++ + FVEV+ RRS Sbjct: 37 AYTQGVAAEQQACNWLEQDGWTVLLRRARTHRGEIDIVASKAVVLCFVEVKKRRS--IEE 94 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF-TGNEVEWIKDAFND 129 A S+ ++Q +L + A L +H + + RFD+ F +E ++D Sbjct: 95 ALVSLQPAQQRRLFRAAECLLQKHPY-WQYEEMRFDLFVFDDAGRMERLEDVIRQ 148 >UniRef50_O66457 UPF0102 protein aq_041 n=1 Tax=Aquifex aeolicus RepID=Y041_AQUAE Length = 103 Score = 97.5 bits (242), Expect = 1e-19, Method: Composition-based stats. Identities = 29/107 (27%), Positives = 45/107 (42%), Gaps = 10/107 (9%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 G +E A R+L+ KG + + N+ GEID++ + VEV+ + A Sbjct: 2 KGREYEDLAARYLKSKGYQILGRNLRSPYGEIDILAEFEGRKVIVEVKGSETFF---PAE 58 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 VT K K+++TA L S + VV +V KD Sbjct: 59 KVTPHKLSKIIRTAYEVLGEEPFSIE-------VVVVYRGKVYHYKD 98 >UniRef50_A6Q6T2 UPF0102 protein SUN_0231 n=3 Tax=Epsilonproteobacteria RepID=Y231_SULNB Length = 113 Score = 97.1 bits (241), Expect = 1e-19, Method: Composition-based stats. Identities = 31/111 (27%), Positives = 49/111 (44%), Gaps = 6/111 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERG-GEIDLIMREGRTTIFVEVRYRRSALYG 74 K GD E A +LE +G I N R GEID+I ++ F+EV+ Sbjct: 5 PKIFGDKSEDLATLFLEQEGFIVIERNYFARKLGEIDIIAQKDEVLHFIEVK--SGKADF 62 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 +VT K K++ +A ++ V D + G+EVE+I++ Sbjct: 63 DPVYNVTPDKLRKVINSAHYYMKSKKI---DVSFSVDALIIRGDEVEFIEN 110 >UniRef50_C5BWW3 UPF0102 protein Bcav_2532 n=21 Tax=Actinomycetales RepID=Y2532_BEUC1 Length = 118 Score = 97.1 bits (241), Expect = 1e-19, Method: Composition-based stats. Identities = 42/118 (35%), Positives = 58/118 (49%), Gaps = 8/118 (6%) Query: 14 LTTKQ-TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + K G E A RWLE +GL + N GE+DL+ R+G T +FVEV+ R S Sbjct: 1 MRAKDAIGAYGERVAGRWLEAEGLEVVERNWRCPDGELDLVARDGETLVFVEVKTRSSLA 60 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-----GNEVEWIKD 125 +G +VTR K +L + A WLA H+ + R DVVA VE ++ Sbjct: 61 FGHPGEAVTRLKLARLRRLAARWLAEHDAHA--REVRIDVVAVLRTRAGAARVEHLRG 116 >UniRef50_A3K994 Putative uncharacterized protein n=7 Tax=Rhodobacterales RepID=A3K994_9RHOB Length = 159 Score = 97.1 bits (241), Expect = 2e-19, Method: Composition-based stats. Identities = 31/131 (23%), Positives = 59/131 (45%), Gaps = 7/131 (5%) Query: 3 TVPTRSGSPRQLTTK---QTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRT 59 P + ++ + + G + E + + E +G +GGEIDLI+R+G Sbjct: 31 PEPDAARRAKKDAGRIGYEAGASAELRVAQDYERRGFPLARRRWRGQGGEIDLIVRDGDG 90 Query: 60 TIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-N 118 IFVEV+ +S + AA ++R + ++++ A +L D RFD+ Sbjct: 91 LIFVEVK--KSRSFRHAAERLSRRQMNRIISAAEEFLGTQPLG-SLTDVRFDLAMVDVYG 147 Query: 119 EVEWIKDAFND 129 ++ I++A Sbjct: 148 QIRVIENAIGH 158 >UniRef50_Q0C451 UPF0102 protein HNE_0764 n=1 Tax=Hyphomonas neptunium ATCC 15444 RepID=Y764_HYPNA Length = 122 Score = 96.7 bits (240), Expect = 2e-19, Method: Composition-based stats. Identities = 39/121 (32%), Positives = 59/121 (48%), Gaps = 4/121 (3%) Query: 10 SPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRR 69 ++ + G E +A WL KG +AA V GGEIDLI R+GR FVEV+ R Sbjct: 2 PAKRQIAEARGRQAERRAALWLRLKGCSVLAARVKLPGGEIDLIARKGRLIAFVEVKAR- 60 Query: 70 SALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVE-WIKDAFN 128 A A +V+ H++ + A +W+ H F R+D++A + +KDA+ Sbjct: 61 -ARRDDALGAVSVQSWHRIARAAEVWMG-HRPKFAGYGWRYDLIALAPGSLPYHLKDAWR 118 Query: 129 D 129 Sbjct: 119 P 119 >UniRef50_A3UIK6 Putative uncharacterized protein n=1 Tax=Oceanicaulis alexandrii HTCC2633 RepID=A3UIK6_9RHOB Length = 128 Score = 96.3 bits (239), Expect = 3e-19, Method: Composition-based stats. Identities = 35/112 (31%), Positives = 56/112 (50%), Gaps = 4/112 (3%) Query: 18 QTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAA 77 + G EA + WL KG R + GE+DL+ R G F+EV++R + A Sbjct: 10 RRGRRTEAISALWLRLKGWRILDERARTGVGELDLVARRGGVLAFIEVKHRPTVD--AAR 67 Query: 78 ASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV-EWIKDAFN 128 ++T +Q +L++ A LW +RH D + RFDV+ + I+ AF+ Sbjct: 68 LAITPRQQMRLIRAASLWRSRH-AGIDHLQPRFDVMLWPAQGWPRHIQGAFS 118 >UniRef50_A9BFT1 UPF0102 protein Pmob_0702 n=1 Tax=Petrotoga mobilis SJ95 RepID=Y702_PETMO Length = 112 Score = 96.3 bits (239), Expect = 3e-19, Method: Composition-based stats. Identities = 28/107 (26%), Positives = 52/107 (48%), Gaps = 4/107 (3%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + TK G +E +A + + R IA N + R GEID+I + + +EV+ + + Sbjct: 1 MNTK--GKVYEDKAVSFFLNRDYRIIARNFSYRHGEIDIIALKNKILHLIEVKGGKET-F 57 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV 120 G A V K K+++ ++A H + + + DV++ T + V Sbjct: 58 GDPAFRVNSRKLKKIMKVGNYFIATHP-KLEFDEIQIDVISVTNDGV 103 >UniRef50_Q7U7D4 UPF0102 protein SYNW1051 n=3 Tax=Synechococcus RepID=Y1051_SYNPX Length = 134 Score = 96.3 bits (239), Expect = 3e-19, Method: Composition-based stats. Identities = 25/94 (26%), Positives = 45/94 (47%), Gaps = 2/94 (2%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + + G E + L+ +G + + N + R GE+DL++ + + VEV+ RRS Sbjct: 17 PMKMQPPGAQAETRVSSLLQRQGWQLLDRNWSCRWGELDLVLHKNEQLLVVEVKKRRSLA 76 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTV 106 +G SV +K+ +L + W A H D + Sbjct: 77 WGP--WSVDPTKRRRLGRAISCWRAEHPIQTDWL 108 >UniRef50_A4EVA8 Putative uncharacterized protein n=1 Tax=Roseobacter sp. SK209-2-6 RepID=A4EVA8_9RHOB Length = 136 Score = 95.9 bits (238), Expect = 4e-19, Method: Composition-based stats. Identities = 35/120 (29%), Positives = 60/120 (50%), Gaps = 4/120 (3%) Query: 7 RSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVR 66 + G R L + +G + E QA R E G + GEIDLI+R+G T +F EV+ Sbjct: 15 KRGQNRGLRSHLSGLSAEHQAARAYEALGFEVVEERWRGEAGEIDLILRQGATWVFAEVK 74 Query: 67 YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-NEVEWIKD 125 +S + AA +++ + ++ Q+A L+L R + + R D V +VE +++ Sbjct: 75 --KSTDFETAATRISQKQVQRIRQSATLYLDRFP-NEQVEEVRLDAVLIDAEGQVEILEN 131 >UniRef50_Q31RH5 UPF0102 protein Synpcc7942_0312 n=2 Tax=Synechococcus elongatus RepID=Y312_SYNE7 Length = 142 Score = 95.9 bits (238), Expect = 4e-19, Method: Composition-based stats. Identities = 29/96 (30%), Positives = 43/96 (44%), Gaps = 2/96 (2%) Query: 21 DAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG-GAAAS 79 A EA W + L +A + R GE+DL+ +E F+EV+ RR + + Sbjct: 10 RAGEALVAAWCRDRRLEVLAERWHCRWGELDLVTQEDSALRFIEVKTRRQTGWDQSGLLA 69 Query: 80 VTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF 115 + +KQ L + A +LA V CRFDV Sbjct: 70 IGPAKQRCLSRAAACYLASLGNQAA-VACRFDVALV 104 >UniRef50_A9I0M2 UPF0102 protein Bpet0439 n=14 Tax=Proteobacteria RepID=Y439_BORPD Length = 162 Score = 95.5 bits (237), Expect = 4e-19, Method: Composition-based stats. Identities = 56/115 (48%), Positives = 72/115 (62%), Gaps = 3/115 (2%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 T++TG A E QA R L G GL +A N++ R GEIDL+MR+G T + VEVR R + YGG Sbjct: 46 TQRTGTAHEDQALRLLAGAGLVPLARNLHCRAGEIDLVMRDGATLVLVEVRARANPRYGG 105 Query: 76 AAASVTRSKQHKLLQTARLW---LARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 AAASV R+K+ +LL+ A L LAR + RFDVVAF +W+ AF Sbjct: 106 AAASVGRAKRARLLRCAALLLPDLARRHWGGRIPPVRFDVVAFEAGRADWLPAAF 160 >UniRef50_Q04SX0 UPF0102 protein LBJ_1427 n=4 Tax=Leptospira RepID=Y1427_LEPBJ Length = 116 Score = 95.5 bits (237), Expect = 4e-19, Method: Composition-based stats. Identities = 25/113 (22%), Positives = 45/113 (39%), Gaps = 2/113 (1%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 K GD E+ A +L G + N EID+I + F EV++ + Sbjct: 5 KKIKGDEGESIASDFLISIGHEILKRNYRFLYCEIDIISIKEEVLYFSEVKFWKEFESFD 64 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-VEWIKDAF 127 + +KQ ++ + A +L+ + S F +V+ + E+ D F Sbjct: 65 PRFTFNFAKQTRMRKAASGFLSEN-LSLQNHFVSFCLVSINEKKGCEYYPDLF 116 >UniRef50_B2IFF3 Putative uncharacterized protein n=1 Tax=Beijerinckia indica subsp. indica ATCC 9039 RepID=B2IFF3_BEII9 Length = 125 Score = 95.5 bits (237), Expect = 4e-19, Method: Composition-based stats. Identities = 32/121 (26%), Positives = 50/121 (41%), Gaps = 4/121 (3%) Query: 7 RSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVR 66 +S + + G E+ A WL + R + +GGEID++ G T F+EV+ Sbjct: 2 KSRKEARRRAHRFGLWAESLAILWLRMRFYRILDRRFFVKGGEIDIVAHRGDTIAFIEVK 61 Query: 67 YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV-EWIKD 125 R + A ++ K+ +L AR WLA H + V R D + I Sbjct: 62 ARPT--LDEALLAIDAVKRRRLSLAARYWLAAHPWAASHV-LRGDALCIAPWCWPRHIPA 118 Query: 126 A 126 A Sbjct: 119 A 119 >UniRef50_C9M9E0 Putative uncharacterized protein n=1 Tax=Jonquetella anthropi E3_33 E1 RepID=C9M9E0_9BACT Length = 155 Score = 95.5 bits (237), Expect = 5e-19, Method: Composition-based stats. Identities = 27/117 (23%), Positives = 41/117 (35%), Gaps = 7/117 (5%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + G E A L +G NV E+DLI VEVR R+ + Sbjct: 32 AQDSLAVGRWAEDLAADLLAEEGYSVCGRNVRVGPCELDLIGFIDGCLTAVEVRCRQKSR 91 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG----NEVEWIKD 125 +V K + L++ R + ++ + R D+ A T W KD Sbjct: 92 LQSPEETVGPRKWNALVRGIRGYASQTGWNG---PMRIDLFAVTVCGRRWSARWYKD 145 >UniRef50_A1B931 Putative uncharacterized protein n=1 Tax=Paracoccus denitrificans PD1222 RepID=A1B931_PARDP Length = 137 Score = 95.5 bits (237), Expect = 5e-19, Method: Composition-based stats. Identities = 31/114 (27%), Positives = 44/114 (38%), Gaps = 4/114 (3%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 G E A R +G +A RGGEIDLI+ FVEV+ +S + Sbjct: 25 AYSAGRLAEESAAREYRRRGYEVMAERWRGRGGEIDLILCRDDEYTFVEVK--KSRFHDR 82 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-GNEVEWIKDAFN 128 AA + + ++ A + R RFD VE I++AF Sbjct: 83 AAERIGARQIARICNAALEYCGRLPAGL-LTAMRFDAALVDQFGRVEIIENAFG 135 >UniRef50_B5Y8F8 Putative uncharacterized protein n=1 Tax=Coprothermobacter proteolyticus DSM 5265 RepID=B5Y8F8_COPPD Length = 115 Score = 95.2 bits (236), Expect = 6e-19, Method: Composition-based stats. Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 9/102 (8%) Query: 24 EAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAASVTRS 83 E + +L + R + NV GEID++ +GRT +FVEVRYR++ AA +V Sbjct: 7 EDRVASFLVSQKYRILDQNVVFPTGEIDIVALKGRTLVFVEVRYRKNF---DAAETVDSR 63 Query: 84 KQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 K +++Q A L+ + R DV A W K Sbjct: 64 KLERIMQCAYLY------TGGEQSYRIDVFACGPQGCHWYKG 99 >UniRef50_C0XSA5 Endonuclease n=1 Tax=Corynebacterium lipophiloflavum DSM 44291 RepID=C0XSA5_9CORY Length = 124 Score = 94.8 bits (235), Expect = 7e-19, Method: Composition-based stats. Identities = 35/122 (28%), Positives = 48/122 (39%), Gaps = 10/122 (8%) Query: 9 GSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREG-RTTIFVEVRY 67 A E A G + V + GEIDLI+RE T +FVEV+ Sbjct: 2 AQSNYAENHALALAGEKLAASTYSEMGYAIVGTRVRTKVGEIDLIVREETGTVVFVEVKT 61 Query: 68 RRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----EVEWI 123 RR +G AA +VT K + + A WLA + RFDV ++ Sbjct: 62 RRGRGFG-AAETVTAKKLRTMRRCAAEWLAGN----AYAPVRFDVAEVIVTGETMDIRLF 116 Query: 124 KD 125 +D Sbjct: 117 ED 118 >UniRef50_A9IXC8 UPF0102 protein BT_1882 n=8 Tax=Rhizobiales RepID=Y1882_BART1 Length = 130 Score = 94.4 bits (234), Expect = 9e-19, Method: Composition-based stats. Identities = 36/120 (30%), Positives = 53/120 (44%), Gaps = 7/120 (5%) Query: 11 PRQLTTKQ--TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYR 68 P++ K G E A WL KG + GEIDLI R G + VEV+ R Sbjct: 10 PKKQRQKSFYRGVRAEKLAAWWLRFKGFHIAEMRFKTKCGEIDLIARRGNLVLIVEVKAR 69 Query: 69 RSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV-EWIKDAF 127 + A +V+R + ++ A +WLAR + + RFD++A + I AF Sbjct: 70 ST--LLEAMEAVSRMNEKRIEAAADIWLARQK-DYALLSVRFDLIAILPWRWPKHIP-AF 125 >UniRef50_A4QF37 UPF0102 protein cgR_1859 n=4 Tax=Corynebacterium RepID=Y1859_CORGB Length = 122 Score = 94.4 bits (234), Expect = 1e-18, Method: Composition-based stats. Identities = 31/107 (28%), Positives = 46/107 (42%), Gaps = 6/107 (5%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMR-EGRTTIFVEVRYRRSA 71 + + G E A + + NV GE+DLI+R +FVEV+ RR + Sbjct: 2 KTQKQYLGAFGEDVALQQYLDDQATLLDRNVRYSCGELDLIVRLASGVVVFVEVKTRRGS 61 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN 118 + AA +V K ++ + A LWL RFDVVA + Sbjct: 62 AFDSAA-AVNNQKMLRMRRAAALWLEGKP----YTPIRFDVVAIVLD 103 >UniRef50_B3T5F3 Putative uncharacterized protein family UPF0102 n=1 Tax=uncultured marine microorganism HF4000_ANIW141I9 RepID=B3T5F3_9ZZZZ Length = 172 Score = 94.4 bits (234), Expect = 1e-18, Method: Composition-based stats. Identities = 26/117 (22%), Positives = 40/117 (34%), Gaps = 8/117 (6%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNE-RGGEIDLIMREGRTTIFVEVRYRRSAL 72 ++ G E A KG + A N GEID+I + +F EV+ Sbjct: 55 QKKRKIGQWGERLAALEYYRKGYKVHALNYYCAPFGEIDIIAEKENELVFAEVKTAAGKT 114 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEWIKD 125 GG V K +L ++ + D R DV A + ++ K Sbjct: 115 LGGVEGQVDEVKLQRLSNAIDKYIMDNEIQND---IRLDVFAIILGKNGPALKHFKG 168 >UniRef50_Q2VYL8 UPF0102 protein amb4503 n=5 Tax=Alphaproteobacteria RepID=Y4503_MAGSA Length = 129 Score = 94.0 bits (233), Expect = 1e-18, Method: Composition-based stats. Identities = 32/133 (24%), Positives = 51/133 (38%), Gaps = 11/133 (8%) Query: 1 MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERG----GEIDLIMRE 56 M + P ++ G E A WL KG +A + GE+DL+ R Sbjct: 1 MNSAPPSRA---HQAAQRRGKVAEGLAALWLRLKGYGILAKGLKSGRGSGAGEVDLVARR 57 Query: 57 GRTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT 116 G FVEV+ R + A S+T ++ ++ + A + RFD+V Sbjct: 58 GDLVAFVEVKSRAT--LDQAIESLTPFQRQRIERAAAA-FLARRPELASCGVRFDMVLVA 114 Query: 117 GNEV-EWIKDAFN 128 + I DA+ Sbjct: 115 PWRLPRHIPDAWR 127 >UniRef50_B0T377 UPF0102 protein Caul_0175 n=3 Tax=Caulobacteraceae RepID=Y175_CAUSK Length = 139 Score = 93.6 bits (232), Expect = 2e-18, Method: Composition-based stats. Identities = 29/129 (22%), Positives = 48/129 (37%), Gaps = 6/129 (4%) Query: 2 ATVPTRSGS--PRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRT 59 P R R + +G E A WL KG R + + GEIDL+ + Sbjct: 6 PLRPERQAQKQARGAAARLSGRRAEVLAALWLMAKGYRILGFRLATPLGEIDLLAQRRGV 65 Query: 60 TIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN- 118 VEV+ R A +VT ++ +L + +A + R D++A Sbjct: 66 LAVVEVKSR--TSLEAALEAVTYEQRSRLRRAGAH-IAANRAGLRDAVVRLDLIALAPGR 122 Query: 119 EVEWIKDAF 127 + +A+ Sbjct: 123 RPRHLLNAW 131 >UniRef50_A8U078 Predicted endonuclease n=1 Tax=alpha proteobacterium BAL199 RepID=A8U078_9PROT Length = 151 Score = 92.8 bits (230), Expect = 3e-18, Method: Composition-based stats. Identities = 33/120 (27%), Positives = 57/120 (47%), Gaps = 4/120 (3%) Query: 11 PRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRS 70 R+ + ++ G E A WL +G R +A V GE+DL++R G + VEV+ R + Sbjct: 33 ERRRSAERRGLRAEWLAALWLMLRGYRVLARRVRTPAGEVDLVVRRGSVVVAVEVKARAT 92 Query: 71 ALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV-EWIKDAFND 129 A SV+ ++H++ +LAR ++ RFD+VA + + D + Sbjct: 93 --LDAALDSVSSRQRHRVALGLESFLARRP-ELAGLNRRFDLVAVQPWRLPVHLADVWRP 149 >UniRef50_Q16B02 UPF0102 protein RD1_1191 n=1 Tax=Roseobacter denitrificans OCh 114 RepID=Y1191_ROSDO Length = 129 Score = 92.5 bits (229), Expect = 3e-18, Method: Composition-based stats. Identities = 32/131 (24%), Positives = 59/131 (45%), Gaps = 5/131 (3%) Query: 1 MATVPTRSGSPRQLT-TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRT 59 M +P + G + EA R E G F A + GEIDL++R+ Sbjct: 1 MTQMPQTNARVHAGRMAYHAGLSAEASVIREYESHGYVFEAQRWRGQVGEIDLVLRKSGL 60 Query: 60 TIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-GN 118 +FVEV+ +S + AA ++ +++ ++ T ++A+ D RFDV Sbjct: 61 VVFVEVK--KSKSFERAALRISPTQKRRIFATGEEFVAQEPQGL-LTDMRFDVALVDAAG 117 Query: 119 EVEWIKDAFND 129 V+ +++A ++ Sbjct: 118 AVQILENALSE 128 >UniRef50_UPI00016C4BC8 hypothetical protein GobsU_17186 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4BC8 Length = 155 Score = 92.5 bits (229), Expect = 4e-18, Method: Composition-based stats. Identities = 41/122 (33%), Positives = 56/122 (45%), Gaps = 10/122 (8%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 + G E A +L G R +AANVN+R GE+DL+ +G T + VEVR SA Sbjct: 26 KRWFGRRSERAAANYLRGLRYRLLAANVNDRDGELDLLAIDGETLVIVEVRSTSSARPDA 85 Query: 76 A---AASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE------VEWIKDA 126 AASV KQ K+ + +L R + R+DV+ E V I+ A Sbjct: 86 IEQTAASVDLRKQRKITEATSRFLGRRRL-LGRIAVRYDVLVIAWPEHAREPAVRHIRHA 144 Query: 127 FN 128 F Sbjct: 145 FE 146 >UniRef50_Q0AK98 UPF0102 protein Mmar10_3014 n=1 Tax=Maricaulis maris MCS10 RepID=Y3014_MARMM Length = 127 Score = 91.7 bits (227), Expect = 6e-18, Method: Composition-based stats. Identities = 29/116 (25%), Positives = 50/116 (43%), Gaps = 4/116 (3%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + + G E A WL KG R + GEIDL+ R G +F+EV+ R + Sbjct: 5 RRQAEARGRWAEWLAMAWLVAKGYRLLDHRARTAAGEIDLVARRGEYLVFIEVKARATR- 63 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV-EWIKDAF 127 A S+ ++ ++ + A +W A S + R+D+V + + A+ Sbjct: 64 -AEALDSIGPRQRGRITRAASIWRAP-RSSLHHLHLRYDLVLVVPGRWPQHRRAAW 117 >UniRef50_B2S8H0 UPF0102 protein BAbS19_I01690 n=50 Tax=Rhizobiales RepID=Y1690_BRUA1 Length = 126 Score = 91.7 bits (227), Expect = 6e-18, Method: Composition-based stats. Identities = 37/109 (33%), Positives = 47/109 (43%), Gaps = 3/109 (2%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 G + E A L KG R +A R GEIDLI R G + VEV+ R A + A Sbjct: 14 RGHSAERLAAFALMLKGFRIVARRYRTRLGEIDLIARRGDLVLIVEVKAR--ASFEAAQF 71 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 +VT ++ A LWL R + RFD+VA AF Sbjct: 72 AVTPQAMRRIEAAADLWLQRQTDR-ARLSLRFDMVAVLPRRWPKHVPAF 119 >UniRef50_A8ERF6 UPF0102 protein Abu_0255 n=2 Tax=Campylobacterales RepID=Y255_ARCB4 Length = 110 Score = 90.9 bits (225), Expect = 1e-17, Method: Composition-based stats. Identities = 25/111 (22%), Positives = 55/111 (49%), Gaps = 6/111 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAAN-VNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 +K+ GD E +A +LE + N ++ GEID+I + + F+EV+ + Y Sbjct: 2 SKEKGDIAEKKAISFLEKSNFEIVEKNFYAKKLGEIDIIAQRNKIYHFIEVK--SANDYE 59 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 A ++T K K+ ++ ++ ++N + DV+ +++E +++ Sbjct: 60 TAINNITSQKLSKIKRSVDFYIQKNNLNISYS---IDVIIVVDDKIELLEN 107 >UniRef50_A3TTV9 Putative uncharacterized protein n=1 Tax=Oceanicola batsensis HTCC2597 RepID=A3TTV9_9RHOB Length = 140 Score = 90.2 bits (223), Expect = 2e-17, Method: Composition-based stats. Identities = 37/129 (28%), Positives = 62/129 (48%), Gaps = 5/129 (3%) Query: 1 MATVPTRSGSPRQLT-TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRT 59 +A +P + R+ +G A E+ R E +G R + GGEIDLI+ E Sbjct: 11 VAAIPVPAARRRRGEIAHLSGLAAESAVERTYEARGARVLHRRWRGPGGEIDLILAEPDR 70 Query: 60 TIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE 119 +FVEV+ ++A +G AA V ++ ++ ++A ++ D R DV G Sbjct: 71 VVFVEVK--KAATHGAAAERVRPAQVQRIARSAMAFVDTLP-GGALTDIRLDVALVDGGG 127 Query: 120 -VEWIKDAF 127 VE +++AF Sbjct: 128 AVELLENAF 136 >UniRef50_Q0FQ74 Putative uncharacterized protein n=1 Tax=Roseovarius sp. HTCC2601 RepID=Q0FQ74_9RHOB Length = 153 Score = 90.2 bits (223), Expect = 2e-17, Method: Composition-based stats. Identities = 31/124 (25%), Positives = 52/124 (41%), Gaps = 4/124 (3%) Query: 7 RSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVR 66 R R + G + EAQ + +G R R GEIDLI+ +G IFVEV+ Sbjct: 33 RRRVERGALGHRAGLSAEAQVAQDYRRRGYRVAGQRWRGRSGEIDLILHDGDGLIFVEVK 92 Query: 67 YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-VEWIKD 125 +S + A ++ + +LL+ + A ++ RFDV + +++ Sbjct: 93 --KSRSFDHAMQHLSSRQIARLLRAGEEF-AGTQPRGSLIEMRFDVALMNEQGMIRIVEN 149 Query: 126 AFND 129 A Sbjct: 150 ALGP 153 >UniRef50_A7ZB75 UPF0102 protein Ccon26_01140 n=23 Tax=Epsilonproteobacteria RepID=Y114_CAMC1 Length = 113 Score = 89.8 bits (222), Expect = 2e-17, Method: Composition-based stats. Identities = 24/114 (21%), Positives = 48/114 (42%), Gaps = 6/114 (5%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMR-EGRTTIFVEVRYRRSA 71 L G + E +A +L G + N + + GEID+I + F+EV+ Sbjct: 2 GLKEYLFGKSSEDRACEFLRKLGFVILERNFHSKFGEIDIIALSSDKILHFIEVKATSGG 61 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 A + ++K K+L+T ++ ++ + D D++ E I++ Sbjct: 62 Y--EAEYRLNKAKYMKILKTINFYMMKNEPNRDYQ---LDLLVVKNENFELIEN 110 >UniRef50_C6QFU1 Putative uncharacterized protein n=1 Tax=Hyphomicrobium denitrificans ATCC 51888 RepID=C6QFU1_9RHIZ Length = 129 Score = 89.4 bits (221), Expect = 3e-17, Method: Composition-based stats. Identities = 32/115 (27%), Positives = 50/115 (43%), Gaps = 3/115 (2%) Query: 6 TRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEV 65 R S + ++G E G R + GEIDLI +GR FVEV Sbjct: 10 ARPLSDIRRRRYRSGLNAEMVVAAVYMALGHRILGRRFKTPVGEIDLIAIKGRRVAFVEV 69 Query: 66 RYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV 120 + R S+ A ++T + + ++ + A LWLAR+ + + D FD+V Sbjct: 70 KRRASSE--EAEDAITLTMRRRVRRAADLWLARNP-QYQSHDVGFDLVFVLPWRF 121 >UniRef50_C1CZ90 UPF0102 protein Deide_03080 n=3 Tax=Deinococcus RepID=Y3080_DEIDV Length = 114 Score = 89.0 bits (220), Expect = 4e-17, Method: Composition-based stats. Identities = 37/90 (41%), Positives = 50/90 (55%), Gaps = 2/90 (2%) Query: 30 WLEGKGLRFIAANVNERGGEIDLIMREG-RTTIFVEVRYRRSALYGGAAASVTRSKQHKL 88 L+G G + N RGGEIDL+ RE T +F EVR RR+ +G AA SVT K + Sbjct: 13 HLQGLGRELLQRNYRMRGGEIDLVTREPCGTLVFTEVRQRRTRRHGSAAESVTSRKLALM 72 Query: 89 LQTARLWLARHNGSFDTVDCRFDVVAFTGN 118 + A+ +L R +G D + CR +VV G Sbjct: 73 HRAAQSYLIREHGR-DDLPCRLEVVTIDGP 101 >UniRef50_B4CV56 Putative uncharacterized protein n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4CV56_9BACT Length = 89 Score = 88.6 bits (219), Expect = 5e-17, Method: Composition-based stats. Identities = 32/85 (37%), Positives = 44/85 (51%), Gaps = 5/85 (5%) Query: 50 IDLIMREGRTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCR 109 +D++ R+ T +FVEV+ RRS +G A SVTR KQ + + A WL + D + R Sbjct: 1 MDIVCRDHDTLVFVEVKTRRSLTFGSPAESVTREKQKLIARGALAWLDLLG-NPDNILFR 59 Query: 110 FDVVAFTGNE----VEWIKDAFNDH 130 FD+V E IKDAF Sbjct: 60 FDIVEIIFEEDVPTFHIIKDAFKLP 84 >UniRef50_B1ZZB1 Putative uncharacterized protein n=2 Tax=Opitutaceae RepID=B1ZZB1_OPITP Length = 141 Score = 88.6 bits (219), Expect = 6e-17, Method: Composition-based stats. Identities = 30/118 (25%), Positives = 48/118 (40%), Gaps = 12/118 (10%) Query: 18 QTGDAWEAQARRWLEG-KGLRFIAANVNERGG---EIDLIMREGRTTIFVEVRYRRSALY 73 G A E A WL+ +G R +A N E+DL+ R+ +FVEV+ R + Sbjct: 16 DAGAAGERLAAAWLQRERGFRVVARNWRNPRDRREELDLVCRDREVLVFVEVKSRAANAL 75 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN------EVEWIKD 125 +V + K+ L + + +LAR + R DVV V ++ Sbjct: 76 VPGYYAVDKRKKRVLGRAIKAYLAR--LTAKPATFRLDVVEIAEGGGDAEPTVRHFEN 131 >UniRef50_A7BDE4 Putative uncharacterized protein n=1 Tax=Actinomyces odontolyticus ATCC 17982 RepID=A7BDE4_9ACTO Length = 146 Score = 88.6 bits (219), Expect = 6e-17, Method: Composition-based stats. Identities = 38/112 (33%), Positives = 48/112 (42%), Gaps = 8/112 (7%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVN-ERGGEIDLIMREG-----RTTIFVEVR 66 + + G A E AR LE +GLR + N R GE+D+I R+ T+ VEVR Sbjct: 6 RPDRRAIGAAGEYTARLALEEEGLRLLDTNWRDGRRGELDIIARDETDPSRSWTVIVEVR 65 Query: 67 YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN 118 R G A ASV K +L W H R DVVA T + Sbjct: 66 TRVGRRKGSALASVDHRKVARLRALTGAWCRAHGHLASR--VRIDVVAITVD 115 >UniRef50_Q7V7V8 UPF0102 protein PMT_0624 n=2 Tax=Prochlorococcus marinus RepID=Y624_PROMM Length = 126 Score = 88.2 bits (218), Expect = 7e-17, Method: Composition-based stats. Identities = 23/93 (24%), Positives = 45/93 (48%), Gaps = 1/93 (1%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + + G E + R L+ +G + ++ + R GE+DL++ + + + VEV+ RRS Sbjct: 2 MDSTGMGCWGEERVLRLLQKRGWQLVSQRWSCRYGELDLVVEKQQRVLVVEVKSRRSRGL 61 Query: 74 GG-AAASVTRSKQHKLLQTARLWLARHNGSFDT 105 + + KQ +L++ WLA H + Sbjct: 62 DHWGLCAFNKGKQLRLMRAIGCWLATHPYFAEH 94 >UniRef50_B3CRA6 Putative uncharacterized protein n=2 Tax=Orientia tsutsugamushi RepID=B3CRA6_ORITI Length = 112 Score = 87.5 bits (216), Expect = 1e-16, Method: Composition-based stats. Identities = 28/115 (24%), Positives = 51/115 (44%), Gaps = 5/115 (4%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 +++ G E + FIA + GEID+I +G+ +F+EV+ RRS Sbjct: 2 ISSYNLGVLAEWLIIARYSVRLYSFIAHRMRNSAGEIDIICTKGQVIVFIEVKARRSNFD 61 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVE-WIKDAF 127 + K+ ++A L+L +N + D RFD+ + I++A+ Sbjct: 62 NTIC---NYQQITKIRKSAELYLY-YNRQYSNFDVRFDLAIVRPMQWPLIIENAW 112 >UniRef50_C0W187 Putative uncharacterized protein n=1 Tax=Actinomyces coleocanis DSM 15436 RepID=C0W187_9ACTO Length = 117 Score = 87.5 bits (216), Expect = 1e-16, Method: Composition-based stats. Identities = 31/108 (28%), Positives = 47/108 (43%), Gaps = 6/108 (5%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 ++ G E A +LE G + + NV +G EID+I E +FVEVR R + +G Sbjct: 4 NKQRVGKLGEDLAAEYLESLGWKILERNVTYKGAEIDIIALEDDVVVFVEVRTRTTDDWG 63 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEW 122 A S+T K L WL + + R D+V ++ Sbjct: 64 SALESLTPKKLASLRSGVVRWLLNQD---EYCKARIDMVTV---KLNH 105 >UniRef50_A5V3S4 UPF0102 protein Swit_0572 n=4 Tax=Sphingomonadaceae RepID=Y572_SPHWW Length = 118 Score = 87.5 bits (216), Expect = 1e-16, Method: Composition-based stats. Identities = 33/119 (27%), Positives = 52/119 (43%), Gaps = 5/119 (4%) Query: 11 PRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRS 70 R+ ++ G E A WL G R + + V R GE+DLI R GRT FVEV+ R Sbjct: 2 NRRAAAERQGRTGERIAAWWLRLHGWRIVGSRVKTRRGEVDLIARRGRTLAFVEVKTRGD 61 Query: 71 ALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-EVEWIKDAFN 128 A G A ++ + ++ A L R+ + R DV+ + + ++ Sbjct: 62 A--AGLATAIDEYRLRRVAAAAEALLPRYGVGVEN--VRIDVMLVRPWRRPVHLTNVWH 116 >UniRef50_C2M936 Putative uncharacterized protein n=1 Tax=Porphyromonas uenonis 60-3 RepID=C2M936_9PORP Length = 136 Score = 87.1 bits (215), Expect = 2e-16, Method: Composition-based stats. Identities = 25/99 (25%), Positives = 46/99 (46%), Gaps = 3/99 (3%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 + G A E A+R+L + +R + N + EID+I +GR + VEV+ R Sbjct: 4 ANELGAAGERAAQRYLLSRHIRLLEINWRDPLCEIDIIASDGRHLLIVEVKSRMEYTATS 63 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVA 114 +V ++K H+++ + R + R+DV+ Sbjct: 64 PLDAVDQAKAHQMMLGGMRYAQRMRINL---PIRYDVIE 99 >UniRef50_A4WPR4 UPF0102 protein Rsph17025_0472 n=7 Tax=Rhodobacterales RepID=Y472_RHOS5 Length = 117 Score = 86.3 bits (213), Expect = 3e-16, Method: Composition-based stats. Identities = 34/113 (30%), Positives = 50/113 (44%), Gaps = 4/113 (3%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 + + G E R E A GEIDLI R+G IF+EV+ +S + Sbjct: 6 SHRAGFVAEEAVARIYERADRPVTARRWRGAAGEIDLIARDGAEVIFIEVK--KSKSHAA 63 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-NEVEWIKDAF 127 AAA ++R + ++ A +LA RFDV G +E I++AF Sbjct: 64 AAARLSRRQMERIYGAASEFLAGEPLG-QLTASRFDVALVDGMGRIEIIENAF 115 >UniRef50_Q2GDU7 Putative uncharacterized protein n=1 Tax=Neorickettsia sennetsu str. Miyayama RepID=Q2GDU7_NEOSM Length = 116 Score = 85.1 bits (210), Expect = 6e-16, Method: Composition-based stats. Identities = 21/105 (20%), Positives = 37/105 (35%), Gaps = 3/105 (2%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 G E L KG + E+DL+ + +FVEV++R S Sbjct: 13 VGRLAEMIVALHLSIKGYMLLCRRYRNPHCELDLVCIKHGVLLFVEVKFRSSLQ--VLET 70 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWI 123 V S+ K+ + + + + F V T + ++ I Sbjct: 71 MVDYSRMEKMYPASESFCSEFQLYYCLERV-FKVFLVTPSVIQVI 114 >UniRef50_Q28TZ5 Putative uncharacterized protein n=1 Tax=Jannaschia sp. CCS1 RepID=Q28TZ5_JANSC Length = 100 Score = 85.1 bits (210), Expect = 6e-16, Method: Composition-based stats. Identities = 29/100 (29%), Positives = 48/100 (48%), Gaps = 5/100 (5%) Query: 28 RRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAASVTRSKQHK 87 R +L G R ++ GEIDL+M + IFVEV+ R+ + AA +++ + + Sbjct: 2 RAYL-DHGHRLVSRRWRGPAGEIDLVMEKDGEVIFVEVKASRT--HARAAEALSNRQIAR 58 Query: 88 LLQTARLWLARHNGSFDTVDCRFDVVAFTG-NEVEWIKDA 126 LL++A L RFDV G +++ I +A Sbjct: 59 LLRSAEHCLGSFPKGLA-TPMRFDVALVDGQGQLDVIVNA 97 >UniRef50_Q7NEX4 UPF0102 protein gll3754 n=1 Tax=Gloeobacter violaceus RepID=Y3754_GLOVI Length = 126 Score = 84.4 bits (208), Expect = 9e-16, Method: Composition-based stats. Identities = 38/121 (31%), Positives = 50/121 (41%), Gaps = 8/121 (6%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 + E L +G +A RGGEIDL++R G FVEV+ R + Sbjct: 3 RRHRFALQAEIWVADHLAAQGGLVLARRWRCRGGEIDLVVRLGGVLCFVEVKARGGNSWD 62 Query: 75 GAA-ASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE------VEWIKDAF 127 A +V KQ +LL A L+LA H CRFDV + V +I AF Sbjct: 63 SAGWEAVGAVKQRRLLLAAALFLAAHP-ELARSVCRFDVALVGRDPGGGVRLVAYIAGAF 121 Query: 128 N 128 Sbjct: 122 E 122 >UniRef50_B8GXN3 UPF0102 protein CCNA_00142 n=4 Tax=Caulobacteraceae RepID=Y142_CAUCN Length = 125 Score = 84.4 bits (208), Expect = 1e-15, Method: Composition-based stats. Identities = 31/126 (24%), Positives = 51/126 (40%), Gaps = 4/126 (3%) Query: 4 VPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFV 63 + R ++ G E A WL KG R + + GEIDL+ + G+ V Sbjct: 1 MAAGVRQSRGTAARKVGRRAEVIAALWLMAKGYRILGFRLATPLGEIDLLAQRGKVLAVV 60 Query: 64 EVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-VEW 122 EV+ R A +V +++ +L + A LA H + R D++A Sbjct: 61 EVKQR--TTIEDALDAVKPTQRERLRRAATH-LAAHRAGLRDLLVRLDLIAMAPGRPPRH 117 Query: 123 IKDAFN 128 + DA+ Sbjct: 118 LPDAWG 123 >UniRef50_Q3J5H3 UPF0102 protein RHOS4_03930 n=7 Tax=Rhodobacteraceae RepID=Y393_RHOS4 Length = 117 Score = 84.4 bits (208), Expect = 1e-15, Method: Composition-based stats. Identities = 35/109 (32%), Positives = 48/109 (44%), Gaps = 4/109 (3%) Query: 20 GDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAAS 79 G E R + G A GEIDLI REG IF+EV+ +S + AAA Sbjct: 10 GQTAEEAVARIYDRSGRPVAARRWRGVSGEIDLIAREGAEVIFIEVK--KSTSHAAAAAR 67 Query: 80 VTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-NEVEWIKDAF 127 ++R + ++ A +LA RFDV VE I++AF Sbjct: 68 LSRRQMDRIYGAASEFLAGEP-RGQLTASRFDVALVDALGRVEIIENAF 115 >UniRef50_C6V4V5 Putative uncharacterized protein n=1 Tax=Neorickettsia risticii str. Illinois RepID=C6V4V5_NEORI Length = 117 Score = 84.0 bits (207), Expect = 1e-15, Method: Composition-based stats. Identities = 19/105 (18%), Positives = 37/105 (35%), Gaps = 3/105 (2%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 G E L KG + E+DL+ + +F EV++R S Sbjct: 14 VGRLAEMIVALHLSIKGYMLLCRRYRNPHCELDLVCIKSGVLLFAEVKFRSSLQ--AVET 71 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWI 123 +V S+ ++ + + + + F V T + ++ I Sbjct: 72 TVDYSRMERMYPASESFCSEFQLYYYLERI-FKVFLITPSVIQVI 115 >UniRef50_C8WWC7 Putative uncharacterized protein n=1 Tax=Alicyclobacillus acidocaldarius subsp. acidocaldarius DSM 446 RepID=C8WWC7_ALIAD Length = 120 Score = 84.0 bits (207), Expect = 1e-15, Method: Composition-based stats. Identities = 22/78 (28%), Positives = 38/78 (48%), Gaps = 1/78 (1%) Query: 13 QLTTKQTGDAWEAQARRWLEGK-GLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSA 71 ++ ++ G E+ ++LE G R I N R GE+DLI + VEV+ R S Sbjct: 2 RVNRRELGTLGESFVGQYLERCLGWRVIEKNWRTRFGELDLIAENEDELVAVEVKTRTSP 61 Query: 72 LYGGAAASVTRSKQHKLL 89 + G ++ ++ KL+ Sbjct: 62 IDGDPIYALRPAQIPKLV 79 >UniRef50_B9KH25 Putative uncharacterized protein n=3 Tax=Anaplasma marginale RepID=B9KH25_ANAMF Length = 126 Score = 83.6 bits (206), Expect = 2e-15, Method: Composition-based stats. Identities = 31/130 (23%), Positives = 58/130 (44%), Gaps = 10/130 (7%) Query: 1 MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTT 60 M T +R+ R L G A E + + + + GEIDLI++ GR Sbjct: 1 MCTSKSRASKVRSL----VGYAGELVVLLLRKARLHKVLHHRYRSPLGEIDLIVQNGREL 56 Query: 61 IFVEVRYRRSALY-GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE 119 F+EV+ ++ + VT+ ++ +++TA+ +L+R+ F F+V F+ Sbjct: 57 HFIEVKTSMTSRFHEVP---VTKKQRRSVVRTAQYFLSRNP-QFSEHQISFEVYCFSPKS 112 Query: 120 -VEWIKDAFN 128 V +A+ Sbjct: 113 GVTRFVNAWQ 122 >UniRef50_C4YXH4 Protein Mlr4633 n=1 Tax=Rickettsia endosymbiont of Ixodes scapularis RepID=C4YXH4_9RICK Length = 111 Score = 83.6 bits (206), Expect = 2e-15, Method: Composition-based stats. Identities = 21/93 (22%), Positives = 46/93 (49%), Gaps = 5/93 (5%) Query: 36 LRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLW 95 + + GEID+I + +F+EV+ R S + V+ ++Q ++ + A ++ Sbjct: 23 YQILHHRKRYYVGEIDIIALCNKEIVFIEVKARSSKIDDR---FVSFNQQRRITRAAEMF 79 Query: 96 LARHNGSFDTVDCRFDVVAFTGNEVE-WIKDAF 127 L+ + + + RFD+V ++ IK+A+ Sbjct: 80 LSSN-SKYRNYNIRFDLVIIRSYKLPIIIKNAW 111 >UniRef50_A5GTL0 Restriction endonuclease-like n=1 Tax=Synechococcus sp. RCC307 RepID=A5GTL0_SYNR3 Length = 121 Score = 82.8 bits (204), Expect = 3e-15, Method: Composition-based stats. Identities = 28/107 (26%), Positives = 44/107 (41%), Gaps = 3/107 (2%) Query: 20 GDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG-AAA 78 G EAQ L + R + N R GE+DL++ + + + VEV+ RR Sbjct: 11 GAEAEAQVAVLLCRRHWRLLDCNWCCRWGELDLVLAKPQRLLLVEVKARRRWGLDHGGLL 70 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-NEVEWIK 124 + K+ +L + R WLA H S+ + G +V W Sbjct: 71 ACGPRKRCRLARALRCWLAAHP-SYAFHSIEAHLALVDGEGQVRWFP 116 >UniRef50_B0S8P6 Endonuclease n=2 Tax=Leptospira biflexa serovar Patoc RepID=B0S8P6_LEPBA Length = 114 Score = 82.1 bits (202), Expect = 5e-15, Method: Composition-based stats. Identities = 19/106 (17%), Positives = 43/106 (40%), Gaps = 1/106 (0%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + G E A +L+ + +N ++ GEID+I + T EV+ Sbjct: 1 MKKGTIGKKGEEFASFYLQSLEHTILFSNYRKKIGEIDIISIKNDTLHCSEVKTWNERFG 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE 119 + +K+ ++ + L+L + +F + F+++ T + Sbjct: 61 FHPKECLHATKRARMRKV-YLYLLQEIPAFYHLTPSFNLIHITEKK 105 >UniRef50_A5G0S4 UPF0102 protein Acry_2261 n=1 Tax=Acidiphilium cryptum JF-5 RepID=Y2261_ACICJ Length = 128 Score = 82.1 bits (202), Expect = 5e-15, Method: Composition-based stats. Identities = 31/112 (27%), Positives = 53/112 (47%), Gaps = 3/112 (2%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 G E + W +G +A + GE+DL++ + T +FVEV+ R + A Sbjct: 18 RGRDAERRVAGWYAAQGFVVLAQRLRTAAGELDLVVADRTTLVFVEVKARNALR--SAIE 75 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFNDH 130 SV ++ +L+ A + + + + RFDVV G++V I+DAF Sbjct: 76 SVAPRQRRRLVAAAAI-VLAGQPDWGRAETRFDVVLLVGDDVHAIRDAFRAD 126 >UniRef50_A2VU88 Putative uncharacterized protein n=1 Tax=Burkholderia cenocepacia PC184 RepID=A2VU88_9BURK Length = 132 Score = 81.7 bits (201), Expect = 8e-15, Method: Composition-based stats. Identities = 48/127 (37%), Positives = 62/127 (48%), Gaps = 30/127 (23%) Query: 3 TVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE-GRTTI 61 P+ +K G A+E +AR++LE GL F+AANV RGGE+DL+MRE + Sbjct: 31 RPPSGDNFSGAARSKPVGAAFEQRARQFLERHGLGFVAANVTMRGGELDLVMREPDGMLV 90 Query: 62 FVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVE 121 FVEVR RRS +G AA CRFDVVAF + Sbjct: 91 FVEVRARRSTRHGAGAA-----------------------------CRFDVVAFEAGRLA 121 Query: 122 WIKDAFN 128 W++DAF Sbjct: 122 WLRDAFR 128 >UniRef50_Q1GJI4 UPF0102 protein TM1040_0449 n=9 Tax=Rhodobacteraceae RepID=Y449_SILST Length = 124 Score = 81.3 bits (200), Expect = 1e-14, Method: Composition-based stats. Identities = 32/101 (31%), Positives = 49/101 (48%), Gaps = 5/101 (4%) Query: 30 WLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAASVTRSKQHKLL 89 +L +GL + + GEIDLI+R+G T IF EV+ S AAA + ++ ++ Sbjct: 27 YL-ARGLTLVKSRWRGPHGEIDLILRDGETVIFAEVK--SSTTRDKAAARIKPAQMQRVF 83 Query: 90 QTARLWLARHNGSFDTVDCRFDVVAFTG-NEVEWIKDAFND 129 +A +L R DVV G EVE I++A+ Sbjct: 84 NSAGAFLEGEPLG-QLTPARLDVVLVWGAGEVEIIENAYGH 123 >UniRef50_C7N7P3 Predicted endonuclease related to Holliday junction resolvase n=2 Tax=Coriobacteriaceae RepID=C7N7P3_SLAHD Length = 117 Score = 80.9 bits (199), Expect = 1e-14, Method: Composition-based stats. Identities = 19/115 (16%), Positives = 37/115 (32%), Gaps = 11/115 (9%) Query: 21 DAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE-GRTTIFVEVRYRRSALYGGAAAS 79 + RR+LE KG + +D I + +F++ +A G + Sbjct: 6 QRAKQGVRRYLELKGYEILEDGWCHGRDSVDFIATDEDDALVFIDCEVSENAGEGIPEEA 65 Query: 80 VTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-----VEWIKDAFND 129 R ++ A +LA + R+D+V + +A Sbjct: 66 PDRKAFERI---AAAYLAE--ADLSNTEVRYDIVGVLILGESRALIRHHINAITP 115 >UniRef50_A4E8G7 Putative uncharacterized protein n=1 Tax=Collinsella aerofaciens ATCC 25986 RepID=A4E8G7_9ACTN Length = 220 Score = 80.5 bits (198), Expect = 1e-14, Method: Composition-based stats. Identities = 30/136 (22%), Positives = 47/136 (34%), Gaps = 14/136 (10%) Query: 3 TVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERG--GEIDLIMRE-GRT 59 T R+ P++ + R +LE KG + G IDL+ + T Sbjct: 85 TPQGRASEPKEQDMNDMKEKAMGAVRAFLERKGYEIVDEAWQGPEGIGGIDLVAVDEDGT 144 Query: 60 TIFVEVRYRRSALYGGAAASVTRSKQHKLLQT-ARLWLARHNGSFDTVDCRFDVVA---F 115 +FV+ R G A + L + A WLA + + RFD VA Sbjct: 145 LVFVDATVRIGTD-GFPEA----HRARGLREALAARWLAGNGDDYADTPVRFDEVAMMVV 199 Query: 116 TGNE--VEWIKDAFND 129 N + + F + Sbjct: 200 KENRALLRHHINCFGE 215 >UniRef50_A8LJ68 UPF0102 protein Dshi_2830 n=1 Tax=Dinoroseobacter shibae DFL 12 RepID=Y2830_DINSH Length = 134 Score = 80.5 bits (198), Expect = 1e-14, Method: Composition-based stats. Identities = 34/121 (28%), Positives = 55/121 (45%), Gaps = 4/121 (3%) Query: 6 TRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEV 65 R+ R +G A EA+ R +G +A GGE+DLI+R G +FVEV Sbjct: 14 ARARQARGTRAMLSGAAAEARVERAYRDRGCDVLATRWRGSGGEVDLIVRRGDLLVFVEV 73 Query: 66 RYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-NEVEWIK 124 + SA Y A S++ ++ ++ TA +L R ++ RFD+ G + Sbjct: 74 K--SSASYTRAIESLSLAQLTRIQNTALEFLDRSP-DLAGLEMRFDLAVVEGSGRFRVLA 130 Query: 125 D 125 + Sbjct: 131 N 131 >UniRef50_A5KGE0 Putative uncharacterized protein n=1 Tax=Campylobacter jejuni subsp. jejuni CG8486 RepID=A5KGE0_CAMJE Length = 84 Score = 80.1 bits (197), Expect = 2e-14, Method: Composition-based stats. Identities = 17/68 (25%), Positives = 34/68 (50%), Gaps = 2/68 (2%) Query: 20 GDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAAS 79 G E +A ++L+ +G + N + + GEID+I ++ F+EV++ ++ Sbjct: 9 GILGEDKACKFLKKQGFEILKRNFHSKFGEIDIIAKKDEILHFIEVKFTQNDYEVS--ER 66 Query: 80 VTRSKQHK 87 + R K K Sbjct: 67 LDRKKLRK 74 >UniRef50_A5GLH9 Restriction endonuclease-like n=5 Tax=Chroococcales RepID=A5GLH9_SYNPW Length = 142 Score = 79.4 bits (195), Expect = 3e-14, Method: Composition-based stats. Identities = 33/109 (30%), Positives = 51/109 (46%), Gaps = 8/109 (7%) Query: 11 PRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRT----TIFVEVR 66 +LTT TG EA+A R L+G+G + + R GEIDL++ + + VEV+ Sbjct: 12 NAKLTTATTGLWAEAKALRLLQGRGWTLLEKRWSCRYGEIDLLLCKANAPVPRLLAVEVK 71 Query: 67 -YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVA 114 RR G A+ K+ +L T W+A + C+ +VV Sbjct: 72 GRRRCGPDGWGLAAFDARKRQRLALTLNYWIALNP---RHACCQLEVVL 117 >UniRef50_B1M445 UPF0102 protein Mrad2831_2938 n=10 Tax=Alphaproteobacteria RepID=Y2938_METRJ Length = 129 Score = 78.2 bits (192), Expect = 8e-14, Method: Composition-based stats. Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 3/107 (2%) Query: 9 GSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYR 68 G+ R+ + G E A L KG I V+ GGEIDL++R T +FVEV+ R Sbjct: 8 GADRRRAAYRFGHRAEWLALAALMLKGYWPIGRRVSVAGGEIDLVVRRWNTVVFVEVKAR 67 Query: 69 RSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF 115 A ++ +K+ + + R W+ R+ R D V Sbjct: 68 AKRD--DAREAIDGAKRRRFSRAVRAWIGRNAW-CAGATFRADAVFV 111 >UniRef50_B3DVH2 Predicted endonuclease n=2 Tax=Verrucomicrobia RepID=B3DVH2_METI4 Length = 78 Score = 77.4 bits (190), Expect = 1e-13, Method: Composition-based stats. Identities = 22/75 (29%), Positives = 31/75 (41%), Gaps = 7/75 (9%) Query: 59 TTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF--- 115 +FVEV+ R S YG +V K+ L+ A +L V RFDVV Sbjct: 1 MLVFVEVKTRSSIQYGFPYEAVDAQKKRNLIAAAHAYLKLLKNPV--VAYRFDVVEVLFF 58 Query: 116 --TGNEVEWIKDAFN 128 T ++ +AF Sbjct: 59 KGTRPKITHYPNAFG 73 >UniRef50_A9GEX7 UPF0102 protein sce2912 n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=Y2912_SORC5 Length = 144 Score = 76.7 bits (188), Expect = 2e-13, Method: Composition-based stats. Identities = 33/126 (26%), Positives = 51/126 (40%), Gaps = 6/126 (4%) Query: 8 SGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRY 67 +G+P + G E L +G+ +A N EID++ R+G +EVR Sbjct: 19 AGAPAADARRALGARAEDAVVAHLAAQGVEIVARNARVGRLEIDVVARDGPVIAIIEVRT 78 Query: 68 RRSALYGGAAASVTRSKQHKLLQTARL-WLARHNGSFDTVDCRFDVVAFTG-----NEVE 121 R + Y A S+ K+ ++ + W A + RFD + T VE Sbjct: 79 RGAGSYVRALDSIDARKRARVRRAGERLWRATFSRVRGVERMRFDAASVTFLPSGEATVE 138 Query: 122 WIKDAF 127 IK AF Sbjct: 139 IIKAAF 144 >UniRef50_C0W7A2 Putative uncharacterized protein (Fragment) n=1 Tax=Actinomyces urogenitalis DSM 15434 RepID=C0W7A2_9ACTO Length = 125 Score = 74.7 bits (183), Expect = 7e-13, Method: Composition-based stats. Identities = 21/79 (26%), Positives = 29/79 (36%), Gaps = 11/79 (13%) Query: 3 TVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNER-----GGEIDLIMREG 57 PT +QTG E A +L G + N R GEID++ E Sbjct: 44 QEPTGPQPFPSGDRRQTGRRGEDLAAAYLTDLGWTVLERNWRPRGLAGLRGEIDIVASEP 103 Query: 58 R------TTIFVEVRYRRS 70 T + VEV+ R + Sbjct: 104 SASAGRPTLVVVEVKTRST 122 >UniRef50_B6IVS9 Putative uncharacterized protein n=1 Tax=Rhodospirillum centenum SW RepID=B6IVS9_RHOCS Length = 126 Score = 74.7 bits (183), Expect = 9e-13, Method: Composition-based stats. Identities = 28/123 (22%), Positives = 49/123 (39%), Gaps = 5/123 (4%) Query: 4 VPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFV 63 P RS + + T ++ G E R L KG R +A + GE+D++ V Sbjct: 2 APVRSRTDYR-TAERLGRRAEWLCRLALLLKGYRILATRLRTPAGEVDILAERRGLLAVV 60 Query: 64 EVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV-EW 122 EV+ R A A+VT + ++ + A+ A R+D++ Sbjct: 61 EVKARPG--LEAARAAVTEADWRRIARAAQG-YAAARPRLAGHAIRYDLMVVLPGRWPVH 117 Query: 123 IKD 125 ++D Sbjct: 118 LED 120 >UniRef50_UPI000190D97F hypothetical protein SentesTyp_00923 n=1 Tax=Salmonella enterica subsp. enterica serovar Typhi str. E98-2068 RepID=UPI000190D97F Length = 82 Score = 74.4 bits (182), Expect = 1e-12, Method: Composition-based stats. Identities = 48/55 (87%), Positives = 51/55 (92%) Query: 32 EGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAASVTRSKQH 86 E KGLRFIAANV ERGGEIDLIMR+G+TT+FVEVRYRRS LYGGAAASVTRSKQ Sbjct: 2 ESKGLRFIAANVRERGGEIDLIMRDGKTTVFVEVRYRRSGLYGGAAASVTRSKQQ 56 >UniRef50_C8WHA2 Putative uncharacterized protein n=1 Tax=Eggerthella lenta DSM 2243 RepID=C8WHA2_EGGLE Length = 123 Score = 73.6 bits (180), Expect = 2e-12, Method: Composition-based stats. Identities = 27/114 (23%), Positives = 50/114 (43%), Gaps = 13/114 (11%) Query: 25 AQARRWLEGKGLRFIAANVNER--GGEIDLIMREG--RTTIFVEVRYRRSALYGGAAASV 80 A R+LE +G +A G IDL+ R+ +FV+V R ++ G Sbjct: 13 EAAARFLEVRGYETLATGWKSPETRGTIDLVARDPESDDLVFVDVSARPNSGAGFGD--- 69 Query: 81 TRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVA--FTGNE---VEWIKDAFND 129 R+ + + A WL ++ + ++V RFD ++ G + + +AF + Sbjct: 70 GRNDRETMELLAVSWLVENDFA-ESVGVRFDKISMIVVGEDRALLRHHINAFGE 122 >UniRef50_Q0I9S1 Uncharacterised protein family protein n=1 Tax=Synechococcus sp. CC9311 RepID=Q0I9S1_SYNS3 Length = 144 Score = 73.2 bits (179), Expect = 2e-12, Method: Composition-based stats. Identities = 27/95 (28%), Positives = 42/95 (44%), Gaps = 5/95 (5%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREG----RTTIFVEVRYR-R 69 ++ G E + L G R + N + R GEIDL+ + + VEV+ R R Sbjct: 14 NSQALGAQAELYVKEVLLRHGWRLLEHNWSCRYGEIDLLFTKQSFPASRILVVEVKARHR 73 Query: 70 SALYGGAAASVTRSKQHKLLQTARLWLARHNGSFD 104 S L G A+ ++K+ L +T W A + S Sbjct: 74 SGLDGWGVAAFHQAKRRCLARTVECWRAANAWSEA 108 >UniRef50_B9XLY1 Putative uncharacterized protein n=1 Tax=bacterium Ellin514 RepID=B9XLY1_9BACT Length = 101 Score = 73.2 bits (179), Expect = 2e-12, Method: Composition-based stats. Identities = 24/59 (40%), Positives = 34/59 (57%) Query: 18 QTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGA 76 Q G+ E A+++L +GL+F AN GEIDLI R+G +FVEV+ R S + Sbjct: 21 QHGELGERAAKKYLRKQGLKFFTANFKSDRGEIDLIFRDGDGLVFVEVKTRSSVDWNLP 79 >UniRef50_D1ATK0 Putative uncharacterized protein n=1 Tax=Anaplasma centrale str. Israel RepID=D1ATK0_ANACI Length = 151 Score = 72.8 bits (178), Expect = 3e-12, Method: Composition-based stats. Identities = 21/89 (23%), Positives = 43/89 (48%), Gaps = 4/89 (4%) Query: 40 AANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARH 99 GEIDLI++ GR F+EV+ ++ + VT ++ +++TA+ +L+RH Sbjct: 66 HHRYRSPLGEIDLIVQNGRELYFIEVKTSMTSRFREVP--VTGKQRRSIVRTAQYFLSRH 123 Query: 100 NGSFDTVDCRFDVVAFTGNE-VEWIKDAF 127 ++ F+V + + +A+ Sbjct: 124 PQFYEH-QISFEVYCISPRSGITRFVNAW 151 >UniRef50_Q1GWI7 UPF0102 protein Sala_0262 n=5 Tax=Sphingomonadales RepID=Y262_SPHAL Length = 116 Score = 69.7 bits (170), Expect = 3e-11, Method: Composition-based stats. Identities = 22/101 (21%), Positives = 40/101 (39%), Gaps = 5/101 (4%) Query: 30 WLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAASVTRSKQHKLL 89 WL G R + + GE+DL+ R GRT F+EV++R ++ + ++ Sbjct: 20 WLRLHGWRIVGQRLRVPVGEVDLVARRGRTVAFIEVKWR--DRAADLDLAIDPYRLRRVA 77 Query: 90 QTARLWLARHNGSFDTVDCRFDVVAFTGNEV-EWIKDAFND 129 A + R +D R DV+ + + + Sbjct: 78 AAAEMLAPRFARPYDD--IRIDVMLLAPRRLPRHLVHVWQP 116 >UniRef50_C0BCN9 Putative uncharacterized protein n=1 Tax=Coprococcus comes ATCC 27758 RepID=C0BCN9_9FIRM Length = 70 Score = 69.4 bits (169), Expect = 4e-11, Method: Composition-based stats. Identities = 18/63 (28%), Positives = 34/63 (53%), Gaps = 2/63 (3%) Query: 66 RYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 ++R++ A +V+ KQ +L A +L ++ + + CRFDV+ GN++ K+ Sbjct: 2 KFRKTGGLSAALEAVSVPKQMRLSGAAVYYLMKNGCT--EIPCRFDVIGIAGNKISLRKN 59 Query: 126 AFN 128 AF Sbjct: 60 AFE 62 >UniRef50_C8W847 Putative uncharacterized protein n=1 Tax=Atopobium parvulum DSM 20469 RepID=C8W847_ATOPD Length = 117 Score = 66.3 bits (161), Expect = 3e-10, Method: Composition-based stats. Identities = 23/111 (20%), Positives = 43/111 (38%), Gaps = 11/111 (9%) Query: 24 EAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAASVTRS 83 EA + L+ + + + N ID I+ + IFV+ + + Sbjct: 11 EAISAN-LKHRDIEVLEKNWAHGSDGIDFIVMDDEELIFVDTAT-KCGGFDVPREEPD-- 66 Query: 84 KQHKLLQTARLWLARHNGSFDTVDCRFDVVA--FTGNE---VEWIKDAFND 129 Q + + A +LA + R+D+V+ TG+E + K+ ND Sbjct: 67 -QERFERIAAAYLAE-SEVEGLASIRYDIVSLLVTGSEKALLRHHKNVLND 115 >UniRef50_Q4E9U1 Endonuclease (Fragment) n=5 Tax=Wolbachia RepID=Q4E9U1_9RICK Length = 107 Score = 62.4 bits (151), Expect = 4e-09, Method: Composition-based stats. Identities = 16/77 (20%), Positives = 36/77 (46%), Gaps = 5/77 (6%) Query: 36 LRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLW 95 I + GEIDLI+ + + IF+EV+ ++ + ++ +++ + Sbjct: 26 YNVIKRRYRCKFGEIDLIVSKKKELIFIEVKTSLLGKEIP----ISHLQCQSIINSSKYF 81 Query: 96 LARHNGSFDTVDCRFDV 112 L+++ SF R+D+ Sbjct: 82 LSKN-LSFLDYSVRYDL 97 >UniRef50_Q73VP8 Putative uncharacterized protein n=1 Tax=Mycobacterium avium subsp. paratuberculosis RepID=Q73VP8_MYCPA Length = 98 Score = 60.5 bits (146), Expect = 2e-08, Method: Composition-based stats. Identities = 15/39 (38%), Positives = 19/39 (48%) Query: 18 QTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE 56 Q G EA A L GLR + N R GE+D+I + Sbjct: 42 QLGAMGEALAVDHLTRMGLRVLHRNWRCRYGELDIIACD 80 >UniRef50_Q3AKE1 Uncharacterised protein family UPF0102 n=2 Tax=Chroococcales RepID=Q3AKE1_SYNSC Length = 114 Score = 59.7 bits (144), Expect = 3e-08, Method: Composition-based stats. Identities = 18/73 (24%), Positives = 34/73 (46%), Gaps = 1/73 (1%) Query: 35 GLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG-GAAASVTRSKQHKLLQTAR 93 G R + N + R GE+DL++ + VEV+ RR + + +K+ ++ + Sbjct: 22 GWRLLDRNWHCRWGELDLVLERQLLLLVVEVKGRRMGHHDRHGLDAFHSAKRRRMARAIS 81 Query: 94 LWLARHNGSFDTV 106 W A H S + + Sbjct: 82 CWRAVHPASAEQL 94 >UniRef50_C7N801 Predicted endonuclease related to Holliday junction resolvase n=1 Tax=Slackia heliotrinireducens DSM 20476 RepID=C7N801_SLAHD Length = 130 Score = 58.2 bits (140), Expect = 7e-08, Method: Composition-based stats. Identities = 21/90 (23%), Positives = 36/90 (40%), Gaps = 5/90 (5%) Query: 25 AQARRWLEG-KGLRFIAANVNERGGEIDLIMREG--RTTIFVEVRYRRSALYGGAAASVT 81 A AR +L+ KG + + + ID I + +FVE+R R Y Sbjct: 17 ALARVFLQREKGFAILKDDFSRGLDSIDFIALDDTQTVIVFVEMRLRHENSYIDKNPRSD 76 Query: 82 RSKQHKLLQTARLWLARHNGSFDTVDCRFD 111 K + A +L+++ F ++ R D Sbjct: 77 FDK-RRFEHLALAFLSKYPCLF-NLEIRAD 104 >UniRef50_Q8W6V7 Putative uncharacterized protein n=1 Tax=Synechococcus phage P60 RepID=Q8W6V7_9CAUD Length = 99 Score = 55.9 bits (134), Expect = 4e-07, Method: Composition-based stats. Identities = 29/113 (25%), Positives = 43/113 (38%), Gaps = 21/113 (18%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 ++T + G + E A L G NV G ID+++ +G T ++V+ Sbjct: 2 ISTHKRGASAELLACAALVDAGFEVF-RNV-TPDGPIDIVVWDGETFYPIDVKRASHY-- 57 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDA 126 +KQ K A L L N + CR D +E WI DA Sbjct: 58 --------VNKQGK----ATLKLPAKNNEHALILCRTD-----KDEWVWINDA 93 >UniRef50_A6DBD4 Putative uncharacterized protein (Fragment) n=1 Tax=Caminibacter mediatlanticus TB-2 RepID=A6DBD4_9PROT Length = 43 Score = 55.5 bits (133), Expect = 5e-07, Method: Composition-based stats. Identities = 14/43 (32%), Positives = 24/43 (55%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE 56 + +K G+ E +A+ +L K R + N +GGEID+I + Sbjct: 1 MNSKSKGNIAEKKAKEYLLNKKFRIVETNFYCKGGEIDIIAYK 43 >UniRef50_Q8TW03 Predicted endonuclease of the RecB family n=1 Tax=Methanopyrus kandleri RepID=Q8TW03_METKA Length = 258 Score = 53.9 bits (129), Expect = 2e-06, Method: Composition-based stats. Identities = 13/55 (23%), Positives = 21/55 (38%), Gaps = 5/55 (9%) Query: 18 QTGDAWEAQARRWLEGKGLRFIAANVNERG-----GEIDLIMREGRTTIFVEVRY 67 + G + E A L +G +A N EID++ + VEV+ Sbjct: 3 RRGKSAEEIAASILRKEGFEVVARNYRVELEDELVAEIDIVAEKDGERYAVEVKA 57 >UniRef50_A8A9D3 Putative uncharacterized protein n=1 Tax=Ignicoccus hospitalis KIN4/I RepID=A8A9D3_IGNH4 Length = 201 Score = 53.6 bits (128), Expect = 2e-06, Method: Composition-based stats. Identities = 31/115 (26%), Positives = 44/115 (38%), Gaps = 12/115 (10%) Query: 11 PRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGG----EIDLIMREGRTTIFVEVR 66 + + A+E+ R LE G + NV R G E D+I +G I VE + Sbjct: 60 EHEAASYLNWKAFESYVARALEEAGFETL-KNVRVRAGDKLAEFDVIGYDGDKVIVVECK 118 Query: 67 YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVE 121 R SA A V + K+ + A WLA+ V VV G + Sbjct: 119 -RWSAFRRSALLKVAEEHKAKVERAA-YWLAKLGKRALPV-----VVTLRGTPIR 166 >UniRef50_A4GJ57 Putative uncharacterized protein n=1 Tax=uncultured marine Nitrospinaceae bacterium RepID=A4GJ57_9DELT Length = 64 Score = 53.6 bits (128), Expect = 2e-06, Method: Composition-based stats. Identities = 15/61 (24%), Positives = 30/61 (49%), Gaps = 7/61 (11%) Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-----VEWIKDAF 127 +G ++T +KQ K++Q + +L + RFDVV T + +E ++++F Sbjct: 5 FGHQFDALTPTKQKKIIQITQSFLVQKRIP--DKSMRFDVVVLTLDRPDSCKIELLENSF 62 Query: 128 N 128 Sbjct: 63 Q 63 >UniRef50_UPI0001699F06 hypothetical protein Epers_29808 n=1 Tax=Endoriftia persephone 'Hot96_1+Hot96_2' RepID=UPI0001699F06 Length = 60 Score = 53.6 bits (128), Expect = 2e-06, Method: Composition-based stats. Identities = 18/40 (45%), Positives = 30/40 (75%) Query: 54 MREGRTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTAR 93 M++G + +FVEVRYR+S +G AA S+T +K+ KL+ ++ Sbjct: 1 MQDGNSLVFVEVRYRKSDNFGSAAESITAAKRAKLIAASQ 40 >UniRef50_C5U625 Putative uncharacterized protein n=1 Tax=Methanocaldococcus infernus ME RepID=C5U625_9EURY Length = 108 Score = 52.8 bits (126), Expect = 4e-06, Method: Composition-based stats. Identities = 17/57 (29%), Positives = 26/57 (45%), Gaps = 5/57 (8%) Query: 18 QTGDAWEAQARRWLEGKGLRFIAANV-----NERGGEIDLIMREGRTTIFVEVRYRR 69 + G E +A +L+ KG + I NV + E D+I + G VEV+ R Sbjct: 2 RKGKKKEGRAANYLKEKGYKIIGRNVIKRINQHKKAEYDIIAKRGNYKYAVEVKSGR 58 >UniRef50_O33024 UPF0102 protein ML1607 n=2 Tax=Mycobacterium leprae RepID=Y1607_MYCLE Length = 96 Score = 51.6 bits (123), Expect = 7e-06, Method: Composition-based stats. Identities = 15/47 (31%), Positives = 20/47 (42%) Query: 10 SPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE 56 + + +T Q E A L GLR + N R GE D+I E Sbjct: 3 THKAMTRVQLEAMGEVFAVDNLTRMGLRGLHCNWRCRYGECDVIASE 49 >UniRef50_Q5GSW9 RecB family endonuclease n=1 Tax=Wolbachia endosymbiont strain TRS of Brugia malayi RepID=Q5GSW9_WOLTR Length = 114 Score = 48.9 bits (116), Expect = 4e-05, Method: Composition-based stats. Identities = 14/70 (20%), Positives = 32/70 (45%), Gaps = 5/70 (7%) Query: 43 VNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGS 102 + EIDLI+ + + IF+EV+ ++ + ++ +++ +L+ S Sbjct: 33 YCCKFSEIDLIVSKKKELIFIEVKASLLGEDIL----ISYLQYQSIVNSSKYFLSEK-LS 87 Query: 103 FDTVDCRFDV 112 F R+D+ Sbjct: 88 FLDYPIRYDL 97 >UniRef50_Q3BT99 Putative uncharacterized protein n=2 Tax=Xanthomonas RepID=Q3BT99_XANC5 Length = 708 Score = 45.1 bits (106), Expect = 6e-04, Method: Composition-based stats. Identities = 23/89 (25%), Positives = 34/89 (38%), Gaps = 8/89 (8%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLR-FIAANVNERGGEIDLIMRE-GRTTIFVEVRYRRS 70 +LTT+Q GD EA L +G +A G IDL+ R G F E++ Sbjct: 432 RLTTRQLGDIGEAIQTHELVKQGYSDIVAIKNRSGHG-IDLVGRNPGGELEFFEIKTSAK 490 Query: 71 ----ALYGGAAASVTRSKQHKLLQTARLW 95 A +G V + + + W Sbjct: 491 GMAPAQHGDPEQFV-AKRLERAIDAKGHW 518 >UniRef50_Q5NWY8 Putative uncharacterized protein n=1 Tax=Aromatoleum aromaticum EbN1 RepID=Q5NWY8_AZOSE Length = 196 Score = 45.1 bits (106), Expect = 7e-04, Method: Composition-based stats. Identities = 27/133 (20%), Positives = 49/133 (36%), Gaps = 20/133 (15%) Query: 9 GSPRQLTTKQTGDAWE----AQARRWLEGKGLRF-IAANVNERGGEIDLIMREGRTTIFV 63 R+ +Q G E A+ LE G + ++ RGG+IDLI + +++ V Sbjct: 57 NQHRRAQVRQHGQHVEAKCGQLAKTALESDGYTVALGQRLH-RGGDIDLIATKDGSSVVV 115 Query: 64 EVRY--------RRSALYGGAAASVTRSKQHKL-LQTARLWLARHNGSFDTVDCRFDV-- 112 E++ R A V +Q + + A +WL + + + + Sbjct: 116 ELKSFRYWGARGRDDWREKKAIEQV-LRQQDTIAAKAAVIWLPMASPTLWQLLWGYSFGG 174 Query: 113 --VAFTGNEVEWI 123 VA V + Sbjct: 175 RGVAVVRGGVRHL 187 >UniRef50_Q9Y9F5 Putative uncharacterized protein n=1 Tax=Aeropyrum pernix RepID=Q9Y9F5_AERPE Length = 213 Score = 44.7 bits (105), Expect = 0.001, Method: Composition-based stats. Identities = 15/61 (24%), Positives = 25/61 (40%), Gaps = 9/61 (14%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNER-------GGEIDLIMREGRTTIFVEVR 66 + ++ E A R LE G R + + GEID++ +G + VEV+ Sbjct: 1 MAGRRAWRNSEEIAARILEKSGFRVLD--FHVPIEDGGVEVGEIDIVAEKGGSRYSVEVK 58 Query: 67 Y 67 Sbjct: 59 A 59 >UniRef50_C1E903 Predicted protein n=1 Tax=Micromonas sp. RCC299 RepID=C1E903_9CHLO Length = 682 Score = 43.9 bits (103), Expect = 0.002, Method: Composition-based stats. Identities = 17/66 (25%), Positives = 24/66 (36%), Gaps = 11/66 (16%) Query: 15 TTKQTGDAWEAQARRWLEGK--GLRFIAANVNERGGE----IDLIMR--EGRTTIFVEVR 66 ++ G EA R+L + G E D+ MR +FVEV+ Sbjct: 562 DNRRVGRWGEALVYRYLLQRHPGWTVT---WVNEHAESKSFYDVKMRNVRDGRIVFVEVK 618 Query: 67 YRRSAL 72 RSA Sbjct: 619 TTRSAD 624 >UniRef50_D1TJS5 Putative uncharacterized protein n=1 Tax=Burkholderia sp. CCGE1002 RepID=D1TJS5_9BURK Length = 341 Score = 42.0 bits (98), Expect = 0.007, Method: Composition-based stats. Identities = 11/48 (22%), Positives = 19/48 (39%), Gaps = 2/48 (4%) Query: 21 DAWEAQARRWLEGKGLRFIAANVNE--RGGEIDLIMREGRTTIFVEVR 66 +E+ L G + N+ R E+D+ R+ VEV+ Sbjct: 8 QQFESIVAELLVKLGFEKVERNIAHPARRAEVDITFRKKSELAVVEVK 55 >UniRef50_Q6MLA1 Putative uncharacterized protein n=1 Tax=Bdellovibrio bacteriovorus RepID=Q6MLA1_BDEBA Length = 97 Score = 41.6 bits (97), Expect = 0.007, Method: Composition-based stats. Identities = 23/101 (22%), Positives = 45/101 (44%), Gaps = 21/101 (20%) Query: 28 RRWLEGKGLRFIAANVNERGGEIDLIMREGR-TTIFVEVRYRRSALYGGAAASVTRSKQH 86 ++ + K + V E+DL+ + R T + VEV+ + + +T+ ++ Sbjct: 2 IKYYQLKCCHLLGQRVKTPFAEVDLLFKTPRQTLLMVEVKTTNLSDFQP--FRITKKQKA 59 Query: 87 KLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 +L++ A L+LA R+DV+ E+ W AF Sbjct: 60 RLVR-AMLFLAA----------RWDVLV----EIHW---AF 82 >UniRef50_B7KKS4 Putative uncharacterized protein n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7KKS4_CYAP7 Length = 678 Score = 41.6 bits (97), Expect = 0.008, Method: Composition-based stats. Identities = 14/54 (25%), Positives = 24/54 (44%), Gaps = 3/54 (5%) Query: 20 GDAWEAQARRWLEGKGLRFIAANVNE-RGGEIDLIMREGRTTIFVEVRYRRSAL 72 G+ E +A ++ + G +NV+ DL + IFVEV+ S+ Sbjct: 517 GNFGEDKAIQFYQALGYEV--SNVSNQPQKGYDLECIKDGQEIFVEVKTISSSN 568 >UniRef50_A2BIV6 Endonuclease of RecB family n=1 Tax=Hyperthermus butylicus DSM 5456 RepID=A2BIV6_HYPBU Length = 235 Score = 40.8 bits (95), Expect = 0.015, Method: Composition-based stats. Identities = 17/67 (25%), Positives = 27/67 (40%), Gaps = 6/67 (8%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIA--ANVNERG---GEIDLIMR-EGRTTIFVEVRY 67 ++ + A E A R LE +G + V G E+D + R VEV+ Sbjct: 1 MSGMKRWHASERIAFRLLEEQGYEILEVHKRVRIEGVEVAEVDAVARGPDGELYAVEVKA 60 Query: 68 RRSALYG 74 R ++G Sbjct: 61 GRLDVHG 67 >UniRef50_A3DLW4 Endonuclease (RecB family)-like protein n=1 Tax=Staphylothermus marinus F1 RepID=A3DLW4_STAMF Length = 236 Score = 40.5 bits (94), Expect = 0.018, Method: Composition-based stats. Identities = 15/61 (24%), Positives = 28/61 (45%), Gaps = 6/61 (9%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNER-----GGEIDLIMREG-RTTIFVEVR 66 L+ K+ + E A ++LE +G + I +V + EID I+ + VE++ Sbjct: 2 SLSAKRKWRSSEEIALQFLEQQGFKIIDKHVKVKIEGVEVSEIDAIVEDEKGEKYAVEIK 61 Query: 67 Y 67 Sbjct: 62 A 62 >UniRef50_B5IHF1 ATPase n=1 Tax=Aciduliprofundum boonei T469 RepID=B5IHF1_9EURY Length = 390 Score = 40.1 bits (93), Expect = 0.022, Method: Composition-based stats. Identities = 15/95 (15%), Positives = 34/95 (35%), Gaps = 21/95 (22%) Query: 11 PRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNER-GGEIDLIMREGRTTIFVEVRYRR 69 PR++ G +E L GL + E+D I+ + +EV+ Sbjct: 283 PREMD----GLLFENYVLSELIKMGLEP--RYWRTKSKAEVDFIVERDGKIVPIEVKL-- 334 Query: 70 SALYGGAAASVTRSKQHKLLQTARLWLARHNGSFD 104 ++K K+ ++ R ++ ++ + Sbjct: 335 ------------QAKPEKVEKSMRAFIEKYEPEYA 357 >UniRef50_D0CIJ2 Putative uncharacterized protein n=2 Tax=Bacteria RepID=D0CIJ2_9SYNE Length = 46 Score = 39.7 bits (92), Expect = 0.030, Method: Composition-based stats. Identities = 8/19 (42%), Positives = 12/19 (63%) Query: 35 GLRFIAANVNERGGEIDLI 53 G R + N + R GE+DL+ Sbjct: 22 GWRLLDRNWHCRWGELDLV 40 >UniRef50_A7VFU4 Putative uncharacterized protein n=6 Tax=Bacteria RepID=A7VFU4_9CLOT Length = 416 Score = 39.3 bits (91), Expect = 0.036, Method: Composition-based stats. Identities = 10/63 (15%), Positives = 22/63 (34%), Gaps = 4/63 (6%) Query: 5 PTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRF-IAANVNERGGEIDLIMREGRTTIFV 63 P + G +E L +G + + E+D + ++ T I++ Sbjct: 304 PAFRYAVNGTRNMDFGRVYENIVYLELRRRGYEVYVGKLYKK---EVDFVAKKRDTLIYI 360 Query: 64 EVR 66 +V Sbjct: 361 QVS 363 >UniRef50_C4F8E7 Putative uncharacterized protein n=1 Tax=Collinsella intestinalis DSM 13280 RepID=C4F8E7_9ACTN Length = 115 Score = 38.9 bits (90), Expect = 0.054, Method: Composition-based stats. Identities = 19/97 (19%), Positives = 33/97 (34%), Gaps = 20/97 (20%) Query: 23 WEAQARRWLEGKGLRFIA-ANVNERGGEIDLIMREG---RTTIFVEVRYRRSALYGGAAA 78 E A+ +L K L+ + G+ D I + + VE +R Sbjct: 6 GELIAKEFLLSKDLKSVDMTGYECDEGKADAICIDESGCHVLVNVETHRKRG-------- 57 Query: 79 SVTRSKQ----HKLLQTARLWLARHNGSFDTVDCRFD 111 V KQ ++ + +LA H + R+D Sbjct: 58 -VEEPKQVYNVKRMRRVLMCYLADHP---EVKAARYD 90 >UniRef50_Q05TI5 Putative uncharacterized protein n=1 Tax=Synechococcus sp. RS9916 RepID=Q05TI5_9SYNE Length = 89 Score = 38.5 bits (89), Expect = 0.064, Method: Composition-based stats. Identities = 10/45 (22%), Positives = 16/45 (35%), Gaps = 1/45 (2%) Query: 58 RTTIFVEVRYRR-SALYGGAAASVTRSKQHKLLQTARLWLARHNG 101 + VEV+ RR G A+ K +L + W + Sbjct: 10 GRLLVVEVKARRRCGRDGWGVAACNAGKLQRLARAMACWRMANPW 54 >UniRef50_A8MBK5 Putative uncharacterized protein n=1 Tax=Caldivirga maquilingensis IC-167 RepID=A8MBK5_CALMQ Length = 211 Score = 38.2 bits (88), Expect = 0.081, Method: Composition-based stats. Identities = 14/54 (25%), Positives = 20/54 (37%), Gaps = 6/54 (11%) Query: 20 GDAWEAQARRWLEGKGLRFIAANVNERG-----GEIDLIMREG-RTTIFVEVRY 67 G +E L G R + V GE+DLI+ + VEV+ Sbjct: 4 GVRFEDYVAELLSRLGFRVMDRRVKVTSNGVEVGEVDLIVEDECGNKYSVEVKS 57 >UniRef50_A3CY54 Restriction endonuclease n=1 Tax=Methanoculleus marisnigri JR1 RepID=A3CY54_METMJ Length = 276 Score = 38.2 bits (88), Expect = 0.083, Method: Composition-based stats. Identities = 21/120 (17%), Positives = 32/120 (26%), Gaps = 18/120 (15%) Query: 2 ATVPTRSGSPRQLT-TKQTGDA-----WEAQARRWLEGKGLRFIAANVN----ERGGEID 51 R + + + G +E R L G R E+D Sbjct: 60 RAKAVRPAAGHRTNLRRALGLLRSKPDFEEFVRVLLREHGYRV-ETGCVLAGLCGEHEVD 118 Query: 52 LIMREGRTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFD 111 I TIFVEV++ S VT + ++ + L D Sbjct: 119 AIAERDGATIFVEVKHHASHH------RVTGLDEGRIARAIIEDLQE-GFRAGRCTVSID 171 >UniRef50_Q7VH71 Putative uncharacterized protein n=1 Tax=Helicobacter hepaticus RepID=Q7VH71_HELHP Length = 154 Score = 38.2 bits (88), Expect = 0.090, Method: Composition-based stats. Identities = 21/116 (18%), Positives = 39/116 (33%), Gaps = 19/116 (16%) Query: 1 MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRF----IAANVNERGGEIDLIMRE 56 M S KQ GD +E Q R + +G + + ++G ID+I + Sbjct: 19 MYAYRENKDSNNARHNKQKGDKYELQIVRHYKQQGYKVYPKGLKEGRRDKG--IDIIAYK 76 Query: 57 GRTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQT---ARLWLARHNGSFDTVDCR 109 G+ + ++ + + KQ L +L ++ F R Sbjct: 77 GKEALLIQCKNWERSQV----------KQEHLRIFLGDCTAYLEQNQKIFAKRSVR 122 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_A4WEW3 UPF0102 protein Ent638_3585 n=124 Tax=Enterobact... 164 6e-40 UniRef50_D1P1S8 Putative choloylglycine hydrolase n=5 Tax=Entero... 158 4e-38 UniRef50_C4ZD68 Putative uncharacterized protein n=1 Tax=Eubacte... 157 1e-37 UniRef50_A5KJ12 Putative uncharacterized protein n=2 Tax=Clostri... 154 7e-37 UniRef50_A8SP33 Putative uncharacterized protein n=2 Tax=Clostri... 154 7e-37 UniRef50_A9KLL5 UPF0102 protein Cphy_2398 n=3 Tax=Clostridiales ... 154 1e-36 UniRef50_C1SJ30 Putative uncharacterized protein n=1 Tax=Denitro... 152 4e-36 UniRef50_UPI0001973CC8 hypothetical protein ClM62_15129 n=1 Tax=... 152 4e-36 UniRef50_Q0AWX4 UPF0102 protein Swol_1475 n=1 Tax=Syntrophomonas... 152 4e-36 UniRef50_B5YIA9 UPF0102 protein THEYE_A1950 n=1 Tax=Thermodesulf... 152 4e-36 UniRef50_A8MHC8 UPF0102 protein Clos_1471 n=8 Tax=Clostridiaceae... 151 6e-36 UniRef50_C7R9K3 Putative uncharacterized protein n=1 Tax=Kangiel... 151 7e-36 UniRef50_Q1JX98 Putative uncharacterized protein n=1 Tax=Desulfu... 150 9e-36 UniRef50_B0P3K1 Putative uncharacterized protein n=2 Tax=Clostri... 150 1e-35 UniRef50_C4Z0D1 Putative endonuclease n=13 Tax=Clostridiales Rep... 150 1e-35 UniRef50_C4V2D8 Endonuclease n=1 Tax=Selenomonas flueggei ATCC 4... 150 2e-35 UniRef50_A6FEY4 Putative uncharacterized protein n=1 Tax=Moritel... 150 2e-35 UniRef50_Q5ZR89 UPF0102 protein lpg2994 n=4 Tax=Legionella RepID... 149 2e-35 UniRef50_C4L9F6 Putative uncharacterized protein n=1 Tax=Tolumon... 149 2e-35 UniRef50_A0KPY4 UPF0102 protein AHA_3896 n=4 Tax=Proteobacteria ... 149 3e-35 UniRef50_D1RD77 Putative uncharacterized protein n=1 Tax=Legione... 148 4e-35 UniRef50_C0QTY9 UPF0102 protein PERMA_0362 n=2 Tax=Hydrogenother... 148 5e-35 UniRef50_Q3JE65 UPF0102 protein Noc_0355 n=2 Tax=Nitrosococcus o... 148 5e-35 UniRef50_Q6AJE4 UPF0102 protein DP2807 n=1 Tax=Desulfotalea psyc... 148 5e-35 UniRef50_B0U003 UPF0102 protein Fphi_0415 n=15 Tax=Francisella R... 147 9e-35 UniRef50_D1KBC1 Putative uncharacterized protein n=1 Tax=uncultu... 147 1e-34 UniRef50_C2KW07 Putative uncharacterized protein n=1 Tax=Oribact... 147 1e-34 UniRef50_Q47VU1 UPF0102 protein CPS_4433 n=1 Tax=Colwellia psych... 147 2e-34 UniRef50_C8X4T0 Putative uncharacterized protein n=1 Tax=Desulfo... 146 2e-34 UniRef50_B2V8B3 UPF0102 protein SYO3AOP1_0546 n=2 Tax=Sulfurihyd... 146 2e-34 UniRef50_B8J1T1 Putative uncharacterized protein n=1 Tax=Desulfo... 145 3e-34 UniRef50_C5BS52 Putative uncharacterized protein n=1 Tax=Teredin... 145 4e-34 UniRef50_A9NAA4 UPF0102 protein COXBURSA331_A1934 n=6 Tax=Coxiel... 145 5e-34 UniRef50_D1VVL2 Putative uncharacterized protein n=1 Tax=Peptoni... 145 5e-34 UniRef50_Q7N090 UPF0102 protein plu4003 n=2 Tax=Enterobacteriace... 145 5e-34 UniRef50_C0N677 Putative uncharacterized protein n=1 Tax=Methylo... 144 7e-34 UniRef50_B6ELI6 UPF0102 protein VSAL_I2655 n=7 Tax=Vibrionaceae ... 144 7e-34 UniRef50_A5F986 UPF0102 protein VC0395_A0112/VC395_0597 n=28 Tax... 144 8e-34 UniRef50_B8CW28 Putative uncharacterized protein n=1 Tax=Halothe... 144 8e-34 UniRef50_A3XHA2 Putative uncharacterized protein n=1 Tax=Leeuwen... 144 8e-34 UniRef50_A0YER5 Putative uncharacterized protein n=1 Tax=marine ... 144 8e-34 UniRef50_C0GQX2 Putative uncharacterized protein n=1 Tax=Desulfo... 144 8e-34 UniRef50_A1SU47 UPF0102 protein Ping_1176 n=2 Tax=Psychromonas R... 144 9e-34 UniRef50_A1AN88 UPF0102 protein Ppro_1186 n=11 Tax=Deltaproteoba... 144 1e-33 UniRef50_Q67PD3 UPF0102 protein STH1475 n=1 Tax=Symbiobacterium ... 144 1e-33 UniRef50_B7GSU4 UPF0102 protein Blon_1698 n=12 Tax=Bifidobacteri... 143 2e-33 UniRef50_Q2S9Y0 UPF0102 protein HCH_05895 n=1 Tax=Hahella chejue... 143 2e-33 UniRef50_Q0VS15 UPF0102 protein ABO_0585 n=2 Tax=Alcanivorax Rep... 142 2e-33 UniRef50_C7RDQ1 Putative uncharacterized protein n=1 Tax=Anaeroc... 142 2e-33 UniRef50_A4BQJ8 Putative uncharacterized protein n=1 Tax=Nitroco... 142 3e-33 UniRef50_Q1Q244 Putative uncharacterized protein n=1 Tax=Candida... 142 3e-33 UniRef50_B9MQX5 UPF0102 protein Athe_0977 n=1 Tax=Anaerocellum t... 142 3e-33 UniRef50_C4GB01 Putative uncharacterized protein n=1 Tax=Shuttle... 142 3e-33 UniRef50_B0G5Y9 Putative uncharacterized protein n=4 Tax=Clostri... 142 4e-33 UniRef50_B3EJJ5 UPF0102 protein Cphamn1_0017 n=1 Tax=Chlorobium ... 142 4e-33 UniRef50_C6A8H5 Putative uncharacterized protein n=4 Tax=Bifidob... 141 5e-33 UniRef50_Q7MNW2 UPF0102 protein VV0603 n=80 Tax=Vibrionales RepI... 141 6e-33 UniRef50_Q8R5S3 UPF0102 protein TTE1452 n=9 Tax=Thermoanaerobact... 141 6e-33 UniRef50_D1CBL5 Putative uncharacterized protein n=1 Tax=Thermob... 141 7e-33 UniRef50_B7RSM7 Putative uncharacterized protein n=1 Tax=marine ... 141 7e-33 UniRef50_A3WNE6 Predicted endonuclease n=1 Tax=Idiomarina baltic... 140 9e-33 UniRef50_A1ZF36 Putative uncharacterized protein n=1 Tax=Microsc... 140 9e-33 UniRef50_A6VB97 UPF0102 protein PSPA7_4996 n=17 Tax=Pseudomonada... 140 9e-33 UniRef50_C6WYK5 Putative uncharacterized protein n=1 Tax=Methylo... 140 9e-33 UniRef50_D0KYK3 Putative uncharacterized protein n=1 Tax=Halothi... 140 1e-32 UniRef50_A8SLV5 Putative uncharacterized protein n=1 Tax=Parvimo... 140 1e-32 UniRef50_C9KJR5 Endonuclease n=2 Tax=Veillonellaceae RepID=C9KJR... 140 1e-32 UniRef50_Q1GYY7 UPF0102 protein Mfla_2283 n=1 Tax=Methylobacillu... 140 1e-32 UniRef50_A5N821 UPF0102 protein CKL_1410 n=16 Tax=Clostridium Re... 140 2e-32 UniRef50_C0YUE8 Possible endonuclease n=2 Tax=Flavobacteriaceae ... 140 2e-32 UniRef50_Q7P0B3 UPF0102 protein CV_0654 n=1 Tax=Chromobacterium ... 139 2e-32 UniRef50_B4RXI2 Sigma-54 factor n=2 Tax=Alteromonas macleodii Re... 139 2e-32 UniRef50_B8DRI1 Putative uncharacterized protein n=2 Tax=Desulfo... 139 2e-32 UniRef50_D0MIM6 Putative uncharacterized protein n=1 Tax=Rhodoth... 139 3e-32 UniRef50_A8G183 UPF0102 protein Ssed_4252 n=16 Tax=Shewanella Re... 139 3e-32 UniRef50_B2KC57 Putative uncharacterized protein n=1 Tax=Elusimi... 139 4e-32 UniRef50_B5YFD1 UPF0102 protein DICTH_1420 n=2 Tax=Dictyoglomus ... 139 4e-32 UniRef50_C8W5C0 Putative uncharacterized protein n=1 Tax=Desulfo... 138 5e-32 UniRef50_A8PP71 Putative uncharacterized protein n=1 Tax=Rickett... 138 7e-32 UniRef50_Q0TPP8 UPF0102 protein CPF_1959 n=9 Tax=Clostridium per... 138 7e-32 UniRef50_A3M3I7 UPF0102 protein A1S_1049 n=12 Tax=Acinetobacter ... 137 8e-32 UniRef50_C6BVQ7 Putative uncharacterized protein n=1 Tax=Desulfo... 137 9e-32 UniRef50_UPI0000510419 hypothetical protein BlinB_18076 n=1 Tax=... 137 1e-31 UniRef50_A4J649 UPF0102 protein Dred_2035 n=1 Tax=Desulfotomacul... 137 1e-31 UniRef50_C9LW74 Endonuclease n=1 Tax=Selenomonas sputigena ATCC ... 137 1e-31 UniRef50_A5EVA6 UPF0102 protein DNO_0639 n=2 Tax=Cardiobacteriac... 137 2e-31 UniRef50_Q31EY6 UPF0102 protein Tcr_1695 n=1 Tax=Thiomicrospira ... 137 2e-31 UniRef50_Q2BGY7 Putative uncharacterized protein n=1 Tax=Neptuni... 136 2e-31 UniRef50_B2A2P1 UPF0102 protein Nther_1376 n=1 Tax=Natranaerobiu... 136 2e-31 UniRef50_A1KWG5 UPF0102 protein NMC2069 n=28 Tax=Neisseriaceae R... 136 2e-31 UniRef50_B3JMU0 Putative uncharacterized protein n=6 Tax=Bactero... 136 2e-31 UniRef50_A0LV62 UPF0102 protein Acel_1550 n=8 Tax=Actinomycetale... 136 2e-31 UniRef50_C8PPR7 HD domain protein n=1 Tax=Treponema vincentii AT... 136 2e-31 UniRef50_B0VJC5 Putative uncharacterized protein n=1 Tax=Candida... 135 3e-31 UniRef50_A6TRS2 UPF0102 protein Amet_2739 n=1 Tax=Alkaliphilus m... 135 3e-31 UniRef50_C3W9T3 Endonuclease n=4 Tax=Fusobacterium RepID=C3W9T3_... 135 3e-31 UniRef50_A1SB01 UPF0102 protein Sama_3355 n=5 Tax=Shewanella Rep... 135 3e-31 UniRef50_A4FME3 UPF0102 protein SACE_6045 n=2 Tax=Actinomycetale... 135 4e-31 UniRef50_C2LNN6 Possible endonuclease n=3 Tax=Proteus RepID=C2LN... 135 4e-31 UniRef50_C6D2Y6 Putative uncharacterized protein n=1 Tax=Paeniba... 135 4e-31 UniRef50_D0I4Y5 Endonuclease n=1 Tax=Grimontia hollisae CIP 1018... 135 5e-31 UniRef50_A8ZV12 UPF0102 protein Dole_2298 n=2 Tax=Desulfobactera... 135 5e-31 UniRef50_Q3IG11 UPF0102 protein PSHAa2523 n=3 Tax=Alteromonadale... 135 5e-31 UniRef50_Q1MRU7 UPF0102 protein LI0223 n=1 Tax=Lawsonia intracel... 135 6e-31 UniRef50_C7HUU3 Endonuclease n=4 Tax=Anaerococcus RepID=C7HUU3_9... 134 6e-31 UniRef50_Q1NMK2 Putative uncharacterized protein n=1 Tax=delta p... 134 7e-31 UniRef50_C4FFG3 Putative uncharacterized protein n=1 Tax=Bifidob... 134 7e-31 UniRef50_C7N589 Predicted endonuclease related to Holliday junct... 134 8e-31 UniRef50_D1U5W3 Putative uncharacterized protein n=1 Tax=Desulfo... 134 9e-31 UniRef50_C7IKV6 Putative uncharacterized protein n=1 Tax=Clostri... 134 1e-30 UniRef50_A6L1J0 UPF0102 protein BVU_1879 n=26 Tax=Bacteroidales ... 134 1e-30 UniRef50_B3QZF2 UPF0102 protein Ctha_1382 n=1 Tax=Chloroherpeton... 134 1e-30 UniRef50_C0GHM5 Putative uncharacterized protein n=1 Tax=Dethiob... 134 1e-30 UniRef50_A5Z6D1 Putative uncharacterized protein n=1 Tax=Eubacte... 134 1e-30 UniRef50_B3PLA2 Putative uncharacterized protein n=1 Tax=Cellvib... 133 1e-30 UniRef50_Q1MYA7 Putative uncharacterized protein n=1 Tax=Bermane... 133 1e-30 UniRef50_B8KR63 Putative uncharacterized protein n=1 Tax=gamma p... 133 2e-30 UniRef50_A5D1I2 UPF0102 protein PTH_1707 n=1 Tax=Pelotomaculum t... 133 2e-30 UniRef50_C4F8U2 Putative uncharacterized protein n=2 Tax=Collins... 133 2e-30 UniRef50_Q2YCL8 UPF0102 protein Nmul_A0195 n=3 Tax=Nitrosomonada... 133 2e-30 UniRef50_B8HR21 Putative uncharacterized protein n=1 Tax=Cyanoth... 133 2e-30 UniRef50_D1NRZ9 Putative endonuclease n=1 Tax=Bifidobacterium ga... 132 2e-30 UniRef50_C4FZ58 Putative uncharacterized protein n=1 Tax=Abiotro... 132 2e-30 UniRef50_D2R2U4 Putative uncharacterized protein n=1 Tax=Pirellu... 132 2e-30 UniRef50_A5FR87 UPF0102 protein DehaBAV1_0707 n=5 Tax=Dehalococc... 132 3e-30 UniRef50_A3HX52 Putative uncharacterized protein n=1 Tax=Algorip... 132 3e-30 UniRef50_C9LM05 Endonuclease n=1 Tax=Dialister invisus DSM 15470... 132 3e-30 UniRef50_C4K712 Putative uncharacterized protein n=1 Tax=Candida... 132 3e-30 UniRef50_B3ES88 Putative uncharacterized protein n=1 Tax=Candida... 132 3e-30 UniRef50_A6LSN5 UPF0102 protein Cbei_1183 n=5 Tax=Clostridium Re... 132 3e-30 UniRef50_Q2S1J6 UPF0102 protein SRU_1822 n=1 Tax=Salinibacter ru... 132 4e-30 UniRef50_C7MNC2 Predicted endonuclease related to Holliday junct... 132 4e-30 UniRef50_Q0AFH8 UPF0102 protein Neut_1662 n=2 Tax=Proteobacteria... 132 4e-30 UniRef50_B0C8B9 UPF0102 protein AM1_3954 n=1 Tax=Acaryochloris m... 132 5e-30 UniRef50_D2SDZ8 Putative uncharacterized protein n=1 Tax=Geoderm... 131 5e-30 UniRef50_C6IVU3 Putative uncharacterized protein n=1 Tax=Paeniba... 131 5e-30 UniRef50_C9MX50 Endonuclease n=2 Tax=Leptotrichia RepID=C9MX50_9... 131 7e-30 UniRef50_C8WAJ1 Putative uncharacterized protein n=1 Tax=Atopobi... 131 8e-30 UniRef50_C9R878 Putative uncharacterized protein n=1 Tax=Ammonif... 131 8e-30 UniRef50_UPI00016929A4 hypothetical protein Plarl_14719 n=1 Tax=... 131 8e-30 UniRef50_A1K3T3 UPF0102 protein azo0871 n=1 Tax=Azoarcus sp. BH7... 131 8e-30 UniRef50_UPI0001C37581 hypothetical protein RflaF_17327 n=1 Tax=... 131 9e-30 UniRef50_A4SC34 UPF0102 protein Cvib_0014 n=9 Tax=Chlorobiaceae ... 131 9e-30 UniRef50_C4XKL5 Putative uncharacterized protein n=1 Tax=Desulfo... 130 9e-30 UniRef50_D0LAN4 Putative uncharacterized protein n=1 Tax=Gordoni... 130 9e-30 UniRef50_C2G0R5 Possible endonuclease n=2 Tax=Sphingobacterium s... 130 9e-30 UniRef50_Q15PJ2 UPF0102 protein Patl_3694 n=1 Tax=Pseudoalteromo... 130 1e-29 UniRef50_D1PKZ1 Putative choloylglycine hydrolase n=1 Tax=Subdol... 130 1e-29 UniRef50_C6XZ20 Putative uncharacterized protein n=2 Tax=Pedobac... 130 1e-29 UniRef50_C2D6J2 Putative uncharacterized protein n=1 Tax=Atopobi... 130 1e-29 UniRef50_C9MR57 Putative endonuclease n=2 Tax=Prevotella RepID=C... 130 1e-29 UniRef50_C8WGY4 Putative uncharacterized protein n=2 Tax=Eggerth... 130 1e-29 UniRef50_A3N211 UPF0102 protein APL_1363 n=33 Tax=Pasteurellacea... 130 2e-29 UniRef50_D2RIH7 Putative uncharacterized protein n=1 Tax=Acidami... 130 2e-29 UniRef50_Q1QVF6 UPF0102 protein Csal_2201 n=1 Tax=Chromohalobact... 130 2e-29 UniRef50_C1ZN06 Putative uncharacterized protein n=1 Tax=Plancto... 130 2e-29 UniRef50_C0ZFM4 Putative uncharacterized protein n=1 Tax=Breviba... 130 2e-29 UniRef50_C2HKC4 Possible endonuclease n=2 Tax=Finegoldia magna R... 129 2e-29 UniRef50_Q8R616 UPF0102 protein FN1370 n=9 Tax=Fusobacterium Rep... 129 2e-29 UniRef50_A1R7F9 UPF0102 protein AAur_2443 n=3 Tax=Micrococcaceae... 129 2e-29 UniRef50_Q1LHS4 UPF0102 protein Rmet_3430 n=2 Tax=Betaproteobact... 129 2e-29 UniRef50_A1U3H0 UPF0102 protein Maqu_2464 n=3 Tax=Marinobacter R... 129 3e-29 UniRef50_B7K4B3 Putative uncharacterized protein n=4 Tax=Cyanoba... 129 3e-29 UniRef50_Q3A2F1 UPF0102 protein Pcar_2217 n=2 Tax=Deltaproteobac... 129 3e-29 UniRef50_Q2JJU2 UPF0102 protein CYB_2119 n=3 Tax=Synechococcus R... 129 3e-29 UniRef50_C7LP67 Putative uncharacterized protein n=1 Tax=Desulfo... 129 4e-29 UniRef50_B0TH88 Putative uncharacterized protein n=1 Tax=Helioba... 128 4e-29 UniRef50_Q60CC4 UPF0102 protein MCA0184 n=1 Tax=Methylococcus ca... 128 5e-29 UniRef50_Q313K2 UPF0102 protein Dde_1093 n=1 Tax=Desulfovibrio d... 128 5e-29 UniRef50_A4YJR8 UPF0102 protein BRADO0179 n=14 Tax=Rhizobiales R... 128 5e-29 UniRef50_Q1YQG9 Putative uncharacterized protein n=1 Tax=gamma p... 128 5e-29 UniRef50_A8UQV9 Putative uncharacterized protein n=1 Tax=Hydroge... 128 6e-29 UniRef50_A5WCR1 UPF0102 protein PsycPRwf_0497 n=1 Tax=Psychrobac... 128 7e-29 UniRef50_C6VV91 Putative uncharacterized protein n=2 Tax=Flexiba... 128 7e-29 UniRef50_A3DDG4 UPF0102 protein Cthe_0758 n=6 Tax=Clostridia Rep... 128 7e-29 UniRef50_D1SBF6 Putative uncharacterized protein n=1 Tax=Micromo... 127 8e-29 UniRef50_C1AG13 UPF0102 protein JTY_2914 n=20 Tax=Mycobacterium ... 127 8e-29 UniRef50_A1VIW8 UPF0102 protein Pnap_0271 n=10 Tax=Burkholderial... 127 9e-29 UniRef50_Q11XW1 UPF0102 protein CHU_0465 n=1 Tax=Cytophaga hutch... 127 1e-28 UniRef50_C0VVC1 Endonuclease n=2 Tax=Corynebacterium glucuronoly... 127 1e-28 UniRef50_B9ZKW9 Putative uncharacterized protein n=1 Tax=Thioalk... 127 1e-28 UniRef50_A7NKS5 UPF0102 protein Rcas_2007 n=2 Tax=Roseiflexus Re... 127 1e-28 UniRef50_A5FKL6 UPF0102 protein Fjoh_1217 n=17 Tax=Bacteroidetes... 127 1e-28 UniRef50_Q5R0L0 UPF0102 protein IL0423 n=1 Tax=Idiomarina loihie... 127 1e-28 UniRef50_D0GLC9 Putative uncharacterized protein n=1 Tax=Leptotr... 127 1e-28 UniRef50_C0EXX9 Putative uncharacterized protein n=1 Tax=Eubacte... 127 2e-28 UniRef50_A0Z7D7 Putative uncharacterized protein n=1 Tax=marine ... 127 2e-28 UniRef50_Q146Q2 UPF0102 protein Bxeno_A0149 n=9 Tax=Burkholderia... 127 2e-28 UniRef50_A6GLM3 Putative uncharacterized protein n=1 Tax=Limnoba... 127 2e-28 UniRef50_A3YDY3 Putative uncharacterized protein n=1 Tax=Marinom... 127 2e-28 UniRef50_D1W8G8 Putative uncharacterized protein n=1 Tax=Prevote... 126 2e-28 UniRef50_A4AH12 Putative uncharacterized protein n=1 Tax=marine ... 126 2e-28 UniRef50_Q1D6H9 UPF0102 protein MXAN_3551 n=2 Tax=Cystobacterine... 126 2e-28 UniRef50_Q8XUC6 UPF0102 protein RSc3265 n=6 Tax=Proteobacteria R... 126 2e-28 UniRef50_A0YUK1 Putative uncharacterized protein n=2 Tax=Cyanoba... 126 2e-28 UniRef50_C2KRF5 Possible endonuclease n=2 Tax=Mobiluncus mulieri... 126 2e-28 UniRef50_A6SUE7 UPF0102 protein mma_0204 n=4 Tax=Betaproteobacte... 126 3e-28 UniRef50_C9RJM0 Putative uncharacterized protein n=1 Tax=Fibroba... 126 3e-28 UniRef50_C8PW53 Putative uncharacterized protein n=1 Tax=Enhydro... 126 3e-28 UniRef50_Q4FQF2 UPF0102 protein Psyc_1908 n=2 Tax=Psychrobacter ... 125 3e-28 UniRef50_A6VXY8 UPF0102 protein Mmwyl1_2395 n=1 Tax=Marinomonas ... 125 4e-28 UniRef50_D2RAN4 Putative uncharacterized protein n=1 Tax=Gardner... 125 4e-28 UniRef50_D2MIZ7 Putative uncharacterized protein n=1 Tax=Candida... 125 4e-28 UniRef50_B8G6B1 UPF0102 protein Cagg_0930 n=3 Tax=Chloroflexus R... 125 4e-28 UniRef50_C7R327 Putative uncharacterized protein n=1 Tax=Jonesia... 125 5e-28 UniRef50_Q2KU88 UPF0102 protein BAV3162 n=1 Tax=Bordetella avium... 125 5e-28 UniRef50_C9LKQ6 Putative uncharacterized protein n=1 Tax=Prevote... 125 5e-28 UniRef50_C2BVF9 Possible endonuclease n=1 Tax=Mobiluncus curtisi... 124 6e-28 UniRef50_C7MB82 Putative uncharacterized protein n=1 Tax=Brachyb... 124 7e-28 UniRef50_B1XJM9 Putative uncharacterized protein n=1 Tax=Synecho... 124 8e-28 UniRef50_Q8DI54 UPF0102 protein tll1737 n=1 Tax=Thermosynechococ... 124 8e-28 UniRef50_B0MPN2 Putative uncharacterized protein n=1 Tax=Eubacte... 124 1e-27 UniRef50_Q6A7T5 UPF0102 protein PPA1431 n=3 Tax=Propionibacteriu... 124 1e-27 UniRef50_C9PT16 Endonuclease n=5 Tax=Prevotella RepID=C9PT16_9BACT 124 1e-27 UniRef50_B1VG84 Putative uncharacterized protein n=1 Tax=Coryneb... 124 1e-27 UniRef50_A0QVA9 UPF0102 protein MSMEG_2508 n=6 Tax=Corynebacteri... 124 1e-27 UniRef50_Q025A4 UPF0102 protein Acid_2433 n=1 Tax=Candidatus Sol... 123 2e-27 UniRef50_D1ANU1 Putative uncharacterized protein n=1 Tax=Sebalde... 123 2e-27 UniRef50_C2BNU0 Endonuclease n=2 Tax=Corynebacterium RepID=C2BNU... 123 2e-27 UniRef50_Q6FD45 UPF0102 protein ACIAD1132 n=4 Tax=Acinetobacter ... 122 2e-27 UniRef50_B1WNM5 Putative uncharacterized protein n=3 Tax=Chrooco... 122 3e-27 UniRef50_Q24UC6 UPF0102 protein DSY2577 n=2 Tax=Desulfitobacteri... 122 3e-27 UniRef50_B2S4F0 UPF0102 protein TPASS_0913 n=3 Tax=Treponema Rep... 122 3e-27 UniRef50_A6NUN6 Putative uncharacterized protein n=1 Tax=Bactero... 122 3e-27 UniRef50_Q6AEA8 UPF0102 protein Lxx14785 n=1 Tax=Leifsonia xyli ... 122 4e-27 UniRef50_Q55761 UPF0102 protein sll0189 n=1 Tax=Synechocystis sp... 122 4e-27 UniRef50_A6LF20 UPF0102 protein BDI_2565 n=4 Tax=Bacteroidales R... 122 4e-27 UniRef50_Q3AC88 UPF0102 protein CHY_1414 n=1 Tax=Carboxydothermu... 122 4e-27 UniRef50_A3NEP2 UPF0102 protein BURPS668_3819 n=83 Tax=Proteobac... 122 5e-27 UniRef50_B4VZG7 Putative uncharacterized protein n=1 Tax=Microco... 122 5e-27 UniRef50_D1BMP5 Putative uncharacterized protein n=3 Tax=Veillon... 121 6e-27 UniRef50_A4A5E8 Putative uncharacterized protein n=2 Tax=unclass... 121 8e-27 UniRef50_A9B5H2 UPF0102 protein Haur_0145 n=1 Tax=Herpetosiphon ... 121 8e-27 UniRef50_UPI00017886BD protein of unknown function UPF0102 n=1 T... 120 1e-26 UniRef50_C2CWR1 Endonuclease n=1 Tax=Gardnerella vaginalis ATCC ... 120 1e-26 UniRef50_A1W341 UPF0102 protein Ajs_0414 n=11 Tax=Betaproteobact... 120 1e-26 UniRef50_UPI0001BC5BE0 endonuclease n=3 Tax=Fusobacterium RepID=... 120 1e-26 UniRef50_B4WKR8 Putative uncharacterized protein n=1 Tax=Synecho... 120 1e-26 UniRef50_D1BJ87 Predicted endonuclease related to Holliday junct... 120 1e-26 UniRef50_B0MUK7 Putative uncharacterized protein n=1 Tax=Alistip... 120 2e-26 UniRef50_C0WKI9 Endonuclease n=3 Tax=Actinomycetales RepID=C0WKI... 120 2e-26 UniRef50_C0BHK9 Putative uncharacterized protein n=1 Tax=Flavoba... 119 3e-26 UniRef50_A0LCU4 UPF0102 protein Mmc1_3298 n=1 Tax=Magnetococcus ... 119 3e-26 UniRef50_Q2NZA5 UPF0102 protein XOO3617 n=18 Tax=Xanthomonadacea... 119 3e-26 UniRef50_C0WCB9 Endonuclease n=1 Tax=Acidaminococcus sp. D21 Rep... 119 4e-26 UniRef50_A9HJH4 UPF0102 protein GDI1964/Gdia_0189 n=1 Tax=Glucon... 119 4e-26 UniRef50_D1AZ70 Putative uncharacterized protein n=1 Tax=Sulfuro... 118 6e-26 UniRef50_UPI000197ABB4 hypothetical protein GHTCC_11038 n=2 Tax=... 118 6e-26 UniRef50_A5IKG8 UPF0102 protein Tpet_0671 n=6 Tax=Thermotogaceae... 118 7e-26 UniRef50_C7PDZ7 Putative uncharacterized protein n=1 Tax=Chitino... 117 8e-26 UniRef50_Q1IJG5 UPF0102 protein Acid345_3985 n=1 Tax=Candidatus ... 117 8e-26 UniRef50_C7GYG3 Putative choloylglycine hydrolase n=1 Tax=Eubact... 117 9e-26 UniRef50_B9CKG2 Putative uncharacterized protein n=1 Tax=Atopobi... 117 9e-26 UniRef50_Q4JV13 UPF0102 protein jk1180 n=2 Tax=Corynebacterium j... 117 1e-25 UniRef50_A4A171 Putative uncharacterized protein n=2 Tax=Plancto... 117 1e-25 UniRef50_A1VFE8 UPF0102 protein Dvul_2148 n=3 Tax=Desulfovibrio ... 117 1e-25 UniRef50_A7HN69 UPF0102 protein Fnod_1509 n=1 Tax=Fervidobacteri... 117 2e-25 UniRef50_C0E2B0 Putative uncharacterized protein n=2 Tax=Coryneb... 116 2e-25 UniRef50_C1TKL9 Predicted endonuclease related to Holliday junct... 116 2e-25 UniRef50_C0QVG4 UPF0102 protein BHWA1_02005 n=2 Tax=Brachyspira ... 116 2e-25 UniRef50_A1SLR5 UPF0102 protein Noca_3248 n=11 Tax=Actinomycetal... 116 2e-25 UniRef50_A6DUI2 Endonuclease n=1 Tax=Lentisphaera araneosa HTCC2... 116 2e-25 UniRef50_A0Q0X6 UPF0102 protein NT01CX_2205 n=3 Tax=Clostridium ... 116 2e-25 UniRef50_C8NTW9 Choloylglycine hydrolase n=1 Tax=Corynebacterium... 116 3e-25 UniRef50_C6WEV0 Putative uncharacterized protein n=1 Tax=Actinos... 115 3e-25 UniRef50_Q47S60 UPF0102 protein Tfu_0669 n=2 Tax=Nocardiopsaceae... 115 3e-25 UniRef50_B8HA88 UPF0102 protein Achl_2213 n=1 Tax=Arthrobacter c... 115 3e-25 UniRef50_B2IUS6 Putative uncharacterized protein n=4 Tax=Nostoca... 115 4e-25 UniRef50_C0BPF0 Putative uncharacterized protein n=1 Tax=Flavoba... 115 5e-25 UniRef50_B6R5V9 Putative uncharacterized protein n=1 Tax=Pseudov... 115 5e-25 UniRef50_Q118B0 UPF0102 protein Tery_0733 n=1 Tax=Trichodesmium ... 115 5e-25 UniRef50_C5CAH6 Holliday junction resolvase-like endonuclease n=... 115 5e-25 UniRef50_A0LMM6 Putative uncharacterized protein n=1 Tax=Syntrop... 115 5e-25 UniRef50_C7H7V1 Endonuclease n=2 Tax=Faecalibacterium prausnitzi... 115 6e-25 UniRef50_D2ATE9 Putative uncharacterized protein n=1 Tax=Strepto... 114 8e-25 UniRef50_A8F4Z9 UPF0102 protein Tlet_0667 n=1 Tax=Thermotoga let... 114 8e-25 UniRef50_B7K7K1 Putative uncharacterized protein n=1 Tax=Cyanoth... 113 1e-24 UniRef50_Q5FF38 UPF0102 protein ERGA_CDS_00540 n=5 Tax=canis gro... 113 1e-24 UniRef50_C1A4D5 Putative uncharacterized protein n=1 Tax=Gemmati... 113 2e-24 UniRef50_A8HYK3 UPF0102 protein AZC_4471 n=3 Tax=Rhizobiales Rep... 113 2e-24 UniRef50_A0NXE9 Putative uncharacterized protein n=2 Tax=Labrenz... 113 2e-24 UniRef50_C4LJB4 Putative uncharacterized protein n=1 Tax=Coryneb... 113 2e-24 UniRef50_B2RLS5 UPF0102 protein PGN_1801 n=2 Tax=Porphyromonas g... 113 2e-24 UniRef50_A6GIE5 Putative uncharacterized protein n=1 Tax=Plesioc... 113 2e-24 UniRef50_C4KCT6 Putative uncharacterized protein n=3 Tax=Betapro... 113 2e-24 UniRef50_C5CGT1 UPF0102 protein Kole_1919 n=1 Tax=Kosmotoga olea... 112 3e-24 UniRef50_B2UP21 Putative uncharacterized protein n=2 Tax=Verruco... 112 3e-24 UniRef50_C6XIG7 Putative uncharacterized protein n=1 Tax=Hirschi... 112 3e-24 UniRef50_A4CH47 Putative uncharacterized protein n=1 Tax=Robigin... 112 4e-24 UniRef50_Q3M3N9 UPF0102 protein Ava_4800 n=2 Tax=Nostocaceae Rep... 112 4e-24 UniRef50_B4U6T0 Putative uncharacterized protein n=1 Tax=Hydroge... 112 4e-24 UniRef50_Q7UM23 UPF0102 protein RB9115 n=1 Tax=Rhodopirellula ba... 112 5e-24 UniRef50_Q0A6J0 UPF0102 protein Mlg_2205 n=2 Tax=Ectothiorhodosp... 112 5e-24 UniRef50_Q0BTH9 UPF0102 protein GbCGDNIH1_0975 n=1 Tax=Granuliba... 112 5e-24 UniRef50_D1N5L9 Putative uncharacterized protein n=1 Tax=Victiva... 111 6e-24 UniRef50_B6BHT8 Putative uncharacterized protein n=1 Tax=Campylo... 111 7e-24 UniRef50_D2NTN5 Predicted endonuclease distantly related to arch... 111 7e-24 UniRef50_D2LER1 Putative uncharacterized protein n=1 Tax=Rhodomi... 111 8e-24 UniRef50_Q83I01 UPF0102 protein TW312 n=2 Tax=Tropheryma whipple... 110 1e-23 UniRef50_Q0F072 Putative uncharacterized protein n=1 Tax=Maripro... 110 1e-23 UniRef50_A3VPC2 Putative uncharacterized protein n=1 Tax=Parvula... 110 1e-23 UniRef50_Q2IJ48 UPF0102 protein Adeh_1910 n=4 Tax=Anaeromyxobact... 110 1e-23 UniRef50_C7QA44 Putative uncharacterized protein n=1 Tax=Catenul... 110 2e-23 UniRef50_A1HN64 Putative uncharacterized protein n=1 Tax=Thermos... 110 2e-23 UniRef50_C2ANQ3 Predicted endonuclease related to Holliday junct... 110 2e-23 UniRef50_A4X4J0 UPF0102 protein Strop_1320 n=2 Tax=Micromonospor... 110 2e-23 UniRef50_C3XDT2 Putative uncharacterized protein n=1 Tax=Helicob... 109 3e-23 UniRef50_UPI0001C31AB5 protein of unknown function UPF0102 n=1 T... 109 3e-23 UniRef50_UPI00019790C0 hypothetical protein HcinC1_06745 n=2 Tax... 109 3e-23 UniRef50_D1Y396 Putative uncharacterized protein n=1 Tax=Pyramid... 108 4e-23 UniRef50_C1F7M4 Putative uncharacterized protein n=1 Tax=Acidoba... 108 4e-23 UniRef50_Q6NGK0 UPF0102 protein DIP1513 n=1 Tax=Corynebacterium ... 108 5e-23 UniRef50_A7HZ51 UPF0102 protein Plav_3586 n=1 Tax=Parvibaculum l... 108 5e-23 UniRef50_B5JM98 Putative uncharacterized protein n=1 Tax=Verruco... 108 6e-23 UniRef50_D0WR56 Putative endonuclease n=1 Tax=Actinomyces sp. or... 107 8e-23 UniRef50_A8YJ85 Similar to Y189_SYNY3 UPF0102 protein sll0189 n=... 107 9e-23 UniRef50_B6JM54 UPF0102 protein HPP12_0830 n=15 Tax=Epsilonprote... 107 9e-23 UniRef50_D2L5G2 Putative uncharacterized protein n=1 Tax=Desulfo... 107 9e-23 UniRef50_D1B6E0 Putative uncharacterized protein n=1 Tax=Therman... 107 1e-22 UniRef50_B9L042 Putative uncharacterized protein n=2 Tax=Thermom... 107 1e-22 UniRef50_C7NGY4 Predicted endonuclease related to Holliday junct... 107 1e-22 UniRef50_C3JBE2 Putative uncharacterized protein n=2 Tax=Bacteri... 107 1e-22 UniRef50_A3JND8 Putative uncharacterized protein n=1 Tax=Rhodoba... 107 2e-22 UniRef50_A6FT82 PII uridylyl-transferase n=5 Tax=Rhodobacterales... 106 2e-22 UniRef50_C3XN83 Putative uncharacterized protein n=3 Tax=Helicob... 105 3e-22 UniRef50_Q5SLC1 UPF0102 protein TTHA0372 n=5 Tax=Thermaceae RepI... 105 4e-22 UniRef50_Q2RJT6 UPF0102 protein Moth_0988 n=2 Tax=Clostridia Rep... 105 5e-22 UniRef50_C3PH56 Putative uncharacterized protein n=1 Tax=Coryneb... 104 8e-22 UniRef50_C4DPS8 Predicted endonuclease related to Holliday junct... 104 9e-22 UniRef50_O66457 UPF0102 protein aq_041 n=1 Tax=Aquifex aeolicus ... 104 1e-21 UniRef50_Q2KDE4 UPF0102 protein RHE_CH00320 n=4 Tax=Rhizobiales ... 103 2e-21 UniRef50_A9BFT1 UPF0102 protein Pmob_0702 n=1 Tax=Petrotoga mobi... 103 2e-21 UniRef50_B2GFY9 Putative uncharacterized protein n=1 Tax=Kocuria... 103 2e-21 UniRef50_B2IFF3 Putative uncharacterized protein n=1 Tax=Beijeri... 102 4e-21 UniRef50_B0P7L1 Putative uncharacterized protein n=1 Tax=Anaerot... 102 4e-21 UniRef50_C0XSA5 Endonuclease n=1 Tax=Corynebacterium lipophilofl... 102 5e-21 UniRef50_C7JH91 Putative uncharacterized protein n=8 Tax=Acetoba... 102 5e-21 UniRef50_A3K994 Putative uncharacterized protein n=7 Tax=Rhodoba... 101 7e-21 UniRef50_A4EVA8 Putative uncharacterized protein n=1 Tax=Roseoba... 101 7e-21 UniRef50_A3UIK6 Putative uncharacterized protein n=1 Tax=Oceanic... 101 8e-21 UniRef50_A1B931 Putative uncharacterized protein n=1 Tax=Paracoc... 101 8e-21 UniRef50_Q0C451 UPF0102 protein HNE_0764 n=1 Tax=Hyphomonas nept... 101 8e-21 UniRef50_A6Q6T2 UPF0102 protein SUN_0231 n=3 Tax=Epsilonproteoba... 101 9e-21 UniRef50_Q04SX0 UPF0102 protein LBJ_1427 n=4 Tax=Leptospira RepI... 100 1e-20 UniRef50_A9IXC8 UPF0102 protein BT_1882 n=8 Tax=Rhizobiales RepI... 100 1e-20 UniRef50_C9M9E0 Putative uncharacterized protein n=1 Tax=Jonquet... 100 1e-20 UniRef50_B3T5F3 Putative uncharacterized protein family UPF0102 ... 100 1e-20 UniRef50_Q31RH5 UPF0102 protein Synpcc7942_0312 n=2 Tax=Synechoc... 100 1e-20 UniRef50_Q7U7D4 UPF0102 protein SYNW1051 n=3 Tax=Synechococcus R... 100 1e-20 UniRef50_C5BWW3 UPF0102 protein Bcav_2532 n=21 Tax=Actinomycetal... 100 2e-20 UniRef50_D1NDV3 Fimbrial usher protein (Fragment) n=1 Tax=Haemop... 100 2e-20 UniRef50_A4QF37 UPF0102 protein cgR_1859 n=4 Tax=Corynebacterium... 100 2e-20 UniRef50_B5Y8F8 Putative uncharacterized protein n=1 Tax=Coproth... 100 2e-20 UniRef50_A4ECJ4 Putative uncharacterized protein n=1 Tax=Collins... 100 3e-20 UniRef50_A9I0M2 UPF0102 protein Bpet0439 n=14 Tax=Proteobacteria... 100 3e-20 UniRef50_B0T377 UPF0102 protein Caul_0175 n=3 Tax=Caulobacterace... 99 3e-20 UniRef50_B2S8H0 UPF0102 protein BAbS19_I01690 n=50 Tax=Rhizobial... 99 4e-20 UniRef50_Q0AK98 UPF0102 protein Mmar10_3014 n=1 Tax=Maricaulis m... 99 6e-20 UniRef50_A8U078 Predicted endonuclease n=1 Tax=alpha proteobacte... 98 9e-20 UniRef50_Q16B02 UPF0102 protein RD1_1191 n=1 Tax=Roseobacter den... 97 1e-19 UniRef50_Q2VYL8 UPF0102 protein amb4503 n=5 Tax=Alphaproteobacte... 97 2e-19 UniRef50_Q0FQ74 Putative uncharacterized protein n=1 Tax=Roseova... 96 3e-19 UniRef50_UPI00016C4BC8 hypothetical protein GobsU_17186 n=1 Tax=... 96 5e-19 UniRef50_C6QFU1 Putative uncharacterized protein n=1 Tax=Hyphomi... 95 5e-19 UniRef50_A3TTV9 Putative uncharacterized protein n=1 Tax=Oceanic... 95 6e-19 UniRef50_B3CRA6 Putative uncharacterized protein n=2 Tax=Orienti... 94 1e-18 UniRef50_A7ZB75 UPF0102 protein Ccon26_01140 n=23 Tax=Epsilonpro... 94 1e-18 UniRef50_A8ERF6 UPF0102 protein Abu_0255 n=2 Tax=Campylobacteral... 94 2e-18 UniRef50_B1ZZB1 Putative uncharacterized protein n=2 Tax=Opituta... 93 2e-18 UniRef50_A5V3S4 UPF0102 protein Swit_0572 n=4 Tax=Sphingomonadac... 93 2e-18 UniRef50_Q7V7V8 UPF0102 protein PMT_0624 n=2 Tax=Prochlorococcus... 93 3e-18 UniRef50_C1CZ90 UPF0102 protein Deide_03080 n=3 Tax=Deinococcus ... 93 3e-18 UniRef50_C2M936 Putative uncharacterized protein n=1 Tax=Porphyr... 93 3e-18 UniRef50_C0W187 Putative uncharacterized protein n=1 Tax=Actinom... 92 4e-18 UniRef50_A7BDE4 Putative uncharacterized protein n=1 Tax=Actinom... 92 5e-18 UniRef50_A4WPR4 UPF0102 protein Rsph17025_0472 n=7 Tax=Rhodobact... 91 9e-18 UniRef50_A4E8G7 Putative uncharacterized protein n=1 Tax=Collins... 91 1e-17 UniRef50_B8GXN3 UPF0102 protein CCNA_00142 n=4 Tax=Caulobacterac... 90 1e-17 UniRef50_B9KH25 Putative uncharacterized protein n=3 Tax=Anaplas... 90 1e-17 UniRef50_Q28TZ5 Putative uncharacterized protein n=1 Tax=Jannasc... 90 2e-17 UniRef50_B4CV56 Putative uncharacterized protein n=1 Tax=Chthoni... 90 2e-17 UniRef50_Q7NEX4 UPF0102 protein gll3754 n=1 Tax=Gloeobacter viol... 90 2e-17 UniRef50_A5GTL0 Restriction endonuclease-like n=1 Tax=Synechococ... 90 3e-17 UniRef50_Q2GDU7 Putative uncharacterized protein n=1 Tax=Neorick... 90 3e-17 UniRef50_Q3J5H3 UPF0102 protein RHOS4_03930 n=7 Tax=Rhodobactera... 89 4e-17 UniRef50_C6V4V5 Putative uncharacterized protein n=1 Tax=Neorick... 89 5e-17 UniRef50_B0S8P6 Endonuclease n=2 Tax=Leptospira biflexa serovar ... 89 6e-17 UniRef50_C4YXH4 Protein Mlr4633 n=1 Tax=Rickettsia endosymbiont ... 88 8e-17 UniRef50_C8WWC7 Putative uncharacterized protein n=1 Tax=Alicycl... 88 9e-17 UniRef50_A2VU88 Putative uncharacterized protein n=1 Tax=Burkhol... 88 9e-17 UniRef50_Q1GJI4 UPF0102 protein TM1040_0449 n=9 Tax=Rhodobactera... 87 2e-16 UniRef50_A5G0S4 UPF0102 protein Acry_2261 n=1 Tax=Acidiphilium c... 87 2e-16 UniRef50_C7N7P3 Predicted endonuclease related to Holliday junct... 87 2e-16 UniRef50_A5KGE0 Putative uncharacterized protein n=1 Tax=Campylo... 85 5e-16 UniRef50_A8LJ68 UPF0102 protein Dshi_2830 n=1 Tax=Dinoroseobacte... 84 1e-15 UniRef50_A5GLH9 Restriction endonuclease-like n=5 Tax=Chroococca... 84 1e-15 UniRef50_B1M445 UPF0102 protein Mrad2831_2938 n=10 Tax=Alphaprot... 82 4e-15 UniRef50_D1ATK0 Putative uncharacterized protein n=1 Tax=Anaplas... 80 1e-14 UniRef50_B6IVS9 Putative uncharacterized protein n=1 Tax=Rhodosp... 80 2e-14 UniRef50_A9GEX7 UPF0102 protein sce2912 n=1 Tax=Sorangium cellul... 80 3e-14 UniRef50_B3DVH2 Predicted endonuclease n=2 Tax=Verrucomicrobia R... 79 5e-14 UniRef50_C8WHA2 Putative uncharacterized protein n=1 Tax=Eggerth... 79 6e-14 UniRef50_UPI000190D97F hypothetical protein SentesTyp_00923 n=1 ... 79 6e-14 UniRef50_C0W7A2 Putative uncharacterized protein (Fragment) n=1 ... 78 7e-14 UniRef50_Q0I9S1 Uncharacterised protein family protein n=1 Tax=S... 77 1e-13 UniRef50_B9XLY1 Putative uncharacterized protein n=1 Tax=bacteri... 77 2e-13 UniRef50_Q1GWI7 UPF0102 protein Sala_0262 n=5 Tax=Sphingomonadal... 73 3e-12 UniRef50_C0BCN9 Putative uncharacterized protein n=1 Tax=Coproco... 70 2e-11 UniRef50_Q4E9U1 Endonuclease (Fragment) n=5 Tax=Wolbachia RepID=... 69 3e-11 UniRef50_C8W847 Putative uncharacterized protein n=1 Tax=Atopobi... 69 5e-11 UniRef50_Q73VP8 Putative uncharacterized protein n=1 Tax=Mycobac... 64 1e-09 UniRef50_A8A9D3 Putative uncharacterized protein n=1 Tax=Ignicoc... 62 4e-09 UniRef50_C7N801 Predicted endonuclease related to Holliday junct... 62 6e-09 UniRef50_Q3AKE1 Uncharacterised protein family UPF0102 n=2 Tax=C... 61 1e-08 UniRef50_Q8W6V7 Putative uncharacterized protein n=1 Tax=Synecho... 60 2e-08 UniRef50_A6DBD4 Putative uncharacterized protein (Fragment) n=1 ... 59 5e-08 UniRef50_Q8TW03 Predicted endonuclease of the RecB family n=1 Ta... 58 9e-08 UniRef50_C5U625 Putative uncharacterized protein n=1 Tax=Methano... 55 5e-07 UniRef50_O33024 UPF0102 protein ML1607 n=2 Tax=Mycobacterium lep... 54 1e-06 UniRef50_A4GJ57 Putative uncharacterized protein n=1 Tax=uncultu... 54 1e-06 UniRef50_UPI0001699F06 hypothetical protein Epers_29808 n=1 Tax=... 54 2e-06 UniRef50_Q5GSW9 RecB family endonuclease n=1 Tax=Wolbachia endos... 54 2e-06 UniRef50_Q5NWY8 Putative uncharacterized protein n=1 Tax=Aromato... 54 2e-06 UniRef50_Q3BT99 Putative uncharacterized protein n=2 Tax=Xanthom... 52 6e-06 UniRef50_C1E903 Predicted protein n=1 Tax=Micromonas sp. RCC299 ... 50 2e-05 UniRef50_Q9Y9F5 Putative uncharacterized protein n=1 Tax=Aeropyr... 48 1e-04 Sequences not found previously or not previously below threshold: UniRef50_Q6MLA1 Putative uncharacterized protein n=1 Tax=Bdellov... 45 8e-04 UniRef50_D0CIJ2 Putative uncharacterized protein n=2 Tax=Bacteri... 44 0.002 UniRef50_B5IHF1 ATPase n=1 Tax=Aciduliprofundum boonei T469 RepI... 43 0.003 UniRef50_Q7VH71 Putative uncharacterized protein n=1 Tax=Helicob... 43 0.003 UniRef50_D1TJS5 Putative uncharacterized protein n=1 Tax=Burkhol... 43 0.004 UniRef50_B7KKS4 Putative uncharacterized protein n=1 Tax=Cyanoth... 42 0.006 UniRef50_A3DLW4 Endonuclease (RecB family)-like protein n=1 Tax=... 42 0.007 UniRef50_A2BIV6 Endonuclease of RecB family n=1 Tax=Hyperthermus... 42 0.007 UniRef50_B4S2M3 Putative transmembrane protein n=1 Tax=Alteromon... 42 0.008 UniRef50_A7VFU4 Putative uncharacterized protein n=6 Tax=Bacteri... 41 0.010 UniRef50_D2LQ48 DUF234 DEXX-box ATPase n=3 Tax=Aciduliprofundum ... 41 0.012 UniRef50_A8MBK5 Putative uncharacterized protein n=1 Tax=Caldivi... 41 0.013 UniRef50_Q2FT66 Putative uncharacterized protein n=1 Tax=Methano... 41 0.013 UniRef50_B1L6B5 Putative uncharacterized protein n=1 Tax=Candida... 40 0.016 UniRef50_A9KZ92 Putative uncharacterized protein n=2 Tax=Shewane... 40 0.019 UniRef50_C4F8E7 Putative uncharacterized protein n=1 Tax=Collins... 40 0.019 UniRef50_Q50I46 Putative holliday junction resolvase n=1 Tax=Aci... 40 0.020 UniRef50_A3CY54 Restriction endonuclease n=1 Tax=Methanoculleus ... 40 0.025 UniRef50_C5A3A3 Prokaryotic ATPase, AAA superfamily n=1 Tax=Ther... 40 0.029 UniRef50_A1RWI2 Putative uncharacterized protein n=1 Tax=Thermof... 40 0.029 UniRef50_Q3BXS5 Putative uncharacterized protein n=1 Tax=Xanthom... 40 0.030 UniRef50_C1MSX7 Predicted protein n=1 Tax=Micromonas pusilla CCM... 39 0.035 UniRef50_C8SAI5 Restriction endonuclease n=1 Tax=Ferroglobus pla... 39 0.035 UniRef50_A7H236 Putative uncharacterized protein n=1 Tax=Campylo... 39 0.037 UniRef50_Q05TI5 Putative uncharacterized protein n=1 Tax=Synecho... 39 0.045 UniRef50_B0AC80 Putative uncharacterized protein n=1 Tax=Clostri... 38 0.069 UniRef50_C7RS72 WD-40 repeat protein n=3 Tax=Bacteria RepID=C7RS... 38 0.083 UniRef50_B8D4V8 Predicted transcriptional regulator n=1 Tax=Desu... 38 0.090 UniRef50_B8D4V0 Endonuclease (RecB family)-like protein n=1 Tax=... 38 0.093 >UniRef50_A4WEW3 UPF0102 protein Ent638_3585 n=124 Tax=Enterobacteriaceae RepID=Y3585_ENT38 Length = 131 Score = 164 bits (417), Expect = 6e-40, Method: Composition-based stats. Identities = 93/131 (70%), Positives = 115/131 (87%) Query: 1 MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTT 60 MA +P + P +L+ KQTGDAWE +ARRWLEGKGLRFIAANV+ RGGEIDLIM++G+ Sbjct: 1 MAQIPAGADRPGKLSRKQTGDAWELKARRWLEGKGLRFIAANVHGRGGEIDLIMKDGQVI 60 Query: 61 IFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV 120 +F+EVR+R+S+ +GGAAASVT +KQHKLLQTA LWLARHNGSFDTVDCRFDVVAFTGN++ Sbjct: 61 VFIEVRFRQSSRFGGAAASVTLAKQHKLLQTAHLWLARHNGSFDTVDCRFDVVAFTGNDI 120 Query: 121 EWIKDAFNDHS 131 EW+K+AF + + Sbjct: 121 EWLKNAFGEDA 131 >UniRef50_D1P1S8 Putative choloylglycine hydrolase n=5 Tax=Enterobacteriaceae RepID=D1P1S8_9ENTR Length = 128 Score = 158 bits (401), Expect = 4e-38, Method: Composition-based stats. Identities = 56/110 (50%), Positives = 76/110 (69%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 TG +E QA +L+ +GL IA NV R GEIDLIMR+G +FVEVR+R+++ YG A Sbjct: 12 TGRHYENQALAYLQQQGLTLIARNVRCRMGEIDLIMRDGTVLVFVEVRFRKNSDYGNALL 71 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFN 128 SV K+ K+L TA+ WLA+ SF+T CRFD+ A TG + EW+++AFN Sbjct: 72 SVNWHKRRKILATAQYWLAQRQQSFETTPCRFDIYAITGKQFEWVQNAFN 121 >UniRef50_C4ZD68 Putative uncharacterized protein n=1 Tax=Eubacterium rectale ATCC 33656 RepID=C4ZD68_EUBR3 Length = 122 Score = 157 bits (397), Expect = 1e-37, Method: Composition-based stats. Identities = 43/115 (37%), Positives = 60/115 (52%), Gaps = 1/115 (0%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + + G +E A L G +A N R GEID+I ++G T F EV+YRR Sbjct: 7 MNKRSVGSIYEQLAAEQLINMGYSVLACNYRNRFGEIDIIAKDGDTICFCEVKYRRDNGC 66 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFN 128 G A +V SKQ K++ AR +L +H CRFDV+A +EV +K+AF Sbjct: 67 GRALEAVGYSKQKKIISVARYYLMKHGLDEW-TPCRFDVIAVDDDEVTVLKNAFE 120 >UniRef50_A5KJ12 Putative uncharacterized protein n=2 Tax=Clostridiales RepID=A5KJ12_9FIRM Length = 120 Score = 154 bits (391), Expect = 7e-37, Method: Composition-based stats. Identities = 38/118 (32%), Positives = 57/118 (48%), Gaps = 2/118 (1%) Query: 10 SPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRR 69 + + G +E A +LE G I N R GEID+I ++G +F EV+YR Sbjct: 3 RNVKQNNRSVGAVYEQAAGYYLEQNGYELIEYNYRCRDGEIDIIAKDGDCYVFCEVKYRS 62 Query: 70 SALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 G +V + KQ K+ + A + +H + CRFDV+ G E+ IK+AF Sbjct: 63 GRQAGNPLEAVDQRKQKKIFRCALYYTVQHGI--EDAQCRFDVIGVEGTEITHIKNAF 118 >UniRef50_A8SP33 Putative uncharacterized protein n=2 Tax=Clostridiales RepID=A8SP33_9FIRM Length = 127 Score = 154 bits (390), Expect = 7e-37, Method: Composition-based stats. Identities = 35/121 (28%), Positives = 66/121 (54%), Gaps = 1/121 (0%) Query: 8 SGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRY 67 S + +++TG +E A +L+ G +A N GEID++ + +FVEV++ Sbjct: 4 SAVLNEYNSRRTGSEYETAACDYLKNCGYDILARNYRVSAGEIDIVAQSDGYIVFVEVKF 63 Query: 68 RRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 R + G A+ +V KQ ++ + A +L ++ + V RFDV+ +GNE+ I++A+ Sbjct: 64 RSNTHMGAASEAVDHRKQKRISKAALYFLKQYGYGVE-VPVRFDVITVSGNEITHIENAY 122 Query: 128 N 128 + Sbjct: 123 D 123 >UniRef50_A9KLL5 UPF0102 protein Cphy_2398 n=3 Tax=Clostridiales RepID=Y2398_CLOPH Length = 118 Score = 154 bits (389), Expect = 1e-36, Method: Composition-based stats. Identities = 39/117 (33%), Positives = 63/117 (53%), Gaps = 1/117 (0%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + K G E +A +L +G + +A N R GEID++ RE +FVEV+YR + Sbjct: 1 MNKKVEGLTKETEAANYLSEQGYQILARNYRCRLGEIDIVARENGYLVFVEVKYRTNVEK 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFNDH 130 G ++T KQ ++ TA+ +L + ++ CRFDVV E+ IK+AF+ + Sbjct: 61 GFPEEAITIQKQRRITNTAKYYLLVNRLP-ESTPCRFDVVVMLKEEIRLIKNAFDAY 116 >UniRef50_C1SJ30 Putative uncharacterized protein n=1 Tax=Denitrovibrio acetiphilus DSM 12809 RepID=C1SJ30_9BACT Length = 111 Score = 152 bits (384), Expect = 4e-36, Method: Composition-based stats. Identities = 36/113 (31%), Positives = 59/113 (52%), Gaps = 4/113 (3%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 G E +A +LE +G + N + GEID+I + IFVEV+ R + +G Sbjct: 2 RLLFGKKGEKKAACFLEKQGYAIVEMNYRCKFGEIDIIAEKNGVLIFVEVKTRSTDKFGL 61 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFN 128 SVT SKQ KL +TA+ ++ + + +FDV++ G+ + I +AF+ Sbjct: 62 GYESVTLSKQQKLFKTAQHYMVENG----EMPAQFDVISIDGDTLTHIPNAFS 110 >UniRef50_UPI0001973CC8 hypothetical protein ClM62_15129 n=1 Tax=Clostridium sp. M62/1 RepID=UPI0001973CC8 Length = 126 Score = 152 bits (384), Expect = 4e-36, Method: Composition-based stats. Identities = 46/127 (36%), Positives = 66/127 (51%), Gaps = 1/127 (0%) Query: 1 MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTT 60 M R G + +T+ G +E A +LE +G + N R GEID+I REG T Sbjct: 1 MQEEKRRKGPAGRKSTRARGARYEDLAAAFLEKQGYVILEKNFFCRTGEIDIIAREGDTL 60 Query: 61 IFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV 120 +FVEV+YR+ G A +V KQ K+ + A +L + CRFDVVA G++ Sbjct: 61 VFVEVKYRKDLAAGDPAEAVNERKQEKIRKAAAFYLYARGLPPEQ-PCRFDVVAILGSDF 119 Query: 121 EWIKDAF 127 ++DAF Sbjct: 120 RLLRDAF 126 >UniRef50_Q0AWX4 UPF0102 protein Swol_1475 n=1 Tax=Syntrophomonas wolfei subsp. wolfei str. Goettingen RepID=Y1475_SYNWW Length = 115 Score = 152 bits (384), Expect = 4e-36, Method: Composition-based stats. Identities = 35/116 (30%), Positives = 56/116 (48%), Gaps = 6/116 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 ++ G E A ++L KG + + N + R GE+DL+ + +FVEV+ RRS +G Sbjct: 2 NRELGLWGEELAAQYLRKKGYKILERNFHTRYGELDLVCEKDDNIVFVEVKTRRSTRFGS 61 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----EVEWIKDAF 127 +VT K L + A L+L F + FDVV+ ++ I +AF Sbjct: 62 PEEAVTPRKIGNLKKAAILYLKSTPRFF--PEISFDVVSILVEDGKSKINHIINAF 115 >UniRef50_B5YIA9 UPF0102 protein THEYE_A1950 n=1 Tax=Thermodesulfovibrio yellowstonii DSM 11347 RepID=Y1950_THEYD Length = 112 Score = 152 bits (384), Expect = 4e-36, Method: Composition-based stats. Identities = 34/114 (29%), Positives = 57/114 (50%), Gaps = 3/114 (2%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + + G E A +L KG + + N GEID+I ++G + +EV+ R S + Sbjct: 1 MARIELGKEGEKLAIDYLLTKGYKILEKNFRTPFGEIDIIAKDGNFIVIIEVKRRLSDKF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 G SV +KQ KL + A +++ + RFDV+A ++E I++AF Sbjct: 61 GKPELSVNYTKQQKLKKLALYYISMLKKEY---PVRFDVIAINDKKIEHIENAF 111 >UniRef50_A8MHC8 UPF0102 protein Clos_1471 n=8 Tax=Clostridiaceae RepID=Y1471_ALKOO Length = 117 Score = 151 bits (383), Expect = 6e-36, Method: Composition-based stats. Identities = 34/116 (29%), Positives = 57/116 (49%), Gaps = 4/116 (3%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 K+ G E A +L+ KG R + N R GEID+I T +FVEV+ R S +G Sbjct: 2 NKKIGAIGEQLAVHYLKNKGYRILDCNYRTRLGEIDIIAILNDTIVFVEVKTRSSGAFGT 61 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF----TGNEVEWIKDAF 127 + +V KQ + + ++ +L + D + RFDV+ ++ +++AF Sbjct: 62 PSEAVNYKKQMTIRRVSQQYLLSNRIGEDDWNLRFDVIEVQLIEKKYKINHMENAF 117 >UniRef50_C7R9K3 Putative uncharacterized protein n=1 Tax=Kangiella koreensis DSM 16069 RepID=C7R9K3_KANKD Length = 122 Score = 151 bits (382), Expect = 7e-36, Method: Composition-based stats. Identities = 50/122 (40%), Positives = 72/122 (59%), Gaps = 5/122 (4%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 ++T+Q GD E A +L+ +GL + N N R GEIDLIM + +FVEVR+R + Y Sbjct: 1 MSTRQRGDHVELFAESYLKKQGLTLVEKNFNSRFGEIDLIMLDKSALVFVEVRFRANTSY 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEWIKDAFND 129 G A +V KQ K+++TA+L+L + DCRFDVV+ T + +EW K+AF Sbjct: 61 GSGAETVNFRKQQKIIKTAQLYLQANK-KMQQRDCRFDVVSVTLSAQEPLIEWHKNAFQA 119 Query: 130 HS 131 S Sbjct: 120 PS 121 >UniRef50_Q1JX98 Putative uncharacterized protein n=1 Tax=Desulfuromonas acetoxidans DSM 684 RepID=Q1JX98_DESAC Length = 121 Score = 150 bits (381), Expect = 9e-36, Method: Composition-based stats. Identities = 42/122 (34%), Positives = 59/122 (48%), Gaps = 6/122 (4%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 G E QA +L + R + N GEIDLI+R G+T FVEV+ R+S Sbjct: 2 TQQRLTLGRWGEQQAADYLRRRLYRIVTCNYRCHYGEIDLIVRRGKTLAFVEVKTRKSRC 61 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----EVEWIKDAFN 128 YG +VT KQ +++ TA+ +L S RFDV+A + ++ I DAF Sbjct: 62 YGTPQEAVTPRKQQQIIATAQHYLTTQQPSTQ--TVRFDVIAINVDGDKTQINHIVDAFE 119 Query: 129 DH 130 H Sbjct: 120 LH 121 >UniRef50_B0P3K1 Putative uncharacterized protein n=2 Tax=Clostridiales RepID=B0P3K1_9CLOT Length = 115 Score = 150 bits (380), Expect = 1e-35, Method: Composition-based stats. Identities = 41/115 (35%), Positives = 68/115 (59%), Gaps = 1/115 (0%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + ++TG EA A +L+ +G + N + GEID+I +E +T +FVEV+YR+ Sbjct: 2 KKNNRETGAKAEAIACWFLKQQGYDVLEQNFYTKVGEIDIIAKEDQTLVFVEVKYRKDDK 61 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 G A +V + KQ K+ ++A ++L +++ SF+ RFDVV G ++ IK AF Sbjct: 62 KGYPAQAVDQRKQQKIRKSAMIYLKKNHLSFEQ-PIRFDVVEILGKKIRVIKHAF 115 >UniRef50_C4Z0D1 Putative endonuclease n=13 Tax=Clostridiales RepID=C4Z0D1_EUBE2 Length = 115 Score = 150 bits (379), Expect = 1e-35, Method: Composition-based stats. Identities = 38/113 (33%), Positives = 57/113 (50%), Gaps = 1/113 (0%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 + TG E A R+L G + N + GEID+I ++ +FVEV+YR + YG Sbjct: 4 NKRATGADKEQLAARYLVDNGYTVLERNFRNKTGEIDIIAKKDNYIVFVEVKYRSNNKYG 63 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 A +V KQ + + A+ ++ S + CRFDV+ G V IK+AF Sbjct: 64 YAVEAVNYRKQQIIRRVAQFYITTRYKS-CDIPCRFDVIGIDGETVTHIKNAF 115 >UniRef50_C4V2D8 Endonuclease n=1 Tax=Selenomonas flueggei ATCC 43531 RepID=C4V2D8_9FIRM Length = 118 Score = 150 bits (379), Expect = 2e-35, Method: Composition-based stats. Identities = 44/119 (36%), Positives = 62/119 (52%), Gaps = 6/119 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 ++ K GD EA A R+L +G R +A + GEIDLI ++ T +FVEV+ RRS Sbjct: 1 MSNKVLGDRGEACAARYLGAQGYRILAQKYRTKTGEIDLIAKDHDTLVFVEVKTRRSVRC 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG----NEVEWIKDAFN 128 G A +V KQ +++QTA L+L D CRFD+V + +K AF Sbjct: 61 GLPAEAVNYRKQRRIIQTAMLYLCEKQM--DQTPCRFDIVEVYAAGSEWRIHHLKGAFE 117 >UniRef50_A6FEY4 Putative uncharacterized protein n=1 Tax=Moritella sp. PE36 RepID=A6FEY4_9GAMM Length = 135 Score = 150 bits (379), Expect = 2e-35, Method: Composition-based stats. Identities = 47/134 (35%), Positives = 75/134 (55%), Gaps = 13/134 (9%) Query: 7 RSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGR-------- 58 R + + ++ G+ +E A +L+ +GL +A N R GEIDLI + G Sbjct: 2 RKATLNRKQPRKRGEYFEGIAAEFLQRQGLIILARNFACRQGEIDLICQHGASCDIKSST 61 Query: 59 ---TTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF 115 T +FVEV+YR+ YGGA +++ +KQ KL TA+ ++ RH + + CRFDV+A Sbjct: 62 TLPTLVFVEVKYRQYTHYGGAISAIPVAKQRKLRYTAQYYMVRHGINENYTPCRFDVIAI 121 Query: 116 TG--NEVEWIKDAF 127 G + ++WI +AF Sbjct: 122 EGCSDNIQWITNAF 135 >UniRef50_Q5ZR89 UPF0102 protein lpg2994 n=4 Tax=Legionella RepID=Y2994_LEGPH Length = 118 Score = 149 bits (378), Expect = 2e-35, Method: Composition-based stats. Identities = 43/116 (37%), Positives = 68/116 (58%), Gaps = 3/116 (2%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 T++ G E A +L+ GL + N + R GEIDLIMREG +FVEVR R + +GG Sbjct: 2 TQEKGKFAEQLALNYLKENGLALVMQNYHCRLGEIDLIMREGSYLVFVEVRSRSNMNFGG 61 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG--NEVEWIKDAFND 129 AS+T K+ K+++ ++ ++ D RFDV++ G N++ W+K+AF+ Sbjct: 62 GLASITYEKKQKIIKATSHYMIKYRIQ-DKFPIRFDVISIDGKSNKITWLKNAFDA 116 >UniRef50_C4L9F6 Putative uncharacterized protein n=1 Tax=Tolumonas auensis DSM 9187 RepID=C4L9F6_TOLAT Length = 124 Score = 149 bits (378), Expect = 2e-35, Method: Composition-based stats. Identities = 52/113 (46%), Positives = 74/113 (65%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 + G +E QAR +LE +GL F+ AN + R GE+DLIMRE T +F+EVR+R S YG Sbjct: 12 NRRSKGQHYEQQARCFLEQQGLLFVCANYHCRQGELDLIMRERDTLVFIEVRFRASRDYG 71 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 GA +SVT +KQHK+ TAR +L + + CRFD+V++ + W+K+AF Sbjct: 72 GALSSVTPAKQHKIRHTARYYLMSQHINEAHQACRFDIVSYDDGQCSWLKNAF 124 >UniRef50_A0KPY4 UPF0102 protein AHA_3896 n=4 Tax=Proteobacteria RepID=Y3896_AERHH Length = 130 Score = 149 bits (376), Expect = 3e-35, Method: Composition-based stats. Identities = 56/109 (51%), Positives = 76/109 (69%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 G +E A RWL+ +GL+ + N RGGEIDLIMR+G T +FVEVRYR +GGAAA Sbjct: 22 KGQHFEQLAERWLQARGLQPVTRNYRCRGGEIDLIMRQGETLVFVEVRYRSQTSHGGAAA 81 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 SVTR KQHK++ AR + +H + + CRFDV+AF G++ +WI++AF Sbjct: 82 SVTRCKQHKIVLAARHYFKQHAINEASQACRFDVIAFEGDQPDWIQNAF 130 >UniRef50_D1RD77 Putative uncharacterized protein n=1 Tax=Legionella longbeachae D-4968 RepID=D1RD77_LEGLO Length = 119 Score = 148 bits (375), Expect = 4e-35, Method: Composition-based stats. Identities = 45/119 (37%), Positives = 68/119 (57%), Gaps = 3/119 (2%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + T++ G E +A L +GL+ + N R GEIDLIM + +F+EVR R S + Sbjct: 1 MRTQEKGRVAEEKALAHLTKQGLKLVMKNYRCRFGEIDLIMYDKDYLVFIEVRSRVSNQF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN--EVEWIKDAFNDH 130 GG +SVT +K+ K+L+TA ++ H ++ RFDVV+ G+ + WIKDAF Sbjct: 61 GGGISSVTHTKRQKILKTASCFILEHQ-KYNQFGLRFDVVSIDGDAASISWIKDAFGAD 118 >UniRef50_C0QTY9 UPF0102 protein PERMA_0362 n=2 Tax=Hydrogenothermaceae RepID=Y362_PERMH Length = 116 Score = 148 bits (375), Expect = 5e-35, Method: Composition-based stats. Identities = 35/112 (31%), Positives = 56/112 (50%), Gaps = 2/112 (1%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 G E +A +L G R + N R GEID+I + T + VEVR + S YG Sbjct: 6 KGKEGEDKAVEYLRNSGYRILERNFRSRFGEIDIIAEDNGTIVIVEVRSKGSTGYGYPEE 65 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFNDH 130 S+ K K+++TA+ +L + + RFD+++ N + IK+AF+ Sbjct: 66 SIDHKKVRKIIKTAQFYLLKRDIKG--KQVRFDIISIVNNNIFHIKNAFDLD 115 >UniRef50_Q3JE65 UPF0102 protein Noc_0355 n=2 Tax=Nitrosococcus oceani RepID=Y355_NITOC Length = 124 Score = 148 bits (375), Expect = 5e-35, Method: Composition-based stats. Identities = 45/121 (37%), Positives = 67/121 (55%), Gaps = 5/121 (4%) Query: 12 RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSA 71 + T + G+ E A +L+ +GLR N + R GEIDLIM + + +F+EVRYRR Sbjct: 2 KPATHRDKGEQAEQLACHYLQARGLRLTQRNYHCRLGEIDLIMEDRESLVFIEVRYRRKG 61 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT----GNEVEWIKDAF 127 +G A S+T +KQ +L+ A+ +L R CRFDVV T + + W++DAF Sbjct: 62 RFGDAIDSITPAKQARLIAAAQHYLQRTG-GAQNKPCRFDVVGITSEKGADNIMWLRDAF 120 Query: 128 N 128 Sbjct: 121 R 121 >UniRef50_Q6AJE4 UPF0102 protein DP2807 n=1 Tax=Desulfotalea psychrophila RepID=Y2807_DESPS Length = 128 Score = 148 bits (374), Expect = 5e-35, Method: Composition-based stats. Identities = 39/118 (33%), Positives = 63/118 (53%), Gaps = 7/118 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 K+ G E A R+L+ +G + N ++ GEID+I +EG +FVEV+ R ++ +G Sbjct: 5 RKKKGAEGEYLACRFLKKQGYVILQKNYRKKYGEIDIIAQEGGDLVFVEVKTRSNSDWGS 64 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-----VEWIKDAFN 128 A+VT+ KQ K+++ A+ +LA RFDV+ +E E I +AF Sbjct: 65 PVAAVTKQKQRKIIRVAQTYLAE--TELFDEAIRFDVIGIILDENSPPIFELIHNAFE 120 >UniRef50_B0U003 UPF0102 protein Fphi_0415 n=15 Tax=Francisella RepID=Y420_FRAP2 Length = 117 Score = 147 bits (372), Expect = 9e-35, Method: Composition-based stats. Identities = 41/115 (35%), Positives = 66/115 (57%), Gaps = 2/115 (1%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVN-ERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + T + G+ E QA ++L K L+ +A N GEID+I + T +F+EV+YR Sbjct: 1 MKTIEIGNKAEEQASKFLRTKNLQILAQNFKAFPYGEIDIIALDQNTLVFIEVKYRSKTK 60 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 + A +T SKQ KL+ A ++L + F+ +CRFD++A ++ WIK+AF Sbjct: 61 FAKAEEMLTYSKQQKLINAANIFLQENP-KFENYECRFDLIAINKEDINWIKNAF 114 >UniRef50_D1KBC1 Putative uncharacterized protein n=1 Tax=uncultured SUP05 cluster bacterium RepID=D1KBC1_9GAMM Length = 123 Score = 147 bits (371), Expect = 1e-34, Method: Composition-based stats. Identities = 46/120 (38%), Positives = 70/120 (58%), Gaps = 7/120 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE--GRTTIFVEVRYRRSALY 73 ++ G+ E A +L GL I N + GEID+IM + +T +FVEVRYR++ + Sbjct: 5 KRKVGNQAEDIALEYLSTHGLELIEQNYLTKMGEIDIIMLDKSEQTLVFVEVRYRQNTYF 64 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----EVEWIKDAFND 129 G AA +V ++KQ KL++TA+ +L +H + CRFDVV + ++ WIKDAF Sbjct: 65 GSAADTVDQNKQAKLVRTAQYYLQQH-SKYQEFICRFDVVGVESDLKYPKINWIKDAFGA 123 >UniRef50_C2KW07 Putative uncharacterized protein n=1 Tax=Oribacterium sinus F0268 RepID=C2KW07_9FIRM Length = 188 Score = 147 bits (371), Expect = 1e-34, Method: Composition-based stats. Identities = 36/115 (31%), Positives = 60/115 (52%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 ++ G +E +A +LE KG + N E+DL+ ++G F+EV+ R+ Sbjct: 74 NISNTLKGKVFEDRAVAFLEEKGYEILERNSRFHHLEMDLVAKDGEMLCFIEVKGRKEHS 133 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 Y +V R KQ +L A +L +H+ S CRFDVV+ G +++ I++AF Sbjct: 134 YLSGVYAVDRGKQRRLRTWATAYLCKHSYSLTETACRFDVVSIEGEKIQLIQNAF 188 >UniRef50_Q47VU1 UPF0102 protein CPS_4433 n=1 Tax=Colwellia psychrerythraea 34H RepID=Y4433_COLP3 Length = 129 Score = 147 bits (371), Expect = 2e-34, Method: Composition-based stats. Identities = 49/124 (39%), Positives = 76/124 (61%), Gaps = 4/124 (3%) Query: 8 SGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRY 67 S + ++ G E+ A+++L +GLRFI N + R GEIDLIM +G T +FVEV+Y Sbjct: 6 KTSAKNTSSTDKGQVTESYAQQYLSKQGLRFIERNFHSRQGEIDLIMLDGDTYVFVEVKY 65 Query: 68 RRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----EVEWI 123 R+S +GGA A+++ SKQ+K+ +L ++ + CR DVVA G+ +V W+ Sbjct: 66 RKSKGFGGAIAAISASKQNKVKHCITFYLHQNGLNEYNTPCRVDVVALEGDITQPQVTWL 125 Query: 124 KDAF 127 K+AF Sbjct: 126 KNAF 129 >UniRef50_C8X4T0 Putative uncharacterized protein n=1 Tax=Desulfohalobium retbaense DSM 5692 RepID=C8X4T0_DESRD Length = 134 Score = 146 bits (370), Expect = 2e-34, Method: Composition-based stats. Identities = 43/120 (35%), Positives = 64/120 (53%), Gaps = 6/120 (5%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 +TG E AR +LE G +A N GGE+DL+ R GR IFVEV+ R S+ Sbjct: 2 SARHLKTGRDGEEAARAYLESCGYVIVARNWRGGGGELDLVCRLGREIIFVEVKTRASSG 61 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF----TGNEVEWIKDAFN 128 ++T +KQ +L++ A +L+R+ CRFDV++ +GN+VE +AF Sbjct: 62 RTLPIQALTPAKQQRLIRAASAYLSRNR--LWETPCRFDVISVFSGPSGNQVEHCTNAFE 119 >UniRef50_B2V8B3 UPF0102 protein SYO3AOP1_0546 n=2 Tax=Sulfurihydrogenibium RepID=Y546_SULSY Length = 115 Score = 146 bits (370), Expect = 2e-34, Method: Composition-based stats. Identities = 37/115 (32%), Positives = 63/115 (54%), Gaps = 2/115 (1%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + Q G +E +A R+LE G + + N + GEID+I +FVEV+ R + + Sbjct: 1 MDKTQKGKFFEDKAVRYLESIGYKVLHKNYRSKYGEIDIIAETDNVIVFVEVKGRFTENF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFN 128 G S+T+ K K+++TA ++ +N RFDVVA GN++ +++AF+ Sbjct: 61 GSGEESITKKKIDKIVKTALQFIEENNLQGKDF--RFDVVALKGNQIFHLENAFS 113 >UniRef50_B8J1T1 Putative uncharacterized protein n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774 RepID=B8J1T1_DESDA Length = 160 Score = 145 bits (368), Expect = 3e-34, Method: Composition-based stats. Identities = 44/133 (33%), Positives = 61/133 (45%), Gaps = 7/133 (5%) Query: 4 VPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFV 63 R + + G A E A L G G +A N + E+D++ +G T +FV Sbjct: 19 KSVRPATAPAAAHLRLGSAGEDAAAELLTGAGCTLLARNWRQARLELDMVCLDGDTIVFV 78 Query: 64 EVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----E 119 EV+ R S YGG A +V SKQ L + AR WLA H CRFDV+ N Sbjct: 79 EVKTRSSERYGGPAYAVGLSKQRVLCRAARAWLAAH--EAWDKPCRFDVICVLRNGDTLH 136 Query: 120 VEWIKDAFN-DHS 131 +E + AF+ + Sbjct: 137 LEHFRHAFDCPPA 149 >UniRef50_C5BS52 Putative uncharacterized protein n=1 Tax=Teredinibacter turnerae T7901 RepID=C5BS52_TERTT Length = 129 Score = 145 bits (367), Expect = 4e-34, Method: Composition-based stats. Identities = 54/126 (42%), Positives = 79/126 (62%), Gaps = 2/126 (1%) Query: 3 TVPTRSGSPRQLTTKQT-GDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTI 61 P R+ + +Q T ++ GD E A+++L +GL +A N R GEIDLIM+ T + Sbjct: 4 PNPFRTPTGKQPTARRKTGDLAEDAAQQYLISQGLTPVARNYRSRFGEIDLIMQHASTLV 63 Query: 62 FVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVE 121 FVEVRYR ++ YG +AA+VT SKQ+K+ QTA+ ++ S + RFDVV +G + + Sbjct: 64 FVEVRYRANSRYGSSAATVTASKQNKIRQTAQQFIIDKKLS-ANLALRFDVVGMSGTQTQ 122 Query: 122 WIKDAF 127 WIK AF Sbjct: 123 WIKGAF 128 >UniRef50_A9NAA4 UPF0102 protein COXBURSA331_A1934 n=6 Tax=Coxiella burnetii RepID=Y1934_COXBR Length = 120 Score = 145 bits (366), Expect = 5e-34, Method: Composition-based stats. Identities = 46/114 (40%), Positives = 68/114 (59%), Gaps = 2/114 (1%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 T++ G E A R+L+ +GL FI N + GEIDLIM + +F+EVRYRR + + Sbjct: 5 TQKIGFNAEKTACRYLQKQGLSFITKNFRYKQGEIDLIMSDQSMLVFIEVRYRRFSDFIH 64 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-VEWIKDAFN 128 A+VT KQ +L++TA +L +H D + CRFD+V T + + WIK+A Sbjct: 65 PVATVTPLKQRRLIKTALHYLQKHRL-LDKISCRFDIVGITADRQITWIKNAIE 117 >UniRef50_D1VVL2 Putative uncharacterized protein n=1 Tax=Peptoniphilus lacrimalis 315-B RepID=D1VVL2_9FIRM Length = 117 Score = 145 bits (366), Expect = 5e-34, Method: Composition-based stats. Identities = 36/116 (31%), Positives = 61/116 (52%), Gaps = 4/116 (3%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + K+ G+ E+ A +L+ K + N G EID+I ++G +FVEV+ RR+ + Sbjct: 1 MNNKELGNFGESLATDFLQKKNYIILDRNYRALGTEIDIIAKDGEELVFVEVKTRRNHKF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN--EVEWIKDAF 127 G A +VT K ++QTA +++ +H RFDV+ N + I++AF Sbjct: 61 GEAYEAVTEFKMRNIIQTANVYIYKH--ELYNTQVRFDVIEVYINEKRINHIENAF 114 >UniRef50_Q7N090 UPF0102 protein plu4003 n=2 Tax=Enterobacteriaceae RepID=Y4003_PHOLL Length = 126 Score = 145 bits (366), Expect = 5e-34, Method: Composition-based stats. Identities = 59/119 (49%), Positives = 86/119 (72%) Query: 12 RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSA 71 ++LT+ G +EAQA+ +L+ +GL FIAANV GGEIDLIM++ +T +F+EVR+R+S Sbjct: 4 KKLTSYLLGRNYEAQAKLFLQKQGLSFIAANVKVHGGEIDLIMKDKQTWVFIEVRFRKSG 63 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFNDH 130 YG A A++TRSK+ KLL A +WL + F+T CRFD+ A TG + EW+++AFN + Sbjct: 64 QYGDALATITRSKRKKLLHAAAVWLFQRGECFETSSCRFDICAITGQQFEWLQNAFNQN 122 >UniRef50_C0N677 Putative uncharacterized protein n=1 Tax=Methylophaga thiooxidans DMS010 RepID=C0N677_9GAMM Length = 120 Score = 144 bits (365), Expect = 7e-34, Method: Composition-based stats. Identities = 48/119 (40%), Positives = 73/119 (61%), Gaps = 6/119 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + ++ G E Q + L+ +G+R I N RGGEIDLIM++ T +F+EVRYR+SA + Sbjct: 1 MFAREKGQQIEKQVAKHLQKQGMRLITRNYQCRGGEIDLIMQDRETLVFIEVRYRQSARF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF----TGNEVEWIKDAFN 128 G A SV ++KQ +++ TA +L + + CRFDVVA TG + +W+K+AF Sbjct: 61 GSALESVNKTKQSRIIHTAEHYLQQSRDGYQ--ACRFDVVAVSPAKTGYQFDWVKNAFQ 117 >UniRef50_B6ELI6 UPF0102 protein VSAL_I2655 n=7 Tax=Vibrionaceae RepID=Y2655_ALISL Length = 123 Score = 144 bits (365), Expect = 7e-34, Method: Composition-based stats. Identities = 49/119 (41%), Positives = 70/119 (58%), Gaps = 2/119 (1%) Query: 11 PRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRS 70 ++ + G+ +E A+R+LE L FI N + GE+DLIMR+ + +FVEV+YR S Sbjct: 2 EKKPNKRIKGEYYELMAKRYLETHQLTFIERNFYSKTGELDLIMRDRDSFVFVEVKYRAS 61 Query: 71 ALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT--GNEVEWIKDAF 127 + YG A VT KQ KL +TA WL ++ S + RFDVVA G ++ WIK+A Sbjct: 62 SNYGSAQEMVTWQKQRKLQRTALFWLMKNGLSVEHTSFRFDVVAIHSQGQDINWIKNAI 120 >UniRef50_A5F986 UPF0102 protein VC0395_A0112/VC395_0597 n=28 Tax=Vibrio RepID=Y1312_VIBC3 Length = 122 Score = 144 bits (365), Expect = 8e-34, Method: Composition-based stats. Identities = 46/117 (39%), Positives = 74/117 (63%), Gaps = 2/117 (1%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 ++ G+ +E A +L +GL + NVN R GE+DLIMR+G T +FVEVRYR + +G Sbjct: 5 NSRHQGNHYEQMAADYLRRQGLTLVTQNVNYRFGELDLIMRDGNTLVFVEVRYRNNTQHG 64 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT--GNEVEWIKDAFND 129 AA +VTR+K+ +L++ A W+ + + + D RFDV+A G ++W+K+A + Sbjct: 65 HAAETVTRTKRARLIKAANCWMLANKMNSHSADFRFDVIAIHQQGQHIDWLKNAITE 121 >UniRef50_B8CW28 Putative uncharacterized protein n=1 Tax=Halothermothrix orenii H 168 RepID=B8CW28_HALOH Length = 116 Score = 144 bits (364), Expect = 8e-34, Method: Composition-based stats. Identities = 40/118 (33%), Positives = 62/118 (52%), Gaps = 6/118 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + ++ GD E +A R+L+ KG + I N GEID+I + +FVEV+ RRS Y Sbjct: 1 MQNRELGDWGEKKAVRYLKSKGYQVIKTNYRCLIGEIDIIAIDNNFLVFVEVKTRRSIAY 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEWIKDAF 127 G A +V KQ K+ + AR +L + + RFDV++ ++ IK+AF Sbjct: 61 GVPACAVNFDKQKKIRKVARHYLKSN--MINKYQIRFDVISIIVKNNRGFLKHIKNAF 116 >UniRef50_A3XHA2 Putative uncharacterized protein n=1 Tax=Leeuwenhoekiella blandensis MED217 RepID=A3XHA2_9FLAO Length = 118 Score = 144 bits (364), Expect = 8e-34, Method: Composition-based stats. Identities = 31/118 (26%), Positives = 54/118 (45%), Gaps = 7/118 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + + G E A +LE KG + N E+D+I + + VEV+ R S + Sbjct: 1 MNHNELGKWGEEYAANYLEKKGYELLERNWFFNKAELDIIALKNNQLVVVEVKTRNSDFF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF----TGNEVEWIKDAF 127 G VT +K L++ ++ ++ ++ RFDV+A T ++E +DAF Sbjct: 61 GDPQDFVTPAKIKLLVKATNEYIISNDL---DLEVRFDVIAVLKNKTQEQLEHFEDAF 115 >UniRef50_A0YER5 Putative uncharacterized protein n=1 Tax=marine gamma proteobacterium HTCC2143 RepID=A0YER5_9GAMM Length = 128 Score = 144 bits (364), Expect = 8e-34, Method: Composition-based stats. Identities = 46/118 (38%), Positives = 66/118 (55%), Gaps = 6/118 (5%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 G E +A WL+ +GL+ +A N + GEID+IM +G+ +FVEVRYR+SA +G Sbjct: 10 KNTNFGAYVEEKAYHWLQQQGLKSVALNYRCKTGEIDIIMLDGQQLVFVEVRYRKSASFG 69 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-----EVEWIKDAF 127 SV R KQ K+ + A +L F+ + CRFDV+A + W+KDAF Sbjct: 70 DGLESVDRRKQQKIQKAAAHFLTDRP-GFNHLPCRFDVIAAKPSSDSSLHWNWVKDAF 126 >UniRef50_C0GQX2 Putative uncharacterized protein n=1 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GQX2_9DELT Length = 132 Score = 144 bits (364), Expect = 8e-34, Method: Composition-based stats. Identities = 40/119 (33%), Positives = 55/119 (46%), Gaps = 5/119 (4%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 G E AR +L G R N RGGE+DL+ G +FVEV+ R Sbjct: 2 SAHNLNLGRYGEEVARDYLTENGYRIKERNWRARGGELDLVCTCGDCIVFVEVKTRAEEG 61 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE---VEWIKDAFN 128 G S+ +Q KLL+TA L+L+RHN + RFD + T +E I++A Sbjct: 62 MGHPLESLGFKQQKKLLRTAGLYLSRHNM--WSSQSRFDFICVTVGREVQIEHIQNAIE 118 >UniRef50_A1SU47 UPF0102 protein Ping_1176 n=2 Tax=Psychromonas RepID=Y1176_PSYIN Length = 122 Score = 144 bits (364), Expect = 9e-34, Method: Composition-based stats. Identities = 46/118 (38%), Positives = 67/118 (56%), Gaps = 5/118 (4%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + G E QA +L+ +GLR I N R GEIDLIM + T +F+EVRYR+++ + Sbjct: 8 QASNSKGVLAEKQALSYLQEQGLRLICQNYYCRFGEIDLIMIDQDTLVFIEVRYRKNSDF 67 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT--GNEVEWIKDAFND 129 GG AS+ +SKQ K++ TA+ +L D CRFD +A WI++AF + Sbjct: 68 GGPFASINKSKQRKIITTAKHYL---RTLEDEPFCRFDAIAIDSKSTTPAWIQNAFQE 122 >UniRef50_A1AN88 UPF0102 protein Ppro_1186 n=11 Tax=Deltaproteobacteria RepID=Y1186_PELPD Length = 140 Score = 144 bits (363), Expect = 1e-33, Method: Composition-based stats. Identities = 39/132 (29%), Positives = 60/132 (45%), Gaps = 9/132 (6%) Query: 4 VPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRT--TI 61 S + + TG E A +L +G R + N +GGE+D++ R + Sbjct: 8 RQESPSSTARPDNRNTGSRGEEIATSFLGQQGYRILERNFRCKGGELDIVARAPGERSLV 67 Query: 62 FVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-- 119 FVEV+ RR YG +VT KQ ++ + A WL+R+ RFDV+A + Sbjct: 68 FVEVKTRRDRSYGPPQLAVTPFKQRQISKAALTWLSRN--HLHDSQARFDVIAILLEDGG 125 Query: 120 ---VEWIKDAFN 128 +E I +AF Sbjct: 126 RHSIEHIVNAFE 137 >UniRef50_Q67PD3 UPF0102 protein STH1475 n=1 Tax=Symbiobacterium thermophilum RepID=Y1475_SYMTH Length = 118 Score = 144 bits (363), Expect = 1e-33, Method: Composition-based stats. Identities = 46/119 (38%), Positives = 64/119 (53%), Gaps = 7/119 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 +++ G+A E A +L G R IA NV R GEIDLI ++G +FVEV+ RR YG Sbjct: 2 SRRVGEAGEQAAAEFLTASGYRIIARNVRFRSGEIDLIAQDGGVLVFVEVKTRRGRRYGT 61 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-----EVEWIKDAFND 129 +VT +KQ +L + A L+LAR + CRFDVV I++AF+ Sbjct: 62 PGEAVTAAKQRRLARLASLYLARLGS--EPPPCRFDVVEVEPGPDGRLRCRLIQNAFHA 118 >UniRef50_B7GSU4 UPF0102 protein Blon_1698 n=12 Tax=Bifidobacterium RepID=Y1698_BIFLI Length = 124 Score = 143 bits (362), Expect = 2e-33, Method: Composition-based stats. Identities = 45/122 (36%), Positives = 59/122 (48%), Gaps = 5/122 (4%) Query: 11 PRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGR-TTIFVEVRYRR 69 R LT KQ G E A WLE G ++ N + R GE+D++M T +FVEV+ RR Sbjct: 3 DRNLTPKQFGALGEQYAAAWLEEHGWTTLSRNWHTRYGELDIVMLNPEYTVVFVEVKSRR 62 Query: 70 SALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEWIKD 125 S YG ++T +KQH L + A WL RFDVV V I++ Sbjct: 63 SMHYGYPQEAITPAKQHNLRKAACDWLLDRRNRVPHTAVRFDVVTIVLRVGRPLVHHIEN 122 Query: 126 AF 127 AF Sbjct: 123 AF 124 >UniRef50_Q2S9Y0 UPF0102 protein HCH_05895 n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Y5895_HAHCH Length = 124 Score = 143 bits (361), Expect = 2e-33, Method: Composition-based stats. Identities = 42/123 (34%), Positives = 68/123 (55%), Gaps = 5/123 (4%) Query: 9 GSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYR 68 R + + G A E+QA ++ +G + N +GGEIDLI R G +F+EVR+R Sbjct: 2 PFKRLIKSIDIGRAAESQAEKFARAQGFTIVERNFRCKGGEIDLIARHGEHLVFIEVRHR 61 Query: 69 RSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVV---AFTGNEVEWIKD 125 S +G AA S+T+ KQ +++ A ++L + + + CRFDV+ + +WI D Sbjct: 62 SSDKFGSAAESITQKKQQRIILAANIYLQKKGLT--NMPCRFDVIVGNLKSNTGFQWIPD 119 Query: 126 AFN 128 AF+ Sbjct: 120 AFS 122 >UniRef50_Q0VS15 UPF0102 protein ABO_0585 n=2 Tax=Alcanivorax RepID=Y585_ALCBS Length = 125 Score = 142 bits (360), Expect = 2e-33, Method: Composition-based stats. Identities = 48/119 (40%), Positives = 69/119 (57%), Gaps = 6/119 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + K TG E +A +WL G+GL + N + R GEIDLI+ + T +F EVR+R+ Y Sbjct: 5 RSKKNTGRDAEKRAAKWLTGQGLSIVERNFHCRQGEIDLILLDQETLVFTEVRWRKHQSY 64 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-----VEWIKDAF 127 GGA ASV + KQ +L+ A+ +LARH CRFDV+ + +WI++AF Sbjct: 65 GGALASVDQHKQRRLINAAQHFLARHP-EHHHRPCRFDVLGMEPDSQQAVLYQWIQNAF 122 >UniRef50_C7RDQ1 Putative uncharacterized protein n=1 Tax=Anaerococcus prevotii DSM 20548 RepID=C7RDQ1_ANAPD Length = 115 Score = 142 bits (360), Expect = 2e-33, Method: Composition-based stats. Identities = 36/114 (31%), Positives = 62/114 (54%), Gaps = 4/114 (3%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 K+ GD E +L+ K +A N + GEID++ + +FVEV+ R++A + Sbjct: 4 KKEFGDYGENLVEGYLKDKSYEILARNYRKPFGEIDIVAKLSDMIVFVEVKTRKNANFAS 63 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF--TGNEVEWIKDAF 127 A +VT SKQ K++Q ++ +L +N + + RFDV E+ +I++AF Sbjct: 64 PAEAVTPSKQRKVIQASQAFLIENNMT--DMLMRFDVAEVIADKGEINYIENAF 115 >UniRef50_A4BQJ8 Putative uncharacterized protein n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BQJ8_9GAMM Length = 119 Score = 142 bits (360), Expect = 3e-33, Method: Composition-based stats. Identities = 49/118 (41%), Positives = 67/118 (56%), Gaps = 4/118 (3%) Query: 12 RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSA 71 R + G EA+A +L+ +GLR + N + R GEIDLIM + +FVEVR R + Sbjct: 3 RGPNPRTLGKQAEARALEFLQRRGLRCLQRNFHTRLGEIDLIMEDTGEVVFVEVRQRATK 62 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-NEVEWIKDAFN 128 +GGA SVT K+ +L+ AR +L H CRFDV+A G +EWI+DAF Sbjct: 63 RFGGALESVTPVKRQRLIAAARYYLLTHAP---NAACRFDVIAIDGQGSIEWIRDAFQ 117 >UniRef50_Q1Q244 Putative uncharacterized protein n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q244_9BACT Length = 149 Score = 142 bits (360), Expect = 3e-33, Method: Composition-based stats. Identities = 37/127 (29%), Positives = 63/127 (49%), Gaps = 8/127 (6%) Query: 7 RSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVR 66 + S Q K G E A ++L+ KG + + N + GEID+I + + +FVEV+ Sbjct: 20 KKTSDVQPHKKALGKKGEVVAAKFLKKKGYKILQRNYRRKTGEIDIICYDRGSIVFVEVK 79 Query: 67 YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF------TGNEV 120 R S YG +VT +K+ ++++ A ++A + +D RFDVV+ + Sbjct: 80 TRGSDSYGPPELAVTEAKKKQIIKMASRYIAEKKV--EGIDLRFDVVSVFYPPAKKHPAI 137 Query: 121 EWIKDAF 127 K+AF Sbjct: 138 TLYKNAF 144 >UniRef50_B9MQX5 UPF0102 protein Athe_0977 n=1 Tax=Anaerocellum thermophilum DSM 6725 RepID=Y977_ANATD Length = 119 Score = 142 bits (359), Expect = 3e-33, Method: Composition-based stats. Identities = 42/120 (35%), Positives = 59/120 (49%), Gaps = 7/120 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + KQ G E A +L G + N R GEID+I +E +T +FVEV+ R+S + Sbjct: 1 MNLKQVGRFGENLAVDFLIKHGYEILRTNFRCRLGEIDIIAKEDKTIVFVEVKTRKSLKF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF------TGNEVEWIKDAF 127 G + SV KQ + + A ++A H S D RFDVV ++ IKDAF Sbjct: 61 GLPSESVNFKKQLHIKKVAEYFIAYH-LSQDKYLYRFDVVEIFIDGKNNVTKINLIKDAF 119 >UniRef50_C4GB01 Putative uncharacterized protein n=1 Tax=Shuttleworthia satelles DSM 14600 RepID=C4GB01_9FIRM Length = 116 Score = 142 bits (359), Expect = 3e-33, Method: Composition-based stats. Identities = 41/113 (36%), Positives = 60/113 (53%), Gaps = 1/113 (0%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 ++ G +E +A +L G+GL + N + R GEIDL+ REG +FVEV+YRRS +G Sbjct: 5 NRRKEGSFYERRAGDYLTGQGLTLVEFNFSCRLGEIDLVAREGTCLVFVEVKYRRSRRFG 64 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 +V SK + + A + H T RFD+VA G E+ + AF Sbjct: 65 LPEEAVGPSKMRTIRKVAGYYCLTHGI-CQTTPVRFDLVAIEGEEIRHYRGAF 116 >UniRef50_B0G5Y9 Putative uncharacterized protein n=4 Tax=Clostridiales RepID=B0G5Y9_9FIRM Length = 122 Score = 142 bits (359), Expect = 4e-33, Method: Composition-based stats. Identities = 45/119 (37%), Positives = 64/119 (53%), Gaps = 6/119 (5%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + ++TG +E +A +LE G + + N R GEIDLI R+G +FVEV+YR + + Sbjct: 3 KKNNRRTGTGYERKAGAYLESLGYKIVTYNYRCRLGEIDLIARDGEYLVFVEVKYRTTGV 62 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG----NEVEWIKDAF 127 G A +V KQ + + A +L + D V CRFDVVA G E+ KDAF Sbjct: 63 SGYPAEAVDARKQQTIAKCAMHFLMKQGN--DDVPCRFDVVAIAGAEGQEEITLYKDAF 119 >UniRef50_B3EJJ5 UPF0102 protein Cphamn1_0017 n=1 Tax=Chlorobium phaeobacteroides BS1 RepID=Y017_CHLPB Length = 126 Score = 142 bits (358), Expect = 4e-33, Method: Composition-based stats. Identities = 38/122 (31%), Positives = 57/122 (46%), Gaps = 12/122 (9%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 G E A +L KG R + N R EID+I + RT F+EV+ R SA G Sbjct: 5 PHDLGRQGEHTAVTFLIEKGYRILQRNYRHRRNEIDIIALDRRTLCFIEVKTRSSASKGH 64 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----------EVEWIKD 125 +VT KQ ++++ A +L+ + CRFDV+A + ++E I + Sbjct: 65 PLEAVTPEKQKEIIRAATAYLSAYPSPEPD--CRFDVIAIIAHDFTNGRIREFKLEHITN 122 Query: 126 AF 127 AF Sbjct: 123 AF 124 >UniRef50_C6A8H5 Putative uncharacterized protein n=4 Tax=Bifidobacterium animalis subsp. lactis RepID=C6A8H5_BIFLB Length = 158 Score = 141 bits (357), Expect = 5e-33, Method: Composition-based stats. Identities = 41/119 (34%), Positives = 57/119 (47%), Gaps = 5/119 (4%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIM-REGRTTIFVEVRYRRSAL 72 LT KQ G E R WL +A N + R GEID+I T +FVEV+ RRS Sbjct: 40 LTAKQIGSLGERLCRAWLIEHHWHVLACNWHCRFGEIDIIALTSHSTIVFVEVKTRRSTS 99 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEWIKDAF 127 G +V +KQ ++ + A WL H + + RFDV+A T + + +AF Sbjct: 100 CGIPEEAVHAAKQMRVRRAAICWLGEHGSTIRHIGVRFDVIAVTVTPTDVFIHHVPEAF 158 >UniRef50_Q7MNW2 UPF0102 protein VV0603 n=80 Tax=Vibrionales RepID=Y603_VIBVY Length = 122 Score = 141 bits (357), Expect = 6e-33, Method: Composition-based stats. Identities = 49/115 (42%), Positives = 76/115 (66%), Gaps = 2/115 (1%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 + + G+ +E+ A+ +L+ +GLRFI AN + GEIDLI +E +T +FVEV+YR+++ YG Sbjct: 5 SRRAIGNQYESLAKEYLQRQGLRFIEANFTTKVGEIDLIFKEAQTIVFVEVKYRKNSCYG 64 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF--TGNEVEWIKDAF 127 AA V +K +KL++TA LWL +H + RFDVVA G+++ WI +A Sbjct: 65 DAAEMVNPAKANKLIKTAYLWLNKHGYNACNTAMRFDVVAIHSNGHDINWIANAI 119 >UniRef50_Q8R5S3 UPF0102 protein TTE1452 n=9 Tax=Thermoanaerobacterales RepID=Y1452_THETN Length = 122 Score = 141 bits (357), Expect = 6e-33, Method: Composition-based stats. Identities = 36/123 (29%), Positives = 61/123 (49%), Gaps = 9/123 (7%) Query: 12 RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSA 71 +++ K G E A ++L KG + + N + GEIDLI +FVEV+ R S Sbjct: 2 KKVNKKTVGSVGEKIAAQYLSKKGYKILEKNFKCKIGEIDLIALYKNQIVFVEVKTRTSV 61 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF-------TGNEVEWIK 124 +G + +V KQ K+++ A++++A +F RFD++ T +V I Sbjct: 62 NFGLPSEAVDFHKQQKIVKIAQVYIAS--SNFKQYQPRFDIIEVYLNPEKLTLEKVNHIL 119 Query: 125 DAF 127 +AF Sbjct: 120 NAF 122 >UniRef50_D1CBL5 Putative uncharacterized protein n=1 Tax=Thermobaculum terrenum ATCC BAA-798 RepID=D1CBL5_THET1 Length = 125 Score = 141 bits (356), Expect = 7e-33, Method: Composition-based stats. Identities = 36/120 (30%), Positives = 51/120 (42%), Gaps = 6/120 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 +K G E A +L KG + IA N R GEID+I ++ +FVEV+ R S G Sbjct: 2 SKSLGRIGEDYACNFLLSKGYKLIARNWRCRQGEIDIIFQDKDEIVFVEVKTRSSLSLGT 61 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT---GNEV---EWIKDAFND 129 S+ K +LL A++W+ RFD V T V I++ Sbjct: 62 PEESIDMHKARQLLTLAKIWIFECYDGEKDPPVRFDAVTVTISRSGRVIDSNHIQNCIMP 121 >UniRef50_B7RSM7 Putative uncharacterized protein n=1 Tax=marine gamma proteobacterium HTCC2148 RepID=B7RSM7_9GAMM Length = 124 Score = 141 bits (356), Expect = 7e-33, Method: Composition-based stats. Identities = 47/121 (38%), Positives = 69/121 (57%), Gaps = 7/121 (5%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 ++ KQ GD +E +A +L +G+ + N R GEIDLI R+ +F+EVR RR+ Sbjct: 3 GISMKQIGDEYERRAAHFLSQQGVEVLICNYRCRCGEIDLIARQNDYLVFIEVRARRNPR 62 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE------VEWIKDA 126 + AAASV KQ +LL+TA+ +L RH + CRFDV+ F + +WI+ A Sbjct: 63 FATAAASVDYRKQQRLLRTAQFFLQRH-TKLANLPCRFDVITFEPRQSTANDSPQWIRGA 121 Query: 127 F 127 F Sbjct: 122 F 122 >UniRef50_A3WNE6 Predicted endonuclease n=1 Tax=Idiomarina baltica OS145 RepID=A3WNE6_9GAMM Length = 116 Score = 140 bits (355), Expect = 9e-33, Method: Composition-based stats. Identities = 35/116 (30%), Positives = 63/116 (54%), Gaps = 2/116 (1%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 ++T++ G E+QA R+L +GL + N GEID+I R+ T +FVEV+ R+++ + Sbjct: 1 MSTRKRGLEGESQASRYLRQQGLVIVQHNFRVPCGEIDIICRDSDTWVFVEVKRRQNSDF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE--VEWIKDAF 127 +T + ++ + A+ +L + RFD++ ++ VEW KDAF Sbjct: 61 ASILEQITTRQCQRIRRAAQYFLVEQTVNEYLAKMRFDIITINDSQVTVEWYKDAF 116 >UniRef50_A1ZF36 Putative uncharacterized protein n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZF36_9SPHI Length = 119 Score = 140 bits (355), Expect = 9e-33, Method: Composition-based stats. Identities = 34/116 (29%), Positives = 60/116 (51%), Gaps = 8/116 (6%) Query: 17 KQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGA 76 ++ G E A +++ KG + N + GEID+I + G +FVEV+ R S +G Sbjct: 6 QKKGKYGENLAAAFMQNKGYTLLERNYRYKRGEIDIIAQTGDVLVFVEVKLRSSDNFGLP 65 Query: 77 AASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-----GNEVEWIKDAF 127 SV+ ++Q+ ++QTA ++ + D RFD+VA ++ + +DAF Sbjct: 66 EESVSENQQNLIIQTAEQYIEEIDWE---SDIRFDIVAIELKSHQSPQITYFEDAF 118 >UniRef50_A6VB97 UPF0102 protein PSPA7_4996 n=17 Tax=Pseudomonadaceae RepID=Y4996_PSEA7 Length = 125 Score = 140 bits (355), Expect = 9e-33, Method: Composition-based stats. Identities = 41/123 (33%), Positives = 64/123 (52%), Gaps = 7/123 (5%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + ++ G E A L +GL + N R GE+DL+M +G T +FVEVR RR Sbjct: 4 RANSRDKGRQAEEMACAHLLRQGLATLGKNWTCRRGELDLVMLDGDTVVFVEVRSRRHRA 63 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT------GNEVEWIKDA 126 +GGA S+ K+ +L+ +A L+L + + CRFDVV ++WI++A Sbjct: 64 WGGALESIDARKRQRLILSAELFLQQ-EARWAKRPCRFDVVTVDTSDGQSPPRLDWIQNA 122 Query: 127 FND 129 F+ Sbjct: 123 FDA 125 >UniRef50_C6WYK5 Putative uncharacterized protein n=1 Tax=Methylotenera mobilis JLW8 RepID=C6WYK5_METML Length = 119 Score = 140 bits (355), Expect = 9e-33, Method: Composition-based stats. Identities = 49/117 (41%), Positives = 61/117 (52%), Gaps = 7/117 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 G E A +L+ GL I N GEIDLIMR+G+T +FVEVR R + + Sbjct: 7 AKNITEGQLAEQIAATFLQNNGLTVIEKNFRSAYGEIDLIMRDGKTLVFVEVRLRSNTKF 66 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVV---AFTGNEVEWIKDAF 127 GGA S+ SKQ KL +TA +L + CRFD + A VEWIKDAF Sbjct: 67 GGAGMSINASKQQKLTRTAERYLQING----DSACRFDAILMHALDITTVEWIKDAF 119 >UniRef50_D0KYK3 Putative uncharacterized protein n=1 Tax=Halothiobacillus neapolitanus c2 RepID=D0KYK3_HALNC Length = 165 Score = 140 bits (355), Expect = 1e-32, Method: Composition-based stats. Identities = 50/141 (35%), Positives = 69/141 (48%), Gaps = 13/141 (9%) Query: 1 MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTT 60 M + + TT G E A +L +GL+ I NV GEIDLIM++G T Sbjct: 25 MRGTDAELPNAKAQTTLARGHRAETMAAEYLSRQGLKLIDRNVRAGRGEIDLIMQDGATL 84 Query: 61 IFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-- 118 +FVEVR R++ + AA SV+ +K+ K+++TA L + CRFDVVA Sbjct: 85 VFVEVRARKAGAWVSAAESVSPAKRKKIIETAERLLNEKPV-WRKSPCRFDVVAIGLPSE 143 Query: 119 ----------EVEWIKDAFND 129 EV WI+DAF Sbjct: 144 SSSEPAAKQAEVNWIQDAFQA 164 >UniRef50_A8SLV5 Putative uncharacterized protein n=1 Tax=Parvimonas micra ATCC 33270 RepID=A8SLV5_9FIRM Length = 114 Score = 140 bits (355), Expect = 1e-32, Method: Composition-based stats. Identities = 31/116 (26%), Positives = 57/116 (49%), Gaps = 4/116 (3%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + K G+ E A ++L KG + I N + GEID+I ++ +F+EV+ R++ + Sbjct: 1 MKAKDIGNLGEDMAVKFLLEKGYQIIERNFLKPFGEIDIIAKDKDFLVFIEVKARKNVNF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN--EVEWIKDAF 127 G V K K+ A++++ N RFDV+ + ++ I++AF Sbjct: 61 GFPREFVNGIKIKKIQDVAQIYMMEKNLFGAK--IRFDVIEIIFDNYKITHIENAF 114 >UniRef50_C9KJR5 Endonuclease n=2 Tax=Veillonellaceae RepID=C9KJR5_9FIRM Length = 132 Score = 140 bits (354), Expect = 1e-32, Method: Composition-based stats. Identities = 40/124 (32%), Positives = 61/124 (49%), Gaps = 6/124 (4%) Query: 9 GSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYR 68 + R + T G E A +LE G +A N GEID++ +GR FVEV+ R Sbjct: 10 NAERLMDTTTIGRQGEEAAAVFLERAGYEILARNFRTPRGEIDIVASKGRMLAFVEVKTR 69 Query: 69 RSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF----TGNEVEWIK 124 R+ +G AA+V KQ K++Q+A +L + + CRFDV+ ++E + Sbjct: 70 RTQRFGRPAAAVDYRKQQKIIQSAHWFLRQRHLEGCL--CRFDVIEIYRAGERWQIEHLP 127 Query: 125 DAFN 128 AF Sbjct: 128 GAFE 131 >UniRef50_Q1GYY7 UPF0102 protein Mfla_2283 n=1 Tax=Methylobacillus flagellatus KT RepID=Y2283_METFK Length = 113 Score = 140 bits (353), Expect = 1e-32, Method: Composition-based stats. Identities = 54/117 (46%), Positives = 71/117 (60%), Gaps = 7/117 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 KQ GD EA A R+L +GL IA N R GEIDL+M++G T +FVEVR R A +GG Sbjct: 1 MKQLGDDAEALAERYLIKQGLVVIARNYRCRFGEIDLVMKQGATIVFVEVRMRSHATFGG 60 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF---TGNEVEWIKDAFND 129 AAAS+ +K+ KL+ TA +L RH CRFD + + +EWI+DAF+ Sbjct: 61 AAASIHAAKRQKLILTAEHFLQRHGS----APCRFDAILLSKRDADGIEWIQDAFSA 113 >UniRef50_A5N821 UPF0102 protein CKL_1410 n=16 Tax=Clostridium RepID=Y1410_CLOK5 Length = 122 Score = 140 bits (353), Expect = 2e-32, Method: Composition-based stats. Identities = 35/119 (29%), Positives = 54/119 (45%), Gaps = 8/119 (6%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 K G E A+ +L G + N + GEID+I ++G F+EV+ R LYG Sbjct: 5 NKDIGSLGEDIAKNYLNQIGYTVLERNFRCKVGEIDIIGKDGDYICFIEVKSRYGKLYGN 64 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF------TGNEVEWIKDAFN 128 SV K+ K+ + A +++ R + RFDV+ ++ IKDAF Sbjct: 65 PCESVNYPKRLKIYKAANIYMLRKK--LFKFNFRFDVIEIIFNTYNDVPSIKLIKDAFQ 121 >UniRef50_C0YUE8 Possible endonuclease n=2 Tax=Flavobacteriaceae RepID=C0YUE8_9FLAO Length = 125 Score = 140 bits (353), Expect = 2e-32, Method: Composition-based stats. Identities = 29/121 (23%), Positives = 53/121 (43%), Gaps = 8/121 (6%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 G E A +L+ G + + N + EID+I + I VEV+ R + + Sbjct: 4 ANHNDFGKMAEDLAVEYLKKCGYKILVRNFRFQKAEIDVIAEKDNQIIVVEVKARSTDAF 63 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-----VEWIKDAFN 128 +VT++K ++ A ++ N + RFD+++ +E +E I+DAF Sbjct: 64 MLPQEAVTKTKIKSIVSAANHYMEEFNK---DNEVRFDIISVLPDENKNLIIEHIEDAFE 120 Query: 129 D 129 Sbjct: 121 A 121 >UniRef50_Q7P0B3 UPF0102 protein CV_0654 n=1 Tax=Chromobacterium violaceum RepID=Y654_CHRVO Length = 112 Score = 139 bits (352), Expect = 2e-32, Method: Composition-based stats. Identities = 51/111 (45%), Positives = 71/111 (63%), Gaps = 4/111 (3%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 Q G E +A LE +GL+ +A N + RGGEIDLIMR+G +FVEVR+R + +GG Sbjct: 1 MNQAGRDAEDRALALLEKRGLKLVARNWHCRGGEIDLIMRDGDALVFVEVRHRGGSRFGG 60 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFD-VVAFTGNEVEWIKD 125 AA S+T +KQ KLL A ++L+ HN CRFD VV+ G+ +W+K+ Sbjct: 61 AADSITAAKQRKLLLAAEVYLSSHNI---DSPCRFDAVVSVGGDAPQWLKN 108 >UniRef50_B4RXI2 Sigma-54 factor n=2 Tax=Alteromonas macleodii RepID=B4RXI2_ALTMD Length = 113 Score = 139 bits (352), Expect = 2e-32, Method: Composition-based stats. Identities = 34/109 (31%), Positives = 60/109 (55%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 +K G+A E +A +L +GL N + R GE+D++M++G T + +EV+YR+ +G Sbjct: 2 SKLQGNAAEDKACEYLLQQGLTLRCRNYHTRRGELDIVMQDGNTIVCIEVKYRKQNRFGS 61 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIK 124 A VT K ++ +L + + + R DV+A G+ +EW+K Sbjct: 62 AVEFVTAKKLQRIQAAFGFYLLDNGLNPASTPLRIDVIAIDGDNLEWLK 110 >UniRef50_B8DRI1 Putative uncharacterized protein n=2 Tax=Desulfovibrio RepID=B8DRI1_DESVM Length = 146 Score = 139 bits (352), Expect = 2e-32, Method: Composition-based stats. Identities = 39/135 (28%), Positives = 64/135 (47%), Gaps = 9/135 (6%) Query: 4 VPTRSGSPRQLT---TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTT 60 P R+ P + G E A R L +GLR +A N G E+D++ + T Sbjct: 2 TPPRAAPPTTASATGNAAIGARGEEAAARLLAQRGLRVLARNWRHGGLELDIVCDDRGTL 61 Query: 61 IFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE- 119 +FVEV+ R ++ ++T +K+ KL++ AR +LA H+ CRFD+V + Sbjct: 62 VFVEVKTRAASGPARPDEALTTAKRGKLVRAARQYLAAHDC--WDKPCRFDLVCVVHDGA 119 Query: 120 ---VEWIKDAFNDHS 131 +E AF+ + Sbjct: 120 TLTLEHYPHAFDLTA 134 >UniRef50_D0MIM6 Putative uncharacterized protein n=1 Tax=Rhodothermus marinus DSM 4252 RepID=D0MIM6_RHOM4 Length = 127 Score = 139 bits (351), Expect = 3e-32, Method: Composition-based stats. Identities = 41/126 (32%), Positives = 58/126 (46%), Gaps = 14/126 (11%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIM-------REGRTTIFVEVR 66 + T+ G E A +LE +G R +A EIDL+ +G +FVEV+ Sbjct: 1 MDTRTIGTRGEDLAAAYLEQQGYRILARQYRFERAEIDLVCFEPAPRPEDGGEIVFVEVK 60 Query: 67 YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-----GNEVE 121 RR +G +VT KQ L++ AR +L H CRFDV+A E+E Sbjct: 61 TRRGLGFGRPEEAVTPEKQRHLIRAARAYLYEH--HLQRARCRFDVIAIVLHDDRPPEIE 118 Query: 122 WIKDAF 127 +DAF Sbjct: 119 HFRDAF 124 >UniRef50_A8G183 UPF0102 protein Ssed_4252 n=16 Tax=Shewanella RepID=Y4252_SHESH Length = 117 Score = 139 bits (351), Expect = 3e-32, Method: Composition-based stats. Identities = 40/110 (36%), Positives = 65/110 (59%), Gaps = 3/110 (2%) Query: 18 QTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAA 77 + G A E A +L +GL FI NV + GEIDL+M+ G+ IFVEV+YR + YGGA Sbjct: 11 EHGQAGENLAMNYLLEQGLTFIERNVRFKFGEIDLVMKNGKEWIFVEVKYRSKSQYGGAI 70 Query: 78 ASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 +++ + +L + A ++ +N CRFD++A +++W+ +AF Sbjct: 71 NALSSGQIKRLRRAAEHYMQLNNI---DAICRFDLIAVDAGQIQWLPNAF 117 >UniRef50_B2KC57 Putative uncharacterized protein n=1 Tax=Elusimicrobium minutum Pei191 RepID=B2KC57_ELUMP Length = 122 Score = 139 bits (350), Expect = 4e-32, Method: Composition-based stats. Identities = 39/117 (33%), Positives = 64/117 (54%), Gaps = 7/117 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGR-----TTIFVEVRYRRS 70 + G E A +L+ G + IA N + GE+D+I +G T +F+EV+ R Sbjct: 1 MNKLGVESENAAANFLKKNGYKIIARNYAVQTGEVDIIASQGGLLKQKTLVFIEVKGRAY 60 Query: 71 ALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 YGG A+VT++KQ+K++ A +++ + FD RFDVV ++E I++AF Sbjct: 61 KAYGGPLAAVTKAKQNKIISAATIYVKENFPKFD--SIRFDVVTVVDGKIEHIENAF 115 >UniRef50_B5YFD1 UPF0102 protein DICTH_1420 n=2 Tax=Dictyoglomus RepID=Y1420_DICT6 Length = 118 Score = 139 bits (350), Expect = 4e-32, Method: Composition-based stats. Identities = 32/120 (26%), Positives = 61/120 (50%), Gaps = 8/120 (6%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + K+ G E +L +G + N GE+D+I ++G IF+EV+ RR+ + Sbjct: 1 MNNKEIGKLGEDFTIDFLNKRGFIILERNYKVPLGEVDIIAQKGDLLIFIEVKTRRNLDF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE------VEWIKDAF 127 G A +V R+KQ ++ + A L+++ F RFD+++ ++ E++ +AF Sbjct: 61 GIPAEAVDRTKQTRIKKIAELYISTKKPKFKK--IRFDIMSIILSKSGKILDWEYLINAF 118 >UniRef50_C8W5C0 Putative uncharacterized protein n=1 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8W5C0_DESAS Length = 119 Score = 138 bits (349), Expect = 5e-32, Method: Composition-based stats. Identities = 45/119 (37%), Positives = 61/119 (51%), Gaps = 9/119 (7%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 + G E+ A R+L KG I N R GEID+I RE T+FVEVR R + YG Sbjct: 4 KKQLLGRLGESVAARYLYSKGFIIIHQNFRCRLGEIDIIAREKGVTVFVEVRSRCGSSYG 63 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE------VEWIKDAF 127 SV KQ KL + A+ ++AR+ + D RFDVVA + +E ++AF Sbjct: 64 LPQESVVIKKQVKLRKLAQYYIARYALTG---DFRFDVVAVMFEQDNSIKLIEHFRNAF 119 >UniRef50_A8PP71 Putative uncharacterized protein n=1 Tax=Rickettsiella grylli RepID=A8PP71_9COXI Length = 130 Score = 138 bits (348), Expect = 7e-32, Method: Composition-based stats. Identities = 42/127 (33%), Positives = 68/127 (53%), Gaps = 14/127 (11%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + T++ G E+ + +L + L+ I N GEIDLIM++ +F+EVRYR+S + Sbjct: 1 MNTQKLGHHIESLVQDYLRRQKLKRITRNFRCCFGEIDLIMKDKNVLVFIEVRYRQSLQF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-------------NEV 120 G + S+ KQ+K+++ A +L+ S + + CRFDVV +V Sbjct: 61 GNSLESIHAMKQNKIMKAAEYYLSSQRLS-EKIACRFDVVGVKPITQKLLAVSKLDSAQV 119 Query: 121 EWIKDAF 127 EWIK+AF Sbjct: 120 EWIKNAF 126 >UniRef50_Q0TPP8 UPF0102 protein CPF_1959 n=9 Tax=Clostridium perfringens RepID=Y1959_CLOP1 Length = 122 Score = 138 bits (348), Expect = 7e-32, Method: Composition-based stats. Identities = 36/119 (30%), Positives = 57/119 (47%), Gaps = 8/119 (6%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 K G E + ++L+ +G + N N GEID+I + F+EV+ R S +G Sbjct: 5 NKSIGFYGEDLSAKFLKKEGYSILEKNFNCSSGEIDIIAIKDEIISFIEVKSRFSNSFGN 64 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN------EVEWIKDAFN 128 SVT SKQ +++ A+ +L H RFDV+ + E+ ++KDAF Sbjct: 65 PKESVTCSKQGRIINAAKYYL--HVKKLYNYYIRFDVIEVNFHIDSSKYELNFLKDAFR 121 >UniRef50_A3M3I7 UPF0102 protein A1S_1049 n=12 Tax=Acinetobacter RepID=Y1049_ACIBT Length = 133 Score = 137 bits (347), Expect = 8e-32, Method: Composition-based stats. Identities = 44/131 (33%), Positives = 68/131 (51%), Gaps = 17/131 (12%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 +Q G E A + L+ + ++A+N + R GE+DLI++ G IFVEV+ R YG Sbjct: 4 AQQLGQWAEQTALKLLKEQNYEWVASNYHSRRGEVDLIVKRGNELIFVEVKARGQGNYGQ 63 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE---------------- 119 A VT SKQ K+++TA +L R+ S+ CRFDV+ F + Sbjct: 64 ACEMVTLSKQKKIIKTAMRFLQRYP-SYQDFYCRFDVICFDFPQKIAKTVQQDFSKFHYD 122 Query: 120 VEWIKDAFNDH 130 ++WI++AF Sbjct: 123 LQWIENAFTLD 133 >UniRef50_C6BVQ7 Putative uncharacterized protein n=1 Tax=Desulfovibrio salexigens DSM 2638 RepID=C6BVQ7_DESAD Length = 134 Score = 137 bits (346), Expect = 9e-32, Method: Composition-based stats. Identities = 35/120 (29%), Positives = 54/120 (45%), Gaps = 6/120 (5%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 G A E A +LE +G N + E+D+I + IFVEV+ R Sbjct: 2 SPRHLDFGQAGEDYAACFLENRGYFLRQRNWRWKQWELDIICEKDDELIFVEVKTRAGRS 61 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF----TGNEVEWIKDAFN 128 +VT +K+ KL++ A +L+ + CRFD+V TG E I++AF+ Sbjct: 62 AQSGIEAVTPAKRKKLVKAATRYLSAFD--LWERPCRFDLVIVNDDGTGFRAEHIENAFD 119 >UniRef50_UPI0000510419 hypothetical protein BlinB_18076 n=1 Tax=Brevibacterium linens BL2 RepID=UPI0000510419 Length = 132 Score = 137 bits (346), Expect = 1e-31, Method: Composition-based stats. Identities = 34/117 (29%), Positives = 55/117 (47%), Gaps = 2/117 (1%) Query: 2 ATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTI 61 T RS + L + G E A +L+ +G+ I N GEID+I ++G T + Sbjct: 3 PTTGRRSATTSGLRQRALGQTGEDLAADFLQRQGMVIIERNFRCPRGEIDIIAKDGDTIV 62 Query: 62 FVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN 118 FVEV+ RR+ G +VT +K K+ + +WL++ F R D + + Sbjct: 63 FVEVKTRRTLAQGSPLEAVTAAKLRKIRTLSGIWLSQQKDFFA--SIRIDALGIVMD 117 >UniRef50_A4J649 UPF0102 protein Dred_2035 n=1 Tax=Desulfotomaculum reducens MI-1 RepID=Y2035_DESRM Length = 122 Score = 137 bits (346), Expect = 1e-31, Method: Composition-based stats. Identities = 35/121 (28%), Positives = 58/121 (47%), Gaps = 8/121 (6%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREG-RTTIFVEVRYRRSA 71 + K G+ E +A ++++ G + N + GE+D+I + +F+EVR R Sbjct: 2 SIQRKALGNKGEEEACKYIQNLGYNIMERNYRCKIGELDIIAWDPVGMLVFLEVRSRSGR 61 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN------EVEWIKD 125 +G SV KQ+KL A+ +L F + CRFDV+ N E++ IK+ Sbjct: 62 AFGVPEESVNYRKQNKLRMLAQQFLLTK-SEFAKISCRFDVIGVYFNKEGSVQEIKHIKN 120 Query: 126 A 126 A Sbjct: 121 A 121 >UniRef50_C9LW74 Endonuclease n=1 Tax=Selenomonas sputigena ATCC 35185 RepID=C9LW74_9FIRM Length = 121 Score = 137 bits (346), Expect = 1e-31, Method: Composition-based stats. Identities = 37/119 (31%), Positives = 60/119 (50%), Gaps = 6/119 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 +TTK GD E A ++LE +G R + + GEID+I + +F+EV+ RR + Sbjct: 1 MTTKSFGDRGEDLAAQYLEKRGCRILERQFRAKTGEIDIIAEDRGALLFIEVKTRRPTRF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG----NEVEWIKDAFN 128 G A +V +KQ ++ +TA L++ + CRFDV+ V ++AF Sbjct: 61 GAPAQAVGYTKQRRIFRTALLYMQKRAIGERF--CRFDVLEVLVMGGSYTVNHYENAFE 117 >UniRef50_A5EVA6 UPF0102 protein DNO_0639 n=2 Tax=Cardiobacteriaceae RepID=Y639_DICNV Length = 126 Score = 137 bits (345), Expect = 2e-31, Method: Composition-based stats. Identities = 45/119 (37%), Positives = 66/119 (55%), Gaps = 6/119 (5%) Query: 11 PRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRS 70 +++TTK+ G E A +L GL +A NV R GEIDLI ++ R +FVEVR RR+ Sbjct: 7 NKKMTTKKRGQYGELLAADYLTAHGLNIVAKNVYSRYGEIDLIAQDDRVLVFVEVRLRRA 66 Query: 71 ALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF----TGNEVEWIKD 125 AA S+T K + Q+A+ +L ++ CRFD V T +E+EW+K+ Sbjct: 67 QALVSAAESITPEKLRRCYQSAQDYLQKNYAVPPD--CRFDAVLITQYQTHHEIEWLKN 123 >UniRef50_Q31EY6 UPF0102 protein Tcr_1695 n=1 Tax=Thiomicrospira crunogena XCL-2 RepID=Y1695_THICR Length = 120 Score = 137 bits (345), Expect = 2e-31, Method: Composition-based stats. Identities = 46/115 (40%), Positives = 72/115 (62%), Gaps = 4/115 (3%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE-GRTTIFVEVRYRRSALYG 74 +++ G E QA WL+ + + +A N +GGEIDLI + T IF EV+YR+S+ +G Sbjct: 5 SQKIGQQKEQQAAVWLKTQAITIVAQNFRCKGGEIDLIGLDTDDTLIFFEVKYRQSSTFG 64 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN--EVEWIKDAF 127 A+ SVT KQ +L+Q A+ +L +H ++ + RFDV+ F N + EW++DAF Sbjct: 65 TASESVTPQKQQRLIQCAQNFLQKHP-NYQACNMRFDVLFFEDNQTQPEWLQDAF 118 >UniRef50_Q2BGY7 Putative uncharacterized protein n=1 Tax=Neptuniibacter caesariensis RepID=Q2BGY7_9GAMM Length = 119 Score = 136 bits (344), Expect = 2e-31, Method: Composition-based stats. Identities = 50/119 (42%), Positives = 72/119 (60%), Gaps = 5/119 (4%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + ++ G E A +L + LR I N N R GEIDLIM++G + +F+EVR R A + Sbjct: 1 MDRRKRGKDAEQHALVYLSKQKLRLIEQNFNCRFGEIDLIMQDGESIVFIEVRLRTHAEF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT----GNEVEWIKDAFN 128 GGAAASVT +KQ K+++TA+L+L++ +CRFDV+A+ W KDAF Sbjct: 61 GGAAASVTTTKQRKIIKTAQLYLSKRP-RLQNKNCRFDVIAYEYDAAPTHPLWYKDAFR 118 >UniRef50_B2A2P1 UPF0102 protein Nther_1376 n=1 Tax=Natranaerobius thermophilus JW/NM-WN-LF RepID=Y1376_NATTJ Length = 119 Score = 136 bits (344), Expect = 2e-31, Method: Composition-based stats. Identities = 44/120 (36%), Positives = 58/120 (48%), Gaps = 7/120 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVN-ERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + K G E AR +L KG + I N R GEIDLI IFVEV+ R S L Sbjct: 1 MNNKSKGRTAEKIARIFLLSKGYQIIFQNYRFSRLGEIDLICCFDNILIFVEVKSRSSLL 60 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-----EVEWIKDAF 127 +G +V KQ +L + A ++L N F RFDV+A N E+ ++DAF Sbjct: 61 WGQPEEAVGYEKQGQLKKLANIFLYEFN-EFTEYQIRFDVIAILNNNKVKCEISHLRDAF 119 >UniRef50_A1KWG5 UPF0102 protein NMC2069 n=28 Tax=Neisseriaceae RepID=Y2069_NEIMF Length = 115 Score = 136 bits (344), Expect = 2e-31, Method: Composition-based stats. Identities = 40/111 (36%), Positives = 65/111 (58%), Gaps = 3/111 (2%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 + G+A E A +L+ +G +A N + GEIDLI++ G +FVEV+YR++ +GG Sbjct: 4 NHKQGEAGEDAALAFLQSQGCTLLARNWHCAYGEIDLIVKNGGMILFVEVKYRKNRQFGG 63 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-VEWIKD 125 AA S++ SK KL ++ +L ++ + V CR D V G+ EWI++ Sbjct: 64 AAYSISPSKLLKLQRSVEYYLQQNRLT--NVPCRLDAVLIEGSRPPEWIQN 112 >UniRef50_B3JMU0 Putative uncharacterized protein n=6 Tax=Bacteroidales RepID=B3JMU0_9BACE Length = 129 Score = 136 bits (343), Expect = 2e-31, Method: Composition-based stats. Identities = 29/126 (23%), Positives = 54/126 (42%), Gaps = 7/126 (5%) Query: 9 GSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYR 68 + + G E A +L KG N + E+D++ ++ I VEV+ R Sbjct: 5 AKNKMAKHNELGKEGENAAAEYLMSKGYSIRHRNWHSGKRELDIVAQKDGELIVVEVKTR 64 Query: 69 RSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEWIK 124 R+ +G ++T K ++ + ++ R + RFD++ TG E +E I+ Sbjct: 65 RNEEFGKPEEAITDRKIRNIIISTDTYIKRFEI---DLPVRFDIITVTGTEPPFHIEHIQ 121 Query: 125 DAFNDH 130 +AF Sbjct: 122 EAFLPP 127 >UniRef50_A0LV62 UPF0102 protein Acel_1550 n=8 Tax=Actinomycetales RepID=Y1550_ACIC1 Length = 132 Score = 136 bits (343), Expect = 2e-31, Method: Composition-based stats. Identities = 39/126 (30%), Positives = 57/126 (45%), Gaps = 13/126 (10%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 + G E A + L+ G+ +A N R GE+D++ R+G T + EV+ RR +G Sbjct: 7 AREALGRFGEELAAQHLQTLGMTILARNWRCRSGELDIVARDGYTLVVCEVKTRRGVGFG 66 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNG--------SFDTVDCRFDVVAF-----TGNEVE 121 SVT K +L Q A WL H + RFDVVA G +E Sbjct: 67 EPLESVTPRKAARLRQLAVAWLTEHAATRVDTTEGTHGYTAVRFDVVAILHRKEDGPTIE 126 Query: 122 WIKDAF 127 +++ AF Sbjct: 127 YVRGAF 132 >UniRef50_C8PPR7 HD domain protein n=1 Tax=Treponema vincentii ATCC 35580 RepID=C8PPR7_9SPIO Length = 462 Score = 136 bits (343), Expect = 2e-31, Method: Composition-based stats. Identities = 39/126 (30%), Positives = 63/126 (50%), Gaps = 8/126 (6%) Query: 12 RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSA 71 +++T ++ G A EA A +WLE G IA N R GEID+I + T IF EV+ Sbjct: 337 KKMTEERLGPAGEAFAAKWLERNGYSVIARNWRTRTGEIDIIAEKNETLIFFEVKTLPHT 396 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-------EVEWIK 124 + V KQ ++ +TA+ +L H ++ + RFDV+ + E ++ Sbjct: 397 AFTDLDIIVGNRKQERICKTAKYFLLTHR-KYNKMHIRFDVLVLPFDPRTTEEAEPVHLE 455 Query: 125 DAFNDH 130 +AF D+ Sbjct: 456 NAFEDY 461 >UniRef50_B0VJC5 Putative uncharacterized protein n=1 Tax=Candidatus Cloacamonas acidaminovorans RepID=B0VJC5_9BACT Length = 128 Score = 135 bits (342), Expect = 3e-31, Method: Composition-based stats. Identities = 34/124 (27%), Positives = 60/124 (48%), Gaps = 7/124 (5%) Query: 12 RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSA 71 ++ + + E A R+L G N ++ GEID+I+ + + +F EV+ R S Sbjct: 2 KKYSLQDFSHIGEDLAARYLVSNGYTITCRNYRKKYGEIDIIVEKDQHLVFCEVKTRTSH 61 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF------TGNEVEWIKD 125 A AS+ SKQ K+ +TA+L++ + F RFDV+ E++ ++ Sbjct: 62 SIEWALASIGFSKQRKISRTAQLYINENP-QFAKHIFRFDVLLVFYYENTDTFEIKHFEN 120 Query: 126 AFND 129 AF+ Sbjct: 121 AFDA 124 >UniRef50_A6TRS2 UPF0102 protein Amet_2739 n=1 Tax=Alkaliphilus metalliredigens QYMF RepID=Y2739_ALKMQ Length = 114 Score = 135 bits (342), Expect = 3e-31, Method: Composition-based stats. Identities = 40/115 (34%), Positives = 57/115 (49%), Gaps = 5/115 (4%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 +K G+ E ++LE KG R I N + GEID+I +G FVEV+ RRS YG Sbjct: 2 SKSLGELGERIIGQYLEKKGYRLIETNYRTKLGEIDIIAYKGTIIAFVEVKTRRSQSYGM 61 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN---EVEWIKDAF 127 +V KQ +L + A ++AR D RFDV ++ +I +AF Sbjct: 62 PCEAVNWQKQQRLHRVASHYIARKG--LINYDFRFDVAEVIIGKEKKIHYINNAF 114 >UniRef50_C3W9T3 Endonuclease n=4 Tax=Fusobacterium RepID=C3W9T3_FUSMR Length = 120 Score = 135 bits (342), Expect = 3e-31, Method: Composition-based stats. Identities = 32/111 (28%), Positives = 59/111 (53%), Gaps = 2/111 (1%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 ++ GD +E +A + L +G + + N + GEID+I + T +FVEV+YR++ YG Sbjct: 3 NNREIGDKYEEKAVKLLISRGYKILERNYRVKAGEIDIIAKFEDTIVFVEVKYRKTLKYG 62 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 +V K ++ A+++L + RFD ++F G ++ W K+ Sbjct: 63 YGLEAVDYRKIRRIYNAAKVYLTLNKKLSSK--IRFDCISFLGEKISWTKN 111 >UniRef50_A1SB01 UPF0102 protein Sama_3355 n=5 Tax=Shewanella RepID=Y3355_SHEAM Length = 108 Score = 135 bits (342), Expect = 3e-31, Method: Composition-based stats. Identities = 47/108 (43%), Positives = 66/108 (61%), Gaps = 3/108 (2%) Query: 20 GDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAAS 79 G E +A + L GLR A NV GEIDL+MREGR +FVEV++R +G A + Sbjct: 4 GQLAEDRAMKHLCAHGLRLEARNVRYPFGEIDLVMREGRVYVFVEVKFRTPKGFGDAVQA 63 Query: 80 VTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 ++ ++Q +L + A +L H CRFD+VA TG+++EWIKDAF Sbjct: 64 LSAAQQQRLRRAATHYLQCHRI---DAPCRFDMVAITGDKLEWIKDAF 108 >UniRef50_A4FME3 UPF0102 protein SACE_6045 n=2 Tax=Actinomycetales RepID=Y6045_SACEN Length = 133 Score = 135 bits (341), Expect = 4e-31, Method: Composition-based stats. Identities = 37/118 (31%), Positives = 53/118 (44%), Gaps = 7/118 (5%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 G E A R+LE G+ + N GE+D++ +G T IF EV+ R YG Sbjct: 18 RRHALGVEGERLAARFLEEHGITVLERNWRCDRGELDIVATDGETVIFCEVKARSGVDYG 77 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-----EVEWIKDAF 127 +V+ K L AR WL+ N + T RFDVV+ +E ++ AF Sbjct: 78 APLNAVSPHKVRHLRALARTWLSERNLTGCTA--RFDVVSVLWPPGRPARIEHLEGAF 133 >UniRef50_C2LNN6 Possible endonuclease n=3 Tax=Proteus RepID=C2LNN6_PROMI Length = 125 Score = 135 bits (341), Expect = 4e-31, Method: Composition-based stats. Identities = 52/115 (45%), Positives = 73/115 (63%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 +T G +E +A +L +GL+ I NV GEIDLIM+ RT IFVEVR+RRSA +G Sbjct: 6 STYLVGQYYERKALNYLRQQGLKLIERNVRYPCGEIDLIMQGNRTWIFVEVRFRRSAQFG 65 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFND 129 A +SVT SK+ +L A WLA+ S +TV+CRFD+ AF ++ W+K+ + Sbjct: 66 DAISSVTYSKRRRLWYAANCWLAQRQQSIETVNCRFDICAFDQRQLIWLKNILDH 120 >UniRef50_C6D2Y6 Putative uncharacterized protein n=1 Tax=Paenibacillus sp. JDR-2 RepID=C6D2Y6_PAESJ Length = 122 Score = 135 bits (341), Expect = 4e-31, Method: Composition-based stats. Identities = 47/120 (39%), Positives = 63/120 (52%), Gaps = 9/120 (7%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRR-SALY 73 +QTG A E A R+LE +G I N R GEID+I T +FVEVR RR + Sbjct: 5 RRRQTGLAGETAACRYLEKEGYNVIERNWRCRSGEIDIIATIDHTLVFVEVRTRRTGGRF 64 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE------VEWIKDAF 127 G AA SV R KQ ++ A+++L ++ RFDV+A T + V+ IK AF Sbjct: 65 GTAAESVDRRKQQQVALVAQVYLRMRQLTY--PPMRFDVIAVTMDRNDSISEVKHIKAAF 122 >UniRef50_D0I4Y5 Endonuclease n=1 Tax=Grimontia hollisae CIP 101886 RepID=D0I4Y5_VIBHO Length = 122 Score = 135 bits (341), Expect = 5e-31, Method: Composition-based stats. Identities = 57/114 (50%), Positives = 76/114 (66%), Gaps = 2/114 (1%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 L KQTGD +E QA R+LE +GL + N +GGE+DLIMRE +FVEV+YR+ A Y Sbjct: 4 LNRKQTGDHYENQACRFLERQGLTTLDKNARFKGGELDLIMREKSCIVFVEVKYRKQASY 63 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG--NEVEWIKD 125 GGAAA+++R KQ ++L+ A LW+A+ S + RFD V F G N V WIK+ Sbjct: 64 GGAAATISRQKQQRMLKAAYLWMAKKGLSATHTEFRFDAVTFEGSVNSVNWIKN 117 >UniRef50_A8ZV12 UPF0102 protein Dole_2298 n=2 Tax=Desulfobacteraceae RepID=Y2298_DESOH Length = 123 Score = 135 bits (340), Expect = 5e-31, Method: Composition-based stats. Identities = 40/118 (33%), Positives = 61/118 (51%), Gaps = 4/118 (3%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 +Q G E A R+L+ +G + N GEID+I ++ T FVEV+ RR+ YG Sbjct: 4 QRQQYGRQGEQAAERFLKKEGYTIVCRNYRTPVGEIDIIAKDKTTLAFVEVKARRTESYG 63 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE--VEWIKDAFNDH 130 S+T+ KQ K+ + A +L + RFDVV G + VE I++AF+ + Sbjct: 64 SPRLSITKDKQRKITRAALWYLKDTGQAGARA--RFDVVIVQGRDNSVELIRNAFDAN 119 >UniRef50_Q3IG11 UPF0102 protein PSHAa2523 n=3 Tax=Alteromonadales RepID=Y2523_PSEHT Length = 123 Score = 135 bits (340), Expect = 5e-31, Method: Composition-based stats. Identities = 39/117 (33%), Positives = 67/117 (57%), Gaps = 5/117 (4%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 +++ G +E QA+++L +GL I N GE+D+IM++G T +FVEV++R++ Sbjct: 9 QNSREKGQYYELQAQKYLVSQGLTAIERNYYCPFGELDVIMKDGNTLVFVEVKFRKNHAR 68 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN---EVEWIKDAF 127 GGA +++ KQ +L ++ +LA N + R D VA TG + W+K+ F Sbjct: 69 GGANYALSIQKQARLKRSIYHYLAAKNLT--NQPLRIDYVAITGEPSMHINWLKNVF 123 >UniRef50_Q1MRU7 UPF0102 protein LI0223 n=1 Tax=Lawsonia intracellularis PHE/MN1-00 RepID=Y223_LAWIP Length = 132 Score = 135 bits (340), Expect = 6e-31, Method: Composition-based stats. Identities = 33/119 (27%), Positives = 61/119 (51%), Gaps = 6/119 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + + + G E+ A +L KG+ + N + EIDL+ ++ +T +FVEVR R++ Sbjct: 1 MKSCEIGQQGESAAALFLYNKGMSILERNWRKGRFEIDLVCQDIKTLVFVEVRTRKAKGM 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----EVEWIKDAFN 128 ++T SK+ ++ +A+L+L ++ CRFDV+ E+E K+ F Sbjct: 61 LLPEQTLTISKRCNIIHSAQLYLMDKKD--WSMPCRFDVICIISKKTTLELEHYKNVFE 117 >UniRef50_C7HUU3 Endonuclease n=4 Tax=Anaerococcus RepID=C7HUU3_9FIRM Length = 118 Score = 134 bits (339), Expect = 6e-31, Method: Composition-based stats. Identities = 33/117 (28%), Positives = 56/117 (47%), Gaps = 4/117 (3%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 + GD E A ++LE KG + + N + GEID+I + FVEV+ R++ + Sbjct: 4 KKRTIGDFGEEIALKYLEKKGYQILDRNFLKYYGEIDIIAIKNDILTFVEVKTRKNDEFK 63 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF--TGNEVEWIKDAFND 129 A+ V KQ ++ +TA+ ++ + FDV + +IK+AF D Sbjct: 64 PASLDVDYYKQERIKKTAQAYIMEKDLGE--FLISFDVCEVYLENKTIHYIKNAFGD 118 >UniRef50_Q1NMK2 Putative uncharacterized protein n=1 Tax=delta proteobacterium MLMS-1 RepID=Q1NMK2_9DELT Length = 120 Score = 134 bits (339), Expect = 7e-31, Method: Composition-based stats. Identities = 44/120 (36%), Positives = 61/120 (50%), Gaps = 6/120 (5%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + G E AR WLEG G R + AN GE+DL+ EG +FVEV+ RR Sbjct: 2 TRQRQGLGRRGEQLARDWLEGAGYRILEANCRTSSGELDLVAEEGGELVFVEVKSRRGDA 61 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEWIKDAFN 128 +G +V R KQ ++++ AR +L+R RFDVVA T +E +K+AF Sbjct: 62 FGSPLEAVDRRKQARIIRCAREYLSRRRSHG--RPARFDVVAVTFTGGKPAIEVVKNAFE 119 >UniRef50_C4FFG3 Putative uncharacterized protein n=1 Tax=Bifidobacterium angulatum DSM 20098 RepID=C4FFG3_9BIFI Length = 151 Score = 134 bits (339), Expect = 7e-31, Method: Composition-based stats. Identities = 46/132 (34%), Positives = 63/132 (47%), Gaps = 6/132 (4%) Query: 1 MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRT- 59 M T+ R ++++Q G EA A WLEG +A N + R GE+D+I Sbjct: 21 METIEARLAGN-AVSSRQVGALGEAYAAAWLEGFDWLVLARNWHCRYGELDIIALSPERR 79 Query: 60 TIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE 119 IFVEV+ RR +G +VT SKQ L + A WL R+ RFDVV + ++ Sbjct: 80 IIFVEVKTRRGVRFGTPQEAVTPSKQTNLRRAALQWLERNGHLLRHNGMRFDVVTVSVHD 139 Query: 120 ----VEWIKDAF 127 V I AF Sbjct: 140 GQVAVHRIPGAF 151 >UniRef50_C7N589 Predicted endonuclease related to Holliday junction resolvase n=2 Tax=Slackia RepID=C7N589_SLAHD Length = 167 Score = 134 bits (339), Expect = 8e-31, Method: Composition-based stats. Identities = 34/127 (26%), Positives = 57/127 (44%), Gaps = 8/127 (6%) Query: 9 GSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE-GRTTIFVEVRY 67 R++ K+ G EA A L+ KG + N GE D+I + T +FVEV+ Sbjct: 41 RLSREMDPKELGRRGEACACMLLDYKGYEILERNWKCPAGEADIIAIDENGTLVFVEVKT 100 Query: 68 RRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-----VEW 122 RR G ++ R+K+ + + A +L+++ + RFD +A + V Sbjct: 101 RRGVENGLPEEAIGRAKRARYEKIAAYYLSQY--TGPDTALRFDTIALLVMDNYRALVRH 158 Query: 123 IKDAFND 129 I +AF Sbjct: 159 IVNAFGQ 165 >UniRef50_D1U5W3 Putative uncharacterized protein n=1 Tax=Desulfovibrio aespoeensis Aspo-2 RepID=D1U5W3_9DELT Length = 130 Score = 134 bits (338), Expect = 9e-31, Method: Composition-based stats. Identities = 38/122 (31%), Positives = 63/122 (51%), Gaps = 6/122 (4%) Query: 11 PRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRS 70 ++ K GD E A R+LE +G+R + N R E+DL+ R+G T +FVEV+ R + Sbjct: 5 DKRTPAKWRGDLGEDAAARYLESRGMRVLDRNWRYRQWELDLVCRDGDTLVFVEVKTRVA 64 Query: 71 ALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----EVEWIKDA 126 A + R+K+ +L++ A +L+ CRFD+ A +VE ++A Sbjct: 65 GSMSAPADGLGRAKRARLVKAAARYLSAKG--LWDEPCRFDLAAVVDTGVSMDVEHTENA 122 Query: 127 FN 128 F+ Sbjct: 123 FD 124 >UniRef50_C7IKV6 Putative uncharacterized protein n=1 Tax=Clostridium papyrosolvens DSM 2782 RepID=C7IKV6_9CLOT Length = 126 Score = 134 bits (338), Expect = 1e-30, Method: Composition-based stats. Identities = 37/125 (29%), Positives = 60/125 (48%), Gaps = 13/125 (10%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNE-RGGEIDLIMREGRTTIFVEVRYRRSAL 72 ++ G E +A +L+ G + N R GEID+I + F+EV+ RR++ Sbjct: 4 ANKREIGAVGEREAAEFLQRNGYTILKINYRVGRLGEIDIIANDNEYICFIEVKTRRTST 63 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----------VEW 122 +G +VT++KQ K+ Q A ++L N + RFDV+ N+ + Sbjct: 64 FGSPGEAVTKTKQQKIRQIAAIYLT--NTRKMDSNVRFDVIEILMNKSMESVNSIKSINL 121 Query: 123 IKDAF 127 IKDAF Sbjct: 122 IKDAF 126 >UniRef50_A6L1J0 UPF0102 protein BVU_1879 n=26 Tax=Bacteroidales RepID=Y1879_BACV8 Length = 121 Score = 134 bits (338), Expect = 1e-30, Method: Composition-based stats. Identities = 26/121 (21%), Positives = 53/121 (43%), Gaps = 7/121 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + G E +A +L KG N + E+D++ I +EV+ R++ + Sbjct: 2 AEHNEFGKEGEEEAAAYLIDKGYSIRHRNWHCGKKELDIVAEYRNELIVIEVKTRKNTRF 61 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEWIKDAFND 129 G +VT K +++ + +L + + + RFD++ G + +E I++AF Sbjct: 62 GNPEDAVTDKKIRRIIASTDAYLRKFSV---DLPVRFDIITLVGEKTPFTIEHIEEAFYP 118 Query: 130 H 130 Sbjct: 119 P 119 >UniRef50_B3QZF2 UPF0102 protein Ctha_1382 n=1 Tax=Chloroherpeton thalassium ATCC 35110 RepID=Y1382_CHLT3 Length = 129 Score = 134 bits (337), Expect = 1e-30, Method: Composition-based stats. Identities = 37/116 (31%), Positives = 58/116 (50%), Gaps = 4/116 (3%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 T G E A +L+ G + + N EIDLI ++ FVEV+ R + YG Sbjct: 3 TNVAFGKKGEDMASAFLKKCGYQILRRNYRSGNNEIDLITKKDNIVAFVEVKTRHNLNYG 62 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFNDH 130 A +VT SKQ +L++ A+ ++ + RFDVVA +E + ++AFN+ Sbjct: 63 HPAEAVTLSKQKELIKAAQNFINDNPSQGVDY--RFDVVAIILDESK--RNAFNEP 114 >UniRef50_C0GHM5 Putative uncharacterized protein n=1 Tax=Dethiobacter alkaliphilus AHT 1 RepID=C0GHM5_9FIRM Length = 112 Score = 134 bits (337), Expect = 1e-30, Method: Composition-based stats. Identities = 38/114 (33%), Positives = 50/114 (43%), Gaps = 4/114 (3%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 K G E A L G +A N GEID++ + G +FVEV+ RRS+ G Sbjct: 1 MKTLGQKGEELAVDHLRRAGYLILARNWRCERGEIDIVAKAGNILVFVEVKTRRSSRLGT 60 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT--GNEVEWIKDAF 127 +V KQ KL A ++ + RFDV A N V IK+AF Sbjct: 61 PQEAVDFRKQEKLRHLAYRFINATGITAAEY--RFDVAAVNAKNNTVTIIKNAF 112 >UniRef50_A5Z6D1 Putative uncharacterized protein n=1 Tax=Eubacterium ventriosum ATCC 27560 RepID=A5Z6D1_9FIRM Length = 134 Score = 134 bits (337), Expect = 1e-30, Method: Composition-based stats. Identities = 31/115 (26%), Positives = 54/115 (46%), Gaps = 2/115 (1%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 L + G +E +L G + N + GEID+I ++ F+EV++R S Y Sbjct: 20 LNKRGRGSFYEDVCVEYLIKNGFDILHRNYRCKLGEIDIIAKKDDIIRFIEVKFRGSDSY 79 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFN 128 G A +V KQ ++++ A +L + + V C FDV+ NE + + + Sbjct: 80 GSALEAVDFRKQRRIMRAASWFLNEYGLN--DVQCSFDVMTVENNEARYYFNCYG 132 >UniRef50_B3PLA2 Putative uncharacterized protein n=1 Tax=Cellvibrio japonicus Ueda107 RepID=B3PLA2_CELJU Length = 126 Score = 133 bits (336), Expect = 1e-30, Method: Composition-based stats. Identities = 45/113 (39%), Positives = 65/113 (57%), Gaps = 5/113 (4%) Query: 20 GDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAAS 79 G EA+A+ +LE +GL N + GEIDLIM EG T +FVEVR R + + A S Sbjct: 13 GARAEARAQAYLEQQGLTTWMKNYRCKTGEIDLIMCEGDTLVFVEVRLRTNRFFSSAVES 72 Query: 80 VTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----EVEWIKDAFN 128 +T +K+ K+++TA+ +L D CRFD++A + EWI+DAF Sbjct: 73 ITPAKRQKMIRTAQRFLQERGL-VDKHACRFDIIALDAKGQHAKPEWIRDAFG 124 >UniRef50_Q1MYA7 Putative uncharacterized protein n=1 Tax=Bermanella marisrubri RepID=Q1MYA7_9GAMM Length = 124 Score = 133 bits (336), Expect = 1e-30, Method: Composition-based stats. Identities = 55/122 (45%), Positives = 76/122 (62%), Gaps = 3/122 (2%) Query: 8 SGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRY 67 S S + +TG +EAQA+++L +GL FI NVN + GE+DLIM+ + +FVEVRY Sbjct: 3 SNSDHSKSKIETGSFYEAQAKQFLVNQGLIFIEQNVNFKTGELDLIMKHNKHLVFVEVRY 62 Query: 68 RRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG--NEVEWIKD 125 R S YGGA S+T SKQ ++ + A+ +L +H CR DVVAF G + WIK+ Sbjct: 63 RSSQDYGGAVTSITASKQARVRRAAQTYLQKH-FGNRPPPCRIDVVAFEGANTKAIWIKN 121 Query: 126 AF 127 AF Sbjct: 122 AF 123 >UniRef50_B8KR63 Putative uncharacterized protein n=1 Tax=gamma proteobacterium NOR51-B RepID=B8KR63_9GAMM Length = 128 Score = 133 bits (336), Expect = 2e-30, Method: Composition-based stats. Identities = 43/121 (35%), Positives = 69/121 (57%), Gaps = 6/121 (4%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 + G+ WE +A +L G GL I N GEIDLI + +FVEVR R+ + +G Sbjct: 1 MRSEGNQWEIKAASFLRGHGLTIIVQNFTCPFGEIDLIGDDQGVIVFVEVRKRKRSRFGN 60 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF-----TGNEVEWIKDAFNDH 130 AA+SV R+KQ K++++A +L +H CRFDV+A+ + +W++ AF+ + Sbjct: 61 AASSVGRAKQKKIIRSAAFYLQQHGA-MADTHCRFDVIAYDVGADDPDTPKWLRSAFSAN 119 Query: 131 S 131 + Sbjct: 120 A 120 >UniRef50_A5D1I2 UPF0102 protein PTH_1707 n=1 Tax=Pelotomaculum thermopropionicum SI RepID=Y1707_PELTS Length = 120 Score = 133 bits (336), Expect = 2e-30, Method: Composition-based stats. Identities = 41/119 (34%), Positives = 59/119 (49%), Gaps = 8/119 (6%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 K G E A R+LE KG R ++ N R GE+DL++ +G +FVEVR R YG Sbjct: 4 ARKLLGRMGEEAAARYLEKKGCRILSRNHCCRLGELDLVVSDGDVLVFVEVRARTGEEYG 63 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN------EVEWIKDAF 127 A S+T K+ +L A +L + CRFDV+A + +E ++AF Sbjct: 64 LAQESITGRKKSRLRLLAWQYLKEKGKTGSM--CRFDVIAVLFDREGRVKRLEHFENAF 120 >UniRef50_C4F8U2 Putative uncharacterized protein n=2 Tax=Collinsella RepID=C4F8U2_9ACTN Length = 158 Score = 133 bits (336), Expect = 2e-30, Method: Composition-based stats. Identities = 35/126 (27%), Positives = 53/126 (42%), Gaps = 13/126 (10%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIM--REGRTTIFVEVRYRRS 70 ++ K G E A R+LE +G I N GE DL+ ++ + VEV+ RRS Sbjct: 32 GMSNKLLGSLGEELAARYLEQRGYDIIDRNYRCPEGEADLVAYDQDDDGVVLVEVKTRRS 91 Query: 71 ALY---GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-----NEVEW 122 +VT KQ + + A + A H + RFDV+ T E+ Sbjct: 92 RSERGGAYPEEAVTPEKQRRYRRIALCYAADH---YPVPSIRFDVIGVTLRPANIGEIRH 148 Query: 123 IKDAFN 128 + AF+ Sbjct: 149 LCGAFD 154 >UniRef50_Q2YCL8 UPF0102 protein Nmul_A0195 n=3 Tax=Nitrosomonadaceae RepID=Y195_NITMU Length = 119 Score = 133 bits (336), Expect = 2e-30, Method: Composition-based stats. Identities = 49/118 (41%), Positives = 69/118 (58%), Gaps = 6/118 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 +T + G+ E A +L G L + N R GEIDLIMR+G T +FVEVR R + + Sbjct: 1 MTLRLKGNQAERYAEAFLAGHRLVLVQRNYRCRFGEIDLIMRDGETLVFVEVRMRTNRNF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE---VEWIKDAFN 128 G A +S+T SKQ K+++ AR +L CRFD V +GNE +EWI++AF+ Sbjct: 61 GDAGSSITLSKQRKVVRAARHYLLSLRTEPC---CRFDAVLLSGNEGRDIEWIRNAFD 115 >UniRef50_B8HR21 Putative uncharacterized protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HR21_CYAP4 Length = 164 Score = 133 bits (335), Expect = 2e-30, Method: Composition-based stats. Identities = 34/107 (31%), Positives = 56/107 (52%), Gaps = 2/107 (1%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG- 74 ++Q GD E WL +G + + N GEIDLI+++ FVEV+ R + Sbjct: 2 SRQVGDVGEMLVAHWLTAQGWQIVQRNWQCCWGEIDLILQQDEWLAFVEVKTRSRGNWDQ 61 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVE 121 ++T +KQ KL +TA L+L+ + +F + CRFD+ + +V Sbjct: 62 DGLLAITPTKQRKLWKTATLFLSEYP-NFADLSCRFDLALVSYVKVH 107 >UniRef50_D1NRZ9 Putative endonuclease n=1 Tax=Bifidobacterium gallicum DSM 20093 RepID=D1NRZ9_9BIFI Length = 144 Score = 132 bits (334), Expect = 2e-30, Method: Composition-based stats. Identities = 35/120 (29%), Positives = 54/120 (45%), Gaps = 5/120 (4%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREG-RTTIFVEVRYRRSA 71 +L+ + G EA + WL G R + N + R GE+DLI FVE++ RR Sbjct: 25 RLSARDLGSWGEAASACWLRTHGWRIVGHNWHCRYGELDLIALSATDELAFVEIKTRRGC 84 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----EVEWIKDAF 127 +G +V +KQ L + A LW+ + + V RFDV+ + + AF Sbjct: 85 QFGTPIEAVGVTKQTNLRRAAMLWMLEADHHINHVGIRFDVIGVLVHAGRIRFTHVPHAF 144 >UniRef50_C4FZ58 Putative uncharacterized protein n=1 Tax=Abiotrophia defectiva ATCC 49176 RepID=C4FZ58_ABIDE Length = 113 Score = 132 bits (334), Expect = 2e-30, Method: Composition-based stats. Identities = 36/112 (32%), Positives = 58/112 (51%), Gaps = 3/112 (2%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE-GRTTIFVEVRYRRSALYGGAA 77 G E +A +L KG + + N + GEID+I + VEV+YR S +G Sbjct: 1 MGKEKEEKAAAYLISKGYKILEKNYLRKTGEIDIIAKSADGYLTAVEVKYRSSDRFGSPF 60 Query: 78 ASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-EVEWIKDAFN 128 ++VT KQ K+ +T +++ HN S + RFDV+ G+ +E + +AF Sbjct: 61 SAVTYIKQRKICKTLLFYMSEHNISP-DIKSRFDVIGIYGDGRLEHLVNAFE 111 >UniRef50_D2R2U4 Putative uncharacterized protein n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R2U4_9PLAN Length = 150 Score = 132 bits (334), Expect = 2e-30, Method: Composition-based stats. Identities = 37/122 (30%), Positives = 58/122 (47%), Gaps = 8/122 (6%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 L K G E A +L +G +A + GEIDL+ +GRT +FVEV+ R + + Sbjct: 23 LQPKSLGRRGEDAAALFLRARGYWIVARSYRTSLGEIDLVAVDGRTIVFVEVKTRVRSDH 82 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE------VEWIKDAF 127 G +V KQ +L + A +L RH+ + RFD+V+ +E ++ AF Sbjct: 83 GQPFDAVHPDKQRRLTRLAAAYLKRHDLT--RYASRFDIVSILWPGGRKQPLIEHLQHAF 140 Query: 128 ND 129 Sbjct: 141 EA 142 >UniRef50_A5FR87 UPF0102 protein DehaBAV1_0707 n=5 Tax=Dehalococcoides RepID=Y707_DEHSB Length = 121 Score = 132 bits (334), Expect = 3e-30, Method: Composition-based stats. Identities = 38/119 (31%), Positives = 60/119 (50%), Gaps = 6/119 (5%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 K+TG+ E A +L+G G I N GEID++ ++G +F+EVR +R YG Sbjct: 4 NRKETGEFGEKLAAEYLKGMGYSIIQTNCRLPEGEIDIVGQDGEYLVFIEVRTKRRLGYG 63 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT----GNEVEWIKDAFND 129 A SVT K+ L+ +A ++ +H + CR D V+ +E IK+A + Sbjct: 64 LPAESVTPRKKAHLMASAESYIQKHR--LEHFPCRIDFVSVDLSQPEPRLELIKNALGE 120 >UniRef50_A3HX52 Putative uncharacterized protein n=1 Tax=Algoriphagus sp. PR1 RepID=A3HX52_9SPHI Length = 118 Score = 132 bits (333), Expect = 3e-30, Method: Composition-based stats. Identities = 32/119 (26%), Positives = 53/119 (44%), Gaps = 8/119 (6%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 ++G E A +WL KG + + N EIDLI+ + +FVEV++R + Sbjct: 2 AEHNRSGQLAEEMAAQWLISKGYQLLEKNYRHGYAEIDLILTHKKLLVFVEVKFRSGTGF 61 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-----VEWIKDAF 127 G A V +K+ +++ A ++ N D RFD+V + +DAF Sbjct: 62 GYAEEFVDYTKRKLIIKAADHYIHEKNWK---SDIRFDIVGVYRDRTGAINYRHFEDAF 117 >UniRef50_C9LM05 Endonuclease n=1 Tax=Dialister invisus DSM 15470 RepID=C9LM05_9FIRM Length = 117 Score = 132 bits (333), Expect = 3e-30, Method: Composition-based stats. Identities = 40/119 (33%), Positives = 58/119 (48%), Gaps = 7/119 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + G E +A +LE KG+ + N + GEIDLIM++G +F+EV+ RRS LY Sbjct: 1 MGNTAFGRMGEDRACLYLEEKGMTLVTRNFRCKHGEIDLIMKDGSVFVFIEVKTRRSRLY 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF-----TGNEVEWIKDAF 127 G +VT KQ + TA ++L V RFDVV + ++AF Sbjct: 61 GEPIEAVTVYKQRHIRYTAEVFLLAR--HLHDVRIRFDVVEVMMAPGRAVRLRHTRNAF 117 >UniRef50_C4K712 Putative uncharacterized protein n=1 Tax=Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum) RepID=C4K712_HAMD5 Length = 118 Score = 132 bits (333), Expect = 3e-30, Method: Composition-based stats. Identities = 52/118 (44%), Positives = 75/118 (63%), Gaps = 1/118 (0%) Query: 11 PRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRS 70 + L+ ++ G +E ARR+LE GL F +NV R EIDLIMR+ +T +FVEVR++R+ Sbjct: 2 DKTLSRREIGFRYEMIARRYLEKAGLVFKESNVTLRSAEIDLIMRDQKTWVFVEVRFKRN 61 Query: 71 ALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFN 128 + +G AA S+ KQ +L A +WL++ F RFDV A TGN+ EW ++AFN Sbjct: 62 SFFGSAADSINNKKQKRLRDAAAIWLSKRGSHF-NTSYRFDVFAITGNQFEWFQNAFN 118 >UniRef50_B3ES88 Putative uncharacterized protein n=1 Tax=Candidatus Amoebophilus asiaticus 5a2 RepID=B3ES88_AMOA5 Length = 122 Score = 132 bits (333), Expect = 3e-30, Method: Composition-based stats. Identities = 33/120 (27%), Positives = 59/120 (49%), Gaps = 7/120 (5%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + + Q G +E A +L+ KGL + N + EID+I ++ F+EV+ R SA Sbjct: 6 EASPHQLGKKYEDLATSYLQQKGLMIMVRNYRYKKAEIDIIAQKDACLYFIEVKARTSAK 65 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEWIKDAFN 128 +G A V KQ + A ++ +++ RFD++A + +E+ +DAF+ Sbjct: 66 FGYPEAFVNTYKQQLIKAAAENYILQND---WNSSIRFDIIAILDQKGCINLEYFEDAFS 122 >UniRef50_A6LSN5 UPF0102 protein Cbei_1183 n=5 Tax=Clostridium RepID=Y1183_CLOB8 Length = 123 Score = 132 bits (333), Expect = 3e-30, Method: Composition-based stats. Identities = 35/121 (28%), Positives = 56/121 (46%), Gaps = 8/121 (6%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 K G E A+++LE + N GEID+I ++ I VEV+ R + YG Sbjct: 5 NKDIGSFSEDLAKKYLEKNDYSILDCNFKNFLGEIDIICKKNTLLIIVEVKSRYNNNYGL 64 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN------EVEWIKDAFND 129 SV SKQ +++ A ++ + ++ RFDV+ N ++ IKDAF Sbjct: 65 PRESVNFSKQRSIIKVANSYI--NYKRLPNINVRFDVIEVYLNLESTNFKINHIKDAFRL 122 Query: 130 H 130 + Sbjct: 123 N 123 >UniRef50_Q2S1J6 UPF0102 protein SRU_1822 n=1 Tax=Salinibacter ruber DSM 13855 RepID=Y1822_SALRD Length = 122 Score = 132 bits (332), Expect = 4e-30, Method: Composition-based stats. Identities = 40/123 (32%), Positives = 54/123 (43%), Gaps = 8/123 (6%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMR--EGRTTIFVEVRYRRSA 71 TT GD E A L+G G +A N E+DL+ R + +FVEV+ R Sbjct: 2 ATTNDIGDRGEEIAAAHLDGAGYEILARNYRHSRNEVDLVCRETDAGEYVFVEVKTRSGT 61 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT----GNEVEWIKDAF 127 +G AS+T K+ L AR +L H RFDVVA EV+ ++AF Sbjct: 62 GFGAPEASITAKKRAALQHAARGYLHEHGAEGA--PARFDVVAVMLTGGPPEVQHYENAF 119 Query: 128 NDH 130 Sbjct: 120 WAD 122 >UniRef50_C7MNC2 Predicted endonuclease related to Holliday junction resolvase n=1 Tax=Cryptobacterium curtum DSM 15641 RepID=C7MNC2_CRYCD Length = 186 Score = 132 bits (332), Expect = 4e-30, Method: Composition-based stats. Identities = 25/111 (22%), Positives = 49/111 (44%), Gaps = 1/111 (0%) Query: 8 SGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRY 67 + + + ++ G E A +L +G + N + GE D+I + + F+EV+ Sbjct: 57 APTNKSQNNRELGRRGEDAAAAFLTRRGYEIVERNWMCQAGEADIIAQGEGSIHFIEVKT 116 Query: 68 RRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN 118 R SA G + +V K+ + + A +L N + + FDV++ Sbjct: 117 RSSAARGFPSEAVDAKKRSRYERIAECYLRSCN-NLPEMRVTFDVISILAT 166 >UniRef50_Q0AFH8 UPF0102 protein Neut_1662 n=2 Tax=Proteobacteria RepID=Y1662_NITEC Length = 116 Score = 132 bits (332), Expect = 4e-30, Method: Composition-based stats. Identities = 53/116 (45%), Positives = 76/116 (65%), Gaps = 4/116 (3%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 +TK G E QA +L+ + L + N R GEIDLIM++G T +FVEVR R + L+G Sbjct: 3 STKNKGSDAEQQATIFLQQQQLTLLEKNYRCRFGEIDLIMQDGDTVVFVEVRMRVNQLFG 62 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-VEWIKDAFND 129 GAAAS+T +KQ KL + AR +LAR + + CRFD + +GN +EWI++AF++ Sbjct: 63 GAAASITPAKQLKLTRAARHYLARCD---EDFPCRFDAILISGNREIEWIQNAFDE 115 >UniRef50_B0C8B9 UPF0102 protein AM1_3954 n=1 Tax=Acaryochloris marina MBIC11017 RepID=Y3954_ACAM1 Length = 172 Score = 132 bits (332), Expect = 5e-30, Method: Composition-based stats. Identities = 35/124 (28%), Positives = 52/124 (41%), Gaps = 11/124 (8%) Query: 2 ATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMR------ 55 P RQ Q G+ E +WL + + + R GEID+I R Sbjct: 1 MPSPAAPRPNRQSRNLQVGEWGEQLVCQWLTQQQWHILDRRWHCRWGEIDIIARSNPPLP 60 Query: 56 ---EGRTTIFVEVRYRRSALYG-GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFD 111 FVEV+ RR+ + ++T KQ KL +TA+L+L +H + C+FD Sbjct: 61 GQDSNTRLAFVEVKTRRAQNWDADGLLAITPQKQQKLWKTAQLYLKKHP-ELAELFCQFD 119 Query: 112 VVAF 115 V Sbjct: 120 VALV 123 >UniRef50_D2SDZ8 Putative uncharacterized protein n=1 Tax=Geodermatophilus obscurus DSM 43160 RepID=D2SDZ8_9ACTO Length = 139 Score = 131 bits (331), Expect = 5e-30, Method: Composition-based stats. Identities = 44/135 (32%), Positives = 62/135 (45%), Gaps = 10/135 (7%) Query: 1 MATVPTRSGSP---RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREG 57 + P RSG R TT G E A +L GLR + N R GE+D++ R+G Sbjct: 6 LRDRPDRSGPTTVGRVRTTSDLGAHGERIAAAYLTDSGLRVLDRNWRCRDGELDIVARDG 65 Query: 58 RTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG 117 +F EV+ RR+ +G +V KQ +L A+ WLA H+ + RFDVV Sbjct: 66 DALVFCEVKTRRAVGFGHPVEAVGHVKQRRLRVLAQRWLAAHDERA--PELRFDVVGVLV 123 Query: 118 NE-----VEWIKDAF 127 V ++ AF Sbjct: 124 RVDRPALVTHLRAAF 138 >UniRef50_C6IVU3 Putative uncharacterized protein n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6IVU3_9BACL Length = 130 Score = 131 bits (331), Expect = 5e-30, Method: Composition-based stats. Identities = 39/123 (31%), Positives = 55/123 (44%), Gaps = 9/123 (7%) Query: 12 RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRS- 70 R K+ G E A L +G + N R GE+D+I R+ + VEVR R Sbjct: 10 RGDGRKERGRKAEQAACEHLISQGYTILERNWRCRSGELDIIARKRDVLVNVEVRSRSQQ 69 Query: 71 -ALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-----VEWIK 124 A +G A SV K ++ TA ++L H + RFDV+A T +E I+ Sbjct: 70 AAAFGTPAESVNARKIKQVRDTAAVYL--HRTGQSDANLRFDVIAVTFGRGDNIALEHIQ 127 Query: 125 DAF 127 AF Sbjct: 128 AAF 130 >UniRef50_C9MX50 Endonuclease n=2 Tax=Leptotrichia RepID=C9MX50_9FUSO Length = 121 Score = 131 bits (331), Expect = 7e-30, Method: Composition-based stats. Identities = 40/123 (32%), Positives = 72/123 (58%), Gaps = 7/123 (5%) Query: 9 GSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIM--REGRTTIFVEVR 66 G + ++ G +E A+ +L +GL F+ +N R GEIDLI ++ +T +FVEV+ Sbjct: 2 GQEYSMNKREIGFKYENVAKEYLILQGLTFVESNFYTRFGEIDLIFFEKKSQTLVFVEVK 61 Query: 67 YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT--GNEVEWIK 124 YR++ +G A VT KQ+K+L +++++L + + R+D+V + +EW+K Sbjct: 62 YRKNDFFGSAIEMVTEEKQNKILASSQIYLLKK---EWDKNVRYDIVGVSRGSGSIEWLK 118 Query: 125 DAF 127 +AF Sbjct: 119 NAF 121 >UniRef50_C8WAJ1 Putative uncharacterized protein n=1 Tax=Atopobium parvulum DSM 20469 RepID=C8WAJ1_ATOPD Length = 172 Score = 131 bits (330), Expect = 8e-30, Method: Composition-based stats. Identities = 33/134 (24%), Positives = 58/134 (43%), Gaps = 11/134 (8%) Query: 3 TVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIF 62 +++++Q G+ E A ++L +G + I N + GE+D++ ++G + Sbjct: 38 DAQESRIPLEEMSSRQIGEKGEEIAAKYLIKRGYKIIQTNWTCQIGEVDIVAQDGDNVVL 97 Query: 63 VEVRYRR---SALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-- 117 VEV+ RR +V R+KQ K A ++ A H RFDVVA Sbjct: 98 VEVKTRRVLNKDDSIMPELAVNRAKQEKYRTLALMYAALHP---ALTSIRFDVVAINLVA 154 Query: 118 ---NEVEWIKDAFN 128 + + AF+ Sbjct: 155 PSTASLRHLIGAFS 168 >UniRef50_C9R878 Putative uncharacterized protein n=1 Tax=Ammonifex degensii KC4 RepID=C9R878_AMMDK Length = 114 Score = 131 bits (330), Expect = 8e-30, Method: Composition-based stats. Identities = 42/114 (36%), Positives = 55/114 (48%), Gaps = 10/114 (8%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 G E A +L G I N R GEIDLI REG T +FVEVR R + +G Sbjct: 2 RGKRAEEVAAVYLRKAGWEIIERNYRCRWGEIDLIAREGETIVFVEVRSRSNLAFGLPEE 61 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-------EVEWIKD 125 S+ R KQ KL + AR +LAR CRFDV+A + + +++ Sbjct: 62 SIGRRKQEKLRKVARYFLARLGREL---PCRFDVIAVAWDAATGEIKSLRHLRN 112 >UniRef50_UPI00016929A4 hypothetical protein Plarl_14719 n=1 Tax=Paenibacillus larvae subsp. larvae BRL-230010 RepID=UPI00016929A4 Length = 134 Score = 131 bits (330), Expect = 8e-30, Method: Composition-based stats. Identities = 45/135 (33%), Positives = 66/135 (48%), Gaps = 9/135 (6%) Query: 1 MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLI-MREGRT 59 M + + + + K G E A R+L+ KG + ++ N R GE+D+I + E R Sbjct: 1 MNSDEFQRETKKLDGRKALGKRGEEIAVRYLKEKGFQILSQNWRCRTGEVDIILLEEPRC 60 Query: 60 TIFVEVRYRR-SALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG- 117 IF EVR RR + +G AA S+ R KQ ++ QTA ++ H F+ RFDVV Sbjct: 61 LIFTEVRSRRVTGKFGSAAESINRRKQQQIRQTALYYVYVHPP-FNRYTIRFDVVTVEFF 119 Query: 118 -----NEVEWIKDAF 127 + IK AF Sbjct: 120 PEKEDPVIHHIKAAF 134 >UniRef50_A1K3T3 UPF0102 protein azo0871 n=1 Tax=Azoarcus sp. BH72 RepID=Y871_AZOSB Length = 137 Score = 131 bits (330), Expect = 8e-30, Method: Composition-based stats. Identities = 43/118 (36%), Positives = 63/118 (53%), Gaps = 3/118 (2%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 + G E +A L +G+R +A N + RGGE+DL+ G +FVEVR R + +GG Sbjct: 20 MQARGREGEERAAAHLAAQGVRILARNRHCRGGELDLVGLHGDMLVFVEVRMRANPRFGG 79 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE---VEWIKDAFNDH 130 AAAS+T K+ +++ A+ WLA CRFDVV G W++ AF+ Sbjct: 80 AAASITAEKRRRVILAAQWWLAGEGRRHAHRPCRFDVVLLEGPATTPPTWLQAAFDAD 137 >UniRef50_UPI0001C37581 hypothetical protein RflaF_17327 n=1 Tax=Ruminococcus flavefaciens FD-1 RepID=UPI0001C37581 Length = 125 Score = 131 bits (330), Expect = 9e-30, Method: Composition-based stats. Identities = 36/120 (30%), Positives = 58/120 (48%), Gaps = 8/120 (6%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 +T +TG E +L G G R IA N +GGEID+I G FVEV+ R+ Sbjct: 1 MTKSETGKLGEESVCSYLLGMGYRIIARNYRIKGGEIDIIAENGDYIAFVEVKSRKPDSL 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDV--VAFTGNE---VEWIKDAFN 128 +V++ KQ +++TA + +H + RFDV V ++++ +AF+ Sbjct: 61 VSGFEAVSKRKQGLIIKTAADYCLKHPNVWQP---RFDVASVIIENGRVLSIDYVTNAFD 117 >UniRef50_A4SC34 UPF0102 protein Cvib_0014 n=9 Tax=Chlorobiaceae RepID=Y014_PROVI Length = 131 Score = 131 bits (330), Expect = 9e-30, Method: Composition-based stats. Identities = 46/130 (35%), Positives = 59/130 (45%), Gaps = 15/130 (11%) Query: 14 LTTKQ---TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRS 70 + + Q G E A +LE KG R + N EID+I +G T F+EV+ R S Sbjct: 1 MNSNQPWLLGREGERIAAGFLEKKGYRIVQRNFRFHRNEIDIIAMDGETVCFIEVKTRSS 60 Query: 71 ALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF----------TGNEV 120 A G A +VT KQ ++ + A WLA S DCRFDVV V Sbjct: 61 ATKGEPAEAVTPGKQREIARAAEAWLAF--SSEGEPDCRFDVVGIIAEPLSGGRFRARSV 118 Query: 121 EWIKDAFNDH 130 E DAF+D Sbjct: 119 ELFADAFHDP 128 >UniRef50_C4XKL5 Putative uncharacterized protein n=1 Tax=Desulfovibrio magneticus RS-1 RepID=C4XKL5_DESMR Length = 133 Score = 130 bits (329), Expect = 9e-30, Method: Composition-based stats. Identities = 38/120 (31%), Positives = 59/120 (49%), Gaps = 6/120 (5%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + G EA A L KG + N RGGE+DL+ R+G T +FVEV+ R + Sbjct: 2 TAKHLEFGREGEAAAEAHLIAKGFAVVTRNYRARGGEVDLVCRDGDTVVFVEVKARGEGM 61 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----EVEWIKDAFN 128 G +VT +K+ ++++ A +L+ + + CRFDVVA + DAF+ Sbjct: 62 RGRPEEAVTPAKRRRIVRAAAQFLSERD--WWDRPCRFDVVAVESRSGHLTASHVADAFS 119 >UniRef50_D0LAN4 Putative uncharacterized protein n=1 Tax=Gordonia bronchialis DSM 43247 RepID=D0LAN4_GORB4 Length = 137 Score = 130 bits (329), Expect = 9e-30, Method: Composition-based stats. Identities = 37/138 (26%), Positives = 61/138 (44%), Gaps = 13/138 (9%) Query: 1 MATVPTRSGSPRQLTTKQ-TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRT 59 M PR+ ++ G E A ++ +G R + N R GE+DLI +GR Sbjct: 1 MTAHSAAEPGPRRADRRRHIGHLGEDIAAEFVTNRGWRVLHRNWRNRYGELDLIAADGRV 60 Query: 60 TIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN- 118 + VEV+ R S +Y +VT +K ++ + R+WL+ NGS+ RFDV++ + Sbjct: 61 LVVVEVKTRASLMYSDPLEAVTPAKLSRMRKLTRMWLSEQNGSWS--QIRFDVISVQLDP 118 Query: 119 ---------EVEWIKDAF 127 + F Sbjct: 119 HHPDDRASARIRHHLGVF 136 >UniRef50_C2G0R5 Possible endonuclease n=2 Tax=Sphingobacterium spiritivorum RepID=C2G0R5_9SPHI Length = 118 Score = 130 bits (329), Expect = 9e-30, Method: Composition-based stats. Identities = 36/117 (30%), Positives = 52/117 (44%), Gaps = 6/117 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + G E A L G + +A N E+D++ +G +FVEV+ R S + Sbjct: 2 AQHLEQGKKGEQMALSHLTALGYQILALNWRTGKLEVDILAYDGDILVFVEVKTRSSNAH 61 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE---VEWIKDAF 127 G A V KQ KL++ AR + D RFD+V+ E + IKDAF Sbjct: 62 GEPADFVDIQKQRKLIRAARACIEERGHQG---DIRFDIVSVYLGEPAYIHLIKDAF 115 >UniRef50_Q15PJ2 UPF0102 protein Patl_3694 n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Y3694_PSEA6 Length = 114 Score = 130 bits (329), Expect = 1e-29, Method: Composition-based stats. Identities = 32/111 (28%), Positives = 54/111 (48%), Gaps = 5/111 (4%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 G EAQA +L+ +GL + N R GEID+IMR+ + +FVEV+YR +G A Sbjct: 2 KGAQGEAQALAYLKQQGLTLVTQNYRCRSGEIDIIMRDHQELVFVEVKYRSGQQFGSAVE 61 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-----GNEVEWIK 124 K+ K + ++ + + + R D+V +++ W+K Sbjct: 62 FFHPHKRRKFESAIQHYMLDNKLNPSLIAHRIDIVGIDVLSNNNDKISWLK 112 >UniRef50_D1PKZ1 Putative choloylglycine hydrolase n=1 Tax=Subdoligranulum variabile DSM 15176 RepID=D1PKZ1_9FIRM Length = 117 Score = 130 bits (329), Expect = 1e-29, Method: Composition-based stats. Identities = 37/117 (31%), Positives = 53/117 (45%), Gaps = 6/117 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 ++ G EA A ++ +G + N R GEIDLI+ + +F EV+ R + Sbjct: 2 SRNIGQKGEAIAAQYYRQRGYLVLGHNYRTRMGEIDLILYKEDLIVFAEVKTRTGRMLAT 61 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG----NEVEWIKDAFN 128 A +V KQ +L A +L N F + RFDVV T +V I DAF Sbjct: 62 PAEAVDLHKQQRLRLAAERYLQ--NSPFSEANVRFDVVEVTPAAKGWQVHCIMDAFQ 116 >UniRef50_C6XZ20 Putative uncharacterized protein n=2 Tax=Pedobacter RepID=C6XZ20_PEDHD Length = 120 Score = 130 bits (329), Expect = 1e-29, Method: Composition-based stats. Identities = 33/119 (27%), Positives = 51/119 (42%), Gaps = 8/119 (6%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 T G E A +LE G R + N E+D+I + IFVEV+ R S Y Sbjct: 2 ATHNDLGWRGEQIAVEYLENLGYRILNRNWKCARAEVDVIADQEGKLIFVEVKTRSSTDY 61 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF-----TGNEVEWIKDAF 127 G V+ K+ +L + ++ N + RFD++A ++ I+DAF Sbjct: 62 GQPEEFVSYKKERQLEFASSAYIEMRNHQGE---IRFDIIAIVFENKDIYKINHIEDAF 117 >UniRef50_C2D6J2 Putative uncharacterized protein n=1 Tax=Atopobium vaginae DSM 15829 RepID=C2D6J2_9ACTN Length = 176 Score = 130 bits (329), Expect = 1e-29, Method: Composition-based stats. Identities = 34/130 (26%), Positives = 57/130 (43%), Gaps = 10/130 (7%) Query: 6 TRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEV 65 R + L +K+ G E A +LE + + N GE+D+I +G T+FVEV Sbjct: 44 PRKSAVNTLNSKELGALGENLACCFLERQDFEILDRNWKCADGEVDIIASKGDETVFVEV 103 Query: 66 RYRRSALYGG--AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF-----TGN 118 + R G +V + KQ + + AR + + + RFDV+A + Sbjct: 104 KTRLQNKSGELFPEIAVDKQKQSRYIALARSYNTAYPMCE---NIRFDVIALAILDDSHA 160 Query: 119 EVEWIKDAFN 128 ++ I+ AF Sbjct: 161 QLRHIQSAFE 170 >UniRef50_C9MR57 Putative endonuclease n=2 Tax=Prevotella RepID=C9MR57_9BACT Length = 121 Score = 130 bits (328), Expect = 1e-29, Method: Composition-based stats. Identities = 31/119 (26%), Positives = 54/119 (45%), Gaps = 8/119 (6%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 G E A +L+ +G + N + ++D++ + + VEV+ R+ + Sbjct: 4 HNDIGKWGEEVAANYLQQQGYTILHRNWMYQHRDLDIVAMDAGALVIVEVKTRKDERFVN 63 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-----NEVEWIKDAFND 129 A A+VT K L A ++ R+N S + RFD++ G +EV +KDAF Sbjct: 64 ADAAVTPQKVRSLSLAANAYVKRYNISLE---IRFDIITIVGCPDDKHEVRHVKDAFLP 119 >UniRef50_C8WGY4 Putative uncharacterized protein n=2 Tax=Eggerthella lenta DSM 2243 RepID=C8WGY4_EGGLE Length = 173 Score = 130 bits (328), Expect = 1e-29, Method: Composition-based stats. Identities = 34/119 (28%), Positives = 57/119 (47%), Gaps = 7/119 (5%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 + G E A R+L+ +G + N GE D+I R+G + +FVEV+ R S G Sbjct: 55 RNAELGRRGEDAAARFLDRRGYEIVERNWTCAAGEADIIARDGDSVVFVEVKTRSSCDCG 114 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFD---VVAFTGNE--VEWIKDAFN 128 A +V +K+ + + A L+L + V RFD +VA + + + +AF+ Sbjct: 115 MPAEAVDEAKRDRYERIAALFLQGFDVV--DVPVRFDIVSIVAISPDRAMIRHHINAFS 171 >UniRef50_A3N211 UPF0102 protein APL_1363 n=33 Tax=Pasteurellaceae RepID=Y1363_ACTP2 Length = 123 Score = 130 bits (327), Expect = 2e-29, Method: Composition-based stats. Identities = 58/116 (50%), Positives = 75/116 (64%), Gaps = 2/116 (1%) Query: 12 RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSA 71 + LT + G +E +AR +LE GL+FIAAN + GE+DLIMR+G T +FVEVR R+S Sbjct: 5 KTLTKRSQGANFEQKAREFLERNGLKFIAANQQFKCGELDLIMRQGDTFVFVEVRQRKSN 64 Query: 72 LYGGAAASVTRSKQHKLLQTARLWL-ARHNGSFDTVDCRFDVVAFTGNEVE-WIKD 125 +G A S+ KQ K L A +WL RH S DT +CRFDVVAF GN+ WI + Sbjct: 65 RFGSAVESIDYRKQQKWLDAANMWLFTRHKQSLDTANCRFDVVAFEGNDPPLWIPN 120 >UniRef50_D2RIH7 Putative uncharacterized protein n=1 Tax=Acidaminococcus fermentans DSM 20731 RepID=D2RIH7_ACIFE Length = 118 Score = 130 bits (327), Expect = 2e-29, Method: Composition-based stats. Identities = 37/117 (31%), Positives = 56/117 (47%), Gaps = 6/117 (5%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 ++ G+ E A R+LE +G + N GEID+I R FVEV+ R S +G Sbjct: 4 QRRRFGNWGEDAAVRYLETRGYEILDRNYRSSWGEIDIIARYRGVLAFVEVKTRHSLKFG 63 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT----GNEVEWIKDAF 127 AA+VTR KQ +L +TA +L + RFD++ + +K+ F Sbjct: 64 RPAAAVTREKQIRLRKTAWCYLRENQVF--RYRSRFDIIEILDLYGKISLNHLKNCF 118 >UniRef50_Q1QVF6 UPF0102 protein Csal_2201 n=1 Tax=Chromohalobacter salexigens DSM 3043 RepID=Y2201_CHRSD Length = 123 Score = 130 bits (327), Expect = 2e-29, Method: Composition-based stats. Identities = 53/124 (42%), Positives = 72/124 (58%), Gaps = 7/124 (5%) Query: 10 SPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRR 69 S +++ G E +A WL GLR + AN + R GEIDLIMR+G T +F+EVR+RR Sbjct: 2 SNSNNDSRRRGLEMERRAADWLASHGLRLVDANQHARRGEIDLIMRDGDTLVFIEVRHRR 61 Query: 70 SALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----EVEWIKD 125 A +G +VT +KQ +L+ AR +L R+ S CRFDVV TG EWI+ Sbjct: 62 DARHGHPFETVTAAKQRRLIGAARFYLHRNGLSCA---CRFDVVGVTGTPPHLSFEWIRS 118 Query: 126 AFND 129 AF+ Sbjct: 119 AFDA 122 >UniRef50_C1ZN06 Putative uncharacterized protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZN06_PLALI Length = 181 Score = 130 bits (327), Expect = 2e-29, Method: Composition-based stats. Identities = 40/131 (30%), Positives = 61/131 (46%), Gaps = 11/131 (8%) Query: 3 TVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIF 62 P S R L G+ EA+A ++L+ G + +A N+ R GEIDL+ EG T +F Sbjct: 52 RSPHSPSSHRTLN---IGEQGEARAEKYLKELGYQILARNLRTRLGEIDLLALEGETIVF 108 Query: 63 VEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN---- 118 +EV+ R+S G ++ KQ +L + A L + R DV+ TG Sbjct: 109 IEVKTRKSDARGRPEEAIHPRKQKQLSRVAMALLKSKG--WLHRQSRIDVITITGEPESP 166 Query: 119 --EVEWIKDAF 127 E+ + AF Sbjct: 167 DCELRHYRHAF 177 >UniRef50_C0ZFM4 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0ZFM4_BREBN Length = 125 Score = 130 bits (327), Expect = 2e-29, Method: Composition-based stats. Identities = 39/124 (31%), Positives = 57/124 (45%), Gaps = 13/124 (10%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 + G E A +L KG R + NV + GE+DLI +G+ +F+EVR RRS +G Sbjct: 4 RRRLLGQRGEQLAEGYLVNKGFRIVERNVRTKRGEMDLIALDGKCLVFIEVRTRRSQSFG 63 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-----------GNEVEWI 123 A S+T K+ KL + A +L + RFDV+A +E Sbjct: 64 TAGESITWKKKQKLRELALEYLQKSAQPI--PSFRFDVIAIYTGASTQGEDFMKPVIEHY 121 Query: 124 KDAF 127 + AF Sbjct: 122 ESAF 125 >UniRef50_C2HKC4 Possible endonuclease n=2 Tax=Finegoldia magna RepID=C2HKC4_PEPMA Length = 115 Score = 129 bits (326), Expect = 2e-29, Method: Composition-based stats. Identities = 31/113 (27%), Positives = 53/113 (46%), Gaps = 4/113 (3%) Query: 17 KQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGA 76 K G E A +L KG I N +ER GE+D++ + VEV+ R +G Sbjct: 5 KNRGKFAEDYACEYLIEKGYEIIDRNYSERIGELDIVCTYENYLVIVEVKARTDDKFGAP 64 Query: 77 AASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN--EVEWIKDAF 127 + VT KQ ++ +T +++ +++ RFDV+ + ++ DAF Sbjct: 65 SDFVTLGKQDRIRKTTEIYIDKND--LYDYQPRFDVIEIYLDNFKLNHYIDAF 115 >UniRef50_Q8R616 UPF0102 protein FN1370 n=9 Tax=Fusobacterium RepID=Y1370_FUSNN Length = 119 Score = 129 bits (326), Expect = 2e-29, Method: Composition-based stats. Identities = 31/112 (27%), Positives = 63/112 (56%), Gaps = 2/112 (1%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + T++ G+ +E ++ L + + + N + GEID+I + + IF+EV+YR++ + Sbjct: 1 MNTREIGNEYEDKSVEILVKEDYKILERNYQNKFGEIDIIAEKNKEIIFIEVKYRKTNKF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 G +V R K K+L+ A ++ + RFD +++ G+E++WIK+ Sbjct: 61 GYGYEAVDRRKIMKILKLANYYIQSKK--YQDYKIRFDCMSYLGDELDWIKN 110 >UniRef50_A1R7F9 UPF0102 protein AAur_2443 n=3 Tax=Micrococcaceae RepID=Y2443_ARTAT Length = 121 Score = 129 bits (326), Expect = 2e-29, Method: Composition-based stats. Identities = 32/118 (27%), Positives = 56/118 (47%), Gaps = 8/118 (6%) Query: 14 LTTKQT-GDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + K G + EA A +LE +G+R + N GEID++ +G T + EV+ R+S Sbjct: 4 MRAKDLLGRSGEALAADFLENQGMRIVDRNWRCPDGEIDIVAIDGDTLVVAEVKTRKSLD 63 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVA-----FTGNEVEWIKD 125 YG +V +K +L + + W +H + R DVV+ ++E ++ Sbjct: 64 YGHPFEAVDAAKLARLHRLSSSWCRQHQLNAPRR--RIDVVSVIDNGVVEPQLEHLRG 119 >UniRef50_Q1LHS4 UPF0102 protein Rmet_3430 n=2 Tax=Betaproteobacteria RepID=Y3430_RALME Length = 134 Score = 129 bits (326), Expect = 2e-29, Method: Composition-based stats. Identities = 50/129 (38%), Positives = 74/129 (57%), Gaps = 8/129 (6%) Query: 4 VPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMR-EGRTTIF 62 +P R S +T + G E +A +L+ +GL + N +GGEIDLIMR T +F Sbjct: 1 MPARPAS----STTRQGALAEDRALAYLQRQGLVAVERNYRCKGGEIDLIMRAADDTLVF 56 Query: 63 VEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEW 122 VEVR R +GGAAAS+T +KQ ++L+ A +LA + CR DVVA +EW Sbjct: 57 VEVRKRGGRGFGGAAASITLTKQRRVLRAASHYLATLD---RLPPCRVDVVALDPGRLEW 113 Query: 123 IKDAFNDHS 131 +++AF+ + Sbjct: 114 LRNAFDLGA 122 >UniRef50_A1U3H0 UPF0102 protein Maqu_2464 n=3 Tax=Marinobacter RepID=Y2464_MARAV Length = 123 Score = 129 bits (325), Expect = 3e-29, Method: Composition-based stats. Identities = 42/120 (35%), Positives = 62/120 (51%), Gaps = 7/120 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 +++ G +E A R+LE KG+R I NV+ RGGEIDLI + +F EVR+R Sbjct: 6 SRKLGQHYEGVAARYLESKGIRIIERNVHNRGGEIDLIGMDAEALVFFEVRFRADGALVD 65 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-----EVEWIKDAFNDH 130 +SV+ KQ +L++ A +L RH R DV+ T ++WIK+A Sbjct: 66 PISSVSAVKQQRLVRAASFYLHRHG--LWDRVSRIDVIGITPGHSSKYRIQWIKNAIQAD 123 >UniRef50_B7K4B3 Putative uncharacterized protein n=4 Tax=Cyanobacteria RepID=B7K4B3_CYAP8 Length = 144 Score = 129 bits (325), Expect = 3e-29, Method: Composition-based stats. Identities = 33/103 (32%), Positives = 43/103 (41%), Gaps = 4/103 (3%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRT--TIFVEVRYRRSALY 73 G E RWL+ +G + GGEIDLI T FVEV+ R + Sbjct: 1 MTSIGQLGENLVARWLQSQGWTILQQRWRCPGGEIDLIAHSQGTNLITFVEVKTRSRGNW 60 Query: 74 G-GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF 115 ++T KQ KL Q+A +LA + CRFDV Sbjct: 61 DADGLLAITPQKQVKLTQSAAYFLAEYP-HLADFPCRFDVALV 102 >UniRef50_Q3A2F1 UPF0102 protein Pcar_2217 n=2 Tax=Deltaproteobacteria RepID=Y2217_PELCD Length = 123 Score = 129 bits (325), Expect = 3e-29, Method: Composition-based stats. Identities = 37/120 (30%), Positives = 58/120 (48%), Gaps = 6/120 (5%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 G E A +L +G++ + N+ GE+D++ R R IFVEV+ RR +G Sbjct: 4 QRLSLGRWGEDIAAGYLRRQGMKILDRNIRTPVGELDIVARHKRMLIFVEVKTRRGISHG 63 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF----TGNEVEWIKDAFNDH 130 +V +KQ ++L+ A+ +LA D + RFDV+A EVE AF+ Sbjct: 64 YPQEAVGAAKQRQILRAAQWYLAERR--LDRLQPRFDVIAVRRRGDEAEVEHFPGAFDVD 121 >UniRef50_Q2JJU2 UPF0102 protein CYB_2119 n=3 Tax=Synechococcus RepID=Y2119_SYNJB Length = 133 Score = 129 bits (324), Expect = 3e-29, Method: Composition-based stats. Identities = 35/130 (26%), Positives = 61/130 (46%), Gaps = 11/130 (8%) Query: 9 GSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYR 68 PR+ + + TG+ E R++L +G + +A GE+DL+ + IFVEV+ R Sbjct: 3 PLPRRASLQNTGNVGEGWVRQYLCQQGWQILAQRWRCPWGELDLVAHKADVLIFVEVKTR 62 Query: 69 RSALYG-GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-------- 119 + G +V KQ +L++ A+ +L++H + CRFDV Sbjct: 63 SPGSWDRGGLLAVGIPKQRRLIRAAQAFLSQHP-HLSELSCRFDVALIERRASREGVSYA 121 Query: 120 -VEWIKDAFN 128 V+++ AF Sbjct: 122 LVDYLPAAFE 131 >UniRef50_C7LP67 Putative uncharacterized protein n=1 Tax=Desulfomicrobium baculatum DSM 4028 RepID=C7LP67_DESBD Length = 134 Score = 129 bits (324), Expect = 4e-29, Method: Composition-based stats. Identities = 38/120 (31%), Positives = 58/120 (48%), Gaps = 6/120 (5%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 TG A E A +L KG+R + N GEIDLI + T +FVEV+ R A+ Sbjct: 2 AARHLITGQAGEELAAAFLVEKGMRIVERNFRCASGEIDLICEDAGTIVFVEVKTRSGAV 61 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG----NEVEWIKDAFN 128 G ++ +K+ +L++ L+L+RH + CRFD+V VE +D + Sbjct: 62 RGEPGEAIGPAKKKRLIKAGALYLSRHRA--WSRPCRFDLVGILFLHGETVVEHWEDIID 119 >UniRef50_B0TH88 Putative uncharacterized protein n=1 Tax=Heliobacterium modesticaldum Ice1 RepID=B0TH88_HELMI Length = 119 Score = 128 bits (323), Expect = 4e-29, Method: Composition-based stats. Identities = 36/121 (29%), Positives = 55/121 (45%), Gaps = 7/121 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + G E +A + L G G I N GEIDLI+RE +FVEVR R S + Sbjct: 1 MNRVLLGRWGEERALQHLLGLGWSLICQNYRTPRGEIDLILRESNWIVFVEVRTRSSERF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-----GNEVEWIKDAFN 128 G +V K+ +L+ TA +L + G RFD+++ +++ I+ F Sbjct: 61 GRGEETVDYRKRRRLMATAGHFLGTYQGPPGDP--RFDLISILRLDSGEEQLQHIRGMFT 118 Query: 129 D 129 Sbjct: 119 P 119 >UniRef50_Q60CC4 UPF0102 protein MCA0184 n=1 Tax=Methylococcus capsulatus RepID=Y184_METCA Length = 123 Score = 128 bits (323), Expect = 5e-29, Method: Composition-based stats. Identities = 54/126 (42%), Positives = 68/126 (53%), Gaps = 11/126 (8%) Query: 8 SGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRY 67 SGS R LT G E+ +L +GLR I N R GEIDL+M EG T +FVEVRY Sbjct: 4 SGSHRPLT----GPQAESWTAEYLTARGLRLIERNYRCRLGEIDLVMAEGATLVFVEVRY 59 Query: 68 RRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----EVEWI 123 R YGGA ASV R K +LL TA+ ++ H + R DVVA + + EWI Sbjct: 60 RSGKRYGGALASVDRHKCRRLLATAQHYMVEHRVTGA---VRLDVVAVSPGAAGPQAEWI 116 Query: 124 KDAFND 129 ++A Sbjct: 117 RNAIEA 122 >UniRef50_Q313K2 UPF0102 protein Dde_1093 n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. G20 RepID=Y1093_DESDG Length = 202 Score = 128 bits (323), Expect = 5e-29, Method: Composition-based stats. Identities = 29/111 (26%), Positives = 52/111 (46%), Gaps = 2/111 (1%) Query: 6 TRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEV 65 R + G E A +L G+R +A N E+D+I ++ T +F EV Sbjct: 3 ARRAAGSCPAHIAAGRLGEEAACAYLAASGMRILARNWRAGHLELDIIAQDNGTIVFAEV 62 Query: 66 RYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT 116 + R + ++T +K+ +L++ A +WL+ ++ CRFD+V T Sbjct: 63 KTRAARGLESPHEALTPAKRSRLVRAAGMWLSSNDM--WDRPCRFDLVCVT 111 >UniRef50_A4YJR8 UPF0102 protein BRADO0179 n=14 Tax=Rhizobiales RepID=Y179_BRASO Length = 141 Score = 128 bits (323), Expect = 5e-29, Method: Composition-based stats. Identities = 41/129 (31%), Positives = 63/129 (48%), Gaps = 4/129 (3%) Query: 2 ATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTI 61 A P ++ SP ++ +TG + EA+A L KG R +A GEIDLI R+ Sbjct: 14 APKPAKTASPERVAAFRTGLSAEARAAALLIAKGYRILAKRFRTPHGEIDLIARKRGLVA 73 Query: 62 FVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV- 120 FVEV+ R A AA +VT +Q +++ A+ WL H ++ RFD + + Sbjct: 74 FVEVKAR--ASLDDAAYAVTPRQQQRIIDAAQAWLMAHP-DHAELELRFDAILVAPRSLP 130 Query: 121 EWIKDAFND 129 + AF+ Sbjct: 131 RHLMAAFDA 139 >UniRef50_Q1YQG9 Putative uncharacterized protein n=1 Tax=gamma proteobacterium HTCC2207 RepID=Q1YQG9_9GAMM Length = 133 Score = 128 bits (323), Expect = 5e-29, Method: Composition-based stats. Identities = 40/125 (32%), Positives = 62/125 (49%), Gaps = 11/125 (8%) Query: 11 PRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRS 70 P + +G E A +L+ KGL + N R GEIDLIMR+ +FVEVR+R + Sbjct: 7 PNNKKERLSGAEAEQLALDFLQAKGLELVVKNFRTRRGEIDLIMRDNAVLVFVEVRFRSN 66 Query: 71 ALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----------V 120 +G A S+T K +L A+ ++ R + + V RFD VA + + + Sbjct: 67 LNFGTAEESITAQKCQRLSSAAQAYMQREGLT-ERVSGRFDAVAISPAKPHRQSSGMYSI 125 Query: 121 EWIKD 125 WI++ Sbjct: 126 NWIQN 130 >UniRef50_A8UQV9 Putative uncharacterized protein n=1 Tax=Hydrogenivirga sp. 128-5-R1-1 RepID=A8UQV9_9AQUI Length = 111 Score = 128 bits (323), Expect = 6e-29, Method: Composition-based stats. Identities = 27/108 (25%), Positives = 50/108 (46%), Gaps = 3/108 (2%) Query: 18 QTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAA 77 + G +E +A +L+ KG +A N + R GEID++ R+G +FVEV+ + G A Sbjct: 2 RRGSEYEERACLYLQDKGYSIVARNYHCRSGEIDIVARQGGELVFVEVKGGKDTSLGHPA 61 Query: 78 ASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 K +++ A ++ R D+V + +E ++ Sbjct: 62 ERFNPRKLDRIIACAFRFMEEMGLEE---PFRVDLVVVLEDRIEHYEN 106 >UniRef50_A5WCR1 UPF0102 protein PsycPRwf_0497 n=1 Tax=Psychrobacter sp. PRwf-1 RepID=Y497_PSYWF Length = 142 Score = 128 bits (322), Expect = 7e-29, Method: Composition-based stats. Identities = 55/146 (37%), Positives = 78/146 (53%), Gaps = 20/146 (13%) Query: 1 MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERG-GEIDLIMREGR- 58 M P SP+ ++ G +E A +L+ +GLR IA N + GEIDL++ E Sbjct: 1 MPANPELLISPK----QRQGGGYEQLAADFLQQQGLRLIARNWQQPKVGEIDLVLIEHGR 56 Query: 59 ---TTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF 115 +F EVR R+ YG A AS+TRSKQ KL++TAR +L RH+ + +CRFDVV F Sbjct: 57 SWNVLVFAEVRKRKLLGYGDALASITRSKQKKLIKTARYFL-RHHPEYADFECRFDVVGF 115 Query: 116 TGN----------EVEWIKDAFNDHS 131 T + EW++ AF + Sbjct: 116 TERTGRSGQGEPLQSEWLQGAFLAPA 141 >UniRef50_C6VV91 Putative uncharacterized protein n=2 Tax=Flexibacteraceae RepID=C6VV91_DYAFD Length = 119 Score = 128 bits (322), Expect = 7e-29, Method: Composition-based stats. Identities = 31/119 (26%), Positives = 53/119 (44%), Gaps = 8/119 (6%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 G E A +L KG + IA N EIDLI + IF+EV+ R +G Sbjct: 3 QANDLGRWGETTAASFLAEKGFKIIARNYRNWQSEIDLIAAKDDMLIFIEVKTRTGMAFG 62 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-----NEVEWIKDAFN 128 V +K +++ A ++ + ++ D RFD+++ ++ I+DAF+ Sbjct: 63 MPEEFVNVTKARLIMRAAEQYI--FDVDWEN-DVRFDIISILVLPDGSTDIRHIEDAFS 118 >UniRef50_A3DDG4 UPF0102 protein Cthe_0758 n=6 Tax=Clostridia RepID=Y758_CLOTH Length = 130 Score = 128 bits (322), Expect = 7e-29, Method: Composition-based stats. Identities = 37/126 (29%), Positives = 57/126 (45%), Gaps = 12/126 (9%) Query: 12 RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERG-GEIDLIMREGRTTIFVEVRYRRS 70 + + G EA A ++L+ + N R GEID+I RE FVEV+ R S Sbjct: 7 NKNNKRAAGSIGEAAAVQFLKENNYEILETNFRYRRLGEIDIISREKDYICFVEVKARSS 66 Query: 71 ALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF---------TGNEVE 121 YG +V KQ + + A+++L ++ + + RFDVV E+ Sbjct: 67 LGYGYPREAVNIRKQENIRRLAQIYLCKNRIN--DLKVRFDVVEVYMEKKGDDIEVKEIS 124 Query: 122 WIKDAF 127 IK+AF Sbjct: 125 LIKNAF 130 >UniRef50_D1SBF6 Putative uncharacterized protein n=1 Tax=Micromonospora aurantiaca ATCC 27029 RepID=D1SBF6_9ACTO Length = 150 Score = 127 bits (321), Expect = 8e-29, Method: Composition-based stats. Identities = 44/136 (32%), Positives = 59/136 (43%), Gaps = 12/136 (8%) Query: 2 ATVPTRSGSPRQL-----TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE 56 + P S R L + G E A R L GLR +A N GEID+I E Sbjct: 17 SAPPATVASGRILAGMTNRNRAVGAYGERCALRHLIETGLRPVARNWRCPEGEIDIIAWE 76 Query: 57 GRTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT 116 G EV+ RRS +G A +V R+K +L A WLA + + RFDV++ Sbjct: 77 GPVLAICEVKTRRSEQFGSPAEAVVRAKARRLRGLAARWLAETGTTAA--EVRFDVLSVR 134 Query: 117 -----GNEVEWIKDAF 127 VE ++ AF Sbjct: 135 LPLTGPARVEHLRGAF 150 >UniRef50_C1AG13 UPF0102 protein JTY_2914 n=20 Tax=Mycobacterium RepID=Y2914_MYCBT Length = 128 Score = 127 bits (321), Expect = 8e-29, Method: Composition-based stats. Identities = 40/126 (31%), Positives = 58/126 (46%), Gaps = 12/126 (9%) Query: 10 SPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREG--RTTIFVEVRY 67 + + +T Q G EA A +L GLR + N R GE+D+I + RT +FVEV+ Sbjct: 3 TLKTMTRVQLGAMGEALAVDYLTSMGLRILNRNWRCRYGELDVIACDAATRTVVFVEVKT 62 Query: 68 RRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN--------E 119 R YGG A +VT K +L + A LWLA + R DV+ E Sbjct: 63 RTGDGYGGLAHAVTERKVRRLRRLAGLWLADQEERWA--AVRIDVIGVRVGPKNSGRTPE 120 Query: 120 VEWIKD 125 + ++ Sbjct: 121 LTHLQG 126 >UniRef50_A1VIW8 UPF0102 protein Pnap_0271 n=10 Tax=Burkholderiales RepID=Y271_POLNA Length = 153 Score = 127 bits (321), Expect = 9e-29, Method: Composition-based stats. Identities = 58/141 (41%), Positives = 78/141 (55%), Gaps = 16/141 (11%) Query: 2 ATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERG---GEIDLIMR-EG 57 +T + P+Q+TTK GDA E+ AR +L G GLR+I +N G GEIDL+MR Sbjct: 15 STAGGAAALPKQVTTKSRGDAAESAARAYLVGAGLRWIESNYRTPGRGGGEIDLVMRVPD 74 Query: 58 RTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG 117 T +FVEVR R SA +GGA AS++ KQ +++ AR +L R CRFDVV G Sbjct: 75 GTLVFVEVRQRSSASHGGAGASISAVKQRRIIFAARHYLMRF---ASLPPCRFDVVLVHG 131 Query: 118 ---------NEVEWIKDAFND 129 +EW+ AF+ Sbjct: 132 ALSGGESPQATIEWLPAAFDA 152 >UniRef50_Q11XW1 UPF0102 protein CHU_0465 n=1 Tax=Cytophaga hutchinsonii ATCC 33406 RepID=Y465_CYTH3 Length = 113 Score = 127 bits (320), Expect = 1e-28, Method: Composition-based stats. Identities = 40/113 (35%), Positives = 56/113 (49%), Gaps = 4/113 (3%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + Q G A E +A +LE +G + I N+ GEIDLI F+EV+YR+ Y Sbjct: 1 MEHIQKGIAGEQKACAFLEQQGYKIIEKNLRIGKGEIDLIAVHNNCMCFIEVKYRKHNRY 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV-EWIKD 125 G VT+ K K+ +TA ++ N RFDVVA TG E+ + D Sbjct: 61 GFPEEFVTQKKLLKIQETAEAYIYTVNWQGR---IRFDVVAITGEELPVHLMD 110 >UniRef50_C0VVC1 Endonuclease n=2 Tax=Corynebacterium glucuronolyticum RepID=C0VVC1_9CORY Length = 115 Score = 127 bits (320), Expect = 1e-28, Method: Composition-based stats. Identities = 43/111 (38%), Positives = 59/111 (53%), Gaps = 5/111 (4%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 G E ARR+ + +G F+AANV GEIDLIM+ G TT+FVEV+ R ++ G Sbjct: 4 KNLLLGRRGETIARRYYQDRGYGFVAANVRYTCGEIDLIMQHGDTTVFVEVKTRTNSAMG 63 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 GA +VT +K ++ + A WL RFDVV GNE+ + Sbjct: 64 GA-EAVTPAKLRRVQRAAMTWLEGKP----YRPIRFDVVEIIGNEITCFEG 109 >UniRef50_B9ZKW9 Putative uncharacterized protein n=1 Tax=Thioalkalivibrio sp. K90mix RepID=B9ZKW9_9GAMM Length = 118 Score = 127 bits (320), Expect = 1e-28, Method: Composition-based stats. Identities = 43/112 (38%), Positives = 60/112 (53%), Gaps = 4/112 (3%) Query: 20 GDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAAS 79 G A E +A +L G+GL +A NV GE+DL+ REG T + VEVR R +GG A S Sbjct: 8 GQAAEDRAAHYLTGQGLILVARNVRRPWGELDLVAREGDTLVLVEVRKRSHRNFGGGAES 67 Query: 80 VTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-VEWIKDAFNDH 130 + K+ +LL+ A +L RFDVV G++ +EW+ DA Sbjct: 68 IDAGKRRRLLRAAEGYLQETRWQG---PVRFDVVLLDGDDTIEWLPDAIQGD 116 >UniRef50_A7NKS5 UPF0102 protein Rcas_2007 n=2 Tax=Roseiflexus RepID=Y2007_ROSCS Length = 124 Score = 127 bits (320), Expect = 1e-28, Method: Composition-based stats. Identities = 39/121 (32%), Positives = 56/121 (46%), Gaps = 7/121 (5%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 + GD E A R+L +G +A GEID++ R +FVEVR RR G Sbjct: 4 RRTRLGDWGETMAARFLARRGYEVLARKWRCAAGEIDIVARHDGDLVFVEVRTRRGRDPG 63 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN------EVEWIKDAFN 128 AA S+T +K+ +L+ A +LA H+ R DVVA + +E I A Sbjct: 64 MAAESITNAKRARLMALADAFLAAHDLP-SNTPWRIDVVAISVGLRAQEVSIEHIPYAVE 122 Query: 129 D 129 + Sbjct: 123 E 123 >UniRef50_A5FKL6 UPF0102 protein Fjoh_1217 n=17 Tax=Bacteroidetes RepID=Y1217_FLAJ1 Length = 123 Score = 127 bits (319), Expect = 1e-28, Method: Composition-based stats. Identities = 28/119 (23%), Positives = 51/119 (42%), Gaps = 5/119 (4%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + G E A LE + + + N + E+D++ ++ + VEV+ R S + Sbjct: 2 AEHNELGKLGEDLAAEHLEKENYKILERNWVYKNAEVDILAQKENILVVVEVKTRSSLDF 61 Query: 74 GGAAASVTRSKQHKLLQTARLWL-ARHNGSFDTVDCRFDVVAFTGNE----VEWIKDAF 127 G V K L++ ++ R + ++ RFD+VA N +E + DAF Sbjct: 62 GSPQDFVKPKKIQLLIKAVNAYINYREKDFEEDINVRFDIVAIHKNGESFAIEHLTDAF 120 >UniRef50_Q5R0L0 UPF0102 protein IL0423 n=1 Tax=Idiomarina loihiensis RepID=Y423_IDILO Length = 116 Score = 127 bits (319), Expect = 1e-28, Method: Composition-based stats. Identities = 35/111 (31%), Positives = 54/111 (48%), Gaps = 2/111 (1%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 TG E + +L+ L I N GGE+D+I R+G +F EV++R + Sbjct: 6 TGKRAELLSAEFLKKNNLTIICKNYRIDGGEVDIIARDGHYWVFCEVKFRDDESFAAVIE 65 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT--GNEVEWIKDAF 127 + + ++ TAR +L +N T RFDV+A ++EW KDAF Sbjct: 66 QIQPQQCRRIRYTARHYLLSNNIDEHTAAIRFDVIAIVGQPTKIEWFKDAF 116 >UniRef50_D0GLC9 Putative uncharacterized protein n=1 Tax=Leptotrichia goodfellowii F0264 RepID=D0GLC9_9FUSO Length = 119 Score = 127 bits (319), Expect = 1e-28, Method: Composition-based stats. Identities = 42/121 (34%), Positives = 74/121 (61%), Gaps = 9/121 (7%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREG--RTTIFVEVRYRRS 70 + + ++ G +E A+ +LE + L FI +N + GEIDLI E T IFVEV+YR++ Sbjct: 2 RKSKREVGFEYEEIAKDYLEERKLLFIESNYYTKYGEIDLIFLEKSSETLIFVEVKYRKN 61 Query: 71 ALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEWIKDA 126 +YG A +V + KQ K++Q+++++++++ R+DV+ GN+ + WIK+A Sbjct: 62 NIYGEAVEAVDKRKQEKIIQSSQIYISKNKWK---NSVRYDVIGIIGNKLKNDINWIKNA 118 Query: 127 F 127 F Sbjct: 119 F 119 >UniRef50_C0EXX9 Putative uncharacterized protein n=1 Tax=Eubacterium hallii DSM 3353 RepID=C0EXX9_9FIRM Length = 117 Score = 127 bits (319), Expect = 2e-28, Method: Composition-based stats. Identities = 37/114 (32%), Positives = 54/114 (47%), Gaps = 2/114 (1%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 K +G A E A +LEGKG+R + N GEID+I E + VEV+ R G Sbjct: 2 RKNSGGAAEEAAVLFLEGKGIRILERNFRSYHGEIDIIALEQEMILVVEVKMRSYGDCGT 61 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-EVEWIKDAFN 128 AA +V KQ ++ T + + + RFDV+ + WI++AF Sbjct: 62 AAEAVDFRKQKRICYTFNYYRMQRRL-AENTAVRFDVIEVDKDFRCHWIQNAFE 114 >UniRef50_A0Z7D7 Putative uncharacterized protein n=1 Tax=marine gamma proteobacterium HTCC2080 RepID=A0Z7D7_9GAMM Length = 123 Score = 127 bits (319), Expect = 2e-28, Method: Composition-based stats. Identities = 46/118 (38%), Positives = 63/118 (53%), Gaps = 5/118 (4%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 QTG E A+ +L +GLR +A NV R GEID+IM +G T +FVEVR R Sbjct: 5 NTQTGKDAEDYAQNFLITQGLRTVARNVCCRYGEIDIIMEQGITVVFVEVRLRAQKGLQT 64 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT----GNEVEWIKDAFND 129 A+SV+ KQ +L++TA L + R RFDV+A+ WI+ AF+ Sbjct: 65 GASSVSYRKQQRLIKTASLVIQRMP-ELQGRPVRFDVIAYDTLQKNRVPHWIQQAFDA 121 >UniRef50_Q146Q2 UPF0102 protein Bxeno_A0149 n=9 Tax=Burkholderiaceae RepID=Y149_BURXL Length = 140 Score = 127 bits (319), Expect = 2e-28, Method: Composition-based stats. Identities = 52/117 (44%), Positives = 74/117 (63%), Gaps = 2/117 (1%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE-GRTTIFVEVRYRRSALYG 74 +K G A+EA+A+ +L+ + LRF+A NV RGGEIDL+MRE +FVEVR R YG Sbjct: 22 SKLVGAAFEARAQEFLQRQRLRFVARNVACRGGEIDLVMRERDGALVFVEVRARAQRRYG 81 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVD-CRFDVVAFTGNEVEWIKDAFNDH 130 GAAAS+ KQ ++++ A+ +LA + CRFDV+AF + W++DAF Sbjct: 82 GAAASIGWRKQQRIVRAAQHYLATRSSQLRDQPACRFDVIAFEAGRLVWLRDAFRAD 138 >UniRef50_A6GLM3 Putative uncharacterized protein n=1 Tax=Limnobacter sp. MED105 RepID=A6GLM3_9BURK Length = 167 Score = 127 bits (319), Expect = 2e-28, Method: Composition-based stats. Identities = 44/128 (34%), Positives = 63/128 (49%), Gaps = 2/128 (1%) Query: 2 ATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTI 61 V ++ +L G E QA + LE GL + N R GEIDLIM G T + Sbjct: 38 PNVAEKAPPVHKLALLAEGQLAETQALQLLEKHGLILVTRNHRCRCGEIDLIMASGNTAV 97 Query: 62 FVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-GNEV 120 VEVR R + +G A S++ KQ ++ + A+LW + RFDVVA G E Sbjct: 98 IVEVRLRNNKRHGSALESISSHKQARVSRCAKLWWVQQGQR-KFTHLRFDVVALENGTEP 156 Query: 121 EWIKDAFN 128 W+++A+ Sbjct: 157 RWVQNAWQ 164 >UniRef50_A3YDY3 Putative uncharacterized protein n=1 Tax=Marinomonas sp. MED121 RepID=A3YDY3_9GAMM Length = 130 Score = 127 bits (319), Expect = 2e-28, Method: Composition-based stats. Identities = 38/114 (33%), Positives = 54/114 (47%), Gaps = 6/114 (5%) Query: 18 QTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAA 77 G E A+ +L + L I N + GEIDLI + +FVEVRYR+ G AA Sbjct: 19 NKGQLAEEAAKVFLLSQKLSMIEQNFICKLGEIDLICLDNGVIVFVEVRYRQDNSRGSAA 78 Query: 78 ASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVA----FTGNEVEWIKDAF 127 S+ KQ K+++ A+ WL ++ RFD V + W+K AF Sbjct: 79 QSIHLGKQKKVIKAAQYWLLINHK--QDTPIRFDAVLFDQVIDNEHLTWLKSAF 130 >UniRef50_D1W8G8 Putative uncharacterized protein n=1 Tax=Prevotella buccalis ATCC 35310 RepID=D1W8G8_9BACT Length = 134 Score = 126 bits (318), Expect = 2e-28, Method: Composition-based stats. Identities = 32/123 (26%), Positives = 51/123 (41%), Gaps = 10/123 (8%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGR--TTIFVEVRYRRSA 71 T + G E A +L +G EID+I T +FVEV+ RRS Sbjct: 12 ATHNKFGKWGEDTAVDYLHKQGYTIRERGWRHGKFEIDIIALSPDGITCVFVEVKTRRSD 71 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-----EVEWIKDA 126 + +V K L A +++ + + RFDV++ G+ ++E +DA Sbjct: 72 EVALPSDAVDEKKMRNLGIAADVYVKMFDIQEE---LRFDVISIVGSTAENMQIEHFEDA 128 Query: 127 FND 129 FN Sbjct: 129 FNP 131 >UniRef50_A4AH12 Putative uncharacterized protein n=1 Tax=marine actinobacterium PHSC20C1 RepID=A4AH12_9ACTN Length = 118 Score = 126 bits (318), Expect = 2e-28, Method: Composition-based stats. Identities = 35/117 (29%), Positives = 52/117 (44%), Gaps = 8/117 (6%) Query: 14 LTTKQ-TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + K G E A L GL + N GE+D++ R+ +FVEV+ R S L Sbjct: 1 MAAKDVLGARGEELATDHLISAGLEILDRNWRCSQGELDIVARDQDDVVFVEVKTRSSVL 60 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-----EVEWIK 124 +G S+T +K +L + A +W H GS R D +A E+E +K Sbjct: 61 FGHPFESITATKVARLRRLAAVWCDAHPGSGA--TVRIDAIAVIVPSRGAVEIEHLK 115 >UniRef50_Q1D6H9 UPF0102 protein MXAN_3551 n=2 Tax=Cystobacterineae RepID=Y3551_MYXXD Length = 126 Score = 126 bits (318), Expect = 2e-28, Method: Composition-based stats. Identities = 38/125 (30%), Positives = 61/125 (48%), Gaps = 6/125 (4%) Query: 9 GSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYR 68 ++ G+A E A R+LE +G R N R GE+D++ FVEVR R Sbjct: 2 RRAAPAERREYGNAGEEAAVRFLEAQGWRVRDRNWTCRFGELDVVAERDDLVCFVEVRMR 61 Query: 69 RSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEWIK 124 +A +G + SV+ +KQ ++++ A +L H+ RFDV++ G V+ I Sbjct: 62 STATWGDPSHSVSFAKQRRVVKAALRYLFAHDLRG--RMFRFDVISVVGRGERATVDHIP 119 Query: 125 DAFND 129 AF+ Sbjct: 120 GAFDA 124 >UniRef50_Q8XUC6 UPF0102 protein RSc3265 n=6 Tax=Proteobacteria RepID=Y3265_RALSO Length = 130 Score = 126 bits (318), Expect = 2e-28, Method: Composition-based stats. Identities = 49/108 (45%), Positives = 68/108 (62%), Gaps = 6/108 (5%) Query: 25 AQARRWLEGKGLRFIAANVNERGGEIDLIMRE-GRTTIFVEVR---YRRSALYGGAAASV 80 +A R+L+ +GL IA N + GEIDL+MR+ T +FVEVR R + +GGAAASV Sbjct: 22 DRALRYLQARGLSVIARNYRCKTGEIDLVMRDVAGTLVFVEVRARVARSAQRFGGAAASV 81 Query: 81 TRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFN 128 T +KQ +L+ A +LA H + CRFDV+A G +EW++DAF Sbjct: 82 TPAKQRRLIAAAEDFLAGHP--GEVPACRFDVIAIDGTRIEWMRDAFG 127 >UniRef50_A0YUK1 Putative uncharacterized protein n=2 Tax=Cyanobacteria RepID=A0YUK1_9CYAN Length = 176 Score = 126 bits (318), Expect = 2e-28, Method: Composition-based stats. Identities = 35/113 (30%), Positives = 52/113 (46%), Gaps = 12/113 (10%) Query: 18 QTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE----------GRTTIFVEVRY 67 + G E + WL+ +G + R GEIDLI RE T IFVEV+ Sbjct: 16 KIGTLGEQLVQAWLKQQGWEILFHQYRCRWGEIDLIAREVKDPKVQSKLDSTVIFVEVKT 75 Query: 68 RRSALYG-GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE 119 R + ++T SKQ KL+++A+++L+ H CRFDV + Sbjct: 76 RSKRNWDSDGLLAITPSKQTKLIKSAQIFLSDHP-ELADSPCRFDVALVRCDR 127 >UniRef50_C2KRF5 Possible endonuclease n=2 Tax=Mobiluncus mulieris RepID=C2KRF5_9ACTO Length = 173 Score = 126 bits (317), Expect = 2e-28, Method: Composition-based stats. Identities = 29/120 (24%), Positives = 53/120 (44%), Gaps = 3/120 (2%) Query: 2 ATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGR-TT 60 A P ++ + ++ G A E A +L+ +G + + N R GE+D++ Sbjct: 44 ALKPPKAPR-KNPHNRELGLAGEELAVEFLQTQGYQVLDRNWRCRAGEVDIVALSPDSVL 102 Query: 61 IFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV 120 FVEV+ R + +G A ++T +K ++ W H F D D+V+ + V Sbjct: 103 AFVEVKTRSTRRHGTPAEAITYAKLTRMRCVMGAWFRVHEAPF-HHDVSLDLVSVEWDGV 161 >UniRef50_A6SUE7 UPF0102 protein mma_0204 n=4 Tax=Betaproteobacteria RepID=Y204_JANMA Length = 123 Score = 126 bits (317), Expect = 3e-28, Method: Composition-based stats. Identities = 50/123 (40%), Positives = 75/123 (60%), Gaps = 4/123 (3%) Query: 8 SGSPRQLTTKQT-GDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVR 66 R T KQ G A E QA +L+ +GL+ + N +GGEIDL+M++G+ +FVEVR Sbjct: 4 PAFLRPRTAKQLAGQAGEDQALIYLQQQGLQLLERNFRCKGGEIDLLMQDGKALVFVEVR 63 Query: 67 YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDA 126 R +GGAAAS+ +KQ +L+ A+++L R++ CRFDV+AF E+ W+K+A Sbjct: 64 MRSEKKFGGAAASIGTAKQKRLIIAAQIYLQRYSMP---PPCRFDVIAFDDKEMTWLKNA 120 Query: 127 FND 129 Sbjct: 121 IEA 123 >UniRef50_C9RJM0 Putative uncharacterized protein n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RJM0_FIBSS Length = 138 Score = 126 bits (317), Expect = 3e-28, Method: Composition-based stats. Identities = 38/120 (31%), Positives = 57/120 (47%), Gaps = 7/120 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 + G+ E QA +L +G + + N GGE+D++ R+ T +FVEV+ + G Sbjct: 11 NRAKGNFIETQAVAFLMREGYQVVTRNYAYHGGELDIVARDNGTLVFVEVKSVWNNQEGN 70 Query: 76 AAASVTRSKQHKLLQTARLWLARHN---GSFDTVDCRFDVVAF----TGNEVEWIKDAFN 128 AA V KQ K+ QTA +LA CRFDV++ + IK+AF Sbjct: 71 PAARVNALKQKKIWQTACHFLATQKTIAPKGFDTPCRFDVLSARAYQEPLQFAHIKNAFE 130 >UniRef50_C8PW53 Putative uncharacterized protein n=1 Tax=Enhydrobacter aerosaccus SK60 RepID=C8PW53_9GAMM Length = 134 Score = 126 bits (317), Expect = 3e-28, Method: Composition-based stats. Identities = 46/130 (35%), Positives = 71/130 (54%), Gaps = 15/130 (11%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNER-GGEIDLIMREGR----TTIFVEVRY 67 ++ GD +E A+ +LE +GL F A N + + GE+DL+M E + VEVR Sbjct: 4 TAPKQRQGDYYETLAKHYLEAQGLTFFAKNWHYKNLGELDLVMLEPTQKIPCLVIVEVRQ 63 Query: 68 RRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG---------N 118 R+++ +G + S+T +KQ K+++T +L H FD D RFDVV++ G Sbjct: 64 RKASQFGTSLDSITPAKQRKIVKTTAAFLQAHP-QFDNFDIRFDVVSYEGAATAGQAVMP 122 Query: 119 EVEWIKDAFN 128 WIKDAF+ Sbjct: 123 TPTWIKDAFS 132 >UniRef50_Q4FQF2 UPF0102 protein Psyc_1908 n=2 Tax=Psychrobacter RepID=Y1908_PSYA2 Length = 173 Score = 125 bits (316), Expect = 3e-28, Method: Composition-based stats. Identities = 44/145 (30%), Positives = 70/145 (48%), Gaps = 30/145 (20%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANV-NERGGEIDLIMREGR----TTIFVEVRYRRS 70 ++ G +E A +L+ +GL IA N + GE+DL+M E T +F+EVR R Sbjct: 29 KQRQGGYFEQLACEFLQEQGLILIAKNWQRPKVGELDLVMLEKGQAWSTLVFIEVRQRNR 88 Query: 71 ALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVA---------------- 114 + +G AA SVT KQ K+++ AR +L +H + +CRFDV+A Sbjct: 89 SHFGDAALSVTAGKQRKIIKVARYFLHQHQ-KYSDYECRFDVIAYNTSNNKNSENETDIR 147 Query: 115 --------FTGNEVEWIKDAFNDHS 131 ++ EW++ AF + Sbjct: 148 LDNQLNQPLEKDQPEWLQGAFIASA 172 >UniRef50_A6VXY8 UPF0102 protein Mmwyl1_2395 n=1 Tax=Marinomonas sp. MWYL1 RepID=Y2395_MARMS Length = 127 Score = 125 bits (316), Expect = 4e-28, Method: Composition-based stats. Identities = 46/128 (35%), Positives = 64/128 (50%), Gaps = 6/128 (4%) Query: 4 VPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFV 63 P S R+ K GD E A +L +GLRF+ N R GEIDLI + T +FV Sbjct: 2 RPVTSFLNRKKAPKNNGDKAEQAAEAFLRKQGLRFVERNFFCRIGEIDLIFLDQNTYVFV 61 Query: 64 EVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVA----FTGNE 119 EVR+R + +G AA S+ +SK K+ +A LWL ++N RFD + Sbjct: 62 EVRFRANNTHGNAAESLGQSKLKKVRNSAALWLQKNNKV--NNSSRFDAILFDEKIDSQH 119 Query: 120 VEWIKDAF 127 + W+K F Sbjct: 120 LTWLKAVF 127 >UniRef50_D2RAN4 Putative uncharacterized protein n=1 Tax=Gardnerella vaginalis 409-05 RepID=D2RAN4_GARVA Length = 182 Score = 125 bits (315), Expect = 4e-28, Method: Composition-based stats. Identities = 35/108 (32%), Positives = 47/108 (43%), Gaps = 1/108 (0%) Query: 10 SPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRT-TIFVEVRYR 68 L +K+ G E A L KG I N + R GE+DL+M +FVEV+ R Sbjct: 40 QDDSLESKELGKLGETYATLRLIQKGWHVIDQNWHCRNGELDLVMITPEQKLVFVEVKTR 99 Query: 69 RSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT 116 RS G ++T+ K+ KL T WL RFD V+ Sbjct: 100 RSVRCGTPLEAITQEKRSKLRTTGMKWLEEFGSDIPHYRIRFDAVSIL 147 >UniRef50_D2MIZ7 Putative uncharacterized protein n=1 Tax=Candidatus Poribacteria sp. WGA-A3 RepID=D2MIZ7_9BACT Length = 128 Score = 125 bits (315), Expect = 4e-28, Method: Composition-based stats. Identities = 48/121 (39%), Positives = 68/121 (56%), Gaps = 7/121 (5%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 LT + G EA A R L KG R + NV GE+D++ R G T IFVEV+ RR+ Sbjct: 2 SLTRQLLGKEAEAAAERLLRQKGYRILDRNVRIGRGELDIVARVGETVIFVEVKARRTDR 61 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-----VEWIKDAF 127 YGG A +VT K+ +L+Q A +LARH + CRFDV+ + + +E +++AF Sbjct: 62 YGGVAHAVTARKERQLIQLAARYLARHR--LERQPCRFDVLLYDAGDPGSPSLEHVENAF 119 Query: 128 N 128 Sbjct: 120 E 120 >UniRef50_B8G6B1 UPF0102 protein Cagg_0930 n=3 Tax=Chloroflexus RepID=Y930_CHLAD Length = 123 Score = 125 bits (315), Expect = 4e-28, Method: Composition-based stats. Identities = 41/119 (34%), Positives = 59/119 (49%), Gaps = 9/119 (7%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 +Q GD E A +LE G IA N R GEID++ R+G +FVEVR RR Sbjct: 5 KRQLGDRGEQVAAVYLERCGYTIIARNWRCRNGEIDMVARDGDYLVFVEVRTRRDE---Y 61 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVA--FTGNEV---EWIKDAFND 129 A S+ K+ +L+ A +LA H+ +T R DV+A GN + + + A + Sbjct: 62 ALESLLMHKRQRLVTLAYHYLAEHDVP-ETTPWRIDVIALTVVGNRLVVTDHVMAAIGE 119 >UniRef50_C7R327 Putative uncharacterized protein n=1 Tax=Jonesia denitrificans DSM 20603 RepID=C7R327_JONDD Length = 129 Score = 125 bits (314), Expect = 5e-28, Method: Composition-based stats. Identities = 34/125 (27%), Positives = 54/125 (43%), Gaps = 16/125 (12%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNER---GGEIDLIMREGRTTIFVEVRYRRSA 71 T G E A +WL+ +G + N GEID+I R+G T + VEV+ R + Sbjct: 4 RTYTLGQTGETYAAQWLQKRGYAILERNWRAAYPMRGEIDIIARDGATLVIVEVKTRTTQ 63 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-----------NEV 120 G + +VT K +L + A WL + + RFDV++ + Sbjct: 64 HCGHPSEAVTPRKLTQLRRLAAAWLT--HAGVRPRELRFDVISVLAPSNRYATPTNEWHI 121 Query: 121 EWIKD 125 + +KD Sbjct: 122 DHLKD 126 >UniRef50_Q2KU88 UPF0102 protein BAV3162 n=1 Tax=Bordetella avium 197N RepID=Y3162_BORA1 Length = 145 Score = 125 bits (314), Expect = 5e-28, Method: Composition-based stats. Identities = 49/125 (39%), Positives = 66/125 (52%), Gaps = 2/125 (1%) Query: 7 RSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVR 66 R G R G EAQ R L +GLR +A N R GE+DLIM +G + VEVR Sbjct: 9 RRGLIRPDPRHAQGKRAEAQGLRLLRAQGLRLLARNARNRHGELDLIMLDGEVLVVVEVR 68 Query: 67 YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDA 126 +R + +GGAAAS+ +KQ +L + A WLA RFDV+AF + W++ A Sbjct: 69 WRSGSAFGGAAASIGPAKQARLARAAACWLA--GSEHAGRRLRFDVLAFEAGQARWLRGA 126 Query: 127 FNDHS 131 F + Sbjct: 127 FEPPA 131 >UniRef50_C9LKQ6 Putative uncharacterized protein n=1 Tax=Prevotella tannerae ATCC 51259 RepID=C9LKQ6_9BACT Length = 131 Score = 125 bits (314), Expect = 5e-28, Method: Composition-based stats. Identities = 31/120 (25%), Positives = 50/120 (41%), Gaps = 7/120 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 Q G E A +L R + N E+D+I +FVEV+ R Sbjct: 4 HNQLGALGEEVAAHYLSQLEYRLLERNWRTGHLEVDIIADYYGEIVFVEVKTRSYEAEYT 63 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----EVEWIKDAFNDHS 131 A +V R+K+ L++ A ++ H+ CRFD++ G +V DA++ S Sbjct: 64 ALEAVDRTKKKHLVRAAHDYMHLHHL---DAACRFDIITVVGREAPFQVTHYIDAYSPKS 120 >UniRef50_C2BVF9 Possible endonuclease n=1 Tax=Mobiluncus curtisii ATCC 43063 RepID=C2BVF9_9ACTO Length = 178 Score = 124 bits (313), Expect = 6e-28, Method: Composition-based stats. Identities = 27/124 (21%), Positives = 49/124 (39%), Gaps = 6/124 (4%) Query: 8 SGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREG-RTTIFVEVR 66 S L +Q G A E A L+ G + N R GE+D++ FVEV+ Sbjct: 53 SPPRNNLHNRQLGMAGEEVAAESLKAAGYVIVDRNWRCRAGEVDIVALSPEGVLGFVEVK 112 Query: 67 YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-----VE 121 R + +G ++T K ++ + WLA+ + D+ + + V+ Sbjct: 113 TRSNHRHGLPIEAITMKKLARMRRVMGAWLAQRDIVPVHRAVSLDLCSVDWDGHGEPVVK 172 Query: 122 WIKD 125 ++ Sbjct: 173 HLQG 176 >UniRef50_C7MB82 Putative uncharacterized protein n=1 Tax=Brachybacterium faecium DSM 4810 RepID=C7MB82_BRAFD Length = 153 Score = 124 bits (313), Expect = 7e-28, Method: Composition-based stats. Identities = 38/120 (31%), Positives = 58/120 (48%), Gaps = 7/120 (5%) Query: 10 SPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRR 69 R +TT Q G A E A L +G + + N+ R GE+D++ + T +FVEV+ RR Sbjct: 33 RVRDMTTAQLGRAGEELAASHLSAQGWQIVERNLRLRQGELDIVALDHATLVFVEVKTRR 92 Query: 70 SALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-----EVEWIK 124 S + G A+VT K +L + A +L + D R DVVA +E ++ Sbjct: 93 SFVTGVPQAAVTPDKLRRLRRLAGEYLMERSTP--HRDVRIDVVAVHAQLDGTFSIEHLE 150 >UniRef50_B1XJM9 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7002 RepID=B1XJM9_SYNP2 Length = 140 Score = 124 bits (313), Expect = 8e-28, Method: Composition-based stats. Identities = 37/144 (25%), Positives = 65/144 (45%), Gaps = 18/144 (12%) Query: 1 MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRT- 59 MA P SP +L G+ E ++ L+ + + +A R GE+DL+ +T Sbjct: 1 MADTP----SPEKLAALAVGEQGELFVQQHLKSQDWQIVATRWRCRWGELDLVAFHAQTK 56 Query: 60 -TIFVEVRYRRSALYGG-AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG 117 FVEV+ R+ ++T SKQ K ++ A +L++ ++T CRFDV T Sbjct: 57 ILAFVEVKTRQQHSLDYQGLLAITPSKQRKTIRAAMQFLSKFP-QYETYGCRFDVALVTY 115 Query: 118 NE----------VEWIKDAFNDHS 131 ++ +++ AF + Sbjct: 116 SKTATFPQGFRLATYLEGAFEADA 139 >UniRef50_Q8DI54 UPF0102 protein tll1737 n=1 Tax=Thermosynechococcus elongatus BP-1 RepID=Y1737_THEEB Length = 124 Score = 124 bits (312), Expect = 8e-28, Method: Composition-based stats. Identities = 33/117 (28%), Positives = 55/117 (47%), Gaps = 8/117 (6%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE-GRTTIFVEVRYRRSALYG 74 + GD EA WL+ + + +A N + GE+D+I + G +FVEV+ R S + Sbjct: 1 MRHVGDRGEAVVAAWLQTQQCQILAQNWSCPWGELDIIACDPGGVVLFVEVKTRGSYNWD 60 Query: 75 -GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFNDH 130 +++ SKQ KL+ A+ +L + CRFDV + A++ H Sbjct: 61 RDGLDAISPSKQRKLILAAQAFLESQP-QWQEHPCRFDVALV-----RHQRGAYHLH 111 >UniRef50_B0MPN2 Putative uncharacterized protein n=1 Tax=Eubacterium siraeum DSM 15702 RepID=B0MPN2_9FIRM Length = 132 Score = 124 bits (312), Expect = 1e-27, Method: Composition-based stats. Identities = 30/118 (25%), Positives = 51/118 (43%), Gaps = 11/118 (9%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 G E +L G I N + GEID++ +G +FVEV+ RR Sbjct: 10 KGKLGEDFTADYLIKNGYDIITRNYRKPCGEIDIVASKGDILVFVEVKTRRYRSLVSGVE 69 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN--------EVEWIKDAFN 128 +V K+ +++ TA +LA + + R+D+ T + + + KDAF+ Sbjct: 70 AVGYKKKGRIIATADCFLAEYG---EEKQIRYDIAEVTVSTGDAVRVIDFRYFKDAFD 124 >UniRef50_Q6A7T5 UPF0102 protein PPA1431 n=3 Tax=Propionibacterium acnes RepID=Y1431_PROAC Length = 140 Score = 124 bits (312), Expect = 1e-27, Method: Composition-based stats. Identities = 36/139 (25%), Positives = 54/139 (38%), Gaps = 16/139 (11%) Query: 1 MATVP--TRSGSPRQLTTK-------QTGDAWEAQARRWLEGKGLRFIAANVNERGGEID 51 M P +PR + G E A +++E G IA N GEID Sbjct: 2 MTPKPLSAELTTPRGAALRGRRGCRPAFGAWGEDLAAQYVESLGWTIIARNWTCDVGEID 61 Query: 52 LIMREGRTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFD 111 LI R+ +T +F+EV+ R +G S+T +K KL + A WL + R D Sbjct: 62 LIARDDQTVVFIEVKARSGTGFGDPLESITTAKVRKLHELALAWLVNQDDGV--HSVRID 119 Query: 112 VVAFTG-----NEVEWIKD 125 + V ++ Sbjct: 120 AIGVMVRPGAEPTVTHVRG 138 >UniRef50_C9PT16 Endonuclease n=5 Tax=Prevotella RepID=C9PT16_9BACT Length = 124 Score = 124 bits (311), Expect = 1e-27, Method: Composition-based stats. Identities = 34/123 (27%), Positives = 49/123 (39%), Gaps = 10/123 (8%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGR--TTIFVEVRYRRSA 71 G E A L+ +G + + +IDL+ T +FVEV+ R S Sbjct: 2 AKHNDLGKWGEDFAAEHLQKQGYVIRDRDWHCGKRDIDLVAITADMATVVFVEVKTRTSN 61 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-----EVEWIKDA 126 A +V R K L A ++ + N RFDVV G ++E I+DA Sbjct: 62 EVSEPADAVNRQKIRNLGIAANNYIKQFNVVEQ---VRFDVVTIVGTSRENAQLEHIEDA 118 Query: 127 FND 129 FN Sbjct: 119 FNP 121 >UniRef50_B1VG84 Putative uncharacterized protein n=1 Tax=Corynebacterium urealyticum DSM 7109 RepID=B1VG84_CORU7 Length = 154 Score = 124 bits (311), Expect = 1e-27, Method: Composition-based stats. Identities = 33/114 (28%), Positives = 52/114 (45%), Gaps = 3/114 (2%) Query: 9 GSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREG-RTTIFVEVRY 67 G +T G E A +L +G R + N + R E+D+I + +FVEV+Y Sbjct: 36 GQNSADSTVGVGRQGENLAGEYLVNQGWRIVERNWHCRFAELDIIALDPAGEMVFVEVKY 95 Query: 68 RRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVE 121 R+ ++G +VT++K ++ A WLA D RFDV+ V Sbjct: 96 RKDTVHGTGVEAVTQTKLRRMRLAAGKWLAEQQRGVDV--VRFDVIDVGPGGVR 147 >UniRef50_A0QVA9 UPF0102 protein MSMEG_2508 n=6 Tax=Corynebacterineae RepID=Y2508_MYCS2 Length = 124 Score = 124 bits (311), Expect = 1e-27, Method: Composition-based stats. Identities = 34/109 (31%), Positives = 52/109 (47%), Gaps = 4/109 (3%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREG--RTTIFVEVRYRRS 70 LT + G E A L GL+ +A N R GE+D+I + T +FVEV+ R Sbjct: 5 SLTRAELGALGEEVAVEHLAALGLKTLARNWRCRYGELDIIAEDAATGTVVFVEVKTRSG 64 Query: 71 ALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE 119 +GG A +VT K ++ + A +WLA + + + R DV+ Sbjct: 65 DGFGGLAEAVTPQKVRRIRRLAAIWLAAQDAHWAVL--RIDVIGVRVGR 111 >UniRef50_Q025A4 UPF0102 protein Acid_2433 n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Y2433_SOLUE Length = 132 Score = 123 bits (310), Expect = 2e-27, Method: Composition-based stats. Identities = 35/113 (30%), Positives = 57/113 (50%), Gaps = 7/113 (6%) Query: 20 GDAWEAQARRWLEGKGLRFIAANVNE--RGGEIDLIMREGRTTIFVEVRYRRSALYGGAA 77 G E A R+L +G +A N GEIDL++ +G FVEV+ R S +G Sbjct: 22 GRIGEDLAHRYLRSQGCTVVARNYRTLAGTGEIDLVVWDGGRLAFVEVKTRSSTDFGPPE 81 Query: 78 ASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT---GNEVEWIKDAF 127 ++V K+ +L AR ++ R + + RFD+V+ ++EW++ AF Sbjct: 82 SAVDAEKRDRLRTAARDYVRRADVDW--KAVRFDIVSVILQASPKIEWLRGAF 132 >UniRef50_D1ANU1 Putative uncharacterized protein n=1 Tax=Sebaldella termitidis ATCC 33386 RepID=D1ANU1_SEBTE Length = 111 Score = 123 bits (310), Expect = 2e-27, Method: Composition-based stats. Identities = 38/112 (33%), Positives = 67/112 (59%), Gaps = 3/112 (2%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + ++ G +E A+ +L GL ++ +N GEIDLI ++ IFVEV+YR+++ Y Sbjct: 1 MNKREKGFKYENAAKDFLINNGLEYVRSNYYSEYGEIDLIFKDRDFLIFVEVKYRKNSDY 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 G A SVT++K K++ + +++ N + CR+D+VA G E+ W+K+ Sbjct: 61 GFAEESVTQAKLKKIINASLNYISEVNWNEG---CRYDLVAINGEEIIWVKN 109 >UniRef50_C2BNU0 Endonuclease n=2 Tax=Corynebacterium RepID=C2BNU0_9CORY Length = 142 Score = 123 bits (310), Expect = 2e-27, Method: Composition-based stats. Identities = 46/124 (37%), Positives = 62/124 (50%), Gaps = 10/124 (8%) Query: 7 RSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREG-RTTIFVEV 65 R + + + G EA A R+L +G IAANV+ R GEIDLI RE T +FVEV Sbjct: 18 RLATKKPRHKQVLGKRGEAFAARYLHERGAEIIAANVSYRVGEIDLIAREPNGTIVFVEV 77 Query: 66 RYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VE 121 + R ++ YG A +VT K +L + A WL S + RFDV+A +E Sbjct: 78 KTRANSNYGVA-EAVTPQKLARLRKAAAQWLDGKPLS----EVRFDVIALVAQGQGFVLE 132 Query: 122 WIKD 125 K Sbjct: 133 HFKG 136 >UniRef50_Q6FD45 UPF0102 protein ACIAD1132 n=4 Tax=Acinetobacter RepID=Y1132_ACIAD Length = 140 Score = 122 bits (308), Expect = 2e-27, Method: Composition-based stats. Identities = 39/130 (30%), Positives = 63/130 (48%), Gaps = 17/130 (13%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + G E QA L+ G + + N + R GEIDLI+ + IFVEV+ R Y Sbjct: 10 IHAHHLGKWAENQALNILQANGFKLVIRNFHSRVGEIDLIVAKADELIFVEVKARTLGSY 69 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-GNEV------------ 120 A + S+Q K+++TA+ +L R+ + CRFDV+ F +++ Sbjct: 70 AAANEVLLVSQQRKIIKTAQYFLNRYP-DYQQFYCRFDVICFDFPHKIAKTVQQDFSKLR 128 Query: 121 ---EWIKDAF 127 +WI++AF Sbjct: 129 YDQQWIENAF 138 >UniRef50_B1WNM5 Putative uncharacterized protein n=3 Tax=Chroococcales RepID=B1WNM5_CYAA5 Length = 153 Score = 122 bits (308), Expect = 3e-27, Method: Composition-based stats. Identities = 32/103 (31%), Positives = 45/103 (43%), Gaps = 4/103 (3%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREG--RTTIFVEVRYRRSALY 73 G E +WL + + + GEID+I + T IFVEV+ R+S + Sbjct: 12 MTSIGKIGEQFVAQWLISQSWQILHERWRSPWGEIDIIAQHHHSNTIIFVEVKTRKSKNW 71 Query: 74 G-GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF 115 +VT KQ K+ QTA +L + F CRFDV Sbjct: 72 DQSGILAVTPQKQAKITQTASYFLGEYP-QFSNFICRFDVALV 113 >UniRef50_Q24UC6 UPF0102 protein DSY2577 n=2 Tax=Desulfitobacterium hafniense RepID=Y2577_DESHY Length = 121 Score = 122 bits (308), Expect = 3e-27, Method: Composition-based stats. Identities = 33/118 (27%), Positives = 54/118 (45%), Gaps = 8/118 (6%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 + G E A + + GL + N GE+D+I REG T IF+EVR R + G Sbjct: 4 HRQALGRYGEELAVKHIRQAGLTVLECNYRCPLGEMDIIAREGETIIFIEVRTRSTGSRG 63 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-------NEVEWIKD 125 S+T K+ +L + A +L ++ + RFD++A ++ WI+ Sbjct: 64 WGEESITAKKRERLYRIATHYL-KYRNYKEWPSLRFDLIAIRCQDQEGKQPDIIWIRG 120 >UniRef50_B2S4F0 UPF0102 protein TPASS_0913 n=3 Tax=Treponema RepID=Y913_TREPS Length = 126 Score = 122 bits (308), Expect = 3e-27, Method: Composition-based stats. Identities = 38/122 (31%), Positives = 58/122 (47%), Gaps = 8/122 (6%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 K G EA A RWL +G I N GEID+I ++ T +FVEV+ R Y Sbjct: 4 HNKLLGAFGEAYAARWLATRGYIIITRNWRRATGEIDIIAQQDDTIVFVEVKTLRCTSYA 63 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-------EVEWIKDAF 127 A V + KQ ++ +TA+ +LA ++ + RFDV+ + ++ + AF Sbjct: 64 DLAIIVGKRKQKRICETAKHFLASAR-EYNHMCARFDVIVLRSDPFRRQDVDIVHLPHAF 122 Query: 128 ND 129 D Sbjct: 123 ED 124 >UniRef50_A6NUN6 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6NUN6_9BACE Length = 119 Score = 122 bits (307), Expect = 3e-27, Method: Composition-based stats. Identities = 36/122 (29%), Positives = 57/122 (46%), Gaps = 11/122 (9%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + T G E+ L +G R +A+ R GEIDLI +G +FVEV+ R+S + Sbjct: 1 MNTSLLGRWGESLVAEELRRRGCRVVASGYRTRFGEIDLIAEDGPYLLFVEVKLRKSDRF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG--------NEVEWIKD 125 A V R KQ ++ TA ++LA++ RFDV + ++++ Sbjct: 61 APGRAFVDRGKQERIRTTAEIYLAQNPTERQP---RFDVAEVYAPQGTATAHPRIVYLEN 117 Query: 126 AF 127 AF Sbjct: 118 AF 119 >UniRef50_Q6AEA8 UPF0102 protein Lxx14785 n=1 Tax=Leifsonia xyli subsp. xyli RepID=Y1478_LEIXX Length = 118 Score = 122 bits (307), Expect = 4e-27, Method: Composition-based stats. Identities = 32/117 (27%), Positives = 46/117 (39%), Gaps = 7/117 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + G E+ A WLE G I N R GEID+I R G T+FVEV+ R + Y Sbjct: 2 AKKDELGRRGESVAAHWLEAHGYVLIGRNWRIRSGEIDIIARTGNITVFVEVKTRATTHY 61 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-----GNEVEWIKD 125 G ++T K +L + W + R D + E+ + Sbjct: 62 GHPLEAITPEKAARLRRLTAEWCRTYGPLPG--ALRVDAIGVLNAWSANPEIHHLPG 116 >UniRef50_Q55761 UPF0102 protein sll0189 n=1 Tax=Synechocystis sp. PCC 6803 RepID=Y189_SYNY3 Length = 150 Score = 122 bits (306), Expect = 4e-27, Method: Composition-based stats. Identities = 34/103 (33%), Positives = 46/103 (44%), Gaps = 4/103 (3%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRT--TIFVEVRYRRSALY 73 G A E+ WLE +G + + GEIDLI T FVEV+ R + Sbjct: 1 MTDLGQAGESLVAAWLEQQGGKILQQRWRSPWGEIDLITHFPDTKIIAFVEVKTRSGGNW 60 Query: 74 G-GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF 115 G +V KQ K+ QTA +LA + +CRFDV+ Sbjct: 61 DQGGLLAVNARKQEKIWQTANHFLASQP-QWSDWNCRFDVMIV 102 >UniRef50_A6LF20 UPF0102 protein BDI_2565 n=4 Tax=Bacteroidales RepID=Y2565_PARD8 Length = 121 Score = 122 bits (306), Expect = 4e-27, Method: Composition-based stats. Identities = 28/121 (23%), Positives = 48/121 (39%), Gaps = 7/121 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 G E++AR +L G + N + E+D+I + I VEV+ R Sbjct: 2 ARQNDMGREGESEARAYLVKHGYNVLHTNWHWHHYELDIIAVKEDELIVVEVKTRSEDFL 61 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEWIKDAFND 129 +V K +++ A ++ N + RFD+V E ++ I+DAF Sbjct: 62 LSPEDAVDTKKIRRIVAAADAYVRYFNI---DLPVRFDIVTLIKKETGFLIDHIEDAFYA 118 Query: 130 H 130 Sbjct: 119 P 119 >UniRef50_Q3AC88 UPF0102 protein CHY_1414 n=1 Tax=Carboxydothermus hydrogenoformans Z-2901 RepID=Y1414_CARHZ Length = 118 Score = 122 bits (306), Expect = 4e-27, Method: Composition-based stats. Identities = 27/119 (22%), Positives = 57/119 (47%), Gaps = 6/119 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + ++ G WE A ++L KG + + N RGGEID++ ++G +F+EVR+R + Sbjct: 1 MNRRELGQKWEELAEQYLRKKGYKILTRNYQIRGGEIDIVAQDGEFLVFIEVRFRSDISF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEWIKDAFN 128 G + +V K+ L + ++++ + + R D + + V ++ + Sbjct: 61 GTPSETVNEKKKASLKKAIKVYIHEN--FLYHLQPRVDFIGIEQKDNRFFVNHYQNVLD 117 >UniRef50_A3NEP2 UPF0102 protein BURPS668_3819 n=83 Tax=Proteobacteria RepID=Y3819_BURP6 Length = 144 Score = 122 bits (306), Expect = 5e-27, Method: Composition-based stats. Identities = 59/131 (45%), Positives = 79/131 (60%), Gaps = 5/131 (3%) Query: 2 ATVPTRSGSPRQL-TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE-GRT 59 R PR+ + + G A+E +A+R+LE GL +A NV RGGEIDL+MRE T Sbjct: 14 PEAAPRDNFPREAGSKRGIGAAFETRAQRFLERAGLALVARNVTVRGGEIDLVMRERDGT 73 Query: 60 TIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE 119 +FVEVR R ++ YGGAAAS+ K+ +LL A + AR G+ CRFDVVAF G Sbjct: 74 LVFVEVRARANSRYGGAAASIGVRKRMRLLLAAHAFWARTGGANA---CRFDVVAFEGGR 130 Query: 120 VEWIKDAFNDH 130 + W++DAF Sbjct: 131 LVWLRDAFRAD 141 >UniRef50_B4VZG7 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VZG7_9CYAN Length = 177 Score = 122 bits (306), Expect = 5e-27, Method: Composition-based stats. Identities = 33/135 (24%), Positives = 48/135 (35%), Gaps = 31/135 (22%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRT---------------- 59 T G E WL+ +G + + R GEIDLI Sbjct: 2 TNAKGQLGEQLVATWLQAQGWTILHHRWHCRWGEIDLIAYRDGEVKENPVSNRDINPQFH 61 Query: 60 -------------TIFVEVRYRRSALYG-GAAASVTRSKQHKLLQTARLWLARHNGSFDT 105 FVEV+ R + ++T SKQ K+ +TA L+LA + Sbjct: 62 LPTPLPANAESPILGFVEVKTRSRGNWDADGQLAITSSKQAKIWRTAELFLAENP-DLSD 120 Query: 106 VDCRFDVVAFTGNEV 120 + CRFDV + + Sbjct: 121 LPCRFDVALVRYHRI 135 >UniRef50_D1BMP5 Putative uncharacterized protein n=3 Tax=Veillonella RepID=D1BMP5_VEIPT Length = 132 Score = 121 bits (305), Expect = 6e-27, Method: Composition-based stats. Identities = 36/133 (27%), Positives = 61/133 (45%), Gaps = 7/133 (5%) Query: 1 MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTT 60 M T+ T +L +K+ G E A ++E GL + N R GEID+I + Sbjct: 1 MKTISTGKAFN-ELDSKELGKWGERVATNYIEKIGLTVVDTNYRTRLGEIDIIAKRDLVY 59 Query: 61 IFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLAR-HNGSFDTVDCRFDVVAFTGNE 119 F+E++ RR +G A +VT+ KQ + + A L+L + + FDV+ +E Sbjct: 60 HFIEIKARRGMQHGLAREAVTKKKQKHIKRAAMLFLYDLNQKKRRWKEISFDVIEVYLHE 119 Query: 120 -----VEWIKDAF 127 + ++ F Sbjct: 120 DFQSSIHYLPQCF 132 >UniRef50_A4A5E8 Putative uncharacterized protein n=2 Tax=unclassified Gammaproteobacteria RepID=A4A5E8_9GAMM Length = 118 Score = 121 bits (304), Expect = 8e-27, Method: Composition-based stats. Identities = 45/114 (39%), Positives = 65/114 (57%), Gaps = 7/114 (6%) Query: 20 GDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAAS 79 GD +EA++ L+ GLR + + GEID+I + +FVEVR RR +GGAAAS Sbjct: 4 GDDFEARSAALLKSYGLRILDTQYRCKAGEIDIIACDEHHLLFVEVRARRHRSHGGAAAS 63 Query: 80 VTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN------EVEWIKDAF 127 V R+KQ ++ + A +L RH + + CRFDV+A+ E WI+ AF Sbjct: 64 VNRAKQCRIARCAAYFLNRHP-QWCHLPCRFDVIAWEPGCAGQSFEARWIQAAF 116 >UniRef50_A9B5H2 UPF0102 protein Haur_0145 n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=Y145_HERA2 Length = 124 Score = 121 bits (304), Expect = 8e-27, Method: Composition-based stats. Identities = 35/116 (30%), Positives = 56/116 (48%), Gaps = 6/116 (5%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 K G E A +L+ G + IA+ + R GEIDLI + T + +EVR RR +G Sbjct: 4 DRKALGRWGEQYAAEYLQQLGYQLIASGWHCRWGEIDLIAYDQATLVIIEVRTRRGTAHG 63 Query: 75 GAAASVTRSKQHKLLQTARLWLARHN--GSFDTVDCRFDVVAFTG----NEVEWIK 124 AA S+T K+ +L + + +L + + D R D +A T ++E + Sbjct: 64 SAAESLTLKKRQRLARLLQAYLQALDAAQTPWLGDYRIDAIAITLSRGQPQLEHFQ 119 >UniRef50_UPI00017886BD protein of unknown function UPF0102 n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI00017886BD Length = 127 Score = 120 bits (303), Expect = 1e-26, Method: Composition-based stats. Identities = 38/128 (29%), Positives = 61/128 (47%), Gaps = 9/128 (7%) Query: 7 RSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVR 66 S S ++ KQ G A E A L KG R + N R GE+D++ G T + +EVR Sbjct: 2 TSPSGKKDNRKQKGAAAEELAAAALIQKGYRILDRNWRCRFGELDIVAETGETLVVIEVR 61 Query: 67 YRRS-ALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF------TGNE 119 R +G + SV K ++ TA+ ++ H + RFDV++ T + Sbjct: 62 SRSGTTRFGTPSESVNARKVMQVRNTAQQYV--HQKRYYERTIRFDVISVMLREDMTADS 119 Query: 120 VEWIKDAF 127 ++ I++AF Sbjct: 120 MDHIENAF 127 >UniRef50_C2CWR1 Endonuclease n=1 Tax=Gardnerella vaginalis ATCC 14019 RepID=C2CWR1_GARVA Length = 153 Score = 120 bits (303), Expect = 1e-26, Method: Composition-based stats. Identities = 36/118 (30%), Positives = 53/118 (44%), Gaps = 7/118 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREG-RTTIFVEVRYRRSALYG 74 K G+ E A L K + N + R GE+D++M + +F+EV+ RRS +G Sbjct: 37 NKTIGNLGEEYASLKLILKNWILLDRNWHSRFGELDVVMMDPFGRIVFIEVKTRRSVRFG 96 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-----EVEWIKDAF 127 +VT K K + WL HN F RFDVV+ + ++ I AF Sbjct: 97 TPLEAVTNEKCLKTHKAGFKWLDEHNF-FKHRKIRFDVVSILISKDKNIQLRHILGAF 153 >UniRef50_A1W341 UPF0102 protein Ajs_0414 n=11 Tax=Betaproteobacteria RepID=Y414_ACISJ Length = 132 Score = 120 bits (302), Expect = 1e-26, Method: Composition-based stats. Identities = 47/125 (37%), Positives = 67/125 (53%), Gaps = 7/125 (5%) Query: 9 GSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERG---GEIDLIMRE-GRTTIFVE 64 GS TT+ G A E +A L GL + N G GEIDLI+RE T +FVE Sbjct: 10 GSAPARTTRAAGQAGEDRALAHLTAAGLALVERNYRTPGRGGGEIDLILRERDGTLVFVE 69 Query: 65 VRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIK 124 VR R ++ YGGA S+ +KQ +++ A+ +L R CRFD V G+ ++W++ Sbjct: 70 VRSRGASAYGGAGGSIGVAKQRRIVFAAQHYLLRWP---APPPCRFDAVLIEGDRLQWLR 126 Query: 125 DAFND 129 AF+ Sbjct: 127 GAFDA 131 >UniRef50_UPI0001BC5BE0 endonuclease n=3 Tax=Fusobacterium RepID=UPI0001BC5BE0 Length = 123 Score = 120 bits (302), Expect = 1e-26, Method: Composition-based stats. Identities = 31/117 (26%), Positives = 53/117 (45%), Gaps = 3/117 (2%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 +Q G+ +E +A L + + N GEID+I + +F+EV+YR++ + Sbjct: 2 QNNRQKGNEYEERAVNILRENQYQILERNFRIFQGEIDIIAEKDGVLVFIEVKYRKNRNF 61 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD-AFND 129 G +V K K+ + A + + R DV+ F G+ W KD A+ D Sbjct: 62 GYGKEAVDSRKLGKIFRVAEYYKTYCGKQYQK--MRIDVIHFLGDTYFWEKDVAWGD 116 >UniRef50_B4WKR8 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WKR8_9SYNE Length = 144 Score = 120 bits (302), Expect = 1e-26, Method: Composition-based stats. Identities = 34/111 (30%), Positives = 51/111 (45%), Gaps = 10/111 (9%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMR--------EGRTTIFVEV 65 L + G+ E +WL K + + + R GEIDLI + + T F+EV Sbjct: 2 LNPQDLGNYGEQLVCQWLTQKNCQILQRQWHSRFGEIDLIAKGISGQGSLKAETLAFIEV 61 Query: 66 RYRRSALYG-GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF 115 + R + ++TRSKQ K+ TAR +L RH + CRFD+ Sbjct: 62 KTRSKGNWDADGLLALTRSKQQKIRMTARYFLVRHP-HLSELPCRFDLALV 111 >UniRef50_D1BJ87 Predicted endonuclease related to Holliday junction resolvase n=1 Tax=Sanguibacter keddieii DSM 10542 RepID=D1BJ87_SANKS Length = 127 Score = 120 bits (302), Expect = 1e-26, Method: Composition-based stats. Identities = 33/121 (27%), Positives = 48/121 (39%), Gaps = 8/121 (6%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + G E R LEG G + N GGE+DL+ +GR + +EV+ R Sbjct: 5 RTDRAAVGRYGEELVARMLEGAGWVVVDRNWRGTGGELDLVALDGRELVVIEVKTRTGLG 64 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGS---FDTVDCRFDVVAFT-----GNEVEWIK 124 YG + +VT K +L + A WLA R DVV +V+ + Sbjct: 65 YGHPSEAVTPRKLARLRRLAGEWLAGRAAETVPERPTSVRVDVVGVLLEKGRPPQVDHLV 124 Query: 125 D 125 Sbjct: 125 G 125 >UniRef50_B0MUK7 Putative uncharacterized protein n=1 Tax=Alistipes putredinis DSM 17216 RepID=B0MUK7_9BACT Length = 121 Score = 120 bits (302), Expect = 2e-26, Method: Composition-based stats. Identities = 32/121 (26%), Positives = 54/121 (44%), Gaps = 8/121 (6%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 TT+ TG E A RWL G + N + E+D++ T F+EV+ RR Sbjct: 3 TTQHTGRLGEETAARWLLDHGFTLLHRNWRQGHYELDIVAARKGTLHFIEVKTRRRDGLT 62 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-----GNEVEWIKDAFND 129 ++ K+ L++ A +L + + + +FD++A EV +I+DA Sbjct: 63 PPEQALDSHKRRALVRAANAYLTENPFAGE---VQFDLIAVETAPAGTPEVRYIEDAIEL 119 Query: 130 H 130 H Sbjct: 120 H 120 >UniRef50_C0WKI9 Endonuclease n=3 Tax=Actinomycetales RepID=C0WKI9_9CORY Length = 132 Score = 120 bits (301), Expect = 2e-26, Method: Composition-based stats. Identities = 42/120 (35%), Positives = 59/120 (49%), Gaps = 10/120 (8%) Query: 9 GSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE-GRTTIFVEVRY 67 + ++ G E+ A +L +G IAANV+ R GEIDLI RE T +FVEV+ Sbjct: 10 ATKHPRHRQELGKRGESFAAGYLRERGSDIIAANVSYRVGEIDLIAREPDGTIVFVEVKT 69 Query: 68 RRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF----TGNEVEWI 123 R +A +G A +VT K ++ + A WL RFDV+A G E+E Sbjct: 70 RSTASFGTA-EAVTPHKLARMRRAAVQWLDGKPL----ATVRFDVIALVVNGEGFELEHF 124 >UniRef50_C0BHK9 Putative uncharacterized protein n=1 Tax=Flavobacteria bacterium MS024-2A RepID=C0BHK9_9BACT Length = 120 Score = 119 bits (299), Expect = 3e-26, Method: Composition-based stats. Identities = 29/118 (24%), Positives = 50/118 (42%), Gaps = 7/118 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 G E +A +L KG + N EIDL+M++ + +EV+ R + + Sbjct: 2 AQHNLFGQEAEQKALSFLCNKGYVLLEKNYRFGKAEIDLLMKDKDLLVCIEVKARSTDFF 61 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVA--FTGNEV--EWIKDAF 127 G + +T K L+ +L HN + RFDV++ + + I+ AF Sbjct: 62 GTPESFITSKKIKLLVGAVNHYLEYHNL---DYEVRFDVLSYTIKNKKWICKHIESAF 116 >UniRef50_A0LCU4 UPF0102 protein Mmc1_3298 n=1 Tax=Magnetococcus sp. MC-1 RepID=Y3298_MAGSM Length = 124 Score = 119 bits (299), Expect = 3e-26, Method: Composition-based stats. Identities = 37/119 (31%), Positives = 56/119 (47%), Gaps = 5/119 (4%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 LT K G+ E A + ++ KG + N R GE+D+I G +F EV+ R+ A+ Sbjct: 4 LTPKSFGEQAEDFACKMMKKKGYHILQRNARSRYGELDIIALHGEVVVFCEVKARQGAVS 63 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF----TGNEVEWIKDAFN 128 G A ++ KQ +L + A W + + CRFD V G E ++DAF Sbjct: 64 GSAGEAIDGRKQRQLGRLAEAWRLANPA-WMAAPCRFDAVLVAREAQGWHAEIVQDAFQ 121 >UniRef50_Q2NZA5 UPF0102 protein XOO3617 n=18 Tax=Xanthomonadaceae RepID=Y3617_XANOM Length = 122 Score = 119 bits (299), Expect = 3e-26, Method: Composition-based stats. Identities = 54/120 (45%), Positives = 71/120 (59%), Gaps = 3/120 (2%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 +Q G EA AR LE GLR + N N RGGE+DL+MR+G++ +FVEVRYRR Sbjct: 2 PAARQQRGAGVEAAARALLEQAGLRLVVGNANYRGGELDLVMRDGQSLVFVEVRYRRDDR 61 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVV--AFTGNEVEWIKDAFNDH 130 +GG AASV K+ KL+ A+L+L H + CRFDVV + + WI+DAF Sbjct: 62 FGGGAASVDWRKRRKLVLAAQLFLGAHPA-LAALPCRFDVVDASGEPPVLHWIRDAFRAD 120 >UniRef50_C0WCB9 Endonuclease n=1 Tax=Acidaminococcus sp. D21 RepID=C0WCB9_9FIRM Length = 117 Score = 119 bits (298), Expect = 4e-26, Method: Composition-based stats. Identities = 34/114 (29%), Positives = 55/114 (48%), Gaps = 6/114 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 + G E A +L +G A N + GE+DL+ R+G +FVEV+ RR+ LYG Sbjct: 4 RTRFGRWGERAAAAYLRHQGYIIEAQNYSSSHGELDLVARKGHLLVFVEVKSRRTDLYGR 63 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT----GNEVEWIKD 125 +VT K ++ +TA +L H D R+DV+ ++ +K+ Sbjct: 64 PRDAVTEEKAARIRETAYEYLQDHKRPGDR--IRYDVIEIMMLFGHFQLNHLKN 115 >UniRef50_A9HJH4 UPF0102 protein GDI1964/Gdia_0189 n=1 Tax=Gluconacetobacter diazotrophicus PAl 5 RepID=Y1964_GLUDA Length = 127 Score = 119 bits (298), Expect = 4e-26, Method: Composition-based stats. Identities = 43/129 (33%), Positives = 60/129 (46%), Gaps = 5/129 (3%) Query: 1 MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTT 60 M PTR R + Q G E A WL+ G + R GEIDL+ + G Sbjct: 1 MMAEPTRRR-VRGAASYQRGLQAEQVAGAWLQEHGWTILMHRARTRWGEIDLVAQRGAMI 59 Query: 61 IFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-GNE 119 +F EV+ R Y AA S+ R++ +L+ A WL + + + RFDV+ T G+ Sbjct: 60 VFCEVKCRPH--YTTAAESLGRAQMRRLMNAA-AWLCAAHPGWIYDEMRFDVLLVTAGDA 116 Query: 120 VEWIKDAFN 128 V I DAF Sbjct: 117 VHHIADAFR 125 >UniRef50_D1AZ70 Putative uncharacterized protein n=1 Tax=Sulfurospirillum deleyianum DSM 6946 RepID=D1AZ70_SULD5 Length = 108 Score = 118 bits (296), Expect = 6e-26, Method: Composition-based stats. Identities = 30/107 (28%), Positives = 49/107 (45%), Gaps = 6/107 (5%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 G E +A +LE +G +A N + + GEID+I + F EV+Y + Sbjct: 5 IGKEAETKASAYLEKEGYTILARNFHSKFGEIDIIALKEDILHFCEVKY---SQKYDPLL 61 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 +T SK K++ T + H S+ D ++ G E+E IK+ Sbjct: 62 RITPSKMKKIITTIHYYFLTHPSSYCYQ---IDAISIKGEEIEIIKN 105 >UniRef50_UPI000197ABB4 hypothetical protein GHTCC_11038 n=2 Tax=Proteobacteria RepID=UPI000197ABB4 Length = 122 Score = 118 bits (296), Expect = 6e-26, Method: Composition-based stats. Identities = 41/119 (34%), Positives = 63/119 (52%) Query: 7 RSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVR 66 +S G A E + + +L +GL F N + GEIDL+M++ T +FVEV+ Sbjct: 3 KSAVNNTQNAYHRGLAVEQKVKAYLIAQGLVFKDENFRAKCGEIDLVMKDQDTWVFVEVK 62 Query: 67 YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 YR +G AA +T SK+ KL +T +++A+H + +D R D+ A GN W K Sbjct: 63 YRARPTHGSAADMLTSSKRDKLTKTMYVYMAKHYLNPSIIDHRIDLFAVDGNRARWHKH 121 >UniRef50_A5IKG8 UPF0102 protein Tpet_0671 n=6 Tax=Thermotogaceae RepID=Y671_THEP1 Length = 108 Score = 118 bits (296), Expect = 7e-26, Method: Composition-based stats. Identities = 27/108 (25%), Positives = 48/108 (44%), Gaps = 5/108 (4%) Query: 20 GDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAAS 79 E A ++L+ KG + + N + GEID++ R+GR +FVEV+ + Sbjct: 4 WKEAEELACKFLKKKGYKILERNYRTKYGEIDIVARDGREIVFVEVK--SGSGKVDPLER 61 Query: 80 VTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 + K L QTAR ++ ++ R D V T ++ + + Sbjct: 62 IDLKKVRNLEQTARFYMIQNKLKG---PARVDFVRVTPEGIDHFEGIW 106 >UniRef50_C7PDZ7 Putative uncharacterized protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PDZ7_CHIPD Length = 118 Score = 117 bits (295), Expect = 8e-26, Method: Composition-based stats. Identities = 34/119 (28%), Positives = 46/119 (38%), Gaps = 7/119 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + G E A L IA N R EID+I +F EV+ S LY Sbjct: 2 ASHIALGKKGELIACGHLRLHHYEIIAVNWRHRRREIDIIASRDGCLVFFEVKTLASDLY 61 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-----NEVEWIKDAF 127 G VT +K+ + A ++ R RFDV+A T E+ +DAF Sbjct: 62 GWPEKHVTAAKRRNIQAVASAYMDRMKQLPKV--IRFDVIAITFQPDGTYELVHFEDAF 118 >UniRef50_Q1IJG5 UPF0102 protein Acid345_3985 n=1 Tax=Candidatus Koribacter versatilis Ellin345 RepID=Y3985_ACIBL Length = 143 Score = 117 bits (295), Expect = 8e-26, Method: Composition-based stats. Identities = 32/125 (25%), Positives = 53/125 (42%), Gaps = 9/125 (7%) Query: 9 GSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERG--GEIDLIMREGRTTIFVEVR 66 P + +TG E A +L G +A N E+D+I G F+EV+ Sbjct: 17 PEPDEPEHLKTGRRGEELAYFFLRKHGYTIVARNFRTPWHKSELDIIGWNGGILCFIEVK 76 Query: 67 YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEW 122 R + A A+V +K++ L + AR +L + + RFD+V + + Sbjct: 77 TRTTRDIATAEAAVDDTKRNDLRRVARHYLRQ---CAENTPTRFDIVTVYLDRPKPEITI 133 Query: 123 IKDAF 127 +K AF Sbjct: 134 LKSAF 138 >UniRef50_C7GYG3 Putative choloylglycine hydrolase n=1 Tax=Eubacterium saphenum ATCC 49989 RepID=C7GYG3_9FIRM Length = 109 Score = 117 bits (295), Expect = 9e-26, Method: Composition-based stats. Identities = 31/114 (27%), Positives = 57/114 (50%), Gaps = 5/114 (4%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + K+ G E +L+ + + N R EID+I +G T F+EV+ R S + Sbjct: 1 MENKKIGSLGEEMTCSYLKDRQFVVLEQNYRNRYAEIDVIALKGDTVHFIEVKTRCSEVA 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 G A+ +V SKQ+++ + A ++LA ++ + F VV +++ + AF Sbjct: 61 GRASEAVPVSKQNRIRRLAEIYLADND--LCDKNVEFHVVTIDLHDINY---AF 109 >UniRef50_B9CKG2 Putative uncharacterized protein n=1 Tax=Atopobium rimae ATCC 49626 RepID=B9CKG2_9ACTN Length = 118 Score = 117 bits (295), Expect = 9e-26, Method: Composition-based stats. Identities = 33/116 (28%), Positives = 48/116 (41%), Gaps = 12/116 (10%) Query: 22 AWEAQARRWLEGKGLRFIAANVNE-RGGEIDLIMREGRTTIFVEVRYRRSALYGG---AA 77 E A +L +G + I N GGE D+I ++G + VEV+ RR Sbjct: 1 MGEQLAADYLAERGYKIIQRNWRCKGGGEADIIAQDGDVYVMVEVKTRRMLQVDANLMPE 60 Query: 78 ASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-----GNEVEWIKDAFN 128 +VT KQ + A L+LA H RFDV+A + + AF+ Sbjct: 61 LAVTAQKQRMYRKMALLYLAFHG---QVSMIRFDVIAINLVAEHNASLRHLIGAFS 113 >UniRef50_Q4JV13 UPF0102 protein jk1180 n=2 Tax=Corynebacterium jeikeium RepID=Y1180_CORJK Length = 131 Score = 117 bits (294), Expect = 1e-25, Method: Composition-based stats. Identities = 34/131 (25%), Positives = 52/131 (39%), Gaps = 4/131 (3%) Query: 1 MATVPTRS-GSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREG-R 58 M + G+ E A +L G + N R GE+DL+ R Sbjct: 1 MPDHQEGEGRRVTAANRRAVGNLGEDLAAEYLHRAGYEVLDRNFYTRYGELDLVARTPED 60 Query: 59 TTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTV-DCRFDVVAFTG 117 F+EV+YR SA GG A+V K ++ A LWL ++ R DV+ Sbjct: 61 DLAFIEVKYRTSASDGGGVAAVGPRKLRRIRTLAGLWLEQNREGVQFSGGLRVDVIDVGP 120 Query: 118 NEV-EWIKDAF 127 + V E ++ + Sbjct: 121 DGVREHVEGVW 131 >UniRef50_A4A171 Putative uncharacterized protein n=2 Tax=Planctomycetaceae RepID=A4A171_9PLAN Length = 137 Score = 117 bits (294), Expect = 1e-25, Method: Composition-based stats. Identities = 37/106 (34%), Positives = 52/106 (49%), Gaps = 8/106 (7%) Query: 30 WLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAASVTRSKQHKLL 89 +L G IA + + GEID+I +GRT +FVEV+ R S+ + +V KQ KL Sbjct: 26 FLRQLGYVIIARSDRSKLGEIDIIAVDGRTVVFVEVKTRSSSDAAHPSEAVDTHKQAKLT 85 Query: 90 QTARLWLARHNGSFDTVDCRFDVVAFTG------NEVEWIKDAFND 129 + A +L RHN RFDV+A T +E +AF Sbjct: 86 RLAISYLRRHNLLECKA--RFDVIAITWPAAAQTPTIEHFLNAFEP 129 >UniRef50_A1VFE8 UPF0102 protein Dvul_2148 n=3 Tax=Desulfovibrio vulgaris RepID=Y2148_DESVV Length = 134 Score = 117 bits (293), Expect = 1e-25, Method: Composition-based stats. Identities = 38/118 (32%), Positives = 54/118 (45%), Gaps = 6/118 (5%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 TG E +A L+ G R IA N G E+D+I G T +FVEV+ R + Sbjct: 4 ARHATGQHGEDEAAALLQRTGHRIIARNWRHGGLELDIICETGDTIVFVEVKTRAAHGLT 63 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEWIKDAFN 128 ++T K+H+L++ AR CRFD+V T +E I DAF+ Sbjct: 64 SPTDALTHQKRHRLIRAARA--WLAAADAWDRACRFDLVCVTQRGATCTLEHITDAFD 119 >UniRef50_A7HN69 UPF0102 protein Fnod_1509 n=1 Tax=Fervidobacterium nodosum Rt17-B1 RepID=Y1509_FERNB Length = 133 Score = 117 bits (293), Expect = 2e-25, Method: Composition-based stats. Identities = 24/110 (21%), Positives = 49/110 (44%), Gaps = 1/110 (0%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 K+ E A ++L+ KG + + N GEID+I + IFVEV+ + Sbjct: 19 NKKEWQIAEELAVKYLKEKGYKILEKNFKTPYGEIDIIANKKDIIIFVEVKSGKGIR-IQ 77 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 + V K K++++A +L + + + + DV+ ++ ++ Sbjct: 78 PSERVDDKKYLKIVKSAEFYLEFYLKNKNYKISQIDVIEIINGNIKHYEN 127 >UniRef50_C0E2B0 Putative uncharacterized protein n=2 Tax=Corynebacterium matruchotii RepID=C0E2B0_9CORY Length = 156 Score = 116 bits (292), Expect = 2e-25, Method: Composition-based stats. Identities = 34/106 (32%), Positives = 55/106 (51%), Gaps = 6/106 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGR---TTIFVEVRYRRSAL 72 + G+ EA A ++ +G R +A NV GE+D+I R T +F+EV+ R + Sbjct: 36 AHRVGELGEATAAQFYRDEGYRILARNVRYPVGELDVIARAPDPSGTIVFIEVKTRTTLD 95 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN 118 +G A +VT K H++ + A WL + + + RFDVVA + Sbjct: 96 FGIA-EAVTPRKLHRMHRAAYRWLTERHVPWS--EVRFDVVAIYLD 138 >UniRef50_C1TKL9 Predicted endonuclease related to Holliday junction resolvase n=1 Tax=Dethiosulfovibrio peptidovorans DSM 11002 RepID=C1TKL9_9BACT Length = 123 Score = 116 bits (292), Expect = 2e-25, Method: Composition-based stats. Identities = 28/113 (24%), Positives = 49/113 (43%), Gaps = 3/113 (2%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + G E A R+L G + NV R GE+D++ R+G T + VEVR+R + + Sbjct: 2 TAPHLEKGKRGEDLACRYLRNLGWTVLERNVRFRRGELDIVARDGDTLVIVEVRFRTTGI 61 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 SV K +L+ ++ + + R D++A T + + Sbjct: 62 IMSPEDSVGPRKLRRLVIAGAAYVEKTGWNGF---WRIDLIALTERKGRLFLN 111 >UniRef50_C0QVG4 UPF0102 protein BHWA1_02005 n=2 Tax=Brachyspira RepID=Y2005_BRAHW Length = 121 Score = 116 bits (292), Expect = 2e-25, Method: Composition-based stats. Identities = 41/120 (34%), Positives = 58/120 (48%), Gaps = 7/120 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNER--GGEIDLIMREGRTTIFVEVRYRRSA 71 K G+ E A +LE G I N + GEIDL+M +G +F+EV+YRR Sbjct: 2 ANKKIIGNLGEDIALEYLEKLGYTLIERNFKGKKTRGEIDLVMTKGVVIVFIEVKYRRQG 61 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT----GNEVEWIKDAF 127 +G AA S++ K+ KL +TA +L SF C F V E+ +I+D F Sbjct: 62 SFGYAACSISDRKKKKLYETAEEYLIEKGLSF-NQKCSFGAVLIDDTHYNREISFIEDIF 120 >UniRef50_A1SLR5 UPF0102 protein Noca_3248 n=11 Tax=Actinomycetales RepID=Y3248_NOCSJ Length = 124 Score = 116 bits (292), Expect = 2e-25, Method: Composition-based stats. Identities = 32/111 (28%), Positives = 46/111 (41%), Gaps = 2/111 (1%) Query: 9 GSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYR 68 S + G E A R L G+G+ + N GEIDL++R+G + EV+ R Sbjct: 3 SSAAAAIKQALGAYGETLAARHLVGQGMVLLERNWRCEAGEIDLVLRDGDVLVVCEVKTR 62 Query: 69 RSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE 119 S YG +VT K +L + A W+ D R D+V Sbjct: 63 SSLRYGTPHEAVTDIKVARLRRLASRWVQDRGV--AVRDIRIDLVGIVRPR 111 >UniRef50_A6DUI2 Endonuclease n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DUI2_9BACT Length = 127 Score = 116 bits (292), Expect = 2e-25, Method: Composition-based stats. Identities = 36/127 (28%), Positives = 58/127 (45%), Gaps = 11/127 (8%) Query: 12 RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERG-GEIDLIMREGRTTIFVEVRYRRS 70 ++ +TG EA A+R + G + N + GEID+I R+G T FVEV+ R Sbjct: 3 KKAKHLKTGRKGEAMAQRQMRRCGYEILRKNYSLEHIGEIDIIARDGGTLCFVEVKTRHQ 62 Query: 71 AL--YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEW---IK- 124 A ++ K+ ++ + A+ +L +H S RFD+V + W IK Sbjct: 63 NKTEDTSPAQAIDSKKRQRIAKCAKYYLKKH--SLTQCSFRFDIVEVILGKFFWQHQIKI 120 Query: 125 --DAFND 129 AF + Sbjct: 121 RTHAFGE 127 >UniRef50_A0Q0X6 UPF0102 protein NT01CX_2205 n=3 Tax=Clostridium RepID=Y2205_CLONN Length = 120 Score = 116 bits (291), Expect = 2e-25, Method: Composition-based stats. Identities = 31/118 (26%), Positives = 53/118 (44%), Gaps = 8/118 (6%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 K G E + +L KG + + N R GEID+I F EV+ R + +G Sbjct: 5 NKPIGSYGEHISENFLVSKGHKILTKNFRCRSGEIDIISSHNNYICFTEVKTRYNYSFGI 64 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE------VEWIKDAF 127 SVT +K K+ TA+ ++ + + F+V+ N+ + +I++AF Sbjct: 65 PCESVTITKIKKIRNTAKFYIYINKLFKNNFK--FNVIEIILNKYSNDYSINFIENAF 120 >UniRef50_C8NTW9 Choloylglycine hydrolase n=1 Tax=Corynebacterium genitalium ATCC 33030 RepID=C8NTW9_9CORY Length = 125 Score = 116 bits (291), Expect = 3e-25, Method: Composition-based stats. Identities = 35/107 (32%), Positives = 45/107 (42%), Gaps = 6/107 (5%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMR-EGRTTIFVEVRYRRSA 71 G E + E G +A N R GEID+I TT+FVEV+ RR Sbjct: 7 PRDQYMLGALGETEVATRYEQAGYIIVARNYRCRDGEIDIIAMATDGTTVFVEVKTRRGT 66 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN 118 +GGA SVT K ++ + A WL RFDVV + Sbjct: 67 CFGGA-ESVTARKLARMRKAAVHWLRDKPFR----QVRFDVVEVLFD 108 >UniRef50_C6WEV0 Putative uncharacterized protein n=1 Tax=Actinosynnema mirum DSM 43827 RepID=C6WEV0_ACTMD Length = 117 Score = 115 bits (290), Expect = 3e-25, Method: Composition-based stats. Identities = 28/116 (24%), Positives = 49/116 (42%), Gaps = 7/116 (6%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 + G E+ A R+LE +GL +A N GE+D++ +G + EV+ R + G Sbjct: 3 ASHVLGRLGESVACRYLERQGLVVLARNWRCASGELDVVATDGVRLVVCEVKCRSGSGRG 62 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-----EVEWIKD 125 + T + ++ +TA W H S V R D+V + ++ Sbjct: 63 DPLEAATPEQLDRVRRTAYRWRREHRLSG--VGVRVDLVGLEWPPGGPVRLRHVRG 116 >UniRef50_Q47S60 UPF0102 protein Tfu_0669 n=2 Tax=Nocardiopsaceae RepID=Y669_THEFY Length = 124 Score = 115 bits (290), Expect = 3e-25, Method: Composition-based stats. Identities = 32/110 (29%), Positives = 48/110 (43%), Gaps = 3/110 (2%) Query: 7 RSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVR 66 R+ + + G E A R+L G+R + N R GEID++ R+ RT + VEV+ Sbjct: 2 RTRARLADQRRTLGQRGEELAARYLTRHGMRVLQRNWRCRDGEIDILARQDRTLVVVEVK 61 Query: 67 YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT 116 R +G +V +K+ +L W H S R DVV Sbjct: 62 TRAGRRFGTPLEAVDETKRARLRALGYRWARDHGCS---ARIRVDVVGIL 108 >UniRef50_B8HA88 UPF0102 protein Achl_2213 n=1 Tax=Arthrobacter chlorophenolicus A6 RepID=Y2213_ARTCA Length = 132 Score = 115 bits (290), Expect = 3e-25, Method: Composition-based stats. Identities = 31/126 (24%), Positives = 48/126 (38%), Gaps = 10/126 (7%) Query: 14 LTTKQT-GDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + K G E A +LE G+ + N GEID++ +G + EV+ RRS Sbjct: 1 MKAKDLLGRHGEDLAVGYLETLGMLIVERNWRCSEGEIDVVALDGDALVIAEVKTRRSLD 60 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-----VEWIKDA- 126 YG +V K +L + W R DV+A + VE +K Sbjct: 61 YGHPFEAVGPDKLARLHRLGAAWCRDRELRMPLR--RVDVIAVVDDGGGSPVVEHLKGVA 118 Query: 127 -FNDHS 131 + + Sbjct: 119 EWRSDA 124 >UniRef50_B2IUS6 Putative uncharacterized protein n=4 Tax=Nostocaceae RepID=B2IUS6_NOSP7 Length = 180 Score = 115 bits (290), Expect = 4e-25, Method: Composition-based stats. Identities = 31/130 (23%), Positives = 48/130 (36%), Gaps = 28/130 (21%) Query: 18 QTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGR------------------- 58 G E +WL+ G + R GEID+I + Sbjct: 11 DIGHLGEDLVAQWLQSTGWIILHRRFASRWGEIDIIAQHDGQTGEKLLTQHSLRAKRPAT 70 Query: 59 -------TTIFVEVRYRRSALYG-GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRF 110 FVEV+ R S + G +++T KQ K+ +TA ++LA++ CRF Sbjct: 71 ANSTQHSLLAFVEVKTRSSGSWDAGGRSAITPQKQAKISRTAGIFLAQYPEK-ADYSCRF 129 Query: 111 DVVAFTGNEV 120 DV + Sbjct: 130 DVAIVYCQRI 139 >UniRef50_C0BPF0 Putative uncharacterized protein n=1 Tax=Flavobacteria bacterium MS024-3C RepID=C0BPF0_9BACT Length = 119 Score = 115 bits (289), Expect = 5e-25, Method: Composition-based stats. Identities = 35/116 (30%), Positives = 50/116 (43%), Gaps = 7/116 (6%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 G E A+ +L KG A N R EID+I I VEV+ R + Sbjct: 4 HNDLGAKGERIAQEYLISKGYEIRAVNYRHRKAEIDIIALHENFLIVVEVKTRTAPTIVP 63 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT----GNEVEWIKDAF 127 V SK + L++ A ++ RH + RFD+V T ++E +KDAF Sbjct: 64 LIQLVPPSKINHLIRAANYYMNRHKV---HKEARFDIVYITMKAHSYDLEHLKDAF 116 >UniRef50_B6R5V9 Putative uncharacterized protein n=1 Tax=Pseudovibrio sp. JE062 RepID=B6R5V9_9RHOB Length = 135 Score = 115 bits (289), Expect = 5e-25, Method: Composition-based stats. Identities = 33/129 (25%), Positives = 56/129 (43%), Gaps = 4/129 (3%) Query: 2 ATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTI 61 T+S ++ + G E +A L G + + + GEIDLI + +T + Sbjct: 7 TPRATKSNLLKKQAAYRKGLQAELKAEMLLRQAGWQILERRYKTKQGEIDLIAEQDQTIV 66 Query: 62 FVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFD-VVAFTGNEV 120 FVEV+ RR G ++T+ Q ++ AR W++ H+ RFD V+ E Sbjct: 67 FVEVKARRGVDDG--LYAITQRSQRRIANAAREWVSHHHEVVGKT-LRFDAVILPKHGEA 123 Query: 121 EWIKDAFND 129 + + F Sbjct: 124 QHFPNLFEA 132 >UniRef50_Q118B0 UPF0102 protein Tery_0733 n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Y733_TRIEI Length = 180 Score = 115 bits (289), Expect = 5e-25, Method: Composition-based stats. Identities = 30/117 (25%), Positives = 47/117 (40%), Gaps = 14/117 (11%) Query: 18 QTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRT------------TIFVEV 65 TG E +WL +G + + GE+D++ + + FVEV Sbjct: 17 DTGILGEELVAKWLNLEGWQILHRRWQCPWGELDIVATKTTSSLRDSSNYKFPILAFVEV 76 Query: 66 RYRRSALYG-GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVE 121 + R + +VT SKQ KL +TA ++L+ CRFDV N + Sbjct: 77 KTRSRGNWDQDGLLAVTESKQAKLWKTAEIFLSDRP-ELVDYSCRFDVALVRCNYIR 132 >UniRef50_C5CAH6 Holliday junction resolvase-like endonuclease n=1 Tax=Micrococcus luteus NCTC 2665 RepID=C5CAH6_MICLC Length = 139 Score = 115 bits (288), Expect = 5e-25, Method: Composition-based stats. Identities = 29/118 (24%), Positives = 41/118 (34%), Gaps = 2/118 (1%) Query: 2 ATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTI 61 A P + R G E A RWL +G N GE+D++ + Sbjct: 10 AQPPGERRASR--AHTALGRFGEDAAARWLAERGYVIADRNWRGEAGELDIVAHHAGWWV 67 Query: 62 FVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE 119 VEV+ R +G S+ R K +L + W+ H R D VA Sbjct: 68 GVEVKTRSGLAFGDPFESIDRRKLTRLHRLTAAWVRAHAADRRGTPWRVDAVAVLVPR 125 >UniRef50_A0LMM6 Putative uncharacterized protein n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LMM6_SYNFM Length = 95 Score = 115 bits (288), Expect = 5e-25, Method: Composition-based stats. Identities = 37/94 (39%), Positives = 51/94 (54%), Gaps = 6/94 (6%) Query: 40 AANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARH 99 N GEIDLI+R+G+T +FVEV+ R + +G SV+ +KQ +L + A +L Sbjct: 2 ERNFRCAAGEIDLIVRDGKTLVFVEVKSRCGSRFGLPQESVSIAKQRRLTRLALWYLREK 61 Query: 100 NGSFDTVDCRFDVVAFTG----NEVEWIKDAFND 129 F+ RFDVVA T EV WI +AF Sbjct: 62 R--FEGHPARFDVVAVTWSGGKPEVTWIVNAFEA 93 >UniRef50_C7H7V1 Endonuclease n=2 Tax=Faecalibacterium prausnitzii RepID=C7H7V1_9FIRM Length = 120 Score = 115 bits (288), Expect = 6e-25, Method: Composition-based stats. Identities = 42/122 (34%), Positives = 58/122 (47%), Gaps = 8/122 (6%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE-GRTTIFVEVRYRRSAL 72 + +TG EA A R+ + +G +A N R GEIDLI+RE T + EV+ R Sbjct: 1 MDRAETGRTGEAVAARYYQKQGCELVAHNYRTRMGEIDLILREPDGTLVLCEVKTRSPDP 60 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-----NEVEWIKDAF 127 AA+VT +KQ +L++TA +L + RFDV T V IK AF Sbjct: 61 LAAPAAAVTPAKQRRLIRTAEYYLQ--HTGQSDEPVRFDVAEVTPLDSGRWMVHIIKGAF 118 Query: 128 ND 129 Sbjct: 119 TA 120 >UniRef50_D2ATE9 Putative uncharacterized protein n=1 Tax=Streptosporangium roseum DSM 43021 RepID=D2ATE9_STRRD Length = 117 Score = 114 bits (287), Expect = 8e-25, Method: Composition-based stats. Identities = 33/103 (32%), Positives = 47/103 (45%), Gaps = 2/103 (1%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + G E A +L G++ + N GEID++ REGR + VEV+ R + Sbjct: 2 AAKDELGRHGEQVAVDYLLAHGMQILDRNWRCPDGEIDVVAREGRALVVVEVKTRSGRTH 61 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT 116 G A +VT K +L + WLA FD R DV+A Sbjct: 62 GTAFEAVTVVKLARLRRLTGRWLAERRERFD--SVRIDVIALE 102 >UniRef50_A8F4Z9 UPF0102 protein Tlet_0667 n=1 Tax=Thermotoga lettingae TMO RepID=Y667_THELT Length = 107 Score = 114 bits (287), Expect = 8e-25, Method: Composition-based stats. Identities = 30/106 (28%), Positives = 44/106 (41%), Gaps = 4/106 (3%) Query: 20 GDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAAS 79 E +A R+L KG + +A N R GEID+I R +FVEV+ + Sbjct: 3 WKEAEEKASRYLRHKGFKILARNYRTRFGEIDIIARYRGYLVFVEVK--SGNSFFLPRTR 60 Query: 80 VTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 V K + A ++ SF R DV+ T +E +D Sbjct: 61 VDLQKIRHIQLAANDYIMNTKDSFKGY--RIDVIEVTEKGIEHFED 104 >UniRef50_B7K7K1 Putative uncharacterized protein n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7K7K1_CYAP7 Length = 148 Score = 113 bits (285), Expect = 1e-24, Method: Composition-based stats. Identities = 28/103 (27%), Positives = 45/103 (43%), Gaps = 4/103 (3%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE--GRTTIFVEVRYRRSALY 73 G+ E WL+ + + R GEID+I + + F+EV+ R S + Sbjct: 1 MTTIGELGEKLVSEWLKTQEWSILQHRWRCRWGEIDIISQSTTDHSLAFIEVKTRNSRNW 60 Query: 74 G-GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF 115 ++ KQ KL+++A L+L + S CRFDV Sbjct: 61 DSDGLLAINEKKQIKLIKSASLFLGEYP-SLALFPCRFDVALV 102 >UniRef50_Q5FF38 UPF0102 protein ERGA_CDS_00540 n=5 Tax=canis group RepID=Y054_EHRRG Length = 127 Score = 113 bits (285), Expect = 1e-24, Method: Composition-based stats. Identities = 27/120 (22%), Positives = 50/120 (41%), Gaps = 6/120 (5%) Query: 12 RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSA 71 ++L G E +L+ K I GEID+I + + +F+EV+ Sbjct: 8 KRLAYNTLGYLGEVLIILFLKCKLYHIIKHRYRCPLGEIDIIAHKNKQLVFIEVKTSLFN 67 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF-TGNEVEWIKDAFNDH 130 +T +Q +L++A+ ++A H F RFD+ F + I +A+ + Sbjct: 68 KNIP----ITYKQQKSILKSAKYFIAFHR-KFANYSIRFDLYFFSLSTGLTHIPNAWQEP 122 >UniRef50_C1A4D5 Putative uncharacterized protein n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1A4D5_GEMAT Length = 121 Score = 113 bits (284), Expect = 2e-24, Method: Composition-based stats. Identities = 35/120 (29%), Positives = 60/120 (50%), Gaps = 6/120 (5%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 ++ G E A RWL +G + +A +IDLIM+ + FVEV+ RR Sbjct: 2 TRARQELGLLGERIAARWLIREGWQLVAHRFRHGHRDIDLIMQREQEVAFVEVKARRGEA 61 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----EVEWIKDAFN 128 +G +V K+ +L+++A++W+ RH + ++ RFDV+ + V I+ AF Sbjct: 62 FGSPVEAVHARKRRELVRSAKVWVDRHGT--EGLEYRFDVLGILIDGQNVRVRHIEGAFQ 119 >UniRef50_A8HYK3 UPF0102 protein AZC_4471 n=3 Tax=Rhizobiales RepID=Y4471_AZOC5 Length = 131 Score = 113 bits (284), Expect = 2e-24, Method: Composition-based stats. Identities = 35/128 (27%), Positives = 55/128 (42%), Gaps = 4/128 (3%) Query: 2 ATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTI 61 P R+ G A E +A LE + R +A + GE+DL+ R + Sbjct: 4 PPPPDTPARRRKQAAHARGLAAEDRAAAVLEAQSFRILARRLRTSAGELDLVARRDDLLV 63 Query: 62 FVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF-TGNEV 120 F EV+ RRS AA S+ ++ +++ A L+LA H + RFD + Sbjct: 64 FCEVKLRRS--LAEAAESLQLRQRRRIIAAAELFLADHP-ELAPLAMRFDAILLGRDGGA 120 Query: 121 EWIKDAFN 128 E ++ AF Sbjct: 121 EHLEGAFE 128 >UniRef50_A0NXE9 Putative uncharacterized protein n=2 Tax=Labrenzia RepID=A0NXE9_9RHOB Length = 134 Score = 113 bits (284), Expect = 2e-24, Method: Composition-based stats. Identities = 38/135 (28%), Positives = 63/135 (46%), Gaps = 9/135 (6%) Query: 1 MATVPTRSG-----SPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMR 55 MA P SG + R+ G + E A +L G R + + GEIDLI + Sbjct: 1 MARAPGGSGKLPAETDRRRRAHALGLSAETLAAWYLRLTGWRILKRRYKTKAGEIDLIAK 60 Query: 56 EGRTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF 115 +T F+EV+ R+S A +VT + Q ++ + A++++A H RFD++ Sbjct: 61 RRKTVAFIEVKARKSRQ--AALEAVTPASQKRITRAAKIFVAEHP-KAGFYTLRFDIIVV 117 Query: 116 TGNEV-EWIKDAFND 129 + E I +AF+ Sbjct: 118 RPRALPERIVNAFHA 132 >UniRef50_C4LJB4 Putative uncharacterized protein n=1 Tax=Corynebacterium kroppenstedtii DSM 44385 RepID=C4LJB4_CORK4 Length = 154 Score = 113 bits (284), Expect = 2e-24, Method: Composition-based stats. Identities = 31/108 (28%), Positives = 47/108 (43%), Gaps = 8/108 (7%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMR-------EGRTTIFVEVRY 67 +T+ G E +A WL +G + N RGGE+D++ VEV+ Sbjct: 26 STRLLGRWGEDRAAEWLVRQGFVIVDRNWRFRGGELDIVATLNTDARNSPAVCAVVEVKT 85 Query: 68 RRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF 115 RR+ +GG ++TR KQ L + WLA H R D++ Sbjct: 86 RRTQFFGGGVEAITRKKQQTLRRGMSQWLAAHPDVHPQF-IRIDLIDI 132 >UniRef50_B2RLS5 UPF0102 protein PGN_1801 n=2 Tax=Porphyromonas gingivalis RepID=Y1801_PORG3 Length = 135 Score = 113 bits (283), Expect = 2e-24, Method: Composition-based stats. Identities = 24/120 (20%), Positives = 46/120 (38%), Gaps = 9/120 (7%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 G E A + L +G + A N E+D++ R + VEV+ R Sbjct: 2 ADHNDRGRQGEEIALKHLRQQGYQIEALNWQSGRRELDIVASTSRELVVVEVKTRTEGFL 61 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG------NEVEWIKDAF 127 +V K+ + ++A ++ + + RFDV++ +E ++AF Sbjct: 62 LAPEEAVDARKRRLISESAHHYVRMYAI---DLPVRFDVISVVLSADGSCKRIEHRENAF 118 >UniRef50_A6GIE5 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6GIE5_9DELT Length = 125 Score = 113 bits (283), Expect = 2e-24, Method: Composition-based stats. Identities = 44/125 (35%), Positives = 61/125 (48%), Gaps = 6/125 (4%) Query: 8 SGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGR-----TTIF 62 + T+ G A E A R LE GL +A NV G E+DL+ E T +F Sbjct: 2 ASDEPSTHTRGRGLAAEQLAARQLERAGLTILARNVELSGAEVDLVASERDREGTPTIVF 61 Query: 63 VEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEW 122 VEVR R G A +V KQ ++ + A +L R + ++ V RFDV+A G W Sbjct: 62 VEVRSRADDRRGHPAQTVDARKQARVRRAATAYLVREDL-WERVAVRFDVIAIVGERATW 120 Query: 123 IKDAF 127 ++DAF Sbjct: 121 LRDAF 125 >UniRef50_C4KCT6 Putative uncharacterized protein n=3 Tax=Betaproteobacteria RepID=C4KCT6_THASP Length = 150 Score = 113 bits (283), Expect = 2e-24, Method: Composition-based stats. Identities = 42/98 (42%), Positives = 56/98 (57%), Gaps = 3/98 (3%) Query: 35 GLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARL 94 GLR IA NV RGGE+DL+ + +FVEVR RR+ +GGAA S+T +KQ ++L A+ Sbjct: 52 GLRVIARNVRCRGGEVDLVCLDRSHVVFVEVRLRRNNRFGGAAESITAAKQRRVLIAAQW 111 Query: 95 WLARHNGSFDTVDCRFDVV---AFTGNEVEWIKDAFND 129 WL F CRFD V A + W+ AF+ Sbjct: 112 WLGGAGRRFRDAACRFDAVLLDALDPARIIWLPGAFDA 149 >UniRef50_C5CGT1 UPF0102 protein Kole_1919 n=1 Tax=Kosmotoga olearia TBF 19.5.1 RepID=Y1919_KOSOT Length = 115 Score = 112 bits (282), Expect = 3e-24, Method: Composition-based stats. Identities = 29/111 (26%), Positives = 55/111 (49%), Gaps = 5/111 (4%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 + + G +E +A ++L+ +G + +A NV GE+D++ R+G+T +FVEV+ Sbjct: 5 SLKKGKEFEERASKFLKKQGYKILARNVRYSFGELDIVARKGKTLVFVEVK--GGNPDFP 62 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-GNEVEWIKD 125 V R+K +L A ++ + F+ R DV+ E+ +K Sbjct: 63 PRMRVDRAKLRRLELAAYKYIKDFSPKFEES--RLDVIEVLSNGEINHLKG 111 >UniRef50_B2UP21 Putative uncharacterized protein n=2 Tax=Verrucomicrobiaceae RepID=B2UP21_AKKM8 Length = 118 Score = 112 bits (282), Expect = 3e-24, Method: Composition-based stats. Identities = 36/115 (31%), Positives = 52/115 (45%), Gaps = 9/115 (7%) Query: 21 DAWEAQARRWLEGKGLRFIAANVN-ERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAAS 79 E A +L +G + N RGGE+D++ REG +FVEV+ R YGGA + Sbjct: 1 MYGELAAASFLRAEGCVILRRNWRPVRGGELDIVCREGECLVFVEVKTRTGNGYGGARRA 60 Query: 80 VTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-----GNEVEWIKDAFND 129 V K+ + + A WL + V+ R+DVV E I+ AF Sbjct: 61 VNARKRALIRRGAAEWL---RLLPEPVNSRYDVVEVLYREGMPPEFRHIRGAFGA 112 >UniRef50_C6XIG7 Putative uncharacterized protein n=1 Tax=Hirschia baltica ATCC 49814 RepID=C6XIG7_HIRBI Length = 138 Score = 112 bits (282), Expect = 3e-24, Method: Composition-based stats. Identities = 34/119 (28%), Positives = 59/119 (49%), Gaps = 4/119 (3%) Query: 12 RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSA 71 ++ ++ G E A WL KG + + V R GEIDLI +GR F+EV+ R++ Sbjct: 23 KRRKHEKRGRNAEWLASIWLRLKGYKILQKRVRMRTGEIDLIATKGRVIAFIEVKARKTI 82 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV-EWIKDAFND 129 G SV + ++ +TA +W+A+ F + D R+D+V ++ +K + Sbjct: 83 NIG--LQSVPETSWRRISKTAEIWMAKK-TKFKSHDWRYDLVVVCPWKIPSHLKAFWRP 138 >UniRef50_A4CH47 Putative uncharacterized protein n=1 Tax=Robiginitalea biformata HTCC2501 RepID=A4CH47_9FLAO Length = 128 Score = 112 bits (281), Expect = 4e-24, Method: Composition-based stats. Identities = 32/117 (27%), Positives = 53/117 (45%), Gaps = 7/117 (5%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 TT G E A R+L G R + N R EID++ + +EV+ R A Y Sbjct: 11 TTCDIGREGEDYAVRYLLASGYRILCRNYRYRRAEIDVLAFREGVLVVIEVKTRTRAFYE 70 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF----TGNEVEWIKDAF 127 + S+ RSK +L++ A ++ + + RFD++ G + ++DAF Sbjct: 71 ALSRSIPRSKIARLVRAADHYVRSNGLR---AEVRFDIIQVIRLREGYRLVHLEDAF 124 >UniRef50_Q3M3N9 UPF0102 protein Ava_4800 n=2 Tax=Nostocaceae RepID=Y4800_ANAVT Length = 151 Score = 112 bits (281), Expect = 4e-24, Method: Composition-based stats. Identities = 29/108 (26%), Positives = 50/108 (46%), Gaps = 7/108 (6%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE-----GRTTIFVEVRYR 68 ++ + E +WL+ G + + R GEID+I + FVEV+ R Sbjct: 1 MSHLNIANLGEDFVAQWLQSTGWMILNRQFSCRWGEIDIIAQHTRNNQESILAFVEVKTR 60 Query: 69 RSALYGG-AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF 115 + ++T KQ K+ +TAR++LA++ + + CRFDV Sbjct: 61 SPGNWDDGGRGAITLKKQAKIERTARIFLAKYPDKAEYI-CRFDVAIV 107 >UniRef50_B4U6T0 Putative uncharacterized protein n=1 Tax=Hydrogenobaculum sp. Y04AAS1 RepID=B4U6T0_HYDS0 Length = 110 Score = 112 bits (281), Expect = 4e-24, Method: Composition-based stats. Identities = 22/107 (20%), Positives = 43/107 (40%), Gaps = 2/107 (1%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 G +E A WL KG + + N + GEID+I + I EV+ + YG Sbjct: 2 KGKEFEDMAFSWLLEKGYKVLKRNHRCKRGEIDIIATKENKLIAFEVKGNNTDTYGLPEE 61 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 + R K ++ + + D + D + +++ +++ Sbjct: 62 RIDRLKLERIRLCLTEYALSNGIDLDNIQI--DAIFIYKDQIRHLEN 106 >UniRef50_Q7UM23 UPF0102 protein RB9115 n=1 Tax=Rhodopirellula baltica RepID=Y9115_RHOBA Length = 167 Score = 112 bits (280), Expect = 5e-24, Method: Composition-based stats. Identities = 42/123 (34%), Positives = 57/123 (46%), Gaps = 11/123 (8%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIM--REGRTTIFVEVRYRRSALY 73 Q G E A + L KGL IA + ++R GEIDLI + R +FVEV+ + Sbjct: 39 NAQLGRRGEQAAAQLLRRKGLNVIAESESDRAGEIDLIALRKRPRLIVFVEVKTLSTTRP 98 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF-------TGNEVEWIKDA 126 G A V +KQ ++ + A +L R + CRFDVVA VE + A Sbjct: 99 GHPADRVDENKQARITRAALRYLKRKKLLG--ITCRFDVVAVWWPRDEPRPTRVEHYESA 156 Query: 127 FND 129 FN Sbjct: 157 FNA 159 >UniRef50_Q0A6J0 UPF0102 protein Mlg_2205 n=2 Tax=Ectothiorhodospiraceae RepID=Y2205_ALHEH Length = 126 Score = 112 bits (280), Expect = 5e-24, Method: Composition-based stats. Identities = 52/124 (41%), Positives = 67/124 (54%), Gaps = 1/124 (0%) Query: 7 RSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVR 66 R +TG+ E +A L G+GL + N R GEIDLIMR+G +FVEVR Sbjct: 3 RRRPSAPAPHLETGNRGERRALEHLTGQGLELLECNFRCRAGEIDLIMRDGEVVVFVEVR 62 Query: 67 YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDA 126 R YGGA AS+T +KQ +L + A WL RH + CRFDVV F G +W++ A Sbjct: 63 VRTHPGYGGALASITPAKQRRLARAAARWLQRHRLT-QRAVCRFDVVTFDGERPQWLRHA 121 Query: 127 FNDH 130 F Sbjct: 122 FTAP 125 >UniRef50_Q0BTH9 UPF0102 protein GbCGDNIH1_0975 n=1 Tax=Granulibacter bethesdensis CGDNIH1 RepID=Y975_GRABC Length = 158 Score = 112 bits (280), Expect = 5e-24, Method: Composition-based stats. Identities = 33/120 (27%), Positives = 52/120 (43%), Gaps = 4/120 (3%) Query: 12 RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSA 71 R + G E A + LE G + + + GEID++ VEV+YR + Sbjct: 37 RGGKASRDGLEAERIAAQALEADGWQILGRRLRTSAGEIDILAEMDGLLAIVEVKYRPTL 96 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-GNEVEWIKDAFNDH 130 AA ++ ++ +L+ A LA+H + T RFDV+ +V I DAF Sbjct: 97 S--EAAHALGPRQRKRLIAAASYVLAQHP-EYGTEGVRFDVIVVDMAGQVRRITDAFRLD 153 >UniRef50_D1N5L9 Putative uncharacterized protein n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N5L9_9BACT Length = 132 Score = 111 bits (279), Expect = 6e-24, Method: Composition-based stats. Identities = 27/109 (24%), Positives = 49/109 (44%), Gaps = 2/109 (1%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + G EA A R LE KG +A N + GE+D++ R+G + +FVEV+ Sbjct: 5 RAAHLALGRRGEAAACRLLEAKGFDILARNWRVKAGELDIVARDGASVVFVEVKTLHRKG 64 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVE 121 + +++ ++ + A+L+L + RFD+V + Sbjct: 65 FFRPLDNLSAHQKKRNFHAAQLYLRM--IGGTGLPVRFDLVEVVASRWR 111 >UniRef50_B6BHT8 Putative uncharacterized protein n=1 Tax=Campylobacterales bacterium GD 1 RepID=B6BHT8_9PROT Length = 109 Score = 111 bits (279), Expect = 7e-24, Method: Composition-based stats. Identities = 27/110 (24%), Positives = 53/110 (48%), Gaps = 5/110 (4%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 ++ GD E +A ++L G + N R GEID+I + FVEV+ + Sbjct: 2 SRAKGDLAEDRACKFLYENGFMLVDRNFYSRFGEIDIIATKDEVLHFVEVK--SGLDFES 59 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 A ++T K +L++T +++ ++ + V +D + T VE +++ Sbjct: 60 AIQNITPKKLSRLIRTGNVYMKKNKLDVNFV---YDAIVVTPKTVEIVEN 106 >UniRef50_D2NTN5 Predicted endonuclease distantly related to archaeal Holliday junction resolvase n=2 Tax=Rothia mucilaginosa RepID=D2NTN5_9MICC Length = 151 Score = 111 bits (278), Expect = 7e-24, Method: Composition-based stats. Identities = 42/142 (29%), Positives = 53/142 (37%), Gaps = 20/142 (14%) Query: 2 ATVPTRSGSPRQL------TTKQTGDAWEAQARRWLEGKGLRFIAANVNER--------G 47 A TRS R L G E R LE G R + N Sbjct: 8 ARAATRSAPNRPLLRRTSPRAHSVGRWGEELTARILETNGYRILERNWRPPAGLEHEQIR 67 Query: 48 GEIDLIMREG-RTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTV 106 GE+DLI + +FVEV+ R S +G AS+ R K + A LW R + D Sbjct: 68 GELDLIAIDPEDELVFVEVKTRSSEDFGHPFASIDRDKARRTRSLAILWC-RLRENLDFP 126 Query: 107 DCRFDVVAFTGN----EVEWIK 124 R D +A TG E +K Sbjct: 127 RFRIDAIAVTGTCETFTFEHLK 148 >UniRef50_D2LER1 Putative uncharacterized protein n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LER1_RHOVA Length = 124 Score = 111 bits (278), Expect = 8e-24, Method: Composition-based stats. Identities = 36/123 (29%), Positives = 58/123 (47%), Gaps = 4/123 (3%) Query: 6 TRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEV 65 S + + + G A E +A+ LE K R +A +GGEIDL+ + G FVEV Sbjct: 3 AASRARNAPNSYKIGVAAETRAKLLLEAKSYRILAERYKTKGGEIDLVAQRGDHLAFVEV 62 Query: 66 RYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-VEWIK 124 + RR+ AA +V +Q ++ A ++L H FDV+ + + + I+ Sbjct: 63 KCRRTQE--EAAYAVLPRQQARIATAAEVFLGEH-AGLSHESASFDVILVSPTQGLSHIE 119 Query: 125 DAF 127 AF Sbjct: 120 QAF 122 >UniRef50_Q83I01 UPF0102 protein TW312 n=2 Tax=Tropheryma whipplei RepID=Y312_TROW8 Length = 120 Score = 110 bits (277), Expect = 1e-23, Method: Composition-based stats. Identities = 26/117 (22%), Positives = 46/117 (39%), Gaps = 4/117 (3%) Query: 12 RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSA 71 ++ G E +A +L G + N R GE+D+I R+ + VEV+ + Sbjct: 3 HDVSKYALGRIAEDKACNYLSVNGYIVLDRNWYCRFGELDIIARKNGVIVAVEVKGGKRN 62 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG---NEVEWIKD 125 ++T K KL + WL + + +D R D V+ T ++ Sbjct: 63 A-DYPICNITVKKLSKLTFLLKAWLHENKLNEFCIDLRIDAVSVTFIPELQIRHFVG 118 >UniRef50_Q0F072 Putative uncharacterized protein n=1 Tax=Mariprofundus ferrooxydans PV-1 RepID=Q0F072_9PROT Length = 120 Score = 110 bits (277), Expect = 1e-23, Method: Composition-based stats. Identities = 33/124 (26%), Positives = 59/124 (47%), Gaps = 13/124 (10%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 ++T+ G+ E++A R+L+ G R + N GE+D++ G +FVEV+ Sbjct: 1 MSTRD-GNIGESEASRYLQHHGYRILDRNARLGRGELDIVALSGEIVVFVEVKA--HHNR 57 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN---------EVEWIK 124 A +V K +L A++WLA H + ++ CRFD++ T +E ++ Sbjct: 58 ESALLAVHEDKCARLKSAAQMWLALHP-RYASLQCRFDLIIITPRVGLTAWLGSCIEHME 116 Query: 125 DAFN 128 D Sbjct: 117 DIIR 120 >UniRef50_A3VPC2 Putative uncharacterized protein n=1 Tax=Parvularcula bermudensis HTCC2503 RepID=A3VPC2_9PROT Length = 146 Score = 110 bits (277), Expect = 1e-23, Method: Composition-based stats. Identities = 36/131 (27%), Positives = 55/131 (41%), Gaps = 3/131 (2%) Query: 2 ATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTI 61 A + S L ++ G E +A +L+ KG V GEIDLI+ +G T Sbjct: 8 ARQSAKRQSAEYLAAERLGRRAERRAALFLQLKGYAIRDRRVRTPRGEIDLIVTKGSTLA 67 Query: 62 FVEVRYRRSAL-YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV 120 F+EV+ R SA A V ++ + +W AR S RFD++ Sbjct: 68 FIEVKARTSADALQDPATLVPPQNWARIAAASAIWRAR--ASLMPKIVRFDLILVRRGIP 125 Query: 121 EWIKDAFNDHS 131 +KDA+ + Sbjct: 126 CHVKDAYRPDA 136 >UniRef50_Q2IJ48 UPF0102 protein Adeh_1910 n=4 Tax=Anaeromyxobacter RepID=Y1910_ANADE Length = 134 Score = 110 bits (277), Expect = 1e-23, Method: Composition-based stats. Identities = 42/119 (35%), Positives = 54/119 (45%), Gaps = 6/119 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 + G EA A WL +G R + N R GE+DL+ R+G +FVEVR R S GG Sbjct: 12 RQALGREGEALAAAWLAERGFRILDRNHRTRRGEVDLVCRDGEVLVFVEVRSRTSGAQGG 71 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----EVEWIKDAFNDH 130 +V K +++ A W H G RFDVVA T VE AF+ Sbjct: 72 PEETVGPLKGRRVVAAATDWALGHGGLEQ--AIRFDVVAVTFGDGEPRVEHFPAAFDGD 128 >UniRef50_C7QA44 Putative uncharacterized protein n=1 Tax=Catenulispora acidiphila DSM 44928 RepID=C7QA44_CATAD Length = 157 Score = 110 bits (275), Expect = 2e-23, Method: Composition-based stats. Identities = 30/142 (21%), Positives = 50/142 (35%), Gaps = 19/142 (13%) Query: 2 ATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNER----GGEIDLIMREG 57 T L+ Q G E A +L G + N R GE+D+I Sbjct: 14 PTDFPSQADHSTLSPGQLGREGEDLAAAYLTACGYHVLDRNWRWRGPDVRGELDIIALAS 73 Query: 58 RTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVD--------CR 109 + +EV+ RR+A ++T +K+ +L + WLA H R Sbjct: 74 DLLVTIEVKTRRAATGARPFDAITEAKRARLWKLTNRWLAEHRLDPAVRHHLPRGIRGIR 133 Query: 110 FDVVAF-------TGNEVEWIK 124 DV+ T ++ ++ Sbjct: 134 LDVIGLIYPTDGHTEPTIDHLQ 155 >UniRef50_A1HN64 Putative uncharacterized protein n=1 Tax=Thermosinus carboxydivorans Nor1 RepID=A1HN64_9FIRM Length = 78 Score = 110 bits (275), Expect = 2e-23, Method: Composition-based stats. Identities = 25/70 (35%), Positives = 34/70 (48%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 G E A +L G + + N R GEID++ T +FVEV+ R S +G A Sbjct: 2 MGKMGENAAADYLARNGYKILMRNYRCRIGEIDIVAERQGTIVFVEVKTRSSEKFGFPAE 61 Query: 79 SVTRSKQHKL 88 +V KQ KL Sbjct: 62 AVNYRKQQKL 71 >UniRef50_C2ANQ3 Predicted endonuclease related to Holliday junction resolvase n=1 Tax=Tsukamurella paurometabola DSM 20162 RepID=C2ANQ3_TSUPA Length = 122 Score = 110 bits (275), Expect = 2e-23, Method: Composition-based stats. Identities = 28/107 (26%), Positives = 40/107 (37%), Gaps = 7/107 (6%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNER----GGEIDLIMRE-GRTTIFVEVRYR 68 + + G A E +L G+G R + N GE+D+I + VEV+ R Sbjct: 1 MGNNEVGRAGEDLVCEYLTGRGWRVLDRNWRFSGSGLRGELDVIAQSADGVLAVVEVKTR 60 Query: 69 RSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF 115 YG +VT K +L WLA R DV + Sbjct: 61 SGTAYGSGFEAVTPRKVAQLRALTARWLAE--SENAYRRVRIDVASV 105 >UniRef50_A4X4J0 UPF0102 protein Strop_1320 n=2 Tax=Micromonosporaceae RepID=Y1320_SALTO Length = 121 Score = 110 bits (275), Expect = 2e-23, Method: Composition-based stats. Identities = 40/119 (33%), Positives = 54/119 (45%), Gaps = 6/119 (5%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + G E A R L GLR +A N GEID+I +G EV+ RR+ Sbjct: 5 SRHNQSVGAYGERCALRHLITAGLRPVARNWRCPHGEIDIIAWDGPVLAICEVKTRRTDT 64 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT----GNEVEWIKDAF 127 +G A+VT +K +L A WLA D + RFDV++ VE +K AF Sbjct: 65 FGTPTAAVTGTKARRLRLLAARWLAETGTRAD--EVRFDVLSIRLTGGPPHVEHLKGAF 121 >UniRef50_C3XDT2 Putative uncharacterized protein n=1 Tax=Helicobacter bilis ATCC 43879 RepID=C3XDT2_9HELI Length = 112 Score = 109 bits (274), Expect = 3e-23, Method: Composition-based stats. Identities = 30/110 (27%), Positives = 52/110 (47%), Gaps = 6/110 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 +Q G +E A +L G FI N + R GEIDLIM++ F+EV+ S+ Sbjct: 4 MRQKGRYYEQVALEYLISLGFEFIEQNFHSRYGEIDLIMKKDSILHFIEVK---SSHCIN 60 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 A++T K +L +T ++L + D V+ + + +I++ Sbjct: 61 PLANITPKKLERLTKTIHVFLDQRQIVSHFC---IDAVSIYKDNITFIEN 107 >UniRef50_UPI0001C31AB5 protein of unknown function UPF0102 n=1 Tax=Conexibacter woesei DSM 14684 RepID=UPI0001C31AB5 Length = 122 Score = 109 bits (273), Expect = 3e-23, Method: Composition-based stats. Identities = 28/118 (23%), Positives = 46/118 (38%), Gaps = 7/118 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 G E A LE +G + N R GE+D++ + +F EV+ RR Sbjct: 6 RHHLGRIGENLAVEHLERRGFVVLDRNYRTRWGELDVVACDDERIVFCEVKTRRLGSSA- 64 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN------EVEWIKDAF 127 + ++ +L + A WL + RFD V T + +E ++ AF Sbjct: 65 PLEGLREPQRRRLRRMAVSWLQAKPRRTYVPELRFDAVGVTIDATGQLVALEHLEGAF 122 >UniRef50_UPI00019790C0 hypothetical protein HcinC1_06745 n=2 Tax=Helicobacter cinaedi CCUG 18818 RepID=UPI00019790C0 Length = 116 Score = 109 bits (273), Expect = 3e-23, Method: Composition-based stats. Identities = 28/112 (25%), Positives = 49/112 (43%), Gaps = 5/112 (4%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRR--SALY 73 ++ G E +A +L G I N R GEID+I + F+EV+ + Sbjct: 4 SRAKGKEAEDKACAFLRENGFEIIERNFFARYGEIDIIAQRDGILHFIEVKSASVGAKSG 63 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 ++T SK KL+ T +L+ N + + D + G +E+I++ Sbjct: 64 FEPIYNITPSKIEKLISTIGFYLSTQNLTQEYC---LDALIIKGGHIEFIEN 112 >UniRef50_D1Y396 Putative uncharacterized protein n=1 Tax=Pyramidobacter piscolens W5455 RepID=D1Y396_9BACT Length = 164 Score = 108 bits (272), Expect = 4e-23, Method: Composition-based stats. Identities = 34/120 (28%), Positives = 51/120 (42%), Gaps = 3/120 (2%) Query: 4 VPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFV 63 G E A +L+ +GL+ + NV ER E+DL+ EG+T +FV Sbjct: 31 TQAERAFFLAKERAAIGRWAEELAAGFLQAQGLKILERNVRERFSELDLVALEGKTLVFV 90 Query: 64 EVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWI 123 EVR RR A ++ K +LL+ A L+ R + R D+V+ W Sbjct: 91 EVRCRRKNPVMSAQDTIGPLKWRRLLRGAELYTLRRQWRGE---WRLDLVSVDVGHERWH 147 >UniRef50_C1F7M4 Putative uncharacterized protein n=1 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F7M4_ACIC5 Length = 157 Score = 108 bits (272), Expect = 4e-23, Method: Composition-based stats. Identities = 34/128 (26%), Positives = 50/128 (39%), Gaps = 9/128 (7%) Query: 8 SGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERG--GEIDLIMREGRTTIFVEV 65 + SP + TG E A +L G +A G++DLI EG +EV Sbjct: 27 AASPEEPAHLTTGRRGELAAYGFLRRNGYTIVARGWRSHICPGDLDLIAWEGEHLCVIEV 86 Query: 66 RYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-----EV 120 + R + A A+V K+ L AR +L RFDVV+ + E Sbjct: 87 KARTTRDVATAEAAVDHQKRRTLRMLARRYLRLAGIP--QSAARFDVVSVYFDSGHAPEF 144 Query: 121 EWIKDAFN 128 ++AF Sbjct: 145 TLYRNAFG 152 >UniRef50_Q6NGK0 UPF0102 protein DIP1513 n=1 Tax=Corynebacterium diphtheriae RepID=Y1513_CORDI Length = 122 Score = 108 bits (271), Expect = 5e-23, Method: Composition-based stats. Identities = 32/106 (30%), Positives = 45/106 (42%), Gaps = 6/106 (5%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREG-RTTIFVEVRYRRSALY 73 E + +G A NV+ GEID+I +F+EV+ R S+ Sbjct: 5 HNHYLAVLGEDFVAQQYANEGYDITARNVSFSVGEIDIIATSPQGEVVFIEVKTRSSS-L 63 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE 119 AA +VT +K K+ + A WL D RFDVVA +E Sbjct: 64 MDAAEAVTPTKMRKIHRAASKWLQGKPF----ADIRFDVVAVHVDE 105 >UniRef50_A7HZ51 UPF0102 protein Plav_3586 n=1 Tax=Parvibaculum lavamentivorans DS-1 RepID=Y3586_PARL1 Length = 133 Score = 108 bits (271), Expect = 5e-23, Method: Composition-based stats. Identities = 41/130 (31%), Positives = 57/130 (43%), Gaps = 5/130 (3%) Query: 2 ATVPTRSGSPR-QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTT 60 A R G+P L + G E A L KG R +A + GEIDL++R GR Sbjct: 7 APRAARKGNPATGLAAYRLGLRAETLAVLLLRLKGYRVVARRLKTPAGEIDLVVRRGRAL 66 Query: 61 IFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV 120 VEV+ R AA ++ +Q +L + A L R+ F +D RFDVV Sbjct: 67 AVVEVKARGEGD--AAAEALLPRQQRRLERAAAHLLGRYP-HFADLDLRFDVVLIVPRRW 123 Query: 121 -EWIKDAFND 129 + DA+ Sbjct: 124 PRHLADAWRP 133 >UniRef50_B5JM98 Putative uncharacterized protein n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JM98_9BACT Length = 141 Score = 108 bits (271), Expect = 6e-23, Method: Composition-based stats. Identities = 33/114 (28%), Positives = 48/114 (42%), Gaps = 5/114 (4%) Query: 6 TRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEV 65 R+ P G E +A + L+ KG + +A N EIDLI G+ +FVEV Sbjct: 10 GRASEPESAA---IGRRGEREAEKLLKRKGYQILARNWRSGRDEIDLICLHGKAVVFVEV 66 Query: 66 RYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE 119 R R+ S+ R K+ L + R + + RFDVV +E Sbjct: 67 RTRKVGALVSGYDSIDRRKREALRRVCRSYFGMMKPK--PITLRFDVVEIEHDE 118 >UniRef50_D0WR56 Putative endonuclease n=1 Tax=Actinomyces sp. oral taxon 848 str. F0332 RepID=D0WR56_9ACTO Length = 138 Score = 107 bits (269), Expect = 8e-23, Method: Composition-based stats. Identities = 32/122 (26%), Positives = 57/122 (46%), Gaps = 9/122 (7%) Query: 11 PRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLI--MREGRTTIFVEVRYR 68 P L ++ G E A R+L+ G +A N R GE+DL+ + R + VEV+ R Sbjct: 17 PEPLGNQELGKWGEELAARYLQAYGYVVLARNWRRRAGELDLVTACPQRRAVVAVEVKTR 76 Query: 69 RSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-----VEWI 123 + + G+ +++R+K +L + +WL + V D+VA T + + Sbjct: 77 NAEVSVGSVEAISRAKLARLRKLTGMWLQETGTRCERVCL--DLVAITVENDGSWLIRHL 134 Query: 124 KD 125 +D Sbjct: 135 RD 136 >UniRef50_A8YJ85 Similar to Y189_SYNY3 UPF0102 protein sll0189 n=2 Tax=Microcystis aeruginosa RepID=A8YJ85_MICAE Length = 139 Score = 107 bits (269), Expect = 9e-23, Method: Composition-based stats. Identities = 25/103 (24%), Positives = 47/103 (45%), Gaps = 4/103 (3%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIM--REGRTTIFVEVRYRRSALY 73 G+ E WL+ + + GGEIDLI+ + FVEV+ R + + Sbjct: 1 MTTVGELGENLVADWLQLQQWHILQRRWRSGGGEIDLIVLSKSQAILAFVEVKTRSAGNW 60 Query: 74 G-GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF 115 G ++ K+ ++ + A+++L+ + + + CRFDV Sbjct: 61 DLGGKLAIDDRKKGRIYEAAQIFLSFYP-QWSDLTCRFDVALV 102 >UniRef50_B6JM54 UPF0102 protein HPP12_0830 n=15 Tax=Epsilonproteobacteria RepID=Y830_HELP2 Length = 114 Score = 107 bits (269), Expect = 9e-23, Method: Composition-based stats. Identities = 24/111 (21%), Positives = 51/111 (45%), Gaps = 6/111 (5%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 ++ G E +A +L+ G + N + GEID+I + F+EV+ + Sbjct: 7 KHREKGLKAEEEACGFLKSLGFEMVERNFFSQFGEIDIIALKKGVLHFIEVKSGEN---F 63 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 ++T SK K+++T R +L++ + + D D + + E +++ Sbjct: 64 DPIYAITPSKLKKMIKTIRCYLSQKDPNSDFC---IDALIVKNGKFELLEN 111 >UniRef50_D2L5G2 Putative uncharacterized protein n=1 Tax=Desulfovibrio sp. FW1012B RepID=D2L5G2_9DELT Length = 134 Score = 107 bits (269), Expect = 9e-23, Method: Composition-based stats. Identities = 39/118 (33%), Positives = 57/118 (48%), Gaps = 7/118 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYR-RSALYG 74 G EA A L G G R N RGGE+DLI +G T +FVEV+ R +L Sbjct: 5 HLLLGREGEAVAEALLVGAGFRVEVRNYRTRGGEVDLICLDGDTVVFVEVKARGPGSLLD 64 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEWIKDAFN 128 +VT +K+ ++ + A +L+ ++ CRFDVVA + + + DAF Sbjct: 65 RPEEAVTPAKRGRIARAAAAFLSER--AWWDRPCRFDVVAVSVHGGRRTATHLPDAFG 120 >UniRef50_D1B6E0 Putative uncharacterized protein n=1 Tax=Thermanaerovibrio acidaminovorans DSM 6589 RepID=D1B6E0_THEAS Length = 118 Score = 107 bits (268), Expect = 1e-22, Method: Composition-based stats. Identities = 35/117 (29%), Positives = 49/117 (41%), Gaps = 8/117 (6%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + G E A R L G R + NV GEID++ +G +FVEVR R Sbjct: 2 EARNLARGALGEEMAVRHLIRMGWRILGRNVRYPFGEIDIVAHDGTELVFVEVRLR-GPG 60 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT----GNEVEWIKD 125 AA +V +K KL++ R + R D++A G VE I+D Sbjct: 61 PQRAAETVGPAKLRKLIRACRAFAESRG---YDGPFRIDLLAIDQGPCGYRVELIRD 114 >UniRef50_B9L042 Putative uncharacterized protein n=2 Tax=Thermomicrobia (class) RepID=B9L042_THERP Length = 125 Score = 107 bits (268), Expect = 1e-22, Method: Composition-based stats. Identities = 36/106 (33%), Positives = 54/106 (50%), Gaps = 1/106 (0%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + + G+A E A RWLE G +A N R GE+D++ +G + VEV+ RR A Sbjct: 1 MGRQCLGEAGERAAARWLEEAGWHVLARNWRCRQGELDIVALDGDVLVAVEVKVRRDAGN 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE 119 A +VT K +LL +LA H + + CR D++A T + Sbjct: 61 EPAEWAVTPRKGRRLLAALSAFLAAHPEHQERL-CRVDLIAVTVDR 105 >UniRef50_C7NGY4 Predicted endonuclease related to Holliday junction resolvase n=1 Tax=Kytococcus sedentarius DSM 20547 RepID=C7NGY4_KYTSD Length = 120 Score = 107 bits (268), Expect = 1e-22, Method: Composition-based stats. Identities = 30/119 (25%), Positives = 49/119 (41%), Gaps = 7/119 (5%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + G E A RW + +G R + N R GEIDL++ G + EV+ R + Sbjct: 2 TRERRTLGRRGEDIAARWWQERGARVLERNWRHRLGEIDLVVTSGPRLVVCEVKTRSTVA 61 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-----GNEVEWIKDA 126 +G V ++ +L + WL H G + + R DV+ V+ + A Sbjct: 62 FGQPVEMVALPQRRRLRRLTAAWLQEHPGRWA--EVRIDVIGVLLPPGGPATVQHVPGA 118 >UniRef50_C3JBE2 Putative uncharacterized protein n=2 Tax=Bacteria RepID=C3JBE2_9PORP Length = 127 Score = 107 bits (267), Expect = 1e-22, Method: Composition-based stats. Identities = 28/123 (22%), Positives = 49/123 (39%), Gaps = 10/123 (8%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNE-RGGEIDLIMREGRTTIFVEVRYRRSAL 72 G E A R+LE + + N + E+D+ + R I +EV+ R Sbjct: 2 AQHNDLGVLGERAAYRYLEQLKYKILDTNWSIDGKKEVDIFATDERELIVIEVKTRNEDY 61 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN------EVEWIKDA 126 ++VTR KQ ++ ++ + RFDV+ + ++E+ KDA Sbjct: 62 SVSPLSAVTRRKQANIISLTNAYIRLKGITL---PIRFDVLTAVFHPFDQSFDIEYYKDA 118 Query: 127 FND 129 F Sbjct: 119 FRA 121 >UniRef50_A3JND8 Putative uncharacterized protein n=1 Tax=Rhodobacterales bacterium HTCC2150 RepID=A3JND8_9RHOB Length = 117 Score = 107 bits (267), Expect = 2e-22, Method: Composition-based stats. Identities = 28/117 (23%), Positives = 52/117 (44%), Gaps = 4/117 (3%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 T+ G + E R + +G + GEID+I RE IF+EV+ +S Sbjct: 3 GKTSYLAGLSAEEAVERHCKRRGKTILHRRWRGSVGEIDIIAREQDQVIFIEVK--KSKS 60 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-NEVEWIKDAFN 128 + A + ++ ++Q ++ T +LA + RFDV +++ I++A Sbjct: 61 FYDAISHLSVAQQQRIYATGSEYLA-NEELGQNTPVRFDVALVDSMGQIKVIENAIG 116 >UniRef50_A6FT82 PII uridylyl-transferase n=5 Tax=Rhodobacterales RepID=A6FT82_9RHOB Length = 175 Score = 106 bits (265), Expect = 2e-22, Method: Composition-based stats. Identities = 32/129 (24%), Positives = 53/129 (41%), Gaps = 4/129 (3%) Query: 2 ATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTI 61 A + R L + G A E + +G+ + RGGEIDLI+R+G + Sbjct: 49 AGPAKTARRDRGLRSWLAGAAAEKIVALAYDKRGIDLLETRWRGRGGEIDLILRDGSEIV 108 Query: 62 FVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-NEV 120 F EV+ RS A + ++ ++ A +L R RFD+ G Sbjct: 109 FCEVKAARSTQ--EAIQRLRPAQMRRIHAAASEYLGRVP-EGQLAQVRFDLAVVDGTGRA 165 Query: 121 EWIKDAFND 129 + +++AF Sbjct: 166 DILENAFGH 174 >UniRef50_C3XN83 Putative uncharacterized protein n=3 Tax=Helicobacter RepID=C3XN83_9HELI Length = 122 Score = 105 bits (264), Expect = 3e-22, Method: Composition-based stats. Identities = 26/95 (27%), Positives = 43/95 (45%), Gaps = 3/95 (3%) Query: 12 RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSA 71 T Q G E A +LE +G A N N R GEID+I ++ FVEV+ Sbjct: 5 NTANTTQKGKEAEDFACAFLENEGYSIEARNFNTRFGEIDIIAKKDGILHFVEVKSGIG- 63 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTV 106 ++T +K K+++T ++L ++ + Sbjct: 64 --FEPIYNITPTKVQKIIKTIEIYLKEYHLNLPYC 96 >UniRef50_Q5SLC1 UPF0102 protein TTHA0372 n=5 Tax=Thermaceae RepID=Y372_THET8 Length = 112 Score = 105 bits (263), Expect = 4e-22, Method: Composition-based stats. Identities = 36/109 (33%), Positives = 52/109 (47%), Gaps = 9/109 (8%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 G E +A R+L GKG R + N GE+DL M + + VEV+ R SA +G Sbjct: 5 RGRWAEEEALRFLLGKGYRLLWRNRRTPFGEVDLFMEKDGVYVVVEVKQRASARFGAPLE 64 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----EVEWI 123 ++T K +LLQ+AR L R D + R + V G +E + Sbjct: 65 AITPGKVRRLLQSARFLLGR-----DDLPVRLEAVLVHGTPKDFRLEHL 108 >UniRef50_Q2RJT6 UPF0102 protein Moth_0988 n=2 Tax=Clostridia RepID=Y988_MOOTA Length = 120 Score = 105 bits (263), Expect = 5e-22, Method: Composition-based stats. Identities = 39/121 (32%), Positives = 54/121 (44%), Gaps = 8/121 (6%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 +T ++ G EA A L G R + N GEID++ +G +F+EVR R S Sbjct: 2 TMTRRRRGQIGEAAAAALLADSGYRILERNYRCPLGEIDIVAAQGEEIVFIEVRTRSSQT 61 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE------VEWIKDA 126 +G SV K+ +L + A + CRFDVVA + VE IK A Sbjct: 62 FGTPQESVDGRKRLRLRRLAAY--YLGSRGLAGRSCRFDVVAVWLDRQERVAGVEVIKGA 119 Query: 127 F 127 F Sbjct: 120 F 120 >UniRef50_C3PH56 Putative uncharacterized protein n=1 Tax=Corynebacterium aurimucosum ATCC 700975 RepID=C3PH56_CORA7 Length = 145 Score = 104 bits (261), Expect = 8e-22, Method: Composition-based stats. Identities = 35/103 (33%), Positives = 53/103 (51%), Gaps = 3/103 (2%) Query: 18 QTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE-GRTTIFVEVRYRRSALYGGA 76 G A E A + +G + IAANV+ R GE+DL++RE T +F EV+ R + +G A Sbjct: 23 ALGKAGEKFAADFYRARGAQVIAANVHYRVGELDLVVRESDGTIVFCEVKTRATRNFGVA 82 Query: 77 AASVTRSKQHKLLQTARLWLA-RHNGSFDTVDCRFDVVAFTGN 118 +VT K +L + A WL+ + + RFDV+ Sbjct: 83 -EAVTPRKLKRLRKAAAQWLSTARSENQALSKVRFDVLGLVAT 124 >UniRef50_C4DPS8 Predicted endonuclease related to Holliday junction resolvase n=1 Tax=Stackebrandtia nassauensis DSM 44728 RepID=C4DPS8_9ACTO Length = 207 Score = 104 bits (260), Expect = 9e-22, Method: Composition-based stats. Identities = 25/98 (25%), Positives = 41/98 (41%) Query: 11 PRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRS 70 P + G E A L G+R + N GE+D+I E T+F EV+ RRS Sbjct: 2 PHDRRHLRLGCFGENLAVAHLRRDGMRVLQRNWRCEHGELDIIAIERGVTVFCEVKTRRS 61 Query: 71 ALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDC 108 +G ++ +K ++ + A W R+ + Sbjct: 62 LRFGTPMQAIDEAKALRIRRLAASWHRRYRDKPPWAEW 99 >UniRef50_O66457 UPF0102 protein aq_041 n=1 Tax=Aquifex aeolicus RepID=Y041_AQUAE Length = 103 Score = 104 bits (260), Expect = 1e-21, Method: Composition-based stats. Identities = 29/107 (27%), Positives = 45/107 (42%), Gaps = 10/107 (9%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 G +E A R+L+ KG + + N+ GEID++ + VEV+ + A Sbjct: 2 KGREYEDLAARYLKSKGYQILGRNLRSPYGEIDILAEFEGRKVIVEVKGSETFF---PAE 58 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 VT K K+++TA L S + VV +V KD Sbjct: 59 KVTPHKLSKIIRTAYEVLGEEPFSIE-------VVVVYRGKVYHYKD 98 >UniRef50_Q2KDE4 UPF0102 protein RHE_CH00320 n=4 Tax=Rhizobiales RepID=Y320_RHIEC Length = 122 Score = 103 bits (258), Expect = 2e-21, Method: Composition-based stats. Identities = 36/116 (31%), Positives = 54/116 (46%), Gaps = 4/116 (3%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + + G E A +L KG R +A R GEID++ R+G TIFVEV+ R Sbjct: 10 KRKALRRGRMSEYVAAAFLMLKGYRILALRHRTRLGEIDIVARKGDLTIFVEVKARH--G 67 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV-EWIKDAF 127 A +V+ + Q ++ + LWLAR R+D++A + DAF Sbjct: 68 EAAAIDAVSVAAQKRIRAASDLWLARQADQARLSQ-RYDIIAVMPGRLPRHFPDAF 122 >UniRef50_A9BFT1 UPF0102 protein Pmob_0702 n=1 Tax=Petrotoga mobilis SJ95 RepID=Y702_PETMO Length = 112 Score = 103 bits (257), Expect = 2e-21, Method: Composition-based stats. Identities = 26/105 (24%), Positives = 49/105 (46%), Gaps = 2/105 (1%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 G +E +A + + R IA N + R GEID+I + + +EV+ + +G Sbjct: 1 MNTKGKVYEDKAVSFFLNRDYRIIARNFSYRHGEIDIIALKNKILHLIEVKGGKET-FGD 59 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV 120 A V K K+++ ++A H + + + DV++ T + V Sbjct: 60 PAFRVNSRKLKKIMKVGNYFIATHP-KLEFDEIQIDVISVTNDGV 103 >UniRef50_B2GFY9 Putative uncharacterized protein n=1 Tax=Kocuria rhizophila DC2201 RepID=B2GFY9_KOCRD Length = 144 Score = 103 bits (257), Expect = 2e-21, Method: Composition-based stats. Identities = 28/123 (22%), Positives = 38/123 (30%), Gaps = 9/123 (7%) Query: 3 TVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVN------ERGGEIDLIMRE 56 + + +P T G A E L G N GE+D++ Sbjct: 10 SRSSAVPTPDAPTALDVGRAGEDLIADLLARSGWSVRDRNWRPAPGPGRPRGELDIVAER 69 Query: 57 GRTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT 116 G EV+ R A +G +V K +L AR W H DVVA Sbjct: 70 GGVVTVFEVKTRSGADFGHPCEAVGAEKLRRLHVLARAWAREHRDPRVPT---VDVVAVH 126 Query: 117 GNE 119 Sbjct: 127 WPR 129 >UniRef50_B2IFF3 Putative uncharacterized protein n=1 Tax=Beijerinckia indica subsp. indica ATCC 9039 RepID=B2IFF3_BEII9 Length = 125 Score = 102 bits (255), Expect = 4e-21, Method: Composition-based stats. Identities = 32/121 (26%), Positives = 50/121 (41%), Gaps = 4/121 (3%) Query: 7 RSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVR 66 +S + + G E+ A WL + R + +GGEID++ G T F+EV+ Sbjct: 2 KSRKEARRRAHRFGLWAESLAILWLRMRFYRILDRRFFVKGGEIDIVAHRGDTIAFIEVK 61 Query: 67 YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV-EWIKD 125 R + A ++ K+ +L AR WLA H + V R D + I Sbjct: 62 ARPT--LDEALLAIDAVKRRRLSLAARYWLAAHPWAASHV-LRGDALCIAPWCWPRHIPA 118 Query: 126 A 126 A Sbjct: 119 A 119 >UniRef50_B0P7L1 Putative uncharacterized protein n=1 Tax=Anaerotruncus colihominis DSM 17241 RepID=B0P7L1_9FIRM Length = 132 Score = 102 bits (255), Expect = 4e-21, Method: Composition-based stats. Identities = 34/124 (27%), Positives = 52/124 (41%), Gaps = 9/124 (7%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 ++ + G A EA A LE +G R + N EIDLI + G FVEV+ R Sbjct: 1 MSARTYGAAGEAFAASALEAEGYRILERNWRSGRSEIDLIAQRGDIIAFVEVKTRGEHAL 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNG-SFDTVDCRFDVVAFTGN--------EVEWIK 124 AA VTR+++ ++ A +L + V RFDV + Sbjct: 61 AAPAAFVTRAQRRRIALAAVEYLRARGIYNTGAVQPRFDVFEIVTGGPDGACVTRFSHLV 120 Query: 125 DAFN 128 +A++ Sbjct: 121 NAYD 124 >UniRef50_C0XSA5 Endonuclease n=1 Tax=Corynebacterium lipophiloflavum DSM 44291 RepID=C0XSA5_9CORY Length = 124 Score = 102 bits (254), Expect = 5e-21, Method: Composition-based stats. Identities = 35/122 (28%), Positives = 48/122 (39%), Gaps = 10/122 (8%) Query: 9 GSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREG-RTTIFVEVRY 67 A E A G + V + GEIDLI+RE T +FVEV+ Sbjct: 2 AQSNYAENHALALAGEKLAASTYSEMGYAIVGTRVRTKVGEIDLIVREETGTVVFVEVKT 61 Query: 68 RRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----EVEWI 123 RR +G AA +VT K + + A WLA + RFDV ++ Sbjct: 62 RRGRGFG-AAETVTAKKLRTMRRCAAEWLAGN----AYAPVRFDVAEVIVTGETMDIRLF 116 Query: 124 KD 125 +D Sbjct: 117 ED 118 >UniRef50_C7JH91 Putative uncharacterized protein n=8 Tax=Acetobacter pasteurianus RepID=C7JH91_ACEP3 Length = 149 Score = 102 bits (254), Expect = 5e-21, Method: Composition-based stats. Identities = 33/115 (28%), Positives = 50/115 (43%), Gaps = 4/115 (3%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 G A E QA WLE G + GEID++ + FVEV+ RRS Sbjct: 37 AYTQGVAAEQQACNWLEQDGWTVLLRRARTHRGEIDIVASKAVVLCFVEVKKRRS--IEE 94 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF-TGNEVEWIKDAFND 129 A S+ ++Q +L + A L +H + + RFD+ F +E ++D Sbjct: 95 ALVSLQPAQQRRLFRAAECLLQKHPY-WQYEEMRFDLFVFDDAGRMERLEDVIRQ 148 >UniRef50_A3K994 Putative uncharacterized protein n=7 Tax=Rhodobacterales RepID=A3K994_9RHOB Length = 159 Score = 101 bits (253), Expect = 7e-21, Method: Composition-based stats. Identities = 31/131 (23%), Positives = 59/131 (45%), Gaps = 7/131 (5%) Query: 3 TVPTRSGSPRQLTTK---QTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRT 59 P + ++ + + G + E + + E +G +GGEIDLI+R+G Sbjct: 31 PEPDAARRAKKDAGRIGYEAGASAELRVAQDYERRGFPLARRRWRGQGGEIDLIVRDGDG 90 Query: 60 TIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-N 118 IFVEV+ +S + AA ++R + ++++ A +L D RFD+ Sbjct: 91 LIFVEVK--KSRSFRHAAERLSRRQMNRIISAAEEFLGTQPL-GSLTDVRFDLAMVDVYG 147 Query: 119 EVEWIKDAFND 129 ++ I++A Sbjct: 148 QIRVIENAIGH 158 >UniRef50_A4EVA8 Putative uncharacterized protein n=1 Tax=Roseobacter sp. SK209-2-6 RepID=A4EVA8_9RHOB Length = 136 Score = 101 bits (253), Expect = 7e-21, Method: Composition-based stats. Identities = 35/120 (29%), Positives = 60/120 (50%), Gaps = 4/120 (3%) Query: 7 RSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVR 66 + G R L + +G + E QA R E G + GEIDLI+R+G T +F EV+ Sbjct: 15 KRGQNRGLRSHLSGLSAEHQAARAYEALGFEVVEERWRGEAGEIDLILRQGATWVFAEVK 74 Query: 67 YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-NEVEWIKD 125 +S + AA +++ + ++ Q+A L+L R + + R D V +VE +++ Sbjct: 75 --KSTDFETAATRISQKQVQRIRQSATLYLDRFP-NEQVEEVRLDAVLIDAEGQVEILEN 131 >UniRef50_A3UIK6 Putative uncharacterized protein n=1 Tax=Oceanicaulis alexandrii HTCC2633 RepID=A3UIK6_9RHOB Length = 128 Score = 101 bits (252), Expect = 8e-21, Method: Composition-based stats. Identities = 35/115 (30%), Positives = 57/115 (49%), Gaps = 4/115 (3%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 ++ G EA + WL KG R + GE+DL+ R G F+EV++R + Sbjct: 7 QAERRGRRTEAISALWLRLKGWRILDERARTGVGELDLVARRGGVLAFIEVKHRPTVD-- 64 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV-EWIKDAFN 128 A ++T +Q +L++ A LW +RH D + RFDV+ + I+ AF+ Sbjct: 65 AARLAITPRQQMRLIRAASLWRSRH-AGIDHLQPRFDVMLWPAQGWPRHIQGAFS 118 >UniRef50_A1B931 Putative uncharacterized protein n=1 Tax=Paracoccus denitrificans PD1222 RepID=A1B931_PARDP Length = 137 Score = 101 bits (252), Expect = 8e-21, Method: Composition-based stats. Identities = 31/114 (27%), Positives = 44/114 (38%), Gaps = 4/114 (3%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 G E A R +G +A RGGEIDLI+ FVEV+ +S + Sbjct: 25 AYSAGRLAEESAAREYRRRGYEVMAERWRGRGGEIDLILCRDDEYTFVEVK--KSRFHDR 82 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-GNEVEWIKDAFN 128 AA + + ++ A + R RFD VE I++AF Sbjct: 83 AAERIGARQIARICNAALEYCGRLPAGL-LTAMRFDAALVDQFGRVEIIENAFG 135 >UniRef50_Q0C451 UPF0102 protein HNE_0764 n=1 Tax=Hyphomonas neptunium ATCC 15444 RepID=Y764_HYPNA Length = 122 Score = 101 bits (252), Expect = 8e-21, Method: Composition-based stats. Identities = 39/121 (32%), Positives = 59/121 (48%), Gaps = 4/121 (3%) Query: 10 SPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRR 69 ++ + G E +A WL KG +AA V GGEIDLI R+GR FVEV+ R Sbjct: 2 PAKRQIAEARGRQAERRAALWLRLKGCSVLAARVKLPGGEIDLIARKGRLIAFVEVKAR- 60 Query: 70 SALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVE-WIKDAFN 128 A A +V+ H++ + A +W+ H F R+D++A + +KDA+ Sbjct: 61 -ARRDDALGAVSVQSWHRIARAAEVWMG-HRPKFAGYGWRYDLIALAPGSLPYHLKDAWR 118 Query: 129 D 129 Sbjct: 119 P 119 >UniRef50_A6Q6T2 UPF0102 protein SUN_0231 n=3 Tax=Epsilonproteobacteria RepID=Y231_SULNB Length = 113 Score = 101 bits (252), Expect = 9e-21, Method: Composition-based stats. Identities = 31/111 (27%), Positives = 49/111 (44%), Gaps = 6/111 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERG-GEIDLIMREGRTTIFVEVRYRRSALYG 74 K GD E A +LE +G I N R GEID+I ++ F+EV+ Sbjct: 5 PKIFGDKSEDLATLFLEQEGFIVIERNYFARKLGEIDIIAQKDEVLHFIEVK--SGKADF 62 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 +VT K K++ +A ++ V D + G+EVE+I++ Sbjct: 63 DPVYNVTPDKLRKVINSAHYYMKSKKI---DVSFSVDALIIRGDEVEFIEN 110 >UniRef50_Q04SX0 UPF0102 protein LBJ_1427 n=4 Tax=Leptospira RepID=Y1427_LEPBJ Length = 116 Score = 100 bits (251), Expect = 1e-20, Method: Composition-based stats. Identities = 25/113 (22%), Positives = 45/113 (39%), Gaps = 2/113 (1%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 K GD E+ A +L G + N EID+I + F EV++ + Sbjct: 5 KKIKGDEGESIASDFLISIGHEILKRNYRFLYCEIDIISIKEEVLYFSEVKFWKEFESFD 64 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-VEWIKDAF 127 + +KQ ++ + A +L+ + S F +V+ + E+ D F Sbjct: 65 PRFTFNFAKQTRMRKAASGFLSEN-LSLQNHFVSFCLVSINEKKGCEYYPDLF 116 >UniRef50_A9IXC8 UPF0102 protein BT_1882 n=8 Tax=Rhizobiales RepID=Y1882_BART1 Length = 130 Score = 100 bits (251), Expect = 1e-20, Method: Composition-based stats. Identities = 33/123 (26%), Positives = 51/123 (41%), Gaps = 4/123 (3%) Query: 9 GSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYR 68 ++ + G E A WL KG + GEIDLI R G + VEV+ R Sbjct: 10 PKKQRQKSFYRGVRAEKLAAWWLRFKGFHIAEMRFKTKCGEIDLIARRGNLVLIVEVKAR 69 Query: 69 RSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV-EWIKDAF 127 + A +V+R + ++ A +WLAR + + RFD++A + I F Sbjct: 70 ST--LLEAMEAVSRMNEKRIEAAADIWLARQK-DYALLSVRFDLIAILPWRWPKHIPAFF 126 Query: 128 NDH 130 Sbjct: 127 TSD 129 >UniRef50_C9M9E0 Putative uncharacterized protein n=1 Tax=Jonquetella anthropi E3_33 E1 RepID=C9M9E0_9BACT Length = 155 Score = 100 bits (250), Expect = 1e-20, Method: Composition-based stats. Identities = 27/117 (23%), Positives = 41/117 (35%), Gaps = 7/117 (5%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + G E A L +G NV E+DLI VEVR R+ + Sbjct: 32 AQDSLAVGRWAEDLAADLLAEEGYSVCGRNVRVGPCELDLIGFIDGCLTAVEVRCRQKSR 91 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG----NEVEWIKD 125 +V K + L++ R + ++ + R D+ A T W KD Sbjct: 92 LQSPEETVGPRKWNALVRGIRGYASQTGWNG---PMRIDLFAVTVCGRRWSARWYKD 145 >UniRef50_B3T5F3 Putative uncharacterized protein family UPF0102 n=1 Tax=uncultured marine microorganism HF4000_ANIW141I9 RepID=B3T5F3_9ZZZZ Length = 172 Score = 100 bits (250), Expect = 1e-20, Method: Composition-based stats. Identities = 26/117 (22%), Positives = 40/117 (34%), Gaps = 8/117 (6%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNE-RGGEIDLIMREGRTTIFVEVRYRRSAL 72 ++ G E A KG + A N GEID+I + +F EV+ Sbjct: 55 QKKRKIGQWGERLAALEYYRKGYKVHALNYYCAPFGEIDIIAEKENELVFAEVKTAAGKT 114 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEWIKD 125 GG V K +L ++ + D R DV A + ++ K Sbjct: 115 LGGVEGQVDEVKLQRLSNAIDKYIMDNEIQ---NDIRLDVFAIILGKNGPALKHFKG 168 >UniRef50_Q31RH5 UPF0102 protein Synpcc7942_0312 n=2 Tax=Synechococcus elongatus RepID=Y312_SYNE7 Length = 142 Score = 100 bits (250), Expect = 1e-20, Method: Composition-based stats. Identities = 29/97 (29%), Positives = 43/97 (44%), Gaps = 2/97 (2%) Query: 21 DAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG-GAAAS 79 A EA W + L +A + R GE+DL+ +E F+EV+ RR + + Sbjct: 10 RAGEALVAAWCRDRRLEVLAERWHCRWGELDLVTQEDSALRFIEVKTRRQTGWDQSGLLA 69 Query: 80 VTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT 116 + +KQ L + A +LA V CRFDV Sbjct: 70 IGPAKQRCLSRAAACYLASLGNQAA-VACRFDVALVR 105 >UniRef50_Q7U7D4 UPF0102 protein SYNW1051 n=3 Tax=Synechococcus RepID=Y1051_SYNPX Length = 134 Score = 100 bits (250), Expect = 1e-20, Method: Composition-based stats. Identities = 25/94 (26%), Positives = 45/94 (47%), Gaps = 2/94 (2%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + + G E + L+ +G + + N + R GE+DL++ + + VEV+ RRS Sbjct: 17 PMKMQPPGAQAETRVSSLLQRQGWQLLDRNWSCRWGELDLVLHKNEQLLVVEVKKRRSLA 76 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTV 106 +G SV +K+ +L + W A H D + Sbjct: 77 WGP--WSVDPTKRRRLGRAISCWRAEHPIQTDWL 108 >UniRef50_C5BWW3 UPF0102 protein Bcav_2532 n=21 Tax=Actinomycetales RepID=Y2532_BEUC1 Length = 118 Score = 100 bits (249), Expect = 2e-20, Method: Composition-based stats. Identities = 41/113 (36%), Positives = 56/113 (49%), Gaps = 7/113 (6%) Query: 18 QTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAA 77 G E A RWLE +GL + N GE+DL+ R+G T +FVEV+ R S +G Sbjct: 6 AIGAYGERVAGRWLEAEGLEVVERNWRCPDGELDLVARDGETLVFVEVKTRSSLAFGHPG 65 Query: 78 ASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-----EVEWIKD 125 +VTR K +L + A WLA H+ + R DVVA VE ++ Sbjct: 66 EAVTRLKLARLRRLAARWLAEHDAHA--REVRIDVVAVLRTRAGAARVEHLRG 116 >UniRef50_D1NDV3 Fimbrial usher protein (Fragment) n=1 Tax=Haemophilus influenzae HK1212 RepID=D1NDV3_HAEIN Length = 323 Score = 100 bits (249), Expect = 2e-20, Method: Composition-based stats. Identities = 42/80 (52%), Positives = 51/80 (63%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 +Q G ++E QAR +LE KGL FIAAN N + GE+DLIM + T +FVEVR R + YG Sbjct: 167 KRQQGASFEHQARLFLESKGLTFIAANQNFKCGELDLIMNDKETIVFVEVRQRSHSAYGS 226 Query: 76 AAASVTRSKQHKLLQTARLW 95 A SV KQ K L A LW Sbjct: 227 AIESVDWRKQQKWLDAANLW 246 >UniRef50_A4QF37 UPF0102 protein cgR_1859 n=4 Tax=Corynebacterium RepID=Y1859_CORGB Length = 122 Score = 99.7 bits (248), Expect = 2e-20, Method: Composition-based stats. Identities = 31/107 (28%), Positives = 46/107 (42%), Gaps = 6/107 (5%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREG-RTTIFVEVRYRRSA 71 + + G E A + + NV GE+DLI+R +FVEV+ RR + Sbjct: 2 KTQKQYLGAFGEDVALQQYLDDQATLLDRNVRYSCGELDLIVRLASGVVVFVEVKTRRGS 61 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN 118 + AA +V K ++ + A LWL RFDVVA + Sbjct: 62 AFDSAA-AVNNQKMLRMRRAAALWLEGKP----YTPIRFDVVAIVLD 103 >UniRef50_B5Y8F8 Putative uncharacterized protein n=1 Tax=Coprothermobacter proteolyticus DSM 5265 RepID=B5Y8F8_COPPD Length = 115 Score = 99.7 bits (248), Expect = 2e-20, Method: Composition-based stats. Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 9/102 (8%) Query: 24 EAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAASVTRS 83 E + +L + R + NV GEID++ +GRT +FVEVRYR++ AA +V Sbjct: 7 EDRVASFLVSQKYRILDQNVVFPTGEIDIVALKGRTLVFVEVRYRKN---FDAAETVDSR 63 Query: 84 KQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 K +++Q A L+ + R DV A W K Sbjct: 64 KLERIMQCAYLY------TGGEQSYRIDVFACGPQGCHWYKG 99 >UniRef50_A4ECJ4 Putative uncharacterized protein n=1 Tax=Collinsella aerofaciens ATCC 25986 RepID=A4ECJ4_9ACTN Length = 161 Score = 99.7 bits (248), Expect = 3e-20, Method: Composition-based stats. Identities = 26/128 (20%), Positives = 44/128 (34%), Gaps = 14/128 (10%) Query: 12 RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEI--DLIMREGRTTIFVEVRYRR 69 + L+ ++ G E + +G + GE L+ + EV+ RR Sbjct: 33 KGLSPRELGMLGELITIDYFNERGYTLLEQGYRCTEGEADLVLLDELDDVVVMAEVKTRR 92 Query: 70 ----SALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-----EV 120 +V KQ + + A +L H + RFD V T E+ Sbjct: 93 VALDCNTRVFPEEAVDAQKQRRYRRIASCYLMEH---YPLKAIRFDAVGVTIRGGHIAEI 149 Query: 121 EWIKDAFN 128 E +AF+ Sbjct: 150 EHQYNAFD 157 >UniRef50_A9I0M2 UPF0102 protein Bpet0439 n=14 Tax=Proteobacteria RepID=Y439_BORPD Length = 162 Score = 99.7 bits (248), Expect = 3e-20, Method: Composition-based stats. Identities = 56/115 (48%), Positives = 72/115 (62%), Gaps = 3/115 (2%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 T++TG A E QA R L G GL +A N++ R GEIDL+MR+G T + VEVR R + YGG Sbjct: 46 TQRTGTAHEDQALRLLAGAGLVPLARNLHCRAGEIDLVMRDGATLVLVEVRARANPRYGG 105 Query: 76 AAASVTRSKQHKLLQTARLW---LARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 AAASV R+K+ +LL+ A L LAR + RFDVVAF +W+ AF Sbjct: 106 AAASVGRAKRARLLRCAALLLPDLARRHWGGRIPPVRFDVVAFEAGRADWLPAAF 160 >UniRef50_B0T377 UPF0102 protein Caul_0175 n=3 Tax=Caulobacteraceae RepID=Y175_CAUSK Length = 139 Score = 99.4 bits (247), Expect = 3e-20, Method: Composition-based stats. Identities = 29/129 (22%), Positives = 48/129 (37%), Gaps = 6/129 (4%) Query: 2 ATVPTRSGS--PRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRT 59 P R R + +G E A WL KG R + + GEIDL+ + Sbjct: 6 PLRPERQAQKQARGAAARLSGRRAEVLAALWLMAKGYRILGFRLATPLGEIDLLAQRRGV 65 Query: 60 TIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN- 118 VEV+ R A +VT ++ +L + +A + R D++A Sbjct: 66 LAVVEVKSR--TSLEAALEAVTYEQRSRLRRAGAH-IAANRAGLRDAVVRLDLIALAPGR 122 Query: 119 EVEWIKDAF 127 + +A+ Sbjct: 123 RPRHLLNAW 131 >UniRef50_B2S8H0 UPF0102 protein BAbS19_I01690 n=50 Tax=Rhizobiales RepID=Y1690_BRUA1 Length = 126 Score = 99.0 bits (246), Expect = 4e-20, Method: Composition-based stats. Identities = 37/109 (33%), Positives = 47/109 (43%), Gaps = 3/109 (2%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 G + E A L KG R +A R GEIDLI R G + VEV+ R A + A Sbjct: 14 RGHSAERLAAFALMLKGFRIVARRYRTRLGEIDLIARRGDLVLIVEVKAR--ASFEAAQF 71 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 +VT ++ A LWL R + RFD+VA AF Sbjct: 72 AVTPQAMRRIEAAADLWLQRQ-TDRARLSLRFDMVAVLPRRWPKHVPAF 119 >UniRef50_Q0AK98 UPF0102 protein Mmar10_3014 n=1 Tax=Maricaulis maris MCS10 RepID=Y3014_MARMM Length = 127 Score = 98.6 bits (245), Expect = 6e-20, Method: Composition-based stats. Identities = 29/119 (24%), Positives = 51/119 (42%), Gaps = 4/119 (3%) Query: 10 SPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRR 69 + + + G E A WL KG R + GEIDL+ R G +F+EV+ R Sbjct: 2 TRARRQAEARGRWAEWLAMAWLVAKGYRLLDHRARTAAGEIDLVARRGEYLVFIEVKARA 61 Query: 70 SALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV-EWIKDAF 127 + A S+ ++ ++ + A +W A S + R+D+V + + A+ Sbjct: 62 TR--AEALDSIGPRQRGRITRAASIWRAP-RSSLHHLHLRYDLVLVVPGRWPQHRRAAW 117 >UniRef50_A8U078 Predicted endonuclease n=1 Tax=alpha proteobacterium BAL199 RepID=A8U078_9PROT Length = 151 Score = 97.8 bits (243), Expect = 9e-20, Method: Composition-based stats. Identities = 33/128 (25%), Positives = 58/128 (45%), Gaps = 4/128 (3%) Query: 3 TVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIF 62 + R+ + ++ G E A WL +G R +A V GE+DL++R G + Sbjct: 25 RPNGSAQLERRRSAERRGLRAEWLAALWLMLRGYRVLARRVRTPAGEVDLVVRRGSVVVA 84 Query: 63 VEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV-E 121 VEV+ R + A SV+ ++H++ +LAR ++ RFD+VA + Sbjct: 85 VEVKARAT--LDAALDSVSSRQRHRVALGLESFLARRP-ELAGLNRRFDLVAVQPWRLPV 141 Query: 122 WIKDAFND 129 + D + Sbjct: 142 HLADVWRP 149 >UniRef50_Q16B02 UPF0102 protein RD1_1191 n=1 Tax=Roseobacter denitrificans OCh 114 RepID=Y1191_ROSDO Length = 129 Score = 97.0 bits (241), Expect = 1e-19, Method: Composition-based stats. Identities = 32/131 (24%), Positives = 59/131 (45%), Gaps = 5/131 (3%) Query: 1 MATVPTRSGSPRQLTT-KQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRT 59 M +P + G + EA R E G F A + GEIDL++R+ Sbjct: 1 MTQMPQTNARVHAGRMAYHAGLSAEASVIREYESHGYVFEAQRWRGQVGEIDLVLRKSGL 60 Query: 60 TIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-GN 118 +FVEV+ +S + AA ++ +++ ++ T ++A+ D RFDV Sbjct: 61 VVFVEVK--KSKSFERAALRISPTQKRRIFATGEEFVAQEPQGL-LTDMRFDVALVDAAG 117 Query: 119 EVEWIKDAFND 129 V+ +++A ++ Sbjct: 118 AVQILENALSE 128 >UniRef50_Q2VYL8 UPF0102 protein amb4503 n=5 Tax=Alphaproteobacteria RepID=Y4503_MAGSA Length = 129 Score = 96.7 bits (240), Expect = 2e-19, Method: Composition-based stats. Identities = 32/135 (23%), Positives = 51/135 (37%), Gaps = 11/135 (8%) Query: 1 MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERG----GEIDLIMRE 56 M + P ++ G E A WL KG +A + GE+DL+ R Sbjct: 1 MNSAPPSRA---HQAAQRRGKVAEGLAALWLRLKGYGILAKGLKSGRGSGAGEVDLVARR 57 Query: 57 GRTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT 116 G FVEV+ R + A S+T ++ ++ + A + RFD+V Sbjct: 58 GDLVAFVEVKSRAT--LDQAIESLTPFQRQRIERAAAA-FLARRPELASCGVRFDMVLVA 114 Query: 117 GNEV-EWIKDAFNDH 130 + I DA+ Sbjct: 115 PWRLPRHIPDAWRID 129 >UniRef50_Q0FQ74 Putative uncharacterized protein n=1 Tax=Roseovarius sp. HTCC2601 RepID=Q0FQ74_9RHOB Length = 153 Score = 96.3 bits (239), Expect = 3e-19, Method: Composition-based stats. Identities = 31/124 (25%), Positives = 52/124 (41%), Gaps = 4/124 (3%) Query: 7 RSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVR 66 R R + G + EAQ + +G R R GEIDLI+ +G IFVEV+ Sbjct: 33 RRRVERGALGHRAGLSAEAQVAQDYRRRGYRVAGQRWRGRSGEIDLILHDGDGLIFVEVK 92 Query: 67 YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-VEWIKD 125 +S + A ++ + +LL+ + A ++ RFDV + +++ Sbjct: 93 --KSRSFDHAMQHLSSRQIARLLRAGEEF-AGTQPRGSLIEMRFDVALMNEQGMIRIVEN 149 Query: 126 AFND 129 A Sbjct: 150 ALGP 153 >UniRef50_UPI00016C4BC8 hypothetical protein GobsU_17186 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4BC8 Length = 155 Score = 95.5 bits (237), Expect = 5e-19, Method: Composition-based stats. Identities = 41/122 (33%), Positives = 56/122 (45%), Gaps = 10/122 (8%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 + G E A +L G R +AANVN+R GE+DL+ +G T + VEVR SA Sbjct: 26 KRWFGRRSERAAANYLRGLRYRLLAANVNDRDGELDLLAIDGETLVIVEVRSTSSARPDA 85 Query: 76 A---AASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE------VEWIKDA 126 AASV KQ K+ + +L R + R+DV+ E V I+ A Sbjct: 86 IEQTAASVDLRKQRKITEATSRFLGRRRL-LGRIAVRYDVLVIAWPEHAREPAVRHIRHA 144 Query: 127 FN 128 F Sbjct: 145 FE 146 >UniRef50_C6QFU1 Putative uncharacterized protein n=1 Tax=Hyphomicrobium denitrificans ATCC 51888 RepID=C6QFU1_9RHIZ Length = 129 Score = 95.1 bits (236), Expect = 5e-19, Method: Composition-based stats. Identities = 32/119 (26%), Positives = 50/119 (42%), Gaps = 3/119 (2%) Query: 2 ATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTI 61 R S + ++G E G R + GEIDLI +GR Sbjct: 6 NDDQARPLSDIRRRRYRSGLNAEMVVAAVYMALGHRILGRRFKTPVGEIDLIAIKGRRVA 65 Query: 62 FVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV 120 FVEV+ R S+ A ++T + + ++ + A LWLAR+ + + D FD+V Sbjct: 66 FVEVKRRASSE--EAEDAITLTMRRRVRRAADLWLARNP-QYQSHDVGFDLVFVLPWRF 121 >UniRef50_A3TTV9 Putative uncharacterized protein n=1 Tax=Oceanicola batsensis HTCC2597 RepID=A3TTV9_9RHOB Length = 140 Score = 95.1 bits (236), Expect = 6e-19, Method: Composition-based stats. Identities = 35/123 (28%), Positives = 57/123 (46%), Gaps = 4/123 (3%) Query: 6 TRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEV 65 + R +G A E+ R E +G R + GGEIDLI+ E +FVEV Sbjct: 17 PAARRRRGEIAHLSGLAAESAVERTYEARGARVLHRRWRGPGGEIDLILAEPDRVVFVEV 76 Query: 66 RYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-VEWIK 124 + ++A +G AA V ++ ++ ++A ++ D R DV G VE ++ Sbjct: 77 K--KAATHGAAAERVRPAQVQRIARSAMAFVDTLP-GGALTDIRLDVALVDGGGAVELLE 133 Query: 125 DAF 127 +AF Sbjct: 134 NAF 136 >UniRef50_B3CRA6 Putative uncharacterized protein n=2 Tax=Orientia tsutsugamushi RepID=B3CRA6_ORITI Length = 112 Score = 94.4 bits (234), Expect = 1e-18, Method: Composition-based stats. Identities = 28/115 (24%), Positives = 51/115 (44%), Gaps = 5/115 (4%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 +++ G E + FIA + GEID+I +G+ +F+EV+ RRS Sbjct: 2 ISSYNLGVLAEWLIIARYSVRLYSFIAHRMRNSAGEIDIICTKGQVIVFIEVKARRSNFD 61 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVE-WIKDAF 127 + K+ ++A L+L +N + D RFD+ + I++A+ Sbjct: 62 NTIC---NYQQITKIRKSAELYLY-YNRQYSNFDVRFDLAIVRPMQWPLIIENAW 112 >UniRef50_A7ZB75 UPF0102 protein Ccon26_01140 n=23 Tax=Epsilonproteobacteria RepID=Y114_CAMC1 Length = 113 Score = 94.0 bits (233), Expect = 1e-18, Method: Composition-based stats. Identities = 24/114 (21%), Positives = 48/114 (42%), Gaps = 6/114 (5%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMR-EGRTTIFVEVRYRRSA 71 L G + E +A +L G + N + + GEID+I + F+EV+ Sbjct: 2 GLKEYLFGKSSEDRACEFLRKLGFVILERNFHSKFGEIDIIALSSDKILHFIEVKATSGG 61 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 A + ++K K+L+T ++ ++ + D D++ E I++ Sbjct: 62 Y--EAEYRLNKAKYMKILKTINFYMMKNEPNRDYQ---LDLLVVKNENFELIEN 110 >UniRef50_A8ERF6 UPF0102 protein Abu_0255 n=2 Tax=Campylobacterales RepID=Y255_ARCB4 Length = 110 Score = 93.6 bits (232), Expect = 2e-18, Method: Composition-based stats. Identities = 25/111 (22%), Positives = 55/111 (49%), Gaps = 6/111 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAAN-VNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 +K+ GD E +A +LE + N ++ GEID+I + + F+EV+ + Y Sbjct: 2 SKEKGDIAEKKAISFLEKSNFEIVEKNFYAKKLGEIDIIAQRNKIYHFIEVK--SANDYE 59 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 A ++T K K+ ++ ++ ++N + DV+ +++E +++ Sbjct: 60 TAINNITSQKLSKIKRSVDFYIQKNNLNISYS---IDVIIVVDDKIELLEN 107 >UniRef50_B1ZZB1 Putative uncharacterized protein n=2 Tax=Opitutaceae RepID=B1ZZB1_OPITP Length = 141 Score = 93.2 bits (231), Expect = 2e-18, Method: Composition-based stats. Identities = 30/118 (25%), Positives = 48/118 (40%), Gaps = 12/118 (10%) Query: 18 QTGDAWEAQARRWLEG-KGLRFIAANVNERGG---EIDLIMREGRTTIFVEVRYRRSALY 73 G A E A WL+ +G R +A N E+DL+ R+ +FVEV+ R + Sbjct: 16 DAGAAGERLAAAWLQRERGFRVVARNWRNPRDRREELDLVCRDREVLVFVEVKSRAANAL 75 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE------VEWIKD 125 +V + K+ L + + +LAR + R DVV V ++ Sbjct: 76 VPGYYAVDKRKKRVLGRAIKAYLAR--LTAKPATFRLDVVEIAEGGGDAEPTVRHFEN 131 >UniRef50_A5V3S4 UPF0102 protein Swit_0572 n=4 Tax=Sphingomonadaceae RepID=Y572_SPHWW Length = 118 Score = 93.2 bits (231), Expect = 2e-18, Method: Composition-based stats. Identities = 32/119 (26%), Positives = 51/119 (42%), Gaps = 5/119 (4%) Query: 11 PRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRS 70 R+ ++ G E A WL G R + + V R GE+DLI R GRT FVEV+ R Sbjct: 2 NRRAAAERQGRTGERIAAWWLRLHGWRIVGSRVKTRRGEVDLIARRGRTLAFVEVKTR-- 59 Query: 71 ALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-EVEWIKDAFN 128 G A ++ + ++ A L R+ + R DV+ + + ++ Sbjct: 60 GDAAGLATAIDEYRLRRVAAAAEALLPRYGVGVEN--VRIDVMLVRPWRRPVHLTNVWH 116 >UniRef50_Q7V7V8 UPF0102 protein PMT_0624 n=2 Tax=Prochlorococcus marinus RepID=Y624_PROMM Length = 126 Score = 93.2 bits (231), Expect = 3e-18, Method: Composition-based stats. Identities = 23/93 (24%), Positives = 45/93 (48%), Gaps = 1/93 (1%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + + G E + R L+ +G + ++ + R GE+DL++ + + + VEV+ RRS Sbjct: 2 MDSTGMGCWGEERVLRLLQKRGWQLVSQRWSCRYGELDLVVEKQQRVLVVEVKSRRSRGL 61 Query: 74 GG-AAASVTRSKQHKLLQTARLWLARHNGSFDT 105 + + KQ +L++ WLA H + Sbjct: 62 DHWGLCAFNKGKQLRLMRAIGCWLATHPYFAEH 94 >UniRef50_C1CZ90 UPF0102 protein Deide_03080 n=3 Tax=Deinococcus RepID=Y3080_DEIDV Length = 114 Score = 93.2 bits (231), Expect = 3e-18, Method: Composition-based stats. Identities = 37/90 (41%), Positives = 50/90 (55%), Gaps = 2/90 (2%) Query: 30 WLEGKGLRFIAANVNERGGEIDLIMREG-RTTIFVEVRYRRSALYGGAAASVTRSKQHKL 88 L+G G + N RGGEIDL+ RE T +F EVR RR+ +G AA SVT K + Sbjct: 13 HLQGLGRELLQRNYRMRGGEIDLVTREPCGTLVFTEVRQRRTRRHGSAAESVTSRKLALM 72 Query: 89 LQTARLWLARHNGSFDTVDCRFDVVAFTGN 118 + A+ +L R +G D + CR +VV G Sbjct: 73 HRAAQSYLIREHGR-DDLPCRLEVVTIDGP 101 >UniRef50_C2M936 Putative uncharacterized protein n=1 Tax=Porphyromonas uenonis 60-3 RepID=C2M936_9PORP Length = 136 Score = 92.8 bits (230), Expect = 3e-18, Method: Composition-based stats. Identities = 29/120 (24%), Positives = 54/120 (45%), Gaps = 9/120 (7%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 + G A E A+R+L + +R + N + EID+I +GR + VEV+ R Sbjct: 4 ANELGAAGERAAQRYLLSRHIRLLEINWRDPLCEIDIIASDGRHLLIVEVKSRMEYTATS 63 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVV------AFTGNEVEWIKDAFND 129 +V ++K H+++ + R + R+DV+ A E+++ K F+ Sbjct: 64 PLDAVDQAKAHQMMLGGMRYAQRMRINL---PIRYDVIEALYCPASDLFEIKYHKGYFSA 120 >UniRef50_C0W187 Putative uncharacterized protein n=1 Tax=Actinomyces coleocanis DSM 15436 RepID=C0W187_9ACTO Length = 117 Score = 92.4 bits (229), Expect = 4e-18, Method: Composition-based stats. Identities = 31/108 (28%), Positives = 47/108 (43%), Gaps = 6/108 (5%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 ++ G E A +LE G + + NV +G EID+I E +FVEVR R + +G Sbjct: 4 NKQRVGKLGEDLAAEYLESLGWKILERNVTYKGAEIDIIALEDDVVVFVEVRTRTTDDWG 63 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEW 122 A S+T K L WL + + R D+V ++ Sbjct: 64 SALESLTPKKLASLRSGVVRWLLNQD---EYCKARIDMVTV---KLNH 105 >UniRef50_A7BDE4 Putative uncharacterized protein n=1 Tax=Actinomyces odontolyticus ATCC 17982 RepID=A7BDE4_9ACTO Length = 146 Score = 92.0 bits (228), Expect = 5e-18, Method: Composition-based stats. Identities = 38/112 (33%), Positives = 48/112 (42%), Gaps = 8/112 (7%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVN-ERGGEIDLIMREG-----RTTIFVEVR 66 + + G A E AR LE +GLR + N R GE+D+I R+ T+ VEVR Sbjct: 6 RPDRRAIGAAGEYTARLALEEEGLRLLDTNWRDGRRGELDIIARDETDPSRSWTVIVEVR 65 Query: 67 YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN 118 R G A ASV K +L W H R DVVA T + Sbjct: 66 TRVGRRKGSALASVDHRKVARLRALTGAWCRAHGHLASR--VRIDVVAITVD 115 >UniRef50_A4WPR4 UPF0102 protein Rsph17025_0472 n=7 Tax=Rhodobacterales RepID=Y472_RHOS5 Length = 117 Score = 91.3 bits (226), Expect = 9e-18, Method: Composition-based stats. Identities = 34/115 (29%), Positives = 50/115 (43%), Gaps = 4/115 (3%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 + + G E R E A GEIDLI R+G IF+EV+ +S + Sbjct: 6 SHRAGFVAEEAVARIYERADRPVTARRWRGAAGEIDLIARDGAEVIFIEVK--KSKSHAA 63 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-NEVEWIKDAFND 129 AAA ++R + ++ A +LA RFDV G +E I++AF Sbjct: 64 AAARLSRRQMERIYGAASEFLAGEPL-GQLTASRFDVALVDGMGRIEIIENAFAA 117 >UniRef50_A4E8G7 Putative uncharacterized protein n=1 Tax=Collinsella aerofaciens ATCC 25986 RepID=A4E8G7_9ACTN Length = 220 Score = 90.9 bits (225), Expect = 1e-17, Method: Composition-based stats. Identities = 30/136 (22%), Positives = 47/136 (34%), Gaps = 14/136 (10%) Query: 3 TVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERG--GEIDLIMRE-GRT 59 T R+ P++ + R +LE KG + G IDL+ + T Sbjct: 85 TPQGRASEPKEQDMNDMKEKAMGAVRAFLERKGYEIVDEAWQGPEGIGGIDLVAVDEDGT 144 Query: 60 TIFVEVRYRRSALYGGAAASVTRSKQHKLLQT-ARLWLARHNGSFDTVDCRFDVVA---F 115 +FV+ R G A + L + A WLA + + RFD VA Sbjct: 145 LVFVDATVRIGTD-GFPEA----HRARGLREALAARWLAGNGDDYADTPVRFDEVAMMVV 199 Query: 116 TGNE--VEWIKDAFND 129 N + + F + Sbjct: 200 KENRALLRHHINCFGE 215 >UniRef50_B8GXN3 UPF0102 protein CCNA_00142 n=4 Tax=Caulobacteraceae RepID=Y142_CAUCN Length = 125 Score = 90.5 bits (224), Expect = 1e-17, Method: Composition-based stats. Identities = 31/126 (24%), Positives = 51/126 (40%), Gaps = 4/126 (3%) Query: 4 VPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFV 63 + R ++ G E A WL KG R + + GEIDL+ + G+ V Sbjct: 1 MAAGVRQSRGTAARKVGRRAEVIAALWLMAKGYRILGFRLATPLGEIDLLAQRGKVLAVV 60 Query: 64 EVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-VEW 122 EV+ R A +V +++ +L + A LA H + R D++A Sbjct: 61 EVKQR--TTIEDALDAVKPTQRERLRRAATH-LAAHRAGLRDLLVRLDLIAMAPGRPPRH 117 Query: 123 IKDAFN 128 + DA+ Sbjct: 118 LPDAWG 123 >UniRef50_B9KH25 Putative uncharacterized protein n=3 Tax=Anaplasma marginale RepID=B9KH25_ANAMF Length = 126 Score = 90.5 bits (224), Expect = 1e-17, Method: Composition-based stats. Identities = 31/129 (24%), Positives = 58/129 (44%), Gaps = 8/129 (6%) Query: 1 MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTT 60 M T +R+ R L G A E + + + + GEIDLI++ GR Sbjct: 1 MCTSKSRASKVRSL----VGYAGELVVLLLRKARLHKVLHHRYRSPLGEIDLIVQNGREL 56 Query: 61 IFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-E 119 F+EV+ ++ + VT+ ++ +++TA+ +L+R+ F F+V F+ Sbjct: 57 HFIEVKTSMTSRFHEVP--VTKKQRRSVVRTAQYFLSRNP-QFSEHQISFEVYCFSPKSG 113 Query: 120 VEWIKDAFN 128 V +A+ Sbjct: 114 VTRFVNAWQ 122 >UniRef50_Q28TZ5 Putative uncharacterized protein n=1 Tax=Jannaschia sp. CCS1 RepID=Q28TZ5_JANSC Length = 100 Score = 90.1 bits (223), Expect = 2e-17, Method: Composition-based stats. Identities = 28/99 (28%), Positives = 46/99 (46%), Gaps = 4/99 (4%) Query: 29 RWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAASVTRSKQHKL 88 R G R ++ GEIDL+M + IFVEV+ R+ + AA +++ + +L Sbjct: 2 RAYLDHGHRLVSRRWRGPAGEIDLVMEKDGEVIFVEVKASRT--HARAAEALSNRQIARL 59 Query: 89 LQTARLWLARHNGSFDTVDCRFDVVAFTG-NEVEWIKDA 126 L++A L RFDV G +++ I +A Sbjct: 60 LRSAEHCLGSFPKGLA-TPMRFDVALVDGQGQLDVIVNA 97 >UniRef50_B4CV56 Putative uncharacterized protein n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4CV56_9BACT Length = 89 Score = 90.1 bits (223), Expect = 2e-17, Method: Composition-based stats. Identities = 32/85 (37%), Positives = 44/85 (51%), Gaps = 5/85 (5%) Query: 50 IDLIMREGRTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCR 109 +D++ R+ T +FVEV+ RRS +G A SVTR KQ + + A WL + D + R Sbjct: 1 MDIVCRDHDTLVFVEVKTRRSLTFGSPAESVTREKQKLIARGALAWLDLLG-NPDNILFR 59 Query: 110 FDVVAFTGNE----VEWIKDAFNDH 130 FD+V E IKDAF Sbjct: 60 FDIVEIIFEEDVPTFHIIKDAFKLP 84 >UniRef50_Q7NEX4 UPF0102 protein gll3754 n=1 Tax=Gloeobacter violaceus RepID=Y3754_GLOVI Length = 126 Score = 90.1 bits (223), Expect = 2e-17, Method: Composition-based stats. Identities = 38/121 (31%), Positives = 50/121 (41%), Gaps = 8/121 (6%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 + E L +G +A RGGEIDL++R G FVEV+ R + Sbjct: 3 RRHRFALQAEIWVADHLAAQGGLVLARRWRCRGGEIDLVVRLGGVLCFVEVKARGGNSWD 62 Query: 75 GAA-ASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE------VEWIKDAF 127 A +V KQ +LL A L+LA H CRFDV + V +I AF Sbjct: 63 SAGWEAVGAVKQRRLLLAAALFLAAHP-ELARSVCRFDVALVGRDPGGGVRLVAYIAGAF 121 Query: 128 N 128 Sbjct: 122 E 122 >UniRef50_A5GTL0 Restriction endonuclease-like n=1 Tax=Synechococcus sp. RCC307 RepID=A5GTL0_SYNR3 Length = 121 Score = 89.7 bits (222), Expect = 3e-17, Method: Composition-based stats. Identities = 28/107 (26%), Positives = 44/107 (41%), Gaps = 3/107 (2%) Query: 20 GDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG-AAA 78 G EAQ L + R + N R GE+DL++ + + + VEV+ RR Sbjct: 11 GAEAEAQVAVLLCRRHWRLLDCNWCCRWGELDLVLAKPQRLLLVEVKARRRWGLDHGGLL 70 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-NEVEWIK 124 + K+ +L + R WLA H S+ + G +V W Sbjct: 71 ACGPRKRCRLARALRCWLAAHP-SYAFHSIEAHLALVDGEGQVRWFP 116 >UniRef50_Q2GDU7 Putative uncharacterized protein n=1 Tax=Neorickettsia sennetsu str. Miyayama RepID=Q2GDU7_NEOSM Length = 116 Score = 89.7 bits (222), Expect = 3e-17, Method: Composition-based stats. Identities = 21/105 (20%), Positives = 37/105 (35%), Gaps = 3/105 (2%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 G E L KG + E+DL+ + +FVEV++R S Sbjct: 13 VGRLAEMIVALHLSIKGYMLLCRRYRNPHCELDLVCIKHGVLLFVEVKFRSSLQ--VLET 70 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWI 123 V S+ K+ + + + + F V T + ++ I Sbjct: 71 MVDYSRMEKMYPASESFCSEFQLYYCLERV-FKVFLVTPSVIQVI 114 >UniRef50_Q3J5H3 UPF0102 protein RHOS4_03930 n=7 Tax=Rhodobacteraceae RepID=Y393_RHOS4 Length = 117 Score = 89.0 bits (220), Expect = 4e-17, Method: Composition-based stats. Identities = 35/111 (31%), Positives = 48/111 (43%), Gaps = 4/111 (3%) Query: 20 GDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAAS 79 G E R + G A GEIDLI REG IF+EV+ +S + AAA Sbjct: 10 GQTAEEAVARIYDRSGRPVAARRWRGVSGEIDLIAREGAEVIFIEVK--KSTSHAAAAAR 67 Query: 80 VTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-NEVEWIKDAFND 129 ++R + ++ A +LA RFDV VE I++AF Sbjct: 68 LSRRQMDRIYGAASEFLAGEP-RGQLTASRFDVALVDALGRVEIIENAFAA 117 >UniRef50_C6V4V5 Putative uncharacterized protein n=1 Tax=Neorickettsia risticii str. Illinois RepID=C6V4V5_NEORI Length = 117 Score = 88.6 bits (219), Expect = 5e-17, Method: Composition-based stats. Identities = 19/105 (18%), Positives = 37/105 (35%), Gaps = 3/105 (2%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 G E L KG + E+DL+ + +F EV++R S Sbjct: 14 VGRLAEMIVALHLSIKGYMLLCRRYRNPHCELDLVCIKSGVLLFAEVKFRSSLQ--AVET 71 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWI 123 +V S+ ++ + + + + F V T + ++ I Sbjct: 72 TVDYSRMERMYPASESFCSEFQLYYYLERI-FKVFLITPSVIQVI 115 >UniRef50_B0S8P6 Endonuclease n=2 Tax=Leptospira biflexa serovar Patoc RepID=B0S8P6_LEPBA Length = 114 Score = 88.6 bits (219), Expect = 6e-17, Method: Composition-based stats. Identities = 19/106 (17%), Positives = 43/106 (40%), Gaps = 1/106 (0%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + G E A +L+ + +N ++ GEID+I + T EV+ Sbjct: 1 MKKGTIGKKGEEFASFYLQSLEHTILFSNYRKKIGEIDIISIKNDTLHCSEVKTWNERFG 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE 119 + +K+ ++ + L+L + +F + F+++ T + Sbjct: 61 FHPKECLHATKRARMRKV-YLYLLQEIPAFYHLTPSFNLIHITEKK 105 >UniRef50_C4YXH4 Protein Mlr4633 n=1 Tax=Rickettsia endosymbiont of Ixodes scapularis RepID=C4YXH4_9RICK Length = 111 Score = 88.2 bits (218), Expect = 8e-17, Method: Composition-based stats. Identities = 21/93 (22%), Positives = 46/93 (49%), Gaps = 5/93 (5%) Query: 36 LRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLW 95 + + GEID+I + +F+EV+ R S + V+ ++Q ++ + A ++ Sbjct: 23 YQILHHRKRYYVGEIDIIALCNKEIVFIEVKARSSKIDDR---FVSFNQQRRITRAAEMF 79 Query: 96 LARHNGSFDTVDCRFDVVAFTGNEVE-WIKDAF 127 L+ + + + RFD+V ++ IK+A+ Sbjct: 80 LSSN-SKYRNYNIRFDLVIIRSYKLPIIIKNAW 111 >UniRef50_C8WWC7 Putative uncharacterized protein n=1 Tax=Alicyclobacillus acidocaldarius subsp. acidocaldarius DSM 446 RepID=C8WWC7_ALIAD Length = 120 Score = 87.8 bits (217), Expect = 9e-17, Method: Composition-based stats. Identities = 22/78 (28%), Positives = 38/78 (48%), Gaps = 1/78 (1%) Query: 13 QLTTKQTGDAWEAQARRWLEGK-GLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSA 71 ++ ++ G E+ ++LE G R I N R GE+DLI + VEV+ R S Sbjct: 2 RVNRRELGTLGESFVGQYLERCLGWRVIEKNWRTRFGELDLIAENEDELVAVEVKTRTSP 61 Query: 72 LYGGAAASVTRSKQHKLL 89 + G ++ ++ KL+ Sbjct: 62 IDGDPIYALRPAQIPKLV 79 >UniRef50_A2VU88 Putative uncharacterized protein n=1 Tax=Burkholderia cenocepacia PC184 RepID=A2VU88_9BURK Length = 132 Score = 87.8 bits (217), Expect = 9e-17, Method: Composition-based stats. Identities = 48/129 (37%), Positives = 62/129 (48%), Gaps = 30/129 (23%) Query: 3 TVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE-GRTTI 61 P+ +K G A+E +AR++LE GL F+AANV RGGE+DL+MRE + Sbjct: 31 RPPSGDNFSGAARSKPVGAAFEQRARQFLERHGLGFVAANVTMRGGELDLVMREPDGMLV 90 Query: 62 FVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVE 121 FVEVR RRS +G AA CRFDVVAF + Sbjct: 91 FVEVRARRSTRHGAGAA-----------------------------CRFDVVAFEAGRLA 121 Query: 122 WIKDAFNDH 130 W++DAF Sbjct: 122 WLRDAFRTD 130 >UniRef50_Q1GJI4 UPF0102 protein TM1040_0449 n=9 Tax=Rhodobacteraceae RepID=Y449_SILST Length = 124 Score = 86.6 bits (214), Expect = 2e-16, Method: Composition-based stats. Identities = 31/100 (31%), Positives = 47/100 (47%), Gaps = 4/100 (4%) Query: 31 LEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQ 90 +GL + + GEIDLI+R+G T IF EV+ S AAA + ++ ++ Sbjct: 27 YLARGLTLVKSRWRGPHGEIDLILRDGETVIFAEVK--SSTTRDKAAARIKPAQMQRVFN 84 Query: 91 TARLWLARHNGSFDTVDCRFDVVAFTG-NEVEWIKDAFND 129 +A +L R DVV G EVE I++A+ Sbjct: 85 SAGAFLEGEPL-GQLTPARLDVVLVWGAGEVEIIENAYGH 123 >UniRef50_A5G0S4 UPF0102 protein Acry_2261 n=1 Tax=Acidiphilium cryptum JF-5 RepID=Y2261_ACICJ Length = 128 Score = 86.6 bits (214), Expect = 2e-16, Method: Composition-based stats. Identities = 33/113 (29%), Positives = 54/113 (47%), Gaps = 3/113 (2%) Query: 18 QTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAA 77 G E + W +G +A + GE+DL++ + T +FVEV+ R + A Sbjct: 17 HRGRDAERRVAGWYAAQGFVVLAQRLRTAAGELDLVVADRTTLVFVEVKARNALR--SAI 74 Query: 78 ASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFNDH 130 SV ++ +L+ A + LA + + RFDVV G++V I+DAF Sbjct: 75 ESVAPRQRRRLVAAAAIVLAGQP-DWGRAETRFDVVLLVGDDVHAIRDAFRAD 126 >UniRef50_C7N7P3 Predicted endonuclease related to Holliday junction resolvase n=2 Tax=Coriobacteriaceae RepID=C7N7P3_SLAHD Length = 117 Score = 86.6 bits (214), Expect = 2e-16, Method: Composition-based stats. Identities = 19/115 (16%), Positives = 37/115 (32%), Gaps = 11/115 (9%) Query: 21 DAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE-GRTTIFVEVRYRRSALYGGAAAS 79 + RR+LE KG + +D I + +F++ +A G + Sbjct: 6 QRAKQGVRRYLELKGYEILEDGWCHGRDSVDFIATDEDDALVFIDCEVSENAGEGIPEEA 65 Query: 80 VTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-----VEWIKDAFND 129 R ++ A +LA + R+D+V + +A Sbjct: 66 PDRKAFERI---AAAYLAE--ADLSNTEVRYDIVGVLILGESRALIRHHINAITP 115 >UniRef50_A5KGE0 Putative uncharacterized protein n=1 Tax=Campylobacter jejuni subsp. jejuni CG8486 RepID=A5KGE0_CAMJE Length = 84 Score = 85.5 bits (211), Expect = 5e-16, Method: Composition-based stats. Identities = 17/68 (25%), Positives = 34/68 (50%), Gaps = 2/68 (2%) Query: 20 GDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAAS 79 G E +A ++L+ +G + N + + GEID+I ++ F+EV++ ++ Sbjct: 9 GILGEDKACKFLKKQGFEILKRNFHSKFGEIDIIAKKDEILHFIEVKFTQNDYEVS--ER 66 Query: 80 VTRSKQHK 87 + R K K Sbjct: 67 LDRKKLRK 74 >UniRef50_A8LJ68 UPF0102 protein Dshi_2830 n=1 Tax=Dinoroseobacter shibae DFL 12 RepID=Y2830_DINSH Length = 134 Score = 84.3 bits (208), Expect = 1e-15, Method: Composition-based stats. Identities = 34/121 (28%), Positives = 55/121 (45%), Gaps = 4/121 (3%) Query: 6 TRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEV 65 R+ R +G A EA+ R +G +A GGE+DLI+R G +FVEV Sbjct: 14 ARARQARGTRAMLSGAAAEARVERAYRDRGCDVLATRWRGSGGEVDLIVRRGDLLVFVEV 73 Query: 66 RYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-NEVEWIK 124 + SA Y A S++ ++ ++ TA +L R ++ RFD+ G + Sbjct: 74 K--SSASYTRAIESLSLAQLTRIQNTALEFLDRSP-DLAGLEMRFDLAVVEGSGRFRVLA 130 Query: 125 D 125 + Sbjct: 131 N 131 >UniRef50_A5GLH9 Restriction endonuclease-like n=5 Tax=Chroococcales RepID=A5GLH9_SYNPW Length = 142 Score = 84.3 bits (208), Expect = 1e-15, Method: Composition-based stats. Identities = 33/109 (30%), Positives = 51/109 (46%), Gaps = 8/109 (7%) Query: 11 PRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRT----TIFVEVR 66 +LTT TG EA+A R L+G+G + + R GEIDL++ + + VEV+ Sbjct: 12 NAKLTTATTGLWAEAKALRLLQGRGWTLLEKRWSCRYGEIDLLLCKANAPVPRLLAVEVK 71 Query: 67 -YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVA 114 RR G A+ K+ +L T W+A + C+ +VV Sbjct: 72 GRRRCGPDGWGLAAFDARKRQRLALTLNYWIALNP---RHACCQLEVVL 117 >UniRef50_B1M445 UPF0102 protein Mrad2831_2938 n=10 Tax=Alphaproteobacteria RepID=Y2938_METRJ Length = 129 Score = 82.4 bits (203), Expect = 4e-15, Method: Composition-based stats. Identities = 32/112 (28%), Positives = 48/112 (42%), Gaps = 3/112 (2%) Query: 9 GSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYR 68 G+ R+ + G E A L KG I V+ GGEIDL++R T +FVEV+ R Sbjct: 8 GADRRRAAYRFGHRAEWLALAALMLKGYWPIGRRVSVAGGEIDLVVRRWNTVVFVEVKAR 67 Query: 69 RSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV 120 A A ++ +K+ + + R W+ R+ R D V Sbjct: 68 --AKRDDAREAIDGAKRRRFSRAVRAWIGRNAW-CAGATFRADAVFVGHWAW 116 >UniRef50_D1ATK0 Putative uncharacterized protein n=1 Tax=Anaplasma centrale str. Israel RepID=D1ATK0_ANACI Length = 151 Score = 80.5 bits (198), Expect = 1e-14, Method: Composition-based stats. Identities = 21/89 (23%), Positives = 43/89 (48%), Gaps = 4/89 (4%) Query: 40 AANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARH 99 GEIDLI++ GR F+EV+ ++ + VT ++ +++TA+ +L+RH Sbjct: 66 HHRYRSPLGEIDLIVQNGRELYFIEVKTSMTSRFREVP--VTGKQRRSIVRTAQYFLSRH 123 Query: 100 NGSFDTVDCRFDVVAFTGN-EVEWIKDAF 127 ++ F+V + + +A+ Sbjct: 124 PQFYEH-QISFEVYCISPRSGITRFVNAW 151 >UniRef50_B6IVS9 Putative uncharacterized protein n=1 Tax=Rhodospirillum centenum SW RepID=B6IVS9_RHOCS Length = 126 Score = 80.1 bits (197), Expect = 2e-14, Method: Composition-based stats. Identities = 28/123 (22%), Positives = 49/123 (39%), Gaps = 5/123 (4%) Query: 4 VPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFV 63 P RS + + T ++ G E R L KG R +A + GE+D++ V Sbjct: 2 APVRSRTDYR-TAERLGRRAEWLCRLALLLKGYRILATRLRTPAGEVDILAERRGLLAVV 60 Query: 64 EVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV-EW 122 EV+ R A A+VT + ++ + A+ A R+D++ Sbjct: 61 EVKARP--GLEAARAAVTEADWRRIARAAQG-YAAARPRLAGHAIRYDLMVVLPGRWPVH 117 Query: 123 IKD 125 ++D Sbjct: 118 LED 120 >UniRef50_A9GEX7 UPF0102 protein sce2912 n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=Y2912_SORC5 Length = 144 Score = 79.7 bits (196), Expect = 3e-14, Method: Composition-based stats. Identities = 33/128 (25%), Positives = 51/128 (39%), Gaps = 6/128 (4%) Query: 6 TRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEV 65 +G+P + G E L +G+ +A N EID++ R+G +EV Sbjct: 17 GSAGAPAADARRALGARAEDAVVAHLAAQGVEIVARNARVGRLEIDVVARDGPVIAIIEV 76 Query: 66 RYRRSALYGGAAASVTRSKQHKLLQTARL-WLARHNGSFDTVDCRFDVVAFTG-----NE 119 R R + Y A S+ K+ ++ + W A + RFD + T Sbjct: 77 RTRGAGSYVRALDSIDARKRARVRRAGERLWRATFSRVRGVERMRFDAASVTFLPSGEAT 136 Query: 120 VEWIKDAF 127 VE IK AF Sbjct: 137 VEIIKAAF 144 >UniRef50_B3DVH2 Predicted endonuclease n=2 Tax=Verrucomicrobia RepID=B3DVH2_METI4 Length = 78 Score = 78.9 bits (194), Expect = 5e-14, Method: Composition-based stats. Identities = 22/75 (29%), Positives = 31/75 (41%), Gaps = 7/75 (9%) Query: 59 TTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF--- 115 +FVEV+ R S YG +V K+ L+ A +L V RFDVV Sbjct: 1 MLVFVEVKTRSSIQYGFPYEAVDAQKKRNLIAAAHAYLKL--LKNPVVAYRFDVVEVLFF 58 Query: 116 --TGNEVEWIKDAFN 128 T ++ +AF Sbjct: 59 KGTRPKITHYPNAFG 73 >UniRef50_C8WHA2 Putative uncharacterized protein n=1 Tax=Eggerthella lenta DSM 2243 RepID=C8WHA2_EGGLE Length = 123 Score = 78.6 bits (193), Expect = 6e-14, Method: Composition-based stats. Identities = 26/114 (22%), Positives = 48/114 (42%), Gaps = 13/114 (11%) Query: 25 AQARRWLEGKGLRFIAANVNER--GGEIDLIMREG--RTTIFVEVRYRRSALYGGAAASV 80 A R+LE +G +A G IDL+ R+ +FV+V R ++ G Sbjct: 13 EAAARFLEVRGYETLATGWKSPETRGTIDLVARDPESDDLVFVDVSARPNSGAGFGD--- 69 Query: 81 TRSKQHKLLQTARLWLARHNGSFDTVDCRFD---VVAFTGNE--VEWIKDAFND 129 R+ + + A WL ++ ++V RFD ++ + + +AF + Sbjct: 70 GRNDRETMELLAVSWLVENDF-AESVGVRFDKISMIVVGEDRALLRHHINAFGE 122 >UniRef50_UPI000190D97F hypothetical protein SentesTyp_00923 n=1 Tax=Salmonella enterica subsp. enterica serovar Typhi str. E98-2068 RepID=UPI000190D97F Length = 82 Score = 78.6 bits (193), Expect = 6e-14, Method: Composition-based stats. Identities = 48/55 (87%), Positives = 51/55 (92%) Query: 32 EGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAASVTRSKQH 86 E KGLRFIAANV ERGGEIDLIMR+G+TT+FVEVRYRRS LYGGAAASVTRSKQ Sbjct: 2 ESKGLRFIAANVRERGGEIDLIMRDGKTTVFVEVRYRRSGLYGGAAASVTRSKQQ 56 >UniRef50_C0W7A2 Putative uncharacterized protein (Fragment) n=1 Tax=Actinomyces urogenitalis DSM 15434 RepID=C0W7A2_9ACTO Length = 125 Score = 78.2 bits (192), Expect = 7e-14, Method: Composition-based stats. Identities = 21/79 (26%), Positives = 29/79 (36%), Gaps = 11/79 (13%) Query: 3 TVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNER-----GGEIDLIMREG 57 PT +QTG E A +L G + N R GEID++ E Sbjct: 44 QEPTGPQPFPSGDRRQTGRRGEDLAAAYLTDLGWTVLERNWRPRGLAGLRGEIDIVASEP 103 Query: 58 R------TTIFVEVRYRRS 70 T + VEV+ R + Sbjct: 104 SASAGRPTLVVVEVKTRST 122 >UniRef50_Q0I9S1 Uncharacterised protein family protein n=1 Tax=Synechococcus sp. CC9311 RepID=Q0I9S1_SYNS3 Length = 144 Score = 77.4 bits (190), Expect = 1e-13, Method: Composition-based stats. Identities = 24/95 (25%), Positives = 40/95 (42%), Gaps = 5/95 (5%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREG----RTTIFVEVRYRRS 70 ++ G E + L G R + N + R GEIDL+ + + VEV+ R Sbjct: 14 NSQALGAQAELYVKEVLLRHGWRLLEHNWSCRYGEIDLLFTKQSFPASRILVVEVKARHR 73 Query: 71 AL-YGGAAASVTRSKQHKLLQTARLWLARHNGSFD 104 + G A+ ++K+ L +T W A + S Sbjct: 74 SGLDGWGVAAFHQAKRRCLARTVECWRAANAWSEA 108 >UniRef50_B9XLY1 Putative uncharacterized protein n=1 Tax=bacterium Ellin514 RepID=B9XLY1_9BACT Length = 101 Score = 77.0 bits (189), Expect = 2e-13, Method: Composition-based stats. Identities = 24/61 (39%), Positives = 34/61 (55%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 Q G+ E A+++L +GL+F AN GEIDLI R+G +FVEV+ R S + Sbjct: 19 HLQHGELGERAAKKYLRKQGLKFFTANFKSDRGEIDLIFRDGDGLVFVEVKTRSSVDWNL 78 Query: 76 A 76 Sbjct: 79 P 79 >UniRef50_Q1GWI7 UPF0102 protein Sala_0262 n=5 Tax=Sphingomonadales RepID=Y262_SPHAL Length = 116 Score = 72.8 bits (178), Expect = 3e-12, Method: Composition-based stats. Identities = 22/101 (21%), Positives = 40/101 (39%), Gaps = 5/101 (4%) Query: 30 WLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAASVTRSKQHKLL 89 WL G R + + GE+DL+ R GRT F+EV++R ++ + ++ Sbjct: 20 WLRLHGWRIVGQRLRVPVGEVDLVARRGRTVAFIEVKWR--DRAADLDLAIDPYRLRRVA 77 Query: 90 QTARLWLARHNGSFDTVDCRFDVVAFTGNEV-EWIKDAFND 129 A + R +D R DV+ + + + Sbjct: 78 AAAEMLAPRFARPYDD--IRIDVMLLAPRRLPRHLVHVWQP 116 >UniRef50_C0BCN9 Putative uncharacterized protein n=1 Tax=Coprococcus comes ATCC 27758 RepID=C0BCN9_9FIRM Length = 70 Score = 70.1 bits (171), Expect = 2e-11, Method: Composition-based stats. Identities = 18/63 (28%), Positives = 34/63 (53%), Gaps = 2/63 (3%) Query: 66 RYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 ++R++ A +V+ KQ +L A +L ++ + + CRFDV+ GN++ K+ Sbjct: 2 KFRKTGGLSAALEAVSVPKQMRLSGAAVYYLMKNGCTE--IPCRFDVIGIAGNKISLRKN 59 Query: 126 AFN 128 AF Sbjct: 60 AFE 62 >UniRef50_Q4E9U1 Endonuclease (Fragment) n=5 Tax=Wolbachia RepID=Q4E9U1_9RICK Length = 107 Score = 69.3 bits (169), Expect = 3e-11, Method: Composition-based stats. Identities = 16/77 (20%), Positives = 36/77 (46%), Gaps = 5/77 (6%) Query: 36 LRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLW 95 I + GEIDLI+ + + IF+EV+ ++ + ++ +++ + Sbjct: 26 YNVIKRRYRCKFGEIDLIVSKKKELIFIEVKTSLLGKEIP----ISHLQCQSIINSSKYF 81 Query: 96 LARHNGSFDTVDCRFDV 112 L+++ SF R+D+ Sbjct: 82 LSKN-LSFLDYSVRYDL 97 >UniRef50_C8W847 Putative uncharacterized protein n=1 Tax=Atopobium parvulum DSM 20469 RepID=C8W847_ATOPD Length = 117 Score = 68.9 bits (168), Expect = 5e-11, Method: Composition-based stats. Identities = 23/111 (20%), Positives = 43/111 (38%), Gaps = 11/111 (9%) Query: 24 EAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAASVTRS 83 EA + L+ + + + N ID I+ + IFV+ + + Sbjct: 11 EAISAN-LKHRDIEVLEKNWAHGSDGIDFIVMDDEELIFVDTAT-KCGGFDVPREEPD-- 66 Query: 84 KQHKLLQTARLWLARHNGSFDTVDCRFDVVA--FTGNE---VEWIKDAFND 129 Q + + A +LA + R+D+V+ TG+E + K+ ND Sbjct: 67 -QERFERIAAAYLAE-SEVEGLASIRYDIVSLLVTGSEKALLRHHKNVLND 115 >UniRef50_Q73VP8 Putative uncharacterized protein n=1 Tax=Mycobacterium avium subsp. paratuberculosis RepID=Q73VP8_MYCPA Length = 98 Score = 64.3 bits (156), Expect = 1e-09, Method: Composition-based stats. Identities = 15/41 (36%), Positives = 19/41 (46%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE 56 Q G EA A L GLR + N R GE+D+I + Sbjct: 40 RIQLGAMGEALAVDHLTRMGLRVLHRNWRCRYGELDIIACD 80 >UniRef50_A8A9D3 Putative uncharacterized protein n=1 Tax=Ignicoccus hospitalis KIN4/I RepID=A8A9D3_IGNH4 Length = 201 Score = 62.4 bits (151), Expect = 4e-09, Method: Composition-based stats. Identities = 31/115 (26%), Positives = 44/115 (38%), Gaps = 12/115 (10%) Query: 11 PRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGG----EIDLIMREGRTTIFVEVR 66 + + A+E+ R LE G + NV R G E D+I +G I VE + Sbjct: 60 EHEAASYLNWKAFESYVARALEEAGFETL-KNVRVRAGDKLAEFDVIGYDGDKVIVVECK 118 Query: 67 YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVE 121 R SA A V + K+ + A WLA+ V VV G + Sbjct: 119 -RWSAFRRSALLKVAEEHKAKVERAA-YWLAKLGKRALPV-----VVTLRGTPIR 166 >UniRef50_C7N801 Predicted endonuclease related to Holliday junction resolvase n=1 Tax=Slackia heliotrinireducens DSM 20476 RepID=C7N801_SLAHD Length = 130 Score = 62.0 bits (150), Expect = 6e-09, Method: Composition-based stats. Identities = 21/90 (23%), Positives = 36/90 (40%), Gaps = 5/90 (5%) Query: 25 AQARRWLEG-KGLRFIAANVNERGGEIDLIMREG--RTTIFVEVRYRRSALYGGAAASVT 81 A AR +L+ KG + + + ID I + +FVE+R R Y Sbjct: 17 ALARVFLQREKGFAILKDDFSRGLDSIDFIALDDTQTVIVFVEMRLRHENSYIDKNPRSD 76 Query: 82 RSKQHKLLQTARLWLARHNGSFDTVDCRFD 111 K + A +L+++ F ++ R D Sbjct: 77 FDK-RRFEHLALAFLSKYPCLF-NLEIRAD 104 >UniRef50_Q3AKE1 Uncharacterised protein family UPF0102 n=2 Tax=Chroococcales RepID=Q3AKE1_SYNSC Length = 114 Score = 61.2 bits (148), Expect = 1e-08, Method: Composition-based stats. Identities = 18/73 (24%), Positives = 34/73 (46%), Gaps = 1/73 (1%) Query: 35 GLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG-GAAASVTRSKQHKLLQTAR 93 G R + N + R GE+DL++ + VEV+ RR + + +K+ ++ + Sbjct: 22 GWRLLDRNWHCRWGELDLVLERQLLLLVVEVKGRRMGHHDRHGLDAFHSAKRRRMARAIS 81 Query: 94 LWLARHNGSFDTV 106 W A H S + + Sbjct: 82 CWRAVHPASAEQL 94 >UniRef50_Q8W6V7 Putative uncharacterized protein n=1 Tax=Synechococcus phage P60 RepID=Q8W6V7_9CAUD Length = 99 Score = 60.1 bits (145), Expect = 2e-08, Method: Composition-based stats. Identities = 29/113 (25%), Positives = 43/113 (38%), Gaps = 21/113 (18%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 ++T + G + E A L G NV G ID+++ +G T ++V+ Sbjct: 2 ISTHKRGASAELLACAALVDAGFEVF-RNV-TPDGPIDIVVWDGETFYPIDVKRASHY-- 57 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDA 126 +KQ K A L L N + CR D +E WI DA Sbjct: 58 --------VNKQGK----ATLKLPAKNNEHALILCRTD-----KDEWVWINDA 93 >UniRef50_A6DBD4 Putative uncharacterized protein (Fragment) n=1 Tax=Caminibacter mediatlanticus TB-2 RepID=A6DBD4_9PROT Length = 43 Score = 58.9 bits (142), Expect = 5e-08, Method: Composition-based stats. Identities = 14/43 (32%), Positives = 24/43 (55%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE 56 + +K G+ E +A+ +L K R + N +GGEID+I + Sbjct: 1 MNSKSKGNIAEKKAKEYLLNKKFRIVETNFYCKGGEIDIIAYK 43 >UniRef50_Q8TW03 Predicted endonuclease of the RecB family n=1 Tax=Methanopyrus kandleri RepID=Q8TW03_METKA Length = 258 Score = 58.1 bits (140), Expect = 9e-08, Method: Composition-based stats. Identities = 13/57 (22%), Positives = 21/57 (36%), Gaps = 5/57 (8%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERG-----GEIDLIMREGRTTIFVEVRY 67 + G + E A L +G +A N EID++ + VEV+ Sbjct: 1 MLRRGKSAEEIAASILRKEGFEVVARNYRVELEDELVAEIDIVAEKDGERYAVEVKA 57 >UniRef50_C5U625 Putative uncharacterized protein n=1 Tax=Methanocaldococcus infernus ME RepID=C5U625_9EURY Length = 108 Score = 55.4 bits (133), Expect = 5e-07, Method: Composition-based stats. Identities = 17/57 (29%), Positives = 26/57 (45%), Gaps = 5/57 (8%) Query: 18 QTGDAWEAQARRWLEGKGLRFIAANV-----NERGGEIDLIMREGRTTIFVEVRYRR 69 + G E +A +L+ KG + I NV + E D+I + G VEV+ R Sbjct: 2 RKGKKKEGRAANYLKEKGYKIIGRNVIKRINQHKKAEYDIIAKRGNYKYAVEVKSGR 58 >UniRef50_O33024 UPF0102 protein ML1607 n=2 Tax=Mycobacterium leprae RepID=Y1607_MYCLE Length = 96 Score = 54.3 bits (130), Expect = 1e-06, Method: Composition-based stats. Identities = 15/47 (31%), Positives = 20/47 (42%) Query: 10 SPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE 56 + + +T Q E A L GLR + N R GE D+I E Sbjct: 3 THKAMTRVQLEAMGEVFAVDNLTRMGLRGLHCNWRCRYGECDVIASE 49 >UniRef50_A4GJ57 Putative uncharacterized protein n=1 Tax=uncultured marine Nitrospinaceae bacterium RepID=A4GJ57_9DELT Length = 64 Score = 54.3 bits (130), Expect = 1e-06, Method: Composition-based stats. Identities = 15/61 (24%), Positives = 30/61 (49%), Gaps = 7/61 (11%) Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-----VEWIKDAF 127 +G ++T +KQ K++Q + +L + RFDVV T + +E ++++F Sbjct: 5 FGHQFDALTPTKQKKIIQITQSFLVQKRIP--DKSMRFDVVVLTLDRPDSCKIELLENSF 62 Query: 128 N 128 Sbjct: 63 Q 63 >UniRef50_UPI0001699F06 hypothetical protein Epers_29808 n=1 Tax=Endoriftia persephone 'Hot96_1+Hot96_2' RepID=UPI0001699F06 Length = 60 Score = 53.9 bits (129), Expect = 2e-06, Method: Composition-based stats. Identities = 18/40 (45%), Positives = 30/40 (75%) Query: 54 MREGRTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTAR 93 M++G + +FVEVRYR+S +G AA S+T +K+ KL+ ++ Sbjct: 1 MQDGNSLVFVEVRYRKSDNFGSAAESITAAKRAKLIAASQ 40 >UniRef50_Q5GSW9 RecB family endonuclease n=1 Tax=Wolbachia endosymbiont strain TRS of Brugia malayi RepID=Q5GSW9_WOLTR Length = 114 Score = 53.9 bits (129), Expect = 2e-06, Method: Composition-based stats. Identities = 14/70 (20%), Positives = 32/70 (45%), Gaps = 5/70 (7%) Query: 43 VNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGS 102 + EIDLI+ + + IF+EV+ ++ + ++ +++ +L+ S Sbjct: 33 YCCKFSEIDLIVSKKKELIFIEVKASLLGEDIL----ISYLQYQSIVNSSKYFLSEK-LS 87 Query: 103 FDTVDCRFDV 112 F R+D+ Sbjct: 88 FLDYPIRYDL 97 >UniRef50_Q5NWY8 Putative uncharacterized protein n=1 Tax=Aromatoleum aromaticum EbN1 RepID=Q5NWY8_AZOSE Length = 196 Score = 53.5 bits (128), Expect = 2e-06, Method: Composition-based stats. Identities = 27/133 (20%), Positives = 49/133 (36%), Gaps = 20/133 (15%) Query: 9 GSPRQLTTKQTGDAWE----AQARRWLEGKGLRF-IAANVNERGGEIDLIMREGRTTIFV 63 R+ +Q G E A+ LE G + ++ RGG+IDLI + +++ V Sbjct: 57 NQHRRAQVRQHGQHVEAKCGQLAKTALESDGYTVALGQRLH-RGGDIDLIATKDGSSVVV 115 Query: 64 EVRY--------RRSALYGGAAASVTRSKQHKL-LQTARLWLARHNGSFDTVDCRFDV-- 112 E++ R A V +Q + + A +WL + + + + Sbjct: 116 ELKSFRYWGARGRDDWREKKAIEQV-LRQQDTIAAKAAVIWLPMASPTLWQLLWGYSFGG 174 Query: 113 --VAFTGNEVEWI 123 VA V + Sbjct: 175 RGVAVVRGGVRHL 187 >UniRef50_Q3BT99 Putative uncharacterized protein n=2 Tax=Xanthomonas RepID=Q3BT99_XANC5 Length = 708 Score = 52.0 bits (124), Expect = 6e-06, Method: Composition-based stats. Identities = 22/89 (24%), Positives = 33/89 (37%), Gaps = 8/89 (8%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLR-FIAANVNERGGEIDLIMRE-GRTTIFVEVRYRRS 70 +LTT+Q GD EA L +G +A G IDL+ R G F E++ Sbjct: 432 RLTTRQLGDIGEAIQTHELVKQGYSDIVAIKNRSGHG-IDLVGRNPGGELEFFEIKTSAK 490 Query: 71 A----LYGGAAASVTRSKQHKLLQTARLW 95 +G V + + + W Sbjct: 491 GMAPAQHGDPEQFV-AKRLERAIDAKGHW 518 >UniRef50_C1E903 Predicted protein n=1 Tax=Micromonas sp. RCC299 RepID=C1E903_9CHLO Length = 682 Score = 50.4 bits (120), Expect = 2e-05, Method: Composition-based stats. Identities = 17/66 (25%), Positives = 24/66 (36%), Gaps = 11/66 (16%) Query: 15 TTKQTGDAWEAQARRWLEGK--GLRFIAANVNERGGE----IDLIMR--EGRTTIFVEVR 66 ++ G EA R+L + G E D+ MR +FVEV+ Sbjct: 562 DNRRVGRWGEALVYRYLLQRHPGWTVT---WVNEHAESKSFYDVKMRNVRDGRIVFVEVK 618 Query: 67 YRRSAL 72 RSA Sbjct: 619 TTRSAD 624 >UniRef50_Q9Y9F5 Putative uncharacterized protein n=1 Tax=Aeropyrum pernix RepID=Q9Y9F5_AERPE Length = 213 Score = 47.7 bits (113), Expect = 1e-04, Method: Composition-based stats. Identities = 15/61 (24%), Positives = 25/61 (40%), Gaps = 9/61 (14%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNER-------GGEIDLIMREGRTTIFVEVR 66 + ++ E A R LE G R + + GEID++ +G + VEV+ Sbjct: 1 MAGRRAWRNSEEIAARILEKSGFRVLD--FHVPIEDGGVEVGEIDIVAEKGGSRYSVEVK 58 Query: 67 Y 67 Sbjct: 59 A 59 >UniRef50_Q6MLA1 Putative uncharacterized protein n=1 Tax=Bdellovibrio bacteriovorus RepID=Q6MLA1_BDEBA Length = 97 Score = 44.7 bits (105), Expect = 8e-04, Method: Composition-based stats. Identities = 23/100 (23%), Positives = 45/100 (45%), Gaps = 21/100 (21%) Query: 29 RWLEGKGLRFIAANVNERGGEIDLIMREGR-TTIFVEVRYRRSALYGGAAASVTRSKQHK 87 ++ + K + V E+DL+ + R T + VEV+ + + +T+ ++ + Sbjct: 3 KYYQLKCCHLLGQRVKTPFAEVDLLFKTPRQTLLMVEVKTTNLSDFQP--FRITKKQKAR 60 Query: 88 LLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 L++ A L+LA R+DV+ E+ W AF Sbjct: 61 LVR-AMLFLAA----------RWDVLV----EIHW---AF 82 >UniRef50_D0CIJ2 Putative uncharacterized protein n=2 Tax=Bacteria RepID=D0CIJ2_9SYNE Length = 46 Score = 43.5 bits (102), Expect = 0.002, Method: Composition-based stats. Identities = 8/19 (42%), Positives = 12/19 (63%) Query: 35 GLRFIAANVNERGGEIDLI 53 G R + N + R GE+DL+ Sbjct: 22 GWRLLDRNWHCRWGELDLV 40 >UniRef50_B5IHF1 ATPase n=1 Tax=Aciduliprofundum boonei T469 RepID=B5IHF1_9EURY Length = 390 Score = 43.1 bits (101), Expect = 0.003, Method: Composition-based stats. Identities = 15/95 (15%), Positives = 34/95 (35%), Gaps = 21/95 (22%) Query: 11 PRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNER-GGEIDLIMREGRTTIFVEVRYRR 69 PR++ G +E L GL + E+D I+ + +EV+ Sbjct: 283 PREMD----GLLFENYVLSELIKMGLEP--RYWRTKSKAEVDFIVERDGKIVPIEVKL-- 334 Query: 70 SALYGGAAASVTRSKQHKLLQTARLWLARHNGSFD 104 ++K K+ ++ R ++ ++ + Sbjct: 335 ------------QAKPEKVEKSMRAFIEKYEPEYA 357 >UniRef50_Q7VH71 Putative uncharacterized protein n=1 Tax=Helicobacter hepaticus RepID=Q7VH71_HELHP Length = 154 Score = 42.7 bits (100), Expect = 0.003, Method: Composition-based stats. Identities = 20/107 (18%), Positives = 38/107 (35%), Gaps = 19/107 (17%) Query: 10 SPRQLTTKQTGDAWEAQARRWLEGKGLRF----IAANVNERGGEIDLIMREGRTTIFVEV 65 S KQ GD +E Q R + +G + + ++G ID+I +G+ + ++ Sbjct: 28 SNNARHNKQKGDKYELQIVRHYKQQGYKVYPKGLKEGRRDKG--IDIIAYKGKEALLIQC 85 Query: 66 RYRRSALYGGAAASVTRSKQHKLLQT---ARLWLARHNGSFDTVDCR 109 + + KQ L +L ++ F R Sbjct: 86 KNWERSQV----------KQEHLRIFLGDCTAYLEQNQKIFAKRSVR 122 >UniRef50_D1TJS5 Putative uncharacterized protein n=1 Tax=Burkholderia sp. CCGE1002 RepID=D1TJS5_9BURK Length = 341 Score = 42.7 bits (100), Expect = 0.004, Method: Composition-based stats. Identities = 11/48 (22%), Positives = 19/48 (39%), Gaps = 2/48 (4%) Query: 21 DAWEAQARRWLEGKGLRFIAANVNE--RGGEIDLIMREGRTTIFVEVR 66 +E+ L G + N+ R E+D+ R+ VEV+ Sbjct: 8 QQFESIVAELLVKLGFEKVERNIAHPARRAEVDITFRKKSELAVVEVK 55 >UniRef50_B7KKS4 Putative uncharacterized protein n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7KKS4_CYAP7 Length = 678 Score = 42.0 bits (98), Expect = 0.006, Method: Composition-based stats. Identities = 14/54 (25%), Positives = 24/54 (44%), Gaps = 3/54 (5%) Query: 20 GDAWEAQARRWLEGKGLRFIAANVNE-RGGEIDLIMREGRTTIFVEVRYRRSAL 72 G+ E +A ++ + G +NV+ DL + IFVEV+ S+ Sbjct: 517 GNFGEDKAIQFYQALGYEV--SNVSNQPQKGYDLECIKDGQEIFVEVKTISSSN 568 >UniRef50_A3DLW4 Endonuclease (RecB family)-like protein n=1 Tax=Staphylothermus marinus F1 RepID=A3DLW4_STAMF Length = 236 Score = 42.0 bits (98), Expect = 0.007, Method: Composition-based stats. Identities = 16/61 (26%), Positives = 29/61 (47%), Gaps = 6/61 (9%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNER-GG----EIDLIMREG-RTTIFVEVR 66 L+ K+ + E A ++LE +G + I +V + G EID I+ + VE++ Sbjct: 2 SLSAKRKWRSSEEIALQFLEQQGFKIIDKHVKVKIEGVEVSEIDAIVEDEKGEKYAVEIK 61 Query: 67 Y 67 Sbjct: 62 A 62 >UniRef50_A2BIV6 Endonuclease of RecB family n=1 Tax=Hyperthermus butylicus DSM 5456 RepID=A2BIV6_HYPBU Length = 235 Score = 41.6 bits (97), Expect = 0.007, Method: Composition-based stats. Identities = 17/67 (25%), Positives = 27/67 (40%), Gaps = 6/67 (8%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIA--ANVNERG---GEIDLIMR-EGRTTIFVEVRY 67 ++ + A E A R LE +G + V G E+D + R VEV+ Sbjct: 1 MSGMKRWHASERIAFRLLEEQGYEILEVHKRVRIEGVEVAEVDAVARGPDGELYAVEVKA 60 Query: 68 RRSALYG 74 R ++G Sbjct: 61 GRLDVHG 67 >UniRef50_B4S2M3 Putative transmembrane protein n=1 Tax=Alteromonas macleodii 'Deep ecotype' RepID=B4S2M3_ALTMD Length = 252 Score = 41.6 bits (97), Expect = 0.008, Method: Composition-based stats. Identities = 17/113 (15%), Positives = 36/113 (31%), Gaps = 15/113 (13%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG- 74 + +E +G R G IDL +R+G + V+ + ++ G Sbjct: 91 RQLHWRNFEELVAEAYRRQGYRVTEGGF-GADGGIDLELRKGDERVIVQCKQWKAQKVGV 149 Query: 75 -------GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV 120 G + +K ++ + + + VV G+E+ Sbjct: 150 SVVREMFGVLTASNANKV--IIICSGKFTQQAIDFASDKP----VVLIDGDEL 196 >UniRef50_A7VFU4 Putative uncharacterized protein n=6 Tax=Bacteria RepID=A7VFU4_9CLOT Length = 416 Score = 41.2 bits (96), Expect = 0.010, Method: Composition-based stats. Identities = 11/82 (13%), Positives = 25/82 (30%), Gaps = 9/82 (10%) Query: 5 PTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRF-IAANVNERGGEIDLIMREGRTTIFV 63 P + G +E L +G + + E+D + ++ T I++ Sbjct: 304 PAFRYAVNGTRNMDFGRVYENIVYLELRRRGYEVYVGKLYKK---EVDFVAKKRDTLIYI 360 Query: 64 EVRYRRSA-----LYGGAAASV 80 +V S ++ Sbjct: 361 QVSDNISDETTFEREYSPLLAI 382 >UniRef50_D2LQ48 DUF234 DEXX-box ATPase n=3 Tax=Aciduliprofundum boonei T469 RepID=D2LQ48_9EURY Length = 462 Score = 40.8 bits (95), Expect = 0.012, Method: Composition-based stats. Identities = 12/56 (21%), Positives = 22/56 (39%), Gaps = 3/56 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAAN--VNERGGEIDLIMREGRTTIFVEVRYRR 69 G +E + + + + N RG EID++ +FVE ++R Sbjct: 342 NAHMGRIFEKIVAEIIAEQ-FKPLKMGSWWNRRGDEIDIVAELENEVLFVECKWRN 396 >UniRef50_A8MBK5 Putative uncharacterized protein n=1 Tax=Caldivirga maquilingensis IC-167 RepID=A8MBK5_CALMQ Length = 211 Score = 40.8 bits (95), Expect = 0.013, Method: Composition-based stats. Identities = 14/56 (25%), Positives = 21/56 (37%), Gaps = 6/56 (10%) Query: 20 GDAWEAQARRWLEGKGLRFIAANVNERG-----GEIDLIMREG-RTTIFVEVRYRR 69 G +E L G R + V GE+DLI+ + VEV+ + Sbjct: 4 GVRFEDYVAELLSRLGFRVMDRRVKVTSNGVEVGEVDLIVEDECGNKYSVEVKSGK 59 >UniRef50_Q2FT66 Putative uncharacterized protein n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FT66_METHJ Length = 423 Score = 40.8 bits (95), Expect = 0.013, Method: Composition-based stats. Identities = 13/53 (24%), Positives = 22/53 (41%), Gaps = 5/53 (9%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNER-GGEIDLIMREGRTTIFVEVRY 67 ++ G ++E Q L +G R GE D I R + + ++V Y Sbjct: 309 SQDIGKSFENQVYIELIRRGYEV----WYFRDKGECDFIARRPGSMLAIQVSY 357 >UniRef50_B1L6B5 Putative uncharacterized protein n=1 Tax=Candidatus Korarchaeum cryptofilum OPF8 RepID=B1L6B5_KORCO Length = 226 Score = 40.4 bits (94), Expect = 0.016, Method: Composition-based stats. Identities = 21/105 (20%), Positives = 37/105 (35%), Gaps = 17/105 (16%) Query: 10 SPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNE---RGGEIDLIMREGRTTIFVEVR 66 SP + T+ + +E R L+ G+ I + R EID++ + ++ + Sbjct: 57 SPEKATSLLSWKDFETFCMRGLQIHGMEAI-RGLRFKNDRRYEIDVLGIGEGLILLIDCK 115 Query: 67 YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFD 111 + G SV R K A+ + RFD Sbjct: 116 MWSTKGRSGKIESVARDHLRK----AKAF---------DEAIRFD 147 >UniRef50_A9KZ92 Putative uncharacterized protein n=2 Tax=Shewanella baltica RepID=A9KZ92_SHEB9 Length = 500 Score = 40.4 bits (94), Expect = 0.019, Method: Composition-based stats. Identities = 10/63 (15%), Positives = 19/63 (30%), Gaps = 5/63 (7%) Query: 7 RSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGG-----EIDLIMREGRTTI 61 G ++ G +E + G + N R E D++ + Sbjct: 332 TLGKVEGAASRLRGALFEYVVAEAMRASGYNGVEINKFCRNASGVQKEADVVCFNNKDVC 391 Query: 62 FVE 64 F+E Sbjct: 392 FIE 394 >UniRef50_C4F8E7 Putative uncharacterized protein n=1 Tax=Collinsella intestinalis DSM 13280 RepID=C4F8E7_9ACTN Length = 115 Score = 40.4 bits (94), Expect = 0.019, Method: Composition-based stats. Identities = 19/97 (19%), Positives = 32/97 (32%), Gaps = 16/97 (16%) Query: 21 DAWEAQARRWLEGKGLRFIA-ANVNERGGEIDLIMREG-RTTIFVEVRYRRSALYGGAAA 78 D E A+ +L K L+ + G+ D I + + V V R Sbjct: 4 DIGELIAKEFLLSKDLKSVDMTGYECDEGKADAICIDESGCHVLVNVETHRKRGVEEP-- 61 Query: 79 SVTRSKQ----HKLLQTARLWLARHNGSFDTVDCRFD 111 KQ ++ + +LA H + R+D Sbjct: 62 -----KQVYNVKRMRRVLMCYLADHP---EVKAARYD 90 >UniRef50_Q50I46 Putative holliday junction resolvase n=1 Tax=Acidianus rod-shaped virus 1 RepID=Q50I46_9VIRU Length = 115 Score = 40.4 bits (94), Expect = 0.020, Method: Composition-based stats. Identities = 15/80 (18%), Positives = 27/80 (33%), Gaps = 15/80 (18%) Query: 17 KQTGDAWEAQARRWLEGKGLRFIAANVNERGGEI------DLIMREGRTTIFVEVRYRRS 70 +G +E QA WL+ G + I D+I + +EV+ + Sbjct: 6 HNSGRYFEYQAMEWLQSHGYQTI----RIPASAAGKQPLPDIIATKNSVVYAIEVKSTSN 61 Query: 71 ALYGGAAASVTRSKQHKLLQ 90 V + + KL + Sbjct: 62 R-----LVRVDKFQIDKLYR 76 >UniRef50_A3CY54 Restriction endonuclease n=1 Tax=Methanoculleus marisnigri JR1 RepID=A3CY54_METMJ Length = 276 Score = 40.0 bits (93), Expect = 0.025, Method: Composition-based stats. Identities = 21/120 (17%), Positives = 32/120 (26%), Gaps = 18/120 (15%) Query: 2 ATVPTRSGSPRQLT-TKQTGDA-----WEAQARRWLEGKGLRFIAANVN----ERGGEID 51 R + + + G +E R L G R E+D Sbjct: 60 RAKAVRPAAGHRTNLRRALGLLRSKPDFEEFVRVLLREHGYRV-ETGCVLAGLCGEHEVD 118 Query: 52 LIMREGRTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFD 111 I TIFVEV++ S VT + ++ + L D Sbjct: 119 AIAERDGATIFVEVKHHASHH------RVTGLDEGRIARAIIEDLQE-GFRAGRCTVSID 171 >UniRef50_C5A3A3 Prokaryotic ATPase, AAA superfamily n=1 Tax=Thermococcus gammatolerans EJ3 RepID=C5A3A3_THEGJ Length = 468 Score = 39.7 bits (92), Expect = 0.029, Method: Composition-based stats. Identities = 16/82 (19%), Positives = 29/82 (35%), Gaps = 21/82 (25%) Query: 19 TGDAWEAQARRWLEGKG--------LRFIAANVNERGGEIDLIMREGRT--TIFVEVRYR 68 G +E AR +L + EID++ + + +FVEV++ Sbjct: 334 LGRPFEEIAREFLIEANRKNLLPFRFTKLGRWWRRGE-EIDIVALDEGSKKALFVEVKWS 392 Query: 69 RSALYGGAAASVTRSKQHKLLQ 90 +T K K+L+ Sbjct: 393 D----------LTAGKARKVLR 404 >UniRef50_A1RWI2 Putative uncharacterized protein n=1 Tax=Thermofilum pendens Hrk 5 RepID=A1RWI2_THEPD Length = 203 Score = 39.7 bits (92), Expect = 0.029, Method: Composition-based stats. Identities = 15/56 (26%), Positives = 22/56 (39%), Gaps = 5/56 (8%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERG-----GEIDLIMREGRTTIFVEVRYRR 69 G EA AR LE +G + E+D++ R+ VEV+ R Sbjct: 3 RGVWVEALARLILELEGFSVEQTRLRLERNGVSLAEVDILARKNGEAYAVEVKSGR 58 >UniRef50_Q3BXS5 Putative uncharacterized protein n=1 Tax=Xanthomonas campestris pv. vesicatoria str. 85-10 RepID=Q3BXS5_XANC5 Length = 382 Score = 39.7 bits (92), Expect = 0.030, Method: Composition-based stats. Identities = 16/80 (20%), Positives = 28/80 (35%), Gaps = 11/80 (13%) Query: 18 QTGDAWEAQARRWLEGKGLR-FIAANVNERGGEIDLIMR-EGRTTIFVEVRYR----RSA 71 + GD EA L KG +A + G ID++ + + EV+ Sbjct: 79 EIGDIGEAFVSHDLAKKGYTDLVAIQDKQGHG-IDVVGKNQEGKWESFEVKASVQGTARR 137 Query: 72 LYGGAAASVTRSKQHKLLQT 91 +G +T +L + Sbjct: 138 QFGNPEEFIT----DRLRKA 153 >UniRef50_C1MSX7 Predicted protein n=1 Tax=Micromonas pusilla CCMP1545 RepID=C1MSX7_9CHLO Length = 648 Score = 39.3 bits (91), Expect = 0.035, Method: Composition-based stats. Identities = 14/66 (21%), Positives = 23/66 (34%), Gaps = 11/66 (16%) Query: 15 TTKQTGDAWEAQARRWLEGK--GLRFIAANVNERGGE----IDLIMR--EGRTTIFVEVR 66 + G E+ +L + G R E D+ + +G TIFVEV+ Sbjct: 526 DNRAVGRWGESLVYHYLLSRHVGWRVT---WMNEEKETKSFYDIKLESADGAETIFVEVK 582 Query: 67 YRRSAL 72 + Sbjct: 583 TTKFGD 588 >UniRef50_C8SAI5 Restriction endonuclease n=1 Tax=Ferroglobus placidus DSM 10642 RepID=C8SAI5_FERPL Length = 261 Score = 39.3 bits (91), Expect = 0.035, Method: Composition-based stats. Identities = 13/52 (25%), Positives = 20/52 (38%), Gaps = 5/52 (9%) Query: 20 GDAWEAQARRWLEGKGLRFIAANV----NERGGEIDLIMREGRTTIFVEVRY 67 G +E R LE G NV E+D+I R+ +E ++ Sbjct: 85 GFNFEKFVARVLEEWGYST-ETNVTMKGRCVMQEVDVIARKDEEVYMIECKF 135 >UniRef50_A7H236 Putative uncharacterized protein n=1 Tax=Campylobacter jejuni subsp. doylei 269.97 RepID=A7H236_CAMJD Length = 161 Score = 39.3 bits (91), Expect = 0.037, Method: Composition-based stats. Identities = 21/115 (18%), Positives = 42/115 (36%), Gaps = 20/115 (17%) Query: 4 VPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAA--NVNERGGEIDLIMREGRTTI 61 T++ K+ GD +E Q + + +G + N ++ IDLI + + Sbjct: 25 PKTKNNQFNYFKNKKKGDLYEIQIGKMYQKQGYKVYFKGINEKKKDAGIDLIAYKDNEVL 84 Query: 62 FVEVRYRRSALYGGAAASVTRSKQHKLLQ-----TARLWLARHNGSFDTVDC-RF 110 ++ + +++ KQ L TA +L + F + RF Sbjct: 85 LIQCKNWQNSQI----------KQEHLRIFLGDCTA--YLEKEKHKFKNKEIKRF 127 >UniRef50_Q05TI5 Putative uncharacterized protein n=1 Tax=Synechococcus sp. RS9916 RepID=Q05TI5_9SYNE Length = 89 Score = 39.3 bits (91), Expect = 0.045, Method: Composition-based stats. Identities = 10/47 (21%), Positives = 17/47 (36%), Gaps = 1/47 (2%) Query: 57 GRTTIFVEVRYRR-SALYGGAAASVTRSKQHKLLQTARLWLARHNGS 102 + VEV+ RR G A+ K +L + W + + Sbjct: 9 EGRLLVVEVKARRRCGRDGWGVAACNAGKLQRLARAMACWRMANPWT 55 >UniRef50_B0AC80 Putative uncharacterized protein n=1 Tax=Clostridium bartlettii DSM 16795 RepID=B0AC80_9CLOT Length = 336 Score = 38.5 bits (89), Expect = 0.069, Method: Composition-based stats. Identities = 15/83 (18%), Positives = 30/83 (36%), Gaps = 8/83 (9%) Query: 12 RQLTTKQTGDAW-EAQARRWLEGKGLR--FIAANVNERGGEIDLIMREGRTT----IFVE 64 + + QTG E ++ +G +A N + + D++ + + IFV+ Sbjct: 190 KGKSNLQTGGIGLENLVCEIMQCEGYESKILAKNKFQGKADADILAIKEDSFMSKKIFVQ 249 Query: 65 VRYRRSALYGGAAASV-TRSKQH 86 V++ V KQ Sbjct: 250 VKHHNGESGSYGIQQVIDVLKQK 272 >UniRef50_C7RS72 WD-40 repeat protein n=3 Tax=Bacteria RepID=C7RS72_9PROT Length = 1737 Score = 38.1 bits (88), Expect = 0.083, Method: Composition-based stats. Identities = 21/87 (24%), Positives = 32/87 (36%), Gaps = 16/87 (18%) Query: 2 ATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGL-----RFIAANVNERGGEIDLIMRE 56 A P G + + +Q G +E + L+ G + + N EIDLI + Sbjct: 298 AARPDVYGPGERGSAQQRGKTFEEKVAYLLQLLGYSVEQKQLLDGN------EIDLIASK 351 Query: 57 GR-----TTIFVEVRYRRSALYGGAAA 78 TT VE + A+ AA Sbjct: 352 RGDFGEVTTYLVECKAYTGAVPKEAAE 378 >UniRef50_B8D4V8 Predicted transcriptional regulator n=1 Tax=Desulfurococcus kamchatkensis 1221n RepID=B8D4V8_DESK1 Length = 296 Score = 38.1 bits (88), Expect = 0.090, Method: Composition-based stats. Identities = 15/52 (28%), Positives = 21/52 (40%), Gaps = 1/52 (1%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVR 66 + TGD E R L G R + ID+I G T +F++V Sbjct: 3 RNRGTGDIAEE-VSRTLRKAGFRVEFLSYPTSARSIDIIACRGDTRVFIKVS 53 >UniRef50_B8D4V0 Endonuclease (RecB family)-like protein n=1 Tax=Desulfurococcus kamchatkensis 1221n RepID=B8D4V0_DESK1 Length = 233 Score = 38.1 bits (88), Expect = 0.093, Method: Composition-based stats. Identities = 14/63 (22%), Positives = 25/63 (39%), Gaps = 6/63 (9%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNER-----GGEIDLIMRE-GRTTIFVEVR 66 +++ + + E A +LE +G R + GE+D I G VE++ Sbjct: 2 SISSSRKWRSSELIALEYLEKQGFRIEETRKKIKIEGVEIGEVDAIAISPGGEKYAVEIK 61 Query: 67 YRR 69 R Sbjct: 62 AGR 64 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_A4WEW3 UPF0102 protein Ent638_3585 n=124 Tax=Enterobact... 164 8e-40 UniRef50_D1P1S8 Putative choloylglycine hydrolase n=5 Tax=Entero... 159 4e-38 UniRef50_C4ZD68 Putative uncharacterized protein n=1 Tax=Eubacte... 157 1e-37 UniRef50_A5KJ12 Putative uncharacterized protein n=2 Tax=Clostri... 154 7e-37 UniRef50_A8SP33 Putative uncharacterized protein n=2 Tax=Clostri... 154 1e-36 UniRef50_A9KLL5 UPF0102 protein Cphy_2398 n=3 Tax=Clostridiales ... 154 1e-36 UniRef50_C1SJ30 Putative uncharacterized protein n=1 Tax=Denitro... 152 3e-36 UniRef50_B5YIA9 UPF0102 protein THEYE_A1950 n=1 Tax=Thermodesulf... 152 4e-36 UniRef50_Q0AWX4 UPF0102 protein Swol_1475 n=1 Tax=Syntrophomonas... 152 4e-36 UniRef50_UPI0001973CC8 hypothetical protein ClM62_15129 n=1 Tax=... 151 6e-36 UniRef50_A8MHC8 UPF0102 protein Clos_1471 n=8 Tax=Clostridiaceae... 151 6e-36 UniRef50_Q1JX98 Putative uncharacterized protein n=1 Tax=Desulfu... 151 7e-36 UniRef50_B0P3K1 Putative uncharacterized protein n=2 Tax=Clostri... 150 1e-35 UniRef50_C7R9K3 Putative uncharacterized protein n=1 Tax=Kangiel... 150 1e-35 UniRef50_A6FEY4 Putative uncharacterized protein n=1 Tax=Moritel... 150 1e-35 UniRef50_C4Z0D1 Putative endonuclease n=13 Tax=Clostridiales Rep... 150 1e-35 UniRef50_C4V2D8 Endonuclease n=1 Tax=Selenomonas flueggei ATCC 4... 150 2e-35 UniRef50_C4L9F6 Putative uncharacterized protein n=1 Tax=Tolumon... 149 2e-35 UniRef50_C0QTY9 UPF0102 protein PERMA_0362 n=2 Tax=Hydrogenother... 149 2e-35 UniRef50_Q5ZR89 UPF0102 protein lpg2994 n=4 Tax=Legionella RepID... 149 2e-35 UniRef50_A0KPY4 UPF0102 protein AHA_3896 n=4 Tax=Proteobacteria ... 149 3e-35 UniRef50_Q3JE65 UPF0102 protein Noc_0355 n=2 Tax=Nitrosococcus o... 148 4e-35 UniRef50_Q6AJE4 UPF0102 protein DP2807 n=1 Tax=Desulfotalea psyc... 148 5e-35 UniRef50_D1RD77 Putative uncharacterized protein n=1 Tax=Legione... 148 6e-35 UniRef50_B0U003 UPF0102 protein Fphi_0415 n=15 Tax=Francisella R... 147 7e-35 UniRef50_B2V8B3 UPF0102 protein SYO3AOP1_0546 n=2 Tax=Sulfurihyd... 147 9e-35 UniRef50_C8X4T0 Putative uncharacterized protein n=1 Tax=Desulfo... 147 9e-35 UniRef50_Q47VU1 UPF0102 protein CPS_4433 n=1 Tax=Colwellia psych... 147 1e-34 UniRef50_D1KBC1 Putative uncharacterized protein n=1 Tax=uncultu... 147 1e-34 UniRef50_C2KW07 Putative uncharacterized protein n=1 Tax=Oribact... 146 2e-34 UniRef50_B8J1T1 Putative uncharacterized protein n=1 Tax=Desulfo... 146 2e-34 UniRef50_A9NAA4 UPF0102 protein COXBURSA331_A1934 n=6 Tax=Coxiel... 145 4e-34 UniRef50_C0GQX2 Putative uncharacterized protein n=1 Tax=Desulfo... 145 4e-34 UniRef50_A1AN88 UPF0102 protein Ppro_1186 n=11 Tax=Deltaproteoba... 145 4e-34 UniRef50_D1VVL2 Putative uncharacterized protein n=1 Tax=Peptoni... 145 4e-34 UniRef50_A5F986 UPF0102 protein VC0395_A0112/VC395_0597 n=28 Tax... 145 6e-34 UniRef50_B6ELI6 UPF0102 protein VSAL_I2655 n=7 Tax=Vibrionaceae ... 145 6e-34 UniRef50_A0YER5 Putative uncharacterized protein n=1 Tax=marine ... 144 6e-34 UniRef50_B8CW28 Putative uncharacterized protein n=1 Tax=Halothe... 144 6e-34 UniRef50_C5BS52 Putative uncharacterized protein n=1 Tax=Teredin... 144 6e-34 UniRef50_A3XHA2 Putative uncharacterized protein n=1 Tax=Leeuwen... 144 7e-34 UniRef50_Q7N090 UPF0102 protein plu4003 n=2 Tax=Enterobacteriace... 144 8e-34 UniRef50_A1SU47 UPF0102 protein Ping_1176 n=2 Tax=Psychromonas R... 144 9e-34 UniRef50_Q2S9Y0 UPF0102 protein HCH_05895 n=1 Tax=Hahella chejue... 144 1e-33 UniRef50_C0N677 Putative uncharacterized protein n=1 Tax=Methylo... 144 1e-33 UniRef50_B7GSU4 UPF0102 protein Blon_1698 n=12 Tax=Bifidobacteri... 143 1e-33 UniRef50_Q67PD3 UPF0102 protein STH1475 n=1 Tax=Symbiobacterium ... 143 1e-33 UniRef50_C7RDQ1 Putative uncharacterized protein n=1 Tax=Anaeroc... 143 2e-33 UniRef50_Q1Q244 Putative uncharacterized protein n=1 Tax=Candida... 143 2e-33 UniRef50_Q0VS15 UPF0102 protein ABO_0585 n=2 Tax=Alcanivorax Rep... 142 3e-33 UniRef50_B3EJJ5 UPF0102 protein Cphamn1_0017 n=1 Tax=Chlorobium ... 142 3e-33 UniRef50_B9MQX5 UPF0102 protein Athe_0977 n=1 Tax=Anaerocellum t... 142 3e-33 UniRef50_A4BQJ8 Putative uncharacterized protein n=1 Tax=Nitroco... 142 4e-33 UniRef50_Q7MNW2 UPF0102 protein VV0603 n=80 Tax=Vibrionales RepI... 142 4e-33 UniRef50_B0G5Y9 Putative uncharacterized protein n=4 Tax=Clostri... 142 4e-33 UniRef50_C4GB01 Putative uncharacterized protein n=1 Tax=Shuttle... 142 5e-33 UniRef50_C6A8H5 Putative uncharacterized protein n=4 Tax=Bifidob... 141 5e-33 UniRef50_B7RSM7 Putative uncharacterized protein n=1 Tax=marine ... 141 6e-33 UniRef50_D0KYK3 Putative uncharacterized protein n=1 Tax=Halothi... 141 7e-33 UniRef50_Q8R5S3 UPF0102 protein TTE1452 n=9 Tax=Thermoanaerobact... 141 7e-33 UniRef50_A1ZF36 Putative uncharacterized protein n=1 Tax=Microsc... 141 7e-33 UniRef50_C6WYK5 Putative uncharacterized protein n=1 Tax=Methylo... 141 7e-33 UniRef50_D1CBL5 Putative uncharacterized protein n=1 Tax=Thermob... 141 8e-33 UniRef50_C9KJR5 Endonuclease n=2 Tax=Veillonellaceae RepID=C9KJR... 141 8e-33 UniRef50_A6VB97 UPF0102 protein PSPA7_4996 n=17 Tax=Pseudomonada... 141 8e-33 UniRef50_A8SLV5 Putative uncharacterized protein n=1 Tax=Parvimo... 140 8e-33 UniRef50_A3WNE6 Predicted endonuclease n=1 Tax=Idiomarina baltic... 140 8e-33 UniRef50_B8DRI1 Putative uncharacterized protein n=2 Tax=Desulfo... 140 1e-32 UniRef50_C0YUE8 Possible endonuclease n=2 Tax=Flavobacteriaceae ... 140 1e-32 UniRef50_Q7P0B3 UPF0102 protein CV_0654 n=1 Tax=Chromobacterium ... 140 1e-32 UniRef50_Q1GYY7 UPF0102 protein Mfla_2283 n=1 Tax=Methylobacillu... 140 1e-32 UniRef50_B4RXI2 Sigma-54 factor n=2 Tax=Alteromonas macleodii Re... 140 2e-32 UniRef50_A5N821 UPF0102 protein CKL_1410 n=16 Tax=Clostridium Re... 139 2e-32 UniRef50_D0MIM6 Putative uncharacterized protein n=1 Tax=Rhodoth... 139 2e-32 UniRef50_B2KC57 Putative uncharacterized protein n=1 Tax=Elusimi... 139 3e-32 UniRef50_A8G183 UPF0102 protein Ssed_4252 n=16 Tax=Shewanella Re... 139 3e-32 UniRef50_C8W5C0 Putative uncharacterized protein n=1 Tax=Desulfo... 139 3e-32 UniRef50_B5YFD1 UPF0102 protein DICTH_1420 n=2 Tax=Dictyoglomus ... 139 3e-32 UniRef50_A3M3I7 UPF0102 protein A1S_1049 n=12 Tax=Acinetobacter ... 139 3e-32 UniRef50_C6BVQ7 Putative uncharacterized protein n=1 Tax=Desulfo... 138 6e-32 UniRef50_A8PP71 Putative uncharacterized protein n=1 Tax=Rickett... 138 6e-32 UniRef50_A4J649 UPF0102 protein Dred_2035 n=1 Tax=Desulfotomacul... 137 8e-32 UniRef50_Q0TPP8 UPF0102 protein CPF_1959 n=9 Tax=Clostridium per... 137 8e-32 UniRef50_UPI0000510419 hypothetical protein BlinB_18076 n=1 Tax=... 137 1e-31 UniRef50_A5EVA6 UPF0102 protein DNO_0639 n=2 Tax=Cardiobacteriac... 137 1e-31 UniRef50_C9LW74 Endonuclease n=1 Tax=Selenomonas sputigena ATCC ... 137 1e-31 UniRef50_Q31EY6 UPF0102 protein Tcr_1695 n=1 Tax=Thiomicrospira ... 137 1e-31 UniRef50_A1KWG5 UPF0102 protein NMC2069 n=28 Tax=Neisseriaceae R... 137 1e-31 UniRef50_A0LV62 UPF0102 protein Acel_1550 n=8 Tax=Actinomycetale... 137 2e-31 UniRef50_Q2BGY7 Putative uncharacterized protein n=1 Tax=Neptuni... 136 2e-31 UniRef50_B2A2P1 UPF0102 protein Nther_1376 n=1 Tax=Natranaerobiu... 136 2e-31 UniRef50_B3JMU0 Putative uncharacterized protein n=6 Tax=Bactero... 136 2e-31 UniRef50_B0VJC5 Putative uncharacterized protein n=1 Tax=Candida... 136 2e-31 UniRef50_A4FME3 UPF0102 protein SACE_6045 n=2 Tax=Actinomycetale... 136 2e-31 UniRef50_A1SB01 UPF0102 protein Sama_3355 n=5 Tax=Shewanella Rep... 135 3e-31 UniRef50_C2LNN6 Possible endonuclease n=3 Tax=Proteus RepID=C2LN... 135 3e-31 UniRef50_A6TRS2 UPF0102 protein Amet_2739 n=1 Tax=Alkaliphilus m... 135 3e-31 UniRef50_C6D2Y6 Putative uncharacterized protein n=1 Tax=Paeniba... 135 3e-31 UniRef50_C8PPR7 HD domain protein n=1 Tax=Treponema vincentii AT... 135 3e-31 UniRef50_C3W9T3 Endonuclease n=4 Tax=Fusobacterium RepID=C3W9T3_... 135 3e-31 UniRef50_Q1NMK2 Putative uncharacterized protein n=1 Tax=delta p... 135 4e-31 UniRef50_A8ZV12 UPF0102 protein Dole_2298 n=2 Tax=Desulfobactera... 135 5e-31 UniRef50_Q3IG11 UPF0102 protein PSHAa2523 n=3 Tax=Alteromonadale... 135 5e-31 UniRef50_C7HUU3 Endonuclease n=4 Tax=Anaerococcus RepID=C7HUU3_9... 135 5e-31 UniRef50_C7N589 Predicted endonuclease related to Holliday junct... 135 6e-31 UniRef50_D0I4Y5 Endonuclease n=1 Tax=Grimontia hollisae CIP 1018... 134 6e-31 UniRef50_C7IKV6 Putative uncharacterized protein n=1 Tax=Clostri... 134 6e-31 UniRef50_Q1MRU7 UPF0102 protein LI0223 n=1 Tax=Lawsonia intracel... 134 6e-31 UniRef50_Q2S1J6 UPF0102 protein SRU_1822 n=1 Tax=Salinibacter ru... 134 7e-31 UniRef50_C0GHM5 Putative uncharacterized protein n=1 Tax=Dethiob... 134 8e-31 UniRef50_C4FFG3 Putative uncharacterized protein n=1 Tax=Bifidob... 134 8e-31 UniRef50_A6L1J0 UPF0102 protein BVU_1879 n=26 Tax=Bacteroidales ... 134 8e-31 UniRef50_B3QZF2 UPF0102 protein Ctha_1382 n=1 Tax=Chloroherpeton... 134 1e-30 UniRef50_D1U5W3 Putative uncharacterized protein n=1 Tax=Desulfo... 134 1e-30 UniRef50_A5Z6D1 Putative uncharacterized protein n=1 Tax=Eubacte... 134 1e-30 UniRef50_A5D1I2 UPF0102 protein PTH_1707 n=1 Tax=Pelotomaculum t... 133 1e-30 UniRef50_Q1MYA7 Putative uncharacterized protein n=1 Tax=Bermane... 133 1e-30 UniRef50_B3PLA2 Putative uncharacterized protein n=1 Tax=Cellvib... 133 1e-30 UniRef50_B8KR63 Putative uncharacterized protein n=1 Tax=gamma p... 133 1e-30 UniRef50_C4FZ58 Putative uncharacterized protein n=1 Tax=Abiotro... 133 2e-30 UniRef50_Q2YCL8 UPF0102 protein Nmul_A0195 n=3 Tax=Nitrosomonada... 133 2e-30 UniRef50_B8HR21 Putative uncharacterized protein n=1 Tax=Cyanoth... 133 2e-30 UniRef50_D2R2U4 Putative uncharacterized protein n=1 Tax=Pirellu... 133 2e-30 UniRef50_C4F8U2 Putative uncharacterized protein n=2 Tax=Collins... 133 2e-30 UniRef50_A3HX52 Putative uncharacterized protein n=1 Tax=Algorip... 133 2e-30 UniRef50_C8WAJ1 Putative uncharacterized protein n=1 Tax=Atopobi... 133 2e-30 UniRef50_C9LM05 Endonuclease n=1 Tax=Dialister invisus DSM 15470... 132 3e-30 UniRef50_D1NRZ9 Putative endonuclease n=1 Tax=Bifidobacterium ga... 132 3e-30 UniRef50_A5FR87 UPF0102 protein DehaBAV1_0707 n=5 Tax=Dehalococc... 132 3e-30 UniRef50_B3ES88 Putative uncharacterized protein n=1 Tax=Candida... 132 3e-30 UniRef50_A6LSN5 UPF0102 protein Cbei_1183 n=5 Tax=Clostridium Re... 132 4e-30 UniRef50_UPI00016929A4 hypothetical protein Plarl_14719 n=1 Tax=... 132 4e-30 UniRef50_B0C8B9 UPF0102 protein AM1_3954 n=1 Tax=Acaryochloris m... 132 4e-30 UniRef50_C7MNC2 Predicted endonuclease related to Holliday junct... 132 4e-30 UniRef50_C4K712 Putative uncharacterized protein n=1 Tax=Candida... 132 4e-30 UniRef50_A1K3T3 UPF0102 protein azo0871 n=1 Tax=Azoarcus sp. BH7... 132 4e-30 UniRef50_C4XKL5 Putative uncharacterized protein n=1 Tax=Desulfo... 132 5e-30 UniRef50_C2G0R5 Possible endonuclease n=2 Tax=Sphingobacterium s... 132 5e-30 UniRef50_C6XZ20 Putative uncharacterized protein n=2 Tax=Pedobac... 131 5e-30 UniRef50_Q0AFH8 UPF0102 protein Neut_1662 n=2 Tax=Proteobacteria... 131 5e-30 UniRef50_D0LAN4 Putative uncharacterized protein n=1 Tax=Gordoni... 131 6e-30 UniRef50_C6IVU3 Putative uncharacterized protein n=1 Tax=Paeniba... 131 6e-30 UniRef50_D2SDZ8 Putative uncharacterized protein n=1 Tax=Geoderm... 131 6e-30 UniRef50_Q15PJ2 UPF0102 protein Patl_3694 n=1 Tax=Pseudoalteromo... 131 8e-30 UniRef50_C2D6J2 Putative uncharacterized protein n=1 Tax=Atopobi... 131 8e-30 UniRef50_C9MX50 Endonuclease n=2 Tax=Leptotrichia RepID=C9MX50_9... 131 8e-30 UniRef50_UPI0001C37581 hypothetical protein RflaF_17327 n=1 Tax=... 131 8e-30 UniRef50_C9R878 Putative uncharacterized protein n=1 Tax=Ammonif... 131 8e-30 UniRef50_C7LP67 Putative uncharacterized protein n=1 Tax=Desulfo... 130 1e-29 UniRef50_D1PKZ1 Putative choloylglycine hydrolase n=1 Tax=Subdol... 130 1e-29 UniRef50_Q1LHS4 UPF0102 protein Rmet_3430 n=2 Tax=Betaproteobact... 130 1e-29 UniRef50_C9MR57 Putative endonuclease n=2 Tax=Prevotella RepID=C... 130 1e-29 UniRef50_D2RIH7 Putative uncharacterized protein n=1 Tax=Acidami... 130 1e-29 UniRef50_C8WGY4 Putative uncharacterized protein n=2 Tax=Eggerth... 130 1e-29 UniRef50_C1ZN06 Putative uncharacterized protein n=1 Tax=Plancto... 130 2e-29 UniRef50_Q3A2F1 UPF0102 protein Pcar_2217 n=2 Tax=Deltaproteobac... 130 2e-29 UniRef50_A4SC34 UPF0102 protein Cvib_0014 n=9 Tax=Chlorobiaceae ... 130 2e-29 UniRef50_A1U3H0 UPF0102 protein Maqu_2464 n=3 Tax=Marinobacter R... 130 2e-29 UniRef50_B7K4B3 Putative uncharacterized protein n=4 Tax=Cyanoba... 130 2e-29 UniRef50_C0ZFM4 Putative uncharacterized protein n=1 Tax=Breviba... 130 2e-29 UniRef50_C2HKC4 Possible endonuclease n=2 Tax=Finegoldia magna R... 130 2e-29 UniRef50_A1R7F9 UPF0102 protein AAur_2443 n=3 Tax=Micrococcaceae... 129 2e-29 UniRef50_Q8R616 UPF0102 protein FN1370 n=9 Tax=Fusobacterium Rep... 129 2e-29 UniRef50_Q1QVF6 UPF0102 protein Csal_2201 n=1 Tax=Chromohalobact... 129 3e-29 UniRef50_A3N211 UPF0102 protein APL_1363 n=33 Tax=Pasteurellacea... 129 3e-29 UniRef50_Q2JJU2 UPF0102 protein CYB_2119 n=3 Tax=Synechococcus R... 129 3e-29 UniRef50_A8UQV9 Putative uncharacterized protein n=1 Tax=Hydroge... 129 4e-29 UniRef50_Q313K2 UPF0102 protein Dde_1093 n=1 Tax=Desulfovibrio d... 129 4e-29 UniRef50_Q60CC4 UPF0102 protein MCA0184 n=1 Tax=Methylococcus ca... 129 4e-29 UniRef50_Q1YQG9 Putative uncharacterized protein n=1 Tax=gamma p... 129 4e-29 UniRef50_B0TH88 Putative uncharacterized protein n=1 Tax=Helioba... 128 5e-29 UniRef50_A5FKL6 UPF0102 protein Fjoh_1217 n=17 Tax=Bacteroidetes... 128 5e-29 UniRef50_C0VVC1 Endonuclease n=2 Tax=Corynebacterium glucuronoly... 128 5e-29 UniRef50_A1VIW8 UPF0102 protein Pnap_0271 n=10 Tax=Burkholderial... 128 6e-29 UniRef50_A5WCR1 UPF0102 protein PsycPRwf_0497 n=1 Tax=Psychrobac... 128 6e-29 UniRef50_A3DDG4 UPF0102 protein Cthe_0758 n=6 Tax=Clostridia Rep... 128 6e-29 UniRef50_B9ZKW9 Putative uncharacterized protein n=1 Tax=Thioalk... 128 6e-29 UniRef50_Q11XW1 UPF0102 protein CHU_0465 n=1 Tax=Cytophaga hutch... 128 7e-29 UniRef50_C6VV91 Putative uncharacterized protein n=2 Tax=Flexiba... 128 7e-29 UniRef50_D1SBF6 Putative uncharacterized protein n=1 Tax=Micromo... 128 7e-29 UniRef50_C1AG13 UPF0102 protein JTY_2914 n=20 Tax=Mycobacterium ... 127 7e-29 UniRef50_A6GLM3 Putative uncharacterized protein n=1 Tax=Limnoba... 127 8e-29 UniRef50_A4YJR8 UPF0102 protein BRADO0179 n=14 Tax=Rhizobiales R... 127 8e-29 UniRef50_D0GLC9 Putative uncharacterized protein n=1 Tax=Leptotr... 127 1e-28 UniRef50_Q5R0L0 UPF0102 protein IL0423 n=1 Tax=Idiomarina loihie... 127 1e-28 UniRef50_Q1D6H9 UPF0102 protein MXAN_3551 n=2 Tax=Cystobacterine... 127 1e-28 UniRef50_Q8XUC6 UPF0102 protein RSc3265 n=6 Tax=Proteobacteria R... 127 1e-28 UniRef50_A7NKS5 UPF0102 protein Rcas_2007 n=2 Tax=Roseiflexus Re... 127 1e-28 UniRef50_A4AH12 Putative uncharacterized protein n=1 Tax=marine ... 127 1e-28 UniRef50_Q4FQF2 UPF0102 protein Psyc_1908 n=2 Tax=Psychrobacter ... 127 1e-28 UniRef50_A3YDY3 Putative uncharacterized protein n=1 Tax=Marinom... 127 1e-28 UniRef50_Q146Q2 UPF0102 protein Bxeno_A0149 n=9 Tax=Burkholderia... 127 1e-28 UniRef50_D1W8G8 Putative uncharacterized protein n=1 Tax=Prevote... 127 2e-28 UniRef50_C0EXX9 Putative uncharacterized protein n=1 Tax=Eubacte... 126 2e-28 UniRef50_A0Z7D7 Putative uncharacterized protein n=1 Tax=marine ... 126 2e-28 UniRef50_C9RJM0 Putative uncharacterized protein n=1 Tax=Fibroba... 126 2e-28 UniRef50_C8PW53 Putative uncharacterized protein n=1 Tax=Enhydro... 126 2e-28 UniRef50_D2RAN4 Putative uncharacterized protein n=1 Tax=Gardner... 126 2e-28 UniRef50_A0YUK1 Putative uncharacterized protein n=2 Tax=Cyanoba... 126 2e-28 UniRef50_A6SUE7 UPF0102 protein mma_0204 n=4 Tax=Betaproteobacte... 126 3e-28 UniRef50_B8G6B1 UPF0102 protein Cagg_0930 n=3 Tax=Chloroflexus R... 126 3e-28 UniRef50_C2KRF5 Possible endonuclease n=2 Tax=Mobiluncus mulieri... 126 3e-28 UniRef50_Q8DI54 UPF0102 protein tll1737 n=1 Tax=Thermosynechococ... 125 4e-28 UniRef50_C2BVF9 Possible endonuclease n=1 Tax=Mobiluncus curtisi... 125 4e-28 UniRef50_D2MIZ7 Putative uncharacterized protein n=1 Tax=Candida... 125 4e-28 UniRef50_B1XJM9 Putative uncharacterized protein n=1 Tax=Synecho... 125 4e-28 UniRef50_C7R327 Putative uncharacterized protein n=1 Tax=Jonesia... 125 5e-28 UniRef50_C9LKQ6 Putative uncharacterized protein n=1 Tax=Prevote... 125 5e-28 UniRef50_Q2KU88 UPF0102 protein BAV3162 n=1 Tax=Bordetella avium... 125 6e-28 UniRef50_A6VXY8 UPF0102 protein Mmwyl1_2395 n=1 Tax=Marinomonas ... 125 6e-28 UniRef50_Q6A7T5 UPF0102 protein PPA1431 n=3 Tax=Propionibacteriu... 124 9e-28 UniRef50_C9PT16 Endonuclease n=5 Tax=Prevotella RepID=C9PT16_9BACT 124 9e-28 UniRef50_C7MB82 Putative uncharacterized protein n=1 Tax=Brachyb... 124 1e-27 UniRef50_B1VG84 Putative uncharacterized protein n=1 Tax=Coryneb... 124 1e-27 UniRef50_B0MPN2 Putative uncharacterized protein n=1 Tax=Eubacte... 124 1e-27 UniRef50_A0QVA9 UPF0102 protein MSMEG_2508 n=6 Tax=Corynebacteri... 124 1e-27 UniRef50_C2BNU0 Endonuclease n=2 Tax=Corynebacterium RepID=C2BNU... 124 1e-27 UniRef50_Q025A4 UPF0102 protein Acid_2433 n=1 Tax=Candidatus Sol... 123 2e-27 UniRef50_B1WNM5 Putative uncharacterized protein n=3 Tax=Chrooco... 123 2e-27 UniRef50_D1ANU1 Putative uncharacterized protein n=1 Tax=Sebalde... 123 2e-27 UniRef50_Q6AEA8 UPF0102 protein Lxx14785 n=1 Tax=Leifsonia xyli ... 123 2e-27 UniRef50_A6NUN6 Putative uncharacterized protein n=1 Tax=Bactero... 123 2e-27 UniRef50_Q6FD45 UPF0102 protein ACIAD1132 n=4 Tax=Acinetobacter ... 122 2e-27 UniRef50_Q55761 UPF0102 protein sll0189 n=1 Tax=Synechocystis sp... 122 2e-27 UniRef50_B4VZG7 Putative uncharacterized protein n=1 Tax=Microco... 122 3e-27 UniRef50_Q24UC6 UPF0102 protein DSY2577 n=2 Tax=Desulfitobacteri... 122 3e-27 UniRef50_B2S4F0 UPF0102 protein TPASS_0913 n=3 Tax=Treponema Rep... 122 3e-27 UniRef50_D1BMP5 Putative uncharacterized protein n=3 Tax=Veillon... 122 5e-27 UniRef50_A3NEP2 UPF0102 protein BURPS668_3819 n=83 Tax=Proteobac... 122 5e-27 UniRef50_A6LF20 UPF0102 protein BDI_2565 n=4 Tax=Bacteroidales R... 122 5e-27 UniRef50_Q3AC88 UPF0102 protein CHY_1414 n=1 Tax=Carboxydothermu... 121 5e-27 UniRef50_A4A5E8 Putative uncharacterized protein n=2 Tax=unclass... 121 7e-27 UniRef50_A9B5H2 UPF0102 protein Haur_0145 n=1 Tax=Herpetosiphon ... 121 7e-27 UniRef50_C2CWR1 Endonuclease n=1 Tax=Gardnerella vaginalis ATCC ... 121 8e-27 UniRef50_D1BJ87 Predicted endonuclease related to Holliday junct... 121 9e-27 UniRef50_C0BHK9 Putative uncharacterized protein n=1 Tax=Flavoba... 120 1e-26 UniRef50_A1W341 UPF0102 protein Ajs_0414 n=11 Tax=Betaproteobact... 120 1e-26 UniRef50_C0WKI9 Endonuclease n=3 Tax=Actinomycetales RepID=C0WKI... 120 1e-26 UniRef50_Q4JV13 UPF0102 protein jk1180 n=2 Tax=Corynebacterium j... 120 1e-26 UniRef50_B0MUK7 Putative uncharacterized protein n=1 Tax=Alistip... 120 2e-26 UniRef50_B4WKR8 Putative uncharacterized protein n=1 Tax=Synecho... 120 2e-26 UniRef50_UPI00017886BD protein of unknown function UPF0102 n=1 T... 120 2e-26 UniRef50_UPI0001BC5BE0 endonuclease n=3 Tax=Fusobacterium RepID=... 119 2e-26 UniRef50_C0WCB9 Endonuclease n=1 Tax=Acidaminococcus sp. D21 Rep... 119 2e-26 UniRef50_Q2NZA5 UPF0102 protein XOO3617 n=18 Tax=Xanthomonadacea... 119 3e-26 UniRef50_A5IKG8 UPF0102 protein Tpet_0671 n=6 Tax=Thermotogaceae... 119 3e-26 UniRef50_Q1IJG5 UPF0102 protein Acid345_3985 n=1 Tax=Candidatus ... 119 4e-26 UniRef50_A0LCU4 UPF0102 protein Mmc1_3298 n=1 Tax=Magnetococcus ... 119 4e-26 UniRef50_A9HJH4 UPF0102 protein GDI1964/Gdia_0189 n=1 Tax=Glucon... 118 4e-26 UniRef50_D1AZ70 Putative uncharacterized protein n=1 Tax=Sulfuro... 118 6e-26 UniRef50_UPI000197ABB4 hypothetical protein GHTCC_11038 n=2 Tax=... 118 7e-26 UniRef50_C7PDZ7 Putative uncharacterized protein n=1 Tax=Chitino... 118 7e-26 UniRef50_C7GYG3 Putative choloylglycine hydrolase n=1 Tax=Eubact... 118 7e-26 UniRef50_A4A171 Putative uncharacterized protein n=2 Tax=Plancto... 117 9e-26 UniRef50_B9CKG2 Putative uncharacterized protein n=1 Tax=Atopobi... 117 1e-25 UniRef50_C1TKL9 Predicted endonuclease related to Holliday junct... 117 1e-25 UniRef50_C0QVG4 UPF0102 protein BHWA1_02005 n=2 Tax=Brachyspira ... 117 1e-25 UniRef50_B8HA88 UPF0102 protein Achl_2213 n=1 Tax=Arthrobacter c... 117 1e-25 UniRef50_A1SLR5 UPF0102 protein Noca_3248 n=11 Tax=Actinomycetal... 117 1e-25 UniRef50_A7HN69 UPF0102 protein Fnod_1509 n=1 Tax=Fervidobacteri... 117 1e-25 UniRef50_C0E2B0 Putative uncharacterized protein n=2 Tax=Coryneb... 117 2e-25 UniRef50_A6DUI2 Endonuclease n=1 Tax=Lentisphaera araneosa HTCC2... 117 2e-25 UniRef50_A1VFE8 UPF0102 protein Dvul_2148 n=3 Tax=Desulfovibrio ... 117 2e-25 UniRef50_C6WEV0 Putative uncharacterized protein n=1 Tax=Actinos... 116 2e-25 UniRef50_C8NTW9 Choloylglycine hydrolase n=1 Tax=Corynebacterium... 116 2e-25 UniRef50_Q47S60 UPF0102 protein Tfu_0669 n=2 Tax=Nocardiopsaceae... 116 2e-25 UniRef50_A0Q0X6 UPF0102 protein NT01CX_2205 n=3 Tax=Clostridium ... 116 3e-25 UniRef50_C7H7V1 Endonuclease n=2 Tax=Faecalibacterium prausnitzi... 115 3e-25 UniRef50_B2IUS6 Putative uncharacterized protein n=4 Tax=Nostoca... 115 3e-25 UniRef50_C5CAH6 Holliday junction resolvase-like endonuclease n=... 115 3e-25 UniRef50_A0LMM6 Putative uncharacterized protein n=1 Tax=Syntrop... 115 4e-25 UniRef50_B6R5V9 Putative uncharacterized protein n=1 Tax=Pseudov... 115 4e-25 UniRef50_Q118B0 UPF0102 protein Tery_0733 n=1 Tax=Trichodesmium ... 115 4e-25 UniRef50_C0BPF0 Putative uncharacterized protein n=1 Tax=Flavoba... 115 4e-25 UniRef50_A8F4Z9 UPF0102 protein Tlet_0667 n=1 Tax=Thermotoga let... 114 7e-25 UniRef50_D2ATE9 Putative uncharacterized protein n=1 Tax=Strepto... 114 7e-25 UniRef50_B7K7K1 Putative uncharacterized protein n=1 Tax=Cyanoth... 114 8e-25 UniRef50_C6XIG7 Putative uncharacterized protein n=1 Tax=Hirschi... 114 1e-24 UniRef50_A0NXE9 Putative uncharacterized protein n=2 Tax=Labrenz... 114 1e-24 UniRef50_Q5FF38 UPF0102 protein ERGA_CDS_00540 n=5 Tax=canis gro... 114 1e-24 UniRef50_C4LJB4 Putative uncharacterized protein n=1 Tax=Coryneb... 113 2e-24 UniRef50_B2RLS5 UPF0102 protein PGN_1801 n=2 Tax=Porphyromonas g... 113 2e-24 UniRef50_C1A4D5 Putative uncharacterized protein n=1 Tax=Gemmati... 113 2e-24 UniRef50_C5CGT1 UPF0102 protein Kole_1919 n=1 Tax=Kosmotoga olea... 113 2e-24 UniRef50_A6GIE5 Putative uncharacterized protein n=1 Tax=Plesioc... 113 2e-24 UniRef50_Q0A6J0 UPF0102 protein Mlg_2205 n=2 Tax=Ectothiorhodosp... 113 2e-24 UniRef50_A8HYK3 UPF0102 protein AZC_4471 n=3 Tax=Rhizobiales Rep... 113 2e-24 UniRef50_B2UP21 Putative uncharacterized protein n=2 Tax=Verruco... 113 2e-24 UniRef50_C4KCT6 Putative uncharacterized protein n=3 Tax=Betapro... 112 2e-24 UniRef50_Q3M3N9 UPF0102 protein Ava_4800 n=2 Tax=Nostocaceae Rep... 112 3e-24 UniRef50_C7QA44 Putative uncharacterized protein n=1 Tax=Catenul... 112 3e-24 UniRef50_Q7UM23 UPF0102 protein RB9115 n=1 Tax=Rhodopirellula ba... 112 3e-24 UniRef50_D1N5L9 Putative uncharacterized protein n=1 Tax=Victiva... 112 3e-24 UniRef50_Q0BTH9 UPF0102 protein GbCGDNIH1_0975 n=1 Tax=Granuliba... 112 3e-24 UniRef50_A4CH47 Putative uncharacterized protein n=1 Tax=Robigin... 112 4e-24 UniRef50_B4U6T0 Putative uncharacterized protein n=1 Tax=Hydroge... 112 4e-24 UniRef50_D2LER1 Putative uncharacterized protein n=1 Tax=Rhodomi... 112 5e-24 UniRef50_B6BHT8 Putative uncharacterized protein n=1 Tax=Campylo... 112 5e-24 UniRef50_Q83I01 UPF0102 protein TW312 n=2 Tax=Tropheryma whipple... 111 6e-24 UniRef50_Q2IJ48 UPF0102 protein Adeh_1910 n=4 Tax=Anaeromyxobact... 111 7e-24 UniRef50_Q0F072 Putative uncharacterized protein n=1 Tax=Maripro... 110 1e-23 UniRef50_A3VPC2 Putative uncharacterized protein n=1 Tax=Parvula... 110 1e-23 UniRef50_D2NTN5 Predicted endonuclease distantly related to arch... 110 1e-23 UniRef50_D1Y396 Putative uncharacterized protein n=1 Tax=Pyramid... 110 1e-23 UniRef50_A4X4J0 UPF0102 protein Strop_1320 n=2 Tax=Micromonospor... 110 2e-23 UniRef50_A1HN64 Putative uncharacterized protein n=1 Tax=Thermos... 110 2e-23 UniRef50_C2ANQ3 Predicted endonuclease related to Holliday junct... 110 2e-23 UniRef50_UPI00019790C0 hypothetical protein HcinC1_06745 n=2 Tax... 109 2e-23 UniRef50_C3XDT2 Putative uncharacterized protein n=1 Tax=Helicob... 109 2e-23 UniRef50_C1F7M4 Putative uncharacterized protein n=1 Tax=Acidoba... 109 3e-23 UniRef50_Q6NGK0 UPF0102 protein DIP1513 n=1 Tax=Corynebacterium ... 109 3e-23 UniRef50_UPI0001C31AB5 protein of unknown function UPF0102 n=1 T... 108 4e-23 UniRef50_A8YJ85 Similar to Y189_SYNY3 UPF0102 protein sll0189 n=... 108 4e-23 UniRef50_B5JM98 Putative uncharacterized protein n=1 Tax=Verruco... 108 4e-23 UniRef50_A7HZ51 UPF0102 protein Plav_3586 n=1 Tax=Parvibaculum l... 108 5e-23 UniRef50_D2L5G2 Putative uncharacterized protein n=1 Tax=Desulfo... 108 5e-23 UniRef50_B6JM54 UPF0102 protein HPP12_0830 n=15 Tax=Epsilonprote... 108 7e-23 UniRef50_C7NGY4 Predicted endonuclease related to Holliday junct... 107 8e-23 UniRef50_D1B6E0 Putative uncharacterized protein n=1 Tax=Therman... 107 9e-23 UniRef50_C3JBE2 Putative uncharacterized protein n=2 Tax=Bacteri... 107 1e-22 UniRef50_A3JND8 Putative uncharacterized protein n=1 Tax=Rhodoba... 107 1e-22 UniRef50_D0WR56 Putative endonuclease n=1 Tax=Actinomyces sp. or... 107 1e-22 UniRef50_C3XN83 Putative uncharacterized protein n=3 Tax=Helicob... 107 1e-22 UniRef50_B9L042 Putative uncharacterized protein n=2 Tax=Thermom... 106 2e-22 UniRef50_A6FT82 PII uridylyl-transferase n=5 Tax=Rhodobacterales... 106 2e-22 UniRef50_Q5SLC1 UPF0102 protein TTHA0372 n=5 Tax=Thermaceae RepI... 105 3e-22 UniRef50_C3PH56 Putative uncharacterized protein n=1 Tax=Coryneb... 105 6e-22 UniRef50_Q2RJT6 UPF0102 protein Moth_0988 n=2 Tax=Clostridia Rep... 105 6e-22 UniRef50_C4DPS8 Predicted endonuclease related to Holliday junct... 104 7e-22 UniRef50_O66457 UPF0102 protein aq_041 n=1 Tax=Aquifex aeolicus ... 104 7e-22 UniRef50_A9BFT1 UPF0102 protein Pmob_0702 n=1 Tax=Petrotoga mobi... 104 8e-22 UniRef50_Q2KDE4 UPF0102 protein RHE_CH00320 n=4 Tax=Rhizobiales ... 104 9e-22 UniRef50_A9IXC8 UPF0102 protein BT_1882 n=8 Tax=Rhizobiales RepI... 104 1e-21 UniRef50_B2IFF3 Putative uncharacterized protein n=1 Tax=Beijeri... 103 2e-21 UniRef50_B2GFY9 Putative uncharacterized protein n=1 Tax=Kocuria... 103 2e-21 UniRef50_C0XSA5 Endonuclease n=1 Tax=Corynebacterium lipophilofl... 102 3e-21 UniRef50_C7JH91 Putative uncharacterized protein n=8 Tax=Acetoba... 102 3e-21 UniRef50_A3K994 Putative uncharacterized protein n=7 Tax=Rhodoba... 102 3e-21 UniRef50_A3UIK6 Putative uncharacterized protein n=1 Tax=Oceanic... 102 5e-21 UniRef50_B0P7L1 Putative uncharacterized protein n=1 Tax=Anaerot... 102 5e-21 UniRef50_A4EVA8 Putative uncharacterized protein n=1 Tax=Roseoba... 101 7e-21 UniRef50_Q7U7D4 UPF0102 protein SYNW1051 n=3 Tax=Synechococcus R... 101 8e-21 UniRef50_A1B931 Putative uncharacterized protein n=1 Tax=Paracoc... 101 8e-21 UniRef50_B3T5F3 Putative uncharacterized protein family UPF0102 ... 101 8e-21 UniRef50_A6Q6T2 UPF0102 protein SUN_0231 n=3 Tax=Epsilonproteoba... 101 8e-21 UniRef50_C9M9E0 Putative uncharacterized protein n=1 Tax=Jonquet... 100 1e-20 UniRef50_Q04SX0 UPF0102 protein LBJ_1427 n=4 Tax=Leptospira RepI... 100 1e-20 UniRef50_Q0C451 UPF0102 protein HNE_0764 n=1 Tax=Hyphomonas nept... 100 1e-20 UniRef50_Q31RH5 UPF0102 protein Synpcc7942_0312 n=2 Tax=Synechoc... 100 1e-20 UniRef50_A4QF37 UPF0102 protein cgR_1859 n=4 Tax=Corynebacterium... 100 1e-20 UniRef50_B5Y8F8 Putative uncharacterized protein n=1 Tax=Coproth... 100 2e-20 UniRef50_C5BWW3 UPF0102 protein Bcav_2532 n=21 Tax=Actinomycetal... 100 2e-20 UniRef50_D1NDV3 Fimbrial usher protein (Fragment) n=1 Tax=Haemop... 100 2e-20 UniRef50_A9I0M2 UPF0102 protein Bpet0439 n=14 Tax=Proteobacteria... 100 2e-20 UniRef50_B2S8H0 UPF0102 protein BAbS19_I01690 n=50 Tax=Rhizobial... 100 3e-20 UniRef50_B0T377 UPF0102 protein Caul_0175 n=3 Tax=Caulobacterace... 99 3e-20 UniRef50_A4ECJ4 Putative uncharacterized protein n=1 Tax=Collins... 99 3e-20 UniRef50_A8U078 Predicted endonuclease n=1 Tax=alpha proteobacte... 99 3e-20 UniRef50_Q0AK98 UPF0102 protein Mmar10_3014 n=1 Tax=Maricaulis m... 99 4e-20 UniRef50_Q2VYL8 UPF0102 protein amb4503 n=5 Tax=Alphaproteobacte... 98 7e-20 UniRef50_C6QFU1 Putative uncharacterized protein n=1 Tax=Hyphomi... 97 1e-19 UniRef50_Q16B02 UPF0102 protein RD1_1191 n=1 Tax=Roseobacter den... 97 2e-19 UniRef50_Q0FQ74 Putative uncharacterized protein n=1 Tax=Roseova... 97 2e-19 UniRef50_A3TTV9 Putative uncharacterized protein n=1 Tax=Oceanic... 96 3e-19 UniRef50_UPI00016C4BC8 hypothetical protein GobsU_17186 n=1 Tax=... 96 4e-19 UniRef50_B9KH25 Putative uncharacterized protein n=3 Tax=Anaplas... 96 5e-19 UniRef50_A7ZB75 UPF0102 protein Ccon26_01140 n=23 Tax=Epsilonpro... 95 7e-19 UniRef50_C2M936 Putative uncharacterized protein n=1 Tax=Porphyr... 95 7e-19 UniRef50_B3CRA6 Putative uncharacterized protein n=2 Tax=Orienti... 94 1e-18 UniRef50_B1ZZB1 Putative uncharacterized protein n=2 Tax=Opituta... 94 1e-18 UniRef50_A5V3S4 UPF0102 protein Swit_0572 n=4 Tax=Sphingomonadac... 94 2e-18 UniRef50_Q7V7V8 UPF0102 protein PMT_0624 n=2 Tax=Prochlorococcus... 94 2e-18 UniRef50_A8ERF6 UPF0102 protein Abu_0255 n=2 Tax=Campylobacteral... 93 2e-18 UniRef50_C1CZ90 UPF0102 protein Deide_03080 n=3 Tax=Deinococcus ... 93 2e-18 UniRef50_C0W187 Putative uncharacterized protein n=1 Tax=Actinom... 93 3e-18 UniRef50_A4E8G7 Putative uncharacterized protein n=1 Tax=Collins... 92 4e-18 UniRef50_A7BDE4 Putative uncharacterized protein n=1 Tax=Actinom... 92 5e-18 UniRef50_A4WPR4 UPF0102 protein Rsph17025_0472 n=7 Tax=Rhodobact... 92 6e-18 UniRef50_B8GXN3 UPF0102 protein CCNA_00142 n=4 Tax=Caulobacterac... 91 1e-17 UniRef50_Q28TZ5 Putative uncharacterized protein n=1 Tax=Jannasc... 90 2e-17 UniRef50_Q2GDU7 Putative uncharacterized protein n=1 Tax=Neorick... 90 2e-17 UniRef50_A5GTL0 Restriction endonuclease-like n=1 Tax=Synechococ... 90 2e-17 UniRef50_Q7NEX4 UPF0102 protein gll3754 n=1 Tax=Gloeobacter viol... 90 2e-17 UniRef50_B4CV56 Putative uncharacterized protein n=1 Tax=Chthoni... 90 2e-17 UniRef50_Q3J5H3 UPF0102 protein RHOS4_03930 n=7 Tax=Rhodobactera... 89 3e-17 UniRef50_C6V4V5 Putative uncharacterized protein n=1 Tax=Neorick... 89 4e-17 UniRef50_A2VU88 Putative uncharacterized protein n=1 Tax=Burkhol... 89 6e-17 UniRef50_B0S8P6 Endonuclease n=2 Tax=Leptospira biflexa serovar ... 89 6e-17 UniRef50_C4YXH4 Protein Mlr4633 n=1 Tax=Rickettsia endosymbiont ... 88 7e-17 UniRef50_C8WWC7 Putative uncharacterized protein n=1 Tax=Alicycl... 88 1e-16 UniRef50_C7N7P3 Predicted endonuclease related to Holliday junct... 87 2e-16 UniRef50_Q1GJI4 UPF0102 protein TM1040_0449 n=9 Tax=Rhodobactera... 87 2e-16 UniRef50_A5G0S4 UPF0102 protein Acry_2261 n=1 Tax=Acidiphilium c... 86 2e-16 UniRef50_A5KGE0 Putative uncharacterized protein n=1 Tax=Campylo... 85 5e-16 UniRef50_A5GLH9 Restriction endonuclease-like n=5 Tax=Chroococca... 85 7e-16 UniRef50_A8LJ68 UPF0102 protein Dshi_2830 n=1 Tax=Dinoroseobacte... 85 9e-16 UniRef50_D1ATK0 Putative uncharacterized protein n=1 Tax=Anaplas... 83 4e-15 UniRef50_B1M445 UPF0102 protein Mrad2831_2938 n=10 Tax=Alphaprot... 82 4e-15 UniRef50_A9GEX7 UPF0102 protein sce2912 n=1 Tax=Sorangium cellul... 80 2e-14 UniRef50_B6IVS9 Putative uncharacterized protein n=1 Tax=Rhodosp... 80 2e-14 UniRef50_C8WHA2 Putative uncharacterized protein n=1 Tax=Eggerth... 79 3e-14 UniRef50_B3DVH2 Predicted endonuclease n=2 Tax=Verrucomicrobia R... 79 3e-14 UniRef50_C0W7A2 Putative uncharacterized protein (Fragment) n=1 ... 79 5e-14 UniRef50_UPI000190D97F hypothetical protein SentesTyp_00923 n=1 ... 79 7e-14 UniRef50_Q0I9S1 Uncharacterised protein family protein n=1 Tax=S... 78 9e-14 UniRef50_B9XLY1 Putative uncharacterized protein n=1 Tax=bacteri... 77 1e-13 UniRef50_Q1GWI7 UPF0102 protein Sala_0262 n=5 Tax=Sphingomonadal... 73 3e-12 UniRef50_C0BCN9 Putative uncharacterized protein n=1 Tax=Coproco... 70 2e-11 UniRef50_Q4E9U1 Endonuclease (Fragment) n=5 Tax=Wolbachia RepID=... 70 2e-11 UniRef50_C8W847 Putative uncharacterized protein n=1 Tax=Atopobi... 69 5e-11 UniRef50_Q73VP8 Putative uncharacterized protein n=1 Tax=Mycobac... 64 1e-09 UniRef50_A8A9D3 Putative uncharacterized protein n=1 Tax=Ignicoc... 63 2e-09 UniRef50_C7N801 Predicted endonuclease related to Holliday junct... 62 5e-09 UniRef50_Q3AKE1 Uncharacterised protein family UPF0102 n=2 Tax=C... 62 8e-09 UniRef50_Q8W6V7 Putative uncharacterized protein n=1 Tax=Synecho... 60 3e-08 UniRef50_Q8TW03 Predicted endonuclease of the RecB family n=1 Ta... 59 6e-08 UniRef50_A6DBD4 Putative uncharacterized protein (Fragment) n=1 ... 59 6e-08 UniRef50_C5U625 Putative uncharacterized protein n=1 Tax=Methano... 56 3e-07 UniRef50_Q5GSW9 RecB family endonuclease n=1 Tax=Wolbachia endos... 54 1e-06 UniRef50_A4GJ57 Putative uncharacterized protein n=1 Tax=uncultu... 54 1e-06 UniRef50_UPI0001699F06 hypothetical protein Epers_29808 n=1 Tax=... 54 1e-06 UniRef50_O33024 UPF0102 protein ML1607 n=2 Tax=Mycobacterium lep... 54 2e-06 UniRef50_Q5NWY8 Putative uncharacterized protein n=1 Tax=Aromato... 54 2e-06 UniRef50_Q3BT99 Putative uncharacterized protein n=2 Tax=Xanthom... 52 8e-06 UniRef50_C1E903 Predicted protein n=1 Tax=Micromonas sp. RCC299 ... 51 1e-05 UniRef50_Q6MLA1 Putative uncharacterized protein n=1 Tax=Bdellov... 51 1e-05 UniRef50_Q9Y9F5 Putative uncharacterized protein n=1 Tax=Aeropyr... 48 1e-04 Sequences not found previously or not previously below threshold: UniRef50_D0CIJ2 Putative uncharacterized protein n=2 Tax=Bacteri... 44 0.002 UniRef50_Q7VH71 Putative uncharacterized protein n=1 Tax=Helicob... 43 0.003 UniRef50_B5IHF1 ATPase n=1 Tax=Aciduliprofundum boonei T469 RepI... 43 0.003 UniRef50_D1TJS5 Putative uncharacterized protein n=1 Tax=Burkhol... 43 0.004 UniRef50_B7KKS4 Putative uncharacterized protein n=1 Tax=Cyanoth... 42 0.005 UniRef50_A2BIV6 Endonuclease of RecB family n=1 Tax=Hyperthermus... 42 0.006 UniRef50_A3DLW4 Endonuclease (RecB family)-like protein n=1 Tax=... 42 0.006 UniRef50_B4S2M3 Putative transmembrane protein n=1 Tax=Alteromon... 42 0.006 UniRef50_A7VFU4 Putative uncharacterized protein n=6 Tax=Bacteri... 42 0.008 UniRef50_A8MBK5 Putative uncharacterized protein n=1 Tax=Caldivi... 41 0.011 UniRef50_B1L6B5 Putative uncharacterized protein n=1 Tax=Candida... 41 0.011 UniRef50_D2LQ48 DUF234 DEXX-box ATPase n=3 Tax=Aciduliprofundum ... 41 0.012 UniRef50_Q2FT66 Putative uncharacterized protein n=1 Tax=Methano... 41 0.013 UniRef50_C4F8E7 Putative uncharacterized protein n=1 Tax=Collins... 41 0.014 UniRef50_A9KZ92 Putative uncharacterized protein n=2 Tax=Shewane... 40 0.017 UniRef50_Q50I46 Putative holliday junction resolvase n=1 Tax=Aci... 40 0.018 UniRef50_A1RWI2 Putative uncharacterized protein n=1 Tax=Thermof... 40 0.024 UniRef50_C5A3A3 Prokaryotic ATPase, AAA superfamily n=1 Tax=Ther... 40 0.024 UniRef50_Q3BXS5 Putative uncharacterized protein n=1 Tax=Xanthom... 40 0.028 UniRef50_C8SAI5 Restriction endonuclease n=1 Tax=Ferroglobus pla... 40 0.032 UniRef50_C1MSX7 Predicted protein n=1 Tax=Micromonas pusilla CCM... 40 0.033 UniRef50_A7H236 Putative uncharacterized protein n=1 Tax=Campylo... 40 0.033 UniRef50_C6JN47 Helicase n=7 Tax=Bacteria RepID=C6JN47_FUSVA 40 0.034 UniRef50_B0MYF3 Putative uncharacterized protein n=1 Tax=Alistip... 39 0.037 UniRef50_Q05TI5 Putative uncharacterized protein n=1 Tax=Synecho... 39 0.038 UniRef50_UPI0001699F07 LppC putative lipoprotein n=1 Tax=Endorif... 39 0.046 UniRef50_A3CY54 Restriction endonuclease n=1 Tax=Methanoculleus ... 39 0.050 UniRef50_D2Q690 Putative uncharacterized protein n=6 Tax=Bacteri... 39 0.065 UniRef50_B8D4V0 Endonuclease (RecB family)-like protein n=1 Tax=... 39 0.068 UniRef50_C3XFJ6 Putative uncharacterized protein n=1 Tax=Helicob... 38 0.080 UniRef50_B8D4V8 Predicted transcriptional regulator n=1 Tax=Desu... 38 0.086 UniRef50_B7AVX2 Putative uncharacterized protein n=1 Tax=Bactero... 38 0.090 UniRef50_A8MAQ2 Putative uncharacterized protein n=1 Tax=Caldivi... 38 0.090 CONVERGED! >UniRef50_A4WEW3 UPF0102 protein Ent638_3585 n=124 Tax=Enterobacteriaceae RepID=Y3585_ENT38 Length = 131 Score = 164 bits (416), Expect = 8e-40, Method: Composition-based stats. Identities = 93/131 (70%), Positives = 115/131 (87%) Query: 1 MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTT 60 MA +P + P +L+ KQTGDAWE +ARRWLEGKGLRFIAANV+ RGGEIDLIM++G+ Sbjct: 1 MAQIPAGADRPGKLSRKQTGDAWELKARRWLEGKGLRFIAANVHGRGGEIDLIMKDGQVI 60 Query: 61 IFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV 120 +F+EVR+R+S+ +GGAAASVT +KQHKLLQTA LWLARHNGSFDTVDCRFDVVAFTGN++ Sbjct: 61 VFIEVRFRQSSRFGGAAASVTLAKQHKLLQTAHLWLARHNGSFDTVDCRFDVVAFTGNDI 120 Query: 121 EWIKDAFNDHS 131 EW+K+AF + + Sbjct: 121 EWLKNAFGEDA 131 >UniRef50_D1P1S8 Putative choloylglycine hydrolase n=5 Tax=Enterobacteriaceae RepID=D1P1S8_9ENTR Length = 128 Score = 159 bits (402), Expect = 4e-38, Method: Composition-based stats. Identities = 56/110 (50%), Positives = 76/110 (69%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 TG +E QA +L+ +GL IA NV R GEIDLIMR+G +FVEVR+R+++ YG A Sbjct: 12 TGRHYENQALAYLQQQGLTLIARNVRCRMGEIDLIMRDGTVLVFVEVRFRKNSDYGNALL 71 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFN 128 SV K+ K+L TA+ WLA+ SF+T CRFD+ A TG + EW+++AFN Sbjct: 72 SVNWHKRRKILATAQYWLAQRQQSFETTPCRFDIYAITGKQFEWVQNAFN 121 >UniRef50_C4ZD68 Putative uncharacterized protein n=1 Tax=Eubacterium rectale ATCC 33656 RepID=C4ZD68_EUBR3 Length = 122 Score = 157 bits (397), Expect = 1e-37, Method: Composition-based stats. Identities = 43/115 (37%), Positives = 60/115 (52%), Gaps = 1/115 (0%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + + G +E A L G +A N R GEID+I ++G T F EV+YRR Sbjct: 7 MNKRSVGSIYEQLAAEQLINMGYSVLACNYRNRFGEIDIIAKDGDTICFCEVKYRRDNGC 66 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFN 128 G A +V SKQ K++ AR +L +H CRFDV+A +EV +K+AF Sbjct: 67 GRALEAVGYSKQKKIISVARYYLMKHGLDEW-TPCRFDVIAVDDDEVTVLKNAFE 120 >UniRef50_A5KJ12 Putative uncharacterized protein n=2 Tax=Clostridiales RepID=A5KJ12_9FIRM Length = 120 Score = 154 bits (391), Expect = 7e-37, Method: Composition-based stats. Identities = 38/118 (32%), Positives = 57/118 (48%), Gaps = 2/118 (1%) Query: 10 SPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRR 69 + + G +E A +LE G I N R GEID+I ++G +F EV+YR Sbjct: 3 RNVKQNNRSVGAVYEQAAGYYLEQNGYELIEYNYRCRDGEIDIIAKDGDCYVFCEVKYRS 62 Query: 70 SALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 G +V + KQ K+ + A + +H + CRFDV+ G E+ IK+AF Sbjct: 63 GRQAGNPLEAVDQRKQKKIFRCALYYTVQHGI--EDAQCRFDVIGVEGTEITHIKNAF 118 >UniRef50_A8SP33 Putative uncharacterized protein n=2 Tax=Clostridiales RepID=A8SP33_9FIRM Length = 127 Score = 154 bits (389), Expect = 1e-36, Method: Composition-based stats. Identities = 35/121 (28%), Positives = 66/121 (54%), Gaps = 1/121 (0%) Query: 8 SGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRY 67 S + +++TG +E A +L+ G +A N GEID++ + +FVEV++ Sbjct: 4 SAVLNEYNSRRTGSEYETAACDYLKNCGYDILARNYRVSAGEIDIVAQSDGYIVFVEVKF 63 Query: 68 RRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 R + G A+ +V KQ ++ + A +L ++ + V RFDV+ +GNE+ I++A+ Sbjct: 64 RSNTHMGAASEAVDHRKQKRISKAALYFLKQYGYGVE-VPVRFDVITVSGNEITHIENAY 122 Query: 128 N 128 + Sbjct: 123 D 123 >UniRef50_A9KLL5 UPF0102 protein Cphy_2398 n=3 Tax=Clostridiales RepID=Y2398_CLOPH Length = 118 Score = 154 bits (389), Expect = 1e-36, Method: Composition-based stats. Identities = 39/117 (33%), Positives = 61/117 (52%), Gaps = 1/117 (0%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + K G E +A +L +G + +A N R GEID++ RE +FVEV+YR + Sbjct: 1 MNKKVEGLTKETEAANYLSEQGYQILARNYRCRLGEIDIVARENGYLVFVEVKYRTNVEK 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFNDH 130 G ++T KQ ++ TA+ +L + CRFDVV E+ IK+AF+ + Sbjct: 61 GFPEEAITIQKQRRITNTAKYYLLVNRLPES-TPCRFDVVVMLKEEIRLIKNAFDAY 116 >UniRef50_C1SJ30 Putative uncharacterized protein n=1 Tax=Denitrovibrio acetiphilus DSM 12809 RepID=C1SJ30_9BACT Length = 111 Score = 152 bits (385), Expect = 3e-36, Method: Composition-based stats. Identities = 36/113 (31%), Positives = 59/113 (52%), Gaps = 4/113 (3%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 G E +A +LE +G + N + GEID+I + IFVEV+ R + +G Sbjct: 2 RLLFGKKGEKKAACFLEKQGYAIVEMNYRCKFGEIDIIAEKNGVLIFVEVKTRSTDKFGL 61 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFN 128 SVT SKQ KL +TA+ ++ + + +FDV++ G+ + I +AF+ Sbjct: 62 GYESVTLSKQQKLFKTAQHYMVENG----EMPAQFDVISIDGDTLTHIPNAFS 110 >UniRef50_B5YIA9 UPF0102 protein THEYE_A1950 n=1 Tax=Thermodesulfovibrio yellowstonii DSM 11347 RepID=Y1950_THEYD Length = 112 Score = 152 bits (385), Expect = 4e-36, Method: Composition-based stats. Identities = 34/114 (29%), Positives = 57/114 (50%), Gaps = 3/114 (2%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + + G E A +L KG + + N GEID+I ++G + +EV+ R S + Sbjct: 1 MARIELGKEGEKLAIDYLLTKGYKILEKNFRTPFGEIDIIAKDGNFIVIIEVKRRLSDKF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 G SV +KQ KL + A +++ + RFDV+A ++E I++AF Sbjct: 61 GKPELSVNYTKQQKLKKLALYYISMLKKEY---PVRFDVIAINDKKIEHIENAF 111 >UniRef50_Q0AWX4 UPF0102 protein Swol_1475 n=1 Tax=Syntrophomonas wolfei subsp. wolfei str. Goettingen RepID=Y1475_SYNWW Length = 115 Score = 152 bits (384), Expect = 4e-36, Method: Composition-based stats. Identities = 35/116 (30%), Positives = 56/116 (48%), Gaps = 6/116 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 ++ G E A ++L KG + + N + R GE+DL+ + +FVEV+ RRS +G Sbjct: 2 NRELGLWGEELAAQYLRKKGYKILERNFHTRYGELDLVCEKDDNIVFVEVKTRRSTRFGS 61 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----EVEWIKDAF 127 +VT K L + A L+L F + FDVV+ ++ I +AF Sbjct: 62 PEEAVTPRKIGNLKKAAILYLKSTPRFF--PEISFDVVSILVEDGKSKINHIINAF 115 >UniRef50_UPI0001973CC8 hypothetical protein ClM62_15129 n=1 Tax=Clostridium sp. M62/1 RepID=UPI0001973CC8 Length = 126 Score = 151 bits (383), Expect = 6e-36, Method: Composition-based stats. Identities = 46/127 (36%), Positives = 66/127 (51%), Gaps = 1/127 (0%) Query: 1 MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTT 60 M R G + +T+ G +E A +LE +G + N R GEID+I REG T Sbjct: 1 MQEEKRRKGPAGRKSTRARGARYEDLAAAFLEKQGYVILEKNFFCRTGEIDIIAREGDTL 60 Query: 61 IFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV 120 +FVEV+YR+ G A +V KQ K+ + A +L + CRFDVVA G++ Sbjct: 61 VFVEVKYRKDLAAGDPAEAVNERKQEKIRKAAAFYLYARGLPPEQ-PCRFDVVAILGSDF 119 Query: 121 EWIKDAF 127 ++DAF Sbjct: 120 RLLRDAF 126 >UniRef50_A8MHC8 UPF0102 protein Clos_1471 n=8 Tax=Clostridiaceae RepID=Y1471_ALKOO Length = 117 Score = 151 bits (383), Expect = 6e-36, Method: Composition-based stats. Identities = 34/116 (29%), Positives = 57/116 (49%), Gaps = 4/116 (3%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 K+ G E A +L+ KG R + N R GEID+I T +FVEV+ R S +G Sbjct: 2 NKKIGAIGEQLAVHYLKNKGYRILDCNYRTRLGEIDIIAILNDTIVFVEVKTRSSGAFGT 61 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF----TGNEVEWIKDAF 127 + +V KQ + + ++ +L + D + RFDV+ ++ +++AF Sbjct: 62 PSEAVNYKKQMTIRRVSQQYLLSNRIGEDDWNLRFDVIEVQLIEKKYKINHMENAF 117 >UniRef50_Q1JX98 Putative uncharacterized protein n=1 Tax=Desulfuromonas acetoxidans DSM 684 RepID=Q1JX98_DESAC Length = 121 Score = 151 bits (382), Expect = 7e-36, Method: Composition-based stats. Identities = 42/122 (34%), Positives = 59/122 (48%), Gaps = 6/122 (4%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 G E QA +L + R + N GEIDLI+R G+T FVEV+ R+S Sbjct: 2 TQQRLTLGRWGEQQAADYLRRRLYRIVTCNYRCHYGEIDLIVRRGKTLAFVEVKTRKSRC 61 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----EVEWIKDAFN 128 YG +VT KQ +++ TA+ +L S RFDV+A + ++ I DAF Sbjct: 62 YGTPQEAVTPRKQQQIIATAQHYLTTQQPSTQ--TVRFDVIAINVDGDKTQINHIVDAFE 119 Query: 129 DH 130 H Sbjct: 120 LH 121 >UniRef50_B0P3K1 Putative uncharacterized protein n=2 Tax=Clostridiales RepID=B0P3K1_9CLOT Length = 115 Score = 150 bits (380), Expect = 1e-35, Method: Composition-based stats. Identities = 41/115 (35%), Positives = 68/115 (59%), Gaps = 1/115 (0%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + ++TG EA A +L+ +G + N + GEID+I +E +T +FVEV+YR+ Sbjct: 2 KKNNRETGAKAEAIACWFLKQQGYDVLEQNFYTKVGEIDIIAKEDQTLVFVEVKYRKDDK 61 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 G A +V + KQ K+ ++A ++L +++ SF+ RFDVV G ++ IK AF Sbjct: 62 KGYPAQAVDQRKQQKIRKSAMIYLKKNHLSFEQ-PIRFDVVEILGKKIRVIKHAF 115 >UniRef50_C7R9K3 Putative uncharacterized protein n=1 Tax=Kangiella koreensis DSM 16069 RepID=C7R9K3_KANKD Length = 122 Score = 150 bits (380), Expect = 1e-35, Method: Composition-based stats. Identities = 50/122 (40%), Positives = 72/122 (59%), Gaps = 5/122 (4%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 ++T+Q GD E A +L+ +GL + N N R GEIDLIM + +FVEVR+R + Y Sbjct: 1 MSTRQRGDHVELFAESYLKKQGLTLVEKNFNSRFGEIDLIMLDKSALVFVEVRFRANTSY 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEWIKDAFND 129 G A +V KQ K+++TA+L+L + DCRFDVV+ T + +EW K+AF Sbjct: 61 GSGAETVNFRKQQKIIKTAQLYLQANK-KMQQRDCRFDVVSVTLSAQEPLIEWHKNAFQA 119 Query: 130 HS 131 S Sbjct: 120 PS 121 >UniRef50_A6FEY4 Putative uncharacterized protein n=1 Tax=Moritella sp. PE36 RepID=A6FEY4_9GAMM Length = 135 Score = 150 bits (380), Expect = 1e-35, Method: Composition-based stats. Identities = 47/134 (35%), Positives = 75/134 (55%), Gaps = 13/134 (9%) Query: 7 RSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGR-------- 58 R + + ++ G+ +E A +L+ +GL +A N R GEIDLI + G Sbjct: 2 RKATLNRKQPRKRGEYFEGIAAEFLQRQGLIILARNFACRQGEIDLICQHGASCDIKSST 61 Query: 59 ---TTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF 115 T +FVEV+YR+ YGGA +++ +KQ KL TA+ ++ RH + + CRFDV+A Sbjct: 62 TLPTLVFVEVKYRQYTHYGGAISAIPVAKQRKLRYTAQYYMVRHGINENYTPCRFDVIAI 121 Query: 116 TG--NEVEWIKDAF 127 G + ++WI +AF Sbjct: 122 EGCSDNIQWITNAF 135 >UniRef50_C4Z0D1 Putative endonuclease n=13 Tax=Clostridiales RepID=C4Z0D1_EUBE2 Length = 115 Score = 150 bits (379), Expect = 1e-35, Method: Composition-based stats. Identities = 38/113 (33%), Positives = 57/113 (50%), Gaps = 1/113 (0%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 + TG E A R+L G + N + GEID+I ++ +FVEV+YR + YG Sbjct: 4 NKRATGADKEQLAARYLVDNGYTVLERNFRNKTGEIDIIAKKDNYIVFVEVKYRSNNKYG 63 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 A +V KQ + + A+ ++ S + CRFDV+ G V IK+AF Sbjct: 64 YAVEAVNYRKQQIIRRVAQFYITTRYKS-CDIPCRFDVIGIDGETVTHIKNAF 115 >UniRef50_C4V2D8 Endonuclease n=1 Tax=Selenomonas flueggei ATCC 43531 RepID=C4V2D8_9FIRM Length = 118 Score = 150 bits (379), Expect = 2e-35, Method: Composition-based stats. Identities = 43/119 (36%), Positives = 61/119 (51%), Gaps = 6/119 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 ++ K GD EA A R+L +G R +A + GEIDLI ++ T +FVEV+ RRS Sbjct: 1 MSNKVLGDRGEACAARYLGAQGYRILAQKYRTKTGEIDLIAKDHDTLVFVEVKTRRSVRC 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG----NEVEWIKDAFN 128 G A +V KQ +++QTA L+L CRFD+V + +K AF Sbjct: 61 GLPAEAVNYRKQRRIIQTAMLYLCEKQMD--QTPCRFDIVEVYAAGSEWRIHHLKGAFE 117 >UniRef50_C4L9F6 Putative uncharacterized protein n=1 Tax=Tolumonas auensis DSM 9187 RepID=C4L9F6_TOLAT Length = 124 Score = 149 bits (378), Expect = 2e-35, Method: Composition-based stats. Identities = 52/113 (46%), Positives = 74/113 (65%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 + G +E QAR +LE +GL F+ AN + R GE+DLIMRE T +F+EVR+R S YG Sbjct: 12 NRRSKGQHYEQQARCFLEQQGLLFVCANYHCRQGELDLIMRERDTLVFIEVRFRASRDYG 71 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 GA +SVT +KQHK+ TAR +L + + CRFD+V++ + W+K+AF Sbjct: 72 GALSSVTPAKQHKIRHTARYYLMSQHINEAHQACRFDIVSYDDGQCSWLKNAF 124 >UniRef50_C0QTY9 UPF0102 protein PERMA_0362 n=2 Tax=Hydrogenothermaceae RepID=Y362_PERMH Length = 116 Score = 149 bits (378), Expect = 2e-35, Method: Composition-based stats. Identities = 35/112 (31%), Positives = 56/112 (50%), Gaps = 2/112 (1%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 G E +A +L G R + N R GEID+I + T + VEVR + S YG Sbjct: 6 KGKEGEDKAVEYLRNSGYRILERNFRSRFGEIDIIAEDNGTIVIVEVRSKGSTGYGYPEE 65 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFNDH 130 S+ K K+++TA+ +L + + RFD+++ N + IK+AF+ Sbjct: 66 SIDHKKVRKIIKTAQFYLLKRDIKG--KQVRFDIISIVNNNIFHIKNAFDLD 115 >UniRef50_Q5ZR89 UPF0102 protein lpg2994 n=4 Tax=Legionella RepID=Y2994_LEGPH Length = 118 Score = 149 bits (378), Expect = 2e-35, Method: Composition-based stats. Identities = 43/116 (37%), Positives = 68/116 (58%), Gaps = 3/116 (2%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 T++ G E A +L+ GL + N + R GEIDLIMREG +FVEVR R + +GG Sbjct: 2 TQEKGKFAEQLALNYLKENGLALVMQNYHCRLGEIDLIMREGSYLVFVEVRSRSNMNFGG 61 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG--NEVEWIKDAFND 129 AS+T K+ K+++ ++ ++ D RFDV++ G N++ W+K+AF+ Sbjct: 62 GLASITYEKKQKIIKATSHYMIKYRIQ-DKFPIRFDVISIDGKSNKITWLKNAFDA 116 >UniRef50_A0KPY4 UPF0102 protein AHA_3896 n=4 Tax=Proteobacteria RepID=Y3896_AERHH Length = 130 Score = 149 bits (377), Expect = 3e-35, Method: Composition-based stats. Identities = 56/109 (51%), Positives = 76/109 (69%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 G +E A RWL+ +GL+ + N RGGEIDLIMR+G T +FVEVRYR +GGAAA Sbjct: 22 KGQHFEQLAERWLQARGLQPVTRNYRCRGGEIDLIMRQGETLVFVEVRYRSQTSHGGAAA 81 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 SVTR KQHK++ AR + +H + + CRFDV+AF G++ +WI++AF Sbjct: 82 SVTRCKQHKIVLAARHYFKQHAINEASQACRFDVIAFEGDQPDWIQNAF 130 >UniRef50_Q3JE65 UPF0102 protein Noc_0355 n=2 Tax=Nitrosococcus oceani RepID=Y355_NITOC Length = 124 Score = 148 bits (375), Expect = 4e-35, Method: Composition-based stats. Identities = 45/121 (37%), Positives = 67/121 (55%), Gaps = 5/121 (4%) Query: 12 RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSA 71 + T + G+ E A +L+ +GLR N + R GEIDLIM + + +F+EVRYRR Sbjct: 2 KPATHRDKGEQAEQLACHYLQARGLRLTQRNYHCRLGEIDLIMEDRESLVFIEVRYRRKG 61 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT----GNEVEWIKDAF 127 +G A S+T +KQ +L+ A+ +L R CRFDVV T + + W++DAF Sbjct: 62 RFGDAIDSITPAKQARLIAAAQHYLQRTG-GAQNKPCRFDVVGITSEKGADNIMWLRDAF 120 Query: 128 N 128 Sbjct: 121 R 121 >UniRef50_Q6AJE4 UPF0102 protein DP2807 n=1 Tax=Desulfotalea psychrophila RepID=Y2807_DESPS Length = 128 Score = 148 bits (374), Expect = 5e-35, Method: Composition-based stats. Identities = 39/118 (33%), Positives = 63/118 (53%), Gaps = 7/118 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 K+ G E A R+L+ +G + N ++ GEID+I +EG +FVEV+ R ++ +G Sbjct: 5 RKKKGAEGEYLACRFLKKQGYVILQKNYRKKYGEIDIIAQEGGDLVFVEVKTRSNSDWGS 64 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-----VEWIKDAFN 128 A+VT+ KQ K+++ A+ +LA RFDV+ +E E I +AF Sbjct: 65 PVAAVTKQKQRKIIRVAQTYLAE--TELFDEAIRFDVIGIILDENSPPIFELIHNAFE 120 >UniRef50_D1RD77 Putative uncharacterized protein n=1 Tax=Legionella longbeachae D-4968 RepID=D1RD77_LEGLO Length = 119 Score = 148 bits (374), Expect = 6e-35, Method: Composition-based stats. Identities = 45/119 (37%), Positives = 68/119 (57%), Gaps = 3/119 (2%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + T++ G E +A L +GL+ + N R GEIDLIM + +F+EVR R S + Sbjct: 1 MRTQEKGRVAEEKALAHLTKQGLKLVMKNYRCRFGEIDLIMYDKDYLVFIEVRSRVSNQF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN--EVEWIKDAFNDH 130 GG +SVT +K+ K+L+TA ++ H ++ RFDVV+ G+ + WIKDAF Sbjct: 61 GGGISSVTHTKRQKILKTASCFILEHQ-KYNQFGLRFDVVSIDGDAASISWIKDAFGAD 118 >UniRef50_B0U003 UPF0102 protein Fphi_0415 n=15 Tax=Francisella RepID=Y420_FRAP2 Length = 117 Score = 147 bits (373), Expect = 7e-35, Method: Composition-based stats. Identities = 41/115 (35%), Positives = 66/115 (57%), Gaps = 2/115 (1%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVN-ERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + T + G+ E QA ++L K L+ +A N GEID+I + T +F+EV+YR Sbjct: 1 MKTIEIGNKAEEQASKFLRTKNLQILAQNFKAFPYGEIDIIALDQNTLVFIEVKYRSKTK 60 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 + A +T SKQ KL+ A ++L + F+ +CRFD++A ++ WIK+AF Sbjct: 61 FAKAEEMLTYSKQQKLINAANIFLQENP-KFENYECRFDLIAINKEDINWIKNAF 114 >UniRef50_B2V8B3 UPF0102 protein SYO3AOP1_0546 n=2 Tax=Sulfurihydrogenibium RepID=Y546_SULSY Length = 115 Score = 147 bits (372), Expect = 9e-35, Method: Composition-based stats. Identities = 37/115 (32%), Positives = 63/115 (54%), Gaps = 2/115 (1%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + Q G +E +A R+LE G + + N + GEID+I +FVEV+ R + + Sbjct: 1 MDKTQKGKFFEDKAVRYLESIGYKVLHKNYRSKYGEIDIIAETDNVIVFVEVKGRFTENF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFN 128 G S+T+ K K+++TA ++ +N RFDVVA GN++ +++AF+ Sbjct: 61 GSGEESITKKKIDKIVKTALQFIEENNLQGKD--FRFDVVALKGNQIFHLENAFS 113 >UniRef50_C8X4T0 Putative uncharacterized protein n=1 Tax=Desulfohalobium retbaense DSM 5692 RepID=C8X4T0_DESRD Length = 134 Score = 147 bits (372), Expect = 9e-35, Method: Composition-based stats. Identities = 43/120 (35%), Positives = 64/120 (53%), Gaps = 6/120 (5%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 +TG E AR +LE G +A N GGE+DL+ R GR IFVEV+ R S+ Sbjct: 2 SARHLKTGRDGEEAARAYLESCGYVIVARNWRGGGGELDLVCRLGREIIFVEVKTRASSG 61 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF----TGNEVEWIKDAFN 128 ++T +KQ +L++ A +L+R+ CRFDV++ +GN+VE +AF Sbjct: 62 RTLPIQALTPAKQQRLIRAASAYLSRNR--LWETPCRFDVISVFSGPSGNQVEHCTNAFE 119 >UniRef50_Q47VU1 UPF0102 protein CPS_4433 n=1 Tax=Colwellia psychrerythraea 34H RepID=Y4433_COLP3 Length = 129 Score = 147 bits (371), Expect = 1e-34, Method: Composition-based stats. Identities = 49/124 (39%), Positives = 76/124 (61%), Gaps = 4/124 (3%) Query: 8 SGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRY 67 S + ++ G E+ A+++L +GLRFI N + R GEIDLIM +G T +FVEV+Y Sbjct: 6 KTSAKNTSSTDKGQVTESYAQQYLSKQGLRFIERNFHSRQGEIDLIMLDGDTYVFVEVKY 65 Query: 68 RRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----EVEWI 123 R+S +GGA A+++ SKQ+K+ +L ++ + CR DVVA G+ +V W+ Sbjct: 66 RKSKGFGGAIAAISASKQNKVKHCITFYLHQNGLNEYNTPCRVDVVALEGDITQPQVTWL 125 Query: 124 KDAF 127 K+AF Sbjct: 126 KNAF 129 >UniRef50_D1KBC1 Putative uncharacterized protein n=1 Tax=uncultured SUP05 cluster bacterium RepID=D1KBC1_9GAMM Length = 123 Score = 147 bits (371), Expect = 1e-34, Method: Composition-based stats. Identities = 46/120 (38%), Positives = 70/120 (58%), Gaps = 7/120 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE--GRTTIFVEVRYRRSALY 73 ++ G+ E A +L GL I N + GEID+IM + +T +FVEVRYR++ + Sbjct: 5 KRKVGNQAEDIALEYLSTHGLELIEQNYLTKMGEIDIIMLDKSEQTLVFVEVRYRQNTYF 64 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----EVEWIKDAFND 129 G AA +V ++KQ KL++TA+ +L +H + CRFDVV + ++ WIKDAF Sbjct: 65 GSAADTVDQNKQAKLVRTAQYYLQQH-SKYQEFICRFDVVGVESDLKYPKINWIKDAFGA 123 >UniRef50_C2KW07 Putative uncharacterized protein n=1 Tax=Oribacterium sinus F0268 RepID=C2KW07_9FIRM Length = 188 Score = 146 bits (370), Expect = 2e-34, Method: Composition-based stats. Identities = 36/113 (31%), Positives = 59/113 (52%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 + G +E +A +LE KG + N E+DL+ ++G F+EV+ R+ Y Sbjct: 76 SNTLKGKVFEDRAVAFLEEKGYEILERNSRFHHLEMDLVAKDGEMLCFIEVKGRKEHSYL 135 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 +V R KQ +L A +L +H+ S CRFDVV+ G +++ I++AF Sbjct: 136 SGVYAVDRGKQRRLRTWATAYLCKHSYSLTETACRFDVVSIEGEKIQLIQNAF 188 >UniRef50_B8J1T1 Putative uncharacterized protein n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774 RepID=B8J1T1_DESDA Length = 160 Score = 146 bits (369), Expect = 2e-34, Method: Composition-based stats. Identities = 44/133 (33%), Positives = 61/133 (45%), Gaps = 7/133 (5%) Query: 4 VPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFV 63 R + + G A E A L G G +A N + E+D++ +G T +FV Sbjct: 19 KSVRPATAPAAAHLRLGSAGEDAAAELLTGAGCTLLARNWRQARLELDMVCLDGDTIVFV 78 Query: 64 EVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----E 119 EV+ R S YGG A +V SKQ L + AR WLA H CRFDV+ N Sbjct: 79 EVKTRSSERYGGPAYAVGLSKQRVLCRAARAWLAAH--EAWDKPCRFDVICVLRNGDTLH 136 Query: 120 VEWIKDAFN-DHS 131 +E + AF+ + Sbjct: 137 LEHFRHAFDCPPA 149 >UniRef50_A9NAA4 UPF0102 protein COXBURSA331_A1934 n=6 Tax=Coxiella burnetii RepID=Y1934_COXBR Length = 120 Score = 145 bits (367), Expect = 4e-34, Method: Composition-based stats. Identities = 46/114 (40%), Positives = 68/114 (59%), Gaps = 2/114 (1%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 T++ G E A R+L+ +GL FI N + GEIDLIM + +F+EVRYRR + + Sbjct: 5 TQKIGFNAEKTACRYLQKQGLSFITKNFRYKQGEIDLIMSDQSMLVFIEVRYRRFSDFIH 64 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-VEWIKDAFN 128 A+VT KQ +L++TA +L +H D + CRFD+V T + + WIK+A Sbjct: 65 PVATVTPLKQRRLIKTALHYLQKHRL-LDKISCRFDIVGITADRQITWIKNAIE 117 >UniRef50_C0GQX2 Putative uncharacterized protein n=1 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GQX2_9DELT Length = 132 Score = 145 bits (367), Expect = 4e-34, Method: Composition-based stats. Identities = 40/119 (33%), Positives = 55/119 (46%), Gaps = 5/119 (4%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 G E AR +L G R N RGGE+DL+ G +FVEV+ R Sbjct: 2 SAHNLNLGRYGEEVARDYLTENGYRIKERNWRARGGELDLVCTCGDCIVFVEVKTRAEEG 61 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE---VEWIKDAFN 128 G S+ +Q KLL+TA L+L+RHN + RFD + T +E I++A Sbjct: 62 MGHPLESLGFKQQKKLLRTAGLYLSRHNM--WSSQSRFDFICVTVGREVQIEHIQNAIE 118 >UniRef50_A1AN88 UPF0102 protein Ppro_1186 n=11 Tax=Deltaproteobacteria RepID=Y1186_PELPD Length = 140 Score = 145 bits (367), Expect = 4e-34, Method: Composition-based stats. Identities = 40/132 (30%), Positives = 61/132 (46%), Gaps = 9/132 (6%) Query: 4 VPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRT--TI 61 S + + TG E A +L +G R + N +GGE+D++ R + Sbjct: 8 RQESPSSTARPDNRNTGSRGEEIATSFLGQQGYRILERNFRCKGGELDIVARAPGERSLV 67 Query: 62 FVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF---TGN 118 FVEV+ RR YG +VT KQ ++ + A WL+R++ RFDV+A G Sbjct: 68 FVEVKTRRDRSYGPPQLAVTPFKQRQISKAALTWLSRNHLH--DSQARFDVIAILLEDGG 125 Query: 119 E--VEWIKDAFN 128 +E I +AF Sbjct: 126 RHSIEHIVNAFE 137 >UniRef50_D1VVL2 Putative uncharacterized protein n=1 Tax=Peptoniphilus lacrimalis 315-B RepID=D1VVL2_9FIRM Length = 117 Score = 145 bits (367), Expect = 4e-34, Method: Composition-based stats. Identities = 36/116 (31%), Positives = 61/116 (52%), Gaps = 4/116 (3%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + K+ G+ E+ A +L+ K + N G EID+I ++G +FVEV+ RR+ + Sbjct: 1 MNNKELGNFGESLATDFLQKKNYIILDRNYRALGTEIDIIAKDGEELVFVEVKTRRNHKF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN--EVEWIKDAF 127 G A +VT K ++QTA +++ +H RFDV+ N + I++AF Sbjct: 61 GEAYEAVTEFKMRNIIQTANVYIYKH--ELYNTQVRFDVIEVYINEKRINHIENAF 114 >UniRef50_A5F986 UPF0102 protein VC0395_A0112/VC395_0597 n=28 Tax=Vibrio RepID=Y1312_VIBC3 Length = 122 Score = 145 bits (366), Expect = 6e-34, Method: Composition-based stats. Identities = 46/117 (39%), Positives = 74/117 (63%), Gaps = 2/117 (1%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 ++ G+ +E A +L +GL + NVN R GE+DLIMR+G T +FVEVRYR + +G Sbjct: 5 NSRHQGNHYEQMAADYLRRQGLTLVTQNVNYRFGELDLIMRDGNTLVFVEVRYRNNTQHG 64 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT--GNEVEWIKDAFND 129 AA +VTR+K+ +L++ A W+ + + + D RFDV+A G ++W+K+A + Sbjct: 65 HAAETVTRTKRARLIKAANCWMLANKMNSHSADFRFDVIAIHQQGQHIDWLKNAITE 121 >UniRef50_B6ELI6 UPF0102 protein VSAL_I2655 n=7 Tax=Vibrionaceae RepID=Y2655_ALISL Length = 123 Score = 145 bits (366), Expect = 6e-34, Method: Composition-based stats. Identities = 49/119 (41%), Positives = 70/119 (58%), Gaps = 2/119 (1%) Query: 11 PRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRS 70 ++ + G+ +E A+R+LE L FI N + GE+DLIMR+ + +FVEV+YR S Sbjct: 2 EKKPNKRIKGEYYELMAKRYLETHQLTFIERNFYSKTGELDLIMRDRDSFVFVEVKYRAS 61 Query: 71 ALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT--GNEVEWIKDAF 127 + YG A VT KQ KL +TA WL ++ S + RFDVVA G ++ WIK+A Sbjct: 62 SNYGSAQEMVTWQKQRKLQRTALFWLMKNGLSVEHTSFRFDVVAIHSQGQDINWIKNAI 120 >UniRef50_A0YER5 Putative uncharacterized protein n=1 Tax=marine gamma proteobacterium HTCC2143 RepID=A0YER5_9GAMM Length = 128 Score = 144 bits (365), Expect = 6e-34, Method: Composition-based stats. Identities = 46/118 (38%), Positives = 66/118 (55%), Gaps = 6/118 (5%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 G E +A WL+ +GL+ +A N + GEID+IM +G+ +FVEVRYR+SA +G Sbjct: 10 KNTNFGAYVEEKAYHWLQQQGLKSVALNYRCKTGEIDIIMLDGQQLVFVEVRYRKSASFG 69 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-----EVEWIKDAF 127 SV R KQ K+ + A +L F+ + CRFDV+A + W+KDAF Sbjct: 70 DGLESVDRRKQQKIQKAAAHFLTDRP-GFNHLPCRFDVIAAKPSSDSSLHWNWVKDAF 126 >UniRef50_B8CW28 Putative uncharacterized protein n=1 Tax=Halothermothrix orenii H 168 RepID=B8CW28_HALOH Length = 116 Score = 144 bits (365), Expect = 6e-34, Method: Composition-based stats. Identities = 40/118 (33%), Positives = 62/118 (52%), Gaps = 6/118 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + ++ GD E +A R+L+ KG + I N GEID+I + +FVEV+ RRS Y Sbjct: 1 MQNRELGDWGEKKAVRYLKSKGYQVIKTNYRCLIGEIDIIAIDNNFLVFVEVKTRRSIAY 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEWIKDAF 127 G A +V KQ K+ + AR +L + + RFDV++ ++ IK+AF Sbjct: 61 GVPACAVNFDKQKKIRKVARHYLKSNMIN--KYQIRFDVISIIVKNNRGFLKHIKNAF 116 >UniRef50_C5BS52 Putative uncharacterized protein n=1 Tax=Teredinibacter turnerae T7901 RepID=C5BS52_TERTT Length = 129 Score = 144 bits (365), Expect = 6e-34, Method: Composition-based stats. Identities = 54/126 (42%), Positives = 79/126 (62%), Gaps = 2/126 (1%) Query: 3 TVPTRSGSPRQLTTKQT-GDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTI 61 P R+ + +Q T ++ GD E A+++L +GL +A N R GEIDLIM+ T + Sbjct: 4 PNPFRTPTGKQPTARRKTGDLAEDAAQQYLISQGLTPVARNYRSRFGEIDLIMQHASTLV 63 Query: 62 FVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVE 121 FVEVRYR ++ YG +AA+VT SKQ+K+ QTA+ ++ S + RFDVV +G + + Sbjct: 64 FVEVRYRANSRYGSSAATVTASKQNKIRQTAQQFIIDKKLS-ANLALRFDVVGMSGTQTQ 122 Query: 122 WIKDAF 127 WIK AF Sbjct: 123 WIKGAF 128 >UniRef50_A3XHA2 Putative uncharacterized protein n=1 Tax=Leeuwenhoekiella blandensis MED217 RepID=A3XHA2_9FLAO Length = 118 Score = 144 bits (365), Expect = 7e-34, Method: Composition-based stats. Identities = 31/118 (26%), Positives = 54/118 (45%), Gaps = 7/118 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + + G E A +LE KG + N E+D+I + + VEV+ R S + Sbjct: 1 MNHNELGKWGEEYAANYLEKKGYELLERNWFFNKAELDIIALKNNQLVVVEVKTRNSDFF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----EVEWIKDAF 127 G VT +K L++ ++ ++ ++ RFDV+A N ++E +DAF Sbjct: 61 GDPQDFVTPAKIKLLVKATNEYIISNDL---DLEVRFDVIAVLKNKTQEQLEHFEDAF 115 >UniRef50_Q7N090 UPF0102 protein plu4003 n=2 Tax=Enterobacteriaceae RepID=Y4003_PHOLL Length = 126 Score = 144 bits (364), Expect = 8e-34, Method: Composition-based stats. Identities = 59/119 (49%), Positives = 86/119 (72%) Query: 12 RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSA 71 ++LT+ G +EAQA+ +L+ +GL FIAANV GGEIDLIM++ +T +F+EVR+R+S Sbjct: 4 KKLTSYLLGRNYEAQAKLFLQKQGLSFIAANVKVHGGEIDLIMKDKQTWVFIEVRFRKSG 63 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFNDH 130 YG A A++TRSK+ KLL A +WL + F+T CRFD+ A TG + EW+++AFN + Sbjct: 64 QYGDALATITRSKRKKLLHAAAVWLFQRGECFETSSCRFDICAITGQQFEWLQNAFNQN 122 >UniRef50_A1SU47 UPF0102 protein Ping_1176 n=2 Tax=Psychromonas RepID=Y1176_PSYIN Length = 122 Score = 144 bits (364), Expect = 9e-34, Method: Composition-based stats. Identities = 46/118 (38%), Positives = 67/118 (56%), Gaps = 5/118 (4%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + G E QA +L+ +GLR I N R GEIDLIM + T +F+EVRYR+++ + Sbjct: 8 QASNSKGVLAEKQALSYLQEQGLRLICQNYYCRFGEIDLIMIDQDTLVFIEVRYRKNSDF 67 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT--GNEVEWIKDAFND 129 GG AS+ +SKQ K++ TA+ +L D CRFD +A WI++AF + Sbjct: 68 GGPFASINKSKQRKIITTAKHYL---RTLEDEPFCRFDAIAIDSKSTTPAWIQNAFQE 122 >UniRef50_Q2S9Y0 UPF0102 protein HCH_05895 n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Y5895_HAHCH Length = 124 Score = 144 bits (363), Expect = 1e-33, Method: Composition-based stats. Identities = 42/123 (34%), Positives = 68/123 (55%), Gaps = 5/123 (4%) Query: 9 GSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYR 68 R + + G A E+QA ++ +G + N +GGEIDLI R G +F+EVR+R Sbjct: 2 PFKRLIKSIDIGRAAESQAEKFARAQGFTIVERNFRCKGGEIDLIARHGEHLVFIEVRHR 61 Query: 69 RSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVV---AFTGNEVEWIKD 125 S +G AA S+T+ KQ +++ A ++L + + + CRFDV+ + +WI D Sbjct: 62 SSDKFGSAAESITQKKQQRIILAANIYLQKKGLT--NMPCRFDVIVGNLKSNTGFQWIPD 119 Query: 126 AFN 128 AF+ Sbjct: 120 AFS 122 >UniRef50_C0N677 Putative uncharacterized protein n=1 Tax=Methylophaga thiooxidans DMS010 RepID=C0N677_9GAMM Length = 120 Score = 144 bits (363), Expect = 1e-33, Method: Composition-based stats. Identities = 48/119 (40%), Positives = 73/119 (61%), Gaps = 6/119 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + ++ G E Q + L+ +G+R I N RGGEIDLIM++ T +F+EVRYR+SA + Sbjct: 1 MFAREKGQQIEKQVAKHLQKQGMRLITRNYQCRGGEIDLIMQDRETLVFIEVRYRQSARF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF----TGNEVEWIKDAFN 128 G A SV ++KQ +++ TA +L + + CRFDVVA TG + +W+K+AF Sbjct: 61 GSALESVNKTKQSRIIHTAEHYLQQSRDGYQ--ACRFDVVAVSPAKTGYQFDWVKNAFQ 117 >UniRef50_B7GSU4 UPF0102 protein Blon_1698 n=12 Tax=Bifidobacterium RepID=Y1698_BIFLI Length = 124 Score = 143 bits (362), Expect = 1e-33, Method: Composition-based stats. Identities = 45/122 (36%), Positives = 59/122 (48%), Gaps = 5/122 (4%) Query: 11 PRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGR-TTIFVEVRYRR 69 R LT KQ G E A WLE G ++ N + R GE+D++M T +FVEV+ RR Sbjct: 3 DRNLTPKQFGALGEQYAAAWLEEHGWTTLSRNWHTRYGELDIVMLNPEYTVVFVEVKSRR 62 Query: 70 SALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEWIKD 125 S YG ++T +KQH L + A WL RFDVV V I++ Sbjct: 63 SMHYGYPQEAITPAKQHNLRKAACDWLLDRRNRVPHTAVRFDVVTIVLRVGRPLVHHIEN 122 Query: 126 AF 127 AF Sbjct: 123 AF 124 >UniRef50_Q67PD3 UPF0102 protein STH1475 n=1 Tax=Symbiobacterium thermophilum RepID=Y1475_SYMTH Length = 118 Score = 143 bits (362), Expect = 1e-33, Method: Composition-based stats. Identities = 46/119 (38%), Positives = 64/119 (53%), Gaps = 7/119 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 +++ G+A E A +L G R IA NV R GEIDLI ++G +FVEV+ RR YG Sbjct: 2 SRRVGEAGEQAAAEFLTASGYRIIARNVRFRSGEIDLIAQDGGVLVFVEVKTRRGRRYGT 61 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-----EVEWIKDAFND 129 +VT +KQ +L + A L+LAR + CRFDVV I++AF+ Sbjct: 62 PGEAVTAAKQRRLARLASLYLARLGS--EPPPCRFDVVEVEPGPDGRLRCRLIQNAFHA 118 >UniRef50_C7RDQ1 Putative uncharacterized protein n=1 Tax=Anaerococcus prevotii DSM 20548 RepID=C7RDQ1_ANAPD Length = 115 Score = 143 bits (361), Expect = 2e-33, Method: Composition-based stats. Identities = 36/114 (31%), Positives = 62/114 (54%), Gaps = 4/114 (3%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 K+ GD E +L+ K +A N + GEID++ + +FVEV+ R++A + Sbjct: 4 KKEFGDYGENLVEGYLKDKSYEILARNYRKPFGEIDIVAKLSDMIVFVEVKTRKNANFAS 63 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF--TGNEVEWIKDAF 127 A +VT SKQ K++Q ++ +L +N + + RFDV E+ +I++AF Sbjct: 64 PAEAVTPSKQRKVIQASQAFLIENNMT--DMLMRFDVAEVIADKGEINYIENAF 115 >UniRef50_Q1Q244 Putative uncharacterized protein n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q244_9BACT Length = 149 Score = 143 bits (361), Expect = 2e-33, Method: Composition-based stats. Identities = 37/127 (29%), Positives = 62/127 (48%), Gaps = 8/127 (6%) Query: 7 RSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVR 66 + S Q K G E A ++L+ KG + + N + GEID+I + + +FVEV+ Sbjct: 20 KKTSDVQPHKKALGKKGEVVAAKFLKKKGYKILQRNYRRKTGEIDIICYDRGSIVFVEVK 79 Query: 67 YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF------TGNEV 120 R S YG +VT +K+ ++++ A ++A +D RFDVV+ + Sbjct: 80 TRGSDSYGPPELAVTEAKKKQIIKMASRYIAEKKVEG--IDLRFDVVSVFYPPAKKHPAI 137 Query: 121 EWIKDAF 127 K+AF Sbjct: 138 TLYKNAF 144 >UniRef50_Q0VS15 UPF0102 protein ABO_0585 n=2 Tax=Alcanivorax RepID=Y585_ALCBS Length = 125 Score = 142 bits (360), Expect = 3e-33, Method: Composition-based stats. Identities = 48/119 (40%), Positives = 69/119 (57%), Gaps = 6/119 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + K TG E +A +WL G+GL + N + R GEIDLI+ + T +F EVR+R+ Y Sbjct: 5 RSKKNTGRDAEKRAAKWLTGQGLSIVERNFHCRQGEIDLILLDQETLVFTEVRWRKHQSY 64 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-----VEWIKDAF 127 GGA ASV + KQ +L+ A+ +LARH CRFDV+ + +WI++AF Sbjct: 65 GGALASVDQHKQRRLINAAQHFLARHP-EHHHRPCRFDVLGMEPDSQQAVLYQWIQNAF 122 >UniRef50_B3EJJ5 UPF0102 protein Cphamn1_0017 n=1 Tax=Chlorobium phaeobacteroides BS1 RepID=Y017_CHLPB Length = 126 Score = 142 bits (359), Expect = 3e-33, Method: Composition-based stats. Identities = 38/122 (31%), Positives = 57/122 (46%), Gaps = 12/122 (9%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 G E A +L KG R + N R EID+I + RT F+EV+ R SA G Sbjct: 5 PHDLGRQGEHTAVTFLIEKGYRILQRNYRHRRNEIDIIALDRRTLCFIEVKTRSSASKGH 64 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----------EVEWIKD 125 +VT KQ ++++ A +L+ + CRFDV+A + ++E I + Sbjct: 65 PLEAVTPEKQKEIIRAATAYLSAYPSPEPD--CRFDVIAIIAHDFTNGRIREFKLEHITN 122 Query: 126 AF 127 AF Sbjct: 123 AF 124 >UniRef50_B9MQX5 UPF0102 protein Athe_0977 n=1 Tax=Anaerocellum thermophilum DSM 6725 RepID=Y977_ANATD Length = 119 Score = 142 bits (359), Expect = 3e-33, Method: Composition-based stats. Identities = 42/120 (35%), Positives = 59/120 (49%), Gaps = 7/120 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + KQ G E A +L G + N R GEID+I +E +T +FVEV+ R+S + Sbjct: 1 MNLKQVGRFGENLAVDFLIKHGYEILRTNFRCRLGEIDIIAKEDKTIVFVEVKTRKSLKF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF------TGNEVEWIKDAF 127 G + SV KQ + + A ++A H S D RFDVV ++ IKDAF Sbjct: 61 GLPSESVNFKKQLHIKKVAEYFIAYH-LSQDKYLYRFDVVEIFIDGKNNVTKINLIKDAF 119 >UniRef50_A4BQJ8 Putative uncharacterized protein n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BQJ8_9GAMM Length = 119 Score = 142 bits (359), Expect = 4e-33, Method: Composition-based stats. Identities = 49/118 (41%), Positives = 67/118 (56%), Gaps = 4/118 (3%) Query: 12 RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSA 71 R + G EA+A +L+ +GLR + N + R GEIDLIM + +FVEVR R + Sbjct: 3 RGPNPRTLGKQAEARALEFLQRRGLRCLQRNFHTRLGEIDLIMEDTGEVVFVEVRQRATK 62 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-NEVEWIKDAFN 128 +GGA SVT K+ +L+ AR +L H CRFDV+A G +EWI+DAF Sbjct: 63 RFGGALESVTPVKRQRLIAAARYYLLTH---APNAACRFDVIAIDGQGSIEWIRDAFQ 117 >UniRef50_Q7MNW2 UPF0102 protein VV0603 n=80 Tax=Vibrionales RepID=Y603_VIBVY Length = 122 Score = 142 bits (358), Expect = 4e-33, Method: Composition-based stats. Identities = 49/115 (42%), Positives = 76/115 (66%), Gaps = 2/115 (1%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 + + G+ +E+ A+ +L+ +GLRFI AN + GEIDLI +E +T +FVEV+YR+++ YG Sbjct: 5 SRRAIGNQYESLAKEYLQRQGLRFIEANFTTKVGEIDLIFKEAQTIVFVEVKYRKNSCYG 64 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF--TGNEVEWIKDAF 127 AA V +K +KL++TA LWL +H + RFDVVA G+++ WI +A Sbjct: 65 DAAEMVNPAKANKLIKTAYLWLNKHGYNACNTAMRFDVVAIHSNGHDINWIANAI 119 >UniRef50_B0G5Y9 Putative uncharacterized protein n=4 Tax=Clostridiales RepID=B0G5Y9_9FIRM Length = 122 Score = 142 bits (358), Expect = 4e-33, Method: Composition-based stats. Identities = 44/119 (36%), Positives = 63/119 (52%), Gaps = 6/119 (5%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + ++TG +E +A +LE G + + N R GEIDLI R+G +FVEV+YR + + Sbjct: 3 KKNNRRTGTGYERKAGAYLESLGYKIVTYNYRCRLGEIDLIARDGEYLVFVEVKYRTTGV 62 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG----NEVEWIKDAF 127 G A +V KQ + + A +L + V CRFDVVA G E+ KDAF Sbjct: 63 SGYPAEAVDARKQQTIAKCAMHFLMKQGND--DVPCRFDVVAIAGAEGQEEITLYKDAF 119 >UniRef50_C4GB01 Putative uncharacterized protein n=1 Tax=Shuttleworthia satelles DSM 14600 RepID=C4GB01_9FIRM Length = 116 Score = 142 bits (358), Expect = 5e-33, Method: Composition-based stats. Identities = 41/113 (36%), Positives = 60/113 (53%), Gaps = 1/113 (0%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 ++ G +E +A +L G+GL + N + R GEIDL+ REG +FVEV+YRRS +G Sbjct: 5 NRRKEGSFYERRAGDYLTGQGLTLVEFNFSCRLGEIDLVAREGTCLVFVEVKYRRSRRFG 64 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 +V SK + + A + H T RFD+VA G E+ + AF Sbjct: 65 LPEEAVGPSKMRTIRKVAGYYCLTHGIC-QTTPVRFDLVAIEGEEIRHYRGAF 116 >UniRef50_C6A8H5 Putative uncharacterized protein n=4 Tax=Bifidobacterium animalis subsp. lactis RepID=C6A8H5_BIFLB Length = 158 Score = 141 bits (357), Expect = 5e-33, Method: Composition-based stats. Identities = 41/119 (34%), Positives = 57/119 (47%), Gaps = 5/119 (4%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIM-REGRTTIFVEVRYRRSAL 72 LT KQ G E R WL +A N + R GEID+I T +FVEV+ RRS Sbjct: 40 LTAKQIGSLGERLCRAWLIEHHWHVLACNWHCRFGEIDIIALTSHSTIVFVEVKTRRSTS 99 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEWIKDAF 127 G +V +KQ ++ + A WL H + + RFDV+A T + + +AF Sbjct: 100 CGIPEEAVHAAKQMRVRRAAICWLGEHGSTIRHIGVRFDVIAVTVTPTDVFIHHVPEAF 158 >UniRef50_B7RSM7 Putative uncharacterized protein n=1 Tax=marine gamma proteobacterium HTCC2148 RepID=B7RSM7_9GAMM Length = 124 Score = 141 bits (357), Expect = 6e-33, Method: Composition-based stats. Identities = 47/121 (38%), Positives = 69/121 (57%), Gaps = 7/121 (5%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 ++ KQ GD +E +A +L +G+ + N R GEIDLI R+ +F+EVR RR+ Sbjct: 3 GISMKQIGDEYERRAAHFLSQQGVEVLICNYRCRCGEIDLIARQNDYLVFIEVRARRNPR 62 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE------VEWIKDA 126 + AAASV KQ +LL+TA+ +L RH + CRFDV+ F + +WI+ A Sbjct: 63 FATAAASVDYRKQQRLLRTAQFFLQRH-TKLANLPCRFDVITFEPRQSTANDSPQWIRGA 121 Query: 127 F 127 F Sbjct: 122 F 122 >UniRef50_D0KYK3 Putative uncharacterized protein n=1 Tax=Halothiobacillus neapolitanus c2 RepID=D0KYK3_HALNC Length = 165 Score = 141 bits (356), Expect = 7e-33, Method: Composition-based stats. Identities = 50/141 (35%), Positives = 69/141 (48%), Gaps = 13/141 (9%) Query: 1 MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTT 60 M + + TT G E A +L +GL+ I NV GEIDLIM++G T Sbjct: 25 MRGTDAELPNAKAQTTLARGHRAETMAAEYLSRQGLKLIDRNVRAGRGEIDLIMQDGATL 84 Query: 61 IFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-- 118 +FVEVR R++ + AA SV+ +K+ K+++TA L + CRFDVVA Sbjct: 85 VFVEVRARKAGAWVSAAESVSPAKRKKIIETAERLLNEKPV-WRKSPCRFDVVAIGLPSE 143 Query: 119 ----------EVEWIKDAFND 129 EV WI+DAF Sbjct: 144 SSSEPAAKQAEVNWIQDAFQA 164 >UniRef50_Q8R5S3 UPF0102 protein TTE1452 n=9 Tax=Thermoanaerobacterales RepID=Y1452_THETN Length = 122 Score = 141 bits (356), Expect = 7e-33, Method: Composition-based stats. Identities = 36/123 (29%), Positives = 61/123 (49%), Gaps = 9/123 (7%) Query: 12 RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSA 71 +++ K G E A ++L KG + + N + GEIDLI +FVEV+ R S Sbjct: 2 KKVNKKTVGSVGEKIAAQYLSKKGYKILEKNFKCKIGEIDLIALYKNQIVFVEVKTRTSV 61 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF-------TGNEVEWIK 124 +G + +V KQ K+++ A++++A +F RFD++ T +V I Sbjct: 62 NFGLPSEAVDFHKQQKIVKIAQVYIAS--SNFKQYQPRFDIIEVYLNPEKLTLEKVNHIL 119 Query: 125 DAF 127 +AF Sbjct: 120 NAF 122 >UniRef50_A1ZF36 Putative uncharacterized protein n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZF36_9SPHI Length = 119 Score = 141 bits (356), Expect = 7e-33, Method: Composition-based stats. Identities = 34/116 (29%), Positives = 60/116 (51%), Gaps = 8/116 (6%) Query: 17 KQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGA 76 ++ G E A +++ KG + N + GEID+I + G +FVEV+ R S +G Sbjct: 6 QKKGKYGENLAAAFMQNKGYTLLERNYRYKRGEIDIIAQTGDVLVFVEVKLRSSDNFGLP 65 Query: 77 AASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-----NEVEWIKDAF 127 SV+ ++Q+ ++QTA ++ + D RFD+VA ++ + +DAF Sbjct: 66 EESVSENQQNLIIQTAEQYIEEID---WESDIRFDIVAIELKSHQSPQITYFEDAF 118 >UniRef50_C6WYK5 Putative uncharacterized protein n=1 Tax=Methylotenera mobilis JLW8 RepID=C6WYK5_METML Length = 119 Score = 141 bits (356), Expect = 7e-33, Method: Composition-based stats. Identities = 49/117 (41%), Positives = 61/117 (52%), Gaps = 7/117 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 G E A +L+ GL I N GEIDLIMR+G+T +FVEVR R + + Sbjct: 7 AKNITEGQLAEQIAATFLQNNGLTVIEKNFRSAYGEIDLIMRDGKTLVFVEVRLRSNTKF 66 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVV---AFTGNEVEWIKDAF 127 GGA S+ SKQ KL +TA +L + CRFD + A VEWIKDAF Sbjct: 67 GGAGMSINASKQQKLTRTAERYLQING----DSACRFDAILMHALDITTVEWIKDAF 119 >UniRef50_D1CBL5 Putative uncharacterized protein n=1 Tax=Thermobaculum terrenum ATCC BAA-798 RepID=D1CBL5_THET1 Length = 125 Score = 141 bits (356), Expect = 8e-33, Method: Composition-based stats. Identities = 36/120 (30%), Positives = 51/120 (42%), Gaps = 6/120 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 +K G E A +L KG + IA N R GEID+I ++ +FVEV+ R S G Sbjct: 2 SKSLGRIGEDYACNFLLSKGYKLIARNWRCRQGEIDIIFQDKDEIVFVEVKTRSSLSLGT 61 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT---GNEV---EWIKDAFND 129 S+ K +LL A++W+ RFD V T V I++ Sbjct: 62 PEESIDMHKARQLLTLAKIWIFECYDGEKDPPVRFDAVTVTISRSGRVIDSNHIQNCIMP 121 >UniRef50_C9KJR5 Endonuclease n=2 Tax=Veillonellaceae RepID=C9KJR5_9FIRM Length = 132 Score = 141 bits (356), Expect = 8e-33, Method: Composition-based stats. Identities = 40/124 (32%), Positives = 61/124 (49%), Gaps = 6/124 (4%) Query: 9 GSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYR 68 + R + T G E A +LE G +A N GEID++ +GR FVEV+ R Sbjct: 10 NAERLMDTTTIGRQGEEAAAVFLERAGYEILARNFRTPRGEIDIVASKGRMLAFVEVKTR 69 Query: 69 RSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF----TGNEVEWIK 124 R+ +G AA+V KQ K++Q+A +L + + CRFDV+ ++E + Sbjct: 70 RTQRFGRPAAAVDYRKQQKIIQSAHWFLRQRHLEGCL--CRFDVIEIYRAGERWQIEHLP 127 Query: 125 DAFN 128 AF Sbjct: 128 GAFE 131 >UniRef50_A6VB97 UPF0102 protein PSPA7_4996 n=17 Tax=Pseudomonadaceae RepID=Y4996_PSEA7 Length = 125 Score = 141 bits (356), Expect = 8e-33, Method: Composition-based stats. Identities = 41/123 (33%), Positives = 64/123 (52%), Gaps = 7/123 (5%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + ++ G E A L +GL + N R GE+DL+M +G T +FVEVR RR Sbjct: 4 RANSRDKGRQAEEMACAHLLRQGLATLGKNWTCRRGELDLVMLDGDTVVFVEVRSRRHRA 63 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT------GNEVEWIKDA 126 +GGA S+ K+ +L+ +A L+L + + CRFDVV ++WI++A Sbjct: 64 WGGALESIDARKRQRLILSAELFLQQE-ARWAKRPCRFDVVTVDTSDGQSPPRLDWIQNA 122 Query: 127 FND 129 F+ Sbjct: 123 FDA 125 >UniRef50_A8SLV5 Putative uncharacterized protein n=1 Tax=Parvimonas micra ATCC 33270 RepID=A8SLV5_9FIRM Length = 114 Score = 140 bits (355), Expect = 8e-33, Method: Composition-based stats. Identities = 31/116 (26%), Positives = 57/116 (49%), Gaps = 4/116 (3%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + K G+ E A ++L KG + I N + GEID+I ++ +F+EV+ R++ + Sbjct: 1 MKAKDIGNLGEDMAVKFLLEKGYQIIERNFLKPFGEIDIIAKDKDFLVFIEVKARKNVNF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN--EVEWIKDAF 127 G V K K+ A++++ N RFDV+ + ++ I++AF Sbjct: 61 GFPREFVNGIKIKKIQDVAQIYMMEKNLFGAK--IRFDVIEIIFDNYKITHIENAF 114 >UniRef50_A3WNE6 Predicted endonuclease n=1 Tax=Idiomarina baltica OS145 RepID=A3WNE6_9GAMM Length = 116 Score = 140 bits (355), Expect = 8e-33, Method: Composition-based stats. Identities = 35/116 (30%), Positives = 63/116 (54%), Gaps = 2/116 (1%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 ++T++ G E+QA R+L +GL + N GEID+I R+ T +FVEV+ R+++ + Sbjct: 1 MSTRKRGLEGESQASRYLRQQGLVIVQHNFRVPCGEIDIICRDSDTWVFVEVKRRQNSDF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE--VEWIKDAF 127 +T + ++ + A+ +L + RFD++ ++ VEW KDAF Sbjct: 61 ASILEQITTRQCQRIRRAAQYFLVEQTVNEYLAKMRFDIITINDSQVTVEWYKDAF 116 >UniRef50_B8DRI1 Putative uncharacterized protein n=2 Tax=Desulfovibrio RepID=B8DRI1_DESVM Length = 146 Score = 140 bits (355), Expect = 1e-32, Method: Composition-based stats. Identities = 38/136 (27%), Positives = 62/136 (45%), Gaps = 7/136 (5%) Query: 1 MATVPTRSGSPRQLT-TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRT 59 M + T G E A R L +GLR +A N G E+D++ + T Sbjct: 1 MTPPRAAPPTTASATGNAAIGARGEEAAARLLAQRGLRVLARNWRHGGLELDIVCDDRGT 60 Query: 60 TIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE 119 +FVEV+ R ++ ++T +K+ KL++ AR +LA H+ CRFD+V + Sbjct: 61 LVFVEVKTRAASGPARPDEALTTAKRGKLVRAARQYLAAHDC--WDKPCRFDLVCVVHDG 118 Query: 120 ----VEWIKDAFNDHS 131 +E AF+ + Sbjct: 119 ATLTLEHYPHAFDLTA 134 >UniRef50_C0YUE8 Possible endonuclease n=2 Tax=Flavobacteriaceae RepID=C0YUE8_9FLAO Length = 125 Score = 140 bits (355), Expect = 1e-32, Method: Composition-based stats. Identities = 29/121 (23%), Positives = 53/121 (43%), Gaps = 8/121 (6%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 G E A +L+ G + + N + EID+I + I VEV+ R + + Sbjct: 4 ANHNDFGKMAEDLAVEYLKKCGYKILVRNFRFQKAEIDVIAEKDNQIIVVEVKARSTDAF 63 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-----VEWIKDAFN 128 +VT++K ++ A ++ N + RFD+++ +E +E I+DAF Sbjct: 64 MLPQEAVTKTKIKSIVSAANHYMEEFNK---DNEVRFDIISVLPDENKNLIIEHIEDAFE 120 Query: 129 D 129 Sbjct: 121 A 121 >UniRef50_Q7P0B3 UPF0102 protein CV_0654 n=1 Tax=Chromobacterium violaceum RepID=Y654_CHRVO Length = 112 Score = 140 bits (354), Expect = 1e-32, Method: Composition-based stats. Identities = 51/113 (45%), Positives = 71/113 (62%), Gaps = 4/113 (3%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 Q G E +A LE +GL+ +A N + RGGEIDLIMR+G +FVEVR+R + +GG Sbjct: 1 MNQAGRDAEDRALALLEKRGLKLVARNWHCRGGEIDLIMRDGDALVFVEVRHRGGSRFGG 60 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFD-VVAFTGNEVEWIKDAF 127 AA S+T +KQ KLL A ++L+ HN CRFD VV+ G+ +W+K+ Sbjct: 61 AADSITAAKQRKLLLAAEVYLSSHNI---DSPCRFDAVVSVGGDAPQWLKNVI 110 >UniRef50_Q1GYY7 UPF0102 protein Mfla_2283 n=1 Tax=Methylobacillus flagellatus KT RepID=Y2283_METFK Length = 113 Score = 140 bits (354), Expect = 1e-32, Method: Composition-based stats. Identities = 54/117 (46%), Positives = 71/117 (60%), Gaps = 7/117 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 KQ GD EA A R+L +GL IA N R GEIDL+M++G T +FVEVR R A +GG Sbjct: 1 MKQLGDDAEALAERYLIKQGLVVIARNYRCRFGEIDLVMKQGATIVFVEVRMRSHATFGG 60 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF---TGNEVEWIKDAFND 129 AAAS+ +K+ KL+ TA +L RH CRFD + + +EWI+DAF+ Sbjct: 61 AAASIHAAKRQKLILTAEHFLQRHGS----APCRFDAILLSKRDADGIEWIQDAFSA 113 >UniRef50_B4RXI2 Sigma-54 factor n=2 Tax=Alteromonas macleodii RepID=B4RXI2_ALTMD Length = 113 Score = 140 bits (353), Expect = 2e-32, Method: Composition-based stats. Identities = 34/109 (31%), Positives = 60/109 (55%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 +K G+A E +A +L +GL N + R GE+D++M++G T + +EV+YR+ +G Sbjct: 2 SKLQGNAAEDKACEYLLQQGLTLRCRNYHTRRGELDIVMQDGNTIVCIEVKYRKQNRFGS 61 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIK 124 A VT K ++ +L + + + R DV+A G+ +EW+K Sbjct: 62 AVEFVTAKKLQRIQAAFGFYLLDNGLNPASTPLRIDVIAIDGDNLEWLK 110 >UniRef50_A5N821 UPF0102 protein CKL_1410 n=16 Tax=Clostridium RepID=Y1410_CLOK5 Length = 122 Score = 139 bits (352), Expect = 2e-32, Method: Composition-based stats. Identities = 35/119 (29%), Positives = 54/119 (45%), Gaps = 8/119 (6%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 K G E A+ +L G + N + GEID+I ++G F+EV+ R LYG Sbjct: 5 NKDIGSLGEDIAKNYLNQIGYTVLERNFRCKVGEIDIIGKDGDYICFIEVKSRYGKLYGN 64 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF------TGNEVEWIKDAFN 128 SV K+ K+ + A +++ R + RFDV+ ++ IKDAF Sbjct: 65 PCESVNYPKRLKIYKAANIYMLRKKLF--KFNFRFDVIEIIFNTYNDVPSIKLIKDAFQ 121 >UniRef50_D0MIM6 Putative uncharacterized protein n=1 Tax=Rhodothermus marinus DSM 4252 RepID=D0MIM6_RHOM4 Length = 127 Score = 139 bits (352), Expect = 2e-32, Method: Composition-based stats. Identities = 41/128 (32%), Positives = 59/128 (46%), Gaps = 14/128 (10%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIM-------REGRTTIFVEVR 66 + T+ G E A +LE +G R +A EIDL+ +G +FVEV+ Sbjct: 1 MDTRTIGTRGEDLAAAYLEQQGYRILARQYRFERAEIDLVCFEPAPRPEDGGEIVFVEVK 60 Query: 67 YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF-----TGNEVE 121 RR +G +VT KQ L++ AR +L H+ CRFDV+A E+E Sbjct: 61 TRRGLGFGRPEEAVTPEKQRHLIRAARAYLYEHH--LQRARCRFDVIAIVLHDDRPPEIE 118 Query: 122 WIKDAFND 129 +DAF Sbjct: 119 HFRDAFWA 126 >UniRef50_B2KC57 Putative uncharacterized protein n=1 Tax=Elusimicrobium minutum Pei191 RepID=B2KC57_ELUMP Length = 122 Score = 139 bits (351), Expect = 3e-32, Method: Composition-based stats. Identities = 39/117 (33%), Positives = 64/117 (54%), Gaps = 7/117 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGR-----TTIFVEVRYRRS 70 + G E A +L+ G + IA N + GE+D+I +G T +F+EV+ R Sbjct: 1 MNKLGVESENAAANFLKKNGYKIIARNYAVQTGEVDIIASQGGLLKQKTLVFIEVKGRAY 60 Query: 71 ALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 YGG A+VT++KQ+K++ A +++ + FD RFDVV ++E I++AF Sbjct: 61 KAYGGPLAAVTKAKQNKIISAATIYVKENFPKFD--SIRFDVVTVVDGKIEHIENAF 115 >UniRef50_A8G183 UPF0102 protein Ssed_4252 n=16 Tax=Shewanella RepID=Y4252_SHESH Length = 117 Score = 139 bits (351), Expect = 3e-32, Method: Composition-based stats. Identities = 40/117 (34%), Positives = 67/117 (57%), Gaps = 3/117 (2%) Query: 11 PRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRS 70 + + + G A E A +L +GL FI NV + GEIDL+M+ G+ IFVEV+YR Sbjct: 4 NNEPISAEHGQAGENLAMNYLLEQGLTFIERNVRFKFGEIDLVMKNGKEWIFVEVKYRSK 63 Query: 71 ALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 + YGGA +++ + +L + A ++ +N CRFD++A +++W+ +AF Sbjct: 64 SQYGGAINALSSGQIKRLRRAAEHYMQLNNI---DAICRFDLIAVDAGQIQWLPNAF 117 >UniRef50_C8W5C0 Putative uncharacterized protein n=1 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8W5C0_DESAS Length = 119 Score = 139 bits (351), Expect = 3e-32, Method: Composition-based stats. Identities = 45/119 (37%), Positives = 61/119 (51%), Gaps = 9/119 (7%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 + G E+ A R+L KG I N R GEID+I RE T+FVEVR R + YG Sbjct: 4 KKQLLGRLGESVAARYLYSKGFIIIHQNFRCRLGEIDIIAREKGVTVFVEVRSRCGSSYG 63 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE------VEWIKDAF 127 SV KQ KL + A+ ++AR+ + D RFDVVA + +E ++AF Sbjct: 64 LPQESVVIKKQVKLRKLAQYYIARYALTG---DFRFDVVAVMFEQDNSIKLIEHFRNAF 119 >UniRef50_B5YFD1 UPF0102 protein DICTH_1420 n=2 Tax=Dictyoglomus RepID=Y1420_DICT6 Length = 118 Score = 139 bits (351), Expect = 3e-32, Method: Composition-based stats. Identities = 32/120 (26%), Positives = 61/120 (50%), Gaps = 8/120 (6%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + K+ G E +L +G + N GE+D+I ++G IF+EV+ RR+ + Sbjct: 1 MNNKEIGKLGEDFTIDFLNKRGFIILERNYKVPLGEVDIIAQKGDLLIFIEVKTRRNLDF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE------VEWIKDAF 127 G A +V R+KQ ++ + A L+++ F RFD+++ ++ E++ +AF Sbjct: 61 GIPAEAVDRTKQTRIKKIAELYISTKKPKFKK--IRFDIMSIILSKSGKILDWEYLINAF 118 >UniRef50_A3M3I7 UPF0102 protein A1S_1049 n=12 Tax=Acinetobacter RepID=Y1049_ACIBT Length = 133 Score = 139 bits (350), Expect = 3e-32, Method: Composition-based stats. Identities = 44/131 (33%), Positives = 68/131 (51%), Gaps = 17/131 (12%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 +Q G E A + L+ + ++A+N + R GE+DLI++ G IFVEV+ R YG Sbjct: 4 AQQLGQWAEQTALKLLKEQNYEWVASNYHSRRGEVDLIVKRGNELIFVEVKARGQGNYGQ 63 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE---------------- 119 A VT SKQ K+++TA +L R+ S+ CRFDV+ F + Sbjct: 64 ACEMVTLSKQKKIIKTAMRFLQRYP-SYQDFYCRFDVICFDFPQKIAKTVQQDFSKFHYD 122 Query: 120 VEWIKDAFNDH 130 ++WI++AF Sbjct: 123 LQWIENAFTLD 133 >UniRef50_C6BVQ7 Putative uncharacterized protein n=1 Tax=Desulfovibrio salexigens DSM 2638 RepID=C6BVQ7_DESAD Length = 134 Score = 138 bits (348), Expect = 6e-32, Method: Composition-based stats. Identities = 35/120 (29%), Positives = 54/120 (45%), Gaps = 6/120 (5%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 G A E A +LE +G N + E+D+I + IFVEV+ R Sbjct: 2 SPRHLDFGQAGEDYAACFLENRGYFLRQRNWRWKQWELDIICEKDDELIFVEVKTRAGRS 61 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF----TGNEVEWIKDAFN 128 +VT +K+ KL++ A +L+ + CRFD+V TG E I++AF+ Sbjct: 62 AQSGIEAVTPAKRKKLVKAATRYLSAFD--LWERPCRFDLVIVNDDGTGFRAEHIENAFD 119 >UniRef50_A8PP71 Putative uncharacterized protein n=1 Tax=Rickettsiella grylli RepID=A8PP71_9COXI Length = 130 Score = 138 bits (348), Expect = 6e-32, Method: Composition-based stats. Identities = 42/127 (33%), Positives = 68/127 (53%), Gaps = 14/127 (11%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + T++ G E+ + +L + L+ I N GEIDLIM++ +F+EVRYR+S + Sbjct: 1 MNTQKLGHHIESLVQDYLRRQKLKRITRNFRCCFGEIDLIMKDKNVLVFIEVRYRQSLQF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-------------NEV 120 G + S+ KQ+K+++ A +L+ S + + CRFDVV +V Sbjct: 61 GNSLESIHAMKQNKIMKAAEYYLSSQRLS-EKIACRFDVVGVKPITQKLLAVSKLDSAQV 119 Query: 121 EWIKDAF 127 EWIK+AF Sbjct: 120 EWIKNAF 126 >UniRef50_A4J649 UPF0102 protein Dred_2035 n=1 Tax=Desulfotomaculum reducens MI-1 RepID=Y2035_DESRM Length = 122 Score = 137 bits (347), Expect = 8e-32, Method: Composition-based stats. Identities = 35/121 (28%), Positives = 58/121 (47%), Gaps = 8/121 (6%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREG-RTTIFVEVRYRRSA 71 + K G+ E +A ++++ G + N + GE+D+I + +F+EVR R Sbjct: 2 SIQRKALGNKGEEEACKYIQNLGYNIMERNYRCKIGELDIIAWDPVGMLVFLEVRSRSGR 61 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN------EVEWIKD 125 +G SV KQ+KL A+ +L F + CRFDV+ N E++ IK+ Sbjct: 62 AFGVPEESVNYRKQNKLRMLAQQFLLTK-SEFAKISCRFDVIGVYFNKEGSVQEIKHIKN 120 Query: 126 A 126 A Sbjct: 121 A 121 >UniRef50_Q0TPP8 UPF0102 protein CPF_1959 n=9 Tax=Clostridium perfringens RepID=Y1959_CLOP1 Length = 122 Score = 137 bits (347), Expect = 8e-32, Method: Composition-based stats. Identities = 36/119 (30%), Positives = 57/119 (47%), Gaps = 8/119 (6%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 K G E + ++L+ +G + N N GEID+I + F+EV+ R S +G Sbjct: 5 NKSIGFYGEDLSAKFLKKEGYSILEKNFNCSSGEIDIIAIKDEIISFIEVKSRFSNSFGN 64 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN------EVEWIKDAFN 128 SVT SKQ +++ A+ +L H RFDV+ + E+ ++KDAF Sbjct: 65 PKESVTCSKQGRIINAAKYYL--HVKKLYNYYIRFDVIEVNFHIDSSKYELNFLKDAFR 121 >UniRef50_UPI0000510419 hypothetical protein BlinB_18076 n=1 Tax=Brevibacterium linens BL2 RepID=UPI0000510419 Length = 132 Score = 137 bits (346), Expect = 1e-31, Method: Composition-based stats. Identities = 34/117 (29%), Positives = 55/117 (47%), Gaps = 2/117 (1%) Query: 2 ATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTI 61 T RS + L + G E A +L+ +G+ I N GEID+I ++G T + Sbjct: 3 PTTGRRSATTSGLRQRALGQTGEDLAADFLQRQGMVIIERNFRCPRGEIDIIAKDGDTIV 62 Query: 62 FVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN 118 FVEV+ RR+ G +VT +K K+ + +WL++ F R D + + Sbjct: 63 FVEVKTRRTLAQGSPLEAVTAAKLRKIRTLSGIWLSQQKDFFA--SIRIDALGIVMD 117 >UniRef50_A5EVA6 UPF0102 protein DNO_0639 n=2 Tax=Cardiobacteriaceae RepID=Y639_DICNV Length = 126 Score = 137 bits (346), Expect = 1e-31, Method: Composition-based stats. Identities = 45/121 (37%), Positives = 66/121 (54%), Gaps = 6/121 (4%) Query: 11 PRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRS 70 +++TTK+ G E A +L GL +A NV R GEIDLI ++ R +FVEVR RR+ Sbjct: 7 NKKMTTKKRGQYGELLAADYLTAHGLNIVAKNVYSRYGEIDLIAQDDRVLVFVEVRLRRA 66 Query: 71 ALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF----TGNEVEWIKDA 126 AA S+T K + Q+A+ +L ++ CRFD V T +E+EW+K+ Sbjct: 67 QALVSAAESITPEKLRRCYQSAQDYLQKNYAVPPD--CRFDAVLITQYQTHHEIEWLKNV 124 Query: 127 F 127 Sbjct: 125 I 125 >UniRef50_C9LW74 Endonuclease n=1 Tax=Selenomonas sputigena ATCC 35185 RepID=C9LW74_9FIRM Length = 121 Score = 137 bits (346), Expect = 1e-31, Method: Composition-based stats. Identities = 37/121 (30%), Positives = 60/121 (49%), Gaps = 6/121 (4%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 +TTK GD E A ++LE +G R + + GEID+I + +F+EV+ RR + Sbjct: 1 MTTKSFGDRGEDLAAQYLEKRGCRILERQFRAKTGEIDIIAEDRGALLFIEVKTRRPTRF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG----NEVEWIKDAFND 129 G A +V +KQ ++ +TA L++ + CRFDV+ V ++AF Sbjct: 61 GAPAQAVGYTKQRRIFRTALLYMQKRAIGERF--CRFDVLEVLVMGGSYTVNHYENAFEF 118 Query: 130 H 130 Sbjct: 119 D 119 >UniRef50_Q31EY6 UPF0102 protein Tcr_1695 n=1 Tax=Thiomicrospira crunogena XCL-2 RepID=Y1695_THICR Length = 120 Score = 137 bits (345), Expect = 1e-31, Method: Composition-based stats. Identities = 46/115 (40%), Positives = 72/115 (62%), Gaps = 4/115 (3%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE-GRTTIFVEVRYRRSALYG 74 +++ G E QA WL+ + + +A N +GGEIDLI + T IF EV+YR+S+ +G Sbjct: 5 SQKIGQQKEQQAAVWLKTQAITIVAQNFRCKGGEIDLIGLDTDDTLIFFEVKYRQSSTFG 64 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN--EVEWIKDAF 127 A+ SVT KQ +L+Q A+ +L +H ++ + RFDV+ F N + EW++DAF Sbjct: 65 TASESVTPQKQQRLIQCAQNFLQKHP-NYQACNMRFDVLFFEDNQTQPEWLQDAF 118 >UniRef50_A1KWG5 UPF0102 protein NMC2069 n=28 Tax=Neisseriaceae RepID=Y2069_NEIMF Length = 115 Score = 137 bits (345), Expect = 1e-31, Method: Composition-based stats. Identities = 40/111 (36%), Positives = 65/111 (58%), Gaps = 3/111 (2%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 + G+A E A +L+ +G +A N + GEIDLI++ G +FVEV+YR++ +GG Sbjct: 4 NHKQGEAGEDAALAFLQSQGCTLLARNWHCAYGEIDLIVKNGGMILFVEVKYRKNRQFGG 63 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-VEWIKD 125 AA S++ SK KL ++ +L ++ + V CR D V G+ EWI++ Sbjct: 64 AAYSISPSKLLKLQRSVEYYLQQNRLT--NVPCRLDAVLIEGSRPPEWIQN 112 >UniRef50_A0LV62 UPF0102 protein Acel_1550 n=8 Tax=Actinomycetales RepID=Y1550_ACIC1 Length = 132 Score = 137 bits (345), Expect = 2e-31, Method: Composition-based stats. Identities = 39/126 (30%), Positives = 57/126 (45%), Gaps = 13/126 (10%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 + G E A + L+ G+ +A N R GE+D++ R+G T + EV+ RR +G Sbjct: 7 AREALGRFGEELAAQHLQTLGMTILARNWRCRSGELDIVARDGYTLVVCEVKTRRGVGFG 66 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNG--------SFDTVDCRFDVVAF-----TGNEVE 121 SVT K +L Q A WL H + RFDVVA G +E Sbjct: 67 EPLESVTPRKAARLRQLAVAWLTEHAATRVDTTEGTHGYTAVRFDVVAILHRKEDGPTIE 126 Query: 122 WIKDAF 127 +++ AF Sbjct: 127 YVRGAF 132 >UniRef50_Q2BGY7 Putative uncharacterized protein n=1 Tax=Neptuniibacter caesariensis RepID=Q2BGY7_9GAMM Length = 119 Score = 136 bits (344), Expect = 2e-31, Method: Composition-based stats. Identities = 50/119 (42%), Positives = 72/119 (60%), Gaps = 5/119 (4%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + ++ G E A +L + LR I N N R GEIDLIM++G + +F+EVR R A + Sbjct: 1 MDRRKRGKDAEQHALVYLSKQKLRLIEQNFNCRFGEIDLIMQDGESIVFIEVRLRTHAEF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT----GNEVEWIKDAFN 128 GGAAASVT +KQ K+++TA+L+L++ +CRFDV+A+ W KDAF Sbjct: 61 GGAAASVTTTKQRKIIKTAQLYLSKRP-RLQNKNCRFDVIAYEYDAAPTHPLWYKDAFR 118 >UniRef50_B2A2P1 UPF0102 protein Nther_1376 n=1 Tax=Natranaerobius thermophilus JW/NM-WN-LF RepID=Y1376_NATTJ Length = 119 Score = 136 bits (344), Expect = 2e-31, Method: Composition-based stats. Identities = 44/120 (36%), Positives = 58/120 (48%), Gaps = 7/120 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVN-ERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + K G E AR +L KG + I N R GEIDLI IFVEV+ R S L Sbjct: 1 MNNKSKGRTAEKIARIFLLSKGYQIIFQNYRFSRLGEIDLICCFDNILIFVEVKSRSSLL 60 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-----EVEWIKDAF 127 +G +V KQ +L + A ++L N F RFDV+A N E+ ++DAF Sbjct: 61 WGQPEEAVGYEKQGQLKKLANIFLYEFN-EFTEYQIRFDVIAILNNNKVKCEISHLRDAF 119 >UniRef50_B3JMU0 Putative uncharacterized protein n=6 Tax=Bacteroidales RepID=B3JMU0_9BACE Length = 129 Score = 136 bits (344), Expect = 2e-31, Method: Composition-based stats. Identities = 29/126 (23%), Positives = 54/126 (42%), Gaps = 7/126 (5%) Query: 9 GSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYR 68 + + G E A +L KG N + E+D++ ++ I VEV+ R Sbjct: 5 AKNKMAKHNELGKEGENAAAEYLMSKGYSIRHRNWHSGKRELDIVAQKDGELIVVEVKTR 64 Query: 69 RSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEWIK 124 R+ +G ++T K ++ + ++ R + RFD++ TG E +E I+ Sbjct: 65 RNEEFGKPEEAITDRKIRNIIISTDTYIKRFEI---DLPVRFDIITVTGTEPPFHIEHIQ 121 Query: 125 DAFNDH 130 +AF Sbjct: 122 EAFLPP 127 >UniRef50_B0VJC5 Putative uncharacterized protein n=1 Tax=Candidatus Cloacamonas acidaminovorans RepID=B0VJC5_9BACT Length = 128 Score = 136 bits (343), Expect = 2e-31, Method: Composition-based stats. Identities = 34/124 (27%), Positives = 60/124 (48%), Gaps = 7/124 (5%) Query: 12 RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSA 71 ++ + + E A R+L G N ++ GEID+I+ + + +F EV+ R S Sbjct: 2 KKYSLQDFSHIGEDLAARYLVSNGYTITCRNYRKKYGEIDIIVEKDQHLVFCEVKTRTSH 61 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF------TGNEVEWIKD 125 A AS+ SKQ K+ +TA+L++ + F RFDV+ E++ ++ Sbjct: 62 SIEWALASIGFSKQRKISRTAQLYINENP-QFAKHIFRFDVLLVFYYENTDTFEIKHFEN 120 Query: 126 AFND 129 AF+ Sbjct: 121 AFDA 124 >UniRef50_A4FME3 UPF0102 protein SACE_6045 n=2 Tax=Actinomycetales RepID=Y6045_SACEN Length = 133 Score = 136 bits (343), Expect = 2e-31, Method: Composition-based stats. Identities = 37/118 (31%), Positives = 53/118 (44%), Gaps = 7/118 (5%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 G E A R+LE G+ + N GE+D++ +G T IF EV+ R YG Sbjct: 18 RRHALGVEGERLAARFLEEHGITVLERNWRCDRGELDIVATDGETVIFCEVKARSGVDYG 77 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-----EVEWIKDAF 127 +V+ K L AR WL+ N + T RFDVV+ +E ++ AF Sbjct: 78 APLNAVSPHKVRHLRALARTWLSERNLTGCTA--RFDVVSVLWPPGRPARIEHLEGAF 133 >UniRef50_A1SB01 UPF0102 protein Sama_3355 n=5 Tax=Shewanella RepID=Y3355_SHEAM Length = 108 Score = 135 bits (342), Expect = 3e-31, Method: Composition-based stats. Identities = 47/108 (43%), Positives = 66/108 (61%), Gaps = 3/108 (2%) Query: 20 GDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAAS 79 G E +A + L GLR A NV GEIDL+MREGR +FVEV++R +G A + Sbjct: 4 GQLAEDRAMKHLCAHGLRLEARNVRYPFGEIDLVMREGRVYVFVEVKFRTPKGFGDAVQA 63 Query: 80 VTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 ++ ++Q +L + A +L H CRFD+VA TG+++EWIKDAF Sbjct: 64 LSAAQQQRLRRAATHYLQCHRI---DAPCRFDMVAITGDKLEWIKDAF 108 >UniRef50_C2LNN6 Possible endonuclease n=3 Tax=Proteus RepID=C2LNN6_PROMI Length = 125 Score = 135 bits (342), Expect = 3e-31, Method: Composition-based stats. Identities = 52/117 (44%), Positives = 73/117 (62%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 +T G +E +A +L +GL+ I NV GEIDLIM+ RT IFVEVR+RRSA Sbjct: 4 PKSTYLVGQYYERKALNYLRQQGLKLIERNVRYPCGEIDLIMQGNRTWIFVEVRFRRSAQ 63 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFND 129 +G A +SVT SK+ +L A WLA+ S +TV+CRFD+ AF ++ W+K+ + Sbjct: 64 FGDAISSVTYSKRRRLWYAANCWLAQRQQSIETVNCRFDICAFDQRQLIWLKNILDH 120 >UniRef50_A6TRS2 UPF0102 protein Amet_2739 n=1 Tax=Alkaliphilus metalliredigens QYMF RepID=Y2739_ALKMQ Length = 114 Score = 135 bits (342), Expect = 3e-31, Method: Composition-based stats. Identities = 40/115 (34%), Positives = 57/115 (49%), Gaps = 5/115 (4%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 +K G+ E ++LE KG R I N + GEID+I +G FVEV+ RRS YG Sbjct: 2 SKSLGELGERIIGQYLEKKGYRLIETNYRTKLGEIDIIAYKGTIIAFVEVKTRRSQSYGM 61 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN---EVEWIKDAF 127 +V KQ +L + A ++AR D RFDV ++ +I +AF Sbjct: 62 PCEAVNWQKQQRLHRVASHYIARKG--LINYDFRFDVAEVIIGKEKKIHYINNAF 114 >UniRef50_C6D2Y6 Putative uncharacterized protein n=1 Tax=Paenibacillus sp. JDR-2 RepID=C6D2Y6_PAESJ Length = 122 Score = 135 bits (342), Expect = 3e-31, Method: Composition-based stats. Identities = 47/120 (39%), Positives = 63/120 (52%), Gaps = 9/120 (7%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRR-SALY 73 +QTG A E A R+LE +G I N R GEID+I T +FVEVR RR + Sbjct: 5 RRRQTGLAGETAACRYLEKEGYNVIERNWRCRSGEIDIIATIDHTLVFVEVRTRRTGGRF 64 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE------VEWIKDAF 127 G AA SV R KQ ++ A+++L ++ RFDV+A T + V+ IK AF Sbjct: 65 GTAAESVDRRKQQQVALVAQVYLRMRQLTY--PPMRFDVIAVTMDRNDSISEVKHIKAAF 122 >UniRef50_C8PPR7 HD domain protein n=1 Tax=Treponema vincentii ATCC 35580 RepID=C8PPR7_9SPIO Length = 462 Score = 135 bits (342), Expect = 3e-31, Method: Composition-based stats. Identities = 39/128 (30%), Positives = 63/128 (49%), Gaps = 8/128 (6%) Query: 10 SPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRR 69 +++T ++ G A EA A +WLE G IA N R GEID+I + T IF EV+ Sbjct: 335 QQKKMTEERLGPAGEAFAAKWLERNGYSVIARNWRTRTGEIDIIAEKNETLIFFEVKTLP 394 Query: 70 SALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-------EVEW 122 + V KQ ++ +TA+ +L H ++ + RFDV+ + E Sbjct: 395 HTAFTDLDIIVGNRKQERICKTAKYFLLTHR-KYNKMHIRFDVLVLPFDPRTTEEAEPVH 453 Query: 123 IKDAFNDH 130 +++AF D+ Sbjct: 454 LENAFEDY 461 >UniRef50_C3W9T3 Endonuclease n=4 Tax=Fusobacterium RepID=C3W9T3_FUSMR Length = 120 Score = 135 bits (342), Expect = 3e-31, Method: Composition-based stats. Identities = 32/111 (28%), Positives = 59/111 (53%), Gaps = 2/111 (1%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 ++ GD +E +A + L +G + + N + GEID+I + T +FVEV+YR++ YG Sbjct: 3 NNREIGDKYEEKAVKLLISRGYKILERNYRVKAGEIDIIAKFEDTIVFVEVKYRKTLKYG 62 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 +V K ++ A+++L + RFD ++F G ++ W K+ Sbjct: 63 YGLEAVDYRKIRRIYNAAKVYLTLNKKLSSK--IRFDCISFLGEKISWTKN 111 >UniRef50_Q1NMK2 Putative uncharacterized protein n=1 Tax=delta proteobacterium MLMS-1 RepID=Q1NMK2_9DELT Length = 120 Score = 135 bits (341), Expect = 4e-31, Method: Composition-based stats. Identities = 44/120 (36%), Positives = 61/120 (50%), Gaps = 6/120 (5%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + G E AR WLEG G R + AN GE+DL+ EG +FVEV+ RR Sbjct: 2 TRQRQGLGRRGEQLARDWLEGAGYRILEANCRTSSGELDLVAEEGGELVFVEVKSRRGDA 61 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEWIKDAFN 128 +G +V R KQ ++++ AR +L+R RFDVVA T +E +K+AF Sbjct: 62 FGSPLEAVDRRKQARIIRCAREYLSRRRSHG--RPARFDVVAVTFTGGKPAIEVVKNAFE 119 >UniRef50_A8ZV12 UPF0102 protein Dole_2298 n=2 Tax=Desulfobacteraceae RepID=Y2298_DESOH Length = 123 Score = 135 bits (341), Expect = 5e-31, Method: Composition-based stats. Identities = 40/118 (33%), Positives = 61/118 (51%), Gaps = 4/118 (3%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 +Q G E A R+L+ +G + N GEID+I ++ T FVEV+ RR+ YG Sbjct: 4 QRQQYGRQGEQAAERFLKKEGYTIVCRNYRTPVGEIDIIAKDKTTLAFVEVKARRTESYG 63 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE--VEWIKDAFNDH 130 S+T+ KQ K+ + A +L + RFDVV G + VE I++AF+ + Sbjct: 64 SPRLSITKDKQRKITRAALWYLKDTGQAGARA--RFDVVIVQGRDNSVELIRNAFDAN 119 >UniRef50_Q3IG11 UPF0102 protein PSHAa2523 n=3 Tax=Alteromonadales RepID=Y2523_PSEHT Length = 123 Score = 135 bits (340), Expect = 5e-31, Method: Composition-based stats. Identities = 39/117 (33%), Positives = 67/117 (57%), Gaps = 5/117 (4%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 +++ G +E QA+++L +GL I N GE+D+IM++G T +FVEV++R++ Sbjct: 9 QNSREKGQYYELQAQKYLVSQGLTAIERNYYCPFGELDVIMKDGNTLVFVEVKFRKNHAR 68 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN---EVEWIKDAF 127 GGA +++ KQ +L ++ +LA N + R D VA TG + W+K+ F Sbjct: 69 GGANYALSIQKQARLKRSIYHYLAAKNLT--NQPLRIDYVAITGEPSMHINWLKNVF 123 >UniRef50_C7HUU3 Endonuclease n=4 Tax=Anaerococcus RepID=C7HUU3_9FIRM Length = 118 Score = 135 bits (340), Expect = 5e-31, Method: Composition-based stats. Identities = 33/117 (28%), Positives = 56/117 (47%), Gaps = 4/117 (3%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 + GD E A ++LE KG + + N + GEID+I + FVEV+ R++ + Sbjct: 4 KKRTIGDFGEEIALKYLEKKGYQILDRNFLKYYGEIDIIAIKNDILTFVEVKTRKNDEFK 63 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF--TGNEVEWIKDAFND 129 A+ V KQ ++ +TA+ ++ + FDV + +IK+AF D Sbjct: 64 PASLDVDYYKQERIKKTAQAYIMEKDLGE--FLISFDVCEVYLENKTIHYIKNAFGD 118 >UniRef50_C7N589 Predicted endonuclease related to Holliday junction resolvase n=2 Tax=Slackia RepID=C7N589_SLAHD Length = 167 Score = 135 bits (340), Expect = 6e-31, Method: Composition-based stats. Identities = 34/126 (26%), Positives = 57/126 (45%), Gaps = 8/126 (6%) Query: 9 GSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE-GRTTIFVEVRY 67 R++ K+ G EA A L+ KG + N GE D+I + T +FVEV+ Sbjct: 41 RLSREMDPKELGRRGEACACMLLDYKGYEILERNWKCPAGEADIIAIDENGTLVFVEVKT 100 Query: 68 RRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-----VEW 122 RR G ++ R+K+ + + A +L+++ + RFD +A + V Sbjct: 101 RRGVENGLPEEAIGRAKRARYEKIAAYYLSQY--TGPDTALRFDTIALLVMDNYRALVRH 158 Query: 123 IKDAFN 128 I +AF Sbjct: 159 IVNAFG 164 >UniRef50_D0I4Y5 Endonuclease n=1 Tax=Grimontia hollisae CIP 101886 RepID=D0I4Y5_VIBHO Length = 122 Score = 134 bits (339), Expect = 6e-31, Method: Composition-based stats. Identities = 57/114 (50%), Positives = 76/114 (66%), Gaps = 2/114 (1%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 L KQTGD +E QA R+LE +GL + N +GGE+DLIMRE +FVEV+YR+ A Y Sbjct: 4 LNRKQTGDHYENQACRFLERQGLTTLDKNARFKGGELDLIMREKSCIVFVEVKYRKQASY 63 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG--NEVEWIKD 125 GGAAA+++R KQ ++L+ A LW+A+ S + RFD V F G N V WIK+ Sbjct: 64 GGAAATISRQKQQRMLKAAYLWMAKKGLSATHTEFRFDAVTFEGSVNSVNWIKN 117 >UniRef50_C7IKV6 Putative uncharacterized protein n=1 Tax=Clostridium papyrosolvens DSM 2782 RepID=C7IKV6_9CLOT Length = 126 Score = 134 bits (339), Expect = 6e-31, Method: Composition-based stats. Identities = 37/125 (29%), Positives = 60/125 (48%), Gaps = 13/125 (10%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNE-RGGEIDLIMREGRTTIFVEVRYRRSAL 72 ++ G E +A +L+ G + N R GEID+I + F+EV+ RR++ Sbjct: 4 ANKREIGAVGEREAAEFLQRNGYTILKINYRVGRLGEIDIIANDNEYICFIEVKTRRTST 63 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----------VEW 122 +G +VT++KQ K+ Q A ++L N + RFDV+ N+ + Sbjct: 64 FGSPGEAVTKTKQQKIRQIAAIYLT--NTRKMDSNVRFDVIEILMNKSMESVNSIKSINL 121 Query: 123 IKDAF 127 IKDAF Sbjct: 122 IKDAF 126 >UniRef50_Q1MRU7 UPF0102 protein LI0223 n=1 Tax=Lawsonia intracellularis PHE/MN1-00 RepID=Y223_LAWIP Length = 132 Score = 134 bits (339), Expect = 6e-31, Method: Composition-based stats. Identities = 33/119 (27%), Positives = 61/119 (51%), Gaps = 6/119 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + + + G E+ A +L KG+ + N + EIDL+ ++ +T +FVEVR R++ Sbjct: 1 MKSCEIGQQGESAAALFLYNKGMSILERNWRKGRFEIDLVCQDIKTLVFVEVRTRKAKGM 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----EVEWIKDAFN 128 ++T SK+ ++ +A+L+L ++ CRFDV+ E+E K+ F Sbjct: 61 LLPEQTLTISKRCNIIHSAQLYLMDKKD--WSMPCRFDVICIISKKTTLELEHYKNVFE 117 >UniRef50_Q2S1J6 UPF0102 protein SRU_1822 n=1 Tax=Salinibacter ruber DSM 13855 RepID=Y1822_SALRD Length = 122 Score = 134 bits (339), Expect = 7e-31, Method: Composition-based stats. Identities = 40/123 (32%), Positives = 54/123 (43%), Gaps = 8/123 (6%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMR--EGRTTIFVEVRYRRSA 71 TT GD E A L+G G +A N E+DL+ R + +FVEV+ R Sbjct: 2 ATTNDIGDRGEEIAAAHLDGAGYEILARNYRHSRNEVDLVCRETDAGEYVFVEVKTRSGT 61 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT----GNEVEWIKDAF 127 +G AS+T K+ L AR +L H RFDVVA EV+ ++AF Sbjct: 62 GFGAPEASITAKKRAALQHAARGYLHEHGAEGA--PARFDVVAVMLTGGPPEVQHYENAF 119 Query: 128 NDH 130 Sbjct: 120 WAD 122 >UniRef50_C0GHM5 Putative uncharacterized protein n=1 Tax=Dethiobacter alkaliphilus AHT 1 RepID=C0GHM5_9FIRM Length = 112 Score = 134 bits (339), Expect = 8e-31, Method: Composition-based stats. Identities = 38/114 (33%), Positives = 50/114 (43%), Gaps = 4/114 (3%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 K G E A L G +A N GEID++ + G +FVEV+ RRS+ G Sbjct: 1 MKTLGQKGEELAVDHLRRAGYLILARNWRCERGEIDIVAKAGNILVFVEVKTRRSSRLGT 60 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT--GNEVEWIKDAF 127 +V KQ KL A ++ + RFDV A N V IK+AF Sbjct: 61 PQEAVDFRKQEKLRHLAYRFINATGITAAEY--RFDVAAVNAKNNTVTIIKNAF 112 >UniRef50_C4FFG3 Putative uncharacterized protein n=1 Tax=Bifidobacterium angulatum DSM 20098 RepID=C4FFG3_9BIFI Length = 151 Score = 134 bits (338), Expect = 8e-31, Method: Composition-based stats. Identities = 46/132 (34%), Positives = 63/132 (47%), Gaps = 6/132 (4%) Query: 1 MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRT- 59 M T+ R ++++Q G EA A WLEG +A N + R GE+D+I Sbjct: 21 METIEARLAGN-AVSSRQVGALGEAYAAAWLEGFDWLVLARNWHCRYGELDIIALSPERR 79 Query: 60 TIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE 119 IFVEV+ RR +G +VT SKQ L + A WL R+ RFDVV + ++ Sbjct: 80 IIFVEVKTRRGVRFGTPQEAVTPSKQTNLRRAALQWLERNGHLLRHNGMRFDVVTVSVHD 139 Query: 120 ----VEWIKDAF 127 V I AF Sbjct: 140 GQVAVHRIPGAF 151 >UniRef50_A6L1J0 UPF0102 protein BVU_1879 n=26 Tax=Bacteroidales RepID=Y1879_BACV8 Length = 121 Score = 134 bits (338), Expect = 8e-31, Method: Composition-based stats. Identities = 26/121 (21%), Positives = 53/121 (43%), Gaps = 7/121 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + G E +A +L KG N + E+D++ I +EV+ R++ + Sbjct: 2 AEHNEFGKEGEEEAAAYLIDKGYSIRHRNWHCGKKELDIVAEYRNELIVIEVKTRKNTRF 61 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEWIKDAFND 129 G +VT K +++ + +L + + + RFD++ G + +E I++AF Sbjct: 62 GNPEDAVTDKKIRRIIASTDAYLRKFSV---DLPVRFDIITLVGEKTPFTIEHIEEAFYP 118 Query: 130 H 130 Sbjct: 119 P 119 >UniRef50_B3QZF2 UPF0102 protein Ctha_1382 n=1 Tax=Chloroherpeton thalassium ATCC 35110 RepID=Y1382_CHLT3 Length = 129 Score = 134 bits (338), Expect = 1e-30, Method: Composition-based stats. Identities = 37/116 (31%), Positives = 58/116 (50%), Gaps = 4/116 (3%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 T G E A +L+ G + + N EIDLI ++ FVEV+ R + YG Sbjct: 3 TNVAFGKKGEDMASAFLKKCGYQILRRNYRSGNNEIDLITKKDNIVAFVEVKTRHNLNYG 62 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFNDH 130 A +VT SKQ +L++ A+ ++ + RFDVVA +E + ++AFN+ Sbjct: 63 HPAEAVTLSKQKELIKAAQNFINDNPSQGVDY--RFDVVAIILDESK--RNAFNEP 114 >UniRef50_D1U5W3 Putative uncharacterized protein n=1 Tax=Desulfovibrio aespoeensis Aspo-2 RepID=D1U5W3_9DELT Length = 130 Score = 134 bits (337), Expect = 1e-30, Method: Composition-based stats. Identities = 38/122 (31%), Positives = 62/122 (50%), Gaps = 6/122 (4%) Query: 11 PRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRS 70 ++ K GD E A R+LE +G+R + N R E+DL+ R+G T +FVEV+ R + Sbjct: 5 DKRTPAKWRGDLGEDAAARYLESRGMRVLDRNWRYRQWELDLVCRDGDTLVFVEVKTRVA 64 Query: 71 ALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEWIKDA 126 A + R+K+ +L++ A +L+ CRFD+ A VE ++A Sbjct: 65 GSMSAPADGLGRAKRARLVKAAARYLSAKG--LWDEPCRFDLAAVVDTGVSMDVEHTENA 122 Query: 127 FN 128 F+ Sbjct: 123 FD 124 >UniRef50_A5Z6D1 Putative uncharacterized protein n=1 Tax=Eubacterium ventriosum ATCC 27560 RepID=A5Z6D1_9FIRM Length = 134 Score = 134 bits (337), Expect = 1e-30, Method: Composition-based stats. Identities = 31/115 (26%), Positives = 54/115 (46%), Gaps = 2/115 (1%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 L + G +E +L G + N + GEID+I ++ F+EV++R S Y Sbjct: 20 LNKRGRGSFYEDVCVEYLIKNGFDILHRNYRCKLGEIDIIAKKDDIIRFIEVKFRGSDSY 79 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFN 128 G A +V KQ ++++ A +L + + V C FDV+ NE + + + Sbjct: 80 GSALEAVDFRKQRRIMRAASWFLNEYGLN--DVQCSFDVMTVENNEARYYFNCYG 132 >UniRef50_A5D1I2 UPF0102 protein PTH_1707 n=1 Tax=Pelotomaculum thermopropionicum SI RepID=Y1707_PELTS Length = 120 Score = 133 bits (336), Expect = 1e-30, Method: Composition-based stats. Identities = 41/119 (34%), Positives = 59/119 (49%), Gaps = 8/119 (6%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 K G E A R+LE KG R ++ N R GE+DL++ +G +FVEVR R YG Sbjct: 4 ARKLLGRMGEEAAARYLEKKGCRILSRNHCCRLGELDLVVSDGDVLVFVEVRARTGEEYG 63 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN------EVEWIKDAF 127 A S+T K+ +L A +L + CRFDV+A + +E ++AF Sbjct: 64 LAQESITGRKKSRLRLLAWQYLKEKGKTGS--MCRFDVIAVLFDREGRVKRLEHFENAF 120 >UniRef50_Q1MYA7 Putative uncharacterized protein n=1 Tax=Bermanella marisrubri RepID=Q1MYA7_9GAMM Length = 124 Score = 133 bits (336), Expect = 1e-30, Method: Composition-based stats. Identities = 55/122 (45%), Positives = 76/122 (62%), Gaps = 3/122 (2%) Query: 8 SGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRY 67 S S + +TG +EAQA+++L +GL FI NVN + GE+DLIM+ + +FVEVRY Sbjct: 3 SNSDHSKSKIETGSFYEAQAKQFLVNQGLIFIEQNVNFKTGELDLIMKHNKHLVFVEVRY 62 Query: 68 RRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG--NEVEWIKD 125 R S YGGA S+T SKQ ++ + A+ +L +H CR DVVAF G + WIK+ Sbjct: 63 RSSQDYGGAVTSITASKQARVRRAAQTYLQKH-FGNRPPPCRIDVVAFEGANTKAIWIKN 121 Query: 126 AF 127 AF Sbjct: 122 AF 123 >UniRef50_B3PLA2 Putative uncharacterized protein n=1 Tax=Cellvibrio japonicus Ueda107 RepID=B3PLA2_CELJU Length = 126 Score = 133 bits (336), Expect = 1e-30, Method: Composition-based stats. Identities = 46/121 (38%), Positives = 67/121 (55%), Gaps = 5/121 (4%) Query: 12 RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSA 71 R + G EA+A+ +LE +GL N + GEIDLIM EG T +FVEVR R + Sbjct: 5 RDDSHLVDGARAEARAQAYLEQQGLTTWMKNYRCKTGEIDLIMCEGDTLVFVEVRLRTNR 64 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----EVEWIKDAF 127 + A S+T +K+ K+++TA+ +L D CRFD++A + EWI+DAF Sbjct: 65 FFSSAVESITPAKRQKMIRTAQRFLQERGLV-DKHACRFDIIALDAKGQHAKPEWIRDAF 123 Query: 128 N 128 Sbjct: 124 G 124 >UniRef50_B8KR63 Putative uncharacterized protein n=1 Tax=gamma proteobacterium NOR51-B RepID=B8KR63_9GAMM Length = 128 Score = 133 bits (336), Expect = 1e-30, Method: Composition-based stats. Identities = 43/121 (35%), Positives = 69/121 (57%), Gaps = 6/121 (4%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 + G+ WE +A +L G GL I N GEIDLI + +FVEVR R+ + +G Sbjct: 1 MRSEGNQWEIKAASFLRGHGLTIIVQNFTCPFGEIDLIGDDQGVIVFVEVRKRKRSRFGN 60 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF-----TGNEVEWIKDAFNDH 130 AA+SV R+KQ K++++A +L +H CRFDV+A+ + +W++ AF+ + Sbjct: 61 AASSVGRAKQKKIIRSAAFYLQQHGA-MADTHCRFDVIAYDVGADDPDTPKWLRSAFSAN 119 Query: 131 S 131 + Sbjct: 120 A 120 >UniRef50_C4FZ58 Putative uncharacterized protein n=1 Tax=Abiotrophia defectiva ATCC 49176 RepID=C4FZ58_ABIDE Length = 113 Score = 133 bits (336), Expect = 2e-30, Method: Composition-based stats. Identities = 36/112 (32%), Positives = 58/112 (51%), Gaps = 3/112 (2%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE-GRTTIFVEVRYRRSALYGGAA 77 G E +A +L KG + + N + GEID+I + VEV+YR S +G Sbjct: 1 MGKEKEEKAAAYLISKGYKILEKNYLRKTGEIDIIAKSADGYLTAVEVKYRSSDRFGSPF 60 Query: 78 ASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-EVEWIKDAFN 128 ++VT KQ K+ +T +++ HN S + RFDV+ G+ +E + +AF Sbjct: 61 SAVTYIKQRKICKTLLFYMSEHNISP-DIKSRFDVIGIYGDGRLEHLVNAFE 111 >UniRef50_Q2YCL8 UPF0102 protein Nmul_A0195 n=3 Tax=Nitrosomonadaceae RepID=Y195_NITMU Length = 119 Score = 133 bits (336), Expect = 2e-30, Method: Composition-based stats. Identities = 49/118 (41%), Positives = 69/118 (58%), Gaps = 6/118 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 +T + G+ E A +L G L + N R GEIDLIMR+G T +FVEVR R + + Sbjct: 1 MTLRLKGNQAERYAEAFLAGHRLVLVQRNYRCRFGEIDLIMRDGETLVFVEVRMRTNRNF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE---VEWIKDAFN 128 G A +S+T SKQ K+++ AR +L CRFD V +GNE +EWI++AF+ Sbjct: 61 GDAGSSITLSKQRKVVRAARHYLLSLRTEPC---CRFDAVLLSGNEGRDIEWIRNAFD 115 >UniRef50_B8HR21 Putative uncharacterized protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HR21_CYAP4 Length = 164 Score = 133 bits (336), Expect = 2e-30, Method: Composition-based stats. Identities = 34/107 (31%), Positives = 56/107 (52%), Gaps = 2/107 (1%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG- 74 ++Q GD E WL +G + + N GEIDLI+++ FVEV+ R + Sbjct: 2 SRQVGDVGEMLVAHWLTAQGWQIVQRNWQCCWGEIDLILQQDEWLAFVEVKTRSRGNWDQ 61 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVE 121 ++T +KQ KL +TA L+L+ + +F + CRFD+ + +V Sbjct: 62 DGLLAITPTKQRKLWKTATLFLSEYP-NFADLSCRFDLALVSYVKVH 107 >UniRef50_D2R2U4 Putative uncharacterized protein n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R2U4_9PLAN Length = 150 Score = 133 bits (336), Expect = 2e-30, Method: Composition-based stats. Identities = 37/122 (30%), Positives = 58/122 (47%), Gaps = 8/122 (6%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 L K G E A +L +G +A + GEIDL+ +GRT +FVEV+ R + + Sbjct: 23 LQPKSLGRRGEDAAALFLRARGYWIVARSYRTSLGEIDLVAVDGRTIVFVEVKTRVRSDH 82 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE------VEWIKDAF 127 G +V KQ +L + A +L RH+ + RFD+V+ +E ++ AF Sbjct: 83 GQPFDAVHPDKQRRLTRLAAAYLKRHDLT--RYASRFDIVSILWPGGRKQPLIEHLQHAF 140 Query: 128 ND 129 Sbjct: 141 EA 142 >UniRef50_C4F8U2 Putative uncharacterized protein n=2 Tax=Collinsella RepID=C4F8U2_9ACTN Length = 158 Score = 133 bits (335), Expect = 2e-30, Method: Composition-based stats. Identities = 35/126 (27%), Positives = 53/126 (42%), Gaps = 13/126 (10%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIM--REGRTTIFVEVRYRRS 70 ++ K G E A R+LE +G I N GE DL+ ++ + VEV+ RRS Sbjct: 32 GMSNKLLGSLGEELAARYLEQRGYDIIDRNYRCPEGEADLVAYDQDDDGVVLVEVKTRRS 91 Query: 71 ALY---GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-----NEVEW 122 +VT KQ + + A + A H + RFDV+ T E+ Sbjct: 92 RSERGGAYPEEAVTPEKQRRYRRIALCYAADH---YPVPSIRFDVIGVTLRPANIGEIRH 148 Query: 123 IKDAFN 128 + AF+ Sbjct: 149 LCGAFD 154 >UniRef50_A3HX52 Putative uncharacterized protein n=1 Tax=Algoriphagus sp. PR1 RepID=A3HX52_9SPHI Length = 118 Score = 133 bits (335), Expect = 2e-30, Method: Composition-based stats. Identities = 32/119 (26%), Positives = 53/119 (44%), Gaps = 8/119 (6%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 ++G E A +WL KG + + N EIDLI+ + +FVEV++R + Sbjct: 2 AEHNRSGQLAEEMAAQWLISKGYQLLEKNYRHGYAEIDLILTHKKLLVFVEVKFRSGTGF 61 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-----VEWIKDAF 127 G A V +K+ +++ A ++ N D RFD+V + +DAF Sbjct: 62 GYAEEFVDYTKRKLIIKAADHYIHEKN---WKSDIRFDIVGVYRDRTGAINYRHFEDAF 117 >UniRef50_C8WAJ1 Putative uncharacterized protein n=1 Tax=Atopobium parvulum DSM 20469 RepID=C8WAJ1_ATOPD Length = 172 Score = 133 bits (335), Expect = 2e-30, Method: Composition-based stats. Identities = 33/136 (24%), Positives = 58/136 (42%), Gaps = 11/136 (8%) Query: 3 TVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIF 62 +++++Q G+ E A ++L +G + I N + GE+D++ ++G + Sbjct: 38 DAQESRIPLEEMSSRQIGEKGEEIAAKYLIKRGYKIIQTNWTCQIGEVDIVAQDGDNVVL 97 Query: 63 VEVRYRR---SALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-- 117 VEV+ RR +V R+KQ K A ++ A H RFDVVA Sbjct: 98 VEVKTRRVLNKDDSIMPELAVNRAKQEKYRTLALMYAALHP---ALTSIRFDVVAINLVA 154 Query: 118 ---NEVEWIKDAFNDH 130 + + AF+ Sbjct: 155 PSTASLRHLIGAFSWD 170 >UniRef50_C9LM05 Endonuclease n=1 Tax=Dialister invisus DSM 15470 RepID=C9LM05_9FIRM Length = 117 Score = 132 bits (334), Expect = 3e-30, Method: Composition-based stats. Identities = 40/119 (33%), Positives = 59/119 (49%), Gaps = 7/119 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + G E +A +LE KG+ + N + GEIDLIM++G +F+EV+ RRS LY Sbjct: 1 MGNTAFGRMGEDRACLYLEEKGMTLVTRNFRCKHGEIDLIMKDGSVFVFIEVKTRRSRLY 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF-----TGNEVEWIKDAF 127 G +VT KQ + TA ++L + V RFDVV + ++AF Sbjct: 61 GEPIEAVTVYKQRHIRYTAEVFLLARHLH--DVRIRFDVVEVMMAPGRAVRLRHTRNAF 117 >UniRef50_D1NRZ9 Putative endonuclease n=1 Tax=Bifidobacterium gallicum DSM 20093 RepID=D1NRZ9_9BIFI Length = 144 Score = 132 bits (334), Expect = 3e-30, Method: Composition-based stats. Identities = 35/120 (29%), Positives = 54/120 (45%), Gaps = 5/120 (4%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREG-RTTIFVEVRYRRSA 71 +L+ + G EA + WL G R + N + R GE+DLI FVE++ RR Sbjct: 25 RLSARDLGSWGEAASACWLRTHGWRIVGHNWHCRYGELDLIALSATDELAFVEIKTRRGC 84 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----EVEWIKDAF 127 +G +V +KQ L + A LW+ + + V RFDV+ + + AF Sbjct: 85 QFGTPIEAVGVTKQTNLRRAAMLWMLEADHHINHVGIRFDVIGVLVHAGRIRFTHVPHAF 144 >UniRef50_A5FR87 UPF0102 protein DehaBAV1_0707 n=5 Tax=Dehalococcoides RepID=Y707_DEHSB Length = 121 Score = 132 bits (333), Expect = 3e-30, Method: Composition-based stats. Identities = 38/119 (31%), Positives = 60/119 (50%), Gaps = 6/119 (5%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 K+TG+ E A +L+G G I N GEID++ ++G +F+EVR +R YG Sbjct: 4 NRKETGEFGEKLAAEYLKGMGYSIIQTNCRLPEGEIDIVGQDGEYLVFIEVRTKRRLGYG 63 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT----GNEVEWIKDAFND 129 A SVT K+ L+ +A ++ +H + CR D V+ +E IK+A + Sbjct: 64 LPAESVTPRKKAHLMASAESYIQKHR--LEHFPCRIDFVSVDLSQPEPRLELIKNALGE 120 >UniRef50_B3ES88 Putative uncharacterized protein n=1 Tax=Candidatus Amoebophilus asiaticus 5a2 RepID=B3ES88_AMOA5 Length = 122 Score = 132 bits (333), Expect = 3e-30, Method: Composition-based stats. Identities = 33/120 (27%), Positives = 59/120 (49%), Gaps = 7/120 (5%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + + Q G +E A +L+ KGL + N + EID+I ++ F+EV+ R SA Sbjct: 6 EASPHQLGKKYEDLATSYLQQKGLMIMVRNYRYKKAEIDIIAQKDACLYFIEVKARTSAK 65 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEWIKDAFN 128 +G A V KQ + A ++ +++ RFD++A + +E+ +DAF+ Sbjct: 66 FGYPEAFVNTYKQQLIKAAAENYILQND---WNSSIRFDIIAILDQKGCINLEYFEDAFS 122 >UniRef50_A6LSN5 UPF0102 protein Cbei_1183 n=5 Tax=Clostridium RepID=Y1183_CLOB8 Length = 123 Score = 132 bits (333), Expect = 4e-30, Method: Composition-based stats. Identities = 35/121 (28%), Positives = 56/121 (46%), Gaps = 8/121 (6%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 K G E A+++LE + N GEID+I ++ I VEV+ R + YG Sbjct: 5 NKDIGSFSEDLAKKYLEKNDYSILDCNFKNFLGEIDIICKKNTLLIIVEVKSRYNNNYGL 64 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN------EVEWIKDAFND 129 SV SKQ +++ A ++ + ++ RFDV+ N ++ IKDAF Sbjct: 65 PRESVNFSKQRSIIKVANSYI--NYKRLPNINVRFDVIEVYLNLESTNFKINHIKDAFRL 122 Query: 130 H 130 + Sbjct: 123 N 123 >UniRef50_UPI00016929A4 hypothetical protein Plarl_14719 n=1 Tax=Paenibacillus larvae subsp. larvae BRL-230010 RepID=UPI00016929A4 Length = 134 Score = 132 bits (333), Expect = 4e-30, Method: Composition-based stats. Identities = 45/135 (33%), Positives = 66/135 (48%), Gaps = 9/135 (6%) Query: 1 MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLI-MREGRT 59 M + + + + K G E A R+L+ KG + ++ N R GE+D+I + E R Sbjct: 1 MNSDEFQRETKKLDGRKALGKRGEEIAVRYLKEKGFQILSQNWRCRTGEVDIILLEEPRC 60 Query: 60 TIFVEVRYRR-SALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG- 117 IF EVR RR + +G AA S+ R KQ ++ QTA ++ H F+ RFDVV Sbjct: 61 LIFTEVRSRRVTGKFGSAAESINRRKQQQIRQTALYYVYVHPP-FNRYTIRFDVVTVEFF 119 Query: 118 -----NEVEWIKDAF 127 + IK AF Sbjct: 120 PEKEDPVIHHIKAAF 134 >UniRef50_B0C8B9 UPF0102 protein AM1_3954 n=1 Tax=Acaryochloris marina MBIC11017 RepID=Y3954_ACAM1 Length = 172 Score = 132 bits (333), Expect = 4e-30, Method: Composition-based stats. Identities = 35/124 (28%), Positives = 52/124 (41%), Gaps = 11/124 (8%) Query: 2 ATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMR------ 55 P RQ Q G+ E +WL + + + R GEID+I R Sbjct: 1 MPSPAAPRPNRQSRNLQVGEWGEQLVCQWLTQQQWHILDRRWHCRWGEIDIIARSNPPLP 60 Query: 56 ---EGRTTIFVEVRYRRSALYG-GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFD 111 FVEV+ RR+ + ++T KQ KL +TA+L+L +H + C+FD Sbjct: 61 GQDSNTRLAFVEVKTRRAQNWDADGLLAITPQKQQKLWKTAQLYLKKHP-ELAELFCQFD 119 Query: 112 VVAF 115 V Sbjct: 120 VALV 123 >UniRef50_C7MNC2 Predicted endonuclease related to Holliday junction resolvase n=1 Tax=Cryptobacterium curtum DSM 15641 RepID=C7MNC2_CRYCD Length = 186 Score = 132 bits (333), Expect = 4e-30, Method: Composition-based stats. Identities = 25/112 (22%), Positives = 49/112 (43%), Gaps = 1/112 (0%) Query: 8 SGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRY 67 + + + ++ G E A +L +G + N + GE D+I + + F+EV+ Sbjct: 57 APTNKSQNNRELGRRGEDAAAAFLTRRGYEIVERNWMCQAGEADIIAQGEGSIHFIEVKT 116 Query: 68 RRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE 119 R SA G + +V K+ + + A +L N + + FDV++ Sbjct: 117 RSSAARGFPSEAVDAKKRSRYERIAECYLRSCN-NLPEMRVTFDVISILATG 167 >UniRef50_C4K712 Putative uncharacterized protein n=1 Tax=Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum) RepID=C4K712_HAMD5 Length = 118 Score = 132 bits (333), Expect = 4e-30, Method: Composition-based stats. Identities = 52/118 (44%), Positives = 75/118 (63%), Gaps = 1/118 (0%) Query: 11 PRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRS 70 + L+ ++ G +E ARR+LE GL F +NV R EIDLIMR+ +T +FVEVR++R+ Sbjct: 2 DKTLSRREIGFRYEMIARRYLEKAGLVFKESNVTLRSAEIDLIMRDQKTWVFVEVRFKRN 61 Query: 71 ALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFN 128 + +G AA S+ KQ +L A +WL++ F RFDV A TGN+ EW ++AFN Sbjct: 62 SFFGSAADSINNKKQKRLRDAAAIWLSKRGSHF-NTSYRFDVFAITGNQFEWFQNAFN 118 >UniRef50_A1K3T3 UPF0102 protein azo0871 n=1 Tax=Azoarcus sp. BH72 RepID=Y871_AZOSB Length = 137 Score = 132 bits (332), Expect = 4e-30, Method: Composition-based stats. Identities = 43/118 (36%), Positives = 63/118 (53%), Gaps = 3/118 (2%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 + G E +A L +G+R +A N + RGGE+DL+ G +FVEVR R + +GG Sbjct: 20 MQARGREGEERAAAHLAAQGVRILARNRHCRGGELDLVGLHGDMLVFVEVRMRANPRFGG 79 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE---VEWIKDAFNDH 130 AAAS+T K+ +++ A+ WLA CRFDVV G W++ AF+ Sbjct: 80 AAASITAEKRRRVILAAQWWLAGEGRRHAHRPCRFDVVLLEGPATTPPTWLQAAFDAD 137 >UniRef50_C4XKL5 Putative uncharacterized protein n=1 Tax=Desulfovibrio magneticus RS-1 RepID=C4XKL5_DESMR Length = 133 Score = 132 bits (332), Expect = 5e-30, Method: Composition-based stats. Identities = 38/120 (31%), Positives = 59/120 (49%), Gaps = 6/120 (5%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + G EA A L KG + N RGGE+DL+ R+G T +FVEV+ R + Sbjct: 2 TAKHLEFGREGEAAAEAHLIAKGFAVVTRNYRARGGEVDLVCRDGDTVVFVEVKARGEGM 61 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----EVEWIKDAFN 128 G +VT +K+ ++++ A +L+ + + CRFDVVA + DAF+ Sbjct: 62 RGRPEEAVTPAKRRRIVRAAAQFLSERD--WWDRPCRFDVVAVESRSGHLTASHVADAFS 119 >UniRef50_C2G0R5 Possible endonuclease n=2 Tax=Sphingobacterium spiritivorum RepID=C2G0R5_9SPHI Length = 118 Score = 132 bits (332), Expect = 5e-30, Method: Composition-based stats. Identities = 36/117 (30%), Positives = 52/117 (44%), Gaps = 6/117 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + G E A L G + +A N E+D++ +G +FVEV+ R S + Sbjct: 2 AQHLEQGKKGEQMALSHLTALGYQILALNWRTGKLEVDILAYDGDILVFVEVKTRSSNAH 61 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE---VEWIKDAF 127 G A V KQ KL++ AR + D RFD+V+ E + IKDAF Sbjct: 62 GEPADFVDIQKQRKLIRAARACIEERGHQG---DIRFDIVSVYLGEPAYIHLIKDAF 115 >UniRef50_C6XZ20 Putative uncharacterized protein n=2 Tax=Pedobacter RepID=C6XZ20_PEDHD Length = 120 Score = 131 bits (331), Expect = 5e-30, Method: Composition-based stats. Identities = 33/121 (27%), Positives = 51/121 (42%), Gaps = 8/121 (6%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 T G E A +LE G R + N E+D+I + IFVEV+ R S Y Sbjct: 2 ATHNDLGWRGEQIAVEYLENLGYRILNRNWKCARAEVDVIADQEGKLIFVEVKTRSSTDY 61 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-----EVEWIKDAFN 128 G V+ K+ +L + ++ N + RFD++A ++ I+DAF Sbjct: 62 GQPEEFVSYKKERQLEFASSAYIEMRNHQGE---IRFDIIAIVFENKDIYKINHIEDAFW 118 Query: 129 D 129 Sbjct: 119 P 119 >UniRef50_Q0AFH8 UPF0102 protein Neut_1662 n=2 Tax=Proteobacteria RepID=Y1662_NITEC Length = 116 Score = 131 bits (331), Expect = 5e-30, Method: Composition-based stats. Identities = 53/116 (45%), Positives = 76/116 (65%), Gaps = 4/116 (3%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 +TK G E QA +L+ + L + N R GEIDLIM++G T +FVEVR R + L+G Sbjct: 3 STKNKGSDAEQQATIFLQQQQLTLLEKNYRCRFGEIDLIMQDGDTVVFVEVRMRVNQLFG 62 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-VEWIKDAFND 129 GAAAS+T +KQ KL + AR +LAR + + CRFD + +GN +EWI++AF++ Sbjct: 63 GAAASITPAKQLKLTRAARHYLARCD---EDFPCRFDAILISGNREIEWIQNAFDE 115 >UniRef50_D0LAN4 Putative uncharacterized protein n=1 Tax=Gordonia bronchialis DSM 43247 RepID=D0LAN4_GORB4 Length = 137 Score = 131 bits (331), Expect = 6e-30, Method: Composition-based stats. Identities = 37/138 (26%), Positives = 61/138 (44%), Gaps = 13/138 (9%) Query: 1 MATVPTRSGSPRQLTTKQ-TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRT 59 M PR+ ++ G E A ++ +G R + N R GE+DLI +GR Sbjct: 1 MTAHSAAEPGPRRADRRRHIGHLGEDIAAEFVTNRGWRVLHRNWRNRYGELDLIAADGRV 60 Query: 60 TIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN- 118 + VEV+ R S +Y +VT +K ++ + R+WL+ NGS+ RFDV++ + Sbjct: 61 LVVVEVKTRASLMYSDPLEAVTPAKLSRMRKLTRMWLSEQNGSWS--QIRFDVISVQLDP 118 Query: 119 ---------EVEWIKDAF 127 + F Sbjct: 119 HHPDDRASARIRHHLGVF 136 >UniRef50_C6IVU3 Putative uncharacterized protein n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6IVU3_9BACL Length = 130 Score = 131 bits (331), Expect = 6e-30, Method: Composition-based stats. Identities = 39/123 (31%), Positives = 55/123 (44%), Gaps = 9/123 (7%) Query: 12 RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRS- 70 R K+ G E A L +G + N R GE+D+I R+ + VEVR R Sbjct: 10 RGDGRKERGRKAEQAACEHLISQGYTILERNWRCRSGELDIIARKRDVLVNVEVRSRSQQ 69 Query: 71 -ALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-----VEWIK 124 A +G A SV K ++ TA ++L H + RFDV+A T +E I+ Sbjct: 70 AAAFGTPAESVNARKIKQVRDTAAVYL--HRTGQSDANLRFDVIAVTFGRGDNIALEHIQ 127 Query: 125 DAF 127 AF Sbjct: 128 AAF 130 >UniRef50_D2SDZ8 Putative uncharacterized protein n=1 Tax=Geodermatophilus obscurus DSM 43160 RepID=D2SDZ8_9ACTO Length = 139 Score = 131 bits (331), Expect = 6e-30, Method: Composition-based stats. Identities = 44/135 (32%), Positives = 62/135 (45%), Gaps = 10/135 (7%) Query: 1 MATVPTRSGSP---RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREG 57 + P RSG R TT G E A +L GLR + N R GE+D++ R+G Sbjct: 6 LRDRPDRSGPTTVGRVRTTSDLGAHGERIAAAYLTDSGLRVLDRNWRCRDGELDIVARDG 65 Query: 58 RTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG 117 +F EV+ RR+ +G +V KQ +L A+ WLA H+ + RFDVV Sbjct: 66 DALVFCEVKTRRAVGFGHPVEAVGHVKQRRLRVLAQRWLAAHDERA--PELRFDVVGVLV 123 Query: 118 NE-----VEWIKDAF 127 V ++ AF Sbjct: 124 RVDRPALVTHLRAAF 138 >UniRef50_Q15PJ2 UPF0102 protein Patl_3694 n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Y3694_PSEA6 Length = 114 Score = 131 bits (330), Expect = 8e-30, Method: Composition-based stats. Identities = 32/111 (28%), Positives = 54/111 (48%), Gaps = 5/111 (4%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 G EAQA +L+ +GL + N R GEID+IMR+ + +FVEV+YR +G A Sbjct: 2 KGAQGEAQALAYLKQQGLTLVTQNYRCRSGEIDIIMRDHQELVFVEVKYRSGQQFGSAVE 61 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-----GNEVEWIK 124 K+ K + ++ + + + R D+V +++ W+K Sbjct: 62 FFHPHKRRKFESAIQHYMLDNKLNPSLIAHRIDIVGIDVLSNNNDKISWLK 112 >UniRef50_C2D6J2 Putative uncharacterized protein n=1 Tax=Atopobium vaginae DSM 15829 RepID=C2D6J2_9ACTN Length = 176 Score = 131 bits (330), Expect = 8e-30, Method: Composition-based stats. Identities = 34/132 (25%), Positives = 57/132 (43%), Gaps = 10/132 (7%) Query: 6 TRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEV 65 R + L +K+ G E A +LE + + N GE+D+I +G T+FVEV Sbjct: 44 PRKSAVNTLNSKELGALGENLACCFLERQDFEILDRNWKCADGEVDIIASKGDETVFVEV 103 Query: 66 RYRRSALYGG--AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF-----TGN 118 + R G +V + KQ + + AR + + + RFDV+A + Sbjct: 104 KTRLQNKSGELFPEIAVDKQKQSRYIALARSYNTAYPMCE---NIRFDVIALAILDDSHA 160 Query: 119 EVEWIKDAFNDH 130 ++ I+ AF Sbjct: 161 QLRHIQSAFEWD 172 >UniRef50_C9MX50 Endonuclease n=2 Tax=Leptotrichia RepID=C9MX50_9FUSO Length = 121 Score = 131 bits (330), Expect = 8e-30, Method: Composition-based stats. Identities = 40/123 (32%), Positives = 72/123 (58%), Gaps = 7/123 (5%) Query: 9 GSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIM--REGRTTIFVEVR 66 G + ++ G +E A+ +L +GL F+ +N R GEIDLI ++ +T +FVEV+ Sbjct: 2 GQEYSMNKREIGFKYENVAKEYLILQGLTFVESNFYTRFGEIDLIFFEKKSQTLVFVEVK 61 Query: 67 YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG--NEVEWIK 124 YR++ +G A VT KQ+K+L +++++L + + R+D+V + +EW+K Sbjct: 62 YRKNDFFGSAIEMVTEEKQNKILASSQIYLLKK---EWDKNVRYDIVGVSRGSGSIEWLK 118 Query: 125 DAF 127 +AF Sbjct: 119 NAF 121 >UniRef50_UPI0001C37581 hypothetical protein RflaF_17327 n=1 Tax=Ruminococcus flavefaciens FD-1 RepID=UPI0001C37581 Length = 125 Score = 131 bits (330), Expect = 8e-30, Method: Composition-based stats. Identities = 36/120 (30%), Positives = 58/120 (48%), Gaps = 8/120 (6%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 +T +TG E +L G G R IA N +GGEID+I G FVEV+ R+ Sbjct: 1 MTKSETGKLGEESVCSYLLGMGYRIIARNYRIKGGEIDIIAENGDYIAFVEVKSRKPDSL 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDV--VAFTGNE---VEWIKDAFN 128 +V++ KQ +++TA + +H + RFDV V ++++ +AF+ Sbjct: 61 VSGFEAVSKRKQGLIIKTAADYCLKHPNVWQP---RFDVASVIIENGRVLSIDYVTNAFD 117 >UniRef50_C9R878 Putative uncharacterized protein n=1 Tax=Ammonifex degensii KC4 RepID=C9R878_AMMDK Length = 114 Score = 131 bits (330), Expect = 8e-30, Method: Composition-based stats. Identities = 42/114 (36%), Positives = 55/114 (48%), Gaps = 10/114 (8%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 G E A +L G I N R GEIDLI REG T +FVEVR R + +G Sbjct: 2 RGKRAEEVAAVYLRKAGWEIIERNYRCRWGEIDLIAREGETIVFVEVRSRSNLAFGLPEE 61 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-------EVEWIKD 125 S+ R KQ KL + AR +LAR CRFDV+A + + +++ Sbjct: 62 SIGRRKQEKLRKVARYFLARLGREL---PCRFDVIAVAWDAATGEIKSLRHLRN 112 >UniRef50_C7LP67 Putative uncharacterized protein n=1 Tax=Desulfomicrobium baculatum DSM 4028 RepID=C7LP67_DESBD Length = 134 Score = 130 bits (328), Expect = 1e-29, Method: Composition-based stats. Identities = 38/120 (31%), Positives = 58/120 (48%), Gaps = 6/120 (5%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 TG A E A +L KG+R + N GEIDLI + T +FVEV+ R A+ Sbjct: 2 AARHLITGQAGEELAAAFLVEKGMRIVERNFRCASGEIDLICEDAGTIVFVEVKTRSGAV 61 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG----NEVEWIKDAFN 128 G ++ +K+ +L++ L+L+RH + CRFD+V VE +D + Sbjct: 62 RGEPGEAIGPAKKKRLIKAGALYLSRHR--AWSRPCRFDLVGILFLHGETVVEHWEDIID 119 >UniRef50_D1PKZ1 Putative choloylglycine hydrolase n=1 Tax=Subdoligranulum variabile DSM 15176 RepID=D1PKZ1_9FIRM Length = 117 Score = 130 bits (328), Expect = 1e-29, Method: Composition-based stats. Identities = 37/117 (31%), Positives = 53/117 (45%), Gaps = 6/117 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 ++ G EA A ++ +G + N R GEIDLI+ + +F EV+ R + Sbjct: 2 SRNIGQKGEAIAAQYYRQRGYLVLGHNYRTRMGEIDLILYKEDLIVFAEVKTRTGRMLAT 61 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG----NEVEWIKDAFN 128 A +V KQ +L A +L N F + RFDVV T +V I DAF Sbjct: 62 PAEAVDLHKQQRLRLAAERYLQ--NSPFSEANVRFDVVEVTPAAKGWQVHCIMDAFQ 116 >UniRef50_Q1LHS4 UPF0102 protein Rmet_3430 n=2 Tax=Betaproteobacteria RepID=Y3430_RALME Length = 134 Score = 130 bits (328), Expect = 1e-29, Method: Composition-based stats. Identities = 49/132 (37%), Positives = 72/132 (54%), Gaps = 11/132 (8%) Query: 1 MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMR-EGRT 59 M P S + + G E +A +L+ +GL + N +GGEIDLIMR T Sbjct: 1 MPARPASSTT-------RQGALAEDRALAYLQRQGLVAVERNYRCKGGEIDLIMRAADDT 53 Query: 60 TIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE 119 +FVEVR R +GGAAAS+T +KQ ++L+ A +LA + CR DVVA Sbjct: 54 LVFVEVRKRGGRGFGGAAASITLTKQRRVLRAASHYLATLD---RLPPCRVDVVALDPGR 110 Query: 120 VEWIKDAFNDHS 131 +EW+++AF+ + Sbjct: 111 LEWLRNAFDLGA 122 >UniRef50_C9MR57 Putative endonuclease n=2 Tax=Prevotella RepID=C9MR57_9BACT Length = 121 Score = 130 bits (328), Expect = 1e-29, Method: Composition-based stats. Identities = 31/119 (26%), Positives = 54/119 (45%), Gaps = 8/119 (6%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 G E A +L+ +G + N + ++D++ + + VEV+ R+ + Sbjct: 4 HNDIGKWGEEVAANYLQQQGYTILHRNWMYQHRDLDIVAMDAGALVIVEVKTRKDERFVN 63 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-----NEVEWIKDAFND 129 A A+VT K L A ++ R+N S + RFD++ G +EV +KDAF Sbjct: 64 ADAAVTPQKVRSLSLAANAYVKRYNISLE---IRFDIITIVGCPDDKHEVRHVKDAFLP 119 >UniRef50_D2RIH7 Putative uncharacterized protein n=1 Tax=Acidaminococcus fermentans DSM 20731 RepID=D2RIH7_ACIFE Length = 118 Score = 130 bits (328), Expect = 1e-29, Method: Composition-based stats. Identities = 37/117 (31%), Positives = 57/117 (48%), Gaps = 6/117 (5%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 ++ G+ E A R+LE +G + N GEID+I R FVEV+ R S +G Sbjct: 4 QRRRFGNWGEDAAVRYLETRGYEILDRNYRSSWGEIDIIARYRGVLAFVEVKTRHSLKFG 63 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG--NEV--EWIKDAF 127 AA+VTR KQ +L +TA +L + RFD++ ++ +K+ F Sbjct: 64 RPAAAVTREKQIRLRKTAWCYLRENQVFRYRS--RFDIIEILDLYGKISLNHLKNCF 118 >UniRef50_C8WGY4 Putative uncharacterized protein n=2 Tax=Eggerthella lenta DSM 2243 RepID=C8WGY4_EGGLE Length = 173 Score = 130 bits (327), Expect = 1e-29, Method: Composition-based stats. Identities = 34/119 (28%), Positives = 57/119 (47%), Gaps = 7/119 (5%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 + G E A R+L+ +G + N GE D+I R+G + +FVEV+ R S G Sbjct: 55 RNAELGRRGEDAAARFLDRRGYEIVERNWTCAAGEADIIARDGDSVVFVEVKTRSSCDCG 114 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFD---VVAFTGNE--VEWIKDAFN 128 A +V +K+ + + A L+L + V RFD +VA + + + +AF+ Sbjct: 115 MPAEAVDEAKRDRYERIAALFLQGFDVV--DVPVRFDIVSIVAISPDRAMIRHHINAFS 171 >UniRef50_C1ZN06 Putative uncharacterized protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZN06_PLALI Length = 181 Score = 130 bits (327), Expect = 2e-29, Method: Composition-based stats. Identities = 40/131 (30%), Positives = 61/131 (46%), Gaps = 11/131 (8%) Query: 3 TVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIF 62 P S R L G+ EA+A ++L+ G + +A N+ R GEIDL+ EG T +F Sbjct: 52 RSPHSPSSHRTLN---IGEQGEARAEKYLKELGYQILARNLRTRLGEIDLLALEGETIVF 108 Query: 63 VEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN---- 118 +EV+ R+S G ++ KQ +L + A L + R DV+ TG Sbjct: 109 IEVKTRKSDARGRPEEAIHPRKQKQLSRVAMALLKSKG--WLHRQSRIDVITITGEPESP 166 Query: 119 --EVEWIKDAF 127 E+ + AF Sbjct: 167 DCELRHYRHAF 177 >UniRef50_Q3A2F1 UPF0102 protein Pcar_2217 n=2 Tax=Deltaproteobacteria RepID=Y2217_PELCD Length = 123 Score = 130 bits (327), Expect = 2e-29, Method: Composition-based stats. Identities = 37/120 (30%), Positives = 58/120 (48%), Gaps = 6/120 (5%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 G E A +L +G++ + N+ GE+D++ R R IFVEV+ RR +G Sbjct: 4 QRLSLGRWGEDIAAGYLRRQGMKILDRNIRTPVGELDIVARHKRMLIFVEVKTRRGISHG 63 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF----TGNEVEWIKDAFNDH 130 +V +KQ ++L+ A+ +LA D + RFDV+A EVE AF+ Sbjct: 64 YPQEAVGAAKQRQILRAAQWYLAERR--LDRLQPRFDVIAVRRRGDEAEVEHFPGAFDVD 121 >UniRef50_A4SC34 UPF0102 protein Cvib_0014 n=9 Tax=Chlorobiaceae RepID=Y014_PROVI Length = 131 Score = 130 bits (327), Expect = 2e-29, Method: Composition-based stats. Identities = 45/122 (36%), Positives = 56/122 (45%), Gaps = 12/122 (9%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 G E A +LE KG R + N EID+I +G T F+EV+ R SA G A Sbjct: 9 LGREGERIAAGFLEKKGYRIVQRNFRFHRNEIDIIAMDGETVCFIEVKTRSSATKGEPAE 68 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF----------TGNEVEWIKDAFN 128 +VT KQ ++ + A WLA S DCRFDVV VE DAF+ Sbjct: 69 AVTPGKQREIARAAEAWLAF--SSEGEPDCRFDVVGIIAEPLSGGRFRARSVELFADAFH 126 Query: 129 DH 130 D Sbjct: 127 DP 128 >UniRef50_A1U3H0 UPF0102 protein Maqu_2464 n=3 Tax=Marinobacter RepID=Y2464_MARAV Length = 123 Score = 130 bits (327), Expect = 2e-29, Method: Composition-based stats. Identities = 42/120 (35%), Positives = 62/120 (51%), Gaps = 7/120 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 +++ G +E A R+LE KG+R I NV+ RGGEIDLI + +F EVR+R Sbjct: 6 SRKLGQHYEGVAARYLESKGIRIIERNVHNRGGEIDLIGMDAEALVFFEVRFRADGALVD 65 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-----EVEWIKDAFNDH 130 +SV+ KQ +L++ A +L RH R DV+ T ++WIK+A Sbjct: 66 PISSVSAVKQQRLVRAASFYLHRHG--LWDRVSRIDVIGITPGHSSKYRIQWIKNAIQAD 123 >UniRef50_B7K4B3 Putative uncharacterized protein n=4 Tax=Cyanobacteria RepID=B7K4B3_CYAP8 Length = 144 Score = 130 bits (327), Expect = 2e-29, Method: Composition-based stats. Identities = 33/103 (32%), Positives = 43/103 (41%), Gaps = 4/103 (3%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRT--TIFVEVRYRRSALY 73 G E RWL+ +G + GGEIDLI T FVEV+ R + Sbjct: 1 MTSIGQLGENLVARWLQSQGWTILQQRWRCPGGEIDLIAHSQGTNLITFVEVKTRSRGNW 60 Query: 74 G-GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF 115 ++T KQ KL Q+A +LA + CRFDV Sbjct: 61 DADGLLAITPQKQVKLTQSAAYFLAEYP-HLADFPCRFDVALV 102 >UniRef50_C0ZFM4 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0ZFM4_BREBN Length = 125 Score = 130 bits (327), Expect = 2e-29, Method: Composition-based stats. Identities = 39/124 (31%), Positives = 58/124 (46%), Gaps = 13/124 (10%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 + G E A +L KG R + NV + GE+DLI +G+ +F+EVR RRS +G Sbjct: 4 RRRLLGQRGEQLAEGYLVNKGFRIVERNVRTKRGEMDLIALDGKCLVFIEVRTRRSQSFG 63 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-----------GNEVEWI 123 A S+T K+ KL + A +L + + RFDV+A +E Sbjct: 64 TAGESITWKKKQKLRELALEYLQK--SAQPIPSFRFDVIAIYTGASTQGEDFMKPVIEHY 121 Query: 124 KDAF 127 + AF Sbjct: 122 ESAF 125 >UniRef50_C2HKC4 Possible endonuclease n=2 Tax=Finegoldia magna RepID=C2HKC4_PEPMA Length = 115 Score = 130 bits (327), Expect = 2e-29, Method: Composition-based stats. Identities = 31/113 (27%), Positives = 53/113 (46%), Gaps = 4/113 (3%) Query: 17 KQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGA 76 K G E A +L KG I N +ER GE+D++ + VEV+ R +G Sbjct: 5 KNRGKFAEDYACEYLIEKGYEIIDRNYSERIGELDIVCTYENYLVIVEVKARTDDKFGAP 64 Query: 77 AASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN--EVEWIKDAF 127 + VT KQ ++ +T +++ +++ RFDV+ + ++ DAF Sbjct: 65 SDFVTLGKQDRIRKTTEIYIDKND--LYDYQPRFDVIEIYLDNFKLNHYIDAF 115 >UniRef50_A1R7F9 UPF0102 protein AAur_2443 n=3 Tax=Micrococcaceae RepID=Y2443_ARTAT Length = 121 Score = 129 bits (326), Expect = 2e-29, Method: Composition-based stats. Identities = 31/112 (27%), Positives = 54/112 (48%), Gaps = 7/112 (6%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 G + EA A +LE +G+R + N GEID++ +G T + EV+ R+S YG Sbjct: 10 LGRSGEALAADFLENQGMRIVDRNWRCPDGEIDIVAIDGDTLVVAEVKTRKSLDYGHPFE 69 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVA-----FTGNEVEWIKD 125 +V +K +L + + W +H + R DVV+ ++E ++ Sbjct: 70 AVDAAKLARLHRLSSSWCRQHQLNAPRR--RIDVVSVIDNGVVEPQLEHLRG 119 >UniRef50_Q8R616 UPF0102 protein FN1370 n=9 Tax=Fusobacterium RepID=Y1370_FUSNN Length = 119 Score = 129 bits (326), Expect = 2e-29, Method: Composition-based stats. Identities = 31/112 (27%), Positives = 63/112 (56%), Gaps = 2/112 (1%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + T++ G+ +E ++ L + + + N + GEID+I + + IF+EV+YR++ + Sbjct: 1 MNTREIGNEYEDKSVEILVKEDYKILERNYQNKFGEIDIIAEKNKEIIFIEVKYRKTNKF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 G +V R K K+L+ A ++ + RFD +++ G+E++WIK+ Sbjct: 61 GYGYEAVDRRKIMKILKLANYYIQSKK--YQDYKIRFDCMSYLGDELDWIKN 110 >UniRef50_Q1QVF6 UPF0102 protein Csal_2201 n=1 Tax=Chromohalobacter salexigens DSM 3043 RepID=Y2201_CHRSD Length = 123 Score = 129 bits (325), Expect = 3e-29, Method: Composition-based stats. Identities = 53/124 (42%), Positives = 72/124 (58%), Gaps = 7/124 (5%) Query: 10 SPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRR 69 S +++ G E +A WL GLR + AN + R GEIDLIMR+G T +F+EVR+RR Sbjct: 2 SNSNNDSRRRGLEMERRAADWLASHGLRLVDANQHARRGEIDLIMRDGDTLVFIEVRHRR 61 Query: 70 SALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----EVEWIKD 125 A +G +VT +KQ +L+ AR +L R+ S CRFDVV TG EWI+ Sbjct: 62 DARHGHPFETVTAAKQRRLIGAARFYLHRNGLS---CACRFDVVGVTGTPPHLSFEWIRS 118 Query: 126 AFND 129 AF+ Sbjct: 119 AFDA 122 >UniRef50_A3N211 UPF0102 protein APL_1363 n=33 Tax=Pasteurellaceae RepID=Y1363_ACTP2 Length = 123 Score = 129 bits (325), Expect = 3e-29, Method: Composition-based stats. Identities = 58/116 (50%), Positives = 75/116 (64%), Gaps = 2/116 (1%) Query: 12 RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSA 71 + LT + G +E +AR +LE GL+FIAAN + GE+DLIMR+G T +FVEVR R+S Sbjct: 5 KTLTKRSQGANFEQKAREFLERNGLKFIAANQQFKCGELDLIMRQGDTFVFVEVRQRKSN 64 Query: 72 LYGGAAASVTRSKQHKLLQTARLWL-ARHNGSFDTVDCRFDVVAFTGNEVE-WIKD 125 +G A S+ KQ K L A +WL RH S DT +CRFDVVAF GN+ WI + Sbjct: 65 RFGSAVESIDYRKQQKWLDAANMWLFTRHKQSLDTANCRFDVVAFEGNDPPLWIPN 120 >UniRef50_Q2JJU2 UPF0102 protein CYB_2119 n=3 Tax=Synechococcus RepID=Y2119_SYNJB Length = 133 Score = 129 bits (325), Expect = 3e-29, Method: Composition-based stats. Identities = 35/130 (26%), Positives = 61/130 (46%), Gaps = 11/130 (8%) Query: 9 GSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYR 68 PR+ + + TG+ E R++L +G + +A GE+DL+ + IFVEV+ R Sbjct: 3 PLPRRASLQNTGNVGEGWVRQYLCQQGWQILAQRWRCPWGELDLVAHKADVLIFVEVKTR 62 Query: 69 RSALYG-GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-------- 119 + G +V KQ +L++ A+ +L++H + CRFDV Sbjct: 63 SPGSWDRGGLLAVGIPKQRRLIRAAQAFLSQHP-HLSELSCRFDVALIERRASREGVSYA 121 Query: 120 -VEWIKDAFN 128 V+++ AF Sbjct: 122 LVDYLPAAFE 131 >UniRef50_A8UQV9 Putative uncharacterized protein n=1 Tax=Hydrogenivirga sp. 128-5-R1-1 RepID=A8UQV9_9AQUI Length = 111 Score = 129 bits (324), Expect = 4e-29, Method: Composition-based stats. Identities = 27/108 (25%), Positives = 50/108 (46%), Gaps = 3/108 (2%) Query: 18 QTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAA 77 + G +E +A +L+ KG +A N + R GEID++ R+G +FVEV+ + G A Sbjct: 2 RRGSEYEERACLYLQDKGYSIVARNYHCRSGEIDIVARQGGELVFVEVKGGKDTSLGHPA 61 Query: 78 ASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 K +++ A ++ R D+V + +E ++ Sbjct: 62 ERFNPRKLDRIIACAFRFMEEMGLEE---PFRVDLVVVLEDRIEHYEN 106 >UniRef50_Q313K2 UPF0102 protein Dde_1093 n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. G20 RepID=Y1093_DESDG Length = 202 Score = 129 bits (324), Expect = 4e-29, Method: Composition-based stats. Identities = 29/111 (26%), Positives = 52/111 (46%), Gaps = 2/111 (1%) Query: 6 TRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEV 65 R + G E A +L G+R +A N E+D+I ++ T +F EV Sbjct: 3 ARRAAGSCPAHIAAGRLGEEAACAYLAASGMRILARNWRAGHLELDIIAQDNGTIVFAEV 62 Query: 66 RYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT 116 + R + ++T +K+ +L++ A +WL+ ++ CRFD+V T Sbjct: 63 KTRAARGLESPHEALTPAKRSRLVRAAGMWLSSNDM--WDRPCRFDLVCVT 111 >UniRef50_Q60CC4 UPF0102 protein MCA0184 n=1 Tax=Methylococcus capsulatus RepID=Y184_METCA Length = 123 Score = 129 bits (324), Expect = 4e-29, Method: Composition-based stats. Identities = 49/115 (42%), Positives = 63/115 (54%), Gaps = 7/115 (6%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 TG E+ +L +GLR I N R GEIDL+M EG T +FVEVRYR YGGA A Sbjct: 11 TGPQAESWTAEYLTARGLRLIERNYRCRLGEIDLVMAEGATLVFVEVRYRSGKRYGGALA 70 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----EVEWIKDAFND 129 SV R K +LL TA+ ++ H + R DVVA + + EWI++A Sbjct: 71 SVDRHKCRRLLATAQHYMVEHRVTGA---VRLDVVAVSPGAAGPQAEWIRNAIEA 122 >UniRef50_Q1YQG9 Putative uncharacterized protein n=1 Tax=gamma proteobacterium HTCC2207 RepID=Q1YQG9_9GAMM Length = 133 Score = 129 bits (324), Expect = 4e-29, Method: Composition-based stats. Identities = 39/126 (30%), Positives = 60/126 (47%), Gaps = 11/126 (8%) Query: 10 SPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRR 69 P + +G E A +L+ KGL + N R GEIDLIMR+ +FVEVR+R Sbjct: 6 FPNNKKERLSGAEAEQLALDFLQAKGLELVVKNFRTRRGEIDLIMRDNAVLVFVEVRFRS 65 Query: 70 SALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE---------- 119 + +G A S+T K +L A+ ++ R + RFD VA + + Sbjct: 66 NLNFGTAEESITAQKCQRLSSAAQAYMQREGLTERVSG-RFDAVAISPAKPHRQSSGMYS 124 Query: 120 VEWIKD 125 + WI++ Sbjct: 125 INWIQN 130 >UniRef50_B0TH88 Putative uncharacterized protein n=1 Tax=Heliobacterium modesticaldum Ice1 RepID=B0TH88_HELMI Length = 119 Score = 128 bits (323), Expect = 5e-29, Method: Composition-based stats. Identities = 36/121 (29%), Positives = 55/121 (45%), Gaps = 7/121 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + G E +A + L G G I N GEIDLI+RE +FVEVR R S + Sbjct: 1 MNRVLLGRWGEERALQHLLGLGWSLICQNYRTPRGEIDLILRESNWIVFVEVRTRSSERF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-----GNEVEWIKDAFN 128 G +V K+ +L+ TA +L + G RFD+++ +++ I+ F Sbjct: 61 GRGEETVDYRKRRRLMATAGHFLGTYQGPPGDP--RFDLISILRLDSGEEQLQHIRGMFT 118 Query: 129 D 129 Sbjct: 119 P 119 >UniRef50_A5FKL6 UPF0102 protein Fjoh_1217 n=17 Tax=Bacteroidetes RepID=Y1217_FLAJ1 Length = 123 Score = 128 bits (323), Expect = 5e-29, Method: Composition-based stats. Identities = 28/119 (23%), Positives = 51/119 (42%), Gaps = 5/119 (4%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + G E A LE + + + N + E+D++ ++ + VEV+ R S + Sbjct: 2 AEHNELGKLGEDLAAEHLEKENYKILERNWVYKNAEVDILAQKENILVVVEVKTRSSLDF 61 Query: 74 GGAAASVTRSKQHKLLQTARLWL-ARHNGSFDTVDCRFDVVAFTGNE----VEWIKDAF 127 G V K L++ ++ R + ++ RFD+VA N +E + DAF Sbjct: 62 GSPQDFVKPKKIQLLIKAVNAYINYREKDFEEDINVRFDIVAIHKNGESFAIEHLTDAF 120 >UniRef50_C0VVC1 Endonuclease n=2 Tax=Corynebacterium glucuronolyticum RepID=C0VVC1_9CORY Length = 115 Score = 128 bits (323), Expect = 5e-29, Method: Composition-based stats. Identities = 43/111 (38%), Positives = 59/111 (53%), Gaps = 5/111 (4%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 G E ARR+ + +G F+AANV GEIDLIM+ G TT+FVEV+ R ++ G Sbjct: 4 KNLLLGRRGETIARRYYQDRGYGFVAANVRYTCGEIDLIMQHGDTTVFVEVKTRTNSAMG 63 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 GA +VT +K ++ + A WL RFDVV GNE+ + Sbjct: 64 GA-EAVTPAKLRRVQRAAMTWLEGKP----YRPIRFDVVEIIGNEITCFEG 109 >UniRef50_A1VIW8 UPF0102 protein Pnap_0271 n=10 Tax=Burkholderiales RepID=Y271_POLNA Length = 153 Score = 128 bits (323), Expect = 6e-29, Method: Composition-based stats. Identities = 58/141 (41%), Positives = 78/141 (55%), Gaps = 16/141 (11%) Query: 2 ATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERG---GEIDLIMR-EG 57 +T + P+Q+TTK GDA E+ AR +L G GLR+I +N G GEIDL+MR Sbjct: 15 STAGGAAALPKQVTTKSRGDAAESAARAYLVGAGLRWIESNYRTPGRGGGEIDLVMRVPD 74 Query: 58 RTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG 117 T +FVEVR R SA +GGA AS++ KQ +++ AR +L R CRFDVV G Sbjct: 75 GTLVFVEVRQRSSASHGGAGASISAVKQRRIIFAARHYLMRF---ASLPPCRFDVVLVHG 131 Query: 118 ---------NEVEWIKDAFND 129 +EW+ AF+ Sbjct: 132 ALSGGESPQATIEWLPAAFDA 152 >UniRef50_A5WCR1 UPF0102 protein PsycPRwf_0497 n=1 Tax=Psychrobacter sp. PRwf-1 RepID=Y497_PSYWF Length = 142 Score = 128 bits (322), Expect = 6e-29, Method: Composition-based stats. Identities = 55/146 (37%), Positives = 78/146 (53%), Gaps = 20/146 (13%) Query: 1 MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERG-GEIDLIMREGR- 58 M P SP+ ++ G +E A +L+ +GLR IA N + GEIDL++ E Sbjct: 1 MPANPELLISPK----QRQGGGYEQLAADFLQQQGLRLIARNWQQPKVGEIDLVLIEHGR 56 Query: 59 ---TTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF 115 +F EVR R+ YG A AS+TRSKQ KL++TAR +L RH+ + +CRFDVV F Sbjct: 57 SWNVLVFAEVRKRKLLGYGDALASITRSKQKKLIKTARYFL-RHHPEYADFECRFDVVGF 115 Query: 116 TGN----------EVEWIKDAFNDHS 131 T + EW++ AF + Sbjct: 116 TERTGRSGQGEPLQSEWLQGAFLAPA 141 >UniRef50_A3DDG4 UPF0102 protein Cthe_0758 n=6 Tax=Clostridia RepID=Y758_CLOTH Length = 130 Score = 128 bits (322), Expect = 6e-29, Method: Composition-based stats. Identities = 37/126 (29%), Positives = 57/126 (45%), Gaps = 12/126 (9%) Query: 12 RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERG-GEIDLIMREGRTTIFVEVRYRRS 70 + + G EA A ++L+ + N R GEID+I RE FVEV+ R S Sbjct: 7 NKNNKRAAGSIGEAAAVQFLKENNYEILETNFRYRRLGEIDIISREKDYICFVEVKARSS 66 Query: 71 ALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF---------TGNEVE 121 YG +V KQ + + A+++L ++ + + RFDVV E+ Sbjct: 67 LGYGYPREAVNIRKQENIRRLAQIYLCKNRIN--DLKVRFDVVEVYMEKKGDDIEVKEIS 124 Query: 122 WIKDAF 127 IK+AF Sbjct: 125 LIKNAF 130 >UniRef50_B9ZKW9 Putative uncharacterized protein n=1 Tax=Thioalkalivibrio sp. K90mix RepID=B9ZKW9_9GAMM Length = 118 Score = 128 bits (322), Expect = 6e-29, Method: Composition-based stats. Identities = 43/112 (38%), Positives = 60/112 (53%), Gaps = 4/112 (3%) Query: 20 GDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAAS 79 G A E +A +L G+GL +A NV GE+DL+ REG T + VEVR R +GG A S Sbjct: 8 GQAAEDRAAHYLTGQGLILVARNVRRPWGELDLVAREGDTLVLVEVRKRSHRNFGGGAES 67 Query: 80 VTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-VEWIKDAFNDH 130 + K+ +LL+ A +L RFDVV G++ +EW+ DA Sbjct: 68 IDAGKRRRLLRAAEGYLQETRWQG---PVRFDVVLLDGDDTIEWLPDAIQGD 116 >UniRef50_Q11XW1 UPF0102 protein CHU_0465 n=1 Tax=Cytophaga hutchinsonii ATCC 33406 RepID=Y465_CYTH3 Length = 113 Score = 128 bits (322), Expect = 7e-29, Method: Composition-based stats. Identities = 40/113 (35%), Positives = 56/113 (49%), Gaps = 4/113 (3%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + Q G A E +A +LE +G + I N+ GEIDLI F+EV+YR+ Y Sbjct: 1 MEHIQKGIAGEQKACAFLEQQGYKIIEKNLRIGKGEIDLIAVHNNCMCFIEVKYRKHNRY 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV-EWIKD 125 G VT+ K K+ +TA ++ N RFDVVA TG E+ + D Sbjct: 61 GFPEEFVTQKKLLKIQETAEAYIYTVNWQGR---IRFDVVAITGEELPVHLMD 110 >UniRef50_C6VV91 Putative uncharacterized protein n=2 Tax=Flexibacteraceae RepID=C6VV91_DYAFD Length = 119 Score = 128 bits (322), Expect = 7e-29, Method: Composition-based stats. Identities = 31/119 (26%), Positives = 53/119 (44%), Gaps = 8/119 (6%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 G E A +L KG + IA N EIDLI + IF+EV+ R +G Sbjct: 3 QANDLGRWGETTAASFLAEKGFKIIARNYRNWQSEIDLIAAKDDMLIFIEVKTRTGMAFG 62 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-----NEVEWIKDAFN 128 V +K +++ A ++ + ++ D RFD+++ ++ I+DAF+ Sbjct: 63 MPEEFVNVTKARLIMRAAEQYI--FDVDWEN-DVRFDIISILVLPDGSTDIRHIEDAFS 118 >UniRef50_D1SBF6 Putative uncharacterized protein n=1 Tax=Micromonospora aurantiaca ATCC 27029 RepID=D1SBF6_9ACTO Length = 150 Score = 128 bits (322), Expect = 7e-29, Method: Composition-based stats. Identities = 44/136 (32%), Positives = 59/136 (43%), Gaps = 12/136 (8%) Query: 2 ATVPTRSGSPRQL-----TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE 56 + P S R L + G E A R L GLR +A N GEID+I E Sbjct: 17 SAPPATVASGRILAGMTNRNRAVGAYGERCALRHLIETGLRPVARNWRCPEGEIDIIAWE 76 Query: 57 GRTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT 116 G EV+ RRS +G A +V R+K +L A WLA + + RFDV++ Sbjct: 77 GPVLAICEVKTRRSEQFGSPAEAVVRAKARRLRGLAARWLAETGTTAA--EVRFDVLSVR 134 Query: 117 -----GNEVEWIKDAF 127 VE ++ AF Sbjct: 135 LPLTGPARVEHLRGAF 150 >UniRef50_C1AG13 UPF0102 protein JTY_2914 n=20 Tax=Mycobacterium RepID=Y2914_MYCBT Length = 128 Score = 127 bits (321), Expect = 7e-29, Method: Composition-based stats. Identities = 40/126 (31%), Positives = 58/126 (46%), Gaps = 12/126 (9%) Query: 10 SPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREG--RTTIFVEVRY 67 + + +T Q G EA A +L GLR + N R GE+D+I + RT +FVEV+ Sbjct: 3 TLKTMTRVQLGAMGEALAVDYLTSMGLRILNRNWRCRYGELDVIACDAATRTVVFVEVKT 62 Query: 68 RRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN--------E 119 R YGG A +VT K +L + A LWLA + R DV+ E Sbjct: 63 RTGDGYGGLAHAVTERKVRRLRRLAGLWLADQEERWA--AVRIDVIGVRVGPKNSGRTPE 120 Query: 120 VEWIKD 125 + ++ Sbjct: 121 LTHLQG 126 >UniRef50_A6GLM3 Putative uncharacterized protein n=1 Tax=Limnobacter sp. MED105 RepID=A6GLM3_9BURK Length = 167 Score = 127 bits (321), Expect = 8e-29, Method: Composition-based stats. Identities = 44/128 (34%), Positives = 63/128 (49%), Gaps = 2/128 (1%) Query: 2 ATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTI 61 V ++ +L G E QA + LE GL + N R GEIDLIM G T + Sbjct: 38 PNVAEKAPPVHKLALLAEGQLAETQALQLLEKHGLILVTRNHRCRCGEIDLIMASGNTAV 97 Query: 62 FVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-GNEV 120 VEVR R + +G A S++ KQ ++ + A+LW + RFDVVA G E Sbjct: 98 IVEVRLRNNKRHGSALESISSHKQARVSRCAKLWWVQQGQR-KFTHLRFDVVALENGTEP 156 Query: 121 EWIKDAFN 128 W+++A+ Sbjct: 157 RWVQNAWQ 164 >UniRef50_A4YJR8 UPF0102 protein BRADO0179 n=14 Tax=Rhizobiales RepID=Y179_BRASO Length = 141 Score = 127 bits (321), Expect = 8e-29, Method: Composition-based stats. Identities = 41/129 (31%), Positives = 63/129 (48%), Gaps = 4/129 (3%) Query: 2 ATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTI 61 A P ++ SP ++ +TG + EA+A L KG R +A GEIDLI R+ Sbjct: 14 APKPAKTASPERVAAFRTGLSAEARAAALLIAKGYRILAKRFRTPHGEIDLIARKRGLVA 73 Query: 62 FVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV- 120 FVEV+ R A AA +VT +Q +++ A+ WL H ++ RFD + + Sbjct: 74 FVEVKAR--ASLDDAAYAVTPRQQQRIIDAAQAWLMAHP-DHAELELRFDAILVAPRSLP 130 Query: 121 EWIKDAFND 129 + AF+ Sbjct: 131 RHLMAAFDA 139 >UniRef50_D0GLC9 Putative uncharacterized protein n=1 Tax=Leptotrichia goodfellowii F0264 RepID=D0GLC9_9FUSO Length = 119 Score = 127 bits (320), Expect = 1e-28, Method: Composition-based stats. Identities = 42/121 (34%), Positives = 74/121 (61%), Gaps = 9/121 (7%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREG--RTTIFVEVRYRRS 70 + + ++ G +E A+ +LE + L FI +N + GEIDLI E T IFVEV+YR++ Sbjct: 2 RKSKREVGFEYEEIAKDYLEERKLLFIESNYYTKYGEIDLIFLEKSSETLIFVEVKYRKN 61 Query: 71 ALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEWIKDA 126 +YG A +V + KQ K++Q+++++++++ R+DV+ GN+ + WIK+A Sbjct: 62 NIYGEAVEAVDKRKQEKIIQSSQIYISKNKWK---NSVRYDVIGIIGNKLKNDINWIKNA 118 Query: 127 F 127 F Sbjct: 119 F 119 >UniRef50_Q5R0L0 UPF0102 protein IL0423 n=1 Tax=Idiomarina loihiensis RepID=Y423_IDILO Length = 116 Score = 127 bits (320), Expect = 1e-28, Method: Composition-based stats. Identities = 35/111 (31%), Positives = 54/111 (48%), Gaps = 2/111 (1%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 TG E + +L+ L I N GGE+D+I R+G +F EV++R + Sbjct: 6 TGKRAELLSAEFLKKNNLTIICKNYRIDGGEVDIIARDGHYWVFCEVKFRDDESFAAVIE 65 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT--GNEVEWIKDAF 127 + + ++ TAR +L +N T RFDV+A ++EW KDAF Sbjct: 66 QIQPQQCRRIRYTARHYLLSNNIDEHTAAIRFDVIAIVGQPTKIEWFKDAF 116 >UniRef50_Q1D6H9 UPF0102 protein MXAN_3551 n=2 Tax=Cystobacterineae RepID=Y3551_MYXXD Length = 126 Score = 127 bits (320), Expect = 1e-28, Method: Composition-based stats. Identities = 38/125 (30%), Positives = 61/125 (48%), Gaps = 6/125 (4%) Query: 9 GSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYR 68 ++ G+A E A R+LE +G R N R GE+D++ FVEVR R Sbjct: 2 RRAAPAERREYGNAGEEAAVRFLEAQGWRVRDRNWTCRFGELDVVAERDDLVCFVEVRMR 61 Query: 69 RSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEWIK 124 +A +G + SV+ +KQ ++++ A +L H+ RFDV++ G V+ I Sbjct: 62 STATWGDPSHSVSFAKQRRVVKAALRYLFAHDLRG--RMFRFDVISVVGRGERATVDHIP 119 Query: 125 DAFND 129 AF+ Sbjct: 120 GAFDA 124 >UniRef50_Q8XUC6 UPF0102 protein RSc3265 n=6 Tax=Proteobacteria RepID=Y3265_RALSO Length = 130 Score = 127 bits (320), Expect = 1e-28, Method: Composition-based stats. Identities = 49/111 (44%), Positives = 69/111 (62%), Gaps = 6/111 (5%) Query: 25 AQARRWLEGKGLRFIAANVNERGGEIDLIMRE-GRTTIFVEVR---YRRSALYGGAAASV 80 +A R+L+ +GL IA N + GEIDL+MR+ T +FVEVR R + +GGAAASV Sbjct: 22 DRALRYLQARGLSVIARNYRCKTGEIDLVMRDVAGTLVFVEVRARVARSAQRFGGAAASV 81 Query: 81 TRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFNDHS 131 T +KQ +L+ A +LA H + CRFDV+A G +EW++DAF + Sbjct: 82 TPAKQRRLIAAAEDFLAGHP--GEVPACRFDVIAIDGTRIEWMRDAFGVEA 130 >UniRef50_A7NKS5 UPF0102 protein Rcas_2007 n=2 Tax=Roseiflexus RepID=Y2007_ROSCS Length = 124 Score = 127 bits (319), Expect = 1e-28, Method: Composition-based stats. Identities = 39/121 (32%), Positives = 56/121 (46%), Gaps = 7/121 (5%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 + GD E A R+L +G +A GEID++ R +FVEVR RR G Sbjct: 4 RRTRLGDWGETMAARFLARRGYEVLARKWRCAAGEIDIVARHDGDLVFVEVRTRRGRDPG 63 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN------EVEWIKDAFN 128 AA S+T +K+ +L+ A +LA H+ R DVVA + +E I A Sbjct: 64 MAAESITNAKRARLMALADAFLAAHDLP-SNTPWRIDVVAISVGLRAQEVSIEHIPYAVE 122 Query: 129 D 129 + Sbjct: 123 E 123 >UniRef50_A4AH12 Putative uncharacterized protein n=1 Tax=marine actinobacterium PHSC20C1 RepID=A4AH12_9ACTN Length = 118 Score = 127 bits (319), Expect = 1e-28, Method: Composition-based stats. Identities = 34/111 (30%), Positives = 50/111 (45%), Gaps = 7/111 (6%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 G E A L GL + N GE+D++ R+ +FVEV+ R S L+G Sbjct: 7 LGARGEELATDHLISAGLEILDRNWRCSQGELDIVARDQDDVVFVEVKTRSSVLFGHPFE 66 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-----EVEWIK 124 S+T +K +L + A +W H GS R D +A E+E +K Sbjct: 67 SITATKVARLRRLAAVWCDAHPGSGA--TVRIDAIAVIVPSRGAVEIEHLK 115 >UniRef50_Q4FQF2 UPF0102 protein Psyc_1908 n=2 Tax=Psychrobacter RepID=Y1908_PSYA2 Length = 173 Score = 127 bits (319), Expect = 1e-28, Method: Composition-based stats. Identities = 44/145 (30%), Positives = 70/145 (48%), Gaps = 30/145 (20%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANV-NERGGEIDLIMREGR----TTIFVEVRYRRS 70 ++ G +E A +L+ +GL IA N + GE+DL+M E T +F+EVR R Sbjct: 29 KQRQGGYFEQLACEFLQEQGLILIAKNWQRPKVGELDLVMLEKGQAWSTLVFIEVRQRNR 88 Query: 71 ALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVA---------------- 114 + +G AA SVT KQ K+++ AR +L +H + +CRFDV+A Sbjct: 89 SHFGDAALSVTAGKQRKIIKVARYFLHQHQ-KYSDYECRFDVIAYNTSNNKNSENETDIR 147 Query: 115 --------FTGNEVEWIKDAFNDHS 131 ++ EW++ AF + Sbjct: 148 LDNQLNQPLEKDQPEWLQGAFIASA 172 >UniRef50_A3YDY3 Putative uncharacterized protein n=1 Tax=Marinomonas sp. MED121 RepID=A3YDY3_9GAMM Length = 130 Score = 127 bits (319), Expect = 1e-28, Method: Composition-based stats. Identities = 38/119 (31%), Positives = 56/119 (47%), Gaps = 6/119 (5%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + + G E A+ +L + L I N + GEIDLI + +FVEVRYR+ Sbjct: 14 KKQSFNKGQLAEEAAKVFLLSQKLSMIEQNFICKLGEIDLICLDNGVIVFVEVRYRQDNS 73 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVA----FTGNEVEWIKDAF 127 G AA S+ KQ K+++ A+ WL ++ RFD V + W+K AF Sbjct: 74 RGSAAQSIHLGKQKKVIKAAQYWLLINHK--QDTPIRFDAVLFDQVIDNEHLTWLKSAF 130 >UniRef50_Q146Q2 UPF0102 protein Bxeno_A0149 n=9 Tax=Burkholderiaceae RepID=Y149_BURXL Length = 140 Score = 127 bits (319), Expect = 1e-28, Method: Composition-based stats. Identities = 52/117 (44%), Positives = 74/117 (63%), Gaps = 2/117 (1%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE-GRTTIFVEVRYRRSALYG 74 +K G A+EA+A+ +L+ + LRF+A NV RGGEIDL+MRE +FVEVR R YG Sbjct: 22 SKLVGAAFEARAQEFLQRQRLRFVARNVACRGGEIDLVMRERDGALVFVEVRARAQRRYG 81 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVD-CRFDVVAFTGNEVEWIKDAFNDH 130 GAAAS+ KQ ++++ A+ +LA + CRFDV+AF + W++DAF Sbjct: 82 GAAASIGWRKQQRIVRAAQHYLATRSSQLRDQPACRFDVIAFEAGRLVWLRDAFRAD 138 >UniRef50_D1W8G8 Putative uncharacterized protein n=1 Tax=Prevotella buccalis ATCC 35310 RepID=D1W8G8_9BACT Length = 134 Score = 127 bits (319), Expect = 2e-28, Method: Composition-based stats. Identities = 32/123 (26%), Positives = 51/123 (41%), Gaps = 10/123 (8%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGR--TTIFVEVRYRRSA 71 T + G E A +L +G EID+I T +FVEV+ RRS Sbjct: 12 ATHNKFGKWGEDTAVDYLHKQGYTIRERGWRHGKFEIDIIALSPDGITCVFVEVKTRRSD 71 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-----EVEWIKDA 126 + +V K L A +++ + + RFDV++ G+ ++E +DA Sbjct: 72 EVALPSDAVDEKKMRNLGIAADVYVKMFDIQEEL---RFDVISIVGSTAENMQIEHFEDA 128 Query: 127 FND 129 FN Sbjct: 129 FNP 131 >UniRef50_C0EXX9 Putative uncharacterized protein n=1 Tax=Eubacterium hallii DSM 3353 RepID=C0EXX9_9FIRM Length = 117 Score = 126 bits (318), Expect = 2e-28, Method: Composition-based stats. Identities = 37/114 (32%), Positives = 54/114 (47%), Gaps = 2/114 (1%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 K +G A E A +LEGKG+R + N GEID+I E + VEV+ R G Sbjct: 2 RKNSGGAAEEAAVLFLEGKGIRILERNFRSYHGEIDIIALEQEMILVVEVKMRSYGDCGT 61 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-EVEWIKDAFN 128 AA +V KQ ++ T + + + RFDV+ + WI++AF Sbjct: 62 AAEAVDFRKQKRICYTFNYYRMQRRL-AENTAVRFDVIEVDKDFRCHWIQNAFE 114 >UniRef50_A0Z7D7 Putative uncharacterized protein n=1 Tax=marine gamma proteobacterium HTCC2080 RepID=A0Z7D7_9GAMM Length = 123 Score = 126 bits (318), Expect = 2e-28, Method: Composition-based stats. Identities = 46/118 (38%), Positives = 63/118 (53%), Gaps = 5/118 (4%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 QTG E A+ +L +GLR +A NV R GEID+IM +G T +FVEVR R Sbjct: 5 NTQTGKDAEDYAQNFLITQGLRTVARNVCCRYGEIDIIMEQGITVVFVEVRLRAQKGLQT 64 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT----GNEVEWIKDAFND 129 A+SV+ KQ +L++TA L + R RFDV+A+ WI+ AF+ Sbjct: 65 GASSVSYRKQQRLIKTASLVIQRMP-ELQGRPVRFDVIAYDTLQKNRVPHWIQQAFDA 121 >UniRef50_C9RJM0 Putative uncharacterized protein n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RJM0_FIBSS Length = 138 Score = 126 bits (318), Expect = 2e-28, Method: Composition-based stats. Identities = 38/120 (31%), Positives = 57/120 (47%), Gaps = 7/120 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 + G+ E QA +L +G + + N GGE+D++ R+ T +FVEV+ + G Sbjct: 11 NRAKGNFIETQAVAFLMREGYQVVTRNYAYHGGELDIVARDNGTLVFVEVKSVWNNQEGN 70 Query: 76 AAASVTRSKQHKLLQTARLWLARHN---GSFDTVDCRFDVVAF----TGNEVEWIKDAFN 128 AA V KQ K+ QTA +LA CRFDV++ + IK+AF Sbjct: 71 PAARVNALKQKKIWQTACHFLATQKTIAPKGFDTPCRFDVLSARAYQEPLQFAHIKNAFE 130 >UniRef50_C8PW53 Putative uncharacterized protein n=1 Tax=Enhydrobacter aerosaccus SK60 RepID=C8PW53_9GAMM Length = 134 Score = 126 bits (317), Expect = 2e-28, Method: Composition-based stats. Identities = 46/130 (35%), Positives = 71/130 (54%), Gaps = 15/130 (11%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNER-GGEIDLIMREGR----TTIFVEVRY 67 ++ GD +E A+ +LE +GL F A N + + GE+DL+M E + VEVR Sbjct: 4 TAPKQRQGDYYETLAKHYLEAQGLTFFAKNWHYKNLGELDLVMLEPTQKIPCLVIVEVRQ 63 Query: 68 RRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG---------N 118 R+++ +G + S+T +KQ K+++T +L H FD D RFDVV++ G Sbjct: 64 RKASQFGTSLDSITPAKQRKIVKTTAAFLQAHP-QFDNFDIRFDVVSYEGAATAGQAVMP 122 Query: 119 EVEWIKDAFN 128 WIKDAF+ Sbjct: 123 TPTWIKDAFS 132 >UniRef50_D2RAN4 Putative uncharacterized protein n=1 Tax=Gardnerella vaginalis 409-05 RepID=D2RAN4_GARVA Length = 182 Score = 126 bits (317), Expect = 2e-28, Method: Composition-based stats. Identities = 35/108 (32%), Positives = 47/108 (43%), Gaps = 1/108 (0%) Query: 10 SPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRT-TIFVEVRYR 68 L +K+ G E A L KG I N + R GE+DL+M +FVEV+ R Sbjct: 40 QDDSLESKELGKLGETYATLRLIQKGWHVIDQNWHCRNGELDLVMITPEQKLVFVEVKTR 99 Query: 69 RSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT 116 RS G ++T+ K+ KL T WL RFD V+ Sbjct: 100 RSVRCGTPLEAITQEKRSKLRTTGMKWLEEFGSDIPHYRIRFDAVSIL 147 >UniRef50_A0YUK1 Putative uncharacterized protein n=2 Tax=Cyanobacteria RepID=A0YUK1_9CYAN Length = 176 Score = 126 bits (317), Expect = 2e-28, Method: Composition-based stats. Identities = 35/113 (30%), Positives = 52/113 (46%), Gaps = 12/113 (10%) Query: 18 QTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE----------GRTTIFVEVRY 67 + G E + WL+ +G + R GEIDLI RE T IFVEV+ Sbjct: 16 KIGTLGEQLVQAWLKQQGWEILFHQYRCRWGEIDLIAREVKDPKVQSKLDSTVIFVEVKT 75 Query: 68 RRSALYG-GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE 119 R + ++T SKQ KL+++A+++L+ H CRFDV + Sbjct: 76 RSKRNWDSDGLLAITPSKQTKLIKSAQIFLSDHP-ELADSPCRFDVALVRCDR 127 >UniRef50_A6SUE7 UPF0102 protein mma_0204 n=4 Tax=Betaproteobacteria RepID=Y204_JANMA Length = 123 Score = 126 bits (317), Expect = 3e-28, Method: Composition-based stats. Identities = 50/123 (40%), Positives = 75/123 (60%), Gaps = 4/123 (3%) Query: 8 SGSPRQLTTKQT-GDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVR 66 R T KQ G A E QA +L+ +GL+ + N +GGEIDL+M++G+ +FVEVR Sbjct: 4 PAFLRPRTAKQLAGQAGEDQALIYLQQQGLQLLERNFRCKGGEIDLLMQDGKALVFVEVR 63 Query: 67 YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDA 126 R +GGAAAS+ +KQ +L+ A+++L R++ CRFDV+AF E+ W+K+A Sbjct: 64 MRSEKKFGGAAASIGTAKQKRLIIAAQIYLQRYSMP---PPCRFDVIAFDDKEMTWLKNA 120 Query: 127 FND 129 Sbjct: 121 IEA 123 >UniRef50_B8G6B1 UPF0102 protein Cagg_0930 n=3 Tax=Chloroflexus RepID=Y930_CHLAD Length = 123 Score = 126 bits (317), Expect = 3e-28, Method: Composition-based stats. Identities = 41/119 (34%), Positives = 59/119 (49%), Gaps = 9/119 (7%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 +Q GD E A +LE G IA N R GEID++ R+G +FVEVR RR Sbjct: 5 KRQLGDRGEQVAAVYLERCGYTIIARNWRCRNGEIDMVARDGDYLVFVEVRTRRDE---Y 61 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVA--FTGNEV---EWIKDAFND 129 A S+ K+ +L+ A +LA H+ +T R DV+A GN + + + A + Sbjct: 62 ALESLLMHKRQRLVTLAYHYLAEHDVP-ETTPWRIDVIALTVVGNRLVVTDHVMAAIGE 119 >UniRef50_C2KRF5 Possible endonuclease n=2 Tax=Mobiluncus mulieris RepID=C2KRF5_9ACTO Length = 173 Score = 126 bits (317), Expect = 3e-28, Method: Composition-based stats. Identities = 29/120 (24%), Positives = 53/120 (44%), Gaps = 3/120 (2%) Query: 2 ATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGR-TT 60 A P ++ + ++ G A E A +L+ +G + + N R GE+D++ Sbjct: 44 ALKPPKAPR-KNPHNRELGLAGEELAVEFLQTQGYQVLDRNWRCRAGEVDIVALSPDSVL 102 Query: 61 IFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV 120 FVEV+ R + +G A ++T +K ++ W H F D D+V+ + V Sbjct: 103 AFVEVKTRSTRRHGTPAEAITYAKLTRMRCVMGAWFRVHEAPF-HHDVSLDLVSVEWDGV 161 >UniRef50_Q8DI54 UPF0102 protein tll1737 n=1 Tax=Thermosynechococcus elongatus BP-1 RepID=Y1737_THEEB Length = 124 Score = 125 bits (316), Expect = 4e-28, Method: Composition-based stats. Identities = 34/123 (27%), Positives = 55/123 (44%), Gaps = 8/123 (6%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE-GRTTIFVEVRYRRSALYG 74 + GD EA WL+ + + +A N + GE+D+I + G +FVEV+ R S + Sbjct: 1 MRHVGDRGEAVVAAWLQTQQCQILAQNWSCPWGELDIIACDPGGVVLFVEVKTRGSYNWD 60 Query: 75 -GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEW-IKDAFN 128 +++ SKQ KL+ A+ +L + CRFDV + + AF Sbjct: 61 RDGLDAISPSKQRKLILAAQAFLESQP-QWQEHPCRFDVALVRHQRGAYHLHHYLAQAFT 119 Query: 129 DHS 131 S Sbjct: 120 LDS 122 >UniRef50_C2BVF9 Possible endonuclease n=1 Tax=Mobiluncus curtisii ATCC 43063 RepID=C2BVF9_9ACTO Length = 178 Score = 125 bits (315), Expect = 4e-28, Method: Composition-based stats. Identities = 27/124 (21%), Positives = 49/124 (39%), Gaps = 6/124 (4%) Query: 8 SGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREG-RTTIFVEVR 66 S L +Q G A E A L+ G + N R GE+D++ FVEV+ Sbjct: 53 SPPRNNLHNRQLGMAGEEVAAESLKAAGYVIVDRNWRCRAGEVDIVALSPEGVLGFVEVK 112 Query: 67 YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-----VE 121 R + +G ++T K ++ + WLA+ + D+ + + V+ Sbjct: 113 TRSNHRHGLPIEAITMKKLARMRRVMGAWLAQRDIVPVHRAVSLDLCSVDWDGHGEPVVK 172 Query: 122 WIKD 125 ++ Sbjct: 173 HLQG 176 >UniRef50_D2MIZ7 Putative uncharacterized protein n=1 Tax=Candidatus Poribacteria sp. WGA-A3 RepID=D2MIZ7_9BACT Length = 128 Score = 125 bits (315), Expect = 4e-28, Method: Composition-based stats. Identities = 48/121 (39%), Positives = 68/121 (56%), Gaps = 7/121 (5%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 LT + G EA A R L KG R + NV GE+D++ R G T IFVEV+ RR+ Sbjct: 2 SLTRQLLGKEAEAAAERLLRQKGYRILDRNVRIGRGELDIVARVGETVIFVEVKARRTDR 61 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-----VEWIKDAF 127 YGG A +VT K+ +L+Q A +LARH + CRFDV+ + + +E +++AF Sbjct: 62 YGGVAHAVTARKERQLIQLAARYLARHR--LERQPCRFDVLLYDAGDPGSPSLEHVENAF 119 Query: 128 N 128 Sbjct: 120 E 120 >UniRef50_B1XJM9 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7002 RepID=B1XJM9_SYNP2 Length = 140 Score = 125 bits (315), Expect = 4e-28, Method: Composition-based stats. Identities = 37/144 (25%), Positives = 65/144 (45%), Gaps = 18/144 (12%) Query: 1 MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRT- 59 MA P SP +L G+ E ++ L+ + + +A R GE+DL+ +T Sbjct: 1 MADTP----SPEKLAALAVGEQGELFVQQHLKSQDWQIVATRWRCRWGELDLVAFHAQTK 56 Query: 60 -TIFVEVRYRRSALYGG-AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG 117 FVEV+ R+ ++T SKQ K ++ A +L++ ++T CRFDV T Sbjct: 57 ILAFVEVKTRQQHSLDYQGLLAITPSKQRKTIRAAMQFLSKFP-QYETYGCRFDVALVTY 115 Query: 118 NE----------VEWIKDAFNDHS 131 ++ +++ AF + Sbjct: 116 SKTATFPQGFRLATYLEGAFEADA 139 >UniRef50_C7R327 Putative uncharacterized protein n=1 Tax=Jonesia denitrificans DSM 20603 RepID=C7R327_JONDD Length = 129 Score = 125 bits (314), Expect = 5e-28, Method: Composition-based stats. Identities = 34/125 (27%), Positives = 54/125 (43%), Gaps = 16/125 (12%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNER---GGEIDLIMREGRTTIFVEVRYRRSA 71 T G E A +WL+ +G + N GEID+I R+G T + VEV+ R + Sbjct: 4 RTYTLGQTGETYAAQWLQKRGYAILERNWRAAYPMRGEIDIIARDGATLVIVEVKTRTTQ 63 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-----------NEV 120 G + +VT K +L + A WL + + RFDV++ + Sbjct: 64 HCGHPSEAVTPRKLTQLRRLAAAWLT--HAGVRPRELRFDVISVLAPSNRYATPTNEWHI 121 Query: 121 EWIKD 125 + +KD Sbjct: 122 DHLKD 126 >UniRef50_C9LKQ6 Putative uncharacterized protein n=1 Tax=Prevotella tannerae ATCC 51259 RepID=C9LKQ6_9BACT Length = 131 Score = 125 bits (314), Expect = 5e-28, Method: Composition-based stats. Identities = 31/120 (25%), Positives = 50/120 (41%), Gaps = 7/120 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 Q G E A +L R + N E+D+I +FVEV+ R Sbjct: 4 HNQLGALGEEVAAHYLSQLEYRLLERNWRTGHLEVDIIADYYGEIVFVEVKTRSYEAEYT 63 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----EVEWIKDAFNDHS 131 A +V R+K+ L++ A ++ H+ CRFD++ G +V DA++ S Sbjct: 64 ALEAVDRTKKKHLVRAAHDYMHLHHL---DAACRFDIITVVGREAPFQVTHYIDAYSPKS 120 >UniRef50_Q2KU88 UPF0102 protein BAV3162 n=1 Tax=Bordetella avium 197N RepID=Y3162_BORA1 Length = 145 Score = 125 bits (314), Expect = 6e-28, Method: Composition-based stats. Identities = 49/125 (39%), Positives = 66/125 (52%), Gaps = 2/125 (1%) Query: 7 RSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVR 66 R G R G EAQ R L +GLR +A N R GE+DLIM +G + VEVR Sbjct: 9 RRGLIRPDPRHAQGKRAEAQGLRLLRAQGLRLLARNARNRHGELDLIMLDGEVLVVVEVR 68 Query: 67 YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDA 126 +R + +GGAAAS+ +KQ +L + A WLA RFDV+AF + W++ A Sbjct: 69 WRSGSAFGGAAASIGPAKQARLARAAACWLA--GSEHAGRRLRFDVLAFEAGQARWLRGA 126 Query: 127 FNDHS 131 F + Sbjct: 127 FEPPA 131 >UniRef50_A6VXY8 UPF0102 protein Mmwyl1_2395 n=1 Tax=Marinomonas sp. MWYL1 RepID=Y2395_MARMS Length = 127 Score = 125 bits (314), Expect = 6e-28, Method: Composition-based stats. Identities = 46/128 (35%), Positives = 64/128 (50%), Gaps = 6/128 (4%) Query: 4 VPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFV 63 P S R+ K GD E A +L +GLRF+ N R GEIDLI + T +FV Sbjct: 2 RPVTSFLNRKKAPKNNGDKAEQAAEAFLRKQGLRFVERNFFCRIGEIDLIFLDQNTYVFV 61 Query: 64 EVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVA----FTGNE 119 EVR+R + +G AA S+ +SK K+ +A LWL ++N RFD + Sbjct: 62 EVRFRANNTHGNAAESLGQSKLKKVRNSAALWLQKNNKV--NNSSRFDAILFDEKIDSQH 119 Query: 120 VEWIKDAF 127 + W+K F Sbjct: 120 LTWLKAVF 127 >UniRef50_Q6A7T5 UPF0102 protein PPA1431 n=3 Tax=Propionibacterium acnes RepID=Y1431_PROAC Length = 140 Score = 124 bits (312), Expect = 9e-28, Method: Composition-based stats. Identities = 34/130 (26%), Positives = 53/130 (40%), Gaps = 7/130 (5%) Query: 1 MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTT 60 + T + R+ G E A +++E G IA N GEIDLI R+ +T Sbjct: 11 LTTPRGAALRGRRGCRPAFGAWGEDLAAQYVESLGWTIIARNWTCDVGEIDLIARDDQTV 70 Query: 61 IFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG--- 117 +F+EV+ R +G S+T +K KL + A WL + R D + Sbjct: 71 VFIEVKARSGTGFGDPLESITTAKVRKLHELALAWLVNQDDGV--HSVRIDAIGVMVRPG 128 Query: 118 --NEVEWIKD 125 V ++ Sbjct: 129 AEPTVTHVRG 138 >UniRef50_C9PT16 Endonuclease n=5 Tax=Prevotella RepID=C9PT16_9BACT Length = 124 Score = 124 bits (312), Expect = 9e-28, Method: Composition-based stats. Identities = 34/123 (27%), Positives = 49/123 (39%), Gaps = 10/123 (8%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGR--TTIFVEVRYRRSA 71 G E A L+ +G + + +IDL+ T +FVEV+ R S Sbjct: 2 AKHNDLGKWGEDFAAEHLQKQGYVIRDRDWHCGKRDIDLVAITADMATVVFVEVKTRTSN 61 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-----EVEWIKDA 126 A +V R K L A ++ + N RFDVV G ++E I+DA Sbjct: 62 EVSEPADAVNRQKIRNLGIAANNYIKQFNVVEQ---VRFDVVTIVGTSRENAQLEHIEDA 118 Query: 127 FND 129 FN Sbjct: 119 FNP 121 >UniRef50_C7MB82 Putative uncharacterized protein n=1 Tax=Brachybacterium faecium DSM 4810 RepID=C7MB82_BRAFD Length = 153 Score = 124 bits (312), Expect = 1e-27, Method: Composition-based stats. Identities = 39/120 (32%), Positives = 58/120 (48%), Gaps = 7/120 (5%) Query: 10 SPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRR 69 R +TT Q G A E A L +G + + N+ R GE+D++ + T +FVEV+ RR Sbjct: 33 RVRDMTTAQLGRAGEELAASHLSAQGWQIVERNLRLRQGELDIVALDHATLVFVEVKTRR 92 Query: 70 SALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-----EVEWIK 124 S + G A+VT K +L + A +L S D R DVVA +E ++ Sbjct: 93 SFVTGVPQAAVTPDKLRRLRRLAGEYLMER--STPHRDVRIDVVAVHAQLDGTFSIEHLE 150 >UniRef50_B1VG84 Putative uncharacterized protein n=1 Tax=Corynebacterium urealyticum DSM 7109 RepID=B1VG84_CORU7 Length = 154 Score = 124 bits (312), Expect = 1e-27, Method: Composition-based stats. Identities = 33/114 (28%), Positives = 52/114 (45%), Gaps = 3/114 (2%) Query: 9 GSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREG-RTTIFVEVRY 67 G +T G E A +L +G R + N + R E+D+I + +FVEV+Y Sbjct: 36 GQNSADSTVGVGRQGENLAGEYLVNQGWRIVERNWHCRFAELDIIALDPAGEMVFVEVKY 95 Query: 68 RRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVE 121 R+ ++G +VT++K ++ A WLA D RFDV+ V Sbjct: 96 RKDTVHGTGVEAVTQTKLRRMRLAAGKWLAEQQRGVDV--VRFDVIDVGPGGVR 147 >UniRef50_B0MPN2 Putative uncharacterized protein n=1 Tax=Eubacterium siraeum DSM 15702 RepID=B0MPN2_9FIRM Length = 132 Score = 124 bits (312), Expect = 1e-27, Method: Composition-based stats. Identities = 30/118 (25%), Positives = 51/118 (43%), Gaps = 11/118 (9%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 G E +L G I N + GEID++ +G +FVEV+ RR Sbjct: 10 KGKLGEDFTADYLIKNGYDIITRNYRKPCGEIDIVASKGDILVFVEVKTRRYRSLVSGVE 69 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN--------EVEWIKDAFN 128 +V K+ +++ TA +LA + + R+D+ T + + + KDAF+ Sbjct: 70 AVGYKKKGRIIATADCFLAEYG---EEKQIRYDIAEVTVSTGDAVRVIDFRYFKDAFD 124 >UniRef50_A0QVA9 UPF0102 protein MSMEG_2508 n=6 Tax=Corynebacterineae RepID=Y2508_MYCS2 Length = 124 Score = 124 bits (312), Expect = 1e-27, Method: Composition-based stats. Identities = 34/109 (31%), Positives = 52/109 (47%), Gaps = 4/109 (3%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREG--RTTIFVEVRYRRS 70 LT + G E A L GL+ +A N R GE+D+I + T +FVEV+ R Sbjct: 5 SLTRAELGALGEEVAVEHLAALGLKTLARNWRCRYGELDIIAEDAATGTVVFVEVKTRSG 64 Query: 71 ALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE 119 +GG A +VT K ++ + A +WLA + + + R DV+ Sbjct: 65 DGFGGLAEAVTPQKVRRIRRLAAIWLAAQDAHWAVL--RIDVIGVRVGR 111 >UniRef50_C2BNU0 Endonuclease n=2 Tax=Corynebacterium RepID=C2BNU0_9CORY Length = 142 Score = 124 bits (311), Expect = 1e-27, Method: Composition-based stats. Identities = 46/124 (37%), Positives = 62/124 (50%), Gaps = 10/124 (8%) Query: 7 RSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREG-RTTIFVEV 65 R + + + G EA A R+L +G IAANV+ R GEIDLI RE T +FVEV Sbjct: 18 RLATKKPRHKQVLGKRGEAFAARYLHERGAEIIAANVSYRVGEIDLIAREPNGTIVFVEV 77 Query: 66 RYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VE 121 + R ++ YG A +VT K +L + A WL S + RFDV+A +E Sbjct: 78 KTRANSNYGVA-EAVTPQKLARLRKAAAQWLDGKPLS----EVRFDVIALVAQGQGFVLE 132 Query: 122 WIKD 125 K Sbjct: 133 HFKG 136 >UniRef50_Q025A4 UPF0102 protein Acid_2433 n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Y2433_SOLUE Length = 132 Score = 123 bits (310), Expect = 2e-27, Method: Composition-based stats. Identities = 35/113 (30%), Positives = 57/113 (50%), Gaps = 7/113 (6%) Query: 20 GDAWEAQARRWLEGKGLRFIAANVNE--RGGEIDLIMREGRTTIFVEVRYRRSALYGGAA 77 G E A R+L +G +A N GEIDL++ +G FVEV+ R S +G Sbjct: 22 GRIGEDLAHRYLRSQGCTVVARNYRTLAGTGEIDLVVWDGGRLAFVEVKTRSSTDFGPPE 81 Query: 78 ASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG---NEVEWIKDAF 127 ++V K+ +L AR ++ R + + RFD+V+ ++EW++ AF Sbjct: 82 SAVDAEKRDRLRTAARDYVRRADVDW--KAVRFDIVSVILQASPKIEWLRGAF 132 >UniRef50_B1WNM5 Putative uncharacterized protein n=3 Tax=Chroococcales RepID=B1WNM5_CYAA5 Length = 153 Score = 123 bits (310), Expect = 2e-27, Method: Composition-based stats. Identities = 32/106 (30%), Positives = 45/106 (42%), Gaps = 4/106 (3%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREG--RTTIFVEVRYRRSALY 73 G E +WL + + + GEID+I + T IFVEV+ R+S + Sbjct: 12 MTSIGKIGEQFVAQWLISQSWQILHERWRSPWGEIDIIAQHHHSNTIIFVEVKTRKSKNW 71 Query: 74 G-GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN 118 +VT KQ K+ QTA +L + F CRFDV Sbjct: 72 DQSGILAVTPQKQAKITQTASYFLGEYP-QFSNFICRFDVALVHHE 116 >UniRef50_D1ANU1 Putative uncharacterized protein n=1 Tax=Sebaldella termitidis ATCC 33386 RepID=D1ANU1_SEBTE Length = 111 Score = 123 bits (310), Expect = 2e-27, Method: Composition-based stats. Identities = 38/112 (33%), Positives = 67/112 (59%), Gaps = 3/112 (2%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + ++ G +E A+ +L GL ++ +N GEIDLI ++ IFVEV+YR+++ Y Sbjct: 1 MNKREKGFKYENAAKDFLINNGLEYVRSNYYSEYGEIDLIFKDRDFLIFVEVKYRKNSDY 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 G A SVT++K K++ + +++ N + CR+D+VA G E+ W+K+ Sbjct: 61 GFAEESVTQAKLKKIINASLNYISEVNWNEG---CRYDLVAINGEEIIWVKN 109 >UniRef50_Q6AEA8 UPF0102 protein Lxx14785 n=1 Tax=Leifsonia xyli subsp. xyli RepID=Y1478_LEIXX Length = 118 Score = 123 bits (309), Expect = 2e-27, Method: Composition-based stats. Identities = 32/117 (27%), Positives = 46/117 (39%), Gaps = 7/117 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + G E+ A WLE G I N R GEID+I R G T+FVEV+ R + Y Sbjct: 2 AKKDELGRRGESVAAHWLEAHGYVLIGRNWRIRSGEIDIIARTGNITVFVEVKTRATTHY 61 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-----GNEVEWIKD 125 G ++T K +L + W + R D + E+ + Sbjct: 62 GHPLEAITPEKAARLRRLTAEWCRTYGPLPG--ALRVDAIGVLNAWSANPEIHHLPG 116 >UniRef50_A6NUN6 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6NUN6_9BACE Length = 119 Score = 123 bits (309), Expect = 2e-27, Method: Composition-based stats. Identities = 36/122 (29%), Positives = 57/122 (46%), Gaps = 11/122 (9%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + T G E+ L +G R +A+ R GEIDLI +G +FVEV+ R+S + Sbjct: 1 MNTSLLGRWGESLVAEELRRRGCRVVASGYRTRFGEIDLIAEDGPYLLFVEVKLRKSDRF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT--------GNEVEWIKD 125 A V R KQ ++ TA ++LA++ RFDV + ++++ Sbjct: 61 APGRAFVDRGKQERIRTTAEIYLAQNPTERQP---RFDVAEVYAPQGTATAHPRIVYLEN 117 Query: 126 AF 127 AF Sbjct: 118 AF 119 >UniRef50_Q6FD45 UPF0102 protein ACIAD1132 n=4 Tax=Acinetobacter RepID=Y1132_ACIAD Length = 140 Score = 122 bits (308), Expect = 2e-27, Method: Composition-based stats. Identities = 40/143 (27%), Positives = 66/143 (46%), Gaps = 21/143 (14%) Query: 1 MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTT 60 M + + + G E QA L+ G + + N + R GEIDLI+ + Sbjct: 1 MPDMKP----VQDIHAHHLGKWAENQALNILQANGFKLVIRNFHSRVGEIDLIVAKADEL 56 Query: 61 IFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-GNE 119 IFVEV+ R Y A + S+Q K+++TA+ +L R+ + CRFDV+ F ++ Sbjct: 57 IFVEVKARTLGSYAAANEVLLVSQQRKIIKTAQYFLNRYP-DYQQFYCRFDVICFDFPHK 115 Query: 120 V---------------EWIKDAF 127 + +WI++AF Sbjct: 116 IAKTVQQDFSKLRYDQQWIENAF 138 >UniRef50_Q55761 UPF0102 protein sll0189 n=1 Tax=Synechocystis sp. PCC 6803 RepID=Y189_SYNY3 Length = 150 Score = 122 bits (308), Expect = 2e-27, Method: Composition-based stats. Identities = 34/103 (33%), Positives = 46/103 (44%), Gaps = 4/103 (3%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRT--TIFVEVRYRRSALY 73 G A E+ WLE +G + + GEIDLI T FVEV+ R + Sbjct: 1 MTDLGQAGESLVAAWLEQQGGKILQQRWRSPWGEIDLITHFPDTKIIAFVEVKTRSGGNW 60 Query: 74 G-GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF 115 G +V KQ K+ QTA +LA + +CRFDV+ Sbjct: 61 DQGGLLAVNARKQEKIWQTANHFLASQP-QWSDWNCRFDVMIV 102 >UniRef50_B4VZG7 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VZG7_9CYAN Length = 177 Score = 122 bits (308), Expect = 3e-27, Method: Composition-based stats. Identities = 33/135 (24%), Positives = 48/135 (35%), Gaps = 31/135 (22%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRT---------------- 59 T G E WL+ +G + + R GEIDLI Sbjct: 2 TNAKGQLGEQLVATWLQAQGWTILHHRWHCRWGEIDLIAYRDGEVKENPVSNRDINPQFH 61 Query: 60 -------------TIFVEVRYRRSALYG-GAAASVTRSKQHKLLQTARLWLARHNGSFDT 105 FVEV+ R + ++T SKQ K+ +TA L+LA + Sbjct: 62 LPTPLPANAESPILGFVEVKTRSRGNWDADGQLAITSSKQAKIWRTAELFLAENP-DLSD 120 Query: 106 VDCRFDVVAFTGNEV 120 + CRFDV + + Sbjct: 121 LPCRFDVALVRYHRI 135 >UniRef50_Q24UC6 UPF0102 protein DSY2577 n=2 Tax=Desulfitobacterium hafniense RepID=Y2577_DESHY Length = 121 Score = 122 bits (307), Expect = 3e-27, Method: Composition-based stats. Identities = 33/118 (27%), Positives = 54/118 (45%), Gaps = 8/118 (6%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 + G E A + + GL + N GE+D+I REG T IF+EVR R + G Sbjct: 4 HRQALGRYGEELAVKHIRQAGLTVLECNYRCPLGEMDIIAREGETIIFIEVRTRSTGSRG 63 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-------NEVEWIKD 125 S+T K+ +L + A +L ++ + RFD++A ++ WI+ Sbjct: 64 WGEESITAKKRERLYRIATHYL-KYRNYKEWPSLRFDLIAIRCQDQEGKQPDIIWIRG 120 >UniRef50_B2S4F0 UPF0102 protein TPASS_0913 n=3 Tax=Treponema RepID=Y913_TREPS Length = 126 Score = 122 bits (307), Expect = 3e-27, Method: Composition-based stats. Identities = 38/124 (30%), Positives = 58/124 (46%), Gaps = 8/124 (6%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 K G EA A RWL +G I N GEID+I ++ T +FVEV+ R Sbjct: 2 PKHNKLLGAFGEAYAARWLATRGYIIITRNWRRATGEIDIIAQQDDTIVFVEVKTLRCTS 61 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-------EVEWIKD 125 Y A V + KQ ++ +TA+ +LA ++ + RFDV+ + ++ + Sbjct: 62 YADLAIIVGKRKQKRICETAKHFLASAR-EYNHMCARFDVIVLRSDPFRRQDVDIVHLPH 120 Query: 126 AFND 129 AF D Sbjct: 121 AFED 124 >UniRef50_D1BMP5 Putative uncharacterized protein n=3 Tax=Veillonella RepID=D1BMP5_VEIPT Length = 132 Score = 122 bits (306), Expect = 5e-27, Method: Composition-based stats. Identities = 36/133 (27%), Positives = 61/133 (45%), Gaps = 7/133 (5%) Query: 1 MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTT 60 M T+ T +L +K+ G E A ++E GL + N R GEID+I + Sbjct: 1 MKTISTGKAFN-ELDSKELGKWGERVATNYIEKIGLTVVDTNYRTRLGEIDIIAKRDLVY 59 Query: 61 IFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLAR-HNGSFDTVDCRFDVVAFTGNE 119 F+E++ RR +G A +VT+ KQ + + A L+L + + FDV+ +E Sbjct: 60 HFIEIKARRGMQHGLAREAVTKKKQKHIKRAAMLFLYDLNQKKRRWKEISFDVIEVYLHE 119 Query: 120 -----VEWIKDAF 127 + ++ F Sbjct: 120 DFQSSIHYLPQCF 132 >UniRef50_A3NEP2 UPF0102 protein BURPS668_3819 n=83 Tax=Proteobacteria RepID=Y3819_BURP6 Length = 144 Score = 122 bits (306), Expect = 5e-27, Method: Composition-based stats. Identities = 59/131 (45%), Positives = 79/131 (60%), Gaps = 5/131 (3%) Query: 2 ATVPTRSGSPRQL-TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE-GRT 59 R PR+ + + G A+E +A+R+LE GL +A NV RGGEIDL+MRE T Sbjct: 14 PEAAPRDNFPREAGSKRGIGAAFETRAQRFLERAGLALVARNVTVRGGEIDLVMRERDGT 73 Query: 60 TIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE 119 +FVEVR R ++ YGGAAAS+ K+ +LL A + AR G+ CRFDVVAF G Sbjct: 74 LVFVEVRARANSRYGGAAASIGVRKRMRLLLAAHAFWARTGGANA---CRFDVVAFEGGR 130 Query: 120 VEWIKDAFNDH 130 + W++DAF Sbjct: 131 LVWLRDAFRAD 141 >UniRef50_A6LF20 UPF0102 protein BDI_2565 n=4 Tax=Bacteroidales RepID=Y2565_PARD8 Length = 121 Score = 122 bits (306), Expect = 5e-27, Method: Composition-based stats. Identities = 28/121 (23%), Positives = 48/121 (39%), Gaps = 7/121 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 G E++AR +L G + N + E+D+I + I VEV+ R Sbjct: 2 ARQNDMGREGESEARAYLVKHGYNVLHTNWHWHHYELDIIAVKEDELIVVEVKTRSEDFL 61 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEWIKDAFND 129 +V K +++ A ++ N + RFD+V E ++ I+DAF Sbjct: 62 LSPEDAVDTKKIRRIVAAADAYVRYFNI---DLPVRFDIVTLIKKETGFLIDHIEDAFYA 118 Query: 130 H 130 Sbjct: 119 P 119 >UniRef50_Q3AC88 UPF0102 protein CHY_1414 n=1 Tax=Carboxydothermus hydrogenoformans Z-2901 RepID=Y1414_CARHZ Length = 118 Score = 121 bits (305), Expect = 5e-27, Method: Composition-based stats. Identities = 27/119 (22%), Positives = 57/119 (47%), Gaps = 6/119 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + ++ G WE A ++L KG + + N RGGEID++ ++G +F+EVR+R + Sbjct: 1 MNRRELGQKWEELAEQYLRKKGYKILTRNYQIRGGEIDIVAQDGEFLVFIEVRFRSDISF 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEWIKDAFN 128 G + +V K+ L + ++++ + + R D + + V ++ + Sbjct: 61 GTPSETVNEKKKASLKKAIKVYIHEN--FLYHLQPRVDFIGIEQKDNRFFVNHYQNVLD 117 >UniRef50_A4A5E8 Putative uncharacterized protein n=2 Tax=unclassified Gammaproteobacteria RepID=A4A5E8_9GAMM Length = 118 Score = 121 bits (305), Expect = 7e-27, Method: Composition-based stats. Identities = 45/114 (39%), Positives = 65/114 (57%), Gaps = 7/114 (6%) Query: 20 GDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAAS 79 GD +EA++ L+ GLR + + GEID+I + +FVEVR RR +GGAAAS Sbjct: 4 GDDFEARSAALLKSYGLRILDTQYRCKAGEIDIIACDEHHLLFVEVRARRHRSHGGAAAS 63 Query: 80 VTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN------EVEWIKDAF 127 V R+KQ ++ + A +L RH + + CRFDV+A+ E WI+ AF Sbjct: 64 VNRAKQCRIARCAAYFLNRHP-QWCHLPCRFDVIAWEPGCAGQSFEARWIQAAF 116 >UniRef50_A9B5H2 UPF0102 protein Haur_0145 n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=Y145_HERA2 Length = 124 Score = 121 bits (304), Expect = 7e-27, Method: Composition-based stats. Identities = 35/116 (30%), Positives = 56/116 (48%), Gaps = 6/116 (5%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 K G E A +L+ G + IA+ + R GEIDLI + T + +EVR RR +G Sbjct: 4 DRKALGRWGEQYAAEYLQQLGYQLIASGWHCRWGEIDLIAYDQATLVIIEVRTRRGTAHG 63 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNG--SFDTVDCRFDVVAFTG----NEVEWIK 124 AA S+T K+ +L + + +L + + D R D +A T ++E + Sbjct: 64 SAAESLTLKKRQRLARLLQAYLQALDAAQTPWLGDYRIDAIAITLSRGQPQLEHFQ 119 >UniRef50_C2CWR1 Endonuclease n=1 Tax=Gardnerella vaginalis ATCC 14019 RepID=C2CWR1_GARVA Length = 153 Score = 121 bits (304), Expect = 8e-27, Method: Composition-based stats. Identities = 36/118 (30%), Positives = 53/118 (44%), Gaps = 7/118 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREG-RTTIFVEVRYRRSALYG 74 K G+ E A L K + N + R GE+D++M + +F+EV+ RRS +G Sbjct: 37 NKTIGNLGEEYASLKLILKNWILLDRNWHSRFGELDVVMMDPFGRIVFIEVKTRRSVRFG 96 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-----EVEWIKDAF 127 +VT K K + WL HN F RFDVV+ + ++ I AF Sbjct: 97 TPLEAVTNEKCLKTHKAGFKWLDEHNF-FKHRKIRFDVVSILISKDKNIQLRHILGAF 153 >UniRef50_D1BJ87 Predicted endonuclease related to Holliday junction resolvase n=1 Tax=Sanguibacter keddieii DSM 10542 RepID=D1BJ87_SANKS Length = 127 Score = 121 bits (304), Expect = 9e-27, Method: Composition-based stats. Identities = 33/121 (27%), Positives = 48/121 (39%), Gaps = 8/121 (6%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + G E R LEG G + N GGE+DL+ +GR + +EV+ R Sbjct: 5 RTDRAAVGRYGEELVARMLEGAGWVVVDRNWRGTGGELDLVALDGRELVVIEVKTRTGLG 64 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGS---FDTVDCRFDVVAFT-----GNEVEWIK 124 YG + +VT K +L + A WLA R DVV +V+ + Sbjct: 65 YGHPSEAVTPRKLARLRRLAGEWLAGRAAETVPERPTSVRVDVVGVLLEKGRPPQVDHLV 124 Query: 125 D 125 Sbjct: 125 G 125 >UniRef50_C0BHK9 Putative uncharacterized protein n=1 Tax=Flavobacteria bacterium MS024-2A RepID=C0BHK9_9BACT Length = 120 Score = 120 bits (303), Expect = 1e-26, Method: Composition-based stats. Identities = 29/118 (24%), Positives = 50/118 (42%), Gaps = 7/118 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 G E +A +L KG + N EIDL+M++ + +EV+ R + + Sbjct: 2 AQHNLFGQEAEQKALSFLCNKGYVLLEKNYRFGKAEIDLLMKDKDLLVCIEVKARSTDFF 61 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVA--FTGNEV--EWIKDAF 127 G + +T K L+ +L HN + RFDV++ + + I+ AF Sbjct: 62 GTPESFITSKKIKLLVGAVNHYLEYHNL---DYEVRFDVLSYTIKNKKWICKHIESAF 116 >UniRef50_A1W341 UPF0102 protein Ajs_0414 n=11 Tax=Betaproteobacteria RepID=Y414_ACISJ Length = 132 Score = 120 bits (303), Expect = 1e-26, Method: Composition-based stats. Identities = 47/125 (37%), Positives = 67/125 (53%), Gaps = 7/125 (5%) Query: 9 GSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERG---GEIDLIMRE-GRTTIFVE 64 GS TT+ G A E +A L GL + N G GEIDLI+RE T +FVE Sbjct: 10 GSAPARTTRAAGQAGEDRALAHLTAAGLALVERNYRTPGRGGGEIDLILRERDGTLVFVE 69 Query: 65 VRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIK 124 VR R ++ YGGA S+ +KQ +++ A+ +L R CRFD V G+ ++W++ Sbjct: 70 VRSRGASAYGGAGGSIGVAKQRRIVFAAQHYLLRWP---APPPCRFDAVLIEGDRLQWLR 126 Query: 125 DAFND 129 AF+ Sbjct: 127 GAFDA 131 >UniRef50_C0WKI9 Endonuclease n=3 Tax=Actinomycetales RepID=C0WKI9_9CORY Length = 132 Score = 120 bits (302), Expect = 1e-26, Method: Composition-based stats. Identities = 42/122 (34%), Positives = 59/122 (48%), Gaps = 10/122 (8%) Query: 9 GSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE-GRTTIFVEVRY 67 + ++ G E+ A +L +G IAANV+ R GEIDLI RE T +FVEV+ Sbjct: 10 ATKHPRHRQELGKRGESFAAGYLRERGSDIIAANVSYRVGEIDLIAREPDGTIVFVEVKT 69 Query: 68 RRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF----TGNEVEWI 123 R +A +G A +VT K ++ + A WL RFDV+A G E+E Sbjct: 70 RSTASFGTA-EAVTPHKLARMRRAAVQWLDGKPL----ATVRFDVIALVVNGEGFELEHF 124 Query: 124 KD 125 Sbjct: 125 TG 126 >UniRef50_Q4JV13 UPF0102 protein jk1180 n=2 Tax=Corynebacterium jeikeium RepID=Y1180_CORJK Length = 131 Score = 120 bits (302), Expect = 1e-26, Method: Composition-based stats. Identities = 34/131 (25%), Positives = 52/131 (39%), Gaps = 4/131 (3%) Query: 1 MATVPTRS-GSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREG-R 58 M + G+ E A +L G + N R GE+DL+ R Sbjct: 1 MPDHQEGEGRRVTAANRRAVGNLGEDLAAEYLHRAGYEVLDRNFYTRYGELDLVARTPED 60 Query: 59 TTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTV-DCRFDVVAFTG 117 F+EV+YR SA GG A+V K ++ A LWL ++ R DV+ Sbjct: 61 DLAFIEVKYRTSASDGGGVAAVGPRKLRRIRTLAGLWLEQNREGVQFSGGLRVDVIDVGP 120 Query: 118 NEV-EWIKDAF 127 + V E ++ + Sbjct: 121 DGVREHVEGVW 131 >UniRef50_B0MUK7 Putative uncharacterized protein n=1 Tax=Alistipes putredinis DSM 17216 RepID=B0MUK7_9BACT Length = 121 Score = 120 bits (301), Expect = 2e-26, Method: Composition-based stats. Identities = 32/121 (26%), Positives = 54/121 (44%), Gaps = 8/121 (6%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 TT+ TG E A RWL G + N + E+D++ T F+EV+ RR Sbjct: 3 TTQHTGRLGEETAARWLLDHGFTLLHRNWRQGHYELDIVAARKGTLHFIEVKTRRRDGLT 62 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-----GNEVEWIKDAFND 129 ++ K+ L++ A +L + + + +FD++A EV +I+DA Sbjct: 63 PPEQALDSHKRRALVRAANAYLTENPFAGE---VQFDLIAVETAPAGTPEVRYIEDAIEL 119 Query: 130 H 130 H Sbjct: 120 H 120 >UniRef50_B4WKR8 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WKR8_9SYNE Length = 144 Score = 120 bits (301), Expect = 2e-26, Method: Composition-based stats. Identities = 34/111 (30%), Positives = 51/111 (45%), Gaps = 10/111 (9%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMR--------EGRTTIFVEV 65 L + G+ E +WL K + + + R GEIDLI + + T F+EV Sbjct: 2 LNPQDLGNYGEQLVCQWLTQKNCQILQRQWHSRFGEIDLIAKGISGQGSLKAETLAFIEV 61 Query: 66 RYRRSALYG-GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF 115 + R + ++TRSKQ K+ TAR +L RH + CRFD+ Sbjct: 62 KTRSKGNWDADGLLALTRSKQQKIRMTARYFLVRHP-HLSELPCRFDLALV 111 >UniRef50_UPI00017886BD protein of unknown function UPF0102 n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI00017886BD Length = 127 Score = 120 bits (301), Expect = 2e-26, Method: Composition-based stats. Identities = 38/128 (29%), Positives = 61/128 (47%), Gaps = 9/128 (7%) Query: 7 RSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVR 66 S S ++ KQ G A E A L KG R + N R GE+D++ G T + +EVR Sbjct: 2 TSPSGKKDNRKQKGAAAEELAAAALIQKGYRILDRNWRCRFGELDIVAETGETLVVIEVR 61 Query: 67 YRRS-ALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF------TGNE 119 R +G + SV K ++ TA+ ++ H + RFDV++ T + Sbjct: 62 SRSGTTRFGTPSESVNARKVMQVRNTAQQYV--HQKRYYERTIRFDVISVMLREDMTADS 119 Query: 120 VEWIKDAF 127 ++ I++AF Sbjct: 120 MDHIENAF 127 >UniRef50_UPI0001BC5BE0 endonuclease n=3 Tax=Fusobacterium RepID=UPI0001BC5BE0 Length = 123 Score = 119 bits (300), Expect = 2e-26, Method: Composition-based stats. Identities = 31/117 (26%), Positives = 53/117 (45%), Gaps = 3/117 (2%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 +Q G+ +E +A L + + N GEID+I + +F+EV+YR++ + Sbjct: 2 QNNRQKGNEYEERAVNILRENQYQILERNFRIFQGEIDIIAEKDGVLVFIEVKYRKNRNF 61 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD-AFND 129 G +V K K+ + A + + R DV+ F G+ W KD A+ D Sbjct: 62 GYGKEAVDSRKLGKIFRVAEYY--KTYCGKQYQKMRIDVIHFLGDTYFWEKDVAWGD 116 >UniRef50_C0WCB9 Endonuclease n=1 Tax=Acidaminococcus sp. D21 RepID=C0WCB9_9FIRM Length = 117 Score = 119 bits (300), Expect = 2e-26, Method: Composition-based stats. Identities = 34/114 (29%), Positives = 55/114 (48%), Gaps = 6/114 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 + G E A +L +G A N + GE+DL+ R+G +FVEV+ RR+ LYG Sbjct: 4 RTRFGRWGERAAAAYLRHQGYIIEAQNYSSSHGELDLVARKGHLLVFVEVKSRRTDLYGR 63 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT----GNEVEWIKD 125 +VT K ++ +TA +L H D R+DV+ ++ +K+ Sbjct: 64 PRDAVTEEKAARIRETAYEYLQDHKRPGDR--IRYDVIEIMMLFGHFQLNHLKN 115 >UniRef50_Q2NZA5 UPF0102 protein XOO3617 n=18 Tax=Xanthomonadaceae RepID=Y3617_XANOM Length = 122 Score = 119 bits (300), Expect = 3e-26, Method: Composition-based stats. Identities = 54/120 (45%), Positives = 71/120 (59%), Gaps = 3/120 (2%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 +Q G EA AR LE GLR + N N RGGE+DL+MR+G++ +FVEVRYRR Sbjct: 2 PAARQQRGAGVEAAARALLEQAGLRLVVGNANYRGGELDLVMRDGQSLVFVEVRYRRDDR 61 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVV--AFTGNEVEWIKDAFNDH 130 +GG AASV K+ KL+ A+L+L H + CRFDVV + + WI+DAF Sbjct: 62 FGGGAASVDWRKRRKLVLAAQLFLGAHPA-LAALPCRFDVVDASGEPPVLHWIRDAFRAD 120 >UniRef50_A5IKG8 UPF0102 protein Tpet_0671 n=6 Tax=Thermotogaceae RepID=Y671_THEP1 Length = 108 Score = 119 bits (299), Expect = 3e-26, Method: Composition-based stats. Identities = 27/108 (25%), Positives = 48/108 (44%), Gaps = 5/108 (4%) Query: 20 GDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAAS 79 E A ++L+ KG + + N + GEID++ R+GR +FVEV+ + Sbjct: 4 WKEAEELACKFLKKKGYKILERNYRTKYGEIDIVARDGREIVFVEVK--SGSGKVDPLER 61 Query: 80 VTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 + K L QTAR ++ ++ R D V T ++ + + Sbjct: 62 IDLKKVRNLEQTARFYMIQNKLKG---PARVDFVRVTPEGIDHFEGIW 106 >UniRef50_Q1IJG5 UPF0102 protein Acid345_3985 n=1 Tax=Candidatus Koribacter versatilis Ellin345 RepID=Y3985_ACIBL Length = 143 Score = 119 bits (298), Expect = 4e-26, Method: Composition-based stats. Identities = 32/125 (25%), Positives = 53/125 (42%), Gaps = 9/125 (7%) Query: 9 GSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERG--GEIDLIMREGRTTIFVEVR 66 P + +TG E A +L G +A N E+D+I G F+EV+ Sbjct: 17 PEPDEPEHLKTGRRGEELAYFFLRKHGYTIVARNFRTPWHKSELDIIGWNGGILCFIEVK 76 Query: 67 YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEW 122 R + A A+V +K++ L + AR +L + + RFD+V + + Sbjct: 77 TRTTRDIATAEAAVDDTKRNDLRRVARHYLRQ---CAENTPTRFDIVTVYLDRPKPEITI 133 Query: 123 IKDAF 127 +K AF Sbjct: 134 LKSAF 138 >UniRef50_A0LCU4 UPF0102 protein Mmc1_3298 n=1 Tax=Magnetococcus sp. MC-1 RepID=Y3298_MAGSM Length = 124 Score = 119 bits (298), Expect = 4e-26, Method: Composition-based stats. Identities = 37/119 (31%), Positives = 56/119 (47%), Gaps = 5/119 (4%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 LT K G+ E A + ++ KG + N R GE+D+I G +F EV+ R+ A+ Sbjct: 4 LTPKSFGEQAEDFACKMMKKKGYHILQRNARSRYGELDIIALHGEVVVFCEVKARQGAVS 63 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT----GNEVEWIKDAFN 128 G A ++ KQ +L + A W + + CRFD V G E ++DAF Sbjct: 64 GSAGEAIDGRKQRQLGRLAEAWRLANPA-WMAAPCRFDAVLVAREAQGWHAEIVQDAFQ 121 >UniRef50_A9HJH4 UPF0102 protein GDI1964/Gdia_0189 n=1 Tax=Gluconacetobacter diazotrophicus PAl 5 RepID=Y1964_GLUDA Length = 127 Score = 118 bits (297), Expect = 4e-26, Method: Composition-based stats. Identities = 43/129 (33%), Positives = 60/129 (46%), Gaps = 5/129 (3%) Query: 1 MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTT 60 M PTR R + Q G E A WL+ G + R GEIDL+ + G Sbjct: 1 MMAEPTRRR-VRGAASYQRGLQAEQVAGAWLQEHGWTILMHRARTRWGEIDLVAQRGAMI 59 Query: 61 IFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-GNE 119 +F EV+ R Y AA S+ R++ +L+ A WL + + + RFDV+ T G+ Sbjct: 60 VFCEVKCRPH--YTTAAESLGRAQMRRLMNAA-AWLCAAHPGWIYDEMRFDVLLVTAGDA 116 Query: 120 VEWIKDAFN 128 V I DAF Sbjct: 117 VHHIADAFR 125 >UniRef50_D1AZ70 Putative uncharacterized protein n=1 Tax=Sulfurospirillum deleyianum DSM 6946 RepID=D1AZ70_SULD5 Length = 108 Score = 118 bits (296), Expect = 6e-26, Method: Composition-based stats. Identities = 30/107 (28%), Positives = 49/107 (45%), Gaps = 6/107 (5%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 G E +A +LE +G +A N + + GEID+I + F EV+Y + Sbjct: 5 IGKEAETKASAYLEKEGYTILARNFHSKFGEIDIIALKEDILHFCEVKY---SQKYDPLL 61 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 +T SK K++ T + H S+ D ++ G E+E IK+ Sbjct: 62 RITPSKMKKIITTIHYYFLTHPSSYCYQ---IDAISIKGEEIEIIKN 105 >UniRef50_UPI000197ABB4 hypothetical protein GHTCC_11038 n=2 Tax=Proteobacteria RepID=UPI000197ABB4 Length = 122 Score = 118 bits (296), Expect = 7e-26, Method: Composition-based stats. Identities = 41/119 (34%), Positives = 63/119 (52%) Query: 7 RSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVR 66 +S G A E + + +L +GL F N + GEIDL+M++ T +FVEV+ Sbjct: 3 KSAVNNTQNAYHRGLAVEQKVKAYLIAQGLVFKDENFRAKCGEIDLVMKDQDTWVFVEVK 62 Query: 67 YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 YR +G AA +T SK+ KL +T +++A+H + +D R D+ A GN W K Sbjct: 63 YRARPTHGSAADMLTSSKRDKLTKTMYVYMAKHYLNPSIIDHRIDLFAVDGNRARWHKH 121 >UniRef50_C7PDZ7 Putative uncharacterized protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PDZ7_CHIPD Length = 118 Score = 118 bits (296), Expect = 7e-26, Method: Composition-based stats. Identities = 34/119 (28%), Positives = 46/119 (38%), Gaps = 7/119 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + G E A L IA N R EID+I +F EV+ S LY Sbjct: 2 ASHIALGKKGELIACGHLRLHHYEIIAVNWRHRRREIDIIASRDGCLVFFEVKTLASDLY 61 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-----NEVEWIKDAF 127 G VT +K+ + A ++ R RFDV+A T E+ +DAF Sbjct: 62 GWPEKHVTAAKRRNIQAVASAYMDRMKQLPKV--IRFDVIAITFQPDGTYELVHFEDAF 118 >UniRef50_C7GYG3 Putative choloylglycine hydrolase n=1 Tax=Eubacterium saphenum ATCC 49989 RepID=C7GYG3_9FIRM Length = 109 Score = 118 bits (296), Expect = 7e-26, Method: Composition-based stats. Identities = 31/114 (27%), Positives = 57/114 (50%), Gaps = 5/114 (4%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + K+ G E +L+ + + N R EID+I +G T F+EV+ R S + Sbjct: 1 MENKKIGSLGEEMTCSYLKDRQFVVLEQNYRNRYAEIDVIALKGDTVHFIEVKTRCSEVA 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 G A+ +V SKQ+++ + A ++LA ++ + F VV +++ + AF Sbjct: 61 GRASEAVPVSKQNRIRRLAEIYLADND--LCDKNVEFHVVTIDLHDINY---AF 109 >UniRef50_A4A171 Putative uncharacterized protein n=2 Tax=Planctomycetaceae RepID=A4A171_9PLAN Length = 137 Score = 117 bits (295), Expect = 9e-26, Method: Composition-based stats. Identities = 37/106 (34%), Positives = 52/106 (49%), Gaps = 8/106 (7%) Query: 30 WLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAASVTRSKQHKLL 89 +L G IA + + GEID+I +GRT +FVEV+ R S+ + +V KQ KL Sbjct: 26 FLRQLGYVIIARSDRSKLGEIDIIAVDGRTVVFVEVKTRSSSDAAHPSEAVDTHKQAKLT 85 Query: 90 QTARLWLARHNGSFDTVDCRFDVVAFTG------NEVEWIKDAFND 129 + A +L RHN RFDV+A T +E +AF Sbjct: 86 RLAISYLRRHNLLECKA--RFDVIAITWPAAAQTPTIEHFLNAFEP 129 >UniRef50_B9CKG2 Putative uncharacterized protein n=1 Tax=Atopobium rimae ATCC 49626 RepID=B9CKG2_9ACTN Length = 118 Score = 117 bits (295), Expect = 1e-25, Method: Composition-based stats. Identities = 33/118 (27%), Positives = 48/118 (40%), Gaps = 12/118 (10%) Query: 22 AWEAQARRWLEGKGLRFIAANVNE-RGGEIDLIMREGRTTIFVEVRYRRSALYGG---AA 77 E A +L +G + I N GGE D+I ++G + VEV+ RR Sbjct: 1 MGEQLAADYLAERGYKIIQRNWRCKGGGEADIIAQDGDVYVMVEVKTRRMLQVDANLMPE 60 Query: 78 ASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-----GNEVEWIKDAFNDH 130 +VT KQ + A L+LA H RFDV+A + + AF+ Sbjct: 61 LAVTAQKQRMYRKMALLYLAFHG---QVSMIRFDVIAINLVAEHNASLRHLIGAFSWD 115 >UniRef50_C1TKL9 Predicted endonuclease related to Holliday junction resolvase n=1 Tax=Dethiosulfovibrio peptidovorans DSM 11002 RepID=C1TKL9_9BACT Length = 123 Score = 117 bits (294), Expect = 1e-25, Method: Composition-based stats. Identities = 28/113 (24%), Positives = 49/113 (43%), Gaps = 3/113 (2%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + G E A R+L G + NV R GE+D++ R+G T + VEVR+R + + Sbjct: 2 TAPHLEKGKRGEDLACRYLRNLGWTVLERNVRFRRGELDIVARDGDTLVIVEVRFRTTGI 61 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 SV K +L+ ++ + + R D++A T + + Sbjct: 62 IMSPEDSVGPRKLRRLVIAGAAYVEKTGWNGF---WRIDLIALTERKGRLFLN 111 >UniRef50_C0QVG4 UPF0102 protein BHWA1_02005 n=2 Tax=Brachyspira RepID=Y2005_BRAHW Length = 121 Score = 117 bits (294), Expect = 1e-25, Method: Composition-based stats. Identities = 41/120 (34%), Positives = 58/120 (48%), Gaps = 7/120 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNER--GGEIDLIMREGRTTIFVEVRYRRSA 71 K G+ E A +LE G I N + GEIDL+M +G +F+EV+YRR Sbjct: 2 ANKKIIGNLGEDIALEYLEKLGYTLIERNFKGKKTRGEIDLVMTKGVVIVFIEVKYRRQG 61 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT----GNEVEWIKDAF 127 +G AA S++ K+ KL +TA +L SF C F V E+ +I+D F Sbjct: 62 SFGYAACSISDRKKKKLYETAEEYLIEKGLSF-NQKCSFGAVLIDDTHYNREISFIEDIF 120 >UniRef50_B8HA88 UPF0102 protein Achl_2213 n=1 Tax=Arthrobacter chlorophenolicus A6 RepID=Y2213_ARTCA Length = 132 Score = 117 bits (294), Expect = 1e-25, Method: Composition-based stats. Identities = 30/120 (25%), Positives = 46/120 (38%), Gaps = 9/120 (7%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 G E A +LE G+ + N GEID++ +G + EV+ RRS YG Sbjct: 7 LGRHGEDLAVGYLETLGMLIVERNWRCSEGEIDVVALDGDALVIAEVKTRRSLDYGHPFE 66 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-----VEWIKDA--FNDHS 131 +V K +L + W R DV+A + VE +K + + Sbjct: 67 AVGPDKLARLHRLGAAWCRDRELRMPLR--RVDVIAVVDDGGGSPVVEHLKGVAEWRSDA 124 >UniRef50_A1SLR5 UPF0102 protein Noca_3248 n=11 Tax=Actinomycetales RepID=Y3248_NOCSJ Length = 124 Score = 117 bits (293), Expect = 1e-25, Method: Composition-based stats. Identities = 32/111 (28%), Positives = 46/111 (41%), Gaps = 2/111 (1%) Query: 9 GSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYR 68 S + G E A R L G+G+ + N GEIDL++R+G + EV+ R Sbjct: 3 SSAAAAIKQALGAYGETLAARHLVGQGMVLLERNWRCEAGEIDLVLRDGDVLVVCEVKTR 62 Query: 69 RSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE 119 S YG +VT K +L + A W+ D R D+V Sbjct: 63 SSLRYGTPHEAVTDIKVARLRRLASRWVQDRGV--AVRDIRIDLVGIVRPR 111 >UniRef50_A7HN69 UPF0102 protein Fnod_1509 n=1 Tax=Fervidobacterium nodosum Rt17-B1 RepID=Y1509_FERNB Length = 133 Score = 117 bits (293), Expect = 1e-25, Method: Composition-based stats. Identities = 24/110 (21%), Positives = 49/110 (44%), Gaps = 1/110 (0%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 K+ E A ++L+ KG + + N GEID+I + IFVEV+ + Sbjct: 19 NKKEWQIAEELAVKYLKEKGYKILEKNFKTPYGEIDIIANKKDIIIFVEVKSGKGIR-IQ 77 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 + V K K++++A +L + + + + DV+ ++ ++ Sbjct: 78 PSERVDDKKYLKIVKSAEFYLEFYLKNKNYKISQIDVIEIINGNIKHYEN 127 >UniRef50_C0E2B0 Putative uncharacterized protein n=2 Tax=Corynebacterium matruchotii RepID=C0E2B0_9CORY Length = 156 Score = 117 bits (293), Expect = 2e-25, Method: Composition-based stats. Identities = 34/106 (32%), Positives = 55/106 (51%), Gaps = 6/106 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGR---TTIFVEVRYRRSAL 72 + G+ EA A ++ +G R +A NV GE+D+I R T +F+EV+ R + Sbjct: 36 AHRVGELGEATAAQFYRDEGYRILARNVRYPVGELDVIARAPDPSGTIVFIEVKTRTTLD 95 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN 118 +G A +VT K H++ + A WL + + + RFDVVA + Sbjct: 96 FGIA-EAVTPRKLHRMHRAAYRWLTERHVPWS--EVRFDVVAIYLD 138 >UniRef50_A6DUI2 Endonuclease n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DUI2_9BACT Length = 127 Score = 117 bits (293), Expect = 2e-25, Method: Composition-based stats. Identities = 36/127 (28%), Positives = 58/127 (45%), Gaps = 11/127 (8%) Query: 12 RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERG-GEIDLIMREGRTTIFVEVRYRRS 70 ++ +TG EA A+R + G + N + GEID+I R+G T FVEV+ R Sbjct: 3 KKAKHLKTGRKGEAMAQRQMRRCGYEILRKNYSLEHIGEIDIIARDGGTLCFVEVKTRHQ 62 Query: 71 AL--YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEW---IK- 124 A ++ K+ ++ + A+ +L +H S RFD+V + W IK Sbjct: 63 NKTEDTSPAQAIDSKKRQRIAKCAKYYLKKH--SLTQCSFRFDIVEVILGKFFWQHQIKI 120 Query: 125 --DAFND 129 AF + Sbjct: 121 RTHAFGE 127 >UniRef50_A1VFE8 UPF0102 protein Dvul_2148 n=3 Tax=Desulfovibrio vulgaris RepID=Y2148_DESVV Length = 134 Score = 117 bits (293), Expect = 2e-25, Method: Composition-based stats. Identities = 38/118 (32%), Positives = 54/118 (45%), Gaps = 6/118 (5%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 TG E +A L+ G R IA N G E+D+I G T +FVEV+ R + Sbjct: 4 ARHATGQHGEDEAAALLQRTGHRIIARNWRHGGLELDIICETGDTIVFVEVKTRAAHGLT 63 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEWIKDAFN 128 ++T K+H+L++ AR CRFD+V T +E I DAF+ Sbjct: 64 SPTDALTHQKRHRLIRAARA--WLAAADAWDRACRFDLVCVTQRGATCTLEHITDAFD 119 >UniRef50_C6WEV0 Putative uncharacterized protein n=1 Tax=Actinosynnema mirum DSM 43827 RepID=C6WEV0_ACTMD Length = 117 Score = 116 bits (292), Expect = 2e-25, Method: Composition-based stats. Identities = 28/116 (24%), Positives = 49/116 (42%), Gaps = 7/116 (6%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 + G E+ A R+LE +GL +A N GE+D++ +G + EV+ R + G Sbjct: 3 ASHVLGRLGESVACRYLERQGLVVLARNWRCASGELDVVATDGVRLVVCEVKCRSGSGRG 62 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-----EVEWIKD 125 + T + ++ +TA W H S V R D+V + ++ Sbjct: 63 DPLEAATPEQLDRVRRTAYRWRREHRLSG--VGVRVDLVGLEWPPGGPVRLRHVRG 116 >UniRef50_C8NTW9 Choloylglycine hydrolase n=1 Tax=Corynebacterium genitalium ATCC 33030 RepID=C8NTW9_9CORY Length = 125 Score = 116 bits (292), Expect = 2e-25, Method: Composition-based stats. Identities = 35/108 (32%), Positives = 45/108 (41%), Gaps = 6/108 (5%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMR-EGRTTIFVEVRYRRSA 71 G E + E G +A N R GEID+I TT+FVEV+ RR Sbjct: 7 PRDQYMLGALGETEVATRYEQAGYIIVARNYRCRDGEIDIIAMATDGTTVFVEVKTRRGT 66 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE 119 +GGA SVT K ++ + A WL RFDVV + Sbjct: 67 CFGGA-ESVTARKLARMRKAAVHWLRDKPFR----QVRFDVVEVLFDG 109 >UniRef50_Q47S60 UPF0102 protein Tfu_0669 n=2 Tax=Nocardiopsaceae RepID=Y669_THEFY Length = 124 Score = 116 bits (291), Expect = 2e-25, Method: Composition-based stats. Identities = 32/110 (29%), Positives = 48/110 (43%), Gaps = 3/110 (2%) Query: 7 RSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVR 66 R+ + + G E A R+L G+R + N R GEID++ R+ RT + VEV+ Sbjct: 2 RTRARLADQRRTLGQRGEELAARYLTRHGMRVLQRNWRCRDGEIDILARQDRTLVVVEVK 61 Query: 67 YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT 116 R +G +V +K+ +L W H S R DVV Sbjct: 62 TRAGRRFGTPLEAVDETKRARLRALGYRWARDHGCSAR---IRVDVVGIL 108 >UniRef50_A0Q0X6 UPF0102 protein NT01CX_2205 n=3 Tax=Clostridium RepID=Y2205_CLONN Length = 120 Score = 116 bits (291), Expect = 3e-25, Method: Composition-based stats. Identities = 31/118 (26%), Positives = 53/118 (44%), Gaps = 8/118 (6%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 K G E + +L KG + + N R GEID+I F EV+ R + +G Sbjct: 5 NKPIGSYGEHISENFLVSKGHKILTKNFRCRSGEIDIISSHNNYICFTEVKTRYNYSFGI 64 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE------VEWIKDAF 127 SVT +K K+ TA+ ++ + + F+V+ N+ + +I++AF Sbjct: 65 PCESVTITKIKKIRNTAKFYIYINKLFKNNFK--FNVIEIILNKYSNDYSINFIENAF 120 >UniRef50_C7H7V1 Endonuclease n=2 Tax=Faecalibacterium prausnitzii RepID=C7H7V1_9FIRM Length = 120 Score = 115 bits (290), Expect = 3e-25, Method: Composition-based stats. Identities = 42/122 (34%), Positives = 58/122 (47%), Gaps = 8/122 (6%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE-GRTTIFVEVRYRRSAL 72 + +TG EA A R+ + +G +A N R GEIDLI+RE T + EV+ R Sbjct: 1 MDRAETGRTGEAVAARYYQKQGCELVAHNYRTRMGEIDLILREPDGTLVLCEVKTRSPDP 60 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-----NEVEWIKDAF 127 AA+VT +KQ +L++TA +L + RFDV T V IK AF Sbjct: 61 LAAPAAAVTPAKQRRLIRTAEYYLQ--HTGQSDEPVRFDVAEVTPLDSGRWMVHIIKGAF 118 Query: 128 ND 129 Sbjct: 119 TA 120 >UniRef50_B2IUS6 Putative uncharacterized protein n=4 Tax=Nostocaceae RepID=B2IUS6_NOSP7 Length = 180 Score = 115 bits (290), Expect = 3e-25, Method: Composition-based stats. Identities = 31/130 (23%), Positives = 48/130 (36%), Gaps = 28/130 (21%) Query: 18 QTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGR------------------- 58 G E +WL+ G + R GEID+I + Sbjct: 11 DIGHLGEDLVAQWLQSTGWIILHRRFASRWGEIDIIAQHDGQTGEKLLTQHSLRAKRPAT 70 Query: 59 -------TTIFVEVRYRRSALYG-GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRF 110 FVEV+ R S + G +++T KQ K+ +TA ++LA++ CRF Sbjct: 71 ANSTQHSLLAFVEVKTRSSGSWDAGGRSAITPQKQAKISRTAGIFLAQYPEK-ADYSCRF 129 Query: 111 DVVAFTGNEV 120 DV + Sbjct: 130 DVAIVYCQRI 139 >UniRef50_C5CAH6 Holliday junction resolvase-like endonuclease n=1 Tax=Micrococcus luteus NCTC 2665 RepID=C5CAH6_MICLC Length = 139 Score = 115 bits (290), Expect = 3e-25, Method: Composition-based stats. Identities = 29/118 (24%), Positives = 41/118 (34%), Gaps = 2/118 (1%) Query: 2 ATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTI 61 A P + R G E A RWL +G N GE+D++ + Sbjct: 10 AQPPGERRASR--AHTALGRFGEDAAARWLAERGYVIADRNWRGEAGELDIVAHHAGWWV 67 Query: 62 FVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE 119 VEV+ R +G S+ R K +L + W+ H R D VA Sbjct: 68 GVEVKTRSGLAFGDPFESIDRRKLTRLHRLTAAWVRAHAADRRGTPWRVDAVAVLVPR 125 >UniRef50_A0LMM6 Putative uncharacterized protein n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LMM6_SYNFM Length = 95 Score = 115 bits (290), Expect = 4e-25, Method: Composition-based stats. Identities = 37/94 (39%), Positives = 51/94 (54%), Gaps = 6/94 (6%) Query: 40 AANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARH 99 N GEIDLI+R+G+T +FVEV+ R + +G SV+ +KQ +L + A +L Sbjct: 2 ERNFRCAAGEIDLIVRDGKTLVFVEVKSRCGSRFGLPQESVSIAKQRRLTRLALWYLREK 61 Query: 100 NGSFDTVDCRFDVVAFTG----NEVEWIKDAFND 129 F+ RFDVVA T EV WI +AF Sbjct: 62 R--FEGHPARFDVVAVTWSGGKPEVTWIVNAFEA 93 >UniRef50_B6R5V9 Putative uncharacterized protein n=1 Tax=Pseudovibrio sp. JE062 RepID=B6R5V9_9RHOB Length = 135 Score = 115 bits (290), Expect = 4e-25, Method: Composition-based stats. Identities = 33/129 (25%), Positives = 56/129 (43%), Gaps = 4/129 (3%) Query: 2 ATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTI 61 T+S ++ + G E +A L G + + + GEIDLI + +T + Sbjct: 7 TPRATKSNLLKKQAAYRKGLQAELKAEMLLRQAGWQILERRYKTKQGEIDLIAEQDQTIV 66 Query: 62 FVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFD-VVAFTGNEV 120 FVEV+ RR G ++T+ Q ++ AR W++ H+ RFD V+ E Sbjct: 67 FVEVKARRGVDDG--LYAITQRSQRRIANAAREWVSHHHEVVGKT-LRFDAVILPKHGEA 123 Query: 121 EWIKDAFND 129 + + F Sbjct: 124 QHFPNLFEA 132 >UniRef50_Q118B0 UPF0102 protein Tery_0733 n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Y733_TRIEI Length = 180 Score = 115 bits (289), Expect = 4e-25, Method: Composition-based stats. Identities = 30/120 (25%), Positives = 47/120 (39%), Gaps = 14/120 (11%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRT------------TIF 62 TG E +WL +G + + GE+D++ + + F Sbjct: 14 QKIDTGILGEELVAKWLNLEGWQILHRRWQCPWGELDIVATKTTSSLRDSSNYKFPILAF 73 Query: 63 VEVRYRRSALYG-GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVE 121 VEV+ R + +VT SKQ KL +TA ++L+ CRFDV N + Sbjct: 74 VEVKTRSRGNWDQDGLLAVTESKQAKLWKTAEIFLSDRP-ELVDYSCRFDVALVRCNYIR 132 >UniRef50_C0BPF0 Putative uncharacterized protein n=1 Tax=Flavobacteria bacterium MS024-3C RepID=C0BPF0_9BACT Length = 119 Score = 115 bits (289), Expect = 4e-25, Method: Composition-based stats. Identities = 35/116 (30%), Positives = 50/116 (43%), Gaps = 7/116 (6%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 G E A+ +L KG A N R EID+I I VEV+ R + Sbjct: 4 HNDLGAKGERIAQEYLISKGYEIRAVNYRHRKAEIDIIALHENFLIVVEVKTRTAPTIVP 63 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT----GNEVEWIKDAF 127 V SK + L++ A ++ RH + RFD+V T ++E +KDAF Sbjct: 64 LIQLVPPSKINHLIRAANYYMNRHKV---HKEARFDIVYITMKAHSYDLEHLKDAF 116 >UniRef50_A8F4Z9 UPF0102 protein Tlet_0667 n=1 Tax=Thermotoga lettingae TMO RepID=Y667_THELT Length = 107 Score = 114 bits (287), Expect = 7e-25, Method: Composition-based stats. Identities = 30/106 (28%), Positives = 44/106 (41%), Gaps = 4/106 (3%) Query: 20 GDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAAS 79 E +A R+L KG + +A N R GEID+I R +FVEV+ + Sbjct: 3 WKEAEEKASRYLRHKGFKILARNYRTRFGEIDIIARYRGYLVFVEVK--SGNSFFLPRTR 60 Query: 80 VTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 V K + A ++ SF R DV+ T +E +D Sbjct: 61 VDLQKIRHIQLAANDYIMNTKDSFKGY--RIDVIEVTEKGIEHFED 104 >UniRef50_D2ATE9 Putative uncharacterized protein n=1 Tax=Streptosporangium roseum DSM 43021 RepID=D2ATE9_STRRD Length = 117 Score = 114 bits (287), Expect = 7e-25, Method: Composition-based stats. Identities = 33/103 (32%), Positives = 47/103 (45%), Gaps = 2/103 (1%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + G E A +L G++ + N GEID++ REGR + VEV+ R + Sbjct: 2 AAKDELGRHGEQVAVDYLLAHGMQILDRNWRCPDGEIDVVAREGRALVVVEVKTRSGRTH 61 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT 116 G A +VT K +L + WLA FD R DV+A Sbjct: 62 GTAFEAVTVVKLARLRRLTGRWLAERRERFD--SVRIDVIALE 102 >UniRef50_B7K7K1 Putative uncharacterized protein n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7K7K1_CYAP7 Length = 148 Score = 114 bits (287), Expect = 8e-25, Method: Composition-based stats. Identities = 28/103 (27%), Positives = 45/103 (43%), Gaps = 4/103 (3%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE--GRTTIFVEVRYRRSALY 73 G+ E WL+ + + R GEID+I + + F+EV+ R S + Sbjct: 1 MTTIGELGEKLVSEWLKTQEWSILQHRWRCRWGEIDIISQSTTDHSLAFIEVKTRNSRNW 60 Query: 74 G-GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF 115 ++ KQ KL+++A L+L + S CRFDV Sbjct: 61 DSDGLLAINEKKQIKLIKSASLFLGEYP-SLALFPCRFDVALV 102 >UniRef50_C6XIG7 Putative uncharacterized protein n=1 Tax=Hirschia baltica ATCC 49814 RepID=C6XIG7_HIRBI Length = 138 Score = 114 bits (286), Expect = 1e-24, Method: Composition-based stats. Identities = 34/122 (27%), Positives = 59/122 (48%), Gaps = 4/122 (3%) Query: 9 GSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYR 68 ++ ++ G E A WL KG + + V R GEIDLI +GR F+EV+ R Sbjct: 20 SDSKRRKHEKRGRNAEWLASIWLRLKGYKILQKRVRMRTGEIDLIATKGRVIAFIEVKAR 79 Query: 69 RSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV-EWIKDAF 127 ++ G SV + ++ +TA +W+A+ F + D R+D+V ++ +K + Sbjct: 80 KTINIG--LQSVPETSWRRISKTAEIWMAKK-TKFKSHDWRYDLVVVCPWKIPSHLKAFW 136 Query: 128 ND 129 Sbjct: 137 RP 138 >UniRef50_A0NXE9 Putative uncharacterized protein n=2 Tax=Labrenzia RepID=A0NXE9_9RHOB Length = 134 Score = 114 bits (286), Expect = 1e-24, Method: Composition-based stats. Identities = 38/135 (28%), Positives = 63/135 (46%), Gaps = 9/135 (6%) Query: 1 MATVPTRSG-----SPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMR 55 MA P SG + R+ G + E A +L G R + + GEIDLI + Sbjct: 1 MARAPGGSGKLPAETDRRRRAHALGLSAETLAAWYLRLTGWRILKRRYKTKAGEIDLIAK 60 Query: 56 EGRTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF 115 +T F+EV+ R+S A +VT + Q ++ + A++++A H RFD++ Sbjct: 61 RRKTVAFIEVKARKSRQ--AALEAVTPASQKRITRAAKIFVAEHP-KAGFYTLRFDIIVV 117 Query: 116 TGNEV-EWIKDAFND 129 + E I +AF+ Sbjct: 118 RPRALPERIVNAFHA 132 >UniRef50_Q5FF38 UPF0102 protein ERGA_CDS_00540 n=5 Tax=canis group RepID=Y054_EHRRG Length = 127 Score = 114 bits (286), Expect = 1e-24, Method: Composition-based stats. Identities = 27/120 (22%), Positives = 50/120 (41%), Gaps = 6/120 (5%) Query: 12 RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSA 71 ++L G E +L+ K I GEID+I + + +F+EV+ Sbjct: 8 KRLAYNTLGYLGEVLIILFLKCKLYHIIKHRYRCPLGEIDIIAHKNKQLVFIEVKTSLFN 67 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF-TGNEVEWIKDAFNDH 130 +T +Q +L++A+ ++A H F RFD+ F + I +A+ + Sbjct: 68 KNIP----ITYKQQKSILKSAKYFIAFHR-KFANYSIRFDLYFFSLSTGLTHIPNAWQEP 122 >UniRef50_C4LJB4 Putative uncharacterized protein n=1 Tax=Corynebacterium kroppenstedtii DSM 44385 RepID=C4LJB4_CORK4 Length = 154 Score = 113 bits (284), Expect = 2e-24, Method: Composition-based stats. Identities = 31/108 (28%), Positives = 47/108 (43%), Gaps = 8/108 (7%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMR-------EGRTTIFVEVRY 67 +T+ G E +A WL +G + N RGGE+D++ VEV+ Sbjct: 26 STRLLGRWGEDRAAEWLVRQGFVIVDRNWRFRGGELDIVATLNTDARNSPAVCAVVEVKT 85 Query: 68 RRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF 115 RR+ +GG ++TR KQ L + WLA H R D++ Sbjct: 86 RRTQFFGGGVEAITRKKQQTLRRGMSQWLAAHPDVHPQF-IRIDLIDI 132 >UniRef50_B2RLS5 UPF0102 protein PGN_1801 n=2 Tax=Porphyromonas gingivalis RepID=Y1801_PORG3 Length = 135 Score = 113 bits (284), Expect = 2e-24, Method: Composition-based stats. Identities = 24/120 (20%), Positives = 46/120 (38%), Gaps = 9/120 (7%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 G E A + L +G + A N E+D++ R + VEV+ R Sbjct: 2 ADHNDRGRQGEEIALKHLRQQGYQIEALNWQSGRRELDIVASTSRELVVVEVKTRTEGFL 61 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG------NEVEWIKDAF 127 +V K+ + ++A ++ + + RFDV++ +E ++AF Sbjct: 62 LAPEEAVDARKRRLISESAHHYVRMYAI---DLPVRFDVISVVLSADGSCKRIEHRENAF 118 >UniRef50_C1A4D5 Putative uncharacterized protein n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1A4D5_GEMAT Length = 121 Score = 113 bits (284), Expect = 2e-24, Method: Composition-based stats. Identities = 35/120 (29%), Positives = 59/120 (49%), Gaps = 6/120 (5%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 ++ G E A RWL +G + +A +IDLIM+ + FVEV+ RR Sbjct: 2 TRARQELGLLGERIAARWLIREGWQLVAHRFRHGHRDIDLIMQREQEVAFVEVKARRGEA 61 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----EVEWIKDAFN 128 +G +V K+ +L+++A++W+ RH ++ RFDV+ + V I+ AF Sbjct: 62 FGSPVEAVHARKRRELVRSAKVWVDRHGTEG--LEYRFDVLGILIDGQNVRVRHIEGAFQ 119 >UniRef50_C5CGT1 UPF0102 protein Kole_1919 n=1 Tax=Kosmotoga olearia TBF 19.5.1 RepID=Y1919_KOSOT Length = 115 Score = 113 bits (284), Expect = 2e-24, Method: Composition-based stats. Identities = 29/111 (26%), Positives = 55/111 (49%), Gaps = 5/111 (4%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 + + G +E +A ++L+ +G + +A NV GE+D++ R+G+T +FVEV+ Sbjct: 5 SLKKGKEFEERASKFLKKQGYKILARNVRYSFGELDIVARKGKTLVFVEVK--GGNPDFP 62 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-GNEVEWIKD 125 V R+K +L A ++ + F+ R DV+ E+ +K Sbjct: 63 PRMRVDRAKLRRLELAAYKYIKDFSPKFEES--RLDVIEVLSNGEINHLKG 111 >UniRef50_A6GIE5 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6GIE5_9DELT Length = 125 Score = 113 bits (284), Expect = 2e-24, Method: Composition-based stats. Identities = 44/125 (35%), Positives = 61/125 (48%), Gaps = 6/125 (4%) Query: 8 SGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGR-----TTIF 62 + T+ G A E A R LE GL +A NV G E+DL+ E T +F Sbjct: 2 ASDEPSTHTRGRGLAAEQLAARQLERAGLTILARNVELSGAEVDLVASERDREGTPTIVF 61 Query: 63 VEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEW 122 VEVR R G A +V KQ ++ + A +L R + ++ V RFDV+A G W Sbjct: 62 VEVRSRADDRRGHPAQTVDARKQARVRRAATAYLVREDL-WERVAVRFDVIAIVGERATW 120 Query: 123 IKDAF 127 ++DAF Sbjct: 121 LRDAF 125 >UniRef50_Q0A6J0 UPF0102 protein Mlg_2205 n=2 Tax=Ectothiorhodospiraceae RepID=Y2205_ALHEH Length = 126 Score = 113 bits (283), Expect = 2e-24, Method: Composition-based stats. Identities = 52/124 (41%), Positives = 67/124 (54%), Gaps = 1/124 (0%) Query: 7 RSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVR 66 R +TG+ E +A L G+GL + N R GEIDLIMR+G +FVEVR Sbjct: 3 RRRPSAPAPHLETGNRGERRALEHLTGQGLELLECNFRCRAGEIDLIMRDGEVVVFVEVR 62 Query: 67 YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDA 126 R YGGA AS+T +KQ +L + A WL RH + CRFDVV F G +W++ A Sbjct: 63 VRTHPGYGGALASITPAKQRRLARAAARWLQRHRLT-QRAVCRFDVVTFDGERPQWLRHA 121 Query: 127 FNDH 130 F Sbjct: 122 FTAP 125 >UniRef50_A8HYK3 UPF0102 protein AZC_4471 n=3 Tax=Rhizobiales RepID=Y4471_AZOC5 Length = 131 Score = 113 bits (283), Expect = 2e-24, Method: Composition-based stats. Identities = 35/128 (27%), Positives = 55/128 (42%), Gaps = 4/128 (3%) Query: 2 ATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTI 61 P R+ G A E +A LE + R +A + GE+DL+ R + Sbjct: 4 PPPPDTPARRRKQAAHARGLAAEDRAAAVLEAQSFRILARRLRTSAGELDLVARRDDLLV 63 Query: 62 FVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF-TGNEV 120 F EV+ RRS AA S+ ++ +++ A L+LA H + RFD + Sbjct: 64 FCEVKLRRS--LAEAAESLQLRQRRRIIAAAELFLADHP-ELAPLAMRFDAILLGRDGGA 120 Query: 121 EWIKDAFN 128 E ++ AF Sbjct: 121 EHLEGAFE 128 >UniRef50_B2UP21 Putative uncharacterized protein n=2 Tax=Verrucomicrobiaceae RepID=B2UP21_AKKM8 Length = 118 Score = 113 bits (283), Expect = 2e-24, Method: Composition-based stats. Identities = 36/115 (31%), Positives = 52/115 (45%), Gaps = 9/115 (7%) Query: 21 DAWEAQARRWLEGKGLRFIAANVN-ERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAAS 79 E A +L +G + N RGGE+D++ REG +FVEV+ R YGGA + Sbjct: 1 MYGELAAASFLRAEGCVILRRNWRPVRGGELDIVCREGECLVFVEVKTRTGNGYGGARRA 60 Query: 80 VTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-----GNEVEWIKDAFND 129 V K+ + + A WL + V+ R+DVV E I+ AF Sbjct: 61 VNARKRALIRRGAAEWL---RLLPEPVNSRYDVVEVLYREGMPPEFRHIRGAFGA 112 >UniRef50_C4KCT6 Putative uncharacterized protein n=3 Tax=Betaproteobacteria RepID=C4KCT6_THASP Length = 150 Score = 112 bits (282), Expect = 2e-24, Method: Composition-based stats. Identities = 42/98 (42%), Positives = 56/98 (57%), Gaps = 3/98 (3%) Query: 35 GLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARL 94 GLR IA NV RGGE+DL+ + +FVEVR RR+ +GGAA S+T +KQ ++L A+ Sbjct: 52 GLRVIARNVRCRGGEVDLVCLDRSHVVFVEVRLRRNNRFGGAAESITAAKQRRVLIAAQW 111 Query: 95 WLARHNGSFDTVDCRFDVV---AFTGNEVEWIKDAFND 129 WL F CRFD V A + W+ AF+ Sbjct: 112 WLGGAGRRFRDAACRFDAVLLDALDPARIIWLPGAFDA 149 >UniRef50_Q3M3N9 UPF0102 protein Ava_4800 n=2 Tax=Nostocaceae RepID=Y4800_ANAVT Length = 151 Score = 112 bits (282), Expect = 3e-24, Method: Composition-based stats. Identities = 29/108 (26%), Positives = 50/108 (46%), Gaps = 7/108 (6%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE-----GRTTIFVEVRYR 68 ++ + E +WL+ G + + R GEID+I + FVEV+ R Sbjct: 1 MSHLNIANLGEDFVAQWLQSTGWMILNRQFSCRWGEIDIIAQHTRNNQESILAFVEVKTR 60 Query: 69 RSALYGG-AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF 115 + ++T KQ K+ +TAR++LA++ + + CRFDV Sbjct: 61 SPGNWDDGGRGAITLKKQAKIERTARIFLAKYPDKAEYI-CRFDVAIV 107 >UniRef50_C7QA44 Putative uncharacterized protein n=1 Tax=Catenulispora acidiphila DSM 44928 RepID=C7QA44_CATAD Length = 157 Score = 112 bits (282), Expect = 3e-24, Method: Composition-based stats. Identities = 30/142 (21%), Positives = 50/142 (35%), Gaps = 19/142 (13%) Query: 2 ATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNER----GGEIDLIMREG 57 T L+ Q G E A +L G + N R GE+D+I Sbjct: 14 PTDFPSQADHSTLSPGQLGREGEDLAAAYLTACGYHVLDRNWRWRGPDVRGELDIIALAS 73 Query: 58 RTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVD--------CR 109 + +EV+ RR+A ++T +K+ +L + WLA H R Sbjct: 74 DLLVTIEVKTRRAATGARPFDAITEAKRARLWKLTNRWLAEHRLDPAVRHHLPRGIRGIR 133 Query: 110 FDVVAF-------TGNEVEWIK 124 DV+ T ++ ++ Sbjct: 134 LDVIGLIYPTDGHTEPTIDHLQ 155 >UniRef50_Q7UM23 UPF0102 protein RB9115 n=1 Tax=Rhodopirellula baltica RepID=Y9115_RHOBA Length = 167 Score = 112 bits (282), Expect = 3e-24, Method: Composition-based stats. Identities = 42/123 (34%), Positives = 57/123 (46%), Gaps = 11/123 (8%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIM--REGRTTIFVEVRYRRSALY 73 Q G E A + L KGL IA + ++R GEIDLI + R +FVEV+ + Sbjct: 39 NAQLGRRGEQAAAQLLRRKGLNVIAESESDRAGEIDLIALRKRPRLIVFVEVKTLSTTRP 98 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF-------TGNEVEWIKDA 126 G A V +KQ ++ + A +L R + CRFDVVA VE + A Sbjct: 99 GHPADRVDENKQARITRAALRYLKRKKLLG--ITCRFDVVAVWWPRDEPRPTRVEHYESA 156 Query: 127 FND 129 FN Sbjct: 157 FNA 159 >UniRef50_D1N5L9 Putative uncharacterized protein n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N5L9_9BACT Length = 132 Score = 112 bits (281), Expect = 3e-24, Method: Composition-based stats. Identities = 27/109 (24%), Positives = 49/109 (44%), Gaps = 2/109 (1%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + G EA A R LE KG +A N + GE+D++ R+G + +FVEV+ Sbjct: 5 RAAHLALGRRGEAAACRLLEAKGFDILARNWRVKAGELDIVARDGASVVFVEVKTLHRKG 64 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVE 121 + +++ ++ + A+L+L + RFD+V + Sbjct: 65 FFRPLDNLSAHQKKRNFHAAQLYLRM--IGGTGLPVRFDLVEVVASRWR 111 >UniRef50_Q0BTH9 UPF0102 protein GbCGDNIH1_0975 n=1 Tax=Granulibacter bethesdensis CGDNIH1 RepID=Y975_GRABC Length = 158 Score = 112 bits (281), Expect = 3e-24, Method: Composition-based stats. Identities = 33/120 (27%), Positives = 52/120 (43%), Gaps = 4/120 (3%) Query: 12 RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSA 71 R + G E A + LE G + + + GEID++ VEV+YR + Sbjct: 37 RGGKASRDGLEAERIAAQALEADGWQILGRRLRTSAGEIDILAEMDGLLAIVEVKYRPTL 96 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-GNEVEWIKDAFNDH 130 AA ++ ++ +L+ A LA+H + T RFDV+ +V I DAF Sbjct: 97 S--EAAHALGPRQRKRLIAAASYVLAQHP-EYGTEGVRFDVIVVDMAGQVRRITDAFRLD 153 >UniRef50_A4CH47 Putative uncharacterized protein n=1 Tax=Robiginitalea biformata HTCC2501 RepID=A4CH47_9FLAO Length = 128 Score = 112 bits (281), Expect = 4e-24, Method: Composition-based stats. Identities = 32/117 (27%), Positives = 53/117 (45%), Gaps = 7/117 (5%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 TT G E A R+L G R + N R EID++ + +EV+ R A Y Sbjct: 11 TTCDIGREGEDYAVRYLLASGYRILCRNYRYRRAEIDVLAFREGVLVVIEVKTRTRAFYE 70 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF----TGNEVEWIKDAF 127 + S+ RSK +L++ A ++ + + RFD++ G + ++DAF Sbjct: 71 ALSRSIPRSKIARLVRAADHYVRSNGLRAE---VRFDIIQVIRLREGYRLVHLEDAF 124 >UniRef50_B4U6T0 Putative uncharacterized protein n=1 Tax=Hydrogenobaculum sp. Y04AAS1 RepID=B4U6T0_HYDS0 Length = 110 Score = 112 bits (280), Expect = 4e-24, Method: Composition-based stats. Identities = 22/107 (20%), Positives = 43/107 (40%), Gaps = 2/107 (1%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 G +E A WL KG + + N + GEID+I + I EV+ + YG Sbjct: 2 KGKEFEDMAFSWLLEKGYKVLKRNHRCKRGEIDIIATKENKLIAFEVKGNNTDTYGLPEE 61 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 + R K ++ + + D + D + +++ +++ Sbjct: 62 RIDRLKLERIRLCLTEYALSNGIDLDNIQI--DAIFIYKDQIRHLEN 106 >UniRef50_D2LER1 Putative uncharacterized protein n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LER1_RHOVA Length = 124 Score = 112 bits (280), Expect = 5e-24, Method: Composition-based stats. Identities = 36/123 (29%), Positives = 58/123 (47%), Gaps = 4/123 (3%) Query: 6 TRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEV 65 S + + + G A E +A+ LE K R +A +GGEIDL+ + G FVEV Sbjct: 3 AASRARNAPNSYKIGVAAETRAKLLLEAKSYRILAERYKTKGGEIDLVAQRGDHLAFVEV 62 Query: 66 RYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-VEWIK 124 + RR+ AA +V +Q ++ A ++L H FDV+ + + + I+ Sbjct: 63 KCRRTQE--EAAYAVLPRQQARIATAAEVFLGEH-AGLSHESASFDVILVSPTQGLSHIE 119 Query: 125 DAF 127 AF Sbjct: 120 QAF 122 >UniRef50_B6BHT8 Putative uncharacterized protein n=1 Tax=Campylobacterales bacterium GD 1 RepID=B6BHT8_9PROT Length = 109 Score = 112 bits (280), Expect = 5e-24, Method: Composition-based stats. Identities = 27/110 (24%), Positives = 53/110 (48%), Gaps = 5/110 (4%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 ++ GD E +A ++L G + N R GEID+I + FVEV+ + Sbjct: 2 SRAKGDLAEDRACKFLYENGFMLVDRNFYSRFGEIDIIATKDEVLHFVEVK--SGLDFES 59 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 A ++T K +L++T +++ ++ + V +D + T VE +++ Sbjct: 60 AIQNITPKKLSRLIRTGNVYMKKNKLDVNFV---YDAIVVTPKTVEIVEN 106 >UniRef50_Q83I01 UPF0102 protein TW312 n=2 Tax=Tropheryma whipplei RepID=Y312_TROW8 Length = 120 Score = 111 bits (279), Expect = 6e-24, Method: Composition-based stats. Identities = 26/114 (22%), Positives = 45/114 (39%), Gaps = 4/114 (3%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 + G E +A +L G + N R GE+D+I R+ + VEV+ + Sbjct: 6 SKYALGRIAEDKACNYLSVNGYIVLDRNWYCRFGELDIIARKNGVIVAVEVKGGKRNA-D 64 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG---NEVEWIKD 125 ++T K KL + WL + + +D R D V+ T ++ Sbjct: 65 YPICNITVKKLSKLTFLLKAWLHENKLNEFCIDLRIDAVSVTFIPELQIRHFVG 118 >UniRef50_Q2IJ48 UPF0102 protein Adeh_1910 n=4 Tax=Anaeromyxobacter RepID=Y1910_ANADE Length = 134 Score = 111 bits (279), Expect = 7e-24, Method: Composition-based stats. Identities = 42/119 (35%), Positives = 54/119 (45%), Gaps = 6/119 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 + G EA A WL +G R + N R GE+DL+ R+G +FVEVR R S GG Sbjct: 12 RQALGREGEALAAAWLAERGFRILDRNHRTRRGEVDLVCRDGEVLVFVEVRSRTSGAQGG 71 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----EVEWIKDAFNDH 130 +V K +++ A W H G RFDVVA T VE AF+ Sbjct: 72 PEETVGPLKGRRVVAAATDWALGHGGLEQ--AIRFDVVAVTFGDGEPRVEHFPAAFDGD 128 >UniRef50_Q0F072 Putative uncharacterized protein n=1 Tax=Mariprofundus ferrooxydans PV-1 RepID=Q0F072_9PROT Length = 120 Score = 110 bits (277), Expect = 1e-23, Method: Composition-based stats. Identities = 33/124 (26%), Positives = 59/124 (47%), Gaps = 13/124 (10%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 ++T+ G+ E++A R+L+ G R + N GE+D++ G +FVEV+ Sbjct: 1 MSTRD-GNIGESEASRYLQHHGYRILDRNARLGRGELDIVALSGEIVVFVEVKA--HHNR 57 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE---------VEWIK 124 A +V K +L A++WLA H + ++ CRFD++ T +E ++ Sbjct: 58 ESALLAVHEDKCARLKSAAQMWLALHP-RYASLQCRFDLIIITPRVGLTAWLGSCIEHME 116 Query: 125 DAFN 128 D Sbjct: 117 DIIR 120 >UniRef50_A3VPC2 Putative uncharacterized protein n=1 Tax=Parvularcula bermudensis HTCC2503 RepID=A3VPC2_9PROT Length = 146 Score = 110 bits (277), Expect = 1e-23, Method: Composition-based stats. Identities = 36/131 (27%), Positives = 55/131 (41%), Gaps = 3/131 (2%) Query: 2 ATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTI 61 A + S L ++ G E +A +L+ KG V GEIDLI+ +G T Sbjct: 8 ARQSAKRQSAEYLAAERLGRRAERRAALFLQLKGYAIRDRRVRTPRGEIDLIVTKGSTLA 67 Query: 62 FVEVRYRRSAL-YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV 120 F+EV+ R SA A V ++ + +W AR S RFD++ Sbjct: 68 FIEVKARTSADALQDPATLVPPQNWARIAAASAIWRAR--ASLMPKIVRFDLILVRRGIP 125 Query: 121 EWIKDAFNDHS 131 +KDA+ + Sbjct: 126 CHVKDAYRPDA 136 >UniRef50_D2NTN5 Predicted endonuclease distantly related to archaeal Holliday junction resolvase n=2 Tax=Rothia mucilaginosa RepID=D2NTN5_9MICC Length = 151 Score = 110 bits (276), Expect = 1e-23, Method: Composition-based stats. Identities = 42/142 (29%), Positives = 53/142 (37%), Gaps = 20/142 (14%) Query: 2 ATVPTRSGSPRQL------TTKQTGDAWEAQARRWLEGKGLRFIAANVNER--------G 47 A TRS R L G E R LE G R + N Sbjct: 8 ARAATRSAPNRPLLRRTSPRAHSVGRWGEELTARILETNGYRILERNWRPPAGLEHEQIR 67 Query: 48 GEIDLIMREG-RTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTV 106 GE+DLI + +FVEV+ R S +G AS+ R K + A LW R + D Sbjct: 68 GELDLIAIDPEDELVFVEVKTRSSEDFGHPFASIDRDKARRTRSLAILWC-RLRENLDFP 126 Query: 107 DCRFDVVAFTGN----EVEWIK 124 R D +A TG E +K Sbjct: 127 RFRIDAIAVTGTCETFTFEHLK 148 >UniRef50_D1Y396 Putative uncharacterized protein n=1 Tax=Pyramidobacter piscolens W5455 RepID=D1Y396_9BACT Length = 164 Score = 110 bits (276), Expect = 1e-23, Method: Composition-based stats. Identities = 34/120 (28%), Positives = 51/120 (42%), Gaps = 3/120 (2%) Query: 4 VPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFV 63 G E A +L+ +GL+ + NV ER E+DL+ EG+T +FV Sbjct: 31 TQAERAFFLAKERAAIGRWAEELAAGFLQAQGLKILERNVRERFSELDLVALEGKTLVFV 90 Query: 64 EVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWI 123 EVR RR A ++ K +LL+ A L+ R + R D+V+ W Sbjct: 91 EVRCRRKNPVMSAQDTIGPLKWRRLLRGAELYTLRRQWRGE---WRLDLVSVDVGHERWH 147 >UniRef50_A4X4J0 UPF0102 protein Strop_1320 n=2 Tax=Micromonosporaceae RepID=Y1320_SALTO Length = 121 Score = 110 bits (276), Expect = 2e-23, Method: Composition-based stats. Identities = 40/119 (33%), Positives = 54/119 (45%), Gaps = 6/119 (5%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + G E A R L GLR +A N GEID+I +G EV+ RR+ Sbjct: 5 SRHNQSVGAYGERCALRHLITAGLRPVARNWRCPHGEIDIIAWDGPVLAICEVKTRRTDT 64 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT----GNEVEWIKDAF 127 +G A+VT +K +L A WLA D + RFDV++ VE +K AF Sbjct: 65 FGTPTAAVTGTKARRLRLLAARWLAETGTRAD--EVRFDVLSIRLTGGPPHVEHLKGAF 121 >UniRef50_A1HN64 Putative uncharacterized protein n=1 Tax=Thermosinus carboxydivorans Nor1 RepID=A1HN64_9FIRM Length = 78 Score = 110 bits (275), Expect = 2e-23, Method: Composition-based stats. Identities = 25/70 (35%), Positives = 34/70 (48%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 G E A +L G + + N R GEID++ T +FVEV+ R S +G A Sbjct: 2 MGKMGENAAADYLARNGYKILMRNYRCRIGEIDIVAERQGTIVFVEVKTRSSEKFGFPAE 61 Query: 79 SVTRSKQHKL 88 +V KQ KL Sbjct: 62 AVNYRKQQKL 71 >UniRef50_C2ANQ3 Predicted endonuclease related to Holliday junction resolvase n=1 Tax=Tsukamurella paurometabola DSM 20162 RepID=C2ANQ3_TSUPA Length = 122 Score = 110 bits (275), Expect = 2e-23, Method: Composition-based stats. Identities = 28/107 (26%), Positives = 40/107 (37%), Gaps = 7/107 (6%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNER----GGEIDLIMRE-GRTTIFVEVRYR 68 + + G A E +L G+G R + N GE+D+I + VEV+ R Sbjct: 1 MGNNEVGRAGEDLVCEYLTGRGWRVLDRNWRFSGSGLRGELDVIAQSADGVLAVVEVKTR 60 Query: 69 RSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF 115 YG +VT K +L WLA R DV + Sbjct: 61 SGTAYGSGFEAVTPRKVAQLRALTARWLAE--SENAYRRVRIDVASV 105 >UniRef50_UPI00019790C0 hypothetical protein HcinC1_06745 n=2 Tax=Helicobacter cinaedi CCUG 18818 RepID=UPI00019790C0 Length = 116 Score = 109 bits (274), Expect = 2e-23, Method: Composition-based stats. Identities = 28/112 (25%), Positives = 49/112 (43%), Gaps = 5/112 (4%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRR--SALY 73 ++ G E +A +L G I N R GEID+I + F+EV+ + Sbjct: 4 SRAKGKEAEDKACAFLRENGFEIIERNFFARYGEIDIIAQRDGILHFIEVKSASVGAKSG 63 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 ++T SK KL+ T +L+ N + + D + G +E+I++ Sbjct: 64 FEPIYNITPSKIEKLISTIGFYLSTQNLTQEYC---LDALIIKGGHIEFIEN 112 >UniRef50_C3XDT2 Putative uncharacterized protein n=1 Tax=Helicobacter bilis ATCC 43879 RepID=C3XDT2_9HELI Length = 112 Score = 109 bits (274), Expect = 2e-23, Method: Composition-based stats. Identities = 30/110 (27%), Positives = 52/110 (47%), Gaps = 6/110 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 +Q G +E A +L G FI N + R GEIDLIM++ F+EV+ S+ Sbjct: 4 MRQKGRYYEQVALEYLISLGFEFIEQNFHSRYGEIDLIMKKDSILHFIEVK---SSHCIN 60 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 A++T K +L +T ++L + D V+ + + +I++ Sbjct: 61 PLANITPKKLERLTKTIHVFLDQRQIVSHFC---IDAVSIYKDNITFIEN 107 >UniRef50_C1F7M4 Putative uncharacterized protein n=1 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F7M4_ACIC5 Length = 157 Score = 109 bits (273), Expect = 3e-23, Method: Composition-based stats. Identities = 34/129 (26%), Positives = 50/129 (38%), Gaps = 9/129 (6%) Query: 7 RSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERG--GEIDLIMREGRTTIFVE 64 + SP + TG E A +L G +A G++DLI EG +E Sbjct: 26 SAASPEEPAHLTTGRRGELAAYGFLRRNGYTIVARGWRSHICPGDLDLIAWEGEHLCVIE 85 Query: 65 VRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-----E 119 V+ R + A A+V K+ L AR +L RFDVV+ + E Sbjct: 86 VKARTTRDVATAEAAVDHQKRRTLRMLARRYLRLAGIP--QSAARFDVVSVYFDSGHAPE 143 Query: 120 VEWIKDAFN 128 ++AF Sbjct: 144 FTLYRNAFG 152 >UniRef50_Q6NGK0 UPF0102 protein DIP1513 n=1 Tax=Corynebacterium diphtheriae RepID=Y1513_CORDI Length = 122 Score = 109 bits (273), Expect = 3e-23, Method: Composition-based stats. Identities = 32/106 (30%), Positives = 45/106 (42%), Gaps = 6/106 (5%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREG-RTTIFVEVRYRRSALY 73 E + +G A NV+ GEID+I +F+EV+ R S+ Sbjct: 5 HNHYLAVLGEDFVAQQYANEGYDITARNVSFSVGEIDIIATSPQGEVVFIEVKTRSSS-L 63 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE 119 AA +VT +K K+ + A WL D RFDVVA +E Sbjct: 64 MDAAEAVTPTKMRKIHRAASKWLQGKPF----ADIRFDVVAVHVDE 105 >UniRef50_UPI0001C31AB5 protein of unknown function UPF0102 n=1 Tax=Conexibacter woesei DSM 14684 RepID=UPI0001C31AB5 Length = 122 Score = 108 bits (272), Expect = 4e-23, Method: Composition-based stats. Identities = 28/118 (23%), Positives = 46/118 (38%), Gaps = 7/118 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 G E A LE +G + N R GE+D++ + +F EV+ RR Sbjct: 6 RHHLGRIGENLAVEHLERRGFVVLDRNYRTRWGELDVVACDDERIVFCEVKTRRLGSSA- 64 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN------EVEWIKDAF 127 + ++ +L + A WL + RFD V T + +E ++ AF Sbjct: 65 PLEGLREPQRRRLRRMAVSWLQAKPRRTYVPELRFDAVGVTIDATGQLVALEHLEGAF 122 >UniRef50_A8YJ85 Similar to Y189_SYNY3 UPF0102 protein sll0189 n=2 Tax=Microcystis aeruginosa RepID=A8YJ85_MICAE Length = 139 Score = 108 bits (272), Expect = 4e-23, Method: Composition-based stats. Identities = 25/103 (24%), Positives = 47/103 (45%), Gaps = 4/103 (3%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIM--REGRTTIFVEVRYRRSALY 73 G+ E WL+ + + GGEIDLI+ + FVEV+ R + + Sbjct: 1 MTTVGELGENLVADWLQLQQWHILQRRWRSGGGEIDLIVLSKSQAILAFVEVKTRSAGNW 60 Query: 74 G-GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF 115 G ++ K+ ++ + A+++L+ + + + CRFDV Sbjct: 61 DLGGKLAIDDRKKGRIYEAAQIFLSFYP-QWSDLTCRFDVALV 102 >UniRef50_B5JM98 Putative uncharacterized protein n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JM98_9BACT Length = 141 Score = 108 bits (272), Expect = 4e-23, Method: Composition-based stats. Identities = 33/114 (28%), Positives = 48/114 (42%), Gaps = 5/114 (4%) Query: 6 TRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEV 65 R+ P G E +A + L+ KG + +A N EIDLI G+ +FVEV Sbjct: 10 GRASEPESAA---IGRRGEREAEKLLKRKGYQILARNWRSGRDEIDLICLHGKAVVFVEV 66 Query: 66 RYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE 119 R R+ S+ R K+ L + R + + RFDVV +E Sbjct: 67 RTRKVGALVSGYDSIDRRKREALRRVCRSYFGMMKPK--PITLRFDVVEIEHDE 118 >UniRef50_A7HZ51 UPF0102 protein Plav_3586 n=1 Tax=Parvibaculum lavamentivorans DS-1 RepID=Y3586_PARL1 Length = 133 Score = 108 bits (272), Expect = 5e-23, Method: Composition-based stats. Identities = 37/129 (28%), Positives = 53/129 (41%), Gaps = 4/129 (3%) Query: 2 ATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTI 61 + L + G E A L KG R +A + GEIDL++R GR Sbjct: 8 PRAARKGNPATGLAAYRLGLRAETLAVLLLRLKGYRVVARRLKTPAGEIDLVVRRGRALA 67 Query: 62 FVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV- 120 VEV+ R AA ++ +Q +L + A L R+ F +D RFDVV Sbjct: 68 VVEVKARGEGD--AAAEALLPRQQRRLERAAAHLLGRYP-HFADLDLRFDVVLIVPRRWP 124 Query: 121 EWIKDAFND 129 + DA+ Sbjct: 125 RHLADAWRP 133 >UniRef50_D2L5G2 Putative uncharacterized protein n=1 Tax=Desulfovibrio sp. FW1012B RepID=D2L5G2_9DELT Length = 134 Score = 108 bits (271), Expect = 5e-23, Method: Composition-based stats. Identities = 39/118 (33%), Positives = 57/118 (48%), Gaps = 7/118 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYR-RSALYG 74 G EA A L G G R N RGGE+DLI +G T +FVEV+ R +L Sbjct: 5 HLLLGREGEAVAEALLVGAGFRVEVRNYRTRGGEVDLICLDGDTVVFVEVKARGPGSLLD 64 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEWIKDAFN 128 +VT +K+ ++ + A +L+ ++ CRFDVVA + + + DAF Sbjct: 65 RPEEAVTPAKRGRIARAAAAFLSER--AWWDRPCRFDVVAVSVHGGRRTATHLPDAFG 120 >UniRef50_B6JM54 UPF0102 protein HPP12_0830 n=15 Tax=Epsilonproteobacteria RepID=Y830_HELP2 Length = 114 Score = 108 bits (270), Expect = 7e-23, Method: Composition-based stats. Identities = 24/111 (21%), Positives = 51/111 (45%), Gaps = 6/111 (5%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 ++ G E +A +L+ G + N + GEID+I + F+EV+ + Sbjct: 7 KHREKGLKAEEEACGFLKSLGFEMVERNFFSQFGEIDIIALKKGVLHFIEVKSGEN---F 63 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 ++T SK K+++T R +L++ + + D D + + E +++ Sbjct: 64 DPIYAITPSKLKKMIKTIRCYLSQKDPNSDFC---IDALIVKNGKFELLEN 111 >UniRef50_C7NGY4 Predicted endonuclease related to Holliday junction resolvase n=1 Tax=Kytococcus sedentarius DSM 20547 RepID=C7NGY4_KYTSD Length = 120 Score = 107 bits (269), Expect = 8e-23, Method: Composition-based stats. Identities = 30/119 (25%), Positives = 49/119 (41%), Gaps = 7/119 (5%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + G E A RW + +G R + N R GEIDL++ G + EV+ R + Sbjct: 2 TRERRTLGRRGEDIAARWWQERGARVLERNWRHRLGEIDLVVTSGPRLVVCEVKTRSTVA 61 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-----GNEVEWIKDA 126 +G V ++ +L + WL H G + + R DV+ V+ + A Sbjct: 62 FGQPVEMVALPQRRRLRRLTAAWLQEHPGRWA--EVRIDVIGVLLPPGGPATVQHVPGA 118 >UniRef50_D1B6E0 Putative uncharacterized protein n=1 Tax=Thermanaerovibrio acidaminovorans DSM 6589 RepID=D1B6E0_THEAS Length = 118 Score = 107 bits (269), Expect = 9e-23, Method: Composition-based stats. Identities = 35/117 (29%), Positives = 49/117 (41%), Gaps = 8/117 (6%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + G E A R L G R + NV GEID++ +G +FVEVR R Sbjct: 2 EARNLARGALGEEMAVRHLIRMGWRILGRNVRYPFGEIDIVAHDGTELVFVEVRLR-GPG 60 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT----GNEVEWIKD 125 AA +V +K KL++ R + R D++A G VE I+D Sbjct: 61 PQRAAETVGPAKLRKLIRACRAFAESRG---YDGPFRIDLLAIDQGPCGYRVELIRD 114 >UniRef50_C3JBE2 Putative uncharacterized protein n=2 Tax=Bacteria RepID=C3JBE2_9PORP Length = 127 Score = 107 bits (269), Expect = 1e-22, Method: Composition-based stats. Identities = 28/123 (22%), Positives = 49/123 (39%), Gaps = 10/123 (8%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNE-RGGEIDLIMREGRTTIFVEVRYRRSAL 72 G E A R+LE + + N + E+D+ + R I +EV+ R Sbjct: 2 AQHNDLGVLGERAAYRYLEQLKYKILDTNWSIDGKKEVDIFATDERELIVIEVKTRNEDY 61 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN------EVEWIKDA 126 ++VTR KQ ++ ++ + RFDV+ + ++E+ KDA Sbjct: 62 SVSPLSAVTRRKQANIISLTNAYIRLKGITL---PIRFDVLTAVFHPFDQSFDIEYYKDA 118 Query: 127 FND 129 F Sbjct: 119 FRA 121 >UniRef50_A3JND8 Putative uncharacterized protein n=1 Tax=Rhodobacterales bacterium HTCC2150 RepID=A3JND8_9RHOB Length = 117 Score = 107 bits (269), Expect = 1e-22, Method: Composition-based stats. Identities = 28/117 (23%), Positives = 52/117 (44%), Gaps = 4/117 (3%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 T+ G + E R + +G + GEID+I RE IF+EV+ +S Sbjct: 3 GKTSYLAGLSAEEAVERHCKRRGKTILHRRWRGSVGEIDIIAREQDQVIFIEVK--KSKS 60 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-NEVEWIKDAFN 128 + A + ++ ++Q ++ T +LA + RFDV +++ I++A Sbjct: 61 FYDAISHLSVAQQQRIYATGSEYLA-NEELGQNTPVRFDVALVDSMGQIKVIENAIG 116 >UniRef50_D0WR56 Putative endonuclease n=1 Tax=Actinomyces sp. oral taxon 848 str. F0332 RepID=D0WR56_9ACTO Length = 138 Score = 107 bits (268), Expect = 1e-22, Method: Composition-based stats. Identities = 32/122 (26%), Positives = 57/122 (46%), Gaps = 9/122 (7%) Query: 11 PRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLI--MREGRTTIFVEVRYR 68 P L ++ G E A R+L+ G +A N R GE+DL+ + R + VEV+ R Sbjct: 17 PEPLGNQELGKWGEELAARYLQAYGYVVLARNWRRRAGELDLVTACPQRRAVVAVEVKTR 76 Query: 69 RSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-----VEWI 123 + + G+ +++R+K +L + +WL + V D+VA T + + Sbjct: 77 NAEVSVGSVEAISRAKLARLRKLTGMWLQETGTRCERVCL--DLVAITVENDGSWLIRHL 134 Query: 124 KD 125 +D Sbjct: 135 RD 136 >UniRef50_C3XN83 Putative uncharacterized protein n=3 Tax=Helicobacter RepID=C3XN83_9HELI Length = 122 Score = 107 bits (268), Expect = 1e-22, Method: Composition-based stats. Identities = 26/95 (27%), Positives = 43/95 (45%), Gaps = 3/95 (3%) Query: 12 RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSA 71 T Q G E A +LE +G A N N R GEID+I ++ FVEV+ Sbjct: 5 NTANTTQKGKEAEDFACAFLENEGYSIEARNFNTRFGEIDIIAKKDGILHFVEVKSGIG- 63 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTV 106 ++T +K K+++T ++L ++ + Sbjct: 64 --FEPIYNITPTKVQKIIKTIEIYLKEYHLNLPYC 96 >UniRef50_B9L042 Putative uncharacterized protein n=2 Tax=Thermomicrobia (class) RepID=B9L042_THERP Length = 125 Score = 106 bits (266), Expect = 2e-22, Method: Composition-based stats. Identities = 36/106 (33%), Positives = 54/106 (50%), Gaps = 1/106 (0%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + + G+A E A RWLE G +A N R GE+D++ +G + VEV+ RR A Sbjct: 1 MGRQCLGEAGERAAARWLEEAGWHVLARNWRCRQGELDIVALDGDVLVAVEVKVRRDAGN 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE 119 A +VT K +LL +LA H + + CR D++A T + Sbjct: 61 EPAEWAVTPRKGRRLLAALSAFLAAHPEHQERL-CRVDLIAVTVDR 105 >UniRef50_A6FT82 PII uridylyl-transferase n=5 Tax=Rhodobacterales RepID=A6FT82_9RHOB Length = 175 Score = 106 bits (266), Expect = 2e-22, Method: Composition-based stats. Identities = 32/129 (24%), Positives = 53/129 (41%), Gaps = 4/129 (3%) Query: 2 ATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTI 61 A + R L + G A E + +G+ + RGGEIDLI+R+G + Sbjct: 49 AGPAKTARRDRGLRSWLAGAAAEKIVALAYDKRGIDLLETRWRGRGGEIDLILRDGSEIV 108 Query: 62 FVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-NEV 120 F EV+ RS A + ++ ++ A +L R RFD+ G Sbjct: 109 FCEVKAARSTQ--EAIQRLRPAQMRRIHAAASEYLGRVP-EGQLAQVRFDLAVVDGTGRA 165 Query: 121 EWIKDAFND 129 + +++AF Sbjct: 166 DILENAFGH 174 >UniRef50_Q5SLC1 UPF0102 protein TTHA0372 n=5 Tax=Thermaceae RepID=Y372_THET8 Length = 112 Score = 105 bits (264), Expect = 3e-22, Method: Composition-based stats. Identities = 36/109 (33%), Positives = 52/109 (47%), Gaps = 9/109 (8%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 G E +A R+L GKG R + N GE+DL M + + VEV+ R SA +G Sbjct: 5 RGRWAEEEALRFLLGKGYRLLWRNRRTPFGEVDLFMEKDGVYVVVEVKQRASARFGAPLE 64 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN----EVEWI 123 ++T K +LLQ+AR L R D + R + V G +E + Sbjct: 65 AITPGKVRRLLQSARFLLGR-----DDLPVRLEAVLVHGTPKDFRLEHL 108 >UniRef50_C3PH56 Putative uncharacterized protein n=1 Tax=Corynebacterium aurimucosum ATCC 700975 RepID=C3PH56_CORA7 Length = 145 Score = 105 bits (262), Expect = 6e-22, Method: Composition-based stats. Identities = 35/104 (33%), Positives = 53/104 (50%), Gaps = 3/104 (2%) Query: 18 QTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE-GRTTIFVEVRYRRSALYGGA 76 G A E A + +G + IAANV+ R GE+DL++RE T +F EV+ R + +G A Sbjct: 23 ALGKAGEKFAADFYRARGAQVIAANVHYRVGELDLVVRESDGTIVFCEVKTRATRNFGVA 82 Query: 77 AASVTRSKQHKLLQTARLWLA-RHNGSFDTVDCRFDVVAFTGNE 119 +VT K +L + A WL+ + + RFDV+ Sbjct: 83 -EAVTPRKLKRLRKAAAQWLSTARSENQALSKVRFDVLGLVATG 125 >UniRef50_Q2RJT6 UPF0102 protein Moth_0988 n=2 Tax=Clostridia RepID=Y988_MOOTA Length = 120 Score = 105 bits (262), Expect = 6e-22, Method: Composition-based stats. Identities = 39/121 (32%), Positives = 54/121 (44%), Gaps = 8/121 (6%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 +T ++ G EA A L G R + N GEID++ +G +F+EVR R S Sbjct: 2 TMTRRRRGQIGEAAAAALLADSGYRILERNYRCPLGEIDIVAAQGEEIVFIEVRTRSSQT 61 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE------VEWIKDA 126 +G SV K+ +L + A + CRFDVVA + VE IK A Sbjct: 62 FGTPQESVDGRKRLRLRRLAAY--YLGSRGLAGRSCRFDVVAVWLDRQERVAGVEVIKGA 119 Query: 127 F 127 F Sbjct: 120 F 120 >UniRef50_C4DPS8 Predicted endonuclease related to Holliday junction resolvase n=1 Tax=Stackebrandtia nassauensis DSM 44728 RepID=C4DPS8_9ACTO Length = 207 Score = 104 bits (261), Expect = 7e-22, Method: Composition-based stats. Identities = 25/98 (25%), Positives = 41/98 (41%) Query: 11 PRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRS 70 P + G E A L G+R + N GE+D+I E T+F EV+ RRS Sbjct: 2 PHDRRHLRLGCFGENLAVAHLRRDGMRVLQRNWRCEHGELDIIAIERGVTVFCEVKTRRS 61 Query: 71 ALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDC 108 +G ++ +K ++ + A W R+ + Sbjct: 62 LRFGTPMQAIDEAKALRIRRLAASWHRRYRDKPPWAEW 99 >UniRef50_O66457 UPF0102 protein aq_041 n=1 Tax=Aquifex aeolicus RepID=Y041_AQUAE Length = 103 Score = 104 bits (261), Expect = 7e-22, Method: Composition-based stats. Identities = 29/107 (27%), Positives = 45/107 (42%), Gaps = 10/107 (9%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 G +E A R+L+ KG + + N+ GEID++ + VEV+ + A Sbjct: 2 KGREYEDLAARYLKSKGYQILGRNLRSPYGEIDILAEFEGRKVIVEVKGSETFF---PAE 58 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 VT K K+++TA L S + VV +V KD Sbjct: 59 KVTPHKLSKIIRTAYEVLGEEPFSIE-------VVVVYRGKVYHYKD 98 >UniRef50_A9BFT1 UPF0102 protein Pmob_0702 n=1 Tax=Petrotoga mobilis SJ95 RepID=Y702_PETMO Length = 112 Score = 104 bits (261), Expect = 8e-22, Method: Composition-based stats. Identities = 26/105 (24%), Positives = 49/105 (46%), Gaps = 2/105 (1%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 G +E +A + + R IA N + R GEID+I + + +EV+ + +G Sbjct: 1 MNTKGKVYEDKAVSFFLNRDYRIIARNFSYRHGEIDIIALKNKILHLIEVKGGKET-FGD 59 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV 120 A V K K+++ ++A H + + + DV++ T + V Sbjct: 60 PAFRVNSRKLKKIMKVGNYFIATHP-KLEFDEIQIDVISVTNDGV 103 >UniRef50_Q2KDE4 UPF0102 protein RHE_CH00320 n=4 Tax=Rhizobiales RepID=Y320_RHIEC Length = 122 Score = 104 bits (260), Expect = 9e-22, Method: Composition-based stats. Identities = 36/116 (31%), Positives = 54/116 (46%), Gaps = 4/116 (3%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + + G E A +L KG R +A R GEID++ R+G TIFVEV+ R Sbjct: 10 KRKALRRGRMSEYVAAAFLMLKGYRILALRHRTRLGEIDIVARKGDLTIFVEVKARH--G 67 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV-EWIKDAF 127 A +V+ + Q ++ + LWLAR R+D++A + DAF Sbjct: 68 EAAAIDAVSVAAQKRIRAASDLWLARQADQARLSQ-RYDIIAVMPGRLPRHFPDAF 122 >UniRef50_A9IXC8 UPF0102 protein BT_1882 n=8 Tax=Rhizobiales RepID=Y1882_BART1 Length = 130 Score = 104 bits (260), Expect = 1e-21, Method: Composition-based stats. Identities = 33/123 (26%), Positives = 51/123 (41%), Gaps = 4/123 (3%) Query: 9 GSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYR 68 ++ + G E A WL KG + GEIDLI R G + VEV+ R Sbjct: 10 PKKQRQKSFYRGVRAEKLAAWWLRFKGFHIAEMRFKTKCGEIDLIARRGNLVLIVEVKAR 69 Query: 69 RSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV-EWIKDAF 127 + A +V+R + ++ A +WLAR + + RFD++A + I F Sbjct: 70 ST--LLEAMEAVSRMNEKRIEAAADIWLARQK-DYALLSVRFDLIAILPWRWPKHIPAFF 126 Query: 128 NDH 130 Sbjct: 127 TSD 129 >UniRef50_B2IFF3 Putative uncharacterized protein n=1 Tax=Beijerinckia indica subsp. indica ATCC 9039 RepID=B2IFF3_BEII9 Length = 125 Score = 103 bits (258), Expect = 2e-21, Method: Composition-based stats. Identities = 32/125 (25%), Positives = 50/125 (40%), Gaps = 4/125 (3%) Query: 7 RSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVR 66 +S + + G E+ A WL + R + +GGEID++ G T F+EV+ Sbjct: 2 KSRKEARRRAHRFGLWAESLAILWLRMRFYRILDRRFFVKGGEIDIVAHRGDTIAFIEVK 61 Query: 67 YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV-EWIKD 125 R + A ++ K+ +L AR WLA H + V R D + I Sbjct: 62 ARPT--LDEALLAIDAVKRRRLSLAARYWLAAHPWAASHV-LRGDALCIAPWCWPRHIPA 118 Query: 126 AFNDH 130 A Sbjct: 119 AIPLD 123 >UniRef50_B2GFY9 Putative uncharacterized protein n=1 Tax=Kocuria rhizophila DC2201 RepID=B2GFY9_KOCRD Length = 144 Score = 103 bits (258), Expect = 2e-21, Method: Composition-based stats. Identities = 28/123 (22%), Positives = 38/123 (30%), Gaps = 9/123 (7%) Query: 3 TVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVN------ERGGEIDLIMRE 56 + + +P T G A E L G N GE+D++ Sbjct: 10 SRSSAVPTPDAPTALDVGRAGEDLIADLLARSGWSVRDRNWRPAPGPGRPRGELDIVAER 69 Query: 57 GRTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT 116 G EV+ R A +G +V K +L AR W H DVVA Sbjct: 70 GGVVTVFEVKTRSGADFGHPCEAVGAEKLRRLHVLARAWAREHRDPRVPT---VDVVAVH 126 Query: 117 GNE 119 Sbjct: 127 WPR 129 >UniRef50_C0XSA5 Endonuclease n=1 Tax=Corynebacterium lipophiloflavum DSM 44291 RepID=C0XSA5_9CORY Length = 124 Score = 102 bits (256), Expect = 3e-21, Method: Composition-based stats. Identities = 35/122 (28%), Positives = 47/122 (38%), Gaps = 10/122 (8%) Query: 9 GSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREG-RTTIFVEVRY 67 A E A G + V + GEIDLI+RE T +FVEV+ Sbjct: 2 AQSNYAENHALALAGEKLAASTYSEMGYAIVGTRVRTKVGEIDLIVREETGTVVFVEVKT 61 Query: 68 RRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEWI 123 RR +G AA +VT K + + A WLA + RFDV + Sbjct: 62 RRGRGFG-AAETVTAKKLRTMRRCAAEWLAGN----AYAPVRFDVAEVIVTGETMDIRLF 116 Query: 124 KD 125 +D Sbjct: 117 ED 118 >UniRef50_C7JH91 Putative uncharacterized protein n=8 Tax=Acetobacter pasteurianus RepID=C7JH91_ACEP3 Length = 149 Score = 102 bits (256), Expect = 3e-21, Method: Composition-based stats. Identities = 33/114 (28%), Positives = 50/114 (43%), Gaps = 4/114 (3%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 G A E QA WLE G + GEID++ + FVEV+ RRS Sbjct: 37 AYTQGVAAEQQACNWLEQDGWTVLLRRARTHRGEIDIVASKAVVLCFVEVKKRRS--IEE 94 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF-TGNEVEWIKDAFN 128 A S+ ++Q +L + A L +H + + RFD+ F +E ++D Sbjct: 95 ALVSLQPAQQRRLFRAAECLLQKHPY-WQYEEMRFDLFVFDDAGRMERLEDVIR 147 >UniRef50_A3K994 Putative uncharacterized protein n=7 Tax=Rhodobacterales RepID=A3K994_9RHOB Length = 159 Score = 102 bits (255), Expect = 3e-21, Method: Composition-based stats. Identities = 31/129 (24%), Positives = 56/129 (43%), Gaps = 4/129 (3%) Query: 2 ATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTI 61 R+ + G + E + + E +G +GGEIDLI+R+G I Sbjct: 33 PDAARRAKKDAGRIGYEAGASAELRVAQDYERRGFPLARRRWRGQGGEIDLIVRDGDGLI 92 Query: 62 FVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-NEV 120 FVEV+ +S + AA ++R + ++++ A +L D RFD+ ++ Sbjct: 93 FVEVK--KSRSFRHAAERLSRRQMNRIISAAEEFLGTQPL-GSLTDVRFDLAMVDVYGQI 149 Query: 121 EWIKDAFND 129 I++A Sbjct: 150 RVIENAIGH 158 >UniRef50_A3UIK6 Putative uncharacterized protein n=1 Tax=Oceanicaulis alexandrii HTCC2633 RepID=A3UIK6_9RHOB Length = 128 Score = 102 bits (254), Expect = 5e-21, Method: Composition-based stats. Identities = 35/117 (29%), Positives = 58/117 (49%), Gaps = 4/117 (3%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + ++ G EA + WL KG R + GE+DL+ R G F+EV++R + Sbjct: 5 REQAERRGRRTEAISALWLRLKGWRILDERARTGVGELDLVARRGGVLAFIEVKHRPTVD 64 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV-EWIKDAFN 128 A ++T +Q +L++ A LW +RH D + RFDV+ + I+ AF+ Sbjct: 65 --AARLAITPRQQMRLIRAASLWRSRH-AGIDHLQPRFDVMLWPAQGWPRHIQGAFS 118 >UniRef50_B0P7L1 Putative uncharacterized protein n=1 Tax=Anaerotruncus colihominis DSM 17241 RepID=B0P7L1_9FIRM Length = 132 Score = 102 bits (254), Expect = 5e-21, Method: Composition-based stats. Identities = 34/124 (27%), Positives = 52/124 (41%), Gaps = 9/124 (7%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 ++ + G A EA A LE +G R + N EIDLI + G FVEV+ R Sbjct: 1 MSARTYGAAGEAFAASALEAEGYRILERNWRSGRSEIDLIAQRGDIIAFVEVKTRGEHAL 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNG-SFDTVDCRFDVVAFT--------GNEVEWIK 124 AA VTR+++ ++ A +L + V RFDV + Sbjct: 61 AAPAAFVTRAQRRRIALAAVEYLRARGIYNTGAVQPRFDVFEIVTGGPDGACVTRFSHLV 120 Query: 125 DAFN 128 +A++ Sbjct: 121 NAYD 124 >UniRef50_A4EVA8 Putative uncharacterized protein n=1 Tax=Roseobacter sp. SK209-2-6 RepID=A4EVA8_9RHOB Length = 136 Score = 101 bits (253), Expect = 7e-21, Method: Composition-based stats. Identities = 35/120 (29%), Positives = 60/120 (50%), Gaps = 4/120 (3%) Query: 7 RSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVR 66 + G R L + +G + E QA R E G + GEIDLI+R+G T +F EV+ Sbjct: 15 KRGQNRGLRSHLSGLSAEHQAARAYEALGFEVVEERWRGEAGEIDLILRQGATWVFAEVK 74 Query: 67 YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-NEVEWIKD 125 +S + AA +++ + ++ Q+A L+L R + + R D V +VE +++ Sbjct: 75 --KSTDFETAATRISQKQVQRIRQSATLYLDRFP-NEQVEEVRLDAVLIDAEGQVEILEN 131 >UniRef50_Q7U7D4 UPF0102 protein SYNW1051 n=3 Tax=Synechococcus RepID=Y1051_SYNPX Length = 134 Score = 101 bits (252), Expect = 8e-21, Method: Composition-based stats. Identities = 24/90 (26%), Positives = 43/90 (47%), Gaps = 2/90 (2%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + + G E + L+ +G + + N + R GE+DL++ + + VEV+ RRS Sbjct: 17 PMKMQPPGAQAETRVSSLLQRQGWQLLDRNWSCRWGELDLVLHKNEQLLVVEVKKRRSLA 76 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGS 102 +G SV +K+ +L + W A H Sbjct: 77 WGP--WSVDPTKRRRLGRAISCWRAEHPIQ 104 >UniRef50_A1B931 Putative uncharacterized protein n=1 Tax=Paracoccus denitrificans PD1222 RepID=A1B931_PARDP Length = 137 Score = 101 bits (252), Expect = 8e-21, Method: Composition-based stats. Identities = 31/114 (27%), Positives = 44/114 (38%), Gaps = 4/114 (3%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 G E A R +G +A RGGEIDLI+ FVEV+ +S + Sbjct: 25 AYSAGRLAEESAAREYRRRGYEVMAERWRGRGGEIDLILCRDDEYTFVEVK--KSRFHDR 82 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-GNEVEWIKDAFN 128 AA + + ++ A + R RFD VE I++AF Sbjct: 83 AAERIGARQIARICNAALEYCGRLPAGL-LTAMRFDAALVDQFGRVEIIENAFG 135 >UniRef50_B3T5F3 Putative uncharacterized protein family UPF0102 n=1 Tax=uncultured marine microorganism HF4000_ANIW141I9 RepID=B3T5F3_9ZZZZ Length = 172 Score = 101 bits (252), Expect = 8e-21, Method: Composition-based stats. Identities = 26/117 (22%), Positives = 40/117 (34%), Gaps = 8/117 (6%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNE-RGGEIDLIMREGRTTIFVEVRYRRSAL 72 ++ G E A KG + A N GEID+I + +F EV+ Sbjct: 55 QKKRKIGQWGERLAALEYYRKGYKVHALNYYCAPFGEIDIIAEKENELVFAEVKTAAGKT 114 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE----VEWIKD 125 GG V K +L ++ + D R DV A + ++ K Sbjct: 115 LGGVEGQVDEVKLQRLSNAIDKYIMDNEIQ---NDIRLDVFAIILGKNGPALKHFKG 168 >UniRef50_A6Q6T2 UPF0102 protein SUN_0231 n=3 Tax=Epsilonproteobacteria RepID=Y231_SULNB Length = 113 Score = 101 bits (252), Expect = 8e-21, Method: Composition-based stats. Identities = 31/111 (27%), Positives = 49/111 (44%), Gaps = 6/111 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERG-GEIDLIMREGRTTIFVEVRYRRSALYG 74 K GD E A +LE +G I N R GEID+I ++ F+EV+ Sbjct: 5 PKIFGDKSEDLATLFLEQEGFIVIERNYFARKLGEIDIIAQKDEVLHFIEVK--SGKADF 62 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 +VT K K++ +A ++ V D + G+EVE+I++ Sbjct: 63 DPVYNVTPDKLRKVINSAHYYMKSKKI---DVSFSVDALIIRGDEVEFIEN 110 >UniRef50_C9M9E0 Putative uncharacterized protein n=1 Tax=Jonquetella anthropi E3_33 E1 RepID=C9M9E0_9BACT Length = 155 Score = 100 bits (251), Expect = 1e-20, Method: Composition-based stats. Identities = 27/117 (23%), Positives = 41/117 (35%), Gaps = 7/117 (5%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSAL 72 + G E A L +G NV E+DLI VEVR R+ + Sbjct: 32 AQDSLAVGRWAEDLAADLLAEEGYSVCGRNVRVGPCELDLIGFIDGCLTAVEVRCRQKSR 91 Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG----NEVEWIKD 125 +V K + L++ R + ++ + R D+ A T W KD Sbjct: 92 LQSPEETVGPRKWNALVRGIRGYASQTGWNG---PMRIDLFAVTVCGRRWSARWYKD 145 >UniRef50_Q04SX0 UPF0102 protein LBJ_1427 n=4 Tax=Leptospira RepID=Y1427_LEPBJ Length = 116 Score = 100 bits (251), Expect = 1e-20, Method: Composition-based stats. Identities = 25/113 (22%), Positives = 44/113 (38%), Gaps = 2/113 (1%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 K GD E+ A +L G + N EID+I + F EV++ + Sbjct: 5 KKIKGDEGESIASDFLISIGHEILKRNYRFLYCEIDIISIKEEVLYFSEVKFWKEFESFD 64 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF-TGNEVEWIKDAF 127 + +KQ ++ + A +L+ + S F +V+ E+ D F Sbjct: 65 PRFTFNFAKQTRMRKAASGFLSEN-LSLQNHFVSFCLVSINEKKGCEYYPDLF 116 >UniRef50_Q0C451 UPF0102 protein HNE_0764 n=1 Tax=Hyphomonas neptunium ATCC 15444 RepID=Y764_HYPNA Length = 122 Score = 100 bits (250), Expect = 1e-20, Method: Composition-based stats. Identities = 39/121 (32%), Positives = 59/121 (48%), Gaps = 4/121 (3%) Query: 10 SPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRR 69 ++ + G E +A WL KG +AA V GGEIDLI R+GR FVEV+ R Sbjct: 2 PAKRQIAEARGRQAERRAALWLRLKGCSVLAARVKLPGGEIDLIARKGRLIAFVEVKAR- 60 Query: 70 SALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVE-WIKDAFN 128 A A +V+ H++ + A +W+ H F R+D++A + +KDA+ Sbjct: 61 -ARRDDALGAVSVQSWHRIARAAEVWMG-HRPKFAGYGWRYDLIALAPGSLPYHLKDAWR 118 Query: 129 D 129 Sbjct: 119 P 119 >UniRef50_Q31RH5 UPF0102 protein Synpcc7942_0312 n=2 Tax=Synechococcus elongatus RepID=Y312_SYNE7 Length = 142 Score = 100 bits (250), Expect = 1e-20, Method: Composition-based stats. Identities = 29/97 (29%), Positives = 43/97 (44%), Gaps = 2/97 (2%) Query: 21 DAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG-GAAAS 79 A EA W + L +A + R GE+DL+ +E F+EV+ RR + + Sbjct: 10 RAGEALVAAWCRDRRLEVLAERWHCRWGELDLVTQEDSALRFIEVKTRRQTGWDQSGLLA 69 Query: 80 VTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT 116 + +KQ L + A +LA V CRFDV Sbjct: 70 IGPAKQRCLSRAAACYLASLGNQAA-VACRFDVALVR 105 >UniRef50_A4QF37 UPF0102 protein cgR_1859 n=4 Tax=Corynebacterium RepID=Y1859_CORGB Length = 122 Score = 100 bits (250), Expect = 1e-20, Method: Composition-based stats. Identities = 31/107 (28%), Positives = 46/107 (42%), Gaps = 6/107 (5%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREG-RTTIFVEVRYRRSA 71 + + G E A + + NV GE+DLI+R +FVEV+ RR + Sbjct: 2 KTQKQYLGAFGEDVALQQYLDDQATLLDRNVRYSCGELDLIVRLASGVVVFVEVKTRRGS 61 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN 118 + AA +V K ++ + A LWL RFDVVA + Sbjct: 62 AFDSAA-AVNNQKMLRMRRAAALWLEGKP----YTPIRFDVVAIVLD 103 >UniRef50_B5Y8F8 Putative uncharacterized protein n=1 Tax=Coprothermobacter proteolyticus DSM 5265 RepID=B5Y8F8_COPPD Length = 115 Score = 100 bits (250), Expect = 2e-20, Method: Composition-based stats. Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 9/102 (8%) Query: 24 EAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAASVTRS 83 E + +L + R + NV GEID++ +GRT +FVEVRYR++ AA +V Sbjct: 7 EDRVASFLVSQKYRILDQNVVFPTGEIDIVALKGRTLVFVEVRYRKN---FDAAETVDSR 63 Query: 84 KQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 K +++Q A L+ + R DV A W K Sbjct: 64 KLERIMQCAYLY------TGGEQSYRIDVFACGPQGCHWYKG 99 >UniRef50_C5BWW3 UPF0102 protein Bcav_2532 n=21 Tax=Actinomycetales RepID=Y2532_BEUC1 Length = 118 Score = 100 bits (249), Expect = 2e-20, Method: Composition-based stats. Identities = 41/117 (35%), Positives = 56/117 (47%), Gaps = 7/117 (5%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 G E A RWLE +GL + N GE+DL+ R+G T +FVEV+ R S + Sbjct: 2 RAKDAIGAYGERVAGRWLEAEGLEVVERNWRCPDGELDLVARDGETLVFVEVKTRSSLAF 61 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-----EVEWIKD 125 G +VTR K +L + A WLA H+ + R DVVA VE ++ Sbjct: 62 GHPGEAVTRLKLARLRRLAARWLAEHDAHA--REVRIDVVAVLRTRAGAARVEHLRG 116 >UniRef50_D1NDV3 Fimbrial usher protein (Fragment) n=1 Tax=Haemophilus influenzae HK1212 RepID=D1NDV3_HAEIN Length = 323 Score = 100 bits (249), Expect = 2e-20, Method: Composition-based stats. Identities = 42/80 (52%), Positives = 51/80 (63%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 +Q G ++E QAR +LE KGL FIAAN N + GE+DLIM + T +FVEVR R + YG Sbjct: 167 KRQQGASFEHQARLFLESKGLTFIAANQNFKCGELDLIMNDKETIVFVEVRQRSHSAYGS 226 Query: 76 AAASVTRSKQHKLLQTARLW 95 A SV KQ K L A LW Sbjct: 227 AIESVDWRKQQKWLDAANLW 246 >UniRef50_A9I0M2 UPF0102 protein Bpet0439 n=14 Tax=Proteobacteria RepID=Y439_BORPD Length = 162 Score = 99.7 bits (248), Expect = 2e-20, Method: Composition-based stats. Identities = 56/115 (48%), Positives = 72/115 (62%), Gaps = 3/115 (2%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 T++TG A E QA R L G GL +A N++ R GEIDL+MR+G T + VEVR R + YGG Sbjct: 46 TQRTGTAHEDQALRLLAGAGLVPLARNLHCRAGEIDLVMRDGATLVLVEVRARANPRYGG 105 Query: 76 AAASVTRSKQHKLLQTARLW---LARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 AAASV R+K+ +LL+ A L LAR + RFDVVAF +W+ AF Sbjct: 106 AAASVGRAKRARLLRCAALLLPDLARRHWGGRIPPVRFDVVAFEAGRADWLPAAF 160 >UniRef50_B2S8H0 UPF0102 protein BAbS19_I01690 n=50 Tax=Rhizobiales RepID=Y1690_BRUA1 Length = 126 Score = 99.7 bits (248), Expect = 3e-20, Method: Composition-based stats. Identities = 37/109 (33%), Positives = 47/109 (43%), Gaps = 3/109 (2%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 G + E A L KG R +A R GEIDLI R G + VEV+ R A + A Sbjct: 14 RGHSAERLAAFALMLKGFRIVARRYRTRLGEIDLIARRGDLVLIVEVKAR--ASFEAAQF 71 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 +VT ++ A LWL R + RFD+VA AF Sbjct: 72 AVTPQAMRRIEAAADLWLQRQ-TDRARLSLRFDMVAVLPRRWPKHVPAF 119 >UniRef50_B0T377 UPF0102 protein Caul_0175 n=3 Tax=Caulobacteraceae RepID=Y175_CAUSK Length = 139 Score = 99.4 bits (247), Expect = 3e-20, Method: Composition-based stats. Identities = 29/129 (22%), Positives = 48/129 (37%), Gaps = 6/129 (4%) Query: 2 ATVPTRSGS--PRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRT 59 P R R + +G E A WL KG R + + GEIDL+ + Sbjct: 6 PLRPERQAQKQARGAAARLSGRRAEVLAALWLMAKGYRILGFRLATPLGEIDLLAQRRGV 65 Query: 60 TIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN- 118 VEV+ R A +VT ++ +L + +A + R D++A Sbjct: 66 LAVVEVKSR--TSLEAALEAVTYEQRSRLRRAGAH-IAANRAGLRDAVVRLDLIALAPGR 122 Query: 119 EVEWIKDAF 127 + +A+ Sbjct: 123 RPRHLLNAW 131 >UniRef50_A4ECJ4 Putative uncharacterized protein n=1 Tax=Collinsella aerofaciens ATCC 25986 RepID=A4ECJ4_9ACTN Length = 161 Score = 99.4 bits (247), Expect = 3e-20, Method: Composition-based stats. Identities = 26/128 (20%), Positives = 44/128 (34%), Gaps = 14/128 (10%) Query: 12 RQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEI--DLIMREGRTTIFVEVRYRR 69 + L+ ++ G E + +G + GE L+ + EV+ RR Sbjct: 33 KGLSPRELGMLGELITIDYFNERGYTLLEQGYRCTEGEADLVLLDELDDVVVMAEVKTRR 92 Query: 70 ----SALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-----EV 120 +V KQ + + A +L H + RFD V T E+ Sbjct: 93 VALDCNTRVFPEEAVDAQKQRRYRRIASCYLMEH---YPLKAIRFDAVGVTIRGGHIAEI 149 Query: 121 EWIKDAFN 128 E +AF+ Sbjct: 150 EHQYNAFD 157 >UniRef50_A8U078 Predicted endonuclease n=1 Tax=alpha proteobacterium BAL199 RepID=A8U078_9PROT Length = 151 Score = 99.4 bits (247), Expect = 3e-20, Method: Composition-based stats. Identities = 33/128 (25%), Positives = 58/128 (45%), Gaps = 4/128 (3%) Query: 3 TVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIF 62 + R+ + ++ G E A WL +G R +A V GE+DL++R G + Sbjct: 25 RPNGSAQLERRRSAERRGLRAEWLAALWLMLRGYRVLARRVRTPAGEVDLVVRRGSVVVA 84 Query: 63 VEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV-E 121 VEV+ R + A SV+ ++H++ +LAR ++ RFD+VA + Sbjct: 85 VEVKARAT--LDAALDSVSSRQRHRVALGLESFLARRP-ELAGLNRRFDLVAVQPWRLPV 141 Query: 122 WIKDAFND 129 + D + Sbjct: 142 HLADVWRP 149 >UniRef50_Q0AK98 UPF0102 protein Mmar10_3014 n=1 Tax=Maricaulis maris MCS10 RepID=Y3014_MARMM Length = 127 Score = 99.0 bits (246), Expect = 4e-20, Method: Composition-based stats. Identities = 29/119 (24%), Positives = 51/119 (42%), Gaps = 4/119 (3%) Query: 10 SPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRR 69 + + + G E A WL KG R + GEIDL+ R G +F+EV+ R Sbjct: 2 TRARRQAEARGRWAEWLAMAWLVAKGYRLLDHRARTAAGEIDLVARRGEYLVFIEVKARA 61 Query: 70 SALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV-EWIKDAF 127 + A S+ ++ ++ + A +W A S + R+D+V + + A+ Sbjct: 62 TR--AEALDSIGPRQRGRITRAASIWRAP-RSSLHHLHLRYDLVLVVPGRWPQHRRAAW 117 >UniRef50_Q2VYL8 UPF0102 protein amb4503 n=5 Tax=Alphaproteobacteria RepID=Y4503_MAGSA Length = 129 Score = 98.2 bits (244), Expect = 7e-20, Method: Composition-based stats. Identities = 32/135 (23%), Positives = 51/135 (37%), Gaps = 11/135 (8%) Query: 1 MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERG----GEIDLIMRE 56 M + P ++ G E A WL KG +A + GE+DL+ R Sbjct: 1 MNSAPPSRA---HQAAQRRGKVAEGLAALWLRLKGYGILAKGLKSGRGSGAGEVDLVARR 57 Query: 57 GRTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT 116 G FVEV+ R + A S+T ++ ++ + A + RFD+V Sbjct: 58 GDLVAFVEVKSRAT--LDQAIESLTPFQRQRIERAAAA-FLARRPELASCGVRFDMVLVA 114 Query: 117 GNEV-EWIKDAFNDH 130 + I DA+ Sbjct: 115 PWRLPRHIPDAWRID 129 >UniRef50_C6QFU1 Putative uncharacterized protein n=1 Tax=Hyphomicrobium denitrificans ATCC 51888 RepID=C6QFU1_9RHIZ Length = 129 Score = 97.4 bits (242), Expect = 1e-19, Method: Composition-based stats. Identities = 32/119 (26%), Positives = 50/119 (42%), Gaps = 3/119 (2%) Query: 2 ATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTI 61 R S + ++G E G R + GEIDLI +GR Sbjct: 6 NDDQARPLSDIRRRRYRSGLNAEMVVAAVYMALGHRILGRRFKTPVGEIDLIAIKGRRVA 65 Query: 62 FVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV 120 FVEV+ R S+ A ++T + + ++ + A LWLAR+ + + D FD+V Sbjct: 66 FVEVKRRASSE--EAEDAITLTMRRRVRRAADLWLARNP-QYQSHDVGFDLVFVLPWRF 121 >UniRef50_Q16B02 UPF0102 protein RD1_1191 n=1 Tax=Roseobacter denitrificans OCh 114 RepID=Y1191_ROSDO Length = 129 Score = 97.1 bits (241), Expect = 2e-19, Method: Composition-based stats. Identities = 32/131 (24%), Positives = 59/131 (45%), Gaps = 5/131 (3%) Query: 1 MATVPTRSGSPRQLTT-KQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRT 59 M +P + G + EA R E G F A + GEIDL++R+ Sbjct: 1 MTQMPQTNARVHAGRMAYHAGLSAEASVIREYESHGYVFEAQRWRGQVGEIDLVLRKSGL 60 Query: 60 TIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT-GN 118 +FVEV+ +S + AA ++ +++ ++ T ++A+ D RFDV Sbjct: 61 VVFVEVK--KSKSFERAALRISPTQKRRIFATGEEFVAQEPQGL-LTDMRFDVALVDAAG 117 Query: 119 EVEWIKDAFND 129 V+ +++A ++ Sbjct: 118 AVQILENALSE 128 >UniRef50_Q0FQ74 Putative uncharacterized protein n=1 Tax=Roseovarius sp. HTCC2601 RepID=Q0FQ74_9RHOB Length = 153 Score = 96.7 bits (240), Expect = 2e-19, Method: Composition-based stats. Identities = 31/124 (25%), Positives = 52/124 (41%), Gaps = 4/124 (3%) Query: 7 RSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVR 66 R R + G + EAQ + +G R R GEIDLI+ +G IFVEV+ Sbjct: 33 RRRVERGALGHRAGLSAEAQVAQDYRRRGYRVAGQRWRGRSGEIDLILHDGDGLIFVEVK 92 Query: 67 YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-VEWIKD 125 +S + A ++ + +LL+ + A ++ RFDV + +++ Sbjct: 93 --KSRSFDHAMQHLSSRQIARLLRAGEEF-AGTQPRGSLIEMRFDVALMNEQGMIRIVEN 149 Query: 126 AFND 129 A Sbjct: 150 ALGP 153 >UniRef50_A3TTV9 Putative uncharacterized protein n=1 Tax=Oceanicola batsensis HTCC2597 RepID=A3TTV9_9RHOB Length = 140 Score = 96.3 bits (239), Expect = 3e-19, Method: Composition-based stats. Identities = 35/123 (28%), Positives = 57/123 (46%), Gaps = 4/123 (3%) Query: 6 TRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEV 65 + R +G A E+ R E +G R + GGEIDLI+ E +FVEV Sbjct: 17 PAARRRRGEIAHLSGLAAESAVERTYEARGARVLHRRWRGPGGEIDLILAEPDRVVFVEV 76 Query: 66 RYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-VEWIK 124 + ++A +G AA V ++ ++ ++A ++ D R DV G VE ++ Sbjct: 77 K--KAATHGAAAERVRPAQVQRIARSAMAFVDTLP-GGALTDIRLDVALVDGGGAVELLE 133 Query: 125 DAF 127 +AF Sbjct: 134 NAF 136 >UniRef50_UPI00016C4BC8 hypothetical protein GobsU_17186 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4BC8 Length = 155 Score = 95.5 bits (237), Expect = 4e-19, Method: Composition-based stats. Identities = 41/122 (33%), Positives = 56/122 (45%), Gaps = 10/122 (8%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 + G E A +L G R +AANVN+R GE+DL+ +G T + VEVR SA Sbjct: 26 KRWFGRRSERAAANYLRGLRYRLLAANVNDRDGELDLLAIDGETLVIVEVRSTSSARPDA 85 Query: 76 A---AASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE------VEWIKDA 126 AASV KQ K+ + +L R + R+DV+ E V I+ A Sbjct: 86 IEQTAASVDLRKQRKITEATSRFLGRRRL-LGRIAVRYDVLVIAWPEHAREPAVRHIRHA 144 Query: 127 FN 128 F Sbjct: 145 FE 146 >UniRef50_B9KH25 Putative uncharacterized protein n=3 Tax=Anaplasma marginale RepID=B9KH25_ANAMF Length = 126 Score = 95.5 bits (237), Expect = 5e-19, Method: Composition-based stats. Identities = 31/129 (24%), Positives = 58/129 (44%), Gaps = 8/129 (6%) Query: 1 MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTT 60 M T +R+ R L G A E + + + + GEIDLI++ GR Sbjct: 1 MCTSKSRASKVRSL----VGYAGELVVLLLRKARLHKVLHHRYRSPLGEIDLIVQNGREL 56 Query: 61 IFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-E 119 F+EV+ ++ + VT+ ++ +++TA+ +L+R+ F F+V F+ Sbjct: 57 HFIEVKTSMTSRFHEVP--VTKKQRRSVVRTAQYFLSRNP-QFSEHQISFEVYCFSPKSG 113 Query: 120 VEWIKDAFN 128 V +A+ Sbjct: 114 VTRFVNAWQ 122 >UniRef50_A7ZB75 UPF0102 protein Ccon26_01140 n=23 Tax=Epsilonproteobacteria RepID=Y114_CAMC1 Length = 113 Score = 94.7 bits (235), Expect = 7e-19, Method: Composition-based stats. Identities = 24/114 (21%), Positives = 48/114 (42%), Gaps = 6/114 (5%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMR-EGRTTIFVEVRYRRSA 71 L G + E +A +L G + N + + GEID+I + F+EV+ Sbjct: 2 GLKEYLFGKSSEDRACEFLRKLGFVILERNFHSKFGEIDIIALSSDKILHFIEVKATSGG 61 Query: 72 LYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 A + ++K K+L+T ++ ++ D + D++ E I++ Sbjct: 62 Y--EAEYRLNKAKYMKILKTINFYMMKN---EPNRDYQLDLLVVKNENFELIEN 110 >UniRef50_C2M936 Putative uncharacterized protein n=1 Tax=Porphyromonas uenonis 60-3 RepID=C2M936_9PORP Length = 136 Score = 94.7 bits (235), Expect = 7e-19, Method: Composition-based stats. Identities = 29/120 (24%), Positives = 54/120 (45%), Gaps = 9/120 (7%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 + G A E A+R+L + +R + N + EID+I +GR + VEV+ R Sbjct: 4 ANELGAAGERAAQRYLLSRHIRLLEINWRDPLCEIDIIASDGRHLLIVEVKSRMEYTATS 63 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVV------AFTGNEVEWIKDAFND 129 +V ++K H+++ + R + R+DV+ A E+++ K F+ Sbjct: 64 PLDAVDQAKAHQMMLGGMRYAQRMRINL---PIRYDVIEALYCPASDLFEIKYHKGYFSA 120 >UniRef50_B3CRA6 Putative uncharacterized protein n=2 Tax=Orientia tsutsugamushi RepID=B3CRA6_ORITI Length = 112 Score = 94.4 bits (234), Expect = 1e-18, Method: Composition-based stats. Identities = 28/114 (24%), Positives = 50/114 (43%), Gaps = 5/114 (4%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 ++ G E + FIA + GEID+I +G+ +F+EV+ RRS Sbjct: 3 SSYNLGVLAEWLIIARYSVRLYSFIAHRMRNSAGEIDIICTKGQVIVFIEVKARRSNFDN 62 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVE-WIKDAF 127 + K+ ++A L+L +N + D RFD+ + I++A+ Sbjct: 63 TIC---NYQQITKIRKSAELYLY-YNRQYSNFDVRFDLAIVRPMQWPLIIENAW 112 >UniRef50_B1ZZB1 Putative uncharacterized protein n=2 Tax=Opitutaceae RepID=B1ZZB1_OPITP Length = 141 Score = 94.0 bits (233), Expect = 1e-18, Method: Composition-based stats. Identities = 30/118 (25%), Positives = 48/118 (40%), Gaps = 12/118 (10%) Query: 18 QTGDAWEAQARRWLEG-KGLRFIAANVNERGG---EIDLIMREGRTTIFVEVRYRRSALY 73 G A E A WL+ +G R +A N E+DL+ R+ +FVEV+ R + Sbjct: 16 DAGAAGERLAAAWLQRERGFRVVARNWRNPRDRREELDLVCRDREVLVFVEVKSRAANAL 75 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE------VEWIKD 125 +V + K+ L + + +LAR + R DVV V ++ Sbjct: 76 VPGYYAVDKRKKRVLGRAIKAYLAR--LTAKPATFRLDVVEIAEGGGDAEPTVRHFEN 131 >UniRef50_A5V3S4 UPF0102 protein Swit_0572 n=4 Tax=Sphingomonadaceae RepID=Y572_SPHWW Length = 118 Score = 93.6 bits (232), Expect = 2e-18, Method: Composition-based stats. Identities = 32/119 (26%), Positives = 51/119 (42%), Gaps = 5/119 (4%) Query: 11 PRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRS 70 R+ ++ G E A WL G R + + V R GE+DLI R GRT FVEV+ R Sbjct: 2 NRRAAAERQGRTGERIAAWWLRLHGWRIVGSRVKTRRGEVDLIARRGRTLAFVEVKTR-- 59 Query: 71 ALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN-EVEWIKDAFN 128 G A ++ + ++ A L R+ + R DV+ + + ++ Sbjct: 60 GDAAGLATAIDEYRLRRVAAAAEALLPRYGVGVEN--VRIDVMLVRPWRRPVHLTNVWH 116 >UniRef50_Q7V7V8 UPF0102 protein PMT_0624 n=2 Tax=Prochlorococcus marinus RepID=Y624_PROMM Length = 126 Score = 93.6 bits (232), Expect = 2e-18, Method: Composition-based stats. Identities = 23/93 (24%), Positives = 45/93 (48%), Gaps = 1/93 (1%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + + G E + R L+ +G + ++ + R GE+DL++ + + + VEV+ RRS Sbjct: 2 MDSTGMGCWGEERVLRLLQKRGWQLVSQRWSCRYGELDLVVEKQQRVLVVEVKSRRSRGL 61 Query: 74 GG-AAASVTRSKQHKLLQTARLWLARHNGSFDT 105 + + KQ +L++ WLA H + Sbjct: 62 DHWGLCAFNKGKQLRLMRAIGCWLATHPYFAEH 94 >UniRef50_A8ERF6 UPF0102 protein Abu_0255 n=2 Tax=Campylobacterales RepID=Y255_ARCB4 Length = 110 Score = 93.2 bits (231), Expect = 2e-18, Method: Composition-based stats. Identities = 25/111 (22%), Positives = 55/111 (49%), Gaps = 6/111 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAAN-VNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 +K+ GD E +A +LE + N ++ GEID+I + + F+EV+ + Y Sbjct: 2 SKEKGDIAEKKAISFLEKSNFEIVEKNFYAKKLGEIDIIAQRNKIYHFIEVK--SANDYE 59 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 A ++T K K+ ++ ++ ++N + DV+ +++E +++ Sbjct: 60 TAINNITSQKLSKIKRSVDFYIQKNNLNISYS---IDVIIVVDDKIELLEN 107 >UniRef50_C1CZ90 UPF0102 protein Deide_03080 n=3 Tax=Deinococcus RepID=Y3080_DEIDV Length = 114 Score = 93.2 bits (231), Expect = 2e-18, Method: Composition-based stats. Identities = 37/90 (41%), Positives = 50/90 (55%), Gaps = 2/90 (2%) Query: 30 WLEGKGLRFIAANVNERGGEIDLIMREG-RTTIFVEVRYRRSALYGGAAASVTRSKQHKL 88 L+G G + N RGGEIDL+ RE T +F EVR RR+ +G AA SVT K + Sbjct: 13 HLQGLGRELLQRNYRMRGGEIDLVTREPCGTLVFTEVRQRRTRRHGSAAESVTSRKLALM 72 Query: 89 LQTARLWLARHNGSFDTVDCRFDVVAFTGN 118 + A+ +L R +G D + CR +VV G Sbjct: 73 HRAAQSYLIREHGR-DDLPCRLEVVTIDGP 101 >UniRef50_C0W187 Putative uncharacterized protein n=1 Tax=Actinomyces coleocanis DSM 15436 RepID=C0W187_9ACTO Length = 117 Score = 92.8 bits (230), Expect = 3e-18, Method: Composition-based stats. Identities = 31/108 (28%), Positives = 47/108 (43%), Gaps = 6/108 (5%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 ++ G E A +LE G + + NV +G EID+I E +FVEVR R + +G Sbjct: 4 NKQRVGKLGEDLAAEYLESLGWKILERNVTYKGAEIDIIALEDDVVVFVEVRTRTTDDWG 63 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEW 122 A S+T K L WL + + R D+V ++ Sbjct: 64 SALESLTPKKLASLRSGVVRWLLNQD---EYCKARIDMVTV---KLNH 105 >UniRef50_A4E8G7 Putative uncharacterized protein n=1 Tax=Collinsella aerofaciens ATCC 25986 RepID=A4E8G7_9ACTN Length = 220 Score = 92.4 bits (229), Expect = 4e-18, Method: Composition-based stats. Identities = 30/136 (22%), Positives = 47/136 (34%), Gaps = 14/136 (10%) Query: 3 TVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERG--GEIDLIMRE-GRT 59 T R+ P++ + R +LE KG + G IDL+ + T Sbjct: 85 TPQGRASEPKEQDMNDMKEKAMGAVRAFLERKGYEIVDEAWQGPEGIGGIDLVAVDEDGT 144 Query: 60 TIFVEVRYRRSALYGGAAASVTRSKQHKLLQT-ARLWLARHNGSFDTVDCRFDVVA---F 115 +FV+ R G A + L + A WLA + + RFD VA Sbjct: 145 LVFVDATVRIGTD-GFPEA----HRARGLREALAARWLAGNGDDYADTPVRFDEVAMMVV 199 Query: 116 TGNE--VEWIKDAFND 129 N + + F + Sbjct: 200 KENRALLRHHINCFGE 215 >UniRef50_A7BDE4 Putative uncharacterized protein n=1 Tax=Actinomyces odontolyticus ATCC 17982 RepID=A7BDE4_9ACTO Length = 146 Score = 92.0 bits (228), Expect = 5e-18, Method: Composition-based stats. Identities = 38/112 (33%), Positives = 48/112 (42%), Gaps = 8/112 (7%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVN-ERGGEIDLIMREG-----RTTIFVEVR 66 + + G A E AR LE +GLR + N R GE+D+I R+ T+ VEVR Sbjct: 6 RPDRRAIGAAGEYTARLALEEEGLRLLDTNWRDGRRGELDIIARDETDPSRSWTVIVEVR 65 Query: 67 YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGN 118 R G A ASV K +L W H R DVVA T + Sbjct: 66 TRVGRRKGSALASVDHRKVARLRALTGAWCRAHGHLASR--VRIDVVAITVD 115 >UniRef50_A4WPR4 UPF0102 protein Rsph17025_0472 n=7 Tax=Rhodobacterales RepID=Y472_RHOS5 Length = 117 Score = 91.7 bits (227), Expect = 6e-18, Method: Composition-based stats. Identities = 34/115 (29%), Positives = 50/115 (43%), Gaps = 4/115 (3%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 + + G E R E A GEIDLI R+G IF+EV+ +S + Sbjct: 6 SHRAGFVAEEAVARIYERADRPVTARRWRGAAGEIDLIARDGAEVIFIEVK--KSKSHAA 63 Query: 76 AAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-NEVEWIKDAFND 129 AAA ++R + ++ A +LA RFDV G +E I++AF Sbjct: 64 AAARLSRRQMERIYGAASEFLAGEPL-GQLTASRFDVALVDGMGRIEIIENAFAA 117 >UniRef50_B8GXN3 UPF0102 protein CCNA_00142 n=4 Tax=Caulobacteraceae RepID=Y142_CAUCN Length = 125 Score = 90.9 bits (225), Expect = 1e-17, Method: Composition-based stats. Identities = 31/126 (24%), Positives = 51/126 (40%), Gaps = 4/126 (3%) Query: 4 VPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFV 63 + R ++ G E A WL KG R + + GEIDL+ + G+ V Sbjct: 1 MAAGVRQSRGTAARKVGRRAEVIAALWLMAKGYRILGFRLATPLGEIDLLAQRGKVLAVV 60 Query: 64 EVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-VEW 122 EV+ R A +V +++ +L + A LA H + R D++A Sbjct: 61 EVKQR--TTIEDALDAVKPTQRERLRRAATH-LAAHRAGLRDLLVRLDLIAMAPGRPPRH 117 Query: 123 IKDAFN 128 + DA+ Sbjct: 118 LPDAWG 123 >UniRef50_Q28TZ5 Putative uncharacterized protein n=1 Tax=Jannaschia sp. CCS1 RepID=Q28TZ5_JANSC Length = 100 Score = 90.1 bits (223), Expect = 2e-17, Method: Composition-based stats. Identities = 28/99 (28%), Positives = 46/99 (46%), Gaps = 4/99 (4%) Query: 29 RWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAASVTRSKQHKL 88 R G R ++ GEIDL+M + IFVEV+ R+ + AA +++ + +L Sbjct: 2 RAYLDHGHRLVSRRWRGPAGEIDLVMEKDGEVIFVEVKASRT--HARAAEALSNRQIARL 59 Query: 89 LQTARLWLARHNGSFDTVDCRFDVVAFTG-NEVEWIKDA 126 L++A L RFDV G +++ I +A Sbjct: 60 LRSAEHCLGSFPKGLA-TPMRFDVALVDGQGQLDVIVNA 97 >UniRef50_Q2GDU7 Putative uncharacterized protein n=1 Tax=Neorickettsia sennetsu str. Miyayama RepID=Q2GDU7_NEOSM Length = 116 Score = 90.1 bits (223), Expect = 2e-17, Method: Composition-based stats. Identities = 21/105 (20%), Positives = 37/105 (35%), Gaps = 3/105 (2%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 G E L KG + E+DL+ + +FVEV++R S Sbjct: 13 VGRLAEMIVALHLSIKGYMLLCRRYRNPHCELDLVCIKHGVLLFVEVKFRSSLQ--VLET 70 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWI 123 V S+ K+ + + + + F V T + ++ I Sbjct: 71 MVDYSRMEKMYPASESFCSEFQLYYCLERV-FKVFLVTPSVIQVI 114 >UniRef50_A5GTL0 Restriction endonuclease-like n=1 Tax=Synechococcus sp. RCC307 RepID=A5GTL0_SYNR3 Length = 121 Score = 90.1 bits (223), Expect = 2e-17, Method: Composition-based stats. Identities = 28/107 (26%), Positives = 44/107 (41%), Gaps = 3/107 (2%) Query: 20 GDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG-AAA 78 G EAQ L + R + N R GE+DL++ + + + VEV+ RR Sbjct: 11 GAEAEAQVAVLLCRRHWRLLDCNWCCRWGELDLVLAKPQRLLLVEVKARRRWGLDHGGLL 70 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-NEVEWIK 124 + K+ +L + R WLA H S+ + G +V W Sbjct: 71 ACGPRKRCRLARALRCWLAAHP-SYAFHSIEAHLALVDGEGQVRWFP 116 >UniRef50_Q7NEX4 UPF0102 protein gll3754 n=1 Tax=Gloeobacter violaceus RepID=Y3754_GLOVI Length = 126 Score = 90.1 bits (223), Expect = 2e-17, Method: Composition-based stats. Identities = 39/121 (32%), Positives = 50/121 (41%), Gaps = 8/121 (6%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 + E L +G +A RGGEIDL++R G FVEV+ R + Sbjct: 3 RRHRFALQAEIWVADHLAAQGGLVLARRWRCRGGEIDLVVRLGGVLCFVEVKARGGNSWD 62 Query: 75 GAA-ASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFT---GNEVE---WIKDAF 127 A +V KQ +LL A L+LA H CRFDV G V +I AF Sbjct: 63 SAGWEAVGAVKQRRLLLAAALFLAAHP-ELARSVCRFDVALVGRDPGGGVRLVAYIAGAF 121 Query: 128 N 128 Sbjct: 122 E 122 >UniRef50_B4CV56 Putative uncharacterized protein n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4CV56_9BACT Length = 89 Score = 89.7 bits (222), Expect = 2e-17, Method: Composition-based stats. Identities = 31/85 (36%), Positives = 43/85 (50%), Gaps = 5/85 (5%) Query: 50 IDLIMREGRTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCR 109 +D++ R+ T +FVEV+ RRS +G A SVTR KQ + + A WL + D + R Sbjct: 1 MDIVCRDHDTLVFVEVKTRRSLTFGSPAESVTREKQKLIARGALAWLDLLG-NPDNILFR 59 Query: 110 FDVVAF----TGNEVEWIKDAFNDH 130 FD+V IKDAF Sbjct: 60 FDIVEIIFEEDVPTFHIIKDAFKLP 84 >UniRef50_Q3J5H3 UPF0102 protein RHOS4_03930 n=7 Tax=Rhodobacteraceae RepID=Y393_RHOS4 Length = 117 Score = 89.3 bits (221), Expect = 3e-17, Method: Composition-based stats. Identities = 35/111 (31%), Positives = 48/111 (43%), Gaps = 4/111 (3%) Query: 20 GDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAAS 79 G E R + G A GEIDLI REG IF+EV+ +S + AAA Sbjct: 10 GQTAEEAVARIYDRSGRPVAARRWRGVSGEIDLIAREGAEVIFIEVK--KSTSHAAAAAR 67 Query: 80 VTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-NEVEWIKDAFND 129 ++R + ++ A +LA RFDV VE I++AF Sbjct: 68 LSRRQMDRIYGAASEFLAGEP-RGQLTASRFDVALVDALGRVEIIENAFAA 117 >UniRef50_C6V4V5 Putative uncharacterized protein n=1 Tax=Neorickettsia risticii str. Illinois RepID=C6V4V5_NEORI Length = 117 Score = 89.3 bits (221), Expect = 4e-17, Method: Composition-based stats. Identities = 19/105 (18%), Positives = 37/105 (35%), Gaps = 3/105 (2%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 G E L KG + E+DL+ + +F EV++R S Sbjct: 14 VGRLAEMIVALHLSIKGYMLLCRRYRNPHCELDLVCIKSGVLLFAEVKFRSSLQ--AVET 71 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWI 123 +V S+ ++ + + + + F V T + ++ I Sbjct: 72 TVDYSRMERMYPASESFCSEFQLYYYLERI-FKVFLITPSVIQVI 115 >UniRef50_A2VU88 Putative uncharacterized protein n=1 Tax=Burkholderia cenocepacia PC184 RepID=A2VU88_9BURK Length = 132 Score = 88.6 bits (219), Expect = 6e-17, Method: Composition-based stats. Identities = 48/129 (37%), Positives = 62/129 (48%), Gaps = 30/129 (23%) Query: 3 TVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE-GRTTI 61 P+ +K G A+E +AR++LE GL F+AANV RGGE+DL+MRE + Sbjct: 31 RPPSGDNFSGAARSKPVGAAFEQRARQFLERHGLGFVAANVTMRGGELDLVMREPDGMLV 90 Query: 62 FVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVE 121 FVEVR RRS +G AA CRFDVVAF + Sbjct: 91 FVEVRARRSTRHGAGAA-----------------------------CRFDVVAFEAGRLA 121 Query: 122 WIKDAFNDH 130 W++DAF Sbjct: 122 WLRDAFRTD 130 >UniRef50_B0S8P6 Endonuclease n=2 Tax=Leptospira biflexa serovar Patoc RepID=B0S8P6_LEPBA Length = 114 Score = 88.6 bits (219), Expect = 6e-17, Method: Composition-based stats. Identities = 19/106 (17%), Positives = 43/106 (40%), Gaps = 1/106 (0%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALY 73 + G E A +L+ + +N ++ GEID+I + T EV+ Sbjct: 1 MKKGTIGKKGEEFASFYLQSLEHTILFSNYRKKIGEIDIISIKNDTLHCSEVKTWNERFG 60 Query: 74 GGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE 119 + +K+ ++ + L+L + +F + F+++ T + Sbjct: 61 FHPKECLHATKRARMRKV-YLYLLQEIPAFYHLTPSFNLIHITEKK 105 >UniRef50_C4YXH4 Protein Mlr4633 n=1 Tax=Rickettsia endosymbiont of Ixodes scapularis RepID=C4YXH4_9RICK Length = 111 Score = 88.2 bits (218), Expect = 7e-17, Method: Composition-based stats. Identities = 22/98 (22%), Positives = 48/98 (48%), Gaps = 5/98 (5%) Query: 31 LEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQ 90 + K + + GEID+I + +F+EV+ R S + V+ ++Q ++ + Sbjct: 18 YKLKFYQILHHRKRYYVGEIDIIALCNKEIVFIEVKARSSKIDDR---FVSFNQQRRITR 74 Query: 91 TARLWLARHNGSFDTVDCRFDVVAFTGNEVE-WIKDAF 127 A ++L+ + + + RFD+V ++ IK+A+ Sbjct: 75 AAEMFLSSN-SKYRNYNIRFDLVIIRSYKLPIIIKNAW 111 >UniRef50_C8WWC7 Putative uncharacterized protein n=1 Tax=Alicyclobacillus acidocaldarius subsp. acidocaldarius DSM 446 RepID=C8WWC7_ALIAD Length = 120 Score = 87.8 bits (217), Expect = 1e-16, Method: Composition-based stats. Identities = 22/78 (28%), Positives = 38/78 (48%), Gaps = 1/78 (1%) Query: 13 QLTTKQTGDAWEAQARRWLEGK-GLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSA 71 ++ ++ G E+ ++LE G R I N R GE+DLI + VEV+ R S Sbjct: 2 RVNRRELGTLGESFVGQYLERCLGWRVIEKNWRTRFGELDLIAENEDELVAVEVKTRTSP 61 Query: 72 LYGGAAASVTRSKQHKLL 89 + G ++ ++ KL+ Sbjct: 62 IDGDPIYALRPAQIPKLV 79 >UniRef50_C7N7P3 Predicted endonuclease related to Holliday junction resolvase n=2 Tax=Coriobacteriaceae RepID=C7N7P3_SLAHD Length = 117 Score = 87.0 bits (215), Expect = 2e-16, Method: Composition-based stats. Identities = 19/115 (16%), Positives = 37/115 (32%), Gaps = 11/115 (9%) Query: 21 DAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE-GRTTIFVEVRYRRSALYGGAAAS 79 + RR+LE KG + +D I + +F++ +A G + Sbjct: 6 QRAKQGVRRYLELKGYEILEDGWCHGRDSVDFIATDEDDALVFIDCEVSENAGEGIPEEA 65 Query: 80 VTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-----VEWIKDAFND 129 R ++ A +LA + R+D+V + +A Sbjct: 66 PDRKAFERI---AAAYLAE--ADLSNTEVRYDIVGVLILGESRALIRHHINAITP 115 >UniRef50_Q1GJI4 UPF0102 protein TM1040_0449 n=9 Tax=Rhodobacteraceae RepID=Y449_SILST Length = 124 Score = 86.7 bits (214), Expect = 2e-16, Method: Composition-based stats. Identities = 31/100 (31%), Positives = 47/100 (47%), Gaps = 4/100 (4%) Query: 31 LEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQ 90 +GL + + GEIDLI+R+G T IF EV+ S AAA + ++ ++ Sbjct: 27 YLARGLTLVKSRWRGPHGEIDLILRDGETVIFAEVK--SSTTRDKAAARIKPAQMQRVFN 84 Query: 91 TARLWLARHNGSFDTVDCRFDVVAFTG-NEVEWIKDAFND 129 +A +L R DVV G EVE I++A+ Sbjct: 85 SAGAFLEGEPL-GQLTPARLDVVLVWGAGEVEIIENAYGH 123 >UniRef50_A5G0S4 UPF0102 protein Acry_2261 n=1 Tax=Acidiphilium cryptum JF-5 RepID=Y2261_ACICJ Length = 128 Score = 86.3 bits (213), Expect = 2e-16, Method: Composition-based stats. Identities = 33/112 (29%), Positives = 54/112 (48%), Gaps = 3/112 (2%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAA 78 G E + W +G +A + GE+DL++ + T +FVEV+ R + A Sbjct: 18 RGRDAERRVAGWYAAQGFVVLAQRLRTAAGELDLVVADRTTLVFVEVKARNALR--SAIE 75 Query: 79 SVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFNDH 130 SV ++ +L+ A + LA + + RFDVV G++V I+DAF Sbjct: 76 SVAPRQRRRLVAAAAIVLAGQP-DWGRAETRFDVVLLVGDDVHAIRDAFRAD 126 >UniRef50_A5KGE0 Putative uncharacterized protein n=1 Tax=Campylobacter jejuni subsp. jejuni CG8486 RepID=A5KGE0_CAMJE Length = 84 Score = 85.5 bits (211), Expect = 5e-16, Method: Composition-based stats. Identities = 17/68 (25%), Positives = 34/68 (50%), Gaps = 2/68 (2%) Query: 20 GDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAAS 79 G E +A ++L+ +G + N + + GEID+I ++ F+EV++ ++ Sbjct: 9 GILGEDKACKFLKKQGFEILKRNFHSKFGEIDIIAKKDEILHFIEVKFTQNDYEVS--ER 66 Query: 80 VTRSKQHK 87 + R K K Sbjct: 67 LDRKKLRK 74 >UniRef50_A5GLH9 Restriction endonuclease-like n=5 Tax=Chroococcales RepID=A5GLH9_SYNPW Length = 142 Score = 85.1 bits (210), Expect = 7e-16, Method: Composition-based stats. Identities = 38/136 (27%), Positives = 57/136 (41%), Gaps = 15/136 (11%) Query: 1 MATVPTRSG-SPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRT 59 M R +LTT TG EA+A R L+G+G + + R GEIDL++ + Sbjct: 1 MLREQERHQWVNAKLTTATTGLWAEAKALRLLQGRGWTLLEKRWSCRYGEIDLLLCKANA 60 Query: 60 ----TIFVEVR-YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVA 114 + VEV+ RR G A+ K+ +L T W+A + C+ +VV Sbjct: 61 PVPRLLAVEVKGRRRCGPDGWGLAAFDARKRQRLALTLNYWIALNP---RHACCQLEVVL 117 Query: 115 FTGN------EVEWIK 124 V W+K Sbjct: 118 ALVPLPPNHRPVRWLK 133 >UniRef50_A8LJ68 UPF0102 protein Dshi_2830 n=1 Tax=Dinoroseobacter shibae DFL 12 RepID=Y2830_DINSH Length = 134 Score = 84.7 bits (209), Expect = 9e-16, Method: Composition-based stats. Identities = 34/121 (28%), Positives = 55/121 (45%), Gaps = 4/121 (3%) Query: 6 TRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEV 65 R+ R +G A EA+ R +G +A GGE+DLI+R G +FVEV Sbjct: 14 ARARQARGTRAMLSGAAAEARVERAYRDRGCDVLATRWRGSGGEVDLIVRRGDLLVFVEV 73 Query: 66 RYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTG-NEVEWIK 124 + SA Y A S++ ++ ++ TA +L R ++ RFD+ G + Sbjct: 74 K--SSASYTRAIESLSLAQLTRIQNTALEFLDRSP-DLAGLEMRFDLAVVEGSGRFRVLA 130 Query: 125 D 125 + Sbjct: 131 N 131 >UniRef50_D1ATK0 Putative uncharacterized protein n=1 Tax=Anaplasma centrale str. Israel RepID=D1ATK0_ANACI Length = 151 Score = 82.8 bits (204), Expect = 4e-15, Method: Composition-based stats. Identities = 21/89 (23%), Positives = 43/89 (48%), Gaps = 4/89 (4%) Query: 40 AANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARH 99 GEIDLI++ GR F+EV+ ++ + VT ++ +++TA+ +L+RH Sbjct: 66 HHRYRSPLGEIDLIVQNGRELYFIEVKTSMTSRFREVP--VTGKQRRSIVRTAQYFLSRH 123 Query: 100 NGSFDTVDCRFDVVAFTGN-EVEWIKDAF 127 ++ F+V + + +A+ Sbjct: 124 PQFYEH-QISFEVYCISPRSGITRFVNAW 151 >UniRef50_B1M445 UPF0102 protein Mrad2831_2938 n=10 Tax=Alphaproteobacteria RepID=Y2938_METRJ Length = 129 Score = 82.4 bits (203), Expect = 4e-15, Method: Composition-based stats. Identities = 33/120 (27%), Positives = 51/120 (42%), Gaps = 4/120 (3%) Query: 9 GSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYR 68 G+ R+ + G E A L KG I V+ GGEIDL++R T +FVEV+ R Sbjct: 8 GADRRRAAYRFGHRAEWLALAALMLKGYWPIGRRVSVAGGEIDLVVRRWNTVVFVEVKAR 67 Query: 69 RSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVE-WIKDAF 127 A A ++ +K+ + + R W+ R+ R D V ++ F Sbjct: 68 --AKRDDAREAIDGAKRRRFSRAVRAWIGRNAWC-AGATFRADAVFVGHWAWPAHVERVF 124 >UniRef50_A9GEX7 UPF0102 protein sce2912 n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=Y2912_SORC5 Length = 144 Score = 80.1 bits (197), Expect = 2e-14, Method: Composition-based stats. Identities = 33/128 (25%), Positives = 51/128 (39%), Gaps = 6/128 (4%) Query: 6 TRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEV 65 +G+P + G E L +G+ +A N EID++ R+G +EV Sbjct: 17 GSAGAPAADARRALGARAEDAVVAHLAAQGVEIVARNARVGRLEIDVVARDGPVIAIIEV 76 Query: 66 RYRRSALYGGAAASVTRSKQHKLLQTARL-WLARHNGSFDTVDCRFDVVAFTG-----NE 119 R R + Y A S+ K+ ++ + W A + RFD + T Sbjct: 77 RTRGAGSYVRALDSIDARKRARVRRAGERLWRATFSRVRGVERMRFDAASVTFLPSGEAT 136 Query: 120 VEWIKDAF 127 VE IK AF Sbjct: 137 VEIIKAAF 144 >UniRef50_B6IVS9 Putative uncharacterized protein n=1 Tax=Rhodospirillum centenum SW RepID=B6IVS9_RHOCS Length = 126 Score = 80.1 bits (197), Expect = 2e-14, Method: Composition-based stats. Identities = 28/123 (22%), Positives = 49/123 (39%), Gaps = 5/123 (4%) Query: 4 VPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFV 63 P RS + + T ++ G E R L KG R +A + GE+D++ V Sbjct: 2 APVRSRTDYR-TAERLGRRAEWLCRLALLLKGYRILATRLRTPAGEVDILAERRGLLAVV 60 Query: 64 EVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV-EW 122 EV+ R A A+VT + ++ + A+ A R+D++ Sbjct: 61 EVKARP--GLEAARAAVTEADWRRIARAAQG-YAAARPRLAGHAIRYDLMVVLPGRWPVH 117 Query: 123 IKD 125 ++D Sbjct: 118 LED 120 >UniRef50_C8WHA2 Putative uncharacterized protein n=1 Tax=Eggerthella lenta DSM 2243 RepID=C8WHA2_EGGLE Length = 123 Score = 79.3 bits (195), Expect = 3e-14, Method: Composition-based stats. Identities = 26/114 (22%), Positives = 48/114 (42%), Gaps = 13/114 (11%) Query: 25 AQARRWLEGKGLRFIAANVNER--GGEIDLIMREG--RTTIFVEVRYRRSALYGGAAASV 80 A R+LE +G +A G IDL+ R+ +FV+V R ++ G Sbjct: 13 EAAARFLEVRGYETLATGWKSPETRGTIDLVARDPESDDLVFVDVSARPNSGAGFGD--- 69 Query: 81 TRSKQHKLLQTARLWLARHNGSFDTVDCRFD---VVAFTGNE--VEWIKDAFND 129 R+ + + A WL ++ ++V RFD ++ + + +AF + Sbjct: 70 GRNDRETMELLAVSWLVENDF-AESVGVRFDKISMIVVGEDRALLRHHINAFGE 122 >UniRef50_B3DVH2 Predicted endonuclease n=2 Tax=Verrucomicrobia RepID=B3DVH2_METI4 Length = 78 Score = 79.3 bits (195), Expect = 3e-14, Method: Composition-based stats. Identities = 22/75 (29%), Positives = 31/75 (41%), Gaps = 7/75 (9%) Query: 59 TTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAF--- 115 +FVEV+ R S YG +V K+ L+ A +L V RFDVV Sbjct: 1 MLVFVEVKTRSSIQYGFPYEAVDAQKKRNLIAAAHAYLKL--LKNPVVAYRFDVVEVLFF 58 Query: 116 --TGNEVEWIKDAFN 128 T ++ +AF Sbjct: 59 KGTRPKITHYPNAFG 73 >UniRef50_C0W7A2 Putative uncharacterized protein (Fragment) n=1 Tax=Actinomyces urogenitalis DSM 15434 RepID=C0W7A2_9ACTO Length = 125 Score = 78.9 bits (194), Expect = 5e-14, Method: Composition-based stats. Identities = 21/79 (26%), Positives = 29/79 (36%), Gaps = 11/79 (13%) Query: 3 TVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNER-----GGEIDLIMREG 57 PT +QTG E A +L G + N R GEID++ E Sbjct: 44 QEPTGPQPFPSGDRRQTGRRGEDLAAAYLTDLGWTVLERNWRPRGLAGLRGEIDIVASEP 103 Query: 58 R------TTIFVEVRYRRS 70 T + VEV+ R + Sbjct: 104 SASAGRPTLVVVEVKTRST 122 >UniRef50_UPI000190D97F hypothetical protein SentesTyp_00923 n=1 Tax=Salmonella enterica subsp. enterica serovar Typhi str. E98-2068 RepID=UPI000190D97F Length = 82 Score = 78.6 bits (193), Expect = 7e-14, Method: Composition-based stats. Identities = 48/55 (87%), Positives = 51/55 (92%) Query: 32 EGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAASVTRSKQH 86 E KGLRFIAANV ERGGEIDLIMR+G+TT+FVEVRYRRS LYGGAAASVTRSKQ Sbjct: 2 ESKGLRFIAANVRERGGEIDLIMRDGKTTVFVEVRYRRSGLYGGAAASVTRSKQQ 56 >UniRef50_Q0I9S1 Uncharacterised protein family protein n=1 Tax=Synechococcus sp. CC9311 RepID=Q0I9S1_SYNS3 Length = 144 Score = 77.8 bits (191), Expect = 9e-14, Method: Composition-based stats. Identities = 24/95 (25%), Positives = 40/95 (42%), Gaps = 5/95 (5%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREG----RTTIFVEVRYRRS 70 ++ G E + L G R + N + R GEIDL+ + + VEV+ R Sbjct: 14 NSQALGAQAELYVKEVLLRHGWRLLEHNWSCRYGEIDLLFTKQSFPASRILVVEVKARHR 73 Query: 71 AL-YGGAAASVTRSKQHKLLQTARLWLARHNGSFD 104 + G A+ ++K+ L +T W A + S Sbjct: 74 SGLDGWGVAAFHQAKRRCLARTVECWRAANAWSEA 108 >UniRef50_B9XLY1 Putative uncharacterized protein n=1 Tax=bacterium Ellin514 RepID=B9XLY1_9BACT Length = 101 Score = 77.4 bits (190), Expect = 1e-13, Method: Composition-based stats. Identities = 24/61 (39%), Positives = 34/61 (55%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGG 75 Q G+ E A+++L +GL+F AN GEIDLI R+G +FVEV+ R S + Sbjct: 19 HLQHGELGERAAKKYLRKQGLKFFTANFKSDRGEIDLIFRDGDGLVFVEVKTRSSVDWNL 78 Query: 76 A 76 Sbjct: 79 P 79 >UniRef50_Q1GWI7 UPF0102 protein Sala_0262 n=5 Tax=Sphingomonadales RepID=Y262_SPHAL Length = 116 Score = 72.8 bits (178), Expect = 3e-12, Method: Composition-based stats. Identities = 22/101 (21%), Positives = 40/101 (39%), Gaps = 5/101 (4%) Query: 30 WLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAASVTRSKQHKLL 89 WL G R + + GE+DL+ R GRT F+EV++R ++ + ++ Sbjct: 20 WLRLHGWRIVGQRLRVPVGEVDLVARRGRTVAFIEVKWR--DRAADLDLAIDPYRLRRVA 77 Query: 90 QTARLWLARHNGSFDTVDCRFDVVAFTGNEV-EWIKDAFND 129 A + R +D R DV+ + + + Sbjct: 78 AAAEMLAPRFARPYDD--IRIDVMLLAPRRLPRHLVHVWQP 116 >UniRef50_C0BCN9 Putative uncharacterized protein n=1 Tax=Coprococcus comes ATCC 27758 RepID=C0BCN9_9FIRM Length = 70 Score = 70.1 bits (171), Expect = 2e-11, Method: Composition-based stats. Identities = 18/63 (28%), Positives = 34/63 (53%), Gaps = 2/63 (3%) Query: 66 RYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKD 125 ++R++ A +V+ KQ +L A +L ++ + + CRFDV+ GN++ K+ Sbjct: 2 KFRKTGGLSAALEAVSVPKQMRLSGAAVYYLMKNGCTE--IPCRFDVIGIAGNKISLRKN 59 Query: 126 AFN 128 AF Sbjct: 60 AFE 62 >UniRef50_Q4E9U1 Endonuclease (Fragment) n=5 Tax=Wolbachia RepID=Q4E9U1_9RICK Length = 107 Score = 70.1 bits (171), Expect = 2e-11, Method: Composition-based stats. Identities = 16/77 (20%), Positives = 36/77 (46%), Gaps = 5/77 (6%) Query: 36 LRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLW 95 I + GEIDLI+ + + IF+EV+ ++ + ++ +++ + Sbjct: 26 YNVIKRRYRCKFGEIDLIVSKKKELIFIEVKTSLLGKEIP----ISHLQCQSIINSSKYF 81 Query: 96 LARHNGSFDTVDCRFDV 112 L+++ SF R+D+ Sbjct: 82 LSKN-LSFLDYSVRYDL 97 >UniRef50_C8W847 Putative uncharacterized protein n=1 Tax=Atopobium parvulum DSM 20469 RepID=C8W847_ATOPD Length = 117 Score = 68.9 bits (168), Expect = 5e-11, Method: Composition-based stats. Identities = 21/107 (19%), Positives = 41/107 (38%), Gaps = 11/107 (10%) Query: 24 EAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAASVTRS 83 EA + L+ + + + N ID I+ + IFV+ + + Sbjct: 11 EAISAN-LKHRDIEVLEKNWAHGSDGIDFIVMDDEELIFVDTAT-KCGGFDVPREEPD-- 66 Query: 84 KQHKLLQTARLWLARHNGSFDTVDCRFDVVA--FTGNE---VEWIKD 125 Q + + A +LA + R+D+V+ TG+E + K+ Sbjct: 67 -QERFERIAAAYLAE-SEVEGLASIRYDIVSLLVTGSEKALLRHHKN 111 >UniRef50_Q73VP8 Putative uncharacterized protein n=1 Tax=Mycobacterium avium subsp. paratuberculosis RepID=Q73VP8_MYCPA Length = 98 Score = 64.3 bits (156), Expect = 1e-09, Method: Composition-based stats. Identities = 15/41 (36%), Positives = 19/41 (46%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE 56 Q G EA A L GLR + N R GE+D+I + Sbjct: 40 RIQLGAMGEALAVDHLTRMGLRVLHRNWRCRYGELDIIACD 80 >UniRef50_A8A9D3 Putative uncharacterized protein n=1 Tax=Ignicoccus hospitalis KIN4/I RepID=A8A9D3_IGNH4 Length = 201 Score = 63.2 bits (153), Expect = 2e-09, Method: Composition-based stats. Identities = 31/115 (26%), Positives = 44/115 (38%), Gaps = 12/115 (10%) Query: 11 PRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGG----EIDLIMREGRTTIFVEVR 66 + + A+E+ R LE G + NV R G E D+I +G I VE + Sbjct: 60 EHEAASYLNWKAFESYVARALEEAGFETL-KNVRVRAGDKLAEFDVIGYDGDKVIVVECK 118 Query: 67 YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVE 121 R SA A V + K+ + A WLA+ V VV G + Sbjct: 119 -RWSAFRRSALLKVAEEHKAKVERAA-YWLAKLGKRALPV-----VVTLRGTPIR 166 >UniRef50_C7N801 Predicted endonuclease related to Holliday junction resolvase n=1 Tax=Slackia heliotrinireducens DSM 20476 RepID=C7N801_SLAHD Length = 130 Score = 62.0 bits (150), Expect = 5e-09, Method: Composition-based stats. Identities = 21/90 (23%), Positives = 36/90 (40%), Gaps = 5/90 (5%) Query: 25 AQARRWLEG-KGLRFIAANVNERGGEIDLIMREG--RTTIFVEVRYRRSALYGGAAASVT 81 A AR +L+ KG + + + ID I + +FVE+R R Y Sbjct: 17 ALARVFLQREKGFAILKDDFSRGLDSIDFIALDDTQTVIVFVEMRLRHENSYIDKNPRSD 76 Query: 82 RSKQHKLLQTARLWLARHNGSFDTVDCRFD 111 K + A +L+++ F ++ R D Sbjct: 77 FDK-RRFEHLALAFLSKYPCLF-NLEIRAD 104 >UniRef50_Q3AKE1 Uncharacterised protein family UPF0102 n=2 Tax=Chroococcales RepID=Q3AKE1_SYNSC Length = 114 Score = 61.6 bits (149), Expect = 8e-09, Method: Composition-based stats. Identities = 18/73 (24%), Positives = 34/73 (46%), Gaps = 1/73 (1%) Query: 35 GLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG-GAAASVTRSKQHKLLQTAR 93 G R + N + R GE+DL++ + VEV+ RR + + +K+ ++ + Sbjct: 22 GWRLLDRNWHCRWGELDLVLERQLLLLVVEVKGRRMGHHDRHGLDAFHSAKRRRMARAIS 81 Query: 94 LWLARHNGSFDTV 106 W A H S + + Sbjct: 82 CWRAVHPASAEQL 94 >UniRef50_Q8W6V7 Putative uncharacterized protein n=1 Tax=Synechococcus phage P60 RepID=Q8W6V7_9CAUD Length = 99 Score = 59.7 bits (144), Expect = 3e-08, Method: Composition-based stats. Identities = 29/112 (25%), Positives = 42/112 (37%), Gaps = 21/112 (18%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG 74 +T + G + E A L G NV G ID+++ +G T ++V+ Sbjct: 3 STHKRGASAELLACAALVDAGFEVF-RNV-TPDGPIDIVVWDGETFYPIDVKRASHY--- 57 Query: 75 GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDA 126 +KQ K A L L N + CR D +E WI DA Sbjct: 58 -------VNKQGK----ATLKLPAKNNEHALILCRTD-----KDEWVWINDA 93 >UniRef50_Q8TW03 Predicted endonuclease of the RecB family n=1 Tax=Methanopyrus kandleri RepID=Q8TW03_METKA Length = 258 Score = 58.5 bits (141), Expect = 6e-08, Method: Composition-based stats. Identities = 13/57 (22%), Positives = 21/57 (36%), Gaps = 5/57 (8%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERG-----GEIDLIMREGRTTIFVEVRY 67 + G + E A L +G +A N EID++ + VEV+ Sbjct: 1 MLRRGKSAEEIAASILRKEGFEVVARNYRVELEDELVAEIDIVAEKDGERYAVEVKA 57 >UniRef50_A6DBD4 Putative uncharacterized protein (Fragment) n=1 Tax=Caminibacter mediatlanticus TB-2 RepID=A6DBD4_9PROT Length = 43 Score = 58.5 bits (141), Expect = 6e-08, Method: Composition-based stats. Identities = 14/43 (32%), Positives = 24/43 (55%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE 56 + +K G+ E +A+ +L K R + N +GGEID+I + Sbjct: 1 MNSKSKGNIAEKKAKEYLLNKKFRIVETNFYCKGGEIDIIAYK 43 >UniRef50_C5U625 Putative uncharacterized protein n=1 Tax=Methanocaldococcus infernus ME RepID=C5U625_9EURY Length = 108 Score = 56.2 bits (135), Expect = 3e-07, Method: Composition-based stats. Identities = 17/57 (29%), Positives = 26/57 (45%), Gaps = 5/57 (8%) Query: 18 QTGDAWEAQARRWLEGKGLRFIAANV-----NERGGEIDLIMREGRTTIFVEVRYRR 69 + G E +A +L+ KG + I NV + E D+I + G VEV+ R Sbjct: 2 RKGKKKEGRAANYLKEKGYKIIGRNVIKRINQHKKAEYDIIAKRGNYKYAVEVKSGR 58 >UniRef50_Q5GSW9 RecB family endonuclease n=1 Tax=Wolbachia endosymbiont strain TRS of Brugia malayi RepID=Q5GSW9_WOLTR Length = 114 Score = 54.3 bits (130), Expect = 1e-06, Method: Composition-based stats. Identities = 14/70 (20%), Positives = 32/70 (45%), Gaps = 5/70 (7%) Query: 43 VNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGS 102 + EIDLI+ + + IF+EV+ ++ + ++ +++ +L+ S Sbjct: 33 YCCKFSEIDLIVSKKKELIFIEVKASLLGEDIL----ISYLQYQSIVNSSKYFLSEK-LS 87 Query: 103 FDTVDCRFDV 112 F R+D+ Sbjct: 88 FLDYPIRYDL 97 >UniRef50_A4GJ57 Putative uncharacterized protein n=1 Tax=uncultured marine Nitrospinaceae bacterium RepID=A4GJ57_9DELT Length = 64 Score = 54.3 bits (130), Expect = 1e-06, Method: Composition-based stats. Identities = 15/61 (24%), Positives = 30/61 (49%), Gaps = 7/61 (11%) Query: 73 YGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNE-----VEWIKDAF 127 +G ++T +KQ K++Q + +L + RFDVV T + +E ++++F Sbjct: 5 FGHQFDALTPTKQKKIIQITQSFLVQKRIP--DKSMRFDVVVLTLDRPDSCKIELLENSF 62 Query: 128 N 128 Sbjct: 63 Q 63 >UniRef50_UPI0001699F06 hypothetical protein Epers_29808 n=1 Tax=Endoriftia persephone 'Hot96_1+Hot96_2' RepID=UPI0001699F06 Length = 60 Score = 53.9 bits (129), Expect = 1e-06, Method: Composition-based stats. Identities = 18/40 (45%), Positives = 30/40 (75%) Query: 54 MREGRTTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTAR 93 M++G + +FVEVRYR+S +G AA S+T +K+ KL+ ++ Sbjct: 1 MQDGNSLVFVEVRYRKSDNFGSAAESITAAKRAKLIAASQ 40 >UniRef50_O33024 UPF0102 protein ML1607 n=2 Tax=Mycobacterium leprae RepID=Y1607_MYCLE Length = 96 Score = 53.9 bits (129), Expect = 2e-06, Method: Composition-based stats. Identities = 15/47 (31%), Positives = 20/47 (42%) Query: 10 SPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMRE 56 + + +T Q E A L GLR + N R GE D+I E Sbjct: 3 THKAMTRVQLEAMGEVFAVDNLTRMGLRGLHCNWRCRYGECDVIASE 49 >UniRef50_Q5NWY8 Putative uncharacterized protein n=1 Tax=Aromatoleum aromaticum EbN1 RepID=Q5NWY8_AZOSE Length = 196 Score = 53.5 bits (128), Expect = 2e-06, Method: Composition-based stats. Identities = 27/133 (20%), Positives = 49/133 (36%), Gaps = 20/133 (15%) Query: 9 GSPRQLTTKQTGDAWE----AQARRWLEGKGLRF-IAANVNERGGEIDLIMREGRTTIFV 63 R+ +Q G E A+ LE G + ++ RGG+IDLI + +++ V Sbjct: 57 NQHRRAQVRQHGQHVEAKCGQLAKTALESDGYTVALGQRLH-RGGDIDLIATKDGSSVVV 115 Query: 64 EVRY--------RRSALYGGAAASVTRSKQHKL-LQTARLWLARHNGSFDTVDCRFDV-- 112 E++ R A V +Q + + A +WL + + + + Sbjct: 116 ELKSFRYWGARGRDDWREKKAIEQV-LRQQDTIAAKAAVIWLPMASPTLWQLLWGYSFGG 174 Query: 113 --VAFTGNEVEWI 123 VA V + Sbjct: 175 RGVAVVRGGVRHL 187 >UniRef50_Q3BT99 Putative uncharacterized protein n=2 Tax=Xanthomonas RepID=Q3BT99_XANC5 Length = 708 Score = 51.6 bits (123), Expect = 8e-06, Method: Composition-based stats. Identities = 22/89 (24%), Positives = 33/89 (37%), Gaps = 8/89 (8%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLR-FIAANVNERGGEIDLIMRE-GRTTIFVEVRYRRS 70 +LTT+Q GD EA L +G +A G IDL+ R G F E++ Sbjct: 432 RLTTRQLGDIGEAIQTHELVKQGYSDIVAIKNRSGHG-IDLVGRNPGGELEFFEIKTSAK 490 Query: 71 A----LYGGAAASVTRSKQHKLLQTARLW 95 +G V + + + W Sbjct: 491 GMAPAQHGDPEQFV-AKRLERAIDAKGHW 518 >UniRef50_C1E903 Predicted protein n=1 Tax=Micromonas sp. RCC299 RepID=C1E903_9CHLO Length = 682 Score = 50.8 bits (121), Expect = 1e-05, Method: Composition-based stats. Identities = 17/66 (25%), Positives = 24/66 (36%), Gaps = 11/66 (16%) Query: 15 TTKQTGDAWEAQARRWLEGK--GLRFIAANVNERGGE----IDLIMR--EGRTTIFVEVR 66 ++ G EA R+L + G E D+ MR +FVEV+ Sbjct: 562 DNRRVGRWGEALVYRYLLQRHPGWTVT---WVNEHAESKSFYDVKMRNVRDGRIVFVEVK 618 Query: 67 YRRSAL 72 RSA Sbjct: 619 TTRSAD 624 >UniRef50_Q6MLA1 Putative uncharacterized protein n=1 Tax=Bdellovibrio bacteriovorus RepID=Q6MLA1_BDEBA Length = 97 Score = 50.8 bits (121), Expect = 1e-05, Method: Composition-based stats. Identities = 23/100 (23%), Positives = 45/100 (45%), Gaps = 21/100 (21%) Query: 29 RWLEGKGLRFIAANVNERGGEIDLIMREGR-TTIFVEVRYRRSALYGGAAASVTRSKQHK 87 ++ + K + V E+DL+ + R T + VEV+ + + +T+ ++ + Sbjct: 3 KYYQLKCCHLLGQRVKTPFAEVDLLFKTPRQTLLMVEVKTTNLSDFQP--FRITKKQKAR 60 Query: 88 LLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAF 127 L++ A L+LA R+DV+ E+ W AF Sbjct: 61 LVR-AMLFLAA----------RWDVLV----EIHW---AF 82 >UniRef50_Q9Y9F5 Putative uncharacterized protein n=1 Tax=Aeropyrum pernix RepID=Q9Y9F5_AERPE Length = 213 Score = 47.7 bits (113), Expect = 1e-04, Method: Composition-based stats. Identities = 15/61 (24%), Positives = 25/61 (40%), Gaps = 9/61 (14%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIAANVNER-------GGEIDLIMREGRTTIFVEVR 66 + ++ E A R LE G R + + GEID++ +G + VEV+ Sbjct: 1 MAGRRAWRNSEEIAARILEKSGFRVLD--FHVPIEDGGVEVGEIDIVAEKGGSRYSVEVK 58 Query: 67 Y 67 Sbjct: 59 A 59 >UniRef50_D0CIJ2 Putative uncharacterized protein n=2 Tax=Bacteria RepID=D0CIJ2_9SYNE Length = 46 Score = 43.5 bits (102), Expect = 0.002, Method: Composition-based stats. Identities = 8/19 (42%), Positives = 12/19 (63%) Query: 35 GLRFIAANVNERGGEIDLI 53 G R + N + R GE+DL+ Sbjct: 22 GWRLLDRNWHCRWGELDLV 40 >UniRef50_Q7VH71 Putative uncharacterized protein n=1 Tax=Helicobacter hepaticus RepID=Q7VH71_HELHP Length = 154 Score = 43.1 bits (101), Expect = 0.003, Method: Composition-based stats. Identities = 20/107 (18%), Positives = 38/107 (35%), Gaps = 19/107 (17%) Query: 10 SPRQLTTKQTGDAWEAQARRWLEGKGLRF----IAANVNERGGEIDLIMREGRTTIFVEV 65 S KQ GD +E Q R + +G + + ++G ID+I +G+ + ++ Sbjct: 28 SNNARHNKQKGDKYELQIVRHYKQQGYKVYPKGLKEGRRDKG--IDIIAYKGKEALLIQC 85 Query: 66 RYRRSALYGGAAASVTRSKQHKLLQT---ARLWLARHNGSFDTVDCR 109 + + KQ L +L ++ F R Sbjct: 86 KNWERSQV----------KQEHLRIFLGDCTAYLEQNQKIFAKRSVR 122 >UniRef50_B5IHF1 ATPase n=1 Tax=Aciduliprofundum boonei T469 RepID=B5IHF1_9EURY Length = 390 Score = 42.7 bits (100), Expect = 0.003, Method: Composition-based stats. Identities = 15/95 (15%), Positives = 34/95 (35%), Gaps = 21/95 (22%) Query: 11 PRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNER-GGEIDLIMREGRTTIFVEVRYRR 69 PR++ G +E L GL + E+D I+ + +EV+ Sbjct: 283 PREMD----GLLFENYVLSELIKMGLEP--RYWRTKSKAEVDFIVERDGKIVPIEVKL-- 334 Query: 70 SALYGGAAASVTRSKQHKLLQTARLWLARHNGSFD 104 ++K K+ ++ R ++ ++ + Sbjct: 335 ------------QAKPEKVEKSMRAFIEKYEPEYA 357 >UniRef50_D1TJS5 Putative uncharacterized protein n=1 Tax=Burkholderia sp. CCGE1002 RepID=D1TJS5_9BURK Length = 341 Score = 42.7 bits (100), Expect = 0.004, Method: Composition-based stats. Identities = 11/48 (22%), Positives = 19/48 (39%), Gaps = 2/48 (4%) Query: 21 DAWEAQARRWLEGKGLRFIAANVNE--RGGEIDLIMREGRTTIFVEVR 66 +E+ L G + N+ R E+D+ R+ VEV+ Sbjct: 8 QQFESIVAELLVKLGFEKVERNIAHPARRAEVDITFRKKSELAVVEVK 55 >UniRef50_B7KKS4 Putative uncharacterized protein n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7KKS4_CYAP7 Length = 678 Score = 42.0 bits (98), Expect = 0.005, Method: Composition-based stats. Identities = 14/54 (25%), Positives = 24/54 (44%), Gaps = 3/54 (5%) Query: 20 GDAWEAQARRWLEGKGLRFIAANVNE-RGGEIDLIMREGRTTIFVEVRYRRSAL 72 G+ E +A ++ + G +NV+ DL + IFVEV+ S+ Sbjct: 517 GNFGEDKAIQFYQALGYEV--SNVSNQPQKGYDLECIKDGQEIFVEVKTISSSN 568 >UniRef50_A2BIV6 Endonuclease of RecB family n=1 Tax=Hyperthermus butylicus DSM 5456 RepID=A2BIV6_HYPBU Length = 235 Score = 42.0 bits (98), Expect = 0.006, Method: Composition-based stats. Identities = 17/67 (25%), Positives = 27/67 (40%), Gaps = 6/67 (8%) Query: 14 LTTKQTGDAWEAQARRWLEGKGLRFIA--ANVNERG---GEIDLIMR-EGRTTIFVEVRY 67 ++ + A E A R LE +G + V G E+D + R VEV+ Sbjct: 1 MSGMKRWHASERIAFRLLEEQGYEILEVHKRVRIEGVEVAEVDAVARGPDGELYAVEVKA 60 Query: 68 RRSALYG 74 R ++G Sbjct: 61 GRLDVHG 67 >UniRef50_A3DLW4 Endonuclease (RecB family)-like protein n=1 Tax=Staphylothermus marinus F1 RepID=A3DLW4_STAMF Length = 236 Score = 42.0 bits (98), Expect = 0.006, Method: Composition-based stats. Identities = 16/61 (26%), Positives = 29/61 (47%), Gaps = 6/61 (9%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNER-GG----EIDLIMREG-RTTIFVEVR 66 L+ K+ + E A ++LE +G + I +V + G EID I+ + VE++ Sbjct: 2 SLSAKRKWRSSEEIALQFLEQQGFKIIDKHVKVKIEGVEVSEIDAIVEDEKGEKYAVEIK 61 Query: 67 Y 67 Sbjct: 62 A 62 >UniRef50_B4S2M3 Putative transmembrane protein n=1 Tax=Alteromonas macleodii 'Deep ecotype' RepID=B4S2M3_ALTMD Length = 252 Score = 42.0 bits (98), Expect = 0.006, Method: Composition-based stats. Identities = 17/113 (15%), Positives = 36/113 (31%), Gaps = 15/113 (13%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYG- 74 + +E +G R G IDL +R+G + V+ + ++ G Sbjct: 91 RQLHWRNFEELVAEAYRRQGYRVTEGGF-GADGGIDLELRKGDERVIVQCKQWKAQKVGV 149 Query: 75 -------GAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEV 120 G + +K ++ + + + VV G+E+ Sbjct: 150 SVVREMFGVLTASNANKV--IIICSGKFTQQAIDFASDKP----VVLIDGDEL 196 >UniRef50_A7VFU4 Putative uncharacterized protein n=6 Tax=Bacteria RepID=A7VFU4_9CLOT Length = 416 Score = 41.6 bits (97), Expect = 0.008, Method: Composition-based stats. Identities = 11/86 (12%), Positives = 25/86 (29%), Gaps = 10/86 (11%) Query: 1 MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRF-IAANVNERGGEIDLIMREGRT 59 + R G +E L +G + + E+D + ++ T Sbjct: 301 LCDPAFRYAVNGTRNM-DFGRVYENIVYLELRRRGYEVYVGKLYKK---EVDFVAKKRDT 356 Query: 60 TIFVEVRYRRSA-----LYGGAAASV 80 I+++V S ++ Sbjct: 357 LIYIQVSDNISDETTFEREYSPLLAI 382 >UniRef50_A8MBK5 Putative uncharacterized protein n=1 Tax=Caldivirga maquilingensis IC-167 RepID=A8MBK5_CALMQ Length = 211 Score = 41.2 bits (96), Expect = 0.011, Method: Composition-based stats. Identities = 14/56 (25%), Positives = 21/56 (37%), Gaps = 6/56 (10%) Query: 20 GDAWEAQARRWLEGKGLRFIAANVNERG-----GEIDLIMREG-RTTIFVEVRYRR 69 G +E L G R + V GE+DLI+ + VEV+ + Sbjct: 4 GVRFEDYVAELLSRLGFRVMDRRVKVTSNGVEVGEVDLIVEDECGNKYSVEVKSGK 59 >UniRef50_B1L6B5 Putative uncharacterized protein n=1 Tax=Candidatus Korarchaeum cryptofilum OPF8 RepID=B1L6B5_KORCO Length = 226 Score = 41.2 bits (96), Expect = 0.011, Method: Composition-based stats. Identities = 21/105 (20%), Positives = 37/105 (35%), Gaps = 17/105 (16%) Query: 10 SPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNE---RGGEIDLIMREGRTTIFVEVR 66 SP + T+ + +E R L+ G+ I + R EID++ + ++ + Sbjct: 57 SPEKATSLLSWKDFETFCMRGLQIHGMEAI-RGLRFKNDRRYEIDVLGIGEGLILLIDCK 115 Query: 67 YRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFD 111 + G SV R K A+ + RFD Sbjct: 116 MWSTKGRSGKIESVARDHLRK----AKAF---------DEAIRFD 147 >UniRef50_D2LQ48 DUF234 DEXX-box ATPase n=3 Tax=Aciduliprofundum boonei T469 RepID=D2LQ48_9EURY Length = 462 Score = 40.8 bits (95), Expect = 0.012, Method: Composition-based stats. Identities = 12/56 (21%), Positives = 22/56 (39%), Gaps = 3/56 (5%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAAN--VNERGGEIDLIMREGRTTIFVEVRYRR 69 G +E + + + + N RG EID++ +FVE ++R Sbjct: 342 NAHMGRIFEKIVAEIIAEQ-FKPLKMGSWWNRRGDEIDIVAELENEVLFVECKWRN 396 >UniRef50_Q2FT66 Putative uncharacterized protein n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FT66_METHJ Length = 423 Score = 40.8 bits (95), Expect = 0.013, Method: Composition-based stats. Identities = 13/53 (24%), Positives = 22/53 (41%), Gaps = 5/53 (9%) Query: 16 TKQTGDAWEAQARRWLEGKGLRFIAANVNER-GGEIDLIMREGRTTIFVEVRY 67 ++ G ++E Q L +G R GE D I R + + ++V Y Sbjct: 309 SQDIGKSFENQVYIELIRRGYEV----WYFRDKGECDFIARRPGSMLAIQVSY 357 >UniRef50_C4F8E7 Putative uncharacterized protein n=1 Tax=Collinsella intestinalis DSM 13280 RepID=C4F8E7_9ACTN Length = 115 Score = 40.8 bits (95), Expect = 0.014, Method: Composition-based stats. Identities = 19/97 (19%), Positives = 32/97 (32%), Gaps = 16/97 (16%) Query: 21 DAWEAQARRWLEGKGLRFIA-ANVNERGGEIDLIMREG-RTTIFVEVRYRRSALYGGAAA 78 D E A+ +L K L+ + G+ D I + + V V R Sbjct: 4 DIGELIAKEFLLSKDLKSVDMTGYECDEGKADAICIDESGCHVLVNVETHRKRGVEEP-- 61 Query: 79 SVTRSKQ----HKLLQTARLWLARHNGSFDTVDCRFD 111 KQ ++ + +LA H + R+D Sbjct: 62 -----KQVYNVKRMRRVLMCYLADHP---EVKAARYD 90 >UniRef50_A9KZ92 Putative uncharacterized protein n=2 Tax=Shewanella baltica RepID=A9KZ92_SHEB9 Length = 500 Score = 40.4 bits (94), Expect = 0.017, Method: Composition-based stats. Identities = 10/61 (16%), Positives = 19/61 (31%), Gaps = 5/61 (8%) Query: 9 GSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGG-----EIDLIMREGRTTIFV 63 G ++ G +E + G + N R E D++ + F+ Sbjct: 334 GKVEGAASRLRGALFEYVVAEAMRASGYNGVEINKFCRNASGVQKEADVVCFNNKDVCFI 393 Query: 64 E 64 E Sbjct: 394 E 394 >UniRef50_Q50I46 Putative holliday junction resolvase n=1 Tax=Acidianus rod-shaped virus 1 RepID=Q50I46_9VIRU Length = 115 Score = 40.4 bits (94), Expect = 0.018, Method: Composition-based stats. Identities = 15/80 (18%), Positives = 27/80 (33%), Gaps = 15/80 (18%) Query: 17 KQTGDAWEAQARRWLEGKGLRFIAANVNERGGEI------DLIMREGRTTIFVEVRYRRS 70 +G +E QA WL+ G + I D+I + +EV+ + Sbjct: 6 HNSGRYFEYQAMEWLQSHGYQTI----RIPASAAGKQPLPDIIATKNSVVYAIEVKSTSN 61 Query: 71 ALYGGAAASVTRSKQHKLLQ 90 V + + KL + Sbjct: 62 R-----LVRVDKFQIDKLYR 76 >UniRef50_A1RWI2 Putative uncharacterized protein n=1 Tax=Thermofilum pendens Hrk 5 RepID=A1RWI2_THEPD Length = 203 Score = 40.0 bits (93), Expect = 0.024, Method: Composition-based stats. Identities = 15/56 (26%), Positives = 22/56 (39%), Gaps = 5/56 (8%) Query: 19 TGDAWEAQARRWLEGKGLRFIAANVNERG-----GEIDLIMREGRTTIFVEVRYRR 69 G EA AR LE +G + E+D++ R+ VEV+ R Sbjct: 3 RGVWVEALARLILELEGFSVEQTRLRLERNGVSLAEVDILARKNGEAYAVEVKSGR 58 >UniRef50_C5A3A3 Prokaryotic ATPase, AAA superfamily n=1 Tax=Thermococcus gammatolerans EJ3 RepID=C5A3A3_THEGJ Length = 468 Score = 40.0 bits (93), Expect = 0.024, Method: Composition-based stats. Identities = 16/82 (19%), Positives = 29/82 (35%), Gaps = 21/82 (25%) Query: 19 TGDAWEAQARRWLEGKG--------LRFIAANVNERGGEIDLIMREGRT--TIFVEVRYR 68 G +E AR +L + EID++ + + +FVEV++ Sbjct: 334 LGRPFEEIAREFLIEANRKNLLPFRFTKLGRWWRRGE-EIDIVALDEGSKKALFVEVKWS 392 Query: 69 RSALYGGAAASVTRSKQHKLLQ 90 +T K K+L+ Sbjct: 393 D----------LTAGKARKVLR 404 >UniRef50_Q3BXS5 Putative uncharacterized protein n=1 Tax=Xanthomonas campestris pv. vesicatoria str. 85-10 RepID=Q3BXS5_XANC5 Length = 382 Score = 39.7 bits (92), Expect = 0.028, Method: Composition-based stats. Identities = 16/80 (20%), Positives = 28/80 (35%), Gaps = 11/80 (13%) Query: 18 QTGDAWEAQARRWLEGKGLR-FIAANVNERGGEIDLIMR-EGRTTIFVEVRYR----RSA 71 + GD EA L KG +A + G ID++ + + EV+ Sbjct: 79 EIGDIGEAFVSHDLAKKGYTDLVAIQDKQGHG-IDVVGKNQEGKWESFEVKASVQGTARR 137 Query: 72 LYGGAAASVTRSKQHKLLQT 91 +G +T +L + Sbjct: 138 QFGNPEEFIT----DRLRKA 153 >UniRef50_C8SAI5 Restriction endonuclease n=1 Tax=Ferroglobus placidus DSM 10642 RepID=C8SAI5_FERPL Length = 261 Score = 39.7 bits (92), Expect = 0.032, Method: Composition-based stats. Identities = 13/52 (25%), Positives = 20/52 (38%), Gaps = 5/52 (9%) Query: 20 GDAWEAQARRWLEGKGLRFIAANV----NERGGEIDLIMREGRTTIFVEVRY 67 G +E R LE G NV E+D+I R+ +E ++ Sbjct: 85 GFNFEKFVARVLEEWGYST-ETNVTMKGRCVMQEVDVIARKDEEVYMIECKF 135 >UniRef50_C1MSX7 Predicted protein n=1 Tax=Micromonas pusilla CCMP1545 RepID=C1MSX7_9CHLO Length = 648 Score = 39.7 bits (92), Expect = 0.033, Method: Composition-based stats. Identities = 14/66 (21%), Positives = 23/66 (34%), Gaps = 11/66 (16%) Query: 15 TTKQTGDAWEAQARRWLEGK--GLRFIAANVNERGGE----IDLIMR--EGRTTIFVEVR 66 + G E+ +L + G R E D+ + +G TIFVEV+ Sbjct: 526 DNRAVGRWGESLVYHYLLSRHVGWRVT---WMNEEKETKSFYDIKLESADGAETIFVEVK 582 Query: 67 YRRSAL 72 + Sbjct: 583 TTKFGD 588 >UniRef50_A7H236 Putative uncharacterized protein n=1 Tax=Campylobacter jejuni subsp. doylei 269.97 RepID=A7H236_CAMJD Length = 161 Score = 39.7 bits (92), Expect = 0.033, Method: Composition-based stats. Identities = 21/115 (18%), Positives = 42/115 (36%), Gaps = 20/115 (17%) Query: 4 VPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAA--NVNERGGEIDLIMREGRTTI 61 T++ K+ GD +E Q + + +G + N ++ IDLI + + Sbjct: 25 PKTKNNQFNYFKNKKKGDLYEIQIGKMYQKQGYKVYFKGINEKKKDAGIDLIAYKDNEVL 84 Query: 62 FVEVRYRRSALYGGAAASVTRSKQHKLLQ-----TARLWLARHNGSFDTVDC-RF 110 ++ + +++ KQ L TA +L + F + RF Sbjct: 85 LIQCKNWQNSQI----------KQEHLRIFLGDCTA--YLEKEKHKFKNKEIKRF 127 >UniRef50_C6JN47 Helicase n=7 Tax=Bacteria RepID=C6JN47_FUSVA Length = 1624 Score = 39.7 bits (92), Expect = 0.034, Method: Composition-based stats. Identities = 10/86 (11%), Positives = 24/86 (27%), Gaps = 6/86 (6%) Query: 24 EAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAASV--T 81 E + ++L +G I ID++ + +E + V Sbjct: 1361 EEEVAKYLYARGYNIIQQ-WEVGAYRIDMVAFFENKRVAIEC---DGERWHSTEEQVKQD 1416 Query: 82 RSKQHKLLQTARLWLARHNGSFDTVD 107 +Q L + ++ + Sbjct: 1417 IERQDILERCGWDFIRIRGSRYFRNP 1442 >UniRef50_B0MYF3 Putative uncharacterized protein n=1 Tax=Alistipes putredinis DSM 17216 RepID=B0MYF3_9BACT Length = 242 Score = 39.3 bits (91), Expect = 0.037, Method: Composition-based stats. Identities = 17/74 (22%), Positives = 27/74 (36%), Gaps = 6/74 (8%) Query: 22 AWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAAASVT 81 +E KG R + + G ID+ + I V+V+ YGG+ V Sbjct: 5 EFEQLVCDHYRQKGYRVQTTSASNDYG-IDVFAEKKGEKIAVQVKM-----YGGSTRKVN 58 Query: 82 RSKQHKLLQTARLW 95 R +L A + Sbjct: 59 RQTMMELYGAAAYF 72 >UniRef50_Q05TI5 Putative uncharacterized protein n=1 Tax=Synechococcus sp. RS9916 RepID=Q05TI5_9SYNE Length = 89 Score = 39.3 bits (91), Expect = 0.038, Method: Composition-based stats. Identities = 10/47 (21%), Positives = 17/47 (36%), Gaps = 1/47 (2%) Query: 57 GRTTIFVEVRYRR-SALYGGAAASVTRSKQHKLLQTARLWLARHNGS 102 + VEV+ RR G A+ K +L + W + + Sbjct: 9 EGRLLVVEVKARRRCGRDGWGVAACNAGKLQRLARAMACWRMANPWT 55 >UniRef50_UPI0001699F07 LppC putative lipoprotein n=1 Tax=Endoriftia persephone 'Hot96_1+Hot96_2' RepID=UPI0001699F07 Length = 390 Score = 38.9 bits (90), Expect = 0.046, Method: Composition-based stats. Identities = 9/46 (19%), Positives = 14/46 (30%), Gaps = 1/46 (2%) Query: 4 VPTRSGSPRQLTTKQTGDAWEAQARRWLEGKG-LRFIAANVNERGG 48 P + G+ E A R L+ +G N + G Sbjct: 345 HPGSRRLTMKAKHLIRGEQAERLACRHLQARGTAAVWMRNYRSQAG 390 >UniRef50_A3CY54 Restriction endonuclease n=1 Tax=Methanoculleus marisnigri JR1 RepID=A3CY54_METMJ Length = 276 Score = 38.9 bits (90), Expect = 0.050, Method: Composition-based stats. Identities = 21/113 (18%), Positives = 31/113 (27%), Gaps = 17/113 (15%) Query: 8 SGSPRQLTTKQTGDA-----WEAQARRWLEGKGLRFIAANVN----ERGGEIDLIMREGR 58 + R + G +E R L G R E+D I Sbjct: 67 AAGHRTNLRRALGLLRSKPDFEEFVRVLLREHGYRV-ETGCVLAGLCGEHEVDAIAERDG 125 Query: 59 TTIFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFD 111 TIFVEV++ S VT + ++ + L D Sbjct: 126 ATIFVEVKHHASHH------RVTGLDEGRIARAIIEDLQE-GFRAGRCTVSID 171 >UniRef50_D2Q690 Putative uncharacterized protein n=6 Tax=Bacteria RepID=D2Q690_9BIFI Length = 453 Score = 38.5 bits (89), Expect = 0.065, Method: Composition-based stats. Identities = 16/63 (25%), Positives = 24/63 (38%), Gaps = 1/63 (1%) Query: 18 QTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRSALYGGAA 77 G E + L G N R GE+D +++ GR + VEV+ A Sbjct: 336 NMGAITENVVAQELRAHGFDLYYFN-TNRLGEVDFMVQSGRIVLPVEVKSGNDWTKHKAL 394 Query: 78 ASV 80 +V Sbjct: 395 DNV 397 >UniRef50_B8D4V0 Endonuclease (RecB family)-like protein n=1 Tax=Desulfurococcus kamchatkensis 1221n RepID=B8D4V0_DESK1 Length = 233 Score = 38.5 bits (89), Expect = 0.068, Method: Composition-based stats. Identities = 14/63 (22%), Positives = 25/63 (39%), Gaps = 6/63 (9%) Query: 13 QLTTKQTGDAWEAQARRWLEGKGLRFIAANVNER-----GGEIDLIMRE-GRTTIFVEVR 66 +++ + + E A +LE +G R + GE+D I G VE++ Sbjct: 2 SISSSRKWRSSELIALEYLEKQGFRIEETRKKIKIEGVEIGEVDAIAISPGGEKYAVEIK 61 Query: 67 YRR 69 R Sbjct: 62 AGR 64 >UniRef50_C3XFJ6 Putative uncharacterized protein n=1 Tax=Helicobacter bilis ATCC 43879 RepID=C3XFJ6_9HELI Length = 160 Score = 38.1 bits (88), Expect = 0.080, Method: Composition-based stats. Identities = 25/109 (22%), Positives = 42/109 (38%), Gaps = 23/109 (21%) Query: 9 GSPRQLTTKQTGDAWEAQARRWLEGKGLRF----IAANVNERGGEIDLIMREGRTTIFVE 64 KQ GD +E Q R+ + +G + I ++G IDLI +GR I ++ Sbjct: 32 AVDYAAINKQKGDDYERQIGRFYQQQGYKVYFKGIKEGRRDQG--IDLIAYKGREAILIQ 89 Query: 65 VRYRRSALYGGAAASVTRSKQHKLLQ-----TARLWLARHNGSFDTVDC 108 + + T+ KQ L TA +L ++ F + Sbjct: 90 CKNWEN----------TQVKQEHLRIFLGDCTA--YLEQNQKIFAKKNV 126 >UniRef50_B8D4V8 Predicted transcriptional regulator n=1 Tax=Desulfurococcus kamchatkensis 1221n RepID=B8D4V8_DESK1 Length = 296 Score = 38.1 bits (88), Expect = 0.086, Method: Composition-based stats. Identities = 15/52 (28%), Positives = 21/52 (40%), Gaps = 1/52 (1%) Query: 15 TTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVR 66 + TGD E R L G R + ID+I G T +F++V Sbjct: 3 RNRGTGDIAEE-VSRTLRKAGFRVEFLSYPTSARSIDIIACRGDTRVFIKVS 53 >UniRef50_B7AVX2 Putative uncharacterized protein n=1 Tax=Bacteroides pectinophilus ATCC 43243 RepID=B7AVX2_9BACE Length = 115 Score = 38.1 bits (88), Expect = 0.090, Method: Composition-based stats. Identities = 14/61 (22%), Positives = 22/61 (36%) Query: 11 PRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTIFVEVRYRRS 70 L G +E L KG R EID+I+++ I +EV+ + Sbjct: 12 ENTLAGSTKGGIFECAVADALFKKGYRLYFYKNETTKREIDVIIQQDGKVIPIEVKSGNT 71 Query: 71 A 71 Sbjct: 72 R 72 >UniRef50_A8MAQ2 Putative uncharacterized protein n=1 Tax=Caldivirga maquilingensis IC-167 RepID=A8MAQ2_CALMQ Length = 246 Score = 38.1 bits (88), Expect = 0.090, Method: Composition-based stats. Identities = 16/97 (16%), Positives = 33/97 (34%), Gaps = 26/97 (26%) Query: 31 LEGKGLRFIAANVNERGGEIDLIMREG-RTTIFVEVRYRRSALYGGAAASVTRSKQ---- 85 L G R I+ G IDL+ + + +EV+ G + +V + K+ Sbjct: 141 LIEDGFRVISREYETMVGSIDLLGEDKAGNLVVIEVKRS-----GASPEAVHQLKRYVDY 195 Query: 86 ------HKLLQ----------TARLWLARHNGSFDTV 106 ++++ +A +L + F Sbjct: 196 ITSKNPGRVVRGILVAAWISASAYRYLKDYGLEFRRY 232 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.310 0.176 0.568 Lambda K H 0.267 0.0542 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,260,497,809 Number of Sequences: 3077464 Number of extensions: 71485062 Number of successful extensions: 179742 Number of sequences better than 1.0e-01: 470 Number of HSP's better than 0.1 without gapping: 1251 Number of HSP's successfully gapped in prelim test: 132 Number of HSP's that attempted gapping in prelim test: 176389 Number of HSP's gapped (non-prelim): 1415 length of query: 131 length of database: 1,040,396,356 effective HSP length: 96 effective length of query: 35 effective length of database: 744,959,812 effective search space: 26073593420 effective search space used: 26073593420 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.8 bits) S2: 88 (38.1 bits)