BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (439 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P64427 UPF0748 lipoprotein yddW n=88 Tax=Enterobacteria... 905 0.0 UniRef50_C2DR58 Lipoprotein yddW n=6 Tax=Enterobacteriaceae RepI... 719 0.0 UniRef50_C1M4K1 Lipoprotein YddW n=4 Tax=Enterobacteriaceae RepI... 692 0.0 UniRef50_Q48C14 YngK protein n=53 Tax=Proteobacteria RepID=Q48C1... 466 e-130 UniRef50_A6WZY4 Putative uncharacterized protein n=10 Tax=Brucel... 433 e-120 UniRef50_A6TUC1 Putative uncharacterized protein n=5 Tax=Bacteri... 347 7e-94 UniRef50_C6J3R7 Putative uncharacterized protein n=1 Tax=Paeniba... 329 2e-88 UniRef50_C9XP71 Cell surface protein n=6 Tax=Clostridium RepID=C... 329 2e-88 UniRef50_C0Z8S4 Putative uncharacterized protein n=1 Tax=Breviba... 324 4e-87 UniRef50_O35015 UPF0748 protein yngK n=11 Tax=Bacteria RepID=YNG... 316 1e-84 UniRef50_UPI000178945D protein of unknown function DUF187 n=1 Ta... 315 3e-84 UniRef50_C5C4P8 Putative uncharacterized protein n=4 Tax=Bacteri... 307 5e-82 UniRef50_A7Z5C7 YngK n=5 Tax=Bacteria RepID=A7Z5C7_BACA2 306 1e-81 UniRef50_D2AWT1 FenI protein n=1 Tax=Streptosporangium roseum DS... 301 3e-80 UniRef50_Q47Q17 FenI protein n=9 Tax=Bacteria RepID=Q47Q17_THEFY 298 2e-79 UniRef50_D1S7M0 Putative uncharacterized protein n=1 Tax=Micromo... 296 1e-78 UniRef50_Q81DH4 FenI n=65 Tax=Bacteria RepID=Q81DH4_BACCR 294 5e-78 UniRef50_B8I4Q9 Putative uncharacterized protein n=2 Tax=Bacteri... 293 6e-78 UniRef50_C4RBZ7 FenI protein n=10 Tax=Actinomycetales RepID=C4RB... 291 3e-77 UniRef50_D2AR89 FenI protein n=9 Tax=Bacteria RepID=D2AR89_STRRD 291 3e-77 UniRef50_B1HPQ3 Hypothetical lipoprotein yddW n=2 Tax=Bacillacea... 290 7e-77 UniRef50_UPI00016A6D2C fenI protein n=1 Tax=Burkholderia oklahom... 289 1e-76 UniRef50_D1AYL2 Putative uncharacterized protein n=1 Tax=Strepto... 286 7e-76 UniRef50_D1A3Q7 Putative uncharacterized protein n=1 Tax=Thermom... 285 3e-75 UniRef50_A1V3X0 FenI protein n=36 Tax=Bacteria RepID=A1V3X0_BURMS 285 3e-75 UniRef50_A8MM80 Putative uncharacterized protein n=1 Tax=Alkalip... 282 2e-74 UniRef50_D0GIS1 YngK n=16 Tax=Bacteria RepID=D0GIS1_9FUSO 277 5e-73 UniRef50_C7IM14 Putative uncharacterized protein n=1 Tax=Clostri... 274 6e-72 UniRef50_A5FI17 Putative uncharacterized protein n=1 Tax=Flavoba... 271 5e-71 UniRef50_C5PKN1 Possible FenI n=1 Tax=Sphingobacterium spiritivo... 270 7e-71 UniRef50_C9L341 YngK protein n=45 Tax=Bacteroidales RepID=C9L341... 267 6e-70 UniRef50_A6NVH8 Putative uncharacterized protein n=1 Tax=Bactero... 266 1e-69 UniRef50_A3HZ09 FenI n=1 Tax=Algoriphagus sp. PR1 RepID=A3HZ09_9... 264 5e-69 UniRef50_A1ZQ43 YngK protein n=1 Tax=Microscilla marina ATCC 231... 262 2e-68 UniRef50_A9NEW1 Putative uncharacterized protein n=1 Tax=Acholep... 260 7e-68 UniRef50_UPI00016C0313 cell surface protein n=1 Tax=Epulopiscium... 260 8e-68 UniRef50_D2QEX0 Putative uncharacterized protein n=2 Tax=Flexiba... 257 6e-67 UniRef50_D1N426 Putative uncharacterized protein n=1 Tax=Victiva... 256 8e-67 UniRef50_C6XWP7 Putative uncharacterized protein n=1 Tax=Pedobac... 256 9e-67 UniRef50_C7PIN2 Putative uncharacterized protein n=1 Tax=Chitino... 256 9e-67 UniRef50_A5FAG6 Putative uncharacterized protein n=1 Tax=Flavoba... 247 5e-64 UniRef50_A4ASW6 FenI n=1 Tax=Flavobacteriales bacterium HTCC2170... 238 3e-61 UniRef50_Q7MXU6 YngK protein n=4 Tax=Porphyromonadaceae RepID=Q7... 235 2e-60 UniRef50_A6EKL7 Putative uncharacterized protein (Fragment) n=1 ... 234 5e-60 UniRef50_B4D6Q1 Putative uncharacterized protein n=2 Tax=Verruco... 226 2e-57 UniRef50_A9NEW0 Putative uncharacterized protein n=1 Tax=Acholep... 224 4e-57 UniRef50_B9XM08 Putative uncharacterized protein n=2 Tax=bacteri... 223 8e-57 UniRef50_Q8YW40 All1776 protein n=5 Tax=Nostocaceae RepID=Q8YW40... 223 9e-57 UniRef50_C3XYE7 Putative uncharacterized protein n=2 Tax=Branchi... 219 1e-55 UniRef50_B7AM83 Putative uncharacterized protein n=1 Tax=Bactero... 219 2e-55 UniRef50_B4VZ35 Putative uncharacterized protein n=1 Tax=Microco... 218 3e-55 UniRef50_UPI0001C160EA conserved hypothetical protein n=2 Tax=No... 218 4e-55 UniRef50_C0YRL9 FenI family protein n=3 Tax=Bacteroidetes RepID=... 214 5e-54 UniRef50_A0M6M5 Protein containing DUF187 n=4 Tax=Bacteroidetes ... 213 1e-53 UniRef50_Q110S6 Putative uncharacterized protein n=5 Tax=Bacteri... 212 2e-53 UniRef50_C9PUA7 FenI protein n=2 Tax=Prevotella RepID=C9PUA7_9BACT 210 1e-52 UniRef50_UPI00016C4E90 hypothetical protein GobsU_27726 n=1 Tax=... 209 2e-52 UniRef50_C1A9I5 Putative uncharacterized protein n=1 Tax=Gemmati... 208 4e-52 UniRef50_B0NT08 Putative uncharacterized protein n=2 Tax=Bactero... 207 6e-52 UniRef50_A6G0M0 Putative uncharacterized protein n=1 Tax=Plesioc... 205 3e-51 UniRef50_C1A7Q3 Putative uncharacterized protein n=1 Tax=Gemmati... 205 3e-51 UniRef50_C0EGV5 Putative uncharacterized protein n=1 Tax=Clostri... 202 2e-50 UniRef50_C6XWM5 Putative uncharacterized protein n=1 Tax=Pedobac... 202 3e-50 UniRef50_C1I7D2 Putative uncharacterized protein n=1 Tax=Clostri... 201 5e-50 UniRef50_A7VTI3 Putative uncharacterized protein n=1 Tax=Clostri... 200 7e-50 UniRef50_C0EWT6 Putative uncharacterized protein (Fragment) n=1 ... 190 1e-46 UniRef50_B0P7J3 Putative uncharacterized protein n=1 Tax=Anaerot... 189 1e-46 UniRef50_A9NEM7 Hypothetical surface-anchored protein n=2 Tax=Ac... 189 2e-46 UniRef50_C1Q9T9 Uncharacterized conserved protein n=3 Tax=Brachy... 187 9e-46 UniRef50_B2ULM6 Putative uncharacterized protein n=1 Tax=Akkerma... 187 9e-46 UniRef50_A9KK48 Putative uncharacterized protein n=1 Tax=Clostri... 185 4e-45 UniRef50_UPI0001745532 hypothetical protein VspiD_00105 n=1 Tax=... 183 1e-44 UniRef50_C3R8E6 S-layer protein n=24 Tax=Bacteroides RepID=C3R8E... 182 3e-44 UniRef50_A6L917 Putative uncharacterized protein n=5 Tax=Bactero... 179 1e-43 UniRef50_C9LEC6 YngK protein n=1 Tax=Prevotella tannerae ATCC 51... 178 3e-43 UniRef50_D1PRQ4 FenI protein n=1 Tax=Subdoligranulum variabile D... 178 4e-43 UniRef50_C7H8A9 FenI protein n=2 Tax=Faecalibacterium prausnitzi... 174 8e-42 UniRef50_Q7MWV9 YngK protein n=2 Tax=Porphyromonas gingivalis Re... 172 1e-41 UniRef50_B9Y560 Putative uncharacterized protein n=1 Tax=Holdema... 169 1e-40 UniRef50_B0MQ11 Putative uncharacterized protein n=1 Tax=Eubacte... 169 3e-40 UniRef50_C3QJ47 S-layer protein n=5 Tax=Bacteroides RepID=C3QJ47... 168 4e-40 UniRef50_C3J8B5 YngK protein n=2 Tax=Bacteria RepID=C3J8B5_9PORP 166 2e-39 UniRef50_B3QYB7 Putative uncharacterized protein n=1 Tax=Chloroh... 164 6e-39 UniRef50_UPI0001C37647 hypothetical protein RflaF_08645 n=1 Tax=... 162 2e-38 UniRef50_C4FZ05 Putative uncharacterized protein n=1 Tax=Abiotro... 162 3e-38 UniRef50_B0NXH7 Putative uncharacterized protein n=3 Tax=Clostri... 161 4e-38 UniRef50_C2M9G1 YngK protein n=1 Tax=Porphyromonas uenonis 60-3 ... 158 4e-37 UniRef50_A6DH63 Putative uncharacterized protein n=1 Tax=Lentisp... 150 1e-34 UniRef50_C2FS67 FenI family protein n=1 Tax=Sphingobacterium spi... 145 3e-33 UniRef50_B6YR88 Putative uncharacterized protein n=1 Tax=Candida... 143 1e-32 UniRef50_C5VL52 YngK protein n=3 Tax=Prevotella RepID=C5VL52_9BACT 142 2e-32 UniRef50_C9PZF4 YngK protein n=5 Tax=Prevotella RepID=C9PZF4_9BACT 141 4e-32 UniRef50_D1PA22 YngK protein n=1 Tax=Prevotella copri DSM 18205 ... 134 5e-30 UniRef50_C2L0K0 Lipoprotein yddW n=1 Tax=Oribacterium sinus F026... 121 4e-26 UniRef50_C2FS66 Putative uncharacterized protein n=1 Tax=Sphingo... 112 3e-23 UniRef50_C7GZF2 Putative lipoprotein n=1 Tax=Eubacterium saphenu... 100 1e-19 UniRef50_B0P7J4 Putative uncharacterized protein n=1 Tax=Anaerot... 95 4e-18 UniRef50_C7E4U8 Putative uncharacterized protein psa8 n=1 Tax=Pa... 92 4e-17 UniRef50_B0PF61 Putative uncharacterized protein n=1 Tax=Anaerot... 84 1e-14 UniRef50_B8HYQ9 Putative uncharacterized protein n=1 Tax=Cyanoth... 83 2e-14 UniRef50_P74735 Slr0592 protein n=1 Tax=Synechocystis sp. PCC 68... 82 4e-14 UniRef50_A0YS74 Putative uncharacterized protein n=2 Tax=Oscilla... 82 4e-14 UniRef50_B4VTS6 Putative uncharacterized protein n=1 Tax=Microco... 82 4e-14 UniRef50_Q8YV65 All2116 protein n=15 Tax=Cyanobacteria RepID=Q8Y... 82 5e-14 UniRef50_B0MQ12 Putative uncharacterized protein n=1 Tax=Eubacte... 81 8e-14 UniRef50_Q8YQA0 All3933 protein n=18 Tax=Cyanobacteria RepID=Q8Y... 81 8e-14 UniRef50_B9XI64 Putative uncharacterized protein n=1 Tax=bacteri... 78 6e-13 UniRef50_A8YI06 Similar to tr|Q8YPV9|Q8YPV9 n=8 Tax=Chroococcale... 78 6e-13 UniRef50_B4VPG3 Putative uncharacterized protein n=1 Tax=Microco... 78 7e-13 UniRef50_Q7NL32 Glr1294 protein n=1 Tax=Gloeobacter violaceus Re... 77 9e-13 UniRef50_C1D2P2 Putative uncharacterized protein n=2 Tax=Deinoco... 77 1e-12 UniRef50_Q8YXK2 All1210 protein n=4 Tax=Nostocaceae RepID=Q8YXK2... 77 2e-12 UniRef50_A8YDR3 Genome sequencing data, contig C294 n=9 Tax=Chro... 76 2e-12 UniRef50_UPI0001C16380 Protein of unknown function DUF187 n=1 Ta... 76 2e-12 UniRef50_B7JXY5 Putative uncharacterized protein n=9 Tax=Cyanoba... 76 2e-12 UniRef50_C6PCP2 Putative uncharacterized protein n=1 Tax=Thermoa... 76 3e-12 UniRef50_Q2JQ39 Putative uncharacterized protein n=1 Tax=Synecho... 75 7e-12 UniRef50_Q10YX0 Putative uncharacterized protein n=2 Tax=Cyanoba... 75 7e-12 UniRef50_B2IV00 Putative uncharacterized protein n=4 Tax=Cyanoba... 72 3e-11 UniRef50_C1D298 Putative uncharacterized protein n=1 Tax=Deinoco... 72 6e-11 UniRef50_Q8EPF4 Hypothetical conserved protein n=1 Tax=Oceanobac... 71 8e-11 UniRef50_Q6AHL3 Putative uncharacterized protein n=1 Tax=Leifson... 71 9e-11 UniRef50_B0VF99 Putative uncharacterized protein n=1 Tax=Candida... 70 1e-10 UniRef50_Q1IWF6 Putative uncharacterized protein n=3 Tax=Deinoco... 69 3e-10 UniRef50_B5W1E7 Putative uncharacterized protein n=2 Tax=Arthros... 69 3e-10 UniRef50_A0YRE2 Putative uncharacterized protein n=1 Tax=Lyngbya... 69 3e-10 UniRef50_A6CAJ3 Putative uncharacterized protein n=1 Tax=Plancto... 69 4e-10 UniRef50_B1WZU0 Putative uncharacterized protein n=2 Tax=Cyanoth... 68 6e-10 UniRef50_B4AVG6 Putative uncharacterized protein n=1 Tax=Cyanoth... 68 7e-10 UniRef50_Q8YLM8 Alr5270 protein n=12 Tax=Cyanobacteria RepID=Q8Y... 68 8e-10 UniRef50_B5WA73 Putative uncharacterized protein n=2 Tax=Arthros... 67 9e-10 UniRef50_Q8YK50 All8067 protein n=8 Tax=Cyanobacteria RepID=Q8YK... 65 5e-09 UniRef50_Q6ZE96 Slr7102 protein n=5 Tax=Cyanobacteria RepID=Q6ZE... 65 6e-09 UniRef50_Q7NJN0 Glr1802 protein n=1 Tax=Gloeobacter violaceus Re... 64 9e-09 UniRef50_C3R3M7 Putative uncharacterized protein n=2 Tax=Bactero... 64 1e-08 UniRef50_C3A5Y1 Putative uncharacterized protein n=1 Tax=Bacillu... 63 2e-08 UniRef50_B4WH89 Putative uncharacterized protein n=1 Tax=Synecho... 63 2e-08 UniRef50_A2C8D8 DUF187 n=12 Tax=Cyanobacteria RepID=A2C8D8_PROM3 62 6e-08 UniRef50_C5CIL6 Putative uncharacterized protein n=1 Tax=Kosmoto... 60 2e-07 UniRef50_Q3AJ74 Putative uncharacterized protein n=3 Tax=Chrooco... 60 2e-07 UniRef50_UPI0001AF05D8 hypothetical protein SghaA1_34850 n=1 Tax... 58 8e-07 UniRef50_P74629 Sll0736 protein n=1 Tax=Synechocystis sp. PCC 68... 55 5e-06 UniRef50_B4WJG2 Putative uncharacterized protein n=1 Tax=Synecho... 55 6e-06 UniRef50_C6IEW4 Putative uncharacterized protein n=4 Tax=Bactero... 52 5e-05 UniRef50_A7LVF6 Putative uncharacterized protein n=4 Tax=Bactero... 50 1e-04 UniRef50_Q2BFL2 Putative uncharacterized protein n=1 Tax=Bacillu... 50 2e-04 UniRef50_A8F7U2 Putative uncharacterized protein n=2 Tax=Thermot... 49 3e-04 UniRef50_UPI0001C1694B Protein of unknown function DUF187 n=1 Ta... 49 5e-04 UniRef50_C9KJL9 Alpha-galactosidase n=1 Tax=Mitsuokella multacid... 45 0.005 UniRef50_Q8YXF7 All1256 protein n=4 Tax=Nostocaceae RepID=Q8YXF7... 44 0.009 UniRef50_Q5N184 Putative uncharacterized protein n=2 Tax=Synecho... 44 0.011 UniRef50_A7VPY5 Putative uncharacterized protein n=1 Tax=Clostri... 44 0.012 UniRef50_C6VY08 Putative uncharacterized protein n=1 Tax=Dyadoba... 44 0.014 UniRef50_Q9P8N4 Alpha-galactosidase (Fragment) n=2 Tax=Lichtheim... 44 0.016 UniRef50_D1BUC2 Putative uncharacterized protein n=1 Tax=Xylanim... 43 0.021 UniRef50_C3QPV8 Glycoside hydrolase family 36 protein n=5 Tax=Ba... 43 0.022 UniRef50_Q8AAL7 S-layer related protein, sialic acid-specific 9-... 43 0.025 UniRef50_A6L961 Glycoside hydrolase family 36, candidate alpha-g... 42 0.038 UniRef50_Q114S3 Putative uncharacterized protein n=2 Tax=Oscilla... 42 0.051 UniRef50_B8HXB6 Putative uncharacterized protein n=1 Tax=Cyanoth... 41 0.086 >UniRef50_P64427 UPF0748 lipoprotein yddW n=88 Tax=Enterobacteriaceae RepID=YDDW_ECO57 Length = 439 Score = 905 bits (2340), Expect = 0.0, Method: Compositional matrix adjust. Identities = 439/439 (100%), Positives = 439/439 (100%) Query: 1 MDICSRNKKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIW 60 MDICSRNKKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIW Sbjct: 1 MDICSRNKKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIW 60 Query: 61 LATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPS 120 LATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPS Sbjct: 61 LATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPS 120 Query: 121 KILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNST 180 KILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNST Sbjct: 121 KILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNST 180 Query: 181 LSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYT 240 LSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYT Sbjct: 181 LSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYT 240 Query: 241 ESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWR 300 ESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWR Sbjct: 241 ESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWR 300 Query: 301 NRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWAD 360 NRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWAD Sbjct: 301 NRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWAD 360 Query: 361 VVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDY 420 VVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDY Sbjct: 361 VVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDY 420 Query: 421 LNKPQTQQAVSYLQSRWGS 439 LNKPQTQQAVSYLQSRWGS Sbjct: 421 LNKPQTQQAVSYLQSRWGS 439 >UniRef50_C2DR58 Lipoprotein yddW n=6 Tax=Enterobacteriaceae RepID=C2DR58_ECOLX Length = 349 Score = 719 bits (1857), Expect = 0.0, Method: Compositional matrix adjust. Identities = 347/349 (99%), Positives = 348/349 (99%) Query: 91 MIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAH 150 MIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAH Sbjct: 1 MIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAH 60 Query: 151 KRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIP 210 KRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIP Sbjct: 61 KRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIP 120 Query: 211 EVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNN 270 EVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNN Sbjct: 121 EVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNN 180 Query: 271 TQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQ 330 TQQLIAKVSHTIKSIKP VEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQ Sbjct: 181 TQQLIAKVSHTIKSIKPEVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQ 240 Query: 331 GLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMI 390 GLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMI Sbjct: 241 GLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMI 300 Query: 391 NGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQSRWGS 439 NGGVPELKKQLDLNDA+PEISGTILFREDYLNKPQTQQAVSYLQSRWGS Sbjct: 301 NGGVPELKKQLDLNDALPEISGTILFREDYLNKPQTQQAVSYLQSRWGS 349 >UniRef50_C1M4K1 Lipoprotein YddW n=4 Tax=Enterobacteriaceae RepID=C1M4K1_9ENTR Length = 441 Score = 692 bits (1787), Expect = 0.0, Method: Compositional matrix adjust. Identities = 332/424 (78%), Positives = 373/424 (87%), Gaps = 5/424 (1%) Query: 16 AILVALALLLCSCKSTPPESMVTP-PAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSS 74 A+LV LLL SC S PP TP P SKP QQS +P+RGIWLATVSRLDWPP+SS Sbjct: 21 AVLVGSMLLLGSCSSQPPGPKTTPLPPVSKP----QQSKEPVRGIWLATVSRLDWPPISS 76 Query: 75 VNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIG 134 VNIS+P R QQ+A+ DKLD+L+RLGINTVFFQVKPDGTALW SKILPWSD +TG IG Sbjct: 77 VNISSPAVRISQQQKALTDKLDNLKRLGINTVFFQVKPDGTALWKSKILPWSDTLTGTIG 136 Query: 135 ENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRD 194 ++PGYDPLQFMLDEAHKRGMKVHAW NPYRVSVNTKP T+ ELNSTLSQ P+SVYV HRD Sbjct: 137 QDPGYDPLQFMLDEAHKRGMKVHAWLNPYRVSVNTKPSTVSELNSTLSQTPSSVYVLHRD 196 Query: 195 WIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYR 254 WIRT+G+RFVLDPGIP+V+DWITSIVAEVV YPVDGVQFDDYFYTESPGS LND++T+R Sbjct: 197 WIRTAGERFVLDPGIPDVRDWITSIVAEVVENYPVDGVQFDDYFYTESPGSALNDSQTFR 256 Query: 255 KYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGA 314 +YG FASKADWRR+NTQ+LIA+VS TIK +KP VEFGVSPAGVWRNRSHDP GSDTRGA Sbjct: 257 RYGQGFASKADWRRDNTQRLIAQVSRTIKKLKPEVEFGVSPAGVWRNRSHDPAGSDTRGA 316 Query: 315 AAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIA 374 AAYDESYADTRRWV+ GLLDYIAPQ+YWPF+R AARYDVLAKWWADVVK T TRLYIG+A Sbjct: 317 AAYDESYADTRRWVQLGLLDYIAPQLYWPFARDAARYDVLAKWWADVVKSTNTRLYIGVA 376 Query: 375 FYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQ 434 YKVGEPS+ EPDW + GGVPELKKQLDLN++ P I+GTILFREDYLN+PQTQ+AV+Y++ Sbjct: 377 LYKVGEPSRKEPDWTVKGGVPELKKQLDLNESEPYINGTILFREDYLNQPQTQEAVTYIR 436 Query: 435 SRWG 438 +RWG Sbjct: 437 NRWG 440 >UniRef50_Q48C14 YngK protein n=53 Tax=Proteobacteria RepID=Q48C14_PSE14 Length = 393 Score = 466 bits (1198), Expect = e-130, Method: Compositional matrix adjust. Identities = 208/388 (53%), Positives = 280/388 (72%), Gaps = 1/388 (0%) Query: 52 SSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVK 111 +++ ++ W+ATV+ LDWP VSSV I++ +R Q++ + LD + + +N V FQV Sbjct: 4 ANKNLKATWVATVTNLDWPSVSSVAITDEAARVSKQKEELTGILDEIVAMKMNAVIFQVV 63 Query: 112 PDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKP 171 P A + S +LPWS +TG +G+NPG+DPL + +++AH R +++HAW NPYRVS+N Sbjct: 64 PCADAFYASDLLPWSKYLTGTLGKNPGFDPLAYAIEQAHARNIELHAWVNPYRVSMNASD 123 Query: 172 GTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDG 231 TI ELN++ S PASV+ H +W T+ +RFVL+PGIPEVQ W++SIV E+V++Y VD Sbjct: 124 ATIEELNNSSSDSPASVFKTHPEWTGTAANRFVLNPGIPEVQTWVSSIVEEIVTKYDVDA 183 Query: 232 VQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEF 291 +QFDDYFY E+ S L D+ TY+KY F +KADWRRNNT L+ I ++K V F Sbjct: 184 IQFDDYFYNETASSLLQDDATYQKYNTNFTTKADWRRNNTYSLVDTCHKKIAAVKADVLF 243 Query: 292 GVSPAGVWRNRSHDPLGSDTR-GAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAAR 350 GVSPAGVWRN+S DPLGSDT+ GA+ YD +YADTR+WV G++DYIAPQ+YWPF+R AR Sbjct: 244 GVSPAGVWRNKSDDPLGSDTQAGASNYDFAYADTRKWVIDGIIDYIAPQVYWPFAREVAR 303 Query: 351 YDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEI 410 YDV+ +WWAD V T T LYIG+A YKVG S+ EPDW + GGVPE+ +QLDLND++ E+ Sbjct: 304 YDVITQWWADTVSGTGTALYIGMALYKVGTASETEPDWTVEGGVPEITRQLDLNDSLTEV 363 Query: 411 SGTILFREDYLNKPQTQQAVSYLQSRWG 438 SG +LFR +L QTQQ V YL+ RW Sbjct: 364 SGCMLFRHMFLRASQTQQVVDYLKLRWA 391 >UniRef50_A6WZY4 Putative uncharacterized protein n=10 Tax=Brucellaceae RepID=A6WZY4_OCHA4 Length = 442 Score = 433 bits (1113), Expect = e-120, Method: Compositional matrix adjust. Identities = 190/380 (50%), Positives = 270/380 (71%), Gaps = 1/380 (0%) Query: 60 WLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWP 119 W+ATV LDWP SS I + R + Q++ ++ D GIN V FQV P A + Sbjct: 56 WIATVLNLDWPSRSSSRIEDDAERIKRQKEELLRLFDEASEHGINAVIFQVSPTADAFYQ 115 Query: 120 SKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNS 179 S LPWS +TG +G++PG+DPL+F + EAHKRG+++HAW NPYRVS++TKP T +EL + Sbjct: 116 SSYLPWSSYLTGTLGKDPGFDPLKFAIQEAHKRGIELHAWLNPYRVSMDTKPSTRKELRN 175 Query: 180 TLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFY 239 + ++ P SV+ H DW+ S DR+VLDPGIP V++W+T++ AEVV +Y +DG+QFDDYFY Sbjct: 176 SSNESPVSVFKSHPDWVGVSADRYVLDPGIPAVREWVTNVTAEVVQKYDIDGIQFDDYFY 235 Query: 240 TESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVW 299 E+ S+L+D+++Y ++G F+SK +WRR NT L+ ++S IK+IKP V FG+SP+GVW Sbjct: 236 YETASSKLDDDKSYARFGTRFSSKYEWRRYNTHTLVREISDKIKAIKPNVRFGISPSGVW 295 Query: 300 RNRSHDPLGSDTR-GAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWW 358 RN + DP GS TR G YD +ADTRRWV++G++DYIAPQIYW F R Y +AKWW Sbjct: 296 RNAADDPRGSATRAGKTNYDGDFADTRRWVKEGMIDYIAPQIYWSFGRKDVSYGTIAKWW 355 Query: 359 ADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFRE 418 AD V+ T+T LYIG+A Y+ G + +EP W GV E+K+QL+ N+++PE+ G+ILFR+ Sbjct: 356 ADTVRGTKTDLYIGLALYRAGSGTTLEPSWQAGEGVTEIKRQLEFNESLPEVKGSILFRQ 415 Query: 419 DYLNKPQTQQAVSYLQSRWG 438 +L+ P+ + +YL+ WG Sbjct: 416 GFLSDPKLKGVSNYLKKTWG 435 >UniRef50_A6TUC1 Putative uncharacterized protein n=5 Tax=Bacteria RepID=A6TUC1_ALKMQ Length = 731 Score = 347 bits (889), Expect = 7e-94, Method: Compositional matrix adjust. Identities = 175/430 (40%), Positives = 257/430 (59%), Gaps = 14/430 (3%) Query: 8 KKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRL 67 K ++I IL+ AL L S P + P T + +RG W++TV L Sbjct: 9 KLISICIVGILMITALPLHSFAIEEPWDQYNQYLPRETPVTKRH----LRGAWISTVINL 64 Query: 68 DWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSD 127 DWP V + I N R + ++ +I LD + +N VFFQV P+G A + S I+PWS Sbjct: 65 DWPSVETAKIKNDKERIQKSKEELIAILDKSVEMNMNAVFFQVSPEGDAFYNSNIVPWSR 124 Query: 128 LMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPAS 187 +TG G++PG+DPL F ++EAHKR +++HAWFNPYR+S+ TI LN S Sbjct: 125 YLTGTFGKDPGFDPLAFAIEEAHKRNLELHAWFNPYRISMYMNDSTIESLNI-----EKS 179 Query: 188 VYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRL 247 VY +H DW++++ RFV+DPGIP+ ++W+ EVV+ Y VDG+ FDDYFY E L Sbjct: 180 VYKEHPDWVKSAMSRFVIDPGIPQAREWVIKRTMEVVNDYDVDGIHFDDYFYYEKHVGEL 239 Query: 248 NDNETYRKYG-GAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDP 306 D +T+ +Y G F++ +WRRNNT L+ ++S+ I+ KP ++FG+SPAGVW N+ Sbjct: 240 EDQDTFSQYNLGQFSNLGEWRRNNTYLLVKELSNEIRKTKPWIKFGISPAGVWANKKDGH 299 Query: 307 L-GSDTR-GAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKP 364 L GS+T G YD S+ADT++WVE+ ++DYIAPQ+Y+ F+ +A Y +A WW++VV+ Sbjct: 300 LNGSNTSAGLPNYDRSFADTKKWVEEEIIDYIAPQVYFTFANPSAPYGEVANWWSNVVRG 359 Query: 365 TRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKP 424 LYIG A YKV + + + + N V E +Q N PE+ G+I+FR N Sbjct: 360 KNVHLYIGQALYKVNDNA--DQYFQGNHAVEEFVRQHKYNTMKPEVMGSIMFRFQNFNHG 417 Query: 425 QTQQAVSYLQ 434 QQ V+ ++ Sbjct: 418 NKQQVVNVMK 427 >UniRef50_C6J3R7 Putative uncharacterized protein n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J3R7_9BACL Length = 545 Score = 329 bits (843), Expect = 2e-88, Method: Compositional matrix adjust. Identities = 176/422 (41%), Positives = 245/422 (58%), Gaps = 30/422 (7%) Query: 24 LLCSCKSTPPESMV--TPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPT 81 LL +T PE V P P +S +RG+W++TVS LDWP SS Sbjct: 144 LLSGSGATQPEPGVGGEDPQSDVPQPPAVDTSNGLRGVWVSTVSNLDWPSKSSYG----- 198 Query: 82 SRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDP 141 + Q+ + LD +Q +G+N VF QV+P A++PS +PWS +TG G++PGYDP Sbjct: 199 -KVEAQKAEYVQLLDEVQAMGMNAVFVQVRPSADAIYPSSQVPWSSYLTGTAGKDPGYDP 257 Query: 142 LQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPAS-VYVQHRDWIRTSG 200 LQF+++E H+RGM+ HAWFNP+R S S S+ PA+ V QH +WI Sbjct: 258 LQFLIEETHRRGMEFHAWFNPFRAST----------GSDASKLPANHVANQHPEWIVKFD 307 Query: 201 DRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFY--TESPGSRLNDNETYRKYGG 258 + ++PGIPE +D + S + EVV+ Y +DGV DDYFY E+ + D+ T++ Y Sbjct: 308 GKLYINPGIPEARDHVISAIMEVVNGYDIDGVHLDDYFYPTGETTSKKFADDATFKSYNS 367 Query: 259 -AFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAA-A 316 A+K DWRR+N Q + K+ I++ KP V FG+SP GVWRN+S+D GSDT+ + A Sbjct: 368 KKIATKGDWRRDNINQFVQKLGQRIEASKPYVSFGISPYGVWRNKSNDLTGSDTKASVTA 427 Query: 317 YDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFY 376 YD +YAD R W++ +DY+APQ+YW +R RYD+LA WWA V+ T +LYIG A Y Sbjct: 428 YDSTYADVRTWIKNEWIDYVAPQLYWSMTRKEVRYDLLADWWAQEVRGTNVKLYIGHAPY 487 Query: 377 KVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQSR 436 K+G P E W E+ QL+ N +PEISG+I F L K + LQS Sbjct: 488 KLGTP---EIGW---SSAQEIINQLEYNRQIPEISGSIFFSAKDLRK-NPLGLIPLLQSY 540 Query: 437 WG 438 +G Sbjct: 541 YG 542 >UniRef50_C9XP71 Cell surface protein n=6 Tax=Clostridium RepID=C9XP71_CLODC Length = 703 Score = 329 bits (843), Expect = 2e-88, Method: Compositional matrix adjust. Identities = 178/431 (41%), Positives = 247/431 (57%), Gaps = 51/431 (11%) Query: 12 IRRPAILV---ALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLD 68 +++ +ILV + L +CS S S + + MR W++TV LD Sbjct: 1 MKKISILVLSLIMTLTMCSVSSFADSS----------------NDKEMRAAWISTVYNLD 44 Query: 69 WPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDL 128 WP + + A+ Q++ D LD L+ +GINT QV+P AL+ S I PWS+ Sbjct: 45 WPKTKN-------NEAK-QKKEYTDLLDKLKSVGINTAVVQVRPKSDALYKSNINPWSEY 96 Query: 129 MTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASV 188 +TG G++PGYDPL F+++EAHKRGM+ HAWFNPYR+++ + ++ + PA Sbjct: 97 LTGTQGKDPGYDPLPFLIEEAHKRGMEFHAWFNPYRITMADE-----SIDKLPANHPAK- 150 Query: 189 YVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLN 248 ++ W+ G+++ DPG+PEV+ +I +AEVV Y +DGV FDDYFY PG N Sbjct: 151 --KNPSWVVKHGNKYYYDPGLPEVRKYIVDSIAEVVQNYDIDGVHFDDYFY---PGVSFN 205 Query: 249 DNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLG 308 D TY+KYG +K +WRR N L+ V +IKSIKP V FGVSPAG+WRN+S DP G Sbjct: 206 DTATYQKYGKG-QNKDNWRRENVNTLLRDVKASIKSIKPNVVFGVSPAGIWRNKSSDPTG 264 Query: 309 SDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTR 368 SDT G +Y +YADTR W++QGL+DY+ PQ+YWP AA Y L WWA+ VK T Sbjct: 265 SDTSGNESYVGTYADTRAWIKQGLIDYVVPQLYWPIGLKAADYSKLVAWWANEVKGTNVD 324 Query: 369 LYIGIAFYKVGEPSKIEPDWMINGG---VPELKKQLDLNDAVPEISGTILFR-EDYLNKP 424 LYIG YK G+ S GG E+ +Q+ LN EI G++ F +D N Sbjct: 325 LYIGQGIYKQGQSSY--------GGQNIAKEIVQQVTLNRKYSEIKGSMYFSAKDIANST 376 Query: 425 QTQQAVSYLQS 435 Q+ + L S Sbjct: 377 SIQKDLKSLYS 387 >UniRef50_C0Z8S4 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0Z8S4_BREBN Length = 540 Score = 324 bits (830), Expect = 4e-87, Method: Compositional matrix adjust. Identities = 169/395 (42%), Positives = 231/395 (58%), Gaps = 25/395 (6%) Query: 31 TPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQA 90 TPP + G+ P T ++ + G+W++TV LDWP SS + NP QQQ Sbjct: 154 TPPPQDILSGNGAMEPGTPVVTNGNLHGVWISTVYNLDWP--SSGSYGNPAK----QQQE 207 Query: 91 MIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAH 150 I LD LQ +G+N F QV+P G AL+PS + PWS +TG G++PGYDPL FM+ E H Sbjct: 208 YIQLLDELQAMGMNAAFVQVRPSGDALYPSTLTPWSRFLTGTPGKDPGYDPLAFMVQETH 267 Query: 151 KRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPAS-VYVQHRDWIRTSGDRFVLDPGI 209 +RGM+ HAWFNP+R + + K Q PA+ V QH DWI + + ++PG+ Sbjct: 268 RRGMQFHAWFNPFRATTDAK----------TDQLPANHVIKQHPDWIVNANKKLYINPGV 317 Query: 210 PEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRN 269 P + I + V EVV RY +DGV DDYFY + + SKADWRR+ Sbjct: 318 PAARQQIINEVMEVVQRYDIDGVHLDDYFYPSNVAFADDAAFKAYN-SKKIVSKADWRRD 376 Query: 270 NTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTR-GAAAYDESYADTRRWV 328 N Q + +++ +IKS+KP V+FG+SP GVWRN + DP GSDT+ G AYD +AD R W+ Sbjct: 377 NINQFVQQMNQSIKSVKPHVQFGISPFGVWRNSNVDPTGSDTKAGVTAYDHMFADVRTWI 436 Query: 329 EQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDW 388 +QG +DY+ PQIYW FS + A+YD L WWA+ V+ T +LYIG + YK+G E W Sbjct: 437 QQGWIDYVTPQIYWSFSFAPAQYDKLVTWWANEVQGTNVKLYIGHSPYKLGTA---EAGW 493 Query: 389 MINGGVPELKKQLDLNDAVPEISGTILFREDYLNK 423 E+ QL+ N VP++ G+I F L K Sbjct: 494 Q---SAQEIINQLNFNAMVPQVQGSIFFSAKDLRK 525 >UniRef50_O35015 UPF0748 protein yngK n=11 Tax=Bacteria RepID=YNGK_BACSU Length = 510 Score = 316 bits (809), Expect = 1e-84, Method: Compositional matrix adjust. Identities = 161/384 (41%), Positives = 230/384 (59%), Gaps = 23/384 (5%) Query: 43 SKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLG 102 S P QS + +R +W+A+V +DWP +++ + Q+Q I LD +Q++G Sbjct: 23 SVPFMANAQSDRELRAVWIASVLNIDWPSKKGLSV-------KEQKQEYIKLLDDVQKMG 75 Query: 103 INTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNP 162 +N V Q+KP A +PS PWS+ +TG G++PGYDPL FM++E HKR ++ HAWFNP Sbjct: 76 MNAVIVQIKPTADAFYPSAYGPWSEYLTGVQGKDPGYDPLAFMIEETHKRNLEFHAWFNP 135 Query: 163 YRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAE 222 YR+++N +LN PA +H DW+ G++ PGIPE +D+I + E Sbjct: 136 YRITMNHT-----DLNKLSEDHPAR---KHPDWVAAYGNQLYYHPGIPEARDFIVKGIEE 187 Query: 223 VVSRYPVDGVQFDDYFY-TESPGSRLNDNETYRKYG-GAFASKADWRRNNTQQLIAKVSH 280 VV Y +D V DDYFY + G D Y +YG AF++ DWRR+N QL+ +++ Sbjct: 188 VVKHYDIDAVHMDDYFYPYKIAGQEFPDQAQYEQYGKDAFSNIDDWRRDNVNQLVKQINQ 247 Query: 281 TIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTR-GAAAYDESYADTRRWVEQGLLDYIAPQ 339 TIK+ KP V+FG+SP GVWRN + DP GS+T+ G YD+ YADTR W+++G +DYIAPQ Sbjct: 248 TIKAAKPYVKFGISPFGVWRNAADDPTGSNTKAGVRNYDDLYADTRHWIQEGDIDYIAPQ 307 Query: 340 IYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKK 399 IYW +AA YDVLA WW++ VK LYIG A YK+ + +P W E + Sbjct: 308 IYWSIGFNAAAYDVLADWWSNEVKNRPVHLYIGQAAYKIN--NNFDPPW---SDPEEYVR 362 Query: 400 QLDLNDAVPEISGTILFREDYLNK 423 Q+ LN + + G++ F LNK Sbjct: 363 QITLNRQLELVKGSMHFSLKDLNK 386 >UniRef50_UPI000178945D protein of unknown function DUF187 n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI000178945D Length = 518 Score = 315 bits (806), Expect = 3e-84, Method: Compositional matrix adjust. Identities = 165/383 (43%), Positives = 225/383 (58%), Gaps = 31/383 (8%) Query: 45 PPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGIN 104 PP +T +RG W++TV LDWP + A QQ + I LD LQ +GIN Sbjct: 152 PPVST---GDEVRGAWISTVFNLDWPKTKT--------SAEQQQASYIALLDSLQDVGIN 200 Query: 105 TVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYR 164 TV+ QV+P G AL+PS ++PWS ++TG G +PGYDP+ FM++E H+R M+ HAWFNP+R Sbjct: 201 TVYVQVRPAGDALYPSTMVPWSKVLTGIQGADPGYDPVAFMVEETHRRNMEFHAWFNPFR 260 Query: 165 VSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVV 224 + + T S P+ V + H DWI +G + ++PGIPE + I + EVV Sbjct: 261 ANTDIL---------TASLHPSHVALSHPDWIVNTGKQLYINPGIPEARQHIIDTIMEVV 311 Query: 225 SRYPVDGVQFDDYFYTESPGSRLNDNETYRKY-GGAFASKADWRRNNTQQLIAKVSHTIK 283 + Y +DG+ DDYFY + + ND+ YR++ GA+A+ ADWRR N + + +I Sbjct: 312 NGYDIDGIHLDDYFYPSN--TVFNDDAAYREFNNGAYANLADWRRGNINAFVQSLGESIH 369 Query: 284 SIKPGVEFGVSPAGVWRNRSHDPLGSDTR-GAAAYDESYADTRRWVEQGLLDYIAPQIYW 342 +KP VE+G+SP GVWRN+S D GSDT+ G AYD YAD R W++ G +DY+APQIYW Sbjct: 370 RVKPDVEYGISPFGVWRNQSVDKTGSDTKAGVTAYDSMYADVRTWIQNGWIDYVAPQIYW 429 Query: 343 PFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLD 402 S AA YD L WWA V+ T L IG A YK+G E W E+ QL Sbjct: 430 SMSNPAADYDKLVDWWASEVQGTGVDLLIGHAPYKLGTS---EIGWQ---SASEIINQLK 483 Query: 403 LNDAVPEISGTILFR-EDYLNKP 424 N E+ G+I FR E+ L+ P Sbjct: 484 YNQNHAEVKGSIFFRAENILSNP 506 >UniRef50_C5C4P8 Putative uncharacterized protein n=4 Tax=Bacteria RepID=C5C4P8_BEUC1 Length = 538 Score = 307 bits (787), Expect = 5e-82, Method: Compositional matrix adjust. Identities = 161/406 (39%), Positives = 221/406 (54%), Gaps = 28/406 (6%) Query: 24 LLCSCKSTPPESMVTPPAGSKPPATTQQSSQP-------MRGIWLATVSRLDWPPVSSVN 76 L + S +T GS PA S+ P +R +W+++V +DWP + ++ Sbjct: 20 FLTLATAGVAASTLTVTVGSMTPAAATPSADPAAFLKRELRAMWISSVVNIDWPSATGLS 79 Query: 77 ISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGEN 136 A QQ + LD Q +N VF QV+P A WPS PWS +TG G++ Sbjct: 80 -------AEAQQAEYLHWLDVAQDFRLNAVFVQVRPTADAFWPSPHEPWSQYLTGVQGQD 132 Query: 137 PGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWI 196 PGYDPL F+++E HKR +++H W+NPYRVS+ P + + + PA V H DWI Sbjct: 133 PGYDPLAFIVEETHKRNLELHTWYNPYRVSMQADPAQL------VPEHPARV---HPDWI 183 Query: 197 RTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFY-TESPGSRLNDNETYRK 255 G + DPG+PE Q+ I + + V Y +DGV FDDYFY G + D ETY Sbjct: 184 WPYGGKLYFDPGLPETQEHIQAAILHSVENYDIDGVHFDDYFYPYPVAGQTIPDAETYAT 243 Query: 256 YGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAA 315 YG F DWRR+N I+ +S IK +KP V+FG+SP G+WRN + DPLGS TRG+ Sbjct: 244 YGAGFDDVGDWRRHNVDTFISSISARIKQVKPWVKFGISPFGIWRNDTTDPLGSATRGSQ 303 Query: 316 AYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAF 375 +YD +ADTR+WV +G LDYI PQ+YW + A Y VL WWADV + T LYIG A Sbjct: 304 SYDLQFADTRKWVLEGWLDYINPQVYWQIGLAVADYSVLVPWWADVAATSGTHLYIGEAL 363 Query: 376 YKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYL 421 YKV +P + N L D+ + V + G + F ++ Sbjct: 364 YKVTSGVFTDPAELAN----HLALDRDVTETVGPVHGNVYFSAKHV 405 >UniRef50_A7Z5C7 YngK n=5 Tax=Bacteria RepID=A7Z5C7_BACA2 Length = 512 Score = 306 bits (783), Expect = 1e-81, Method: Compositional matrix adjust. Identities = 162/380 (42%), Positives = 221/380 (58%), Gaps = 23/380 (6%) Query: 47 ATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTV 106 A+ Q + MR +W+A+V+ +DWP ++ Q++ LD +Q +G+N V Sbjct: 29 ASGTQPKREMRAVWIASVTNIDWPSKKGLSPEE-------QKREYSKLLDDVQEMGMNAV 81 Query: 107 FFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVS 166 Q+KP A +PS PWS+ +TG G+NPGYDPL F+++E HKR ++ HAWFNPYR++ Sbjct: 82 IVQIKPAADAFYPSDYGPWSEYLTGTQGKNPGYDPLAFLVEETHKRNLEFHAWFNPYRIT 141 Query: 167 VNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSR 226 +N LN+ PA H DW+ G + +PGIPEV+ +IT + EVVSR Sbjct: 142 MNHT-----NLNALSDDHPAR---SHPDWVAAYGKQLYYNPGIPEVRQFITDGIKEVVSR 193 Query: 227 YPVDGVQFDDYFY-TESPGSRLNDNETYRKYGGA-FASKADWRRNNTQQLIAKVSHTIKS 284 Y +D V DDYFY + G D Y +YG A FAS DWRR+N +L+ +++ TIK Sbjct: 194 YDIDAVHMDDYFYPYKIAGQEFPDQAEYERYGKAHFASIDDWRRDNVNRLVKEINQTIKR 253 Query: 285 IKPGVEFGVSPAGVWRNRSHDPLGSDT-RGAAAYDESYADTRRWVEQGLLDYIAPQIYWP 343 KP V+FG+SP GVWRN + DP GS+T G YD+ YADTR W+++G +DYIAPQIYW Sbjct: 254 EKPYVKFGISPFGVWRNAADDPTGSETAAGVRNYDDLYADTREWIQKGYIDYIAPQIYWS 313 Query: 344 FSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDL 403 AA YDVLA WW V LYIG A YK+ + +P W G E Q+ L Sbjct: 314 IGFKAAAYDVLADWWGKEVNNRPVHLYIGQAAYKIN--NNADPAWADPG---EYGGQITL 368 Query: 404 NDAVPEISGTILFREDYLNK 423 N I G++ F LN+ Sbjct: 369 NRGSAWIKGSLHFSLKDLNR 388 >UniRef50_D2AWT1 FenI protein n=1 Tax=Streptosporangium roseum DSM 43021 RepID=D2AWT1_STRRD Length = 533 Score = 301 bits (771), Expect = 3e-80, Method: Compositional matrix adjust. Identities = 160/367 (43%), Positives = 215/367 (58%), Gaps = 23/367 (6%) Query: 56 MRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGT 115 +RG+W+ATV +DWP + ++ + QQ + LD+ + +N VF QV+P Sbjct: 70 LRGVWIATVKNIDWPSRTGLSAAK-------QQAEYVRILDNAVKRRLNAVFVQVRPASD 122 Query: 116 ALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIR 175 AL+ S + PWS +TG G++PG+DPL F++ EAHKRG++ HAWFNPYR S + G + Sbjct: 123 ALYKSSLEPWSKFLTGTAGKDPGWDPLPFLVAEAHKRGLEFHAWFNPYRASYD---GDVS 179 Query: 176 ELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFD 235 +L + PA V H DWI +PG+P V+D +TS++ +VV RY VDGV FD Sbjct: 180 KLPA---DHPARV---HPDWIVKHEGLVYYNPGLPAVRDHVTSVITDVVKRYDVDGVHFD 233 Query: 236 DYFYTESPGS-RLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVS 294 DYFY GS + D +RKYG ADWRR+N +LIA+V + K V+FG+S Sbjct: 234 DYFYPYPGGSAQFADGAAFRKYGKG-EKLADWRRSNVDKLIAQVDEAVHGTKQHVKFGIS 292 Query: 295 PAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVL 354 P G+WRN++ DP GS T G +AYD YAD R W+ +G +DY+APQ+YWP AA YDVL Sbjct: 293 PFGIWRNKAQDPTGSATAGMSAYDSIYADARHWIRKGTVDYVAPQLYWPSGFKAADYDVL 352 Query: 355 AKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTI 414 WWA VK T LYIG A Y+VG S P W G EL L N ++ G + Sbjct: 353 MPWWAKEVKGTDVHLYIGQALYRVG--STDTPAWTRPG---ELPSHLTKNRKHKQVKGDV 407 Query: 415 LFREDYL 421 F L Sbjct: 408 YFNAKQL 414 >UniRef50_Q47Q17 FenI protein n=9 Tax=Bacteria RepID=Q47Q17_THEFY Length = 540 Score = 298 bits (764), Expect = 2e-79, Method: Compositional matrix adjust. Identities = 169/391 (43%), Positives = 222/391 (56%), Gaps = 33/391 (8%) Query: 33 PESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMI 92 PE T PA K + MRG+WL TV +DWP S P + QQ+ + Sbjct: 57 PEDCATDPAYPK---------RQMRGVWLTTVRNIDWP-------SEPGLSPQQQQEELT 100 Query: 93 DKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKR 152 LD LG+N VFF ++P A++ S PW+ +TG G +PGYDPL+F + EAH R Sbjct: 101 AFLDRAVELGLNAVFFHIRPTADAVYASDKEPWARYLTGTQGGDPGYDPLEFAVAEAHTR 160 Query: 153 GMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEV 212 G+++HAWFNPYRV +L P ++ +W+ D+ LDPG PEV Sbjct: 161 GLELHAWFNPYRVGWREA-----DLEHLADDHPVR---RNPEWMIVYDDQGYLDPGNPEV 212 Query: 213 QDWITSIVAEVVSRYPVDGVQFDDYFY-TESPGSRLNDNETYRKYGGAFASKADWRRNNT 271 ++W+ +VA+VV RY VDGV FDDYFY + G +D+ +++ +G F + WRR+N Sbjct: 213 REWVVDVVADVVERYDVDGVHFDDYFYPYPASGETFDDDASWQAHGDGFPDRDAWRRDNV 272 Query: 272 QQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQG 331 QLI +V + IKP V FGVSP G+WRNRS DP GS T G +YD +ADTR W+ +G Sbjct: 273 NQLIRQVHERVHDIKPWVRFGVSPFGIWRNRSSDPSGSATSGLQSYDALHADTRTWIREG 332 Query: 332 LLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMIN 391 +DY+ PQ+YWP +AA Y VLA WWA+ V T LYIG A Y+VGE W Sbjct: 333 WIDYVVPQLYWPQGFAAADYAVLAPWWAEEVAGTGVDLYIGQAAYRVGEDG-----WK-- 385 Query: 392 GGVPELKKQLDLNDAVPEISGTILFREDYLN 422 G L KQLD N PEI+G I F LN Sbjct: 386 -GADALAKQLDFNTQHPEITGDIYFSMKDLN 415 >UniRef50_D1S7M0 Putative uncharacterized protein n=1 Tax=Micromonospora aurantiaca ATCC 27029 RepID=D1S7M0_9ACTO Length = 555 Score = 296 bits (757), Expect = 1e-78, Method: Compositional matrix adjust. Identities = 162/399 (40%), Positives = 231/399 (57%), Gaps = 27/399 (6%) Query: 26 CSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRAR 85 ++P VT PA K + R +W+A+V+ +DWP S + ++ + Sbjct: 33 TGTATSPSTDCVTDPATPK---------RQFRAMWIASVTNIDWPSKGSWTAPDQVAKQK 83 Query: 86 VQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFM 145 + A LD Q+L N V QV+P A WPS PWS+ +TG G+NPG+DPL F+ Sbjct: 84 AEYLAW---LDLAQKLNHNAVVVQVRPTADAFWPSPYEPWSEYLTGVRGKNPGWDPLDFL 140 Query: 146 LDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWI------RTS 199 + E+HKR ++ HAWFNPYRVS+ G +L+ PA QH DW+ + Sbjct: 141 VAESHKRNLEFHAWFNPYRVSMPAPGGAGADLSQLAPDSPAR---QHPDWVFAYPPAGVA 197 Query: 200 GDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGS-RLNDNETYRKYGG 258 G R +PG+PEV++++ + + + V RY +DGV FDDYFY G+ ++ D+ T+ Y Sbjct: 198 GSRLYYNPGVPEVREFVQTAMMDAVKRYDIDGVHFDDYFYPYPSGTHQVPDDATFAAYNR 257 Query: 259 AFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYD 318 F KADWRR+N LI +++ IK++KP V+FGVSP G+WRN S DP GSDT G+ +YD Sbjct: 258 GFTDKADWRRDNINLLIQEMNAKIKAVKPYVKFGVSPFGIWRNASADPNGSDTTGSQSYD 317 Query: 319 ESYADTRRWVEQGLLDYIAPQIYWPFSR-SAARYDVLAKWWADVVKPTRTRLYIGIAFYK 377 AD+R+WV++ +DYI PQ+YW + AA Y L WWA+ V+ TR +LYIG A YK Sbjct: 318 IISADSRKWVKEEWIDYIVPQLYWYIGQYPAADYARLVPWWAEQVRGTRVQLYIGQADYK 377 Query: 378 VGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILF 416 G+P+ WM EL L LN + PE+ G + F Sbjct: 378 SGDPA-YGSFWM---NPQELSNHLTLNRSYPEVLGNVHF 412 >UniRef50_Q81DH4 FenI n=65 Tax=Bacteria RepID=Q81DH4_BACCR Length = 519 Score = 294 bits (752), Expect = 5e-78, Method: Compositional matrix adjust. Identities = 160/412 (38%), Positives = 224/412 (54%), Gaps = 25/412 (6%) Query: 17 ILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVN 76 I+ L ++ C P S + PP + T +R +W+A+V +DWP + + Sbjct: 2 IVKRLLMICCIVILFIPFSFI-PPHFTYAEVNTTYKKHELRAVWIASVLNIDWPSKTGLP 60 Query: 77 ISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGEN 136 I Q+Q I LD ++ G+N V Q+KP A +PS PWS+ +TG G++ Sbjct: 61 IEK-------QKQEFIRLLDDVKSTGMNAVVVQIKPTADAFYPSNYGPWSEYITGTQGKD 113 Query: 137 PGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWI 196 PGYDPL FM++E HKR ++ HAW NPYR+++N ++N + PA QH DW+ Sbjct: 114 PGYDPLAFMIEETHKRNIEFHAWINPYRITMNHT-----DINRLSNNHPAR---QHPDWV 165 Query: 197 RTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFY-TESPGSRLNDNETYRK 255 T G + +PGIPEV+ +IT E+V Y +D + DDYFY + G D +TY Sbjct: 166 VTYGGKLYYNPGIPEVKKFITEGALEIVENYDIDALHMDDYFYPYKVAGEEFPDQKTYET 225 Query: 256 Y-GGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTR-G 313 Y G F + DWRRNN +L+ ++ IK K V+FG+SP GVWRN + DP GS+T G Sbjct: 226 YNNGRFTNIEDWRRNNVNELVKDLNTAIKQEKSYVKFGISPFGVWRNIADDPTGSNTTAG 285 Query: 314 AAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGI 373 YD+ YADTR W+++G +DYI PQIYW + A YD+L WW LYIG Sbjct: 286 QRNYDDLYADTREWIQKGYIDYITPQIYWNIGFTPAAYDILVDWWVKETNNKPLHLYIGQ 345 Query: 374 AFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFR-EDYLNKP 424 A YK+ S P W E KQ+ LN P+I G++ F +D N P Sbjct: 346 AAYKINNNSV--PAW---SDPEEYPKQIALNRLYPDIKGSMHFSLKDINNNP 392 >UniRef50_B8I4Q9 Putative uncharacterized protein n=2 Tax=Bacteria RepID=B8I4Q9_CLOCE Length = 997 Score = 293 bits (751), Expect = 6e-78, Method: Compositional matrix adjust. Identities = 146/381 (38%), Positives = 224/381 (58%), Gaps = 19/381 (4%) Query: 42 GSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRL 101 G+ A T + +RG+W+A+VS +D+P S P A Q++ + D + + Q + Sbjct: 33 GNVSNAQTVSKIEDLRGVWIASVSNIDFP-------SKPGISAEKQKKELDDIISNAQYM 85 Query: 102 GINTVFFQVKPDGTALWPSKILPWSDLMTGKIGE--NPGYDPLQFMLDEAHKRGMKVHAW 159 G+N +FFQ++P G AL+ S I PWS +TGK G+ + G+DPL +++++AHK+G+++HAW Sbjct: 86 GLNAIFFQIRPTGDALYKSTIFPWSAYLTGKQGKENDNGFDPLAYIIEQAHKKGIQIHAW 145 Query: 160 FNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSI 219 NP R+S+ T +N PA + + + LDPG P IT Sbjct: 146 INPLRLSMGTTSNPTGNINVLSDNHPARKIPEAV--VAAPTGQLYLDPGNPAAIKLITDG 203 Query: 220 VAEVVSRYPVDGVQFDDYFY---TESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIA 276 VAE+V Y VDG+ FDDYFY +E G ND+ +Y KY G+F +K DWRRNN L+ Sbjct: 204 VAEIVKNYDVDGIHFDDYFYPSKSEGKGVDFNDSASYAKYKGSFKNKDDWRRNNINTLVK 263 Query: 277 KVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGA-AAYDESYADTRRWVEQGLLDY 335 +T+K+IKP V+FG+SP +W N+ + GSDT+G + Y + YAD+++WV++ +DY Sbjct: 264 STYNTVKNIKPSVQFGISPFAIWSNKDRNKEGSDTQGGISTYYDHYADSKKWVKEAYIDY 323 Query: 336 IAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVP 395 IAPQIYW A Y VL WW +V + T+ +LY+G A YK+ + ++ DW+ +P Sbjct: 324 IAPQIYWNIGFKVADYSVLVNWWKNVCRGTKVKLYVGHAAYKINDTTQ-SNDWLDPLQIP 382 Query: 396 ELKKQLDLNDAVPEISGTILF 416 KQ+ N + G+I + Sbjct: 383 ---KQIAYNRKSNSVDGSIFY 400 >UniRef50_C4RBZ7 FenI protein n=10 Tax=Actinomycetales RepID=C4RBZ7_9ACTO Length = 538 Score = 291 bits (746), Expect = 3e-77, Method: Compositional matrix adjust. Identities = 158/369 (42%), Positives = 221/369 (59%), Gaps = 18/369 (4%) Query: 56 MRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGT 115 R +W+++V +DWP +S + R Q+ + LD QRL N V QV+P Sbjct: 37 FRAMWISSVVNIDWPTKASQTAPD---RIAAQRAEYLGWLDLAQRLHHNAVVVQVRPTAD 93 Query: 116 ALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIR 175 ALWPS PWS+ +TG G++PG+DPL F++DEAHKR ++ HAWFNPYR+S+ G Sbjct: 94 ALWPSPHEPWSEYLTGVRGQDPGWDPLAFLVDEAHKRNLEFHAWFNPYRISMPAPGGAGA 153 Query: 176 ELNSTLSQQPASVYVQHRDWI------RTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPV 229 +L PA QH +W +G R +PGIP V++++ + + + V+RY V Sbjct: 154 DLAQLAPDHPAR---QHPEWTFAYPPAGVAGSRLYYNPGIPAVREFVQTAMMDAVTRYDV 210 Query: 230 DGVQFDDYFYTESPGS-RLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPG 288 DGV FDDYFY G+ ++ D+ T+ ++ F +ADWRR+N LI +++ IK+ KP Sbjct: 211 DGVHFDDYFYPYPSGTYQVPDDATFAEFNRGFTDRADWRRDNINLLIREMNDRIKAAKPW 270 Query: 289 VEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSR-S 347 V+FGVSP G+WRN S DPLGSDT G+ +YD ADTR+WV+Q +DYI PQ+YW + Sbjct: 271 VKFGVSPFGIWRNASVDPLGSDTTGSQSYDIISADTRKWVKQEWIDYIVPQLYWYIGQYP 330 Query: 348 AARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAV 407 AA Y L WWA+ V+ TR +LYIG A YK G+P+ W EL L LN + Sbjct: 331 AADYARLVPWWAETVRGTRVQLYIGQADYKSGDPA-YGSYWQ---NPRELSDHLTLNRSY 386 Query: 408 PEISGTILF 416 PE+ G + F Sbjct: 387 PEVLGNVHF 395 >UniRef50_D2AR89 FenI protein n=9 Tax=Bacteria RepID=D2AR89_STRRD Length = 520 Score = 291 bits (745), Expect = 3e-77, Method: Compositional matrix adjust. Identities = 149/365 (40%), Positives = 216/365 (59%), Gaps = 29/365 (7%) Query: 56 MRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGT 115 MRG+W+A+V ++WP S P A Q+ + LD Q +N VF Q++P Sbjct: 47 MRGMWIASVVNINWP-------SKPGLTADQQKAEYLAWLDLAQVRKLNAVFVQIRPTAD 99 Query: 116 ALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIR 175 A WPS PWS +TG G++PGYDPL F+++E HKRG+ HAWFNPYRVS+ P + Sbjct: 100 AFWPSPFEPWSQYLTGTQGQDPGYDPLAFVVEETHKRGLAFHAWFNPYRVSMQPDPSKL- 158 Query: 176 ELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFD 235 P +H DWI G + +PG+PEV+ ++ + + V++Y +DG+ FD Sbjct: 159 --------HPDHPGTKHPDWIVPYGGKLYYNPGMPEVRAFVQDAMMDAVTKYDIDGLHFD 210 Query: 236 DYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSP 295 DYFY + + +D+ + KYG F A WRRNN L+ ++ ++ KP + +G+SP Sbjct: 211 DYFYPVN-TTAFDDSAAFAKYGQGFPDLAAWRRNNVDLLVQEMQQRVRQAKPEIAWGISP 269 Query: 296 AGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLA 355 +G+WRN++ DPLGSDT G+ +YD +ADTR WV++G LDYIAPQ+YW +S A Y L Sbjct: 270 SGIWRNKTTDPLGSDTGGSQSYDNLHADTRGWVKKGWLDYIAPQLYWYIGQSNADYAKLV 329 Query: 356 KWWADVVKPTRTRLYIGIAFYK---VGEPSK-IEPDWMINGGVPELKKQLDLNDAVPEIS 411 WW+DV T T+L+IG A YK G+P++ +PD EL + L LN P++ Sbjct: 330 PWWSDVAAGTPTQLWIGQAAYKAGAAGQPAQWFQPD--------ELTRHLTLNRDHPQVG 381 Query: 412 GTILF 416 G I + Sbjct: 382 GDIWY 386 >UniRef50_B1HPQ3 Hypothetical lipoprotein yddW n=2 Tax=Bacillaceae RepID=B1HPQ3_LYSSC Length = 522 Score = 290 bits (742), Expect = 7e-77, Method: Compositional matrix adjust. Identities = 158/373 (42%), Positives = 217/373 (58%), Gaps = 33/373 (8%) Query: 18 LVAL-ALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVN 76 L+AL A++L C S P + V A+T Q + MR +W++TV LD + +N Sbjct: 10 LIALVAMILMLCLSAIPANTV--------KASTTQPKREMRAVWISTVLNLDMK--AGMN 59 Query: 77 ISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKI-GE 135 T AR LD L+ NTV +QV+P A++ S++ PWS +TGK G Sbjct: 60 KEQYTVWAR-------QTLDQLKANKFNTVIYQVRPTNDAMYASELAPWSSYITGKKQGT 112 Query: 136 NPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDW 195 NPGYDPL +++E+HKRGM++HAW NPYRV+++ + T P +V + H +W Sbjct: 113 NPGYDPLTILVEESHKRGMELHAWMNPYRVTMSGQKLT--------DLAPDNVAITHPNW 164 Query: 196 IRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLN-DNETYR 254 + G ++ L+PG+PEVQD++ IV E+V+ Y VD V DDYFY + + D Y+ Sbjct: 165 VVKYGKQYYLNPGLPEVQDYLVEIVRELVANYDVDAVHMDDYFYPYKIANEVFPDQAAYK 224 Query: 255 KYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTR-G 313 KYG +F DWRRNN +L+ + IK KP V+FG+SP GVWRN+S D GSDTR G Sbjct: 225 KYGASFNKVEDWRRNNVNRLVENLYTAIKETKPYVQFGISPFGVWRNKSLDKTGSDTRAG 284 Query: 314 AAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVV----KPTRTRL 369 YD+ YAD R W++ G +DYI PQIYW + S A+Y L WW+ V K L Sbjct: 285 VNNYDDLYADVRTWIQNGTIDYITPQIYWSRTLSVAKYGTLLDWWSHEVQTYAKMHPVHL 344 Query: 370 YIGIAFYKVGEPS 382 YIG+A YKVG S Sbjct: 345 YIGLADYKVGNDS 357 >UniRef50_UPI00016A6D2C fenI protein n=1 Tax=Burkholderia oklahomensis EO147 RepID=UPI00016A6D2C Length = 521 Score = 289 bits (740), Expect = 1e-76, Method: Compositional matrix adjust. Identities = 153/400 (38%), Positives = 216/400 (54%), Gaps = 23/400 (5%) Query: 20 ALALLLCSCKSTPPESMVTPPAGSKPPATTQQS-SQPMRGIWLATVSRLDWPPVSSVNIS 78 A+ L + + T M++ A + A ++ + + R W+A V +DWP + ++ Sbjct: 7 AIVLTISAGACTRSSDMISENAPNVACAVSRATPKRDFRAFWIAAVRNIDWPSREGLTVA 66 Query: 79 NPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPG 138 QQ+ + LD RL N V QV+P + WPS PWS+ +TG G +PG Sbjct: 67 E-------QQEELRKWLDLAVRLRYNAVILQVRPVSDSFWPSPFAPWSEFLTGTQGTDPG 119 Query: 139 YDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRT 198 YDPL F + EAH+R +++HAWFNPYR + NT +++ PA + HRDW+ + Sbjct: 120 YDPLAFAVAEAHRRNLELHAWFNPYRAARNT------QIDLLAPTHPARL---HRDWLVS 170 Query: 199 SGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTES-PGSRLNDNETYRKYG 257 ++ +PG+P ++ I + + V RY VDGV DD+FY G D TY +YG Sbjct: 171 YDNQLYFNPGVPAAREHIVDAIMDAVDRYDVDGVHLDDFFYPYPIAGETFADTATYMQYG 230 Query: 258 GAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTR-GAAA 316 F + ADWRR+N + +S IK++KP V+FG+SP VWRN S DP GS+T Sbjct: 231 AGFTTLADWRRHNVDVFVEMLSRRIKAVKPWVKFGISPFAVWRNASVDPQGSETSTDVQT 290 Query: 317 YDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFY 376 YD+ YADTRRW+ + +DYIAPQ+YW A YD + WW + V+ +R LYIG A Y Sbjct: 291 YDDQYADTRRWLRENWIDYIAPQVYWAQDFQRADYDKVVSWWVEQVRSSRAHLYIGQAAY 350 Query: 377 KVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILF 416 KVG S P W EL L N PE+ G I F Sbjct: 351 KVG-ISNQSPGW---ASPAELANHLAFNCKFPEVKGNIYF 386 >UniRef50_D1AYL2 Putative uncharacterized protein n=1 Tax=Streptobacillus moniliformis DSM 12112 RepID=D1AYL2_STRM9 Length = 437 Score = 286 bits (733), Expect = 7e-76, Method: Compositional matrix adjust. Identities = 141/373 (37%), Positives = 221/373 (59%), Gaps = 29/373 (7%) Query: 49 TQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFF 108 ++ ++ ++G+W ATV LD+P +S+ Q++ + + ++++++ G+N VFF Sbjct: 70 NRKINKNLKGVWAATVVNLDFPKTTSM---------EEQKREIDEMMENIKKWGLNAVFF 120 Query: 109 QVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVN 168 V+P AL+ S+ PWS +TG +PGYDPL++ + AHKRG+++HAW NPYR ++N Sbjct: 121 HVRPAADALYNSEFEPWSIYLTGTQNRHPGYDPLEYAIKAAHKRGIELHAWINPYRAAMN 180 Query: 169 TKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYP 228 T + + S + ++P +WI +F ++PG PEV ++++ + E+V +Y Sbjct: 181 TDLNKLSD-KSIVKRKP--------EWIFEYDGKFYMNPGNPEVVNYVSKAIEEIVEKYD 231 Query: 229 VDGVQFDDYFY-TESPGSRLNDN---ETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKS 284 +DG+ DDYFY S +L DN + + KYG + S+ DWRR+N +I +S ++ Sbjct: 232 IDGLHLDDYFYPYPSATLKLGDNVDQKEFEKYGSEYNSRGDWRRDNVNNMIKNLSVSVHK 291 Query: 285 IKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPF 344 IKP + FGVSP G+WRN D GS T+G +YD YAD+ +W+++G +DYIAPQIYW Sbjct: 292 IKPNLSFGVSPFGIWRNYETDARGSKTKGLQSYDSLYADSLKWMKEGWVDYIAPQIYWNI 351 Query: 345 SRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLN 404 A Y+ L KWWA+ K T T LY+G YK EP + EL+KQL LN Sbjct: 352 GFEKADYEELVKWWAEKSKETNTPLYVGHGVYKYIEPKPWKDS-------KELEKQLKLN 404 Query: 405 DAVPEISGTILFR 417 + + G+I FR Sbjct: 405 EKYDAVKGSIFFR 417 >UniRef50_D1A3Q7 Putative uncharacterized protein n=1 Tax=Thermomonospora curvata DSM 43183 RepID=D1A3Q7_THECD Length = 532 Score = 285 bits (728), Expect = 3e-75, Method: Compositional matrix adjust. Identities = 158/422 (37%), Positives = 229/422 (54%), Gaps = 40/422 (9%) Query: 7 NKKLTIRRPAILVALALLLCSCKST------------PPESMVTPPAGSKPPATTQQSSQ 54 K I+ + VA + LL C S P + + KPP + +++ Sbjct: 6 GSKERIKIASAAVAASGLLAGCTSAAGGEVGALRADAPIAAGMAECPDIKPPGDS--AAR 63 Query: 55 PMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDG 114 +RG+W+ATVS +DWP + ++ R + + + LD + LG+N VF QV+P Sbjct: 64 QVRGMWIATVSGIDWPS----DTAHSAERKKADYRKL---LDQARALGLNAVFVQVRPSA 116 Query: 115 TALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTI 174 A + S PWS ++G+ G +PG+D L+F + EAHKR ++ HAWFNPYRV+++ G + Sbjct: 117 DAFYDSPYEPWSQWISGEQGRDPGFDVLEFFVSEAHKRDLEFHAWFNPYRVALHNDRGKL 176 Query: 175 RELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQF 234 N PA ++ W+R + DPG+P+V++ +T +V +VV +Y +D V Sbjct: 177 HPDN------PAR---KNPSWVREYDGKLWYDPGLPQVRELVTKVVLDVVGKYDIDAVHL 227 Query: 235 DDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVS 294 DDYFY G D +TYR+YG SK DWRR N L+ + I KP V FG+S Sbjct: 228 DDYFYPYPSGGDFPDEDTYRRYGRGM-SKGDWRRANVDALVKGLHEEIHRAKPQVRFGIS 286 Query: 295 PAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVL 354 P GVWRNR DP GS T +YD+ YADTR+WV+QG +DYI PQ+YW +AA Y L Sbjct: 287 PFGVWRNRRSDPAGSQTTALQSYDDVYADTRKWVKQGWVDYITPQLYWEIGNAAADYSTL 346 Query: 355 AKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTI 414 WWA+ V+ T +L IG A Y+VGE + G EL + L +N ++ G + Sbjct: 347 VAWWAEQVEGTGVQLTIGQASYRVGERG-------FDAG--ELSRHLAVNARHRQVRGDV 397 Query: 415 LF 416 F Sbjct: 398 YF 399 >UniRef50_A1V3X0 FenI protein n=36 Tax=Bacteria RepID=A1V3X0_BURMS Length = 521 Score = 285 bits (728), Expect = 3e-75, Method: Compositional matrix adjust. Identities = 153/391 (39%), Positives = 213/391 (54%), Gaps = 27/391 (6%) Query: 28 CKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQ 87 C S+P P +P T + + RG W+A+V LDWP S P A Q Sbjct: 21 CASSP---QAVPEVACRPDETMPK--RQFRGTWIASVINLDWP-------SRPGLPAAAQ 68 Query: 88 QQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLD 147 Q + LD R+ N V QV+P A WPS PWS +TG G +PGYDPL F + Sbjct: 69 QAELSAWLDDAVRMNRNAVILQVRPTADAFWPSPFEPWSKYLTGAQGGDPGYDPLAFAVA 128 Query: 148 EAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDP 207 EAH+R +++HAWFNPYRV+++ + L++ ++ PA H DW+ G + +P Sbjct: 129 EAHRRNLELHAWFNPYRVAMDDR------LDALVATHPARA---HPDWVVRYGGKLYYNP 179 Query: 208 GIPEVQDWITSIVAEVVSRYPVDGVQFDDYFY-TESPGSRLNDNETYRKYGGAFASKADW 266 G+P + ++ + + V+RY +D V DDYFY G+ +D Y +YG FA+ ADW Sbjct: 180 GVPAARAFVVDAIMDAVARYDIDAVHLDDYFYPYPVAGATFDDASAYAQYGAGFATLADW 239 Query: 267 RRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAA-AYDESYADTR 325 RR+N +L+ ++ IK+ KP V+FG+SP VWRN + DP GS T + YD+ YADTR Sbjct: 240 RRDNVDRLVESLARRIKAAKPWVKFGISPFAVWRNAATDPQGSRTSASVQTYDDLYADTR 299 Query: 326 RWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIE 385 RWV + +DY+ PQ YW + A YD + WWA+ V+ LYIG A YKVG S Sbjct: 300 RWVRERWIDYVVPQAYWARGFAPADYDEVVAWWANEVRGRDAHLYIGQAAYKVGT-SNQS 358 Query: 386 PDWMINGGVPELKKQLDLNDAVPEISGTILF 416 P W EL + L N PE+ G + F Sbjct: 359 PGW---SDPDELSRHLAFNLTAPEVKGDVYF 386 >UniRef50_A8MM80 Putative uncharacterized protein n=1 Tax=Alkaliphilus oremlandii OhILAs RepID=A8MM80_ALKOO Length = 476 Score = 282 bits (721), Expect = 2e-74, Method: Compositional matrix adjust. Identities = 151/366 (41%), Positives = 207/366 (56%), Gaps = 27/366 (7%) Query: 56 MRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGT 115 +R W++TV LDWP + + + Q+ LD L+ G+N V Q+KP Sbjct: 113 LRATWISTVYNLDWPSKKGLAVED-------QKSEFTALLDGLKSAGLNAVMVQIKPSAD 165 Query: 116 ALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIR 175 + +PS+ PWS+ +TG G++PGY+PL FM++E HKR M+ HAWFNPYRVSV Sbjct: 166 SFYPSQYGPWSEYLTGVQGKDPGYNPLAFMIEETHKRNMEFHAWFNPYRVSVK------E 219 Query: 176 ELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFD 235 + N+ PA ++ DW+ + G + +PGIP VQ ++ + EVV Y +DGV D Sbjct: 220 DRNALAEGHPAK---KNPDWVVSYGGKLFYNPGIPAVQQFVIDSILEVVKNYNIDGVHLD 276 Query: 236 DYF--YTESPGSRLNDNETYRKY-GGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFG 292 DYF Y E G D E Y+ Y A +K WRRNN I + +IK K V G Sbjct: 277 DYFYPYPEKEGD-FPDEELYQSYRRTASETKEQWRRNNINDFIQNLYQSIKREKSTVVLG 335 Query: 293 VSPAGVWRNRSHDPLGSDTRGAA-AYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARY 351 VSP G+WRN++ DP GS+TRG +YD YADT+ W+E G LDYIAPQ+YW A Y Sbjct: 336 VSPFGIWRNKADDPKGSNTRGGVTSYDSLYADTKYWIENGWLDYIAPQVYWHIGYDRAEY 395 Query: 352 DVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEIS 411 L WW++VV+ + LYIG A YKV + P W G E+ Q++ N +PE+ Sbjct: 396 KELINWWSNVVQNKKVELYIGQAAYKV--EAGTTP-W---GNPLEILDQIEYNRMIPEVK 449 Query: 412 GTILFR 417 G+I FR Sbjct: 450 GSIFFR 455 >UniRef50_D0GIS1 YngK n=16 Tax=Bacteria RepID=D0GIS1_9FUSO Length = 330 Score = 277 bits (709), Expect = 5e-73, Method: Compositional matrix adjust. Identities = 148/325 (45%), Positives = 201/325 (61%), Gaps = 22/325 (6%) Query: 95 LDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGM 154 L+++++ +N VF Q+KP G A +PSK PWS+ +TG GENPGYDPL+FM++EAHKR + Sbjct: 2 LENVKKWNMNAVFVQIKPVGDAFYPSKYAPWSEYLTGVQGENPGYDPLKFMIEEAHKRNI 61 Query: 155 KVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQD 214 + HAWFNPYR+++ G RE LS+ ++ + +W G + L+PGIPEV D Sbjct: 62 EFHAWFNPYRLTM----GGGRE---KLSRD--NIGNKRPEWTVMYGGKLYLNPGIPEVND 112 Query: 215 WITSIVAEVVSRYPVDGVQFDDYFY-TESPGSRLNDNETYRKYGGAFASKADWRRNNTQQ 273 ++ + EVV +Y VDGV DDYFY + G D++ YRKYGG F++ DWRRNN + Sbjct: 113 YVVDSIVEVVKKYDVDGVHMDDYFYPYKVKGQEYPDSQQYRKYGGKFSNIGDWRRNNINK 172 Query: 274 LIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPL-GSDTR-GAAAYDESYADTRRWVEQG 331 LI K+ ++IK V FG+SP GVWRN S DP+ GS T+ G YD+ YAD W+++ Sbjct: 173 LIEKLHNSIKKENKNVSFGISPFGVWRNASTDPVRGSQTQAGVQNYDDLYADILYWMDKH 232 Query: 332 LLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMIN 391 +DY+APQIYW A Y L WW+ T T LYIG A YKV + S P+ Sbjct: 233 WIDYVAPQIYWVRGFKVADYSTLINWWSKYAGKTNTDLYIGHAAYKVNDWS--NPN---- 286 Query: 392 GGVPELKKQLDLNDAVPEISGTILF 416 EL +Q+ LN PEI G+I F Sbjct: 287 ----ELVEQVKLNRKYPEIKGSIFF 307 >UniRef50_C7IM14 Putative uncharacterized protein n=1 Tax=Clostridium papyrosolvens DSM 2782 RepID=C7IM14_9CLOT Length = 999 Score = 274 bits (700), Expect = 6e-72, Method: Compositional matrix adjust. Identities = 140/382 (36%), Positives = 223/382 (58%), Gaps = 21/382 (5%) Query: 42 GSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRL 101 G+ A T ++ +RG+W+A+V+ +D+P S P A Q++ + + + + + + Sbjct: 33 GNISNAQTVSKNEDLRGVWIASVANIDFP-------SKPGISADKQKKELDEIISNTKYM 85 Query: 102 GINTVFFQVKPDGTALWPSKILPWSDLMTGKIGE--NPGYDPLQFMLDEAHKRGMKVHAW 159 G+N +FFQV+P G AL+ S I PWS +TG+ G+ + G+DPL +++ +AHK G++VHAW Sbjct: 86 GLNAIFFQVRPTGDALYKSSIFPWSKYLTGQQGKENDGGFDPLAYIIKQAHKEGIQVHAW 145 Query: 160 FNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTS-GDRFVLDPGIPEVQDWITS 218 NP R+++ T + ++ + PA + D + + + LDPG P IT Sbjct: 146 LNPLRLTMGTTAKPDKNVSVLSANHPAR---KIPDAVVAAPTGQLYLDPGNPAAIKLITD 202 Query: 219 IVAEVVSRYPVDGVQFDDYFY---TESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLI 275 VAE+V Y VDG+ FDDYFY +E+ G ND+ ++ KY G F +K DWRRNN L+ Sbjct: 203 GVAEIVKNYDVDGIHFDDYFYPSKSETKGVDFNDSASFAKYKGNFKNKDDWRRNNINTLV 262 Query: 276 AKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGA-AAYDESYADTRRWVEQGLLD 334 T+K+IK V+FG+SP +W N+ + GS T+G + Y + YAD+++WV + +D Sbjct: 263 KNTYDTVKNIKNKVQFGISPFAIWSNKDRNIEGSSTQGGISTYYDHYADSKKWVREAYID 322 Query: 335 YIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGV 394 YIAPQIYW A Y VL WW +V T+ +LY+G A YK+ + ++ DW+ + Sbjct: 323 YIAPQIYWNMGFKIADYSVLVNWWKNVCSGTKVKLYVGHAAYKINDTTQ-SNDWLDPLQI 381 Query: 395 PELKKQLDLNDAVPEISGTILF 416 P KQ+ N ++G+I + Sbjct: 382 P---KQIAYNRKSNAVAGSIFY 400 >UniRef50_A5FI17 Putative uncharacterized protein n=1 Tax=Flavobacterium johnsoniae UW101 RepID=A5FI17_FLAJ1 Length = 523 Score = 271 bits (692), Expect = 5e-71, Method: Compositional matrix adjust. Identities = 141/365 (38%), Positives = 207/365 (56%), Gaps = 29/365 (7%) Query: 56 MRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGT 115 RG+W+ATV +DWP + N+ ++ ++ L+ ++L N V Q++ G Sbjct: 34 FRGVWIATVVNIDWPKTAIDNVEK-------EKADYLEILNTYKKLNYNAVIVQIRSVGD 86 Query: 116 ALWPSKILPWSDLMTGKIGE--NPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGT 173 A +PS+ PWS +TGK G NP YD L++M++EAH RG + HAW NPYR + Sbjct: 87 AFYPSEFAPWSRFLTGKEGTAPNPYYDALEWMIEEAHNRGFEFHAWLNPYRATF------ 140 Query: 174 IRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQ 233 +LN L ++ +H +W+ G ++ DP +PEVQ +T +V EVV +Y +D + Sbjct: 141 --DLNKNLLSPNHDIF-KHPEWMIEYGGKYYYDPALPEVQTHLTKVVKEVVDKYDIDAIH 197 Query: 234 FDDYFYTES-PGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFG 292 FDDYFY + PG ND +Y+KYG S ADWRR N + +S TIK+ KP V+FG Sbjct: 198 FDDYFYPYAVPGKVFNDTASYKKYGSGL-SLADWRRANVSNFVHTISTTIKASKPWVQFG 256 Query: 293 VSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYD 352 +SP GVWRN+S DP GS+T+ + YD+ YAD W++Q +DYI PQ+YW + A Y Sbjct: 257 ISPFGVWRNKSQDPKGSETQSTSNYDDLYADPVLWMDQKWIDYIMPQLYWSMNNPRASYS 316 Query: 353 VLAKWWADVVKPTRTRLYIGIAFYKV-GEPSKIEPDWMINGGVPELKKQLDLNDAVPEIS 411 L KWW++ T +YIG A YK+ G+ K W +P Q+D + ++ Sbjct: 317 KLVKWWSE--NANNTAIYIGHASYKIRGDGDK---SWYFATEIPT---QVDFARSFKNVN 368 Query: 412 GTILF 416 G+ F Sbjct: 369 GSAYF 373 >UniRef50_C5PKN1 Possible FenI n=1 Tax=Sphingobacterium spiritivorum ATCC 33861 RepID=C5PKN1_9SPHI Length = 508 Score = 270 bits (691), Expect = 7e-71, Method: Compositional matrix adjust. Identities = 142/376 (37%), Positives = 218/376 (57%), Gaps = 31/376 (8%) Query: 47 ATTQQS-SQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINT 105 A +Q S + +RG+W+ATV+ +DWP S + Q+Q +I+ LD QR G+N Sbjct: 21 AISQNSPKRELRGVWIATVANIDWP-------SRDNESSERQKQELINILDAHQRAGLNA 73 Query: 106 VFFQVKPDGTALWPSKILPWSDLMTGKIGE--NPGYDPLQFMLDEAHKRGMKVHAWFNPY 163 +FFQ++P A + PWS +TG G+ +P YDPL+F+++EAHKRGM++HAW NPY Sbjct: 74 IFFQIRPAADAFYAKGREPWSRYLTGVQGKAPSPFYDPLEFVIEEAHKRGMELHAWVNPY 133 Query: 164 RVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEV 223 R S P + + T ++ +W G +++ +PG+PEV+ +I ++ +V Sbjct: 134 RASTTLNPAHFSKDHITRTKP---------EWFFKYGGKYLFNPGLPEVRQYIIDVIMDV 184 Query: 224 VSRYPVDGVQFDDYFYT--ESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHT 281 V Y VDG+ FDDYFY ++ + L D T+ ++G FA+ DWRRNN LI + Sbjct: 185 VKNYDVDGIHFDDYFYPYPDARNTALPDAPTFHQFGKGFANIHDWRRNNVDLLIRDLGIA 244 Query: 282 IKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIY 341 IK KP +++G+SP G+W N+ +P GS+T G + Y YAD +W+++G +DYI PQIY Sbjct: 245 IKKEKPFIKYGISPFGIWDNKRDNPDGSNTSGLSGYRTLYADGVKWMKEGWIDYINPQIY 304 Query: 342 WPFSRSAARYDVLAKWWADVVKPTRTR-LYIGIAFYKVGEPSKIEPDWMINGGVPELKKQ 400 +PF+ AA +++L +WW K T R Y+G Y+V E P W G +P+ + Sbjct: 305 FPFNNRAAAFEILLEWWE---KHTYGRHFYVGHGAYRVTEK---RPGWTDKGQIPKQVRH 358 Query: 401 LDLNDAVPEISGTILF 416 L E+ G+I F Sbjct: 359 LRDQH---EVQGSIYF 371 >UniRef50_C9L341 YngK protein n=45 Tax=Bacteroidales RepID=C9L341_9BACE Length = 528 Score = 267 bits (683), Expect = 6e-70, Method: Compositional matrix adjust. Identities = 137/323 (42%), Positives = 191/323 (59%), Gaps = 30/323 (9%) Query: 40 PAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQ 99 P+G+K P + RG W+ V+ PT + + Q +ID+L+ LQ Sbjct: 49 PSGNKYP------KREFRGAWIQAVN--------GQFKGIPTGKLK---QTLIDQLNSLQ 91 Query: 100 RLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGE--NPGYDPLQFMLDEAHKRGMKVH 157 GIN + FQV+P+ AL+ S+ PWS +TG G+ +P +DP+QFM++E KR M+ H Sbjct: 92 GAGINAIIFQVRPEADALYASQHEPWSRFLTGTQGQIPSPMWDPMQFMIEECRKRNMEFH 151 Query: 158 AWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWIT 217 AW NPYRV + L + L+ P +Y QH +W T GD+ DP +PE +D+I Sbjct: 152 AWINPYRVKTS--------LKNQLA--PEHIYHQHPEWFVTYGDQLYFDPALPESRDYIC 201 Query: 218 SIVAEVVSRYPVDGVQFDDYFY-TESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIA 276 IV ++VSRY VD + DDYFY G D+ ++ +YGG F +KADWRR+N LI Sbjct: 202 KIVTDIVSRYDVDAIHMDDYFYPYPVKGMDFPDDASFARYGGGFTNKADWRRSNVNVLIK 261 Query: 277 KVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYI 336 K+ TI+++KP V+FG+SP G++RN+ DPLGSDT G YD+ YAD W +G +DY Sbjct: 262 KLHETIRAVKPWVKFGISPFGIYRNQKSDPLGSDTNGLQNYDDLYADVLLWAREGWIDYN 321 Query: 337 APQIYWPFSRSAARYDVLAKWWA 359 PQIYW AA Y+ L KWWA Sbjct: 322 IPQIYWEIGHKAADYETLVKWWA 344 >UniRef50_A6NVH8 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6NVH8_9BACE Length = 606 Score = 266 bits (681), Expect = 1e-69, Method: Compositional matrix adjust. Identities = 141/382 (36%), Positives = 221/382 (57%), Gaps = 31/382 (8%) Query: 49 TQQSSQP------MRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLG 102 QQ++ P R +W+ATV LD+P ++ + ++A + L++ +G Sbjct: 34 AQQANAPSAARDDFRAVWVATVYNLDYPNAATTDADALKAQAD-------EILENCVDMG 86 Query: 103 INTVFFQVKPDGTALWPSKILPWSDLMTGKIGENP--GYDPLQFMLDEAHKRGMKVHAWF 160 +N V QV+P G AL+PS++ PWS +TG G P +DPL + ++ AH+ G+++HAW Sbjct: 87 MNAVILQVRPSGDALYPSELFPWSKYLTGASGLAPEDNFDPLAYWVERAHELGLELHAWI 146 Query: 161 NPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIV 220 NP+R++ G EL + ++ PA VQH +W+ + L+PG+PEV++ + Sbjct: 147 NPFRIT----KGGEAELAALDAKSPA---VQHPEWVVECDGNYYLNPGLPEVRELVIQGA 199 Query: 221 AEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSH 280 E+V Y VDGV DDYFY P ND+ +++YGG F + DWRR+N QLI + Sbjct: 200 EELVRNYDVDGVHLDDYFY---PSRSFNDDAAFQQYGGDFDNIGDWRRDNVNQLIQGLDQ 256 Query: 281 TIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGA-AAYDESYADTRRWVEQGLLDYIAPQ 339 + ++ P + FGVSP+GVW + +H GS T G +Y +YAD+R+WV++G +DYI PQ Sbjct: 257 RLHALDPELSFGVSPSGVWADSTHQSAGSATTGNYESYYAAYADSRKWVKEGWVDYICPQ 316 Query: 340 IYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKK 399 IYW Y+ +A+WW+D V+ T +LYIG+A Y + ++ P W G+ + K Sbjct: 317 IYWYIGHPTMDYETIARWWSDTVEGTGVKLYIGMADYLADDGTEGSP-W---NGLDAITK 372 Query: 400 QLDLNDAVPEISGTILFREDYL 421 QL LN + +SG + FR +L Sbjct: 373 QLTLNRELG-VSGEVHFRYKFL 393 >UniRef50_A3HZ09 FenI n=1 Tax=Algoriphagus sp. PR1 RepID=A3HZ09_9SPHI Length = 543 Score = 264 bits (674), Expect = 5e-69, Method: Compositional matrix adjust. Identities = 154/438 (35%), Positives = 220/438 (50%), Gaps = 53/438 (12%) Query: 9 KLTIRRPAILVALALLLCSCKSTP---PESMVTPPAGSKP-------------------- 45 K R I++A LLL +CKS+ P T P + P Sbjct: 2 KFLSRHLFIILAFGLLLSACKSSKNVTPGQQPTAPIQTSPNTDSGTNLPVKTLPKTPIAL 61 Query: 46 -PATTQQSSQP--MRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLG 102 P + Q P RG+W+ATV+ +DWP +P Q++ ++ LD+ + L Sbjct: 62 APLSYQMPEMPREFRGVWIATVANIDWP-------ISPDDPYEKQKRDFLEILDYYKSLN 114 Query: 103 INTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYD--PLQFMLDEAHKRGMKVHAWF 160 N V QV+ G A +PS + PWS +TGK G+ P + PL +M+ E+H RGM+ HAW Sbjct: 115 FNAVIVQVRTAGDAFFPSNLAPWSKYLTGKQGKAPNTNENPLTWMIHESHARGMEFHAWL 174 Query: 161 NPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIV 220 NPYR +++ K T P Y HR+W+ G ++ +PG+PEVQ + ++ Sbjct: 175 NPYRATMDLK---------TDELSPDHDYNAHRNWMVKYGTKYYYNPGLPEVQTHLLKVI 225 Query: 221 AEVVSRYPVDGVQFDDYFYTESPG-SRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVS 279 E+V Y VD + FDDYFY D TY KY + ++ DWRR+N QLI ++ Sbjct: 226 KEIVDNYDVDAIHFDDYFYPYKIAREEFPDRNTYNKYKKSGQTQDDWRRDNVNQLIFALN 285 Query: 280 HTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTR-GAAAYDESYADTRRWVEQGLLDYIAP 338 +TIK KP V+FG+SP GVWRN+ DP GS T+ G YD+ YAD W++ G +DY+ P Sbjct: 286 NTIKQSKPWVQFGISPFGVWRNQDKDPKGSPTQAGQTNYDDLYADVLLWMKNGWVDYMIP 345 Query: 339 QIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELK 398 Q+YW A + +L WWA T +YIG YK+ E S + W E+ Sbjct: 346 QLYWSMEHPLASHRILNDWWA--TNHNYTNIYIGNGPYKIREDS--DKAW---ENPKEIN 398 Query: 399 KQLDLNDAVPEISGTILF 416 Q+ +P I G F Sbjct: 399 NQISYTRTLPTIQGNAFF 416 >UniRef50_A1ZQ43 YngK protein n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZQ43_9SPHI Length = 517 Score = 262 bits (669), Expect = 2e-68, Method: Compositional matrix adjust. Identities = 144/382 (37%), Positives = 206/382 (53%), Gaps = 29/382 (7%) Query: 47 ATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTV 106 T ++ + R +WL T +D+P S +Q +I LD Q+ GIN + Sbjct: 35 VTKRKLKREFRAVWLTTFDHMDFPKEKGAPPSE-------HKQELIKLLDFHQKSGINAI 87 Query: 107 FFQVKPDGTALWPSKILPWSDLMTGKIGE--NPGYDPLQFMLDEAHKRGMKVHAWFNPYR 164 FFQV+P A + S+I WS +TGK G+ P +DPL+F++ E HKR +++HAW NPYR Sbjct: 88 FFQVRPAADAFYKSEIELWSQWLTGKQGKAPEPLWDPLEFLVTECHKRNIELHAWINPYR 147 Query: 165 VSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVV 224 N K + P + +H +W G +PGIP V+ ++ ++VA++ Sbjct: 148 AVYNIKHD---------ATAPNHITKRHPEWFVVYGKHKQFNPGIPAVRHYLKAVVADIA 198 Query: 225 SRYPVDGVQFDDYFYTESPGS-RLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIK 283 RY +DG+ FDDYFY G D T+ K+GG WRR N LI +V T++ Sbjct: 199 QRYDIDGIHFDDYFYPYKKGRLEFPDQSTFMKHGGNSKDVHHWRRQNVNSLIKEVHDTLQ 258 Query: 284 SIKPGVEFGVSPAGVWRNRSHDPLGSDTR-GAAAYDESYADTRRWVEQGLLDYIAPQIYW 342 SIKP ++FG+SP GVWRN+S DP GSDT+ G ++YD YAD +W+ +G +DY+ PQ+YW Sbjct: 259 SIKPYLKFGISPLGVWRNKSEDPNGSDTQVGQSSYDYLYADVLKWLRKGWIDYLVPQLYW 318 Query: 343 PFSRSAARYDVLAKWWADVVKPTRTR-LYIGIAFYKVGEPSKIEPDWMINGGVPELKKQL 401 A + LA WWA K +R +YIG AFYK+ + W V EL Q+ Sbjct: 319 SIEHPRASFKSLAFWWA---KHAYSRHIYIGHAFYKIKNDK--DDHW---KQVSELPNQV 370 Query: 402 DLNDAVPEISGTILFREDYLNK 423 + I G FR D+L K Sbjct: 371 RMTRQYRSILGNAYFRSDFLQK 392 >UniRef50_A9NEW1 Putative uncharacterized protein n=1 Tax=Acholeplasma laidlawii PG-8A RepID=A9NEW1_ACHLI Length = 1328 Score = 260 bits (664), Expect = 7e-68, Method: Compositional matrix adjust. Identities = 138/385 (35%), Positives = 211/385 (54%), Gaps = 29/385 (7%) Query: 46 PATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINT 105 P T + + +R +W+ATV+ +D I+ + A + Q +I L+ ++ L NT Sbjct: 890 PTTYTEKDKEIRAVWVATVANID--------ITQYDNEANYKNQ-IISILERMKELKFNT 940 Query: 106 VFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRV 165 +FFQ +P + +PS+ P S ++G G G+D L+F++ EAH RG++VHAW NPYRV Sbjct: 941 MFFQTRPMNDSFYPSEYAPMSRFLSGTEGVGVGWDVLEFLITEAHARGIEVHAWMNPYRV 1000 Query: 166 SVNTKPGTIREL----NSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVA 221 + + +L +S ++Q S VQ + G +L+PGIPEV+ ++ +IV Sbjct: 1001 ASGSTASIEDQLALLHDSNFAKQNPSYVVQDK------GGALILNPGIPEVRQYLYNIVD 1054 Query: 222 EVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHT 281 E++ Y +DGV FDDYFY+ S D + + Y S+ DWRR N + + Sbjct: 1055 EIMENYAIDGVHFDDYFYSYSGTEDSQDADAFLNYNPNNLSRDDWRRENVNMFVKTIYER 1114 Query: 282 IKSIKPG----VEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIA 337 +++ V+FG+SP G+WRN++ D LGS+++G ++Y YAD+R+WV++G L YI Sbjct: 1115 VEAHNEANDMHVKFGISPFGIWRNKTQDALGSNSQGLSSYSAQYADSRKWVKEGWLHYII 1174 Query: 338 PQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPEL 397 PQ+YW F S AR+ L WW DVVK T L IG FY+ E S +W E Sbjct: 1175 PQLYWQFDHSTARFADLVDWWVDVVKDTNVDLIIGQGFYRYAENSN---NWT---NESEF 1228 Query: 398 KKQLDLNDAVPEISGTILFREDYLN 422 +QL EI G+ +F LN Sbjct: 1229 LEQLRYMSQYDEIIGSSIFSYKTLN 1253 Score = 158 bits (400), Expect = 3e-37, Method: Compositional matrix adjust. Identities = 98/307 (31%), Positives = 148/307 (48%), Gaps = 38/307 (12%) Query: 95 LDHLQRLGINTVFFQVKPDGTALWPSKILP---WSDLMTGKIGENPGYDPLQFMLDEAHK 151 L++++ +N + AL+ S+I P W + + +DP+ + ++EAHK Sbjct: 67 LNNMEANNLNVAIVHFRTHNNALYKSEINPVASWFATVDFDV-----FDPMAYFIEEAHK 121 Query: 152 RGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPE 211 RG++ HAW NPYRV + GTI N PA++ G +L+P +P Sbjct: 122 RGIEFHAWLNPYRVLSTYQRGTIPASNP--QSNPANLLSNKE------GTAHILNPALPV 173 Query: 212 VQDWITSIVAEVVSRYPVDGVQFDDYFYTE-SPGSRLNDNETYRKYGG------AFASKA 264 V++ + + + E++ Y VD + FDDYFY E + G LND + A K+ Sbjct: 174 VREHVVNTILEIIENYNVDAIHFDDYFYMEMNNGGILNDPDQALFLSNPLGQPNTVAGKS 233 Query: 265 DWRRNNTQQLIAKVSHTIKSIKPG----VEFGVSPAGVWRN--------RSHDPL--GSD 310 +WRR I + S IK V+FG+SP G++RN + P+ GS Sbjct: 234 NWRRTQINTFIEQASQAIKDFNQANNRYVQFGISPTGIYRNGDGEVTYDQDGKPITNGSK 293 Query: 311 TRGAAAYDES-YADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRL 369 T+G Y +ADT W+ +G LDYI PQ YW + S A +D + WW VV+ L Sbjct: 294 TQGQEHYASYLFADTVHWISEGWLDYILPQSYWASTHSLAGFDKVMGWWDKVVRYLDVNL 353 Query: 370 YIGIAFY 376 Y GI Y Sbjct: 354 YSGIGLY 360 >UniRef50_UPI00016C0313 cell surface protein n=1 Tax=Epulopiscium sp. 'N.t. morphotype B' RepID=UPI00016C0313 Length = 539 Score = 260 bits (664), Expect = 8e-68, Method: Compositional matrix adjust. Identities = 157/443 (35%), Positives = 232/443 (52%), Gaps = 48/443 (10%) Query: 7 NKKLT--IRRPAILVALALLLCSCKSTPPESMVTPPAGS---KPPAT-TQQSSQPMRGIW 60 NKK I +I+ A +LL K P + P S K P T + ++ +R +W Sbjct: 2 NKKFIAWILGGSIVTAAVILLLVPKELPSMKFI-PNKNSLNKKNPITPLRPQNEEVRAVW 60 Query: 61 LATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPS 120 ++++ LD+P +S+N +NP + QQ I LD LQ +G NTV QV+P AL+ S Sbjct: 61 ISSIWGLDFP-YNSINRNNPAA----QQAEFISYLDELQEIGFNTVMVQVRPSADALYKS 115 Query: 121 KILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNST 180 I PW+ ++TG G++PGYDPL FM+D+ HKRGMK+HAW NPYRV+ K +++ Sbjct: 116 AINPWAAILTGTQGQDPGYDPLAFMIDQTHKRGMKLHAWINPYRVTTAGK-----GIDTL 170 Query: 181 LSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYT 240 ++ PA + + D + + + P + V+ I V E+V+ Y VDG+ DDYFY Sbjct: 171 VATHPARL---NPDMLISHKNALYYXPELDAVKSHIEETVKEIVTNYSVDGIHMDDYFYP 227 Query: 241 E---SPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAG 297 P + +T A RRN+ ++ ++ IK IKP VEFG+SP G Sbjct: 228 AWYPLPAGEDGNGKT-----------ATTRRNHVNDMVKRIHTAIKQIKPNVEFGISPIG 276 Query: 298 VWRNRSHDPLGSDTR-GAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAK 356 +W++ D GS+T G +Y YADTR W++ +DY+ PQIYW A Y+VL K Sbjct: 277 IWKDSITDITGSETSAGWNSYYAVYADTRAWIQNEWIDYVVPQIYWEIDNPVASYEVLVK 336 Query: 357 WWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILF 416 WWA+ VK T LYIG YK + E+ Q+ LND PEI G++ F Sbjct: 337 WWAEEVKNTNVDLYIGQGIYK-------------DAVAEEITTQILLNDLYPEIKGSVXF 383 Query: 417 REDYLNKPQTQQAVSYLQSRWGS 439 + + T L++ +G+ Sbjct: 384 AISDIIRKNTGNVRGQLEALFGT 406 >UniRef50_D2QEX0 Putative uncharacterized protein n=2 Tax=Flexibacteraceae RepID=D2QEX0_9SPHI Length = 570 Score = 257 bits (656), Expect = 6e-67, Method: Compositional matrix adjust. Identities = 137/364 (37%), Positives = 204/364 (56%), Gaps = 25/364 (6%) Query: 56 MRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGT 115 R +W+ATV+ +DWP + +++ QQ+ ++ D Q++G+N V QV+ Sbjct: 92 FRAVWVATVNNIDWPSKKGLPVAD-------QQREIVAMFDQHQQMGLNAVVVQVRSAAD 144 Query: 116 ALWPSKILPWSDLMTGKIG--ENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGT 173 A + PWS+ +TG+ G P YDPL+FM+D+AH RG++ HAWFN R + Sbjct: 145 AFYARGSEPWSEWLTGQQGLAPEPFYDPLEFMIDQAHGRGLEFHAWFNLDRAT------- 197 Query: 174 IRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQ 233 + T S P+++ + +W+ G R + + GIP V+ +I IVA VV Y VDG+ Sbjct: 198 ---FSKTASVAPSNIVNRKPEWMLMYGGRKLFNLGIPAVRSYIAGIVANVVREYDVDGIH 254 Query: 234 FDDYFYTES-PGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFG 292 FDDYFY + PG L D+ TY+ SK DWRR+N +L+ ++ +I++ KP V+FG Sbjct: 255 FDDYFYPYAEPGQVLRDDSTYKANSNGM-SKPDWRRDNVTKLVKELRDSIRANKPWVKFG 313 Query: 293 VSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYD 352 +SP G+W+N+S DP GS T G AY E YADTR+WV +GL+DY+ PQ+Y+ S Y Sbjct: 314 ISPFGIWKNKSSDPEGSATNGGQAYYELYADTRKWVREGLIDYVVPQVYFSSEFSKVPYK 373 Query: 353 VLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISG 412 L WW LYIG Y+VG S+ +P W P+ Q+ N + G Sbjct: 374 TLVDWWTRNCT-ENCHLYIGHGAYRVGRGSERDPGWWRPTEFPD---QMRYNRQQQVVKG 429 Query: 413 TILF 416 ++ F Sbjct: 430 SVFF 433 >UniRef50_D1N426 Putative uncharacterized protein n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N426_9BACT Length = 450 Score = 256 bits (655), Expect = 8e-67, Method: Compositional matrix adjust. Identities = 151/400 (37%), Positives = 218/400 (54%), Gaps = 33/400 (8%) Query: 51 QSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQV 110 Q ++ MRG+W+ATV +D+ + A ++ I +++LQR N +FFQV Sbjct: 58 QRAREMRGVWVATVENIDF---------GRHTDAAGFKRDFIAVVNNLQRAKFNAIFFQV 108 Query: 111 KPDGTALWPSKILPWSDLMTGKIGEN-PGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNT 169 +P A +PSK PWS MTGK G+ P +DPL FM+ EAHKRG++ HAW NPYRV+ Sbjct: 109 RPMCDAFYPSKHNPWSRWMTGKEGQAIPNFDPLAFMVAEAHKRGLEFHAWLNPYRVNAGA 168 Query: 170 KPGTIREL----NSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVS 225 + G L N + +++ + ++ + + + L+PG P V I +AE++ Sbjct: 169 QVGKTAYLKTLDNKSFAKRNPGLVLESK--LASGRYSLFLNPGEPRVVRHIADTIAEILE 226 Query: 226 RYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSI 285 YPVD + FDDYFY S + D+ ++++ S +WRR N + I V T+ + Sbjct: 227 NYPVDAIHFDDYFYLYSDIGTI-DSASFQRNNPGRLSLEEWRRGNVDKAIYTVKKTVDAY 285 Query: 286 --KPG--VEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIY 341 + G V FGVSP G+W N+ +P GS T G +Y YADTR WV +G +DYI PQ+Y Sbjct: 286 NRRSGRKVAFGVSPFGIWANKKSNPNGSLTGGKQSYYAQYADTRGWVRKGWVDYIIPQLY 345 Query: 342 WPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQL 401 WPFS A Y LA WW+D VK TR RL+IG Y+VG +P EL Q+ Sbjct: 346 WPFSHEVAAYAALADWWSDAVKGTRVRLFIGQGLYRVGAERIWQPR--------ELVDQM 397 Query: 402 DLNDAVPEISGTILFREDYLNKP---QTQQAVS-YLQSRW 437 N + + GT++F + P Q ++AVS LQ W Sbjct: 398 RYNQMLFNVDGTVIFSYRNVFMPGNGQMKEAVSRILQGCW 437 >UniRef50_C6XWP7 Putative uncharacterized protein n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6XWP7_PEDHD Length = 519 Score = 256 bits (655), Expect = 9e-67, Method: Compositional matrix adjust. Identities = 143/388 (36%), Positives = 208/388 (53%), Gaps = 32/388 (8%) Query: 18 LVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNI 77 L+ ALL TP + P+ P + RG+W+ATV+ +DWP +NI Sbjct: 4 LIIYALLTVII--TPISLIAQSPSKIAP-------KREFRGVWVATVANIDWPSKPGLNI 54 Query: 78 SNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENP 137 Q+Q +I L+ + G+N + QV+P A + PWS + GK G P Sbjct: 55 DQ-------QKQELIGLLEQHKANGMNAIILQVRPAADAFYLKSREPWSQWLMGKQGMAP 107 Query: 138 --GYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDW 195 GYDPL F + EAH RGM++HAWFNPYR + ++++ P + + D Sbjct: 108 APGYDPLAFAIKEAHSRGMELHAWFNPYRAT----------MSASAVVSPDHMTRKRPDL 157 Query: 196 IRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFY-TESPGSRLNDNETYR 254 G + DPGIPEV+++I ++ +VV Y VDG+ FDDYFY + G +ND T+ Sbjct: 158 FFVYGGKKQFDPGIPEVREYIVQVILDVVKGYDVDGIHFDDYFYPYKIAGQNINDAATFN 217 Query: 255 KYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGA 314 KY F++ ADWRRNN LI ++ +I K V+FGVSP G+W+N S D LGS T G Sbjct: 218 KYPNGFSNIADWRRNNVDLLIKQLDDSIHHYKKYVKFGVSPFGIWKNLSEDSLGSATNGL 277 Query: 315 AAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIA 374 + Y E YAD+R+WV++G +DYI PQ+Y+ F+R AA + +A WW + +YIG Sbjct: 278 SNYAELYADSRKWVKEGWVDYINPQVYFSFTRRAAPFATIADWWTN--NAFGRHVYIGHG 335 Query: 375 FYKVGEPS-KIEPDWMINGGVPELKKQL 401 Y + S + E W +P + + Sbjct: 336 AYLIHNGSTRKEAAWAFPNQIPNQIRHI 363 >UniRef50_C7PIN2 Putative uncharacterized protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PIN2_CHIPD Length = 509 Score = 256 bits (655), Expect = 9e-67, Method: Compositional matrix adjust. Identities = 134/326 (41%), Positives = 183/326 (56%), Gaps = 26/326 (7%) Query: 56 MRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGT 115 R +W+ATV +DWP + + Q+Q I+ LD QR G+N V Q++P Sbjct: 29 FRAVWIATVENIDWPSRKGLPVE-------TQKQEFINLLDKHQRNGMNAVIVQIRPAAD 81 Query: 116 ALWPSKILPWSDLMTGKIGE--NPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGT 173 A + S PWS+ ++G G+ NP YDPL+FML+E HKRGM+ HAWFNPYR + Sbjct: 82 AFYDSPFEPWSEYLSGVQGQAPNPYYDPLRFMLEETHKRGMEFHAWFNPYRAVIRNASA- 140 Query: 174 IRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQ 233 N +P W + DPGIPEV++++T I+ +VV RY +D V Sbjct: 141 ----NHISRMRP--------QWFVNFDGKKYFDPGIPEVREYVTQIIRDVVRRYDIDAVH 188 Query: 234 FDDYFY-TESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFG 292 FDDYFY PG DN +YR+YG K DWRR N +I VS IK KP V+FG Sbjct: 189 FDDYFYPYPVPGREFGDNNSYRQYGRNMM-KDDWRRWNVDTIIQMVSKMIKEEKPWVKFG 247 Query: 293 VSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYD 352 +SP G+WRN++ D GS T G + YD+ YAD R+W++ G +DY+APQ+YW A Y+ Sbjct: 248 ISPFGIWRNKNKDQDGSYTTGLSNYDDLYADVRKWLQNGWIDYVAPQLYWERGHRVANYE 307 Query: 353 VLAKWWADVVKPTRTRLYIGIAFYKV 378 +L WWA +YIG Y++ Sbjct: 308 LLLNWWAQ--HGYGRNVYIGHGVYRL 331 >UniRef50_A5FAG6 Putative uncharacterized protein n=1 Tax=Flavobacterium johnsoniae UW101 RepID=A5FAG6_FLAJ1 Length = 493 Score = 247 bits (631), Expect = 5e-64, Method: Compositional matrix adjust. Identities = 140/387 (36%), Positives = 207/387 (53%), Gaps = 31/387 (8%) Query: 56 MRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGT 115 MR W++TV +DWP S P + + MI LD+L+ +NTV FQ++P Sbjct: 27 MRAAWISTVDNIDWP-------SKPGLSDKQMKSEMIAILDNLRSNNLNTVIFQIRPTAD 79 Query: 116 ALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIR 175 A + S P S +TG G PG+DPLQ M+DEA KRGM VH W NPYRV +T + Sbjct: 80 AYYKSTKEPASHWITGTQGVAPGFDPLQMMIDEAGKRGMNVHVWLNPYRVQKDTVKDVLT 139 Query: 176 ELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFD 235 + + +Y + + T G +PG E +D+++S+V E+V Y + V D Sbjct: 140 KTH---------LYFKKPELFLTYGKSRYFNPGYKETRDFVSSVVGEIVRNYDIQAVHMD 190 Query: 236 DYFY-TESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVS 294 DYFY + G D + + K F K DWRR+N +I ++ TI + KP VEFG+S Sbjct: 191 DYFYPYKIAGQEFPDEKAFAKEPRQFKDKDDWRRDNVDLIIKQIRDTIIANKPEVEFGIS 250 Query: 295 PAGVWRNRSHDPLGSDT-RGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDV 353 P GVWRN + D GS+T GA YD+ YA+ +W ++ +DY+ PQ+YW A ++V Sbjct: 251 PFGVWRNIAKDSDGSNTVAGATNYDDLYANILKWQKENWIDYVTPQLYWHIGFDRANFEV 310 Query: 354 LAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGT 413 LAKWWA T +Y+G YK+ +K EP+W ++ KQ+++ +P+I G+ Sbjct: 311 LAKWWA--AHKYGTNVYVGHGDYKISNTAK-EPEWR---SPDQIVKQIEMIRKLPQIDGS 364 Query: 414 ILF-------REDYLNKPQTQQAVSYL 433 + F + D L P Q+ Y+ Sbjct: 365 MHFTASTFLKKGDTLRNPLIQKPYKYI 391 >UniRef50_A4ASW6 FenI n=1 Tax=Flavobacteriales bacterium HTCC2170 RepID=A4ASW6_9FLAO Length = 507 Score = 238 bits (608), Expect = 3e-61, Method: Compositional matrix adjust. Identities = 134/430 (31%), Positives = 214/430 (49%), Gaps = 43/430 (10%) Query: 12 IRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPP 71 +++ + + L ++ SC + P Q RG+W+ATV +DWP Sbjct: 2 LKKISHYLLLLIIFNSCDAIKP---------------IPQPRTEFRGVWVATVVNIDWP- 45 Query: 72 VSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTG 131 N Q+ + L+ +L NTV QV+ G + + SK PWS +TG Sbjct: 46 ------KNGLDAIEKQKADFLKILEFYDQLNFNTVIVQVRTAGDSFYDSKYAPWSRFLTG 99 Query: 132 KIGENP--GYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVY 189 G++ +D L +M+D+ H RG + HAW NPYR + + K + + + Sbjct: 100 TEGKSTEGHFDMLNWMIDQTHNRGFEFHAWLNPYRATFDLKTDVLSATHD---------F 150 Query: 190 VQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRL-N 248 H +W+ G+++ +PG+PEV++ + SI+ EVV++Y +D + FDDYFY + N Sbjct: 151 NLHPEWMLKYGNKYYYNPGLPEVRERLASIMGEVVTKYDIDAIHFDDYFYPYRIKDEIFN 210 Query: 249 DNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLG 308 D+ Y + + + +WRR+N L+ + T+K+IKP V+FG+SP GVW+N+S DP G Sbjct: 211 DSLAYNYHSFSGQTVENWRRSNIDSLVKNIHSTVKNIKPWVQFGISPFGVWKNKSTDPRG 270 Query: 309 SDTR-GAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRT 367 SDT+ G Y++ YAD W+ +G +DY+ PQ+YW A + + WW++ T Sbjct: 271 SDTKAGQTTYEDLYADPLTWMNEGWIDYLVPQVYWSMDLPVASHKKIVNWWSN--NSVNT 328 Query: 368 RLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQ 427 LYIG YK+ S D E+ QL L ++ G +LF L Sbjct: 329 NLYIGNGAYKIRSNSDKAWD-----DKKEMPNQLKLARKDSKVQGNVLFSAKSLMN-DNP 382 Query: 428 QAVSYLQSRW 437 V YL+ R+ Sbjct: 383 DVVEYLKRRF 392 >UniRef50_Q7MXU6 YngK protein n=4 Tax=Porphyromonadaceae RepID=Q7MXU6_PORGI Length = 512 Score = 235 bits (600), Expect = 2e-60, Method: Compositional matrix adjust. Identities = 127/364 (34%), Positives = 187/364 (51%), Gaps = 37/364 (10%) Query: 1 MDICSRNKKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIW 60 M CSR K LT + + L C K P + + R W Sbjct: 1 MYHCSR-KSLTFFLALLFCVMVLFSCGTKRKLPSQV-----------HADYPKREFRAAW 48 Query: 61 LATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPS 120 + TV + ++ +S P R+ +I +LD L+ G N + FQ++P+ A + S Sbjct: 49 IQTVYQGEYARLS------PAEARRL----LIGRLDKLKEAGCNAIIFQIRPESDAWYES 98 Query: 121 KILPWSDLMTGKIGE--NPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELN 178 I PWS +TG+ G+ P +DPL FM+ E HKRGM++HAW NPYR S + G Sbjct: 99 AIEPWSRFLTGRQGQAPTPFWDPLAFMVSECHKRGMELHAWINPYRASTSGTAGL----- 153 Query: 179 STLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYF 238 P Y ++ W T ++ DPG+P+ + +I IV ++ RY +D + DDYF Sbjct: 154 -----APNHPYHRYPQWFVTYNNQLYYDPGVPDCRAYICRIVRDITMRYDIDAIHMDDYF 208 Query: 239 Y-TESPGSRLNDNETYRKYGGA--FASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSP 295 Y G+ D++++R+YG F +K DWRR N +L+ ++ TI KP V FG+SP Sbjct: 209 YPYPVAGAAFPDDDSFRRYGQGYTFQTKGDWRRENVNKLVHEIKQTILQSKPWVRFGISP 268 Query: 296 AGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLA 355 G++RN+ P GS+T G YD+ YAD W ++G +DY+ PQIYW AA Y LA Sbjct: 269 FGIYRNKRTSPSGSETAGLQNYDDLYADVLLWQKRGWIDYVIPQIYWEIGHKAADYATLA 328 Query: 356 KWWA 359 +WW Sbjct: 329 EWWG 332 >UniRef50_A6EKL7 Putative uncharacterized protein (Fragment) n=1 Tax=Pedobacter sp. BAL39 RepID=A6EKL7_9SPHI Length = 391 Score = 234 bits (597), Expect = 5e-60, Method: Compositional matrix adjust. Identities = 122/296 (41%), Positives = 177/296 (59%), Gaps = 20/296 (6%) Query: 124 PWSDLMTGKIG--ENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTL 181 PWS + G+ G PGYDPL F + EAH RGM++HAWFNPYR +++ T+ + Sbjct: 4 PWSQWLMGRQGLAPGPGYDPLAFAIKEAHSRGMELHAWFNPYRATMSAN--TVTSADHMT 61 Query: 182 SQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTE 241 ++P D T G + DPGIPEV+++I ++ +VV Y VDG+ FDDYFY Sbjct: 62 RKRP--------DLFFTYGGKKQFDPGIPEVREYIVQVILDVVKGYDVDGIHFDDYFYPY 113 Query: 242 S-PGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWR 300 G R++D+ T+ KY F++K DWRRNN LI ++ +I K V+FG+SP G+W+ Sbjct: 114 PIAGQRISDDVTFSKYANGFSNKNDWRRNNVDLLIKQLDDSIHHYKKYVKFGISPFGIWK 173 Query: 301 NRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWAD 360 N++ D LGS T G + Y E YAD+R+WV++G +DYI PQIY+ F+R AA +D L WW++ Sbjct: 174 NKAEDTLGSATHGLSNYTELYADSRKWVKEGWVDYINPQIYFSFTRRAAPFDTLVNWWSN 233 Query: 361 VVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILF 416 LYIG A Y V + K+E W +P+ + L N+ + G++ F Sbjct: 234 --NAYGRHLYIGQAAYLVNQ--KMEAAWRNPSQIPDQVRYLRANN---RVQGSVYF 282 >UniRef50_B4D6Q1 Putative uncharacterized protein n=2 Tax=Verrucomicrobia RepID=B4D6Q1_9BACT Length = 388 Score = 226 bits (575), Expect = 2e-57, Method: Compositional matrix adjust. Identities = 125/336 (37%), Positives = 185/336 (55%), Gaps = 24/336 (7%) Query: 52 SSQP-MRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQV 110 ++QP RG W+ATV LDWP S +S +A+++ D D Q+L +N + QV Sbjct: 19 AAQPEFRGAWVATVFNLDWP--SKAGLSEAEQKAQLR-----DIFDRAQQLKLNAILLQV 71 Query: 111 KPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTK 170 + A + S+ PWS +TGK G +PGYDPL + + EAH RG+++HAWFNP+R TK Sbjct: 72 RSMSDACYASRREPWSTFLTGKQGVDPGYDPLAYAITEAHARGIELHAWFNPFR--AGTK 129 Query: 171 PGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVD 230 G+ N P +WIR G + LDPG P + ++ ++ +VV RY +D Sbjct: 130 GGSSCAANHVTRAHP--------EWIRPYGSQLWLDPGDPNARRYVLDVILDVVKRYDID 181 Query: 231 GVQFDDYFY-TESPGSRLNDNETYRKYGGAFA-SKADWRRNNTQQLIAKVSHTIKSIKPG 288 GV DDYFY G+ D+ T++KYG A S+ADWRR+N + + + H +K+ KP Sbjct: 182 GVHIDDYFYPYPVKGAEFPDDVTWQKYGMAGGKSRADWRRDNINRFVEAMYHEVKAAKPS 241 Query: 289 VEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSA 348 V G+SP G+WR + + + AY + YAD R W+ +G DY+APQ+YW Sbjct: 242 VRVGISPFGIWRPKVPATIEAQLD---AYAQLYADARYWLSEGWCDYLAPQLYWGIHPDK 298 Query: 349 ARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKI 384 + VL WW R ++ GIA ++G+P + Sbjct: 299 QSFPVLLNWWRQQSTAGRP-VWPGIATERIGKPYDV 333 >UniRef50_A9NEW0 Putative uncharacterized protein n=1 Tax=Acholeplasma laidlawii PG-8A RepID=A9NEW0_ACHLI Length = 404 Score = 224 bits (572), Expect = 4e-57, Method: Compositional matrix adjust. Identities = 127/386 (32%), Positives = 207/386 (53%), Gaps = 25/386 (6%) Query: 54 QPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPD 113 +P R W++ V +D P N+ +P+ + +V I+ LD + + +FFQV+ Sbjct: 30 KPFRAFWISNVLNIDLP-----NMKDPSYKDKV-----IEMLDTAKAYNMTAIFFQVRTT 79 Query: 114 GTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGT 173 A + SK+ P+S +TGK GE P +D L+F++ EA R ++VHAW NPYRVS+ T T Sbjct: 80 NDAFYKSKLNPYSRFLTGKEGEVPLFDVLEFVIKEAKNRSLEVHAWCNPYRVSMKTDM-T 138 Query: 174 IRELNSTLSQQPASVYVQHRDWIRTSGD-RFVLDPGIPEVQDWITSIVAEVVSRYPVDGV 232 E STL + +H +++ T + + +L+P EV+ +I + E+ Y VDG+ Sbjct: 139 KSEYLSTLDD--LNFAKRHPEFVITDKNGQLILNPAKEEVKTFIIDSMLEIADNYDVDGI 196 Query: 233 QFDDYFYTESPGSRL-NDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEF 291 FDDYFY + S ND + + + D+RRN +I ++ +K P + F Sbjct: 197 HFDDYFYPYAGLSDSDNDASDFEQRTDKSLTLGDFRRNQITDVIRNLNKALKEKHPNLRF 256 Query: 292 GVSPAGVWRNRSHDPLGS--DTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAA 349 GVSP G+W+ + D LGS D + + +YD YAD+ W+++G++DYI PQ+YW F A Sbjct: 257 GVSPFGIWKTKKSDELGSNVDPQCSQSYDNQYADSYLWIKEGIIDYIVPQLYWDFEHKLA 316 Query: 350 RYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPE 409 + LA WWA+V K + LYIG Y+ GE E + E+ QL + Sbjct: 317 PFADLALWWAEVCKGSNVDLYIGHGPYRYGEKGGYENPY-------EVVNQLKFANQFDN 369 Query: 410 ISGTILFR-EDYLNKPQTQQAVSYLQ 434 + G + F + ++++ + QQ + ++ Sbjct: 370 VVGNVFFTYKTFIDETKQQQGMHLVK 395 >UniRef50_B9XM08 Putative uncharacterized protein n=2 Tax=bacterium Ellin514 RepID=B9XM08_9BACT Length = 523 Score = 223 bits (569), Expect = 8e-57, Method: Compositional matrix adjust. Identities = 129/357 (36%), Positives = 197/357 (55%), Gaps = 35/357 (9%) Query: 33 PESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMI 92 P ++V P+ ++PPA+ ++ R +W+AT+ +DWP S P Q+ ++ Sbjct: 44 PAAVVYIPSTAQPPASNRE----FRAMWIATMVNIDWP-------SKPGLPVPQQKAELL 92 Query: 93 DKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPG--YDPLQFMLDEAH 150 LD +L +N V FQV+P A++ S I PWS +TG +G+ P YDPL F ++EAH Sbjct: 93 AILDCAVKLNLNAVIFQVRPGSDAMYASSIEPWSYYLTGAMGKAPAPFYDPLAFAVEEAH 152 Query: 151 KRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIP 210 KRG+++HA+FNP+R + +K N +P D +R G+ LDPG Sbjct: 153 KRGLELHAYFNPFRAAQPSKKWQFSS-NHISRTRP--------DLVRQYGNLLWLDPGER 203 Query: 211 EVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLN------DNETYRKY--GGAFAS 262 E QD + +V +VV+RY +D V FDDYFY N D++T++++ GG S Sbjct: 204 EAQDHVLKVVMDVVNRYDIDAVHFDDYFYPYKQQDARNRDIDFPDSKTWKRFVAGGGKLS 263 Query: 263 KADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYA 322 + DWRR N + +V +I + KP V+FG+SP G+W+ + P +G AYD YA Sbjct: 264 RDDWRRENINSFVHRVHDSIHAAKPWVKFGISPFGIWQP-GYPP---QVKGLNAYDSIYA 319 Query: 323 DTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVG 379 D+R+W+ G +DY++PQ+YW + VL KWW + R ++ GIA KVG Sbjct: 320 DSRKWLMNGWVDYLSPQLYWAVESPGQSFPVLLKWWLEQNSKNRN-VWPGIASEKVG 375 >UniRef50_Q8YW40 All1776 protein n=5 Tax=Nostocaceae RepID=Q8YW40_ANASP Length = 669 Score = 223 bits (569), Expect = 9e-57, Method: Compositional matrix adjust. Identities = 134/363 (36%), Positives = 194/363 (53%), Gaps = 31/363 (8%) Query: 17 ILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVN 76 I+ + L + +V PP + P + + RG W+ TV DWP + ++ Sbjct: 180 IIYQALVYLGQAEKIASVYLVIPPKPTLPTVRVSHNRE-FRGAWITTVWNSDWPSKAGLS 238 Query: 77 ISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGE- 135 ++ QQ ++ L LQ+L N V QV+P+G AL+ S++ PWS +TG G+ Sbjct: 239 VAQ-------QQAELVAILTRLQQLNFNAVILQVRPEGDALYASELEPWSAWLTGTPGKA 291 Query: 136 -NPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTK--PGTIRELNSTLSQQPASVYVQH 192 P YDPLQF + EAHKR ++VHAWFNPYR +T+ P ++ T P VY Sbjct: 292 PEPFYDPLQFAIAEAHKRNLEVHAWFNPYRAKTSTRSAPNVRPHISVT---NPEVVY--- 345 Query: 193 RDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTES-PGSRLNDNE 251 W G++ +DPGI VQD +++ +VV RY +D V DDYFY G D++ Sbjct: 346 -QW----GNQLWMDPGIKIVQDRAYNVIIDVVRRYDIDAVHLDDYFYPYPIQGQAFPDDK 400 Query: 252 TYRKY--GGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGS 309 TY Y G S DWRR N Q++ ++S IK+ K V+FG+SP G++R P G Sbjct: 401 TYAAYKSAGGQLSLNDWRRQNVDQMVLRLSQGIKATKSYVKFGISPFGIYR--PGQPPG- 457 Query: 310 DTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRL 369 G AY YAD ++W+EQG +DY+APQ+YW ++ Y VL KWW + + R + Sbjct: 458 -ITGLDAYSVLYADAKKWLEQGWVDYLAPQLYWRTDQTNQSYPVLLKWWTE-INSKRRHI 515 Query: 370 YIG 372 Y G Sbjct: 516 YAG 518 >UniRef50_C3XYE7 Putative uncharacterized protein n=2 Tax=Branchiostoma floridae RepID=C3XYE7_BRAFL Length = 576 Score = 219 bits (558), Expect = 1e-55, Method: Compositional matrix adjust. Identities = 137/378 (36%), Positives = 197/378 (52%), Gaps = 37/378 (9%) Query: 46 PATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINT 105 P T S+ RG+W+ATVS +DWP SS ++S +A ++ LD L +N Sbjct: 114 PTITPSPSREFRGVWVATVSNIDWP--SSRHLSTEQQKAE-----LVTILDRTVELNLNA 166 Query: 106 VFFQVKPDGTALWPSKILPWSDLMTGKIGE--NPGYDPLQFMLDEAHKRGMKVHAWFNPY 163 + FQV+P G A + S++ PWS + G+ G P YDPL F ++E+H+RG+++HAWFNPY Sbjct: 167 IVFQVRPAGDAFYDSQLEPWSYYLAGQHGSAPTPFYDPLAFAIEESHRRGIELHAWFNPY 226 Query: 164 RVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEV 223 R G N + P Y G+ +DPG V D ++ +V Sbjct: 227 RAKTKAA-GYSLASNHMAKRFPQYAY--------DYGNYIWMDPGAQVVADHTYDVIIDV 277 Query: 224 VSRYPVDGVQFDDYFY-TESPGSRLNDNETYRKY--GGAFASKADWRRNNTQQLIAKVSH 280 V RY VDG+ FDDYFY G D TY+ Y G SKADWRR+N +L+ +++ Sbjct: 278 VRRYDVDGIHFDDYFYPYPVSGVDFPDTATYQAYQTSGGTMSKADWRRDNVNRLVRRLNS 337 Query: 281 TIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQI 340 I + K V+FG+SP G+WR +P G G + YD YAD + W+EQGL+DY+APQ+ Sbjct: 338 GIHAEKSHVKFGISPFGIWR--PGNPAG--IVGFSQYDSLYADPKFWLEQGLVDYLAPQL 393 Query: 341 YWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQ 400 YW Y L WW D P + +Y G ++ + W ++ EL Q Sbjct: 394 YWMIDPPQQSYPALLDWWLD-QNPLQRHVYTGNYLSRI-----LTDGWPVS----ELVNQ 443 Query: 401 LDLN-DAVPEIS-GTILF 416 + L+ D +S G I+F Sbjct: 444 VSLSRDRADRLSLGNIMF 461 >UniRef50_B7AM83 Putative uncharacterized protein n=1 Tax=Bacteroides eggerthii DSM 20697 RepID=B7AM83_9BACE Length = 489 Score = 219 bits (557), Expect = 2e-55, Method: Compositional matrix adjust. Identities = 137/409 (33%), Positives = 201/409 (49%), Gaps = 49/409 (11%) Query: 18 LVALALLLCSC------KSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPP 71 LVAL SC ++ PPE TPP P + +RG W+ TV +DWP Sbjct: 9 LVALLAFAVSCSKDDDGENMPPEP--TPPVEKPEPQAVTLPQKELRGAWITTVWGIDWP- 65 Query: 72 VSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTG 131 + N A QQ+ D LD L +N VFFQ++ A + S+ WS +TG Sbjct: 66 MEDYN-------AATQQKKYTDYLDLLVANNMNAVFFQIRGMADAFYESQYESWSKNITG 118 Query: 132 KIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVN-TKPGTIRELNSTLSQQPASVYV 190 G+NPGYD L F+++EAHKRG++ HAW NPYR+S +K + EL++ + P + Sbjct: 119 TAGKNPGYDVLGFLVEEAHKRGLQFHAWMNPYRISTRASKNSSFAELDTKI---PVA--- 172 Query: 191 QHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFY-TESPGSRLND 249 W + + +P +PEVQ I IV E++++Y VDG+ DDYFY + G +ND Sbjct: 173 ----WTKDYNKIRIYNPAMPEVQTRIMDIVKEIITKYDVDGIHMDDYFYPSLEEGESMND 228 Query: 250 NETYRKYG-GAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLG 308 N Y KYG F S ++RRNN +I + I KPGV F VSPA N Sbjct: 229 NAEYEKYGKDKFKSIEEFRRNNVDVVIQNIQKVIIDTKPGVIFSVSPAANIDNN------ 282 Query: 309 SDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTR 368 Y + +AD R+W+++G +D I PQ+Y+ ++ W V +T Sbjct: 283 --------YSKLFADVRKWLKEGWVDVIIPQLYFATGTGKNSFNQFLDQWMQYV--NQTH 332 Query: 369 LYIGIAFYKVGEPSKIEPDW-MINGGVPELKKQLDLNDAVPEISGTILF 416 IG YK G +PD+ +LK Q + +++G++L+ Sbjct: 333 CLIGYGIYKFGS---TDPDYGNAFHSSADLKSQFEYASKKSKVNGSVLY 378 >UniRef50_B4VZ35 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VZ35_9CYAN Length = 665 Score = 218 bits (556), Expect = 3e-55, Method: Compositional matrix adjust. Identities = 126/375 (33%), Positives = 205/375 (54%), Gaps = 37/375 (9%) Query: 56 MRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGT 115 RG+W+ATV +DWPP ++++ QQ+ ++ +D + L +N + QV+P G Sbjct: 214 FRGVWVATVWNIDWPPQRGLSVAQ-------QQRELLQIIDRMAELQLNALILQVRPTGD 266 Query: 116 ALWPSKILPWSDLMTGKIGE--NPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGT 173 A + S++ PWS+ +TG G+ +P YDPL+F + H+R +++HAWFNPYR ++ Sbjct: 267 AFYASELEPWSEWLTGVQGQAPDPYYDPLEFAIAACHQRNIELHAWFNPYRAKTSS---- 322 Query: 174 IRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQ 233 +S+ S P + V H +++ G++ +DPG+ VQD +++ +VV RY VDG+ Sbjct: 323 ----HSSASVAP-HISVTHPEYVYKYGNQQWMDPGVKVVQDLTYNVIMDVVRRYDVDGIH 377 Query: 234 FDDYFYTES-PGSRLNDNETYRKYG--GAFASKADWRRNNTQQLIAKVSHTIKSIKPGVE 290 DDYFY G D++TY Y G S +DWRR N Q++ ++ I++ K V+ Sbjct: 378 LDDYFYPYPIAGEDFPDDKTYNAYQAEGGTLSLSDWRRENVNQMVQRLYKGIQATKKQVK 437 Query: 291 FGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAAR 350 FG+SP G++R P +G Y+ YAD ++W+E G +DYIAPQ+YW A Sbjct: 438 FGISPFGIYRP-GQPP---QIKGLDQYESLYADPKKWLEAGWIDYIAPQLYWRIDPPAQS 493 Query: 351 YDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLN-DAVPE 409 Y VL +WW D P + +Y G + + DW E ++Q+D+ + P+ Sbjct: 494 YPVLLEWWTD-NNPKQRHIYPGNRLSMLD-----DKDW----SFLEYERQVDITRNLAPQ 543 Query: 410 IS-GTILFREDYLNK 423 +S G I + N+ Sbjct: 544 LSLGNIFYNMKVFNE 558 >UniRef50_UPI0001C160EA conserved hypothetical protein n=2 Tax=Nostocaceae RepID=UPI0001C160EA Length = 668 Score = 218 bits (554), Expect = 4e-55, Method: Compositional matrix adjust. Identities = 134/370 (36%), Positives = 196/370 (52%), Gaps = 32/370 (8%) Query: 36 MVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKL 95 +VT P G K + Q + RG W+ V DWP S P Q+ +++ + Sbjct: 199 IVTLPLGVKTVKVSHQ--REFRGAWITVVWNSDWP-------SKPGLSVEQQKTELLEII 249 Query: 96 DHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPG--YDPLQFMLDEAHKRG 153 LQ N + QV+P+G A++ S I PWS MTG G+ P YDPL+F ++E HKR Sbjct: 250 KQLQSFNFNALILQVRPEGDAVYASPIEPWSAWMTGTQGKAPEPIYDPLEFAIEECHKRN 309 Query: 154 MKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQ 213 ++VHAWFNPYR TK G+ + ++ P VY W G++ +DPG VQ Sbjct: 310 IEVHAWFNPYRAKTTTKSGSNVSPHIAITN-PEVVY----RW----GNQLWMDPGAKIVQ 360 Query: 214 DWITSIVAEVVSRYPVDGVQFDDYFY-TESPGSRLNDNETYRKY--GGAFASKADWRRNN 270 D +++ +V++RY VDG+ DDYFY G D +TY Y G S DWRR N Sbjct: 361 DRAYNVIIDVLTRYDVDGIHLDDYFYPYPISGQSFPDEKTYSAYKNSGGKLSVEDWRREN 420 Query: 271 TQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQ 330 Q++ ++S IK IK V+FG+SP G++R P G G Y YAD+++W+++ Sbjct: 421 VNQMVWRLSEGIKKIKAHVKFGISPFGIYR--PGQPAG--IVGLDPYSVLYADSKKWLQE 476 Query: 331 GLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAF----YKVGEPSKIEP 386 G +DY+APQ+YW ++ Y+ L KWW +V R +Y G KV + S+IE Sbjct: 477 GWIDYLAPQLYWRTDQTQQSYETLLKWWTEVNTKQR-HIYAGNNLGQLDGKVWKNSEIEK 535 Query: 387 DWMINGGVPE 396 +I+ + E Sbjct: 536 QIVISRNLAE 545 >UniRef50_C0YRL9 FenI family protein n=3 Tax=Bacteroidetes RepID=C0YRL9_9FLAO Length = 538 Score = 214 bits (545), Expect = 5e-54, Method: Compositional matrix adjust. Identities = 114/338 (33%), Positives = 179/338 (52%), Gaps = 39/338 (11%) Query: 40 PAGSKPPATTQQSSQ------------PMRGIWLATVSRLDWPPVSSVNISNPTSRARVQ 87 PA + P T S++ RG W+A+V+ ++WP + + + Q Sbjct: 51 PAATNPATGTAASTEDNFRTNLPEIKREFRGAWIASVANINWPSRNDLTVEQ-------Q 103 Query: 88 QQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGE--NPGYDPLQFM 145 + I LD L+ N FQ++P AL+ S I PWS +TG+ G +P YDPLQF Sbjct: 104 KAEAISMLDMLKDNNFNAAIFQIRPSADALYTSNIEPWSYFLTGETGTAPSPNYDPLQFW 163 Query: 146 LDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVL 205 ++EAHKRG+++H W NPYR +T G + +L ++ + + + V R + Sbjct: 164 IEEAHKRGLELHVWLNPYRAH-HTNGGAVNKL--SMVNKLSDIVV------RLKNGMYWF 214 Query: 206 DPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFY---TESPGSRLNDNETYRKY--GGAF 260 DP P+ Q +++IV ++V RY +D + FDDYFY T + G+ DN ++ Y G Sbjct: 215 DPANPKTQGHVSNIVKDIVKRYDIDAIHFDDYFYPYATYNKGADFPDNASWNAYVSSGGT 274 Query: 261 ASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDES 320 S+ADWRR+N + + ++ I + K V FG+SP G+W+ P G G++ YDE Sbjct: 275 LSRADWRRDNVNKFVERIYKEIHAEKNNVRFGISPFGIWK--PGYPAG--IVGSSQYDEL 330 Query: 321 YADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWW 358 YAD + W+ +G +DY +PQ+YWP ++ L WW Sbjct: 331 YADAKLWLNKGWVDYFSPQLYWPIDSKGQSFEALLSWW 368 >UniRef50_A0M6M5 Protein containing DUF187 n=4 Tax=Bacteroidetes RepID=A0M6M5_GRAFK Length = 540 Score = 213 bits (542), Expect = 1e-53, Method: Compositional matrix adjust. Identities = 133/391 (34%), Positives = 188/391 (48%), Gaps = 62/391 (15%) Query: 2 DICSRNKKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQ------- 54 D C N R I + L L L +CKST V PP +P ++S Q Sbjct: 4 DCCLSNH----FRIPIFILLMLFLNACKST----KVAPPKPVEPQVEEEKSEQNVDQVPE 55 Query: 55 --------------------PMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDK 94 RG W+ATV+ ++WP S N+S +A I+ Sbjct: 56 VEEPEASSENKIVEPPIDIEEFRGAWIATVANINWP--SKNNLSTEAQKAEA-----IEM 108 Query: 95 LDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGE--NPGYDPLQFMLDEAHKR 152 LD L+ N V QV+P AL+ S+I PWS +TGK G+ P YDPL+F ++EAH R Sbjct: 109 LDFLENHNFNAVILQVRPQADALYDSEIEPWSYFLTGKSGKAPQPYYDPLKFWIEEAHNR 168 Query: 153 GMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEV 212 G+++H W NPYR T G S + P V + + +DPG +V Sbjct: 169 GLELHVWLNPYRAHHTT--GKEIGEKSIVKTNPELV-------MELKNGMWWMDPGSSKV 219 Query: 213 QDWITSIVAEVVSRYPVDGVQFDDYFY---TESPGSRLNDNETYRKY--GGAFASKADWR 267 QD ++V ++V RY +D V FDDYFY + + D +++ KY G S+ DWR Sbjct: 220 QDHSAAVVMDIVKRYDIDAVHFDDYFYPYASYNGKKDFPDEKSWEKYVNSGGELSRGDWR 279 Query: 268 RNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRW 327 R N I +++ IK+ K V+FG+SP G+WR P G G Y+E YAD + W Sbjct: 280 RKNVNDFIERIAEEIKAEKSFVKFGISPFGIWR--PGFPKG--ISGMDQYEELYADAKLW 335 Query: 328 VEQGLLDYIAPQIYWPFSRSAARYDVLAKWW 358 + +G +DY PQ+YWP + + VL WW Sbjct: 336 LNKGWIDYFTPQLYWPTRQIGQSFPVLLGWW 366 >UniRef50_Q110S6 Putative uncharacterized protein n=5 Tax=Bacteria RepID=Q110S6_TRIEI Length = 521 Score = 212 bits (540), Expect = 2e-53, Method: Compositional matrix adjust. Identities = 127/374 (33%), Positives = 188/374 (50%), Gaps = 38/374 (10%) Query: 8 KKLTIRRPAIL----VALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLAT 63 K + RR +L + L L L S P S P+ P + RG+W+A+ Sbjct: 26 KNIHFRRKNLLWSCFLILGLTLTQMSSYLPTSRAQQPSSFSP--------REFRGVWVAS 77 Query: 64 VSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKIL 123 V+ +DWP S P Q+ +++ L+ +Q L +N + QV+P+G A + S I Sbjct: 78 VANIDWP-------SQPGLPVTQQKTELLNILNRMQELNLNALVLQVRPNGDAFYNSTIE 130 Query: 124 PWSDLMTGKIGE--NPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTL 181 PWS +TGK G P YDPL+F + E+HKR +++HAWFNPYR ++ G+ ++ Sbjct: 131 PWSGWLTGKQGTPPQPYYDPLEFAIAESHKRNIELHAWFNPYRAQLSPNDGSFASNHAA- 189 Query: 182 SQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFY-T 240 V++ + G LDPG VQD + + +VV RY +D V FDDYFY Sbjct: 190 --------VKYPQYAYRYGKYVWLDPGAKVVQDQTFNTIIDVVRRYDIDAVHFDDYFYPY 241 Query: 241 ESPGSRLNDNETYRKY--GGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGV 298 G D +TY Y G S ++WRR N ++ ++ I + KP V+FG+SP G+ Sbjct: 242 PQGGQEFPDYQTYNSYKASGGTLSLSNWRRQNVNNMVERLYQGIHAEKPYVKFGISPFGI 301 Query: 299 WRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWW 358 + R +P G G Y+ YAD + W+ +G +DY+APQ+YW Y VL WW Sbjct: 302 Y--RPGNPPG--IVGLDQYESLYADVKLWLAKGWVDYLAPQLYWRIDPPKQSYPVLLNWW 357 Query: 359 ADVVKPTRTRLYIG 372 P R +Y G Sbjct: 358 LQ-QNPQRRHIYAG 370 >UniRef50_C9PUA7 FenI protein n=2 Tax=Prevotella RepID=C9PUA7_9BACT Length = 493 Score = 210 bits (534), Expect = 1e-52, Method: Compositional matrix adjust. Identities = 134/425 (31%), Positives = 213/425 (50%), Gaps = 49/425 (11%) Query: 8 KKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRL 67 KK+ + A+LV + C+ + P PP + + +RG+W+ATV L Sbjct: 6 KKIALALSAVLV----VACNHDDNILPNKPKKPDTPNPPTQSILPKKELRGVWMATVWGL 61 Query: 68 DWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSD 127 DWP A Q+ + I + L++ IN VF QV+ A + S PW Sbjct: 62 DWP--------RGEYNAESQKASYIAYMKALEKNNINAVFVQVRGRADAFYKSDYEPWCQ 113 Query: 128 LMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPAS 187 +TG++ ++PGYD L+FM+DEAHKRG+ HAWFNPYRV+ TK T + S+ P + Sbjct: 114 YLTGEVDKDPGYDVLRFMIDEAHKRGIAFHAWFNPYRVA--TKKATDAAFPALDSRIPQA 171 Query: 188 VYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFY-TESPGSR 246 + V ++ IR + +P +PEV+ I I+ +++++Y VDGV DDYFY + + G Sbjct: 172 MMVDYKT-IR------MYNPALPEVRQRIFDIIKDLITKYDVDGVHIDDYFYPSLTSGET 224 Query: 247 LNDNETYRKY------GGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWR 300 + D E Y+KY G + ++RRNN + + +++ +P V F VSPAG Sbjct: 225 IKDEEEYKKYAPKDNNGKPTITIEEFRRNNVDLAVKGIHDVVQATRPEVVFTVSPAG--- 281 Query: 301 NRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWAD 360 N ++ Y+ YAD +W +G + I PQ+Y+P +A ++ WW+ Sbjct: 282 NPDYN-----------YNTMYADVVKWSREGWTEAIIPQLYFPMGNAATNFNQRLIWWSQ 330 Query: 361 VVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFR-ED 419 + L+IG Y+ G+P P N EL KQ +++G++L+ +D Sbjct: 331 YT--FKNALFIGYGTYRFGDPK--SPAAYQNAS--ELAKQFAFASKYNKVTGSVLYSAKD 384 Query: 420 YLNKP 424 LN P Sbjct: 385 LLNNP 389 >UniRef50_UPI00016C4E90 hypothetical protein GobsU_27726 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4E90 Length = 481 Score = 209 bits (531), Expect = 2e-52, Method: Compositional matrix adjust. Identities = 123/333 (36%), Positives = 175/333 (52%), Gaps = 36/333 (10%) Query: 37 VTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLD 96 P + PPA ++ R +W+ATVS +DWP S P A Q++ ++ LD Sbjct: 14 CAPVRAADPPALKRE----FRAVWVATVSNIDWP-------SKPGLPADQQKKELLAILD 62 Query: 97 HLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKV 156 + L +N V FQV+P AL+ S++ PWS+ +TG+IG+ PGYDPL F + EAHKRG+++ Sbjct: 63 NAVELKLNAVIFQVRPMADALYASELEPWSEYLTGQIGKAPGYDPLAFAVTEAHKRGLEL 122 Query: 157 HAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHR-DWIRTSGDRFVLDPGIPEVQDW 215 HAWFNPYR S S PA + R D + G ++P PEVQ+ Sbjct: 123 HAWFNPYRA----------RHPSAKSPAPADHLTRKRPDLAKPYGTHAWMNPTNPEVQEH 172 Query: 216 ITSIVAEVVSRYPVDGVQFDDYFY------TESPGSRLNDNET---YRKYGGAFASKADW 266 + +VV RY +DG+ DDYFY T+ D++T Y+K GG S+ DW Sbjct: 173 SLRVFLDVVKRYDIDGIHIDDYFYPYKEKGTDGKVIPFPDDDTWEAYQKQGGKL-SRDDW 231 Query: 267 RRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRR 326 RR+ + ++ K KP V+ G+SP G+W R P G G Y E YAD + Sbjct: 232 RRDAVNVFVRRMYEETKKAKPWVKVGISPFGIW--RPGHPAG--IAGLDQYAELYADAKL 287 Query: 327 WVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWA 359 W +G +DY PQ+YWP ++ + L WWA Sbjct: 288 WFNEGWVDYFTPQLYWPIAQEKQSFPKLLDWWA 320 >UniRef50_C1A9I5 Putative uncharacterized protein n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1A9I5_GEMAT Length = 534 Score = 208 bits (529), Expect = 4e-52, Method: Compositional matrix adjust. Identities = 114/321 (35%), Positives = 168/321 (52%), Gaps = 30/321 (9%) Query: 56 MRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGT 115 RG+W+A+V+ +DWP +++ + QQ ++ LD L +N V FQV+P Sbjct: 47 FRGVWVASVANIDWPSKRTLSTAE-------QQAELLALLDRAAELKLNAVIFQVRPAAD 99 Query: 116 ALWPSKILPWSDLMTGKIGENPG--YDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGT 173 AL+ S I PWS+ +TG G P +DPL F++ EAH RGM++HAWFNPYR Sbjct: 100 ALYESSIEPWSEYLTGAQGRRPEPFWDPLAFVIREAHARGMELHAWFNPYRA-------- 151 Query: 174 IRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQ 233 R ++ + + + ++ +DPG P V+ +V +VV RY +DGV Sbjct: 152 -RHTDARSPLARSHIARTNPALVKPYAGYLWMDPGEPAVRARTLRVVLDVVKRYDIDGVH 210 Query: 234 FDDYFYTESPGSR------LNDNETYRKY--GGAFASKADWRRNNTQQLIAKVSHTIKSI 285 DDYFY R D ++ +Y G +++DWRR+N +L+ ++ I Sbjct: 211 IDDYFYPYPENDRRGRAIAFPDTRSWTRYQKSGGKLTRSDWRRDNVNKLVEELYDGIHKT 270 Query: 286 KPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFS 345 KP V FG+SP G+WR RG AY++ YAD R+W+ +G LDY PQ+YWP + Sbjct: 271 KPWVRFGISPFGIWR----PGFPEQIRGLDAYEKLYADARKWLHEGWLDYFTPQLYWPTT 326 Query: 346 RSAARYDVLAKWWADVVKPTR 366 + Y VL WWA K R Sbjct: 327 KREQAYPVLLDWWATENKRAR 347 >UniRef50_B0NT08 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=B0NT08_BACSE Length = 486 Score = 207 bits (527), Expect = 6e-52, Method: Compositional matrix adjust. Identities = 128/410 (31%), Positives = 201/410 (49%), Gaps = 52/410 (12%) Query: 17 ILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQP---MRGIWLATVSRLDWPPVS 73 I + + L+L + E ++ + S+ T S P +RG+W+ATV LDWP Sbjct: 7 IYIIILLVLVAACGKDDEGILDDGSHSQGEGQTSSSVLPGKELRGVWIATVWGLDWPM-- 64 Query: 74 SVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKI 133 A VQ++ D LD L +N VFFQ++ A + S+ PWS +TG Sbjct: 65 ------EKYDADVQKKLYTDYLDLLVGYNMNAVFFQIRGMADAFYESEYEPWSKYITGSA 118 Query: 134 GENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVN-TKPGTIRELNSTLSQQPASVYVQH 192 G P YD L F+++EAHKRG++ HAW NPYR++ K +L++ + + Y + Sbjct: 119 GVRPDYDVLGFLVEEAHKRGIQFHAWLNPYRIATRANKNAAFPKLDAKIPMELVKDYEKI 178 Query: 193 RDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSR-LNDNE 251 R V +P +PEVQ+ I +IV E++++Y VDG+ DDYFY S +ND Sbjct: 179 R----------VYNPALPEVQERIVNIVKEIITKYDVDGIHMDDYFYPSLEASETMNDGA 228 Query: 252 TYRKYG-GAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAG-VWRNRSHDPLGS 309 ++KYG F + D+RRNN ++ + TI +P V F +SPA + RN Sbjct: 229 EFQKYGKDKFKNVEDFRRNNVNTVVRNIQKTIIETRPEVIFSISPAADMERN-------- 280 Query: 310 DTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRL 369 Y+ +AD W ++G +D + PQ+Y+ A +++ W+ L Sbjct: 281 -------YNTLFADVNTWAKEGWVDVVIPQLYFATGNDATSFNLRLDLWSQYT--YENHL 331 Query: 370 YIGIAFYKVGEP---SKIEPDWMINGGVPELKKQLDLNDAVPEISGTILF 416 IG YK G+ SK + +L KQ +L A P++ G++L+ Sbjct: 332 LIGYGIYKFGDSQYGSKFQSS-------DDLMKQFELASAKPKVKGSVLY 374 >UniRef50_A6G0M0 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G0M0_9DELT Length = 540 Score = 205 bits (522), Expect = 3e-51, Method: Compositional matrix adjust. Identities = 119/355 (33%), Positives = 189/355 (53%), Gaps = 37/355 (10%) Query: 53 SQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKP 112 ++ RG+W+ TV ++WP S+ A Q + +D + + +N + FQV+P Sbjct: 86 AREFRGVWVTTVYNINWP-------SSQGLSAAAAQAELASIVDTAEAVNLNAIVFQVRP 138 Query: 113 DGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPG 172 + A++ S + PWS ++G G +PG+DPL F+++EAH RG++VHAWFNPYR + Sbjct: 139 ESDAVYESSLEPWSRYLSGSQGGDPGFDPLAFLIEEAHARGIEVHAWFNPYRGAA----- 193 Query: 173 TIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGV 232 ++ ++ + +Q + T G +DPG +V++ +V +VV RY VDGV Sbjct: 194 -----SAGITLAEPHIALQLPEHAHTYGSSLWMDPGALDVREHTVDVVLDVVERYAVDGV 248 Query: 233 QFDDYFYTESPGSRLNDNETYRKY---GGAFASKADWRRNNTQQLIAKVSHTIKSIKPGV 289 DDYFY G D T+ Y GGA S+ DWRR+N L+ ++ TI + P Sbjct: 249 HLDDYFYPYPNGDDFPDALTWNAYLADGGAL-SQGDWRRDNVNALVEELHDTIAAADPDA 307 Query: 290 EFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAA 349 FG++P G++R + G Y E YAD W+E+G +DY+APQ+YWP + Sbjct: 308 RFGIAPFGIYRPG----IPEGIVGLDQYAELYADPVLWMEEGWVDYLAPQLYWPTYSAQQ 363 Query: 350 RYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLN 404 Y+VL WW+ + P R Y+ Y SK+ DW ++ E+ Q++L+ Sbjct: 364 TYEVLLDWWSS-IDPER---YVFTGNYL----SKLGDDWTLD----EMLYQVELS 406 >UniRef50_C1A7Q3 Putative uncharacterized protein n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1A7Q3_GEMAT Length = 501 Score = 205 bits (521), Expect = 3e-51, Method: Compositional matrix adjust. Identities = 118/366 (32%), Positives = 184/366 (50%), Gaps = 34/366 (9%) Query: 53 SQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKP 112 ++ RG+W+ATV+ +DWP + ++I QQ + LD Q+ G+N V V+ Sbjct: 39 TREFRGMWIATVANIDWPSRTGLSIPQ-------QQAEFVALLDVAQQAGLNAVILHVRA 91 Query: 113 DGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPG 172 G AL+PS + PW +G G +PG+DPLQ+ ++++H RG+++HAWFNP+R + Sbjct: 92 AGDALYPSTLEPWMRSFSGTQGVDPGWDPLQYAIEQSHARGIELHAWFNPFRAGNASDTA 151 Query: 173 TIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGV 232 + E A + D +R + DPG D ++V++VV RY VDGV Sbjct: 152 RLAE---------AHFGRKRPDILRRYCSQLWFDPGEAATHDQAIAVVSDVVRRYAVDGV 202 Query: 233 QFDDYFY---TESPGSRLNDNETYRKY---GGAFASKADWRRNNTQQLIAKVSHTIKSIK 286 DDYFY + DN + Y GG A +ADWRR+N + + ++ T++ + Sbjct: 203 HIDDYFYPYPETGCTTDFPDNTAFAAYQRQGGTMA-RADWRRDNVNRFVERLYATVRGLS 261 Query: 287 PGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSR 346 G+SP G+W R +P G G +Y YAD+R W+++G DY APQ+YW + Sbjct: 262 RTARVGISPFGIW--RPGNPAG--ITGLDSYASIYADSRLWLQRGWADYFAPQLYWSSTS 317 Query: 347 SAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDA 406 Y+ L WW R L+ G+A Y++ + S V E+ Q+ + A Sbjct: 318 VGQNYNALLTWWTQ-QNTMRRHLWPGLASYRIADGSSAP------FAVTEISTQIGITRA 370 Query: 407 VPEISG 412 SG Sbjct: 371 QSSASG 376 >UniRef50_C0EGV5 Putative uncharacterized protein n=1 Tax=Clostridium methylpentosum DSM 5476 RepID=C0EGV5_9CLOT Length = 430 Score = 202 bits (515), Expect = 2e-50, Method: Compositional matrix adjust. Identities = 120/394 (30%), Positives = 201/394 (51%), Gaps = 33/394 (8%) Query: 32 PPESMVTPPAGSKPPATTQQSSQ--PMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQ 89 PP + T S+P ++ Q SQ PM G+ S L+W N + QQ Sbjct: 31 PPATGQTDFVSSEPSSSGGQESQKDPMEGMKAVWFSYLEW------NTMFKGASEEQFQQ 84 Query: 90 AMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEA 149 + LD+L +G NTV V+ G A++ S + PWS ++G +G++PGYDPL ++++A Sbjct: 85 KLGTVLDNLVSIGCNTVMMHVRAFGDAMYRSSVYPWSASVSGVLGKDPGYDPLSIIVEKA 144 Query: 150 HKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGI 209 H +G+ VHAW NP R + I + L Q A +++ +++ S ++L+P Sbjct: 145 HAKGIAVHAWINPMRTMTAAEFDQIGDC--ALKQWYAGAQ-RYQYYMKDSSGHYILNPAN 201 Query: 210 PEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRN 269 PEV+ I++ V E+V Y +DGV DDYFY ++ Y + WRR+ Sbjct: 202 PEVRKLISAGVTELVQNYDIDGVHIDDYFYPSGVDGLPENDAQYYQEAAPGTDIGSWRRD 261 Query: 270 NTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVE 329 T +++ ++ +K++KP + FG SP N +D Y D RW+ Sbjct: 262 ATTEMVREMHDAVKAVKPEIPFGASPQSSLTND--------------FDRLYIDIERWIS 307 Query: 330 QGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVG-----EPSKI 384 +GL+DY+ PQIY+ F ++ +D A W ++V +T LY+G+A YKVG + Sbjct: 308 EGLVDYLMPQIYFGFHNTSQPFDQTAAKWNELVG-DKTALYVGLATYKVGLENDQHAGEG 366 Query: 385 EPDWM--INGGVPELKKQLDLNDAVPEISGTILF 416 + +W+ NG L++Q+++ +++P G L+ Sbjct: 367 KTEWIDCFNGENNMLERQVEVLESLPNCKGYCLY 400 >UniRef50_C6XWM5 Putative uncharacterized protein n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6XWM5_PEDHD Length = 481 Score = 202 bits (513), Expect = 3e-50, Method: Compositional matrix adjust. Identities = 121/374 (32%), Positives = 185/374 (49%), Gaps = 49/374 (13%) Query: 54 QPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPD 113 + MR +W+A+V LDWP Q+Q ID L+ + L IN ++FQVK Sbjct: 46 KEMRAVWIASVYGLDWP--------QSVYTMAGQKQQYIDYLEKFKSLNINAIYFQVKGM 97 Query: 114 GTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGT 173 G A + S PWS +TG G +PGYD L+FM+DEAH R ++ HAW NPYR++ Sbjct: 98 GDAFYNSSYEPWSASITGTRGVDPGYDVLKFMIDEAHARDIEFHAWMNPYRIATRA---- 153 Query: 174 IRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQ 233 S+ S PA +W+ + +P +PEV+ + IV E +++Y VDG+ Sbjct: 154 -----SSASSFPALHSSVKPEWVLDFPTIRIYNPALPEVRQRLVDIVKETITKYDVDGIH 208 Query: 234 FDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGV 293 FDDYFY E G D + KYG A+ D+RR+N + I V I + KPGV F V Sbjct: 209 FDDYFYPE--GETFTDQADFTKYGAGMANIQDFRRDNVNKAIKGVYDIIVATKPGVVFSV 266 Query: 294 SPA-GVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYD 352 SPA + +N ++ YAD ++W ++G +D + PQ+Y + Sbjct: 267 SPAPEITKN---------------FNTLYADVKKWNQEGWVDVVIPQLYQEIGNQYNDFQ 311 Query: 353 V-LAKWWADVVKPTRTRLYIGIAFYKVGE---PSKIEPDWMINGGVPELKKQLDLNDAVP 408 + L++W + K L +G +Y+ G+ P+ + EL++Q DL Sbjct: 312 LRLSEWSQNSFKAA---LMVGHGYYRFGDATAPAAFQSS-------SELQRQFDLTRLNK 361 Query: 409 EISGTILFREDYLN 422 ++ G ++ YLN Sbjct: 362 KVVGNAMYSAKYLN 375 >UniRef50_C1I7D2 Putative uncharacterized protein n=1 Tax=Clostridium sp. 7_2_43FAA RepID=C1I7D2_9CLOT Length = 741 Score = 201 bits (511), Expect = 5e-50, Method: Compositional matrix adjust. Identities = 134/401 (33%), Positives = 198/401 (49%), Gaps = 45/401 (11%) Query: 54 QPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQ-QQAMIDKLDHLQRLGINTVFFQVKP 112 + R WL+TV +D V+S NP A + + + LD + L +N V FQV P Sbjct: 19 EQFRTAWLSTVVNIDISDVTS----NPNLSAEEEFKNDLSSILDRFEELNLNAVTFQVSP 74 Query: 113 DGTALWPSKILPWSDLMTGKIGEN----------PGYDPLQFMLDEAHKRGMKVHAWFNP 162 A +PS I PWS + K G N G+DPL++++ E H RGM+ HAWFNP Sbjct: 75 MLDAWYPSDIAPWSQYL-HKGGNNYTLQGKDPGFNGFDPLEWLISETHNRGMEFHAWFNP 133 Query: 163 YRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAE 222 YRV+ + E + L++ + H I ++ LDPG PEV D++ V E Sbjct: 134 YRVTNTVDKRPVSEKLNELAENNFARKNPH--LIYEFQNKLFLDPGRPEVIDYVVQRVEE 191 Query: 223 VVSRYPVDGVQFDDYFY----TESPG-----SRLNDNETYRKYGGAFA-----SKADWRR 268 V ++Y VD + FDDYFY +E+ ++ D +T+ Y F + A WR Sbjct: 192 VANKYNVDAIHFDDYFYPYKYSENNKDIYFYTQDLDKQTFIDYNRGFGEYNIENAAKWRE 251 Query: 269 NNTQQLIAKVSHTIKSIK----PGVEFGVSPAGVWRNRSHDPLGSDT--RGAAAYDESYA 322 NN LI + + I ++FGVSP G+W + + GS+T ++ + +A Sbjct: 252 NNIDILIKAIKDKVTDINITNNRSIQFGVSPFGIWGHAENYLEGSNTPTGSTSSLRDQFA 311 Query: 323 DTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKP-TRTRLYIGIAFYKVGEP 381 +TR+WV++GL+DY+ PQIYW F+ +AA Y L +WW + + LYIG YK Sbjct: 312 NTRKWVKEGLVDYLTPQIYWSFNTAAAPYGELLQWWDSQFEGINNSHLYIGHPNYKY--- 368 Query: 382 SKIEPDWMINGGVP-ELKKQLDLNDAVPEISGTILFREDYL 421 I+ W N P E+ QL N + G+ F D L Sbjct: 369 --IDASWDNNFKNPYEIANQLRFNQKFENVKGSAFFSFDKL 407 >UniRef50_A7VTI3 Putative uncharacterized protein n=1 Tax=Clostridium leptum DSM 753 RepID=A7VTI3_9CLOT Length = 434 Score = 200 bits (509), Expect = 7e-50, Method: Compositional matrix adjust. Identities = 132/421 (31%), Positives = 200/421 (47%), Gaps = 49/421 (11%) Query: 26 CSCKSTPPESMVTPPAGSKPPATTQQSSQPMRG-IWLATVSRLDWPPVSSVNISNPTSRA 84 T + +V PAG + Q+ QP G I L R W P S++ S Sbjct: 36 SGASQTASDQLVNTPAGEE-----SQAEQPPGGQIVLEGEMRAVWVPYLSLDQSKIGQGQ 90 Query: 85 RVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQF 144 Q+A + + + G+NT+ V+P G A++PS+I PWS L+TG G +PG+DPL++ Sbjct: 91 EAFQKAFDEIVSQAKEYGLNTLIVHVRPFGDAMYPSEIYPWSHLLTGTQGGDPGFDPLEY 150 Query: 145 MLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFV 204 M+ + H+ GM+ HAW NP R+ P + P +Y Q R+ + D +V Sbjct: 151 MVRKTHEAGMQFHAWLNPLRIQSKGTPSILA---------PDHLYTQWREDSDPNNDDWV 201 Query: 205 LD--------PGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKY 256 +D P PEV++ I + E+V YPVD + FDDYFY S G+ D + Y+ Y Sbjct: 202 VDWEEGKYFNPAYPEVREKIIEGIREIVENYPVDAIHFDDYFYPTSDGAF--DEKAYQAY 259 Query: 257 GGAFASKA-----DWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDT 311 + WR N L++ V IKSI P V+FG+SP G N + +G Sbjct: 260 TESVGEGVPLTLPQWRIANINTLVSGVYSAIKSINPQVQFGISPQGNITNDLN--MG--- 314 Query: 312 RGAAAYDESYADTRRWVEQ-GLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLY 370 AD W Q G +DY+ PQIY F ++ A W + +LY Sbjct: 315 ----------ADVETWASQKGYVDYLCPQIYVNFDHPLLPFNQTADQWRQMTTAEGVKLY 364 Query: 371 IGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAV 430 IG+A YK G W NG L+++++ + ++ G +L+ DY++ QTQ+ V Sbjct: 365 IGLAVYKAGSEDADSGTW--NGKTDILQREIEYSRSLG-CDGIMLYSWDYMDTSQTQEEV 421 Query: 431 S 431 + Sbjct: 422 A 422 >UniRef50_C0EWT6 Putative uncharacterized protein (Fragment) n=1 Tax=Eubacterium hallii DSM 3353 RepID=C0EWT6_9FIRM Length = 491 Score = 190 bits (482), Expect = 1e-46, Method: Compositional matrix adjust. Identities = 137/445 (30%), Positives = 210/445 (47%), Gaps = 56/445 (12%) Query: 8 KKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGS--KPPATTQQSSQPM--RGIWLAT 63 KK++ R + + L K++ +S VT G+ K ++ QS+ M R +WL+ Sbjct: 41 KKISGRSSSEFINF-LFSTQEKASSNKSSVTSNKGNSKKGNNSSTQSADTMNYRAVWLSY 99 Query: 64 VSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKIL 123 + + SV +N +S + + L ++ +G N + QV+P G AL+ S Sbjct: 100 LEFNSYRK--SVKNNNESSFRKFYKHI----LQQIKTIGCNRIIVQVRPFGDALYASDYF 153 Query: 124 PWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQ 183 PW+ ++G G+NPGYDPL+ M + +HK G+ + AW NPYR+S +IR L+ T Sbjct: 154 PWAACISGTQGKNPGYDPLKIMTEMSHKEGISIEAWINPYRIS---SGNSIRSLSKT--- 207 Query: 184 QPA----SVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFY 239 PA SV R+ + G + +P V++ I V E+V Y VDG+ DDYFY Sbjct: 208 NPARKWFSVQNTKRNILSYEGSLY-YNPSSESVRNLIIQGVKEIVQNYNVDGIHMDDYFY 266 Query: 240 ---TESPGSRLNDNETYRKY------------------GGAFASKADWRRNNTQQLIAKV 278 TE + D Y++ S ADWRR+N +L++ + Sbjct: 267 PSFTEKNVTTAFDAPEYKQQLKTNLSSTDSTSLTSADKSSNEISLADWRRDNVNRLVSGI 326 Query: 279 SHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQ-GLLDYIA 337 +K I V FG+SPAG + D L SD E Y D WV Q G +DY+ Sbjct: 327 YKAVKEINSDVTFGISPAG-----NLDNLRSDL-------EYYVDIDTWVSQNGYVDYLM 374 Query: 338 PQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPEL 397 PQIYW F+ A +D + W +++ + +LYIG+ Y++G + D L Sbjct: 375 PQIYWGFTNEVAPFDKVTDAWCILMENSPVKLYIGLQLYRMGSTEPGQSDEKELQKTSLL 434 Query: 398 KKQLDLNDAVPEISGTILFREDYLN 422 KK+L +I G LF YL+ Sbjct: 435 KKELSYLKKQKKIEGYCLFSYQYLD 459 >UniRef50_B0P7J3 Putative uncharacterized protein n=1 Tax=Anaerotruncus colihominis DSM 17241 RepID=B0P7J3_9FIRM Length = 429 Score = 189 bits (481), Expect = 1e-46, Method: Compositional matrix adjust. Identities = 115/351 (32%), Positives = 169/351 (48%), Gaps = 56/351 (15%) Query: 41 AGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQ---AMIDKLDH 97 G+ + + +RG+W++ ++ P + + Q Q + D D Sbjct: 60 GGAHTAVLSSSVNGEVRGVWISYLTL------------EPMIKGKTQAQFVKNIGDAFDQ 107 Query: 98 LQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVH 157 G NTVF +P G AL+ S+ PWS +TG+ G +PGYDPL+ M+ AH+RG+++ Sbjct: 108 AADFGFNTVFVHARPFGDALYKSEYFPWSRYLTGEEGRDPGYDPLELMVSLAHERGLRIE 167 Query: 158 AWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFV-------LDPGIP 210 AW NPYRV ++ K P S Q + W+ + D + +PG Sbjct: 168 AWINPYRVRLDDK--------------PMSADNQAKKWLASGNDGALAWNGGVYYNPGSA 213 Query: 211 EVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLN-DNETYRKYGGAFASKADWRRN 269 ++ I + V E+V Y VDG+ FDDYFY P + L D TY+ G + ++ADWRR Sbjct: 214 AARELIVNGVREIVENYDVDGIHFDDYFY---PTTDLTFDAATYQASGSSL-TQADWRRE 269 Query: 270 NTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWV- 328 N +L+ V +K P FG+SP G D Y+ +AD R WV Sbjct: 270 NVNKLVHDVYAAVKEANPDCLFGISPQG----------NVDIN----YNGQFADVRTWVS 315 Query: 329 EQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVG 379 E G +DYI PQIY+ + A Y W ++K +LY+GIA YKVG Sbjct: 316 EPGYVDYICPQIYYGYRNGTAPYAETVALWDSMIKVDTIKLYVGIAAYKVG 366 >UniRef50_A9NEM7 Hypothetical surface-anchored protein n=2 Tax=Acholeplasma laidlawii PG-8A RepID=A9NEM7_ACHLI Length = 906 Score = 189 bits (481), Expect = 2e-46, Method: Compositional matrix adjust. Identities = 131/450 (29%), Positives = 216/450 (48%), Gaps = 70/450 (15%) Query: 18 LVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNI 77 LV L +L S + P + + K +++Q +R +W+ P V V+ Sbjct: 11 LVCLTTILLSGFTKPNSNDI------KSFEFEFETNQKLRAVWVT-------PIVGEVST 57 Query: 78 SNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENP 137 + + + M+D L+H + IN + F V+ AL+ S++ P + + G + N Sbjct: 58 FTTETAFKNEMNQMLDILEHYK---INALIFHVRTHNNALYDSELNPKATVF-GSVNFN- 112 Query: 138 GYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELN-----STLSQQPASVYVQH 192 +DPL ++++E RG++ HAW NPYRV N GT+ N S + P++ ++ Sbjct: 113 NFDPLLWLVNETQSRGIEFHAWLNPYRVGTN-YVGTMPAENPASNASNILSNPSNSALK- 170 Query: 193 RDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTE-SPGSRLN--- 248 +L+PG P V+D+I V E++ +YPVD + FDDYFYT L+ Sbjct: 171 -----------ILNPGEPVVRDFIVDTVIEIIEKYPVDAIHFDDYFYTNLGANGALSGAT 219 Query: 249 ------DNETYRKYGGAFAS-----KADWRRNNTQQLIAKVSHTIKSIK----PGVEFGV 293 D +TY YG F + KA+WRR+ ++ VS+ IK+ ++FG+ Sbjct: 220 TILDEPDQQTYVTYGSGFNTTSATDKANWRRHQVNTMVQAVSNAIKNYNQLNGKHIQFGI 279 Query: 294 SPAGVWRNR----SHDPLG------SDTRGAAAYDES-YADTRRWVEQGLLDYIAPQIYW 342 SP G+++N ++D G S T G Y ++D+ W+++G LDYIAPQ YW Sbjct: 280 SPTGIYKNGNGVVTYDEFGKPVTTGSLTTGQTHYSSYLFSDSLHWIKEGWLDYIAPQSYW 339 Query: 343 PFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLD 402 + SAA Y + WW VVK LY GI Y E + +W + E++ QL+ Sbjct: 340 ATNHSAASYYNVMGWWEKVVKYLDVNLYSGIGLYMADESTN-TFNW--KDDMLEMRTQLE 396 Query: 403 LNDAVPEISGTILFREDYL-NKPQTQQAVS 431 + + ++ G ++ Y+ N Q + S Sbjct: 397 YLETLNDVDGLSVYSYKYIRNHYNNQNSTS 426 >UniRef50_C1Q9T9 Uncharacterized conserved protein n=3 Tax=Brachyspira RepID=C1Q9T9_9SPIR Length = 605 Score = 187 bits (474), Expect = 9e-46, Method: Compositional matrix adjust. Identities = 125/390 (32%), Positives = 176/390 (45%), Gaps = 69/390 (17%) Query: 54 QPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPD 113 + R W +TV+ +DWP Q++ +I L+ L N VF QVKPD Sbjct: 48 REFRAAWFSTVANIDWPI--------KGGSENEQKKLIIKHLNTLYENNFNAVFVQVKPD 99 Query: 114 GTALWPSKILPWSDLMTGKIG--ENPGY----DPLQFMLDEAHKRGMKVHAWFNPYRVSV 167 ++PSKI P + G E Y D L+F++DEAHKR ++VHAWFNPYR+S+ Sbjct: 100 AGVIFPSKINPTTRYFFGTASSDEKDEYPFKTDMLKFIIDEAHKRNLEVHAWFNPYRMSL 159 Query: 168 NTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRY 227 E + + + I +R LDPG P +I V EVV Y Sbjct: 160 TYDTNKTYEEQFSKKNFIHTYVSNNLKPIHWYDNRIYLDPGEPISSKYIIDSVIEVVENY 219 Query: 228 PVDGVQFDDYFYTESPGSRLN--------------------DNETYRKYG--GAFASKAD 265 VDG+ FDDYFY + G + +N +Y YG G +A Sbjct: 220 DVDGIHFDDYFYQNAAGGKTYKDWPDRISAEKYGEKSGYDINNTSYDDYGVNGLYA---- 275 Query: 266 WRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSH--DPLGSDTRGAAA-----YD 318 WRR+N +L++ + IKS KP V++ +SPAGVWRN + + +GS A +D Sbjct: 276 WRRDNINRLVSDLYKEIKSRKPYVKWTISPAGVWRNNTKLSEYIGSKYGSATQSYNPNFD 335 Query: 319 ESYADTRRWVEQG------------------LLDYIAPQIYWPFSRSAARYDVLAKWWAD 360 +AD W+ G +D + PQ+YW A +D + KWW + Sbjct: 336 ALHADVLLWLLNGEKTSSLENASDKDGLNRMYIDAVIPQVYWSSYHKTAPFDTIVKWWVN 395 Query: 361 VVKPTRTR----LYIGIAFYKVGEPSKIEP 386 K R LYIG A YK+G + EP Sbjct: 396 EYKKARATNTADLYIGHALYKMGRETNTEP 425 >UniRef50_B2ULM6 Putative uncharacterized protein n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2ULM6_AKKM8 Length = 486 Score = 187 bits (474), Expect = 9e-46, Method: Compositional matrix adjust. Identities = 112/353 (31%), Positives = 180/353 (50%), Gaps = 32/353 (9%) Query: 26 CSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRAR 85 CS + +++ +G PA Q+ R W++TV +DWP S ++ Sbjct: 12 CSLLALASQALGWQTSGESVPAVPQE----FRAAWISTVHNIDWPSRSGLS-------GA 60 Query: 86 VQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFM 145 Q+ +++ L+ +L +N VF QV+P+ AL+ S + PWS ++G G NPGYDPL F Sbjct: 61 AQRAELLNILNTCAQLKLNAVFLQVRPNADALYRSSLEPWSQWLSGP-GVNPGYDPLAFA 119 Query: 146 LDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVL 205 + EAH+RG+++HAWFNP+R N K R + + D ++ +G ++ Sbjct: 120 IQEAHRRGIELHAWFNPFRAKANVKHAVGRN----------HISLTRPDLMKRNGSVLLI 169 Query: 206 DPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKAD 265 +P +D ++ +VV RY +DGV DDYFY R ++ G S+ Sbjct: 170 NPSASASRDHALKVIMDVVRRYDIDGVHLDDYFYPYPTPGRAWSPASFGD--GKSPSQ-- 225 Query: 266 WRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTR 325 RR + + ++KS KP V GVSP G+W R P G + G AY+ D R Sbjct: 226 -RRGYIDGFVQDMYKSVKSSKPWVRVGVSPFGIW--RPGVPGGIEA-GVDAYEHLACDAR 281 Query: 326 RWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKV 378 +W+ +G +DY+APQ+YW S + + L +WWA + +R ++ GIA ++ Sbjct: 282 KWLSRGWVDYLAPQLYWRCSPAKQSFPALMQWWA--AQNSRRPVWPGIATARI 332 >UniRef50_A9KK48 Putative uncharacterized protein n=1 Tax=Clostridium phytofermentans ISDg RepID=A9KK48_CLOPH Length = 490 Score = 185 bits (469), Expect = 4e-45, Method: Compositional matrix adjust. Identities = 124/378 (32%), Positives = 183/378 (48%), Gaps = 53/378 (14%) Query: 81 TSRARVQQQAMIDKL-DHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGY 139 T + + +A ID++ D++ +G+N V V+P G A++ S PWS ++G G++PG+ Sbjct: 135 TGYTKDEFEAQIDEMFDNVVDMGMNAVIVHVRPFGDAMYDSDYFPWSKYISGTQGKDPGF 194 Query: 140 DPLQFMLDEAHKRGMKVHAWFNPYRV-SVNTKPGTIRELNSTLSQQPASVYVQHRDWI-- 196 DPL++M++ AH RG++ HAW NPYR+ S NT T+ + PA R W+ Sbjct: 195 DPLEYMVEAAHDRGLQFHAWLNPYRITSKNTDVKTLA------TNNPA------RKWLTD 242 Query: 197 -RTSGDRFVL--------DPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFY--TESPGS 245 +TS DR VL +P +PEV+ I + V E+V Y VDG+ FDDYFY S Sbjct: 243 KKTSNDRNVLSFDGNLYYNPAVPEVRTLIRNGVLEIVRNYDVDGIHFDDYFYPTLGSNYE 302 Query: 246 RLNDNETYRKYGGAFASKA---------DWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPA 296 ++ D Y+ Y + + +WRR N LI + IK K V FG+SP Sbjct: 303 KVFDATEYKSYVDNYKKQGLDNYILPIDEWRRQNVNTLIKGIYSAIKLEKSDVVFGISPG 362 Query: 297 GVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQ-GLLDYIAPQIYWPFSRSAARYDVLA 355 G DT D Y D W+ + G +DYI PQ+YW F S +D + Sbjct: 363 GFL----------DTLRMK--DRYYVDVDTWLSKPGYVDYICPQLYWSFEHSQYPFDGIL 410 Query: 356 KWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTIL 415 W ++ K T +Y+GI YK S EPD+ N + L + + G + Sbjct: 411 NRWLELRKNTDVNVYVGIPVYK--SASNDEPDFKKNANI--LADMIITCRNSKLVDGYMF 466 Query: 416 FREDYLNKPQTQQAVSYL 433 FR D ++AV L Sbjct: 467 FRYDNFYSNTAKKAVKNL 484 >UniRef50_UPI0001745532 hypothetical protein VspiD_00105 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001745532 Length = 382 Score = 183 bits (465), Expect = 1e-44, Method: Compositional matrix adjust. Identities = 123/391 (31%), Positives = 194/391 (49%), Gaps = 36/391 (9%) Query: 52 SSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVK 111 +S MRG W+A+V L++P S +S RA +++ I L N++ QV+ Sbjct: 22 ASAEMRGAWVASVHNLNFP--SRTGLSADQQRAEIRRIINIAAACRL-----NSLMVQVR 74 Query: 112 PDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKP 171 P+G AL+ S++ PWS +TG G +PGYDPL + E +G+ +HAW NPYR S + Sbjct: 75 PEGDALYRSRLEPWSRFLTGTQGVDPGYDPLATFIAEGKSQGIAIHAWINPYRASTSKAG 134 Query: 172 GTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDG 231 ++ T+ P +V R G +DPG P V+ + +V ++V RY V G Sbjct: 135 KAENHISRTM---PGAV--------RRVGSMLWMDPGDPAVRQHVVRVVEDIVRRYAVRG 183 Query: 232 VQFDDYFY----TESPGSRLNDNETYRKY--GGAFASKADWRRNNTQQLIAKVSHTIKSI 285 V DDYFY T P D+ TY +Y GG +ADWRR N LI ++ + + Sbjct: 184 VILDDYFYPYPGTGLPRGTFPDDTTYGRYQAGGGRLDRADWRRENVNTLIRELHTVVHAN 243 Query: 286 KPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFS 345 + G FGVSP G++ R + P G + + E Y+D W+ +G +DY++PQ+YW + Sbjct: 244 RQGAWFGVSPFGIY--RPNVPRGVEAQ-LDQLTELYSDPVAWLREGTVDYLSPQLYWTDA 300 Query: 346 RSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLND 405 + +L W + V P ++ +A ++G N V E+ +QLD+ Sbjct: 301 GPQSFSSLLGWWRSSSVNPRGILVFPSLAADRLGGSH--------NWPVQEISRQLDIES 352 Query: 406 AVPEISGTILFREDYLNKPQTQQAVSYLQSR 436 ++ G I++ L + T+ LQ R Sbjct: 353 SIRPKGGFIIWSMAPLMR-NTKGVNGVLQGR 382 >UniRef50_C3R8E6 S-layer protein n=24 Tax=Bacteroides RepID=C3R8E6_9BACE Length = 559 Score = 182 bits (461), Expect = 3e-44, Method: Compositional matrix adjust. Identities = 105/293 (35%), Positives = 159/293 (54%), Gaps = 22/293 (7%) Query: 53 SQP---MRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQ 109 SQP +R WL T+ +DWP ++N S R QQ+ + D LD L+ NTV Q Sbjct: 20 SQPKYEIRATWLTTLGGMDWPRNKAINASG----IRRQQKELCDILDRLKAANFNTVLLQ 75 Query: 110 VKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNT 169 + G ++PS I +++ +TG G NPGYDPL F + E HKRGM++HAW V Sbjct: 76 TRLRGDMIYPSAIETFAESLTGSTGGNPGYDPLAFAIGECHKRGMELHAWI------VTI 129 Query: 170 KPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPV 229 G R++ Q +SV ++R + + LDPG P +++++ IV E+ SRY + Sbjct: 130 PAGNTRQVQ---LQGRSSVVRKNRTICKLYKGNWYLDPGNPGTKEYLSCIVKEITSRYDI 186 Query: 230 DGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGV 289 DG+ FD Y E D +TYRKYG K WRR+N ++ ++ IK+IKP V Sbjct: 187 DGIHFDYIRYPEQ-ADNFPDKDTYRKYGKGKELK-QWRRDNITDIVHRLYTDIKTIKPWV 244 Query: 290 EFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYW 342 + SP G +R+ + P +RG AY Y D ++W+++G+ D + P +Y+ Sbjct: 245 KVSSSPIGKYRDTNRYP----SRGWNAYHVVYQDAQKWLKEGIHDALFPMMYF 293 >UniRef50_A6L917 Putative uncharacterized protein n=5 Tax=Bacteroidales RepID=A6L917_PARD8 Length = 495 Score = 179 bits (455), Expect = 1e-43, Method: Compositional matrix adjust. Identities = 116/339 (34%), Positives = 165/339 (48%), Gaps = 30/339 (8%) Query: 54 QPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPD 113 + +R +WL TV LDWP + + + QQQA++D LD LQ N VF Q + Sbjct: 28 KEIRAVWLTTVYGLDWPHKPATT----EAGRKAQQQALLDILDRLQEANFNMVFIQARLR 83 Query: 114 GTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGT 173 G ++ S I P S +GK GE PGYDPL F++DE HKRGM+ HAWF V GT Sbjct: 84 GDVMYRSAIEPVSKTFSGKYGELPGYDPLAFVVDECHKRGMECHAWF------VTFPLGT 137 Query: 174 IRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQ 233 + S Q SV + + + LDPG+PE D+I S+V E+V+ Y +DG+ Sbjct: 138 EK---SVKEQGKLSVVKKKPKLCKRHNGEWYLDPGVPETADYILSLVKEIVNGYDIDGIH 194 Query: 234 FDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGV 293 FD Y E + D Y K G S ADWRR N +++ ++ +K KP V+ Sbjct: 195 FDYIRYPEE-AKKFPDKALYNK-SGKKKSLADWRRENINRMVYRIYDWVKQTKPWVQVSS 252 Query: 294 SPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDV 353 SP G + P G AY+ + D + W++QG D I P +Y+ + Sbjct: 253 SPLGKYNRIERVP----NAGWTAYESVFQDPKMWMQQGKQDMIVPMMYYLHKNF---FPF 305 Query: 354 LAKWWADVVKPTRTRLYI-GIAFYKVGEPSKIEPDWMIN 391 + W V RL + G+ Y++ K E DW +N Sbjct: 306 VDNW----VDNCNGRLVVPGLGAYRM---DKSEADWAVN 337 >UniRef50_C9LEC6 YngK protein n=1 Tax=Prevotella tannerae ATCC 51259 RepID=C9LEC6_9BACT Length = 537 Score = 178 bits (452), Expect = 3e-43, Method: Compositional matrix adjust. Identities = 120/367 (32%), Positives = 177/367 (48%), Gaps = 40/367 (10%) Query: 57 RGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTA 116 R +WL T LDWP +P + QQ+ + LD LQ +NTV FQV+ GT Sbjct: 29 RAVWLTTFLGLDWP-----KGHDPLT----QQKQLCRILDQLQAAKVNTVLFQVRLRGTT 79 Query: 117 LWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRE 176 + S I PW + TG G P YDPL F ++E H+RGM++HAW + V + Sbjct: 80 AYDSDIEPWDGIFTGTPGRRPTYDPLAFAIEECHRRGMELHAWMVAFPVC---------K 130 Query: 177 LNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDD 236 LN + SV +H + R S D++++DPG+P D++ ++ E+VS+Y VDG+ D Sbjct: 131 LNVLKALGTKSVVRKHPELCRRSDDQYIMDPGMPGTADYLANLCRELVSQYDVDGIHLDY 190 Query: 237 YFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPA 296 Y E+ G +D TYRKYG KA WRR+N +++ K+ +KS KP V +P Sbjct: 191 IRYPEA-GLHFDDAATYRKYGKGRELKA-WRRDNVTRVVEKIHEAVKSQKPWVRLSCAPV 248 Query: 297 GVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAK 356 G + D +RG A D D W+ +G +D + P +Y+ Y + Sbjct: 249 G----KYADLPRQSSRGWNARDAVGQDAVMWLNKGWMDVLFPMMYFD---GDNYYPFVLD 301 Query: 357 WWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDA-VPEISGTIL 415 W + + R + G+ Y + S E +W P L Q +N E G L Sbjct: 302 W---LERAERGTVAPGLGVYCL---SAGEKNW------PLLTLQRQMNFLRTAEAGGFAL 349 Query: 416 FREDYLN 422 FR D+L Sbjct: 350 FRSDFLT 356 >UniRef50_D1PRQ4 FenI protein n=1 Tax=Subdoligranulum variabile DSM 15176 RepID=D1PRQ4_9FIRM Length = 412 Score = 178 bits (451), Expect = 4e-43, Method: Compositional matrix adjust. Identities = 113/336 (33%), Positives = 169/336 (50%), Gaps = 45/336 (13%) Query: 51 QSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQV 110 + + P R +W VS L+W V+ S + +R + LD++ +G V QV Sbjct: 48 ERTAPYRAVW---VSYLEW---QQVDFSGADAFSR----DIAAMLDNIASVGATVVLAQV 97 Query: 111 KPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTK 170 +P G AL+PS P+S L TG G +PG+DPL +++ AH G+++ AW NPYR+ Sbjct: 98 RPFGDALYPSDYFPFSHLCTGIQGRDPGFDPLALLVEAAHASGLELEAWVNPYRLQAGGV 157 Query: 171 PGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVD 230 P + Q PA V H DW++ + LDP P+V+ +I V E+ Y +D Sbjct: 158 P-------ALCDQSPA---VTHPDWVKKTETGSYLDPANPDVRQYIADGVEELCRNYALD 207 Query: 231 GVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVE 290 G+ FDDYFY + S D Y S ADWRR+N L++ + H + + + GV Sbjct: 208 GIHFDDYFYPTT--SATFDAAEYAAAQTGL-SLADWRRDNVNALMS-LCHGVTA-RYGVR 262 Query: 291 FGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQ-GLLDYIAPQIYWPFSR--- 346 GV+P G DP YD Y+D RW+ Q G +DY+ PQ+YW + Sbjct: 263 LGVAPLG-------DP-------ELCYDGQYSDAARWLAQGGYVDYLMPQLYWGLTYEQN 308 Query: 347 --SAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGE 380 +A D LA WAD+ + LY+G+ Y++G+ Sbjct: 309 GDTAHSLDTLAARWADLPRAEGVALYVGLGAYRIGD 344 >UniRef50_C7H8A9 FenI protein n=2 Tax=Faecalibacterium prausnitzii RepID=C7H8A9_9FIRM Length = 425 Score = 174 bits (440), Expect = 8e-42, Method: Compositional matrix adjust. Identities = 107/297 (36%), Positives = 153/297 (51%), Gaps = 40/297 (13%) Query: 95 LDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGM 154 +D+ LG+NTV QV+P G AL+ S + PWS L TG G++PG+DPL +L EAH RG+ Sbjct: 93 MDNCLSLGLNTVIAQVRPFGDALYRSSLFPWSHLCTGVQGQDPGFDPLDVLLTEAHARGL 152 Query: 155 KVHAWFNPYRV-SVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQ 213 + AW NPYR S + P I E +S+ H +WI T + L+P IPE Sbjct: 153 SLEAWVNPYRFRSSASMPPAIAE---------SSLLNTHPEWICTVNEGAYLNPAIPEAA 203 Query: 214 DWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQ 273 D++ VAE+V Y VDG+ FDDYFY + S D + G + WRR N + Sbjct: 204 DYVVQGVAELVQNYAVDGIHFDDYFYPTTDPSI--DAAQFAASGETDLTA--WRRANVTR 259 Query: 274 LIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWV----E 329 L+ +K+ P + FGVSP G N +D +E Y D W+ Sbjct: 260 LVKAAHDAVKAADPTLRFGVSPQG---NPDND-----------RNEQYTDLSVWLTASGA 305 Query: 330 QGLLDYIAPQIYWPF----SRSAARY---DVLAKWWADVVKPTRTRLYIGIAFYKVG 379 ++DY+ PQIYW + S + R+ ++ A+W A + + T LY G+ Y+VG Sbjct: 306 DAVVDYLCPQIYWGYGYTLSSGSTRFSFENITAEWLA-LPRAESTALYFGLGAYRVG 361 >UniRef50_Q7MWV9 YngK protein n=2 Tax=Porphyromonas gingivalis RepID=Q7MWV9_PORGI Length = 515 Score = 172 bits (437), Expect = 1e-41, Method: Compositional matrix adjust. Identities = 134/418 (32%), Positives = 197/418 (47%), Gaps = 55/418 (13%) Query: 18 LVALALLL---CSCKSTPPESMVTPPAGSKPPATTQQS----------SQPMRGIWLATV 64 V++A LL C K P +PP P T + + MRG+WL T+ Sbjct: 10 FVSIAALLFAGCGTKKVAP----SPPTVKPLPDTVIAAPVIEPWVSPVREEMRGVWLTTI 65 Query: 65 SRLDWPPVSSVNISNPTSRA-RVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKIL 123 LDWP S+ PT+ R Q++ + LD L+R NTVFFQV+ G ++PS+I Sbjct: 66 YGLDWPQRSA-----PTAEGLRKQREELCRILDRLKREKFNTVFFQVRHRGDVIYPSEIE 120 Query: 124 PWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQ 183 P S + TG P YDPL+F L E HKRG+ HAW + T G + + S + Sbjct: 121 PQSTIFTGT--GKPDYDPLEFALKECHKRGLTFHAWL------IVTPLGPDKHIRSLKGE 172 Query: 184 QPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESP 243 SV +H +W + + L+PG+PE + + S+V E+V +YPVDG+ D Y E Sbjct: 173 ---SVKSRHPEWCVRHNNLWYLNPGVPEARAYFASLVREIVEKYPVDGIHLDYMRYPEK- 228 Query: 244 GSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRS 303 +D TY++YGG A WRR N L+A V P V+ V+ G R + Sbjct: 229 AKIFDDAATYKQYGGNM-DPAAWRRRNLSDLMADVHRAATEKTPWVQVSVATIGRLRKLA 287 Query: 304 HDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVK 363 G T AY+ + D W ++G +D++ P +Y+ R Y L W A + Sbjct: 288 GKRGGDWT----AYEGVHQDPVVWAQEGSVDFLVPMLYY---RDDLFYPFLEDWKAQL-- 338 Query: 364 PTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYL 421 + G+A Y+V + S+ P +I +Q+D + +G LFRED L Sbjct: 339 -PDLPIIPGLATYRVVDNSQW-PAQVIG-------EQIDSARHI-GFAGVCLFREDQL 386 >UniRef50_B9Y560 Putative uncharacterized protein n=1 Tax=Holdemania filiformis DSM 12042 RepID=B9Y560_9FIRM Length = 408 Score = 169 bits (429), Expect = 1e-40, Method: Compositional matrix adjust. Identities = 121/428 (28%), Positives = 201/428 (46%), Gaps = 52/428 (12%) Query: 10 LTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDW 69 + I+R I V + LL+ AG P T + +R W++ + Sbjct: 7 MHIKRILIAVFVILLIF--------------AGCHPRKKTGTMGE-VRAAWISYIE---- 47 Query: 70 PPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLM 129 +SS+ + N + +Q + L++L+ + NTV+ A +PS+ P + + Sbjct: 48 --LSSI-LDNRSETDYIQ--GVKTMLENLKAMNFNTVYVHASAFTDAYYPSQYYPTAQYV 102 Query: 130 TGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVY 189 G+IG+N YDP + +AH+ G + AW NP R S T + ++S + Q + Sbjct: 103 AGQIGQNVAYDPFGLFVQQAHEAGFHIEAWINPMR-SFRTDQESQIPVSSVIGQWLSDPT 161 Query: 190 VQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLND 249 ++ I GDR+ L+P PEV++ I ++ E+ YP+DG+ DDYFY + + Sbjct: 162 MRGTR-IVAEGDRWYLNPAYPEVRELICAVAKELAQNYPIDGLHLDDYFYPDGVSESFDQ 220 Query: 250 --NETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPL 307 + YR+ GG S DWRR N +++A + T+K + ++ G+SPAG N + Sbjct: 221 VAYQAYRQTGGEL-SLGDWRRQNINEMVASLYATVKQVDKTIQVGISPAG---NLEY--- 273 Query: 308 GSDTRGAAAYDESYADTRRWVEQ-GLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTR 366 + + Y D R WV G LDYI PQIY+ + +D K W D+ + T Sbjct: 274 --------SVESIYGDVREWVRHDGYLDYILPQIYFGYEHGTLPFDQCLKQWEDLTQGTS 325 Query: 367 TRLYIGIAFYKVGEPSKIEPD----WMINGGVPELKKQ-LDLNDAVPEISGTILFREDYL 421 T L +G+A YK+ D W + + LK+Q L+L D ++G +F + L Sbjct: 326 TELIVGLAAYKINTVDNYAKDGKYEWQQHDDI--LKRQILELRDHAA-VAGFSIFSYNSL 382 Query: 422 NKPQTQQA 429 +P + A Sbjct: 383 FQPAAENA 390 >UniRef50_B0MQ11 Putative uncharacterized protein n=1 Tax=Eubacterium siraeum DSM 15702 RepID=B0MQ11_9FIRM Length = 511 Score = 169 bits (427), Expect = 3e-40, Method: Compositional matrix adjust. Identities = 112/357 (31%), Positives = 173/357 (48%), Gaps = 38/357 (10%) Query: 81 TSRARVQQQAMI-DKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGY 139 T ++ Q ++ I D D+ LGINTV+ V+P G A++ S PWS TG IG++PGY Sbjct: 165 TGKSESQLRSGIGDYYDNCLSLGINTVYVHVRPFGDAIYKSDYFPWSKYCTGYIGKDPGY 224 Query: 140 DPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTS 199 DPL+ M+DEAH RG+ AW NP R + ST + + D+I Sbjct: 225 DPLKVMIDEAHARGISFQAWVNPLRCYYEDDAPDV----STAYKTGQWYDTKDGDYIVKV 280 Query: 200 GDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFY--TESPGSRLNDNETYRKYG 257 + L+P EV D I + AE+VS+Y VDGV DDYFY TE+ + N + Sbjct: 281 KSYWWLNPAYKEVTDLIANGAAELVSKYDVDGVHIDDYFYPTTEAYFDSIAFNAS----- 335 Query: 258 GAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAY 317 +++S + +R +N +++A + +KS P FGVS G N Sbjct: 336 -SYSSLSQFRLDNCSRMVADMYKAVKSHNPTALFGVSAQGNVTNNET------------- 381 Query: 318 DESYADTRRWV-EQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFY 376 + YAD +W E G +DY+APQIY+ F ++ + + W ++ T L G+A Y Sbjct: 382 -QLYADVEKWSKEDGYVDYMAPQIYYGFDNGGQPFEQVVERWDKMLAGTGKSLIPGLAVY 440 Query: 377 KVGEPSKIEPDWMING------GVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQ 427 K+G E +W +G +K+Q+ + G IL+ ++ +P + Sbjct: 441 KIG----TEDEWAGSGRYEWQNDKEIIKRQIVKSQKTSNYGGVILYSYQFIFEPDSN 493 >UniRef50_C3QJ47 S-layer protein n=5 Tax=Bacteroides RepID=C3QJ47_9BACE Length = 488 Score = 168 bits (425), Expect = 4e-40, Method: Compositional matrix adjust. Identities = 99/292 (33%), Positives = 154/292 (52%), Gaps = 19/292 (6%) Query: 51 QSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQV 110 Q +R W+ V LDWP + + P + R Q++ +ID LD L+ NT+ FQ Sbjct: 20 QPKHEVRAAWVTAVYGLDWPRTRA---TTPQT-IRKQKEELIDILDKLKAANFNTILFQT 75 Query: 111 KPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTK 170 + G L+PS I P++ ++TGK G NPGYDPL F ++E HKRGM+ HAW V Sbjct: 76 RTRGDVLYPSAIEPFNSILTGKTGGNPGYDPLAFAVEECHKRGMECHAWM------VTIP 129 Query: 171 PGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVD 230 G + + S SQ SV + ++ + L+PG P ++++ +V EVVS Y VD Sbjct: 130 LGNKKHVASLGSQ---SVTKRMKEICVPYKREYFLNPGHPATKEYLMKLVREVVSGYDVD 186 Query: 231 GVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVE 290 GV FD Y E+ D +R+Y + WRR+N +++ + +K++KP V+ Sbjct: 187 GVHFDYLRYPEN-APLFPDKYDFRRYNKG-RTLDQWRRDNISEIVRYIYKGVKAMKPWVK 244 Query: 291 FGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYW 342 P G +R+ S P +RG A+ Y D + W+ +G++D I P +Y+ Sbjct: 245 VSTCPVGKYRDTSRYP----SRGWNAFFTVYQDPQGWMGEGIMDQIYPMMYF 292 >UniRef50_C3J8B5 YngK protein n=2 Tax=Bacteria RepID=C3J8B5_9PORP Length = 535 Score = 166 bits (419), Expect = 2e-39, Method: Compositional matrix adjust. Identities = 96/291 (32%), Positives = 150/291 (51%), Gaps = 19/291 (6%) Query: 52 SSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVK 111 +++ +RG+WL T+ LDWP +V+ + S Q++ + LD L NTVFFQV+ Sbjct: 67 NAEAIRGVWLTTIYGLDWPSRRAVSTQDMVS----QRKELCRILDRLAESHFNTVFFQVR 122 Query: 112 PDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKP 171 G ++PSKI P + TG YDPLQF ++E HKRG+ +HAW + + + Sbjct: 123 HRGDVIYPSKIEPRVTVFTGGRNNYLDYDPLQFAIEECHKRGLSIHAWIVTFPLGNTSHV 182 Query: 172 GTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDG 231 ++ + SV+ +HRDW T + + L+PG PE + +ITS+V E+V RY +DG Sbjct: 183 QSLGD---------NSVWKKHRDWCFTLHNDWYLNPGHPEARSYITSVVREMVERYDLDG 233 Query: 232 VQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEF 291 V FD Y + + D Y +YG S +WR +N + +VS + S+KP + Sbjct: 234 VHFDYVRYPDKMREK-EDQNLYMRYGKG-RSLGEWRTSNISAFLKEVSTEVCSVKPHMLV 291 Query: 292 GVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYW 342 +P G R P G A + + D W +G +D+I P +Y+ Sbjct: 292 SAAPLGKLRVLPSMP----NVGWTARESVFQDPAAWYREGSVDFIVPMMYY 338 >UniRef50_B3QYB7 Putative uncharacterized protein n=1 Tax=Chloroherpeton thalassium ATCC 35110 RepID=B3QYB7_CHLT3 Length = 489 Score = 164 bits (415), Expect = 6e-39, Method: Compositional matrix adjust. Identities = 101/340 (29%), Positives = 168/340 (49%), Gaps = 43/340 (12%) Query: 54 QPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPD 113 + +RG+W+AT +DWP T Q++++ + +++ +N VFFQV+ Sbjct: 42 EQLRGVWIATAYGIDWPK---------TYDPEKQKESLQEIFHDIKKKNLNAVFFQVRIR 92 Query: 114 GTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGT 173 G L+ S P+S+++TG +G P YDP+ + + A + G++ HAWFN + +N K T Sbjct: 93 GDVLFYSPYEPFSNVLTGSLGVIPDYDPVAYAISLAKENGLEFHAWFN--TMILNGKNST 150 Query: 174 IRELNSTLSQQPASVYVQHRDWIRTSGDRFV------LDPGIPEVQDWITSIVAEVVSRY 227 + S+ A ++ H +WI + L+P +PEV+ + ++ + RY Sbjct: 151 PQ------SEGVAHIWQAHPEWIDKRARKNAWQKTAYLNPALPEVRAHLIRLITDFAERY 204 Query: 228 PVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKP 287 +DG+Q DDY P D+E + KY S DWRR N Q + + ++ KP Sbjct: 205 DIDGIQLDDYL--RYPTKDFPDDEEFEKYNPKKLSLDDWRRENINQFVGDLYDSLMQRKP 262 Query: 288 GVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRS 347 ++FGV+P GV+ D +Y + Y D+R WV + DY+APQIY+ ++ Sbjct: 263 YLKFGVTPIGVYTR------VDDVPAMESYHDVYQDSREWVRRKKCDYLAPQIYFHTGKT 316 Query: 348 AAR----------YDVLAKWWADVVKPTRTRLYIGIAFYK 377 A ++ L + W + P R LY+GI YK Sbjct: 317 TAADRRKNKTNPPFENLVRDWGGNM-PFR-HLYVGIGMYK 354 >UniRef50_UPI0001C37647 hypothetical protein RflaF_08645 n=1 Tax=Ruminococcus flavefaciens FD-1 RepID=UPI0001C37647 Length = 379 Score = 162 bits (410), Expect = 2e-38, Method: Compositional matrix adjust. Identities = 111/401 (27%), Positives = 186/401 (46%), Gaps = 49/401 (12%) Query: 16 AILVALALLLCSCK-STPPESMVTPPAGSKPPATTQQSSQPM-----RGIWLATVSRLDW 69 A++ A LL C + PE++ P + A + P+ +G+W+ + ++ Sbjct: 6 AVMALSAFLLGRCTPAAMPENLKQPDPAAVSEAAANKEYAPLNYEYQKGMWIPYLDYAEY 65 Query: 70 PPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLM 129 + + R R+ A G NTV+ ++P G A + S P + Sbjct: 66 MQGKTADDFRSAIRKRLSDAA---------DSGTNTVYVHIRPTGDAYYKSTFFPKGRYL 116 Query: 130 TGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVY 189 G YDPL+ MLDEAHK G+ VH W NP R+ + T+ +S +++Q S Sbjct: 117 DGD------YDPLEIMLDEAHKLGLSVHGWINPLRLQTAEEMETVP--DSAITKQWYSSG 168 Query: 190 VQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLND 249 I +G R L P PEV++ + + + E++ Y VDG+ DDYFY ++ S D Sbjct: 169 DSMN--IGETGGRLYLRPDSPEVRELLANEIREIIGSYDVDGIHIDDYFYPDTDPSF--D 224 Query: 250 NETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGS 309 +E++ G + +K WR + +++ + +K V FG+SP G R Sbjct: 225 SESFALSGESDLTK--WRTDAVSEMVKAMYSAVKDTDERVLFGISPQGNVR--------- 273 Query: 310 DTRGAAAYDESYADTRRWV-EQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTR 368 A Y+ YAD RRW+ E+G DYI PQIY+ F + + + W + + + R Sbjct: 274 -----ADYETQYADVRRWISEKGFCDYIVPQIYYGFKNETLPFTSVLEEWERMAENSNVR 328 Query: 369 LYIGIAFYKVGEPSK-----IEPDWMINGGVPELKKQLDLN 404 L IG+ YK+G+ + E +W+ + G+ + + Q L+ Sbjct: 329 LIIGLGAYKLGKEDRWAGESGESEWLDDPGIIDKQTQAVLD 369 >UniRef50_C4FZ05 Putative uncharacterized protein n=1 Tax=Abiotrophia defectiva ATCC 49176 RepID=C4FZ05_ABIDE Length = 562 Score = 162 bits (409), Expect = 3e-38, Method: Compositional matrix adjust. Identities = 105/295 (35%), Positives = 149/295 (50%), Gaps = 31/295 (10%) Query: 91 MIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAH 150 + + D + G+N ++ V+P A++ S PWS +GK G +PG+DPL M++ AH Sbjct: 220 ITEMFDKIAASGMNEIYVHVRPFSDAMYRSVYFPWSKYASGKQGVDPGFDPLAIMVNAAH 279 Query: 151 KRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDW-----IRTSGDRFVL 205 R +K+HA+ NPYRV G++ +NS PA ++ D + G + Sbjct: 280 TRNLKLHAYINPYRVCAEADFGSL-AVNS-----PAYKWLNDDDEENDRNVLKFGKMYYY 333 Query: 206 DPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFY--TESPGSRLNDNETYRKYGGAFA-- 261 +P +V + I + VAE+V Y VDGV FDDYFY S S D+E Y Y A Sbjct: 334 NPSSDDVINLINNGVAEIVKNYDVDGVIFDDYFYPTLGSNYSSKFDSEEYADYKLNTANP 393 Query: 262 -SKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDES 320 S DWRR+N +++ V T+KS FG+SPAG N A D+ Sbjct: 394 MSIVDWRRDNINKMVKTVYATVKSSGKNRTFGISPAGNLTN------------LRANDKY 441 Query: 321 YADTRRW-VEQGLLDYIAPQIYWPFSRSAARY-DVLAKWWADVVKPTRTRLYIGI 373 Y D RW E G +DYIAPQ YW F S + D ++KW A V P +LY+ + Sbjct: 442 YVDIDRWGRETGFVDYIAPQQYWGFEHSICPFEDNVSKWMAVVTNPN-VKLYVAL 495 >UniRef50_B0NXH7 Putative uncharacterized protein n=3 Tax=Clostridiales RepID=B0NXH7_9CLOT Length = 468 Score = 161 bits (408), Expect = 4e-38, Method: Compositional matrix adjust. Identities = 109/345 (31%), Positives = 163/345 (47%), Gaps = 27/345 (7%) Query: 101 LGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWF 160 LG+N V QV+P G A++ SK PWS ++GK G NPG+DPL+ M++ AH MK+ AW Sbjct: 138 LGMNRVIVQVRPFGDAIYKSKYFPWSKYISGKQGRNPGFDPLKIMVEVAHDNDMKIEAWV 197 Query: 161 NPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIV 220 NPYRV+ T ++L + R + + G +P V+ IT+ V Sbjct: 198 NPYRVT--TGSTNYKKLAKNNQARKWHAKKSTRRNVLSYGGSLYYNPSKKAVRTLITNGV 255 Query: 221 AEVVSRYPVDGVQFDDYFY---TESPGSRLND--NETYRKYGGAFASKADWRRNNTQQLI 275 E+V Y VDG+ DDYFY T+ + D Y S +RR L+ Sbjct: 256 KEIVQNYDVDGIHMDDYFYPSFTKRNVKKAFDAKEYKKSSYKKKKKSIYTYRRAQINTLV 315 Query: 276 AKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQG-LLD 334 ++ +KS+ P V +G+SPAG + D L S Y D +W+ +D Sbjct: 316 KQMKKAVKSVDPNVTYGISPAG-----NIDNLTSKY-------SYYVDIYKWLNSTEYVD 363 Query: 335 YIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYK----VGEPSKIEPDWMI 390 YI PQ+YW F A+++ + W K + +LYIGIA Y+ VG+ +W Sbjct: 364 YICPQVYWGFKHPTAKFNKVTDRWIKAAKSKKVKLYIGIAVYRAGHNVGQNRAERKEWKR 423 Query: 391 NGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQS 435 + V LKKQ+ + G F L + +AV+ L++ Sbjct: 424 DTKV--LKKQVQYARK-KHVDGFAFFDYQDLKSKTSAKAVNQLKT 465 >UniRef50_C2M9G1 YngK protein n=1 Tax=Porphyromonas uenonis 60-3 RepID=C2M9G1_9PORP Length = 530 Score = 158 bits (399), Expect = 4e-37, Method: Compositional matrix adjust. Identities = 112/374 (29%), Positives = 178/374 (47%), Gaps = 34/374 (9%) Query: 50 QQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDK-LDHLQRLGINTVFF 108 Q MR +WL T+ LDWP +++ T V+QQ +DK LD R GINTVF Sbjct: 57 QHPKAEMRAVWLTTIWGLDWPKMTA-----DTHAGMVRQQESLDKMLDDCVRAGINTVFL 111 Query: 109 QVKPDGTALWPSKILPWSDLMTGKIGENP-GYDPLQFMLDEAHKRGMKVHAWFNPYRVSV 167 QV+ G L+PS + P S ++ K G P GYDPLQ+ +D H RGM VHAW Y + Sbjct: 112 QVRMRGDLLYPSTLEPLSTTIS-KTGVLPEGYDPLQYAIDACHHRGMSVHAWMVSYPLGT 170 Query: 168 NTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRY 227 N +R L++Q Y H + G+ + +DP P V+ + +V ++V+RY Sbjct: 171 NDH---VR----ALAKQGKGFYAAHPEMCLRQGNAWFMDPAQPAVRTHMAQLVRDLVTRY 223 Query: 228 PVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKP 287 VDGV D Y + P S+ ND ++Y++ + WR N +I + T++ + P Sbjct: 224 DVDGVHLDYIRYPDGP-SKFNDLKSYQRMNPDRLPRMAWREANVTAMIDTLHRTLQEVAP 282 Query: 288 GVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRS 347 V + G ++ G G D+ D W ++G++D+I P IY+ Sbjct: 283 EVALSTACIGKYQQLPKPAPG----GYFCKDDVSQDPLVWFQRGIVDFIVPMIYY----K 334 Query: 348 AARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAV 407 ++ WA + P + G+ Y++ + S+ W + ++ QLD A Sbjct: 335 DGHFNYYIADWAKRIAP-HGPIVAGLGVYRLYDNSR----WKLQ----DIYNQLD-TLAQ 384 Query: 408 PEISGTILFREDYL 421 ++SG +R + L Sbjct: 385 YDLSGVSYYRAEQL 398 >UniRef50_A6DH63 Putative uncharacterized protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DH63_9BACT Length = 225 Score = 150 bits (378), Expect = 1e-34, Method: Compositional matrix adjust. Identities = 85/240 (35%), Positives = 125/240 (52%), Gaps = 24/240 (10%) Query: 56 MRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGT 115 MR W+A+V+ DWP ++++ QQ+ D LD +L +NT+ FQV+P G Sbjct: 1 MRAAWVASVANTDWPSKQGLSVAQ-------QQKECRDLLDLAVQLKLNTIIFQVRPHGD 53 Query: 116 ALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIR 175 AL+ S PWSD +TG G+ PGYDPLQ+ +D+ HKR +K+HAWFNPYRV P Sbjct: 54 ALYKSSFEPWSDRLTGIQGKYPGYDPLQYWIDQCHKRKLKIHAWFNPYRVQ---HPTVKE 110 Query: 176 ELNSTLSQQPASVYVQHRDWIRTSGDRFV-LDPGIPEVQDWITSIVAEVVSRYPVDGVQF 234 L S Q+ A + W +V L+P V+ ++ ++V + RY +DG+ Sbjct: 111 PLASNSLQRKA------KPWCIYLKKGYVWLNPASKAVRQYVQTVVFDCARRYNIDGIHL 164 Query: 235 DDYFYTES---PGSRLNDNETYRKYGGA----FASKADWRRNNTQQLIAKVSHTIKSIKP 287 DDYFY P + D++ Y Y + K WRR+ LI + +K +KP Sbjct: 165 DDYFYPYKDFLPATGFPDHKEYSAYLSSKPQKVMDKEMWRRHQVNTLIYSLHKGLKRLKP 224 >UniRef50_C2FS67 FenI family protein n=1 Tax=Sphingobacterium spiritivorum ATCC 33300 RepID=C2FS67_9SPHI Length = 327 Score = 145 bits (366), Expect = 3e-33, Method: Compositional matrix adjust. Identities = 78/198 (39%), Positives = 115/198 (58%), Gaps = 12/198 (6%) Query: 222 EVVSRYPVDGVQFDDYFYT--ESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVS 279 +VV Y VDG+ FDDYFY ++ + L D T+ ++G FA+ DWRRNN LI + Sbjct: 2 DVVKNYDVDGIHFDDYFYPYPDARNTALPDAPTFHQFGRGFANIHDWRRNNVDLLIRDLG 61 Query: 280 HTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQ 339 IK KP +++G+SP G+W N+ +P GS+T G + Y YAD +W+++G +DYI PQ Sbjct: 62 IAIKKEKPFIKYGISPFGIWDNKRDNPDGSNTSGLSGYRTLYADGVKWMKEGWIDYINPQ 121 Query: 340 IYWPFSRSAARYDVLAKWWADVVKPTRTR-LYIGIAFYKVGEPSKIEPDWMINGGVPELK 398 IY+PF+ AA +++L +WW K T R Y+G Y+V E P W G +P+ Sbjct: 122 IYFPFNNRAAAFEILLEWWE---KHTYGRHFYVGHGAYRVTEK---RPGWTDKGQIPKQV 175 Query: 399 KQLDLNDAVPEISGTILF 416 + L E+ G+I F Sbjct: 176 RHLRDQH---EVQGSIYF 190 >UniRef50_B6YR88 Putative uncharacterized protein n=1 Tax=Candidatus Azobacteroides pseudotrichonymphae genomovar. CFP2 RepID=B6YR88_AZOPC Length = 490 Score = 143 bits (360), Expect = 1e-32, Method: Compositional matrix adjust. Identities = 104/372 (27%), Positives = 170/372 (45%), Gaps = 47/372 (12%) Query: 56 MRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGT 115 +R +WL T LDWP + + Q++ +++ L L++ N VFFQ + G Sbjct: 27 IRAVWLTTNYALDWPTKPFTTLEDIDK----QKEELVNILCCLKKTNFNIVFFQTRLRGN 82 Query: 116 ALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIR 175 ++ SK+ P S + K G YDPL F ++E HK G++ HAWF Y + G Sbjct: 83 VVYDSKVEPLSPFIRNK-GYKVTYDPLAFAIEECHKLGLECHAWFVTYLLGAAEVKG--- 138 Query: 176 ELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFD 235 E N +L + + R LDPG E ++ SIV E+V +Y VDG+ D Sbjct: 139 EDNCSLVVKCNQLQT------RIYKGEIYLDPGDLETDRYLLSIVEEIVDKYDVDGIHMD 192 Query: 236 DYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSP 295 Y E P + D+ TY+ YG +K +WR++N + ++++ +K KP V+ + Sbjct: 193 YIRYPEKP-TEFPDDITYKYYGKG-KNKTEWRKDNINRFVSRLYDMVKGKKPWVQVSSAV 250 Query: 296 AGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYW------PFSRSAA 349 G++ + LG + + A +E Y D +W+ G D+I P +Y+ PF + Sbjct: 251 VGIYTRK----LGDNKKYWTA-NEVYQDPEQWLRMGKHDFIVPMMYYSGNLFFPFVQ--- 302 Query: 350 RYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPE 409 D A+ + V P GI Y++ E + +W + ++K N Sbjct: 303 --DWQARSYGRFVVP-------GIGIYRMDEK---DSNWDVQTVTEQIKSSRQHNTG--- 347 Query: 410 ISGTILFREDYL 421 G FR +YL Sbjct: 348 --GNAFFRANYL 357 >UniRef50_C5VL52 YngK protein n=3 Tax=Prevotella RepID=C5VL52_9BACT Length = 566 Score = 142 bits (359), Expect = 2e-32, Method: Compositional matrix adjust. Identities = 95/319 (29%), Positives = 143/319 (44%), Gaps = 38/319 (11%) Query: 48 TTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVF 107 T + + R +WL T++ LDWP N + ++Q+Q +ID LD Q+ INTV Sbjct: 28 TKRMPKRETRAVWLTTLASLDWPK----NYARSEESIKLQKQELIDILDKYQKANINTVL 83 Query: 108 FQVKPDGTALWPSKILPWSDLMTGKIGENP--GYDPLQFMLDEAHKRGMKVHAWFNPYRV 165 Q + ++PS I PW +TG G P GYDPL F ++E HKRGM++HAW V Sbjct: 84 LQARVRAATIYPSDIEPWDQCITGVEGRAPGYGYDPLSFAVEECHKRGMEIHAWIATIPV 143 Query: 166 SVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVS 225 G TL ++ IR LDP P V ++ S+ E+V Sbjct: 144 GAKNSLGC-----RTLMKKGFR--------IRNFSTGSYLDPADPSVAPYLASVCGEIVR 190 Query: 226 RYPVDGVQFDDYFYTES-PGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKS 284 +Y VDG+ D Y + P D +T D RR+N ++ + +K+ Sbjct: 191 KYDVDGINLDYIRYPDGWPRPSYRDGDT-----------PDQRRSNITAIVRAIHDEVKA 239 Query: 285 IKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPF 344 IKP V+ SP G + D ++ A+D + + W+ GL+D + P Y+ Sbjct: 240 IKPWVKMSCSPIG----KHADLSRYSSKNFNAHDRVSQEAQEWMRLGLMDQLYPMQYF-- 293 Query: 345 SRSAARYDVLAKWWADVVK 363 R Y +A W + K Sbjct: 294 -RGDNYYPFVADWVENAYK 311 >UniRef50_C9PZF4 YngK protein n=5 Tax=Prevotella RepID=C9PZF4_9BACT Length = 573 Score = 141 bits (356), Expect = 4e-32, Method: Compositional matrix adjust. Identities = 101/337 (29%), Positives = 156/337 (46%), Gaps = 43/337 (12%) Query: 8 KKLTIRRPAILVALALLLCSCKSTPPESMVTPPA--GSKPPATTQQSSQPMRGIWLATVS 65 K+L+ +L+ AL + S P G K P + +R +WL T+ Sbjct: 3 KQLSFSNRFLLLFFALSTATMLCAKSFSFFKPNGLNGWKLP------KREVRAVWLTTIG 56 Query: 66 RLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPW 125 LDWP + N A Q+Q + D LD LQR GINTV FQ + GT ++PS++ PW Sbjct: 57 GLDWPHSYAQN----ELMAGRQKQELRDILDKLQRAGINTVLFQARVRGTVVYPSQLEPW 112 Query: 126 SDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQP 185 ++G G +PGYDPL F ++E HKRGM++HAW V T P + + NS + Sbjct: 113 DGCLSGVPGRSPGYDPLAFAINECHKRGMELHAW-------VVTIP--VGKWNSLGCKTL 163 Query: 186 ASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGS 245 + Y I+ G+ +DP ++ + E+ RY VDG+ D Y P Sbjct: 164 RNKYPH---LIKRIGEEGYMDPENTATATYLANFCKEITDRYDVDGIHLD---YIRYP-- 215 Query: 246 RLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHD 305 ET++ D R N ++ + +K+ KP V++ SP G + + S Sbjct: 216 -----ETWK-----INIAHDAARRNITTIVRAIGEKVKASKPWVKYSCSPIGKFSDLSRF 265 Query: 306 PLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYW 342 + G AY + D + W+ GL+D + P +Y+ Sbjct: 266 A----SNGWNAYAKVCQDAQGWLRDGLMDALFPMMYF 298 >UniRef50_D1PA22 YngK protein n=1 Tax=Prevotella copri DSM 18205 RepID=D1PA22_9BACT Length = 582 Score = 134 bits (338), Expect = 5e-30, Method: Compositional matrix adjust. Identities = 91/320 (28%), Positives = 147/320 (45%), Gaps = 44/320 (13%) Query: 23 LLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTS 82 ++LCS + +S+V Q +R +WL T+ +DWP + + + Sbjct: 9 IVLCSVLAAKAQSIV---------FNNQVPKHEVRAVWLTTIGGIDWPH----SYAQSSY 55 Query: 83 RARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPL 142 A Q++ + D LD LQ INTV Q + GT ++PS PW ++G G +PGYD L Sbjct: 56 SAEKQKKELTDILDRLQLAKINTVLIQTRVRGTMIYPSAYEPWDGCLSGFPGRSPGYDAL 115 Query: 143 QFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDR 202 QF +DE HKRGM++HAW V G TL Q+ + I+ G Sbjct: 116 QFAIDECHKRGMELHAWVVTIPVGKWNALGC-----KTLRQKMPKL-------IKKIGAD 163 Query: 203 FVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFAS 262 +DP D++ +I E+ +Y VDG+ D Y E+ +++ E R+Y Sbjct: 164 GYMDPENSRTGDYLANICREITHKYNVDGIHLDYIRYPETWNIKVS-REQGRRY------ 216 Query: 263 KADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYA 322 ++ K+ +K+ KP V+ SP G + + S + G AY + Sbjct: 217 --------ITNIVRKIHDAVKAEKPWVKMSCSPVGKYDDLSR----YRSFGWNAYTKVCQ 264 Query: 323 DTRRWVEQGLLDYIAPQIYW 342 D + W++ GL+D + P +Y+ Sbjct: 265 DAQGWLKSGLMDELFPMMYF 284 >UniRef50_C2L0K0 Lipoprotein yddW n=1 Tax=Oribacterium sinus F0268 RepID=C2L0K0_9FIRM Length = 443 Score = 121 bits (304), Expect = 4e-26, Method: Compositional matrix adjust. Identities = 109/419 (26%), Positives = 176/419 (42%), Gaps = 70/419 (16%) Query: 45 PPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGIN 104 P + S + +R +W S LDW +N+ R + + +D+LQ+ G Sbjct: 63 PGVKSLSSQKELRAVWF---SYLDW-----INMPKEEQAFRAEAAKV---MDNLQKNGFQ 111 Query: 105 TVFFQVKPDGTALWPS-KILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPY 163 T+F V + + P+S M G N +DPL+ M+ EA K+G+ VHAWFNPY Sbjct: 112 TIFLHVHSHSDSYGKKMTVFPYSKFMPG----NGSFDPLEIMISEAKKKGISVHAWFNPY 167 Query: 164 RVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGD---------RFVLDPGIPEVQD 214 RVS + S P V + W RTSG+ ++ ++P ++ Sbjct: 168 RVSSSM---------SKWENIPEDSIV--KKWSRTSGEERNVLLHEGQYYINPSRAAGRE 216 Query: 215 WITSIVAEVVSRYPVDGVQFDDYFY----TESPGSRLNDNETYR-KYGGAFASKADWRRN 269 + + + E++ Y VDG+ FDDYFY G R ++ E K G S ++RRN Sbjct: 217 ALLASIKELLDNYAVDGIHFDDYFYPRVSLTEEGKRFDEPEYEEAKRQGETGSLTEYRRN 276 Query: 270 NTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRR-WV 328 L+ +V K + GV FGVSP P R + AY + D + Sbjct: 277 QVSLLLKQVHSLCK--ERGVVFGVSPV---------PNLQSLRSSVAY---FLDVDKIMA 322 Query: 329 EQGLLDYIAPQIYWPFSRSAAR-------YDVLAKWWADVVKPT--RTRLYIGIAFYKVG 379 + +DYI PQ+Y F + Y W ++ T + L +G+ Y+ G Sbjct: 323 SKDYIDYIMPQMYHGFRAKNGKGQEAPHAYMRSLGDWVNLTNSTGNQVELMLGLGLYRAG 382 Query: 380 EP---SKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQS 435 +W + LK+Q++ + G +F L + + Q+ + L+S Sbjct: 383 SSVWDGNPVSEWFTESDI--LKRQVEEARKTGIVKGYAVFAYQNLLEERAQRELGNLRS 439 >UniRef50_C2FS66 Putative uncharacterized protein n=1 Tax=Sphingobacterium spiritivorum ATCC 33300 RepID=C2FS66_9SPHI Length = 172 Score = 112 bits (280), Expect = 3e-23, Method: Compositional matrix adjust. Identities = 54/129 (41%), Positives = 79/129 (61%), Gaps = 10/129 (7%) Query: 47 ATTQQS-SQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINT 105 A +Q S + +RG+W+ATV+ +DWP S + Q+Q +I+ LD QR G+N Sbjct: 21 AISQNSPKRELRGVWIATVANIDWP-------SRDNESSERQKQELINILDAHQRAGLNA 73 Query: 106 VFFQVKPDGTALWPSKILPWSDLMTGKIGENPG--YDPLQFMLDEAHKRGMKVHAWFNPY 163 +FFQ++P A + PWS ++G G+ P YDPL+F+++EAHKRGM++HAW NPY Sbjct: 74 IFFQIRPAADAFYAKGREPWSRYLSGVQGKAPSPFYDPLEFVIEEAHKRGMELHAWVNPY 133 Query: 164 RVSVNTKPG 172 R S P Sbjct: 134 RASTTLNPA 142 >UniRef50_C7GZF2 Putative lipoprotein n=1 Tax=Eubacterium saphenum ATCC 49989 RepID=C7GZF2_9FIRM Length = 373 Score = 100 bits (248), Expect = 1e-19, Method: Compositional matrix adjust. Identities = 103/401 (25%), Positives = 165/401 (41%), Gaps = 93/401 (23%) Query: 56 MRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGT 115 M+ +W VS LD+ + N+ T ++ A I D +R GINT+FF V+ Sbjct: 27 MKAVW---VSFLDFQNLGLTNVREKT----FKKNAEIMVKD-AKRNGINTIFFHVRAFDD 78 Query: 116 ALWPSKIL-PWSDLMTGKIGENPG-----YDPLQFMLDEAHKRGMKVHAWFNPYRVSVNT 169 A + SK+ L T P YDPL+ + + AHK G+++HAW NPYRV Sbjct: 79 AAYKSKVFRAMRYLKTNASYAKPATSSFSYDPLKLVAEAAHKHGVQLHAWLNPYRV---- 134 Query: 170 KPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPV 229 G + L P + I V E+++ Y V Sbjct: 135 ------------------------------GYDYFLSPKSEYSTNRIIKAVNEILT-YKV 163 Query: 230 DGVQFDDYFYTESPG-SRLNDNETYRKYGGAFASKADW------RRNNTQQLIAKVSHTI 282 DG+ FDDYFY G RLN + Y +K D+ +R +LI +V+ Sbjct: 164 DGIHFDDYFYHAKKGYYRLNSKKQYSV--NPATAKKDYSPSSINKRRYVNKLIRRVN--- 218 Query: 283 KSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQ-GLLDYIAPQIY 341 K+ + F VSPAG N + S D W+ G +D I PQIY Sbjct: 219 KTTQGKALFSVSPAGNVDNCMN---------------SGVDLTTWLSNDGYVDMIMPQIY 263 Query: 342 WPFS-RSAARYDVLAKWWADVVKPTRTR--LYIGIAFYKVGEPSKIEPDWM-----INGG 393 W + ++ R + + ++ + + + IG+A Y+ GE + W I+G Sbjct: 264 WTDNWGASGRVKMFSSRLGQFMRKNKKKIPMVIGLALYRSGERGLGDKGWSMRGSNISGQ 323 Query: 394 VPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQ 434 + +++ + G LFR + L + + ++ V ++ Sbjct: 324 IKSIRRH--------GLGGYCLFRFNNLYQGRCKKEVKNMR 356 >UniRef50_B0P7J4 Putative uncharacterized protein n=1 Tax=Anaerotruncus colihominis DSM 17241 RepID=B0P7J4_9FIRM Length = 1211 Score = 95.1 bits (235), Expect = 4e-18, Method: Compositional matrix adjust. Identities = 89/365 (24%), Positives = 151/365 (41%), Gaps = 72/365 (19%) Query: 32 PPESMVTPPAGSKPPAT----TQQSSQP--------MRGIWLATVSRLDWPPVSSVNISN 79 PP++ PP + T Q+ +P MRG+ ++ + D+ ++N Sbjct: 92 PPDASSAPPDTAGETGTDSSDNGQADEPVYFNVPTEMRGVMIS--AGTDY-------LTN 142 Query: 80 PTSRARVQQQAMIDK-LDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPG 138 T + + +D+ L Q+L +NTV + + L+ S L + L G Sbjct: 143 GTDVSAQELATQLDEALAAAQQLTMNTVIIDTQYGDSVLFESSALESAPL---------G 193 Query: 139 YDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRT 198 D +++ +A + G V+A ++ V+T+ G Sbjct: 194 LDVTEYLCAKAREMGFYVYATYD-----VSTRSG-------------------------- 222 Query: 199 SGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGG 258 G+ D D + + Y DG+ D Y +SP + Y + GG Sbjct: 223 -GEGLTADGA---ALDDLAENIGAFAEAYKPDGILLDGYECADSPAAYAG----YLQSGG 274 Query: 259 AFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRG-AAAY 317 +A ++R + L+ + ++ PGV+ G+ VW+N DP GSDT+ A Sbjct: 275 GMGYEA-YQRQVPRALLETAAAAVRENAPGVQVGLYTQAVWQNSDADPDGSDTKAETTAL 333 Query: 318 DESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYK 377 ADTR +V+ GL D++ + Y + AR+ V+A WWA VV T T+LY+ A + Sbjct: 334 GTGNADTRAFVKDGLFDFVMVKNYGSTNEETARFGVVAAWWAGVVDGTDTKLYMMHAADR 393 Query: 378 VGEPS 382 VG S Sbjct: 394 VGTQS 398 >UniRef50_C7E4U8 Putative uncharacterized protein psa8 n=1 Tax=Pantoea stewartii subsp. stewartii DC283 RepID=C7E4U8_ERWST Length = 106 Score = 92.0 bits (227), Expect = 4e-17, Method: Compositional matrix adjust. Identities = 40/65 (61%), Positives = 49/65 (75%) Query: 373 IAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSY 432 +A YKVG PS IEPDW I GGVPE +QL LND++ E+ G +LFR +L +PQTQQ V Y Sbjct: 1 MALYKVGTPSAIEPDWTIEGGVPETTRQLGLNDSLEEVGGCMLFRHMFLREPQTQQVVDY 60 Query: 433 LQSRW 437 L+SRW Sbjct: 61 LRSRW 65 >UniRef50_B0PF61 Putative uncharacterized protein n=1 Tax=Anaerotruncus colihominis DSM 17241 RepID=B0PF61_9FIRM Length = 915 Score = 84.0 bits (206), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 66/232 (28%), Positives = 108/232 (46%), Gaps = 27/232 (11%) Query: 6 RNKKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSK-PPATTQQSSQPMRGIWLATV 64 K TI R +L +LA++ +C ++T P SK PPA + + + + + T Sbjct: 3 HETKRTILR-TLLASLAIVAATCAVLYASDLLTSPISSKTPPAGIPAAGEQLHALIVRTR 61 Query: 65 SRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQ-RLGINTVFFQVKPDGTALWPSKIL 123 D+P S P A+ QQ+A +D++ G N VFF+ P AL+ S IL Sbjct: 62 GNADFP-------SAPGLSAK-QQRAQLDEIAAFAGEYGYNAVFFEAVPSCDALYRSSIL 113 Query: 124 PWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQ 183 P S G+ G +DPL ++++ + G++V+A +P+ VS Sbjct: 114 PSSAYWMGEQGAFAFFDPLDYLVNVCKESGIQVYAMIDPFAVSA----------EDLAES 163 Query: 184 QPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFD 235 PAS ++ +WI G RF +P VQ S+ AE+ +RY + G+ + Sbjct: 164 SPAS---KNPEWIAADG-RF--NPTELGVQQLAGSVAAELATRYDIAGIVLE 209 >UniRef50_B8HYQ9 Putative uncharacterized protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HYQ9_CYAP4 Length = 383 Score = 83.2 bits (204), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 82/337 (24%), Positives = 134/337 (39%), Gaps = 94/337 (27%) Query: 44 KPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDK-LDHLQRLG 102 + PA S+Q RG+WL ++ A + ++D+ L L + G Sbjct: 25 RAPARPTASTQENRGLWLTSIGL-----------------AGLYHSTLLDETLSDLSQRG 67 Query: 103 INTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNP 162 NT++ V G L+PS+++P + + D L + E ++G+++ WF Sbjct: 68 FNTLYPAVWNRGQTLYPSRVVPAAFTLG---------DVLSTTVREGKQQGLRIIPWFE- 117 Query: 163 YRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIR------------------------- 197 Y + V + S L++Q H DW+ Sbjct: 118 YGLKVTDR--------SVLARQ-------HPDWLARDRNGRPYINPEPVNALPFPLKGLS 162 Query: 198 ---TSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTE-SPGSRLNDNETY 253 T D VL+P P+VQ+ I + +VV RY VDG+Q DD+F G + + Sbjct: 163 RSVTGADHVVLNPIHPQVQNLIVKMFVDVVKRYNVDGIQIDDHFALPVQLGYDSYTRQRF 222 Query: 254 RKYGGAFASK-------ADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDP 306 R+ G +WR N +L+ K+S IK KP + F ++P + P Sbjct: 223 RQEQGVEPPADPTDPAWMEWRANKLTELVGKISTAIKQQKPAIIFSIAP--------NPP 274 Query: 307 LGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWP 343 A AY + D WV +G +D + Q+Y P Sbjct: 275 -------AFAYRTTLQDWPTWVRRGYVDEVVVQVYRP 304 >UniRef50_P74735 Slr0592 protein n=1 Tax=Synechocystis sp. PCC 6803 RepID=P74735_SYNY3 Length = 491 Score = 82.0 bits (201), Expect = 4e-14, Method: Compositional matrix adjust. Identities = 74/287 (25%), Positives = 120/287 (41%), Gaps = 40/287 (13%) Query: 69 WPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDL 128 +P + V I+N + + Q + ++ L L NT++ V G L+ S+ L Sbjct: 48 FPEIRGVWITNNDTVHFLDQNRTTESINLLADLNFNTIYPVVWNSGYVLYESEFAKREGL 107 Query: 129 MTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASV 188 + G D L ++D+AH+R M V WF P S L ++ Sbjct: 108 QPFSPRGDQGQDVLADIIDKAHRRNMLVLPWF---EFGFKAPPM------SELVKRHPWW 158 Query: 189 YVQHRDWIRTS----GDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPG 244 + Q RD +TS G+ ++P P+VQ +IT +V + V++Y +DGVQFDD +T P Sbjct: 159 FTQKRDGTKTSVSAAGEVMWMNPFHPQVQTFITQLVMDAVNKYDLDGVQFDD--HTALPN 216 Query: 245 SRLNDNETYRKYGGAFASK----------ADWRRNNTQQLIAKVSHTIKSIKPGVEFGVS 294 DN T Y WR + + +++ IK+ KP + VS Sbjct: 217 EFGYDNYTISLYQQETKKTPPSNPKDPAWIRWRADKITAFMVQLNARIKAAKPNILVSVS 276 Query: 295 PAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIY 341 PA AY+ D W+ +G++D + Q+Y Sbjct: 277 PATY---------------NLAYNTFLQDWLDWIRKGIVDEVIVQVY 308 >UniRef50_A0YS74 Putative uncharacterized protein n=2 Tax=Oscillatoriales RepID=A0YS74_9CYAN Length = 1005 Score = 82.0 bits (201), Expect = 4e-14, Method: Compositional matrix adjust. Identities = 66/232 (28%), Positives = 101/232 (43%), Gaps = 46/232 (19%) Query: 95 LDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGM 154 D L + GINTVFF+ G ++P+++ P + +T G+DPL + AH+RGM Sbjct: 553 FDQLAQAGINTVFFETVNAGYPIYPTRVAPQQNPLT------QGWDPLASGVKLAHERGM 606 Query: 155 KVHAWFNPYRVSVNTKPGTIRELNSTLSQQPAS----VYVQHRDWI-RTSGDRF------ 203 ++HAW + T + ++TL QP S V H DW R S R Sbjct: 607 ELHAWLWTF--------ATANQRHNTLVNQPTSYLGPVLTAHPDWANRDSRGRVWHERDG 658 Query: 204 --VLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFY-TESPGSRLN------DNETYR 254 LDP EV+ +I +V E+V Y VDG+Q D Y + P N E +R Sbjct: 659 KAYLDPANREVRSYILRLVGEIVHNYDVDGIQLDYIRYPFQDPNRNFNFGYGTAGREQFR 718 Query: 255 KYGGA------------FASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVS 294 + G + ++R ++ +VS ++ P V F V+ Sbjct: 719 QLTGVDPISVSPKDSQLWQQWVNFRVEQVSTMVREVSQLLRKQYPDVIFSVA 770 >UniRef50_B4VTS6 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VTS6_9CYAN Length = 884 Score = 82.0 bits (201), Expect = 4e-14, Method: Compositional matrix adjust. Identities = 78/311 (25%), Positives = 132/311 (42%), Gaps = 65/311 (20%) Query: 66 RLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPW 125 +L P + ++ + T ++Q ++ D+L + G NTVFF+ ++PS++ P Sbjct: 396 QLAQPEIRAIWLDRGTIVKAKRKQDLVKLFDNLAKAGFNTVFFETVNASYPIYPSQVAP- 454 Query: 126 SDLMTGKIGENP---GYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLS 182 +NP G+DPL+ ++ AH+RGM++HAW + + N + + LN L Sbjct: 455 --------EQNPLVRGWDPLEAAVELAHERGMELHAWVWIF-AAANQRHNAL--LNQPLD 503 Query: 183 QQPASVYVQHRDW--IRTSGDRFV-------LDPGIPEVQDWITSIVAEVVSRYPVDGVQ 233 P+ V H DW G F DP PEV++++ +++ E+ +RY VDG+Q Sbjct: 504 Y-PSPVLAAHPDWAIFDKQGRLFAPNTRKAFFDPAHPEVREYLMALLEEIATRYDVDGIQ 562 Query: 234 FDDYFYTESPGSRLNDNETYRKYGGAFASK-----------------------ADWRRNN 270 D Y P N+TY YG A + ++R Sbjct: 563 LD---YIRYPFQDPRVNQTY-GYGVAARQQFKERTGVDPIEVYPRDRTLWQQWTEFRIRQ 618 Query: 271 TQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQ 330 +A VS + S +P + + A V+ P R ++E W +Q Sbjct: 619 VDSFVASVSARLLSQRPDL---ILSAAVF---PLPPAERQQRLQQNWEE-------WAKQ 665 Query: 331 GLLDYIAPQIY 341 G +D + P Y Sbjct: 666 GYIDLVVPMTY 676 >UniRef50_Q8YV65 All2116 protein n=15 Tax=Cyanobacteria RepID=Q8YV65_ANASP Length = 416 Score = 81.6 bits (200), Expect = 5e-14, Method: Compositional matrix adjust. Identities = 89/349 (25%), Positives = 148/349 (42%), Gaps = 81/349 (23%) Query: 16 AILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSV 75 A+++AL+++ S P + +TP A + +RG+WL Sbjct: 26 ALMMALSVVATVMLSFPLNAQITPSAAL---------ASELRGVWLT------------- 63 Query: 76 NISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGE 135 NI + R + + + KLD +L NTV+ V G L+PSK+ + ++ I Sbjct: 64 NIDSDVLFERDRLKTSLQKLD---KLNFNTVYPAVWNWGYTLYPSKVA--AKVIGRAIDP 118 Query: 136 NPGY---DPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQH 192 PG D L+ ++ E HK+G+ V WF + G + +S L++ Sbjct: 119 TPGLQGRDMLKEIVTEGHKQGLTVIPWF---------EFGFMAPADSLLAKN-------R 162 Query: 193 RDWI--RTSGDRFV---------LDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTE 241 W+ R++G R V L+P P+VQ +I ++ E+V Y +DG+QFDD+F Sbjct: 163 PQWLTSRSNGSRIVKEGIHDRVWLNPFRPDVQQFIQDLIVEIVRNYDIDGIQFDDHFGLP 222 Query: 242 SP-GSRLNDNETYRK-YGGAFASK-------ADWRRNNTQQLIAKVSHTIKSIKPGVEFG 292 S G Y+K + G SK WR + + ++ IK+ K Sbjct: 223 SELGYDAYTVALYKKEHRGQAPSKNPRDPEWLRWRASKITNFMQRIFKAIKATKKDCLVS 282 Query: 293 VSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIY 341 V+P +P +YD AD ++W GL++ + QIY Sbjct: 283 VAP---------NP------QRFSYDYFLADWQKWERMGLIEELVLQIY 316 >UniRef50_B0MQ12 Putative uncharacterized protein n=1 Tax=Eubacterium siraeum DSM 15702 RepID=B0MQ12_9FIRM Length = 990 Score = 81.3 bits (199), Expect = 8e-14, Method: Compositional matrix adjust. Identities = 94/422 (22%), Positives = 159/422 (37%), Gaps = 74/422 (17%) Query: 6 RNKKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVS 65 RNK IR A +++ +LL + E T S + +QS Q I ++ Sbjct: 2 RNK--FIRIMAGVLSAFMLLSQLTAVAEEK--TNENTSASAESAEQSKQTTPQIAEPKLT 57 Query: 66 RLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQ-VKPDGTALWPSKILP 124 + + +N+ + A + KLD L G+N V+ DGT + Sbjct: 58 LSNELKATVINLGDFA--AEKFGENFSKKLDTLIAYGMNGVYINPYGKDGTYYTTNM--- 112 Query: 125 WSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQ 184 N D L+ L+ A K+GM+ + +F ++N T++ Sbjct: 113 -----------NKSGDRLEKALEAATKKGMQRYVYF---------------DINKTMAAC 146 Query: 185 PASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPG 244 P + D++ S + +Y +G+ ++ T Sbjct: 147 PDG----------------------EDCYDYLVSEAHKFALKYRCNGIILTGFYGT---- 180 Query: 245 SRLNDNETYRKY--GGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNR 302 N+N Y +Y G+ +W + + + VS I+ + G+ VW N Sbjct: 181 ---NNNSAYEEYMKNGSGIGYKNWLYDTVEYKFSTVSGVIRLSDNSIAVGIDAKDVWANA 237 Query: 303 SHDPLGSDTRGA-AAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADV 361 S + GSDT A+ YADT+ +VE+GL D+I ++ + WW++V Sbjct: 238 SKNKKGSDTSAKYTAFYNGYADTKSFVEKGLTDFIVVNASGSLDNETVGFENVCSWWSNV 297 Query: 362 VKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYL 421 K + I FY V KI D G +L KQL D + SG++ + E L Sbjct: 298 AKSAK------IPFYIVHHNEKIGTDEDGWGVEDQLLKQLAKADELDNYSGSVFYSEKSL 351 Query: 422 NK 423 + Sbjct: 352 EE 353 >UniRef50_Q8YQA0 All3933 protein n=18 Tax=Cyanobacteria RepID=Q8YQA0_ANASP Length = 741 Score = 80.9 bits (198), Expect = 8e-14, Method: Compositional matrix adjust. Identities = 85/319 (26%), Positives = 134/319 (42%), Gaps = 72/319 (22%) Query: 46 PATTQQSSQP---MRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLG 102 PAT Q P +RG+WL ++ + +RA+VQ D L L+RL Sbjct: 59 PATAQFFQSPRQEIRGVWL-----------TNNDFDILRNRAKVQ-----DTLAQLRRLN 102 Query: 103 INTVFFQVKPDGTALWPSKILPWSDLMTGKIG------ENPGYDPLQFMLDEAHKRGMKV 156 NT++ V DG +PS + T ++G G D + ++ +A +G+ Sbjct: 103 FNTIYPVVWNDGYTKYPSAV-------TQRMGIPYFFRGTEGQDVIADIISQARSQGLLA 155 Query: 157 HAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTS----GDRFVLDPGIPEV 212 WF + G + L S L+ Q Q RD +TS G+ ++P PEV Sbjct: 156 IPWF---------EFGFMAPLTSELASQHPDWLTQKRDGTQTSISAAGEVAWMNPFHPEV 206 Query: 213 QDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNET---YRKYGG-------AFAS 262 Q +IT +V E++++Y DG+QFDD+ P D T YR+ G + Sbjct: 207 QQFITDLVVEIITKYNADGIQFDDHM--SLPVDFGYDKYTINLYRQETGNPPPSNPQAQA 264 Query: 263 KADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYA 322 WR + + +++ +K+ KP F VSP +D AY Sbjct: 265 WVKWRADKITAFMVQLNQAVKARKPNAIFAVSP------NYYD---------FAYKLQLQ 309 Query: 323 DTRRWVEQGLLDYIAPQIY 341 D WV G++D + Q+Y Sbjct: 310 DWLNWVRLGVVDELVVQVY 328 >UniRef50_B9XI64 Putative uncharacterized protein n=1 Tax=bacterium Ellin514 RepID=B9XI64_9BACT Length = 1083 Score = 78.2 bits (191), Expect = 6e-13, Method: Compositional matrix adjust. Identities = 108/437 (24%), Positives = 164/437 (37%), Gaps = 99/437 (22%) Query: 6 RNKKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVS 65 RN ++ IR I+ L +L ++ PA Q R W A V Sbjct: 8 RNVRMRIRCLVIMAGLWFVLA----------ISSPA------------QEFRAAW-ADVF 44 Query: 66 RLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPW 125 + + VN T + ++ + ++ +G+N+ A W S ILPW Sbjct: 45 HVGMGSQTEVNNMVATLVSGHYNAVIVQVVGYMDGIGVNS--------HGAHWKSNILPW 96 Query: 126 SDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNP-----YRVSVNTKPGTIRELNST 180 S +T G+DPL + +AH G++VHAW YRVS P N+T Sbjct: 97 SPRVTA------GFDPLAALCAQAHANGIEVHAWLGGSAGAMYRVSTAWPPAG----NAT 146 Query: 181 LSQQP----ASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDD 236 L+ P A + + LD G P+ Q++I SIV E+V+ YP+DG+ +DD Sbjct: 147 LTAHPEWFIAPLANSEGGAPVLVDGNYDLDMGSPDAQEYIVSIVRELVTNYPIDGINWDD 206 Query: 237 ----------YFY-----TESPGSRLNDNETYRKYGGAFASK-------ADWRRNNTQQL 274 + Y T P S L YR+ G + +++RR +L Sbjct: 207 ELNNAGYAAGFGYPALSQTNYPNSGLGR---YRRNTGYVGTPPNTDTAWSNYRRRFKNEL 263 Query: 275 IAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLD 334 +A+V I+SIK + S P G Y + D ++ G LD Sbjct: 264 MARVQAEIQSIKTNPRQPLRHTSAALAYSPYPTSCTFAGLVPYTY-FCDWAGMLQNGWLD 322 Query: 335 YIAPQIY---------------WPFSR----SAARYDVLAKWWADVVKPTRTRLYIGIAF 375 + PQ Y W ++R Y A+++ TR+ G A Sbjct: 323 AVIPQTYSLGTFTNWANFSASCWQYNRQIFPGIGAYLNTNASIANMIGYTRSIGLKGNAI 382 Query: 376 YKVGEPSK----IEPDW 388 Y G P E DW Sbjct: 383 YSYGVPHTNFVPAESDW 399 >UniRef50_A8YI06 Similar to tr|Q8YPV9|Q8YPV9 n=8 Tax=Chroococcales RepID=A8YI06_MICAE Length = 438 Score = 78.2 bits (191), Expect = 6e-13, Method: Compositional matrix adjust. Identities = 76/316 (24%), Positives = 128/316 (40%), Gaps = 66/316 (20%) Query: 47 ATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTV 106 A +Q +RG+W+ T + + R + QQ ++ L L N + Sbjct: 38 AFSQSRDPDIRGVWITTN-----------DTAMLMDRDKRQQ-----AIEQLVNLNFNAI 81 Query: 107 FFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVS 166 + V G AL+PS I + G D L ++++ RG+ V WF Sbjct: 82 YPVVWNSGYALYPSAIAQREGIQPFVPTGAQGQDILAELVEQTRGRGLLVIPWF------ 135 Query: 167 VNTKPGTIRELNSTLSQQPASVYVQHRD----WIRTSGDRFVLDPGIPEVQDWITSIVAE 222 + G + S L+ + + Q RD W+ +G+ L+P PEVQ+++ +V E Sbjct: 136 ---EFGFMAPPTSELALKHQNWLTQKRDGGTTWVGAAGEVVWLNPFRPEVQNFLRELVLE 192 Query: 223 VVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASK---------------ADWR 267 VV +Y ++G+QFDD+ L + Y Y A + WR Sbjct: 193 VVGQYDINGIQFDDHL-------SLPNEFGYDPYTIALYQQETEKTPPANPRDPEWTKWR 245 Query: 268 RNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRW 327 + +A + +I++IKP + ++P +P AY+ D W Sbjct: 246 ADKITAFLANLKQSIEAIKPNILLSIAP---------NPY------EFAYNGHLQDWLAW 290 Query: 328 VEQGLLDYIAPQIYWP 343 V QGL+D + Q+Y P Sbjct: 291 VRQGLVDELIVQVYRP 306 >UniRef50_B4VPG3 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VPG3_9CYAN Length = 406 Score = 77.8 bits (190), Expect = 7e-13, Method: Compositional matrix adjust. Identities = 90/349 (25%), Positives = 144/349 (41%), Gaps = 75/349 (21%) Query: 13 RRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPV 72 RR + + L L+ S +S+V P+ ++ T+ + +RG+WL V+ Sbjct: 8 RRFRVFLVLGLVF-SIVLLVAKSIVFSPSLARSQTPTKITE--IRGVWLTNVA------- 57 Query: 73 SSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGK 132 S + +P R Q L +L NTV+ V G +PS + L+ Sbjct: 58 -SGVLFSPWGINRAIAQ--------LSKLNFNTVYPVVWNRGHTFYPSAVATQEPLLA-- 106 Query: 133 IGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQH 192 I G D L +L + H++G++V WF + G + + S L+++ H Sbjct: 107 IMRLNG-DVLADILQQGHRQGLRVIPWF---------EYGFMTPIYSELARR-------H 149 Query: 193 RDWIRTSGDR---------FVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESP 243 WI S + L+P PEVQ I ++ EVVS+Y VDG+Q DD+F P Sbjct: 150 PTWITQSLTQKSDPENPQLLWLNPLHPEVQQLILDLIKEVVSQYDVDGIQLDDHF--GMP 207 Query: 244 GSRLNDNETYRKYGGAFASKA-----------DWRRNNTQQLIAKVSHTIKSIKPGVEFG 292 D T +Y + WR N + + ++ T+KSIKP Sbjct: 208 VELGYDPYTIERYQQEHYGNSPPNSPLNSEWMRWRANKISEFMGEIVQTVKSIKPDCIIS 267 Query: 293 VSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIY 341 +SP +P A AY D + WV++G +D + Q+Y Sbjct: 268 LSP---------NP------QAFAYKHYLQDWQTWVQRGWVDELVLQVY 301 >UniRef50_Q7NL32 Glr1294 protein n=1 Tax=Gloeobacter violaceus RepID=Q7NL32_GLOVI Length = 796 Score = 77.4 bits (189), Expect = 9e-13, Method: Compositional matrix adjust. Identities = 92/372 (24%), Positives = 151/372 (40%), Gaps = 80/372 (21%) Query: 22 ALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIW---LATVSRLDW--------- 69 ALL +ST PE PPA A Q++ + + + L T +R W Sbjct: 250 ALLTSDARSTAPEQF--PPAYRDAIARAQRTLKELPAMLKDGLDTQARAAWEDAIEDLWA 307 Query: 70 ----------PPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWP 119 P V ++ + T ++ + D L + GINTVFF+ G ++P Sbjct: 308 HYPTSQLAALPEVRAIWLDRGTIVKAGSEEGLTRIFDRLAQSGINTVFFETVNAGYTIYP 367 Query: 120 SKILPWSDLMTGKIGENP---GYDPLQFMLDEAHKRGMKVHAW---FNPYRVSVNTKPGT 173 S + P +NP G+DPL + AH+R M++HAW F N G Sbjct: 368 SAVAP---------AQNPLIRGWDPLAAAVRLAHERKMELHAWTWAFAAGNTRHNALIGK 418 Query: 174 IREL-NSTLSQQPASVYVQHRDWIRTSGD-RFVLDPGIPEVQDWITSIVAEVVSRYPVDG 231 ++ L+ P + +R +G + +DP PEV+ ++ S+ E+++ Y VDG Sbjct: 419 SQDFPGPVLAAHPGWAQSGRKGNLRPAGQPEYWMDPANPEVRAYLQSLYEEILTNYDVDG 478 Query: 232 VQFD----------DYFYTESPGSRLNDNETYRKYGGA----FASK-----ADWRRNNTQ 272 +QFD ++ SP +R ++ + G A + A W R + Sbjct: 479 LQFDYIRYPLQKNAGQYFGYSPAAR----RSFAQLTGVDPIDIAPEESSLWALWTRFKAE 534 Query: 273 QL---IAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVE 329 Q+ +A+ + ++ IKP + + A V+ N P G R D W Sbjct: 535 QVSSFVAESAEKLRRIKPRL---IVSAAVFPN----PPGERLR------LLQQDWEAWAI 581 Query: 330 QGLLDYIAPQIY 341 QG +D + P Y Sbjct: 582 QGNIDLLVPMTY 593 >UniRef50_C1D2P2 Putative uncharacterized protein n=2 Tax=Deinococcus RepID=C1D2P2_DEIDV Length = 521 Score = 77.0 bits (188), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 94/371 (25%), Positives = 141/371 (38%), Gaps = 92/371 (24%) Query: 10 LTIRRPAILVALALLLCSCKSTPPES---------MVTPPAGSKPPATTQQSSQPMRGIW 60 +T + A+L A +LLL +C + P S + TP P QQ +RG+W Sbjct: 1 MTHKLTAVL-ATSLLLAACGTAPQSSDLDALDTQGVRTPGPHDSPRGRGQQE---LRGLW 56 Query: 61 LATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQR-LGINTVFFQVKPDGTALWP 119 V+ P ++ A ID L R + +N +F QV G Sbjct: 57 --------------VDAFGPG----MKTPAEIDVLVATARAMNVNVLFAQVGRRGDCYCN 98 Query: 120 SKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNS 179 + +P T G+DPL ++ +AH +G++VHAW + +T P T Sbjct: 99 NAAMP----RTNDPAVPAGFDPLADLITKAHAQGIQVHAWIITTAIWNSTTPPT------ 148 Query: 180 TLSQQPASVYVQH--------------RDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVS 225 PA + H D G+ ++LDPG P+ ++I ++ VV Sbjct: 149 ----DPAHAFNAHGLGKTGRDFWLMVKNDGTTRGGNDWLLDPGHPDAAEYIRNMYVSVVK 204 Query: 226 RYPVDGVQFDDYFYTE-SP-------GSRLNDNETYRKYGGAFASK-------ADWRRNN 270 Y VDG+QFD YT+ +P G E YR GA + WR Sbjct: 205 NYDVDGIQFDRVRYTDFNPVGGPSNWGYNPTALERYRAETGATGMPLPGDPQWSAWRMQQ 264 Query: 271 TQQLIAKVSHTIKSIKPGVE-------FGVSPAGVWRNRSHDPLGSDTRGAAAYDESYAD 323 L+ + + +K+ KP V +G PA P Y E D Sbjct: 265 VTNLVRETALAVKATKPDVSVNAATITYGAGPANETEWLRSRP----------YTEVLQD 314 Query: 324 TRRWVEQGLLD 334 WV++G LD Sbjct: 315 WVTWVKEGYLD 325 >UniRef50_Q8YXK2 All1210 protein n=4 Tax=Nostocaceae RepID=Q8YXK2_ANASP Length = 906 Score = 76.6 bits (187), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 90/372 (24%), Positives = 138/372 (37%), Gaps = 87/372 (23%) Query: 88 QQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENP---GYDPLQF 144 +Q + D L + GINT+FF+ G ++PSK+ P +NP G+DPL Sbjct: 438 EQELAKIFDRLAQAGINTIFFETINAGYTIYPSKVAP---------QQNPLIRGWDPLAS 488 Query: 145 MLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDW--------- 195 + AH RGM++HAW + EL + + P V + DW Sbjct: 489 GVKLAHARGMELHAWVWTFAAGNQRH----NELLNIPTNYPGPVLAANPDWANYDHQGQM 544 Query: 196 IRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRK 255 I + DP PE++ ++ + E++++Y VDG+Q D Y P D R Sbjct: 545 IPLGQTKPFFDPANPELRQYLLKLYEEIITKYKVDGLQLD---YIRYP---FQDPAAGRS 598 Query: 256 YGGAFASKA----------------------DWRRNNTQQL---IAKVSHTIKSIKPGVE 290 YG A++ W TQQ+ +A+VS ++ + Sbjct: 599 YGYGKAARTQFQQLTGVDPMKISPSQTQLWQQWTTFRTQQVDSFVAQVSQMLRQQDRNLI 658 Query: 291 FGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAAR 350 V+ PL R + W QG +D I P Y ++ R Sbjct: 659 LSVAVF---------PLPEYER----VQKIQQHWEIWARQGNIDLIIPMTY---AQDTVR 702 Query: 351 YDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEI 410 + LAK W + T L GI + + QL L +P + Sbjct: 703 FQTLAKPWITSTQLGSTLLIPGIRLLSLPTLGAFD--------------QLQLVRDLP-V 747 Query: 411 SGTILFREDYLN 422 SG LF + LN Sbjct: 748 SGYALFAAENLN 759 >UniRef50_A8YDR3 Genome sequencing data, contig C294 n=9 Tax=Chroococcales RepID=A8YDR3_MICAE Length = 875 Score = 76.3 bits (186), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 52/174 (29%), Positives = 83/174 (47%), Gaps = 32/174 (18%) Query: 95 LDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGM 154 D + GIN VFF+ ++PS++ P + +T G+DPL+ + AH+R M Sbjct: 425 FDRMAAAGINVVFFETVNASYTIYPSQVAPEQNPLTR------GWDPLKVAVKLAHERNM 478 Query: 155 KVHAWFNPYRVSVNTKPGTIRELNSTLSQQPAS----VYVQHRDWIRT--SGDRF----- 203 ++HAW + + + ++ + +QP + V ++ DW T SG F Sbjct: 479 EIHAWVWVFAAA--------NQAHNKVLEQPLNYLGPVLSRNSDWGATNKSGGSFDYSQG 530 Query: 204 ----VLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETY 253 DP PEVQ+++ S+ E+V Y VDG+Q D Y P N N+TY Sbjct: 531 TKKAFFDPANPEVQNYLLSLYEEIVKNYDVDGLQLD---YIRYPFQNQNYNQTY 581 >UniRef50_UPI0001C16380 Protein of unknown function DUF187 n=1 Tax=Raphidiopsis brookii D9 RepID=UPI0001C16380 Length = 289 Score = 76.3 bits (186), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 69/262 (26%), Positives = 121/262 (46%), Gaps = 49/262 (18%) Query: 43 SKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLG 102 S P T Q S + +RG+W V+S +++ R +V+ D + L+RL Sbjct: 43 SVPSVTAQMSREEIRGVW-----------VTSNDLNVFKDRDQVK-----DAVTKLRRLN 86 Query: 103 INTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNP 162 NT++ V G ++PS + D+ + G+D L ++++AH + + WF Sbjct: 87 FNTIYPVVWNSGYVMYPSNVAKSLDIQPFVFRGSDGHDILADIINQAHSQNLLAIPWFEF 146 Query: 163 YRVSVNT------KPGTIREL--NSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQD 214 ++ NT KP + ++ ST+S A G+ L+P P+VQ Sbjct: 147 GFMTPNTGELALNKPEWLTKMRDGSTVSMSAA-------------GEVSWLNPFHPQVQK 193 Query: 215 WITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNET---YRKYGGAF--ASKAD---- 265 +I ++ E+ + Y +DG+QFDD +T P D+ T Y++ G A+ D Sbjct: 194 FIIDLLVELTNNYDIDGIQFDD--HTSLPHQFGYDDYTVNLYKQETGKNPPANSQDSEWV 251 Query: 266 -WRRNNTQQLIAKVSHTIKSIK 286 WR N + + +++HT+K IK Sbjct: 252 AWRANKITEFMVRLNHTVKQIK 273 >UniRef50_B7JXY5 Putative uncharacterized protein n=9 Tax=Cyanobacteria RepID=B7JXY5_CYAP8 Length = 427 Score = 76.3 bits (186), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 89/366 (24%), Positives = 143/366 (39%), Gaps = 108/366 (29%) Query: 21 LALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNP 80 L L++ S + + PA S+ P+ +RG+WL +N Sbjct: 27 LFLVIFSLSVVLILATLQYPAQSRTPSE-------IRGVWL----------------TNI 63 Query: 81 TSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENP--- 137 S Q ++ D + L++L NT++ V G L+PS + K+ P Sbjct: 64 DSEVLFSQNSLSDGIRTLKQLNFNTLYPTVWNWGHTLYPSPV-------AKKVIGTPLDP 116 Query: 138 -----GYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQH 192 G D LQ ++D+ H+ M V WF + G + +S L+ +++ Sbjct: 117 TEGLQGRDMLQEIIDQGHQANMAVIPWF---------EFGFMAPADSQLA-------IKY 160 Query: 193 RDWI--RTSGD----------RFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYT 240 W+ R +GD R L+P PEVQ +ITS+V E+VS Y +DG+QFDD+F Sbjct: 161 PQWLTERQNGDKIWLEGNVHKRVWLNPLKPEVQQFITSLVTEIVSNYSIDGIQFDDHFGI 220 Query: 241 ESPGSRLNDNETYRKYGGAFASK-------------------------ADWRRNNTQQLI 275 P D+ T + Y K WR N + Sbjct: 221 --PFDFGYDDFTLQLYQQEHQGKLPPKPPQNVKTENNCSINSQEWKEWTQWRANKITGYM 278 Query: 276 AKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDY 335 ++ IK+I P V VSP + P + AD ++W +GL++ Sbjct: 279 TELFKAIKTINPNVIVSVSP-------NPQPFSVNCY--------LADWQQWERRGLVEE 323 Query: 336 IAPQIY 341 + Q+Y Sbjct: 324 LVLQVY 329 >UniRef50_C6PCP2 Putative uncharacterized protein n=1 Tax=Thermoanaerobacterium thermosaccharolyticum DSM 571 RepID=C6PCP2_CLOTS Length = 1117 Score = 75.9 bits (185), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 67/280 (23%), Positives = 121/280 (43%), Gaps = 60/280 (21%) Query: 91 MIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENP---GYDPLQFMLD 147 ++ LD L+ + INT++ G ++P+ + +NP G+D L + Sbjct: 343 VVRNLDMLKSININTIYLDTFWSGYTIYPTN--------SKYTSQNPIYGGFDVLDAYIK 394 Query: 148 EAHKRGMKVHAWFNPYRVSVN--TKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDR--- 202 EAHKRGM V+AW + + + + G I++ ++P + V + + T Sbjct: 395 EAHKRGMVVYAWTENFLIGTSDVSDGGPIKK------EKPEWLMVSRKGYNYTLDKYGIK 448 Query: 203 -FVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNE---------T 252 + L+P IPE +D+++ + E+ S+Y +DG+QFD Y P S N+ Sbjct: 449 YYYLNPAIPEARDFLSELYKEIASKYDIDGIQFD---YIRFPNSNDYSNDFGYDDYTRNL 505 Query: 253 YRKYGGA----FASKAD-------WRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRN 301 +++Y G +D +R N + V ++ IKP ++ A VW N Sbjct: 506 FKQYAGVDPKYLNVNSDMWQLWNYFRMNIVNTFVYSVVSELRMIKPEIKIA---ADVWPN 562 Query: 302 RSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIY 341 +D SD + D++ W + +D + P Y Sbjct: 563 --YDTAPSDI---------FQDSKDWTLKNYIDTLNPMSY 591 >UniRef50_Q2JQ39 Putative uncharacterized protein n=1 Tax=Synechococcus sp. JA-2-3B'a(2-13) RepID=Q2JQ39_SYNJB Length = 850 Score = 74.7 bits (182), Expect = 7e-12, Method: Compositional matrix adjust. Identities = 77/286 (26%), Positives = 126/286 (44%), Gaps = 52/286 (18%) Query: 95 LDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGM 154 D L + G+NTVFF+ G A+ PS++ P + +T G DPL+ + AH+RG+ Sbjct: 403 FDRLAQAGLNTVFFETMNAGFAIHPSRVAPQQNPLT------RGRDPLRAAVRLAHERGL 456 Query: 155 KVHAWFNPYRVSVNTKPGTIRELNSTLSQQ-PASVYVQHRDW--IRTSGDRFV------- 204 ++HAW V NT+ + E+N L Q V H DW + G F Sbjct: 457 ELHAWIWTLAVG-NTRHNLLPEIN--LPQDYIGPVLTAHPDWANLDNRGRLFPRGQPETW 513 Query: 205 LDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFY-TESPGSR------LNDNETYRKYG 257 LDP P+V+ ++ ++ E+V Y VDG+ D Y ++ SR + +++ Sbjct: 514 LDPANPQVRSYLLALTRELVQDYQVDGIHLDYIRYPFQNAASRQVFGFGRAARQGFQQLS 573 Query: 258 GAFASKAD----------WRRNNTQQ---LIAKVSHTIKSIKPGVEFGVSPAGVWRNRSH 304 G + D W R TQQ ++ ++ T +S+ P V + A V+ Sbjct: 574 GVDPLELDPLRDRSLWQLWTRYRTQQVNEVVEAIARTARSLNPRV---ILSAAVYALPKQ 630 Query: 305 DPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAAR 350 + L R ++E W++ G LD + P Y +R A+ Sbjct: 631 ERL---QRLQQNWEE-------WIQAGELDLLIPLTYAGNTRRLAQ 666 >UniRef50_Q10YX0 Putative uncharacterized protein n=2 Tax=Cyanobacteria RepID=Q10YX0_TRIEI Length = 1099 Score = 74.7 bits (182), Expect = 7e-12, Method: Compositional matrix adjust. Identities = 49/162 (30%), Positives = 79/162 (48%), Gaps = 12/162 (7%) Query: 83 RARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPL 142 RAR ++ + + L GINTVFF+ G ++PS + P + +T +DPL Sbjct: 618 RAR-SERGLAGVFNRLAAAGINTVFFETINAGYTIYPSNVAPRQNPLT------TSWDPL 670 Query: 143 QFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTL----SQQPASVYVQHRDWIRT 198 + + AH+R M++H W + V + + +S L S P+ V R R Sbjct: 671 KAAVKLAHERNMELHPWIWAFAVGNKAHNQALGQGDSYLGPVISAHPSWVMTDKRGRKRH 730 Query: 199 SGD-RFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFY 239 D + +DP PEV+ ++ +I+ E+ SRY VDG+ D Y Sbjct: 731 PLDGKVYMDPANPEVRQYLLNIIDEIASRYEVDGIHLDYIRY 772 >UniRef50_B2IV00 Putative uncharacterized protein n=4 Tax=Cyanobacteria RepID=B2IV00_NOSP7 Length = 381 Score = 72.4 bits (176), Expect = 3e-11, Method: Compositional matrix adjust. Identities = 92/376 (24%), Positives = 149/376 (39%), Gaps = 83/376 (22%) Query: 57 RGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTA 116 RGIWL T S+ +Q + + +D L G N VF V Sbjct: 7 RGIWLTTTD----------------SKVLRSKQRIAEAMDLLAETGFNVVFPVVWNKAVT 50 Query: 117 LWPSKILPWSDLMTGKIGENP---GYDPLQFMLDEAHKRGMKVHAWFN-PYRVSVNTKPG 172 L+PS+ + T + +P G DPL+ ++ EA + G+KV WF + S N G Sbjct: 51 LYPSQTMQ----ETFGVEIDPMSVGRDPLEEVVVEARRVGLKVIPWFEYGFASSYNLNGG 106 Query: 173 TI---------RELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEV 223 + R+ N L + ++ D +VQ++ ++V EV Sbjct: 107 VLLQKKPEWAARDFNGNLLNKNGFEWLNALD---------------SQVQEFFLNLVLEV 151 Query: 224 VSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASK----------ADWRRNNTQQ 273 V Y VDGVQ DD P D T +Y + WR + Sbjct: 152 VKNYDVDGVQGDDRL-PAFPCEGGYDEGTVSRYRQEYDRNPPQNPKDRQWLQWRADILTD 210 Query: 274 LIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLL 333 +A++ +K++ P + ++P HD A+ E D+ W+++G++ Sbjct: 211 FLARLYGEVKAVNPNLLVAIAP------NIHD---------WAFQEYLQDSPTWLKRGIV 255 Query: 334 DYIAPQIY-WPFSRSAARYD-VLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMIN 391 D I PQIY F A D ++++ + D P RL GI K+G I P++++ Sbjct: 256 DMIQPQIYRRDFGSYCAIADKLVSQQFTDATLP---RLAPGI-LMKLGSYC-ISPEYLVQ 310 Query: 392 GGVPELKKQLDLNDAV 407 E +QL + V Sbjct: 311 A--IEYNRQLGIQGEV 324 >UniRef50_C1D298 Putative uncharacterized protein n=1 Tax=Deinococcus deserti VCD115 RepID=C1D298_DEIDV Length = 628 Score = 71.6 bits (174), Expect = 6e-11, Method: Compositional matrix adjust. Identities = 72/280 (25%), Positives = 112/280 (40%), Gaps = 59/280 (21%) Query: 92 IDKLDHLQR-LGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPG----YDPLQFML 146 +D+L R + INT+F Q G +LP + E+P +DPL +L Sbjct: 273 VDRLIADARAMNINTLFVQAVKRGDCYCNGSLLPRT--------EDPAVPAEFDPLADVL 324 Query: 147 DEAHKRGMKVHAWFNPYRVSVN------TKPGTIRELNSTLSQQPASVYVQHRDWI-RTS 199 +AH G+KVHAW P VS T P + + +Q DW+ R S Sbjct: 325 TKAHAHGIKVHAWVIPTAVSNRAVRYPVTNPEHVVNAHGEGDEQ---------DWLMRNS 375 Query: 200 GDRF------VLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLN----- 248 G LD G P+ + ++ + V + Y +DGVQ D Y + G+ + Sbjct: 376 GGSMWAGNDQQLDIGHPDARRYMVDAIQSVAAAYNIDGVQLDRVRYPDPSGTVQDWGYNP 435 Query: 249 --------DNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWR 300 ++ET A WRR L+++VS ++S +PG V+ Sbjct: 436 GAVAAYQAESETTETPAPGDARWTAWRREQVNALVSEVSGAVRSARPGTVISVAAITYG- 494 Query: 301 NRSHDPLGSDTRGAAAYDESYADTRR----WVEQGLLDYI 336 G TR A A +YA+ + W+ G +D + Sbjct: 495 ------AGPRTREAFASTRTYAEVLQDWPLWLADGNVDLV 528 >UniRef50_Q8EPF4 Hypothetical conserved protein n=1 Tax=Oceanobacillus iheyensis RepID=Q8EPF4_OCEIH Length = 502 Score = 71.2 bits (173), Expect = 8e-11, Method: Compositional matrix adjust. Identities = 80/313 (25%), Positives = 130/313 (41%), Gaps = 50/313 (15%) Query: 92 IDKL-DHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENP----GYDPLQFML 146 ID+L D + + +NT+ QV A + S +LP++ E+P G+DPL ++L Sbjct: 59 IDELVDDVHKANMNTIIAQVSRRHDAYYQSDVLPFT--------EDPSVPEGFDPLGYLL 110 Query: 147 DEAHKRGMKVHAWF--NPYRVSVN----TKPGTIRELNSTLSQQPASVYVQHRDWIRTSG 200 +AH++G++VHAW P SV + P I L+ +Q+ + W Sbjct: 111 TKAHEKGIEVHAWVVVGPMWHSVYGDAPSDPTHIWNLHGPDAQEES--------WATEDY 162 Query: 201 DRFV------LDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYR 254 + V LD G PE ++ + +V ++ Y VDGV D Y E G N R Sbjct: 163 NGNVPYWQPYLDLGHPEARNHVVDMVNDIAKNYEVDGVHLDYIRYPED-GKGYNATSLAR 221 Query: 255 KYGGAFASK---------ADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHD 305 + + W+ LI +V + ++ VE A V H+ Sbjct: 222 FHEETGRTDRPPVNDQEWIAWKVEQVDSLIKRVYTELLTVDSDVELS---AAVLSWGFHN 278 Query: 306 PLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFS--RSAARYDVLAKWWADVVK 363 P + ++ + + WV++G LDY Y + R A R+D +W D+ Sbjct: 279 PSNTHWWNMDPVQRAHQNWKEWVQEGYLDYAYVMNYDSDADPRRALRFDQWIEWQKDL-- 336 Query: 364 PTRTRLYIGIAFY 376 P + IG A Y Sbjct: 337 PRNRGIIIGPALY 349 >UniRef50_Q6AHL3 Putative uncharacterized protein n=1 Tax=Leifsonia xyli subsp. xyli RepID=Q6AHL3_LEIXX Length = 112 Score = 70.9 bits (172), Expect = 9e-11, Method: Compositional matrix adjust. Identities = 38/88 (43%), Positives = 56/88 (63%), Gaps = 3/88 (3%) Query: 293 VSPAGVWRNRSHDPLGSDTRGAAAY---DESYADTRRWVEQGLLDYIAPQIYWPFSRSAA 349 +SP G+W ++++D GSDT +++ + YADT WV+ G+LDYI PQ+YW + A Sbjct: 1 MSPFGIWEHKANDSRGSDTPTSSSSTYSKQVYADTLGWVKAGILDYIVPQVYWSSDQPVA 60 Query: 350 RYDVLAKWWADVVKPTRTRLYIGIAFYK 377 Y +A+WW + V+ T RLYIG YK Sbjct: 61 PYGEIARWWNNAVEGTNVRLYIGQPNYK 88 >UniRef50_B0VF99 Putative uncharacterized protein n=1 Tax=Candidatus Cloacamonas acidaminovorans RepID=B0VF99_9BACT Length = 482 Score = 70.5 bits (171), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 63/230 (27%), Positives = 99/230 (43%), Gaps = 52/230 (22%) Query: 133 IGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQH 192 I E+ +DPL ++L +AH++G+ V AW V N P RE + Q +Y H Sbjct: 81 ILEDASFDPLAYILKKAHQKGLAVQAWV----VVFNATP---REQSYI---QQNYIYNNH 130 Query: 193 RDWI------------RTSGDRFVLDPGIPEVQDWITSIVAEVVSRYP-VDGVQFDDYFY 239 +DWI R SG + +DPGIPEVQ+++ +I+ + YP +DG+ D Y Sbjct: 131 KDWITYNFNGSQMNIDRQSG--YFIDPGIPEVQEYLLNILGNLAGGYPELDGIHLDYIRY 188 Query: 240 TES-----PGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEF--G 292 ES P S NE + + +WR + +K I P ++ Sbjct: 189 PESDLGFHPVSLARYNEYCQ--NQEEITYNEWRIMQVTNFVENAYFQLKEINPTLQLTAA 246 Query: 293 VSPAGVWRNRSHDPLGSDTRGAAAYDESYA-DTRRWVEQGLLDYIAPQIY 341 V P A + YA D + W+++G++D + P Y Sbjct: 247 VVP-----------------DIAEANVDYAQDWQSWLKKGIIDRVYPMAY 279 >UniRef50_Q1IWF6 Putative uncharacterized protein n=3 Tax=Deinococcus RepID=Q1IWF6_DEIGD Length = 536 Score = 69.3 bits (168), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 61/202 (30%), Positives = 92/202 (45%), Gaps = 22/202 (10%) Query: 101 LGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWF 160 LG+NT+F Q L LP ++T E +DPL + AH RGM+V AW Sbjct: 95 LGVNTLFVQAIRRADCLCRRSSLP---VITDADLEKD-FDPLAEVTRLAHARGMRVIAWV 150 Query: 161 NPYRVSV----NTKPGTIRELNSTLSQQPASVYVQHR-DWIRTSGDRFVLDPGIPEVQDW 215 + S N+ P + + +Q A+ ++ R D G LDP IP D+ Sbjct: 151 SVTGASNLRVPNSNPAHVSRQHG--AQAGAASWLSRRPDGSWQEGADGWLDPAIPAAADF 208 Query: 216 ITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNET---YRKYGGAFASKA-------D 265 + V +V YPVDGVQ D Y + G+ D +T YR GA + A D Sbjct: 209 MVGGVVSLVKHYPVDGVQLDRIRYPDG-GNWGYDPKTLARYRAETGAKGTPAPDDARWRD 267 Query: 266 WRRNNTQQLIAKVSHTIKSIKP 287 W+R L+ +++ +K+++P Sbjct: 268 WKREQVTLLVRRIALEVKAVRP 289 >UniRef50_B5W1E7 Putative uncharacterized protein n=2 Tax=Arthrospira RepID=B5W1E7_SPIMA Length = 910 Score = 69.3 bits (168), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 54/179 (30%), Positives = 83/179 (46%), Gaps = 19/179 (10%) Query: 95 LDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGM 154 D L GINTVFF+ G ++PS++ P + +T G+DPL + A RGM Sbjct: 462 FDRLADAGINTVFFETVNAGYTIYPSRVAPSQNPLT------VGWDPLAAAVKLAKARGM 515 Query: 155 KVHAWFNPYRVSVNTKPGTIRE----LNSTLSQQP--ASVYVQHRDWIRTSGDRFVLDPG 208 ++HAW + ++ +R+ L LS P A++ Q R W + + LDP Sbjct: 516 ELHAWVWVFAIANQRHNALLRQPDSYLGPVLSAYPEWANLDNQGRTW-HENDRKAYLDPA 574 Query: 209 IPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWR 267 EV+ ++ +V E+ Y VDG+ D Y P N N +G AS+ +R Sbjct: 575 NREVRSYLLRLVGEIAHNYQVDGIHLD---YIRYPFQDANRN---FNFGYGTASRTQFR 627 >UniRef50_A0YRE2 Putative uncharacterized protein n=1 Tax=Lyngbya sp. PCC 8106 RepID=A0YRE2_9CYAN Length = 574 Score = 68.9 bits (167), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 81/349 (23%), Positives = 140/349 (40%), Gaps = 82/349 (23%) Query: 18 LVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNI 77 ++ ALL +S+ P+ + P + +RG+WL + S + Sbjct: 175 FLSQALLKAEEESSVPKEYLVKAVEIDAP------NGEIRGVWLTNID--------SDVL 220 Query: 78 SNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENP 137 +PTS +++ +D L L NT++ V G +PS+++ ++ ++ P Sbjct: 221 FSPTS--------VVEAIDSLSELNFNTLYPVVWNRGFTQFPSQVM--KRIIGTELDPAP 270 Query: 138 ---GYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHR- 193 G D LQ ++ +A + M V WF E + Q S ++Q R Sbjct: 271 ELAGRDVLQEIITQAKAKNMSVMPWF---------------EFGFMVPQD--SQFLQSRP 313 Query: 194 DWIRTSGD-----------RFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYF---- 238 +WI T+ + R +P P+VQ +I ++ EVVS+Y +DG+QFDD+F Sbjct: 314 NWITTNKEGIPFVKEEDKYRVWFNPFNPQVQQFILDLIVEVVSKYDIDGIQFDDHFGLPF 373 Query: 239 ---YTESPGSRLNDNETYRKYGGAFASKAD---WRRNNTQQLIAKVSHTIKSIKPGVEFG 292 Y E S+L E K + D WR + + ++ +K KP Sbjct: 374 ELGYDEF-TSKLYQRENDGKLPPSDPKDQDWVKWRADKLTDFMMRLFWVVKDYKPDCIIS 432 Query: 293 VSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIY 341 +SP + AYD D W + G ++ + Q+Y Sbjct: 433 LSP---------------NPKSYAYDNYLQDWPTWEQSGFIEELVLQVY 466 >UniRef50_A6CAJ3 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CAJ3_9PLAN Length = 811 Score = 68.6 bits (166), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 70/271 (25%), Positives = 101/271 (37%), Gaps = 68/271 (25%) Query: 97 HLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKV 156 L G N + + G A +PS +LP S K G D ++ L AH+ G++V Sbjct: 492 ELSDAGFNMIIPNMLWGGLAHYPSDVLPRSTTYE-KYG-----DQIEQCLKAAHQHGLEV 545 Query: 157 HAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWI 216 H W + +S T P + + SV + DW L+P PE Sbjct: 546 HVWKVNHNLS--TAPQAFVKKMRDAGRTQVSVTGEPSDW---------LNPAHPENFQLE 594 Query: 217 TSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRK-----------------YGGA 259 + EVV +YPVDG+ FD Y P R + ++ R+ Y G Sbjct: 595 VDSMLEVVRKYPVDGIHFD---YIRYPNDRHDYSDYSRQKFEADTGIKVQNWPADCYNGT 651 Query: 260 FASK-ADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYD 318 S+ DWR +L+ V + I+PG++ +AA Sbjct: 652 LKSQYRDWRAAQITRLVETVQREARKIRPGIKI----------------------SAAVF 689 Query: 319 ESYADTRRWVEQ--------GLLDYIAPQIY 341 Y D R WV Q G LD+I P Y Sbjct: 690 REYPDCREWVAQDWPLWAKNGYLDFICPMDY 720 >UniRef50_B1WZU0 Putative uncharacterized protein n=2 Tax=Cyanothece RepID=B1WZU0_CYAA5 Length = 421 Score = 68.2 bits (165), Expect = 6e-10, Method: Compositional matrix adjust. Identities = 77/328 (23%), Positives = 131/328 (39%), Gaps = 78/328 (23%) Query: 54 QPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPD 113 Q RG+WL V+ S + + +RA + L +L NTV+ V Sbjct: 45 QERRGVWLTNVAS------SVLFVPGSVNRA----------IKQLSQLHFNTVYPVVWNR 88 Query: 114 GTALWPSKILPWSDLMTGKIGE------NPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSV 167 G +PS + + M G+ E D L+ +++E+H+RG+ V WF Y + + Sbjct: 89 GHTFYPSSL---AKEMIGESQEPLLNWTRSNIDVLRVIIEESHQRGLAVIPWFE-YGLMI 144 Query: 168 ---------------NTKPGTIR-----ELNSTLSQQPASV---YVQHRDWIRTSGDRFV 204 +++ GT+ EL + ++ + + QH + + + Sbjct: 145 PRSSLIAQKHPDWLTHSQQGTVNTFFQDELKTKNKKKSTNFLENWSQH-SYQKRASQLVW 203 Query: 205 LDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASK- 263 L+P PEVQ I ++ E++ +Y VDGVQ DD+F P D T + Y K Sbjct: 204 LNPFHPEVQQLIKGLMLEIIMQYKVDGVQLDDHFGI--PVELGYDPLTIKLYQQEHEGKN 261 Query: 264 ----------ADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRG 313 +WR + + TIK + P + +SP Sbjct: 262 PPNDPYNAQWMNWRAKKLTAFMTDLVTTIKIVNPDILISLSPNSY--------------- 306 Query: 314 AAAYDESYADTRRWVEQGLLDYIAPQIY 341 + +Y D + WV+QGL+D + Q+Y Sbjct: 307 SFSYQNYLQDWKTWVKQGLIDELVLQVY 334 >UniRef50_B4AVG6 Putative uncharacterized protein n=1 Tax=Cyanothece sp. PCC 7822 RepID=B4AVG6_9CHRO Length = 423 Score = 67.8 bits (164), Expect = 7e-10, Method: Compositional matrix adjust. Identities = 101/397 (25%), Positives = 155/397 (39%), Gaps = 68/397 (17%) Query: 56 MRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGT 115 +RG+WL V S +Q +I+ L+ L G NTVF V G Sbjct: 4 IRGVWLTNVG----------------SEVLNSRQNIINALNLLADTGFNTVFPVVWNKGF 47 Query: 116 ALWPSKILPWSDLMTGKIGENP---GYDPLQFMLDEAHKRGMKVHAWFN-PYRVSVNTKP 171 +PS+++ L T +P G DPL +++ A G+ V WF + S Sbjct: 48 TQYPSQVM----LQTFNQEIDPAFAGRDPLAEVIEAAKNVGIDVIPWFEYGFACSYQKNG 103 Query: 172 GTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDG 231 G I ++ +P + + ++ PEVQ++I S+V EV Y V G Sbjct: 104 GHI------IASKPHWAAKDINNQLLNKNGFEWMNAFEPEVQNFILSLVLEVARNYDVAG 157 Query: 232 VQFDDYFYTESPGSRLNDNETYRKYGGAFASK----------ADWRRNNTQQLIAKVSHT 281 VQ DD P D +T +Y K WR + +S Sbjct: 158 VQGDDRL-PALPCEGGYDEKTRARYYSEQGVKPPQNIKDAKWLQWRAALLTNFLGNLSRE 216 Query: 282 IKSIKPGVEFGVSPAGVWRNRSHD-PLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQI 340 +K+IK + +S SH P G Y E D+ W+ Q ++D I PQ+ Sbjct: 217 VKAIKNDLLVSIS--------SHPYPFG--------YHEYLQDSPTWIRQKIVDVIHPQL 260 Query: 341 YWPFSRSAARYDVLAKWWADVVKP-TRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKK 399 Y R+ Y L + P TRL+ G+ ++ P K + D+ I+ PE Sbjct: 261 Y---RRTLKDYQALVETTLKQFSPDDLTRLFPGV-LIRLNAPGKPQ-DFHIS---PEQLW 312 Query: 400 QLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQSR 436 Q L + I G + F + LN Q +LQ++ Sbjct: 313 QTILINRRLGIRGEVFFFFEELN-VNAQSLAQFLQAK 348 >UniRef50_Q8YLM8 Alr5270 protein n=12 Tax=Cyanobacteria RepID=Q8YLM8_ANASP Length = 420 Score = 67.8 bits (164), Expect = 8e-10, Method: Compositional matrix adjust. Identities = 58/194 (29%), Positives = 87/194 (44%), Gaps = 35/194 (18%) Query: 205 LDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKA 264 L+P PEVQ +I S+++EVV+ Y +DG+Q DD+F P D T Y K+ Sbjct: 184 LNPLHPEVQKFILSLISEVVTNYHIDGIQVDDHF--GMPVQFGYDPYTTELYQKEHKGKS 241 Query: 265 -----------DWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRG 313 WR N + + +VS +K IKP V+ +SP Sbjct: 242 PPRNHLDAEWMKWRANKITRFMTQVSQVVKEIKPSVKVSLSP---------------NSQ 286 Query: 314 AAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRL--YI 371 A AY D W++ GL+D + Q+Y +S+ Y++ VK RT++ I Sbjct: 287 AFAYKYYLQDWANWIKTGLVDELILQVYRN-DKSSFVYELEQP----AVKLARTQIPVAI 341 Query: 372 GIAFYKVGEPSKIE 385 GI+ + P KIE Sbjct: 342 GISTGTLRSPVKIE 355 >UniRef50_B5WA73 Putative uncharacterized protein n=2 Tax=Arthrospira RepID=B5WA73_SPIMA Length = 476 Score = 67.4 bits (163), Expect = 9e-10, Method: Compositional matrix adjust. Identities = 75/300 (25%), Positives = 122/300 (40%), Gaps = 56/300 (18%) Query: 56 MRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGT 115 +RG+W+ T N T + Q + + + L + NT++ V G Sbjct: 125 IRGVWMTT---------------NDTD-VLMNQPRLEEAVSKLAQFNFNTIYPVVWNSGY 168 Query: 116 ALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIR 175 + S ++ + + G D L +++ AH+ + V WF + G + Sbjct: 169 VTYKSSVVKEAGIQPFVRRGFQGQDMLADIIERAHRHNLLVLPWF---------EFGFMA 219 Query: 176 ELNSTLSQQPASVYVQHRDWIRTS----GDRFVLDPGIPEVQDWITSIVAEVVSRYPVDG 231 +S L+ + + Q RD +TS G+ L+P P+VQ ++T ++ EVV+ Y +DG Sbjct: 220 PPSSELALKHPNWLTQQRDGTKTSISAAGEVVWLNPFHPQVQKFMTDLIVEVVTDYDIDG 279 Query: 232 VQFDDYFYTESPGS--------RLNDNETYRKYGGAFASKA--DWRRNNTQQLIAKVSHT 281 VQFDD +T P + L ET R A WR + + ++ Sbjct: 280 VQFDD--HTSLPSTFGYDPYTISLYQRETNRTPPSNPQDPAWVRWRAHKITAFMRQLHQA 337 Query: 282 IKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIY 341 IK+ KP VSP N H AY+ D WV GL+D + Q+Y Sbjct: 338 IKAKKPHSIISVSP-----NPYH----------IAYNGHLQDWVTWVRDGLVDELVVQVY 382 >UniRef50_Q8YK50 All8067 protein n=8 Tax=Cyanobacteria RepID=Q8YK50_ANASP Length = 399 Score = 65.1 bits (157), Expect = 5e-09, Method: Compositional matrix adjust. Identities = 52/183 (28%), Positives = 86/183 (46%), Gaps = 25/183 (13%) Query: 72 VSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTG 131 + + +S + +Q + ++ + + GINT+ V +G ++ S D+M Sbjct: 113 IRGIYLSRYQATNNADEQTIRQRVRYYRSQGINTIIHGVWGNGCTMYKS------DVMQQ 166 Query: 132 KIGENPGYDPLQ-----FMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPA 186 +G + + Q +++DEAHK+GM+VHA+F + G + NS + Sbjct: 167 TLGYSSCPNQFQEKWLNWLIDEAHKQGMQVHAYF---------EKGIKIDKNSPIFDLAV 217 Query: 187 SV--YVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYP-VDGVQFDDY--FYTE 241 + V D D +VLD PEV + +I E V +YP VD VQ+DDY +Y E Sbjct: 218 AKNWMVPGIDKTYAGIDHYVLDVEKPEVATFFKNISVEFVKKYPNVDAVQWDDYLGYYAE 277 Query: 242 SPG 244 PG Sbjct: 278 LPG 280 >UniRef50_Q6ZE96 Slr7102 protein n=5 Tax=Cyanobacteria RepID=Q6ZE96_SYNY3 Length = 338 Score = 64.7 bits (156), Expect = 6e-09, Method: Compositional matrix adjust. Identities = 80/348 (22%), Positives = 142/348 (40%), Gaps = 102/348 (29%) Query: 8 KKL--TIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVS 65 KKL +++ PA+ V + LLL +C + P T + + M+G+WL V Sbjct: 2 KKLLKSLKWPALFVGIILLLAACH--------------RAPTRTAKETDKMKGVWLTDVG 47 Query: 66 RLDWPPVSSVNISNPTSRARVQQQAMIDK-LDHLQRLGINTVFFQVKPDGTALWPSKILP 124 + + ++D+ L H+ + + V+F V L+P++ Sbjct: 48 TMG-----------------LTYSTLLDETLHHISKSDYDRVYFSVYGLRGQLYPTR--Q 88 Query: 125 WSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQ 184 DL I + P + + M E+ ++G+K +AWF + Q Sbjct: 89 RGDL----IPKLPFPNAVGSMARESRRQGLKPYAWFE----------------YGLMLPQ 128 Query: 185 PASVYVQHRDWIRT--SGDRFV---------LDPGIPEVQDWITSIVAEVVSRYPVDGVQ 233 V + DW+ T +G++ + LDP PEV+ +I + + +++ + G+Q Sbjct: 129 FDPVAKNNPDWLLTMANGEQVIENHGVPMVWLDPSNPEVEAYILAHIDDILKEKSLAGIQ 188 Query: 234 FDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGV 293 DD++ R++G D+RR+ T L KV IK+ P E + Sbjct: 189 LDDHWAVP------------RQFG-------DYRRSLT-ALTTKVHEHIKTKNPEFELSL 228 Query: 294 SPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIY 341 SP +P + +E D RWV+QG++D + QIY Sbjct: 229 SP---------NPY------QFSLNEYNQDWLRWVKQGIVDEVVVQIY 261 >UniRef50_Q7NJN0 Glr1802 protein n=1 Tax=Gloeobacter violaceus RepID=Q7NJN0_GLOVI Length = 344 Score = 64.3 bits (155), Expect = 9e-09, Method: Compositional matrix adjust. Identities = 94/390 (24%), Positives = 156/390 (40%), Gaps = 86/390 (22%) Query: 56 MRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGT 115 +RG+WL +N SR ++A++ +D L + G N VF V G Sbjct: 4 LRGVWL----------------TNVGSRVLHSREAIVRAMDLLAQTGFNAVFPVVWNKGF 47 Query: 116 ALWPSKILPWSDLMTGKIGENPGY-----DPLQFMLDEAHKRGMK-VHAWFNPYRVSVNT 169 L+PS+I+ L I +P Y DPL +++ A + G++ V WF S Sbjct: 48 TLYPSRIM----LELFGIEIDPLYAEAKRDPLAEVIEAAGRAGIRMVIPWFEYGFASSPR 103 Query: 170 KPGTIRELNSTLSQQPASVYVQHRDWI-RTSGDRFVLDPGI-------PEVQDWITSIVA 221 G L +PA W R SG ++ G+ PEVQ+++ S++ Sbjct: 104 SDG-----GHILLTRPA--------WTARVSGGAPLVKNGLVWMNALDPEVQNFVLSLML 150 Query: 222 EVVSRYPVDGVQFDDYF--YTESPGSRLNDNETYRKYGGA----FASK---ADWRRNNTQ 272 EV + Y + GVQ DD G E +R+ G+ +AS+ WR + Sbjct: 151 EVATHYDIVGVQGDDRLPALPVEGGYDPRTVELFRETTGSDPPGWASEPGWVQWRADRLT 210 Query: 273 QLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGL 332 + + ++ IKS++P + ++P S P + + D W +G Sbjct: 211 EFLGRLYTQIKSVRPELLLSLAP-------SVYPF--------SLNHYLQDVAEWARRGW 255 Query: 333 LDYIAPQIYWP-FSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMIN 391 D + PQ+Y F + D + D R+ GIAF G + Sbjct: 256 FDLLHPQVYRENFGQYRREIDRFKR---DFPPEATGRIAPGIAFKANG----------VE 302 Query: 392 GGVPELKKQLDLNDAVPEISGTILFREDYL 421 GV ++++++ LN + G + F D L Sbjct: 303 IGVDDVRRRIALN-CERGLGGEVFFYFDGL 331 >UniRef50_C3R3M7 Putative uncharacterized protein n=2 Tax=Bacteroides sp. 2_2_4 RepID=C3R3M7_9BACE Length = 432 Score = 63.5 bits (153), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 48/163 (29%), Positives = 74/163 (45%), Gaps = 11/163 (6%) Query: 95 LDHLQRLGINTVFFQVKP-DGTALWPSKIL-PWSDLMTGKIGENPGYDPLQFMLDEAHKR 152 LD + G N + V+P G AL+ S L P +DL I + Y LQF +DEAHKR Sbjct: 71 LDLAKSTGFNKIVVDVRPVQGDALFKSSYLTPLTDLAGTHIERDWNY--LQFFIDEAHKR 128 Query: 153 GMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRF----VLDPG 208 +KV + + + + + T + Y + + I D+ L+P Sbjct: 129 ELKVTVSATIFTAGLPSSKNGMAYRDDTWDGKTCLEYTKDQGLIDIKDDKTKVSAFLNPV 188 Query: 209 IPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNE 251 +PEVQD+ + + E+V+ Y DG D Y PG + +E Sbjct: 189 LPEVQDFCLNFIKELVTNYNFDGFALD---YCRYPGDESDFSE 228 >UniRef50_C3A5Y1 Putative uncharacterized protein n=1 Tax=Bacillus mycoides DSM 2048 RepID=C3A5Y1_BACMY Length = 143 Score = 63.2 bits (152), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 35/126 (27%), Positives = 63/126 (50%), Gaps = 14/126 (11%) Query: 10 LTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDW 69 + ++R ++ + +L P S ++P + TT + + +R +W+A+V +DW Sbjct: 1 MIMKRLVMMCYIVILF------TPFSFISPHSTYAEVNTTYKKHE-LRAVWIASVLNIDW 53 Query: 70 PPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLM 129 P + + I Q+Q I LD ++ G+N V Q+KP A +PS PWS+ + Sbjct: 54 PSKTGLPIEK-------QKQEFIRLLDDVKNTGMNAVVVQIKPTADAFYPSNYGPWSEYI 106 Query: 130 TGKIGE 135 TG G+ Sbjct: 107 TGTQGK 112 >UniRef50_B4WH89 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WH89_9SYNE Length = 635 Score = 62.8 bits (151), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 91/380 (23%), Positives = 148/380 (38%), Gaps = 97/380 (25%) Query: 16 AILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSV 75 AI+ +L + + P + +V PP S P T + +RG+W+ Sbjct: 231 AIICRASLSPNTVATVPSDRIVFPP--SLPTVATPTTE--LRGVWM-------------- 272 Query: 76 NISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILP---------WS 126 +N S + A+ L+ L +L N V+ V GT L+PS + + Sbjct: 273 --TNIDSDVLFSRSALEQALETLSKLNFNVVYPTVWNWGTTLYPSAVAERTIGYKQGLYP 330 Query: 127 DL-MTGKIGENPGY----DPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTL 181 DL TG+ E D LQ +++ AH R +KV WF + G + +S L Sbjct: 331 DLDRTGRKVELEAAQGDRDMLQEIIELAHSRNLKVMPWF---------EFGFMAPADSEL 381 Query: 182 SQQPASVYVQHRDWIRTSGD-----------RFVLDPGIPEVQDWITSIVAEVVSRYPVD 230 +++ H DW+ D R L+P EVQ ++ +++E+ + Y +D Sbjct: 382 ARR-------HPDWLTQKADGTLTTLEGEHERVWLNPFHLEVQTFLLQLISELSANYDID 434 Query: 231 GVQFDDYFYTESPGSRLNDNETYRKYGGAFASK---AD--------WRRNNTQQLIAKVS 279 G Q DD+ P + D T Y K AD WR + + +V Sbjct: 435 GFQVDDHMGL--PFAYGYDPYTINLYQQEHDGKSPPADPKDPEWTRWRADKITDFMDQVF 492 Query: 280 HTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQ 339 T+K+ +P VSP +P AY+ D WV++G ++ + Q Sbjct: 493 TTVKAQRPQAIMSVSP---------NP------HIFAYEYYLQDWDTWVKRGYVEELIIQ 537 Query: 340 IY--------WPFSRSAARY 351 +Y W + AA Y Sbjct: 538 LYRTDLGRFVWEMGQEAAEY 557 >UniRef50_A2C8D8 DUF187 n=12 Tax=Cyanobacteria RepID=A2C8D8_PROM3 Length = 410 Score = 61.6 bits (148), Expect = 6e-08, Method: Compositional matrix adjust. Identities = 67/276 (24%), Positives = 100/276 (36%), Gaps = 43/276 (15%) Query: 43 SKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLG 102 S+P A + Q+S P D P+ V ++N S + M + L R G Sbjct: 35 SQPCAISAQASTPPSVAQSGLRHLSDHLPIVGVWMTNSPSPLYYSRNLMHKAVKDLYRAG 94 Query: 103 INTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNP 162 ++ V G+ S P + K G DP+ + E H RGMKV WF Sbjct: 95 FTALYLNVWSRGSTFHRSNYAPVEGPLQ-KAGL--ALDPICTLRREGHARGMKVVPWFE- 150 Query: 163 YRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGD------------RFVLDPGIP 210 + A V H DW+ D R L+P P Sbjct: 151 ---------------YGLMEPDDAEVVKLHPDWVLARADGNPVVKMHGNHKRVWLNPAHP 195 Query: 211 EVQDWITSIVAEVVSRYPVDGVQFDDYF----------YTESPGSRLNDNETYRKYGGAF 260 EV+ +V EV+ R +DGVQ DD+F YT + + + R Y F Sbjct: 196 EVRARFIGVVIEVMKRCKMDGVQLDDHFAWPVQLGYDPYTVALYQQETGSLPPRDYSDRF 255 Query: 261 ASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPA 296 + WRR L+ ++ ++ K V ++P Sbjct: 256 WMQ--WRRRKLTGLLRELRQALEKEKLPVNISLAPG 289 >UniRef50_C5CIL6 Putative uncharacterized protein n=1 Tax=Kosmotoga olearia TBF 19.5.1 RepID=C5CIL6_KOSOT Length = 993 Score = 60.1 bits (144), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 68/277 (24%), Positives = 112/277 (40%), Gaps = 53/277 (19%) Query: 92 IDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHK 151 I+KL HL G N + +V GT + P K+ + K E DPL+ +++EAHK Sbjct: 352 IEKLAHL---GFNVLLPEVIWKGTTISP-KLTVYPQNEEFKDWEE---DPLEIIIEEAHK 404 Query: 152 RGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRF----VLDP 207 M+VHAW + V G + E N +++ P V GD Sbjct: 405 YDMEVHAWTWTFAV------GYLGESNELMNKNPHLVEKDRFGRTFAEGDNVKRAGFFSH 458 Query: 208 GIPEVQDWITSIVAEVVSRYP-VDGVQFDDYFYTESPGSRLNDN-------ETYRKYGGA 259 P+ ++ I S + EVV +YP +DG+ D Y S + D+ + +++ G Sbjct: 459 SNPKARELIKSAIKEVVEKYPEIDGINLD---YIRYENSDIIDHGYDDYSVKAFKEETGI 515 Query: 260 FASKAD-----------WRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLG 308 K + WR N + ++S +K+IKP + S D + Sbjct: 516 DPFKIEKYTKEEVLWHLWRENQVTSFVKEISEELKAIKPTIII-----------SADVIN 564 Query: 309 SDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFS 345 T + +++ W + G +D + P Y P S Sbjct: 565 LPTGAQHKFKQNWV---LWAKNGYVDALFPMAYTPSS 598 >UniRef50_Q3AJ74 Putative uncharacterized protein n=3 Tax=Chroococcales RepID=Q3AJ74_SYNSC Length = 390 Score = 59.7 bits (143), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 69/281 (24%), Positives = 114/281 (40%), Gaps = 40/281 (14%) Query: 74 SVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKI 133 V ++N S+ ++ + + LQ G N V V GT S+ P + K Sbjct: 48 GVWLTNSPSKLYYDRKRISAAMQQLQHAGFNRVVPNVWSRGTTFHRSRFAPVEPPLQ-KA 106 Query: 134 GENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNS-TLSQQPASVYVQH 192 G G DP+ + E +RG+KV WF + G + +S + + P+ V + Sbjct: 107 GV--GLDPICTLAAEGRRRGIKVMPWF---------EYGLMEPADSAVVHENPSWVLAKA 155 Query: 193 --RDWIRTSGDRFV--LDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYF-----YTESP 243 + W+ G+ + L+P PEV+ +V E + R P+DG+Q DD+F + P Sbjct: 156 NGQRWMAMHGNHRMAWLNPAHPEVRARFIGLVVETLKRCPMDGLQLDDHFAWPVHFGYDP 215 Query: 244 GS-RLNDNETYRKYGGAFASK--ADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWR 300 + L ET G +++ WRRN L+ ++ +K +SP Sbjct: 216 TTLALYRQETGLAPPGDHSNRYWMKWRRNQLTSLLRELRQRLKQEGLSTRISLSPG---- 271 Query: 301 NRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIY 341 P S AY+ D W GL++ + Q Y Sbjct: 272 -----PFRS------AYNLWLQDWELWALGGLIEELVVQNY 301 >UniRef50_UPI0001AF05D8 hypothetical protein SghaA1_34850 n=1 Tax=Streptomyces ghanaensis ATCC 14672 RepID=UPI0001AF05D8 Length = 522 Score = 57.8 bits (138), Expect = 8e-07, Method: Compositional matrix adjust. Identities = 69/291 (23%), Positives = 109/291 (37%), Gaps = 53/291 (18%) Query: 75 VNISNPTSRARVQQQAMI-DKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKI 133 V+ NP Q A++ D LD + N + Q + + P +D I Sbjct: 43 VDAFNPGIFTPAQVAALVEDALD----VNANALIVQTARRYDCFCNNALYPRTD---AAI 95 Query: 134 GENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQH- 192 P YDPL+ ++ + H G++VHAW N VNT +T + P V+ QH Sbjct: 96 APEP-YDPLEEIVRQGHAAGLQVHAWVN-----VNTMWN-----RTTPPRSPEHVFNQHG 144 Query: 193 ------RDWI--RTSGDRFV-----LDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFY 239 W+ + G V +DPG P D+I V +V Y VDGV D Y Sbjct: 145 PGATGADRWLNKKADGQELVGANAYVDPGHPAAVDYIVRGVQSIVRNYDVDGVNLDYVRY 204 Query: 240 TESPGSRLNDNETYRKYGGAFASKA---------------DWRRNNTQQLIAKVSHTIKS 284 + + + + Y + A +A DWRR+ L+ K+ + Sbjct: 205 PDGSSTTTHSDWGYNEVSVARFQQATGRTDIPLPSDTAWSDWRRSQVTNLVRKIYLGVWE 264 Query: 285 IKPGVEFGVSPAGVWRNRSHDPLGSDT-RGAAAYDESYADTRRWVEQGLLD 334 + P + H P + Y E D W+++G++D Sbjct: 265 VDPQARLSMDAI----TYGHGPQAVGGWQATRTYAEVLQDWAGWLDEGIMD 311 >UniRef50_P74629 Sll0736 protein n=1 Tax=Synechocystis sp. PCC 6803 RepID=P74629_SYNY3 Length = 408 Score = 55.1 bits (131), Expect = 5e-06, Method: Compositional matrix adjust. Identities = 63/282 (22%), Positives = 109/282 (38%), Gaps = 76/282 (26%) Query: 52 SSQPMRGIWLATV-SRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQV 110 S +RG+WL V S + + PV + + L+ NT++ V Sbjct: 38 SPNKIRGVWLTNVDSNVLYDPVQ-----------------LKTAIADLKSTNFNTLYPTV 80 Query: 111 KPDGTALWPSKILPWSDLMTGKIGENPG-YDPLQFMLDEAHKRGMKVHAWF--------- 160 DG L+PS + + K E G D L +++ A ++ ++V WF Sbjct: 81 WNDGHTLYPSAVA--QQWLGKKQDEKLGDRDMLGEVINLAKEKSLRVIPWFEFGFMAPAE 138 Query: 161 ------NPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQD 214 +P+ ++ N++ TI T+ R L+P PEVQ Sbjct: 139 SDWVKAHPHWLTTNSQGETIWLEGGTIP-------------------RVWLNPLHPEVQQ 179 Query: 215 WITSIVAEVVSRYPVDGVQFDDYF---------------YTESPGS------RLNDNETY 253 IT+++ ++V RY VDG+Q DD+F Y + G L+ N+ Sbjct: 180 LITALLVDLVRRYDVDGIQLDDHFGYPYSFGYDPITVALYRQETGQEPLPVPELDLNQNC 239 Query: 254 RKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSP 295 + DWR + + + +K++KP + +SP Sbjct: 240 VSSDPIWQQWTDWRSAKISRYVQSLVPILKAVKPNLTISISP 281 >UniRef50_B4WJG2 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WJG2_9SYNE Length = 453 Score = 54.7 bits (130), Expect = 6e-06, Method: Compositional matrix adjust. Identities = 90/403 (22%), Positives = 135/403 (33%), Gaps = 130/403 (32%) Query: 12 IRRPAILVALALLLCSCKSTPPESMVTPPAGSKPP-------ATTQQSSQPMRGIWLATV 64 I+R I L++ + P S++ P G +P + + +RG+W Sbjct: 16 IKRTGIFCVAVLVVF--LTGPLGSLLVPSTGGEPTTLDHLVGSKSSSLDSEVRGVW---- 69 Query: 65 SRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSK--- 121 ++N S + ++ L + NTV+ V G ++ S Sbjct: 70 ------------VTNVASSVFFMPWGIASTIEQLADMRFNTVYPVVWNRGQTIYRSDRMK 117 Query: 122 ------ILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIR 175 I P LM +P DPL M+ H++G++V WF + G + Sbjct: 118 EITQRDISPLVGLM------HPREDPLAEMIRRGHQKGLRVIPWF---------EYGFMV 162 Query: 176 ELNSTLSQQPASVYVQHRDWI--------RTSGDRFV----------------------- 204 L S L+Q H DW+ R S D FV Sbjct: 163 PLQSRLAQA-------HPDWLTARADGSQRLSEDTFVNGPIEETPELETASESAMARSKR 215 Query: 205 ---------------LDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLND 249 L+P P VQ I +V EV + Y VDG+QFDD+F P D Sbjct: 216 LHRLLKSGAPSELGWLNPLHPNVQALILDLVDEVTTYYDVDGIQFDDHF--SFPIEFGYD 273 Query: 250 NETYRKYGGAFASK------AD-----WRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGV 298 T Y + AD WR + + +K F +SP Sbjct: 274 AFTVALYEAEHEGQLPPLDPADKDWIHWRAEKLSGFVNTLQKRVKETCSDCVFSLSP--- 330 Query: 299 WRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIY 341 +P + AY D + W E+G LD + QIY Sbjct: 331 ------NP------ASYAYQYYAQDWQTWAEKGWLDELVVQIY 361 >UniRef50_C6IEW4 Putative uncharacterized protein n=4 Tax=Bacteroidales RepID=C6IEW4_9BACE Length = 490 Score = 52.0 bits (123), Expect = 5e-05, Method: Compositional matrix adjust. Identities = 73/350 (20%), Positives = 131/350 (37%), Gaps = 74/350 (21%) Query: 6 RNKKLTIRRPAI---LVALALLLC-SCKSTPPESMVTPPAGSK--PPATTQQSSQPMRGI 59 ++ K+ I++ I + +A LC +C + +G + P +S+P R I Sbjct: 29 KSYKMNIKKNIIKTFMGGIAACLCMACGGNDSKDYWGDTSGGEDEEPTENPNASKP-RYI 87 Query: 60 WLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPD-GTALW 118 W+ + S NI+ + A+ G + V+P G L+ Sbjct: 88 WIDAAANFPDFANSKENIARDLALAK--------------DAGFTDIVVDVRPTTGDVLF 133 Query: 119 PSKILPWSDLMTGKIGEN-------PGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKP 171 + ++ M IG N +D LQ +DEA K+G+++HA N + Sbjct: 134 KTNLVDQVKFMYAWIGSNYTKVERTATWDYLQAFVDEARKQGLRIHAAINTFVGGNQIDG 193 Query: 172 GT---IRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYP 228 GT R+ + ++ V + TS +P PEVQ ++ ++ ++ Y Sbjct: 194 GTGLLYRDQSKAEWATQMNMQVGITSVMNTSESTKFFNPAHPEVQTFLCDLLKDLAG-YD 252 Query: 229 VDGV--------------------QFDDYFYTESPGSRLNDN-------------ETYRK 255 +DG+ QF++Y G R+ + TY K Sbjct: 253 LDGIFLDRGRFLNLQADFSEESRKQFEEYM----GGIRIQNYPNDILAPGASSLPATYPK 308 Query: 256 YGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHD 305 Y ++R + K +K +KPG++FGV G W + +D Sbjct: 309 Y---LTKWLEFRAKVIYDFMQKARTAVKGVKPGIKFGVYVGG-WYSTYYD 354 >UniRef50_A7LVF6 Putative uncharacterized protein n=4 Tax=Bacteroides RepID=A7LVF6_BACOV Length = 395 Score = 50.4 bits (119), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 42/163 (25%), Positives = 78/163 (47%), Gaps = 16/163 (9%) Query: 89 QAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKIL---------PWSDLMTG--KIGENP 137 Q + D + L L +N++F + ++ S +L S L++G K ++P Sbjct: 47 QGVKDFVKTLDELNMNSIFLVSYAETKTIYRSDVLMHYSTYKTQEESYLLSGYSKQYQSP 106 Query: 138 GYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDW-I 196 DP++ ++DEAHK +KV WF Y +P I N L++ P + + ++ Sbjct: 107 TNDPVRDLIDEAHKHDIKVFFWFE-YGFMGEGRP--ISPNNPLLAKNPHWLGIDNQQHPA 163 Query: 197 RTSGDRFVLDPGIPEVQDWITSIVAEVVSRYP-VDGVQFDDYF 238 + + + P VQ+++ ++ E + YP +DG+Q DD F Sbjct: 164 NYNQHDYYFNAYNPAVQNFLIELIEEALMLYPDLDGIQGDDRF 206 >UniRef50_Q2BFL2 Putative uncharacterized protein n=1 Tax=Bacillus sp. NRRL B-14911 RepID=Q2BFL2_9BACI Length = 813 Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 45/164 (27%), Positives = 70/164 (42%), Gaps = 40/164 (24%) Query: 95 LDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMT--GKIGENP----------GYDPL 142 LD ++ G N+V+ + T W I P S+ MT G ++P G D L Sbjct: 381 LDRMEEAGFNSVYLE-----TTFWGYTIYP-SETMTEYGLPAQHPNFRNADYGKYGSDLL 434 Query: 143 QFMLDEAHKRGMKVHAWFN-----------PYRVSVNTKPGTIRELNSTLSQQPASVYVQ 191 Q + E KRG+ V AW + P + V+ + I+ N+T +P Sbjct: 435 QAYIKEGKKRGISVQAWTDGFMIGHSSLGLPSQFQVHPEWAAIQRSNTTGEPKP------ 488 Query: 192 HRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFD 235 TS + + LD PEVQ ++ I E+ S+Y + G+ D Sbjct: 489 -----DTSSNYYWLDIAQPEVQTFMLDIYKEMQSKYDIKGLNID 527 >UniRef50_A8F7U2 Putative uncharacterized protein n=2 Tax=Thermotogaceae RepID=A8F7U2_THELT Length = 367 Score = 49.3 bits (116), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 73/316 (23%), Positives = 128/316 (40%), Gaps = 64/316 (20%) Query: 99 QRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHA 158 + +G ++ QV A + S+ILP ++ ++ +P +DPL+ ++D A G+K+ A Sbjct: 45 KEVGATRIYVQVVGRADAYYNSEILPKAETLSEC---SPDFDPLKEIIDLAKISGIKISA 101 Query: 159 WFNPY------------RVSVNTKPGTIRELNSTLSQQPASV--YVQHRDWIRTSGDRFV 204 W N + + VN P I T Q S+ Y + I T G Sbjct: 102 WMNVFYAWPFGKKPVSEKHVVNVHPDWI-----TYDQNGKSMLEYASSPE-INTPG--LF 153 Query: 205 LDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKA 264 L+P + +V+ ++++I E+ Y VD + D Y P +T+ + A Sbjct: 154 LEPALEDVKKFVSNIAEEIAKNYDVDEIHLD---YIRYP------YKTFGYHPDAMKIYR 204 Query: 265 DWRRNNTQQ-------------LIAKVSHTIKSIKPGVE-FGVS-PAGVWRNRSHDPLGS 309 +W + Q+ I +VS T+K I V +G A V+ D + Sbjct: 205 EWLKKAIQEKKLTNLGEGFDLFRIQQVSDTVKLIYEKVHNYGKKLSAAVFAYYEQDAISQ 264 Query: 310 DTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRL 369 +G +W+E LDY A + + +R Y V +A ++ Sbjct: 265 RLQGWL----------QWLEGEYLDY-ACLMAYENNRDTVEYYVK---YAVKALGAAEKI 310 Query: 370 YIGIAFYKVGE-PSKI 384 +G+ YK+ E P K+ Sbjct: 311 RVGLGAYKMTENPEKL 326 >UniRef50_UPI0001C1694B Protein of unknown function DUF187 n=1 Tax=Raphidiopsis brookii D9 RepID=UPI0001C1694B Length = 166 Score = 48.5 bits (114), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 31/97 (31%), Positives = 46/97 (47%), Gaps = 13/97 (13%) Query: 201 DRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAF 260 DR L+P PEVQ ++ +++ E+V Y +DG+QFDD+F P D+ T Y Sbjct: 32 DRVWLNPFHPEVQKFMENLIVEIVRNYDIDGIQFDDHFGL--PSELGYDSYTVGLYKQEH 89 Query: 261 ASKA-----------DWRRNNTQQLIAKVSHTIKSIK 286 KA WR + L+ +V IK+ K Sbjct: 90 QGKAPSENFQDPEWVKWRADKITNLMKRVFFAIKANK 126 >UniRef50_C9KJL9 Alpha-galactosidase n=1 Tax=Mitsuokella multacida DSM 20544 RepID=C9KJL9_9FIRM Length = 749 Score = 45.1 bits (105), Expect = 0.005, Method: Compositional matrix adjust. Identities = 30/102 (29%), Positives = 47/102 (46%), Gaps = 25/102 (24%) Query: 142 LQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDW------ 195 LQ + + AHKRG+K WF P VSV++ +Y H DW Sbjct: 394 LQGVAESAHKRGLKFGLWFEPEMVSVDS-----------------DLYRAHPDWALRSPS 436 Query: 196 --IRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFD 235 + S + VLD +V+D+I V +++ P+D V++D Sbjct: 437 YPMTFSRHQLVLDLSRADVRDYIVDSVCKILDTAPIDYVKWD 478 >UniRef50_Q8YXF7 All1256 protein n=4 Tax=Nostocaceae RepID=Q8YXF7_ANASP Length = 500 Score = 44.3 bits (103), Expect = 0.009, Method: Compositional matrix adjust. Identities = 38/153 (24%), Positives = 65/153 (42%), Gaps = 10/153 (6%) Query: 95 LDHLQRLGINTVFFQVKPDGTALWPSKILP--WSDLMTGKIGENPGYDPLQFMLDEAHKR 152 +D + G N V+ +V DG L P+ P W ++ K E D L + + +R Sbjct: 119 MDRMVNRGYNEVYLEVFYDGRVLLPASANPTVWPSVIRTKGAEK--VDLLATAIQKGRQR 176 Query: 153 GMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEV 212 G+KV+ W Y + R+ +++ Q ++ +G + +DP + Sbjct: 177 GLKVYGWL--YTNNFGYNYALRRDREGAIARNGKG---QTSLYVVDNGSQVFIDPYNEQA 231 Query: 213 QDWITSIVAEVVSRYPVDGVQFDDYFYTESPGS 245 + +V E+V R P DG+ FD Y GS Sbjct: 232 KRDYYRMVQEIVRRRP-DGLLFDYVRYPRQAGS 263 >UniRef50_Q5N184 Putative uncharacterized protein n=2 Tax=Synechococcus elongatus RepID=Q5N184_SYNP6 Length = 481 Score = 43.9 bits (102), Expect = 0.011, Method: Compositional matrix adjust. Identities = 43/156 (27%), Positives = 67/156 (42%), Gaps = 16/156 (10%) Query: 95 LDHLQRLGINTVFFQVKPDGTALWPSKILP--WSDLMTGKIGENPGYDPLQFMLDEAHK- 151 D LQ LG N VF + DG L P+ P W ++ PG + + + + K Sbjct: 115 FDGLQALGYNEVFIETFYDGRVLLPAADNPTVWPSVVA-----EPGLERVDLLAEAIRKG 169 Query: 152 --RGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGI 209 RGM V+AW + + R+ TL++ S + I + G + +DP Sbjct: 170 RERGMSVYAWLFTLNYGYSYSQRSDRQ--DTLARNGRS---ESSLEIVSGGAQVFVDPFN 224 Query: 210 PEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGS 245 P + +++ V+SR P DGV FD Y G+ Sbjct: 225 PVARQDYQTLLRSVLSRRP-DGVLFDYVRYPRGTGA 259 >UniRef50_A7VPY5 Putative uncharacterized protein n=1 Tax=Clostridium leptum DSM 753 RepID=A7VPY5_9CLOT Length = 721 Score = 43.9 bits (102), Expect = 0.012, Method: Compositional matrix adjust. Identities = 31/91 (34%), Positives = 43/91 (47%), Gaps = 12/91 (13%) Query: 145 MLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFV 204 ++D AH +GMK W V N K +RE + P + D I +G R V Sbjct: 380 IIDRAHAKGMKFGLWMELEAVGRNAK---LRE------EHPDFIMRNGSDEI--AGGR-V 427 Query: 205 LDPGIPEVQDWITSIVAEVVSRYPVDGVQFD 235 LD PEV DW+ V ++SRY +D + D Sbjct: 428 LDLSKPEVADWVEEEVESLISRYQLDMFRID 458 >UniRef50_C6VY08 Putative uncharacterized protein n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6VY08_DYAFD Length = 557 Score = 43.5 bits (101), Expect = 0.014, Method: Compositional matrix adjust. Identities = 47/210 (22%), Positives = 81/210 (38%), Gaps = 34/210 (16%) Query: 161 NPYRVSVNTKPGTIRELNST---------LSQQPASVYVQHRDWIRTS--------GDRF 203 NPY N +R+ + S+ S++ H DW S D + Sbjct: 100 NPYLKDNNVLADIVRKCHEKSIKVIVRFDFSRVHESIFKAHPDWCYISPKGERIINTDMY 159 Query: 204 VLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASK 263 V+ P VQ+ I+ EV++ +P+DG+ + PG ++N N KY G ++ Sbjct: 160 VVSINAPYVQEKAFRIIEEVINTFPIDGI------FLNMPGYQVN-NPYEGKYHGIDQNE 212 Query: 264 ADWRR----NNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAA--AY 317 D +R + + L + + + +EF + W R H + S A Y Sbjct: 213 YDRKRFAEYSGGKALPVEENKADPLFQKYLEFKKATVEDWSERLHKLVKSKNEQIAICTY 272 Query: 318 DESYADTRRWVEQGLLDYIAPQIYWPFSRS 347 + + D R Q + YWP++ S Sbjct: 273 SDKFVDIIRHESQSM----TTLPYWPYTAS 298 >UniRef50_Q9P8N4 Alpha-galactosidase (Fragment) n=2 Tax=Lichtheimia corymbifera RepID=Q9P8N4_9FUNG Length = 729 Score = 43.5 bits (101), Expect = 0.016, Method: Compositional matrix adjust. Identities = 25/102 (24%), Positives = 48/102 (47%), Gaps = 25/102 (24%) Query: 142 LQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWI----- 196 L+ + D H GM+ WF P V+ N+ ++Y +H DW+ Sbjct: 383 LKPLADHVHDLGMQFGVWFEPESVNPNS-----------------NLYREHPDWVLYYDG 425 Query: 197 ---RTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFD 235 + ++ +L+ G+PEVQD+I V+ ++ +D +++D Sbjct: 426 VPRYEARNQLLLNLGLPEVQDYIYDRVSSIIEENDIDYIKWD 467 >UniRef50_D1BUC2 Putative uncharacterized protein n=1 Tax=Xylanimonas cellulosilytica DSM 15894 RepID=D1BUC2_XYLCX Length = 806 Score = 43.1 bits (100), Expect = 0.021, Method: Compositional matrix adjust. Identities = 35/152 (23%), Positives = 65/152 (42%), Gaps = 10/152 (6%) Query: 89 QAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDE 148 +A+ ++ + G+N V+ QV G ++PS + L + + GYD L Sbjct: 367 EAVEATVEAMASAGVNEVYLQVLSGGYTIYPSAVAVAHGLPAVRP-DLAGYDALAAWKSA 425 Query: 149 AHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSG-----DRF 203 A + G+++HAW + +V G + + Q P + V T+ + Sbjct: 426 ADENGIELHAWIDGLQVGNELGDG----IGPIVQQHPEWLAVDRAHAGTTTATPSFNGFY 481 Query: 204 VLDPGIPEVQDWITSIVAEVVSRYPVDGVQFD 235 LD P + ++ + E+VSRY + G+ D Sbjct: 482 WLDITDPVARQYMIDVTTEMVSRYDLAGLNHD 513 >UniRef50_C3QPV8 Glycoside hydrolase family 36 protein n=5 Tax=Bacteroides RepID=C3QPV8_9BACE Length = 735 Score = 43.1 bits (100), Expect = 0.022, Method: Compositional matrix adjust. Identities = 35/124 (28%), Positives = 58/124 (46%), Gaps = 29/124 (23%) Query: 145 MLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTS----- 199 +L+ A K G+K W P E+ +T S+ +Y +H DWI + Sbjct: 381 LLENARKNGVKFGIWIEP-------------EMANTTSE----LYEKHPDWILKAPNREL 423 Query: 200 -----GDRFVLDPGIPEVQDWITSIVAEVVSRYP-VDGVQFDDYFYTESPGSR-LNDNET 252 G + VLD P+VQD+I +V +++ YP +D +++D + GS L+DN+ Sbjct: 424 VLGRGGTQVVLDLANPKVQDFIFGMVDNLMTNYPEIDYIKWDANMSIMNHGSNYLSDNDQ 483 Query: 253 YRKY 256 Y Sbjct: 484 SHMY 487 >UniRef50_Q8AAL7 S-layer related protein, sialic acid-specific 9-O-acetylesterase n=9 Tax=Bacteroidales RepID=Q8AAL7_BACTN Length = 884 Score = 42.7 bits (99), Expect = 0.025, Method: Compositional matrix adjust. Identities = 54/250 (21%), Positives = 102/250 (40%), Gaps = 40/250 (16%) Query: 85 RVQQQAMIDK-LDHLQRLGINTVFFQVKP-DGTALWPSKILPWSDLMTGKIGENPGYDPL 142 R + ID L+ ++ LG ++P G L+ S+ P + K + +D L Sbjct: 489 RFSHKDSIDYYLEKIKSLGFTHAVVDIRPITGEVLYKSEYAP--QMKEWKGAKAGDFDYL 546 Query: 143 QFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDR 202 + + + H+ G+++HA N + N + + S + + VY + I + ++ Sbjct: 547 GYFIKKGHELGLEIHASLNVFCAGHNYFDRGM--VYSGHPEWASMVYTPDKGIIPITEEK 604 Query: 203 F----VLDPGIPEVQDWITSIVAEVVSRYP-VDGVQFDDYFY---------------TES 242 +++P E + I +++ EVV++YP +DG+ D Y E Sbjct: 605 HKYGAMINPLNEEYRTHILNVLKEVVTKYPDLDGLMLDRVRYDGITADFSSLSRKKFEEY 664 Query: 243 PGSRLND--NETYR-------KY----GGAFASKADWRRNNTQQLIAKVSHTIKSIKPGV 289 G ++ + + +R KY G F +WR N +A +K+ P V Sbjct: 665 IGKKVANFPEDIFRWTKNADGKYTTQPGKYFRKWLEWRTKNITDFMALARKEVKAANPDV 724 Query: 290 EFGVSPAGVW 299 FG + G W Sbjct: 725 SFG-TYTGAW 733 >UniRef50_A6L961 Glycoside hydrolase family 36, candidate alpha-glycosidase n=6 Tax=Bacteroidales RepID=A6L961_PARD8 Length = 735 Score = 42.4 bits (98), Expect = 0.038, Method: Compositional matrix adjust. Identities = 34/134 (25%), Positives = 62/134 (46%), Gaps = 32/134 (23%) Query: 142 LQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTS-- 199 ++ ++ +A+K G+K W P E+ +T S+ +Y +H +W+ + Sbjct: 378 IEGLIADANKHGIKFGIWIEP-------------EMANTTSE----LYEKHPEWVLKAPN 420 Query: 200 --------GDRFVLDPGIPEVQDWITSIVAEVVSRYP-VDGVQFDDYFYTESPGSRL--- 247 G + VLD PEVQD+I IV +++ YP +D +++D + GS+ Sbjct: 421 REIVLGRGGTQVVLDLSNPEVQDFIFGIVDNLMTTYPEIDYIKWDANMSILNHGSQYLPS 480 Query: 248 -NDNETYRKYGGAF 260 + Y +Y G F Sbjct: 481 DQQSHMYIEYHGGF 494 >UniRef50_Q114S3 Putative uncharacterized protein n=2 Tax=Oscillatoriales RepID=Q114S3_TRIEI Length = 508 Score = 42.0 bits (97), Expect = 0.051, Method: Compositional matrix adjust. Identities = 45/168 (26%), Positives = 75/168 (44%), Gaps = 21/168 (12%) Query: 92 IDK-LDHLQRLGINTVFFQVKPDGTALWPSKILP--WSDLMTGKIGENPGYDPLQFMLD- 147 ID+ LD + G N V+ + DG L P+ P W ++ PGY+ + + D Sbjct: 116 IDRILDKIVNQGYNQVYIEAFYDGQVLLPAANNPTVWPSIL-----RVPGYENVDLLADS 170 Query: 148 --EAHKRGMKVHAWF--NPYRVSVNTKPGTIREL------NSTLSQQPASVYVQHRDWIR 197 +A +RG++ +AW + + + P + L +TL P +V +Q++ Sbjct: 171 LKKAKERGLRAYAWVFTMNFGYTYSQLPNRQQALARNGRGQTTLDVIPDNVSLQNQLGAS 230 Query: 198 TSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGS 245 + F+ DP P+ + +V EV+ R P GV FD Y GS Sbjct: 231 HAFHTFI-DPYSPQARQDYNVMVNEVLKRQP-QGVLFDYIRYLRGMGS 276 >UniRef50_B8HXB6 Putative uncharacterized protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HXB6_CYAP4 Length = 528 Score = 41.2 bits (95), Expect = 0.086, Method: Compositional matrix adjust. Identities = 55/229 (24%), Positives = 90/229 (39%), Gaps = 24/229 (10%) Query: 39 PPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHL 98 P A S+ A + +Q +R T WP V + + +Q + LD + Sbjct: 90 PAAQSQYQALVAEQAQSLRQCRSQT-----WPQVKGIWLQ--LFACDLQPGVLESVLDRI 142 Query: 99 QRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHA 158 G N ++ Q DG L P+ P L + D L ++ + +RG++V+A Sbjct: 143 VSQGYNRIYVQTFYDGQVLLPANRNPTPWLAVAQGSAFADRDLLAEVIQKGRERGLRVYA 202 Query: 159 WFNPYRVSVNTKPGTIRELNSTLSQ---QPASVYVQHRDWIRTSGDRF-----VLDPGIP 210 W + ++ + R+ TL Q QPA+ V H +G F +DP P Sbjct: 203 WVSG--MNYGSSYAQRRDRQQTLVQNGRQPATTPVGH------TGQGFEQTAIFIDPYHP 254 Query: 211 EVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGA 259 ++ ++ V+ R P DGV D Y L + + YG A Sbjct: 255 RTREDFQLMLQAVLQRQP-DGVLIDYLRYPRQSNPVLTEVKDLWIYGPA 302 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P64427 UPF0748 lipoprotein yddW n=88 Tax=Enterobacteria... 622 e-177 UniRef50_C1M4K1 Lipoprotein YddW n=4 Tax=Enterobacteriaceae RepI... 547 e-154 UniRef50_C2DR58 Lipoprotein yddW n=6 Tax=Enterobacteriaceae RepI... 501 e-140 UniRef50_A6WZY4 Putative uncharacterized protein n=10 Tax=Brucel... 468 e-130 UniRef50_Q48C14 YngK protein n=53 Tax=Proteobacteria RepID=Q48C1... 467 e-130 UniRef50_C9XP71 Cell surface protein n=6 Tax=Clostridium RepID=C... 452 e-125 UniRef50_A6TUC1 Putative uncharacterized protein n=5 Tax=Bacteri... 449 e-125 UniRef50_C6J3R7 Putative uncharacterized protein n=1 Tax=Paeniba... 448 e-124 UniRef50_O35015 UPF0748 protein yngK n=11 Tax=Bacteria RepID=YNG... 446 e-124 UniRef50_A7Z5C7 YngK n=5 Tax=Bacteria RepID=A7Z5C7_BACA2 442 e-122 UniRef50_Q81DH4 FenI n=65 Tax=Bacteria RepID=Q81DH4_BACCR 440 e-122 UniRef50_D1A3Q7 Putative uncharacterized protein n=1 Tax=Thermom... 435 e-120 UniRef50_C0Z8S4 Putative uncharacterized protein n=1 Tax=Breviba... 428 e-118 UniRef50_D1S7M0 Putative uncharacterized protein n=1 Tax=Micromo... 428 e-118 UniRef50_UPI000178945D protein of unknown function DUF187 n=1 Ta... 427 e-118 UniRef50_D2AWT1 FenI protein n=1 Tax=Streptosporangium roseum DS... 426 e-118 UniRef50_C4RBZ7 FenI protein n=10 Tax=Actinomycetales RepID=C4RB... 423 e-117 UniRef50_C5C4P8 Putative uncharacterized protein n=4 Tax=Bacteri... 422 e-116 UniRef50_B8I4Q9 Putative uncharacterized protein n=2 Tax=Bacteri... 420 e-116 UniRef50_UPI00016A6D2C fenI protein n=1 Tax=Burkholderia oklahom... 420 e-116 UniRef50_Q47Q17 FenI protein n=9 Tax=Bacteria RepID=Q47Q17_THEFY 420 e-116 UniRef50_A1V3X0 FenI protein n=36 Tax=Bacteria RepID=A1V3X0_BURMS 419 e-116 UniRef50_A8MM80 Putative uncharacterized protein n=1 Tax=Alkalip... 415 e-114 UniRef50_A3HZ09 FenI n=1 Tax=Algoriphagus sp. PR1 RepID=A3HZ09_9... 413 e-114 UniRef50_D1AYL2 Putative uncharacterized protein n=1 Tax=Strepto... 413 e-114 UniRef50_C7IM14 Putative uncharacterized protein n=1 Tax=Clostri... 410 e-113 UniRef50_D2AR89 FenI protein n=9 Tax=Bacteria RepID=D2AR89_STRRD 408 e-112 UniRef50_C5PKN1 Possible FenI n=1 Tax=Sphingobacterium spiritivo... 406 e-111 UniRef50_A5FI17 Putative uncharacterized protein n=1 Tax=Flavoba... 405 e-111 UniRef50_D2QEX0 Putative uncharacterized protein n=2 Tax=Flexiba... 404 e-111 UniRef50_A4ASW6 FenI n=1 Tax=Flavobacteriales bacterium HTCC2170... 404 e-111 UniRef50_C6XWP7 Putative uncharacterized protein n=1 Tax=Pedobac... 394 e-108 UniRef50_A1ZQ43 YngK protein n=1 Tax=Microscilla marina ATCC 231... 392 e-108 UniRef50_UPI00016C0313 cell surface protein n=1 Tax=Epulopiscium... 392 e-107 UniRef50_B1HPQ3 Hypothetical lipoprotein yddW n=2 Tax=Bacillacea... 391 e-107 UniRef50_C7PIN2 Putative uncharacterized protein n=1 Tax=Chitino... 390 e-107 UniRef50_A6NVH8 Putative uncharacterized protein n=1 Tax=Bactero... 389 e-106 UniRef50_A5FAG6 Putative uncharacterized protein n=1 Tax=Flavoba... 384 e-105 UniRef50_A9NEW1 Putative uncharacterized protein n=1 Tax=Acholep... 382 e-104 UniRef50_C9L341 YngK protein n=45 Tax=Bacteroidales RepID=C9L341... 381 e-104 UniRef50_UPI00016C4E90 hypothetical protein GobsU_27726 n=1 Tax=... 377 e-103 UniRef50_D0GIS1 YngK n=16 Tax=Bacteria RepID=D0GIS1_9FUSO 376 e-102 UniRef50_B4VZ35 Putative uncharacterized protein n=1 Tax=Microco... 374 e-102 UniRef50_D1N426 Putative uncharacterized protein n=1 Tax=Victiva... 374 e-102 UniRef50_C3XYE7 Putative uncharacterized protein n=2 Tax=Branchi... 373 e-102 UniRef50_Q8YW40 All1776 protein n=5 Tax=Nostocaceae RepID=Q8YW40... 369 e-100 UniRef50_C1A9I5 Putative uncharacterized protein n=1 Tax=Gemmati... 368 e-100 UniRef50_B4D6Q1 Putative uncharacterized protein n=2 Tax=Verruco... 367 e-100 UniRef50_Q7MXU6 YngK protein n=4 Tax=Porphyromonadaceae RepID=Q7... 367 e-100 UniRef50_UPI0001C160EA conserved hypothetical protein n=2 Tax=No... 365 2e-99 UniRef50_A9NEW0 Putative uncharacterized protein n=1 Tax=Acholep... 362 2e-98 UniRef50_Q110S6 Putative uncharacterized protein n=5 Tax=Bacteri... 359 1e-97 UniRef50_B9XM08 Putative uncharacterized protein n=2 Tax=bacteri... 354 4e-96 UniRef50_B0NT08 Putative uncharacterized protein n=2 Tax=Bactero... 351 3e-95 UniRef50_A6G0M0 Putative uncharacterized protein n=1 Tax=Plesioc... 351 3e-95 UniRef50_B7AM83 Putative uncharacterized protein n=1 Tax=Bactero... 351 3e-95 UniRef50_C6XWM5 Putative uncharacterized protein n=1 Tax=Pedobac... 345 1e-93 UniRef50_C1A7Q3 Putative uncharacterized protein n=1 Tax=Gemmati... 345 2e-93 UniRef50_C9PUA7 FenI protein n=2 Tax=Prevotella RepID=C9PUA7_9BACT 344 4e-93 UniRef50_A0M6M5 Protein containing DUF187 n=4 Tax=Bacteroidetes ... 344 5e-93 UniRef50_C0YRL9 FenI family protein n=3 Tax=Bacteroidetes RepID=... 342 2e-92 UniRef50_A7VTI3 Putative uncharacterized protein n=1 Tax=Clostri... 337 4e-91 UniRef50_C1I7D2 Putative uncharacterized protein n=1 Tax=Clostri... 337 5e-91 UniRef50_C9LEC6 YngK protein n=1 Tax=Prevotella tannerae ATCC 51... 335 1e-90 UniRef50_C3R8E6 S-layer protein n=24 Tax=Bacteroides RepID=C3R8E... 334 3e-90 UniRef50_A6L917 Putative uncharacterized protein n=5 Tax=Bactero... 334 4e-90 UniRef50_B2ULM6 Putative uncharacterized protein n=1 Tax=Akkerma... 331 3e-89 UniRef50_A9KK48 Putative uncharacterized protein n=1 Tax=Clostri... 331 4e-89 UniRef50_C0EGV5 Putative uncharacterized protein n=1 Tax=Clostri... 330 4e-89 UniRef50_Q7MWV9 YngK protein n=2 Tax=Porphyromonas gingivalis Re... 330 5e-89 UniRef50_C0EWT6 Putative uncharacterized protein (Fragment) n=1 ... 330 7e-89 UniRef50_C3QJ47 S-layer protein n=5 Tax=Bacteroides RepID=C3QJ47... 329 1e-88 UniRef50_UPI0001745532 hypothetical protein VspiD_00105 n=1 Tax=... 329 1e-88 UniRef50_B0P7J3 Putative uncharacterized protein n=1 Tax=Anaerot... 325 1e-87 UniRef50_A6EKL7 Putative uncharacterized protein (Fragment) n=1 ... 324 4e-87 UniRef50_C3J8B5 YngK protein n=2 Tax=Bacteria RepID=C3J8B5_9PORP 320 5e-86 UniRef50_C2M9G1 YngK protein n=1 Tax=Porphyromonas uenonis 60-3 ... 309 1e-82 UniRef50_C1Q9T9 Uncharacterized conserved protein n=3 Tax=Brachy... 308 2e-82 UniRef50_B9Y560 Putative uncharacterized protein n=1 Tax=Holdema... 302 3e-80 UniRef50_B6YR88 Putative uncharacterized protein n=1 Tax=Candida... 300 5e-80 UniRef50_B0MQ11 Putative uncharacterized protein n=1 Tax=Eubacte... 299 2e-79 UniRef50_C9PZF4 YngK protein n=5 Tax=Prevotella RepID=C9PZF4_9BACT 298 2e-79 UniRef50_B3QYB7 Putative uncharacterized protein n=1 Tax=Chloroh... 297 7e-79 UniRef50_B0NXH7 Putative uncharacterized protein n=3 Tax=Clostri... 295 3e-78 UniRef50_A9NEM7 Hypothetical surface-anchored protein n=2 Tax=Ac... 290 5e-77 UniRef50_D1PA22 YngK protein n=1 Tax=Prevotella copri DSM 18205 ... 288 2e-76 UniRef50_C7H8A9 FenI protein n=2 Tax=Faecalibacterium prausnitzi... 285 3e-75 UniRef50_C5VL52 YngK protein n=3 Tax=Prevotella RepID=C5VL52_9BACT 281 4e-74 UniRef50_C4FZ05 Putative uncharacterized protein n=1 Tax=Abiotro... 278 3e-73 UniRef50_UPI0001C37647 hypothetical protein RflaF_08645 n=1 Tax=... 277 8e-73 UniRef50_D1PRQ4 FenI protein n=1 Tax=Subdoligranulum variabile D... 273 1e-71 UniRef50_C2L0K0 Lipoprotein yddW n=1 Tax=Oribacterium sinus F026... 254 5e-66 UniRef50_A6DH63 Putative uncharacterized protein n=1 Tax=Lentisp... 235 4e-60 UniRef50_B4AVG6 Putative uncharacterized protein n=1 Tax=Cyanoth... 232 2e-59 UniRef50_B4VPG3 Putative uncharacterized protein n=1 Tax=Microco... 229 2e-58 UniRef50_B4WH89 Putative uncharacterized protein n=1 Tax=Synecho... 228 4e-58 UniRef50_Q8YXK2 All1210 protein n=4 Tax=Nostocaceae RepID=Q8YXK2... 228 4e-58 UniRef50_B5WA73 Putative uncharacterized protein n=2 Tax=Arthros... 224 7e-57 UniRef50_A8YI06 Similar to tr|Q8YPV9|Q8YPV9 n=8 Tax=Chroococcale... 222 2e-56 UniRef50_P74735 Slr0592 protein n=1 Tax=Synechocystis sp. PCC 68... 220 7e-56 UniRef50_Q8YV65 All2116 protein n=15 Tax=Cyanobacteria RepID=Q8Y... 219 1e-55 UniRef50_Q8YQA0 All3933 protein n=18 Tax=Cyanobacteria RepID=Q8Y... 216 1e-54 UniRef50_Q7NL32 Glr1294 protein n=1 Tax=Gloeobacter violaceus Re... 215 4e-54 UniRef50_B2IV00 Putative uncharacterized protein n=4 Tax=Cyanoba... 215 4e-54 UniRef50_C2FS67 FenI family protein n=1 Tax=Sphingobacterium spi... 214 5e-54 UniRef50_Q8YLM8 Alr5270 protein n=12 Tax=Cyanobacteria RepID=Q8Y... 213 1e-53 UniRef50_B4VTS6 Putative uncharacterized protein n=1 Tax=Microco... 212 3e-53 UniRef50_A0YRE2 Putative uncharacterized protein n=1 Tax=Lyngbya... 208 4e-52 UniRef50_B9XI64 Putative uncharacterized protein n=1 Tax=bacteri... 207 5e-52 UniRef50_B7JXY5 Putative uncharacterized protein n=9 Tax=Cyanoba... 206 1e-51 UniRef50_Q7NJN0 Glr1802 protein n=1 Tax=Gloeobacter violaceus Re... 204 4e-51 UniRef50_Q10YX0 Putative uncharacterized protein n=2 Tax=Cyanoba... 204 4e-51 UniRef50_Q8EPF4 Hypothetical conserved protein n=1 Tax=Oceanobac... 204 5e-51 UniRef50_C1D2P2 Putative uncharacterized protein n=2 Tax=Deinoco... 203 9e-51 UniRef50_B1WZU0 Putative uncharacterized protein n=2 Tax=Cyanoth... 203 1e-50 UniRef50_A0YS74 Putative uncharacterized protein n=2 Tax=Oscilla... 201 6e-50 UniRef50_B4WJG2 Putative uncharacterized protein n=1 Tax=Synecho... 199 2e-49 UniRef50_C7GZF2 Putative lipoprotein n=1 Tax=Eubacterium saphenu... 198 5e-49 UniRef50_A8YDR3 Genome sequencing data, contig C294 n=9 Tax=Chro... 195 3e-48 UniRef50_B8HYQ9 Putative uncharacterized protein n=1 Tax=Cyanoth... 194 4e-48 UniRef50_Q2JQ39 Putative uncharacterized protein n=1 Tax=Synecho... 187 8e-46 UniRef50_C1D298 Putative uncharacterized protein n=1 Tax=Deinoco... 186 1e-45 UniRef50_B0MQ12 Putative uncharacterized protein n=1 Tax=Eubacte... 185 2e-45 UniRef50_UPI0001C16380 Protein of unknown function DUF187 n=1 Ta... 179 2e-43 UniRef50_C6PCP2 Putative uncharacterized protein n=1 Tax=Thermoa... 178 3e-43 UniRef50_Q1IWF6 Putative uncharacterized protein n=3 Tax=Deinoco... 178 4e-43 UniRef50_B5W1E7 Putative uncharacterized protein n=2 Tax=Arthros... 178 6e-43 UniRef50_UPI0001AF05D8 hypothetical protein SghaA1_34850 n=1 Tax... 176 2e-42 UniRef50_A2C8D8 DUF187 n=12 Tax=Cyanobacteria RepID=A2C8D8_PROM3 175 3e-42 UniRef50_C5CIL6 Putative uncharacterized protein n=1 Tax=Kosmoto... 175 3e-42 UniRef50_Q3AJ74 Putative uncharacterized protein n=3 Tax=Chrooco... 170 8e-41 UniRef50_A6CAJ3 Putative uncharacterized protein n=1 Tax=Plancto... 168 5e-40 UniRef50_A7LVF6 Putative uncharacterized protein n=4 Tax=Bactero... 167 7e-40 UniRef50_C2FS66 Putative uncharacterized protein n=1 Tax=Sphingo... 167 1e-39 UniRef50_B0P7J4 Putative uncharacterized protein n=1 Tax=Anaerot... 166 2e-39 UniRef50_P74629 Sll0736 protein n=1 Tax=Synechocystis sp. PCC 68... 165 2e-39 UniRef50_B0VF99 Putative uncharacterized protein n=1 Tax=Candida... 165 4e-39 UniRef50_A8F7U2 Putative uncharacterized protein n=2 Tax=Thermot... 159 2e-37 UniRef50_C6IEW4 Putative uncharacterized protein n=4 Tax=Bactero... 152 3e-35 UniRef50_B0PF61 Putative uncharacterized protein n=1 Tax=Anaerot... 149 2e-34 UniRef50_Q6ZE96 Slr7102 protein n=5 Tax=Cyanobacteria RepID=Q6ZE... 148 5e-34 UniRef50_C3R3M7 Putative uncharacterized protein n=2 Tax=Bactero... 140 9e-32 UniRef50_Q2BFL2 Putative uncharacterized protein n=1 Tax=Bacillu... 116 2e-24 UniRef50_C3A5Y1 Putative uncharacterized protein n=1 Tax=Bacillu... 111 5e-23 UniRef50_Q8YK50 All8067 protein n=8 Tax=Cyanobacteria RepID=Q8YK... 110 7e-23 UniRef50_UPI0001C1694B Protein of unknown function DUF187 n=1 Ta... 106 2e-21 UniRef50_Q6AHL3 Putative uncharacterized protein n=1 Tax=Leifson... 100 2e-19 UniRef50_C7E4U8 Putative uncharacterized protein psa8 n=1 Tax=Pa... 84 1e-14 Sequences not found previously or not previously below threshold: UniRef50_A8F3E2 Putative uncharacterized protein n=1 Tax=Thermot... 162 2e-38 UniRef50_A7LVF0 Putative uncharacterized protein n=3 Tax=Bactero... 114 8e-24 UniRef50_D1BUC2 Putative uncharacterized protein n=1 Tax=Xylanim... 114 9e-24 UniRef50_Q8AAL7 S-layer related protein, sialic acid-specific 9-... 112 2e-23 UniRef50_P35824 S-layer-related protein n=1 Tax=Bacillus circula... 104 8e-21 UniRef50_UPI0001789939 S-layer domain protein n=1 Tax=Geobacillu... 99 4e-19 UniRef50_D1PX02 Putative uncharacterized protein n=2 Tax=Prevote... 98 8e-19 UniRef50_C6IVH6 S-layer domain-containing protein n=1 Tax=Paenib... 96 2e-18 UniRef50_C3Y3M5 Putative uncharacterized protein n=3 Tax=Branchi... 96 4e-18 UniRef50_UPI0001BC8648 hypothetical protein BacD2_02792 n=1 Tax=... 94 8e-18 UniRef50_C3R3K8 S-layer protein n=4 Tax=Bacteroides RepID=C3R3K8... 94 9e-18 UniRef50_A9KMJ8 Putative uncharacterized protein n=1 Tax=Clostri... 89 3e-16 UniRef50_A9KIP9 Putative uncharacterized protein n=1 Tax=Clostri... 86 2e-15 UniRef50_C5A3T6 Glycosyl hydrolase, putative n=1 Tax=Thermococcu... 84 1e-14 UniRef50_A8F7H1 Putative uncharacterized protein n=1 Tax=Thermot... 82 4e-14 UniRef50_A8F7H3 Putative uncharacterized protein n=1 Tax=Thermot... 79 3e-13 UniRef50_D1JA21 Conserved hypothetical membrane protein, DUF187 ... 77 1e-12 UniRef50_C6AUQ1 Putative uncharacterized protein n=2 Tax=Rhizobi... 77 1e-12 UniRef50_Q8YXF7 All1256 protein n=4 Tax=Nostocaceae RepID=Q8YXF7... 77 2e-12 UniRef50_B8HXB6 Putative uncharacterized protein n=1 Tax=Cyanoth... 77 2e-12 UniRef50_Q5N184 Putative uncharacterized protein n=2 Tax=Synecho... 76 3e-12 UniRef50_Q2RYS7 Tat (Twin-arginine translocation) pathway signal... 75 6e-12 UniRef50_Q3B486 Xylanase/chitin deacetylase-like n=2 Tax=Chlorob... 74 1e-11 UniRef50_Q67N80 Putative uncharacterized protein n=1 Tax=Symbiob... 73 2e-11 UniRef50_B0C6V7 Putative uncharacterized protein n=3 Tax=Cyanoba... 73 2e-11 UniRef50_B4D7E2 Putative uncharacterized protein n=1 Tax=Chthoni... 72 3e-11 UniRef50_Q11AV5 Putative uncharacterized protein n=1 Tax=Chelati... 72 3e-11 UniRef50_C1YLJ5 Uncharacterized conserved protein n=4 Tax=Bacter... 72 6e-11 UniRef50_D0MH73 Putative uncharacterized protein n=1 Tax=Rhodoth... 71 8e-11 UniRef50_D1JE50 Hypothetical secreted protein n=1 Tax=uncultured... 71 9e-11 UniRef50_Q114S3 Putative uncharacterized protein n=2 Tax=Oscilla... 71 1e-10 UniRef50_A1S0G8 Putative uncharacterized protein n=1 Tax=Thermof... 70 2e-10 UniRef50_B1XJ85 Putative uncharacterized protein n=3 Tax=Chrooco... 70 2e-10 UniRef50_A4AQ95 Putative uncharacterized protein n=2 Tax=Bactero... 69 3e-10 UniRef50_A5FIA1 Hypothetical lipoprotein n=2 Tax=Flavobacteriace... 69 3e-10 UniRef50_A0Z097 Putative uncharacterized protein n=2 Tax=Oscilla... 68 6e-10 UniRef50_C7PH83 Putative uncharacterized protein n=3 Tax=Sphingo... 67 1e-09 UniRef50_B9XFU7 Putative uncharacterized protein n=1 Tax=bacteri... 67 1e-09 UniRef50_A9WDB3 Putative uncharacterized protein n=5 Tax=Chlorof... 67 1e-09 UniRef50_B1WYP8 Putative uncharacterized protein n=4 Tax=Chrooco... 67 2e-09 UniRef50_UPI0001C391E2 hypothetical protein AplaP_16720 n=1 Tax=... 66 4e-09 UniRef50_B9YV30 Putative uncharacterized protein n=1 Tax='Nostoc... 66 4e-09 UniRef50_A0PYZ0 Putative uncharacterized protein n=3 Tax=Clostri... 65 4e-09 UniRef50_A6LHH0 Putative uncharacterized protein n=6 Tax=Bactero... 65 5e-09 UniRef50_UPI0001B9ED67 hypothetical protein GYMC10_3557 n=1 Tax=... 64 9e-09 UniRef50_D0LIN5 Putative uncharacterized protein n=1 Tax=Haliang... 64 1e-08 UniRef50_A3ZTB2 Putative uncharacterized protein n=1 Tax=Blastop... 64 1e-08 UniRef50_Q2JK94 Putative uncharacterized protein n=2 Tax=Synecho... 64 2e-08 UniRef50_UPI0001746A87 hypothetical protein VspiD_03245 n=1 Tax=... 62 3e-08 UniRef50_A1HM88 Polysaccharide deacetylase n=1 Tax=Thermosinus c... 62 3e-08 UniRef50_C7HH08 GTP-binding protein n=3 Tax=Clostridium thermoce... 62 4e-08 UniRef50_B8I5J6 Putative uncharacterized protein n=2 Tax=Clostri... 62 6e-08 UniRef50_D1CHN7 GTP-binding protein n=1 Tax=Thermobaculum terren... 61 7e-08 UniRef50_A4IKZ2 Alpha-amylase family protein n=12 Tax=Bacillacea... 61 9e-08 UniRef50_O26457 Conserved protein n=1 Tax=Methanothermobacter th... 61 1e-07 UniRef50_Q30RN5 Putative uncharacterized protein n=1 Tax=Sulfuri... 60 2e-07 UniRef50_B4D3R6 Putative uncharacterized protein n=1 Tax=Chthoni... 60 2e-07 UniRef50_B6W970 Putative uncharacterized protein n=1 Tax=Anaeroc... 60 2e-07 UniRef50_C7RFM8 Glycoside hydrolase clan GH-D n=34 Tax=Bacteria ... 60 2e-07 UniRef50_B3TAU5 Putative uncharacterized protein n=1 Tax=uncultu... 60 2e-07 UniRef50_A4FBJ1 Putative uncharacterized protein n=1 Tax=Sacchar... 59 2e-07 UniRef50_Q3A0T1 Putative uncharacterized protein n=1 Tax=Pelobac... 59 3e-07 UniRef50_Q5I942 Alpha-amylase n=1 Tax=Anaerobranca gottschalkii ... 59 3e-07 UniRef50_B6KGT3 1,4-alpha-glucan branching enzyme, putative n=5 ... 59 3e-07 UniRef50_C0A7S7 Putative uncharacterized protein n=1 Tax=Opituta... 59 4e-07 UniRef50_A9B0X0 Putative uncharacterized protein n=1 Tax=Herpeto... 59 4e-07 UniRef50_A4BGI0 Alpha-galactosidase n=1 Tax=Reinekea blandensis ... 58 5e-07 UniRef50_C9KJL9 Alpha-galactosidase n=1 Tax=Mitsuokella multacid... 58 6e-07 UniRef50_UPI00019691CD hypothetical protein BACCELL_01336 n=1 Ta... 58 6e-07 UniRef50_Q3A0V9 Putative uncharacterized protein n=1 Tax=Pelobac... 58 6e-07 UniRef50_Q5WAP8 Maltogenic amylase n=1 Tax=Bacillus clausii KSM-... 58 7e-07 UniRef50_A5UJP6 Putative cysteine protease (Transglutaminase-lik... 58 7e-07 UniRef50_C2KVT1 Putative uncharacterized protein n=1 Tax=Oribact... 58 7e-07 UniRef50_A6CFN7 Putative uncharacterized protein n=1 Tax=Plancto... 58 9e-07 UniRef50_D1AEP8 Putative uncharacterized protein n=1 Tax=Thermom... 57 1e-06 UniRef50_C0A376 Putative uncharacterized protein n=1 Tax=Opituta... 57 1e-06 UniRef50_UPI000178A7B6 hypothetical protein GYMC10_3553 n=1 Tax=... 57 2e-06 UniRef50_UPI00006CC013 Alpha amylase, catalytic domain containin... 57 2e-06 UniRef50_C1SH35 Putative uncharacterized protein n=1 Tax=Denitro... 56 3e-06 UniRef50_B7J5F9 GTP-binding protein n=2 Tax=Acidithiobacillus fe... 56 3e-06 UniRef50_C6VY08 Putative uncharacterized protein n=1 Tax=Dyadoba... 56 3e-06 UniRef50_C7HUF6 Sugar fermentation stimulation protein n=4 Tax=A... 56 3e-06 UniRef50_D2R498 Putative uncharacterized protein n=1 Tax=Pirellu... 56 4e-06 UniRef50_D2QE70 Alpha amylase catalytic region n=1 Tax=Spirosoma... 55 4e-06 UniRef50_B3DUS3 Trehalose synthase n=3 Tax=Bacteria RepID=B3DUS3... 55 4e-06 UniRef50_A6C749 Putative uncharacterized protein n=1 Tax=Plancto... 55 4e-06 UniRef50_UPI000197B402 hypothetical protein BACCOPRO_02222 n=1 T... 55 5e-06 UniRef50_Q9P8N4 Alpha-galactosidase (Fragment) n=2 Tax=Lichtheim... 55 5e-06 UniRef50_A6L961 Glycoside hydrolase family 36, candidate alpha-g... 55 5e-06 UniRef50_Q4L9B3 Similar to unknown protein n=4 Tax=Bacilli RepID... 55 6e-06 UniRef50_B0XSJ0 Alpha-glucosidase/alpha-amylase, putative n=4 Ta... 55 7e-06 UniRef50_C6CVL0 Alpha amylase catalytic region n=1 Tax=Paenibaci... 55 7e-06 UniRef50_B3JE63 Putative uncharacterized protein n=4 Tax=Bactero... 54 8e-06 UniRef50_A6C7C5 Putative uncharacterized protein n=1 Tax=Plancto... 54 9e-06 UniRef50_D0AJB4 Predicted protein n=3 Tax=cellular organisms Rep... 54 9e-06 UniRef50_A8RX71 Putative uncharacterized protein n=4 Tax=Clostri... 54 1e-05 UniRef50_Q02D31 Putative uncharacterized protein n=1 Tax=Candida... 54 1e-05 UniRef50_Q6VUG7 Dextranase 1 n=3 Tax=Paenibacillus RepID=Q6VUG7_... 54 1e-05 UniRef50_Q3JIJ6 Conserved domain protein n=67 Tax=Betaproteobact... 54 1e-05 UniRef50_C7M6E9 Glycoside hydrolase family 31 n=10 Tax=Bacteroid... 54 1e-05 UniRef50_Q5LE31 Putative uncharacterized protein n=14 Tax=Bacter... 54 1e-05 UniRef50_D2QQI8 Trehalose synthase n=1 Tax=Spirosoma linguale DS... 54 1e-05 >UniRef50_P64427 UPF0748 lipoprotein yddW n=88 Tax=Enterobacteriaceae RepID=YDDW_ECO57 Length = 439 Score = 622 bits (1605), Expect = e-177, Method: Composition-based stats. Identities = 439/439 (100%), Positives = 439/439 (100%) Query: 1 MDICSRNKKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIW 60 MDICSRNKKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIW Sbjct: 1 MDICSRNKKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIW 60 Query: 61 LATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPS 120 LATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPS Sbjct: 61 LATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPS 120 Query: 121 KILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNST 180 KILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNST Sbjct: 121 KILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNST 180 Query: 181 LSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYT 240 LSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYT Sbjct: 181 LSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYT 240 Query: 241 ESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWR 300 ESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWR Sbjct: 241 ESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWR 300 Query: 301 NRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWAD 360 NRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWAD Sbjct: 301 NRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWAD 360 Query: 361 VVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDY 420 VVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDY Sbjct: 361 VVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDY 420 Query: 421 LNKPQTQQAVSYLQSRWGS 439 LNKPQTQQAVSYLQSRWGS Sbjct: 421 LNKPQTQQAVSYLQSRWGS 439 >UniRef50_C1M4K1 Lipoprotein YddW n=4 Tax=Enterobacteriaceae RepID=C1M4K1_9ENTR Length = 441 Score = 547 bits (1408), Expect = e-154, Method: Composition-based stats. Identities = 329/428 (76%), Positives = 373/428 (87%), Gaps = 3/428 (0%) Query: 11 TIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWP 70 ++ A+LV LLL SC S PP TP P + QQS +P+RGIWLATVSRLDWP Sbjct: 16 NMKWFAVLVGSMLLLGSCSSQPPGPKTTPLP---PVSKPQQSKEPVRGIWLATVSRLDWP 72 Query: 71 PVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMT 130 P+SSVNIS+P R QQ+A+ DKLD+L+RLGINTVFFQVKPDGTALW SKILPWSD +T Sbjct: 73 PISSVNISSPAVRISQQQKALTDKLDNLKRLGINTVFFQVKPDGTALWKSKILPWSDTLT 132 Query: 131 GKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYV 190 G IG++PGYDPLQFMLDEAHKRGMKVHAW NPYRVSVNTKP T+ ELNSTLSQ P+SVYV Sbjct: 133 GTIGQDPGYDPLQFMLDEAHKRGMKVHAWLNPYRVSVNTKPSTVSELNSTLSQTPSSVYV 192 Query: 191 QHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDN 250 HRDWIRT+G+RFVLDPGIP+V+DWITSIVAEVV YPVDGVQFDDYFYTESPGS LND+ Sbjct: 193 LHRDWIRTAGERFVLDPGIPDVRDWITSIVAEVVENYPVDGVQFDDYFYTESPGSALNDS 252 Query: 251 ETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSD 310 +T+R+YG FASKADWRR+NTQ+LIA+VS TIK +KP VEFGVSPAGVWRNRSHDP GSD Sbjct: 253 QTFRRYGQGFASKADWRRDNTQRLIAQVSRTIKKLKPEVEFGVSPAGVWRNRSHDPAGSD 312 Query: 311 TRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLY 370 TRGAAAYDESYADTRRWV+ GLLDYIAPQ+YWPF+R AARYDVLAKWWADVVK T TRLY Sbjct: 313 TRGAAAYDESYADTRRWVQLGLLDYIAPQLYWPFARDAARYDVLAKWWADVVKSTNTRLY 372 Query: 371 IGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAV 430 IG+A YKVGEPS+ EPDW + GGVPELKKQLDLN++ P I+GTILFREDYLN+PQTQ+AV Sbjct: 373 IGVALYKVGEPSRKEPDWTVKGGVPELKKQLDLNESEPYINGTILFREDYLNQPQTQEAV 432 Query: 431 SYLQSRWG 438 +Y+++RWG Sbjct: 433 TYIRNRWG 440 >UniRef50_C2DR58 Lipoprotein yddW n=6 Tax=Enterobacteriaceae RepID=C2DR58_ECOLX Length = 349 Score = 501 bits (1289), Expect = e-140, Method: Composition-based stats. Identities = 347/349 (99%), Positives = 348/349 (99%) Query: 91 MIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAH 150 MIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAH Sbjct: 1 MIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAH 60 Query: 151 KRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIP 210 KRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIP Sbjct: 61 KRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIP 120 Query: 211 EVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNN 270 EVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNN Sbjct: 121 EVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNN 180 Query: 271 TQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQ 330 TQQLIAKVSHTIKSIKP VEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQ Sbjct: 181 TQQLIAKVSHTIKSIKPEVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQ 240 Query: 331 GLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMI 390 GLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMI Sbjct: 241 GLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMI 300 Query: 391 NGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQSRWGS 439 NGGVPELKKQLDLNDA+PEISGTILFREDYLNKPQTQQAVSYLQSRWGS Sbjct: 301 NGGVPELKKQLDLNDALPEISGTILFREDYLNKPQTQQAVSYLQSRWGS 349 >UniRef50_A6WZY4 Putative uncharacterized protein n=10 Tax=Brucellaceae RepID=A6WZY4_OCHA4 Length = 442 Score = 468 bits (1204), Expect = e-130, Method: Composition-based stats. Identities = 190/384 (49%), Positives = 270/384 (70%), Gaps = 1/384 (0%) Query: 56 MRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGT 115 W+ATV LDWP SS I + R + Q++ ++ D GIN V FQV P Sbjct: 52 FHASWIATVLNLDWPSRSSSRIEDDAERIKRQKEELLRLFDEASEHGINAVIFQVSPTAD 111 Query: 116 ALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIR 175 A + S LPWS +TG +G++PG+DPL+F + EAHKRG+++HAW NPYRVS++TKP T + Sbjct: 112 AFYQSSYLPWSSYLTGTLGKDPGFDPLKFAIQEAHKRGIELHAWLNPYRVSMDTKPSTRK 171 Query: 176 ELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFD 235 EL ++ ++ P SV+ H DW+ S DR+VLDPGIP V++W+T++ AEVV +Y +DG+QFD Sbjct: 172 ELRNSSNESPVSVFKSHPDWVGVSADRYVLDPGIPAVREWVTNVTAEVVQKYDIDGIQFD 231 Query: 236 DYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSP 295 DYFY E+ S+L+D+++Y ++G F+SK +WRR NT L+ ++S IK+IKP V FG+SP Sbjct: 232 DYFYYETASSKLDDDKSYARFGTRFSSKYEWRRYNTHTLVREISDKIKAIKPNVRFGISP 291 Query: 296 AGVWRNRSHDPLGSDTR-GAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVL 354 +GVWRN + DP GS TR G YD +ADTRRWV++G++DYIAPQIYW F R Y + Sbjct: 292 SGVWRNAADDPRGSATRAGKTNYDGDFADTRRWVKEGMIDYIAPQIYWSFGRKDVSYGTI 351 Query: 355 AKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTI 414 AKWWAD V+ T+T LYIG+A Y+ G + +EP W GV E+K+QL+ N+++PE+ G+I Sbjct: 352 AKWWADTVRGTKTDLYIGLALYRAGSGTTLEPSWQAGEGVTEIKRQLEFNESLPEVKGSI 411 Query: 415 LFREDYLNKPQTQQAVSYLQSRWG 438 LFR+ +L+ P+ + +YL+ WG Sbjct: 412 LFRQGFLSDPKLKGVSNYLKKTWG 435 >UniRef50_Q48C14 YngK protein n=53 Tax=Proteobacteria RepID=Q48C14_PSE14 Length = 393 Score = 467 bits (1201), Expect = e-130, Method: Composition-based stats. Identities = 208/390 (53%), Positives = 280/390 (71%), Gaps = 1/390 (0%) Query: 50 QQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQ 109 +++ ++ W+ATV+ LDWP VSSV I++ +R Q++ + LD + + +N V FQ Sbjct: 2 ATANKNLKATWVATVTNLDWPSVSSVAITDEAARVSKQKEELTGILDEIVAMKMNAVIFQ 61 Query: 110 VKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNT 169 V P A + S +LPWS +TG +G+NPG+DPL + +++AH R +++HAW NPYRVS+N Sbjct: 62 VVPCADAFYASDLLPWSKYLTGTLGKNPGFDPLAYAIEQAHARNIELHAWVNPYRVSMNA 121 Query: 170 KPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPV 229 TI ELN++ S PASV+ H +W T+ +RFVL+PGIPEVQ W++SIV E+V++Y V Sbjct: 122 SDATIEELNNSSSDSPASVFKTHPEWTGTAANRFVLNPGIPEVQTWVSSIVEEIVTKYDV 181 Query: 230 DGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGV 289 D +QFDDYFY E+ S L D+ TY+KY F +KADWRRNNT L+ I ++K V Sbjct: 182 DAIQFDDYFYNETASSLLQDDATYQKYNTNFTTKADWRRNNTYSLVDTCHKKIAAVKADV 241 Query: 290 EFGVSPAGVWRNRSHDPLGSDTR-GAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSA 348 FGVSPAGVWRN+S DPLGSDT+ GA+ YD +YADTR+WV G++DYIAPQ+YWPF+R Sbjct: 242 LFGVSPAGVWRNKSDDPLGSDTQAGASNYDFAYADTRKWVIDGIIDYIAPQVYWPFAREV 301 Query: 349 ARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVP 408 ARYDV+ +WWAD V T T LYIG+A YKVG S+ EPDW + GGVPE+ +QLDLND++ Sbjct: 302 ARYDVITQWWADTVSGTGTALYIGMALYKVGTASETEPDWTVEGGVPEITRQLDLNDSLT 361 Query: 409 EISGTILFREDYLNKPQTQQAVSYLQSRWG 438 E+SG +LFR +L QTQQ V YL+ RW Sbjct: 362 EVSGCMLFRHMFLRASQTQQVVDYLKLRWA 391 >UniRef50_C9XP71 Cell surface protein n=6 Tax=Clostridium RepID=C9XP71_CLODC Length = 703 Score = 452 bits (1163), Expect = e-125, Method: Composition-based stats. Identities = 170/424 (40%), Positives = 238/424 (56%), Gaps = 40/424 (9%) Query: 13 RRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPV 72 + ++++L + L C S + + MR W++TV LDWP Sbjct: 3 KISILVLSLIMTLTMC--------------SVSSFADSSNDKEMRAAWISTVYNLDWPKT 48 Query: 73 SSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGK 132 + Q++ D LD L+ +GINT QV+P AL+ S I PWS+ +TG Sbjct: 49 KNN--------EAKQKKEYTDLLDKLKSVGINTAVVQVRPKSDALYKSNINPWSEYLTGT 100 Query: 133 IGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQH 192 G++PGYDPL F+++EAHKRGM+ HAWFNPYR+++ + ++ + PA ++ Sbjct: 101 QGKDPGYDPLPFLIEEAHKRGMEFHAWFNPYRITMADE-----SIDKLPANHPA---KKN 152 Query: 193 RDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNET 252 W+ G+++ DPG+PEV+ +I +AEVV Y +DGV FDDYFY PG ND T Sbjct: 153 PSWVVKHGNKYYYDPGLPEVRKYIVDSIAEVVQNYDIDGVHFDDYFY---PGVSFNDTAT 209 Query: 253 YRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTR 312 Y+KYG +K +WRR N L+ V +IKSIKP V FGVSPAG+WRN+S DP GSDT Sbjct: 210 YQKYGKG-QNKDNWRRENVNTLLRDVKASIKSIKPNVVFGVSPAGIWRNKSSDPTGSDTS 268 Query: 313 GAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIG 372 G +Y +YADTR W++QGL+DY+ PQ+YWP AA Y L WWA+ VK T LYIG Sbjct: 269 GNESYVGTYADTRAWIKQGLIDYVVPQLYWPIGLKAADYSKLVAWWANEVKGTNVDLYIG 328 Query: 373 IAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYL-NKPQTQQAVS 431 YK G+ S + E+ +Q+ LN EI G++ F + N Q+ + Sbjct: 329 QGIYKQGQSSYGGQNI-----AKEIVQQVTLNRKYSEIKGSMYFSAKDIANSTSIQKDLK 383 Query: 432 YLQS 435 L S Sbjct: 384 SLYS 387 >UniRef50_A6TUC1 Putative uncharacterized protein n=5 Tax=Bacteria RepID=A6TUC1_ALKMQ Length = 731 Score = 449 bits (1155), Expect = e-125, Method: Composition-based stats. Identities = 175/437 (40%), Positives = 259/437 (59%), Gaps = 15/437 (3%) Query: 7 NKKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSR 66 +K ++I IL+ AL L S P + P T + +RG W++TV Sbjct: 8 SKLISICIVGILMITALPLHSFAIEEPWDQYNQYLPRETPVT----KRHLRGAWISTVIN 63 Query: 67 LDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWS 126 LDWP V + I N R + ++ +I LD + +N VFFQV P+G A + S I+PWS Sbjct: 64 LDWPSVETAKIKNDKERIQKSKEELIAILDKSVEMNMNAVFFQVSPEGDAFYNSNIVPWS 123 Query: 127 DLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPA 186 +TG G++PG+DPL F ++EAHKR +++HAWFNPYR+S+ TI LN Sbjct: 124 RYLTGTFGKDPGFDPLAFAIEEAHKRNLELHAWFNPYRISMYMNDSTIESLNIE-----K 178 Query: 187 SVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSR 246 SVY +H DW++++ RFV+DPGIP+ ++W+ EVV+ Y VDG+ FDDYFY E Sbjct: 179 SVYKEHPDWVKSAMSRFVIDPGIPQAREWVIKRTMEVVNDYDVDGIHFDDYFYYEKHVGE 238 Query: 247 LNDNETYRKYG-GAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRN-RSH 304 L D +T+ +Y G F++ +WRRNNT L+ ++S+ I+ KP ++FG+SPAGVW N + Sbjct: 239 LEDQDTFSQYNLGQFSNLGEWRRNNTYLLVKELSNEIRKTKPWIKFGISPAGVWANKKDG 298 Query: 305 DPLGSDTR-GAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVK 363 GS+T G YD S+ADT++WVE+ ++DYIAPQ+Y+ F+ +A Y +A WW++VV+ Sbjct: 299 HLNGSNTSAGLPNYDRSFADTKKWVEEEIIDYIAPQVYFTFANPSAPYGEVANWWSNVVR 358 Query: 364 PTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNK 423 LYIG A YKV + + + + N V E +Q N PE+ G+I+FR N Sbjct: 359 GKNVHLYIGQALYKVNDNA--DQYFQGNHAVEEFVRQHKYNTMKPEVMGSIMFRFQNFNH 416 Query: 424 PQTQQAVSYLQS-RWGS 439 QQ V+ ++ W + Sbjct: 417 GNKQQVVNVMKEDLWST 433 >UniRef50_C6J3R7 Putative uncharacterized protein n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J3R7_9BACL Length = 545 Score = 448 bits (1151), Expect = e-124, Method: Composition-based stats. Identities = 170/424 (40%), Positives = 240/424 (56%), Gaps = 28/424 (6%) Query: 21 LALLLCSCKSTPPESMVT--PPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNIS 78 LL +T PE V P P +S +RG+W++TVS LDWP SS Sbjct: 141 TITLLSGSGATQPEPGVGGEDPQSDVPQPPAVDTSNGLRGVWVSTVSNLDWPSKSSY--- 197 Query: 79 NPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPG 138 + Q+ + LD +Q +G+N VF QV+P A++PS +PWS +TG G++PG Sbjct: 198 ---GKVEAQKAEYVQLLDEVQAMGMNAVFVQVRPSADAIYPSSQVPWSSYLTGTAGKDPG 254 Query: 139 YDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRT 198 YDPLQF+++E H+RGM+ HAWFNP+R S + + V QH +WI Sbjct: 255 YDPLQFLIEETHRRGMEFHAWFNPFRASTGSDASKLPA---------NHVANQHPEWIVK 305 Query: 199 SGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYT--ESPGSRLNDNETYRKY 256 + ++PGIPE +D + S + EVV+ Y +DGV DDYFY E+ + D+ T++ Y Sbjct: 306 FDGKLYINPGIPEARDHVISAIMEVVNGYDIDGVHLDDYFYPTGETTSKKFADDATFKSY 365 Query: 257 G-GAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGA- 314 A+K DWRR+N Q + K+ I++ KP V FG+SP GVWRN+S+D GSDT+ + Sbjct: 366 NSKKIATKGDWRRDNINQFVQKLGQRIEASKPYVSFGISPYGVWRNKSNDLTGSDTKASV 425 Query: 315 AAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIA 374 AYD +YAD R W++ +DY+APQ+YW +R RYD+LA WWA V+ T +LYIG A Sbjct: 426 TAYDSTYADVRTWIKNEWIDYVAPQLYWSMTRKEVRYDLLADWWAQEVRGTNVKLYIGHA 485 Query: 375 FYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQ 434 YK+G P + E+ QL+ N +PEISG+I F L K + LQ Sbjct: 486 PYKLGTP------EIGWSSAQEIINQLEYNRQIPEISGSIFFSAKDLRK-NPLGLIPLLQ 538 Query: 435 SRWG 438 S +G Sbjct: 539 SYYG 542 >UniRef50_O35015 UPF0748 protein yngK n=11 Tax=Bacteria RepID=YNGK_BACSU Length = 510 Score = 446 bits (1148), Expect = e-124, Method: Composition-based stats. Identities = 161/385 (41%), Positives = 228/385 (59%), Gaps = 23/385 (5%) Query: 43 SKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLG 102 S P QS + +R +W+A+V +DWP +++ + Q+Q I LD +Q++G Sbjct: 23 SVPFMANAQSDRELRAVWIASVLNIDWPSKKGLSV-------KEQKQEYIKLLDDVQKMG 75 Query: 103 INTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNP 162 +N V Q+KP A +PS PWS+ +TG G++PGYDPL FM++E HKR ++ HAWFNP Sbjct: 76 MNAVIVQIKPTADAFYPSAYGPWSEYLTGVQGKDPGYDPLAFMIEETHKRNLEFHAWFNP 135 Query: 163 YRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAE 222 YR+++N +LN PA +H DW+ G++ PGIPE +D+I + E Sbjct: 136 YRITMNHT-----DLNKLSEDHPAR---KHPDWVAAYGNQLYYHPGIPEARDFIVKGIEE 187 Query: 223 VVSRYPVDGVQFDDYFYTES-PGSRLNDNETYRKYGG-AFASKADWRRNNTQQLIAKVSH 280 VV Y +D V DDYFY G D Y +YG AF++ DWRR+N QL+ +++ Sbjct: 188 VVKHYDIDAVHMDDYFYPYKIAGQEFPDQAQYEQYGKDAFSNIDDWRRDNVNQLVKQINQ 247 Query: 281 TIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTR-GAAAYDESYADTRRWVEQGLLDYIAPQ 339 TIK+ KP V+FG+SP GVWRN + DP GS+T+ G YD+ YADTR W+++G +DYIAPQ Sbjct: 248 TIKAAKPYVKFGISPFGVWRNAADDPTGSNTKAGVRNYDDLYADTRHWIQEGDIDYIAPQ 307 Query: 340 IYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKK 399 IYW +AA YDVLA WW++ VK LYIG A YK+ +P W E + Sbjct: 308 IYWSIGFNAAAYDVLADWWSNEVKNRPVHLYIGQAAYKINNN--FDPPW---SDPEEYVR 362 Query: 400 QLDLNDAVPEISGTILFREDYLNKP 424 Q+ LN + + G++ F LNK Sbjct: 363 QITLNRQLELVKGSMHFSLKDLNKN 387 >UniRef50_A7Z5C7 YngK n=5 Tax=Bacteria RepID=A7Z5C7_BACA2 Length = 512 Score = 442 bits (1137), Expect = e-122, Method: Composition-based stats. Identities = 168/418 (40%), Positives = 232/418 (55%), Gaps = 32/418 (7%) Query: 10 LTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDW 69 + R +++ LA++L + T S A+ Q + MR +W+A+V+ +DW Sbjct: 1 MKSCRFSMIWFLAVVLTAGIFTFSAS---------AQASGTQPKREMRAVWIASVTNIDW 51 Query: 70 PPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLM 129 P + Q++ LD +Q +G+N V Q+KP A +PS PWS+ + Sbjct: 52 PSKKGL-------SPEEQKREYSKLLDDVQEMGMNAVIVQIKPAADAFYPSDYGPWSEYL 104 Query: 130 TGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVY 189 TG G+NPGYDPL F+++E HKR ++ HAWFNPYR+++N LN+ PA Sbjct: 105 TGTQGKNPGYDPLAFLVEETHKRNLEFHAWFNPYRITMNHT-----NLNALSDDHPAR-- 157 Query: 190 VQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTES-PGSRLN 248 H DW+ G + +PGIPEV+ +IT + EVVSRY +D V DDYFY G Sbjct: 158 -SHPDWVAAYGKQLYYNPGIPEVRQFITDGIKEVVSRYDIDAVHMDDYFYPYKIAGQEFP 216 Query: 249 DNETYRKYGGA-FASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPL 307 D Y +YG A FAS DWRR+N +L+ +++ TIK KP V+FG+SP GVWRN + DP Sbjct: 217 DQAEYERYGKAHFASIDDWRRDNVNRLVKEINQTIKREKPYVKFGISPFGVWRNAADDPT 276 Query: 308 GSDT-RGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTR 366 GS+T G YD+ YADTR W+++G +DYIAPQIYW AA YDVLA WW V Sbjct: 277 GSETAAGVRNYDDLYADTREWIQKGYIDYIAPQIYWSIGFKAAAYDVLADWWGKEVNNRP 336 Query: 367 TRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKP 424 LYIG A YK+ + +P W G E Q+ LN I G++ F LN+ Sbjct: 337 VHLYIGQAAYKINNNA--DPAWADPG---EYGGQITLNRGSAWIKGSLHFSLKDLNRN 389 >UniRef50_Q81DH4 FenI n=65 Tax=Bacteria RepID=Q81DH4_BACCR Length = 519 Score = 440 bits (1132), Expect = e-122, Method: Composition-based stats. Identities = 158/411 (38%), Positives = 221/411 (53%), Gaps = 24/411 (5%) Query: 17 ILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVN 76 I+ L ++ C P S + PP + T +R +W+A+V +DWP + + Sbjct: 2 IVKRLLMICCIVILFIPFSFI-PPHFTYAEVNTTYKKHELRAVWIASVLNIDWPSKTGLP 60 Query: 77 ISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGEN 136 I Q+Q I LD ++ G+N V Q+KP A +PS PWS+ +TG G++ Sbjct: 61 I-------EKQKQEFIRLLDDVKSTGMNAVVVQIKPTADAFYPSNYGPWSEYITGTQGKD 113 Query: 137 PGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWI 196 PGYDPL FM++E HKR ++ HAW NPYR+++N ++N + PA QH DW+ Sbjct: 114 PGYDPLAFMIEETHKRNIEFHAWINPYRITMNHT-----DINRLSNNHPAR---QHPDWV 165 Query: 197 RTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTES-PGSRLNDNETYRK 255 T G + +PGIPEV+ +IT E+V Y +D + DDYFY G D +TY Sbjct: 166 VTYGGKLYYNPGIPEVKKFITEGALEIVENYDIDALHMDDYFYPYKVAGEEFPDQKTYET 225 Query: 256 Y-GGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSD-TRG 313 Y G F + DWRRNN +L+ ++ IK K V+FG+SP GVWRN + DP GS+ T G Sbjct: 226 YNNGRFTNIEDWRRNNVNELVKDLNTAIKQEKSYVKFGISPFGVWRNIADDPTGSNTTAG 285 Query: 314 AAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGI 373 YD+ YADTR W+++G +DYI PQIYW + A YD+L WW LYIG Sbjct: 286 QRNYDDLYADTREWIQKGYIDYITPQIYWNIGFTPAAYDILVDWWVKETNNKPLHLYIGQ 345 Query: 374 AFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKP 424 A YK+ S P W E KQ+ LN P+I G++ F +N Sbjct: 346 AAYKINNNSV--PAW---SDPEEYPKQIALNRLYPDIKGSMHFSLKDINNN 391 >UniRef50_D1A3Q7 Putative uncharacterized protein n=1 Tax=Thermomonospora curvata DSM 43183 RepID=D1A3Q7_THECD Length = 532 Score = 435 bits (1118), Expect = e-120, Method: Composition-based stats. Identities = 156/429 (36%), Positives = 225/429 (52%), Gaps = 40/429 (9%) Query: 5 SRNKKLTIRRPAILVALALLLCSCKST------------PPESMVTPPAGSKPPATTQQS 52 K I+ + VA + LL C S P + + KPP + + Sbjct: 4 LSGSKERIKIASAAVAASGLLAGCTSAAGGEVGALRADAPIAAGMAECPDIKPPGDS--A 61 Query: 53 SQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKP 112 ++ +RG+W+ATVS +DWP ++ A ++ LD + LG+N VF QV+P Sbjct: 62 ARQVRGMWIATVSGIDWPSDTA-------HSAERKKADYRKLLDQARALGLNAVFVQVRP 114 Query: 113 DGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPG 172 A + S PWS ++G+ G +PG+D L+F + EAHKR ++ HAWFNPYRV+++ G Sbjct: 115 SADAFYDSPYEPWSQWISGEQGRDPGFDVLEFFVSEAHKRDLEFHAWFNPYRVALHNDRG 174 Query: 173 TIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGV 232 + P + ++ W+R + DPG+P+V++ +T +V +VV +Y +D V Sbjct: 175 KL---------HPDNPARKNPSWVREYDGKLWYDPGLPQVRELVTKVVLDVVGKYDIDAV 225 Query: 233 QFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFG 292 DDYFY G D +TYR+YG SK DWRR N L+ + I KP V FG Sbjct: 226 HLDDYFYPYPSGGDFPDEDTYRRYGRGM-SKGDWRRANVDALVKGLHEEIHRAKPQVRFG 284 Query: 293 VSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYD 352 +SP GVWRNR DP GS T +YD+ YADTR+WV+QG +DYI PQ+YW +AA Y Sbjct: 285 ISPFGVWRNRRSDPAGSQTTALQSYDDVYADTRKWVKQGWVDYITPQLYWEIGNAAADYS 344 Query: 353 VLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISG 412 L WWA+ V+ T +L IG A Y+VGE EL + L +N ++ G Sbjct: 345 TLVAWWAEQVEGTGVQLTIGQASYRVGERG---------FDAGELSRHLAVNARHRQVRG 395 Query: 413 TILFREDYL 421 + F L Sbjct: 396 DVYFSAKDL 404 >UniRef50_C0Z8S4 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0Z8S4_BREBN Length = 540 Score = 428 bits (1101), Expect = e-118, Method: Composition-based stats. Identities = 165/411 (40%), Positives = 228/411 (55%), Gaps = 24/411 (5%) Query: 28 CKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQ 87 TPP + G+ P T ++ + G+W++TV LDWP S Q Sbjct: 151 ATMTPPPQDILSGNGAMEPGTPVVTNGNLHGVWISTVYNLDWPSSGSYGNP------AKQ 204 Query: 88 QQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLD 147 QQ I LD LQ +G+N F QV+P G AL+PS + PWS +TG G++PGYDPL FM+ Sbjct: 205 QQEYIQLLDELQAMGMNAAFVQVRPSGDALYPSTLTPWSRFLTGTPGKDPGYDPLAFMVQ 264 Query: 148 EAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDP 207 E H+RGM+ HAWFNP+R + + K + V QH DWI + + ++P Sbjct: 265 ETHRRGMQFHAWFNPFRATTDAKTDQLPA---------NHVIKQHPDWIVNANKKLYINP 315 Query: 208 GIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWR 267 G+P + I + V EVV RY +DGV DDYFY + + SKADWR Sbjct: 316 GVPAARQQIINEVMEVVQRYDIDGVHLDDYFYPSNVAFADDAAFK-AYNSKKIVSKADWR 374 Query: 268 RNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTR-GAAAYDESYADTRR 326 R+N Q + +++ +IKS+KP V+FG+SP GVWRN + DP GSDT+ G AYD +AD R Sbjct: 375 RDNINQFVQQMNQSIKSVKPHVQFGISPFGVWRNSNVDPTGSDTKAGVTAYDHMFADVRT 434 Query: 327 WVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEP 386 W++QG +DY+ PQIYW FS + A+YD L WWA+ V+ T +LYIG + YK+G E Sbjct: 435 WIQQGWIDYVTPQIYWSFSFAPAQYDKLVTWWANEVQGTNVKLYIGHSPYKLGT---AEA 491 Query: 387 DWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQSRW 437 W E+ QL+ N VP++ G+I F L K + L S + Sbjct: 492 GWQ---SAQEIINQLNFNAMVPQVQGSIFFSAKDLRK-NPLGLLPALSSYY 538 >UniRef50_D1S7M0 Putative uncharacterized protein n=1 Tax=Micromonospora aurantiaca ATCC 27029 RepID=D1S7M0_9ACTO Length = 555 Score = 428 bits (1100), Expect = e-118, Method: Composition-based stats. Identities = 159/401 (39%), Positives = 226/401 (56%), Gaps = 27/401 (6%) Query: 26 CSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRAR 85 ++P VT PA + R +W+A+V+ +DWP S + + Sbjct: 33 TGTATSPSTDCVTDPAT---------PKRQFRAMWIASVTNIDWPSKGSWTAPD---QVA 80 Query: 86 VQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFM 145 Q+ + LD Q+L N V QV+P A WPS PWS+ +TG G+NPG+DPL F+ Sbjct: 81 KQKAEYLAWLDLAQKLNHNAVVVQVRPTADAFWPSPYEPWSEYLTGVRGKNPGWDPLDFL 140 Query: 146 LDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTS------ 199 + E+HKR ++ HAWFNPYRVS+ G +L+ PA QH DW+ Sbjct: 141 VAESHKRNLEFHAWFNPYRVSMPAPGGAGADLSQLAPDSPAR---QHPDWVFAYPPAGVA 197 Query: 200 GDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGS-RLNDNETYRKYGG 258 G R +PG+PEV++++ + + + V RY +DGV FDDYFY G+ ++ D+ T+ Y Sbjct: 198 GSRLYYNPGVPEVREFVQTAMMDAVKRYDIDGVHFDDYFYPYPSGTHQVPDDATFAAYNR 257 Query: 259 AFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYD 318 F KADWRR+N LI +++ IK++KP V+FGVSP G+WRN S DP GSDT G+ +YD Sbjct: 258 GFTDKADWRRDNINLLIQEMNAKIKAVKPYVKFGVSPFGIWRNASADPNGSDTTGSQSYD 317 Query: 319 ESYADTRRWVEQGLLDYIAPQIYWPFSR-SAARYDVLAKWWADVVKPTRTRLYIGIAFYK 377 AD+R+WV++ +DYI PQ+YW + AA Y L WWA+ V+ TR +LYIG A YK Sbjct: 318 IISADSRKWVKEEWIDYIVPQLYWYIGQYPAADYARLVPWWAEQVRGTRVQLYIGQADYK 377 Query: 378 VGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFRE 418 G+P+ EL L LN + PE+ G + F Sbjct: 378 SGDPAYGS----FWMNPQELSNHLTLNRSYPEVLGNVHFSA 414 >UniRef50_UPI000178945D protein of unknown function DUF187 n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI000178945D Length = 518 Score = 427 bits (1097), Expect = e-118, Method: Composition-based stats. Identities = 167/432 (38%), Positives = 234/432 (54%), Gaps = 32/432 (7%) Query: 12 IRRPAILVALALLL----CSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRL 67 I+ ++V L + + ++ GS + +RG W++TV L Sbjct: 112 IKNGRVMVPLRFISENLGVQVEWNQAAQRISLSTGSVVVPPPVSTGDEVRGAWISTVFNL 171 Query: 68 DWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSD 127 DWP + A QQ + I LD LQ +GINTV+ QV+P G AL+PS ++PWS Sbjct: 172 DWPKTKT--------SAEQQQASYIALLDSLQDVGINTVYVQVRPAGDALYPSTMVPWSK 223 Query: 128 LMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPAS 187 ++TG G +PGYDP+ FM++E H+R M+ HAWFNP+R + + T S P+ Sbjct: 224 VLTGIQGADPGYDPVAFMVEETHRRNMEFHAWFNPFRANTDIL---------TASLHPSH 274 Query: 188 VYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRL 247 V + H DWI +G + ++PGIPE + I + EVV+ Y +DG+ DDYFY + Sbjct: 275 VALSHPDWIVNTGKQLYINPGIPEARQHIIDTIMEVVNGYDIDGIHLDDYFYPS--NTVF 332 Query: 248 NDNETYRKY-GGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDP 306 ND+ YR++ GA+A+ ADWRR N + + +I +KP VE+G+SP GVWRN+S D Sbjct: 333 NDDAAYREFNNGAYANLADWRRGNINAFVQSLGESIHRVKPDVEYGISPFGVWRNQSVDK 392 Query: 307 LGSDTR-GAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPT 365 GSDT+ G AYD YAD R W++ G +DY+APQIYW S AA YD L WWA V+ T Sbjct: 393 TGSDTKAGVTAYDSMYADVRTWIQNGWIDYVAPQIYWSMSNPAADYDKLVDWWASEVQGT 452 Query: 366 RTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQ 425 L IG A YK+G E W E+ QL N E+ G+I FR + + Sbjct: 453 GVDLLIGHAPYKLGTS---EIGWQ---SASEIINQLKYNQNHAEVKGSIFFRAENILS-N 505 Query: 426 TQQAVSYLQSRW 437 LQS + Sbjct: 506 PLGIKDQLQSYY 517 >UniRef50_D2AWT1 FenI protein n=1 Tax=Streptosporangium roseum DSM 43021 RepID=D2AWT1_STRRD Length = 533 Score = 426 bits (1096), Expect = e-118, Method: Composition-based stats. Identities = 159/377 (42%), Positives = 215/377 (57%), Gaps = 23/377 (6%) Query: 46 PATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINT 105 + + +RG+W+ATV +DWP + + A QQ + LD+ + +N Sbjct: 60 TTDVRYPKRQLRGVWIATVKNIDWPSRTGL-------SAAKQQAEYVRILDNAVKRRLNA 112 Query: 106 VFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRV 165 VF QV+P AL+ S + PWS +TG G++PG+DPL F++ EAHKRG++ HAWFNPYR Sbjct: 113 VFVQVRPASDALYKSSLEPWSKFLTGTAGKDPGWDPLPFLVAEAHKRGLEFHAWFNPYRA 172 Query: 166 SVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVS 225 S + +++ + PA V H DWI +PG+P V+D +TS++ +VV Sbjct: 173 SYD------GDVSKLPADHPARV---HPDWIVKHEGLVYYNPGLPAVRDHVTSVITDVVK 223 Query: 226 RYPVDGVQFDDYFYTESPGS-RLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKS 284 RY VDGV FDDYFY GS + D +RKYG ADWRR+N +LIA+V + Sbjct: 224 RYDVDGVHFDDYFYPYPGGSAQFADGAAFRKYGKGEK-LADWRRSNVDKLIAQVDEAVHG 282 Query: 285 IKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPF 344 K V+FG+SP G+WRN++ DP GS T G +AYD YAD R W+ +G +DY+APQ+YWP Sbjct: 283 TKQHVKFGISPFGIWRNKAQDPTGSATAGMSAYDSIYADARHWIRKGTVDYVAPQLYWPS 342 Query: 345 SRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLN 404 AA YDVL WWA VK T LYIG A Y+VG S P W G EL L N Sbjct: 343 GFKAADYDVLMPWWAKEVKGTDVHLYIGQALYRVG--STDTPAWTRPG---ELPSHLTKN 397 Query: 405 DAVPEISGTILFREDYL 421 ++ G + F L Sbjct: 398 RKHKQVKGDVYFNAKQL 414 >UniRef50_C4RBZ7 FenI protein n=10 Tax=Actinomycetales RepID=C4RBZ7_9ACTO Length = 538 Score = 423 bits (1086), Expect = e-117, Method: Composition-based stats. Identities = 158/381 (41%), Positives = 220/381 (57%), Gaps = 18/381 (4%) Query: 46 PATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINT 105 P + R +W+++V +DWP +S + R Q+ + LD QRL N Sbjct: 27 PTDPAAPKRQFRAMWISSVVNIDWPTKASQTAPD---RIAAQRAEYLGWLDLAQRLHHNA 83 Query: 106 VFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRV 165 V QV+P ALWPS PWS+ +TG G++PG+DPL F++DEAHKR ++ HAWFNPYR+ Sbjct: 84 VVVQVRPTADALWPSPHEPWSEYLTGVRGQDPGWDPLAFLVDEAHKRNLEFHAWFNPYRI 143 Query: 166 SVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTS------GDRFVLDPGIPEVQDWITSI 219 S+ G +L PA QH +W G R +PGIP V++++ + Sbjct: 144 SMPAPGGAGADLAQLAPDHPAR---QHPEWTFAYPPAGVAGSRLYYNPGIPAVREFVQTA 200 Query: 220 VAEVVSRYPVDGVQFDDYFYTESPGSR-LNDNETYRKYGGAFASKADWRRNNTQQLIAKV 278 + + V+RY VDGV FDDYFY G+ + D+ T+ ++ F +ADWRR+N LI ++ Sbjct: 201 MMDAVTRYDVDGVHFDDYFYPYPSGTYQVPDDATFAEFNRGFTDRADWRRDNINLLIREM 260 Query: 279 SHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAP 338 + IK+ KP V+FGVSP G+WRN S DPLGSDT G+ +YD ADTR+WV+Q +DYI P Sbjct: 261 NDRIKAAKPWVKFGVSPFGIWRNASVDPLGSDTTGSQSYDIISADTRKWVKQEWIDYIVP 320 Query: 339 QIYWPFSR-SAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPEL 397 Q+YW + AA Y L WWA+ V+ TR +LYIG A YK G+P+ EL Sbjct: 321 QLYWYIGQYPAADYARLVPWWAETVRGTRVQLYIGQADYKSGDPAYGS----YWQNPREL 376 Query: 398 KKQLDLNDAVPEISGTILFRE 418 L LN + PE+ G + F Sbjct: 377 SDHLTLNRSYPEVLGNVHFSA 397 >UniRef50_C5C4P8 Putative uncharacterized protein n=4 Tax=Bacteria RepID=C5C4P8_BEUC1 Length = 538 Score = 422 bits (1085), Expect = e-116, Method: Composition-based stats. Identities = 160/410 (39%), Positives = 220/410 (53%), Gaps = 22/410 (5%) Query: 13 RRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPV 72 RR + +A A + S + SM TP A + + +R +W+++V +DWP Sbjct: 17 RRTFLTLATAGVAASTLTVTVGSM-TPAAATPSADPAAFLKRELRAMWISSVVNIDWPSA 75 Query: 73 SSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGK 132 + + A QQ + LD Q +N VF QV+P A WPS PWS +TG Sbjct: 76 TGL-------SAEAQQAEYLHWLDVAQDFRLNAVFVQVRPTADAFWPSPHEPWSQYLTGV 128 Query: 133 IGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQH 192 G++PGYDPL F+++E HKR +++H W+NPYRVS+ P + P H Sbjct: 129 QGQDPGYDPLAFIVEETHKRNLELHTWYNPYRVSMQADPAQLV---------PEHPARVH 179 Query: 193 RDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTES-PGSRLNDNE 251 DWI G + DPG+PE Q+ I + + V Y +DGV FDDYFY G + D E Sbjct: 180 PDWIWPYGGKLYFDPGLPETQEHIQAAILHSVENYDIDGVHFDDYFYPYPVAGQTIPDAE 239 Query: 252 TYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDT 311 TY YG F DWRR+N I+ +S IK +KP V+FG+SP G+WRN + DPLGS T Sbjct: 240 TYATYGAGFDDVGDWRRHNVDTFISSISARIKQVKPWVKFGISPFGIWRNDTTDPLGSAT 299 Query: 312 RGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYI 371 RG+ +YD +ADTR+WV +G LDYI PQ+YW + A Y VL WWADV + T LYI Sbjct: 300 RGSQSYDLQFADTRKWVLEGWLDYINPQVYWQIGLAVADYSVLVPWWADVAATSGTHLYI 359 Query: 372 GIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYL 421 G A YKV +P + N L D+ + V + G + F ++ Sbjct: 360 GEALYKVTSGVFTDPAELANH----LALDRDVTETVGPVHGNVYFSAKHV 405 >UniRef50_B8I4Q9 Putative uncharacterized protein n=2 Tax=Bacteria RepID=B8I4Q9_CLOCE Length = 997 Score = 420 bits (1080), Expect = e-116, Method: Composition-based stats. Identities = 155/439 (35%), Positives = 241/439 (54%), Gaps = 24/439 (5%) Query: 7 NKKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSR 66 NKK+ + ++ + L + K + G+ A T + +RG+W+A+VS Sbjct: 2 NKKIGVVCILLVFLMILPIAGYKLFSDRNY----DGNVSNAQTVSKIEDLRGVWIASVSN 57 Query: 67 LDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWS 126 +D+P P A Q++ + D + + Q +G+N +FFQ++P G AL+ S I PWS Sbjct: 58 IDFPSK-------PGISAEKQKKELDDIISNAQYMGLNAIFFQIRPTGDALYKSTIFPWS 110 Query: 127 DLMTGKIGE--NPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQ 184 +TGK G+ + G+DPL +++++AHK+G+++HAW NP R+S+ T +N Sbjct: 111 AYLTGKQGKENDNGFDPLAYIIEQAHKKGIQIHAWINPLRLSMGTTSNPTGNINVLSDNH 170 Query: 185 PASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYT---E 241 PA + + + LDPG P IT VAE+V Y VDG+ FDDYFY E Sbjct: 171 PARKIPEA--VVAAPTGQLYLDPGNPAAIKLITDGVAEIVKNYDVDGIHFDDYFYPSKSE 228 Query: 242 SPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRN 301 G ND+ +Y KY G+F +K DWRRNN L+ +T+K+IKP V+FG+SP +W N Sbjct: 229 GKGVDFNDSASYAKYKGSFKNKDDWRRNNINTLVKSTYNTVKNIKPSVQFGISPFAIWSN 288 Query: 302 RSHDPLGSDTRG-AAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWAD 360 + + GSDT+G + Y + YAD+++WV++ +DYIAPQIYW A Y VL WW + Sbjct: 289 KDRNKEGSDTQGGISTYYDHYADSKKWVKEAYIDYIAPQIYWNIGFKVADYSVLVNWWKN 348 Query: 361 VVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDY 420 V + T+ +LY+G A YK+ + ++ DW+ +P KQ+ N + G+I + Sbjct: 349 VCRGTKVKLYVGHAAYKINDTTQ-SNDWLDPLQIP---KQIAYNRKSNSVDGSIFYGYSK 404 Query: 421 LNKPQTQQAVSYLQSRWGS 439 L K T L+ + S Sbjct: 405 L-KNNTLGIKDKLKGIFVS 422 >UniRef50_UPI00016A6D2C fenI protein n=1 Tax=Burkholderia oklahomensis EO147 RepID=UPI00016A6D2C Length = 521 Score = 420 bits (1080), Expect = e-116, Method: Composition-based stats. Identities = 156/423 (36%), Positives = 220/423 (52%), Gaps = 24/423 (5%) Query: 20 ALALLLCSCKSTPPESMVTPPAGSKPPATTQQ-SSQPMRGIWLATVSRLDWPPVSSVNIS 78 A+ L + + T M++ A + A ++ + R W+A V +DWP + ++ Sbjct: 7 AIVLTISAGACTRSSDMISENAPNVACAVSRATPKRDFRAFWIAAVRNIDWPSREGLTVA 66 Query: 79 NPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPG 138 QQ+ + LD RL N V QV+P + WPS PWS+ +TG G +PG Sbjct: 67 E-------QQEELRKWLDLAVRLRYNAVILQVRPVSDSFWPSPFAPWSEFLTGTQGTDPG 119 Query: 139 YDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRT 198 YDPL F + EAH+R +++HAWFNPYR + NT+ + P HRDW+ + Sbjct: 120 YDPLAFAVAEAHRRNLELHAWFNPYRAARNTQIDLLA---------PTHPARLHRDWLVS 170 Query: 199 SGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTES-PGSRLNDNETYRKYG 257 ++ +PG+P ++ I + + V RY VDGV DD+FY G D TY +YG Sbjct: 171 YDNQLYFNPGVPAAREHIVDAIMDAVDRYDVDGVHLDDFFYPYPIAGETFADTATYMQYG 230 Query: 258 GAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTR-GAAA 316 F + ADWRR+N + +S IK++KP V+FG+SP VWRN S DP GS+T Sbjct: 231 AGFTTLADWRRHNVDVFVEMLSRRIKAVKPWVKFGISPFAVWRNASVDPQGSETSTDVQT 290 Query: 317 YDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFY 376 YD+ YADTRRW+ + +DYIAPQ+YW A YD + WW + V+ +R LYIG A Y Sbjct: 291 YDDQYADTRRWLRENWIDYIAPQVYWAQDFQRADYDKVVSWWVEQVRSSRAHLYIGQAAY 350 Query: 377 KVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQSR 436 KVG S P W EL L N PE+ G I F + + ++S L Sbjct: 351 KVG-ISNQSPGW---ASPAELANHLAFNCKFPEVKGNIYFSAKDVRADRL-GSISQLGHT 405 Query: 437 WGS 439 W S Sbjct: 406 WYS 408 >UniRef50_Q47Q17 FenI protein n=9 Tax=Bacteria RepID=Q47Q17_THEFY Length = 540 Score = 420 bits (1079), Expect = e-116, Method: Composition-based stats. Identities = 164/384 (42%), Positives = 216/384 (56%), Gaps = 24/384 (6%) Query: 40 PAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQ 99 P + MRG+WL TV +DWP S P + QQ+ + LD Sbjct: 55 PIPEDCATDPAYPKRQMRGVWLTTVRNIDWP-------SEPGLSPQQQQEELTAFLDRAV 107 Query: 100 RLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAW 159 LG+N VFF ++P A++ S PW+ +TG G +PGYDPL+F + EAH RG+++HAW Sbjct: 108 ELGLNAVFFHIRPTADAVYASDKEPWARYLTGTQGGDPGYDPLEFAVAEAHTRGLELHAW 167 Query: 160 FNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSI 219 FNPYRV +L P ++ +W+ D+ LDPG PEV++W+ + Sbjct: 168 FNPYRVGWRE-----ADLEHLADDHPVR---RNPEWMIVYDDQGYLDPGNPEVREWVVDV 219 Query: 220 VAEVVSRYPVDGVQFDDYFYTESP-GSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKV 278 VA+VV RY VDGV FDDYFY G +D+ +++ +G F + WRR+N QLI +V Sbjct: 220 VADVVERYDVDGVHFDDYFYPYPASGETFDDDASWQAHGDGFPDRDAWRRDNVNQLIRQV 279 Query: 279 SHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAP 338 + IKP V FGVSP G+WRNRS DP GS T G +YD +ADTR W+ +G +DY+ P Sbjct: 280 HERVHDIKPWVRFGVSPFGIWRNRSSDPSGSATSGLQSYDALHADTRTWIREGWIDYVVP 339 Query: 339 QIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELK 398 Q+YWP +AA Y VLA WWA+ V T LYIG A Y+VG E W G L Sbjct: 340 QLYWPQGFAAADYAVLAPWWAEEVAGTGVDLYIGQAAYRVG-----EDGWK---GADALA 391 Query: 399 KQLDLNDAVPEISGTILFREDYLN 422 KQLD N PEI+G I F LN Sbjct: 392 KQLDFNTQHPEITGDIYFSMKDLN 415 >UniRef50_A1V3X0 FenI protein n=36 Tax=Bacteria RepID=A1V3X0_BURMS Length = 521 Score = 419 bits (1078), Expect = e-116, Method: Composition-based stats. Identities = 154/410 (37%), Positives = 215/410 (52%), Gaps = 25/410 (6%) Query: 32 PPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAM 91 P +P T + RG W+A+V LDWP S P A QQ + Sbjct: 22 ASSPQAVPEVACRPDET--MPKRQFRGTWIASVINLDWP-------SRPGLPAAAQQAEL 72 Query: 92 IDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHK 151 LD R+ N V QV+P A WPS PWS +TG G +PGYDPL F + EAH+ Sbjct: 73 SAWLDDAVRMNRNAVILQVRPTADAFWPSPFEPWSKYLTGAQGGDPGYDPLAFAVAEAHR 132 Query: 152 RGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPE 211 R +++HAWFNPYRV+++ + L++ ++ PA H DW+ G + +PG+P Sbjct: 133 RNLELHAWFNPYRVAMDDR------LDALVATHPAR---AHPDWVVRYGGKLYYNPGVPA 183 Query: 212 VQDWITSIVAEVVSRYPVDGVQFDDYFYTES-PGSRLNDNETYRKYGGAFASKADWRRNN 270 + ++ + + V+RY +D V DDYFY G+ +D Y +YG FA+ ADWRR+N Sbjct: 184 ARAFVVDAIMDAVARYDIDAVHLDDYFYPYPVAGATFDDASAYAQYGAGFATLADWRRDN 243 Query: 271 TQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGA-AAYDESYADTRRWVE 329 +L+ ++ IK+ KP V+FG+SP VWRN + DP GS T + YD+ YADTRRWV Sbjct: 244 VDRLVESLARRIKAAKPWVKFGISPFAVWRNAATDPQGSRTSASVQTYDDLYADTRRWVR 303 Query: 330 QGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWM 389 + +DY+ PQ YW + A YD + WWA+ V+ LYIG A YKVG S P W Sbjct: 304 ERWIDYVVPQAYWARGFAPADYDEVVAWWANEVRGRDAHLYIGQAAYKVGT-SNQSPGW- 361 Query: 390 INGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQSRWGS 439 EL + L N PE+ G + F + + A + L W S Sbjct: 362 --SDPDELSRHLAFNLTAPEVKGDVYFSAKDVRADRL-GATTRLNRTWYS 408 >UniRef50_A8MM80 Putative uncharacterized protein n=1 Tax=Alkaliphilus oremlandii OhILAs RepID=A8MM80_ALKOO Length = 476 Score = 415 bits (1067), Expect = e-114, Method: Composition-based stats. Identities = 147/390 (37%), Positives = 204/390 (52%), Gaps = 26/390 (6%) Query: 51 QSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQV 110 +R W++TV LDWP + + + Q+ LD L+ G+N V Q+ Sbjct: 108 HKDAELRATWISTVYNLDWPSKKGLAVED-------QKSEFTALLDGLKSAGLNAVMVQI 160 Query: 111 KPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTK 170 KP + +PS+ PWS+ +TG G++PGY+PL FM++E HKR M+ HAWFNPYRVSV Sbjct: 161 KPSADSFYPSQYGPWSEYLTGVQGKDPGYNPLAFMIEETHKRNMEFHAWFNPYRVSVKED 220 Query: 171 PGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVD 230 + E ++ DW+ + G + +PGIP VQ ++ + EVV Y +D Sbjct: 221 RNALAE---------GHPAKKNPDWVVSYGGKLFYNPGIPAVQQFVIDSILEVVKNYNID 271 Query: 231 GVQFDDYFYTESPGS-RLNDNETYRKYGG-AFASKADWRRNNTQQLIAKVSHTIKSIKPG 288 GV DDYFY D E Y+ Y A +K WRRNN I + +IK K Sbjct: 272 GVHLDDYFYPYPEKEGDFPDEELYQSYRRTASETKEQWRRNNINDFIQNLYQSIKREKST 331 Query: 289 VEFGVSPAGVWRNRSHDPLGSDTR-GAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRS 347 V GVSP G+WRN++ DP GS+TR G +YD YADT+ W+E G LDYIAPQ+YW Sbjct: 332 VVLGVSPFGIWRNKADDPKGSNTRGGVTSYDSLYADTKYWIENGWLDYIAPQVYWHIGYD 391 Query: 348 AARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAV 407 A Y L WW++VV+ + LYIG A YKV E G E+ Q++ N + Sbjct: 392 RAEYKELINWWSNVVQNKKVELYIGQAAYKV------EAGTTPWGNPLEILDQIEYNRMI 445 Query: 408 PEISGTILFREDYLNKPQTQQAVSYLQSRW 437 PE+ G+I FR + L+ + Sbjct: 446 PEVKGSIFFRAKSIVN-NPLGLKDNLEKMY 474 >UniRef50_A3HZ09 FenI n=1 Tax=Algoriphagus sp. PR1 RepID=A3HZ09_9SPHI Length = 543 Score = 413 bits (1062), Expect = e-114, Method: Composition-based stats. Identities = 153/457 (33%), Positives = 225/457 (49%), Gaps = 54/457 (11%) Query: 9 KLTIRRPAILVALALLLCSCKSTP---PESMVTPPAGSKPPATT---------------- 49 K R I++A LLL +CKS+ P T P + P + Sbjct: 2 KFLSRHLFIILAFGLLLSACKSSKNVTPGQQPTAPIQTSPNTDSGTNLPVKTLPKTPIAL 61 Query: 50 -------QQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLG 102 + + RG+W+ATV+ +DWP +P Q++ ++ LD+ + L Sbjct: 62 APLSYQMPEMPREFRGVWIATVANIDWPI-------SPDDPYEKQKRDFLEILDYYKSLN 114 Query: 103 INTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYD--PLQFMLDEAHKRGMKVHAWF 160 N V QV+ G A +PS + PWS +TGK G+ P + PL +M+ E+H RGM+ HAW Sbjct: 115 FNAVIVQVRTAGDAFFPSNLAPWSKYLTGKQGKAPNTNENPLTWMIHESHARGMEFHAWL 174 Query: 161 NPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIV 220 NPYR +++ K T P Y HR+W+ G ++ +PG+PEVQ + ++ Sbjct: 175 NPYRATMDLK---------TDELSPDHDYNAHRNWMVKYGTKYYYNPGLPEVQTHLLKVI 225 Query: 221 AEVVSRYPVDGVQFDDYFYTES-PGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVS 279 E+V Y VD + FDDYFY D TY KY + ++ DWRR+N QLI ++ Sbjct: 226 KEIVDNYDVDAIHFDDYFYPYKIAREEFPDRNTYNKYKKSGQTQDDWRRDNVNQLIFALN 285 Query: 280 HTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTR-GAAAYDESYADTRRWVEQGLLDYIAP 338 +TIK KP V+FG+SP GVWRN+ DP GS T+ G YD+ YAD W++ G +DY+ P Sbjct: 286 NTIKQSKPWVQFGISPFGVWRNQDKDPKGSPTQAGQTNYDDLYADVLLWMKNGWVDYMIP 345 Query: 339 QIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELK 398 Q+YW A + +L WWA T +YIG YK+ E S + W E+ Sbjct: 346 QLYWSMEHPLASHRILNDWWA--TNHNYTNIYIGNGPYKIREDS--DKAW---ENPKEIN 398 Query: 399 KQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQS 435 Q+ +P I G F + K + + L+ Sbjct: 399 NQISYTRTLPTIQGNAFFSAKSM-KIKNRDVAQLLKG 434 >UniRef50_D1AYL2 Putative uncharacterized protein n=1 Tax=Streptobacillus moniliformis DSM 12112 RepID=D1AYL2_STRM9 Length = 437 Score = 413 bits (1061), Expect = e-114, Method: Composition-based stats. Identities = 138/381 (36%), Positives = 218/381 (57%), Gaps = 29/381 (7%) Query: 48 TTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVF 107 ++ ++ ++G+W ATV LD+P +S+ Q++ + + ++++++ G+N VF Sbjct: 69 ENRKINKNLKGVWAATVVNLDFPKTTSM---------EEQKREIDEMMENIKKWGLNAVF 119 Query: 108 FQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSV 167 F V+P AL+ S+ PWS +TG +PGYDPL++ + AHKRG+++HAW NPYR ++ Sbjct: 120 FHVRPAADALYNSEFEPWSIYLTGTQNRHPGYDPLEYAIKAAHKRGIELHAWINPYRAAM 179 Query: 168 NTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRY 227 NT + + S+ + +WI +F ++PG PEV ++++ + E+V +Y Sbjct: 180 NTDLNKLSDK---------SIVKRKPEWIFEYDGKFYMNPGNPEVVNYVSKAIEEIVEKY 230 Query: 228 PVDGVQFDDYFYTESPGS----RLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIK 283 +DG+ DDYFY + D + + KYG + S+ DWRR+N +I +S ++ Sbjct: 231 DIDGLHLDDYFYPYPSATLKLGDNVDQKEFEKYGSEYNSRGDWRRDNVNNMIKNLSVSVH 290 Query: 284 SIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWP 343 IKP + FGVSP G+WRN D GS T+G +YD YAD+ +W+++G +DYIAPQIYW Sbjct: 291 KIKPNLSFGVSPFGIWRNYETDARGSKTKGLQSYDSLYADSLKWMKEGWVDYIAPQIYWN 350 Query: 344 FSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDL 403 A Y+ L KWWA+ K T T LY+G YK EP + EL+KQL L Sbjct: 351 IGFEKADYEELVKWWAEKSKETNTPLYVGHGVYKYIEPKPWKDS-------KELEKQLKL 403 Query: 404 NDAVPEISGTILFREDYLNKP 424 N+ + G+I FR L + Sbjct: 404 NEKYDAVKGSIFFRYGTLLEN 424 >UniRef50_C7IM14 Putative uncharacterized protein n=1 Tax=Clostridium papyrosolvens DSM 2782 RepID=C7IM14_9CLOT Length = 999 Score = 410 bits (1054), Expect = e-113, Method: Composition-based stats. Identities = 147/440 (33%), Positives = 238/440 (54%), Gaps = 26/440 (5%) Query: 7 NKKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSR 66 NKK+ ++ L + + ++ G+ A T ++ +RG+W+A+V+ Sbjct: 2 NKKIVAVCILLVFLTILPIAGYRLFADKTY----EGNISNAQTVSKNEDLRGVWIASVAN 57 Query: 67 LDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWS 126 +D+P P A Q++ + + + + + +G+N +FFQV+P G AL+ S I PWS Sbjct: 58 IDFPSK-------PGISADKQKKELDEIISNTKYMGLNAIFFQVRPTGDALYKSSIFPWS 110 Query: 127 DLMTGKIGE--NPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQ 184 +TG+ G+ + G+DPL +++ +AHK G++VHAW NP R+++ T + ++ + Sbjct: 111 KYLTGQQGKENDGGFDPLAYIIKQAHKEGIQVHAWLNPLRLTMGTTAKPDKNVSVLSANH 170 Query: 185 PASVYVQHRDWIRTS-GDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYT--- 240 PA + D + + + LDPG P IT VAE+V Y VDG+ FDDYFY Sbjct: 171 PAR---KIPDAVVAAPTGQLYLDPGNPAAIKLITDGVAEIVKNYDVDGIHFDDYFYPSKS 227 Query: 241 ESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWR 300 E+ G ND+ ++ KY G F +K DWRRNN L+ T+K+IK V+FG+SP +W Sbjct: 228 ETKGVDFNDSASFAKYKGNFKNKDDWRRNNINTLVKNTYDTVKNIKNKVQFGISPFAIWS 287 Query: 301 NRSHDPLGSDTRG-AAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWA 359 N+ + GS T+G + Y + YAD+++WV + +DYIAPQIYW A Y VL WW Sbjct: 288 NKDRNIEGSSTQGGISTYYDHYADSKKWVREAYIDYIAPQIYWNMGFKIADYSVLVNWWK 347 Query: 360 DVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFRED 419 +V T+ +LY+G A YK+ + ++ DW+ +P KQ+ N ++G+I + Sbjct: 348 NVCSGTKVKLYVGHAAYKINDTTQ-SNDWLDPLQIP---KQIAYNRKSNAVAGSIFYGYA 403 Query: 420 YLNKPQTQQAVSYLQSRWGS 439 L T L+ + S Sbjct: 404 KLRD-NTLGIKDKLRGIFVS 422 >UniRef50_D2AR89 FenI protein n=9 Tax=Bacteria RepID=D2AR89_STRRD Length = 520 Score = 408 bits (1049), Expect = e-112, Method: Composition-based stats. Identities = 145/382 (37%), Positives = 214/382 (56%), Gaps = 21/382 (5%) Query: 52 SSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVK 111 + MRG+W+A+V ++WP P A Q+ + LD Q +N VF Q++ Sbjct: 43 PLRQMRGMWIASVVNINWPSK-------PGLTADQQKAEYLAWLDLAQVRKLNAVFVQIR 95 Query: 112 PDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKP 171 P A WPS PWS +TG G++PGYDPL F+++E HKRG+ HAWFNPYRVS+ P Sbjct: 96 PTADAFWPSPFEPWSQYLTGTQGQDPGYDPLAFVVEETHKRGLAFHAWFNPYRVSMQPDP 155 Query: 172 GTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDG 231 + P +H DWI G + +PG+PEV+ ++ + + V++Y +DG Sbjct: 156 SKL---------HPDHPGTKHPDWIVPYGGKLYYNPGMPEVRAFVQDAMMDAVTKYDIDG 206 Query: 232 VQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEF 291 + FDDYFY + + +D+ + KYG F A WRRNN L+ ++ ++ KP + + Sbjct: 207 LHFDDYFYPVN-TTAFDDSAAFAKYGQGFPDLAAWRRNNVDLLVQEMQQRVRQAKPEIAW 265 Query: 292 GVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARY 351 G+SP+G+WRN++ DPLGSDT G+ +YD +ADTR WV++G LDYIAPQ+YW +S A Y Sbjct: 266 GISPSGIWRNKTTDPLGSDTGGSQSYDNLHADTRGWVKKGWLDYIAPQLYWYIGQSNADY 325 Query: 352 DVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEIS 411 L WW+DV T T+L+IG A YK G + EL + L LN P++ Sbjct: 326 AKLVPWWSDVAAGTPTQLWIGQAAYKAGAAGQP----AQWFQPDELTRHLTLNRDHPQVG 381 Query: 412 GTILFREDYLNKPQTQQAVSYL 433 G I + + + + + Sbjct: 382 GDIWYNSGDVRDDRLGSVTTVV 403 >UniRef50_C5PKN1 Possible FenI n=1 Tax=Sphingobacterium spiritivorum ATCC 33861 RepID=C5PKN1_9SPHI Length = 508 Score = 406 bits (1042), Expect = e-111, Method: Composition-based stats. Identities = 135/378 (35%), Positives = 208/378 (55%), Gaps = 28/378 (7%) Query: 51 QSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQV 110 + +RG+W+ATV+ +DWP S + Q+Q +I+ LD QR G+N +FFQ+ Sbjct: 26 SPKRELRGVWIATVANIDWP-------SRDNESSERQKQELINILDAHQRAGLNAIFFQI 78 Query: 111 KPDGTALWPSKILPWSDLMTGKIGE--NPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVN 168 +P A + PWS +TG G+ +P YDPL+F+++EAHKRGM++HAW NPYR S Sbjct: 79 RPAADAFYAKGREPWSRYLTGVQGKAPSPFYDPLEFVIEEAHKRGMELHAWVNPYRASTT 138 Query: 169 TKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYP 228 P + + +W G +++ +PG+PEV+ +I ++ +VV Y Sbjct: 139 LNPAHFSK---------DHITRTKPEWFFKYGGKYLFNPGLPEVRQYIIDVIMDVVKNYD 189 Query: 229 VDGVQFDDYFYTESPG--SRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIK 286 VDG+ FDDYFY + L D T+ ++G FA+ DWRRNN LI + IK K Sbjct: 190 VDGIHFDDYFYPYPDARNTALPDAPTFHQFGKGFANIHDWRRNNVDLLIRDLGIAIKKEK 249 Query: 287 PGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSR 346 P +++G+SP G+W N+ +P GS+T G + Y YAD +W+++G +DYI PQIY+PF+ Sbjct: 250 PFIKYGISPFGIWDNKRDNPDGSNTSGLSGYRTLYADGVKWMKEGWIDYINPQIYFPFNN 309 Query: 347 SAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDA 406 AA +++L +WW Y+G Y+V ++ P W G +P+ + L Sbjct: 310 RAAAFEILLEWWEKHT--YGRHFYVGHGAYRV---TEKRPGWTDKGQIPKQVRHL---RD 361 Query: 407 VPEISGTILFREDYLNKP 424 E+ G+I F L Sbjct: 362 QHEVQGSIYFSSKSLMDN 379 >UniRef50_A5FI17 Putative uncharacterized protein n=1 Tax=Flavobacterium johnsoniae UW101 RepID=A5FI17_FLAJ1 Length = 523 Score = 405 bits (1040), Expect = e-111, Method: Composition-based stats. Identities = 137/377 (36%), Positives = 203/377 (53%), Gaps = 27/377 (7%) Query: 52 SSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVK 111 RG+W+ATV +DWP + N+ ++ ++ L+ ++L N V Q++ Sbjct: 30 PKNEFRGVWIATVVNIDWPKTAIDNV-------EKEKADYLEILNTYKKLNYNAVIVQIR 82 Query: 112 PDGTALWPSKILPWSDLMTGKIG--ENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNT 169 G A +PS+ PWS +TGK G NP YD L++M++EAH RG + HAW NPYR + + Sbjct: 83 SVGDAFYPSEFAPWSRFLTGKEGTAPNPYYDALEWMIEEAHNRGFEFHAWLNPYRATFDL 142 Query: 170 KPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPV 229 + P +H +W+ G ++ DP +PEVQ +T +V EVV +Y + Sbjct: 143 NKNLLS---------PNHDIFKHPEWMIEYGGKYYYDPALPEVQTHLTKVVKEVVDKYDI 193 Query: 230 DGVQFDDYFYTES-PGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPG 288 D + FDDYFY + PG ND +Y+KYG S ADWRR N + +S TIK+ KP Sbjct: 194 DAIHFDDYFYPYAVPGKVFNDTASYKKYGSGL-SLADWRRANVSNFVHTISTTIKASKPW 252 Query: 289 VEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSA 348 V+FG+SP GVWRN+S DP GS+T+ + YD+ YAD W++Q +DYI PQ+YW + Sbjct: 253 VQFGISPFGVWRNKSQDPKGSETQSTSNYDDLYADPVLWMDQKWIDYIMPQLYWSMNNPR 312 Query: 349 ARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVP 408 A Y L KWW++ T +YIG A YK+ + W E+ Q+D + Sbjct: 313 ASYSKLVKWWSE--NANNTAIYIGHASYKIR--GDGDKSWYF---ATEIPTQVDFARSFK 365 Query: 409 EISGTILFREDYLNKPQ 425 ++G+ F + Sbjct: 366 NVNGSAYFSAKWFMSKN 382 >UniRef50_D2QEX0 Putative uncharacterized protein n=2 Tax=Flexibacteraceae RepID=D2QEX0_9SPHI Length = 570 Score = 404 bits (1039), Expect = e-111, Method: Composition-based stats. Identities = 143/418 (34%), Positives = 220/418 (52%), Gaps = 29/418 (6%) Query: 23 LLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTS 82 +L+ + + P + ++ P + R +W+ATV+ +DWP + +++ Sbjct: 62 VLVNAVDTLPFDDTPEQILAARGPIP---PKREFRAVWVATVNNIDWPSKKGLPVAD--- 115 Query: 83 RARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIG--ENPGYD 140 QQ+ ++ D Q++G+N V QV+ A + PWS+ +TG+ G P YD Sbjct: 116 ----QQREIVAMFDQHQQMGLNAVVVQVRSAADAFYARGSEPWSEWLTGQQGLAPEPFYD 171 Query: 141 PLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSG 200 PL+FM+D+AH RG++ HAWFN R + + T S P+++ + +W+ G Sbjct: 172 PLEFMIDQAHGRGLEFHAWFNLDRATFS----------KTASVAPSNIVNRKPEWMLMYG 221 Query: 201 DRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTES-PGSRLNDNETYRKYGGA 259 R + + GIP V+ +I IVA VV Y VDG+ FDDYFY + PG L D+ TY+ Sbjct: 222 GRKLFNLGIPAVRSYIAGIVANVVREYDVDGIHFDDYFYPYAEPGQVLRDDSTYKANSNG 281 Query: 260 FASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDE 319 SK DWRR+N +L+ ++ +I++ KP V+FG+SP G+W+N+S DP GS T G AY E Sbjct: 282 M-SKPDWRRDNVTKLVKELRDSIRANKPWVKFGISPFGIWKNKSSDPEGSATNGGQAYYE 340 Query: 320 SYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVG 379 YADTR+WV +GL+DY+ PQ+Y+ S Y L WW LYIG Y+VG Sbjct: 341 LYADTRKWVREGLIDYVVPQVYFSSEFSKVPYKTLVDWWTRNCT-ENCHLYIGHGAYRVG 399 Query: 380 EPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQSRW 437 S+ +P W E Q+ N + G++ F L + LQ+ + Sbjct: 400 RGSERDPGWWRPT---EFPDQMRYNRQQQVVKGSVFFSAKNL-QINPLSIRDSLQTNF 453 >UniRef50_A4ASW6 FenI n=1 Tax=Flavobacteriales bacterium HTCC2170 RepID=A4ASW6_9FLAO Length = 507 Score = 404 bits (1037), Expect = e-111, Method: Composition-based stats. Identities = 134/430 (31%), Positives = 214/430 (49%), Gaps = 43/430 (10%) Query: 12 IRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPP 71 +++ + + L ++ SC + P Q RG+W+ATV +DWP Sbjct: 2 LKKISHYLLLLIIFNSCDAIKP---------------IPQPRTEFRGVWVATVVNIDWPK 46 Query: 72 VSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTG 131 N Q+ + L+ +L NTV QV+ G + + SK PWS +TG Sbjct: 47 -------NGLDAIEKQKADFLKILEFYDQLNFNTVIVQVRTAGDSFYDSKYAPWSRFLTG 99 Query: 132 KIGENP--GYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVY 189 G++ +D L +M+D+ H RG + HAW NPYR + + K + + + Sbjct: 100 TEGKSTEGHFDMLNWMIDQTHNRGFEFHAWLNPYRATFDLKTDVLSATHD---------F 150 Query: 190 VQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSR-LN 248 H +W+ G+++ +PG+PEV++ + SI+ EVV++Y +D + FDDYFY N Sbjct: 151 NLHPEWMLKYGNKYYYNPGLPEVRERLASIMGEVVTKYDIDAIHFDDYFYPYRIKDEIFN 210 Query: 249 DNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLG 308 D+ Y + + + +WRR+N L+ + T+K+IKP V+FG+SP GVW+N+S DP G Sbjct: 211 DSLAYNYHSFSGQTVENWRRSNIDSLVKNIHSTVKNIKPWVQFGISPFGVWKNKSTDPRG 270 Query: 309 SDTR-GAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRT 367 SDT+ G Y++ YAD W+ +G +DY+ PQ+YW A + + WW++ T Sbjct: 271 SDTKAGQTTYEDLYADPLTWMNEGWIDYLVPQVYWSMDLPVASHKKIVNWWSN--NSVNT 328 Query: 368 RLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQ 427 LYIG YK+ S + W E+ QL L ++ G +LF L Sbjct: 329 NLYIGNGAYKIRSNS--DKAWD---DKKEMPNQLKLARKDSKVQGNVLFSAKSLMN-DNP 382 Query: 428 QAVSYLQSRW 437 V YL+ R+ Sbjct: 383 DVVEYLKRRF 392 >UniRef50_C6XWP7 Putative uncharacterized protein n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6XWP7_PEDHD Length = 519 Score = 394 bits (1013), Expect = e-108, Method: Composition-based stats. Identities = 140/381 (36%), Positives = 202/381 (53%), Gaps = 26/381 (6%) Query: 36 MVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKL 95 ++TP + + + RG+W+ATV+ +DWP +NI Q+Q +I L Sbjct: 13 IITPISLIAQSPSKIAPKREFRGVWVATVANIDWPSKPGLNIDQ-------QKQELIGLL 65 Query: 96 DHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIG--ENPGYDPLQFMLDEAHKRG 153 + + G+N + QV+P A + PWS + GK G PGYDPL F + EAH RG Sbjct: 66 EQHKANGMNAIILQVRPAADAFYLKSREPWSQWLMGKQGMAPAPGYDPLAFAIKEAHSRG 125 Query: 154 MKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQ 213 M++HAWFNPYR +++ P + + D G + DPGIPEV+ Sbjct: 126 MELHAWFNPYRATMSAS----------AVVSPDHMTRKRPDLFFVYGGKKQFDPGIPEVR 175 Query: 214 DWITSIVAEVVSRYPVDGVQFDDYFYTES-PGSRLNDNETYRKYGGAFASKADWRRNNTQ 272 ++I ++ +VV Y VDG+ FDDYFY G +ND T+ KY F++ ADWRRNN Sbjct: 176 EYIVQVILDVVKGYDVDGIHFDDYFYPYKIAGQNINDAATFNKYPNGFSNIADWRRNNVD 235 Query: 273 QLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGL 332 LI ++ +I K V+FGVSP G+W+N S D LGS T G + Y E YAD+R+WV++G Sbjct: 236 LLIKQLDDSIHHYKKYVKFGVSPFGIWKNLSEDSLGSATNGLSNYAELYADSRKWVKEGW 295 Query: 333 LDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPS-KIEPDWMIN 391 +DYI PQ+Y+ F+R AA + +A WW + +YIG Y + S + E W Sbjct: 296 VDYINPQVYFSFTRRAAPFATIADWWTN--NAFGRHVYIGHGAYLIHNGSTRKEAAWAFP 353 Query: 392 GGVPELKKQLDLNDAVPEISG 412 +P Q+ I G Sbjct: 354 NQIP---NQIRHIRGSNLIQG 371 >UniRef50_A1ZQ43 YngK protein n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZQ43_9SPHI Length = 517 Score = 392 bits (1008), Expect = e-108, Method: Composition-based stats. Identities = 142/392 (36%), Positives = 206/392 (52%), Gaps = 28/392 (7%) Query: 47 ATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTV 106 T ++ + R +WL T +D+P S +Q +I LD Q+ GIN + Sbjct: 35 VTKRKLKREFRAVWLTTFDHMDFPKEKGAPPSE-------HKQELIKLLDFHQKSGINAI 87 Query: 107 FFQVKPDGTALWPSKILPWSDLMTGKIGENPG--YDPLQFMLDEAHKRGMKVHAWFNPYR 164 FFQV+P A + S+I WS +TGK G+ P +DPL+F++ E HKR +++HAW NPYR Sbjct: 88 FFQVRPAADAFYKSEIELWSQWLTGKQGKAPEPLWDPLEFLVTECHKRNIELHAWINPYR 147 Query: 165 VSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVV 224 N K + P + +H +W G +PGIP V+ ++ ++VA++ Sbjct: 148 AVYNIKHD---------ATAPNHITKRHPEWFVVYGKHKQFNPGIPAVRHYLKAVVADIA 198 Query: 225 SRYPVDGVQFDDYFYTESPGS-RLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIK 283 RY +DG+ FDDYFY G D T+ K+GG WRR N LI +V T++ Sbjct: 199 QRYDIDGIHFDDYFYPYKKGRLEFPDQSTFMKHGGNSKDVHHWRRQNVNSLIKEVHDTLQ 258 Query: 284 SIKPGVEFGVSPAGVWRNRSHDPLGSDTR-GAAAYDESYADTRRWVEQGLLDYIAPQIYW 342 SIKP ++FG+SP GVWRN+S DP GSDT+ G ++YD YAD +W+ +G +DY+ PQ+YW Sbjct: 259 SIKPYLKFGISPLGVWRNKSEDPNGSDTQVGQSSYDYLYADVLKWLRKGWIDYLVPQLYW 318 Query: 343 PFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLD 402 A + LA WWA +YIG AFYK+ + W V EL Q+ Sbjct: 319 SIEHPRASFKSLAFWWAK--HAYSRHIYIGHAFYKIKNDK--DDHWK---QVSELPNQVR 371 Query: 403 LNDAVPEISGTILFREDYLNKPQTQQAVSYLQ 434 + I G FR D+L + + L+ Sbjct: 372 MTRQYRSILGNAYFRSDFL-QKNPAKVTDTLR 402 >UniRef50_UPI00016C0313 cell surface protein n=1 Tax=Epulopiscium sp. 'N.t. morphotype B' RepID=UPI00016C0313 Length = 539 Score = 392 bits (1006), Expect = e-107, Method: Composition-based stats. Identities = 153/440 (34%), Positives = 225/440 (51%), Gaps = 42/440 (9%) Query: 7 NKKLT--IRRPAILVALALLLCSCKSTPPESMV---TPPAGSKPPATTQQSSQPMRGIWL 61 NKK I +I+ A +LL K P + P + ++ +R +W+ Sbjct: 2 NKKFIAWILGGSIVTAAVILLLVPKELPSMKFIPNKNSLNKKNPITPLRPQNEEVRAVWI 61 Query: 62 ATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSK 121 +++ LD+P +S+N +NP QQ I LD LQ +G NTV QV+P AL+ S Sbjct: 62 SSIWGLDFP-YNSINRNNPA----AQQAEFISYLDELQEIGFNTVMVQVRPSADALYKSA 116 Query: 122 ILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTL 181 I PW+ ++TG G++PGYDPL FM+D+ HKRGMK+HAW NPYRV+ K +++ + Sbjct: 117 INPWAAILTGTQGQDPGYDPLAFMIDQTHKRGMKLHAWINPYRVTTAGKG-----IDTLV 171 Query: 182 SQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTE 241 + PA + + D + + + P + V+ I V E+V+ Y VDG+ DDYFY Sbjct: 172 ATHPARL---NPDMLISHKNALYYXPELDAVKSHIEETVKEIVTNYSVDGIHMDDYFYPA 228 Query: 242 -SPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWR 300 P D A RRN+ ++ ++ IK IKP VEFG+SP G+W+ Sbjct: 229 WYPLPAGED---------GNGKTATTRRNHVNDMVKRIHTAIKQIKPNVEFGISPIGIWK 279 Query: 301 NRSHDPLGSDTR-GAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWA 359 + D GS+T G +Y YADTR W++ +DY+ PQIYW A Y+VL KWWA Sbjct: 280 DSITDITGSETSAGWNSYYAVYADTRAWIQNEWIDYVVPQIYWEIDNPVASYEVLVKWWA 339 Query: 360 DVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFRED 419 + VK T LYIG YK + E+ Q+ LND PEI G++ F Sbjct: 340 EEVKNTNVDLYIGQGIYK-------------DAVAEEITTQILLNDLYPEIKGSVXFAIS 386 Query: 420 YLNKPQTQQAVSYLQSRWGS 439 + + T L++ +G+ Sbjct: 387 DIIRKNTGNVRGQLEALFGT 406 >UniRef50_B1HPQ3 Hypothetical lipoprotein yddW n=2 Tax=Bacillaceae RepID=B1HPQ3_LYSSC Length = 522 Score = 391 bits (1004), Expect = e-107, Method: Composition-based stats. Identities = 161/417 (38%), Positives = 225/417 (53%), Gaps = 38/417 (9%) Query: 17 ILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVN 76 ++ +A++L C S P + V A+T Q + MR +W++TV LD + +N Sbjct: 10 LIALVAMILMLCLSAIPANTVK--------ASTTQPKREMRAVWISTVLNLDM--KAGMN 59 Query: 77 ISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGK-IGE 135 T AR LD L+ NTV +QV+P A++ S++ PWS +TGK G Sbjct: 60 KEQYTVWAR-------QTLDQLKANKFNTVIYQVRPTNDAMYASELAPWSSYITGKKQGT 112 Query: 136 NPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDW 195 NPGYDPL +++E+HKRGM++HAW NPYRV+++ + P +V + H +W Sbjct: 113 NPGYDPLTILVEESHKRGMELHAWMNPYRVTMSGQ--------KLTDLAPDNVAITHPNW 164 Query: 196 IRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTES-PGSRLNDNETYR 254 + G ++ L+PG+PEVQD++ IV E+V+ Y VD V DDYFY D Y+ Sbjct: 165 VVKYGKQYYLNPGLPEVQDYLVEIVRELVANYDVDAVHMDDYFYPYKIANEVFPDQAAYK 224 Query: 255 KYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTR-G 313 KYG +F DWRRNN +L+ + IK KP V+FG+SP GVWRN+S D GSDTR G Sbjct: 225 KYGASFNKVEDWRRNNVNRLVENLYTAIKETKPYVQFGISPFGVWRNKSLDKTGSDTRAG 284 Query: 314 AAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVV----KPTRTRL 369 YD+ YAD R W++ G +DYI PQIYW + S A+Y L WW+ V K L Sbjct: 285 VNNYDDLYADVRTWIQNGTIDYITPQIYWSRTLSVAKYGTLLDWWSHEVQTYAKMHPVHL 344 Query: 370 YIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVP-EISGTILFREDYLNKPQ 425 YIG+A YKVG S + W EL Q+ N + G + F + Sbjct: 345 YIGLADYKVGNDS--DAAWK---NKMELPSQILENRSEKVAADGQMHFSLRSFQSNK 396 >UniRef50_C7PIN2 Putative uncharacterized protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PIN2_CHIPD Length = 509 Score = 390 bits (1002), Expect = e-107, Method: Composition-based stats. Identities = 139/389 (35%), Positives = 196/389 (50%), Gaps = 34/389 (8%) Query: 52 SSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVK 111 + R +W+ATV +DWP + Q+Q I+ LD QR G+N V Q++ Sbjct: 25 PKREFRAVWIATVENIDWPSRKGL-------PVETQKQEFINLLDKHQRNGMNAVIVQIR 77 Query: 112 PDGTALWPSKILPWSDLMTGKIGE--NPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNT 169 P A + S PWS+ ++G G+ NP YDPL+FML+E HKRGM+ HAWFNPYR + Sbjct: 78 PAADAFYDSPFEPWSEYLSGVQGQAPNPYYDPLRFMLEETHKRGMEFHAWFNPYRAVIRN 137 Query: 170 KPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPV 229 + W + DPGIPEV++++T I+ +VV RY + Sbjct: 138 ASA-------------NHISRMRPQWFVNFDGKKYFDPGIPEVREYVTQIIRDVVRRYDI 184 Query: 230 DGVQFDDYFYTES-PGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPG 288 D V FDDYFY PG DN +YR+YG K DWRR N +I VS IK KP Sbjct: 185 DAVHFDDYFYPYPVPGREFGDNNSYRQYGRNM-MKDDWRRWNVDTIIQMVSKMIKEEKPW 243 Query: 289 VEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSA 348 V+FG+SP G+WRN++ D GS T G + YD+ YAD R+W++ G +DY+APQ+YW Sbjct: 244 VKFGISPFGIWRNKNKDQDGSYTTGLSNYDDLYADVRKWLQNGWIDYVAPQLYWERGHRV 303 Query: 349 ARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVP 408 A Y++L WWA +YIG Y++ + EL Q+ + Sbjct: 304 ANYELLLNWWAQ--HGYGRNVYIGHGVYRLRSNAAWSI-------PNELPVQITEVRTLN 354 Query: 409 EISGTILFREDYLNKPQTQQAVSYLQSRW 437 I G+ + N L++ + Sbjct: 355 TIQGSAFYSAKSFN-GNPLGIEDSLRNHF 382 >UniRef50_A6NVH8 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6NVH8_9BACE Length = 606 Score = 389 bits (998), Expect = e-106, Method: Composition-based stats. Identities = 139/400 (34%), Positives = 222/400 (55%), Gaps = 31/400 (7%) Query: 43 SKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLG 102 ++ + R +W+ATV LD+P ++ + A + + L++ +G Sbjct: 34 AQQANAPSAARDDFRAVWVATVYNLDYPNAATTD-------ADALKAQADEILENCVDMG 86 Query: 103 INTVFFQVKPDGTALWPSKILPWSDLMTGKIGENP--GYDPLQFMLDEAHKRGMKVHAWF 160 +N V QV+P G AL+PS++ PWS +TG G P +DPL + ++ AH+ G+++HAW Sbjct: 87 MNAVILQVRPSGDALYPSELFPWSKYLTGASGLAPEDNFDPLAYWVERAHELGLELHAWI 146 Query: 161 NPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIV 220 NP+R++ G EL + ++ PA VQH +W+ + L+PG+PEV++ + Sbjct: 147 NPFRIT----KGGEAELAALDAKSPA---VQHPEWVVECDGNYYLNPGLPEVRELVIQGA 199 Query: 221 AEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSH 280 E+V Y VDGV DDYFY P ND+ +++YGG F + DWRR+N QLI + Sbjct: 200 EELVRNYDVDGVHLDDYFY---PSRSFNDDAAFQQYGGDFDNIGDWRRDNVNQLIQGLDQ 256 Query: 281 TIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRG-AAAYDESYADTRRWVEQGLLDYIAPQ 339 + ++ P + FGVSP+GVW + +H GS T G +Y +YAD+R+WV++G +DYI PQ Sbjct: 257 RLHALDPELSFGVSPSGVWADSTHQSAGSATTGNYESYYAAYADSRKWVKEGWVDYICPQ 316 Query: 340 IYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKK 399 IYW Y+ +A+WW+D V+ T +LYIG+A Y + ++ P G+ + K Sbjct: 317 IYWYIGHPTMDYETIARWWSDTVEGTGVKLYIGMADYLADDGTEGSP----WNGLDAITK 372 Query: 400 QLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQSRWGS 439 QL LN + +SG + FR +L S L+ + + Sbjct: 373 QLTLNREL-GVSGEVHFRYKFL------AVNSNLKRLYET 405 >UniRef50_A5FAG6 Putative uncharacterized protein n=1 Tax=Flavobacterium johnsoniae UW101 RepID=A5FAG6_FLAJ1 Length = 493 Score = 384 bits (987), Expect = e-105, Method: Composition-based stats. Identities = 138/395 (34%), Positives = 205/395 (51%), Gaps = 31/395 (7%) Query: 48 TTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVF 107 + + + MR W++TV +DWP P + + MI LD+L+ +NTV Sbjct: 19 SQESPKREMRAAWISTVDNIDWPSK-------PGLSDKQMKSEMIAILDNLRSNNLNTVI 71 Query: 108 FQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSV 167 FQ++P A + S P S +TG G PG+DPLQ M+DEA KRGM VH W NPYRV Sbjct: 72 FQIRPTADAYYKSTKEPASHWITGTQGVAPGFDPLQMMIDEAGKRGMNVHVWLNPYRVQK 131 Query: 168 NTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRY 227 +T + + +Y + + T G +PG E +D+++S+V E+V Y Sbjct: 132 DTVKDVLTKT---------HLYFKKPELFLTYGKSRYFNPGYKETRDFVSSVVGEIVRNY 182 Query: 228 PVDGVQFDDYFYTES-PGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIK 286 + V DDYFY G D + + K F K DWRR+N +I ++ TI + K Sbjct: 183 DIQAVHMDDYFYPYKIAGQEFPDEKAFAKEPRQFKDKDDWRRDNVDLIIKQIRDTIIANK 242 Query: 287 PGVEFGVSPAGVWRNRSHDPLGSDT-RGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFS 345 P VEFG+SP GVWRN + D GS+T GA YD+ YA+ +W ++ +DY+ PQ+YW Sbjct: 243 PEVEFGISPFGVWRNIAKDSDGSNTVAGATNYDDLYANILKWQKENWIDYVTPQLYWHIG 302 Query: 346 RSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLND 405 A ++VLAKWWA T +Y+G YK+ + EP+W ++ KQ+++ Sbjct: 303 FDRANFEVLAKWWA--AHKYGTNVYVGHGDYKI-SNTAKEPEWR---SPDQIVKQIEMIR 356 Query: 406 AVPEISGTILFRE-------DYLNKPQTQQAVSYL 433 +P+I G++ F D L P Q+ Y+ Sbjct: 357 KLPQIDGSMHFTASTFLKKGDTLRNPLIQKPYKYI 391 >UniRef50_A9NEW1 Putative uncharacterized protein n=1 Tax=Acholeplasma laidlawii PG-8A RepID=A9NEW1_ACHLI Length = 1328 Score = 382 bits (981), Expect = e-104, Method: Composition-based stats. Identities = 132/396 (33%), Positives = 203/396 (51%), Gaps = 23/396 (5%) Query: 36 MVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKL 95 P T + + +R +W+ATV+ +D I+ + A + +I L Sbjct: 880 YYQTNTPVALPTTYTEKDKEIRAVWVATVANID--------ITQYDNEAN-YKNQIISIL 930 Query: 96 DHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMK 155 + ++ L NT+FFQ +P + +PS+ P S ++G G G+D L+F++ EAH RG++ Sbjct: 931 ERMKELKFNTMFFQTRPMNDSFYPSEYAPMSRFLSGTEGVGVGWDVLEFLITEAHARGIE 990 Query: 156 VHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRT-SGDRFVLDPGIPEVQD 214 VHAW NPYRV T + ++ Q+ ++ G +L+PGIPEV+ Sbjct: 991 VHAWMNPYRV---ASGSTASIEDQLALLHDSNFAKQNPSYVVQDKGGALILNPGIPEVRQ 1047 Query: 215 WITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQL 274 ++ +IV E++ Y +DGV FDDYFY+ S D + + Y S+ DWRR N Sbjct: 1048 YLYNIVDEIMENYAIDGVHFDDYFYSYSGTEDSQDADAFLNYNPNNLSRDDWRRENVNMF 1107 Query: 275 IAKVSHTIKSIKP----GVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQ 330 + + +++ V+FG+SP G+WRN++ D LGS+++G ++Y YAD+R+WV++ Sbjct: 1108 VKTIYERVEAHNEANDMHVKFGISPFGIWRNKTQDALGSNSQGLSSYSAQYADSRKWVKE 1167 Query: 331 GLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMI 390 G L YI PQ+YW F S AR+ L WW DVVK T L IG FY+ E S Sbjct: 1168 GWLHYIIPQLYWQFDHSTARFADLVDWWVDVVKDTNVDLIIGQGFYRYAENSN------N 1221 Query: 391 NGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQT 426 E +QL EI G+ +F LN Sbjct: 1222 WTNESEFLEQLRYMSQYDEIIGSSIFSYKTLNSNHA 1257 Score = 270 bits (690), Expect = 8e-71, Method: Composition-based stats. Identities = 103/408 (25%), Positives = 170/408 (41%), Gaps = 43/408 (10%) Query: 38 TPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDH 97 T A + P S R W+ P S+ + + + L++ Sbjct: 20 TNEAHAYPEPEVLTSPGEFRATWITHFIGS-MPAYST---------EQDFKSEVNSILNN 69 Query: 98 LQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVH 157 ++ +N + AL+ S+I P + +DP+ + ++EAHKRG++ H Sbjct: 70 MEANNLNVAIVHFRTHNNALYKSEINPVASWFATVD--FDVFDPMAYFIEEAHKRGIEFH 127 Query: 158 AWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWIT 217 AW NPYRV + GTI N PA++ G +L+P +P V++ + Sbjct: 128 AWLNPYRVLSTYQRGTIPASNP--QSNPANLLSNK------EGTAHILNPALPVVREHVV 179 Query: 218 SIVAEVVSRYPVDGVQFDDYFYTESPGSRL---NDNETYRKYGGAFAS----KADWRRNN 270 + + E++ Y VD + FDDYFY E + D + + K++WRR Sbjct: 180 NTILEIIENYNVDAIHFDDYFYMEMNNGGILNDPDQALFLSNPLGQPNTVAGKSNWRRTQ 239 Query: 271 TQQLIAKVSHTIK----SIKPGVEFGVSPAGVWRNRSHD----------PLGSDTRGAAA 316 I + S IK + V+FG+SP G++RN + GS T+G Sbjct: 240 INTFIEQASQAIKDFNQANNRYVQFGISPTGIYRNGDGEVTYDQDGKPITNGSKTQGQEH 299 Query: 317 YDES-YADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAF 375 Y +ADT W+ +G LDYI PQ YW + S A +D + WW VV+ LY GI Sbjct: 300 YASYLFADTVHWISEGWLDYILPQSYWASTHSLAGFDKVMGWWDKVVRYLDVNLYSGIGL 359 Query: 376 YKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNK 423 Y + + + E Q++ ++ G ++ + + Sbjct: 360 Y-LADAGIASNVYSWRDNPEEFSNQMEFLHSLESNQGFSIYSYNMIRD 406 >UniRef50_C9L341 YngK protein n=45 Tax=Bacteroidales RepID=C9L341_9BACE Length = 528 Score = 381 bits (979), Expect = e-104, Method: Composition-based stats. Identities = 143/386 (37%), Positives = 208/386 (53%), Gaps = 31/386 (8%) Query: 43 SKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLG 102 ++ P+ + + RG W+ V N +Q +ID+L+ LQ G Sbjct: 46 AQVPSGNKYPKREFRGAWIQAV-----------NGQFKGIPTGKLKQTLIDQLNSLQGAG 94 Query: 103 INTVFFQVKPDGTALWPSKILPWSDLMTGKIGE--NPGYDPLQFMLDEAHKRGMKVHAWF 160 IN + FQV+P+ AL+ S+ PWS +TG G+ +P +DP+QFM++E KR M+ HAW Sbjct: 95 INAIIFQVRPEADALYASQHEPWSRFLTGTQGQIPSPMWDPMQFMIEECRKRNMEFHAWI 154 Query: 161 NPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIV 220 NPYRV + K P +Y QH +W T GD+ DP +PE +D+I IV Sbjct: 155 NPYRVKTSLKNQL----------APEHIYHQHPEWFVTYGDQLYFDPALPESRDYICKIV 204 Query: 221 AEVVSRYPVDGVQFDDYFYTES-PGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVS 279 ++VSRY VD + DDYFY G D+ ++ +YGG F +KADWRR+N LI K+ Sbjct: 205 TDIVSRYDVDAIHMDDYFYPYPVKGMDFPDDASFARYGGGFTNKADWRRSNVNVLIKKLH 264 Query: 280 HTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQ 339 TI+++KP V+FG+SP G++RN+ DPLGSDT G YD+ YAD W +G +DY PQ Sbjct: 265 ETIRAVKPWVKFGISPFGIYRNQKSDPLGSDTNGLQNYDDLYADVLLWAREGWIDYNIPQ 324 Query: 340 IYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKK 399 IYW AA Y+ L KWWA L+IG + + I+ N + +L + Sbjct: 325 IYWEIGHKAADYETLVKWWA--THSENRPLFIGQSV-----SNTIQHADPKNPSINQLPR 377 Query: 400 QLDLNDAVPEISGTILFREDYLNKPQ 425 ++ L A I G+ + + + Q Sbjct: 378 KMALQRAYQTIGGSCQWYASAVVENQ 403 >UniRef50_UPI00016C4E90 hypothetical protein GobsU_27726 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4E90 Length = 481 Score = 377 bits (967), Expect = e-103, Method: Composition-based stats. Identities = 122/400 (30%), Positives = 186/400 (46%), Gaps = 37/400 (9%) Query: 47 ATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTV 106 A + R +W+ATVS +DWP P A Q++ ++ LD+ L +N V Sbjct: 20 ADPPALKREFRAVWVATVSNIDWPSK-------PGLPADQQKKELLAILDNAVELKLNAV 72 Query: 107 FFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVS 166 FQV+P AL+ S++ PWS+ +TG+IG+ PGYDPL F + EAHKRG+++HAWFNPYR Sbjct: 73 IFQVRPMADALYASELEPWSEYLTGQIGKAPGYDPLAFAVTEAHKRGLELHAWFNPYRAR 132 Query: 167 VNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSR 226 + + + D + G ++P PEVQ+ + +VV R Sbjct: 133 HPSAKSPAPA---------DHLTRKRPDLAKPYGTHAWMNPTNPEVQEHSLRVFLDVVKR 183 Query: 227 YPVDGVQFDDYFYTESPGSR------LNDNETYRKY--GGAFASKADWRRNNTQQLIAKV 278 Y +DG+ DDYFY D++T+ Y G S+ DWRR+ + ++ Sbjct: 184 YDIDGIHIDDYFYPYKEKGTDGKVIPFPDDDTWEAYQKQGGKLSRDDWRRDAVNVFVRRM 243 Query: 279 SHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAP 338 K KP V+ G+SP G+WR P G G Y E YAD + W +G +DY P Sbjct: 244 YEETKKAKPWVKVGISPFGIWR--PGHPAG--IAGLDQYAELYADAKLWFNEGWVDYFTP 299 Query: 339 QIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELK 398 Q+YWP ++ + L WWA + L+ G+ +V +K E+ Sbjct: 300 QLYWPIAQEKQSFPKLLDWWAGE-NTKKRHLWPGLYTSRVTGAAKG-------WNAKEIA 351 Query: 399 KQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQSRWG 438 Q+ + + G + F L + T L+ + Sbjct: 352 DQIAVTRQRSDTDGAVHFSAKALVR-NTGGIADELKQVYA 390 >UniRef50_D0GIS1 YngK n=16 Tax=Bacteria RepID=D0GIS1_9FUSO Length = 330 Score = 376 bits (964), Expect = e-102, Method: Composition-based stats. Identities = 145/343 (42%), Positives = 196/343 (57%), Gaps = 23/343 (6%) Query: 94 KLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRG 153 L+++++ +N VF Q+KP G A +PSK PWS+ +TG GENPGYDPL+FM++EAHKR Sbjct: 1 MLENVKKWNMNAVFVQIKPVGDAFYPSKYAPWSEYLTGVQGENPGYDPLKFMIEEAHKRN 60 Query: 154 MKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQ 213 ++ HAWFNPYR+++ + N + + +W G + L+PGIPEV Sbjct: 61 IEFHAWFNPYRLTMGGGREKLSRDN---------IGNKRPEWTVMYGGKLYLNPGIPEVN 111 Query: 214 DWITSIVAEVVSRYPVDGVQFDDYFYTES-PGSRLNDNETYRKYGGAFASKADWRRNNTQ 272 D++ + EVV +Y VDGV DDYFY G D++ YRKYGG F++ DWRRNN Sbjct: 112 DYVVDSIVEVVKKYDVDGVHMDDYFYPYKVKGQEYPDSQQYRKYGGKFSNIGDWRRNNIN 171 Query: 273 QLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDP-LGSDTR-GAAAYDESYADTRRWVEQ 330 +LI K+ ++IK V FG+SP GVWRN S DP GS T+ G YD+ YAD W+++ Sbjct: 172 KLIEKLHNSIKKENKNVSFGISPFGVWRNASTDPVRGSQTQAGVQNYDDLYADILYWMDK 231 Query: 331 GLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMI 390 +DY+APQIYW A Y L WW+ T T LYIG A YKV + Sbjct: 232 HWIDYVAPQIYWVRGFKVADYSTLINWWSKYAGKTNTDLYIGHAAYKVND---------- 281 Query: 391 NGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYL 433 EL +Q+ LN PEI G+I F L P + + L Sbjct: 282 WSNPNELVEQVKLNRKYPEIKGSIFFSYKSL-VPNPKNVTNNL 323 >UniRef50_B4VZ35 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VZ35_9CYAN Length = 665 Score = 374 bits (961), Expect = e-102, Method: Composition-based stats. Identities = 124/414 (29%), Positives = 210/414 (50%), Gaps = 37/414 (8%) Query: 19 VALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNIS 78 VA + + +++ + + + RG+W+ATV +DWPP ++++ Sbjct: 177 VAAFIYQALVYTGRLDAIGSDYIIVRQKTLKLSHQREFRGVWVATVWNIDWPPQRGLSVA 236 Query: 79 NPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGE--N 136 QQ+ ++ +D + L +N + QV+P G A + S++ PWS+ +TG G+ + Sbjct: 237 Q-------QQRELLQIIDRMAELQLNALILQVRPTGDAFYASELEPWSEWLTGVQGQAPD 289 Query: 137 PGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWI 196 P YDPL+F + H+R +++HAWFNPYR ++ + V H +++ Sbjct: 290 PYYDPLEFAIAACHQRNIELHAWFNPYRAKTSSHSSASVA---------PHISVTHPEYV 340 Query: 197 RTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTES-PGSRLNDNETYRK 255 G++ +DPG+ VQD +++ +VV RY VDG+ DDYFY G D++TY Sbjct: 341 YKYGNQQWMDPGVKVVQDLTYNVIMDVVRRYDVDGIHLDDYFYPYPIAGEDFPDDKTYNA 400 Query: 256 YG--GAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRG 313 Y G S +DWRR N Q++ ++ I++ K V+FG+SP G++R +G Sbjct: 401 YQAEGGTLSLSDWRRENVNQMVQRLYKGIQATKKQVKFGISPFGIYRPGQPP----QIKG 456 Query: 314 AAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGI 373 Y+ YAD ++W+E G +DYIAPQ+YW A Y VL +WW D P + +Y G Sbjct: 457 LDQYESLYADPKKWLEAGWIDYIAPQLYWRIDPPAQSYPVLLEWWTD-NNPKQRHIYPGN 515 Query: 374 AFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAV-PEIS-GTILFREDYLNKPQ 425 + + DW E ++Q+D+ + P++S G I + N+ + Sbjct: 516 RLSMLD-----DKDW----SFLEYERQVDITRNLAPQLSLGNIFYNMKVFNENR 560 >UniRef50_D1N426 Putative uncharacterized protein n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N426_9BACT Length = 450 Score = 374 bits (959), Expect = e-102, Method: Composition-based stats. Identities = 151/401 (37%), Positives = 213/401 (53%), Gaps = 35/401 (8%) Query: 51 QSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQV 110 Q ++ MRG+W+ATV +D+ + A ++ I +++LQR N +FFQV Sbjct: 58 QRAREMRGVWVATVENIDFGRHT---------DAAGFKRDFIAVVNNLQRAKFNAIFFQV 108 Query: 111 KPDGTALWPSKILPWSDLMTGKIGEN-PGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNT 169 +P A +PSK PWS MTGK G+ P +DPL FM+ EAHKRG++ HAW NPYRV+ Sbjct: 109 RPMCDAFYPSKHNPWSRWMTGKEGQAIPNFDPLAFMVAEAHKRGLEFHAWLNPYRVNAGA 168 Query: 170 KPGTIRELNSTLSQQPASVYVQHRDWIRTSG-----DRFVLDPGIPEVQDWITSIVAEVV 224 + G L + ++ S ++ + S L+PG P V I +AE++ Sbjct: 169 QVGKTAYLKTLDNK---SFAKRNPGLVLESKLASGRYSLFLNPGEPRVVRHIADTIAEIL 225 Query: 225 SRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKS 284 YPVD + FDDYFY S + D+ ++++ S +WRR N + I V T+ + Sbjct: 226 ENYPVDAIHFDDYFYLYSDIGTI-DSASFQRNNPGRLSLEEWRRGNVDKAIYTVKKTVDA 284 Query: 285 IKPG----VEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQI 340 V FGVSP G+W N+ +P GS T G +Y YADTR WV +G +DYI PQ+ Sbjct: 285 YNRRSGRKVAFGVSPFGIWANKKSNPNGSLTGGKQSYYAQYADTRGWVRKGWVDYIIPQL 344 Query: 341 YWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQ 400 YWPFS A Y LA WW+D VK TR RL+IG Y+VG +P EL Q Sbjct: 345 YWPFSHEVAAYAALADWWSDAVKGTRVRLFIGQGLYRVGAERIWQP--------RELVDQ 396 Query: 401 LDLNDAVPEISGTILFREDYLNKP---QTQQAVS-YLQSRW 437 + N + + GT++F + P Q ++AVS LQ W Sbjct: 397 MRYNQMLFNVDGTVIFSYRNVFMPGNGQMKEAVSRILQGCW 437 >UniRef50_C3XYE7 Putative uncharacterized protein n=2 Tax=Branchiostoma floridae RepID=C3XYE7_BRAFL Length = 576 Score = 373 bits (957), Expect = e-102, Method: Composition-based stats. Identities = 131/398 (32%), Positives = 195/398 (48%), Gaps = 38/398 (9%) Query: 45 PPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGIN 104 P T S+ RG+W+ATVS +DWP ++ Q+ ++ LD L +N Sbjct: 113 APTITPSPSREFRGVWVATVSNIDWPSSRHLST-------EQQKAELVTILDRTVELNLN 165 Query: 105 TVFFQVKPDGTALWPSKILPWSDLMTGKIG--ENPGYDPLQFMLDEAHKRGMKVHAWFNP 162 + FQV+P G A + S++ PWS + G+ G P YDPL F ++E+H+RG+++HAWFNP Sbjct: 166 AIVFQVRPAGDAFYDSQLEPWSYYLAGQHGSAPTPFYDPLAFAIEESHRRGIELHAWFNP 225 Query: 163 YRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAE 222 YR ++ + + + G+ +DPG V D ++ + Sbjct: 226 YRAKTKAAGYSLAS---------NHMAKRFPQYAYDYGNYIWMDPGAQVVADHTYDVIID 276 Query: 223 VVSRYPVDGVQFDDYFYTES-PGSRLNDNETYRKYG--GAFASKADWRRNNTQQLIAKVS 279 VV RY VDG+ FDDYFY G D TY+ Y G SKADWRR+N +L+ +++ Sbjct: 277 VVRRYDVDGIHFDDYFYPYPVSGVDFPDTATYQAYQTSGGTMSKADWRRDNVNRLVRRLN 336 Query: 280 HTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQ 339 I + K V+FG+SP G+WR +P G G + YD YAD + W+EQGL+DY+APQ Sbjct: 337 SGIHAEKSHVKFGISPFGIWR--PGNPAG--IVGFSQYDSLYADPKFWLEQGLVDYLAPQ 392 Query: 340 IYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKK 399 +YW Y L WW D P + +Y G ++ + W V EL Sbjct: 393 LYWMIDPPQQSYPALLDWWLDQ-NPLQRHVYTGNYLSRI-----LTDGWP----VSELVN 442 Query: 400 QLDLNDAVPE--ISGTILFREDYLNKPQTQQAVSYLQS 435 Q+ L+ + G I+F + V +S Sbjct: 443 QVSLSRDRADRLSLGNIMFSMKPFRD-NSDGVVDAFKS 479 >UniRef50_Q8YW40 All1776 protein n=5 Tax=Nostocaceae RepID=Q8YW40_ANASP Length = 669 Score = 369 bits (947), Expect = e-100, Method: Composition-based stats. Identities = 137/432 (31%), Positives = 214/432 (49%), Gaps = 38/432 (8%) Query: 15 PAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSS 74 I+ + L + +V PP + P ++ RG W+ TV DWP + Sbjct: 178 ATIIYQALVYLGQAEKIASVYLVIPPKPTLPTVRVSH-NREFRGAWITTVWNSDWPSKAG 236 Query: 75 VNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIG 134 ++++ QQ ++ L LQ+L N V QV+P+G AL+ S++ PWS +TG G Sbjct: 237 LSVAQ-------QQAELVAILTRLQQLNFNAVILQVRPEGDALYASELEPWSAWLTGTPG 289 Query: 135 E--NPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQH 192 + P YDPLQF + EAHKR ++VHAWFNPYR +T+ + V + Sbjct: 290 KAPEPFYDPLQFAIAEAHKRNLEVHAWFNPYRAKTSTRSAPNVR---------PHISVTN 340 Query: 193 RDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTES-PGSRLNDNE 251 + + G++ +DPGI VQD +++ +VV RY +D V DDYFY G D++ Sbjct: 341 PEVVYQWGNQLWMDPGIKIVQDRAYNVIIDVVRRYDIDAVHLDDYFYPYPIQGQAFPDDK 400 Query: 252 TYRKYG--GAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGS 309 TY Y G S DWRR N Q++ ++S IK+ K V+FG+SP G++R P G Sbjct: 401 TYAAYKSAGGQLSLNDWRRQNVDQMVLRLSQGIKATKSYVKFGISPFGIYR--PGQPPG- 457 Query: 310 DTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRL 369 G AY YAD ++W+EQG +DY+APQ+YW ++ Y VL KWW + + R + Sbjct: 458 -ITGLDAYSVLYADAKKWLEQGWVDYLAPQLYWRTDQTNQSYPVLLKWWTE-INSKRRHI 515 Query: 370 YIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPE--ISGTILFREDYLNKPQTQ 427 Y G ++ W E++KQ+ ++ G I F + + + Sbjct: 516 YAGNNIGQLD-----GKAW----KNEEIEKQVKISRNQAGELSLGNIFFSVSSIIENRQD 566 Query: 428 QAVSYLQSRWGS 439 + ++ S + + Sbjct: 567 ISTTFQNSLYTT 578 >UniRef50_C1A9I5 Putative uncharacterized protein n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1A9I5_GEMAT Length = 534 Score = 368 bits (944), Expect = e-100, Method: Composition-based stats. Identities = 124/385 (32%), Positives = 184/385 (47%), Gaps = 37/385 (9%) Query: 47 ATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTV 106 A + RG+W+A+V+ +DWP +++ + QQ ++ LD L +N V Sbjct: 38 AEPPPVLREFRGVWVASVANIDWPSKRTLSTAE-------QQAELLALLDRAAELKLNAV 90 Query: 107 FFQVKPDGTALWPSKILPWSDLMTGKIGE--NPGYDPLQFMLDEAHKRGMKVHAWFNPYR 164 FQV+P AL+ S I PWS+ +TG G P +DPL F++ EAH RGM++HAWFNPYR Sbjct: 91 IFQVRPAADALYESSIEPWSEYLTGAQGRRPEPFWDPLAFVIREAHARGMELHAWFNPYR 150 Query: 165 VSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVV 224 + + + + ++ +DPG P V+ +V +VV Sbjct: 151 ARHTDARSPLAR---------SHIARTNPALVKPYAGYLWMDPGEPAVRARTLRVVLDVV 201 Query: 225 SRYPVDGVQFDDYFYTESPGSR------LNDNETYRKYG--GAFASKADWRRNNTQQLIA 276 RY +DGV DDYFY R D ++ +Y G +++DWRR+N +L+ Sbjct: 202 KRYDIDGVHIDDYFYPYPENDRRGRAIAFPDTRSWTRYQKSGGKLTRSDWRRDNVNKLVE 261 Query: 277 KVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYI 336 ++ I KP V FG+SP G+WR RG AY++ YAD R+W+ +G LDY Sbjct: 262 ELYDGIHKTKPWVRFGISPFGIWRPG----FPEQIRGLDAYEKLYADARKWLHEGWLDYF 317 Query: 337 APQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPE 396 PQ+YWP ++ Y VL WWA K R L+ G + G V E Sbjct: 318 TPQLYWPTTKREQAYPVLLDWWATENKRAR-HLWPGNFTSRAGGRGSG------AFSVAE 370 Query: 397 LKKQLDLNDAVPEISGTILFREDYL 421 L +Q+ + SG + F Sbjct: 371 LMEQIRVTRLNAAASGNVHFSMKSF 395 >UniRef50_B4D6Q1 Putative uncharacterized protein n=2 Tax=Verrucomicrobia RepID=B4D6Q1_9BACT Length = 388 Score = 367 bits (942), Expect = e-100, Method: Composition-based stats. Identities = 124/390 (31%), Positives = 187/390 (47%), Gaps = 37/390 (9%) Query: 51 QSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQV 110 + RG W+ATV LDWP + + Q+ + D D Q+L +N + QV Sbjct: 19 AAQPEFRGAWVATVFNLDWPSKAGL-------SEAEQKAQLRDIFDRAQQLKLNAILLQV 71 Query: 111 KPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTK 170 + A + S+ PWS +TGK G +PGYDPL + + EAH RG+++HAWFNP+R Sbjct: 72 RSMSDACYASRREPWSTFLTGKQGVDPGYDPLAYAITEAHARGIELHAWFNPFRAGTKGG 131 Query: 171 PGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVD 230 V H +WIR G + LDPG P + ++ ++ +VV RY +D Sbjct: 132 SSCAAN----------HVTRAHPEWIRPYGSQLWLDPGDPNARRYVLDVILDVVKRYDID 181 Query: 231 GVQFDDYFYTES-PGSRLNDNETYRKYG-GAFASKADWRRNNTQQLIAKVSHTIKSIKPG 288 GV DDYFY G+ D+ T++KYG S+ADWRR+N + + + H +K+ KP Sbjct: 182 GVHIDDYFYPYPVKGAEFPDDVTWQKYGMAGGKSRADWRRDNINRFVEAMYHEVKAAKPS 241 Query: 289 VEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSA 348 V G+SP G+WR + + + AY + YAD R W+ +G DY+APQ+YW Sbjct: 242 VRVGISPFGIWRPKVPATIEAQ---LDAYAQLYADARYWLSEGWCDYLAPQLYWGIHPDK 298 Query: 349 ARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVP 408 + VL WW ++ GIA ++G+P V E+ +Q++L Sbjct: 299 QSFPVLLNWWRQQST-AGRPVWPGIATERIGKPY----------DVGEIARQIELTRQSL 347 Query: 409 EIS---GTILFREDYLNKPQTQQAVSYLQS 435 + G I + L L+ Sbjct: 348 PANGEPGNIQWSMKALMH-NQGGVADLLKR 376 >UniRef50_Q7MXU6 YngK protein n=4 Tax=Porphyromonadaceae RepID=Q7MXU6_PORGI Length = 512 Score = 367 bits (942), Expect = e-100, Method: Composition-based stats. Identities = 128/382 (33%), Positives = 189/382 (49%), Gaps = 38/382 (9%) Query: 1 MDICSRNKKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIW 60 M CSR K LT + + L C K P + R W Sbjct: 1 MYHCSR-KSLTFFLALLFCVMVLFSCGTKRKLPSQ-----------VHADYPKREFRAAW 48 Query: 61 LATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPS 120 + TV + ++ +S ++ +I +LD L+ G N + FQ++P+ A + S Sbjct: 49 IQTVYQGEYARLSPAEA----------RRLLIGRLDKLKEAGCNAIIFQIRPESDAWYES 98 Query: 121 KILPWSDLMTGKIGENP--GYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELN 178 I PWS +TG+ G+ P +DPL FM+ E HKRGM++HAW NPYR S + G Sbjct: 99 AIEPWSRFLTGRQGQAPTPFWDPLAFMVSECHKRGMELHAWINPYRASTSGTAGL----- 153 Query: 179 STLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYF 238 P Y ++ W T ++ DPG+P+ + +I IV ++ RY +D + DDYF Sbjct: 154 -----APNHPYHRYPQWFVTYNNQLYYDPGVPDCRAYICRIVRDITMRYDIDAIHMDDYF 208 Query: 239 YTES-PGSRLNDNETYRKYGGA--FASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSP 295 Y G+ D++++R+YG F +K DWRR N +L+ ++ TI KP V FG+SP Sbjct: 209 YPYPVAGAAFPDDDSFRRYGQGYTFQTKGDWRRENVNKLVHEIKQTILQSKPWVRFGISP 268 Query: 296 AGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLA 355 G++RN+ P GS+T G YD+ YAD W ++G +DY+ PQIYW AA Y LA Sbjct: 269 FGIYRNKRTSPSGSETAGLQNYDDLYADVLLWQKRGWIDYVIPQIYWEIGHKAADYATLA 328 Query: 356 KWWADVVKPTRTRLYIGIAFYK 377 +WW LY G + Sbjct: 329 EWWGRNSVGA-AHLYFGQDVKR 349 >UniRef50_UPI0001C160EA conserved hypothetical protein n=2 Tax=Nostocaceae RepID=UPI0001C160EA Length = 668 Score = 365 bits (936), Expect = 2e-99, Method: Composition-based stats. Identities = 132/420 (31%), Positives = 208/420 (49%), Gaps = 40/420 (9%) Query: 15 PAILVALALL-LCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVS 73 A+++ A++ L + +VT P G K + Q + RG W+ V DWP Sbjct: 177 VAVMMYQAIVHLGKMQKINSPYIVTLPLGVKTVKVSHQ--REFRGAWITVVWNSDWPSK- 233 Query: 74 SVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKI 133 P Q+ +++ + LQ N + QV+P+G A++ S I PWS MTG Sbjct: 234 ------PGLSVEQQKTELLEIIKQLQSFNFNALILQVRPEGDAVYASPIEPWSAWMTGTQ 287 Query: 134 GENPG--YDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQ 191 G+ P YDPL+F ++E HKR ++VHAWFNPYR TK G+ + + Sbjct: 288 GKAPEPIYDPLEFAIEECHKRNIEVHAWFNPYRAKTTTKSGSNVS---------PHIAIT 338 Query: 192 HRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTES-PGSRLNDN 250 + + + G++ +DPG VQD +++ +V++RY VDG+ DDYFY G D Sbjct: 339 NPEVVYRWGNQLWMDPGAKIVQDRAYNVIIDVLTRYDVDGIHLDDYFYPYPISGQSFPDE 398 Query: 251 ETYRKY--GGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLG 308 +TY Y G S DWRR N Q++ ++S IK IK V+FG+SP G++R P G Sbjct: 399 KTYSAYKNSGGKLSVEDWRRENVNQMVWRLSEGIKKIKAHVKFGISPFGIYR--PGQPAG 456 Query: 309 SDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTR 368 G Y YAD+++W+++G +DY+APQ+YW ++ Y+ L KWW + V + Sbjct: 457 --IVGLDPYSVLYADSKKWLQEGWIDYLAPQLYWRTDQTQQSYETLLKWWTE-VNTKQRH 513 Query: 369 LYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPE--ISGTILFREDYLNKPQT 426 +Y G ++ E++KQ+ ++ + E G I F L + + Sbjct: 514 IYAGNNLGQLDGKV---------WKNSEIEKQIVISRNLAENFSLGNIFFSMKSLAENRQ 564 >UniRef50_A9NEW0 Putative uncharacterized protein n=1 Tax=Acholeplasma laidlawii PG-8A RepID=A9NEW0_ACHLI Length = 404 Score = 362 bits (928), Expect = 2e-98, Method: Composition-based stats. Identities = 120/387 (31%), Positives = 199/387 (51%), Gaps = 25/387 (6%) Query: 54 QPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPD 113 +P R W++ V +D P + + + +I+ LD + + +FFQV+ Sbjct: 30 KPFRAFWISNVLNIDLPNMKDPS----------YKDKVIEMLDTAKAYNMTAIFFQVRTT 79 Query: 114 GTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGT 173 A + SK+ P+S +TGK GE P +D L+F++ EA R ++VHAW NPYRVS+ T Sbjct: 80 NDAFYKSKLNPYSRFLTGKEGEVPLFDVLEFVIKEAKNRSLEVHAWCNPYRVSMKTDMTK 139 Query: 174 IRELNSTLSQQPASVYVQHRDWIRT-SGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGV 232 L++ + +H +++ T + +L+P EV+ +I + E+ Y VDG+ Sbjct: 140 SEYLSTLDDL---NFAKRHPEFVITDKNGQLILNPAKEEVKTFIIDSMLEIADNYDVDGI 196 Query: 233 QFDDYFYTESPGSRL-NDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEF 291 FDDYFY + S ND + + + D+RRN +I ++ +K P + F Sbjct: 197 HFDDYFYPYAGLSDSDNDASDFEQRTDKSLTLGDFRRNQITDVIRNLNKALKEKHPNLRF 256 Query: 292 GVSPAGVWRNRSHDPLGS--DTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAA 349 GVSP G+W+ + D LGS D + + +YD YAD+ W+++G++DYI PQ+YW F A Sbjct: 257 GVSPFGIWKTKKSDELGSNVDPQCSQSYDNQYADSYLWIKEGIIDYIVPQLYWDFEHKLA 316 Query: 350 RYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPE 409 + LA WWA+V K + LYIG Y+ GE E + E+ QL + Sbjct: 317 PFADLALWWAEVCKGSNVDLYIGHGPYRYGEKGGYENPY-------EVVNQLKFANQFDN 369 Query: 410 ISGTILFRED-YLNKPQTQQAVSYLQS 435 + G + F ++++ + QQ + ++ Sbjct: 370 VVGNVFFTYKTFIDETKQQQGMHLVKK 396 >UniRef50_Q110S6 Putative uncharacterized protein n=5 Tax=Bacteria RepID=Q110S6_TRIEI Length = 521 Score = 359 bits (921), Expect = 1e-97, Method: Composition-based stats. Identities = 131/440 (29%), Positives = 203/440 (46%), Gaps = 50/440 (11%) Query: 8 KKLTIRRPAI----LVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLAT 63 K + RR + + L L L S P S P+ P + RG+W+A+ Sbjct: 26 KNIHFRRKNLLWSCFLILGLTLTQMSSYLPTSRAQQPSSFSP--------REFRGVWVAS 77 Query: 64 VSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKIL 123 V+ +DWP S P Q+ +++ L+ +Q L +N + QV+P+G A + S I Sbjct: 78 VANIDWP-------SQPGLPVTQQKTELLNILNRMQELNLNALVLQVRPNGDAFYNSTIE 130 Query: 124 PWSDLMTGKIG--ENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTL 181 PWS +TGK G P YDPL+F + E+HKR +++HAWFNPYR ++ G+ Sbjct: 131 PWSGWLTGKQGTPPQPYYDPLEFAIAESHKRNIELHAWFNPYRAQLSPNDGSFAS----- 185 Query: 182 SQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTE 241 V++ + G LDPG VQD + + +VV RY +D V FDDYFY Sbjct: 186 ----NHAAVKYPQYAYRYGKYVWLDPGAKVVQDQTFNTIIDVVRRYDIDAVHFDDYFYPY 241 Query: 242 S-PGSRLNDNETYRKY--GGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGV 298 G D +TY Y G S ++WRR N ++ ++ I + KP V+FG+SP G+ Sbjct: 242 PQGGQEFPDYQTYNSYKASGGTLSLSNWRRQNVNNMVERLYQGIHAEKPYVKFGISPFGI 301 Query: 299 WRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWW 358 +R +P G G Y+ YAD + W+ +G +DY+APQ+YW Y VL WW Sbjct: 302 YR--PGNPPG--IVGLDQYESLYADVKLWLAKGWVDYLAPQLYWRIDPPKQSYPVLLNWW 357 Query: 359 ADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPE--ISGTILF 416 P R +Y G ++ V E ++Q+ ++ G I + Sbjct: 358 LQQ-NPQRRHIYAGNFLSQLQVSG---------WPVSEFERQVAISRQRASQLSLGNIFY 407 Query: 417 REDYLNKPQTQQAVSYLQSR 436 + + + ++ Sbjct: 408 SMK-MFRDNVAGVNNVFKNY 426 >UniRef50_B9XM08 Putative uncharacterized protein n=2 Tax=bacterium Ellin514 RepID=B9XM08_9BACT Length = 523 Score = 354 bits (908), Expect = 4e-96, Method: Composition-based stats. Identities = 131/415 (31%), Positives = 211/415 (50%), Gaps = 44/415 (10%) Query: 33 PESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMI 92 P ++V P+ ++PPA S++ R +W+AT+ +DWP P Q+ ++ Sbjct: 44 PAAVVYIPSTAQPPA----SNREFRAMWIATMVNIDWPSK-------PGLPVPQQKAELL 92 Query: 93 DKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENP--GYDPLQFMLDEAH 150 LD +L +N V FQV+P A++ S I PWS +TG +G+ P YDPL F ++EAH Sbjct: 93 AILDCAVKLNLNAVIFQVRPGSDAMYASSIEPWSYYLTGAMGKAPAPFYDPLAFAVEEAH 152 Query: 151 KRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIP 210 KRG+++HA+FNP+R + +K + D +R G+ LDPG Sbjct: 153 KRGLELHAYFNPFRAAQPSKKWQFSS---------NHISRTRPDLVRQYGNLLWLDPGER 203 Query: 211 EVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGS------RLNDNETYRKY--GGAFAS 262 E QD + +V +VV+RY +D V FDDYFY D++T++++ GG S Sbjct: 204 EAQDHVLKVVMDVVNRYDIDAVHFDDYFYPYKQQDARNRDIDFPDSKTWKRFVAGGGKLS 263 Query: 263 KADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYA 322 + DWRR N + +V +I + KP V+FG+SP G+W+ +G AYD YA Sbjct: 264 RDDWRRENINSFVHRVHDSIHAAKPWVKFGISPFGIWQPGYPP----QVKGLNAYDSIYA 319 Query: 323 DTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPS 382 D+R+W+ G +DY++PQ+YW + VL KWW + ++ GIA KVG Sbjct: 320 DSRKWLMNGWVDYLSPQLYWAVESPGQSFPVLLKWWLEQ-NSKNRNVWPGIASEKVGRT- 377 Query: 383 KIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQSRW 437 W N + +++ + + + +G L+ + L + + A + Q + Sbjct: 378 -----WKANEIIRQIQ---IIREQAGDRAGEALYSAEGLVQNRGGLASALAQGVY 424 >UniRef50_B0NT08 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=B0NT08_BACSE Length = 486 Score = 351 bits (901), Expect = 3e-95, Method: Composition-based stats. Identities = 124/427 (29%), Positives = 196/427 (45%), Gaps = 44/427 (10%) Query: 17 ILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQ---SSQPMRGIWLATVSRLDWPPVS 73 I + + L+L + E ++ + S+ T + +RG+W+ATV LDWP Sbjct: 7 IYIIILLVLVAACGKDDEGILDDGSHSQGEGQTSSSVLPGKELRGVWIATVWGLDWPMEK 66 Query: 74 SVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKI 133 A VQ++ D LD L +N VFFQ++ A + S+ PWS +TG Sbjct: 67 --------YDADVQKKLYTDYLDLLVGYNMNAVFFQIRGMADAFYESEYEPWSKYITGSA 118 Query: 134 GENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHR 193 G P YD L F+++EAHKRG++ HAW NPYR++ + P Sbjct: 119 GVRPDYDVLGFLVEEAHKRGIQFHAWLNPYRIATRANKN---------AAFPKLDAKIPM 169 Query: 194 DWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSR-LNDNET 252 + ++ V +P +PEVQ+ I +IV E++++Y VDG+ DDYFY S +ND Sbjct: 170 ELVKDYEKIRVYNPALPEVQERIVNIVKEIITKYDVDGIHMDDYFYPSLEASETMNDGAE 229 Query: 253 YRKYGG-AFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDT 311 ++KYG F + D+RRNN ++ + TI +P V F +SPA Sbjct: 230 FQKYGKDKFKNVEDFRRNNVNTVVRNIQKTIIETRPEVIFSISPAADMER---------- 279 Query: 312 RGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYI 371 Y+ +AD W ++G +D + PQ+Y+ A +++ W+ L I Sbjct: 280 ----NYNTLFADVNTWAKEGWVDVVIPQLYFATGNDATSFNLRLDLWSQYT--YENHLLI 333 Query: 372 GIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYL--NKPQTQQA 429 G YK G+ +L KQ +L A P++ G++L+ L NK A Sbjct: 334 GYGIYKFGDSQYGSK----FQSSDDLMKQFELASAKPKVKGSVLYSAKNLVENKVGIADA 389 Query: 430 VSYLQSR 436 V + + Sbjct: 390 VKAIYGK 396 >UniRef50_A6G0M0 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G0M0_9DELT Length = 540 Score = 351 bits (900), Expect = 3e-95, Method: Composition-based stats. Identities = 116/390 (29%), Positives = 192/390 (49%), Gaps = 37/390 (9%) Query: 53 SQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKP 112 ++ RG+W+ TV ++WP ++ + + +D + + +N + FQV+P Sbjct: 86 AREFRGVWVTTVYNINWPSSQGLSAAAAQAELAS-------IVDTAEAVNLNAIVFQVRP 138 Query: 113 DGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPG 172 + A++ S + PWS ++G G +PG+DPL F+++EAH RG++VHAWFNPYR + + Sbjct: 139 ESDAVYESSLEPWSRYLSGSQGGDPGFDPLAFLIEEAHARGIEVHAWFNPYRGAASAG-- 196 Query: 173 TIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGV 232 ++ + +Q + T G +DPG +V++ +V +VV RY VDGV Sbjct: 197 --------ITLAEPHIALQLPEHAHTYGSSLWMDPGALDVREHTVDVVLDVVERYAVDGV 248 Query: 233 QFDDYFYTESPGSRLNDNETYRKY--GGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVE 290 DDYFY G D T+ Y G S+ DWRR+N L+ ++ TI + P Sbjct: 249 HLDDYFYPYPNGDDFPDALTWNAYLADGGALSQGDWRRDNVNALVEELHDTIAAADPDAR 308 Query: 291 FGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAAR 350 FG++P G++R + G Y E YAD W+E+G +DY+APQ+YWP + Sbjct: 309 FGIAPFGIYRPG----IPEGIVGLDQYAELYADPVLWMEEGWVDYLAPQLYWPTYSAQQT 364 Query: 351 YDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVP-- 408 Y+VL WW+ + ++ G K+G+ DW + E+ Q++L+ Sbjct: 365 YEVLLDWWSSI--DPERYVFTGNYLSKLGD------DWT----LDEMLYQVELSRLYSDQ 412 Query: 409 EISGTILFREDYLNKPQTQQAVSYLQSRWG 438 G + F + L + L +G Sbjct: 413 NSMGNVYFHVEPLQSDTLGINAALLDEFYG 442 >UniRef50_B7AM83 Putative uncharacterized protein n=1 Tax=Bacteroides eggerthii DSM 20697 RepID=B7AM83_9BACE Length = 489 Score = 351 bits (900), Expect = 3e-95, Method: Composition-based stats. Identities = 134/422 (31%), Positives = 195/422 (46%), Gaps = 49/422 (11%) Query: 10 LTIRRPAILVALALLLCSCKS------TPPESMVTPPAGSKPPATTQQSSQPMRGIWLAT 63 + + LVAL SC PPE TPP P + +RG W+ T Sbjct: 1 MKYLKYITLVALLAFAVSCSKDDDGENMPPEP--TPPVEKPEPQAVTLPQKELRGAWITT 58 Query: 64 VSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKIL 123 V +DWP A QQ+ D LD L +N VFFQ++ A + S+ Sbjct: 59 VWGIDWPM--------EDYNAATQQKKYTDYLDLLVANNMNAVFFQIRGMADAFYESQYE 110 Query: 124 PWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPG-TIRELNSTLS 182 WS +TG G+NPGYD L F+++EAHKRG++ HAW NPYR+S + EL++ + Sbjct: 111 SWSKNITGTAGKNPGYDVLGFLVEEAHKRGLQFHAWMNPYRISTRASKNSSFAELDTKIP 170 Query: 183 QQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTE- 241 W + + +P +PEVQ I IV E++++Y VDG+ DDYFY Sbjct: 171 VA----------WTKDYNKIRIYNPAMPEVQTRIMDIVKEIITKYDVDGIHMDDYFYPSL 220 Query: 242 SPGSRLNDNETYRKYGG-AFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWR 300 G +NDN Y KYG F S ++RRNN +I + I KPGV F VSPA Sbjct: 221 EEGESMNDNAEYEKYGKDKFKSIEEFRRNNVDVVIQNIQKVIIDTKPGVIFSVSPAANID 280 Query: 301 NRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWAD 360 N Y + +AD R+W+++G +D I PQ+Y+ ++ W Sbjct: 281 N--------------NYSKLFADVRKWLKEGWVDVIIPQLYFATGTGKNSFNQFLDQWMQ 326 Query: 361 VVKPTRTRLYIGIAFYKVGEPSKIEPDW-MINGGVPELKKQLDLNDAVPEISGTILFRED 419 V +T IG YK G +PD+ +LK Q + +++G++L+ Sbjct: 327 YVN--QTHCLIGYGIYKFGS---TDPDYGNAFHSSADLKSQFEYASKKSKVNGSVLYSIK 381 Query: 420 YL 421 + Sbjct: 382 DM 383 >UniRef50_C6XWM5 Putative uncharacterized protein n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6XWM5_PEDHD Length = 481 Score = 345 bits (886), Expect = 1e-93, Method: Composition-based stats. Identities = 121/387 (31%), Positives = 181/387 (46%), Gaps = 41/387 (10%) Query: 38 TPPAGSKPPATTQ--QSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKL 95 P PP T + MR +W+A+V LDWP Q+Q ID L Sbjct: 28 AKPDPVDPPVETSLLFPKKEMRAVWIASVYGLDWP--------QSVYTMAGQKQQYIDYL 79 Query: 96 DHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMK 155 + + L IN ++FQVK G A + S PWS +TG G +PGYD L+FM+DEAH R ++ Sbjct: 80 EKFKSLNINAIYFQVKGMGDAFYNSSYEPWSASITGTRGVDPGYDVLKFMIDEAHARDIE 139 Query: 156 VHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDW 215 HAW NPYR++ S+ S PA +W+ + +P +PEV+ Sbjct: 140 FHAWMNPYRIATRA---------SSASSFPALHSSVKPEWVLDFPTIRIYNPALPEVRQR 190 Query: 216 ITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLI 275 + IV E +++Y VDG+ FDDYFY E G D + KYG A+ D+RR+N + I Sbjct: 191 LVDIVKETITKYDVDGIHFDDYFYPE--GETFTDQADFTKYGAGMANIQDFRRDNVNKAI 248 Query: 276 AKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDY 335 V I + KPGV F VSPA ++ YAD ++W ++G +D Sbjct: 249 KGVYDIIVATKPGVVFSVSPAPEITK--------------NFNTLYADVKKWNQEGWVDV 294 Query: 336 IAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVP 395 + PQ+Y + + W+ + L +G +Y+ G+ + Sbjct: 295 VIPQLYQEIGNQYNDFQLRLSEWSQ--NSFKAALMVGHGYYRFGDATAP----AAFQSSS 348 Query: 396 ELKKQLDLNDAVPEISGTILFREDYLN 422 EL++Q DL ++ G ++ YLN Sbjct: 349 ELQRQFDLTRLNKKVVGNAMYSAKYLN 375 >UniRef50_C1A7Q3 Putative uncharacterized protein n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1A7Q3_GEMAT Length = 501 Score = 345 bits (884), Expect = 2e-93, Method: Composition-based stats. Identities = 123/420 (29%), Positives = 197/420 (46%), Gaps = 44/420 (10%) Query: 23 LLLCSCKSTPPESMVTPPAGSKPPA--------TTQQSSQPMRGIWLATVSRLDWPPVSS 74 +L +C +P + P T ++ RG+W+ATV+ +DWP + Sbjct: 1 MLFAACGGSPADPTGPVVPPPVVPPPEPPVTPFTVPTITREFRGMWIATVANIDWPSRTG 60 Query: 75 VNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIG 134 ++I QQ + LD Q+ G+N V V+ G AL+PS + PW +G G Sbjct: 61 LSIPQ-------QQAEFVALLDVAQQAGLNAVILHVRAAGDALYPSTLEPWMRSFSGTQG 113 Query: 135 ENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRD 194 +PG+DPLQ+ ++++H RG+++HAWFNP+R + + E A + D Sbjct: 114 VDPGWDPLQYAIEQSHARGIELHAWFNPFRAGNASDTARLAE---------AHFGRKRPD 164 Query: 195 WIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESP---GSRLNDNE 251 +R + DPG D ++V++VV RY VDGV DDYFY + DN Sbjct: 165 ILRRYCSQLWFDPGEAATHDQAIAVVSDVVRRYAVDGVHIDDYFYPYPETGCTTDFPDNT 224 Query: 252 TYRKY--GGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGS 309 + Y G ++ADWRR+N + + ++ T++ + G+SP G+WR +P G Sbjct: 225 AFAAYQRQGGTMARADWRRDNVNRFVERLYATVRGLSRTARVGISPFGIWR--PGNPAG- 281 Query: 310 DTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRL 369 G +Y YAD+R W+++G DY APQ+YW + Y+ L WW R L Sbjct: 282 -ITGLDSYASIYADSRLWLQRGWADYFAPQLYWSSTSVGQNYNALLTWWTQQ-NTMRRHL 339 Query: 370 YIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEI----SGTILFREDYLNKPQ 425 + G+A Y++ + S V E+ Q+ + A SGTI + + + Sbjct: 340 WPGLASYRIADGS------SAPFAVTEISTQIGITRAQSSASGGPSGTIFYNASSVKNDR 393 >UniRef50_C9PUA7 FenI protein n=2 Tax=Prevotella RepID=C9PUA7_9BACT Length = 493 Score = 344 bits (882), Expect = 4e-93, Method: Composition-based stats. Identities = 123/424 (29%), Positives = 196/424 (46%), Gaps = 48/424 (11%) Query: 8 KKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRL 67 KK+ + A+LV + C+ + P PP + + +RG+W+ATV L Sbjct: 6 KKIALALSAVLV----VACNHDDNILPNKPKKPDTPNPPTQSILPKKELRGVWMATVWGL 61 Query: 68 DWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSD 127 DWP A Q+ + I + L++ IN VF QV+ A + S PW Sbjct: 62 DWP--------RGEYNAESQKASYIAYMKALEKNNINAVFVQVRGRADAFYKSDYEPWCQ 113 Query: 128 LMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPAS 187 +TG++ ++PGYD L+FM+DEAHKRG+ HAWFNPYRV+ +T + PA Sbjct: 114 YLTGEVDKDPGYDVLRFMIDEAHKRGIAFHAWFNPYRVATK---------KATDAAFPAL 164 Query: 188 VYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTE-SPGSR 246 + + + +P +PEV+ I I+ +++++Y VDGV DDYFY + G Sbjct: 165 DSRIPQAMMVDYKTIRMYNPALPEVRQRIFDIIKDLITKYDVDGVHIDDYFYPSLTSGET 224 Query: 247 LNDNETYRKY------GGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWR 300 + D E Y+KY G + ++RRNN + + +++ +P V F VSPAG Sbjct: 225 IKDEEEYKKYAPKDNNGKPTITIEEFRRNNVDLAVKGIHDVVQATRPEVVFTVSPAGNPD 284 Query: 301 NRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWAD 360 Y+ YAD +W +G + I PQ+Y+P +A ++ WW+ Sbjct: 285 Y--------------NYNTMYADVVKWSREGWTEAIIPQLYFPMGNAATNFNQRLIWWSQ 330 Query: 361 VVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDY 420 + L+IG Y+ G+P EL KQ +++G++L+ Sbjct: 331 YT--FKNALFIGYGTYRFGDPKSPAAY----QNASELAKQFAFASKYNKVTGSVLYSAKD 384 Query: 421 LNKP 424 L Sbjct: 385 LLNN 388 >UniRef50_A0M6M5 Protein containing DUF187 n=4 Tax=Bacteroidetes RepID=A0M6M5_GRAFK Length = 540 Score = 344 bits (881), Expect = 5e-93, Method: Composition-based stats. Identities = 138/459 (30%), Positives = 206/459 (44%), Gaps = 74/459 (16%) Query: 2 DICSRNKKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQ------- 54 D C N R I + L L L +CKST V PP +P ++S Q Sbjct: 4 DCCLSNH----FRIPIFILLMLFLNACKSTK----VAPPKPVEPQVEEEKSEQNVDQVPE 55 Query: 55 --------------------PMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDK 94 RG W+ATV+ ++WP ++++ Q+ I+ Sbjct: 56 VEEPEASSENKIVEPPIDIEEFRGAWIATVANINWPSKNNLST-------EAQKAEAIEM 108 Query: 95 LDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGEN--PGYDPLQFMLDEAHKR 152 LD L+ N V QV+P AL+ S+I PWS +TGK G+ P YDPL+F ++EAH R Sbjct: 109 LDFLENHNFNAVILQVRPQADALYDSEIEPWSYFLTGKSGKAPQPYYDPLKFWIEEAHNR 168 Query: 153 GMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEV 212 G+++H W NPYR T G S + P V + + +DPG +V Sbjct: 169 GLELHVWLNPYRAHHTT--GKEIGEKSIVKTNPELV-------MELKNGMWWMDPGSSKV 219 Query: 213 QDWITSIVAEVVSRYPVDGVQFDDYFYTESPG---SRLNDNETYRKY--GGAFASKADWR 267 QD ++V ++V RY +D V FDDYFY + D +++ KY G S+ DWR Sbjct: 220 QDHSAAVVMDIVKRYDIDAVHFDDYFYPYASYNGKKDFPDEKSWEKYVNSGGELSRGDWR 279 Query: 268 RNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRW 327 R N I +++ IK+ K V+FG+SP G+WR P G G Y+E YAD + W Sbjct: 280 RKNVNDFIERIAEEIKAEKSFVKFGISPFGIWR--PGFPKG--ISGMDQYEELYADAKLW 335 Query: 328 VEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPD 387 + +G +DY PQ+YWP + + VL WW ++ GI + Sbjct: 336 LNKGWIDYFTPQLYWPTRQIGQSFPVLLGWWESE-NVVGRHVWPGINLGLEDKEENKG-- 392 Query: 388 WMINGGVPELKKQLDLNDA-VPEISGTILFREDYLNKPQ 425 E+ Q+ ++ + + GT+ + L K Sbjct: 393 --------EIASQILISRGILRDNPGTVHWNIGPLMKND 423 >UniRef50_C0YRL9 FenI family protein n=3 Tax=Bacteroidetes RepID=C0YRL9_9FLAO Length = 538 Score = 342 bits (877), Expect = 2e-92, Method: Composition-based stats. Identities = 121/415 (29%), Positives = 200/415 (48%), Gaps = 41/415 (9%) Query: 29 KSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQ 88 +T P + + + RG W+A+V+ ++WP S Q+ Sbjct: 52 AATNPATGTAASTEDNFRTNLPEIKREFRGAWIASVANINWP-------SRNDLTVEQQK 104 Query: 89 QAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIG--ENPGYDPLQFML 146 I LD L+ N FQ++P AL+ S I PWS +TG+ G +P YDPLQF + Sbjct: 105 AEAISMLDMLKDNNFNAAIFQIRPSADALYTSNIEPWSYFLTGETGTAPSPNYDPLQFWI 164 Query: 147 DEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLD 206 +EAHKRG+++H W NPYR +T G + +L+ + + + + +R + D Sbjct: 165 EEAHKRGLELHVWLNPYRA-HHTNGGAVNKLS--MVNKLSDIV------VRLKNGMYWFD 215 Query: 207 PGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTE---SPGSRLNDNETYRKY--GGAFA 261 P P+ Q +++IV ++V RY +D + FDDYFY + G+ DN ++ Y G Sbjct: 216 PANPKTQGHVSNIVKDIVKRYDIDAIHFDDYFYPYATYNKGADFPDNASWNAYVSSGGTL 275 Query: 262 SKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESY 321 S+ADWRR+N + + ++ I + K V FG+SP G+W + P G G++ YDE Y Sbjct: 276 SRADWRRDNVNKFVERIYKEIHAEKNNVRFGISPFGIW--KPGYPAG--IVGSSQYDELY 331 Query: 322 ADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEP 381 AD + W+ +G +DY +PQ+YWP ++ L WW L+ G+ Sbjct: 332 ADAKLWLNKGWVDYFSPQLYWPIDSKGQSFEALLSWW-QSENTMNRHLWPGLNT------ 384 Query: 382 SKIEPDWMINGGVPELKKQLDLNDA-VPEISGTILFREDYLNKPQTQQAVSYLQS 435 + ++ E+K Q+D++ + +G I + L + + L+S Sbjct: 385 ----VEIKVSDRPTEIKNQIDISRNILKNDAGEIHWSIAGL--TRNPNMLPALKS 433 >UniRef50_A7VTI3 Putative uncharacterized protein n=1 Tax=Clostridium leptum DSM 753 RepID=A7VTI3_9CLOT Length = 434 Score = 337 bits (865), Expect = 4e-91, Method: Composition-based stats. Identities = 123/420 (29%), Positives = 196/420 (46%), Gaps = 41/420 (9%) Query: 25 LCSCKSTPPESMVTPPAGSKPPATTQQSSQ-----PMRGIWLATVSRLDWPPVSSVNISN 79 T + +V PAG + A Q MR +W+ + S++ S Sbjct: 35 ASGASQTASDQLVNTPAGEESQAEQPPGGQIVLEGEMRAVWVPYL---------SLDQSK 85 Query: 80 PTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGY 139 Q+A + + + G+NT+ V+P G A++PS+I PWS L+TG G +PG+ Sbjct: 86 IGQGQEAFQKAFDEIVSQAKEYGLNTLIVHVRPFGDAMYPSEIYPWSHLLTGTQGGDPGF 145 Query: 140 DPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTS 199 DPL++M+ + H+ GM+ HAW NP R+ P + + +Q + DW+ Sbjct: 146 DPLEYMVRKTHEAGMQFHAWLNPLRIQSKGTPSILAP-DHLYTQWREDSDPNNDDWVVDW 204 Query: 200 GDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYG-- 257 + +P PEV++ I + E+V YPVD + FDDYFY S G+ D + Y+ Y Sbjct: 205 EEGKYFNPAYPEVREKIIEGIREIVENYPVDAIHFDDYFYPTSDGAF--DEKAYQAYTES 262 Query: 258 ---GAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGA 314 G + WR N L++ V IKSI P V+FG+SP G N + Sbjct: 263 VGEGVPLTLPQWRIANINTLVSGVYSAIKSINPQVQFGISPQGNITNDLN---------- 312 Query: 315 AAYDESYADTRRWV-EQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGI 373 AD W ++G +DY+ PQIY F ++ A W + +LYIG+ Sbjct: 313 -----MGADVETWASQKGYVDYLCPQIYVNFDHPLLPFNQTADQWRQMTTAEGVKLYIGL 367 Query: 374 AFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYL 433 A YK G W NG L+++++ + ++ G +L+ DY++ QTQ+ V+ + Sbjct: 368 AVYKAGSEDADSGTW--NGKTDILQREIEYSRSL-GCDGIMLYSWDYMDTSQTQEEVANV 424 >UniRef50_C1I7D2 Putative uncharacterized protein n=1 Tax=Clostridium sp. 7_2_43FAA RepID=C1I7D2_9CLOT Length = 741 Score = 337 bits (864), Expect = 5e-91, Method: Composition-based stats. Identities = 130/407 (31%), Positives = 193/407 (47%), Gaps = 41/407 (10%) Query: 46 PATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINT 105 P + R WL+TV +D V+S S + + LD + L +N Sbjct: 11 PDKFINPKEQFRTAWLSTVVNIDISDVTSNPN---LSAEEEFKNDLSSILDRFEELNLNA 67 Query: 106 VFFQVKPDGTALWPSKILPWSDLM------TGKIGENPGY---DPLQFMLDEAHKRGMKV 156 V FQV P A +PS I PWS + G++PG+ DPL++++ E H RGM+ Sbjct: 68 VTFQVSPMLDAWYPSDIAPWSQYLHKGGNNYTLQGKDPGFNGFDPLEWLISETHNRGMEF 127 Query: 157 HAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWI 216 HAWFNPYRV+ + E + L++ + ++ I ++ LDPG PEV D++ Sbjct: 128 HAWFNPYRVTNTVDKRPVSEKLNELAEN--NFARKNPHLIYEFQNKLFLDPGRPEVIDYV 185 Query: 217 TSIVAEVVSRYPVDGVQFDDYFYTESPGS-----RLN----DNETYRKYGGAFA-----S 262 V EV ++Y VD + FDDYFY D +T+ Y F + Sbjct: 186 VQRVEEVANKYNVDAIHFDDYFYPYKYSENNKDIYFYTQDLDKQTFIDYNRGFGEYNIEN 245 Query: 263 KADWRRNNTQQLIAKVSHTIKSI----KPGVEFGVSPAGVWRNRSHDPLGSDTR--GAAA 316 A WR NN LI + + I ++FGVSP G+W + + GS+T ++ Sbjct: 246 AAKWRENNIDILIKAIKDKVTDINITNNRSIQFGVSPFGIWGHAENYLEGSNTPTGSTSS 305 Query: 317 YDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKP-TRTRLYIGIAF 375 + +A+TR+WV++GL+DY+ PQIYW F+ +AA Y L +WW + + LYIG Sbjct: 306 LRDQFANTRKWVKEGLVDYLTPQIYWSFNTAAAPYGELLQWWDSQFEGINNSHLYIGHPN 365 Query: 376 YKVGEPSKIEPDWMINGGVP-ELKKQLDLNDAVPEISGTILFREDYL 421 YK I+ W N P E+ QL N + G+ F D L Sbjct: 366 YKY-----IDASWDNNFKNPYEIANQLRFNQKFENVKGSAFFSFDKL 407 >UniRef50_C9LEC6 YngK protein n=1 Tax=Prevotella tannerae ATCC 51259 RepID=C9LEC6_9BACT Length = 537 Score = 335 bits (860), Expect = 1e-90, Method: Composition-based stats. Identities = 118/387 (30%), Positives = 182/387 (47%), Gaps = 39/387 (10%) Query: 53 SQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKP 112 + R +WL T LDWP QQ+ + LD LQ +NTV FQV+ Sbjct: 25 RREYRAVWLTTFLGLDWPK---------GHDPLTQQKQLCRILDQLQAAKVNTVLFQVRL 75 Query: 113 DGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPG 172 GT + S I PW + TG G P YDPL F ++E H+RGM++HAW + V Sbjct: 76 RGTTAYDSDIEPWDGIFTGTPGRRPTYDPLAFAIEECHRRGMELHAWMVAFPVC------ 129 Query: 173 TIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGV 232 +LN + SV +H + R S D++++DPG+P D++ ++ E+VS+Y VDG+ Sbjct: 130 ---KLNVLKALGTKSVVRKHPELCRRSDDQYIMDPGMPGTADYLANLCRELVSQYDVDGI 186 Query: 233 QFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFG 292 D Y E G +D TYRKYG KA WRR+N +++ K+ +KS KP V Sbjct: 187 HLDYIRYPE-AGLHFDDAATYRKYGKGRELKA-WRRDNVTRVVEKIHEAVKSQKPWVRLS 244 Query: 293 VSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYD 352 +P G + + +RG A D D W+ +G +D + P +Y+ Y Sbjct: 245 CAPVGKYADLPR----QSSRGWNARDAVGQDAVMWLNKGWMDVLFPMMYFDGDN---YYP 297 Query: 353 VLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISG 412 + W + + R + G+ Y + S E +W + L++Q++ E G Sbjct: 298 FVLDW---LERAERGTVAPGLGVYCL---SAGEKNWPLLT----LQRQMNFLRT-AEAGG 346 Query: 413 TILFREDYLNKPQTQQAVSYLQSRWGS 439 LFR D+L T+ +L + + Sbjct: 347 FALFRSDFLTN-NTKGVYDWLAGEYTT 372 >UniRef50_C3R8E6 S-layer protein n=24 Tax=Bacteroides RepID=C3R8E6_9BACE Length = 559 Score = 334 bits (857), Expect = 3e-90, Method: Composition-based stats. Identities = 114/390 (29%), Positives = 184/390 (47%), Gaps = 34/390 (8%) Query: 50 QQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQ 109 Q +R WL T+ +DWP ++N S R QQ+ + D LD L+ NTV Q Sbjct: 20 SQPKYEIRATWLTTLGGMDWPRNKAINASG----IRRQQKELCDILDRLKAANFNTVLLQ 75 Query: 110 VKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNT 169 + G ++PS I +++ +TG G NPGYDPL F + E HKRGM++HAW Sbjct: 76 TRLRGDMIYPSAIETFAESLTGSTGGNPGYDPLAFAIGECHKRGMELHAWIVTIPAGNTR 135 Query: 170 KPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPV 229 + +SV ++R + + LDPG P +++++ IV E+ SRY + Sbjct: 136 QVQLQGR---------SSVVRKNRTICKLYKGNWYLDPGNPGTKEYLSCIVKEITSRYDI 186 Query: 230 DGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGV 289 DG+ FD Y E D +TYRKYG K WRR+N ++ ++ IK+IKP V Sbjct: 187 DGIHFDYIRYPEQ-ADNFPDKDTYRKYGKGKELK-QWRRDNITDIVHRLYTDIKTIKPWV 244 Query: 290 EFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAA 349 + SP G +R+ + P +RG AY Y D ++W+++G+ D + P +Y+ + Sbjct: 245 KVSSSPIGKYRDTNRYP----SRGWNAYHVVYQDAQKWLKEGIHDALFPMMYF---QGNN 297 Query: 350 RYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPE 409 Y W + + G+ Y + + N + E+ +QL + + Sbjct: 298 FYPFALDWKENC---GNRWIIPGLGIYFLSPNEQ-------NWPLDEIVRQLYFTRQI-K 346 Query: 410 ISGTILFREDYLNKPQTQQAVSYLQSRWGS 439 ++G FR +L T+ LQ + + Sbjct: 347 LNGQAYFRNRFLLN-NTKGIWDELQENFYT 375 >UniRef50_A6L917 Putative uncharacterized protein n=5 Tax=Bacteroidales RepID=A6L917_PARD8 Length = 495 Score = 334 bits (856), Expect = 4e-90, Method: Composition-based stats. Identities = 116/386 (30%), Positives = 176/386 (45%), Gaps = 34/386 (8%) Query: 52 SSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVK 111 + +R +WL TV LDWP + + + QQQA++D LD LQ N VF Q + Sbjct: 26 PKKEIRAVWLTTVYGLDWPHKPATT----EAGRKAQQQALLDILDRLQEANFNMVFIQAR 81 Query: 112 PDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKP 171 G ++ S I P S +GK GE PGYDPL F++DE HKRGM+ HAWF + + Sbjct: 82 LRGDVMYRSAIEPVSKTFSGKYGELPGYDPLAFVVDECHKRGMECHAWFVTFPLGTE--- 138 Query: 172 GTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDG 231 S Q SV + + + LDPG+PE D+I S+V E+V+ Y +DG Sbjct: 139 ------KSVKEQGKLSVVKKKPKLCKRHNGEWYLDPGVPETADYILSLVKEIVNGYDIDG 192 Query: 232 VQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEF 291 + FD Y E + D Y K G S ADWRR N +++ ++ +K KP V+ Sbjct: 193 IHFDYIRYPEE-AKKFPDKALYNK-SGKKKSLADWRRENINRMVYRIYDWVKQTKPWVQV 250 Query: 292 GVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARY 351 SP G + P G AY+ + D + W++QG D I P +Y+ + Sbjct: 251 SSSPLGKYNRIERVP----NAGWTAYESVFQDPKMWMQQGKQDMIVPMMYYLHKN---FF 303 Query: 352 DVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEIS 411 + W + + G+ Y++ K E DW +N ++ Q+D + + Sbjct: 304 PFVDNWVDNC---NGRLVVPGLGAYRMD---KSEADWAVN----DITDQIDYSRYYGG-A 352 Query: 412 GTILFREDYLNKPQTQQAVSYLQSRW 437 G FR + + L+ + Sbjct: 353 GCAFFRCGNVL-YNDKGLYKELRDNY 377 >UniRef50_B2ULM6 Putative uncharacterized protein n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2ULM6_AKKM8 Length = 486 Score = 331 bits (848), Expect = 3e-89, Method: Composition-based stats. Identities = 117/417 (28%), Positives = 193/417 (46%), Gaps = 40/417 (9%) Query: 24 LLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSR 83 CS + +++ +G PA Q R W++TV +DWP S + Sbjct: 10 FSCSLLALASQALGWQTSGESVPAVP----QEFRAAWISTVHNIDWPSRSGL-------S 58 Query: 84 ARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQ 143 Q+ +++ L+ +L +N VF QV+P+ AL+ S + PWS ++G G NPGYDPL Sbjct: 59 GAAQRAELLNILNTCAQLKLNAVFLQVRPNADALYRSSLEPWSQWLSG-PGVNPGYDPLA 117 Query: 144 FMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRF 203 F + EAH+RG+++HAWFNP+R N K R + + D ++ +G Sbjct: 118 FAIQEAHRRGIELHAWFNPFRAKANVKHAVGRN----------HISLTRPDLMKRNGSVL 167 Query: 204 VLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASK 263 +++P +D ++ +VV RY +DGV DDYFY R ++ G S Sbjct: 168 LINPSASASRDHALKVIMDVVRRYDIDGVHLDDYFYPYPTPGRAWSPASF----GDGKSP 223 Query: 264 ADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYAD 323 + RR + + ++KS KP V GVSP G+WR G G AY+ D Sbjct: 224 SQ-RRGYIDGFVQDMYKSVKSSKPWVRVGVSPFGIWR---PGVPGGIEAGVDAYEHLACD 279 Query: 324 TRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSK 383 R+W+ +G +DY+APQ+YW S + + L +WWA + +R ++ GIA ++ Sbjct: 280 ARKWLSRGWVDYLAPQLYWRCSPAKQSFPALMQWWA--AQNSRRPVWPGIATARIMSSED 337 Query: 384 IEPDWMINGGVPELKKQLDLNDAVPEIS-GTILFREDYLNKPQTQQAVSYLQSRWGS 439 E+ Q++ + ++ + G + + + YL + S Sbjct: 338 PG------RPASEIAAQVNYSRSLARTAPGQCFWSIKSIMR-NAGGIQKYLNRLYPS 387 >UniRef50_A9KK48 Putative uncharacterized protein n=1 Tax=Clostridium phytofermentans ISDg RepID=A9KK48_CLOPH Length = 490 Score = 331 bits (848), Expect = 4e-89, Method: Composition-based stats. Identities = 105/391 (26%), Positives = 170/391 (43%), Gaps = 39/391 (9%) Query: 55 PMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDG 114 + +W++ + + + + + D++ +G+N V V+P G Sbjct: 121 EFKAVWISYLE-----------FKSTGYTKDEFEAQIDEMFDNVVDMGMNAVIVHVRPFG 169 Query: 115 TALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTI 174 A++ S PWS ++G G++PG+DPL++M++ AH RG++ HAW NPYR++ Sbjct: 170 DAMYDSDYFPWSKYISGTQGKDPGFDPLEYMVEAAHDRGLQFHAWLNPYRITSKNTDVKT 229 Query: 175 RELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQF 234 N+ + + + + +P +PEV+ I + V E+V Y VDG+ F Sbjct: 230 LATNNPARKWLTDKKTSNDRNVLSFDGNLYYNPAVPEVRTLIRNGVLEIVRNYDVDGIHF 289 Query: 235 DDYFYTESPG--SRLNDNETYRKYGGAFASKA---------DWRRNNTQQLIAKVSHTIK 283 DDYFY ++ D Y+ Y + + +WRR N LI + IK Sbjct: 290 DDYFYPTLGSNYEKVFDATEYKSYVDNYKKQGLDNYILPIDEWRRQNVNTLIKGIYSAIK 349 Query: 284 SIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQ-GLLDYIAPQIYW 342 K V FG+SP G D Y D W+ + G +DYI PQ+YW Sbjct: 350 LEKSDVVFGISPGGFLDTLRMK------------DRYYVDVDTWLSKPGYVDYICPQLYW 397 Query: 343 PFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLD 402 F S +D + W ++ K T +Y+GI YK S EPD+ N + L + Sbjct: 398 SFEHSQYPFDGILNRWLELRKNTDVNVYVGIPVYK--SASNDEPDFKKNANI--LADMII 453 Query: 403 LNDAVPEISGTILFREDYLNKPQTQQAVSYL 433 + G + FR D ++AV L Sbjct: 454 TCRNSKLVDGYMFFRYDNFYSNTAKKAVKNL 484 >UniRef50_C0EGV5 Putative uncharacterized protein n=1 Tax=Clostridium methylpentosum DSM 5476 RepID=C0EGV5_9CLOT Length = 430 Score = 330 bits (847), Expect = 4e-89, Method: Composition-based stats. Identities = 117/409 (28%), Positives = 203/409 (49%), Gaps = 39/409 (9%) Query: 29 KSTPPESMVTPPAGSKPPATTQQSSQP-----MRGIWLATVSRLDWPPVSSVNISNPTSR 83 PP + T S+P ++ Q SQ M+ +W + L+W N + Sbjct: 28 GEFPPATGQTDFVSSEPSSSGGQESQKDPMEGMKAVWFS---YLEW------NTMFKGAS 78 Query: 84 ARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQ 143 QQ + LD+L +G NTV V+ G A++ S + PWS ++G +G++PGYDPL Sbjct: 79 EEQFQQKLGTVLDNLVSIGCNTVMMHVRAFGDAMYRSSVYPWSASVSGVLGKDPGYDPLS 138 Query: 144 FMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRF 203 ++++AH +G+ VHAW NP R + I + +Q + +++ +++ S + Sbjct: 139 IIVEKAHAKGIAVHAWINPMRTMTAAEFDQIGDC---ALKQWYAGAQRYQYYMKDSSGHY 195 Query: 204 VLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASK 263 +L+P PEV+ I++ V E+V Y +DGV DDYFY ++ Y + Sbjct: 196 ILNPANPEVRKLISAGVTELVQNYDIDGVHIDDYFYPSGVDGLPENDAQYYQEAAPGTDI 255 Query: 264 ADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYAD 323 WRR+ T +++ ++ +K++KP + FG SP N +D Y D Sbjct: 256 GSWRRDATTEMVREMHDAVKAVKPEIPFGASPQSSLTND--------------FDRLYID 301 Query: 324 TRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVG---- 379 RW+ +GL+DY+ PQIY+ F ++ +D A W ++V +T LY+G+A YKVG Sbjct: 302 IERWISEGLVDYLMPQIYFGFHNTSQPFDQTAAKWNELV-GDKTALYVGLATYKVGLEND 360 Query: 380 -EPSKIEPDWM--INGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQ 425 + + +W+ NG L++Q+++ +++P G L+ + P Sbjct: 361 QHAGEGKTEWIDCFNGENNMLERQVEVLESLPNCKGYCLYSYQSIFNPD 409 >UniRef50_Q7MWV9 YngK protein n=2 Tax=Porphyromonas gingivalis RepID=Q7MWV9_PORGI Length = 515 Score = 330 bits (846), Expect = 5e-89, Method: Composition-based stats. Identities = 125/427 (29%), Positives = 186/427 (43%), Gaps = 45/427 (10%) Query: 20 ALALLLCSCKS-----TPPESMVTPPAGSKPPATTQ---QSSQPMRGIWLATVSRLDWPP 71 ALL C + +PP P P + MRG+WL T+ LDWP Sbjct: 13 IAALLFAGCGTKKVAPSPPTVKPLPDTVIAAPVIEPWVSPVREEMRGVWLTTIYGLDWPQ 72 Query: 72 VSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTG 131 S+ R Q++ + LD L+R NTVFFQV+ G ++PS+I P S + TG Sbjct: 73 RSAPTAEG----LRKQREELCRILDRLKREKFNTVFFQVRHRGDVIYPSEIEPQSTIFTG 128 Query: 132 KIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQ 191 P YDPL+F L E HKRG+ HAW + S + SV + Sbjct: 129 T--GKPDYDPLEFALKECHKRGLTFHAWLIVTPLG---------PDKHIRSLKGESVKSR 177 Query: 192 HRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNE 251 H +W + + L+PG+PE + + S+V E+V +YPVDG+ D Y E +D Sbjct: 178 HPEWCVRHNNLWYLNPGVPEARAYFASLVREIVEKYPVDGIHLDYMRYPEK-AKIFDDAA 236 Query: 252 TYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDT 311 TY++YGG A WRR N L+A V P V+ V+ G R + G Sbjct: 237 TYKQYGGNM-DPAAWRRRNLSDLMADVHRAATEKTPWVQVSVATIGRLRKLAGKRGGD-- 293 Query: 312 RGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYI 371 AY+ + D W ++G +D++ P +Y+ R Y L W A + + Sbjct: 294 --WTAYEGVHQDPVVWAQEGSVDFLVPMLYY---RDDLFYPFLEDWKAQLP---DLPIIP 345 Query: 372 GIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVS 431 G+A Y+V + S+ + +Q+D + +G LFRED L Sbjct: 346 GLATYRVVDNSQW--------PAQVIGEQIDSARHI-GFAGVCLFREDQLRHESN-GIPQ 395 Query: 432 YLQSRWG 438 ++ R+ Sbjct: 396 IIRERFA 402 >UniRef50_C0EWT6 Putative uncharacterized protein (Fragment) n=1 Tax=Eubacterium hallii DSM 3353 RepID=C0EWT6_9FIRM Length = 491 Score = 330 bits (846), Expect = 7e-89, Method: Composition-based stats. Identities = 128/447 (28%), Positives = 202/447 (45%), Gaps = 48/447 (10%) Query: 7 NKKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGS--KPPATTQQSSQPM--RGIWLA 62 KK++ R + + L K++ +S VT G+ K ++ QS+ M R +WL+ Sbjct: 40 EKKISGRSSSEFINF-LFSTQEKASSNKSSVTSNKGNSKKGNNSSTQSADTMNYRAVWLS 98 Query: 63 TVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKI 122 + + SV +N +S + + L ++ +G N + QV+P G AL+ S Sbjct: 99 YLEFNSY--RKSVKNNNESSFRKFYKH----ILQQIKTIGCNRIIVQVRPFGDALYASDY 152 Query: 123 LPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLS 182 PW+ ++G G+NPGYDPL+ M + +HK G+ + AW NPYR+S +IR L+ T Sbjct: 153 FPWAACISGTQGKNPGYDPLKIMTEMSHKEGISIEAWINPYRIS---SGNSIRSLSKTNP 209 Query: 183 QQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYT-- 240 + + I + +P V++ I V E+V Y VDG+ DDYFY Sbjct: 210 ARKWFSVQNTKRNILSYEGSLYYNPSSESVRNLIIQGVKEIVQNYNVDGIHMDDYFYPSF 269 Query: 241 -ESPGSRLNDNETYRKY------------------GGAFASKADWRRNNTQQLIAKVSHT 281 E + D Y++ S ADWRR+N +L++ + Sbjct: 270 TEKNVTTAFDAPEYKQQLKTNLSSTDSTSLTSADKSSNEISLADWRRDNVNRLVSGIYKA 329 Query: 282 IKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWV-EQGLLDYIAPQI 340 +K I V FG+SPAG N D E Y D WV + G +DY+ PQI Sbjct: 330 VKEINSDVTFGISPAGNLDNLRSDL------------EYYVDIDTWVSQNGYVDYLMPQI 377 Query: 341 YWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQ 400 YW F+ A +D + W +++ + +LYIG+ Y++G + D LKK+ Sbjct: 378 YWGFTNEVAPFDKVTDAWCILMENSPVKLYIGLQLYRMGSTEPGQSDEKELQKTSLLKKE 437 Query: 401 LDLNDAVPEISGTILFREDYLNKPQTQ 427 L +I G LF YL+ + Sbjct: 438 LSYLKKQKKIEGYCLFSYQYLDCQNKK 464 >UniRef50_C3QJ47 S-layer protein n=5 Tax=Bacteroides RepID=C3QJ47_9BACE Length = 488 Score = 329 bits (843), Expect = 1e-88, Method: Composition-based stats. Identities = 108/388 (27%), Positives = 177/388 (45%), Gaps = 34/388 (8%) Query: 50 QQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQ 109 Q +R W+ V LDWP + R Q++ +ID LD L+ NT+ FQ Sbjct: 19 AQPKHEVRAAWVTAVYGLDWPRTRATT----PQTIRKQKEELIDILDKLKAANFNTILFQ 74 Query: 110 VKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNT 169 + G L+PS I P++ ++TGK G NPGYDPL F ++E HKRGM+ HAW + Sbjct: 75 TRTRGDVLYPSAIEPFNSILTGKTGGNPGYDPLAFAVEECHKRGMECHAWMVTIPLGNKK 134 Query: 170 KPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPV 229 ++ SV + ++ + L+PG P ++++ +V EVVS Y V Sbjct: 135 HVASLGSQ---------SVTKRMKEICVPYKREYFLNPGHPATKEYLMKLVREVVSGYDV 185 Query: 230 DGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGV 289 DGV FD Y E+ D +R+Y + WRR+N +++ + +K++KP V Sbjct: 186 DGVHFDYLRYPEN-APLFPDKYDFRRYNKG-RTLDQWRRDNISEIVRYIYKGVKAMKPWV 243 Query: 290 EFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAA 349 + P G +R+ S P +RG A+ Y D + W+ +G++D I P +Y+ + Sbjct: 244 KVSTCPVGKYRDTSRYP----SRGWNAFFTVYQDPQGWMGEGIMDQIYPMMYF---QGNN 296 Query: 350 RYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPE 409 Y W ++ G+ Y + E W E+ +Q++ + Sbjct: 297 FYPFALDWQEQ---SNGRQVIPGLGIYFL---HPDEGKWTR----DEIDRQMNFIRKQ-K 345 Query: 410 ISGTILFREDYLNKPQTQQAVSYLQSRW 437 ++G +R YL + TQ L + Sbjct: 346 MAGEGHYRVKYLME-NTQGIYDELSENF 372 >UniRef50_UPI0001745532 hypothetical protein VspiD_00105 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001745532 Length = 382 Score = 329 bits (843), Expect = 1e-88, Method: Composition-based stats. Identities = 116/395 (29%), Positives = 188/395 (47%), Gaps = 38/395 (9%) Query: 49 TQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFF 108 +S MRG W+A+V L++P + + A Q+ + ++ +N++ Sbjct: 19 AAPASAEMRGAWVASVHNLNFPSRTGL-------SADQQRAEIRRIINIAAACRLNSLMV 71 Query: 109 QVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVN 168 QV+P+G AL+ S++ PWS +TG G +PGYDPL + E +G+ +HAW NPYR S + Sbjct: 72 QVRPEGDALYRSRLEPWSRFLTGTQGVDPGYDPLATFIAEGKSQGIAIHAWINPYRASTS 131 Query: 169 TKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYP 228 ++ T+ +R G +DPG P V+ + +V ++V RY Sbjct: 132 KAGKAENHISRTM-----------PGAVRRVGSMLWMDPGDPAVRQHVVRVVEDIVRRYA 180 Query: 229 VDGVQFDDYFYTES----PGSRLNDNETYRKY--GGAFASKADWRRNNTQQLIAKVSHTI 282 V GV DDYFY P D+ TY +Y GG +ADWRR N LI ++ + Sbjct: 181 VRGVILDDYFYPYPGTGLPRGTFPDDTTYGRYQAGGGRLDRADWRRENVNTLIRELHTVV 240 Query: 283 KSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYW 342 + + G FGVSP G++R + P G + + E Y+D W+ +G +DY++PQ+YW Sbjct: 241 HANRQGAWFGVSPFGIYR--PNVPRGVEAQ-LDQLTELYSDPVAWLREGTVDYLSPQLYW 297 Query: 343 PFSRSAARYDVLAKWW-ADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQL 401 + L WW + V P ++ +A ++G V E+ +QL Sbjct: 298 -TDAGPQSFSSLLGWWRSSSVNPRGILVFPSLAADRLGGSHNW--------PVQEISRQL 348 Query: 402 DLNDAVPEISGTILFREDYLNKPQTQQAVSYLQSR 436 D+ ++ G I++ L + T+ LQ R Sbjct: 349 DIESSIRPKGGFIIWSMAPLMR-NTKGVNGVLQGR 382 >UniRef50_B0P7J3 Putative uncharacterized protein n=1 Tax=Anaerotruncus colihominis DSM 17241 RepID=B0P7J3_9FIRM Length = 429 Score = 325 bits (834), Expect = 1e-87, Method: Composition-based stats. Identities = 113/415 (27%), Positives = 180/415 (43%), Gaps = 43/415 (10%) Query: 31 TPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQA 90 P G+ + + +RG+W++ + ++ + Sbjct: 50 APDSRDRQALGGAHTAVLSSSVNGEVRGVWISYL---------TLEPMIKGKTQAQFVKN 100 Query: 91 MIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAH 150 + D D G NTVF +P G AL+ S+ PWS +TG+ G +PGYDPL+ M+ AH Sbjct: 101 IGDAFDQAADFGFNTVFVHARPFGDALYKSEYFPWSRYLTGEEGRDPGYDPLELMVSLAH 160 Query: 151 KRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIP 210 +RG+++ AW NPYRV ++ KP + Q D +PG Sbjct: 161 ERGLRIEAWINPYRVRLDDKPMS-------ADNQAKKWLASGNDGALAWNGGVYYNPGSA 213 Query: 211 EVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNN 270 ++ I + V E+V Y VDG+ FDDYFY + D TY+ G+ ++ADWRR N Sbjct: 214 AARELIVNGVREIVENYDVDGIHFDDYFYPTT--DLTFDAATYQA-SGSSLTQADWRREN 270 Query: 271 TQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQ 330 +L+ V +K P FG+SP G Y+ +AD R WV + Sbjct: 271 VNKLVHDVYAAVKEANPDCLFGISPQGNVD--------------INYNGQFADVRTWVSE 316 Query: 331 -GLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGE----PSKIE 385 G +DYI PQIY+ + A Y W ++K +LY+GIA YKVG + + Sbjct: 317 PGYVDYICPQIYYGYRNGTAPYAETVALWDSMIKVDTIKLYVGIAAYKVGTVDTWAGEGK 376 Query: 386 PDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLN---KPQTQQAVSYLQSRW 437 +W+ + L + + G +++ + L Q + + L+ + Sbjct: 377 NEWIDTTDI--LARMVKTARKAEHYGGIVIYSYESLFGDVSEQMKIERNNLKKVF 429 >UniRef50_A6EKL7 Putative uncharacterized protein (Fragment) n=1 Tax=Pedobacter sp. BAL39 RepID=A6EKL7_9SPHI Length = 391 Score = 324 bits (831), Expect = 4e-87, Method: Composition-based stats. Identities = 121/302 (40%), Positives = 174/302 (57%), Gaps = 20/302 (6%) Query: 123 LPWSDLMTGKIG--ENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNST 180 PWS + G+ G PGYDPL F + EAH RGM++HAWFNPYR +++ T + Sbjct: 3 EPWSQWLMGRQGLAPGPGYDPLAFAIKEAHSRGMELHAWFNPYRATMSANTVTSAD---- 58 Query: 181 LSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYT 240 + + D T G + DPGIPEV+++I ++ +VV Y VDG+ FDDYFY Sbjct: 59 ------HMTRKRPDLFFTYGGKKQFDPGIPEVREYIVQVILDVVKGYDVDGIHFDDYFYP 112 Query: 241 ES-PGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVW 299 G R++D+ T+ KY F++K DWRRNN LI ++ +I K V+FG+SP G+W Sbjct: 113 YPIAGQRISDDVTFSKYANGFSNKNDWRRNNVDLLIKQLDDSIHHYKKYVKFGISPFGIW 172 Query: 300 RNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWA 359 +N++ D LGS T G + Y E YAD+R+WV++G +DYI PQIY+ F+R AA +D L WW+ Sbjct: 173 KNKAEDTLGSATHGLSNYTELYADSRKWVKEGWVDYINPQIYFSFTRRAAPFDTLVNWWS 232 Query: 360 DVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFRED 419 + LYIG A Y V + K+E W +P+ Q+ A + G++ F Sbjct: 233 N--NAYGRHLYIGQAAYLVNQ--KMEAAWRNPSQIPD---QVRYLRANNRVQGSVYFSSK 285 Query: 420 YL 421 Sbjct: 286 SF 287 >UniRef50_C3J8B5 YngK protein n=2 Tax=Bacteria RepID=C3J8B5_9PORP Length = 535 Score = 320 bits (821), Expect = 5e-86, Method: Composition-based stats. Identities = 120/452 (26%), Positives = 196/452 (43%), Gaps = 50/452 (11%) Query: 1 MDICSRNKKLTIRRPAILVAL--ALLLCSCKSTPPESMVTPPAG-------------SKP 45 M SR+ R L LLL + V A Sbjct: 1 MQHHSRSFLFAHRLTQCFSLLIGVLLLSGVSACSSRKKVVSTAPLPAPPTPRIEVEIPVE 60 Query: 46 PATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINT 105 +++ +RG+WL T+ LDWP +V+ + S Q++ + LD L NT Sbjct: 61 KPVIPSNAEAIRGVWLTTIYGLDWPSRRAVSTQDMVS----QRKELCRILDRLAESHFNT 116 Query: 106 VFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRV 165 VFFQV+ G ++PSKI P + TG YDPLQF ++E HKRG+ +HAW + + Sbjct: 117 VFFQVRHRGDVIYPSKIEPRVTVFTGGRNNYLDYDPLQFAIEECHKRGLSIHAWIVTFPL 176 Query: 166 SVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVS 225 + ++ + SV+ +HRDW T + + L+PG PE + +ITS+V E+V Sbjct: 177 GNTSHVQSLGD---------NSVWKKHRDWCFTLHNDWYLNPGHPEARSYITSVVREMVE 227 Query: 226 RYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSI 285 RY +DGV FD Y + + D Y +YG S +WR +N + +VS + S+ Sbjct: 228 RYDLDGVHFDYVRYPDKMREK-EDQNLYMRYGKG-RSLGEWRTSNISAFLKEVSTEVCSV 285 Query: 286 KPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFS 345 KP + +P G R P G A + + D W +G +D+I P +Y+ Sbjct: 286 KPHMLVSAAPLGKLRVLPSMP----NVGWTARESVFQDPAAWYREGSVDFIVPMMYY--- 338 Query: 346 RSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLND 405 R L W A + + G+A Y+ + SK +++ Q++ ++ Sbjct: 339 RDNLFEPFLVDWKAQIP---GLPIVPGLAPYRTEDESKWTA--------RDIENQMNASE 387 Query: 406 AVPEISGTILFREDYLNKPQTQQAVSYLQSRW 437 + ++G +RE + +P + + Sbjct: 388 RM-GMAGICFYRELNI-RPNRNGVDRVITRHF 417 >UniRef50_C2M9G1 YngK protein n=1 Tax=Porphyromonas uenonis 60-3 RepID=C2M9G1_9PORP Length = 530 Score = 309 bits (791), Expect = 1e-82, Method: Composition-based stats. Identities = 103/380 (27%), Positives = 170/380 (44%), Gaps = 30/380 (7%) Query: 48 TTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVF 107 Q MR +WL T+ LDWP +++ + QQ+++ LD R GINTVF Sbjct: 55 APQHPKAEMRAVWLTTIWGLDWPKMTADTHAGMV----RQQESLDKMLDDCVRAGINTVF 110 Query: 108 FQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSV 167 QV+ G L+PS + P S ++ GYDPLQ+ +D H RGM VHAW Y + Sbjct: 111 LQVRMRGDLLYPSTLEPLSTTISKTGVLPEGYDPLQYAIDACHHRGMSVHAWMVSYPLGT 170 Query: 168 NTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRY 227 N L++Q Y H + G+ + +DP P V+ + +V ++V+RY Sbjct: 171 NDHV-------RALAKQGKGFYAAHPEMCLRQGNAWFMDPAQPAVRTHMAQLVRDLVTRY 223 Query: 228 PVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKP 287 VDGV D Y + P S+ ND ++Y++ + WR N +I + T++ + P Sbjct: 224 DVDGVHLDYIRYPDGP-SKFNDLKSYQRMNPDRLPRMAWREANVTAMIDTLHRTLQEVAP 282 Query: 288 GVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRS 347 V + G ++ G G D+ D W ++G++D+I P IY+ Sbjct: 283 EVALSTACIGKYQQLPKPAPG----GYFCKDDVSQDPLVWFQRGIVDFIVPMIYYKDGH- 337 Query: 348 AARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAV 407 ++ WA + P + G+ Y++ + S+ + ++ QLD Sbjct: 338 ---FNYYIADWAKRIAP-HGPIVAGLGVYRLYDNSRW--------KLQDIYNQLDTLAQY 385 Query: 408 PEISGTILFREDYLNKPQTQ 427 +SG +R + L + Q Sbjct: 386 D-LSGVSYYRAEQLLQMYNQ 404 >UniRef50_C1Q9T9 Uncharacterized conserved protein n=3 Tax=Brachyspira RepID=C1Q9T9_9SPIR Length = 605 Score = 308 bits (790), Expect = 2e-82, Method: Composition-based stats. Identities = 128/427 (29%), Positives = 183/427 (42%), Gaps = 67/427 (15%) Query: 54 QPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPD 113 + R W +TV+ +DWP Q++ +I L+ L N VF QVKPD Sbjct: 48 REFRAAWFSTVANIDWPIKGG--------SENEQKKLIIKHLNTLYENNFNAVFVQVKPD 99 Query: 114 GTALWPSKILPWSDLMTGKIGEN-----PG-YDPLQFMLDEAHKRGMKVHAWFNPYRVSV 167 ++PSKI P + G + P D L+F++DEAHKR ++VHAWFNPYR+S+ Sbjct: 100 AGVIFPSKINPTTRYFFGTASSDEKDEYPFKTDMLKFIIDEAHKRNLEVHAWFNPYRMSL 159 Query: 168 NTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRY 227 E + + + I +R LDPG P +I V EVV Y Sbjct: 160 TYDTNKTYEEQFSKKNFIHTYVSNNLKPIHWYDNRIYLDPGEPISSKYIIDSVIEVVENY 219 Query: 228 PVDGVQFDDYFYTESPG----SRLNDNETYRKYGG--------------AFASKADWRRN 269 VDG+ FDDYFY + G D + KYG WRR+ Sbjct: 220 DVDGIHFDDYFYQNAAGGKTYKDWPDRISAEKYGEKSGYDINNTSYDDYGVNGLYAWRRD 279 Query: 270 NTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSH------DPLGSDTRGAA-AYDESYA 322 N +L++ + IKS KP V++ +SPAGVWRN + GS T+ +D +A Sbjct: 280 NINRLVSDLYKEIKSRKPYVKWTISPAGVWRNNTKLSEYIGSKYGSATQSYNPNFDALHA 339 Query: 323 DTRRWVEQG------------------LLDYIAPQIYWPFSRSAARYDVLAKWWADVVKP 364 D W+ G +D + PQ+YW A +D + KWW + K Sbjct: 340 DVLLWLLNGEKTSSLENASDKDGLNRMYIDAVIPQVYWSSYHKTAPFDTIVKWWVNEYKK 399 Query: 365 TR----TRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDA--VPEISGTILFRE 418 R LYIG A YK+G + EP + + +Q+D + I G+ F Sbjct: 400 ARATNTADLYIGHALYKMGRETNTEP----WQNIELISEQIDYIRKIGINSIKGSSFFTM 455 Query: 419 DYLNKPQ 425 + K Sbjct: 456 HSMYKKD 462 >UniRef50_B9Y560 Putative uncharacterized protein n=1 Tax=Holdemania filiformis DSM 12042 RepID=B9Y560_9FIRM Length = 408 Score = 302 bits (772), Expect = 3e-80, Method: Composition-based stats. Identities = 115/427 (26%), Positives = 186/427 (43%), Gaps = 50/427 (11%) Query: 10 LTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDW 69 + I+R I V + LL+ AG P T +R W++ + Sbjct: 7 MHIKRILIAVFVILLIF--------------AGCHPRKKTGTMG-EVRAAWISYIELSSI 51 Query: 70 PPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLM 129 S Q + L++L+ + NTV+ A +PS+ P + + Sbjct: 52 LDNRSET---------DYIQGVKTMLENLKAMNFNTVYVHASAFTDAYYPSQYYPTAQYV 102 Query: 130 TGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVY 189 G+IG+N YDP + +AH+ G + AW NP R S T + ++S + Q + Sbjct: 103 AGQIGQNVAYDPFGLFVQQAHEAGFHIEAWINPMR-SFRTDQESQIPVSSVIGQWLSDPT 161 Query: 190 VQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLND 249 ++ I GDR+ L+P PEV++ I ++ E+ YP+DG+ DDYFY + D Sbjct: 162 MRGTR-IVAEGDRWYLNPAYPEVRELICAVAKELAQNYPIDGLHLDDYFYPDGVSESF-D 219 Query: 250 NETYRKY--GGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPL 307 Y+ Y G S DWRR N +++A + T+K + ++ G+SPAG Sbjct: 220 QVAYQAYRQTGGELSLGDWRRQNINEMVASLYATVKQVDKTIQVGISPAGNLEY------ 273 Query: 308 GSDTRGAAAYDESYADTRRWVE-QGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTR 366 + + Y D R WV G LDYI PQIY+ + +D K W D+ + T Sbjct: 274 --------SVESIYGDVREWVRHDGYLDYILPQIYFGYEHGTLPFDQCLKQWEDLTQGTS 325 Query: 367 TRLYIGIAFYKVGEPSKIEPD----WMINGGVPELKKQLDLNDAVPEISGTILFREDYLN 422 T L +G+A YK+ D W + + LK+Q+ ++G +F + L Sbjct: 326 TELIVGLAAYKINTVDNYAKDGKYEWQQHDDI--LKRQILELRDHAAVAGFSIFSYNSLF 383 Query: 423 KPQTQQA 429 +P + A Sbjct: 384 QPAAENA 390 >UniRef50_B6YR88 Putative uncharacterized protein n=1 Tax=Candidatus Azobacteroides pseudotrichonymphae genomovar. CFP2 RepID=B6YR88_AZOPC Length = 490 Score = 300 bits (769), Expect = 5e-80, Method: Composition-based stats. Identities = 98/390 (25%), Positives = 171/390 (43%), Gaps = 36/390 (9%) Query: 48 TTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVF 107 + +R +WL T LDWP + + Q++ +++ L L++ N VF Sbjct: 19 SALMPKNEIRAVWLTTNYALDWPTKPFTTLEDID----KQKEELVNILCCLKKTNFNIVF 74 Query: 108 FQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSV 167 FQ + G ++ SK+ P S + K G YDPL F ++E HK G++ HAWF Y + Sbjct: 75 FQTRLRGNVVYDSKVEPLSPFIRNK-GYKVTYDPLAFAIEECHKLGLECHAWFVTYLLGA 133 Query: 168 NTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRY 227 G + V ++ R LDPG E ++ SIV E+V +Y Sbjct: 134 AEVKG---------EDNCSLVVKCNQLQTRIYKGEIYLDPGDLETDRYLLSIVEEIVDKY 184 Query: 228 PVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKP 287 VDG+ D Y E P + D+ TY+ YG +K +WR++N + ++++ +K KP Sbjct: 185 DVDGIHMDYIRYPEKP-TEFPDDITYKYYGKG-KNKTEWRKDNINRFVSRLYDMVKGKKP 242 Query: 288 GVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRS 347 V+ + G++ + LG + + A +E Y D +W+ G D+I P +Y+ Sbjct: 243 WVQVSSAVVGIYTRK----LGDNKKYWTA-NEVYQDPEQWLRMGKHDFIVPMMYYS---G 294 Query: 348 AARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAV 407 + + W A + + GI Y++ E + +W + ++K N Sbjct: 295 NLFFPFVQDWQA---RSYGRFVVPGIGIYRMDEK---DSNWDVQTVTEQIKSSRQHNT-- 346 Query: 408 PEISGTILFREDYLNKPQTQQAVSYLQSRW 437 G FR +YL + +++ + Sbjct: 347 ---GGNAFFRANYLI-GNKKGIRDEIKNNF 372 >UniRef50_B0MQ11 Putative uncharacterized protein n=1 Tax=Eubacterium siraeum DSM 15702 RepID=B0MQ11_9FIRM Length = 511 Score = 299 bits (765), Expect = 2e-79, Method: Composition-based stats. Identities = 108/404 (26%), Positives = 181/404 (44%), Gaps = 34/404 (8%) Query: 27 SCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARV 86 + S +++ + T + ++G+W+ W ++ + Sbjct: 121 TVTSKKNDNVPAATDNLPVNSYTALNYNEVKGVWI-------WYSELYPILTGKSESQL- 172 Query: 87 QQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFML 146 + + D D+ LGINTV+ V+P G A++ S PWS TG IG++PGYDPL+ M+ Sbjct: 173 -RSGIGDYYDNCLSLGINTVYVHVRPFGDAIYKSDYFPWSKYCTGYIGKDPGYDPLKVMI 231 Query: 147 DEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLD 206 DEAH RG+ AW NP R ST + + D+I + L+ Sbjct: 232 DEAHARGISFQAWVNPLRCYYEDD----APDVSTAYKTGQWYDTKDGDYIVKVKSYWWLN 287 Query: 207 PGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADW 266 P EV D I + AE+VS+Y VDGV DDYFY + D+ + +++S + + Sbjct: 288 PAYKEVTDLIANGAAELVSKYDVDGVHIDDYFYPTT--EAYFDSIAFNA--SSYSSLSQF 343 Query: 267 RRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRR 326 R +N +++A + +KS P FGVS G N + YAD + Sbjct: 344 RLDNCSRMVADMYKAVKSHNPTALFGVSAQGNVTNNE--------------TQLYADVEK 389 Query: 327 WVEQ-GLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKI- 384 W ++ G +DY+APQIY+ F ++ + + W ++ T L G+A YK+G + Sbjct: 390 WSKEDGYVDYMAPQIYYGFDNGGQPFEQVVERWDKMLAGTGKSLIPGLAVYKIGTEDEWA 449 Query: 385 -EPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQ 427 + +K+Q+ + G IL+ ++ +P + Sbjct: 450 GSGRYEWQNDKEIIKRQIVKSQKTSNYGGVILYSYQFIFEPDSN 493 >UniRef50_C9PZF4 YngK protein n=5 Tax=Prevotella RepID=C9PZF4_9BACT Length = 573 Score = 298 bits (764), Expect = 2e-79, Method: Composition-based stats. Identities = 104/430 (24%), Positives = 176/430 (40%), Gaps = 55/430 (12%) Query: 8 KKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRL 67 K+L+ +L+ AL + S P + + + +R +WL T+ L Sbjct: 3 KQLSFSNRFLLLFFALSTATMLCAKSFSFFKPNGLN----GWKLPKREVRAVWLTTIGGL 58 Query: 68 DWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSD 127 DWP + N A Q+Q + D LD LQR GINTV FQ + GT ++PS++ PW Sbjct: 59 DWPHSYAQN----ELMAGRQKQELRDILDKLQRAGINTVLFQARVRGTVVYPSQLEPWDG 114 Query: 128 LMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPAS 187 ++G G +PGYDPL F ++E HKRGM++HAW V G N Sbjct: 115 CLSGVPGRSPGYDPLAFAINECHKRGMELHAWVVTIPVGKWNSLGCKTLRN--------- 165 Query: 188 VYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRL 247 ++ I+ G+ +DP ++ + E+ RY VDG+ D Y E+ + Sbjct: 166 ---KYPHLIKRIGEEGYMDPENTATATYLANFCKEITDRYDVDGIHLDYIRYPETWKINI 222 Query: 248 NDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPL 307 + R N ++ + +K+ KP V++ SP G + + S Sbjct: 223 AHDAA---------------RRNITTIVRAIGEKVKASKPWVKYSCSPIGKFSDLSRFA- 266 Query: 308 GSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRT 367 + G AY + D + W+ GL+D + P +Y+ + + W + Sbjct: 267 ---SNGWNAYAKVCQDAQGWLRDGLMDALFPMMYFQGNH---FFPFAIDW---AEQSYGR 317 Query: 368 RLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQ 427 L G+ Y + S E +W + + +++ + G FR + + Sbjct: 318 MLVPGLGIYFM---SPSEKNW----SLDVITREMQVARQYG--MGHAYFRSKFFTD-NLK 367 Query: 428 QAVSYLQSRW 437 +Y Q + Sbjct: 368 GIYTYAQRIF 377 >UniRef50_B3QYB7 Putative uncharacterized protein n=1 Tax=Chloroherpeton thalassium ATCC 35110 RepID=B3QYB7_CHLT3 Length = 489 Score = 297 bits (759), Expect = 7e-79, Method: Composition-based stats. Identities = 104/414 (25%), Positives = 177/414 (42%), Gaps = 52/414 (12%) Query: 15 PAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSS 74 IL+ ALL+ + + + G + +RG+W+AT +DWP Sbjct: 10 LFILIFSALLILDHTTLFSQPLKNRLNGDD-------EREQLRGVWIATAYGIDWPK--- 59 Query: 75 VNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIG 134 T Q++++ + +++ +N VFFQV+ G L+ S P+S+++TG +G Sbjct: 60 ------TYDPEKQKESLQEIFHDIKKKNLNAVFFQVRIRGDVLFYSPYEPFSNVLTGSLG 113 Query: 135 ENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRD 194 P YDP+ + + A + G++ HAWFN ++ + P + + R Sbjct: 114 VIPDYDPVAYAISLAKENGLEFHAWFNTMILNGKNSTPQSEGVAHIWQAHPEWIDKRARK 173 Query: 195 WIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYR 254 L+P +PEV+ + ++ + RY +DG+Q DD Y P D+E + Sbjct: 174 NAWQ--KTAYLNPALPEVRAHLIRLITDFAERYDIDGIQLDD--YLRYPTKDFPDDEEFE 229 Query: 255 KYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGA 314 KY S DWRR N Q + + ++ KP ++FGV+P GV+ P Sbjct: 230 KYNPKKLSLDDWRRENINQFVGDLYDSLMQRKPYLKFGVTPIGVYTRVDDVP------AM 283 Query: 315 AAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAA----------RYDVLAKWWADVVKP 364 +Y + Y D+R WV + DY+APQIY+ ++ A ++ L + W + Sbjct: 284 ESYHDVYQDSREWVRRKKCDYLAPQIYFHTGKTTAADRRKNKTNPPFENLVRDWGGNMPF 343 Query: 365 TRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFRE 418 LY+GI YK E Q++L + G I + Sbjct: 344 --RHLYVGIGMYK-------------PPIKEEWPHQVELAEK-AGAEGVIFYPY 381 >UniRef50_B0NXH7 Putative uncharacterized protein n=3 Tax=Clostridiales RepID=B0NXH7_9CLOT Length = 468 Score = 295 bits (754), Expect = 3e-78, Method: Composition-based stats. Identities = 120/458 (26%), Positives = 184/458 (40%), Gaps = 51/458 (11%) Query: 6 RNKKLTIRRPAILVALALL--LCSCKSTPPESMVTPPAGSKPPATTQQSSQ--------- 54 K I+ A+LV A+ LCS PA T S Sbjct: 31 HLKLGKIQWIALLVVSAMFVNLCSHMVLAAARDGNDPAQGTTAEATTASEATTETTTEQQ 90 Query: 55 -------PMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVF 107 + W + D+ + + R + + LG+N V Sbjct: 91 EETQSMGEYKAFWFS---FYDYDSYRAKYKKRTAANFRTY---FTGVVKKGKSLGMNRVI 144 Query: 108 FQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSV 167 QV+P G A++ SK PWS ++GK G NPG+DPL+ M++ AH MK+ AW NPYRV+ Sbjct: 145 VQVRPFGDAIYKSKYFPWSKYISGKQGRNPGFDPLKIMVEVAHDNDMKIEAWVNPYRVTT 204 Query: 168 NTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRY 227 + N+ + A + + + G +P V+ IT+ V E+V Y Sbjct: 205 GSTNYKKLAKNNQARKWHAKKSTRRN--VLSYGGSLYYNPSKKAVRTLITNGVKEIVQNY 262 Query: 228 PVDGVQFDDYFYTES-----PGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTI 282 VDG+ DDYFY + Y S +RR L+ ++ + Sbjct: 263 DVDGIHMDDYFYPSFTKRNVKKAFDAKEYKKSSYKKKKKSIYTYRRAQINTLVKQMKKAV 322 Query: 283 KSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQ-GLLDYIAPQIY 341 KS+ P V +G+SPAG N + Y D +W+ +DYI PQ+Y Sbjct: 323 KSVDPNVTYGISPAGNIDN------------LTSKYSYYVDIYKWLNSTEYVDYICPQVY 370 Query: 342 WPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYK----VGEPSKIEPDWMINGGVPEL 397 W F A+++ + W K + +LYIGIA Y+ VG+ +W + V L Sbjct: 371 WGFKHPTAKFNKVTDRWIKAAKSKKVKLYIGIAVYRAGHNVGQNRAERKEWKRDTKV--L 428 Query: 398 KKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQS 435 KKQ+ + G F L + +AV+ L++ Sbjct: 429 KKQVQYARK-KHVDGFAFFDYQDLKSKTSAKAVNQLKT 465 >UniRef50_A9NEM7 Hypothetical surface-anchored protein n=2 Tax=Acholeplasma laidlawii PG-8A RepID=A9NEM7_ACHLI Length = 906 Score = 290 bits (743), Expect = 5e-77, Method: Composition-based stats. Identities = 126/445 (28%), Positives = 211/445 (47%), Gaps = 60/445 (13%) Query: 18 LVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNI 77 LV L +L S + P + + K +++Q +R +W+ P V V+ Sbjct: 11 LVCLTTILLSGFTKPNSNDI------KSFEFEFETNQKLRAVWVT-------PIVGEVST 57 Query: 78 SNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENP 137 + + + M+D L+H + IN + F V+ AL+ S++ P + + G + N Sbjct: 58 FTTETAFKNEMNQMLDILEHYK---INALIFHVRTHNNALYDSELNPKATVF-GSVNFN- 112 Query: 138 GYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIR 197 +DPL ++++E RG++ HAW NPYRV N + + + AS + + Sbjct: 113 NFDPLLWLVNETQSRGIEFHAWLNPYRVGTN----YVGTMPAENPASNASNILSNP---- 164 Query: 198 TSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGS----------RL 247 ++ +L+PG P V+D+I V E++ +YPVD + FDDYFYT + Sbjct: 165 SNSALKILNPGEPVVRDFIVDTVIEIIEKYPVDAIHFDDYFYTNLGANGALSGATTILDE 224 Query: 248 NDNETYRKYGGAFAS-----KADWRRNNTQQLIAKVSHTIKSIK----PGVEFGVSPAGV 298 D +TY YG F + KA+WRR+ ++ VS+ IK+ ++FG+SP G+ Sbjct: 225 PDQQTYVTYGSGFNTTSATDKANWRRHQVNTMVQAVSNAIKNYNQLNGKHIQFGISPTGI 284 Query: 299 WRNRS----------HDPLGSDTRGAAAYDES-YADTRRWVEQGLLDYIAPQIYWPFSRS 347 ++N + GS T G Y ++D+ W+++G LDYIAPQ YW + S Sbjct: 285 YKNGNGVVTYDEFGKPVTTGSLTTGQTHYSSYLFSDSLHWIKEGWLDYIAPQSYWATNHS 344 Query: 348 AARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAV 407 AA Y + WW VVK LY GI Y E + +W + E++ QL+ + + Sbjct: 345 AASYYNVMGWWEKVVKYLDVNLYSGIGLYMADESTNT-FNWKD--DMLEMRTQLEYLETL 401 Query: 408 PEISGTILFREDYL-NKPQTQQAVS 431 ++ G ++ Y+ N Q + S Sbjct: 402 NDVDGLSVYSYKYIRNHYNNQNSTS 426 >UniRef50_D1PA22 YngK protein n=1 Tax=Prevotella copri DSM 18205 RepID=D1PA22_9BACT Length = 582 Score = 288 bits (738), Expect = 2e-76, Method: Composition-based stats. Identities = 101/418 (24%), Positives = 165/418 (39%), Gaps = 60/418 (14%) Query: 18 LVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNI 77 ++LCS + +S+V Q +R +WL T+ +DWP + Sbjct: 4 FKIFFIVLCSVLAAKAQSIVFN---------NQVPKHEVRAVWLTTIGGIDWPH----SY 50 Query: 78 SNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENP 137 + + A Q++ + D LD LQ INTV Q + GT ++PS PW ++G G +P Sbjct: 51 AQSSYSAEKQKKELTDILDRLQLAKINTVLIQTRVRGTMIYPSAYEPWDGCLSGFPGRSP 110 Query: 138 GYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIR 197 GYD LQF +DE HKRGM++HAW V G + + I+ Sbjct: 111 GYDALQFAIDECHKRGMELHAWVVTIPVGKWNALGCKT------------LRQKMPKLIK 158 Query: 198 TSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYG 257 G +DP D++ +I E+ +Y VDG+ D Y E+ +++ + Sbjct: 159 KIGADGYMDPENSRTGDYLANICREITHKYNVDGIHLDYIRYPETWNIKVSREQG----- 213 Query: 258 GAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAY 317 R ++ K+ +K+ KP V+ SP G + + S + G AY Sbjct: 214 ----------RRYITNIVRKIHDAVKAEKPWVKMSCSPVGKYDDLSRY----RSFGWNAY 259 Query: 318 DESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYK 377 + D + W++ GL+D + P +Y+ Y W + G+ Y Sbjct: 260 TKVCQDAQGWLKSGLMDELFPMMYFKNEH---FYPFAIDWQEQ---SHGKIVVPGLGIYF 313 Query: 378 VGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQS 435 + E W IN E+ + G FR +L Q + + Q Sbjct: 314 LD---PKEGKWNINDVTAEMY----HIRNLG--MGYAFFRNKFLLD-NKQGILDFTQR 361 >UniRef50_C7H8A9 FenI protein n=2 Tax=Faecalibacterium prausnitzii RepID=C7H8A9_9FIRM Length = 425 Score = 285 bits (728), Expect = 3e-75, Method: Composition-based stats. Identities = 115/419 (27%), Positives = 173/419 (41%), Gaps = 49/419 (11%) Query: 19 VALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNIS 78 V+ AL +S P + P + S R +W++ + + Sbjct: 27 VSAALTYYLLRSIPAGNNAEPAPSPQAAPNPALPSGEWRAVWVSYLEFAEM--------- 77 Query: 79 NPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPG 138 S + +D+ LG+NTV QV+P G AL+ S + PWS L TG G++PG Sbjct: 78 -DFSSESAFRADAAALMDNCLSLGLNTVIAQVRPFGDALYRSSLFPWSHLCTGVQGQDPG 136 Query: 139 YDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRT 198 +DPL +L EAH RG+ + AW NPYR + +S L H +WI T Sbjct: 137 FDPLDVLLTEAHARGLSLEAWVNPYRFRSSASMPPAIAESSLL--------NTHPEWICT 188 Query: 199 SGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGG 258 + L+P IPE D++ VAE+V Y VDG+ FDDYFY + S D + G Sbjct: 189 VNEGAYLNPAIPEAADYVVQGVAELVQNYAVDGIHFDDYFYPTTDPSI--DAAQFAASGE 246 Query: 259 AFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYD 318 WRR N +L+ +K+ P + FGVSP G N + Sbjct: 247 --TDLTAWRRANVTRLVKAAHDAVKAADPTLRFGVSPQGNPDNDR--------------N 290 Query: 319 ESYADTRRWV----EQGLLDYIAPQIYWPF------SRSAARYDVLAKWWADVVKPTRTR 368 E Y D W+ ++DY+ PQIYW + + ++ + W + + T Sbjct: 291 EQYTDLSVWLTASGADAVVDYLCPQIYWGYGYTLSSGSTRFSFENITAEWLALPRAESTA 350 Query: 369 LYIGIAFYK--VGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQ 425 LY G+ Y+ VG+ L +Q+ + G L+R L + Sbjct: 351 LYFGLGAYRVGVGDGGANADSVSQWCTGSALARQVTDLRS-AGAGGWALYRYGSLFRSD 408 >UniRef50_C5VL52 YngK protein n=3 Tax=Prevotella RepID=C5VL52_9BACT Length = 566 Score = 281 bits (718), Expect = 4e-74, Method: Composition-based stats. Identities = 101/389 (25%), Positives = 164/389 (42%), Gaps = 49/389 (12%) Query: 48 TTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVF 107 T + + R +WL T++ LDWP N + ++Q+Q +ID LD Q+ INTV Sbjct: 28 TKRMPKRETRAVWLTTLASLDWPK----NYARSEESIKLQKQELIDILDKYQKANINTVL 83 Query: 108 FQVKPDGTALWPSKILPWSDLMTGKIGENP--GYDPLQFMLDEAHKRGMKVHAWFNPYRV 165 Q + ++PS I PW +TG G P GYDPL F ++E HKRGM++HAW V Sbjct: 84 LQARVRAATIYPSDIEPWDQCITGVEGRAPGYGYDPLSFAVEECHKRGMEIHAWIATIPV 143 Query: 166 SVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVS 225 G ++ IR LDP P V ++ S+ E+V Sbjct: 144 GAKNSLG-------------CRTLMKKGFRIRNFSTGSYLDPADPSVAPYLASVCGEIVR 190 Query: 226 RYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSI 285 +Y VDG+ D Y + + + D RR+N ++ + +K+I Sbjct: 191 KYDVDGINLDYIRYPDG----------WPRPSYRDGDTPDQRRSNITAIVRAIHDEVKAI 240 Query: 286 KPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFS 345 KP V+ SP G + S ++ A+D + + W+ GL+D + P Y+ Sbjct: 241 KPWVKMSCSPIGKHADLSRY----SSKNFNAHDRVSQEAQEWMRLGLMDQLYPMQYF--- 293 Query: 346 RSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLND 405 R Y +A W + K + G+ Y + E +W + +L +Q+ ++ Sbjct: 294 RGDNYYPFVADWVENAYK---REIVTGLGTYFLD---PREGNWTLG----DLTRQMYVSR 343 Query: 406 AVPEISGTILFREDYLNKPQTQQAVSYLQ 434 + G FR +L Q + + Sbjct: 344 DLG--VGHAHFRSYFL-TANKQGVYDFEK 369 >UniRef50_C4FZ05 Putative uncharacterized protein n=1 Tax=Abiotrophia defectiva ATCC 49176 RepID=C4FZ05_ABIDE Length = 562 Score = 278 bits (711), Expect = 3e-73, Method: Composition-based stats. Identities = 104/375 (27%), Positives = 168/375 (44%), Gaps = 35/375 (9%) Query: 53 SQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKP 112 + +R +W+ + S+ + + D + G+N ++ V+P Sbjct: 193 TNEVRAVWIT-----------FLEFSSKGYTVNSFTNQITEMFDKIAASGMNEIYVHVRP 241 Query: 113 DGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPG 172 A++ S PWS +GK G +PG+DPL M++ AH R +K+HA+ NPYRV G Sbjct: 242 FSDAMYRSVYFPWSKYASGKQGVDPGFDPLAIMVNAAHTRNLKLHAYINPYRVCAEADFG 301 Query: 173 TIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGV 232 ++ +NS + ++ + G + +P +V + I + VAE+V Y VDGV Sbjct: 302 SLA-VNSPAYKWLNDDDEENDRNVLKFGKMYYYNPSSDDVINLINNGVAEIVKNYDVDGV 360 Query: 233 QFDDYFYTESPGSRL--NDNETYRKY---GGAFASKADWRRNNTQQLIAKVSHTIKSIKP 287 FDDYFY + D+E Y Y S DWRR+N +++ V T+KS Sbjct: 361 IFDDYFYPTLGSNYSSKFDSEEYADYKLNTANPMSIVDWRRDNINKMVKTVYATVKSSGK 420 Query: 288 GVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQ-GLLDYIAPQIYWPFSR 346 FG+SPAG N + D+ Y D RW + G +DYIAPQ YW F Sbjct: 421 NRTFGISPAGNLTNLRAN------------DKYYVDIDRWGRETGFVDYIAPQQYWGFEH 468 Query: 347 SAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDA 406 S ++ W VV +LY+ + + ++ +W N + L + + Sbjct: 469 SICPFEDNVSKWMAVVTNPNVKLYVALPMHL--AQAQETSEWKNNHDI--LGRMVTSLRN 524 Query: 407 VPEISGTILFREDYL 421 +SG ++R Y+ Sbjct: 525 -KSLSGFSIYRYHYI 538 >UniRef50_UPI0001C37647 hypothetical protein RflaF_08645 n=1 Tax=Ruminococcus flavefaciens FD-1 RepID=UPI0001C37647 Length = 379 Score = 277 bits (707), Expect = 8e-73, Method: Composition-based stats. Identities = 108/416 (25%), Positives = 184/416 (44%), Gaps = 52/416 (12%) Query: 13 RRPAILVALALLLCSCK-STPPESMVTPPAGSKPPATTQQSSQPM-----RGIWLATVSR 66 + A++ A LL C + PE++ P + A + P+ +G+W+ + Sbjct: 3 KILAVMALSAFLLGRCTPAAMPENLKQPDPAAVSEAAANKEYAPLNYEYQKGMWIPYLDY 62 Query: 67 LDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWS 126 ++ A + A+ +L G NTV+ ++P G A + S P Sbjct: 63 AEYMQ---------GKTADDFRSAIRKRLSDAADSGTNTVYVHIRPTGDAYYKSTFFPKG 113 Query: 127 DLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPA 186 + G YDPL+ MLDEAHK G+ VH W NP R+ + T+ + T Sbjct: 114 RYLDG------DYDPLEIMLDEAHKLGLSVHGWINPLRLQTAEEMETVPDSAITK----Q 163 Query: 187 SVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSR 246 I +G R L P PEV++ + + + E++ Y VDG+ DDYFY ++ S Sbjct: 164 WYSSGDSMNIGETGGRLYLRPDSPEVRELLANEIREIIGSYDVDGIHIDDYFYPDTDPSF 223 Query: 247 LNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDP 306 D+E++ G + +K WR + +++ + +K V FG+SP G R Sbjct: 224 --DSESFALSGESDLTK--WRTDAVSEMVKAMYSAVKDTDERVLFGISPQGNVRAD---- 275 Query: 307 LGSDTRGAAAYDESYADTRRWVEQ-GLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPT 365 Y+ YAD RRW+ + G DYI PQIY+ F + + + W + + + Sbjct: 276 ----------YETQYADVRRWISEKGFCDYIVPQIYYGFKNETLPFTSVLEEWERMAENS 325 Query: 366 RTRLYIGIAFYKVGEPSKI-----EPDWMINGGVPELKKQLDLNDAVPEISGTILF 416 RL IG+ YK+G+ + E +W+ + G+ + + Q L+ + G ++ Sbjct: 326 NVRLIIGLGAYKLGKEDRWAGESGESEWLDDPGIIDKQTQAVLDSSA---DGYAVY 378 >UniRef50_D1PRQ4 FenI protein n=1 Tax=Subdoligranulum variabile DSM 15176 RepID=D1PRQ4_9FIRM Length = 412 Score = 273 bits (697), Expect = 1e-71, Method: Composition-based stats. Identities = 121/395 (30%), Positives = 178/395 (45%), Gaps = 46/395 (11%) Query: 33 PESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMI 92 P++ A + + + P R +W VS L+W V S A + + Sbjct: 30 PDTPSATAAPTPAATAVPERTAPYRAVW---VSYLEWQQV-------DFSGADAFSRDIA 79 Query: 93 DKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKR 152 LD++ +G V QV+P G AL+PS P+S L TG G +PG+DPL +++ AH Sbjct: 80 AMLDNIASVGATVVLAQVRPFGDALYPSDYFPFSHLCTGIQGRDPGFDPLALLVEAAHAS 139 Query: 153 GMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEV 212 G+++ AW NPYR+ P Q PA V H DW++ + LDP P+V Sbjct: 140 GLELEAWVNPYRLQAGGVPA-------LCDQSPA---VTHPDWVKKTETGSYLDPANPDV 189 Query: 213 QDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQ 272 + +I V E+ Y +DG+ FDDYFY + + D Y S ADWRR+N Sbjct: 190 RQYIADGVEELCRNYALDGIHFDDYFYPTTSATF--DAAEYAAAQTGL-SLADWRRDNVN 246 Query: 273 QLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQ-G 331 L++ + + GV GV+P G DP YD Y+D RW+ Q G Sbjct: 247 ALMSLCHGV--TARYGVRLGVAPLG-------DPE-------LCYDGQYSDAARWLAQGG 290 Query: 332 LLDYIAPQIYWPFSRSAA-----RYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEP 386 +DY+ PQ+YW + D LA WAD+ + LY+G+ Y++G+ Sbjct: 291 YVDYLMPQLYWGLTYEQNGDTAHSLDTLAARWADLPRAEGVALYVGLGAYRIGDGDGSTA 350 Query: 387 DWMINGGVPELKKQLDLNDAVPEISGTILFREDYL 421 L QLD + + I G L+R L Sbjct: 351 GAAEWQSGHALADQLDALETL-GIGGAGLYRYASL 384 >UniRef50_C2L0K0 Lipoprotein yddW n=1 Tax=Oribacterium sinus F0268 RepID=C2L0K0_9FIRM Length = 443 Score = 254 bits (649), Expect = 5e-66, Method: Composition-based stats. Identities = 97/412 (23%), Positives = 165/412 (40%), Gaps = 52/412 (12%) Query: 45 PPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGIN 104 P + S + +R +W + + ++ P + + +D+LQ+ G Sbjct: 63 PGVKSLSSQKELRAVWFSYLDWINMPK-----------EEQAFRAEAAKVMDNLQKNGFQ 111 Query: 105 TVFFQVKPDGTALWPS-KILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPY 163 T+F V + + P+S M G N +DPL+ M+ EA K+G+ VHAWFNPY Sbjct: 112 TIFLHVHSHSDSYGKKMTVFPYSKFMPG----NGSFDPLEIMISEAKKKGISVHAWFNPY 167 Query: 164 RVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEV 223 RVS + +S + + S + ++ ++P ++ + + + E+ Sbjct: 168 RVSSSMSKWENIPEDSIVKKW--SRTSGEERNVLLHEGQYYINPSRAAGREALLASIKEL 225 Query: 224 VSRYPVDGVQFDDYFYTESPGSRL---NDNETYR--KYGGAFASKADWRRNNTQQLIAKV 278 + Y VDG+ FDDYFY + D Y K G S ++RRN L+ +V Sbjct: 226 LDNYAVDGIHFDDYFYPRVSLTEEGKRFDEPEYEEAKRQGETGSLTEYRRNQVSLLLKQV 285 Query: 279 SHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRR-WVEQGLLDYIA 337 K GV FGVSP P R + AY + D + + +DYI Sbjct: 286 HSLCKE--RGVVFGVSPV---------PNLQSLRSSVAY---FLDVDKIMASKDYIDYIM 331 Query: 338 PQIYWPFSRSAA-------RYDVLAKWWADVVKPTR--TRLYIGIAFYKVGEP---SKIE 385 PQ+Y F Y W ++ T L +G+ Y+ G Sbjct: 332 PQMYHGFRAKNGKGQEAPHAYMRSLGDWVNLTNSTGNQVELMLGLGLYRAGSSVWDGNPV 391 Query: 386 PDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQSRW 437 +W + LK+Q++ + G +F L + + Q+ + L+S + Sbjct: 392 SEWFTESDI--LKRQVEEARKTGIVKGYAVFAYQNLLEERAQRELGNLRSVF 441 >UniRef50_A6DH63 Putative uncharacterized protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DH63_9BACT Length = 225 Score = 235 bits (598), Expect = 4e-60, Method: Composition-based stats. Identities = 81/239 (33%), Positives = 123/239 (51%), Gaps = 22/239 (9%) Query: 56 MRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGT 115 MR W+A+V+ DWP ++++ QQ+ D LD +L +NT+ FQV+P G Sbjct: 1 MRAAWVASVANTDWPSKQGLSVAQ-------QQKECRDLLDLAVQLKLNTIIFQVRPHGD 53 Query: 116 ALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIR 175 AL+ S PWSD +TG G+ PGYDPLQ+ +D+ HKR +K+HAWFNPYRV T + Sbjct: 54 ALYKSSFEPWSDRLTGIQGKYPGYDPLQYWIDQCHKRKLKIHAWFNPYRVQHPTVKEPLA 113 Query: 176 ELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFD 235 + +P +Y++ L+P V+ ++ ++V + RY +DG+ D Sbjct: 114 SNSLQRKAKPWCIYLKK--------GYVWLNPASKAVRQYVQTVVFDCARRYNIDGIHLD 165 Query: 236 DYFYTES---PGSRLNDNETYRKY----GGAFASKADWRRNNTQQLIAKVSHTIKSIKP 287 DYFY P + D++ Y Y K WRR+ LI + +K +KP Sbjct: 166 DYFYPYKDFLPATGFPDHKEYSAYLSSKPQKVMDKEMWRRHQVNTLIYSLHKGLKRLKP 224 >UniRef50_B4AVG6 Putative uncharacterized protein n=1 Tax=Cyanothece sp. PCC 7822 RepID=B4AVG6_9CHRO Length = 423 Score = 232 bits (591), Expect = 2e-59, Method: Composition-based stats. Identities = 96/397 (24%), Positives = 148/397 (37%), Gaps = 64/397 (16%) Query: 54 QPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPD 113 + +RG+WL V S +Q +I+ L+ L G NTVF V Sbjct: 2 KTIRGVWLTNV----------------GSEVLNSRQNIINALNLLADTGFNTVFPVVWNK 45 Query: 114 GTALWPSKILPWSDLMTGKIGENP---GYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTK 170 G +PS+++ L T +P G DPL +++ A G+ V WF K Sbjct: 46 GFTQYPSQVM----LQTFNQEIDPAFAGRDPLAEVIEAAKNVGIDVIPWFEYGFACSYQK 101 Query: 171 PGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVD 230 G ++ +P + + ++ PEVQ++I S+V EV Y V Sbjct: 102 NG-----GHIIASKPHWAAKDINNQLLNKNGFEWMNAFEPEVQNFILSLVLEVARNYDVA 156 Query: 231 GVQFDDYFYTESPGSRLNDNETYRKY----------GGAFASKADWRRNNTQQLIAKVSH 280 GVQ DD P D +T +Y A WR + +S Sbjct: 157 GVQGDD-RLPALPCEGGYDEKTRARYYSEQGVKPPQNIKDAKWLQWRAALLTNFLGNLSR 215 Query: 281 TIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQI 340 +K+IK + +S P G Y E D+ W+ Q ++D I PQ+ Sbjct: 216 EVKAIKNDLLVSISS-------HPYPFG--------YHEYLQDSPTWIRQKIVDVIHPQL 260 Query: 341 YWPFSRSAARYDVLAKWWADVVKPTR-TRLYIGIAFYKVGEPSKIEPDWMINGGVPELKK 399 Y R+ Y L + P TRL+ G+ ++ P K + PE Sbjct: 261 Y---RRTLKDYQALVETTLKQFSPDDLTRLFPGV-LIRLNAPGKPQD----FHISPEQLW 312 Query: 400 QLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQSR 436 Q L + I G + F + LN Q +LQ++ Sbjct: 313 QTILINRRLGIRGEVFFFFEELN-VNAQSLAQFLQAK 348 >UniRef50_B4VPG3 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VPG3_9CYAN Length = 406 Score = 229 bits (583), Expect = 2e-58, Method: Composition-based stats. Identities = 96/445 (21%), Positives = 163/445 (36%), Gaps = 89/445 (20%) Query: 13 RRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPV 72 RR + + L L+ +S+V P+ ++ T+ + +RG+WL V+ Sbjct: 8 RRFRVFLVLGLVFSIVLLVA-KSIVFSPSLARSQTPTKIT--EIRGVWLTNVA------- 57 Query: 73 SSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGK 132 S + + L +L NTV+ V G +PS + L+ Sbjct: 58 ---------SGVLFSPWGINRAIAQLSKLNFNTVYPVVWNRGHTFYPSAVATQEPLL--A 106 Query: 133 IGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQH 192 I G D L +L + H++G++V WF G + + S L++ +H Sbjct: 107 IMRLNG-DVLADILQQGHRQGLRVIPWFE---------YGFMTPIYSELAR-------RH 149 Query: 193 RDWIRTSGDR---------FVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESP 243 WI S + L+P PEVQ I ++ EVVS+Y VDG+Q DD+F P Sbjct: 150 PTWITQSLTQKSDPENPQLLWLNPLHPEVQQLILDLIKEVVSQYDVDGIQLDDHFG--MP 207 Query: 244 GSRLNDNETYRKYGGAFAS-----------KADWRRNNTQQLIAKVSHTIKSIKPGVEFG 292 D T +Y WR N + + ++ T+KSIKP Sbjct: 208 VELGYDPYTIERYQQEHYGNSPPNSPLNSEWMRWRANKISEFMGEIVQTVKSIKPDCIIS 267 Query: 293 VSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYD 352 +SP A AY D + WV++G +D + Q+Y + + Sbjct: 268 LSP---------------NPQAFAYKHYLQDWQTWVQRGWVDELVLQVYRD---ELSSFT 309 Query: 353 VLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISG 412 A + + IGI + +P E +++ Q++ G Sbjct: 310 AELNQPAVRMARRTIPVSIGILTGTLADPISFE----------QIQAQVEAVRD-RAFDG 358 Query: 413 TILFREDYLNKPQTQQAVSYLQSRW 437 F + L T ++ + + Sbjct: 359 VSFFYWETLWSYLTPESPQQRRRGF 383 >UniRef50_B4WH89 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WH89_9SYNE Length = 635 Score = 228 bits (580), Expect = 4e-58, Method: Composition-based stats. Identities = 91/453 (20%), Positives = 158/453 (34%), Gaps = 109/453 (24%) Query: 15 PAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSS 74 AI+ +L + + P + +V PP+ T + +RG+W+ +D Sbjct: 230 AAIICRASLSPNTVATVPSDRIVFPPSLP----TVATPTTELRGVWMT---NID------ 276 Query: 75 VNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIG 134 S + A+ L+ L +L N V+ V GT L+PS + + K G Sbjct: 277 -------SDVLFSRSALEQALETLSKLNFNVVYPTVWNWGTTLYPSAVA--ERTIGYKQG 327 Query: 135 ENPG----------------YDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELN 178 P D LQ +++ AH R +KV WF G + + Sbjct: 328 LYPDLDRTGRKVELEAAQGDRDMLQEIIELAHSRNLKVMPWFE---------FGFMAPAD 378 Query: 179 STLSQQPASVYVQHRDWIRT----SGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQF 234 S L+++ Q D T +R L+P EVQ ++ +++E+ + Y +DG Q Sbjct: 379 SELARRHPDWLTQKADGTLTTLEGEHERVWLNPFHLEVQTFLLQLISELSANYDIDGFQV 438 Query: 235 DDYFYTESPGSRLNDNETYRKYGGAFAS-----------KADWRRNNTQQLIAKVSHTIK 283 DD+ P + D T Y WR + + +V T+K Sbjct: 439 DDHMG--LPFAYGYDPYTINLYQQEHDGKSPPADPKDPEWTRWRADKITDFMDQVFTTVK 496 Query: 284 SIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIY-- 341 + +P VSP AY+ D WV++G ++ + Q+Y Sbjct: 497 AQRPQAIMSVSP---------------NPHIFAYEYYLQDWDTWVKRGYVEELIIQLYRT 541 Query: 342 ------WPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVP 395 W + AA Y A PT + G+ + +P Sbjct: 542 DLGRFVWEMGQEAAEY-------AREHIPTAVGVLSGLK--------------GRSVPMP 580 Query: 396 ELKKQLDLNDAVPEISGTILFREDYLNKPQTQQ 428 +++Q++ +G F + L + Sbjct: 581 LIEEQVEAVRD-RGFAGVSFFFYETLWNLSNEG 612 >UniRef50_Q8YXK2 All1210 protein n=4 Tax=Nostocaceae RepID=Q8YXK2_ANASP Length = 906 Score = 228 bits (580), Expect = 4e-58, Method: Composition-based stats. Identities = 90/407 (22%), Positives = 143/407 (35%), Gaps = 83/407 (20%) Query: 57 RGIWLATVSRL--DWP--------PVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTV 106 R WLAT + L +P + SV + T +Q + D L + GINT+ Sbjct: 397 RQQWLATRTNLWKQFPTDRRLAPAEIRSVWLDRGTIVRAGSEQELAKIFDRLAQAGINTI 456 Query: 107 FFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVS 166 FF+ G ++PSK+ P + + G+DPL + AH RGM++HAW + Sbjct: 457 FFETINAGYTIYPSKVAPQQNPLIR------GWDPLASGVKLAHARGMELHAWVWTFAAG 510 Query: 167 VNTKPGTI----RELNSTLSQQPASVYVQHRDWIRTSGD-RFVLDPGIPEVQDWITSIVA 221 + L+ P H+ + G + DP PE++ ++ + Sbjct: 511 NQRHNELLNIPTNYPGPVLAANPDWANYDHQGQMIPLGQTKPFFDPANPELRQYLLKLYE 570 Query: 222 EVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGA---------------------- 259 E++++Y VDG+Q D Y D R YG Sbjct: 571 EIITKYKVDGLQLDYIRYP------FQDPAAGRSYGYGKAARTQFQQLTGVDPMKISPSQ 624 Query: 260 ---FASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAA 316 + +R +A+VS ++ + V+ PL R Sbjct: 625 TQLWQQWTTFRTQQVDSFVAQVSQMLRQQDRNLILSVAVF---------PLPEYER---- 671 Query: 317 YDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFY 376 + W QG +D I P Y ++ R+ LAK W + T L GI Sbjct: 672 VQKIQQHWEIWARQGNIDLIIPMTY---AQDTVRFQTLAKPWITSTQLGSTLLIPGIRLL 728 Query: 377 KVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNK 423 + + QL L +P +SG LF + LN Sbjct: 729 SLPTLGAFD--------------QLQLVRDLP-VSGYALFAAENLNN 760 >UniRef50_B5WA73 Putative uncharacterized protein n=2 Tax=Arthrospira RepID=B5WA73_SPIMA Length = 476 Score = 224 bits (570), Expect = 7e-57, Method: Composition-based stats. Identities = 77/383 (20%), Positives = 140/383 (36%), Gaps = 71/383 (18%) Query: 53 SQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKP 112 +RG+W+ T + + Q + + + L + NT++ V Sbjct: 122 KTEIRGVWMTT----------------NDTDVLMNQPRLEEAVSKLAQFNFNTIYPVVWN 165 Query: 113 DGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPG 172 G + S ++ + + G D L +++ AH+ + V WF G Sbjct: 166 SGYVTYKSSVVKEAGIQPFVRRGFQGQDMLADIIERAHRHNLLVLPWFE---------FG 216 Query: 173 TIRELNSTLSQQPASVYVQHRDWIRTS----GDRFVLDPGIPEVQDWITSIVAEVVSRYP 228 + +S L+ + + Q RD +TS G+ L+P P+VQ ++T ++ EVV+ Y Sbjct: 217 FMAPPSSELALKHPNWLTQQRDGTKTSISAAGEVVWLNPFHPQVQKFMTDLIVEVVTDYD 276 Query: 229 VDGVQFDDYFYTESPGSRLNDNETYRKY----------GGAFASKADWRRNNTQQLIAKV 278 +DGVQFDD+ T P + D T Y + WR + + ++ Sbjct: 277 IDGVQFDDH--TSLPSTFGYDPYTISLYQRETNRTPPSNPQDPAWVRWRAHKITAFMRQL 334 Query: 279 SHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAP 338 IK+ KP VSP AY+ D WV GL+D + Sbjct: 335 HQAIKAKKPHSIISVSP---------------NPYHIAYNGHLQDWVTWVRDGLVDELVV 379 Query: 339 QIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELK 398 Q+Y R + +A+ +K + ++ G+ + +++ + Sbjct: 380 QVY----RDELDF-FIAELNRPEMKAAQNKISTGVGILTGLRTRPVPINFIQSKVRAARD 434 Query: 399 KQLDLNDAVPEISGTILFREDYL 421 +Q G F + L Sbjct: 435 RQF----------GVAFFFYESL 447 >UniRef50_A8YI06 Similar to tr|Q8YPV9|Q8YPV9 n=8 Tax=Chroococcales RepID=A8YI06_MICAE Length = 438 Score = 222 bits (565), Expect = 2e-56, Method: Composition-based stats. Identities = 81/349 (23%), Positives = 135/349 (38%), Gaps = 67/349 (19%) Query: 9 KLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLD 68 + +++ IL+ LA L + G A +Q +RG+W+ T Sbjct: 11 RQILKKFPILLFLASFL-----------IVVFLGYFSTAFSQSRDPDIRGVWITT----- 54 Query: 69 WPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDL 128 + + R + QQ ++ L L N ++ V G AL+PS I + Sbjct: 55 ------NDTAMLMDRDKRQQA-----IEQLVNLNFNAIYPVVWNSGYALYPSAIAQREGI 103 Query: 129 MTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASV 188 G D L ++++ RG+ V WF G + S L+ + + Sbjct: 104 QPFVPTGAQGQDILAELVEQTRGRGLLVIPWFE---------FGFMAPPTSELALKHQNW 154 Query: 189 YVQHRD----WIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPG 244 Q RD W+ +G+ L+P PEVQ+++ +V EVV +Y ++G+QFDD+ P Sbjct: 155 LTQKRDGGTTWVGAAGEVVWLNPFRPEVQNFLRELVLEVVGQYDINGIQFDDHL--SLPN 212 Query: 245 SRLNDNETYRKYGGAFAS----------KADWRRNNTQQLIAKVSHTIKSIKPGVEFGVS 294 D T Y WR + +A + +I++IKP + ++ Sbjct: 213 EFGYDPYTIALYQQETEKTPPANPRDPEWTKWRADKITAFLANLKQSIEAIKPNILLSIA 272 Query: 295 PAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWP 343 P AY+ D WV QGL+D + Q+Y P Sbjct: 273 P---------------NPYEFAYNGHLQDWLAWVRQGLVDELIVQVYRP 306 >UniRef50_P74735 Slr0592 protein n=1 Tax=Synechocystis sp. PCC 6803 RepID=P74735_SYNY3 Length = 491 Score = 220 bits (561), Expect = 7e-56, Method: Composition-based stats. Identities = 78/313 (24%), Positives = 123/313 (39%), Gaps = 62/313 (19%) Query: 46 PATTQQSSQPMRGIWLA---TVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLG 102 Q + +RG+W+ TV LD Q + ++ L L Sbjct: 41 QVQAQNAFPEIRGVWITNNDTVHFLD-------------------QNRTTESINLLADLN 81 Query: 103 INTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNP 162 NT++ V G L+ S+ L + G D L ++D+AH+R M V WF Sbjct: 82 FNTIYPVVWNSGYVLYESEFAKREGLQPFSPRGDQGQDVLADIIDKAHRRNMLVLPWFE- 140 Query: 163 YRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTS----GDRFVLDPGIPEVQDWITS 218 G S L ++ + Q RD +TS G+ ++P P+VQ +IT Sbjct: 141 --------FGFKAPPMSELVKRHPWWFTQKRDGTKTSVSAAGEVMWMNPFHPQVQTFITQ 192 Query: 219 IVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFAS----------KADWRR 268 +V + V++Y +DGVQFDD+ T P DN T Y WR Sbjct: 193 LVMDAVNKYDLDGVQFDDH--TALPNEFGYDNYTISLYQQETKKTPPSNPKDPAWIRWRA 250 Query: 269 NNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWV 328 + + +++ IK+ KP + VSPA AY+ D W+ Sbjct: 251 DKITAFMVQLNARIKAAKPNILVSVSPATY---------------NLAYNTFLQDWLDWI 295 Query: 329 EQGLLDYIAPQIY 341 +G++D + Q+Y Sbjct: 296 RKGIVDEVIVQVY 308 >UniRef50_Q8YV65 All2116 protein n=15 Tax=Cyanobacteria RepID=Q8YV65_ANASP Length = 416 Score = 219 bits (558), Expect = 1e-55, Method: Composition-based stats. Identities = 90/425 (21%), Positives = 154/425 (36%), Gaps = 85/425 (20%) Query: 15 PAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSS 74 A+++AL+++ S P + +TP A +RG+WL +D Sbjct: 25 FALMMALSVVATVMLSFPLNAQITPSAALAS---------ELRGVWLT---NID------ 66 Query: 75 VNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIG 134 S ++ + L L +L NTV+ V G L+PSK+ + ++ I Sbjct: 67 -------SDVLFERDRLKTSLQKLDKLNFNTVYPAVWNWGYTLYPSKVA--AKVIGRAID 117 Query: 135 ENPG---YDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQ 191 PG D L+ ++ E HK+G+ V WF G + +S L++ Sbjct: 118 PTPGLQGRDMLKEIVTEGHKQGLTVIPWFE---------FGFMAPADSLLAKNRPQWLTS 168 Query: 192 HRDWIRTSG----DRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRL 247 + R DR L+P P+VQ +I ++ E+V Y +DG+QFDD+F P Sbjct: 169 RSNGSRIVKEGIHDRVWLNPFRPDVQQFIQDLIVEIVRNYDIDGIQFDDHFG--LPSELG 226 Query: 248 NDNETYRKYGGAFASKA-----------DWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPA 296 D T Y +A WR + + ++ IK+ K V+P Sbjct: 227 YDAYTVALYKKEHRGQAPSKNPRDPEWLRWRASKITNFMQRIFKAIKATKKDCLVSVAP- 285 Query: 297 GVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAK 356 +YD AD ++W GL++ + QIY + + Sbjct: 286 --------------NPQRFSYDYFLADWQKWERMGLIEELVLQIYRD---DLNVFVQELE 328 Query: 357 WWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILF 416 + + IGI I+ +++ Q+ +G F Sbjct: 329 YPEVKAAKAHIPVSIGILSGLKNRSVPIQ----------QIQTQVQKVRD-RNFAGVSFF 377 Query: 417 REDYL 421 + L Sbjct: 378 FYETL 382 >UniRef50_Q8YQA0 All3933 protein n=18 Tax=Cyanobacteria RepID=Q8YQA0_ANASP Length = 741 Score = 216 bits (550), Expect = 1e-54, Method: Composition-based stats. Identities = 91/441 (20%), Positives = 153/441 (34%), Gaps = 99/441 (22%) Query: 22 ALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPT 81 L S + +TP Q Q +RG+WL N Sbjct: 42 VLFALSFTTVLLLQNLTP----ATAQFFQSPRQEIRGVWLT----------------NND 81 Query: 82 SRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDP 141 + + D L L+RL NT++ V DG +PS + + G G D Sbjct: 82 FDILRNRAKVQDTLAQLRRLNFNTIYPVVWNDGYTKYPSAVTQRMGIPYFFRGTE-GQDV 140 Query: 142 LQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTS-- 199 + ++ +A +G+ WF G + L S L+ Q Q RD +TS Sbjct: 141 IADIISQARSQGLLAIPWFE---------FGFMAPLTSELASQHPDWLTQKRDGTQTSIS 191 Query: 200 --GDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYG 257 G+ ++P PEVQ +IT +V E++++Y DG+QFDD+ P D T Y Sbjct: 192 AAGEVAWMNPFHPEVQQFITDLVVEIITKYNADGIQFDDHM--SLPVDFGYDKYTINLYR 249 Query: 258 GAFAS----------KADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPL 307 + WR + + +++ +K+ KP F VSP Sbjct: 250 QETGNPPPSNPQAQAWVKWRADKITAFMVQLNQAVKARKPNAIFAVSP------------ 297 Query: 308 GSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRT 367 AY D WV G++D + Q+Y ++ Sbjct: 298 ---NYYDFAYKLQLQDWLNWVRLGVVDELVVQVY---RNDLQSFN--------------- 336 Query: 368 RLYIGIAFYKVGEPSKIEPDWMINGGVPELK----KQLDLNDAVPEIS-------GTILF 416 K+ P IE +I G+ + +Q+ ++ ++ G + F Sbjct: 337 --------SKLITPEIIETQQLIPTGIGIMTGLRNRQVSMSQIQSQVRAAQERGLGAVFF 388 Query: 417 REDYLNKPQTQQAVSYLQSRW 437 + L + V+ Q+ + Sbjct: 389 YYESLWDY-APEPVAQRQASF 408 >UniRef50_Q7NL32 Glr1294 protein n=1 Tax=Gloeobacter violaceus RepID=Q7NL32_GLOVI Length = 796 Score = 215 bits (546), Expect = 4e-54, Method: Composition-based stats. Identities = 94/455 (20%), Positives = 161/455 (35%), Gaps = 86/455 (18%) Query: 19 VALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPM------------RGIWLATVSR 66 V ALL +ST PE PPA A Q++ + + R W + Sbjct: 247 VESALLTSDARSTAPEQF--PPAYRDAIARAQRTLKELPAMLKDGLDTQARAAWEDAIED 304 Query: 67 LDW-----------PPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGT 115 L W P V ++ + T ++ + D L + GINTVFF+ G Sbjct: 305 L-WAHYPTSQLAALPEVRAIWLDRGTIVKAGSEEGLTRIFDRLAQSGINTVFFETVNAGY 363 Query: 116 ALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIR 175 ++PS + P + + G+DPL + AH+R M++HAW + I Sbjct: 364 TIYPSAVAPAQNPLIR------GWDPLAAAVRLAHERKMELHAWTWAFAAGNTRHNALIG 417 Query: 176 ELNS----TLSQQPASVYVQHRDWIRTSGD-RFVLDPGIPEVQDWITSIVAEVVSRYPVD 230 + L+ P + +R +G + +DP PEV+ ++ S+ E+++ Y VD Sbjct: 418 KSQDFPGPVLAAHPGWAQSGRKGNLRPAGQPEYWMDPANPEVRAYLQSLYEEILTNYDVD 477 Query: 231 GVQFDDYFYT------ESPGSRLNDNETYRKYGGA------------FASKADWRRNNTQ 272 G+QFD Y + G ++ + G +A ++ Sbjct: 478 GLQFDYIRYPLQKNAGQYFGYSPAARRSFAQLTGVDPIDIAPEESSLWALWTRFKAEQVS 537 Query: 273 QLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGL 332 +A+ + ++ IKP + + +P G R D W QG Sbjct: 538 SFVAESAEKLRRIKPRLIVSAAVF-------PNPPGERLRLLQ------QDWEAWAIQGN 584 Query: 333 LDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMING 392 +D + P Y +R L + VK + + + + Sbjct: 585 IDLLVPMTYALNTRRLQ---QLVEPTLPGVKEAPVLILPSLNLMSLPQ------------ 629 Query: 393 GVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQ 427 +L+ QL +P G LF +L Q Sbjct: 630 --VQLRDQLQAVRDLPS-GGYSLFAAAHLADNHQQ 661 >UniRef50_B2IV00 Putative uncharacterized protein n=4 Tax=Cyanobacteria RepID=B2IV00_NOSP7 Length = 381 Score = 215 bits (546), Expect = 4e-54, Method: Composition-based stats. Identities = 89/386 (23%), Positives = 144/386 (37%), Gaps = 68/386 (17%) Query: 55 PMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDG 114 RGIWL T S+ +Q + + +D L G N VF V Sbjct: 5 ETRGIWLTT----------------TDSKVLRSKQRIAEAMDLLAETGFNVVFPVVWNKA 48 Query: 115 TALWPSKILPWSDLMTGKIGENP---GYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKP 171 L+PS+ + T + +P G DPL+ ++ EA + G+KV WF S Sbjct: 49 VTLYPSQTMQ----ETFGVEIDPMSVGRDPLEEVVVEARRVGLKVIPWFEYGFASSYNLN 104 Query: 172 GTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDG 231 G + L ++P + L+ +VQ++ ++V EVV Y VDG Sbjct: 105 GGV-----LLQKKPEWAARDFNGNLLNKNGFEWLNALDSQVQEFFLNLVLEVVKNYDVDG 159 Query: 232 VQFDDYFYTESPGSRLNDNETYRKY----------GGAFASKADWRRNNTQQLIAKVSHT 281 VQ DD P D T +Y WR + +A++ Sbjct: 160 VQGDD-RLPAFPCEGGYDEGTVSRYRQEYDRNPPQNPKDRQWLQWRADILTDFLARLYGE 218 Query: 282 IKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIY 341 +K++ P + ++P + A+ E D+ W+++G++D I PQIY Sbjct: 219 VKAVNPNLLVAIAP--------------NIHDW-AFQEYLQDSPTWLKRGIVDMIQPQIY 263 Query: 342 -WPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQ 400 F A D L T RL GI K+G I P++++ Sbjct: 264 RRDFGSYCAIADKLVSQ--QFTDATLPRLAPGI-LMKLGSYC-ISPEYLVQA-------- 311 Query: 401 LDLNDAVPEISGTILFREDYLNKPQT 426 ++ N + I G + F + L + Sbjct: 312 IEYNRQL-GIQGEVFFFYEGLRENNN 336 >UniRef50_C2FS67 FenI family protein n=1 Tax=Sphingobacterium spiritivorum ATCC 33300 RepID=C2FS67_9SPHI Length = 327 Score = 214 bits (545), Expect = 5e-54, Method: Composition-based stats. Identities = 75/206 (36%), Positives = 112/206 (54%), Gaps = 10/206 (4%) Query: 221 AEVVSRYPVDGVQFDDYFYTESPG--SRLNDNETYRKYGGAFASKADWRRNNTQQLIAKV 278 +VV Y VDG+ FDDYFY + L D T+ ++G FA+ DWRRNN LI + Sbjct: 1 MDVVKNYDVDGIHFDDYFYPYPDARNTALPDAPTFHQFGRGFANIHDWRRNNVDLLIRDL 60 Query: 279 SHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAP 338 IK KP +++G+SP G+W N+ +P GS+T G + Y YAD +W+++G +DYI P Sbjct: 61 GIAIKKEKPFIKYGISPFGIWDNKRDNPDGSNTSGLSGYRTLYADGVKWMKEGWIDYINP 120 Query: 339 QIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELK 398 QIY+PF+ AA +++L +WW Y+G Y+V ++ P W G +P+ Sbjct: 121 QIYFPFNNRAAAFEILLEWWEKHT--YGRHFYVGHGAYRV---TEKRPGWTDKGQIPKQV 175 Query: 399 KQLDLNDAVPEISGTILFREDYLNKP 424 + L E+ G+I F L Sbjct: 176 RHL---RDQHEVQGSIYFSSKSLMDN 198 >UniRef50_Q8YLM8 Alr5270 protein n=12 Tax=Cyanobacteria RepID=Q8YLM8_ANASP Length = 420 Score = 213 bits (541), Expect = 1e-53, Method: Composition-based stats. Identities = 86/429 (20%), Positives = 150/429 (34%), Gaps = 89/429 (20%) Query: 39 PPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHL 98 P + + ++ +RG+WL V+ S + +D L Sbjct: 27 PSFSNYEQKSNLPTTTEIRGVWLTNVA----------------SGVLFVPWGINRAIDQL 70 Query: 99 QRLGINTVFFQVKPDGTALWPSKILPW------SDLMTGKIGENPGYDPLQFMLDEAHKR 152 L NT++ V G + S L+ G G D L ++ A + Sbjct: 71 SALNFNTIYPVVWNRGYTFYKSSTAKSVTGSDTQPLLNFVHG---GQDVLAKIVALAKPK 127 Query: 153 GMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRF--------- 203 + V WF G + +S ++++ + + T + Sbjct: 128 NLSVIPWFE---------YGFMAPPDSVIAKRHPEWLTNGQGGVITISEMLPEESDNDPT 178 Query: 204 ----VLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGA 259 L+P PEVQ +I S+++EVV+ Y +DG+Q DD+F P D T Y Sbjct: 179 NKLVWLNPLHPEVQKFILSLISEVVTNYHIDGIQVDDHFG--MPVQFGYDPYTTELYQKE 236 Query: 260 FAS-----------KADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLG 308 WR N + + +VS +K IKP V+ +SP Sbjct: 237 HKGKSPPRNHLDAEWMKWRANKITRFMTQVSQVVKEIKPSVKVSLSP------------- 283 Query: 309 SDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTR 368 A AY D W++ GL+D + Q+Y + + + A + T+ Sbjct: 284 --NSQAFAYKYYLQDWANWIKTGLVDELILQVY---RNDKSSFVYELEQPAVKLARTQIP 338 Query: 369 LYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQ 428 + IGI+ + P KIE ++++Q+ G F + L T + Sbjct: 339 VAIGISTGTLRSPVKIE----------QIREQVQAVRD-RSFFGISFFYWESLWGYITPE 387 Query: 429 AVSYLQSRW 437 + Y + + Sbjct: 388 SPPYRRQVF 396 >UniRef50_B4VTS6 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VTS6_9CYAN Length = 884 Score = 212 bits (539), Expect = 3e-53, Method: Composition-based stats. Identities = 73/383 (19%), Positives = 145/383 (37%), Gaps = 69/383 (18%) Query: 70 PPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLM 129 P + ++ + T ++Q ++ D+L + G NTVFF+ ++PS++ P + + Sbjct: 400 PEIRAIWLDRGTIVKAKRKQDLVKLFDNLAKAGFNTVFFETVNASYPIYPSQVAPEQNPL 459 Query: 130 TGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVY 189 G+DPL+ ++ AH+RGM++HAW + + + + P+ V Sbjct: 460 VR------GWDPLEAAVELAHERGMELHAWVWIFAAANQRHNALLNQP----LDYPSPVL 509 Query: 190 VQHRDW---------IRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYT 240 H DW + + DP PEV++++ +++ E+ +RY VDG+Q D Y Sbjct: 510 AAHPDWAIFDKQGRLFAPNTRKAFFDPAHPEVREYLMALLEEIATRYDVDGIQLDYIRYP 569 Query: 241 -------ESPGSRLNDNETYRKYGGA------------FASKADWRRNNTQQLIAKVSHT 281 ++ G + + +++ G + ++R +A VS Sbjct: 570 FQDPRVNQTYGYGVAARQQFKERTGVDPIEVYPRDRTLWQQWTEFRIRQVDSFVASVSAR 629 Query: 282 IKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIY 341 + S +P + + P R + W +QG +D + P Y Sbjct: 630 LLSQRPDLILSAAVF------PLPPAERQQR-------LQQNWEEWAKQGYIDLVVPMTY 676 Query: 342 WPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQL 401 + L + R L GI + + ++ Q+ Sbjct: 677 ALDTEGLHS---LTQPLLTESTLNRVLLIPGIRLLNLPDVVAVD--------------QI 719 Query: 402 DLNDAVPEISGTILFREDYLNKP 424 L +P +G +F + LN+ Sbjct: 720 QLLRDLPA-NGYAVFAVENLNEN 741 >UniRef50_A0YRE2 Putative uncharacterized protein n=1 Tax=Lyngbya sp. PCC 8106 RepID=A0YRE2_9CYAN Length = 574 Score = 208 bits (528), Expect = 4e-52, Method: Composition-based stats. Identities = 78/424 (18%), Positives = 150/424 (35%), Gaps = 82/424 (19%) Query: 18 LVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNI 77 ++ ALL +S+ P+ + P + +RG+WL +D Sbjct: 175 FLSQALLKAEEESSVPKEYLVKAVEIDAP------NGEIRGVWLT---NID--------- 216 Query: 78 SNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENP 137 S ++++ +D L L NT++ V G +PS+++ ++ ++ P Sbjct: 217 ----SDVLFSPTSVVEAIDSLSELNFNTLYPVVWNRGFTQFPSQVMK--RIIGTELDPAP 270 Query: 138 ---GYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRD 194 G D LQ ++ +A + M V WF G + +S Q + +++ Sbjct: 271 ELAGRDVLQEIITQAKAKNMSVMPWFE---------FGFMVPQDSQFLQSRPNWITTNKE 321 Query: 195 WI----RTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDN 250 I R +P P+VQ +I ++ EVVS+Y +DG+QFDD+F P D Sbjct: 322 GIPFVKEEDKYRVWFNPFNPQVQQFILDLIVEVVSKYDIDGIQFDDHFG--LPFELGYDE 379 Query: 251 ETYRKYG-----------GAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVW 299 T + Y WR + + ++ +K KP +SP Sbjct: 380 FTSKLYQRENDGKLPPSDPKDQDWVKWRADKLTDFMMRLFWVVKDYKPDCIISLSP---- 435 Query: 300 RNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWA 359 + AYD D W + G ++ + Q+Y + + Sbjct: 436 -----------NPKSYAYDNYLQDWPTWEQSGFIEELVLQVYRD---DPKAFKADLEAAE 481 Query: 360 DVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFRED 419 + + +GI + + +++Q+ + + +G F + Sbjct: 482 VLNAKVNIPVAVGILTGLKNQSVPLST----------VQEQVAESRR-RKFAGVSFFFYE 530 Query: 420 YLNK 423 L Sbjct: 531 TLKS 534 >UniRef50_B9XI64 Putative uncharacterized protein n=1 Tax=bacterium Ellin514 RepID=B9XI64_9BACT Length = 1083 Score = 207 bits (527), Expect = 5e-52, Method: Composition-based stats. Identities = 99/441 (22%), Positives = 156/441 (35%), Gaps = 85/441 (19%) Query: 6 RNKKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVS 65 RN ++ IR I+ L +L +Q R W A V Sbjct: 8 RNVRMRIRCLVIMAGLWFVLAI----------------------SSPAQEFRAAW-ADVF 44 Query: 66 RLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPW 125 + + VN T + ++ + ++ +G+N+ A W S ILPW Sbjct: 45 HVGMGSQTEVNNMVATLVSGHYNAVIVQVVGYMDGIGVNS--------HGAHWKSNILPW 96 Query: 126 SDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNP-----YRVSVNTKPGTIRELNST 180 S +T G+DPL + +AH G++VHAW YRVS P L + Sbjct: 97 SPRVTA------GFDPLAALCAQAHANGIEVHAWLGGSAGAMYRVSTAWPPAGNATLTAH 150 Query: 181 LSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDD---- 236 A + + LD G P+ Q++I SIV E+V+ YP+DG+ +DD Sbjct: 151 PEWFIAPLANSEGGAPVLVDGNYDLDMGSPDAQEYIVSIVRELVTNYPIDGINWDDELNN 210 Query: 237 ------YFYTESPGSRLNDNET--YRKYGGAFAS-------KADWRRNNTQQLIAKVSHT 281 + Y + ++ YR+ G + +++RR +L+A+V Sbjct: 211 AGYAAGFGYPALSQTNYPNSGLGRYRRNTGYVGTPPNTDTAWSNYRRRFKNELMARVQAE 270 Query: 282 IKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIY 341 I+SIK + S P G Y + D ++ G LD + PQ Y Sbjct: 271 IQSIKTNPRQPLRHTSAALAYSPYPTSCTFAGLVPYT-YFCDWAGMLQNGWLDAVIPQTY 329 Query: 342 ---------------WPFSR----SAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPS 382 W ++R Y A+++ TR+ G A Y G P Sbjct: 330 SLGTFTNWANFSASCWQYNRQIFPGIGAYLNTNASIANMIGYTRSIGLKGNAIYSYGVPH 389 Query: 383 ----KIEPDWMINGGVPELKK 399 E DW Sbjct: 390 TNFVPAESDWWAYAAANVYTN 410 >UniRef50_B7JXY5 Putative uncharacterized protein n=9 Tax=Cyanobacteria RepID=B7JXY5_CYAP8 Length = 427 Score = 206 bits (525), Expect = 1e-51, Method: Composition-based stats. Identities = 94/432 (21%), Positives = 156/432 (36%), Gaps = 94/432 (21%) Query: 21 LALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNP 80 L L++ S + + PA S+ P+ +RG+WL + Sbjct: 27 LFLVIFSLSVVLILATLQYPAQSRTPS-------EIRGVWLTNI---------------- 63 Query: 81 TSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWS-DLMTGKIGENPGY 139 S Q ++ D + L++L NT++ V G L+PS + G Sbjct: 64 DSEVLFSQNSLSDGIRTLKQLNFNTLYPTVWNWGHTLYPSPVAKKVIGTPLDPTEGLQGR 123 Query: 140 DPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLS-QQPASVY-VQHRDWIR 197 D LQ ++D+ H+ M V WF G + +S L+ + P + Q+ D I Sbjct: 124 DMLQEIIDQGHQANMAVIPWFE---------FGFMAPADSQLAIKYPQWLTERQNGDKIW 174 Query: 198 TSGD---RFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYR 254 G+ R L+P PEVQ +ITS+V E+VS Y +DG+QFDD+F P D+ T + Sbjct: 175 LEGNVHKRVWLNPLKPEVQQFITSLVTEIVSNYSIDGIQFDDHFGI--PFDFGYDDFTLQ 232 Query: 255 KYGGAFAS-------------------------KADWRRNNTQQLIAKVSHTIKSIKPGV 289 Y WR N + ++ IK+I P V Sbjct: 233 LYQQEHQGKLPPKPPQNVKTENNCSINSQEWKEWTQWRANKITGYMTELFKAIKTINPNV 292 Query: 290 EFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAA 349 VSP + + AD ++W +GL++ + Q+Y + Sbjct: 293 IVSVSP---------------NPQPFSVNCYLADWQQWERRGLVEELVLQVY---RNNLN 334 Query: 350 RYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPE 409 + +GI G P + ++K Q++ + Sbjct: 335 SFKQELSRPEVQQAKKHIPFGVGIISGLKGRPV----------SMKQIKSQVETTRQQ-K 383 Query: 410 ISGTILFREDYL 421 +G F + L Sbjct: 384 FTGVSFFFYESL 395 >UniRef50_Q7NJN0 Glr1802 protein n=1 Tax=Gloeobacter violaceus RepID=Q7NJN0_GLOVI Length = 344 Score = 204 bits (520), Expect = 4e-51, Method: Composition-based stats. Identities = 88/385 (22%), Positives = 147/385 (38%), Gaps = 72/385 (18%) Query: 54 QPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPD 113 + +RG+WL V SR ++A++ +D L + G N VF V Sbjct: 2 ERLRGVWLTNV----------------GSRVLHSREAIVRAMDLLAQTGFNAVFPVVWNK 45 Query: 114 GTALWPSKILPWSDLMTGKIGENPGY-----DPLQFMLDEAHKRGMK-VHAWFNPYRVSV 167 G L+PS+I+ L I +P Y DPL +++ A + G++ V WF S Sbjct: 46 GFTLYPSRIM----LELFGIEIDPLYAEAKRDPLAEVIEAAGRAGIRMVIPWFEYGFASS 101 Query: 168 NTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRY 227 G L +PA ++ PEVQ+++ S++ EV + Y Sbjct: 102 PRSDG-----GHILLTRPAWTARVSGGAPLVKNGLVWMNALDPEVQNFVLSLMLEVATHY 156 Query: 228 PVDGVQFDDYFYTESPGSRLNDNET---YRKYGGAF-------ASKADWRRNNTQQLIAK 277 + GVQ DD P D T +R+ G+ WR + + + + Sbjct: 157 DIVGVQGDD-RLPALPVEGGYDPRTVELFRETTGSDPPGWASEPGWVQWRADRLTEFLGR 215 Query: 278 VSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIA 337 + IKS++P + ++P S P + + D W +G D + Sbjct: 216 LYTQIKSVRPELLLSLAP-------SVYP--------FSLNHYLQDVAEWARRGWFDLLH 260 Query: 338 PQIYWP-FSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPE 396 PQ+Y F + D + D R+ GIAF G + GV + Sbjct: 261 PQVYRENFGQYRREIDRFKR---DFPPEATGRIAPGIAFKANG----------VEIGVDD 307 Query: 397 LKKQLDLNDAVPEISGTILFREDYL 421 +++++ LN + G + F D L Sbjct: 308 VRRRIALN-CERGLGGEVFFYFDGL 331 >UniRef50_Q10YX0 Putative uncharacterized protein n=2 Tax=Cyanobacteria RepID=Q10YX0_TRIEI Length = 1099 Score = 204 bits (520), Expect = 4e-51, Method: Composition-based stats. Identities = 78/398 (19%), Positives = 141/398 (35%), Gaps = 77/398 (19%) Query: 46 PATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINT 105 P Q++ +R +WL T ++ + + L GINT Sbjct: 596 PTDGQRAGAEIRAVWL----------------DRGTIVRARSERGLAGVFNRLAAAGINT 639 Query: 106 VFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRV 165 VFF+ G ++PS + P + +T +DPL+ + AH+R M++H W + V Sbjct: 640 VFFETINAGYTIYPSNVAPRQNPLTTS------WDPLKAAVKLAHERNMELHPWIWAFAV 693 Query: 166 SVNTKPGTIRELNS----TLSQQPASVYVQHRDWIR-TSGDRFVLDPGIPEVQDWITSIV 220 + + +S +S P+ V R R + +DP PEV+ ++ +I+ Sbjct: 694 GNKAHNQALGQGDSYLGPVISAHPSWVMTDKRGRKRHPLDGKVYMDPANPEVRQYLLNII 753 Query: 221 AEVVSRYPVDGVQFDDYFYT--------ESPGSRLNDNETYRKYG-----------GAFA 261 E+ SRY VDG+ D Y S + N+ + YG Sbjct: 754 DEIASRYEVDGIHLDYIRYPFQNPERNFSYGYSTIARNQFRQLYGIDPMKISSRDRQNLW 813 Query: 262 SKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESY 321 +++ N +A S +K P + F V+ P R +D+ Sbjct: 814 RWTEFKINQVNSFVANTSSFLKKKYPRLIFSVAVF---------PFPRHQR----FDQIQ 860 Query: 322 ADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEP 381 D WV +D + P Y + R+ + + + T + + + E Sbjct: 861 QDWESWVMNEDIDLLTPMTY---ALDTNRFQRITQPLTNTGVLGSTLITPAVKLLNIPEI 917 Query: 382 SKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFRED 419 ++ Q+ +P G I+F + Sbjct: 918 VAVD--------------QIQAARDLPT-GGYIIFAAE 940 >UniRef50_Q8EPF4 Hypothetical conserved protein n=1 Tax=Oceanobacillus iheyensis RepID=Q8EPF4_OCEIH Length = 502 Score = 204 bits (519), Expect = 5e-51, Method: Composition-based stats. Identities = 76/383 (19%), Positives = 143/383 (37%), Gaps = 49/383 (12%) Query: 8 KKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRL 67 K++ + I++ + +++ S TP + A+ Q+ + +R W+ Sbjct: 2 KRIRTKGVTIVIVMLIVISSLTLTP---------FTTSEASFQKENPFIRAFWVQAF--- 49 Query: 68 DWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSD 127 + + + +D + + +NT+ QV A + S +LP Sbjct: 50 --------------EPGLKTPEEIDELVDDVHKANMNTIIAQVSRRHDAYYQSDVLP--- 92 Query: 128 LMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPAS 187 T G+DPL ++L +AH++G++VHAW + + + + Sbjct: 93 -FTEDPSVPEGFDPLGYLLTKAHEKGIEVHAWVVVGPMWHSVYGDAPSDPTHIWNLHGPD 151 Query: 188 VYVQHRDWIRTSGD----RFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESP 243 + +G+ + LD G PE ++ + +V ++ Y VDGV D Y E Sbjct: 152 AQEESWATEDYNGNVPYWQPYLDLGHPEARNHVVDMVNDIAKNYEVDGVHLDYIRYPEDG 211 Query: 244 -GSRLNDNETYRKYGG-------AFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSP 295 G + + G W+ LI +V + ++ VE Sbjct: 212 KGYNATSLARFHEETGRTDRPPVNDQEWIAWKVEQVDSLIKRVYTELLTVDSDVELS--- 268 Query: 296 AGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFS--RSAARYDV 353 A V H+P + ++ + + WV++G LDY Y + R A R+D Sbjct: 269 AAVLSWGFHNPSNTHWWNMDPVQRAHQNWKEWVQEGYLDYAYVMNYDSDADPRRALRFDQ 328 Query: 354 LAKWWADVVKPTRTRLYIGIAFY 376 +W D+ P + IG A Y Sbjct: 329 WIEWQKDL--PRNRGIIIGPALY 349 >UniRef50_C1D2P2 Putative uncharacterized protein n=2 Tax=Deinococcus RepID=C1D2P2_DEIDV Length = 521 Score = 203 bits (517), Expect = 9e-51, Method: Composition-based stats. Identities = 82/413 (19%), Positives = 139/413 (33%), Gaps = 63/413 (15%) Query: 10 LTIRRPAILVALALLLCSCKSTPPESMV---------TPPAGSKPPATTQQSSQPMRGIW 60 +T + A+L A +LLL +C + P S + TP P QQ +RG+W Sbjct: 1 MTHKLTAVL-ATSLLLAACGTAPQSSDLDALDTQGVRTPGPHDSPRGRGQQ---ELRGLW 56 Query: 61 LATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPS 120 + + + + + +N +F QV G + Sbjct: 57 VDAFG-----------------PGMKTPAEIDVLVATARAMNVNVLFAQVGRRGDCYCNN 99 Query: 121 KILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRV----SVNTKPGTIRE 176 +P T G+DPL ++ +AH +G++VHAW + + T P Sbjct: 100 AAMPR----TNDPAVPAGFDPLADLITKAHAQGIQVHAWIITTAIWNSTTPPTDPAHAFN 155 Query: 177 LNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDD 236 + + D G+ ++LDPG P+ ++I ++ VV Y VDG+QFD Sbjct: 156 AHGLGKTGRDFWLMVKNDGTTRGGNDWLLDPGHPDAAEYIRNMYVSVVKNYDVDGIQFDR 215 Query: 237 YFYTES-----PGSRLNDNETYRKYG----------GAFASKADWRRNNTQQLIAKVSHT 281 YT+ P + + +Y + WR L+ + + Sbjct: 216 VRYTDFNPVGGPSNWGYNPTALERYRAETGATGMPLPGDPQWSAWRMQQVTNLVRETALA 275 Query: 282 IKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIY 341 +K+ KP V + + ++ + Y E D WV++G LD Y Sbjct: 276 VKATKPDVSVNAATI---TYGAGPANETEWLRSRPYTEVLQDWVTWVKEGYLDVNVMMNY 332 Query: 342 WPFSRSAARYDVLAKWWADVVK-----PTRTRLYIGIAFYKVGEPSKIEPDWM 389 A + W G A Y S + W Sbjct: 333 KRDFVPAQS--LWFDQWNQFAASLQRVAPDVHQVSGSAIYLNDIASSVNQVWK 383 >UniRef50_B1WZU0 Putative uncharacterized protein n=2 Tax=Cyanothece RepID=B1WZU0_CYAA5 Length = 421 Score = 203 bits (516), Expect = 1e-50, Method: Composition-based stats. Identities = 88/450 (19%), Positives = 157/450 (34%), Gaps = 86/450 (19%) Query: 7 NKKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSR 66 N++ + R ++ + +S+ S Q RG+WL V+ Sbjct: 2 NRQFFLWRNRLICIALTFIILLILFVSQSIFQ----SSGKVIASSIFQERRGVWLTNVA- 56 Query: 67 LDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWS 126 S + P S R +Q L +L NTV+ V G +PS + Sbjct: 57 -------SSVLFVPGSVNRAIKQ--------LSQLHFNTVYPVVWNRGHTFYPSSLAKEM 101 Query: 127 DLMTGKIGENPGY---DPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQ 183 + + N D L+ +++E+H+RG+ V WF + + + + Sbjct: 102 IGESQEPLLNWTRSNIDVLRVIIEESHQRGLAVIPWFEYGLMIPRSSLIAQKHPDWLTHS 161 Query: 184 QPASVYVQHRD---------------------WIRTSGDRFVLDPGIPEVQDWITSIVAE 222 Q +V +D + + + L+P PEVQ I ++ E Sbjct: 162 QQGTVNTFFQDELKTKNKKKSTNFLENWSQHSYQKRASQLVWLNPFHPEVQQLIKGLMLE 221 Query: 223 VVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAF-----------ASKADWRRNNT 271 ++ +Y VDGVQ DD+F P D T + Y A +WR Sbjct: 222 IIMQYKVDGVQLDDHFGI--PVELGYDPLTIKLYQQEHEGKNPPNDPYNAQWMNWRAKKL 279 Query: 272 QQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQG 331 + + TIK + P + +SP + +Y D + WV+QG Sbjct: 280 TAFMTDLVTTIKIVNPDILISLSPNSY---------------SFSYQNYLQDWKTWVKQG 324 Query: 332 LLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMIN 391 L+D + Q+Y ++ + + + + IGI + P KIE Sbjct: 325 LIDELVLQVY---RNDMDSFNRELQESTVKLARQKIPVSIGILSGTLNNPVKIE------ 375 Query: 392 GGVPELKKQLDLNDAVPEISGTILFREDYL 421 ++++Q++ G F + L Sbjct: 376 ----QIRQQVEKVRQQ-GFDGVSFFYWESL 400 >UniRef50_A0YS74 Putative uncharacterized protein n=2 Tax=Oscillatoriales RepID=A0YS74_9CYAN Length = 1005 Score = 201 bits (510), Expect = 6e-50, Method: Composition-based stats. Identities = 72/333 (21%), Positives = 122/333 (36%), Gaps = 67/333 (20%) Query: 46 PATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINT 105 P +++ +R +WL T + + D L + GINT Sbjct: 520 PIDGERAGAEIRAVWL----------------DRGTIVQARGEAGLAKIFDQLAQAGINT 563 Query: 106 VFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRV 165 VFF+ G ++P+++ P + +T G+DPL + AH+RGM++HAW + Sbjct: 564 VFFETVNAGYPIYPTRVAPQQNPLT------QGWDPLASGVKLAHERGMELHAWLWTFAT 617 Query: 166 SVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTS---------GDRFVLDPGIPEVQDWI 216 + + + S L V H DW + LDP EV+ +I Sbjct: 618 ANQRHNTLVNQPTSYL----GPVLTAHPDWANRDSRGRVWHERDGKAYLDPANREVRSYI 673 Query: 217 TSIVAEVVSRYPVDGVQFDDYFYT-ESPGSRLND------NETYRKYGGA---------- 259 +V E+V Y VDG+Q D Y + P N E +R+ G Sbjct: 674 LRLVGEIVHNYDVDGIQLDYIRYPFQDPNRNFNFGYGTAGREQFRQLTGVDPISVSPKDS 733 Query: 260 --FASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAY 317 + ++R ++ +VS ++ P V F V V+ + D + Sbjct: 734 QLWQQWVNFRVEQVSTMVREVSQLLRKQYPDVIFSV---AVFPHPEQDRIR--------- 781 Query: 318 DESYADTRRWVEQGLLDYIAPQIYWPFSRSAAR 350 + W +Q +D + P Y + R Sbjct: 782 -KIQQHWETWAQQNYVDLVVPMTYSLDTNRLQR 813 >UniRef50_B4WJG2 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WJG2_9SYNE Length = 453 Score = 199 bits (506), Expect = 2e-49, Method: Composition-based stats. Identities = 92/477 (19%), Positives = 148/477 (31%), Gaps = 100/477 (20%) Query: 12 IRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPA-------TTQQSSQPMRGIWLATV 64 I+R I L++ + P S++ P G +P + +RG+W+ V Sbjct: 16 IKRTGIFCVAVLVVF--LTGPLGSLLVPSTGGEPTTLDHLVGSKSSSLDSEVRGVWVTNV 73 Query: 65 SRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILP 124 + S + ++ L + NTV+ V G ++ S + Sbjct: 74 A----------------SSVFFMPWGIASTIEQLADMRFNTVYPVVWNRGQTIYRSDRMK 117 Query: 125 ---WSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPY------------------ 163 D+ +P DPL M+ H++G++V WF Sbjct: 118 EITQRDISPLVGLMHPREDPLAEMIRRGHQKGLRVIPWFEYGFMVPLQSRLAQAHPDWLT 177 Query: 164 -------RVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIR-----TSGDRFVLDPGIPE 211 R+S +T E L S + + R + L+P P Sbjct: 178 ARADGSQRLSEDTFVNGPIEETPELETASESAMARSKRLHRLLKSGAPSELGWLNPLHPN 237 Query: 212 VQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAF----------- 260 VQ I +V EV + Y VDG+QFDD+F P D T Y Sbjct: 238 VQALILDLVDEVTTYYDVDGIQFDDHF--SFPIEFGYDAFTVALYEAEHEGQLPPLDPAD 295 Query: 261 ASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDES 320 WR + + +K F +SP + AY Sbjct: 296 KDWIHWRAEKLSGFVNTLQKRVKETCSDCVFSLSP---------------NPASYAYQYY 340 Query: 321 YADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGE 380 D + W E+G LD + QIY ++ R + IGI G Sbjct: 341 AQDWQTWAEKGWLDELVVQIY---RNDLDQFAAELTKETLQSIRDRIPVSIGILTGTWGS 397 Query: 381 PSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQSRW 437 P E ++ +Q+ + SG F D L T + + + Sbjct: 398 PIAFE----------QISQQVISSRDH-HFSGVSFFYWDTLWSYFTPEPPQQRRQNF 443 >UniRef50_C7GZF2 Putative lipoprotein n=1 Tax=Eubacterium saphenum ATCC 49989 RepID=C7GZF2_9FIRM Length = 373 Score = 198 bits (502), Expect = 5e-49, Method: Composition-based stats. Identities = 97/416 (23%), Positives = 166/416 (39%), Gaps = 83/416 (19%) Query: 37 VTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLD 96 + + + + M+ +W VS LD+ + N+ R + ++ + Sbjct: 8 IEQKKAPQKTVKVVKFNTEMKAVW---VSFLDFQNLGLTNV-----REKTFKKNAEIMVK 59 Query: 97 HLQRLGINTVFFQVKPDGTALWPSKILPWSDLM-TGKIGENPG-----YDPLQFMLDEAH 150 +R GINT+FF V+ A + SK+ + T P YDPL+ + + AH Sbjct: 60 DAKRNGINTIFFHVRAFDDAAYKSKVFRAMRYLKTNASYAKPATSSFSYDPLKLVAEAAH 119 Query: 151 KRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIP 210 K G+++HAW NPYRV G + L P Sbjct: 120 KHGVQLHAWLNPYRV----------------------------------GYDYFLSPKSE 145 Query: 211 EVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFAS-KADW--- 266 + I V E+++ Y VDG+ FDDYFY G +++ ++Y A+ K D+ Sbjct: 146 YSTNRIIKAVNEILT-YKVDGIHFDDYFYHAKKGYYRLNSK--KQYSVNPATAKKDYSPS 202 Query: 267 ---RRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYAD 323 +R +LI +V+ K+ + F VSPAG N + D Sbjct: 203 SINKRRYVNKLIRRVN---KTTQGKALFSVSPAGNVDNCMNSG---------------VD 244 Query: 324 TRRWVEQ-GLLDYIAPQIYWPFSRSA-ARYDVLAKWWADVVKPTRTRL--YIGIAFYKVG 379 W+ G +D I PQIYW + A R + + ++ + ++ IG+A Y+ G Sbjct: 245 LTTWLSNDGYVDMIMPQIYWTDNWGASGRVKMFSSRLGQFMRKNKKKIPMVIGLALYRSG 304 Query: 380 EPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQS 435 E + W + G + Q+ + G LFR + L + + ++ V ++ Sbjct: 305 ERGLGDKGWSMRGSN--ISGQIKSIRRH-GLGGYCLFRFNNLYQGRCKKEVKNMRK 357 >UniRef50_A8YDR3 Genome sequencing data, contig C294 n=9 Tax=Chroococcales RepID=A8YDR3_MICAE Length = 875 Score = 195 bits (496), Expect = 3e-48, Method: Composition-based stats. Identities = 84/419 (20%), Positives = 142/419 (33%), Gaps = 87/419 (20%) Query: 46 PATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINT 105 P Q + +R +WL T ++ + D + GIN Sbjct: 392 PTNRQFAQPEIRAMWL----------------DRGTIVQAKNEEDLAKVFDRMAAAGINV 435 Query: 106 VFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRV 165 VFF+ ++PS++ P + +T G+DPL+ + AH+R M++HAW + Sbjct: 436 VFFETVNASYTIYPSQVAPEQNPLTR------GWDPLKVAVKLAHERNMEIHAWVWVFAA 489 Query: 166 SVNTKP----GTIRELNSTLSQQPASVYVQHRDWIRTSG---DRFVLDPGIPEVQDWITS 218 + + L LS+ + DP PEVQ+++ S Sbjct: 490 ANQAHNKVLEQPLNYLGPVLSRNSDWGATNKSGGSFDYSQGTKKAFFDPANPEVQNYLLS 549 Query: 219 IVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGA------------------- 259 + E+V Y VDG+Q D Y P N N+TY YG + Sbjct: 550 LYEEIVKNYDVDGLQLDYIRY---PFQNQNYNQTYG-YGKSSRWLFKQMTGVDPITLNPR 605 Query: 260 ---FASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAA 316 + ++ +++VS +K I+P ++ + PL R Sbjct: 606 GALWEQWTSFKIRQVDTFVSQVSTRLKQIRPQLKMSAAVF---------PLEQKER---- 652 Query: 317 YDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFY 376 + W + +D I Y + + + D + GI Sbjct: 653 LYRIQQNWEEWGQNQWIDIIFLMTY---ALDTGTLEDKTQSLFDRQIAGNALIIPGIRLL 709 Query: 377 KVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQS 435 KV + I+ QL +P SG LF + LN P Q ++ +Q Sbjct: 710 KVPDQVTID--------------QLQFIRNLPT-SGFALFATENLN-PNLQTILNRIQG 752 >UniRef50_B8HYQ9 Putative uncharacterized protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HYQ9_CYAP4 Length = 383 Score = 194 bits (494), Expect = 4e-48, Method: Composition-based stats. Identities = 83/406 (20%), Positives = 153/406 (37%), Gaps = 76/406 (18%) Query: 44 KPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGI 103 + PA S+Q RG+WL ++ + ++ + L L + G Sbjct: 25 RAPARPTASTQENRGLWLTSIGLAGLYHSTLLD----------------ETLSDLSQRGF 68 Query: 104 NTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPY 163 NT++ V G L+PS+++P + + D L + E ++G+++ WF Y Sbjct: 69 NTLYPAVWNRGQTLYPSRVVPAAFTLG---------DVLSTTVREGKQQGLRIIPWFE-Y 118 Query: 164 RVSVNTKPGTIRELNSTLSQQPASVYVQHRDWI-------------RTSGDRFVLDPGIP 210 + V + R+ L++ + + + T D VL+P P Sbjct: 119 GLKVTDRSVLARQHPDWLARDRNGRPYINPEPVNALPFPLKGLSRSVTGADHVVLNPIHP 178 Query: 211 EVQDWITSIVAEVVSRYPVDGVQFDDYFY-TESPGSRLNDNETYRKYGG-------AFAS 262 +VQ+ I + +VV RY VDG+Q DD+F G + +R+ G + Sbjct: 179 QVQNLIVKMFVDVVKRYNVDGIQIDDHFALPVQLGYDSYTRQRFRQEQGVEPPADPTDPA 238 Query: 263 KADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYA 322 +WR N +L+ K+S IK KP + F ++P A AY + Sbjct: 239 WMEWRANKLTELVGKISTAIKQQKPAIIFSIAP---------------NPPAFAYRTTLQ 283 Query: 323 DTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPS 382 D WV +G +D + Q+Y P ++Y G+ Y Sbjct: 284 DWPTWVRRGYVDEVVVQVYRPTVAEM------------EAIAADPQIY-GLQAYAPVSLG 330 Query: 383 KIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQ 428 + +L +++ + + + +G LF ++ P + Sbjct: 331 LYAGPGLKAKTGQQLDREVAVTRRL-KYNGFALFTWEFAIGPLARG 375 >UniRef50_Q2JQ39 Putative uncharacterized protein n=1 Tax=Synechococcus sp. JA-2-3B'a(2-13) RepID=Q2JQ39_SYNJB Length = 850 Score = 187 bits (474), Expect = 8e-46, Method: Composition-based stats. Identities = 79/408 (19%), Positives = 136/408 (33%), Gaps = 84/408 (20%) Query: 43 SKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLG 102 + P + +R IWL T + + D L + G Sbjct: 367 AHYPVDRLTALSEVRAIWL----------------DRSTIVEAGSEAGLAQIFDRLAQAG 410 Query: 103 INTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNP 162 +NTVFF+ G A+ PS++ P + +T G DPL+ + AH+RG+++HAW Sbjct: 411 LNTVFFETMNAGFAIHPSRVAPQQNPLTR------GRDPLRAAVRLAHERGLELHAWIWT 464 Query: 163 YRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRF---------VLDPGIPEVQ 213 V NT+ + E+N V H DW LDP P+V+ Sbjct: 465 LAVG-NTRHNLLPEIN-LPQDYIGPVLTAHPDWANLDNRGRLFPRGQPETWLDPANPQVR 522 Query: 214 DWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLN-------DNETYRKYGGA------- 259 ++ ++ E+V Y VDG+ D Y + + +++ G Sbjct: 523 SYLLALTRELVQDYQVDGIHLDYIRYPFQNAASRQVFGFGRAARQGFQQLSGVDPLELDP 582 Query: 260 ------FASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRG 313 + +R +++ ++ T +S+ P V + Sbjct: 583 LRDRSLWQLWTRYRTQQVNEVVEAIARTARSLNPRVILSAAVYA-------------LPK 629 Query: 314 AAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGI 373 + W++ G LD + P Y +R A L + ++V T + Sbjct: 630 QERLQRLQQNWEEWIQAGELDLLIPLTYAGNTRRLA---QLVQPNLEIVSRFSTLFVPSL 686 Query: 374 AFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYL 421 +N E QL + +P G LF L Sbjct: 687 NL--------------LNLPPVEFLDQLQVVRDLPT-GGFALFSVRQL 719 >UniRef50_C1D298 Putative uncharacterized protein n=1 Tax=Deinococcus deserti VCD115 RepID=C1D298_DEIDV Length = 628 Score = 186 bits (473), Expect = 1e-45, Method: Composition-based stats. Identities = 76/370 (20%), Positives = 123/370 (33%), Gaps = 57/370 (15%) Query: 35 SMVTPPAGSKPPATTQQSSQP------MRGIWLATVSRLDWPPVSSVNISNPTSRARVQQ 88 V PA + A ++++ P MRG+W+ Sbjct: 228 KSVQKPAATHQVAESRRAPGPLQTGPAMRGLWVDAFG-----------------PGFKTP 270 Query: 89 QAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDE 148 + + + + INT+F Q G +LP T +DPL +L + Sbjct: 271 GEVDRLIADARAMNINTLFVQAVKRGDCYCNGSLLPR----TEDPAVPAEFDPLADVLTK 326 Query: 149 AHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDW-IRTSGDRFV--- 204 AH G+KVHAW P VS + ++ +DW +R SG Sbjct: 327 AHAHGIKVHAWVIPTAVSNRAVRYPVTNPEHVVNAHGEG---DEQDWLMRNSGGSMWAGN 383 Query: 205 ---LDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGS---RLNDNETYRKYG- 257 LD G P+ + ++ + V + Y +DGVQ D Y + G+ + Y Sbjct: 384 DQQLDIGHPDARRYMVDAIQSVAAAYNIDGVQLDRVRYPDPSGTVQDWGYNPGAVAAYQA 443 Query: 258 ---------GAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLG 308 A WRR L+++VS ++S +PG V+ + Sbjct: 444 ESETTETPAPGDARWTAWRREQVNALVSEVSGAVRSARPGTVISVAAI---TYGAGPRTR 500 Query: 309 SDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKP--TR 366 Y E D W+ G +D + Y + + D W K Sbjct: 501 EAFASTRTYAEVLQDWPLWLADGNVDLVVLMNYKREAHAGQARDF--DSWNRFAKSVKAG 558 Query: 367 TRLYIGIAFY 376 ++ G A Y Sbjct: 559 GQVAAGTALY 568 >UniRef50_B0MQ12 Putative uncharacterized protein n=1 Tax=Eubacterium siraeum DSM 15702 RepID=B0MQ12_9FIRM Length = 990 Score = 185 bits (470), Expect = 2e-45, Method: Composition-based stats. Identities = 90/423 (21%), Positives = 155/423 (36%), Gaps = 72/423 (17%) Query: 5 SRNKKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATV 64 RNK IR A +++ +LL + E T S + +QS Q I + Sbjct: 1 MRNK--FIRIMAGVLSAFMLLSQLTAVAEEK--TNENTSASAESAEQSKQTTPQIAEPKL 56 Query: 65 SRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILP 124 + + + +N+ + A + KLD L G+N V+ + Sbjct: 57 TLSNELKATVINLGDFA--AEKFGENFSKKLDTLIAYGMNGVYINPYGKDGTYY------ 108 Query: 125 WSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQ 184 N D L+ L+ A K+GM+ + +F ++N T++ Sbjct: 109 -------TTNMNKSGDRLEKALEAATKKGMQRYVYF---------------DINKTMAAC 146 Query: 185 PASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPG 244 P + D++ S + +Y +G+ ++ T Sbjct: 147 PDG----------------------EDCYDYLVSEAHKFALKYRCNGIILTGFYGT---- 180 Query: 245 SRLNDNETYRKY--GGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNR 302 N+N Y +Y G+ +W + + + VS I+ + G+ VW N Sbjct: 181 ---NNNSAYEEYMKNGSGIGYKNWLYDTVEYKFSTVSGVIRLSDNSIAVGIDAKDVWANA 237 Query: 303 SHDPLGSDTRG-AAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADV 361 S + GSDT A+ YADT+ +VE+GL D+I ++ + WW++V Sbjct: 238 SKNKKGSDTSAKYTAFYNGYADTKSFVEKGLTDFIVVNASGSLDNETVGFENVCSWWSNV 297 Query: 362 VKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYL 421 K + YI V KI D G +L KQL D + SG++ + E L Sbjct: 298 AKSAKIPFYI------VHHNEKIGTDEDGWGVEDQLLKQLAKADELDNYSGSVFYSEKSL 351 Query: 422 NKP 424 + Sbjct: 352 EEN 354 >UniRef50_UPI0001C16380 Protein of unknown function DUF187 n=1 Tax=Raphidiopsis brookii D9 RepID=UPI0001C16380 Length = 289 Score = 179 bits (453), Expect = 2e-43, Method: Composition-based stats. Identities = 58/259 (22%), Positives = 104/259 (40%), Gaps = 41/259 (15%) Query: 42 GSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRL 101 S P T Q S + +RG+W+ + + + D + L+RL Sbjct: 42 HSVPSVTAQMSREEIRGVWVTS----------------NDLNVFKDRDQVKDAVTKLRRL 85 Query: 102 GINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFN 161 NT++ V G ++PS + D+ + G+D L ++++AH + + WF Sbjct: 86 NFNTIYPVVWNSGYVMYPSNVAKSLDIQPFVFRGSDGHDILADIINQAHSQNLLAIPWFE 145 Query: 162 PYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDW----IRTSGDRFVLDPGIPEVQDWIT 217 G + L+ + RD + +G+ L+P P+VQ +I Sbjct: 146 ---------FGFMTPNTGELALNKPEWLTKMRDGSTVSMSAAGEVSWLNPFHPQVQKFII 196 Query: 218 SIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFAS----------KADWR 267 ++ E+ + Y +DG+QFDD+ T P D+ T Y WR Sbjct: 197 DLLVELTNNYDIDGIQFDDH--TSLPHQFGYDDYTVNLYKQETGKNPPANSQDSEWVAWR 254 Query: 268 RNNTQQLIAKVSHTIKSIK 286 N + + +++HT+K IK Sbjct: 255 ANKITEFMVRLNHTVKQIK 273 >UniRef50_C6PCP2 Putative uncharacterized protein n=1 Tax=Thermoanaerobacterium thermosaccharolyticum DSM 571 RepID=C6PCP2_CLOTS Length = 1117 Score = 178 bits (452), Expect = 3e-43, Method: Composition-based stats. Identities = 74/396 (18%), Positives = 141/396 (35%), Gaps = 82/396 (20%) Query: 50 QQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQ 109 R +W+ P +++ ++ LD L+ + INT++ Sbjct: 322 ASEKVESRAVWIR-------PKEKNLD-------------EVVRNLDMLKSININTIYLD 361 Query: 110 VKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNT 169 G ++P+ S T + G+D L + EAHKRGM V+AW + + + Sbjct: 362 TFWSGYTIYPTN----SKY-TSQNPIYGGFDVLDAYIKEAHKRGMVVYAWTENFLIGTSD 416 Query: 170 KPGTIRELNSTLSQQPASVYVQHRDW---IRTSG-DRFVLDPGIPEVQDWITSIVAEVVS 225 + + ++P + V + + + G + L+P IPE +D+++ + E+ S Sbjct: 417 ----VSDGGPIKKEKPEWLMVSRKGYNYTLDKYGIKYYYLNPAIPEARDFLSELYKEIAS 472 Query: 226 RYPVDGVQFDDYFYT---ESPGSRLNDNET---YRKYGGAFASKAD-----------WRR 268 +Y +DG+QFD + + D+ T +++Y G + +R Sbjct: 473 KYDIDGIQFDYIRFPNSNDYSNDFGYDDYTRNLFKQYAGVDPKYLNVNSDMWQLWNYFRM 532 Query: 269 NNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWV 328 N + V ++ IKP ++ A VW N P + + D++ W Sbjct: 533 NIVNTFVYSVVSELRMIKPEIKI---AADVWPNYDTAPS-----------DIFQDSKDWT 578 Query: 329 EQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDW 388 + +D + P Y ++A + + Sbjct: 579 LKNYIDTLNPMSY------NMSVSLVANDLKNTLDFASGH-----------SNVIPAIGT 621 Query: 389 MINGGVPELKKQLDLNDAVPEISGTILFREDYLNKP 424 I L KQ++ SG LF + L K Sbjct: 622 FIGTDNVTLLKQIEAIRD-NNASGVGLFEFESLFKN 656 >UniRef50_Q1IWF6 Putative uncharacterized protein n=3 Tax=Deinococcus RepID=Q1IWF6_DEIGD Length = 536 Score = 178 bits (451), Expect = 4e-43, Method: Composition-based stats. Identities = 82/368 (22%), Positives = 128/368 (34%), Gaps = 42/368 (11%) Query: 25 LCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRA 84 + + P TP + P +RG+WL + P + Sbjct: 36 AAASVTPVPPQAATPAILAPVPTPVPAPISSVRGLWL--------------DAFGPGLKT 81 Query: 85 RVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQF 144 Q + ++ LG+NT+F Q L LP ++T E +DPL Sbjct: 82 AAQ---VRRSVEDAASLGVNTLFVQAIRRADCLCRRSSLP---VITDADLEK-DFDPLAE 134 Query: 145 MLDEAHKRGMKVHAWFNPYRVSV----NTKPGTIRELNSTLSQQPASVYVQHRDWIRTSG 200 + AH RGM+V AW + S N+ P + + + AS + D G Sbjct: 135 VTRLAHARGMRVIAWVSVTGASNLRVPNSNPAHVSRQHGAQAGA-ASWLSRRPDGSWQEG 193 Query: 201 DRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNET---YRKYG 257 LDP IP D++ V +V YPVDGVQ D Y + G+ D +T YR Sbjct: 194 ADGWLDPAIPAAADFMVGGVVSLVKHYPVDGVQLDRIRYPD-GGNWGYDPKTLARYRAET 252 Query: 258 GAFAS-------KADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSD 310 GA + DW+R L+ +++ +K+++P + + Sbjct: 253 GAKGTPAPDDARWRDWKREQVTLLVRRIALEVKAVRPTAWVTAATIT-YGPPPPPGDLDA 311 Query: 311 TRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTR-- 368 Y + D W+ +GLLD Y + W R Sbjct: 312 FHKTRTYLDVLQDWPTWMREGLLDLNVLMNYKRDAVGEQ--GAWLDGWNAFAASVRGDAE 369 Query: 369 LYIGIAFY 376 + G A Y Sbjct: 370 VAGGTALY 377 >UniRef50_B5W1E7 Putative uncharacterized protein n=2 Tax=Arthrospira RepID=B5W1E7_SPIMA Length = 910 Score = 178 bits (450), Expect = 6e-43, Method: Composition-based stats. Identities = 68/329 (20%), Positives = 113/329 (34%), Gaps = 59/329 (17%) Query: 46 PATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINT 105 P ++S +R +WL T A + + D L GINT Sbjct: 429 PTDGERSGAEIRAVWL----------------DRGTIVAARGEAGLAQIFDRLADAGINT 472 Query: 106 VFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRV 165 VFF+ G ++PS++ P + +T G+DPL + A RGM++HAW + + Sbjct: 473 VFFETVNAGYTIYPSRVAPSQNPLT------VGWDPLAAAVKLAKARGMELHAWVWVFAI 526 Query: 166 SVNTKPGTIRELNS----TLSQQPASVYVQHRDWIRTSGDRF-VLDPGIPEVQDWITSIV 220 + +R+ +S LS P + ++ DR LDP EV+ ++ +V Sbjct: 527 ANQRHNALLRQPDSYLGPVLSAYPEWANLDNQGRTWHENDRKAYLDPANREVRSYLLRLV 586 Query: 221 AEVVSRYPVDGVQFDDYFYTESPGSRLND-------NETYRKYGGA------------FA 261 E+ Y VDG+ D Y +R + +R G + Sbjct: 587 GEIAHNYQVDGIHLDYIRYPFQDANRNFNFGYGTASRTQFRDLTGVDPISLTPRDGVLWQ 646 Query: 262 SKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESY 321 +R + + V + + P V + P + Sbjct: 647 QWTQFRSDQVTSFVRDVRQLLSTNYPNVILSAAVF-------PHPETERIA------KIQ 693 Query: 322 ADTRRWVEQGLLDYIAPQIYWPFSRSAAR 350 W QG LD + P Y + R Sbjct: 694 QHWEVWARQGYLDLLVPMTYSLDTNRLQR 722 >UniRef50_UPI0001AF05D8 hypothetical protein SghaA1_34850 n=1 Tax=Streptomyces ghanaensis ATCC 14672 RepID=UPI0001AF05D8 Length = 522 Score = 176 bits (445), Expect = 2e-42, Method: Composition-based stats. Identities = 63/328 (19%), Positives = 113/328 (34%), Gaps = 37/328 (11%) Query: 75 VNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIG 134 V+ NP Q A+++ + N + Q + + P +D I Sbjct: 43 VDAFNPGIFTPAQVAALVE---DALDVNANALIVQTARRYDCFCNNALYPRTD---AAIA 96 Query: 135 ENPGYDPLQFMLDEAHKRGMKVHAWFNPY----RVSVNTKPGTIRELNSTLSQQPASVYV 190 P YDPL+ ++ + H G++VHAW N R + P + + + Sbjct: 97 PEP-YDPLEEIVRQGHAAGLQVHAWVNVNTMWNRTTPPRSPEHVFNQHGPGATGADRWLN 155 Query: 191 QHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDN 250 + D G +DPG P D+I V +V Y VDGV D Y + + + + Sbjct: 156 KKADGQELVGANAYVDPGHPAAVDYIVRGVQSIVRNYDVDGVNLDYVRYPDGSSTTTHSD 215 Query: 251 ETYRKYG---------------GAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSP 295 Y + + + +DWRR+ L+ K+ + + P + Sbjct: 216 WGYNEVSVARFQQATGRTDIPLPSDTAWSDWRRSQVTNLVRKIYLGVWEVDPQARLSMDA 275 Query: 296 AGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIY---WPFSRSAARYD 352 + Y E D W+++G++D Y W ++ Sbjct: 276 I---TYGHGPQAVGGWQATRTYAEVLQDWAGWLDEGIMDTAVTMNYKRNWDPDQAL---- 328 Query: 353 VLAKWWADVVKPTRTRLYI-GIAFYKVG 379 + ++W + R + G A Y G Sbjct: 329 MFSEWSEFLADHQGERQAVNGPALYLNG 356 >UniRef50_A2C8D8 DUF187 n=12 Tax=Cyanobacteria RepID=A2C8D8_PROM3 Length = 410 Score = 175 bits (444), Expect = 3e-42, Method: Composition-based stats. Identities = 75/321 (23%), Positives = 108/321 (33%), Gaps = 58/321 (18%) Query: 43 SKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLG 102 S+P A + Q+S P D P+ V ++N S + M + L R G Sbjct: 35 SQPCAISAQASTPPSVAQSGLRHLSDHLPIVGVWMTNSPSPLYYSRNLMHKAVKDLYRAG 94 Query: 103 INTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNP 162 ++ V G+ S P + DP+ + E H RGMKV WF Sbjct: 95 FTALYLNVWSRGSTFHRSNYAPVEGPLQKAGL---ALDPICTLRREGHARGMKVVPWFE- 150 Query: 163 YRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGD------------RFVLDPGIP 210 + A V H DW+ D R L+P P Sbjct: 151 ---------------YGLMEPDDAEVVKLHPDWVLARADGNPVVKMHGNHKRVWLNPAHP 195 Query: 211 EVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFAS-------- 262 EV+ +V EV+ R +DGVQ DD+F P D T Y S Sbjct: 196 EVRARFIGVVIEVMKRCKMDGVQLDDHF--AWPVQLGYDPYTVALYQQETGSLPPRDYSD 253 Query: 263 --KADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDES 320 WRR L+ ++ ++ K V ++P G +R AY+ Sbjct: 254 RFWMQWRRRKLTGLLRELRQALEKEKLPVNISLAP-GPFR--------------FAYNNW 298 Query: 321 YADTRRWVEQGLLDYIAPQIY 341 D W L+D + Q Y Sbjct: 299 LQDWELWTVGKLIDELVVQNY 319 >UniRef50_C5CIL6 Putative uncharacterized protein n=1 Tax=Kosmotoga olearia TBF 19.5.1 RepID=C5CIL6_KOSOT Length = 993 Score = 175 bits (443), Expect = 3e-42, Method: Composition-based stats. Identities = 82/390 (21%), Positives = 140/390 (35%), Gaps = 67/390 (17%) Query: 61 LATVSRLDWPPV----SSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTA 116 L T S P ++ + + A + + ++ L LG N + +V GT Sbjct: 314 LTTFSYSLLPSRVVQTRAIWLDHGAMAATGGPENLRKTIEKLAHLGFNVLLPEVIWKGTT 373 Query: 117 LWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRE 176 + P K+ + K E DPL+ +++EAHK M+VHAW + V G + E Sbjct: 374 ISP-KLTVYPQNEEFKDWEE---DPLEIIIEEAHKYDMEVHAWTWTFAV------GYLGE 423 Query: 177 LNSTLSQQPASVYVQHRDWIRTSGDRF----VLDPGIPEVQDWITSIVAEVVSRYP-VDG 231 N +++ P V GD P+ ++ I S + EVV +YP +DG Sbjct: 424 SNELMNKNPHLVEKDRFGRTFAEGDNVKRAGFFSHSNPKARELIKSAIKEVVEKYPEIDG 483 Query: 232 VQFD----------DYFYTESPGSRLN-----DNETYRKYGGAFASKADWRRNNTQQLIA 276 + D D+ Y + D KY WR N + Sbjct: 484 INLDYIRYENSDIIDHGYDDYSVKAFKEETGIDPFKIEKYTKEEVLWHLWRENQVTSFVK 543 Query: 277 KVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYI 336 ++S +K+IKP + + P G+ + + W + G +D + Sbjct: 544 EISEELKAIKPTIIISADVINL-------PTGAQ-------HKFKQNWVLWAKNGYVDAL 589 Query: 337 APQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPE 396 P Y P S R + A+ + LY G++ + +N Sbjct: 590 FPMAYTP-SSDDLRIMIEAE---KSAVSGKVFLYPGMSLF-------------VNRDTES 632 Query: 397 LKKQLDLNDAVPEISGTILFREDYLNKPQT 426 + KQL + E+ G +F Y++ Sbjct: 633 VLKQLKILSE--ELDGLSMFALSYIDDFDN 660 >UniRef50_Q3AJ74 Putative uncharacterized protein n=3 Tax=Chroococcales RepID=Q3AJ74_SYNSC Length = 390 Score = 170 bits (431), Expect = 8e-41, Method: Composition-based stats. Identities = 79/370 (21%), Positives = 127/370 (34%), Gaps = 66/370 (17%) Query: 71 PVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMT 130 V ++N S+ ++ + + LQ G N V V GT S+ P + Sbjct: 45 STMGVWLTNSPSKLYYDRKRISAAMQQLQHAGFNRVVPNVWSRGTTFHRSRFAPVEPPLQ 104 Query: 131 GKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYV 190 G DP+ + E +RG+KV WF Y + + E P+ V Sbjct: 105 KAG---VGLDPICTLAAEGRRRGIKVMPWFE-YGLMEPADSAVVHE-------NPSWVLA 153 Query: 191 Q--HRDWIRTSGDRF--VLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSR 246 + + W+ G+ L+P PEV+ +V E + R P+DG+Q DD+F P Sbjct: 154 KANGQRWMAMHGNHRMAWLNPAHPEVRARFIGLVVETLKRCPMDGLQLDDHF--AWPVHF 211 Query: 247 LNDNET---YRKYGG-------AFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPA 296 D T YR+ G + WRRN L+ ++ +K +SP Sbjct: 212 GYDPTTLALYRQETGLAPPGDHSNRYWMKWRRNQLTSLLRELRQRLKQEGLSTRISLSPG 271 Query: 297 GVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRS-----AARY 351 P S AY+ D W GL++ + Q Y R Sbjct: 272 ---------PFRS------AYNLWLQDWELWALGGLIEELVVQNYAYSVRGFAKDLDQPA 316 Query: 352 DVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEIS 411 A+ W P++ + G + L++++ L Sbjct: 317 LRKARDWGI---PSQIGVLAGFG--------------KRTTSMAVLEQKVRLARQRG--H 357 Query: 412 GTILFREDYL 421 G I F + L Sbjct: 358 GVIFFYWEGL 367 >UniRef50_A6CAJ3 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CAJ3_9PLAN Length = 811 Score = 168 bits (424), Expect = 5e-40, Method: Composition-based stats. Identities = 75/387 (19%), Positives = 122/387 (31%), Gaps = 80/387 (20%) Query: 48 TTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVF 107 T + R +W S P R ++ L G N + Sbjct: 462 TRASPPREARAVW-----------DHSPTGPYPGDWNRTCKE--------LSDAGFNMII 502 Query: 108 FQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSV 167 + G A +PS +LP S D ++ L AH+ G++VH W + +S Sbjct: 503 PNMLWGGLAHYPSDVLPRSTTYEKYG------DQIEQCLKAAHQHGLEVHVWKVNHNLS- 555 Query: 168 NTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRY 227 T P + + SV + DW L+P PE + EVV +Y Sbjct: 556 -TAPQAFVKKMRDAGRTQVSVTGEPSDW---------LNPAHPENFQLEVDSMLEVVRKY 605 Query: 228 PVDGVQFDDYFYTESPGSRL-NDNETYRK-------------YGGAFASK-ADWRRNNTQ 272 PVDG+ FD Y + + Y G S+ DWR Sbjct: 606 PVDGIHFDYIRYPNDRHDYSDYSRQKFEADTGIKVQNWPADCYNGTLKSQYRDWRAAQIT 665 Query: 273 QLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGL 332 +L+ V + I+PG++ + + D R A D W + G Sbjct: 666 RLVETVQREARKIRPGIKISAAVFREYP---------DCREWVA-----QDWPLWAKNGY 711 Query: 333 LDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMING 392 LD+I P Y + + + + + +Y GI Sbjct: 712 LDFICPMDY--TDNDTQ-FRIWIEDQQKHLAGS-IPVYPGIGALSSRTT----------L 757 Query: 393 GVPELKKQLDLNDAVPEISGTILFRED 419 + Q+D+ + G +F + Sbjct: 758 SSDRILGQVDMTRKL-NAGGFTVFSLN 783 >UniRef50_A7LVF6 Putative uncharacterized protein n=4 Tax=Bacteroides RepID=A7LVF6_BACOV Length = 395 Score = 167 bits (423), Expect = 7e-40, Method: Composition-based stats. Identities = 78/412 (18%), Positives = 150/412 (36%), Gaps = 71/412 (17%) Query: 46 PATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINT 105 + + + Q +RG+W+ P + Q + D + L L +N+ Sbjct: 19 QSIAKTTEQGIRGVWVPA------PRFT---------PVLHSYQGVKDFVKTLDELNMNS 63 Query: 106 VFFQVKPDGTALWPSKIL---------PWSDLMTG--KIGENPGYDPLQFMLDEAHKRGM 154 +F + ++ S +L S L++G K ++P DP++ ++DEAHK + Sbjct: 64 IFLVSYAETKTIYRSDVLMHYSTYKTQEESYLLSGYSKQYQSPTNDPVRDLIDEAHKHDI 123 Query: 155 KVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDR-FVLDPGIPEVQ 213 KV WF + + I N L++ P + + ++ + + P VQ Sbjct: 124 KVFFWFEYGFMG---EGRPISPNNPLLAKNPHWLGIDNQQHPANYNQHDYYFNAYNPAVQ 180 Query: 214 DWITSIVAEVVSRY-PVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFAS---------- 262 +++ ++ E + Y +DG+Q DD F P + D T Y Sbjct: 181 NFLIELIEEALMLYPDLDGIQGDDRF-PAMPRNSGYDTYTVSLYQSQHQGNNPPVDYNNS 239 Query: 263 -KADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESY 321 WR + ++ IK+ P V +P + P + Sbjct: 240 EWVHWRLDILNTFAKRLYKRIKAKSPNVMISFAP-------NPYPWCE--------ENLM 284 Query: 322 ADTRRWVEQGLLDYIAPQIY-WPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGE 380 + RW ++ + D +A Q Y + A + K+ G+ + G Sbjct: 285 QEWPRWCKEKVCDLLAVQCYRYSVDAYRATVSEVLKYIHQ--NNPNQLFAPGMILME-GS 341 Query: 381 PSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSY 432 SK+ P+ L+KQL +N + I+ I F ++ P ++ + Sbjct: 342 NSKMSPE--------LLQKQLRINREL-GINSEIYFYNKGIDNPSVRKVLKQ 384 >UniRef50_C2FS66 Putative uncharacterized protein n=1 Tax=Sphingobacterium spiritivorum ATCC 33300 RepID=C2FS66_9SPHI Length = 172 Score = 167 bits (422), Expect = 1e-39, Method: Composition-based stats. Identities = 52/150 (34%), Positives = 80/150 (53%), Gaps = 18/150 (12%) Query: 51 QSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQV 110 + +RG+W+ATV+ +DWP S + Q+Q +I+ LD QR G+N +FFQ+ Sbjct: 26 SPKRELRGVWIATVANIDWP-------SRDNESSERQKQELINILDAHQRAGLNAIFFQI 78 Query: 111 KPDGTALWPSKILPWSDLMTGKIGE--NPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVN 168 +P A + PWS ++G G+ +P YDPL+F+++EAHKRGM++HAW NPYR S Sbjct: 79 RPAADAFYAKGREPWSRYLSGVQGKAPSPFYDPLEFVIEEAHKRGMELHAWVNPYRASTT 138 Query: 169 TKPGTIRELNSTLSQQPASVYVQHRDWIRT 198 P + + +W Sbjct: 139 LNPAHFSK---------DHITRTKPEWFLN 159 >UniRef50_B0P7J4 Putative uncharacterized protein n=1 Tax=Anaerotruncus colihominis DSM 17241 RepID=B0P7J4_9FIRM Length = 1211 Score = 166 bits (419), Expect = 2e-39, Method: Composition-based stats. Identities = 93/412 (22%), Positives = 156/412 (37%), Gaps = 78/412 (18%) Query: 26 CSCKSTPPESMVTPPAGSKPPATTQQSS------------QPMRGIWLATVSRLDWPPVS 73 + PP++ PP + T + MRG+ ++ + D+ Sbjct: 86 SAENPAPPDASSAPPDTAGETGTDSSDNGQADEPVYFNVPTEMRGVMIS--AGTDY---- 139 Query: 74 SVNISNPTSRARVQQQAMID-KLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGK 132 ++N T + + +D L Q+L +NTV + + L+ S L + L Sbjct: 140 ---LTNGTDVSAQELATQLDEALAAAQQLTMNTVIIDTQYGDSVLFESSALESAPL---- 192 Query: 133 IGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQH 192 G D +++ +A + G V+A Y VS Sbjct: 193 -----GLDVTEYLCAKAREMGFYVYA---TYDVST------------------------- 219 Query: 193 RDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNET 252 R+ G+ D D + + Y DG+ D Y +SP + Sbjct: 220 ----RSGGEGLTADGA---ALDDLAENIGAFAEAYKPDGILLDGYECADSPAAY----AG 268 Query: 253 YRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTR 312 Y + GG +A ++R + L+ + ++ PGV+ G+ VW+N DP GSDT+ Sbjct: 269 YLQSGGGMGYEA-YQRQVPRALLETAAAAVRENAPGVQVGLYTQAVWQNSDADPDGSDTK 327 Query: 313 G-AAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYI 371 A ADTR +V+ GL D++ + Y + AR+ V+A WWA VV T T+LY+ Sbjct: 328 AETTALGTGNADTRAFVKDGLFDFVMVKNYGSTNEETARFGVVAAWWAGVVDGTDTKLYM 387 Query: 372 GIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNK 423 A +VG S + +L Q+ + SG+ L Sbjct: 388 MHAADRVGTQS------VGWTVYEQLTAQIIRLEEAGGSSGSAFNSLAALRS 433 >UniRef50_P74629 Sll0736 protein n=1 Tax=Synechocystis sp. PCC 6803 RepID=P74629_SYNY3 Length = 408 Score = 165 bits (418), Expect = 2e-39, Method: Composition-based stats. Identities = 78/400 (19%), Positives = 135/400 (33%), Gaps = 83/400 (20%) Query: 49 TQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFF 108 + S +RG+WL V S + + L+ NT++ Sbjct: 35 SSASPNKIRGVWLTNV----------------DSNVLYDPVQLKTAIADLKSTNFNTLYP 78 Query: 109 QVKPDGTALWPSKILPWSDLMTGKIGENPG-YDPLQFMLDEAHKRGMKVHAWFNPYRVSV 167 V DG L+PS + + K E G D L +++ A ++ ++V WF Sbjct: 79 TVWNDGHTLYPSAVA--QQWLGKKQDEKLGDRDMLGEVINLAKEKSLRVIPWFE------ 130 Query: 168 NTKPGTIRELNSTLSQQPASVYVQHRDW--IRTSGD---RFVLDPGIPEVQDWITSIVAE 222 G + S + + I G R L+P PEVQ IT+++ + Sbjct: 131 ---FGFMAPAESDWVKAHPHWLTTNSQGETIWLEGGTIPRVWLNPLHPEVQQLITALLVD 187 Query: 223 VVSRYPVDGVQFDDYF-YTESPGSRLNDNETYRKYGGA--------------------FA 261 +V RY VDG+Q DD+F Y S G YR+ G + Sbjct: 188 LVRRYDVDGIQLDDHFGYPYSFGYDPITVALYRQETGQEPLPVPELDLNQNCVSSDPIWQ 247 Query: 262 SKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESY 321 DWR + + + +K++KP + +SP + + Sbjct: 248 QWTDWRSAKISRYVQSLVPILKAVKPNLTISISP---------------NPQTFSKNCFL 292 Query: 322 ADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEP 381 D + W E +++ + Q+Y A + + + + +GI G Sbjct: 293 LDWQTWHEAKVINELVLQVYRE---KQAAFTGELQQSSVQQTKQEIPVVVGI---LSGLK 346 Query: 382 SKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYL 421 ++ P I + Q +GT F + L Sbjct: 347 NRSIPSARIKQQAQWVDDQ--------NFAGTAFFFYESL 378 >UniRef50_B0VF99 Putative uncharacterized protein n=1 Tax=Candidatus Cloacamonas acidaminovorans RepID=B0VF99_9BACT Length = 482 Score = 165 bits (417), Expect = 4e-39, Method: Composition-based stats. Identities = 72/397 (18%), Positives = 148/397 (37%), Gaps = 79/397 (19%) Query: 53 SQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKP 112 + +R +W+ L W + ++++ + + N + +V+ Sbjct: 17 NAEIRSVWV-----LPWDIAT--------------EESIDEVIATAVSCNQNELLVEVRY 57 Query: 113 DGTAL---------WPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPY 163 AL +P+ P S ++ E+ +DPL ++L +AH++G+ V AW + Sbjct: 58 RADALFDTSKGAYLYPNP-EPKSYIL-----EDASFDPLAYILKKAHQKGLAVQAWVVVF 111 Query: 164 RVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTS----------GDRFVLDPGIPEVQ 213 + + Q +Y H+DWI + + +DPGIPEVQ Sbjct: 112 NATPREQSYI----------QQNYIYNNHKDWITYNFNGSQMNIDRQSGYFIDPGIPEVQ 161 Query: 214 DWITSIVAEVVSRYP-VDGVQFDDYFYTESPGSRLNDN-ETYRKY--GGAFASKADWRRN 269 +++ +I+ + YP +DG+ D Y ES + Y +Y + +WR Sbjct: 162 EYLLNILGNLAGGYPELDGIHLDYIRYPESDLGFHPVSLARYNEYCQNQEEITYNEWRIM 221 Query: 270 NTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVE 329 + +K I P ++ + + D D + W++ Sbjct: 222 QVTNFVENAYFQLKEINPTLQLTAAVVPDIAEANVD--------------YAQDWQSWLK 267 Query: 330 QGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWM 389 +G++D + P Y A++ + + + ++ IG+ + G + + Sbjct: 268 KGIIDRVYPMAY---DVQYAKFKKQLEQIKLLQ--MKEKIVIGLRAW-NGNGNSLAVGNG 321 Query: 390 INGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQT 426 + V E+ K++ L + +G LF L K Sbjct: 322 NSYNVKEIAKKITLTRDL-GFAGVSLFSYSGLQKGNA 357 >UniRef50_A8F3E2 Putative uncharacterized protein n=1 Tax=Thermotoga lettingae TMO RepID=A8F3E2_THELT Length = 961 Score = 162 bits (410), Expect = 2e-38, Method: Composition-based stats. Identities = 61/380 (16%), Positives = 119/380 (31%), Gaps = 59/380 (15%) Query: 71 PVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMT 130 + + N + + + + + + L +G N + +V G + SK+ S Sbjct: 315 QTRGIWLDNQSIKKTGSPERLRETIRKLHSIGFNMIIPEVIYKGKTM-ASKL---SYFPQ 370 Query: 131 GKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYV 190 + DPLQ ++DEA K ++VHAW + S + + P + Sbjct: 371 DDDFQRWSEDPLQVIVDEAKKLNIEVHAWCWVFAASSGGEENYF------IKNFPDWIEK 424 Query: 191 QHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFY----------- 239 I T P+ ++++ + E+ +Y +DG+ D Y Sbjct: 425 DKYGNIFTKNGTAWFSHSNPQTREYLIDGILEIAKKYEIDGINLDYIRYDGDEMGYDEHA 484 Query: 240 -TESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGV 298 D KY WR + ++ K++ + Sbjct: 485 VKSFMKETGVDPYKIEKYSKDQVIWHMWREEKINSFVEELYKRAKALNDRLLISADV--- 541 Query: 299 WRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWW 358 P S A +E + WV +D + P Y S ++ + Sbjct: 542 ------YPSLSG-----ARNEKKQNWEAWVRNKYIDALIPMNY---KGSIEDLKIVLEMQ 587 Query: 359 ADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFRE 418 LY G+ + +L +Q+ + SG +LF Sbjct: 588 TKF--KNMVYLYSGLQMINL-------------KSTEDLIEQIKTSINYLS-SGIVLFSL 631 Query: 419 DYLNKPQTQQAVSYLQSRWG 438 YL++ Y+++ +G Sbjct: 632 SYLDRYDE----DYIRNIFG 647 >UniRef50_A8F7U2 Putative uncharacterized protein n=2 Tax=Thermotogaceae RepID=A8F7U2_THELT Length = 367 Score = 159 bits (402), Expect = 2e-37, Method: Composition-based stats. Identities = 75/398 (18%), Positives = 142/398 (35%), Gaps = 82/398 (20%) Query: 64 VSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKIL 123 V+ P + + + + +I+ + +G ++ QV A + S+IL Sbjct: 13 VTTTLMPYPLGIWVVRDQITSIEKINRVIEI---AKEVGATRIYVQVVGRADAYYNSEIL 69 Query: 124 PWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQ 183 P ++ ++ +P +DPL+ ++D A G+K+ AW N + K + Sbjct: 70 PKAETLSEC---SPDFDPLKEIIDLAKISGIKISAWMNVFYAWPFGKKPVSEK------- 119 Query: 184 QPASVYVQHRDWIRTSGDR----------------FVLDPGIPEVQDWITSIVAEVVSRY 227 V H DWI + L+P + +V+ ++++I E+ Y Sbjct: 120 ---HVVNVHPDWITYDQNGKSMLEYASSPEINTPGLFLEPALEDVKKFVSNIAEEIAKNY 176 Query: 228 PVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQL------------- 274 VD + D Y +T+ + A +W + Q+ Sbjct: 177 DVDEIHLDYIRYPY---------KTFGYHPDAMKIYREWLKKAIQEKKLTNLGEGFDLFR 227 Query: 275 IAKVSHTIKSIKPGVE-FGVS-PAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGL 332 I +VS T+K I V +G A V+ D + +G +W+E Sbjct: 228 IQQVSDTVKLIYEKVHNYGKKLSAAVFAYYEQDAISQRLQG----------WLQWLEGEY 277 Query: 333 LDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMING 392 LDY Y +R Y V +A ++ +G+ YK+ E Sbjct: 278 LDYACLMAY-ENNRDTVEYYVK---YAVKALGAAEKIRVGLGAYKMTEN---------PE 324 Query: 393 GVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAV 430 + E+ K + EI ++F + L + ++ V Sbjct: 325 KLYEIAKSVVEKYRPDEI---LIFSFENLLDEKVRKYV 359 >UniRef50_C6IEW4 Putative uncharacterized protein n=4 Tax=Bacteroidales RepID=C6IEW4_9BACE Length = 490 Score = 152 bits (383), Expect = 3e-35, Method: Composition-based stats. Identities = 88/480 (18%), Positives = 154/480 (32%), Gaps = 90/480 (18%) Query: 6 RNKKLTIRRPAI---LVALALLLC-SCKSTPPESMVTPPAGSK--PPATTQQSSQPMRGI 59 ++ K+ I++ I + +A LC +C + +G + P +S+P R I Sbjct: 29 KSYKMNIKKNIIKTFMGGIAACLCMACGGNDSKDYWGDTSGGEDEEPTENPNASKP-RYI 87 Query: 60 WLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPD-GTALW 118 W+ + S NI+ L + G + V+P G L+ Sbjct: 88 WIDAAANFPDFANSKENIARD--------------LALAKDAGFTDIVVDVRPTTGDVLF 133 Query: 119 PSKILPWSDLMTGKIGEN-------PGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKP 171 + ++ M IG N +D LQ +DEA K+G+++HA N + Sbjct: 134 KTNLVDQVKFMYAWIGSNYTKVERTATWDYLQAFVDEARKQGLRIHAAINTFVGGNQIDG 193 Query: 172 GT---IRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYP 228 GT R+ + ++ V + TS +P PEVQ ++ ++ ++ Y Sbjct: 194 GTGLLYRDQSKAEWATQMNMQVGITSVMNTSESTKFFNPAHPEVQTFLCDLLKDLAG-YD 252 Query: 229 VDGV--------------------QFDDYFYTESPGSRLND------NETYRKYGGAFAS 262 +DG+ QF++Y + ND + Y Sbjct: 253 LDGIFLDRGRFLNLQADFSEESRKQFEEYMGGIRIQNYPNDILAPGASSLPATYPKYLTK 312 Query: 263 KADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAG----------VWRNRSHDPLGSDTR 312 ++R + K +K +KPG++FGV G W ++D Sbjct: 313 WLEFRAKVIYDFMQKARTAVKGVKPGIKFGVYVGGWYSTYYDVGVNWAASTYDTSRYYNW 372 Query: 313 GAAAYDESYADTRRWVEQGL---LDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRL 369 + Y G +D I Y + +W + Sbjct: 373 ATSKYKNY----------GYAACMDQILIGAYAS----PLKVHGTTEWTMEGFCSLAKDK 418 Query: 370 YIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLND---AVPEISGTILFREDYLNKPQT 426 G G P D N E + Q + + G LF +L K Sbjct: 419 IKGECPIVAGGPDVGNWD-TNNQATQEQENQAIVQSVKACMNVCDGYFLFDMIHLKKADQ 477 >UniRef50_B0PF61 Putative uncharacterized protein n=1 Tax=Anaerotruncus colihominis DSM 17241 RepID=B0PF61_9FIRM Length = 915 Score = 149 bits (376), Expect = 2e-34, Method: Composition-based stats. Identities = 60/232 (25%), Positives = 101/232 (43%), Gaps = 25/232 (10%) Query: 6 RNKKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSK-PPATTQQSSQPMRGIWLATV 64 K TI R +L +LA++ +C ++T P SK PPA + + + + + T Sbjct: 3 HETKRTILRT-LLASLAIVAATCAVLYASDLLTSPISSKTPPAGIPAAGEQLHALIVRTR 61 Query: 65 SRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILP 124 D+P P A+ Q+ + + G N VFF+ P AL+ S ILP Sbjct: 62 GNADFPSA-------PGLSAKQQRAQLDEIAAFAGEYGYNAVFFEAVPSCDALYRSSILP 114 Query: 125 WSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQ 184 S G+ G +DPL ++++ + G++V+A +P+ VS Sbjct: 115 SSAYWMGEQGAFAFFDPLDYLVNVCKESGIQVYAMIDPFAVSAE----------DLAESS 164 Query: 185 PASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDD 236 PAS ++ +WI G +P VQ S+ AE+ +RY + G+ + Sbjct: 165 PAS---KNPEWIAADGR---FNPTELGVQQLAGSVAAELATRYDIAGIVLEG 210 >UniRef50_Q6ZE96 Slr7102 protein n=5 Tax=Cyanobacteria RepID=Q6ZE96_SYNY3 Length = 338 Score = 148 bits (373), Expect = 5e-34, Method: Composition-based stats. Identities = 81/422 (19%), Positives = 154/422 (36%), Gaps = 113/422 (26%) Query: 11 TIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWP 70 +++ PA+ V + LLL +C + P T + + M+G+WL V + Sbjct: 7 SLKWPALFVGIILLLAACH--------------RAPTRTAKETDKMKGVWLTDVGTMGLT 52 Query: 71 PVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMT 130 + ++ + L H+ + + V+F V L+P++ DL+ Sbjct: 53 YSTLLD----------------ETLHHISKSDYDRVYFSVYGLRGQLYPTR--QRGDLIP 94 Query: 131 GKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYV 190 + P + + M E+ ++G+K +AWF + Q V Sbjct: 95 ----KLPFPNAVGSMARESRRQGLKPYAWFE----------------YGLMLPQFDPVAK 134 Query: 191 QHRDWIRTSGDR-----------FVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFY 239 + DW+ T + LDP PEV+ +I + + +++ + G+Q DD++ Sbjct: 135 NNPDWLLTMANGEQVIENHGVPMVWLDPSNPEVEAYILAHIDDILKEKSLAGIQLDDHWA 194 Query: 240 TESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVW 299 R++G D+RR+ L KV IK+ P E +SP Sbjct: 195 VP------------RQFG-------DYRRS-LTALTTKVHEHIKTKNPEFELSLSP---- 230 Query: 300 RNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWA 359 + +E D RWV+QG++D + QIY S A Sbjct: 231 -----------NPYQFSLNEYNQDWLRWVKQGIVDEVVVQIYRS---SPAEVQQAVNNSG 276 Query: 360 DVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFRED 419 + +G+ + +P ++ +K Q++ + G LF + Sbjct: 277 IYTASRYVPVGVGLYTGRKIKPFNLQS----------IKDQINAVEKQN--LGHSLFVWE 324 Query: 420 YL 421 ++ Sbjct: 325 FM 326 >UniRef50_C3R3M7 Putative uncharacterized protein n=2 Tax=Bacteroides sp. 2_2_4 RepID=C3R3M7_9BACE Length = 432 Score = 140 bits (353), Expect = 9e-32, Method: Composition-based stats. Identities = 65/337 (19%), Positives = 115/337 (34%), Gaps = 54/337 (16%) Query: 26 CSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRAR 85 CS K+ E P P + M +W + + Sbjct: 19 CSTKTVESELPEPNPPTPVIPEEPTPEKEKM--LWFDAEANFERFSK------------- 63 Query: 86 VQQQAMIDKLDHLQRLGINTVFFQVKP-DGTALWPSKIL-PWSDLMTGKIGENPGYDPLQ 143 ++ + LD + G N + V+P G AL+ S L P +DL I + Y LQ Sbjct: 64 --KENITYYLDLAKSTGFNKIVVDVRPVQGDALFKSSYLTPLTDLAGTHIERDWNY--LQ 119 Query: 144 FMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRF 203 F +DEAHKR +KV + + + + + T + Y + + I D+ Sbjct: 120 FFIDEAHKRELKVTVSATIFTAGLPSSKNGMAYRDDTWDGKTCLEYTKDQGLIDIKDDKT 179 Query: 204 ----VLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNE-TYRKY-- 256 L+P +PEVQD+ + + E+V+ Y DG D Y + + +Y Sbjct: 180 KVSAFLNPVLPEVQDFCLNFIKELVTNYNFDGFALDYCRYPGDESDFSEATKIAFEQYIG 239 Query: 257 ---------------------GGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSP 295 G + ++R + + +V IK+IKP ++ Sbjct: 240 KQLDRFPDDIFIWNTDGTKRTGTYYKKWWEFRSMVIRNFVERVRTEIKNIKPDIQL---- 295 Query: 296 AGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGL 332 W + + A+ + ++ W Sbjct: 296 -EYWAASWIHAIYGQGQNWASTEYDFSKEYSWASPEY 331 >UniRef50_Q2BFL2 Putative uncharacterized protein n=1 Tax=Bacillus sp. NRRL B-14911 RepID=Q2BFL2_9BACI Length = 813 Score = 116 bits (290), Expect = 2e-24, Method: Composition-based stats. Identities = 49/290 (16%), Positives = 89/290 (30%), Gaps = 54/290 (18%) Query: 89 QAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGY--------- 139 + LD ++ G N+V+ + G ++PS+ + G ++P + Sbjct: 375 AGVNQVLDRMEEAGFNSVYLETTFWGYTIYPSETM----TEYGLPAQHPNFRNADYGKYG 430 Query: 140 -DPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRT 198 D LQ + E KRG+ V AW + + + ++ + T Sbjct: 431 SDLLQAYIKEGKKRGISVQAWTDGFMIGHSSLGLPSQFQVHPEWAAIQRSNTTGEPKPDT 490 Query: 199 SGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLND-----NETY 253 S + + LD PEVQ ++ I E+ S+Y + G+ D Y + E Y Sbjct: 491 SSNYYWLDIAQPEVQTFMLDIYKEMQSKYDIKGLNIDYMRYPHQSFEKSYGFSEKVRELY 550 Query: 254 RKYGG-------------AFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWR 300 + G + A W + + + K + +P Sbjct: 551 KAKTGIDPMELSPTATPEEWEKWAGWIQQRENDFVDGLHTQSKKLNSKFMLTATP----- 605 Query: 301 NRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAAR 350 P D + +D + PQ Y S Sbjct: 606 --EPGPEAVLIS----------DWQE-----DIDGVIPQAYGHDFNSIQS 638 >UniRef50_A7LVF0 Putative uncharacterized protein n=3 Tax=Bacteroides RepID=A7LVF0_BACOV Length = 467 Score = 114 bits (285), Expect = 8e-24, Method: Composition-based stats. Identities = 74/468 (15%), Positives = 138/468 (29%), Gaps = 85/468 (18%) Query: 8 KKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRL 67 K L I L A+ + CS S ++ P + R IW+ + Sbjct: 3 KFLKILILTFLGAVTITSCSDDSDGIPGWPWNDNSTEKPDEPDVAEAKPRYIWIDAAANF 62 Query: 68 DWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPD-GTALWPSKILPWS 126 ++ + ++ ++ G + V+P G L+ + ++ Sbjct: 63 P--------------DYANSKENIAKDMEKIKAAGFTDIIVDVRPTTGDVLFNTNVVDQV 108 Query: 127 DLMTGKIGENPGY---------DPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRE- 176 M + N GY D LQ ++EA +G+KV+A N + + Sbjct: 109 KRM--DVWGNSGYSYYERTETWDYLQAFIEEARIQGLKVNASINTFVGGYLCPYNLGHDG 166 Query: 177 --LNSTLSQQPASVYVQHRDWIRTSG--------DRFVLDPGIPEVQDWITSIVAEVVSR 226 + ASV T +P +VQ+++ ++A++ + Sbjct: 167 VLFRDESKKGWASVANLADGLTNTMDLLDDETDYGAKFFNPANDDVQNFVLQLLADLA-K 225 Query: 227 YPVDGVQFDDYFYTESPGSRLNDN---ETYRKYGGA------------------------ 259 Y +DG+ D Y + + + + +Y G Sbjct: 226 YDLDGIILDRCRYDDYGLESDFSDISKQKFEEYIGETVANFPADIMAPGTDEIPSDQPVY 285 Query: 260 FASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDE 319 F ++R I K +KS+ ++FGV G W + + + Sbjct: 286 FKKWLEFRAKVIHDFIVKAREKVKSVNNNIKFGVY-VGAWYSTYYTSGVNWASPKYNTSA 344 Query: 320 SYADTRRWVEQGL--------LDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYI 371 Y +W LDYI Y + +W + L Sbjct: 345 YY---PKWATSDYKNYGYADHLDYIFLGAYASVNNIYGS----GEWTMEGFCKNGRELLQ 397 Query: 372 GIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFRED 419 G + G W G ++ +D + G F D Sbjct: 398 GDVPFAGGPDIGNSTGWTDGGQSAKIPDAID--ACISNSDG--FFAFD 441 >UniRef50_D1BUC2 Putative uncharacterized protein n=1 Tax=Xylanimonas cellulosilytica DSM 15894 RepID=D1BUC2_XYLCX Length = 806 Score = 114 bits (284), Expect = 9e-24, Method: Composition-based stats. Identities = 64/382 (16%), Positives = 124/382 (32%), Gaps = 73/382 (19%) Query: 63 TVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKI 122 T S P + + +A+ ++ + G+N V+ QV G ++PS + Sbjct: 341 TASYRSIPARVAESRGVWYRPEEKNPEAVEATVEAMASAGVNEVYLQVLSGGYTIYPSAV 400 Query: 123 LPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLS 182 + + + GYD L A + G+++HAW + +V G + + Sbjct: 401 A-VAHGLPAVRPDLAGYDALAAWKSAADENGIELHAWIDGLQVGNELGDG----IGPIVQ 455 Query: 183 QQPASVYVQH-----RDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDY 237 Q P + V + + LD P + ++ + E+VSRY + G+ D Sbjct: 456 QHPEWLAVDRAHAGTTTATPSFNGFYWLDITDPVARQYMIDVTTEMVSRYDLAGLNHDYM 515 Query: 238 FYTESPG----SRLNDN--ETYRKYGGAFA-------SKADWRRNNT------QQLIAKV 278 Y ++ +D+ Y+ G A W R +L+ + Sbjct: 516 RYWDNGNAQDSYNFSDDSRAAYQALTGVDPVTLSPEADAAAWERWKAFVSSEEDRLVRDI 575 Query: 279 SHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAP 338 ++K P +P A+ RW ++D + P Sbjct: 576 FRSVKKAAPTAVVSNAP--------------------EVGRENAEIGRW--NDVVDVVIP 613 Query: 339 QIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFY--KVGEPSKIEPDWMINGGVPE 396 Q Y +W D + + +Y G++ + G +E Sbjct: 614 QAYTAN---LDSIHQRVEWIQDTMTGGQL-VYTGLSAMYQRFGSARTVE----------- 658 Query: 397 LKKQLDLNDAVPEISGTILFRE 418 Q + E G+++F Sbjct: 659 ---QTQAARDLDE--GSVIFSW 675 >UniRef50_Q8AAL7 S-layer related protein, sialic acid-specific 9-O-acetylesterase n=9 Tax=Bacteroidales RepID=Q8AAL7_BACTN Length = 884 Score = 112 bits (281), Expect = 2e-23, Method: Composition-based stats. Identities = 45/262 (17%), Positives = 89/262 (33%), Gaps = 35/262 (13%) Query: 75 VNISNPTSRARVQQQAMIDK-LDHLQRLGINTVFFQVKP-DGTALWPSKILPWSDLMTGK 132 + I + R + ID L+ ++ LG ++P G L+ S+ P G Sbjct: 479 MWIDAEANFERFSHKDSIDYYLEKIKSLGFTHAVVDIRPITGEVLYKSEYAPQMKEWKGA 538 Query: 133 IGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQH 192 + +D L + + + H+ G+++HA N + N + + Sbjct: 539 KAGD--FDYLGYFIKKGHELGLEIHASLNVFCAGHNYFDRGMVYSGHPEWASMVYTPDKG 596 Query: 193 RDWIRTSGDRF--VLDPGIPEVQDWITSIVAEVVSRY-PVDGVQFDDYFYTESPGSRLN- 248 I ++ +++P E + I +++ EVV++Y +DG+ D Y + Sbjct: 597 IIPITEEKHKYGAMINPLNEEYRTHILNVLKEVVTKYPDLDGLMLDRVRYDGITADFSSL 656 Query: 249 ---------------------------DNETYRKYGGAFASKADWRRNNTQQLIAKVSHT 281 D + + G F +WR N +A Sbjct: 657 SRKKFEEYIGKKVANFPEDIFRWTKNADGKYTTQPGKYFRKWLEWRTKNITDFMALARKE 716 Query: 282 IKSIKPGVEFGVSPAGVWRNRS 303 +K+ P V FG + + Sbjct: 717 VKAANPDVSFGTYTGAWYPSYY 738 >UniRef50_C3A5Y1 Putative uncharacterized protein n=1 Tax=Bacillus mycoides DSM 2048 RepID=C3A5Y1_BACMY Length = 143 Score = 111 bits (278), Expect = 5e-23, Method: Composition-based stats. Identities = 34/126 (26%), Positives = 60/126 (47%), Gaps = 14/126 (11%) Query: 10 LTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDW 69 + ++R ++ + +L P S ++P + T +R +W+A+V +DW Sbjct: 1 MIMKRLVMMCYIVILFT------PFSFISPHSTYAE-VNTTYKKHELRAVWIASVLNIDW 53 Query: 70 PPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLM 129 P + + I Q+Q I LD ++ G+N V Q+KP A +PS PWS+ + Sbjct: 54 PSKTGLPI-------EKQKQEFIRLLDDVKNTGMNAVVVQIKPTADAFYPSNYGPWSEYI 106 Query: 130 TGKIGE 135 TG G+ Sbjct: 107 TGTQGK 112 >UniRef50_Q8YK50 All8067 protein n=8 Tax=Cyanobacteria RepID=Q8YK50_ANASP Length = 399 Score = 110 bits (276), Expect = 7e-23, Method: Composition-based stats. Identities = 59/276 (21%), Positives = 106/276 (38%), Gaps = 55/276 (19%) Query: 72 VSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWS---DL 128 + + +S + +Q + ++ + + GINT+ V +G ++ S ++ + Sbjct: 113 IRGIYLSRYQATNNADEQTIRQRVRYYRSQGINTIIHGVWGNGCTMYKSDVMQQTLGYSS 172 Query: 129 MTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTL--SQQPA 186 + E L +++DEAHK+GM+VHA+F G + NS + Sbjct: 173 CPNQFQEKW----LNWLIDEAHKQGMQVHAYFE---------KGIKIDKNSPIFDLAVAK 219 Query: 187 SVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYP-VDGVQFDDYFYTESPGS 245 + V D D +VLD PEV + +I E V +YP VD VQ+DDY Sbjct: 220 NWMVPGIDKTYAGIDHYVLDVEKPEVATFFKNISVEFVKKYPNVDAVQWDDYLG------ 273 Query: 246 RLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHD 305 Y K D R + + + +++ ++K V F + + + + Sbjct: 274 ----------YYAELPGKTD-RTKHLTKFVQQMTSSMKEANSLVSFDICHHNPYWAKKYF 322 Query: 306 PLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIY 341 AD +W +D + Q Y Sbjct: 323 A---------------ADWEQWG----VDRVFIQAY 339 >UniRef50_UPI0001C1694B Protein of unknown function DUF187 n=1 Tax=Raphidiopsis brookii D9 RepID=UPI0001C1694B Length = 166 Score = 106 bits (264), Expect = 2e-21, Method: Composition-based stats. Identities = 36/131 (27%), Positives = 56/131 (42%), Gaps = 17/131 (12%) Query: 174 IRELNSTLSQQPASVYVQHRDWIRTSG----DRFVLDPGIPEVQDWITSIVAEVVSRYPV 229 + +S L+Q RD + DR L+P PEVQ ++ +++ E+V Y + Sbjct: 1 MAPADSLLAQARPEWITTRRDGTKIVKEGIHDRVWLNPFHPEVQKFMENLIVEIVRNYDI 60 Query: 230 DGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKA-----------DWRRNNTQQLIAKV 278 DG+QFDD+F P D+ T Y KA WR + L+ +V Sbjct: 61 DGIQFDDHFG--LPSELGYDSYTVGLYKQEHQGKAPSENFQDPEWVKWRADKITNLMKRV 118 Query: 279 SHTIKSIKPGV 289 IK+ K + Sbjct: 119 FFAIKANKKEL 129 >UniRef50_P35824 S-layer-related protein n=1 Tax=Bacillus circulans RepID=SLAP_BACCI Length = 1616 Score = 104 bits (259), Expect = 8e-21, Method: Composition-based stats. Identities = 74/416 (17%), Positives = 128/416 (30%), Gaps = 75/416 (18%) Query: 71 PVSSVNISNPTSRARVQQ-QAMIDKLDHLQRLGINTVFFQVKP-DGTALWPSKILPWSDL 128 + + + + Q + + L + G+ +V F VK +G + L Sbjct: 514 KKVILWVDQAANARKFQTGDNVANFLRTAKENGVTSVVFDVKGVEGYVSYKKSTLTGRPY 573 Query: 129 MTG-----KIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLS- 182 ++ K G NP D LQ + + + G+ +H FN + + Sbjct: 574 VSAIKAPEKAGSNPDLDLLQEFIRYSRELGLDIHVSFNIFAEGSIASNEFALLDSHLDWE 633 Query: 183 ----QQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYF 238 + ++ G ++P EV+D+ + EV+ Y VDGV D Sbjct: 634 ERVYNAADNGQIKRLRESAKQGAVAFVNPSNDEVRDFQLKTIEEVLQNYDVDGVVLDRAR 693 Query: 239 YTESPGSR----------------------LNDNETY----RKYGGAFASKADWRRNNTQ 272 Y +D TY RK G ++R + Sbjct: 694 YDNESADFSDLTKAKFESFLGARGKQLQNWPDDVFTYAGNVRKDGPLIRDWWEFRSKTIK 753 Query: 273 QL---IAKVSHTIKSIK-PGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDE--------S 320 + +++ +K+ K +E G W + YDE Sbjct: 754 SFTSEVRQLTDRVKAEKGKKIEVSAY-VGSWFESYYLNGVHWGSTEFRYDERLRMKDKSV 812 Query: 321 YADTRRWVEQGLL---DYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYK 377 Y T + E G + D+I Y + Y L ++V LY GIA Sbjct: 813 Y--TPGYYESGYVKNLDFIMIGAYQTTAPEIEHYITL----GNIVTNGEVPLYAGIAL-- 864 Query: 378 VGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYL 433 N P L++ + + G +LF +N P A+ L Sbjct: 865 ------------TNVQEPALQRDV-FQAGLVNTHGLMLFDASQVNWPVAGAALRNL 907 >UniRef50_Q6AHL3 Putative uncharacterized protein n=1 Tax=Leifsonia xyli subsp. xyli RepID=Q6AHL3_LEIXX Length = 112 Score = 99.8 bits (247), Expect = 2e-19, Method: Composition-based stats. Identities = 44/115 (38%), Positives = 63/115 (54%), Gaps = 6/115 (5%) Query: 293 VSPAGVWRNRSHDPLGSDTRGAAAY---DESYADTRRWVEQGLLDYIAPQIYWPFSRSAA 349 +SP G+W ++++D GSDT +++ + YADT WV+ G+LDYI PQ+YW + A Sbjct: 1 MSPFGIWEHKANDSRGSDTPTSSSSTYSKQVYADTLGWVKAGILDYIVPQVYWSSDQPVA 60 Query: 350 RYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLN 404 Y +A+WW + V+ T RLYIG YK E W E+ QL N Sbjct: 61 PYGEIARWWNNAVEGTNVRLYIGQPNYKYTLFGPKEVAWT---NPDEVPNQLLFN 112 >UniRef50_UPI0001789939 S-layer domain protein n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI0001789939 Length = 1549 Score = 98.7 bits (244), Expect = 4e-19, Method: Composition-based stats. Identities = 74/418 (17%), Positives = 134/418 (32%), Gaps = 80/418 (19%) Query: 70 PPVSSVNISNPTSRARVQQ--QAMIDKLDHLQRLGINTVFFQVKP-DGTALWPSKILPWS 126 P + + S AR + + ++ L + G+ +V F VK +G + L Sbjct: 513 PEKEVILWVDQASNARKFKTSEDVLAFLQKAKETGVTSVAFDVKGVEGYVSYKKNDLTGR 572 Query: 127 DLMTG-----KIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTL 181 ++ K G +P D LQ +D H G+++HA N + + Sbjct: 573 PYVSEIKAPEKAGASPDLDLLQEFIDHGHALGLEIHAAINVFAEGSIAHNEYAVLNDHLD 632 Query: 182 SQQPASVYVQHRDW-----IRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDD 236 ++ + + + G ++P EV+D+ E++ Y VDGV D Sbjct: 633 WEERVYFPENNGEIKRLRESKKQGLVAFVNPSNDEVRDYQLRTFEEIIKNYDVDGVVHDR 692 Query: 237 YFYTESPGSR----------------------LNDNETY----RKYGGAFASKADWRRNN 270 Y ND +Y R YG ++R Sbjct: 693 SRYDNEGADFSDETRVKFEQFLQARGKQLVNWPNDIFSYENNVRVYGPLIQDWWEFRSGT 752 Query: 271 TQQLIAKVSHTIKSIK----PGVEFGVSPAGVWRNRSHDPLGSDTRGAAAY--------- 317 Q +V + S + +E V L G+ + Sbjct: 753 IQSFFGEVKALVDSYEVSEGRKIEVSSY---VGSWYETYYLNGVNWGSKNFRFNPALGMP 809 Query: 318 -DESYADTRRWVEQGLLDYI-APQI--YWPFSRSAARYDVLAKWWADVVKPTRTRLYIGI 373 + Y T + + G ++Y+ I Y S+ +Y L ++V LY GI Sbjct: 810 DESVY--TEEYYQTGYIEYLDFLMIGAYQTTSQEIQKYITL----GNIVTNGEIPLYAGI 863 Query: 374 AFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVS 431 A N P +++++ + +G +LF +N P A+ Sbjct: 864 AM--------------NNVQAPAVQREV-FQAGLKSTNGLMLFDASQVNWPIAAAALQ 906 >UniRef50_D1PX02 Putative uncharacterized protein n=2 Tax=Prevotella RepID=D1PX02_9BACT Length = 893 Score = 97.5 bits (241), Expect = 8e-19, Method: Composition-based stats. Identities = 51/270 (18%), Positives = 81/270 (30%), Gaps = 45/270 (16%) Query: 94 KLDHLQRLGINTVFFQVK-PDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKR 152 +D GI + +K G L+ SK G P +D + + EAHK+ Sbjct: 46 YVDKCHEAGITHLVLDIKDNTGEVLYDSKYTSRKRTWKGFTR--PDFDFINTFISEAHKQ 103 Query: 153 GMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWI----RTSGDRFVLDPG 208 GM + A N + + + + YV + + L+P Sbjct: 104 GMVIFAGMNIFADGSKAHGQPRGAVFGKNKKWQSINYVPGKGLVPVTELNGKTSMFLNPA 163 Query: 209 IPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNET-YRKY----------- 256 + +VQ +IV EVV R+ DG+ D Y + T + KY Sbjct: 164 LKDVQRHEINIVKEVVKRFKFDGIMLDRARYDCIDSDFSEASRTLFEKYIGEKLNKYPED 223 Query: 257 ----------------GGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAG--- 297 G + +WR + I V IK I P Sbjct: 224 IYEWKANDKGSFDRVPGPYYTKWIEWRASVIYGFIKDVRTAIKKIDPQCMLASYTGAWYP 283 Query: 298 -------VWRNRSHDPLGSDTRGAAAYDES 320 W ++ +DP Y + Sbjct: 284 TYYEVGVNWASKKYDPSKEFPWATPEYRQY 313 >UniRef50_C6IVH6 S-layer domain-containing protein n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6IVH6_9BACL Length = 1573 Score = 96.4 bits (238), Expect = 2e-18, Method: Composition-based stats. Identities = 63/404 (15%), Positives = 126/404 (31%), Gaps = 81/404 (20%) Query: 75 VNISNPTSRARVQQQAMIDK-LDHLQRLGINTVFFQVKP-DGTALWPSKILPWSDLMT-- 130 + + ++ + Q + L + G+ ++ VK +G + L ++ Sbjct: 542 LWVDQASNAKKFQTSEQVRAFLQKAKDTGVTSIALDVKGVEGYVSYKKNDLTGRPYVSEL 601 Query: 131 ---GKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPAS 187 G+ G NP D LQ +D H G+++HA N + + ++ Sbjct: 602 QAPGRAGANPDLDLLQEFIDHGHDLGLEIHAVVNVFAEGSIAYNEYAVLNDHLDWEERVH 661 Query: 188 VYVQHRDWIR-----TSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFY--- 239 + + R G ++P EV+++ E++ Y VDGV D Y Sbjct: 662 YAENNGEIKRLRESAKQGLVAFVNPANDEVREFELKTFEEILKNYDVDGVVHDRGRYDNE 721 Query: 240 -----------------------TESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIA 276 + P R G ++R Q Sbjct: 722 GADFSEETRVKFEQFLLQRGKQLNDWPNDIFYYENNVRVDGPLIQDWWEFRSGVIQSFFG 781 Query: 277 KVSHTIKSIK----PGVEFGVSPAGVWRNRSHDPLGSDTRGA-----------AAYDESY 321 +V + S + ++ + + + ++ + Y Y Sbjct: 782 EVKSLVDSYEAGSGRTIKVSSYVGSWYETYYLNGVNWASKNFRIHPSLGLPVESIYTPEY 841 Query: 322 ADTRRWVEQGLLDYI-APQI--YWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKV 378 DT G ++Y+ I Y S+ +Y L ++V LY GIA Sbjct: 842 YDT------GYIEYLDFLMIGAYQTTSQEIQKYITL----GNIVTNGEIPLYAGIAL--- 888 Query: 379 GEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLN 422 N +P +++++ + +G +LF +N Sbjct: 889 -----------NNVQLPAVQREV-FQAGLRTTNGLMLFDASQIN 920 >UniRef50_C3Y3M5 Putative uncharacterized protein n=3 Tax=Branchiostoma floridae RepID=C3Y3M5_BRAFL Length = 399 Score = 95.6 bits (236), Expect = 4e-18, Method: Composition-based stats. Identities = 63/340 (18%), Positives = 109/340 (32%), Gaps = 81/340 (23%) Query: 51 QSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQV 110 +R WL+ L+ GIN V+ V Sbjct: 104 SPYADVRATWLS------------------RYDHATSLAETAATFATLKAKGINRVYLNV 145 Query: 111 KPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTK 170 G + S+ + D L + ++E K G++V AWF Sbjct: 146 WASGQIYFQSRTFESLGIRGFVR------DVLGWAVEEGQKNGIEVWAWFE--------- 190 Query: 171 PGTIRELNSTLSQQPASVYVQHRDWIR-TSGDRFVLDPGIPEVQDWITSIVAEVVSRYP- 228 G S+ + S V WI+ +G+ + +D G +V D++ ++ + V YP Sbjct: 191 YGLKACWGSSPTVTVFSNKVYGLGWIKGQAGEYWWMDAGNTQVLDFLAGMMQDAVDNYPG 250 Query: 229 VDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPG 288 + GVQ DD+F + ++ G A+ N ++ +VS + Sbjct: 251 LAGVQLDDHF-----------VQPWQLGTGLVATMT----NAASHILGQVSGRV------ 289 Query: 289 VEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQ--GLLDYIAPQIYWPFSR 346 +SP + Y+ D W G +Y PQIY Sbjct: 290 ---SLSPIA-----------PPSLSLTNYN---VDWASWARDDIGFHEY-VPQIYRE--- 328 Query: 347 SAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEP 386 A+ ++ V +T+L G+ G P+ Sbjct: 329 DASVFNTDLDRVMSEVG--KTKLVPGLRCIGSGSPTTYSA 366 >UniRef50_UPI0001BC8648 hypothetical protein BacD2_02792 n=1 Tax=Bacteroides sp. D2 RepID=UPI0001BC8648 Length = 891 Score = 94.4 bits (233), Expect = 8e-18, Method: Composition-based stats. Identities = 54/308 (17%), Positives = 86/308 (27%), Gaps = 40/308 (12%) Query: 60 WLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDK-LDHLQRLGINTVFFQVK-PDGTAL 117 W+ V+ + + + R I +D GI + +K G L Sbjct: 14 WIGIVAFAS-GKPKVMWLDCSANFQRFSYPDSIRYYVDKCHEAGITHLVLDIKDNTGEVL 72 Query: 118 WPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIREL 177 +PSK P +D + ++ AH M + A N + N Sbjct: 73 YPSKYAIQKKNWKNFDR--PDFDFINTFIEAAHTHNMIIFAGMNIFADGQNIVKRGAVFD 130 Query: 178 NSTLSQQPASVYVQHRDWIRTSGDR--FVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFD 235 Q V + + + L+P + EVQ + I+ EVV Y DG+ D Sbjct: 131 KHKKWQAINYVPRKGLLPVTEIDGKPTMFLNPALKEVQKYEIDIIKEVVRNYAFDGIMLD 190 Query: 236 DYFYTESPGSRLNDNET-YRKY---------------------------GGAFASKADWR 267 Y +++ + K+ G + WR Sbjct: 191 RARYDCIDSDFSPESKKMFEKFIGKKVERFPEDIFEWRPNAEGGIDRVGGPYYHQWITWR 250 Query: 268 RNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRW 327 + I V +IK IKP + YD D W Sbjct: 251 TSVIYNFIKDVRTSIKKIKPECMLAAYTGA---WYPTYFEVGVNWASRQYD-VSKDF-SW 305 Query: 328 VEQGLLDY 335 DY Sbjct: 306 ATPDYKDY 313 >UniRef50_C3R3K8 S-layer protein n=4 Tax=Bacteroides RepID=C3R3K8_9BACE Length = 672 Score = 94.1 bits (232), Expect = 9e-18, Method: Composition-based stats. Identities = 55/334 (16%), Positives = 110/334 (32%), Gaps = 60/334 (17%) Query: 80 PTSRARVQQQAMIDKLDHLQRLGINTVFFQVK-PDGTALWPSKILPWSDLMT-----GKI 133 P ++ ++A+ +++ ++ G ++ VK P+G + L + +T K Sbjct: 275 PNAKVLTNREAVATMVNNAKKAGFTSIGLDVKGPEGYVSYRKNDLSKTPYLTATKNPNKQ 334 Query: 134 GENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSV---------NTKPGTIRELNSTLSQQ 184 ++ G+D L+ +L EAHK G+KV+ FN + + + + Sbjct: 335 VKDDGFDLLEVVLQEAHKIGLKVYTSFNFFTEGNITVNDYAILHEHKDWEEIVQRPEDKG 394 Query: 185 PASVYVQHRDWIRTSGDRF----VLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYT 240 + + + ++P EVQD+ V EV+ Y +DG+ D Y Sbjct: 395 KLLKITESTRGKEAAKGKLLALAFVNPSNKEVQDFQLLRVEEVLKNYDIDGIVLDRCRYD 454 Query: 241 ESPGSRLN-DNETYRKY--------------------------GGAFASKADWRRNNTQQ 273 + + +Y G F +R Sbjct: 455 NLYADFSHVTRNAFEEYLEKEGKILENFPADAFKIDKEGTLVKGQFFKEWITFRSQTICD 514 Query: 274 LIAKVSHTIKSIK----PGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDE--SYADTRRW 327 ++ + K P ++ G W + + YD+ S+ D+ + Sbjct: 515 FTGRIRSLVDKYKTEKNPDLKMAAY-VGSWYEVYYQNGVNWASNQFKYDDRLSFPDSEIY 573 Query: 328 VEQ-------GLLDYIAPQIYWPFSRSAARYDVL 354 E LD++ Y+ + RY L Sbjct: 574 GENYNKTSYLNNLDFLMIGTYYKTPKEVNRYITL 607 >UniRef50_A9KMJ8 Putative uncharacterized protein n=1 Tax=Clostridium phytofermentans ISDg RepID=A9KMJ8_CLOPH Length = 1263 Score = 89.0 bits (219), Expect = 3e-16, Method: Composition-based stats. Identities = 54/335 (16%), Positives = 104/335 (31%), Gaps = 57/335 (17%) Query: 89 QAMIDKLDHLQRLGINTVFFQVKP-DGTALWPSKILPWSDLMTGKIGENPG----YDPLQ 143 + + + + +R GI + F VK +G + + + MT N D L+ Sbjct: 461 EKIQKMMANAKRAGITALAFDVKGVEGYVSYKKATVSNTPYMTETKNPNKAVAMDIDFLE 520 Query: 144 FMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDW-------I 196 ML EAH G+K++A N + ++ +T Sbjct: 521 EMLAEAHANGIKLYASSNFFTEGNIATNDYAFDIRNTHPDWAEVFQTPEDKGELKSILNS 580 Query: 197 RTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLN-DNETYRK 255 + ++P EV+ +IV +V+ Y VDG+ D Y N E + Sbjct: 581 SRNSTLLFVNPANEEVRAHELAIVKDVLENYAVDGIILDRARYDNQYADFSNLSKEQFMA 640 Query: 256 Y-GGAFASKADW-------------------------RRNNTQQLIAKVSHTIKSIK--- 286 Y G + +W R + +++V I K Sbjct: 641 YLQGKGKTLQNWPDDAFKIKADGSMVTGQHYLEWLSYRSTVIESFVSEVRTLIDQYKTSQ 700 Query: 287 -PGVEFGVSPAGVWRNRSHD-------PLGSDTRGAAAYDESYADTRRWVEQGL---LDY 335 ++ + + + + R +E YA + + +D+ Sbjct: 701 NRNIDLAAYVGSWYESYYQNGVNWADSSFEYNERLGFPMEELYAKEFEYSKTSYVKHIDF 760 Query: 336 IAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLY 370 I Y+ +Y L +++ + LY Sbjct: 761 IMTGCYYTTEALMQKYTTL----NNILINNQVPLY 791 >UniRef50_A9KIP9 Putative uncharacterized protein n=1 Tax=Clostridium phytofermentans ISDg RepID=A9KIP9_CLOPH Length = 690 Score = 86.3 bits (212), Expect = 2e-15, Method: Composition-based stats. Identities = 58/344 (16%), Positives = 104/344 (30%), Gaps = 67/344 (19%) Query: 89 QAMIDKLDHLQRLGINTVFFQVKP-DGTALWPSKILPWSDLMTGKIGENPGY----DPLQ 143 + + + + GI VK +G A + L MT + D L+ Sbjct: 311 ERIESLIQTAKDAGITAFALDVKGCEGYAAYRKSTLTNVKYMTETTNPKKAFQMEIDFLE 370 Query: 144 FMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQH-RDWIRTSGDR 202 + AH G++V+A FN + +L +T P V H + Sbjct: 371 EFVKAAHASGLRVYASFNFFVEGNIASNDFAIDLPNT---HPDWAEVLHVPEDQGELKSV 427 Query: 203 F---------VLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGS-------- 245 ++P EVQD+ V E++ Y VDGV D Y Sbjct: 428 LETKRNCMLCYVNPANKEVQDFELLRVKELLDNYEVDGVIMDRTRYDNQYADFSEVTRIQ 487 Query: 246 -------------------RLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTI---- 282 D E +G + +R + Q ++ + Sbjct: 488 FVEYLKSKGKELVHWPKDIYSFDAEQKMIFGPLYLDWLTFRSSIIQGFARRLRGIVDEYA 547 Query: 283 KSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDE--------SYADTRRWVEQGLLD 334 K+ K + G W + + + Y+E Y T + + +D Sbjct: 548 KNQKRPIALAAY-VGSWFDLYYQNGVNWGSKDFRYNERLNFPVSKLY--TEDYSKTSYVD 604 Query: 335 YI-APQI--YWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAF 375 YI I Y+ S +Y + ++V + + ++ Sbjct: 605 YIDFLMIGCYYGTSEMIEKYTTI----GNIVTNHKVPMMASMSL 644 >UniRef50_C7E4U8 Putative uncharacterized protein psa8 n=1 Tax=Pantoea stewartii subsp. stewartii DC283 RepID=C7E4U8_ERWST Length = 106 Score = 83.6 bits (205), Expect = 1e-14, Method: Composition-based stats. Identities = 40/65 (61%), Positives = 49/65 (75%) Query: 373 IAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSY 432 +A YKVG PS IEPDW I GGVPE +QL LND++ E+ G +LFR +L +PQTQQ V Y Sbjct: 1 MALYKVGTPSAIEPDWTIEGGVPETTRQLGLNDSLEEVGGCMLFRHMFLREPQTQQVVDY 60 Query: 433 LQSRW 437 L+SRW Sbjct: 61 LRSRW 65 >UniRef50_C5A3T6 Glycosyl hydrolase, putative n=1 Tax=Thermococcus gammatolerans EJ3 RepID=C5A3T6_THEGJ Length = 909 Score = 83.6 bits (205), Expect = 1e-14, Method: Composition-based stats. Identities = 62/386 (16%), Positives = 120/386 (31%), Gaps = 89/386 (23%) Query: 70 PPVSSVNISNPTSRARVQQQAMIDKL-DHLQRLGINTVFFQVKPD-GTALWPSKILPWSD 127 P V + P + ++L L+ GI VF +VK G ++PSK+ P Sbjct: 65 PFDEKVYPTIPEDVKEKALKTAAERLVSELKEAGITDVFIEVKLTLGYVIYPSKVYPERT 124 Query: 128 LMTGKIGENPGY-----DPLQFMLDEAHKRGMKVHAWF----NPYRVSVNTKPGTIRELN 178 P Y + L+ +L+EAH+ G++VHAW + Y + + + Sbjct: 125 Y--------PAYPYNTTNILKPLLEEAHRNGIRVHAWMIVHYDKYFFGKTDPIWHVGKAS 176 Query: 179 STLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYF 238 P +R S ++ + +I E++S DG+ D Sbjct: 177 KNWEAYPV------PGRVRLSNKEYL---------KVLENIAKELISM-GFDGIHLDYIR 220 Query: 239 YTESPGSRLNDN-----------------------------------------ETYRKYG 257 Y S + ++ Y Sbjct: 221 YPHMVYSFSPKDLERAEEAGINVTKVTLAVEHTFYNDVPIPGTNKTMGPKDPYYIFKLYV 280 Query: 258 GAFASKADW---RRNNTQQLIAKVSHTIKSIKPG----VEFGVSPAGVWRNRSHDPLGSD 310 W RR + + ++ + S+K + W + L + Sbjct: 281 KGDKDIVKWFELRRKDVDSYVGNITQVVHSLKTWNGEKPIVSAALMPDWT--RDNILYPE 338 Query: 311 TRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLY 370 Y + ++D +V+ G +D++ P Y+ + + K + T++ Sbjct: 339 EFQIMHYAQVWSD---FVKLG-VDWLIPMAYFKDYGEPISWVGVVKGHLVGITGTKSVPL 394 Query: 371 IGIAFYKVGEPSKIEPDWMINGGVPE 396 +G+ Y + +E PE Sbjct: 395 VGVQSYGIPMEKVLEEKDFALSEFPE 420 >UniRef50_A8F7H1 Putative uncharacterized protein n=1 Tax=Thermotoga lettingae TMO RepID=A8F7H1_THELT Length = 370 Score = 81.7 bits (200), Expect = 4e-14, Method: Composition-based stats. Identities = 61/367 (16%), Positives = 107/367 (29%), Gaps = 69/367 (18%) Query: 93 DKLDHLQRLGINTVFFQVKPD-GTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHK 151 + L+ +G +F K GT WPSKI I + L + K Sbjct: 26 KAVKELKEMGFTDLFILAKGTTGTVYWPSKIA---------ISVSKNNAVLPKASEICKK 76 Query: 152 RGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPE 211 +++HAWF + K + ++ + Y V D Sbjct: 77 LNIRLHAWFIVSQDKSYLKLNPSSGMWGIPLEELSHEYRLGEHTCFRVSTSVV-DFTDQN 135 Query: 212 VQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDN-------------------ET 252 +++ S++ E+VS Y +G+ D Y +T Sbjct: 136 YREYFFSLIKEIVSNYEPEGIHLDYIRYPNGAWGWGPSQIHRLRIFDLDGEKLLKKAIQT 195 Query: 253 YRKYGG----------AFASKADW---RRNNTQQLIAKVSHTIKSIKPGVEFGVS--PAG 297 + + G W R ++ + S IK+++ + F + P G Sbjct: 196 WGRNGDGRSFLDAFEHGDPDVIKWVELRVDDVKDFTQATSQLIKNMRDSIIFSAALIPEG 255 Query: 298 VWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKW 357 N S S G Y D L D + P Y ++ W Sbjct: 256 GDPNPSERNFASIHCGQR-----YQDFAE-----LCDLMLPMAYH------QDFNRSNSW 299 Query: 358 WADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFR 417 D+ K T ++ +G I N E+ + + + G +F Sbjct: 300 IEDITKATH-KIAMG------KSRVVIGIQAHNNIRTHEVVEAIKIAQN-SGADGVCIFA 351 Query: 418 EDYLNKP 424 + K Sbjct: 352 FHEVFKN 358 >UniRef50_A8F7H3 Putative uncharacterized protein n=1 Tax=Thermotoga lettingae TMO RepID=A8F7H3_THELT Length = 560 Score = 79.4 bits (194), Expect = 3e-13, Method: Composition-based stats. Identities = 74/469 (15%), Positives = 139/469 (29%), Gaps = 121/469 (25%) Query: 13 RRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWL--ATVSRLDWP 70 R+ +I V +A + K + ++ +W+ +T++ L Sbjct: 158 RKISIDVFIATVSKILKEFSDTGKLPNDVILSKALLPFSWPAKIKAVWVWGSTLANLG-- 215 Query: 71 PVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPD-GTALWPSKILPWSDLM 129 + + L L+ +G + VK GT WPS+I Sbjct: 216 --------------------VENTLQQLKEIGFTDILLLVKGTSGTVNWPSQIA------ 249 Query: 130 TGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVY 189 +G + L G+++H WF V + T S + P +V Sbjct: 250 ---LGFSSDTTVLPRASKFCRTSGLRLHVWF----VCNQDQTFTSTYPESKMYGIPKTVD 302 Query: 190 VQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESP------ 243 + +T G E ++++ S++ EV+ Y DG+ FD Y Sbjct: 303 GDPQRAGKTVDFV-----GFDEYREYMESLIREVMEDYKPDGLHFDYIRYPTGAWGWGPA 357 Query: 244 --------------------------GSRLNDNETYRKYGGAFASKADW---RRNNTQQL 274 G+ ++ Y A+ W R Q+ Sbjct: 358 EIQTAMENGLTELDIQYLKNLAIQTWGTNGDNQSFINAYISGDATVTKWVEIRSKIVQKF 417 Query: 275 IAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTR----GAAAYDESYADTRRWVEQ 330 + +S K +K V + +P D+ G Y ++YA + Sbjct: 418 LQDLSSCAKQVKSDVIISAA-------LMPEPASLDSTEKAFGLVHYGQNYA-----IFS 465 Query: 331 GLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKP-------TRTRLYIGIAFYKVGEPSK 383 + I P Y Y + W + + T+L +G+ Y Sbjct: 466 DDCEMIVPMAYHR------DYGKDSSWITEEIFTGARQQIQANTKLVLGLQGYS------ 513 Query: 384 IEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSY 432 EL Q+ + G +FR + + ++A+ Sbjct: 514 -------PVTGDELA-QIINDCIDKNAEGICVFRAGTILNTEIEEALRN 554 >UniRef50_D1JA21 Conserved hypothetical membrane protein, DUF187 family n=2 Tax=uncultured archaeon RepID=D1JA21_9ARCH Length = 1594 Score = 77.5 bits (189), Expect = 1e-12, Method: Composition-based stats. Identities = 43/233 (18%), Positives = 82/233 (35%), Gaps = 42/233 (18%) Query: 94 KLDHLQRLGINTVFFQVKPD-GTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKR 152 ++ L+ ++TV +K D G +PS++ IG++ + + +D+AH+ Sbjct: 114 IINKLKSGNVSTVIINLKDDNGFVYFPSEVA-----EEDAIGQD--INVTKVFIDKAHEE 166 Query: 153 GMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEV 212 G++V A + +R T+ P V + D + P E Sbjct: 167 GLRVFAALSCFR------------DPITVGDHPEWSQVDNED----KRSEEWICPLNDEY 210 Query: 213 QDWITSIVAEVVSRYPVDGVQFDDYFYTESP---------------GSRLNDNETYRKYG 257 ++++ ++ EV+ Y +DGV D+ Y S G + +Y Sbjct: 211 KEYLLNLREEVLG-YDIDGVVLSDFGYAGSDYCFCDLCKRGFWNDTGIDPGKVDLANRYS 269 Query: 258 GAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSD 310 +WR +K I P + G + + P G D Sbjct: 270 SNTQKWFEWRATMVTDFFVSFCKQVKKIDPEITIGARMQNPFDDY--YPAGYD 320 >UniRef50_C6AUQ1 Putative uncharacterized protein n=2 Tax=Rhizobium leguminosarum RepID=C6AUQ1_RHILS Length = 702 Score = 76.7 bits (187), Expect = 1e-12, Method: Composition-based stats. Identities = 49/244 (20%), Positives = 84/244 (34%), Gaps = 33/244 (13%) Query: 63 TVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGT-ALWPSK 121 T+ DW ++ ++ +D +R N V G A +PS+ Sbjct: 11 TLRTPDWFKTATRWTQLTFVEDDPEKYDPAFWIDVFKRTKSNAVCLSA--GGYIAYYPSE 68 Query: 122 ILPW---SDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELN 178 + P+ S + K D ++D A K M V A +P+ + + + Sbjct: 69 V-PYHYVSKYLGDK-------DIFGALVDAARKLDMHVMARVDPHAIHDDAAKAHPEWVM 120 Query: 179 STLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGV-----Q 233 P + + D T+ +PEV V E+V +Y +D V Q Sbjct: 121 INADGTPRRHW-AYPDVWVTNAYGDYNSVFMPEV-------VKEIVRKYDIDAVFANRWQ 172 Query: 234 FDDYFYTESPGSRLNDNETYR------KYGGAFASKADWRRNNTQQLIAKVSHTIKSIKP 287 Y+E R D + A+ + WRR +IA+ +K+I+P Sbjct: 173 GHGVDYSEDSARRFKDMSGHALPVKPDAEDPAWQAWVQWRRRVLTDMIAQWDDAVKAIRP 232 Query: 288 GVEF 291 F Sbjct: 233 HASF 236 >UniRef50_Q8YXF7 All1256 protein n=4 Tax=Nostocaceae RepID=Q8YXF7_ANASP Length = 500 Score = 76.7 bits (187), Expect = 2e-12, Method: Composition-based stats. Identities = 40/180 (22%), Positives = 73/180 (40%), Gaps = 12/180 (6%) Query: 69 WPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILP--WS 126 WP V ++ + ++ A+ +D + G N V+ +V DG L P+ P W Sbjct: 95 WPNVQALWLR--LYPCDMKPGAIDQIMDRMVNRGYNEVYLEVFYDGRVLLPASANPTVWP 152 Query: 127 DLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPA 186 ++ K E D L + + +RG+KV+ W Y + R+ +++ Sbjct: 153 SVIRTKGAEKV--DLLATAIQKGRQRGLKVYGWL--YTNNFGYNYALRRDREGAIARNGK 208 Query: 187 SVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSR 246 ++ +G + +DP + + +V E+V R DG+ FD Y GS Sbjct: 209 GQTSL---YVVDNGSQVFIDPYNEQAKRDYYRMVQEIVRR-RPDGLLFDYVRYPRQAGSN 264 >UniRef50_B8HXB6 Putative uncharacterized protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HXB6_CYAP4 Length = 528 Score = 76.7 bits (187), Expect = 2e-12, Method: Composition-based stats. Identities = 35/178 (19%), Positives = 65/178 (36%), Gaps = 7/178 (3%) Query: 69 WPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDL 128 WP V + + +Q + LD + G N ++ Q DG L P+ P L Sbjct: 115 WPQVKGIWLQLFACD--LQPGVLESVLDRIVSQGYNRIYVQTFYDGQVLLPANRNPTPWL 172 Query: 129 MTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASV 188 + D L ++ + +RG++V+AW + ++ + R+ TL Q Sbjct: 173 AVAQGSAFADRDLLAEVIQKGRERGLRVYAWVS--GMNYGSSYAQRRDRQQTLVQNGRQP 230 Query: 189 YVQHRDWIRTSGDR--FVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPG 244 ++ +DP P ++ ++ + V + DGV D Y Sbjct: 231 ATTPVGHTGQGFEQTAIFIDPYHPRTREDF-QLMLQAVLQRQPDGVLIDYLRYPRQSN 287 >UniRef50_Q5N184 Putative uncharacterized protein n=2 Tax=Synechococcus elongatus RepID=Q5N184_SYNP6 Length = 481 Score = 75.9 bits (185), Expect = 3e-12, Method: Composition-based stats. Identities = 41/177 (23%), Positives = 68/177 (38%), Gaps = 8/177 (4%) Query: 69 WPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDL 128 WP ++ + ++ + D LQ LG N VF + DG L P+ P Sbjct: 91 WPRKMAIWVR--LYSCDLRPGGLDSLFDGLQALGYNEVFIETFYDGRVLLPAADNPTVWP 148 Query: 129 MTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASV 188 D L + + +RGM V+AW + ++ + TL++ S Sbjct: 149 SVVAEPGLERVDLLAEAIRKGRERGMSVYAWL--FTLNYGYSYSQRSDRQDTLARNGRSE 206 Query: 189 YVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGS 245 I + G + +DP P + +++ V+SR DGV FD Y G+ Sbjct: 207 SSLE---IVSGGAQVFVDPFNPVARQDYQTLLRSVLSR-RPDGVLFDYVRYPRGTGA 259 >UniRef50_Q2RYS7 Tat (Twin-arginine translocation) pathway signal sequence domain protein n=1 Tax=Salinibacter ruber DSM 13855 RepID=Q2RYS7_SALRD Length = 389 Score = 74.8 bits (182), Expect = 6e-12, Method: Composition-based stats. Identities = 63/359 (17%), Positives = 108/359 (30%), Gaps = 59/359 (16%) Query: 43 SKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLG 102 S PPA +S S P + N T V + + + L+ Sbjct: 24 STPPARAPSAS--------TNASDPSPPDSAPTNWVWMTPELDVSGEEWRRRFERLRAHN 75 Query: 103 INTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNP 162 I+ + QV + A + S LP L+ +L A + G++VH W Sbjct: 76 IDAILPQVYTNSAAYYGSDFLPVEGEW------------LETILPPAKEVGLEVHGWMVS 123 Query: 163 YRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAE 222 ++ +E + + V + ++ P P VQD+I V E Sbjct: 124 MPCTIPKIVNQHKEW--FVVNRNGESAVDNPAYVDYYK---FTCPNRPGVQDFIERRVEE 178 Query: 223 VVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRR-NNTQQLIAKVSHT 281 + S +DG+ FD + + + + Y + D+ + + + Sbjct: 179 ITSIDGLDGIHFDYIRFPDVVIAEALQPK-YGIVQHEEQAPYDYCYCDVCRSKFERDHGA 237 Query: 282 --IKSIKP-----GVEFGVSPAGVWRNRSHDPLGSDTRGAA------AYDESYADTRRWV 328 P F N P+ + A ++ W Sbjct: 238 DPYDLEDPTTSTAWRLFRYESITNLVNDRLIPIARENGKAVSAATFPNWEAVRQRWHHW- 296 Query: 329 EQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVK---------PTRTRLYIGIAFYKV 378 LDY+ P +Y F + A W D + TRLY G+ V Sbjct: 297 ---DLDYVHPMLYHNFYHAGA------NWVRDETRAGIERLRGQGRSTRLYSGLNVGAV 346 >UniRef50_Q3B486 Xylanase/chitin deacetylase-like n=2 Tax=Chlorobium/Pelodictyon group RepID=Q3B486_PELLD Length = 830 Score = 74.0 bits (180), Expect = 1e-11, Method: Composition-based stats. Identities = 40/228 (17%), Positives = 66/228 (28%), Gaps = 46/228 (20%) Query: 130 TGKIGENPGYDPLQFMLDEAHKRGMK--VHAWFNPYRVSVNTKPGTIRELNSTLSQQPAS 187 T G D L K+G+K VHAW L + + Sbjct: 292 TFYYGPKSADDVFGRTLALLRKQGLKIKVHAWL------------PALSDGRALKKHHSW 339 Query: 188 VYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRL 247 + + P PEV ++ S V +++ Y V+G+ D Y + Sbjct: 340 AMTAQNGRLSAH----WMSPANPEVAAYMKSTVTDIIRNYGVEGIHLDRLSYPDLDYDYS 395 Query: 248 NDN-ETYRKYGG-------------AFASKADWRRNNTQQLIAKVSHTI-KSIKPGVEFG 292 +N + G + S +WR + AKV + K+ VE+ Sbjct: 396 RENIRAFAAAKGLGSVPALNELLTLHYNSWVNWRSMRVSEYAAKVGEAVRKAGNEDVEYS 455 Query: 293 VSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQI 340 G +D S + D+I P + Sbjct: 456 AEMQGKRVFNFNDVALSGQEVSLLAQSF-------------DFIVPML 490 >UniRef50_Q67N80 Putative uncharacterized protein n=1 Tax=Symbiobacterium thermophilum RepID=Q67N80_SYMTH Length = 384 Score = 72.9 bits (177), Expect = 2e-11, Method: Composition-based stats. Identities = 64/356 (17%), Positives = 103/356 (28%), Gaps = 89/356 (25%) Query: 27 SCKSTPPESMVTPPAGSKPPATTQQSSQPMRGI----WLATVSRLDWPPVSSVNISNPTS 82 P G P ++ P+RG+ W A L WP Sbjct: 43 GGSGNAQAPENGPQPGDGPARPAREMPDPVRGLHLSGWYAGSPDLVWP------------ 90 Query: 83 RARVQQQAMIDKLDHLQRLGINTVFFQVKP-DGTALWPSKILPWSDLMTGKIGENPGYDP 141 LD + GINT+ +K DG W S + P + + + + Sbjct: 91 -----------LLDWAKEAGINTIVLDLKAEDGYLSWESDL-PLAQEIGANMRKIAD--- 135 Query: 142 LQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTS-- 199 L + EAH+RG W + +Y +W Sbjct: 136 LPAFVAEAHERGF----WV----------------AGRIVVMNDQWLYKARPEWAIPGFD 175 Query: 200 -GDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGG 258 G +DP V + + E ++ VD +QFD Y+E N Sbjct: 176 GGAYSFMDPANENVWKYNVDVAKEAIAA-GVDEIQFDYIRYSEHLREGYN---------- 224 Query: 259 AFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYD 318 + A+ R + +K + GV G G+ G D Y Sbjct: 225 GKDTTAEQRTKPINDFLRYAMAELKPL--GVVVGADVFGLTT---SVAEGDDMEIGQDYR 279 Query: 319 ESYADTRRWVEQGLLDYIAPQIY--------WPFSRSAA-RYDVLAKWWADVVKPT 365 + ++DYIAP +Y + A Y+ + ++ T Sbjct: 280 QI---------AEIVDYIAPMVYPSHYAPYTYGLDNPNAHPYETVYNSMKKALERT 326 >UniRef50_B0C6V7 Putative uncharacterized protein n=3 Tax=Cyanobacteria RepID=B0C6V7_ACAM1 Length = 522 Score = 72.9 bits (177), Expect = 2e-11, Method: Composition-based stats. Identities = 38/185 (20%), Positives = 73/185 (39%), Gaps = 13/185 (7%) Query: 68 DWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSD 127 +WP ++ + ++ + LD + G N V+ +V +G L P+ P + Sbjct: 92 NWPQNQAIWLR--LHECDLEPGVLDTLLDRIVSRGYNQVYVEVFYNGRVLLPAANNPTTW 149 Query: 128 LMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPAS 187 + + D L + + RG+KV+AW + ++ + G + NS L++ Sbjct: 150 SSEIRNPKYANRDLLAETIKKGRARGLKVYAWM--FSLNYGHQYGQRSDRNSVLARNGQG 207 Query: 188 VYV------QHRDWIRTSG--DRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFY 239 + +G DR +DP + + ++ ++ R P DGV FD Y Sbjct: 208 KTSLTLLDYADPNINLDNGDIDRAFVDPYSAQARQDYARMLQAILQRKP-DGVLFDYIRY 266 Query: 240 TESPG 244 G Sbjct: 267 PRQTG 271 >UniRef50_B4D7E2 Putative uncharacterized protein n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D7E2_9BACT Length = 423 Score = 72.5 bits (176), Expect = 3e-11, Method: Composition-based stats. Identities = 47/252 (18%), Positives = 78/252 (30%), Gaps = 35/252 (13%) Query: 135 ENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRD 194 G D LQ + + H M+ +A V G + + + H + Sbjct: 119 RADGVDTLQCIAEGCHAADMQCYASVRMNAVYPLKANGWVGDSMARFFNSKFWW--DHPE 176 Query: 195 WIRTSGD---RFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNE 251 W S D + L PEV+ + IV EV+ R VDGV D + G + + Sbjct: 177 WRVRSRDGREQPSLSYAFPEVRARVLGIVREVLER-DVDGVDLDFLRHPPCFGYEESLVK 235 Query: 252 TYRK---------YGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNR 302 Y+ +R L+ ++ + + Sbjct: 236 GYQDRFHLDPQTIPDDHDERWLHYRAELMTGLLREIRQAV--------------DEAAKK 281 Query: 303 SHDPLGSDTR-GAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADV 361 PLG R A Y D W+++ LLD + +D + + Sbjct: 282 KGRPLGLSARIDHANYLLWGCDVDVWLQERLLDILVV-----SQHGLGGWDFDLRPFVQK 336 Query: 362 VKPTRTRLYIGI 373 K T +Y+G Sbjct: 337 AKGTGCAVYLGE 348 >UniRef50_Q11AV5 Putative uncharacterized protein n=1 Tax=Chelativorans sp. BNC1 RepID=Q11AV5_MESSB Length = 685 Score = 72.1 bits (175), Expect = 3e-11, Method: Composition-based stats. Identities = 37/225 (16%), Positives = 79/225 (35%), Gaps = 41/225 (18%) Query: 90 AMIDKLDHLQRLGINTVFFQVK--PDGT--ALWPSKILPWSDLMTGKIGENPGYDPLQFM 145 + LD ++ +G N V G + +P+++ + + P D Sbjct: 40 DVEKVLDFIEDMGCN-----VWLVNGGGILSFYPTRLEHQTRNPY--LDRRPSGDLFGDA 92 Query: 146 LDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVL 205 ++ H+RG++V A + + + + P ++V+ + + + + Sbjct: 93 VEAGHRRGIRVMA-----------RMDFSKVNQAVADRHPDWLFVRPDGRRQAAEGQVSV 141 Query: 206 DPGIPEVQDWITSIVAEVVSRYPVDGV--------QFDDYFYTESPGSRLNDNETYRKYG 257 DP Q+ + +V E++ RYP+DG +FD + + + Sbjct: 142 DPSGDYYQEKLLEVVDEMIDRYPLDGFFFNRAGFNEFDYAMHYHGVSQSEASKRGFAAFS 201 Query: 258 GAF--------ASKADWR---RNNTQQLIAKVSHTIKSIKPGVEF 291 G + WR L ++S IK+ +P V Sbjct: 202 GGQQLPTGPESPNYDLWRAYCAKVVGDLWVRISAHIKTRRPDVAL 246 >UniRef50_C1YLJ5 Uncharacterized conserved protein n=4 Tax=Bacteria RepID=C1YLJ5_NOCDA Length = 705 Score = 71.7 bits (174), Expect = 6e-11, Method: Composition-based stats. Identities = 38/164 (23%), Positives = 62/164 (37%), Gaps = 20/164 (12%) Query: 140 DPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTS 199 DP +++ A M V A +P+ V + L T P + W+ Sbjct: 87 DPFGALVEGARGLDMHVMARVDPHAVHADAASAHPEWLARTADGSPVEHWGYPGIWLTC- 145 Query: 200 GDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGV-----QFDDYFYTESPGSRLNDNETY- 253 P P +D+IT + E+V+RY VD V Q Y+E+ D + Sbjct: 146 -------PFGPYNRDFITEVAREIVTRYDVDAVFANRWQGHGISYSEAALRSFRDETGFD 198 Query: 254 --RKYG----GAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEF 291 R+ G A+ + WRR L++ ++ I+P F Sbjct: 199 LPRREGDTSDPAWRAYVVWRRRKLSDLVSLWDQAVRDIRPHARF 242 >UniRef50_D0MH73 Putative uncharacterized protein n=1 Tax=Rhodothermus marinus DSM 4252 RepID=D0MH73_RHOM4 Length = 322 Score = 70.9 bits (172), Expect = 8e-11, Method: Composition-based stats. Identities = 48/332 (14%), Positives = 91/332 (27%), Gaps = 88/332 (26%) Query: 145 MLDEAHKRGMKVHAWFNPY----RVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSG 200 ++ A G+++HAW + + + + +++P V Sbjct: 42 LIPLAQAEGIELHAWIPTMMRAELLETHPDWYAVNREGVSTAEKPPYV-----------D 90 Query: 201 DRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYT-------------------E 241 L P +P V+ ++ + + G+ D + Sbjct: 91 YYRFLSPCVPGVRSYLADYYDRMAQIEGLAGLHLDYIRFPDVILPITLQPKYGLVQDREY 150 Query: 242 SPGSRLNDNETYRKY-------------GGAFASKADWRRNNTQQLIAKVSHTIKSI-KP 287 P E ++ A + +R + ++ +++ + + KP Sbjct: 151 PPFDYGYHPECRAQFKAQTGIDPLELEDPSANEAWRQFRYDQITAVVRQIAERVHARGKP 210 Query: 288 GVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRS 347 A V+ A D RW LD + P IY F Sbjct: 211 ------LTAAVFPTPE-----------IARTLVRQDWPRWP----LDAVMPMIYHNFYDK 249 Query: 348 AARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAV 407 + A R LY G+ ++ EL + +D Sbjct: 250 PVAWIETATREGVEALGGRIPLYSGL--------------FIPALTPEELAQAIDYA-LA 294 Query: 408 PEISGTILFREDYLNKPQTQQAVSYLQSRWGS 439 SG LF + L Q L++R S Sbjct: 295 GGASGVSLFNVESLTDAHWQ----MLKTRLAS 322 >UniRef50_D1JE50 Hypothetical secreted protein n=1 Tax=uncultured archaeon RepID=D1JE50_9ARCH Length = 419 Score = 70.9 bits (172), Expect = 9e-11, Method: Composition-based stats. Identities = 49/328 (14%), Positives = 97/328 (29%), Gaps = 86/328 (26%) Query: 111 KPDGTALWPSK------ILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYR 164 G ++ SK P++ +T + +DP+Q+++D + ++VHAW + Sbjct: 128 WASGILMYNSKNHQELVYGPFNRSITQE-----NFDPMQYLVDRCEENNIEVHAWVPVF- 181 Query: 165 VSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVV 224 I LS QP S + D + +V+++ SI+ E++ Sbjct: 182 ------KDHIGAEMLNLSLQPESEHSVKPDDV--------------DVREYELSIITEIL 221 Query: 225 SRYP-VDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIK 283 YP + G+ D S + N YR+ + D + V + K Sbjct: 222 QTYPRIKGINLDYIR------SGDDANNEYRQDPDVAKAVVD--------FVKDVENITK 267 Query: 284 SIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWP 343 + A V+ + +G D + + +D + P Y Sbjct: 268 KQNK-----ILSADVFSSDWAYKVGQDIEEISNH---------------VDVLMPMTYHG 307 Query: 344 FSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGE--------PSKIEPDWMINGGVP 395 + W P IG + + + Sbjct: 308 DN----------CLWGGSTDPEWVGEVIGNYKTRYNSRYILAGIQGYETRGKETFDVTPE 357 Query: 396 ELKKQLDLNDAVPEISGTILFREDYLNK 423 ++ ++ G LF + Sbjct: 358 QIITAVNSARE-NGADGFALFNYVSFKE 384 >UniRef50_Q114S3 Putative uncharacterized protein n=2 Tax=Oscillatoriales RepID=Q114S3_TRIEI Length = 508 Score = 70.6 bits (171), Expect = 1e-10, Method: Composition-based stats. Identities = 72/376 (19%), Positives = 123/376 (32%), Gaps = 63/376 (16%) Query: 8 KKLTIRRPAILVALALLLCSCKSTPP-----ESMVTPPAGSKPPATTQQSSQPMRGIWLA 62 KK+ R+ + +L +L S + K P S + +R L Sbjct: 9 KKIKSRKYFHINSLGAILFSLAVNLSPWFSAKVQAQTDIYCKLPPEAIASKENLRQAVLE 68 Query: 63 TVSRL---------------------DWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRL 101 +WP + + AR + LD + Sbjct: 69 GNKNAEKQYQDILIKHNREVGNCRMRNWPRTQGIWLRLYPCDAR--PGEIDRILDKIVNQ 126 Query: 102 GINTVFFQVKPDGTALWPSKILP--WSDLMTGKIGENPGY---DPLQFMLDEAHKRGMKV 156 G N V+ + DG L P+ P W ++ PGY D L L +A +RG++ Sbjct: 127 GYNQVYIEAFYDGQVLLPAANNPTVWPSIL-----RVPGYENVDLLADSLKKAKERGLRA 181 Query: 157 HAWFNPYRVSVNTKPGTIREL--------NSTLSQQPASVYVQHRDWIRTSGDRFVLDPG 208 +AW R+ +TL P +V +Q++ + +DP Sbjct: 182 YAWVFTMNFGYTYSQLPNRQQALARNGRGQTTLDVIPDNVSLQNQ-LGASHAFHTFIDPY 240 Query: 209 IPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRK--YGGAFASKADW 266 P+ + +V EV+ R GV FD Y GS ++ Y A + Sbjct: 241 SPQARQDYNVMVNEVLKR-QPQGVLFDYIRYLRGMGSDSVADQVKDLWIYSEASQNVLLQ 299 Query: 267 RRNN------TQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDES 320 R N ++ + K T + I G +P W+ + S + Sbjct: 300 RAKNEAGKELIRKFVDKGYVTSQEIN-----GRTP--KWQRFFSPSINSRLTERGLETQI 352 Query: 321 YADTRRWVEQGLLDYI 336 + + QG+LD++ Sbjct: 353 WELSVAHAAQGILDFL 368 >UniRef50_A1S0G8 Putative uncharacterized protein n=1 Tax=Thermofilum pendens Hrk 5 RepID=A1S0G8_THEPD Length = 718 Score = 70.2 bits (170), Expect = 2e-10, Method: Composition-based stats. Identities = 50/234 (21%), Positives = 82/234 (35%), Gaps = 39/234 (16%) Query: 78 SNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVK-PDGTALWPSKILPWSDLMTGKIGEN 136 + D + L +TV + G A + S + K+ Sbjct: 9 FEDKLAQYADRVTGADIVGLAAELHCDTVVIFARDAWGRAYYDSAVA-------RKVASL 61 Query: 137 PGYDPLQFMLDEAHKRGMKVHAWF----NPYRVSVNTKPGTIRELNSTLSQQPASVYVQH 192 D L+ +++EAH+RG+KV A NP S + + + V+ Sbjct: 62 KSRDLLREVVEEAHRRGIKVVAMIGHTTNPELYSSHPEWAQRDRNGRVIHMDTDPQGVKD 121 Query: 193 R-DWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFY-TESPGSRL--N 248 + W + LD ++ EV+ RY VDGV D + Y + + N Sbjct: 122 KVRWPLMCLNSPFLD--------YVLREAEEVL-RYGVDGVFLDSFRYMPDVERACFCEN 172 Query: 249 DNETYRKY-GGAFASKADW-----RRN-------NTQQLIAKVSHTIKSIKPGV 289 + Y + GG S+ DW RR N + L +V ++ KPG Sbjct: 173 CRKAYAEEVGGELPSEEDWDSEAFRRAFAWRYRVNVKAL-ERVKDFVRKAKPGA 225 >UniRef50_B1XJ85 Putative uncharacterized protein n=3 Tax=Chroococcales RepID=B1XJ85_SYNP2 Length = 506 Score = 69.8 bits (169), Expect = 2e-10, Method: Composition-based stats. Identities = 35/180 (19%), Positives = 68/180 (37%), Gaps = 12/180 (6%) Query: 69 WPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKIL--PWS 126 WP ++ + + LD + G + ++ +V D AL P PW Sbjct: 100 WPRHQAMWLRVYPCDTKAGMIE--RILDDIVNKGYDQLYLEVFADSQALLPKNQNNTPWP 157 Query: 127 DLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPA 186 + EN D + ++++ RG++V+AW + ++ G + S +++ Sbjct: 158 SRIQTPGLEN--RDVMAEIIEKGRARGLEVYAWL--FGMNYGYVYGNRGDRQSVMARNGY 213 Query: 187 SVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSR 246 G + DP Q ++V +++ R DG+ FD Y G+R Sbjct: 214 GETSLD---FVQDGAQAFADPYNAIAQADYKNLVNQMLQR-QPDGILFDYIRYPRGTGAR 269 >UniRef50_A4AQ95 Putative uncharacterized protein n=2 Tax=Bacteroidetes RepID=A4AQ95_9FLAO Length = 373 Score = 69.0 bits (167), Expect = 3e-10, Method: Composition-based stats. Identities = 46/323 (14%), Positives = 88/323 (27%), Gaps = 77/323 (23%) Query: 134 GENPG-YDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQH 192 G+NP Y + ++ EA GM+ H W N K + + + Sbjct: 62 GQNPATYKRVGALVKEA---GMEFHTWIPTMVQGENPKIAKDLYAH----NRNGESAFEK 114 Query: 193 RDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTES-PGSRLND-- 249 ++ L P ++++ + V VDG+ D + + L D Sbjct: 115 PAYVNYYK---FLCPNKEGTYEFLSDMYGSVAEVEEVDGIHLDYIRFPDVILAEGLWDKY 171 Query: 250 ----NETYRKY-------------------------GGAFASKADWRRNNTQQLIAKVSH 280 ++ + +Y +R + +++ ++S Sbjct: 172 GLVMDQEFPEYDYCYCEKCTSDFKELTGIDINEVEDPSQIQEWKQFRYDLITKMVNRLSK 231 Query: 281 TIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQI 340 + G N + P S + + +W LD I P Sbjct: 232 VVHEK-----------GKVLNAAVFPGPSIAKKL-----VRQEWNKW----DLDAIYPMN 271 Query: 341 YWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPE---- 396 Y + + W V G+ G P+ N PE Sbjct: 272 Y-------NDFYLKGPEWVGEVTKEEVAAVKGLKPIYSGLFICPNPENKTNENDPENHGL 324 Query: 397 LKKQLDLN---DAVPEISGTILF 416 L +++ +G LF Sbjct: 325 LPSEIETAIRTSMENGAAGICLF 347 >UniRef50_A5FIA1 Hypothetical lipoprotein n=2 Tax=Flavobacteriaceae RepID=A5FIA1_FLAJ1 Length = 360 Score = 69.0 bits (167), Expect = 3e-10, Method: Composition-based stats. Identities = 64/356 (17%), Positives = 100/356 (28%), Gaps = 71/356 (19%) Query: 108 FQVKPDGTAL-----WPSKILPWSDLMTGKIGENPGYDP--LQFMLDEAHKRGMKVHAWF 160 F V A + + + D ++ N DP L+ ++ A K G+KVHAW Sbjct: 30 FGVWTTADAKKSDADYTKEFKKYKDGGIDEVLINTTTDPQLLKRLVPLATKEGLKVHAWI 89 Query: 161 NPYRVSVNTKPGTIRELNSTLSQQPASVYVQHR-----DWIRTSGDRFVLDPGIPEVQDW 215 R +S Q P V D L P E ++ Sbjct: 90 ----------MAMNRPGDSVALQHPEWYQVSKEGKSCFDNRPYVDYYQWLCPTRKESREH 139 Query: 216 ITSIVAEVVSRYPVDGVQFDDYFYTE--SPGSRLNDNETYRKYGGAFASKADWRRNNTQQ 273 + +V E+ ++ V D + + P S L Y + D+ + Sbjct: 140 VLHLVEELAKVEGIESVHLDYIRFPDIFLPISLLP---KYNLVQDVELPQFDF--CYCDE 194 Query: 274 LIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLG---------------SDTRGAAAYD 318 + I P S W+N + + T Y Sbjct: 195 CVKA-FEKIHHKNPKESHNTSIDMEWKNFRLNAIRGVVDDAYKIAHKHNKQLTAAVFPYP 253 Query: 319 ESYAD------TRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIG 372 E AD +W +D + P IY F Y+ W K L Sbjct: 254 EM-ADHMVRQRWDKW----NIDEVYPMIYHSF------YNEEIDWVGYATKQGVEDL--- 299 Query: 373 IAFYKVGEPSKIEPDWMING-GVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQ 427 + +KI I G KQ L G F + L++ + Sbjct: 300 -----EDKKTKINTGIYIPGLKNDAELKQAILEAKENGAVGVSFFDGNALSESNLK 350 >UniRef50_A0Z097 Putative uncharacterized protein n=2 Tax=Oscillatoriales RepID=A0Z097_9CYAN Length = 533 Score = 68.2 bits (165), Expect = 6e-10, Method: Composition-based stats. Identities = 37/189 (19%), Positives = 73/189 (38%), Gaps = 23/189 (12%) Query: 69 WPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWP--SKILPWS 126 W ++ + Q A+ LD L G N V+ +V DG L P PW Sbjct: 89 WLKDQAIWLR--LYPCDAQPGAIDQILDDLVNRGYNKVYLEVFYDGQVLLPVSDNNTPWQ 146 Query: 127 DLMTGKIGENPGY---DPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQ 183 ++ +PG D + + +RG++V+AW + ++ + + + L++ Sbjct: 147 SVL-----RSPGTETVDLFAEAVQKGRRRGLEVYAW--AFLLNYGYTYTLLPDRQNVLAR 199 Query: 184 QPASVYV--------QHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFD 235 D+ + ++ +DP P+ + +++ ++SR GV FD Sbjct: 200 NGEGETTVTAIAGGSNSDDFGESYTNQGFVDPYNPQARQDYQTLLNAILSR-RPQGVLFD 258 Query: 236 DYFYTESPG 244 Y + G Sbjct: 259 YVRYPKGLG 267 >UniRef50_C7PH83 Putative uncharacterized protein n=3 Tax=Sphingobacteriales RepID=C7PH83_CHIPD Length = 361 Score = 67.5 bits (163), Expect = 1e-09, Method: Composition-based stats. Identities = 45/267 (16%), Positives = 71/267 (26%), Gaps = 69/267 (25%) Query: 147 DEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLD 206 A + G++ H W + E + H ++ G L Sbjct: 82 RAAKENGIQAHRWIWTMNRGEKELLASHPEWY--AKNRKGESCATHPPYV---GYYRWLC 136 Query: 207 PGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTE------------------SPGSRLN 248 P PEV +++ V+S+ VDG+ D Y + P Sbjct: 137 PSKPEVINYLKEQAEAVLSKDYVDGLHLDYVRYCDVVLPVNLWSNYGIDQSKELPEYDFC 196 Query: 249 DNETYR--------------KYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVS 294 ET R ++ S +R N ++ ++ K K Sbjct: 197 YCETCREKYKEQKGVDPLDMEHPDQIPSWRRFRYNRITNVVNNLAAVAKQHK-------K 249 Query: 295 PAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVL 354 P + D R D W LD + P IY F Y Sbjct: 250 PISAAVFPTPDIAKRIVR---------QDWTNWP----LDSVNPMIYHGF------YKED 290 Query: 355 AKWWADVVK------PTRTRLYIGIAF 375 W + V + LY G+ Sbjct: 291 VNWIGEAVAEGVGGLHGKFPLYAGLYL 317 >UniRef50_B9XFU7 Putative uncharacterized protein n=1 Tax=bacterium Ellin514 RepID=B9XFU7_9BACT Length = 430 Score = 67.1 bits (162), Expect = 1e-09, Method: Composition-based stats. Identities = 37/232 (15%), Positives = 72/232 (31%), Gaps = 35/232 (15%) Query: 138 GYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIR 197 +D L ++ H+ G+++HAW VS+N + + P + + Sbjct: 141 NFDSLGTAVEYGHQIGLQIHAW-----VSINEDDHGWGIQSEFSRKHPEFRWRKRNGAAY 195 Query: 198 TSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYG 257 S F PEVQ + +I+ E+++ Y +DG+ D + + D + G Sbjct: 196 HSQLSF----AFPEVQKYKLAILKELLANYKIDGIFLDWIRTGDVRDNSQTDADGVADSG 251 Query: 258 GAFASKADW---------------------RRNNTQQLIAKVSHTIKSIKPGVEFGVSPA 296 + R + + V +K P Sbjct: 252 YEEPLVKQFKKKFGVDPHEVSNGDERWVRLRAEPQTEFMRAVRK-LKYSNNR----RLPI 306 Query: 297 GVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSA 348 V G + D W +GL++++ Y+ +A Sbjct: 307 AVMVGHPWHYRGEMNKIDGNLRGLLLDVATWAREGLMNFVVAAGYYRDGGTA 358 >UniRef50_A9WDB3 Putative uncharacterized protein n=5 Tax=Chloroflexaceae RepID=A9WDB3_CHLAA Length = 693 Score = 66.7 bits (161), Expect = 1e-09, Method: Composition-based stats. Identities = 62/355 (17%), Positives = 113/355 (31%), Gaps = 48/355 (13%) Query: 93 DKLDHLQRLGINTVFFQVKPD-----GTALWPSKILPWSDLMTGKIGENPGYDPLQFMLD 147 LD + R +N + +K D G + S++ P + P D Q +L Sbjct: 366 RFLDLIDRTELNAIVIDIKSDLRDDLGMVYYDSQV-PLVRELG---LSTPRVD-FQSILA 420 Query: 148 EAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDP 207 +A +RG+ A RV + + + + S + S + D+ LDP Sbjct: 421 KAKERGIYTIA-----RVQLFSHDNALSDARPEWSIRLRSTGEVYADYPGPGIRYAYLDP 475 Query: 208 GIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPG--SRLNDNETYRKYGGAFASKAD 265 V D+ ++ E D + FD + + G D + + + Sbjct: 476 TNQNVWDYNIALAVEAAQM-GFDEINFDYIRFPDWFGTREEFRDKLLFSEPIDPVGNPG- 533 Query: 266 WR-RNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADT 324 R + + + + H + S G V G N + D + Sbjct: 534 -RMYDVIIEFMQRAHHAVNSA--GAFMSVDVFGRVVNGPSLTIAQDMARMGEHT------ 584 Query: 325 RRWVEQGLLDYIAPQIY---W--PFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVG 379 DY+ P Y W A + V+ ++ + Sbjct: 585 ---------DYVCPMPYPSLWWGGLENIAVPVKFPYETLQIAVRNGGRQMAGSYGRQRPW 635 Query: 380 EPSKIEPDW---MINGGVPELKKQLDLNDAVPE-ISGTILFREDYLNKPQTQQAV 430 +P W ++ G E++ Q+D + PE SG +L+ + K AV Sbjct: 636 LQDHTDP-WSPVVVEYGPAEVRAQIDATEEQPEAASGWLLYDSANIYKGAFNGAV 689 >UniRef50_B1WYP8 Putative uncharacterized protein n=4 Tax=Chroococcales RepID=B1WYP8_CYAA5 Length = 492 Score = 66.7 bits (161), Expect = 2e-09, Method: Composition-based stats. Identities = 36/180 (20%), Positives = 67/180 (37%), Gaps = 12/180 (6%) Query: 69 WPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWP--SKILPWS 126 WP ++ + R ++ LD + G NTV+ + + L P PW Sbjct: 87 WPQEQAIWLRLYPCDVRS--GSIDAVLDRIVSKGYNTVYIETFANSQVLLPPADNPTPWD 144 Query: 127 DLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPA 186 ++ EN D L + + +RG+K++AW + ++ + L++ Sbjct: 145 TVVRTSGVENV--DLLAQTIKKGRERGLKMYAWL--FTLNFGYAYAQRSDRQQVLARNGK 200 Query: 187 SVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSR 246 G + +DP + Q ++ V+ R DGV FD Y G++ Sbjct: 201 GEDSTS---YVHDGAQAFVDPYNRQAQTDYYRLLEAVLKR-RPDGVLFDYVRYPRGTGTQ 256 >UniRef50_UPI0001C391E2 hypothetical protein AplaP_16720 n=1 Tax=Arthrospira platensis str. Paraca RepID=UPI0001C391E2 Length = 333 Score = 65.5 bits (158), Expect = 4e-09, Method: Composition-based stats. Identities = 44/198 (22%), Positives = 73/198 (36%), Gaps = 19/198 (9%) Query: 86 VQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWP--SKILPWSDLMTGKIGENPGY---D 140 Q A+ LD + G N V+ +V D L P + PW +M +PG D Sbjct: 105 AQPGALEKLLDDIINKGYNQVYLEVFYDAQVLLPAANNPTPWPSVM-----RSPGLEQVD 159 Query: 141 PLQFMLDEAHKRGMKVHAWF------NPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRD 194 + + + +RG++V+AW Y + N + + D Sbjct: 160 LMAQTIQKGRQRGLQVYAWMFMLNFGYTYTLLPNRAGVLAVNGSGETTVTANLDGTASND 219 Query: 195 WIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTE--SPGSRLNDNET 252 + + ++ +DP P + +++ V+ R GV FD Y P S + + Sbjct: 220 FGESYINQGFVDPYHPTARQDFNTLLNAVLRR-RPSGVLFDYVRYPRGLGPQSVASRVKD 278 Query: 253 YRKYGGAFASKADWRRNN 270 YG A S R NN Sbjct: 279 LWIYGDASQSALLQRANN 296 >UniRef50_B9YV30 Putative uncharacterized protein n=1 Tax='Nostoc azollae' 0708 RepID=B9YV30_ANAAZ Length = 111 Score = 65.5 bits (158), Expect = 4e-09, Method: Composition-based stats. Identities = 21/78 (26%), Positives = 29/78 (37%), Gaps = 13/78 (16%) Query: 229 VDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKAD-----------WRRNNTQQLIAK 277 +DG+QFDD+F P D T Y +A WR N + + Sbjct: 2 IDGIQFDDHFG--LPSELGYDAYTVALYKKEHRGQAPSKNPKDPEWLGWRANKITNFMKR 59 Query: 278 VSHTIKSIKPGVEFGVSP 295 V IK+ K V+P Sbjct: 60 VFTAIKANKKNCLVSVAP 77 >UniRef50_A0PYZ0 Putative uncharacterized protein n=3 Tax=Clostridium RepID=A0PYZ0_CLONN Length = 656 Score = 65.2 bits (157), Expect = 4e-09, Method: Composition-based stats. Identities = 47/250 (18%), Positives = 81/250 (32%), Gaps = 56/250 (22%) Query: 200 GDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGA 259 ++L P+ +++I +++ E+ + V G+ D Y GS E Sbjct: 428 NGGYMLSYYYPKYRNYILNVLKEISNTENVQGITLDFCRYPYIMGSESTLKEKI------ 481 Query: 260 FASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDE 319 + + KV I S K V F DP Sbjct: 482 ---------DIMNYFMKKVRQEIPSKKITVRF----------PYLDPKSYGL-------- 514 Query: 320 SYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVL--AKWWADVVKPTRTRLYIGIAFYK 377 D + WV +GL+D I P S Y+ ++ +V+ T LY+GI+ Sbjct: 515 ---DVKTWVNEGLVDRIIP--------SVISYEDFFNLDKYSKLVRGTNVELYLGISANV 563 Query: 378 VGEPSKIEPDWMINGG--------VPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQA 429 G + + + + G PE G +F + LN ++ Sbjct: 564 EGGDATKDSESLDEDGKLPGNKYLTPEEYIYRAYEGYKAGAKG--IFMFNTLNALDIEKD 621 Query: 430 VSYLQSRWGS 439 VS + G+ Sbjct: 622 VSPMYKLIGN 631 >UniRef50_A6LHH0 Putative uncharacterized protein n=6 Tax=Bacteroidales RepID=A6LHH0_PARD8 Length = 448 Score = 65.2 bits (157), Expect = 5e-09, Method: Composition-based stats. Identities = 37/260 (14%), Positives = 71/260 (27%), Gaps = 65/260 (25%) Query: 129 MTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASV 188 + G + P D + + A K G++V+AW + ++ L + P Sbjct: 57 IDGVMLNAPTPDDYRAAIPIAQKHGIEVYAWLWTM--------NPEHDRDAILKEHPEWF 108 Query: 189 YVQHR-----DWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFD-------- 235 V D + P +PEV+++I + ++G+ D Sbjct: 109 SVNRNGQSLADTTAYVDYYKFMCPALPEVREFIKKKIEAYCEVEGLNGIAIDYNRFVDVI 168 Query: 236 --------------------DYFYTESPGSRLNDNETYRKYGGAFAS----KADWRRNNT 271 D+ Y + + Y S +R + Sbjct: 169 LPTTLWPKYGIVQDQEYPQWDFGYHPAMIEKFKAAYGYDPREQEDPSQDEKWLQFRCDQI 228 Query: 272 QQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQG 331 ++ ++ + S G S P A D +W Sbjct: 229 TEVANMIADVVHS-----------YGKKMAASPFPTP-----KMASKMVRQDWGKW---- 268 Query: 332 LLDYIAPQIYWPFSRSAARY 351 LD + P +Y F + Sbjct: 269 NLDIVFPMVYHNFYTEDISF 288 >UniRef50_UPI0001B9ED67 hypothetical protein GYMC10_3557 n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI0001B9ED67 Length = 707 Score = 64.0 bits (154), Expect = 9e-09, Method: Composition-based stats. Identities = 35/254 (13%), Positives = 77/254 (30%), Gaps = 46/254 (18%) Query: 66 RLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPW 125 + W + + + R + + L+ N + A +PS ++ Sbjct: 2 GMKWWSTNRLRLIQNNLRETDADMDVDLLIRELKSFQANVLMMNAGGI-FAFYPSSLMHQ 60 Query: 126 --SDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQ 183 + +T D L +++AH G++ A + + S + Sbjct: 61 YVTPYLTK--------DLLGEAVEKAHANGLRFIA-----------RFDFSKAHESLWMR 101 Query: 184 QPASVYVQHRDWIRTSGDRFV--LDPGIPEVQDWITSIVAEVVSRYPVDGV--------Q 233 P Y L+ +V+ + EV+ RYPVDG+ Sbjct: 102 YPEWFYRDREGREVNYNGIVHTCLNGAYQQVKS--LESIQEVLERYPVDGIFFNMFGYQH 159 Query: 234 FDDYFYTESPGSRLNDNETYRKYGGA------------FASKADWRRNNTQQLIAKVSHT 281 +D P N +R+ G ++ +++++ ++ Sbjct: 160 WDYSGNFYGPCYCPNCRLRFREVTGEDLLAYTGPEHPMHELYRSFQERTSREMLERIHGF 219 Query: 282 IKSIKPGVEFGVSP 295 +KS++ V Sbjct: 220 VKSLRSDVAISTYH 233 >UniRef50_D0LIN5 Putative uncharacterized protein n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LIN5_HALO1 Length = 446 Score = 64.0 bits (154), Expect = 1e-08, Method: Composition-based stats. Identities = 41/246 (16%), Positives = 83/246 (33%), Gaps = 44/246 (17%) Query: 97 HLQRLGINTVFFQVKP-DGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMK 155 L+R +N V K G LWPS + P S + + ++P + +++ H+ G+ Sbjct: 98 ELKRARLNAVVIDAKTDTGHILWPSDV-PMSKAVQMPLIKDP-----RALIERFHEHGIY 151 Query: 156 VHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDW 215 V A ++ + P +L + ++ W+ EV D+ Sbjct: 152 VIARIVCFK--DDELPLVRPDLGVRSGSRGGRLFRAGSRWLDQYAG---------EVHDY 200 Query: 216 ITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLI 275 + ++ E R+ VD +Q D + + GS+ + S R + Sbjct: 201 LIALAMEW-QRFGVDEIQLDYIRFPKGRGSQ---YAKWLHSNDQSPS----RDALIAGFL 252 Query: 276 AKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDY 335 ++ + V V G+ DP G + ++ Sbjct: 253 ERLDRAV-----EVPLSVDVFGLTTLVDGDPRG-----------LGQTIEKMAP--YVEA 294 Query: 336 IAPQIY 341 ++P +Y Sbjct: 295 VSPMMY 300 >UniRef50_A3ZTB2 Putative uncharacterized protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZTB2_9PLAN Length = 1292 Score = 64.0 bits (154), Expect = 1e-08, Method: Composition-based stats. Identities = 43/245 (17%), Positives = 81/245 (33%), Gaps = 35/245 (14%) Query: 93 DKLDHLQRLGINTVFFQVKPDGTALWPSKIL----PWSDLMTGKIGENPGY-DPLQFMLD 147 + +LQ G N V +G+AL+PS+++ + + G +P D L+ +L Sbjct: 514 RLIQYLQHAGYNGASISVLSEGSALFPSELIRTNPKYDKGVYFLDGRDPVQKDVLEMLLR 573 Query: 148 EAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDP 207 + + GM + + + + L + G P Sbjct: 574 QFDRAGMTLFPAIE-FNAPLESLEKLRESLGDERLDLVDDQGRAAIEISSAGGQGAYYSP 632 Query: 208 GIPEVQDWITSIVAEVVSRY----PVDGV--QFDDYFYTESPGSRLN-DNETYRKYGGAF 260 PEV+ + ++ E+V+RY GV Q + Y + PG++ D T ++ Sbjct: 633 LHPEVRAAMEQLIEELVNRYGHHPSFGGVTLQLNANGYAQLPGAKWGMDRATVNRFLQET 692 Query: 261 ----------------------ASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGV 298 + WR Q + ++ KP + AG Sbjct: 693 RIASSIEELGQRPSTAILAQHRQAWLAWRAEQMSQFYEAAAQIVRRKKPTGLLMLDTAGS 752 Query: 299 WRNRS 303 +R Sbjct: 753 FRGYD 757 >UniRef50_Q2JK94 Putative uncharacterized protein n=2 Tax=Synechococcus RepID=Q2JK94_SYNJB Length = 573 Score = 63.6 bits (153), Expect = 2e-08, Method: Composition-based stats. Identities = 31/183 (16%), Positives = 57/183 (31%), Gaps = 16/183 (8%) Query: 69 WPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDL 128 WP + ++ + + D + LG N VF + + + + Sbjct: 111 WPQIQAIWVR--LHPCDANPGVLDQVFDQVVNLGYNRVFIE------TFYDGRSILPDGA 162 Query: 129 MTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASV 188 P D L L A +RG+ +AW + ++ G + L++ + Sbjct: 163 GGIWPSLQPNADLLDLALKAARRRGLSAYAWV--FSLNFGYSYGQRPDRQVALARNGRGI 220 Query: 189 YVQHRDWIRTSG-----DRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESP 243 D +DP P + +V ++ R DG+ FD Y Sbjct: 221 TTVLDPATALGEDLGSPDEVFVDPFHPLARRDFAELVRRILQR-QPDGILFDYIRYPRGS 279 Query: 244 GSR 246 G Sbjct: 280 GGS 282 >UniRef50_UPI0001746A87 hypothetical protein VspiD_03245 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001746A87 Length = 542 Score = 62.5 bits (150), Expect = 3e-08, Method: Composition-based stats. Identities = 56/338 (16%), Positives = 100/338 (29%), Gaps = 75/338 (22%) Query: 117 LWPSKILPWSDLM--------------TGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNP 162 WPS++LP ++ + G D L+ +D H RG A Sbjct: 87 WWPSQVLPLAEHCEWFKRRFNLPKASTSFTSYVLRGGDFLKLTVDRCHARGA---ACLVS 143 Query: 163 YRVSVNTKPGTIRELNSTLSQQPASVYVQHRDW--------IRTSGDRFVLDPGIPEVQD 214 +R++ + ++ Y H + + S R V + IPEV++ Sbjct: 144 FRLNDVHHKESADAPGNSHIASVPQFYSDHPQYRLGEQSRETQISWARHVQNWTIPEVRE 203 Query: 215 WITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQL 274 + + E+ Y +DG++ D Y Y A + + RR + Sbjct: 204 FKFRFIEELSRNYDLDGIELDFYRA--------------AAYFRAGETTREQRRQIMTEF 249 Query: 275 IAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLD 334 + +V T+ + AG R S G +A+D D + + G +D Sbjct: 250 MGRVRRTLDDH--------ARAGRRRWLSVRIPGQ----LSAHDAQGIDVQAFAAAG-VD 296 Query: 335 YIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFY-------KVGEPSKIEPD 387 + ++ P RL G A Y ++ Sbjct: 297 MFNLSASYHVEQTT-------------SLPEIRRLVPGAAIYLELTHVVSFNGDTRNRSH 343 Query: 388 WMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQ 425 + L + G LF Y + Sbjct: 344 RRTTTEIATTTAHLAYSR---GADGLSLFNFQYYRDYR 378 >UniRef50_A1HM88 Polysaccharide deacetylase n=1 Tax=Thermosinus carboxydivorans Nor1 RepID=A1HM88_9FIRM Length = 621 Score = 62.5 bits (150), Expect = 3e-08, Method: Composition-based stats. Identities = 50/345 (14%), Positives = 105/345 (30%), Gaps = 51/345 (14%) Query: 80 PTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGY 139 + + + + + +L++ INTV Q I + + Sbjct: 297 YDASPQQMAENLETAIAYLRKAKINTVILQ--SFADEQGTGNIQELYFYTSAAPVKK--- 351 Query: 140 DPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTS 199 D L ++ H+ +V+AW P G + + + Y + Sbjct: 352 DVLNHIIQRLHREKFQVYAWM-PTLAGQWLLTGHPEDEVAAWDNKNKGWYRRAT------ 404 Query: 200 GDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTE-----SPGSRLNDNETYR 254 P P V + + ++ + P+DG+ F D Y SP ++ E + Sbjct: 405 -------PFSPRVAAELKKVFRDLAAYNPIDGILFQDDLYLNDYEDFSPAAKAAFREKFN 457 Query: 255 KY-GGAFASKADWRRNNTQQLIAKVS-------HTIKSIKPGVEFGVSPAGVWRNRSHDP 306 + A + R+ + ++ ++ +P +F R+ P Sbjct: 458 RELTPAALNDPQVRQEWINLKVQAMNKLTAELMDEVRRYRPQAKF---------ARNIYP 508 Query: 307 LGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTR 366 + A A+ D +++ L DY Y + + W + Sbjct: 509 SAVLSPEAKAW--LAQDYAEFLQ--LYDYTVIMCYPALEKVPKP-----ERWLADLAKAA 559 Query: 367 TRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEIS 411 Y G A V + + + LKKQ+ + ++ Sbjct: 560 LA-YPGAAERTVFKLQAYDWEKKRWIAPAVLKKQVAILKENGAVN 603 >UniRef50_C7HH08 GTP-binding protein n=3 Tax=Clostridium thermocellum RepID=C7HH08_CLOTM Length = 419 Score = 62.1 bits (149), Expect = 4e-08, Method: Composition-based stats. Identities = 48/273 (17%), Positives = 94/273 (34%), Gaps = 50/273 (18%) Query: 71 PVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMT 130 V + I+ ++ + + +++ ++ + +NTV VK DG + S++ + Sbjct: 81 KVKGLYITGTSAGNKKFMERLVNLINTTE---LNTVVLDVKEDGKVNYASEVESVKKI-- 135 Query: 131 GKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYV 190 G E D ++ H + V +R ++ L+ + + + Sbjct: 136 GAYHELYNVD---EVIKLLHDNNIYVIGRIVCFR-------------DNYLAGKRVDLAI 179 Query: 191 QHRDWI--RTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLN 248 + +D R +G +P EV + I E V + D +QFD + + + ++ Sbjct: 180 KRKDGSIWRENGSIAWTNPYNKEVWRYNIDIAKEAVKK-GFDEIQFDYVRFPAAGKNEVD 238 Query: 249 DNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLG 308 YG KAD + + + I K GV + D G Sbjct: 239 -------YGENPIPKAD----AISGFLKEAASEI--NKMGVPVSADIFAIVCETPGDTEG 285 Query: 309 SDTRGAAAYDESYADTRRWVEQGLLDYIAPQIY 341 + D +DYI+P IY Sbjct: 286 IG----QVLERIGMD---------IDYISPMIY 305 >UniRef50_B8I5J6 Putative uncharacterized protein n=2 Tax=Clostridium RepID=B8I5J6_CLOCE Length = 443 Score = 61.7 bits (148), Expect = 6e-08, Method: Composition-based stats. Identities = 54/314 (17%), Positives = 102/314 (32%), Gaps = 47/314 (14%) Query: 28 CKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQ 87 K+ P ++ + +T Q+ G+ L+ ++ V +V ++ P++ + Sbjct: 61 IKTKPSDTTSANSTAAASDSTKQEQIPQSTGLQLS--PGIEQVKVKAVYLTGPSAGSA-- 116 Query: 88 QQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLD 147 + + + +NTV VK DG + + + NP + ++ Sbjct: 117 -ARIDKIISMAKNTELNTVVIDVKEDGAVNYTTNLDLVKKYGKQVKYYNP-----KDVIK 170 Query: 148 EAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDP 207 + H G+ V R+ V P + + P+ W+ +G +P Sbjct: 171 KLHDNGIYVI-----GRIVVFKDPVLAKNRADLGVKAPSGKL-----WL-ENGTTPWTNP 219 Query: 208 GIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWR 267 + EV D+ +I E +S Y D +QFD + N +G KA+ Sbjct: 220 YMEEVWDYNLAIAKEAIS-YGFDEIQFDYVRFPTGGKKSFN-------FGTNVPEKAE-- 269 Query: 268 RNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRW 327 +AK + GV + D R + Y Sbjct: 270 --AINGFLAKSQKELHQE-LGVPVSADVFAIIIESKLDGESIGQRFQEVGKDIYC----- 321 Query: 328 VEQGLLDYIAPQIY 341 I+P IY Sbjct: 322 --------ISPMIY 327 >UniRef50_D1CHN7 GTP-binding protein n=1 Tax=Thermobaculum terrenum ATCC BAA-798 RepID=D1CHN7_THET1 Length = 410 Score = 61.3 bits (147), Expect = 7e-08, Method: Composition-based stats. Identities = 47/265 (17%), Positives = 86/265 (32%), Gaps = 48/265 (18%) Query: 91 MIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAH 150 + ++ +N + VK D +W SK+ P + G E+ LQ + + H Sbjct: 106 YRRFMRLIEETELNAIVINVKNDDGKVWTSKV-PLAR-QIGASYEDFH---LQEFVRDMH 160 Query: 151 KRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRT-SGDRFVLDPGI 209 +RG+ V F +R + + +P R + + +DP Sbjct: 161 RRGIYVIGRFTTFR------------DPTLATARPDMAVRDIRGGVWEDNKGHRWVDPFN 208 Query: 210 PEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYR-KYGGAFASKADWRR 268 +V + ++ E+ + +D +QFD + + T +Y R Sbjct: 209 KKVWRYFGDLLEEIAAS-GIDEIQFDYVRFPVDGDLSKVEYLTPSTRYN---------RP 258 Query: 269 NNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDP-LGSDTRGAAAYDESYADTRRW 327 + + + S I+ K V G+ + G A Y Sbjct: 259 DTIEAFLRYASSRIRPHK--VFISADTYGLTVWSEKEQGTGQVLERLAPY---------- 306 Query: 328 VEQGLLDYIAPQIY-WPFSRSAARY 351 LDY +P IY F+ Y Sbjct: 307 -----LDYYSPMIYPDHFAPGTGGY 326 >UniRef50_A4IKZ2 Alpha-amylase family protein n=12 Tax=Bacillaceae RepID=A4IKZ2_GEOTN Length = 511 Score = 60.9 bits (146), Expect = 9e-08, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 66/195 (33%), Gaps = 22/195 (11%) Query: 59 IWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTV----FFQVKPDG 114 I + + +D VN+++P + + KLD+++ +G + F+ +P G Sbjct: 39 IMVDRFNNMDSTNDQDVNVNDPKGYFGGDLKGVTAKLDYIKEMGFTAIWLTPIFKNRPGG 98 Query: 115 TALWPSKILPWSDLMTGKIGENPGY---DPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKP 171 + + +P + D L+ ++ EAHKR MKV + V Sbjct: 99 YHGY---------WIEDFYEVDPHFGTLDDLKTLVKEAHKRDMKVI--LDFVANHVGYDH 147 Query: 172 GTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDP----GIPEVQDWITSIVAEVVSRY 227 + + P + P PEV++++ + Sbjct: 148 PWLHDPAKKDWFHPKKEIFDWNSQEQVENGWVYGLPDLAQENPEVKNYLIDAAKWWIKET 207 Query: 228 PVDGVQFDDYFYTES 242 +DG + D + Sbjct: 208 DIDGYRLDMVRHVPK 222 >UniRef50_O26457 Conserved protein n=1 Tax=Methanothermobacter thermautotrophicus str. Delta H RepID=O26457_METTH Length = 735 Score = 60.5 bits (145), Expect = 1e-07, Method: Composition-based stats. Identities = 28/200 (14%), Positives = 56/200 (28%), Gaps = 53/200 (26%) Query: 229 VDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPG 288 V G+ D Y + N T ++ ++S +KS+ PG Sbjct: 281 VGGIHLDYIRYPGNAYQVPNSTAT------------------ITGIVRRISEAVKSVNPG 322 Query: 289 VEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSA 348 + + + + Y + LD + P +Y R Sbjct: 323 LLLSAA-------LMPEKGSNAYYYGQNYTQL---------AEYLDVLVPMVYKGNYRED 366 Query: 349 ARYDVLAKWWADVV-----KPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDL 403 + W +V + +++ G+ Y+ P + G + + Sbjct: 367 SS------WIQNVTAYIKARSPGAQVWTGLQTYRSDSDVTPIPADELRGDIDSALR---- 416 Query: 404 NDAVPEISGTILFREDYLNK 423 G LFR ++K Sbjct: 417 ----GGADGYALFRYGLIDK 432 >UniRef50_Q30RN5 Putative uncharacterized protein n=1 Tax=Sulfurimonas denitrificans DSM 1251 RepID=Q30RN5_SULDN Length = 410 Score = 60.2 bits (144), Expect = 2e-07, Method: Composition-based stats. Identities = 38/268 (14%), Positives = 79/268 (29%), Gaps = 44/268 (16%) Query: 87 QQQAMIDKLDHLQRLGINTVFFQVKPD-GTALWPSKILPWSDLMTGKIGENPGYDPLQFM 145 Q + D L + INTV VK + G + + + K N ++ Sbjct: 102 QSPKLRDILKIIDETDINTVVVDVKNEYGHTSFKTSFEQANSYGADKSIINRN---IEEF 158 Query: 146 LDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVL 205 + + + A + + I ++ + + +++ H + + Sbjct: 159 MSIMRAKNIYTIARIVTF----KDELQAINNVDYAIKNRDGTIWRNH-------DNMAWV 207 Query: 206 DPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKAD 265 DP ++ S+ E ++ D + FD + G Y ++ Sbjct: 208 DPFDVRAHNYAISVAQE-AAKVGFDEINFDYIRFPAKDG---------LLYSKESTQES- 256 Query: 266 WRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTR 325 R + ++ K GV G D T + A Sbjct: 257 -RIKAISDFLNLAQERLR--KYGVFISADTYGNICWTKDDNNIGQTVSSMAPH------- 306 Query: 326 RWVEQGLLDYIAPQIYWPFSRSAARYDV 353 +DY+AP +Y P ++ + Sbjct: 307 -------VDYLAPMLY-PSGFASGSFSF 326 >UniRef50_B4D3R6 Putative uncharacterized protein n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D3R6_9BACT Length = 551 Score = 59.8 bits (143), Expect = 2e-07, Method: Composition-based stats. Identities = 45/232 (19%), Positives = 86/232 (37%), Gaps = 48/232 (20%) Query: 112 PDGTALWPSKIL-PWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTK 170 P G+A +PSK+L P D + + + G D ++ ++ E +R +V F +R+ Sbjct: 98 PLGSA-YPSKVLLPVDDPLV-RQWLDAGIDWVEQLVAETRRRKREV---FWNHRICEVEF 152 Query: 171 PGTIRELNSTLSQQPASVYVQHRDWIRTSG--DRFVLDPGIPEVQDWITSIVAEVVSRYP 228 + S+ P + VQH DW+ + + ++ + +I+ E+ +RY Sbjct: 153 VPGVGH-----SKTPHPLKVQHPDWVVAADWWPHGTWNLAAEGLRAYKVAILRELTTRYD 207 Query: 229 VDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADW-RRNNTQQLIAKVSHTIKSIKP 287 DG+Q D + ++ W R + + + +V + Sbjct: 208 FDGLQID-----------------FSRHIPCLPVGRQWELRESVTEFLREVRRMTLEV-- 248 Query: 288 GVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYA---DTRRWVEQGLLDYI 336 + PL + D +A D R W EQ L+D + Sbjct: 249 ------------AAQRGRPLLLAAKVPQTIDGCHADGFDVRAWAEQRLVDIL 288 >UniRef50_B6W970 Putative uncharacterized protein n=1 Tax=Anaerococcus hydrogenalis DSM 7454 RepID=B6W970_9FIRM Length = 422 Score = 59.8 bits (143), Expect = 2e-07, Method: Composition-based stats. Identities = 42/312 (13%), Positives = 105/312 (33%), Gaps = 56/312 (17%) Query: 5 SRNKKLTIRRPAILVALALLLCSCKSTP----PESMVTPPAGSKPPATTQQSSQPMR--- 57 +NKKL + L ++ SC E + +++ +P+ Sbjct: 1 MKNKKLCFC----FLILTIIFTSCSLNKDKETSEKDFSSTEKIIYSKNSKKEKKPLGEPY 56 Query: 58 --GIWLATVSRLDWPPVSSVNISNPTSRARVQQ---------------QAMIDKLDHLQR 100 G+ +D+ +++ S+ ++ +A ++ L Sbjct: 57 TVGV-TPDDYNMDYDTSRLKSLNEKKSKYYPEEGVKGIYLNAYTAANPKAFKKIMNLLDE 115 Query: 101 LGINTVFFQV---KPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVH 157 +N V V + T + + T +I + + +++ HK+G+ V Sbjct: 116 TKLNAVVLDVKDDWGNITCKFDTN-NKDIKYATHEILDA------EDFINKMHKKGIYVI 168 Query: 158 AWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSG-DRFVLDPGIPEVQDWI 216 ++ SV T+ + P + + +G ++P + EV+++ Sbjct: 169 GRITTFKDSVITE------------KHPDWGFKLDDGSLWKNGHGEAFMNPFMDEVRNYD 216 Query: 217 TSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETY--RKYGGAFASKADWRRNNTQQL 274 +AE+ + D +QFD + E + Y ++ + + D R + Sbjct: 217 LQ-IAELAANAGFDEIQFDYIRFAE-GFETFHGKLDYPKGRWEKSKMDEGDKRIDAITSF 274 Query: 275 IAKVSHTIKSIK 286 + + +++ Sbjct: 275 VKEAREMLQAYD 286 >UniRef50_C7RFM8 Glycoside hydrolase clan GH-D n=34 Tax=Bacteria RepID=C7RFM8_ANAPD Length = 722 Score = 59.8 bits (143), Expect = 2e-07, Method: Composition-based stats. Identities = 31/160 (19%), Positives = 60/160 (37%), Gaps = 21/160 (13%) Query: 142 LQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGD 201 L ++ + HK+GMK W P +SV++ + P + S + Sbjct: 388 LSELISDVHKKGMKFGLWVEPEMISVDSD---------LYRKHPDWAIQAPKRGHSYSRN 438 Query: 202 RFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFA 261 + VL+ PEV ++ I+ +++S + +D +++D + G+ + ET + Sbjct: 439 QLVLNLANPEVVAYLKEILDDLLSNHDIDYIKWDYNRNITNIGNGKDYLETMEQSHKYML 498 Query: 262 SKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRN 301 D L+ + V F G RN Sbjct: 499 GFYD--------LVKYL----TEKHSDVLFESCSGGGGRN 526 >UniRef50_B3TAU5 Putative uncharacterized protein n=1 Tax=uncultured marine crenarchaeote HF4000_APKG8G15 RepID=B3TAU5_9ARCH Length = 393 Score = 59.8 bits (143), Expect = 2e-07, Method: Composition-based stats. Identities = 36/231 (15%), Positives = 74/231 (32%), Gaps = 39/231 (16%) Query: 132 KIGENPGYDPLQFMLDEAHKRGMKVHAWF------NPYRVSVNTKPGTIRELNSTLSQQP 185 KI + G+D + + AH GM+ + + P + +++ + Sbjct: 76 KIKNDLGWDDFEVVPKIAHDNGMRANLYVSLFDEGWPLPSREEQEVSYHNKMHGQHTSWQ 135 Query: 186 ASVYVQHRDWIRTSGDRF-----VLDPGIPEVQDWITSIVAEVVSRYPVDGV-------- 232 ++ H ++ VL +V+ + + +++ Y DG+ Sbjct: 136 STFSHDHPEYTVMDQSGSKRQWGVLCLAYEQVRKYFVKLYLDLLDGYDFDGLFVCFRSQS 195 Query: 233 ---QFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRR---NNTQQLIAKVSHTIKSIK 286 +F D F P +N + DWR + + ++ Sbjct: 196 KPAEFADQFNFNEPVRNDFENIYGKDILKDGFDLQDWRDLVGKYITDFLRMLKKELRRKN 255 Query: 287 PGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIA 337 + G+ V PLG+ T + R+WV GL+D + Sbjct: 256 ISLSVGLPRGDVI----GPPLGNTTL----------EWRKWVRDGLVDELV 292 >UniRef50_A4FBJ1 Putative uncharacterized protein n=1 Tax=Saccharopolyspora erythraea NRRL 2338 RepID=A4FBJ1_SACEN Length = 501 Score = 59.4 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 55/305 (18%), Positives = 100/305 (32%), Gaps = 61/305 (20%) Query: 94 KLDHLQRLGINTVFFQVKP-DGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKR 152 L+ ++ I+TV +K G + S + M +IG GY + +D+ H Sbjct: 202 VLEMARQGRIDTVELDIKDESGEVPYDSAV-----PMANQIGAVKGYYNARQAVDQLHGM 256 Query: 153 GMKVH----AWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPG 208 G++V A+ +P + + G + T QP W G + Sbjct: 257 GVRVVGRLVAFKDPILGEASWRGGHPERVVQTAGGQP---------WTGGYGGFAFTNFA 307 Query: 209 IPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRR 268 P V+ + I E S + FDD Y + ++ G + Sbjct: 308 DPVVRQYNIDIATEAAS------LGFDDVLYDYVRRPDGAIEQ--MRFPGLTTTPE---- 355 Query: 269 NNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWV 328 + + ++S G G S G+ NR + D R+ Sbjct: 356 AGIADFLRQTQPAVRS--RGALLGASVFGISVNRP--------------TQIAQDIRQMA 399 Query: 329 EQGLLDYIAPQIY---WP------FSRSAARYDVL---AKWWADVVKPTRTRLYIGIAFY 376 + DYIAP +Y W Y+++ +A V+ T ++ + + Sbjct: 400 Q--YTDYIAPMVYPSHWGPGEFGVADPDTQPYEIVRNSLAEFAKAVEGTDVQIIPWLQDF 457 Query: 377 KVGEP 381 +G Sbjct: 458 SLGAS 462 >UniRef50_Q3A0T1 Putative uncharacterized protein n=1 Tax=Pelobacter carbinolicus DSM 2380 RepID=Q3A0T1_PELCD Length = 640 Score = 59.0 bits (141), Expect = 3e-07, Method: Composition-based stats. Identities = 56/317 (17%), Positives = 108/317 (34%), Gaps = 60/317 (18%) Query: 85 RVQQQAMIDKLDHLQRLGINTVFFQVKPD------GTALW-PSKILPWSDLMTGKIGENP 137 ++ + LD ++ + I TV+ Q D AL+ P+ LP + Sbjct: 304 EQTERNLGLLLDRIKAMQITTVYLQAFADPDGDGVADALYFPNDYLPVRADL-------- 355 Query: 138 GYDPLQFMLDEAHKRGMKVHAWFNPYRVSV-NTKPGTIRELNSTLSQQPASVYVQHRDWI 196 ++ + + L + G+KV+AW + + + T P + Sbjct: 356 -FNRVAWQLK--TRAGVKVYAWLPVLGFEIGRPDLLVLSQQPETGKISPDPHAYKR---- 408 Query: 197 RTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKY 256 L P +PEV+ + I ++ DGV F D S +R+ Sbjct: 409 --------LSPFVPEVRALVHGIYGDLARYADFDGVLFHDDAVLSSFEDAQPAAMAWRRE 460 Query: 257 GG---------AFASKADWRRNNTQQLI---AKVSHTIKSIKPGVEFG--------VSPA 296 G + + W R T+ LI +++ +++ +P +E ++ Sbjct: 461 RGLKDDLAGVWSEEERTRWSRLKTEYLIVFTQELADAVRAYRPEIETARNLYAPAVLASD 520 Query: 297 GVWRNRSHDPLGSDTRGAAA-----YDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARY 351 G R L + A Y E +D W+ + L+ IA +P + Sbjct: 521 GKLRFSQDPGLFLEAYDFTALMAMPYLEGASDPESWLRK-LVAAIAC---FPDGLKKTVF 576 Query: 352 DVLAKWWADVVKPTRTR 368 ++ + W +P T Sbjct: 577 ELQSIDWNGRKQPVPTE 593 >UniRef50_Q5I942 Alpha-amylase n=1 Tax=Anaerobranca gottschalkii RepID=Q5I942_9FIRM Length = 532 Score = 59.0 bits (141), Expect = 3e-07, Method: Composition-based stats. Identities = 48/274 (17%), Positives = 90/274 (32%), Gaps = 37/274 (13%) Query: 10 LTIRRPAILVALALLLC--SCKSTPPESMVTPPAGSKPPATTQQSSQPMR--GIWLATVS 65 +T + +L+ L+ C S T Q GI T Sbjct: 1 MTSKILRVLLVFLLIFAIVGCTSDKQGPQETYKNIDDTVTHGQNYDGSFSREGIQEVTFE 60 Query: 66 RLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPW 125 + + N + T +I+ LD+++ LG+N ++ G + ++ + Sbjct: 61 NGVFYQIFVYNFRDSTGDGVGDLGGIIESLDYIESLGVNGIWLTPITHGASYHKYDVVDY 120 Query: 126 SDLMTGKIGENPGY----DPLQFMLDEAHKRGMKV------------HAWFNPYRVSVNT 169 +P + D + ++ EAHKRG+KV H WF N+ Sbjct: 121 -------YAVDPEFGTMED-FETLISEAHKRGIKVIIDLVINHTSDRHPWFKAAASDPNS 172 Query: 170 KPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFV-----LDPGIPEVQDWITSIVAEVV 224 K + +P S + F L+ P V++ + +A+ Sbjct: 173 KFRDYYIWAAHDEPRPGSGWRHLSGTTWFYLAHFWERMPDLNFDNPAVREEVKR-IAKFW 231 Query: 225 SRYPVDGVQFD--DYFYTESPGSRLNDNETYRKY 256 VDG + D + Y++ P + +Y Sbjct: 232 LDKGVDGFRLDAAKHLYSD-PAKNHQFWNEFYQY 264 >UniRef50_B6KGT3 1,4-alpha-glucan branching enzyme, putative n=5 Tax=Toxoplasma gondii RepID=B6KGT3_TOXGO Length = 891 Score = 59.0 bits (141), Expect = 3e-07, Method: Composition-based stats. Identities = 38/160 (23%), Positives = 63/160 (39%), Gaps = 11/160 (6%) Query: 77 ISNPTSRARVQQQAMIDKLDHLQRLGINTV-FFQVKPDGTALWPSKILPWSDLMTGKIGE 135 +P S R + ++DKLDH++ L N V F V+ G P L T Sbjct: 442 TFSPASADRTVFETLVDKLDHIRSLNFNAVEFLPVQEFGGEW---GYNPRLLLATHGKYG 498 Query: 136 NPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDW 195 P D L+ ++DE H++G+ V F+ + K ++ + +Y + Sbjct: 499 TP--DQLRKLVDECHRKGLAVI--FDLVLNHGSAKLNSLWNWDGYGKDASGGIYFEGGG- 553 Query: 196 IRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFD 235 SG EV+D I + + Y DG++ D Sbjct: 554 --ESGWGRKFSFHKREVRDMILAAALCFIEEYGADGLRLD 591 >UniRef50_C0A7S7 Putative uncharacterized protein n=1 Tax=Opitutaceae bacterium TAV2 RepID=C0A7S7_9BACT Length = 558 Score = 59.0 bits (141), Expect = 4e-07, Method: Composition-based stats. Identities = 40/266 (15%), Positives = 77/266 (28%), Gaps = 54/266 (20%) Query: 136 NPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDW 195 + G DPL+ + + +++ F R +N + + QH ++ Sbjct: 133 DAGIDPLRVVQKFTREHNIEL---FWSMR--MNDNHDGYADNYGPALFDATRLKTQHPEY 187 Query: 196 IR-------TSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLN 248 + G ++ PE+++ V EV Y VDGV+ D + + Sbjct: 188 LLGTRGQRLPYGSWSAVNYARPEIRELAFRYVEEVCLNYDVDGVELDFFRHPV----FFP 243 Query: 249 DNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLG 308 + + L+ ++ G R + Sbjct: 244 STTRGQPCTDDELNMM-------TGLLRRI-----------RVMADAEGR-RRGRPLLIA 284 Query: 309 SDTRGAAAY-DESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRT 367 +AAY D W+ LLD + Y+ + W D V R Sbjct: 285 VRVPDSAAYAKTIGLDLETWLSGDLLDLLIVGGYFQLNE-----------WTDSVALARK 333 Query: 368 R-------LYIGIAFYKVGEPSKIEP 386 +Y + +V + K + Sbjct: 334 HGAGFGIKVYPSLDDARVPDGPKFDA 359 >UniRef50_A9B0X0 Putative uncharacterized protein n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9B0X0_HERA2 Length = 679 Score = 58.6 bits (140), Expect = 4e-07, Method: Composition-based stats. Identities = 57/366 (15%), Positives = 111/366 (30%), Gaps = 67/366 (18%) Query: 81 TSRARVQQQAMIDKLDHLQRLGINTVFFQVKPD-----GTALWPSKILPWSDLMTGKIGE 135 T+ + ++ + D + + +N V +K D G + S+ L+ Sbjct: 357 TAATGSSKASLSELFDLVDQTEVNAVVIDIKLDIAGDVGGVGYLSQH----PLVLAAETS 412 Query: 136 NPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDW 195 + D ++++ EA KR + + + N P Sbjct: 413 SDYLDM-EWIVAEARKRDIYLI------------GRMAVMRDNRLADAHPEWAAQSKATG 459 Query: 196 IRT--SGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETY 253 G LDP P V ++ I E+ + + D VQFD Y P N + Sbjct: 460 GVWEDDGGLKWLDPFNPNVTEYNVGIAKEIAA-FGFDEVQFD---YIRFPSDGSTSNLVF 515 Query: 254 RKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVE-----FGVSPAGVWRNRSHDPLG 308 K NN + + + + +K + F + G R+ +G Sbjct: 516 SKPIDPK--------NNPEVMYEAIGNVLKRAHGDINGSGAFFSIDVFGYATWRNMWEIG 567 Query: 309 SDTRGAAAYDESYADTRRWVEQGLLDYIAPQIY--------WPFSRSAA-RYDVLAKWWA 359 A + DY+ +Y F + A Y+++ Sbjct: 568 QSLEIMADHT---------------DYVCAMVYPSHYDRNELGFDNADAYPYEIVKDSIE 612 Query: 360 DVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFRED 419 K + + + + + ++P + G E++ Q+ V G IL+ Sbjct: 613 KGQKRMEGKYAVQRPWLQAFTATWLDP--VTPYGRTEVRAQMQAVAEVEGTYGWILWNAA 670 Query: 420 YLNKPQ 425 P Sbjct: 671 NYYDPD 676 >UniRef50_A4BGI0 Alpha-galactosidase n=1 Tax=Reinekea blandensis MED297 RepID=A4BGI0_9GAMM Length = 708 Score = 58.2 bits (139), Expect = 5e-07, Method: Composition-based stats. Identities = 52/298 (17%), Positives = 102/298 (34%), Gaps = 62/298 (20%) Query: 17 ILVALALLLCSCKSTPPESMVTPP-----AGSKPPATTQQSSQPMRG-IWLATVSRLDWP 70 I +A L + P +S TP + + A +Q+ + +R + +V+ + P Sbjct: 240 IQLAEWLYPGEIELAPGDSYQTPTIVGSWSQNGLNALSQRFHRYLRQQVLDPSVAEVPRP 299 Query: 71 PVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDG---TALWPSKILPWSD 127 + + ++ +D +G+ G + + W+ Sbjct: 300 VHLNTW---EGIYFDHTPEHLLRMVDQAADMGVERFILDDGWFGARDDDH--AGLGDWTV 354 Query: 128 LMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPAS 187 M G L +++D RGM+ WF P + + Sbjct: 355 NMQKHPGG------LHYLIDAVKARGMEFGLWFEP-----------------EMVNPDSD 391 Query: 188 VYVQHRDWIRTSGD--------RFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFY 239 +Y H DW+ D ++VLD PEV +++ + + +++ Y + +++D Sbjct: 392 LYRAHPDWVLQVEDYEQLLGRYQYVLDLSRPEVSEYLWNSIDAILTEYDIRYIKWDMNRD 451 Query: 240 TESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAG 297 PGS G AS R Q + ++ I+ P VE +G Sbjct: 452 LVQPGS------------GGVASVH---RQ--TQALYQLMARIREAHPHVEIENCSSG 492 >UniRef50_C9KJL9 Alpha-galactosidase n=1 Tax=Mitsuokella multacida DSM 20544 RepID=C9KJL9_9FIRM Length = 749 Score = 58.2 bits (139), Expect = 6e-07, Method: Composition-based stats. Identities = 38/182 (20%), Positives = 60/182 (32%), Gaps = 32/182 (17%) Query: 119 PSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELN 178 S + W M G LQ + + AHKRG+K WF P VSV++ Sbjct: 377 NSSLGDWVVNMEKLKGG------LQGVAESAHKRGLKFGLWFEPEMVSVDSD-------- 422 Query: 179 STLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYF 238 P + S + VLD +V+D+I V +++ P+D V++D Sbjct: 423 -LYRAHPDWALRSPSYPMTFSRHQLVLDLSRADVRDYIVDSVCKILDTAPIDYVKWDFNR 481 Query: 239 YTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSH---TIKSIKPGVEFGVSP 295 + S K + + + + I P V F Sbjct: 482 HLTDAFSS--------KLPPERQGEVR------TRFVLGLYDVLERITQTHPDVLFESCS 527 Query: 296 AG 297 G Sbjct: 528 GG 529 >UniRef50_UPI00019691CD hypothetical protein BACCELL_01336 n=1 Tax=Bacteroides cellulosilyticus DSM 14838 RepID=UPI00019691CD Length = 368 Score = 58.2 bits (139), Expect = 6e-07, Method: Composition-based stats. Identities = 37/246 (15%), Positives = 64/246 (26%), Gaps = 65/246 (26%) Query: 146 LDEAHK-RGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFV 204 ++ A +G KVHAW +VN + ++ Y H ++ Sbjct: 81 IEAAKSYQGAKVHAWM----FTVNAPGDSAALVHPEWFDVNRIGYNSH-EYDPYVKHYKW 135 Query: 205 LDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFY--------------------TESPG 244 L P +PE + ++ A + + V D Y + Sbjct: 136 LSPSVPEARQYMKDKAASYAALEGLTSVHLDFIRYNDAVLGRRLQQHKFKIQQDTYRAEY 195 Query: 245 SRLNDNETYRKYGGAFAS-------------KADWRRNNTQQLIAKVSHTIKSIKPGVEF 291 K+ F +R N L+ ++ S Sbjct: 196 DFGYHPVAIEKFKKLFGYSPLDLQAPWMSPEWLQFRLNEVTSLVNEIVEATHSE------ 249 Query: 292 GVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARY 351 G + + P + A Y D W +D + P Y F Y Sbjct: 250 -----GKLVSAAVFPYPTR-----ARMTVYQDWPTW----KIDIVCPMNYQSF------Y 289 Query: 352 DVLAKW 357 +W Sbjct: 290 SESLEW 295 >UniRef50_Q3A0V9 Putative uncharacterized protein n=1 Tax=Pelobacter carbinolicus DSM 2380 RepID=Q3A0V9_PELCD Length = 395 Score = 58.2 bits (139), Expect = 6e-07, Method: Composition-based stats. Identities = 30/152 (19%), Positives = 52/152 (34%), Gaps = 19/152 (12%) Query: 88 QQAMIDKLDHLQRLGINTVFFQVKPD-GTALWP--SKILPWSDLMTGKIGENPGYDPLQF 144 + + ++R+G++TV +V G +P + +P + D L Sbjct: 43 YPEVAAEFARMRRMGLDTVVLRVFQRPGDRFYPFSNPRVPAGVYFSTDAAPVVD-DVLGS 101 Query: 145 MLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFV 204 + H G+KV AW + S + Sbjct: 102 LTALGHAAGLKVFAWMTTLSTPLPGAESLGGRRYDPASARIVPCEA-------------- 147 Query: 205 LDPGIPEVQDWITSIVAEVVSRYPVDGVQFDD 236 LDP EVQ + ++ A++ RY +DGV D Sbjct: 148 LDPFRIEVQQRLGTLFADLA-RYDIDGVLLQD 178 >UniRef50_Q5WAP8 Maltogenic amylase n=1 Tax=Bacillus clausii KSM-K16 RepID=Q5WAP8_BACSK Length = 589 Score = 57.8 bits (138), Expect = 7e-07, Method: Composition-based stats. Identities = 44/265 (16%), Positives = 88/265 (33%), Gaps = 58/265 (21%) Query: 70 PPVSSVNISNPTSRARVQQ---QAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWS 126 P + S P + Q ++D LD+LQ+LGIN ++ + + + Sbjct: 152 PENTLAWGSAPPTATNYFGGDLQGIVDHLDYLQKLGINGIYLTPIFKAFSNHKYDTIDYL 211 Query: 127 DLMTGKIGENPGYDPLQFMLDEAHKRGMKV-------HA--WFNPYRVSVNTKPGTIREL 177 + E L+ ++DE HKRG++V HA +F P++ + + + Sbjct: 212 KVDPQFGDETT----LKLLVDECHKRGIRVMLDAVFNHAGLYFPPFQDVLKHQQESEYRD 267 Query: 178 NSTLSQQPASVY-VQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDD 236 + Q P + D + L+ V+D++ ++ + + +DG Sbjct: 268 WFHIRQFPVRAEEPPNYDTFAFTPLMPKLNTANEAVKDYLLNVATYWIKEFDIDG----- 322 Query: 237 YFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQ----LIAKVSHTIKSIKPGVEFG 292 WR + + + + +K++KP + Sbjct: 323 -----------------------------WRLDVANEVDHAFWREFRNRVKALKPDLYI- 352 Query: 293 VSPAGVWRNRSHDPLGSDTRGAAAY 317 +W N G G Y Sbjct: 353 --LGEIWHNAYPWLQGDQFDGVMNY 375 >UniRef50_A5UJP6 Putative cysteine protease (Transglutaminase-like superfamily) n=3 Tax=Methanobrevibacter smithii RepID=A5UJP6_METS3 Length = 1496 Score = 57.8 bits (138), Expect = 7e-07, Method: Composition-based stats. Identities = 46/277 (16%), Positives = 82/277 (29%), Gaps = 80/277 (28%) Query: 145 MLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFV 204 + +A+ G+KVH W + G + LNS S A + Sbjct: 752 WIKQANSHGIKVHIWMQVF-----YDGGWLSPLNSDGSINTA-----------------L 789 Query: 205 LDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKA 264 + I E + + + V GV FD Y + Y+ GG A Sbjct: 790 FNERIAEAKKY--------AALPGVAGVHFDYLRYPGT---------AYKHPGGTAA--- 829 Query: 265 DWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADT 324 + + + I+ + P + AY D Sbjct: 830 ------ISEFVKLATTAIRGVNPNCLISAAVM--------------PEKNDAY-VYGQDI 868 Query: 325 RRWVEQGLLDYIAPQIY-WPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSK 383 V LD I P +Y ++ + + +W+ + ++G+ Y Sbjct: 869 A--VISKYLDIIVPMVYKGNYNSGTSWISSITQWFVETSNGAAV--WVGLQTY------- 917 Query: 384 IEPDWMINGGVPELKK--QLDLNDAVPEISGTILFRE 418 + + + V EL K Q + G ++FR Sbjct: 918 VSDNDITKLPVSELSKDAQTAYD---AGAKGVMMFRW 951 >UniRef50_C2KVT1 Putative uncharacterized protein n=1 Tax=Oribacterium sinus F0268 RepID=C2KVT1_9FIRM Length = 435 Score = 57.8 bits (138), Expect = 7e-07, Method: Composition-based stats. Identities = 44/285 (15%), Positives = 92/285 (32%), Gaps = 47/285 (16%) Query: 59 IWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTAL- 117 IW + D V + I++ T+ + M D L ++ +N + +K D + Sbjct: 76 IWKKKDIQKDRVKVKGIYITDLTAGSPK----MEDILSKMKDTELNALVIDIKNDNGQIV 131 Query: 118 WP-SKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRE 176 + + T I ++ L +L + H++G+ + A +R + E Sbjct: 132 YQMNNGGQQEFYNTTNIVKD-----LPALLKKCHEQGLYLIARLVCFR------DPAMGE 180 Query: 177 LNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDD 236 ++ Q + + ++P + D+I S+ D +Q D Sbjct: 181 VHPEWMNQ-----KADGSLFKDNSGMTWINPYKKDYWDYIASVAERCADD-GFDEIQLDY 234 Query: 237 YFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPA 296 + G + Y + ++ + + +S + V F Sbjct: 235 VRFCTEKGMK---EVQYPEEAKTNKTQI------ITEFVQYMSDRL--ANKQVFFSTDVF 283 Query: 297 GVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIY 341 G D + A + Y+D +DY+ P IY Sbjct: 284 GTIIGSYVD--------STAVGQDYSDMAA-----SVDYMCPMIY 315 >UniRef50_A6CFN7 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CFN7_9PLAN Length = 565 Score = 57.8 bits (138), Expect = 9e-07, Method: Composition-based stats. Identities = 62/433 (14%), Positives = 128/433 (29%), Gaps = 77/433 (17%) Query: 18 LVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNI 77 ++ L LL + A K + Q M+ + W + Sbjct: 33 MLRLLLLPAIVIIGHTYLERSVDAAEKNATLSPSQQQSMQAA----RQKAAWKKRRIIFN 88 Query: 78 SNPTSRARVQQQAMIDKL-----DHLQRLGINTVFFQVKPDGTALWPSK----------I 122 ++ ++A L L+ ++ +F+ G + + Sbjct: 89 NDGNEPVYSLKEATPQALLDVRTSPLKGSQVDAIFYCTWSSGFSYFTHDTKVGNVFTETA 148 Query: 123 LPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLS 182 S+ T ++ +N G+DPL M D + +++ F +R +N L Sbjct: 149 NKLSNNKTAELIKN-GHDPLTVMSDWCKENDVEL---FWSFR--MNDTHDASSAWYGPLL 202 Query: 183 QQPASVYVQHRDWIR-------TSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFD 235 P + +H +W+ +G +D E+ D V EV Y VDGV+ D Sbjct: 203 FPP--LKKEHPEWLVGSAKEKPKNGRWTAVDFTHEEICDLAYRYVEEVCRNYDVDGVELD 260 Query: 236 DYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSP 295 + + ++ + G + L+ ++ + Sbjct: 261 FFRHLNY-----FKRVSWGEPAGDLEL------SRLNDLMRRIRTMADEVGQ-------- 301 Query: 296 AGVWRNRSHDPLGSDTRGAAAYDESYA-DTRRWVEQGLLDYIAPQIYWPFSRSAARYDVL 354 + + + Y D W+++ L+D + Y+ + Sbjct: 302 ----QRGRPILIAIRVPDSVEYARVLGLDVETWLKEDLVDIMTVTGYFRLNP-------- 349 Query: 355 AKWWADVVK---PTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEIS 411 W + V+ + +Y G + E + + E + +N I Sbjct: 350 ---WKESVELGHKYQVPVYAG-----LSESRQKDQRARKVYASTEGFRGRAMNAWSQGID 401 Query: 412 GTILFREDYLNKP 424 G LF P Sbjct: 402 GIYLFNSFNPRHP 414 >UniRef50_D1AEP8 Putative uncharacterized protein n=1 Tax=Thermomonospora curvata DSM 43183 RepID=D1AEP8_THECD Length = 540 Score = 57.1 bits (136), Expect = 1e-06, Method: Composition-based stats. Identities = 59/342 (17%), Positives = 112/342 (32%), Gaps = 65/342 (19%) Query: 81 TSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTAL-WPSKILPWSDLMTGKIGENPGY 139 T+ A + L ++ IN V +K + + + S++ P + + G Y Sbjct: 223 TALAWASKPLRERILKMIREKRINAVQLDIKDEDGIIGYDSQV-PLAREVKATRGI---Y 278 Query: 140 DPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTS 199 D + LD+ H ++V +R K L + P + Sbjct: 279 DA-RQALDQLHAMNVRVIGRIVAFRDPQLGKASWRAGKRDRLVRTPDGGA-----YGSQY 332 Query: 200 GDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGA 259 G + + PEV+ + + E +R D + +D + P + ++ G Sbjct: 333 GAQSFTNLAHPEVRKYNIDLAVE-AARLGFDEILYDYVRRPDGPLKSM-------RFPGL 384 Query: 260 FASKADWRRNNTQQLIAKVSHTIKSIKPGVEF-GVSPAGVWRNRSHDPLGSDTRGAAAYD 318 S D + IA+ ++ P F G S G+ R H Sbjct: 385 RGSVED----SVASFIAETRRLVR---PHGAFLGASVYGIAATRPH-------------- 423 Query: 319 ESYADTRRWVEQGLLDYIAPQIY--------WPFSRSAA-RYDVLAKWWA---DVVKPTR 366 E D + + +DY+AP +Y + A Y + V++ T Sbjct: 424 EIGQDIAKIGKH--VDYVAPMLYPSHWGAGEFGLKNPNAEPYKTVYASMLTFHKVLRGTS 481 Query: 367 TRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVP 408 ++ + + +G P GV E+K Q++ Sbjct: 482 AQIVPWLQDFSLGHPY----------GVAEVKAQIEAAAKTG 513 >UniRef50_C0A376 Putative uncharacterized protein n=1 Tax=Opitutaceae bacterium TAV2 RepID=C0A376_9BACT Length = 736 Score = 57.1 bits (136), Expect = 1e-06, Method: Composition-based stats. Identities = 45/258 (17%), Positives = 86/258 (33%), Gaps = 47/258 (18%) Query: 65 SRLDWPPVSSVNISNPTSRARVQ--QQAMIDKLDHLQRLGINTVFF---QVKPDGTALWP 119 + + W + P R +A+ L ++ LG +T++ Q P + Sbjct: 268 TFIGWRMIVGETTGTPPQRLEPYPTTEALAADLPRIKALGFDTIYLMPRQPFPG----YT 323 Query: 120 SKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKV-----------HAWFNPYRVSVN 168 + L L G G + + H GM V H + ++ Sbjct: 324 TASLTDPALQYGDGEGTGGR--FRSLTKSIHALGMSVIVDVVLHGGMDHGTLDFQDGIID 381 Query: 169 TKPGTIRELNSTL--------SQQPASVYVQHRDWIRTS-------GDRFVLDPGIPEVQ 213 + PG + N + + ++ H +W G D P Q Sbjct: 382 SLPGNWGKNNHLAHSEIWRSEAPKQHPLWEAHPEWFCQFEDGRAQIGYTRAFDLRHPGFQ 441 Query: 214 DWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQ 273 ++ +A +VS Y +DG +FD ++T D +A W+ Q Sbjct: 442 EYFVQSLASLVSEYGIDGFRFDAPWWTIFAYRWKEDA----------GYRASWQAGAAAQ 491 Query: 274 LIAKVSHTIKSIKPGVEF 291 L++++ ++ +KP F Sbjct: 492 LVSRLHLAVQRVKPDTIF 509 >UniRef50_UPI000178A7B6 hypothetical protein GYMC10_3553 n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI000178A7B6 Length = 699 Score = 56.7 bits (135), Expect = 2e-06, Method: Composition-based stats. Identities = 38/244 (15%), Positives = 74/244 (30%), Gaps = 38/244 (15%) Query: 97 HLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKV 156 L+ +G N + F A + +++ T G D L ++ H ++ Sbjct: 38 QLKEMGANAMVFNTGGI-YAWYDTQV----PYHTVNGYLPEGRDLLGELITACHLEELRF 92 Query: 157 HAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQ---HRDWIRTSGDRFV-------LD 206 A + + +S ++P + + I ++ Sbjct: 93 IA-----------RFDFSKADDSVYLRRPEWFVRKEGGRPEIIGAERPGPWPLLMSTCIN 141 Query: 207 PGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFAS---- 262 G D ++ E +SRYP+DG+ F++ Y R +YG Sbjct: 142 GGYRNA-DVAAPVLREALSRYPIDGIFFNNPGYVFCRCERCRRKYA-ERYGKDLPDSPQE 199 Query: 263 -KADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESY 321 + D+ + + IK +P V P ++ N D L + Sbjct: 200 LEPDFAAGCFDDNMKAMHDLIKQERPEV-----PMILYYNLHRDNLSKRVQITDMLCTEP 254 Query: 322 ADTR 325 D Sbjct: 255 QDVL 258 >UniRef50_UPI00006CC013 Alpha amylase, catalytic domain containing protein n=1 Tax=Tetrahymena thermophila RepID=UPI00006CC013 Length = 486 Score = 56.7 bits (135), Expect = 2e-06, Method: Composition-based stats. Identities = 30/189 (15%), Positives = 67/189 (35%), Gaps = 21/189 (11%) Query: 61 LATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPS 120 + L +++ + + + LD++Q +G N ++ PD +P+ Sbjct: 34 ITDRFALGDGSKPYCDLNKQPYYCGGNFKGIENNLDYIQGMGFNAIWISPVPDN---YPN 90 Query: 121 KILPWS-DLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNS 179 + ++ MT G D L+ ++D HKR + W V+ + + Sbjct: 91 QFHGYAAKNMTEINSHFGGADGLKSLIDACHKRDI----WVMIDVVANHMGNTDQNYSEN 146 Query: 180 TLSQQPASV----YVQHRDWIRTS---------GDRFVLDPGIPEVQDWITSIVAEVVSR 226 P + D+ + L+ V +++ + + ++V Sbjct: 147 IPFNSPDHYHSYCIISDSDFATKNMYNIQHCRLAGLADLNQENSYVSNYLVNWIKDLVQT 206 Query: 227 YPVDGVQFD 235 Y VDG++ D Sbjct: 207 YNVDGIRID 215 >UniRef50_C1SH35 Putative uncharacterized protein n=1 Tax=Denitrovibrio acetiphilus DSM 12809 RepID=C1SH35_9BACT Length = 383 Score = 55.9 bits (133), Expect = 3e-06, Method: Composition-based stats. Identities = 58/373 (15%), Positives = 107/373 (28%), Gaps = 79/373 (21%) Query: 90 AMIDKLDHLQRLGINTVFFQV-KPDGTALW--PSKILPWSDLMTGKIGENPGYDPLQFML 146 + + L+ GI++VF +V + S + D L + Sbjct: 41 NIDNFFAGLKERGIDSVFLRVFHNSIDRYHYLDTNEKCKSGVYFKTDSACVIRDVLGEAV 100 Query: 147 DEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLD 206 A K MKV AW +S KP + + D + + Sbjct: 101 SAARKYDMKVFAWMATRSLSFLKKPLYMEKEFRKGGLA----------------DGYGMS 144 Query: 207 PGIPEVQDWITSIVAEVVSRYPVDGVQF-DDYFYTESPGSRLNDNETYRKYGG------- 258 PE + + + ++ Y +DG+ F DD+ G+ + Y G Sbjct: 145 IFQPEAAERVKKLFRDLAF-YDIDGILFQDDFILRYREGASPYAVKAYEDDTGIKLSYNK 203 Query: 259 -------------------AFASKADWRRNNTQQLIAKVSHTIK----SIKPGVEFGVSP 295 F +W+ + + ++K + P + F Sbjct: 204 LFGCTGGNGITKVPGGCPDTFLPWTEWK----NSKMMEFYQSLKIESMKVNPDLVFA--- 256 Query: 296 AGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWP---------FSR 346 N ++ +G + Y +E G DY+A Y Sbjct: 257 ----GNVYYETPLEKRKGMSWYS---QSIGSMLEFGF-DYLAVMGYHDQIAGELNLVRDD 308 Query: 347 SAARYDVLAKWWADVVKPTRTRLY----IGIAFYKVGEPSKIEPDWMINGGVPELKKQLD 402 + +A D V + L I + K I + PE+ + L Sbjct: 309 ALNLVGQMADNLKDEVDASSRILMKVQRISFSNSKKLSDDNISSLCSMLSEHPEISRILL 368 Query: 403 LNDAVPEISGTIL 415 + V +++GT Sbjct: 369 PVNKVEDLAGTCF 381 >UniRef50_B7J5F9 GTP-binding protein n=2 Tax=Acidithiobacillus ferrooxidans RepID=B7J5F9_ACIF2 Length = 455 Score = 55.9 bits (133), Expect = 3e-06, Method: Composition-based stats. Identities = 47/339 (13%), Positives = 101/339 (29%), Gaps = 52/339 (15%) Query: 94 KLDHLQRLGINTVFFQVKPDGTAL-WPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKR 152 LD + + +N V VK D + + + + P + + + + ++ ++D+ H+ Sbjct: 119 ALDIIGKTDLNAVVIDVKSDRGMIAYKTDV-PLATEIGAQKMITIKH--IKRLMDDLHQE 175 Query: 153 GMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRF-VLDPGIPE 211 G+ A + + N +P I + DP + Sbjct: 176 GIYTIARIVVF------------KDNVLALARPDLAVRTAGGAIWKDREGLAWTDPFSKQ 223 Query: 212 VQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNT 271 V D+ + D +QFD + ++ G + + T R + Sbjct: 224 VWDYNIDVAVAAAKD-GFDEIQFDYVRFPDAKGLVFSRSTTEES-----------RVSAI 271 Query: 272 QQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQG 331 +A+ + I V G +D + +Q Sbjct: 272 SGFLAEARKRL--IPYNVFLSADIFGYVIWNRND------------TGIGQNLEEMAQQ- 316 Query: 332 LLDYIAPQIY---WPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDW 388 +DYI+P +Y + + VL + + G+ + + D+ Sbjct: 317 -VDYISPMLYPSGFQYGIPGYPNPVLHPHQIVYLSLRKAEERTGLPPVRFRPWLQAFRDY 375 Query: 389 MINGGV---PELKKQLDLNDAVPEISGTILFREDYLNKP 424 G E+ Q+D G +L+ + Sbjct: 376 AFGGKPFGGEEIAAQIDAAQTFGS-DGWMLWNPRNVYTT 413 >UniRef50_C6VY08 Putative uncharacterized protein n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6VY08_DYAFD Length = 557 Score = 55.9 bits (133), Expect = 3e-06, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 70/201 (34%), Gaps = 44/201 (21%) Query: 116 ALWPSKILPWSDLMTGKIGENPGY---DPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPG 172 A +P+K+ NP + L ++ + H++ +KV F+ RV Sbjct: 88 AFYPTKL--------DFHYRNPYLKDNNVLADIVRKCHEKSIKVIVRFDFSRVH------ 133 Query: 173 TIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGV 232 S P Y+ + + D +V+ P VQ+ I+ EV++ +P+DG+ Sbjct: 134 -----ESIFKAHPDWCYISPKGERIINTDMYVVSINAPYVQEKAFRIIEEVINTFPIDGI 188 Query: 233 QFDDYFYTESPGSRL---------NDNETYRKYGGA-------------FASKADWRRNN 270 + Y + D + + +Y G F ++++ Sbjct: 189 FLNMPGYQVNNPYEGKYHGIDQNEYDRKRFAEYSGGKALPVEENKADPLFQKYLEFKKAT 248 Query: 271 TQQLIAKVSHTIKSIKPGVEF 291 + ++ +KS + Sbjct: 249 VEDWSERLHKLVKSKNEQIAI 269 >UniRef50_C7HUF6 Sugar fermentation stimulation protein n=4 Tax=Anaerococcus RepID=C7HUF6_9FIRM Length = 395 Score = 55.9 bits (133), Expect = 3e-06, Method: Composition-based stats. Identities = 34/204 (16%), Positives = 77/204 (37%), Gaps = 27/204 (13%) Query: 89 QAMIDKLDHLQRLGINTVFFQV---KPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFM 145 ++ LD + +NTV + + T + S T K+ + + Sbjct: 77 KSFKKNLDLIDSTKLNTVVIDIKDDWGNITCDFKSD-NKDIKYATDKVIDA------EDF 129 Query: 146 LDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSG-DRFV 204 +++ HK+G+ V ++ S+ T+ + P +V + T+G + Sbjct: 130 INKMHKKGIYVIGRITTFKDSIITE------------KHPDWGFVLEDGSLWTNGLNEAF 177 Query: 205 LDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASK- 263 ++P + +V+++ +AE+ + D +QFD + E N+ Y K K Sbjct: 178 MNPYLDDVRNYNLK-IAELAANVGFDEIQFDYVRFAE-GFETFNEKLDYSKGKWEKIQKK 235 Query: 264 -ADWRRNNTQQLIAKVSHTIKSIK 286 D R + + + +++ Sbjct: 236 DEDKRIDAITSFVKEAREMLQAYD 259 >UniRef50_D2R498 Putative uncharacterized protein n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R498_9PLAN Length = 1350 Score = 55.5 bits (132), Expect = 4e-06, Method: Composition-based stats. Identities = 40/247 (16%), Positives = 81/247 (32%), Gaps = 42/247 (17%) Query: 85 RVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKI-LPWSDLMTG---KIGENPGY- 139 + Q +D+L+ G + +G+ ++PSK+ P TG G++ Sbjct: 503 QTFYQGATRLVDYLRAHGYQGAVIPILSEGSTIYPSKLLEPTPKYDTGVFFASGQDVVRK 562 Query: 140 DPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTS 199 D L+ +L + G+ + + S TIR + + WI Sbjct: 563 DVLELLLRLFDRAGLTLIPALD--LSSPLPALETIRLGDPIRAMGIEPRGADELSWIERH 620 Query: 200 GDRF----VLDPGIPEVQDWITSIVAEVVSRY----PVDGVQF---DDYFYTESPGSRLN 248 G R +P VQ + +V E+ +RY + G+ D + + Sbjct: 621 GTRRGMGAYYNPLDGRVQKAMVDVVDEIATRYGSHPSLGGLSLGVRDWSYLVVADELTSC 680 Query: 249 DNETYRKY------------------------GGAFASKADWRRNNTQQLIAKVSHTIKS 284 D T ++ G + + WR + L+ ++ + + Sbjct: 681 DPATLTRFCTDTGVEIPATFATSPAARVELVTGPSRETWLTWRCDQIAILLEQMHAKLAA 740 Query: 285 IKPGVEF 291 +P + Sbjct: 741 RQPTAKL 747 >UniRef50_D2QE70 Alpha amylase catalytic region n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QE70_9SPHI Length = 958 Score = 55.1 bits (131), Expect = 4e-06, Method: Composition-based stats. Identities = 35/212 (16%), Positives = 67/212 (31%), Gaps = 35/212 (16%) Query: 85 RVQQQAMIDKLDHLQRLGINTV--FFQV-------KPDGTALWPSKILPWSDLMTGKIGE 135 Q + D L +L+RLGINT+ + P T Sbjct: 410 NRSYQTLTDSLAYLKRLGINTIELMPVTEFSGNDSWGYNPTFY---FAPDKAYGTKTA-- 464 Query: 136 NPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDW 195 L+ +D AHK+G+ V ++ ++ + A ++ Sbjct: 465 ------LRQFIDAAHKQGIAV---VLDMVLNQADYEFPYVKMYWAGDRPSADSPYFNQQA 515 Query: 196 IRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRK 255 F + P+ + ++ + + Y +DG +FD S ND + Sbjct: 516 THPYSVFFDFNHESPDTKAFVDRVCKYWLQEYKIDGFRFDLSKGFTQKNS-GNDVSAWGN 574 Query: 256 YGGAFASKADWRRNNTQQLIAKVSHTIKSIKP 287 Y + W+R + I+++ P Sbjct: 575 YDAGRIAI--WKR---------IYDQIRTVDP 595 >UniRef50_B3DUS3 Trehalose synthase n=3 Tax=Bacteria RepID=B3DUS3_METI4 Length = 1121 Score = 55.1 bits (131), Expect = 4e-06, Method: Composition-based stats. Identities = 42/180 (23%), Positives = 65/180 (36%), Gaps = 38/180 (21%) Query: 91 MIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFM---LD 147 + KLD+++ LG+N ++ +PS + + +P Y LQ L Sbjct: 54 ITQKLDYIKALGVNAIWL------LPFYPSPLKDDGYDIADYCSIHPDYGDLQDFKTFLK 107 Query: 148 EAHKRGMKV------------HAWFNPYRVSVNTK-----------PGTIRELNSTLSQQ 184 EAHKRG++V H WF RVS P +E Sbjct: 108 EAHKRGLRVITELVINHTSDQHPWFQRARVSPPGSLYRNYYVWSDTPQKYKEARIIFKDF 167 Query: 185 PASVYVQHRD-----WIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFY 239 +S + W R + L+ PEV+ I I+ + VDG++ D Y Sbjct: 168 ESSNWTWDPVAKAYFWHRFYSHQPDLNFDNPEVKKEIFKII-DFWLGMGVDGLRLDAVPY 226 >UniRef50_A6C749 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C749_9PLAN Length = 621 Score = 55.1 bits (131), Expect = 4e-06, Method: Composition-based stats. Identities = 52/314 (16%), Positives = 98/314 (31%), Gaps = 53/314 (16%) Query: 88 QQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENP---------- 137 + + + T +FQV PS++ +LM G + P Sbjct: 241 RADLAATFSAYRDSDFKTWWFQVGGADLVHHPSQV---GNLMGGHLDTFPREVDREYVES 297 Query: 138 -------GYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYV 190 G DPL+ ++EAH + ++ R + E + Y Sbjct: 298 VRHLHQQGIDPLKVAVEEAHSQDAEI---LVCLRAAGWKAAPPWEEF------FMSDFYE 348 Query: 191 QHRDW-IRTSGDRFVLDP--GIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRL 247 H +W + EVQD + + E + R DG F + Sbjct: 349 AHPEWRCIDYDGTPTMHLSYAAEEVQDHLIEVYREALQR-GADGAGFLFHRGMPMILWEE 407 Query: 248 NDNETY-RKYGGAFASKAD-------WRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVW 299 + + +++G A+ R + + K+ + V+ G + Sbjct: 408 PFCQRFIKEFGENPRDLAEDDPRVFQLRATIVTEFVRKIRAMLDETA--VQRGNAH---- 461 Query: 300 RNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDY--IAPQIYWPFSRSAARYDVLAKW 357 R TR D RW+E+ L+D IA Y+ + D + Sbjct: 462 RRLKLAVSTFSTRADNQKFGL--DVERWIEEKLIDQIGIAWFAYYTSGLKSKSGDT--AY 517 Query: 358 WADVVKPTRTRLYI 371 +A + + T +++ Sbjct: 518 YARITEGTDVKIFP 531 >UniRef50_UPI000197B402 hypothetical protein BACCOPRO_02222 n=1 Tax=Bacteroides coprophilus DSM 18228 RepID=UPI000197B402 Length = 363 Score = 55.1 bits (131), Expect = 5e-06, Method: Composition-based stats. Identities = 41/234 (17%), Positives = 78/234 (33%), Gaps = 44/234 (18%) Query: 73 SSVNISNPTSRARVQQQAMIDKLDHLQRLGINTV-FFQVKPDGTALWPSKILPWSDLMTG 131 S + + Q + + + G+NT+ F V +G A + S I+P Sbjct: 42 SGKGYAGLSLPEDYSQADPKEHFEWYKAHGVNTIQSFCVSHNGYAWYDSDIVPK------ 95 Query: 132 KIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQ 191 G + L+ ++D H+ GMKV +F+P + Sbjct: 96 VPGLRSNF--LKDLVDMGHREGMKVMGYFSP--------------------GTNVRWMKE 133 Query: 192 HRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTE---------S 242 H D + + F + E ++ +++ E VS+ +DG D F Sbjct: 134 HPDEVYDNQTMFHI-VYTSEYLTYLGNVIYEAVSKTGIDGFMIDALFTAPRDSAEAMKWM 192 Query: 243 PGSRLNDNETYRKYGGAFASKAD-----WRRNNTQQLIAKVSHTIKSIKPGVEF 291 P R E + + D ++R +T++ + + K P Sbjct: 193 PCERQIYEELFGEPFPGKEHITDEQVLEYKRRSTERCWDTIYQSAKRANPDCIV 246 >UniRef50_Q9P8N4 Alpha-galactosidase (Fragment) n=2 Tax=Lichtheimia corymbifera RepID=Q9P8N4_9FUNG Length = 729 Score = 55.1 bits (131), Expect = 5e-06, Method: Composition-based stats. Identities = 29/132 (21%), Positives = 51/132 (38%), Gaps = 28/132 (21%) Query: 127 DLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPA 186 D K G PL + H GM+ WF P V+ N+ Sbjct: 371 DWYPNKEKFPNGLKPLADHV---HDLGMQFGVWFEPESVNPNS----------------- 410 Query: 187 SVYVQHRDWIRTSG--------DRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYF 238 ++Y +H DW+ ++ +L+ G+PEVQD+I V+ ++ +D +++D Sbjct: 411 NLYREHPDWVLYYDGVPRYEARNQLLLNLGLPEVQDYIYDRVSSIIEENDIDYIKWDMNR 470 Query: 239 YTESPGSRLNDN 250 + D Sbjct: 471 PYQGVTMHHYDR 482 >UniRef50_A6L961 Glycoside hydrolase family 36, candidate alpha-glycosidase n=6 Tax=Bacteroidales RepID=A6L961_PARD8 Length = 735 Score = 55.1 bits (131), Expect = 5e-06, Method: Composition-based stats. Identities = 42/191 (21%), Positives = 73/191 (38%), Gaps = 32/191 (16%) Query: 145 MLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIR--TSGDR 202 ++ +A+K G+K W P + ++ + P V I G + Sbjct: 381 LIADANKHGIKFGIWIEPEMANTTSE---------LYEKHPEWVLKAPNREIVLGRGGTQ 431 Query: 203 FVLDPGIPEVQDWITSIVAEVVSRYP-VDGVQFDDYFYTESPGSRL--NDNET--YRKYG 257 VLD PEVQD+I IV +++ YP +D +++D + GS+ +D ++ Y +Y Sbjct: 432 VVLDLSNPEVQDFIFGIVDNLMTTYPEIDYIKWDANMSILNHGSQYLPSDQQSHMYIEYH 491 Query: 258 GAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWR-NRSHDPLGSDTRGAAA 316 G F KV I++ P + +G R N P + + Sbjct: 492 GGF---------------KKVCERIRAKYPDLTLQACASGGGRANYGVMPYFDEFWVSDN 536 Query: 317 YDESYADTRRW 327 D +W Sbjct: 537 TDALQRIYMQW 547 >UniRef50_Q4L9B3 Similar to unknown protein n=4 Tax=Bacilli RepID=Q4L9B3_STAHJ Length = 441 Score = 54.8 bits (130), Expect = 6e-06, Method: Composition-based stats. Identities = 56/345 (16%), Positives = 110/345 (31%), Gaps = 52/345 (15%) Query: 13 RRPAILVALALLLCSCKSTPPESMVTP--PAGSKPPATTQQSSQPMRGIWLATVSRLDWP 70 + A+L ALLL +C + S + +Q +++D+P Sbjct: 5 KIFAVLTTSALLLAACSNGDNSSSSGQKGDSQKNEQTNSQSEKLKKNNDKNKNENKVDYP 64 Query: 71 PVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMT 130 I ++ + + + + ++ +NT+ VK D + L T Sbjct: 65 KDGVKGIYVTSNSTEGDK--IDELIKFIKDSKLNTMVIDVKDDEGNI-------TMKLNT 115 Query: 131 GKIGENPG-YDPL--QFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPAS 187 G + D + + +L + H + A + + + P Sbjct: 116 GNKQVDKNTLDIVDGKKLLKKLHNNNIYPIARIVTF------------KDTKLAEEHPEW 163 Query: 188 VYVQHRDWIRTSG-DRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSR 246 + + + T+G ++P + EV D+ ++ D +QFD + E Sbjct: 164 SFKESDGSVWTNGKGDSFVNPFMKEVWDYDITVAKAAAKAGFQD-IQFDYVRFPE-GFEN 221 Query: 247 LNDNETYRK--YGGAFASKADWRRNNTQQLIAKVSHTIKSIKP-GVEFGVSPAGVWRNRS 303 D+ TY K Y + S D R + + + + K +KP GV G Sbjct: 222 EADSLTYSKGDYKNSKLSSGDQRVDTITKFLEHAN---KELKPMGVNVSADVFGYSALVK 278 Query: 304 HDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIY---WPFS 345 + P + + +D I+ IY W Sbjct: 279 NAPGIGQSFPKMS--------------ENVDAISSMIYPSHWSNG 309 >UniRef50_B0XSJ0 Alpha-glucosidase/alpha-amylase, putative n=4 Tax=Trichocomaceae RepID=B0XSJ0_ASPFC Length = 655 Score = 54.8 bits (130), Expect = 7e-06, Method: Composition-based stats. Identities = 45/248 (18%), Positives = 83/248 (33%), Gaps = 49/248 (19%) Query: 89 QAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDE 148 + ++ +LD+L+ LG + ++ T ++ S + + G NP Y ++ D Sbjct: 107 RGIVQRLDYLKDLGADMLWL------TPIYESPLEDQGYDIANYKGINPIYGTMKDWEDL 160 Query: 149 A---HKRGMKV------------HAWF--------NPYRVSVNTKPGTIRELNSTLSQQP 185 A H+RGMK+ HAWF NP R + G I L Sbjct: 161 AAEIHRRGMKIMMDMVFNHTSSQHAWFLESKKSRDNPKRDWYFWRRGKIGTNGERLPPNN 220 Query: 186 ASVYVQHRDWIRTSGDRFV-----------LDPGIPEVQDWITSIVAEVVSRYPVDGVQF 234 W L+ PEV++ + ++ + + DG +F Sbjct: 221 WESLFGGPAWKYDEHTDEWYMHLFSPSQPDLNWDNPEVRNAVFDVI-DFWGQKGTDGFRF 279 Query: 235 DDY----FYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVE 290 D P + + + + Y + + N + +++ + S Sbjct: 280 DVINLVSKTPGLPDAPIVNPDKYEQPAVPLFTNGP----NIHTYMHEMNRKVLSKYTNCT 335 Query: 291 FGVSPAGV 298 G P GV Sbjct: 336 VGEMPCGV 343 >UniRef50_C6CVL0 Alpha amylase catalytic region n=1 Tax=Paenibacillus sp. JDR-2 RepID=C6CVL0_PAESJ Length = 582 Score = 54.8 bits (130), Expect = 7e-06, Method: Composition-based stats. Identities = 37/184 (20%), Positives = 74/184 (40%), Gaps = 23/184 (12%) Query: 68 DWPPVSSVNISNPTSRARV--QQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPW 125 + P +S P + Q +I+++ HL LG+N V+ T ++ S Sbjct: 148 NDPEGTSPWGEQPEGESFFGGDLQGIINRIGHLNELGVNAVYL------TPVFRSPSNHK 201 Query: 126 SDLMTGKIGENPGY---DPLQFMLDEAHKRGMKV-------HAW--FNPYRVSVNTKPGT 173 D T +P + D L+ +++ HK G++V HA F P++ + + Sbjct: 202 YDT-TDYREVDPHFGDKDLLKMLVEVCHKHGIRVVLDAVFNHASEQFPPFQDVLEKGDQS 260 Query: 174 IRELNSTLSQQPASVY--VQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDG 231 + L+ P V + + D G+ L+ P+V++++ + +DG Sbjct: 261 EFKDWFHLNGFPVEVQDGIANYDTFGFYGNMPKLNTANPDVKNYLIETAVNWMKETGIDG 320 Query: 232 VQFD 235 + D Sbjct: 321 WRLD 324 >UniRef50_B3JE63 Putative uncharacterized protein n=4 Tax=Bacteroides RepID=B3JE63_9BACE Length = 738 Score = 54.4 bits (129), Expect = 8e-06, Method: Composition-based stats. Identities = 31/165 (18%), Positives = 62/165 (37%), Gaps = 41/165 (24%) Query: 145 MLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTS----- 199 ++ +A K G+K W P ++ + +Y +H +WI + Sbjct: 384 LIRDAKKHGIKFGIWIEP-----------------EMANTTSELYEKHPEWILKAPQRDP 426 Query: 200 -----GDRFVLDPGIPEVQDWITSIVAEVVSRYP-VDGVQFDDYFYTESPGS-RLNDNET 252 G + VLD PEVQD++ +V ++++ YP + +++D + GS L ++ Sbjct: 427 VLGRGGTQVVLDLANPEVQDFVFKVVDDLMTNYPEIAYIKWDANMAIMNHGSNYLPADKQ 486 Query: 253 YRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAG 297 Y K+ I++ P + +G Sbjct: 487 SHMYIEFHKG------------FEKICQRIRAKYPDLTIQACASG 519 >UniRef50_A6C7C5 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C7C5_9PLAN Length = 1225 Score = 54.4 bits (129), Expect = 9e-06, Method: Composition-based stats. Identities = 39/230 (16%), Positives = 75/230 (32%), Gaps = 43/230 (18%) Query: 103 INTVFFQVKPDGTALWPSK-ILPWSDLMTGK---IGENPGY-DPLQFMLDEAHKRGMKVH 157 N++ V DG+A++PS+ + P +G G++ D L+ + ++ + + Sbjct: 483 FNSILMAVSADGSAIYPSEFLAPTPRYDSGVYHSSGQDVVRKDVLELLFQIFNREKLTLV 542 Query: 158 AWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRF----VLDPGIPEVQ 213 S+ + +S S V + W G +P P VQ Sbjct: 543 P--ELQFSSMLNSLEQMLHDDSRNSAGIELVNSAGQTWREAKGTSRGQAPFYNPLDPRVQ 600 Query: 214 DWITSIVAEVVSRYPVD------GVQFDDYFYTESPG-SRLNDNETYRKYGGA------- 259 I I E+V RY VQ Y + PG D++T +++ Sbjct: 601 AEIVKIFRELVVRYKRHPSFQGVAVQLSLNGYLQLPGLDWGYDDQTVQQFQKETGIKIPF 660 Query: 260 ------------------FASKADWRRNNTQQLIAKVSHTIKSIKPGVEF 291 WR ++L ++ +++ P + Sbjct: 661 SSEANRYQDRYRFLTTTALPQWTQWRCQKIRELHQLLADVLQAESPDAQI 710 >UniRef50_D0AJB4 Predicted protein n=3 Tax=cellular organisms RepID=D0AJB4_ENTFC Length = 730 Score = 54.4 bits (129), Expect = 9e-06, Method: Composition-based stats. Identities = 30/166 (18%), Positives = 68/166 (40%), Gaps = 17/166 (10%) Query: 89 QAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDE 148 + + D +D LG TV + + + L ++ + + ++ ++D Sbjct: 346 KLITDLIDAANDLGFETVVLD-----DGWYGKRNSSKTSLGDWQVDTDKFPNGIESLVDY 400 Query: 149 AHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPG 208 A ++ + WF P +S N++ + + P V + S ++VLD Sbjct: 401 AKQKNIGFGIWFEPEMISPNSE---------LIQKHPDWVMRSKKYEPLLSRSQYVLDLT 451 Query: 209 IPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYR 254 EVQ +I ++E +++ V+ +++D + P + + TY Sbjct: 452 KQEVQQFIIKTLSETITKLRVNYIKWDMNRHISDP---FSQDSTYE 494 >UniRef50_A8RX71 Putative uncharacterized protein n=4 Tax=Clostridiales RepID=A8RX71_9CLOT Length = 465 Score = 54.4 bits (129), Expect = 1e-05, Method: Composition-based stats. Identities = 70/416 (16%), Positives = 117/416 (28%), Gaps = 68/416 (16%) Query: 12 IRRPAILVALALLLCSCKSTPPESMVT-----PPAGSKPPATTQQSSQPMRGIWLATVSR 66 ++R L L + C P +T A Q +Q ++ Sbjct: 1 MKRWIAAGILILAMTGCSRYEPAKEMTRAQEESVQSEASGAQNGQETQEAEAADSQVITI 60 Query: 67 LDWPPVSSVNISNPTSRARVQQQA--MIDKLDHLQRLGINTVFFQVKPDGTALWPSKILP 124 +PP + V + A V M ++ + R +N V VK D + + P Sbjct: 61 STYPPRNPVKVKGIYVSAYVAGTGDMMDKIIEEIDRTELNAVVIDVKDDQGRITYAMDSP 120 Query: 125 WSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQ 184 + + D M + G+ A +R Q+ Sbjct: 121 TVNEIGACQVFIQ--DMPALMAKL-KEHGIYTIARVVAFR------------DPYLAEQK 165 Query: 185 PASVYVQHRDWIRTSGDRF-VLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESP 243 P I ++P EV D++ V + D +QFD + Sbjct: 166 PEWSLHVADGKIYRDNKGLAWVNPYKKEVWDYLIE-VGKKAGEAGFDEIQFDYIRFAVDK 224 Query: 244 GSR---LNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWR 300 +D +T G ++A + I + K G+ G Sbjct: 225 TMNDVVFDDADT----QGRDKTQA------ITEFIGYAHDEL--AKEGLFVSADVFGTIM 272 Query: 301 NRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIY---WPFSR------SAARY 351 D A + Y D EQ LDYI P IY + Y Sbjct: 273 RSEEDAA--------AVGQEYED---MAEQ--LDYICPMIYPSHYGPGNFGIEYPDTQPY 319 Query: 352 DVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAV 407 D + + + +R L A K P + W+ + L+ + D Sbjct: 320 DTIL----NALNGSRELLA---ASAKEDAPQAVVRPWLQDFTASYLEHYIKYGDEQ 368 >UniRef50_Q02D31 Putative uncharacterized protein n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q02D31_SOLUE Length = 662 Score = 54.0 bits (128), Expect = 1e-05, Method: Composition-based stats. Identities = 23/177 (12%), Positives = 64/177 (36%), Gaps = 17/177 (9%) Query: 77 ISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGEN 136 +++ + + D R ++ +F + + ++ W + + Sbjct: 177 LADLGLDPPFRSNRLWAFFDSAYRSRVDVEYFAARWHKAGIAALQVAAWHNFEP-----D 231 Query: 137 PGYD-PLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDW 195 G D L+ +++ H++G+ V+AWF + + ++ A DW Sbjct: 232 AGRDEYLRKLIEACHRQGILVYAWFE-----LPHVSEKFWADHPEWREKTAVQQDAQLDW 286 Query: 196 IRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNET 252 R +++ + +++ +++ R+ DGV + ++ G+ T Sbjct: 287 ------RKLMNLSNRDCFRAVSAGARDLLGRFDWDGVNLAELYFESLEGTGNPSRFT 337 >UniRef50_Q6VUG7 Dextranase 1 n=3 Tax=Paenibacillus RepID=Q6VUG7_9BACL Length = 599 Score = 54.0 bits (128), Expect = 1e-05, Method: Composition-based stats. Identities = 44/239 (18%), Positives = 78/239 (32%), Gaps = 15/239 (6%) Query: 61 LATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPS 120 +A DW ++ +D L R IN + F L P Sbjct: 131 IAVDVSSDWGKFPRYGYLADFMTMEQAKRE--AVIDRLNRFHINGIQFYDWQWKHHL-PL 187 Query: 121 KIL---PWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIREL 177 KI P + ++ ++ +D AH+R M + Y + + +++ Sbjct: 188 KIESGKPAATW-PDIANREVSFETVKSYIDLAHERNMTAMNYNLLYGAYEDAEQDGVKKE 246 Query: 178 NSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFD-- 235 HR + D ++DP PE Q ++ EV P DG D Sbjct: 247 WGLYKDPLHEHPDLHRLPDSWASDIVLMDPANPEWQRYLIEKEKEVFRHLPFDGWHVDQL 306 Query: 236 ---DYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEF 291 Y +T G +N TY ++ A D + + + + + + V+F Sbjct: 307 GERGYLWTYD-GKEVNLPATYGQFLQAAKKALD--VDYVMNAVGQYGQGVIAAQAPVKF 362 >UniRef50_Q3JIJ6 Conserved domain protein n=67 Tax=Betaproteobacteria RepID=Q3JIJ6_BURP1 Length = 619 Score = 54.0 bits (128), Expect = 1e-05, Method: Composition-based stats. Identities = 48/344 (13%), Positives = 95/344 (27%), Gaps = 54/344 (15%) Query: 93 DKLDHLQRLGINTVFFQVKP-DGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHK 151 +D IN + +K G +PS S D ++ + H+ Sbjct: 317 AAVDLKGATAINALVVDMKGDRGITPYPSAARRASGAAARTPNAPVVRD-FAALVADLHR 375 Query: 152 RGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRF-VLDPGIP 210 RG+ + A + + + + P I ++ +DP + Sbjct: 376 RGLYLIARIVVF------------KDDPLAAAHPEWTVRDADGNIWRDREKLRWIDPSLR 423 Query: 211 EVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNN 270 E + E ++ D +QFD + ++ R + T R Sbjct: 424 ETWAHNLDVAEE-AAKLGFDEIQFDYVRFPDARELRFSVPNTRAN-----------RTAA 471 Query: 271 TQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQ 330 + + V G D Sbjct: 472 IAGFLRAARERLAPY--NVFVAADIFGYVCWNEDDTAIGQQIETLG-------------- 515 Query: 331 GLLDYIAPQIY-----WPF---SRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPS 382 G LDYI+P +Y W ++ A + + +V+ G+ F + Sbjct: 516 GPLDYISPMLYPSGFTWGLPGCTQPTADPGQIVRR--SLVEARSRTGLPGVRFRPWLQAF 573 Query: 383 KIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQT 426 + + E++ Q+D +A G +L+ P+ Sbjct: 574 RDYAFDRRDFAAAEIRAQVDAAEAADT-DGWMLWNARNRYDPRQ 616 >UniRef50_C7M6E9 Glycoside hydrolase family 31 n=10 Tax=Bacteroidetes RepID=C7M6E9_CAPOD Length = 527 Score = 54.0 bits (128), Expect = 1e-05, Method: Composition-based stats. Identities = 24/123 (19%), Positives = 50/123 (40%), Gaps = 5/123 (4%) Query: 143 QFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDR 202 + M+++ H+ G KV W P+ + + +R+ + ++ A DW +G Sbjct: 193 KEMINQLHQMGFKVMVWVCPFVSPDSQEYRFLRDKGYLVKKKGAD-TPAILDW--WNGLS 249 Query: 203 FVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFD--DYFYTESPGSRLNDNETYRKYGGAF 260 D PE + +++ ++ Y +DG +FD D + + D ++Y Sbjct: 250 ACYDLSNPEAFAYFVNMLKDLQKEYGIDGFKFDAGDPERYLAEDVDVFDQKSYDTEQTYL 309 Query: 261 ASK 263 K Sbjct: 310 WGK 312 >UniRef50_Q5LE31 Putative uncharacterized protein n=14 Tax=Bacteroides RepID=Q5LE31_BACFN Length = 688 Score = 53.6 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 41/279 (14%), Positives = 78/279 (27%), Gaps = 83/279 (29%) Query: 138 GYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHR---- 193 GYD + + + G++ H W L+ P V + Sbjct: 400 GYD--ENLYRMCKEAGLEAHFWKWTM------------NRAELLNVHPDWFAVNRKGEST 445 Query: 194 -DWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTE----------S 242 D L P V ++ ++ VDGV D + + Sbjct: 446 HDKPAYVNYYRFLCPNHEGVAQYLADDYVKIAHLPYVDGVHLDYVRFPDVVLPVSLWKNY 505 Query: 243 PGSRLNDNETYR----------------------KYGGAFASKADWRRNNTQQLIAKVSH 280 + +++ Y KY S ++R + +++ +++ Sbjct: 506 GIEQTSEHPEYDYCYCDVCRTKFKEQTGRDPLELKYPMEDQSWINFRLDAISRVVDQITK 565 Query: 281 TIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQI 340 +K+ G + + P S + D W LD P I Sbjct: 566 AVKA-----------DGKAISAAVFPGPSMAKKM-----VRQDWGNWS----LDAYFPMI 605 Query: 341 YWPFSRSAARYDVLAKWWADVVKPT------RTRLYIGI 373 Y F Y +W V+ + R ++Y G+ Sbjct: 606 YNGF------YYEGPEWIGRSVQESVKTVDGRAKVYAGL 638 >UniRef50_D2QQI8 Trehalose synthase n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QQI8_9SPHI Length = 1114 Score = 53.6 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 51/261 (19%), Positives = 90/261 (34%), Gaps = 56/261 (21%) Query: 89 QAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGY---DPLQFM 145 Q +++KLD+LQ LG+ ++ +PS + + NP Y + + + Sbjct: 40 QGLLEKLDYLQELGVTAIWL------LPFYPSPLRDDGYDIADYYTINPSYGTIEQFKTL 93 Query: 146 LDEAHKRGMKV------------HAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHR 193 L EAH+R +KV H WF R + P + + Q V + + Sbjct: 94 LREAHQRNLKVITELVINHSSDQHPWFQRARRAPKGSPEREYYVWTDDPTQFKDVRIIFQ 153 Query: 194 D-------WIRTSGDRFV---------LDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDY 237 D W + + + L+ P VQD + ++ + VDG + D Sbjct: 154 DFETSNWTWDQEAQQYYWHRFFHHQPDLNYDNPLVQDEVFKMI-DYWCELGVDGFRLDAV 212 Query: 238 FYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAG 297 Y + + G + + T + K+ + PGV F ++ A Sbjct: 213 PYL------------FEREGTNGENLPE-----THAFLKKLRKHVDDHFPGVVF-LAEAN 254 Query: 298 VWRNRSHDPLGSDTRGAAAYD 318 +W S G Y Sbjct: 255 MWPEDSASYFGDGDECHMNYH 275 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P64427 UPF0748 lipoprotein yddW n=88 Tax=Enterobacteria... 502 e-141 UniRef50_C1M4K1 Lipoprotein YddW n=4 Tax=Enterobacteriaceae RepI... 450 e-125 UniRef50_C2DR58 Lipoprotein yddW n=6 Tax=Enterobacteriaceae RepI... 397 e-109 UniRef50_C9XP71 Cell surface protein n=6 Tax=Clostridium RepID=C... 391 e-107 UniRef50_A6WZY4 Putative uncharacterized protein n=10 Tax=Brucel... 390 e-107 UniRef50_Q48C14 YngK protein n=53 Tax=Proteobacteria RepID=Q48C1... 383 e-104 UniRef50_C6J3R7 Putative uncharacterized protein n=1 Tax=Paeniba... 381 e-104 UniRef50_A7Z5C7 YngK n=5 Tax=Bacteria RepID=A7Z5C7_BACA2 376 e-102 UniRef50_UPI000178945D protein of unknown function DUF187 n=1 Ta... 373 e-102 UniRef50_D1A3Q7 Putative uncharacterized protein n=1 Tax=Thermom... 372 e-101 UniRef50_B8I4Q9 Putative uncharacterized protein n=2 Tax=Bacteri... 371 e-101 UniRef50_O35015 UPF0748 protein yngK n=11 Tax=Bacteria RepID=YNG... 371 e-101 UniRef50_A6TUC1 Putative uncharacterized protein n=5 Tax=Bacteri... 369 e-100 UniRef50_Q81DH4 FenI n=65 Tax=Bacteria RepID=Q81DH4_BACCR 369 e-100 UniRef50_C0Z8S4 Putative uncharacterized protein n=1 Tax=Breviba... 366 e-100 UniRef50_C7IM14 Putative uncharacterized protein n=1 Tax=Clostri... 366 e-100 UniRef50_C4RBZ7 FenI protein n=10 Tax=Actinomycetales RepID=C4RB... 360 6e-98 UniRef50_D2AWT1 FenI protein n=1 Tax=Streptosporangium roseum DS... 360 7e-98 UniRef50_D1S7M0 Putative uncharacterized protein n=1 Tax=Micromo... 359 1e-97 UniRef50_Q47Q17 FenI protein n=9 Tax=Bacteria RepID=Q47Q17_THEFY 359 1e-97 UniRef50_A8MM80 Putative uncharacterized protein n=1 Tax=Alkalip... 357 4e-97 UniRef50_UPI00016A6D2C fenI protein n=1 Tax=Burkholderia oklahom... 354 3e-96 UniRef50_A1V3X0 FenI protein n=36 Tax=Bacteria RepID=A1V3X0_BURMS 350 6e-95 UniRef50_C5C4P8 Putative uncharacterized protein n=4 Tax=Bacteri... 349 2e-94 UniRef50_D2AR89 FenI protein n=9 Tax=Bacteria RepID=D2AR89_STRRD 348 2e-94 UniRef50_D2QEX0 Putative uncharacterized protein n=2 Tax=Flexiba... 348 2e-94 UniRef50_A3HZ09 FenI n=1 Tax=Algoriphagus sp. PR1 RepID=A3HZ09_9... 348 3e-94 UniRef50_D1AYL2 Putative uncharacterized protein n=1 Tax=Strepto... 347 6e-94 UniRef50_C5PKN1 Possible FenI n=1 Tax=Sphingobacterium spiritivo... 342 1e-92 UniRef50_UPI00016C4E90 hypothetical protein GobsU_27726 n=1 Tax=... 336 1e-90 UniRef50_A6NVH8 Putative uncharacterized protein n=1 Tax=Bactero... 334 3e-90 UniRef50_B1HPQ3 Hypothetical lipoprotein yddW n=2 Tax=Bacillacea... 334 3e-90 UniRef50_C7PIN2 Putative uncharacterized protein n=1 Tax=Chitino... 332 1e-89 UniRef50_A5FI17 Putative uncharacterized protein n=1 Tax=Flavoba... 331 4e-89 UniRef50_A4ASW6 FenI n=1 Tax=Flavobacteriales bacterium HTCC2170... 331 4e-89 UniRef50_C6XWP7 Putative uncharacterized protein n=1 Tax=Pedobac... 329 9e-89 UniRef50_A1ZQ43 YngK protein n=1 Tax=Microscilla marina ATCC 231... 325 2e-87 UniRef50_C9L341 YngK protein n=45 Tax=Bacteroidales RepID=C9L341... 324 3e-87 UniRef50_A9NEW1 Putative uncharacterized protein n=1 Tax=Acholep... 324 4e-87 UniRef50_Q8YW40 All1776 protein n=5 Tax=Nostocaceae RepID=Q8YW40... 324 4e-87 UniRef50_UPI00016C0313 cell surface protein n=1 Tax=Epulopiscium... 323 7e-87 UniRef50_C3XYE7 Putative uncharacterized protein n=2 Tax=Branchi... 322 2e-86 UniRef50_B4VZ35 Putative uncharacterized protein n=1 Tax=Microco... 320 5e-86 UniRef50_B4D6Q1 Putative uncharacterized protein n=2 Tax=Verruco... 318 2e-85 UniRef50_UPI0001C160EA conserved hypothetical protein n=2 Tax=No... 318 3e-85 UniRef50_C1A9I5 Putative uncharacterized protein n=1 Tax=Gemmati... 317 5e-85 UniRef50_A5FAG6 Putative uncharacterized protein n=1 Tax=Flavoba... 315 2e-84 UniRef50_Q7MXU6 YngK protein n=4 Tax=Porphyromonadaceae RepID=Q7... 310 6e-83 UniRef50_D0GIS1 YngK n=16 Tax=Bacteria RepID=D0GIS1_9FUSO 307 4e-82 UniRef50_Q110S6 Putative uncharacterized protein n=5 Tax=Bacteri... 305 2e-81 UniRef50_B9XM08 Putative uncharacterized protein n=2 Tax=bacteri... 305 2e-81 UniRef50_C3J8B5 YngK protein n=2 Tax=Bacteria RepID=C3J8B5_9PORP 305 2e-81 UniRef50_D1N426 Putative uncharacterized protein n=1 Tax=Victiva... 305 3e-81 UniRef50_B0NT08 Putative uncharacterized protein n=2 Tax=Bactero... 302 1e-80 UniRef50_B7AM83 Putative uncharacterized protein n=1 Tax=Bactero... 302 2e-80 UniRef50_C3R8E6 S-layer protein n=24 Tax=Bacteroides RepID=C3R8E... 302 2e-80 UniRef50_C6XWM5 Putative uncharacterized protein n=1 Tax=Pedobac... 302 2e-80 UniRef50_B2ULM6 Putative uncharacterized protein n=1 Tax=Akkerma... 300 7e-80 UniRef50_C9LEC6 YngK protein n=1 Tax=Prevotella tannerae ATCC 51... 300 7e-80 UniRef50_A6G0M0 Putative uncharacterized protein n=1 Tax=Plesioc... 297 4e-79 UniRef50_C9PUA7 FenI protein n=2 Tax=Prevotella RepID=C9PUA7_9BACT 297 6e-79 UniRef50_Q7MWV9 YngK protein n=2 Tax=Porphyromonas gingivalis Re... 297 7e-79 UniRef50_C3QJ47 S-layer protein n=5 Tax=Bacteroides RepID=C3QJ47... 297 8e-79 UniRef50_C1A7Q3 Putative uncharacterized protein n=1 Tax=Gemmati... 295 2e-78 UniRef50_A7VTI3 Putative uncharacterized protein n=1 Tax=Clostri... 295 3e-78 UniRef50_A6L917 Putative uncharacterized protein n=5 Tax=Bactero... 293 9e-78 UniRef50_A9NEW0 Putative uncharacterized protein n=1 Tax=Acholep... 293 1e-77 UniRef50_C0YRL9 FenI family protein n=3 Tax=Bacteroidetes RepID=... 292 3e-77 UniRef50_A0M6M5 Protein containing DUF187 n=4 Tax=Bacteroidetes ... 287 4e-76 UniRef50_C1I7D2 Putative uncharacterized protein n=1 Tax=Clostri... 286 1e-75 UniRef50_B0P7J3 Putative uncharacterized protein n=1 Tax=Anaerot... 283 1e-74 UniRef50_UPI0001745532 hypothetical protein VspiD_00105 n=1 Tax=... 283 1e-74 UniRef50_C0EGV5 Putative uncharacterized protein n=1 Tax=Clostri... 282 2e-74 UniRef50_C2M9G1 YngK protein n=1 Tax=Porphyromonas uenonis 60-3 ... 281 4e-74 UniRef50_A9KK48 Putative uncharacterized protein n=1 Tax=Clostri... 277 8e-73 UniRef50_B0MQ11 Putative uncharacterized protein n=1 Tax=Eubacte... 275 2e-72 UniRef50_C9PZF4 YngK protein n=5 Tax=Prevotella RepID=C9PZF4_9BACT 267 5e-70 UniRef50_C0EWT6 Putative uncharacterized protein (Fragment) n=1 ... 267 5e-70 UniRef50_B6YR88 Putative uncharacterized protein n=1 Tax=Candida... 267 6e-70 UniRef50_A6EKL7 Putative uncharacterized protein (Fragment) n=1 ... 265 2e-69 UniRef50_B3QYB7 Putative uncharacterized protein n=1 Tax=Chloroh... 265 2e-69 UniRef50_C1Q9T9 Uncharacterized conserved protein n=3 Tax=Brachy... 262 2e-68 UniRef50_D1PA22 YngK protein n=1 Tax=Prevotella copri DSM 18205 ... 262 3e-68 UniRef50_C7H8A9 FenI protein n=2 Tax=Faecalibacterium prausnitzi... 258 2e-67 UniRef50_B9Y560 Putative uncharacterized protein n=1 Tax=Holdema... 257 7e-67 UniRef50_B0NXH7 Putative uncharacterized protein n=3 Tax=Clostri... 255 3e-66 UniRef50_C5VL52 YngK protein n=3 Tax=Prevotella RepID=C5VL52_9BACT 253 6e-66 UniRef50_A9NEM7 Hypothetical surface-anchored protein n=2 Tax=Ac... 237 6e-61 UniRef50_C4FZ05 Putative uncharacterized protein n=1 Tax=Abiotro... 237 6e-61 UniRef50_D1PRQ4 FenI protein n=1 Tax=Subdoligranulum variabile D... 237 1e-60 UniRef50_UPI0001C37647 hypothetical protein RflaF_08645 n=1 Tax=... 234 6e-60 UniRef50_B4VPG3 Putative uncharacterized protein n=1 Tax=Microco... 233 8e-60 UniRef50_Q8YXK2 All1210 protein n=4 Tax=Nostocaceae RepID=Q8YXK2... 226 9e-58 UniRef50_B4WH89 Putative uncharacterized protein n=1 Tax=Synecho... 223 8e-57 UniRef50_C2L0K0 Lipoprotein yddW n=1 Tax=Oribacterium sinus F026... 223 9e-57 UniRef50_Q8YLM8 Alr5270 protein n=12 Tax=Cyanobacteria RepID=Q8Y... 219 1e-55 UniRef50_B4VTS6 Putative uncharacterized protein n=1 Tax=Microco... 218 5e-55 UniRef50_Q8YQA0 All3933 protein n=18 Tax=Cyanobacteria RepID=Q8Y... 218 5e-55 UniRef50_Q8YV65 All2116 protein n=15 Tax=Cyanobacteria RepID=Q8Y... 216 1e-54 UniRef50_Q7NL32 Glr1294 protein n=1 Tax=Gloeobacter violaceus Re... 216 2e-54 UniRef50_B1WZU0 Putative uncharacterized protein n=2 Tax=Cyanoth... 213 1e-53 UniRef50_B4WJG2 Putative uncharacterized protein n=1 Tax=Synecho... 211 4e-53 UniRef50_Q10YX0 Putative uncharacterized protein n=2 Tax=Cyanoba... 211 4e-53 UniRef50_C1D2P2 Putative uncharacterized protein n=2 Tax=Deinoco... 211 4e-53 UniRef50_B5WA73 Putative uncharacterized protein n=2 Tax=Arthros... 211 5e-53 UniRef50_A8YDR3 Genome sequencing data, contig C294 n=9 Tax=Chro... 210 7e-53 UniRef50_A0YRE2 Putative uncharacterized protein n=1 Tax=Lyngbya... 208 2e-52 UniRef50_A8YI06 Similar to tr|Q8YPV9|Q8YPV9 n=8 Tax=Chroococcale... 207 8e-52 UniRef50_P74735 Slr0592 protein n=1 Tax=Synechocystis sp. PCC 68... 206 1e-51 UniRef50_B4AVG6 Putative uncharacterized protein n=1 Tax=Cyanoth... 205 2e-51 UniRef50_A0YS74 Putative uncharacterized protein n=2 Tax=Oscilla... 205 2e-51 UniRef50_B7JXY5 Putative uncharacterized protein n=9 Tax=Cyanoba... 205 3e-51 UniRef50_B2IV00 Putative uncharacterized protein n=4 Tax=Cyanoba... 203 7e-51 UniRef50_A6DH63 Putative uncharacterized protein n=1 Tax=Lentisp... 199 2e-49 UniRef50_C1D298 Putative uncharacterized protein n=1 Tax=Deinoco... 197 6e-49 UniRef50_Q8EPF4 Hypothetical conserved protein n=1 Tax=Oceanobac... 197 7e-49 UniRef50_Q2JQ39 Putative uncharacterized protein n=1 Tax=Synecho... 192 2e-47 UniRef50_C6PCP2 Putative uncharacterized protein n=1 Tax=Thermoa... 187 6e-46 UniRef50_B8HYQ9 Putative uncharacterized protein n=1 Tax=Cyanoth... 186 1e-45 UniRef50_B9XI64 Putative uncharacterized protein n=1 Tax=bacteri... 186 1e-45 UniRef50_B5W1E7 Putative uncharacterized protein n=2 Tax=Arthros... 185 2e-45 UniRef50_A8F3E2 Putative uncharacterized protein n=1 Tax=Thermot... 185 4e-45 UniRef50_Q7NJN0 Glr1802 protein n=1 Tax=Gloeobacter violaceus Re... 184 5e-45 UniRef50_Q1IWF6 Putative uncharacterized protein n=3 Tax=Deinoco... 182 2e-44 UniRef50_C5CIL6 Putative uncharacterized protein n=1 Tax=Kosmoto... 178 3e-43 UniRef50_C2FS67 FenI family protein n=1 Tax=Sphingobacterium spi... 176 2e-42 UniRef50_C6IEW4 Putative uncharacterized protein n=4 Tax=Bactero... 173 9e-42 UniRef50_A7LVF6 Putative uncharacterized protein n=4 Tax=Bactero... 172 2e-41 UniRef50_P74629 Sll0736 protein n=1 Tax=Synechocystis sp. PCC 68... 169 1e-40 UniRef50_Q3AJ74 Putative uncharacterized protein n=3 Tax=Chrooco... 169 2e-40 UniRef50_UPI0001AF05D8 hypothetical protein SghaA1_34850 n=1 Tax... 168 3e-40 UniRef50_A6CAJ3 Putative uncharacterized protein n=1 Tax=Plancto... 168 3e-40 UniRef50_B0VF99 Putative uncharacterized protein n=1 Tax=Candida... 168 4e-40 UniRef50_A2C8D8 DUF187 n=12 Tax=Cyanobacteria RepID=A2C8D8_PROM3 167 6e-40 UniRef50_C7GZF2 Putative lipoprotein n=1 Tax=Eubacterium saphenu... 167 9e-40 UniRef50_UPI0001789939 S-layer domain protein n=1 Tax=Geobacillu... 156 1e-36 UniRef50_P35824 S-layer-related protein n=1 Tax=Bacillus circula... 155 4e-36 UniRef50_A7LVF0 Putative uncharacterized protein n=3 Tax=Bactero... 154 4e-36 UniRef50_UPI0001C16380 Protein of unknown function DUF187 n=1 Ta... 154 4e-36 UniRef50_C6IVH6 S-layer domain-containing protein n=1 Tax=Paenib... 154 5e-36 UniRef50_B0MQ12 Putative uncharacterized protein n=1 Tax=Eubacte... 150 7e-35 UniRef50_C3R3M7 Putative uncharacterized protein n=2 Tax=Bactero... 146 1e-33 UniRef50_C2FS66 Putative uncharacterized protein n=1 Tax=Sphingo... 145 3e-33 UniRef50_B0P7J4 Putative uncharacterized protein n=1 Tax=Anaerot... 144 6e-33 UniRef50_A8F7U2 Putative uncharacterized protein n=2 Tax=Thermot... 143 1e-32 UniRef50_A9KMJ8 Putative uncharacterized protein n=1 Tax=Clostri... 140 9e-32 UniRef50_Q8AAL7 S-layer related protein, sialic acid-specific 9-... 138 4e-31 UniRef50_D1BUC2 Putative uncharacterized protein n=1 Tax=Xylanim... 136 2e-30 UniRef50_Q6ZE96 Slr7102 protein n=5 Tax=Cyanobacteria RepID=Q6ZE... 136 2e-30 UniRef50_Q2BFL2 Putative uncharacterized protein n=1 Tax=Bacillu... 135 3e-30 UniRef50_A9KIP9 Putative uncharacterized protein n=1 Tax=Clostri... 135 4e-30 UniRef50_D1PX02 Putative uncharacterized protein n=2 Tax=Prevote... 131 6e-29 UniRef50_C3R3K8 S-layer protein n=4 Tax=Bacteroides RepID=C3R3K8... 128 4e-28 UniRef50_UPI0001BC8648 hypothetical protein BacD2_02792 n=1 Tax=... 128 5e-28 UniRef50_B0PF61 Putative uncharacterized protein n=1 Tax=Anaerot... 126 2e-27 UniRef50_A9WDB3 Putative uncharacterized protein n=5 Tax=Chlorof... 125 4e-27 UniRef50_B2T9L3 Trehalose synthase n=119 Tax=Bacteria RepID=B2T9... 123 1e-26 UniRef50_Q5I942 Alpha-amylase n=1 Tax=Anaerobranca gottschalkii ... 120 1e-25 UniRef50_Q5WAP8 Maltogenic amylase n=1 Tax=Bacillus clausii KSM-... 119 2e-25 UniRef50_C5A3T6 Glycosyl hydrolase, putative n=1 Tax=Thermococcu... 118 3e-25 UniRef50_A9B0X0 Putative uncharacterized protein n=1 Tax=Herpeto... 118 4e-25 UniRef50_C7NWY1 Alpha amylase catalytic region n=1 Tax=Halomicro... 115 4e-24 UniRef50_A8F7H3 Putative uncharacterized protein n=1 Tax=Thermot... 115 4e-24 UniRef50_Q3JIJ6 Conserved domain protein n=67 Tax=Betaproteobact... 115 4e-24 UniRef50_Q8YK50 All8067 protein n=8 Tax=Cyanobacteria RepID=Q8YK... 114 5e-24 UniRef50_D2QQI8 Trehalose synthase n=1 Tax=Spirosoma linguale DS... 114 7e-24 UniRef50_Q114S3 Putative uncharacterized protein n=2 Tax=Oscilla... 113 1e-23 UniRef50_Q4L9B3 Similar to unknown protein n=4 Tax=Bacilli RepID... 113 1e-23 UniRef50_A8F7H1 Putative uncharacterized protein n=1 Tax=Thermot... 112 3e-23 UniRef50_B3DUS3 Trehalose synthase n=3 Tax=Bacteria RepID=B3DUS3... 111 4e-23 UniRef50_B8I5J6 Putative uncharacterized protein n=2 Tax=Clostri... 111 6e-23 UniRef50_D0MH73 Putative uncharacterized protein n=1 Tax=Rhodoth... 110 1e-22 UniRef50_A8RX71 Putative uncharacterized protein n=4 Tax=Clostri... 110 1e-22 UniRef50_A4BGI0 Alpha-galactosidase n=1 Tax=Reinekea blandensis ... 110 1e-22 UniRef50_A4IKZ2 Alpha-amylase family protein n=12 Tax=Bacillacea... 109 3e-22 UniRef50_D1C2R1 Trehalose synthase n=15 Tax=Bacteria RepID=D1C2R... 109 3e-22 UniRef50_Q30YU6 Alpha amylase, catalytic subdomain n=10 Tax=Bact... 108 3e-22 UniRef50_B7VK54 Neopullulanase n=26 Tax=Bacteria RepID=B7VK54_VIBSL 108 4e-22 UniRef50_P29964 Cyclomaltodextrinase n=13 Tax=Thermoanaerobacter... 108 5e-22 UniRef50_Q1J674 Neopullulanase / Cyclomaltodextrinase / Maltogen... 108 5e-22 UniRef50_Q08341 Cyclomaltodextrinase n=101 Tax=Bacteria RepID=CD... 108 5e-22 UniRef50_B7J5F9 GTP-binding protein n=2 Tax=Acidithiobacillus fe... 107 7e-22 UniRef50_A5FIA1 Hypothetical lipoprotein n=2 Tax=Flavobacteriace... 107 8e-22 UniRef50_C6XB31 Trehalose synthase n=19 Tax=Bacteria RepID=C6XB3... 107 9e-22 UniRef50_A3XP30 Trehalose synthase n=1 Tax=Leeuwenhoekiella blan... 107 1e-21 UniRef50_C4ZHD6 Neopullulanase n=1 Tax=Eubacterium rectale ATCC ... 106 2e-21 UniRef50_Q9R9H8 Intracellular maltogenic amylase n=70 Tax=Bacter... 106 2e-21 UniRef50_Q2RYS7 Tat (Twin-arginine translocation) pathway signal... 105 3e-21 UniRef50_D2L779 Trehalose synthase n=1 Tax=Desulfovibrio sp. FW1... 105 3e-21 UniRef50_C6IYE3 Alpha amylase catalytic region n=2 Tax=Bacillale... 105 3e-21 UniRef50_C7RFM8 Glycoside hydrolase clan GH-D n=34 Tax=Bacteria ... 105 4e-21 UniRef50_UPI0001973ACA hypothetical protein ClM62_04023 n=1 Tax=... 105 4e-21 UniRef50_A4VLG7 Alpha-amylase family protein n=2 Tax=Bacteria Re... 104 5e-21 UniRef50_C6IYD0 Neopullulanase n=1 Tax=Paenibacillus sp. oral ta... 104 5e-21 UniRef50_C6CVL0 Alpha amylase catalytic region n=1 Tax=Paenibaci... 104 9e-21 UniRef50_B6W970 Putative uncharacterized protein n=1 Tax=Anaeroc... 104 1e-20 UniRef50_B8HXB6 Putative uncharacterized protein n=1 Tax=Cyanoth... 103 1e-20 UniRef50_C6AUQ1 Putative uncharacterized protein n=2 Tax=Rhizobi... 103 1e-20 UniRef50_Q185X6 Cyclomaltodextrinase (Maltogenic alpha-amylase) ... 103 2e-20 UniRef50_Q8YXF7 All1256 protein n=4 Tax=Nostocaceae RepID=Q8YXF7... 102 2e-20 UniRef50_A6CFN7 Putative uncharacterized protein n=1 Tax=Plancto... 102 2e-20 UniRef50_Q11AV5 Putative uncharacterized protein n=1 Tax=Chelati... 102 2e-20 UniRef50_Q67N80 Putative uncharacterized protein n=1 Tax=Symbiob... 102 2e-20 UniRef50_B4D7E2 Putative uncharacterized protein n=1 Tax=Chthoni... 102 2e-20 UniRef50_C7HH08 GTP-binding protein n=3 Tax=Clostridium thermoce... 102 2e-20 UniRef50_Q6TXT5 AmyM n=1 Tax=uncultured bacterium RepID=Q6TXT5_9... 102 2e-20 UniRef50_B9HAL8 Predicted protein n=1 Tax=Populus trichocarpa Re... 102 3e-20 UniRef50_B6FM44 Putative uncharacterized protein n=3 Tax=Clostri... 102 3e-20 UniRef50_B8D2L1 Pullulanase n=1 Tax=Desulfurococcus kamchatkensi... 102 4e-20 UniRef50_D1AEP8 Putative uncharacterized protein n=1 Tax=Thermom... 102 4e-20 UniRef50_C1I251 Trehalose-6-phosphate hydrolase n=1 Tax=Clostrid... 101 6e-20 UniRef50_A6LHH0 Putative uncharacterized protein n=6 Tax=Bactero... 101 6e-20 UniRef50_Q5N184 Putative uncharacterized protein n=2 Tax=Synecho... 100 9e-20 UniRef50_C2KVT1 Putative uncharacterized protein n=1 Tax=Oribact... 100 9e-20 UniRef50_C5A1I5 Pullulan hydrolase type III (PulhA) n=3 Tax=Ther... 100 1e-19 UniRef50_C2CVQ9 Neopullulanase n=1 Tax=Gardnerella vaginalis ATC... 100 1e-19 UniRef50_D1JA21 Conserved hypothetical membrane protein, DUF187 ... 100 2e-19 UniRef50_B8MJB7 Maltase n=14 Tax=cellular organisms RepID=B8MJB7... 99 2e-19 UniRef50_C4ZDP5 Alpha-amylase n=4 Tax=Clostridiales RepID=C4ZDP5... 99 2e-19 UniRef50_A0KFK2 Glycosidase n=3 Tax=cellular organisms RepID=A0K... 99 2e-19 UniRef50_A4AQ95 Putative uncharacterized protein n=2 Tax=Bactero... 99 3e-19 UniRef50_C3A5Y1 Putative uncharacterized protein n=1 Tax=Bacillu... 99 3e-19 UniRef50_C3Y3M5 Putative uncharacterized protein n=3 Tax=Branchi... 99 3e-19 UniRef50_A0Z097 Putative uncharacterized protein n=2 Tax=Oscilla... 98 5e-19 UniRef50_C1XX00 Glycosidase n=1 Tax=Meiothermus silvanus DSM 994... 98 6e-19 UniRef50_A8UCA2 Oligo-1,6-glucosidase n=2 Tax=Carnobacterium sp.... 98 7e-19 UniRef50_Q0HUK8 Alpha amylase, catalytic region n=11 Tax=Bacteri... 98 8e-19 UniRef50_B8CY54 Alpha amylase n=2 Tax=Halothermothrix orenii Rep... 98 8e-19 UniRef50_A4FBJ1 Putative uncharacterized protein n=1 Tax=Sacchar... 98 9e-19 UniRef50_B0C6V7 Putative uncharacterized protein n=3 Tax=Cyanoba... 97 1e-18 UniRef50_UPI0001C42483 alpha amylase catalytic region n=1 Tax=Ba... 97 1e-18 UniRef50_A6L979 Putative uncharacterized protein n=6 Tax=Bactero... 97 1e-18 UniRef50_Q1WRI4 Oligo-1,6-glucosidase n=7 Tax=Firmicutes RepID=Q... 97 1e-18 UniRef50_D1CHN7 GTP-binding protein n=1 Tax=Thermobaculum terren... 97 2e-18 Sequences not found previously or not previously below threshold: UniRef50_A1TNR8 Trehalose synthase n=7 Tax=Betaproteobacteria Re... 107 9e-22 UniRef50_C6VVW7 Trehalose synthase n=1 Tax=Dyadobacter fermentan... 106 1e-21 UniRef50_B7K9W5 Trehalose synthase n=11 Tax=Bacteria RepID=B7K9W... 105 4e-21 UniRef50_A9B4Y8 Alpha amylase catalytic region n=1 Tax=Herpetosi... 102 3e-20 UniRef50_O06458 Trehalose synthase n=7 Tax=Thermaceae RepID=TRES... 101 7e-20 UniRef50_Q04KP3 Neopullulanase n=31 Tax=Bacteria RepID=Q04KP3_STRP2 99 2e-19 UniRef50_Q54S16 Putative uncharacterized protein n=1 Tax=Dictyos... 99 2e-19 UniRef50_Q045F6 Trehalose-6-phosphate hydrolase n=5 Tax=Lactobac... 98 5e-19 UniRef50_C7TI05 Neopullulanase (GH13) n=42 Tax=Lactobacillus Rep... 98 5e-19 UniRef50_B4CZ69 Trehalose synthase n=1 Tax=Chthoniobacter flavus... 98 8e-19 UniRef50_UPI0001C17075 Alpha amylase, catalytic region protein n... 98 9e-19 UniRef50_C1XGB4 Glycosidase n=2 Tax=Meiothermus RepID=C1XGB4_MEIRU 97 1e-18 UniRef50_B1YKQ1 Alpha amylase catalytic region n=2 Tax=Exiguobac... 97 1e-18 UniRef50_A3DDK1 Alpha amylase, catalytic region n=3 Tax=Clostrid... 97 1e-18 UniRef50_B7R259 Pullulanase type II, GH13 family n=1 Tax=Thermoc... 97 1e-18 UniRef50_Q9HHB0 Pullulanase n=1 Tax=Desulfurococcus mucosus RepI... 97 1e-18 >UniRef50_P64427 UPF0748 lipoprotein yddW n=88 Tax=Enterobacteriaceae RepID=YDDW_ECO57 Length = 439 Score = 502 bits (1293), Expect = e-141, Method: Composition-based stats. Identities = 439/439 (100%), Positives = 439/439 (100%) Query: 1 MDICSRNKKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIW 60 MDICSRNKKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIW Sbjct: 1 MDICSRNKKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIW 60 Query: 61 LATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPS 120 LATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPS Sbjct: 61 LATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPS 120 Query: 121 KILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNST 180 KILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNST Sbjct: 121 KILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNST 180 Query: 181 LSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYT 240 LSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYT Sbjct: 181 LSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYT 240 Query: 241 ESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWR 300 ESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWR Sbjct: 241 ESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWR 300 Query: 301 NRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWAD 360 NRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWAD Sbjct: 301 NRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWAD 360 Query: 361 VVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDY 420 VVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDY Sbjct: 361 VVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDY 420 Query: 421 LNKPQTQQAVSYLQSRWGS 439 LNKPQTQQAVSYLQSRWGS Sbjct: 421 LNKPQTQQAVSYLQSRWGS 439 >UniRef50_C1M4K1 Lipoprotein YddW n=4 Tax=Enterobacteriaceae RepID=C1M4K1_9ENTR Length = 441 Score = 450 bits (1158), Expect = e-125, Method: Composition-based stats. Identities = 329/428 (76%), Positives = 373/428 (87%), Gaps = 3/428 (0%) Query: 11 TIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWP 70 ++ A+LV LLL SC S PP TP P + QQS +P+RGIWLATVSRLDWP Sbjct: 16 NMKWFAVLVGSMLLLGSCSSQPPGPKTTPLP---PVSKPQQSKEPVRGIWLATVSRLDWP 72 Query: 71 PVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMT 130 P+SSVNIS+P R QQ+A+ DKLD+L+RLGINTVFFQVKPDGTALW SKILPWSD +T Sbjct: 73 PISSVNISSPAVRISQQQKALTDKLDNLKRLGINTVFFQVKPDGTALWKSKILPWSDTLT 132 Query: 131 GKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYV 190 G IG++PGYDPLQFMLDEAHKRGMKVHAW NPYRVSVNTKP T+ ELNSTLSQ P+SVYV Sbjct: 133 GTIGQDPGYDPLQFMLDEAHKRGMKVHAWLNPYRVSVNTKPSTVSELNSTLSQTPSSVYV 192 Query: 191 QHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDN 250 HRDWIRT+G+RFVLDPGIP+V+DWITSIVAEVV YPVDGVQFDDYFYTESPGS LND+ Sbjct: 193 LHRDWIRTAGERFVLDPGIPDVRDWITSIVAEVVENYPVDGVQFDDYFYTESPGSALNDS 252 Query: 251 ETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSD 310 +T+R+YG FASKADWRR+NTQ+LIA+VS TIK +KP VEFGVSPAGVWRNRSHDP GSD Sbjct: 253 QTFRRYGQGFASKADWRRDNTQRLIAQVSRTIKKLKPEVEFGVSPAGVWRNRSHDPAGSD 312 Query: 311 TRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLY 370 TRGAAAYDESYADTRRWV+ GLLDYIAPQ+YWPF+R AARYDVLAKWWADVVK T TRLY Sbjct: 313 TRGAAAYDESYADTRRWVQLGLLDYIAPQLYWPFARDAARYDVLAKWWADVVKSTNTRLY 372 Query: 371 IGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAV 430 IG+A YKVGEPS+ EPDW + GGVPELKKQLDLN++ P I+GTILFREDYLN+PQTQ+AV Sbjct: 373 IGVALYKVGEPSRKEPDWTVKGGVPELKKQLDLNESEPYINGTILFREDYLNQPQTQEAV 432 Query: 431 SYLQSRWG 438 +Y+++RWG Sbjct: 433 TYIRNRWG 440 >UniRef50_C2DR58 Lipoprotein yddW n=6 Tax=Enterobacteriaceae RepID=C2DR58_ECOLX Length = 349 Score = 397 bits (1020), Expect = e-109, Method: Composition-based stats. Identities = 347/349 (99%), Positives = 348/349 (99%) Query: 91 MIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAH 150 MIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAH Sbjct: 1 MIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAH 60 Query: 151 KRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIP 210 KRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIP Sbjct: 61 KRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIP 120 Query: 211 EVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNN 270 EVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNN Sbjct: 121 EVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNN 180 Query: 271 TQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQ 330 TQQLIAKVSHTIKSIKP VEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQ Sbjct: 181 TQQLIAKVSHTIKSIKPEVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQ 240 Query: 331 GLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMI 390 GLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMI Sbjct: 241 GLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMI 300 Query: 391 NGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQSRWGS 439 NGGVPELKKQLDLNDA+PEISGTILFREDYLNKPQTQQAVSYLQSRWGS Sbjct: 301 NGGVPELKKQLDLNDALPEISGTILFREDYLNKPQTQQAVSYLQSRWGS 349 >UniRef50_C9XP71 Cell surface protein n=6 Tax=Clostridium RepID=C9XP71_CLODC Length = 703 Score = 391 bits (1004), Expect = e-107, Method: Composition-based stats. Identities = 167/427 (39%), Positives = 236/427 (55%), Gaps = 41/427 (9%) Query: 13 RRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPV 72 + ++++L + L C + + MR W++TV LDWP Sbjct: 3 KISILVLSLIMTLTMCSVSSFADSSND--------------KEMRAAWISTVYNLDWPKT 48 Query: 73 SSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGK 132 + Q++ D LD L+ +GINT QV+P AL+ S I PWS+ +TG Sbjct: 49 K--------NNEAKQKKEYTDLLDKLKSVGINTAVVQVRPKSDALYKSNINPWSEYLTGT 100 Query: 133 IGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQH 192 G++PGYDPL F+++EAHKRGM+ HAWFNPYR+++ + ++ + P ++ Sbjct: 101 QGKDPGYDPLPFLIEEAHKRGMEFHAWFNPYRITMADES-----IDKLPANHP---AKKN 152 Query: 193 RDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNET 252 W+ G+++ DPG+PEV+ +I +AEVV Y +DGV FDDYFY PG ND T Sbjct: 153 PSWVVKHGNKYYYDPGLPEVRKYIVDSIAEVVQNYDIDGVHFDDYFY---PGVSFNDTAT 209 Query: 253 YRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTR 312 Y+KYG +K +WRR N L+ V +IKSIKP V FGVSPAG+WRN+S DP GSDT Sbjct: 210 YQKYGKG-QNKDNWRRENVNTLLRDVKASIKSIKPNVVFGVSPAGIWRNKSSDPTGSDTS 268 Query: 313 GAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIG 372 G +Y +YADTR W++QGL+DY+ PQ+YWP AA Y L WWA+ VK T LYIG Sbjct: 269 GNESYVGTYADTRAWIKQGLIDYVVPQLYWPIGLKAADYSKLVAWWANEVKGTNVDLYIG 328 Query: 373 IAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSY 432 YK G+ S + E+ +Q+ LN EI G++ F + + Sbjct: 329 QGIYKQGQSSYGGQNI-----AKEIVQQVTLNRKYSEIKGSMYFSAKDI--ANSTSIQKD 381 Query: 433 LQSRWGS 439 L+S + S Sbjct: 382 LKSLYSS 388 >UniRef50_A6WZY4 Putative uncharacterized protein n=10 Tax=Brucellaceae RepID=A6WZY4_OCHA4 Length = 442 Score = 390 bits (1001), Expect = e-107, Method: Composition-based stats. Identities = 190/384 (49%), Positives = 270/384 (70%), Gaps = 1/384 (0%) Query: 56 MRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGT 115 W+ATV LDWP SS I + R + Q++ ++ D GIN V FQV P Sbjct: 52 FHASWIATVLNLDWPSRSSSRIEDDAERIKRQKEELLRLFDEASEHGINAVIFQVSPTAD 111 Query: 116 ALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIR 175 A + S LPWS +TG +G++PG+DPL+F + EAHKRG+++HAW NPYRVS++TKP T + Sbjct: 112 AFYQSSYLPWSSYLTGTLGKDPGFDPLKFAIQEAHKRGIELHAWLNPYRVSMDTKPSTRK 171 Query: 176 ELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFD 235 EL ++ ++ P SV+ H DW+ S DR+VLDPGIP V++W+T++ AEVV +Y +DG+QFD Sbjct: 172 ELRNSSNESPVSVFKSHPDWVGVSADRYVLDPGIPAVREWVTNVTAEVVQKYDIDGIQFD 231 Query: 236 DYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSP 295 DYFY E+ S+L+D+++Y ++G F+SK +WRR NT L+ ++S IK+IKP V FG+SP Sbjct: 232 DYFYYETASSKLDDDKSYARFGTRFSSKYEWRRYNTHTLVREISDKIKAIKPNVRFGISP 291 Query: 296 AGVWRNRSHDPLGSDTR-GAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVL 354 +GVWRN + DP GS TR G YD +ADTRRWV++G++DYIAPQIYW F R Y + Sbjct: 292 SGVWRNAADDPRGSATRAGKTNYDGDFADTRRWVKEGMIDYIAPQIYWSFGRKDVSYGTI 351 Query: 355 AKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTI 414 AKWWAD V+ T+T LYIG+A Y+ G + +EP W GV E+K+QL+ N+++PE+ G+I Sbjct: 352 AKWWADTVRGTKTDLYIGLALYRAGSGTTLEPSWQAGEGVTEIKRQLEFNESLPEVKGSI 411 Query: 415 LFREDYLNKPQTQQAVSYLQSRWG 438 LFR+ +L+ P+ + +YL+ WG Sbjct: 412 LFRQGFLSDPKLKGVSNYLKKTWG 435 >UniRef50_Q48C14 YngK protein n=53 Tax=Proteobacteria RepID=Q48C14_PSE14 Length = 393 Score = 383 bits (982), Expect = e-104, Method: Composition-based stats. Identities = 208/390 (53%), Positives = 280/390 (71%), Gaps = 1/390 (0%) Query: 50 QQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQ 109 +++ ++ W+ATV+ LDWP VSSV I++ +R Q++ + LD + + +N V FQ Sbjct: 2 ATANKNLKATWVATVTNLDWPSVSSVAITDEAARVSKQKEELTGILDEIVAMKMNAVIFQ 61 Query: 110 VKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNT 169 V P A + S +LPWS +TG +G+NPG+DPL + +++AH R +++HAW NPYRVS+N Sbjct: 62 VVPCADAFYASDLLPWSKYLTGTLGKNPGFDPLAYAIEQAHARNIELHAWVNPYRVSMNA 121 Query: 170 KPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPV 229 TI ELN++ S PASV+ H +W T+ +RFVL+PGIPEVQ W++SIV E+V++Y V Sbjct: 122 SDATIEELNNSSSDSPASVFKTHPEWTGTAANRFVLNPGIPEVQTWVSSIVEEIVTKYDV 181 Query: 230 DGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGV 289 D +QFDDYFY E+ S L D+ TY+KY F +KADWRRNNT L+ I ++K V Sbjct: 182 DAIQFDDYFYNETASSLLQDDATYQKYNTNFTTKADWRRNNTYSLVDTCHKKIAAVKADV 241 Query: 290 EFGVSPAGVWRNRSHDPLGSDTR-GAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSA 348 FGVSPAGVWRN+S DPLGSDT+ GA+ YD +YADTR+WV G++DYIAPQ+YWPF+R Sbjct: 242 LFGVSPAGVWRNKSDDPLGSDTQAGASNYDFAYADTRKWVIDGIIDYIAPQVYWPFAREV 301 Query: 349 ARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVP 408 ARYDV+ +WWAD V T T LYIG+A YKVG S+ EPDW + GGVPE+ +QLDLND++ Sbjct: 302 ARYDVITQWWADTVSGTGTALYIGMALYKVGTASETEPDWTVEGGVPEITRQLDLNDSLT 361 Query: 409 EISGTILFREDYLNKPQTQQAVSYLQSRWG 438 E+SG +LFR +L QTQQ V YL+ RW Sbjct: 362 EVSGCMLFRHMFLRASQTQQVVDYLKLRWA 391 >UniRef50_C6J3R7 Putative uncharacterized protein n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J3R7_9BACL Length = 545 Score = 381 bits (977), Expect = e-104, Method: Composition-based stats. Identities = 169/424 (39%), Positives = 238/424 (56%), Gaps = 28/424 (6%) Query: 21 LALLLCSCKSTPPESMVT--PPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNIS 78 LL +T PE V P P +S +RG+W++TVS LDWP SS Sbjct: 141 TITLLSGSGATQPEPGVGGEDPQSDVPQPPAVDTSNGLRGVWVSTVSNLDWPSKSSYG-- 198 Query: 79 NPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPG 138 + Q+ + LD +Q +G+N VF QV+P A++PS +PWS +TG G++PG Sbjct: 199 ----KVEAQKAEYVQLLDEVQAMGMNAVFVQVRPSADAIYPSSQVPWSSYLTGTAGKDPG 254 Query: 139 YDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRT 198 YDPLQF+++E H+RGM+ HAWFNP+R S + + V QH +WI Sbjct: 255 YDPLQFLIEETHRRGMEFHAWFNPFRASTGSDASKLPA---------NHVANQHPEWIVK 305 Query: 199 SGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSR--LNDNETYRKY 256 + ++PGIPE +D + S + EVV+ Y +DGV DDYFY + D+ T++ Y Sbjct: 306 FDGKLYINPGIPEARDHVISAIMEVVNGYDIDGVHLDDYFYPTGETTSKKFADDATFKSY 365 Query: 257 G-GAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGA- 314 A+K DWRR+N Q + K+ I++ KP V FG+SP GVWRN+S+D GSDT+ + Sbjct: 366 NSKKIATKGDWRRDNINQFVQKLGQRIEASKPYVSFGISPYGVWRNKSNDLTGSDTKASV 425 Query: 315 AAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIA 374 AYD +YAD R W++ +DY+APQ+YW +R RYD+LA WWA V+ T +LYIG A Sbjct: 426 TAYDSTYADVRTWIKNEWIDYVAPQLYWSMTRKEVRYDLLADWWAQEVRGTNVKLYIGHA 485 Query: 375 FYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQ 434 YK+G P + E+ QL+ N +PEISG+I F L K + LQ Sbjct: 486 PYKLGTPE------IGWSSAQEIINQLEYNRQIPEISGSIFFSAKDLRK-NPLGLIPLLQ 538 Query: 435 SRWG 438 S +G Sbjct: 539 SYYG 542 >UniRef50_A7Z5C7 YngK n=5 Tax=Bacteria RepID=A7Z5C7_BACA2 Length = 512 Score = 376 bits (964), Expect = e-102, Method: Composition-based stats. Identities = 164/432 (37%), Positives = 232/432 (53%), Gaps = 32/432 (7%) Query: 10 LTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDW 69 + R +++ LA++L + T S A+ Q + MR +W+A+V+ +DW Sbjct: 1 MKSCRFSMIWFLAVVLTAGIFTFSAS---------AQASGTQPKREMRAVWIASVTNIDW 51 Query: 70 PPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLM 129 P + Q++ LD +Q +G+N V Q+KP A +PS PWS+ + Sbjct: 52 PSKKGL-------SPEEQKREYSKLLDDVQEMGMNAVIVQIKPAADAFYPSDYGPWSEYL 104 Query: 130 TGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVY 189 TG G+NPGYDPL F+++E HKR ++ HAWFNPYR+++N + + Sbjct: 105 TGTQGKNPGYDPLAFLVEETHKRNLEFHAWFNPYRITMNHT--------NLNALSDDHPA 156 Query: 190 VQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTES-PGSRLN 248 H DW+ G + +PGIPEV+ +IT + EVVSRY +D V DDYFY G Sbjct: 157 RSHPDWVAAYGKQLYYNPGIPEVRQFITDGIKEVVSRYDIDAVHMDDYFYPYKIAGQEFP 216 Query: 249 DNETYRKYGGA-FASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPL 307 D Y +YG A FAS DWRR+N +L+ +++ TIK KP V+FG+SP GVWRN + DP Sbjct: 217 DQAEYERYGKAHFASIDDWRRDNVNRLVKEINQTIKREKPYVKFGISPFGVWRNAADDPT 276 Query: 308 GSDT-RGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTR 366 GS+T G YD+ YADTR W+++G +DYIAPQIYW AA YDVLA WW V Sbjct: 277 GSETAAGVRNYDDLYADTREWIQKGYIDYIAPQIYWSIGFKAAAYDVLADWWGKEVNNRP 336 Query: 367 TRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQT 426 LYIG A YK+ + +P W G E Q+ LN I G++ F LN+ Sbjct: 337 VHLYIGQAAYKINNNA--DPAWADPG---EYGGQITLNRGSAWIKGSLHFSLKDLNRNPL 391 Query: 427 QQAVSYLQSRWG 438 ++ + Sbjct: 392 DVKNRLIKDMYS 403 >UniRef50_UPI000178945D protein of unknown function DUF187 n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI000178945D Length = 518 Score = 373 bits (957), Expect = e-102, Method: Composition-based stats. Identities = 165/432 (38%), Positives = 234/432 (54%), Gaps = 32/432 (7%) Query: 12 IRRPAILVALALLL----CSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRL 67 I+ ++V L + + ++ GS + +RG W++TV L Sbjct: 112 IKNGRVMVPLRFISENLGVQVEWNQAAQRISLSTGSVVVPPPVSTGDEVRGAWISTVFNL 171 Query: 68 DWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSD 127 DWP + A QQ + I LD LQ +GINTV+ QV+P G AL+PS ++PWS Sbjct: 172 DWPKTKT--------SAEQQQASYIALLDSLQDVGINTVYVQVRPAGDALYPSTMVPWSK 223 Query: 128 LMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPAS 187 ++TG G +PGYDP+ FM++E H+R M+ HAWFNP+R + + T S P+ Sbjct: 224 VLTGIQGADPGYDPVAFMVEETHRRNMEFHAWFNPFRANTDIL---------TASLHPSH 274 Query: 188 VYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRL 247 V + H DWI +G + ++PGIPE + I + EVV+ Y +DG+ DDYFY + + Sbjct: 275 VALSHPDWIVNTGKQLYINPGIPEARQHIIDTIMEVVNGYDIDGIHLDDYFYPSN--TVF 332 Query: 248 NDNETYRKY-GGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDP 306 ND+ YR++ GA+A+ ADWRR N + + +I +KP VE+G+SP GVWRN+S D Sbjct: 333 NDDAAYREFNNGAYANLADWRRGNINAFVQSLGESIHRVKPDVEYGISPFGVWRNQSVDK 392 Query: 307 LGSDTR-GAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPT 365 GSDT+ G AYD YAD R W++ G +DY+APQIYW S AA YD L WWA V+ T Sbjct: 393 TGSDTKAGVTAYDSMYADVRTWIQNGWIDYVAPQIYWSMSNPAADYDKLVDWWASEVQGT 452 Query: 366 RTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQ 425 L IG A YK+G + E+ QL N E+ G+I FR + + Sbjct: 453 GVDLLIGHAPYKLGTSE------IGWQSASEIINQLKYNQNHAEVKGSIFFRAENILS-N 505 Query: 426 TQQAVSYLQSRW 437 LQS + Sbjct: 506 PLGIKDQLQSYY 517 >UniRef50_D1A3Q7 Putative uncharacterized protein n=1 Tax=Thermomonospora curvata DSM 43183 RepID=D1A3Q7_THECD Length = 532 Score = 372 bits (954), Expect = e-101, Method: Composition-based stats. Identities = 155/444 (34%), Positives = 225/444 (50%), Gaps = 36/444 (8%) Query: 5 SRNKKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPA----------TTQQSSQ 54 K I+ + VA + LL C S + A + A +++ Sbjct: 4 LSGSKERIKIASAAVAASGLLAGCTSAAGGEVGALRADAPIAAGMAECPDIKPPGDSAAR 63 Query: 55 PMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDG 114 +RG+W+ATVS +DWP + A ++ LD + LG+N VF QV+P Sbjct: 64 QVRGMWIATVSGIDWPS-------DTAHSAERKKADYRKLLDQARALGLNAVFVQVRPSA 116 Query: 115 TALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTI 174 A + S PWS ++G+ G +PG+D L+F + EAHKR ++ HAWFNPYRV+++ G + Sbjct: 117 DAFYDSPYEPWSQWISGEQGRDPGFDVLEFFVSEAHKRDLEFHAWFNPYRVALHNDRGKL 176 Query: 175 RELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQF 234 P + ++ W+R + DPG+P+V++ +T +V +VV +Y +D V Sbjct: 177 ---------HPDNPARKNPSWVREYDGKLWYDPGLPQVRELVTKVVLDVVGKYDIDAVHL 227 Query: 235 DDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVS 294 DDYFY G D +TYR+YG SK DWRR N L+ + I KP V FG+S Sbjct: 228 DDYFYPYPSGGDFPDEDTYRRYGRGM-SKGDWRRANVDALVKGLHEEIHRAKPQVRFGIS 286 Query: 295 PAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVL 354 P GVWRNR DP GS T +YD+ YADTR+WV+QG +DYI PQ+YW +AA Y L Sbjct: 287 PFGVWRNRRSDPAGSQTTALQSYDDVYADTRKWVKQGWVDYITPQLYWEIGNAAADYSTL 346 Query: 355 AKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTI 414 WWA+ V+ T +L IG A Y+VGE EL + L +N ++ G + Sbjct: 347 VAWWAEQVEGTGVQLTIGQASYRVGE---------RGFDAGELSRHLAVNARHRQVRGDV 397 Query: 415 LFREDYLNKPQTQQAVSYLQSRWG 438 F L + + +G Sbjct: 398 YFSAKDLVGDKGGATSRLRKDHYG 421 >UniRef50_B8I4Q9 Putative uncharacterized protein n=2 Tax=Bacteria RepID=B8I4Q9_CLOCE Length = 997 Score = 371 bits (953), Expect = e-101, Method: Composition-based stats. Identities = 150/440 (34%), Positives = 238/440 (54%), Gaps = 26/440 (5%) Query: 7 NKKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSR 66 NKK+ + ++ + L + K + G+ A T + +RG+W+A+VS Sbjct: 2 NKKIGVVCILLVFLMILPIAGYKLFSDRNY----DGNVSNAQTVSKIEDLRGVWIASVSN 57 Query: 67 LDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWS 126 +D+P + A Q++ + D + + Q +G+N +FFQ++P G AL+ S I PWS Sbjct: 58 IDFPSKPGI-------SAEKQKKELDDIISNAQYMGLNAIFFQIRPTGDALYKSTIFPWS 110 Query: 127 DLMTGKIG--ENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQ 184 +TGK G + G+DPL +++++AHK+G+++HAW NP R+S+ T +N Sbjct: 111 AYLTGKQGKENDNGFDPLAYIIEQAHKKGIQIHAWINPLRLSMGTTSNPTGNINVLSDNH 170 Query: 185 PASVYVQHRDWIRTS-GDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYT--- 240 P + + + + + LDPG P IT VAE+V Y VDG+ FDDYFY Sbjct: 171 P---ARKIPEAVVAAPTGQLYLDPGNPAAIKLITDGVAEIVKNYDVDGIHFDDYFYPSKS 227 Query: 241 ESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWR 300 E G ND+ +Y KY G+F +K DWRRNN L+ +T+K+IKP V+FG+SP +W Sbjct: 228 EGKGVDFNDSASYAKYKGSFKNKDDWRRNNINTLVKSTYNTVKNIKPSVQFGISPFAIWS 287 Query: 301 NRSHDPLGSDTRG-AAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWA 359 N+ + GSDT+G + Y + YAD+++WV++ +DYIAPQIYW A Y VL WW Sbjct: 288 NKDRNKEGSDTQGGISTYYDHYADSKKWVKEAYIDYIAPQIYWNIGFKVADYSVLVNWWK 347 Query: 360 DVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFRED 419 +V + T+ +LY+G A YK+ + ++ ++ KQ+ N + G+I + Sbjct: 348 NVCRGTKVKLYVGHAAYKINDTTQSN----DWLDPLQIPKQIAYNRKSNSVDGSIFYGYS 403 Query: 420 YLNKPQTQQAVSYLQSRWGS 439 L K T L+ + S Sbjct: 404 KL-KNNTLGIKDKLKGIFVS 422 >UniRef50_O35015 UPF0748 protein yngK n=11 Tax=Bacteria RepID=YNGK_BACSU Length = 510 Score = 371 bits (951), Expect = e-101, Method: Composition-based stats. Identities = 161/400 (40%), Positives = 227/400 (56%), Gaps = 24/400 (6%) Query: 43 SKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLG 102 S P QS + +R +W+A+V +DWP + + Q+Q I LD +Q++G Sbjct: 23 SVPFMANAQSDRELRAVWIASVLNIDWPSKKGL-------SVKEQKQEYIKLLDDVQKMG 75 Query: 103 INTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNP 162 +N V Q+KP A +PS PWS+ +TG G++PGYDPL FM++E HKR ++ HAWFNP Sbjct: 76 MNAVIVQIKPTADAFYPSAYGPWSEYLTGVQGKDPGYDPLAFMIEETHKRNLEFHAWFNP 135 Query: 163 YRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAE 222 YR+++N +LN P +H DW+ G++ PGIPE +D+I + E Sbjct: 136 YRITMNH-----TDLNKLSEDHP---ARKHPDWVAAYGNQLYYHPGIPEARDFIVKGIEE 187 Query: 223 VVSRYPVDGVQFDDYFYTES-PGSRLNDNETYRKYGGA-FASKADWRRNNTQQLIAKVSH 280 VV Y +D V DDYFY G D Y +YG F++ DWRR+N QL+ +++ Sbjct: 188 VVKHYDIDAVHMDDYFYPYKIAGQEFPDQAQYEQYGKDAFSNIDDWRRDNVNQLVKQINQ 247 Query: 281 TIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTR-GAAAYDESYADTRRWVEQGLLDYIAPQ 339 TIK+ KP V+FG+SP GVWRN + DP GS+T+ G YD+ YADTR W+++G +DYIAPQ Sbjct: 248 TIKAAKPYVKFGISPFGVWRNAADDPTGSNTKAGVRNYDDLYADTRHWIQEGDIDYIAPQ 307 Query: 340 IYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKK 399 IYW +AA YDVLA WW++ VK LYIG A YK+ +P W E + Sbjct: 308 IYWSIGFNAAAYDVLADWWSNEVKNRPVHLYIGQAAYKINNN--FDPPWS---DPEEYVR 362 Query: 400 QLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQSRWGS 439 Q+ LN + + G++ F LNK L + S Sbjct: 363 QITLNRQLELVKGSMHFSLKDLNK-NPLGIKDSLSTDLYS 401 >UniRef50_A6TUC1 Putative uncharacterized protein n=5 Tax=Bacteria RepID=A6TUC1_ALKMQ Length = 731 Score = 369 bits (947), Expect = e-100, Method: Composition-based stats. Identities = 173/437 (39%), Positives = 258/437 (59%), Gaps = 15/437 (3%) Query: 7 NKKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSR 66 +K ++I IL+ AL L S P + P T + +RG W++TV Sbjct: 8 SKLISICIVGILMITALPLHSFAIEEPWDQYNQYLPRETPVT----KRHLRGAWISTVIN 63 Query: 67 LDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWS 126 LDWP V + I N R + ++ +I LD + +N VFFQV P+G A + S I+PWS Sbjct: 64 LDWPSVETAKIKNDKERIQKSKEELIAILDKSVEMNMNAVFFQVSPEGDAFYNSNIVPWS 123 Query: 127 DLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPA 186 +TG G++PG+DPL F ++EAHKR +++HAWFNPYR+S+ TI LN Sbjct: 124 RYLTGTFGKDPGFDPLAFAIEEAHKRNLELHAWFNPYRISMYMNDSTIESLNIE-----K 178 Query: 187 SVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSR 246 SVY +H DW++++ RFV+DPGIP+ ++W+ EVV+ Y VDG+ FDDYFY E Sbjct: 179 SVYKEHPDWVKSAMSRFVIDPGIPQAREWVIKRTMEVVNDYDVDGIHFDDYFYYEKHVGE 238 Query: 247 LNDNETYRKYG-GAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRS-- 303 L D +T+ +Y G F++ +WRRNNT L+ ++S+ I+ KP ++FG+SPAGVW N+ Sbjct: 239 LEDQDTFSQYNLGQFSNLGEWRRNNTYLLVKELSNEIRKTKPWIKFGISPAGVWANKKDG 298 Query: 304 HDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVK 363 H + + G YD S+ADT++WVE+ ++DYIAPQ+Y+ F+ +A Y +A WW++VV+ Sbjct: 299 HLNGSNTSAGLPNYDRSFADTKKWVEEEIIDYIAPQVYFTFANPSAPYGEVANWWSNVVR 358 Query: 364 PTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNK 423 LYIG A YKV + + + + N V E +Q N PE+ G+I+FR N Sbjct: 359 GKNVHLYIGQALYKVNDNA--DQYFQGNHAVEEFVRQHKYNTMKPEVMGSIMFRFQNFNH 416 Query: 424 PQTQQAVSYLQS-RWGS 439 QQ V+ ++ W + Sbjct: 417 GNKQQVVNVMKEDLWST 433 >UniRef50_Q81DH4 FenI n=65 Tax=Bacteria RepID=Q81DH4_BACCR Length = 519 Score = 369 bits (947), Expect = e-100, Method: Composition-based stats. Identities = 155/425 (36%), Positives = 218/425 (51%), Gaps = 25/425 (5%) Query: 17 ILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVN 76 I+ L ++ C P S + PP + T +R +W+A+V +DWP + + Sbjct: 2 IVKRLLMICCIVILFIPFSFI-PPHFTYAEVNTTYKKHELRAVWIASVLNIDWPSKTGL- 59 Query: 77 ISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGEN 136 Q+Q I LD ++ G+N V Q+KP A +PS PWS+ +TG G++ Sbjct: 60 ------PIEKQKQEFIRLLDDVKSTGMNAVVVQIKPTADAFYPSNYGPWSEYITGTQGKD 113 Query: 137 PGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWI 196 PGYDPL FM++E HKR ++ HAW NPYR+++N ++N + P QH DW+ Sbjct: 114 PGYDPLAFMIEETHKRNIEFHAWINPYRITMNH-----TDINRLSNNHP---ARQHPDWV 165 Query: 197 RTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTES-PGSRLNDNETYRK 255 T G + +PGIPEV+ +IT E+V Y +D + DDYFY G D +TY Sbjct: 166 VTYGGKLYYNPGIPEVKKFITEGALEIVENYDIDALHMDDYFYPYKVAGEEFPDQKTYET 225 Query: 256 Y-GGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSD-TRG 313 Y G F + DWRRNN +L+ ++ IK K V+FG+SP GVWRN + DP GS+ T G Sbjct: 226 YNNGRFTNIEDWRRNNVNELVKDLNTAIKQEKSYVKFGISPFGVWRNIADDPTGSNTTAG 285 Query: 314 AAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGI 373 YD+ YADTR W+++G +DYI PQIYW + A YD+L WW LYIG Sbjct: 286 QRNYDDLYADTREWIQKGYIDYITPQIYWNIGFTPAAYDILVDWWVKETNNKPLHLYIGQ 345 Query: 374 AFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYL 433 A YK+ S E KQ+ LN P+I G++ F +N L Sbjct: 346 AAYKINNNSVP-----AWSDPEEYPKQIALNRLYPDIKGSMHFSLKDIN-NNPLGVKDRL 399 Query: 434 QSRWG 438 Sbjct: 400 SENIY 404 >UniRef50_C0Z8S4 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0Z8S4_BREBN Length = 540 Score = 366 bits (940), Expect = e-100, Method: Composition-based stats. Identities = 163/410 (39%), Positives = 226/410 (55%), Gaps = 24/410 (5%) Query: 29 KSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQ 88 TPP + G+ P T ++ + G+W++TV LDWP S QQ Sbjct: 152 TMTPPPQDILSGNGAMEPGTPVVTNGNLHGVWISTVYNLDWPSSGSYGNP------AKQQ 205 Query: 89 QAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDE 148 Q I LD LQ +G+N F QV+P G AL+PS + PWS +TG G++PGYDPL FM+ E Sbjct: 206 QEYIQLLDELQAMGMNAAFVQVRPSGDALYPSTLTPWSRFLTGTPGKDPGYDPLAFMVQE 265 Query: 149 AHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPG 208 H+RGM+ HAWFNP+R + + K + V QH DWI + + ++PG Sbjct: 266 THRRGMQFHAWFNPFRATTDAKTDQLPA---------NHVIKQHPDWIVNANKKLYINPG 316 Query: 209 IPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRR 268 +P + I + V EVV RY +DGV DDYFY + + SKADWRR Sbjct: 317 VPAARQQIINEVMEVVQRYDIDGVHLDDYFYPSNVAFADD-AAFKAYNSKKIVSKADWRR 375 Query: 269 NNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTR-GAAAYDESYADTRRW 327 +N Q + +++ +IKS+KP V+FG+SP GVWRN + DP GSDT+ G AYD +AD R W Sbjct: 376 DNINQFVQQMNQSIKSVKPHVQFGISPFGVWRNSNVDPTGSDTKAGVTAYDHMFADVRTW 435 Query: 328 VEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPD 387 ++QG +DY+ PQIYW FS + A+YD L WWA+ V+ T +LYIG + YK+G Sbjct: 436 IQQGWIDYVTPQIYWSFSFAPAQYDKLVTWWANEVQGTNVKLYIGHSPYKLGTAEAG--- 492 Query: 388 WMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQSRW 437 E+ QL+ N VP++ G+I F L K + L S + Sbjct: 493 ---WQSAQEIINQLNFNAMVPQVQGSIFFSAKDLRK-NPLGLLPALSSYY 538 >UniRef50_C7IM14 Putative uncharacterized protein n=1 Tax=Clostridium papyrosolvens DSM 2782 RepID=C7IM14_9CLOT Length = 999 Score = 366 bits (939), Expect = e-100, Method: Composition-based stats. Identities = 141/440 (32%), Positives = 231/440 (52%), Gaps = 26/440 (5%) Query: 7 NKKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSR 66 NKK+ ++ L + + ++ G+ A T ++ +RG+W+A+V+ Sbjct: 2 NKKIVAVCILLVFLTILPIAGYRLFADKTY----EGNISNAQTVSKNEDLRGVWIASVAN 57 Query: 67 LDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWS 126 +D+P + A Q++ + + + + + +G+N +FFQV+P G AL+ S I PWS Sbjct: 58 IDFPSKPGI-------SADKQKKELDEIISNTKYMGLNAIFFQVRPTGDALYKSSIFPWS 110 Query: 127 DLMTGKIG--ENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQ 184 +TG+ G + G+DPL +++ +AHK G++VHAW NP R+++ T + ++ + Sbjct: 111 KYLTGQQGKENDGGFDPLAYIIKQAHKEGIQVHAWLNPLRLTMGTTAKPDKNVSVLSANH 170 Query: 185 PASVYVQHRDWIRTS-GDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESP 243 P + D + + + LDPG P IT VAE+V Y VDG+ FDDYFY Sbjct: 171 P---ARKIPDAVVAAPTGQLYLDPGNPAAIKLITDGVAEIVKNYDVDGIHFDDYFYPSKS 227 Query: 244 ---GSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWR 300 G ND+ ++ KY G F +K DWRRNN L+ T+K+IK V+FG+SP +W Sbjct: 228 ETKGVDFNDSASFAKYKGNFKNKDDWRRNNINTLVKNTYDTVKNIKNKVQFGISPFAIWS 287 Query: 301 NRSHDPLGSDTRG-AAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWA 359 N+ + GS T+G + Y + YAD+++WV + +DYIAPQIYW A Y VL WW Sbjct: 288 NKDRNIEGSSTQGGISTYYDHYADSKKWVREAYIDYIAPQIYWNMGFKIADYSVLVNWWK 347 Query: 360 DVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFRED 419 +V T+ +LY+G A YK+ + ++ ++ KQ+ N ++G+I + Sbjct: 348 NVCSGTKVKLYVGHAAYKINDTTQSN----DWLDPLQIPKQIAYNRKSNAVAGSIFYGYA 403 Query: 420 YLNKPQTQQAVSYLQSRWGS 439 L T L+ + S Sbjct: 404 KLRD-NTLGIKDKLRGIFVS 422 >UniRef50_C4RBZ7 FenI protein n=10 Tax=Actinomycetales RepID=C4RBZ7_9ACTO Length = 538 Score = 360 bits (924), Expect = 6e-98, Method: Composition-based stats. Identities = 157/390 (40%), Positives = 220/390 (56%), Gaps = 18/390 (4%) Query: 46 PATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINT 105 P + R +W+++V +DWP +S + R Q+ + LD QRL N Sbjct: 27 PTDPAAPKRQFRAMWISSVVNIDWPTKASQTAPD---RIAAQRAEYLGWLDLAQRLHHNA 83 Query: 106 VFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRV 165 V QV+P ALWPS PWS+ +TG G++PG+DPL F++DEAHKR ++ HAWFNPYR+ Sbjct: 84 VVVQVRPTADALWPSPHEPWSEYLTGVRGQDPGWDPLAFLVDEAHKRNLEFHAWFNPYRI 143 Query: 166 SVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTS------GDRFVLDPGIPEVQDWITSI 219 S+ G +L P QH +W G R +PGIP V++++ + Sbjct: 144 SMPAPGGAGADLAQLAPDHP---ARQHPEWTFAYPPAGVAGSRLYYNPGIPAVREFVQTA 200 Query: 220 VAEVVSRYPVDGVQFDDYFYTESPGSRL-NDNETYRKYGGAFASKADWRRNNTQQLIAKV 278 + + V+RY VDGV FDDYFY G+ D+ T+ ++ F +ADWRR+N LI ++ Sbjct: 201 MMDAVTRYDVDGVHFDDYFYPYPSGTYQVPDDATFAEFNRGFTDRADWRRDNINLLIREM 260 Query: 279 SHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAP 338 + IK+ KP V+FGVSP G+WRN S DPLGSDT G+ +YD ADTR+WV+Q +DYI P Sbjct: 261 NDRIKAAKPWVKFGVSPFGIWRNASVDPLGSDTTGSQSYDIISADTRKWVKQEWIDYIVP 320 Query: 339 QIYWPFSR-SAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPEL 397 Q+YW + AA Y L WWA+ V+ TR +LYIG A YK G+P+ EL Sbjct: 321 QLYWYIGQYPAADYARLVPWWAETVRGTRVQLYIGQADYKSGDPAYGS----YWQNPREL 376 Query: 398 KKQLDLNDAVPEISGTILFREDYLNKPQTQ 427 L LN + PE+ G + F + + Sbjct: 377 SDHLTLNRSYPEVLGNVHFSAVQVRANRLG 406 >UniRef50_D2AWT1 FenI protein n=1 Tax=Streptosporangium roseum DSM 43021 RepID=D2AWT1_STRRD Length = 533 Score = 360 bits (923), Expect = 7e-98, Method: Composition-based stats. Identities = 158/388 (40%), Positives = 216/388 (55%), Gaps = 24/388 (6%) Query: 47 ATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTV 106 + + +RG+W+ATV +DWP + + A QQ + LD+ + +N V Sbjct: 61 TDVRYPKRQLRGVWIATVKNIDWPSRTGL-------SAAKQQAEYVRILDNAVKRRLNAV 113 Query: 107 FFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVS 166 F QV+P AL+ S + PWS +TG G++PG+DPL F++ EAHKRG++ HAWFNPYR S Sbjct: 114 FVQVRPASDALYKSSLEPWSKFLTGTAGKDPGWDPLPFLVAEAHKRGLEFHAWFNPYRAS 173 Query: 167 VNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSR 226 + +++ + PA V H DWI +PG+P V+D +TS++ +VV R Sbjct: 174 YD------GDVSKLPADHPARV---HPDWIVKHEGLVYYNPGLPAVRDHVTSVITDVVKR 224 Query: 227 YPVDGVQFDDYFYTESPGS-RLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSI 285 Y VDGV FDDYFY GS + D +RKYG ADWRR+N +LIA+V + Sbjct: 225 YDVDGVHFDDYFYPYPGGSAQFADGAAFRKYGKG-EKLADWRRSNVDKLIAQVDEAVHGT 283 Query: 286 KPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFS 345 K V+FG+SP G+WRN++ DP GS T G +AYD YAD R W+ +G +DY+APQ+YWP Sbjct: 284 KQHVKFGISPFGIWRNKAQDPTGSATAGMSAYDSIYADARHWIRKGTVDYVAPQLYWPSG 343 Query: 346 RSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLND 405 AA YDVL WWA VK T LYIG A Y+VG P W G EL L N Sbjct: 344 FKAADYDVLMPWWAKEVKGTDVHLYIGQALYRVGSTDT--PAWTRPG---ELPSHLTKNR 398 Query: 406 AVPEISGTILFREDYLNKPQTQQAVSYL 433 ++ G + F L + + Sbjct: 399 KHKQVKGDVYFNAKQLL-TNPLGVLDLI 425 >UniRef50_D1S7M0 Putative uncharacterized protein n=1 Tax=Micromonospora aurantiaca ATCC 27029 RepID=D1S7M0_9ACTO Length = 555 Score = 359 bits (920), Expect = 1e-97, Method: Composition-based stats. Identities = 154/399 (38%), Positives = 219/399 (54%), Gaps = 18/399 (4%) Query: 37 VTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLD 96 + + R +W+A+V+ +DWP S + Q+ + LD Sbjct: 35 TATSPSTDCVTDPATPKRQFRAMWIASVTNIDWPSKGSWTAPDQ---VAKQKAEYLAWLD 91 Query: 97 HLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKV 156 Q+L N V QV+P A WPS PWS+ +TG G+NPG+DPL F++ E+HKR ++ Sbjct: 92 LAQKLNHNAVVVQVRPTADAFWPSPYEPWSEYLTGVRGKNPGWDPLDFLVAESHKRNLEF 151 Query: 157 HAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTS------GDRFVLDPGIP 210 HAWFNPYRVS+ G +L+ P S QH DW+ G R +PG+P Sbjct: 152 HAWFNPYRVSMPAPGGAGADLSQLA---PDSPARQHPDWVFAYPPAGVAGSRLYYNPGVP 208 Query: 211 EVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRL-NDNETYRKYGGAFASKADWRRN 269 EV++++ + + + V RY +DGV FDDYFY G+ D+ T+ Y F KADWRR+ Sbjct: 209 EVREFVQTAMMDAVKRYDIDGVHFDDYFYPYPSGTHQVPDDATFAAYNRGFTDKADWRRD 268 Query: 270 NTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVE 329 N LI +++ IK++KP V+FGVSP G+WRN S DP GSDT G+ +YD AD+R+WV+ Sbjct: 269 NINLLIQEMNAKIKAVKPYVKFGVSPFGIWRNASADPNGSDTTGSQSYDIISADSRKWVK 328 Query: 330 QGLLDYIAPQIYWPFSR-SAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDW 388 + +DYI PQ+YW + AA Y L WWA+ V+ TR +LYIG A YK G+P+ Sbjct: 329 EEWIDYIVPQLYWYIGQYPAADYARLVPWWAEQVRGTRVQLYIGQADYKSGDPAYGS--- 385 Query: 389 MINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQ 427 EL L LN + PE+ G + F + + Sbjct: 386 -FWMNPQELSNHLTLNRSYPEVLGNVHFSAVQVRANRLG 423 >UniRef50_Q47Q17 FenI protein n=9 Tax=Bacteria RepID=Q47Q17_THEFY Length = 540 Score = 359 bits (920), Expect = 1e-97, Method: Composition-based stats. Identities = 162/384 (42%), Positives = 215/384 (55%), Gaps = 24/384 (6%) Query: 40 PAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQ 99 P + MRG+WL TV +DWP P + QQ+ + LD Sbjct: 55 PIPEDCATDPAYPKRQMRGVWLTTVRNIDWPS-------EPGLSPQQQQEELTAFLDRAV 107 Query: 100 RLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAW 159 LG+N VFF ++P A++ S PW+ +TG G +PGYDPL+F + EAH RG+++HAW Sbjct: 108 ELGLNAVFFHIRPTADAVYASDKEPWARYLTGTQGGDPGYDPLEFAVAEAHTRGLELHAW 167 Query: 160 FNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSI 219 FNPYRV +L P ++ +W+ D+ LDPG PEV++W+ + Sbjct: 168 FNPYRVG-----WREADLEHLADDHPV---RRNPEWMIVYDDQGYLDPGNPEVREWVVDV 219 Query: 220 VAEVVSRYPVDGVQFDDYFYTE-SPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKV 278 VA+VV RY VDGV FDDYFY + G +D+ +++ +G F + WRR+N QLI +V Sbjct: 220 VADVVERYDVDGVHFDDYFYPYPASGETFDDDASWQAHGDGFPDRDAWRRDNVNQLIRQV 279 Query: 279 SHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAP 338 + IKP V FGVSP G+WRNRS DP GS T G +YD +ADTR W+ +G +DY+ P Sbjct: 280 HERVHDIKPWVRFGVSPFGIWRNRSSDPSGSATSGLQSYDALHADTRTWIREGWIDYVVP 339 Query: 339 QIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELK 398 Q+YWP +AA Y VLA WWA+ V T LYIG A Y+VGE G L Sbjct: 340 QLYWPQGFAAADYAVLAPWWAEEVAGTGVDLYIGQAAYRVGEDG--------WKGADALA 391 Query: 399 KQLDLNDAVPEISGTILFREDYLN 422 KQLD N PEI+G I F LN Sbjct: 392 KQLDFNTQHPEITGDIYFSMKDLN 415 >UniRef50_A8MM80 Putative uncharacterized protein n=1 Tax=Alkaliphilus oremlandii OhILAs RepID=A8MM80_ALKOO Length = 476 Score = 357 bits (917), Expect = 4e-97, Method: Composition-based stats. Identities = 147/387 (37%), Positives = 204/387 (52%), Gaps = 26/387 (6%) Query: 54 QPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPD 113 +R W++TV LDWP + + + Q+ LD L+ G+N V Q+KP Sbjct: 111 AELRATWISTVYNLDWPSKKGLAVED-------QKSEFTALLDGLKSAGLNAVMVQIKPS 163 Query: 114 GTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGT 173 + +PS+ PWS+ +TG G++PGY+PL FM++E HKR M+ HAWFNPYRVSV Sbjct: 164 ADSFYPSQYGPWSEYLTGVQGKDPGYNPLAFMIEETHKRNMEFHAWFNPYRVSVKEDRNA 223 Query: 174 IRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQ 233 + E ++ DW+ + G + +PGIP VQ ++ + EVV Y +DGV Sbjct: 224 LAE---------GHPAKKNPDWVVSYGGKLFYNPGIPAVQQFVIDSILEVVKNYNIDGVH 274 Query: 234 FDDYFYTESPGS-RLNDNETYRKYGG-AFASKADWRRNNTQQLIAKVSHTIKSIKPGVEF 291 DDYFY D E Y+ Y A +K WRRNN I + +IK K V Sbjct: 275 LDDYFYPYPEKEGDFPDEELYQSYRRTASETKEQWRRNNINDFIQNLYQSIKREKSTVVL 334 Query: 292 GVSPAGVWRNRSHDPLGSDTR-GAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAAR 350 GVSP G+WRN++ DP GS+TR G +YD YADT+ W+E G LDYIAPQ+YW A Sbjct: 335 GVSPFGIWRNKADDPKGSNTRGGVTSYDSLYADTKYWIENGWLDYIAPQVYWHIGYDRAE 394 Query: 351 YDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEI 410 Y L WW++VV+ + LYIG A YKV E G E+ Q++ N +PE+ Sbjct: 395 YKELINWWSNVVQNKKVELYIGQAAYKV------EAGTTPWGNPLEILDQIEYNRMIPEV 448 Query: 411 SGTILFREDYLNKPQTQQAVSYLQSRW 437 G+I FR + L+ + Sbjct: 449 KGSIFFRAKSIV-NNPLGLKDNLEKMY 474 >UniRef50_UPI00016A6D2C fenI protein n=1 Tax=Burkholderia oklahomensis EO147 RepID=UPI00016A6D2C Length = 521 Score = 354 bits (909), Expect = 3e-96, Method: Composition-based stats. Identities = 151/428 (35%), Positives = 214/428 (50%), Gaps = 23/428 (5%) Query: 14 RPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVS 73 A+ + + + + + + + R W+A V +DWP Sbjct: 2 WCAVSAIVLTISAGACTRSSDMISENAPNVACAVSRATPKRDFRAFWIAAVRNIDWPSRE 61 Query: 74 SVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKI 133 + ++ QQ+ + LD RL N V QV+P + WPS PWS+ +TG Sbjct: 62 GLTVAE-------QQEELRKWLDLAVRLRYNAVILQVRPVSDSFWPSPFAPWSEFLTGTQ 114 Query: 134 GENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHR 193 G +PGYDPL F + EAH+R +++HAWFNPYR + NT+ + P HR Sbjct: 115 GTDPGYDPLAFAVAEAHRRNLELHAWFNPYRAARNTQIDLLA---------PTHPARLHR 165 Query: 194 DWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTES-PGSRLNDNET 252 DW+ + ++ +PG+P ++ I + + V RY VDGV DD+FY G D T Sbjct: 166 DWLVSYDNQLYFNPGVPAAREHIVDAIMDAVDRYDVDGVHLDDFFYPYPIAGETFADTAT 225 Query: 253 YRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTR 312 Y +YG F + ADWRR+N + +S IK++KP V+FG+SP VWRN S DP GS+T Sbjct: 226 YMQYGAGFTTLADWRRHNVDVFVEMLSRRIKAVKPWVKFGISPFAVWRNASVDPQGSETS 285 Query: 313 -GAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYI 371 YD+ YADTRRW+ + +DYIAPQ+YW A YD + WW + V+ +R LYI Sbjct: 286 TDVQTYDDQYADTRRWLRENWIDYIAPQVYWAQDFQRADYDKVVSWWVEQVRSSRAHLYI 345 Query: 372 GIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVS 431 G A YKVG S P W EL L N PE+ G I F + + ++S Sbjct: 346 GQAAYKVGI-SNQSPGWA---SPAELANHLAFNCKFPEVKGNIYFSAKDVRADRL-GSIS 400 Query: 432 YLQSRWGS 439 L W S Sbjct: 401 QLGHTWYS 408 >UniRef50_A1V3X0 FenI protein n=36 Tax=Bacteria RepID=A1V3X0_BURMS Length = 521 Score = 350 bits (898), Expect = 6e-95, Method: Composition-based stats. Identities = 149/410 (36%), Positives = 208/410 (50%), Gaps = 25/410 (6%) Query: 32 PPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAM 91 P +P T + RG W+A+V LDWP + A QQ + Sbjct: 22 ASSPQAVPEVACRPDET--MPKRQFRGTWIASVINLDWPSRPGL-------PAAAQQAEL 72 Query: 92 IDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHK 151 LD R+ N V QV+P A WPS PWS +TG G +PGYDPL F + EAH+ Sbjct: 73 SAWLDDAVRMNRNAVILQVRPTADAFWPSPFEPWSKYLTGAQGGDPGYDPLAFAVAEAHR 132 Query: 152 RGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPE 211 R +++HAWFNPYRV+++ + + H DW+ G + +PG+P Sbjct: 133 RNLELHAWFNPYRVAMDDRLDALVAT---------HPARAHPDWVVRYGGKLYYNPGVPA 183 Query: 212 VQDWITSIVAEVVSRYPVDGVQFDDYFYTES-PGSRLNDNETYRKYGGAFASKADWRRNN 270 + ++ + + V+RY +D V DDYFY G+ +D Y +YG FA+ ADWRR+N Sbjct: 184 ARAFVVDAIMDAVARYDIDAVHLDDYFYPYPVAGATFDDASAYAQYGAGFATLADWRRDN 243 Query: 271 TQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGA-AAYDESYADTRRWVE 329 +L+ ++ IK+ KP V+FG+SP VWRN + DP GS T + YD+ YADTRRWV Sbjct: 244 VDRLVESLARRIKAAKPWVKFGISPFAVWRNAATDPQGSRTSASVQTYDDLYADTRRWVR 303 Query: 330 QGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWM 389 + +DY+ PQ YW + A YD + WWA+ V+ LYIG A YKVG S P W Sbjct: 304 ERWIDYVVPQAYWARGFAPADYDEVVAWWANEVRGRDAHLYIGQAAYKVGT-SNQSPGWS 362 Query: 390 INGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQSRWGS 439 EL + L N PE+ G + F + + A + L W S Sbjct: 363 ---DPDELSRHLAFNLTAPEVKGDVYFSAKDVRADRL-GATTRLNRTWYS 408 >UniRef50_C5C4P8 Putative uncharacterized protein n=4 Tax=Bacteria RepID=C5C4P8_BEUC1 Length = 538 Score = 349 bits (894), Expect = 2e-94, Method: Composition-based stats. Identities = 161/431 (37%), Positives = 225/431 (52%), Gaps = 31/431 (7%) Query: 13 RRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPV 72 RR + +A A + S + S +TP A + + +R +W+++V +DWP Sbjct: 17 RRTFLTLATAGVAASTLTVTVGS-MTPAAATPSADPAAFLKRELRAMWISSVVNIDWPSA 75 Query: 73 SSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGK 132 + + A QQ + LD Q +N VF QV+P A WPS PWS +TG Sbjct: 76 TGL-------SAEAQQAEYLHWLDVAQDFRLNAVFVQVRPTADAFWPSPHEPWSQYLTGV 128 Query: 133 IGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQH 192 G++PGYDPL F+++E HKR +++H W+NPYRVS+ P + P H Sbjct: 129 QGQDPGYDPLAFIVEETHKRNLELHTWYNPYRVSMQADPAQLV---------PEHPARVH 179 Query: 193 RDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTES-PGSRLNDNE 251 DWI G + DPG+PE Q+ I + + V Y +DGV FDDYFY G + D E Sbjct: 180 PDWIWPYGGKLYFDPGLPETQEHIQAAILHSVENYDIDGVHFDDYFYPYPVAGQTIPDAE 239 Query: 252 TYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDT 311 TY YG F DWRR+N I+ +S IK +KP V+FG+SP G+WRN + DPLGS T Sbjct: 240 TYATYGAGFDDVGDWRRHNVDTFISSISARIKQVKPWVKFGISPFGIWRNDTTDPLGSAT 299 Query: 312 RGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYI 371 RG+ +YD +ADTR+WV +G LDYI PQ+YW + A Y VL WWADV + T LYI Sbjct: 300 RGSQSYDLQFADTRKWVLEGWLDYINPQVYWQIGLAVADYSVLVPWWADVAATSGTHLYI 359 Query: 372 GIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDA----VPEISGTILFREDYLNKPQTQ 427 G A YKV +P EL L L+ V + G + F ++ Sbjct: 360 GEALYKVTSGVFTDP--------AELANHLALDRDVTETVGPVHGNVYFSAKHV-PADPA 410 Query: 428 QAVSYLQSRWG 438 ++S ++ + Sbjct: 411 GSMSLVRDGYY 421 >UniRef50_D2AR89 FenI protein n=9 Tax=Bacteria RepID=D2AR89_STRRD Length = 520 Score = 348 bits (893), Expect = 2e-94, Method: Composition-based stats. Identities = 144/388 (37%), Positives = 215/388 (55%), Gaps = 21/388 (5%) Query: 51 QSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQV 110 + MRG+W+A+V ++WP + A Q+ + LD Q +N VF Q+ Sbjct: 42 PPLRQMRGMWIASVVNINWPSKPGL-------TADQQKAEYLAWLDLAQVRKLNAVFVQI 94 Query: 111 KPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTK 170 +P A WPS PWS +TG G++PGYDPL F+++E HKRG+ HAWFNPYRVS+ Sbjct: 95 RPTADAFWPSPFEPWSQYLTGTQGQDPGYDPLAFVVEETHKRGLAFHAWFNPYRVSMQPD 154 Query: 171 PGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVD 230 P + P +H DWI G + +PG+PEV+ ++ + + V++Y +D Sbjct: 155 PSKL---------HPDHPGTKHPDWIVPYGGKLYYNPGMPEVRAFVQDAMMDAVTKYDID 205 Query: 231 GVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVE 290 G+ FDDYFY + + +D+ + KYG F A WRRNN L+ ++ ++ KP + Sbjct: 206 GLHFDDYFYPVN-TTAFDDSAAFAKYGQGFPDLAAWRRNNVDLLVQEMQQRVRQAKPEIA 264 Query: 291 FGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAAR 350 +G+SP+G+WRN++ DPLGSDT G+ +YD +ADTR WV++G LDYIAPQ+YW +S A Sbjct: 265 WGISPSGIWRNKTTDPLGSDTGGSQSYDNLHADTRGWVKKGWLDYIAPQLYWYIGQSNAD 324 Query: 351 YDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEI 410 Y L WW+DV T T+L+IG A YK G + EL + L LN P++ Sbjct: 325 YAKLVPWWSDVAAGTPTQLWIGQAAYKAGAAGQP----AQWFQPDELTRHLTLNRDHPQV 380 Query: 411 SGTILFREDYLNKPQTQQAVSYLQSRWG 438 G I + + + + + + Sbjct: 381 GGDIWYNSGDVRDDRLGSVTTVVTDHYT 408 >UniRef50_D2QEX0 Putative uncharacterized protein n=2 Tax=Flexibacteraceae RepID=D2QEX0_9SPHI Length = 570 Score = 348 bits (893), Expect = 2e-94, Method: Composition-based stats. Identities = 143/419 (34%), Positives = 219/419 (52%), Gaps = 29/419 (6%) Query: 23 LLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTS 82 +L+ + + P + ++ P + R +W+ATV+ +DWP + +++ Sbjct: 62 VLVNAVDTLPFDDTPEQILAARGPIP---PKREFRAVWVATVNNIDWPSKKGLPVAD--- 115 Query: 83 RARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIG--ENPGYD 140 QQ+ ++ D Q++G+N V QV+ A + PWS+ +TG+ G P YD Sbjct: 116 ----QQREIVAMFDQHQQMGLNAVVVQVRSAADAFYARGSEPWSEWLTGQQGLAPEPFYD 171 Query: 141 PLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSG 200 PL+FM+D+AH RG++ HAWFN R + + T S P+++ + +W+ G Sbjct: 172 PLEFMIDQAHGRGLEFHAWFNLDRAT----------FSKTASVAPSNIVNRKPEWMLMYG 221 Query: 201 DRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTE-SPGSRLNDNETYRKYGGA 259 R + + GIP V+ +I IVA VV Y VDG+ FDDYFY PG L D+ TY+ Sbjct: 222 GRKLFNLGIPAVRSYIAGIVANVVREYDVDGIHFDDYFYPYAEPGQVLRDDSTYKA-NSN 280 Query: 260 FASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDE 319 SK DWRR+N +L+ ++ +I++ KP V+FG+SP G+W+N+S DP GS T G AY E Sbjct: 281 GMSKPDWRRDNVTKLVKELRDSIRANKPWVKFGISPFGIWKNKSSDPEGSATNGGQAYYE 340 Query: 320 SYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVG 379 YADTR+WV +GL+DY+ PQ+Y+ S Y L WW LYIG Y+VG Sbjct: 341 LYADTRKWVREGLIDYVVPQVYFSSEFSKVPYKTLVDWWTRNCT-ENCHLYIGHGAYRVG 399 Query: 380 EPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQSRWG 438 S+ +P W E Q+ N + G++ F L + LQ+ + Sbjct: 400 RGSERDPGWWRPT---EFPDQMRYNRQQQVVKGSVFFSAKNL-QINPLSIRDSLQTNFY 454 >UniRef50_A3HZ09 FenI n=1 Tax=Algoriphagus sp. PR1 RepID=A3HZ09_9SPHI Length = 543 Score = 348 bits (892), Expect = 3e-94, Method: Composition-based stats. Identities = 151/461 (32%), Positives = 223/461 (48%), Gaps = 54/461 (11%) Query: 9 KLTIRRPAILVALALLLCSCKSTP---PESMVTPPAGSKPPATT---------------- 49 K R I++A LLL +CKS+ P T P + P + Sbjct: 2 KFLSRHLFIILAFGLLLSACKSSKNVTPGQQPTAPIQTSPNTDSGTNLPVKTLPKTPIAL 61 Query: 50 -------QQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLG 102 + + RG+W+ATV+ +DWP +P Q++ ++ LD+ + L Sbjct: 62 APLSYQMPEMPREFRGVWIATVANIDWP-------ISPDDPYEKQKRDFLEILDYYKSLN 114 Query: 103 INTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYD--PLQFMLDEAHKRGMKVHAWF 160 N V QV+ G A +PS + PWS +TGK G+ P + PL +M+ E+H RGM+ HAW Sbjct: 115 FNAVIVQVRTAGDAFFPSNLAPWSKYLTGKQGKAPNTNENPLTWMIHESHARGMEFHAWL 174 Query: 161 NPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIV 220 NPYR +++ K + P Y HR+W+ G ++ +PG+PEVQ + ++ Sbjct: 175 NPYRATMDLKTDELS---------PDHDYNAHRNWMVKYGTKYYYNPGLPEVQTHLLKVI 225 Query: 221 AEVVSRYPVDGVQFDDYFYTESPG-SRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVS 279 E+V Y VD + FDDYFY D TY KY + ++ DWRR+N QLI ++ Sbjct: 226 KEIVDNYDVDAIHFDDYFYPYKIAREEFPDRNTYNKYKKSGQTQDDWRRDNVNQLIFALN 285 Query: 280 HTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTR-GAAAYDESYADTRRWVEQGLLDYIAP 338 +TIK KP V+FG+SP GVWRN+ DP GS T+ G YD+ YAD W++ G +DY+ P Sbjct: 286 NTIKQSKPWVQFGISPFGVWRNQDKDPKGSPTQAGQTNYDDLYADVLLWMKNGWVDYMIP 345 Query: 339 QIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELK 398 Q+YW A + +L WWA T +YIG YK+ E S E+ Sbjct: 346 QLYWSMEHPLASHRILNDWWA--TNHNYTNIYIGNGPYKIREDS-----DKAWENPKEIN 398 Query: 399 KQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQSRWGS 439 Q+ +P I G F K + + L+ + Sbjct: 399 NQISYTRTLPTIQGNAFFSAKS-MKIKNRDVAQLLKGELYN 438 >UniRef50_D1AYL2 Putative uncharacterized protein n=1 Tax=Streptobacillus moniliformis DSM 12112 RepID=D1AYL2_STRM9 Length = 437 Score = 347 bits (889), Expect = 6e-94, Method: Composition-based stats. Identities = 138/381 (36%), Positives = 217/381 (56%), Gaps = 29/381 (7%) Query: 48 TTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVF 107 ++ ++ ++G+W ATV LD+P +S+ Q++ + + ++++++ G+N VF Sbjct: 69 ENRKINKNLKGVWAATVVNLDFPKTTSM---------EEQKREIDEMMENIKKWGLNAVF 119 Query: 108 FQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSV 167 F V+P AL+ S+ PWS +TG +PGYDPL++ + AHKRG+++HAW NPYR ++ Sbjct: 120 FHVRPAADALYNSEFEPWSIYLTGTQNRHPGYDPLEYAIKAAHKRGIELHAWINPYRAAM 179 Query: 168 NTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRY 227 NT + + S+ + +WI +F ++PG PEV ++++ + E+V +Y Sbjct: 180 NTDLNKLSD---------KSIVKRKPEWIFEYDGKFYMNPGNPEVVNYVSKAIEEIVEKY 230 Query: 228 PVDGVQFDDYFYTESPGS----RLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIK 283 +DG+ DDYFY + D + + KYG + S+ DWRR+N +I +S ++ Sbjct: 231 DIDGLHLDDYFYPYPSATLKLGDNVDQKEFEKYGSEYNSRGDWRRDNVNNMIKNLSVSVH 290 Query: 284 SIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWP 343 IKP + FGVSP G+WRN D GS T+G +YD YAD+ +W+++G +DYIAPQIYW Sbjct: 291 KIKPNLSFGVSPFGIWRNYETDARGSKTKGLQSYDSLYADSLKWMKEGWVDYIAPQIYWN 350 Query: 344 FSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDL 403 A Y+ L KWWA+ K T T LY+G YK EP EL+KQL L Sbjct: 351 IGFEKADYEELVKWWAEKSKETNTPLYVGHGVYKYIEPKP-------WKDSKELEKQLKL 403 Query: 404 NDAVPEISGTILFREDYLNKP 424 N+ + G+I FR L + Sbjct: 404 NEKYDAVKGSIFFRYGTLLEN 424 >UniRef50_C5PKN1 Possible FenI n=1 Tax=Sphingobacterium spiritivorum ATCC 33861 RepID=C5PKN1_9SPHI Length = 508 Score = 342 bits (878), Expect = 1e-92, Method: Composition-based stats. Identities = 133/388 (34%), Positives = 203/388 (52%), Gaps = 29/388 (7%) Query: 51 QSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQV 110 + +RG+W+ATV+ +DWP + Q+Q +I+ LD QR G+N +FFQ+ Sbjct: 26 SPKRELRGVWIATVANIDWPSR-------DNESSERQKQELINILDAHQRAGLNAIFFQI 78 Query: 111 KPDGTALWPSKILPWSDLMTGKIG--ENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVN 168 +P A + PWS +TG G +P YDPL+F+++EAHKRGM++HAW NPYR S Sbjct: 79 RPAADAFYAKGREPWSRYLTGVQGKAPSPFYDPLEFVIEEAHKRGMELHAWVNPYRASTT 138 Query: 169 TKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYP 228 P + + +W G +++ +PG+PEV+ +I ++ +VV Y Sbjct: 139 LNPAHFSK---------DHITRTKPEWFFKYGGKYLFNPGLPEVRQYIIDVIMDVVKNYD 189 Query: 229 VDGVQFDDYFYTESPGSR--LNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIK 286 VDG+ FDDYFY L D T+ ++G FA+ DWRRNN LI + IK K Sbjct: 190 VDGIHFDDYFYPYPDARNTALPDAPTFHQFGKGFANIHDWRRNNVDLLIRDLGIAIKKEK 249 Query: 287 PGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSR 346 P +++G+SP G+W N+ +P GS+T G + Y YAD +W+++G +DYI PQIY+PF+ Sbjct: 250 PFIKYGISPFGIWDNKRDNPDGSNTSGLSGYRTLYADGVKWMKEGWIDYINPQIYFPFNN 309 Query: 347 SAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDA 406 AA +++L +WW Y+G Y+V E ++ KQ+ Sbjct: 310 RAAAFEILLEWWEKHT--YGRHFYVGHGAYRVTEKRPG------WTDKGQIPKQVRHLRD 361 Query: 407 VPEISGTILFREDYLNKPQTQQAVSYLQ 434 E+ G+I F L +Q Sbjct: 362 QHEVQGSIYFSSKSLMD-NLAGLRDSMQ 388 >UniRef50_UPI00016C4E90 hypothetical protein GobsU_27726 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4E90 Length = 481 Score = 336 bits (861), Expect = 1e-90, Method: Composition-based stats. Identities = 119/401 (29%), Positives = 185/401 (46%), Gaps = 37/401 (9%) Query: 46 PATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINT 105 A + R +W+ATVS +DWP + A Q++ ++ LD+ L +N Sbjct: 19 AADPPALKREFRAVWVATVSNIDWPSKPGL-------PADQQKKELLAILDNAVELKLNA 71 Query: 106 VFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRV 165 V FQV+P AL+ S++ PWS+ +TG+IG+ PGYDPL F + EAHKRG+++HAWFNPYR Sbjct: 72 VIFQVRPMADALYASELEPWSEYLTGQIGKAPGYDPLAFAVTEAHKRGLELHAWFNPYRA 131 Query: 166 SVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVS 225 + + + D + G ++P PEVQ+ + +VV Sbjct: 132 RHPSAKSPAPA---------DHLTRKRPDLAKPYGTHAWMNPTNPEVQEHSLRVFLDVVK 182 Query: 226 RYPVDGVQFDDYFYTESPGSR------LNDNETYRKY--GGAFASKADWRRNNTQQLIAK 277 RY +DG+ DDYFY D++T+ Y G S+ DWRR+ + + Sbjct: 183 RYDIDGIHIDDYFYPYKEKGTDGKVIPFPDDDTWEAYQKQGGKLSRDDWRRDAVNVFVRR 242 Query: 278 VSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIA 337 + K KP V+ G+SP G+WR + G Y E YAD + W +G +DY Sbjct: 243 MYEETKKAKPWVKVGISPFGIWRPGHP----AGIAGLDQYAELYADAKLWFNEGWVDYFT 298 Query: 338 PQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPEL 397 PQ+YWP ++ + L WWA + L+ G+ +V +K E+ Sbjct: 299 PQLYWPIAQEKQSFPKLLDWWAGE-NTKKRHLWPGLYTSRVTGAAKG-------WNAKEI 350 Query: 398 KKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQSRWG 438 Q+ + + G + F L + T L+ + Sbjct: 351 ADQIAVTRQRSDTDGAVHFSAKALVR-NTGGIADELKQVYA 390 >UniRef50_A6NVH8 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6NVH8_9BACE Length = 606 Score = 334 bits (857), Expect = 3e-90, Method: Composition-based stats. Identities = 136/400 (34%), Positives = 215/400 (53%), Gaps = 28/400 (7%) Query: 43 SKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLG 102 ++ + R +W+ATV LD+P + T+ A + + L++ +G Sbjct: 34 AQQANAPSAARDDFRAVWVATVYNLDYPNAA-------TTDADALKAQADEILENCVDMG 86 Query: 103 INTVFFQVKPDGTALWPSKILPWSDLMTGKIGENP--GYDPLQFMLDEAHKRGMKVHAWF 160 +N V QV+P G AL+PS++ PWS +TG G P +DPL + ++ AH+ G+++HAW Sbjct: 87 MNAVILQVRPSGDALYPSELFPWSKYLTGASGLAPEDNFDPLAYWVERAHELGLELHAWI 146 Query: 161 NPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIV 220 NP+R++ + + S VQH +W+ + L+PG+PEV++ + Sbjct: 147 NPFRITKGGEAE-------LAALDAKSPAVQHPEWVVECDGNYYLNPGLPEVRELVIQGA 199 Query: 221 AEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSH 280 E+V Y VDGV DDYFY ND+ +++YGG F + DWRR+N QLI + Sbjct: 200 EELVRNYDVDGVHLDDYFYPS---RSFNDDAAFQQYGGDFDNIGDWRRDNVNQLIQGLDQ 256 Query: 281 TIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRG-AAAYDESYADTRRWVEQGLLDYIAPQ 339 + ++ P + FGVSP+GVW + +H GS T G +Y +YAD+R+WV++G +DYI PQ Sbjct: 257 RLHALDPELSFGVSPSGVWADSTHQSAGSATTGNYESYYAAYADSRKWVKEGWVDYICPQ 316 Query: 340 IYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKK 399 IYW Y+ +A+WW+D V+ T +LYIG+A Y + ++ P G+ + K Sbjct: 317 IYWYIGHPTMDYETIARWWSDTVEGTGVKLYIGMADYLADDGTEGSP----WNGLDAITK 372 Query: 400 QLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQSRWGS 439 QL LN + +SG + FR +L + L W S Sbjct: 373 QLTLNREL-GVSGEVHFRYKFLAVNSN---LKRLYETWYS 408 >UniRef50_B1HPQ3 Hypothetical lipoprotein yddW n=2 Tax=Bacillaceae RepID=B1HPQ3_LYSSC Length = 522 Score = 334 bits (857), Expect = 3e-90, Method: Composition-based stats. Identities = 161/432 (37%), Positives = 228/432 (52%), Gaps = 38/432 (8%) Query: 14 RPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVS 73 + ++ +A++L C S P + V A+T Q + MR +W++TV LD + Sbjct: 7 KWKLIALVAMILMLCLSAIPANTVK--------ASTTQPKREMRAVWISTVLNLDM--KA 56 Query: 74 SVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGK- 132 +N T AR LD L+ NTV +QV+P A++ S++ PWS +TGK Sbjct: 57 GMNKEQYTVWARQT-------LDQLKANKFNTVIYQVRPTNDAMYASELAPWSSYITGKK 109 Query: 133 IGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQH 192 G NPGYDPL +++E+HKRGM++HAW NPYRV+++ + P +V + H Sbjct: 110 QGTNPGYDPLTILVEESHKRGMELHAWMNPYRVTMSGQK--------LTDLAPDNVAITH 161 Query: 193 RDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSR-LNDNE 251 +W+ G ++ L+PG+PEVQD++ IV E+V+ Y VD V DDYFY + D Sbjct: 162 PNWVVKYGKQYYLNPGLPEVQDYLVEIVRELVANYDVDAVHMDDYFYPYKIANEVFPDQA 221 Query: 252 TYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDT 311 Y+KYG +F DWRRNN +L+ + IK KP V+FG+SP GVWRN+S D GSDT Sbjct: 222 AYKKYGASFNKVEDWRRNNVNRLVENLYTAIKETKPYVQFGISPFGVWRNKSLDKTGSDT 281 Query: 312 R-GAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKP----TR 366 R G YD+ YAD R W++ G +DYI PQIYW + S A+Y L WW+ V+ Sbjct: 282 RAGVNNYDDLYADVRTWIQNGTIDYITPQIYWSRTLSVAKYGTLLDWWSHEVQTYAKMHP 341 Query: 367 TRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVP-EISGTILFREDYLNKPQ 425 LYIG+A YKVG S EL Q+ N + G + F + Sbjct: 342 VHLYIGLADYKVGNDS-----DAAWKNKMELPSQILENRSEKVAADGQMHFSLRSFQSNK 396 Query: 426 TQQAVSYLQSRW 437 A Q + Sbjct: 397 LGYATIVSQQLY 408 >UniRef50_C7PIN2 Putative uncharacterized protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PIN2_CHIPD Length = 509 Score = 332 bits (852), Expect = 1e-89, Method: Composition-based stats. Identities = 139/391 (35%), Positives = 196/391 (50%), Gaps = 34/391 (8%) Query: 51 QSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQV 110 + R +W+ATV +DWP + Q+Q I+ LD QR G+N V Q+ Sbjct: 24 PPKREFRAVWIATVENIDWPSRKGL-------PVETQKQEFINLLDKHQRNGMNAVIVQI 76 Query: 111 KPDGTALWPSKILPWSDLMTGKIGE--NPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVN 168 +P A + S PWS+ ++G G+ NP YDPL+FML+E HKRGM+ HAWFNPYR + Sbjct: 77 RPAADAFYDSPFEPWSEYLSGVQGQAPNPYYDPLRFMLEETHKRGMEFHAWFNPYRAVIR 136 Query: 169 TKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYP 228 + W + DPGIPEV++++T I+ +VV RY Sbjct: 137 NASA-------------NHISRMRPQWFVNFDGKKYFDPGIPEVREYVTQIIRDVVRRYD 183 Query: 229 VDGVQFDDYFYTES-PGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKP 287 +D V FDDYFY PG DN +YR+YG K DWRR N +I VS IK KP Sbjct: 184 IDAVHFDDYFYPYPVPGREFGDNNSYRQYGRNMM-KDDWRRWNVDTIIQMVSKMIKEEKP 242 Query: 288 GVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRS 347 V+FG+SP G+WRN++ D GS T G + YD+ YAD R+W++ G +DY+APQ+YW Sbjct: 243 WVKFGISPFGIWRNKNKDQDGSYTTGLSNYDDLYADVRKWLQNGWIDYVAPQLYWERGHR 302 Query: 348 AARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAV 407 A Y++L WWA +YIG Y++ + EL Q+ + Sbjct: 303 VANYELLLNWWAQ--HGYGRNVYIGHGVYRLRSNAAWSI-------PNELPVQITEVRTL 353 Query: 408 PEISGTILFREDYLNKPQTQQAVSYLQSRWG 438 I G+ + N L++ + Sbjct: 354 NTIQGSAFYSAKSFN-GNPLGIEDSLRNHFY 383 >UniRef50_A5FI17 Putative uncharacterized protein n=1 Tax=Flavobacterium johnsoniae UW101 RepID=A5FI17_FLAJ1 Length = 523 Score = 331 bits (848), Expect = 4e-89, Method: Composition-based stats. Identities = 136/385 (35%), Positives = 201/385 (52%), Gaps = 27/385 (7%) Query: 45 PPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGIN 104 RG+W+ATV +DWP + N+ ++ ++ L+ ++L N Sbjct: 23 SQEKIIYPKNEFRGVWIATVVNIDWPKTAIDNV-------EKEKADYLEILNTYKKLNYN 75 Query: 105 TVFFQVKPDGTALWPSKILPWSDLMTGKIG--ENPGYDPLQFMLDEAHKRGMKVHAWFNP 162 V Q++ G A +PS+ PWS +TGK G NP YD L++M++EAH RG + HAW NP Sbjct: 76 AVIVQIRSVGDAFYPSEFAPWSRFLTGKEGTAPNPYYDALEWMIEEAHNRGFEFHAWLNP 135 Query: 163 YRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAE 222 YR + + + P +H +W+ G ++ DP +PEVQ +T +V E Sbjct: 136 YRATFDLNKNLLS---------PNHDIFKHPEWMIEYGGKYYYDPALPEVQTHLTKVVKE 186 Query: 223 VVSRYPVDGVQFDDYFYTES-PGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHT 281 VV +Y +D + FDDYFY + PG ND +Y+KYG S ADWRR N + +S T Sbjct: 187 VVDKYDIDAIHFDDYFYPYAVPGKVFNDTASYKKYGSG-LSLADWRRANVSNFVHTISTT 245 Query: 282 IKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIY 341 IK+ KP V+FG+SP GVWRN+S DP GS+T+ + YD+ YAD W++Q +DYI PQ+Y Sbjct: 246 IKASKPWVQFGISPFGVWRNKSQDPKGSETQSTSNYDDLYADPVLWMDQKWIDYIMPQLY 305 Query: 342 WPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQL 401 W + A Y L KWW++ T +YIG A YK+ E+ Q+ Sbjct: 306 WSMNNPRASYSKLVKWWSE--NANNTAIYIGHASYKIRGD-----GDKSWYFATEIPTQV 358 Query: 402 DLNDAVPEISGTILFREDYLNKPQT 426 D + ++G+ F + Sbjct: 359 DFARSFKNVNGSAYFSAKWFMSKNL 383 >UniRef50_A4ASW6 FenI n=1 Tax=Flavobacteriales bacterium HTCC2170 RepID=A4ASW6_9FLAO Length = 507 Score = 331 bits (847), Expect = 4e-89, Method: Composition-based stats. Identities = 133/431 (30%), Positives = 211/431 (48%), Gaps = 43/431 (9%) Query: 12 IRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPP 71 +++ + + L ++ SC + P Q RG+W+ATV +DWP Sbjct: 2 LKKISHYLLLLIIFNSCDAIKP---------------IPQPRTEFRGVWVATVVNIDWPK 46 Query: 72 VSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTG 131 N Q+ + L+ +L NTV QV+ G + + SK PWS +TG Sbjct: 47 -------NGLDAIEKQKADFLKILEFYDQLNFNTVIVQVRTAGDSFYDSKYAPWSRFLTG 99 Query: 132 KIGENP--GYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVY 189 G++ +D L +M+D+ H RG + HAW NPYR + + K + + Sbjct: 100 TEGKSTEGHFDMLNWMIDQTHNRGFEFHAWLNPYRATFDLKTDVLSAT---------HDF 150 Query: 190 VQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSR-LN 248 H +W+ G+++ +PG+PEV++ + SI+ EVV++Y +D + FDDYFY N Sbjct: 151 NLHPEWMLKYGNKYYYNPGLPEVRERLASIMGEVVTKYDIDAIHFDDYFYPYRIKDEIFN 210 Query: 249 DNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLG 308 D+ Y + + + +WRR+N L+ + T+K+IKP V+FG+SP GVW+N+S DP G Sbjct: 211 DSLAYNYHSFSGQTVENWRRSNIDSLVKNIHSTVKNIKPWVQFGISPFGVWKNKSTDPRG 270 Query: 309 SDTR-GAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRT 367 SDT+ G Y++ YAD W+ +G +DY+ PQ+YW A + + WW++ T Sbjct: 271 SDTKAGQTTYEDLYADPLTWMNEGWIDYLVPQVYWSMDLPVASHKKIVNWWSN--NSVNT 328 Query: 368 RLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQ 427 LYIG YK+ S E+ QL L ++ G +LF L Sbjct: 329 NLYIGNGAYKIRSNS-----DKAWDDKKEMPNQLKLARKDSKVQGNVLFSAKSLM-NDNP 382 Query: 428 QAVSYLQSRWG 438 V YL+ R+ Sbjct: 383 DVVEYLKRRFY 393 >UniRef50_C6XWP7 Putative uncharacterized protein n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6XWP7_PEDHD Length = 519 Score = 329 bits (844), Expect = 9e-89, Method: Composition-based stats. Identities = 137/380 (36%), Positives = 199/380 (52%), Gaps = 24/380 (6%) Query: 36 MVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKL 95 ++TP + + + RG+W+ATV+ +DWP +NI Q+Q +I L Sbjct: 13 IITPISLIAQSPSKIAPKREFRGVWVATVANIDWPSKPGLNID-------QQKQELIGLL 65 Query: 96 DHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENP--GYDPLQFMLDEAHKRG 153 + + G+N + QV+P A + PWS + GK G P GYDPL F + EAH RG Sbjct: 66 EQHKANGMNAIILQVRPAADAFYLKSREPWSQWLMGKQGMAPAPGYDPLAFAIKEAHSRG 125 Query: 154 MKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQ 213 M++HAWFNPYR +++ P + + D G + DPGIPEV+ Sbjct: 126 MELHAWFNPYRATMSAS----------AVVSPDHMTRKRPDLFFVYGGKKQFDPGIPEVR 175 Query: 214 DWITSIVAEVVSRYPVDGVQFDDYFYTES-PGSRLNDNETYRKYGGAFASKADWRRNNTQ 272 ++I ++ +VV Y VDG+ FDDYFY G +ND T+ KY F++ ADWRRNN Sbjct: 176 EYIVQVILDVVKGYDVDGIHFDDYFYPYKIAGQNINDAATFNKYPNGFSNIADWRRNNVD 235 Query: 273 QLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGL 332 LI ++ +I K V+FGVSP G+W+N S D LGS T G + Y E YAD+R+WV++G Sbjct: 236 LLIKQLDDSIHHYKKYVKFGVSPFGIWKNLSEDSLGSATNGLSNYAELYADSRKWVKEGW 295 Query: 333 LDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMING 392 +DYI PQ+Y+ F+R AA + +A WW + +YIG Y + S Sbjct: 296 VDYINPQVYFSFTRRAAPFATIADWWTN--NAFGRHVYIGHGAYLIHNGST--RKEAAWA 351 Query: 393 GVPELKKQLDLNDAVPEISG 412 ++ Q+ I G Sbjct: 352 FPNQIPNQIRHIRGSNLIQG 371 >UniRef50_A1ZQ43 YngK protein n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZQ43_9SPHI Length = 517 Score = 325 bits (832), Expect = 2e-87, Method: Composition-based stats. Identities = 141/400 (35%), Positives = 206/400 (51%), Gaps = 27/400 (6%) Query: 42 GSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRL 101 + T ++ + R +WL T +D+P + +Q +I LD Q+ Sbjct: 30 SNNVKVTKRKLKREFRAVWLTTFDHMDFPKEKG-------APPSEHKQELIKLLDFHQKS 82 Query: 102 GINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPG--YDPLQFMLDEAHKRGMKVHAW 159 GIN +FFQV+P A + S+I WS +TGK G+ P +DPL+F++ E HKR +++HAW Sbjct: 83 GINAIFFQVRPAADAFYKSEIELWSQWLTGKQGKAPEPLWDPLEFLVTECHKRNIELHAW 142 Query: 160 FNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSI 219 NPYR N K + P + +H +W G +PGIP V+ ++ ++ Sbjct: 143 INPYRAVYNIKHD---------ATAPNHITKRHPEWFVVYGKHKQFNPGIPAVRHYLKAV 193 Query: 220 VAEVVSRYPVDGVQFDDYFYTESPGS-RLNDNETYRKYGGAFASKADWRRNNTQQLIAKV 278 VA++ RY +DG+ FDDYFY G D T+ K+GG WRR N LI +V Sbjct: 194 VADIAQRYDIDGIHFDDYFYPYKKGRLEFPDQSTFMKHGGNSKDVHHWRRQNVNSLIKEV 253 Query: 279 SHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTR-GAAAYDESYADTRRWVEQGLLDYIA 337 T++SIKP ++FG+SP GVWRN+S DP GSDT+ G ++YD YAD +W+ +G +DY+ Sbjct: 254 HDTLQSIKPYLKFGISPLGVWRNKSEDPNGSDTQVGQSSYDYLYADVLKWLRKGWIDYLV 313 Query: 338 PQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPEL 397 PQ+YW A + LA WWA +YIG AFYK+ V EL Sbjct: 314 PQLYWSIEHPRASFKSLAFWWAK--HAYSRHIYIGHAFYKI-----KNDKDDHWKQVSEL 366 Query: 398 KKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQSRW 437 Q+ + I G FR D+L K + + Q + Sbjct: 367 PNQVRMTRQYRSILGNAYFRSDFLQKNPAKVTDTLRQQLY 406 >UniRef50_C9L341 YngK protein n=45 Tax=Bacteroidales RepID=C9L341_9BACE Length = 528 Score = 324 bits (831), Expect = 3e-87, Method: Composition-based stats. Identities = 143/386 (37%), Positives = 207/386 (53%), Gaps = 31/386 (8%) Query: 43 SKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLG 102 ++ P+ + + RG W+ V N +Q +ID+L+ LQ G Sbjct: 46 AQVPSGNKYPKREFRGAWIQAV-----------NGQFKGIPTGKLKQTLIDQLNSLQGAG 94 Query: 103 INTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPG--YDPLQFMLDEAHKRGMKVHAWF 160 IN + FQV+P+ AL+ S+ PWS +TG G+ P +DP+QFM++E KR M+ HAW Sbjct: 95 INAIIFQVRPEADALYASQHEPWSRFLTGTQGQIPSPMWDPMQFMIEECRKRNMEFHAWI 154 Query: 161 NPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIV 220 NPYRV + K P +Y QH +W T GD+ DP +PE +D+I IV Sbjct: 155 NPYRVKTSLKNQLA----------PEHIYHQHPEWFVTYGDQLYFDPALPESRDYICKIV 204 Query: 221 AEVVSRYPVDGVQFDDYFYTES-PGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVS 279 ++VSRY VD + DDYFY G D+ ++ +YGG F +KADWRR+N LI K+ Sbjct: 205 TDIVSRYDVDAIHMDDYFYPYPVKGMDFPDDASFARYGGGFTNKADWRRSNVNVLIKKLH 264 Query: 280 HTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQ 339 TI+++KP V+FG+SP G++RN+ DPLGSDT G YD+ YAD W +G +DY PQ Sbjct: 265 ETIRAVKPWVKFGISPFGIYRNQKSDPLGSDTNGLQNYDDLYADVLLWAREGWIDYNIPQ 324 Query: 340 IYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKK 399 IYW AA Y+ L KWWA L+IG + + I+ N + +L + Sbjct: 325 IYWEIGHKAADYETLVKWWA--THSENRPLFIGQSV-----SNTIQHADPKNPSINQLPR 377 Query: 400 QLDLNDAVPEISGTILFREDYLNKPQ 425 ++ L A I G+ + + + Q Sbjct: 378 KMALQRAYQTIGGSCQWYASAVVENQ 403 >UniRef50_A9NEW1 Putative uncharacterized protein n=1 Tax=Acholeplasma laidlawii PG-8A RepID=A9NEW1_ACHLI Length = 1328 Score = 324 bits (830), Expect = 4e-87, Method: Composition-based stats. Identities = 132/411 (32%), Positives = 206/411 (50%), Gaps = 25/411 (6%) Query: 36 MVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKL 95 P T + + +R +W+ATV+ +D + +I L Sbjct: 880 YYQTNTPVALPTTYTEKDKEIRAVWVATVANID---------ITQYDNEANYKNQIISIL 930 Query: 96 DHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMK 155 + ++ L NT+FFQ +P + +PS+ P S ++G G G+D L+F++ EAH RG++ Sbjct: 931 ERMKELKFNTMFFQTRPMNDSFYPSEYAPMSRFLSGTEGVGVGWDVLEFLITEAHARGIE 990 Query: 156 VHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTS-GDRFVLDPGIPEVQD 214 VHAW NPYRV T + ++ Q+ ++ G +L+PGIPEV+ Sbjct: 991 VHAWMNPYRV---ASGSTASIEDQLALLHDSNFAKQNPSYVVQDKGGALILNPGIPEVRQ 1047 Query: 215 WITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQL 274 ++ +IV E++ Y +DGV FDDYFY+ S D + + Y S+ DWRR N Sbjct: 1048 YLYNIVDEIMENYAIDGVHFDDYFYSYSGTEDSQDADAFLNYNPNNLSRDDWRRENVNMF 1107 Query: 275 IAKVSHTIKSIKP----GVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQ 330 + + +++ V+FG+SP G+WRN++ D LGS+++G ++Y YAD+R+WV++ Sbjct: 1108 VKTIYERVEAHNEANDMHVKFGISPFGIWRNKTQDALGSNSQGLSSYSAQYADSRKWVKE 1167 Query: 331 GLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMI 390 G L YI PQ+YW F S AR+ L WW DVVK T L IG FY+ E S Sbjct: 1168 GWLHYIIPQLYWQFDHSTARFADLVDWWVDVVKDTNVDLIIGQGFYRYAENSN------N 1221 Query: 391 NGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQ--QAVSYLQSRWGS 439 E +QL EI G+ +F LN A++ L+ + + Sbjct: 1222 WTNESEFLEQLRYMSQYDEIIGSSIFSYKTLNSNHALVSAALARLEGHYWT 1272 Score = 221 bits (562), Expect = 5e-56, Method: Composition-based stats. Identities = 102/409 (24%), Positives = 166/409 (40%), Gaps = 43/409 (10%) Query: 37 VTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLD 96 T A + P S R W+ + S + + + L+ Sbjct: 19 PTNEAHAYPEPEVLTSPGEFRATWIT----------HFIGSMPAYSTEQDFKSEVNSILN 68 Query: 97 HLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKV 156 +++ +N + AL+ S+I P + + +DP+ + ++EAHKRG++ Sbjct: 69 NMEANNLNVAIVHFRTHNNALYKSEINPVASWFATVDFD--VFDPMAYFIEEAHKRGIEF 126 Query: 157 HAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWI 216 HAW NPYRV + GTI N PA++ G +L+P +P V++ + Sbjct: 127 HAWLNPYRVLSTYQRGTIPASNP--QSNPANLLSNK------EGTAHILNPALPVVREHV 178 Query: 217 TSIVAEVVSRYPVDGVQFDDYFYTESPGS---RLNDNETYRKYGGAFAS----KADWRRN 269 + + E++ Y VD + FDDYFY E D + + K++WRR Sbjct: 179 VNTILEIIENYNVDAIHFDDYFYMEMNNGGILNDPDQALFLSNPLGQPNTVAGKSNWRRT 238 Query: 270 NTQQLIAKVSHTIK----SIKPGVEFGVSPAGVWRN----------RSHDPLGSDTRGAA 315 I + S IK + V+FG+SP G++RN GS T+G Sbjct: 239 QINTFIEQASQAIKDFNQANNRYVQFGISPTGIYRNGDGEVTYDQDGKPITNGSKTQGQE 298 Query: 316 AYDES-YADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIA 374 Y +ADT W+ +G LDYI PQ YW + S A +D + WW VV+ LY GI Sbjct: 299 HYASYLFADTVHWISEGWLDYILPQSYWASTHSLAGFDKVMGWWDKVVRYLDVNLYSGIG 358 Query: 375 FYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNK 423 Y + E Q++ ++ G ++ + + Sbjct: 359 LYLADAGIA-SNVYSWRDNPEEFSNQMEFLHSLESNQGFSIYSYNMIRD 406 >UniRef50_Q8YW40 All1776 protein n=5 Tax=Nostocaceae RepID=Q8YW40_ANASP Length = 669 Score = 324 bits (830), Expect = 4e-87, Method: Composition-based stats. Identities = 133/432 (30%), Positives = 207/432 (47%), Gaps = 37/432 (8%) Query: 15 PAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSS 74 A ++ AL+ + P P ++ RG W+ TV DWP + Sbjct: 177 VATIIYQALVYLGQAEKIASVYLVIPPKPTLPTVRVSHNREFRGAWITTVWNSDWPSKAG 236 Query: 75 VNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIG 134 + QQ ++ L LQ+L N V QV+P+G AL+ S++ PWS +TG G Sbjct: 237 L-------SVAQQQAELVAILTRLQQLNFNAVILQVRPEGDALYASELEPWSAWLTGTPG 289 Query: 135 --ENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQH 192 P YDPLQF + EAHKR ++VHAWFNPYR +T+ + V + Sbjct: 290 KAPEPFYDPLQFAIAEAHKRNLEVHAWFNPYRAKTSTRSAPNVRP---------HISVTN 340 Query: 193 RDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESP-GSRLNDNE 251 + + G++ +DPGI VQD +++ +VV RY +D V DDYFY G D++ Sbjct: 341 PEVVYQWGNQLWMDPGIKIVQDRAYNVIIDVVRRYDIDAVHLDDYFYPYPIQGQAFPDDK 400 Query: 252 TYRKYG--GAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGS 309 TY Y G S DWRR N Q++ ++S IK+ K V+FG+SP G++R + Sbjct: 401 TYAAYKSAGGQLSLNDWRRQNVDQMVLRLSQGIKATKSYVKFGISPFGIYRPGQPPGI-- 458 Query: 310 DTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRL 369 G AY YAD ++W+EQG +DY+APQ+YW ++ Y VL KWW ++ R + Sbjct: 459 --TGLDAYSVLYADAKKWLEQGWVDYLAPQLYWRTDQTNQSYPVLLKWWTEI-NSKRRHI 515 Query: 370 YIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPE--ISGTILFREDYLNKPQTQ 427 Y G ++ + E++KQ+ ++ G I F + + + Sbjct: 516 YAGNNIGQLDGKA---------WKNEEIEKQVKISRNQAGELSLGNIFFSVSSIIENRQD 566 Query: 428 QAVSYLQSRWGS 439 + ++ S + + Sbjct: 567 ISTTFQNSLYTT 578 >UniRef50_UPI00016C0313 cell surface protein n=1 Tax=Epulopiscium sp. 'N.t. morphotype B' RepID=UPI00016C0313 Length = 539 Score = 323 bits (828), Expect = 7e-87, Method: Composition-based stats. Identities = 147/439 (33%), Positives = 219/439 (49%), Gaps = 40/439 (9%) Query: 7 NKKLTIRRP--AILVALALLLCSCKSTPPESMVTPPAG---SKPPATTQQSSQPMRGIWL 61 NKK +I+ A +LL K P + P + ++ +R +W+ Sbjct: 2 NKKFIAWILGGSIVTAAVILLLVPKELPSMKFIPNKNSLNKKNPITPLRPQNEEVRAVWI 61 Query: 62 ATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSK 121 +++ LD+P S + QQ I LD LQ +G NTV QV+P AL+ S Sbjct: 62 SSIWGLDFP-----YNSINRNNPAAQQAEFISYLDELQEIGFNTVMVQVRPSADALYKSA 116 Query: 122 ILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTL 181 I PW+ ++TG G++PGYDPL FM+D+ HKRGMK+HAW NPYRV+ K +++ + Sbjct: 117 INPWAAILTGTQGQDPGYDPLAFMIDQTHKRGMKLHAWINPYRVTTAGKG-----IDTLV 171 Query: 182 SQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTE 241 + PA + + D + + + P + V+ I V E+V+ Y VDG+ DDYFY Sbjct: 172 ATHPARL---NPDMLISHKNALYYXPELDAVKSHIEETVKEIVTNYSVDGIHMDDYFYPA 228 Query: 242 SPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRN 301 ++ A RRN+ ++ ++ IK IKP VEFG+SP G+W++ Sbjct: 229 WYPLPAGED--------GNGKTATTRRNHVNDMVKRIHTAIKQIKPNVEFGISPIGIWKD 280 Query: 302 RSHDPLGSDTR-GAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWAD 360 D GS+T G +Y YADTR W++ +DY+ PQIYW A Y+VL KWWA+ Sbjct: 281 SITDITGSETSAGWNSYYAVYADTRAWIQNEWIDYVVPQIYWEIDNPVASYEVLVKWWAE 340 Query: 361 VVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDY 420 VK T LYIG YK + E+ Q+ LND PEI G++ F Sbjct: 341 EVKNTNVDLYIGQGIYK-------------DAVAEEITTQILLNDLYPEIKGSVXFAISD 387 Query: 421 LNKPQTQQAVSYLQSRWGS 439 + + T L++ +G+ Sbjct: 388 IIRKNTGNVRGQLEALFGT 406 >UniRef50_C3XYE7 Putative uncharacterized protein n=2 Tax=Branchiostoma floridae RepID=C3XYE7_BRAFL Length = 576 Score = 322 bits (824), Expect = 2e-86, Method: Composition-based stats. Identities = 130/404 (32%), Positives = 195/404 (48%), Gaps = 38/404 (9%) Query: 43 SKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLG 102 + P T S+ RG+W+ATVS +DWP + Q+ ++ LD L Sbjct: 111 TGAPTITPSPSREFRGVWVATVSNIDWPSSRHL-------STEQQKAELVTILDRTVELN 163 Query: 103 INTVFFQVKPDGTALWPSKILPWSDLMTGKIG--ENPGYDPLQFMLDEAHKRGMKVHAWF 160 +N + FQV+P G A + S++ PWS + G+ G P YDPL F ++E+H+RG+++HAWF Sbjct: 164 LNAIVFQVRPAGDAFYDSQLEPWSYYLAGQHGSAPTPFYDPLAFAIEESHRRGIELHAWF 223 Query: 161 NPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIV 220 NPYR ++ + + + G+ +DPG V D ++ Sbjct: 224 NPYRAKTKAAGYSLAS---------NHMAKRFPQYAYDYGNYIWMDPGAQVVADHTYDVI 274 Query: 221 AEVVSRYPVDGVQFDDYFYTES-PGSRLNDNETYRKYG--GAFASKADWRRNNTQQLIAK 277 +VV RY VDG+ FDDYFY G D TY+ Y G SKADWRR+N +L+ + Sbjct: 275 IDVVRRYDVDGIHFDDYFYPYPVSGVDFPDTATYQAYQTSGGTMSKADWRRDNVNRLVRR 334 Query: 278 VSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIA 337 ++ I + K V+FG+SP G+WR +P G G + YD YAD + W+EQGL+DY+A Sbjct: 335 LNSGIHAEKSHVKFGISPFGIWR--PGNPAG--IVGFSQYDSLYADPKFWLEQGLVDYLA 390 Query: 338 PQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPEL 397 PQ+YW Y L WW D P + +Y G ++ V EL Sbjct: 391 PQLYWMIDPPQQSYPALLDWWLDQ-NPLQRHVYTGNYLSRILTDG---------WPVSEL 440 Query: 398 KKQLDLNDAVPE--ISGTILFREDYLNKPQTQQAVSYLQSRWGS 439 Q+ L+ + G I+F + V +S+ + Sbjct: 441 VNQVSLSRDRADRLSLGNIMFSMKPFRD-NSDGVVDAFKSQVYT 483 >UniRef50_B4VZ35 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VZ35_9CYAN Length = 665 Score = 320 bits (821), Expect = 5e-86, Method: Composition-based stats. Identities = 123/426 (28%), Positives = 206/426 (48%), Gaps = 37/426 (8%) Query: 19 VALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNIS 78 VA + + +++ + + + RG+W+ATV +DWPP + Sbjct: 177 VAAFIYQALVYTGRLDAIGSDYIIVRQKTLKLSHQREFRGVWVATVWNIDWPPQRGL--- 233 Query: 79 NPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGE--N 136 QQ+ ++ +D + L +N + QV+P G A + S++ PWS+ +TG G+ + Sbjct: 234 ----SVAQQQRELLQIIDRMAELQLNALILQVRPTGDAFYASELEPWSEWLTGVQGQAPD 289 Query: 137 PGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWI 196 P YDPL+F + H+R +++HAWFNPYR ++ + V H +++ Sbjct: 290 PYYDPLEFAIAACHQRNIELHAWFNPYRAKTSSHSSASVAP---------HISVTHPEYV 340 Query: 197 RTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTES-PGSRLNDNETYRK 255 G++ +DPG+ VQD +++ +VV RY VDG+ DDYFY G D++TY Sbjct: 341 YKYGNQQWMDPGVKVVQDLTYNVIMDVVRRYDVDGIHLDDYFYPYPIAGEDFPDDKTYNA 400 Query: 256 YG--GAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRG 313 Y G S +DWRR N Q++ ++ I++ K V+FG+SP G++R +G Sbjct: 401 YQAEGGTLSLSDWRRENVNQMVQRLYKGIQATKKQVKFGISPFGIYRPGQPP----QIKG 456 Query: 314 AAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGI 373 Y+ YAD ++W+E G +DYIAPQ+YW A Y VL +WW D P + +Y G Sbjct: 457 LDQYESLYADPKKWLEAGWIDYIAPQLYWRIDPPAQSYPVLLEWWTDN-NPKQRHIYPGN 515 Query: 374 AFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPE--ISGTILFREDYLNKPQTQQAVS 431 + + DW E ++Q+D+ + G I + N+ + Sbjct: 516 RLSMLD-----DKDWSFL----EYERQVDITRNLAPQLSLGNIFYNMKVFNENRFGVVEK 566 Query: 432 YLQSRW 437 + S + Sbjct: 567 FQSSVY 572 >UniRef50_B4D6Q1 Putative uncharacterized protein n=2 Tax=Verrucomicrobia RepID=B4D6Q1_9BACT Length = 388 Score = 318 bits (815), Expect = 2e-85, Method: Composition-based stats. Identities = 124/390 (31%), Positives = 187/390 (47%), Gaps = 37/390 (9%) Query: 51 QSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQV 110 + RG W+ATV LDWP + + Q+ + D D Q+L +N + QV Sbjct: 19 AAQPEFRGAWVATVFNLDWPSKAGL-------SEAEQKAQLRDIFDRAQQLKLNAILLQV 71 Query: 111 KPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTK 170 + A + S+ PWS +TGK G +PGYDPL + + EAH RG+++HAWFNP+R Sbjct: 72 RSMSDACYASRREPWSTFLTGKQGVDPGYDPLAYAITEAHARGIELHAWFNPFRAGTKGG 131 Query: 171 PGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVD 230 V H +WIR G + LDPG P + ++ ++ +VV RY +D Sbjct: 132 SSCAA----------NHVTRAHPEWIRPYGSQLWLDPGDPNARRYVLDVILDVVKRYDID 181 Query: 231 GVQFDDYFYTES-PGSRLNDNETYRKYG-GAFASKADWRRNNTQQLIAKVSHTIKSIKPG 288 GV DDYFY G+ D+ T++KYG S+ADWRR+N + + + H +K+ KP Sbjct: 182 GVHIDDYFYPYPVKGAEFPDDVTWQKYGMAGGKSRADWRRDNINRFVEAMYHEVKAAKPS 241 Query: 289 VEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSA 348 V G+SP G+WR + + + AY + YAD R W+ +G DY+APQ+YW Sbjct: 242 VRVGISPFGIWRPKVPATIEAQ---LDAYAQLYADARYWLSEGWCDYLAPQLYWGIHPDK 298 Query: 349 ARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVP 408 + VL WW ++ GIA ++G+P V E+ +Q++L Sbjct: 299 QSFPVLLNWWRQQST-AGRPVWPGIATERIGKP----------YDVGEIARQIELTRQSL 347 Query: 409 EIS---GTILFREDYLNKPQTQQAVSYLQS 435 + G I + L L+ Sbjct: 348 PANGEPGNIQWSMKALMH-NQGGVADLLKR 376 >UniRef50_UPI0001C160EA conserved hypothetical protein n=2 Tax=Nostocaceae RepID=UPI0001C160EA Length = 668 Score = 318 bits (814), Expect = 3e-85, Method: Composition-based stats. Identities = 130/432 (30%), Positives = 210/432 (48%), Gaps = 41/432 (9%) Query: 15 PAILVALALL-LCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVS 73 A+++ A++ L + +VT P G K + Q + RG W+ V DWP Sbjct: 177 VAVMMYQAIVHLGKMQKINSPYIVTLPLGVKTVKVSHQ--REFRGAWITVVWNSDWPSKP 234 Query: 74 SVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKI 133 + Q+ +++ + LQ N + QV+P+G A++ S I PWS MTG Sbjct: 235 GL-------SVEQQKTELLEIIKQLQSFNFNALILQVRPEGDAVYASPIEPWSAWMTGTQ 287 Query: 134 GENPG--YDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQ 191 G+ P YDPL+F ++E HKR ++VHAWFNPYR TK G+ + + Sbjct: 288 GKAPEPIYDPLEFAIEECHKRNIEVHAWFNPYRAKTTTKSGSNVSP---------HIAIT 338 Query: 192 HRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTES-PGSRLNDN 250 + + + G++ +DPG VQD +++ +V++RY VDG+ DDYFY G D Sbjct: 339 NPEVVYRWGNQLWMDPGAKIVQDRAYNVIIDVLTRYDVDGIHLDDYFYPYPISGQSFPDE 398 Query: 251 ETYRKY--GGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLG 308 +TY Y G S DWRR N Q++ ++S IK IK V+FG+SP G++R + Sbjct: 399 KTYSAYKNSGGKLSVEDWRRENVNQMVWRLSEGIKKIKAHVKFGISPFGIYRPGQPAGIV 458 Query: 309 SDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTR 368 G Y YAD+++W+++G +DY+APQ+YW ++ Y+ L KWW + V + Sbjct: 459 ----GLDPYSVLYADSKKWLQEGWIDYLAPQLYWRTDQTQQSYETLLKWWTE-VNTKQRH 513 Query: 369 LYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPE--ISGTILFREDYLNKPQT 426 +Y G ++ E++KQ+ ++ + E G I F L + + Sbjct: 514 IYAGNNLGQLDGKV---------WKNSEIEKQIVISRNLAENFSLGNIFFSMKSLAENR- 563 Query: 427 QQAVSYLQSRWG 438 Q + + Sbjct: 564 QGIGDQFKQVYY 575 >UniRef50_C1A9I5 Putative uncharacterized protein n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1A9I5_GEMAT Length = 534 Score = 317 bits (812), Expect = 5e-85, Method: Composition-based stats. Identities = 123/399 (30%), Positives = 185/399 (46%), Gaps = 38/399 (9%) Query: 47 ATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTV 106 A + RG+W+A+V+ +DWP +++ + QQ ++ LD L +N V Sbjct: 38 AEPPPVLREFRGVWVASVANIDWPSKRTLSTAE-------QQAELLALLDRAAELKLNAV 90 Query: 107 FFQVKPDGTALWPSKILPWSDLMTGKIG--ENPGYDPLQFMLDEAHKRGMKVHAWFNPYR 164 FQV+P AL+ S I PWS+ +TG G P +DPL F++ EAH RGM++HAWFNPYR Sbjct: 91 IFQVRPAADALYESSIEPWSEYLTGAQGRRPEPFWDPLAFVIREAHARGMELHAWFNPYR 150 Query: 165 VSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVV 224 + + + + ++ +DPG P V+ +V +VV Sbjct: 151 ARHTDARSPLAR---------SHIARTNPALVKPYAGYLWMDPGEPAVRARTLRVVLDVV 201 Query: 225 SRYPVDGVQFDDYFYTESPGSR------LNDNETYRKYG--GAFASKADWRRNNTQQLIA 276 RY +DGV DDYFY R D ++ +Y G +++DWRR+N +L+ Sbjct: 202 KRYDIDGVHIDDYFYPYPENDRRGRAIAFPDTRSWTRYQKSGGKLTRSDWRRDNVNKLVE 261 Query: 277 KVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYI 336 ++ I KP V FG+SP G+WR RG AY++ YAD R+W+ +G LDY Sbjct: 262 ELYDGIHKTKPWVRFGISPFGIWRPGFP----EQIRGLDAYEKLYADARKWLHEGWLDYF 317 Query: 337 APQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPE 396 PQ+YWP ++ Y VL WWA L+ G + G V E Sbjct: 318 TPQLYWPTTKREQAYPVLLDWWATE-NKRARHLWPGNFTSRAGGRGSGA------FSVAE 370 Query: 397 LKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQS 435 L +Q+ + SG + F L++ Sbjct: 371 LMEQIRVTRLNAAASGNVHFSMKSFL-INQAGMNDTLRA 408 >UniRef50_A5FAG6 Putative uncharacterized protein n=1 Tax=Flavobacterium johnsoniae UW101 RepID=A5FAG6_FLAJ1 Length = 493 Score = 315 bits (807), Expect = 2e-84, Method: Composition-based stats. Identities = 137/393 (34%), Positives = 204/393 (51%), Gaps = 31/393 (7%) Query: 50 QQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQ 109 + + MR W++TV +DWP + + + MI LD+L+ +NTV FQ Sbjct: 21 ESPKREMRAAWISTVDNIDWPSKPGL-------SDKQMKSEMIAILDNLRSNNLNTVIFQ 73 Query: 110 VKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNT 169 ++P A + S P S +TG G PG+DPLQ M+DEA KRGM VH W NPYRV +T Sbjct: 74 IRPTADAYYKSTKEPASHWITGTQGVAPGFDPLQMMIDEAGKRGMNVHVWLNPYRVQKDT 133 Query: 170 KPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPV 229 + + +Y + + T G +PG E +D+++S+V E+V Y + Sbjct: 134 VKDVLTKT---------HLYFKKPELFLTYGKSRYFNPGYKETRDFVSSVVGEIVRNYDI 184 Query: 230 DGVQFDDYFYTES-PGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPG 288 V DDYFY G D + + K F K DWRR+N +I ++ TI + KP Sbjct: 185 QAVHMDDYFYPYKIAGQEFPDEKAFAKEPRQFKDKDDWRRDNVDLIIKQIRDTIIANKPE 244 Query: 289 VEFGVSPAGVWRNRSHDPLGSDT-RGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRS 347 VEFG+SP GVWRN + D GS+T GA YD+ YA+ +W ++ +DY+ PQ+YW Sbjct: 245 VEFGISPFGVWRNIAKDSDGSNTVAGATNYDDLYANILKWQKENWIDYVTPQLYWHIGFD 304 Query: 348 AARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAV 407 A ++VLAKWWA T +Y+G YK+ + EP+W ++ KQ+++ + Sbjct: 305 RANFEVLAKWWA--AHKYGTNVYVGHGDYKI-SNTAKEPEWR---SPDQIVKQIEMIRKL 358 Query: 408 PEISGTILFRE-------DYLNKPQTQQAVSYL 433 P+I G++ F D L P Q+ Y+ Sbjct: 359 PQIDGSMHFTASTFLKKGDTLRNPLIQKPYKYI 391 >UniRef50_Q7MXU6 YngK protein n=4 Tax=Porphyromonadaceae RepID=Q7MXU6_PORGI Length = 512 Score = 310 bits (794), Expect = 6e-83, Method: Composition-based stats. Identities = 136/442 (30%), Positives = 205/442 (46%), Gaps = 54/442 (12%) Query: 1 MDICSRNKKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIW 60 M CSR K LT + + L C K P + R W Sbjct: 1 MYHCSR-KSLTFFLALLFCVMVLFSCGTKRKLP-----------SQVHADYPKREFRAAW 48 Query: 61 LATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPS 120 + TV + ++ ++ +I +LD L+ G N + FQ++P+ A + S Sbjct: 49 IQTVYQGEYAR----------LSPAEARRLLIGRLDKLKEAGCNAIIFQIRPESDAWYES 98 Query: 121 KILPWSDLMTGKIGE--NPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELN 178 I PWS +TG+ G+ P +DPL FM+ E HKRGM++HAW NPYR S + G Sbjct: 99 AIEPWSRFLTGRQGQAPTPFWDPLAFMVSECHKRGMELHAWINPYRASTSGTAGLA---- 154 Query: 179 STLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYF 238 P Y ++ W T ++ DPG+P+ + +I IV ++ RY +D + DDYF Sbjct: 155 ------PNHPYHRYPQWFVTYNNQLYYDPGVPDCRAYICRIVRDITMRYDIDAIHMDDYF 208 Query: 239 YTES-PGSRLNDNETYRKYGGA--FASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSP 295 Y G+ D++++R+YG F +K DWRR N +L+ ++ TI KP V FG+SP Sbjct: 209 YPYPVAGAAFPDDDSFRRYGQGYTFQTKGDWRRENVNKLVHEIKQTILQSKPWVRFGISP 268 Query: 296 AGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLA 355 G++RN+ P GS+T G YD+ YAD W ++G +DY+ PQIYW AA Y LA Sbjct: 269 FGIYRNKRTSPSGSETAGLQNYDDLYADVLLWQKRGWIDYVIPQIYWEIGHKAADYATLA 328 Query: 356 KWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTIL 415 +WW LY G D +L ++ + +V G L Sbjct: 329 EWWGRNSVGA-AHLYFGQ-------------DVKRTMTARQLASKMRMQRSV--AVGHAL 372 Query: 416 FREDYLNKPQTQQAVSYLQSRW 437 + + T+ L+SR+ Sbjct: 373 WPAGEVV-NNTEGVADSLRSRY 393 >UniRef50_D0GIS1 YngK n=16 Tax=Bacteria RepID=D0GIS1_9FUSO Length = 330 Score = 307 bits (787), Expect = 4e-82, Method: Composition-based stats. Identities = 142/343 (41%), Positives = 192/343 (55%), Gaps = 23/343 (6%) Query: 94 KLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRG 153 L+++++ +N VF Q+KP G A +PSK PWS+ +TG GENPGYDPL+FM++EAHKR Sbjct: 1 MLENVKKWNMNAVFVQIKPVGDAFYPSKYAPWSEYLTGVQGENPGYDPLKFMIEEAHKRN 60 Query: 154 MKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQ 213 ++ HAWFNPYR+++ + N + + +W G + L+PGIPEV Sbjct: 61 IEFHAWFNPYRLTMGGGREKLSRDN---------IGNKRPEWTVMYGGKLYLNPGIPEVN 111 Query: 214 DWITSIVAEVVSRYPVDGVQFDDYFYTES-PGSRLNDNETYRKYGGAFASKADWRRNNTQ 272 D++ + EVV +Y VDGV DDYFY G D++ YRKYGG F++ DWRRNN Sbjct: 112 DYVVDSIVEVVKKYDVDGVHMDDYFYPYKVKGQEYPDSQQYRKYGGKFSNIGDWRRNNIN 171 Query: 273 QLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDP--LGSDTRGAAAYDESYADTRRWVEQ 330 +LI K+ ++IK V FG+SP GVWRN S DP G YD+ YAD W+++ Sbjct: 172 KLIEKLHNSIKKENKNVSFGISPFGVWRNASTDPVRGSQTQAGVQNYDDLYADILYWMDK 231 Query: 331 GLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMI 390 +DY+APQIYW A Y L WW+ T T LYIG A YKV + Sbjct: 232 HWIDYVAPQIYWVRGFKVADYSTLINWWSKYAGKTNTDLYIGHAAYKVND---------- 281 Query: 391 NGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYL 433 EL +Q+ LN PEI G+I F L P + + L Sbjct: 282 WSNPNELVEQVKLNRKYPEIKGSIFFSYKSLV-PNPKNVTNNL 323 >UniRef50_Q110S6 Putative uncharacterized protein n=5 Tax=Bacteria RepID=Q110S6_TRIEI Length = 521 Score = 305 bits (781), Expect = 2e-81, Method: Composition-based stats. Identities = 126/436 (28%), Positives = 203/436 (46%), Gaps = 42/436 (9%) Query: 8 KKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRL 67 K + RR +L + L+L + M + S+ + S + RG+W+A+V+ + Sbjct: 26 KNIHFRRKNLLWSCFLILGLTLT----QMSSYLPTSRAQQPSSFSPREFRGVWVASVANI 81 Query: 68 DWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSD 127 DWP P Q+ +++ L+ +Q L +N + QV+P+G A + S I PWS Sbjct: 82 DWPS-------QPGLPVTQQKTELLNILNRMQELNLNALVLQVRPNGDAFYNSTIEPWSG 134 Query: 128 LMTGKIGENP--GYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQP 185 +TGK G P YDPL+F + E+HKR +++HAWFNPYR ++ G+ Sbjct: 135 WLTGKQGTPPQPYYDPLEFAIAESHKRNIELHAWFNPYRAQLSPNDGSFAS--------- 185 Query: 186 ASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTE-SPG 244 V++ + G LDPG VQD + + +VV RY +D V FDDYFY G Sbjct: 186 NHAAVKYPQYAYRYGKYVWLDPGAKVVQDQTFNTIIDVVRRYDIDAVHFDDYFYPYPQGG 245 Query: 245 SRLNDNETYRKY--GGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNR 302 D +TY Y G S ++WRR N ++ ++ I + KP V+FG+SP G++R Sbjct: 246 QEFPDYQTYNSYKASGGTLSLSNWRRQNVNNMVERLYQGIHAEKPYVKFGISPFGIYRPG 305 Query: 303 SHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVV 362 + + G Y+ YAD + W+ +G +DY+APQ+YW Y VL WW Sbjct: 306 NPPGIV----GLDQYESLYADVKLWLAKGWVDYLAPQLYWRIDPPKQSYPVLLNWWLQQ- 360 Query: 363 KPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPE--ISGTILFREDY 420 P R +Y G ++ V E ++Q+ ++ G I + Sbjct: 361 NPQRRHIYAGNFLSQLQVSG---------WPVSEFERQVAISRQRASQLSLGNIFYSMK- 410 Query: 421 LNKPQTQQAVSYLQSR 436 + + + ++ Sbjct: 411 MFRDNVAGVNNVFKNY 426 >UniRef50_B9XM08 Putative uncharacterized protein n=2 Tax=bacterium Ellin514 RepID=B9XM08_9BACT Length = 523 Score = 305 bits (781), Expect = 2e-81, Method: Composition-based stats. Identities = 125/414 (30%), Positives = 196/414 (47%), Gaps = 40/414 (9%) Query: 35 SMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDK 94 + A S++ R +W+AT+ +DWP + Q+ ++ Sbjct: 42 PLPAAVVYIPSTAQPPASNREFRAMWIATMVNIDWPSKPGL-------PVPQQKAELLAI 94 Query: 95 LDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIG--ENPGYDPLQFMLDEAHKR 152 LD +L +N V FQV+P A++ S I PWS +TG +G P YDPL F ++EAHKR Sbjct: 95 LDCAVKLNLNAVIFQVRPGSDAMYASSIEPWSYYLTGAMGKAPAPFYDPLAFAVEEAHKR 154 Query: 153 GMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEV 212 G+++HA+FNP+R + +K + D +R G+ LDPG E Sbjct: 155 GLELHAYFNPFRAAQPSKKWQFSS---------NHISRTRPDLVRQYGNLLWLDPGEREA 205 Query: 213 QDWITSIVAEVVSRYPVDGVQFDDYFYTESPGS------RLNDNETYRKY--GGAFASKA 264 QD + +V +VV+RY +D V FDDYFY D++T++++ GG S+ Sbjct: 206 QDHVLKVVMDVVNRYDIDAVHFDDYFYPYKQQDARNRDIDFPDSKTWKRFVAGGGKLSRD 265 Query: 265 DWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADT 324 DWRR N + +V +I + KP V+FG+SP G+W+ +G AYD YAD+ Sbjct: 266 DWRRENINSFVHRVHDSIHAAKPWVKFGISPFGIWQPGYPP----QVKGLNAYDSIYADS 321 Query: 325 RRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKI 384 R+W+ G +DY++PQ+YW + VL KWW + ++ GIA KVG K Sbjct: 322 RKWLMNGWVDYLSPQLYWAVESPGQSFPVLLKWWLEQ-NSKNRNVWPGIASEKVGRTWKA 380 Query: 385 EPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQSRWG 438 + E + +G L+ + L + + A + Q + Sbjct: 381 NEIIRQIQIIRE---------QAGDRAGEALYSAEGLVQNRGGLASALAQGVYS 425 >UniRef50_C3J8B5 YngK protein n=2 Tax=Bacteria RepID=C3J8B5_9PORP Length = 535 Score = 305 bits (781), Expect = 2e-81, Method: Composition-based stats. Identities = 120/454 (26%), Positives = 196/454 (43%), Gaps = 50/454 (11%) Query: 1 MDICSRNKKLTIRRPAILVALA--LLLCSCKSTPPESMVTPPAG-------------SKP 45 M SR+ R L LLL + V A Sbjct: 1 MQHHSRSFLFAHRLTQCFSLLIGVLLLSGVSACSSRKKVVSTAPLPAPPTPRIEVEIPVE 60 Query: 46 PATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINT 105 +++ +RG+WL T+ LDWP +V+ + S Q++ + LD L NT Sbjct: 61 KPVIPSNAEAIRGVWLTTIYGLDWPSRRAVSTQDMVS----QRKELCRILDRLAESHFNT 116 Query: 106 VFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRV 165 VFFQV+ G ++PSKI P + TG YDPLQF ++E HKRG+ +HAW + + Sbjct: 117 VFFQVRHRGDVIYPSKIEPRVTVFTGGRNNYLDYDPLQFAIEECHKRGLSIHAWIVTFPL 176 Query: 166 SVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVS 225 + ++ + SV+ +HRDW T + + L+PG PE + +ITS+V E+V Sbjct: 177 GNTSHVQSLGD---------NSVWKKHRDWCFTLHNDWYLNPGHPEARSYITSVVREMVE 227 Query: 226 RYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSI 285 RY +DGV FD Y + + D Y +YG S +WR +N + +VS + S+ Sbjct: 228 RYDLDGVHFDYVRYPDKMREKE-DQNLYMRYGKG-RSLGEWRTSNISAFLKEVSTEVCSV 285 Query: 286 KPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFS 345 KP + +P G R P G A + + D W +G +D+I P +Y+ + Sbjct: 286 KPHMLVSAAPLGKLRVLPSMP----NVGWTARESVFQDPAAWYREGSVDFIVPMMYYRDN 341 Query: 346 RSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLND 405 L W A + + G+A Y+ + SK +++ Q++ ++ Sbjct: 342 ---LFEPFLVDWKAQI---PGLPIVPGLAPYRTEDESKW--------TARDIENQMNASE 387 Query: 406 AVPEISGTILFREDYLNKPQTQQAVSYLQSRWGS 439 + ++G +RE + P + + S Sbjct: 388 RM-GMAGICFYRELNIR-PNRNGVDRVITRHFIS 419 >UniRef50_D1N426 Putative uncharacterized protein n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N426_9BACT Length = 450 Score = 305 bits (780), Expect = 3e-81, Method: Composition-based stats. Identities = 148/401 (36%), Positives = 208/401 (51%), Gaps = 35/401 (8%) Query: 51 QSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQV 110 Q ++ MRG+W+ATV +D+ + A ++ I +++LQR N +FFQV Sbjct: 58 QRAREMRGVWVATVENIDFGRHT---------DAAGFKRDFIAVVNNLQRAKFNAIFFQV 108 Query: 111 KPDGTALWPSKILPWSDLMTGKIGEN-PGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNT 169 +P A +PSK PWS MTGK G+ P +DPL FM+ EAHKRG++ HAW NPYRV+ Sbjct: 109 RPMCDAFYPSKHNPWSRWMTGKEGQAIPNFDPLAFMVAEAHKRGLEFHAWLNPYRVNAGA 168 Query: 170 KPGTIRELNSTLSQQPASVYVQHRDWIRTS-----GDRFVLDPGIPEVQDWITSIVAEVV 224 + G L + ++ S ++ + S L+PG P V I +AE++ Sbjct: 169 QVGKTAYLKTLDNK---SFAKRNPGLVLESKLASGRYSLFLNPGEPRVVRHIADTIAEIL 225 Query: 225 SRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKS 284 YPVD + FDDYFY S + D+ ++++ S +WRR N + I V T+ + Sbjct: 226 ENYPVDAIHFDDYFYLYSDIGTI-DSASFQRNNPGRLSLEEWRRGNVDKAIYTVKKTVDA 284 Query: 285 IKP----GVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQI 340 V FGVSP G+W N+ +P GS T G +Y YADTR WV +G +DYI PQ+ Sbjct: 285 YNRRSGRKVAFGVSPFGIWANKKSNPNGSLTGGKQSYYAQYADTRGWVRKGWVDYIIPQL 344 Query: 341 YWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQ 400 YWPFS A Y LA WW+D VK TR RL+IG Y+VG +P EL Q Sbjct: 345 YWPFSHEVAAYAALADWWSDAVKGTRVRLFIGQGLYRVGAERIWQPR--------ELVDQ 396 Query: 401 LDLNDAVPEISGTILFREDYLNKPQTQQ----AVSYLQSRW 437 + N + + GT++F + P Q LQ W Sbjct: 397 MRYNQMLFNVDGTVIFSYRNVFMPGNGQMKEAVSRILQGCW 437 >UniRef50_B0NT08 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=B0NT08_BACSE Length = 486 Score = 302 bits (774), Expect = 1e-80, Method: Composition-based stats. Identities = 119/433 (27%), Positives = 198/433 (45%), Gaps = 42/433 (9%) Query: 10 LTIRRPAILVALALLLCSCKSTPPE--SMVTPPAGSKPPATTQQSSQPMRGIWLATVSRL 67 + + ++ L +L+ +C + G +++ + +RG+W+ATV L Sbjct: 1 MKHLKYIYIIILLVLVAACGKDDEGILDDGSHSQGEGQTSSSVLPGKELRGVWIATVWGL 60 Query: 68 DWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSD 127 DWP A VQ++ D LD L +N VFFQ++ A + S+ PWS Sbjct: 61 DWPMEK--------YDADVQKKLYTDYLDLLVGYNMNAVFFQIRGMADAFYESEYEPWSK 112 Query: 128 LMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPAS 187 +TG G P YD L F+++EAHKRG++ HAW NPYR++ + P Sbjct: 113 YITGSAGVRPDYDVLGFLVEEAHKRGIQFHAWLNPYRIATRANKN---------AAFPKL 163 Query: 188 VYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSR- 246 + ++ V +P +PEVQ+ I +IV E++++Y VDG+ DDYFY S Sbjct: 164 DAKIPMELVKDYEKIRVYNPALPEVQERIVNIVKEIITKYDVDGIHMDDYFYPSLEASET 223 Query: 247 LNDNETYRKYGGA-FASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHD 305 +ND ++KYG F + D+RRNN ++ + TI +P V F +SPA Sbjct: 224 MNDGAEFQKYGKDKFKNVEDFRRNNVNTVVRNIQKTIIETRPEVIFSISPAADMER---- 279 Query: 306 PLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPT 365 Y+ +AD W ++G +D + PQ+Y+ A +++ W+ Sbjct: 280 ----------NYNTLFADVNTWAKEGWVDVVIPQLYFATGNDATSFNLRLDLWSQYT--Y 327 Query: 366 RTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQ 425 L IG YK G+ +L KQ +L A P++ G++L+ L + + Sbjct: 328 ENHLLIGYGIYKFGDSQYGSK----FQSSDDLMKQFELASAKPKVKGSVLYSAKNLVENK 383 Query: 426 TQQAVSYLQSRWG 438 +++ +G Sbjct: 384 V-GIADAVKAIYG 395 >UniRef50_B7AM83 Putative uncharacterized protein n=1 Tax=Bacteroides eggerthii DSM 20697 RepID=B7AM83_9BACE Length = 489 Score = 302 bits (773), Expect = 2e-80, Method: Composition-based stats. Identities = 128/434 (29%), Positives = 189/434 (43%), Gaps = 42/434 (9%) Query: 10 LTIRRPAILVALALLLCSCKSTPP----ESMVTPPAGSKPPATTQQSSQPMRGIWLATVS 65 + + LVAL SC TPP P + +RG W+ TV Sbjct: 1 MKYLKYITLVALLAFAVSCSKDDDGENMPPEPTPPVEKPEPQAVTLPQKELRGAWITTVW 60 Query: 66 RLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPW 125 +DWP A QQ+ D LD L +N VFFQ++ A + S+ W Sbjct: 61 GIDWPM--------EDYNAATQQKKYTDYLDLLVANNMNAVFFQIRGMADAFYESQYESW 112 Query: 126 SDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQP 185 S +TG G+NPGYD L F+++EAHKRG++ HAW NPYR+S S Sbjct: 113 SKNITGTAGKNPGYDVLGFLVEEAHKRGLQFHAWMNPYRISTRASKN---------SSFA 163 Query: 186 ASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTE-SPG 244 W + + +P +PEVQ I IV E++++Y VDG+ DDYFY G Sbjct: 164 ELDTKIPVAWTKDYNKIRIYNPAMPEVQTRIMDIVKEIITKYDVDGIHMDDYFYPSLEEG 223 Query: 245 SRLNDNETYRKYGGA-FASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRS 303 +NDN Y KYG F S ++RRNN +I + I KPGV F VSPA N Sbjct: 224 ESMNDNAEYEKYGKDKFKSIEEFRRNNVDVVIQNIQKVIIDTKPGVIFSVSPAANIDN-- 281 Query: 304 HDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVK 363 Y + +AD R+W+++G +D I PQ+Y+ ++ W V Sbjct: 282 ------------NYSKLFADVRKWLKEGWVDVIIPQLYFATGTGKNSFNQFLDQWMQYVN 329 Query: 364 PTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNK 423 +T IG YK G + +LK Q + +++G++L+ + Sbjct: 330 --QTHCLIGYGIYKFGSTDPDYGN--AFHSSADLKSQFEYASKKSKVNGSVLYSIKDMVA 385 Query: 424 PQTQQAVSYLQSRW 437 + + ++ + Sbjct: 386 NKV-GIGTAIKEIY 398 >UniRef50_C3R8E6 S-layer protein n=24 Tax=Bacteroides RepID=C3R8E6_9BACE Length = 559 Score = 302 bits (773), Expect = 2e-80, Method: Composition-based stats. Identities = 113/390 (28%), Positives = 182/390 (46%), Gaps = 34/390 (8%) Query: 50 QQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQ 109 Q +R WL T+ +DWP ++N S R QQ+ + D LD L+ NTV Q Sbjct: 20 SQPKYEIRATWLTTLGGMDWPRNKAIN----ASGIRRQQKELCDILDRLKAANFNTVLLQ 75 Query: 110 VKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNT 169 + G ++PS I +++ +TG G NPGYDPL F + E HKRGM++HAW Sbjct: 76 TRLRGDMIYPSAIETFAESLTGSTGGNPGYDPLAFAIGECHKRGMELHAWIVTIPAGNTR 135 Query: 170 KPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPV 229 + +SV ++R + + LDPG P +++++ IV E+ SRY + Sbjct: 136 QVQLQGR---------SSVVRKNRTICKLYKGNWYLDPGNPGTKEYLSCIVKEITSRYDI 186 Query: 230 DGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGV 289 DG+ FD Y E D +TYRKYG WRR+N ++ ++ IK+IKP V Sbjct: 187 DGIHFDYIRYPEQ-ADNFPDKDTYRKYGKG-KELKQWRRDNITDIVHRLYTDIKTIKPWV 244 Query: 290 EFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAA 349 + SP G +R+ + P +RG AY Y D ++W+++G+ D + P +Y+ + Sbjct: 245 KVSSSPIGKYRDTNRYP----SRGWNAYHVVYQDAQKWLKEGIHDALFPMMYF---QGNN 297 Query: 350 RYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPE 409 Y W + + G+ Y + + N + E+ +QL + Sbjct: 298 FYPFALDWKENC---GNRWIIPGLGIYFLSPNEQ-------NWPLDEIVRQLYFTRQIK- 346 Query: 410 ISGTILFREDYLNKPQTQQAVSYLQSRWGS 439 ++G FR +L T+ LQ + + Sbjct: 347 LNGQAYFRNRFLL-NNTKGIWDELQENFYT 375 >UniRef50_C6XWM5 Putative uncharacterized protein n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6XWM5_PEDHD Length = 481 Score = 302 bits (772), Expect = 2e-80, Method: Composition-based stats. Identities = 122/402 (30%), Positives = 184/402 (45%), Gaps = 42/402 (10%) Query: 38 TPPAGSKPPATTQ--QSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKL 95 P PP T + MR +W+A+V LDWP Q+Q ID L Sbjct: 28 AKPDPVDPPVETSLLFPKKEMRAVWIASVYGLDWPQSV--------YTMAGQKQQYIDYL 79 Query: 96 DHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMK 155 + + L IN ++FQVK G A + S PWS +TG G +PGYD L+FM+DEAH R ++ Sbjct: 80 EKFKSLNINAIYFQVKGMGDAFYNSSYEPWSASITGTRGVDPGYDVLKFMIDEAHARDIE 139 Query: 156 VHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDW 215 HAW NPYR++ S+ S PA +W+ + +P +PEV+ Sbjct: 140 FHAWMNPYRIATRA---------SSASSFPALHSSVKPEWVLDFPTIRIYNPALPEVRQR 190 Query: 216 ITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLI 275 + IV E +++Y VDG+ FDDYFY E G D + KYG A+ D+RR+N + I Sbjct: 191 LVDIVKETITKYDVDGIHFDDYFYPE--GETFTDQADFTKYGAGMANIQDFRRDNVNKAI 248 Query: 276 AKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDY 335 V I + KPGV F VSPA ++ YAD ++W ++G +D Sbjct: 249 KGVYDIIVATKPGVVFSVSPAPEITK--------------NFNTLYADVKKWNQEGWVDV 294 Query: 336 IAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVP 395 + PQ+Y + + W+ + L +G +Y+ G+ + Sbjct: 295 VIPQLYQEIGNQYNDFQLRLSEWSQ--NSFKAALMVGHGYYRFGDATAPAA----FQSSS 348 Query: 396 ELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQSRW 437 EL++Q DL ++ G ++ YLN L + + Sbjct: 349 ELQRQFDLTRLNKKVVGNAMYSAKYLN-LNKVGITDKLAAIY 389 >UniRef50_B2ULM6 Putative uncharacterized protein n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2ULM6_AKKM8 Length = 486 Score = 300 bits (768), Expect = 7e-80, Method: Composition-based stats. Identities = 115/417 (27%), Positives = 189/417 (45%), Gaps = 40/417 (9%) Query: 24 LLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSR 83 CS + +++ +G PA Q R W++TV +DWP S + Sbjct: 10 FSCSLLALASQALGWQTSGESVPAVP----QEFRAAWISTVHNIDWPSRSGL-------S 58 Query: 84 ARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQ 143 Q+ +++ L+ +L +N VF QV+P+ AL+ S + PWS ++G G NPGYDPL Sbjct: 59 GAAQRAELLNILNTCAQLKLNAVFLQVRPNADALYRSSLEPWSQWLSG-PGVNPGYDPLA 117 Query: 144 FMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRF 203 F + EAH+RG+++HAWFNP+R N K R + + D ++ +G Sbjct: 118 FAIQEAHRRGIELHAWFNPFRAKANVKHAVGR----------NHISLTRPDLMKRNGSVL 167 Query: 204 VLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASK 263 +++P +D ++ +VV RY +DGV DDYFY R ++ Sbjct: 168 LINPSASASRDHALKVIMDVVRRYDIDGVHLDDYFYPYPTPGRAWSPASF-----GDGKS 222 Query: 264 ADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYAD 323 RR + + ++KS KP V GVSP G+WR G G AY+ D Sbjct: 223 PSQRRGYIDGFVQDMYKSVKSSKPWVRVGVSPFGIWRPGVP---GGIEAGVDAYEHLACD 279 Query: 324 TRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSK 383 R+W+ +G +DY+APQ+YW S + + L +WWA + +R ++ GIA ++ Sbjct: 280 ARKWLSRGWVDYLAPQLYWRCSPAKQSFPALMQWWA--AQNSRRPVWPGIATARIMSSED 337 Query: 384 IEPDWMINGGVPELKKQLDLNDAV-PEISGTILFREDYLNKPQTQQAVSYLQSRWGS 439 E+ Q++ + ++ G + + + YL + S Sbjct: 338 PGR------PASEIAAQVNYSRSLARTAPGQCFWSIKSIMR-NAGGIQKYLNRLYPS 387 >UniRef50_C9LEC6 YngK protein n=1 Tax=Prevotella tannerae ATCC 51259 RepID=C9LEC6_9BACT Length = 537 Score = 300 bits (767), Expect = 7e-80, Method: Composition-based stats. Identities = 118/387 (30%), Positives = 182/387 (47%), Gaps = 39/387 (10%) Query: 53 SQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKP 112 + R +WL T LDWP QQ+ + LD LQ +NTV FQV+ Sbjct: 25 RREYRAVWLTTFLGLDWPK---------GHDPLTQQKQLCRILDQLQAAKVNTVLFQVRL 75 Query: 113 DGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPG 172 GT + S I PW + TG G P YDPL F ++E H+RGM++HAW + V Sbjct: 76 RGTTAYDSDIEPWDGIFTGTPGRRPTYDPLAFAIEECHRRGMELHAWMVAFPVC------ 129 Query: 173 TIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGV 232 +LN + SV +H + R S D++++DPG+P D++ ++ E+VS+Y VDG+ Sbjct: 130 ---KLNVLKALGTKSVVRKHPELCRRSDDQYIMDPGMPGTADYLANLCRELVSQYDVDGI 186 Query: 233 QFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFG 292 D Y E+ G +D TYRKYG KA WRR+N +++ K+ +KS KP V Sbjct: 187 HLDYIRYPEA-GLHFDDAATYRKYGKGRELKA-WRRDNVTRVVEKIHEAVKSQKPWVRLS 244 Query: 293 VSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYD 352 +P G + + +RG A D D W+ +G +D + P +Y+ Y Sbjct: 245 CAPVGKYADLPR----QSSRGWNARDAVGQDAVMWLNKGWMDVLFPMMYFDGDN---YYP 297 Query: 353 VLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISG 412 + W + T + G+ Y + S E +W + L++Q++ E G Sbjct: 298 FVLDWLERAERGT---VAPGLGVYCL---SAGEKNWPLLT----LQRQMNFLRT-AEAGG 346 Query: 413 TILFREDYLNKPQTQQAVSYLQSRWGS 439 LFR D+L T+ +L + + Sbjct: 347 FALFRSDFLT-NNTKGVYDWLAGEYTT 372 >UniRef50_A6G0M0 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G0M0_9DELT Length = 540 Score = 297 bits (761), Expect = 4e-79, Method: Composition-based stats. Identities = 114/389 (29%), Positives = 190/389 (48%), Gaps = 37/389 (9%) Query: 54 QPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPD 113 + RG+W+ TV ++WP ++ + + +D + + +N + FQV+P+ Sbjct: 87 REFRGVWVTTVYNINWPSSQGLSAAAAQAELAS-------IVDTAEAVNLNAIVFQVRPE 139 Query: 114 GTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGT 173 A++ S + PWS ++G G +PG+DPL F+++EAH RG++VHAWFNPYR Sbjct: 140 SDAVYESSLEPWSRYLSGSQGGDPGFDPLAFLIEEAHARGIEVHAWFNPYR--------- 190 Query: 174 IRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQ 233 ++ ++ + +Q + T G +DPG +V++ +V +VV RY VDGV Sbjct: 191 -GAASAGITLAEPHIALQLPEHAHTYGSSLWMDPGALDVREHTVDVVLDVVERYAVDGVH 249 Query: 234 FDDYFYTESPGSRLNDNETYRKY--GGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEF 291 DDYFY G D T+ Y G S+ DWRR+N L+ ++ TI + P F Sbjct: 250 LDDYFYPYPNGDDFPDALTWNAYLADGGALSQGDWRRDNVNALVEELHDTIAAADPDARF 309 Query: 292 GVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARY 351 G++P G++R + G Y E YAD W+E+G +DY+APQ+YWP + Y Sbjct: 310 GIAPFGIYR----PGIPEGIVGLDQYAELYADPVLWMEEGWVDYLAPQLYWPTYSAQQTY 365 Query: 352 DVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVP--E 409 +VL WW+ + ++ G K+G+ + + E+ Q++L+ Sbjct: 366 EVLLDWWSSI--DPERYVFTGNYLSKLGD----------DWTLDEMLYQVELSRLYSDQN 413 Query: 410 ISGTILFREDYLNKPQTQQAVSYLQSRWG 438 G + F + L + L +G Sbjct: 414 SMGNVYFHVEPLQSDTLGINAALLDEFYG 442 >UniRef50_C9PUA7 FenI protein n=2 Tax=Prevotella RepID=C9PUA7_9BACT Length = 493 Score = 297 bits (760), Expect = 6e-79, Method: Composition-based stats. Identities = 121/432 (28%), Positives = 195/432 (45%), Gaps = 45/432 (10%) Query: 13 RRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPV 72 + L A+ ++ C+ + P PP + + +RG+W+ATV LDWP Sbjct: 7 KIALALSAVLVVACNHDDNILPNKPKKPDTPNPPTQSILPKKELRGVWMATVWGLDWPR- 65 Query: 73 SSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGK 132 A Q+ + I + L++ IN VF QV+ A + S PW +TG+ Sbjct: 66 -------GEYNAESQKASYIAYMKALEKNNINAVFVQVRGRADAFYKSDYEPWCQYLTGE 118 Query: 133 IGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQH 192 + ++PGYD L+FM+DEAHKRG+ HAWFNPYRV+ T + PA Sbjct: 119 VDKDPGYDVLRFMIDEAHKRGIAFHAWFNPYRVATKKA---------TDAAFPALDSRIP 169 Query: 193 RDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTES-PGSRLNDNE 251 + + + +P +PEV+ I I+ +++++Y VDGV DDYFY G + D E Sbjct: 170 QAMMVDYKTIRMYNPALPEVRQRIFDIIKDLITKYDVDGVHIDDYFYPSLTSGETIKDEE 229 Query: 252 TYRKY------GGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHD 305 Y+KY G + ++RRNN + + +++ +P V F VSPAG Sbjct: 230 EYKKYAPKDNNGKPTITIEEFRRNNVDLAVKGIHDVVQATRPEVVFTVSPAGNPDY---- 285 Query: 306 PLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPT 365 Y+ YAD +W +G + I PQ+Y+P +A ++ WW+ Sbjct: 286 ----------NYNTMYADVVKWSREGWTEAIIPQLYFPMGNAATNFNQRLIWWSQYTFKN 335 Query: 366 RTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQ 425 L+IG Y+ G+P EL KQ +++G++L+ L Sbjct: 336 --ALFIGYGTYRFGDPKSPAA----YQNASELAKQFAFASKYNKVTGSVLYSAKDLL-NN 388 Query: 426 TQQAVSYLQSRW 437 +S ++ + Sbjct: 389 PVDILSVIKDVY 400 >UniRef50_Q7MWV9 YngK protein n=2 Tax=Porphyromonas gingivalis RepID=Q7MWV9_PORGI Length = 515 Score = 297 bits (759), Expect = 7e-79, Method: Composition-based stats. Identities = 124/427 (29%), Positives = 189/427 (44%), Gaps = 45/427 (10%) Query: 20 ALALLLCSCKS---TPPESMVTPPAGSKPPATTQQS-----SQPMRGIWLATVSRLDWPP 71 ALL C + P V P + A + + MRG+WL T+ LDWP Sbjct: 13 IAALLFAGCGTKKVAPSPPTVKPLPDTVIAAPVIEPWVSPVREEMRGVWLTTIYGLDWPQ 72 Query: 72 VSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTG 131 S+ R Q++ + LD L+R NTVFFQV+ G ++PS+I P S + TG Sbjct: 73 RSAPTAEGL----RKQREELCRILDRLKREKFNTVFFQVRHRGDVIYPSEIEPQSTIFTG 128 Query: 132 KIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQ 191 P YDPL+F L E HKRG+ HAW + + ++ + SV + Sbjct: 129 TGK--PDYDPLEFALKECHKRGLTFHAWLIVTPLGPDKHIRSL---------KGESVKSR 177 Query: 192 HRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNE 251 H +W + + L+PG+PE + + S+V E+V +YPVDG+ D Y E +D Sbjct: 178 HPEWCVRHNNLWYLNPGVPEARAYFASLVREIVEKYPVDGIHLDYMRYPEK-AKIFDDAA 236 Query: 252 TYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDT 311 TY++YGG A WRR N L+A V P V+ V+ G R + G T Sbjct: 237 TYKQYGGNM-DPAAWRRRNLSDLMADVHRAATEKTPWVQVSVATIGRLRKLAGKRGGDWT 295 Query: 312 RGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYI 371 AY+ + D W ++G +D++ P +Y+ Y L W A + + Sbjct: 296 ----AYEGVHQDPVVWAQEGSVDFLVPMLYYRDD---LFYPFLEDWKAQL---PDLPIIP 345 Query: 372 GIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVS 431 G+A Y+V + S+ + +Q+D + +G LFRED L Sbjct: 346 GLATYRVVDNSQW--------PAQVIGEQIDSARHI-GFAGVCLFREDQLRHESN-GIPQ 395 Query: 432 YLQSRWG 438 ++ R+ Sbjct: 396 IIRERFA 402 >UniRef50_C3QJ47 S-layer protein n=5 Tax=Bacteroides RepID=C3QJ47_9BACE Length = 488 Score = 297 bits (759), Expect = 8e-79, Method: Composition-based stats. Identities = 107/389 (27%), Positives = 176/389 (45%), Gaps = 34/389 (8%) Query: 50 QQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQ 109 Q +R W+ V LDWP + R Q++ +ID LD L+ NT+ FQ Sbjct: 19 AQPKHEVRAAWVTAVYGLDWPRTRATT----PQTIRKQKEELIDILDKLKAANFNTILFQ 74 Query: 110 VKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNT 169 + G L+PS I P++ ++TGK G NPGYDPL F ++E HKRGM+ HAW + Sbjct: 75 TRTRGDVLYPSAIEPFNSILTGKTGGNPGYDPLAFAVEECHKRGMECHAWMVTIPLGNKK 134 Query: 170 KPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPV 229 ++ + V + ++ + L+PG P ++++ +V EVVS Y V Sbjct: 135 HVASLGSQS---------VTKRMKEICVPYKREYFLNPGHPATKEYLMKLVREVVSGYDV 185 Query: 230 DGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGV 289 DGV FD Y E+ D +R+Y + WRR+N +++ + +K++KP V Sbjct: 186 DGVHFDYLRYPEN-APLFPDKYDFRRYNKG-RTLDQWRRDNISEIVRYIYKGVKAMKPWV 243 Query: 290 EFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAA 349 + P G +R+ S P +RG A+ Y D + W+ +G++D I P +Y+ + Sbjct: 244 KVSTCPVGKYRDTSRYP----SRGWNAFFTVYQDPQGWMGEGIMDQIYPMMYF---QGNN 296 Query: 350 RYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPE 409 Y W ++ G+ Y + E W E+ +Q++ Sbjct: 297 FYPFALDWQEQ---SNGRQVIPGLGIYFL---HPDEGKWTR----DEIDRQMNFIRKQK- 345 Query: 410 ISGTILFREDYLNKPQTQQAVSYLQSRWG 438 ++G +R YL + TQ L + Sbjct: 346 MAGEGHYRVKYLME-NTQGIYDELSENFY 373 >UniRef50_C1A7Q3 Putative uncharacterized protein n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1A7Q3_GEMAT Length = 501 Score = 295 bits (756), Expect = 2e-78, Method: Composition-based stats. Identities = 127/430 (29%), Positives = 202/430 (46%), Gaps = 45/430 (10%) Query: 23 LLLCSCKSTPPESMVTPPAGSKPPA--------TTQQSSQPMRGIWLATVSRLDWPPVSS 74 +L +C +P + P T ++ RG+W+ATV+ +DWP + Sbjct: 1 MLFAACGGSPADPTGPVVPPPVVPPPEPPVTPFTVPTITREFRGMWIATVANIDWPSRTG 60 Query: 75 VNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIG 134 ++I QQ + LD Q+ G+N V V+ G AL+PS + PW +G G Sbjct: 61 LSIP-------QQQAEFVALLDVAQQAGLNAVILHVRAAGDALYPSTLEPWMRSFSGTQG 113 Query: 135 ENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRD 194 +PG+DPLQ+ ++++H RG+++HAWFNP+R + + E A + D Sbjct: 114 VDPGWDPLQYAIEQSHARGIELHAWFNPFRAGNASDTARLAE---------AHFGRKRPD 164 Query: 195 WIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESP---GSRLNDNE 251 +R + DPG D ++V++VV RY VDGV DDYFY + DN Sbjct: 165 ILRRYCSQLWFDPGEAATHDQAIAVVSDVVRRYAVDGVHIDDYFYPYPETGCTTDFPDNT 224 Query: 252 TYRKY--GGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGS 309 + Y G ++ADWRR+N + + ++ T++ + G+SP G+WR +P G Sbjct: 225 AFAAYQRQGGTMARADWRRDNVNRFVERLYATVRGLSRTARVGISPFGIWR--PGNPAG- 281 Query: 310 DTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRL 369 G +Y YAD+R W+++G DY APQ+YW + Y+ L WW R L Sbjct: 282 -ITGLDSYASIYADSRLWLQRGWADYFAPQLYWSSTSVGQNYNALLTWWTQQ-NTMRRHL 339 Query: 370 YIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEI----SGTILFREDYLNKPQ 425 + G+A Y++ + S V E+ Q+ + A SGTI + + K Sbjct: 340 WPGLASYRIADGS------SAPFAVTEISTQIGITRAQSSASGGPSGTIFYNASSV-KND 392 Query: 426 TQQAVSYLQS 435 V+ L+S Sbjct: 393 RGGFVTALKS 402 >UniRef50_A7VTI3 Putative uncharacterized protein n=1 Tax=Clostridium leptum DSM 753 RepID=A7VTI3_9CLOT Length = 434 Score = 295 bits (754), Expect = 3e-78, Method: Composition-based stats. Identities = 123/424 (29%), Positives = 197/424 (46%), Gaps = 41/424 (9%) Query: 25 LCSCKSTPPESMVTPPAGSKPPATTQQSSQ-----PMRGIWLATVSRLDWPPVSSVNISN 79 T + +V PAG + A Q MR +W+ + S++ S Sbjct: 35 ASGASQTASDQLVNTPAGEESQAEQPPGGQIVLEGEMRAVWVPYL---------SLDQSK 85 Query: 80 PTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGY 139 Q+A + + + G+NT+ V+P G A++PS+I PWS L+TG G +PG+ Sbjct: 86 IGQGQEAFQKAFDEIVSQAKEYGLNTLIVHVRPFGDAMYPSEIYPWSHLLTGTQGGDPGF 145 Query: 140 DPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTS 199 DPL++M+ + H+ GM+ HAW NP R+ P + + +Q + DW+ Sbjct: 146 DPLEYMVRKTHEAGMQFHAWLNPLRIQSKGTPSILAP-DHLYTQWREDSDPNNDDWVVDW 204 Query: 200 GDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYG-- 257 + +P PEV++ I + E+V YPVD + FDDYFY S G+ D + Y+ Y Sbjct: 205 EEGKYFNPAYPEVREKIIEGIREIVENYPVDAIHFDDYFYPTSDGAF--DEKAYQAYTES 262 Query: 258 ---GAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGA 314 G + WR N L++ V IKSI P V+FG+SP G N + Sbjct: 263 VGEGVPLTLPQWRIANINTLVSGVYSAIKSINPQVQFGISPQGNITNDLN---------- 312 Query: 315 AAYDESYADTRRWV-EQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGI 373 AD W ++G +DY+ PQIY F ++ A W + +LYIG+ Sbjct: 313 -----MGADVETWASQKGYVDYLCPQIYVNFDHPLLPFNQTADQWRQMTTAEGVKLYIGL 367 Query: 374 AFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYL 433 A YK G W NG L+++++ + ++ G +L+ DY++ QTQ+ V+ + Sbjct: 368 AVYKAGSEDADSGTW--NGKTDILQREIEYSRSL-GCDGIMLYSWDYMDTSQTQEEVANV 424 Query: 434 QSRW 437 + Sbjct: 425 IKIF 428 >UniRef50_A6L917 Putative uncharacterized protein n=5 Tax=Bacteroidales RepID=A6L917_PARD8 Length = 495 Score = 293 bits (750), Expect = 9e-78, Method: Composition-based stats. Identities = 113/389 (29%), Positives = 176/389 (45%), Gaps = 34/389 (8%) Query: 50 QQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQ 109 + + +R +WL TV LDWP + + + QQQA++D LD LQ N VF Q Sbjct: 24 EPPKKEIRAVWLTTVYGLDWPHKPATT----EAGRKAQQQALLDILDRLQEANFNMVFIQ 79 Query: 110 VKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNT 169 + G ++ S I P S +GK GE PGYDPL F++DE HKRGM+ HAWF + + Sbjct: 80 ARLRGDVMYRSAIEPVSKTFSGKYGELPGYDPLAFVVDECHKRGMECHAWFVTFPLGTEK 139 Query: 170 KPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPV 229 +L+ V + + + LDPG+PE D+I S+V E+V+ Y + Sbjct: 140 SVKEQGKLS---------VVKKKPKLCKRHNGEWYLDPGVPETADYILSLVKEIVNGYDI 190 Query: 230 DGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGV 289 DG+ FD Y E + D Y K G S ADWRR N +++ ++ +K KP V Sbjct: 191 DGIHFDYIRYPE-EAKKFPDKALYNKSGKK-KSLADWRRENINRMVYRIYDWVKQTKPWV 248 Query: 290 EFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAA 349 + SP G + P G AY+ + D + W++QG D I P +Y+ Sbjct: 249 QVSSSPLGKYNRIERVP----NAGWTAYESVFQDPKMWMQQGKQDMIVPMMYYLHKN--- 301 Query: 350 RYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPE 409 + + W + + G+ Y++ + E DW +N ++ Q+D + Sbjct: 302 FFPFVDNWVDNC---NGRLVVPGLGAYRMDKS---EADWAVN----DITDQIDYSRYYGG 351 Query: 410 ISGTILFREDYLNKPQTQQAVSYLQSRWG 438 G FR + + L+ + Sbjct: 352 A-GCAFFRCGNVL-YNDKGLYKELRDNYY 378 >UniRef50_A9NEW0 Putative uncharacterized protein n=1 Tax=Acholeplasma laidlawii PG-8A RepID=A9NEW0_ACHLI Length = 404 Score = 293 bits (749), Expect = 1e-77, Method: Composition-based stats. Identities = 120/379 (31%), Positives = 189/379 (49%), Gaps = 24/379 (6%) Query: 54 QPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPD 113 +P R W++ V +D P + + + +I+ LD + + +FFQV+ Sbjct: 30 KPFRAFWISNVLNIDLPNMKDPS----------YKDKVIEMLDTAKAYNMTAIFFQVRTT 79 Query: 114 GTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGT 173 A + SK+ P+S +TGK GE P +D L+F++ EA R ++VHAW NPYRVS+ T Sbjct: 80 NDAFYKSKLNPYSRFLTGKEGEVPLFDVLEFVIKEAKNRSLEVHAWCNPYRVSMKTDMTK 139 Query: 174 IRELNSTLSQQPASVYVQHRDWIRTS-GDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGV 232 L++ +H +++ T + +L+P EV+ +I + E+ Y VDG+ Sbjct: 140 SEYLSTLDDLN---FAKRHPEFVITDKNGQLILNPAKEEVKTFIIDSMLEIADNYDVDGI 196 Query: 233 QFDDYFYTESPGSRL-NDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEF 291 FDDYFY + S ND + + + D+RRN +I ++ +K P + F Sbjct: 197 HFDDYFYPYAGLSDSDNDASDFEQRTDKSLTLGDFRRNQITDVIRNLNKALKEKHPNLRF 256 Query: 292 GVSPAGVWRNRSHDPLGS--DTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAA 349 GVSP G+W+ + D LGS D + + +YD YAD+ W+++G++DYI PQ+YW F A Sbjct: 257 GVSPFGIWKTKKSDELGSNVDPQCSQSYDNQYADSYLWIKEGIIDYIVPQLYWDFEHKLA 316 Query: 350 RYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPE 409 + LA WWA+V K + LYIG Y+ GE E E+ QL + Sbjct: 317 PFADLALWWAEVCKGSNVDLYIGHGPYRYGEKGGYE-------NPYEVVNQLKFANQFDN 369 Query: 410 ISGTILFREDYLNKPQTQQ 428 + G + F QQ Sbjct: 370 VVGNVFFTYKTFIDETKQQ 388 >UniRef50_C0YRL9 FenI family protein n=3 Tax=Bacteroidetes RepID=C0YRL9_9FLAO Length = 538 Score = 292 bits (746), Expect = 3e-77, Method: Composition-based stats. Identities = 117/415 (28%), Positives = 191/415 (46%), Gaps = 41/415 (9%) Query: 29 KSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQ 88 +T P + + + RG W+A+V+ ++WP + + Q+ Sbjct: 52 AATNPATGTAASTEDNFRTNLPEIKREFRGAWIASVANINWPSRNDL-------TVEQQK 104 Query: 89 QAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIG--ENPGYDPLQFML 146 I LD L+ N FQ++P AL+ S I PWS +TG+ G +P YDPLQF + Sbjct: 105 AEAISMLDMLKDNNFNAAIFQIRPSADALYTSNIEPWSYFLTGETGTAPSPNYDPLQFWI 164 Query: 147 DEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLD 206 +EAHKRG+++H W NPYR + +R + D Sbjct: 165 EEAHKRGLELHVWLNPYRAHHTNGGAVNKLSMVNKLSDIV---------VRLKNGMYWFD 215 Query: 207 PGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTE---SPGSRLNDNETYRKY--GGAFA 261 P P+ Q +++IV ++V RY +D + FDDYFY + G+ DN ++ Y G Sbjct: 216 PANPKTQGHVSNIVKDIVKRYDIDAIHFDDYFYPYATYNKGADFPDNASWNAYVSSGGTL 275 Query: 262 SKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESY 321 S+ADWRR+N + + ++ I + K V FG+SP G+W + P G G++ YDE Y Sbjct: 276 SRADWRRDNVNKFVERIYKEIHAEKNNVRFGISPFGIW--KPGYPAG--IVGSSQYDELY 331 Query: 322 ADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEP 381 AD + W+ +G +DY +PQ+YWP ++ L WW L+ G+ Sbjct: 332 ADAKLWLNKGWVDYFSPQLYWPIDSKGQSFEALLSWWQSE-NTMNRHLWPGL-------- 382 Query: 382 SKIEPDWMINGGVPELKKQLDLNDA-VPEISGTILFREDYLNKPQTQQAVSYLQS 435 + ++ E+K Q+D++ + +G I + L + + L+S Sbjct: 383 --NTVEIKVSDRPTEIKNQIDISRNILKNDAGEIHWSIAGL--TRNPNMLPALKS 433 >UniRef50_A0M6M5 Protein containing DUF187 n=4 Tax=Bacteroidetes RepID=A0M6M5_GRAFK Length = 540 Score = 287 bits (735), Expect = 4e-76, Method: Composition-based stats. Identities = 133/460 (28%), Positives = 205/460 (44%), Gaps = 76/460 (16%) Query: 2 DICSRNKKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQ------- 54 D C N R I + L L L +CKST V PP +P ++S Q Sbjct: 4 DCCLSN----HFRIPIFILLMLFLNACKSTK----VAPPKPVEPQVEEEKSEQNVDQVPE 55 Query: 55 --------------------PMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDK 94 RG W+ATV+ ++WP +++ Q+ I+ Sbjct: 56 VEEPEASSENKIVEPPIDIEEFRGAWIATVANINWPSKNNL-------STEAQKAEAIEM 108 Query: 95 LDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENP--GYDPLQFMLDEAHKR 152 LD L+ N V QV+P AL+ S+I PWS +TGK G+ P YDPL+F ++EAH R Sbjct: 109 LDFLENHNFNAVILQVRPQADALYDSEIEPWSYFLTGKSGKAPQPYYDPLKFWIEEAHNR 168 Query: 153 GMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWI-RTSGDRFVLDPGIPE 211 G+++H W NPYR T + S+ + + + + +DPG + Sbjct: 169 GLELHVWLNPYRAHHTTGKEIGEK----------SIVKTNPELVMELKNGMWWMDPGSSK 218 Query: 212 VQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGS---RLNDNETYRKY--GGAFASKADW 266 VQD ++V ++V RY +D V FDDYFY + + D +++ KY G S+ DW Sbjct: 219 VQDHSAAVVMDIVKRYDIDAVHFDDYFYPYASYNGKKDFPDEKSWEKYVNSGGELSRGDW 278 Query: 267 RRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRR 326 RR N I +++ IK+ K V+FG+SP G+WR + G Y+E YAD + Sbjct: 279 RRKNVNDFIERIAEEIKAEKSFVKFGISPFGIWRPGFPKGIS----GMDQYEELYADAKL 334 Query: 327 WVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEP 386 W+ +G +DY PQ+YWP + + VL WW ++ GI + Sbjct: 335 WLNKGWIDYFTPQLYWPTRQIGQSFPVLLGWWESE-NVVGRHVWPGINLGLEDKEENKG- 392 Query: 387 DWMINGGVPELKKQLDLNDA-VPEISGTILFREDYLNKPQ 425 E+ Q+ ++ + + GT+ + L K Sbjct: 393 ---------EIASQILISRGILRDNPGTVHWNIGPLMKND 423 >UniRef50_C1I7D2 Putative uncharacterized protein n=1 Tax=Clostridium sp. 7_2_43FAA RepID=C1I7D2_9CLOT Length = 741 Score = 286 bits (731), Expect = 1e-75, Method: Composition-based stats. Identities = 130/412 (31%), Positives = 192/412 (46%), Gaps = 41/412 (9%) Query: 41 AGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQR 100 P + R WL+TV +D +S V + S + + LD + Sbjct: 6 PQVTIPDKFINPKEQFRTAWLSTVVNID---ISDVTSNPNLSAEEEFKNDLSSILDRFEE 62 Query: 101 LGINTVFFQVKPDGTALWPSKILPWSDLM---------TGKIGENPGYDPLQFMLDEAHK 151 L +N V FQV P A +PS I PWS + GK G+DPL++++ E H Sbjct: 63 LNLNAVTFQVSPMLDAWYPSDIAPWSQYLHKGGNNYTLQGKDPGFNGFDPLEWLISETHN 122 Query: 152 RGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPE 211 RGM+ HAWFNPYRV+ + E + L++ + ++ I ++ LDPG PE Sbjct: 123 RGMEFHAWFNPYRVTNTVDKRPVSEKLNELAEN--NFARKNPHLIYEFQNKLFLDPGRPE 180 Query: 212 VQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLN---------DNETYRKYGGAF-- 260 V D++ V EV ++Y VD + FDDYFY D +T+ Y F Sbjct: 181 VIDYVVQRVEEVANKYNVDAIHFDDYFYPYKYSENNKDIYFYTQDLDKQTFIDYNRGFGE 240 Query: 261 ---ASKADWRRNNTQQLIAKVSHTIKSI----KPGVEFGVSPAGVWRNRSHDPLGSDTR- 312 + A WR NN LI + + I ++FGVSP G+W + + GS+T Sbjct: 241 YNIENAAKWRENNIDILIKAIKDKVTDINITNNRSIQFGVSPFGIWGHAENYLEGSNTPT 300 Query: 313 -GAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKP-TRTRLY 370 ++ + +A+TR+WV++GL+DY+ PQIYW F+ +AA Y L +WW + + LY Sbjct: 301 GSTSSLRDQFANTRKWVKEGLVDYLTPQIYWSFNTAAAPYGELLQWWDSQFEGINNSHLY 360 Query: 371 IGIAFYKVGEPSKIEPDWMINGGVP-ELKKQLDLNDAVPEISGTILFREDYL 421 IG YK I+ W N P E+ QL N + G+ F D L Sbjct: 361 IGHPNYKY-----IDASWDNNFKNPYEIANQLRFNQKFENVKGSAFFSFDKL 407 >UniRef50_B0P7J3 Putative uncharacterized protein n=1 Tax=Anaerotruncus colihominis DSM 17241 RepID=B0P7J3_9FIRM Length = 429 Score = 283 bits (723), Expect = 1e-74, Method: Composition-based stats. Identities = 112/413 (27%), Positives = 176/413 (42%), Gaps = 39/413 (9%) Query: 31 TPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQA 90 P G+ + + +RG+W++ ++ + + Sbjct: 50 APDSRDRQALGGAHTAVLSSSVNGEVRGVWISYLT---------LEPMIKGKTQAQFVKN 100 Query: 91 MIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAH 150 + D D G NTVF +P G AL+ S+ PWS +TG+ G +PGYDPL+ M+ AH Sbjct: 101 IGDAFDQAADFGFNTVFVHARPFGDALYKSEYFPWSRYLTGEEGRDPGYDPLELMVSLAH 160 Query: 151 KRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIP 210 +RG+++ AW NPYRV ++ KP + Q D +PG Sbjct: 161 ERGLRIEAWINPYRVRLDDKPMS-------ADNQAKKWLASGNDGALAWNGGVYYNPGSA 213 Query: 211 EVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNN 270 ++ I + V E+V Y VDG+ FDDYFY + + D TY+ G+ ++ADWRR N Sbjct: 214 AARELIVNGVREIVENYDVDGIHFDDYFYPTTDLTF--DAATYQA-SGSSLTQADWRREN 270 Query: 271 TQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQ 330 +L+ V +K P FG+SP G Y+ +AD R WV + Sbjct: 271 VNKLVHDVYAAVKEANPDCLFGISPQGNVD--------------INYNGQFADVRTWVSE 316 Query: 331 -GLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWM 389 G +DYI PQIY+ + A Y W ++K +LY+GIA YKVG + Sbjct: 317 PGYVDYICPQIYYGYRNGTAPYAETVALWDSMIKVDTIKLYVGIAAYKVGTVDTWAGEGK 376 Query: 390 ING--GVPELKKQLDLNDAVPEISGTILFREDYLN---KPQTQQAVSYLQSRW 437 L + + G +++ + L Q + + L+ + Sbjct: 377 NEWIDTTDILARMVKTARKAEHYGGIVIYSYESLFGDVSEQMKIERNNLKKVF 429 >UniRef50_UPI0001745532 hypothetical protein VspiD_00105 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001745532 Length = 382 Score = 283 bits (723), Expect = 1e-74, Method: Composition-based stats. Identities = 115/395 (29%), Positives = 183/395 (46%), Gaps = 38/395 (9%) Query: 49 TQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFF 108 +S MRG W+A+V L++P + + A Q+ + ++ +N++ Sbjct: 19 AAPASAEMRGAWVASVHNLNFPSRTGL-------SADQQRAEIRRIINIAAACRLNSLMV 71 Query: 109 QVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVN 168 QV+P+G AL+ S++ PWS +TG G +PGYDPL + E +G+ +HAW NPYR S Sbjct: 72 QVRPEGDALYRSRLEPWSRFLTGTQGVDPGYDPLATFIAEGKSQGIAIHAWINPYRAST- 130 Query: 169 TKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYP 228 S + + +R G +DPG P V+ + +V ++V RY Sbjct: 131 ----------SKAGKAENHISRTMPGAVRRVGSMLWMDPGDPAVRQHVVRVVEDIVRRYA 180 Query: 229 VDGVQFDDYFYTE----SPGSRLNDNETYRKY--GGAFASKADWRRNNTQQLIAKVSHTI 282 V GV DDYFY P D+ TY +Y GG +ADWRR N LI ++ + Sbjct: 181 VRGVILDDYFYPYPGTGLPRGTFPDDTTYGRYQAGGGRLDRADWRRENVNTLIRELHTVV 240 Query: 283 KSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYW 342 + + G FGVSP G++R + E Y+D W+ +G +DY++PQ+YW Sbjct: 241 HANRQGAWFGVSPFGIYR---PNVPRGVEAQLDQLTELYSDPVAWLREGTVDYLSPQLYW 297 Query: 343 PFSRSAARYDVLAKWW-ADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQL 401 + L WW + V P ++ +A ++G N V E+ +QL Sbjct: 298 -TDAGPQSFSSLLGWWRSSSVNPRGILVFPSLAADRLGGSH--------NWPVQEISRQL 348 Query: 402 DLNDAVPEISGTILFREDYLNKPQTQQAVSYLQSR 436 D+ ++ G I++ L + T+ LQ R Sbjct: 349 DIESSIRPKGGFIIWSMAPLMR-NTKGVNGVLQGR 382 >UniRef50_C0EGV5 Putative uncharacterized protein n=1 Tax=Clostridium methylpentosum DSM 5476 RepID=C0EGV5_9CLOT Length = 430 Score = 282 bits (720), Expect = 2e-74, Method: Composition-based stats. Identities = 118/409 (28%), Positives = 200/409 (48%), Gaps = 39/409 (9%) Query: 29 KSTPPESMVTPPAGSKPPATTQQSSQP-----MRGIWLATVSRLDWPPVSSVNISNPTSR 83 PP + T S+P ++ Q SQ M+ +W + L+W N + Sbjct: 28 GEFPPATGQTDFVSSEPSSSGGQESQKDPMEGMKAVWFS---YLEW------NTMFKGAS 78 Query: 84 ARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQ 143 QQ + LD+L +G NTV V+ G A++ S + PWS ++G +G++PGYDPL Sbjct: 79 EEQFQQKLGTVLDNLVSIGCNTVMMHVRAFGDAMYRSSVYPWSASVSGVLGKDPGYDPLS 138 Query: 144 FMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRF 203 ++++AH +G+ VHAW NP R + I + +Q + +++ +++ S + Sbjct: 139 IIVEKAHAKGIAVHAWINPMRTMTAAEFDQIGDC---ALKQWYAGAQRYQYYMKDSSGHY 195 Query: 204 VLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASK 263 +L+P PEV+ I++ V E+V Y +DGV DDYFY ++ Y + Sbjct: 196 ILNPANPEVRKLISAGVTELVQNYDIDGVHIDDYFYPSGVDGLPENDAQYYQEAAPGTDI 255 Query: 264 ADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYAD 323 WRR+ T +++ ++ +K++KP + FG SP N +D Y D Sbjct: 256 GSWRRDATTEMVREMHDAVKAVKPEIPFGASPQSSLTND--------------FDRLYID 301 Query: 324 TRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVG---- 379 RW+ +GL+DY+ PQIY+ F ++ +D A W ++V +T LY+G+A YKVG Sbjct: 302 IERWISEGLVDYLMPQIYFGFHNTSQPFDQTAAKWNELV-GDKTALYVGLATYKVGLEND 360 Query: 380 ---EPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQ 425 K E NG L++Q+++ +++P G L+ + P Sbjct: 361 QHAGEGKTEWIDCFNGENNMLERQVEVLESLPNCKGYCLYSYQSIFNPD 409 >UniRef50_C2M9G1 YngK protein n=1 Tax=Porphyromonas uenonis 60-3 RepID=C2M9G1_9PORP Length = 530 Score = 281 bits (718), Expect = 4e-74, Method: Composition-based stats. Identities = 103/381 (27%), Positives = 171/381 (44%), Gaps = 30/381 (7%) Query: 47 ATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTV 106 Q MR +WL T+ LDWP +++ + + QQ+++ LD R GINTV Sbjct: 54 PAPQHPKAEMRAVWLTTIWGLDWPKMTA----DTHAGMVRQQESLDKMLDDCVRAGINTV 109 Query: 107 FFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVS 166 F QV+ G L+PS + P S ++ GYDPLQ+ +D H RGM VHAW Y + Sbjct: 110 FLQVRMRGDLLYPSTLEPLSTTISKTGVLPEGYDPLQYAIDACHHRGMSVHAWMVSYPLG 169 Query: 167 VNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSR 226 N L++Q Y H + G+ + +DP P V+ + +V ++V+R Sbjct: 170 TNDHVRA-------LAKQGKGFYAAHPEMCLRQGNAWFMDPAQPAVRTHMAQLVRDLVTR 222 Query: 227 YPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIK 286 Y VDGV D Y + P S+ ND ++Y++ + WR N +I + T++ + Sbjct: 223 YDVDGVHLDYIRYPDGP-SKFNDLKSYQRMNPDRLPRMAWREANVTAMIDTLHRTLQEVA 281 Query: 287 PGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSR 346 P V + G ++ G G D+ D W ++G++D+I P IY+ Sbjct: 282 PEVALSTACIGKYQQLPKPAPG----GYFCKDDVSQDPLVWFQRGIVDFIVPMIYYKDGH 337 Query: 347 SAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDA 406 ++ WA + P + G+ Y++ + S+ + ++ QLD Sbjct: 338 ----FNYYIADWAKRIAPHG-PIVAGLGVYRLYDNSRW--------KLQDIYNQLDTLAQ 384 Query: 407 VPEISGTILFREDYLNKPQTQ 427 +SG +R + L + Q Sbjct: 385 YD-LSGVSYYRAEQLLQMYNQ 404 >UniRef50_A9KK48 Putative uncharacterized protein n=1 Tax=Clostridium phytofermentans ISDg RepID=A9KK48_CLOPH Length = 490 Score = 277 bits (707), Expect = 8e-73, Method: Composition-based stats. Identities = 105/391 (26%), Positives = 169/391 (43%), Gaps = 39/391 (9%) Query: 55 PMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDG 114 + +W++ + + + + + D++ +G+N V V+P G Sbjct: 121 EFKAVWISYLEF-----------KSTGYTKDEFEAQIDEMFDNVVDMGMNAVIVHVRPFG 169 Query: 115 TALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTI 174 A++ S PWS ++G G++PG+DPL++M++ AH RG++ HAW NPYR++ Sbjct: 170 DAMYDSDYFPWSKYISGTQGKDPGFDPLEYMVEAAHDRGLQFHAWLNPYRITSKNTDVKT 229 Query: 175 RELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQF 234 N+ + + + + +P +PEV+ I + V E+V Y VDG+ F Sbjct: 230 LATNNPARKWLTDKKTSNDRNVLSFDGNLYYNPAVPEVRTLIRNGVLEIVRNYDVDGIHF 289 Query: 235 DDYFYTESP--GSRLNDNETYRKYGGAFAS---------KADWRRNNTQQLIAKVSHTIK 283 DDYFY ++ D Y+ Y + +WRR N LI + IK Sbjct: 290 DDYFYPTLGSNYEKVFDATEYKSYVDNYKKQGLDNYILPIDEWRRQNVNTLIKGIYSAIK 349 Query: 284 SIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWV-EQGLLDYIAPQIYW 342 K V FG+SP G D Y D W+ + G +DYI PQ+YW Sbjct: 350 LEKSDVVFGISPGGFLDTLRM------------KDRYYVDVDTWLSKPGYVDYICPQLYW 397 Query: 343 PFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLD 402 F S +D + W ++ K T +Y+GI YK S EPD+ N + L + Sbjct: 398 SFEHSQYPFDGILNRWLELRKNTDVNVYVGIPVYK--SASNDEPDFKKNANI--LADMII 453 Query: 403 LNDAVPEISGTILFREDYLNKPQTQQAVSYL 433 + G + FR D ++AV L Sbjct: 454 TCRNSKLVDGYMFFRYDNFYSNTAKKAVKNL 484 >UniRef50_B0MQ11 Putative uncharacterized protein n=1 Tax=Eubacterium siraeum DSM 15702 RepID=B0MQ11_9FIRM Length = 511 Score = 275 bits (703), Expect = 2e-72, Method: Composition-based stats. Identities = 108/404 (26%), Positives = 180/404 (44%), Gaps = 34/404 (8%) Query: 27 SCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARV 86 + S +++ + T + ++G+W+ W ++ + Sbjct: 121 TVTSKKNDNVPAATDNLPVNSYTALNYNEVKGVWI-------WYSELYPILTGKSESQLR 173 Query: 87 QQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFML 146 + D D+ LGINTV+ V+P G A++ S PWS TG IG++PGYDPL+ M+ Sbjct: 174 S--GIGDYYDNCLSLGINTVYVHVRPFGDAIYKSDYFPWSKYCTGYIGKDPGYDPLKVMI 231 Query: 147 DEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLD 206 DEAH RG+ AW NP R ST + + D+I + L+ Sbjct: 232 DEAHARGISFQAWVNPLRCYYEDD----APDVSTAYKTGQWYDTKDGDYIVKVKSYWWLN 287 Query: 207 PGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADW 266 P EV D I + AE+VS+Y VDGV DDYFY + D+ + +++S + + Sbjct: 288 PAYKEVTDLIANGAAELVSKYDVDGVHIDDYFYPTTEAYF--DSIAFNA--SSYSSLSQF 343 Query: 267 RRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRR 326 R +N +++A + +KS P FGVS G N + YAD + Sbjct: 344 RLDNCSRMVADMYKAVKSHNPTALFGVSAQGNVTN--------------NETQLYADVEK 389 Query: 327 WVEQ-GLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKI- 384 W ++ G +DY+APQIY+ F ++ + + W ++ T L G+A YK+G + Sbjct: 390 WSKEDGYVDYMAPQIYYGFDNGGQPFEQVVERWDKMLAGTGKSLIPGLAVYKIGTEDEWA 449 Query: 385 -EPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQ 427 + +K+Q+ + G IL+ ++ +P + Sbjct: 450 GSGRYEWQNDKEIIKRQIVKSQKTSNYGGVILYSYQFIFEPDSN 493 >UniRef50_C9PZF4 YngK protein n=5 Tax=Prevotella RepID=C9PZF4_9BACT Length = 573 Score = 267 bits (683), Expect = 5e-70, Method: Composition-based stats. Identities = 102/431 (23%), Positives = 172/431 (39%), Gaps = 55/431 (12%) Query: 8 KKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRL 67 K+L+ +L+ AL + S P + + + +R +WL T+ L Sbjct: 3 KQLSFSNRFLLLFFALSTATMLCAKSFSFFKPNGLNG----WKLPKREVRAVWLTTIGGL 58 Query: 68 DWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSD 127 DWP + + A Q+Q + D LD LQR GINTV FQ + GT ++PS++ PW Sbjct: 59 DWPH----SYAQNELMAGRQKQELRDILDKLQRAGINTVLFQARVRGTVVYPSQLEPWDG 114 Query: 128 LMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPAS 187 ++G G +PGYDPL F ++E HKRGM++HAW V G N Sbjct: 115 CLSGVPGRSPGYDPLAFAINECHKRGMELHAWVVTIPVGKWNSLGCKTLRN--------- 165 Query: 188 VYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRL 247 ++ I+ G+ +DP ++ + E+ RY VDG+ D Y E+ + Sbjct: 166 ---KYPHLIKRIGEEGYMDPENTATATYLANFCKEITDRYDVDGIHLDYIRYPETWKINI 222 Query: 248 NDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPL 307 + R N ++ + +K+ KP V++ SP G + + S Sbjct: 223 AHDAA---------------RRNITTIVRAIGEKVKASKPWVKYSCSPIGKFSDLSRFAS 267 Query: 308 GSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRT 367 G AY + D + W+ GL+D + P +Y+ + + W Sbjct: 268 N----GWNAYAKVCQDAQGWLRDGLMDALFPMMYFQGNH---FFPFAIDWAEQ---SYGR 317 Query: 368 RLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQ 427 L G+ Y + K N + + +++ + G FR + + Sbjct: 318 MLVPGLGIYFMSPSEK-------NWSLDVITREMQVARQYG--MGHAYFRSKFFTD-NLK 367 Query: 428 QAVSYLQSRWG 438 +Y Q + Sbjct: 368 GIYTYAQRIFT 378 >UniRef50_C0EWT6 Putative uncharacterized protein (Fragment) n=1 Tax=Eubacterium hallii DSM 3353 RepID=C0EWT6_9FIRM Length = 491 Score = 267 bits (682), Expect = 5e-70, Method: Composition-based stats. Identities = 116/447 (25%), Positives = 187/447 (41%), Gaps = 48/447 (10%) Query: 7 NKKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQ----PMRGIWLA 62 KK++ R + + L K++ +S VT G+ + R +WL+ Sbjct: 40 EKKISGRSSSEFINF-LFSTQEKASSNKSSVTSNKGNSKKGNNSSTQSADTMNYRAVWLS 98 Query: 63 TVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKI 122 + + N + ++ L ++ +G N + QV+P G AL+ S Sbjct: 99 YLEFNSYRKSVKNNNESS------FRKFYKHILQQIKTIGCNRIIVQVRPFGDALYASDY 152 Query: 123 LPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLS 182 PW+ ++G G+NPGYDPL+ M + +HK G+ + AW NPYR+S ++ + N Sbjct: 153 FPWAACISGTQGKNPGYDPLKIMTEMSHKEGISIEAWINPYRISSGNSIRSLSKTNPARK 212 Query: 183 QQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTES 242 ++ I + +P V++ I V E+V Y VDG+ DDYFY Sbjct: 213 WFSVQNTKRN---ILSYEGSLYYNPSSESVRNLIIQGVKEIVQNYNVDGIHMDDYFYPSF 269 Query: 243 PGSRLNDNETYRKYGGAFA---------------------SKADWRRNNTQQLIAKVSHT 281 + +Y S ADWRR+N +L++ + Sbjct: 270 TEKNVTTAFDAPEYKQQLKTNLSSTDSTSLTSADKSSNEISLADWRRDNVNRLVSGIYKA 329 Query: 282 IKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWV-EQGLLDYIAPQI 340 +K I V FG+SPAG N D E Y D WV + G +DY+ PQI Sbjct: 330 VKEINSDVTFGISPAGNLDNLRSDL------------EYYVDIDTWVSQNGYVDYLMPQI 377 Query: 341 YWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQ 400 YW F+ A +D + W +++ + +LYIG+ Y++G + D LKK+ Sbjct: 378 YWGFTNEVAPFDKVTDAWCILMENSPVKLYIGLQLYRMGSTEPGQSDEKELQKTSLLKKE 437 Query: 401 LDLNDAVPEISGTILFREDYLNKPQTQ 427 L +I G LF YL+ + Sbjct: 438 LSYLKKQKKIEGYCLFSYQYLDCQNKK 464 >UniRef50_B6YR88 Putative uncharacterized protein n=1 Tax=Candidatus Azobacteroides pseudotrichonymphae genomovar. CFP2 RepID=B6YR88_AZOPC Length = 490 Score = 267 bits (682), Expect = 6e-70, Method: Composition-based stats. Identities = 97/391 (24%), Positives = 167/391 (42%), Gaps = 36/391 (9%) Query: 48 TTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVF 107 + +R +WL T LDWP + + Q++ +++ L L++ N VF Sbjct: 19 SALMPKNEIRAVWLTTNYALDWPTKPFTTLED----IDKQKEELVNILCCLKKTNFNIVF 74 Query: 108 FQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSV 167 FQ + G ++ SK+ P S + K G YDPL F ++E HK G++ HAWF Y + Sbjct: 75 FQTRLRGNVVYDSKVEPLSPFIRNK-GYKVTYDPLAFAIEECHKLGLECHAWFVTYLLGA 133 Query: 168 NTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRY 227 G + V ++ R LDPG E ++ SIV E+V +Y Sbjct: 134 AEVKG---------EDNCSLVVKCNQLQTRIYKGEIYLDPGDLETDRYLLSIVEEIVDKY 184 Query: 228 PVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKP 287 VDG+ D Y E P + D+ TY+ YG +K +WR++N + ++++ +K KP Sbjct: 185 DVDGIHMDYIRYPEKP-TEFPDDITYKYYGKG-KNKTEWRKDNINRFVSRLYDMVKGKKP 242 Query: 288 GVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRS 347 V+ + G++ + D T +E Y D +W+ G D+I P +Y+ Sbjct: 243 WVQVSSAVVGIYTRKLGDNKKYWTA-----NEVYQDPEQWLRMGKHDFIVPMMYYS---G 294 Query: 348 AARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAV 407 + + W A + + GI Y++ E N V + +Q+ + Sbjct: 295 NLFFPFVQDWQA---RSYGRFVVPGIGIYRMDEKDS-------NWDVQTVTEQIKSSRQH 344 Query: 408 PEISGTILFREDYLNKPQTQQAVSYLQSRWG 438 G FR +YL + +++ + Sbjct: 345 -NTGGNAFFRANYLI-GNKKGIRDEIKNNFY 373 >UniRef50_A6EKL7 Putative uncharacterized protein (Fragment) n=1 Tax=Pedobacter sp. BAL39 RepID=A6EKL7_9SPHI Length = 391 Score = 265 bits (678), Expect = 2e-69, Method: Composition-based stats. Identities = 117/304 (38%), Positives = 168/304 (55%), Gaps = 20/304 (6%) Query: 121 KILPWSDLMTGKIGENPG--YDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELN 178 PWS + G+ G PG YDPL F + EAH RGM++HAWFNPYR +++ T Sbjct: 1 SREPWSQWLMGRQGLAPGPGYDPLAFAIKEAHSRGMELHAWFNPYRATMSANTVTSA--- 57 Query: 179 STLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYF 238 + + D T G + DPGIPEV+++I ++ +VV Y VDG+ FDDYF Sbjct: 58 -------DHMTRKRPDLFFTYGGKKQFDPGIPEVREYIVQVILDVVKGYDVDGIHFDDYF 110 Query: 239 YTES-PGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAG 297 Y G R++D+ T+ KY F++K DWRRNN LI ++ +I K V+FG+SP G Sbjct: 111 YPYPIAGQRISDDVTFSKYANGFSNKNDWRRNNVDLLIKQLDDSIHHYKKYVKFGISPFG 170 Query: 298 VWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKW 357 +W+N++ D LGS T G + Y E YAD+R+WV++G +DYI PQIY+ F+R AA +D L W Sbjct: 171 IWKNKAEDTLGSATHGLSNYTELYADSRKWVKEGWVDYINPQIYFSFTRRAAPFDTLVNW 230 Query: 358 WADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFR 417 W++ LYIG A Y V + ++ Q+ A + G++ F Sbjct: 231 WSN--NAYGRHLYIGQAAYLVNQ-----KMEAAWRNPSQIPDQVRYLRANNRVQGSVYFS 283 Query: 418 EDYL 421 Sbjct: 284 SKSF 287 >UniRef50_B3QYB7 Putative uncharacterized protein n=1 Tax=Chloroherpeton thalassium ATCC 35110 RepID=B3QYB7_CHLT3 Length = 489 Score = 265 bits (677), Expect = 2e-69, Method: Composition-based stats. Identities = 104/419 (24%), Positives = 178/419 (42%), Gaps = 52/419 (12%) Query: 15 PAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSS 74 IL+ ALL+ + + + G + +RG+W+AT +DWP Sbjct: 10 LFILIFSALLILDHTTLFSQPLKNRLNGDD-------EREQLRGVWIATAYGIDWPK--- 59 Query: 75 VNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIG 134 T Q++++ + +++ +N VFFQV+ G L+ S P+S+++TG +G Sbjct: 60 ------TYDPEKQKESLQEIFHDIKKKNLNAVFFQVRIRGDVLFYSPYEPFSNVLTGSLG 113 Query: 135 ENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRD 194 P YDP+ + + A + G++ HAWFN ++ + P + + R Sbjct: 114 VIPDYDPVAYAISLAKENGLEFHAWFNTMILNGKNSTPQSEGVAHIWQAHPEWIDKRARK 173 Query: 195 WIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYR 254 L+P +PEV+ + ++ + RY +DG+Q DD Y P D+E + Sbjct: 174 NAWQ--KTAYLNPALPEVRAHLIRLITDFAERYDIDGIQLDD--YLRYPTKDFPDDEEFE 229 Query: 255 KYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGA 314 KY S DWRR N Q + + ++ KP ++FGV+P GV+ D Sbjct: 230 KYNPKKLSLDDWRRENINQFVGDLYDSLMQRKPYLKFGVTPIGVYTR------VDDVPAM 283 Query: 315 AAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAA----------RYDVLAKWWADVVKP 364 +Y + Y D+R WV + DY+APQIY+ ++ A ++ L + W + Sbjct: 284 ESYHDVYQDSREWVRRKKCDYLAPQIYFHTGKTTAADRRKNKTNPPFENLVRDWGGNMPF 343 Query: 365 TRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNK 423 LY+GI YK E Q++L + G I + + Sbjct: 344 --RHLYVGIGMYK-------------PPIKEEWPHQVELAEK-AGAEGVIFYPYHAIED 386 >UniRef50_C1Q9T9 Uncharacterized conserved protein n=3 Tax=Brachyspira RepID=C1Q9T9_9SPIR Length = 605 Score = 262 bits (670), Expect = 2e-68, Method: Composition-based stats. Identities = 126/427 (29%), Positives = 179/427 (41%), Gaps = 67/427 (15%) Query: 54 QPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPD 113 + R W +TV+ +DWP Q++ +I L+ L N VF QVKPD Sbjct: 48 REFRAAWFSTVANIDWPIKGG--------SENEQKKLIIKHLNTLYENNFNAVFVQVKPD 99 Query: 114 GTALWPSKILPWSDLMTGKIGENPG------YDPLQFMLDEAHKRGMKVHAWFNPYRVSV 167 ++PSKI P + G + D L+F++DEAHKR ++VHAWFNPYR+S+ Sbjct: 100 AGVIFPSKINPTTRYFFGTASSDEKDEYPFKTDMLKFIIDEAHKRNLEVHAWFNPYRMSL 159 Query: 168 NTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRY 227 E + + + I +R LDPG P +I V EVV Y Sbjct: 160 TYDTNKTYEEQFSKKNFIHTYVSNNLKPIHWYDNRIYLDPGEPISSKYIIDSVIEVVENY 219 Query: 228 PVDGVQFDDYFYTESPG----SRLNDNETYRKYGG--------------AFASKADWRRN 269 VDG+ FDDYFY + G D + KYG WRR+ Sbjct: 220 DVDGIHFDDYFYQNAAGGKTYKDWPDRISAEKYGEKSGYDINNTSYDDYGVNGLYAWRRD 279 Query: 270 NTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNR---SHDPLGSDTRGAAAY----DESYA 322 N +L++ + IKS KP V++ +SPAGVWRN S +Y D +A Sbjct: 280 NINRLVSDLYKEIKSRKPYVKWTISPAGVWRNNTKLSEYIGSKYGSATQSYNPNFDALHA 339 Query: 323 DTRRWVEQG------------------LLDYIAPQIYWPFSRSAARYDVLAKWWADVVKP 364 D W+ G +D + PQ+YW A +D + KWW + K Sbjct: 340 DVLLWLLNGEKTSSLENASDKDGLNRMYIDAVIPQVYWSSYHKTAPFDTIVKWWVNEYKK 399 Query: 365 TR----TRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDA--VPEISGTILFRE 418 R LYIG A YK+G + EP + + +Q+D + I G+ F Sbjct: 400 ARATNTADLYIGHALYKMGRETNTEP----WQNIELISEQIDYIRKIGINSIKGSSFFTM 455 Query: 419 DYLNKPQ 425 + K Sbjct: 456 HSMYKKD 462 >UniRef50_D1PA22 YngK protein n=1 Tax=Prevotella copri DSM 18205 RepID=D1PA22_9BACT Length = 582 Score = 262 bits (668), Expect = 3e-68, Method: Composition-based stats. Identities = 101/419 (24%), Positives = 164/419 (39%), Gaps = 60/419 (14%) Query: 18 LVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNI 77 ++LCS + +S+V Q +R +WL T+ +DWP + Sbjct: 4 FKIFFIVLCSVLAAKAQSIVFN---------NQVPKHEVRAVWLTTIGGIDWPH----SY 50 Query: 78 SNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENP 137 + + A Q++ + D LD LQ INTV Q + GT ++PS PW ++G G +P Sbjct: 51 AQSSYSAEKQKKELTDILDRLQLAKINTVLIQTRVRGTMIYPSAYEPWDGCLSGFPGRSP 110 Query: 138 GYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIR 197 GYD LQF +DE HKRGM++HAW V G + I+ Sbjct: 111 GYDALQFAIDECHKRGMELHAWVVTIPVGKWNALGCKTLRQ------------KMPKLIK 158 Query: 198 TSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYG 257 G +DP D++ +I E+ +Y VDG+ D Y E+ +++ + Sbjct: 159 KIGADGYMDPENSRTGDYLANICREITHKYNVDGIHLDYIRYPETWNIKVSREQG----- 213 Query: 258 GAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAY 317 R ++ K+ +K+ KP V+ SP G + + S + G AY Sbjct: 214 ----------RRYITNIVRKIHDAVKAEKPWVKMSCSPVGKYDDLSRY----RSFGWNAY 259 Query: 318 DESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYK 377 + D + W++ GL+D + P +Y+ Y W + G+ Y Sbjct: 260 TKVCQDAQGWLKSGLMDELFPMMYFKNEH---FYPFAIDWQEQ---SHGKIVVPGLGIYF 313 Query: 378 VGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQSR 436 + E W IN E+ + G FR +L Q + + Q Sbjct: 314 LD---PKEGKWNINDVTAEMY----HIRNLG--MGYAFFRNKFLLD-NKQGILDFTQRF 362 >UniRef50_C7H8A9 FenI protein n=2 Tax=Faecalibacterium prausnitzii RepID=C7H8A9_9FIRM Length = 425 Score = 258 bits (660), Expect = 2e-67, Method: Composition-based stats. Identities = 115/422 (27%), Positives = 174/422 (41%), Gaps = 49/422 (11%) Query: 19 VALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNIS 78 V+ AL +S P + P + S R +W++ + + Sbjct: 27 VSAALTYYLLRSIPAGNNAEPAPSPQAAPNPALPSGEWRAVWVSYLEFAEM--------- 77 Query: 79 NPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPG 138 S + +D+ LG+NTV QV+P G AL+ S + PWS L TG G++PG Sbjct: 78 -DFSSESAFRADAAALMDNCLSLGLNTVIAQVRPFGDALYRSSLFPWSHLCTGVQGQDPG 136 Query: 139 YDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRT 198 +DPL +L EAH RG+ + AW NPYR + +S L H +WI T Sbjct: 137 FDPLDVLLTEAHARGLSLEAWVNPYRFRSSASMPPAIAESSLL--------NTHPEWICT 188 Query: 199 SGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGG 258 + L+P IPE D++ VAE+V Y VDG+ FDDYFY + S D + G Sbjct: 189 VNEGAYLNPAIPEAADYVVQGVAELVQNYAVDGIHFDDYFYPTTDPSI--DAAQFAASGE 246 Query: 259 AFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYD 318 WRR N +L+ +K+ P + FGVSP G N + Sbjct: 247 --TDLTAWRRANVTRLVKAAHDAVKAADPTLRFGVSPQGNPDNDR--------------N 290 Query: 319 ESYADTRRWV----EQGLLDYIAPQIYWPFSRSAA------RYDVLAKWWADVVKPTRTR 368 E Y D W+ ++DY+ PQIYW + + + ++ + W + + T Sbjct: 291 EQYTDLSVWLTASGADAVVDYLCPQIYWGYGYTLSSGSTRFSFENITAEWLALPRAESTA 350 Query: 369 LYIGIAFYK--VGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQT 426 LY G+ Y+ VG+ L +Q+ + G L+R L + Sbjct: 351 LYFGLGAYRVGVGDGGANADSVSQWCTGSALARQVTDLRS-AGAGGWALYRYGSLFRSDE 409 Query: 427 QQ 428 Sbjct: 410 SG 411 >UniRef50_B9Y560 Putative uncharacterized protein n=1 Tax=Holdemania filiformis DSM 12042 RepID=B9Y560_9FIRM Length = 408 Score = 257 bits (656), Expect = 7e-67, Method: Composition-based stats. Identities = 112/435 (25%), Positives = 189/435 (43%), Gaps = 46/435 (10%) Query: 7 NKKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSR 66 ++ + I+R I V + LL+ + G P T +R W++ + Sbjct: 4 SELMHIKRILIAVFVILLIFA--------------GCHPRKKTGTMG-EVRAAWISYIE- 47 Query: 67 LDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWS 126 ++ Q + L++L+ + NTV+ A +PS+ P + Sbjct: 48 --------LSSILDNRSETDYIQGVKTMLENLKAMNFNTVYVHASAFTDAYYPSQYYPTA 99 Query: 127 DLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPA 186 + G+IG+N YDP + +AH+ G + AW NP R S T + ++S + Q + Sbjct: 100 QYVAGQIGQNVAYDPFGLFVQQAHEAGFHIEAWINPMR-SFRTDQESQIPVSSVIGQWLS 158 Query: 187 SVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSR 246 ++ I GDR+ L+P PEV++ I ++ E+ YP+DG+ DDYFY + Sbjct: 159 DPTMRGTR-IVAEGDRWYLNPAYPEVRELICAVAKELAQNYPIDGLHLDDYFYPDGVSES 217 Query: 247 LNDNETYRKY--GGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSH 304 D Y+ Y G S DWRR N +++A + T+K + ++ G+SPAG Sbjct: 218 F-DQVAYQAYRQTGGELSLGDWRRQNINEMVASLYATVKQVDKTIQVGISPAGNLEY--- 273 Query: 305 DPLGSDTRGAAAYDESYADTRRWVE-QGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVK 363 + + Y D R WV G LDYI PQIY+ + +D K W D+ + Sbjct: 274 -----------SVESIYGDVREWVRHDGYLDYILPQIYFGYEHGTLPFDQCLKQWEDLTQ 322 Query: 364 PTRTRLYIGIAFYKVG--EPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYL 421 T T L +G+A YK+ + + + LK+Q+ ++G +F + L Sbjct: 323 GTSTELIVGLAAYKINTVDNYAKDGKYEWQQHDDILKRQILELRDHAAVAGFSIFSYNSL 382 Query: 422 NKPQTQQAVSYLQSR 436 +P + A Q Sbjct: 383 FQPAAENAQRVSQEL 397 >UniRef50_B0NXH7 Putative uncharacterized protein n=3 Tax=Clostridiales RepID=B0NXH7_9CLOT Length = 468 Score = 255 bits (650), Expect = 3e-66, Method: Composition-based stats. Identities = 118/457 (25%), Positives = 179/457 (39%), Gaps = 47/457 (10%) Query: 6 RNKKLTIRRPAILVALALL--LCSCKSTPPESMVTPPAGSKPPATTQQSSQ--------- 54 K I+ A+LV A+ LCS PA T S Sbjct: 31 HLKLGKIQWIALLVVSAMFVNLCSHMVLAAARDGNDPAQGTTAEATTASEATTETTTEQQ 90 Query: 55 -------PMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVF 107 + W + D+ + + R ++ K + LG+N V Sbjct: 91 EETQSMGEYKAFWFS---FYDYDSYRAKYKKRTAANFRTYFTGVVKK---GKSLGMNRVI 144 Query: 108 FQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSV 167 QV+P G A++ SK PWS ++GK G NPG+DPL+ M++ AH MK+ AW NPYRV+ Sbjct: 145 VQVRPFGDAIYKSKYFPWSKYISGKQGRNPGFDPLKIMVEVAHDNDMKIEAWVNPYRVTT 204 Query: 168 NTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRY 227 + N+ + R + + G +P V+ IT+ V E+V Y Sbjct: 205 GSTNYKKLAKNNQARKW--HAKKSTRRNVLSYGGSLYYNPSKKAVRTLITNGVKEIVQNY 262 Query: 228 PVDGVQFDDYFYTES-----PGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTI 282 VDG+ DDYFY + Y S +RR L+ ++ + Sbjct: 263 DVDGIHMDDYFYPSFTKRNVKKAFDAKEYKKSSYKKKKKSIYTYRRAQINTLVKQMKKAV 322 Query: 283 KSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQ-GLLDYIAPQIY 341 KS+ P V +G+SPAG N + Y D +W+ +DYI PQ+Y Sbjct: 323 KSVDPNVTYGISPAGNIDN------------LTSKYSYYVDIYKWLNSTEYVDYICPQVY 370 Query: 342 WPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMING--GVPELKK 399 W F A+++ + W K + +LYIGIA Y+ G LKK Sbjct: 371 WGFKHPTAKFNKVTDRWIKAAKSKKVKLYIGIAVYRAGHNVGQNRAERKEWKRDTKVLKK 430 Query: 400 QLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQSR 436 Q+ + G F L + +AV+ L++ Sbjct: 431 QVQYARK-KHVDGFAFFDYQDLKSKTSAKAVNQLKTV 466 >UniRef50_C5VL52 YngK protein n=3 Tax=Prevotella RepID=C5VL52_9BACT Length = 566 Score = 253 bits (647), Expect = 6e-66, Method: Composition-based stats. Identities = 103/423 (24%), Positives = 169/423 (39%), Gaps = 54/423 (12%) Query: 19 VALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNIS 78 + L LLC T +K + + R +WL T++ LDWP N + Sbjct: 4 LFLFTLLCVITMTTQAQTNLSDYLTK-----RMPKRETRAVWLTTLASLDWPK----NYA 54 Query: 79 NPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENP- 137 ++Q+Q +ID LD Q+ INTV Q + ++PS I PW +TG G P Sbjct: 55 RSEESIKLQKQELIDILDKYQKANINTVLLQARVRAATIYPSDIEPWDQCITGVEGRAPG 114 Query: 138 -GYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWI 196 GYDPL F ++E HKRGM++HAW V G ++ I Sbjct: 115 YGYDPLSFAVEECHKRGMEIHAWIATIPVGAKNSLGCRT-------------LMKKGFRI 161 Query: 197 RTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKY 256 R LDP P V ++ S+ E+V +Y VDG+ D Y + + + Sbjct: 162 RNFSTGSYLDPADPSVAPYLASVCGEIVRKYDVDGINLDYIRYPD----------GWPRP 211 Query: 257 GGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAA 316 D RR+N ++ + +K+IKP V+ SP G + S ++ A Sbjct: 212 SYRDGDTPDQRRSNITAIVRAIHDEVKAIKPWVKMSCSPIGKHADLSRY----SSKNFNA 267 Query: 317 YDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFY 376 +D + + W+ GL+D + P Y+ Y +A W + + + G+ Y Sbjct: 268 HDRVSQEAQEWMRLGLMDQLYPMQYFRGDN---YYPFVADWVEN---AYKREIVTGLGTY 321 Query: 377 KVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQSR 436 + N + +L +Q+ ++ + G FR +L Q + + Sbjct: 322 FLDPRE-------GNWTLGDLTRQMYVSRDLG--VGHAHFRSYFLT-ANKQGVYDFEKQF 371 Query: 437 WGS 439 + Sbjct: 372 NAT 374 >UniRef50_A9NEM7 Hypothetical surface-anchored protein n=2 Tax=Acholeplasma laidlawii PG-8A RepID=A9NEM7_ACHLI Length = 906 Score = 237 bits (604), Expect = 6e-61, Method: Composition-based stats. Identities = 124/445 (27%), Positives = 200/445 (44%), Gaps = 60/445 (13%) Query: 18 LVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNI 77 LV L +L S + P + + K +++Q +R +W+ V Sbjct: 11 LVCLTTILLSGFTKPNSNDI------KSFEFEFETNQKLRAVWVT----------PIVGE 54 Query: 78 SNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENP 137 + + + M LD L+ IN + F V+ AL+ S++ P + + N Sbjct: 55 VSTFTTETAFKNEMNQMLDILEHYKINALIFHVRTHNNALYDSELNPKATVFGSVNFNN- 113 Query: 138 GYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIR 197 +DPL ++++E RG++ HAW NPYRV N E AS + + Sbjct: 114 -FDPLLWLVNETQSRGIEFHAWLNPYRVGTNYVGTMPAEN----PASNASNILSNP---- 164 Query: 198 TSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGS----------RL 247 ++ +L+PG P V+D+I V E++ +YPVD + FDDYFYT + Sbjct: 165 SNSALKILNPGEPVVRDFIVDTVIEIIEKYPVDAIHFDDYFYTNLGANGALSGATTILDE 224 Query: 248 NDNETYRKYGGAF-----ASKADWRRNNTQQLIAKVSHTIKSIK----PGVEFGVSPAGV 298 D +TY YG F KA+WRR+ ++ VS+ IK+ ++FG+SP G+ Sbjct: 225 PDQQTYVTYGSGFNTTSATDKANWRRHQVNTMVQAVSNAIKNYNQLNGKHIQFGISPTGI 284 Query: 299 WRNR----------SHDPLGSDTRGAAAYDES-YADTRRWVEQGLLDYIAPQIYWPFSRS 347 ++N GS T G Y ++D+ W+++G LDYIAPQ YW + S Sbjct: 285 YKNGNGVVTYDEFGKPVTTGSLTTGQTHYSSYLFSDSLHWIKEGWLDYIAPQSYWATNHS 344 Query: 348 AARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAV 407 AA Y + WW VVK LY GI Y E + +W + E++ QL+ + + Sbjct: 345 AASYYNVMGWWEKVVKYLDVNLYSGIGLYMADESTNTF-NWKD--DMLEMRTQLEYLETL 401 Query: 408 PEISGTILFREDYL-NKPQTQQAVS 431 ++ G ++ Y+ N Q + S Sbjct: 402 NDVDGLSVYSYKYIRNHYNNQNSTS 426 >UniRef50_C4FZ05 Putative uncharacterized protein n=1 Tax=Abiotrophia defectiva ATCC 49176 RepID=C4FZ05_ABIDE Length = 562 Score = 237 bits (604), Expect = 6e-61, Method: Composition-based stats. Identities = 104/375 (27%), Positives = 168/375 (44%), Gaps = 35/375 (9%) Query: 53 SQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKP 112 + +R +W+ + S+ + + D + G+N ++ V+P Sbjct: 193 TNEVRAVWIT-----------FLEFSSKGYTVNSFTNQITEMFDKIAASGMNEIYVHVRP 241 Query: 113 DGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPG 172 A++ S PWS +GK G +PG+DPL M++ AH R +K+HA+ NPYRV G Sbjct: 242 FSDAMYRSVYFPWSKYASGKQGVDPGFDPLAIMVNAAHTRNLKLHAYINPYRVCAEADFG 301 Query: 173 TIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGV 232 ++ +NS + ++ + G + +P +V + I + VAE+V Y VDGV Sbjct: 302 SLA-VNSPAYKWLNDDDEENDRNVLKFGKMYYYNPSSDDVINLINNGVAEIVKNYDVDGV 360 Query: 233 QFDDYFYTESPGSRL--NDNETYRKY---GGAFASKADWRRNNTQQLIAKVSHTIKSIKP 287 FDDYFY + D+E Y Y S DWRR+N +++ V T+KS Sbjct: 361 IFDDYFYPTLGSNYSSKFDSEEYADYKLNTANPMSIVDWRRDNINKMVKTVYATVKSSGK 420 Query: 288 GVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQ-GLLDYIAPQIYWPFSR 346 FG+SPAG N + D+ Y D RW + G +DYIAPQ YW F Sbjct: 421 NRTFGISPAGNLTNLRAN------------DKYYVDIDRWGRETGFVDYIAPQQYWGFEH 468 Query: 347 SAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDA 406 S ++ W VV +LY+ + + ++ +W N + L + + Sbjct: 469 SICPFEDNVSKWMAVVTNPNVKLYVALPMHLA--QAQETSEWKNNHDI--LGRMVTSLRN 524 Query: 407 VPEISGTILFREDYL 421 +SG ++R Y+ Sbjct: 525 -KSLSGFSIYRYHYI 538 >UniRef50_D1PRQ4 FenI protein n=1 Tax=Subdoligranulum variabile DSM 15176 RepID=D1PRQ4_9FIRM Length = 412 Score = 237 bits (603), Expect = 1e-60, Method: Composition-based stats. Identities = 112/398 (28%), Positives = 172/398 (43%), Gaps = 46/398 (11%) Query: 33 PESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMI 92 P++ A + + + P R +W++ + S A + + Sbjct: 30 PDTPSATAAPTPAATAVPERTAPYRAVWVSYLE----------WQQVDFSGADAFSRDIA 79 Query: 93 DKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKR 152 LD++ +G V QV+P G AL+PS P+S L TG G +PG+DPL +++ AH Sbjct: 80 AMLDNIASVGATVVLAQVRPFGDALYPSDYFPFSHLCTGIQGRDPGFDPLALLVEAAHAS 139 Query: 153 GMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEV 212 G+++ AW NPYR+ P S V H DW++ + LDP P+V Sbjct: 140 GLELEAWVNPYRLQAGGVPAL----------CDQSPAVTHPDWVKKTETGSYLDPANPDV 189 Query: 213 QDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQ 272 + +I V E+ Y +DG+ FDDYFY + + + G S ADWRR+N Sbjct: 190 RQYIADGVEELCRNYALDGIHFDDYFYPTTSATFDAAEYAAAQTG---LSLADWRRDNVN 246 Query: 273 QLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQ-G 331 L++ + + GV GV+P G YD Y+D RW+ Q G Sbjct: 247 ALMSLCHGV--TARYGVRLGVAPLGD--------------PELCYDGQYSDAARWLAQGG 290 Query: 332 LLDYIAPQIYWP-----FSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEP 386 +DY+ PQ+YW +A D LA WAD+ + LY+G+ Y++G+ Sbjct: 291 YVDYLMPQLYWGLTYEQNGDTAHSLDTLAARWADLPRAEGVALYVGLGAYRIGDGDGSTA 350 Query: 387 DWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKP 424 L QLD + + I G L+R L Sbjct: 351 GAAEWQSGHALADQLDALETL-GIGGAGLYRYASLWAN 387 >UniRef50_UPI0001C37647 hypothetical protein RflaF_08645 n=1 Tax=Ruminococcus flavefaciens FD-1 RepID=UPI0001C37647 Length = 379 Score = 234 bits (596), Expect = 6e-60, Method: Composition-based stats. Identities = 106/417 (25%), Positives = 183/417 (43%), Gaps = 52/417 (12%) Query: 13 RRPAILVALALLLCSCKSTP-PESMVTPPAGSKPPATTQQSSQPM-----RGIWLATVSR 66 + A++ A LL C PE++ P + A + P+ +G+W+ + Sbjct: 3 KILAVMALSAFLLGRCTPAAMPENLKQPDPAAVSEAAANKEYAPLNYEYQKGMWIPYLDY 62 Query: 67 LDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWS 126 ++ A + A+ +L G NTV+ ++P G A + S P Sbjct: 63 AEYMQ---------GKTADDFRSAIRKRLSDAADSGTNTVYVHIRPTGDAYYKSTFFPKG 113 Query: 127 DLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPA 186 + + YDPL+ MLDEAHK G+ VH W NP R+ + T+ + + Sbjct: 114 RYL------DGDYDPLEIMLDEAHKLGLSVHGWINPLRLQTAEEMETVPDS----AITKQ 163 Query: 187 SVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSR 246 I +G R L P PEV++ + + + E++ Y VDG+ DDYFY ++ S Sbjct: 164 WYSSGDSMNIGETGGRLYLRPDSPEVRELLANEIREIIGSYDVDGIHIDDYFYPDTDPSF 223 Query: 247 LNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDP 306 D+E++ G + +K WR + +++ + +K V FG+SP G R Sbjct: 224 --DSESFALSGESDLTK--WRTDAVSEMVKAMYSAVKDTDERVLFGISPQGNVRAD---- 275 Query: 307 LGSDTRGAAAYDESYADTRRWVEQ-GLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPT 365 Y+ YAD RRW+ + G DYI PQIY+ F + + + W + + + Sbjct: 276 ----------YETQYADVRRWISEKGFCDYIVPQIYYGFKNETLPFTSVLEEWERMAENS 325 Query: 366 RTRLYIGIAFYKVGEPSKI-----EPDWMINGGVPELKKQLDLNDAVPEISGTILFR 417 RL IG+ YK+G+ + E +W+ + G+ + + Q L+ + G ++ Sbjct: 326 NVRLIIGLGAYKLGKEDRWAGESGESEWLDDPGIIDKQTQAVLDSS---ADGYAVYY 379 >UniRef50_B4VPG3 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VPG3_9CYAN Length = 406 Score = 233 bits (595), Expect = 8e-60, Method: Composition-based stats. Identities = 88/437 (20%), Positives = 154/437 (35%), Gaps = 73/437 (16%) Query: 13 RRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPV 72 RR + + L L+ +S+V P+ ++ T+ +RG+WL V+ Sbjct: 8 RRFRVFLVLGLVFSIVLLVA-KSIVFSPSLARSQTPTK--ITEIRGVWLTNVA------- 57 Query: 73 SSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGK 132 S + + L +L NTV+ V G +PS + L+ Sbjct: 58 ---------SGVLFSPWGINRAIAQLSKLNFNTVYPVVWNRGHTFYPSAVATQEPLLAIM 108 Query: 133 IGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQH 192 D L +L + H++G++V WF ++ + P + Sbjct: 109 RL---NGDVLADILQQGHRQGLRVIPWFEYGFMTPIYSE--------LARRHPTWITQSL 157 Query: 193 RDWIRTSGDRF-VLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNE 251 + L+P PEVQ I ++ EVVS+Y VDG+Q DD+F P D Sbjct: 158 TQKSDPENPQLLWLNPLHPEVQQLILDLIKEVVSQYDVDGIQLDDHFG--MPVELGYDPY 215 Query: 252 TYRKYGGAF-----------ASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWR 300 T +Y + WR N + + ++ T+KSIKP +SP Sbjct: 216 TIERYQQEHYGNSPPNSPLNSEWMRWRANKISEFMGEIVQTVKSIKPDCIISLSP----- 270 Query: 301 NRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWAD 360 A AY D + WV++G +D + Q+Y + + A Sbjct: 271 ----------NPQAFAYKHYLQDWQTWVQRGWVDELVLQVYRD---ELSSFTAELNQPAV 317 Query: 361 VVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDY 420 + + IGI + +P E +++ Q++ G F + Sbjct: 318 RMARRTIPVSIGILTGTLADPISFE----------QIQAQVEAVRD-RAFDGVSFFYWET 366 Query: 421 LNKPQTQQAVSYLQSRW 437 L T ++ + + Sbjct: 367 LWSYLTPESPQQRRRGF 383 >UniRef50_Q8YXK2 All1210 protein n=4 Tax=Nostocaceae RepID=Q8YXK2_ANASP Length = 906 Score = 226 bits (577), Expect = 9e-58, Method: Composition-based stats. Identities = 87/408 (21%), Positives = 138/408 (33%), Gaps = 85/408 (20%) Query: 57 RGIWLATVSRLDWPPVS-----------SVNISNPTSRARVQQQAMIDKLDHLQRLGINT 105 R WLAT + L W SV + T +Q + D L + GINT Sbjct: 397 RQQWLATRTNL-WKQFPTDRRLAPAEIRSVWLDRGTIVRAGSEQELAKIFDRLAQAGINT 455 Query: 106 VFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRV 165 +FF+ G ++PSK+ P + + G+DPL + AH RGM++HAW + Sbjct: 456 IFFETINAGYTIYPSKVAPQQNPLIR------GWDPLASGVKLAHARGMELHAWVWTFAA 509 Query: 166 SVNTKPGTIR----ELNSTLSQQPASVYVQHRDWIRTSG-DRFVLDPGIPEVQDWITSIV 220 + L+ P H+ + G + DP PE++ ++ + Sbjct: 510 GNQRHNELLNIPTNYPGPVLAANPDWANYDHQGQMIPLGQTKPFFDPANPELRQYLLKLY 569 Query: 221 AEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGA--------------------- 259 E++++Y VDG+Q D Y D R YG Sbjct: 570 EEIITKYKVDGLQLDYIRYP------FQDPAAGRSYGYGKAARTQFQQLTGVDPMKISPS 623 Query: 260 ----FASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAA 315 + +R +A+VS ++ + V+ Sbjct: 624 QTQLWQQWTTFRTQQVDSFVAQVSQMLRQQDRNLILSVAVFP-------------LPEYE 670 Query: 316 AYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAF 375 + W QG +D I P Y ++ R+ LAK W + T L GI Sbjct: 671 RVQKIQQHWEIWARQGNIDLIIPMTY---AQDTVRFQTLAKPWITSTQLGSTLLIPGIRL 727 Query: 376 YKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNK 423 + + QL L +P +SG LF + LN Sbjct: 728 LSLPTLGAFD--------------QLQLVRDLP-VSGYALFAAENLNN 760 >UniRef50_B4WH89 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WH89_9SYNE Length = 635 Score = 223 bits (569), Expect = 8e-57, Method: Composition-based stats. Identities = 87/451 (19%), Positives = 154/451 (34%), Gaps = 93/451 (20%) Query: 16 AILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSV 75 AI+ +L + + P + +V PP+ + +RG+W+ + Sbjct: 231 AIICRASLSPNTVATVPSDRIVFPPSLPTVAT----PTTELRGVWMTNI----------- 275 Query: 76 NISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGE 135 S + A+ L+ L +L N V+ V GT L+PS + + + K G Sbjct: 276 -----DSDVLFSRSALEQALETLSKLNFNVVYPTVWNWGTTLYPSAVAERT--IGYKQGL 328 Query: 136 NP----------------GYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNS 179 P D LQ +++ AH R +KV WF G + +S Sbjct: 329 YPDLDRTGRKVELEAAQGDRDMLQEIIELAHSRNLKVMPWFE---------FGFMAPADS 379 Query: 180 TLSQQPASVYVQHRDWIRT----SGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFD 235 L+++ Q D T +R L+P EVQ ++ +++E+ + Y +DG Q D Sbjct: 380 ELARRHPDWLTQKADGTLTTLEGEHERVWLNPFHLEVQTFLLQLISELSANYDIDGFQVD 439 Query: 236 DYFYTESPGSRLNDNETYRKYGGA-----------FASKADWRRNNTQQLIAKVSHTIKS 284 D+ P + D T Y WR + + +V T+K+ Sbjct: 440 DHMGL--PFAYGYDPYTINLYQQEHDGKSPPADPKDPEWTRWRADKITDFMDQVFTTVKA 497 Query: 285 IKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPF 344 +P VSP AY+ D WV++G ++ + Q+Y Sbjct: 498 QRPQAIMSVSP---------------NPHIFAYEYYLQDWDTWVKRGYVEELIIQLYRT- 541 Query: 345 SRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLN 404 R+ A +G+ G +P +++Q++ Sbjct: 542 --DLGRFVWEMGQEAAEYAREHIPTAVGVLSGLKGRSVP----------MPLIEEQVEAV 589 Query: 405 DAVPEISGTILFREDYLNKPQTQQAVSYLQS 435 +G F + L + S Q Sbjct: 590 RD-RGFAGVSFFFYETLWNLSNEGTPSQRQQ 619 >UniRef50_C2L0K0 Lipoprotein yddW n=1 Tax=Oribacterium sinus F0268 RepID=C2L0K0_9FIRM Length = 443 Score = 223 bits (569), Expect = 9e-57, Method: Composition-based stats. Identities = 94/423 (22%), Positives = 167/423 (39%), Gaps = 48/423 (11%) Query: 34 ESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMID 93 + + P + S + +R +W + + ++ P + + Sbjct: 52 GNFAYASNLTGPGVKSLSSQKELRAVWFSYLDWINMPKE-----------EQAFRAEAAK 100 Query: 94 KLDHLQRLGINTVFFQVKPDGTALWPS-KILPWSDLMTGKIGENPGYDPLQFMLDEAHKR 152 +D+LQ+ G T+F V + + P+S M G N +DPL+ M+ EA K+ Sbjct: 101 VMDNLQKNGFQTIFLHVHSHSDSYGKKMTVFPYSKFMPG----NGSFDPLEIMISEAKKK 156 Query: 153 GMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEV 212 G+ VHAWFNPYRVS + +S + + S + ++ ++P Sbjct: 157 GISVHAWFNPYRVSSSMSKWENIPEDSIVKKW--SRTSGEERNVLLHEGQYYINPSRAAG 214 Query: 213 QDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRL---NDNETYRK--YGGAFASKADWR 267 ++ + + + E++ Y VDG+ FDDYFY + D Y + G S ++R Sbjct: 215 REALLASIKELLDNYAVDGIHFDDYFYPRVSLTEEGKRFDEPEYEEAKRQGETGSLTEYR 274 Query: 268 RNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRR- 326 RN L+ +V K GV FGVSP P R + AY + D + Sbjct: 275 RNQVSLLLKQVHSLCKER--GVVFGVSPV---------PNLQSLRSSVAY---FLDVDKI 320 Query: 327 WVEQGLLDYIAPQIYWPFSRSAAR-------YDVLAKWWADVVKPTR--TRLYIGIAFYK 377 + +DYI PQ+Y F + Y W ++ T L +G+ Y+ Sbjct: 321 MASKDYIDYIMPQMYHGFRAKNGKGQEAPHAYMRSLGDWVNLTNSTGNQVELMLGLGLYR 380 Query: 378 VGEPSKIEPDWMINGGVPEL-KKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQSR 436 G ++ K+Q++ + G +F L + + Q+ + L+S Sbjct: 381 AGSSVWDGNPVSEWFTESDILKRQVEEARKTGIVKGYAVFAYQNLLEERAQRELGNLRSV 440 Query: 437 WGS 439 + + Sbjct: 441 FQN 443 >UniRef50_Q8YLM8 Alr5270 protein n=12 Tax=Cyanobacteria RepID=Q8YLM8_ANASP Length = 420 Score = 219 bits (558), Expect = 1e-55, Method: Composition-based stats. Identities = 83/429 (19%), Positives = 149/429 (34%), Gaps = 83/429 (19%) Query: 36 MVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKL 95 + P + + ++ +RG+WL V+ S + + Sbjct: 24 LSVPSFSNYEQKSNLPTTTEIRGVWLTNVA----------------SGVLFVPWGINRAI 67 Query: 96 DHLQRLGINTVFFQVKPDGTALWPSKILPW---SDLMTGKIGENPGYDPLQFMLDEAHKR 152 D L L NT++ V G + S SD + G D L ++ A + Sbjct: 68 DQLSALNFNTIYPVVWNRGYTFYKSSTAKSVTGSDTQPLLNFVHGGQDVLAKIVALAKPK 127 Query: 153 GMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRF--------- 203 + V WF G + +S ++++ + + T + Sbjct: 128 NLSVIPWFE---------YGFMAPPDSVIAKRHPEWLTNGQGGVITISEMLPEESDNDPT 178 Query: 204 ----VLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGA 259 L+P PEVQ +I S+++EVV+ Y +DG+Q DD+F P D T Y Sbjct: 179 NKLVWLNPLHPEVQKFILSLISEVVTNYHIDGIQVDDHFG--MPVQFGYDPYTTELYQKE 236 Query: 260 FAS-----------KADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLG 308 WR N + + +VS +K IKP V+ +SP Sbjct: 237 HKGKSPPRNHLDAEWMKWRANKITRFMTQVSQVVKEIKPSVKVSLSP------------- 283 Query: 309 SDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTR 368 A AY D W++ GL+D + Q+Y + + + A + T+ Sbjct: 284 --NSQAFAYKYYLQDWANWIKTGLVDELILQVY---RNDKSSFVYELEQPAVKLARTQIP 338 Query: 369 LYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQ 428 + IGI+ + P + ++++Q+ G F + L T + Sbjct: 339 VAIGISTGTLRSPV----------KIEQIREQVQAVRD-RSFFGISFFYWESLWGYITPE 387 Query: 429 AVSYLQSRW 437 + Y + + Sbjct: 388 SPPYRRQVF 396 >UniRef50_B4VTS6 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VTS6_9CYAN Length = 884 Score = 218 bits (554), Expect = 5e-55, Method: Composition-based stats. Identities = 69/379 (18%), Positives = 140/379 (36%), Gaps = 61/379 (16%) Query: 70 PPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLM 129 P + ++ + T ++Q ++ D+L + G NTVFF+ ++PS++ P + + Sbjct: 400 PEIRAIWLDRGTIVKAKRKQDLVKLFDNLAKAGFNTVFFETVNASYPIYPSQVAPEQNPL 459 Query: 130 TGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIR----ELNSTLSQQP 185 G+DPL+ ++ AH+RGM++HAW + + + + L+ P Sbjct: 460 VR------GWDPLEAAVELAHERGMELHAWVWIFAAANQRHNALLNQPLDYPSPVLAAHP 513 Query: 186 ASVYVQHRDWIRTSGDRF-VLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESP- 243 + + R DP PEV++++ +++ E+ +RY VDG+Q D Y Sbjct: 514 DWAIFDKQGRLFAPNTRKAFFDPAHPEVREYLMALLEEIATRYDVDGIQLDYIRYPFQDP 573 Query: 244 ------GSRLNDNETYRKYGGA------------FASKADWRRNNTQQLIAKVSHTIKSI 285 G + + +++ G + ++R +A VS + S Sbjct: 574 RVNQTYGYGVAARQQFKERTGVDPIEVYPRDRTLWQQWTEFRIRQVDSFVASVSARLLSQ 633 Query: 286 KPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFS 345 +P + + + + W +QG +D + P Y + Sbjct: 634 RPDLILSAAVFPLPPAERQ-------------QRLQQNWEEWAKQGYIDLVVPMTYALDT 680 Query: 346 RSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLND 405 L + R L GI + + ++ Q+ L Sbjct: 681 EGLHS---LTQPLLTESTLNRVLLIPGIRLLNLPDVVAVD--------------QIQLLR 723 Query: 406 AVPEISGTILFREDYLNKP 424 +P +G +F + LN+ Sbjct: 724 DLP-ANGYAVFAVENLNEN 741 >UniRef50_Q8YQA0 All3933 protein n=18 Tax=Cyanobacteria RepID=Q8YQA0_ANASP Length = 741 Score = 218 bits (554), Expect = 5e-55, Method: Composition-based stats. Identities = 85/429 (19%), Positives = 148/429 (34%), Gaps = 73/429 (17%) Query: 23 LLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTS 82 LL + ++ Q Q +RG+WL Sbjct: 39 LLPVLFALSFTTVLLLQNLTPATAQFFQSPRQEIRGVWLTN----------------NDF 82 Query: 83 RARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPL 142 + + D L L+RL NT++ V DG +PS + + G G D + Sbjct: 83 DILRNRAKVQDTLAQLRRLNFNTIYPVVWNDGYTKYPSAVTQRMGIPYFFRGTE-GQDVI 141 Query: 143 QFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTS--- 199 ++ +A +G+ WF G + L S L+ Q Q RD +TS Sbjct: 142 ADIISQARSQGLLAIPWFE---------FGFMAPLTSELASQHPDWLTQKRDGTQTSISA 192 Query: 200 -GDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKY-- 256 G+ ++P PEVQ +IT +V E++++Y DG+QFDD+ P D T Y Sbjct: 193 AGEVAWMNPFHPEVQQFITDLVVEIITKYNADGIQFDDHM--SLPVDFGYDKYTINLYRQ 250 Query: 257 --------GGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLG 308 + WR + + +++ +K+ KP F VSP Sbjct: 251 ETGNPPPSNPQAQAWVKWRADKITAFMVQLNQAVKARKPNAIFAVSP------------- 297 Query: 309 SDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTR 368 AY D WV G++D + Q+Y ++ + T+ Sbjct: 298 --NYYDFAYKLQLQDWLNWVRLGVVDELVVQVY---RNDLQSFNSKL--ITPEIIETQQL 350 Query: 369 LYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQ 428 + GI ++ + +++ Q+ G + F + L + Sbjct: 351 IPTGIGIMTGLRNRQVS--------MSQIQSQVRAAQERG--LGAVFFYYESLWDY-APE 399 Query: 429 AVSYLQSRW 437 V+ Q+ + Sbjct: 400 PVAQRQASF 408 >UniRef50_Q8YV65 All2116 protein n=15 Tax=Cyanobacteria RepID=Q8YV65_ANASP Length = 416 Score = 216 bits (551), Expect = 1e-54, Method: Composition-based stats. Identities = 90/441 (20%), Positives = 158/441 (35%), Gaps = 86/441 (19%) Query: 15 PAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSS 74 A+++AL+++ S P + +TP A +RG+WL + Sbjct: 25 FALMMALSVVATVMLSFPLNAQITPSAALAS---------ELRGVWLTNI---------- 65 Query: 75 VNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIG 134 S ++ + L L +L NTV+ V G L+PSK+ + ++ I Sbjct: 66 ------DSDVLFERDRLKTSLQKLDKLNFNTVYPAVWNWGYTLYPSKVA--AKVIGRAID 117 Query: 135 ENP---GYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQ 191 P G D L+ ++ E HK+G+ V WF G + +S L++ Sbjct: 118 PTPGLQGRDMLKEIVTEGHKQGLTVIPWFE---------FGFMAPADSLLAKNRPQWLTS 168 Query: 192 HRDWIRTSG----DRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRL 247 + R DR L+P P+VQ +I ++ E+V Y +DG+QFDD+F P Sbjct: 169 RSNGSRIVKEGIHDRVWLNPFRPDVQQFIQDLIVEIVRNYDIDGIQFDDHFGL--PSELG 226 Query: 248 NDNETYRKYGGA-----------FASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPA 296 D T Y WR + + ++ IK+ K V+P Sbjct: 227 YDAYTVALYKKEHRGQAPSKNPRDPEWLRWRASKITNFMQRIFKAIKATKKDCLVSVAP- 285 Query: 297 GVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAK 356 +YD AD ++W GL++ + QIY + + Sbjct: 286 --------------NPQRFSYDYFLADWQKWERMGLIEELVLQIYRD---DLNVFVQELE 328 Query: 357 WWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILF 416 + + IGI + +++ Q+ +G F Sbjct: 329 YPEVKAAKAHIPVSIGILSGLKNRSVP----------IQQIQTQVQKVRD-RNFAGVSFF 377 Query: 417 REDYLNKPQTQQAVSYLQSRW 437 + L +Q+A + Q+ + Sbjct: 378 FYETLW-NLSQEASAKRQAGF 397 >UniRef50_Q7NL32 Glr1294 protein n=1 Tax=Gloeobacter violaceus RepID=Q7NL32_GLOVI Length = 796 Score = 216 bits (549), Expect = 2e-54, Method: Composition-based stats. Identities = 94/455 (20%), Positives = 161/455 (35%), Gaps = 86/455 (18%) Query: 19 VALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPM------------RGIWLATVSR 66 V ALL +ST PE PPA A Q++ + + R W + Sbjct: 247 VESALLTSDARSTAPEQF--PPAYRDAIARAQRTLKELPAMLKDGLDTQARAAWEDAIED 304 Query: 67 LDW-----------PPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGT 115 L W P V ++ + T ++ + D L + GINTVFF+ G Sbjct: 305 L-WAHYPTSQLAALPEVRAIWLDRGTIVKAGSEEGLTRIFDRLAQSGINTVFFETVNAGY 363 Query: 116 ALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIR 175 ++PS + P + + G+DPL + AH+R M++HAW + I Sbjct: 364 TIYPSAVAPAQNPLIR------GWDPLAAAVRLAHERKMELHAWTWAFAAGNTRHNALIG 417 Query: 176 E----LNSTLSQQPASVYVQHRDWIRTSGD-RFVLDPGIPEVQDWITSIVAEVVSRYPVD 230 + L+ P + +R +G + +DP PEV+ ++ S+ E+++ Y VD Sbjct: 418 KSQDFPGPVLAAHPGWAQSGRKGNLRPAGQPEYWMDPANPEVRAYLQSLYEEILTNYDVD 477 Query: 231 GVQFDDYFYT------ESPGSRLNDNETYRKYGGA------------FASKADWRRNNTQ 272 G+QFD Y + G ++ + G +A ++ Sbjct: 478 GLQFDYIRYPLQKNAGQYFGYSPAARRSFAQLTGVDPIDIAPEESSLWALWTRFKAEQVS 537 Query: 273 QLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGL 332 +A+ + ++ IKP + + +P G R D W QG Sbjct: 538 SFVAESAEKLRRIKPRLIVSAAVF-------PNPPGERLRL------LQQDWEAWAIQGN 584 Query: 333 LDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMING 392 +D + P Y +R L + VK + + + + Sbjct: 585 IDLLVPMTYALNTRRLQ---QLVEPTLPGVKEAPVLILPSLNLMSLPQ------------ 629 Query: 393 GVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQ 427 +L+ QL +P G LF +L Q Sbjct: 630 --VQLRDQLQAVRDLPS-GGYSLFAAAHLADNHQQ 661 >UniRef50_B1WZU0 Putative uncharacterized protein n=2 Tax=Cyanothece RepID=B1WZU0_CYAA5 Length = 421 Score = 213 bits (541), Expect = 1e-53, Method: Composition-based stats. Identities = 79/466 (16%), Positives = 155/466 (33%), Gaps = 86/466 (18%) Query: 7 NKKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSR 66 N++ + R ++ + +S+ Q RG+WL V+ Sbjct: 2 NRQFFLWRNRLICIALTFIILLILFVSQSIFQSS----GKVIASSIFQERRGVWLTNVA- 56 Query: 67 LDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWS 126 S ++ + L +L NTV+ V G +PS + Sbjct: 57 ---------------SSVLFVPGSVNRAIKQLSQLHFNTVYPVVWNRGHTFYPSSLAKEM 101 Query: 127 DLMTGKIGENPGY---DPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQ 183 + + N D L+ +++E+H+RG+ V WF + + + + Sbjct: 102 IGESQEPLLNWTRSNIDVLRVIIEESHQRGLAVIPWFEYGLMIPRSSLIAQKHPDWLTHS 161 Query: 184 QPASVYVQHRD---------------------WIRTSGDRFVLDPGIPEVQDWITSIVAE 222 Q +V +D + + + L+P PEVQ I ++ E Sbjct: 162 QQGTVNTFFQDELKTKNKKKSTNFLENWSQHSYQKRASQLVWLNPFHPEVQQLIKGLMLE 221 Query: 223 VVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFA-----------SKADWRRNNT 271 ++ +Y VDGVQ DD+F P D T + Y +WR Sbjct: 222 IIMQYKVDGVQLDDHFGI--PVELGYDPLTIKLYQQEHEGKNPPNDPYNAQWMNWRAKKL 279 Query: 272 QQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQG 331 + + TIK + P + +SP + +Y D + WV+QG Sbjct: 280 TAFMTDLVTTIKIVNPDILISLSP---------------NSYSFSYQNYLQDWKTWVKQG 324 Query: 332 LLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMIN 391 L+D + Q+Y ++ + + + + IGI + P Sbjct: 325 LIDELVLQVY---RNDMDSFNRELQESTVKLARQKIPVSIGILSGTLNNPV--------- 372 Query: 392 GGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQSRW 437 + ++++Q++ G F + L + ++ + + Sbjct: 373 -KIEQIRQQVEKVRQQ-GFDGVSFFYWESLWGYLSPESPYKRRRIF 416 >UniRef50_B4WJG2 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WJG2_9SYNE Length = 453 Score = 211 bits (537), Expect = 4e-53, Method: Composition-based stats. Identities = 89/477 (18%), Positives = 147/477 (30%), Gaps = 100/477 (20%) Query: 12 IRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPA-------TTQQSSQPMRGIWLATV 64 I+R I L++ + P S++ P G +P + +RG+W+ V Sbjct: 16 IKRTGIFCVAVLVVF--LTGPLGSLLVPSTGGEPTTLDHLVGSKSSSLDSEVRGVWVTNV 73 Query: 65 SRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILP 124 + S + ++ L + NTV+ V G ++ S + Sbjct: 74 A----------------SSVFFMPWGIASTIEQLADMRFNTVYPVVWNRGQTIYRSDRMK 117 Query: 125 ---WSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNP-YRVSVNTKPGTIRELNST 180 D+ +P DPL M+ H++G++V WF + V + ++ T Sbjct: 118 EITQRDISPLVGLMHPREDPLAEMIRRGHQKGLRVIPWFEYGFMVPLQSRLAQAHPDWLT 177 Query: 181 LSQQPASVYVQH----------RDWIRTSGDRF-------------------VLDPGIPE 211 + + + S L+P P Sbjct: 178 ARADGSQRLSEDTFVNGPIEETPELETASESAMARSKRLHRLLKSGAPSELGWLNPLHPN 237 Query: 212 VQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKY-----------GGAF 260 VQ I +V EV + Y VDG+QFDD+F P D T Y A Sbjct: 238 VQALILDLVDEVTTYYDVDGIQFDDHF--SFPIEFGYDAFTVALYEAEHEGQLPPLDPAD 295 Query: 261 ASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDES 320 WR + + +K F +SP + AY Sbjct: 296 KDWIHWRAEKLSGFVNTLQKRVKETCSDCVFSLSP---------------NPASYAYQYY 340 Query: 321 YADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGE 380 D + W E+G LD + QIY ++ R + IGI G Sbjct: 341 AQDWQTWAEKGWLDELVVQIY---RNDLDQFAAELTKETLQSIRDRIPVSIGILTGTWGS 397 Query: 381 PSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQSRW 437 P E ++ +Q+ + SG F D L T + + + Sbjct: 398 PIAFE----------QISQQVISSRDH-HFSGVSFFYWDTLWSYFTPEPPQQRRQNF 443 >UniRef50_Q10YX0 Putative uncharacterized protein n=2 Tax=Cyanobacteria RepID=Q10YX0_TRIEI Length = 1099 Score = 211 bits (537), Expect = 4e-53, Method: Composition-based stats. Identities = 76/401 (18%), Positives = 137/401 (34%), Gaps = 77/401 (19%) Query: 43 SKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLG 102 + P Q++ +R +WL T ++ + + L G Sbjct: 593 NNYPTDGQRAGAEIRAVWL----------------DRGTIVRARSERGLAGVFNRLAAAG 636 Query: 103 INTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNP 162 INTVFF+ G ++PS + P + +T +DPL+ + AH+R M++H W Sbjct: 637 INTVFFETINAGYTIYPSNVAPRQNPLTTS------WDPLKAAVKLAHERNMELHPWIWA 690 Query: 163 YRVSVNTKPGTI----RELNSTLSQQPASVYVQHRDWIRTS-GDRFVLDPGIPEVQDWIT 217 + V + L +S P+ V R R + +DP PEV+ ++ Sbjct: 691 FAVGNKAHNQALGQGDSYLGPVISAHPSWVMTDKRGRKRHPLDGKVYMDPANPEVRQYLL 750 Query: 218 SIVAEVVSRYPVDGVQFDDYFYTES-------PGSRLNDNETYRK-YG-----------G 258 +I+ E+ SRY VDG+ D Y G +R+ YG Sbjct: 751 NIIDEIASRYEVDGIHLDYIRYPFQNPERNFSYGYSTIARNQFRQLYGIDPMKISSRDRQ 810 Query: 259 AFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYD 318 +++ N +A S +K P + F V+ +D Sbjct: 811 NLWRWTEFKINQVNSFVANTSSFLKKKYPRLIFSVAVFP-------------FPRHQRFD 857 Query: 319 ESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKV 378 + D WV +D + P Y + R+ + + + T + + + Sbjct: 858 QIQQDWESWVMNEDIDLLTPMTY---ALDTNRFQRITQPLTNTGVLGSTLITPAVKLLNI 914 Query: 379 GEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFRED 419 E ++ Q+ +P G I+F + Sbjct: 915 PEIVAVD--------------QIQAARDLP-TGGYIIFAAE 940 >UniRef50_C1D2P2 Putative uncharacterized protein n=2 Tax=Deinococcus RepID=C1D2P2_DEIDV Length = 521 Score = 211 bits (537), Expect = 4e-53, Method: Composition-based stats. Identities = 83/445 (18%), Positives = 144/445 (32%), Gaps = 72/445 (16%) Query: 10 LTIRRPAILVALALLLCSCKSTPPESMV--TPPAGSKPPATTQQSS----QPMRGIWLAT 63 +T + A+L A +LLL +C + P S + G + P Q +RG+W+ Sbjct: 1 MTHKLTAVL-ATSLLLAACGTAPQSSDLDALDTQGVRTPGPHDSPRGRGQQELRGLWVDA 59 Query: 64 VSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKIL 123 + + + + +N +F QV G + + Sbjct: 60 F-----------------GPGMKTPAEIDVLVATARAMNVNVLFAQVGRRGDCYCNNAAM 102 Query: 124 PWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSV----NTKPGTIRELNS 179 P T G+DPL ++ +AH +G++VHAW + T P + Sbjct: 103 PR----TNDPAVPAGFDPLADLITKAHAQGIQVHAWIITTAIWNSTTPPTDPAHAFNAHG 158 Query: 180 TLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFY 239 + D G+ ++LDPG P+ ++I ++ VV Y VDG+QFD Y Sbjct: 159 LGKTGRDFWLMVKNDGTTRGGNDWLLDPGHPDAAEYIRNMYVSVVKNYDVDGIQFDRVRY 218 Query: 240 TES-----PGSRLNDNETYRKYG----------GAFASKADWRRNNTQQLIAKVSHTIKS 284 T+ P + + +Y + WR L+ + + +K+ Sbjct: 219 TDFNPVGGPSNWGYNPTALERYRAETGATGMPLPGDPQWSAWRMQQVTNLVRETALAVKA 278 Query: 285 IKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPF 344 KP V + +++ + + Y E D WV++G LD Y Sbjct: 279 TKPDVSVNAATITYGAGPANET---EWLRSRPYTEVLQDWVTWVKEGYLDVNVMMNYKRD 335 Query: 345 SRSAARYDVLAKWWADVVKP-----TRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKK 399 A + W G A Y S + W Sbjct: 336 FVPAQS--LWFDQWNQFAASLQRVAPDVHQVSGSAIYLNDIASSVNQVWKT--------- 384 Query: 400 QLDLNDAVPEISGTILFREDYLNKP 424 +SG + +K Sbjct: 385 ------RQAGLSGWAGYSYRTPDKD 403 >UniRef50_B5WA73 Putative uncharacterized protein n=2 Tax=Arthrospira RepID=B5WA73_SPIMA Length = 476 Score = 211 bits (536), Expect = 5e-53, Method: Composition-based stats. Identities = 79/401 (19%), Positives = 146/401 (36%), Gaps = 72/401 (17%) Query: 53 SQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKP 112 +RG+W+ T + + Q + + + L + NT++ V Sbjct: 122 KTEIRGVWMTT----------------NDTDVLMNQPRLEEAVSKLAQFNFNTIYPVVWN 165 Query: 113 DGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPG 172 G + S ++ + + G D L +++ AH+ + V WF G Sbjct: 166 SGYVTYKSSVVKEAGIQPFVRRGFQGQDMLADIIERAHRHNLLVLPWFE---------FG 216 Query: 173 TIRELNSTLSQQPASVYVQHRDWIRTS----GDRFVLDPGIPEVQDWITSIVAEVVSRYP 228 + +S L+ + + Q RD +TS G+ L+P P+VQ ++T ++ EVV+ Y Sbjct: 217 FMAPPSSELALKHPNWLTQQRDGTKTSISAAGEVVWLNPFHPQVQKFMTDLIVEVVTDYD 276 Query: 229 VDGVQFDDYFYTESPGSRLNDNETYRKY----------GGAFASKADWRRNNTQQLIAKV 278 +DGVQFDD+ T P + D T Y + WR + + ++ Sbjct: 277 IDGVQFDDH--TSLPSTFGYDPYTISLYQRETNRTPPSNPQDPAWVRWRAHKITAFMRQL 334 Query: 279 SHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAP 338 IK+ KP VSP AY+ D WV GL+D + Sbjct: 335 HQAIKAKKPHSIISVSP---------------NPYHIAYNGHLQDWVTWVRDGLVDELVV 379 Query: 339 QIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELK 398 Q+Y R + +A+ +K + ++ G+ + +++ + Sbjct: 380 QVY----RDELDF-FIAELNRPEMKAAQNKISTGVGILTGLRTRPVPINFIQSKVRAARD 434 Query: 399 KQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQSRWGS 439 +Q G F + L + ++ Q + S Sbjct: 435 RQF----------GVAFFFYESLWD-HAAEPLAQRQYSFQS 464 >UniRef50_A8YDR3 Genome sequencing data, contig C294 n=9 Tax=Chroococcales RepID=A8YDR3_MICAE Length = 875 Score = 210 bits (535), Expect = 7e-53, Method: Composition-based stats. Identities = 75/410 (18%), Positives = 139/410 (33%), Gaps = 65/410 (15%) Query: 57 RGIWLATVSRLDW--PPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDG 114 R +W + + P + ++ + T ++ + D + GIN VFF+ Sbjct: 385 RNLWDNYPTNRQFAQPEIRAMWLDRGTIVQAKNEEDLAKVFDRMAAAGINVVFFETVNAS 444 Query: 115 TALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKP--- 171 ++PS++ P + +T G+DPL+ + AH+R M++HAW + + Sbjct: 445 YTIYPSQVAPEQNPLTR------GWDPLKVAVKLAHERNMEIHAWVWVFAAANQAHNKVL 498 Query: 172 -GTIRELNSTLSQQPASVYVQHRDWIRTSG---DRFVLDPGIPEVQDWITSIVAEVVSRY 227 + L LS+ + DP PEVQ+++ S+ E+V Y Sbjct: 499 EQPLNYLGPVLSRNSDWGATNKSGGSFDYSQGTKKAFFDPANPEVQNYLLSLYEEIVKNY 558 Query: 228 PVDGVQFDDYFYTESPGSRLN------------------DNETYRKYGGAFASKADWRRN 269 VDG+Q D Y + D T G + ++ Sbjct: 559 DVDGLQLDYIRYPFQNQNYNQTYGYGKSSRWLFKQMTGVDPITLNPRGALWEQWTSFKIR 618 Query: 270 NTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVE 329 +++VS +K I+P ++ + + + + W + Sbjct: 619 QVDTFVSQVSTRLKQIRPQLKMSAAVFPLEQKERLY-------------RIQQNWEEWGQ 665 Query: 330 QGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWM 389 +D I Y + + + D + GI KV + I+ Sbjct: 666 NQWIDIIFLMTY---ALDTGTLEDKTQSLFDRQIAGNALIIPGIRLLKVPDQVTID---- 718 Query: 390 INGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQSRWGS 439 QL +P SG LF + LN P Q ++ +Q + Sbjct: 719 ----------QLQFIRNLP-TSGFALFATENLN-PNLQTILNRIQGSIIT 756 >UniRef50_A0YRE2 Putative uncharacterized protein n=1 Tax=Lyngbya sp. PCC 8106 RepID=A0YRE2_9CYAN Length = 574 Score = 208 bits (530), Expect = 2e-52, Method: Composition-based stats. Identities = 74/421 (17%), Positives = 141/421 (33%), Gaps = 76/421 (18%) Query: 21 LALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNP 80 + L E P + +RG+WL + Sbjct: 172 ITTFLSQALLKAEEESSVPKEYLVKAVEIDAPNGEIRGVWLTNI---------------- 215 Query: 81 TSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENP--- 137 S ++++ +D L L NT++ V G +PS+++ ++ ++ P Sbjct: 216 DSDVLFSPTSVVEAIDSLSELNFNTLYPVVWNRGFTQFPSQVMK--RIIGTELDPAPELA 273 Query: 138 GYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIR 197 G D LQ ++ +A + M V WF G + +S Q + +++ I Sbjct: 274 GRDVLQEIITQAKAKNMSVMPWFE---------FGFMVPQDSQFLQSRPNWITTNKEGIP 324 Query: 198 T----SGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETY 253 R +P P+VQ +I ++ EVVS+Y +DG+QFDD+F P D T Sbjct: 325 FVKEEDKYRVWFNPFNPQVQQFILDLIVEVVSKYDIDGIQFDDHFGL--PFELGYDEFTS 382 Query: 254 RKYG-----------GAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNR 302 + Y WR + + ++ +K KP +SP Sbjct: 383 KLYQRENDGKLPPSDPKDQDWVKWRADKLTDFMMRLFWVVKDYKPDCIISLSP------- 435 Query: 303 SHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVV 362 + AYD D W + G ++ + Q+Y + + + Sbjct: 436 --------NPKSYAYDNYLQDWPTWEQSGFIEELVLQVYRD---DPKAFKADLEAAEVLN 484 Query: 363 KPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLN 422 + +GI + + +++Q+ + + +G F + L Sbjct: 485 AKVNIPVAVGILTGLKNQSVPLST----------VQEQVAESRR-RKFAGVSFFFYETLK 533 Query: 423 K 423 Sbjct: 534 S 534 >UniRef50_A8YI06 Similar to tr|Q8YPV9|Q8YPV9 n=8 Tax=Chroococcales RepID=A8YI06_MICAE Length = 438 Score = 207 bits (526), Expect = 8e-52, Method: Composition-based stats. Identities = 88/429 (20%), Positives = 157/429 (36%), Gaps = 82/429 (19%) Query: 9 KLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLD 68 + +++ IL+ LA L + G A +Q +RG+W+ T Sbjct: 11 RQILKKFPILLFLASFL-----------IVVFLGYFSTAFSQSRDPDIRGVWITT----- 54 Query: 69 WPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDL 128 + + R + QQ ++ L L N ++ V G AL+PS I + Sbjct: 55 ------NDTAMLMDRDKRQQA-----IEQLVNLNFNAIYPVVWNSGYALYPSAIAQREGI 103 Query: 129 MTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASV 188 G D L ++++ RG+ V WF G + S L+ + + Sbjct: 104 QPFVPTGAQGQDILAELVEQTRGRGLLVIPWFE---------FGFMAPPTSELALKHQNW 154 Query: 189 YVQHRD----WIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPG 244 Q RD W+ +G+ L+P PEVQ+++ +V EVV +Y ++G+QFDD+ P Sbjct: 155 LTQKRDGGTTWVGAAGEVVWLNPFRPEVQNFLRELVLEVVGQYDINGIQFDDH--LSLPN 212 Query: 245 SRLNDNETYRKYGGA----------FASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVS 294 D T Y WR + +A + +I++IKP + ++ Sbjct: 213 EFGYDPYTIALYQQETEKTPPANPRDPEWTKWRADKITAFLANLKQSIEAIKPNILLSIA 272 Query: 295 PAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVL 354 P AY+ D WV QGL+D + Q+Y P + Sbjct: 273 P---------------NPYEFAYNGHLQDWLAWVRQGLVDELIVQVYRPDLP---SFLKQ 314 Query: 355 AKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTI 414 + ++ T+ + G+ +P +++++ G I Sbjct: 315 IER--PEIQETQQTIPTGVGVLTGLRN--------RPIALPLIEEKVLAARQRG--LGVI 362 Query: 415 LFREDYLNK 423 F + L + Sbjct: 363 FFFYESLWQ 371 >UniRef50_P74735 Slr0592 protein n=1 Tax=Synechocystis sp. PCC 6803 RepID=P74735_SYNY3 Length = 491 Score = 206 bits (525), Expect = 1e-51, Method: Composition-based stats. Identities = 92/438 (21%), Positives = 154/438 (35%), Gaps = 78/438 (17%) Query: 13 RRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPV 72 ++P + + LL S P S+ Q + +RG+W+ Sbjct: 15 KKPFLHNLVLGLLISLAIVHPFSLFNQ-------VQAQNAFPEIRGVWITN--------- 58 Query: 73 SSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGK 132 + + Q + ++ L L NT++ V G L+ S+ L Sbjct: 59 -------NDTVHFLDQNRTTESINLLADLNFNTIYPVVWNSGYVLYESEFAKREGLQPFS 111 Query: 133 IGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQH 192 + G D L ++D+AH+R M V WF G S L ++ + Q Sbjct: 112 PRGDQGQDVLADIIDKAHRRNMLVLPWFE---------FGFKAPPMSELVKRHPWWFTQK 162 Query: 193 RDWIRTS----GDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLN 248 RD +TS G+ ++P P+VQ +IT +V + V++Y +DGVQFDD+ T P Sbjct: 163 RDGTKTSVSAAGEVMWMNPFHPQVQTFITQLVMDAVNKYDLDGVQFDDH--TALPNEFGY 220 Query: 249 DNETYRKY----------GGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGV 298 DN T Y + WR + + +++ IK+ KP + VSPA Sbjct: 221 DNYTISLYQQETKKTPPSNPKDPAWIRWRADKITAFMVQLNARIKAAKPNILVSVSPATY 280 Query: 299 WRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWW 358 AY+ D W+ +G++D + Q+Y + Sbjct: 281 ---------------NLAYNTFLQDWLDWIRKGIVDEVIVQVYRT------SLPTFTEPI 319 Query: 359 ADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFRE 418 L G P+K P ++N + + A G F Sbjct: 320 QRAEFRESKTLIPTAVGILTGLPTKQVPMPLVNDK-------VYASRAQG--MGVSFFYY 370 Query: 419 DYLNKPQTQQAVSYLQSR 436 L ++ +QS Sbjct: 371 QTLWDIAPEEKDDRIQSF 388 >UniRef50_B4AVG6 Putative uncharacterized protein n=1 Tax=Cyanothece sp. PCC 7822 RepID=B4AVG6_9CHRO Length = 423 Score = 205 bits (522), Expect = 2e-51, Method: Composition-based stats. Identities = 95/395 (24%), Positives = 145/395 (36%), Gaps = 64/395 (16%) Query: 56 MRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGT 115 +RG+WL V S +Q +I+ L+ L G NTVF V G Sbjct: 4 IRGVWLTNV----------------GSEVLNSRQNIINALNLLADTGFNTVFPVVWNKGF 47 Query: 116 ALWPSKILPWSDLMTGKIGENP---GYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPG 172 +PS+++ L T +P G DPL +++ A G+ V WF K G Sbjct: 48 TQYPSQVM----LQTFNQEIDPAFAGRDPLAEVIEAAKNVGIDVIPWFEYGFACSYQKNG 103 Query: 173 TIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGV 232 ++ +P + + ++ PEVQ++I S+V EV Y V GV Sbjct: 104 -----GHIIASKPHWAAKDINNQLLNKNGFEWMNAFEPEVQNFILSLVLEVARNYDVAGV 158 Query: 233 QFDDYFYTESPGSRLNDNETYRKY----------GGAFASKADWRRNNTQQLIAKVSHTI 282 Q DD P D +T +Y A WR + +S + Sbjct: 159 QGDD-RLPALPCEGGYDEKTRARYYSEQGVKPPQNIKDAKWLQWRAALLTNFLGNLSREV 217 Query: 283 KSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYW 342 K+IK + +S P G Y E D+ W+ Q ++D I PQ+Y Sbjct: 218 KAIKNDLLVSISS-------HPYPFG--------YHEYLQDSPTWIRQKIVDVIHPQLYR 262 Query: 343 PFSRSAARYDVLAKWWADVVKPTR-TRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQL 401 Y L + P TRL+ G+ ++ P K + PE Q Sbjct: 263 RT---LKDYQALVETTLKQFSPDDLTRLFPGV-LIRLNAPGKPQ----DFHISPEQLWQT 314 Query: 402 DLNDAVPEISGTILFREDYLNKPQTQQAVSYLQSR 436 L + I G + F + LN Q +LQ++ Sbjct: 315 ILINRRLGIRGEVFFFFEELN-VNAQSLAQFLQAK 348 >UniRef50_A0YS74 Putative uncharacterized protein n=2 Tax=Oscillatoriales RepID=A0YS74_9CYAN Length = 1005 Score = 205 bits (522), Expect = 2e-51, Method: Composition-based stats. Identities = 76/405 (18%), Positives = 136/405 (33%), Gaps = 81/405 (20%) Query: 43 SKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLG 102 P +++ +R +WL T + + D L + G Sbjct: 517 ENYPIDGERAGAEIRAVWL----------------DRGTIVQARGEAGLAKIFDQLAQAG 560 Query: 103 INTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNP 162 INTVFF+ G ++P+++ P + +T G+DPL + AH+RGM++HAW Sbjct: 561 INTVFFETVNAGYPIYPTRVAPQQNPLT------QGWDPLASGVKLAHERGMELHAWLWT 614 Query: 163 YRVSVNTKP----GTIRELNSTLSQQPASVYVQHRDWIRTS-GDRFVLDPGIPEVQDWIT 217 + + L L+ P R + + LDP EV+ +I Sbjct: 615 FATANQRHNTLVNQPTSYLGPVLTAHPDWANRDSRGRVWHERDGKAYLDPANREVRSYIL 674 Query: 218 SIVAEVVSRYPVDGVQFDDYFYTESPGSRLND-------NETYRKYGGA----------- 259 +V E+V Y VDG+Q D Y +R + E +R+ G Sbjct: 675 RLVGEIVHNYDVDGIQLDYIRYPFQDPNRNFNFGYGTAGREQFRQLTGVDPISVSPKDSQ 734 Query: 260 -FASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYD 318 + ++R ++ +VS ++ P V F V+ P R Sbjct: 735 LWQQWVNFRVEQVSTMVREVSQLLRKQYPDVIFSVAVF-------PHPEQDRIRK----- 782 Query: 319 ESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKV 378 W +Q +D + P Y + R + + + T + + Sbjct: 783 -IQQHWETWAQQNYVDLVVPMTYSLDTNRLQR---ITQPLTHSDRLGATLIVPSVKLL-- 836 Query: 379 GEPSKIEPDWMINGGVPELK--KQLDLNDAVPEISGTILFREDYL 421 +PE+ Q+ +P G +F + + Sbjct: 837 --------------DIPEIVAIDQIQALRDLP-TGGYSIFAVESI 866 >UniRef50_B7JXY5 Putative uncharacterized protein n=9 Tax=Cyanobacteria RepID=B7JXY5_CYAP8 Length = 427 Score = 205 bits (521), Expect = 3e-51, Method: Composition-based stats. Identities = 88/431 (20%), Positives = 151/431 (35%), Gaps = 86/431 (19%) Query: 20 ALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISN 79 L L++ S + + PA S+ P+ +RG+WL + Sbjct: 26 LLFLVIFSLSVVLILATLQYPAQSRTPS-------EIRGVWLTNI--------------- 63 Query: 80 PTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWS-DLMTGKIGENPG 138 S Q ++ D + L++L NT++ V G L+PS + G Sbjct: 64 -DSEVLFSQNSLSDGIRTLKQLNFNTLYPTVWNWGHTLYPSPVAKKVIGTPLDPTEGLQG 122 Query: 139 YDPLQFMLDEAHKRGMKVHAWFNP-YRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIR 197 D LQ ++D+ H+ M V WF + +++ T Q ++++ Sbjct: 123 RDMLQEIIDQGHQANMAVIPWFEFGFMAPADSQLAIKYPQWLTERQNGDKIWLEGN---- 178 Query: 198 TSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYG 257 R L+P PEVQ +ITS+V E+VS Y +DG+QFDD+F P D+ T + Y Sbjct: 179 -VHKRVWLNPLKPEVQQFITSLVTEIVSNYSIDGIQFDDHFGI--PFDFGYDDFTLQLYQ 235 Query: 258 -------------------------GAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFG 292 + WR N + ++ IK+I P V Sbjct: 236 QEHQGKLPPKPPQNVKTENNCSINSQEWKEWTQWRANKITGYMTELFKAIKTINPNVIVS 295 Query: 293 VSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYD 352 VSP + + AD ++W +GL++ + Q+Y + + Sbjct: 296 VSP---------------NPQPFSVNCYLADWQQWERRGLVEELVLQVY---RNNLNSFK 337 Query: 353 VLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISG 412 +GI G P + ++K Q++ +G Sbjct: 338 QELSRPEVQQAKKHIPFGVGIISGLKGRPV----------SMKQIKSQVETTRQQK-FTG 386 Query: 413 TILFREDYLNK 423 F + L Sbjct: 387 VSFFFYESLWN 397 >UniRef50_B2IV00 Putative uncharacterized protein n=4 Tax=Cyanobacteria RepID=B2IV00_NOSP7 Length = 381 Score = 203 bits (517), Expect = 7e-51, Method: Composition-based stats. Identities = 78/368 (21%), Positives = 133/368 (36%), Gaps = 48/368 (13%) Query: 71 PVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMT 130 + ++ S+ +Q + + +D L G N VF V L+PS+ + + Sbjct: 5 ETRGIWLTTTDSKVLRSKQRIAEAMDLLAETGFNVVFPVVWNKAVTLYPSQTMQET-FGV 63 Query: 131 GKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYV 190 + G DPL+ ++ EA + G+KV WF S G + L ++P Sbjct: 64 EIDPMSVGRDPLEEVVVEARRVGLKVIPWFEYGFASSYNLNGGV-----LLQKKPEWAAR 118 Query: 191 QHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDN 250 + L+ +VQ++ ++V EVV Y VDGVQ DD P D Sbjct: 119 DFNGNLLNKNGFEWLNALDSQVQEFFLNLVLEVVKNYDVDGVQGDD-RLPAFPCEGGYDE 177 Query: 251 ETYRKY----------GGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWR 300 T +Y WR + +A++ +K++ P + ++P Sbjct: 178 GTVSRYRQEYDRNPPQNPKDRQWLQWRADILTDFLARLYGEVKAVNPNLLVAIAP----- 232 Query: 301 NRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWW-- 358 A+ E D+ W+++G++D I PQIY Y +A Sbjct: 233 ----------NIHDWAFQEYLQDSPTWLKRGIVDMIQPQIYRRDF---GSYCAIADKLVS 279 Query: 359 ADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFRE 418 T RL GI K+G L + ++ N + I G + F Sbjct: 280 QQFTDATLPRLAPGI-LMKLGSYC---------ISPEYLVQAIEYNRQL-GIQGEVFFFY 328 Query: 419 DYLNKPQT 426 + L + Sbjct: 329 EGLRENNN 336 >UniRef50_A6DH63 Putative uncharacterized protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DH63_9BACT Length = 225 Score = 199 bits (505), Expect = 2e-49, Method: Composition-based stats. Identities = 81/239 (33%), Positives = 120/239 (50%), Gaps = 22/239 (9%) Query: 56 MRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGT 115 MR W+A+V+ DWP + QQ+ D LD +L +NT+ FQV+P G Sbjct: 1 MRAAWVASVANTDWPSKQGL-------SVAQQQKECRDLLDLAVQLKLNTIIFQVRPHGD 53 Query: 116 ALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIR 175 AL+ S PWSD +TG G+ PGYDPLQ+ +D+ HKR +K+HAWFNPYRV T + Sbjct: 54 ALYKSSFEPWSDRLTGIQGKYPGYDPLQYWIDQCHKRKLKIHAWFNPYRVQHPTVKEPLA 113 Query: 176 ELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFD 235 + +P +Y++ L+P V+ ++ ++V + RY +DG+ D Sbjct: 114 SNSLQRKAKPWCIYLKK--------GYVWLNPASKAVRQYVQTVVFDCARRYNIDGIHLD 165 Query: 236 DYFYTES---PGSRLNDNETYRKY----GGAFASKADWRRNNTQQLIAKVSHTIKSIKP 287 DYFY P + D++ Y Y K WRR+ LI + +K +KP Sbjct: 166 DYFYPYKDFLPATGFPDHKEYSAYLSSKPQKVMDKEMWRRHQVNTLIYSLHKGLKRLKP 224 >UniRef50_C1D298 Putative uncharacterized protein n=1 Tax=Deinococcus deserti VCD115 RepID=C1D298_DEIDV Length = 628 Score = 197 bits (501), Expect = 6e-49, Method: Composition-based stats. Identities = 73/411 (17%), Positives = 131/411 (31%), Gaps = 66/411 (16%) Query: 35 SMVTPPAGSKPPATTQQSSQP------MRGIWLATVSRLDWPPVSSVNISNPTSRARVQQ 88 V PA + A ++++ P MRG+W+ Sbjct: 228 KSVQKPAATHQVAESRRAPGPLQTGPAMRGLWVDAF-----------------GPGFKTP 270 Query: 89 QAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDE 148 + + + + INT+F Q G +LP T +DPL +L + Sbjct: 271 GEVDRLIADARAMNINTLFVQAVKRGDCYCNGSLLPR----TEDPAVPAEFDPLADVLTK 326 Query: 149 AHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPA----SVYVQHRDWIRTSGDRFV 204 AH G+KVHAW P VS + ++ +++ +G+ Sbjct: 327 AHAHGIKVHAWVIPTAVSNRAVRYPVTNPEHVVNAHGEGDEQDWLMRNSGGSMWAGNDQQ 386 Query: 205 LDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPG---SRLNDNETYRKYG---- 257 LD G P+ + ++ + V + Y +DGVQ D Y + G + Y Sbjct: 387 LDIGHPDARRYMVDAIQSVAAAYNIDGVQLDRVRYPDPSGTVQDWGYNPGAVAAYQAESE 446 Query: 258 ------GAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDT 311 A WRR L+++VS ++S +PG V+ Sbjct: 447 TTETPAPGDARWTAWRREQVNALVSEVSGAVRSARPGTVISVAAITYGAGPR---TREAF 503 Query: 312 RGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVK--PTRTRL 369 Y E D W+ G +D + Y + + D W K ++ Sbjct: 504 ASTRTYAEVLQDWPLWLADGNVDLVVLMNYKREAHAGQARDF--DSWNRFAKSVKAGGQV 561 Query: 370 YIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDY 420 G A Y + ++++ ++ + G + + Sbjct: 562 AAGTALYLNTAQENLV----------QVRRAVE-----QGLDGWVGYSYRT 597 >UniRef50_Q8EPF4 Hypothetical conserved protein n=1 Tax=Oceanobacillus iheyensis RepID=Q8EPF4_OCEIH Length = 502 Score = 197 bits (501), Expect = 7e-49, Method: Composition-based stats. Identities = 74/393 (18%), Positives = 143/393 (36%), Gaps = 49/393 (12%) Query: 8 KKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRL 67 K++ + I++ + +++ S TP + A+ Q+ + +R W+ Sbjct: 2 KRIRTKGVTIVIVMLIVISSLTLTP---------FTTSEASFQKENPFIRAFWVQAFE-- 50 Query: 68 DWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSD 127 + + + +D + + +NT+ QV A + S +LP Sbjct: 51 ---------------PGLKTPEEIDELVDDVHKANMNTIIAQVSRRHDAYYQSDVLP--- 92 Query: 128 LMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPAS 187 T G+DPL ++L +AH++G++VHAW + + + + Sbjct: 93 -FTEDPSVPEGFDPLGYLLTKAHEKGIEVHAWVVVGPMWHSVYGDAPSDPTHIWNLHGPD 151 Query: 188 VYVQHRDWIRTSGD----RFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESP 243 + +G+ + LD G PE ++ + +V ++ Y VDGV D Y E Sbjct: 152 AQEESWATEDYNGNVPYWQPYLDLGHPEARNHVVDMVNDIAKNYEVDGVHLDYIRYPEDG 211 Query: 244 -GSRLNDNETYRKYGG-------AFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSP 295 G + + G W+ LI +V + ++ VE Sbjct: 212 KGYNATSLARFHEETGRTDRPPVNDQEWIAWKVEQVDSLIKRVYTELLTVDSDVELS--- 268 Query: 296 AGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRS--AARYDV 353 A V H+P + ++ + + WV++G LDY Y + A R+D Sbjct: 269 AAVLSWGFHNPSNTHWWNMDPVQRAHQNWKEWVQEGYLDYAYVMNYDSDADPRRALRFDQ 328 Query: 354 LAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEP 386 +W D+ + + IG A Y + Sbjct: 329 WIEWQKDLPRNRG--IIIGPALYLNTVADSMNQ 359 >UniRef50_Q2JQ39 Putative uncharacterized protein n=1 Tax=Synechococcus sp. JA-2-3B'a(2-13) RepID=Q2JQ39_SYNJB Length = 850 Score = 192 bits (488), Expect = 2e-47, Method: Composition-based stats. Identities = 72/378 (19%), Positives = 129/378 (34%), Gaps = 64/378 (16%) Query: 71 PVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMT 130 V ++ + T + + D L + G+NTVFF+ G A+ PS++ P + +T Sbjct: 379 EVRAIWLDRSTIVEAGSEAGLAQIFDRLAQAGLNTVFFETMNAGFAIHPSRVAPQQNPLT 438 Query: 131 GKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPG------TIRELNSTLSQQ 184 G DPL+ + AH+RG+++HAW V + L+ Sbjct: 439 R------GRDPLRAAVRLAHERGLELHAWIWTLAVGNTRHNLLPEINLPQDYIGPVLTAH 492 Query: 185 PASVYVQHRDWIRTSGD-RFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTES- 242 P + +R + G LDP P+V+ ++ ++ E+V Y VDG+ D Y Sbjct: 493 PDWANLDNRGRLFPRGQPETWLDPANPQVRSYLLALTRELVQDYQVDGIHLDYIRYPFQN 552 Query: 243 ------PGSRLNDNETYRKYGGA-------------FASKADWRRNNTQQLIAKVSHTIK 283 G + +++ G + +R +++ ++ T + Sbjct: 553 AASRQVFGFGRAARQGFQQLSGVDPLELDPLRDRSLWQLWTRYRTQQVNEVVEAIARTAR 612 Query: 284 SIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWP 343 S+ P V + + W++ G LD + P Y Sbjct: 613 SLNPRVILSAAVYA-------------LPKQERLQRLQQNWEEWIQAGELDLLIPLTYAG 659 Query: 344 FSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDL 403 +R A L + ++V T + N E QL + Sbjct: 660 NTRRLA---QLVQPNLEIVSRFSTLFVPSLNLL--------------NLPPVEFLDQLQV 702 Query: 404 NDAVPEISGTILFREDYL 421 +P G LF L Sbjct: 703 VRDLP-TGGFALFSVRQL 719 >UniRef50_C6PCP2 Putative uncharacterized protein n=1 Tax=Thermoanaerobacterium thermosaccharolyticum DSM 571 RepID=C6PCP2_CLOTS Length = 1117 Score = 187 bits (475), Expect = 6e-46, Method: Composition-based stats. Identities = 72/396 (18%), Positives = 135/396 (34%), Gaps = 82/396 (20%) Query: 50 QQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQ 109 R +W+ ++ LD L+ + INT++ Sbjct: 322 ASEKVESRAVWI--------------------RPKEKNLDEVVRNLDMLKSININTIYLD 361 Query: 110 VKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNT 169 G ++P+ T + G+D L + EAHKRGM V+AW + + + Sbjct: 362 TFWSGYTIYPTNSK-----YTSQNPIYGGFDVLDAYIKEAHKRGMVVYAWTENFLIGTSD 416 Query: 170 KPGTIRELNSTLSQQPASVYVQHRDWIRTSGDR----FVLDPGIPEVQDWITSIVAEVVS 225 + + ++P + V + + T + L+P IPE +D+++ + E+ S Sbjct: 417 ----VSDGGPIKKEKPEWLMVSRKGYNYTLDKYGIKYYYLNPAIPEARDFLSELYKEIAS 472 Query: 226 RYPVDGVQFDDYFYT---ESPGSRLNDNET---YRKYGGA-----------FASKADWRR 268 +Y +DG+QFD + + D+ T +++Y G + +R Sbjct: 473 KYDIDGIQFDYIRFPNSNDYSNDFGYDDYTRNLFKQYAGVDPKYLNVNSDMWQLWNYFRM 532 Query: 269 NNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWV 328 N + V ++ IKP ++ A VW N P + + D++ W Sbjct: 533 NIVNTFVYSVVSELRMIKPEIKIA---ADVWPNYDTAPS-----------DIFQDSKDWT 578 Query: 329 EQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDW 388 + +D + P Y ++A + + Sbjct: 579 LKNYIDTLNPMSY------NMSVSLVANDLKNTLDFASGH-----------SNVIPAIGT 621 Query: 389 MINGGVPELKKQLDLNDAVPEISGTILFREDYLNKP 424 I L KQ++ SG LF + L K Sbjct: 622 FIGTDNVTLLKQIEAIRD-NNASGVGLFEFESLFKN 656 >UniRef50_B8HYQ9 Putative uncharacterized protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HYQ9_CYAP4 Length = 383 Score = 186 bits (472), Expect = 1e-45, Method: Composition-based stats. Identities = 81/413 (19%), Positives = 145/413 (35%), Gaps = 90/413 (21%) Query: 44 KPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGI 103 + PA S+Q RG+WL ++ + + + L L + G Sbjct: 25 RAPARPTASTQENRGLWLTSIGLAGLYHSTL----------------LDETLSDLSQRGF 68 Query: 104 NTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPY 163 NT++ V G L+PS+++P + + D L + E ++G+++ WF Sbjct: 69 NTLYPAVWNRGQTLYPSRVVPAAFTLG---------DVLSTTVREGKQQGLRIIPWFEY- 118 Query: 164 RVSVNTKPGTIRELNSTLSQQPASVYVQHRDWI--------------------RTSGDRF 203 + + + Q P + T D Sbjct: 119 -------GLKVTDRSVLARQHPDWLARDRNGRPYINPEPVNALPFPLKGLSRSVTGADHV 171 Query: 204 VLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYF-YTESPGSRLNDNETYRKYGG---- 258 VL+P P+VQ+ I + +VV RY VDG+Q DD+F G + +R+ G Sbjct: 172 VLNPIHPQVQNLIVKMFVDVVKRYNVDGIQIDDHFALPVQLGYDSYTRQRFRQEQGVEPP 231 Query: 259 ---AFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAA 315 + +WR N +L+ K+S IK KP + F ++P A Sbjct: 232 ADPTDPAWMEWRANKLTELVGKISTAIKQQKPAIIFSIAP---------------NPPAF 276 Query: 316 AYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAF 375 AY + D WV +G +D + Q+Y P ++Y G+ Sbjct: 277 AYRTTLQDWPTWVRRGYVDEVVVQVYRPTVAEM------------EAIAADPQIY-GLQA 323 Query: 376 YKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQ 428 Y + +L +++ + + +G LF ++ P + Sbjct: 324 YAPVSLGLYAGPGLKAKTGQQLDREVAVTRRLK-YNGFALFTWEFAIGPLARG 375 >UniRef50_B9XI64 Putative uncharacterized protein n=1 Tax=bacterium Ellin514 RepID=B9XI64_9BACT Length = 1083 Score = 186 bits (472), Expect = 1e-45, Method: Composition-based stats. Identities = 90/437 (20%), Positives = 153/437 (35%), Gaps = 83/437 (18%) Query: 6 RNKKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVS 65 RN ++ IR I+ L +L +Q R W A V Sbjct: 8 RNVRMRIRCLVIMAGLWFVLAI----------------------SSPAQEFRAAW-ADVF 44 Query: 66 RLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPW 125 + + VN T + ++ + ++ +G+N A W S ILPW Sbjct: 45 HVGMGSQTEVNNMVATLVSGHYNAVIVQVVGYMDGIGVN--------SHGAHWKSNILPW 96 Query: 126 SDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYR-VSVNTKPGTIRELNSTLSQQ 184 S +T G+DPL + +AH G++VHAW N+TL+ Sbjct: 97 SPRVT------AGFDPLAALCAQAHANGIEVHAWLGGSAGAMYRVSTAWPPAGNATLTAH 150 Query: 185 PASV----YVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDD---- 236 P + LD G P+ Q++I SIV E+V+ YP+DG+ +DD Sbjct: 151 PEWFIAPLANSEGGAPVLVDGNYDLDMGSPDAQEYIVSIVRELVTNYPIDGINWDDELNN 210 Query: 237 ------YFYTESPGSRLNDN--ETYRKYGGAFAS-------KADWRRNNTQQLIAKVSHT 281 + Y + ++ YR+ G + +++RR +L+A+V Sbjct: 211 AGYAAGFGYPALSQTNYPNSGLGRYRRNTGYVGTPPNTDTAWSNYRRRFKNELMARVQAE 270 Query: 282 IKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIY 341 I+SIK + S P G Y + D ++ G LD + PQ Y Sbjct: 271 IQSIKTNPRQPLRHTSAALAYSPYPTSCTFAGLVPYT-YFCDWAGMLQNGWLDAVIPQTY 329 Query: 342 WPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQL 401 S + A + A + +++ GI Y S + + Sbjct: 330 -----SLGTFTNWANFSASCWQ-YNRQIFPGIGAYLNTNAS--------------IANMI 369 Query: 402 DLNDAVPEISGTILFRE 418 ++ + G ++ Sbjct: 370 GYTRSI-GLKGNAIYSY 385 >UniRef50_B5W1E7 Putative uncharacterized protein n=2 Tax=Arthrospira RepID=B5W1E7_SPIMA Length = 910 Score = 185 bits (470), Expect = 2e-45, Method: Composition-based stats. Identities = 75/421 (17%), Positives = 131/421 (31%), Gaps = 78/421 (18%) Query: 43 SKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLG 102 P ++S +R +WL T A + + D L G Sbjct: 426 ENYPTDGERSGAEIRAVWL----------------DRGTIVAARGEAGLAQIFDRLADAG 469 Query: 103 INTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNP 162 INTVFF+ G ++PS++ P + +T G+DPL + A RGM++HAW Sbjct: 470 INTVFFETVNAGYTIYPSRVAPSQNPLT------VGWDPLAAAVKLAKARGMELHAWVWV 523 Query: 163 YRVSVNTKPGTI----RELNSTLSQQPASVYVQHRDWIRTSGDRF-VLDPGIPEVQDWIT 217 + ++ + L LS P + ++ DR LDP EV+ ++ Sbjct: 524 FAIANQRHNALLRQPDSYLGPVLSAYPEWANLDNQGRTWHENDRKAYLDPANREVRSYLL 583 Query: 218 SIVAEVVSRYPVDGVQFDDYFYTESPGSRLND-------NETYRKYGGA----------- 259 +V E+ Y VDG+ D Y +R + +R G Sbjct: 584 RLVGEIAHNYQVDGIHLDYIRYPFQDANRNFNFGYGTASRTQFRDLTGVDPISLTPRDGV 643 Query: 260 -FASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYD 318 + +R + + V + + P V + P Sbjct: 644 LWQQWTQFRSDQVTSFVRDVRQLLSTNYPNVILSAAVF-------PHPETERIAK----- 691 Query: 319 ESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKV 378 W QG LD + P Y + R ++ Sbjct: 692 -IQQHWEVWARQGYLDLLVPMTYSLDTNRLQRITQPL-----------------TGPQQL 733 Query: 379 GEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQSRWG 438 G +++ Q+ +P G +F + ++ Q + Q+ G Sbjct: 734 GLTLIAPSVKLLDIPNVVAIDQIQALRDLPS-GGYSIFAVETIDS-NLQGFLRRTQNNGG 791 Query: 439 S 439 + Sbjct: 792 N 792 >UniRef50_A8F3E2 Putative uncharacterized protein n=1 Tax=Thermotoga lettingae TMO RepID=A8F3E2_THELT Length = 961 Score = 185 bits (468), Expect = 4e-45, Method: Composition-based stats. Identities = 57/380 (15%), Positives = 114/380 (30%), Gaps = 59/380 (15%) Query: 71 PVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMT 130 + + N + + + + + + L +G N + +V G + S Sbjct: 315 QTRGIWLDNQSIKKTGSPERLRETIRKLHSIGFNMIIPEVIYKGKTM----ASKLSYFPQ 370 Query: 131 GKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYV 190 + DPLQ ++DEA K ++VHAW + S E N + P + Sbjct: 371 DDDFQRWSEDPLQVIVDEAKKLNIEVHAWCWVFAASSG------GEENYFIKNFPDWIEK 424 Query: 191 QHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYT---------- 240 I T P+ ++++ + E+ +Y +DG+ D Y Sbjct: 425 DKYGNIFTKNGTAWFSHSNPQTREYLIDGILEIAKKYEIDGINLDYIRYDGDEMGYDEHA 484 Query: 241 --ESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGV 298 D KY WR + ++ K++ + Sbjct: 485 VKSFMKETGVDPYKIEKYSKDQVIWHMWREEKINSFVEELYKRAKALNDRLLISADVYPS 544 Query: 299 WRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWW 358 ++ + WV +D + P Y S ++ + Sbjct: 545 LSGARNEKK--------------QNWEAWVRNKYIDALIPMNYKG---SIEDLKIVLEMQ 587 Query: 359 ADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFRE 418 LY G+ + +L +Q+ + SG +LF Sbjct: 588 TKF--KNMVYLYSGLQMINL-------------KSTEDLIEQIKTSINYLS-SGIVLFSL 631 Query: 419 DYLNKPQTQQAVSYLQSRWG 438 YL++ Y+++ +G Sbjct: 632 SYLDRYDE----DYIRNIFG 647 >UniRef50_Q7NJN0 Glr1802 protein n=1 Tax=Gloeobacter violaceus RepID=Q7NJN0_GLOVI Length = 344 Score = 184 bits (467), Expect = 5e-45, Method: Composition-based stats. Identities = 80/382 (20%), Positives = 139/382 (36%), Gaps = 62/382 (16%) Query: 56 MRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGT 115 +RG+WL V SR ++A++ +D L + G N VF V G Sbjct: 4 LRGVWLTNV----------------GSRVLHSREAIVRAMDLLAQTGFNAVFPVVWNKGF 47 Query: 116 ALWPSKIL-PWSDLMTGKIGENPGYDPLQFMLDEAHKRGMK-VHAWFNPYRVSVNTKPGT 173 L+PS+I+ + + DPL +++ A + G++ V WF S G Sbjct: 48 TLYPSRIMLELFGIEIDPLYAEAKRDPLAEVIEAAGRAGIRMVIPWFEYGFASSPRSDG- 106 Query: 174 IRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQ 233 L +PA ++ PEVQ+++ S++ EV + Y + GVQ Sbjct: 107 ----GHILLTRPAWTARVSGGAPLVKNGLVWMNALDPEVQNFVLSLMLEVATHYDIVGVQ 162 Query: 234 FDDYFYTESPGSRLNDNETYRKYGG----------AFASKADWRRNNTQQLIAKVSHTIK 283 DD P D T + + WR + + + ++ IK Sbjct: 163 GDD-RLPALPVEGGYDPRTVELFRETTGSDPPGWASEPGWVQWRADRLTEFLGRLYTQIK 221 Query: 284 SIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWP 343 S++P + ++P S P + + D W +G D + PQ+Y Sbjct: 222 SVRPELLLSLAP-------SVYP--------FSLNHYLQDVAEWARRGWFDLLHPQVYR- 265 Query: 344 FSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDL 403 + ++ D R+ GIAF G + GV ++++++ L Sbjct: 266 -ENFGQYRREIDRFKRDFPPEATGRIAPGIAFKANG----------VEIGVDDVRRRIAL 314 Query: 404 NDAVPEISGTILFREDYLNKPQ 425 N + G + F D L Sbjct: 315 N-CERGLGGEVFFYFDGLCAND 335 >UniRef50_Q1IWF6 Putative uncharacterized protein n=3 Tax=Deinococcus RepID=Q1IWF6_DEIGD Length = 536 Score = 182 bits (461), Expect = 2e-44, Method: Composition-based stats. Identities = 77/411 (18%), Positives = 117/411 (28%), Gaps = 56/411 (13%) Query: 25 LCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRA 84 + + P TP + P +RG+WL Sbjct: 36 AAASVTPVPPQAATPAILAPVPTPVPAPISSVRGLWLDAF-----------------GPG 78 Query: 85 RVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQF 144 + ++ LG+NT+F Q L LP +DPL Sbjct: 79 LKTAAQVRRSVEDAASLGVNTLFVQAIRRADCLCRRSSLPV----ITDADLEKDFDPLAE 134 Query: 145 MLDEAHKRGMKVHAWFNPYRVSV---NTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGD 201 + AH RGM+V AW + S AS + D G Sbjct: 135 VTRLAHARGMRVIAWVSVTGASNLRVPNSNPAHVSRQHGAQAGAASWLSRRPDGSWQEGA 194 Query: 202 RFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYG---- 257 LDP IP D++ V +V YPVDGVQ D Y + G+ D +T +Y Sbjct: 195 DGWLDPAIPAAADFMVGGVVSLVKHYPVDGVQLDRIRYPDG-GNWGYDPKTLARYRAETG 253 Query: 258 ------GAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDT 311 A DW+R L+ +++ +K+++P + + Sbjct: 254 AKGTPAPDDARWRDWKREQVTLLVRRIALEVKAVRPTAWVTAATIT-YGPPPPPGDLDAF 312 Query: 312 RGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTR--TRL 369 Y + D W+ +GLLD Y + W R + Sbjct: 313 HKTRTYLDVLQDWPTWMREGLLDLNVLMNYKRDAVGEQ--GAWLDGWNAFAASVRGDAEV 370 Query: 370 YIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDY 420 G A Y N V G + + Sbjct: 371 AGGTALYLNPPAVTAS----------------QANRTVGAGLGWVGYSYRT 405 >UniRef50_C5CIL6 Putative uncharacterized protein n=1 Tax=Kosmotoga olearia TBF 19.5.1 RepID=C5CIL6_KOSOT Length = 993 Score = 178 bits (452), Expect = 3e-43, Method: Composition-based stats. Identities = 77/390 (19%), Positives = 134/390 (34%), Gaps = 67/390 (17%) Query: 61 LATVSRLDWPPV----SSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTA 116 L T S P ++ + + A + + ++ L LG N + +V GT Sbjct: 314 LTTFSYSLLPSRVVQTRAIWLDHGAMAATGGPENLRKTIEKLAHLGFNVLLPEVIWKGTT 373 Query: 117 LWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRE 176 + P + + ++ DPL+ +++EAHK M+VHAW + V G + E Sbjct: 374 ISP----KLTVYPQNEEFKDWEEDPLEIIIEEAHKYDMEVHAWTWTFAV------GYLGE 423 Query: 177 LNSTLSQQPASVYVQHRDWIRTSGDRF----VLDPGIPEVQDWITSIVAEVVSRYP-VDG 231 N +++ P V GD P+ ++ I S + EVV +YP +DG Sbjct: 424 SNELMNKNPHLVEKDRFGRTFAEGDNVKRAGFFSHSNPKARELIKSAIKEVVEKYPEIDG 483 Query: 232 VQFDDYFYTESP---------------GSRLNDNETYRKYGGAFASKADWRRNNTQQLIA 276 + D Y S D KY WR N + Sbjct: 484 INLDYIRYENSDIIDHGYDDYSVKAFKEETGIDPFKIEKYTKEEVLWHLWRENQVTSFVK 543 Query: 277 KVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYI 336 ++S +K+IKP + + P G+ + + W + G +D + Sbjct: 544 EISEELKAIKPTIIISADVINL-------PTGAQ-------HKFKQNWVLWAKNGYVDAL 589 Query: 337 APQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPE 396 P Y P S + + LY G++ + +N Sbjct: 590 FPMAYTPSSDDL----RIMIEAEKSAVSGKVFLYPGMSLF-------------VNRDTES 632 Query: 397 LKKQLDLNDAVPEISGTILFREDYLNKPQT 426 + KQL + E+ G +F Y++ Sbjct: 633 VLKQLKILSE--ELDGLSMFALSYIDDFDN 660 >UniRef50_C2FS67 FenI family protein n=1 Tax=Sphingobacterium spiritivorum ATCC 33300 RepID=C2FS67_9SPHI Length = 327 Score = 176 bits (445), Expect = 2e-42, Method: Composition-based stats. Identities = 74/216 (34%), Positives = 109/216 (50%), Gaps = 11/216 (5%) Query: 221 AEVVSRYPVDGVQFDDYFYTESPGSR--LNDNETYRKYGGAFASKADWRRNNTQQLIAKV 278 +VV Y VDG+ FDDYFY L D T+ ++G FA+ DWRRNN LI + Sbjct: 1 MDVVKNYDVDGIHFDDYFYPYPDARNTALPDAPTFHQFGRGFANIHDWRRNNVDLLIRDL 60 Query: 279 SHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAP 338 IK KP +++G+SP G+W N+ +P GS+T G + Y YAD +W+++G +DYI P Sbjct: 61 GIAIKKEKPFIKYGISPFGIWDNKRDNPDGSNTSGLSGYRTLYADGVKWMKEGWIDYINP 120 Query: 339 QIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELK 398 QIY+PF+ AA +++L +WW Y+G Y+V E ++ Sbjct: 121 QIYFPFNNRAAAFEILLEWWEKHT--YGRHFYVGHGAYRVTEKRPG------WTDKGQIP 172 Query: 399 KQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQ 434 KQ+ E+ G+I F L +Q Sbjct: 173 KQVRHLRDQHEVQGSIYFSSKSLMD-NLAGLRDSMQ 207 >UniRef50_C6IEW4 Putative uncharacterized protein n=4 Tax=Bacteroidales RepID=C6IEW4_9BACE Length = 490 Score = 173 bits (439), Expect = 9e-42, Method: Composition-based stats. Identities = 81/481 (16%), Positives = 146/481 (30%), Gaps = 78/481 (16%) Query: 5 SRNKKLTIRRPAI---LVALALLLC-SCKSTPPESMVTPPAGSKPPATTQQSSQ-PMRGI 59 ++ K+ I++ I + +A LC +C + +G + T+ + R I Sbjct: 28 IKSYKMNIKKNIIKTFMGGIAACLCMACGGNDSKDYWGDTSGGEDEEPTENPNASKPRYI 87 Query: 60 WLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPD-GTALW 118 W+ + ++ + L + G + V+P G L+ Sbjct: 88 WIDAAANF--------------PDFANSKENIARDLALAKDAGFTDIVVDVRPTTGDVLF 133 Query: 119 PSKILPWSDLMTGKIGEN-------PGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKP 171 + ++ M IG N +D LQ +DEA K+G+++HA N + Sbjct: 134 KTNLVDQVKFMYAWIGSNYTKVERTATWDYLQAFVDEARKQGLRIHAAINTFVGGNQIDG 193 Query: 172 G---TIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYP 228 G R+ + ++ V + TS +P PEVQ ++ ++ ++ Y Sbjct: 194 GTGLLYRDQSKAEWATQMNMQVGITSVMNTSESTKFFNPAHPEVQTFLCDLLKDLAG-YD 252 Query: 229 VDGVQFDDYFYTESPGSR--------------------------LNDNETYRKYGGAFAS 262 +DG+ D + + Y Sbjct: 253 LDGIFLDRGRFLNLQADFSEESRKQFEEYMGGIRIQNYPNDILAPGASSLPATYPKYLTK 312 Query: 263 KADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYA 322 ++R + K +K +KPG++FGV G + S A+ YD S Sbjct: 313 WLEFRAKVIYDFMQKARTAVKGVKPGIKFGVYVGGWY---STYYDVGVNWAASTYDTSR- 368 Query: 323 DTRRWVEQGL--------LDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIA 374 W +D I Y + +W + G Sbjct: 369 -YYNWATSKYKNYGYAACMDQILIGAYAS----PLKVHGTTEWTMEGFCSLAKDKIKGEC 423 Query: 375 FYKVGEPSKIEPDWMINGGVPELKKQLDLND---AVPEISGTILFREDYLNKPQTQQAVS 431 G P D N E + Q + + G LF +L K Q Sbjct: 424 PIVAGGPDVGNWD-TNNQATQEQENQAIVQSVKACMNVCDGYFLFDMIHLKKADQWQYAK 482 Query: 432 Y 432 Sbjct: 483 E 483 >UniRef50_A7LVF6 Putative uncharacterized protein n=4 Tax=Bacteroides RepID=A7LVF6_BACOV Length = 395 Score = 172 bits (436), Expect = 2e-41, Method: Composition-based stats. Identities = 75/414 (18%), Positives = 148/414 (35%), Gaps = 71/414 (17%) Query: 46 PATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINT 105 + + + Q +RG+W+ P + Q + D + L L +N+ Sbjct: 19 QSIAKTTEQGIRGVWVPA------PRFT---------PVLHSYQGVKDFVKTLDELNMNS 63 Query: 106 VFFQVKPDGTALWPSKI-LPWSDLMTG----------KIGENPGYDPLQFMLDEAHKRGM 154 +F + ++ S + + +S T K ++P DP++ ++DEAHK + Sbjct: 64 IFLVSYAETKTIYRSDVLMHYSTYKTQEESYLLSGYSKQYQSPTNDPVRDLIDEAHKHDI 123 Query: 155 KVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDR-FVLDPGIPEVQ 213 KV WF + + I N L++ P + + ++ + + P VQ Sbjct: 124 KVFFWFEYGFMG---EGRPISPNNPLLAKNPHWLGIDNQQHPANYNQHDYYFNAYNPAVQ 180 Query: 214 DWITSIVAEVVSRY-PVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFAS---------- 262 +++ ++ E + Y +DG+Q DD + P + D T Y Sbjct: 181 NFLIELIEEALMLYPDLDGIQGDD-RFPAMPRNSGYDTYTVSLYQSQHQGNNPPVDYNNS 239 Query: 263 -KADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESY 321 WR + ++ IK+ P V +P + P + Sbjct: 240 EWVHWRLDILNTFAKRLYKRIKAKSPNVMISFAP-------NPYPWCEEN--------LM 284 Query: 322 ADTRRWVEQGLLDYIAPQIY-WPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGE 380 + RW ++ + D +A Q Y + A + K+ G+ + G Sbjct: 285 QEWPRWCKEKVCDLLAVQCYRYSVDAYRATVSEVLKYIHQ--NNPNQLFAPGMILME-GS 341 Query: 381 PSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQ 434 SK+ P+ L+KQL +N + I+ I F ++ P ++ + Sbjct: 342 NSKMSPE--------LLQKQLRINREL-GINSEIYFYNKGIDNPSVRKVLKQTY 386 >UniRef50_P74629 Sll0736 protein n=1 Tax=Synechocystis sp. PCC 6803 RepID=P74629_SYNY3 Length = 408 Score = 169 bits (429), Expect = 1e-40, Method: Composition-based stats. Identities = 74/421 (17%), Positives = 135/421 (32%), Gaps = 85/421 (20%) Query: 48 TTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVF 107 + S +RG+WL V S + + L+ NT++ Sbjct: 34 NSSASPNKIRGVWLTNV----------------DSNVLYDPVQLKTAIADLKSTNFNTLY 77 Query: 108 FQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSV 167 V DG L+PS + L + + D L +++ A ++ ++V WF + Sbjct: 78 PTVWNDGHTLYPSAVAQQ-WLGKKQDEKLGDRDMLGEVINLAKEKSLRVIPWFEFGFM-- 134 Query: 168 NTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGD----RFVLDPGIPEVQDWITSIVAEV 223 + + P + + R L+P PEVQ IT+++ ++ Sbjct: 135 ------APAESDWVKAHPHWLTTNSQGETIWLEGGTIPRVWLNPLHPEVQQLITALLVDL 188 Query: 224 VSRYPVDGVQFDDYF-YTESPGSRLNDNETYRKYGGA--------------------FAS 262 V RY VDG+Q DD+F Y S G YR+ G + Sbjct: 189 VRRYDVDGIQLDDHFGYPYSFGYDPITVALYRQETGQEPLPVPELDLNQNCVSSDPIWQQ 248 Query: 263 KADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYA 322 DWR + + + +K++KP + +SP + + Sbjct: 249 WTDWRSAKISRYVQSLVPILKAVKPNLTISISP---------------NPQTFSKNCFLL 293 Query: 323 DTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPS 382 D + W E +++ + Q+Y A + + + + +G G + Sbjct: 294 DWQTWHEAKVINELVLQVYRE---KQAAFTGELQQSSVQQTKQEIPVVVG---ILSGLKN 347 Query: 383 KIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNK------PQTQQAVSYLQSR 436 + P I + Q +GT F + L P Q Sbjct: 348 RSIPSARIKQQAQWVDDQ--------NFAGTAFFFYESLWNLEAETSPNPFGLKQQWQRL 399 Query: 437 W 437 + Sbjct: 400 Y 400 >UniRef50_Q3AJ74 Putative uncharacterized protein n=3 Tax=Chroococcales RepID=Q3AJ74_SYNSC Length = 390 Score = 169 bits (428), Expect = 2e-40, Method: Composition-based stats. Identities = 75/377 (19%), Positives = 123/377 (32%), Gaps = 66/377 (17%) Query: 71 PVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMT 130 V ++N S+ ++ + + LQ G N V V GT S+ P + Sbjct: 45 STMGVWLTNSPSKLYYDRKRISAAMQQLQHAGFNRVVPNVWSRGTTFHRSRFAPVEPPLQ 104 Query: 131 GKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYV 190 G DP+ + E +RG+KV WF Y + + E P+ V Sbjct: 105 KAGV---GLDPICTLAAEGRRRGIKVMPWFE-YGLMEPADSAVVHE-------NPSWVLA 153 Query: 191 Q--HRDWIRTSGDRF--VLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSR 246 + + W+ G+ L+P PEV+ +V E + R P+DG+Q DD+F P Sbjct: 154 KANGQRWMAMHGNHRMAWLNPAHPEVRARFIGLVVETLKRCPMDGLQLDDHF--AWPVHF 211 Query: 247 LNDNETYRKYGGAF----------ASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPA 296 D T Y WRRN L+ ++ +K +SP Sbjct: 212 GYDPTTLALYRQETGLAPPGDHSNRYWMKWRRNQLTSLLRELRQRLKQEGLSTRISLSPG 271 Query: 297 GVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRS-----AARY 351 +AY+ D W GL++ + Q Y R Sbjct: 272 ---------------PFRSAYNLWLQDWELWALGGLIEELVVQNYAYSVRGFAKDLDQPA 316 Query: 352 DVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEIS 411 A+ W P++ + G + L++++ L Sbjct: 317 LRKARDWG---IPSQIGVLAGFG--------------KRTTSMAVLEQKVRLARQRG--H 357 Query: 412 GTILFREDYLNKPQTQQ 428 G I F + L + Sbjct: 358 GVIFFYWEGLWGKHVAE 374 >UniRef50_UPI0001AF05D8 hypothetical protein SghaA1_34850 n=1 Tax=Streptomyces ghanaensis ATCC 14672 RepID=UPI0001AF05D8 Length = 522 Score = 168 bits (426), Expect = 3e-40, Method: Composition-based stats. Identities = 62/421 (14%), Positives = 125/421 (29%), Gaps = 60/421 (14%) Query: 21 LALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNP 80 + L + ++ PPA + + R W+ Sbjct: 6 VLTTLAAGLGLLFGAVTAPPAHAD---DGAAAPAQWRSYWVDAF---------------- 46 Query: 81 TSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYD 140 + + ++ + N + Q + + P T YD Sbjct: 47 -NPGIFTPAQVAALVEDALDVNANALIVQTARRYDCFCNNALYPR----TDAAIAPEPYD 101 Query: 141 PLQFMLDEAHKRGMKVHAWFNPY----RVSVNTKPGTIRELNSTLSQQPASVYVQHRDWI 196 PL+ ++ + H G++VHAW N R + P + + + + D Sbjct: 102 PLEEIVRQGHAAGLQVHAWVNVNTMWNRTTPPRSPEHVFNQHGPGATGADRWLNKKADGQ 161 Query: 197 RTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTES-----PGSRLNDNE 251 G +DPG P D+I V +V Y VDGV D Y + + Sbjct: 162 ELVGANAYVDPGHPAAVDYIVRGVQSIVRNYDVDGVNLDYVRYPDGSSTTTHSDWGYNEV 221 Query: 252 TYRKYG----------GAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRN 301 + ++ + + +DWRR+ L+ K+ + + P + Sbjct: 222 SVARFQQATGRTDIPLPSDTAWSDWRRSQVTNLVRKIYLGVWEVDPQARLSMDAI---TY 278 Query: 302 RSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADV 361 + Y E D W+++G++D Y ++ W++ Sbjct: 279 GHGPQAVGGWQATRTYAEVLQDWAGWLDEGIMDTAVTMNYKRNWDPDQ--ALMFSEWSEF 336 Query: 362 VK--PTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFRED 419 + + G A Y G + ++++ L + A +G + Sbjct: 337 LADHQGERQAVNGPALYLNGVADSLS----------QIREALRPSPAGNTAAGWSGYSYA 386 Query: 420 Y 420 Sbjct: 387 S 387 >UniRef50_A6CAJ3 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CAJ3_9PLAN Length = 811 Score = 168 bits (426), Expect = 3e-40, Method: Composition-based stats. Identities = 69/387 (17%), Positives = 119/387 (30%), Gaps = 80/387 (20%) Query: 48 TTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVF 107 T + R +W S P R ++ L G N + Sbjct: 462 TRASPPREARAVW-----------DHSPTGPYPGDWNRTCKE--------LSDAGFNMII 502 Query: 108 FQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSV 167 + G A +PS +LP S D ++ L AH+ G++VH W + +S Sbjct: 503 PNMLWGGLAHYPSDVLPRSTTYEKYG------DQIEQCLKAAHQHGLEVHVWKVNHNLST 556 Query: 168 NTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRY 227 P + + SV + DW L+P PE + EVV +Y Sbjct: 557 --APQAFVKKMRDAGRTQVSVTGEPSDW---------LNPAHPENFQLEVDSMLEVVRKY 605 Query: 228 PVDGVQFDDYFYT-ESPGSRLNDNETYRKYGG-AFASKA-------------DWRRNNTQ 272 PVDG+ FD Y + + + G + DWR Sbjct: 606 PVDGIHFDYIRYPNDRHDYSDYSRQKFEADTGIKVQNWPADCYNGTLKSQYRDWRAAQIT 665 Query: 273 QLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGL 332 +L+ V + I+PG++ + + + D W + G Sbjct: 666 RLVETVQREARKIRPGIKISAAVFREYPDCREW--------------VAQDWPLWAKNGY 711 Query: 333 LDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMING 392 LD+I P Y + + + + + +Y GI + Sbjct: 712 LDFICPMDY--TDNDTQ-FRIWIEDQQKHLAGS-IPVYPGIGALSSRTTLSSDR------ 761 Query: 393 GVPELKKQLDLNDAVPEISGTILFRED 419 + Q+D+ + G +F + Sbjct: 762 ----ILGQVDMTRKL-NAGGFTVFSLN 783 >UniRef50_B0VF99 Putative uncharacterized protein n=1 Tax=Candidatus Cloacamonas acidaminovorans RepID=B0VF99_9BACT Length = 482 Score = 168 bits (425), Expect = 4e-40, Method: Composition-based stats. Identities = 63/383 (16%), Positives = 138/383 (36%), Gaps = 51/383 (13%) Query: 53 SQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKP 112 + +R +W+ ++++ + + N + +V+ Sbjct: 17 NAEIRSVWV-------------------LPWDIATEESIDEVIATAVSCNQNELLVEVRY 57 Query: 113 DGTALWPSKILPWSDLMTG---KIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNT 169 AL+ + + I E+ +DPL ++L +AH++G+ V AW + + Sbjct: 58 RADALFDTSKGAYLYPNPEPKSYILEDASFDPLAYILKKAHQKGLAVQAWVVVFNATPRE 117 Query: 170 KPGTIRELNSTLSQQPASVYVQHRDWIRTSGDR--FVLDPGIPEVQDWITSIVAEVVSRY 227 + + + N + + + + +DPGIPEVQ+++ +I+ + Y Sbjct: 118 Q--SYIQQNYIYNNHKDWITYNFNGSQMNIDRQSGYFIDPGIPEVQEYLLNILGNLAGGY 175 Query: 228 P-VDGVQFDDYFYTESPGSRLN-DNETYRKY--GGAFASKADWRRNNTQQLIAKVSHTIK 283 P +DG+ D Y ES Y +Y + +WR + +K Sbjct: 176 PELDGIHLDYIRYPESDLGFHPVSLARYNEYCQNQEEITYNEWRIMQVTNFVENAYFQLK 235 Query: 284 SIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWP 343 I P ++ + + D D + W+++G++D + P Y Sbjct: 236 EINPTLQLTAAVVPDIAEANVD--------------YAQDWQSWLKKGIIDRVYPMAY-- 279 Query: 344 FSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDL 403 A++ + + + ++ IG+ + G + + + V E+ K++ L Sbjct: 280 -DVQYAKFKKQLEQIK--LLQMKEKIVIGLRAW-NGNGNSLAVGNGNSYNVKEIAKKITL 335 Query: 404 NDAVPEISGTILFREDYLNKPQT 426 + +G LF L K Sbjct: 336 TRDL-GFAGVSLFSYSGLQKGNA 357 >UniRef50_A2C8D8 DUF187 n=12 Tax=Cyanobacteria RepID=A2C8D8_PROM3 Length = 410 Score = 167 bits (423), Expect = 6e-40, Method: Composition-based stats. Identities = 81/396 (20%), Positives = 131/396 (33%), Gaps = 50/396 (12%) Query: 43 SKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLG 102 S+P A + Q+S P D P+ V ++N S + M + L R G Sbjct: 35 SQPCAISAQASTPPSVAQSGLRHLSDHLPIVGVWMTNSPSPLYYSRNLMHKAVKDLYRAG 94 Query: 103 INTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNP 162 ++ V G+ S P + DP+ + E H RGMKV WF Sbjct: 95 FTALYLNVWSRGSTFHRSNYAPVEGPLQKAGLA---LDPICTLRREGHARGMKVVPWFEY 151 Query: 163 YRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAE 222 + + + L++ + V+ + + R L+P PEV+ +V E Sbjct: 152 GLMEPDDAEVVKLHPDWVLARADGNPVVK----MHGNHKRVWLNPAHPEVRARFIGVVIE 207 Query: 223 VVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFAS----------KADWRRNNTQ 272 V+ R +DGVQ DD+F P D T Y S WRR Sbjct: 208 VMKRCKMDGVQLDDHF--AWPVQLGYDPYTVALYQQETGSLPPRDYSDRFWMQWRRRKLT 265 Query: 273 QLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGL 332 L+ ++ ++ K V ++P AY+ D W L Sbjct: 266 GLLRELRQALEKEKLPVNISLAPG---------------PFRFAYNNWLQDWELWTVGKL 310 Query: 333 LDYIAPQIYWPFSRSAARYDVLAKWWADVVKPT-RTRLYIGIAFYKVGEPSKIEPDWMIN 391 +D + Q Y + S + A P ++IG+ + Sbjct: 311 IDELVVQNY---AYSLKGFAKDLDQPALRKAPQWGLPVHIGV----------LAGFGKRT 357 Query: 392 GGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQ 427 +P L +++ L G I F + L Sbjct: 358 TPMPVLVEKVRLAAERG--HGVIYFYWEGLWGQHAG 391 >UniRef50_C7GZF2 Putative lipoprotein n=1 Tax=Eubacterium saphenum ATCC 49989 RepID=C7GZF2_9FIRM Length = 373 Score = 167 bits (422), Expect = 9e-40, Method: Composition-based stats. Identities = 95/419 (22%), Positives = 160/419 (38%), Gaps = 81/419 (19%) Query: 34 ESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMID 93 + + + + M+ +W VS LD+ N+ R + ++ Sbjct: 5 AEAIEQKKAPQKTVKVVKFNTEMKAVW---VSFLDFQ-----NLGLTNVREKTFKKNAEI 56 Query: 94 KLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENP------GYDPLQFMLD 147 + +R GINT+FF V+ A + SK+ + YDPL+ + + Sbjct: 57 MVKDAKRNGINTIFFHVRAFDDAAYKSKVFRAMRYLKTNASYAKPATSSFSYDPLKLVAE 116 Query: 148 EAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDP 207 AHK G+++HAW NPYRV G + L P Sbjct: 117 AAHKHGVQLHAWLNPYRV----------------------------------GYDYFLSP 142 Query: 208 GIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADW- 266 + I V E+++ Y VDG+ FDDYFY G +++ A A K D+ Sbjct: 143 KSEYSTNRIIKAVNEILT-YKVDGIHFDDYFYHAKKGYYRLNSKKQYSVNPATAKK-DYS 200 Query: 267 -----RRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESY 321 +R +LI +V+ T + F VSPAG N + + Sbjct: 201 PSSINKRRYVNKLIRRVNKTTQGK---ALFSVSPAGNVDNCMNSGV-------------- 243 Query: 322 ADTRRWVEQ-GLLDYIAPQIYWPFSRSA-ARYDVLAKWWADVVKPTRTRL--YIGIAFYK 377 D W+ G +D I PQIYW + A R + + ++ + ++ IG+A Y+ Sbjct: 244 -DLTTWLSNDGYVDMIMPQIYWTDNWGASGRVKMFSSRLGQFMRKNKKKIPMVIGLALYR 302 Query: 378 VGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQSR 436 GE + W + G + Q+ + G LFR + L + + ++ V ++ Sbjct: 303 SGERGLGDKGWSMRGSN--ISGQIKSIRRH-GLGGYCLFRFNNLYQGRCKKEVKNMRKI 358 >UniRef50_UPI0001789939 S-layer domain protein n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI0001789939 Length = 1549 Score = 156 bits (395), Expect = 1e-36, Method: Composition-based stats. Identities = 71/415 (17%), Positives = 131/415 (31%), Gaps = 74/415 (17%) Query: 70 PPVSSVNISNPTSRARVQQ--QAMIDKLDHLQRLGINTVFFQVKP-DGTALWPSKILPWS 126 P + + S AR + + ++ L + G+ +V F VK +G + L Sbjct: 513 PEKEVILWVDQASNARKFKTSEDVLAFLQKAKETGVTSVAFDVKGVEGYVSYKKNDLTGR 572 Query: 127 DLM-----TGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTL 181 + K G +P D LQ +D H G+++HA N + + Sbjct: 573 PYVSEIKAPEKAGASPDLDLLQEFIDHGHALGLEIHAAINVFAEGSIAHNEYAVLNDHLD 632 Query: 182 SQQPASVYVQHRDWIR-----TSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDD 236 ++ + + R G ++P EV+D+ E++ Y VDGV D Sbjct: 633 WEERVYFPENNGEIKRLRESKKQGLVAFVNPSNDEVRDYQLRTFEEIIKNYDVDGVVHDR 692 Query: 237 YFYTESPGS----------------------RLNDNETY----RKYGGAFASKADWRRNN 270 Y ND +Y R YG ++R Sbjct: 693 SRYDNEGADFSDETRVKFEQFLQARGKQLVNWPNDIFSYENNVRVYGPLIQDWWEFRSGT 752 Query: 271 TQQLIAKVSHTIKSI----KPGVEFGVSPAGVWRNRSHDPLGSDTRGA-------AAYDE 319 Q +V + S +E + + + ++ + Sbjct: 753 IQSFFGEVKALVDSYEVSEGRKIEVSSYVGSWYETYYLNGVNWGSKNFRFNPALGMPDES 812 Query: 320 SYADTRRWVEQGLL---DYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFY 376 Y T + + G + D++ Y S+ +Y L ++V LY GIA Sbjct: 813 VY--TEEYYQTGYIEYLDFLMIGAYQTTSQEIQKYITL----GNIVTNGEIPLYAGIAM- 865 Query: 377 KVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVS 431 N P +++++ + +G +LF +N P A+ Sbjct: 866 -------------NNVQAPAVQREV-FQAGLKSTNGLMLFDASQVNWPIAAAALQ 906 >UniRef50_P35824 S-layer-related protein n=1 Tax=Bacillus circulans RepID=SLAP_BACCI Length = 1616 Score = 155 bits (391), Expect = 4e-36, Method: Composition-based stats. Identities = 75/416 (18%), Positives = 129/416 (31%), Gaps = 75/416 (18%) Query: 71 PVSSVNISNPTSRARVQQQA-MIDKLDHLQRLGINTVFFQVKP-DGTALWPSKILPWSDL 128 + + + + Q + + L + G+ +V F VK +G + L Sbjct: 514 KKVILWVDQAANARKFQTGDNVANFLRTAKENGVTSVVFDVKGVEGYVSYKKSTLTGRPY 573 Query: 129 M-----TGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQ 183 + K G NP D LQ + + + G+ +H FN + + + Sbjct: 574 VSAIKAPEKAGSNPDLDLLQEFIRYSRELGLDIHVSFNIFAEGSIASNEFALLDSHLDWE 633 Query: 184 QPASVYVQHRDWIR-----TSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYF 238 + + R G ++P EV+D+ + EV+ Y VDGV D Sbjct: 634 ERVYNAADNGQIKRLRESAKQGAVAFVNPSNDEVRDFQLKTIEEVLQNYDVDGVVLDRAR 693 Query: 239 YTESPGS----------------------RLNDNETY----RKYGGAFASKADWRRNNTQ 272 Y +D TY RK G ++R + Sbjct: 694 YDNESADFSDLTKAKFESFLGARGKQLQNWPDDVFTYAGNVRKDGPLIRDWWEFRSKTIK 753 Query: 273 QL---IAKVSHTIKSIK-PGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDE--------S 320 + +++ +K+ K +E + G W + YDE Sbjct: 754 SFTSEVRQLTDRVKAEKGKKIEVS-AYVGSWFESYYLNGVHWGSTEFRYDERLRMKDKSV 812 Query: 321 YADTRRWVEQGLL---DYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYK 377 Y T + E G + D+I Y + Y L ++V LY GIA Sbjct: 813 Y--TPGYYESGYVKNLDFIMIGAYQTTAPEIEHYITL----GNIVTNGEVPLYAGIAL-- 864 Query: 378 VGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYL 433 N P L++ + + G +LF +N P A+ L Sbjct: 865 ------------TNVQEPALQRDV-FQAGLVNTHGLMLFDASQVNWPVAGAALRNL 907 >UniRef50_A7LVF0 Putative uncharacterized protein n=3 Tax=Bacteroides RepID=A7LVF0_BACOV Length = 467 Score = 154 bits (390), Expect = 4e-36, Method: Composition-based stats. Identities = 72/472 (15%), Positives = 141/472 (29%), Gaps = 71/472 (15%) Query: 8 KKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRL 67 K L I L A+ + CS S ++ P + R IW+ + Sbjct: 3 KFLKILILTFLGAVTITSCSDDSDGIPGWPWNDNSTEKPDEPDVAEAKPRYIWIDAAANF 62 Query: 68 DWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPD-GTALWPSKILP-- 124 ++ + ++ ++ G + V+P G L+ + ++ Sbjct: 63 --------------PDYANSKENIAKDMEKIKAAGFTDIIVDVRPTTGDVLFNTNVVDQV 108 Query: 125 -----WSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRE--- 176 W + +D LQ ++EA +G+KV+A N + + Sbjct: 109 KRMDVWGNSGYSYYERTETWDYLQAFIEEARIQGLKVNASINTFVGGYLCPYNLGHDGVL 168 Query: 177 LNSTLSQQPASVYVQHRDWIRTSG--------DRFVLDPGIPEVQDWITSIVAEVVSRYP 228 + ASV T +P +VQ+++ ++A++ Y Sbjct: 169 FRDESKKGWASVANLADGLTNTMDLLDDETDYGAKFFNPANDDVQNFVLQLLADLAK-YD 227 Query: 229 VDGVQFDDYFYTESPGSRLNDN---ETYRKYGGA------------------------FA 261 +DG+ D Y + + + + +Y G F Sbjct: 228 LDGIILDRCRYDDYGLESDFSDISKQKFEEYIGETVANFPADIMAPGTDEIPSDQPVYFK 287 Query: 262 SKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTR--GAAAYDE 319 ++R I K +KS+ ++FGV + + + +AY Sbjct: 288 KWLEFRAKVIHDFIVKAREKVKSVNNNIKFGVYVGAWYSTYYTSGVNWASPKYNTSAYYP 347 Query: 320 SYA--DTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYK 377 +A D + + LDYI Y + +W + L G + Sbjct: 348 KWATSDYKNYGYADHLDYIFLGAYASVNNIYGS----GEWTMEGFCKNGRELLQGDVPFA 403 Query: 378 VGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQA 429 G W G ++ +D + G F ++ K A Sbjct: 404 GGPDIGNSTGWTDGGQSAKIPDAIDA--CISNSDGFFAFDLCHVKKYDYWNA 453 >UniRef50_UPI0001C16380 Protein of unknown function DUF187 n=1 Tax=Raphidiopsis brookii D9 RepID=UPI0001C16380 Length = 289 Score = 154 bits (390), Expect = 4e-36, Method: Composition-based stats. Identities = 57/255 (22%), Positives = 105/255 (41%), Gaps = 33/255 (12%) Query: 42 GSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRL 101 S P T Q S + +RG+W+ + + + D + L+RL Sbjct: 42 HSVPSVTAQMSREEIRGVWVTS----------------NDLNVFKDRDQVKDAVTKLRRL 85 Query: 102 GINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFN 161 NT++ V G ++PS + D+ + G+D L ++++AH + + WF Sbjct: 86 NFNTIYPVVWNSGYVMYPSNVAKSLDIQPFVFRGSDGHDILADIINQAHSQNLLAIPWFE 145 Query: 162 PYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVA 221 ++ NT + + + S + +G+ L+P P+VQ +I ++ Sbjct: 146 FGFMTPNTGELALNKPEWLTKMRDGSTVS-----MSAAGEVSWLNPFHPQVQKFIIDLLV 200 Query: 222 EVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFAS----------KADWRRNNT 271 E+ + Y +DG+QFDD+ T P D+ T Y WR N Sbjct: 201 ELTNNYDIDGIQFDDH--TSLPHQFGYDDYTVNLYKQETGKNPPANSQDSEWVAWRANKI 258 Query: 272 QQLIAKVSHTIKSIK 286 + + +++HT+K IK Sbjct: 259 TEFMVRLNHTVKQIK 273 >UniRef50_C6IVH6 S-layer domain-containing protein n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6IVH6_9BACL Length = 1573 Score = 154 bits (390), Expect = 5e-36, Method: Composition-based stats. Identities = 58/410 (14%), Positives = 125/410 (30%), Gaps = 69/410 (16%) Query: 71 PVSSVNISNPTSRARVQ-QQAMIDKLDHLQRLGINTVFFQVKP-DGTALWPSKILPWSDL 128 + + ++ + Q + + L + G+ ++ VK +G + L Sbjct: 538 KEVILWVDQASNAKKFQTSEQVRAFLQKAKDTGVTSIALDVKGVEGYVSYKKNDLTGRPY 597 Query: 129 M-----TGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQ 183 + G+ G NP D LQ +D H G+++HA N + + + Sbjct: 598 VSELQAPGRAGANPDLDLLQEFIDHGHDLGLEIHAVVNVFAEGSIAYNEYAVLNDHLDWE 657 Query: 184 QPASVYVQHRDWIR-----TSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYF 238 + + + R G ++P EV+++ E++ Y VDGV D Sbjct: 658 ERVHYAENNGEIKRLRESAKQGLVAFVNPANDEVREFELKTFEEILKNYDVDGVVHDRGR 717 Query: 239 YT--------------------------ESPGSRLNDNETYRKYGGAFASKADWRRNNTQ 272 Y + P R G ++R Q Sbjct: 718 YDNEGADFSEETRVKFEQFLLQRGKQLNDWPNDIFYYENNVRVDGPLIQDWWEFRSGVIQ 777 Query: 273 QLIAKVSHTIKSI----KPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTR--- 325 +V + S ++ + + + ++ + Sbjct: 778 SFFGEVKSLVDSYEAGSGRTIKVSSYVGSWYETYYLNGVNWASKNFRIHPSLGLPVESIY 837 Query: 326 --RWVEQGLL---DYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGE 380 + + G + D++ Y S+ +Y L ++V LY GIA Sbjct: 838 TPEYYDTGYIEYLDFLMIGAYQTTSQEIQKYITL----GNIVTNGEIPLYAGIAL----- 888 Query: 381 PSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAV 430 N +P +++++ + +G +LF +N + ++ Sbjct: 889 ---------NNVQLPAVQREV-FQAGLRTTNGLMLFDASQINWAIAKASL 928 >UniRef50_B0MQ12 Putative uncharacterized protein n=1 Tax=Eubacterium siraeum DSM 15702 RepID=B0MQ12_9FIRM Length = 990 Score = 150 bits (379), Expect = 7e-35, Method: Composition-based stats. Identities = 91/436 (20%), Positives = 158/436 (36%), Gaps = 73/436 (16%) Query: 5 SRNKKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATV 64 RNK IR A +++ +LL + E T S + +QS Q I + Sbjct: 1 MRNK--FIRIMAGVLSAFMLLSQLTAVAEEK--TNENTSASAESAEQSKQTTPQIAEPKL 56 Query: 65 SRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILP 124 + + + +N+ + A + KLD L G+N V+ + + Sbjct: 57 TLSNELKATVINLGDFA--AEKFGENFSKKLDTLIAYGMNGVYINPYGKDGTYYTTN--- 111 Query: 125 WSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQ 184 N D L+ L+ A K+GM+ + +F ++N T++ Sbjct: 112 ----------MNKSGDRLEKALEAATKKGMQRYVYF---------------DINKTMAAC 146 Query: 185 PASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPG 244 P + D++ S + +Y +G+ ++ T Sbjct: 147 PDG----------------------EDCYDYLVSEAHKFALKYRCNGIILTGFYGT---- 180 Query: 245 SRLNDNETYRKY--GGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNR 302 N+N Y +Y G+ +W + + + VS I+ + G+ VW N Sbjct: 181 ---NNNSAYEEYMKNGSGIGYKNWLYDTVEYKFSTVSGVIRLSDNSIAVGIDAKDVWANA 237 Query: 303 SHDPLGSDTRG-AAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADV 361 S + GSDT A+ YADT+ +VE+GL D+I ++ + WW++V Sbjct: 238 SKNKKGSDTSAKYTAFYNGYADTKSFVEKGLTDFIVVNASGSLDNETVGFENVCSWWSNV 297 Query: 362 VKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYL 421 K + YI V KI D G +L KQL D + SG++ + E L Sbjct: 298 AKSAKIPFYI------VHHNEKIGTDEDGWGVEDQLLKQLAKADELDNYSGSVFYSEKSL 351 Query: 422 NKPQTQQAVSYLQSRW 437 + L + Sbjct: 352 -EENPMGTTDTLTKYF 366 >UniRef50_C3R3M7 Putative uncharacterized protein n=2 Tax=Bacteroides sp. 2_2_4 RepID=C3R3M7_9BACE Length = 432 Score = 146 bits (369), Expect = 1e-33, Method: Composition-based stats. Identities = 64/337 (18%), Positives = 114/337 (33%), Gaps = 54/337 (16%) Query: 26 CSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRAR 85 CS K+ E P P + M +W + + Sbjct: 19 CSTKTVESELPEPNPPTPVIPEEPTPEKEKM--LWFDAEANFERFSK------------- 63 Query: 86 VQQQAMIDKLDHLQRLGINTVFFQVKP-DGTALWPSKI-LPWSDLMTGKIGENPGYDPLQ 143 ++ + LD + G N + V+P G AL+ S P +DL I + Y LQ Sbjct: 64 --KENITYYLDLAKSTGFNKIVVDVRPVQGDALFKSSYLTPLTDLAGTHIERDWNY--LQ 119 Query: 144 FMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDR- 202 F +DEAHKR +KV + + + + + T + Y + + I D+ Sbjct: 120 FFIDEAHKRELKVTVSATIFTAGLPSSKNGMAYRDDTWDGKTCLEYTKDQGLIDIKDDKT 179 Query: 203 ---FVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNE-TYRKY-- 256 L+P +PEVQD+ + + E+V+ Y DG D Y + + +Y Sbjct: 180 KVSAFLNPVLPEVQDFCLNFIKELVTNYNFDGFALDYCRYPGDESDFSEATKIAFEQYIG 239 Query: 257 ---------------------GGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSP 295 G + ++R + + +V IK+IKP ++ Sbjct: 240 KQLDRFPDDIFIWNTDGTKRTGTYYKKWWEFRSMVIRNFVERVRTEIKNIKPDIQLEY-- 297 Query: 296 AGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGL 332 W + + A+ + ++ W Sbjct: 298 ---WAASWIHAIYGQGQNWASTEYDFSKEYSWASPEY 331 >UniRef50_C2FS66 Putative uncharacterized protein n=1 Tax=Sphingobacterium spiritivorum ATCC 33300 RepID=C2FS66_9SPHI Length = 172 Score = 145 bits (366), Expect = 3e-33, Method: Composition-based stats. Identities = 51/142 (35%), Positives = 78/142 (54%), Gaps = 10/142 (7%) Query: 51 QSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQV 110 + +RG+W+ATV+ +DWP + Q+Q +I+ LD QR G+N +FFQ+ Sbjct: 26 SPKRELRGVWIATVANIDWPSR-------DNESSERQKQELINILDAHQRAGLNAIFFQI 78 Query: 111 KPDGTALWPSKILPWSDLMTGKIG--ENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVN 168 +P A + PWS ++G G +P YDPL+F+++EAHKRGM++HAW NPYR S Sbjct: 79 RPAADAFYAKGREPWSRYLSGVQGKAPSPFYDPLEFVIEEAHKRGMELHAWVNPYRASTT 138 Query: 169 TKPGTIRELNSTLSQQPASVYV 190 P + + +P Sbjct: 139 LNPAHFSK-DHITRTKPEWFLN 159 >UniRef50_B0P7J4 Putative uncharacterized protein n=1 Tax=Anaerotruncus colihominis DSM 17241 RepID=B0P7J4_9FIRM Length = 1211 Score = 144 bits (363), Expect = 6e-33, Method: Composition-based stats. Identities = 88/422 (20%), Positives = 151/422 (35%), Gaps = 76/422 (18%) Query: 26 CSCKSTPPESMVTPPAGSKPPATTQQSS------------QPMRGIWLATVSRLDWPPVS 73 + PP++ PP + T + MRG+ ++ D+ Sbjct: 86 SAENPAPPDASSAPPDTAGETGTDSSDNGQADEPVYFNVPTEMRGVMISA--GTDYLTNG 143 Query: 74 SVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKI 133 + + + + + L Q+L +NTV + + L+ S L + L Sbjct: 144 TDVSAQELATQ------LDEALAAAQQLTMNTVIIDTQYGDSVLFESSALESAPL----- 192 Query: 134 GENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHR 193 G D +++ +A + G V+A Y VS + + + L Sbjct: 193 ----GLDVTEYLCAKAREMGFYVYA---TYDVSTRSGGEGLTADGAAL------------ 233 Query: 194 DWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETY 253 D + + Y DG+ D Y +SP + Y Sbjct: 234 --------------------DDLAENIGAFAEAYKPDGILLDGYECADSPAAY----AGY 269 Query: 254 RKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRG 313 + GG ++R + L+ + ++ PGV+ G+ VW+N DP GSDT+ Sbjct: 270 LQSGGGM-GYEAYQRQVPRALLETAAAAVRENAPGVQVGLYTQAVWQNSDADPDGSDTKA 328 Query: 314 -AAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIG 372 A ADTR +V+ GL D++ + Y + AR+ V+A WWA VV T T+LY+ Sbjct: 329 ETTALGTGNADTRAFVKDGLFDFVMVKNYGSTNEETARFGVVAAWWAGVVDGTDTKLYMM 388 Query: 373 IAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSY 432 A +VG S +L Q+ + SG+ L + Sbjct: 389 HAADRVGTQSVG------WTVYEQLTAQIIRLEEAGGSSGSAFNSLAALRSDPGGSTTTL 442 Query: 433 LQ 434 +Q Sbjct: 443 IQ 444 >UniRef50_A8F7U2 Putative uncharacterized protein n=2 Tax=Thermotogaceae RepID=A8F7U2_THELT Length = 367 Score = 143 bits (360), Expect = 1e-32, Method: Composition-based stats. Identities = 62/392 (15%), Positives = 135/392 (34%), Gaps = 58/392 (14%) Query: 64 VSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKIL 123 V+ P + + + + ++ + +G ++ QV A + S+IL Sbjct: 13 VTTTLMPYPLGIWVVRDQIT---SIEKINRVIEIAKEVGATRIYVQVVGRADAYYNSEIL 69 Query: 124 PWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQ 183 P ++ ++ +P +DPL+ ++D A G+K+ AW N + K + ++ Sbjct: 70 PKAETLSE---CSPDFDPLKEIIDLAKISGIKISAWMNVFYAWPFGKKPVSEK--HVVNV 124 Query: 184 QPASV-YVQHRDWIRTSGD-------RFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFD 235 P + Y Q+ + L+P + +V+ ++++I E+ Y VD + D Sbjct: 125 HPDWITYDQNGKSMLEYASSPEINTPGLFLEPALEDVKKFVSNIAEEIAKNYDVDEIHLD 184 Query: 236 DYFYTESPGSRLNDNET-YRKYGGAFASKAD----------WRRNNTQQLIAKVSHTIKS 284 Y D YR++ + +R + + + + Sbjct: 185 YIRYPYKTFGYHPDAMKIYREWLKKAIQEKKLTNLGEGFDLFRIQQVSDTVKLIYEKVHN 244 Query: 285 IKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPF 344 + A V+ D + +G +W+E LDY Y Sbjct: 245 YGKKL-----SAAVFAYYEQDAISQRLQG----------WLQWLEGEYLDYACLMAYENN 289 Query: 345 SRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLN 404 + Y +A ++ +G+ YK+ E + E+ K + Sbjct: 290 RDTVEYYVK----YAVKALGAAEKIRVGLGAYKMTEN---------PEKLYEIAKSVVEK 336 Query: 405 DAVPEISGTILFREDYLNKPQTQQAVSYLQSR 436 EI ++F + L + ++ V+ + Sbjct: 337 YRPDEI---LIFSFENLLDEKVRKYVADIARL 365 >UniRef50_A9KMJ8 Putative uncharacterized protein n=1 Tax=Clostridium phytofermentans ISDg RepID=A9KMJ8_CLOPH Length = 1263 Score = 140 bits (353), Expect = 9e-32, Method: Composition-based stats. Identities = 61/394 (15%), Positives = 121/394 (30%), Gaps = 76/394 (19%) Query: 89 QAMIDKLDHLQRLGINTVFFQVKP-DGTALWPSKILPWSDLMTGKIGENP----GYDPLQ 143 + + + + +R GI + F VK +G + + + MT N D L+ Sbjct: 461 EKIQKMMANAKRAGITALAFDVKGVEGYVSYKKATVSNTPYMTETKNPNKAVAMDIDFLE 520 Query: 144 FMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIR------ 197 ML EAH G+K++A N + ++ +T Sbjct: 521 EMLAEAHANGIKLYASSNFFTEGNIATNDYAFDIRNTHPDWAEVFQTPEDKGELKSILNS 580 Query: 198 -TSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLN-DNETYRK 255 + ++P EV+ +IV +V+ Y VDG+ D Y N E + Sbjct: 581 SRNSTLLFVNPANEEVRAHELAIVKDVLENYAVDGIILDRARYDNQYADFSNLSKEQFMA 640 Query: 256 Y--------------------------GGAFASKADWRRNNTQQLIAKVSHTIKSIK--- 286 Y G + +R + +++V I K Sbjct: 641 YLQGKGKTLQNWPDDAFKIKADGSMVTGQHYLEWLSYRSTVIESFVSEVRTLIDQYKTSQ 700 Query: 287 -PGVEFGVSPAGVWRNRSHDPLGSDTRGAAAY--------DESYADTRRWVEQGLL---D 334 ++ + G W + + + Y +E YA + + + D Sbjct: 701 NRNIDLA-AYVGSWYESYYQNGVNWADSSFEYNERLGFPMEELYAKEFEYSKTSYVKHID 759 Query: 335 YIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGV 394 +I Y+ +Y L +++ + LY I + Sbjct: 760 FIMTGCYYTTEALMQKYTTL----NNILINNQVPLYASIDL----------------TNL 799 Query: 395 PELKKQ-LDLNDAVPEISGTILFREDYLNKPQTQ 427 E Q + A G+++F +++ + + Sbjct: 800 SEAPDQRMIFQAAYQHSEGSMIFDLCFVDWDKIR 833 >UniRef50_Q8AAL7 S-layer related protein, sialic acid-specific 9-O-acetylesterase n=9 Tax=Bacteroidales RepID=Q8AAL7_BACTN Length = 884 Score = 138 bits (347), Expect = 4e-31, Method: Composition-based stats. Identities = 49/310 (15%), Positives = 98/310 (31%), Gaps = 38/310 (12%) Query: 71 PVSSVNISNPTSRARVQQQAMIDK-LDHLQRLGINTVFFQVKP-DGTALWPSKILPWSDL 128 + + I + R + ID L+ ++ LG ++P G L+ S+ P Sbjct: 475 KPALMWIDAEANFERFSHKDSIDYYLEKIKSLGFTHAVVDIRPITGEVLYKSEYAPQMKE 534 Query: 129 MTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASV 188 G + +D L + + + H+ G+++HA N + N + Sbjct: 535 WKGAKAGD--FDYLGYFIKKGHELGLEIHASLNVFCAGHNYFDRGMVYSGHPEWASMVYT 592 Query: 189 YVQH--RDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRY-PVDGVQFDDYFYTESPGS 245 + +++P E + I +++ EVV++Y +DG+ D Y Sbjct: 593 PDKGIIPITEEKHKYGAMINPLNEEYRTHILNVLKEVVTKYPDLDGLMLDRVRYDGITAD 652 Query: 246 RL----------------------------NDNETYRKYGGAFASKADWRRNNTQQLIAK 277 D + + G F +WR N +A Sbjct: 653 FSSLSRKKFEEYIGKKVANFPEDIFRWTKNADGKYTTQPGKYFRKWLEWRTKNITDFMAL 712 Query: 278 VSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGL---LD 334 +K+ P V FG + + + ++ + T + G +D Sbjct: 713 ARKEVKAANPDVSFGTYTGAWYPSYYEVGVNFASKEYDPGKDFSWATPEYKNYGYAELID 772 Query: 335 YIAPQIYWPF 344 A Y+ Sbjct: 773 LYATGNYYTD 782 >UniRef50_D1BUC2 Putative uncharacterized protein n=1 Tax=Xylanimonas cellulosilytica DSM 15894 RepID=D1BUC2_XYLCX Length = 806 Score = 136 bits (342), Expect = 2e-30, Method: Composition-based stats. Identities = 62/380 (16%), Positives = 122/380 (32%), Gaps = 69/380 (18%) Query: 63 TVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKI 122 T S P + + +A+ ++ + G+N V+ QV G ++PS + Sbjct: 341 TASYRSIPARVAESRGVWYRPEEKNPEAVEATVEAMASAGVNEVYLQVLSGGYTIYPSAV 400 Query: 123 LPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLS 182 + + + GYD L A + G+++HAW + +V G + + Sbjct: 401 A-VAHGLPAVRPDLAGYDALAAWKSAADENGIELHAWIDGLQVGNELGDG----IGPIVQ 455 Query: 183 QQPASVYVQH-----RDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDY 237 Q P + V + + LD P + ++ + E+VSRY + G+ D Sbjct: 456 QHPEWLAVDRAHAGTTTATPSFNGFYWLDITDPVARQYMIDVTTEMVSRYDLAGLNHDYM 515 Query: 238 FYTE----SPGSRLNDN--ETYRKYGGAFA----------SKADWRRNNT---QQLIAKV 278 Y + +D+ Y+ G + W+ + +L+ + Sbjct: 516 RYWDNGNAQDSYNFSDDSRAAYQALTGVDPVTLSPEADAAAWERWKAFVSSEEDRLVRDI 575 Query: 279 SHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAP 338 ++K P +P N A+ RW ++D + P Sbjct: 576 FRSVKKAAPTAVVSNAPEVGREN--------------------AEIGRW--NDVVDVVIP 613 Query: 339 QIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELK 398 Q Y +W D + +Y G++ G Sbjct: 614 QAYTAN---LDSIHQRVEWIQDTMTG-GQLVYTGLSAM------------YQRFGSARTV 657 Query: 399 KQLDLNDAVPEISGTILFRE 418 +Q + E G+++F Sbjct: 658 EQTQAARDLDE--GSVIFSW 675 >UniRef50_Q6ZE96 Slr7102 protein n=5 Tax=Cyanobacteria RepID=Q6ZE96_SYNY3 Length = 338 Score = 136 bits (341), Expect = 2e-30, Method: Composition-based stats. Identities = 73/420 (17%), Positives = 149/420 (35%), Gaps = 97/420 (23%) Query: 5 SRNKKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATV 64 + +++ PA+ V + LLL +C + P T + + M+G+WL V Sbjct: 1 MKKLLKSLKWPALFVGIILLLAACH--------------RAPTRTAKETDKMKGVWLTDV 46 Query: 65 SRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILP 124 + + + + L H+ + + V+F V L+P++ Sbjct: 47 GTMGLTYSTL----------------LDETLHHISKSDYDRVYFSVYGLRGQLYPTRQR- 89 Query: 125 WSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQ 184 I + P + + M E+ ++G+K +AWF + + + + Sbjct: 90 -----GDLIPKLPFPNAVGSMARESRRQGLKPYAWFEYGLM--------LPQFDPVAKNN 136 Query: 185 PASVYV-QHRDWIRTSGD--RFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTE 241 P + + + + + LDP PEV+ +I + + +++ + G+Q DD++ Sbjct: 137 PDWLLTMANGEQVIENHGVPMVWLDPSNPEVEAYILAHIDDILKEKSLAGIQLDDHW--- 193 Query: 242 SPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRN 301 R++G D+RR+ L KV IK+ P E +SP Sbjct: 194 ---------AVPRQFG-------DYRRS-LTALTTKVHEHIKTKNPEFELSLSP------ 230 Query: 302 RSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADV 361 + +E D RWV+QG++D + QIY Sbjct: 231 ---------NPYQFSLNEYNQDWLRWVKQGIVDEVVVQIYRSSPAEVQ---QAVNNSGIY 278 Query: 362 VKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYL 421 + +G+ + + + +K Q++ + G LF +++ Sbjct: 279 TASRYVPVGVGLYTGRK----------IKPFNLQSIKDQINAVEKQN--LGHSLFVWEFM 326 >UniRef50_Q2BFL2 Putative uncharacterized protein n=1 Tax=Bacillus sp. NRRL B-14911 RepID=Q2BFL2_9BACI Length = 813 Score = 135 bits (340), Expect = 3e-30, Method: Composition-based stats. Identities = 49/305 (16%), Positives = 89/305 (29%), Gaps = 46/305 (15%) Query: 70 PPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLM 129 P ++ + + LD ++ G N+V+ + G ++PS+ + L Sbjct: 356 PSHAAEQRAIWYRPEETTLAGVNQVLDRMEEAGFNSVYLETTFWGYTIYPSETMTEYGLP 415 Query: 130 TGKIGE-NPGY-----DPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQ 183 N Y D LQ + E KRG+ V AW + + + ++ + Sbjct: 416 AQHPNFRNADYGKYGSDLLQAYIKEGKKRGISVQAWTDGFMIGHSSLGLPSQFQVHPEWA 475 Query: 184 QPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESP 243 TS + + LD PEVQ ++ I E+ S+Y + G+ D Y Sbjct: 476 AIQRSNTTGEPKPDTSSNYYWLDIAQPEVQTFMLDIYKEMQSKYDIKGLNIDYMRYPHQS 535 Query: 244 GSRLN--DNETYRKY----------------GGAFASKADWRRNNTQQLIAKVSHTIKSI 285 + + Y + A W + + + K + Sbjct: 536 FEKSYGFSEKVRELYKAKTGIDPMELSPTATPEEWEKWAGWIQQRENDFVDGLHTQSKKL 595 Query: 286 KPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFS 345 +P +D Q +D + PQ Y Sbjct: 596 NSKFMLTATPEPGPEAV-----------------LISDW-----QEDIDGVIPQAYGHDF 633 Query: 346 RSAAR 350 S Sbjct: 634 NSIQS 638 >UniRef50_A9KIP9 Putative uncharacterized protein n=1 Tax=Clostridium phytofermentans ISDg RepID=A9KIP9_CLOPH Length = 690 Score = 135 bits (339), Expect = 4e-30, Method: Composition-based stats. Identities = 55/362 (15%), Positives = 106/362 (29%), Gaps = 64/362 (17%) Query: 70 PPVSSVNISNP-TSRARVQQQAMIDKLDHLQRLGINTVFFQVKP-DGTALWPSKILPWSD 127 P + + + + + + GI VK +G A + L Sbjct: 291 PKEIIMWAEQYVNAETTKTVERIESLIQTAKDAGITAFALDVKGCEGYAAYRKSTLTNVK 350 Query: 128 LMTGKIGENPGY----DPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQ 183 MT + D L+ + AH G++V+A FN + +L +T Sbjct: 351 YMTETTNPKKAFQMEIDFLEEFVKAAHASGLRVYASFNFFVEGNIASNDFAIDLPNT--- 407 Query: 184 QPASVYVQH-RD---------WIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQ 233 P V H + + + ++P EVQD+ V E++ Y VDGV Sbjct: 408 HPDWAEVLHVPEDQGELKSVLETKRNCMLCYVNPANKEVQDFELLRVKELLDNYEVDGVI 467 Query: 234 FDDYFYTESPGSR---------------------------LNDNETYRKYGGAFASKADW 266 D Y D E +G + + Sbjct: 468 MDRTRYDNQYADFSEVTRIQFVEYLKSKGKELVHWPKDIYSFDAEQKMIFGPLYLDWLTF 527 Query: 267 RRNNTQQLIAKVSHTI----KSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDE--- 319 R + Q ++ + K+ K + + G W + + + Y+E Sbjct: 528 RSSIIQGFARRLRGIVDEYAKNQKRPIALA-AYVGSWFDLYYQNGVNWGSKDFRYNERLN 586 Query: 320 ------SYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGI 373 D + +D++ Y+ S +Y ++V + + + Sbjct: 587 FPVSKLYTEDYSKTSYVDYIDFLMIGCYYGTSEMIEKYTT----IGNIVTNHKVPMMASM 642 Query: 374 AF 375 + Sbjct: 643 SL 644 >UniRef50_D1PX02 Putative uncharacterized protein n=2 Tax=Prevotella RepID=D1PX02_9BACT Length = 893 Score = 131 bits (329), Expect = 6e-29, Method: Composition-based stats. Identities = 65/406 (16%), Positives = 115/406 (28%), Gaps = 46/406 (11%) Query: 71 PVSSVNISNPTSRARVQQQAMIDK-LDHLQRLGINTVFFQVK-PDGTALWPSKILPWSDL 128 + + + R I +D GI + +K G L+ SK Sbjct: 22 KPKVMWLDCSANFERFSYPDSIRYYVDKCHEAGITHLVLDIKDNTGEVLYDSKYTSRKR- 80 Query: 129 MTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASV 188 T K P +D + + EAHK+GM + A N + + + + Sbjct: 81 -TWKGFTRPDFDFINTFISEAHKQGMVIFAGMNIFADGSKAHGQPRGAVFGKNKKWQSIN 139 Query: 189 YVQHRDWI----RTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPG 244 YV + + L+P + +VQ +IV EVV R+ DG+ D Y Sbjct: 140 YVPGKGLVPVTELNGKTSMFLNPALKDVQRHEINIVKEVVKRFKFDGIMLDRARYDCIDS 199 Query: 245 SRLNDNET-YRKY---------------------------GGAFASKADWRRNNTQQLIA 276 + T + KY G + +WR + I Sbjct: 200 DFSEASRTLFEKYIGEKLNKYPEDIYEWKANDKGSFDRVPGPYYTKWIEWRASVIYGFIK 259 Query: 277 KVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGL---L 333 V IK I P + + ++ E T + + G + Sbjct: 260 DVRTAIKKIDPQCMLASYTGAWYPTYYEVGVNWASKKYDPSKEFPWATPEYRQYGYAELI 319 Query: 334 DYIAPQIYWPFSRSAARYDVL---AKWWADVVKPTRTRLYIGIAFYK----VGEPSKIEP 386 D+ Y+ Y V G Y G Sbjct: 320 DFYTNGNYYSNVTIDDYYKSSGLHKNETDSEVSSGEYLCVEGGCKYTRRLLQGAKPFYGG 379 Query: 387 DWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSY 432 ++ + L+ Q + + E G ++F ++ + + Sbjct: 380 LYVEDYRKNALQFQKAVRMNLKESDGVMIFDIVHIILYDWWKELKE 425 >UniRef50_C3R3K8 S-layer protein n=4 Tax=Bacteroides RepID=C3R3K8_9BACE Length = 672 Score = 128 bits (321), Expect = 4e-28, Method: Composition-based stats. Identities = 58/336 (17%), Positives = 114/336 (33%), Gaps = 60/336 (17%) Query: 80 PTSRARVQQQAMIDKLDHLQRLGINTVFFQVK-PDGTALWPSKILPWSDLMT-----GKI 133 P ++ ++A+ +++ ++ G ++ VK P+G + L + +T K Sbjct: 275 PNAKVLTNREAVATMVNNAKKAGFTSIGLDVKGPEGYVSYRKNDLSKTPYLTATKNPNKQ 334 Query: 134 GENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPG----TIRELNSTLSQQPASV- 188 ++ G+D L+ +L EAHK G+KV+ FN + T + + Q+P Sbjct: 335 VKDDGFDLLEVVLQEAHKIGLKVYTSFNFFTEGNITVNDYAILHEHKDWEEIVQRPEDKG 394 Query: 189 ----YVQHRDWIRTSGDRF----VLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYT 240 + + + ++P EVQD+ V EV+ Y +DG+ D Y Sbjct: 395 KLLKITESTRGKEAAKGKLLALAFVNPSNKEVQDFQLLRVEEVLKNYDIDGIVLDRCRYD 454 Query: 241 ESPGSRLN-DNETYRKY--------------------------GGAFASKADWRRNNTQQ 273 + + +Y G F +R Sbjct: 455 NLYADFSHVTRNAFEEYLEKEGKILENFPADAFKIDKEGTLVKGQFFKEWITFRSQTICD 514 Query: 274 LIAKVSHTIKSIK----PGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDE--SYADTRRW 327 ++ + K P ++ + G W + + YD+ S+ D+ + Sbjct: 515 FTGRIRSLVDKYKTEKNPDLKMA-AYVGSWYEVYYQNGVNWASNQFKYDDRLSFPDSEIY 573 Query: 328 VEQ-------GLLDYIAPQIYWPFSRSAARYDVLAK 356 E LD++ Y+ + RY L Sbjct: 574 GENYNKTSYLNNLDFLMIGTYYKTPKEVNRYITLGN 609 >UniRef50_UPI0001BC8648 hypothetical protein BacD2_02792 n=1 Tax=Bacteroides sp. D2 RepID=UPI0001BC8648 Length = 891 Score = 128 bits (321), Expect = 5e-28, Method: Composition-based stats. Identities = 54/308 (17%), Positives = 87/308 (28%), Gaps = 40/308 (12%) Query: 60 WLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDK-LDHLQRLGINTVFFQVK-PDGTAL 117 W+ V+ + + + R I +D GI + +K G L Sbjct: 14 WIGIVAFAS-GKPKVMWLDCSANFQRFSYPDSIRYYVDKCHEAGITHLVLDIKDNTGEVL 72 Query: 118 WPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIREL 177 +PSK P +D + ++ AH M + A N + N Sbjct: 73 YPSKYAIQKKNWKNFDR--PDFDFINTFIEAAHTHNMIIFAGMNIFADGQNIVKRGAVFD 130 Query: 178 NSTLSQQPASVYVQHRDWIRTSGDRF--VLDPGIPEVQDWITSIVAEVVSRYPVDGVQFD 235 Q V + + + L+P + EVQ + I+ EVV Y DG+ D Sbjct: 131 KHKKWQAINYVPRKGLLPVTEIDGKPTMFLNPALKEVQKYEIDIIKEVVRNYAFDGIMLD 190 Query: 236 DYFYTESPGSRLNDNET-YRKY---------------------------GGAFASKADWR 267 Y +++ + K+ G + WR Sbjct: 191 RARYDCIDSDFSPESKKMFEKFIGKKVERFPEDIFEWRPNAEGGIDRVGGPYYHQWITWR 250 Query: 268 RNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRW 327 + I V +IK IKP + + YD D W Sbjct: 251 TSVIYNFIKDVRTSIKKIKPECMLAAYTGAWYPTYF---EVGVNWASRQYD-VSKDF-SW 305 Query: 328 VEQGLLDY 335 DY Sbjct: 306 ATPDYKDY 313 >UniRef50_B0PF61 Putative uncharacterized protein n=1 Tax=Anaerotruncus colihominis DSM 17241 RepID=B0PF61_9FIRM Length = 915 Score = 126 bits (315), Expect = 2e-27, Method: Composition-based stats. Identities = 58/233 (24%), Positives = 100/233 (42%), Gaps = 25/233 (10%) Query: 6 RNKKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSK-PPATTQQSSQPMRGIWLATV 64 K TI R +L +LA++ +C ++T P SK PPA + + + + + T Sbjct: 3 HETKRTILRT-LLASLAIVAATCAVLYASDLLTSPISSKTPPAGIPAAGEQLHALIVRTR 61 Query: 65 SRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILP 124 D+P P A+ Q+ + + G N VFF+ P AL+ S ILP Sbjct: 62 GNADFPS-------APGLSAKQQRAQLDEIAAFAGEYGYNAVFFEAVPSCDALYRSSILP 114 Query: 125 WSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQ 184 S G+ G +DPL ++++ + G++V+A +P+ VS Sbjct: 115 SSAYWMGEQGAFAFFDPLDYLVNVCKESGIQVYAMIDPFAVSAEDLAE------------ 162 Query: 185 PASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDY 237 +S ++ +WI G +P VQ S+ AE+ +RY + G+ + Sbjct: 163 -SSPASKNPEWIAADGR---FNPTELGVQQLAGSVAAELATRYDIAGIVLEGV 211 >UniRef50_A9WDB3 Putative uncharacterized protein n=5 Tax=Chloroflexaceae RepID=A9WDB3_CHLAA Length = 693 Score = 125 bits (313), Expect = 4e-27, Method: Composition-based stats. Identities = 58/356 (16%), Positives = 111/356 (31%), Gaps = 44/356 (12%) Query: 91 MIDKLDHLQRLGINTVFFQVKPD-----GTALWPSKILPWSDLMTGKIGENPGYDPLQFM 145 + LD + R +N + +K D G + S++ L+ P D Q + Sbjct: 364 VDRFLDLIDRTELNAIVIDIKSDLRDDLGMVYYDSQV----PLVRELGLSTPRVD-FQSI 418 Query: 146 LDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVL 205 L +A +RG+ A RV + + + + S + S + D+ L Sbjct: 419 LAKAKERGIYTIA-----RVQLFSHDNALSDARPEWSIRLRSTGEVYADYPGPGIRYAYL 473 Query: 206 DPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPG--SRLNDNETYRKYGGAFASK 263 DP V D+ ++ E D + FD + + G D + + + Sbjct: 474 DPTNQNVWDYNIALAVEAAQ-MGFDEINFDYIRFPDWFGTREEFRDKLLFSEPIDPVGNP 532 Query: 264 ADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYAD 323 + + + + H + S G V G N + D + Sbjct: 533 GR-MYDVIIEFMQRAHHAVNSA--GAFMSVDVFGRVVNGPSLTIAQDMARMGEHT----- 584 Query: 324 TRRWVEQGLLDYIAPQIYW-----PFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKV 378 DY+ P Y A + V+ ++ + Sbjct: 585 ----------DYVCPMPYPSLWWGGLENIAVPVKFPYETLQIAVRNGGRQMAGSYGRQRP 634 Query: 379 GEPSKIEP--DWMINGGVPELKKQLDLNDAVPEIS-GTILFREDYLNKPQTQQAVS 431 +P ++ G E++ Q+D + PE + G +L+ + K AV Sbjct: 635 WLQDHTDPWSPVVVEYGPAEVRAQIDATEEQPEAASGWLLYDSANIYKGAFNGAVR 690 >UniRef50_B2T9L3 Trehalose synthase n=119 Tax=Bacteria RepID=B2T9L3_BURPP Length = 1154 Score = 123 bits (309), Expect = 1e-26, Method: Composition-based stats. Identities = 55/328 (16%), Positives = 97/328 (29%), Gaps = 62/328 (18%) Query: 28 CKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVN------ISNPT 81 K P T P T + RG A W + + + Sbjct: 1 MKRDDPAETTTQSPVGNAPGNTAKPRTNRRGKPAALSDDPLWYKDAIIYQVHIKSFFDAN 60 Query: 82 SRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDP 141 + +I KLD++ LG+N ++ +PS + +P Y Sbjct: 61 NDGVGDFPGLIAKLDYIAELGVNAIWL------LPFYPSPRRDDGYDIADYRNVHPDYGN 114 Query: 142 L---QFMLDEAHKRGMKV------------HAWF-------------NPYRVSVNTKPGT 173 L + + EAH RG++V H WF N Y S + Sbjct: 115 LSDVKRFIQEAHARGIRVITELVINHTSDQHPWFQRARRAKPGSNHRNYYVWSDTDQKYQ 174 Query: 174 ---IRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVD 230 I ++S S W R + L+ P V + ++ + +D Sbjct: 175 ETRIIFIDSEPSNWTHDPVAGAYYWHRFYSHQPDLNFDNPAVLKEVLQVMRFWL-DMGID 233 Query: 231 GVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVE 290 G++ D Y + G + + T ++ K+ TI + P Sbjct: 234 GLRLDAVPYLV------------EREGTNNENLPE-----THAVLKKIRATIDAEYPNRM 276 Query: 291 FGVSPAGVWRNRSHDPLGSDTRGAAAYD 318 ++ A W + G + A+ Sbjct: 277 L-LAEANQWPEDVKEYFGDEDECHMAFH 303 >UniRef50_Q5I942 Alpha-amylase n=1 Tax=Anaerobranca gottschalkii RepID=Q5I942_9FIRM Length = 532 Score = 120 bits (300), Expect = 1e-25, Method: Composition-based stats. Identities = 45/269 (16%), Positives = 84/269 (31%), Gaps = 27/269 (10%) Query: 10 LTIRRPAILVALALLLC--SCKSTPPESMVTPPAGSKPPATTQQSSQPM--RGIWLATVS 65 +T + +L+ L+ C S T Q GI T Sbjct: 1 MTSKILRVLLVFLLIFAIVGCTSDKQGPQETYKNIDDTVTHGQNYDGSFSREGIQEVTFE 60 Query: 66 RLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPW 125 + + N + T +I+ LD+++ LG+N ++ G + ++ + Sbjct: 61 NGVFYQIFVYNFRDSTGDGVGDLGGIIESLDYIESLGVNGIWLTPITHGASYHKYDVVDY 120 Query: 126 SDLMTGKIGENPGYDPLQFMLDEAHKRGMKV------------HAWFNPYRVSVNTKPGT 173 + + ++ EAHKRG+KV H WF N+K Sbjct: 121 YAVDPEFGTME----DFETLISEAHKRGIKVIIDLVINHTSDRHPWFKAAASDPNSKFRD 176 Query: 174 IRELNSTLSQQPASVYVQHRDWIRTSGDRFV-----LDPGIPEVQDWITSIVAEVVSRYP 228 + +P S + F L+ P V++ + I + + Sbjct: 177 YYIWAAHDEPRPGSGWRHLSGTTWFYLAHFWERMPDLNFDNPAVREEVKRIAKFWLDK-G 235 Query: 229 VDGVQFDDYFYTE-SPGSRLNDNETYRKY 256 VDG + D + P + +Y Sbjct: 236 VDGFRLDAAKHLYSDPAKNHQFWNEFYQY 264 >UniRef50_Q5WAP8 Maltogenic amylase n=1 Tax=Bacillus clausii KSM-K16 RepID=Q5WAP8_BACSK Length = 589 Score = 119 bits (298), Expect = 2e-25, Method: Composition-based stats. Identities = 42/261 (16%), Positives = 86/261 (32%), Gaps = 50/261 (19%) Query: 70 PPVSSVNISNP---TSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWS 126 P + S P T+ Q ++D LD+LQ+LGIN ++ + + + Sbjct: 152 PENTLAWGSAPPTATNYFGGDLQGIVDHLDYLQKLGINGIYLTPIFKAFSNHKYDTIDYL 211 Query: 127 DLMTGKIGENPGYDPLQFMLDEAHKRGMKVH---------AWFNPYRVSVNTKPGTIREL 177 + E L+ ++DE HKRG++V +F P++ + + + Sbjct: 212 KVDPQFGDET----TLKLLVDECHKRGIRVMLDAVFNHAGLYFPPFQDVLKHQQESEYRD 267 Query: 178 NSTLSQQPASVYV-QHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDD 236 + Q P + D + L+ V+D++ ++ + + +DG + D Sbjct: 268 WFHIRQFPVRAEEPPNYDTFAFTPLMPKLNTANEAVKDYLLNVATYWIKEFDIDGWRLDV 327 Query: 237 YFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPA 296 + + + +K++KP + Sbjct: 328 ANEVDH------------------------------AFWREFRNRVKALKPDLYI---LG 354 Query: 297 GVWRNRSHDPLGSDTRGAAAY 317 +W N G G Y Sbjct: 355 EIWHNAYPWLQGDQFDGVMNY 375 >UniRef50_C5A3T6 Glycosyl hydrolase, putative n=1 Tax=Thermococcus gammatolerans EJ3 RepID=C5A3T6_THEGJ Length = 909 Score = 118 bits (296), Expect = 3e-25, Method: Composition-based stats. Identities = 56/358 (15%), Positives = 108/358 (30%), Gaps = 70/358 (19%) Query: 88 QQAMIDKLDHLQRLGINTVFFQVKPD-GTALWPSKILPWSDLMTGKIGENPGYDPLQFML 146 + A + L+ GI VF +VK G ++PSK+ P + L+ +L Sbjct: 84 KTAAERLVSELKEAGITDVFIEVKLTLGYVIYPSKVYPERTYPAYPYNTT---NILKPLL 140 Query: 147 DEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLD 206 +EAH+ G++VHAW + G + +R Sbjct: 141 EEAHRNGIRVHAWMIVH--YDKYFFGKTDPIWHVGKASKNWEAYPVPGRVR--------- 189 Query: 207 PGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDN---------------- 250 E + +I E++S DG+ D Y S + Sbjct: 190 LSNKEYLKVLENIAKELIS-MGFDGIHLDYIRYPHMVYSFSPKDLERAEEAGINVTKVTL 248 Query: 251 -------------------------ETYRKYGGAFASKADW---RRNNTQQLIAKVSHTI 282 ++ Y W RR + + ++ + Sbjct: 249 AVEHTFYNDVPIPGTNKTMGPKDPYYIFKLYVKGDKDIVKWFELRRKDVDSYVGNITQVV 308 Query: 283 KSIKPG----VEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAP 338 S+K + W + L + Y + ++D +V+ G +D++ P Sbjct: 309 HSLKTWNGEKPIVSAALMPDWT--RDNILYPEEFQIMHYAQVWSD---FVKLG-VDWLIP 362 Query: 339 QIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPE 396 Y+ + + K + T++ +G+ Y + +E PE Sbjct: 363 MAYFKDYGEPISWVGVVKGHLVGITGTKSVPLVGVQSYGIPMEKVLEEKDFALSEFPE 420 >UniRef50_A9B0X0 Putative uncharacterized protein n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9B0X0_HERA2 Length = 679 Score = 118 bits (295), Expect = 4e-25, Method: Composition-based stats. Identities = 53/361 (14%), Positives = 108/361 (29%), Gaps = 57/361 (15%) Query: 81 TSRARVQQQAMIDKLDHLQRLGINTVFFQVKPD-----GTALWPSKILPWSDLMTGKIGE 135 T+ + ++ + D + + +N V +K D G + S+ + Sbjct: 357 TAATGSSKASLSELFDLVDQTEVNAVVIDIKLDIAGDVGGVGYLSQH-----PLVLAAET 411 Query: 136 NPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQH--R 193 + Y +++++ EA KR + + R N P Sbjct: 412 SSDYLDMEWIVAEARKRDIYLIGRMAVMR------------DNRLADAHPEWAAQSKATG 459 Query: 194 DWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETY 253 G LDP P V ++ I E+ + + D VQFD + + N + Sbjct: 460 GVWEDDGGLKWLDPFNPNVTEYNVGIAKEIAA-FGFDEVQFDYIRFPSDGSTS---NLVF 515 Query: 254 RKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRG 313 K + + ++ + I G F + G R+ +G Sbjct: 516 SKPI-DPKNNPEVMYEAIGNVLKRAHGDI--NGSGAFFSIDVFGYATWRNMWEIGQSLEI 572 Query: 314 AAAYDESYADTRRWVEQGLLDYIAPQIYWP---------FSRSAARYDVLAKWWADVVKP 364 A + DY+ +Y + A Y+++ K Sbjct: 573 MADHT---------------DYVCAMVYPSHYDRNELGFDNADAYPYEIVKDSIEKGQKR 617 Query: 365 TRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKP 424 + + + + + ++P + G E++ Q+ V G IL+ P Sbjct: 618 MEGKYAVQRPWLQAFTATWLDP--VTPYGRTEVRAQMQAVAEVEGTYGWILWNAANYYDP 675 Query: 425 Q 425 Sbjct: 676 D 676 >UniRef50_C7NWY1 Alpha amylase catalytic region n=1 Tax=Halomicrobium mukohataei DSM 12286 RepID=C7NWY1_HALMD Length = 587 Score = 115 bits (287), Expect = 4e-24, Method: Composition-based stats. Identities = 40/265 (15%), Positives = 73/265 (27%), Gaps = 50/265 (18%) Query: 81 TSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYD 140 S + +ID LD+L LGI ++ + + + + Sbjct: 172 DSFFGGDLEGIIDTLDYLADLGITALYLTPVFESLSNHKYNTADYEQIDPHFGDTE---- 227 Query: 141 PLQFMLDEAHKRGMKVHAW---------FNPYRVSVNTKPGTIRELNSTLSQQPASV-YV 190 L ++D AH RG++V F P++ + + + + P Sbjct: 228 TLSRLVDAAHDRGIRVMLDAVFNHCGRQFEPFQDVIEHGRESEYVDWFHIHEFPIQFEPR 287 Query: 191 QHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDN 250 D L+ PEVQ ++ + + +DG + D Sbjct: 288 PSYDTFGFESYMPKLNTENPEVQSYLIDVATHWIEETDIDGWRLDV-------------- 333 Query: 251 ETYRKYGGAFASKADWRRNNTQ-QLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGS 309 + Q +K +KP +W + G Sbjct: 334 -----------------ADEVDHQFWRAFRQAVKDVKPDAYI---LGEIWHDSRPWLRGD 373 Query: 310 DTRGAAAYDESYADTRRWVEQGLLD 334 Y YA ++ G LD Sbjct: 374 QFDAVMNYPFMYA-VDGFLSDGSLD 397 >UniRef50_A8F7H3 Putative uncharacterized protein n=1 Tax=Thermotoga lettingae TMO RepID=A8F7H3_THELT Length = 560 Score = 115 bits (287), Expect = 4e-24, Method: Composition-based stats. Identities = 70/464 (15%), Positives = 136/464 (29%), Gaps = 101/464 (21%) Query: 13 RRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWL--ATVSRLDWP 70 R+ +I V +A + K + ++ +W+ +T++ L Sbjct: 158 RKISIDVFIATVSKILKEFSDTGKLPNDVILSKALLPFSWPAKIKAVWVWGSTLANL--- 214 Query: 71 PVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPD-GTALWPSKILPWSDLM 129 + + L L+ +G + VK GT WPS+I Sbjct: 215 -------------------GVENTLQQLKEIGFTDILLLVKGTSGTVNWPSQIA------ 249 Query: 130 TGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVY 189 +G + L G+++H WF + T ++ P +V Sbjct: 250 ---LGFSSDTTVLPRASKFCRTSGLRLHVWFVCNQDQTFTSTYPESKMYGI----PKTVD 302 Query: 190 VQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLN- 248 + +T E ++++ S++ EV+ Y DG+ FD Y Sbjct: 303 GDPQRAGKTVDFVGF-----DEYREYMESLIREVMEDYKPDGLHFDYIRYPTGAWGWGPA 357 Query: 249 ------------------------------DNETY-RKYGGAFASKADW---RRNNTQQL 274 DN+++ Y A+ W R Q+ Sbjct: 358 EIQTAMENGLTELDIQYLKNLAIQTWGTNGDNQSFINAYISGDATVTKWVEIRSKIVQKF 417 Query: 275 IAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLD 334 + +S K +K V A + + G Y ++YA + + Sbjct: 418 LQDLSSCAKQVKSDVIIS---AALMPEPASLDSTEKAFGLVHYGQNYA-----IFSDDCE 469 Query: 335 YIAPQIYWPFSRSAARYDVLAKWW-ADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGG 393 I P Y + + + A T+L +G+ Y Sbjct: 470 MIVPMAYHRDYGKDSSWITEEIFTGARQQIQANTKLVLGLQGYS-------------PVT 516 Query: 394 VPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQSRW 437 EL Q+ + G +FR + + ++A+ + Sbjct: 517 GDELA-QIINDCIDKNAEGICVFRAGTILNTEIEEALRNTFKEF 559 >UniRef50_Q3JIJ6 Conserved domain protein n=67 Tax=Betaproteobacteria RepID=Q3JIJ6_BURP1 Length = 619 Score = 115 bits (287), Expect = 4e-24, Method: Composition-based stats. Identities = 48/345 (13%), Positives = 94/345 (27%), Gaps = 54/345 (15%) Query: 92 IDKLDHLQRLGINTVFFQVKPD-GTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAH 150 +D IN + +K D G +PS S D ++ + H Sbjct: 316 EAAVDLKGATAINALVVDMKGDRGITPYPSAARRASGAAARTPNAPVVRD-FAALVADLH 374 Query: 151 KRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRF-VLDPGI 209 +RG+ + A + + + + P I ++ +DP + Sbjct: 375 RRGLYLIARIVVF------------KDDPLAAAHPEWTVRDADGNIWRDREKLRWIDPSL 422 Query: 210 PEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRN 269 E + E D +QFD + ++ R + T R Sbjct: 423 RETWAHNLDVAEEAAKL-GFDEIQFDYVRFPDARELRFSVPNTRAN-----------RTA 470 Query: 270 NTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVE 329 + + V G D Sbjct: 471 AIAGFLRAARERLAPYN--VFVAADIFGYVCWNEDDTAIGQQIETLG------------- 515 Query: 330 QGLLDYIAPQIY-----WPF---SRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEP 381 G LDYI+P +Y W ++ A + + +V+ G+ F + Sbjct: 516 -GPLDYISPMLYPSGFTWGLPGCTQPTADPGQIVRR--SLVEARSRTGLPGVRFRPWLQA 572 Query: 382 SKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQT 426 + + E++ Q+D + + G +L+ P+ Sbjct: 573 FRDYAFDRRDFAAAEIRAQVDAAE-AADTDGWMLWNARNRYDPRQ 616 >UniRef50_Q8YK50 All8067 protein n=8 Tax=Cyanobacteria RepID=Q8YK50_ANASP Length = 399 Score = 114 bits (286), Expect = 5e-24, Method: Composition-based stats. Identities = 60/323 (18%), Positives = 117/323 (36%), Gaps = 50/323 (15%) Query: 22 ALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPV--SSVNISN 79 ++ + + +TP S+ T + + R + W + +S Sbjct: 64 SISIPTASATPKNSLPTSVNVASKNNKASSTITQERK---TVLPPNPWEKKLIRGIYLSR 120 Query: 80 PTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGY 139 + +Q + ++ + + GINT+ V +G ++ S ++ + + + Sbjct: 121 YQATNNADEQTIRQRVRYYRSQGINTIIHGVWGNGCTMYKSDVMQQTLGYSSCPNQFQE- 179 Query: 140 DPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTS 199 L +++DEAHK+GM+VHA+F K + + + V D Sbjct: 180 KWLNWLIDEAHKQGMQVHAYFE-----KGIKIDKNSPIFDLAVAK--NWMVPGIDKTYAG 232 Query: 200 GDRFVLDPGIPEVQDWITSIVAEVVSRYP-VDGVQFDDYFYTESPGSRLNDNETYRKYGG 258 D +VLD PEV + +I E V +YP VD VQ+DDY + D Sbjct: 233 IDHYVLDVEKPEVATFFKNISVEFVKKYPNVDAVQWDDYLGYYAELPGKTD--------- 283 Query: 259 AFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYD 318 R + + + +++ ++K V F + + + + Sbjct: 284 --------RTKHLTKFVQQMTSSMKEANSLVSFDICHHNPYWAKKYFA------------ 323 Query: 319 ESYADTRRWVEQGLLDYIAPQIY 341 AD +W +D + Q Y Sbjct: 324 ---ADWEQWG----VDRVFIQAY 339 >UniRef50_D2QQI8 Trehalose synthase n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QQI8_9SPHI Length = 1114 Score = 114 bits (285), Expect = 7e-24, Method: Composition-based stats. Identities = 53/295 (17%), Positives = 90/295 (30%), Gaps = 62/295 (21%) Query: 61 LATVSRLDWPPVSSVN------ISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDG 114 + + L W + + + Q +++KLD+LQ LG+ ++ Sbjct: 6 VEQLDNLRWYKDAIIYELHIKAFCDGNGDGIGDFQGLLEKLDYLQELGVTAIWL------ 59 Query: 115 TALWPSKILPWSDLMTGKIGENPGYDP---LQFMLDEAHKRGMKV------------HAW 159 +PS + + NP Y + +L EAH+R +KV H W Sbjct: 60 LPFYPSPLRDDGYDIADYYTINPSYGTIEQFKTLLREAHQRNLKVITELVINHSSDQHPW 119 Query: 160 FNPYRVSVNTKPGTIRELN----------------STLSQQPASVYVQHRDWIRTSGDRF 203 F R + P + S Q W R + Sbjct: 120 FQRARRAPKGSPEREYYVWTDDPTQFKDVRIIFQDFETSNWTWDQEAQQYYWHRFFHHQP 179 Query: 204 VLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASK 263 L+ P VQD + ++ VDG + D Y + + G + Sbjct: 180 DLNYDNPLVQDEVFKMIDYWC-ELGVDGFRLDAVPYL------------FEREGTNGENL 226 Query: 264 ADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYD 318 + T + K+ + PGV F ++ A +W S G Y Sbjct: 227 PE-----THAFLKKLRKHVDDHFPGVVF-LAEANMWPEDSASYFGDGDECHMNYH 275 >UniRef50_Q114S3 Putative uncharacterized protein n=2 Tax=Oscillatoriales RepID=Q114S3_TRIEI Length = 508 Score = 113 bits (282), Expect = 1e-23, Method: Composition-based stats. Identities = 67/371 (18%), Positives = 116/371 (31%), Gaps = 53/371 (14%) Query: 8 KKLTIRRPAILVALALLLCSCKSTPPE-----SMVTPPAGSKPPATTQQSSQPMRGIWLA 62 KK+ R+ + +L +L S K P S + +R L Sbjct: 9 KKIKSRKYFHINSLGAILFSLAVNLSPWFSAKVQAQTDIYCKLPPEAIASKENLRQAVLE 68 Query: 63 TVSRL---------------------DWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRL 101 +WP + + AR + LD + Sbjct: 69 GNKNAEKQYQDILIKHNREVGNCRMRNWPRTQGIWLRLYPCDAR--PGEIDRILDKIVNQ 126 Query: 102 GINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFN 161 G N V+ + DG L P+ P ++ D L L +A +RG++ +AW Sbjct: 127 GYNQVYIEAFYDGQVLLPAANNPTVWPSILRVPGYENVDLLADSLKKAKERGLRAYAWVF 186 Query: 162 PYRVSVNTKP--------GTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQ 213 +TL P +V +Q++ + +DP P+ + Sbjct: 187 TMNFGYTYSQLPNRQQALARNGRGQTTLDVIPDNVSLQNQ-LGASHAFHTFIDPYSPQAR 245 Query: 214 DWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRK--YGGAFASKADWRRNN- 270 +V EV+ R GV FD Y GS ++ Y A + R N Sbjct: 246 QDYNVMVNEVLKR-QPQGVLFDYIRYLRGMGSDSVADQVKDLWIYSEASQNVLLQRAKNE 304 Query: 271 -----TQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTR 325 ++ + K T + I G +P W+ + S + + + Sbjct: 305 AGKELIRKFVDKGYVTSQEIN-----GRTP--KWQRFFSPSINSRLTERGLETQIWELSV 357 Query: 326 RWVEQGLLDYI 336 QG+LD++ Sbjct: 358 AHAAQGILDFL 368 >UniRef50_Q4L9B3 Similar to unknown protein n=4 Tax=Bacilli RepID=Q4L9B3_STAHJ Length = 441 Score = 113 bits (282), Expect = 1e-23, Method: Composition-based stats. Identities = 58/428 (13%), Positives = 130/428 (30%), Gaps = 60/428 (14%) Query: 13 RRPAILVALALLLCSCKSTPPESMVTP--PAGSKPPATTQQSSQPMRGIWLATVSRLDWP 70 + A+L ALLL +C + S + +Q +++D+P Sbjct: 5 KIFAVLTTSALLLAACSNGDNSSSSGQKGDSQKNEQTNSQSEKLKKNNDKNKNENKVDYP 64 Query: 71 PVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMT 130 I ++ + + + + ++ +NT+ VK D + L T Sbjct: 65 KDGVKGIYVTSNSTEGDK--IDELIKFIKDSKLNTMVIDVKDDEGNI-------TMKLNT 115 Query: 131 GKIGENPG-YDPL--QFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPAS 187 G + D + + +L + H + A + + + P Sbjct: 116 GNKQVDKNTLDIVDGKKLLKKLHNNNIYPIARIVTF------------KDTKLAEEHPEW 163 Query: 188 VYVQHRDWIRTSG-DRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSR 246 + + + T+G ++P + EV D+ ++ +QFD + E Sbjct: 164 SFKESDGSVWTNGKGDSFVNPFMKEVWDYDITVAKAAAKA-GFQDIQFDYVRFPEG-FEN 221 Query: 247 LNDNETYRK--YGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSH 304 D+ TY K Y + S D R + + + + +K + GV G + Sbjct: 222 EADSLTYSKGDYKNSKLSSGDQRVDTITKFLEHANKELKPM--GVNVSADVFGYSALVKN 279 Query: 305 DPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWP---------FSRSAARYDVLA 355 P + + +D I+ IY + Y + Sbjct: 280 APGIGQSFPKMS--------------ENVDAISSMIYPSHWSNGDFGLDAPDTEPYKTVN 325 Query: 356 KWWADV---VKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISG 412 ++ + + I + + S + I+ + +Q+ ++ Sbjct: 326 RYIQKENSLLDSLGKKKPISRPWIQDFTASYLGNGNYIDYDAKAVSEQVQALKD-NGVNE 384 Query: 413 TILFREDY 420 +L+ Sbjct: 385 FLLWNAGN 392 >UniRef50_A8F7H1 Putative uncharacterized protein n=1 Tax=Thermotoga lettingae TMO RepID=A8F7H1_THELT Length = 370 Score = 112 bits (280), Expect = 3e-23, Method: Composition-based stats. Identities = 60/373 (16%), Positives = 112/373 (30%), Gaps = 69/373 (18%) Query: 87 QQQAMIDKLDHLQRLGINTVFFQVKPD-GTALWPSKILPWSDLMTGKIGENPGYDPLQFM 145 + + + L+ +G +F K GT WPSKI I + L Sbjct: 20 SEYGVEKAVKELKEMGFTDLFILAKGTTGTVYWPSKIA---------ISVSKNNAVLPKA 70 Query: 146 LDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVL 205 + K +++HAWF + K + ++ + Y V Sbjct: 71 SEICKKLNIRLHAWFIVSQDKSYLKLNPSSGMWGIPLEELSHEYRLGEHTCFRVSTSVV- 129 Query: 206 DPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDN--------------- 250 D +++ S++ E+VS Y +G+ D Y Sbjct: 130 DFTDQNYREYFFSLIKEIVSNYEPEGIHLDYIRYPNGAWGWGPSQIHRLRIFDLDGEKLL 189 Query: 251 ----ETYRKYG-------------GAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGV 293 +T+ + G + R ++ + S IK+++ + F Sbjct: 190 KKAIQTWGRNGDGRSFLDAFEHGDPDVIKWVELRVDDVKDFTQATSQLIKNMRDSIIFSA 249 Query: 294 S--PAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARY 351 + P G N S S G Y D L D + P Y + Sbjct: 250 ALIPEGGDPNPSERNFASIHCGQR-----YQDFAE-----LCDLMLPMAYH------QDF 293 Query: 352 DVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEIS 411 + W D+ K T ++ +G + +G + N E+ + + + Sbjct: 294 NRSNSWIEDITKATH-KIAMGKSRVVIGIQAH------NNIRTHEVVEAIKIAQN-SGAD 345 Query: 412 GTILFREDYLNKP 424 G +F + K Sbjct: 346 GVCIFAFHEVFKN 358 >UniRef50_B3DUS3 Trehalose synthase n=3 Tax=Bacteria RepID=B3DUS3_METI4 Length = 1121 Score = 111 bits (278), Expect = 4e-23, Method: Composition-based stats. Identities = 48/274 (17%), Positives = 86/274 (31%), Gaps = 56/274 (20%) Query: 76 NISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGE 135 + + + + KLD+++ LG+N ++ +PS + + Sbjct: 39 SFFDGNNDGIGDFVGITQKLDYIKALGVNAIWL------LPFYPSPLKDDGYDIADYCSI 92 Query: 136 NPGYDPLQFM---LDEAHKRGMKV------------HAWFNPYRVSVNTKPGTIRELNS- 179 +P Y LQ L EAHKRG++V H WF RVS + S Sbjct: 93 HPDYGDLQDFKTFLKEAHKRGLRVITELVINHTSDQHPWFQRARVSPPGSLYRNYYVWSD 152 Query: 180 ---------------TLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVV 224 S + W R + L+ PEV+ I I+ + Sbjct: 153 TPQKYKEARIIFKDFESSNWTWDPVAKAYFWHRFYSHQPDLNFDNPEVKKEIFKIIDFWL 212 Query: 225 SRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKS 284 VDG++ D Y + + G + + + T + ++ I Sbjct: 213 G-MGVDGLRLDAVPYL------------FEREGTSCENLPE-----THNFLKELRAYIDR 254 Query: 285 IKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYD 318 ++ A W + G+ A+ Sbjct: 255 HYENRML-LAEANQWPEDAVAYFGNGDECHMAFH 287 >UniRef50_B8I5J6 Putative uncharacterized protein n=2 Tax=Clostridium RepID=B8I5J6_CLOCE Length = 443 Score = 111 bits (276), Expect = 6e-23, Method: Composition-based stats. Identities = 51/338 (15%), Positives = 105/338 (31%), Gaps = 51/338 (15%) Query: 29 KSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQ 88 K+ P ++ + +T Q+ G+ L+ ++ V +V ++ P++ Sbjct: 62 KTKPSDTTSANSTAAASDSTKQEQIPQSTGLQLS--PGIEQVKVKAVYLTGPSA---GSA 116 Query: 89 QAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDE 148 + + + +NTV VK DG + + + NP + ++ + Sbjct: 117 ARIDKIISMAKNTELNTVVIDVKEDGAVNYTTNLDLVKKYGKQVKYYNP-----KDVIKK 171 Query: 149 AHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDW--IRTSGDRFVLD 206 H G+ V + + L++ A + V+ +G + Sbjct: 172 LHDNGIYVIGRIVVF-------------KDPVLAKNRADLGVKAPSGKLWLENGTTPWTN 218 Query: 207 PGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADW 266 P + EV D+ +I E +S Y D +QFD + N +G K Sbjct: 219 PYMEEVWDYNLAIAKEAIS-YGFDEIQFDYVRFPTGGKKSFN-------FGTNVPEK--- 267 Query: 267 RRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRR 326 +AK + GV + D R + Y Sbjct: 268 -AEAINGFLAKSQKELHQE-LGVPVSADVFAIIIESKLDGESIGQRFQEVGKDIYC---- 321 Query: 327 WVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKP 364 I+P IY + + +++ ++ Sbjct: 322 ---------ISPMIYPSHYANNSPKGIMSNGVGQMING 350 >UniRef50_D0MH73 Putative uncharacterized protein n=1 Tax=Rhodothermus marinus DSM 4252 RepID=D0MH73_RHOM4 Length = 322 Score = 110 bits (275), Expect = 1e-22, Method: Composition-based stats. Identities = 52/385 (13%), Positives = 98/385 (25%), Gaps = 113/385 (29%) Query: 93 DKLDHLQRLGINTVFFQ-VKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHK 151 L+ GI+ + + +G A + + ++ A Sbjct: 13 RTFARLKSSGIDALLLHENREEGPAFY------------------------ERLIPLAQA 48 Query: 152 RGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHR-----DWIRTSGDRFVLD 206 G+++HAW + L P V + L Sbjct: 49 EGIELHAWIPTMMRAE------------LLETHPDWYAVNREGVSTAEKPPYVDYYRFLS 96 Query: 207 PGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTE-------------------SPGSRL 247 P +P V+ ++ + + G+ D + + P Sbjct: 97 PCVPGVRSYLADYYDRMAQIEGLAGLHLDYIRFPDVILPITLQPKYGLVQDREYPPFDYG 156 Query: 248 NDNETYRKY-------------GGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVS 294 E ++ A + +R + ++ +++ + + + V Sbjct: 157 YHPECRAQFKAQTGIDPLELEDPSANEAWRQFRYDQITAVVRQIAERVHARGKPLTAAVF 216 Query: 295 PAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVL 354 P A D RW LD + P IY F + Sbjct: 217 PTPEI----------------ARTLVRQDWPRWP----LDAVMPMIYHNFYDKPVAWIET 256 Query: 355 AKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTI 414 A R LY G+ ++ EL + +D SG Sbjct: 257 ATREGVEALGGRIPLYSGL--------------FIPALTPEELAQAIDYA-LAGGASGVS 301 Query: 415 LFREDYLNKPQTQQAVSYLQSRWGS 439 LF + L Q L++R S Sbjct: 302 LFNVESLTDAHWQ----MLKTRLAS 322 >UniRef50_A8RX71 Putative uncharacterized protein n=4 Tax=Clostridiales RepID=A8RX71_9CLOT Length = 465 Score = 110 bits (275), Expect = 1e-22, Method: Composition-based stats. Identities = 61/409 (14%), Positives = 112/409 (27%), Gaps = 54/409 (13%) Query: 12 IRRPAILVALALLLCSCKSTPPESMVT-----PPAGSKPPATTQQSSQPMRGIWLATVSR 66 ++R L L + C P +T A Q +Q ++ Sbjct: 1 MKRWIAAGILILAMTGCSRYEPAKEMTRAQEESVQSEASGAQNGQETQEAEAADSQVITI 60 Query: 67 LDWPPVSSVNISNPTSRARVQQQA--MIDKLDHLQRLGINTVFFQVKPDGTALWPSKILP 124 +PP + V + A V M ++ + R +N V VK D + + P Sbjct: 61 STYPPRNPVKVKGIYVSAYVAGTGDMMDKIIEEIDRTELNAVVIDVKDDQGRITYAMDSP 120 Query: 125 WSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQ 184 + + + ++ + + G+ A +R Q+ Sbjct: 121 TVNEIGACQVFIQD---MPALMAKLKEHGIYTIARVVAFR------------DPYLAEQK 165 Query: 185 PASVY-VQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESP 243 P V R + ++P EV D++ + + D +QFD + Sbjct: 166 PEWSLHVADGKIYRDNKGLAWVNPYKKEVWDYLIEVGKK-AGEAGFDEIQFDYIRFAVDK 224 Query: 244 GSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRS 303 ++ + K + I + K G+ G Sbjct: 225 TM---NDVVFDDADTQGRDKTQ----AITEFIGYAHDEL--AKEGLFVSADVFGTIMRSE 275 Query: 304 HDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIY---WPFSRSAARYDVL--AKWW 358 D + EQ LDYI P IY + Y Sbjct: 276 EDAA-----------AVGQEYEDMAEQ--LDYICPMIYPSHYGPGNFGIEYPDTQPYDTI 322 Query: 359 ADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAV 407 + + +R L A K P + W+ + L+ + D Sbjct: 323 LNALNGSRELLA---ASAKEDAPQAVVRPWLQDFTASYLEHYIKYGDEQ 368 >UniRef50_A4BGI0 Alpha-galactosidase n=1 Tax=Reinekea blandensis MED297 RepID=A4BGI0_9GAMM Length = 708 Score = 110 bits (275), Expect = 1e-22, Method: Composition-based stats. Identities = 45/290 (15%), Positives = 92/290 (31%), Gaps = 40/290 (13%) Query: 17 ILVALALLLCSCKSTPPESMVTPP-----AGSKPPATTQQSSQPMRG-IWLATVSRLDWP 70 I +A L + P +S TP + + A +Q+ + +R + +V+ + P Sbjct: 240 IQLAEWLYPGEIELAPGDSYQTPTIVGSWSQNGLNALSQRFHRYLRQQVLDPSVAEVPRP 299 Query: 71 PVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMT 130 + + ++ +D +G+ G D Sbjct: 300 VHLNTW---EGIYFDHTPEHLLRMVDQAADMGVERFILDDGWFGAR--DDDHAGLGDWTV 354 Query: 131 GKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYV 190 G L +++D RGM+ WF P + + P V Sbjct: 355 NMQKHPGG---LHYLIDAVKARGMEFGLWFEP---------EMVNPDSDLYRAHPDWVLQ 402 Query: 191 QHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDN 250 ++VLD PEV +++ + + +++ Y + +++D PGS + Sbjct: 403 VEDYEQLLGRYQYVLDLSRPEVSEYLWNSIDAILTEYDIRYIKWDMNRDLVQPGSGGVAS 462 Query: 251 ETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWR 300 + Q + ++ I+ P VE +G R Sbjct: 463 -----------------VHRQTQALYQLMARIREAHPHVEIENCSSGGGR 495 >UniRef50_A4IKZ2 Alpha-amylase family protein n=12 Tax=Bacillaceae RepID=A4IKZ2_GEOTN Length = 511 Score = 109 bits (271), Expect = 3e-22, Method: Composition-based stats. Identities = 40/283 (14%), Positives = 88/283 (31%), Gaps = 60/283 (21%) Query: 59 IWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALW 118 I + + +D VN+++P + + KLD+++ +G ++ T ++ Sbjct: 39 IMVDRFNNMDSTNDQDVNVNDPKGYFGGDLKGVTAKLDYIKEMGFTAIWL------TPIF 92 Query: 119 PSKILPW-SDLMTGKIGENPGY---DPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKP--G 172 ++ + + +P + D L+ ++ EAHKR MKV F V + Sbjct: 93 KNRPGGYHGYWIEDFYEVDPHFGTLDDLKTLVKEAHKRDMKVILDFVANHVGYDHPWLHD 152 Query: 173 TIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGV 232 ++ ++ Q + L PEV++++ + +DG Sbjct: 153 PAKKDWFHPKKEIFDWNSQEQVENGWVYGLPDLAQENPEVKNYLIDAAKWWIKETDIDGY 212 Query: 233 QFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFG 292 + D + + + +K++K Sbjct: 213 RLDMVRHVPK------------------------------SFWQEFAKEVKAVKKDFFL- 241 Query: 293 VSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDY 335 VW + AD ++ G +DY Sbjct: 242 --LGEVWSDDPRYI---------------ADYGKYGIDGFVDY 267 >UniRef50_D1C2R1 Trehalose synthase n=15 Tax=Bacteria RepID=D1C2R1_SPHTD Length = 1121 Score = 109 bits (271), Expect = 3e-22, Method: Composition-based stats. Identities = 43/281 (15%), Positives = 85/281 (30%), Gaps = 56/281 (19%) Query: 69 WPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDL 128 + V + + + +++KLD++Q LG+ T++ +PS + Sbjct: 23 FYEVHVRAFYDSNADGIGDFRGLVEKLDYIQDLGVTTIWL------LPFYPSPLRDDGYD 76 Query: 129 MTGKIGENPGYDPLQF---MLDEAHKRGMKV------------HAWF------------- 160 + +P Y L+ + EAH+RG+ V H WF Sbjct: 77 IADYTNVHPDYGTLRDVRRFVREAHRRGLHVVTELVCNHTSDEHPWFQRARRAKPGSVWR 136 Query: 161 NPYRVSVNTKPGTIRELNSTLSQQPASVYVQ---HRDWIRTSGDRFVLDPGIPEVQDWIT 217 N Y S + ++ + W R + L+ P V+ I Sbjct: 137 NYYVWSDTPDKYRDARIIFKDFERSNWTWDPVAGAYYWHRFYSHQPDLNYDNPRVRREIF 196 Query: 218 SIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAK 277 + + +DG++ D Y Y + G + + T + + Sbjct: 197 R-ILDFWLDMGIDGLRLDAIPYL------------YEREGTNCENLPE-----THAFLKE 238 Query: 278 VSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYD 318 + I ++ A W + G A+ Sbjct: 239 LRAHIDERFSDRML-LAEANQWPEDAVQYFGDGDECHMAFH 278 >UniRef50_Q30YU6 Alpha amylase, catalytic subdomain n=10 Tax=Bacteria RepID=Q30YU6_DESDG Length = 1110 Score = 108 bits (270), Expect = 3e-22, Method: Composition-based stats. Identities = 49/274 (17%), Positives = 84/274 (30%), Gaps = 56/274 (20%) Query: 76 NISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGE 135 + + +I+KLD+LQ LG+ ++ +PS + + + Sbjct: 27 SFHDSDGDGMGDMAGLIEKLDYLQDLGVTALWL------LPFYPSPLRDDGYDIADYMSI 80 Query: 136 NPGYD---PLQFMLDEAHKRGMKV------------HAWF-------------NPYRVSV 167 NP Y + +L EAH RG++V HAWF + Y S Sbjct: 81 NPDYGSMADFRKLLREAHSRGLRVITELVLNHTSDQHAWFRRARRAPAGSEERDFYVWSD 140 Query: 168 NTK---PGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVV 224 + I + S + W R + L+ P V + ++ + Sbjct: 141 TSDRYKDARIIFKDFEPSNWSWDPVARAYYWHRFYHHQPDLNYENPAVHKAMFRVI-DFW 199 Query: 225 SRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKS 284 VDGV+ D Y Y + G + T + + I S Sbjct: 200 LDMGVDGVRLDAVPYL------------YEEEGTNCENLP-----RTHDFLKALRSHIDS 242 Query: 285 IKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYD 318 G ++ A W + G A+ Sbjct: 243 RFQGRML-LAEANQWPEDAARYFGDGDSCHMAFH 275 >UniRef50_B7VK54 Neopullulanase n=26 Tax=Bacteria RepID=B7VK54_VIBSL Length = 612 Score = 108 bits (269), Expect = 4e-22, Method: Composition-based stats. Identities = 44/285 (15%), Positives = 76/285 (26%), Gaps = 58/285 (20%) Query: 70 PPVSSVNISNPTSRAR--VQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSD 127 P + P S +IDKLD+LQ LG+N ++ A + + + Sbjct: 191 PANVQPWGTRPVSDNFMGGDLWGVIDKLDYLQDLGVNGLYLCPIFTANANHKYDTVDYYN 250 Query: 128 LMTGKIGENPGYDPLQFMLDEAHKRGMKVH--AWFNPYRVSVNTKPGTIRE-------LN 178 + G + ++DEAHKRGMK+ A FN + Sbjct: 251 VDPHFGGNEA----FKALVDEAHKRGMKIMLDAVFNHIGSQSPLWLDVVNNGAKSKYADW 306 Query: 179 STLSQQPASVYVQHRDWIRTSGDR---------FVLDPGIPEVQDWITSIVAEVVSRYPV 229 ++Q P DW + + L+ E + ++ + V + + Sbjct: 307 FWINQFPVYPDTPKEDWDFWNLNYETFANVVEMPKLNTENEECRAYLLDVARHWVEEFNI 366 Query: 230 DGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGV 289 DG + D + +K P Sbjct: 367 DGWRLDVANEVDH------------------------------AFWRDFRKVVKDANPDC 396 Query: 290 EFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLD 334 +W G Y + G +D Sbjct: 397 YI---LGEIWHEGMPWLRGDQYDSLMNYP-LTQAITDYFGLGDVD 437 >UniRef50_P29964 Cyclomaltodextrinase n=13 Tax=Thermoanaerobacterales RepID=CDAS_THEP3 Length = 574 Score = 108 bits (269), Expect = 5e-22, Method: Composition-based stats. Identities = 40/265 (15%), Positives = 73/265 (27%), Gaps = 50/265 (18%) Query: 66 RLDWPPVSSVNISNPTSRARV--QQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKIL 123 + + P PT+ + Q +IDK+D+L+ LGIN ++ + Sbjct: 147 KSNDPENVKPWGEKPTADSFFGGDLQGIIDKIDYLKDLGINAIYLTPIFLSHSTHKYDTT 206 Query: 124 PWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVH--AWFN-----PYRVSVNTKPGTIRE 176 + + + ++ + H G+KV A FN + K G + Sbjct: 207 DYYTIDPHFGDTQKA----RELVQKCHDNGIKVIFDAVFNHCGYDFFAFQDVIKNGKKSK 262 Query: 177 LNSTLSQQPASVYVQHRDWIRTSGDRFVLDP----GIPEVQDWITSIVAEVVSRYPVDGV 232 + + + D P PEVQ ++ + + +DG Sbjct: 263 YWDWFNIYEWPIKTHPKPSYEAFADTVWRMPKLMTKNPEVQKYLLEVAEYWIKEVDIDGW 322 Query: 233 QFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFG 292 + D + K +K+ KP Sbjct: 323 RLDVANEIDHH------------------------------FWRKFREVVKAAKPEAII- 351 Query: 293 VSPAGVWRNRSHDPLGSDTRGAAAY 317 VW + S G Y Sbjct: 352 --VGEVWHDASPWLRGDQFDSVMNY 374 >UniRef50_Q1J674 Neopullulanase / Cyclomaltodextrinase / Maltogenic alpha-amylase n=9 Tax=Firmicutes RepID=Q1J674_STRPF Length = 571 Score = 108 bits (269), Expect = 5e-22, Method: Composition-based stats. Identities = 40/272 (14%), Positives = 78/272 (28%), Gaps = 53/272 (19%) Query: 57 RGIWL--ATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDG 114 RG W + + W + P S A + + +KLD+L+ LGI ++ Sbjct: 150 RGDWEKGDSYVNMGWLEKPT-----PKSFAGGDLKGITEKLDYLKDLGITVIYLTPIFQS 204 Query: 115 TALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKV-------HAWFNPYRVSV 167 + I + + + YD LQ ++D AH+ G+K+ HA + Sbjct: 205 ISNHKYDISDYYAIDPQFGTK---YD-LQELIDLAHQMGIKIILDAVFNHASSDAVEFQD 260 Query: 168 NTKPGTIRELNSTLSQQPASVYVQ--HRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVS 225 + G + + + + + +VQD++ I + Sbjct: 261 VLRYGKESKFFDWFMTHDEHPSMDLVNYETFAGCNYMPKWNTSNRDVQDYLIEIGRYWIK 320 Query: 226 RYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSI 285 + +DG + D + + +K+ Sbjct: 321 EFCIDGWRLDVS------------------------------DEVSHDFWRRFRQAVKAE 350 Query: 286 KPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAY 317 K W + G G Y Sbjct: 351 KADAIL---IGENWHDAYPYLAGDQYDGIMNY 379 >UniRef50_Q08341 Cyclomaltodextrinase n=101 Tax=Bacteria RepID=CDAS_BACSH Length = 591 Score = 108 bits (269), Expect = 5e-22, Method: Composition-based stats. Identities = 38/253 (15%), Positives = 69/253 (27%), Gaps = 48/253 (18%) Query: 76 NISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGE 135 + + Q +ID LD+L LG+N ++F T + + Sbjct: 161 GTPSAGNFFGGDLQGVIDHLDYLSDLGVNALYFNPLFAATTNHKYDTADYMKIDPQFGTN 220 Query: 136 NPGYDPLQFMLDEAHKRGMKVHAW---------FNPYRVSVNTKPGTIRELNSTLSQQPA 186 L+ ++D H RGM+V F P+ +N + + + P Sbjct: 221 EK----LKELVDACHARGMRVLLDAVFNHCGHTFPPFVDVLNNGLNSRYADWFHVREWPL 276 Query: 187 SVYVQHR--DWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPG 244 V D L+ G EV+ ++ ++ + +DG + D + Sbjct: 277 RVVDGIPTYDTFAFEPIMPKLNTGNEEVKAYLLNVGRYWLEEMGLDGWRLDVANEVDH-- 334 Query: 245 SRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSH 304 Q + IK I P + + Sbjct: 335 ----------------------------QFWREFRSEIKRINPSAYI---LGEIMHDSMP 363 Query: 305 DPLGSDTRGAAAY 317 G Y Sbjct: 364 WLQGDQFDAVMNY 376 >UniRef50_B7J5F9 GTP-binding protein n=2 Tax=Acidithiobacillus ferrooxidans RepID=B7J5F9_ACIF2 Length = 455 Score = 107 bits (267), Expect = 7e-22, Method: Composition-based stats. Identities = 47/339 (13%), Positives = 101/339 (29%), Gaps = 52/339 (15%) Query: 92 IDKLDHLQRLGINTVFFQVKPD-GTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAH 150 LD + + +N V VK D G + + + +++ K+ ++ ++D+ H Sbjct: 117 ESALDIIGKTDLNAVVIDVKSDRGMIAYKTDVPLATEIGAQKMITIKH---IKRLMDDLH 173 Query: 151 KRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRF-VLDPGI 209 + G+ A + + N +P I + DP Sbjct: 174 QEGIYTIARIVVF------------KDNVLALARPDLAVRTAGGAIWKDREGLAWTDPFS 221 Query: 210 PEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRN 269 +V D+ + D +QFD + ++ G + + T + R + Sbjct: 222 KQVWDYNIDVAVAAAKD-GFDEIQFDYVRFPDAKGLVFSRSTT-----------EESRVS 269 Query: 270 NTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVE 329 +A+ + V G +D + + Sbjct: 270 AISGFLAEARKRLIPYN--VFLSADIFGYVIWNRND------------TGIGQNLEEMAQ 315 Query: 330 QGLLDYIAPQIY---WPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEP 386 Q +DYI+P +Y + + VL + + G+ + + Sbjct: 316 Q--VDYISPMLYPSGFQYGIPGYPNPVLHPHQIVYLSLRKAEERTGLPPVRFRPWLQAFR 373 Query: 387 DWMINGGV---PELKKQLDLNDAVPEISGTILFREDYLN 422 D+ G E+ Q+D G +L+ + Sbjct: 374 DYAFGGKPFGGEEIAAQIDAAQTF-GSDGWMLWNPRNVY 411 >UniRef50_A5FIA1 Hypothetical lipoprotein n=2 Tax=Flavobacteriaceae RepID=A5FIA1_FLAJ1 Length = 360 Score = 107 bits (267), Expect = 8e-22, Method: Composition-based stats. Identities = 54/354 (15%), Positives = 94/354 (26%), Gaps = 63/354 (17%) Query: 108 FQVKPDGTAL-----WPSKILPWSDLMTGKIGENPGYDP--LQFMLDEAHKRGMKVHAWF 160 F V A + + + D ++ N DP L+ ++ A K G+KVHAW Sbjct: 30 FGVWTTADAKKSDADYTKEFKKYKDGGIDEVLINTTTDPQLLKRLVPLATKEGLKVHAWI 89 Query: 161 NPYRVSVNTKPGTIRELNSTLSQQPASVYVQHR-----DWIRTSGDRFVLDPGIPEVQDW 215 R +S Q P V D L P E ++ Sbjct: 90 ----------MAMNRPGDSVALQHPEWYQVSKEGKSCFDNRPYVDYYQWLCPTRKESREH 139 Query: 216 ITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLI 275 + +V E+ ++ V D + + + Y + D+ + + Sbjct: 140 VLHLVEELAKVEGIESVHLDYIRFPDIFLPISLLPK-YNLVQDVELPQFDFC--YCDECV 196 Query: 276 AKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLG---------------SDTRGAAAY--- 317 I P S W+N + + T Y Sbjct: 197 K-AFEKIHHKNPKESHNTSIDMEWKNFRLNAIRGVVDDAYKIAHKHNKQLTAAVFPYPEM 255 Query: 318 --DESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAF 375 +W +D + P IY F Y+ W K G+ Sbjct: 256 ADHMVRQRWDKW----NIDEVYPMIYHSF------YNEEIDWVGYATKQ-------GVED 298 Query: 376 YKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQA 429 + + ++ KQ L G F + L++ + Sbjct: 299 LEDKKTKINTGIYIPGLKNDAELKQAILEAKENGAVGVSFFDGNALSESNLKTI 352 >UniRef50_A1TNR8 Trehalose synthase n=7 Tax=Betaproteobacteria RepID=A1TNR8_ACIAC Length = 1142 Score = 107 bits (267), Expect = 9e-22, Method: Composition-based stats. Identities = 53/325 (16%), Positives = 98/325 (30%), Gaps = 68/325 (20%) Query: 31 TPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVN------ISNPTSRA 84 P P P ++ + + T W + + + + Sbjct: 10 LPAADAAAGPLPEPGPVVMPETPE------IDTQGDPQWYRDAVIYQLNVKAFFDSNNDG 63 Query: 85 RVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPL-- 142 + + KLD+++ LG+NT++ +PS + ++ +P Y L Sbjct: 64 YGDFKGVTAKLDYVKDLGVNTIWLMP------FYPSPLRDDGYDISDYENVHPQYGTLAD 117 Query: 143 -QFMLDEAHKRGMKV------------HAWFNPYRVSVNTKPGTIRELNS---------- 179 + MLD AH RG++V H WF R + P + S Sbjct: 118 FKEMLDAAHARGLRVITELVINHTSSEHPWFQRARRAPPGSPERDFYVWSDTDQIYRGTR 177 Query: 180 ------TLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQ 233 S + W R + L+ P V + + + + VDG + Sbjct: 178 IIFTDTETSNWAWDPVAKQYYWHRFFSHQPDLNFDNPLVLEAVFKTMRFWL-DMGVDGFR 236 Query: 234 FDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGV 293 D Y + G + + + T +I K+ I + F + Sbjct: 237 LDAIPYLV------------ERDGTSNENLPE-----THAVIKKLRAAIDAEYRN-RFLL 278 Query: 294 SPAGVWRNRSHDPLGSDTRGAAAYD 318 + A +W + G AY Sbjct: 279 AEANMWPEDVREYFGDGDECHMAYH 303 >UniRef50_C6XB31 Trehalose synthase n=19 Tax=Bacteria RepID=C6XB31_METSD Length = 1199 Score = 107 bits (266), Expect = 9e-22, Method: Composition-based stats. Identities = 49/274 (17%), Positives = 92/274 (33%), Gaps = 56/274 (20%) Query: 76 NISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGE 135 + + + +I KLD++ LG+NT++ +PS ++ G Sbjct: 45 SFFDGNNDGIGDFAGLIQKLDYIVSLGVNTLWL------LPFYPSPRRDDGYDISDYRGV 98 Query: 136 NPGYDPL---QFMLDEAHKRGMKV------------HAWF-------------NPYRVSV 167 +P Y L + + EAHK G++V H WF N Y S Sbjct: 99 HPDYGNLFDVKRFIAEAHKHGLRVITELIINHTSDQHPWFQRARQAKPGSAARNFYVWSD 158 Query: 168 NTKPGTIRELNSTLSQQPASVYVQ---HRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVV 224 + + T +++ + W R + L+ P V + + S + E Sbjct: 159 TDTAYSQTRIIFTDTEKSNWTWDPVANAFYWHRFFSHQPDLNYDNPRVFNTVMS-IMEFW 217 Query: 225 SRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKS 284 VDG++ D Y + G + + + T Q++ K+ I Sbjct: 218 LDLGVDGLRLDAVPYL------------IEREGTSNENLPE-----THQILKKIRQFIDQ 260 Query: 285 IKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYD 318 P ++ A +W G++ A+ Sbjct: 261 RYPDRML-LAEANMWPEDVQYYFGNNDECHMAFH 293 >UniRef50_A3XP30 Trehalose synthase n=1 Tax=Leeuwenhoekiella blandensis MED217 RepID=A3XP30_9FLAO Length = 1104 Score = 107 bits (266), Expect = 1e-21, Method: Composition-based stats. Identities = 48/288 (16%), Positives = 92/288 (31%), Gaps = 62/288 (21%) Query: 68 DWPPVSSVN------ISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSK 121 +W + + + S + ++ KLD+L+ LG+ ++ +PS Sbjct: 12 NWYKDAIIYELHIKAFFDSNSDGIGDFEGLLQKLDYLEDLGVTAIWL------LPFYPSP 65 Query: 122 ILPWSDLMTGKIGENPGYD---PLQFMLDEAHKRGMKV------------HAWF------ 160 + + NP Y + ++EAHKRG+KV H WF Sbjct: 66 LRDDGYDIADYYSINPSYGEVEDFKRFIEEAHKRGLKVITELVINHTSDQHEWFQRARKA 125 Query: 161 -------NPYRVSVNTKPGTIRELNSTLSQQPASVYVQH---RDWIRTSGDRFVLDPGIP 210 + Y + N + + T ++ + W R + L+ P Sbjct: 126 PADSKYRDYYVWTDNVEKYKDARIIFTDTEPSNWTWDAEAEAYYWHRFFSHQPDLNFDNP 185 Query: 211 EVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNN 270 +VQ + +I+ VDG + D Y + + G + + Sbjct: 186 DVQQEVFNIMDYWC-DLGVDGFRLDAVPYL------------FERDGTNCENLPE----- 227 Query: 271 TQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYD 318 T + K+ I + + ++ A +W S G Y Sbjct: 228 THVFLKKLRAHIDANHDN-KLLLAEANMWPEDSAAYFGDGDECHMNYH 274 >UniRef50_C6VVW7 Trehalose synthase n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6VVW7_DYAFD Length = 1113 Score = 106 bits (265), Expect = 1e-21, Method: Composition-based stats. Identities = 49/290 (16%), Positives = 88/290 (30%), Gaps = 62/290 (21%) Query: 66 RLDWPPVSSVN------ISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWP 119 L W + + + + ++++LD+LQ LG+ ++ +P Sbjct: 11 NLHWYKDAIIYELHIKAFKDGNCDGIGDFKGLLEQLDYLQDLGVTAIWL------LPFYP 64 Query: 120 SKILPWSDLMTGKIGENPGYDP---LQFMLDEAHKRGMKV------------HAWF---- 160 S + + NP Y + L EAHKRG+KV H WF Sbjct: 65 SPLRDDGYDIADYYTINPSYGDIQEFKTFLREAHKRGLKVITELVINHTSDQHPWFQRAR 124 Query: 161 ---------NPYRVSVNTKPGTIRELNSTLSQQPASVYVQH---RDWIRTSGDRFVLDPG 208 N Y + + + ++ + W R + L+ Sbjct: 125 RAPKGSAYRNFYVWTDDPHQFKDARIIFQDYEKSNWTWDNEAEQYYWHRFFHHQPDLNYD 184 Query: 209 IPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRR 268 +VQ+ I I+ VDG + D Y + + G + + Sbjct: 185 SMDVQEEIFKIINFWCK-MGVDGFRLDAVPYL------------FERDGTNCENLPE--- 228 Query: 269 NNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYD 318 T + K+ + PG ++ A +W S G Y Sbjct: 229 --THAFLKKLRKYVDDRYPGTLL-LAEANMWPEDSAAYFGDGDECQMNYH 275 >UniRef50_C4ZHD6 Neopullulanase n=1 Tax=Eubacterium rectale ATCC 33656 RepID=C4ZHD6_EUBR3 Length = 588 Score = 106 bits (265), Expect = 2e-21, Method: Composition-based stats. Identities = 32/248 (12%), Positives = 77/248 (31%), Gaps = 49/248 (19%) Query: 85 RVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQF 144 + + + LD++++LGIN ++ + + + + ++ + L+ Sbjct: 182 GGTLKGICENLDYIEKLGINCIYLNPIFEAASYHKYDTIDYFEIDPCLGNKA----DLKE 237 Query: 145 MLDEAHKRGMKVHA--WFN-----PYRVSVNTKPGTIRELNSTLSQQPASVYVQHR---D 194 ++ + HKRG++V FN + + G + P ++ + Sbjct: 238 LVQQCHKRGIRVILDGVFNHCGADFFAFRDVRQKGKASRYYNWFYHLPETIQYADPPDYE 297 Query: 195 WIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYR 254 + L+ G PEV +++ ++ + +DG + D Sbjct: 298 AFAYVKEMPKLNTGNPEVVEYLCNVGTYWIREADIDGWRLDV------------------ 339 Query: 255 KYGGAFASKADWRRNNTQ-QLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRG 313 N + H ++++K + +W G Sbjct: 340 -------------ANEINHEFWRAFRHAVRAVKEDIFL---IGEIWEEAGIWLQGDQFDS 383 Query: 314 AAAYDESY 321 Y SY Sbjct: 384 TMNYTFSY 391 >UniRef50_Q9R9H8 Intracellular maltogenic amylase n=70 Tax=Bacteria RepID=BBMA2_BACSU Length = 588 Score = 106 bits (263), Expect = 2e-21, Method: Composition-based stats. Identities = 40/258 (15%), Positives = 73/258 (28%), Gaps = 47/258 (18%) Query: 70 PPVSSVNISNPTSRARV---QQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWS 126 P + S Q ++DKLD+L+ LG+N ++ + L + Sbjct: 156 PKNALPWGSKDPGVNDFFGGDLQGIVDKLDYLEDLGVNGIYLTPIFSAPSNHKYDTLDYF 215 Query: 127 DLMTGKIGENPGYDPLQFMLDEAHKRGMKVH--AWFNPYR-----VSVNTKPGTIRELNS 179 + + ++ + H+RGM++ A FN K G Sbjct: 216 SIDPHFGDPE----IFRTLVSQLHQRGMRIMLDAVFNHIGSASPQWQDVVKNGDQSRYKD 271 Query: 180 TLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFY 239 V + D + D L+ PEVQ ++ I + + +DG + D Sbjct: 272 WFHIHSFPVTDDNYDRFAFTADMPKLNTANPEVQKYLLDIALYWIREFDIDGWRLDVANE 331 Query: 240 TESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVW 299 + + + KP V +W Sbjct: 332 VDHV------------------------------FWKTFRQAVSTEKPDVYI---LGEIW 358 Query: 300 RNRSHDPLGSDTRGAAAY 317 + G + A Y Sbjct: 359 HSAEPWLRGDEFHAAMNY 376 >UniRef50_Q2RYS7 Tat (Twin-arginine translocation) pathway signal sequence domain protein n=1 Tax=Salinibacter ruber DSM 13855 RepID=Q2RYS7_SALRD Length = 389 Score = 105 bits (262), Expect = 3e-21, Method: Composition-based stats. Identities = 67/395 (16%), Positives = 118/395 (29%), Gaps = 56/395 (14%) Query: 43 SKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLG 102 S PPA +S S P + N T V + + + L+ Sbjct: 24 STPPARAPSAS--------TNASDPSPPDSAPTNWVWMTPELDVSGEEWRRRFERLRAHN 75 Query: 103 INTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNP 162 I+ + QV + A + S LP L+ +L A + G++VH W Sbjct: 76 IDAILPQVYTNSAAYYGSDFLPVEGEW------------LETILPPAKEVGLEVHGWMVS 123 Query: 163 YRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAE 222 ++ +E + V + ++ P P VQD+I V E Sbjct: 124 MPCTIPKIVNQHKEWFVV--NRNGESAVDNPAYVDYY---KFTCPNRPGVQDFIERRVEE 178 Query: 223 VVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWR-RNNTQQLIAKVSHT 281 + S +DG+ FD + + + + Y + D+ + + + Sbjct: 179 ITSIDGLDGIHFDYIRFPDVVIAEALQPK-YGIVQHEEQAPYDYCYCDVCRSKFERDHGA 237 Query: 282 --IKSIKP-----GVEFGVSPAGVWRNRSHDPLGSDTRGAA------AYDESYADTRRWV 328 P F N P+ + A ++ W Sbjct: 238 DPYDLEDPTTSTAWRLFRYESITNLVNDRLIPIARENGKAVSAATFPNWEAVRQRWHHW- 296 Query: 329 EQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDW 388 LDY+ P +Y F + A W D + RL G +++ Sbjct: 297 ---DLDYVHPMLYHNFYHAGA------NWVRDETRAGIERL------RGQGRSTRLYSGL 341 Query: 389 MINGGVPELKKQLDLNDAVPEISGTILFREDYLNK 423 + P ++L + SG LF +N Sbjct: 342 NVGAVAPGGLERLIEKAHEGDASGITLFAAGSMND 376 >UniRef50_D2L779 Trehalose synthase n=1 Tax=Desulfovibrio sp. FW1012B RepID=D2L779_9DELT Length = 1107 Score = 105 bits (262), Expect = 3e-21, Method: Composition-based stats. Identities = 51/287 (17%), Positives = 88/287 (30%), Gaps = 62/287 (21%) Query: 69 WPPVSSVN------ISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKI 122 W + + + + +LD+L LG++T++ +PS + Sbjct: 14 WYKDAVIYEVHVKAFMDGNGDGIGDFAGLTSRLDYLANLGVDTLWL------LPFYPSPL 67 Query: 123 LPWSDLMTGKIGENPGYDPL---QFMLDEAHKRGMKV------------HAWFNPYRVSV 167 + G +P Y L + L EAH RG+KV H WF RV+ Sbjct: 68 RDDGYDIADYYGIHPHYGTLRDFKDFLREAHDRGLKVITELVVNHTSDQHPWFKRSRVAP 127 Query: 168 NTKPGTIRELNSTLSQ-------------QPASVYVQH---RDWIRTSGDRFVLDPGIPE 211 P + S + Q W R + L+ P Sbjct: 128 PGDPWRDYYVWSKTPDKYRQARIIFKDFEHSNWTFDQEAGAYYWHRFYSHQPDLNYDNPR 187 Query: 212 VQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNT 271 V+ IT ++ +S VDG++ D Y Y + G + + T Sbjct: 188 VRKDITKVMDFWLSL-GVDGLRLDAVPYL------------YERRGTNCENLPE-----T 229 Query: 272 QQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYD 318 + K+ + + ++ A W + G AY Sbjct: 230 HGFLKKLRAHVDTRFKNRML-LAEANQWPEDAVAYFGDGDECHMAYH 275 >UniRef50_C6IYE3 Alpha amylase catalytic region n=2 Tax=Bacillales RepID=C6IYE3_9BACL Length = 564 Score = 105 bits (262), Expect = 3e-21, Method: Composition-based stats. Identities = 50/279 (17%), Positives = 81/279 (29%), Gaps = 45/279 (16%) Query: 26 CSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLD------WPPVSSVNISN 79 C TP T S P G A + +D + + + ++ Sbjct: 22 AGCTDTPGSKSTASEPKGNSNDTAATSPNPPEGGKAAAPNAVDSQPSTVFYEIFVRSFAD 81 Query: 80 PTSRARVQQQAMIDKLDHLQR--------LGINTVFFQVKPDGTALWPSKILPWSDLMTG 131 + +I KLD+L LGI ++ + + + D+ Sbjct: 82 SDGDGIGDFKGLISKLDYLNDGNPDTDTDLGIGGIWLMPINPSPSYHGYDVTNYRDINPD 141 Query: 132 KIGENPGYDPLQFMLDEAHKRGMKV------------HAWFN--------PYRVSVNTKP 171 D + L+EAHKRG+KV H WF PYR Sbjct: 142 YGTM----DDFRTFLNEAHKRGIKVIMDLVVNHTSKEHPWFTQSAADPNSPYRDWYVWAE 197 Query: 172 GTIRELNSTLSQ---QPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYP 228 R ++ T + P L+ PEV+ + I + + Sbjct: 198 DQGRAVSGTSAAGSGNPWHSLNGGHYLGIFWDGMPDLNLDNPEVRKEMIEIGQFWLEQ-G 256 Query: 229 VDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWR 267 VDG + D + ET A ++R Sbjct: 257 VDGFRLDAAKHIYEDLLTDKSQETTN---KNVAWWQEFR 292 >UniRef50_C7RFM8 Glycoside hydrolase clan GH-D n=34 Tax=Bacteria RepID=C7RFM8_ANAPD Length = 722 Score = 105 bits (261), Expect = 4e-21, Method: Composition-based stats. Identities = 54/346 (15%), Positives = 110/346 (31%), Gaps = 39/346 (11%) Query: 66 RLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPW 125 + + + + ++ + D +LGI G ++ L Sbjct: 317 NFAYEKRPILINNWEATYFDFDKEKLSSLTDEASKLGIELFVLDDGWFGNRFDDNRALGD 376 Query: 126 SDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQP 185 + KIG L ++ + HK+GMK W P +SV++ + P Sbjct: 377 WQVNEEKIGCK-----LSELISDVHKKGMKFGLWVEPEMISVDSD---------LYRKHP 422 Query: 186 ASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGS 245 + S ++ VL+ PEV ++ I+ +++S + +D +++D + G+ Sbjct: 423 DWAIQAPKRGHSYSRNQLVLNLANPEVVAYLKEILDDLLSNHDIDYIKWDYNRNITNIGN 482 Query: 246 RLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHD 305 + ET + D L+ ++ V F G RN Sbjct: 483 GKDYLETMEQSHKYMLGFYD--------LVKYLTE----KHSDVLFESCSGGGGRNDLGV 530 Query: 306 -PLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKP 364 + D ++ ++ P I A V P Sbjct: 531 MRYFPQVWASDNTDAISRLPIQYGST----FLYPTI-----SMGAHVSASPNHQMKRVTP 581 Query: 365 TRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEI 410 +TR G +++ + E+K+Q++L + I Sbjct: 582 LKTR---GHVAMMGNFGYELDLSKLSEEEKNEIKEQVNLYKEIRPI 624 >UniRef50_UPI0001973ACA hypothetical protein ClM62_04023 n=1 Tax=Clostridium sp. M62/1 RepID=UPI0001973ACA Length = 436 Score = 105 bits (261), Expect = 4e-21, Method: Composition-based stats. Identities = 54/389 (13%), Positives = 108/389 (27%), Gaps = 55/389 (14%) Query: 62 ATVSRLDWPPVSS-VNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPS 120 D P V ++ A ++ M ++ + R +N V K D + Sbjct: 88 DDAWSTDSPREPVKVKGIYISAYAAGSRERMARIIEEIDRTELNAVVIDFKDDNGNI--- 144 Query: 121 KILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNST 180 S ++ P L +L E + G+ A R + Sbjct: 145 TAEVESPMLQELGVCRPYISDLPGLLSELREHGIYTIARVVSLRDPKMGE---------- 194 Query: 181 LSQQPAS-VYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFY 239 +P + + +R S LDP EV++++ + + D VQFD + Sbjct: 195 --TRPQWCFQSESGEVLRDSDKMAWLDPYNEEVREYLAEVGRQAALA-GFDEVQFDYIRF 251 Query: 240 TESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVW 299 + + + + + + +L + + S GV G Sbjct: 252 STNNSLQKAAAKAAGESSK---------TDIITELAGYIYDELSSE--GVYVSADVFGAI 300 Query: 300 RNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIY---WPFSRSAARYDVLAK 356 + S D D LDYI P IY + + L Sbjct: 301 ISSSGDA-----------QAVGQDYAALASS--LDYICPMIYPSHYGDGNFGLDHPDLHP 347 Query: 357 WWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAV--------- 407 + + ++ + A + + + + W+ + LK + Sbjct: 348 YETIMGALADSKTALDTAGAETTQKTAVVRPWLQDFTASWLKNHIAYGGEEIRSQIQAVY 407 Query: 408 -PEISGTILFREDYLNKPQTQQAVSYLQS 435 IL+ ++ Q+ Sbjct: 408 DAGYEEWILWSASNKYHYGALESEEEQQN 436 >UniRef50_B7K9W5 Trehalose synthase n=11 Tax=Bacteria RepID=B7K9W5_CYAP7 Length = 1130 Score = 105 bits (261), Expect = 4e-21, Method: Composition-based stats. Identities = 48/286 (16%), Positives = 87/286 (30%), Gaps = 56/286 (19%) Query: 64 VSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKIL 123 V ++ + + KLD+LQ LG++ V+ +PS + Sbjct: 34 FKNAIIYEVPVRAFADSNGDGIGDFKGLTQKLDYLQDLGVSAVWV------LPFFPSPLK 87 Query: 124 PWSDLMTGKIGENPGYDPL---QFMLDEAHKRGMKV------------HAWF-------- 160 ++ NP Y L + L+ AH+RG++V H WF Sbjct: 88 DDGYDISDYNNVNPIYGTLEDFKEFLNAAHQRGIRVIIELIVNHTSDTHPWFQRARRAPK 147 Query: 161 -----NPYRVSVNTKP---GTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEV 212 + Y S I + S W R + L+ P V Sbjct: 148 GSVERDFYVWSDTPDKYREARIIFQDFESSNWSWDSVANAYYWHRFYSHQPDLNYDNPAV 207 Query: 213 QDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQ 272 + + ++ +S VDG++ D Y Y + G + + T Sbjct: 208 RQAVFEVLDFWLS-MGVDGLRLDAVPYL------------YERDGTNCENLPE-----TH 249 Query: 273 QLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYD 318 Q++ + I P ++ A W + G+ + Sbjct: 250 QILKDLRQYIDQKYPNRML-LAEANQWPEDAAAYYGNGDECHMNFH 294 >UniRef50_A4VLG7 Alpha-amylase family protein n=2 Tax=Bacteria RepID=A4VLG7_PSEU5 Length = 1087 Score = 104 bits (260), Expect = 5e-21, Method: Composition-based stats. Identities = 40/244 (16%), Positives = 80/244 (32%), Gaps = 55/244 (22%) Query: 76 NISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGE 135 + + + +IDKLD++ LG+NT++ +PS + G Sbjct: 8 SFYDSNNDGVGDFVGLIDKLDYIADLGVNTIWL------LPFYPSPRRDDGYDIADYRGV 61 Query: 136 NPGYDPLQF---MLDEAHKRGMKV------------HAWF-------------NPYRVSV 167 +P Y + + EAHKRG++V H WF N Y S Sbjct: 62 HPEYGNMADARRFIAEAHKRGLRVITELVINHTSDQHPWFQRARKAKKGSAARNFYVWSD 121 Query: 168 NTKPGTIRELNSTLSQQPASVYV---QHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVV 224 + +++ + + W R + L+ P+V + +++ + Sbjct: 122 TDDKYQGTRIIFLDTEKSNWTWDPVAKQYFWHRFYSHQPDLNFDNPQVMKAVLAVMRFWL 181 Query: 225 SRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKS 284 +DG++ D Y + G + + T ++ + I + Sbjct: 182 -DMGIDGLRLDAIPYLV------------ERDGTNNENLPE-----THAVLKAIRAEIDA 223 Query: 285 IKPG 288 P Sbjct: 224 NYPD 227 >UniRef50_C6IYD0 Neopullulanase n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6IYD0_9BACL Length = 576 Score = 104 bits (260), Expect = 5e-21, Method: Composition-based stats. Identities = 38/262 (14%), Positives = 74/262 (28%), Gaps = 48/262 (18%) Query: 68 DWPPVSSVNISNPTSRAR--VQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPW 125 + P S PT Q +I +LD+L LGIN ++ T+ + + Sbjct: 147 NDPESVEPWGSVPTRDNYMGGDLQGIIKQLDYLSGLGINALYLNPIFAATSNHKYDTVDY 206 Query: 126 SDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKP---------GTIRE 176 + LQ ++ + H+R +KV ++ + Sbjct: 207 FQIDPQFGTVQ----DLQELVQKCHERDIKVILDLVINHCGFYHPYFQDVVKKGEASVYK 262 Query: 177 LNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDD 236 +++ P D + L PEVQ ++ IV+ +DG + D Sbjct: 263 DWFYINRYPVHRSEDGYDSVGYYQWMPKLRTSNPEVQQYVYDIVSFWQKETGIDGWRID- 321 Query: 237 YFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPA 296 + D + ++ +K + P A Sbjct: 322 ----------VADEVEI-------------------SFLRELKSCVKKLNPNAII---IA 349 Query: 297 GVWRNRSHDPLGSDTRGAAAYD 318 +W + Y+ Sbjct: 350 EIWDDAKRLMASGGVDSVMNYE 371 >UniRef50_C6CVL0 Alpha amylase catalytic region n=1 Tax=Paenibacillus sp. JDR-2 RepID=C6CVL0_PAESJ Length = 582 Score = 104 bits (258), Expect = 9e-21, Method: Composition-based stats. Identities = 39/263 (14%), Positives = 78/263 (29%), Gaps = 50/263 (19%) Query: 68 DWPPVSSVNISNPTSRARV--QQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPW 125 + P +S P + Q +I+++ HL LG+N V+ + + Sbjct: 148 NDPEGTSPWGEQPEGESFFGGDLQGIINRIGHLNELGVNAVYLTPVFRSPSNHKYDTTDY 207 Query: 126 SDLMTGKIGENPGYDPLQFMLDEAHKRGMKV-------HAW--FNPYRVSVNTKPGTIRE 176 ++ + D L+ +++ HK G++V HA F P++ + + + Sbjct: 208 REVDPHFGDK----DLLKMLVEVCHKHGIRVVLDAVFNHASEQFPPFQDVLEKGDQSEFK 263 Query: 177 LNSTLSQQPASVYV--QHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQF 234 L+ P V + D G+ L+ P+V++++ + +DG + Sbjct: 264 DWFHLNGFPVEVQDGIANYDTFGFYGNMPKLNTANPDVKNYLIETAVNWMKETGIDGWRL 323 Query: 235 DDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVS 294 D + IK Sbjct: 324 DVANEIDHH------------------------------FWRDFRAAIKDANKEAFI--- 350 Query: 295 PAGVWRNRSHDPLGSDTRGAAAY 317 VW + LG Y Sbjct: 351 IGEVWSDSLRWLLGDQFDSVMNY 373 >UniRef50_B6W970 Putative uncharacterized protein n=1 Tax=Anaerococcus hydrogenalis DSM 7454 RepID=B6W970_9FIRM Length = 422 Score = 104 bits (258), Expect = 1e-20, Method: Composition-based stats. Identities = 57/445 (12%), Positives = 134/445 (30%), Gaps = 81/445 (18%) Query: 5 SRNKKLTIRRPAILVALALLLCSCKSTP----PESMVTPPAGSKPPATTQQSSQPMR--- 57 +NKKL + L ++ SC E + +++ +P+ Sbjct: 1 MKNKKLCFC----FLILTIIFTSCSLNKDKETSEKDFSSTEKIIYSKNSKKEKKPLGEPY 56 Query: 58 --GIWLATVSRLDW---------------PPVSSVNISNPTSRARVQQQAMIDKLDHLQR 100 G+ +D+ P V + +A ++ L Sbjct: 57 TVGV-TPDDYNMDYDTSRLKSLNEKKSKYYPEEGVKGIYLNAYTAANPKAFKKIMNLLDE 115 Query: 101 LGINTVFFQV---KPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVH 157 +N V V + T + + T +I + + +++ HK+G+ V Sbjct: 116 TKLNAVVLDVKDDWGNITCKFDTN-NKDIKYATHEILDA------EDFINKMHKKGIYVI 168 Query: 158 AWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSG-DRFVLDPGIPEVQDWI 216 ++ SV T+ + P + + +G ++P + EV+++ Sbjct: 169 GRITTFKDSVITE------------KHPDWGFKLDDGSLWKNGHGEAFMNPFMDEVRNYD 216 Query: 217 TSIVAEVVSRYPVDGVQFDDYFYTESPGSR-LNDNETYRKYGGAFASKADWRRNNTQQLI 275 I E+ + D +QFD + E + + ++ + + D R + + Sbjct: 217 LQIA-ELAANAGFDEIQFDYIRFAEGFETFHGKLDYPKGRWEKSKMDEGDKRIDAITSFV 275 Query: 276 AKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDY 335 + +++ +P G+ +G D D + Q D Sbjct: 276 KEAREMLQAYD-------TPCGIDVFGYSMQVGRA-------DGIGQDFKEMSNQA--DV 319 Query: 336 IAPQIY---WPFS------RSAARYDVLAKWW--ADVVKPTRTRLYIGIAFYKVGEPSKI 384 ++ IY W + Y+++ ++ V + + S I Sbjct: 320 MSSMIYPSHWGLNSFDIEKPDLEPYELVKRYLKEEQEVFSEIEHKPQSRPWIQDFTASWI 379 Query: 385 EPDWMINGGVPELKKQLDLNDAVPE 409 + ++ Q+ + Sbjct: 380 GAGNYMEYDKEAVEDQIKAIYDSGQ 404 >UniRef50_B8HXB6 Putative uncharacterized protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HXB6_CYAP4 Length = 528 Score = 103 bits (257), Expect = 1e-20, Method: Composition-based stats. Identities = 48/255 (18%), Positives = 83/255 (32%), Gaps = 18/255 (7%) Query: 18 LVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNI 77 + A LL + P + A A + + + WP V + + Sbjct: 75 IARQATLLQASLRGEPAAQSQYQALVAEQAQSLRQCRS-----------QTWPQVKGIWL 123 Query: 78 SNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENP 137 Q + LD + G N ++ Q DG L P+ P L + Sbjct: 124 QLFACDL--QPGVLESVLDRIVSQGYNRIYVQTFYDGQVLLPANRNPTPWLAVAQGSAFA 181 Query: 138 GYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIR 197 D L ++ + +RG++V+AW ++ + R+ TL Q Sbjct: 182 DRDLLAEVIQKGRERGLRVYAWV--SGMNYGSSYAQRRDRQQTLVQNGRQPATTPVGHTG 239 Query: 198 TSGDR--FVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRK 255 ++ +DP P ++ ++ V+ R DGV D Y L + + Sbjct: 240 QGFEQTAIFIDPYHPRTREDFQLMLQAVLQR-QPDGVLIDYLRYPRQSNPVLTEVKDLWI 298 Query: 256 YGGAFASKADWRRNN 270 YG A R N Sbjct: 299 YGPASRLTLRDRALN 313 >UniRef50_C6AUQ1 Putative uncharacterized protein n=2 Tax=Rhizobium leguminosarum RepID=C6AUQ1_RHILS Length = 702 Score = 103 bits (256), Expect = 1e-20, Method: Composition-based stats. Identities = 48/248 (19%), Positives = 80/248 (32%), Gaps = 29/248 (11%) Query: 63 TVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPS-- 120 T+ DW ++ ++ +D +R N V A +PS Sbjct: 11 TLRTPDWFKTATRWTQLTFVEDDPEKYDPAFWIDVFKRTKSNAVCLSAGGYI-AYYPSEV 69 Query: 121 KILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNST 180 S + K D ++D A K M V A +P+ + + + Sbjct: 70 PYHYVSKYLGDK-------DIFGALVDAARKLDMHVMARVDPHAIHDDAAKAHPEWVMIN 122 Query: 181 LSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGV-----QFD 235 P + + D T+ +PEV V E+V +Y +D V Q Sbjct: 123 ADGTPRRHW-AYPDVWVTNAYGDYNSVFMPEV-------VKEIVRKYDIDAVFANRWQGH 174 Query: 236 DYFYTESPGSRLND------NETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGV 289 Y+E R D A+ + WRR +IA+ +K+I+P Sbjct: 175 GVDYSEDSARRFKDMSGHALPVKPDAEDPAWQAWVQWRRRVLTDMIAQWDDAVKAIRPHA 234 Query: 290 EFGVSPAG 297 F + G Sbjct: 235 SFIPNMGG 242 >UniRef50_Q185X6 Cyclomaltodextrinase (Maltogenic alpha-amylase) n=7 Tax=Clostridium difficile RepID=Q185X6_CLOD6 Length = 627 Score = 103 bits (256), Expect = 2e-20, Method: Composition-based stats. Identities = 35/258 (13%), Positives = 72/258 (27%), Gaps = 54/258 (20%) Query: 77 ISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGEN 136 I + Q +I+ LD+L LGIN ++F G + + ++ + Sbjct: 204 IPTRDNFTGGDLQGVIEHLDYLSDLGINGIYFCPITIGKTNHRYDTVDYMEVDPTLGDKE 263 Query: 137 PGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQH---- 192 L+ +++EAHKR +K+ + +K N S+ Y++ Sbjct: 264 ----TLKKLIEEAHKRNIKIMLDAVFNHIGYYSKQWQDVVQNKENSRYKNWFYIKDMSKI 319 Query: 193 -------------RDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFY 239 + L+ EV ++ + + + +D + D Sbjct: 320 DTPIEQIDEKNIPYETFGCEKYMPKLNTENSEVIKYLLDVGKYWIQEFDIDAWRLDVSNE 379 Query: 240 TESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVW 299 + K +K +KP + +W Sbjct: 380 VDHV------------------------------FWRKFREEVKKVKPDIYI---LGEIW 406 Query: 300 RNRSHDPLGSDTRGAAAY 317 +G Y Sbjct: 407 HGSLPWLMGDQFDSVMNY 424 >UniRef50_Q8YXF7 All1256 protein n=4 Tax=Nostocaceae RepID=Q8YXF7_ANASP Length = 500 Score = 102 bits (255), Expect = 2e-20, Method: Composition-based stats. Identities = 63/402 (15%), Positives = 124/402 (30%), Gaps = 61/402 (15%) Query: 69 WPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDL 128 WP V ++ + + A+ +D + G N V+ +V DG L P+ P Sbjct: 95 WPNVQALWLRLYPCDMK--PGAIDQIMDRMVNRGYNEVYLEVFYDGRVLLPASANPTVWP 152 Query: 129 MTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASV 188 + D L + + +RG+KV+ W Y + R+ +++ Sbjct: 153 SVIRTKGAEKVDLLATAIQKGRQRGLKVYGWL--YTNNFGYNYALRRDREGAIARNGKGQ 210 Query: 189 YVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLN 248 ++ +G + +DP + + +V E+V R DG+ FD Y GS Sbjct: 211 TSL---YVVDNGSQVFIDPYNEQAKRDYYRMVQEIVRR-RPDGLLFDYVRYPRQAGSNSI 266 Query: 249 DNETYR--KYGGAFASKADWRRNN------TQQLIAKVSHTIKSIKP-GVEFGVSPAGVW 299 + Y A + R N ++ +++ T I + +W Sbjct: 267 ATKVADLWLYTEATQAALFRRAQNQKGLELIRRFLSQGYVTPADINDVDRLYPQEGEPMW 326 Query: 300 RNRSHDPLGSDTRGAAAYDESYADTRRWV------EQGLLDYIAP--------------- 338 + R P A W+ QG++D+++ Sbjct: 327 QGRIIAPQQKSLLTPTARQPILQ-MDLWLLAIAHSMQGIIDFVSLATHPAKQINIPTGVV 385 Query: 339 ----------QIYWPFSRSAARYDVLAKW----------WADVVKPTRTRLYIGIAFYKV 378 Q Y + R+ +W +V+ + L + K+ Sbjct: 386 FFPEGNQTVGQGYDSRLQPWDRFPNSLQWHPMSYATCGNVNCIVEQVQRVLSMAQPGTKI 445 Query: 379 GEPSKIEPDWMINGGVPELKKQLDLNDAV-PEISGTILFRED 419 ++ P L+ Q+ ++ G F Sbjct: 446 IPALAGNWGESVSNRPP-LEVQMQALRPFASKLKGVSHFAYS 486 >UniRef50_A6CFN7 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CFN7_9PLAN Length = 565 Score = 102 bits (255), Expect = 2e-20, Method: Composition-based stats. Identities = 58/440 (13%), Positives = 130/440 (29%), Gaps = 71/440 (16%) Query: 18 LVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNI 77 ++ L LL + A K + Q M+ + W + Sbjct: 33 MLRLLLLPAIVIIGHTYLERSVDAAEKNATLSPSQQQSMQAA----RQKAAWKKRRIIFN 88 Query: 78 SNPTSRARVQQQAM-IDKLDH----LQRLGINTVFFQVKPDGTALWPSKI---------- 122 ++ ++A LD L+ ++ +F+ G + + Sbjct: 89 NDGNEPVYSLKEATPQALLDVRTSPLKGSQVDAIFYCTWSSGFSYFTHDTKVGNVFTETA 148 Query: 123 LPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRE----LN 178 S+ T ++ +N G+DPL M D + +++ F +R++ + Sbjct: 149 NKLSNNKTAELIKN-GHDPLTVMSDWCKENDVEL---FWSFRMNDTHDASSAWYGPLLFP 204 Query: 179 STLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYF 238 + P + ++ +G +D E+ D V EV Y VDGV+ D + Sbjct: 205 PLKKEHPEWLVGSAKEKP-KNGRWTAVDFTHEEICDLAYRYVEEVCRNYDVDGVELDFFR 263 Query: 239 YTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGV 298 + ++ + G + L+ ++ + Sbjct: 264 HLNYFK-----RVSWGEPAGDLEL------SRLNDLMRRIRTMADEVGQ----------- 301 Query: 299 WRNRSHDPLGSDTRGAAAYDESY-ADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKW 357 + + + Y D W+++ L+D + Y+ Sbjct: 302 -QRGRPILIAIRVPDSVEYARVLGLDVETWLKEDLVDIMTVTGYFR-----------LNP 349 Query: 358 WADVVK---PTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTI 414 W + V+ + +Y G++ + + + E + +N I G Sbjct: 350 WKESVELGHKYQVPVYAGLSESR-----QKDQRARKVYASTEGFRGRAMNAWSQGIDGIY 404 Query: 415 LFREDYLNKPQTQQAVSYLQ 434 LF P ++ + Sbjct: 405 LFNSFNPRHPLWRELGDPTR 424 >UniRef50_Q11AV5 Putative uncharacterized protein n=1 Tax=Chelativorans sp. BNC1 RepID=Q11AV5_MESSB Length = 685 Score = 102 bits (255), Expect = 2e-20, Method: Composition-based stats. Identities = 38/243 (15%), Positives = 82/243 (33%), Gaps = 34/243 (13%) Query: 68 DWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSD 127 W + LD ++ +G N V+ + +P+++ + Sbjct: 19 PWYRSPFRLFQTNLLEPDADM-DVEKVLDFIEDMGCN-VWLVNGGGILSFYPTRLEHQTR 76 Query: 128 LMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPAS 187 + P D ++ H+RG++V A + +V+ + + P Sbjct: 77 NP--YLDRRPSGDLFGDAVEAGHRRGIRVMARMDFSKVN-----------QAVADRHPDW 123 Query: 188 VYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGV--------QFDDYFY 239 ++V+ + + + +DP Q+ + +V E++ RYP+DG +FD + Sbjct: 124 LFVRPDGRRQAAEGQVSVDPSGDYYQEKLLEVVDEMIDRYPLDGFFFNRAGFNEFDYAMH 183 Query: 240 TESPGSRLNDNETYRKY--------GGAFASKADWR---RNNTQQLIAKVSHTIKSIKPG 288 + + G + WR L ++S IK+ +P Sbjct: 184 YHGVSQSEASKRGFAAFSGGQQLPTGPESPNYDLWRAYCAKVVGDLWVRISAHIKTRRPD 243 Query: 289 VEF 291 V Sbjct: 244 VAL 246 >UniRef50_Q67N80 Putative uncharacterized protein n=1 Tax=Symbiobacterium thermophilum RepID=Q67N80_SYMTH Length = 384 Score = 102 bits (255), Expect = 2e-20, Method: Composition-based stats. Identities = 74/416 (17%), Positives = 123/416 (29%), Gaps = 99/416 (23%) Query: 26 CSCKSTPPESMVTPPAGSKPPATTQQSSQPM----RGI----WLATVSRLDWPPVSSVNI 77 S + P G +P + ++ M RG+ W A L WP Sbjct: 38 ASADPGGSGNAQAPENGPQPGDGPARPAREMPDPVRGLHLSGWYAGSPDLVWP------- 90 Query: 78 SNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKP-DGTALWPSKILPWSDLMTGKIGEN 136 LD + GINT+ +K DG W S + P + + + + Sbjct: 91 ----------------LLDWAKEAGINTIVLDLKAEDGYLSWESDL-PLAQEIGANMRKI 133 Query: 137 PGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWI 196 L + EAH+RG W + +Y +W Sbjct: 134 AD---LPAFVAEAHERG----FWV----------------AGRIVVMNDQWLYKARPEWA 170 Query: 197 ---RTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETY 253 G +DP V + + E ++ VD +QFD Y+E Sbjct: 171 IPGFDGGAYSFMDPANENVWKYNVDVAKEAIAA-GVDEIQFDYIRYSEHLREG------- 222 Query: 254 RKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRG 313 Y G + A+ R + +K + GV G G+ + G D Sbjct: 223 --YNGKDTT-AEQRTKPINDFLRYAMAELKPL--GVVVGADVFGL---TTSVAEGDDMEI 274 Query: 314 AAAYDESYADTRRWVEQGLLDYIAPQIY--------WPFSRSAA-RYDVLAKWWADVVKP 364 Y + ++DYIAP +Y + A Y+ + ++ Sbjct: 275 GQDYRQI---------AEIVDYIAPMVYPSHYAPYTYGLDNPNAHPYETVYNSMKKALER 325 Query: 365 TRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDY 420 T G+ K + I G E+ Q+ + I +L+ Sbjct: 326 TE-----GLPIEKHRPWIQDFSLGGITYGAAEVMAQVQALKDL-GIESFMLWDPSN 375 >UniRef50_B4D7E2 Putative uncharacterized protein n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D7E2_9BACT Length = 423 Score = 102 bits (254), Expect = 2e-20, Method: Composition-based stats. Identities = 53/320 (16%), Positives = 93/320 (29%), Gaps = 49/320 (15%) Query: 138 GYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIR 197 G D LQ + + H M+ +A V G + + + H +W Sbjct: 122 GVDTLQCIAEGCHAADMQCYASVRMNAVYPLKANGWVGDSMARFFNSKFWW--DHPEWRV 179 Query: 198 TSGD---RFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYR 254 S D + L PEV+ + IV EV+ R VDGV D + G + + Y+ Sbjct: 180 RSRDGREQPSLSYAFPEVRARVLGIVREVLER-DVDGVDLDFLRHPPCFGYEESLVKGYQ 238 Query: 255 K---------YGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHD 305 +R L+ ++ + + Sbjct: 239 DRFHLDPQTIPDDHDERWLHYRAELMTGLLREIRQAVDEAAKK-------------KGRP 285 Query: 306 PLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPT 365 S A Y D W+++ LLD + +D + + K T Sbjct: 286 LGLSARIDHANYLLWGCDVDVWLQERLLDILVV-----SQHGLGGWDFDLRPFVQKAKGT 340 Query: 366 RTRLYIG-IAFYKVGEPSKIE-----PDWMINGGVPELKKQLDLND----AVPEISGTIL 415 +Y+G A +P+ + P + + + +G + Sbjct: 341 GCAVYLGEEATIAGRDPTPGDVAKLKPGEKAPATATTMTEAMWFERARHWYEQGAAGIHI 400 Query: 416 FR------EDYLNKPQTQQA 429 F YL P +Q Sbjct: 401 FNGAPPSVLKYLGDPPAKQP 420 >UniRef50_C7HH08 GTP-binding protein n=3 Tax=Clostridium thermocellum RepID=C7HH08_CLOTM Length = 419 Score = 102 bits (254), Expect = 2e-20, Method: Composition-based stats. Identities = 50/369 (13%), Positives = 103/369 (27%), Gaps = 76/369 (20%) Query: 81 TSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYD 140 T + ++ M ++ + +NTV VK DG + S++ + N Sbjct: 88 TGTSAGNKKFMERLVNLINTTELNTVVLDVKEDGKVNYASEVESVKKIGAYHELYNVD-- 145 Query: 141 PLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSG 200 ++ H + V +R + + + R +G Sbjct: 146 ---EVIKLLHDNNIYVIGRIVCFRDNYLAGKRVDLA-----------IKRKDGSIWRENG 191 Query: 201 DRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAF 260 +P EV + I E V + D +QFD + + + ++ YG Sbjct: 192 SIAWTNPYNKEVWRYNIDIAKEAVKK-GFDEIQFDYVRFPAAGKNEVD-------YGENP 243 Query: 261 ASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDES 320 K + + + + I K GV + D G + Sbjct: 244 IPK----ADAISGFLKEAASEI--NKMGVPVSADIFAIVCETPGDTEG----IGQVLERI 293 Query: 321 YADTRRWVEQGLLDYIAPQIYWP-----------------------FSRSAARYDVLAKW 357 D +DYI+P IY + Y+V+ Sbjct: 294 GMD---------IDYISPMIYPSHFANASRGMMGNGKGQSINGILFTAPDLKPYEVVYNV 344 Query: 358 WADV------VKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEIS 411 V+ R ++ + + S + + + G ++++Q+ Sbjct: 345 LLKTKDRISKVEGYRAKV---RPYLQGFTASYLPKGYYQHYGPEQIRQQIKAVYD-AGYE 400 Query: 412 GTILFREDY 420 I + Sbjct: 401 EWIFWNAAN 409 >UniRef50_Q6TXT5 AmyM n=1 Tax=uncultured bacterium RepID=Q6TXT5_9BACT Length = 517 Score = 102 bits (254), Expect = 2e-20, Method: Composition-based stats. Identities = 39/224 (17%), Positives = 71/224 (31%), Gaps = 40/224 (17%) Query: 69 WPPVSSVN------ISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKI 122 WP + + + KLD+++ LG N ++F + Sbjct: 31 WPQAGVTYEIFVQSFYDSNGDSIGDFNGVTQKLDYVKELGANAIWFMPIMPSPTYHKYDV 90 Query: 123 LPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKV------------HAWF--------NP 162 + + D + +LDEAHKR +K+ H WF NP Sbjct: 91 TDYKAVHPDYG----TLDDFKKLLDEAHKRDIKIVIDLIINHTSNEHPWFLEAKSGRDNP 146 Query: 163 YRVSVNT----------KPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEV 212 YR TI + Q + + G L+ P+V Sbjct: 147 YRDYYVWAQKDTIADFLNKKTITFDLDNIRQWHDPGQGEDFYYGFFWGGMPDLNFDNPKV 206 Query: 213 QDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKY 256 ++ I I + VDG + D + L+++ ++++ Sbjct: 207 REEIYEIGRFWLEEVGVDGFRLDAAKHIFPDDRPLDNHAFWKEF 250 >UniRef50_B9HAL8 Predicted protein n=1 Tax=Populus trichocarpa RepID=B9HAL8_POPTR Length = 826 Score = 102 bits (253), Expect = 3e-20, Method: Composition-based stats. Identities = 36/204 (17%), Positives = 65/204 (31%), Gaps = 7/204 (3%) Query: 32 PPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAM 91 P ++ + + S G +L V +LD VN + Sbjct: 225 PQRDLIIYEMHVRGFTQHESSRTEFPGTYLGVVEKLDHLKELGVNCIELMPCHEFNELEY 284 Query: 92 IDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHK 151 L +N T + S + +S T G + + + ++ EAHK Sbjct: 285 YSYNSVLGDYKVN-----FWGYSTVNYFSPMTRYSSAGTRNCGRDA-INEFKLLVREAHK 338 Query: 152 RGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPE 211 RG++V + + G I + ++ SG + P Sbjct: 339 RGIEVFMDVVFNHTAEGNEKGPILSFRGV-DNSIYYMLAPKGEFYNYSGCGNTFNCNHPI 397 Query: 212 VQDWITSIVAEVVSRYPVDGVQFD 235 V+ +I + V+ VDG +FD Sbjct: 398 VRQFILDCLRYWVTEMHVDGFRFD 421 >UniRef50_B6FM44 Putative uncharacterized protein n=3 Tax=Clostridiales RepID=B6FM44_9CLOT Length = 587 Score = 102 bits (253), Expect = 3e-20, Method: Composition-based stats. Identities = 35/253 (13%), Positives = 70/253 (27%), Gaps = 55/253 (21%) Query: 81 TSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYD 140 R + KL +L+ LGIN ++ + + ++ + Sbjct: 174 DERFGGNLAGIRKKLSYLKELGINGIYLNPIMEAESNHKYDTTDYTKIDPHFGTNEE--- 230 Query: 141 PLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDW----- 195 ++ EAH G++V P + S+ V DW Sbjct: 231 -FAQLVKEAHGHGIRVMVDAVFNHSGSKFVPWRDVQEYGKESKYADWFMV--NDWSNIKK 287 Query: 196 -----------IRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPG 244 + L+ EV + I + + VDG++FD Sbjct: 288 KADTRDERFYSFAFIDNMPKLNTNNEEVIQYFCGICENWIKEFDVDGIRFDVGN------ 341 Query: 245 SRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSH 304 + + + ++ ++SIKP + +W + S Sbjct: 342 ------------------------EVSHRFLKRIREHVRSIKPDIYL---LGEIWHDASQ 374 Query: 305 DPLGSDTRGAAAY 317 +G + Y Sbjct: 375 WLMGDEYDSVMNY 387 >UniRef50_A9B4Y8 Alpha amylase catalytic region n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9B4Y8_HERA2 Length = 477 Score = 102 bits (253), Expect = 3e-20, Method: Composition-based stats. Identities = 42/265 (15%), Positives = 75/265 (28%), Gaps = 52/265 (19%) Query: 68 DWPPVSSVNISNPTSRAR--VQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPW 125 + P + ++PT Q +IDKLD+L LGIN ++ T + Sbjct: 32 NDPANAQPWGTSPTLYNYMGGDLQGIIDKLDYLVDLGINALYLNPIFQATTSHKYNTFDY 91 Query: 126 SDLMTGKIGENPGYDPLQFMLDEAHKRGMKVH--AWFN-----PYRVSVNTKPGTIRELN 178 + + +L+EAH+RG+KV A FN + + G Sbjct: 92 FKIDPHFGTLE----TFKTLLNEAHRRGIKVILDAVFNHCGRGFFAFHDVIENGVHSPYT 147 Query: 179 ST-----LSQQP-ASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGV 232 + P S Y + + + P V+ ++ + + +DG Sbjct: 148 NWFHISRFPIHPYESRYAANYRTWWDFRELPKFNTDNPAVRKYLLDVARYWI-ELGIDGW 206 Query: 233 QFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFG 292 + D + + +K I P Sbjct: 207 RLDVPNEIDDH-----------------------------NFWREFRTIVKDINPEAYI- 236 Query: 293 VSPAGVWRNRSHDPLGSDTRGAAAY 317 +W + S G Y Sbjct: 237 --VGEIWTDGSAWLQGDQFDAVMNY 259 >UniRef50_B8D2L1 Pullulanase n=1 Tax=Desulfurococcus kamchatkensis 1221n RepID=B8D2L1_DESK1 Length = 687 Score = 102 bits (253), Expect = 4e-20, Method: Composition-based stats. Identities = 37/248 (14%), Positives = 71/248 (28%), Gaps = 50/248 (20%) Query: 83 RARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPL 142 + + +KLD+L+ LGI ++ ++ + + + D L Sbjct: 205 YFGGDLKGITEKLDYLKELGIGLIYLNPIFTSGSVHGYDVYDYYTVDPKFG----TLDDL 260 Query: 143 QFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQH---------- 192 +L+EAHKRG++V F P V + N S+ + + Sbjct: 261 MMLLNEAHKRGIRVIFDFVPDHVGLGFWAFQDVYRNGPSSRYWSWFIIYKWPFKLGDSSA 320 Query: 193 RDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNET 252 G L+ EV+ ++ ++ +S DG++ D Sbjct: 321 YKCWWGIGSLPQLNVLNKEVRQYLINVALYWLSI-GFDGLRIDAPL-------------- 365 Query: 253 YRKYGGAFASKADWRRNNTQ--QLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSD 310 + ++ +KS P +W R G Sbjct: 366 ----------------DVIDSENFFRELREAVKSRYPDAYI---VGEIWDYRPKWLRGEA 406 Query: 311 TRGAAAYD 318 Y Sbjct: 407 FDSLMNYY 414 >UniRef50_D1AEP8 Putative uncharacterized protein n=1 Tax=Thermomonospora curvata DSM 43183 RepID=D1AEP8_THECD Length = 540 Score = 102 bits (253), Expect = 4e-20, Method: Composition-based stats. Identities = 55/331 (16%), Positives = 104/331 (31%), Gaps = 63/331 (19%) Query: 92 IDKLDHLQRLGINTVFFQVKPDGTAL-WPSKILPWSDLMTGKIGENPGYDPLQFMLDEAH 150 L ++ IN V +K + + + S++ P + + G YD + LD+ H Sbjct: 234 ERILKMIREKRINAVQLDIKDEDGIIGYDSQV-PLAREVKATRGI---YDA-RQALDQLH 288 Query: 151 KRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIP 210 ++V +R K L + P + G + + P Sbjct: 289 AMNVRVIGRIVAFRDPQLGKASWRAGKRDRLVRTPDGGA-----YGSQYGAQSFTNLAHP 343 Query: 211 EVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNN 270 EV+ + + E R D + +D + P ++ G S D + Sbjct: 344 EVRKYNIDLAVEAA-RLGFDEILYDYVRRPDGPLKS-------MRFPGLRGSVED----S 391 Query: 271 TQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQ 330 IA+ ++ G G S G+ R E D + + Sbjct: 392 VASFIAETRRLVRPH--GAFLGASVYGIAATRP--------------HEIGQDIAKIGKH 435 Query: 331 GLLDYIAPQIY--------WPFSRSAA-RYDVLAKWWA---DVVKPTRTRLYIGIAFYKV 378 +DY+AP +Y + A Y + V++ T ++ + + + Sbjct: 436 --VDYVAPMLYPSHWGAGEFGLKNPNAEPYKTVYASMLTFHKVLRGTSAQIVPWLQDFSL 493 Query: 379 GEPSKIEPDWMINGGVPELKKQLDLNDAVPE 409 G P GV E+K Q++ Sbjct: 494 GHP----------YGVAEVKAQIEAAAKTGS 514 >UniRef50_C1I251 Trehalose-6-phosphate hydrolase n=1 Tax=Clostridium sp. 7_2_43FAA RepID=C1I251_9CLOT Length = 528 Score = 101 bits (251), Expect = 6e-20, Method: Composition-based stats. Identities = 50/263 (19%), Positives = 91/263 (34%), Gaps = 41/263 (15%) Query: 56 MRGIW--LATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPD 113 MR IW + P + + + + KLD+L +LG+ ++ Sbjct: 1 MRKIWWKETVFYEIYMPSFK-----DGNNDGIGDFKGITSKLDYLHKLGVKGIWL----- 50 Query: 114 GTALWPSKILPWSDLMTGKIGENPGYDPL---QFMLDEAHKRGMKV------------HA 158 T +PS + ++ + Y L + +L++AHK +KV H Sbjct: 51 -TPFYPSPKVDNGYDISDYYNIDKDYGTLEDFKDLLEKAHKLDIKVIGDIVLNHTSSEHP 109 Query: 159 WFNPYRVSVNTKPGTIRELNSTLSQQPASVY----------VQHRDWIRTSGDRFVLDPG 208 WF + S + + S + Q + + ++ L+ Sbjct: 110 WFKESKSSKDNEKRDFYIWRKDKPNNWESFFGEDAWEYDNVTQEYYYHAFAKEQVDLNWA 169 Query: 209 IPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRR 268 P+V D + ++ + VDG +FD + N+N G K D + Sbjct: 170 NPKVYDEMIKVLRFWL-DLGVDGFRFDVINFLTVNEDLSNNNPY--DEKGEQIHKFDKDQ 226 Query: 269 NNTQQLIAKVSHTIKSIKPGVEF 291 N +I K++ IKSIK Sbjct: 227 NGVLDIIKKLAKDIKSIKKDAFL 249 >UniRef50_A6LHH0 Putative uncharacterized protein n=6 Tax=Bacteroidales RepID=A6LHH0_PARD8 Length = 448 Score = 101 bits (251), Expect = 6e-20, Method: Composition-based stats. Identities = 43/333 (12%), Positives = 86/333 (25%), Gaps = 84/333 (25%) Query: 129 MTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASV 188 + G + P D + + A K G++V+AW + ++ L + P Sbjct: 57 IDGVMLNAPTPDDYRAAIPIAQKHGIEVYAWLWTM--------NPEHDRDAILKEHPEWF 108 Query: 189 YVQHR-----DWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYF----- 238 V D + P +PEV+++I + ++G+ D Sbjct: 109 SVNRNGQSLADTTAYVDYYKFMCPALPEVREFIKKKIEAYCEVEGLNGIAIDYNRFVDVI 168 Query: 239 -----------------------YTESPGSRLNDNETY----RKYGGAFASKADWRRNNT 271 Y + + Y ++ +R + Sbjct: 169 LPTTLWPKYGIVQDQEYPQWDFGYHPAMIEKFKAAYGYDPREQEDPSQDEKWLQFRCDQI 228 Query: 272 QQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQG 331 ++ ++ + S + P A D +W Sbjct: 229 TEVANMIADVVHSYGKKMAASPFPTPK----------------MASKMVRQDWGKW---- 268 Query: 332 LLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMIN 391 LD + P +Y F + K +T LY G+ Sbjct: 269 NLDIVFPMVYHNFYTEDISFISDCMIEDVRDKNPKTTLYCGL------------------ 310 Query: 392 GGVPELKKQLDLNDAVPEISGTILFREDYLNKP 424 +++ +D G +F L P Sbjct: 311 MVADDIENAMDAALNH-GAEGISIFTVSALRTP 342 >UniRef50_O06458 Trehalose synthase n=7 Tax=Thermaceae RepID=TRES_THETH Length = 963 Score = 101 bits (250), Expect = 7e-20, Method: Composition-based stats. Identities = 44/270 (16%), Positives = 86/270 (31%), Gaps = 51/270 (18%) Query: 76 NISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGE 135 + + + + + KL +L+ LG+NT++ + S + ++ Sbjct: 18 SFFDANNDGYGDFEGLRRKLPYLEELGVNTLWLMP------FFQSPLRDDGYDISDYYQI 71 Query: 136 NPGYDPLQFM-LDEAHKRGMKV------------HAWFN-------PYRVSVNTKPGTIR 175 P + L+ +DEAH RGMKV H WF P R + Sbjct: 72 LPVHGTLEDFTVDEAHGRGMKVIIELVLNHTSIDHPWFQEARKPNSPMRDWYVWSDTPEK 131 Query: 176 E-------LNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYP 228 + S + W R + L+ PEV+ I ++ Sbjct: 132 YKGVRVIFKDFETSNWTFDPVAKAYYWHRFYWHQPDLNWDSPEVEKAIHQVMFFWA-DLG 190 Query: 229 VDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPG 288 VDG + D Y Y + G + + + T + + ++ ++ Sbjct: 191 VDGFRLDAIPYL------------YEREGTSCENLPE-----TIEAVKRLRKALEERYGP 233 Query: 289 VEFGVSPAGVWRNRSHDPLGSDTRGAAAYD 318 + ++ +W + G AY+ Sbjct: 234 GKILLAEVNMWPEETLPYFGDGDGVHMAYN 263 >UniRef50_Q5N184 Putative uncharacterized protein n=2 Tax=Synechococcus elongatus RepID=Q5N184_SYNP6 Length = 481 Score = 100 bits (249), Expect = 9e-20, Method: Composition-based stats. Identities = 42/179 (23%), Positives = 68/179 (37%), Gaps = 8/179 (4%) Query: 69 WPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDL 128 WP ++ + + R + D LQ LG N VF + DG L P+ P Sbjct: 91 WPRKMAIWVRLYSCDLR--PGGLDSLFDGLQALGYNEVFIETFYDGRVLLPAADNPTVWP 148 Query: 129 MTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASV 188 D L + + +RGM V+AW + ++ + TL++ S Sbjct: 149 SVVAEPGLERVDLLAEAIRKGRERGMSVYAWL--FTLNYGYSYSQRSDRQDTLARNGRSE 206 Query: 189 YVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRL 247 I + G + +DP P + +++ V+SR DGV FD Y G+ Sbjct: 207 SSLE---IVSGGAQVFVDPFNPVARQDYQTLLRSVLSR-RPDGVLFDYVRYPRGTGAAS 261 >UniRef50_C2KVT1 Putative uncharacterized protein n=1 Tax=Oribacterium sinus F0268 RepID=C2KVT1_9FIRM Length = 435 Score = 100 bits (249), Expect = 9e-20, Method: Composition-based stats. Identities = 46/379 (12%), Positives = 101/379 (26%), Gaps = 65/379 (17%) Query: 59 IWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPD-GTAL 117 IW + D V + I++ T+ + M D L ++ +N + +K D G + Sbjct: 76 IWKKKDIQKDRVKVKGIYITDLTAGSPK----MEDILSKMKDTELNALVIDIKNDNGQIV 131 Query: 118 WPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIREL 177 + + L +L + H++G+ + A +R + Sbjct: 132 YQMNNGGQQEFYNTTNIVK----DLPALLKKCHEQGLYLIARLVCFRDPAMGEV------ 181 Query: 178 NSTLSQQPASV-YVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDD 236 P + + + ++P + D+I S+ D +Q D Sbjct: 182 ------HPEWMNQKADGSLFKDNSGMTWINPYKKDYWDYIASVAERCADD-GFDEIQLDY 234 Query: 237 YFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPA 296 + G + +Y + + + + +S + V F Sbjct: 235 VRFCTEKGMKEV------QYPEEAKTN---KTQIITEFVQYMSDRL--ANKQVFFSTDVF 283 Query: 297 GVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIY---WPFSR------S 347 G D D +DY+ P IY + Sbjct: 284 GTIIGSYVDS-----------TAVGQDYSDMAAS--VDYMCPMIYPSHYGDGNFGIEHPD 330 Query: 348 AARYDVLAKWWAD------VVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQL 401 Y + +VK + + + S ++ I + + Q+ Sbjct: 331 TDPYKTIYSALRSSQKELALVKSGDSHQATVRPWLQGFTASYLQH--YIPYEKEQFRAQI 388 Query: 402 DLNDAVPEISGTILFREDY 420 + + Sbjct: 389 QAVYD-SGYDEWLFWNAGS 406 >UniRef50_C5A1I5 Pullulan hydrolase type III (PulhA) n=3 Tax=Thermococcus RepID=C5A1I5_THEGJ Length = 790 Score = 100 bits (249), Expect = 1e-19, Method: Composition-based stats. Identities = 37/245 (15%), Positives = 70/245 (28%), Gaps = 46/245 (18%) Query: 83 RARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPL 142 + +KLD+L LG+ ++ ++ + + E L Sbjct: 374 YFGGDIAGITEKLDYLSSLGVRLIYLNPIFLSGSVHGYDTYDYYRVDPKFGTETE----L 429 Query: 143 QFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQH---------- 192 + L EAHKRG+KV F P + N SQ +++ Sbjct: 430 KLFLSEAHKRGIKVIFDFVPDHSGIGADQFLDVWKNGRESQYWNWYFIKRWPFKLGDGSA 489 Query: 193 RDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNET 252 + G L+ PEV++++ + + + DG++ D +P +N Sbjct: 490 YEGWWGLGSLPKLNTTNPEVREYLIGSALKWL-DFGFDGIRVD------TPADLVNA--- 539 Query: 253 YRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTR 312 + + +K P +W G Sbjct: 540 -------------------DEFFREFRERVKEKHPNAYL---VGEIWTLSPEWVKGDKFD 577 Query: 313 GAAAY 317 Y Sbjct: 578 SLMNY 582 >UniRef50_C2CVQ9 Neopullulanase n=1 Tax=Gardnerella vaginalis ATCC 14019 RepID=C2CVQ9_GARVA Length = 529 Score = 100 bits (248), Expect = 1e-19, Method: Composition-based stats. Identities = 33/243 (13%), Positives = 66/243 (27%), Gaps = 50/243 (20%) Query: 89 QAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDE 148 + +I+ LD+++ LG N ++ +L + + + D + ++ E Sbjct: 118 RGIIENLDYIEDLGFNCLYLNPIFKAAEYHRYDLLDYYHVCPNLGTD----DDFRELVSE 173 Query: 149 AHKRGMKVH-------------AWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDW 195 H RGM + A+ + + N++ ++P Sbjct: 174 VHNRGMHIIIDGVFNHSSWYFFAFDDVVKNGENSRYKDWFYGLKFPVKRPEDGKRPSYTC 233 Query: 196 IRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRK 255 L+ PEV+D+ + + + VDG + D + Sbjct: 234 FAYERKMPKLNTSNPEVRDYFMDVCRYWLEDFDVDGWRLDVANEVDK------------- 280 Query: 256 YGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAA 315 K K V A +W N G Sbjct: 281 -----------------DFWRAFRSVAKKTKKD---SVLIAEIWENSERWLQGDMFDSTM 320 Query: 316 AYD 318 Y+ Sbjct: 321 NYE 323 >UniRef50_D1JA21 Conserved hypothetical membrane protein, DUF187 family n=2 Tax=uncultured archaeon RepID=D1JA21_9ARCH Length = 1594 Score = 99.9 bits (247), Expect = 2e-19, Method: Composition-based stats. Identities = 37/227 (16%), Positives = 74/227 (32%), Gaps = 40/227 (17%) Query: 94 KLDHLQRLGINTVFFQVKPD-GTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKR 152 ++ L+ ++TV +K D G +PS++ + + +D+AH+ Sbjct: 114 IINKLKSGNVSTVIINLKDDNGFVYFPSEVAEEDAIGQDINVT-------KVFIDKAHEE 166 Query: 153 GMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEV 212 G++V A + +R T+ P V + D + P E Sbjct: 167 GLRVFAALSCFR------------DPITVGDHPEWSQVDNED----KRSEEWICPLNDEY 210 Query: 213 QDWITSIVAEVVSRYPVDGVQFDDYFY---------------TESPGSRLNDNETYRKYG 257 ++++ ++ EV+ Y +DGV D+ Y G + +Y Sbjct: 211 KEYLLNLREEVLG-YDIDGVVLSDFGYAGSDYCFCDLCKRGFWNDTGIDPGKVDLANRYS 269 Query: 258 GAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSH 304 +WR +K I P + G + + Sbjct: 270 SNTQKWFEWRATMVTDFFVSFCKQVKKIDPEITIGARMQNPFDDYYP 316 >UniRef50_B8MJB7 Maltase n=14 Tax=cellular organisms RepID=B8MJB7_TALSN Length = 1108 Score = 99.5 bits (246), Expect = 2e-19, Method: Composition-based stats. Identities = 38/261 (14%), Positives = 72/261 (27%), Gaps = 48/261 (18%) Query: 67 LDWPPVSSVNISNPTSRARVQQQAMIDK------LDHLQRLGINTVFFQVKPDGTALWPS 120 + W ++V P S + D LD+++ LG++ ++ ++ S Sbjct: 1 MSWWKNATVYQIYPASFKDSNGDGIGDIPGIHAQLDYIESLGVDAIWLCP------MYDS 54 Query: 121 KILPWSDLMTGKIGENPGYDPLQF---MLDEAHKRGMKV------------HAWFNPYRV 165 ++ P Y ++ +++ H+RGMK+ HAWF R Sbjct: 55 PQHDMGYDISNYEAVYPPYGTVKDVEALIEACHERGMKILLDLVINHTSHEHAWFKESRS 114 Query: 166 SVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFV--------------------L 205 S N+ + +W G L Sbjct: 115 SKNSPKRDW-YIWRPAKYDADGTRRPPNNWRSCFGGSVWEWDEETQEYYLHLFAPQQPDL 173 Query: 206 DPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKAD 265 + P + I E R +DG + D P + T A Sbjct: 174 NWENPATRAAIYESAMEFWLRKGIDGFRVDTVNMYSKPSDFPDAPVTDASLQWQSAHHLF 233 Query: 266 WRRNNTQQLIAKVSHTIKSIK 286 + I ++ + Sbjct: 234 CNGPRIHEYIREMGQVLLKYN 254 >UniRef50_Q04KP3 Neopullulanase n=31 Tax=Bacteria RepID=Q04KP3_STRP2 Length = 587 Score = 99.5 bits (246), Expect = 2e-19, Method: Composition-based stats. Identities = 39/266 (14%), Positives = 74/266 (27%), Gaps = 55/266 (20%) Query: 67 LDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWS 126 LDW Q +ID +D+LQ LGI ++ + T+ + Sbjct: 168 LDWDSSV---TPKSDDFFGGDLQGIIDHMDYLQDLGITGLYLCPIFESTSNHKYNTTDYF 224 Query: 127 DLMTGKIGENPGYDPLQFMLDEAHKRGMKVH--AWFNPYRVSVNTKPGTIRELNSTLSQQ 184 ++ + + ++D+AH RGMKV A FN ++ + + Sbjct: 225 EIDRHFGDKE----TFRELVDQAHHRGMKVMLDAVFNHIASQSLQWKNVVKNGEQSAYKD 280 Query: 185 PASVY---------VQHRDWIR----TSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDG 231 + V RD L+ PEV++++ + + + +D Sbjct: 281 WFHIQQFPVTTEKLVNKRDLPYHVFGFEDYMPKLNTANPEVKNYLLKVATYWIEEFNIDA 340 Query: 232 VQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEF 291 + D + Q + + P + Sbjct: 341 WRLDVANEIDH------------------------------QFWKDFRKAVLAKNPDLYI 370 Query: 292 GVSPAGVWRNRSHDPLGSDTRGAAAY 317 VW G + Y Sbjct: 371 ---LGEVWHTSQPWLNGDEFHAVMNY 393 >UniRef50_C4ZDP5 Alpha-amylase n=4 Tax=Clostridiales RepID=C4ZDP5_EUBR3 Length = 572 Score = 99.5 bits (246), Expect = 2e-19, Method: Composition-based stats. Identities = 35/267 (13%), Positives = 73/267 (27%), Gaps = 52/267 (19%) Query: 70 PPVSSVNISNPTSRAR---VQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWS 126 P + P + + +I+ LD+++ LGI+ V+ + + + Sbjct: 151 PVDKKLWYKAPITPMDDLHGNLRGIIEHLDYIKDLGIDVVYLTPIFKSNSCHKYDTIDYY 210 Query: 127 DLMTGKIGENPGYDPLQFMLDEAHKRGMKV-------HAWFNPYRVSVNTKPGTIRELNS 179 + L+ ++ ++H+RGMKV H + + G + Sbjct: 211 QVDPSFGTTE----DLKELVQKSHERGMKVVLDAVYNHTGREFFAFQDILEKGEKSKYLD 266 Query: 180 T-----LSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQF 234 L + + + G L+ PEV+ +IT + + +DG + Sbjct: 267 WYFIDELPPRGEWGEIPNFKCFGYYGGMPKLNLKNPEVEKYITDVACYWIKECDIDGWRL 326 Query: 235 DDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVS 294 D IK++K + Sbjct: 327 DVGDEISHF------------------------------FWKNFRKAIKAVKKDMLI--- 353 Query: 295 PAGVWRNRSHDPLGSDTRGAAAYDESY 321 +W G + Y Sbjct: 354 IGEIWHYAGDFLEGDEWDTVMNYPFYL 380 >UniRef50_Q54S16 Putative uncharacterized protein n=1 Tax=Dictyostelium discoideum RepID=Q54S16_DICDI Length = 770 Score = 99.5 bits (246), Expect = 2e-19, Method: Composition-based stats. Identities = 47/309 (15%), Positives = 96/309 (31%), Gaps = 70/309 (22%) Query: 23 LLLCSCKSTPPESMVTPPAGSKPPAT----TQQSSQPMRGIWLATVSRLDWPPVSSVN-- 76 +L S T + ++ P + ++ M +VS W + Sbjct: 14 MLSLSSGDTLTQQFISSPPLNNSTDEFLDYSEHERSEM------SVSNNLWYKEAIFYEV 67 Query: 77 ----ISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGK 132 + + +KLD+L LG++ ++ ++PS + ++ Sbjct: 68 YVRAFCDIEGTGNGGISGITNKLDYLHTLGVDCIWL------LPIYPSPLKDDGYDISDY 121 Query: 133 IGENPGYDPL---QFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVY 189 +P Y L + ++ H+R MK+ A F P S K L+ + V+ Sbjct: 122 CDIHPDYGTLNDFKILVKAVHERNMKIIADFIPNHCSDKHKWFQSARLSRDSPYRDYFVW 181 Query: 190 VQHRD---------------------------WIRTSGDRFVLDPGIPEVQDWITSIVAE 222 W R ++ L+ P+VQ + +I+ + Sbjct: 182 SDSPQKYKDARIIFLDVEQSNWTWDEAAGQYYWHRFYKEQPDLNFDNPKVQQEMLNII-D 240 Query: 223 VVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTI 282 +DG + D Y + + G + + + T + + K+ I Sbjct: 241 FWLNLGIDGFRVDAVPYL------------FEREGTSCENLPE-----THEFLKKMRKFI 283 Query: 283 KSIKPGVEF 291 PG Sbjct: 284 DDKYPGRII 292 >UniRef50_A0KFK2 Glycosidase n=3 Tax=cellular organisms RepID=A0KFK2_AERHH Length = 557 Score = 99.5 bits (246), Expect = 2e-19, Method: Composition-based stats. Identities = 55/302 (18%), Positives = 98/302 (32%), Gaps = 30/302 (9%) Query: 2 DICSRNKKLT--IRRPAILVALALLLCSCKSTPPESMV---TPPAGSKPPATTQ------ 50 S N + R L+ ALLL +C S PP + PPAT Sbjct: 11 YHHSGNITMKQGTLRHGGLIGAALLLTACGSGGGSGSSEPQQPPVTNPPPATASRACYGN 70 Query: 51 -QSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQ 109 Q + +R + S +D ++ + S+ Q +ID LDH++ L +N ++ Sbjct: 71 DQPACNLRTYQVMVESFVDGDGSANYGVGYGPSQHNGDLQGIIDSLDHIKSLNVNAIWLT 130 Query: 110 VKPD--GTALWPSKILPWSDLMTGKIGENPGYD---PLQFMLDEAHKRGMKVHAWFNPYR 164 D K+ +P + L+ ++D AH+RG+ V + Sbjct: 131 PVFDSCAGQGGDDKLDATGYFACDFFNVDPNFGSNVQLKQLVDGAHQRGLYV------FL 184 Query: 165 VSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVV 224 V + P + + V D P+ + + V Sbjct: 185 DGVFGHVNKVG----VSKPSPEGRLPALKSGGAGYPGQLV-DYSQPQSLAYFKEVARYWV 239 Query: 225 SRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKS 284 +Y +DG + D + R +E R + W ++ +V Sbjct: 240 EQYGIDGWRLDQAYQLGLDDWRAIRSEVERASAARKTAGQQW--GTLGYMVGEVWKGADE 297 Query: 285 IK 286 I+ Sbjct: 298 IR 299 >UniRef50_A4AQ95 Putative uncharacterized protein n=2 Tax=Bacteroidetes RepID=A4AQ95_9FLAO Length = 373 Score = 99.1 bits (245), Expect = 3e-19, Method: Composition-based stats. Identities = 48/406 (11%), Positives = 100/406 (24%), Gaps = 98/406 (24%) Query: 71 PVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMT 130 V + + D L+ GI+ + ++ P Sbjct: 22 KVKIPVYAWMGGPGEATDSVLKANFDDLKAKGIDGL----------MYNGGQNP------ 65 Query: 131 GKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYV 190 Y + ++ EA GM+ H W N K + + Sbjct: 66 ------ATYKRVGALVKEA---GMEFHTWIPTMVQGENPKIAKDLYAH----NRNGESAF 112 Query: 191 QHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTE-------SP 243 + ++ L P ++++ + V VDG+ D + + Sbjct: 113 EKPAYVNYY---KFLCPNKEGTYEFLSDMYGSVAEVEEVDGIHLDYIRFPDVILAEGLWD 169 Query: 244 GSRLNDNETYRKY-------------------------GGAFASKADWRRNNTQQLIAKV 278 L ++ + +Y +R + +++ ++ Sbjct: 170 KYGLVMDQEFPEYDYCYCEKCTSDFKELTGIDINEVEDPSQIQEWKQFRYDLITKMVNRL 229 Query: 279 SHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAP 338 S + G N + P S + + +W LD I P Sbjct: 230 SKVVHEK-----------GKVLNAAVFPGPSIAKKL-----VRQEWNKW----DLDAIYP 269 Query: 339 QIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPE-- 396 Y + + W V G+ G P+ N PE Sbjct: 270 MNY-------NDFYLKGPEWVGEVTKEEVAAVKGLKPIYSGLFICPNPENKTNENDPENH 322 Query: 397 --LKKQLDLN---DAVPEISGTILFREDYLNKPQTQQAVSYLQSRW 437 L +++ +G LF + + + + Sbjct: 323 GLLPSEIETAIRTSMENGAAGICLFTPGRMKAEHWEAFEKAIYKDY 368 >UniRef50_C3A5Y1 Putative uncharacterized protein n=1 Tax=Bacillus mycoides DSM 2048 RepID=C3A5Y1_BACMY Length = 143 Score = 98.7 bits (244), Expect = 3e-19, Method: Composition-based stats. Identities = 31/125 (24%), Positives = 54/125 (43%), Gaps = 14/125 (11%) Query: 10 LTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDW 69 + ++R ++ + +L P + T +R +W+A+V +DW Sbjct: 1 MIMKRLVMMCYIVILFT-------PFSFISPHSTYAEVNTTYKKHELRAVWIASVLNIDW 53 Query: 70 PPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLM 129 P + + Q+Q I LD ++ G+N V Q+KP A +PS PWS+ + Sbjct: 54 PSKTGL-------PIEKQKQEFIRLLDDVKNTGMNAVVVQIKPTADAFYPSNYGPWSEYI 106 Query: 130 TGKIG 134 TG G Sbjct: 107 TGTQG 111 >UniRef50_C3Y3M5 Putative uncharacterized protein n=3 Tax=Branchiostoma floridae RepID=C3Y3M5_BRAFL Length = 399 Score = 98.7 bits (244), Expect = 3e-19, Method: Composition-based stats. Identities = 63/373 (16%), Positives = 110/373 (29%), Gaps = 93/373 (24%) Query: 51 QSSQPMRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQV 110 +R WL+ L+ GIN V+ V Sbjct: 104 SPYADVRATWLSR------------------YDHATSLAETAATFATLKAKGINRVYLNV 145 Query: 111 KPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTK 170 G + S+ + D L + ++E K G++V AWF Sbjct: 146 WASGQIYFQSRTFESLGIRGFV------RDVLGWAVEEGQKNGIEVWAWFE--------- 190 Query: 171 PGTIRELNSTLSQQPASVYVQHRDWIR-TSGDRFVLDPGIPEVQDWITSIVAEVVSRY-P 228 G S+ + S V WI+ +G+ + +D G +V D++ ++ + V Y Sbjct: 191 YGLKACWGSSPTVTVFSNKVYGLGWIKGQAGEYWWMDAGNTQVLDFLAGMMQDAVDNYPG 250 Query: 229 VDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPG 288 + GVQ DD+F N ++ +VS + Sbjct: 251 LAGVQLDDHFVQPW---------------QLGTGLVATMTNAASHILGQVSGRV------ 289 Query: 289 VEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQ--GLLDYIAPQIYWPFSR 346 S P+ + Y+ D W G +Y PQIY Sbjct: 290 --------------SLSPIAPPSLSLTNYN---VDWASWARDDIGFHEY-VPQIYRE--- 328 Query: 347 SAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDA 406 A+ ++ V +T+L G+ G P+ Q+ ++ Sbjct: 329 DASVFNTDLDRVMSEV--GKTKLVPGLRCIGSGSPTTYSA-----------LSQM-ISRC 374 Query: 407 VPEISGTILFRED 419 E G ++ Sbjct: 375 EAEGVGYSVWYSR 387 >UniRef50_Q045F6 Trehalose-6-phosphate hydrolase n=5 Tax=Lactobacillales RepID=Q045F6_LACGA Length = 560 Score = 98.3 bits (243), Expect = 5e-19, Method: Composition-based stats. Identities = 46/343 (13%), Positives = 105/343 (30%), Gaps = 58/343 (16%) Query: 68 DWPPVSSVN------ISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSK 121 DW + V ++ +I+KLD+++ LG N ++ T ++ S Sbjct: 8 DWKKKAIVYEAYVQSFNDSDGDGIGDLPGLIEKLDYIKNLGANVIWL------TPIFKSP 61 Query: 122 ILPWSDLMTGKIGENPGY---DPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELN 178 ++ + NP Y + + +L +AH+R +K+ S K + N Sbjct: 62 LVDNGYDIADYEAINPIYGSMNDFKKLLQQAHERDLKIVMDLVVNHTSDQHKWFKESKKN 121 Query: 179 STLSQQPASVYVQHRD---------------------------WIRTSGDRFVLDPGIPE 211 ++ ++ + + L+ P+ Sbjct: 122 RNNKYSDYYIWRDPKEDGSAPTNLGSAFGGSAWTYVPERNQYYLHLFAKQQPDLNWDNPQ 181 Query: 212 VQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLN----DNETYRKYGGAFASKADWR 267 V++ I ++ + VDG + D + P + DN+ Y Y A+ Sbjct: 182 VRNDIYQMMKFWL-DMGVDGFRMDSISFISKPSKFEDAPLEDNKEYGAYYYGSANGP--- 237 Query: 268 RNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHD--PLGSDTRGAAAYDESYADTR 325 + + + +++ + S + G +P R P + ++ + D Sbjct: 238 --HIHEYLREMNQKVLSKYDVISIGETPHTTAREARLFVEPDRHELDMVFQFEHMHVDYG 295 Query: 326 RWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTR 368 + D S + W ++ + Sbjct: 296 EFGR--YSDVTFKMS--DLRNSMTSWQNDLSWNSNYLGNHDQP 334 >UniRef50_A0Z097 Putative uncharacterized protein n=2 Tax=Oscillatoriales RepID=A0Z097_9CYAN Length = 533 Score = 98.3 bits (243), Expect = 5e-19, Method: Composition-based stats. Identities = 42/227 (18%), Positives = 82/227 (36%), Gaps = 25/227 (11%) Query: 69 WPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWP--SKILPWS 126 W ++ + Q A+ LD L G N V+ +V DG L P PW Sbjct: 89 WLKDQAIWLR--LYPCDAQPGAIDQILDDLVNRGYNKVYLEVFYDGQVLLPVSDNNTPWQ 146 Query: 127 DLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPA 186 ++ E D + + +RG++V+AW + ++ + + + L++ Sbjct: 147 SVLRSPGTETV--DLFAEAVQKGRRRGLEVYAW--AFLLNYGYTYTLLPDRQNVLARNGE 202 Query: 187 SVYV--------QHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYF 238 D+ + ++ +DP P+ + +++ ++SR GV FD Sbjct: 203 GETTVTAIAGGSNSDDFGESYTNQGFVDPYNPQARQDYQTLLNAILSR-RPQGVLFDYVR 261 Query: 239 YTESPGSRLNDNETYRK--YGGAFASKADWRRNN------TQQLIAK 277 Y + G + YG A R NN ++ + + Sbjct: 262 YPKGLGGASVAAKVKDLWIYGEASQQAFLQRANNDRGQEFIRRFLRQ 308 >UniRef50_C7TI05 Neopullulanase (GH13) n=42 Tax=Lactobacillus RepID=C7TI05_LACRL Length = 583 Score = 98.3 bits (243), Expect = 5e-19, Method: Composition-based stats. Identities = 40/277 (14%), Positives = 78/277 (28%), Gaps = 54/277 (19%) Query: 71 PVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMT 130 P + Q ++D LD LQ LGIN ++F + + + ++ Sbjct: 167 PWRPTDHPGREDYYGGDLQGVLDHLDDLQALGINGLYFCPIFTAASNHKYDTIDYLNVDP 226 Query: 131 GKIGENPGYDPLQFMLDEAHKRGMKVH--AWFNPYRVSVNTKPGTIRELNSTLSQQPASV 188 + ++ AH+RGM+V A FN +R ++ + Sbjct: 227 AFGDKV----LFAKLIQAAHQRGMRVMLDAVFNHMGFGSMQWQDVLRNGEASRFASWFHI 282 Query: 189 YVQH---------------RDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQ 233 Y D L+ PEVQ+++ ++ + + +D + Sbjct: 283 YQTPVTPFHNPLKNAGQPQYDTFAFEEKMPKLNTANPEVQEYLLTVATYWIKTFDIDAWR 342 Query: 234 FDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGV 293 D + + T+ +IKP Sbjct: 343 LDVANEVDHH------------------------------FWKRFYATVTAIKPDFYV-- 370 Query: 294 SPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQ 330 VW G + G Y + + ++ Sbjct: 371 -LGEVWHRAQPWLNGDEFSGVMNYPFTQQIEDHFFKR 406 >UniRef50_C1XX00 Glycosidase n=1 Tax=Meiothermus silvanus DSM 9946 RepID=C1XX00_9DEIN Length = 481 Score = 98.0 bits (242), Expect = 6e-19, Method: Composition-based stats. Identities = 38/211 (18%), Positives = 75/211 (35%), Gaps = 16/211 (7%) Query: 68 DWPPVSSVNISNPTSRARV--QQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPW 125 + PP + PT + +I LD++ LG N ++ + Sbjct: 27 NDPPGTEPWGRAPTRDNFFGGDLEGIIQGLDYIADLGCNALYLTPIFKAATNHKYDTYDY 86 Query: 126 SDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAW---------FNPYRVSVNTKPGTIRE 176 + G++ +D L + E +RGM++ F P+R + G+ Sbjct: 87 FQIDPHF-GDDATFDRL---VAEVKRRGMRLVLDGVFNHCGVGFAPFRDLLEQGEGSPYR 142 Query: 177 LNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDD 236 T P + + G L+ P+V+ ++ +V + R +DG + D Sbjct: 143 DWFTPYSFPLKPELPNYATCGGVGWLPRLNTRNPQVEAFVHEVVLHWLER-GIDGWRMDV 201 Query: 237 YFYTESPGSRLNDNETYRKYGGAFASKADWR 267 + E+P + +Y A+ +WR Sbjct: 202 AYEIETPFWQRLRQAVKARYPEAYLVAEEWR 232 >UniRef50_A8UCA2 Oligo-1,6-glucosidase n=2 Tax=Carnobacterium sp. AT7 RepID=A8UCA2_9LACT Length = 562 Score = 98.0 bits (242), Expect = 7e-19, Method: Composition-based stats. Identities = 43/264 (16%), Positives = 88/264 (33%), Gaps = 46/264 (17%) Query: 60 WLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWP 119 W V +P + + + +I ++D++Q +G+N ++ + Sbjct: 9 WKNAVVYQIYPK----SFQDTNDDGIGDLKGIIKQMDYIQSIGVNMIWLNPV------FV 58 Query: 120 SKILPWSDLMTGKIGENPGYDPLQFM---LDEAHKRGMKV------------HAWF---- 160 S + + + + + M ++EAHKRG+KV H WF Sbjct: 59 SPQIDNGYDVANYYAIDDSFGTMADMEKVIEEAHKRGIKVMMDFVLNHTSDQHPWFQEAL 118 Query: 161 ----NPYRVSVNTKP--GTIRELNSTLSQQPASVYVQHR-----DWIRTSGDRFVLDPGI 209 N YR + G N+ S SV+ + + + + L+ Sbjct: 119 KGPGNLYRDYYIWQKATGKRSVPNNWGSFFGGSVWEKEPLGESFYFHLFAKEMPDLNWEN 178 Query: 210 PEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRN 269 PEV+ + + + +D ++ D + + + D E G ++ N Sbjct: 179 PEVRMAMADCANFWLDK-GIDALRLDAFIHVDKE-EGFPDVEITN--GAETELAENYYAN 234 Query: 270 --NTQQLIAKVSHTIKSIKPGVEF 291 + + S I+ P V Sbjct: 235 LPKVTDYMQEFSQRIRKNHPMVFL 258 >UniRef50_Q0HUK8 Alpha amylase, catalytic region n=11 Tax=Bacteria RepID=Q0HUK8_SHESR Length = 540 Score = 97.6 bits (241), Expect = 8e-19, Method: Composition-based stats. Identities = 45/299 (15%), Positives = 93/299 (31%), Gaps = 52/299 (17%) Query: 60 WLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWP 119 W V +P ++ + + +I KLD++ L ++ ++ + Sbjct: 7 WRGAVIYQIYPR----SLLDTNGDGVGDLRGIITKLDYIASLNVDAIWISP------FFK 56 Query: 120 SKILPWSDLMTGKIGENPGYDPLQ---FMLDEAHKRGMKV------------HAWF---- 160 S + + ++ +P + +Q ++++AH+RG+KV HAWF Sbjct: 57 SPMADFGYDISDYREVDPLFGSMQDFDELIEKAHQRGIKVIIDQVLSHTSDQHAWFIESR 116 Query: 161 ----NP----YRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSG---DRFVLDPGI 209 NP Y + + GT + A + R + ++ Sbjct: 117 ESRTNPKADWYVWADPKEDGTAPNNWLAIFGGCAWEWEPRRQQYYLHNFLRSQPDINFHN 176 Query: 210 PEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRK------YGGAFASK 263 PEV+ + V + + VDG + D + ++ + + Sbjct: 177 PEVRQAVLDNVEFWLKK-GVDGFRLDAITFCYHDEQLRDNPAKPKDKRQGRGFSEDNPYA 235 Query: 264 ADWRRNNTQQ-----LIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAY 317 + N + I ++ I V G A + R AY Sbjct: 236 YQYHYYNNDRPQTILFIEELRQLINRYPGAVTLGEVSAEDSLAVMAAYTKGEDRLHMAY 294 >UniRef50_B4CZ69 Trehalose synthase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4CZ69_9BACT Length = 1074 Score = 97.6 bits (241), Expect = 8e-19, Method: Composition-based stats. Identities = 43/274 (15%), Positives = 84/274 (30%), Gaps = 56/274 (20%) Query: 76 NISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGE 135 + + +I++LD+ LG+ ++ +PS + + Sbjct: 19 TFMDSDGDGLGDFRGLIERLDYFTELGVTALWL------LPFYPSPLKDDGYDIADYFAV 72 Query: 136 NPGY---DPLQFMLDEAHKRGMKVH------------AWF-------------NPYRVSV 167 +P Y D + LD AH+RG++V AWF + Y + Sbjct: 73 HPSYGTLDDFRTFLDLAHERGLRVITELVLNHTSDQNAWFQRARRAPKGSPERDFYVWTD 132 Query: 168 NTKP---GTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVV 224 + + I + S + W R + L+ P V + + + Sbjct: 133 DPRKYKEARIIFKDFEPSNWTWDPVAKAYFWHRFYAHQPDLNFDNPLVHAELFRAI-DFW 191 Query: 225 SRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKS 284 VDG++ D Y Y + G + + T I K+ I + Sbjct: 192 MAMGVDGLRLDAIPYL------------YEREGTNCENLDE-----THDFIRKLRAHIDT 234 Query: 285 IKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYD 318 P ++ A W + + G+ + Sbjct: 235 KFPNRML-LAEANQWPEDAVEYFGNGDECHMEFH 267 >UniRef50_B8CY54 Alpha amylase n=2 Tax=Halothermothrix orenii RepID=B8CY54_HALOH Length = 515 Score = 97.6 bits (241), Expect = 8e-19, Method: Composition-based stats. Identities = 42/263 (15%), Positives = 84/263 (31%), Gaps = 54/263 (20%) Query: 69 WPPVSSVNISNPTSRARVQQQAMIDKLDHL--------QRLGINTVFFQVKPDGTALWPS 120 + + + + + +I+KLD+L LG+N ++ + Sbjct: 34 YYEIFVRSFYDSDGDGIGDLKGIIEKLDYLNDGDPETIADLGVNGIWLMPIFKSPSYHGY 93 Query: 121 KILPWSDLMTGKIGENPGYDPLQF---MLDEAHKRGMKV------------HAWFNPYRV 165 + + + NP Y L+ +++ AH+RG+KV H WF Sbjct: 94 DVTDYYKI-------NPDYGTLEDFHKLVEAAHQRGIKVIIDLPINHTSERHPWFLKASR 146 Query: 166 SVNTKPGTIRELNSTLSQQPASVYVQHRDWIR---------TSGDRFVLDPGIPEVQDWI 216 N++ + + R W L+ PEVQ+ + Sbjct: 147 DKNSEYRDYYVWAGPDTDTKETKLDGGRVWHHSPTGMYYGYFWSGMPDLNYNNPEVQEKV 206 Query: 217 TSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRN----NTQ 272 I + + VDG + D + P +Y F +R+ Sbjct: 207 IEIAKYWLKQ-GVDGFRLDGAMHIFPP----------AQYDKNFTWWEKFRQEIEEVKPV 255 Query: 273 QLIAKVSHTIKSIKPGVEFGVSP 295 L+ +V +++ P ++G Sbjct: 256 YLVGEVWDISETVAPYFKYGFDS 278 >UniRef50_A4FBJ1 Putative uncharacterized protein n=1 Tax=Saccharopolyspora erythraea NRRL 2338 RepID=A4FBJ1_SACEN Length = 501 Score = 97.6 bits (241), Expect = 9e-19, Method: Composition-based stats. Identities = 56/348 (16%), Positives = 106/348 (30%), Gaps = 68/348 (19%) Query: 94 KLDHLQRLGINTVFFQVKPD-GTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKR 152 L+ ++ I+TV +K + G + S + M +IG GY + +D+ H Sbjct: 202 VLEMARQGRIDTVELDIKDESGEVPYDSAV-----PMANQIGAVKGYYNARQAVDQLHGM 256 Query: 153 GMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQ--HRDWIRTSGDRFVLDPGIP 210 G++V + I S P V + W G + P Sbjct: 257 GVRVVGRLVAF-------KDPILGEASWRGGHPERVVQTAGGQPWTGGYGGFAFTNFADP 309 Query: 211 EVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNN 270 V+ + I E S D V +D + + ++ G + Sbjct: 310 VVRQYNIDIATEAASL-GFDDVLYDYVRRPDGAIEQ-------MRFPGLTTTPE----AG 357 Query: 271 TQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRWVEQ 330 + + ++S G G S G+ NR + D R+ + Sbjct: 358 IADFLRQTQPAVRSR--GALLGASVFGISVNRP--------------TQIAQDIRQMAQ- 400 Query: 331 GLLDYIAPQIYWPF---------SRSAARYDVL---AKWWADVVKPTRTRLYIGIAFYKV 378 DYIAP +Y Y+++ +A V+ T ++ + + + Sbjct: 401 -YTDYIAPMVYPSHWGPGEFGVADPDTQPYEIVRNSLAEFAKAVEGTDVQIIPWLQDFSL 459 Query: 379 GEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQT 426 G G E+ Q+ + +L+ + Q Sbjct: 460 GAS----------YGPAEVAAQIRAAGD-GGMPSFLLWAANCRYHDQA 496 >UniRef50_UPI0001C17075 Alpha amylase, catalytic region protein n=4 Tax=Cyanobacteria RepID=UPI0001C17075 Length = 491 Score = 97.6 bits (241), Expect = 9e-19, Method: Composition-based stats. Identities = 39/253 (15%), Positives = 74/253 (29%), Gaps = 50/253 (19%) Query: 78 SNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENP 137 +++KLDH++ LG+N ++F + + + G Sbjct: 59 PTLQGYKGGDLWGVLEKLDHIENLGVNAIYFTPIFQSGSNHRYHTHDYYQVDPLLGGN-- 116 Query: 138 GYDPLQFMLDEAHKRGMKV-------HAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYV 190 + +L+EAHKR +K+ HA + + G Q + Sbjct: 117 --GAFRELLNEAHKRNIKIVLDGVFNHASRGIFFFHDILENGPNSPWVDWFKIQDWPLAP 174 Query: 191 QHRDWIRTSGDR------FVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPG 244 D V + P+V+++I I + + +DG + D ++PG Sbjct: 175 YTNDAPANYESWADIRSLPVFNHDHPDVREYIMQIAEYWIK-FGIDGWRLDVPNEIKTPG 233 Query: 245 SRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSH 304 + +K+I P VW + Sbjct: 234 -----------------------------FWQEFRERVKAINPDAYI---VGEVWVDSRE 261 Query: 305 DPLGSDTRGAAAY 317 G+ G Y Sbjct: 262 WLDGTQFDGVMNY 274 >UniRef50_B0C6V7 Putative uncharacterized protein n=3 Tax=Cyanobacteria RepID=B0C6V7_ACAM1 Length = 522 Score = 97.2 bits (240), Expect = 1e-18, Method: Composition-based stats. Identities = 36/185 (19%), Positives = 69/185 (37%), Gaps = 13/185 (7%) Query: 68 DWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSD 127 +WP ++ + + LD + G N V+ +V +G L P+ P + Sbjct: 92 NWPQNQAIWLRLHECDLE--PGVLDTLLDRIVSRGYNQVYVEVFYNGRVLLPAANNPTTW 149 Query: 128 LMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPA- 186 + + D L + + RG+KV+AW + ++ + G + NS L++ Sbjct: 150 SSEIRNPKYANRDLLAETIKKGRARGLKVYAWM--FSLNYGHQYGQRSDRNSVLARNGQG 207 Query: 187 -------SVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFY 239 + + DR +DP + + ++ ++ R DGV FD Y Sbjct: 208 KTSLTLLDYADPNINLDNGDIDRAFVDPYSAQARQDYARMLQAILQR-KPDGVLFDYIRY 266 Query: 240 TESPG 244 G Sbjct: 267 PRQTG 271 >UniRef50_C1XGB4 Glycosidase n=2 Tax=Meiothermus RepID=C1XGB4_MEIRU Length = 715 Score = 97.2 bits (240), Expect = 1e-18, Method: Composition-based stats. Identities = 41/275 (14%), Positives = 79/275 (28%), Gaps = 52/275 (18%) Query: 56 MRGIWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGT 115 W L + ++++L HL+ LG+N ++F D Sbjct: 260 FDQTWSGRPPYLSRWSDPPGDYHCCQQYYGGDLAGVLERLPHLRALGVNLIYFNPLFDSG 319 Query: 116 ALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIR 175 + + + L+ +L EA ++G++V F P + + Sbjct: 320 SAHGYDTHDYVRVSPKFGDNA----LLKRLLAEARRQGIRVIFDFVPNHTGLGH--WAFQ 373 Query: 176 ELNSTLSQQPASVYVQHRDWIRTSGDR------------FVLDPGIPEVQDWITSIVAEV 223 ++ + + R W T GD L+ PEVQD++ + Sbjct: 374 DVVRKGPESRYWDWYFIRRWPFTPGDGRAYVGWADLGSLPKLNTANPEVQDYLIRVSRFW 433 Query: 224 VSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIK 283 + + DG++ D + + + +K Sbjct: 434 L-NFGFDGIRVDVANEIS------------------------------TEFVQRWRAELK 462 Query: 284 SIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYD 318 ++KP V VW R G Y Sbjct: 463 ALKPEVYL---VGEVWDLRPQYLQGDQFDSLMNYT 494 >UniRef50_B1YKQ1 Alpha amylase catalytic region n=2 Tax=Exiguobacterium RepID=B1YKQ1_EXIS2 Length = 509 Score = 97.2 bits (240), Expect = 1e-18, Method: Composition-based stats. Identities = 28/206 (13%), Positives = 64/206 (31%), Gaps = 10/206 (4%) Query: 59 IWLATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALW 118 I + D N +P + + +LD+++ G +++ T ++ Sbjct: 36 IMVDRFENGDKSNDLEANPDDPKAFQGGDLAGVTKRLDYIKDQGFTSIWL------TPIF 89 Query: 119 PSKILPWSDLMTG-KIGENPGYDP---LQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTI 174 ++ + T +P + + ++ EAHKR +KV + N Sbjct: 90 KNRPNGYHGYWTDDYYEIDPHFGTKEEFKTLVKEAHKRDLKVVLDLVVNHLGPNHPLVKE 149 Query: 175 RELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQF 234 + Q + Q D + EV ++ + V +DG + Sbjct: 150 KPDWFHKEQTIMNWNNQAEVENNWLFDLPDFNTENKEVVKYLVDVANYWVDETGIDGYRL 209 Query: 235 DDYFYTESPGSRLNDNETYRKYGGAF 260 D + + +++ F Sbjct: 210 DTVRHVPPAFWKTFIPAVKKEHPDLF 235 >UniRef50_UPI0001C42483 alpha amylase catalytic region n=1 Tax=Bacillus pseudofirmus OF4 RepID=UPI0001C42483 Length = 891 Score = 96.8 bits (239), Expect = 1e-18, Method: Composition-based stats. Identities = 46/265 (17%), Positives = 80/265 (30%), Gaps = 54/265 (20%) Query: 83 RARVQQQAMIDKLDHLQRLGINTVFFQVKPDG---TALWPSKILPWSDLMTGKIGENPGY 139 + + KLD+++ LG++T++ +G P+ + Sbjct: 392 WMGGDLEGVHAKLDYIEELGVDTIWLSPVFEGPYSHGYHPTDFMSVDQNFGTLK------ 445 Query: 140 DPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTS 199 L+ ++DEAH R MKV F P S N S T Sbjct: 446 -VLKELIDEAHDRDMKVIYDFVPNHTSSEHPFFQDALENGEDSPYYDWYTFYEDGTYETF 504 Query: 200 GDR---FVLDPGIPEVQDWITS-IVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRK 255 + PE +D++ + +V + DG++ D Sbjct: 505 YGIEELPQFNNDHPEARDYMLNEVVPFWLEELEFDGLRLDYAK----------------- 547 Query: 256 YGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEF----------GVSPAGVWRNRSHD 305 G +++ D+R +KSI P + S AG Sbjct: 548 -GPSYSFWVDFR------------DKVKSIDPDMYVFGEVWDSREKISSYAGKLDGSLDF 594 Query: 306 PLGSDTRGAAAYDESYADTRRWVEQ 330 +G A+D S +VE+ Sbjct: 595 GFHDTFKGTFAFDGSMQSVVNYVEE 619 >UniRef50_A3DDK1 Alpha amylase, catalytic region n=3 Tax=Clostridium thermocellum RepID=A3DDK1_CLOTH Length = 575 Score = 96.8 bits (239), Expect = 1e-18, Method: Composition-based stats. Identities = 36/245 (14%), Positives = 72/245 (29%), Gaps = 49/245 (20%) Query: 84 ARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQ 143 + +ID+ DHL +LG++ V+ + ++ + ++ + L+ Sbjct: 154 HGGNIKGIIDRFDHLVKLGVDVVYLNPIFKSESYHRYDVVDYYEIDPMFGSKEE----LR 209 Query: 144 FMLDEAHKRGMKVHAW---------FNPYRVSVNTKPGTIRELNSTLSQQPASVY-VQHR 193 ++D HK G+KV F +R V + ++ P Y + Sbjct: 210 ELMDLCHKNGIKVIFDGVFNHSGDKFFAFRDVVEKGEKSKYANWYFINSFPVQGYPRPNY 269 Query: 194 DWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETY 253 + G L+ G PE + +V + VDG + D Sbjct: 270 ECFSFYGGMPKLNTGNPETAKYFLDVVKYWTVEFGVDGWRLDA----------------- 312 Query: 254 RKYGGAFASKADWRRNNTQQ-LIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTR 312 + + K+ +K + V ++ S G Sbjct: 313 --------------ADEVDRKFWRKLRDMLKDLNKDVVL---IGEIFDEASSWLWGDQFD 355 Query: 313 GAAAY 317 Y Sbjct: 356 SVINY 360 >UniRef50_A6L979 Putative uncharacterized protein n=6 Tax=Bacteroidales RepID=A6L979_PARD8 Length = 369 Score = 96.8 bits (239), Expect = 1e-18, Method: Composition-based stats. Identities = 45/352 (12%), Positives = 86/352 (24%), Gaps = 63/352 (17%) Query: 121 KILPWSDLMTGKIGENPGYDP--LQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELN 178 W + N G+D +Q AH GM+ HAW P + Sbjct: 44 DFSKWHAHGVDGMCYNAGHDTEKIQRAAKAAHANGMEYHAWI-PAMLQHGLDSTLYAVNR 102 Query: 179 STLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYF 238 S YV L P +++ + +V VD + D Sbjct: 103 KGESAYSVQAYVP---------YYKCLCPNQEGTAEFLLDLYGKVADIPEVDYIHLDYIR 153 Query: 239 YTESP-------------------GSRLNDNETYRKYGGAF-------------ASKADW 266 Y + ++ + A A + Sbjct: 154 YVDVILARGLWEKYGLVMDEEYPTADYCYCDKCVADFKAATGIDIKSVEDPSKCEEWAQF 213 Query: 267 RRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRR 326 R + +L+ ++ + V V P + A + + Sbjct: 214 RCDLITKLVNCIADEVHGKGKKVSAAVFPG---------------PDSHAKWMVRQEWNK 258 Query: 327 WVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEP 386 W +D P Y F A + + +Y G+ + + Sbjct: 259 W----NIDAFFPMNYNDFYLEDASWLAPIVKEEVAAVQGKKPVYSGLFICEDWQNKANIK 314 Query: 387 DWMINGGVPELKKQLDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQSRWG 438 D +G +P ++ +G LF + + + + Sbjct: 315 DPEGHGLIPSEIEEAVRGSMENGAAGVALFTPGNMTDEHWKAFDKAIHQPYT 366 >UniRef50_Q1WRI4 Oligo-1,6-glucosidase n=7 Tax=Firmicutes RepID=Q1WRI4_LACS1 Length = 552 Score = 96.8 bits (239), Expect = 1e-18, Method: Composition-based stats. Identities = 51/341 (14%), Positives = 112/341 (32%), Gaps = 56/341 (16%) Query: 68 DWPPVSSVN------ISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSK 121 DW V ++ Q +I KLD+L+ LG+N ++ T ++ S Sbjct: 3 DWKKKGVVYEIYVQSFNDTNDDGIGDIQGVIQKLDYLKELGVNILWL------TPIFESP 56 Query: 122 ILPWSDLMTGKIGENPGYDPL---QFMLDEAHKRGMKV------------HAWF------ 160 ++ ++ N Y + + ++ ++H +K+ H WF Sbjct: 57 LVDNGYDISNYQSINNIYGTMEDVEELIKKSHDYNIKIVMDLVVNHTSDQHKWFQESKKS 116 Query: 161 ------NPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRD---WIRTSGDRFVLDPGIPE 211 Y G+ + + A YV RD + ++ L+ E Sbjct: 117 KDNKYSEYYIWRDPRPDGSAPTNHGSAFGGSAWEYVPERDQYYLHLFAKEQPDLNWDNKE 176 Query: 212 VQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLN----DNETYRKYGGAFASKADWR 267 +++ I ++++ + +DG + D + P N DN+ Y Y A+ Sbjct: 177 LRNEIYNMMSFWAEK-GIDGFRMDSISFISKPQKFTNAPVVDNKEYGAYYYGIANG---- 231 Query: 268 RNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDESYADTRRW 327 NN + + +++ + S + G +P S ++ ++ + D ++ Sbjct: 232 -NNIHRYLKEMNERVLSKYDLITIGETPHTNTEEGSKFLDSNELDMIFQFEHMHVDYGKY 290 Query: 328 VEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTR 368 D S + W ++ + Sbjct: 291 GR--YSDVSFKMS--DLRESINHWQENLTWNSNYLGNHDQP 327 >UniRef50_B7R259 Pullulanase type II, GH13 family n=1 Tax=Thermococcus sp. AM4 RepID=B7R259_9EURY Length = 775 Score = 96.8 bits (239), Expect = 1e-18, Method: Composition-based stats. Identities = 37/247 (14%), Positives = 72/247 (29%), Gaps = 50/247 (20%) Query: 83 RARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPL 142 + +KLD+L LG+ ++ ++ + + E L Sbjct: 359 YFGGDIAGITEKLDYLSSLGVKLIYLNPIFLSGSVHGYDTYDYYRVDPKFGTEAE----L 414 Query: 143 QFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDR 202 + L EAH+RG++V F P + + ++ + P + + W GD Sbjct: 415 KLFLTEAHRRGIRVIFDFVPDHSGIGAE--QFLDVWKNGRKSPYWHWYFIKRWPFKLGDG 472 Query: 203 ------------FVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDN 250 L+ PEV+D++ + + + DG++ D +P +N Sbjct: 473 SAYEGWWGLGSLPKLNTTNPEVKDYLFGAAMKWL-DFGFDGIRVD------TPADLVNA- 524 Query: 251 ETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSD 310 + + IK P +W G Sbjct: 525 ---------------------DEFFREFRERIKERHPDAYL---VGEIWTLSPEWVRGDK 560 Query: 311 TRGAAAY 317 Y Sbjct: 561 FDSLMNY 567 >UniRef50_Q9HHB0 Pullulanase n=1 Tax=Desulfurococcus mucosus RepID=Q9HHB0_9CREN Length = 686 Score = 96.8 bits (239), Expect = 1e-18, Method: Composition-based stats. Identities = 36/248 (14%), Positives = 71/248 (28%), Gaps = 50/248 (20%) Query: 83 RARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPL 142 + + +KLD+L+ LG+ ++ ++ + + L Sbjct: 206 YFGGDLKGVTEKLDYLKELGVGLIYLNPIFLSGSVHGYDTYDYYTVDPKFGTLE----DL 261 Query: 143 QFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDW------- 195 + +++EAHKRG+KV F P V + N S + V + Sbjct: 262 KTLINEAHKRGIKVIFDFVPDHVGLGFWAFQDVYRNGRNSTYWSWFIVYKWRFKLGDPTA 321 Query: 196 ---IRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNET 252 G L+ EV+ ++ ++ +S DG++ D Sbjct: 322 YKCWWGIGSLPQLNVLNTEVRQYLINVALYWLSI-GFDGLRIDTPL-------------- 366 Query: 253 YRKYGGAFASKADWRRNNTQ--QLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSD 310 + ++ +KS P +W R G+ Sbjct: 367 ----------------DVIDSESFFRELREAVKSRYPDAYI---VGEIWDYRPEWLRGNA 407 Query: 311 TRGAAAYD 318 Y Sbjct: 408 FDSLMNYY 415 >UniRef50_D1CHN7 GTP-binding protein n=1 Tax=Thermobaculum terrenum ATCC BAA-798 RepID=D1CHN7_THET1 Length = 410 Score = 96.8 bits (239), Expect = 2e-18, Method: Composition-based stats. Identities = 48/335 (14%), Positives = 97/335 (28%), Gaps = 39/335 (11%) Query: 86 VQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFM 145 + ++ +N + VK D +W SK+ + + LQ Sbjct: 101 GDPDVYRRFMRLIEETELNAIVINVKNDDGKVWTSKVPLARQIGASYEDFH-----LQEF 155 Query: 146 LDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSG-DRFV 204 + + H+RG+ V F +R + +P R + Sbjct: 156 VRDMHRRGIYVIGRFTTFRDPT------------LATARPDMAVRDIRGGVWEDNKGHRW 203 Query: 205 LDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYTESPGSRLNDNETYRKYGGAFASKA 264 +DP +V + ++ E+ + +D +QFD + D + Sbjct: 204 VDPFNKKVWRYFGDLLEEIAAS-GIDEIQFDYVRFPVDG-----DLSKVEYLTPSTRYN- 256 Query: 265 DWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSH-DPLGSDTRGAAAYDESYAD 323 R + + + S I+ K V G+ G A Y + Y+ Sbjct: 257 --RPDTIEAFLRYASSRIRPHK--VFISADTYGLTVWSEKEQGTGQVLERLAPYLDYYSP 312 Query: 324 TRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSK 383 D+ AP SA Y+++ + + R + + Sbjct: 313 MI------YPDHFAPGTGGYRIPSAHPYEIIYESVVRAKRRLRGDNV--NTLVRPYLSAF 364 Query: 384 IEPDWMINGGVPELKKQLDLNDAVPEISGTILFRE 418 + + G+P+ Q + G I + Sbjct: 365 PDTQYGQPFGLPQWLAQKRAAED-AGAHGWIYWDA 398 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.312 0.123 0.337 Lambda K H 0.267 0.0374 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 2,439,755,524 Number of Sequences: 3077464 Number of extensions: 91188983 Number of successful extensions: 352049 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 782 Number of HSP's successfully gapped in prelim test: 2211 Number of HSP's that attempted gapping in prelim test: 345009 Number of HSP's gapped (non-prelim): 3691 length of query: 439 length of database: 1,040,396,356 effective HSP length: 132 effective length of query: 307 effective length of database: 634,171,108 effective search space: 194690530156 effective search space used: 194690530156 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.5 bits) S2: 94 (40.9 bits)