BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (254 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P27840 Uncharacterized protein yigE n=68 Tax=Enterobact... 282 7e-75 UniRef50_B2II06 Putative uncharacterized protein n=5 Tax=Rhizobi... 268 1e-70 UniRef50_A9W4Y6 Putative uncharacterized protein n=4 Tax=Methylo... 241 1e-62 UniRef50_B9JX75 Putative uncharacterized protein n=1 Tax=Agrobac... 239 9e-62 UniRef50_Q98NI9 Mlr0120 protein n=3 Tax=Alphaproteobacteria RepI... 233 4e-60 UniRef50_A9CIN9 Putative uncharacterized protein n=6 Tax=Proteob... 224 3e-57 UniRef50_Q0BWY7 Putative lipoprotein n=1 Tax=Hyphomonas neptuniu... 222 5e-57 UniRef50_Q11X50 Putative uncharacterized protein n=2 Tax=Bactero... 222 1e-56 UniRef50_D2LB58 Putative uncharacterized protein n=1 Tax=Rhodomi... 220 4e-56 UniRef50_D0SJ31 Putative uncharacterized protein n=1 Tax=Acineto... 217 3e-55 UniRef50_Q26CZ6 Periplasmic protein n=1 Tax=Flavobacteria bacter... 217 3e-55 UniRef50_A9D6B9 Putative uncharacterized protein n=1 Tax=Hoeflea... 217 3e-55 UniRef50_C8PYM1 Putative uncharacterized protein n=1 Tax=Enhydro... 214 2e-54 UniRef50_Q1IX28 Periplasmic YigE-like protein n=3 Tax=Deinococcu... 214 2e-54 UniRef50_Q093S1 Putative uncharacterized protein n=1 Tax=Stigmat... 214 2e-54 UniRef50_C5CWT4 Periplasmic protein-like protein n=3 Tax=Bacteri... 211 2e-53 UniRef50_B9KP42 Periplasmic protein-like protein n=37 Tax=Rhodob... 211 3e-53 UniRef50_Q1QCK8 Periplasmic protein-like n=1 Tax=Psychrobacter c... 209 5e-53 UniRef50_A1B5U0 Putative uncharacterized protein n=1 Tax=Paracoc... 209 6e-53 UniRef50_D1AL67 Periplasmic protein-like protein n=1 Tax=Sebalde... 209 6e-53 UniRef50_Q1MEZ5 Conserved hypothetical exported protein n=45 Tax... 209 7e-53 UniRef50_Q0G184 Putative uncharacterized protein n=2 Tax=Auranti... 202 7e-51 UniRef50_A5EW16 Putative uncharacterized protein n=1 Tax=Dichelo... 201 3e-50 UniRef50_C7PF78 Putative uncharacterized protein n=1 Tax=Chitino... 197 2e-49 UniRef50_Q2NAA1 Putative uncharacterized protein n=3 Tax=Erythro... 197 3e-49 UniRef50_A5WGQ7 Periplasmic protein-like protein n=1 Tax=Psychro... 193 5e-48 UniRef50_B2HYZ5 Predicted periplasmic protein n=11 Tax=Acinetoba... 191 2e-47 UniRef50_C8N6Z2 Lipoprotein n=1 Tax=Cardiobacterium hominis ATCC... 189 6e-47 UniRef50_UPI0001744B4D hypothetical protein VspiD_05050 n=1 Tax=... 180 5e-44 UniRef50_Q5WVS5 Putative uncharacterized protein n=5 Tax=Legione... 165 1e-39 UniRef50_D1REU9 Putative uncharacterized protein n=1 Tax=Legione... 164 3e-39 UniRef50_D1CZ42 Putative uncharacterized protein n=1 Tax=Brucell... 160 3e-38 UniRef50_A9KDD2 Hypothetical exported protein n=6 Tax=Coxiella b... 153 5e-36 UniRef50_A5USB9 Putative uncharacterized protein n=2 Tax=Roseifl... 142 2e-32 UniRef50_A6LS70 Putative uncharacterized protein n=23 Tax=Clostr... 134 2e-30 UniRef50_C3QHD0 Exopolysaccharide biosynthesis protein n=2 Tax=B... 133 7e-30 UniRef50_D0WLU9 Putative uncharacterized protein n=1 Tax=Actinom... 133 7e-30 UniRef50_C9KSV8 N-acetylmuramoyl-L-alanine amidase/putative S-la... 132 1e-29 UniRef50_C2JZN3 N-acetylmuramoyl-L-alanine amidase/probable S-la... 132 2e-29 UniRef50_A0Q3C5 Conserved protein n=7 Tax=Clostridia RepID=A0Q3C... 130 4e-29 UniRef50_B7AQ96 Putative uncharacterized protein n=1 Tax=Bactero... 130 5e-29 UniRef50_C6XT14 Exopolysaccharide biosynthesis protein n=1 Tax=P... 130 6e-29 UniRef50_A0PY15 Conserved protein n=4 Tax=Clostridium RepID=A0PY... 130 6e-29 UniRef50_B2KU41 N-acetylmuramoyl-L-alanine amidase/putative S-la... 129 7e-29 UniRef50_C7TED9 N-acetylmuramoyl-L-alanine amidase n=2 Tax=Lacto... 129 9e-29 UniRef50_Q97FU3 Uncharaterized conserved protein, YOME B.subtili... 127 3e-28 UniRef50_B8I4Q1 Putative uncharacterized protein n=3 Tax=Clostri... 127 3e-28 UniRef50_A9B1E5 Putative uncharacterized protein n=1 Tax=Herpeto... 126 6e-28 UniRef50_C4G6X0 Putative uncharacterized protein n=2 Tax=Lactoba... 123 5e-27 UniRef50_Q892K3 N-acetylmuramoyl-L-alanine amidase/putative S-la... 123 6e-27 UniRef50_C0GEE0 Putative uncharacterized protein n=1 Tax=Dethiob... 122 9e-27 UniRef50_A9WEC1 Putative uncharacterized protein n=3 Tax=Chlorof... 122 9e-27 UniRef50_C1I4R7 Putative uncharacterized protein (Fragment) n=1 ... 122 1e-26 UniRef50_A4VXL8 Exopolysaccharide biosynthesis protein related t... 122 1e-26 UniRef50_C4FXK4 Putative uncharacterized protein n=1 Tax=Catonel... 120 3e-26 UniRef50_C2HB28 Exopolysaccharide biosynthesis protein n=4 Tax=E... 120 5e-26 UniRef50_C5S6T1 Putative uncharacterized protein n=1 Tax=Allochr... 119 7e-26 UniRef50_C6D6X3 Exopolysaccharide biosynthesis protein n=6 Tax=B... 119 1e-25 UniRef50_D1N9W8 Putative uncharacterized protein n=1 Tax=Victiva... 118 1e-25 UniRef50_Q73Q09 Putative uncharacterized protein n=1 Tax=Trepone... 118 1e-25 UniRef50_C2KZT9 Exopolysaccharide biosynthesis protein n=2 Tax=F... 117 2e-25 UniRef50_D1BL19 Putative uncharacterized protein n=4 Tax=Veillon... 117 4e-25 UniRef50_C8WTH1 N-acetylglucosamine-1-phosphodiester alpha-N-ace... 117 4e-25 UniRef50_B4CZJ8 Putative uncharacterized protein n=1 Tax=Chthoni... 116 7e-25 UniRef50_Q97FU6 Uncharaterized conserved protein, YOME B.subtili... 114 2e-24 UniRef50_C1CWE2 Putative LysM lysin domain protein, n=1 Tax=Dein... 113 4e-24 UniRef50_D2NR45 Exopolysaccharide biosynthesis protein related t... 112 9e-24 UniRef50_C8PNM8 Putative uncharacterized protein n=1 Tax=Trepone... 112 9e-24 UniRef50_B8FUP3 Putative uncharacterized protein n=2 Tax=Desulfi... 111 2e-23 UniRef50_B1BC21 Putative uncharacterized protein n=2 Tax=Clostri... 111 2e-23 UniRef50_UPI000178A82C copper amine oxidase domain protein n=1 T... 111 3e-23 UniRef50_Q1IXP5 Peptidoglycan-binding LysM domain-containing pro... 109 8e-23 UniRef50_C6J074 Copper amine oxidase domain-containing protein n... 109 9e-23 UniRef50_A0YND3 Putative uncharacterized protein n=1 Tax=Lyngbya... 109 1e-22 UniRef50_D0TN59 Predicted protein n=3 Tax=Bacteroides RepID=D0TN... 108 2e-22 UniRef50_B0TEY5 Putative uncharacterized protein n=1 Tax=Helioba... 108 2e-22 UniRef50_A7V127 Putative uncharacterized protein n=1 Tax=Bactero... 106 7e-22 UniRef50_B2J8B3 Putative uncharacterized protein n=1 Tax=Nostoc ... 106 7e-22 UniRef50_C0ZEQ6 Putative uncharacterized protein n=1 Tax=Breviba... 106 8e-22 UniRef50_A7GCS1 Putative uncharacterized protein n=12 Tax=Clostr... 106 1e-21 UniRef50_C3R3M8 Putative uncharacterized protein n=1 Tax=Bactero... 105 1e-21 UniRef50_Q4UP44 Putative uncharacterized protein n=4 Tax=Bacteri... 105 2e-21 UniRef50_C6IYX5 Putative uncharacterized protein n=1 Tax=Paeniba... 104 2e-21 UniRef50_Q8A0T0 Putative uncharacterized protein n=10 Tax=Bacter... 104 3e-21 UniRef50_UPI0001BC3362 hypothetical protein BcroD2_01243 n=1 Tax... 103 4e-21 UniRef50_B2UNL7 Putative uncharacterized protein n=1 Tax=Akkerma... 102 1e-20 UniRef50_C3R3L4 Putative uncharacterized protein n=2 Tax=Bactero... 102 1e-20 UniRef50_C1XS52 Predicted periplasmic protein (DUF2233) n=1 Tax=... 102 1e-20 UniRef50_B2V2N5 Putative uncharacterized protein n=8 Tax=Clostri... 102 2e-20 UniRef50_A7M0G9 Putative uncharacterized protein n=2 Tax=Bactero... 101 3e-20 UniRef50_UPI0001694670 hypothetical protein Plarl_22443 n=1 Tax=... 100 5e-20 UniRef50_Q8YKN4 All7259 protein n=2 Tax=Cyanobacteria RepID=Q8YK... 100 5e-20 UniRef50_C1ABL2 Putative uncharacterized protein n=1 Tax=Gemmati... 100 9e-20 UniRef50_UPI0001744905 hypothetical protein VspiD_09365 n=1 Tax=... 99 2e-19 UniRef50_B3CE38 Putative uncharacterized protein n=3 Tax=Bactero... 98 2e-19 UniRef50_C4ICA6 Peptidase, M56 family n=1 Tax=Clostridium butyri... 97 4e-19 UniRef50_C6PYU6 Putative uncharacterized protein n=1 Tax=Clostri... 97 7e-19 UniRef50_C6XXH4 Putative uncharacterized protein n=1 Tax=Pedobac... 96 8e-19 UniRef50_UPI0001746B2F hypothetical protein VspiD_16055 n=1 Tax=... 96 9e-19 UniRef50_B8J2Y6 Putative uncharacterized protein n=2 Tax=Desulfo... 96 1e-18 UniRef50_Q8YP57 All4343 protein n=5 Tax=Nostocaceae RepID=Q8YP57... 95 2e-18 UniRef50_A7LRK4 Putative uncharacterized protein n=1 Tax=Bactero... 95 2e-18 UniRef50_B2A8G9 Copper amine oxidase domain protein n=1 Tax=Natr... 95 2e-18 UniRef50_B3PTF7 Putative uncharacterized protein n=3 Tax=Rhizobi... 95 2e-18 UniRef50_A6TKB7 Exopolysaccharide biosynthesis protein n=1 Tax=A... 95 2e-18 UniRef50_A6L610 Putative uncharacterized protein n=1 Tax=Bactero... 95 3e-18 UniRef50_C5RID5 Putative uncharacterized protein n=1 Tax=Clostri... 94 3e-18 UniRef50_B7ASL4 Putative uncharacterized protein n=1 Tax=Bactero... 94 4e-18 UniRef50_C8WU56 N-acetylglucosamine-1-phosphodiester alpha-N-ace... 94 4e-18 UniRef50_C6D0A3 Exopolysaccharide biosynthesis protein n=1 Tax=P... 94 5e-18 UniRef50_A7LRK2 Putative uncharacterized protein n=1 Tax=Bactero... 94 5e-18 UniRef50_C6IP98 Putative uncharacterized protein n=2 Tax=Bactero... 93 8e-18 UniRef50_B4WS35 Putative uncharacterized protein n=1 Tax=Synecho... 93 9e-18 UniRef50_C6XT12 NHL repeat containing protein n=2 Tax=Pedobacter... 92 1e-17 UniRef50_B7DMS1 Copper amine oxidase domain protein n=3 Tax=Alic... 92 2e-17 UniRef50_B8CYN3 SpoIID/LytB domain protein n=1 Tax=Halothermothr... 92 2e-17 UniRef50_C4Z4Z5 Putative uncharacterized protein n=1 Tax=Eubacte... 92 2e-17 UniRef50_B4AZH7 Putative uncharacterized protein n=1 Tax=Cyanoth... 91 3e-17 UniRef50_C0ZFU4 Putative uncharacterized protein n=1 Tax=Breviba... 91 3e-17 UniRef50_C9RVV6 Putative uncharacterized protein n=3 Tax=Geobaci... 91 4e-17 UniRef50_A1HRE9 Exopolysaccharide biosynthesis protein n=1 Tax=T... 91 4e-17 UniRef50_B7H7U4 Putative uncharacterized protein n=27 Tax=Bacill... 90 4e-17 UniRef50_C4V4S8 Exopolysaccharide biosynthesis protein n=1 Tax=S... 90 4e-17 UniRef50_C5PL46 Exopolysaccharide biosynthesis protein n=2 Tax=S... 90 5e-17 UniRef50_B0BZE5 Putative uncharacterized protein n=1 Tax=Acaryoc... 90 6e-17 UniRef50_B5VVA8 S-layer domain protein n=3 Tax=Cyanobacteria Rep... 90 7e-17 UniRef50_C0WEQ2 Exopolysaccharide biosynthesis protein n=1 Tax=A... 90 7e-17 UniRef50_C6XWN0 Putative uncharacterized protein n=1 Tax=Pedobac... 89 1e-16 UniRef50_B4VX04 Putative uncharacterized protein n=1 Tax=Microco... 89 2e-16 UniRef50_O31980 SPBc2 prophage-derived uncharacterized protein y... 87 4e-16 UniRef50_B8HTR4 Putative uncharacterized protein n=1 Tax=Cyanoth... 87 4e-16 UniRef50_C4Z6E6 Putative uncharacterized protein n=1 Tax=Eubacte... 87 4e-16 UniRef50_C6JBU1 Putative uncharacterized protein n=1 Tax=Ruminoc... 87 4e-16 UniRef50_A7LVE9 Putative uncharacterized protein n=1 Tax=Bactero... 87 5e-16 UniRef50_A1SN25 Exopolysaccharide biosynthesis protein-like n=1 ... 87 5e-16 UniRef50_C9PX63 Putative uncharacterized protein n=1 Tax=Prevote... 87 5e-16 UniRef50_C6J7B9 Exopolysaccharide biosynthesis protein n=2 Tax=B... 87 5e-16 UniRef50_C6LDL7 Putative uncharacterized protein n=1 Tax=Bryante... 87 7e-16 UniRef50_C9KQW2 Putative secreted protein n=2 Tax=Veillonellacea... 87 7e-16 UniRef50_D1VRM0 Putative copper amine oxidase N-domain family n=... 87 8e-16 UniRef50_D2RLV8 N-acetylglucosamine-1-phosphodiester alpha-N-ace... 86 1e-15 UniRef50_A0YXN3 Putative uncharacterized protein n=1 Tax=Lyngbya... 86 1e-15 UniRef50_UPI0001BC335A hypothetical protein BcroD2_01203 n=1 Tax... 86 1e-15 UniRef50_B8HPJ4 Putative uncharacterized protein n=2 Tax=Cyanoth... 85 1e-15 UniRef50_UPI0000E45D54 PREDICTED: similar to N-acetylglucosamine... 85 2e-15 UniRef50_B7KAU9 Putative uncharacterized protein n=7 Tax=Chrooco... 85 2e-15 UniRef50_UPI0001C43112 hypothetical protein BpOF4_05820 n=1 Tax=... 85 3e-15 UniRef50_B0CAS6 Putative uncharacterized protein n=1 Tax=Acaryoc... 84 3e-15 UniRef50_UPI0001C3370C hypothetical protein UCYN_10670 n=1 Tax=c... 84 4e-15 UniRef50_Q9UK23 N-acetylglucosamine-1-phosphodiester alpha-N-ace... 84 4e-15 UniRef50_C0CND1 Putative uncharacterized protein n=1 Tax=Blautia... 84 5e-15 UniRef50_B1I1S0 N-acetylglucosamine-1-phosphodiester alpha-N-ace... 84 5e-15 UniRef50_B3RIP6 Putative uncharacterized protein (Fragment) n=2 ... 84 6e-15 UniRef50_C9LSB3 Putative secreted protein n=1 Tax=Selenomonas sp... 83 9e-15 UniRef50_P74396 Slr0280 protein n=1 Tax=Synechocystis sp. PCC 68... 83 9e-15 UniRef50_B9YC35 Putative uncharacterized protein n=2 Tax=Holdema... 83 1e-14 UniRef50_C6CV17 Exopolysaccharide biosynthesis protein n=1 Tax=P... 82 1e-14 UniRef50_B8HPB3 Putative uncharacterized protein n=1 Tax=Cyanoth... 82 1e-14 UniRef50_Q67T45 Putative uncharacterized protein n=1 Tax=Symbiob... 82 1e-14 UniRef50_A3DHF5 Ig-like, group 2 n=3 Tax=Clostridium thermocellu... 82 1e-14 UniRef50_B8G1I8 Peptidase M56 BlaR1 n=4 Tax=Desulfitobacterium h... 82 2e-14 UniRef50_UPI0001C16068 conserved hypothetical protein n=2 Tax=No... 82 2e-14 UniRef50_A0YL57 Putative uncharacterized protein n=1 Tax=Lyngbya... 82 3e-14 UniRef50_UPI00019088BB hypothetical protein RetlC8_25680 n=2 Tax... 81 3e-14 UniRef50_A4J956 Copper amine oxidase domain protein n=1 Tax=Desu... 81 4e-14 UniRef50_Q2BF40 Putative uncharacterized protein n=1 Tax=Bacillu... 80 4e-14 UniRef50_A8W171 Flagellar protein FliS n=1 Tax=Bacillus seleniti... 80 5e-14 UniRef50_D2V2G1 N-acetylglucosamine-1-phosphodiester alpha-N-ace... 80 5e-14 UniRef50_A6L611 Putative uncharacterized protein n=1 Tax=Bactero... 80 6e-14 UniRef50_Q8RCE6 Putative uncharacterized protein n=5 Tax=Thermoa... 80 9e-14 UniRef50_UPI000180BA0C PREDICTED: similar to predicted protein n... 80 9e-14 UniRef50_Q5N4C8 Putative uncharacterized protein n=2 Tax=Synecho... 80 9e-14 UniRef50_A9NEV6 Hypothetical surface-anchored protein n=1 Tax=Ac... 79 1e-13 UniRef50_B0C332 Putative uncharacterized protein n=2 Tax=Bacteri... 79 1e-13 UniRef50_A6TVJ8 Exopolysaccharide biosynthesis protein n=2 Tax=A... 79 2e-13 UniRef50_Q3AA51 Conserved domain protein n=1 Tax=Carboxydothermu... 78 2e-13 UniRef50_Q7X4R9 XcbC n=1 Tax=Neisseria meningitidis RepID=Q7X4R9... 78 2e-13 UniRef50_Q8DHU5 Tll1850 protein n=1 Tax=Thermosynechococcus elon... 78 3e-13 UniRef50_B1XK15 Putative uncharacterized protein n=1 Tax=Synecho... 78 3e-13 UniRef50_UPI0001C164F4 hypothetical protein CRD_01886 n=2 Tax=No... 78 4e-13 UniRef50_UPI00017896CA metallophosphoesterase n=1 Tax=Geobacillu... 77 5e-13 UniRef50_B5Y710 Copper amine oxidase N-domain family n=2 Tax=Cop... 77 6e-13 UniRef50_C6IEV9 Putative uncharacterized protein n=2 Tax=Bactero... 77 8e-13 UniRef50_B8I1Q9 Ig-like, group 2 n=3 Tax=Clostridium RepID=B8I1Q... 76 8e-13 UniRef50_UPI00016C4EC3 hypothetical protein GobsU_32169 n=1 Tax=... 76 9e-13 UniRef50_A7C442 Putative uncharacterized protein n=1 Tax=Beggiat... 76 1e-12 UniRef50_Q8YKH7 All7320 protein n=2 Tax=Cyanobacteria RepID=Q8YK... 75 2e-12 UniRef50_D1R528 Putative uncharacterized protein n=1 Tax=Parachl... 75 3e-12 UniRef50_B0JGJ2 Putative uncharacterized protein n=2 Tax=Microcy... 75 3e-12 UniRef50_B1X2V5 Putative uncharacterized protein n=2 Tax=Cyanoth... 74 4e-12 UniRef50_B9XE16 Putative uncharacterized protein n=1 Tax=bacteri... 74 4e-12 UniRef50_D1B6I7 N-acetylglucosamine-1-phosphodiester alpha-N-ace... 74 5e-12 UniRef50_A5D3R0 Hypothetical membrane protein n=1 Tax=Pelotomacu... 74 5e-12 UniRef50_UPI0001923977 PREDICTED: similar to predicted protein, ... 73 8e-12 UniRef50_A4XD34 Putative uncharacterized protein n=1 Tax=Salinis... 73 8e-12 UniRef50_UPI0001BC7E39 hypothetical protein BacD2_08600 n=1 Tax=... 73 9e-12 UniRef50_B1HN11 Putative uncharacterized protein n=2 Tax=Bacilla... 73 1e-11 UniRef50_B5W3X9 Putative uncharacterized protein n=3 Tax=Arthros... 72 1e-11 UniRef50_UPI0001C30FBA N-acetylglucosamine-1-phosphodiester alph... 72 1e-11 UniRef50_D2ASL7 Exopolysaccharide biosynthesis protein related t... 72 2e-11 UniRef50_A9V9Y5 Predicted protein n=1 Tax=Monosiga brevicollis R... 72 2e-11 UniRef50_D2AUR4 Exopolysaccharide biosynthesis protein related t... 72 2e-11 UniRef50_C0Z816 Putative uncharacterized protein n=1 Tax=Breviba... 72 2e-11 UniRef50_B3QZA6 Putative uncharacterized protein n=1 Tax=Chloroh... 72 2e-11 UniRef50_B5YE82 Putative uncharacterized protein n=2 Tax=Dictyog... 72 2e-11 UniRef50_A6TUG6 Copper amine oxidase domain protein n=1 Tax=Alka... 72 2e-11 UniRef50_A0LEU6 Putative uncharacterized protein n=1 Tax=Syntrop... 71 3e-11 UniRef50_A4XGY7 Putative uncharacterized protein n=2 Tax=Clostri... 71 3e-11 UniRef50_C5C0E0 Metallophosphoesterase n=1 Tax=Beutenbergia cave... 71 4e-11 UniRef50_C1A670 Putative uncharacterized protein n=1 Tax=Gemmati... 71 4e-11 UniRef50_B8I064 Exopolysaccharide biosynthesis protein n=1 Tax=C... 71 4e-11 UniRef50_C7IFA0 Exopolysaccharide biosynthesis protein n=1 Tax=C... 70 7e-11 UniRef50_A5D3T7 Hypothetical membrane protein n=1 Tax=Pelotomacu... 70 8e-11 UniRef50_C9N2Q2 Metallophosphoesterase n=2 Tax=Actinomycetales R... 70 9e-11 UniRef50_UPI0001C31921 Collagen triple helix repeat protein n=2 ... 69 1e-10 UniRef50_A7M0H0 Putative uncharacterized protein n=2 Tax=Bactero... 69 2e-10 UniRef50_Q01TI8 Putative uncharacterized protein n=1 Tax=Candida... 68 2e-10 UniRef50_A3DIP4 Exopolysaccharide biosynthesis protein n=3 Tax=C... 68 3e-10 UniRef50_A7HB86 Putative uncharacterized protein n=4 Tax=Anaerom... 68 3e-10 UniRef50_C2FS46 Putative uncharacterized protein n=2 Tax=Sphingo... 68 3e-10 UniRef50_Q2JUI0 Conserved domain protein n=2 Tax=Synechococcus R... 68 4e-10 UniRef50_Q5ULM2 Orf92 n=1 Tax=Lactobacillus phage LP65 RepID=Q5U... 67 4e-10 UniRef50_A4FAG7 Secreted protein n=5 Tax=Actinomycetales RepID=A... 67 4e-10 UniRef50_A1VEZ3 Putative uncharacterized protein n=4 Tax=Desulfo... 67 4e-10 UniRef50_B4VYL6 Tat pathway signal sequence domain protein n=1 T... 67 5e-10 UniRef50_C7LNU2 Putative uncharacterized protein n=1 Tax=Desulfo... 67 6e-10 UniRef50_A9QSN5 Exopolysaccharide biosynthesis protein n=4 Tax=L... 67 6e-10 UniRef50_A4FD37 Secreted protein n=1 Tax=Saccharopolyspora eryth... 67 7e-10 UniRef50_Q9L2D5 Putative secreted protein n=2 Tax=Streptomyces R... 67 8e-10 UniRef50_UPI0001744904 hypothetical protein VspiD_09360 n=1 Tax=... 67 8e-10 UniRef50_B5RQG1 Uncharacterized conserved protein n=20 Tax=Borre... 67 8e-10 UniRef50_A5ILT0 Putative uncharacterized protein n=6 Tax=Thermot... 66 9e-10 UniRef50_A4CSS0 Putative uncharacterized protein n=1 Tax=Synecho... 66 9e-10 UniRef50_B6V2M3 Gp2.43 n=1 Tax=Bacillus phage SPO1 RepID=B6V2M3_... 66 9e-10 UniRef50_C6WLB3 Metallophosphoesterase n=1 Tax=Actinosynnema mir... 66 1e-09 UniRef50_B4WHW3 Putative uncharacterized protein n=1 Tax=Synecho... 66 1e-09 UniRef50_D1VTW3 Copper amine oxidase N-domain superfamily n=1 Ta... 66 1e-09 UniRef50_A4FAL4 Putative uncharacterized protein n=2 Tax=Actinom... 65 2e-09 UniRef50_C0AEZ6 Putative uncharacterized protein n=1 Tax=Opituta... 65 3e-09 UniRef50_A5GW09 Putative uncharacterized protein SynRCC307_2165 ... 64 4e-09 UniRef50_Q7U4D6 Putative uncharacterized protein n=11 Tax=Cyanob... 64 4e-09 UniRef50_C8VW07 S-layer domain protein n=1 Tax=Desulfotomaculum ... 64 4e-09 UniRef50_C5CET4 Putative uncharacterized protein n=1 Tax=Kosmoto... 64 5e-09 UniRef50_C9M6C8 Putative uncharacterized protein n=1 Tax=Jonquet... 64 5e-09 UniRef50_Q30YC1 Putative uncharacterized protein n=1 Tax=Desulfo... 63 8e-09 UniRef50_B4WFN8 Putative uncharacterized protein n=1 Tax=Synecho... 63 8e-09 UniRef50_C8X0Z8 Putative uncharacterized protein n=1 Tax=Desulfo... 63 9e-09 UniRef50_Q0AWB0 Putative uncharacterized protein n=1 Tax=Syntrop... 63 1e-08 UniRef50_A3TM75 Putative uncharacterized protein n=1 Tax=Janibac... 63 1e-08 UniRef50_A7HN47 Putative uncharacterized protein n=1 Tax=Fervido... 63 1e-08 UniRef50_A6NQQ4 Putative uncharacterized protein n=1 Tax=Bactero... 62 1e-08 UniRef50_C1YVW0 Putative uncharacterized protein n=1 Tax=Nocardi... 62 2e-08 UniRef50_Q4ZC55 ORF005 n=1 Tax=Staphylococcus phage EW RepID=Q4Z... 62 2e-08 UniRef50_D2PRV8 Metallophosphoesterase n=1 Tax=Kribbella flavida... 62 3e-08 UniRef50_Q7NGC8 Glr3243 protein n=1 Tax=Gloeobacter violaceus Re... 62 3e-08 UniRef50_A6LP25 Putative uncharacterized protein n=1 Tax=Thermos... 61 3e-08 UniRef50_B7IEY1 Putative uncharacterized protein n=1 Tax=Thermos... 61 3e-08 UniRef50_Q03K73 Exopolysaccharide biosynthesis protein related t... 61 3e-08 UniRef50_C1TLP7 Sporulation related-protein with S-layer-like do... 61 3e-08 UniRef50_D2PZR6 Sporulation domain protein n=4 Tax=Actinomycetal... 60 5e-08 UniRef50_C6D5A3 Copper amine oxidase domain protein n=1 Tax=Paen... 60 8e-08 UniRef50_D2J8B1 Putative uncharacterized protein n=1 Tax=Staphyl... 59 1e-07 UniRef50_B2S1G8 Hypothetical cytosolic protein n=2 Tax=Borrelia ... 59 1e-07 UniRef50_B8FVQ0 Ig-like, group 2 n=2 Tax=Desulfitobacterium hafn... 59 2e-07 UniRef50_C9RD84 Copper amine oxidase domain protein n=1 Tax=Ammo... 58 2e-07 UniRef50_UPI00017890C7 copper amine oxidase domain protein n=1 T... 58 3e-07 UniRef50_C9PT69 Putative uncharacterized protein n=1 Tax=Prevote... 57 4e-07 UniRef50_A8F5X1 Putative uncharacterized protein n=1 Tax=Thermot... 57 5e-07 UniRef50_C7QCB3 Putative uncharacterized protein n=1 Tax=Catenul... 57 5e-07 UniRef50_A7SGX9 Predicted protein (Fragment) n=2 Tax=Nematostell... 57 6e-07 UniRef50_Q1MS76 Putative uncharacterized protein LI0093 n=1 Tax=... 57 6e-07 UniRef50_C4DE18 Putative uncharacterized protein n=1 Tax=Stackeb... 56 9e-07 UniRef50_A9GRW8 Putative uncharacterized protein n=1 Tax=Sorangi... 56 1e-06 UniRef50_C7PW43 Ig domain protein group 2 domain protein n=2 Tax... 56 1e-06 UniRef50_Q826N8 Putative secreted protein n=1 Tax=Streptomyces a... 56 1e-06 UniRef50_D1Y6Q3 Putative liporotein n=1 Tax=Pyramidobacter pisco... 55 2e-06 UniRef50_C7LY43 Putative uncharacterized protein n=1 Tax=Acidimi... 55 2e-06 UniRef50_A6G841 Putative uncharacterized protein n=1 Tax=Plesioc... 55 2e-06 UniRef50_UPI00019038D8 hypothetical protein Retl8_15906 n=1 Tax=... 55 3e-06 UniRef50_UPI00016BFF19 Ig-like, group 2 n=1 Tax=Epulopiscium sp.... 54 4e-06 UniRef50_C6IV65 Putative uncharacterized protein n=1 Tax=Paeniba... 54 5e-06 UniRef50_B2A2E0 Copper amine oxidase domain protein n=1 Tax=Natr... 54 5e-06 UniRef50_A3P9C8 Putative lipoprotein n=32 Tax=pseudomallei group... 54 6e-06 UniRef50_A3YXL4 Putative uncharacterized protein n=2 Tax=Chrooco... 52 1e-05 UniRef50_C6J2I2 Copper amine oxidase domain-containing protein n... 52 2e-05 UniRef50_C0DAA9 Putative uncharacterized protein n=1 Tax=Clostri... 52 3e-05 UniRef50_D2PYC0 Metallophosphoesterase n=1 Tax=Kribbella flavida... 50 5e-05 UniRef50_C7QHR1 Putative uncharacterized protein n=1 Tax=Catenul... 50 6e-05 UniRef50_C3YJA0 Putative uncharacterized protein n=3 Tax=Branchi... 50 7e-05 UniRef50_A6WEB7 Putative uncharacterized protein n=1 Tax=Kineoco... 50 1e-04 UniRef50_B8HP94 Polysaccharide deacetylase n=1 Tax=Cyanothece sp... 49 2e-04 UniRef50_A9BJK8 Putative uncharacterized protein n=1 Tax=Petroto... 48 3e-04 UniRef50_Q7NIQ9 Gll2123 protein n=1 Tax=Gloeobacter violaceus Re... 48 3e-04 UniRef50_B7KAR9 Polysaccharide deacetylase n=3 Tax=Cyanothece Re... 48 3e-04 UniRef50_Q72HQ9 Putative uncharacterized protein n=4 Tax=Thermac... 47 5e-04 UniRef50_C1XUX9 Putative uncharacterized protein n=1 Tax=Meiothe... 47 5e-04 UniRef50_A9EQ62 Putative uncharacterized protein n=1 Tax=Sorangi... 47 8e-04 UniRef50_B8CD22 Predicted protein n=1 Tax=Thalassiosira pseudona... 47 0.001 UniRef50_A4FIV8 Secreted protein n=1 Tax=Saccharopolyspora eryth... 46 0.001 UniRef50_A7MD65 Zgc:165534 protein n=3 Tax=Clupeocephala RepID=A... 45 0.002 UniRef50_UPI00016A4F20 hypothetical protein BthaT_13010 n=4 Tax=... 45 0.002 UniRef50_Q8YTL3 All2704 protein n=4 Tax=Nostocaceae RepID=Q8YTL3... 45 0.002 UniRef50_Q119M8 Putative uncharacterized protein n=1 Tax=Trichod... 45 0.002 UniRef50_A9V0B9 Predicted protein n=1 Tax=Monosiga brevicollis R... 44 0.004 UniRef50_Q2JPV6 Polysaccharide deacetylase family protein n=2 Ta... 43 0.007 UniRef50_B2HMV0 Lipoprotein LprO n=21 Tax=Mycobacterium RepID=B2... 43 0.009 UniRef50_Q92JI8 Uncharacterized protein RC0079 n=11 Tax=Ricketts... 43 0.011 UniRef50_UPI0001C1628D hypothetical protein CRC_02750 n=2 Tax=No... 42 0.014 UniRef50_C5KB48 Putative uncharacterized protein n=1 Tax=Perkins... 42 0.015 UniRef50_C4ICA7 Putative uncharacterized protein n=1 Tax=Clostri... 42 0.016 UniRef50_A9QSK3 Polysaccharide biosynthesis protein n=7 Tax=Stre... 42 0.024 UniRef50_A5N8M7 Predicted regulatory protein n=2 Tax=Clostridium... 41 0.045 >UniRef50_P27840 Uncharacterized protein yigE n=68 Tax=Enterobacteriaceae RepID=YIGE_ECOLI Length = 254 Score = 282 bits (722), Expect = 7e-75, Method: Composition-based stats. Identities = 254/254 (100%), Positives = 254/254 (100%) Query: 1 MAHQLLIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMY 60 MAHQLLIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMY Sbjct: 1 MAHQLLIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMY 60 Query: 61 WQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGE 120 WQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGE Sbjct: 61 WQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGE 120 Query: 121 GNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSK 180 GNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSK Sbjct: 121 GNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSK 180 Query: 181 IRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 IRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ Sbjct: 181 IRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 Query: 241 RYPFVTMISVERKG 254 RYPFVTMISVERKG Sbjct: 241 RYPFVTMISVERKG 254 >UniRef50_B2II06 Putative uncharacterized protein n=5 Tax=Rhizobiales RepID=B2II06_BEII9 Length = 269 Score = 268 bits (685), Expect = 1e-70, Method: Composition-based stats. Identities = 73/245 (29%), Positives = 127/245 (51%), Gaps = 2/245 (0%) Query: 11 MITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWG 70 + T L R+FL L L A A L++ + + + ++++WQ+ G+ +G Sbjct: 17 IFTKLLMRVFLPLFLSAGTAWAEPCLPLTEEGINYVVCRFDTKRSDLRLFWQQPGGQPYG 76 Query: 71 TLHALLADINSQGQVQ-MAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGG 129 L A + +G+ AMN G++ E +P+GLYI+ G+ N+ +G GNF ++P G Sbjct: 77 GFAPLRAQLQPKGETLEFAMNAGMFQEDLSPVGLYIQEGRLLHPANMRNGPGNFHMKPNG 136 Query: 130 VFYVAGDKVGIVRLDAF-KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGIN 188 +FY + G++ F ++ + +A QSGP+L+ N ++P+I P S KIRNGVG+ Sbjct: 137 IFYFSQTSAGVMETGRFLQSGLKPDYATQSGPLLVANNQLHPKIEPTGTSEKIRNGVGVR 196 Query: 189 KHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMI 248 + +F +S+ F+ FA + +L+ L+LDG+IS +Y Q P ++ Sbjct: 197 DNHEVIFAISEAPVTFFRFARLFRDRLHCPDALFLDGSISSLYAPSLNRDDQWRPIGPIV 256 Query: 249 SVERK 253 K Sbjct: 257 GAVSK 261 >UniRef50_A9W4Y6 Putative uncharacterized protein n=4 Tax=Methylobacterium extorquens group RepID=A9W4Y6_METEP Length = 258 Score = 241 bits (616), Expect = 1e-62, Method: Composition-based stats. Identities = 76/235 (32%), Positives = 123/235 (52%), Gaps = 4/235 (1%) Query: 21 LALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADIN 80 + + P A A+ TV+ + ERV+++W +G +G+L +L Sbjct: 26 VPVQAQPAPAAKGPCQAVEFEGQPYTVCTVDLRRERVRLFWLGTDGLPYGSLSSL--ADR 83 Query: 81 SQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGI 140 ++ AMN G+YD+ AP+GLY+E+G++ + A+G GNF ++P GVFYV GD+ G+ Sbjct: 84 QGPRLSFAMNAGMYDKGQAPVGLYVEDGRELKGASTANGPGNFHLKPNGVFYVKGDRAGV 143 Query: 141 VRLDAF-KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNA-VFLLS 198 + + + FA QSGPML+ +G I+P+I + S KIRNGVG+ G+ VF +S Sbjct: 144 LDTGRYLRAKPAPDFATQSGPMLVIDGKIHPKISADGPSQKIRNGVGVRDGGHVAVFAIS 203 Query: 199 QQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 ++ F FA K L+LDG++S +Y G P ++ + Sbjct: 204 ERPVTFGAFARLFKDSFGCRNALFLDGSVSSLYAPGLGRSDLSRPLGPLVGAVGR 258 >UniRef50_B9JX75 Putative uncharacterized protein n=1 Tax=Agrobacterium vitis S4 RepID=B9JX75_AGRVS Length = 274 Score = 239 bits (609), Expect = 9e-62, Method: Composition-based stats. Identities = 81/246 (32%), Positives = 131/246 (53%), Gaps = 4/246 (1%) Query: 12 ITLNLKRIFLALTLLPLFAVAAD--DCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAW 69 I + L I L + P A A + ++ + +P T ++++ + A+G+ + Sbjct: 26 IVVWLFAILSPLVISPERAEAEEQSCRDQTENGFAYRVCRFDPATRTIRIFNRNADGDVY 85 Query: 70 GTLHALLADINSQGQVQ-MAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPG 128 G AL + + Q + A+NGG+Y +P+GL+++ G + A G GNF+++P Sbjct: 86 GGFEALRSQLWQQRLILTFAVNGGMYHSDLSPVGLFVDYGMTRKTAETADGWGNFYLKPN 145 Query: 129 GVFYVAGDKVGIVRLDAFKTSK-EIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGI 187 GVF++ G++ F+T K E FA QSGPML+ +GV++P+ P S KIRNGVGI Sbjct: 146 GVFFLKDGHAGVLETGQFETQKIEADFATQSGPMLVIDGVLHPKFLPTSDSLKIRNGVGI 205 Query: 188 NKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTM 247 + G VF+LS+ FYD A + + +L LYLDGTIS + + YP + Sbjct: 206 DASGQVVFVLSKDPVRFYDMAAFFRDRLGAANALYLDGTISSLAEPMAGRIDRAYPLGPI 265 Query: 248 ISVERK 253 I+V + Sbjct: 266 IAVVDQ 271 >UniRef50_Q98NI9 Mlr0120 protein n=3 Tax=Alphaproteobacteria RepID=Q98NI9_RHILO Length = 263 Score = 233 bits (595), Expect = 4e-60, Method: Composition-based stats. Identities = 72/247 (29%), Positives = 123/247 (49%), Gaps = 3/247 (1%) Query: 10 GMITLNLKRIFLALTLLPLFAVA-ADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEA 68 G + L + + + V+ + + V+P+ ++++W+ G+ Sbjct: 15 GAVKAALPQAVASTMAFSQWFVSLPPCRDFAFEATSYLICEVDPKLYSIELFWKDPVGKP 74 Query: 69 WGTLHALLADINSQGQ-VQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRP 127 + +LH L A + G+ + A+N G+Y P+GLY+E G++ + SG GNF ++P Sbjct: 75 FQSLHNLDAAQRAAGRTMLFAINAGMYHPDLRPVGLYVERGREMAGVRTGSGSGNFSLQP 134 Query: 128 GGVFYVAGDKVGIVRLDAF-KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVG 186 G+FY++G K + F + +A QSGPML+ +G ++P+ + S K R+GVG Sbjct: 135 NGIFYISGGKAAVRATRDFVRKRPSTDYATQSGPMLVIDGQLHPKFQSDGTSRKTRDGVG 194 Query: 187 INKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVT 246 + K G AVF +S NF+ FA + L + L+LDGTIS ++ + Sbjct: 195 VRKDGVAVFAISNGTVNFHTFARLFRDALGCDNALFLDGTISSLFAPAIGRNDDYWNLGP 254 Query: 247 MISVERK 253 MI V RK Sbjct: 255 MIGVFRK 261 >UniRef50_A9CIN9 Putative uncharacterized protein n=6 Tax=Proteobacteria RepID=A9CIN9_AGRT5 Length = 254 Score = 224 bits (570), Expect = 3e-57, Method: Composition-based stats. Identities = 69/213 (32%), Positives = 114/213 (53%), Gaps = 3/213 (1%) Query: 44 TVQAYTVNPQTERVKMYWQK-ANGEAWGTLHALLADINSQGQV-QMAMNGGIYDESYAPL 101 + +P +++Y Q +G+ + L + + Q AMNGG+Y Y+P+ Sbjct: 39 RYTVCSFDPAKNTIRIYDQDHVSGQGYRNFADLSSALWRQHMFSVFAMNGGMYHSDYSPV 98 Query: 102 GLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAF-KTSKEIQFAVQSGP 160 GL++ENG ++ ++ G GNF + P GVFY+ G+ G++ +A+ + FA QSGP Sbjct: 99 GLFVENGVERSPVSTRGGWGNFHLLPNGVFYLDGNTAGVLETEAYLAADPKPDFATQSGP 158 Query: 161 MLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQL 220 ML+ +G ++PR P+ S K RNGVG+++ G F +S+ FYDF + L+ Sbjct: 159 MLVIDGKLHPRFLPDSDSLKRRNGVGVSRDGMVHFAISETTVRFYDFGTLFRDVLDAPNA 218 Query: 221 LYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 LYLDGTIS + + Q + +I+V + Sbjct: 219 LYLDGTISSVDIPAMNRRDQLFSMGPIIAVVDR 251 >UniRef50_Q0BWY7 Putative lipoprotein n=1 Tax=Hyphomonas neptunium ATCC 15444 RepID=Q0BWY7_HYPNA Length = 249 Score = 222 bits (567), Expect = 5e-57, Method: Composition-based stats. Identities = 66/238 (27%), Positives = 112/238 (47%), Gaps = 5/238 (2%) Query: 21 LALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADIN 80 A+ S L + + + ++++ + G +G L + Sbjct: 12 GAILSACNEVEEGPCQTRSFENLPYLVCSFDASQDTIRLFLRDETGVPFGQFDRLANHVA 71 Query: 81 -SQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVG 139 G + AMN G+Y + P+GLYIE G+ ++ L + G GNF + P GVF++ K G Sbjct: 72 SKGGNLVFAMNAGMYHDDRRPVGLYIEEGEAEMNLVRSPGPGNFGMLPNGVFWIDAGKAG 131 Query: 140 IVRLDAFKTSKE---IQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGN-AVF 195 + AF + +FA QSGPML+ +G ++P ++P+ S + RNGVG+++ G F Sbjct: 132 VSETLAFDERFKETPPRFATQSGPMLVIDGALHPALNPDGTSLRRRNGVGVSEDGRQVYF 191 Query: 196 LLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 ++S NF+ FA + +L LYLDG +S Y+ ++ V R+ Sbjct: 192 VISDVPVNFHSFARLFRDELGTPNALYLDGAVSKAYVPALERSETGLDMGPIVGVIRE 249 >UniRef50_Q11X50 Putative uncharacterized protein n=2 Tax=Bacteroidetes RepID=Q11X50_CYTH3 Length = 244 Score = 222 bits (565), Expect = 1e-56, Method: Composition-based stats. Identities = 83/217 (38%), Positives = 124/217 (57%), Gaps = 3/217 (1%) Query: 39 SDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ-VQMAMNGGIYDES 97 T+ V +YTV+PQ + ++ YW+ NGE ++ L A + S+G + A NGG+Y E Sbjct: 25 QQDTIDVISYTVDPQKDNLQFYWKNDNGEILKSIKKLKAYVESKGSTLLFATNGGMYKED 84 Query: 98 YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVA-GDKVGIVRLDAFKTSKEIQFAV 156 +PLGL+I+NG+ LN A G+GNF+++P GVFY+ ++ I + + F + I+FA Sbjct: 85 RSPLGLFIQNGKTVTPLNKAKGQGNFYMQPNGVFYITNDNEAVICKTEDFINNGNIKFAT 144 Query: 157 QSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLN 216 QSGPM++ N I+P + IRNGVGI + +F +S++ NF+DFA Y + L Sbjct: 145 QSGPMIIVNNQIHPSFIKGSKNLNIRNGVGILPNKKIIFAMSEKEVNFFDFALYFQN-LG 203 Query: 217 VEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 E LYLDG +S Y+ F MI V K Sbjct: 204 CENALYLDGFVSRSYLLEKKWLQTDGEFGVMIGVTEK 240 >UniRef50_D2LB58 Putative uncharacterized protein n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LB58_RHOVA Length = 247 Score = 220 bits (560), Expect = 4e-56, Method: Composition-based stats. Identities = 71/239 (29%), Positives = 114/239 (47%), Gaps = 4/239 (1%) Query: 19 IFLALTLLPLFAVAAD--DCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALL 76 F+A+ + AA + + V+++WQK +G + L AL Sbjct: 6 AFIAMAAFCGSSEAAAQTCKPYAFEGNGYTLCEASLDRFAVRLFWQKPDGGPYTYLSALP 65 Query: 77 ADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGD 136 G++ A+NGG++ Y P+GL++ENG++ V N G GNF +RP G+FY Sbjct: 66 KTDERGGRLAFALNGGMFHPDYKPVGLHVENGRELVRANTRPGPGNFHLRPNGIFYFGEA 125 Query: 137 KVGIVRLDAF-KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVF 195 + G++ AF K + FA QSGPML+ +G ++PRI S+K R+GV + + VF Sbjct: 126 EAGVMETGAFLKKKPKANFATQSGPMLVIDGKLHPRIAKANVSAKPRDGVCVRGDKSVVF 185 Query: 196 LLSQQATNFYDFACYAKAKLNVEQLLYLDGTIS-HMYMKGGAIPWQRYPFVTMISVERK 253 +S F F + L L+LDG + +++ G + MI+V K Sbjct: 186 AISDGGVPFDTFMRLFRDGLKCRNALFLDGGTAPALFVPGTRSGNVLFGLGPMIAVYEK 244 >UniRef50_D0SJ31 Putative uncharacterized protein n=1 Tax=Acinetobacter junii SH205 RepID=D0SJ31_ACIJU Length = 252 Score = 217 bits (553), Expect = 3e-55, Method: Composition-based stats. Identities = 67/234 (28%), Positives = 122/234 (52%), Gaps = 4/234 (1%) Query: 22 ALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKAN-GEAWGTLHALLADIN 80 + A + ++ + + V+ ++++ + G+ + + +D+ Sbjct: 16 CMVFQATTVFAFEYQSIKFEDVQFEVIKVDDLK-DLQLFLKNPRIGDFYQKFSNIQSDLA 74 Query: 81 SQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGI 140 + +++ AMN G+Y ++ P+GLYIE ++ LN ++G GNFF++P GV I Sbjct: 75 ACKELRFAMNAGMYHPNFEPVGLYIEKKKKLSELNESTGFGNFFMQPNGVVVWNDHGAVI 134 Query: 141 VRLDAFK-TSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQ 199 +K + FA QSGPML+ G+IN + + S KIRNGVG+ + F++S+ Sbjct: 135 HSTADYKRANFTANFATQSGPMLVHKGLINSQFIKDSNSLKIRNGVGVRDD-HLYFVISE 193 Query: 200 QATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 Q NFY FA + K +L V++ LYLDG+IS +Y+K ++Y ++ + + Sbjct: 194 QRINFYQFAKFFKHQLRVDEALYLDGSISSLYLKDIQRNDRKYNLGPIVGLTHQ 247 >UniRef50_Q26CZ6 Periplasmic protein n=1 Tax=Flavobacteria bacterium BBFL7 RepID=Q26CZ6_9BACT Length = 241 Score = 217 bits (552), Expect = 3e-55, Method: Composition-based stats. Identities = 68/222 (30%), Positives = 109/222 (49%), Gaps = 6/222 (2%) Query: 34 DDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQ-GQVQMAMNGG 92 D + D ++ ++ +++++YW + + T L + Q ++ AMN G Sbjct: 23 QDLIIKDDRFHIKV--IDLTKQKLQLYWLDQDNKPIETFEQLNMHVKQQDKRLVYAMNAG 80 Query: 93 IYDESYAPLGLYIENGQQKVALNLAS-GEGNFFIRPGGVFYVA-GDKVGIVRLDAFKTSK 150 +Y + ++P GLYIENG L+ + G GNF+++P GVFY+ K + Sbjct: 81 MYLKDHSPQGLYIENGTIHKQLDTVTVGYGNFYLQPNGVFYLTQDGKAQVTATPQLSNFS 140 Query: 151 EIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACY 210 I +A QSGPML+ N I+P + + IRN VGI G + +S++ NFYDFA + Sbjct: 141 NITYATQSGPMLVINDTIHPAFNKGSKNVHIRNAVGILPDGRILLAISKEKINFYDFATF 200 Query: 211 AKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVER 252 K + + LYLDG +S +Y + F MI V Sbjct: 201 FKNQ-GCKNALYLDGFVSRIYDPTINVEQMDGHFGVMIGVSD 241 >UniRef50_A9D6B9 Putative uncharacterized protein n=1 Tax=Hoeflea phototrophica DFL-43 RepID=A9D6B9_9RHIZ Length = 286 Score = 217 bits (552), Expect = 3e-55, Method: Composition-based stats. Identities = 66/234 (28%), Positives = 113/234 (48%), Gaps = 6/234 (2%) Query: 26 LPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINS---- 81 + T++PQT +++ ++ G+ G++ A++ + + Sbjct: 47 MTKPDWPEGCVEQVFEGARAILCTIDPQTHDMRLVYRDRMGDVLGSVSAVVDQLAAGAGT 106 Query: 82 QGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYV-AGDKVGI 140 ++ +AMN G+Y +P+GLY+EN + ALN G GNFF++P GVF+V G+ Sbjct: 107 DHKLVLAMNAGMYHADMSPVGLYVENSVEIAALNRDDGFGNFFLKPNGVFFVLKDGNAGV 166 Query: 141 VRLDAFK-TSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQ 199 + DA+ ++A QSGPML+ +GVI+PR P+ S IRNGVG+ G VF +++ Sbjct: 167 LETDAYAEADLSPEYATQSGPMLVIDGVIHPRFLPDGTSKFIRNGVGVRPDGKVVFAITR 226 Query: 200 QATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 + FA + E L+ DG +S + + P + V + Sbjct: 227 DRVSLGSFARLFRDVAGCENALFFDGAVSSLALGSKMEIDSEEPAGPVAVVVAR 280 >UniRef50_C8PYM1 Putative uncharacterized protein n=1 Tax=Enhydrobacter aerosaccus SK60 RepID=C8PYM1_9GAMM Length = 271 Score = 214 bits (546), Expect = 2e-54, Method: Composition-based stats. Identities = 69/233 (29%), Positives = 106/233 (45%), Gaps = 15/233 (6%) Query: 34 DDCALSDPTLTVQAYTVNPQTE-RVKMYWQKANGE------AWGTLHALLADINSQGQVQ 86 DC ++ + ++WQ + + TL L + Sbjct: 38 PDCQRKSQPFDYSICELDAKNAANFSLHWQNPSSASHPLLLTFTTLRDYLVSEQPAKTLL 97 Query: 87 MAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAF 146 AMN G+YD ++AP+G + NG+Q ALNL G GNF + P GVF+ I + Sbjct: 98 FAMNAGMYDSNFAPIGYTVINGKQIRALNLKQGGGNFHLMPNGVFWQDRQGFYITESQSM 157 Query: 147 KTS----KEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHG--NAVFLLSQQ 200 + FA QSGPML+ +G I+P N S K RNG+G+ H F++S Sbjct: 158 AKKLASGAKPTFATQSGPMLVIDGNIHPAFDANSTSRKYRNGIGVCGHNPSRVKFVISDT 217 Query: 201 ATNFYDFACYAKAKLNVEQLLYLDGTI-SHMYMKGGAIPWQRYPFVTMISVER 252 +FY+FA K++L + L+LDG S +Y + + +Y MI+V + Sbjct: 218 PVSFYEFADLFKSQLGCDNALFLDGGSASALYSQTLSRNDNKY-MGVMIAVTQ 269 >UniRef50_Q1IX28 Periplasmic YigE-like protein n=3 Tax=Deinococcus RepID=Q1IX28_DEIGD Length = 317 Score = 214 bits (546), Expect = 2e-54, Method: Composition-based stats. Identities = 79/245 (32%), Positives = 126/245 (51%), Gaps = 11/245 (4%) Query: 15 NLKRIFLALTLLPLFAV-AADDCALSDPTLTVQAYTV---NPQTERVKMYWQKAN-GEAW 69 N+ RIF+ LLPL A A + T YTV + + + ++++W+ G+ + Sbjct: 77 NVLRIFV--LLLPLTACSQAGGLDVRRVTAEGMLYTVAAVDLKRDHLRLHWKNPATGQPY 134 Query: 70 GTLHALLADINSQG-QVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPG 128 T + A + G QV A N GIY PLGL++E G+ + LN A GNF + P Sbjct: 135 RTFAEVSARLRKDGEQVLFATNSGIYGPGLEPLGLHVEEGRTLIGLNNARSGGNFALLPN 194 Query: 129 GVFYVAGDKVGIVRLDAFKT-SKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGI 187 GVF+V G++ G+ A++ + + FA QSGP+L++ G ++P + +S K+R+GVG+ Sbjct: 195 GVFWVKGNQAGVTETQAYRRLNIQPTFATQSGPLLVQGGRLHPAFNKGSSSFKVRSGVGV 254 Query: 188 NKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTM 247 + G F +S NF+ FA + + L LYLDG+IS Q F + Sbjct: 255 CRDGRVRFAVSAGPVNFHSFAVFFRDVLGCPDALYLDGSISAYATPDADT--QVADFAGI 312 Query: 248 ISVER 252 ++ R Sbjct: 313 WTISR 317 >UniRef50_Q093S1 Putative uncharacterized protein n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q093S1_STIAU Length = 278 Score = 214 bits (545), Expect = 2e-54, Method: Composition-based stats. Identities = 76/264 (28%), Positives = 124/264 (46%), Gaps = 23/264 (8%) Query: 5 LLIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPT------------LTVQAYTVNP 52 LLIG G+ T A LL A +L PT T Y V+ Sbjct: 19 LLIGSGLGT-------GATHLLAAPHTPAATRSLQTPTGRVAARRIAYRGNTYDTYEVDL 71 Query: 53 QTERVKMYWQKANGEAWGTLHALLADIN-SQGQVQMAMNGGIYDESYAPLGLYIENGQQK 111 +++ Y+Q+ +G + +L L + ++ A N G++ + P+GLY+E+G++ Sbjct: 72 TQSKLRFYFQQPDGTPFSSLGNLRGWLQGRGKRLVFATNAGMFTPARRPVGLYVEDGREF 131 Query: 112 VALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSK--EIQFAVQSGPMLMENGVIN 169 V LN GNFF++P VF+V GI+ A+ ++ +A QSGP L+ +G ++ Sbjct: 132 VGLNTQEEAGNFFLKPNAVFFVTETGAGILESSAYAAHPPAKVLYATQSGPALLLHGQMH 191 Query: 170 PRIHPNVASSKIR-NGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTIS 228 P + R +GVGI VF ++QQA N ++FA + + + + LYLDG +S Sbjct: 192 PAFREGSRNLSPRRSGVGIVTPTRVVFAMTQQAVNLHEFASFFRDQFGCQDALYLDGVVS 251 Query: 229 HMYMKGGAIPWQRYPFVTMISVER 252 MY+ F MI++ Sbjct: 252 RMYLPALGRDELDGDFGAMIAISE 275 >UniRef50_C5CWT4 Periplasmic protein-like protein n=3 Tax=Bacteria RepID=C5CWT4_VARPS Length = 238 Score = 211 bits (537), Expect = 2e-53, Method: Composition-based stats. Identities = 67/210 (31%), Positives = 118/210 (56%), Gaps = 4/210 (1%) Query: 44 TVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ-VQMAMNGGIYDESYAPLG 102 ++ + ER++++ +G + L A + ++ + + AMN G+Y ++P+G Sbjct: 27 RYTVVKIDVRRERLELFLHDDSGAPFKRFDRLEAWLAARNRQLVFAMNAGMYHADFSPVG 86 Query: 103 LYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKE--IQFAVQSGP 160 L ++ G+++ LNLA+G GNFF++P GVF V+ +V + + ++ A QSGP Sbjct: 87 LLVQEGREEAPLNLAAGAGNFFLKPNGVFLVSDAGPRVVESSEYAALPKEGVRLATQSGP 146 Query: 161 MLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQL 220 +L+ GV++P P+ S KIRNGVG++ H A+F++S+Q NFY+FA Y + L+ Sbjct: 147 LLLRRGVVHPAFIPDSDSRKIRNGVGVSGH-TAIFVISEQPVNFYEFALYFRDVLHCRDA 205 Query: 221 LYLDGTISHMYMKGGAIPWQRYPFVTMISV 250 LYLDGT+S ++ ++ V Sbjct: 206 LYLDGTVSALHSLALRRSDFTRELGPILGV 235 >UniRef50_B9KP42 Periplasmic protein-like protein n=37 Tax=Rhodobacterales RepID=B9KP42_RHOSK Length = 245 Score = 211 bits (536), Expect = 3e-53, Method: Composition-based stats. Identities = 69/241 (28%), Positives = 116/241 (48%), Gaps = 6/241 (2%) Query: 17 KRIFLALTLLPLF--AVAADDCALSDPTLTVQAYTVNPQT--ERVKMYWQKANGEAWGTL 72 R LA L L+ A A + A D T Y++ + ++++ +G +G+ Sbjct: 1 MRTRLAAILFALWPAACATAEPACRDLTFEGTRYSLCEAQAGDDIRIFQTAPDGRPYGSF 60 Query: 73 HALLADINSQGQ-VQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVF 131 + + ++ +G+ + AMN G+Y P+GL IE ++ L ++G GNF + P GVF Sbjct: 61 ERINSALDGEGRQLAFAMNAGMYHADRRPVGLLIEEEVERAPLVTSAGPGNFGLLPNGVF 120 Query: 132 YVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHG 191 V I + A QSGPML+ G ++PR + S IRNGVG++ G Sbjct: 121 CVGDGFRVIESRSFAAERPACRHASQSGPMLVIGGELHPRFLVHSDSRYIRNGVGVSADG 180 Query: 192 -NAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISV 250 AVF +S + F++F + +L + + LY DG+IS +Y +G P ++ + Sbjct: 181 RRAVFAISNRPVTFHEFGRLFRDELGLPEALYFDGSISRLYDRGARRSDWGTPMGPIVGL 240 Query: 251 E 251 Sbjct: 241 V 241 >UniRef50_Q1QCK8 Periplasmic protein-like n=1 Tax=Psychrobacter cryohalolentis K5 RepID=Q1QCK8_PSYCK Length = 276 Score = 209 bits (533), Expect = 5e-53, Method: Composition-based stats. Identities = 69/237 (29%), Positives = 117/237 (49%), Gaps = 9/237 (3%) Query: 24 TLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANG-EAWGTLHALLADINSQ 82 T ++ + + + T +Q+ + + + ++WQ+++ + T LL+ + Sbjct: 38 TASTDWSCQSHNTPFAYSTCHIQSDLLTNKRYSLALFWQQSDSRQPLLTFDNLLSTLPPS 97 Query: 83 GQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAG-DKVGIV 141 ++ AMN G+Y+E+YAP+G + ++ ALNL G GNF + P GV + KV I Sbjct: 98 QSLKFAMNAGMYNENYAPIGYTVIKSEEIRALNLKEGGGNFHLLPNGVLWWDKSGKVQIT 157 Query: 142 RLDAFKTSKE-----IQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFL 196 +A + +A QSGPML+ N I+P+ P+ S+KIRNG+G+ G+ F+ Sbjct: 158 ESNALAEQLKNGIAQPLYATQSGPMLVINDAIHPQFDPDGTSAKIRNGIGVCSDGSLQFV 217 Query: 197 LSQQATNFYDFACYAKAKLNVEQLLYLDGTI-SHMYMKGGAIPWQRYPFVTMISVER 252 S+ FY FA K +L L+LDG I S +Y ++ MI + Sbjct: 218 NSEAPVAFYQFASLFKNELKCPNALFLDGGIASALYAPTIDKHDKK-EMGVMIGLVE 273 >UniRef50_A1B5U0 Putative uncharacterized protein n=1 Tax=Paracoccus denitrificans PD1222 RepID=A1B5U0_PARDP Length = 251 Score = 209 bits (533), Expect = 6e-53, Method: Composition-based stats. Identities = 71/250 (28%), Positives = 113/250 (45%), Gaps = 8/250 (3%) Query: 12 ITLNLKR----IFLALTLLPLFAVAADDCALSDPTLTVQAYTVNP-QTERVKMYWQKANG 66 + ++LKR F AL + L A+A T+ Q ++++ +G Sbjct: 1 MKIDLKRRLGLAFGALIAMTLPALAGICEKRDFDGQGYVICTLTAGQEPGLRLWLNGPDG 60 Query: 67 EAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIR 126 G A+ + + AMN G+Y + P+GLY+ +G + L A G GNF + Sbjct: 61 RTLGDFTAVRRTLAQGESLGFAMNAGMYHPDFTPVGLYVSDGVSQHDLVTAGGGGNFGML 120 Query: 127 PGGVFYVAGDKVG-IVRLDAF-KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNG 184 P GVF G + ++ AF K + + + A QSGPML+ +G ++PR + S IRNG Sbjct: 121 PNGVFCAGGARPYQVIESRAFAKAAPDCRLATQSGPMLVIDGALHPRFLVDSDSRYIRNG 180 Query: 185 VGINKHGNA-VFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYP 243 VG++ G F +S +A F+ F + L LY DG+IS +Y G Sbjct: 181 VGVSPDGQTAWFAISDRAVTFHQFGRLFRDGLGARDALYFDGSISRLYAPGLGRADFGRR 240 Query: 244 FVTMISVERK 253 +I + Sbjct: 241 LGPIIGYVGQ 250 >UniRef50_D1AL67 Periplasmic protein-like protein n=1 Tax=Sebaldella termitidis ATCC 33386 RepID=D1AL67_SEBTE Length = 266 Score = 209 bits (532), Expect = 6e-53, Method: Composition-based stats. Identities = 95/217 (43%), Positives = 133/217 (61%), Gaps = 4/217 (1%) Query: 38 LSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDES 97 + D T Y + E +KMYW+ N +A+ L + + N+ ++ A NGGIY E Sbjct: 52 IEDRGFT--VYKPDLNKEIIKMYWKDENNKAYSELSKFIQE-NTGNKINFATNGGIYSEE 108 Query: 98 YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQ 157 Y P GLYIEN + +NLA GEGNF+++P GVFY+ ++ I AF+ ++ I +A Q Sbjct: 109 YEPNGLYIENHKIISKINLADGEGNFYMQPNGVFYIQNNQPKISESKAFEYNENISYATQ 168 Query: 158 SGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNV 217 SGP+L+ENGVIN +I N S KIR+ VGI++ FL+S + NFYDF+ YA KLN Sbjct: 169 SGPLLIENGVINKKIGKNSESFKIRSAVGIDRENKVFFLMSSEKINFYDFSKYALDKLNC 228 Query: 218 EQLLYLDGTISHMYM-KGGAIPWQRYPFVTMISVERK 253 + LL+LDG IS MY IP Q YPF +I+ E++ Sbjct: 229 KDLLFLDGAISKMYFADEKKIPEQDYPFAVIITSEKR 265 >UniRef50_Q1MEZ5 Conserved hypothetical exported protein n=45 Tax=Rhizobiales RepID=Q1MEZ5_RHIL3 Length = 258 Score = 209 bits (532), Expect = 7e-53, Method: Composition-based stats. Identities = 63/255 (24%), Positives = 110/255 (43%), Gaps = 14/255 (5%) Query: 12 ITLNLKRIFLALTLLPLFAVAADDCALSDPTLT---VQAYTVNPQTERVKMYWQKANGEA 68 + ++ + LT A A + T+ P ++++W+ A+G Sbjct: 4 LKHSVLAAAIMLTATMTSLDQAHAQACEQESFEEAKYVVCTLEPGKADLRLFWKNADGAP 63 Query: 69 WGTLHALLADINSQGQ-VQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEG------ 121 + +L + ++G+ + A+N G+Y ++P+GLY+ENG++ N E Sbjct: 64 YRAFSSLAEAVRAEGRTLAFAVNAGMYRADFSPMGLYVENGRELNPANTTEAESSSGQVP 123 Query: 122 NFFIRPGGVFYVAGDKVGIVRLDAF-KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSK 180 NF+ +P GVF++ GI+ D F K + +FA QSGPML+ +NP Sbjct: 124 NFYKKPNGVFFLGETGAGILPTDEFLKRRPKARFATQSGPMLVIANKLNPIFIVGSTDRT 183 Query: 181 IRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGT-ISHMYMKGGAIPW 239 R+GVG + G F +S+ NF+DFA + L L+LDG +Y Sbjct: 184 RRSGVGTCERGAVRFAISEDRVNFHDFARLFRDHLKCPDALFLDGGRGVGLYNPDMGHND 243 Query: 240 --QRYPFVTMISVER 252 + + + Sbjct: 244 WSWHGGYGPIFGLVE 258 >UniRef50_Q0G184 Putative uncharacterized protein n=2 Tax=Aurantimonadaceae RepID=Q0G184_9RHIZ Length = 268 Score = 202 bits (515), Expect = 7e-51, Method: Composition-based stats. Identities = 68/229 (29%), Positives = 115/229 (50%), Gaps = 5/229 (2%) Query: 28 LFAVAADDCALSDPT-LTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQ 86 L A C ++ + V + + + G + T A + G+V Sbjct: 41 LPAGHEGICRIAMAGSVETILCEVPLSSFDLHLRALDDAGRPYETFEKAAASL--SGEVV 98 Query: 87 MAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAF 146 +AMN G+Y E P+GL +++G+ L +G GNF +RP G+FY+ + + + + Sbjct: 99 LAMNAGMYHEDRRPVGLTVQDGRIVKKAVLGTGSGNFSLRPNGIFYLEDGRAFVRETERY 158 Query: 147 -KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFL-LSQQATNF 204 S + A QSGPML+ G ++PR P S +RNGVG+++ G VFL L+++ NF Sbjct: 159 LGESHDPVLATQSGPMLLIGGKVHPRFIPTSDSLYVRNGVGVSEDGRTVFLALTRKPINF 218 Query: 205 YDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 YDFA + + + V+ L+ DG +S + + I ++R M+ V +K Sbjct: 219 YDFALFFRDTVGVKDALFFDGQVSSLSYRAANIAYRRDRLGPMLLVTKK 267 >UniRef50_A5EW16 Putative uncharacterized protein n=1 Tax=Dichelobacter nodosus VCS1703A RepID=A5EW16_DICNV Length = 263 Score = 201 bits (510), Expect = 3e-50, Method: Composition-based stats. Identities = 78/260 (30%), Positives = 126/260 (48%), Gaps = 19/260 (7%) Query: 12 ITLNLKRIFLALTLLP-LFAVAADDCALSDP------TLTVQAY---TVNPQTERVKMYW 61 + + L++I + + L L AA Q+ P+ ++++ W Sbjct: 1 MLVALRKIIVPVILSSFLLETAAAHLDFKKVAGGNFARFHHQSVDYAVFMPEHDKIRFLW 60 Query: 62 QKANGEAWGTLHALLADINSQG-QVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGE 120 Q GE + T+H L + ++G QV MN GI++++ P GL+IE LN SG+ Sbjct: 61 QNDRGENYQTMHHALRALTNEGYQVHFLMNAGIFNQNAQPAGLWIEKKALLRPLNRRSGK 120 Query: 121 GNFFIRPGGVFYVAGDKVGIVRL-DAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASS 179 GNF I+P GVFY+ +K I+ + +AVQSGP+L+ +G IN R+ N ++ Sbjct: 121 GNFHIQPNGVFYLTQEKAHIITTVQWHNNPPKADYAVQSGPLLIIDGAINSRLPKNHKAA 180 Query: 180 KIRNGVGINKHGNAVFLLS----QQA--TNFYDFACYAKAKLNVEQLLYLDGTISHMYMK 233 RN V ++K F+++ A N Y FA A + +Q LYLDG++S Y+ Sbjct: 181 YKRNAVCVDKARRVYFVITTRYDDGAHFPNLYRFAH-ALQTIGCQQALYLDGSLSDFYLP 239 Query: 234 GGAIPWQRYPFVTMISVERK 253 + + F MI+V K Sbjct: 240 MESSRFHWQKFAGMIAVVSK 259 >UniRef50_C7PF78 Putative uncharacterized protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PF78_CHIPD Length = 273 Score = 197 bits (502), Expect = 2e-49, Method: Composition-based stats. Identities = 71/228 (31%), Positives = 119/228 (52%), Gaps = 8/228 (3%) Query: 34 DDCALSDPTLTVQAYTVNPQTERVKMYWQKANGE-AWGTLHALLADI-NSQGQVQMAMNG 91 + + A VNP + ++W A+ + + ++ AL + + + M NG Sbjct: 46 GEITFTHNGQQYDAIVVNPAVSDISLHWLSADQQTPYKSIQALQDVLLEKKKDILMITNG 105 Query: 92 GIYDESYAPLGLYIENGQQKVALNLA-SGEGNFFIRPGGVFYVAGDKVGIVRLDAF---- 146 G++ ++ P+GL+I G++ ++ A GNF+++P GVFY+ + + Sbjct: 106 GMFMKNNIPVGLFISQGRELRPIDAATDQPGNFYMQPNGVFYLDHTGPHVSTTTDYLKRS 165 Query: 147 KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-TNFY 205 + +I A QSGPML+ G+IN + +P + +R+GVGI +GN VF++S++A T FY Sbjct: 166 RAHSKIVAATQSGPMLVSKGIINAKFNPGSVNRNLRSGVGILSNGNVVFIISKEAQTTFY 225 Query: 206 DFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 DFA KA+ + LYLDG IS MY+K F MI+V + Sbjct: 226 DFASIFKARFGCKDALYLDGAISKMYLKNSRPGDLNGDFGAMIAVTAR 273 >UniRef50_Q2NAA1 Putative uncharacterized protein n=3 Tax=Erythrobacter RepID=Q2NAA1_ERYLH Length = 277 Score = 197 bits (500), Expect = 3e-49, Method: Composition-based stats. Identities = 67/226 (29%), Positives = 104/226 (46%), Gaps = 11/226 (4%) Query: 33 ADDCALSDPTLTVQA---YTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAM 89 A + A T P R+ G + L A+ Sbjct: 58 AAESACERLTFQEVVLTHCVAVPAKHRITTVL----GPPHRSFAKLAEG--RSSAPVFAV 111 Query: 90 NGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAF--K 147 N G++D P+G Y+E+ ++ ALN G GNF ++P GVFY + + + ++F Sbjct: 112 NAGMFDGDGKPIGYYVEDSERLQALNTNDGAGNFHLKPNGVFYGSNGEWRVRTTESFLAN 171 Query: 148 TSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDF 207 S QF QSGPML+ +G ++P I + S +IRNGVG+++ G A F++S+ +F F Sbjct: 172 VSDRPQFGTQSGPMLLIDGKLHPEISEDGPSRQIRNGVGVDRQGRAHFVISEGPISFGKF 231 Query: 208 ACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 A + + N LYLDG +S ++ R P MI VE + Sbjct: 232 ARFFRDVANTPNALYLDGNVSGLWDPANDRMDARAPIGPMIVVETR 277 >UniRef50_A5WGQ7 Periplasmic protein-like protein n=1 Tax=Psychrobacter sp. PRwf-1 RepID=A5WGQ7_PSYWF Length = 309 Score = 193 bits (491), Expect = 5e-48, Method: Composition-based stats. Identities = 61/207 (29%), Positives = 101/207 (48%), Gaps = 8/207 (3%) Query: 53 QTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKV 112 + + Q + E L+ D+ +++ A N G+YD ++AP+G + G+Q + Sbjct: 95 NQPQAAIVDQDKSHEPLYKFDTLIKDLPKDSELKFAANAGMYDGNFAPIGYTVIQGRQIL 154 Query: 113 ALNLASGEGNFFIRPGGVFYVAG-DKVGIVRLDAFKT-----SKEIQFAVQSGPMLMENG 166 +LNL G GNF + P GV + + V I + +A QSGPML+ +G Sbjct: 155 SLNLKQGGGNFHLLPNGVLWWDKANHVHITESTQLDAMLKSGEAKPWYATQSGPMLVIDG 214 Query: 167 VINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGT 226 I+P+ + + S KIRNGVG+ F+ S++ NFY FA + K L+ + L+LDG Sbjct: 215 HIHPKFNSDSTSKKIRNGVGVCDGSQIHFVTSREPVNFYQFARFFKEDLHCDNALFLDGG 274 Query: 227 -ISHMYMKGGAIPWQRYPFVTMISVER 252 S +Y A ++ M+ + Sbjct: 275 VASALYAPDVAAQEEKN-MGVMVGLIE 300 >UniRef50_B2HYZ5 Predicted periplasmic protein n=11 Tax=Acinetobacter RepID=B2HYZ5_ACIBC Length = 204 Score = 191 bits (485), Expect = 2e-47, Method: Composition-based stats. Identities = 61/205 (29%), Positives = 106/205 (51%), Gaps = 4/205 (1%) Query: 12 ITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKA-NGEAWG 70 + + + I + + A+A + + + T E+++++ + + + Sbjct: 1 MKILVLCI-VNFIIFTQSALALEYRQIRNTTDDQFEVIEISNLEQLRLFLKNPQTDQYYK 59 Query: 71 TLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGV 130 + + + + Q+ AMNGG++ ++P+GLYIENG++ LN G GNFF++P GV Sbjct: 60 SFDNIQYQLKACEQLTFAMNGGMFHSGFSPVGLYIENGRESQPLNEDKGWGNFFLQPNGV 119 Query: 131 FYVAGDKVGIVRLDAFKTSK-EIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINK 189 + I+ + +K + +A QSGPML+ NG INP N S KIRNGVG+ K Sbjct: 120 LAWNDKQAVILTTEQYKAKVFQPDYATQSGPMLVINGKINPLFLANSDSKKIRNGVGV-K 178 Query: 190 HGNAVFLLSQQATNFYDFACYAKAK 214 + F++S+ NFY FA + + K Sbjct: 179 NNKLYFVISKNRVNFYSFAQFFQKK 203 >UniRef50_C8N6Z2 Lipoprotein n=1 Tax=Cardiobacterium hominis ATCC 15826 RepID=C8N6Z2_9GAMM Length = 304 Score = 189 bits (481), Expect = 6e-47, Method: Composition-based stats. Identities = 75/219 (34%), Positives = 110/219 (50%), Gaps = 11/219 (5%) Query: 42 TLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQG-QVQMAMNGGIYDESYAP 100 + Y +P V ++W+ A+G A+ L L + G +V MN GIY E+ P Sbjct: 85 NVRYGIYQADPAQ--VSLHWKTADGSAYANLATLKRSLEQSGARVAFLMNAGIYSENDTP 142 Query: 101 LGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKT-SKEIQFAVQSG 159 GL+IE GQ V LN +G+GNF I+P GVFY+ K I A+ + +AVQSG Sbjct: 143 AGLWIERGQTLVPLNRKNGKGNFHIQPNGVFYIERGKARIQTSAAYHIGNHHPDWAVQSG 202 Query: 160 PMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT------NFYDFACYAKA 213 P+L+ +G NPR N++S RN V F+L++ +F+ FA + Sbjct: 203 PLLLLDGKPNPRFVKNLSSPHKRNAVCTTADNRLYFILTEDYDLGSEWPSFHRFAEALQH 262 Query: 214 KLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVER 252 L LYLDGT+S Y+ G A + +V +I+V Sbjct: 263 -LGCHDALYLDGTLSGWYIPGIAGTFHWTHYVGIIAVTT 300 >UniRef50_UPI0001744B4D hypothetical protein VspiD_05050 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001744B4D Length = 235 Score = 180 bits (456), Expect = 5e-44, Method: Composition-based stats. Identities = 63/227 (27%), Positives = 106/227 (46%), Gaps = 13/227 (5%) Query: 38 LSDPTLTVQAYTVNPQTE-RVKMYWQKANGEAWGTLHALLADINSQGQ-VQMAMNGGIYD 95 + V+ R+ + W +G+ G+ LL + QG+ ++ A N GIY+ Sbjct: 10 IEFEGAIYHVLRVDRADFSRLDLRWLGQDGKPLGSFGPLLQEAARQGRRIEFATNAGIYE 69 Query: 96 ESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAG-DKVGIVRLDAF-KTSKEIQ 153 P GL I G++ V LNLA GEGNF++ P GVFY+ G++ + ++ + + Sbjct: 70 RGPKPCGLTIAGGKELVPLNLAKGEGNFYLHPNGVFYLDDQTGAGVMTGAEYGQSGLQPR 129 Query: 154 FAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINK-HGNAVFLLSQ------QATNFYD 206 A QSGP+L+ G I+P + N + ++RN VG+ G VF++S F+ Sbjct: 130 LATQSGPILLRQGKIHPAFNFNSPNRRLRNAVGVRASDGQVVFVMSDREDRVKGRVTFHQ 189 Query: 207 FACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFVTMISVER 252 + + L + L+LDG IS ++ F M + + Sbjct: 190 LSRFFLH-LGCQDALFLDGDISDFLFHPPAGAAVTPNTFAGMFVLWK 235 >UniRef50_Q5WVS5 Putative uncharacterized protein n=5 Tax=Legionella RepID=Q5WVS5_LEGPL Length = 258 Score = 165 bits (418), Expect = 1e-39, Method: Composition-based stats. Identities = 53/233 (22%), Positives = 91/233 (39%), Gaps = 27/233 (11%) Query: 11 MITLNLKRIFLALT---------LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYW 61 + + IF +T L P + L +P + + ++ ++ + Sbjct: 17 FLLILALAIFTPMTSYSASDWQELTPGIEYQDLEGGLLNPWSHIHVFRIDLNKNQMALVT 76 Query: 62 QKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEG 121 K + ++ + +++NGG +D + PLGL I N +Q+ L S Sbjct: 77 AKNLAQKNASVDQF----AEHSKALLSINGGFFDHEFNPLGLRINNKKQENPLKRISW-- 130 Query: 122 NFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKI 181 G+FYV +K I + F I FA+QSGP L+ G I P + VA Sbjct: 131 ------WGIFYVKDNKPRITNIRNFHYDSNIDFAIQSGPRLLIRGNI-PSLKAGVAD--- 180 Query: 182 RNGVGINKHGNAVFLLSQQ-ATNFYDFACYAKA-KLNVEQLLYLDGTISHMYM 232 R +GI G + L++ A + A ++ L+ + LDG S Sbjct: 181 RTALGITDDGKVIILVTTNAAMSTRQLAQIMRSPPLSCSDAINLDGGSSSQLY 233 >UniRef50_D1REU9 Putative uncharacterized protein n=1 Tax=Legionella longbeachae D-4968 RepID=D1REU9_LEGLO Length = 260 Score = 164 bits (414), Expect = 3e-39, Method: Composition-based stats. Identities = 54/223 (24%), Positives = 93/223 (41%), Gaps = 19/223 (8%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ 84 L P + P V + V+ + ++ + K + ++ + Sbjct: 40 LSPGIEYQDLAGGILAPWSHVYVFRVDLKKNKLGLVNAKNLSLKYASV----NQFAEHSK 95 Query: 85 VQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLD 144 +++NGG +D + PLGL I NG+ + L S GVF++ +K I L Sbjct: 96 ALLSINGGFFDHKFNPLGLRITNGKLENPLKRISW--------WGVFFIKNNKAYISSLR 147 Query: 145 AFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ-ATN 203 F+ +I FA+QSGP L+ N I P + P +A R+ +GI G + L++ A Sbjct: 148 QFQYDNDIDFAIQSGPRLLVNRKI-PSLKPGIAE---RSALGITADGKIILLVTTNAAMT 203 Query: 204 FYDFACYAKA-KLNVEQLLYLDGTISH-MYMKGGAIPWQRYPF 244 A ++ L+ + LDG S +Y G+ + F Sbjct: 204 TNKLAHLLRSPPLSCMDAINLDGGSSSQLYAHIGSFLLNVHGF 246 >UniRef50_D1CZ42 Putative uncharacterized protein n=1 Tax=Brucella sp. 83/13 RepID=D1CZ42_9RHIZ Length = 248 Score = 160 bits (406), Expect = 3e-38, Method: Composition-based stats. Identities = 46/167 (27%), Positives = 81/167 (48%), Gaps = 10/167 (5%) Query: 96 ESYAPLGLYIENGQQKVALNLASGEG------NFFIRPGGVFYVAGDKVGIVRLDAF-KT 148 ++PLGL+I +G+++ + A + NF+ +P G+F++ G++ + F K Sbjct: 82 AGFSPLGLFIADGKEQSPIQPAGAKTSDKPVPNFYKKPNGIFFLDESGAGLLPTEQFVKR 141 Query: 149 SKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFA 208 ++ A QSGPML+ +NP A R+GVG+ K G F++S A NF+DFA Sbjct: 142 RPKVWLATQSGPMLVIENRLNPIFIIGSADKSRRSGVGVCKDGVIHFVVSDDAVNFHDFA 201 Query: 209 CYAKAKLNVEQLLYLD-GTISHMYMKGGAIPWQ--RYPFVTMISVER 252 + + +L L+LD G + +Y + M ++ Sbjct: 202 RFFRDRLECPNALFLDGGGGAGLYDPALGRNDMSWHGGYGPMFALIE 248 >UniRef50_A9KDD2 Hypothetical exported protein n=6 Tax=Coxiella burnetii RepID=A9KDD2_COXBN Length = 255 Score = 153 bits (387), Expect = 5e-36, Method: Composition-based stats. Identities = 47/227 (20%), Positives = 86/227 (37%), Gaps = 22/227 (9%) Query: 26 LPLFAVAADDCALSDPTL--TVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQG 83 + V + S P L + A+ +NP+ + ++ A Sbjct: 36 MAYTVVTPAFSSESRPGLFTHLYAWKINPRQYHFNIVTA----KSLQQTALYAAQAAKIK 91 Query: 84 QVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRL 143 +A+NGG + + PLGL I + + +L S G+F + ++ I Sbjct: 92 DTVLAINGGFFTPNLEPLGLRISDNKVLSSLKRISW--------WGIFMIKNNRAAITSP 143 Query: 144 DAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-- 201 ++ S EI FA+Q+GP L+ +G I P++ A R+ +G+ G+ + ++ Sbjct: 144 QNYRYSPEINFAIQAGPRLIIDGRI-PQLRGGSAQ---RSALGVTPTGDIIIAITDNNLL 199 Query: 202 TNFYDFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFVTM 247 A KL L LDG S +++ Q + Sbjct: 200 LTATQLA-ILLQKLGCSNALNLDGGTSSQLFVHTNNFSLQIPSLRPV 245 >UniRef50_A5USB9 Putative uncharacterized protein n=2 Tax=Roseiflexus RepID=A5USB9_ROSS1 Length = 282 Score = 142 bits (357), Expect = 2e-32, Method: Composition-based stats. Identities = 47/221 (21%), Positives = 92/221 (41%), Gaps = 23/221 (10%) Query: 37 ALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE 96 SDP + + A ++P T R+++ + + T + +A+NGG + Sbjct: 70 DSSDPPVPIYAVRLDPATIRLRIRYAPDAPQPLRTW-------FVAHRPLVAVNGGFFTA 122 Query: 97 SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYV-AGDKVGI--VRLDAFKTSKEIQ 153 L + +G G + GG+ +V I +R + + + + Sbjct: 123 ENRATALIVSDGTVY---------GTSYAGFGGMLAAAPDGRVWIQALRDEPYDPNIPLD 173 Query: 154 FAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLS-QQATNFYDFACYA- 211 A+QS PML+ G + I+ N R V I++ G + ++ A + + A + Sbjct: 174 QAIQSFPMLIYPGGVVASINDNGQ-RARRTVVAIDRAGRVLLIVCPTSAFSLQELATWLA 232 Query: 212 KAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFVTMISVE 251 + + +++ L LDG S +++ GA+ WQ F + SV Sbjct: 233 SSDMEIDRALNLDGGSSSGIFVNAGAVRWQIDSFAALPSVI 273 >UniRef50_A6LS70 Putative uncharacterized protein n=23 Tax=Clostridium RepID=A6LS70_CLOB8 Length = 356 Score = 134 bits (338), Expect = 2e-30, Method: Composition-based stats. Identities = 35/218 (16%), Positives = 61/218 (27%), Gaps = 30/218 (13%) Query: 45 VQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE-------- 96 +P ++ + + G + I A+NGG + + Sbjct: 141 YYLVVKDPTRVKIGVSSK------LGVEGETTSTIAENNDAIAAINGGAFTDQSSAAQWT 194 Query: 97 --SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQF 154 G+ + G+ KV + F I GV V IQ Sbjct: 195 GNGGLASGIVMTGGEVKVNDVGDNPTTTFGIDKNGVMVVGD------YTVEKLKELGIQE 248 Query: 155 AVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA------TNFYDFA 208 A+ GP L+ NG + + + +G K G+ + L+ + Sbjct: 249 ALSFGPALIINGNMVKINGDGGFGTAPKTAIGQMKDGSIILLVIDGREIGSIGATLKELQ 308 Query: 209 CYAKAKLNVEQLLYLDGTIS-HMYMKGGAIPWQRYPFV 245 +L + LDG S +Y G Sbjct: 309 EIM-HQLGAWNAMNLDGGKSTTLYYYGEVRNKPSNSMG 345 >UniRef50_C3QHD0 Exopolysaccharide biosynthesis protein n=2 Tax=Bacteroides RepID=C3QHD0_9BACE Length = 311 Score = 133 bits (334), Expect = 7e-30, Method: Composition-based stats. Identities = 45/236 (19%), Positives = 80/236 (33%), Gaps = 25/236 (10%) Query: 24 TLLPLF-AVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGE----AWGTLH-ALLA 77 TL P A+ + + + + + V+ + + M E + LA Sbjct: 62 TLAPGVKALEMEILSATGMAVKMFVLEVDLKDTHLTMKASSPKDEGKLKTKQQMTLQALA 121 Query: 78 DINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDK 137 +V A+NG + P G+Y NG + F + G K Sbjct: 122 HDKQGSRVLAAVNGDFFATDGTPQGIYYRNGVCLKNTMTDNVCTFFAV-------TKGKK 174 Query: 138 VGIVRLDAFK-TSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFL 196 I D + EIQ AV LM NG + P+ + + + R +G+ + L Sbjct: 175 AVIGSYDEYDTYKDEIQEAVGGRVRLMTNGNVLPQ---TLTALEPRTAIGVTDNNVVYIL 231 Query: 197 LSQQATNFY-------DFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFV 245 ++ +Y + KA L + + LDG S ++ ++ F Sbjct: 232 VADGRNFWYSNGMRYAEMGAVMKA-LGAKDAINLDGGGSSTFIIRSKAGFEENRFA 286 >UniRef50_D0WLU9 Putative uncharacterized protein n=1 Tax=Actinomyces sp. oral taxon 848 str. F0332 RepID=D0WLU9_9ACTO Length = 447 Score = 133 bits (334), Expect = 7e-30, Method: Composition-based stats. Identities = 43/221 (19%), Positives = 68/221 (30%), Gaps = 28/221 (12%) Query: 40 DPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYA 99 + T+T TV T+ + AN + + + I S A+NG Y + Sbjct: 221 NNTVTYYVATVKL-TDATALKSAFANNQFGRNITQKTSTIASNNNAIFAINGDYY--GFR 277 Query: 100 PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSG 159 G+ I NG +G F+ Y + + + G Sbjct: 278 SSGIVIRNGVVYRDDGARAGLA-FYRDGSVKIYDE-----TSTNGQKLVKEGVWNTLSFG 331 Query: 160 PMLMENGVINPRIHP----------NVASSKIRNGVGINKHGNAVFLLSQQA-------T 202 P L++NG I I ++ ++ R VG K G VF++ Sbjct: 332 PSLVKNGKIVEGIDDVEIDTNFGNHSIQGNQPRTLVGAKKDGTLVFVVVDGRDAGYSRGV 391 Query: 203 NFYDFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRY 242 + A + LDG S MY G I Sbjct: 392 TMTEAAKIMLEQ-GCVTAYNLDGGGSSTMYFNGEVINEPSN 431 >UniRef50_C9KSV8 N-acetylmuramoyl-L-alanine amidase/putative S-layer protein n=1 Tax=Bacteroides finegoldii DSM 17565 RepID=C9KSV8_9BACE Length = 390 Score = 132 bits (332), Expect = 1e-29, Method: Composition-based stats. Identities = 39/194 (20%), Positives = 69/194 (35%), Gaps = 19/194 (9%) Query: 60 YWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASG 119 + ++ + + A LA S +V A+NG + + P G+Y NG + Sbjct: 183 FLGTSSSYYYVSRDAALAYDKSGSRVLAAVNGDFFAKDGTPQGIYYRNGTCLKGTMTDNV 242 Query: 120 EGNFFIRPGGVFYVAGDKVGIVRLDAFK-TSKEIQFAVQSGPMLMENGVINPRIHPNVAS 178 F I + I D + + IQ AV LM NG + P+ V + Sbjct: 243 CTFFAITKN-------KRAIIGSYDEYDSYKENIQEAVGGRVRLMTNGNVLPQ---TVTA 292 Query: 179 SKIRNGVGINKHGNAVFLLSQQATNFY-------DFACYAKAKLNVEQLLYLDGTISHMY 231 + R +G+ L++ +Y + KA L + + LDG S + Sbjct: 293 LEPRTAIGVTDDNVVYILVADGRNFWYSNGMRYAEMGAVMKA-LGAKNAINLDGGGSSTF 351 Query: 232 MKGGAIPWQRYPFV 245 + ++ F Sbjct: 352 IIRKIAGFEDGRFA 365 >UniRef50_C2JZN3 N-acetylmuramoyl-L-alanine amidase/probable S-layer protein n=2 Tax=Lactobacillus rhamnosus RepID=C2JZN3_LACRH Length = 559 Score = 132 bits (331), Expect = 2e-29, Method: Composition-based stats. Identities = 45/222 (20%), Positives = 78/222 (35%), Gaps = 22/222 (9%) Query: 24 TLLPLFAVAA-DDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTL----HALLAD 78 TL P + S + +NP+ ++ A + A Sbjct: 125 TLTPGVTEQRLTYISQSGTQNKYYSVALNPKNPNTQLLTGTPGDGATSGVQTVSDQASAA 184 Query: 79 INSQGQVQMAMNGGIYD-ESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDK 137 I + QV A+NG ++ S P G I++G + A ++ E F I+ G + ++ Sbjct: 185 IKNGHQVVAAVNGDLFKIASGVPTGNVIKDGVELHAA-TSARESFFGIKKDGTPIIGDEQ 243 Query: 138 VGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLL 197 K ++Q A+ +L+ +G +N S+ R VGI G F++ Sbjct: 244 T------YQKVKGDLQQALGGRNILVADGKVNET-KAIGTDSEPRTAVGIKADGTVFFVV 296 Query: 198 SQQA-------TNFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 + D A + L LDG S Y+ Sbjct: 297 VDGRQAPTSNGLSMVDLANLMIQR-GAVTALNLDGGGSSTYV 337 >UniRef50_A0Q3C5 Conserved protein n=7 Tax=Clostridia RepID=A0Q3C5_CLONN Length = 335 Score = 130 bits (327), Expect = 4e-29, Method: Composition-based stats. Identities = 39/232 (16%), Positives = 67/232 (28%), Gaps = 35/232 (15%) Query: 36 CALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIY 94 + + NP+ +V E G+ + + + A+N G + Sbjct: 105 REIHGDKFKGHLLVIKNPKKIKVGY------NEHLGSKGETTSAMAKRYNSIAAINAGGF 158 Query: 95 -------------DESYAPLGLYIENGQ-QKVALNLASGEGNFFIRPGGVFYVAGDKVGI 140 + + P G+ I NG+ L I G+ V + Sbjct: 159 VANNASSKDANPSETNGNPGGILISNGEIVYNNLRNNEKICIAGITADGILLVGNYNLDE 218 Query: 141 VRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ 200 + ++ AV GP L+ NG + R +G K G+ +FL+ Sbjct: 219 M------MKLNVKDAVSFGPALIVNGQKTITSGDGGWGTAPRTAIGQRKDGSILFLVIDG 272 Query: 201 ------ATNFYDFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFV 245 A + + + LDG S MY G I Sbjct: 273 KYIGRLAVTLRELQDILY-EYGAYNAVNLDGGSSSTMYYNGKVISEPYKSTG 323 >UniRef50_B7AQ96 Putative uncharacterized protein n=1 Tax=Bacteroides pectinophilus ATCC 43243 RepID=B7AQ96_9BACE Length = 305 Score = 130 bits (326), Expect = 5e-29, Method: Composition-based stats. Identities = 41/222 (18%), Positives = 70/222 (31%), Gaps = 17/222 (7%) Query: 41 PTLTVQAYTVNPQTERVK-MYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYA 99 Y + Q + A+G + + + I +A+NG Y + Sbjct: 76 REYDTSIYVADIQLADASYLRAGLADGTFGRNVTEVTSQIAQDSNAILAINGDFY--GFR 133 Query: 100 PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQ-- 157 G + NG +GN + V Y G I + + + A Q Sbjct: 134 NKGYVMRNGYLYRETAQQGRQGNS-RQEDLVIYEDGHMDVIEENEVAAQTLKDSGASQIF 192 Query: 158 -SGPMLMENGVINPRIHPNVASS---KIRNGVGINKHGNAVFLLSQQA------TNFYDF 207 GP L++NG I + V S R +G+ + + +S Y Sbjct: 193 SFGPGLIKNGNITVDENSEVEQSMQSNPRTAIGMITPLHYIMAVSDGRTEASEGLTLYQL 252 Query: 208 ACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMIS 249 A K + + LDG S G + + + + IS Sbjct: 253 AQIMKGQ-DCVTAYNLDGGGSSTMWFNGEVVNKPTSYGSKIS 293 >UniRef50_C6XT14 Exopolysaccharide biosynthesis protein n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6XT14_PEDHD Length = 303 Score = 130 bits (326), Expect = 6e-29, Method: Composition-based stats. Identities = 34/213 (15%), Positives = 68/213 (31%), Gaps = 27/213 (12%) Query: 45 VQAYTVNPQTERVKMYWQKANGEAWGTLHALLAD----INSQGQVQMAMNGGIYDES-YA 99 + ++ + VK+ + + +V +NG ++ S Y Sbjct: 72 IFILKIDLKNPDVKLQAATPYDAPGYGSQTVPEMAKYVDAANNRVIAGINGDFFNTSSYV 131 Query: 100 PLGLYIENGQQKVALNLAS------GEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQ 153 PLG+ + G + G I G Y+ D +++ Sbjct: 132 PLGIIYKKGVAIKPAFTDNTDKPQQGLSFLGILANGKPYIGDK-----ETDYPTIKSQLK 186 Query: 154 FAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-------TNFYD 206 A+ +G L+++ +I ++ + R GVGI F++ N+ + Sbjct: 187 EALGAGVFLVKD---YKKITQSIPTVDPRTGVGITDDDLVYFIVVDGRNFYNSNGINYQE 243 Query: 207 FACYAKAKLNVEQLLYLDGTISHMYMKGGAIPW 239 A V+ + LDG S +M Sbjct: 244 MGKIMYA-FGVKNAVNLDGGGSSTFMIKHPRVD 275 >UniRef50_A0PY15 Conserved protein n=4 Tax=Clostridium RepID=A0PY15_CLONN Length = 436 Score = 130 bits (326), Expect = 6e-29, Method: Composition-based stats. Identities = 41/230 (17%), Positives = 72/230 (31%), Gaps = 33/230 (14%) Query: 37 ALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE 96 + + V P ++++ + + + + + ++I + A+N G + + Sbjct: 200 DIKTNRFNGKMLIV-PNSKKIVIGFNEESP---SKVGKTTSEIAKENNAICAINAGGFTD 255 Query: 97 S------------------YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKV 138 P G+ I NG+ N G N I G F G + Sbjct: 256 DVSGKSAEVVLNPDSGYETRKPCGILIHNGEFVY--NDDKGRKNEKIDIVG-FSKRGKLI 312 Query: 139 GIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLS 198 + I+ AV GP L+ +G + R +G + G+ +FL+ Sbjct: 313 VGKYTLEELKNINIKEAVSFGPALIVDGNPVNILGDGGWGVAPRTAIGQRRDGSVLFLVI 372 Query: 199 QQA------TNFYDFACYAKAKLNVEQLLYLDGT-ISHMYMKGGAIPWQR 241 D K K LDG +S MY K I Sbjct: 373 DGRGFKSMGATIKDVQDIMK-KYGAVNASNLDGGTVSTMYYKDKVINKPC 421 >UniRef50_B2KU41 N-acetylmuramoyl-L-alanine amidase/putative S-layer protein (Fragment) n=1 Tax=Lactobacillus rhamnosus HN001 RepID=B2KU41_LACRH Length = 470 Score = 129 bits (325), Expect = 7e-29, Method: Composition-based stats. Identities = 45/222 (20%), Positives = 78/222 (35%), Gaps = 22/222 (9%) Query: 24 TLLPLFAVAA-DDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTL----HALLAD 78 TL P + S + +NP+ ++ A + A Sbjct: 125 TLTPGVTEQRLTYISQSGTQNKYYSVALNPKNPNTQLLTGTPGDGATSGVQTVSDQASAA 184 Query: 79 INSQGQVQMAMNGGIYD-ESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDK 137 I + QV A+NG ++ S P G I++G + A ++ E F I+ G + ++ Sbjct: 185 IKNGHQVVAAVNGDLFKIASGVPTGNVIKDGVELHAA-TSARESFFGIKKDGTPIIGDEQ 243 Query: 138 VGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLL 197 K ++Q A+ +L+ +G +N S+ R VGI G F++ Sbjct: 244 T------YQKVKGDLQQALGGRNILVADGKVNET-KAIGTDSEPRTAVGIKADGTVFFVV 296 Query: 198 SQQA-------TNFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 + D A + L LDG S Y+ Sbjct: 297 VDGRQAPTSNGLSMVDLANLMIQR-GAVTALNLDGGGSSTYV 337 >UniRef50_C7TED9 N-acetylmuramoyl-L-alanine amidase n=2 Tax=Lactobacillus rhamnosus RepID=C7TED9_LACRG Length = 1561 Score = 129 bits (324), Expect = 9e-29, Method: Composition-based stats. Identities = 36/200 (18%), Positives = 71/200 (35%), Gaps = 20/200 (10%) Query: 45 VQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINS----QGQVQMAMNGGIYDE-SYA 99 + ++P+ + N + + N+ QV A+N Y+ + A Sbjct: 140 YYSVALDPKNPNTTLLAGMPNDGTKPGMQTVRNQANAAISHGQQVVAAVNADYYNMATGA 199 Query: 100 PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSG 159 PLG ++NG + + + E F I+ G + + ++Q AV Sbjct: 200 PLGNVVKNGTEIYSA-PDTNEAFFGIKKDGTPMIG------TAATYQQRKGDLQQAVGGP 252 Query: 160 PMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-------TNFYDFACYAK 212 + +++G +N ++ VGI G F++ + DFA Sbjct: 253 SIFVKDGKVNATQVAGSEGNEPCTAVGIKADGTVFFVVIDGRQAPLSTGISVGDFAKLMI 312 Query: 213 AKLNVEQLLYLDGTISHMYM 232 + L+LDG S ++ Sbjct: 313 ER-GAVNALFLDGGGSATFV 331 >UniRef50_Q97FU3 Uncharaterized conserved protein, YOME B.subtilis ortholog n=6 Tax=Clostridium RepID=Q97FU3_CLOAB Length = 354 Score = 127 bits (320), Expect = 3e-28, Method: Composition-based stats. Identities = 40/238 (16%), Positives = 72/238 (30%), Gaps = 38/238 (15%) Query: 34 DDCALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGG 92 + + + + +P ++ + G ++I A+NGG Sbjct: 117 ECKKIQGNKFSGLMLVIHDPTKVKIGYTSK------LGVEGETTSEIAKHNNALAAVNGG 170 Query: 93 IYDESYA------------PLGLYIENGQQKVALNLAS---GEGNFFIRPGGVFYVAGDK 137 + E+ + P G+ I +G+ N +G I GV V Sbjct: 171 GFQENSSGSKVVWTGTGALPTGIIISDGKVVYPKNPDQLSIQKGTAAITKSGVLVVGDHS 230 Query: 138 VGIVRLDAFKTSKEIQFAVQSGPMLMENG--VINPRIHP--NVASSKIRNGVGINKHGNA 193 + + ++ + A+ GP L+ NG + ++ R +G K G Sbjct: 231 IREL------LNENVVEAINFGPTLIVNGVDQTRDSFGNSIDSQGAQPRTAIGQRKDGAI 284 Query: 194 VFLLSQQATNFYDFACY-----AKAKLNVEQLLYLDGTIS-HMYMKGGAIPWQRYPFV 245 + L A + N + LDG S MY G I F Sbjct: 285 LLLTVDGRQGLQMGATIKDIQKIMEQENAYNAVNLDGGASTTMYYNGHVINNPCDKFG 342 >UniRef50_B8I4Q1 Putative uncharacterized protein n=3 Tax=Clostridium RepID=B8I4Q1_CLOCE Length = 346 Score = 127 bits (319), Expect = 3e-28, Method: Composition-based stats. Identities = 31/227 (13%), Positives = 59/227 (25%), Gaps = 27/227 (11%) Query: 34 DDCALSDPTLTVQAYTVN-PQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGG 92 + + + V+ P +V + + I + A+NGG Sbjct: 120 EYFDVESRNFKGKMIIVDDPTRIKVGYSSKMPRS------GETTSSIARRNGAVAAINGG 173 Query: 93 IY------DESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGI-VRLDA 145 + +G I NG+ N + + + + + + A Sbjct: 174 GFIDKGWAGTGGVAIGFVISNGKYISGKLT-----NNYTKRDTIAFTKDGMLIVGKHSQA 228 Query: 146 FKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA---- 201 I+ + GP L+ NG R +G + G+ + L+ Sbjct: 229 ELAKYNIKEGISFGPPLIVNGKPTINKGDGGWGISPRTAIGQKEDGSVMLLVIDGRSLKS 288 Query: 202 --TNFYDFACYAKAKLNVEQLLYLDGTIS-HMYMKGGAIPWQRYPFV 245 + LDG S MY G + Sbjct: 289 FGATLKEVQDIMLEH-GAVNAANLDGGSSATMYYDGKVVNTPSDALG 334 >UniRef50_A9B1E5 Putative uncharacterized protein n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9B1E5_HERA2 Length = 272 Score = 126 bits (317), Expect = 6e-28, Method: Composition-based stats. Identities = 42/238 (17%), Positives = 84/238 (35%), Gaps = 26/238 (10%) Query: 22 ALTLLPLFAVAADDCALSDPTL---TVQAYTVNPQTERVKMYWQKANGEAWGTLHALLAD 78 T+ + + VQ V+P R+++ + A+ G + A Sbjct: 46 PTTIDNQWQTLEPGLEFREIGYDITNVQILRVDPAYFRLRVGYDVASP---GRVSEWAAA 102 Query: 79 INSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKV 138 + +NGG +D L I +G G + GG+ V Sbjct: 103 LKP----VAVINGGYFDAQGRATALTIFDGVI---------NGTSYDGFGGMLAVDSADG 149 Query: 139 GIVRL---DAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVF 195 +R + +++ + A+QS PML+ +G + + + R+ V I++ G + Sbjct: 150 WSLRSLREQPYDSTEVLNQALQSAPMLVVHGAAIEQPNDDGD-RARRSVVAIDQTGRLLL 208 Query: 196 LLSQQA-TNFYDFACYA-KAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFVTMISV 250 ++ D + + K L ++ L LDG S + + + V + V Sbjct: 209 MVCSWPSFTLTDLSQWLVKQDLAIDAALNLDGGSSTGLVVASENRSFNLDSLVRVPQV 266 >UniRef50_C4G6X0 Putative uncharacterized protein n=2 Tax=Lactobacillales RepID=C4G6X0_ABIDE Length = 345 Score = 123 bits (309), Expect = 5e-27, Method: Composition-based stats. Identities = 38/214 (17%), Positives = 63/214 (29%), Gaps = 22/214 (10%) Query: 41 PTLTVQAYTVNPQTERVK-MYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYA 99 Y + + + A + + + + +A+NG Y + Sbjct: 120 RKNNTTVYVADIKLSDSSYLKTALAYDSFGTNVTETTSSMATNNNAILAVNGDYYGADRS 179 Query: 100 PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAV--- 156 G I+NG + E P Y G I + V Sbjct: 180 --GYVIKNGVIYRNTVRSDSE-----YPDLAVYKDGSFKIIYETEVTAEELLADGVVNLF 232 Query: 157 QSGPMLMENGVINPRIHPNVAS---SKIRNGVGINKHGNAVFLLSQQA------TNFYDF 207 GP L+ENG I+ + V R +GI + + ++S + Y+ Sbjct: 233 AFGPSLVENGEISVDQNTEVRQAMTKNPRTAIGIVDKNHYILVVSDGRTSESEGLSLYEL 292 Query: 208 ACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQ 240 A K + LDG S MY G + Sbjct: 293 AEVLK-EYGATTAYNLDGGGSSTMYFNGNIVNNP 325 >UniRef50_Q892K3 N-acetylmuramoyl-L-alanine amidase/putative S-layer protein n=1 Tax=Clostridium tetani RepID=Q892K3_CLOTE Length = 708 Score = 123 bits (309), Expect = 6e-27, Method: Composition-based stats. Identities = 40/195 (20%), Positives = 68/195 (34%), Gaps = 21/195 (10%) Query: 53 QTERVKMYWQKANGEAWGTLHALL----ADINSQGQVQMAMNGGIY-DESYAPLGLYIEN 107 + RV + N + + +++ A I S V +NG Y + P+G+ +N Sbjct: 92 TSSRVGVKAGTPNNKDSYGMQSVIMQAKASIASGDNVVGGVNGDFYYTVTGEPIGIVYKN 151 Query: 108 GQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGV 167 G+ A N A+ F + G + K + +Q A+ +L+ G Sbjct: 152 GKAVKA-NHAAEWNFFGVLEDGTPIIGDGK------KYNEVKDSLQEALGGNAILVREGR 204 Query: 168 INPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-------TNFYDFACYAKAKLNVEQL 220 I + + R VGI K G F+ + D A L + Sbjct: 205 IY-QTPSIGGYREPRTAVGIKKDGTIFFVTVDGRQEGHSAGISMPDLAQLMID-LGAVEA 262 Query: 221 LYLDGTISHMYMKGG 235 L LDG S ++ Sbjct: 263 LNLDGGGSSTFVSRK 277 >UniRef50_C0GEE0 Putative uncharacterized protein n=1 Tax=Dethiobacter alkaliphilus AHT 1 RepID=C0GEE0_9FIRM Length = 379 Score = 122 bits (307), Expect = 9e-27, Method: Composition-based stats. Identities = 32/201 (15%), Positives = 65/201 (32%), Gaps = 24/201 (11%) Query: 50 VNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQ 109 V+P RV + +G ++ I + +A+N + + P + G+ Sbjct: 181 VDPTKLRVAF-----AHDEYGAPRKPVSKIANSNNAILAINASGFSGN-VPFSPVVREGE 234 Query: 110 QKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVIN 169 + G I G+ +G + ++ + P+L+ NG + Sbjct: 235 VYSMDINHTPMG---ITACGMLMDSGKRGVEQMIED-----GAHQVITFRPVLVRNGQM- 285 Query: 170 PRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-------TNFYDFACYAKAKLNVEQLLY 222 N + R +G ++G+ +F++ N D A + Sbjct: 286 TSTAQNNNTIHPRTAIGQKENGDLIFIVVDGRRNNWSTGINLGDLAQIFIDE-GAAWAYN 344 Query: 223 LDGTIS-HMYMKGGAIPWQRY 242 LDG S +Y G + Sbjct: 345 LDGGGSTTLYFNGKVLNKPSD 365 >UniRef50_A9WEC1 Putative uncharacterized protein n=3 Tax=Chloroflexus RepID=A9WEC1_CHLAA Length = 265 Score = 122 bits (307), Expect = 9e-27, Method: Composition-based stats. Identities = 40/222 (18%), Positives = 84/222 (37%), Gaps = 26/222 (11%) Query: 34 DDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGI 93 L P L VQ ++P R + + + L A + G A+NGG Sbjct: 55 AFRQLEAPGLPVQVVRIDPAHVRFVVGYDPTSPLT------LSAWVARYG-AVAAINGGF 107 Query: 94 YDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKV---GIVRLDAFKTSK 150 +D+ P+ L I N Q G ++ GG+F + + + + Sbjct: 108 FDQQGEPVALLISNQQVF---------GYSYVDQGGMFAIDEQGKPHLWSLADQPYDGTP 158 Query: 151 EIQFAVQSGPMLME-NGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-TNFYDFA 208 A+Q P+L+ NG + R+ + ++++G + +++ A + +++ Sbjct: 159 -FVQAIQGWPLLVRTNGE--AAYTDDDGQRARRSAIALDRNGYVLLIVAPGATFSLAEWS 215 Query: 209 CYAKA-KLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFVTMI 248 + + L++E + LDG S + + + F + Sbjct: 216 QFLASADLDIEIAVNLDGGSSSGLIAQSDQGGVRVDSFTPLP 257 >UniRef50_C1I4R7 Putative uncharacterized protein (Fragment) n=1 Tax=Clostridium sp. 7_2_43FAA RepID=C1I4R7_9CLOT Length = 894 Score = 122 bits (306), Expect = 1e-26, Method: Composition-based stats. Identities = 33/203 (16%), Positives = 67/203 (33%), Gaps = 22/203 (10%) Query: 42 TLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLAD----INSQGQVQMAMNGGIYDE- 96 + ++ + + V + N E+ L + + V +N Y+ Sbjct: 64 RIESFVIEIDTKNKNVSIEASTPNDESAYGLQPVRKQAEALLAKGENVVAGVNADFYNMA 123 Query: 97 SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAV 156 + P G+ +++G N F I G + + ++ A+ Sbjct: 124 TGEPNGVLLKDGVIIK--NHPESRKFFGILKDGSAVIGD------YNKFNEVKDNVEEAL 175 Query: 157 QSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-------TNFYDFAC 209 +L+++G + A + R VGI +GN F+ + D A Sbjct: 176 GGNAILVKDGQVFET-PQTGADKEPRTAVGIKSNGNVFFITVDGRQEPYSAGLSMDDLAQ 234 Query: 210 YAKAKLNVEQLLYLDGTISHMYM 232 + + Q L LDG S ++ Sbjct: 235 LMIS-MGAIQALNLDGGGSTTHL 256 >UniRef50_A4VXL8 Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase n=12 Tax=Firmicutes RepID=A4VXL8_STRSY Length = 312 Score = 122 bits (305), Expect = 1e-26, Method: Composition-based stats. Identities = 43/211 (20%), Positives = 71/211 (33%), Gaps = 18/211 (8%) Query: 42 TLTVQAYTVNPQTERVKMYWQKANGEAWGT-LHALLADINSQGQVQMAMNGGIYDESYAP 100 T Y + Q + +GT + A ++ + +A+NG Y + Sbjct: 88 TNNTTVYVADIQVSSPEYLKTALAQNTYGTNVTAKTSETAAANNAILAVNGDYYGAN--S 145 Query: 101 LGLYIENGQQKVALNLASGE-GNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSG 159 G I+NG + G+ I G F V + K + + G Sbjct: 146 TGYVIKNGVLYRDTVRDNAAYGDLAIYADGSFEVIYENEI---TAQELIDKGVVNLLAFG 202 Query: 160 PMLMENGVINPRIHPNVA---SSKIRNGVGINKHGNAVFLLSQQA------TNFYDFACY 210 P L+ENG I V SS R+ +GI + + +++ + Y A Sbjct: 203 PSLVENGEIVVDTSTEVGRAMSSNPRSAIGIIDENHYIIVVADGRTSESQGLSLYQLAEV 262 Query: 211 AKAKLNVEQLLYLDGTISH-MYMKGGAIPWQ 240 K + + LDG S +Y G I Sbjct: 263 MK-QYGAQTAYNLDGGGSSTLYFNGQVINNP 292 >UniRef50_C4FXK4 Putative uncharacterized protein n=1 Tax=Catonella morbi ATCC 51271 RepID=C4FXK4_9FIRM Length = 305 Score = 120 bits (302), Expect = 3e-26, Method: Composition-based stats. Identities = 42/235 (17%), Positives = 76/235 (32%), Gaps = 19/235 (8%) Query: 24 TLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVK----MYWQKANGEAWGTLHALLADI 79 T+ A D A++ T + ++ T +K + A+ + A + Sbjct: 63 TVNTATAYEDDTKAIAIDTYERNSTQIHVATVTIKGDASIKTALADETYGRNVKAKTSTT 122 Query: 80 NSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVG 139 +A+NG Y G I NGQ + + + + I G F + + Sbjct: 123 AQSVNAVLAVNGDYY--GARDAGYVIRNGQLLRSDSQDPNQEDLVIYQDGSFEIIREGDI 180 Query: 140 IVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASS---KIRNGVGINKHGNAVFL 196 +K + GP L+E+ + V + R +GI + V + Sbjct: 181 ---TAQELLNKGAVQVLSFGPALIEDSQVAVDSTDEVGKAMASNPRTAIGIIDDKHYVLV 237 Query: 197 LSQQA------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFV 245 +S + + A + K +L V LDG S G I + Sbjct: 238 VSDGRTDESKGLSLKELADFMK-ELKVTTAYNLDGGGSSTMYFNGQIINKPTTNG 291 >UniRef50_C2HB28 Exopolysaccharide biosynthesis protein n=4 Tax=Enterococcus faecium RepID=C2HB28_ENTFC Length = 308 Score = 120 bits (300), Expect = 5e-26, Method: Composition-based stats. Identities = 34/219 (15%), Positives = 63/219 (28%), Gaps = 18/219 (8%) Query: 39 SDPTLTVQAYTVNPQTERVKMYWQKANGEAWG-TLHALLADINSQGQVQMAMNGGIYDES 97 S+ Y + +G + + I + Q +A+NG Y Sbjct: 83 SERVDETTVYVADITVSDSSYLKTALANNTYGRNIKETTSAIAQEQQAILAINGDYY--G 140 Query: 98 YAPLGLYIENGQQKVALNLASGEG-NFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAV 156 + G + NG + I G F + + +++Q + Sbjct: 141 FRDKGYVLRNGTLYRDTPSDDETKEDLVIDKNGDFSIIKEAE---TSAEKLVEEDVQQVL 197 Query: 157 QSGPMLMENGVINPRIHPN---VASSKIRNGVGINKHGNAVFLLSQQA------TNFYDF 207 GP L+ENG + S R + + + ++S+ + + Sbjct: 198 SFGPALVENGEVTVSEDEEVSQSMKSNPRTAIAQVGTNHYLVVVSEGRTDDSQGLSLSEL 257 Query: 208 ACYAKAKLNVEQLLYLDGTIS-HMYMKGGAIPWQRYPFV 245 A K + LDG S +Y G I Sbjct: 258 ATVLKNH-GAKTAYNLDGGGSTTLYFNGKVINQTVGGSG 295 >UniRef50_C5S6T1 Putative uncharacterized protein n=1 Tax=Allochromatium vinosum DSM 180 RepID=C5S6T1_CHRVI Length = 272 Score = 119 bits (299), Expect = 7e-26, Method: Composition-based stats. Identities = 42/222 (18%), Positives = 74/222 (33%), Gaps = 20/222 (9%) Query: 22 ALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINS 81 P+ + + T+ + + R+ + + + + Sbjct: 43 PALEAPISHSERTLESSTGRTVRAHLALFDSRRYRLAVLDLGPD---LASASDWPEHTRA 99 Query: 82 QGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIV 141 G + A+NGG + PLGL I G++ GV Y + + Sbjct: 100 AG-LLAAVNGGFFHADGQPLGLVIAGGERLNRFETVK-------LLSGVLYGDARGIHLE 151 Query: 142 RLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA 201 R F++S I VQSGP L+E G + + S R + + + V ++ Sbjct: 152 RRARFQSSPGIDALVQSGPYLVEQGRAVRGLSTHDVSR--RTFIATDWRRHWVLGATRDG 209 Query: 202 TNFYDFACYA-----KAKLNVEQLLYLDGTISH--MYMKGGA 236 + A A VE+ L LDG S ++ G Sbjct: 210 LTLAELAEALATPGALAPWPVERALNLDGGTSTGFLFDPGAG 251 >UniRef50_C6D6X3 Exopolysaccharide biosynthesis protein n=6 Tax=Bacteria RepID=C6D6X3_PAESJ Length = 344 Score = 119 bits (298), Expect = 1e-25, Method: Composition-based stats. Identities = 39/219 (17%), Positives = 67/219 (30%), Gaps = 28/219 (12%) Query: 44 TVQAYTVNPQ-TERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLG 102 + Y + ++ + A + + I S A+NG Y + G Sbjct: 120 MITYYVADVAFNSKMNLLTAFAKDSFGTNITQNTSTIASNNNAVFAINGDYY--GFRSDG 177 Query: 103 LYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPML 162 + I NG G F + D+ D + A GP L Sbjct: 178 VVIRNGTVYRDEPARIGLAMF----NDGTMKSYDEEETSTDDLLAQ--GVTNAFSFGPAL 231 Query: 163 MENGVINPRI----------HPNVASSKIRNGVGINKHGNAVFLLSQQA-------TNFY 205 + +G I + ++ +S R G+G+ + VF++ Sbjct: 232 VTDGEIAGDFSHVEIDKNFGNRSIQNSNPRTGIGMISANHYVFVVVDGRSTGYSRGMTLT 291 Query: 206 DFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYP 243 +FA K +L + LDG S MY G + Sbjct: 292 EFADLFK-ELGATEAYNLDGGGSSTMYFMGRVVNNPLGK 329 >UniRef50_D1N9W8 Putative uncharacterized protein n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N9W8_9BACT Length = 275 Score = 118 bits (297), Expect = 1e-25, Method: Composition-based stats. Identities = 31/197 (15%), Positives = 65/197 (32%), Gaps = 25/197 (12%) Query: 45 VQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLG-- 102 + + T +++ + +G + T+ + A+N G + P+G Sbjct: 61 ISVLRADLSTPGLRLGLAECDGGNYETVSHFGRRL----DALAAVNAGFFAMKGNPMGVR 116 Query: 103 LYIENGQQKVA-LNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPM 161 +G+ A L F + G + IV F + + AV + Sbjct: 117 YLKIDGKVLNADLGGDPERAYFVLDQTG-------RPAIVGPADF-APERCRSAVYGNRL 168 Query: 162 LMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA------TNFYDFACYAKAKL 215 L+++G + P + + R G++ + + ++ F + A K L Sbjct: 169 LLKDGKVPP--LGDDKARHPRTAAGLSGN-TLLLVVIDGRARESAGVTFAELATLLKD-L 224 Query: 216 NVEQLLYLDGTISHMYM 232 + LDG S Sbjct: 225 GCTDAVNLDGGGSSTMW 241 >UniRef50_Q73Q09 Putative uncharacterized protein n=1 Tax=Treponema denticola RepID=Q73Q09_TREDE Length = 293 Score = 118 bits (297), Expect = 1e-25, Method: Composition-based stats. Identities = 35/212 (16%), Positives = 73/212 (34%), Gaps = 28/212 (13%) Query: 35 DCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADI--NSQGQVQMAMNGG 92 D L + A ++ ++K+ + + + + +A+N Sbjct: 58 HIKYEDYPLIIHAVKIDLTNPKLKIVVTEPALFNSKGMVKRETTLSFARRHNTVIALNAA 117 Query: 93 IYDE-------SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDA 145 ++ PLG++I+ F + G + ++ + I+ Sbjct: 118 FFNVISFSFSLRGEPLGIHIDKKINLSKP---------FPKYGALCFLDDNSAFIIESQN 168 Query: 146 F-KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATN- 203 +I++AV ++++NG P I R VG+ G ++L + N Sbjct: 169 TEDIKADIEYAVSGNRIILKNGK--PIITNISKKENSRTCVGLADGGKTLYLFFAEGENK 226 Query: 204 ------FYDFACYAKAKLNVEQLLYLDGTISH 229 YD A + KL + ++LDG S Sbjct: 227 KKSRGITYDQAHFFMKKLGAQDAIHLDGGGSS 258 >UniRef50_C2KZT9 Exopolysaccharide biosynthesis protein n=2 Tax=Firmicutes RepID=C2KZT9_9FIRM Length = 438 Score = 117 bits (294), Expect = 2e-25, Method: Composition-based stats. Identities = 40/221 (18%), Positives = 78/221 (35%), Gaps = 18/221 (8%) Query: 39 SDPTLTVQAYTVNPQ-TERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDES 97 Y + + T+ + AN + +D+ + +A+NG Y Sbjct: 212 RYRAYDSNIYVADVEVTDGTSILSAFANNTYGRNITDTTSDMAEENNAVLAINGDYYGAR 271 Query: 98 YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQ 157 + G I NG + ++GE + G + +++ K+ + Sbjct: 272 QS--GYVIRNGVVYRS-QGSNGEDMVISKDGSLSFISESD----TTTDSLIQKQTWQVLS 324 Query: 158 SGPMLMENGVINPRIHPNVA---SSKIRNGVGINKHGNAVFLLSQQA------TNFYDFA 208 GP+L+ENG + + V +S R +G + +F++S + Y+ A Sbjct: 325 FGPVLVENGQVAVSENDEVGMAMASNPRTAIGTVAKNHYLFVVSDGRTSESAGLSLYELA 384 Query: 209 CYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMIS 249 + K+ L + LDG S + G + IS Sbjct: 385 NFMKS-LGATNVYNLDGGGSSTMVFQGEVVNNPTTNGNKIS 424 >UniRef50_D1BL19 Putative uncharacterized protein n=4 Tax=Veillonellaceae RepID=D1BL19_VEIPT Length = 312 Score = 117 bits (293), Expect = 4e-25, Method: Composition-based stats. Identities = 32/226 (14%), Positives = 63/226 (27%), Gaps = 32/226 (14%) Query: 38 LSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE 96 + + +P+ +V ++I A+NGG + + Sbjct: 90 IQSARYVGYILEIPDPRRIQVGT------AANIQEKGDTTSNIAKMNNAVAAINGGGFHD 143 Query: 97 ------SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKT-- 148 P G + +G+ + ++ E V +V K G + + Sbjct: 144 PNGTGTGRLPYGFILHDGEYVIGKDVGPDED--------VDFVGFSKAGNLIAGNYNKTQ 195 Query: 149 --SKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYD 206 + + GP L+ +G R +G K G +FL+ Y Sbjct: 196 LGDMKAMEGITFGPPLIVDGKKMITEGDGGWGVGPRTAIGQKKDGTVLFLVIDGRQPGYS 255 Query: 207 FACYAKA------KLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFV 245 + + LDG S +Y+ G + Sbjct: 256 IGATLRDVQDILFEKGCYIAANLDGGSSSTLYLNGKVVNKPADLLG 301 >UniRef50_C8WTH1 N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase-like protein exopolysaccharide biosynthesis protein n=4 Tax=Alicyclobacillus acidocaldarius RepID=C8WTH1_ALIAD Length = 352 Score = 117 bits (293), Expect = 4e-25, Method: Composition-based stats. Identities = 47/243 (19%), Positives = 82/243 (33%), Gaps = 40/243 (16%) Query: 36 CALSDPTLTVQAYTV-NPQTERV-KMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGI 93 L +PT V +P+ RV + GE ++ + G + GG Sbjct: 121 ITLHEPTFNAFILLVKDPKRIRVVATKYLHVRGET------VMQMVQDSGAIAGINAGGF 174 Query: 94 YDESYA-----PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFK- 147 D ++ P G+ I +G+ S +P V + I + Sbjct: 175 VDTNWQGTGAYPQGITITDGKLVSMTGSPS-------QPQPVIAFTKEGQMIAGTYSLNQ 227 Query: 148 -TSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA----- 201 S ++ V GP+L+ENG P + + R +G K G + L++ Sbjct: 228 LRSLDVWQCVGFGPVLVENGK--PTVSAENYAVNPRTAIGQTKDGTVILLVTDGRYATGP 285 Query: 202 ----TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ------RYPFVTMISVE 251 +F D A + + + LDG S ++ G + + T I V Sbjct: 286 NDVGASFADVARIML-QFHADIAANLDGGSSATFVYKGRMWNRPVDILGARAVATSIVVM 344 Query: 252 RKG 254 +G Sbjct: 345 PEG 347 >UniRef50_B4CZJ8 Putative uncharacterized protein n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4CZJ8_9BACT Length = 251 Score = 116 bits (290), Expect = 7e-25, Method: Composition-based stats. Identities = 50/245 (20%), Positives = 83/245 (33%), Gaps = 41/245 (16%) Query: 16 LKRIFLALTLLPLF------AV---------AADDCALSDPTLTVQAYTVNPQTERVKMY 60 + R F+ L +L L A A + ++ + A V Sbjct: 1 MYRFFVCLLVLALTTQLASAAWVLKESADRPAPTELEFTERHVQGDAGDVTLWVVTF--- 57 Query: 61 WQKANGEAWGTLHALLADI----NSQGQVQMA-MNGGIYDESYAPLGLYIENGQQKVALN 115 A+ + S+ + +A +NGG + PLGL + G + L Sbjct: 58 --NPKACAFAVMDNPTGAFDLGTASEKRGALAGVNGGYFHPDRTPLGLVVRQGVEIHPLE 115 Query: 116 LASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPN 175 A GV V + + R AFK S ++ A+Q+GP L+E P + Sbjct: 116 RAK-------LLSGVLSVMPTTITLQRTGAFKGSSAVREALQAGPFLIEKEKPIPGLEA- 167 Query: 176 VASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKL-----NVEQLLYLDGTISH- 229 R V N G FL+ + T A + + + LDG S Sbjct: 168 -TKEAARTVVFQNAKGRCGFLICKS-TTLAGMADLLATSSIFPEGKIIRAMNLDGGTSTA 225 Query: 230 MYMKG 234 ++++G Sbjct: 226 LWVRG 230 >UniRef50_Q97FU6 Uncharaterized conserved protein, YOME B.subtilis ortholog n=2 Tax=Clostridium RepID=Q97FU6_CLOAB Length = 347 Score = 114 bits (286), Expect = 2e-24, Method: Composition-based stats. Identities = 40/226 (17%), Positives = 69/226 (30%), Gaps = 27/226 (11%) Query: 40 DPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE-- 96 D T + +P ++ Q G + ++ + + A+NGG + + Sbjct: 116 DGKFTANVLIIKDPNRVKIGYAAQ------IGYVGETTREMAKRYKAVAAINGGYFKDTS 169 Query: 97 --------SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDK---VGIVRLDA 145 P G + NGQ + ++ + D VG Sbjct: 170 PNKQSGGVGAIPTGFIMSNGQIVYPQDNSNWSEITSEEENRALTIDKDGNLQVGGTYSPD 229 Query: 146 FKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFY 205 I+ AV + P L++NG N +V+ + R +G + +F++ Sbjct: 230 QLIKSGIREAVITEPYLIKNGK-NTIQANSVSGTNPRTAIGQRADKSIIFMVIDGRQGVK 288 Query: 206 DFA-----CYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFV 245 A KL LDG S MY G I Sbjct: 289 LGATVGDVQVLMHKLGAVNAACLDGGGSTAMYYNGEIINNPSNATG 334 >UniRef50_C1CWE2 Putative LysM lysin domain protein, n=1 Tax=Deinococcus deserti VCD115 RepID=C1CWE2_DEIDV Length = 442 Score = 113 bits (284), Expect = 4e-24, Method: Composition-based stats. Identities = 41/257 (15%), Positives = 87/257 (33%), Gaps = 26/257 (10%) Query: 10 GMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAW 69 GM L +R+ + + A L + VQ V+ + V + + Sbjct: 189 GMKILVAQRVPVPIPPRA-TGKAVTFKQLRPLNIPVQLVRVDLRHRDVLVAPVLPHAGLV 247 Query: 70 GTLHALLADINSQGQVQMAMNGGIYDE-SYAPLGLYIENGQQKVALNLASGEGNFFIRPG 128 L A + + + Q +NG + +YAP G + G+ + P Sbjct: 248 FGLGARVGQLAQRSGAQALINGSYFHPRTYAPAGDIVMQGRML----------TWGRIPM 297 Query: 129 GVFYVAGDKVGI--VRLDAFKTSKEIQF-----AVQSGPMLMENGVINPRIH-----PNV 176 + ++ I + + + + +GP ++ G ++ + P + Sbjct: 298 ALAITPDNRATIRATTTPLLRRPLDTTWRGMETVIATGPRIVTGGAVHTNYNQVFRDPAL 357 Query: 177 ASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTIS-HMYMKGG 235 R+ VG++ + + V + ++ + +L V++ L LDG S + G Sbjct: 358 FGRAARSAVGLSSNRDLVMVSTRVRLTTTEMGKVM-TRLGVKEALLLDGGSSAGLAWNGR 416 Query: 236 AIPWQRYPFVTMISVER 252 A+ I V Sbjct: 417 AVLDSMRKVSYGIGVFT 433 >UniRef50_D2NR45 Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase n=3 Tax=Micrococcineae RepID=D2NR45_9MICC Length = 356 Score = 112 bits (281), Expect = 9e-24, Method: Composition-based stats. Identities = 35/214 (16%), Positives = 69/214 (32%), Gaps = 27/214 (12%) Query: 45 VQAYTVNPQTER-VKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGL 103 V ++ + + + + AN + + +++ S+ A+NG Y + G+ Sbjct: 131 VVSFVADIKLDNATLLRSAFANNKFGQNIIDTPSNMASEHNGIWAINGDYY--GFRTTGI 188 Query: 104 YIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLM 163 I NG G F+ Y S+ + + GP L+ Sbjct: 189 VIRNGVVYRDSGAREGLA-FYRDGSVKLYDE-----TATNAQTLVSEGVWNTLSFGPALV 242 Query: 164 ENGVINPRIHP----------NVASSKIRNGVGINKHGNAVFLLSQQA-------TNFYD 206 ++ I I ++ ++ R GVG+ + VF++ + Sbjct: 243 KDSAIVDGIDSVEVDTNFGNHSIQGNQPRTGVGVLGTNHLVFIVVDGRSTNYSRGVTMPE 302 Query: 207 FACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 FA K L LDG S + + + Sbjct: 303 FAQMFKD-LGCVSAYNLDGGGSSAMVFNNKLVNR 335 >UniRef50_C8PNM8 Putative uncharacterized protein n=1 Tax=Treponema vincentii ATCC 35580 RepID=C8PNM8_9SPIO Length = 306 Score = 112 bits (281), Expect = 9e-24, Method: Composition-based stats. Identities = 37/223 (16%), Positives = 71/223 (31%), Gaps = 32/223 (14%) Query: 29 FAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKAN----------GEAWGTLHALLAD 78 + A D A L V ++ V + +A GE Sbjct: 66 PGIEAADIADPQLPLIVHIVKIDLLNPSVSVITSEAALFKNTRGRIRGETTRDFALRHNT 125 Query: 79 INSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKV 138 I + N ++ +G++I + ++ N G F+ Sbjct: 126 IAAFNAAPFKTNSLLFSIYRTIVGIHITDFRRMSMPNERYGALLFY---------KDKTA 176 Query: 139 GIVRLDAFKT-SKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLL 197 I+ S ++++AV ++ NG I P+ + R VG+ G +F+ Sbjct: 177 RIIGSQTEDALSADVRYAVGGFWTILRNGTIVPQ---KLHRRDSRTAVGLADSGKTLFVA 233 Query: 198 S--------QQATNFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 + + +F + A + L + L LDG S + Sbjct: 234 AVEGENKRKSRGLSFEETAMLMQ-TLGADDALQLDGGSSSTLV 275 >UniRef50_B8FUP3 Putative uncharacterized protein n=2 Tax=Desulfitobacterium hafniense RepID=B8FUP3_DESHD Length = 350 Score = 111 bits (278), Expect = 2e-23, Method: Composition-based stats. Identities = 37/224 (16%), Positives = 57/224 (25%), Gaps = 28/224 (12%) Query: 37 ALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYD 95 +S V NPQ R+ Q G +++ +N G + Sbjct: 124 EVSGKGFQGYLLKVGNPQRVRLAATDQ------LGDRGLKVSEFVENNHAVAGINAGGFA 177 Query: 96 E------SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTS 149 + P G+ I G+ N + I V V Sbjct: 178 DPGGVSFGGTPTGILITEGKIIHKDNWET-YSLIGITKHDVLVVGR------YTLEQIEE 230 Query: 150 KEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFAC 209 I+ AV GP L+ NG R +G G + L+ Sbjct: 231 LGIRDAVSFGPALIVNGEPMITYGDGGWGIAPRTAIGQTHDGTILLLVIDGRQ-LGSLGA 289 Query: 210 YAKA------KLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFVT 246 K + LDG S + +G P+ Sbjct: 290 TLKDVQDILIEHGAVNGANLDGGSSSTLVYEGEVKNKPSSPYGP 333 >UniRef50_B1BC21 Putative uncharacterized protein n=2 Tax=Clostridium botulinum RepID=B1BC21_CLOBO Length = 326 Score = 111 bits (278), Expect = 2e-23, Method: Composition-based stats. Identities = 37/235 (15%), Positives = 68/235 (28%), Gaps = 35/235 (14%) Query: 38 LSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE 96 L + + NP+ RV + G + ++I A+NGG + + Sbjct: 100 LENSRFKAYLMEISNPKKVRVGY------AKKLGKVGEPTSEIAKDFNAIAAINGGSFTD 153 Query: 97 SYA-----------PLGLYIENGQQK-VALNLASGEGNFFIRPGGVFYVAGDKVGIVRLD 144 + P G+ + +G+ ++ + G + + +R Sbjct: 154 ETSNGTKYSGTGAFPEGVIMSHGKVIWKTVSTNTKIDIIAFNNEGKLILGKYTINELR-- 211 Query: 145 AFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ---- 200 A+ P L+ +G A R +G K G +FL++ Sbjct: 212 ----KLNCIEALCYKPSLIVDGKKAKIKGDGGAGMAPRTAIGQKKDGTILFLVADGTMFK 267 Query: 201 --ATNFYDFACYAKAKLNVEQLLYLDGTIS-HMYMKGGAIPWQRYPFV--TMISV 250 + K LDG S MY G I + S+ Sbjct: 268 RDGLRMDELQDILYEK-GAYNATNLDGGSSATMYYDGEVINNPCDSVGERPIPSI 321 >UniRef50_UPI000178A82C copper amine oxidase domain protein n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI000178A82C Length = 377 Score = 111 bits (277), Expect = 3e-23, Method: Composition-based stats. Identities = 42/229 (18%), Positives = 80/229 (34%), Gaps = 28/229 (12%) Query: 41 PTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYA- 99 + + Q TV+ +V++ A +A L I + +A+NG +D + Sbjct: 161 RSFSAQVVTVSLLHPKVELDVVLAGNKAGKVED--LRSIAKRSNAVVAINGTFFDAYTSG 218 Query: 100 ----PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFA 155 P G + G + + + + G + + + ++ A Sbjct: 219 AYKAPYGYLVSKGNIFHKASGDNRTIFTYDSNNLATMMPG-----LDFKSVYETGRMEGA 273 Query: 156 VQSGPMLMENGVI----------NPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFY 205 +Q+GP L+ NG + +P+I + R+ +GI K + L + Sbjct: 274 LQAGPRLLTNGKVTLDVKKEGFKDPKILTGGGA---RSALGITKDHKLILLTT-GGATIP 329 Query: 206 DFACYAKAKLNVEQLLYLDGT-ISHMYMKGGAIPWQRYPFVTMISVERK 253 A K + Q + LDG S +Y G + I V+ K Sbjct: 330 QLAEIMK-QAGAYQAMNLDGGASSGLYYNGSYLTTPGRQISNAIVVKYK 377 >UniRef50_Q1IXP5 Peptidoglycan-binding LysM domain-containing protein n=2 Tax=Deinococcus RepID=Q1IXP5_DEIGD Length = 444 Score = 109 bits (273), Expect = 8e-23, Method: Composition-based stats. Identities = 39/230 (16%), Positives = 74/230 (32%), Gaps = 25/230 (10%) Query: 37 ALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE 96 L + VQ V+ + V + A ++ + Q +NG + Sbjct: 217 QLKALNIPVQVLRVDLRHRNVLVAPVLPRTGLGTAGGARVSTLARTSGAQAVVNGSYFHP 276 Query: 97 -SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVR--LDAFKTSKEIQ 153 SYAP G + G+ + P + ++ I+ E+ Sbjct: 277 RSYAPAGDLVVQGRLLA----------WGRIPVALAITPDNRAAIMTSTTPLLGRPLEVS 326 Query: 154 F-----AVQSGPMLMENGVINPRI-----HPNVASSKIRNGVGINKHGNAVFLLSQQATN 203 + + +GP ++ G + + P + R+ VG+ + + VF+ + Sbjct: 327 WHGMETVIATGPRILNGGTVVRQYASAFRDPALFGRAARSAVGLKSNRDLVFVTTHAKLT 386 Query: 204 FYDFACYAKAKLNVEQLLYLDGTIS-HMYMKGGAIPWQRYPFVTMISVER 252 + A+L V L LDG S + G A+ I V Sbjct: 387 TTEMGKVM-ARLGVRDALLLDGGSSAGLAWNGQAVLDSVRKVAYGIGVFT 435 >UniRef50_C6J074 Copper amine oxidase domain-containing protein n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J074_9BACL Length = 406 Score = 109 bits (272), Expect = 9e-23, Method: Composition-based stats. Identities = 41/224 (18%), Positives = 82/224 (36%), Gaps = 22/224 (9%) Query: 41 PTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESY-- 98 + + Q T++ +V++ A G+A G + L + + + + +A+NG ++ Sbjct: 190 RSFSTQMVTISLMDPKVRLKVALA-GDAVGKVEEL-SSLAKRHKAVVAINGTFFNAYTDN 247 Query: 99 ---APLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFA 155 AP G + G+ K+ + + + GD DAF ++ A Sbjct: 248 AYKAPYGYIVSGGELKMKASGDKRTIFTYDSNLLARLIPGDDF----NDAFNAGT-MEGA 302 Query: 156 VQSGPMLMENGVINPRIHPNV-------ASSKIRNGVGINKHGNAVFLLSQQATNFYDFA 208 +Q+GP L+ NG + + R+ +G+ + + L + A Sbjct: 303 LQAGPRLVVNGKVAVDVKAEGFKDPKILTGGGARSALGLTRDHKLILLTT-GGATIPQLA 361 Query: 209 CYAKAKLNVEQLLYLDGT-ISHMYMKGGAIPWQRYPFVTMISVE 251 K + Q + LDG S +Y G + + V Sbjct: 362 EIMK-QAGAYQAMNLDGGASSGLYYNGKYLTQPGRKISNALIVT 404 >UniRef50_A0YND3 Putative uncharacterized protein n=1 Tax=Lyngbya sp. PCC 8106 RepID=A0YND3_9CYAN Length = 304 Score = 109 bits (272), Expect = 1e-22, Method: Composition-based stats. Identities = 32/217 (14%), Positives = 69/217 (31%), Gaps = 24/217 (11%) Query: 39 SDPTLTVQAYTVNPQTERVKMYW----QKANGEAWGTLHALLADINSQGQVQMAMNGGIY 94 L + ++ T +++ Q + + ++ + +Q+A+NG + Sbjct: 58 KPDPLMIHIVKIDLTTPGIELLVTPGEQGEDDQDIS--AQTTSEFLQKHYLQLAINGSFF 115 Query: 95 DESY--APLGLYIENGQQKVALNLASGEGNFFI---RPGGVFYVAGDKVGIVRLDAFKTS 149 Y P+ Y +G++ A +G + + V ++ K + F T Sbjct: 116 HPFYVHNPIDYYPNSGERVNIFGQAISQGKIYSIVNKGWSVLCISPKKKAEI---YFDTC 172 Query: 150 KEIQFAVQSGPMLMEN-GV-INPRIHPNVASSKIRNGVGINKHG-NAVFLLSQQA----- 201 + +G +++ + G I + + R V I+K G +L Sbjct: 173 PKNTLQGIAGNLILIDQGQPIKVKKFSDANQKFPRTAVAIDKTGETLWLILIDGRQSWYS 232 Query: 202 --TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGA 236 + VE L DG S + Sbjct: 233 KGVTLATLTNIIQELDGVETALNFDGGGSTTLVISEG 269 >UniRef50_D0TN59 Predicted protein n=3 Tax=Bacteroides RepID=D0TN59_9BACE Length = 315 Score = 108 bits (270), Expect = 2e-22, Method: Composition-based stats. Identities = 31/233 (13%), Positives = 66/233 (28%), Gaps = 21/233 (9%) Query: 23 LTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLA----- 77 + L + + + ++ V + Sbjct: 61 VVALGVTETDVHFQKADSRSTHIFIIDIDLNEPGVSLEVGMPYDADVRNNFQRQTLTEMA 120 Query: 78 --DINSQGQVQMAMNGGIYDESYAPL-GLYIENGQQKVALNLASGEGNFFIRPGGVFYVA 134 +V +N +D S + G NG + E + Sbjct: 121 DYADRPWHRVAAMINADFWDVSTMDIRGPIHRNGVILKNSFIFK-ETLPQQALSFIALTK 179 Query: 135 GDKVGIVRLDAFK-TSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNA 193 +K+ I ++ ++ SG +++ +G I+ +P + R +G + G+ Sbjct: 180 DNKMVIADSVEYRGMQYNLKEVTGSGVIVLRDGEISGATYPGID---PRTCLGYSDDGHV 236 Query: 194 VFLLSQQATNFY-------DFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPW 239 F+++ FY + KA L + LDG S + I Sbjct: 237 YFMVADGRVEFYSYGLTYPEMGSIMKA-LGCSWAVNLDGGGSTQMLIRHPIAD 288 >UniRef50_B0TEY5 Putative uncharacterized protein n=1 Tax=Heliobacterium modesticaldum Ice1 RepID=B0TEY5_HELMI Length = 327 Score = 108 bits (270), Expect = 2e-22, Method: Composition-based stats. Identities = 39/223 (17%), Positives = 65/223 (29%), Gaps = 27/223 (12%) Query: 37 ALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIY- 94 + T + V +P +V + + G + ++ + A+NGG + Sbjct: 98 DIQGYRFTGKVMIVHDPLRIKVAVSSK------LGEAGETVPEMARREGAVAAINGGGFI 151 Query: 95 DESYA-----PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTS 149 D + P G+ + GQ ++ E I G V +R Sbjct: 152 DPNGQGNGAYPDGITVSRGQFISVIDEDQKENIIGITKKGQMIVGRYSARELRS------ 205 Query: 150 KEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT------N 203 +I V GP L+ NG R G+G G+ + ++ Sbjct: 206 MDISEVVTFGPPLVVNGRPTITSGDGGWGVAPRTGIGQRSDGSIIMVVIDGRQIGSIGAT 265 Query: 204 FYDFACYAKAKLNVEQLLYLDGTIS-HMYMKGGAIPWQRYPFV 245 + K LDG S M G I F Sbjct: 266 LRELQDLLL-KYGAVTAGNLDGGASTTMVYNGKVINQPSSVFG 307 >UniRef50_A7V127 Putative uncharacterized protein n=1 Tax=Bacteroides uniformis ATCC 8492 RepID=A7V127_BACUN Length = 277 Score = 106 bits (265), Expect = 7e-22, Method: Composition-based stats. Identities = 36/224 (16%), Positives = 74/224 (33%), Gaps = 28/224 (12%) Query: 44 TVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE-SYAPLG 102 V + ++P+ R + E + A+NG +D + + Sbjct: 59 EVSIFEISPKRYRFDVLVHNPKEET--------SIAARHAGAVAAINGSYFDMKAGNSVC 110 Query: 103 LYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVR---LDAFKTSKEIQFAVQSG 159 ++G + G G + ++ ++ + + + + SG Sbjct: 111 YLRKDGVVIDTTST----GVLATVSNGAVLIKKGRLELIPWSKQEEKACTLKKGTVLASG 166 Query: 160 PMLMENGVINPRIHPN---VASSKIRNGVGINKHGNAVFLLSQQA-------TNFYDFAC 209 P+++++G + N V + R+ V + + G + ++ N + A Sbjct: 167 PLMLKDGQVCDLSGTNRNFVDTKHPRSAVALTREGKILLIVVDGRRKGKAEGINIPELAH 226 Query: 210 YAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 L E L LDG S G A+P + S ERK Sbjct: 227 -MIRILGGEDALNLDGGGSSTLWSG-ALPDKGIANTPSGSAERK 268 >UniRef50_B2J8B3 Putative uncharacterized protein n=1 Tax=Nostoc punctiforme PCC 73102 RepID=B2J8B3_NOSP7 Length = 276 Score = 106 bits (265), Expect = 7e-22, Method: Composition-based stats. Identities = 37/195 (18%), Positives = 64/195 (32%), Gaps = 32/195 (16%) Query: 74 ALLADINSQGQVQMAMNGGIYDESYAPLGLYI-----------ENGQQKVALNLASGEGN 122 A + + + + +N G +D + Y+ EN + NL S Sbjct: 59 ATVEEFAQKHRAVAILNAGFFDPANQKTTSYVILQRKLVADPKENERLVNNPNLKSYLSQ 118 Query: 123 FFIRPGGVFYVAGDKVG---IVRLDAFKTSKEIQFAVQSGPMLM------ENG---VINP 170 F R Y G V ++ + ++ A+ +GP L+ + G N Sbjct: 119 IFNRTEFRRYSCGQTVRYDIVLHSASQPAGCQLVDAIGAGPSLLPELTLEKEGFVDNANK 178 Query: 171 RIHPNVASSKIRNGVGINKHGNAVFLLSQ-------QATNFYDFACYAKAKLNVEQLLYL 223 R R VGI G+ V ++ + A + K L ++ + L Sbjct: 179 RDALGSNQPNARTAVGITHDGSVVLVMVAQKPSAPANGISLPALANFMK-TLGADKAMNL 237 Query: 224 DGT-ISHMYMKGGAI 237 DG S +Y G Sbjct: 238 DGGSSSSLYYNGKTF 252 >UniRef50_C0ZEQ6 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0ZEQ6_BREBN Length = 356 Score = 106 bits (264), Expect = 8e-22, Method: Composition-based stats. Identities = 35/198 (17%), Positives = 58/198 (29%), Gaps = 16/198 (8%) Query: 46 QAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYI 105 Y +P R+ + +K G+ I A + Y + G+ I Sbjct: 144 IMYISDPSRVRLVVTNRKDRGDLLDEFVNKTGAIGIVNASGFA-DPDGYGKGARAYGVVI 202 Query: 106 ENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMEN 165 G+ N SGE + G ++ AV P L+ N Sbjct: 203 HEGKILQGYNPRSGETALGLTYDGKLITG------SYSAEQLVKMGVRDAVSFRPQLIVN 256 Query: 166 GV-INPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT-------NFYDFACYAKAKLNV 217 G + + R +G + G VF + + D A + V Sbjct: 257 GKNMFEGKPAKSWGIQPRTAIGQKEDGTIVFAVIDGRQPGHSIGASMNDMAELLAER-GV 315 Query: 218 EQLLYLDGTISHMYMKGG 235 + +DG S M + G Sbjct: 316 VTAMAMDGGSSSMMLHNG 333 >UniRef50_A7GCS1 Putative uncharacterized protein n=12 Tax=Clostridium RepID=A7GCS1_CLOBL Length = 339 Score = 106 bits (264), Expect = 1e-21, Method: Composition-based stats. Identities = 38/222 (17%), Positives = 64/222 (28%), Gaps = 37/222 (16%) Query: 37 ALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYD 95 ++ + NPQ ++ + G + + + + A+NGG + Sbjct: 116 DINTAKFDGYILEIKNPQKVKIGYT------KYMGKMGERTSKMAERHGAVAAVNGGGFR 169 Query: 96 E-----------SYAPLGLYIENGQQKVALNLASGEGNF-FIRPGGVFYVAGDKVGIVRL 143 + P GL I NG+ + + N G+ V V + Sbjct: 170 DVSSTGKLWTGTGAYPEGLVISNGKVIYNDFKSGQKVNVTAFTKEGLLVVGDHTVDEL-- 227 Query: 144 DAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-- 201 + A+ L+ NG P R +G + G V L+ Sbjct: 228 ----LKMGVVEALSFRNTLIINGKPIPY----NEGINPRTAIGQKQDGTIVLLVIDGRRG 279 Query: 202 ----TNFYDFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIP 238 + + V LDG S MY KG I Sbjct: 280 IKQGATLEEVENILLQR-GVVNASNLDGGSSSTMYYKGKVIN 320 >UniRef50_C3R3M8 Putative uncharacterized protein n=1 Tax=Bacteroides sp. 2_2_4 RepID=C3R3M8_9BACE Length = 329 Score = 105 bits (262), Expect = 1e-21, Method: Composition-based stats. Identities = 33/209 (15%), Positives = 59/209 (28%), Gaps = 53/209 (25%) Query: 78 DINSQGQVQMAMNGGIYDES-YAPLGLYIENGQQK---------VALNLASGEGNFFIRP 127 Q + + MNGG + + L G+ + G F + Sbjct: 101 WQAEQQKYPIIMNGGYFVMGAGKSVSLLCREGEVLAVNSQEEIRSQKSYYPTRGIFQLSK 160 Query: 128 GGVF-----YVAGDKVGIVRLDA------FKTSK-------------EIQFAVQSGPMLM 163 G F Y D V ++ + A+ GP+L+ Sbjct: 161 NGYFSTDWAYTTTDGVTYTYEQPSPNKSGYEPQPAPSAYFPTRGVKLNAETAIGGGPILL 220 Query: 164 ENGVINPRIHPNV---------ASSKIRNGVGINKHGNAVFLLSQQA--------TNFYD 206 ++G + + S R +G+ + +F + + N Sbjct: 221 KDGSVRNTFIEELFDEESGVAPESYHPRTAIGVTANNKVIFFVCEGRSVTEGVKGMNMAM 280 Query: 207 FACYAKAKLNVEQLLYLDGTISH-MYMKG 234 A K+ L + LDG S M + G Sbjct: 281 MANILKS-LGCVDAMNLDGGGSTCMLVNG 308 >UniRef50_Q4UP44 Putative uncharacterized protein n=4 Tax=Bacteria RepID=Q4UP44_XANC8 Length = 439 Score = 105 bits (261), Expect = 2e-21, Method: Composition-based stats. Identities = 39/257 (15%), Positives = 76/257 (29%), Gaps = 30/257 (11%) Query: 22 ALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHA-LLADIN 80 LTL P + P + + ++ T +++ + G A Sbjct: 173 PLTLAPGVRYWRQAIGGAQP-VMLHIAQIDLTTPGLQLVGTPGDRSDGGEFRATPTTAFV 231 Query: 81 SQGQVQMAMNGGIY---------DESYAPL-GLYIE-NGQQKVALNLASGEGNFFIRPGG 129 G + +A+N + D+ + P G + G A S R Sbjct: 232 RDGALTLAINADYFLPFDGGHLLDKPFVPAAGQGVTAEGLAIEAGRTDSAAATSDPRVNA 291 Query: 130 VFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNV---ASSKIRNGVG 186 V+ + + + V +GP+L+ +G PR + R+ VG Sbjct: 292 ALCVSQRDAVRIVRGS--CPAGSRLGVGAGPLLLLDGKRQPREASRAAYYDGPEPRSAVG 349 Query: 187 INKHGN-AVFLLSQQA-------TNFYDFACYAKAKLNVEQLLYLDGTISHMY---MKGG 235 +++ G+ +++ +L + LDG S + G Sbjct: 350 LDRSGHTLWMVVADGRQPGYSAGMTLDALTAVF-EQLGAHAAINLDGGGSSTLAARVDGD 408 Query: 236 AIPWQRYPFVTMISVER 252 R + ER Sbjct: 409 VRALNRPIHTGIPGRER 425 >UniRef50_C6IYX5 Putative uncharacterized protein n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6IYX5_9BACL Length = 347 Score = 104 bits (260), Expect = 2e-21, Method: Composition-based stats. Identities = 36/222 (16%), Positives = 71/222 (31%), Gaps = 29/222 (13%) Query: 37 ALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYD 95 +S + TV +P R+ + ++ GE ++ A + +NGG + Sbjct: 100 EISGKSYHGYVLTVNDPTKIRLGVPAKRGKGEKVSSMVARTGALA-------GVNGGGFA 152 Query: 96 E------SYAPLGLYIENGQQK-VALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKT 148 + + P+G+ I G+ ++ + + G + + Sbjct: 153 DPNWKGNGFKPIGVVISRGKLYYNGISSGAATQIVGLDKQGKMIAGKYTLEELD------ 206 Query: 149 SKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT------ 202 IQ AV P ++ NG R R +G + G +F++ Sbjct: 207 KLGIQEAVTFQPRIIVNGKGQIRSQKEGWGIAPRTAMGQREDGAILFVVIDGRQPGYSIG 266 Query: 203 -NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYP 243 + YD + LDG S + +K G + Sbjct: 267 ASLYDVQQIMLER-GAVIAANLDGGSSTVLVKEGGEIVNKPS 307 >UniRef50_Q8A0T0 Putative uncharacterized protein n=10 Tax=Bacteroides RepID=Q8A0T0_BACTN Length = 308 Score = 104 bits (259), Expect = 3e-21, Method: Composition-based stats. Identities = 41/214 (19%), Positives = 75/214 (35%), Gaps = 36/214 (16%) Query: 42 TLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDES---- 97 T ++ +NP+T K G A+ ++ I + Q A+NG +D + Sbjct: 82 TQSINILEINPKT-------GKKIGIAFTGQLEKISRIARKHQAIGAINGSYFDMTKGNS 134 Query: 98 --YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFA 155 + +G + + L L G + + G V + D+ +K +K A Sbjct: 135 VCFLKVGSQVVDTTSLDELKLRV-TGAVYEKKGKVKLIPWDR---QIEKNYKKNKGSVLA 190 Query: 156 VQSGPMLMENG------VINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-------T 202 SGP+++++G N R+ + + + G +F+ Sbjct: 191 --SGPLMLKDGEYYDWSQCNANFIET---KHPRSAICLTEEGKILFVTVDGRSPENAVGI 245 Query: 203 NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGA 236 N + A L + L LDG S GA Sbjct: 246 NIPELAHLL-HVLGGKDALNLDGGGSTALWLSGA 278 >UniRef50_UPI0001BC3362 hypothetical protein BcroD2_01243 n=1 Tax=Butyrivibrio crossotus DSM 2876 RepID=UPI0001BC3362 Length = 356 Score = 103 bits (258), Expect = 4e-21, Method: Composition-based stats. Identities = 39/227 (17%), Positives = 64/227 (28%), Gaps = 27/227 (11%) Query: 37 ALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYD 95 +S T V +P + +G+ G + I + +A+N G ++ Sbjct: 137 EVSGSTFAGTMVVVTDPSR-----VFVGTSGDYKGEAGINVPAICDKYGATLAINAGGFE 191 Query: 96 E------SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTS 149 + PLG+ + GQ K N+ S VF + Sbjct: 192 DIGGVGNGGTPLGIVMSEGQLKYG-NVNSSYDLIGFDNNNVFVIGQ------MTGQQAID 244 Query: 150 KEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFAC 209 + I+ AV GP L+ NG R +G G + L+ + Sbjct: 245 RGIRDAVSFGPFLILNGTPLEVSGMGG-GLNPRTAIGQRADGAVLLLIIDGRQT-HSLGA 302 Query: 210 YAKA------KLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISV 250 LDG S + G I + V Sbjct: 303 SMNDLINVMLDFGAVNAANLDGGGSTVLYYDGEIKNKISSIYGARGV 349 >UniRef50_B2UNL7 Putative uncharacterized protein n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UNL7_AKKM8 Length = 249 Score = 102 bits (254), Expect = 1e-20, Method: Composition-based stats. Identities = 42/201 (20%), Positives = 76/201 (37%), Gaps = 19/201 (9%) Query: 40 DPTLTVQAYTVNPQTERVKMYWQKANGEA-WGTLHALLADINSQGQVQMAMNGGIYDES- 97 L V + + T R+ + + + +G+L + + +NGG + Sbjct: 34 RDKLNVYFFRSD--THRLLVRDEGSVKTPRYGSLDKAM----RKSPCVAGVNGGFFSADA 87 Query: 98 -YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAV 156 PLGL +++G++ L S + + GG + + ++R + +Q A+ Sbjct: 88 GGTPLGLVVQDGKRLSPLATGSFAVSGVVYEGGRDGLTLVRSSVLR--RMRRLPAMQAAI 145 Query: 157 QSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAK-- 214 Q GP L+ENG + N S R + + +S + A + A Sbjct: 146 QGGPFLVENGSAVKGL--NAQKSTYRTFIATDGGRRWCIGVSSS-LTLKELAAWLAAPGA 202 Query: 215 LN---VEQLLYLDGTISHMYM 232 L VE L LDG S + Sbjct: 203 LGNFRVETALNLDGGSSSAFW 223 >UniRef50_C3R3L4 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=C3R3L4_9BACE Length = 431 Score = 102 bits (254), Expect = 1e-20, Method: Composition-based stats. Identities = 34/218 (15%), Positives = 64/218 (29%), Gaps = 56/218 (25%) Query: 80 NSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVA----LNLASGEGNFFIRP-------- 127 + + MNGG + + A + L N ++ + G N P Sbjct: 202 AESVKPAIVMNGGYFASNGATVSLLYRNNVMLAPNLQSMSRSDGTSNVAFYPTRSAFGEI 261 Query: 128 -GGVFYV-----------------AGDKVGI----VRLDAFKTSK---EIQFAVQSGPML 162 G F V + +K G+ + + + + A+ GP+L Sbjct: 262 ENGKFEVNWVYTVSSGQTYAYPAPSPNKSGVSPMQIPSVNYPEGASIWKAKNAIGGGPVL 321 Query: 163 MENGVINPRIHP---------NVASSKIRNGVGINKHGNAVFLLSQQA--------TNFY 205 ++NG+ S+ R+ +GI +F + + Sbjct: 322 LKNGLYKNTWEAELFDTASGIGPTSNNPRSAIGITGDNRLIFFVCEGRNKTPNVPGFTLE 381 Query: 206 DFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRY 242 + A + L + LDG S M + G Sbjct: 382 EVAYILRD-LGCLDAMNLDGGGSSCMLVNGQETIKPSD 418 >UniRef50_C1XS52 Predicted periplasmic protein (DUF2233) n=1 Tax=Meiothermus silvanus DSM 9946 RepID=C1XS52_9DEIN Length = 294 Score = 102 bits (253), Expect = 1e-20, Method: Composition-based stats. Identities = 41/253 (16%), Positives = 79/253 (31%), Gaps = 27/253 (10%) Query: 6 LIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKAN 65 LI + RI T + V + V A VN V + Sbjct: 48 LISNTIHPGQRLRIRPPATSFSVKLVTRPVLK-----VPVLAVHVNLAHPEVSIRSLLPP 102 Query: 66 GEAWGTLHALLADINSQGQVQMAMNGGIYDE-SYAPLGLYIENGQQKVALNLASGEGNFF 124 +L + + ++ A+NGG + ++ P G + G Q V ++ + Sbjct: 103 PGVGRG-GEVLQRLAWRTRLVAAINGGYFHPRTFWPAGDLVVGGHQLVKGSIQTALA--- 158 Query: 125 IRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPR------IHPNVAS 178 + ++ E A +GP ++ G + P + Sbjct: 159 -------ITPDKRARVMVGPQTWRGYETVIA--NGPYILRRGRLVVTPRAEGYNDPAIWG 209 Query: 179 SKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISH-MYMKGGAI 237 R+ VG+ +F+ ++ + AKL ++ + LDG S + KG + Sbjct: 210 RARRSAVGVVNERYLIFVSTKMELTLSELGKVM-AKLGAKEAIVLDGGSSTGLVWKGETL 268 Query: 238 PWQRYPFVTMISV 250 I + Sbjct: 269 IRPGRALSYGIGI 281 >UniRef50_B2V2N5 Putative uncharacterized protein n=8 Tax=Clostridium RepID=B2V2N5_CLOBA Length = 348 Score = 102 bits (253), Expect = 2e-20, Method: Composition-based stats. Identities = 38/223 (17%), Positives = 71/223 (31%), Gaps = 32/223 (14%) Query: 37 ALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIY- 94 + + NP +V M + G L +++ + A+NGG + Sbjct: 118 DIHTDRYDGYMLEIENPHKVKVAMT------KYLGKLGQKTSEMAEEHNAIAAINGGSFV 171 Query: 95 ----------DESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIV-RL 143 P G I +G+ + G+ N + + ++ + Sbjct: 172 DKSSDGITYAGTGGQPGGFVISSGKVVYPI----GKCNEHSVENVIAFTKKGQLIVGNHT 227 Query: 144 DAFKTSKEIQFAVQSG-PMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA- 201 A ++Q A+ P ++ NG+ + + R VG + G +FL Sbjct: 228 LAELKKLDVQEAMCFREPNVIINGIRQHKKEDYIDGINPRTAVGQKEDGTVLFLALDGRK 287 Query: 202 -----TNFYDFACYAKAKLNVEQLLYLDGTIS-HMYMKGGAIP 238 Y+ +++ LDG S MY KG I Sbjct: 288 LSKPGATIYEVQEIMRSR-GAINAGMLDGGYSTTMYYKGDVIN 329 >UniRef50_A7M0G9 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=A7M0G9_BACOV Length = 621 Score = 101 bits (251), Expect = 3e-20, Method: Composition-based stats. Identities = 37/179 (20%), Positives = 64/179 (35%), Gaps = 20/179 (11%) Query: 74 ALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYV 133 A + I + A+NG Y S P + + KVA + S + GV + Sbjct: 103 AKTSMIAKDKKALFAINGS-YSISGNPSTFTMVDKVVKVASTIESAS-----KVNGVIAI 156 Query: 134 AGDKVGIVRL----DAFKTSKEIQFAVQSGPMLMENGVINPRIHPNV--ASSKIRNGVGI 187 + V+ D E + A+ SGPML+ G + + R+ +GI Sbjct: 157 DAEGSVDVKSCTFSDYTDVEDEYESALASGPMLLMEGKVC-SFPQDAIYTQRMARSVIGI 215 Query: 188 NKHGNAVFLLSQQATN------FYDFACYAKAKLNVEQLLYL-DGTISHMYMKGGAIPW 239 G + L A + A + L ++ + L DG+ S ++ G + Sbjct: 216 TAQGKMMLLTIDGAITGNADGATLEEAAFIAKTLGMKNAVCLADGSSSTLWTSGKGVVN 274 >UniRef50_UPI0001694670 hypothetical protein Plarl_22443 n=1 Tax=Paenibacillus larvae subsp. larvae BRL-230010 RepID=UPI0001694670 Length = 363 Score = 100 bits (249), Expect = 5e-20, Method: Composition-based stats. Identities = 30/208 (14%), Positives = 62/208 (29%), Gaps = 15/208 (7%) Query: 40 DPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYA 99 D + Y +P++ RV + +K GE ++ + ++ +A Sbjct: 118 DYWVGKMMYVFDPRSIRVVVPGKKGEGERITSMVERTGAVAGVNGGGF-IDPDGLGNGFA 176 Query: 100 PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVR-LDAFKTSKEIQFAVQS 158 P+G + G+ I V + + I + + ++ AV Sbjct: 177 PIGAILSGGKVLYNDQKED------IPQHIVGFTDKGTLVIGKYSIDQLRAMKVSEAVSF 230 Query: 159 GPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT------NFYDFACYAK 212 P ++ NG R +G G +F++ + Sbjct: 231 YPRVIANGKPLITKGDGGWGRAPRTALGQRADGTVIFVVIDGRQAHSVGATLREVQDLLL 290 Query: 213 AKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 + +LDG S +K + Q Sbjct: 291 EQ-GCINAGFLDGGASSEMVKDRKLLTQ 317 >UniRef50_Q8YKN4 All7259 protein n=2 Tax=Cyanobacteria RepID=Q8YKN4_ANASP Length = 245 Score = 100 bits (249), Expect = 5e-20, Method: Composition-based stats. Identities = 35/212 (16%), Positives = 62/212 (29%), Gaps = 39/212 (18%) Query: 64 ANGEAWGTLHALLA------DINSQGQVQMAMNGGIYDE-SYAPLGLYIENGQQKV-ALN 115 + AL A + + + N G +D + + GQ + Sbjct: 8 PANSPFVVTGALSAKVSTVEEFAQKHRAFAIFNAGFFDPANQKSTSYVVVTGQMVADPKD 67 Query: 116 LASGEGNFFIRP--GGVF-------YVAGDKVG---IVRLDAFKTSKEIQFAVQSGPMLM 163 N ++P +F Y+ G + ++ + + A+ +GP L+ Sbjct: 68 NERLVNNPQLKPYLNLIFNRSEFRRYLCGQTTRYDITLHNESPPANCRLVDAIGAGPRLL 127 Query: 164 ENGVINP-RIHPNVASS--------KIRNGVGINKHGNAVFLLS--------QQATNFYD 206 P N R VGI G+ + ++ + Sbjct: 128 PKLTSVPEGFVDNAKGRDALLSKQLNARTAVGITSEGSIILVMVAQKPSKPKNSGISLVQ 187 Query: 207 FACYAKAKLNVEQLLYLDGT-ISHMYMKGGAI 237 A K KL + LDG S +Y G A Sbjct: 188 LADLMK-KLGASAAMNLDGGSSSSLYYNGKAF 218 >UniRef50_C1ABL2 Putative uncharacterized protein n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1ABL2_GEMAT Length = 311 Score = 99.7 bits (247), Expect = 9e-20, Method: Composition-based stats. Identities = 37/220 (16%), Positives = 80/220 (36%), Gaps = 33/220 (15%) Query: 29 FAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMA 88 +A + TV ++P + + +G+A A + N+ +A Sbjct: 68 WAEWPVQLGARGISTTVIVVDIDPARIALTLEIA-RDGDAL----APWSLDNAPKDAVIA 122 Query: 89 MNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAF-- 146 +N G + + P G + ++ A + F I + I+R D Sbjct: 123 LNAGQFTDDG-PWGWVVHRQREWQAPGVGPLSAAFVID-------TAGRAAILRADEIAE 174 Query: 147 -KTSKEIQFAVQSGPMLMENGVINPRI----HPNVASSKIRNGVGINKHGNAVFLLSQ-- 199 + + A+QS P+++ +G + P + ++ IR +G+ G+ + L++ Sbjct: 175 ARRRGGWEEALQSFPLILNDGALPPGLCAPGAVDLEHRDIRLTLGVLPDGHVLLALTRYA 234 Query: 200 ------QAT----NFYDFACYAKAKLNVEQLLYLDGTISH 229 + A + +L V + + LDG +S Sbjct: 235 GVGSAGNRLPIGPTTGEMATIMR-ELGVARAVMLDGGLSA 273 >UniRef50_UPI0001744905 hypothetical protein VspiD_09365 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001744905 Length = 251 Score = 98.6 bits (244), Expect = 2e-19, Method: Composition-based stats. Identities = 39/232 (16%), Positives = 78/232 (33%), Gaps = 28/232 (12%) Query: 23 LTLLPLFAVAADDCALSDPTLTVQAYTVN-----PQTERVKMYWQKANGEAWGTLHALLA 77 + + L A L A V P + + A +H Sbjct: 3 VIVETLPAQWTVRSQAGPVKLPGGAIQVKKQLAGPTEAELNLILFTAGKYEMRVVHQPER 62 Query: 78 ----DINSQGQVQMAM---NGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGV 130 + ++ + A+ NGG + + PLGL + +G + +S G GV Sbjct: 63 DKGVSLATKMRELGAIAGCNGGYFTPDFLPLGLEVSDGVRSGTFQRSSLLG-------GV 115 Query: 131 FYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKH 190 F V + +V D + K + +Q+GP L+ G+ + + R + ++ Sbjct: 116 FLVRHGRPAMVWKDEYIEQKGVTQLLQAGPRLVHAGLPVAGLEA--TKRRARTFILTDQA 173 Query: 191 GNAVFLLSQQATNFYDFACYAKAK-----LNVEQLLYLDGTISH-MYMKGGA 236 GN + + + + + V++ L DG S ++ + Sbjct: 174 GNWALGTCKS-VTLRELSDLLSTRALLPEVTVKRALNFDGGNSTGLWWRAEG 224 >UniRef50_B3CE38 Putative uncharacterized protein n=3 Tax=Bacteroides RepID=B3CE38_9BACE Length = 285 Score = 98.2 bits (243), Expect = 2e-19, Method: Composition-based stats. Identities = 32/219 (14%), Positives = 64/219 (29%), Gaps = 27/219 (12%) Query: 32 AADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNG 91 A+ +L V + P+ R + E ++ + +A+NG Sbjct: 49 EAEFVSLYGVPQHVTILEIKPERHRFDILIHSPKEET--------SNAARRSGAVVAING 100 Query: 92 GIYD-ESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSK 150 ++ + + ++G G G + K+ I+ Sbjct: 101 SYFNIKQGTSICYLRKDGVVVDTTAT----GVLSTVSNGAVKIDKGKLDIIAWKKQDEKT 156 Query: 151 ---EIQFAVQSGPMLMENGVINPRIHPN---VASSKIRNGVGINKHGNAVFLLSQQA--- 201 + + SGP+++ +G N V + R+ V + K G + Sbjct: 157 CEQKEGSILVSGPLMLLDGKTCDLSACNRSFVQTKHPRSAVALMKDGTVFLIAVAGRFEG 216 Query: 202 ----TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGA 236 N + + L + L LDG S A Sbjct: 217 KAEGINIPELTHLLR-VLGARKALNLDGGGSTTLWSASA 254 >UniRef50_C4ICA6 Peptidase, M56 family n=1 Tax=Clostridium butyricum E4 str. BoNT E BL5262 RepID=C4ICA6_CLOBU Length = 568 Score = 97.4 bits (241), Expect = 4e-19, Method: Composition-based stats. Identities = 26/182 (14%), Positives = 54/182 (29%), Gaps = 26/182 (14%) Query: 45 VQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIY---------- 94 +P+ +V + + + I A+NGG + Sbjct: 401 YYMEIKDPKRIKVGVAVK------LNEEGQTASKIAQNYNAVAAINGGGFLDQSSTGYWN 454 Query: 95 DESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFK--TSKEI 152 P+G+ + G+ + + +F + + IV + + K + Sbjct: 455 GTGGIPVGIIMSKGEVIYNDVEETEKTE-------LFAIDKQRQMIVGTYSVEDLKEKGV 507 Query: 153 QFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAK 212 Q AV GP L+ +G ++ R +G + G + L+ K Sbjct: 508 QEAVSFGPSLIIDGKMSEMTGDGGWGIAPRTAIGQKEDGTIILLVIDGR-GIGSLGATLK 566 Query: 213 AK 214 Sbjct: 567 ET 568 >UniRef50_C6PYU6 Putative uncharacterized protein n=1 Tax=Clostridium carboxidivorans P7 RepID=C6PYU6_9CLOT Length = 369 Score = 96.7 bits (239), Expect = 7e-19, Method: Composition-based stats. Identities = 34/223 (15%), Positives = 59/223 (26%), Gaps = 24/223 (10%) Query: 37 ALSDPTLTVQAYTV-NPQTERVKMYWQ-KANGEAWGTLHALLADINSQGQVQM---AMNG 91 + D V NP ++ + K G+ + + + NG Sbjct: 146 EIDDTKFHACILEVKNPTRMKIGYTNKLKEVGQKTSEIAEENGAAAAINGGGFTDKSSNG 205 Query: 92 GIYDESYA-PLGLYIENGQQKVALNLASGEGNF-FIRPGGVFYVAGDKVGIVRLDAFKTS 149 ++ + A P G+ I NG+ + + N G V V + Sbjct: 206 KLWTGTGAYPQGIVISNGKVVYSDVKNNEAVNVTAFTKDGKLIVGDHTVSEL------LR 259 Query: 150 KEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA------TN 203 + A+ L+ NG R +G G + L+ + Sbjct: 260 DNVTEAISFRNSLIINGKPVALAEEG---LNPRTAIGQKADGTIIMLVIDGRKGLKAGAS 316 Query: 204 FYDFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFV 245 + + LDG S MY G I Sbjct: 317 LKEVQNILLQR-GALNASSLDGGSSSTMYFNGEVINDPCDWNG 358 >UniRef50_C6XXH4 Putative uncharacterized protein n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6XXH4_PEDHD Length = 289 Score = 96.3 bits (238), Expect = 8e-19, Method: Composition-based stats. Identities = 29/174 (16%), Positives = 58/174 (33%), Gaps = 15/174 (8%) Query: 74 ALLADINSQGQVQMAMNGGIYD-ESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFY 132 + ++ A+NG +D ++ + G+ L + + V Sbjct: 85 KTTSTFGTENNALAAVNGSFFDVKNGGSVDFIKVGGKVLAENRLEKNDSRARHQQAAV-V 143 Query: 133 VAGDKVGIVR---LDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNV--ASSKIRNGVGI 187 ++ K+ + + ++ + + SGP+LM NG + + S R +GI Sbjct: 144 ISNGKLALKKWDGTADWEQRLTEENVLLSGPLLMLNGT-DEALDSTSFSRSRHPRTAIGI 202 Query: 188 NKHGNAVFLLSQQA------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGG 235 +G + L + + A K L + LDG S G Sbjct: 203 KPNGRILLLTVDGRNSNSAGMSLTELAKTMKW-LGCTSSINLDGGGSTTLWVSG 255 >UniRef50_UPI0001746B2F hypothetical protein VspiD_16055 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001746B2F Length = 325 Score = 96.3 bits (238), Expect = 9e-19, Method: Composition-based stats. Identities = 47/214 (21%), Positives = 80/214 (37%), Gaps = 28/214 (13%) Query: 42 TLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPL 101 TV +NP + + ++ +G A T LA + A+ D + PL Sbjct: 91 RETVNVIEINPANYQFQTSFK--DGFALTTAKERLATE----RAAFAITANFRDPAGKPL 144 Query: 102 GLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEI-QFAVQSGP 160 GL + G Q+ F G F+V K F+ + + Q A Q P Sbjct: 145 GLVVHEGTQRNPT---------FPAWTGYFFVKAGKPWFGPKSLFEETPGVLQEASQGYP 195 Query: 161 MLMENGVI------NPRIHPNVASSKIRNGVGINKHGNAVFLL--SQQATNFYDFACYAK 212 LM+N + + + R G+ ++GN VF+L + N + A+ Sbjct: 196 SLMKNHTVFSYVDLPSTRYFDGNRVTYRALAGMKQNGNIVFILSGTGGVMNVSEVTALAQ 255 Query: 213 AKLNVEQLLYLDGTIS---HMYMKGGAIPWQRYP 243 +LNV+ LDG + + + G A + + Sbjct: 256 -RLNVQHATLLDGGRALQYSLKLHGAARHFTAFN 288 >UniRef50_B8J2Y6 Putative uncharacterized protein n=2 Tax=Desulfovibrio RepID=B8J2Y6_DESDA Length = 429 Score = 96.3 bits (238), Expect = 1e-18, Method: Composition-based stats. Identities = 38/208 (18%), Positives = 68/208 (32%), Gaps = 23/208 (11%) Query: 29 FAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMA 88 + + L+D + A ++P + + +G L+ Q + A Sbjct: 131 PGLDFGEFQLTDSEALLTALRIDPAHFDFILCARSQDGGNLRPLNQW----AEQYGLTAA 186 Query: 89 MNGGIYDESY-APLGLYIENGQQKVALNLASGEGNFFI-------RPGGVFYVAGDKVGI 140 +N +Y G +NG + G FF+ PG D Sbjct: 187 INASMYLPDGITSTGYMRQNGH-HNNKRVVQRFGAFFVAGPDSPDLPGAAIVDRDDPQWE 245 Query: 141 VRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ 200 R+ + + +Q+ M + I P I + V + G +FL +Q Sbjct: 246 QRIGQY------RLVIQNYRMTSADRRI--LWSPGGPHYSI-SAVAQDGDGRILFLHCRQ 296 Query: 201 ATNFYDFA-CYAKAKLNVEQLLYLDGTI 227 Y FA LNV ++Y++G Sbjct: 297 PVEAYAFAQQLLHLPLNVRTVMYVEGGG 324 >UniRef50_Q8YP57 All4343 protein n=5 Tax=Nostocaceae RepID=Q8YP57_ANASP Length = 660 Score = 95.5 bits (236), Expect = 2e-18, Method: Composition-based stats. Identities = 39/239 (16%), Positives = 70/239 (29%), Gaps = 46/239 (19%) Query: 52 PQTERVKMYWQKANGEAWGTLHALLADINSQGQV---QMAMNGGIYDES----------- 97 P R + W G+ + +L + + + +A+N G Sbjct: 411 PILNRGAIAW-NDAGQFYFGRLSLQETLATSSNLRVPILALNSGYVQNGIARYTPAWGKM 469 Query: 98 YAPLG-----LYIENGQQKVAL-NLASGEGNFFIRPGGVFYVAGDKVGIVRLD-AFKTSK 150 Y PL + ++N + +G+ NF I G V T Sbjct: 470 YTPLTDNERIVIVQNNKITNQFPGNKAGQTNFPIPNNGYLLTLRGNATTVASQLPVGTDV 529 Query: 151 EIQFA------------VQSGPMLMENGVIN-----PRIHPN-VASSKIRNGVGINKHGN 192 +I A + +GP+L++N I + +A +R+G+ + Sbjct: 530 QITSATTPGEFNRYPHIIGAGPLLLQNSQIVLDAKSEQFSNAFIAERAVRSGICTTANNT 589 Query: 193 AVFLLSQQAT-----NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVT 246 + N + A K L L LDG S G + + Sbjct: 590 LLIAAVHNRAGGPGPNLAEHAQLMKL-LGCVNALNLDGGSSTSLYLSGQLLDRYPNTAA 647 Score = 47.7 bits (112), Expect = 3e-04, Method: Composition-based stats. Identities = 29/146 (19%), Positives = 50/146 (34%), Gaps = 9/146 (6%) Query: 23 LTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQ 82 +T L V VNP+T + + N + +L + Sbjct: 320 ITWATGLRWRQQFVNLGTNRFPVVLLEVNPRTIGLTLKPIVTNPDTLVGTAPILQT-AQR 378 Query: 83 GQVQMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIV 141 A+NGG ++ + PLG +N Q + L G G FY ++ + Sbjct: 379 YFAVGAINGGYFNRNNRYPLGAIRQNNQWLSSPILNR--GAIAWNDAGQFYF--GRLSLQ 434 Query: 142 RLDAFKTSKE-IQFAVQSGPMLMENG 166 A ++ A+ SG ++NG Sbjct: 435 ETLATSSNLRVPILALNSGY--VQNG 458 >UniRef50_A7LRK4 Putative uncharacterized protein n=1 Tax=Bacteroides ovatus ATCC 8483 RepID=A7LRK4_BACOV Length = 326 Score = 95.1 bits (235), Expect = 2e-18, Method: Composition-based stats. Identities = 26/207 (12%), Positives = 60/207 (28%), Gaps = 27/207 (13%) Query: 46 QAYTVNPQTERVKMYWQKANGEA------WGTLHALLADINSQGQVQMAMNGGIYDES-- 97 V+ V + + + + +V + NG Y + Sbjct: 93 IIAEVDLNK-NVTIVTSTPDNKPEVGKILQQVTVQAEKAEAAGRKVILGTNGDFYSKKND 151 Query: 98 -YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFK-TSKEIQFA 155 + P GL+ ++G + F++ + I + FK +E+ A Sbjct: 152 LWIPGGLFYKDGVAIKTEIGWEADHVFYM-------LKDGTAHITSVPEFKLVEREVVHA 204 Query: 156 VQSGPMLMENGVINPRI--HPNVASSKIRNGVGINKHGN-AVFLLSQQATNFYDFACYAK 212 + ++++G + + N R VG++ + Y + Sbjct: 205 IGGWQRMVQDGEVVKNFTVNDNAMQFHPRTFVGVSADNRKVYLFVVDGRQPEYSNGMRLE 264 Query: 213 AKL------NVEQLLYLDGTISHMYMK 233 + Q +DG S ++ Sbjct: 265 DMMLLCQGAGCYQAFNMDGGGSTTMVR 291 >UniRef50_B2A8G9 Copper amine oxidase domain protein n=1 Tax=Natranaerobius thermophilus JW/NM-WN-LF RepID=B2A8G9_NATTJ Length = 718 Score = 95.1 bits (235), Expect = 2e-18, Method: Composition-based stats. Identities = 19/107 (17%), Positives = 34/107 (31%), Gaps = 14/107 (13%) Query: 147 KTSKEIQFAVQSGPMLMENGVINPRIHPN------VASSKIRNGVGINKHGNAVFLLSQQ 200 K +++ FA+ GP ++E G ++ R R VG+ + G + Sbjct: 596 KNVEDVVFALGGGPRILEKGEVDIRSMEEVISDNVSQGRSPRTAVGVTRDGQLLLTAVDG 655 Query: 201 A-------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 + + K + + L LDG S M Sbjct: 656 RQSGLSIGMTLEELGNFMKDR-GAQDALNLDGGGSTMMWFDNEFQNN 701 Score = 50.0 bits (118), Expect = 8e-05, Method: Composition-based stats. Identities = 21/145 (14%), Positives = 44/145 (30%), Gaps = 25/145 (17%) Query: 37 ALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE 96 + + + ++P + + + + L + + A+NGG Y Sbjct: 391 GQENGPIKIHELRLDP---HGDVKPELIMAQDGFSGFERLDSMAKRNNAIAAINGGFYWR 447 Query: 97 SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAV 156 + P+GLYI + + FY + I R A Sbjct: 448 AGHPIGLYISDQRLIREPMPNRS---------AFFYSKDGEATIERT-----------AF 487 Query: 157 QSGPMLMENGVINPRIHPNVASSKI 181 G M +++ IN + + + Sbjct: 488 NGGLMYIDD--INTNLSIDGVNRSR 510 >UniRef50_B3PTF7 Putative uncharacterized protein n=3 Tax=Rhizobium RepID=B3PTF7_RHIE6 Length = 325 Score = 95.1 bits (235), Expect = 2e-18, Method: Composition-based stats. Identities = 38/207 (18%), Positives = 67/207 (32%), Gaps = 23/207 (11%) Query: 27 PLFAVAA-DDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQV 85 P F VA A + V+P R + + + + Sbjct: 83 PGFEVAELPVLADGREVDRIFLSRVDPARFRFVTHNAAPGDK---GIDEWEKTLP---NA 136 Query: 86 QMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRL-- 143 + +NG +D+ P +I G + G F D I L Sbjct: 137 VLIVNGSYFDKHGRPDTPFISEGIAMGPRQYDARA--------GAFTADKDTAEIRDLSH 188 Query: 144 -DAFKTSKEIQFAVQSGPMLM-ENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA 201 D A+ S P+L+ ++G + ++ R V + G V +++A Sbjct: 189 QDWQTAFVGASNAMVSYPLLIGDDGQTH--VNVKSRWLANRTFVAKDDLGRVVIGTTKEA 246 Query: 202 -TNFYDFACYAK-AKLNVEQLLYLDGT 226 + A + K + LN++ L LDG Sbjct: 247 FFSLDRLAQFLKTSPLNLKVALNLDGG 273 >UniRef50_A6TKB7 Exopolysaccharide biosynthesis protein n=1 Tax=Alkaliphilus metalliredigens QYMF RepID=A6TKB7_ALKMQ Length = 236 Score = 95.1 bits (235), Expect = 2e-18, Method: Composition-based stats. Identities = 27/179 (15%), Positives = 61/179 (34%), Gaps = 18/179 (10%) Query: 80 NSQGQVQMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKV 138 + + A+NGG +D + P G++ + ++ + A + G ++ Sbjct: 47 ANGYKRIGAVNGGFFDGNRTLPYGMFYVDSGFLLSESWAGDAFLELVHENGKLHIDDITA 106 Query: 139 GIVRLDAFKTSKEIQFAVQSGPMLMENGVIN----PRIHPNVASSKIRNGVGINKHGNAV 194 ++ K+ +A+ L+ G +N + S R +G + N + Sbjct: 107 NQLKTKY----KKANWAISLSYSLVVGGKMNIMKGDKFPFTNQS-HPRTLIG-DNQENYI 160 Query: 195 FLLSQQATN------FYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTM 247 F++++ + A +L + DG S G I + Y + Sbjct: 161 FVVTEGRMTKEKGLTAVESARVML-ELGCNTAINADGGGSSAMDVEGKIQNKYYDNRAV 218 >UniRef50_A6L610 Putative uncharacterized protein n=1 Tax=Bacteroides vulgatus ATCC 8482 RepID=A6L610_BACV8 Length = 287 Score = 94.7 bits (234), Expect = 3e-18, Method: Composition-based stats. Identities = 26/184 (14%), Positives = 59/184 (32%), Gaps = 20/184 (10%) Query: 75 LLADINSQGQVQMAMNGGIYD-ESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYV 133 + + Q + A+NG + + +N +R G ++ Sbjct: 86 TTSQLAEQSRSSAAINGSYFSIKEGFSTCYLRKNEAVIDTTTTEER----HLRVNGAVHM 141 Query: 134 AGDKVGIVRLDAFKTSKEIQF---AVQSGPMLMENGVINPRIHPN---VASSKIRNGVGI 187 + + I+ + K + SGP+LM++G + + R+ + + Sbjct: 142 VDNNIRIIPWNDENEKKGFPLDGDILASGPLLMQDGKTCDFTTIDREFSETRHPRSAIAL 201 Query: 188 NKHGNAVFLLSQQA-------TNFYDFACYAKAKLNVEQLLYLDGTIS-HMYMKGGAIPW 239 K G+ + + + + A + L L LDG S +++ G + Sbjct: 202 TKEGDIMLVAVDGRAEGHADGMSIAELAYLLR-ILKAHCALNLDGGGSTTLWVNGQVVNH 260 Query: 240 QRYP 243 Sbjct: 261 PSDN 264 >UniRef50_C5RID5 Putative uncharacterized protein n=1 Tax=Clostridium cellulovorans 743B RepID=C5RID5_CLOCL Length = 347 Score = 94.3 bits (233), Expect = 3e-18, Method: Composition-based stats. Identities = 40/225 (17%), Positives = 67/225 (29%), Gaps = 39/225 (17%) Query: 38 LSDPTLTVQAYTV-NPQTER-VKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYD 95 + + +P + V + NG+ +++ A+NGG + Sbjct: 119 IEHDRYIAHILEIKDPTKIKAVMTKYVGKNGQK-------TSEMALDYDAIAAINGGAFA 171 Query: 96 E-----------SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRL- 143 + P G I NG N + V + K+ + Sbjct: 172 DVSASGQKWAGNGAIPGGFVITNGAIVYPKENV----NKYDVQNVVAFTKEGKLVVGDYC 227 Query: 144 DAFKTSKEIQFAVQSGP-MLMENG--VINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ 200 + + A+ P ++ +G I ++ + R +G G V L+ Sbjct: 228 INDLMAMGVTEAMCFRPPSIIIDGVAQITDKLQDG---TNPRTAIGQKADGTVVLLVIDG 284 Query: 201 AT------NFYDFACYAKAKLNVEQLLYLDGT-ISHMYMKGGAIP 238 T YD K LNV LDG S MY G I Sbjct: 285 RTLSMPGATLYDVQQIFKD-LNVVNAGNLDGGYSSTMYFNGEIIN 328 >UniRef50_B7ASL4 Putative uncharacterized protein n=1 Tax=Bacteroides pectinophilus ATCC 43243 RepID=B7ASL4_9BACE Length = 367 Score = 94.3 bits (233), Expect = 4e-18, Method: Composition-based stats. Identities = 35/230 (15%), Positives = 67/230 (29%), Gaps = 18/230 (7%) Query: 29 FAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMA 88 + D + D T + + +++ + + ++A I ++ Sbjct: 134 SEEQSQDIEIVDIKGTTYRGKLMIIKDPSRVFVGTV-PQFFEGDGKVVAKIAARYNAVGG 192 Query: 89 MNGGIYDES-----YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRL 143 +NGG + + P+GL + +G+ F + + V Sbjct: 193 VNGGEFVDGELTYTAMPVGLVMTDGRIVNGDTATRCHVTGFTKDN-ILVVGNMTGQQALD 251 Query: 144 DAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT- 202 + I ++ GP L+ NG R VG G + L Sbjct: 252 MGMRDCVSISSSI--GPFLIINGEAQDVSGVGG-GLNPRTAVGQRADGAVLLLAIDGRQA 308 Query: 203 -----NFYDFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFVT 246 +F D + + +DG S MY +G I P Sbjct: 309 NSLGASFADLLYIMQ-QYGAVNASTMDGGTSTQMYYEGSVINTPYSPTGP 357 >UniRef50_C8WU56 N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase-like protein exopolysaccharide biosynthesis protein n=2 Tax=Alicyclobacillus acidocaldarius RepID=C8WU56_ALIAD Length = 296 Score = 94.0 bits (232), Expect = 4e-18, Method: Composition-based stats. Identities = 34/226 (15%), Positives = 62/226 (27%), Gaps = 35/226 (15%) Query: 38 LSDPTLTVQAYTV-NPQTER-VKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYD 95 + +P T V +P+ V+ + GE +NGG + Sbjct: 77 IHEPNFTAYVLWVRDPRRVEIVETRYAGDVGETVEQFVN-------DWHAVAGVNGGSFT 129 Query: 96 E------SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTS 149 + G+ I NG+ + F G + A + Sbjct: 130 DTNWQGTGGLVQGIVISNGRILKRASGPESIVGFT--------ADGRLISGTYTLAELQA 181 Query: 150 KEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-------- 201 + A+ GP L++ G ++ R +G G + +++ Sbjct: 182 MGVTQALMFGPTLVDRG-VDQIQGAGDWGYAPRTAIGQTADGTVILMVTDGRELHGPADI 240 Query: 202 -TNFYDFACYAKAKLNVEQLLYLDGTIS-HMYMKGGAIPWQRYPFV 245 + D A + L LDG S + G I Sbjct: 241 GASLGDIARLMIS-LGAVTAANLDGGSSATLVYDGCLINQPTDILG 285 >UniRef50_C6D0A3 Exopolysaccharide biosynthesis protein n=1 Tax=Paenibacillus sp. JDR-2 RepID=C6D0A3_PAESJ Length = 349 Score = 94.0 bits (232), Expect = 5e-18, Method: Composition-based stats. Identities = 31/172 (18%), Positives = 53/172 (30%), Gaps = 22/172 (12%) Query: 79 INSQGQVQMAMNGGIYDE------SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFY 132 I + + A+N + + A G+ IE+G K N + E I GV Sbjct: 143 IAKRAKALAAINASGFVDLDGHGNGGASTGVVIEDGVIKSQ-NKNTKEFVAGITKDGVMI 201 Query: 133 VAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGN 192 + + +Q+A P L+ NG R +G G+ Sbjct: 202 TGKYSANEL------VNLGVQYAAGFKPQLIVNGQKMVE-GDGGWGWGPRTAIGQKADGS 254 Query: 193 AVFLLSQQATN------FYDFACYAKAKLNVEQLLYLDGT-ISHMYMKGGAI 237 +F++ + + + +DG S MY G I Sbjct: 255 IIFVVIDGRQTRSVGASIKEVQDLLYER-GAVNAMCMDGGSSSSMYFNGDNI 305 >UniRef50_A7LRK2 Putative uncharacterized protein n=1 Tax=Bacteroides ovatus ATCC 8483 RepID=A7LRK2_BACOV Length = 315 Score = 93.6 bits (231), Expect = 5e-18, Method: Composition-based stats. Identities = 28/225 (12%), Positives = 69/225 (30%), Gaps = 28/225 (12%) Query: 44 TVQAYTVNPQTERVKMYWQKANGEAWGTLH-------ALLADINSQGQVQMAMNGGIYDE 96 + T++ + A + V + +NG Y + Sbjct: 77 HIFVATIDLNELTFTPATKDDKNVPATGPESSAPLPIHAFAAEANGKTVWLGVNGDYYAD 136 Query: 97 S-YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFA 155 + +GL+ ++G + + + G YV +A + A Sbjct: 137 NPRRVMGLFYKDGVCINSQYFEGHDEVLYQLKNGETYVGQ------ADEALAHEANLLHA 190 Query: 156 VQSGPMLMENGVINPRIHP--NVASSKIRNGVGINKHGNAVFL-LSQQA---------TN 203 + +L+++GV+ ++ ++ R VG+++ +++ + Sbjct: 191 LGGYGLLVKDGVVQNFYEEMGDLQNTHPRTSVGLSQDRKTMYVFVVDGRRKDSFFALGLT 250 Query: 204 FYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMI 248 A KA + + LDG S + + P ++ Sbjct: 251 LPHLATMMKA-VGCYNAINLDGGGSTTLII-RKVNDGGKPTFPIL 293 >UniRef50_C6IP98 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=C6IP98_9BACE Length = 536 Score = 93.2 bits (230), Expect = 8e-18, Method: Composition-based stats. Identities = 44/277 (15%), Positives = 78/277 (28%), Gaps = 72/277 (25%) Query: 24 TLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQG 83 TL + L+ + T +V++ + A T+ A N G Sbjct: 242 TLPAEIELYETTSNLNGSNFHAWYAIGDLSTGKVEVRVHIPSSPA--TIDTQSASFN--G 297 Query: 84 QVQMAMNGGIYDESYAPLGLYIEN----------------GQQKVALNLASGEGNFFIRP 127 + +NGG + + G+ + N G + G F + Sbjct: 298 DCYLLVNGGYF-YNGNHTGIAVINSIKSGSVSAVRGSLKTGDTEYNSMYNVTRGTFGVDA 356 Query: 128 GG--------------VFYVA--------GDKVGIVRLDAFKTSKE--IQFAVQSGPMLM 163 G VFY +K GIV + T+ ++A+ +GP+L+ Sbjct: 357 SGKPNVVWTGTDASSNVFYFDRPLPSVKGENKYGIVTNENPTTAISWSPKYALSAGPVLL 416 Query: 164 ENGVINPRIHPNVASSKI--------------------RNGVGINKHGNAVFLLSQQAT- 202 ++ I + R +G + G V + Sbjct: 417 KDKKIPFDFTETSKGTDYYLSNYEIIPYDIFGANVTPDRTAIGYREDGKVVIFICDGRIT 476 Query: 203 -----NFYDFACYAKAKLNVEQLLYLDGTISHMYMKG 234 + A K L + LDG S + G Sbjct: 477 ASGGATLTELAQIMKG-LGCVGAINLDGGGSTGMVVG 512 >UniRef50_B4WS35 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WS35_9SYNE Length = 687 Score = 92.8 bits (229), Expect = 9e-18, Method: Composition-based stats. Identities = 25/168 (14%), Positives = 47/168 (27%), Gaps = 26/168 (15%) Query: 104 YIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQS----- 158 ++N + ++ + P Y+ + +F+ + + QS Sbjct: 507 TVQNHEVIAQKSMGKAGSSSVPIPRDGGYLLALRSYRSAGQSFQPGTPVLLSSQSQPAVF 566 Query: 159 ---------GPMLMENGVIN-----PRIHPN-VASSKIRNGVGINKHGNAVFLLSQQAT- 202 GP+L+ + I N + + R VG G + Sbjct: 567 EQYPNMIGGGPLLVRDRNIVLNPQLEGFSTNFIQGAAPRTAVGKTSDGTWIIATMHDRVG 626 Query: 203 ----NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVT 246 + A K +L L LDG S GG + + Sbjct: 627 GRGPTLTETAYIMK-QLGAVDALNLDGGSSSSLYLGGQLLNRHPRTAA 673 Score = 44.3 bits (103), Expect = 0.004, Method: Composition-based stats. Identities = 25/142 (17%), Positives = 47/142 (33%), Gaps = 15/142 (10%) Query: 26 LPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQV 85 P ++ V + V P + + + A + ++ + Q Sbjct: 349 APGLRWRQQYINVNQHRFPVYMFIVRPNPDALTLRPIHAASNTAIGIEPIVTT-AKRAQA 407 Query: 86 QMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLD 144 A+N G ++ + PLG GQ L G +A + G + +D Sbjct: 408 IGAVNAGFFNRNNQLPLGAVRSAGQWISGPILGRG------------AMAWNDSGELVID 455 Query: 145 AFKTSKEIQFAVQ-SGPMLMEN 165 F S+ + V + P+L N Sbjct: 456 RFALSESVTTGVGEAFPILAVN 477 >UniRef50_C6XT12 NHL repeat containing protein n=2 Tax=Pedobacter heparinus DSM 2366 RepID=C6XT12_PEDHD Length = 646 Score = 92.4 bits (228), Expect = 1e-17, Method: Composition-based stats. Identities = 38/229 (16%), Positives = 70/229 (30%), Gaps = 29/229 (12%) Query: 23 LTLLPL-FAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADIN- 80 +T+ P + + + VN +V M + A Sbjct: 74 ITVAPGVTETDIHYTDTAGKAMHLFILKVNLNEPQVFMEVATPFNLPAYARQTVPAQAAE 133 Query: 81 ---SQGQVQMAMNGGIYDE-SYAPLGLYIENGQQK------VALNLASGEGNFFIRPGGV 130 + V +NG +D + P+G+ +NG L F + V Sbjct: 134 IDTATHMVIAGINGDFFDTSTGIPMGIVHKNGSIVKSTFNDNTLKPQQAVSFFGVTENNV 193 Query: 131 FYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKH 190 + + S ++ + SG ML+ N + + + R VG + + Sbjct: 194 PIID------FKSGYAALSSQLYNSTGSGVMLVNN---HLPVSQPYTAIDPRTSVGYDDN 244 Query: 191 GNAVFLLSQQA-------TNFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 G F++ N+ A NV+ + LDG S +M Sbjct: 245 GIVYFVVIDGRDAPYSNGMNYAQLTSAFMA-FNVKNAVNLDGGGSSTFM 292 >UniRef50_B7DMS1 Copper amine oxidase domain protein n=3 Tax=Alicyclobacillus acidocaldarius RepID=B7DMS1_9BACL Length = 354 Score = 92.0 bits (227), Expect = 2e-17, Method: Composition-based stats. Identities = 41/164 (25%), Positives = 70/164 (42%), Gaps = 20/164 (12%) Query: 100 PLGLYIE--NGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQ 157 P G IE G+ K + G+ I V + +K F A+ Sbjct: 201 PDGYDIEIGAGEAKTPIVTRVHVGDPAILTDTVLALPSEKPV-----PFAAYPN---AIG 252 Query: 158 SGPMLMENGVINPR-------IHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACY 210 +GPML++NG I+ + + +R+ VGI++ G+ +FL +A N + A Sbjct: 253 AGPMLVQNGRIDVEPSLEGLDEPDILNAETLRSVVGIDRAGHLIFLTIHEA-NVWQEASI 311 Query: 211 AKAKLNVEQLLYLDGTIS-HMYMKGGAIPWQRYPFVTMISVERK 253 AKA L + + LDG S ++ +G + + T I V ++ Sbjct: 312 AKA-LGLWDAMNLDGGSSVGLWYEGRYLTPPKRALATAIVVVQR 354 >UniRef50_B8CYN3 SpoIID/LytB domain protein n=1 Tax=Halothermothrix orenii H 168 RepID=B8CYN3_HALOH Length = 833 Score = 92.0 bits (227), Expect = 2e-17, Method: Composition-based stats. Identities = 31/162 (19%), Positives = 56/162 (34%), Gaps = 22/162 (13%) Query: 96 ESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFA 155 + +P + +NG A + F V G+ +K+I+ A Sbjct: 667 KDGSPGTVIPDNGFIIQAHGQSRQFLKLFKEGDKVVLQNNFGPGLT-------NKDIKMA 719 Query: 156 VQSGPMLMENGVIN-----PRIHPNV-ASSKIRNGVGINKHGNAVFLLSQQA-------T 202 + +GP L++NG I P++ R +GI + + + Sbjct: 720 LGAGPTLIKNGKIYITGKAEGFQPDILRGRAPRTALGITSGNHLIMVTVDGRQPGFSIGM 779 Query: 203 NFYDFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYP 243 + A + K NV Q + LDG S M ++G + Sbjct: 780 TLEELAQFML-KYNVVQAMNLDGGASSRMVVRGYTMNNPSDK 820 Score = 43.1 bits (100), Expect = 0.008, Method: Composition-based stats. Identities = 15/69 (21%), Positives = 31/69 (44%), Gaps = 2/69 (2%) Query: 45 VQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLY 104 + ++ + + + A+G+ L L + + S + +NGG Y + PLGL+ Sbjct: 521 ITMLDLDLNNDFLYVEPFLASGK-LSGLSDL-SQVVSGKKALAGINGGFYSYTGRPLGLF 578 Query: 105 IENGQQKVA 113 + NG+ Sbjct: 579 MINGEIVSE 587 >UniRef50_C4Z4Z5 Putative uncharacterized protein n=1 Tax=Eubacterium eligens ATCC 27750 RepID=C4Z4Z5_EUBE2 Length = 388 Score = 92.0 bits (227), Expect = 2e-17, Method: Composition-based stats. Identities = 36/225 (16%), Positives = 65/225 (28%), Gaps = 19/225 (8%) Query: 34 DDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGI 93 D + D + + + +++ G ++ADI + +NGG Sbjct: 161 PDIEIVDVKGATYSGKLMIVKDPSRLFVGTVPEFTNGN-GMVVADIAKRYDAIGGVNGGE 219 Query: 94 YDESYA-----PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKT 148 + + P+GL +++G+ S I + + + Sbjct: 220 FVDGETTYTAMPIGLVMKDGEILNDNGGTSHVT--GITFDNKLVLGNMNAAKAKELNIRD 277 Query: 149 SKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT------ 202 I + GP L+ NG + + R +G G + L Sbjct: 278 CVSISNHI--GPFLIVNGEAQDIVGIAG-GTNPRTAIGQTADGKILLLAVDGRQPNSIGA 334 Query: 203 NFYDFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFVT 246 F D A+ +DG S MY G I P Sbjct: 335 TFSDLQDIM-AQYGAVNASTMDGGTSTQMYYDGEVINVPYSPTGP 378 >UniRef50_B4AZH7 Putative uncharacterized protein n=1 Tax=Cyanothece sp. PCC 7822 RepID=B4AZH7_9CHRO Length = 298 Score = 91.3 bits (225), Expect = 3e-17, Method: Composition-based stats. Identities = 37/273 (13%), Positives = 79/273 (28%), Gaps = 58/273 (21%) Query: 12 ITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYT--------VNPQTERVKMYWQK 63 + +K I ++L + + A + P + A + V Sbjct: 1 MNRLVKLIIISLIMGVVSACTPTQSSSEKPQRSESAVVQPEPLYKVYDLPQSTVHTL-TI 59 Query: 64 ANGEAWGTLHALLADI------NSQGQVQMAMNGGIYDES-YAPLGLYIENGQQKVALNL 116 + L + + A+NGG +D + I G+ Sbjct: 60 PVDSPYQVTVTLARSLETVENLAKKQGAMAAINGGFFDPNNGKTTSYIIHQGKIIADPKN 119 Query: 117 ASGEGNFFIRPGGVFYVAGD-----------KVGIVRLDAFKTSK-----EIQFAVQSGP 160 P Y+ + +F ++ ++ +GP Sbjct: 120 NERL---MKNPDLTRYLDKILNRSEWRRYQCGATVRYSISFHNQPTLTGCQLLDSLGAGP 176 Query: 161 MLM-------------ENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ------- 200 L+ NG + + + R +GI +G+ +++++ Q Sbjct: 177 RLLPEMTAQTEGFIDLVNGTMI-KDALGLKEPNARTAIGITANGDLIWIMAAQKAHSSRA 235 Query: 201 -ATNFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 + + A + K L V++ L LDG S + Sbjct: 236 TGLSLLELAEFLK-TLGVQEALNLDGGSSSTFY 267 >UniRef50_C0ZFU4 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0ZFU4_BREBN Length = 359 Score = 90.9 bits (224), Expect = 3e-17, Method: Composition-based stats. Identities = 28/218 (12%), Positives = 66/218 (30%), Gaps = 26/218 (11%) Query: 37 ALSDPTLTVQAYTV---NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGI 93 +L + V +P ++ + + + + + +A+N G Sbjct: 137 SLQEGGYRGYMAKVRLNDPNALKMVL-----ANNSVKSKGETTSQAGKRTGSILAINAGG 191 Query: 94 YDES----YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTS 149 + PLG+ + +G+ + + G ++ A T Sbjct: 192 FMSDKQGNLTPLGITVVDGK-IRTFSNNAKLSFVGFNNKGHLVGTS-----IKTQAQITQ 245 Query: 150 KEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-------T 202 + I P L++ G P + + R +G +G+ + ++ Sbjct: 246 QGILQGASFLPRLLQGGKRLPIPREWANARQPRTLIGHFDNGDLLLIVIDGRRDGWSNGV 305 Query: 203 NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 + A + +V LDG S + G + + Sbjct: 306 TLEE-AQRKLQEWHVVDAYNLDGGGSSAFYYNGKLLNK 342 >UniRef50_C9RVV6 Putative uncharacterized protein n=3 Tax=Geobacillus RepID=C9RVV6_GEOSY Length = 652 Score = 90.9 bits (224), Expect = 4e-17, Method: Composition-based stats. Identities = 17/107 (15%), Positives = 39/107 (36%), Gaps = 13/107 (12%) Query: 135 GDKVGIVRLDAFKTSK--EIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGN 192 + + + + ++ A+ L+ +G + P ++ R VGI+K+GN Sbjct: 356 KEGDAVEISLQYDQPEWSGVKEALGGRYRLVADGKVQP---FSIEGVHPRTAVGIDKNGN 412 Query: 193 AVFLLSQQA-------TNFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 + ++ + A +L + LDG S ++ Sbjct: 413 VMLIVVDGRQPAYSQGMTLNELAKLM-HELGAVDAMTLDGGGSSTFV 458 Score = 50.4 bits (119), Expect = 6e-05, Method: Composition-based stats. Identities = 20/124 (16%), Positives = 39/124 (31%), Gaps = 4/124 (3%) Query: 21 LALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWG---TLHALLA 77 ++ + + + V ++ ER+ + +N + G L Sbjct: 137 VSTRIASGVEKEEMEIVGARGKQHVYKLDIDTSNERMAIETALSNDQVLGIEPVLEQAKR 196 Query: 78 DINSQGQVQMAMNGGIYDESYAPLGLYIENGQ-QKVALNLASGEGNFFIRPGGVFYVAGD 136 G V A+NG + + +P L + G+ A+ F I G + Sbjct: 197 YDGRDGIVLAAVNGDYFKQDGSPTDLMVHRGEIVITNTTPAAERTIFGISADGKPMIGNP 256 Query: 137 KVGI 140 V I Sbjct: 257 DVQI 260 >UniRef50_A1HRE9 Exopolysaccharide biosynthesis protein n=1 Tax=Thermosinus carboxydivorans Nor1 RepID=A1HRE9_9FIRM Length = 487 Score = 90.9 bits (224), Expect = 4e-17, Method: Composition-based stats. Identities = 18/103 (17%), Positives = 35/103 (33%), Gaps = 13/103 (12%) Query: 150 KEIQFAVQSGPMLMENGVIN-----PRIHPN-VASSKIRNGVGINKHGNAVFLLSQQA-- 201 + A+ +GPML++NG I + R +G+ K G + ++ Sbjct: 364 DKTVHALGAGPMLLKNGSIYLTTKIEEFGSDVAGGRAPRTALGLTKDGRVLLVVVDGRQP 423 Query: 202 ----TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 + A +L + LDG S + + + Sbjct: 424 TSAGMTLLELA-LFLQELGAVDAMNLDGGGSSEMVINDKVVNK 465 >UniRef50_B7H7U4 Putative uncharacterized protein n=27 Tax=Bacillus cereus group RepID=B7H7U4_BACC4 Length = 365 Score = 90.5 bits (223), Expect = 4e-17, Method: Composition-based stats. Identities = 26/186 (13%), Positives = 52/186 (27%), Gaps = 18/186 (9%) Query: 66 GEAWGTLHALLADINSQGQVQMAMNGGIYDE------SYAPLGLYIENGQQKVALNLASG 119 G ++ + + +A+N + + G+ IENG+ + Sbjct: 135 GTQGANRGEKISVMAKRNHALVAVNASGFADETGRGGGNVATGIVIENGKAIDTNMDRNA 194 Query: 120 EGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASS 179 + G+ K++ A P L+ NG S Sbjct: 195 PTIITGLTKFGQMITGN-----YSTQQLLDKQVVSAAGFMPQLIVNGEKMITEGDGGWGS 249 Query: 180 KIRNGVGINKHGNAVFLLSQQATNFYDFACYAKA------KLNVEQLLYLDGTISHMYMK 233 R+ + + G +FL+ + K + + +DG S Sbjct: 250 APRSIMAQKEDGTIMFLVIDGRQT-HSIGATLKECQDILYEKGAINAMAMDGGSSATLYL 308 Query: 234 GGAIPW 239 GG + Sbjct: 309 GGKVIN 314 >UniRef50_C4V4S8 Exopolysaccharide biosynthesis protein n=1 Tax=Selenomonas flueggei ATCC 43531 RepID=C4V4S8_9FIRM Length = 491 Score = 90.5 bits (223), Expect = 4e-17, Method: Composition-based stats. Identities = 27/176 (15%), Positives = 51/176 (28%), Gaps = 24/176 (13%) Query: 77 ADINSQGQVQMAMNGGIYDESYAPLG--LYIENGQQ--KVALNLASGEGNFFIRPGGVFY 132 + + P G I NG+ + + + G Sbjct: 288 NAERGADNLIIYNRAYGRSTGTNPYGLEYVIRNGRVAEINTNDSLIPPDGYVVSVHGTLM 347 Query: 133 -------VAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVIN-----PRIHPNVA-SS 179 V ++ D + + +GP L+ENG ++ + ++ Sbjct: 348 DAFAAAGVRVGDPAVLTEDLGEPWNRAVQVLGAGPRLVENGSVHVTAGEEQFPGDIRYGR 407 Query: 180 KIRNGVGINKHGNAVFLLSQQA------TNFYDFACYAKAKLNVEQLLYLDGTISH 229 R VG+ + GN +F + +FA + V + LDG S Sbjct: 408 APRTAVGVTQKGNILFAVVDGRQSHSHGLTLTEFADLL-VQFGVRDAINLDGGGSS 462 Score = 41.2 bits (95), Expect = 0.037, Method: Composition-based stats. Identities = 17/110 (15%), Positives = 29/110 (26%), Gaps = 6/110 (5%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ 84 L A D +T +P RV+ A ++ I Sbjct: 164 LAAGLTQREYVYADEDGPVTAYFIEADPARYRVR----PALARGIIPGRQTVSGIAQDTN 219 Query: 85 VQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVA 134 A+N + S +G+ +G + F + P F Sbjct: 220 AAAAINASYFALSGELIGITKIDGTVVSSTYFDRSA--FGVMPDNSFVFG 267 >UniRef50_C5PL46 Exopolysaccharide biosynthesis protein n=2 Tax=Sphingobacterium spiritivorum RepID=C5PL46_9SPHI Length = 288 Score = 90.5 bits (223), Expect = 5e-17, Method: Composition-based stats. Identities = 26/202 (12%), Positives = 58/202 (28%), Gaps = 19/202 (9%) Query: 44 TVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYD-ESYAPLG 102 + ++ Q + + T S+ A+NG ++ ++ Sbjct: 59 EINFIEIDLQKIKQPIRLAG-----LQTGFKNTTTFASEANALAAINGAFFNTKTGGGTT 113 Query: 103 LYIENGQQKVALNLASGEG-NFFIRPGGVFYVAGDKVGIVRLDA----FKTSKEIQFAVQ 157 L N Q L G+ R K+ I++ D + ++ + Sbjct: 114 LVRINKQLINETVLKEGKSPKRSFRSNAALAFDTKKIVIIKGDDRDSTWDKKIKMPNVMT 173 Query: 158 SGPMLMENG-VINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA------TNFYDFACY 210 GP+L+ + + R+ + + + + + + + Sbjct: 174 CGPLLLHKSHRAYLDSNAFNNNRHPRSAIALTTEHKLILITVDGRNAQAYGMSLIELSNV 233 Query: 211 AKAKLNVEQLLYLDGTISHMYM 232 K L + L LDG S Sbjct: 234 MKW-LKGKDALNLDGGGSTTLY 254 >UniRef50_B0BZE5 Putative uncharacterized protein n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0BZE5_ACAM1 Length = 584 Score = 90.1 bits (222), Expect = 6e-17, Method: Composition-based stats. Identities = 29/166 (17%), Positives = 52/166 (31%), Gaps = 24/166 (14%) Query: 103 LYIENGQQKVALNLASGEGNFFIRPGGVFYV-----------AGDKVGIVRLDAFKTSKE 151 + + N Q ++ +F I G V +G ++ I + Sbjct: 406 ITVINNQVVSEKV-STSTKSFAIPKNGYLLVLRSFDVGGALASGTQLQIQTATTPASFNG 464 Query: 152 IQFAVQSGPMLMENGVIN-----PRI-HPNVASSKIRNGVGINKHGNAVFLLSQQAT--- 202 V +GP+L+ NG + + P S R+G+G G + Sbjct: 465 FPNIVGAGPLLVSNGQVVLNAKAEKFRPPFDTQSAPRSGIGQTADGTILLAAVHNQVSGP 524 Query: 203 --NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVT 246 ++A + +L L LDG S GG + + Sbjct: 525 GPTLKEWALIMQ-RLGSVNALNLDGGSSTSLYLGGQLLDRHPVTAA 569 Score = 44.7 bits (104), Expect = 0.003, Method: Composition-based stats. Identities = 26/95 (27%), Positives = 40/95 (42%), Gaps = 2/95 (2%) Query: 26 LPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQV 85 P +L + V +NPQT +K+ N A +H LL+ + QV Sbjct: 249 APGILRQERVISLGNKQYPVTWLALNPQTPGLKLQPIWGNRNALLGIHPLLS-MAQGNQV 307 Query: 86 QMAMNGGIYDESY-APLGLYIENGQQKVALNLASG 119 A+N G ++ + PLG +NGQ + L G Sbjct: 308 AAAINAGYFNRNNKTPLGAIRQNGQWISSPILNRG 342 >UniRef50_B5VVA8 S-layer domain protein n=3 Tax=Cyanobacteria RepID=B5VVA8_SPIMA Length = 789 Score = 90.1 bits (222), Expect = 7e-17, Method: Composition-based stats. Identities = 27/162 (16%), Positives = 49/162 (30%), Gaps = 22/162 (13%) Query: 96 ESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFA 155 + P + NG V + S F + G ++ I + Sbjct: 629 DDQTPTAI-PTNGYLLVFRSFRSAVSAFGV---------GSRLTITATTTPSEFIDFPHI 678 Query: 156 VQSGPMLMENGVINPRIHPNV------ASSKIRNGVGINKHGNAVFLLSQQATN-----F 204 + GP+L++N I IR+ VG+ G + + N Sbjct: 679 MGGGPLLVQNRNIVVNAEAEGFNYWFGQQLAIRSAVGVTATGEVLMVTVHNRVNGAGPSL 738 Query: 205 YDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVT 246 + A + +L + LDG S + GG + + Sbjct: 739 TEMAKLMQ-QLGAIDAINLDGGSSTSLVLGGHLLNRTPDTAA 779 >UniRef50_C0WEQ2 Exopolysaccharide biosynthesis protein n=1 Tax=Acidaminococcus sp. D21 RepID=C0WEQ2_9FIRM Length = 470 Score = 90.1 bits (222), Expect = 7e-17, Method: Composition-based stats. Identities = 37/192 (19%), Positives = 63/192 (32%), Gaps = 32/192 (16%) Query: 76 LADINSQGQVQMAMNGGIYDESYAPLGL--YIENGQQKVALNLAS-------------GE 120 L + + + NG G+ I NG+ S G Sbjct: 266 LNRMRLENDLIFYNNGYDDTTDTNAAGVEVAIRNGRVIKTGTTGSMPMSWNMTVLSGHGT 325 Query: 121 GNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVIN------PRIHP 174 F+RP V GDKV I + + +GP+L+ +G++N Sbjct: 326 AADFLRPLAV----GDKVKIKTSLGSPLADKAPSVGTAGPLLVYDGLVNVTASLEEIPSD 381 Query: 175 NVASSKIRNGVGINKHGNAVFLLSQQA------TNFYDFACYAKAKLNVEQLLYLDGTIS 228 R VGI K G + +++ + A Y +L ++ + DG S Sbjct: 382 IADGRAPRTAVGIKKDGTILVVVADGRSSRSAGMTLPELARYLI-QLGADRAMNFDGGGS 440 Query: 229 HMYMKGGAIPWQ 240 + GA+ + Sbjct: 441 SEMVVNGAVKNR 452 >UniRef50_C6XWN0 Putative uncharacterized protein n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6XWN0_PEDHD Length = 328 Score = 89.0 bits (219), Expect = 1e-16, Method: Composition-based stats. Identities = 32/202 (15%), Positives = 64/202 (31%), Gaps = 18/202 (8%) Query: 42 TLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQG----QVQMAMNGGIYDES 97 + + V+ +T ++++ + L L + A+NG + + Sbjct: 99 PVRIFIMEVDMKTPKLEIQAMAPYNDYINGLQRLSEMCRDNELPGTNIVAAVNGDTFSTT 158 Query: 98 YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQ 157 AP L+ N + A+G F G + G ++ +I+ AV Sbjct: 159 GAPTSLFYINNRVYYGTV-ATGRTFFAAMKDGTIVIGGKDTK--GVERPVDKAQIKNAVG 215 Query: 158 SGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ-------ATNFYDFACY 210 G + + I + S+ R +G N + ++ D Sbjct: 216 -GNQWLVDNNIKATLTDATISA--RTAIGYNANKVIYAIVVDGSQATYSNGLTLVDLRDI 272 Query: 211 AKAKLNVEQLLYLDGTISHMYM 232 A L + + LDG S + Sbjct: 273 M-AALGTKDAVNLDGASSSTLV 293 >UniRef50_B4VX04 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VX04_9CYAN Length = 681 Score = 88.6 bits (218), Expect = 2e-16, Method: Composition-based stats. Identities = 44/239 (18%), Positives = 74/239 (30%), Gaps = 45/239 (18%) Query: 52 PQTERVKMYWQKANGEAWGTLHALLADINSQGQ--VQMAMNGGIYDES-----------Y 98 P R + W + +G I + G+ + +N G Y Sbjct: 431 PILNRGAIAWTDQHQFKFGRFSLQETLITANGERFPSLFLNSGYVQAGISRYTPAWGVTY 490 Query: 99 APLG-----LYIENGQQKVAL-NLASGEGNFFIRPGGVFYV--AGDKVGIVRLDAFKTSK 150 PL ++N Q L +GE +F I G D I + T+ Sbjct: 491 TPLTDNEVIWVVQNNQITAQLPGGVAGEESFVIPVNGYLLTHRGHDPNAIAKSLTLGTTV 550 Query: 151 EIQF------------AVQSGPMLMENGVINPR------IHPNVASSKIRNGVGINKHGN 192 +I+ + +GP+L++N I + S IR+ +GI +G Sbjct: 551 QIEQKTLPVEFNDYPHILGAGPLLLQNRQIVLDAKAENFSNAFAQQSAIRSAIGITANGT 610 Query: 193 AVFLLSQQAT-----NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVT 246 + N + A + +L L LDG S GG + + Sbjct: 611 LIIAAMHNRVGGRGPNLTETAQLMQ-QLGAVDALNLDGGSSTGLYLGGHLLDRSPHTAA 668 Score = 45.8 bits (107), Expect = 0.001, Method: Composition-based stats. Identities = 18/96 (18%), Positives = 32/96 (33%), Gaps = 2/96 (2%) Query: 26 LPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQV 85 P L D + V+P+ +VK+ ++ L+ + Sbjct: 343 APGIWWRQRTVTLGDHQFPLVWLEVDPKNPQVKLSPMWSHPTTQVGTAPLIKT-AQLWKA 401 Query: 86 QMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGE 120 A+NGG ++ + PLG +G L G Sbjct: 402 AAAINGGFFNRNNQLPLGAIRRDGYWYSGPILNRGA 437 >UniRef50_O31980 SPBc2 prophage-derived uncharacterized protein yomE n=2 Tax=root RepID=YOME_BACSU Length = 644 Score = 87.4 bits (215), Expect = 4e-16, Method: Composition-based stats. Identities = 31/239 (12%), Positives = 65/239 (27%), Gaps = 33/239 (13%) Query: 28 LFAVAADDCALSDPTLTVQAYTVNPQT-------------ERVKMYWQKANGEAWGTLHA 74 F V + + + V P+T + + T Sbjct: 58 YFTVTSSFKQDATLGIEYYVTKVTPKTTEAKKSMVQKTFAYDFEKSIDPTSSYFGTTNRE 117 Query: 75 LLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVA 134 + + + + +A+N + + +GL I++G + A G VF+ Sbjct: 118 TVLSMAKRKRSVVAINASGWRSNGEVMGLQIKDGVLYKDYDAAGYTGAEAC----VFF-D 172 Query: 135 GDKVGIVRLDAFKTS----KEIQFAVQSGPMLMENGVINPRIH---PNVASSKIRNGVGI 187 + + K + + G L+++ ++ R +G Sbjct: 173 DGTMKVYGNREVDADILISKGARNSFAFGIWLVKDSKPRTAQMTTWADLNVKHPRQAIGQ 232 Query: 188 NKHGNAVFLLSQQA------TNFYDFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPW 239 G V + YD ++ LDG S ++G I Sbjct: 233 RSDGTLVIITVDGRSLRSSGITAYDMPSLFLSE-GCINAFLLDGGGSSQTAVEGKYINN 290 >UniRef50_B8HTR4 Putative uncharacterized protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HTR4_CYAP4 Length = 603 Score = 87.4 bits (215), Expect = 4e-16, Method: Composition-based stats. Identities = 26/159 (16%), Positives = 45/159 (28%), Gaps = 21/159 (13%) Query: 104 YIENGQQKVALNLASGEGNFFIRPGGVFYV---------AGDKVGIVRLDAFKTSKEIQF 154 + + A+ I P G V + I Sbjct: 423 VVNDRVAGQQTASANTPTPILIPPNGYLLVLRDVPLPVFGEGSLQIQMNALPADFNRFPQ 482 Query: 155 AVQSGPMLMENGVIN-----PRIHPNVA-SSKIRNGVGINKHGNAVFLLSQQAT-----N 203 + +GP+L+E G I + + R+G+G G + + + Sbjct: 483 ILGAGPLLLERGQIVLNPDLEQFGNGLDAQQAPRSGIGRTSTGQILLVTTHNRIGGAGPT 542 Query: 204 FYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRY 242 ++A K L L LDG S GG + + Sbjct: 543 LAEWAAILK-TLGAVDALNLDGGSSTALYLGGQLLDRHP 580 Score = 43.5 bits (101), Expect = 0.007, Method: Composition-based stats. Identities = 16/100 (16%), Positives = 34/100 (34%), Gaps = 2/100 (2%) Query: 22 ALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINS 81 + A+ + +NP +++ + + L + Sbjct: 260 NILWSSGVRWREQTLAVGGDRYPLTWLEINPHQAGLQLRPIWNQPDTLVGIQPLPR-LAQ 318 Query: 82 QGQVQMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGE 120 + QV A+NGG ++ + PLG ++G + L G Sbjct: 319 RWQVAAAINGGFFNRNQQVPLGAIRQSGSWISSPILNRGA 358 >UniRef50_C4Z6E6 Putative uncharacterized protein n=1 Tax=Eubacterium eligens ATCC 27750 RepID=C4Z6E6_EUBE2 Length = 360 Score = 87.4 bits (215), Expect = 4e-16, Method: Composition-based stats. Identities = 35/225 (15%), Positives = 74/225 (32%), Gaps = 34/225 (15%) Query: 37 ALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIY- 94 +S + + + +P +V + WG L +I S +NGG+Y Sbjct: 119 EISGRSFFGKMLIIKDPSQVKVGTTY------PWGDYGKELHEIVSGAGAIAGVNGGLYV 172 Query: 95 ---DESYAPLGLYIENGQQ-KVALNLASGEGNFFIRPGGVFYVAG-DKVGIVRLDAFKTS 149 + +PLG+ +++G+ + + SG + + V D + +++ Sbjct: 173 SSGNRGGSPLGIVVQDGKITYNSPSALSGLYLIGLNKDNLLVVKDIDGMSAADFESYVNE 232 Query: 150 KEIQFAVQSG-----------PMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLS 198 I+ AV P+++ N + + + R +G G + L++ Sbjct: 233 AGIRDAVAFQEESSDSNNHFVPLIINNEARV--LKGQGSGANPRTAIGQRVDGAILLLVT 290 Query: 199 QQA-------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGA 236 D + + LDG S + G Sbjct: 291 DGRGASGHLGATASDLISVMQ-EYGAVNAANLDGGSSSTMVYNGG 334 >UniRef50_C6JBU1 Putative uncharacterized protein n=1 Tax=Ruminococcus sp. 5_1_39BFAA RepID=C6JBU1_9FIRM Length = 291 Score = 87.4 bits (215), Expect = 4e-16, Method: Composition-based stats. Identities = 34/182 (18%), Positives = 63/182 (34%), Gaps = 16/182 (8%) Query: 65 NGEAWGTLHALLADINSQGQVQMAMNGGIYDESY---APLGLYIENGQQKVALNLASGEG 121 + +G +D S + +NG +D +PLG+ I+NG + Sbjct: 100 SNGTYGGSRQTTSDAVSSNGGIIGVNGSAFDYGTGKPSPLGMCIKNGIIYGDYMTSYSV- 158 Query: 122 NFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKI 181 ++ G Y G++ + + + GP+L+++G Sbjct: 159 -MAVKKDGTIYTPAQ--GLMGKNLLAAGVKDTY--NFGPVLIKDGEAQLPWTET-EKYYP 212 Query: 182 RNGVGINKHGNAVFLLSQ----QATNFYDFACYAKAKLNVEQLLYLDGTIS-HMYMKGGA 236 R VG+ K + V L++ N +D K+ LDG S +Y G Sbjct: 213 RTAVGMVKPNDYVLLVTDTGSYNGLNHWDMVNIFKS-YGCTYAYNLDGGGSATLYFNGKV 271 Query: 237 IP 238 + Sbjct: 272 MN 273 >UniRef50_A7LVE9 Putative uncharacterized protein n=1 Tax=Bacteroides ovatus ATCC 8483 RepID=A7LVE9_BACOV Length = 332 Score = 87.4 bits (215), Expect = 5e-16, Method: Composition-based stats. Identities = 34/212 (16%), Positives = 58/212 (27%), Gaps = 50/212 (23%) Query: 81 SQGQVQMAMNGGIYDESYAPLGLYIENGQQK--VALNLASGEGN----------FFIRPG 128 + + +NGG + E L L NG+ A N F Sbjct: 112 EENNSTIVINGGFFYEG--SLSLIWRNGEMVCKNNDVTAEDWTNGPFWYPVLAAFCEMND 169 Query: 129 GVF-------------YV----AGDKVGIVRLDAFKTSKEIQFA---VQSGPMLMENGVI 168 G F Y + K + F ++ + A + GP+L+ +G I Sbjct: 170 GSFKSMWTYTTLSNVTYWYSEPSPVKSETTPDENFPSTGTVLNAKTGIGGGPVLLLDGNI 229 Query: 169 NPRIHP------NVASSKIRNGVGINKHGNAVFLLSQQ--------ATNFYDFACYAKAK 214 ++ R+ +GI + + + + A K Sbjct: 230 KNTYEEEILSDIGATVNRPRSAIGITNDKKMILFVCEGDGMTTGVAGMTTENVANIMK-T 288 Query: 215 LNVEQLLYLDGTISH-MYMKGGAIPWQRYPFV 245 L + LDG S M + G Sbjct: 289 LGCTDAINLDGGGSSCMLVNGQETIKTSDSSG 320 >UniRef50_A1SN25 Exopolysaccharide biosynthesis protein-like n=1 Tax=Nocardioides sp. JS614 RepID=A1SN25_NOCSJ Length = 420 Score = 87.4 bits (215), Expect = 5e-16, Method: Composition-based stats. Identities = 26/170 (15%), Positives = 55/170 (32%), Gaps = 24/170 (14%) Query: 92 GIYDES-YAPLGLYIENGQ--------QKVALNLASGEGNFFIRP-GGVFYVA-GDKVGI 140 GIY G + GQ + +P G+ ++ G+ + Sbjct: 224 GIYTPRWGRTAGYGVTQGQTERVRAVTVVNGRVRTNRAKLSHDQPIKGLLFIGRGEGAKV 283 Query: 141 VRLDAFKTSKEIQFAVQSGPMLMENGV---INPRIHPNVASS--KIRNGVGINKH-GNAV 194 +R T ++++++Q P + +G ++ I + R VG++ G + Sbjct: 284 LRKLPKHTRIKVRWSLQGRPQMAISGNNFLVHDGIIRAIDDREMHPRTAVGVDSDTGEVL 343 Query: 195 FLLSQQAT------NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIP 238 L+ + A L ++ + LDG S + Sbjct: 344 LLVVDGRQADSRGYTMVELANLMVD-LGADEAVNLDGGGSSTMVGKNRRG 392 >UniRef50_C9PX63 Putative uncharacterized protein n=1 Tax=Prevotella sp. oral taxon 472 str. F0295 RepID=C9PX63_9BACT Length = 294 Score = 87.0 bits (214), Expect = 5e-16, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 68/207 (32%), Gaps = 29/207 (14%) Query: 44 TVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGL 103 TV + P+ ++ A+G A + ++ + + + +NG + + Sbjct: 66 TVTVAEITPKRS-LEFDIAIADG------GATVGEMAQRTKALVGINGSYFGMNKRSAIT 118 Query: 104 YIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVR-----LDAFKTSKEIQFAVQS 158 Y+ G+ + + +R G G K+ I+ + A S Sbjct: 119 YLRQGRTVLDTTTTAELA---LRVTGAIRTHGRKLRIMPWNKEIERRYHCRHGSTLA--S 173 Query: 159 GPMLMENGV---INPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-------TNFYDFA 208 G +L+ G + V R+ + + G +F+ N + Sbjct: 174 GHLLLYRGQSILLRSSSMGFVVKKHPRSAIALTSRGTVLFVTVDGRHPGYAGGMNLIEL- 232 Query: 209 CYAKAKLNVEQLLYLDGTIS-HMYMKG 234 + +L + LDG S ++ KG Sbjct: 233 RHFLQQLGCTDAINLDGGGSTTLWAKG 259 >UniRef50_C6J7B9 Exopolysaccharide biosynthesis protein n=2 Tax=Bacillales RepID=C6J7B9_9BACL Length = 355 Score = 87.0 bits (214), Expect = 5e-16, Method: Composition-based stats. Identities = 43/236 (18%), Positives = 73/236 (30%), Gaps = 37/236 (15%) Query: 34 DDCALSDPTLTVQAYTVNPQTER---VKMYWQKANG-------EAWGTLHALLADINSQG 83 + + ++ Y VNP T R +K+ + + + G + + G Sbjct: 116 PFETIQSDRIRIELYKVNPGTYRGYAMKIRLKSPDAMKMTLGKDRLGGAETTMQAVQRYG 175 Query: 84 QVQMAMNGGIYDESYA--PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIV 141 V GG D PL I NGQ F +F+V ++ G + Sbjct: 176 AVAGINAGGFADSRGQRYPLSTTILNGQYVNG---------FEPSYKDLFFVGLNQSGQL 226 Query: 142 RLDAFKTSK-----EIQFAVQSGPMLMENGVINPRIHP--NVASSKIRNGVGINKHGNAV 194 F+ + + +F P+L++NGV P R +G K + Sbjct: 227 IGGKFQNKESLDKLKPKFGASFVPILLQNGVKLPIPDKWKTSPLRAPRTVIGNYKDDQLL 286 Query: 195 FLLSQQ-------ATNFYDFACYAKAKLNVEQLLYLDGTI-SHMYMKGGAIPWQRY 242 L+ + L V+ LDG S + + G I Sbjct: 287 VLVVDGDNEKGRSGATLEELQNKLAN-LGVQDAYNLDGGGSSSLVVNGRVINHPSD 341 >UniRef50_C6LDL7 Putative uncharacterized protein n=1 Tax=Bryantella formatexigens DSM 14469 RepID=C6LDL7_9FIRM Length = 400 Score = 86.6 bits (213), Expect = 7e-16, Method: Composition-based stats. Identities = 32/202 (15%), Positives = 56/202 (27%), Gaps = 23/202 (11%) Query: 39 SDPTLTVQAYTVNPQTERVKMYWQKANGEAWG-TLHALLADINSQGQVQMAMNGGIYDES 97 + + + + +G L+ ++ + + +A+NG Y + Sbjct: 170 EKYGTQISYVLADIYVGDITCLRTAFAQDTYGVGYSEKLSGMSDRMKAVLAVNGDSYSNN 229 Query: 98 Y-APLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAV 156 G I NG + + V G + A Sbjct: 230 RHRNNGTIIRNGVIYRSQATDAETC--------VLNWDGTMDIYTPDQMDIQKLIERGAY 281 Query: 157 QS---GPMLM-ENGVINPRIH--PNVASSKIRNGVGINKHGNAVFLLSQQA------TNF 204 QS GP L+ ENG + S R +G + G+ LL Sbjct: 282 QSWVFGPSLLDENGKAKDSFLTWDYIRQSHPRTAIGYYEPGHYCLLLVDGRQKASRGMFL 341 Query: 205 YDFACYAKAKLNVEQLLYLDGT 226 + A +L + LDG Sbjct: 342 DEMAQLF-EELGCKAAYNLDGG 362 >UniRef50_C9KQW2 Putative secreted protein n=2 Tax=Veillonellaceae RepID=C9KQW2_9FIRM Length = 503 Score = 86.6 bits (213), Expect = 7e-16, Method: Composition-based stats. Identities = 22/116 (18%), Positives = 40/116 (34%), Gaps = 13/116 (11%) Query: 136 DKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVIN-----PRIHPNVA-SSKIRNGVGINK 189 ++ + + + F + GP L+ENG ++ ++ R+ VGI K Sbjct: 370 GDPVMIEENLGDGWQNMDFIIGCGPRLVENGRVHVTVDEEDFPADIRIGRAPRSAVGITK 429 Query: 190 HGNAVFLLSQQA------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPW 239 G + + D+A K + L LDG S + G + Sbjct: 430 DGRYLLAVVDGRQSHSVGLTLTDWAKLL-VKFGAQDALNLDGGGSSDLVVNGDVQN 484 >UniRef50_D1VRM0 Putative copper amine oxidase N-domain family n=1 Tax=Peptoniphilus lacrimalis 315-B RepID=D1VRM0_9FIRM Length = 361 Score = 86.6 bits (213), Expect = 8e-16, Method: Composition-based stats. Identities = 30/151 (19%), Positives = 49/151 (32%), Gaps = 11/151 (7%) Query: 102 GLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPM 161 G I G+ V + + V + + ++ V +GPM Sbjct: 207 GYIIYFGKDSVDKSYIDQRFKLGRKVELVLVDSKGNETFKYNGQDISYSKVTELVAAGPM 266 Query: 162 LMENGV-INPRIHPNVASSKI------RNGVGINKHGNAVFLLSQQATNFYDFACYAKAK 214 L++NG + N KI R+ +GI K+G + L + N A Sbjct: 267 LLQNGKNVVAESKNNYKEGKINSATGQRSAIGITKNGKVILLTA--VANVDKLALIMND- 323 Query: 215 LNVEQLLYLDGT-ISHMYMKGGAIPWQRYPF 244 L + LDG S ++ G I Sbjct: 324 LGCIDAMNLDGGASSALFANGKVIKNAGRNL 354 >UniRef50_D2RLV8 N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase-like exopolysaccharide biosynthesis protein n=1 Tax=Acidaminococcus fermentans DSM 20731 RepID=D2RLV8_ACIFE Length = 477 Score = 85.9 bits (211), Expect = 1e-15, Method: Composition-based stats. Identities = 28/137 (20%), Positives = 49/137 (35%), Gaps = 18/137 (13%) Query: 134 AGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVIN-----PRIHPNVA-SSKIRNGVGI 187 GD V + + + +GP L+ +G + I ++A R GVGI Sbjct: 342 TGDPVKVTQTLGNAAADSAPSVGSAGPQLVRDGRVQVTSEEEEIADDIALGRAPRTGVGI 401 Query: 188 NKHGNAVFLLSQQA------TNFYDFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQ 240 K G + +++ +F Y +L ++ + DG S M + G + Sbjct: 402 KKDGTVLVVVADGRSDDSVGMTLTEFGRYF-VQLGADRAMNFDGGGSSEMVVNGKIMNDP 460 Query: 241 RY----PFVTMISVERK 253 P + V RK Sbjct: 461 SDGTERPVRVALGVFRK 477 >UniRef50_A0YXN3 Putative uncharacterized protein n=1 Tax=Lyngbya sp. PCC 8106 RepID=A0YXN3_9CYAN Length = 775 Score = 85.9 bits (211), Expect = 1e-15, Method: Composition-based stats. Identities = 27/167 (16%), Positives = 53/167 (31%), Gaps = 24/167 (14%) Query: 103 LYIENGQQKVALNLASGEGNFFIRPGGVF------------YVAGDKVGIVRLDAFKTSK 150 + +EN Q + + I G + G K+ I Sbjct: 596 ITVENNQLSRQIESNDDQTPIEIPQNGYLLTFRSFRSALSAFPLGGKIAITAKTTPSEFN 655 Query: 151 EIQFAVQSGPMLMENGVINPRIHPNV------ASSKIRNGVGINKHGNAVFLLSQQATN- 203 + + +GP+L++ G I IR+G+G+ +G+ + + Sbjct: 656 QYPHILGAGPLLLQQGQIVVDAEAEGFNIWFAKQRAIRSGIGVTANGDLLIVTVHNRVGG 715 Query: 204 ----FYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVT 246 + A + +L + L LDG S + GG + + Sbjct: 716 PGPDLTELAQLIQ-QLGAVEGLNLDGGSSTSLILGGHLLNRTADTAA 761 >UniRef50_UPI0001BC335A hypothetical protein BcroD2_01203 n=1 Tax=Butyrivibrio crossotus DSM 2876 RepID=UPI0001BC335A Length = 366 Score = 85.9 bits (211), Expect = 1e-15, Method: Composition-based stats. Identities = 34/209 (16%), Positives = 68/209 (32%), Gaps = 31/209 (14%) Query: 51 NPQTERVKMY--WQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESY----APLGLY 104 +P V W N +G L ++ + +NGG Y P GL Sbjct: 139 DPSKVSVATIYPWSDENKSKYGV---TLGELVTNAGAIAGINGGEYCSDGNWGGRPKGLV 195 Query: 105 IENGQQKVALNLASGEGNFFIRPGGVFYVAG-DKVGIVRLDAFKTSKEIQFAVQS----- 158 + NG+ + + G+ + + + + + +++ ++ I+ V Sbjct: 196 VSNGELQYN-SPQWGDVMVGFNEDNILVIKDLNGMSVGQIEEMVKTERIRDCVSFKDIDD 254 Query: 159 -----GPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-------TNFYD 206 L+ NG + I+ + + + R +G G + ++ D Sbjct: 255 GDSNHFTKLIING-VATEINGSGSGANPRTCIGQRADGTVLMFVTDGRGASGHIGATAAD 313 Query: 207 FACYAKAKLNVEQLLYLDGT-ISHMYMKG 234 K + +DG S MY KG Sbjct: 314 LISVMK-EYGAVNAANIDGGSSSSMYYKG 341 >UniRef50_B8HPJ4 Putative uncharacterized protein n=2 Tax=Cyanothece sp. PCC 7425 RepID=B8HPJ4_CYAP4 Length = 338 Score = 85.5 bits (210), Expect = 1e-15, Method: Composition-based stats. Identities = 37/219 (16%), Positives = 68/219 (31%), Gaps = 18/219 (8%) Query: 27 PLFAVAADDCALSDPTL-TVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQV 85 P V + L+ P L +N T +K + L ++ + + Sbjct: 53 PFVGVTYINRRLTSPRLLNQHIVLINLATTGLKFRVTSPAADGSTALEKTIS-FTRRSKA 111 Query: 86 QMAMNGGIY----DESYAPLGLYIENGQQKVA-LNLASGEGNFFIRPGGVFYVAGDKVGI 140 Q+ +NG + LGL +G+ + N G NF F +G Sbjct: 112 QIGINGNFFQALSSTRAKVLGLAASSGRVYSSWSNGYQGAINFSSNRTATFVTPPSGLG- 170 Query: 141 VRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ 200 T + + P+L++NG N R+ +G+ ++ + Sbjct: 171 TTTVPLLTPYNLVSGL---PVLVKNGQNVTVGVANPNEYAARSVIGLTQNQQLLLFAVDG 227 Query: 201 A-------TNFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 N + A + V + LDG S + Sbjct: 228 PRSNVSTGMNQIELADLLISDFKVVHAVNLDGGGSSTLV 266 >UniRef50_UPI0000E45D54 PREDICTED: similar to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase precursor n=2 Tax=Strongylocentrotus purpuratus RepID=UPI0000E45D54 Length = 447 Score = 85.5 bits (210), Expect = 2e-15, Method: Composition-based stats. Identities = 37/220 (16%), Positives = 59/220 (26%), Gaps = 27/220 (12%) Query: 41 PTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE-SYA 99 VN V + +G A + Q +A+N G ++ S A Sbjct: 102 ERAPGHIVRVNSPARTVSVLEPFDSGGCTNHHRATVDSTAKQDNCLVAVNAGFFNPRSGA 161 Query: 100 PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSG 159 G + NG+ N + IR G + IQ G Sbjct: 162 CYGNVVSNGRLV-QTNGGLQNAHLGIRADGTLVFGYLSE---ENVLQTENPFIQLVGGVG 217 Query: 160 PMLMENGVINPRIHPNVA---------------SSKIRNGVGINKHGNAVFLLSQQ---- 200 L+ +G I R VG ++ G V + Sbjct: 218 W-LLRDGEIYVEESKKAECGDTEEASSVDLFFNMLSARVAVGSDEKGRLVIAVIDGQTLK 276 Query: 201 -ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPW 239 + FA + + V + DG S ++ G I Sbjct: 277 RGLSLLSFAKWLLSH-GVTNAINFDGGGSATFVVNGTIVN 315 >UniRef50_B7KAU9 Putative uncharacterized protein n=7 Tax=Chroococcales RepID=B7KAU9_CYAP7 Length = 644 Score = 85.1 bits (209), Expect = 2e-15, Method: Composition-based stats. Identities = 36/237 (15%), Positives = 65/237 (27%), Gaps = 43/237 (18%) Query: 52 PQTERVKMYWQKANGEAWGTLHALLADINSQGQ--VQMAMNGGIYDES-----------Y 98 P R + W G L I + G + +N G Y Sbjct: 395 PILNRGAIAWNDRGQVKMGRLRLQETVITNGGNRLPVLYLNSGYVQSGMARYTRDWGATY 454 Query: 99 APLG-----LYIENGQQKVALNLASGEGNFFIRPGG--VFYVAGDKV-----GIVRLDAF 146 PL + ++N Q N P + + + V I Sbjct: 455 TPLSDDELIITVQNNQVISQRQGGKAGQNVIPIPNDGYLLAIRKNSVPASALTIGTSLNL 514 Query: 147 KTSK------EIQFAVQSGPMLMENGVIN-----PRIHPNVASSKI-RNGVGINKHGNAV 194 ++ + +GP+L+ NG I + + K R+ + + G + Sbjct: 515 ESGTIPADFNNYPHILGAGPLLLLNGQIVLDVASEQFSKGFQNQKASRSAIATTRDGKLM 574 Query: 195 FLLSQQAT-----NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVT 246 + + + A ++ L L LDG S GG + + Sbjct: 575 VVAVHNRVGGSGASLPELAQILQS-LGAVDALNLDGGSSTSLALGGQLIDRSPVTAA 630 Score = 46.6 bits (109), Expect = 9e-04, Method: Composition-based stats. Identities = 16/99 (16%), Positives = 32/99 (32%), Gaps = 2/99 (2%) Query: 23 LTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQ 82 +T P L + V ++ ++ + + +N + + I Sbjct: 304 ITWTPGLIWRQKIIPLKGDSFPVTWLDIDLKSPNIFLKPVTSNPDTLEG-TEPIVTIGRN 362 Query: 83 GQVQMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGE 120 A+NGG ++ + PLG N + L G Sbjct: 363 TTASAAINGGFFNRNNRLPLGAIRTNNRWVSGPILNRGA 401 >UniRef50_UPI0001C43112 hypothetical protein BpOF4_05820 n=1 Tax=Bacillus pseudofirmus OF4 RepID=UPI0001C43112 Length = 762 Score = 84.7 bits (208), Expect = 3e-15, Method: Composition-based stats. Identities = 28/158 (17%), Positives = 53/158 (33%), Gaps = 25/158 (15%) Query: 113 ALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAF------KTSKEIQFAVQSGPMLMENG 166 N + F I G + V + ++ +F + +GP L+ NG Sbjct: 249 GANATIPKDGFVISANGGPFRDALTGVSVGDELTVEASINDAWRDAEFILATGPTLVRNG 308 Query: 167 VINPRIHPNVA---SSKIRNGVGINKHGNAVFLLS--------QQATNFYDFACYAKAKL 215 + + + R VG + G +FL++ + A Y ++ + Sbjct: 309 QTSISMSTSSPFARERAPRTAVGASSDGTKLFLVTIDGRQSGYSNGVTIPELAAYMRS-I 367 Query: 216 NVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 + LDG S + RYP+ +SV + Sbjct: 368 GAHNAINLDGGGSTTMV-------ARYPWADHVSVVNR 398 Score = 42.0 bits (97), Expect = 0.022, Method: Composition-based stats. Identities = 22/108 (20%), Positives = 43/108 (39%), Gaps = 6/108 (5%) Query: 45 VQAYTVNPQTER--VKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE-SYAPL 101 VQ V + +++Y+ G T +A+ +V A+N Y+ + P+ Sbjct: 69 VQVLDVQYRNPNVGLELYYPTPIGRVQTTSQQAMANTYENHRVVGAVNASFYNMSNGMPV 128 Query: 102 GLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTS 149 L +EN + L++ +G F ++ G + L F+T Sbjct: 129 NLLVENNKILNYGVLSNDQGGPV---NAPFAFGVNRNGALTLTDFQTK 173 >UniRef50_B0CAS6 Putative uncharacterized protein n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0CAS6_ACAM1 Length = 279 Score = 84.3 bits (207), Expect = 3e-15, Method: Composition-based stats. Identities = 36/224 (16%), Positives = 66/224 (29%), Gaps = 45/224 (20%) Query: 44 TVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGL 103 TV + P R + +G T+ +NGG +D + Sbjct: 34 TVHVLRI-PNHPRYTVRLDVVDG--LQTVADFAQGTPKP---VAVINGGYFDPANQLTTS 87 Query: 104 YIENGQQKVALNLASGEGNFFIRPGGVFYV--------------AGDKVGIVRLDAFKTS 149 YI G Q +A + P Y+ + Sbjct: 88 YIRRGGQILADPTQNSR--LVDNPDLKVYLPKILNRSEFRQYQCGAKTTYAITSYNQPIP 145 Query: 150 KE--IQFAVQSGPMLM-------------ENGVINPRIHPNVASSKIRNGVGINKHGNAV 194 + + +A+ +GP L+ +G + R R+ VGI G + Sbjct: 146 PDCTLNYALGAGPQLLPQLTSQAEGFTDSVDGQVI-RDAIGSRQPNARSAVGITDKGEVI 204 Query: 195 FLLSQQ------ATNFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 ++L +Q + + A + + + L LDG S + Sbjct: 205 WVLVEQQSATKPGLSLPELADFMEQQ-GAASALNLDGGSSSSLV 247 >UniRef50_UPI0001C3370C hypothetical protein UCYN_10670 n=1 Tax=cyanobacterium UCYN-A RepID=UPI0001C3370C Length = 438 Score = 83.9 bits (206), Expect = 4e-15, Method: Composition-based stats. Identities = 23/129 (17%), Positives = 43/129 (33%), Gaps = 12/129 (9%) Query: 131 FYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVI-----NPRIHPNVA-SSKIRNG 184 + G + I K ++ + GP+L+ +G I + + + R+ Sbjct: 301 LFFIGSTLKIESKTVPKKFNQLSHILGGGPLLINDGSISLNVKDEKFTKSFQKQKASRSA 360 Query: 185 VGINKHGNAVFLLSQQ-----ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPW 239 +GI + + N + A + KL L LDG S + GG + Sbjct: 361 IGITNKDKTILVTVHNSINSNGVNLNEMAQIMQ-KLGSINALNLDGGGSTSLVLGGRLID 419 Query: 240 QRYPFVTMI 248 + I Sbjct: 420 RFPVTAAKI 428 Score = 45.0 bits (105), Expect = 0.002, Method: Composition-based stats. Identities = 18/94 (19%), Positives = 36/94 (38%), Gaps = 4/94 (4%) Query: 43 LTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYA-PL 101 V ++ ++ +V + + + L +I + +V A+NGG ++ + PL Sbjct: 121 FPVNLLEIDNKSSKVILRPIT-SNLNGQIGTSSLEEIAKKWRVVAAINGGFFNRNNRLPL 179 Query: 102 GLYIENGQQKVALNLASGEGNFFIRPGGVFYVAG 135 G N + L G G G F++ Sbjct: 180 GAIRHNNDWLSSPIL--GRGAVGWNENGKFFIDH 211 >UniRef50_Q9UK23 N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase n=29 Tax=Chordata RepID=NAGPA_HUMAN Length = 515 Score = 83.9 bits (206), Expect = 4e-15, Method: Composition-based stats. Identities = 33/224 (14%), Positives = 64/224 (28%), Gaps = 33/224 (14%) Query: 38 LSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADI---NSQGQVQMAMNGGIY 94 D + E ++ + G G A + ++A NGG + Sbjct: 85 FRDRAVAGHLTR---AVEPLRTFSVLEPGGPGGCAARRRATVEETARAADCRVAQNGGFF 141 Query: 95 DES-YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQ 153 + LG + + ++ + F IR G + + T Sbjct: 142 RMNSGECLGNVVSDERRVSSSGGLQNA-QFGIRRDGTLVTG----YLSEEEVLDTENPFV 196 Query: 154 FAVQSGPMLMENGVINP---------------RIHPNVASSKIRNGVGINKHGNAVFLLS 198 + L+ NG I V R +G ++ G V + Sbjct: 197 QLLSGVVWLIRNGSIYINESQATECDETQETGSFSKFVNVISARTAIGHDRKGQLVLFHA 256 Query: 199 QQ-----ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAI 237 N ++ A + + +V + LDG S ++ G + Sbjct: 257 DGHTEQRGINLWEMAEFLLKQ-DVVNAINLDGGGSATFVLNGTL 299 >UniRef50_C0CND1 Putative uncharacterized protein n=1 Tax=Blautia hydrogenotrophica DSM 10507 RepID=C0CND1_9FIRM Length = 454 Score = 83.9 bits (206), Expect = 5e-15, Method: Composition-based stats. Identities = 30/186 (16%), Positives = 59/186 (31%), Gaps = 20/186 (10%) Query: 66 GEAWGTLHALLADINSQGQVQMAMNGGIYD-ESYAPL-GLYIENGQQKVALNLASGEGNF 123 G +G ++ + + +NG + S P G + + ++G Sbjct: 233 GGTYGNPRRTVSQELADHNGVLGINGSGFSYSSGIPAPGKSMIKDRTVYEDVYSNGNIMC 292 Query: 124 FIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGV---INPRIHPNVASSK 180 GG+F G+ + + + + GP L+ENG I+ + Sbjct: 293 VTGEGGMFTAP---AGMTVQEMLQRDVKDTYC--FGPTLVENGEAFEISEQFQQTY--RY 345 Query: 181 IRNGVGINKHGNAVFLLSQQ-------ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMK 233 R VG+ G+ ++ + + L+ E LDG S + Sbjct: 346 QRTAVGMISPGDYYLVIVDGKGVGGSQGMTYEELQQVFLD-LDCEYAYNLDGGGSTTLVF 404 Query: 234 GGAIPW 239 G + Sbjct: 405 KGRVIN 410 >UniRef50_B1I1S0 N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase-like exopolysaccharide biosynthesis protein n=1 Tax=Candidatus Desulforudis audaxviator MP104C RepID=B1I1S0_DESAP Length = 345 Score = 83.9 bits (206), Expect = 5e-15, Method: Composition-based stats. Identities = 35/223 (15%), Positives = 65/223 (29%), Gaps = 35/223 (15%) Query: 37 ALSDPTLTVQAYTVNPQTER-VKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYD 95 L V P +++ +++ GE + G V GG Y Sbjct: 122 ELKGIGYRGYIAKVKPFDPGVLRVTYREGPGET------TSEAVRRTGAVLGVNGGGFYR 175 Query: 96 ------ESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFK-- 147 P+G + +G+ G F +F+ D G + F Sbjct: 176 APVDGLMHTLPIGNTMVDGKLV---------GGFQPPREDLFFAGFDGRGRLVGGIFNDR 226 Query: 148 ---TSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA--- 201 + V P+L+++ P + R +G +G+ + ++ Sbjct: 227 TALLGTGARQGVSFVPILIKDRQPVPIPEKWRNQRQPRTILGEYANGDLIMIVVDGRQAD 286 Query: 202 ----TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 D K V LDG S +++ G I + Sbjct: 287 WSSGVTLEDL-QVTLIKFGVIDAYNLDGGGSSVFVFGNQILNR 328 >UniRef50_B3RIP6 Putative uncharacterized protein (Fragment) n=2 Tax=Trichoplax adhaerens RepID=B3RIP6_TRIAD Length = 344 Score = 83.6 bits (205), Expect = 6e-15, Method: Composition-based stats. Identities = 38/205 (18%), Positives = 65/205 (31%), Gaps = 31/205 (15%) Query: 48 YTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYD-ESYAPLGLYIE 106 +P + Q G L + +A N G ++ E+ G I Sbjct: 12 VVEDPLRTISVLEPQNTGGCNMSKLSTVADTARKAH-CYVAENAGFFNTETGGCYGNIIS 70 Query: 107 NGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPM-LMEN 165 NG+ N+ + NF IR G V G + + + + SG + L+ N Sbjct: 71 NGRLVRLTNVQNV--NFGIRKNGSIIV-----GYLTEEEILDKENPFVQLVSGVIWLVRN 123 Query: 166 GVINPR---------------IHPNVASSKIRNGVGINKHGNAVFLLSQQ-----ATNFY 205 G + + + R +G +++GN + + + N Y Sbjct: 124 GKSYVKESMKMESNKHEETGTLKQFIEVKSARTAIGHDRNGNVMLMQIEGQTNARGLNLY 183 Query: 206 DFACYAKAKLNVEQLLYLDGTISHM 230 DFA + LDG S Sbjct: 184 DFAKKLIKS-GFVNAINLDGGGSST 207 >UniRef50_C9LSB3 Putative secreted protein n=1 Tax=Selenomonas sputigena ATCC 35185 RepID=C9LSB3_9FIRM Length = 475 Score = 83.2 bits (204), Expect = 9e-15, Method: Composition-based stats. Identities = 17/97 (17%), Positives = 36/97 (37%), Gaps = 13/97 (13%) Query: 155 AVQSGPMLMENGVIN-----PRIHPNVA-SSKIRNGVGINKHGNAVFLLSQQAT------ 202 + +GPML+++G+ + P++A R G+ G+ + + Sbjct: 361 VIGAGPMLVKDGIAHVTATEEEFPPDIARGRAPRTAFGVTAEGHYLLAVVDGRQPHSIGC 420 Query: 203 NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPW 239 + A + + Q + DG S + GG + Sbjct: 421 TLQEMAEFML-QFGAVQAINFDGGGSSALVVGGELEN 456 >UniRef50_P74396 Slr0280 protein n=1 Tax=Synechocystis sp. PCC 6803 RepID=P74396_SYNY3 Length = 610 Score = 82.8 bits (203), Expect = 9e-15, Method: Composition-based stats. Identities = 26/136 (19%), Positives = 49/136 (36%), Gaps = 17/136 (12%) Query: 127 PGGVFYVAG--DKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNV------AS 178 P GV V + G +AF + +GP+L++ G + Sbjct: 471 PAGVLAVGTTLNVNGRSTPEAFNAFPN---GMGAGPLLIDQGRMVLNATGEGFSSAFQQQ 527 Query: 179 SKIRNGVGINKHGNAVFLLSQQAT-----NFYDFACYAKAKLNVEQLLYLDGTISHMYMK 233 R+ + ++++GN + + S + +FA + +L L LDG S Sbjct: 528 RASRSAIAVDRNGNIILVASHNRVGGAGASLGEFAQILQ-QLGAVNALNLDGGSSTSLAL 586 Query: 234 GGAIPWQRYPFVTMIS 249 GG + + +S Sbjct: 587 GGQLLDRSPVTAARVS 602 Score = 42.3 bits (98), Expect = 0.016, Method: Composition-based stats. Identities = 16/76 (21%), Positives = 28/76 (36%), Gaps = 2/76 (2%) Query: 28 LFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQM 87 +S V T+NP++ + + AN A L I + + Sbjct: 278 GITWQQRFVNISGGQFPVTTVTINPRSPGISLRPLMANP-TMAQGTAPLVTIARDQRAAV 336 Query: 88 AMNGGIYDESYA-PLG 102 A+N G ++ + PLG Sbjct: 337 AINAGFFNRNNQLPLG 352 >UniRef50_B9YC35 Putative uncharacterized protein n=2 Tax=Holdemania filiformis DSM 12042 RepID=B9YC35_9FIRM Length = 368 Score = 82.8 bits (203), Expect = 1e-14, Method: Composition-based stats. Identities = 29/218 (13%), Positives = 61/218 (27%), Gaps = 26/218 (11%) Query: 36 CALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIY 94 L T + V +P V +G ++ + MN G + Sbjct: 147 IDLKGTTFEGKLMIVHDPSRVFVACNPNMDSGAPGYSVEKYIEL----NDAIAGMNAGGF 202 Query: 95 DESY------APLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKT 148 +++ G+ I +G+ + + I V Sbjct: 203 EDAGGNGNGGTAYGIVIHDGKLISG-SPSEFTPVIGINNANQLVVGD------MTAQQAL 255 Query: 149 SKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT------ 202 +I+ AV GP+ ++N + + R +G G + ++ Sbjct: 256 DYDIRDAVTFGPVFIKNWEVVFESGRH-PGLNPRTVIGQRYDGAFLLMVLDGRQPSSFGS 314 Query: 203 NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 + D + + + LDG S + + G Sbjct: 315 TYQDIIDIMQ-QYDAVNAANLDGGNSTVMVYDGETLNT 351 >UniRef50_C6CV17 Exopolysaccharide biosynthesis protein n=1 Tax=Paenibacillus sp. JDR-2 RepID=C6CV17_PAESJ Length = 355 Score = 82.4 bits (202), Expect = 1e-14, Method: Composition-based stats. Identities = 35/217 (16%), Positives = 59/217 (27%), Gaps = 19/217 (8%) Query: 38 LSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDES 97 + A + ++ + G LA + G V GG D Sbjct: 132 IKADNFQSYAMKIKLKSGDAMKMVLGND--KVGGAETTLAAVQRYGAVAGVNAGGFADGG 189 Query: 98 YA--PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFA 155 PL I NG + FF+ + G K + ++F Sbjct: 190 GKRYPLSTTILNGDYVEGFEP-TRADLFFVGLNASNKLVGGKF---TSKQQLDNLNVKFG 245 Query: 156 VQSGPMLMENGVI--NPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-------TNFYD 206 P+L++NG P + + R + K G + +++ + Sbjct: 246 ASFVPVLLKNGSPTTIPSKWQSSPTRAPRTVIANYKDGQLLIIVADGRNEGGSSGATLAE 305 Query: 207 FACYAKAKLNVEQLLYLDGTI-SHMYMKGGAIPWQRY 242 +L LDG S M G I Sbjct: 306 M-QILLQRLGAVDGYNLDGGGSSSMIWNGRVINKPSD 341 >UniRef50_B8HPB3 Putative uncharacterized protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HPB3_CYAP4 Length = 304 Score = 82.4 bits (202), Expect = 1e-14, Method: Composition-based stats. Identities = 43/275 (15%), Positives = 78/275 (28%), Gaps = 49/275 (17%) Query: 5 LLIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKA 64 L+IG G+I + T L + Y++ + Sbjct: 11 LVIGLGLIGSTTACTQTSTTASSAPVAPTPPQPLQ-----YKVYSLPHSKIHTLVI---P 62 Query: 65 NGEAWGTLH------ALLADINSQGQVQMAMNGGIYDE-SYAPLGLYIENGQQK-VALNL 116 G + LA Q Q +NGG +D + + G+Q + Sbjct: 63 AGSTYEVTAAIAPDVQPLATFAQQHQAIAVLNGGFFDPVNGKSTSHVVLAGKQVANPQDN 122 Query: 117 ASGEGNFFIRP-----------GGVFYVAGDKVGIVR-LDAFKTSKEIQFAVQSGPMLME 164 N + P + I R E+ A+ +GP L+ Sbjct: 123 ERLIQNPDLIPYLPLILNRSELRNYRCAGQIRYEISRHDKPIPPGCELLMALGAGPQLLP 182 Query: 165 N------------GVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQ--------QATNF 204 G R R+ +G+ G+ V+L+ + Sbjct: 183 QNTSVQEGFMAYSGETITRDSLGSLYPNARSAIGLKADGSLVWLMVAERSDANQPGGLSL 242 Query: 205 YDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPW 239 + A + ++ L V + + LDG S + G + Sbjct: 243 PELAQFMQS-LGVVKGMNLDGGSSASFYYQGQTHF 276 >UniRef50_Q67T45 Putative uncharacterized protein n=1 Tax=Symbiobacterium thermophilum RepID=Q67T45_SYMTH Length = 921 Score = 82.4 bits (202), Expect = 1e-14, Method: Composition-based stats. Identities = 20/108 (18%), Positives = 42/108 (38%), Gaps = 10/108 (9%) Query: 132 YVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHG 191 ++ K G +++ S + +A+ L+ +G + + + AS + R+ VG + G Sbjct: 240 FLDPLKPGDPVTVSYRPSPAVAWAIGGQNYLVRDGAVVSGL--DNASRRPRSAVGFSADG 297 Query: 192 -NAVFLLSQQ------ATNFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 L+ + + A + K+ L LDG S + Sbjct: 298 RRMYLLVIEGDSSRSVGATLAEMAAFMKS-FGAANALELDGGGSSTIV 344 >UniRef50_A3DHF5 Ig-like, group 2 n=3 Tax=Clostridium thermocellum RepID=A3DHF5_CLOTH Length = 929 Score = 82.4 bits (202), Expect = 1e-14, Method: Composition-based stats. Identities = 24/149 (16%), Positives = 45/149 (30%), Gaps = 24/149 (16%) Query: 104 YIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSK------------- 150 ++NG + G I G ++ L FK Sbjct: 213 VVDNGTVV---EIRQGLPAVEIPQNGYVIISRGANAQFLLQHFKVGDPVEISFSTVLDWQ 269 Query: 151 EIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGN-AVFLLSQQA------TN 203 +I+ AV +L+++G I + ++ R G +K G + + Sbjct: 270 KIEMAVTGSAILVKDGQIPEKFSYEISGVHPRTAAGTSKSGKELILVTVDGRQAASKGMT 329 Query: 204 FYDFACYAKAKLNVEQLLYLDGTISHMYM 232 + A + L + LDG S + Sbjct: 330 QRELANLMLS-LGAYNAINLDGGGSTSMV 357 >UniRef50_B8G1I8 Peptidase M56 BlaR1 n=4 Tax=Desulfitobacterium hafniense RepID=B8G1I8_DESHD Length = 747 Score = 82.0 bits (201), Expect = 2e-14, Method: Composition-based stats. Identities = 35/218 (16%), Positives = 58/218 (26%), Gaps = 31/218 (14%) Query: 47 AYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIY---DESYA--PL 101 +P+ + + E L ++ G + GGIY +E P Sbjct: 530 MLISDPKRVTLAV-----TEEIGTVEEKLTDMVSRSGAIAGINAGGIYLSLEEGNEVFPD 584 Query: 102 GLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPM 161 G+ ++NG+ + G V + K IQ V P Sbjct: 585 GITVQNGEVVYNNAGDQAVEFIGLDAEGKLITGPMNVQEI------KEKNIQEGVGFSPP 638 Query: 162 LMENGVINPRIH------PNVASSKIRNGVGINKHGNAVFLLSQQA-------TNFYDFA 208 L +NG R R G+G G +F++ D Sbjct: 639 LADNGTTLVREGKPAVPGDGGWGIAPRAGIGQRADGTLIFMVIDGRDPDWSIGATLKDME 698 Query: 209 CYAKAKLNVEQLLYLDGTIS-HMYMKGGAIPWQRYPFV 245 + + + L G M G + F Sbjct: 699 NLFL-EYGAVEAVNLSGGSMVEMVYDGKVLNKVSNIFG 735 >UniRef50_UPI0001C16068 conserved hypothetical protein n=2 Tax=Nostocaceae RepID=UPI0001C16068 Length = 613 Score = 82.0 bits (201), Expect = 2e-14, Method: Composition-based stats. Identities = 35/244 (14%), Positives = 65/244 (26%), Gaps = 57/244 (23%) Query: 52 PQTERVKMYWQKANGEAWGTLHALLADI------NSQGQVQMAMNGGIYD---------- 95 P R + W GE + +L + + +N G Sbjct: 367 PILNRGAIAWNYQ-GEFYFGRLSLNETLIVDQDNKQTSLPVLFLNSGYVQNGIARYTFAW 425 Query: 96 -ESYAPLG-----LYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGI--------- 140 +Y PL + ++NG+ + G I G + Sbjct: 426 GPNYVPLTNNETIITVQNGKI---TKQSPPGGAISIPGDGYLLILRGTAVSKTSLLSVGT 482 Query: 141 -------VRLDAFKTSKEIQFAVQSGPMLMENGVIN-----PRIHPN-VASSKIRNGVGI 187 F T I + +GP+L++N I + + +R+ + Sbjct: 483 KVNLESSTTPGEFNTYPHI---IGAGPLLIQNQRIVVDAKAEKFSQAFIKERAVRSAICT 539 Query: 188 NKHGNAVFLLSQQAT-----NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRY 242 + N + + A + K+ L LDG S GG + + Sbjct: 540 TNNDNLILAAVNNRVGGWGPTLEEHAQLMQ-KIGCTNALNLDGGSSTSLYLGGQLLDRFP 598 Query: 243 PFVT 246 Sbjct: 599 NTAA 602 Score = 46.2 bits (108), Expect = 0.001, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 58/200 (29%), Gaps = 34/200 (17%) Query: 23 LTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQ 82 +T L + V +N +T + + N + L + Sbjct: 276 ITWSKGLRWQQKFINLDKDSFPVVWLEINRKTSGLNLQPILPNPQTQTGTAPLTLT-AQR 334 Query: 83 GQVQMAMNGGIYDESYA-PLGLYIENGQQKVALNLASG------EGNFF---IRPGGVFY 132 A+NGG ++ + PLG +N Q L G +G F+ + Sbjct: 335 YSAMAAINGGYFNRNNQLPLGAVRQNDQWISGPILNRGAIAWNYQGEFYFGRLSLNETLI 394 Query: 133 VAGDK-----VGIVRLDAFKTSKEIQFAVQSGP-----------MLMENGVINPRIHPNV 176 V D + + + ++ GP + ++NG I + P Sbjct: 395 VDQDNKQTSLPVLFLNSGYVQNGIARYTFAWGPNYVPLTNNETIITVQNGKITKQSPPGG 454 Query: 177 ASSKIRNGVGINKHGNAVFL 196 + I G + L Sbjct: 455 -------AISIPGDGYLLIL 467 >UniRef50_A0YL57 Putative uncharacterized protein n=1 Tax=Lyngbya sp. PCC 8106 RepID=A0YL57_9CYAN Length = 620 Score = 81.6 bits (200), Expect = 3e-14, Method: Composition-based stats. Identities = 22/106 (20%), Positives = 36/106 (33%), Gaps = 12/106 (11%) Query: 154 FAVQSGPMLMENGVIN-----PRIHPN-VASSKIRNGVGINKHGNAVFLLSQQAT----- 202 + +GP+L+++G I R IR+ VG + + Sbjct: 504 QILAAGPLLLQSGEIVLDAPSERFSEAFSNQQAIRSAVGRTPDNKLLLVAVHNRPLGSGP 563 Query: 203 NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMI 248 N + A + KL + L LDG S GG + + I Sbjct: 564 NLTELAQILQ-KLGAVEALNLDGGSSTSLYLGGELIDRPAQTAAPI 608 Score = 45.0 bits (105), Expect = 0.002, Method: Composition-based stats. Identities = 17/109 (15%), Positives = 35/109 (32%), Gaps = 11/109 (10%) Query: 22 ALTLLPLFAVAADDCALSDP---------TLTVQAYTVNPQTERVKMYWQKANGEAWGTL 72 + +P + + V ++ ++ + + + L Sbjct: 270 NILWMPGLRWRQQYIEIPNSQPTASSLPNRFPVFWLEIDLTAPQLSLKPILSRNTSRVGL 329 Query: 73 HALLADINSQGQVQMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGE 120 LL S+ Q A+NGG ++ + PLG G+ + L G Sbjct: 330 APLLKT-ASRSQALAAINGGFFNRNTLFPLGAIRRQGRWLSSPILNRGA 377 >UniRef50_UPI00019088BB hypothetical protein RetlC8_25680 n=2 Tax=Rhizobium etli RepID=UPI00019088BB Length = 332 Score = 81.2 bits (199), Expect = 3e-14, Method: Composition-based stats. Identities = 33/203 (16%), Positives = 65/203 (32%), Gaps = 17/203 (8%) Query: 28 LFAVAA-DDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQ 86 F VA A + ++P R ++ + + + Sbjct: 91 GFEVAELPVLADGREVDRIFLSRIDPMRFRFVVHNASQGDK---GIDEWEHALPK---AV 144 Query: 87 MAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAF 146 + +NG YD P +I G + G FF + D Sbjct: 145 LIVNGSYYDMHGRPDTPFISEGVAMGPRQYDAKAGAFFADAASADIRD-----LTHQDWG 199 Query: 147 KTSKEIQFAVQSGPMLM-ENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-TNF 204 A+ S P+L+ ++G + ++ R V + G + +++A + Sbjct: 200 SALAGATNAMVSYPLLIGDDGQTH--VNVKSRWLANRTFVAKDGSGRILIGTTKEAFFSL 257 Query: 205 YDFACYAK-AKLNVEQLLYLDGT 226 A + K + L+++ L LDG Sbjct: 258 DRLAEFLKASPLDLKVALNLDGG 280 >UniRef50_A4J956 Copper amine oxidase domain protein n=1 Tax=Desulfotomaculum reducens MI-1 RepID=A4J956_DESRM Length = 480 Score = 80.9 bits (198), Expect = 4e-14, Method: Composition-based stats. Identities = 27/137 (19%), Positives = 47/137 (34%), Gaps = 16/137 (11%) Query: 119 GEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENG-----VINPRIH 173 G G+ + GV G K ++ I+ + PML+E G +N + Sbjct: 211 GWGSSAGQLVGV--AEGTKARVITEMPEDWQ-NIRHVLTGSPMLVEGGLPVDQAVNEGLW 267 Query: 174 PNVASSKIRNGVGINKHGNAVFLLSQQA------TNFYDFACYAKAKLNVEQLLYLDGTI 227 +V R +G+ G + ++ + A L Q + LDG Sbjct: 268 GSVLKYSPRTALGVTAQGKVLLVVVDGRQESSAGLTLEEMAYLMID-LGAVQAVGLDGGG 326 Query: 228 SH-MYMKGGAIPWQRYP 243 S M++KG + Sbjct: 327 SSEMWVKGKIVNNPSDK 343 Score = 40.4 bits (93), Expect = 0.052, Method: Composition-based stats. Identities = 23/131 (17%), Positives = 41/131 (31%), Gaps = 17/131 (12%) Query: 24 TLLPLFAVAADDCAL------------SDPTLTVQAYTVNPQTERVKMYWQKANGEAWGT 71 L + A AAD A + V+P + ++ G Sbjct: 16 MLWAVPAWAADQLAKGVQYRSFERNNWEGKPIKGHILEVDPGVKYTEIR--PVMGNEVFG 73 Query: 72 LHALLADINSQGQVQMAMNGGIYDES-YAPLGLYIENGQQKVALNLASGEGNFFIRPGGV 130 L+ + + A+NGG +D PLG I +G+ + + +F + G Sbjct: 74 QRENLSKMAQRTGAIAAVNGGFFDMGSGVPLGNLIIDGKPEY--ISDILKTSFGFKTSGG 131 Query: 131 FYVAGDKVGIV 141 + I Sbjct: 132 LKLGYLAPKIT 142 >UniRef50_Q2BF40 Putative uncharacterized protein n=1 Tax=Bacillus sp. NRRL B-14911 RepID=Q2BF40_9BACI Length = 657 Score = 80.5 bits (197), Expect = 4e-14, Method: Composition-based stats. Identities = 27/116 (23%), Positives = 45/116 (38%), Gaps = 11/116 (9%) Query: 134 AGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVAS---SKIRNGVGINKH 190 + ++ K ++ + SGP+L+ NG ++ + PN R V I+K Sbjct: 254 KPGDTVEIAINIDDKWKNSEYMLASGPLLVNNGKVDLGMDPNSTRARERAPRTAVAIDKT 313 Query: 191 GNAVFLLS-QQA------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPW 239 + VFL++ N +FA Y KL + L LDG S + Sbjct: 314 MSKVFLVTVDGRLAESKGMNLTEFAQYL-VKLGAYKALNLDGGGSTAIIARKNGND 368 >UniRef50_A8W171 Flagellar protein FliS n=1 Tax=Bacillus selenitireducens MLS10 RepID=A8W171_9BACI Length = 750 Score = 80.5 bits (197), Expect = 5e-14, Method: Composition-based stats. Identities = 31/145 (21%), Positives = 59/145 (40%), Gaps = 18/145 (12%) Query: 105 IENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQF-------AVQ 157 I N ++ N + G+ FI G + G GI D + +I+ + Sbjct: 229 ITNMKEYGRRNASPIPGDGFIISGHGNRLDGLLDGIRAGDDIEVKVDIEDRWKDAEMIMA 288 Query: 158 SGPMLMENGVINPRIHPNVAS---SKIRNGVGINKHGNAVFLLSQQA-------TNFYDF 207 +GP+L++NG ++ + + ++ R+G+GI+ GN +F+ F Sbjct: 289 TGPLLVQNGRVDITMSSSASTYSVPNPRSGIGIDAQGNTMFVTVDGRQSGYSQGMTIPQF 348 Query: 208 ACYAKAKLNVEQLLYLDGTISHMYM 232 A Y + + + LDG S + Sbjct: 349 ANYMRDQ-GAVMAINLDGGGSTTMV 372 >UniRef50_D2V2G1 N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase n=1 Tax=Naegleria gruberi RepID=D2V2G1_NAEGR Length = 558 Score = 80.5 bits (197), Expect = 5e-14, Method: Composition-based stats. Identities = 30/193 (15%), Positives = 56/193 (29%), Gaps = 36/193 (18%) Query: 62 QKANGEAWGTLHALLADINSQGQ----VQMAMNGGIYD-ESYAPLGLYIENGQQKVALNL 116 + G + + A + DI N G ++ + LG + +G+ Sbjct: 189 RDLRGGCYYNVTAPVRDIAKYHANGYFCHYTTNAGFFNTHKHTCLGNVVSDGRISH--VS 246 Query: 117 ASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNV 176 + NF I G +++ D ++ + L+ G + Sbjct: 247 TNHNVNFGITKDGKYFIGY-------TDENTKLEDFDQMISGVIWLVRKGESYVDESSKI 299 Query: 177 AS---------------SKIRNGVGINKHGNAVFLLSQQ------ATNFYDFACYAKAKL 215 R+ +G +K G V + Y+ A +L Sbjct: 300 EDMSIQETGNAKRFITVRASRSALGHDKEGRLVLVSIDGDGNHNKGPTLYELATLMI-EL 358 Query: 216 NVEQLLYLDGTIS 228 VE + LDG S Sbjct: 359 GVENAINLDGGGS 371 >UniRef50_A6L611 Putative uncharacterized protein n=1 Tax=Bacteroides vulgatus ATCC 8482 RepID=A6L611_BACV8 Length = 308 Score = 80.1 bits (196), Expect = 6e-14, Method: Composition-based stats. Identities = 33/231 (14%), Positives = 63/231 (27%), Gaps = 38/231 (16%) Query: 33 ADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGG 92 A ++S V V+ + + + L+ + + A+NG Sbjct: 51 AGYDSISQAHQNVDVLEVDLTSPSYDIQL------VYEEHGDSLSSVAERNNAAAAING- 103 Query: 93 IYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGI---VRLDAFKTS 149 +Y +I+ G + +A N + + G F D I D+ S Sbjct: 104 ----TYEAEASFIKIGGRLLAQNRLDSTHIRYWKHEGAFLFDDDNKNIDIRFASDSTFLS 159 Query: 150 KEIQFAVQSGPMLMENGVINP-RIHPNVAS-----------------SKIRNGVGINKHG 191 + PML++N NV R V + +H Sbjct: 160 HPAANILSGAPMLIDNNDPVGLNFTGNVEGMDLNKLDYEDFRRHQGVRHPRTAVALTEHK 219 Query: 192 NAVFLLSQQA------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGA 236 + + + + + + L LDG S + Sbjct: 220 KLLLITVDGRSTQAAGMSANELTRFLLTYFCPQSALNLDGGGSTTMWIASS 270 >UniRef50_Q8RCE6 Putative uncharacterized protein n=5 Tax=Thermoanaerobacterales RepID=Q8RCE6_THETN Length = 815 Score = 79.7 bits (195), Expect = 9e-14, Method: Composition-based stats. Identities = 21/95 (22%), Positives = 34/95 (35%), Gaps = 10/95 (10%) Query: 140 IVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLS- 198 I F ++I+ AV G +L++ G I P + R +G K V +++ Sbjct: 280 ITTNPPF---EDIKMAVSGGTILVKGGKIYP-FTHEIKGYAARTAIGYTKDKRYVLMVTV 335 Query: 199 QQA----TNFYDFACYAKAKLNVEQLLYLDGTISH 229 + A + L L LDG S Sbjct: 336 DGPPYRGMTQEELASLMLS-LGAYDALNLDGGGST 369 Score = 42.3 bits (98), Expect = 0.015, Method: Composition-based stats. Identities = 10/93 (10%), Positives = 33/93 (35%), Gaps = 3/93 (3%) Query: 43 LTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYD-ESYAPL 101 + + ++ + + + + + + ++ + A+NG +D ++ + Sbjct: 85 ININILKIDLKDPYLDLSVIFSPSGIKERM--PIREMANSYGAVAAINGDFFDTKTGFVI 142 Query: 102 GLYIENGQQKVALNLASGEGNFFIRPGGVFYVA 134 G +++G F+I G Y+ Sbjct: 143 GATVKDGNLITDPASNGKMATFYIDKTGTPYID 175 >UniRef50_UPI000180BA0C PREDICTED: similar to predicted protein n=1 Tax=Ciona intestinalis RepID=UPI000180BA0C Length = 621 Score = 79.7 bits (195), Expect = 9e-14, Method: Composition-based stats. Identities = 35/186 (18%), Positives = 60/186 (32%), Gaps = 17/186 (9%) Query: 70 GTLHALLADINSQGQVQMAMNGGIYDESYAP-LGLYIENGQQKVALNLASGEGNFFIRPG 128 G + +GQ +A NGG ++ LG I G+ + + +F I Sbjct: 123 GNQLDTSVNAARRGQCFVAQNGGYFNTKTQSCLGNVISRGRTLHTSDATNA--HFGILSN 180 Query: 129 GVFYVAGDKVGIVRLDAF-KTSKEIQFAVQSGPMLMENGVINPRIHPNVAS--------S 179 G V +R F + + V++G +E V Sbjct: 181 GSIVVGYISDADLRRLNFTNLVGGVIWLVRNGTSFVEESVSMESSDTEETGTLRYFSDVQ 240 Query: 180 KIRNGVGINKHGNAVFLLSQQ-----ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKG 234 R +G +KHG V + N F+ + L++ + LDG S + Sbjct: 241 SARTAIGHDKHGWVVLVQVDGQTGARGVNLNSFSKFLIEDLHLVNAINLDGGGSATLVIN 300 Query: 235 GAIPWQ 240 G + Sbjct: 301 GTLANT 306 >UniRef50_Q5N4C8 Putative uncharacterized protein n=2 Tax=Synechococcus elongatus RepID=Q5N4C8_SYNP6 Length = 605 Score = 79.7 bits (195), Expect = 9e-14, Method: Composition-based stats. Identities = 25/167 (14%), Positives = 48/167 (28%), Gaps = 25/167 (14%) Query: 104 YIENGQQKVALNLASGEGN-FFIRPGGVF------------YVAGDKVGIVRLDAFKTSK 150 ++ + N F I G V G + +++ Sbjct: 421 TVQGDRVVSQSQADKAGSNRFTIPRNGYLIVLRSANSLRTSLVNGTTIQVLQQAQPSQFD 480 Query: 151 EIQFAVQSGPMLMENGVINPRIHPNVASSK------IRNGVGINKHGNAVFLLSQ----- 199 A+ GP+L+++G + S R+ +G+ G V + + Sbjct: 481 RFPHALGGGPLLVKSGRVVVNPQAEGFSRAFEIEAAPRSAIGLMPDGRLVLVAAHEQNQG 540 Query: 200 QATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVT 246 Q A + +L V L DG S + G + + Sbjct: 541 QGPTLPQMAAIMQ-QLGVVDALNFDGGSSTSLIVNGQLVNRARGSAA 586 Score = 47.0 bits (110), Expect = 7e-04, Method: Composition-based stats. Identities = 14/96 (14%), Positives = 30/96 (31%), Gaps = 2/96 (2%) Query: 26 LPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQV 85 L + + T V ++ + V++ A + +L + Sbjct: 263 LAGTQLQQRQVTVDGATFPVFVIQLDLRQPNVRLAPIWAGNGSLEG-TQVLQAVARDRGA 321 Query: 86 QMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGE 120 +A+N G ++ + PLG + L G Sbjct: 322 AIAINAGFFNRNNRLPLGAIRRDNIWYSGPILNRGA 357 >UniRef50_A9NEV6 Hypothetical surface-anchored protein n=1 Tax=Acholeplasma laidlawii PG-8A RepID=A9NEV6_ACHLI Length = 520 Score = 79.3 bits (194), Expect = 1e-13, Method: Composition-based stats. Identities = 14/100 (14%), Positives = 32/100 (32%), Gaps = 15/100 (15%) Query: 147 KTSKEIQFAVQSGPMLMENGVINPRIHPNVAS------SKIRNGVGINKHGNAVFLLSQQ 200 + ++ A+ +G +L+++G + ++ S R +G G F++ Sbjct: 278 NGFENVRNAIGTGQLLVKDGAVQHAAFKSLPSNNMAHFRHPRTAIGQKADGTVFFIVVDG 337 Query: 201 A--------TNFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 + + K LDG S + Sbjct: 338 RDALSGKYGVKYSELGELMKMH-GAVTAFNLDGGGSSTML 376 >UniRef50_B0C332 Putative uncharacterized protein n=2 Tax=Bacteria RepID=B0C332_ACAM1 Length = 306 Score = 79.3 bits (194), Expect = 1e-13, Method: Composition-based stats. Identities = 31/226 (13%), Positives = 62/226 (27%), Gaps = 28/226 (12%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQK--ANGEAWGTLHALLADINSQ 82 L S V ++ +++ + + +Q Sbjct: 40 LFAGITYQRQVYT-SPRPYIVHIAKIDLTHPGIRVIATPGQPADDDNEFRAQPTSAFLTQ 98 Query: 83 GQVQMAMNGGIYDESY--APLGLYIENGQQKVALNLASGEGNFFIRP---GGVFYVAG-D 136 ++Q+AMN G + P G + L + G + P V Sbjct: 99 FRLQLAMNAGYFYHFNEKTPWDYAPHTGGRVNVLGQSISMGQPYSPPQKQWPVLCFDQSQ 158 Query: 137 KVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPR--IHPNVASSKIRNGVGINKHGN-A 193 + IV + AV +L +P + + R+ +++ G Sbjct: 159 RGRIVATG--HCPSDTLHAVAGNYIL------HPDQPLQLDSDKPYARSIAALDQTGTTL 210 Query: 194 VFLLSQQ-------ATNFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 ++ F D K ++ + L LDG S + Sbjct: 211 WLIVVDGKQPDYSEGATFADIEQLIK-QIGADIALNLDGGGSTTLV 255 >UniRef50_A6TVJ8 Exopolysaccharide biosynthesis protein n=2 Tax=Alkaliphilus RepID=A6TVJ8_ALKMQ Length = 942 Score = 78.5 bits (192), Expect = 2e-13, Method: Composition-based stats. Identities = 24/144 (16%), Positives = 48/144 (33%), Gaps = 12/144 (8%) Query: 96 ESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFA 155 P NG VA +G G + + +I+ A Sbjct: 219 RRRQPATGIPSNGYVLVASQTETGWGRAGHLFDNLKVGDRLTLHQEIQPNLN---QIELA 275 Query: 156 VQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGN-AVFLLSQQATNFY------DFA 208 + G +L+++G + +VA + R+ +GI++ + + +NFY + Sbjct: 276 LGGGTLLVKDGQA-AHLTQSVAGAHPRSAIGISRDRKQVILVTIDGRSNFYHGVDGRELG 334 Query: 209 CYAKAKLNVEQLLYLDGTISHMYM 232 L + +DG S + Sbjct: 335 NILLG-LGAHDAIIMDGGGSTTMI 357 >UniRef50_Q3AA51 Conserved domain protein n=1 Tax=Carboxydothermus hydrogenoformans Z-2901 RepID=Q3AA51_CARHZ Length = 356 Score = 78.2 bits (191), Expect = 2e-13, Method: Composition-based stats. Identities = 27/153 (17%), Positives = 52/153 (33%), Gaps = 24/153 (15%) Query: 100 PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSG 159 P G + G NL + I P ++ ++ A+ +++G Sbjct: 211 PQGFVLNTGSLCPPDNLLNSNVTLKIEP-------ENQENVLWSKAYA-------VLEAG 256 Query: 160 PMLMENGVINPRIHPNV-------ASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAK 212 P L++ G I S R+G+G+ K+ + + ++A + Sbjct: 257 PYLVKEGKIIADPLKENFTHYKIKDGSFARSGIGVTKNKKLLLVTV-NRATIKEWAIIMQ 315 Query: 213 AKLNVEQLLYLDGT-ISHMYMKGGAIPWQRYPF 244 KL + LDG S +Y+ G + Sbjct: 316 -KLGAYYAMNLDGGASSGLYVNGKYLTKPGRLL 347 Score = 40.4 bits (93), Expect = 0.053, Method: Composition-based stats. Identities = 24/113 (21%), Positives = 45/113 (39%), Gaps = 6/113 (5%) Query: 7 IGKGMITLNLKRIFLALTLLPLF-AVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKAN 65 I G I L +F L L V +++ TV+ V+ T+++KM A Sbjct: 6 IALGAILLVYIILFSQLALGANSYQVFEKKLKINNKNFTVKGVIVDLNTKKLKMQTVLAK 65 Query: 66 GEAWGTLHALLADINSQGQVQMAMNGGIY---DESYAPLGLYIENGQQKVALN 115 + G + +L + + + + +NG + D P G + +G+ N Sbjct: 66 NQ-IGQVESLESMVKRKKGLIG-INGAFFSAYDAYKEPYGNLMIDGRLIRKGN 116 >UniRef50_Q7X4R9 XcbC n=1 Tax=Neisseria meningitidis RepID=Q7X4R9_NEIME Length = 256 Score = 78.2 bits (191), Expect = 2e-13, Method: Composition-based stats. Identities = 30/239 (12%), Positives = 69/239 (28%), Gaps = 31/239 (12%) Query: 16 LKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHAL 75 + IF+ + A C + +N + + + + + Sbjct: 7 ILSIFILSFFNSEYTYAQSLCIQQSSQNHIHIAKINLNCKGINLIATQEADK-----GMT 61 Query: 76 LADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAG 135 ++ + + +A+NG + Y P GL I + + + Sbjct: 62 VSQFARKYRTDIAINGSFFRTGYFPFGLAITDHKTWDKTRDVQKRVFLACNRQNRCMIED 121 Query: 136 D----------KVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGV 185 K+ + +F + + P+ G + + + + R V Sbjct: 122 KNMVSKVDDSWKLAVSGWQSFNPATKKFECSDDDPV----GCTHIKFI----TKQPRTMV 173 Query: 186 GIN-KHGNAVFLLSQQAT------NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAI 237 G++ K ++ + A A L + + + LDG S +KG Sbjct: 174 GLDEKRNYLYLVVIDGRLPKFKGATLNELGQLA-ASLKLTKAINLDGGGSSTMVKGYNR 231 >UniRef50_Q8DHU5 Tll1850 protein n=1 Tax=Thermosynechococcus elongatus BP-1 RepID=Q8DHU5_THEEB Length = 575 Score = 78.2 bits (191), Expect = 3e-13, Method: Composition-based stats. Identities = 28/148 (18%), Positives = 49/148 (33%), Gaps = 21/148 (14%) Query: 106 ENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMEN 165 G VA N S NF G V + I V +GP+L+E Sbjct: 420 SEGFLLVARNFNSALANFPP---------GAAVQLETTAVPAAFNRIPNIVGAGPLLVEQ 470 Query: 166 GVIN-----PRIHPNVA-SSKIRNGVGINKHGNAVFLLSQQAT-----NFYDFACYAKAK 214 G + + + + R+ +G G+ V++ + ++A + Sbjct: 471 GRVVLNAALEQFGAGLDAQAAPRSAMGNRSDGSIVWVTTHNRIGGMGPTLAEWAQIV-HR 529 Query: 215 LNVEQLLYLDGTISHMYMKGGAIPWQRY 242 L + + LDG S GG + + Sbjct: 530 LGLINAVNLDGGSSTALYLGGVLVDRHG 557 Score = 52.0 bits (123), Expect = 2e-05, Method: Composition-based stats. Identities = 18/96 (18%), Positives = 31/96 (32%), Gaps = 2/96 (2%) Query: 26 LPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQV 85 P L V +NPQ +++ + + L + ++ + Q Sbjct: 241 APGLRWQQQTVILGTRQFPVDLLIINPQQPGLRLRPLEISPTTLVGLATVP-ELAQRWQA 299 Query: 86 QMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGE 120 A+NGG ++ PLG G L G Sbjct: 300 AAAINGGFFNRDRQAPLGAIRREGNWLSGPILNRGA 335 >UniRef50_B1XK15 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7002 RepID=B1XK15_SYNP2 Length = 595 Score = 78.2 bits (191), Expect = 3e-13, Method: Composition-based stats. Identities = 19/117 (16%), Positives = 37/117 (31%), Gaps = 16/117 (13%) Query: 142 RLDAFKTSKEIQFAVQSGPMLMENGVIN-----PRIHPNVA-SSKIRNGVGINKHGNAVF 195 ++F T I V GP+L++NG + + S R+ + + + Sbjct: 472 TPNSFATLPNI---VGGGPLLLKNGQVVLNGQAEQFSTAFNIQSASRSAIARTRDNKILL 528 Query: 196 LLSQQ------ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVT 246 + ++A + +L L LDG S G + + Sbjct: 529 VTLHGAAEETAGATLNEWANILR-RLGATDALNLDGGGSSALALGANLSDRHPTTAG 584 Score = 47.7 bits (112), Expect = 3e-04, Method: Composition-based stats. Identities = 25/179 (13%), Positives = 53/179 (29%), Gaps = 18/179 (10%) Query: 26 LPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQV 85 LP ++ A S + V ++P ++++ + L LL Q Sbjct: 256 LPGVQWRQENFAASSGPVRVTWLEIDPTQRQLQLKPITPDNNTIVGLAPLLIQ-ADTNQA 314 Query: 86 QMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLD 144 A+N G ++ + PLG+ ++ + + I G Sbjct: 315 IAAINAGFFNRNNQYPLGI-VQGNRALRSG---------PILNRGAVAWDNAGRWEFDRL 364 Query: 145 AFKTSKEIQFAVQSGPMLMENGVINPR--IHPNVASSKIRNGVGINKHGNAVFLLSQQA 201 +T + G L+ +G + ++ S+ V V + Sbjct: 365 KVETDIVAGNGERVGVELINSGYVKAGAALYDRAWGSRYTTAV----DHEIVLTVMTSG 419 >UniRef50_UPI0001C164F4 hypothetical protein CRD_01886 n=2 Tax=Nostocaceae RepID=UPI0001C164F4 Length = 300 Score = 77.8 bits (190), Expect = 4e-13, Method: Composition-based stats. Identities = 40/230 (17%), Positives = 78/230 (33%), Gaps = 28/230 (12%) Query: 42 TLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLA------DINSQGQVQMAMNGGIYD 95 + ++ + + + N + + + ++ + +NG Sbjct: 56 GIPFYQTIIDLEDPNILLTIGLPNSANFANTISRTNGDENFDQLVARSGAAVVVNGTFAY 115 Query: 96 ESYAPL--GLYIENGQQKVALNLASGEGNFFIRPG-GVFYVAGDKVGIVRLDAFKTSK-- 150 + G + G+ S NF G GV G+K ++ + Sbjct: 116 TNPQKTVMGNLVAGGRSLK----YSPWENFGTTLGLGV----GNKPEMITARVEGRPEWN 167 Query: 151 EIQFAVQSGPMLMENGVI--NPRI----HPNVASSKIRNGVGINKHGNAVFLL-SQQATN 203 + F++ SGP L+ NG + NPR+ P V + +R +G ++ G +FL + Sbjct: 168 KHWFSITSGPRLLRNGEVSVNPRLEGFKDPAVLGTSLRTAIGFSEDGKRLFLANFDEKLY 227 Query: 204 FYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPW-QRYPFVTMISVER 252 + A KA + + + LDG S I +I V Sbjct: 228 LEEEAEAMKA-IGCYEAMNLDGGPSRALASDNVILVPPARKLTNVILVYD 276 >UniRef50_UPI00017896CA metallophosphoesterase n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI00017896CA Length = 2050 Score = 77.0 bits (188), Expect = 5e-13, Method: Composition-based stats. Identities = 28/168 (16%), Positives = 59/168 (35%), Gaps = 23/168 (13%) Query: 72 LHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVF 131 L + + S + M + + D+ +P+G GQ ++ + + ++ G Sbjct: 230 LDIVSGRVASGETLTMKVVSVLKDQGNSPIG----QGQVVLSASGSQRSKLAGLKAG--- 282 Query: 132 YVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHG 191 G + ++ ++ A+ ML+++GV+ + R VG G Sbjct: 283 --DEVTAGFQLDNEWQ---DVTMAIGGTVMLVKDGVVQ---QHTDPAVHPRTVVGTKADG 334 Query: 192 NAVFLLSQQA-------TNFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 + V N+ + +L V L LDG S ++ Sbjct: 335 SVVLFEVDGRQPGFSEGLNYIELGE-MLQELGVVNALNLDGGGSATFV 381 Score = 50.8 bits (120), Expect = 4e-05, Method: Composition-based stats. Identities = 21/107 (19%), Positives = 42/107 (39%), Gaps = 6/107 (5%) Query: 27 PLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWG--TLHALLA-DINSQG 83 P + V +P +++ +G+ +G + + + + Sbjct: 70 PGATYTWANMQKGSGEQKVHMVEFDPSQGNLELQPGLTDGKVYGMQGVSKMASDADKAGN 129 Query: 84 QVQMAMNGGIYDE-SYAPLGLYIENGQQKVALNLASGEGNFFIRPGG 129 +V A+NG YD + PLGL++ +G+ + SG F I+ G Sbjct: 130 RVIAAVNGDFYDMSTGIPLGLFMGDGELL--TDPPSGRNAFGIKQDG 174 >UniRef50_B5Y710 Copper amine oxidase N-domain family n=2 Tax=Coprothermobacter proteolyticus DSM 5265 RepID=B5Y710_COPPD Length = 485 Score = 77.0 bits (188), Expect = 6e-13, Method: Composition-based stats. Identities = 31/162 (19%), Positives = 47/162 (29%), Gaps = 16/162 (9%) Query: 100 PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSG 159 P G I G +V G Y V I + +G Sbjct: 332 PDGYVIHLGGTEVRFKDRFEVGTRLS------YRDIYDVRNSSNPEMWQEGVIWGTLSAG 385 Query: 160 PMLMENGVI--NPR-----IHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAK 212 P L+ NG I +P I R+ +GI ++ + + D A K Sbjct: 386 PRLITNGEITLDPASELLDIPKITGQPLTRSALGITQNNELLMVTV-SKCTIQDLATIMK 444 Query: 213 AKLNVEQLLYLDGTIS-HMYMKGGAIPWQRYPFVTMISVERK 253 L + LDG S +Y G + + V + Sbjct: 445 D-LGAYNAMNLDGGASTSLYANGKFLATPTRKISNALMVLPR 485 >UniRef50_C6IEV9 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=C6IEV9_9BACE Length = 343 Score = 76.6 bits (187), Expect = 8e-13, Method: Composition-based stats. Identities = 31/246 (12%), Positives = 62/246 (25%), Gaps = 56/246 (22%) Query: 45 VQAYTVNPQTERVK----MYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIY---DES 97 + + + + + + G ++ + + +NGG++ Sbjct: 79 AYIAVADMSKAKFEVLGDIAFSQEANGYGGKSIHTPSEFYESSKAPVVINGGLFFYSAGF 138 Query: 98 YAPLGLYIENGQQKVALNL---ASGEGNFFIRPGGVFYVAGDKVGIVRLDA--------- 145 Y L I GQ ++ G + Sbjct: 139 YYSQNLVIREGQLLAPNQNYYSKDWVTMWYPTLGAFCQMKDGTFQTTWTYQASDGINYCY 198 Query: 146 -------------------FKTSKEIQFAVQS--GP-MLMENGVINPR-----IHPNVAS 178 F + A G +L+ G I + + AS Sbjct: 199 PAPADNDINKDPLQAPSSTFPNGAKALEATTGIGGVTVLLRAGEIKNTYVEEMLDISAAS 258 Query: 179 SKIRNGVGINKHGNAVFLLSQQA--------TNFYDFACYAKAKLNVEQLLYLDGTISH- 229 ++ R +GI + + + + + A K L + L LDG S Sbjct: 259 NQPRTAIGITTNKKMIIFVCEGRNMTEGVAGLTTANVAKVMKD-LGCTEALNLDGGGSSC 317 Query: 230 MYMKGG 235 M + G Sbjct: 318 MLVNGK 323 >UniRef50_B8I1Q9 Ig-like, group 2 n=3 Tax=Clostridium RepID=B8I1Q9_CLOCE Length = 952 Score = 76.2 bits (186), Expect = 8e-13, Method: Composition-based stats. Identities = 24/157 (15%), Positives = 45/157 (28%), Gaps = 27/157 (17%) Query: 103 LYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQF-------- 154 + +E+G N + + G + D F +++ Sbjct: 210 MVVEDG-IVKEFNENKPSMD--MPKNGFVVLGAGSHIQYLKDNFNVGDPVEYNITMNVDT 266 Query: 155 -----AVQSGPMLMENGVINPRIHPNVAS---SKIRNGVGINKHGNAVFLL-SQQA---- 201 A+ G ML+++ + N S R +G +K G + + Sbjct: 267 NNMKMALTGGAMLVKDDKVLTSFSHNPVSPSTRASRTAIGTSKDGKTLIVAAVDGRSSAS 326 Query: 202 --TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGA 236 + A Y +L L LDG S + Sbjct: 327 IGMTQSELASYM-HELGCANALNLDGGGSTTLVARKQ 362 >UniRef50_UPI00016C4EC3 hypothetical protein GobsU_32169 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4EC3 Length = 279 Score = 76.2 bits (186), Expect = 9e-13, Method: Composition-based stats. Identities = 29/208 (13%), Positives = 65/208 (31%), Gaps = 32/208 (15%) Query: 45 VQAYTVNPQTERVKMYWQKANGE-AWGTLHALLADINSQGQVQMAMNGGIYDESYAP--- 100 A ++ + + NG+ T + + ++Q+A+N + + Sbjct: 52 GHAVRIDLKAAGIGFLATPGNGDRPGETDGLKTSTFLKRHKLQLAINAAPFGPIHKDEEK 111 Query: 101 ----LGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAV 156 +G+ + G+ + ++ I F + I+ AV Sbjct: 112 EQDVVGVQVSGGKLVSPAQPGYP---------ALLLAKDNRARI-AAPPFDL-EGIENAV 160 Query: 157 QSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGN-AVFLLSQQAT-------NFYDFA 208 ++++ G + S R G++ G V L+ + Sbjct: 161 GGFHIVLKGGEV----LTGDKSIHPRTAAGVSADGKTLVLLVIDGRQKDFSDGATTAEVG 216 Query: 209 CYAKAKLNVEQLLYLDGTISHMYMKGGA 236 + KA L + + LDG + + GA Sbjct: 217 EWLKA-LGCAEGINLDGGGTTTLVVAGA 243 >UniRef50_A7C442 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. PS RepID=A7C442_9GAMM Length = 299 Score = 75.9 bits (185), Expect = 1e-12, Method: Composition-based stats. Identities = 32/223 (14%), Positives = 73/223 (32%), Gaps = 29/223 (13%) Query: 35 DCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALL-ADINSQGQVQMAMNGGI 93 + + + + +V+ ++ G + A + ++ ++Q+A+NG Sbjct: 44 EVRQTPRPIIIHFISVDLTKPNIRFLVTPGEVRDDGEIGARTTSQFLTEFKLQLAINGNF 103 Query: 94 YDESYAPL-------GLYIENGQQKVALNLASGEGNFFIRPGGVF---YVAGDKVGIVRL 143 + + PL + G + LAS G + + F Y++ D Sbjct: 104 FYP-FHPLFSVDFWNAYPKKRGDPVYVVGLASSHGQVYSQTKKSFETLYISADNQA---- 158 Query: 144 DAFKTSKEIQF-AVQSGPMLMENGVINPRIHPN--VASSKIRNGVGINKHGNAVFL-LSQ 199 F+TS + A+ + ++ G I R + ++K + + + Sbjct: 159 -RFQTSIGPLYHAISGRELFIKQGKIQGPFPKGAFNEKPYPRTALALDKTAKTLMIFVVD 217 Query: 200 Q-------ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGG 235 + A ++ + L LDG S + G Sbjct: 218 GKLKNYSEGVTLMELADIVQS-YGADMALNLDGGGSSTLVMEG 259 >UniRef50_Q8YKH7 All7320 protein n=2 Tax=Cyanobacteria RepID=Q8YKH7_ANASP Length = 314 Score = 75.5 bits (184), Expect = 2e-12, Method: Composition-based stats. Identities = 35/228 (15%), Positives = 66/228 (28%), Gaps = 30/228 (13%) Query: 36 CALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTL-------------HALLADINSQ 82 L + T++ T +K + + ++ ++ Sbjct: 55 IESKPRPLIIHIVTIDLNTPGIKPFITPDIENLSKNVGVGKQAIIDNETKARTTSEFVAE 114 Query: 83 GQVQMAMNGGIYDE--SYAPLGLYIENGQQKVALNLASGEGNFFIRP---GGVFYVAGDK 137 QV++A+NG + P Y +G L G + V + Sbjct: 115 FQVKLAINGSYFYPFKEVTPWHYYPHSGDTTKVLGQTISNGKIYANKKSSWYVLCFDNNN 174 Query: 138 VGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKI--RNGVGINKHGN-AV 194 + + K + +L+ G I+ N ++ K R I+K G Sbjct: 175 QAQIPGGE-ECPKNTIQGLAGDDVLVFQGKPKINIYANSSADKPYSRVVAAIDKTGKKLW 233 Query: 195 FLLSQQATNFY-------DFACYAKAKLNVEQLLYLDGTISHMYMKGG 235 +L Y + + AKL V + LDG S + Sbjct: 234 LVLVDGKQPLYSEGFTKRELTQFI-AKLGVYNAINLDGGGSTTLVVAN 280 >UniRef50_D1R528 Putative uncharacterized protein n=1 Tax=Parachlamydia acanthamoebae str. Hall's coccus RepID=D1R528_9CHLA Length = 380 Score = 74.7 bits (182), Expect = 3e-12, Method: Composition-based stats. Identities = 23/108 (21%), Positives = 36/108 (33%), Gaps = 17/108 (15%) Query: 151 EIQFAVQSGPMLMENGVINPRIHPNVASSK------IRNGVGINKHGNAVFLLSQQ---- 200 +I V P+L+ G + S R VGI ++GN +F++ Sbjct: 255 DIVHIVGGTPILVRGGRLVTDFSAEQTGSHFLNVRLARTAVGILENGNWLFVVVDGFYKN 314 Query: 201 -----ATNFYDFACYAKAKLNVEQLLYLDGTI-SHMYMKGGAIPWQRY 242 D A + KL + L L G S M +K + Sbjct: 315 IWNTKGITIPDLAELMQ-KLGCVEALNLCGGKCSTMVLKNVVVNDPPD 361 >UniRef50_B0JGJ2 Putative uncharacterized protein n=2 Tax=Microcystis aeruginosa RepID=B0JGJ2_MICAN Length = 607 Score = 74.7 bits (182), Expect = 3e-12, Method: Composition-based stats. Identities = 24/163 (14%), Positives = 52/163 (31%), Gaps = 27/163 (16%) Query: 101 LGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKV------------GIVRLDAFKT 148 GL ++ + LN + + I G + + F Sbjct: 429 TGLVVQGDRVTEKLNNLFPQDSIKIPENGYLVICRKTDISLNIGERVNLDSVTLPGDFAN 488 Query: 149 SKEIQFAVQSGPMLMENGVIN-----PRIHPNVASSKI-RNGVGINKHGNAVFLLSQQAT 202 +I + +GP+L++NG I + P + + R+ + +++ G + + Sbjct: 489 YPQI---LGAGPLLLQNGRIVLDGNAEKFSPAFQNQQASRSAIAVSREGKILLVAIHNRV 545 Query: 203 -----NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 + A + + L LDG S G + + Sbjct: 546 GGRGATLGELARILLL-MAAKDGLNLDGGSSTGIALAGYLLDR 587 Score = 51.2 bits (121), Expect = 3e-05, Method: Composition-based stats. Identities = 19/95 (20%), Positives = 35/95 (36%), Gaps = 2/95 (2%) Query: 27 PLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQ 86 P L V ++P+ ++ + AN + + L+ INS+ Sbjct: 275 PGLIWNQKYIQLDQDWFPVTWLEIDPRNPQITIKPITANSTSMRGTNPLI-TINSESNAV 333 Query: 87 MAMNGGIYDESYA-PLGLYIENGQQKVALNLASGE 120 +NGG ++ + PLG +G+ L G Sbjct: 334 AMINGGFFNRNNQLPLGAIRVDGKWLSGPILNRGA 368 >UniRef50_B1X2V5 Putative uncharacterized protein n=2 Tax=Cyanothece RepID=B1X2V5_CYAA5 Length = 309 Score = 74.3 bits (181), Expect = 4e-12, Method: Composition-based stats. Identities = 26/190 (13%), Positives = 59/190 (31%), Gaps = 32/190 (16%) Query: 74 ALLADINSQGQVQMAMNGGIYDE-SYAPLGLYIENGQQKV-ALNLASGEGNFFIRPGGVF 131 + D + + +NGG +D + I+ G+ N N + P Sbjct: 89 KTVEDFAQETEAIAVLNGGFFDPVNSQTTSYVIKEGEAIADPSNNPRLMDNPQLEPYLKQ 148 Query: 132 YVAGDK------------VGIVRLDAFKTSKEIQFAVQSGPMLM-------------ENG 166 + + + + ++ ++ GP L+ NG Sbjct: 149 ILNRSEFRRYQCNELTRYAITYHQEPVPENCQLTESIGGGPQLLPNLSAEEEAFFESVNG 208 Query: 167 VINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACY----AKAKLNVEQLLY 222 + R + + R + I G+ ++++ +Q + + L+V + Sbjct: 209 QV-TRDPLGLERANARTAIAITSSGDVLWIMVEQTSPSTGLSLLKLREFLESLDVTSAMN 267 Query: 223 LDGTISHMYM 232 LDG S + Sbjct: 268 LDGGSSSSFF 277 >UniRef50_B9XE16 Putative uncharacterized protein n=1 Tax=bacterium Ellin514 RepID=B9XE16_9BACT Length = 398 Score = 74.3 bits (181), Expect = 4e-12, Method: Composition-based stats. Identities = 18/144 (12%), Positives = 45/144 (31%), Gaps = 24/144 (16%) Query: 109 QQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVI 168 + ++++ I+PG + + + + + AV P+L+ +G Sbjct: 232 KMIISIDPKLASRFAGIQPGTILHFSTGTSRDIA--------KADTAVGGRPLLLVHGKE 283 Query: 169 NPRIHPNVAS-----SKIRNGVGINKHGNAVF-LLSQQA-------TNFYDFACYAKAKL 215 + R +G + F ++ + + A + + L Sbjct: 284 LETSKQKGNNAATIVRHPRTALG--WNARYFFLVVVDGRQKELSMGMSSQELAHFM-STL 340 Query: 216 NVEQLLYLDGTISHMYMKGGAIPW 239 + + LDG S + G + Sbjct: 341 GCTEAMNLDGGGSTTFWLDGKVVN 364 Score = 40.4 bits (93), Expect = 0.051, Method: Composition-based stats. Identities = 30/166 (18%), Positives = 62/166 (37%), Gaps = 16/166 (9%) Query: 11 MITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWG 70 +++++ + + + + ++ ++ + + + + A G+ G Sbjct: 19 LVSIHARAELTPIFSSLVPGLDYAHITETNHPWSIHVARLERSHKELDLVSTLAQGKIVG 78 Query: 71 ---TLHALLADINSQGQVQMAMNGGIYDE-----SYAPLGLYIENGQQKVALNLAS---- 118 + + G+ +A+NG + PLGL I NG+ A N AS Sbjct: 79 LSSVANQVKTFPAGSGKPLVAVNGDFFVIAKGPYQGDPLGLQILNGELVSAPNGASFWKD 138 Query: 119 GEGNFFIRPG----GVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGP 160 EGN F+ + G+K+ + +TSK + F GP Sbjct: 139 AEGNLFLDNVQSKFSILLPKGEKIPFGLNEQRQTSKAVLFTPAFGP 184 >UniRef50_D1B6I7 N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase-like exopolysaccharide biosynthesis protein n=1 Tax=Thermanaerovibrio acidaminovorans DSM 6589 RepID=D1B6I7_THEAS Length = 486 Score = 73.9 bits (180), Expect = 5e-12, Method: Composition-based stats. Identities = 39/172 (22%), Positives = 61/172 (35%), Gaps = 28/172 (16%) Query: 93 IYDESYAP-----LGLYIENGQQKVALNLASGEGNFFIRPGGVFYVA------GDKVGIV 141 Y +Y P L L +++G +F + G A GD + +V Sbjct: 306 FYGGAYRPGNQALLSLSVKDGIV----QDEPQGADFTLLANGRAAEALGSLNIGDTLQLV 361 Query: 142 RLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVAS----SKIRNGVGINKHGNAVFLL 197 R AF + + +Q GPM++EN R S R VGI++ G VF++ Sbjct: 362 RRFAFPAFEACRLVIQGGPMIVENRRYVNRSEGLSRSIRERRHPRTLVGIDEQG-LVFMV 420 Query: 198 SQQA------TNFYDFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRY 242 + A A + + L LDG S M +G + Sbjct: 421 IDGRNGHSSGVTLEEAANLALEE-GLVAALNLDGGGSSQMIWRGVTVNIPSD 471 >UniRef50_A5D3R0 Hypothetical membrane protein n=1 Tax=Pelotomaculum thermopropionicum SI RepID=A5D3R0_PELTS Length = 485 Score = 73.9 bits (180), Expect = 5e-12, Method: Composition-based stats. Identities = 36/178 (20%), Positives = 58/178 (32%), Gaps = 26/178 (14%) Query: 94 YDESYAPLG---LYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLD------ 144 Y P G + + NG + G I G G+ Sbjct: 315 YKYDTTPPGRTAVVVRNG-----IVTGIRSGQVEIPEDGYVIWYGENNYERDDQFSAGRQ 369 Query: 145 -----AFKTSKEIQF--AVQSGPMLMENGVIN--PRIHPNVASSKIRNGVGINKHGNAVF 195 FK +++ +F + + P+L+ NG I P + R+ VG+ V Sbjct: 370 VDYRVTFKENQQARFKATISNYPLLLSNGAIALGDITEPKLTIGAPRSFVGVTWDNILVM 429 Query: 196 LLSQQATNFYDFACYAKAKLNVEQLLYLDGTIS-HMYMKGGAIPWQRYPFVTMISVER 252 A N ++ A K L ++ L LDG S +Y G I + V + Sbjct: 430 GTVDSA-NVWELAEVTKN-LGLKDALNLDGGASCGLYYDGAYIRQPGRLLSNCLVVIQ 485 >UniRef50_UPI0001923977 PREDICTED: similar to predicted protein, partial n=1 Tax=Hydra magnipapillata RepID=UPI0001923977 Length = 290 Score = 73.2 bits (178), Expect = 8e-12, Method: Composition-based stats. Identities = 34/186 (18%), Positives = 54/186 (29%), Gaps = 32/186 (17%) Query: 80 NSQGQVQMAMNGGIYDE------SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYV 133 Q ++A+N G ++ G I NG N NF IR G + Sbjct: 106 AKQQNCRIAVNAGFFNPFETDKDYGKCYGNIISNGNLVQD-NGGIQNANFGIRSDGTLVI 164 Query: 134 AGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENG---------------VINPRIHPNVAS 178 V K + +Q G +L NG + Sbjct: 165 GYLPEKEVID---KKNPFLQLLSGVGWIL-RNGSSYLKESEKAECKESETTGTLDKFFNV 220 Query: 179 SKIRNGVGINKHGNAVFLLSQQ-----ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMK 233 R +G + G+ + N Y+ Y K K+ + + DG S Y++ Sbjct: 221 KSARTMIGYDAKGHVHIVQFDGKTGKSGINLYEAVEYLK-KIGLINAINFDGGGSATYVQ 279 Query: 234 GGAIPW 239 I Sbjct: 280 DSIILN 285 >UniRef50_A4XD34 Putative uncharacterized protein n=1 Tax=Salinispora tropica CNB-440 RepID=A4XD34_SALTO Length = 430 Score = 73.2 bits (178), Expect = 8e-12, Method: Composition-based stats. Identities = 22/95 (23%), Positives = 32/95 (33%), Gaps = 8/95 (8%) Query: 153 QFAVQSGPMLMENGVIN-PRIHPNVASSKIRNGVGINKHGNAVFLLSQQA------TNFY 205 FAV L+++G I P + R G G V + T Sbjct: 314 TFAVNGRYRLVKDGQIVAPSGSDSFFDRHPRTIAGTTLDGKIVLVTIDGRQTTSVGTTMT 373 Query: 206 DFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 + A A A L + + LDG S G++ Q Sbjct: 374 ETASVA-AALGMHDAVNLDGGGSTTMSVEGSLVNQ 407 >UniRef50_UPI0001BC7E39 hypothetical protein BacD2_08600 n=1 Tax=Bacteroides sp. D2 RepID=UPI0001BC7E39 Length = 660 Score = 73.2 bits (178), Expect = 9e-12, Method: Composition-based stats. Identities = 22/218 (10%), Positives = 53/218 (24%), Gaps = 40/218 (18%) Query: 42 TLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPL 101 V ++ + ++ + A L+ + + +NG E Sbjct: 75 RQQVNVLEIDLSSPDYELEFVSAPQL------DSLSSVALKHDAVAGINGTYELE----A 124 Query: 102 GLYIENGQQKVALNLASGEGNFFIRPGGVFYVA---GDKVGIVRLDAFKTSKEIQFAVQS 158 NG + L G ++ G + Y G ++ + I Sbjct: 125 SFVKVNGSIISPITLPEGHLRYWKHEGAIAYDGYKVEIGYGTKESYSYNSMPNI---FSG 181 Query: 159 GPMLMENGVIN-PRIHPNVAS-----------------SKIRNGVGINKHGNAVFLLSQQ 200 P+L+++ ++ R V + + + + Sbjct: 182 APVLIDDYQPVGKTFIGDITGINLNSLDGEDYRRHQGVRHPRTAVALTEQNKLLLVTVDG 241 Query: 201 A------TNFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 + + + L +DG S Sbjct: 242 RADLAAGMTAKELTSFINQYFKPQHALNVDGGGSTTMY 279 >UniRef50_B1HN11 Putative uncharacterized protein n=2 Tax=Bacillaceae RepID=B1HN11_LYSSC Length = 815 Score = 72.8 bits (177), Expect = 1e-11, Method: Composition-based stats. Identities = 21/103 (20%), Positives = 37/103 (35%), Gaps = 15/103 (14%) Query: 151 EIQFAVQSGPMLMENGVINPRIHPNVA---SSKIRNGVGINKHG-NAVFLLSQQA----- 201 + QF + +GPML+ NG ++ + N + R V ++ G + Sbjct: 266 DAQFILAAGPMLVRNGQVDISMPTNSGFASTRSPRTAVAVDATGTKVSLITIDGRLSGHS 325 Query: 202 --TNFYDFACYAKAKLNVEQLLYLDGTISHMYM---KGGAIPW 239 N D A + + + + LDG S + GG Sbjct: 326 NGVNLSDLASHLIS-IGATSAINLDGGGSTAMVARNPGGYFAN 367 >UniRef50_B5W3X9 Putative uncharacterized protein n=3 Tax=Arthrospira RepID=B5W3X9_SPIMA Length = 812 Score = 72.4 bits (176), Expect = 1e-11, Method: Composition-based stats. Identities = 18/115 (15%), Positives = 35/115 (30%), Gaps = 23/115 (20%) Query: 156 VQSGPMLMENGVIN-----PRIHPN-VASSKIRNGVGINKHG-----------NAVFLLS 198 + +GP+L+ + IR+ VG+ + + + ++ Sbjct: 689 LGAGPLLLRGNQVVLDARAENFSDAFNTQRAIRSAVGLKTNTPGRSGSDSPAVSLLLVVV 748 Query: 199 QQAT-----NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMI 248 + + A K +L L LDG S GG + + I Sbjct: 749 HPRLGGPGPSLAELAELMK-QLGATDALNLDGGSSTGLYLGGYLLDRPPQTAAPI 802 Score = 42.7 bits (99), Expect = 0.012, Method: Composition-based stats. Identities = 13/99 (13%), Positives = 33/99 (33%), Gaps = 4/99 (4%) Query: 37 ALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE 96 V ++ + + + + + + L+ + S + A+NGG ++ Sbjct: 458 GRESARFPVVWLEIDLNNQGISLQPILSRPGSRSGVSPLV-HVASSTRAAAAINGGFFNR 516 Query: 97 SYA-PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVA 134 + PLG + L G + + F++ Sbjct: 517 NNQYPLGAIRHQNRWLSGPILNRGAIAWTDQNQ--FFID 553 >UniRef50_UPI0001C30FBA N-acetylglucosamine-1-phosphodiester alpha-N- acetylglucosaminidase-like exopolysaccharide biosynthesis protein n=2 Tax=Conexibacter woesei DSM 14684 RepID=UPI0001C30FBA Length = 249 Score = 72.4 bits (176), Expect = 1e-11, Method: Composition-based stats. Identities = 26/196 (13%), Positives = 55/196 (28%), Gaps = 33/196 (16%) Query: 82 QGQVQMAMNGGIYDES-YAPLGLYIENGQ-QKVALNLASGEGNFFIRPGGVFYVAGDKVG 139 + A+ G + + PLG G A G V D Sbjct: 51 ENDRPEAIVAGFFVRDPHLPLGEVRVGGVPVVHEPVAAPWAGRRA-------CVHVDGEI 103 Query: 140 IVRLDAFKTSKEIQFAVQSGPMLMENGVIN--------------PRIHPNVAS-SKIRNG 184 + VQ+GP+L+ +G + ++ + R Sbjct: 104 RIAPREELADVGGGDLVQAGPLLVRDGTAAIVDGEDREGFSAGASQFDSDITAERHPRCA 163 Query: 185 VGINKHGNAVFLLSQQATN-------FYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAI 237 +G+++ + + + + A + + + LDG S + G + Sbjct: 164 LGVSED-ELLAVCCDGRRSGVDAGLDLAELARLMVS-FGAREAINLDGGGSATLVHRGHL 221 Query: 238 PWQRYPFVTMISVERK 253 + Y + E + Sbjct: 222 LNRPYADRDQPAPESR 237 >UniRef50_D2ASL7 Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase-like protein n=2 Tax=Actinomycetales RepID=D2ASL7_STRRD Length = 1138 Score = 72.0 bits (175), Expect = 2e-11, Method: Composition-based stats. Identities = 32/194 (16%), Positives = 54/194 (27%), Gaps = 27/194 (13%) Query: 62 QKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYA-----PLGLYIENGQQKVALNL 116 G + TL I G G Y A + + G + Sbjct: 183 ATPAGGSPITLTQFNQLIQGNGVGLFTPLWGSYGRGRAVEGAAAVTEVVLEGGVVTEVRT 242 Query: 117 ASGEGNFFI-------RPGGVFYVAGDKVGIVRLDAFKTSK----EIQFAVQSGPMLMEN 165 ++G G R G +A K G ++ ++ AV +L+++ Sbjct: 243 SAGSGPIPAGTAILLGRDAGASALAALKPGDRVEVRYQPKPSEGGAVKAAVGGSQILVKD 302 Query: 166 GVINPRIHPNVASSKIRNGVGINKHGN-AVFLLSQQA------TNFYDFACYAKAKLNVE 218 G + + + R VG + G L + A+L Sbjct: 303 G--VAQTSADNT-AHPRTAVGFSADGRKMYLLTVDGRQTDSRGVTLTELGA-MMAELGAH 358 Query: 219 QLLYLDGTISHMYM 232 L LDG S + Sbjct: 359 DALNLDGGGSSTML 372 >UniRef50_A9V9Y5 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9V9Y5_MONBE Length = 298 Score = 72.0 bits (175), Expect = 2e-11, Method: Composition-based stats. Identities = 33/189 (17%), Positives = 63/189 (33%), Gaps = 27/189 (14%) Query: 73 HALLADINSQGQVQMAMNGGIYDESYAP---LGLYIENGQQKVALNLASGEGNFFIRPGG 129 H +++ + A N G + + P G I +G + + NF + G Sbjct: 95 HRTVSEQAKLLTCEYATNAGFF--DFTPPACEGNLITDGVSIQ--HPCPNQVNFGRKL-G 149 Query: 130 VFYVA---GDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINP----RIHPN---VASS 179 + GD++ I + + + G L+ +G P V+ Sbjct: 150 MTCPDSTQGDRIVIGYMQEADIADLTELITGRGW-LIRHGQAYTNQSREFTPTDSFVSEK 208 Query: 180 KIRNGVGINKHGNAVFLLSQQ------ATNFYDFACYAKAKLNVEQLLYLDGTISHMY-M 232 R +G+ K G + L+ + ++ A +L+V Q + LDG S Sbjct: 209 APRTALGLTKDGAILSLVVDGIEEELVGPDLHEMASLLL-ELDVVQAINLDGGGSSTAVY 267 Query: 233 KGGAIPWQR 241 +G Sbjct: 268 QGHVFNMPH 276 >UniRef50_D2AUR4 Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase-like protein n=1 Tax=Streptosporangium roseum DSM 43021 RepID=D2AUR4_STRRD Length = 487 Score = 72.0 bits (175), Expect = 2e-11, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 50/183 (27%), Gaps = 17/183 (9%) Query: 73 HALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQ-QKVALNLASGEGNFFIRPGGVF 131 + G ++ ++ G + G + + + V Sbjct: 290 EEFGTKTAADGGAEIVVDAQGRIVKARAAGGVVPRGTYVLHGTGIMATWLLEHAQETSVM 349 Query: 132 YVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVAS-------SKIRNG 184 + KV +R + + G L+ NG + + + R Sbjct: 350 KL-DTKVIDLRTERAVPLTPETHIMGGGVGLLRNGRVRISAKADGHASVVMMLRRHPRTM 408 Query: 185 VGINKHGNAVFLLSQQA-------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAI 237 VG+ K G + + + A + L +Q + DG S + G + Sbjct: 409 VGVTKSGGLILATVDGRNPGVTVGASMVEAAQLMRW-LGAKQAINFDGGGSTAMVVGHKV 467 Query: 238 PWQ 240 + Sbjct: 468 INR 470 >UniRef50_C0Z816 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0Z816_BREBN Length = 1054 Score = 71.6 bits (174), Expect = 2e-11, Method: Composition-based stats. Identities = 23/135 (17%), Positives = 47/135 (34%), Gaps = 12/135 (8%) Query: 129 GVFYVAGDKVGIVRLDAFKTSK---EIQFAVQSGPMLMENGVINPRIHPN--VASSKIRN 183 G F VG ++T+ ++ AV +L++ G + + S R Sbjct: 254 GAFLKQNFPVGATAAVEYQTTPQTLNLKQAVGGNVILVDQGKALTSFQADKSITSKTART 313 Query: 184 GVGINKHGNAVFLLS---QQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 VG+++ G +++++ Q + A A+L + + DG S + Sbjct: 314 SVGVSQDGKTLYMVTIDASQGVYLDELAKIM-AELGSYRAVNFDGGGSTTMA-TRMLGET 371 Query: 241 RYPFV--TMISVERK 253 ER+ Sbjct: 372 HANLANKPSGGAERR 386 Score = 40.4 bits (93), Expect = 0.061, Method: Composition-based stats. Identities = 13/133 (9%), Positives = 40/133 (30%), Gaps = 7/133 (5%) Query: 28 LFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQM 87 + + ++ +T+ V+ V++ + + + Sbjct: 51 GTTLQKYTKSFANQVVTIMVTKVDLNNPYVEVKPVYGTKGKLTD-KQTVTQMARETGAIA 109 Query: 88 AMNGGIYDES--YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDA 145 A+N + + AP G+ +++ + ++ L S + + V G Sbjct: 110 AINADFFHMTKRGAPFGIVMKDDELISSMGLVSYWYALGLTGDKMAIVDKFGFG----GK 165 Query: 146 FKTSKEIQFAVQS 158 +++Q Sbjct: 166 VTAPNGATYSIQG 178 >UniRef50_B3QZA6 Putative uncharacterized protein n=1 Tax=Chloroherpeton thalassium ATCC 35110 RepID=B3QZA6_CHLT3 Length = 280 Score = 71.6 bits (174), Expect = 2e-11, Method: Composition-based stats. Identities = 33/199 (16%), Positives = 71/199 (35%), Gaps = 15/199 (7%) Query: 44 TVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGL 103 + +NP+ K+ + + ++ A Q + A+N G++ Sbjct: 59 QIYVIRINPEHYAFKLMCASEHAKTPLSVKAWC----KQHGLISAINAGMFQADMLSAVS 114 Query: 104 YIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIV--RLDAFKTSKEIQFAVQSGPM 161 ++N L+ F P K I+ + + K + + G Sbjct: 115 LMKNFAHINNPRLSKDNTIFAFNPTKK---DLPKAQIIDRTVQNYDALKSVYQSQFQGIR 171 Query: 162 LMENGVINP-RIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKA-KLNVEQ 219 ++ G N + P+ S +G + GN +F+ S+ +DF +++++ Sbjct: 172 MIAPGRKNVWQEQPDEWSIA---ALGSDGDGNILFIFSRSPYTVHDFINILLELPIDIQR 228 Query: 220 LLYLDGTI-SHMYMKGGAI 237 +YLDG + +Y I Sbjct: 229 AMYLDGGAVAQLYFSNKHI 247 >UniRef50_B5YE82 Putative uncharacterized protein n=2 Tax=Dictyoglomus RepID=B5YE82_DICT6 Length = 691 Score = 71.6 bits (174), Expect = 2e-11, Method: Composition-based stats. Identities = 25/150 (16%), Positives = 54/150 (36%), Gaps = 23/150 (15%) Query: 98 YAPLGLY--IENGQQKVALNLAS---GEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEI 152 G+ I G +K + + G ++ +F + + I + Sbjct: 188 GKTSGIVSNIYYGVKKTPIKENTCIISLGGTALKYLPLFSIGKEIEIITEC---NPPIPL 244 Query: 153 QFAVQSGPMLMENGVINPR------IHPNV-ASSKIRNGVGINKHGNAVFLLSQQA---- 201 + A+ GP+L++NG I N+ S R +GI K+ + F++ + Sbjct: 245 KEAIGGGPILLKNGDIVLGNTDELAFDNNIVNSRHPRTIIGI-KNNSIYFIVIEGRKENS 303 Query: 202 --TNFYDFACYAKAKLNVEQLLYLDGTISH 229 + + K ++ + + +DG S Sbjct: 304 AGVSLKEACEILK-EMGINDAINMDGGGSS 332 >UniRef50_A6TUG6 Copper amine oxidase domain protein n=1 Tax=Alkaliphilus metalliredigens QYMF RepID=A6TUG6_ALKMQ Length = 491 Score = 71.6 bits (174), Expect = 2e-11, Method: Composition-based stats. Identities = 26/112 (23%), Positives = 45/112 (40%), Gaps = 10/112 (8%) Query: 150 KEIQFAVQSGPMLMENGVINPR-------IHPNVASSKIRNGVGINKHGNAVFLLSQQAT 202 KE+ A+ +GP L++NGVI + + R+ +G+ K V Sbjct: 258 KEVTSAIGAGPTLIKNGVITANGLSEGFFEDEILTNRGQRSFIGVTKENKLVMGTVPS-V 316 Query: 203 NFYDFACYAKAKLNVEQLLYLDGT-ISHMYMKGGAIPWQRYPFVTMISVERK 253 + + A AK +L + Q + LDG S + K + I + +K Sbjct: 317 SVKELAEIAK-ELGLYQAINLDGGASSGLIYKDRMVHAPGRLLSNAIVITKK 367 >UniRef50_A0LEU6 Putative uncharacterized protein n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LEU6_SYNFM Length = 300 Score = 71.2 bits (173), Expect = 3e-11, Method: Composition-based stats. Identities = 34/209 (16%), Positives = 72/209 (34%), Gaps = 14/209 (6%) Query: 43 LTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESY-APL 101 + ++P+ K+ N T + A+N G+Y E A + Sbjct: 76 YRITVVRIDPRYYAFKLINASENTREKMTAREWSRQF----NLIAAVNAGMYQEDGLASV 131 Query: 102 GLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKT-SKEIQFAVQSGP 160 G ++N L + P G V ++ F + ++ + VQS Sbjct: 132 GY-MKNFDHVNNPRLGRDKTVLAFNPSG-PDVPEVQIIDRECQDFNSLRQKYRTFVQSIR 189 Query: 161 MLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKA-KLNVEQ 219 M+ + R S+ +G ++ G + L + +DF L++++ Sbjct: 190 MISCDRKNVWRQQAGRWSTV---AIGTDETGKVLLLFCRSPITVHDFIEVLLTLPLSLQR 246 Query: 220 LLYLDGT-ISHMYMK-GGAIPWQRYPFVT 246 +YL+G + +Y+ G + + Sbjct: 247 AMYLEGGPQASLYLSTGKTTLERYGSWEP 275 >UniRef50_A4XGY7 Putative uncharacterized protein n=2 Tax=Clostridia RepID=A4XGY7_CALS8 Length = 877 Score = 71.2 bits (173), Expect = 3e-11, Method: Composition-based stats. Identities = 19/87 (21%), Positives = 40/87 (45%), Gaps = 9/87 (10%) Query: 150 KEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLL-SQQA------T 202 ++I+ A L+++G I P +A R+ +GI+K G ++L+ Sbjct: 266 EKIKAAASGNTFLLKDGKI-PSFTHEIAGRHPRSAIGIDKTGRYLYLVAVDGRNGKSIGL 324 Query: 203 NFYDFACYAKAKLNVEQLLYLDGTISH 229 + + A + ++ ++V + LDG S Sbjct: 325 SQGELASFLQS-IDVWTAINLDGGYST 350 >UniRef50_C5C0E0 Metallophosphoesterase n=1 Tax=Beutenbergia cavernae DSM 12333 RepID=C5C0E0_BEUC1 Length = 1327 Score = 70.8 bits (172), Expect = 4e-11, Method: Composition-based stats. Identities = 18/91 (19%), Positives = 33/91 (36%), Gaps = 10/91 (10%) Query: 151 EIQFAVQSGPM--LMENGVINPRIHPNVASSKIRNGVGINKHGNA-VFLLSQQA------ 201 ++ A+ P L+E+G I V R VG ++ G F++ Sbjct: 291 DVAVALGGAPEDWLLEDGEITSATGGYVDVRHPRTAVGFDETGTTAYFVVVDGRQSHSIG 350 Query: 202 TNFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 + + A+L + + LDG S + Sbjct: 351 MTLPELGRFL-AQLGADDAINLDGGGSSEMV 380 >UniRef50_C1A670 Putative uncharacterized protein n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1A670_GEMAT Length = 426 Score = 70.8 bits (172), Expect = 4e-11, Method: Composition-based stats. Identities = 19/156 (12%), Positives = 43/156 (27%), Gaps = 22/156 (14%) Query: 99 APLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQS 158 G +G V + R V +V + + + + + Sbjct: 255 RSGGAIPRDGALLVGTGDRAAGVAAMSRFDTV------RVHLNTWPRLTSQRAPKAVIGG 308 Query: 159 GPMLMENGVINPR--------IHPNVASSKIRNGVGINKHGNA-VFLLSQQA------TN 203 P+++++G I N + R + +++ G + Sbjct: 309 WPLVLQDGENVAARAATLEGTISRNAEARHPRTAIAVSRSGQTAWLVTVDGRATNSVGMT 368 Query: 204 FYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPW 239 + A + + L L DG S + G + Sbjct: 369 LVELAEFLR-TLGAWHALNFDGGGSTTMVIDGRVVN 403 >UniRef50_B8I064 Exopolysaccharide biosynthesis protein n=1 Tax=Clostridium cellulolyticum H10 RepID=B8I064_CLOCE Length = 383 Score = 70.8 bits (172), Expect = 4e-11, Method: Composition-based stats. Identities = 31/221 (14%), Positives = 67/221 (30%), Gaps = 31/221 (14%) Query: 47 AYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYA----PLG 102 ++ R + ++ E+ G + + + + + Sbjct: 148 VLILDKMGARFETFYSNIFLESKGNRVKINEMNRVGKNDDIILYIDKFGNTNRAEVKSTS 207 Query: 103 LYIENGQQKVALNLASG------------EGNFFIRPGGVFYVAGDKVGIVRLDAFKTSK 150 L ++N + + G+ P + GDKV I + Sbjct: 208 LIVDNNKIISIIESTKEVNIKKGMYVISFYGDKSSLPDKIGLKTGDKVNIRIEPYLGYNY 267 Query: 151 EIQFAVQSGPMLMENGV-INP---RIHPNVASSKIRNGVGINKHGNAVFLLSQQA----- 201 A + G ML++NG + P + + + R +GI +G V +++ Sbjct: 268 ---QAYECGSMLVKNGKSVVPERDKWAGTLGNRDPRTVIGIKTNGKIVLVVADGRQPGYS 324 Query: 202 --TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 + + K+ V LDG + + G I + Sbjct: 325 EGMTGKEMGEFL-VKIGVRDAAMLDGGATSQMIINGRIQNR 364 Score = 41.2 bits (95), Expect = 0.031, Method: Composition-based stats. Identities = 16/68 (23%), Positives = 29/68 (42%), Gaps = 2/68 (2%) Query: 44 TVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGL 103 + +P+ ERV+ + +G L+DI + A+NGG + + P G+ Sbjct: 74 EIYMLEFDPRDERVEFKPALSFDNIFG--FEKLSDICKRNGAYAAVNGGFFYQFGDPAGM 131 Query: 104 YIENGQQK 111 +GQ Sbjct: 132 VAIDGQML 139 >UniRef50_C7IFA0 Exopolysaccharide biosynthesis protein n=1 Tax=Clostridium papyrosolvens DSM 2782 RepID=C7IFA0_9CLOT Length = 385 Score = 70.1 bits (170), Expect = 7e-11, Method: Composition-based stats. Identities = 34/221 (15%), Positives = 68/221 (30%), Gaps = 31/221 (14%) Query: 47 AYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAP----LG 102 V+ + R + ++ E G + + + + + Sbjct: 150 VLIVDKKGARFETFYSNITLEHKGNKIKINDMNRIGKNNDIVLYNDKFGSTNRAEIKNTT 209 Query: 103 LYIENGQQ------------KVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSK 150 + ++N + +N+ S G P + AGDKV I Sbjct: 210 IIVDNNVITTLVESTKEVNIRKGMNVISFYGGKESIPEKMGLKAGDKVNIRMEPYLGYRY 269 Query: 151 EIQFAVQSGPMLMENGVIN----PRIHPNVASSKIRNGVGINKHGNAVFLLSQQA----- 201 A + G ML+++G + + + R +GI G + L++ Sbjct: 270 ---QAYECGSMLVKDGKTVVPERDKWAGTLGNRDPRTVIGIKTDGKIIMLVADGRQPGYS 326 Query: 202 --TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 + Y KL V + LDG S + G++ + Sbjct: 327 EGMTGKEMGEYL-VKLGVRDVAMLDGGASSQMIINGSLRNR 366 Score = 44.7 bits (104), Expect = 0.003, Method: Composition-based stats. Identities = 25/157 (15%), Positives = 52/157 (33%), Gaps = 20/157 (12%) Query: 44 TVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGL 103 + +P+ ERV+ + +G L+DI + + A+NGG + + P G+ Sbjct: 76 EIYMLEFDPRDERVEFKPALSFDNIFG--FEKLSDICKRNEAYAAINGGFFYQFGEPTGM 133 Query: 104 YIENGQQKVA--------LNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQF- 154 +GQ A + G G+K+ I ++ + +I Sbjct: 134 VAIDGQMLTASTGLSPVLIVDKKGARFETFYSNITLEHKGNKIKINDMNRIGKNNDIVLY 193 Query: 155 ---------AVQSGPMLMENGVINPRIHPNVASSKIR 182 A ++ + + + + IR Sbjct: 194 NDKFGSTNRAEIKNTTIIVDNNVITTLVESTKEVNIR 230 >UniRef50_A5D3T7 Hypothetical membrane protein n=1 Tax=Pelotomaculum thermopropionicum SI RepID=A5D3T7_PELTS Length = 887 Score = 70.1 bits (170), Expect = 8e-11, Method: Composition-based stats. Identities = 26/154 (16%), Positives = 50/154 (32%), Gaps = 29/154 (18%) Query: 104 YIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQF--------- 154 ++NG + L G I P G + L+ ++ + Sbjct: 219 VVKNGVVQQVLTDQPG---VPIPPDGYVLRGHGQAARFILENLPAGSKVSYTYSVMPQGD 275 Query: 155 ----AVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFL------------LS 198 AV +L+E G + N+A R G++K G ++L + Sbjct: 276 KLFAAVGGQALLVEEGRLPAYFTQNIAGKHARTAAGVSKDGKTLYLVAVEKQSASDGTVV 335 Query: 199 QQATNFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 + + A + + + V + + LDG S Sbjct: 336 SRGMTQEELAEFLIS-IGVWRAVNLDGGGSTTLA 368 >UniRef50_C9N2Q2 Metallophosphoesterase n=2 Tax=Actinomycetales RepID=C9N2Q2_9ACTO Length = 1163 Score = 69.7 bits (169), Expect = 9e-11, Method: Composition-based stats. Identities = 23/158 (14%), Positives = 48/158 (30%), Gaps = 25/158 (15%) Query: 96 ESYAPLGLY-IENGQQKVALNLASGEGNFFIRPGGVFYVA-------------GDKVGIV 141 + P+ + +G+ ++ G G+ + V V GD V I Sbjct: 240 DDARPVAEVAVRDGEVVS---VSDGPGSGPVPEDTVVLVGREAGAGLLAALEPGDPVKIA 296 Query: 142 RLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLS--- 198 + AV +L+ +G ++ R VG ++ G + +++ Sbjct: 297 YRARTDGGAVPRTAVGGRELLVVDGAAQNHDGEGNNTAAPRTAVGFSEDGRTMQVVTVDG 356 Query: 199 ----QQATNFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 + + + L LDG S + Sbjct: 357 RQTDSGGVTLTELGEMMR-RAGSYSALNLDGGGSSTLV 393 >UniRef50_UPI0001C31921 Collagen triple helix repeat protein n=2 Tax=Conexibacter woesei DSM 14684 RepID=UPI0001C31921 Length = 1426 Score = 69.3 bits (168), Expect = 1e-10, Method: Composition-based stats. Identities = 22/146 (15%), Positives = 49/146 (33%), Gaps = 22/146 (15%) Query: 104 YIENGQQKVALNLASG----EGNFFI--RPGGVFYVAGDKVGIVRLDAF----KTSKEIQ 153 + +G+ + G+F++ R + + G A+ ++++Q Sbjct: 211 LVTDGRVVAVSDGVGAGEIPAGSFYLVGRESAADAIRALRAGDEVRLAYGLSGDVAQQLQ 270 Query: 154 FAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFL-LSQQA------TNFYD 206 FA+ +L+ +G + + S R +G G + L ++ Sbjct: 271 FAIGGNEVLVRDGQVV----GSDQSVHPRTAIGFKDGGRTLLLFVADGRQTQVLGMTTQK 326 Query: 207 FACYAKAKLNVEQLLYLDGTISHMYM 232 A + E + LDG S + Sbjct: 327 VAQLLRDA-GAETAMNLDGGGSTTLV 351 >UniRef50_A7M0H0 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=A7M0H0_BACOV Length = 354 Score = 68.5 bits (166), Expect = 2e-10, Method: Composition-based stats. Identities = 35/228 (15%), Positives = 70/228 (30%), Gaps = 50/228 (21%) Query: 45 VQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLY 104 V ++ + + K+ + NG++ + +NGG E +Y Sbjct: 108 VNVLEIDLLSNKYKVEFTYNNGDSL-------STTAQVRGAIGGINGGYEQE-----AIY 155 Query: 105 IE-NGQQKVALNLASGEGNFFIRPGGVFYVAG---------DKVGIVRLDAFKTSKEIQF 154 I NG + L G + + G Y G + G +D +K ++ Sbjct: 156 IRINGTNISEVTLPEGHLR-YWKHDGALYSDGKSDIGIIYGGRNGKAAIDTYK-QHSAKY 213 Query: 155 AVQSGPMLMENGVINPRIHPNVAS------------------SKIRNGVGINKHGNAVFL 196 + S P L+++ + R V + + + + + Sbjct: 214 LLASAPTLIDDYNPLGETFVGNYTMEQLESFDYEDYRRHQGVRHPRTVVAVTEDKDLLLV 273 Query: 197 LSQQA-------TNFYDFACYAKAKLNVEQLLYLDGTIS-HMYMKGGA 236 + + + K N + L +DG S MY+KG Sbjct: 274 TIDGRWAGKAEGMSAKEVTLFLKKHFNPQYALNMDGGGSTTMYVKGKG 321 >UniRef50_Q01TI8 Putative uncharacterized protein n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q01TI8_SOLUE Length = 340 Score = 68.1 bits (165), Expect = 2e-10, Method: Composition-based stats. Identities = 37/274 (13%), Positives = 76/274 (27%), Gaps = 53/274 (19%) Query: 20 FLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADI 79 F+ +TL+ + T+ +N + + G T+ D Sbjct: 34 FVGVTLITRTETSP-------RAETMHIAEINLNAPGIGVKLTSP-GGTLETVRQTTLDY 85 Query: 80 NSQGQVQMAMNGGIY----DESYAP--LGLYIENGQQKVALNLASGEGNFFIRPGGVFYV 133 +Q Q+A+NG + + +GL NG + + Sbjct: 86 LNQEHAQLAINGEFFLPFPSSDFNSMLIGLAASNGNVYSSFEAPVQSYAIVTDAPALNID 145 Query: 134 AGDKVGIVRLDA-------FKTSKEIQFAVQSGPMLMENG---------VINPR--IHPN 175 + IV + + + + ++ NG +P + P Sbjct: 146 QSNHASIVHDNTSFVDGKHVLENVTLWNTIAGSAQIITNGVASIPTYLDATHPNGLLTPG 205 Query: 176 VASS-----------KIRNGVGINKHGNAVFLLS------QQATNFYDFACYAKAKLNVE 218 +S R +G+++ +FL + + + A ++ Sbjct: 206 GPASYSNSNSWYNLINARTVIGLSQDNQTLFLFTVDNAGGSRGMTLPEVANLLIGDYSIY 265 Query: 219 QLLYLDGTISHMYMKGGAIPWQRYPFVTMISVER 252 L LDG S A+ I+V Sbjct: 266 NALNLDGGGSTSM----AMQDPVTGMGRFINVSS 295 >UniRef50_A3DIP4 Exopolysaccharide biosynthesis protein n=3 Tax=Clostridium thermocellum RepID=A3DIP4_CLOTH Length = 382 Score = 68.1 bits (165), Expect = 3e-10, Method: Composition-based stats. Identities = 18/113 (15%), Positives = 33/113 (29%), Gaps = 16/113 (14%) Query: 154 FAVQSGPMLMENGVINPRIHPN----VASSKIRNGVGINKHGNAVFLLSQQATNFY---- 205 A + G L+ +G + + + + R +G+ G V + Y Sbjct: 266 QAYECGSWLVRDGQVVAVDRDDWVGLLTNRDPRTAIGVKHDGKVVLVTVDGRQPGYSVGL 325 Query: 206 ---DFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPW----QRYPFVTMISVE 251 + A Y L ++ LDG S + + I V Sbjct: 326 SSRELAGYLL-TLGIKDAAMLDGGASTQMIVQNKTVNRLPARERMLGGGIVVV 377 >UniRef50_A7HB86 Putative uncharacterized protein n=4 Tax=Anaeromyxobacter RepID=A7HB86_ANADF Length = 287 Score = 68.1 bits (165), Expect = 3e-10, Method: Composition-based stats. Identities = 38/243 (15%), Positives = 80/243 (32%), Gaps = 21/243 (8%) Query: 18 RIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLA 77 R + LF + ++P +K+ A GE GTL A Sbjct: 44 RTLEPGLEMGLFDGPPAGEEAR----PIAVVRIDPARFELKLLNASAPGE--GTLRTARA 97 Query: 78 DINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDK 137 G A+N +Y E Y + ++ P Sbjct: 98 WAERAG-ASAAINASMYQEDYRTSVSLMRTRHHVNQRRVSKDRSVLAFDP---LARGASP 153 Query: 138 VGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIR----NGVGINKHGNA 193 V I+ D +++ A Q+ L+++ + NV + R +G++ G Sbjct: 154 VRIIDRD----CDDLERAAQTYGTLVQSIRLVSCDRKNVWAPSARRFSAAAIGVDAKGRV 209 Query: 194 VFLLSQQATNFYDFA-CYAKAKLNVEQLLYLDGT-ISHMYMKGGAI-PWQRYPFVTMISV 250 +F+ ++ ++ + + Q +Y++G + ++++GG F + Sbjct: 210 LFIHARTPWPVHELVNALLALPIELRQAMYVEGGPEAQLFVRGGGRQHEWVGGFEHVPQA 269 Query: 251 ERK 253 E + Sbjct: 270 ENR 272 >UniRef50_C2FS46 Putative uncharacterized protein n=2 Tax=Sphingobacterium spiritivorum RepID=C2FS46_9SPHI Length = 341 Score = 68.1 bits (165), Expect = 3e-10, Method: Composition-based stats. Identities = 31/213 (14%), Positives = 61/213 (28%), Gaps = 24/213 (11%) Query: 42 TLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADIN-----SQGQVQMAMNGGIYDE 96 +++ ++ ++ E L S G++ +A+NG Sbjct: 102 PVSMHVLEIDLSKPKLAAQALGPFNEVLYATQILPEMAKYNESGSGGKMMVAINGDAVLT 161 Query: 97 SYA----PLGLYIENGQQKVALNLASGEGN---FFIRPGGVFYVAGDKVGIVRLDAFKTS 149 S P G YI G+Q + F + GV ++ +A + Sbjct: 162 SGTTVNAPSGSYIRYGRQIKTNTTTATAFTIPYFAVTKAGVPFIGNRPSATYPAEAVDLN 221 Query: 150 KEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ-------AT 202 + L+ N + I A+ R +GIN + ++ Sbjct: 222 TIYHLVSGTNW-LVFNNNL---ITSTTATVSARTAIGINADKKVICVVVDGGDDAFSTGI 277 Query: 203 NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGG 235 D K L + + +G +K Sbjct: 278 TLNDLGIVMK-TLGSSRAFFTNGGNFSAMVKRK 309 >UniRef50_Q2JUI0 Conserved domain protein n=2 Tax=Synechococcus RepID=Q2JUI0_SYNJA Length = 411 Score = 67.8 bits (164), Expect = 4e-10, Method: Composition-based stats. Identities = 16/128 (12%), Positives = 38/128 (29%), Gaps = 16/128 (12%) Query: 133 VAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNV------ASSKIRNGVG 186 + GD++ + + + +GP+L+ +G + R+ + Sbjct: 258 IPGDRLRLDWTVDPLELEAYPHILGAGPLLLLDGQVVLDAELEGFQPLFRRQQAARSAIC 317 Query: 187 INK---HGNAVFLLSQQ------ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAI 237 + + + L++ + A + +L L LDG S + G Sbjct: 318 LRQGQPDNRDLLLVAAGNAQENQGLTLLEMAQLLR-QLGCRHALNLDGGRSSTLVLGEEA 376 Query: 238 PWQRYPFV 245 Sbjct: 377 VNLEPEIG 384 Score = 41.2 bits (95), Expect = 0.031, Method: Composition-based stats. Identities = 20/92 (21%), Positives = 33/92 (35%), Gaps = 4/92 (4%) Query: 38 LSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDES 97 L D + V V+ +++ W L L A +G A+NGG ++ + Sbjct: 63 LEDRRILVSVVAVSLAAGQLRPIWADPAS--LVGLGELPAFSRERG-AVAAINGGFFNRN 119 Query: 98 -YAPLGLYIENGQQKVALNLASGEGNFFIRPG 128 PLG G+ + L G +F Sbjct: 120 TRQPLGAIRLEGRWISSPILGRGAIAWFDAAN 151 >UniRef50_Q5ULM2 Orf92 n=1 Tax=Lactobacillus phage LP65 RepID=Q5ULM2_9CAUD Length = 556 Score = 67.4 bits (163), Expect = 4e-10, Method: Composition-based stats. Identities = 33/202 (16%), Positives = 60/202 (29%), Gaps = 21/202 (10%) Query: 60 YWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESY-APLGLYIENGQQKVALNLAS 118 ++ + A+N G+++ S P+G I NG + + S Sbjct: 339 LALTSSDGSLSGTKRPTLRYAKDNDTIFAVNAGLFNVSTVEPVGQLIINGISLINTPMTS 398 Query: 119 GEGNFFIRPGGV--FYVAGDKVGIVRLDAFKTSK----EIQFAVQSGPMLMENGVINPRI 172 G I P + + T+ +++AV + L++N I Sbjct: 399 DNG-VTINPNECYPLAIDANGDLTTYPRNADTADMIAAGVKYAVTAWGKLVDNFEIATTD 457 Query: 173 HPN---VASSKIRNGVGINKHGNAVFLLSQ---QATN------FYDFACYAKAKLNVEQL 220 N IR +G ++G + + + A K V+ Sbjct: 458 IENEIVHNGRYIRQSIGQYQNGYYCVCTVDMTRGSVTNEAGLYYKELAQIFVDK-GVKFA 516 Query: 221 LYLDGTISHMYMKGGAIPWQRY 242 LDG S + G Y Sbjct: 517 FSLDGGGSAETVLGKRQLNPIY 538 >UniRef50_A4FAG7 Secreted protein n=5 Tax=Actinomycetales RepID=A4FAG7_SACEN Length = 434 Score = 67.4 bits (163), Expect = 4e-10, Method: Composition-based stats. Identities = 25/147 (17%), Positives = 48/147 (32%), Gaps = 21/147 (14%) Query: 103 LYIENGQQKV----ALNLASGEGNFFI--RPGGVFYVAGDKVGIVRLDAFKTSK----EI 152 + + +G+ A G+F + R GV + K G ++ + E Sbjct: 257 IVVRDGKVAEVRPEPGAGAIAAGDFVLVGREDGVGELDDLKPGDPVSVDYQLAPVGVPEF 316 Query: 153 QFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHG-NAVFLLSQQA------TNFY 205 +F V P+L +G P + + + R G + G + Sbjct: 317 RFVVGGFPIL-RDGTALPGL--DDQALAPRTSAGASADGKRVYLVAMDGRSQVSAGLTVS 373 Query: 206 DFACYAKAKLNVEQLLYLDGTISHMYM 232 + A K + + + LDG S + Sbjct: 374 ELADLLK-RSGADDAVNLDGGGSTTLV 399 >UniRef50_A1VEZ3 Putative uncharacterized protein n=4 Tax=Desulfovibrio vulgaris RepID=A1VEZ3_DESVV Length = 311 Score = 67.4 bits (163), Expect = 4e-10, Method: Composition-based stats. Identities = 28/182 (15%), Positives = 53/182 (29%), Gaps = 9/182 (4%) Query: 47 AYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDES-YAPLGLYI 105 A ++P ++ G +L A + + A+N +Y G Sbjct: 74 ALRIDPNLWDFSLHTATGEGGYPLSLGAWAEKL----NLGAAINSSMYLPDVRTSTGFLK 129 Query: 106 ENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMEN 165 F P D + + VQ+ ++ N Sbjct: 130 AGEHVNNPRVTTKFGSFFVAAPDDPTLPQADLLDRAIDPWAERLPHYNMVVQNYRLISTN 189 Query: 166 GVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKA-KLNVEQLLYLD 224 I I VG + G +FL ++ + FA A L++ ++Y++ Sbjct: 190 RRI--LWPQGGPEYSI-AAVGQDGSGAILFLHCREPMTAHAFASMLLALPLDIHDVMYVE 246 Query: 225 GT 226 G Sbjct: 247 GG 248 >UniRef50_B4VYL6 Tat pathway signal sequence domain protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VYL6_9CYAN Length = 299 Score = 67.0 bits (162), Expect = 5e-10, Method: Composition-based stats. Identities = 30/248 (12%), Positives = 68/248 (27%), Gaps = 29/248 (11%) Query: 24 TLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGT------LHALLA 77 + P+ A + + + + ++ + AN G + Sbjct: 32 LVSPVAAESVSFRRSTILGVPLYQTHIDLTNPDTFIAIGLANNSTLGNHQGAIGEESFGN 91 Query: 78 DINSQGQVQMAMNGGIYDES--YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAG 135 + +A +G + + +G + G + +R Sbjct: 92 MVRRYHAAVVA-SGTFFSKKDPKRLMGNMVSAGTFLKYSPWENYGTTLGLRV-------- 142 Query: 136 DKVGIVRLDAFKTSKEI---QFAVQSGPMLMENGVI-----NPRI-HPNVASSKIRNGVG 186 + + F++ GP L+ G + + P V R +G Sbjct: 143 GNQPELVTARVDGKPDWGQHWFSLTGGPRLLRKGKVWLAPRSEGFTDPRVMGVAHRCAIG 202 Query: 187 INKHGN-AVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYPF 244 G V + + A +A + + + +DG S +Y +G + + Sbjct: 203 FPASGKKLVLVTFLAPLPLWREAKVMRA-IGCSEAMNIDGGSSSALYHRGRILVNPKRML 261 Query: 245 VTMISVER 252 I V Sbjct: 262 TNAIVVYD 269 >UniRef50_C7LNU2 Putative uncharacterized protein n=1 Tax=Desulfomicrobium baculatum DSM 4028 RepID=C7LNU2_DESBD Length = 276 Score = 67.0 bits (162), Expect = 6e-10, Method: Composition-based stats. Identities = 25/240 (10%), Positives = 70/240 (29%), Gaps = 24/240 (10%) Query: 9 KGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKAN--- 65 + + T + +A + + A L + + Q + + + + ++ Sbjct: 7 RAVFTCLVLCAPVASLHAEEWRLLAPGLELREFLIPDQVGDLEGRQSGMAVLRIDSDRFD 66 Query: 66 ---GEAWGTLHALLADINSQGQ-VQMAMNGGIYDES--YAPLGLYIENGQQKVALNLASG 119 G A GT ++ +N G++ G + + Sbjct: 67 VALGSALGTGRMRSMQEWARHSGFVAVINAGMFRADDRMRSTGYMRDAAVMINSF----- 121 Query: 120 EGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQF--AVQSGPMLMENGVINPRIHPNVA 177 G + L + + + +++N + R N+ Sbjct: 122 ---IHPNYGAFLAFQPRDPSLPALRWVDRKSDPDWQAVLADYDGIIQNYRLISRERENLW 178 Query: 178 SSKIR----NGVGINKHGNAVFLLSQQATNFYDFACYAKA-KLNVEQLLYLDGTISHMYM 232 R + +++ G +F+ + + ++FA L++ +Y++G Sbjct: 179 EPSDRRHSGAAIAMDREGRLLFIHCRARLSLHEFAQALIDLPLDLIGAMYVEGGADAAMY 238 >UniRef50_A9QSN5 Exopolysaccharide biosynthesis protein n=4 Tax=Lactococcus lactis RepID=A9QSN5_LACLK Length = 303 Score = 67.0 bits (162), Expect = 6e-10, Method: Composition-based stats. Identities = 30/184 (16%), Positives = 66/184 (35%), Gaps = 13/184 (7%) Query: 59 MYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDES-YAPLGLYIENGQQKVALNLA 117 + + ++ +++ + + MN ++ + G I NG+ N Sbjct: 117 LKTATSADSPVVSMSEVISKYPNS----LIMNASGFNMTTGKITGFQINNGKLFKDWNSD 172 Query: 118 SGEGN-FFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNV 176 N F G + D + K + + G +L+++G P Sbjct: 173 KRATNAFVFNKNG----SSDIYNSTTPASEILKKGAEMSFSFGSILIKDGKSLPS--DGT 226 Query: 177 ASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGA 236 + +I + +G +K N ++S +T + + KL++E + +DG S G Sbjct: 227 VNWEIHSFIGNDKDNNIYLIISDTSTGYQSIMEKFQ-KLHLENVQVMDGGGSSQMSLNGQ 285 Query: 237 IPWQ 240 I + Sbjct: 286 IIYP 289 >UniRef50_A4FD37 Secreted protein n=1 Tax=Saccharopolyspora erythraea NRRL 2338 RepID=A4FD37_SACEN Length = 519 Score = 66.6 bits (161), Expect = 7e-10, Method: Composition-based stats. Identities = 24/165 (14%), Positives = 44/165 (26%), Gaps = 24/165 (14%) Query: 102 GLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPM 161 G G L A+ R G +V A V GP Sbjct: 344 GPVPAGGTVVQGLGQAAEWLVAHARAGEPLWVDQQ--IREESGAPLRLGPSDDIVNGGPE 401 Query: 162 LMENGVINPRIHPNV-------------ASSKIRNGVGINKHGNAVFLLSQQATN----- 203 L+ +G + + + R+ +G++ G + ++ Sbjct: 402 LVRDGQVRINLQEDGIIHDAPSFAYTWGLKRNPRSVIGVDAQGRVILATTEGRMPGFSDG 461 Query: 204 --FYDFACYAKAKLNVEQLLYLDGTIS-HMYMKGGAIPWQRYPFV 245 + A + +A L + LDG S M + + Sbjct: 462 WGLPEAAEFVRA-LGAVDAMALDGGGSAGMVVDDRVVTTPSDATG 505 >UniRef50_Q9L2D5 Putative secreted protein n=2 Tax=Streptomyces RepID=Q9L2D5_STRCO Length = 428 Score = 66.6 bits (161), Expect = 8e-10, Method: Composition-based stats. Identities = 22/183 (12%), Positives = 45/183 (24%), Gaps = 14/183 (7%) Query: 63 KANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGN 122 A+ + + +++N P G G+ + Sbjct: 229 DADARFTEDDDPGAEAVVAADGTVLSLNPNGRGGVTVPTG-----GRVLQGTGTGADWLR 283 Query: 123 FFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIR 182 PG D + P L+ N + R Sbjct: 284 AHATPGTDLAFEERLHDERFGDDIPLDSSVDVVNGHYP-LVHNAQY--AYTGQNTAVDPR 340 Query: 183 NGVGINKHGNAVFLLSQQ-----ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAI 237 + + ++ G +F+ + +FA L L +DG S + A+ Sbjct: 341 SAIAVDGPGRTLFVTATGKSGRNGVTLDEFARILLD-LGAVDGLNMDGGGSTTLVVEQAV 399 Query: 238 PWQ 240 + Sbjct: 400 VNR 402 >UniRef50_UPI0001744904 hypothetical protein VspiD_09360 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001744904 Length = 258 Score = 66.6 bits (161), Expect = 8e-10, Method: Composition-based stats. Identities = 34/193 (17%), Positives = 61/193 (31%), Gaps = 24/193 (12%) Query: 46 QAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE-SYAPLGLY 104 Q T + +V++ ++ + E +H + + + NGG +D ++AP GL Sbjct: 50 QVVTFDASKVKVEVLARQ-DRETALPMHRWMTEA----RAIAGCNGGYFDPATFAPSGLQ 104 Query: 105 IENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSK--EIQFAVQSGPML 162 + G G G F V K I E + VQ P+L Sbjct: 105 VVEGLATGKYQQFGEWG-------GGFGVRSGKAQIWTEQEILAMPTFEAESFVQCSPVL 157 Query: 163 MENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAK------LN 216 + +G + R + + ++ + A + Sbjct: 158 V-DG-VRRFTGAGEDVRARRTFIAHDGGARWALGVTSG-IGLRELAELLVNQGAGLLGFK 214 Query: 217 VEQLLYLDGTISH 229 V + L LDG S Sbjct: 215 VSRALNLDGGPST 227 >UniRef50_B5RQG1 Uncharacterized conserved protein n=20 Tax=Borrelia RepID=B5RQG1_BORRA Length = 269 Score = 66.6 bits (161), Expect = 8e-10, Method: Composition-based stats. Identities = 27/200 (13%), Positives = 51/200 (25%), Gaps = 31/200 (15%) Query: 48 YTVNPQTERVKMYWQKANGEA----WGTLHALLADINSQGQVQMAMNGGIYDESYA---P 100 V + + +K K + + + +V +A+N Y P Sbjct: 46 VIVKIKNKDLKFIISKPIYDTKMNNYYFKGQTTSQFLISNKVDIAINTSPYTIKGTMFYP 105 Query: 101 LGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGP 160 G+YI N + G + K + Sbjct: 106 NGIYIYNKKLISHAKKDQGIIIIKNNQI------------ILNPKHNEIKNSDYGFGGFF 153 Query: 161 MLMENGVINPRIHPNVASSKIRNGVGINKHG-NAVFLLSQQA-------TNFYDFACYAK 212 L++NG N R +G +K + + + + + A Sbjct: 154 SLIKNGKYTKNFKEN---KHPRTIIGTDKENKHLYLITVEGRGTNNSKGISLNE-AIDLS 209 Query: 213 AKLNVEQLLYLDGTISHMYM 232 V + LDG S + Sbjct: 210 LSYGVTNSINLDGGGSSTLV 229 >UniRef50_A5ILT0 Putative uncharacterized protein n=6 Tax=Thermotogaceae RepID=A5ILT0_THEP1 Length = 553 Score = 66.2 bits (160), Expect = 9e-10, Method: Composition-based stats. Identities = 25/129 (19%), Positives = 44/129 (34%), Gaps = 20/129 (15%) Query: 143 LDAFKTSKEIQFAVQSGPMLMENGVINP-------RIHPNVA-SSKIRNGVGINKHGNAV 194 I+ AV+ GP+L++NG P R +A + R + K G Sbjct: 420 SLQPNIPLRIKQAVEGGPLLIQNGAPIPDAWEEKARYGGGIAYAKAPRTVIA-TKDGKLW 478 Query: 195 FLLSQQ------ATNFYDFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFVTM 247 FL+ + + + + ++ E + +DG S M + G + Sbjct: 479 FLVFEGYNHITRGLTYDELVDFLISR-GFEDAMCVDGGSSSVMAVAGSLFGRTENSTAAI 537 Query: 248 ---ISVERK 253 I V K Sbjct: 538 PVGIVVWEK 546 >UniRef50_A4CSS0 Putative uncharacterized protein n=1 Tax=Synechococcus sp. WH 7805 RepID=A4CSS0_SYNPV Length = 549 Score = 66.2 bits (160), Expect = 9e-10, Method: Composition-based stats. Identities = 24/175 (13%), Positives = 48/175 (27%), Gaps = 19/175 (10%) Query: 93 IYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEI 152 S L + I +G+ + + G VA + + + + + Sbjct: 368 YRSLSGEELAILIRDGRVTDQFSKTELARGVPLPEGASLVVARARAPLPAKPGDEVAIRL 427 Query: 153 ---------QFAVQSGPMLMENGVIN-----PRIHPN-VASSKIRNGVGINKHGNAVF-- 195 + + GP+L++ G + + + R VG + + Sbjct: 428 KVSSPVGERRQVMAGGPLLLKEGQVVLRGRQEGFSSGFLGQAAPRTVVGQDPKHRWMLTL 487 Query: 196 -LLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMIS 249 LS + A +L + L LDG S + I Sbjct: 488 EGLSGSDPTLLE-TTLALQQLGLSDALNLDGGSSTTMLIANRTVMTGRGVPPRIQ 541 >UniRef50_B6V2M3 Gp2.43 n=1 Tax=Bacillus phage SPO1 RepID=B6V2M3_BPSP1 Length = 437 Score = 66.2 bits (160), Expect = 9e-10, Method: Composition-based stats. Identities = 26/219 (11%), Positives = 57/219 (26%), Gaps = 18/219 (8%) Query: 39 SDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDES- 97 ++ +K+ N + ++ + N I++ + Sbjct: 30 KTDYFITHVPNLDKNGNLIKLRHGFQNDLINSGVGETARSFCNRHSASLVANASIWNTNN 89 Query: 98 YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQ 157 G+ I++G+ + + V + I Sbjct: 90 GLIRGVQIQDGKVIQDAKDTNSYTLGIKSDNTLVMYPPS----VTAEQVLADGCIDAITA 145 Query: 158 SGPMLMENGVINP----RIHPNVASSKIRNGVGINKHGNAVFLLSQQA------TNFYDF 207 PM +++G NV RN + + + +FL + + D Sbjct: 146 FYPM-IQDGAAFDLSGVTTVSNVTEHHPRNVIAQLPNKDLLFLTCEGRTKANQGMTYDDM 204 Query: 208 ACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFV 245 A+ V LDG S ++G + Sbjct: 205 IRILLAR-GVTTAYCLDGGGSSQTVVRGHLVNNPLDNNG 242 >UniRef50_C6WLB3 Metallophosphoesterase n=1 Tax=Actinosynnema mirum DSM 43827 RepID=C6WLB3_ACTMD Length = 1118 Score = 66.2 bits (160), Expect = 1e-09, Method: Composition-based stats. Identities = 21/98 (21%), Positives = 32/98 (32%), Gaps = 13/98 (13%) Query: 143 LDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLS-QQA 201 A + A+ +L+ + + P S R VG + G +FLL+ Sbjct: 284 TRAGDGGSAPRAAIGGNQVLLRDSEVVAPDDP----SHPRTAVGFSADGRRMFLLTVDGR 339 Query: 202 -------TNFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 N D A + L L LDG S + Sbjct: 340 QSAHLLGLNLKDVAEALRD-LGAHNALNLDGGGSSTLV 376 >UniRef50_B4WHW3 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WHW3_9SYNE Length = 335 Score = 66.2 bits (160), Expect = 1e-09, Method: Composition-based stats. Identities = 33/203 (16%), Positives = 56/203 (27%), Gaps = 48/203 (23%) Query: 74 ALLADINSQGQVQMAMNGGIYD-ESYAPLGLYIE----------NGQQKVALNLASGEGN 122 A + + +NGG +D + I N + NL Sbjct: 98 ATIEAFAERTNADYIINGGFFDPHNGKTTSHLISQEQTVSDPADNERLINNSNLGQYMAQ 157 Query: 123 FFIRPGGVFYVAGDKVGIVRLD-------------------AFKTSKEIQFAVQSGPMLM 163 R Y + R EI A+ +GP L+ Sbjct: 158 ILNRSEFRVYRCRQASVVERGGLEGSLTEEAVVYDITFHNAPPPDGCEIDTAIGAGPQLL 217 Query: 164 ------------ENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ-----ATNFYD 206 + I R R+ +G+ G ++ ++ + Sbjct: 218 PADTSWVEGFIDYDDGILFRDAIGSRQPNARSAIGLYPDGAIALIMVEKSASSIGMTLLE 277 Query: 207 FACYAKAKLNVEQLLYLDGTISH 229 A +AK+ L + +LL LDG S Sbjct: 278 LADFAKS-LGITKLLNLDGGSSS 299 >UniRef50_D1VTW3 Copper amine oxidase N-domain superfamily n=1 Tax=Peptoniphilus lacrimalis 315-B RepID=D1VTW3_9FIRM Length = 765 Score = 66.2 bits (160), Expect = 1e-09, Method: Composition-based stats. Identities = 19/107 (17%), Positives = 42/107 (39%), Gaps = 12/107 (11%) Query: 135 GDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINP--RIHPNVASSKIRNGVGINKHGN 192 GDK+ I + S ++ + +L++N I P + ++ ++ R +GI G Sbjct: 246 GDKLKITYDIYPQKSWKML--IGGHSLLVDNSKIRPYKKDINSIGGTRARTCIGIADGGK 303 Query: 193 -AVFLLSQQA------TNFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 + + + + + + L ++ L LDG S + Sbjct: 304 SVYIVSCEGRTKRSSGMSLNELSNFMVN-LGCQRALNLDGGGSTAMV 349 >UniRef50_A4FAL4 Putative uncharacterized protein n=2 Tax=Actinomycetales RepID=A4FAL4_SACEN Length = 1118 Score = 65.1 bits (157), Expect = 2e-09, Method: Composition-based stats. Identities = 15/88 (17%), Positives = 30/88 (34%), Gaps = 11/88 (12%) Query: 152 IQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHG-NAVFLLSQQA------TNF 204 + AV +L+ +GV+ + + R G + G + Sbjct: 301 PKAAVGGNKVLLRDGVVQ---QVDDTALHPRTAAGFSADGTRMWLVTIDGRQADSRGMTE 357 Query: 205 YDFACYAKAKLNVEQLLYLDGTISHMYM 232 + A + ++ L + L LDG S + Sbjct: 358 RELAEHLRS-LGADDALNLDGGGSSTLL 384 >UniRef50_C0AEZ6 Putative uncharacterized protein n=1 Tax=Opitutaceae bacterium TAV2 RepID=C0AEZ6_9BACT Length = 421 Score = 64.7 bits (156), Expect = 3e-09, Method: Composition-based stats. Identities = 20/104 (19%), Positives = 38/104 (36%), Gaps = 6/104 (5%) Query: 132 YVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHG 191 + + L A +++++ AV +L+ G + + R+ VG+ G Sbjct: 284 AILDINWRLTDLPAGVHTRDVRDAVSGNVILIAAGRLQEGGGAFWTTRHPRSAVGVAADG 343 Query: 192 -NAVFLLSQQATNFY---DFACY--AKAKLNVEQLLYLDGTISH 229 A+ +L + F D + A L + LDG S Sbjct: 344 RRALLVLVDGRSLFSAGMDLSALRDYLAHLGAHDAVNLDGGGSS 387 >UniRef50_A5GW09 Putative uncharacterized protein SynRCC307_2165 n=1 Tax=Synechococcus sp. RCC307 RepID=A5GW09_SYNR3 Length = 563 Score = 64.3 bits (155), Expect = 4e-09, Method: Composition-based stats. Identities = 28/123 (22%), Positives = 42/123 (34%), Gaps = 10/123 (8%) Query: 135 GDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVIN-----PRIHPNV-ASSKIRNGVGIN 188 GD V + R K E+ +Q GP+L+ G + R R+ VG + Sbjct: 430 GDGVSLERSMVPKAFAELPNLIQGGPLLLNQGKVVLNGKAERFSSAFMRQKAPRSVVGSD 489 Query: 189 KHGNAVFLLSQQAT---NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFV 245 + + Q + A + KL ++Q L LDG S M F Sbjct: 490 DELIWLLAVEGQGNAGPTLRETAELMQ-KLGLKQALNLDGGSSTRLMVRNRGQSSGRGFG 548 Query: 246 TMI 248 I Sbjct: 549 AAI 551 >UniRef50_Q7U4D6 Putative uncharacterized protein n=11 Tax=Cyanobacteria RepID=Q7U4D6_SYNPX Length = 589 Score = 64.3 bits (155), Expect = 4e-09, Method: Composition-based stats. Identities = 28/162 (17%), Positives = 48/162 (29%), Gaps = 19/162 (11%) Query: 103 LYIENGQQKVALNLASGEGNFFIRPGGVFYV---------AGDKVGIVRLDAFKTSKEIQ 153 L ++ G+ + AS I G V ++ + Sbjct: 418 LLVQGGRVTQRFDRASIRRGVLIPADGDLVVARGGTPLPAKPGDAVMLSQRTTSGLGDQA 477 Query: 154 FAVQSGPMLMENGVIN-----PRIHPNVAS-SKIRNGVGINKHGNAVFLL---SQQATNF 204 + GP+LM+ G I P+ + + R VG G + L + Sbjct: 478 NVLGGGPLLMQGGQIVLNGRAEGFSPDFLALAAPRTVVGQGTGGTWLLALRGAAGSDPTL 537 Query: 205 YDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVT 246 + A A +L ++ L LDG S + G Sbjct: 538 LETA-LAAQQLGLKDALNLDGGSSTTVVVAGRTVMNGRGSAP 578 >UniRef50_C8VW07 S-layer domain protein n=1 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8VW07_DESAS Length = 921 Score = 64.3 bits (155), Expect = 4e-09, Method: Composition-based stats. Identities = 18/86 (20%), Positives = 35/86 (40%), Gaps = 8/86 (9%) Query: 151 EIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ------ATNF 204 ++ A+ +L+++G + P + + R VGI ++L + + T Sbjct: 269 NLRAALGGNTLLVQDGQLAP-FTQEITGNYARTAVGIMPDNKTLYLAAAENGNGSVGTTQ 327 Query: 205 YDFACYAKAKLNVEQLLYLDGTISHM 230 A + A L V + + LDG S Sbjct: 328 TGMAEFLLA-LGVNRAVNLDGGGSTT 352 Score = 43.1 bits (100), Expect = 0.008, Method: Composition-based stats. Identities = 15/86 (17%), Positives = 28/86 (32%), Gaps = 2/86 (2%) Query: 44 TVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE-SYAPLG 102 + A V+ VK+ +L + G +NGG + ++ P+G Sbjct: 58 RIYAIKVDLSNPYVKIDTMIGADGTLNKAQSLTGMTSRTG-AVAGINGGFFQMKNHRPIG 116 Query: 103 LYIENGQQKVALNLASGEGNFFIRPG 128 L NG + + F + Sbjct: 117 LEFSNGNLVSSPAMREDMPGFAVTNN 142 >UniRef50_C5CET4 Putative uncharacterized protein n=1 Tax=Kosmotoga olearia TBF 19.5.1 RepID=C5CET4_KOSOT Length = 558 Score = 63.9 bits (154), Expect = 5e-09, Method: Composition-based stats. Identities = 22/103 (21%), Positives = 42/103 (40%), Gaps = 14/103 (13%) Query: 149 SKEIQFAVQSGPMLMENGVINPRIHPNVAS------SKIRNGVGINKHGNAVFLLSQQ-- 200 +++++FA++ GP+++ G + S R +GI K G +F++ Sbjct: 439 NEKLKFAIEGGPLIISRGKPVTEYEKSFYSSSLLDIRAPRTLIGITKSGTLMFMIIDGYQ 498 Query: 201 ----ATNFYDFACYAKAKLNVEQLLYLDGT-ISHMYMKGGAIP 238 F + + K N E L+ +DG S + KG Sbjct: 499 MKSYGLTFKEMVEFFTDK-NFEYLMCVDGGKSSALVFKGEVFS 540 Score = 43.1 bits (100), Expect = 0.010, Method: Composition-based stats. Identities = 20/101 (19%), Positives = 39/101 (38%), Gaps = 13/101 (12%) Query: 38 LSDPTLTVQAYTVNPQTERVKMYWQK---ANGEAWGTLHALLADINSQGQVQMAMNGGIY 94 +S + + A ++P+ + +GE+ ++ +NGG + Sbjct: 249 VSGRRIILTALELDPERFDIHPVLANGRIPSGESLLSMAKRYDAFA-------VINGGYF 301 Query: 95 DES-YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVA 134 D S + P+GL IE+G+ +L FF G + Sbjct: 302 DPSSFYPIGLLIEDGKLISLPSLERPL--FFQTEDGKMGIG 340 >UniRef50_C9M6C8 Putative uncharacterized protein n=1 Tax=Jonquetella anthropi E3_33 E1 RepID=C9M6C8_9BACT Length = 603 Score = 63.9 bits (154), Expect = 5e-09, Method: Composition-based stats. Identities = 22/103 (21%), Positives = 33/103 (32%), Gaps = 12/103 (11%) Query: 151 EIQFAVQSGPMLMENGVI---NPRIHPN-VASSKIRNGVGINKHGNAVFLLSQQATNFY- 205 A+Q GP+L+++G I N I + R VG + ++ Sbjct: 459 GTVGALQGGPLLLKDGKIQRMNEGIAVGVINRRHPRTLVGRIGKTVWWLAV-DGRAPWHS 517 Query: 206 -----DFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRY 242 D A L LL LDG S + G + Sbjct: 518 SGLTLDEATTLGQYLGFTDLLNLDGGGSTELLYHGYPVNKPSD 560 >UniRef50_Q30YC1 Putative uncharacterized protein n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. G20 RepID=Q30YC1_DESDG Length = 383 Score = 63.1 bits (152), Expect = 8e-09, Method: Composition-based stats. Identities = 32/245 (13%), Positives = 67/245 (27%), Gaps = 54/245 (22%) Query: 28 LFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQM 87 +A D + TV ++P+ +++Y G T + + Sbjct: 93 GLELAESSAVFRDTSGTVALLRIDPRHYSLQLYTISEQGGPPQTPSEW----AALYNLDA 148 Query: 88 AMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNF----FIRPGGVFYVAGDKVGIVRL 143 +N ++ + Y+ NG + G+F + P G + Sbjct: 149 VINASMFLPDGSTSTGYMRNGTAANNSRINQRFGSFLVFSPLPPHAAASDGQPPAGGTQP 208 Query: 144 DAFKTS--------------------------------------KEIQFAVQSGPMLMEN 165 D + + + VQ+ M+ + Sbjct: 209 DPYAPAAAHTTAARNTPNDAGSDNQQLPAADVLDRYADDWQTLLPRYRGVVQNFRMISAD 268 Query: 166 GVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKA---KLNVEQLLY 222 P S I +G + G +F+ S+ T + + Y L +Y Sbjct: 269 RK--PLWPEEGDSFSI-AAIGKDTQGRILFIHSRAQTTVRELSEYLLDICPSLGAT--MY 323 Query: 223 LDGTI 227 ++G Sbjct: 324 VEGGA 328 >UniRef50_B4WFN8 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WFN8_9SYNE Length = 309 Score = 63.1 bits (152), Expect = 8e-09, Method: Composition-based stats. Identities = 26/184 (14%), Positives = 51/184 (27%), Gaps = 18/184 (9%) Query: 67 EAWGTLHALLADINSQGQVQMAMNGGIYDESY--APLGLYIENGQQKVALNLASGEGNF- 123 + TL + ++Q+A+N ++ P G+ + LA +G Sbjct: 97 QPHETLAQKTSSFLKTHRLQLAVNANFFNPFNETTPWQYSPREGELTNLVGLAISDGQIV 156 Query: 124 --FIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKI 181 + + I + + + AV +G L P + + Sbjct: 157 SPGDKNYPALCFLEGRAEIRDEGV--CAPDTKQAV-AGLRLNLENRPPPDV-ETIYKFYP 212 Query: 182 RNGVGINKHGN-AVFLLSQQ-------ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMK 233 ++ G LL + A + +A L + LDG S Sbjct: 213 VCVAALDAEGTTLWLLLVDGKQPLYSEGMTRPEVADFLQA-LGATTAVQLDGGGSTTLAI 271 Query: 234 GGAI 237 Sbjct: 272 ASER 275 >UniRef50_C8X0Z8 Putative uncharacterized protein n=1 Tax=Desulfohalobium retbaense DSM 5692 RepID=C8X0Z8_DESRD Length = 302 Score = 63.1 bits (152), Expect = 9e-09, Method: Composition-based stats. Identities = 26/187 (13%), Positives = 62/187 (33%), Gaps = 10/187 (5%) Query: 44 TVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGL 103 + ++P+ R +Y A A T+ + + A+N +Y E Sbjct: 63 ELTVLRIDPEFFRFVLYSASAERGADRTVRQWVE----DKNLVAAINASMYWEDRETSTG 118 Query: 104 YIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFA--VQSGPM 161 + N + G FF+ + + L+ + Q+A +Q+ + Sbjct: 119 LMTNFGHVNNGRVHPEFGAFFVANPRRAQLPPVDILDRSLEQQWRKRVAQYATIIQNYRL 178 Query: 162 LMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFA-CYAKAKLNVEQL 220 L G + V + G+ +F+L + + + L++ Sbjct: 179 LDAKGE---NVWQASRQEHSSAAVAEDSQGHILFILQHEPVSVHALGSRLENLSLDLSTA 235 Query: 221 LYLDGTI 227 ++++G + Sbjct: 236 MFVEGGV 242 >UniRef50_Q0AWB0 Putative uncharacterized protein n=1 Tax=Syntrophomonas wolfei subsp. wolfei str. Goettingen RepID=Q0AWB0_SYNWW Length = 497 Score = 62.8 bits (151), Expect = 1e-08, Method: Composition-based stats. Identities = 26/174 (14%), Positives = 53/174 (30%), Gaps = 24/174 (13%) Query: 102 GLYIENGQQKVALNLA---SGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQF---- 154 G+ + NG + G + G Y+ ++ + ++ + F Sbjct: 168 GVIVSNGHVSSITTSSFNIPENGFAIVYNGASSYLVDERYKVGDEVYYEVIIKPTFTNPS 227 Query: 155 -------AVQSGPMLMENGVINPR-------IHPNVASSKIRNGVGINKHGNAVFLLSQQ 200 A+ +GP L+ NG + SS R+ +G G + Sbjct: 228 DWEEVQCAIGAGPSLIINGNVTASGEEEGFFEAKINTSSSPRSFIGATADGRIIMGNMDA 287 Query: 201 ATNFYDFACYAKAKLNVEQLLYLDGTIS-HMYMKGGAIPWQRYPFVTMISVERK 253 AT A ++ + + LDG S +Y + ++ + Sbjct: 288 ATLKKAAAAC--QRMGLVNAMCLDGGYSIALYYASAGVSLAGRDINNGLAFVGR 339 >UniRef50_A3TM75 Putative uncharacterized protein n=1 Tax=Janibacter sp. HTCC2649 RepID=A3TM75_9MICO Length = 1151 Score = 62.8 bits (151), Expect = 1e-08, Method: Composition-based stats. Identities = 15/93 (16%), Positives = 37/93 (39%), Gaps = 11/93 (11%) Query: 147 KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLS-QQA---- 201 ++ A+ ML+++ V+ P+ + R +G + G+ +F+L+ Sbjct: 308 NEGANLKMAISGNTMLLKDNVVLPQTDKAI---HPRTAIGFDADGSTMFVLTVDGRMAAS 364 Query: 202 --TNFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 + + + K ++ + LDG S + Sbjct: 365 RGMTYAETGAFLK-EVGATSGINLDGGGSSTML 396 >UniRef50_A7HN47 Putative uncharacterized protein n=1 Tax=Fervidobacterium nodosum Rt17-B1 RepID=A7HN47_FERNB Length = 528 Score = 62.8 bits (151), Expect = 1e-08, Method: Composition-based stats. Identities = 20/113 (17%), Positives = 41/113 (36%), Gaps = 14/113 (12%) Query: 137 KVGIVRLDAFKTSKEIQFAVQSGPMLMENGVIN-----PRIHPNV---ASSKIRNGVGIN 188 + +++ AV +GP+L+++ I ++ + R + I Sbjct: 402 GADVSVELYTDNGYKVKNAVGAGPLLIQDKKIIQDAAEEKLRYGGGIPTTRASRTIIAI- 460 Query: 189 KHGNAVFLLSQQ----ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAI 237 K G + + NF + A + +K E + LDG S + G + Sbjct: 461 KDGKVHLITIEGTNGTGMNFDEAAQFLLSK-GYESAMMLDGGGSTGMVYAGKL 512 >UniRef50_A6NQQ4 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6NQQ4_9BACE Length = 1060 Score = 62.4 bits (150), Expect = 1e-08, Method: Composition-based stats. Identities = 22/134 (16%), Positives = 38/134 (28%), Gaps = 12/134 (8%) Query: 103 LYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPML 162 + I G+ +++N N + GD V I +E A+ L Sbjct: 295 IAIPEGKVVLSINNK---ANSYWLSNVKSLKPGDLVDIDVTTTDSIWQEADQAMGGLYKL 351 Query: 163 MENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYA------KAKLN 216 + G + + + VG+ G +F Y +L Sbjct: 352 VTAGKVESGLPTGQTAY---TAVGVKADGTVIFYTIDGKQPGYSVGASLTQVAMRLVELG 408 Query: 217 VEQLLYLDGTISHM 230 + LDG S Sbjct: 409 CVDAISLDGGGSTT 422 >UniRef50_C1YVW0 Putative uncharacterized protein n=1 Tax=Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 RepID=C1YVW0_NOCDA Length = 730 Score = 62.0 bits (149), Expect = 2e-08, Method: Composition-based stats. Identities = 23/123 (18%), Positives = 38/123 (30%), Gaps = 14/123 (11%) Query: 133 VAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGN 192 ++ V E + V +L+ +G P S R +G ++ G Sbjct: 243 LSEGDPVEVEHTLTAEGAEPRVVVGGRHVLVRDGEPVPVEDV---SRAPRTAIGFSEDGE 299 Query: 193 AVFLLS-QQAT------NFYDFACYAKAKLNVEQLLYLDGTISHMYM---KGGAIPWQRY 242 + +++ + A A EQ L LDG S + GG P R Sbjct: 300 VMHVVTADGRNRGHAGSTLAEVAELLAAS-GAEQALELDGGGSSTLLVREPGGVSPVLRN 358 Query: 243 PFV 245 Sbjct: 359 RAG 361 >UniRef50_Q4ZC55 ORF005 n=1 Tax=Staphylococcus phage EW RepID=Q4ZC55_9CAUD Length = 576 Score = 61.6 bits (148), Expect = 2e-08, Method: Composition-based stats. Identities = 22/162 (13%), Positives = 41/162 (25%), Gaps = 29/162 (17%) Query: 102 GLYIENGQQK-----VALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAV 156 G I NGQ V + G ++ + + Sbjct: 213 GEQIYNGQILETVKDYEPLKTRWTLAIADDNTLVSFPPGVTAKEIKDKGYNNT-----VS 267 Query: 157 QSGPMLMENGVINPR---IHPNVASSKIRNGVGINKHGNAVFLLSQQAT----------N 203 GP L+ +G I + N S R + + + +F Sbjct: 268 GFGP-LITDGQIVYKKGDYSTNSEESHPRQVICQLDNKDLLFFTCDGRVKSQGLLQKGMT 326 Query: 204 FYDFACYAKAKL-----NVEQLLYLDGTISHMYMKGGAIPWQ 240 + K++ ++ LDG S + G + Sbjct: 327 LSEVIETLKSEYPIGSNGIKFAYNLDGGGSSSSVLRGRRLNK 368 >UniRef50_D2PRV8 Metallophosphoesterase n=1 Tax=Kribbella flavida DSM 17836 RepID=D2PRV8_9ACTO Length = 1163 Score = 61.6 bits (148), Expect = 3e-08, Method: Composition-based stats. Identities = 16/79 (20%), Positives = 26/79 (32%), Gaps = 11/79 (13%) Query: 162 LMENGVINPRIHPNVASSKIRNGVGINKHGN-AVFLLSQQAT------NFYDFACYAKAK 214 L +G I + + + R +G + G V L + A Sbjct: 327 LARDGQI---LTVDDTALHPRTSIGFSADGRRMVLLTVDGRMVDSRGLTEKELARLMLD- 382 Query: 215 LNVEQLLYLDGTISHMYMK 233 L + +L LDG S +K Sbjct: 383 LGSDDVLNLDGGGSSTMLK 401 >UniRef50_Q7NGC8 Glr3243 protein n=1 Tax=Gloeobacter violaceus RepID=Q7NGC8_GLOVI Length = 540 Score = 61.6 bits (148), Expect = 3e-08, Method: Composition-based stats. Identities = 24/135 (17%), Positives = 47/135 (34%), Gaps = 19/135 (14%) Query: 125 IRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQ---------SGPMLMENGVIN-----P 170 + P G+ VA + L A + VQ +GP+L++N + Sbjct: 388 LPPDGLLLVARSEPLRTALRAVAAGTPVVLDVQPSGSGSLLGAGPLLVQNDKLVLDAQGE 447 Query: 171 RIHPNVASSK-IRNGVGINKHGNAVFLLSQQA----TNFYDFACYAKAKLNVEQLLYLDG 225 R P+V + R + + + ++ + +A + + L LDG Sbjct: 448 RFRPDVRAPGVARTAIARRGSLGILAVAARNGWAAGLSLESWANLLLQQFQADDALNLDG 507 Query: 226 TISHMYMKGGAIPWQ 240 S + GG + + Sbjct: 508 GGSSGFYLGGRLRDR 522 >UniRef50_A6LP25 Putative uncharacterized protein n=1 Tax=Thermosipho melanesiensis BI429 RepID=A6LP25_THEM4 Length = 534 Score = 61.2 bits (147), Expect = 3e-08, Method: Composition-based stats. Identities = 25/119 (21%), Positives = 40/119 (33%), Gaps = 18/119 (15%) Query: 147 KTSKEIQFAVQSGPMLMENGVINPRIHPNVA-----------SSKIRNGVGINKHGNAVF 195 I+ A+ +GP+L+ENG ++ + S R + I K G F Sbjct: 414 DFPFPIKHAIGAGPLLIENGK---KLIDSSEEKLRYSNGLALSKTTRTIIAITKEGRVDF 470 Query: 196 LLSQQATNF----YDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISV 250 ++ + N YD A + LDG S + + Q I V Sbjct: 471 IVIEGYNNTGGMNYDIATDFLISKGYFYAMMLDGGGSGAMVIQNEVVNQDGQIQRGIPV 529 >UniRef50_B7IEY1 Putative uncharacterized protein n=1 Tax=Thermosipho africanus TCF52B RepID=B7IEY1_THEAB Length = 535 Score = 61.2 bits (147), Expect = 3e-08, Method: Composition-based stats. Identities = 29/166 (17%), Positives = 46/166 (27%), Gaps = 19/166 (11%) Query: 103 LYIENGQQKVALNLASGEGNFFIRPGGVFYVA-----GDKVGIVRLDAFKTSKEIQFAVQ 157 I N Q + + + Y + I+ A+ Sbjct: 366 FVISNNQIISKEYVEKVPKDSMVLLITKKYDKYLKNIEVGSKVNLTINSDFPFPIKHAIG 425 Query: 158 SGPMLMENGVINPRIHPN--------VASSKIRNGVGINKHGNAVFLLSQQ-----ATNF 204 +GP+L+ENG S R + I K G F++ + N Sbjct: 426 AGPLLIENGKKLIDSDEEKLRYGNGLALSKTSRTIIAITKEGKVDFIVIEGYNDSPGMN- 484 Query: 205 YDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISV 250 YD A + + LDG S + + Q I V Sbjct: 485 YDIATEFLLEKGYFYAMMLDGGGSSAMVIQDEVVNQDGTIQRGIPV 530 >UniRef50_Q03K73 Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase n=3 Tax=Streptococcus thermophilus RepID=Q03K73_STRTD Length = 179 Score = 61.2 bits (147), Expect = 3e-08, Method: Composition-based stats. Identities = 17/112 (15%), Positives = 33/112 (29%), Gaps = 14/112 (12%) Query: 142 RLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASS---KIRNGVGI----NKHGNAV 194 + GP L+ENG + + V R + + ++ + + Sbjct: 33 TTAQKLVDSGVVNTFAFGPTLVENGKVAVSENEEVGQDMADNPRTAIVVNEESDRSVHYI 92 Query: 195 FLLSQQA------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 ++S Y+ A K+ V LD S G + + Sbjct: 93 VIVSDGRTSESSGLTLYEMAELMKS-YGVMTGYNLDVGDSSTMYSNGQVINK 143 >UniRef50_C1TLP7 Sporulation related-protein with S-layer-like domain n=1 Tax=Dethiosulfovibrio peptidovorans DSM 11002 RepID=C1TLP7_9BACT Length = 565 Score = 61.2 bits (147), Expect = 3e-08, Method: Composition-based stats. Identities = 17/134 (12%), Positives = 52/134 (38%), Gaps = 12/134 (8%) Query: 116 LASGEGNFFIRPGGV-FYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLM-ENGVINPRIH 173 + E ++R G+ + ++ + + + + E + +Q+GPM++ G + Sbjct: 416 MKPEENILYLRTSGLGPFSDAAEIKLQTIWSDEAMIEAKQVIQAGPMILGLEGPFSSEWF 475 Query: 174 PNV--ASSKIRNGVGINKHGNAVFLLSQQATNFYD------FACYAKAKLNVEQLLYLDG 225 + R G + +++ ++++ A + + + + + +DG Sbjct: 476 SDSIINKRHPRTLAGWDGD-RLCWIVIDGRSSWHSDGATLSEAAFIARQAGLVKAINMDG 534 Query: 226 TISH-MYMKGGAIP 238 S ++ KG + Sbjct: 535 GGSSQLWWKGITVN 548 >UniRef50_D2PZR6 Sporulation domain protein n=4 Tax=Actinomycetales RepID=D2PZR6_9ACTO Length = 537 Score = 60.4 bits (145), Expect = 5e-08, Method: Composition-based stats. Identities = 18/112 (16%), Positives = 33/112 (29%), Gaps = 22/112 (19%) Query: 155 AVQSGPMLMENGVINPRIHPNV--------------ASSKIRNGVGINKHGNAVFLLSQQ 200 V GP L+ +G + + R G++ G V + + Sbjct: 415 VVNGGPELVRDGRLMATPKADGMAPAGNPNFYYGWVHKRNPRTIAGVDAQGRTVLITADG 474 Query: 201 A------TNFYDFACYAKAKLNVEQLLYLDGTISHMYM-KGGAIPWQRYPFV 245 + A AK+ L + + + LDG S + G + Sbjct: 475 RNVSSLGLGIAEAAAVAKS-LGLREAVNLDGGGSTTMVANGKVVNQPSDAAG 525 Score = 40.8 bits (94), Expect = 0.042, Method: Composition-based stats. Identities = 40/237 (16%), Positives = 70/237 (29%), Gaps = 42/237 (17%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ 84 + + DD + S VQ T++P+ R + A+ + + + Sbjct: 162 IYTGWDGEPDDLSGSTGPWQVQVLTIDPKKFRGTL---DASYGLDLEARETTSTLATLTG 218 Query: 85 VQMAMNGGIYDES------YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKV 138 A+N G + P G+ + +G+ V G+ Sbjct: 219 ATAAVNAGFFVLDPKAGAPGDPAGVAVYDGRLVSEPTAGRPA-----------LVVGENA 267 Query: 139 GIVRLDAFKTSKEIQFAVQSGPMLMENG-VINPRIHPNVASSKIRNGVGINKHGNAVFLL 197 ++ F+ EI+ +G L +G P + IRN G Sbjct: 268 RGTSVERFRWRGEIR---GTGRPLPLDGLNRVPGL--------IRNCGGTTDD------- 309 Query: 198 SQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERKG 254 A +D C +L Y +S G + R VT I +G Sbjct: 310 LPTAAPLHDVTCTDADELIAFDAAY---GVSTPSGPGAEVIVDRRGVVTAIRPATRG 363 >UniRef50_C6D5A3 Copper amine oxidase domain protein n=1 Tax=Paenibacillus sp. JDR-2 RepID=C6D5A3_PAESJ Length = 904 Score = 59.7 bits (143), Expect = 8e-08, Method: Composition-based stats. Identities = 25/149 (16%), Positives = 44/149 (29%), Gaps = 15/149 (10%) Query: 96 ESYAP-LGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQF 154 P G +G A+ ++ G V D + K Sbjct: 255 SDGKPLTGTIPADGYILRGHGTAAQFILEHLQVGQS--VTSDYSLVSATSGQKVDPTSFE 312 Query: 155 AVQSGPMLMENGVINPRIHPNVASSK-----IRNGVGINKHGNAVFLLSQQ------ATN 203 + G ++ N ++ R VG +K G V+L++ + N Sbjct: 313 MLVGGHTILVNNGAAATFSRDITGVSGSSYVSRTAVGYSKDGTKVYLITSEDYGDSTGLN 372 Query: 204 FYDFACYAKAKLNVEQLLYLDGTISHMYM 232 + KL V + + LDG S + Sbjct: 373 LKELQQVM-VKLGVYKGINLDGGGSTTMI 400 >UniRef50_D2J8B1 Putative uncharacterized protein n=1 Tax=Staphylococcus aureus RepID=D2J8B1_STAAU Length = 569 Score = 59.3 bits (142), Expect = 1e-07, Method: Composition-based stats. Identities = 19/182 (10%), Positives = 40/182 (21%), Gaps = 17/182 (9%) Query: 75 LLADINSQGQVQMAMNGG-IYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYV 133 + + N P G+ I NG+ ++ + G Sbjct: 182 TPREFAKRTGATFVSNASTGSGTQLLPHGVQIYNGKIIKSVKDYDALEQRWSLAIGEDNT 241 Query: 134 AGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPR---IHPNVASSKIRNGVGINKH 190 V + E G +++ I + PN R+ + + Sbjct: 242 LRTYAPNVNAETLLAQGETNVLSGFG-AFIQDNKITVKPGDFSPNTDVKHPRSVIAQLPN 300 Query: 191 GNAVFLLSQQA-----------TNFYDFACYAKAKLNVEQ-LLYLDGTISHMYMKGGAIP 238 + +F + LDG S ++ Sbjct: 301 KDIIFFACDGRENNNKGFVEKGMTLQEVGETLFKHYGEITLAYNLDGGGSTAHVLRSTKL 360 Query: 239 WQ 240 + Sbjct: 361 NK 362 >UniRef50_B2S1G8 Hypothetical cytosolic protein n=2 Tax=Borrelia RepID=B2S1G8_BORHD Length = 262 Score = 59.3 bits (142), Expect = 1e-07, Method: Composition-based stats. Identities = 24/202 (11%), Positives = 52/202 (25%), Gaps = 29/202 (14%) Query: 44 TVQAYTVNPQTERVKMYWQKANGE--AWGTLHALLADINSQGQVQMAMNGGIYDES---Y 98 + + + + + + + + +V +A+N Y+ + Sbjct: 44 NYVIVKIKNKNLKFIIPKPIYDQKMNNYYFKGQTTSQFLLSNKVDIAINTSPYEIKENMF 103 Query: 99 APLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQS 158 P GLYI + + A G + K + Sbjct: 104 YPNGLYIYDKKIISNAKKAQGIIIIKNNQI------------ILNPKQDEIKNSDYGFSG 151 Query: 159 GPMLMENGVINPRIHPNVASSKIRNGVGINKHG-NAVFLLSQQA-------TNFYDFACY 210 L+ NG N R +G +K + + + + + A Sbjct: 152 FFPLITNGNYTKNFKEN---KHPRTIIGTDKENKHLYLITVEGRGTNNSKGISLNE-AID 207 Query: 211 AKAKLNVEQLLYLDGTISHMYM 232 + + LDG S + Sbjct: 208 LSLNYAITNSINLDGGGSSTLV 229 >UniRef50_B8FVQ0 Ig-like, group 2 n=2 Tax=Desulfitobacterium hafniense RepID=B8FVQ0_DESHD Length = 913 Score = 58.9 bits (141), Expect = 2e-07, Method: Composition-based stats. Identities = 23/142 (16%), Positives = 43/142 (30%), Gaps = 24/142 (16%) Query: 114 LNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGP------------- 160 L + G+ I G V+ + L+ +++QF V S P Sbjct: 220 LEIREGQPAAEIPEEGFVVVSRGEQAAKLLEQAAPGEQLQFQVTSTPDWNDLKMSTTGTS 279 Query: 161 MLMENGVINPRIH---PNVASSKIRNGVGINKHGN-AVFLLSQQA------TNFYDFACY 210 +L+++G I + R G + G+ + + + A Sbjct: 280 LLIQDGEIPATFSYSTASFNQRNPRTMAGSTEDGSELILVTVDGRQDNSIGLTQQESAEL 339 Query: 211 AKAKLNVEQLLYLDGTISHMYM 232 +L Q + DG S Sbjct: 340 ML-ELGAYQAIMFDGGASTTMA 360 >UniRef50_C9RD84 Copper amine oxidase domain protein n=1 Tax=Ammonifex degensii KC4 RepID=C9RD84_AMMDK Length = 465 Score = 58.1 bits (139), Expect = 2e-07, Method: Composition-based stats. Identities = 25/174 (14%), Positives = 56/174 (32%), Gaps = 32/174 (18%) Query: 104 YIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQF--------- 154 +ENG G G + G G + F ++++ Sbjct: 183 VVENGVV-----TRMGSGPCPVPDNGYVIGFGPAAAAKFANRFYPGAKVEWWVVFEAKDG 237 Query: 155 ---------AVQSGPMLMENGVINPRIHPNVASSKIR-------NGVGINKHGNAVFLLS 198 +Q GP+L+++G I H + + + + +G + G V Sbjct: 238 APLQWSGRTVIQGGPLLLKDGAIVLDSHLDELYREPKFSRYGSWSFIGTDFEGCLVLGSV 297 Query: 199 QQATNFYDFACYAKAKLNVEQLLYLDGTIS-HMYMKGGAIPWQRYPFVTMISVE 251 + ++ A + + + + LDG S ++ +G + ++V Sbjct: 298 LGVDSLWNMARVLQ-QAGIRNAVCLDGNASCGLWYRGSYLVTPGRALSNCVAVT 350 Score = 40.8 bits (94), Expect = 0.047, Method: Composition-based stats. Identities = 23/98 (23%), Positives = 37/98 (37%), Gaps = 7/98 (7%) Query: 45 VQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYD--ESYAPLG 102 V V+ RV+++ A + LA + + A+NG ++ G Sbjct: 42 VHLIKVDISDPRVRVFPVLAQNKTGR--AESLASMACRVGAVAAVNGTFFNAYSDLTSWG 99 Query: 103 LYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGI 140 I+ GQ L SG G + PGG+ VA + Sbjct: 100 ALIDAGQVYR---LGSGSGALSLGPGGLAEVARLNSRV 134 >UniRef50_UPI00017890C7 copper amine oxidase domain protein n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI00017890C7 Length = 900 Score = 57.7 bits (138), Expect = 3e-07, Method: Composition-based stats. Identities = 17/100 (17%), Positives = 37/100 (37%), Gaps = 9/100 (9%) Query: 142 RLDAFKTSKEIQFAVQSGPMLMENGVINP--RIHPNVASSKIRNGVGINKHGNAVFLLSQ 199 + +Q + +L++NG R ++ ++ R VG +K G +L++ Sbjct: 297 KTGQKLDPTNLQMMIGGHTILVDNGKATSFSRNVNDLGGNRARTAVGYSKDGRYAYLIAT 356 Query: 200 Q------ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMK 233 + + K+ V + + LDG S + Sbjct: 357 ESNDNSKGMTLQQLQDFM-TKVGVWKGMNLDGGGSTTMVN 395 Score = 44.7 bits (104), Expect = 0.003, Method: Composition-based stats. Identities = 16/93 (17%), Positives = 31/93 (33%), Gaps = 3/93 (3%) Query: 45 VQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDES--YAPLG 102 V+ + VK+ G + T + + A+NG ++ AP+G Sbjct: 86 ANVIRVDLNNKYVKLDVMTGQGNQFTT-RQSTGGMAKENGAVAAINGDFFNTGREGAPMG 144 Query: 103 LYIENGQQKVALNLASGEGNFFIRPGGVFYVAG 135 + NG + + G F + G + Sbjct: 145 AQVSNGLMMSSPSDLKGMYAFAVTNDGKPILDE 177 >UniRef50_C9PT69 Putative uncharacterized protein n=1 Tax=Prevotella sp. oral taxon 472 str. F0295 RepID=C9PT69_9BACT Length = 814 Score = 57.4 bits (137), Expect = 4e-07, Method: Composition-based stats. Identities = 17/110 (15%), Positives = 32/110 (29%), Gaps = 22/110 (20%) Query: 148 TSKEIQFAVQSGPM--LMENGVINPRIHPNVASSKIRNGVGINKHGNAVF-LLSQQA--- 201 + + P ++++G +N N R G G + + Sbjct: 272 ATAPFTDVIGGDPRSPMLQDGTVNTTEIWN--ELHPRTGFGYTQDKKTAIHCVVDGRSTI 329 Query: 202 ---TNFYDFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFVTM 247 N + A K + + LDG S +++K F M Sbjct: 330 SAGANTKELAEIMKF-VGAYNAMNLDGGGSSCLFLK---------DFGPM 369 >UniRef50_A8F5X1 Putative uncharacterized protein n=1 Tax=Thermotoga lettingae TMO RepID=A8F5X1_THELT Length = 550 Score = 57.4 bits (137), Expect = 5e-07, Method: Composition-based stats. Identities = 19/146 (13%), Positives = 45/146 (30%), Gaps = 24/146 (16%) Query: 110 QKVALNLASGEGNFFIRPGGVFYVAG------DKVGIVRLDAFKTSKE----IQFAVQSG 159 +G + + G V ++ + + I + + I+ A+++G Sbjct: 377 VVDGKVNGTGWLSKAPKNGFVLAISSKYKKYLEGIQIGDTVEYIVNTNFPYPIKHAIEAG 436 Query: 160 PMLMENGVINPRIHPN--------VASSKIRNGVGINKHGNAVFLLS-----QQATNFYD 206 P+++ G P + +S R G V + N+ + Sbjct: 437 PLILYEGSPIPDRNDEKNRYGGSIARASATRTLAATLPDGKVVLAVINDQDGSGGVNYDE 496 Query: 207 FACYAKAKLNVEQLLYLDGTISHMYM 232 ++ K + DG S + + Sbjct: 497 LVEFSLKK-GFYSAMNFDGGSSSIMV 521 >UniRef50_C7QCB3 Putative uncharacterized protein n=1 Tax=Catenulispora acidiphila DSM 44928 RepID=C7QCB3_CATAD Length = 585 Score = 57.0 bits (136), Expect = 5e-07, Method: Composition-based stats. Identities = 33/206 (16%), Positives = 67/206 (32%), Gaps = 17/206 (8%) Query: 27 PLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQ 86 P+ VA +T ++ + ++ G + Sbjct: 341 PVVQVARVRPDAVYTGVTADVAVIDQKHSGFVLHPGHEGGLNSVITTVPNQIDANARPNL 400 Query: 87 MAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAF 146 +A+ G + S + G Y A +L +G + I G V R +F Sbjct: 401 IALFNGGFKISESHGGYYDHG---VTAASLVNGAASEVIFKDGHMAVGMWG----RDYSF 453 Query: 147 KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKI--------RNGVGINKHGNAVFLLS 198 + + +I Q+ ++++ G + P I + + R+GVG+ G+ V+ + Sbjct: 454 QKNADIVSVRQNLKLMVDGGQVVPYIDDSSTWGRADHGSAAVWRSGVGVKADGDIVW-VG 512 Query: 199 QQATNFYDFACYAKAKLNVEQLLYLD 224 A K + + LD Sbjct: 513 GNELTAPSLARLLKDA-GAVRAMQLD 537 >UniRef50_A7SGX9 Predicted protein (Fragment) n=2 Tax=Nematostella vectensis RepID=A7SGX9_NEMVE Length = 442 Score = 57.0 bits (136), Expect = 6e-07, Method: Composition-based stats. Identities = 26/183 (14%), Positives = 54/183 (29%), Gaps = 27/183 (14%) Query: 39 SDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIY-DES 97 + V + G ++ + +A+ + Q + +A N G + ++ Sbjct: 61 RKADVQGHVSVVENPLNTFSILEPGEVGGCGKSVRSSVANSSRQKKCHVASNAGFFKTKN 120 Query: 98 YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVF---YVAGDKVGIVRLDAFKTSKEIQF 154 LG + NG+ + + NF IR G Y++ + V + + + Sbjct: 121 GNCLGNIVSNGKLVMDADGVQNA-NFGIRKDGTIVTGYLSENTVLDQENPFVQLVTGVIW 179 Query: 155 AVQSGPMLMENGVINPRIHPNVAS---------------SKIRNGVGINKHGNAVFLLSQ 199 L+ NG + R +G + G V + Sbjct: 180 -------LVRNGEVYVNASKKAECEDLQESGSVDLFVNVLAARTAIGHDAQGRVVIVQVD 232 Query: 200 QAT 202 T Sbjct: 233 GKT 235 >UniRef50_Q1MS76 Putative uncharacterized protein LI0093 n=1 Tax=Lawsonia intracellularis PHE/MN1-00 RepID=Q1MS76_LAWIP Length = 331 Score = 57.0 bits (136), Expect = 6e-07, Method: Composition-based stats. Identities = 29/219 (13%), Positives = 67/219 (30%), Gaps = 23/219 (10%) Query: 28 LFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQM 87 L+ + V +NPQ ++ G+A+ +L ++ Sbjct: 88 LWLGKFPGVTKAGDVFEVVMLKINPQYYDFSLHMASQTGKAF-SLQDWSNTY----ELSA 142 Query: 88 AMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFI-------RPGGVFYVAGDKVGI 140 +N +Y Y+ N ++ G FF+ P Sbjct: 143 VINASMYLPDGVTSTGYLRNHDHINNAHVGKRLGAFFVASPYNSTLPNADLLDRTSDNWE 202 Query: 141 VRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ 200 + L +K VQ+ ++ N + S V + G F+ ++ Sbjct: 203 ILLPQYKI------VVQNYRVISANRQCLWSTKKVIHSIA---AVARDGKGYLFFIHTKY 253 Query: 201 ATNFYDFACYAKA-KLNVEQLLYLDGTI-SHMYMKGGAI 237 + DF + +++ ++Y++G + + + Sbjct: 254 PISDLDFGNLLLSLPIDIRIVMYVEGGSQAGLLINTSNF 292 >UniRef50_C4DE18 Putative uncharacterized protein n=1 Tax=Stackebrandtia nassauensis DSM 44728 RepID=C4DE18_9ACTO Length = 393 Score = 56.2 bits (134), Expect = 9e-07, Method: Composition-based stats. Identities = 16/87 (18%), Positives = 30/87 (34%), Gaps = 10/87 (11%) Query: 158 SGPMLMENGVINPRIHPNVASSKIRNGVGINKHG-NAVFLLSQQA------TNFYDFACY 210 G +L+++G + ++ + A+ R VG N G + A Sbjct: 279 GGLILIDDGTMLD-LNDDAATLAPRTAVGSNADGSKLYMVAVDGRSSTSVGATVKSMADI 337 Query: 211 AKAKLNVE-QLLYLDGTISHMYMKGGA 236 L + ++ LDG S + A Sbjct: 338 MVN-LGADHNVINLDGGGSTTLVARKA 363 >UniRef50_A9GRW8 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GRW8_SORC5 Length = 387 Score = 56.2 bits (134), Expect = 1e-06, Method: Composition-based stats. Identities = 26/202 (12%), Positives = 59/202 (29%), Gaps = 30/202 (14%) Query: 40 DPTLTVQAYTVNPQTERVKM-------YWQKANGEAWGTLHALLADINSQGQVQMAMNGG 92 + V+ V + G+ + L + ++ A NGG Sbjct: 140 RSWAELFIVAVDLARVDVHLMAGSREPAATTEEGKPYERLAKIPE--ADHERLLAAFNGG 197 Query: 93 IYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEI 152 E G+ + +G V G +A ++ Sbjct: 198 FMTEHGQ-WGMRV-DGVTLV--RPRDQGCTLARHRDGRLQIAP------WTRLSAGESDM 247 Query: 153 QFAVQSGPMLMENGVINPRIHP----------NVASSKIRNGVGINKHGNAVFLLSQQAT 202 Q+ Q+ +++ G ++P + + + R+ VG+++ G +++ T Sbjct: 248 QWWRQTPSCMVDEGELHPLLRAPQVRNWGATLDGNTVIRRSAVGLDRDGKVLYVGISNHT 307 Query: 203 NFYDFACYAKAKLNVEQLLYLD 224 A + + LD Sbjct: 308 TAPAIALGMQHA-GASAVAQLD 328 >UniRef50_C7PW43 Ig domain protein group 2 domain protein n=2 Tax=Catenulispora acidiphila DSM 44928 RepID=C7PW43_CATAD Length = 1174 Score = 55.8 bits (133), Expect = 1e-06, Method: Composition-based stats. Identities = 23/175 (13%), Positives = 56/175 (32%), Gaps = 14/175 (8%) Query: 44 TVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYD--ESYAPL 101 Q V+ V++ +++ + + + + +NG ++ S PL Sbjct: 95 RAQVMDVDLADPNVRLGVVESHDHLTDAADEVPSSMAHRTGAVGGVNGDFFEIYGSGRPL 154 Query: 102 GLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPM 161 G+ + +G+ + + + G + G + A + AV S Sbjct: 155 GMVVIDGRLVKSPDPTWNADLWVRHDGSIGIGTETYAGSLTDGAATAAITAVNAVNS--- 211 Query: 162 LMENGVINPRIHPNVASSKI---RNGVG--INKHGNAVFL--LSQQATNFYDFAC 209 +G R+ P++ + V + G + + ++ T A Sbjct: 212 --LSGNAIVRVTPDLGTPSPIAASTVVAGHLGADGTTLLVDSVTAGVTTLPQLAA 264 Score = 49.3 bits (116), Expect = 1e-04, Method: Composition-based stats. Identities = 14/107 (13%), Positives = 33/107 (30%), Gaps = 12/107 (11%) Query: 137 KVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASS--KIRNGVGINKHG-NA 193 I + ++ + G +L++NG + + ++ VG+++ G +A Sbjct: 286 GDQIAVSEKIGPDPDVVQGLSGGAILVQNGQRAVPLQGSGENNVDNPVTAVGVSQDGKHA 345 Query: 194 VFLLSQQ--------ATNFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 VF A + + + D S + Sbjct: 346 VFAAFDGHQSEDVAQGLTRPQIAGWM-TQHGAYNAILFDSGGSTQMV 391 >UniRef50_Q826N8 Putative secreted protein n=1 Tax=Streptomyces avermitilis RepID=Q826N8_STRAW Length = 409 Score = 55.8 bits (133), Expect = 1e-06, Method: Composition-based stats. Identities = 19/87 (21%), Positives = 30/87 (34%), Gaps = 11/87 (12%) Query: 153 QFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGN-AVFLLSQQ------ATNFY 205 FA+ P+L G P + + +S +R VGI G + L Sbjct: 294 SFAIGGYPVL-RQGKPLPGL--DTVTSAVRTAVGIKDAGRRLLLLAIDGAAAYRSGLTIA 350 Query: 206 DFACYAKAKLNVEQLLYLDGTISHMYM 232 + A + L + LDG S + Sbjct: 351 EVASVMRG-LGATEAFSLDGGGSTTLV 376 >UniRef50_D1Y6Q3 Putative liporotein n=1 Tax=Pyramidobacter piscolens W5455 RepID=D1Y6Q3_9BACT Length = 572 Score = 55.1 bits (131), Expect = 2e-06, Method: Composition-based stats. Identities = 24/131 (18%), Positives = 48/131 (36%), Gaps = 17/131 (12%) Query: 134 AGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVI---NPRIHPNVASSKIRNGVGINKH 190 GD+V + ++ AVQ+GP+L G + R +G + Sbjct: 440 KGDRVTLETQWRETPPIDVASAVQAGPLLYAPGHQFWDEMLSLSILTLRHPRTLLGWDGK 499 Query: 191 GNAVFLLSQQATNFYDFACYAKA------KLNVEQLLYLDGTISH-MYMKGGAIP----- 238 +++ ++++ + +L + LL LDG S M+ G + Sbjct: 500 RMVW-IVADGRSSWHSRGLFLNEAEQLGRRLGLTALLNLDGGGSSEMWWDGHVVNAVSDG 558 Query: 239 -WQRYPFVTMI 248 +R P+ M+ Sbjct: 559 RERRMPYGLMV 569 >UniRef50_C7LY43 Putative uncharacterized protein n=1 Tax=Acidimicrobium ferrooxidans DSM 10331 RepID=C7LY43_ACIFD Length = 397 Score = 55.1 bits (131), Expect = 2e-06, Method: Composition-based stats. Identities = 35/172 (20%), Positives = 62/172 (36%), Gaps = 24/172 (13%) Query: 63 KANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGN 122 G W + + + + + A N G + G Y G+ V L +G + Sbjct: 182 PPGGGPWPYMAPITNPVAAD--LVAAFNSGFRMQDAN--GGYYAYGRTAVPL--RNGAAS 235 Query: 123 FFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPN---VASS 179 F I GV + K I Q+ L+ NG INP ++ + + Sbjct: 236 FVISTSGVPTIE------TWTHGNHVPKGIAVVRQNLIPLISNGRINPLVNSTNFAIWGA 289 Query: 180 KI-------RNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLD 224 + R+GVGI ++G V+ ++ + A A++ + LD Sbjct: 290 TVGNQLLVWRSGVGITRNGALVY-VTGPGLSVASLARLL-ARVGAVNAMELD 339 >UniRef50_A6G841 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G841_9DELT Length = 507 Score = 55.1 bits (131), Expect = 2e-06, Method: Composition-based stats. Identities = 21/141 (14%), Positives = 37/141 (26%), Gaps = 30/141 (21%) Query: 128 GGVFYVAGDKVGIVRLDAFKT-----SKEIQFAVQSGPMLMENGVIN------------- 169 G + V + A++ + + + GPML+E G + Sbjct: 337 NGFVVPTPETVEVGAEVAYEPLRGSGGRPLVAGIAGGPMLLEGGALTLDLRREDFWGSAP 396 Query: 170 ----PRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-------TNFYDFACYAKAKLNVE 218 + + R VG++ VF+ +A L Sbjct: 397 PVTFSQDETGDQNLLPRLAVGLDHAQRLVFVAVDGRDFGRALGMTLGGVGEVLQA-LGCH 455 Query: 219 QLLYLDGTISHMYMKGGAIPW 239 LDG S + G Sbjct: 456 TATNLDGGASKRMVLRGRALD 476 Score = 42.7 bits (99), Expect = 0.013, Method: Composition-based stats. Identities = 25/146 (17%), Positives = 47/146 (32%), Gaps = 23/146 (15%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ 84 + P A A ++ + + V+PQ R+ + E +G Sbjct: 166 VAPGLEHARVAQACAEGPVHLNVLRVDPQRVRLAV---DDRREGVRAGQPFTEWTRQRG- 221 Query: 85 VQMAMNGGIY----------DESYAPLGLYIENGQQKVALNLASG------EGNFFIRPG 128 A++GG + Y P+GL + G+ A G EG I P Sbjct: 222 ATAAVSGGFFLYSEPDIEAPSARYDPVGLLLGEGRCLSPPVFARGALLLDAEGGVAIEPL 281 Query: 129 GVFYVAGDKVGIVRLDAFKTSKEIQF 154 G + G + + ++ ++ Sbjct: 282 G---LGGTHLRLADGRPLDAAEAARW 304 >UniRef50_UPI00019038D8 hypothetical protein Retl8_15906 n=1 Tax=Rhizobium etli 8C-3 RepID=UPI00019038D8 Length = 91 Score = 54.7 bits (130), Expect = 3e-06, Method: Composition-based stats. Identities = 13/67 (19%), Positives = 29/67 (43%), Gaps = 4/67 (5%) Query: 32 AADDCALSDPTLT---VQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ-VQM 87 A T T+ ++++W+ A+GE + +L + ++G+ + Sbjct: 23 QARAQQCGQETFDEAKYVVCTLEVGKVDLRLFWKGADGEPYRAFSSLADAVRAEGRKLIF 82 Query: 88 AMNGGIY 94 A+N G+Y Sbjct: 83 AVNAGMY 89 >UniRef50_UPI00016BFF19 Ig-like, group 2 n=1 Tax=Epulopiscium sp. 'N.t. morphotype B' RepID=UPI00016BFF19 Length = 935 Score = 54.3 bits (129), Expect = 4e-06, Method: Composition-based stats. Identities = 8/64 (12%), Positives = 15/64 (23%), Gaps = 7/64 (10%) Query: 175 NVASSKIRNGVGINKH-GNAVFLLSQQAT-----NFYDFACYAKAKLNVEQLLYLDGTIS 228 R + G + + + L + +YLDG S Sbjct: 299 GKQVRHPRTLIATTDKFGELLLITIDGRQYSAGATHDEVIQILLD-LGAKDAMYLDGGGS 357 Query: 229 HMYM 232 + Sbjct: 358 TTMV 361 >UniRef50_C6IV65 Putative uncharacterized protein n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6IV65_9BACL Length = 257 Score = 53.9 bits (128), Expect = 5e-06, Method: Composition-based stats. Identities = 39/244 (15%), Positives = 81/244 (33%), Gaps = 43/244 (17%) Query: 10 GMITLNLKRIFLALTLLPLF--------AVAADDCALSDPTLTVQAYTVNPQTERVKMYW 61 G+ ++ IFL + L+ + + ++ + + A V+P+ ++ Sbjct: 10 GIAMASVMAIFLVILLMIGWGSRYLLPRHYEYHETTAAN-GVKLHALVVDPERIELR--- 65 Query: 62 QKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEG 121 A + G MNGG + + A L L + N Q A G G Sbjct: 66 --AADQPLGRYR------------VYGMNGGFF-YNEAVLSLAVNNDQPVQGTAGAYGSG 110 Query: 122 NFFIRP-GGVFYVAGDK-----VGIVRLDAFKTSKEIQFAVQSGPML-MENGV------I 168 F + G G + + ++ Q G + + + + Sbjct: 111 WFNAKYARGTLVWDGATGSFSVQVVSAASELAVTDRTRYFAQGGVSMKLPDDAGWRAAAV 170 Query: 169 NPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKL---NVEQLLYLDG 225 PN +++R+G+ + +G +++ F K+ L + ++LDG Sbjct: 171 EAEHLPNPDENRLRSGLAYDANGQLWLIVTPTRCTAEAFRTAVKSALADGGLVDGIFLDG 230 Query: 226 TISH 229 S Sbjct: 231 DGSA 234 >UniRef50_B2A2E0 Copper amine oxidase domain protein n=1 Tax=Natranaerobius thermophilus JW/NM-WN-LF RepID=B2A2E0_NATTJ Length = 514 Score = 53.9 bits (128), Expect = 5e-06, Method: Composition-based stats. Identities = 26/132 (19%), Positives = 45/132 (34%), Gaps = 16/132 (12%) Query: 132 YVAGDKVGIVRLDAFKTSKEIQFA------VQSGPMLMENGVINPRIHPN------VASS 179 Y GDK + + + +E V +GP L+ NG + + + Sbjct: 229 YSPGDKAKLKPSVSDEDREEPIEIEDFIHMVGAGPKLVNNGREDVDLEKDQMTGERHTIK 288 Query: 180 KIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGT-ISHMYMKGGAIP 238 R+ +G + A N D A +L + + + LDG S +Y +G I Sbjct: 289 ARRSFIGY-NDNEVIMGTVDGA-NHEDTAAIC-VELGLTEAMALDGGASSGLYYEGDYIT 345 Query: 239 WQRYPFVTMISV 250 + V Sbjct: 346 RPGREISNALVV 357 >UniRef50_A3P9C8 Putative lipoprotein n=32 Tax=pseudomallei group RepID=A3P9C8_BURP0 Length = 563 Score = 53.5 bits (127), Expect = 6e-06, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 40/162 (24%), Gaps = 35/162 (21%) Query: 107 NGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPML---- 162 NG L ++ PG V+ A + + A GP L Sbjct: 384 NGYVLQGLGASAAWLQAHATPGTRLAVSRRLSADGADLALASGTSLVEA---GPTLSVPN 440 Query: 163 MENGVINPRIHPNVAS--------------------SKIRNGVGINKHGNAVFLLSQQA- 201 + P V R G+ G + + Sbjct: 441 LAQSAAQEGFAPTVGGVDAGEGAAANGNWYNGWYVARNGRTAAGVAADGTILLVEIDGRQ 500 Query: 202 ------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAI 237 T+ + A A L + LDG S + GG + Sbjct: 501 PALSVGTSIPETAAVM-AWLGATSAVNLDGGGSSNMVVGGKM 541 Score = 42.7 bits (99), Expect = 0.012, Method: Composition-based stats. Identities = 12/94 (12%), Positives = 26/94 (27%), Gaps = 9/94 (9%) Query: 45 VQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYD--------- 95 V ++P + G G ++ ++ +NGG + Sbjct: 178 VNVLAIDPSRAGAALSLALPGGNDLGAGGETVSAARARVNALAGVNGGFFTNINPFGAPL 237 Query: 96 ESYAPLGLYIENGQQKVALNLASGEGNFFIRPGG 129 +P+G + +G+ A G Sbjct: 238 PPRSPVGATVVDGRLVAAAIGRRPGLLLARDANG 271 >UniRef50_A3YXL4 Putative uncharacterized protein n=2 Tax=Chroococcales RepID=A3YXL4_9SYNE Length = 610 Score = 52.4 bits (124), Expect = 1e-05, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 46/177 (25%), Gaps = 21/177 (11%) Query: 91 GGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSK 150 G S G+ I +G LN A + + V V +A + + Sbjct: 415 AGYQALSGNESGVLIRDGVVLQRLNGAQLQRGIPLGREDTLVVGRAGVMPPWPEASRLTL 474 Query: 151 EIQ----FAVQSGPMLME-----------NGVINPRIHPNVASSKIRNGVGINKHGNAVF 195 Q Q+ M NG R +G + F Sbjct: 475 SSQSSDPLGQQAYVMGGGPLLLLGGRVVLNGTAEGFSSAFQGQGAPRTVIG-SDGRQIWF 533 Query: 196 L----LSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMI 248 L + + A + +L + + L LDG S G + I Sbjct: 534 LTLQGVDHAGPTLGETATLLR-QLGLREALNLDGGSSTGLFVGNTQTVRGRGVAASI 589 >UniRef50_C6J2I2 Copper amine oxidase domain-containing protein n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J2I2_9BACL Length = 930 Score = 51.6 bits (122), Expect = 2e-05, Method: Composition-based stats. Identities = 20/141 (14%), Positives = 47/141 (33%), Gaps = 12/141 (8%) Query: 102 GLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPM 161 G +G A A+ ++ G A ++ + +++ + + Sbjct: 284 GPVPADGYILRAHGTAADYVASHLQV-GQRVDAIYQLQSLTDGQLVDPADLKVMIGGHTL 342 Query: 162 LMENGVINP----RIHPNVASSKIRNGVGINKHGNAVFLLS------QQATNFYDFACYA 211 L++ G + + S+ R VG +K G ++++ + + Sbjct: 343 LVDQGKASAFTRSTTSISGGSAVARTAVGYSKDGKTAYIITAEKNSNSTGLTLKELQGFM 402 Query: 212 KAKLNVEQLLYLDGTISHMYM 232 + V + L LDG S + Sbjct: 403 -TGIGVWKGLNLDGGGSTTMV 422 >UniRef50_C0DAA9 Putative uncharacterized protein n=1 Tax=Clostridium asparagiforme DSM 15981 RepID=C0DAA9_9CLOT Length = 798 Score = 51.6 bits (122), Expect = 3e-05, Method: Composition-based stats. Identities = 16/75 (21%), Positives = 25/75 (33%), Gaps = 9/75 (12%) Query: 178 SSKIRNGVGINKHGNAVFLLSQQATNF------YDFACYAKAKL-NVEQLLYLDGTISHM 230 S R VG+ G L+ T A+ + NV ++ +DG S + Sbjct: 702 SKHPRTAVGVTDQGELFVLVFSGRTALSVGADYAQMGRIARTLVPNVRHMMNVDGGGSAV 761 Query: 231 Y--MKGGAIPWQRYP 243 + G YP Sbjct: 762 FGMAVGKVFVELSYP 776 >UniRef50_D2PYC0 Metallophosphoesterase n=1 Tax=Kribbella flavida DSM 17836 RepID=D2PYC0_9ACTO Length = 1094 Score = 50.4 bits (119), Expect = 5e-05, Method: Composition-based stats. Identities = 36/231 (15%), Positives = 66/231 (28%), Gaps = 31/231 (13%) Query: 28 LFAVAADDCALSDPTLTVQAYTVNPQTER--VKMYWQKANGEAWGTLHALLADINSQGQV 85 AVA S +A +PQ +++ + G + I Sbjct: 125 GPAVADGQLVKSQSEDPYRAVAFDPQGVGRILEVLFDGTAG-PYRLNRLNSPVIRKDEIG 183 Query: 86 QMAMNGGIYDESYAPLGLYIENGQQKVALNLAS------------GEGNFFIRPGGVFYV 133 G Y ++A G + + G R G + Sbjct: 184 AFTTLWGSYSRAHAVAGAAKVTEVVVAGDTVTAVAAAAGAGDIPAGTTILVGREAGADEL 243 Query: 134 AGDKVGIVRLDAFKTSKE----IQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINK 189 KVG AF ++ AV + +L++ G P + + R G+G + Sbjct: 244 GRLKVGDRVPVAFAPRASDGSVVRTAVGAHALLVKEGKPQP---ADDTAYAGRTGLGFSA 300 Query: 190 HGNAVFLLS--------QQATNFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 G + ++S + + A+ + LDG S + Sbjct: 301 DGKKMVIVSIDSNRLTHSRGATLAEMGRILAAR-GAYVGVELDGGGSTTLV 350 >UniRef50_C7QHR1 Putative uncharacterized protein n=1 Tax=Catenulispora acidiphila DSM 44928 RepID=C7QHR1_CATAD Length = 636 Score = 50.4 bits (119), Expect = 6e-05, Method: Composition-based stats. Identities = 29/227 (12%), Positives = 64/227 (28%), Gaps = 23/227 (10%) Query: 8 GKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGE 67 G+G + + + + ++ + ++ Sbjct: 374 GEGTWSPAVVVNGVPVIQTAKLRSDPQHLE-----YLSAVAWMDQKHASFVLHPGSQQPG 428 Query: 68 AWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRP 127 G + + NG G + NG+ L F+ Sbjct: 429 TAGYNQTDHLSGDQFKNLIATWNGAFLLNPNDAHGGFYLNGKTYGTLVPGQASEVFY--K 486 Query: 128 GGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASS-------- 179 G V G + + + Q+ +L++NG +NP + + Sbjct: 487 DGTMNVGSWNSG----PGLQMAPNVVGVRQNLQLLVDNGQVNPSVDSDDKKLWGVTVKNA 542 Query: 180 --KIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLD 224 R+G+G+ GN V+ + A + A + + + + LD Sbjct: 543 YFVWRSGIGVTADGNLVYAM-GPALSVRTLAELLQ-RAGAVRGMELD 587 >UniRef50_C3YJA0 Putative uncharacterized protein n=3 Tax=Branchiostoma floridae RepID=C3YJA0_BRAFL Length = 851 Score = 50.0 bits (118), Expect = 7e-05, Method: Composition-based stats. Identities = 29/167 (17%), Positives = 52/167 (31%), Gaps = 26/167 (15%) Query: 54 TERVKMYWQKANGEAWGTLHALLADINSQGQ---VQMAMNGGIYD-ESYAPLGLYIENGQ 109 ++ + G + G + + Q +A+N G +D + A LG + +GQ Sbjct: 330 NNPLRTFSVVEPGGSNGCMEPRRRTVTQTSQTRTCHVALNAGFFDTRTGACLGNVVTDGQ 389 Query: 110 QKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPM-LMENGVI 168 + NF IR G V G + + SG + L+ N + Sbjct: 390 RVQDSGGIQNA-NFGIRKDGTIVV-----GYLSEQDVLREDNPFVQLVSGVIWLVRNATV 443 Query: 169 NPRIHPNVASSKI---------------RNGVGINKHGNAVFLLSQQ 200 + I R VG ++ G V + + Sbjct: 444 YVNESRTTECADIQETGTLDRFVNVVSARTAVGHDEEGRVVLVHIEG 490 >UniRef50_A6WEB7 Putative uncharacterized protein n=1 Tax=Kineococcus radiotolerans SRS30216 RepID=A6WEB7_KINRD Length = 986 Score = 49.7 bits (117), Expect = 1e-04, Method: Composition-based stats. Identities = 29/166 (17%), Positives = 49/166 (29%), Gaps = 16/166 (9%) Query: 73 HALLADINSQGQVQMAMNGGIYDESYAPLGLYI-ENGQQKVALNLASGEGNFFIRPGGVF 131 AL + G V++ + G AP + + +G VA + P G Sbjct: 194 RALTVADPAAGAVELEVRAGRVSAVRAPGAVPVPADGYVLVATGSRARA--LSATPVGAA 251 Query: 132 YVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSK--IRNGVGINK 189 +V L FA+ + L+ +G I P + + R +G Sbjct: 252 AGTDLRVRDDALSPGSRG----FALGARLELVRDGAIAPIDVADPTWAALRARTALGWTA 307 Query: 190 HGNAVFLLSQQ------ATNFYDFACYAKAKLNVEQLLYLDGTISH 229 G+ + L + A + LDG S Sbjct: 308 TGDLLLLTVDGGTSRSRGLTAVETAQRMVEA-GARGAVMLDGGGSA 352 >UniRef50_B8HP94 Polysaccharide deacetylase n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HP94_CYAP4 Length = 645 Score = 48.9 bits (115), Expect = 2e-04, Method: Composition-based stats. Identities = 27/189 (14%), Positives = 56/189 (29%), Gaps = 17/189 (8%) Query: 78 DINSQGQVQMAMNGGIYDESYAPLGLYI------ENGQQK----VALNLASGEGNFFIRP 127 +I ++ Q ++GG + + + I +NGQ +G I P Sbjct: 442 EILAKTQAVAGVDGGFFSLEFLDSNVMIGPVLSQKNGQFVPGNASENPRLNGRPLVLISP 501 Query: 128 GGVFYVA-GDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNV----ASSKIR 182 GV ++ A + L++ P +++ R Sbjct: 502 TGVRFIPFDASKHNSLEGIQAEDPGATDAFVAAAWLVKQNQPQPEQSFGNLFDFNAARHR 561 Query: 183 NGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTIS-HMYMKGGAIPWQR 241 GIN++G +S + + + LD S + +G ++ Sbjct: 562 AFWGINQNGQPTIGVSTEPVGSVELGEILYKA-GFRDAVMLDSGASTSLAYQGESLVGYT 620 Query: 242 YPFVTMISV 250 V + Sbjct: 621 PRPVPHVVA 629 >UniRef50_A9BJK8 Putative uncharacterized protein n=1 Tax=Petrotoga mobilis SJ95 RepID=A9BJK8_PETMO Length = 561 Score = 48.1 bits (113), Expect = 3e-04, Method: Composition-based stats. Identities = 35/236 (14%), Positives = 66/236 (27%), Gaps = 29/236 (12%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQ-TERVKMYWQKANGEAWGTLHALLADINSQG 83 LL L + + + L + + Q ++W K+ W L Sbjct: 313 LLHLSSYSRPALIIGTNFLDIDYIKLEYQLNIDNLLFWIKSINSTWKGDVKLYTHHYKGN 372 Query: 84 QVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVF--YVA----GDK 137 + N + EN + EG I + Y+ G K Sbjct: 373 ITETEENYVFFLID--------ENNRIISKNKTTPSEGEKLILVDKKYEKYLENISLGTK 424 Query: 138 VGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVAS--------SKIRNGVGINK 189 V + + + ++ GP+L+ + ++ S R V I+K Sbjct: 425 VDFTLNKSENLTNDPTLLLEGGPILIHSKYTQEQLDAEKKSYSNGIIYGKAPRTVVAIDK 484 Query: 190 HGNAVFLLSQQ------ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPW 239 N ++ + + + E + LDG S + G I Sbjct: 485 EQNINLMVIEGLDNPETGLTYDETRNLLFKIGEFEVAMMLDGGSSSIVYYEGEIQN 540 >UniRef50_Q7NIQ9 Gll2123 protein n=1 Tax=Gloeobacter violaceus RepID=Q7NIQ9_GLOVI Length = 518 Score = 48.1 bits (113), Expect = 3e-04, Method: Composition-based stats. Identities = 36/220 (16%), Positives = 72/220 (32%), Gaps = 28/220 (12%) Query: 53 QTERVKMYWQKANGEAWGTLHA-----LLADINSQGQVQMAMNGGIYDESYAP------L 101 RV++ W +G + T H+ + D + + +NGG + + + Sbjct: 288 THRRVRLVWL--SGGNYTTRHSEGNRYPVGDFIERERAVGGINGGFFAFAGLRATNSDMV 345 Query: 102 GLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQ---- 157 G Y+ + + + + RP + G + + F T E + + Sbjct: 346 GPYLSQNEGRFMPGAPEFDKSLRGRPVVLISATGLRFVPYSPETFDTEAEARAYLSDLSD 405 Query: 158 ---SGPMLMENGVINPRIH------PNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFA 208 +G L+ NG N + + R +GI+K G + + N A Sbjct: 406 LFVAGVWLVNNGQALTTEQIEQFRLSNHSEFRRRTFMGIDKAGLPMVGATLTNVNATQLA 465 Query: 209 CYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMI 248 A + + + + LD S + G + I Sbjct: 466 R-ALEEAGLREAVLLDSGFSTSLVHQGKVLVT-GHTAPSI 503 >UniRef50_B7KAR9 Polysaccharide deacetylase n=3 Tax=Cyanothece RepID=B7KAR9_CYAP7 Length = 627 Score = 48.1 bits (113), Expect = 3e-04, Method: Composition-based stats. Identities = 30/232 (12%), Positives = 72/232 (31%), Gaps = 26/232 (11%) Query: 42 TLTVQAYTVNPQTER--VKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYD---- 95 + Y + + + +++I + + A++GG + Sbjct: 368 NFNTRIYKQDFTVNDTQLTLITGGRPNTIHADTRYQVSEIIAGTGAEAAVDGGFFSLESL 427 Query: 96 ESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFK-------- 147 +S +G + G + I+ + ++ D+V V D K Sbjct: 428 DSNVMIGPVL--GHNTGEFIPGNAWEIPRIKGRPLVLMSSDRVRFVPFDPNKHNTYEGVI 485 Query: 148 ----TSKEIQFAVQSGPMLMENGVINP----RIHPNVASSKIRNGVGINKHGNAVFLLSQ 199 ++I A L+ + P + +++ R GIN+ G V +++ Sbjct: 486 SEATEGEKITDAFVGAAWLVRDNQPQPPEAFGELFDFEAARHRAFWGINQAGQPVIGVTK 545 Query: 200 QATNFYDFACYAKAKLNVEQLLYLDGTIS-HMYMKGGAIPWQRYPFVTMISV 250 + +L + + LD S + +G ++ V + Sbjct: 546 TMVGSVELGEIL-HQLGLRDAVMLDSGASTSLAYQGESLVGYTPRPVPHVVA 596 >UniRef50_Q72HQ9 Putative uncharacterized protein n=4 Tax=Thermaceae RepID=Q72HQ9_THET2 Length = 487 Score = 47.3 bits (111), Expect = 5e-04, Method: Composition-based stats. Identities = 20/111 (18%), Positives = 39/111 (35%), Gaps = 12/111 (10%) Query: 149 SKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGI--------NKHGNAVFLLSQQ 200 + ++A++ GP+L++ G R + K G L+S+ Sbjct: 374 NPPFRYALEGGPLLLKEGR-YAYDPAKENFKDPRPLQAVAPQAAVAWTKEGKLWLLVSE- 431 Query: 201 ATNFYDFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFVTMISV 250 T A + L L +DG S +++KG + ++S Sbjct: 432 PTTPGALARALLS-LGAWNALRMDGGGSAQLWVKGVLRSPYQGSPRPVVSA 481 >UniRef50_C1XUX9 Putative uncharacterized protein n=1 Tax=Meiothermus silvanus DSM 9946 RepID=C1XUX9_9DEIN Length = 497 Score = 47.0 bits (110), Expect = 5e-04, Method: Composition-based stats. Identities = 20/118 (16%), Positives = 44/118 (37%), Gaps = 10/118 (8%) Query: 137 KVGIVRLDAFKTSKEIQFAVQSGPMLMENGV--INPRIHPNVASS-----KIRNGVGINK 189 + G + + +A+++GP+L+++G NP + P ++ V + Sbjct: 368 RTGEILKLYGSLEPPLAYALEAGPLLIQSGAYAFNPNLEPFTDPRPLNATAPQSAVAWTQ 427 Query: 190 HGNAVFLLSQQATNFYDFACYAKA-KLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFV 245 G ++S T A + N+ + +D S +Y++G P Sbjct: 428 DGRLWLVVSD-PTTPSTLARALQLYNPNIWGAIRMDAGGSAQLYVRGSLRTPLIEPQA 484 Score = 40.4 bits (93), Expect = 0.052, Method: Composition-based stats. Identities = 19/90 (21%), Positives = 33/90 (36%), Gaps = 11/90 (12%) Query: 24 TLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQG 83 + P F + L + +P R++ G+ L A L + Sbjct: 192 VIAPGFRYREVW-TFTPEPLRLYLVEADPGRWRME-----PVGQP--GLRAYLPSLAP-- 241 Query: 84 QVQMAMNGGIYDE-SYAPLGLYIENGQQKV 112 +NGG +D S P+GL+I++G Sbjct: 242 TALAILNGGYFDPKSGTPIGLWIKDGVALN 271 >UniRef50_A9EQ62 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9EQ62_SORC5 Length = 397 Score = 46.6 bits (109), Expect = 8e-04, Method: Composition-based stats. Identities = 24/192 (12%), Positives = 59/192 (30%), Gaps = 23/192 (11%) Query: 45 VQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQG---QVQMAMNGGIYDESYAPL 101 + ++ +++ + + G ++ A NGG Sbjct: 142 IAVVAIDLGRVDLRLVAGTKEPFSPDIPAERRPGLVPGGHAAELVAAFNGGFKAMHGH-Y 200 Query: 102 GLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPM 161 G+ ++ + A + + + G R+ A++ + P Sbjct: 201 GMMLDGDTFLPPRDRACTIALYRSGAVRIRTWPELRDGEARMAAYRQTP---------PC 251 Query: 162 LMENGVINPRIHPN---------VASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAK 212 L+E G ++ ++ + + R+ +G++ G +F +A A K Sbjct: 252 LVEQGELHHALYDSNRDWGATVSGETVIRRSALGVDATGKLLFYGLGEAVTARSLARGMK 311 Query: 213 AKLNVEQLLYLD 224 A + LD Sbjct: 312 AA-GAHDVAELD 322 >UniRef50_B8CD22 Predicted protein n=1 Tax=Thalassiosira pseudonana RepID=B8CD22_THAPS Length = 572 Score = 46.6 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 15/114 (13%), Positives = 29/114 (25%), Gaps = 32/114 (28%) Query: 146 FKTSKEIQFAVQSGPMLMENGV----------------INPRIHPNVA---SSKIRNGVG 186 + + AV GP+ ++ + + R G+G Sbjct: 404 YTLPTPLDNAVAGGPIFFDDNNDEQTMDLPSEDFKGSAPPVTFSQDETFDRNLLPRMGIG 463 Query: 187 INKHGN-----AVFLLSQQA-------TNFYDFACYAKAKLNVEQLLYLDGTIS 228 I + + V + + K L + + LDG S Sbjct: 464 ITNNDSSGEKELVCVAVDGRNLDRALGLTLQGTSDLLK-TLGCVKAMNLDGGSS 516 >UniRef50_A4FIV8 Secreted protein n=1 Tax=Saccharopolyspora erythraea NRRL 2338 RepID=A4FIV8_SACEN Length = 94 Score = 46.2 bits (108), Expect = 0.001, Method: Composition-based stats. Identities = 9/68 (13%), Positives = 20/68 (29%), Gaps = 9/68 (13%) Query: 186 GINKHGNAVFLLSQQA-------TNFYDFACYAKAKLNVEQLLYLDGTISHMY-MKGGAI 237 GI++ G + + + A + ++ L + LD S + + G Sbjct: 3 GIDQAGRLLPVTVDGRRPGSSAGFTLLEAARFMRS-LGAVNAMNLDSGGSTSFVVNGKPA 61 Query: 238 PWQRYPFV 245 Sbjct: 62 NSPSDATG 69 >UniRef50_A7MD65 Zgc:165534 protein n=3 Tax=Clupeocephala RepID=A7MD65_DANRE Length = 313 Score = 45.4 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 11/36 (30%), Positives = 18/36 (50%), Gaps = 1/36 (2%) Query: 202 TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAI 237 N + A + K + NV + LDG S Y+ G++ Sbjct: 1 MNLWQVAKFLKDQ-NVMNAINLDGGGSATYVLNGSL 35 >UniRef50_UPI00016A4F20 hypothetical protein BthaT_13010 n=4 Tax=pseudomallei group RepID=UPI00016A4F20 Length = 196 Score = 45.4 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 13/66 (19%), Positives = 23/66 (34%), Gaps = 8/66 (12%) Query: 178 SSKIRNGVGINKHGNAVFLLSQQAT-------NFYDFACYAKAKLNVEQLLYLDGTISHM 230 + R VG+ G+ + + + + A A L + LDG+ S Sbjct: 126 ARNGRTAVGVAADGSVLLVGIDGRQPVPGVGASVPETAAGM-AWLGAASAVTLDGSGSSN 184 Query: 231 YMKGGA 236 + GG Sbjct: 185 LVIGGK 190 >UniRef50_Q8YTL3 All2704 protein n=4 Tax=Nostocaceae RepID=Q8YTL3_ANASP Length = 310 Score = 45.0 bits (105), Expect = 0.002, Method: Composition-based stats. Identities = 29/198 (14%), Positives = 50/198 (25%), Gaps = 33/198 (16%) Query: 81 SQGQVQMAMNGGIYDESYAPLGLYIENGQQKVA--LNLASGEGNFFIRPGGVFYVAGDKV 138 + + A+N D P GL I G + N S G +P K Sbjct: 117 NGRRPIAAINADYIDPENKPQGLNISRGVEYSGDFKNKRSSFGISGGKPQ------ERKA 170 Query: 139 GIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNV----ASSKIRNGVGINKHGNAV 194 I +G G S R+ I G + Sbjct: 171 TIQAGRREINILNYNLVGGNG-RFYRQGKFKDICQDLGEFACKQSTNRSMAAITNKGYVI 229 Query: 195 FLLSQ-------------QATNFYDFACYA-----KAKLN-VEQLLYLDGTIS-HMYMKG 234 L++ Q F + L +++ + DG +S +Y Sbjct: 230 LLVNDIKANSNIEINSNNQELTPDKFDDVLEGISRQNCLGKIQEGILFDGGMSPGLYYNK 289 Query: 235 GAIPWQRYPFVTMISVER 252 P ++ + + Sbjct: 290 KIYVENPGPIGSVFLIYK 307 >UniRef50_Q119M8 Putative uncharacterized protein n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q119M8_TRIEI Length = 283 Score = 45.0 bits (105), Expect = 0.002, Method: Composition-based stats. Identities = 27/218 (12%), Positives = 59/218 (27%), Gaps = 34/218 (15%) Query: 51 NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQM-AMNGGIYDESYAPL-------- 101 + T++ T+ + +++ S+ NG + + Sbjct: 80 DLGTKKGAYGGNNP-QFERQTISQVWSNLYSENSSLFCITNGQFFRNDKSSSTALAFPLK 138 Query: 102 --GLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSG 159 G+ + +G F + V ++ I + I G Sbjct: 139 SDGIIVSDGY---------AGEIEFSHEKLMLEVWNNRALISKFQPNNLQFSIATNFIVG 189 Query: 160 PMLMENGVINPRIHPNVASSKIRNGVGI-NKHGNAV---FLLSQQATNFYDFACYAKAKL 215 + V R +G+ +K G+ + L+ A Sbjct: 190 --------LQENADKGVEDQTGRTFIGVQDKDGDRLYETILIFTSKQATQPHATNVLKSF 241 Query: 216 NVEQLLYLDGTISH-MYMKGGAIPWQRYPFVTMISVER 252 Q++ LDG S + +G + MI++ Sbjct: 242 GATQVMMLDGGGSTQLICQGNNYIDSQRTIPQMIAIFS 279 >UniRef50_A9V0B9 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9V0B9_MONBE Length = 623 Score = 44.3 bits (103), Expect = 0.004, Method: Composition-based stats. Identities = 16/83 (19%), Positives = 25/83 (30%), Gaps = 21/83 (25%) Query: 162 LMENGVINPRIHPNVASSKI---------------RNGVGINKHGNAVFLLSQQ-----A 201 L+ NG + S I R + + +G + Sbjct: 269 LVRNGQVYVNESIAYECSNIEESGSLQEFANLQSARTALAHDSNGAVRIVQHNGQSGHYG 328 Query: 202 TNFYDFACYAKAKLNVEQLLYLD 224 N Y+FA Y K + V + LD Sbjct: 329 INLYEFAKYLKQQ-GVVNAINLD 350 Score = 42.3 bits (98), Expect = 0.014, Method: Composition-based stats. Identities = 10/70 (14%), Positives = 25/70 (35%), Gaps = 1/70 (1%) Query: 50 VNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYD-ESYAPLGLYIENG 108 ++P+ G G ++++ ++ A N G ++ G + +G Sbjct: 101 LDPRRTLSVYEPGAPGGCGDGDHRTIVSETAARHDCIYATNAGFFNTHDGTCYGDIVSDG 160 Query: 109 QQKVALNLAS 118 + A N + Sbjct: 161 RLVQADNHTN 170 >UniRef50_Q2JPV6 Polysaccharide deacetylase family protein n=2 Tax=Synechococcus RepID=Q2JPV6_SYNJB Length = 723 Score = 43.5 bits (101), Expect = 0.007, Method: Composition-based stats. Identities = 25/181 (13%), Positives = 55/181 (30%), Gaps = 20/181 (11%) Query: 77 ADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVA------------LNLASGEGNFF 124 + + + Q +NG + + + G A G Sbjct: 339 STLAQRYQADAGINGSFFSIPWINSASNVMVGPAMAANHKTFIPGRPEDDQAIRGRPLVL 398 Query: 125 IRPGGVFYVAGDKVGIVRLDAFKT-SKEIQFAVQSGPMLMENG------VINPRIHPNVA 177 + + +V D + L+ + ++ +G L+++G IN + A Sbjct: 399 LGRDRLRFVPFDPDTMTHLENIRQLMPDVTDLFVAGLWLVKDGHALSPAEINSFRLASAA 458 Query: 178 SSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAI 237 + R G++ V +++ N A + + + LD S + G I Sbjct: 459 EFRPRAFFGVDDQERVVIGVTKTHVNAAILASLLPKT-GIREAVLLDSGFSTSLVYQGEI 517 Query: 238 P 238 Sbjct: 518 L 518 >UniRef50_B2HMV0 Lipoprotein LprO n=21 Tax=Mycobacterium RepID=B2HMV0_MYCMM Length = 381 Score = 43.1 bits (100), Expect = 0.009, Method: Composition-based stats. Identities = 34/213 (15%), Positives = 61/213 (28%), Gaps = 43/213 (20%) Query: 74 ALLADINSQGQVQMAMNGGIYDESY------------APLGLYIEN------------GQ 109 L GQ +A+N +D +PLG +++N G Sbjct: 149 PPLQAWQRMGQPTIAINANFFDVRGQKGGSWRTTGCSSPLGAFVDNTHGMGRANQAVTGT 208 Query: 110 QKVALNLASGEGNFF--------IRPGGVFYVAGDKVGI-----VRLDAFKTSKEIQFAV 156 A GN I GG YV K + +K +F Sbjct: 209 VAYAGKQGLSGGNEVWTSLTTMIIPVGGAPYVLRPKGRQDYDLATPVIQDLLNKNAKFVA 268 Query: 157 QSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLN 216 +G L+ G I + S R + +K + +++ + + + L Sbjct: 269 VAGIGLLSPGDI--GQLHDGGPSAARTALAYSKPKDEMYIFEGGSYTPDNIQDLFRG-LG 325 Query: 217 VEQLLYLDGTISHMYMKGGAIPWQRYPFVTMIS 249 + + LDG S + + + Sbjct: 326 SDTAILLDGGGSSAIVL---RRDTGGMWAGAGA 355 >UniRef50_Q92JI8 Uncharacterized protein RC0079 n=11 Tax=Rickettsia RepID=Y079_RICCN Length = 282 Score = 42.7 bits (99), Expect = 0.011, Method: Composition-based stats. Identities = 8/40 (20%), Positives = 17/40 (42%) Query: 160 PMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQ 199 P+L++NG + R +G+ G V ++ + Sbjct: 179 PLLVQNGKNVVDNPKQDDPAHARTALGVCNDGTIVIVVVE 218 >UniRef50_UPI0001C1628D hypothetical protein CRC_02750 n=2 Tax=Nostocaceae RepID=UPI0001C1628D Length = 633 Score = 42.3 bits (98), Expect = 0.014, Method: Composition-based stats. Identities = 29/190 (15%), Positives = 54/190 (28%), Gaps = 23/190 (12%) Query: 81 SQGQVQMAMNGGIYD----ESYAPLGLYIENGQQKVALNLASGEGN-----FFIRPGGVF 131 QV A++GG + +S +G + + + N + I P V Sbjct: 430 KDTQVVAAVDGGFFSLKYLDSNTMIGPVLSGNRGFIPGNASENLKLRDRPLVLINPHSVS 489 Query: 132 YVA------GDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNV----ASSKI 181 ++ GI F + L+ N +++ Sbjct: 490 FIPFVPETHNTLEGIQATSPENKGVTDTFVGAAW--LVRNNTPRTAADFGNLYDYDAARH 547 Query: 182 RNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTIS-HMYMKGGAIPWQ 240 R GIN G V +++ +L + LD S + +G ++ Sbjct: 548 RAFWGINLAGMPVIGVTKTPVGSVSLGEILY-QLGFRDAVMLDSGASTSLSYRGKSLVAY 606 Query: 241 RYPFVTMISV 250 V V Sbjct: 607 TPRPVPHAVV 616 >UniRef50_C5KB48 Putative uncharacterized protein n=1 Tax=Perkinsus marinus ATCC 50983 RepID=C5KB48_9ALVE Length = 925 Score = 42.3 bits (98), Expect = 0.015, Method: Composition-based stats. Identities = 24/172 (13%), Positives = 49/172 (28%), Gaps = 29/172 (16%) Query: 80 NSQGQVQMAMNGGIYDESY-APLGLYIENGQQKV-ALNLASGEGNFFIRPGGVFYVAGDK 137 + + + N ++ P +I++G + A + F P VF + D Sbjct: 716 KADENIVLLSNS--VQSNFQNPYDCFIQDGIMQRQGYASACPKYAFGDSPVDVFVLDSDT 773 Query: 138 V-----GIVRLDAFKTSKEIQFAVQSGPMLMENGVINPR-----------------IHPN 175 D + AV P + +G + P+ Sbjct: 774 EERRLRSCKVTDECNERFPWRRAVSGRPF-VTDGELRKIPHWDHEDGEEIKYGEVPWLPS 832 Query: 176 VASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTI 227 + + V ++ G V + Q Y+F + + L G+ Sbjct: 833 STEAAF-SAVCESRDGQVVLAYAIQPLTAYEFGRALIDS-GIHDAVLLGGSG 882 >UniRef50_C4ICA7 Putative uncharacterized protein n=1 Tax=Clostridium butyricum E4 str. BoNT E BL5262 RepID=C4ICA7_CLOBU Length = 42 Score = 42.3 bits (98), Expect = 0.016, Method: Composition-based stats. Identities = 10/40 (25%), Positives = 13/40 (32%), Gaps = 1/40 (2%) Query: 214 KLNVEQLLYLDGT-ISHMYMKGGAIPWQRYPFVTMISVER 252 KL + LDG S MY G I + + Sbjct: 3 KLGAVNAINLDGGKSSTMYYNGNTINETEGRKIPTAILVE 42 >UniRef50_A9QSK3 Polysaccharide biosynthesis protein n=7 Tax=Streptococcaceae RepID=A9QSK3_LACLK Length = 300 Score = 41.6 bits (96), Expect = 0.024, Method: Composition-based stats. Identities = 25/157 (15%), Positives = 52/157 (33%), Gaps = 15/157 (9%) Query: 51 NPQTERVKMYWQKANGEAWGTLHAL------LADINSQGQVQMAMNGGIYDE-SYAPLGL 103 + T + + ++ N E T+ ++++ ++ + MN +D + G Sbjct: 97 DLSTNNITI-YRINNPEVLKTVTNRTDQRMKMSEVIAKYPNALIMNASAFDMQTGQVAGF 155 Query: 104 YIENGQQKVALNLASGEG-NFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPML 162 I NG+ + + F I G + + + Q A G + Sbjct: 156 QINNGKLIQDWSPGTTTQYAFVINKDGSCKIYDSSTPALTI----IKNGGQQAYDFGTAI 211 Query: 163 MENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQ 199 + +G I P I + +K N +LS Sbjct: 212 IRDGKIQPSDGSVDWKIHI--FIANDKDNNLYAILSD 246 >UniRef50_A5N8M7 Predicted regulatory protein n=2 Tax=Clostridium kluyveri RepID=A5N8M7_CLOK5 Length = 535 Score = 40.8 bits (94), Expect = 0.045, Method: Composition-based stats. Identities = 15/80 (18%), Positives = 25/80 (31%), Gaps = 5/80 (6%) Query: 161 MLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQL 220 +++ NG + I+ N ++K N G + + Y Sbjct: 443 IIVHNGNV---IYKNSGNTKY-NIAGFTDKNVLISGEYSIGASLYQLQQILLEN-GAYTA 497 Query: 221 LYLDGTISHMYMKGGAIPWQ 240 LDG IS G I + Sbjct: 498 AVLDGGISSTMYYKGNIINK 517 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P27840 Uncharacterized protein yigE n=68 Tax=Enterobact... 232 7e-60 UniRef50_B2II06 Putative uncharacterized protein n=5 Tax=Rhizobi... 223 3e-57 UniRef50_Q98NI9 Mlr0120 protein n=3 Tax=Alphaproteobacteria RepI... 206 7e-52 UniRef50_B9JX75 Putative uncharacterized protein n=1 Tax=Agrobac... 203 4e-51 UniRef50_A9W4Y6 Putative uncharacterized protein n=4 Tax=Methylo... 203 5e-51 UniRef50_D2LB58 Putative uncharacterized protein n=1 Tax=Rhodomi... 189 6e-47 UniRef50_Q0BWY7 Putative lipoprotein n=1 Tax=Hyphomonas neptuniu... 188 1e-46 UniRef50_A9CIN9 Putative uncharacterized protein n=6 Tax=Proteob... 186 6e-46 UniRef50_C8PYM1 Putative uncharacterized protein n=1 Tax=Enhydro... 182 8e-45 UniRef50_Q26CZ6 Periplasmic protein n=1 Tax=Flavobacteria bacter... 182 9e-45 UniRef50_Q1MEZ5 Conserved hypothetical exported protein n=45 Tax... 181 1e-44 UniRef50_Q11X50 Putative uncharacterized protein n=2 Tax=Bactero... 180 4e-44 UniRef50_Q093S1 Putative uncharacterized protein n=1 Tax=Stigmat... 179 5e-44 UniRef50_Q1IX28 Periplasmic YigE-like protein n=3 Tax=Deinococcu... 178 1e-43 UniRef50_A9D6B9 Putative uncharacterized protein n=1 Tax=Hoeflea... 177 2e-43 UniRef50_C5CWT4 Periplasmic protein-like protein n=3 Tax=Bacteri... 176 7e-43 UniRef50_D0SJ31 Putative uncharacterized protein n=1 Tax=Acineto... 176 1e-42 UniRef50_B9KP42 Periplasmic protein-like protein n=37 Tax=Rhodob... 175 1e-42 UniRef50_A1B5U0 Putative uncharacterized protein n=1 Tax=Paracoc... 174 2e-42 UniRef50_Q0G184 Putative uncharacterized protein n=2 Tax=Auranti... 171 2e-41 UniRef50_D1AL67 Periplasmic protein-like protein n=1 Tax=Sebalde... 171 3e-41 UniRef50_C7PF78 Putative uncharacterized protein n=1 Tax=Chitino... 166 5e-40 UniRef50_Q1QCK8 Periplasmic protein-like n=1 Tax=Psychrobacter c... 163 6e-39 UniRef50_A5WGQ7 Periplasmic protein-like protein n=1 Tax=Psychro... 161 2e-38 UniRef50_A5EW16 Putative uncharacterized protein n=1 Tax=Dichelo... 161 2e-38 UniRef50_C8N6Z2 Lipoprotein n=1 Tax=Cardiobacterium hominis ATCC... 159 9e-38 UniRef50_Q2NAA1 Putative uncharacterized protein n=3 Tax=Erythro... 158 2e-37 UniRef50_B2HYZ5 Predicted periplasmic protein n=11 Tax=Acinetoba... 155 1e-36 UniRef50_A0Q3C5 Conserved protein n=7 Tax=Clostridia RepID=A0Q3C... 153 4e-36 UniRef50_UPI0001744B4D hypothetical protein VspiD_05050 n=1 Tax=... 152 1e-35 UniRef50_A6LS70 Putative uncharacterized protein n=23 Tax=Clostr... 152 1e-35 UniRef50_Q5WVS5 Putative uncharacterized protein n=5 Tax=Legione... 149 8e-35 UniRef50_D1REU9 Putative uncharacterized protein n=1 Tax=Legione... 148 2e-34 UniRef50_B8I4Q1 Putative uncharacterized protein n=3 Tax=Clostri... 146 7e-34 UniRef50_D0WLU9 Putative uncharacterized protein n=1 Tax=Actinom... 144 2e-33 UniRef50_A0PY15 Conserved protein n=4 Tax=Clostridium RepID=A0PY... 144 2e-33 UniRef50_C2JZN3 N-acetylmuramoyl-L-alanine amidase/probable S-la... 144 2e-33 UniRef50_C4G6X0 Putative uncharacterized protein n=2 Tax=Lactoba... 144 2e-33 UniRef50_C4FXK4 Putative uncharacterized protein n=1 Tax=Catonel... 142 8e-33 UniRef50_Q97FU3 Uncharaterized conserved protein, YOME B.subtili... 142 1e-32 UniRef50_B2KU41 N-acetylmuramoyl-L-alanine amidase/putative S-la... 141 2e-32 UniRef50_B7AQ96 Putative uncharacterized protein n=1 Tax=Bactero... 140 4e-32 UniRef50_C6XT14 Exopolysaccharide biosynthesis protein n=1 Tax=P... 140 4e-32 UniRef50_C7TED9 N-acetylmuramoyl-L-alanine amidase n=2 Tax=Lacto... 140 5e-32 UniRef50_B8FUP3 Putative uncharacterized protein n=2 Tax=Desulfi... 139 6e-32 UniRef50_C3QHD0 Exopolysaccharide biosynthesis protein n=2 Tax=B... 139 8e-32 UniRef50_A9KDD2 Hypothetical exported protein n=6 Tax=Coxiella b... 138 2e-31 UniRef50_C2KZT9 Exopolysaccharide biosynthesis protein n=2 Tax=F... 137 3e-31 UniRef50_D1CZ42 Putative uncharacterized protein n=1 Tax=Brucell... 137 4e-31 UniRef50_Q97FU6 Uncharaterized conserved protein, YOME B.subtili... 137 4e-31 UniRef50_A4VXL8 Exopolysaccharide biosynthesis protein related t... 136 7e-31 UniRef50_Q892K3 N-acetylmuramoyl-L-alanine amidase/putative S-la... 135 1e-30 UniRef50_A7GCS1 Putative uncharacterized protein n=12 Tax=Clostr... 134 2e-30 UniRef50_C2HB28 Exopolysaccharide biosynthesis protein n=4 Tax=E... 134 2e-30 UniRef50_C1I4R7 Putative uncharacterized protein (Fragment) n=1 ... 134 2e-30 UniRef50_C6IYX5 Putative uncharacterized protein n=1 Tax=Paeniba... 134 3e-30 UniRef50_C6D6X3 Exopolysaccharide biosynthesis protein n=6 Tax=B... 133 4e-30 UniRef50_B0TEY5 Putative uncharacterized protein n=1 Tax=Helioba... 133 6e-30 UniRef50_B1BC21 Putative uncharacterized protein n=2 Tax=Clostri... 132 1e-29 UniRef50_C9KSV8 N-acetylmuramoyl-L-alanine amidase/putative S-la... 131 2e-29 UniRef50_C0ZEQ6 Putative uncharacterized protein n=1 Tax=Breviba... 131 2e-29 UniRef50_C0GEE0 Putative uncharacterized protein n=1 Tax=Dethiob... 130 3e-29 UniRef50_A5USB9 Putative uncharacterized protein n=2 Tax=Roseifl... 129 7e-29 UniRef50_D1BL19 Putative uncharacterized protein n=4 Tax=Veillon... 128 2e-28 UniRef50_UPI000178A82C copper amine oxidase domain protein n=1 T... 127 4e-28 UniRef50_D1N9W8 Putative uncharacterized protein n=1 Tax=Victiva... 127 5e-28 UniRef50_B2V2N5 Putative uncharacterized protein n=8 Tax=Clostri... 125 2e-27 UniRef50_C8WTH1 N-acetylglucosamine-1-phosphodiester alpha-N-ace... 125 2e-27 UniRef50_UPI0001694670 hypothetical protein Plarl_22443 n=1 Tax=... 124 3e-27 UniRef50_O31980 SPBc2 prophage-derived uncharacterized protein y... 124 4e-27 UniRef50_C6PYU6 Putative uncharacterized protein n=1 Tax=Clostri... 124 4e-27 UniRef50_A7V127 Putative uncharacterized protein n=1 Tax=Bactero... 124 4e-27 UniRef50_C6J074 Copper amine oxidase domain-containing protein n... 123 5e-27 UniRef50_C1CWE2 Putative LysM lysin domain protein, n=1 Tax=Dein... 123 5e-27 UniRef50_UPI0000E45D54 PREDICTED: similar to N-acetylglucosamine... 122 8e-27 UniRef50_D0TN59 Predicted protein n=3 Tax=Bacteroides RepID=D0TN... 122 9e-27 UniRef50_D2NR45 Exopolysaccharide biosynthesis protein related t... 122 9e-27 UniRef50_UPI0001BC3362 hypothetical protein BcroD2_01243 n=1 Tax... 122 1e-26 UniRef50_C5S6T1 Putative uncharacterized protein n=1 Tax=Allochr... 121 3e-26 UniRef50_B3CE38 Putative uncharacterized protein n=3 Tax=Bactero... 121 3e-26 UniRef50_A9B1E5 Putative uncharacterized protein n=1 Tax=Herpeto... 119 7e-26 UniRef50_A7LRK2 Putative uncharacterized protein n=1 Tax=Bactero... 119 1e-25 UniRef50_C0ZFU4 Putative uncharacterized protein n=1 Tax=Breviba... 119 1e-25 UniRef50_A0YND3 Putative uncharacterized protein n=1 Tax=Lyngbya... 119 1e-25 UniRef50_A7LRK4 Putative uncharacterized protein n=1 Tax=Bactero... 118 2e-25 UniRef50_B9YC35 Putative uncharacterized protein n=2 Tax=Holdema... 117 5e-25 UniRef50_C3R3M8 Putative uncharacterized protein n=1 Tax=Bactero... 116 5e-25 UniRef50_C5RID5 Putative uncharacterized protein n=1 Tax=Clostri... 116 6e-25 UniRef50_A9WEC1 Putative uncharacterized protein n=3 Tax=Chlorof... 116 7e-25 UniRef50_Q1IXP5 Peptidoglycan-binding LysM domain-containing pro... 116 7e-25 UniRef50_B7H7U4 Putative uncharacterized protein n=27 Tax=Bacill... 116 8e-25 UniRef50_Q9UK23 N-acetylglucosamine-1-phosphodiester alpha-N-ace... 115 1e-24 UniRef50_C1XS52 Predicted periplasmic protein (DUF2233) n=1 Tax=... 115 1e-24 UniRef50_A6L611 Putative uncharacterized protein n=1 Tax=Bactero... 115 1e-24 UniRef50_B1I1S0 N-acetylglucosamine-1-phosphodiester alpha-N-ace... 115 1e-24 UniRef50_Q8A0T0 Putative uncharacterized protein n=10 Tax=Bacter... 115 1e-24 UniRef50_B7ASL4 Putative uncharacterized protein n=1 Tax=Bactero... 115 1e-24 UniRef50_C8WU56 N-acetylglucosamine-1-phosphodiester alpha-N-ace... 114 2e-24 UniRef50_C5PL46 Exopolysaccharide biosynthesis protein n=2 Tax=S... 114 2e-24 UniRef50_Q7X4R9 XcbC n=1 Tax=Neisseria meningitidis RepID=Q7X4R9... 114 3e-24 UniRef50_B4CZJ8 Putative uncharacterized protein n=1 Tax=Chthoni... 114 3e-24 UniRef50_C4Z6E6 Putative uncharacterized protein n=1 Tax=Eubacte... 114 3e-24 UniRef50_C6XWN0 Putative uncharacterized protein n=1 Tax=Pedobac... 114 4e-24 UniRef50_Q73Q09 Putative uncharacterized protein n=1 Tax=Trepone... 114 4e-24 UniRef50_C6D0A3 Exopolysaccharide biosynthesis protein n=1 Tax=P... 113 5e-24 UniRef50_C4Z4Z5 Putative uncharacterized protein n=1 Tax=Eubacte... 113 6e-24 UniRef50_A6L610 Putative uncharacterized protein n=1 Tax=Bactero... 112 8e-24 UniRef50_C8PNM8 Putative uncharacterized protein n=1 Tax=Trepone... 112 1e-23 UniRef50_UPI000180BA0C PREDICTED: similar to predicted protein n... 112 2e-23 UniRef50_C4ICA6 Peptidase, M56 family n=1 Tax=Clostridium butyri... 112 2e-23 UniRef50_Q4UP44 Putative uncharacterized protein n=4 Tax=Bacteri... 111 2e-23 UniRef50_C6CV17 Exopolysaccharide biosynthesis protein n=1 Tax=P... 111 2e-23 UniRef50_C6J7B9 Exopolysaccharide biosynthesis protein n=2 Tax=B... 111 2e-23 UniRef50_C6JBU1 Putative uncharacterized protein n=1 Tax=Ruminoc... 111 2e-23 UniRef50_C6XT12 NHL repeat containing protein n=2 Tax=Pedobacter... 110 4e-23 UniRef50_C6XXH4 Putative uncharacterized protein n=1 Tax=Pedobac... 110 4e-23 UniRef50_D2V2G1 N-acetylglucosamine-1-phosphodiester alpha-N-ace... 110 5e-23 UniRef50_B8CYN3 SpoIID/LytB domain protein n=1 Tax=Halothermothr... 109 9e-23 UniRef50_B3RIP6 Putative uncharacterized protein (Fragment) n=2 ... 109 1e-22 UniRef50_B8G1I8 Peptidase M56 BlaR1 n=4 Tax=Desulfitobacterium h... 109 1e-22 UniRef50_C0WEQ2 Exopolysaccharide biosynthesis protein n=1 Tax=A... 109 1e-22 UniRef50_B2J8B3 Putative uncharacterized protein n=1 Tax=Nostoc ... 108 2e-22 UniRef50_C0CND1 Putative uncharacterized protein n=1 Tax=Blautia... 108 2e-22 UniRef50_C4V4S8 Exopolysaccharide biosynthesis protein n=1 Tax=S... 108 2e-22 UniRef50_B2A8G9 Copper amine oxidase domain protein n=1 Tax=Natr... 107 3e-22 UniRef50_B5VVA8 S-layer domain protein n=3 Tax=Cyanobacteria Rep... 107 4e-22 UniRef50_C3R3L4 Putative uncharacterized protein n=2 Tax=Bactero... 107 5e-22 UniRef50_C9PX63 Putative uncharacterized protein n=1 Tax=Prevote... 106 6e-22 UniRef50_B0BZE5 Putative uncharacterized protein n=1 Tax=Acaryoc... 106 6e-22 UniRef50_B8HPJ4 Putative uncharacterized protein n=2 Tax=Cyanoth... 106 8e-22 UniRef50_B6V2M3 Gp2.43 n=1 Tax=Bacillus phage SPO1 RepID=B6V2M3_... 106 8e-22 UniRef50_UPI0001923977 PREDICTED: similar to predicted protein, ... 105 1e-21 UniRef50_B4VX04 Putative uncharacterized protein n=1 Tax=Microco... 105 1e-21 UniRef50_B4AZH7 Putative uncharacterized protein n=1 Tax=Cyanoth... 105 2e-21 UniRef50_B8HTR4 Putative uncharacterized protein n=1 Tax=Cyanoth... 105 2e-21 UniRef50_Q8YP57 All4343 protein n=5 Tax=Nostocaceae RepID=Q8YP57... 105 2e-21 UniRef50_A0YXN3 Putative uncharacterized protein n=1 Tax=Lyngbya... 104 4e-21 UniRef50_B0C332 Putative uncharacterized protein n=2 Tax=Bacteri... 103 5e-21 UniRef50_C6IP98 Putative uncharacterized protein n=2 Tax=Bactero... 103 5e-21 UniRef50_B7KAU9 Putative uncharacterized protein n=7 Tax=Chrooco... 103 6e-21 UniRef50_Q8YKH7 All7320 protein n=2 Tax=Cyanobacteria RepID=Q8YK... 102 9e-21 UniRef50_A7M0G9 Putative uncharacterized protein n=2 Tax=Bactero... 102 1e-20 UniRef50_A3DHF5 Ig-like, group 2 n=3 Tax=Clostridium thermocellu... 102 1e-20 UniRef50_A1HRE9 Exopolysaccharide biosynthesis protein n=1 Tax=T... 102 1e-20 UniRef50_UPI0001BC335A hypothetical protein BcroD2_01203 n=1 Tax... 102 1e-20 UniRef50_UPI0001BC7E39 hypothetical protein BacD2_08600 n=1 Tax=... 102 1e-20 UniRef50_A6TVJ8 Exopolysaccharide biosynthesis protein n=2 Tax=A... 102 1e-20 UniRef50_C9RVV6 Putative uncharacterized protein n=3 Tax=Geobaci... 101 3e-20 UniRef50_C7IFA0 Exopolysaccharide biosynthesis protein n=1 Tax=C... 101 3e-20 UniRef50_B4VYL6 Tat pathway signal sequence domain protein n=1 T... 100 3e-20 UniRef50_B3PTF7 Putative uncharacterized protein n=3 Tax=Rhizobi... 100 4e-20 UniRef50_B8I064 Exopolysaccharide biosynthesis protein n=1 Tax=C... 100 4e-20 UniRef50_A6TKB7 Exopolysaccharide biosynthesis protein n=1 Tax=A... 100 4e-20 UniRef50_B5RQG1 Uncharacterized conserved protein n=20 Tax=Borre... 100 4e-20 UniRef50_C9LSB3 Putative secreted protein n=1 Tax=Selenomonas sp... 100 6e-20 UniRef50_UPI0001C164F4 hypothetical protein CRD_01886 n=2 Tax=No... 100 7e-20 UniRef50_C9KQW2 Putative secreted protein n=2 Tax=Veillonellacea... 100 8e-20 UniRef50_A0YL57 Putative uncharacterized protein n=1 Tax=Lyngbya... 99 1e-19 UniRef50_C6LDL7 Putative uncharacterized protein n=1 Tax=Bryante... 99 1e-19 UniRef50_B4WS35 Putative uncharacterized protein n=1 Tax=Synecho... 99 2e-19 UniRef50_Q5N4C8 Putative uncharacterized protein n=2 Tax=Synecho... 99 2e-19 UniRef50_A7LVE9 Putative uncharacterized protein n=1 Tax=Bactero... 99 2e-19 UniRef50_B0CAS6 Putative uncharacterized protein n=1 Tax=Acaryoc... 99 2e-19 UniRef50_Q8RCE6 Putative uncharacterized protein n=5 Tax=Thermoa... 98 2e-19 UniRef50_B2UNL7 Putative uncharacterized protein n=1 Tax=Akkerma... 98 2e-19 UniRef50_Q8YKN4 All7259 protein n=2 Tax=Cyanobacteria RepID=Q8YK... 98 2e-19 UniRef50_A7C442 Putative uncharacterized protein n=1 Tax=Beggiat... 98 3e-19 UniRef50_UPI00016C4EC3 hypothetical protein GobsU_32169 n=1 Tax=... 98 3e-19 UniRef50_C2FS46 Putative uncharacterized protein n=2 Tax=Sphingo... 97 4e-19 UniRef50_C1ABL2 Putative uncharacterized protein n=1 Tax=Gemmati... 97 5e-19 UniRef50_B8HPB3 Putative uncharacterized protein n=1 Tax=Cyanoth... 97 5e-19 UniRef50_A0LEU6 Putative uncharacterized protein n=1 Tax=Syntrop... 97 6e-19 UniRef50_A7M0H0 Putative uncharacterized protein n=2 Tax=Bactero... 97 6e-19 UniRef50_B3QZA6 Putative uncharacterized protein n=1 Tax=Chloroh... 97 6e-19 UniRef50_UPI0001C16068 conserved hypothetical protein n=2 Tax=No... 97 7e-19 UniRef50_UPI0001744905 hypothetical protein VspiD_09365 n=1 Tax=... 97 7e-19 UniRef50_B8J2Y6 Putative uncharacterized protein n=2 Tax=Desulfo... 96 8e-19 UniRef50_D2RLV8 N-acetylglucosamine-1-phosphodiester alpha-N-ace... 96 8e-19 UniRef50_D2AUR4 Exopolysaccharide biosynthesis protein related t... 96 1e-18 UniRef50_UPI00019088BB hypothetical protein RetlC8_25680 n=2 Tax... 96 1e-18 UniRef50_UPI0001C43112 hypothetical protein BpOF4_05820 n=1 Tax=... 96 1e-18 UniRef50_UPI00017896CA metallophosphoesterase n=1 Tax=Geobacillu... 95 2e-18 UniRef50_A9V9Y5 Predicted protein n=1 Tax=Monosiga brevicollis R... 95 2e-18 UniRef50_UPI0001C3370C hypothetical protein UCYN_10670 n=1 Tax=c... 95 2e-18 UniRef50_P74396 Slr0280 protein n=1 Tax=Synechocystis sp. PCC 68... 95 2e-18 UniRef50_Q8DHU5 Tll1850 protein n=1 Tax=Thermosynechococcus elon... 94 4e-18 UniRef50_C6IEV9 Putative uncharacterized protein n=2 Tax=Bactero... 94 4e-18 UniRef50_C1A670 Putative uncharacterized protein n=1 Tax=Gemmati... 94 5e-18 UniRef50_A7HB86 Putative uncharacterized protein n=4 Tax=Anaerom... 94 5e-18 UniRef50_D1VRM0 Putative copper amine oxidase N-domain family n=... 93 9e-18 UniRef50_B4WFN8 Putative uncharacterized protein n=1 Tax=Synecho... 93 9e-18 UniRef50_A8W171 Flagellar protein FliS n=1 Tax=Bacillus seleniti... 93 1e-17 UniRef50_A4J956 Copper amine oxidase domain protein n=1 Tax=Desu... 92 2e-17 UniRef50_Q3AA51 Conserved domain protein n=1 Tax=Carboxydothermu... 92 2e-17 UniRef50_D2J8B1 Putative uncharacterized protein n=1 Tax=Staphyl... 92 2e-17 UniRef50_D2ASL7 Exopolysaccharide biosynthesis protein related t... 92 2e-17 UniRef50_C7QHR1 Putative uncharacterized protein n=1 Tax=Catenul... 92 2e-17 UniRef50_D1R528 Putative uncharacterized protein n=1 Tax=Parachl... 91 3e-17 UniRef50_B9XE16 Putative uncharacterized protein n=1 Tax=bacteri... 91 3e-17 UniRef50_UPI0001C30FBA N-acetylglucosamine-1-phosphodiester alph... 91 3e-17 UniRef50_A9NEV6 Hypothetical surface-anchored protein n=1 Tax=Ac... 91 3e-17 UniRef50_A9BJK8 Putative uncharacterized protein n=1 Tax=Petroto... 91 4e-17 UniRef50_D1B6I7 N-acetylglucosamine-1-phosphodiester alpha-N-ace... 91 5e-17 UniRef50_B8I1Q9 Ig-like, group 2 n=3 Tax=Clostridium RepID=B8I1Q... 90 8e-17 UniRef50_B0JGJ2 Putative uncharacterized protein n=2 Tax=Microcy... 90 8e-17 UniRef50_B7DMS1 Copper amine oxidase domain protein n=3 Tax=Alic... 89 9e-17 UniRef50_B7IEY1 Putative uncharacterized protein n=1 Tax=Thermos... 89 1e-16 UniRef50_A4CSS0 Putative uncharacterized protein n=1 Tax=Synecho... 89 1e-16 UniRef50_B2S1G8 Hypothetical cytosolic protein n=2 Tax=Borrelia ... 89 1e-16 UniRef50_C7LNU2 Putative uncharacterized protein n=1 Tax=Desulfo... 89 1e-16 UniRef50_B1XK15 Putative uncharacterized protein n=1 Tax=Synecho... 89 1e-16 UniRef50_B5Y710 Copper amine oxidase N-domain family n=2 Tax=Cop... 89 1e-16 UniRef50_Q9L2D5 Putative secreted protein n=2 Tax=Streptomyces R... 89 1e-16 UniRef50_A9QSN5 Exopolysaccharide biosynthesis protein n=4 Tax=L... 88 2e-16 UniRef50_A1SN25 Exopolysaccharide biosynthesis protein-like n=1 ... 88 2e-16 UniRef50_A6LP25 Putative uncharacterized protein n=1 Tax=Thermos... 88 3e-16 UniRef50_Q5ULM2 Orf92 n=1 Tax=Lactobacillus phage LP65 RepID=Q5U... 88 3e-16 UniRef50_Q01TI8 Putative uncharacterized protein n=1 Tax=Candida... 87 4e-16 UniRef50_A3DIP4 Exopolysaccharide biosynthesis protein n=3 Tax=C... 87 4e-16 UniRef50_B5W3X9 Putative uncharacterized protein n=3 Tax=Arthros... 87 4e-16 UniRef50_C9N2Q2 Metallophosphoesterase n=2 Tax=Actinomycetales R... 87 4e-16 UniRef50_Q7U4D6 Putative uncharacterized protein n=11 Tax=Cyanob... 87 5e-16 UniRef50_Q67T45 Putative uncharacterized protein n=1 Tax=Symbiob... 87 5e-16 UniRef50_Q2BF40 Putative uncharacterized protein n=1 Tax=Bacillu... 87 6e-16 UniRef50_A1VEZ3 Putative uncharacterized protein n=4 Tax=Desulfo... 87 7e-16 UniRef50_A5D3T7 Hypothetical membrane protein n=1 Tax=Pelotomacu... 86 9e-16 UniRef50_A5D3R0 Hypothetical membrane protein n=1 Tax=Pelotomacu... 86 1e-15 UniRef50_B5YE82 Putative uncharacterized protein n=2 Tax=Dictyog... 86 1e-15 UniRef50_A4FD37 Secreted protein n=1 Tax=Saccharopolyspora eryth... 86 1e-15 UniRef50_B1HN11 Putative uncharacterized protein n=2 Tax=Bacilla... 86 1e-15 UniRef50_C8X0Z8 Putative uncharacterized protein n=1 Tax=Desulfo... 86 2e-15 UniRef50_Q4ZC55 ORF005 n=1 Tax=Staphylococcus phage EW RepID=Q4Z... 85 2e-15 UniRef50_A5GW09 Putative uncharacterized protein SynRCC307_2165 ... 85 2e-15 UniRef50_A4XD34 Putative uncharacterized protein n=1 Tax=Salinis... 85 3e-15 UniRef50_B1X2V5 Putative uncharacterized protein n=2 Tax=Cyanoth... 84 3e-15 UniRef50_A7SGX9 Predicted protein (Fragment) n=2 Tax=Nematostell... 84 4e-15 UniRef50_UPI0001746B2F hypothetical protein VspiD_16055 n=1 Tax=... 84 4e-15 UniRef50_Q2JUI0 Conserved domain protein n=2 Tax=Synechococcus R... 84 5e-15 UniRef50_A5ILT0 Putative uncharacterized protein n=6 Tax=Thermot... 84 6e-15 UniRef50_C9RD84 Copper amine oxidase domain protein n=1 Tax=Ammo... 83 1e-14 UniRef50_C0Z816 Putative uncharacterized protein n=1 Tax=Breviba... 82 1e-14 UniRef50_A6TUG6 Copper amine oxidase domain protein n=1 Tax=Alka... 82 2e-14 UniRef50_C5C0E0 Metallophosphoesterase n=1 Tax=Beutenbergia cave... 82 2e-14 UniRef50_Q03K73 Exopolysaccharide biosynthesis protein related t... 81 3e-14 UniRef50_C9M6C8 Putative uncharacterized protein n=1 Tax=Jonquet... 81 3e-14 UniRef50_UPI0001C31921 Collagen triple helix repeat protein n=2 ... 81 4e-14 UniRef50_Q1MS76 Putative uncharacterized protein LI0093 n=1 Tax=... 81 4e-14 UniRef50_A4XGY7 Putative uncharacterized protein n=2 Tax=Clostri... 81 4e-14 UniRef50_C7QCB3 Putative uncharacterized protein n=1 Tax=Catenul... 81 5e-14 UniRef50_Q30YC1 Putative uncharacterized protein n=1 Tax=Desulfo... 81 5e-14 UniRef50_A7HN47 Putative uncharacterized protein n=1 Tax=Fervido... 81 5e-14 UniRef50_B7KAR9 Polysaccharide deacetylase n=3 Tax=Cyanothece Re... 80 7e-14 UniRef50_C6WLB3 Metallophosphoesterase n=1 Tax=Actinosynnema mir... 79 1e-13 UniRef50_A4FAG7 Secreted protein n=5 Tax=Actinomycetales RepID=A... 79 1e-13 UniRef50_B8FVQ0 Ig-like, group 2 n=2 Tax=Desulfitobacterium hafn... 79 1e-13 UniRef50_C1TLP7 Sporulation related-protein with S-layer-like do... 79 2e-13 UniRef50_B4WHW3 Putative uncharacterized protein n=1 Tax=Synecho... 79 2e-13 UniRef50_UPI0001744904 hypothetical protein VspiD_09360 n=1 Tax=... 78 2e-13 UniRef50_A3YXL4 Putative uncharacterized protein n=2 Tax=Chrooco... 78 2e-13 UniRef50_C6D5A3 Copper amine oxidase domain protein n=1 Tax=Paen... 78 3e-13 UniRef50_D2PZR6 Sporulation domain protein n=4 Tax=Actinomycetal... 77 4e-13 UniRef50_A3TM75 Putative uncharacterized protein n=1 Tax=Janibac... 77 4e-13 UniRef50_D1Y6Q3 Putative liporotein n=1 Tax=Pyramidobacter pisco... 77 5e-13 UniRef50_D2PYC0 Metallophosphoesterase n=1 Tax=Kribbella flavida... 77 6e-13 UniRef50_C1YVW0 Putative uncharacterized protein n=1 Tax=Nocardi... 76 9e-13 UniRef50_C3YJA0 Putative uncharacterized protein n=3 Tax=Branchi... 76 1e-12 UniRef50_Q0AWB0 Putative uncharacterized protein n=1 Tax=Syntrop... 76 1e-12 UniRef50_C5CET4 Putative uncharacterized protein n=1 Tax=Kosmoto... 76 1e-12 UniRef50_A4FAL4 Putative uncharacterized protein n=2 Tax=Actinom... 75 2e-12 UniRef50_C0AEZ6 Putative uncharacterized protein n=1 Tax=Opituta... 75 2e-12 UniRef50_B2A2E0 Copper amine oxidase domain protein n=1 Tax=Natr... 75 3e-12 UniRef50_C8VW07 S-layer domain protein n=1 Tax=Desulfotomaculum ... 75 3e-12 UniRef50_A6NQQ4 Putative uncharacterized protein n=1 Tax=Bactero... 74 4e-12 UniRef50_Q7NGC8 Glr3243 protein n=1 Tax=Gloeobacter violaceus Re... 74 4e-12 UniRef50_B8HP94 Polysaccharide deacetylase n=1 Tax=Cyanothece sp... 74 4e-12 UniRef50_C6IV65 Putative uncharacterized protein n=1 Tax=Paeniba... 74 5e-12 UniRef50_D1VTW3 Copper amine oxidase N-domain superfamily n=1 Ta... 74 5e-12 UniRef50_C6J2I2 Copper amine oxidase domain-containing protein n... 73 8e-12 UniRef50_UPI00017890C7 copper amine oxidase domain protein n=1 T... 72 2e-11 UniRef50_C9PT69 Putative uncharacterized protein n=1 Tax=Prevote... 72 3e-11 UniRef50_A9GRW8 Putative uncharacterized protein n=1 Tax=Sorangi... 71 3e-11 UniRef50_D2PRV8 Metallophosphoesterase n=1 Tax=Kribbella flavida... 71 3e-11 UniRef50_A8F5X1 Putative uncharacterized protein n=1 Tax=Thermot... 70 9e-11 UniRef50_C7PW43 Ig domain protein group 2 domain protein n=2 Tax... 69 1e-10 UniRef50_C4DE18 Putative uncharacterized protein n=1 Tax=Stackeb... 69 2e-10 UniRef50_Q7NIQ9 Gll2123 protein n=1 Tax=Gloeobacter violaceus Re... 68 3e-10 UniRef50_A3P9C8 Putative lipoprotein n=32 Tax=pseudomallei group... 67 5e-10 UniRef50_A6WEB7 Putative uncharacterized protein n=1 Tax=Kineoco... 67 6e-10 UniRef50_A6G841 Putative uncharacterized protein n=1 Tax=Plesioc... 67 7e-10 UniRef50_UPI00016BFF19 Ig-like, group 2 n=1 Tax=Epulopiscium sp.... 67 8e-10 UniRef50_Q72HQ9 Putative uncharacterized protein n=4 Tax=Thermac... 65 3e-09 UniRef50_Q826N8 Putative secreted protein n=1 Tax=Streptomyces a... 64 3e-09 UniRef50_C1XUX9 Putative uncharacterized protein n=1 Tax=Meiothe... 63 7e-09 UniRef50_A9EQ62 Putative uncharacterized protein n=1 Tax=Sorangi... 62 2e-08 UniRef50_C7LY43 Putative uncharacterized protein n=1 Tax=Acidimi... 61 4e-08 UniRef50_B8CD22 Predicted protein n=1 Tax=Thalassiosira pseudona... 59 1e-07 UniRef50_C0DAA9 Putative uncharacterized protein n=1 Tax=Clostri... 59 1e-07 UniRef50_A4FIV8 Secreted protein n=1 Tax=Saccharopolyspora eryth... 54 3e-06 UniRef50_UPI00016A4F20 hypothetical protein BthaT_13010 n=4 Tax=... 53 1e-05 UniRef50_A7MD65 Zgc:165534 protein n=3 Tax=Clupeocephala RepID=A... 48 2e-04 Sequences not found previously or not previously below threshold: UniRef50_B0JW05 Polysaccharide deacetylase family protein n=4 Ta... 65 2e-09 UniRef50_UPI0001C1628D hypothetical protein CRC_02750 n=2 Tax=No... 65 3e-09 UniRef50_Q2JPV6 Polysaccharide deacetylase family protein n=2 Ta... 64 3e-09 UniRef50_A9QSK3 Polysaccharide biosynthesis protein n=7 Tax=Stre... 56 9e-07 UniRef50_Q119M8 Putative uncharacterized protein n=1 Tax=Trichod... 55 2e-06 UniRef50_Q8DHE7 Tlr2012 protein n=1 Tax=Thermosynechococcus elon... 54 5e-06 UniRef50_A5N8M7 Predicted regulatory protein n=2 Tax=Clostridium... 50 5e-05 UniRef50_A9V0B9 Predicted protein n=1 Tax=Monosiga brevicollis R... 50 7e-05 UniRef50_Q92JI8 Uncharacterized protein RC0079 n=11 Tax=Ricketts... 49 2e-04 UniRef50_C6D289 Putative uncharacterized protein n=1 Tax=Paeniba... 48 3e-04 UniRef50_Q1IXC2 Putative uncharacterized protein n=3 Tax=Deinoco... 48 3e-04 UniRef50_B2HMV0 Lipoprotein LprO n=21 Tax=Mycobacterium RepID=B2... 48 3e-04 UniRef50_Q8YTL3 All2704 protein n=4 Tax=Nostocaceae RepID=Q8YTL3... 48 3e-04 UniRef50_C2LSG0 Putative uncharacterized protein n=1 Tax=Strepto... 47 5e-04 UniRef50_D2W0I7 Predicted protein n=1 Tax=Naegleria gruberi RepI... 45 0.002 UniRef50_A0YKD9 Putative uncharacterized protein n=1 Tax=Lyngbya... 44 0.004 UniRef50_UPI00019038D8 hypothetical protein Retl8_15906 n=1 Tax=... 44 0.006 UniRef50_C5KB48 Putative uncharacterized protein n=1 Tax=Perkins... 44 0.006 UniRef50_C7Q9L8 Putative uncharacterized protein n=1 Tax=Catenul... 43 0.012 UniRef50_C4ICA7 Putative uncharacterized protein n=1 Tax=Clostri... 43 0.012 UniRef50_A9GVQ9 Putative uncharacterized protein n=1 Tax=Sorangi... 42 0.018 UniRef50_Q9RZG9 Putative uncharacterized protein n=1 Tax=Deinoco... 42 0.020 UniRef50_C3QEU3 Predicted protein n=5 Tax=Bacteroides RepID=C3QE... 40 0.050 UniRef50_C7M125 Putative uncharacterized protein n=1 Tax=Acidimi... 40 0.073 >UniRef50_P27840 Uncharacterized protein yigE n=68 Tax=Enterobacteriaceae RepID=YIGE_ECOLI Length = 254 Score = 232 bits (592), Expect = 7e-60, Method: Composition-based stats. Identities = 254/254 (100%), Positives = 254/254 (100%) Query: 1 MAHQLLIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMY 60 MAHQLLIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMY Sbjct: 1 MAHQLLIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMY 60 Query: 61 WQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGE 120 WQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGE Sbjct: 61 WQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGE 120 Query: 121 GNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSK 180 GNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSK Sbjct: 121 GNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSK 180 Query: 181 IRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 IRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ Sbjct: 181 IRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 Query: 241 RYPFVTMISVERKG 254 RYPFVTMISVERKG Sbjct: 241 RYPFVTMISVERKG 254 >UniRef50_B2II06 Putative uncharacterized protein n=5 Tax=Rhizobiales RepID=B2II06_BEII9 Length = 269 Score = 223 bits (569), Expect = 3e-57, Method: Composition-based stats. Identities = 72/245 (29%), Positives = 126/245 (51%), Gaps = 2/245 (0%) Query: 11 MITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWG 70 + T L R+FL L L A A L++ + + + ++++WQ+ G+ +G Sbjct: 17 IFTKLLMRVFLPLFLSAGTAWAEPCLPLTEEGINYVVCRFDTKRSDLRLFWQQPGGQPYG 76 Query: 71 TLHALLADIN-SQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGG 129 L A + ++ AMN G++ E +P+GLYI+ G+ N+ +G GNF ++P G Sbjct: 77 GFAPLRAQLQPKGETLEFAMNAGMFQEDLSPVGLYIQEGRLLHPANMRNGPGNFHMKPNG 136 Query: 130 VFYVAGDKVGIVRLDAF-KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGIN 188 +FY + G++ F ++ + +A QSGP+L+ N ++P+I P S KIRNGVG+ Sbjct: 137 IFYFSQTSAGVMETGRFLQSGLKPDYATQSGPLLVANNQLHPKIEPTGTSEKIRNGVGVR 196 Query: 189 KHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMI 248 + +F +S+ F+ FA + +L+ L+LDG+IS +Y Q P ++ Sbjct: 197 DNHEVIFAISEAPVTFFRFARLFRDRLHCPDALFLDGSISSLYAPSLNRDDQWRPIGPIV 256 Query: 249 SVERK 253 K Sbjct: 257 GAVSK 261 >UniRef50_Q98NI9 Mlr0120 protein n=3 Tax=Alphaproteobacteria RepID=Q98NI9_RHILO Length = 263 Score = 206 bits (523), Expect = 7e-52, Method: Composition-based stats. Identities = 72/247 (29%), Positives = 122/247 (49%), Gaps = 3/247 (1%) Query: 10 GMITLNLKRIFLALTLLPLFAVA-ADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEA 68 G + L + + + V+ + + V+P+ ++++W+ G+ Sbjct: 15 GAVKAALPQAVASTMAFSQWFVSLPPCRDFAFEATSYLICEVDPKLYSIELFWKDPVGKP 74 Query: 69 WGTLHALLADINSQGQV-QMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRP 127 + +LH L A + G+ A+N G+Y P+GLY+E G++ + SG GNF ++P Sbjct: 75 FQSLHNLDAAQRAAGRTMLFAINAGMYHPDLRPVGLYVERGREMAGVRTGSGSGNFSLQP 134 Query: 128 GGVFYVAGDKVGIVRLDAF-KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVG 186 G+FY++G K + F + +A QSGPML+ +G ++P+ + S K R+GVG Sbjct: 135 NGIFYISGGKAAVRATRDFVRKRPSTDYATQSGPMLVIDGQLHPKFQSDGTSRKTRDGVG 194 Query: 187 INKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVT 246 + K G AVF +S NF+ FA + L + L+LDGTIS ++ + Sbjct: 195 VRKDGVAVFAISNGTVNFHTFARLFRDALGCDNALFLDGTISSLFAPAIGRNDDYWNLGP 254 Query: 247 MISVERK 253 MI V RK Sbjct: 255 MIGVFRK 261 >UniRef50_B9JX75 Putative uncharacterized protein n=1 Tax=Agrobacterium vitis S4 RepID=B9JX75_AGRVS Length = 274 Score = 203 bits (517), Expect = 4e-51, Method: Composition-based stats. Identities = 80/246 (32%), Positives = 130/246 (52%), Gaps = 4/246 (1%) Query: 12 ITLNLKRIFLALTLLPLFAVAAD--DCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAW 69 I + L I L + P A A + ++ + +P T ++++ + A+G+ + Sbjct: 26 IVVWLFAILSPLVISPERAEAEEQSCRDQTENGFAYRVCRFDPATRTIRIFNRNADGDVY 85 Query: 70 GTLHALLADINSQG-QVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPG 128 G AL + + Q + A+NGG+Y +P+GL+++ G + A G GNF+++P Sbjct: 86 GGFEALRSQLWQQRLILTFAVNGGMYHSDLSPVGLFVDYGMTRKTAETADGWGNFYLKPN 145 Query: 129 GVFYVAGDKVGIVRLDAFKT-SKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGI 187 GVF++ G++ F+T E FA QSGPML+ +GV++P+ P S KIRNGVGI Sbjct: 146 GVFFLKDGHAGVLETGQFETQKIEADFATQSGPMLVIDGVLHPKFLPTSDSLKIRNGVGI 205 Query: 188 NKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTM 247 + G VF+LS+ FYD A + + +L LYLDGTIS + + YP + Sbjct: 206 DASGQVVFVLSKDPVRFYDMAAFFRDRLGAANALYLDGTISSLAEPMAGRIDRAYPLGPI 265 Query: 248 ISVERK 253 I+V + Sbjct: 266 IAVVDQ 271 >UniRef50_A9W4Y6 Putative uncharacterized protein n=4 Tax=Methylobacterium extorquens group RepID=A9W4Y6_METEP Length = 258 Score = 203 bits (516), Expect = 5e-51, Method: Composition-based stats. Identities = 76/237 (32%), Positives = 123/237 (51%), Gaps = 4/237 (1%) Query: 19 IFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLAD 78 + + P A A+ TV+ + ERV+++W +G +G+L +L Sbjct: 24 APVPVQAQPAPAAKGPCQAVEFEGQPYTVCTVDLRRERVRLFWLGTDGLPYGSLSSL--A 81 Query: 79 INSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKV 138 ++ AMN G+YD+ AP+GLY+E+G++ + A+G GNF ++P GVFYV GD+ Sbjct: 82 DRQGPRLSFAMNAGMYDKGQAPVGLYVEDGRELKGASTANGPGNFHLKPNGVFYVKGDRA 141 Query: 139 GIVRLDAF-KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNA-VFL 196 G++ + + FA QSGPML+ +G I+P+I + S KIRNGVG+ G+ VF Sbjct: 142 GVLDTGRYLRAKPAPDFATQSGPMLVIDGKIHPKISADGPSQKIRNGVGVRDGGHVAVFA 201 Query: 197 LSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 +S++ F FA K L+LDG++S +Y G P ++ + Sbjct: 202 ISERPVTFGAFARLFKDSFGCRNALFLDGSVSSLYAPGLGRSDLSRPLGPLVGAVGR 258 >UniRef50_D2LB58 Putative uncharacterized protein n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LB58_RHOVA Length = 247 Score = 189 bits (481), Expect = 6e-47, Method: Composition-based stats. Identities = 70/239 (29%), Positives = 113/239 (47%), Gaps = 4/239 (1%) Query: 19 IFLALTLLPLFAVA--ADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALL 76 F+A+ + A + + V+++WQK +G + L AL Sbjct: 6 AFIAMAAFCGSSEAAAQTCKPYAFEGNGYTLCEASLDRFAVRLFWQKPDGGPYTYLSALP 65 Query: 77 ADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGD 136 G++ A+NGG++ Y P+GL++ENG++ V N G GNF +RP G+FY Sbjct: 66 KTDERGGRLAFALNGGMFHPDYKPVGLHVENGRELVRANTRPGPGNFHLRPNGIFYFGEA 125 Query: 137 KVGIVRLDAF-KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVF 195 + G++ AF K + FA QSGPML+ +G ++PRI S+K R+GV + + VF Sbjct: 126 EAGVMETGAFLKKKPKANFATQSGPMLVIDGKLHPRIAKANVSAKPRDGVCVRGDKSVVF 185 Query: 196 LLSQQATNFYDFACYAKAKLNVEQLLYLDGTIS-HMYMKGGAIPWQRYPFVTMISVERK 253 +S F F + L L+LDG + +++ G + MI+V K Sbjct: 186 AISDGGVPFDTFMRLFRDGLKCRNALFLDGGTAPALFVPGTRSGNVLFGLGPMIAVYEK 244 >UniRef50_Q0BWY7 Putative lipoprotein n=1 Tax=Hyphomonas neptunium ATCC 15444 RepID=Q0BWY7_HYPNA Length = 249 Score = 188 bits (478), Expect = 1e-46, Method: Composition-based stats. Identities = 66/237 (27%), Positives = 111/237 (46%), Gaps = 5/237 (2%) Query: 22 ALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADIN- 80 A+ S L + + + ++++ + G +G L + Sbjct: 13 AILSACNEVEEGPCQTRSFENLPYLVCSFDASQDTIRLFLRDETGVPFGQFDRLANHVAS 72 Query: 81 SQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGI 140 G + AMN G+Y + P+GLYIE G+ ++ L + G GNF + P GVF++ K G+ Sbjct: 73 KGGNLVFAMNAGMYHDDRRPVGLYIEEGEAEMNLVRSPGPGNFGMLPNGVFWIDAGKAGV 132 Query: 141 VRLDAFKT---SKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGN-AVFL 196 AF +FA QSGPML+ +G ++P ++P+ S + RNGVG+++ G F+ Sbjct: 133 SETLAFDERFKETPPRFATQSGPMLVIDGALHPALNPDGTSLRRRNGVGVSEDGRQVYFV 192 Query: 197 LSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 +S NF+ FA + +L LYLDG +S Y+ ++ V R+ Sbjct: 193 ISDVPVNFHSFARLFRDELGTPNALYLDGAVSKAYVPALERSETGLDMGPIVGVIRE 249 >UniRef50_A9CIN9 Putative uncharacterized protein n=6 Tax=Proteobacteria RepID=A9CIN9_AGRT5 Length = 254 Score = 186 bits (472), Expect = 6e-46, Method: Composition-based stats. Identities = 69/222 (31%), Positives = 117/222 (52%), Gaps = 3/222 (1%) Query: 35 DCALSDPTLTVQAYTVNPQTERVKMYWQK-ANGEAWGTLHALLADINSQGQ-VQMAMNGG 92 +++ + +P +++Y Q +G+ + L + + Q AMNGG Sbjct: 30 CKSINHAGGRYTVCSFDPAKNTIRIYDQDHVSGQGYRNFADLSSALWRQHMFSVFAMNGG 89 Query: 93 IYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAF-KTSKE 151 +Y Y+P+GL++ENG ++ ++ G GNF + P GVFY+ G+ G++ +A+ + Sbjct: 90 MYHSDYSPVGLFVENGVERSPVSTRGGWGNFHLLPNGVFYLDGNTAGVLETEAYLAADPK 149 Query: 152 IQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYA 211 FA QSGPML+ +G ++PR P+ S K RNGVG+++ G F +S+ FYDF Sbjct: 150 PDFATQSGPMLVIDGKLHPRFLPDSDSLKRRNGVGVSRDGMVHFAISETTVRFYDFGTLF 209 Query: 212 KAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 + L+ LYLDGTIS + + Q + +I+V + Sbjct: 210 RDVLDAPNALYLDGTISSVDIPAMNRRDQLFSMGPIIAVVDR 251 >UniRef50_C8PYM1 Putative uncharacterized protein n=1 Tax=Enhydrobacter aerosaccus SK60 RepID=C8PYM1_9GAMM Length = 271 Score = 182 bits (462), Expect = 8e-45, Method: Composition-based stats. Identities = 71/264 (26%), Positives = 107/264 (40%), Gaps = 18/264 (6%) Query: 7 IGKGMITLNLKRIFLALTLLPLFAVAA-----DDCALSDPTLTVQAYTVNPQTE-RVKMY 60 I K + + L L A DC ++ + ++ Sbjct: 6 ITKTQAVVVSLCLALLLVASIGLARQFTVKTMPDCQRKSQPFDYSICELDAKNAANFSLH 65 Query: 61 WQKANGE------AWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVAL 114 WQ + + TL L + AMN G+YD ++AP+G + NG+Q AL Sbjct: 66 WQNPSSASHPLLLTFTTLRDYLVSEQPAKTLLFAMNAGMYDSNFAPIGYTVINGKQIRAL 125 Query: 115 NLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTS----KEIQFAVQSGPMLMENGVINP 170 NL G GNF + P GVF+ I + + FA QSGPML+ +G I+P Sbjct: 126 NLKQGGGNFHLMPNGVFWQDRQGFYITESQSMAKKLASGAKPTFATQSGPMLVIDGNIHP 185 Query: 171 RIHPNVASSKIRNGVGINKHG--NAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTIS 228 N S K RNG+G+ H F++S +FY+FA K++L + L+LDG + Sbjct: 186 AFDANSTSRKYRNGIGVCGHNPSRVKFVISDTPVSFYEFADLFKSQLGCDNALFLDGGSA 245 Query: 229 HMYMKGGAIPWQRYPFVTMISVER 252 MI+V + Sbjct: 246 SALYSQTLSRNDNKYMGVMIAVTQ 269 >UniRef50_Q26CZ6 Periplasmic protein n=1 Tax=Flavobacteria bacterium BBFL7 RepID=Q26CZ6_9BACT Length = 241 Score = 182 bits (462), Expect = 9e-45, Method: Composition-based stats. Identities = 67/222 (30%), Positives = 108/222 (48%), Gaps = 6/222 (2%) Query: 34 DDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADI-NSQGQVQMAMNGG 92 D + D ++ ++ +++++YW + + T L + ++ AMN G Sbjct: 23 QDLIIKDDRFHIKV--IDLTKQKLQLYWLDQDNKPIETFEQLNMHVKQQDKRLVYAMNAG 80 Query: 93 IYDESYAPLGLYIENGQQKVALNLAS-GEGNFFIRPGGVFYVA-GDKVGIVRLDAFKTSK 150 +Y + ++P GLYIENG L+ + G GNF+++P GVFY+ K + Sbjct: 81 MYLKDHSPQGLYIENGTIHKQLDTVTVGYGNFYLQPNGVFYLTQDGKAQVTATPQLSNFS 140 Query: 151 EIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACY 210 I +A QSGPML+ N I+P + + IRN VGI G + +S++ NFYDFA + Sbjct: 141 NITYATQSGPMLVINDTIHPAFNKGSKNVHIRNAVGILPDGRILLAISKEKINFYDFATF 200 Query: 211 AKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVER 252 K + + LYLDG +S +Y + F MI V Sbjct: 201 FKNQ-GCKNALYLDGFVSRIYDPTINVEQMDGHFGVMIGVSD 241 >UniRef50_Q1MEZ5 Conserved hypothetical exported protein n=45 Tax=Rhizobiales RepID=Q1MEZ5_RHIL3 Length = 258 Score = 181 bits (460), Expect = 1e-44, Method: Composition-based stats. Identities = 63/255 (24%), Positives = 110/255 (43%), Gaps = 14/255 (5%) Query: 12 ITLNLKRIFLALTLLPLFAVAADDCALSDPTLT---VQAYTVNPQTERVKMYWQKANGEA 68 + ++ + LT A A + T+ P ++++W+ A+G Sbjct: 4 LKHSVLAAAIMLTATMTSLDQAHAQACEQESFEEAKYVVCTLEPGKADLRLFWKNADGAP 63 Query: 69 WGTLHALLADINSQGQ-VQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEG------ 121 + +L + ++G+ + A+N G+Y ++P+GLY+ENG++ N E Sbjct: 64 YRAFSSLAEAVRAEGRTLAFAVNAGMYRADFSPMGLYVENGRELNPANTTEAESSSGQVP 123 Query: 122 NFFIRPGGVFYVAGDKVGIVRLDAF-KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSK 180 NF+ +P GVF++ GI+ D F K + +FA QSGPML+ +NP Sbjct: 124 NFYKKPNGVFFLGETGAGILPTDEFLKRRPKARFATQSGPMLVIANKLNPIFIVGSTDRT 183 Query: 181 IRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGT-ISHMYMKGGAIPW 239 R+GVG + G F +S+ NF+DFA + L L+LDG +Y Sbjct: 184 RRSGVGTCERGAVRFAISEDRVNFHDFARLFRDHLKCPDALFLDGGRGVGLYNPDMGHND 243 Query: 240 --QRYPFVTMISVER 252 + + + Sbjct: 244 WSWHGGYGPIFGLVE 258 >UniRef50_Q11X50 Putative uncharacterized protein n=2 Tax=Bacteroidetes RepID=Q11X50_CYTH3 Length = 244 Score = 180 bits (457), Expect = 4e-44, Method: Composition-based stats. Identities = 81/218 (37%), Positives = 122/218 (55%), Gaps = 3/218 (1%) Query: 38 LSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADI-NSQGQVQMAMNGGIYDE 96 T+ V +YTV+PQ + ++ YW+ NGE ++ L A + + + A NGG+Y E Sbjct: 24 QQQDTIDVISYTVDPQKDNLQFYWKNDNGEILKSIKKLKAYVESKGSTLLFATNGGMYKE 83 Query: 97 SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVA-GDKVGIVRLDAFKTSKEIQFA 155 +PLGL+I+NG+ LN A G+GNF+++P GVFY+ ++ I + + F + I+FA Sbjct: 84 DRSPLGLFIQNGKTVTPLNKAKGQGNFYMQPNGVFYITNDNEAVICKTEDFINNGNIKFA 143 Query: 156 VQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKL 215 QSGPM++ N I+P + IRNGVGI + +F +S++ NF+DFA Y + L Sbjct: 144 TQSGPMIIVNNQIHPSFIKGSKNLNIRNGVGILPNKKIIFAMSEKEVNFFDFALYFQN-L 202 Query: 216 NVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 E LYLDG +S Y+ F MI V K Sbjct: 203 GCENALYLDGFVSRSYLLEKKWLQTDGEFGVMIGVTEK 240 >UniRef50_Q093S1 Putative uncharacterized protein n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q093S1_STIAU Length = 278 Score = 179 bits (455), Expect = 5e-44, Method: Composition-based stats. Identities = 70/257 (27%), Positives = 122/257 (47%), Gaps = 9/257 (3%) Query: 5 LLIGKGMITLNLKRIFLALTLLPLFAVAADD-----CALSDPTLTVQAYTVNPQTERVKM 59 LLIG G+ T + T ++ ++ T Y V+ +++ Sbjct: 19 LLIGSGLGTGATHLLAAPHTPAATRSLQTPTGRVAARRIAYRGNTYDTYEVDLTQSKLRF 78 Query: 60 YWQKANGEAWGTLHALLADIN-SQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLAS 118 Y+Q+ +G + +L L + ++ A N G++ + P+GLY+E+G++ V LN Sbjct: 79 YFQQPDGTPFSSLGNLRGWLQGRGKRLVFATNAGMFTPARRPVGLYVEDGREFVGLNTQE 138 Query: 119 GEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSK--EIQFAVQSGPMLMENGVINPRIHPNV 176 GNFF++P VF+V GI+ A+ ++ +A QSGP L+ +G ++P Sbjct: 139 EAGNFFLKPNAVFFVTETGAGILESSAYAAHPPAKVLYATQSGPALLLHGQMHPAFREGS 198 Query: 177 ASSKIR-NGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGG 235 + R +GVGI VF ++QQA N ++FA + + + + LYLDG +S MY+ Sbjct: 199 RNLSPRRSGVGIVTPTRVVFAMTQQAVNLHEFASFFRDQFGCQDALYLDGVVSRMYLPAL 258 Query: 236 AIPWQRYPFVTMISVER 252 F MI++ Sbjct: 259 GRDELDGDFGAMIAISE 275 >UniRef50_Q1IX28 Periplasmic YigE-like protein n=3 Tax=Deinococcus RepID=Q1IX28_DEIGD Length = 317 Score = 178 bits (452), Expect = 1e-43, Method: Composition-based stats. Identities = 74/234 (31%), Positives = 122/234 (52%), Gaps = 9/234 (3%) Query: 3 HQLLIGKGMITLNLKRIFLALTLLPLFAVAADD----CALSDPTLTVQAYTVNPQTERVK 58 H L + N+ RIF+ LLPL A + ++ + V+ + + ++ Sbjct: 65 HHRLDMLSVRFPNVLRIFV--LLLPLTACSQAGGLDVRRVTAEGMLYTVAAVDLKRDHLR 122 Query: 59 MYWQKAN-GEAWGTLHALLADINSQG-QVQMAMNGGIYDESYAPLGLYIENGQQKVALNL 116 ++W+ G+ + T + A + G QV A N GIY PLGL++E G+ + LN Sbjct: 123 LHWKNPATGQPYRTFAEVSARLRKDGEQVLFATNSGIYGPGLEPLGLHVEEGRTLIGLNN 182 Query: 117 ASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKT-SKEIQFAVQSGPMLMENGVINPRIHPN 175 A GNF + P GVF+V G++ G+ A++ + + FA QSGP+L++ G ++P + Sbjct: 183 ARSGGNFALLPNGVFWVKGNQAGVTETQAYRRLNIQPTFATQSGPLLVQGGRLHPAFNKG 242 Query: 176 VASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISH 229 +S K+R+GVG+ + G F +S NF+ FA + + L LYLDG+IS Sbjct: 243 SSSFKVRSGVGVCRDGRVRFAVSAGPVNFHSFAVFFRDVLGCPDALYLDGSISA 296 >UniRef50_A9D6B9 Putative uncharacterized protein n=1 Tax=Hoeflea phototrophica DFL-43 RepID=A9D6B9_9RHIZ Length = 286 Score = 177 bits (450), Expect = 2e-43, Method: Composition-based stats. Identities = 66/234 (28%), Positives = 113/234 (48%), Gaps = 6/234 (2%) Query: 26 LPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQ--- 82 + T++PQT +++ ++ G+ G++ A++ + + Sbjct: 47 MTKPDWPEGCVEQVFEGARAILCTIDPQTHDMRLVYRDRMGDVLGSVSAVVDQLAAGAGT 106 Query: 83 -GQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYV-AGDKVGI 140 ++ +AMN G+Y +P+GLY+EN + ALN G GNFF++P GVF+V G+ Sbjct: 107 DHKLVLAMNAGMYHADMSPVGLYVENSVEIAALNRDDGFGNFFLKPNGVFFVLKDGNAGV 166 Query: 141 VRLDAFK-TSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQ 199 + DA+ ++A QSGPML+ +GVI+PR P+ S IRNGVG+ G VF +++ Sbjct: 167 LETDAYAEADLSPEYATQSGPMLVIDGVIHPRFLPDGTSKFIRNGVGVRPDGKVVFAITR 226 Query: 200 QATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 + FA + E L+ DG +S + + P + V + Sbjct: 227 DRVSLGSFARLFRDVAGCENALFFDGAVSSLALGSKMEIDSEEPAGPVAVVVAR 280 >UniRef50_C5CWT4 Periplasmic protein-like protein n=3 Tax=Bacteria RepID=C5CWT4_VARPS Length = 238 Score = 176 bits (446), Expect = 7e-43, Method: Composition-based stats. Identities = 67/210 (31%), Positives = 118/210 (56%), Gaps = 4/210 (1%) Query: 44 TVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ-VQMAMNGGIYDESYAPLG 102 ++ + ER++++ +G + L A + ++ + + AMN G+Y ++P+G Sbjct: 27 RYTVVKIDVRRERLELFLHDDSGAPFKRFDRLEAWLAARNRQLVFAMNAGMYHADFSPVG 86 Query: 103 LYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFK--TSKEIQFAVQSGP 160 L ++ G+++ LNLA+G GNFF++P GVF V+ +V + + ++ A QSGP Sbjct: 87 LLVQEGREEAPLNLAAGAGNFFLKPNGVFLVSDAGPRVVESSEYAALPKEGVRLATQSGP 146 Query: 161 MLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQL 220 +L+ GV++P P+ S KIRNGVG++ H A+F++S+Q NFY+FA Y + L+ Sbjct: 147 LLLRRGVVHPAFIPDSDSRKIRNGVGVSGH-TAIFVISEQPVNFYEFALYFRDVLHCRDA 205 Query: 221 LYLDGTISHMYMKGGAIPWQRYPFVTMISV 250 LYLDGT+S ++ ++ V Sbjct: 206 LYLDGTVSALHSLALRRSDFTRELGPILGV 235 >UniRef50_D0SJ31 Putative uncharacterized protein n=1 Tax=Acinetobacter junii SH205 RepID=D0SJ31_ACIJU Length = 252 Score = 176 bits (445), Expect = 1e-42, Method: Composition-based stats. Identities = 67/234 (28%), Positives = 122/234 (52%), Gaps = 4/234 (1%) Query: 22 ALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKAN-GEAWGTLHALLADIN 80 + A + ++ + + V+ ++++ + G+ + + +D+ Sbjct: 16 CMVFQATTVFAFEYQSIKFEDVQFEVIKVDDLK-DLQLFLKNPRIGDFYQKFSNIQSDLA 74 Query: 81 SQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGI 140 + +++ AMN G+Y ++ P+GLYIE ++ LN ++G GNFF++P GV I Sbjct: 75 ACKELRFAMNAGMYHPNFEPVGLYIEKKKKLSELNESTGFGNFFMQPNGVVVWNDHGAVI 134 Query: 141 VRLDAFK-TSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQ 199 +K + FA QSGPML+ G+IN + + S KIRNGVG+ + F++S+ Sbjct: 135 HSTADYKRANFTANFATQSGPMLVHKGLINSQFIKDSNSLKIRNGVGVRDD-HLYFVISE 193 Query: 200 QATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 Q NFY FA + K +L V++ LYLDG+IS +Y+K ++Y ++ + + Sbjct: 194 QRINFYQFAKFFKHQLRVDEALYLDGSISSLYLKDIQRNDRKYNLGPIVGLTHQ 247 >UniRef50_B9KP42 Periplasmic protein-like protein n=37 Tax=Rhodobacterales RepID=B9KP42_RHOSK Length = 245 Score = 175 bits (444), Expect = 1e-42, Method: Composition-based stats. Identities = 64/242 (26%), Positives = 109/242 (45%), Gaps = 8/242 (3%) Query: 17 KRIFLALTLLPLF-----AVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGT 71 R LA L L+ L+ + ++++ +G +G+ Sbjct: 1 MRTRLAAILFALWPAACATAEPACRDLTFEGTRYSLCEA-QAGDDIRIFQTAPDGRPYGS 59 Query: 72 LHALLADINSQGQ-VQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGV 130 + + ++ +G+ + AMN G+Y P+GL IE ++ L ++G GNF + P GV Sbjct: 60 FERINSALDGEGRQLAFAMNAGMYHADRRPVGLLIEEEVERAPLVTSAGPGNFGLLPNGV 119 Query: 131 FYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKH 190 F V I + A QSGPML+ G ++PR + S IRNGVG++ Sbjct: 120 FCVGDGFRVIESRSFAAERPACRHASQSGPMLVIGGELHPRFLVHSDSRYIRNGVGVSAD 179 Query: 191 G-NAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMIS 249 G AVF +S + F++F + +L + + LY DG+IS +Y +G P ++ Sbjct: 180 GRRAVFAISNRPVTFHEFGRLFRDELGLPEALYFDGSISRLYDRGARRSDWGTPMGPIVG 239 Query: 250 VE 251 + Sbjct: 240 LV 241 >UniRef50_A1B5U0 Putative uncharacterized protein n=1 Tax=Paracoccus denitrificans PD1222 RepID=A1B5U0_PARDP Length = 251 Score = 174 bits (441), Expect = 2e-42, Method: Composition-based stats. Identities = 67/239 (28%), Positives = 104/239 (43%), Gaps = 4/239 (1%) Query: 19 IFLALTLLPLFAVAADDCALSDPTLTVQAYTVNP-QTERVKMYWQKANGEAWGTLHALLA 77 F AL + L A+A T+ Q ++++ +G G A+ Sbjct: 12 AFGALIAMTLPALAGICEKRDFDGQGYVICTLTAGQEPGLRLWLNGPDGRTLGDFTAVRR 71 Query: 78 DINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDK 137 + + AMN G+Y + P+GLY+ +G + L A G GNF + P GVF G + Sbjct: 72 TLAQGESLGFAMNAGMYHPDFTPVGLYVSDGVSQHDLVTAGGGGNFGMLPNGVFCAGGAR 131 Query: 138 V--GIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNA-V 194 I K + + + A QSGPML+ +G ++PR + S IRNGVG++ G Sbjct: 132 PYQVIESRAFAKAAPDCRLATQSGPMLVIDGALHPRFLVDSDSRYIRNGVGVSPDGQTAW 191 Query: 195 FLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 F +S +A F+ F + L LY DG+IS +Y G +I + Sbjct: 192 FAISDRAVTFHQFGRLFRDGLGARDALYFDGSISRLYAPGLGRADFGRRLGPIIGYVGQ 250 >UniRef50_Q0G184 Putative uncharacterized protein n=2 Tax=Aurantimonadaceae RepID=Q0G184_9RHIZ Length = 268 Score = 171 bits (434), Expect = 2e-41, Method: Composition-based stats. Identities = 67/246 (27%), Positives = 118/246 (47%), Gaps = 8/246 (3%) Query: 14 LNLKRIFLALTLLPLFAVAADDCALSD----PTLTVQAYTVNPQTERVKMYWQKANGEAW 69 + L+P+ + A + ++ V + + + G + Sbjct: 24 PAVLATGFLSWLVPVPDLPAGHEGICRIAMAGSVETILCEVPLSSFDLHLRALDDAGRPY 83 Query: 70 GTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGG 129 T A + G+V +AMN G+Y E P+GL +++G+ L +G GNF +RP G Sbjct: 84 ETFEKAAASL--SGEVVLAMNAGMYHEDRRPVGLTVQDGRIVKKAVLGTGSGNFSLRPNG 141 Query: 130 VFYVAGDKVGIVRLDAFK-TSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGIN 188 +FY+ + + + + S + A QSGPML+ G ++PR P S +RNGVG++ Sbjct: 142 IFYLEDGRAFVRETERYLGESHDPVLATQSGPMLLIGGKVHPRFIPTSDSLYVRNGVGVS 201 Query: 189 KHGNAVFLL-SQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTM 247 + G VFL +++ NFYDFA + + + V+ L+ DG +S + + I ++R M Sbjct: 202 EDGRTVFLALTRKPINFYDFALFFRDTVGVKDALFFDGQVSSLSYRAANIAYRRDRLGPM 261 Query: 248 ISVERK 253 + V +K Sbjct: 262 LLVTKK 267 >UniRef50_D1AL67 Periplasmic protein-like protein n=1 Tax=Sebaldella termitidis ATCC 33386 RepID=D1AL67_SEBTE Length = 266 Score = 171 bits (432), Expect = 3e-41, Method: Composition-based stats. Identities = 94/219 (42%), Positives = 132/219 (60%), Gaps = 4/219 (1%) Query: 36 CALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYD 95 + D T Y + E +KMYW+ N +A+ L + + + ++ A NGGIY Sbjct: 50 KKIEDRGFT--VYKPDLNKEIIKMYWKDENNKAYSELSKFIQEN-TGNKINFATNGGIYS 106 Query: 96 ESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFA 155 E Y P GLYIEN + +NLA GEGNF+++P GVFY+ ++ I AF+ ++ I +A Sbjct: 107 EEYEPNGLYIENHKIISKINLADGEGNFYMQPNGVFYIQNNQPKISESKAFEYNENISYA 166 Query: 156 VQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKL 215 QSGP+L+ENGVIN +I N S KIR+ VGI++ FL+S + NFYDF+ YA KL Sbjct: 167 TQSGPLLIENGVINKKIGKNSESFKIRSAVGIDRENKVFFLMSSEKINFYDFSKYALDKL 226 Query: 216 NVEQLLYLDGTISHMYM-KGGAIPWQRYPFVTMISVERK 253 N + LL+LDG IS MY IP Q YPF +I+ E++ Sbjct: 227 NCKDLLFLDGAISKMYFADEKKIPEQDYPFAVIITSEKR 265 >UniRef50_C7PF78 Putative uncharacterized protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PF78_CHIPD Length = 273 Score = 166 bits (421), Expect = 5e-40, Method: Composition-based stats. Identities = 70/228 (30%), Positives = 114/228 (50%), Gaps = 8/228 (3%) Query: 34 DDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADIN--SQGQVQMAMNG 91 + + A VNP + ++W A+ + L D+ + + M NG Sbjct: 46 GEITFTHNGQQYDAIVVNPAVSDISLHWLSADQQTPYKSIQALQDVLLEKKKDILMITNG 105 Query: 92 GIYDESYAPLGLYIENGQQKVALNLA-SGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSK 150 G++ ++ P+GL+I G++ ++ A GNF+++P GVFY+ + + Sbjct: 106 GMFMKNNIPVGLFISQGRELRPIDAATDQPGNFYMQPNGVFYLDHTGPHVSTTTDYLKRS 165 Query: 151 ----EIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATN-FY 205 +I A QSGPML+ G+IN + +P + +R+GVGI +GN VF++S++A FY Sbjct: 166 RAHSKIVAATQSGPMLVSKGIINAKFNPGSVNRNLRSGVGILSNGNVVFIISKEAQTTFY 225 Query: 206 DFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 DFA KA+ + LYLDG IS MY+K F MI+V + Sbjct: 226 DFASIFKARFGCKDALYLDGAISKMYLKNSRPGDLNGDFGAMIAVTAR 273 >UniRef50_Q1QCK8 Periplasmic protein-like n=1 Tax=Psychrobacter cryohalolentis K5 RepID=Q1QCK8_PSYCK Length = 276 Score = 163 bits (412), Expect = 6e-39, Method: Composition-based stats. Identities = 68/237 (28%), Positives = 116/237 (48%), Gaps = 9/237 (3%) Query: 24 TLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANG-EAWGTLHALLADINSQ 82 T ++ + + + T +Q+ + + + ++WQ+++ + T LL+ + Sbjct: 38 TASTDWSCQSHNTPFAYSTCHIQSDLLTNKRYSLALFWQQSDSRQPLLTFDNLLSTLPPS 97 Query: 83 GQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAG-DKVGIV 141 ++ AMN G+Y+E+YAP+G + ++ ALNL G GNF + P GV + KV I Sbjct: 98 QSLKFAMNAGMYNENYAPIGYTVIKSEEIRALNLKEGGGNFHLLPNGVLWWDKSGKVQIT 157 Query: 142 RLDAFKTSKE-----IQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFL 196 +A + +A QSGPML+ N I+P+ P+ S+KIRNG+G+ G+ F+ Sbjct: 158 ESNALAEQLKNGIAQPLYATQSGPMLVINDAIHPQFDPDGTSAKIRNGIGVCSDGSLQFV 217 Query: 197 LSQQATNFYDFACYAKAKLNVEQLLYLDGT-ISHMYMKGGAIPWQRYPFVTMISVER 252 S+ FY FA K +L L+LDG S +Y ++ MI + Sbjct: 218 NSEAPVAFYQFASLFKNELKCPNALFLDGGIASALYAPTIDKHDKK-EMGVMIGLVE 273 >UniRef50_A5WGQ7 Periplasmic protein-like protein n=1 Tax=Psychrobacter sp. PRwf-1 RepID=A5WGQ7_PSYWF Length = 309 Score = 161 bits (408), Expect = 2e-38, Method: Composition-based stats. Identities = 58/206 (28%), Positives = 98/206 (47%), Gaps = 6/206 (2%) Query: 53 QTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKV 112 + + Q + E L+ D+ +++ A N G+YD ++AP+G + G+Q + Sbjct: 95 NQPQAAIVDQDKSHEPLYKFDTLIKDLPKDSELKFAANAGMYDGNFAPIGYTVIQGRQIL 154 Query: 113 ALNLASGEGNFFIRPGGVFYVAG-DKVGIVRLDAFKT-----SKEIQFAVQSGPMLMENG 166 +LNL G GNF + P GV + + V I + +A QSGPML+ +G Sbjct: 155 SLNLKQGGGNFHLLPNGVLWWDKANHVHITESTQLDAMLKSGEAKPWYATQSGPMLVIDG 214 Query: 167 VINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGT 226 I+P+ + + S KIRNGVG+ F+ S++ NFY FA + K L+ + L+LDG Sbjct: 215 HIHPKFNSDSTSKKIRNGVGVCDGSQIHFVTSREPVNFYQFARFFKEDLHCDNALFLDGG 274 Query: 227 ISHMYMKGGAIPWQRYPFVTMISVER 252 ++ + M+ + Sbjct: 275 VASALYAPDVAAQEEKNMGVMVGLIE 300 >UniRef50_A5EW16 Putative uncharacterized protein n=1 Tax=Dichelobacter nodosus VCS1703A RepID=A5EW16_DICNV Length = 263 Score = 161 bits (408), Expect = 2e-38, Method: Composition-based stats. Identities = 77/260 (29%), Positives = 126/260 (48%), Gaps = 19/260 (7%) Query: 12 ITLNLKRIFLALTLLP-LFAVAADDCAL---------SDPTLTVQAYTVNPQTERVKMYW 61 + + L++I + + L L AA +V P+ ++++ W Sbjct: 1 MLVALRKIIVPVILSSFLLETAAAHLDFKKVAGGNFARFHHQSVDYAVFMPEHDKIRFLW 60 Query: 62 QKANGEAWGTLHALLADINSQG-QVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGE 120 Q GE + T+H L + ++G QV MN GI++++ P GL+IE LN SG+ Sbjct: 61 QNDRGENYQTMHHALRALTNEGYQVHFLMNAGIFNQNAQPAGLWIEKKALLRPLNRRSGK 120 Query: 121 GNFFIRPGGVFYVAGDKVGIVRL-DAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASS 179 GNF I+P GVFY+ +K I+ + +AVQSGP+L+ +G IN R+ N ++ Sbjct: 121 GNFHIQPNGVFYLTQEKAHIITTVQWHNNPPKADYAVQSGPLLIIDGAINSRLPKNHKAA 180 Query: 180 KIRNGVGINKHGNAVFLLS----QQA--TNFYDFACYAKAKLNVEQLLYLDGTISHMYMK 233 RN V ++K F+++ A N Y FA + + +Q LYLDG++S Y+ Sbjct: 181 YKRNAVCVDKARRVYFVITTRYDDGAHFPNLYRFAHALQ-TIGCQQALYLDGSLSDFYLP 239 Query: 234 GGAIPWQRYPFVTMISVERK 253 + + F MI+V K Sbjct: 240 MESSRFHWQKFAGMIAVVSK 259 >UniRef50_C8N6Z2 Lipoprotein n=1 Tax=Cardiobacterium hominis ATCC 15826 RepID=C8N6Z2_9GAMM Length = 304 Score = 159 bits (402), Expect = 9e-38, Method: Composition-based stats. Identities = 75/227 (33%), Positives = 112/227 (49%), Gaps = 11/227 (4%) Query: 34 DDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQG-QVQMAMNGG 92 + + + Y +P V ++W+ A+G A+ L L + G +V MN G Sbjct: 77 SYASTTYKNVRYGIYQADPAQ--VSLHWKTADGSAYANLATLKRSLEQSGARVAFLMNAG 134 Query: 93 IYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKT-SKE 151 IY E+ P GL+IE GQ V LN +G+GNF I+P GVFY+ K I A+ + Sbjct: 135 IYSENDTPAGLWIERGQTLVPLNRKNGKGNFHIQPNGVFYIERGKARIQTSAAYHIGNHH 194 Query: 152 IQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA------TNFY 205 +AVQSGP+L+ +G NPR N++S RN V F+L++ +F+ Sbjct: 195 PDWAVQSGPLLLLDGKPNPRFVKNLSSPHKRNAVCTTADNRLYFILTEDYDLGSEWPSFH 254 Query: 206 DFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVER 252 FA + L LYLDGT+S Y+ G A + +V +I+V Sbjct: 255 RFAEALQ-HLGCHDALYLDGTLSGWYIPGIAGTFHWTHYVGIIAVTT 300 >UniRef50_Q2NAA1 Putative uncharacterized protein n=3 Tax=Erythrobacter RepID=Q2NAA1_ERYLH Length = 277 Score = 158 bits (399), Expect = 2e-37, Method: Composition-based stats. Identities = 64/230 (27%), Positives = 106/230 (46%), Gaps = 8/230 (3%) Query: 26 LPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQV 85 A + L+ + + P R+ + + L Sbjct: 54 TSNVAAESACERLTFQEVVLTHCVAVPAKHRITTVLGPPH----RSFAKLAEG--RSSAP 107 Query: 86 QMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDA 145 A+N G++D P+G Y+E+ ++ ALN G GNF ++P GVFY + + + ++ Sbjct: 108 VFAVNAGMFDGDGKPIGYYVEDSERLQALNTNDGAGNFHLKPNGVFYGSNGEWRVRTTES 167 Query: 146 FKTS--KEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATN 203 F + QF QSGPML+ +G ++P I + S +IRNGVG+++ G A F++S+ + Sbjct: 168 FLANVSDRPQFGTQSGPMLLIDGKLHPEISEDGPSRQIRNGVGVDRQGRAHFVISEGPIS 227 Query: 204 FYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 F FA + + N LYLDG +S ++ R P MI VE + Sbjct: 228 FGKFARFFRDVANTPNALYLDGNVSGLWDPANDRMDARAPIGPMIVVETR 277 >UniRef50_B2HYZ5 Predicted periplasmic protein n=11 Tax=Acinetobacter RepID=B2HYZ5_ACIBC Length = 204 Score = 155 bits (392), Expect = 1e-36, Method: Composition-based stats. Identities = 61/205 (29%), Positives = 106/205 (51%), Gaps = 4/205 (1%) Query: 12 ITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKA-NGEAWG 70 + + + I + + A+A + + + T E+++++ + + + Sbjct: 1 MKILVLCI-VNFIIFTQSALALEYRQIRNTTDDQFEVIEISNLEQLRLFLKNPQTDQYYK 59 Query: 71 TLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGV 130 + + + + Q+ AMNGG++ ++P+GLYIENG++ LN G GNFF++P GV Sbjct: 60 SFDNIQYQLKACEQLTFAMNGGMFHSGFSPVGLYIENGRESQPLNEDKGWGNFFLQPNGV 119 Query: 131 FYVAGDKVGIVRLDAFKTSK-EIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINK 189 + I+ + +K + +A QSGPML+ NG INP N S KIRNGVG+ K Sbjct: 120 LAWNDKQAVILTTEQYKAKVFQPDYATQSGPMLVINGKINPLFLANSDSKKIRNGVGV-K 178 Query: 190 HGNAVFLLSQQATNFYDFACYAKAK 214 + F++S+ NFY FA + + K Sbjct: 179 NNKLYFVISKNRVNFYSFAQFFQKK 203 >UniRef50_A0Q3C5 Conserved protein n=7 Tax=Clostridia RepID=A0Q3C5_CLONN Length = 335 Score = 153 bits (387), Expect = 4e-36, Method: Composition-based stats. Identities = 38/232 (16%), Positives = 66/232 (28%), Gaps = 35/232 (15%) Query: 36 CALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIY 94 + + NP+ +V E G+ + + + A+N G + Sbjct: 105 REIHGDKFKGHLLVIKNPKKIKVGY------NEHLGSKGETTSAMAKRYNSIAAINAGGF 158 Query: 95 -------------DESYAPLGLYIENGQQK-VALNLASGEGNFFIRPGGVFYVAGDKVGI 140 + + P G+ I NG+ L I G+ V + Sbjct: 159 VANNASSKDANPSETNGNPGGILISNGEIVYNNLRNNEKICIAGITADGILLVGNYNLDE 218 Query: 141 VRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ 200 + ++ AV GP L+ NG + R +G K G+ +FL+ Sbjct: 219 MM------KLNVKDAVSFGPALIVNGQKTITSGDGGWGTAPRTAIGQRKDGSILFLVIDG 272 Query: 201 AT------NFYDFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFV 245 + + + LDG S MY G I Sbjct: 273 KYIGRLAVTLRELQDILY-EYGAYNAVNLDGGSSSTMYYNGKVISEPYKSTG 323 >UniRef50_UPI0001744B4D hypothetical protein VspiD_05050 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001744B4D Length = 235 Score = 152 bits (383), Expect = 1e-35, Method: Composition-based stats. Identities = 63/228 (27%), Positives = 104/228 (45%), Gaps = 13/228 (5%) Query: 37 ALSDPTLTVQAYTVNPQTE-RVKMYWQKANGEAWGTLHALLADINSQGQVQ-MAMNGGIY 94 + V+ R+ + W +G+ G+ LL + QG+ A N GIY Sbjct: 9 RIEFEGAIYHVLRVDRADFSRLDLRWLGQDGKPLGSFGPLLQEAARQGRRIEFATNAGIY 68 Query: 95 DESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAG-DKVGIVRLDAF-KTSKEI 152 + P GL I G++ V LNLA GEGNF++ P GVFY+ G++ + ++ + Sbjct: 69 ERGPKPCGLTIAGGKELVPLNLAKGEGNFYLHPNGVFYLDDQTGAGVMTGAEYGQSGLQP 128 Query: 153 QFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINK-HGNAVFLLSQ------QATNFY 205 + A QSGP+L+ G I+P + N + ++RN VG+ G VF++S F+ Sbjct: 129 RLATQSGPILLRQGKIHPAFNFNSPNRRLRNAVGVRASDGQVVFVMSDREDRVKGRVTFH 188 Query: 206 DFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFVTMISVER 252 + + L + L+LDG IS ++ F M + + Sbjct: 189 QLSRFFL-HLGCQDALFLDGDISDFLFHPPAGAAVTPNTFAGMFVLWK 235 >UniRef50_A6LS70 Putative uncharacterized protein n=23 Tax=Clostridium RepID=A6LS70_CLOB8 Length = 356 Score = 152 bits (383), Expect = 1e-35, Method: Composition-based stats. Identities = 36/227 (15%), Positives = 62/227 (27%), Gaps = 31/227 (13%) Query: 37 ALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYD 95 + + V +P ++ + G + I A+NGG + Sbjct: 132 DIENSKYNGYYLVVKDPTRVKIGV------SSKLGVEGETTSTIAENNDAIAAINGGAFT 185 Query: 96 E----------SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDA 145 + G+ + G+ KV + F I GV V Sbjct: 186 DQSSAAQWTGNGGLASGIVMTGGEVKVNDVGDNPTTTFGIDKNGVMVVGD------YTVE 239 Query: 146 FKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT--- 202 IQ A+ GP L+ NG + + + +G K G+ + L+ Sbjct: 240 KLKELGIQEALSFGPALIINGNMVKINGDGGFGTAPKTAIGQMKDGSIILLVIDGREIGS 299 Query: 203 ---NFYDFACYAKAKLNVEQLLYLDGTISHM-YMKGGAIPWQRYPFV 245 + +L + LDG S Y G Sbjct: 300 IGATLKELQEIM-HQLGAWNAMNLDGGKSTTLYYYGEVRNKPSNSMG 345 >UniRef50_Q5WVS5 Putative uncharacterized protein n=5 Tax=Legionella RepID=Q5WVS5_LEGPL Length = 258 Score = 149 bits (376), Expect = 8e-35, Method: Composition-based stats. Identities = 51/233 (21%), Positives = 89/233 (38%), Gaps = 27/233 (11%) Query: 11 MITLNLKRIFLALT---------LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYW 61 + + IF +T L P + L +P + + ++ ++ + Sbjct: 17 FLLILALAIFTPMTSYSASDWQELTPGIEYQDLEGGLLNPWSHIHVFRIDLNKNQMALVT 76 Query: 62 QKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEG 121 K + ++ + +++NGG +D + PLGL I N +Q+ L S Sbjct: 77 AKNLAQKNASVDQ----FAEHSKALLSINGGFFDHEFNPLGLRINNKKQENPLKRISWW- 131 Query: 122 NFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKI 181 G+FYV +K I + F I FA+QSGP L+ G I P + V Sbjct: 132 -------GIFYVKDNKPRITNIRNFHYDSNIDFAIQSGPRLLIRGNI-PSLKAGV---AD 180 Query: 182 RNGVGINKHGN-AVFLLSQQATNFYDFACYAKA-KLNVEQLLYLDGTISHMYM 232 R +GI G + + + A + A ++ L+ + LDG S Sbjct: 181 RTALGITDDGKVIILVTTNAAMSTRQLAQIMRSPPLSCSDAINLDGGSSSQLY 233 >UniRef50_D1REU9 Putative uncharacterized protein n=1 Tax=Legionella longbeachae D-4968 RepID=D1REU9_LEGLO Length = 260 Score = 148 bits (373), Expect = 2e-34, Method: Composition-based stats. Identities = 52/210 (24%), Positives = 86/210 (40%), Gaps = 18/210 (8%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ 84 L P + P V + V+ + ++ + K + ++ + Sbjct: 40 LSPGIEYQDLAGGILAPWSHVYVFRVDLKKNKLGLVNAKNLSLKYASV----NQFAEHSK 95 Query: 85 VQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLD 144 +++NGG +D + PLGL I NG+ + L S G FFI+ K I L Sbjct: 96 ALLSINGGFFDHKFNPLGLRITNGKLENPLKRISWWGVFFIKNN--------KAYISSLR 147 Query: 145 AFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ-ATN 203 F+ +I FA+QSGP L+ N I P + P +A R+ +GI G + L++ A Sbjct: 148 QFQYDNDIDFAIQSGPRLLVNRKI-PSLKPGIAE---RSALGITADGKIILLVTTNAAMT 203 Query: 204 FYDFACYAKA-KLNVEQLLYLDGTISHMYM 232 A ++ L+ + LDG S Sbjct: 204 TNKLAHLLRSPPLSCMDAINLDGGSSSQLY 233 >UniRef50_B8I4Q1 Putative uncharacterized protein n=3 Tax=Clostridium RepID=B8I4Q1_CLOCE Length = 346 Score = 146 bits (368), Expect = 7e-34, Method: Composition-based stats. Identities = 33/228 (14%), Positives = 58/228 (25%), Gaps = 29/228 (12%) Query: 34 DDCALSDPTLTVQAYTVN-PQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGG 92 + + + V+ P +V + + I + A+NGG Sbjct: 120 EYFDVESRNFKGKMIIVDDPTRIKVGYSSKMPRSG------ETTSSIARRNGAVAAINGG 173 Query: 93 IY------DESYAPLGLYIENGQQKVA--LNLASGEGNFFIRPGGVFYVAGDKVGIVRLD 144 + +G I NG+ N + G+ V Sbjct: 174 GFIDKGWAGTGGVAIGFVISNGKYISGKLTNNYTKRDTIAFTKDGMLIVGK------HSQ 227 Query: 145 AFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT-- 202 A I+ + GP L+ NG R +G + G+ + L+ + Sbjct: 228 AELAKYNIKEGISFGPPLIVNGKPTINKGDGGWGISPRTAIGQKEDGSVMLLVIDGRSLK 287 Query: 203 ----NFYDFACYAKAKLNVEQLLYLDGTIS-HMYMKGGAIPWQRYPFV 245 + LDG S MY G + Sbjct: 288 SFGATLKEVQDIMLEH-GAVNAANLDGGSSATMYYDGKVVNTPSDALG 334 >UniRef50_D0WLU9 Putative uncharacterized protein n=1 Tax=Actinomyces sp. oral taxon 848 str. F0332 RepID=D0WLU9_9ACTO Length = 447 Score = 144 bits (364), Expect = 2e-33, Method: Composition-based stats. Identities = 39/226 (17%), Positives = 65/226 (28%), Gaps = 27/226 (11%) Query: 31 VAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMN 90 V + T+T TV + AN + + + I S A+N Sbjct: 212 VEQIATGSGNNTVTYYVATVKLTDAT-ALKSAFANNQFGRNITQKTSTIASNNNAIFAIN 270 Query: 91 GGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSK 150 G Y + G+ I NG +G G + + + Sbjct: 271 GDYY--GFRSSGIVIRNGVVYRDDGARAG---LAFYRDGSVKIYDET---STNGQKLVKE 322 Query: 151 EIQFAVQSGPMLMENGVINPRIHP----------NVASSKIRNGVGINKHGNAVFLLSQQ 200 + + GP L++NG I I ++ ++ R VG K G VF++ Sbjct: 323 GVWNTLSFGPSLVKNGKIVEGIDDVEIDTNFGNHSIQGNQPRTLVGAKKDGTLVFVVVDG 382 Query: 201 AT-------NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPW 239 + A + LDG S G + Sbjct: 383 RDAGYSRGVTMTEAAKIMLEQ-GCVTAYNLDGGGSSTMYFNGEVIN 427 >UniRef50_A0PY15 Conserved protein n=4 Tax=Clostridium RepID=A0PY15_CLONN Length = 436 Score = 144 bits (364), Expect = 2e-33, Method: Composition-based stats. Identities = 37/232 (15%), Positives = 67/232 (28%), Gaps = 39/232 (16%) Query: 37 ALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE 96 + + V P ++++ + + + + G ++I + A+N G + + Sbjct: 200 DIKTNRFNGKMLIV-PNSKKIVIGFNEESPSKVG---KTTSEIAKENNAICAINAGGFTD 255 Query: 97 S------------------YAPLGLYIENGQQKVALNLASGE---GNFFIRPGGVFYVAG 135 P G+ I NG+ + G V Sbjct: 256 DVSGKSAEVVLNPDSGYETRKPCGILIHNGEFVYNDDKGRKNEKIDIVGFSKRGKLIVGK 315 Query: 136 DKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVF 195 + I+ AV GP L+ +G + R +G + G+ +F Sbjct: 316 ------YTLEELKNINIKEAVSFGPALIVDGNPVNILGDGGWGVAPRTAIGQRRDGSVLF 369 Query: 196 LLSQQA------TNFYDFACYAKAKLNVEQLLYLDGT-ISHMYMKGGAIPWQ 240 L+ D K K LDG +S MY K I Sbjct: 370 LVIDGRGFKSMGATIKDVQDIMK-KYGAVNASNLDGGTVSTMYYKDKVINKP 420 >UniRef50_C2JZN3 N-acetylmuramoyl-L-alanine amidase/probable S-layer protein n=2 Tax=Lactobacillus rhamnosus RepID=C2JZN3_LACRH Length = 559 Score = 144 bits (364), Expect = 2e-33, Method: Composition-based stats. Identities = 45/226 (19%), Positives = 74/226 (32%), Gaps = 22/226 (9%) Query: 23 LTLLPLFAVAA-DDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTL----HALLA 77 TL P + S + +NP+ ++ A + A Sbjct: 124 ATLTPGVTEQRLTYISQSGTQNKYYSVALNPKNPNTQLLTGTPGDGATSGVQTVSDQASA 183 Query: 78 DINSQGQVQMAMNGG-IYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGD 136 I + QV A+NG S P G I++G + A ++ E F I+ G + Sbjct: 184 AIKNGHQVVAAVNGDLFKIASGVPTGNVIKDGVELHAA-TSARESFFGIKKDGTPIIGD- 241 Query: 137 KVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFL 196 K ++Q A+ +L+ +G +N S+ R VGI G F+ Sbjct: 242 -----EQTYQKVKGDLQQALGGRNILVADGKVNET-KAIGTDSEPRTAVGIKADGTVFFV 295 Query: 197 LSQQAT-------NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGG 235 + + D A + L LDG S Y+ Sbjct: 296 VVDGRQAPTSNGLSMVDLANLMIQR-GAVTALNLDGGGSSTYVARE 340 >UniRef50_C4G6X0 Putative uncharacterized protein n=2 Tax=Lactobacillales RepID=C4G6X0_ABIDE Length = 345 Score = 144 bits (364), Expect = 2e-33, Method: Composition-based stats. Identities = 38/223 (17%), Positives = 66/223 (29%), Gaps = 17/223 (7%) Query: 37 ALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE 96 + TV + + A + + + + +A+NG Y Sbjct: 118 TVRKNNTTVYVADIKLSDSSY-LKTALAYDSFGTNVTETTSSMATNNNAILAVNGDYYGA 176 Query: 97 SYAPLGLYIENGQQKVALNL-ASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFA 155 + G I+NG S + + G F + + + + Sbjct: 177 DRS--GYVIKNGVIYRNTVRSDSEYPDLAVYKDGSFKIIYETEV---TAEELLADGVVNL 231 Query: 156 VQSGPMLMENGVINPRIHPNVAS---SKIRNGVGINKHGNAVFLLSQQATN------FYD 206 GP L+ENG I+ + V R +GI + + ++S T+ Y+ Sbjct: 232 FAFGPSLVENGEISVDQNTEVRQAMTKNPRTAIGIVDKNHYILVVSDGRTSESEGLSLYE 291 Query: 207 FACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMIS 249 A K + LDG S G I IS Sbjct: 292 LAEVLK-EYGATTAYNLDGGGSSTMYFNGNIVNNPTTNGHRIS 333 >UniRef50_C4FXK4 Putative uncharacterized protein n=1 Tax=Catonella morbi ATCC 51271 RepID=C4FXK4_9FIRM Length = 305 Score = 142 bits (359), Expect = 8e-33, Method: Composition-based stats. Identities = 44/240 (18%), Positives = 76/240 (31%), Gaps = 21/240 (8%) Query: 23 LTLLPLFAVAADDCALSDPTL-----TVQAYTVNPQTERVKMYWQKANGEAWGTLHALLA 77 T+ A D A++ T + TV + + A+ + A + Sbjct: 62 ATVNTATAYEDDTKAIAIDTYERNSTQIHVATVTIKG-DASIKTALADETYGRNVKAKTS 120 Query: 78 DINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDK 137 +A+NG Y G I NGQ + + + + I G F + + Sbjct: 121 TTAQSVNAVLAVNGDYY--GARDAGYVIRNGQLLRSDSQDPNQEDLVIYQDGSFEIIREG 178 Query: 138 VGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASS---KIRNGVGINKHGNAV 194 +K + GP L+E+ + V + R +GI + V Sbjct: 179 DI---TAQELLNKGAVQVLSFGPALIEDSQVAVDSTDEVGKAMASNPRTAIGIIDDKHYV 235 Query: 195 FLLSQQAT------NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMI 248 ++S T + + A + K +L V LDG S G I + I Sbjct: 236 LVVSDGRTDESKGLSLKELADFMK-ELKVTTAYNLDGGGSSTMYFNGQIINKPTTNGHNI 294 >UniRef50_Q97FU3 Uncharaterized conserved protein, YOME B.subtilis ortholog n=6 Tax=Clostridium RepID=Q97FU3_CLOAB Length = 354 Score = 142 bits (357), Expect = 1e-32, Method: Composition-based stats. Identities = 42/242 (17%), Positives = 74/242 (30%), Gaps = 40/242 (16%) Query: 31 VAADDCALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAM 89 + + + + +P ++ + G ++I A+ Sbjct: 114 EQIECKKIQGNKFSGLMLVIHDPTKVKIGYTSK------LGVEGETTSEIAKHNNALAAV 167 Query: 90 NGGIYDES------------YAPLGLYIENGQQKVALNLAS---GEGNFFIRPGGVFYVA 134 NGG + E+ P G+ I +G+ N +G I GV V Sbjct: 168 NGGGFQENSSGSKVVWTGTGALPTGIIISDGKVVYPKNPDQLSIQKGTAAITKSGVLVVG 227 Query: 135 GDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHP----NVASSKIRNGVGINKH 190 + + ++ + A+ GP L+ NGV R + ++ R +G K Sbjct: 228 DHSIREL------LNENVVEAINFGPTLIVNGVDQTRDSFGNSIDSQGAQPRTAIGQRKD 281 Query: 191 GNAVFLLSQQAT------NFYDFACYAKAKLNVEQLLYLDGTIS-HMYMKGGAIPWQRYP 243 G + L D + + N + LDG S MY G I Sbjct: 282 GAILLLTVDGRQGLQMGATIKDIQKIMEQE-NAYNAVNLDGGASTTMYYNGHVINNPCDK 340 Query: 244 FV 245 F Sbjct: 341 FG 342 >UniRef50_B2KU41 N-acetylmuramoyl-L-alanine amidase/putative S-layer protein (Fragment) n=1 Tax=Lactobacillus rhamnosus HN001 RepID=B2KU41_LACRH Length = 470 Score = 141 bits (356), Expect = 2e-32, Method: Composition-based stats. Identities = 45/226 (19%), Positives = 74/226 (32%), Gaps = 22/226 (9%) Query: 23 LTLLPLFAVAA-DDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTL----HALLA 77 TL P + S + +NP+ ++ A + A Sbjct: 124 ATLTPGVTEQRLTYISQSGTQNKYYSVALNPKNPNTQLLTGTPGDGATSGVQTVSDQASA 183 Query: 78 DINSQGQVQMAMNGG-IYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGD 136 I + QV A+NG S P G I++G + A ++ E F I+ G + Sbjct: 184 AIKNGHQVVAAVNGDLFKIASGVPTGNVIKDGVELHAA-TSARESFFGIKKDGTPIIGD- 241 Query: 137 KVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFL 196 K ++Q A+ +L+ +G +N S+ R VGI G F+ Sbjct: 242 -----EQTYQKVKGDLQQALGGRNILVADGKVNET-KAIGTDSEPRTAVGIKADGTVFFV 295 Query: 197 LSQQAT-------NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGG 235 + + D A + L LDG S Y+ Sbjct: 296 VVDGRQAPTSNGLSMVDLANLMIQR-GAVTALNLDGGGSSTYVARE 340 >UniRef50_B7AQ96 Putative uncharacterized protein n=1 Tax=Bacteroides pectinophilus ATCC 43243 RepID=B7AQ96_9BACE Length = 305 Score = 140 bits (353), Expect = 4e-32, Method: Composition-based stats. Identities = 40/222 (18%), Positives = 67/222 (30%), Gaps = 17/222 (7%) Query: 41 PTLTVQAYTVNPQTERVK-MYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYA 99 Y + Q + A+G + + + I +A+NG Y + Sbjct: 76 REYDTSIYVADIQLADASYLRAGLADGTFGRNVTEVTSQIAQDSNAILAINGDFY--GFR 133 Query: 100 PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLD---AFKTSKEIQFAV 156 G + NG +GN + V Y G I + Sbjct: 134 NKGYVMRNGYLYRETAQQGRQGNS-RQEDLVIYEDGHMDVIEENEVAAQTLKDSGASQIF 192 Query: 157 QSGPMLMENGVINPRIHPNVASS---KIRNGVGINKHGNAVFLLSQQAT------NFYDF 207 GP L++NG I + V S R +G+ + + +S T Y Sbjct: 193 SFGPGLIKNGNITVDENSEVEQSMQSNPRTAIGMITPLHYIMAVSDGRTEASEGLTLYQL 252 Query: 208 ACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMIS 249 A K + + LDG S G + + + + IS Sbjct: 253 AQIMKGQ-DCVTAYNLDGGGSSTMWFNGEVVNKPTSYGSKIS 293 >UniRef50_C6XT14 Exopolysaccharide biosynthesis protein n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6XT14_PEDHD Length = 303 Score = 140 bits (353), Expect = 4e-32, Method: Composition-based stats. Identities = 37/235 (15%), Positives = 68/235 (28%), Gaps = 29/235 (12%) Query: 25 LLPLFAVAADDCALSDPTL--TVQAYTVNPQTERVKMYWQKANGEAWGTLHALLAD---- 78 + P L + ++ + VK+ + Sbjct: 50 IQPGVEETDIHYQSQSGGLSTKIFILKIDLKNPDVKLQAATPYDAPGYGSQTVPEMAKYV 109 Query: 79 INSQGQVQMAMNGGIYDES-YAPLGLYIENGQQKVAL------NLASGEGNFFIRPGGVF 131 + +V +NG ++ S Y PLG+ + G G I G Sbjct: 110 DAANNRVIAGINGDFFNTSSYVPLGIIYKKGVAIKPAFTDNTDKPQQGLSFLGILANGKP 169 Query: 132 YVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHG 191 Y+ D +++ A+ +G L+++ + P V R GVGI Sbjct: 170 YIGDK-----ETDYPTIKSQLKEALGAGVFLVKDYKKITQSIPTVD---PRTGVGITDDD 221 Query: 192 NAVFLLSQQAT-------NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPW 239 F++ N+ + A V+ + LDG S +M Sbjct: 222 LVYFIVVDGRNFYNSNGINYQEMGKIMYA-FGVKNAVNLDGGGSSTFMIKHPRVD 275 >UniRef50_C7TED9 N-acetylmuramoyl-L-alanine amidase n=2 Tax=Lactobacillus rhamnosus RepID=C7TED9_LACRG Length = 1561 Score = 140 bits (352), Expect = 5e-32, Method: Composition-based stats. Identities = 39/223 (17%), Positives = 74/223 (33%), Gaps = 21/223 (9%) Query: 23 LTLLPLFAVAA-DDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLH----ALLA 77 TL P + + ++P+ + N + A Sbjct: 117 ATLAPGITEQKLTYLNQNGVQNKYYSVALDPKNPNTTLLAGMPNDGTKPGMQTVRNQANA 176 Query: 78 DINSQGQVQMAMNGGIYDE-SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGD 136 I+ QV A+N Y+ + APLG ++NG + + + F I+ G + Sbjct: 177 AISHGQQVVAAVNADYYNMATGAPLGNVVKNGTEIYSAPDTNEA-FFGIKKDGTPMIG-- 233 Query: 137 KVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFL 196 + ++Q AV + +++G +N ++ VGI G F+ Sbjct: 234 ----TAATYQQRKGDLQQAVGGPSIFVKDGKVNATQVAGSEGNEPCTAVGIKADGTVFFV 289 Query: 197 LSQQAT-------NFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 + + DFA + L+LDG S ++ Sbjct: 290 VIDGRQAPLSTGISVGDFAKLMIER-GAVNALFLDGGGSATFV 331 >UniRef50_B8FUP3 Putative uncharacterized protein n=2 Tax=Desulfitobacterium hafniense RepID=B8FUP3_DESHD Length = 350 Score = 139 bits (351), Expect = 6e-32, Method: Composition-based stats. Identities = 37/225 (16%), Positives = 56/225 (24%), Gaps = 28/225 (12%) Query: 36 CALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIY 94 +S V NPQ R+ A + G +++ +N G + Sbjct: 123 VEVSGKGFQGYLLKVGNPQRVRL------AATDQLGDRGLKVSEFVENNHAVAGINAGGF 176 Query: 95 DE------SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKT 148 + P G+ I G+ N + I V V Sbjct: 177 ADPGGVSFGGTPTGILITEGKIIHKDNWET-YSLIGITKHDVLVVGR------YTLEQIE 229 Query: 149 SKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT------ 202 I+ AV GP L+ NG R +G G + L+ Sbjct: 230 ELGIRDAVSFGPALIVNGEPMITYGDGGWGIAPRTAIGQTHDGTILLLVIDGRQLGSLGA 289 Query: 203 NFYDFACYAKAKLNVEQLLYLDGTISHMYMK-GGAIPWQRYPFVT 246 D LDG S + G P+ Sbjct: 290 TLKDVQDILIEH-GAVNGANLDGGSSSTLVYEGEVKNKPSSPYGP 333 >UniRef50_C3QHD0 Exopolysaccharide biosynthesis protein n=2 Tax=Bacteroides RepID=C3QHD0_9BACE Length = 311 Score = 139 bits (350), Expect = 8e-32, Method: Composition-based stats. Identities = 41/235 (17%), Positives = 78/235 (33%), Gaps = 23/235 (9%) Query: 24 TLLPLF-AVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKA--NGEAWGTLHALLADIN 80 TL P A+ + + + + + V+ + + M G+ L + Sbjct: 62 TLAPGVKALEMEILSATGMAVKMFVLEVDLKDTHLTMKASSPKDEGKLKTKQQMTLQALA 121 Query: 81 ---SQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDK 137 +V A+NG + P G+Y NG + F + G + Sbjct: 122 HDKQGSRVLAAVNGDFFATDGTPQGIYYRNGVCLKNTMTDNVCTFFAVTKGKKAVIG--- 178 Query: 138 VGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLL 197 + EIQ AV LM NG + P+ + + + R +G+ + L+ Sbjct: 179 ---SYDEYDTYKDEIQEAVGGRVRLMTNGNVLPQ---TLTALEPRTAIGVTDNNVVYILV 232 Query: 198 SQQATNFY-------DFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFV 245 + +Y + KA L + + LDG S ++ ++ F Sbjct: 233 ADGRNFWYSNGMRYAEMGAVMKA-LGAKDAINLDGGGSSTFIIRSKAGFEENRFA 286 >UniRef50_A9KDD2 Hypothetical exported protein n=6 Tax=Coxiella burnetii RepID=A9KDD2_COXBN Length = 255 Score = 138 bits (347), Expect = 2e-31, Method: Composition-based stats. Identities = 48/227 (21%), Positives = 85/227 (37%), Gaps = 22/227 (9%) Query: 26 LPLFAVAADDCALSDPTL--TVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQG 83 + V + S P L + A+ +NP+ + ++ A Sbjct: 36 MAYTVVTPAFSSESRPGLFTHLYAWKINPRQYHFNIVTA----KSLQQTALYAAQAAKIK 91 Query: 84 QVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRL 143 +A+NGG + + PLGL I + + +L S G F I+ + I Sbjct: 92 DTVLAINGGFFTPNLEPLGLRISDNKVLSSLKRISWWGIFMIKNN--------RAAITSP 143 Query: 144 DAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-- 201 ++ S EI FA+Q+GP L+ +G I P++ A R+ +G+ G+ + ++ Sbjct: 144 QNYRYSPEINFAIQAGPRLIIDGRI-PQLRGGSAQ---RSALGVTPTGDIIIAITDNNLL 199 Query: 202 TNFYDFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFVTM 247 A KL L LDG S +++ Q + Sbjct: 200 LTATQLA-ILLQKLGCSNALNLDGGTSSQLFVHTNNFSLQIPSLRPV 245 >UniRef50_C2KZT9 Exopolysaccharide biosynthesis protein n=2 Tax=Firmicutes RepID=C2KZT9_9FIRM Length = 438 Score = 137 bits (346), Expect = 3e-31, Method: Composition-based stats. Identities = 41/221 (18%), Positives = 78/221 (35%), Gaps = 18/221 (8%) Query: 39 SDPTLTVQAYTVNPQ-TERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDES 97 Y + + T+ + AN + +D+ + +A+NG Y Sbjct: 212 RYRAYDSNIYVADVEVTDGTSILSAFANNTYGRNITDTTSDMAEENNAVLAINGDYY--G 269 Query: 98 YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQ 157 G I NG + ++GE + G + +++ K+ + Sbjct: 270 ARQSGYVIRNGVVYRS-QGSNGEDMVISKDGSLSFISESD----TTTDSLIQKQTWQVLS 324 Query: 158 SGPMLMENGVINPRIHPNVA---SSKIRNGVGINKHGNAVFLLSQQATN------FYDFA 208 GP+L+ENG + + V +S R +G + +F++S T+ Y+ A Sbjct: 325 FGPVLVENGQVAVSENDEVGMAMASNPRTAIGTVAKNHYLFVVSDGRTSESAGLSLYELA 384 Query: 209 CYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMIS 249 + K+ L + LDG S + G + IS Sbjct: 385 NFMKS-LGATNVYNLDGGGSSTMVFQGEVVNNPTTNGNKIS 424 >UniRef50_D1CZ42 Putative uncharacterized protein n=1 Tax=Brucella sp. 83/13 RepID=D1CZ42_9RHIZ Length = 248 Score = 137 bits (345), Expect = 4e-31, Method: Composition-based stats. Identities = 46/167 (27%), Positives = 81/167 (48%), Gaps = 10/167 (5%) Query: 96 ESYAPLGLYIENGQQKVALNLASGEG------NFFIRPGGVFYVAGDKVGIVRLDAF-KT 148 ++PLGL+I +G+++ + A + NF+ +P G+F++ G++ + F K Sbjct: 82 AGFSPLGLFIADGKEQSPIQPAGAKTSDKPVPNFYKKPNGIFFLDESGAGLLPTEQFVKR 141 Query: 149 SKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFA 208 ++ A QSGPML+ +NP A R+GVG+ K G F++S A NF+DFA Sbjct: 142 RPKVWLATQSGPMLVIENRLNPIFIIGSADKSRRSGVGVCKDGVIHFVVSDDAVNFHDFA 201 Query: 209 CYAKAKLNVEQLLYLD-GTISHMYMKGGAIPW--QRYPFVTMISVER 252 + + +L L+LD G + +Y + M ++ Sbjct: 202 RFFRDRLECPNALFLDGGGGAGLYDPALGRNDMSWHGGYGPMFALIE 248 >UniRef50_Q97FU6 Uncharaterized conserved protein, YOME B.subtilis ortholog n=2 Tax=Clostridium RepID=Q97FU6_CLOAB Length = 347 Score = 137 bits (344), Expect = 4e-31, Method: Composition-based stats. Identities = 40/227 (17%), Positives = 69/227 (30%), Gaps = 27/227 (11%) Query: 39 SDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE- 96 D T + +P ++ Q G + ++ + + A+NGG + + Sbjct: 115 GDGKFTANVLIIKDPNRVKIGYAAQ------IGYVGETTREMAKRYKAVAAINGGYFKDT 168 Query: 97 ---------SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDK---VGIVRLD 144 P G + NGQ + ++ + D VG Sbjct: 169 SPNKQSGGVGAIPTGFIMSNGQIVYPQDNSNWSEITSEEENRALTIDKDGNLQVGGTYSP 228 Query: 145 AFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNF 204 I+ AV + P L++NG N +V+ + R +G + +F++ Sbjct: 229 DQLIKSGIREAVITEPYLIKNGK-NTIQANSVSGTNPRTAIGQRADKSIIFMVIDGRQGV 287 Query: 205 YDFA-----CYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFV 245 A KL LDG S MY G I Sbjct: 288 KLGATVGDVQVLMHKLGAVNAACLDGGGSTAMYYNGEIINNPSNATG 334 >UniRef50_A4VXL8 Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase n=12 Tax=Firmicutes RepID=A4VXL8_STRSY Length = 312 Score = 136 bits (342), Expect = 7e-31, Method: Composition-based stats. Identities = 42/223 (18%), Positives = 72/223 (32%), Gaps = 17/223 (7%) Query: 37 ALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE 96 ++ TV + + + A + A ++ + +A+NG Y Sbjct: 85 TITTNNTTVYVADIQVSSPEY-LKTALAQNTYGTNVTAKTSETAAANNAILAVNGDYYGA 143 Query: 97 SYAPLGLYIENGQQKVALNLASGE-GNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFA 155 + G I+NG + G+ I G F V + K + Sbjct: 144 N--STGYVIKNGVLYRDTVRDNAAYGDLAIYADGSFEVIYENEI---TAQELIDKGVVNL 198 Query: 156 VQSGPMLMENGVINPRIHPNVA---SSKIRNGVGINKHGNAVFLLSQQATN------FYD 206 + GP L+ENG I V SS R+ +GI + + +++ T+ Y Sbjct: 199 LAFGPSLVENGEIVVDTSTEVGRAMSSNPRSAIGIIDENHYIIVVADGRTSESQGLSLYQ 258 Query: 207 FACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMIS 249 A K + + LDG S G + IS Sbjct: 259 LAEVMK-QYGAQTAYNLDGGGSSTLYFNGQVINNPTTNGNTIS 300 >UniRef50_Q892K3 N-acetylmuramoyl-L-alanine amidase/putative S-layer protein n=1 Tax=Clostridium tetani RepID=Q892K3_CLOTE Length = 708 Score = 135 bits (341), Expect = 1e-30, Method: Composition-based stats. Identities = 41/224 (18%), Positives = 72/224 (32%), Gaps = 21/224 (9%) Query: 24 TLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALL----ADI 79 + + A V + + RV + N + + +++ A I Sbjct: 63 IVPGVTEKAYRFIDKDGKKQYVSLMEIRWTSSRVGVKAGTPNNKDSYGMQSVIMQAKASI 122 Query: 80 NSQGQVQMAMNGGIYDE-SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKV 138 S V +NG Y + P+G+ +NG+ N A+ F + G + K Sbjct: 123 ASGDNVVGGVNGDFYYTVTGEPIGIVYKNGKAVK-ANHAAEWNFFGVLEDGTPIIGDGK- 180 Query: 139 GIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLS 198 + +Q A+ +L+ G I + + R VGI K G F+ Sbjct: 181 -----KYNEVKDSLQEALGGNAILVREGRIY-QTPSIGGYREPRTAVGIKKDGTIFFVTV 234 Query: 199 QQAT-------NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGG 235 + D A L + L LDG S ++ Sbjct: 235 DGRQEGHSAGISMPDLAQLMID-LGAVEALNLDGGGSSTFVSRK 277 >UniRef50_A7GCS1 Putative uncharacterized protein n=12 Tax=Clostridium RepID=A7GCS1_CLOBL Length = 339 Score = 134 bits (338), Expect = 2e-30, Method: Composition-based stats. Identities = 38/252 (15%), Positives = 73/252 (28%), Gaps = 36/252 (14%) Query: 9 KGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTV-NPQTERVKMYWQKANGE 67 KG+ +LK + + + + + + ++ + NPQ ++ + Sbjct: 88 KGISNSSLKENYGDIKIKNKYGNSVERYDINTAKFDGYILEIKNPQKVKIGYT------K 141 Query: 68 AWGTLHALLADINSQGQVQMAMNGGIY-----------DESYAPLGLYIENGQQKVALNL 116 G + + + + A+NGG + P GL I NG+ Sbjct: 142 YMGKMGERTSKMAERHGAVAAVNGGGFRDVSSTGKLWTGTGAYPEGLVISNGKVIYNDFK 201 Query: 117 ASGEGNF-FIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPN 175 + + N G+ V V + + A+ L+ NG P Sbjct: 202 SGQKVNVTAFTKEGLLVVGDHTVDEL------LKMGVVEALSFRNTLIINGKPIPY---- 251 Query: 176 VASSKIRNGVGINKHGNAVFLLSQQAT------NFYDFACYAKAKLNVEQLLYLDGTISH 229 R +G + G V L+ + + V LDG S Sbjct: 252 NEGINPRTAIGQKQDGTIVLLVIDGRRGIKQGATLEEVENILLQR-GVVNASNLDGGSSS 310 Query: 230 MYMKGGAIPWQR 241 G + + Sbjct: 311 TMYYKGKVINRP 322 >UniRef50_C2HB28 Exopolysaccharide biosynthesis protein n=4 Tax=Enterococcus faecium RepID=C2HB28_ENTFC Length = 308 Score = 134 bits (338), Expect = 2e-30, Method: Composition-based stats. Identities = 38/233 (16%), Positives = 71/233 (30%), Gaps = 18/233 (7%) Query: 20 FLALTLLPLFAVAADDCALSDPTL-TVQAYTVNPQTERVK-MYWQKANGEAWGTLHALLA 77 F + + + LS + Y + + AN + + Sbjct: 63 FEPVITDNSYQDENINITLSSERVDETTVYVADITVSDSSYLKTALANNTYGRNIKETTS 122 Query: 78 DINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGE-GNFFIRPGGVFYVAGD 136 I + Q +A+NG Y + G + NG + I G F + + Sbjct: 123 AIAQEQQAILAINGDYY--GFRDKGYVLRNGTLYRDTPSDDETKEDLVIDKNGDFSIIKE 180 Query: 137 KVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASS---KIRNGVGINKHGNA 193 +++Q + GP L+ENG + V+ S R + + Sbjct: 181 ---AETSAEKLVEEDVQQVLSFGPALVENGEVTVSEDEEVSQSMKSNPRTAIAQVGTNHY 237 Query: 194 VFLLSQQAT------NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 + ++S+ T + + A K + LDG S G + Q Sbjct: 238 LVVVSEGRTDDSQGLSLSELATVLKNH-GAKTAYNLDGGGSTTLYFNGKVINQ 289 >UniRef50_C1I4R7 Putative uncharacterized protein (Fragment) n=1 Tax=Clostridium sp. 7_2_43FAA RepID=C1I4R7_9CLOT Length = 894 Score = 134 bits (338), Expect = 2e-30, Method: Composition-based stats. Identities = 37/253 (14%), Positives = 73/253 (28%), Gaps = 34/253 (13%) Query: 25 LLPLFAVA-ADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLAD----I 79 L P + + ++ + + V + N E+ L + + Sbjct: 46 LAPGVIEKGYTFEDNTGKRIESFVIEIDTKNKNVSIEASTPNDESAYGLQPVRKQAEALL 105 Query: 80 NSQGQVQMAMNGGIYDE-SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKV 138 V +N Y+ + P G+ +++G N F I G + Sbjct: 106 AKGENVVAGVNADFYNMATGEPNGVLLKDGVIIK--NHPESRKFFGILKDGSAVIGD--- 160 Query: 139 GIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLS 198 + ++ A+ +L+++G + A + R VGI +GN F+ Sbjct: 161 ---YNKFNEVKDNVEEALGGNAILVKDGQVFET-PQTGADKEPRTAVGIKSNGNVFFITV 216 Query: 199 QQAT-------NFYDFACYAKAKLNVEQLLYLDGTISHMY-----------MKGGAIPWQ 240 + D A + + Q L LDG S + +K Sbjct: 217 DGRQEPYSAGLSMDDLAQLMIS-MGAIQALNLDGGGSTTHLSRIPGTDNLEVKNRPSDNS 275 Query: 241 RYPFVTMISVERK 253 + K Sbjct: 276 ERSVANSWMIISK 288 >UniRef50_C6IYX5 Putative uncharacterized protein n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6IYX5_9BACL Length = 347 Score = 134 bits (337), Expect = 3e-30, Method: Composition-based stats. Identities = 34/220 (15%), Positives = 66/220 (30%), Gaps = 19/220 (8%) Query: 37 ALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYD 95 +S + TV +P R+ + ++ GE ++ A + A + Sbjct: 100 EISGKSYHGYVLTVNDPTKIRLGVPAKRGKGEKVSSMVARTGALAGVNGGGFA-DPNWKG 158 Query: 96 ESYAPLGLYIENGQQK-VALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQF 154 + P+G+ I G+ ++ + + G IQ Sbjct: 159 NGFKPIGVVISRGKLYYNGISSGAATQIVGLDKQGKMIAGK------YTLEELDKLGIQE 212 Query: 155 AVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT-------NFYDF 207 AV P ++ NG R R +G + G +F++ + YD Sbjct: 213 AVTFQPRIIVNGKGQIRSQKEGWGIAPRTAMGQREDGAILFVVIDGRQPGYSIGASLYDV 272 Query: 208 ACYAKAKLNVEQLLYLDGTISHMYMK--GGAIPWQRYPFV 245 + LDG S + +K G + + Sbjct: 273 QQIMLER-GAVIAANLDGGSSTVLVKEGGEIVNKPSSEYG 311 >UniRef50_C6D6X3 Exopolysaccharide biosynthesis protein n=6 Tax=Bacteria RepID=C6D6X3_PAESJ Length = 344 Score = 133 bits (335), Expect = 4e-30, Method: Composition-based stats. Identities = 37/235 (15%), Positives = 65/235 (27%), Gaps = 27/235 (11%) Query: 36 CALSDPTLTVQAYTVNPQ-TERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIY 94 + + Y + ++ + A + + I S A+NG Y Sbjct: 112 VETGSGSDMITYYVADVAFNSKMNLLTAFAKDSFGTNITQNTSTIASNNNAVFAINGDYY 171 Query: 95 DESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQF 154 + G+ I NG G F G ++ + Sbjct: 172 --GFRSDGVVIRNGTVYRDEPARIGLAMF---NDGTM---KSYDEEETSTDDLLAQGVTN 223 Query: 155 AVQSGPMLMENGVINPRIH----------PNVASSKIRNGVGINKHGNAVFLLSQQA--- 201 A GP L+ +G I ++ +S R G+G+ + VF++ Sbjct: 224 AFSFGPALVTDGEIAGDFSHVEIDKNFGNRSIQNSNPRTGIGMISANHYVFVVVDGRSTG 283 Query: 202 ----TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVER 252 +FA K +L + LDG S G + V Sbjct: 284 YSRGMTLTEFADLFK-ELGATEAYNLDGGGSSTMYFMGRVVNNPLGKGNERGVSD 337 >UniRef50_B0TEY5 Putative uncharacterized protein n=1 Tax=Heliobacterium modesticaldum Ice1 RepID=B0TEY5_HELMI Length = 327 Score = 133 bits (334), Expect = 6e-30, Method: Composition-based stats. Identities = 38/223 (17%), Positives = 62/223 (27%), Gaps = 27/223 (12%) Query: 37 ALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGG-IY 94 + T + V +P +V + G + ++ + A+NGG Sbjct: 98 DIQGYRFTGKVMIVHDPLRIKVAV------SSKLGEAGETVPEMARREGAVAAINGGGFI 151 Query: 95 DESYA-----PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTS 149 D + P G+ + GQ ++ E I G V +R Sbjct: 152 DPNGQGNGAYPDGITVSRGQFISVIDEDQKENIIGITKKGQMIVGRYSARELRS------ 205 Query: 150 KEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT------N 203 +I V GP L+ NG R G+G G+ + ++ Sbjct: 206 MDISEVVTFGPPLVVNGRPTITSGDGGWGVAPRTGIGQRSDGSIIMVVIDGRQIGSIGAT 265 Query: 204 FYDFACYAKAKLNVEQLLYLDGTISHMY-MKGGAIPWQRYPFV 245 + K LDG S G I F Sbjct: 266 LRELQDLLL-KYGAVTAGNLDGGASTTMVYNGKVINQPSSVFG 307 >UniRef50_B1BC21 Putative uncharacterized protein n=2 Tax=Clostridium botulinum RepID=B1BC21_CLOBO Length = 326 Score = 132 bits (332), Expect = 1e-29, Method: Composition-based stats. Identities = 37/236 (15%), Positives = 65/236 (27%), Gaps = 35/236 (14%) Query: 38 LSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGG---- 92 L + + NP+ RV + G + ++I A+NGG Sbjct: 100 LENSRFKAYLMEISNPKKVRVGYA------KKLGKVGEPTSEIAKDFNAIAAINGGSFTD 153 Query: 93 -------IYDESYAPLGLYIENGQQK-VALNLASGEGNFFIRPGGVFYVAGDKVGIVRLD 144 P G+ + +G+ ++ + G + + +R Sbjct: 154 ETSNGTKYSGTGAFPEGVIMSHGKVIWKTVSTNTKIDIIAFNNEGKLILGKYTINELR-- 211 Query: 145 AFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ---- 200 A+ P L+ +G A R +G K G +FL++ Sbjct: 212 ----KLNCIEALCYKPSLIVDGKKAKIKGDGGAGMAPRTAIGQKKDGTILFLVADGTMFK 267 Query: 201 --ATNFYDFACYAKAKLNVEQLLYLDGTIS-HMYMKGGAIPWQRYPFV--TMISVE 251 + K LDG S MY G I + S+ Sbjct: 268 RDGLRMDELQDILYEK-GAYNATNLDGGSSATMYYDGEVINNPCDSVGERPIPSIF 322 >UniRef50_C9KSV8 N-acetylmuramoyl-L-alanine amidase/putative S-layer protein n=1 Tax=Bacteroides finegoldii DSM 17565 RepID=C9KSV8_9BACE Length = 390 Score = 131 bits (330), Expect = 2e-29, Method: Composition-based stats. Identities = 37/193 (19%), Positives = 67/193 (34%), Gaps = 17/193 (8%) Query: 60 YWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASG 119 + ++ + + A LA S +V A+NG + + P G+Y NG + Sbjct: 183 FLGTSSSYYYVSRDAALAYDKSGSRVLAAVNGDFFAKDGTPQGIYYRNGTCLKGTMTDNV 242 Query: 120 EGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASS 179 F I + + + IQ AV LM NG + P+ V + Sbjct: 243 CTFFAITKNKRAIIG------SYDEYDSYKENIQEAVGGRVRLMTNGNVLPQ---TVTAL 293 Query: 180 KIRNGVGINKHGNAVFLLSQQATNFY-------DFACYAKAKLNVEQLLYLDGTISHMYM 232 + R +G+ L++ +Y + KA L + + LDG S ++ Sbjct: 294 EPRTAIGVTDDNVVYILVADGRNFWYSNGMRYAEMGAVMKA-LGAKNAINLDGGGSSTFI 352 Query: 233 KGGAIPWQRYPFV 245 ++ F Sbjct: 353 IRKIAGFEDGRFA 365 >UniRef50_C0ZEQ6 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0ZEQ6_BREBN Length = 356 Score = 131 bits (330), Expect = 2e-29, Method: Composition-based stats. Identities = 37/225 (16%), Positives = 64/225 (28%), Gaps = 17/225 (7%) Query: 28 LFAVAADDCALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQ 86 L V D + + + +P R+ + +K G+ I Sbjct: 125 LIEVEDIDVSKGSYYFKGKIMYISDPSRVRLVVTNRKDRGDLLDEFVNKTGAIGIVNASG 184 Query: 87 MAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAF 146 A + Y + G+ I G+ N SGE + G Sbjct: 185 FA-DPDGYGKGARAYGVVIHEGKILQGYNPRSGETALGLTYDGKLITG------SYSAEQ 237 Query: 147 KTSKEIQFAVQSGPMLMENGV-INPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT--- 202 ++ AV P L+ NG + + R +G + G VF + Sbjct: 238 LVKMGVRDAVSFRPQLIVNGKNMFEGKPAKSWGIQPRTAIGQKEDGTIVFAVIDGRQPGH 297 Query: 203 ----NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYP 243 + D A + V + +DG S M + G + Sbjct: 298 SIGASMNDMAELLAER-GVVTAMAMDGGSSSMMLHNGEAITKTSS 341 >UniRef50_C0GEE0 Putative uncharacterized protein n=1 Tax=Dethiobacter alkaliphilus AHT 1 RepID=C0GEE0_9FIRM Length = 379 Score = 130 bits (328), Expect = 3e-29, Method: Composition-based stats. Identities = 31/201 (15%), Positives = 63/201 (31%), Gaps = 23/201 (11%) Query: 50 VNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQ 109 V+P RV + +G ++ I + +A+N + P + G+ Sbjct: 181 VDPTKLRVAF-----AHDEYGAPRKPVSKIANSNNAILAINASGFS-GNVPFSPVVREGE 234 Query: 110 QKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVIN 169 + G I G+ +G + ++ + P+L+ NG + Sbjct: 235 VYSMDINHTPMG---ITACGMLMDSGKRGVEQMIED-----GAHQVITFRPVLVRNGQMT 286 Query: 170 PRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT-------NFYDFACYAKAKLNVEQLLY 222 N + R +G ++G+ +F++ N D A + Sbjct: 287 ST-AQNNNTIHPRTAIGQKENGDLIFIVVDGRRNNWSTGINLGDLAQIFIDE-GAAWAYN 344 Query: 223 LDGTISHMYMKGGAIPWQRYP 243 LDG S G + + Sbjct: 345 LDGGGSTTLYFNGKVLNKPSD 365 >UniRef50_A5USB9 Putative uncharacterized protein n=2 Tax=Roseiflexus RepID=A5USB9_ROSS1 Length = 282 Score = 129 bits (325), Expect = 7e-29, Method: Composition-based stats. Identities = 47/234 (20%), Positives = 93/234 (39%), Gaps = 19/234 (8%) Query: 23 LTLLPLFAVAA--DDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADIN 80 + V SDP + + A ++P T R+++ + + T Sbjct: 54 IVAASGVEVRTFTTGEDSSDPPVPIYAVRLDPATIRLRIRYAPDAPQPLRTWF------- 106 Query: 81 SQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGI 140 + +A+NGG + L + +G + A G P G ++ Sbjct: 107 VAHRPLVAVNGGFFTAENRATALIVSDGTVY-GTSYAGFGGMLAAAPDGRVWI-----QA 160 Query: 141 VRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLS-Q 199 +R + + + + A+QS PML+ G + I+ N R V I++ G + ++ Sbjct: 161 LRDEPYDPNIPLDQAIQSFPMLIYPGGVVASINDNG-QRARRTVVAIDRAGRVLLIVCPT 219 Query: 200 QATNFYDFACYAKAK-LNVEQLLYLDGTISH-MYMKGGAIPWQRYPFVTMISVE 251 A + + A + + + +++ L LDG S +++ GA+ WQ F + SV Sbjct: 220 SAFSLQELATWLASSDMEIDRALNLDGGSSSGIFVNAGAVRWQIDSFAALPSVI 273 >UniRef50_D1BL19 Putative uncharacterized protein n=4 Tax=Veillonellaceae RepID=D1BL19_VEIPT Length = 312 Score = 128 bits (322), Expect = 2e-28, Method: Composition-based stats. Identities = 27/223 (12%), Positives = 52/223 (23%), Gaps = 27/223 (12%) Query: 37 ALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYD 95 + + +P+ +V ++I A+NGG + Sbjct: 89 KIQSARYVGYILEIPDPRRIQVGTAA------NIQEKGDTTSNIAKMNNAVAAINGGGFH 142 Query: 96 E------SYAPLGLYIENGQQK--VALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFK 147 + P G + +G+ + G Sbjct: 143 DPNGTGTGRLPYGFILHDGEYVIGKDVGPDEDVDFVGFSKAGNLIAGN------YNKTQL 196 Query: 148 TSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDF 207 + + GP L+ +G R +G K G +FL+ Y Sbjct: 197 GDMKAMEGITFGPPLIVDGKKMITEGDGGWGVGPRTAIGQKKDGTVLFLVIDGRQPGYSI 256 Query: 208 ACYAKA------KLNVEQLLYLDGTISHMYMKGGAIPWQRYPF 244 + + LDG S G + + Sbjct: 257 GATLRDVQDILFEKGCYIAANLDGGSSSTLYLNGKVVNKPADL 299 >UniRef50_UPI000178A82C copper amine oxidase domain protein n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI000178A82C Length = 377 Score = 127 bits (319), Expect = 4e-28, Method: Composition-based stats. Identities = 42/238 (17%), Positives = 76/238 (31%), Gaps = 22/238 (9%) Query: 29 FAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMA 88 A + + + Q TV+ +V++ A +A L I + +A Sbjct: 149 VTSARKTFKVGARSFSAQVVTVSLLHPKVELDVVLAGNKA--GKVEDLRSIAKRSNAVVA 206 Query: 89 MNGGIYDESYA-----PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRL 143 +NG +D + P G + G + + + + G V Sbjct: 207 INGTFFDAYTSGAYKAPYGYLVSKGNIFHKASGDNRTIFTYDSNNLATMMPGLDFKSVY- 265 Query: 144 DAFKTSKEIQFAVQSGPMLMENGVINPRIHPNV-------ASSKIRNGVGINKHGNAVFL 196 + ++ A+Q+GP L+ NG + + R+ +GI K + L Sbjct: 266 ----ETGRMEGALQAGPRLLTNGKVTLDVKKEGFKDPKILTGGGARSALGITKDHKLILL 321 Query: 197 LSQQATNFYDFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFVTMISVERK 253 + A K Q + LDG S +Y G + I V+ K Sbjct: 322 TT-GGATIPQLAEIMKQA-GAYQAMNLDGGASSGLYYNGSYLTTPGRQISNAIVVKYK 377 >UniRef50_D1N9W8 Putative uncharacterized protein n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N9W8_9BACT Length = 275 Score = 127 bits (318), Expect = 5e-28, Method: Composition-based stats. Identities = 32/231 (13%), Positives = 69/231 (29%), Gaps = 28/231 (12%) Query: 25 LLPLFA-VAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQG 83 L P D + + + T +++ + +G + ++ + Sbjct: 40 LAPGVRLTTFDRVRVFGKPQFISVLRADLSTPGLRLGLAECDGGNY----ETVSHFGRRL 95 Query: 84 QVQMAMNGGIYDESYAPLG--LYIENGQQKV-ALNLASGEGNFFIRPGGVFYVAGDKVGI 140 A+N G + P+G +G+ L F + G + G Sbjct: 96 DALAAVNAGFFAMKGNPMGVRYLKIDGKVLNADLGGDPERAYFVLDQTGRPAIVG----- 150 Query: 141 VRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ 200 A + + AV +L+++G + P + + R G++ + + ++ Sbjct: 151 ---PADFAPERCRSAVYGNRLLLKDGKVPP--LGDDKARHPRTAAGLSGN-TLLLVVIDG 204 Query: 201 A------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKG--GAIPWQRYP 243 F + A K L + LDG S G + Sbjct: 205 RARESAGVTFAELATLLKD-LGCTDAVNLDGGGSSTMWTRHHGVVNHPSDN 254 >UniRef50_B2V2N5 Putative uncharacterized protein n=8 Tax=Clostridium RepID=B2V2N5_CLOBA Length = 348 Score = 125 bits (313), Expect = 2e-27, Method: Composition-based stats. Identities = 34/228 (14%), Positives = 68/228 (29%), Gaps = 31/228 (13%) Query: 37 ALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGG--- 92 + + NP +V M + G L +++ + A+NGG Sbjct: 118 DIHTDRYDGYMLEIENPHKVKVAMT------KYLGKLGQKTSEMAEEHNAIAAINGGSFV 171 Query: 93 --------IYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIV-RL 143 P G I +G+ + G+ N + + ++ + Sbjct: 172 DKSSDGITYAGTGGQPGGFVISSGKVVYPI----GKCNEHSVENVIAFTKKGQLIVGNHT 227 Query: 144 DAFKTSKEIQFAVQSG-PMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA- 201 A ++Q A+ P ++ NG+ + + R VG + G +FL Sbjct: 228 LAELKKLDVQEAMCFREPNVIINGIRQHKKEDYIDGINPRTAVGQKEDGTVLFLALDGRK 287 Query: 202 -----TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPF 244 Y+ +++ LDG S G + + Sbjct: 288 LSKPGATIYEVQEIMRSR-GAINAGMLDGGYSTTMYYKGDVINSPNAW 334 >UniRef50_C8WTH1 N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase-like protein exopolysaccharide biosynthesis protein n=4 Tax=Alicyclobacillus acidocaldarius RepID=C8WTH1_ALIAD Length = 352 Score = 125 bits (313), Expect = 2e-27, Method: Composition-based stats. Identities = 40/242 (16%), Positives = 72/242 (29%), Gaps = 38/242 (15%) Query: 36 CALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIY 94 L +PT V +P+ RV + + + +N G + Sbjct: 121 ITLHEPTFNAFILLVKDPKRIRVV------ATKYLHVRGETVMQMVQDSGAIAGINAGGF 174 Query: 95 ------DESYAPLGLYIENGQQKVALN-LASGEGNFFIRPGGVFYVAGDKVGIVRLDAFK 147 P G+ I +G+ + + G Sbjct: 175 VDTNWQGTGAYPQGITITDGKLVSMTGSPSQPQPVIAFTKEGQMIAG------TYSLNQL 228 Query: 148 TSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT----- 202 S ++ V GP+L+ENG P + + R +G K G + L++ Sbjct: 229 RSLDVWQCVGFGPVLVENGK--PTVSAENYAVNPRTAIGQTKDGTVILLVTDGRYATGPN 286 Query: 203 ----NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQR------YPFVTMISVER 252 +F D A + + + LDG S ++ G + + T I V Sbjct: 287 DVGASFADVARIML-QFHADIAANLDGGSSATFVYKGRMWNRPVDILGARAVATSIVVMP 345 Query: 253 KG 254 +G Sbjct: 346 EG 347 >UniRef50_UPI0001694670 hypothetical protein Plarl_22443 n=1 Tax=Paenibacillus larvae subsp. larvae BRL-230010 RepID=UPI0001694670 Length = 363 Score = 124 bits (310), Expect = 3e-27, Method: Composition-based stats. Identities = 28/212 (13%), Positives = 59/212 (27%), Gaps = 15/212 (7%) Query: 39 SDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESY 98 D + Y +P++ RV + +K GE ++ + ++ + Sbjct: 117 GDYWVGKMMYVFDPRSIRVVVPGKKGEGERITSMVERTGAVAGVNGGGF-IDPDGLGNGF 175 Query: 99 APLGLYIENGQQKVALNLAS-GEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQ 157 AP+G + G+ + G + + ++ AV Sbjct: 176 APIGAILSGGKVLYNDQKEDIPQHIVGFTDKGTLVIGK------YSIDQLRAMKVSEAVS 229 Query: 158 SGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT------NFYDFACYA 211 P ++ NG R +G G +F++ + Sbjct: 230 FYPRVIANGKPLITKGDGGWGRAPRTALGQRADGTVIFVVIDGRQAHSVGATLREVQDLL 289 Query: 212 KAKLNVEQLLYLDGTISHMYMKGGAIPWQRYP 243 + +LDG S +K + Q Sbjct: 290 LEQ-GCINAGFLDGGASSEMVKDRKLLTQPSS 320 >UniRef50_O31980 SPBc2 prophage-derived uncharacterized protein yomE n=2 Tax=root RepID=YOME_BACSU Length = 644 Score = 124 bits (310), Expect = 4e-27, Method: Composition-based stats. Identities = 29/248 (11%), Positives = 64/248 (25%), Gaps = 33/248 (13%) Query: 29 FAVAADDCALSDPTLTVQAYTVNPQT-------------ERVKMYWQKANGEAWGTLHAL 75 F V + + + V P+T + + T Sbjct: 59 FTVTSSFKQDATLGIEYYVTKVTPKTTEAKKSMVQKTFAYDFEKSIDPTSSYFGTTNRET 118 Query: 76 LADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAG 135 + + + + +A+N + + +GL I++G + A G + Sbjct: 119 VLSMAKRKRSVVAINASGWRSNGEVMGLQIKDGVLYKDYDAAGYTGA-----EACVFFDD 173 Query: 136 DKVGIVRLDAFKTS----KEIQFAVQSGPMLMENGVI---NPRIHPNVASSKIRNGVGIN 188 + + K + + G L+++ ++ R +G Sbjct: 174 GTMKVYGNREVDADILISKGARNSFAFGIWLVKDSKPRTAQMTTWADLNVKHPRQAIGQR 233 Query: 189 KHGNAVFLLSQQA------TNFYDFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQR 241 G V + YD ++ LDG S ++G I Sbjct: 234 SDGTLVIITVDGRSLRSSGITAYDMPSLFLSE-GCINAFLLDGGGSSQTAVEGKYINNIS 292 Query: 242 YPFVTMIS 249 + Sbjct: 293 DGIERAVV 300 >UniRef50_C6PYU6 Putative uncharacterized protein n=1 Tax=Clostridium carboxidivorans P7 RepID=C6PYU6_9CLOT Length = 369 Score = 124 bits (310), Expect = 4e-27, Method: Composition-based stats. Identities = 32/223 (14%), Positives = 57/223 (25%), Gaps = 24/223 (10%) Query: 37 ALSDPTLTVQAYTV-NPQTERVKMYWQ-KANGEAWGTLHALLADINSQGQVQMA---MNG 91 + D V NP ++ + K G+ + + NG Sbjct: 146 EIDDTKFHACILEVKNPTRMKIGYTNKLKEVGQKTSEIAEENGAAAAINGGGFTDKSSNG 205 Query: 92 GIYDESYA-PLGLYIENGQQKVALNLASGEGNF-FIRPGGVFYVAGDKVGIVRLDAFKTS 149 ++ + A P G+ I NG+ + + N G V V + Sbjct: 206 KLWTGTGAYPQGIVISNGKVVYSDVKNNEAVNVTAFTKDGKLIVGDHTVSEL------LR 259 Query: 150 KEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA------TN 203 + A+ L+ NG + R +G G + L+ + Sbjct: 260 DNVTEAISFRNSLIINGKPVALAEEGL---NPRTAIGQKADGTIIMLVIDGRKGLKAGAS 316 Query: 204 FYDFACYAKAKLNVEQLLYLDGTISHMYM-KGGAIPWQRYPFV 245 + + LDG S G I Sbjct: 317 LKEVQNILLQR-GALNASSLDGGSSSTMYFNGEVINDPCDWNG 358 >UniRef50_A7V127 Putative uncharacterized protein n=1 Tax=Bacteroides uniformis ATCC 8492 RepID=A7V127_BACUN Length = 277 Score = 124 bits (310), Expect = 4e-27, Method: Composition-based stats. Identities = 38/235 (16%), Positives = 76/235 (32%), Gaps = 28/235 (11%) Query: 33 ADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGG 92 A +L V + ++P+ R + E + A+NG Sbjct: 48 AVFSSLYGVPQEVSIFEISPKRYRFDVLVHNPKEET--------SIAARHAGAVAAINGS 99 Query: 93 IYDES-YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRL---DAFKT 148 +D + ++G + G G + ++ ++ + Sbjct: 100 YFDMKAGNSVCYLRKDGVVIDTTST----GVLATVSNGAVLIKKGRLELIPWSKQEEKAC 155 Query: 149 SKEIQFAVQSGPMLMENGVINPRIHPN---VASSKIRNGVGINKHGNAVFLLSQQAT--- 202 + + + SGP+++++G + N V + R+ V + + G + ++ Sbjct: 156 TLKKGTVLASGPLMLKDGQVCDLSGTNRNFVDTKHPRSAVALTREGKILLIVVDGRRKGK 215 Query: 203 ----NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 N + A L E L LDG S G A+P + S ERK Sbjct: 216 AEGINIPELAH-MIRILGGEDALNLDGGGSSTLWSG-ALPDKGIANTPSGSAERK 268 >UniRef50_C6J074 Copper amine oxidase domain-containing protein n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J074_9BACL Length = 406 Score = 123 bits (309), Expect = 5e-27, Method: Composition-based stats. Identities = 39/235 (16%), Positives = 79/235 (33%), Gaps = 22/235 (9%) Query: 32 AADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNG 91 A + + + Q T++ +V++ A L+ + + + +A+NG Sbjct: 181 ARKTFKVGGRSFSTQMVTISLMDPKVRLKVALAGDAV--GKVEELSSLAKRHKAVVAING 238 Query: 92 GIYDESY-----APLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAF 146 ++ AP G + G+ K+ + + + GD DAF Sbjct: 239 TFFNAYTDNAYKAPYGYIVSGGELKMKASGDKRTIFTYDSNLLARLIPGDDF----NDAF 294 Query: 147 KTSKEIQFAVQSGPMLMENGVINPRIHPNV-------ASSKIRNGVGINKHGNAVFLLSQ 199 ++ A+Q+GP L+ NG + + R+ +G+ + + L + Sbjct: 295 NAGT-MEGALQAGPRLVVNGKVAVDVKAEGFKDPKILTGGGARSALGLTRDHKLILLTT- 352 Query: 200 QATNFYDFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFVTMISVERK 253 A K Q + LDG S +Y G + + V + Sbjct: 353 GGATIPQLAEIMKQA-GAYQAMNLDGGASSGLYYNGKYLTQPGRKISNALIVTYQ 406 >UniRef50_C1CWE2 Putative LysM lysin domain protein, n=1 Tax=Deinococcus deserti VCD115 RepID=C1CWE2_DEIDV Length = 442 Score = 123 bits (309), Expect = 5e-27, Method: Composition-based stats. Identities = 43/250 (17%), Positives = 87/250 (34%), Gaps = 12/250 (4%) Query: 10 GMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAW 69 GM L +R+ + + A L + VQ V+ + V + + Sbjct: 189 GMKILVAQRVPVPIPPRA-TGKAVTFKQLRPLNIPVQLVRVDLRHRDVLVAPVLPHAGLV 247 Query: 70 GTLHALLADINSQGQVQMAMNGGIYDE-SYAPLGLYIENGQQKVALNLASGEGNFFIRPG 128 L A + + + Q +NG + +YAP G + G+ I P Sbjct: 248 FGLGARVGQLAQRSGAQALINGSYFHPRTYAPAGDIVMQGRML---TWGRIPMALAITPD 304 Query: 129 GVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRI-----HPNVASSKIRN 183 + ++R T + ++ + +GP ++ G ++ P + R+ Sbjct: 305 NRATIRATTTPLLRRPLDTTWRGMETVIATGPRIVTGGAVHTNYNQVFRDPALFGRAARS 364 Query: 184 GVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRY 242 VG++ + + V + ++ + +L V++ L LDG S + G A+ Sbjct: 365 AVGLSSNRDLVMVSTRVRLTTTEMGKVM-TRLGVKEALLLDGGSSAGLAWNGRAVLDSMR 423 Query: 243 PFVTMISVER 252 I V Sbjct: 424 KVSYGIGVFT 433 >UniRef50_UPI0000E45D54 PREDICTED: similar to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase precursor n=2 Tax=Strongylocentrotus purpuratus RepID=UPI0000E45D54 Length = 447 Score = 122 bits (307), Expect = 8e-27, Method: Composition-based stats. Identities = 37/226 (16%), Positives = 61/226 (26%), Gaps = 27/226 (11%) Query: 39 SDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE-S 97 S VN V + +G A + Q +A+N G ++ S Sbjct: 100 SGERAPGHIVRVNSPARTVSVLEPFDSGGCTNHHRATVDSTAKQDNCLVAVNAGFFNPRS 159 Query: 98 YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQ 157 A G + NG+ N + IR G + + +T V Sbjct: 160 GACYGNVVSNGRLVQ-TNGGLQNAHLGIRADGTLVFG----YLSEENVLQTENPFIQLVG 214 Query: 158 SGPMLMENGVINPRIHPN---------------VASSKIRNGVGINKHGNAVFLLSQQ-- 200 L+ +G I R VG ++ G V + Sbjct: 215 GVGWLLRDGEIYVEESKKAECGDTEEASSVDLFFNMLSARVAVGSDEKGRLVIAVIDGQT 274 Query: 201 ---ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYP 243 + FA + + V + DG S ++ G I Sbjct: 275 LKRGLSLLSFAKWLLSH-GVTNAINFDGGGSATFVVNGTIVNHPSD 319 >UniRef50_D0TN59 Predicted protein n=3 Tax=Bacteroides RepID=D0TN59_9BACE Length = 315 Score = 122 bits (307), Expect = 9e-27, Method: Composition-based stats. Identities = 28/233 (12%), Positives = 64/233 (27%), Gaps = 21/233 (9%) Query: 23 LTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLA----- 77 + L + + + ++ V + Sbjct: 61 VVALGVTETDVHFQKADSRSTHIFIIDIDLNEPGVSLEVGMPYDADVRNNFQRQTLTEMA 120 Query: 78 --DINSQGQVQMAMNGGIYDESYAPL-GLYIENGQQKVALNLASGEGNFFIRPGGVFYVA 134 +V +N +D S + G NG + + Sbjct: 121 DYADRPWHRVAAMINADFWDVSTMDIRGPIHRNGVILKNSFIFKE-TLPQQALSFIALTK 179 Query: 135 GDKVGIVRLDAFK-TSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNA 193 +K+ I ++ ++ SG +++ +G I+ +P + R +G + G+ Sbjct: 180 DNKMVIADSVEYRGMQYNLKEVTGSGVIVLRDGEISGATYPGID---PRTCLGYSDDGHV 236 Query: 194 VFLLSQQAT-------NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPW 239 F+++ + + KA L + LDG S + I Sbjct: 237 YFMVADGRVEFYSYGLTYPEMGSIMKA-LGCSWAVNLDGGGSTQMLIRHPIAD 288 >UniRef50_D2NR45 Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase n=3 Tax=Micrococcineae RepID=D2NR45_9MICC Length = 356 Score = 122 bits (307), Expect = 9e-27, Method: Composition-based stats. Identities = 33/213 (15%), Positives = 66/213 (30%), Gaps = 27/213 (12%) Query: 46 QAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYI 105 + + + AN + + +++ S+ A+NG Y + G+ I Sbjct: 134 FVADIKLDNATL-LRSAFANNKFGQNIIDTPSNMASEHNGIWAINGDYY--GFRTTGIVI 190 Query: 106 ENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMEN 165 NG G G + + S+ + + GP L+++ Sbjct: 191 RNGVVYRDSGAREG---LAFYRDGSVKLYDETA---TNAQTLVSEGVWNTLSFGPALVKD 244 Query: 166 GVINPRIHP----------NVASSKIRNGVGINKHGNAVFLLSQQATN-------FYDFA 208 I I ++ ++ R GVG+ + VF++ + +FA Sbjct: 245 SAIVDGIDSVEVDTNFGNHSIQGNQPRTGVGVLGTNHLVFIVVDGRSTNYSRGVTMPEFA 304 Query: 209 CYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQR 241 K L LDG S + + + Sbjct: 305 QMFKD-LGCVSAYNLDGGGSSAMVFNNKLVNRP 336 >UniRef50_UPI0001BC3362 hypothetical protein BcroD2_01243 n=1 Tax=Butyrivibrio crossotus DSM 2876 RepID=UPI0001BC3362 Length = 356 Score = 122 bits (305), Expect = 1e-26, Method: Composition-based stats. Identities = 40/229 (17%), Positives = 64/229 (27%), Gaps = 27/229 (11%) Query: 35 DCALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGI 93 +S T V +P + +G+ G + I + +A+N G Sbjct: 135 FYEVSGSTFAGTMVVVTDPSRV-----FVGTSGDYKGEAGINVPAICDKYGATLAINAGG 189 Query: 94 YDE------SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFK 147 +++ PLG+ + GQ K N+ S VF + Sbjct: 190 FEDIGGVGNGGTPLGIVMSEGQLKYG-NVNSSYDLIGFDNNNVFVIG------QMTGQQA 242 Query: 148 TSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATN---- 203 + I+ AV GP L+ NG R +G G + L+ Sbjct: 243 IDRGIRDAVSFGPFLILNGTPLEVSGMGG-GLNPRTAIGQRADGAVLLLIIDGRQTHSLG 301 Query: 204 --FYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISV 250 D LDG S + G I + V Sbjct: 302 ASMNDLINVMLD-FGAVNAANLDGGGSTVLYYDGEIKNKISSIYGARGV 349 >UniRef50_C5S6T1 Putative uncharacterized protein n=1 Tax=Allochromatium vinosum DSM 180 RepID=C5S6T1_CHRVI Length = 272 Score = 121 bits (303), Expect = 3e-26, Method: Composition-based stats. Identities = 45/243 (18%), Positives = 80/243 (32%), Gaps = 24/243 (9%) Query: 22 ALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINS 81 P+ + + T+ + + R+ + + + + Sbjct: 43 PALEAPISHSERTLESSTGRTVRAHLALFDSRRYRLAVLDLGPD---LASASDWPEHTRA 99 Query: 82 QGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIV 141 G + A+NGG + PLGL I G++ GV Y + + Sbjct: 100 AG-LLAAVNGGFFHADGQPLGLVIAGGERLNRFETVK-------LLSGVLYGDARGIHLE 151 Query: 142 RLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA 201 R F++S I VQSGP L+E G + + S R + + + V ++ Sbjct: 152 RRARFQSSPGIDALVQSGPYLVEQGRAVRGLSTHDVSR--RTFIATDWRRHWVLGATRDG 209 Query: 202 TNFYDFACYA-----KAKLNVEQLLYLDGTISH--MYMKGGA----IPWQRYPFVTMISV 250 + A A VE+ L LDG S ++ G R P ++ V Sbjct: 210 LTLAELAEALATPGALAPWPVERALNLDGGTSTGFLFDPGAGQEPIHLRARRPVRNLVGV 269 Query: 251 ERK 253 + Sbjct: 270 RAR 272 >UniRef50_B3CE38 Putative uncharacterized protein n=3 Tax=Bacteroides RepID=B3CE38_9BACE Length = 285 Score = 121 bits (303), Expect = 3e-26, Method: Composition-based stats. Identities = 31/222 (13%), Positives = 63/222 (28%), Gaps = 27/222 (12%) Query: 32 AADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNG 91 A+ +L V + P+ R + E ++ + +A+NG Sbjct: 49 EAEFVSLYGVPQHVTILEIKPERHRFDILIHSPKEET--------SNAARRSGAVVAING 100 Query: 92 GIYD-ESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSK 150 ++ + + ++G G G + K+ I+ Sbjct: 101 SYFNIKQGTSICYLRKDGVVVDTTAT----GVLSTVSNGAVKIDKGKLDIIAWKKQDEKT 156 Query: 151 ---EIQFAVQSGPMLMENGVINPRIH---PNVASSKIRNGVGINKHGNAVFLLSQQAT-- 202 + + SGP+++ +G V + R+ V + K G + Sbjct: 157 CEQKEGSILVSGPLMLLDGKTCDLSACNRSFVQTKHPRSAVALMKDGTVFLIAVAGRFEG 216 Query: 203 -----NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPW 239 N + + L + L LDG S A Sbjct: 217 KAEGINIPELTHLLRV-LGARKALNLDGGGSTTLWSASAPDN 257 >UniRef50_A9B1E5 Putative uncharacterized protein n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9B1E5_HERA2 Length = 272 Score = 119 bits (299), Expect = 7e-26, Method: Composition-based stats. Identities = 38/235 (16%), Positives = 84/235 (35%), Gaps = 20/235 (8%) Query: 22 ALTLLPLFAVAADDCALSDPTL---TVQAYTVNPQTERVKMYWQKANGEAWGTLHALLAD 78 T+ + + VQ V+P R+++ + A+ +++ Sbjct: 46 PTTIDNQWQTLEPGLEFREIGYDITNVQILRVDPAYFRLRVGYDVASPG-------RVSE 98 Query: 79 INSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKV 138 + + +NGG +D L I +G + G + G + Sbjct: 99 WAAALKPVAVINGGYFDAQGRATALTIFDG-VINGTSYDGFGGMLAVDS-----ADGWSL 152 Query: 139 GIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLS 198 +R + +++ + A+QS PML+ +G + + + R+ V I++ G + ++ Sbjct: 153 RSLREQPYDSTEVLNQALQSAPMLVVHGAAIEQPNDDGD-RARRSVVAIDQTGRLLLMVC 211 Query: 199 QQA-TNFYDFACYA-KAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFVTMISV 250 D + + K L ++ L LDG S + + + V + V Sbjct: 212 SWPSFTLTDLSQWLVKQDLAIDAALNLDGGSSTGLVVASENRSFNLDSLVRVPQV 266 >UniRef50_A7LRK2 Putative uncharacterized protein n=1 Tax=Bacteroides ovatus ATCC 8483 RepID=A7LRK2_BACOV Length = 315 Score = 119 bits (298), Expect = 1e-25, Method: Composition-based stats. Identities = 28/229 (12%), Positives = 71/229 (31%), Gaps = 28/229 (12%) Query: 40 DPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALL-------ADINSQGQVQMAMNGG 92 + + T++ + + A + V + +NG Sbjct: 73 GESQHIFVATIDLNELTFTPATKDDKNVPATGPESSAPLPIHAFAAEANGKTVWLGVNGD 132 Query: 93 IYDES-YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKE 151 Y ++ +GL+ ++G + + + G YV +A Sbjct: 133 YYADNPRRVMGLFYKDGVCINSQYFEGHDEVLYQLKNGETYVG------QADEALAHEAN 186 Query: 152 IQFAVQSGPMLMENGVINPRIHP--NVASSKIRNGVGINKHGNAVFL-LSQQAT------ 202 + A+ +L+++GV+ ++ ++ R VG+++ +++ + Sbjct: 187 LLHALGGYGLLVKDGVVQNFYEEMGDLQNTHPRTSVGLSQDRKTMYVFVVDGRRKDSFFA 246 Query: 203 ---NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMI 248 A KA + + LDG S + + P ++ Sbjct: 247 LGLTLPHLATMMKA-VGCYNAINLDGGGSTTLIIRK-VNDGGKPTFPIL 293 >UniRef50_C0ZFU4 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0ZFU4_BREBN Length = 359 Score = 119 bits (297), Expect = 1e-25, Method: Composition-based stats. Identities = 28/221 (12%), Positives = 67/221 (30%), Gaps = 26/221 (11%) Query: 37 ALSDPTLTVQAYTV---NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGI 93 +L + V +P ++ + + + + + +A+N G Sbjct: 137 SLQEGGYRGYMAKVRLNDPNALKMVL-----ANNSVKSKGETTSQAGKRTGSILAINAGG 191 Query: 94 Y----DESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTS 149 + + PLG+ + +G+ + + G ++ A T Sbjct: 192 FMSDKQGNLTPLGITVVDGK-IRTFSNNAKLSFVGFNNKGHLVGTS-----IKTQAQITQ 245 Query: 150 KEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT------- 202 + I P L++ G P + + R +G +G+ + ++ Sbjct: 246 QGILQGASFLPRLLQGGKRLPIPREWANARQPRTLIGHFDNGDLLLIVIDGRRDGWSNGV 305 Query: 203 NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYP 243 + A + +V LDG S + G + + Sbjct: 306 TLEE-AQRKLQEWHVVDAYNLDGGGSSAFYYNGKLLNKPSG 345 >UniRef50_A0YND3 Putative uncharacterized protein n=1 Tax=Lyngbya sp. PCC 8106 RepID=A0YND3_9CYAN Length = 304 Score = 119 bits (297), Expect = 1e-25, Method: Composition-based stats. Identities = 33/228 (14%), Positives = 70/228 (30%), Gaps = 19/228 (8%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMY--WQKANGEAWGTLHALLADINSQ 82 L + L + ++ T +++ + + ++ + Sbjct: 45 LFQGITYQRIRRS-KPDPLMIHIVKIDLTTPGIELLVTPGEQGEDDQDISAQTTSEFLQK 103 Query: 83 GQVQMAMNGGIYDESY--APLGLYIENGQQKVALNLASGEGNFFIRPGG---VFYVAGDK 137 +Q+A+NG + Y P+ Y +G++ A +G + V ++ K Sbjct: 104 HYLQLAINGSFFHPFYVHNPIDYYPNSGERVNIFGQAISQGKIYSIVNKGWSVLCISPKK 163 Query: 138 VGIVRLDAFKTSKEIQFAVQSGPMLMENGV-INPRIHPNVASSKIRNGVGINKHG-NAVF 195 + D K + +L++ G I + + R V I+K G Sbjct: 164 KAEIYFD--TCPKNTLQGIAGNLILIDQGQPIKVKKFSDANQKFPRTAVAIDKTGETLWL 221 Query: 196 LLSQQATNFYD-------FACYAKAKLNVEQLLYLDGTISHMYMKGGA 236 +L ++Y + VE L DG S + Sbjct: 222 ILIDGRQSWYSKGVTLATLTNIIQELDGVETALNFDGGGSTTLVISEG 269 >UniRef50_A7LRK4 Putative uncharacterized protein n=1 Tax=Bacteroides ovatus ATCC 8483 RepID=A7LRK4_BACOV Length = 326 Score = 118 bits (296), Expect = 2e-25, Method: Composition-based stats. Identities = 29/258 (11%), Positives = 65/258 (25%), Gaps = 40/258 (15%) Query: 28 LFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEA------WGTLHALLADINS 81 + + V+ V + + + + Sbjct: 75 GVRITDVIFTYCAKPTRMIIAEVDLNK-NVTIVTSTPDNKPEVGKILQQVTVQAEKAEAA 133 Query: 82 QGQVQMAMNGGIYDESY---APLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKV 138 +V + NG Y + P GL+ ++G + F++ G Sbjct: 134 GRKVILGTNGDFYSKKNDLWIPGGLFYKDGVAIKTEIGWEADHVFYMLKDG-------TA 186 Query: 139 GIVRLDAFK-TSKEIQFAVQSGPMLMENGVINPRI--HPNVASSKIRNGVGINKHGN-AV 194 I + FK +E+ A+ ++++G + + N R VG++ Sbjct: 187 HITSVPEFKLVEREVVHAIGGWQRMVQDGEVVKNFTVNDNAMQFHPRTFVGVSADNRKVY 246 Query: 195 FLLSQQATNFYDFACYAKAKL------NVEQLLYLDGTISHMYMKG---------GAIPW 239 + Y + + Q +DG S ++ + Sbjct: 247 LFVVDGRQPEYSNGMRLEDMMLLCQGAGCYQAFNMDGGGSTTMVRRVEKGSSVSFEVMNQ 306 Query: 240 QRYPFV----TMISVERK 253 + V K Sbjct: 307 PSDNPARSVINGLQVIEK 324 >UniRef50_B9YC35 Putative uncharacterized protein n=2 Tax=Holdemania filiformis DSM 12042 RepID=B9YC35_9FIRM Length = 368 Score = 117 bits (292), Expect = 5e-25, Method: Composition-based stats. Identities = 29/220 (13%), Positives = 61/220 (27%), Gaps = 26/220 (11%) Query: 34 DDCALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGG 92 + L T + V +P V +G ++ + MN G Sbjct: 145 EIIDLKGTTFEGKLMIVHDPSRVFVACNPNMDSGAPGYSVEKYIEL----NDAIAGMNAG 200 Query: 93 IYDESY------APLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAF 146 ++++ G+ I +G+ + + I V Sbjct: 201 GFEDAGGNGNGGTAYGIVIHDGKLISG-SPSEFTPVIGINNANQLVVGDMTA------QQ 253 Query: 147 KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT---- 202 +I+ AV GP+ ++N + R +G G + ++ Sbjct: 254 ALDYDIRDAVTFGPVFIKNWEVVFESGR-HPGLNPRTVIGQRYDGAFLLMVLDGRQPSSF 312 Query: 203 --NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 + D + + + LDG S + + G Sbjct: 313 GSTYQDIIDIMQ-QYDAVNAANLDGGNSTVMVYDGETLNT 351 >UniRef50_C3R3M8 Putative uncharacterized protein n=1 Tax=Bacteroides sp. 2_2_4 RepID=C3R3M8_9BACE Length = 329 Score = 116 bits (291), Expect = 5e-25, Method: Composition-based stats. Identities = 38/288 (13%), Positives = 70/288 (24%), Gaps = 58/288 (20%) Query: 13 TLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTL 72 +K+ TL +V L + G+ + Sbjct: 41 LGWVKQTTEFGTLPEYISVYKSPSELEGMKAIAFIAVADMSKANFATI-----GDQIYSK 95 Query: 73 HALLADINSQGQVQMAMNGGIYDES-YAPLGLYIENGQQK---------VALNLASGEGN 122 Q + + MNGG + + L G+ + G Sbjct: 96 TPNQIWQAEQQKYPIIMNGGYFVMGAGKSVSLLCREGEVLAVNSQEEIRSQKSYYPTRGI 155 Query: 123 FFIRPGGVFYVA---GDKVGIVRLDAFK---------------------TSKEIQFAVQS 158 F + G F G+ + A+ Sbjct: 156 FQLSKNGYFSTDWAYTTTDGVTYTYEQPSPNKSGYEPQPAPSAYFPTRGVKLNAETAIGG 215 Query: 159 GPMLMENGVINPRIHPNV---------ASSKIRNGVGINKHGNAVFLLSQQA-------- 201 GP+L+++G + + S R +G+ + +F + + Sbjct: 216 GPILLKDGSVRNTFIEELFDEESGVAPESYHPRTAIGVTANNKVIFFVCEGRSVTEGVKG 275 Query: 202 TNFYDFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFVTMI 248 N A K+ L + LDG S M + G + I Sbjct: 276 MNMAMMANILKS-LGCVDAMNLDGGGSTCMLVNGQPVIKPSAGAQRAI 322 >UniRef50_C5RID5 Putative uncharacterized protein n=1 Tax=Clostridium cellulovorans 743B RepID=C5RID5_CLOCL Length = 347 Score = 116 bits (291), Expect = 6e-25, Method: Composition-based stats. Identities = 37/230 (16%), Positives = 62/230 (26%), Gaps = 32/230 (13%) Query: 37 ALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYD 95 + + +P + M + G +++ A+NGG + Sbjct: 118 KIEHDRYIAHILEIKDPTKIKAVMT------KYVGKNGQKTSEMALDYDAIAAINGGAFA 171 Query: 96 E-----------SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIV-RL 143 + P G I NG N + V + K+ + Sbjct: 172 DVSASGQKWAGNGAIPGGFVITNGAIVYP----KENVNKYDVQNVVAFTKEGKLVVGDYC 227 Query: 144 DAFKTSKEIQFAVQSGP-MLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT 202 + + A+ P ++ +GV + R +G G V L+ T Sbjct: 228 INDLMAMGVTEAMCFRPPSIIIDGVAQIT-DKLQDGTNPRTAIGQKADGTVVLLVIDGRT 286 Query: 203 ------NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVT 246 YD K LNV LDG S G I + Sbjct: 287 LSMPGATLYDVQQIFKD-LNVVNAGNLDGGYSSTMYFNGEIINSPNAWSG 335 >UniRef50_A9WEC1 Putative uncharacterized protein n=3 Tax=Chloroflexus RepID=A9WEC1_CHLAA Length = 265 Score = 116 bits (291), Expect = 7e-25, Method: Composition-based stats. Identities = 40/220 (18%), Positives = 82/220 (37%), Gaps = 27/220 (12%) Query: 34 DDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGI 93 L P L VQ ++P R + + + + ++ A+NGG Sbjct: 55 AFRQLEAPGLPVQVVRIDPAHVRFVVGYDPTSPLTL-------SAWVARYGAVAAINGGF 107 Query: 94 YDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRL--DAFKTSKE 151 +D+ P+ L I N Q G ++ GG+F + + D Sbjct: 108 FDQQGEPVALLISNQQV---------FGYSYVDQGGMFAIDEQGKPHLWSLADQPYDGTP 158 Query: 152 IQFAVQSGPMLME-NGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-TNFYDFAC 209 A+Q P+L+ NG + R+ + ++++G + +++ A + +++ Sbjct: 159 FVQAIQGWPLLVRTNGEA--AYTDDDGQRARRSAIALDRNGYVLLIVAPGATFSLAEWSQ 216 Query: 210 YAKA-KLNVEQLLYLDGTISHMYM----KGGAIPWQRYPF 244 + + L++E + LDG S + +GG P Sbjct: 217 FLASADLDIEIAVNLDGGSSSGLIAQSDQGGVRVDSFTPL 256 >UniRef50_Q1IXP5 Peptidoglycan-binding LysM domain-containing protein n=2 Tax=Deinococcus RepID=Q1IXP5_DEIGD Length = 444 Score = 116 bits (291), Expect = 7e-25, Method: Composition-based stats. Identities = 38/226 (16%), Positives = 72/226 (31%), Gaps = 11/226 (4%) Query: 34 DDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGI 93 L + VQ V+ + V + A ++ + Q +NG Sbjct: 214 SFKQLKALNIPVQVLRVDLRHRNVLVAPVLPRTGLGTAGGARVSTLARTSGAQAVVNGSY 273 Query: 94 YDE-SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEI 152 + SYAP G + G+ I P + ++ + + Sbjct: 274 FHPRSYAPAGDLVVQGRLLA---WGRIPVALAITPDNRAAIMTSTTPLLGRPLEVSWHGM 330 Query: 153 QFAVQSGPMLMENGVINPRI-----HPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDF 207 + + +GP ++ G + + P + R+ VG+ + + VF+ + + Sbjct: 331 ETVIATGPRILNGGTVVRQYASAFRDPALFGRAARSAVGLKSNRDLVFVTTHAKLTTTEM 390 Query: 208 ACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFVTMISVER 252 A+L V L LDG S + G A+ I V Sbjct: 391 GKVM-ARLGVRDALLLDGGSSAGLAWNGQAVLDSVRKVAYGIGVFT 435 >UniRef50_B7H7U4 Putative uncharacterized protein n=27 Tax=Bacillus cereus group RepID=B7H7U4_BACC4 Length = 365 Score = 116 bits (290), Expect = 8e-25, Method: Composition-based stats. Identities = 27/216 (12%), Positives = 55/216 (25%), Gaps = 23/216 (10%) Query: 43 LTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE------ 96 + T+ + + G ++ + + +A+N + + Sbjct: 115 FEGKLVTI---SNPFNVKLVSHQGTQGANRGEKISVMAKRNHALVAVNASGFADETGRGG 171 Query: 97 SYAPLGLYIENGQQKVALNLASGEGNF-FIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFA 155 G+ IENG+ + + G K++ A Sbjct: 172 GNVATGIVIENGKAIDTNMDRNAPTIITGLTKFGQMITGN------YSTQQLLDKQVVSA 225 Query: 156 VQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT------NFYDFAC 209 P L+ NG S R+ + + G +FL+ + Sbjct: 226 AGFMPQLIVNGEKMITEGDGGWGSAPRSIMAQKEDGTIMFLVIDGRQTHSIGATLKECQD 285 Query: 210 YAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFV 245 K + +DG S GG + Sbjct: 286 ILYEK-GAINAMAMDGGSSATLYLGGKVINSPSTLS 320 >UniRef50_Q9UK23 N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase n=29 Tax=Chordata RepID=NAGPA_HUMAN Length = 515 Score = 115 bits (289), Expect = 1e-24, Method: Composition-based stats. Identities = 33/222 (14%), Positives = 63/222 (28%), Gaps = 29/222 (13%) Query: 38 LSDPTLTVQAYT-VNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE 96 D + V P + G A + + ++A NGG + Sbjct: 85 FRDRAVAGHLTRAVEPLR-TFSVLEPGGPGGCAARRRATVEETARAADCRVAQNGGFFRM 143 Query: 97 S-YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFA 155 + LG + + ++ + F IR G + + T Sbjct: 144 NSGECLGNVVSDERRVSSSGGLQ-NAQFGIRRDGTLVTG----YLSEEEVLDTENPFVQL 198 Query: 156 VQSGPMLMENGVINPR---------------IHPNVASSKIRNGVGINKHGNAVFLLSQQ 200 + L+ NG I V R +G ++ G V + Sbjct: 199 LSGVVWLIRNGSIYINESQATECDETQETGSFSKFVNVISARTAIGHDRKGQLVLFHADG 258 Query: 201 -----ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAI 237 N ++ A + + +V + LDG S ++ G + Sbjct: 259 HTEQRGINLWEMAEFLLKQ-DVVNAINLDGGGSATFVLNGTL 299 >UniRef50_C1XS52 Predicted periplasmic protein (DUF2233) n=1 Tax=Meiothermus silvanus DSM 9946 RepID=C1XS52_9DEIN Length = 294 Score = 115 bits (289), Expect = 1e-24, Method: Composition-based stats. Identities = 39/254 (15%), Positives = 77/254 (30%), Gaps = 27/254 (10%) Query: 6 LIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKAN 65 LI + RI T + V + V A VN V + Sbjct: 48 LISNTIHPGQRLRIRPPATSFSVKLVTRPVL-----KVPVLAVHVNLAHPEVSIRSLLPP 102 Query: 66 GEAWGTLHALLADINSQGQVQMAMNGGIYDE-SYAPLGLYIENGQQKVALNLASGEGNFF 124 +L + + ++ A+NGG + ++ P G + G Q V Sbjct: 103 PGVGRG-GEVLQRLAWRTRLVAAINGGYFHPRTFWPAGDLVVGGHQLVK----------G 151 Query: 125 IRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRI------HPNVAS 178 + + ++ + + +GP ++ G + P + Sbjct: 152 SIQTALAITPDKRARVMVGPQTWR--GYETVIANGPYILRRGRLVVTPRAEGYNDPAIWG 209 Query: 179 SKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISH-MYMKGGAI 237 R+ VG+ +F+ ++ + AKL ++ + LDG S + KG + Sbjct: 210 RARRSAVGVVNERYLIFVSTKMELTLSELGKVM-AKLGAKEAIVLDGGSSTGLVWKGETL 268 Query: 238 PWQRYPFVTMISVE 251 I + Sbjct: 269 IRPGRALSYGIGIF 282 >UniRef50_A6L611 Putative uncharacterized protein n=1 Tax=Bacteroides vulgatus ATCC 8482 RepID=A6L611_BACV8 Length = 308 Score = 115 bits (288), Expect = 1e-24, Method: Composition-based stats. Identities = 32/255 (12%), Positives = 62/255 (24%), Gaps = 44/255 (17%) Query: 23 LTLLPLFAVA-ADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINS 81 L A ++S V V+ + + + L+ + Sbjct: 40 TIAPALIHYRFAGYDSISQAHQNVDVLEVDLTSPSYDIQLV------YEEHGDSLSSVAE 93 Query: 82 QGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIV 141 + A+NG E G+ L S ++ G + +K + Sbjct: 94 RNNAAAAINGTYEAE----ASFIKIGGRLLAQNRLDSTHIRYWKHEGAFLFDDDNKNIDI 149 Query: 142 R--LDAFKTSKEIQFAVQSGPMLMENGVIN------------------PRIHPNVASSKI 181 R D+ S + PML++N + Sbjct: 150 RFASDSTFLSHPAANILSGAPMLIDNNDPVGLNFTGNVEGMDLNKLDYEDFRRHQGVRHP 209 Query: 182 RNGVGINKHGNAVFLLSQQATN------FYDFACYAKAKLNVEQLLYLDGTISHMYMK-- 233 R V + +H + + + + + + L LDG S Sbjct: 210 RTAVALTEHKKLLLITVDGRSTQAAGMSANELTRFLLTYFCPQSALNLDGGGSTTMWIAS 269 Query: 234 -----GGAIPWQRYP 243 GG + Sbjct: 270 SEQRVGGVVNHPTDN 284 >UniRef50_B1I1S0 N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase-like exopolysaccharide biosynthesis protein n=1 Tax=Candidatus Desulforudis audaxviator MP104C RepID=B1I1S0_DESAP Length = 345 Score = 115 bits (288), Expect = 1e-24, Method: Composition-based stats. Identities = 31/231 (13%), Positives = 59/231 (25%), Gaps = 13/231 (5%) Query: 30 AVAADDCALSDPTLTVQAYTVNPQTERV-KMYWQKANGEAWGTLHALLADINSQGQVQMA 88 V L V P V ++ +++ GE + Sbjct: 115 RVEVKIFELKGIGYRGYIAKVKPFDPGVLRVTYREGPGETTSEAVRRTGAVLGVNGGGFY 174 Query: 89 MNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKT 148 P+G + +G+ + F G V G I Sbjct: 175 RAPVDGLMHTLPIGNTMVDGKLVGGFQPPREDLFFAGFDGRGRLVGG----IFNDRTALL 230 Query: 149 SKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT------ 202 + V P+L+++ P + R +G +G+ + ++ Sbjct: 231 GTGARQGVSFVPILIKDRQPVPIPEKWRNQRQPRTILGEYANGDLIMIVVDGRQADWSSG 290 Query: 203 -NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVER 252 D K V LDG S +++ G I + + Sbjct: 291 VTLEDL-QVTLIKFGVIDAYNLDGGGSSVFVFGNQILNRPSDGRERVVATN 340 >UniRef50_Q8A0T0 Putative uncharacterized protein n=10 Tax=Bacteroides RepID=Q8A0T0_BACTN Length = 308 Score = 115 bits (288), Expect = 1e-24, Method: Composition-based stats. Identities = 40/211 (18%), Positives = 72/211 (34%), Gaps = 26/211 (12%) Query: 40 DPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDES-Y 98 T ++ +NP+T K G A+ ++ I + Q A+NG +D + Sbjct: 80 QGTQSINILEINPKT-------GKKIGIAFTGQLEKISRIARKHQAIGAINGSYFDMTKG 132 Query: 99 APLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAF---KTSKEIQFA 155 + Q +L +R G Y KV ++ D K Sbjct: 133 NSVCFLKVGSQVVDTTSLDE----LKLRVTGAVYEKKGKVKLIPWDRQIEKNYKKNKGSV 188 Query: 156 VQSGPMLMENGVINPRIHPN---VASSKIRNGVGINKHGNAVFLLSQQA-------TNFY 205 + SGP+++++G N + + R+ + + + G +F+ N Sbjct: 189 LASGPLMLKDGEYYDWSQCNANFIETKHPRSAICLTEEGKILFVTVDGRSPENAVGINIP 248 Query: 206 DFACYAKAKLNVEQLLYLDGTISHMYMKGGA 236 + A L + L LDG S GA Sbjct: 249 ELAHLL-HVLGGKDALNLDGGGSTALWLSGA 278 >UniRef50_B7ASL4 Putative uncharacterized protein n=1 Tax=Bacteroides pectinophilus ATCC 43243 RepID=B7ASL4_9BACE Length = 367 Score = 115 bits (288), Expect = 1e-24, Method: Composition-based stats. Identities = 35/226 (15%), Positives = 66/226 (29%), Gaps = 23/226 (10%) Query: 34 DDCALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGG 92 + + T + + +P V + + ++A I ++ +NGG Sbjct: 142 EIVDIKGTTYRGKLMIIKDPSRVFVGTVP-----QFFEGDGKVVAKIAARYNAVGGVNGG 196 Query: 93 IYDES-----YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFK 147 + + P+GL + +G+ + A+ + V + Sbjct: 197 EFVDGELTYTAMPVGLVMTDGRIVNG-DTATRCHVTGFTKDNILVVGNMTGQQALDMGMR 255 Query: 148 TSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT----- 202 I ++ GP L+ NG R VG G + L Sbjct: 256 DCVSISSSI--GPFLIINGEAQDVSGVGG-GLNPRTAVGQRADGAVLLLAIDGRQANSLG 312 Query: 203 -NFYDFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFVT 246 +F D + + +DG S MY +G I P Sbjct: 313 ASFADLLYIMQ-QYGAVNASTMDGGTSTQMYYEGSVINTPYSPTGP 357 >UniRef50_C8WU56 N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase-like protein exopolysaccharide biosynthesis protein n=2 Tax=Alicyclobacillus acidocaldarius RepID=C8WU56_ALIAD Length = 296 Score = 114 bits (286), Expect = 2e-24, Method: Composition-based stats. Identities = 32/223 (14%), Positives = 62/223 (27%), Gaps = 32/223 (14%) Query: 38 LSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE 96 + +P T V +P+ + G + + + +NGG + + Sbjct: 77 IHEPNFTAYVLWVRDPRRVEIV------ETRYAGDVGETVEQFVNDWHAVAGVNGGSFTD 130 Query: 97 ------SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSK 150 G+ I NG+ + F G A + Sbjct: 131 TNWQGTGGLVQGIVISNGRILKRASGPESIVGF--TADGRLISG------TYTLAELQAM 182 Query: 151 EIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT-------- 202 + A+ GP L++ G ++ R +G G + +++ Sbjct: 183 GVTQALMFGPTLVDRG-VDQIQGAGDWGYAPRTAIGQTADGTVILMVTDGRELHGPADIG 241 Query: 203 -NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPF 244 + D A + L LDG S + G + Q Sbjct: 242 ASLGDIARLMIS-LGAVTAANLDGGSSATLVYDGCLINQPTDI 283 >UniRef50_C5PL46 Exopolysaccharide biosynthesis protein n=2 Tax=Sphingobacterium spiritivorum RepID=C5PL46_9SPHI Length = 288 Score = 114 bits (286), Expect = 2e-24, Method: Composition-based stats. Identities = 30/238 (12%), Positives = 64/238 (26%), Gaps = 25/238 (10%) Query: 25 LLPLFAVAADDCA-LSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQG 83 L P L + ++ Q + + T S+ Sbjct: 39 LKPGIIWKQGHFDNLFKAEQEINFIEIDLQKIKQPIRLAG-----LQTGFKNTTTFASEA 93 Query: 84 QVQMAMNGGIYDE-SYAPLGLYIENGQQKVALNLASGEG-NFFIRPGGVFYVAGDKVGIV 141 A+NG ++ + L N Q L G+ R K+ I+ Sbjct: 94 NALAAINGAFFNTKTGGGTTLVRINKQLINETVLKEGKSPKRSFRSNAALAFDTKKIVII 153 Query: 142 ----RLDAFKTSKEIQFAVQSGPMLM-ENGVINPRIHPNVASSKIRNGVGINKHGNAVFL 196 R + ++ + GP+L+ ++ + + R+ + + + + Sbjct: 154 KGDDRDSTWDKKIKMPNVMTCGPLLLHKSHRAYLDSNAFNNNRHPRSAIALTTEHKLILI 213 Query: 197 LSQQA------TNFYDFACYAKAKLNVEQLLYLDGTISHMYM-----KGGAIPWQRYP 243 + + + K + L LDG S K G + + Sbjct: 214 TVDGRNAQAYGMSLIELSNVMKWLKG-KDALNLDGGGSTTLYIKDEGKNGIVNYPTDN 270 >UniRef50_Q7X4R9 XcbC n=1 Tax=Neisseria meningitidis RepID=Q7X4R9_NEIME Length = 256 Score = 114 bits (285), Expect = 3e-24, Method: Composition-based stats. Identities = 29/247 (11%), Positives = 71/247 (28%), Gaps = 31/247 (12%) Query: 15 NLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHA 74 ++ IF+ + A C + +N + + + + + Sbjct: 6 SILSIFILSFFNSEYTYAQSLCIQQSSQNHIHIAKINLNCKGINLIATQEADK-----GM 60 Query: 75 LLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYV- 133 ++ + + +A+NG + Y P GL I + + + Sbjct: 61 TVSQFARKYRTDIAINGSFFRTGYFPFGLAITDHKTWDKTRDVQKRVFLACNRQNRCMIE 120 Query: 134 ---------AGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNG 184 K+ + +F + + P+ G + + + + R Sbjct: 121 DKNMVSKVDDSWKLAVSGWQSFNPATKKFECSDDDPV----GCTHIKFI----TKQPRTM 172 Query: 185 VGINKHGNAV-FLLSQQA------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAI 237 VG+++ N + ++ + A L + + + LDG S +KG Sbjct: 173 VGLDEKRNYLYLVVIDGRLPKFKGATLNELGQ-LAASLKLTKAINLDGGGSSTMVKGYNR 231 Query: 238 PWQRYPF 244 Sbjct: 232 ISTLPAT 238 >UniRef50_B4CZJ8 Putative uncharacterized protein n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4CZJ8_9BACT Length = 251 Score = 114 bits (285), Expect = 3e-24, Method: Composition-based stats. Identities = 44/214 (20%), Positives = 67/214 (31%), Gaps = 22/214 (10%) Query: 27 PLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQ 86 +T+ T NP+ + GT + Sbjct: 34 TELEFTERHVQGDAGDVTLWVVTFNPKACAFAVMDNPTGAFDLGTASE-------KRGAL 86 Query: 87 MAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAF 146 +NGG + PLGL + G + L A GV V + + R AF Sbjct: 87 AGVNGGYFHPDRTPLGLVVRQGVEIHPLERAK-------LLSGVLSVMPTTITLQRTGAF 139 Query: 147 KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYD 206 K S ++ A+Q+GP L+E P + R V N G FL+ + T Sbjct: 140 KGSSAVREALQAGPFLIEKEKPIPGLEA--TKEAARTVVFQNAKGRCGFLICKS-TTLAG 196 Query: 207 FACYAKA-----KLNVEQLLYLDGTISHMYMKGG 235 A + + + + LDG S G Sbjct: 197 MADLLATSSIFPEGKIIRAMNLDGGTSTALWVRG 230 >UniRef50_C4Z6E6 Putative uncharacterized protein n=1 Tax=Eubacterium eligens ATCC 27750 RepID=C4Z6E6_EUBE2 Length = 360 Score = 114 bits (285), Expect = 3e-24, Method: Composition-based stats. Identities = 35/225 (15%), Positives = 70/225 (31%), Gaps = 32/225 (14%) Query: 36 CALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIY 94 +S + + + +P +V + WG L +I S +NGG+Y Sbjct: 118 VEISGRSFFGKMLIIKDPSQVKVGTTY------PWGDYGKELHEIVSGAGAIAGVNGGLY 171 Query: 95 ----DESYAPLGLYIENGQQ-KVALNLASGEGNFFIRPGGVFYVAG-DKVGIVRLDAFKT 148 + +PLG+ +++G+ + + SG + + V D + +++ Sbjct: 172 VSSGNRGGSPLGIVVQDGKITYNSPSALSGLYLIGLNKDNLLVVKDIDGMSAADFESYVN 231 Query: 149 SKEIQFAVQS----------GPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLS 198 I+ AV L+ N + R +G G + L++ Sbjct: 232 EAGIRDAVAFQEESSDSNNHFVPLIINNEARVLKGQGS-GANPRTAIGQRVDGAILLLVT 290 Query: 199 QQA-------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGA 236 D + + LDG S + G Sbjct: 291 DGRGASGHLGATASDLISVMQ-EYGAVNAANLDGGSSSTMVYNGG 334 >UniRef50_C6XWN0 Putative uncharacterized protein n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6XWN0_PEDHD Length = 328 Score = 114 bits (284), Expect = 4e-24, Method: Composition-based stats. Identities = 32/210 (15%), Positives = 63/210 (30%), Gaps = 18/210 (8%) Query: 34 DDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQG----QVQMAM 89 + + V+ +T ++++ + L L + A+ Sbjct: 91 AFLRKDKLPVRIFIMEVDMKTPKLEIQAMAPYNDYINGLQRLSEMCRDNELPGTNIVAAV 150 Query: 90 NGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTS 149 NG + + AP L+ N + A+G F G + G V + Sbjct: 151 NGDTFSTTGAPTSLFYINNRVYYGTV-ATGRTFFAAMKDGTIVIGGKDTKGV--ERPVDK 207 Query: 150 KEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ-------AT 202 +I+ AV L++N + + R +G N + ++ Sbjct: 208 AQIKNAVGGNQWLVDNNIKATLTDATI---SARTAIGYNANKVIYAIVVDGSQATYSNGL 264 Query: 203 NFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 D A L + + LDG S + Sbjct: 265 TLVDLRDIM-AALGTKDAVNLDGASSSTLV 293 >UniRef50_Q73Q09 Putative uncharacterized protein n=1 Tax=Treponema denticola RepID=Q73Q09_TREDE Length = 293 Score = 114 bits (284), Expect = 4e-24, Method: Composition-based stats. Identities = 35/220 (15%), Positives = 73/220 (33%), Gaps = 28/220 (12%) Query: 34 DDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTL--HALLADINSQGQVQMAMNG 91 D L + A ++ ++K+ + + + +A+N Sbjct: 57 SHIKYEDYPLIIHAVKIDLTNPKLKIVVTEPALFNSKGMVKRETTLSFARRHNTVIALNA 116 Query: 92 GIYDE-------SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLD 144 ++ PLG++I+ F + G + ++ + I+ Sbjct: 117 AFFNVISFSFSLRGEPLGIHIDKKINLSKP---------FPKYGALCFLDDNSAFIIESQ 167 Query: 145 A-FKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATN 203 +I++AV ++++NG P I R VG+ G ++L + N Sbjct: 168 NTEDIKADIEYAVSGNRIILKNGK--PIITNISKKENSRTCVGLADGGKTLYLFFAEGEN 225 Query: 204 -------FYDFACYAKAKLNVEQLLYLDGTISHMYMKGGA 236 YD A + KL + ++LDG S + Sbjct: 226 KKKSRGITYDQAHFFMKKLGAQDAIHLDGGGSSSLIIKKE 265 >UniRef50_C6D0A3 Exopolysaccharide biosynthesis protein n=1 Tax=Paenibacillus sp. JDR-2 RepID=C6D0A3_PAESJ Length = 349 Score = 113 bits (283), Expect = 5e-24, Method: Composition-based stats. Identities = 30/207 (14%), Positives = 56/207 (27%), Gaps = 29/207 (14%) Query: 42 TLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIY------ 94 + T+ NP ++ + I + + A+N + Sbjct: 112 NYKGKIITISNPNRVKLV-------SSKLSDHGEQIFVIAKRAKALAAINASGFVDLDGH 164 Query: 95 DESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQF 154 A G+ IE+G + N + E I GV + +Q+ Sbjct: 165 GNGGASTGVVIEDG-VIKSQNKNTKEFVAGITKDGVMITGK------YSANELVNLGVQY 217 Query: 155 AVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATN------FYDFA 208 A P L+ NG R +G G+ +F++ + Sbjct: 218 AAGFKPQLIVNGQKMV-EGDGGWGWGPRTAIGQKADGSIIFVVIDGRQTRSVGASIKEVQ 276 Query: 209 CYAKAKLNVEQLLYLDGTISHMYMKGG 235 + + +DG S G Sbjct: 277 DLLYER-GAVNAMCMDGGSSSSMYFNG 302 >UniRef50_C4Z4Z5 Putative uncharacterized protein n=1 Tax=Eubacterium eligens ATCC 27750 RepID=C4Z4Z5_EUBE2 Length = 388 Score = 113 bits (282), Expect = 6e-24, Method: Composition-based stats. Identities = 36/227 (15%), Positives = 65/227 (28%), Gaps = 19/227 (8%) Query: 32 AADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNG 91 D + D + + + +++ G ++ADI + +NG Sbjct: 159 ETPDIEIVDVKGATYSGKLMIVKDPSRLFVGTVPEFTNGN-GMVVADIAKRYDAIGGVNG 217 Query: 92 GIYDESYA-----PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAF 146 G + + P+GL +++G+ S I + + Sbjct: 218 GEFVDGETTYTAMPIGLVMKDGEILNDNGGTSHVT--GITFDNKLVLGNMNAAKAKELNI 275 Query: 147 KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT---- 202 + I + GP L+ NG + + R +G G + L Sbjct: 276 RDCVSISNHI--GPFLIVNGEAQDIVGIAG-GTNPRTAIGQTADGKILLLAVDGRQPNSI 332 Query: 203 --NFYDFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFVT 246 F D A+ +DG S MY G I P Sbjct: 333 GATFSDLQDIM-AQYGAVNASTMDGGTSTQMYYDGEVINVPYSPTGP 378 >UniRef50_A6L610 Putative uncharacterized protein n=1 Tax=Bacteroides vulgatus ATCC 8482 RepID=A6L610_BACV8 Length = 287 Score = 112 bits (281), Expect = 8e-24, Method: Composition-based stats. Identities = 26/184 (14%), Positives = 56/184 (30%), Gaps = 19/184 (10%) Query: 74 ALLADINSQGQVQMAMNGGIYD-ESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFY 132 + + Q + A+NG + + +N +R G + Sbjct: 85 TTTSQLAEQSRSSAAINGSYFSIKEGFSTCYLRKNEAVIDTTTTEER----HLRVNGAVH 140 Query: 133 VAGDKVGIVRLDAFKTSKEIQ---FAVQSGPMLMENGVINPRIHPN---VASSKIRNGVG 186 + + + I+ + K + SGP+LM++G + + R+ + Sbjct: 141 MVDNNIRIIPWNDENEKKGFPLDGDILASGPLLMQDGKTCDFTTIDREFSETRHPRSAIA 200 Query: 187 INKHGNAVFLLSQQA-------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPW 239 + K G+ + + + + A + L L LDG S G + Sbjct: 201 LTKEGDIMLVAVDGRAEGHADGMSIAELAYLLRI-LKAHCALNLDGGGSTTLWVNGQVVN 259 Query: 240 QRYP 243 Sbjct: 260 HPSD 263 >UniRef50_C8PNM8 Putative uncharacterized protein n=1 Tax=Treponema vincentii ATCC 35580 RepID=C8PNM8_9SPIO Length = 306 Score = 112 bits (279), Expect = 1e-23, Method: Composition-based stats. Identities = 39/231 (16%), Positives = 71/231 (30%), Gaps = 34/231 (14%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKAN----------GEAWGTLHA 74 L P A D A L V ++ V + +A GE Sbjct: 64 LSPGIE--AADIADPQLPLIVHIVKIDLLNPSVSVITSEAALFKNTRGRIRGETTRDFAL 121 Query: 75 LLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVA 134 I + N ++ +G++I + ++ N G F+ Sbjct: 122 RHNTIAAFNAAPFKTNSLLFSIYRTIVGIHITDFRRMSMPNERYGALLFY---------K 172 Query: 135 GDKVGIVRLDAFKT-SKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNA 193 I+ S ++++AV ++ NG I P+ + R VG+ G Sbjct: 173 DKTARIIGSQTEDALSADVRYAVGGFWTILRNGTIVPQ---KLHRRDSRTAVGLADSGKT 229 Query: 194 VFLL-SQQ-------ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGA 236 +F+ + +F + A + L + L LDG S + Sbjct: 230 LFVAAVEGENKRKSRGLSFEETAMLMQ-TLGADDALQLDGGSSSTLVLQEN 279 >UniRef50_UPI000180BA0C PREDICTED: similar to predicted protein n=1 Tax=Ciona intestinalis RepID=UPI000180BA0C Length = 621 Score = 112 bits (279), Expect = 2e-23, Method: Composition-based stats. Identities = 44/258 (17%), Positives = 76/258 (29%), Gaps = 38/258 (14%) Query: 8 GKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTV-NPQTERVKMYWQKANG 66 + + + R F++ L ++ + + T+ NP + G Sbjct: 68 SENLFPVTNTRYFVSDIALNQWST------YNKNHVYGHVTTIYNPSKTFSVLEAT--YG 119 Query: 67 EAWGTLHALLADINSQGQVQMAMNGGIYDESYAP-LGLYIENGQQKVALNLASGEGNFFI 125 G + +GQ +A NGG ++ LG I G+ + + F I Sbjct: 120 GCQGNQLDTSVNAARRGQCFVAQNGGYFNTKTQSCLGNVISRGRTLHTSDATNAH--FGI 177 Query: 126 RPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASS------ 179 G V DA V L+ NG ++ SS Sbjct: 178 LSNGSIVVG------YISDADLRRLNFTNLVGGVIWLVRNGTSFVEESVSMESSDTEETG 231 Query: 180 ---------KIRNGVGINKHGNAVFLLSQQ-----ATNFYDFACYAKAKLNVEQLLYLDG 225 R +G +KHG V + N F+ + L++ + LDG Sbjct: 232 TLRYFSDVQSARTAIGHDKHGWVVLVQVDGQTGARGVNLNSFSKFLIEDLHLVNAINLDG 291 Query: 226 TISHMYMKGGAIPWQRYP 243 S + G + Sbjct: 292 GGSATLVINGTLANTPSD 309 >UniRef50_C4ICA6 Peptidase, M56 family n=1 Tax=Clostridium butyricum E4 str. BoNT E BL5262 RepID=C4ICA6_CLOBU Length = 568 Score = 112 bits (279), Expect = 2e-23, Method: Composition-based stats. Identities = 27/189 (14%), Positives = 52/189 (27%), Gaps = 25/189 (13%) Query: 37 ALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIY- 94 L+ + +P+ +V + + + I A+NGG + Sbjct: 392 QLTTDKYKGYYMEIKDPKRIKVGVAVK------LNEEGQTASKIAQNYNAVAAINGGGFL 445 Query: 95 ---------DESYAPLGLYIENGQQKVALNLASGEG-NFFIRPGGVFYVAGDKVGIVRLD 144 P+G+ + G+ + + F I V Sbjct: 446 DQSSTGYWNGTGGIPVGIIMSKGEVIYNDVEETEKTELFAIDKQRQMIVG------TYSV 499 Query: 145 AFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNF 204 K +Q AV GP L+ +G ++ R +G + G + L+ Sbjct: 500 EDLKEKGVQEAVSFGPSLIIDGKMSEMTGDGGWGIAPRTAIGQKEDGTIILLVIDGR-GI 558 Query: 205 YDFACYAKA 213 K Sbjct: 559 GSLGATLKE 567 >UniRef50_Q4UP44 Putative uncharacterized protein n=4 Tax=Bacteria RepID=Q4UP44_XANC8 Length = 439 Score = 111 bits (278), Expect = 2e-23, Method: Composition-based stats. Identities = 37/256 (14%), Positives = 72/256 (28%), Gaps = 28/256 (10%) Query: 22 ALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHAL-LADIN 80 LTL P + + + ++ T +++ + G A Sbjct: 173 PLTLAPGVRYWRQAIGGAQ-PVMLHIAQIDLTTPGLQLVGTPGDRSDGGEFRATPTTAFV 231 Query: 81 SQGQVQMAMNGGIYDES---------YAPLGL--YIENGQQKVALNLASGEGNFFIRPGG 129 G + +A+N + + P G A S R Sbjct: 232 RDGALTLAINADYFLPFDGGHLLDKPFVPAAGQGVTAEGLAIEAGRTDSAAATSDPRVNA 291 Query: 130 VFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNV---ASSKIRNGVG 186 V+ + + + V +GP+L+ +G PR + R+ VG Sbjct: 292 ALCVSQRDAVRIVRGS--CPAGSRLGVGAGPLLLLDGKRQPREASRAAYYDGPEPRSAVG 349 Query: 187 INKHGN-AVFLLSQQATNFYDFACYA------KAKLNVEQLLYLDGTISHMY---MKGGA 236 +++ G+ +++ Y +L + LDG S + G Sbjct: 350 LDRSGHTLWMVVADGRQPGYSAGMTLDALTAVFEQLGAHAAINLDGGGSSTLAARVDGDV 409 Query: 237 IPWQRYPFVTMISVER 252 R + ER Sbjct: 410 RALNRPIHTGIPGRER 425 >UniRef50_C6CV17 Exopolysaccharide biosynthesis protein n=1 Tax=Paenibacillus sp. JDR-2 RepID=C6CV17_PAESJ Length = 355 Score = 111 bits (278), Expect = 2e-23, Method: Composition-based stats. Identities = 28/219 (12%), Positives = 53/219 (24%), Gaps = 21/219 (9%) Query: 38 LSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDES 97 + A + ++ + G + +N G + + Sbjct: 132 IKADNFQSYAMKIKLKSGD---AMKMVLGNDKVGGAETTLAAVQRYGAVAGVNAGGFADG 188 Query: 98 Y---APLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQF 154 PL I NG + F V G + ++F Sbjct: 189 GGKRYPLSTTILNGDYVEGFEPTRADLFFVGLNASNKLVGGK----FTSKQQLDNLNVKF 244 Query: 155 AVQSGPMLMENGVI--NPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT-------NFY 205 P+L++NG P + + R + K G + +++ Sbjct: 245 GASFVPVLLKNGSPTTIPSKWQSSPTRAPRTVIANYKDGQLLIIVADGRNEGGSSGATLA 304 Query: 206 DFACYAKAKLNVEQLLYLDGTISHMY-MKGGAIPWQRYP 243 + +L LDG S G I Sbjct: 305 EM-QILLQRLGAVDGYNLDGGGSSSMIWNGRVINKPSDG 342 >UniRef50_C6J7B9 Exopolysaccharide biosynthesis protein n=2 Tax=Bacillales RepID=C6J7B9_9BACL Length = 355 Score = 111 bits (277), Expect = 2e-23, Method: Composition-based stats. Identities = 39/234 (16%), Positives = 68/234 (29%), Gaps = 29/234 (12%) Query: 34 DDCALSDPTLTVQAYTVNPQTER---VKMYWQKAN------GEAWGTLHALLADINSQGQ 84 + + ++ Y VNP T R +K+ + + G+ + Sbjct: 116 PFETIQSDRIRIELYKVNPGTYRGYAMKIRLKSPDAMKMTLGKDRLGGAETTMQAVQRYG 175 Query: 85 VQMAMNGGIYDESY---APLGLYIENGQQKVALNLASGEGNF-FIRPGGVFYVAGDKVGI 140 +N G + +S PL I NGQ + + F + G + Sbjct: 176 AVAGINAGGFADSRGQRYPLSTTILNGQYVNGFEPSYKDLFFVGLNQSGQLIGGKFQNKE 235 Query: 141 VRLDAFKTSKEIQFAVQSGPMLMENGV--INPRIHPNVASSKIRNGVGINKHGNAVFLLS 198 + +F P+L++NGV P R +G K + L+ Sbjct: 236 SLD-----KLKPKFGASFVPILLQNGVKLPIPDKWKTSPLRAPRTVIGNYKDDQLLVLVV 290 Query: 199 QQ-------ATNFYDFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYPF 244 + A L V+ LDG S + + G I Sbjct: 291 DGDNEKGRSGATLEELQNKL-ANLGVQDAYNLDGGGSSSLVVNGRVINHPSDGT 343 >UniRef50_C6JBU1 Putative uncharacterized protein n=1 Tax=Ruminococcus sp. 5_1_39BFAA RepID=C6JBU1_9FIRM Length = 291 Score = 111 bits (277), Expect = 2e-23, Method: Composition-based stats. Identities = 37/214 (17%), Positives = 71/214 (33%), Gaps = 16/214 (7%) Query: 35 DCALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGI 93 + L ++ +Y V + +T K + +G +D S + +NG Sbjct: 69 NIKLKRKSVHGISYWVAHIKTSNAKQLKSALSNGTYGGSRQTTSDAVSSNGGIIGVNGSA 128 Query: 94 YDESY---APLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSK 150 +D +PLG+ I+NG + ++ G Y + + Sbjct: 129 FDYGTGKPSPLGMCIKNGIIYGDYMTS--YSVMAVKKDGTIYTPAQGLM----GKNLLAA 182 Query: 151 EIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQ----QATNFYD 206 ++ GP+L+++G R VG+ K + V L++ N +D Sbjct: 183 GVKDTYNFGPVLIKDGEAQLPWTET-EKYYPRTAVGMVKPNDYVLLVTDTGSYNGLNHWD 241 Query: 207 FACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 K+ LDG S G + + Sbjct: 242 MVNIFKS-YGCTYAYNLDGGGSATLYFNGKVMNK 274 >UniRef50_C6XT12 NHL repeat containing protein n=2 Tax=Pedobacter heparinus DSM 2366 RepID=C6XT12_PEDHD Length = 646 Score = 110 bits (276), Expect = 4e-23, Method: Composition-based stats. Identities = 39/271 (14%), Positives = 75/271 (27%), Gaps = 35/271 (12%) Query: 4 QLLIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQK 63 Q LI + + + + + + + VN +V M Sbjct: 56 QKLIDETSLIGTVISDDEITVAPGVTETDIHYTDTAGKAMHLFILKVNLNEPQVFMEVAT 115 Query: 64 ANGEAWGTLHALLADIN----SQGQVQMAMNGGIYDE-SYAPLGLYIENGQQK------V 112 + A + V +NG +D + P+G+ +NG Sbjct: 116 PFNLPAYARQTVPAQAAEIDTATHMVIAGINGDFFDTSTGIPMGIVHKNGSIVKSTFNDN 175 Query: 113 ALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRI 172 L F + V + + S ++ + SG ML+ N + + Sbjct: 176 TLKPQQAVSFFGVTENNVPIID------FKSGYAALSSQLYNSTGSGVMLVNN---HLPV 226 Query: 173 HPNVASSKIRNGVGINKHGNAVFLLSQQA-------TNFYDFACYAKAKLNVEQLLYLDG 225 + R VG + +G F++ N+ A NV+ + LDG Sbjct: 227 SQPYTAIDPRTSVGYDDNGIVYFVVIDGRDAPYSNGMNYAQLTSAFMA-FNVKNAVNLDG 285 Query: 226 TISHMYMKGG-------AIPWQRYPFVTMIS 249 S +M ++ Sbjct: 286 GGSSTFMTRNPVTNLLQVRNQPSDGTARAVA 316 >UniRef50_C6XXH4 Putative uncharacterized protein n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6XXH4_PEDHD Length = 289 Score = 110 bits (275), Expect = 4e-23, Method: Composition-based stats. Identities = 29/189 (15%), Positives = 53/189 (28%), Gaps = 24/189 (12%) Query: 74 ALLADINSQGQVQMAMNGGIYD-ESYAPLGLYIENGQQK------VALNLASGEGNFFIR 126 + ++ A+NG +D ++ + G+ + A + + Sbjct: 85 KTTSTFGTENNALAAVNGSFFDVKNGGSVDFIKVGGKVLAENRLEKNDSRARHQQAAVVI 144 Query: 127 PGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINP-RIHPNVASSKIRNGV 185 G + + + SGP+LM NG S R + Sbjct: 145 SNGKLALKKWDGTADWEQRLTE----ENVLLSGPLLMLNGTDEALDSTSFSRSRHPRTAI 200 Query: 186 GINKHGNAVFLLSQQA------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMK-----G 234 GI +G + L + + A K L + LDG S G Sbjct: 201 GIKPNGRILLLTVDGRNSNSAGMSLTELAKTMK-WLGCTSSINLDGGGSTTLWVSGFPGG 259 Query: 235 GAIPWQRYP 243 G + + Sbjct: 260 GVVNYPTDN 268 >UniRef50_D2V2G1 N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase n=1 Tax=Naegleria gruberi RepID=D2V2G1_NAEGR Length = 558 Score = 110 bits (275), Expect = 5e-23, Method: Composition-based stats. Identities = 30/199 (15%), Positives = 59/199 (29%), Gaps = 36/199 (18%) Query: 66 GEAWGTLHALLADINSQGQ----VQMAMNGGIYDES-YAPLGLYIENGQQKVALNLASGE 120 G + + A + DI N G ++ + LG + +G+ + + Sbjct: 193 GGCYYNVTAPVRDIAKYHANGYFCHYTTNAGFFNTHKHTCLGNVVSDGRI--SHVSTNHN 250 Query: 121 GNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNV---- 176 NF I G +++ D ++ + L+ G + Sbjct: 251 VNFGITKDGKYFIG-------YTDENTKLEDFDQMISGVIWLVRKGESYVDESSKIEDMS 303 Query: 177 -----------ASSKIRNGVGINKHGNAVFLLSQQ------ATNFYDFACYAKAKLNVEQ 219 R+ +G +K G V + Y+ A +L VE Sbjct: 304 IQETGNAKRFITVRASRSALGHDKEGRLVLVSIDGDGNHNKGPTLYELATLMI-ELGVEN 362 Query: 220 LLYLDGTISHMYMKGGAIP 238 + LDG S ++ + Sbjct: 363 AINLDGGGSVTVVRDNDVV 381 >UniRef50_B8CYN3 SpoIID/LytB domain protein n=1 Tax=Halothermothrix orenii H 168 RepID=B8CYN3_HALOH Length = 833 Score = 109 bits (272), Expect = 9e-23, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 66/200 (33%), Gaps = 28/200 (14%) Query: 58 KMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLA 117 + K G+ + ++ V ++ + +P + +NG A + Sbjct: 635 AVIINKYYGQVAPPAREWITELVVSNGVVQSI------KDGSPGTVIPDNGFIIQAHGQS 688 Query: 118 SGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVIN-----PRI 172 F V G+ +K+I+ A+ +GP L++NG I Sbjct: 689 RQFLKLFKEGDKVVLQNNFGPGLT-------NKDIKMALGAGPTLIKNGKIYITGKAEGF 741 Query: 173 HPNV-ASSKIRNGVGINKHGNAVFLLSQQA-------TNFYDFACYAKAKLNVEQLLYLD 224 P++ R +GI + + + + A + K NV Q + LD Sbjct: 742 QPDILRGRAPRTALGITSGNHLIMVTVDGRQPGFSIGMTLEELAQFML-KYNVVQAMNLD 800 Query: 225 GTISH-MYMKGGAIPWQRYP 243 G S M ++G + Sbjct: 801 GGASSRMVVRGYTMNNPSDK 820 Score = 47.8 bits (112), Expect = 3e-04, Method: Composition-based stats. Identities = 28/170 (16%), Positives = 59/170 (34%), Gaps = 22/170 (12%) Query: 45 VQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLY 104 + ++ + + + A+G+ G L+ + S + +NGG Y + PLGL+ Sbjct: 521 ITMLDLDLNNDFLYVEPFLASGKLSGLSD--LSQVVSGKKALAGINGGFYSYTGRPLGLF 578 Query: 105 IENGQQKVALNLASGEGNFFIRPGGVFY--------VAGDKVGIVRLDAFKTSKEIQFAV 156 + NG+ + G I P + + + + ++ AV Sbjct: 579 MINGEIVSEDIM--GRTALVITPDDIIIDRIDWTARLTNTRGEEILIEGANRRPRTDEAV 636 Query: 157 QSGPMLMEN---GVINPRIHPNVASSKIRNGVGIN-KHGNAVFLLSQQAT 202 + N G + P + + NGV + K G+ ++ Sbjct: 637 ------IINKYYGQVAPPAREWITELVVSNGVVQSIKDGSPGTVIPDNGF 680 >UniRef50_B3RIP6 Putative uncharacterized protein (Fragment) n=2 Tax=Trichoplax adhaerens RepID=B3RIP6_TRIAD Length = 344 Score = 109 bits (272), Expect = 1e-22, Method: Composition-based stats. Identities = 37/218 (16%), Positives = 63/218 (28%), Gaps = 30/218 (13%) Query: 48 YTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE-SYAPLGLYIE 106 +P + Q G L + +A N G ++ + G I Sbjct: 12 VVEDPLRTISVLEPQNTGGCNMSKLSTVADTARKAH-CYVAENAGFFNTETGGCYGNIIS 70 Query: 107 NGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENG 166 NG+ N+ + NF IR G V + + V L+ NG Sbjct: 71 NGRLVRLTNVQN--VNFGIRKNGSIIVG----YLTEEEILDKENPFVQLVSGVIWLVRNG 124 Query: 167 VINPRIH---------------PNVASSKIRNGVGINKHGNAVFLLSQQ-----ATNFYD 206 + + R +G +++GN + + + N YD Sbjct: 125 KSYVKESMKMESNKHEETGTLKQFIEVKSARTAIGHDRNGNVMLMQIEGQTNARGLNLYD 184 Query: 207 FACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYP 243 FA + LDG S + G A+ + Sbjct: 185 FAKKLIKS-GFVNAINLDGGGSSTTAIDGIAVGYPSDH 221 >UniRef50_B8G1I8 Peptidase M56 BlaR1 n=4 Tax=Desulfitobacterium hafniense RepID=B8G1I8_DESHD Length = 747 Score = 109 bits (272), Expect = 1e-22, Method: Composition-based stats. Identities = 34/216 (15%), Positives = 60/216 (27%), Gaps = 30/216 (13%) Query: 47 AYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYA------P 100 +P+ + + E GT+ L D+ S+ +N G S P Sbjct: 530 MLISDPKRVTLAVT------EEIGTVEEKLTDMVSRSGAIAGINAGGIYLSLEEGNEVFP 583 Query: 101 LGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGP 160 G+ ++NG+ + G V + K IQ V P Sbjct: 584 DGITVQNGEVVYNNAGDQAVEFIGLDAEGKLITGPMNVQEI------KEKNIQEGVGFSP 637 Query: 161 MLMENGVINPRIH------PNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKA- 213 L +NG R R G+G G +F++ + K Sbjct: 638 PLADNGTTLVREGKPAVPGDGGWGIAPRAGIGQRADGTLIFMVIDGRDPDWSIGATLKDM 697 Query: 214 -----KLNVEQLLYLDGTISHMYMKGGAIPWQRYPF 244 + + + L G + G + + Sbjct: 698 ENLFLEYGAVEAVNLSGGSMVEMVYDGKVLNKVSNI 733 >UniRef50_C0WEQ2 Exopolysaccharide biosynthesis protein n=1 Tax=Acidaminococcus sp. D21 RepID=C0WEQ2_9FIRM Length = 470 Score = 109 bits (271), Expect = 1e-22, Method: Composition-based stats. Identities = 35/227 (15%), Positives = 66/227 (29%), Gaps = 29/227 (12%) Query: 55 ERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGL--YIENGQQKV 112 R+ + L + + + NG G+ I NG+ Sbjct: 245 PRLSFTGTVTRPDGAEMKITGLNRMRLENDLIFYNNGYDDTTDTNAAGVEVAIRNGRVIK 304 Query: 113 ALNLAS---GEGNFFIRPGGV------FYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLM 163 S + G GDKV I + + +GP+L+ Sbjct: 305 TGTTGSMPMSWNMTVLSGHGTAADFLRPLAVGDKVKIKTSLGSPLADKAPSVGTAGPLLV 364 Query: 164 ENGVINPR------IHPNVASSKIRNGVGINKHGNAVFLLSQQA------TNFYDFACYA 211 +G++N R VGI K G + +++ + A Y Sbjct: 365 YDGLVNVTASLEEIPSDIADGRAPRTAVGIKKDGTILVVVADGRSSRSAGMTLPELARYL 424 Query: 212 KAKLNVEQLLYLDGTISHMYMKGGAIPWQR-----YPFVTMISVERK 253 +L ++ + DG S + GA+ + P + + + Sbjct: 425 I-QLGADRAMNFDGGGSSEMVVNGAVKNRPSDGAERPVRVALGLFPR 470 >UniRef50_B2J8B3 Putative uncharacterized protein n=1 Tax=Nostoc punctiforme PCC 73102 RepID=B2J8B3_NOSP7 Length = 276 Score = 108 bits (270), Expect = 2e-22, Method: Composition-based stats. Identities = 41/269 (15%), Positives = 77/269 (28%), Gaps = 39/269 (14%) Query: 11 MITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWG 70 + T++ + + P + P + + Sbjct: 3 LWLNLRSSTQSISTVVSSPPKSIRYFERTLPQSIAHVLFI-PVNSKFLVTPA------LS 55 Query: 71 TLHALLADINSQGQVQMAMNGGIYDESYAPLGLYI-----------ENGQQKVALNLASG 119 A + + + + +N G +D + Y+ EN + NL S Sbjct: 56 QKVATVEEFAQKHRAVAILNAGFFDPANQKTTSYVILQRKLVADPKENERLVNNPNLKSY 115 Query: 120 EGNFFIRPGGVFYVAGDKVG---IVRLDAFKTSKEIQFAVQSGPMLM------ENG---V 167 F R Y G V ++ + ++ A+ +GP L+ + G Sbjct: 116 LSQIFNRTEFRRYSCGQTVRYDIVLHSASQPAGCQLVDAIGAGPSLLPELTLEKEGFVDN 175 Query: 168 INPRIHPNVASSKIRNGVGINKHGNAVFLLSQ-------QATNFYDFACYAKAKLNVEQL 220 N R R VGI G+ V ++ + A + K L ++ Sbjct: 176 ANKRDALGSNQPNARTAVGITHDGSVVLVMVAQKPSAPANGISLPALANFMK-TLGADKA 234 Query: 221 LYLDGT-ISHMYMKGGAIPWQRYPFVTMI 248 + LDG S +Y G + I Sbjct: 235 MNLDGGSSSSLYYNGKTFYGKVDLEGNPI 263 >UniRef50_C0CND1 Putative uncharacterized protein n=1 Tax=Blautia hydrogenotrophica DSM 10507 RepID=C0CND1_9FIRM Length = 454 Score = 108 bits (270), Expect = 2e-22, Method: Composition-based stats. Identities = 29/184 (15%), Positives = 54/184 (29%), Gaps = 16/184 (8%) Query: 66 GEAWGTLHALLADINSQGQVQMAMNG-GIYDESYAPL-GLYIENGQQKVALNLASGEGNF 123 G +G ++ + + +NG G S P G + + ++G Sbjct: 233 GGTYGNPRRTVSQELADHNGVLGINGSGFSYSSGIPAPGKSMIKDRTVYEDVYSNGNIMC 292 Query: 124 FIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNV-ASSKIR 182 GG+F ++++ GP L+ENG R Sbjct: 293 VTGEGGMFTAPAG-----MTVQEMLQRDVKDTYCFGPTLVENGEAFEISEQFQQTYRYQR 347 Query: 183 NGVGINKHGNAVFLLSQQ-------ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGG 235 VG+ G+ ++ + + L+ E LDG S + G Sbjct: 348 TAVGMISPGDYYLVIVDGKGVGGSQGMTYEELQQVFLD-LDCEYAYNLDGGGSTTLVFKG 406 Query: 236 AIPW 239 + Sbjct: 407 RVIN 410 >UniRef50_C4V4S8 Exopolysaccharide biosynthesis protein n=1 Tax=Selenomonas flueggei ATCC 43531 RepID=C4V4S8_9FIRM Length = 491 Score = 108 bits (270), Expect = 2e-22, Method: Composition-based stats. Identities = 28/198 (14%), Positives = 55/198 (27%), Gaps = 25/198 (12%) Query: 76 LADINSQGQVQMAMNGGIYDESYAPLG--LYIENGQQ--KVALNLASGEGNFFIRPGGVF 131 + + + P G I NG+ + + + G Sbjct: 287 VNAERGADNLIIYNRAYGRSTGTNPYGLEYVIRNGRVAEINTNDSLIPPDGYVVSVHGTL 346 Query: 132 Y-------VAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVIN-----PRIHPNVA-S 178 V ++ D + + +GP L+ENG ++ + ++ Sbjct: 347 MDAFAAAGVRVGDPAVLTEDLGEPWNRAVQVLGAGPRLVENGSVHVTAGEEQFPGDIRYG 406 Query: 179 SKIRNGVGINKHGNAVFLLSQQAT------NFYDFACYAKAKLNVEQLLYLDGTISHMY- 231 R VG+ + GN +F + +FA + V + LDG S Sbjct: 407 RAPRTAVGVTQKGNILFAVVDGRQSHSHGLTLTEFADLL-VQFGVRDAINLDGGGSSEIC 465 Query: 232 MKGGAIPWQRYPFVTMIS 249 G + + Sbjct: 466 ADGDVLNSPSDGSERAVG 483 Score = 48.9 bits (115), Expect = 2e-04, Method: Composition-based stats. Identities = 19/110 (17%), Positives = 31/110 (28%), Gaps = 6/110 (5%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ 84 L A D +T +P RV+ A G G ++ I Sbjct: 164 LAAGLTQREYVYADEDGPVTAYFIEADPARYRVR--PALARGIIPG--RQTVSGIAQDTN 219 Query: 85 VQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVA 134 A+N + S +G+ +G + F + P F Sbjct: 220 AAAAINASYFALSGELIGITKIDGTVVSSTYFDRSA--FGVMPDNSFVFG 267 >UniRef50_B2A8G9 Copper amine oxidase domain protein n=1 Tax=Natranaerobius thermophilus JW/NM-WN-LF RepID=B2A8G9_NATTJ Length = 718 Score = 107 bits (268), Expect = 3e-22, Method: Composition-based stats. Identities = 20/121 (16%), Positives = 36/121 (29%), Gaps = 18/121 (14%) Query: 150 KEIQFAVQSGPMLMENGVINPRIHPN------VASSKIRNGVGINKHGNAVFLLSQQA-- 201 +++ FA+ GP ++E G ++ R R VG+ + G + Sbjct: 599 EDVVFALGGGPRILEKGEVDIRSMEEVISDNVSQGRSPRTAVGVTRDGQLLLTAVDGRQS 658 Query: 202 -----TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQR----YPFVTMISVER 252 + + K + + L LDG S M I + Sbjct: 659 GLSIGMTLEELGNFMKDR-GAQDALNLDGGGSTMMWFDNEFQNNPSNGIRNIGNSIVIRE 717 Query: 253 K 253 K Sbjct: 718 K 718 Score = 57.4 bits (137), Expect = 5e-07, Method: Composition-based stats. Identities = 19/105 (18%), Positives = 33/105 (31%), Gaps = 5/105 (4%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ 84 + + + + + ++P VK A G L + + Sbjct: 379 IANGLKYTSIRKGQENGPIKIHELRLDP-HGDVKPELIMAQDGFSGF--ERLDSMAKRNN 435 Query: 85 VQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGG 129 A+NGG Y + P+GLYI + + FF G Sbjct: 436 AIAAINGGFYWRAGHPIGLYISDQRLIREPMPNRSA--FFYSKDG 478 >UniRef50_B5VVA8 S-layer domain protein n=3 Tax=Cyanobacteria RepID=B5VVA8_SPIMA Length = 789 Score = 107 bits (266), Expect = 4e-22, Method: Composition-based stats. Identities = 32/193 (16%), Positives = 57/193 (29%), Gaps = 29/193 (15%) Query: 84 QVQMAMNGGIYDESYAPLGL-----YIENGQQKVALNLASGEGNFFIRPGGVFYV----- 133 + +A + SY PL L + Q + L + I G V Sbjct: 590 KAGIARYTSEWGASYFPLTLNEIVVVVSGDQVTRQIELPDDQTPTAIPTNGYLLVFRSFR 649 Query: 134 -------AGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNV------ASSK 180 G ++ I + + GP+L++N I Sbjct: 650 SAVSAFGVGSRLTITATTTPSEFIDFPHIMGGGPLLVQNRNIVVNAEAEGFNYWFGQQLA 709 Query: 181 IRNGVGINKHGNAVFLLSQQATN-----FYDFACYAKAKLNVEQLLYLDGTISHMYMKGG 235 IR+ VG+ G + + N + A + +L + LDG S + GG Sbjct: 710 IRSAVGVTATGEVLMVTVHNRVNGAGPSLTEMAKLMQ-QLGAIDAINLDGGSSTSLVLGG 768 Query: 236 AIPWQRYPFVTMI 248 + + + Sbjct: 769 HLLNRTPDTAARV 781 >UniRef50_C3R3L4 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=C3R3L4_9BACE Length = 431 Score = 107 bits (266), Expect = 5e-22, Method: Composition-based stats. Identities = 38/276 (13%), Positives = 69/276 (25%), Gaps = 62/276 (22%) Query: 24 TLLPLFAVAADDCALSDPTLTVQAYTVNP-QTERVKMYWQKANGEAWGTLHALLADINSQ 82 TL A+ L + + + G A G Sbjct: 150 TLPDYLAIYKSPSTLKGKNAVAYIAVADMDKNASFSVL-----GNATGVKTLTQFYNAES 204 Query: 83 GQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLA-------------SGEGNFFIRPGG 129 + + MNGG + + A + L N + F G Sbjct: 205 VKPAIVMNGGYFASNGATVSLLYRNNVMLAPNLQSMSRSDGTSNVAFYPTRSAFGEIENG 264 Query: 130 VFYV---------------------AGDKVGIVRLDAFKTSK---EIQFAVQSGPMLMEN 165 F V +G + + + + A+ GP+L++N Sbjct: 265 KFEVNWVYTVSSGQTYAYPAPSPNKSGVSPMQIPSVNYPEGASIWKAKNAIGGGPVLLKN 324 Query: 166 GVINPRIHPNV---------ASSKIRNGVGINKHGNAVFLLSQQA--------TNFYDFA 208 G+ + S+ R+ +GI +F + + + A Sbjct: 325 GLYKNTWEAELFDTASGIGPTSNNPRSAIGITGDNRLIFFVCEGRNKTPNVPGFTLEEVA 384 Query: 209 CYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYP 243 + L + LDG S M + G Sbjct: 385 YILRD-LGCLDAMNLDGGGSSCMLVNGQETIKPSDG 419 >UniRef50_C9PX63 Putative uncharacterized protein n=1 Tax=Prevotella sp. oral taxon 472 str. F0295 RepID=C9PX63_9BACT Length = 294 Score = 106 bits (265), Expect = 6e-22, Method: Composition-based stats. Identities = 31/240 (12%), Positives = 71/240 (29%), Gaps = 39/240 (16%) Query: 43 LTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLG 102 TV + P+ ++ A+G A + ++ + + + +NG + + Sbjct: 65 QTVTVAEITPKR-SLEFDIAIADGG------ATVGEMAQRTKALVGINGSYFGMNKRSAI 117 Query: 103 LYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIV-RLDAFKTSKEIQFA--VQSG 159 Y+ G+ + + +R G G K+ I+ + + + SG Sbjct: 118 TYLRQGRTVLDTTTTAE---LALRVTGAIRTHGRKLRIMPWNKEIERRYHCRHGSTLASG 174 Query: 160 PMLMENGV---INPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-------TNFYDFAC 209 +L+ G + V R+ + + G +F+ N + Sbjct: 175 HLLLYRGQSILLRSSSMGFVVKKHPRSAIALTSRGTVLFVTVDGRHPGYAGGMNLIELRH 234 Query: 210 YAKAKLNVEQLLYLDGTISHMYM---------------KGGAIPWQRYPFVTMISVERKG 254 + + +L + LDG S + V ++G Sbjct: 235 FLQ-QLGCTDAINLDGGGSTTLWAKGFSTKGVANYPCDNRKFDHDGERKVANAVVVMKRG 293 >UniRef50_B0BZE5 Putative uncharacterized protein n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0BZE5_ACAM1 Length = 584 Score = 106 bits (265), Expect = 6e-22, Method: Composition-based stats. Identities = 31/177 (17%), Positives = 55/177 (31%), Gaps = 28/177 (15%) Query: 103 LYIENGQQKVALNLASGEGNFFIRPGGVFYV-----------AGDKVGIVRLDAFKTSKE 151 + + N Q +++ +F I G V +G ++ I + Sbjct: 406 ITVINNQVVSE-KVSTSTKSFAIPKNGYLLVLRSFDVGGALASGTQLQIQTATTPASFNG 464 Query: 152 IQFAVQSGPMLMENGVINPRI------HPNVASSKIRNGVGINKHGNAVFLLSQQ----- 200 V +GP+L+ NG + P S R+G+G G + Sbjct: 465 FPNIVGAGPLLVSNGQVVLNAKAEKFRPPFDTQSAPRSGIGQTADGTILLAAVHNQVSGP 524 Query: 201 ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFV----TMISVERK 253 ++A + +L L LDG S GG + + I V + Sbjct: 525 GPTLKEWALIMQ-RLGSVNALNLDGGSSTSLYLGGQLLDRHPVTAARVQNGIGVFWR 580 Score = 60.9 bits (146), Expect = 4e-08, Method: Composition-based stats. Identities = 32/140 (22%), Positives = 50/140 (35%), Gaps = 4/140 (2%) Query: 22 ALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINS 81 ++ P +L + V +NPQT +K+ N A +H LL+ Sbjct: 245 SILWAPGILRQERVISLGNKQYPVTWLALNPQTPGLKLQPIWGNRNALLGIHPLLSM-AQ 303 Query: 82 QGQVQMAMNGGIYDESY-APLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGI 140 QV A+N G ++ + PLG +NGQ + L G P G F + + Sbjct: 304 GNQVAAAINAGYFNRNNKTPLGAIRQNGQWISSPILN--RGVVAWNPQGQFQMGRLNLQQ 361 Query: 141 VRLDAFKTSKEIQFAVQSGP 160 V + I P Sbjct: 362 VLSTSGGKRLSIVSLDSGYP 381 >UniRef50_B8HPJ4 Putative uncharacterized protein n=2 Tax=Cyanothece sp. PCC 7425 RepID=B8HPJ4_CYAP4 Length = 338 Score = 106 bits (264), Expect = 8e-22, Method: Composition-based stats. Identities = 37/219 (16%), Positives = 67/219 (30%), Gaps = 18/219 (8%) Query: 27 PLFAVAADDCALSDPTL-TVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQV 85 P V + L+ P L +N T +K + L ++ + + Sbjct: 53 PFVGVTYINRRLTSPRLLNQHIVLINLATTGLKFRVTSPAADGSTALEKTIS-FTRRSKA 111 Query: 86 QMAMNGGIY----DESYAPLGLYIENGQQKVA-LNLASGEGNFFIRPGGVFYVAGDKVGI 140 Q+ +NG + LGL +G+ + N G NF F +G Sbjct: 112 QIGINGNFFQALSSTRAKVLGLAASSGRVYSSWSNGYQGAINFSSNRTATFVTPPSGLGT 171 Query: 141 VRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ 200 + V P+L++NG N R+ +G+ ++ + Sbjct: 172 TTVPLLT----PYNLVSGLPVLVKNGQNVTVGVANPNEYAARSVIGLTQNQQLLLFAVDG 227 Query: 201 -------ATNFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 N + A + V + LDG S + Sbjct: 228 PRSNVSTGMNQIELADLLISDFKVVHAVNLDGGGSSTLV 266 >UniRef50_B6V2M3 Gp2.43 n=1 Tax=Bacillus phage SPO1 RepID=B6V2M3_BPSP1 Length = 437 Score = 106 bits (264), Expect = 8e-22, Method: Composition-based stats. Identities = 27/221 (12%), Positives = 59/221 (26%), Gaps = 18/221 (8%) Query: 37 ALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE 96 ++ +K+ N + ++ + N I++ Sbjct: 28 TSKTDYFITHVPNLDKNGNLIKLRHGFQNDLINSGVGETARSFCNRHSASLVANASIWNT 87 Query: 97 S-YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFA 155 + G+ I++G+ + I+ + V + I Sbjct: 88 NNGLIRGVQIQDGKVIQDAKDTNSYT-LGIKSDNTLVM---YPPSVTAEQVLADGCIDAI 143 Query: 156 VQSGPMLMENGVIN----PRIHPNVASSKIRNGVGINKHGNAVFLLSQQA------TNFY 205 PM +++G NV RN + + + +FL + + Sbjct: 144 TAFYPM-IQDGAAFDLSGVTTVSNVTEHHPRNVIAQLPNKDLLFLTCEGRTKANQGMTYD 202 Query: 206 DFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFV 245 D A+ V LDG S ++G + Sbjct: 203 DMIRILLAR-GVTTAYCLDGGGSSQTVVRGHLVNNPLDNNG 242 >UniRef50_UPI0001923977 PREDICTED: similar to predicted protein, partial n=1 Tax=Hydra magnipapillata RepID=UPI0001923977 Length = 290 Score = 105 bits (262), Expect = 1e-21, Method: Composition-based stats. Identities = 33/244 (13%), Positives = 64/244 (26%), Gaps = 38/244 (15%) Query: 22 ALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINS 81 + + D T+ + + + +G A + + Sbjct: 54 STFITNYIGYETDHLQFGHRTVIKNPLKM------LSVLEPLKSGGCKTNSLAYVHESAK 107 Query: 82 QGQVQMAMNGGIYDES------YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAG 135 Q ++A+N G ++ G I NG N NF IR G + Sbjct: 108 QQNCRIAVNAGFFNPFETDKDYGKCYGNIISNGNLVQD-NGGIQNANFGIRSDGTLVIG- 165 Query: 136 DKVGIVRLDAFKTSKEIQFAVQSGPMLMENG---------------VINPRIHPNVASSK 180 + + + ++ NG + Sbjct: 166 ---YLPEKEVIDKKNPFLQLLSGVGWILRNGSSYLKESEKAECKESETTGTLDKFFNVKS 222 Query: 181 IRNGVGINKHGNAVFLLSQQ-----ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGG 235 R +G + G+ + N Y+ Y K K+ + + DG S Y++ Sbjct: 223 ARTMIGYDAKGHVHIVQFDGKTGKSGINLYEAVEYLK-KIGLINAINFDGGGSATYVQDS 281 Query: 236 AIPW 239 I Sbjct: 282 IILN 285 >UniRef50_B4VX04 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VX04_9CYAN Length = 681 Score = 105 bits (262), Expect = 1e-21, Method: Composition-based stats. Identities = 41/241 (17%), Positives = 72/241 (29%), Gaps = 45/241 (18%) Query: 52 PQTERVKMYWQKANGEAWGTLHALLADINSQGQ--VQMAMNGGI-----------YDESY 98 P R + W + +G I + G+ + +N G + +Y Sbjct: 431 PILNRGAIAWTDQHQFKFGRFSLQETLITANGERFPSLFLNSGYVQAGISRYTPAWGVTY 490 Query: 99 APLG-----LYIENGQQKVA-LNLASGEGNFFIRPGGVFYVAGD--------------KV 138 PL ++N Q +GE +F I G V Sbjct: 491 TPLTDNEVIWVVQNNQITAQLPGGVAGEESFVIPVNGYLLTHRGHDPNAIAKSLTLGTTV 550 Query: 139 GIVRLDAFKTSKEIQFAVQSGPMLMENGVINPR------IHPNVASSKIRNGVGINKHGN 192 I + + + +GP+L++N I + S IR+ +GI +G Sbjct: 551 QIEQKTLPVEFNDYPHILGAGPLLLQNRQIVLDAKAENFSNAFAQQSAIRSAIGITANGT 610 Query: 193 AVFLLSQQA-----TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTM 247 + N + A + +L L LDG S GG + + Sbjct: 611 LIIAAMHNRVGGRGPNLTETAQLMQ-QLGAVDALNLDGGSSTGLYLGGHLLDRSPHTAAR 669 Query: 248 I 248 + Sbjct: 670 V 670 Score = 59.3 bits (142), Expect = 1e-07, Method: Composition-based stats. Identities = 18/102 (17%), Positives = 34/102 (33%), Gaps = 2/102 (1%) Query: 23 LTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQ 82 + P L D + V+P+ +VK+ ++ L+ Sbjct: 340 ILWAPGIWWRQRTVTLGDHQFPLVWLEVDPKNPQVKLSPMWSHPTTQVGTAPLIKT-AQL 398 Query: 83 GQVQMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGEGNF 123 + A+NGG ++ + PLG +G L G + Sbjct: 399 WKAAAAINGGFFNRNNQLPLGAIRRDGYWYSGPILNRGAIAW 440 >UniRef50_B4AZH7 Putative uncharacterized protein n=1 Tax=Cyanothece sp. PCC 7822 RepID=B4AZH7_9CHRO Length = 298 Score = 105 bits (261), Expect = 2e-21, Method: Composition-based stats. Identities = 38/277 (13%), Positives = 80/277 (28%), Gaps = 58/277 (20%) Query: 12 ITLNLKRIFLALTLLPLFAVAADDCALSDPT-------LTVQAYTV-NPQTERVKMYWQK 63 + +K I ++L + + A + P Y V + V Sbjct: 1 MNRLVKLIIISLIMGVVSACTPTQSSSEKPQRSESAVVQPEPLYKVYDLPQSTVHTLTI- 59 Query: 64 ANGEAWGTLH------ALLADINSQGQVQMAMNGGIYDES-YAPLGLYIENGQQKVALNL 116 + + ++ + A+NGG +D + I G+ Sbjct: 60 PVDSPYQVTVTLARSLETVENLAKKQGAMAAINGGFFDPNNGKTTSYIIHQGKIIADPKN 119 Query: 117 ASGEGNFFIRPGGVFYVAGD-----------KVGIVRLDAFKTSKEIQ-----FAVQSGP 160 P Y+ + +F + ++ +GP Sbjct: 120 NER---LMKNPDLTRYLDKILNRSEWRRYQCGATVRYSISFHNQPTLTGCQLLDSLGAGP 176 Query: 161 MLM-------------ENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ------- 200 L+ NG + + + R +GI +G+ +++++ Q Sbjct: 177 RLLPEMTAQTEGFIDLVNGTMI-KDALGLKEPNARTAIGITANGDLIWIMAAQKAHSSRA 235 Query: 201 -ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGA 236 + + A + K L V++ L LDG S + G Sbjct: 236 TGLSLLELAEFLK-TLGVQEALNLDGGSSSTFYYQGK 271 >UniRef50_B8HTR4 Putative uncharacterized protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HTR4_CYAP4 Length = 603 Score = 105 bits (261), Expect = 2e-21, Method: Composition-based stats. Identities = 45/283 (15%), Positives = 78/283 (27%), Gaps = 41/283 (14%) Query: 6 LIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKAN 65 L+G + +R +A + F L + + +P R + W Sbjct: 307 LVGIQPLPRLAQRWQVAAAINGGFFNRNQQVPLGAIRQSGSWIS-SPILNRGAIGWNDQG 365 Query: 66 GEAWGTLHALLADINSQGQVQ-------------MAMNGGIYDESYAP-----LGLYIEN 107 G L I + GQ +A + +Y P + + N Sbjct: 366 EFTLGRLRLQQTLITASGQSLPINTLDSGFVQKGIARYTRAWGPTYTPRVAKETVITVVN 425 Query: 108 GQQK-VALNLASGEGNFFIRPGGVFYV---------AGDKVGIVRLDAFKTSKEIQFAVQ 157 + A+ I P G V + I + Sbjct: 426 DRVAGQQTASANTPTPILIPPNGYLLVLRDVPLPVFGEGSLQIQMNALPADFNRFPQILG 485 Query: 158 SGPMLMENGVIN--PRIHPNVASS----KIRNGVGINKHGNAVFLLSQQA-----TNFYD 206 +GP+L+E G I P + R+G+G G + + + + Sbjct: 486 AGPLLLERGQIVLNPDLEQFGNGLDAQQAPRSGIGRTSTGQILLVTTHNRIGGAGPTLAE 545 Query: 207 FACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMIS 249 +A K L L LDG S GG + + + Sbjct: 546 WAAILK-TLGAVDALNLDGGSSTALYLGGQLLDRHPVTSARVQ 587 >UniRef50_Q8YP57 All4343 protein n=5 Tax=Nostocaceae RepID=Q8YP57_ANASP Length = 660 Score = 105 bits (261), Expect = 2e-21, Method: Composition-based stats. Identities = 36/241 (14%), Positives = 67/241 (27%), Gaps = 46/241 (19%) Query: 52 PQTERVKMYWQKANGEAWGTLHALLADINSQGQV---QMAMNGGIYDES----------- 97 P R + W G+ + +L + + + +A+N G Sbjct: 411 PILNRGAIAW-NDAGQFYFGRLSLQETLATSSNLRVPILALNSGYVQNGIARYTPAWGKM 469 Query: 98 YAPLG-----LYIENGQQKVA-LNLASGEGNFFIRPGGVFYVAGDKV------------- 138 Y PL + ++N + +G+ NF I G Sbjct: 470 YTPLTDNERIVIVQNNKITNQFPGNKAGQTNFPIPNNGYLLTLRGNATTVASQLPVGTDV 529 Query: 139 GIVRLDAFKTSKEIQFAVQSGPMLMENGVINPR------IHPNVASSKIRNGVGINKHGN 192 I + +GP+L++N I + +A +R+G+ + Sbjct: 530 QITSATTPGEFNRYPHIIGAGPLLLQNSQIVLDAKSEQFSNAFIAERAVRSGICTTANNT 589 Query: 193 AVFLLSQQA-----TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTM 247 + N + A K L L LDG S G + + Sbjct: 590 LLIAAVHNRAGGPGPNLAEHAQLMKL-LGCVNALNLDGGSSTSLYLSGQLLDRYPNTAAR 648 Query: 248 I 248 + Sbjct: 649 V 649 Score = 65.9 bits (159), Expect = 1e-09, Method: Composition-based stats. Identities = 29/145 (20%), Positives = 47/145 (32%), Gaps = 7/145 (4%) Query: 23 LTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQ 82 +T L V VNP+T + + N + +L + Sbjct: 320 ITWATGLRWRQQFVNLGTNRFPVVLLEVNPRTIGLTLKPIVTNPDTLVGTAPILQT-AQR 378 Query: 83 GQVQMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIV 141 A+NGG ++ + PLG +N Q + L G G FY + Sbjct: 379 YFAVGAINGGYFNRNNRYPLGAIRQNNQWLSSPILN--RGAIAWNDAGQFYFGRLSLQET 436 Query: 142 RLDAFKTSKEIQFAVQSGPMLMENG 166 + I A+ SG ++NG Sbjct: 437 LATSSNLRVPI-LALNSGY--VQNG 458 >UniRef50_A0YXN3 Putative uncharacterized protein n=1 Tax=Lyngbya sp. PCC 8106 RepID=A0YXN3_9CYAN Length = 775 Score = 104 bits (258), Expect = 4e-21, Method: Composition-based stats. Identities = 32/193 (16%), Positives = 62/193 (32%), Gaps = 29/193 (15%) Query: 84 QVQMAMNGGIYDESYAPLG-----LYIENGQQKVALNLASGEGNFFIRPGGVF------- 131 + +A + +SY PL + +EN Q + + I G Sbjct: 572 KAGIARYTPDWGKSYTPLTLNEVIITVENNQLSRQIESNDDQTPIEIPQNGYLLTFRSFR 631 Query: 132 -----YVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNV------ASSK 180 + G K+ I + + +GP+L++ G I Sbjct: 632 SALSAFPLGGKIAITAKTTPSEFNQYPHILGAGPLLLQQGQIVVDAEAEGFNIWFAKQRA 691 Query: 181 IRNGVGINKHGNAVFLLSQQATN-----FYDFACYAKAKLNVEQLLYLDGTISHMYMKGG 235 IR+G+G+ +G+ + + + A +L + L LDG S + GG Sbjct: 692 IRSGIGVTANGDLLIVTVHNRVGGPGPDLTELAQ-LIQQLGAVEGLNLDGGSSTSLILGG 750 Query: 236 AIPWQRYPFVTMI 248 + + + Sbjct: 751 HLLNRTADTAARV 763 Score = 47.4 bits (111), Expect = 5e-04, Method: Composition-based stats. Identities = 14/88 (15%), Positives = 29/88 (32%), Gaps = 3/88 (3%) Query: 23 LTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQ 82 + L+ V V + +K+ + +L + I + Sbjct: 437 ILWTTGLNWRQQYLTLNQDRFPVVWLEVK-RNSGLKLQPIWTDKTQMKGTASL-SQITNS 494 Query: 83 GQVQMAMNGGIYDESY-APLGLYIENGQ 109 A+NGG ++ + PLG +N + Sbjct: 495 WGSLAAINGGYFNRNNLLPLGTIRQNNK 522 >UniRef50_B0C332 Putative uncharacterized protein n=2 Tax=Bacteria RepID=B0C332_ACAM1 Length = 306 Score = 103 bits (257), Expect = 5e-21, Method: Composition-based stats. Identities = 27/222 (12%), Positives = 60/222 (27%), Gaps = 20/222 (9%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMY--WQKANGEAWGTLHALLADINSQ 82 L S V ++ +++ + + + +Q Sbjct: 40 LFAGITYQRQVYT-SPRPYIVHIAKIDLTHPGIRVIATPGQPADDDNEFRAQPTSAFLTQ 98 Query: 83 GQVQMAMNGGIYDESY--APLGLYIENGQQKVALNLASGEGNFFIRP---GGVFYVAGDK 137 ++Q+AMN G + P G + L + G + P V + Sbjct: 99 FRLQLAMNAGYFYHFNEKTPWDYAPHTGGRVNVLGQSISMGQPYSPPQKQWPVLCFDQSQ 158 Query: 138 VGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGN-AVFL 196 G + + AV +L + + + R+ +++ G + Sbjct: 159 RGRIV-ATGHCPSDTLHAVAGNYIL----HPDQPLQLDSDKPYARSIAALDQTGTTLWLI 213 Query: 197 LSQQATNFYDFACYAKA------KLNVEQLLYLDGTISHMYM 232 + Y ++ + L LDG S + Sbjct: 214 VVDGKQPDYSEGATFADIEQLIKQIGADIALNLDGGGSTTLV 255 >UniRef50_C6IP98 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=C6IP98_9BACE Length = 536 Score = 103 bits (257), Expect = 5e-21, Method: Composition-based stats. Identities = 41/293 (13%), Positives = 79/293 (26%), Gaps = 73/293 (24%) Query: 24 TLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQG 83 TL + L+ + T +V++ + + T+ A G Sbjct: 242 TLPAEIELYETTSNLNGSNFHAWYAIGDLSTGKVEVRVHIPS--SPATIDTQSASF--NG 297 Query: 84 QVQMAMNGGIYDESYAPLGLYIEN----------------GQQKVALNLASGEGNFFIRP 127 + +NGG + + G+ + N G + G F + Sbjct: 298 DCYLLVNGGYFY-NGNHTGIAVINSIKSGSVSAVRGSLKTGDTEYNSMYNVTRGTFGVDA 356 Query: 128 GGVFYV----------------------AGDKVGIVRLDAFKTSKE--IQFAVQSGPMLM 163 G V +K GIV + T+ ++A+ +GP+L+ Sbjct: 357 SGKPNVVWTGTDASSNVFYFDRPLPSVKGENKYGIVTNENPTTAISWSPKYALSAGPVLL 416 Query: 164 ENGVINPRIHPNVASSKI--------------------RNGVGINKHGNAVFLLSQQATN 203 ++ I + R +G + G V + Sbjct: 417 KDKKIPFDFTETSKGTDYYLSNYEIIPYDIFGANVTPDRTAIGYREDGKVVIFICDGRIT 476 Query: 204 ------FYDFACYAKAKLNVEQLLYLDGTISHMYMKG-GAIPWQRYPFVTMIS 249 + A K L + LDG S + G + ++S Sbjct: 477 ASGGATLTELAQIMK-GLGCVGAINLDGGGSTGMVVGDEHLNDMTGGNRAVVS 528 >UniRef50_B7KAU9 Putative uncharacterized protein n=7 Tax=Chroococcales RepID=B7KAU9_CYAP7 Length = 644 Score = 103 bits (256), Expect = 6e-21, Method: Composition-based stats. Identities = 36/239 (15%), Positives = 65/239 (27%), Gaps = 43/239 (17%) Query: 52 PQTERVKMYWQKANGEAWGTLHALLADINSQGQ--VQMAMNGGI-----------YDESY 98 P R + W G L I + G + +N G + +Y Sbjct: 395 PILNRGAIAWNDRGQVKMGRLRLQETVITNGGNRLPVLYLNSGYVQSGMARYTRDWGATY 454 Query: 99 APLG-----LYIENGQQKVALNLASGEGN-FFIRPGGVFY------VAGDKVGIVRLDAF 146 PL + ++N Q N I G V + I Sbjct: 455 TPLSDDELIITVQNNQVISQRQGGKAGQNVIPIPNDGYLLAIRKNSVPASALTIGTSLNL 514 Query: 147 KTSK------EIQFAVQSGPMLMENGVIN-----PRIHPNV-ASSKIRNGVGINKHGNAV 194 ++ + +GP+L+ NG I + R+ + + G + Sbjct: 515 ESGTIPADFNNYPHILGAGPLLLLNGQIVLDVASEQFSKGFQNQKASRSAIATTRDGKLM 574 Query: 195 FLLSQQAT-----NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMI 248 + + + A ++ L L LDG S GG + + + Sbjct: 575 VVAVHNRVGGSGASLPELAQILQS-LGAVDALNLDGGSSTSLALGGQLIDRSPVTAAKV 632 Score = 62.8 bits (151), Expect = 1e-08, Method: Composition-based stats. Identities = 17/138 (12%), Positives = 40/138 (28%), Gaps = 4/138 (2%) Query: 23 LTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQ 82 +T P L + V ++ ++ + + +N + ++ I Sbjct: 304 ITWTPGLIWRQKIIPLKGDSFPVTWLDIDLKSPNIFLKPVTSNPDTLEGTEPIV-TIGRN 362 Query: 83 GQVQMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIV 141 A+NGG ++ + PLG N + L G G + ++ Sbjct: 363 TTASAAINGGFFNRNNRLPLGAIRTNNRWVSGPILN--RGAIAWNDRGQVKMGRLRLQET 420 Query: 142 RLDAFKTSKEIQFAVQSG 159 + + + Sbjct: 421 VITNGGNRLPVLYLNSGY 438 >UniRef50_Q8YKH7 All7320 protein n=2 Tax=Cyanobacteria RepID=Q8YKH7_ANASP Length = 314 Score = 102 bits (255), Expect = 9e-21, Method: Composition-based stats. Identities = 36/239 (15%), Positives = 66/239 (27%), Gaps = 31/239 (12%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLH----------- 73 L L + T++ T +K + + Sbjct: 45 LFRGIVYQR-LIESKPRPLIIHIVTIDLNTPGIKPFITPDIENLSKNVGVGKQAIIDNET 103 Query: 74 --ALLADINSQGQVQMAMNGGIYDESY--APLGLYIENGQQKVALNLASGEGNFFIRP-- 127 ++ ++ QV++A+NG + P Y +G L G + Sbjct: 104 KARTTSEFVAEFQVKLAINGSYFYPFKEVTPWHYYPHSGDTTKVLGQTISNGKIYANKKS 163 Query: 128 -GGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKI--RNG 184 V + + + K + +L+ G I+ N ++ K R Sbjct: 164 SWYVLCFDNNNQAQIPGGE-ECPKNTIQGLAGDDVLVFQGKPKINIYANSSADKPYSRVV 222 Query: 185 VGINKHGN-AVFLLSQQATNFY-------DFACYAKAKLNVEQLLYLDGTISHMYMKGG 235 I+K G +L Y + AKL V + LDG S + Sbjct: 223 AAIDKTGKKLWLVLVDGKQPLYSEGFTKRELTQ-FIAKLGVYNAINLDGGGSTTLVVAN 280 >UniRef50_A7M0G9 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=A7M0G9_BACOV Length = 621 Score = 102 bits (255), Expect = 1e-20, Method: Composition-based stats. Identities = 36/184 (19%), Positives = 63/184 (34%), Gaps = 18/184 (9%) Query: 74 ALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYV 133 A + I + A+NG S P + + KVA + S + GV + Sbjct: 103 AKTSMIAKDKKALFAINGSY-SISGNPSTFTMVDKVVKVASTIESAS-----KVNGVIAI 156 Query: 134 AGDKVGIVRL----DAFKTSKEIQFAVQSGPMLMENGVINPRIHPNV-ASSKIRNGVGIN 188 + V+ D E + A+ SGPML+ G + + R+ +GI Sbjct: 157 DAEGSVDVKSCTFSDYTDVEDEYESALASGPMLLMEGKVCSFPQDAIYTQRMARSVIGIT 216 Query: 189 KHGNAVFLLSQQATN------FYDFACYAKAKLNVEQLLYL-DGTISHMYMKGGAIPWQR 241 G + L A + A + L ++ + L DG+ S ++ G + Sbjct: 217 AQGKMMLLTIDGAITGNADGATLEEAAFIAKTLGMKNAVCLADGSSSTLWTSGKGVVNHP 276 Query: 242 YPFV 245 Sbjct: 277 VGNG 280 >UniRef50_A3DHF5 Ig-like, group 2 n=3 Tax=Clostridium thermocellum RepID=A3DHF5_CLOTH Length = 929 Score = 102 bits (254), Expect = 1e-20, Method: Composition-based stats. Identities = 24/181 (13%), Positives = 49/181 (27%), Gaps = 35/181 (19%) Query: 103 LYIENGQQKVALNLASGEGNFFIRPGGVFYVAGD-------------KVGIVRLDAFKTS 149 + ++NG + G I G ++ + Sbjct: 212 VVVDNGTVV---EIRQGLPAVEIPQNGYVIISRGANAQFLLQHFKVGDPVEISFSTVLDW 268 Query: 150 KEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGN-AVFLLSQQA------T 202 ++I+ AV +L+++G I + ++ R G +K G + + Sbjct: 269 QKIEMAVTGSAILVKDGQIPEKFSYEISGVHPRTAAGTSKSGKELILVTVDGRQAASKGM 328 Query: 203 NFYDFACYAKAKLNVEQLLYLDGTISHMYMKG-------GAIPWQR----YPFVTMISVE 251 + A + L + LDG S + + T I V Sbjct: 329 TQRELANLMLS-LGAYNAINLDGGGSTSMVSRIPGTNDLKVVNTPSDGALRSISTAIGVF 387 Query: 252 R 252 Sbjct: 388 S 388 >UniRef50_A1HRE9 Exopolysaccharide biosynthesis protein n=1 Tax=Thermosinus carboxydivorans Nor1 RepID=A1HRE9_9FIRM Length = 487 Score = 102 bits (254), Expect = 1e-20, Method: Composition-based stats. Identities = 19/123 (15%), Positives = 38/123 (30%), Gaps = 18/123 (14%) Query: 147 KTSKEIQFAVQSGPMLMENG------VINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ 200 + A+ +GPML++NG I R +G+ K G + ++ Sbjct: 361 PVWDKTVHALGAGPMLLKNGSIYLTTKIEEFGSDVAGGRAPRTALGLTKDGRVLLVVVDG 420 Query: 201 A------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPW-----QRYPFVTMIS 249 + A +L + LDG S + + + + ++ Sbjct: 421 RQPTSAGMTLLELA-LFLQELGAVDAMNLDGGGSSEMVINDKVVNKPSDGRERKVGSALA 479 Query: 250 VER 252 V Sbjct: 480 VIS 482 Score = 40.8 bits (94), Expect = 0.045, Method: Composition-based stats. Identities = 18/113 (15%), Positives = 37/113 (32%), Gaps = 5/113 (4%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ 84 ++P + ++ T++ + + ANG G ++ + + Sbjct: 156 VMPGLTYTSWLSGRPYGPVSAHILTIDLKQ-GFVLKPVLANGVVQG--LDTVSAMARACR 212 Query: 85 VQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDK 137 A+NG + + LGL +G+ LA I P G + Sbjct: 213 AVAAVNGSYFAPTGEILGLLKLDGEIVSTPPLA--RTAMGIMPDGKIIMDQVT 263 >UniRef50_UPI0001BC335A hypothetical protein BcroD2_01203 n=1 Tax=Butyrivibrio crossotus DSM 2876 RepID=UPI0001BC335A Length = 366 Score = 102 bits (253), Expect = 1e-20, Method: Composition-based stats. Identities = 30/223 (13%), Positives = 70/223 (31%), Gaps = 27/223 (12%) Query: 36 CALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIY 94 +S + + + +P V + ++ E L ++ + +NGG Y Sbjct: 123 VEISGRSYYGKLMMIKDPSKVSVATIYPWSD-ENKSKYGVTLGELVTNAGAIAGINGGEY 181 Query: 95 DESY----APLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAG-DKVGIVRLDAFKTS 149 P GL + NG+ + + G+ + + + + + +++ + Sbjct: 182 CSDGNWGGRPKGLVVSNGELQYN-SPQWGDVMVGFNEDNILVIKDLNGMSVGQIEEMVKT 240 Query: 150 KEIQFAVQS----------GPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQ 199 + I+ V L+ NG + I+ + + + R +G G + ++ Sbjct: 241 ERIRDCVSFKDIDDGDSNHFTKLIING-VATEINGSGSGANPRTCIGQRADGTVLMFVTD 299 Query: 200 QA-------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGG 235 D K + +DG S G Sbjct: 300 GRGASGHIGATAADLISVMK-EYGAVNAANIDGGSSSSMYYKG 341 >UniRef50_UPI0001BC7E39 hypothetical protein BacD2_08600 n=1 Tax=Bacteroides sp. D2 RepID=UPI0001BC7E39 Length = 660 Score = 102 bits (253), Expect = 1e-20, Method: Composition-based stats. Identities = 20/217 (9%), Positives = 50/217 (23%), Gaps = 34/217 (15%) Query: 42 TLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPL 101 V ++ + ++ + A L+ + + +NG E Sbjct: 75 RQQVNVLEIDLSSPDYELEFVSAPQL------DSLSSVALKHDAVAGINGTYELE----A 124 Query: 102 GLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPM 161 NG + L G ++ G + Y + + P+ Sbjct: 125 SFVKVNGSIISPITLPEGHLRYWKHEGAIAYDGYKVEIGYGTKESYSYNSMPNIFSGAPV 184 Query: 162 LMENGVIN-PRIHPNVAS-----------------SKIRNGVGINKHGNAVFLLSQQA-- 201 L+++ ++ R V + + + + Sbjct: 185 LIDDYQPVGKTFIGDITGINLNSLDGEDYRRHQGVRHPRTAVALTEQNKLLLVTVDGRAD 244 Query: 202 ----TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKG 234 + + + L +DG S Sbjct: 245 LAAGMTAKELTSFINQYFKPQHALNVDGGGSTTMYIR 281 >UniRef50_A6TVJ8 Exopolysaccharide biosynthesis protein n=2 Tax=Alkaliphilus RepID=A6TVJ8_ALKMQ Length = 942 Score = 102 bits (253), Expect = 1e-20, Method: Composition-based stats. Identities = 33/228 (14%), Positives = 68/228 (29%), Gaps = 27/228 (11%) Query: 49 TVNP---QTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYA-PLGLY 104 ++ + + + WGT + + +N + + P Sbjct: 168 RIDLSSINKYTDRYQYTMLIDKNWGTHTPGYNEKLLDMVEVIVINDEVAEIRRRQPATGI 227 Query: 105 IENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLME 164 NG VA +G G + + +I+ A+ G +L++ Sbjct: 228 PSNGYVLVASQTETGWGRAGHLFDNLKVGDRLTLHQEIQPNLN---QIELALGGGTLLVK 284 Query: 165 NGVINPRIHPNVASSKIRNGVGINKHGN-AVFLLSQQATNFY------DFACYAKAKLNV 217 +G + +VA + R+ +GI++ + + +NFY + L Sbjct: 285 DGQA-AHLTQSVAGAHPRSAIGISRDRKQVILVTIDGRSNFYHGVDGRELGNILL-GLGA 342 Query: 218 EQLLYLDGTISHMYMKGG-------AIPWQR----YPFVTMISVERKG 254 + +DG S + I V I+V K Sbjct: 343 HDAIIMDGGGSTTMIARELGEAKPQIINNPSEGVERRIVNGIAVLSKA 390 >UniRef50_C9RVV6 Putative uncharacterized protein n=3 Tax=Geobacillus RepID=C9RVV6_GEOSY Length = 652 Score = 101 bits (251), Expect = 3e-20, Method: Composition-based stats. Identities = 23/128 (17%), Positives = 41/128 (32%), Gaps = 17/128 (13%) Query: 135 GDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAV 194 GD V I ++ A+ L+ +G + P V R VGI+K+GN + Sbjct: 358 GDAVEISLQYDQPEWSGVKEALGGRYRLVADGKVQPFSIEGV---HPRTAVGIDKNGNVM 414 Query: 195 FLLSQQA-------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKG------GAIPWQR 241 ++ + A +L + LDG S ++ Sbjct: 415 LIVVDGRQPAYSQGMTLNELAKLM-HELGAVDAMTLDGGGSSTFVVRQPNGQLKVENKPS 473 Query: 242 YPFVTMIS 249 F ++ Sbjct: 474 DGFARPVA 481 Score = 60.5 bits (145), Expect = 6e-08, Method: Composition-based stats. Identities = 20/124 (16%), Positives = 39/124 (31%), Gaps = 4/124 (3%) Query: 21 LALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWG---TLHALLA 77 ++ + + + V ++ ER+ + +N + G L Sbjct: 137 VSTRIASGVEKEEMEIVGARGKQHVYKLDIDTSNERMAIETALSNDQVLGIEPVLEQAKR 196 Query: 78 DINSQGQVQMAMNGGIYDESYAPLGLYIENGQ-QKVALNLASGEGNFFIRPGGVFYVAGD 136 G V A+NG + + +P L + G+ A+ F I G + Sbjct: 197 YDGRDGIVLAAVNGDYFKQDGSPTDLMVHRGEIVITNTTPAAERTIFGISADGKPMIGNP 256 Query: 137 KVGI 140 V I Sbjct: 257 DVQI 260 >UniRef50_C7IFA0 Exopolysaccharide biosynthesis protein n=1 Tax=Clostridium papyrosolvens DSM 2782 RepID=C7IFA0_9CLOT Length = 385 Score = 101 bits (251), Expect = 3e-20, Method: Composition-based stats. Identities = 29/219 (13%), Positives = 66/219 (30%), Gaps = 25/219 (11%) Query: 47 AYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAP----LG 102 V+ + R + ++ E G + + + + + Sbjct: 150 VLIVDKKGARFETFYSNITLEHKGNKIKINDMNRIGKNNDIVLYNDKFGSTNRAEIKNTT 209 Query: 103 LYIENGQQK--------VALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQF 154 + ++N V + +F+ + G K G + ++ Sbjct: 210 IIVDNNVITTLVESTKEVNIRKGMNVISFYGGKESIPEKMGLKAGDKVNIRMEPYLGYRY 269 Query: 155 -AVQSGPMLMENGVINP----RIHPNVASSKIRNGVGINKHGNAVFLLSQQA-------T 202 A + G ML+++G + + + R +GI G + L++ Sbjct: 270 QAYECGSMLVKDGKTVVPERDKWAGTLGNRDPRTVIGIKTDGKIIMLVADGRQPGYSEGM 329 Query: 203 NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQR 241 + Y KL V + LDG S + G++ + Sbjct: 330 TGKEMGEYL-VKLGVRDVAMLDGGASSQMIINGSLRNRP 367 Score = 53.9 bits (128), Expect = 5e-06, Method: Composition-based stats. Identities = 16/83 (19%), Positives = 31/83 (37%), Gaps = 2/83 (2%) Query: 29 FAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMA 88 + + +P+ ERV+ + +G L+DI + + A Sbjct: 61 VQYKHKTEIIKGNKQEIYMLEFDPRDERVEFKPALSFDNIFGF--EKLSDICKRNEAYAA 118 Query: 89 MNGGIYDESYAPLGLYIENGQQK 111 +NGG + + P G+ +GQ Sbjct: 119 INGGFFYQFGEPTGMVAIDGQML 141 >UniRef50_B4VYL6 Tat pathway signal sequence domain protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VYL6_9CYAN Length = 299 Score = 100 bits (250), Expect = 3e-20, Method: Composition-based stats. Identities = 29/247 (11%), Positives = 61/247 (24%), Gaps = 27/247 (10%) Query: 24 TLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLH------ALLA 77 + P+ A + + + + ++ + AN G + Sbjct: 32 LVSPVAAESVSFRRSTILGVPLYQTHIDLTNPDTFIAIGLANNSTLGNHQGAIGEESFGN 91 Query: 78 DINSQGQVQMAMNGGIYDES-YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGD 136 + +A + +G + G + +R Sbjct: 92 MVRRYHAAVVASGTFFSKKDPKRLMGNMVSAGTFLKYSPWENYGTTLGLRV--------G 143 Query: 137 KVGIVRLDAFKTSKEI---QFAVQSGPMLMENGVI------NPRIHPNVASSKIRNGVGI 187 + + F++ GP L+ G + P V R +G Sbjct: 144 NQPELVTARVDGKPDWGQHWFSLTGGPRLLRKGKVWLAPRSEGFTDPRVMGVAHRCAIGF 203 Query: 188 NKHGN-AVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAI-PWQRYPFV 245 G V + + A +A + + + +DG S G I + Sbjct: 204 PASGKKLVLVTFLAPLPLWREAKVMRA-IGCSEAMNIDGGSSSALYHRGRILVNPKRMLT 262 Query: 246 TMISVER 252 I V Sbjct: 263 NAIVVYD 269 >UniRef50_B3PTF7 Putative uncharacterized protein n=3 Tax=Rhizobium RepID=B3PTF7_RHIE6 Length = 325 Score = 100 bits (250), Expect = 4e-20, Method: Composition-based stats. Identities = 36/204 (17%), Positives = 63/204 (30%), Gaps = 17/204 (8%) Query: 27 PLFAVAA-DDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQV 85 P F VA A + V+P R + + L Sbjct: 83 PGFEVAELPVLADGREVDRIFLSRVDPARFRFVTHNAAPGDKGIDEWEKTLP------NA 136 Query: 86 QMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDA 145 + +NG +D+ P +I G + G F + D Sbjct: 137 VLIVNGSYFDKHGRPDTPFISEGIAMGPRQYDARAGAFTADKDTAEIRD-----LSHQDW 191 Query: 146 FKTSKEIQFAVQSGPMLM-ENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-TN 203 A+ S P+L+ ++G + + R V + G V +++A + Sbjct: 192 QTAFVGASNAMVSYPLLIGDDGQTHVNVK--SRWLANRTFVAKDDLGRVVIGTTKEAFFS 249 Query: 204 FYDFACYAK-AKLNVEQLLYLDGT 226 A + K + LN++ L LDG Sbjct: 250 LDRLAQFLKTSPLNLKVALNLDGG 273 >UniRef50_B8I064 Exopolysaccharide biosynthesis protein n=1 Tax=Clostridium cellulolyticum H10 RepID=B8I064_CLOCE Length = 383 Score = 100 bits (249), Expect = 4e-20, Method: Composition-based stats. Identities = 30/222 (13%), Positives = 65/222 (29%), Gaps = 31/222 (13%) Query: 47 AYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYA----PLG 102 ++ R + ++ E+ G + + + + + Sbjct: 148 VLILDKMGARFETFYSNIFLESKGNRVKINEMNRVGKNDDIILYIDKFGNTNRAEVKSTS 207 Query: 103 LYIENGQQKVALNLASG------------EGNFFIRPGGVFYVAGDKVGIVRLDAFKTSK 150 L ++N + + G+ P + GDKV I + Sbjct: 208 LIVDNNKIISIIESTKEVNIKKGMYVISFYGDKSSLPDKIGLKTGDKVNIRIEPYLGYNY 267 Query: 151 EIQFAVQSGPMLMENGVINP----RIHPNVASSKIRNGVGINKHGNAVFLLSQQA----- 201 A + G ML++NG + + + R +GI +G V +++ Sbjct: 268 ---QAYECGSMLVKNGKSVVPERDKWAGTLGNRDPRTVIGIKTNGKIVLVVADGRQPGYS 324 Query: 202 --TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQR 241 + + K+ V LDG + + G I + Sbjct: 325 EGMTGKEMGEFL-VKIGVRDAAMLDGGATSQMIINGRIQNRP 365 Score = 52.0 bits (123), Expect = 2e-05, Method: Composition-based stats. Identities = 16/88 (18%), Positives = 32/88 (36%), Gaps = 2/88 (2%) Query: 29 FAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMA 88 + ++ + +P+ ERV+ + +G L+DI + A Sbjct: 59 VQYKSTTETINGYKQEIYMLEFDPRDERVEFKPALSFDNIFGF--EKLSDICKRNGAYAA 116 Query: 89 MNGGIYDESYAPLGLYIENGQQKVALNL 116 +NGG + + P G+ +GQ Sbjct: 117 VNGGFFYQFGDPAGMVAIDGQMLTTSTG 144 >UniRef50_A6TKB7 Exopolysaccharide biosynthesis protein n=1 Tax=Alkaliphilus metalliredigens QYMF RepID=A6TKB7_ALKMQ Length = 236 Score = 100 bits (249), Expect = 4e-20, Method: Composition-based stats. Identities = 32/221 (14%), Positives = 71/221 (32%), Gaps = 20/221 (9%) Query: 39 SDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADI--NSQGQVQMAMNGGIYDE 96 T+ V P+ V + + + + + + A+NGG +D Sbjct: 6 RRYDTTIHVLEV-PKQ-GVVIMPCLGDRTKRQPVQQIRHSYFEANGYKRIGAVNGGFFDG 63 Query: 97 SYA-PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFA 155 + P G++ + ++ + A + G ++ ++ K+ +A Sbjct: 64 NRTLPYGMFYVDSGFLLSESWAGDAFLELVHENGKLHIDDITANQLKTKY----KKANWA 119 Query: 156 VQSGPMLMENGVINP---RIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFY------D 206 + L+ G +N P S R +G + N +F++++ + Sbjct: 120 ISLSYSLVVGGKMNIMKGDKFPFTNQSHPRTLIG-DNQENYIFVVTEGRMTKEKGLTAVE 178 Query: 207 FACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTM 247 A +L + DG S G I + Y + Sbjct: 179 SARVML-ELGCNTAINADGGGSSAMDVEGKIQNKYYDNRAV 218 >UniRef50_B5RQG1 Uncharacterized conserved protein n=20 Tax=Borrelia RepID=B5RQG1_BORRA Length = 269 Score = 100 bits (249), Expect = 4e-20, Method: Composition-based stats. Identities = 28/217 (12%), Positives = 54/217 (24%), Gaps = 31/217 (14%) Query: 31 VAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEA----WGTLHALLADINSQGQVQ 86 + + V + + +K K + + + +V Sbjct: 29 LQPKYEIIKGSFQESNYVIVKIKNKDLKFIISKPIYDTKMNNYYFKGQTTSQFLISNKVD 88 Query: 87 MAMNGGIYDESYA---PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRL 143 +A+N Y P G+YI N + G I+ Sbjct: 89 IAINTSPYTIKGTMFYPNGIYIYNKKLISHAKKDQGIII------------IKNNQIILN 136 Query: 144 DAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHG-NAVFLLSQQA- 201 K + L++NG N R +G +K + + + Sbjct: 137 PKHNEIKNSDYGFGGFFSLIKNGKYTKNFKEN---KHPRTIIGTDKENKHLYLITVEGRG 193 Query: 202 ------TNFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 + + A V + LDG S + Sbjct: 194 TNNSKGISLNE-AIDLSLSYGVTNSINLDGGGSSTLV 229 >UniRef50_C9LSB3 Putative secreted protein n=1 Tax=Selenomonas sputigena ATCC 35185 RepID=C9LSB3_9FIRM Length = 475 Score = 100 bits (248), Expect = 6e-20, Method: Composition-based stats. Identities = 22/196 (11%), Positives = 56/196 (28%), Gaps = 23/196 (11%) Query: 76 LADINSQGQVQMAMNGGIYDESYAPLG--LYIENGQQ--KVALNLASGEGNFFIRPGGVF 131 + + + + + G + G+ + + + I G Sbjct: 273 VDAERGEDSLVIYNHYYGSTTRTNEYGQEYIVRGGRVAAVNSSDSPIPKDGLVISVHGKA 332 Query: 132 YVAGDKVGIVRLDAFKTSKEIQF-----AVQSGPMLMENGVIN-----PRIHPNV-ASSK 180 A +V + + + + +GPML+++G+ + P++ Sbjct: 333 KDAFSQVKVGDAVRVAETIGAPWESLPTVIGAGPMLVKDGIAHVTATEEEFPPDIARGRA 392 Query: 181 IRNGVGINKHGNAVFLLSQQA------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKG 234 R G+ G+ + + + A + + Q + DG S + G Sbjct: 393 PRTAFGVTAEGHYLLAVVDGRQPHSIGCTLQEMAEFML-QFGAVQAINFDGGGSSALVVG 451 Query: 235 GAIPW-QRYPFVTMIS 249 G + + Sbjct: 452 GELENSPSDGQERAVG 467 Score = 42.4 bits (98), Expect = 0.013, Method: Composition-based stats. Identities = 16/127 (12%), Positives = 31/127 (24%), Gaps = 6/127 (4%) Query: 24 TLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQG 83 T P ++ + LT +P R + + ++D+ Sbjct: 149 TPAPGLKLSTLKRLDARGRLTGWVLEADPARYR-AVPVLAKGAVPGRASVSAMSDMA--- 204 Query: 84 QVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRL 143 A+N + + LGL +G F G Y Sbjct: 205 GADAAINASYFAPNGEILGLLKMDGTIVGTTYFRRSAVGFA--ADGRAYFGPVDYSGTVT 262 Query: 144 DAFKTSK 150 ++ Sbjct: 263 LGKRSWP 269 >UniRef50_UPI0001C164F4 hypothetical protein CRD_01886 n=2 Tax=Nostocaceae RepID=UPI0001C164F4 Length = 300 Score = 99.8 bits (247), Expect = 7e-20, Method: Composition-based stats. Identities = 32/249 (12%), Positives = 74/249 (29%), Gaps = 27/249 (10%) Query: 20 FLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLA-- 77 + + A + + ++ + + + N + + Sbjct: 39 LTPTIIPTVRAYRSQI-----NGIPFYQTIIDLEDPNILLTIGLPNSANFANTISRTNGD 93 Query: 78 ----DINSQGQVQMAMNGGIY--DESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVF 131 + ++ + +NG + +G + G+ + + G Sbjct: 94 ENFDQLVARSGAAVVVNGTFAYTNPQKTVMGNLVAGGRSLKYSPWENFGTTLGLGVGNKP 153 Query: 132 YVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRI------HPNVASSKIRNGV 185 + +V + F++ SGP L+ NG ++ P V + +R + Sbjct: 154 EMITARVEGRPEWN-----KHWFSITSGPRLLRNGEVSVNPRLEGFKDPAVLGTSLRTAI 208 Query: 186 GINKHGNAVFLL-SQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQR-YP 243 G ++ G +FL + + A KA + + + LDG S I Sbjct: 209 GFSEDGKRLFLANFDEKLYLEEEAEAMKA-IGCYEAMNLDGGPSRALASDNVILVPPARK 267 Query: 244 FVTMISVER 252 +I V Sbjct: 268 LTNVILVYD 276 >UniRef50_C9KQW2 Putative secreted protein n=2 Tax=Veillonellaceae RepID=C9KQW2_9FIRM Length = 503 Score = 99.8 bits (247), Expect = 8e-20, Method: Composition-based stats. Identities = 29/200 (14%), Positives = 55/200 (27%), Gaps = 27/200 (13%) Query: 80 NSQGQVQMAMNGGIYDESYAPLG--LYIENGQQ--KVALNLASGEGNFFIRPGGVFYVAG 135 + + G + G+ + I G+ Sbjct: 305 RGADSLVIYNRAYGSSTGTNEYGREYIVRGGRVTDIRQNDSPIPADGVVISVHGMAADEL 364 Query: 136 DKVGI-----VRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVA------SSKIRNG 184 V + + + + + F + GP L+ENG ++ + R+ Sbjct: 365 GGVQVGDPVMIEENLGDGWQNMDFIIGCGPRLVENGRVHVTVDEEDFPADIRIGRAPRSA 424 Query: 185 VGINKHGNAVFLLSQQAT------NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIP 238 VGI K G + + D+A K + L LDG S + G + Sbjct: 425 VGITKDGRYLLAVVDGRQSHSVGLTLTDWAKLL-VKFGAQDALNLDGGGSSDLVVNGDVQ 483 Query: 239 W-----QRYPFVTMISVERK 253 Q + + +K Sbjct: 484 NSPSDGQERLVGDGLVLVKK 503 >UniRef50_A0YL57 Putative uncharacterized protein n=1 Tax=Lyngbya sp. PCC 8106 RepID=A0YL57_9CYAN Length = 620 Score = 99.0 bits (245), Expect = 1e-19, Method: Composition-based stats. Identities = 24/125 (19%), Positives = 40/125 (32%), Gaps = 12/125 (9%) Query: 135 GDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPR------IHPNVASSKIRNGVGIN 188 G +V I + + +GP+L+++G I IR+ VG Sbjct: 485 GTEVKIESQTDPPIWETYPQILAAGPLLLQSGEIVLDAPSERFSEAFSNQQAIRSAVGRT 544 Query: 189 KHGNAVFLLSQQAT-----NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYP 243 + + N + A + KL + L LDG S GG + + Sbjct: 545 PDNKLLLVAVHNRPLGSGPNLTELAQILQ-KLGAVEALNLDGGSSTSLYLGGELIDRPAQ 603 Query: 244 FVTMI 248 I Sbjct: 604 TAAPI 608 Score = 61.3 bits (147), Expect = 3e-08, Method: Composition-based stats. Identities = 22/147 (14%), Positives = 42/147 (28%), Gaps = 13/147 (8%) Query: 23 LTLLPLFAVAADDCALSD---------PTLTVQAYTVNPQTERVKMYWQKANGEAWGTLH 73 + +P + + V ++ ++ + + + L Sbjct: 271 ILWMPGLRWRQQYIEIPNSQPTASSLPNRFPVFWLEIDLTAPQLSLKPILSRNTSRVGLA 330 Query: 74 ALLADINSQGQVQMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGEGNFFIRPGGVFY 132 LL S+ Q A+NGG ++ + PLG G+ + L G G Y Sbjct: 331 PLLKT-ASRSQALAAINGGFFNRNTLFPLGAIRRQGRWLSSPILN--RGAIGWTDQGEIY 387 Query: 133 VAGDKVGIVRLDAFKTSKEIQFAVQSG 159 + + A IQ Sbjct: 388 LDRLTRFETLITATGNRFPIQHLNSGY 414 >UniRef50_C6LDL7 Putative uncharacterized protein n=1 Tax=Bryantella formatexigens DSM 14469 RepID=C6LDL7_9FIRM Length = 400 Score = 99.0 bits (245), Expect = 1e-19, Method: Composition-based stats. Identities = 34/225 (15%), Positives = 59/225 (26%), Gaps = 23/225 (10%) Query: 39 SDPTLTVQAYTVNPQTERVKMYWQKANGEAWG-TLHALLADINSQGQVQMAMNGGIYDES 97 + + + + +G L+ ++ + + +A+NG Y + Sbjct: 170 EKYGTQISYVLADIYVGDITCLRTAFAQDTYGVGYSEKLSGMSDRMKAVLAVNGDSYSNN 229 Query: 98 Y-APLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAV 156 G I NG + + V G + A Sbjct: 230 RHRNNGTIIRNGVIYRSQATDAETC--------VLNWDGTMDIYTPDQMDIQKLIERGAY 281 Query: 157 Q---SGPMLM-ENGVINPRI--HPNVASSKIRNGVGINKHGNAVFLLSQQA------TNF 204 Q GP L+ ENG + S R +G + G+ LL Sbjct: 282 QSWVFGPSLLDENGKAKDSFLTWDYIRQSHPRTAIGYYEPGHYCLLLVDGRQKASRGMFL 341 Query: 205 YDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMIS 249 + A +L + LDG G Y +S Sbjct: 342 DEMAQLF-EELGCKAAYNLDGGHCSFMNFQGQTANHPYKPEHTVS 385 >UniRef50_B4WS35 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WS35_9SYNE Length = 687 Score = 98.6 bits (244), Expect = 2e-19, Method: Composition-based stats. Identities = 29/204 (14%), Positives = 57/204 (27%), Gaps = 42/204 (20%) Query: 86 QMAMNGGI-----------YDESYAPLG-----LYIENGQQK-VALNLASGEGNFFIRPG 128 +A+N G + +Y P+ + ++N + +G + I Sbjct: 473 ILAVNSGYVKAGIGRYTEGWGSTYTPIVDNEIIVTVQNHEVIAQKSMGKAGSSSVPIPRD 532 Query: 129 GVFYV-------------AGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPR---- 171 G + + G V + ++ + GP+L+ + I Sbjct: 533 GGYLLALRSYRSAGQSFQPGTPVLLSSQSQPAVFEQYPNMIGGGPLLVRDRNIVLNPQLE 592 Query: 172 --IHPNVASSKIRNGVGINKHGNAVFLLSQQA-----TNFYDFACYAKAKLNVEQLLYLD 224 + + R VG G + + A K +L L LD Sbjct: 593 GFSTNFIQGAAPRTAVGKTSDGTWIIATMHDRVGGRGPTLTETAYIMK-QLGAVDALNLD 651 Query: 225 GTISHMYMKGGAIPWQRYPFVTMI 248 G S GG + + + Sbjct: 652 GGSSSSLYLGGQLLNRHPRTAARV 675 Score = 52.8 bits (125), Expect = 1e-05, Method: Composition-based stats. Identities = 19/138 (13%), Positives = 37/138 (26%), Gaps = 4/138 (2%) Query: 23 LTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQ 82 + P ++ V + V P + + + A + ++ + Sbjct: 346 VAWAPGLRWRQQYINVNQHRFPVYMFIVRPNPDALTLRPIHAASNTAIGIEPIVTT-AKR 404 Query: 83 GQVQMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIV 141 Q A+N G ++ + PLG GQ L G G G + + Sbjct: 405 AQAIGAVNAGFFNRNNQLPLGAVRSAGQWISGPIL--GRGAMAWNDSGELVIDRFALSES 462 Query: 142 RLDAFKTSKEIQFAVQSG 159 + I Sbjct: 463 VTTGVGEAFPILAVNSGY 480 >UniRef50_Q5N4C8 Putative uncharacterized protein n=2 Tax=Synechococcus elongatus RepID=Q5N4C8_SYNP6 Length = 605 Score = 98.6 bits (244), Expect = 2e-19, Method: Composition-based stats. Identities = 24/170 (14%), Positives = 50/170 (29%), Gaps = 25/170 (14%) Query: 103 LYIENGQQKVALNLASGEGN-FFIRPGGVFYV------------AGDKVGIVRLDAFKTS 149 + ++ + N F I G V G + +++ Sbjct: 420 VTVQGDRVVSQSQADKAGSNRFTIPRNGYLIVLRSANSLRTSLVNGTTIQVLQQAQPSQF 479 Query: 150 KEIQFAVQSGPMLMENGVINPRIHPNVASSK------IRNGVGINKHGNAVFLLSQQ--- 200 A+ GP+L+++G + S R+ +G+ G V + + + Sbjct: 480 DRFPHALGGGPLLVKSGRVVVNPQAEGFSRAFEIEAAPRSAIGLMPDGRLVLVAAHEQNQ 539 Query: 201 --ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMI 248 A + +L V L DG S + G + + + Sbjct: 540 GQGPTLPQMAAIMQ-QLGVVDALNFDGGSSTSLIVNGQLVNRARGSAARV 588 Score = 55.9 bits (133), Expect = 1e-06, Method: Composition-based stats. Identities = 18/149 (12%), Positives = 38/149 (25%), Gaps = 8/149 (5%) Query: 18 RIFLALTLLP----LFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLH 73 LT P + + T V ++ + V++ A + Sbjct: 251 SNPAPLTPAPPDLAGTQLQQRQVTVDGATFPVFVIQLDLRQPNVRLAPIWAGNGSLEGT- 309 Query: 74 ALLADINSQGQVQMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGEGNFFIRPGGVFY 132 +L + +A+N G ++ + PLG + L G G Sbjct: 310 QVLQAVARDRGAAIAINAGFFNRNNRLPLGAIRRDNIWYSGPILN--RGAMAWNDQGEVL 367 Query: 133 VAGDKVGIVRLDAFKTSKEIQFAVQSGPM 161 + + + T + Sbjct: 368 IDRLGLQETLQLSSGTRIPLVALNSGYVR 396 >UniRef50_A7LVE9 Putative uncharacterized protein n=1 Tax=Bacteroides ovatus ATCC 8483 RepID=A7LVE9_BACOV Length = 332 Score = 98.6 bits (244), Expect = 2e-19, Method: Composition-based stats. Identities = 38/256 (14%), Positives = 65/256 (25%), Gaps = 57/256 (22%) Query: 37 ALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE 96 L N + GE G + + + +NGG + E Sbjct: 75 TLEGKKAIAYIAVGNMSKATFGVL-----GEKTG--LKKPKEFYEENNSTIVINGGFFYE 127 Query: 97 SYAPLGLYIENGQQK--VALNLASGEGN----------FFIRPGGVF------------- 131 L L NG+ A N F G F Sbjct: 128 G--SLSLIWRNGEMVCKNNDVTAEDWTNGPFWYPVLAAFCEMNDGSFKSMWTYTTLSNVT 185 Query: 132 ----YVAGDKVGIVRLDAFKTSKEIQFA---VQSGPMLMENGVINPRIHPNV------AS 178 + K + F ++ + A + GP+L+ +G I + Sbjct: 186 YWYSEPSPVKSETTPDENFPSTGTVLNAKTGIGGGPVLLLDGNIKNTYEEEILSDIGATV 245 Query: 179 SKIRNGVGINKHGNAVFLLSQQ--------ATNFYDFACYAKAKLNVEQLLYLDGTISH- 229 ++ R+ +GI + + + + A K L + LDG S Sbjct: 246 NRPRSAIGITNDKKMILFVCEGDGMTTGVAGMTTENVANIMK-TLGCTDAINLDGGGSSC 304 Query: 230 MYMKGGAIPWQRYPFV 245 M + G Sbjct: 305 MLVNGQETIKTSDSSG 320 >UniRef50_B0CAS6 Putative uncharacterized protein n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0CAS6_ACAM1 Length = 279 Score = 98.6 bits (244), Expect = 2e-19, Method: Composition-based stats. Identities = 37/231 (16%), Positives = 67/231 (29%), Gaps = 47/231 (20%) Query: 44 TVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQG-QVQMAMNGGIYDESYAPLG 102 TV + P R + +G +AD + +NGG +D + Sbjct: 34 TVHVLRI-PNHPRYTVRLDVVDGL------QTVADFAQGTPKPVAVINGGYFDPANQLTT 86 Query: 103 LYIENGQQKVALNLASGEGNFFIRPGGVFYVAG----------------DKVGIVRLDAF 146 YI G Q +A + P Y+ Sbjct: 87 SYIRRGGQILADPTQNSR--LVDNPDLKVYLPKILNRSEFRQYQCGAKTTYAITSYNQPI 144 Query: 147 KTSKEIQFAVQSGPMLM-------------ENGVINPRIHPNVASSKIRNGVGINKHGNA 193 + +A+ +GP L+ +G + R R+ VGI G Sbjct: 145 PPDCTLNYALGAGPQLLPQLTSQAEGFTDSVDGQVI-RDAIGSRQPNARSAVGITDKGEV 203 Query: 194 VFLLSQQ------ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIP 238 +++L +Q + + A + + + L LDG S + + Sbjct: 204 IWVLVEQQSATKPGLSLPELADFMEQQ-GAASALNLDGGSSSSLVYQDQVI 253 >UniRef50_Q8RCE6 Putative uncharacterized protein n=5 Tax=Thermoanaerobacterales RepID=Q8RCE6_THETN Length = 815 Score = 98.2 bits (243), Expect = 2e-19, Method: Composition-based stats. Identities = 37/223 (16%), Positives = 64/223 (28%), Gaps = 29/223 (13%) Query: 24 TLLPLFAVAADDCALS-DPTLTVQAYTVNPQTERVKMY------WQKANGEAWGTLHALL 76 T P +++ TV +N + + W + + A + L+ Sbjct: 169 TGTPYIDYWTKKMSITLPDGTTVFLAAINKISSTFQYTVMYTRDWYRFSPGANENVPQLV 228 Query: 77 ADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGD 136 + Q + + P E G A + G + PG Sbjct: 229 EVVVDQNDTVIEV------RQGQPSTEIPEGGYVL-AASGDIGNLLLRLSPG-----DKI 276 Query: 137 KVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVF- 195 + I F ++I+ AV G +L++ G I P + R +G K V Sbjct: 277 QKDITTNPPF---EDIKMAVSGGTILVKGGKIYP-FTHEIKGYAARTAIGYTKDKRYVLM 332 Query: 196 LLSQQ----ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKG 234 + + A + L L LDG S Sbjct: 333 VTVDGPPYRGMTQEELASLMLS-LGAYDALNLDGGGSTQMAVR 374 Score = 53.2 bits (126), Expect = 9e-06, Method: Composition-based stats. Identities = 10/102 (9%), Positives = 33/102 (32%), Gaps = 3/102 (2%) Query: 43 LTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE-SYAPL 101 + + ++ + + + + + + ++ + A+NG +D + + Sbjct: 85 ININILKIDLKDPYLDLSVIFSPSGIKERM--PIREMANSYGAVAAINGDFFDTKTGFVI 142 Query: 102 GLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRL 143 G +++G F+I G Y+ + Sbjct: 143 GATVKDGNLITDPASNGKMATFYIDKTGTPYIDYWTKKMSIT 184 >UniRef50_B2UNL7 Putative uncharacterized protein n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UNL7_AKKM8 Length = 249 Score = 98.2 bits (243), Expect = 2e-19, Method: Composition-based stats. Identities = 36/159 (22%), Positives = 62/159 (38%), Gaps = 12/159 (7%) Query: 81 SQGQVQMAMNGGIY--DESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKV 138 + +NGG + D PLGL +++G++ L S + + GG + + Sbjct: 70 RKSPCVAGVNGGFFSADAGGTPLGLVVQDGKRLSPLATGSFAVSGVVYEGGRDGLTLVRS 129 Query: 139 GIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLS 198 ++R + +Q A+Q GP L+ENG ++ S R + + +S Sbjct: 130 SVLR--RMRRLPAMQAAIQGGPFLVENGSAVKGLNA--QKSTYRTFIATDGGRRWCIGVS 185 Query: 199 QQATNFYDFACYAKAK--LN---VEQLLYLDGTISHMYM 232 + A + A L VE L LDG S + Sbjct: 186 SS-LTLKELAAWLAAPGALGNFRVETALNLDGGSSSAFW 223 >UniRef50_Q8YKN4 All7259 protein n=2 Tax=Cyanobacteria RepID=Q8YKN4_ANASP Length = 245 Score = 98.2 bits (243), Expect = 2e-19, Method: Composition-based stats. Identities = 34/236 (14%), Positives = 62/236 (26%), Gaps = 40/236 (16%) Query: 45 VQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE-SYAPLGL 103 + P + A + + + + + N G +D + Sbjct: 2 AHILLI-PANSPFVVT------GALSAKVSTVEEFAQKHRAFAIFNAGFFDPANQKSTSY 54 Query: 104 YIENG----------QQKVALNLASGEGNFFIRPGGVFYVAGDKVG---IVRLDAFKTSK 150 + G + L F R Y+ G + ++ + Sbjct: 55 VVVTGQMVADPKDNERLVNNPQLKPYLNLIFNRSEFRRYLCGQTTRYDITLHNESPPANC 114 Query: 151 EIQFAVQSGPMLMENGVINP-RIHPNVASS--------KIRNGVGINKHGNAVFLLS--- 198 + A+ +GP L+ P N R VGI G+ + ++ Sbjct: 115 RLVDAIGAGPRLLPKLTSVPEGFVDNAKGRDALLSKQLNARTAVGITSEGSIILVMVAQK 174 Query: 199 -----QQATNFYDFACYAKAKLNVEQLLYLDGT-ISHMYMKGGAIPWQRYPFVTMI 248 + A K KL + LDG S +Y G A + I Sbjct: 175 PSKPKNSGISLVQLADLMK-KLGASAAMNLDGGSSSSLYYNGKAFYGKFDLQGNPI 229 >UniRef50_A7C442 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. PS RepID=A7C442_9GAMM Length = 299 Score = 97.9 bits (242), Expect = 3e-19, Method: Composition-based stats. Identities = 27/230 (11%), Positives = 66/230 (28%), Gaps = 22/230 (9%) Query: 24 TLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVK-MYWQKANGEAWGTLHALLADINSQ 82 TL + + + + +V+ ++ + + + ++ Sbjct: 34 TLFEGITYIREVRQ-TPRPIIIHFISVDLTKPNIRFLVTPGEVRDDGEIGARTTSQFLTE 92 Query: 83 GQVQMAMNGGIYDESYAPL-------GLYIENGQQKVALNLASGEGNFFIRPGGVFYVAG 135 ++Q+A+NG + + PL + G + LAS G + + F Sbjct: 93 FKLQLAINGNFFYP-FHPLFSVDFWNAYPKKRGDPVYVVGLASSHGQVYSQTKKSFETLY 151 Query: 136 DKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPN--VASSKIRNGVGINKHGNA 193 + A+ + ++ G I R + ++K Sbjct: 152 ISADNQARFQTSIGP-LYHAISGRELFIKQGKIQGPFPKGAFNEKPYPRTALALDKTAKT 210 Query: 194 VFL-LSQQ-------ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGG 235 + + + + A ++ + L LDG S + G Sbjct: 211 LMIFVVDGKLKNYSEGVTLMELADIVQS-YGADMALNLDGGGSSTLVMEG 259 >UniRef50_UPI00016C4EC3 hypothetical protein GobsU_32169 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4EC3 Length = 279 Score = 97.9 bits (242), Expect = 3e-19, Method: Composition-based stats. Identities = 29/228 (12%), Positives = 67/228 (29%), Gaps = 33/228 (14%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGE-AWGTLHALLADINSQG 83 L + + + A ++ + + NG+ T + + Sbjct: 33 LFSGVELTD-LIGDTPRLMKGHAVRIDLKAAGIGFLATPGNGDRPGETDGLKTSTFLKRH 91 Query: 84 QVQMAMNGGIYDESYA-------PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGD 136 ++Q+A+N + + +G+ + G+ + + Sbjct: 92 KLQLAINAAPFGPIHKDEEKEQDVVGVQVSGGKLVSPAQPGYPA---------LLLAKDN 142 Query: 137 KVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGN-AVF 195 + I I+ AV ++++ G + S R G++ G V Sbjct: 143 RARIAAPPFDLE--GIENAVGGFHIVLKGGEVLT----GDKSIHPRTAAGVSADGKTLVL 196 Query: 196 LLSQQAT-------NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGA 236 L+ + + KA L + + LDG + + GA Sbjct: 197 LVIDGRQKDFSDGATTAEVGEWLKA-LGCAEGINLDGGGTTTLVVAGA 243 >UniRef50_C2FS46 Putative uncharacterized protein n=2 Tax=Sphingobacterium spiritivorum RepID=C2FS46_9SPHI Length = 341 Score = 97.5 bits (241), Expect = 4e-19, Method: Composition-based stats. Identities = 36/282 (12%), Positives = 70/282 (24%), Gaps = 37/282 (13%) Query: 4 QLLIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQK 63 Q +I I + + + +++ ++ ++ Sbjct: 64 QSIISNTDIIKTFRLDSTTTVADGIVHTHIRYLNRLNLPVSMHVLEIDLSKPKLAAQALG 123 Query: 64 ANGEAWGTLHALLADIN-----SQGQVQMAMNGGIYDESYA----PLGLYIENGQQKVAL 114 E L S G++ +A+NG S P G YI G+Q Sbjct: 124 PFNEVLYATQILPEMAKYNESGSGGKMMVAINGDAVLTSGTTVNAPSGSYIRYGRQIKTN 183 Query: 115 NLASGEGN---FFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPR 171 + F + GV ++ +A + I V L+ N + Sbjct: 184 TTTATAFTIPYFAVTKAGVPFIGNRPSATYPAEAVDLNT-IYHLVSGTNWLVFNNNLITS 242 Query: 172 IHPNVASSKIRNGVGINKHGNAVFLLSQQAT-------NFYDFACYAKAKLNVEQLLYLD 224 V R +GIN + ++ D K L + + + Sbjct: 243 TTATV---SARTAIGINADKKVICVVVDGGDDAFSTGITLNDLGIVMK-TLGSSRAFFTN 298 Query: 225 GTISHMYMKGGA-------------IPWQRYPFVTMISVERK 253 G +K + I + Sbjct: 299 GGNFSAMVKRKEDAKGLRWDMLNRPVNKTGSATANGIGFVLR 340 >UniRef50_C1ABL2 Putative uncharacterized protein n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1ABL2_GEMAT Length = 311 Score = 97.1 bits (240), Expect = 5e-19, Method: Composition-based stats. Identities = 36/239 (15%), Positives = 84/239 (35%), Gaps = 28/239 (11%) Query: 29 FAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMA 88 +A + TV ++P + + + G A + N+ +A Sbjct: 68 WAEWPVQLGARGISTTVIVVDIDPARIALTLEIARD-----GDALAPWSLDNAPKDAVIA 122 Query: 89 MNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKT 148 +N G + + P G + ++ A + F I G + + + Sbjct: 123 LNAGQFTDDG-PWGWVVHRQREWQAPGVGPLSAAFVIDTAGRAAILRAD----EIAEARR 177 Query: 149 SKEIQFAVQSGPMLMENGVINPRI----HPNVASSKIRNGVGINKHGNAVFLLSQQ---- 200 + A+QS P+++ +G + P + ++ IR +G+ G+ + L++ Sbjct: 178 RGGWEEALQSFPLILNDGALPPGLCAPGAVDLEHRDIRLTLGVLPDGHVLLALTRYAGVG 237 Query: 201 --------ATNFYDFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFVTMISV 250 + A + +L V + + LDG +S + ++ G + Q + + Sbjct: 238 SAGNRLPIGPTTGEMATIMR-ELGVARAVMLDGGLSAQLLVRDGPVTTQWHGLRRVPLA 295 >UniRef50_B8HPB3 Putative uncharacterized protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HPB3_CYAP4 Length = 304 Score = 97.1 bits (240), Expect = 5e-19, Method: Composition-based stats. Identities = 43/276 (15%), Positives = 76/276 (27%), Gaps = 53/276 (19%) Query: 5 LLIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTL-----TVQAYTVNPQTERVKM 59 L+IG G+I + T L + + P ++ Sbjct: 11 LVIGLGLIGSTTACTQTSTTASSAPVAPTPPQPLQYKVYSLPHSKIHTLVI-PAGSTYEV 69 Query: 60 YWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE-SYAPLGLYIENGQQKVALNLAS 118 A LA Q Q +NGG +D + + G+Q Sbjct: 70 TAAIAPD------VQPLATFAQQHQAIAVLNGGFFDPVNGKSTSHVVLAGKQVANPQDNE 123 Query: 119 GEGNFFIRPGGVFYV---------------AGDKVGIVR-LDAFKTSKEIQFAVQSGPML 162 P + Y+ + I R E+ A+ +GP L Sbjct: 124 RLIQ---NPDLIPYLPLILNRSELRNYRCAGQIRYEISRHDKPIPPGCELLMALGAGPQL 180 Query: 163 MEN------------GVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQ--------QAT 202 + G R R+ +G+ G+ V+L+ Sbjct: 181 LPQNTSVQEGFMAYSGETITRDSLGSLYPNARSAIGLKADGSLVWLMVAERSDANQPGGL 240 Query: 203 NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIP 238 + + A + ++ L V + + LDG S + G Sbjct: 241 SLPELAQFMQS-LGVVKGMNLDGGSSASFYYQGQTH 275 >UniRef50_A0LEU6 Putative uncharacterized protein n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LEU6_SYNFM Length = 300 Score = 97.1 bits (240), Expect = 6e-19, Method: Composition-based stats. Identities = 32/210 (15%), Positives = 66/210 (31%), Gaps = 12/210 (5%) Query: 43 LTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLG 102 + ++P+ K+ N T + A+N G+Y E Sbjct: 76 YRITVVRIDPRYYAFKLINASENTREKMTAREWSRQF----NLIAAVNAGMYQEDGLASV 131 Query: 103 LYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFK-TSKEIQFAVQSGPM 161 Y++N L + P G V ++ F ++ + VQS M Sbjct: 132 GYMKNFDHVNNPRLGRDKTVLAFNPSG-PDVPEVQIIDRECQDFNSLRQKYRTFVQSIRM 190 Query: 162 LMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKA-KLNVEQL 220 + + R S+ +G ++ G + L + +DF L++++ Sbjct: 191 ISCDRKNVWRQQAGRWSTV---AIGTDETGKVLLLFCRSPITVHDFIEVLLTLPLSLQRA 247 Query: 221 LYLDGTISHMYMK--GGAIPWQRYPFVTMI 248 +YL+G G + + + Sbjct: 248 MYLEGGPQASLYLSTGKTTLERYGSWEPAL 277 >UniRef50_A7M0H0 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=A7M0H0_BACOV Length = 354 Score = 96.7 bits (239), Expect = 6e-19, Method: Composition-based stats. Identities = 32/238 (13%), Positives = 67/238 (28%), Gaps = 47/238 (19%) Query: 33 ADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGG 92 +S V ++ + + K+ + NG++ + +NGG Sbjct: 96 ERYDDVSKAQQIVNVLEIDLLSNKYKVEFTYNNGDSL-------STTAQVRGAIGGINGG 148 Query: 93 IYDESYAPLGLYIE-NGQQKVALNLASGEGNFFIRPGGVFYVAGD-KVGIVRLD------ 144 E+ +YI NG + L G + + G Y G +GI+ Sbjct: 149 YEQEA-----IYIRINGTNISEVTLPEGH-LRYWKHDGALYSDGKSDIGIIYGGRNGKAA 202 Query: 145 -AFKTSKEIQFAVQSGPMLMENGVI------------------NPRIHPNVASSKIRNGV 185 ++ + S P L+++ + R V Sbjct: 203 IDTYKQHSAKYLLASAPTLIDDYNPLGETFVGNYTMEQLESFDYEDYRRHQGVRHPRTVV 262 Query: 186 GINKHGNAVFLLSQQAT-------NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGA 236 + + + + + + + + K N + L +DG S G Sbjct: 263 AVTEDKDLLLVTIDGRWAGKAEGMSAKEVTLFLKKHFNPQYALNMDGGGSTTMYVKGK 320 >UniRef50_B3QZA6 Putative uncharacterized protein n=1 Tax=Chloroherpeton thalassium ATCC 35110 RepID=B3QZA6_CHLT3 Length = 280 Score = 96.7 bits (239), Expect = 6e-19, Method: Composition-based stats. Identities = 28/197 (14%), Positives = 60/197 (30%), Gaps = 8/197 (4%) Query: 44 TVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGL 103 + +NP+ K+ + + ++ A Q + A+N G++ Sbjct: 59 QIYVIRINPEHYAFKLMCASEHAKTPLSVKAW----CKQHGLISAINAGMFQADMLSAVS 114 Query: 104 YIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLM 163 ++N L+ F P + + Q Q M+ Sbjct: 115 LMKNFAHINNPRLSKDNTIFAFNPTKKDLPKAQIIDRTVQNYDALKSVYQSQFQGIRMIA 174 Query: 164 ENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKA-KLNVEQLLY 222 + P+ S +G + GN +F+ S+ +DF +++++ +Y Sbjct: 175 PGRKNVWQEQPDEWSIA---ALGSDGDGNILFIFSRSPYTVHDFINILLELPIDIQRAMY 231 Query: 223 LDGTISHMYMKGGAIPW 239 LDG Sbjct: 232 LDGGAVAQLYFSNKHIE 248 >UniRef50_UPI0001C16068 conserved hypothetical protein n=2 Tax=Nostocaceae RepID=UPI0001C16068 Length = 613 Score = 96.7 bits (239), Expect = 7e-19, Method: Composition-based stats. Identities = 36/251 (14%), Positives = 64/251 (25%), Gaps = 53/251 (21%) Query: 52 PQTERVKMYWQKANGEAWGTLHALLADINSQGQ-----VQMAMNGGI-----------YD 95 P R + W +G L I Q + +N G + Sbjct: 367 PILNRGAIAWNYQGEFYFGRLSLNETLIVDQDNKQTSLPVLFLNSGYVQNGIARYTFAWG 426 Query: 96 ESYAPLG-----LYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRL------- 143 +Y PL + ++NG+ + G I G + Sbjct: 427 PNYVPLTNNETIITVQNGKI---TKQSPPGGAISIPGDGYLLILRGTAVSKTSLLSVGTK 483 Query: 144 ------DAFKTSKEIQFAVQSGPMLMENGVINPR------IHPNVASSKIRNGVGINKHG 191 + +GP+L++N I + +R+ + + Sbjct: 484 VNLESSTTPGEFNTYPHIIGAGPLLIQNQRIVVDAKAEKFSQAFIKERAVRSAICTTNND 543 Query: 192 NAVFLLSQQA-----TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFV- 245 N + + A + K+ L LDG S GG + + Sbjct: 544 NLILAAVNNRVGGWGPTLEEHAQLMQ-KIGCTNALNLDGGSSTSLYLGGQLLDRFPNTAA 602 Query: 246 ---TMISVERK 253 I V K Sbjct: 603 RVHNGIGVFLK 613 Score = 63.6 bits (153), Expect = 7e-09, Method: Composition-based stats. Identities = 30/202 (14%), Positives = 54/202 (26%), Gaps = 38/202 (18%) Query: 23 LTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQ 82 +T L + V +N +T + + N + L + Sbjct: 276 ITWSKGLRWQQKFINLDKDSFPVVWLEINRKTSGLNLQPILPNPQTQTGTAPLTLT-AQR 334 Query: 83 GQVQMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIV 141 A+NGG ++ + PLG +N Q L G G FY + Sbjct: 335 YSAMAAINGGYFNRNNQLPLGAVRQNDQWISGPILN--RGAIAWNYQGEFYFGRLSLNET 392 Query: 142 RLDAFKTSKE----------------IQFAVQSGP-----------MLMENGVINPRIHP 174 + + ++ GP + ++NG I + P Sbjct: 393 LIVDQDNKQTSLPVLFLNSGYVQNGIARYTFAWGPNYVPLTNNETIITVQNGKITKQSPP 452 Query: 175 NVASSKIRNGVGINKHGNAVFL 196 + I G + L Sbjct: 453 GG-------AISIPGDGYLLIL 467 >UniRef50_UPI0001744905 hypothetical protein VspiD_09365 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001744905 Length = 251 Score = 96.7 bits (239), Expect = 7e-19, Method: Composition-based stats. Identities = 40/227 (17%), Positives = 72/227 (31%), Gaps = 27/227 (11%) Query: 23 LTLLPLFAVAADDCALSDPTLTVQAYTVN-----PQTERVKMYWQKANGEAWGTLHAL-- 75 + + L A L A V P + + A +H Sbjct: 3 VIVETLPAQWTVRSQAGPVKLPGGAIQVKKQLAGPTEAELNLILFTAGKYEMRVVHQPER 62 Query: 76 -----LADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGV 130 LA + NGG + + PLGL + +G + +S G GV Sbjct: 63 DKGVSLATKMRELGAIAGCNGGYFTPDFLPLGLEVSDGVRSGTFQRSSLLG-------GV 115 Query: 131 FYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKH 190 F V + +V D + K + +Q+GP L+ G+ + + R + ++ Sbjct: 116 FLVRHGRPAMVWKDEYIEQKGVTQLLQAGPRLVHAGLPVAGLEA--TKRRARTFILTDQA 173 Query: 191 GNAVFLLSQQATNFYDFACY-----AKAKLNVEQLLYLDGTISHMYM 232 GN + + + ++ V++ L DG S Sbjct: 174 GNWALGTCKS-VTLRELSDLLSTRALLPEVTVKRALNFDGGNSTGLW 219 >UniRef50_B8J2Y6 Putative uncharacterized protein n=2 Tax=Desulfovibrio RepID=B8J2Y6_DESDA Length = 429 Score = 96.3 bits (238), Expect = 8e-19, Method: Composition-based stats. Identities = 34/202 (16%), Positives = 63/202 (31%), Gaps = 9/202 (4%) Query: 28 LFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQM 87 + + L+D + A ++P + + +G L+ Q + Sbjct: 130 EPGLDFGEFQLTDSEALLTALRIDPAHFDFILCARSQDGGNLRPLNQW----AEQYGLTA 185 Query: 88 AMNGGIYDESY-APLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAF 146 A+N +Y G +NG + F P V Sbjct: 186 AINASMYLPDGITSTGYMRQNGHHNNKRVVQRFGAFFVAGPDSPDLPGAAIVDRDDPQWE 245 Query: 147 KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYD 206 + + + +Q+ M + I P I + V + G +FL +Q Y Sbjct: 246 QRIGQYRLVIQNYRMTSADRRIL--WSPGGPHYSI-SAVAQDGDGRILFLHCRQPVEAYA 302 Query: 207 FA-CYAKAKLNVEQLLYLDGTI 227 FA LNV ++Y++G Sbjct: 303 FAQQLLHLPLNVRTVMYVEGGG 324 >UniRef50_D2RLV8 N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase-like exopolysaccharide biosynthesis protein n=1 Tax=Acidaminococcus fermentans DSM 20731 RepID=D2RLV8_ACIFE Length = 477 Score = 96.3 bits (238), Expect = 8e-19, Method: Composition-based stats. Identities = 33/222 (14%), Positives = 65/222 (29%), Gaps = 32/222 (14%) Query: 61 WQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIE-NGQQKVALNLASG 119 + +G+ + + + ++ + +G G + G + +G Sbjct: 259 VTRPDGKTFAIGG--VDRMRLANELILFNDGYDDTTDTNAYGTEVRLAGGVVREIRKGAG 316 Query: 120 E-----GNFFIRPGGVFYV------AGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVI 168 G + G GD V + + + +GP L+ +G + Sbjct: 317 SMALTPGTTVLSGNGAAAAFLNGLRTGDPVKVTQTLGNAAADSAPSVGSAGPQLVRDGRV 376 Query: 169 NPRIHPN------VASSKIRNGVGINKHGNAVFLLSQQA------TNFYDFACYAKAKLN 216 R GVGI K G + +++ +F Y +L Sbjct: 377 QVTSEEEEIADDIALGRAPRTGVGIKKDGTVLVVVADGRSDDSVGMTLTEFGRYF-VQLG 435 Query: 217 VEQLLYLDGTISHMYMKGGAIPW-----QRYPFVTMISVERK 253 ++ + DG S + G I P + V RK Sbjct: 436 ADRAMNFDGGGSSEMVVNGKIMNDPSDGTERPVRVALGVFRK 477 >UniRef50_D2AUR4 Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase-like protein n=1 Tax=Streptosporangium roseum DSM 43021 RepID=D2AUR4_STRRD Length = 487 Score = 95.9 bits (237), Expect = 1e-18, Method: Composition-based stats. Identities = 23/187 (12%), Positives = 50/187 (26%), Gaps = 17/187 (9%) Query: 72 LHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQ-QKVALNLASGEGNFFIRPGGV 130 + G ++ ++ G + G + + + V Sbjct: 289 TEEFGTKTAADGGAEIVVDAQGRIVKARAAGGVVPRGTYVLHGTGIMATWLLEHAQETSV 348 Query: 131 FYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVAS-------SKIRN 183 + KV +R + + G L+ NG + + + R Sbjct: 349 MKLD-TKVIDLRTERAVPLTPETHIMGGGVGLLRNGRVRISAKADGHASVVMMLRRHPRT 407 Query: 184 GVGINKHGNAVFLLSQQAT-------NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGA 236 VG+ K G + + + A + L +Q + DG S + G Sbjct: 408 MVGVTKSGGLILATVDGRNPGVTVGASMVEAAQLMRW-LGAKQAINFDGGGSTAMVVGHK 466 Query: 237 IPWQRYP 243 + + Sbjct: 467 VINRPSD 473 >UniRef50_UPI00019088BB hypothetical protein RetlC8_25680 n=2 Tax=Rhizobium etli RepID=UPI00019088BB Length = 332 Score = 95.5 bits (236), Expect = 1e-18, Method: Composition-based stats. Identities = 29/227 (12%), Positives = 63/227 (27%), Gaps = 26/227 (11%) Query: 13 TLNLKRIFLALTLLPLFAVAADDCALS----------DPTLTVQAYTVNPQTERVKMYWQ 62 + + + + ++P R ++ Sbjct: 67 MQLALKSPVPAVQPGSLVWQEPEIGFEVAELPVLADGREVDRIFLSRIDPMRFRFVVHNA 126 Query: 63 KANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGN 122 + + + + +NG YD P +I G + G Sbjct: 127 SQGDK---GIDEWEHALPK---AVLIVNGSYYDMHGRPDTPFISEGVAMGPRQYDAKAGA 180 Query: 123 FFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLM-ENGVINPRIHPNVASSKI 181 FF + D A+ S P+L+ ++G + + Sbjct: 181 FFADAASADIRD-----LTHQDWGSALAGATNAMVSYPLLIGDDGQTHVNVK--SRWLAN 233 Query: 182 RNGVGINKHGNAVFLLSQQA-TNFYDFACYAK-AKLNVEQLLYLDGT 226 R V + G + +++A + A + K + L+++ L LDG Sbjct: 234 RTFVAKDGSGRILIGTTKEAFFSLDRLAEFLKASPLDLKVALNLDGG 280 >UniRef50_UPI0001C43112 hypothetical protein BpOF4_05820 n=1 Tax=Bacillus pseudofirmus OF4 RepID=UPI0001C43112 Length = 762 Score = 95.5 bits (236), Expect = 1e-18, Method: Composition-based stats. Identities = 34/213 (15%), Positives = 61/213 (28%), Gaps = 28/213 (13%) Query: 58 KMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLA 117 + + EA + DI + A N D+ + G+ N Sbjct: 197 AVLYTSGYKEATTGASQWVTDIVVTNTNKSAANFSFGDKITGTVSEIRRLGE---GANAT 253 Query: 118 SGEGNFFIRPGGVFYVAGDKVGIVRLDAF------KTSKEIQFAVQSGPMLMENGVI--- 168 + F I G + V + ++ +F + +GP L+ NG Sbjct: 254 IPKDGFVISANGGPFRDALTGVSVGDELTVEASINDAWRDAEFILATGPTLVRNGQTSIS 313 Query: 169 NPRIHPNVASSKIRNGVGINKHG-NAVFLLSQQAT-------NFYDFACYAKAKLNVEQL 220 P R VG + G + + A Y ++ + Sbjct: 314 MSTSSPFARERAPRTAVGASSDGTKLFLVTIDGRQSGYSNGVTIPELAAYMRS-IGAHNA 372 Query: 221 LYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 + LDG S + YP+ +SV + Sbjct: 373 INLDGGGSTTMVAR-------YPWADHVSVVNR 398 >UniRef50_UPI00017896CA metallophosphoesterase n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI00017896CA Length = 2050 Score = 95.2 bits (235), Expect = 2e-18, Method: Composition-based stats. Identities = 32/201 (15%), Positives = 65/201 (32%), Gaps = 34/201 (16%) Query: 72 LHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVF 131 L + + S + M + + D+ +P+G GQ ++ + + ++ G Sbjct: 230 LDIVSGRVASGETLTMKVVSVLKDQGNSPIG----QGQVVLSASGSQRSKLAGLKAG--- 282 Query: 132 YVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHG 191 G + ++ ++ A+ ML+++GV+ P V R VG G Sbjct: 283 --DEVTAGFQLDNEWQ---DVTMAIGGTVMLVKDGVVQQHTDPAV---HPRTVVGTKADG 334 Query: 192 NAVFLLSQQAT-------NFYDFACYAKAKLNVEQLLYLDGTISHMYM-------KGGAI 237 + V N+ + +L V L LDG S ++ + + Sbjct: 335 SVVLFEVDGRQPGFSEGLNYIELGE-MLQELGVVNALNLDGGGSATFVARLPGETERKVL 393 Query: 238 PWQRYP----FVTMISVERKG 254 I + K Sbjct: 394 NSPSDGGERKTANGILLVNKA 414 Score = 63.6 bits (153), Expect = 6e-09, Method: Composition-based stats. Identities = 22/116 (18%), Positives = 41/116 (35%), Gaps = 6/116 (5%) Query: 27 PLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAW---GTLHALLADINSQG 83 P + V +P +++ +G+ + G + Sbjct: 70 PGATYTWANMQKGSGEQKVHMVEFDPSQGNLELQPGLTDGKVYGMQGVSKMASDADKAGN 129 Query: 84 QVQMAMNGGIYDE-SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKV 138 +V A+NG YD + PLGL++ +G+ + SG F I+ G K+ Sbjct: 130 RVIAAVNGDFYDMSTGIPLGLFMGDGELL--TDPPSGRNAFGIKQDGTSLYGSPKL 183 >UniRef50_A9V9Y5 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9V9Y5_MONBE Length = 298 Score = 95.2 bits (235), Expect = 2e-18, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 61/196 (31%), Gaps = 27/196 (13%) Query: 66 GEAWGTLHALLADINSQGQVQMAMNGGIYDESYAP---LGLYIENGQQKVALNLASGEGN 122 G H +++ + A N G + + P G I +G + + N Sbjct: 88 GPNGCEHHRTVSEQAKLLTCEYATNAGFF--DFTPPACEGNLITDGVSIQ--HPCPNQVN 143 Query: 123 FFIRPGGVFYVA---GDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVA-- 177 F R G+ GD++ I + ++ + L+ +G Sbjct: 144 FG-RKLGMTCPDSTQGDRIVIGYMQEADI-ADLTELITGRGWLIRHGQAYTNQSREFTPT 201 Query: 178 -----SSKIRNGVGINKHGNAVFLLSQQ------ATNFYDFACYAKAKLNVEQLLYLDGT 226 R +G+ K G + L+ + ++ A +L+V Q + LDG Sbjct: 202 DSFVSEKAPRTALGLTKDGAILSLVVDGIEEELVGPDLHEMASLLL-ELDVVQAINLDGG 260 Query: 227 ISHM-YMKGGAIPWQR 241 S +G Sbjct: 261 GSSTAVYQGHVFNMPH 276 >UniRef50_UPI0001C3370C hypothetical protein UCYN_10670 n=1 Tax=cyanobacterium UCYN-A RepID=UPI0001C3370C Length = 438 Score = 95.2 bits (235), Expect = 2e-18, Method: Composition-based stats. Identities = 23/129 (17%), Positives = 42/129 (32%), Gaps = 12/129 (9%) Query: 131 FYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPN------VASSKIRNG 184 + G + I K ++ + GP+L+ +G I+ + R+ Sbjct: 301 LFFIGSTLKIESKTVPKKFNQLSHILGGGPLLINDGSISLNVKDEKFTKSFQKQKASRSA 360 Query: 185 VGINKHGNAVFLLSQQ-----ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPW 239 +GI + + N + A + KL L LDG S + GG + Sbjct: 361 IGITNKDKTILVTVHNSINSNGVNLNEMAQIMQ-KLGSINALNLDGGGSTSLVLGGRLID 419 Query: 240 QRYPFVTMI 248 + I Sbjct: 420 RFPVTAAKI 428 Score = 59.3 bits (142), Expect = 1e-07, Method: Composition-based stats. Identities = 21/142 (14%), Positives = 45/142 (31%), Gaps = 8/142 (5%) Query: 23 LTLLPLFAVAADDCALSDPT----LTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLAD 78 + +P + + V ++ ++ +V + + + L + Sbjct: 97 IVWVPGVIWRQQFITVKNKKGYNIFPVNLLEIDNKSSKVILRPIT-SNLNGQIGTSSLEE 155 Query: 79 INSQGQVQMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDK 137 I + +V A+NGG ++ + PLG N + L G G G F++ Sbjct: 156 IAKKWRVVAAINGGFFNRNNRLPLGAIRHNNDWLSSPIL--GRGAVGWNENGKFFIDHLS 213 Query: 138 VGIVRLDAFKTSKEIQFAVQSG 159 + + IQ Sbjct: 214 LKEFLILNNGERISIQSLNSGY 235 >UniRef50_P74396 Slr0280 protein n=1 Tax=Synechocystis sp. PCC 6803 RepID=P74396_SYNY3 Length = 610 Score = 95.2 bits (235), Expect = 2e-18, Method: Composition-based stats. Identities = 40/240 (16%), Positives = 77/240 (32%), Gaps = 43/240 (17%) Query: 52 PQTERVKMYWQKANGEAWG--TLHALLADINSQGQVQMAMNGGI-----------YDESY 98 P R + W +G +L ++ + Q +N G + SY Sbjct: 364 PILNRGAIAWNDQGQTTFGRLSLSEIITTGSGQRLTANYLNSGYVQRGIARYTPAWGPSY 423 Query: 99 APLG-----LYIENGQQKVALN-LASGEGNFFIRPGGVFYVAGDK--------VGIVRLD 144 PL ++N Q +G+ I G + VG Sbjct: 424 IPLSDNEQVYVVQNSQVTAQYPLPKAGQQQMPIPSDGYLIIDRGNQIPAGVLAVGTTLNV 483 Query: 145 AFKTSKEIQFA----VQSGPMLMENGVINPR------IHPNVASSKIRNGVGINKHGNAV 194 +++ E A + +GP+L++ G + R+ + ++++GN + Sbjct: 484 NGRSTPEAFNAFPNGMGAGPLLIDQGRMVLNATGEGFSSAFQQQRASRSAIAVDRNGNII 543 Query: 195 FLLSQQAT-----NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMIS 249 + S + +FA + +L L LDG S GG + + +S Sbjct: 544 LVASHNRVGGAGASLGEFAQILQ-QLGAVNALNLDGGSSTSLALGGQLLDRSPVTAARVS 602 Score = 48.2 bits (113), Expect = 3e-04, Method: Composition-based stats. Identities = 15/81 (18%), Positives = 28/81 (34%), Gaps = 2/81 (2%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ 84 +S V T+NP++ + + AN L+ I + Sbjct: 275 WTEGITWQQRFVNISGGQFPVTTVTINPRSPGISLRPLMANPTMAQGTAPLV-TIARDQR 333 Query: 85 VQMAMNGGIYDESYA-PLGLY 104 +A+N G ++ + PLG Sbjct: 334 AAVAINAGFFNRNNQLPLGAV 354 >UniRef50_Q8DHU5 Tll1850 protein n=1 Tax=Thermosynechococcus elongatus BP-1 RepID=Q8DHU5_THEEB Length = 575 Score = 94.0 bits (232), Expect = 4e-18, Method: Composition-based stats. Identities = 28/181 (15%), Positives = 58/181 (32%), Gaps = 25/181 (13%) Query: 91 GGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYV------------AGDKV 138 G ++ + + + + N Q + + I G V G V Sbjct: 385 GSYQGKTGSEVVITVRNEQVVGQQPINKDQT-VPIPSEGFLLVARNFNSALANFPPGAAV 443 Query: 139 GIVRLDAFKTSKEIQFAVQSGPMLMENGVIN-----PRIHPNVA-SSKIRNGVGINKHGN 192 + I V +GP+L+E G + + + + R+ +G G+ Sbjct: 444 QLETTAVPAAFNRIPNIVGAGPLLVEQGRVVLNAALEQFGAGLDAQAAPRSAMGNRSDGS 503 Query: 193 AVFLLSQQA-----TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTM 247 V++ + ++A +L + + LDG S GG + + T Sbjct: 504 IVWVTTHNRIGGMGPTLAEWAQI-VHRLGLINAVNLDGGSSTALYLGGVLVDRHGVTTTR 562 Query: 248 I 248 + Sbjct: 563 V 563 Score = 60.9 bits (146), Expect = 4e-08, Method: Composition-based stats. Identities = 20/114 (17%), Positives = 33/114 (28%), Gaps = 4/114 (3%) Query: 23 LTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQ 82 + P L V +NPQ +++ + + L + + + Sbjct: 238 IQWAPGLRWQQQTVILGTRQFPVDLLIINPQQPGLRLRPLEISPTTLVGLATVPE-LAQR 296 Query: 83 GQVQMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAG 135 Q A+NGG ++ PLG G L G G V Sbjct: 297 WQAAAAINGGFFNRDRQAPLGAIRREGNWLSGPILN--RGAIGWDDRGQIVVGR 348 >UniRef50_C6IEV9 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=C6IEV9_9BACE Length = 343 Score = 94.0 bits (232), Expect = 4e-18, Method: Composition-based stats. Identities = 33/261 (12%), Positives = 60/261 (22%), Gaps = 56/261 (21%) Query: 38 LSDPTLTVQAYTVNPQTERVKMY----WQKANGEAWGTLHALLADINSQGQVQMAMNGG- 92 L+ + + ++ + + G ++ + + +NGG Sbjct: 72 LAGKKAIAYIAVADMSKAKFEVLGDIAFSQEANGYGGKSIHTPSEFYESSKAPVVINGGL 131 Query: 93 --IYDESYAPLGLYIENGQQKVALNLA----------SGEGNFFIRPGGVFYV------- 133 Y L I GQ G F G F Sbjct: 132 FFYSAGFYYSQNLVIREGQLLAPNQNYYSKDWVTMWYPTLGAFCQMKDGTFQTTWTYQAS 191 Query: 134 -----------------AGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPN- 175 + + E + +L+ G I Sbjct: 192 DGINYCYPAPADNDINKDPLQAPSSTFPNGAKALEATTGIGGVTVLLRAGEIKNTYVEEM 251 Query: 176 ----VASSKIRNGVGINKHGNAVFLLSQQAT--------NFYDFACYAKAKLNVEQLLYL 223 AS++ R +GI + + + + + A K L + L L Sbjct: 252 LDISAASNQPRTAIGITTNKKMIIFVCEGRNMTEGVAGLTTANVAKVMKD-LGCTEALNL 310 Query: 224 DGTISH-MYMKGGAIPWQRYP 243 DG S M + G Sbjct: 311 DGGGSSCMLVNGKETIKGSDG 331 >UniRef50_C1A670 Putative uncharacterized protein n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1A670_GEMAT Length = 426 Score = 94.0 bits (232), Expect = 5e-18, Method: Composition-based stats. Identities = 21/186 (11%), Positives = 48/186 (25%), Gaps = 28/186 (15%) Query: 89 MNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKT 148 + S G +G V + R V +V + + Sbjct: 245 LYAAPDTASSRSGGAIPRDGALLVGTGDRAAGVAAMSRFDTV------RVHLNTWPRLTS 298 Query: 149 SKEIQFAVQSGPMLMENGVINP--------RIHPNVASSKIRNGVGINKHGNA-VFLLSQ 199 + + + P+++++G I N + R + +++ G + Sbjct: 299 QRAPKAVIGGWPLVLQDGENVAARAATLEGTISRNAEARHPRTAIAVSRSGQTAWLVTVD 358 Query: 200 QA------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQR------YPFVTM 247 + A + + L L DG S + G + Sbjct: 359 GRATNSVGMTLVELAEFLR-TLGAWHALNFDGGGSTTMVIDGRVVNVPTDAAGEREVGNA 417 Query: 248 ISVERK 253 + V + Sbjct: 418 LIVRER 423 >UniRef50_A7HB86 Putative uncharacterized protein n=4 Tax=Anaeromyxobacter RepID=A7HB86_ANADF Length = 287 Score = 93.6 bits (231), Expect = 5e-18, Method: Composition-based stats. Identities = 34/243 (13%), Positives = 77/243 (31%), Gaps = 21/243 (8%) Query: 18 RIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLA 77 R + LF + ++P +K+ A GE Sbjct: 44 RTLEPGLEMGLFDGPPAGEEAR----PIAVVRIDPARFELKLLNASAPGE---GTLRTAR 96 Query: 78 DINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDK 137 + A+N +Y E Y + ++ P Sbjct: 97 AWAERAGASAAINASMYQEDYRTSVSLMRTRHHVNQRRVSKDRSVLAFDP---LARGASP 153 Query: 138 VGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIR----NGVGINKHGNA 193 V I+ D +++ A Q+ L+++ + NV + R +G++ G Sbjct: 154 VRIIDRD----CDDLERAAQTYGTLVQSIRLVSCDRKNVWAPSARRFSAAAIGVDAKGRV 209 Query: 194 VFLLSQQATNFYDFACYAKA-KLNVEQLLYLDGT-ISHMYMKGGAI-PWQRYPFVTMISV 250 +F+ ++ ++ A + + Q +Y++G + ++++GG F + Sbjct: 210 LFIHARTPWPVHELVNALLALPIELRQAMYVEGGPEAQLFVRGGGRQHEWVGGFEHVPQA 269 Query: 251 ERK 253 E + Sbjct: 270 ENR 272 >UniRef50_D1VRM0 Putative copper amine oxidase N-domain family n=1 Tax=Peptoniphilus lacrimalis 315-B RepID=D1VRM0_9FIRM Length = 361 Score = 92.8 bits (229), Expect = 9e-18, Method: Composition-based stats. Identities = 29/155 (18%), Positives = 52/155 (33%), Gaps = 11/155 (7%) Query: 102 GLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPM 161 G I G+ V + + V + + ++ V +GPM Sbjct: 207 GYIIYFGKDSVDKSYIDQRFKLGRKVELVLVDSKGNETFKYNGQDISYSKVTELVAAGPM 266 Query: 162 LMENGVINPRIHPNV-------ASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAK 214 L++NG N +++ R+ +GI K+G + L + N A Sbjct: 267 LLQNGKNVVAESKNNYKEGKINSATGQRSAIGITKNGKVILLTA--VANVDKLALIMND- 323 Query: 215 LNVEQLLYLDGTISH-MYMKGGAIPWQRYPFVTMI 248 L + LDG S ++ G I T++ Sbjct: 324 LGCIDAMNLDGGASSALFANGKVIKNAGRNLNTVL 358 >UniRef50_B4WFN8 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WFN8_9SYNE Length = 309 Score = 92.8 bits (229), Expect = 9e-18, Method: Composition-based stats. Identities = 28/241 (11%), Positives = 58/241 (24%), Gaps = 34/241 (14%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQK---------------ANGEAW 69 L + + ++ + + + Sbjct: 41 LFEGITYSR-YIEQQPRPQLIHLLEIDLSASGIVPFVTPGISKTSPKADREVDIEATQPH 99 Query: 70 GTLHALLADINSQGQVQMAMNGGIYDESY--APLGLYIENGQQKVALNLASGEGNFFIRP 127 TL + ++Q+A+N ++ P G+ + LA +G Sbjct: 100 ETLAQKTSSFLKTHRLQLAVNANFFNPFNETTPWQYSPREGELTNLVGLAISDGQIVSPG 159 Query: 128 GG---VFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNG 184 + I + + + AV + +EN P + Sbjct: 160 DKNYPALCFLEGRAEIRDEGV--CAPDTKQAVAGLRLNLENR--PPPDVETIYKFYPVCV 215 Query: 185 VGINKHGN-AVFLLSQQATNFY-------DFACYAKAKLNVEQLLYLDGTISHMYMKGGA 236 ++ G LL Y + A + +A L + LDG S Sbjct: 216 AALDAEGTTLWLLLVDGKQPLYSEGMTRPEVADFLQA-LGATTAVQLDGGGSTTLAIASE 274 Query: 237 I 237 Sbjct: 275 R 275 >UniRef50_A8W171 Flagellar protein FliS n=1 Tax=Bacillus selenitireducens MLS10 RepID=A8W171_9BACI Length = 750 Score = 92.8 bits (229), Expect = 1e-17, Method: Composition-based stats. Identities = 28/196 (14%), Positives = 58/196 (29%), Gaps = 24/196 (12%) Query: 60 YWQKANGEAWGTLHALLADINSQGQVQMA--MNGGIYDE---SYAPLGLYIENGQQKVAL 114 E Q+ + G I + +G Sbjct: 194 RTATPTNEFGREFTVTDTSKRINDQLSFGDSVRGSITNMKEYGRRNASPIPGDGFIISGH 253 Query: 115 NLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHP 174 G+ +V + D K+ + + +GP+L++NG ++ + Sbjct: 254 GNRLDG-----LLDGIRAGDDIEVKV---DIEDRWKDAEMIMATGPLLVQNGRVDITMSS 305 Query: 175 NVAS---SKIRNGVGINKHGNAVFLLSQQA-------TNFYDFACYAKAKLNVEQLLYLD 224 + ++ R+G+GI+ GN +F+ FA Y + + + LD Sbjct: 306 SASTYSVPNPRSGIGIDAQGNTMFVTVDGRQSGYSQGMTIPQFANYMRDQ-GAVMAINLD 364 Query: 225 GTISHMYMKGGAIPWQ 240 G S + + Sbjct: 365 GGGSTTMVARDFSRDR 380 >UniRef50_A4J956 Copper amine oxidase domain protein n=1 Tax=Desulfotomaculum reducens MI-1 RepID=A4J956_DESRM Length = 480 Score = 92.1 bits (227), Expect = 2e-17, Method: Composition-based stats. Identities = 26/145 (17%), Positives = 47/145 (32%), Gaps = 13/145 (8%) Query: 111 KVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENG---- 166 A + G + VA V + + + I+ + PML+E G Sbjct: 200 VKAPAGGYVLAGWGSSAGQLVGVAEGTKARVITEMPEDWQNIRHVLTGSPMLVEGGLPVD 259 Query: 167 -VINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT------NFYDFACYAKAKLNVEQ 219 +N + +V R +G+ G + ++ + A L Q Sbjct: 260 QAVNEGLWGSVLKYSPRTALGVTAQGKVLLVVVDGRQESSAGLTLEEMAYLMID-LGAVQ 318 Query: 220 LLYLDGTISH-MYMKGGAIPWQRYP 243 + LDG S M++KG + Sbjct: 319 AVGLDGGGSSEMWVKGKIVNNPSDK 343 Score = 46.2 bits (108), Expect = 0.001, Method: Composition-based stats. Identities = 17/104 (16%), Positives = 33/104 (31%), Gaps = 5/104 (4%) Query: 39 SDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDES- 97 + V+P + ++ G L+ + + A+NGG +D Sbjct: 43 EGKPIKGHILEVDPGVKYTEIRPVM--GNEVFGQRENLSKMAQRTGAIAAVNGGFFDMGS 100 Query: 98 YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIV 141 PLG I +G+ + +F + G + I Sbjct: 101 GVPLGNLIIDGK--PEYISDILKTSFGFKTSGGLKLGYLAPKIT 142 >UniRef50_Q3AA51 Conserved domain protein n=1 Tax=Carboxydothermus hydrogenoformans Z-2901 RepID=Q3AA51_CARHZ Length = 356 Score = 92.1 bits (227), Expect = 2e-17, Method: Composition-based stats. Identities = 22/155 (14%), Positives = 46/155 (29%), Gaps = 10/155 (6%) Query: 105 IENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLME 164 I G+ + P + + +++GP L++ Sbjct: 202 IVYGKTSIPPQGFVLNTGSLCPPDNLLNSNVTLKIEPENQENVLWSKAYAVLEAGPYLVK 261 Query: 165 NGVINPRIHPNV-------ASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNV 217 G I S R+G+G+ K+ + + ++A + KL Sbjct: 262 EGKIIADPLKENFTHYKIKDGSFARSGIGVTKNKKLLLVTV-NRATIKEWAIIMQ-KLGA 319 Query: 218 EQLLYLDGTISH-MYMKGGAIPWQRYPFVTMISVE 251 + LDG S +Y+ G + + + Sbjct: 320 YYAMNLDGGASSGLYVNGKYLTKPGRLLSNALVIS 354 Score = 43.9 bits (102), Expect = 0.005, Method: Composition-based stats. Identities = 21/109 (19%), Positives = 39/109 (35%), Gaps = 7/109 (6%) Query: 36 CALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGI-- 93 +++ TV+ V+ T+++KM A + L + + + + +NG Sbjct: 36 LKINNKNFTVKGVIVDLNTKKLKMQTVLAKNQIGQ--VESLESMVKRKKGLIGINGAFFS 93 Query: 94 -YDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIV 141 YD P G + +G+ N + I PG K I Sbjct: 94 AYDAYKEPYGNLMIDGRLIRKGNG--ERCSVGILPGNEVIFGYVKWDIS 140 >UniRef50_D2J8B1 Putative uncharacterized protein n=1 Tax=Staphylococcus aureus RepID=D2J8B1_STAAU Length = 569 Score = 91.7 bits (226), Expect = 2e-17, Method: Composition-based stats. Identities = 21/238 (8%), Positives = 55/238 (23%), Gaps = 31/238 (13%) Query: 29 FAVAADDCALSDPTL--TVQAYTV---NPQTERVKMYWQKANGEAWGTLHALLADINSQG 83 A ++ + T + + + +K+ + +G+ H + + Sbjct: 132 SAYYSEITTVKGRNFETTYYITHIPHKDKEGNLIKIK-RGISGDINKPDHITPREFAKRT 190 Query: 84 QVQMAMNGG-IYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVR 142 N P G+ I NG+ ++ + + D Sbjct: 191 GATFVSNASTGSGTQLLPHGVQIYNGKIIKSVKDYDALEQRWS-----LAIGEDNTLRTY 245 Query: 143 LDAFKTSK----EIQFAVQSGPMLMENGVINPR---IHPNVASSKIRNGVGINKHGNAVF 195 + +++ I + PN R+ + + + +F Sbjct: 246 APNVNAETLLAQGETNVLSGFGAFIQDNKITVKPGDFSPNTDVKHPRSVIAQLPNKDIIF 305 Query: 196 LLSQQA-----------TNFYDFACYAKAKLN-VEQLLYLDGTISHMYMKGGAIPWQR 241 + + LDG S ++ + Sbjct: 306 FACDGRENNNKGFVEKGMTLQEVGETLFKHYGEITLAYNLDGGGSTAHVLRSTKLNKS 363 >UniRef50_D2ASL7 Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase-like protein n=2 Tax=Actinomycetales RepID=D2ASL7_STRRD Length = 1138 Score = 91.7 bits (226), Expect = 2e-17, Method: Composition-based stats. Identities = 37/251 (14%), Positives = 63/251 (25%), Gaps = 42/251 (16%) Query: 8 GKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGE 67 G G+ +L + +A + A + G Sbjct: 144 GIGIHNGDLVQAPVAGHNNAVAVTADGVGRVLQMHFDGT---------------ATPAGG 188 Query: 68 AWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGL-----YIENGQQKVALNLASGEGN 122 + TL I G G Y A G + G + ++G G Sbjct: 189 SPITLTQFNQLIQGNGVGLFTPLWGSYGRGRAVEGAAAVTEVVLEGGVVTEVRTSAGSGP 248 Query: 123 FFIRPGGVF-----------YVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPR 171 + GD+V + ++ AV +L+++GV Sbjct: 249 IPAGTAILLGRDAGASALAALKPGDRVEVRYQPKPSEGGAVKAAVGGSQILVKDGVAQ-- 306 Query: 172 IHPNVASSKIRNGVGINKHGN-AVFLLSQQATN------FYDFACYAKAKLNVEQLLYLD 224 ++ R VG + G L + A+L L LD Sbjct: 307 -TSADNTAHPRTAVGFSADGRKMYLLTVDGRQTDSRGVTLTELGA-MMAELGAHDALNLD 364 Query: 225 GTISHMYMKGG 235 G S + Sbjct: 365 GGGSSTMLARE 375 >UniRef50_C7QHR1 Putative uncharacterized protein n=1 Tax=Catenulispora acidiphila DSM 44928 RepID=C7QHR1_CATAD Length = 636 Score = 91.7 bits (226), Expect = 2e-17, Method: Composition-based stats. Identities = 29/227 (12%), Positives = 64/227 (28%), Gaps = 23/227 (10%) Query: 8 GKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGE 67 G+G + + + + ++ + ++ Sbjct: 374 GEGTWSPAVVVNGVPVIQTAKLRSDPQHLE-----YLSAVAWMDQKHASFVLHPGSQQPG 428 Query: 68 AWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRP 127 G + + NG G + NG+ L F+ Sbjct: 429 TAGYNQTDHLSGDQFKNLIATWNGAFLLNPNDAHGGFYLNGKTYGTLVPGQASEVFY--K 486 Query: 128 GGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASS-------- 179 G V G + + + Q+ +L++NG +NP + + Sbjct: 487 DGTMNVGSWNSG----PGLQMAPNVVGVRQNLQLLVDNGQVNPSVDSDDKKLWGVTVKNA 542 Query: 180 --KIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLD 224 R+G+G+ GN V+ + A + A + + + + LD Sbjct: 543 YFVWRSGIGVTADGNLVYAM-GPALSVRTLAELLQ-RAGAVRGMELD 587 >UniRef50_D1R528 Putative uncharacterized protein n=1 Tax=Parachlamydia acanthamoebae str. Hall's coccus RepID=D1R528_9CHLA Length = 380 Score = 91.3 bits (225), Expect = 3e-17, Method: Composition-based stats. Identities = 23/114 (20%), Positives = 37/114 (32%), Gaps = 17/114 (14%) Query: 147 KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSK------IRNGVGINKHGNAVFLLSQQ 200 + +I V P+L+ G + S R VGI ++GN +F++ Sbjct: 251 EEWSDIVHIVGGTPILVRGGRLVTDFSAEQTGSHFLNVRLARTAVGILENGNWLFVVVDG 310 Query: 201 ---------ATNFYDFACYAKAKLNVEQLLYLDGTI-SHMYMKGGAIPWQRYPF 244 D A + KL + L L G S M +K + Sbjct: 311 FYKNIWNTKGITIPDLAELMQ-KLGCVEALNLCGGKCSTMVLKNVVVNDPPDGT 363 Score = 43.2 bits (100), Expect = 0.010, Method: Composition-based stats. Identities = 16/123 (13%), Positives = 37/123 (30%), Gaps = 14/123 (11%) Query: 14 LNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLH 73 L + R +A+ L + + + V VNP ++ Sbjct: 28 LAIVRTAIAVELPEGISYSHIFLSDQTS---VHVLEVNP-------HFFDIIPVKNKGDV 77 Query: 74 ALLADINSQGQVQMAMNGGIYDESYA----PLGLYIENGQQKVALNLASGEGNFFIRPGG 129 ++ + + + A+NGG + P+G+ + + G + Sbjct: 78 EAVSSMAKRHKAIAAVNGGFFKMKGEFADLPMGILKIDNHWYGTPHKPRGAIGWSHADEK 137 Query: 130 VFY 132 V + Sbjct: 138 VLF 140 >UniRef50_B9XE16 Putative uncharacterized protein n=1 Tax=bacterium Ellin514 RepID=B9XE16_9BACT Length = 398 Score = 91.3 bits (225), Expect = 3e-17, Method: Composition-based stats. Identities = 18/143 (12%), Positives = 44/143 (30%), Gaps = 22/143 (15%) Query: 109 QQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVI 168 + ++++ I+PG + + + + + AV P+L+ +G Sbjct: 232 KMIISIDPKLASRFAGIQPGTILHFSTGTSRDIA--------KADTAVGGRPLLLVHGKE 283 Query: 169 NPRIHPNVAS-----SKIRNGVGINKHGNAVFLLSQQA-------TNFYDFACYAKAKLN 216 + R +G N ++ + + A + + L Sbjct: 284 LETSKQKGNNAATIVRHPRTALGWNA-RYFFLVVVDGRQKELSMGMSSQELAHFM-STLG 341 Query: 217 VEQLLYLDGTISHMYMKGGAIPW 239 + + LDG S + G + Sbjct: 342 CTEAMNLDGGGSTTFWLDGKVVN 364 Score = 41.6 bits (96), Expect = 0.022, Method: Composition-based stats. Identities = 29/166 (17%), Positives = 63/166 (37%), Gaps = 16/166 (9%) Query: 11 MITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWG 70 +++++ + + + + ++ ++ + + + + A G+ G Sbjct: 19 LVSIHARAELTPIFSSLVPGLDYAHITETNHPWSIHVARLERSHKELDLVSTLAQGKIVG 78 Query: 71 --TLHALLADI-NSQGQVQMAMNGGIYDES-----YAPLGLYIENGQQKVALNLASGE-- 120 ++ + G+ +A+NG + + PLGL I NG+ A N AS Sbjct: 79 LSSVANQVKTFPAGSGKPLVAVNGDFFVIAKGPYQGDPLGLQILNGELVSAPNGASFWKD 138 Query: 121 --GNFFIRPG----GVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGP 160 GN F+ + G+K+ + +TSK + F GP Sbjct: 139 AEGNLFLDNVQSKFSILLPKGEKIPFGLNEQRQTSKAVLFTPAFGP 184 >UniRef50_UPI0001C30FBA N-acetylglucosamine-1-phosphodiester alpha-N- acetylglucosaminidase-like exopolysaccharide biosynthesis protein n=2 Tax=Conexibacter woesei DSM 14684 RepID=UPI0001C30FBA Length = 249 Score = 91.3 bits (225), Expect = 3e-17, Method: Composition-based stats. Identities = 25/196 (12%), Positives = 55/196 (28%), Gaps = 32/196 (16%) Query: 81 SQGQVQMAMNGGIYDESYAPLGLYIENGQ-QKVALNLASGEGNFFIRPGGVFYVAGDKVG 139 + + + G + + PLG G A G V D Sbjct: 51 ENDRPEAIVAGFFVRDPHLPLGEVRVGGVPVVHEPVAAPWAGRRA-------CVHVDGEI 103 Query: 140 IVRLDAFKTSKEIQFAVQSGPMLMENGVINP--------------RIHPNVAS-SKIRNG 184 + VQ+GP+L+ +G + ++ + R Sbjct: 104 RIAPREELADVGGGDLVQAGPLLVRDGTAAIVDGEDREGFSAGASQFDSDITAERHPRCA 163 Query: 185 VGINKHGNAVFLLSQQATN-------FYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAI 237 +G+++ + + + + A + + + LDG S + G + Sbjct: 164 LGVSED-ELLAVCCDGRRSGVDAGLDLAELARLMVS-FGAREAINLDGGGSATLVHRGHL 221 Query: 238 PWQRYPFVTMISVERK 253 + Y + E + Sbjct: 222 LNRPYADRDQPAPESR 237 >UniRef50_A9NEV6 Hypothetical surface-anchored protein n=1 Tax=Acholeplasma laidlawii PG-8A RepID=A9NEV6_ACHLI Length = 520 Score = 91.3 bits (225), Expect = 3e-17, Method: Composition-based stats. Identities = 14/103 (13%), Positives = 31/103 (30%), Gaps = 15/103 (14%) Query: 147 KTSKEIQFAVQSGPMLMENGVINPRIHPNVAS------SKIRNGVGINKHGNAVFLLSQQ 200 + ++ A+ +G +L+++G + ++ S R +G G F++ Sbjct: 278 NGFENVRNAIGTGQLLVKDGAVQHAAFKSLPSNNMAHFRHPRTAIGQKADGTVFFIVVDG 337 Query: 201 ATNF--------YDFACYAKAKLNVEQLLYLDGTISHMYMKGG 235 + K LDG S + Sbjct: 338 RDALSGKYGVKYSELGELMKMH-GAVTAFNLDGGGSSTMLLRN 379 >UniRef50_A9BJK8 Putative uncharacterized protein n=1 Tax=Petrotoga mobilis SJ95 RepID=A9BJK8_PETMO Length = 561 Score = 90.9 bits (224), Expect = 4e-17, Method: Composition-based stats. Identities = 33/236 (13%), Positives = 64/236 (27%), Gaps = 29/236 (12%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQ-TERVKMYWQKANGEAWGTLHALLADINSQG 83 LL L + + + L + + Q ++W K+ W L Sbjct: 313 LLHLSSYSRPALIIGTNFLDIDYIKLEYQLNIDNLLFWIKSINSTWKGDVKLYTHHYKGN 372 Query: 84 QVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVF------YVAGDK 137 + N + EN + EG I + G K Sbjct: 373 ITETEENYVFFLID--------ENNRIISKNKTTPSEGEKLILVDKKYEKYLENISLGTK 424 Query: 138 VGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPN--------VASSKIRNGVGINK 189 V + + + ++ GP+L+ + ++ + R V I+K Sbjct: 425 VDFTLNKSENLTNDPTLLLEGGPILIHSKYTQEQLDAEKKSYSNGIIYGKAPRTVVAIDK 484 Query: 190 HGNAVFLLSQQ------ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPW 239 N ++ + + + E + LDG S + G I Sbjct: 485 EQNINLMVIEGLDNPETGLTYDETRNLLFKIGEFEVAMMLDGGSSSIVYYEGEIQN 540 Score = 45.5 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 14/108 (12%), Positives = 34/108 (31%), Gaps = 7/108 (6%) Query: 28 LFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQM 87 L+ L + V+P+ ++K+ + GT + + + Sbjct: 236 GLQYIEKTEELNGQRLKIYQLIVDPKIYQIKV-----DLNNLGTRSDVYS-FLKEKNPIF 289 Query: 88 AMNGGIYDESY-APLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVA 134 ++N +D P+G I +G + + + Y+ Sbjct: 290 SVNASFFDPQTLEPVGNIISDGALLHLSSYSRPALIIGTNFLDIDYIK 337 >UniRef50_D1B6I7 N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase-like exopolysaccharide biosynthesis protein n=1 Tax=Thermanaerovibrio acidaminovorans DSM 6589 RepID=D1B6I7_THEAS Length = 486 Score = 90.5 bits (223), Expect = 5e-17, Method: Composition-based stats. Identities = 40/225 (17%), Positives = 71/225 (31%), Gaps = 33/225 (14%) Query: 56 RVKMYWQKANGEAWGTLHALLA-----DINSQGQVQMAMNGGIYDESYAPL-GLYIENGQ 109 +++ + + ++ + GG Y L L +++G Sbjct: 268 DGSVFFGDGSASFGVSNGEWTLPIGDFNVPPKNGNLSIFYGGAYRPGNQALLSLSVKDGI 327 Query: 110 QKVALNLASGEGNFFIRPGGVFYVA------GDKVGIVRLDAFKTSKEIQFAVQSGPMLM 163 +F + G A GD + +VR AF + + +Q GPM++ Sbjct: 328 V----QDEPQGADFTLLANGRAAEALGSLNIGDTLQLVRRFAFPAFEACRLVIQGGPMIV 383 Query: 164 ENGVINPRIHPNVAS----SKIRNGVGINKHGNAVFLLSQQAT------NFYDFACYAKA 213 EN R S R VGI++ G VF++ + A A Sbjct: 384 ENRRYVNRSEGLSRSIRERRHPRTLVGIDEQG-LVFMVIDGRNGHSSGVTLEEAANLALE 442 Query: 214 KLNVEQLLYLDGTISHMYMKGGAIPW-----QRYPFVTMISVERK 253 + + L LDG S + G + P I + + Sbjct: 443 E-GLVAALNLDGGGSSQMIWRGVTVNIPSDGKERPLPYGIGLFPR 486 >UniRef50_B8I1Q9 Ig-like, group 2 n=3 Tax=Clostridium RepID=B8I1Q9_CLOCE Length = 952 Score = 89.8 bits (221), Expect = 8e-17, Method: Composition-based stats. Identities = 24/157 (15%), Positives = 45/157 (28%), Gaps = 27/157 (17%) Query: 103 LYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQF-------- 154 + +E+G N + + G + D F +++ Sbjct: 210 MVVEDG-IVKEFNENKPSMD--MPKNGFVVLGAGSHIQYLKDNFNVGDPVEYNITMNVDT 266 Query: 155 -----AVQSGPMLMENGVINPRIHPNVAS---SKIRNGVGINKHGNAVFLL-SQQA---- 201 A+ G ML+++ + N S R +G +K G + + Sbjct: 267 NNMKMALTGGAMLVKDDKVLTSFSHNPVSPSTRASRTAIGTSKDGKTLIVAAVDGRSSAS 326 Query: 202 --TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGA 236 + A Y +L L LDG S + Sbjct: 327 IGMTQSELASYM-HELGCANALNLDGGGSTTLVARKQ 362 >UniRef50_B0JGJ2 Putative uncharacterized protein n=2 Tax=Microcystis aeruginosa RepID=B0JGJ2_MICAN Length = 607 Score = 89.8 bits (221), Expect = 8e-17, Method: Composition-based stats. Identities = 25/181 (13%), Positives = 55/181 (30%), Gaps = 26/181 (14%) Query: 93 IYDESYAP-----LGLYIENGQQKVALNLASGEGNFFIRPGGVFYVA---------GDKV 138 + +Y P GL ++ + LN + + I G + G++V Sbjct: 416 DWGSNYHPLTERETGLVVQGDRVTEKLNNLFPQDSIKIPENGYLVICRKTDISLNIGERV 475 Query: 139 GIVRLDAFKTSKEIQFAVQSGPMLMENGVINPR------IHPNVASSKIRNGVGINKHGN 192 + + + +GP+L++NG I R+ + +++ G Sbjct: 476 NLDSVTLPGDFANYPQILGAGPLLLQNGRIVLDGNAEKFSPAFQNQQASRSAIAVSREGK 535 Query: 193 AVFLLSQQAT-----NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTM 247 + + + A + + L LDG S G + + Sbjct: 536 ILLVAIHNRVGGRGATLGELARILLL-MAAKDGLNLDGGSSTGIALAGYLLDRSAVTAAK 594 Query: 248 I 248 + Sbjct: 595 V 595 Score = 64.3 bits (155), Expect = 3e-09, Method: Composition-based stats. Identities = 20/120 (16%), Positives = 39/120 (32%), Gaps = 4/120 (3%) Query: 23 LTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQ 82 + P L V ++P+ ++ + AN + + L+ INS+ Sbjct: 271 IVWQPGLIWNQKYIQLDQDWFPVTWLEIDPRNPQITIKPITANSTSMRGTNPLI-TINSE 329 Query: 83 GQVQMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIV 141 +NGG ++ + PLG +G+ L G G + + Sbjct: 330 SNAVAMINGGFFNRNNQLPLGAIRVDGKWLSGPILN--RGAIAWDNRGKIRIDRLSLEET 387 >UniRef50_B7DMS1 Copper amine oxidase domain protein n=3 Tax=Alicyclobacillus acidocaldarius RepID=B7DMS1_9BACL Length = 354 Score = 89.4 bits (220), Expect = 9e-17, Method: Composition-based stats. Identities = 40/164 (24%), Positives = 69/164 (42%), Gaps = 20/164 (12%) Query: 100 PLGLYIE--NGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQ 157 P G IE G+ K + G+ I V + +K A+ Sbjct: 201 PDGYDIEIGAGEAKTPIVTRVHVGDPAILTDTVLALPSEKPV--------PFAAYPNAIG 252 Query: 158 SGPMLMENGVINP-------RIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACY 210 +GPML++NG I+ + + +R+ VGI++ G+ +FL +A N + A Sbjct: 253 AGPMLVQNGRIDVEPSLEGLDEPDILNAETLRSVVGIDRAGHLIFLTIHEA-NVWQEASI 311 Query: 211 AKAKLNVEQLLYLDGTIS-HMYMKGGAIPWQRYPFVTMISVERK 253 AKA L + + LDG S ++ +G + + T I V ++ Sbjct: 312 AKA-LGLWDAMNLDGGSSVGLWYEGRYLTPPKRALATAIVVVQR 354 >UniRef50_B7IEY1 Putative uncharacterized protein n=1 Tax=Thermosipho africanus TCF52B RepID=B7IEY1_THEAB Length = 535 Score = 89.4 bits (220), Expect = 1e-16, Method: Composition-based stats. Identities = 28/166 (16%), Positives = 46/166 (27%), Gaps = 19/166 (11%) Query: 103 LYIENGQQKVALNLASGEGNFFI-----RPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQ 157 I N Q + + + + + I+ A+ Sbjct: 366 FVISNNQIISKEYVEKVPKDSMVLLITKKYDKYLKNIEVGSKVNLTINSDFPFPIKHAIG 425 Query: 158 SGPMLMENGVINPRIHPN--------VASSKIRNGVGINKHGNAVFLLSQQ-----ATNF 204 +GP+L+ENG S R + I K G F++ + N Sbjct: 426 AGPLLIENGKKLIDSDEEKLRYGNGLALSKTSRTIIAITKEGKVDFIVIEGYNDSPGMN- 484 Query: 205 YDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISV 250 YD A + + LDG S + + Q I V Sbjct: 485 YDIATEFLLEKGYFYAMMLDGGGSSAMVIQDEVVNQDGTIQRGIPV 530 Score = 50.5 bits (119), Expect = 6e-05, Method: Composition-based stats. Identities = 12/91 (13%), Positives = 31/91 (34%), Gaps = 5/91 (5%) Query: 24 TLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQG 83 TL ++ + ++P+ +V++ ++ L ++ Sbjct: 213 TLKDGLEWERKIEVFNEKKYLINYLHIDPK--KVEILPIISSNGI--GTRQDLREMLKNN 268 Query: 84 QVQMAMNGGIYDESYA-PLGLYIENGQQKVA 113 +N +D S P+ L I++G+ Sbjct: 269 NCIAGINANYFDPSTNIPIDLVIKDGKLLSD 299 >UniRef50_A4CSS0 Putative uncharacterized protein n=1 Tax=Synechococcus sp. WH 7805 RepID=A4CSS0_SYNPV Length = 549 Score = 89.4 bits (220), Expect = 1e-16, Method: Composition-based stats. Identities = 25/175 (14%), Positives = 49/175 (28%), Gaps = 19/175 (10%) Query: 93 IYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEI 152 S L + I +G+ + + G VA + + + + + Sbjct: 368 YRSLSGEELAILIRDGRVTDQFSKTELARGVPLPEGASLVVARARAPLPAKPGDEVAIRL 427 Query: 153 ---------QFAVQSGPMLMENGVINPR------IHPNVASSKIRNGVGINKHGNAVF-- 195 + + GP+L++ G + R + + R VG + + Sbjct: 428 KVSSPVGERRQVMAGGPLLLKEGQVVLRGRQEGFSSGFLGQAAPRTVVGQDPKHRWMLTL 487 Query: 196 -LLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMIS 249 LS + A +L + L LDG S + I Sbjct: 488 EGLSGSDPTLLET-TLALQQLGLSDALNLDGGSSTTMLIANRTVMTGRGVPPRIQ 541 >UniRef50_B2S1G8 Hypothetical cytosolic protein n=2 Tax=Borrelia RepID=B2S1G8_BORHD Length = 262 Score = 89.4 bits (220), Expect = 1e-16, Method: Composition-based stats. Identities = 26/214 (12%), Positives = 55/214 (25%), Gaps = 29/214 (13%) Query: 32 AADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGE--AWGTLHALLADINSQGQVQMAM 89 + S + + + + + + + + +V +A+ Sbjct: 32 QYEIIKSSFKESNYVIVKIKNKNLKFIIPKPIYDQKMNNYYFKGQTTSQFLLSNKVDIAI 91 Query: 90 NGGIYDES---YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAF 146 N Y+ + P GLYI + + A G I+ Sbjct: 92 NTSPYEIKENMFYPNGLYIYDKKIISNAKKAQGIII------------IKNNQIILNPKQ 139 Query: 147 KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHG-NAVFLLSQQA---- 201 K + L+ NG N R +G +K + + + Sbjct: 140 DEIKNSDYGFSGFFPLITNGNYTKNFKEN---KHPRTIIGTDKENKHLYLITVEGRGTNN 196 Query: 202 ---TNFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 + + A + + LDG S + Sbjct: 197 SKGISLNE-AIDLSLNYAITNSINLDGGGSSTLV 229 >UniRef50_C7LNU2 Putative uncharacterized protein n=1 Tax=Desulfomicrobium baculatum DSM 4028 RepID=C7LNU2_DESBD Length = 276 Score = 89.4 bits (220), Expect = 1e-16, Method: Composition-based stats. Identities = 24/245 (9%), Positives = 70/245 (28%), Gaps = 30/245 (12%) Query: 9 KGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLT------------VQAYTVNPQTER 56 + + T + +A + + A L + + + ++ Sbjct: 7 RAVFTCLVLCAPVASLHAEEWRLLAPGLELREFLIPDQVGDLEGRQSGMAVLRIDSDRFD 66 Query: 57 VKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDES--YAPLGLYIENGQQKVAL 114 V + G ++ +N G++ G + + Sbjct: 67 VALGSALGTGR-MRSMQEW----ARHSGFVAVINAGMFRADDRMRSTGYMRDAAVMINSF 121 Query: 115 NLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHP 174 + +P + +R K+ + Q + +++N + R Sbjct: 122 IHPNYGAFLAFQP------RDPSLPALRWVDRKSDPDWQAVLADYDGIIQNYRLISRERE 175 Query: 175 NVASSKIR----NGVGINKHGNAVFLLSQQATNFYDFACYAKA-KLNVEQLLYLDGTISH 229 N+ R + +++ G +F+ + + ++FA L++ +Y++G Sbjct: 176 NLWEPSDRRHSGAAIAMDREGRLLFIHCRARLSLHEFAQALIDLPLDLIGAMYVEGGADA 235 Query: 230 MYMKG 234 Sbjct: 236 AMYVD 240 >UniRef50_B1XK15 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7002 RepID=B1XK15_SYNP2 Length = 595 Score = 89.4 bits (220), Expect = 1e-16, Method: Composition-based stats. Identities = 34/264 (12%), Positives = 71/264 (26%), Gaps = 34/264 (12%) Query: 10 GMITLNLKRIFLALTLLPLFAVAADDC-ALSDPTLTVQAYTVNPQTERVKMYWQKANGEA 68 G++ N + A + N + V++ Sbjct: 332 GIVQGNRALRSGPILNRGAVAWDNAGRWEFDRLKVETDIVAGNGERVGVELINSG----- 386 Query: 69 WGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPG 128 + A L D + A++ I G + + +G+ ++ I G Sbjct: 387 YVKAGAALYDRAWGSRYTTAVDHEIVLTVMTSGG---RDQVIRQETAGKAGQNSYEIPQG 443 Query: 129 GVFYVAGD--------KVGIVRLDAFKTSKE----IQFAVQSGPMLMENGVINPR----- 171 G V VG+ + + + V GP+L++NG + Sbjct: 444 GYLLVFRSFRTGAAKFPVGVTLERRPRFTPNSFATLPNIVGGGPLLLKNGQVVLNGQAEQ 503 Query: 172 -IHPNVASSKIRNGVGINKHGNAVFLLSQQ------ATNFYDFACYAKAKLNVEQLLYLD 224 S R+ + + + + ++A + +L L LD Sbjct: 504 FSTAFNIQSASRSAIARTRDNKILLVTLHGAAEETAGATLNEWANILR-RLGATDALNLD 562 Query: 225 GTISHMYMKGGAIPWQRYPFVTMI 248 G S G + + + Sbjct: 563 GGGSSALALGANLSDRHPTTAGRV 586 Score = 60.1 bits (144), Expect = 7e-08, Method: Composition-based stats. Identities = 24/182 (13%), Positives = 51/182 (28%), Gaps = 12/182 (6%) Query: 22 ALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINS 81 ++ LP ++ A S + V ++P ++++ + L LL Sbjct: 252 SVQWLPGVQWRQENFAASSGPVRVTWLEIDPTQRQLQLKPITPDNNTIVGLAPLLIQ-AD 310 Query: 82 QGQVQMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGI 140 Q A+N G ++ + PLG+ + I G Sbjct: 311 TNQAIAAINAGFFNRNNQYPLGIV----------QGNRALRSGPILNRGAVAWDNAGRWE 360 Query: 141 VRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ 200 +T + G L+ +G + + R ++ + S Sbjct: 361 FDRLKVETDIVAGNGERVGVELINSGYVKAGAALYDRAWGSRYTTAVDHEIVLTVMTSGG 420 Query: 201 AT 202 Sbjct: 421 RD 422 >UniRef50_B5Y710 Copper amine oxidase N-domain family n=2 Tax=Coprothermobacter proteolyticus DSM 5265 RepID=B5Y710_COPPD Length = 485 Score = 89.0 bits (219), Expect = 1e-16, Method: Composition-based stats. Identities = 32/204 (15%), Positives = 58/204 (28%), Gaps = 10/204 (4%) Query: 58 KMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLA 117 + W + + + L +N G + NG + ++ +G Sbjct: 284 RTMWGENSIRVFTPLRGATTRLNVDGINVIVRNGEVVEQVTGTNVPIPPDGYVIHLGGTE 343 Query: 118 SGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVI-------NP 170 + F + Y V I + +GP L+ NG I Sbjct: 344 VRFKDRFEVGTRLSYRDIYDVRNSSNPEMWQEGVIWGTLSAGPRLITNGEITLDPASELL 403 Query: 171 RIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTIS-H 229 I R+ +GI ++ + + D A K L + LDG S Sbjct: 404 DIPKITGQPLTRSALGITQNNELLMVTV-SKCTIQDLATIMKD-LGAYNAMNLDGGASTS 461 Query: 230 MYMKGGAIPWQRYPFVTMISVERK 253 +Y G + + V + Sbjct: 462 LYANGKFLATPTRKISNALMVLPR 485 Score = 42.8 bits (99), Expect = 0.012, Method: Composition-based stats. Identities = 18/134 (13%), Positives = 39/134 (29%), Gaps = 7/134 (5%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ 84 + A+S T TV V +K+ A + + + + Sbjct: 139 IETGVESYTTTVAVSTGTATVNVVKVFLNDPTIKLEIVNAQDQI--GVLEPFESMVKRKN 196 Query: 85 VQMAMNGGIYDES-----YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVG 139 A+NG + + P + NG+ + F +A K+ Sbjct: 197 PLAAINGTFFQVADTSLPMEPAANLVINGRIEHLGTPEKYASTFAFTQDNQVDIANVKMK 256 Query: 140 IVRLDAFKTSKEIQ 153 + + + E++ Sbjct: 257 LQGEYTYIPNPELE 270 >UniRef50_Q9L2D5 Putative secreted protein n=2 Tax=Streptomyces RepID=Q9L2D5_STRCO Length = 428 Score = 89.0 bits (219), Expect = 1e-16, Method: Composition-based stats. Identities = 22/204 (10%), Positives = 45/204 (22%), Gaps = 15/204 (7%) Query: 53 QTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKV 112 + A+ + + +++N P G G+ Sbjct: 219 SRFKAPTPVPDADARFTEDDDPGAEAVVAADGTVLSLNPNGRGGVTVPTG-----GRVLQ 273 Query: 113 ALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRI 172 + PG D + P L+ N Sbjct: 274 GTGTGADWLRAHATPGTDLAFEERLHDERFGDDIPLDSSVDVVNGHYP-LVHNAQY--AY 330 Query: 173 HPNVASSKIRNGVGINKHGNAVFLLSQQ-----ATNFYDFACYAKAKLNVEQLLYLDGTI 227 + R+ + ++ G +F+ + +FA L L +DG Sbjct: 331 TGQNTAVDPRSAIAVDGPGRTLFVTATGKSGRNGVTLDEFARILLD-LGAVDGLNMDGGG 389 Query: 228 SHMYMKGGAIPW-QRYPFVTMISV 250 S + A+ Sbjct: 390 STTLVVEQAVVNRPSDSTGERPVA 413 >UniRef50_A9QSN5 Exopolysaccharide biosynthesis protein n=4 Tax=Lactococcus lactis RepID=A9QSN5_LACLK Length = 303 Score = 88.2 bits (217), Expect = 2e-16, Method: Composition-based stats. Identities = 29/185 (15%), Positives = 63/185 (34%), Gaps = 13/185 (7%) Query: 59 MYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDES-YAPLGLYIENGQQKVALNLA 117 + + ++ +++ + + MN ++ + G I NG+ N Sbjct: 117 LKTATSADSPVVSMSEVISKYPNS----LIMNASGFNMTTGKITGFQINNGKLFKDWNSD 172 Query: 118 SGEGN-FFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNV 176 N F G + D + K + + G +L+++G P Sbjct: 173 KRATNAFVFNKNG----SSDIYNSTTPASEILKKGAEMSFSFGSILIKDGKSLPSDGTVN 228 Query: 177 ASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGA 236 + +G +K N ++S +T + + KL++E + +DG S G Sbjct: 229 WEIH--SFIGNDKDNNIYLIISDTSTGYQSIMEKFQ-KLHLENVQVMDGGGSSQMSLNGQ 285 Query: 237 IPWQR 241 I + Sbjct: 286 IIYPS 290 >UniRef50_A1SN25 Exopolysaccharide biosynthesis protein-like n=1 Tax=Nocardioides sp. JS614 RepID=A1SN25_NOCSJ Length = 420 Score = 88.2 bits (217), Expect = 2e-16, Method: Composition-based stats. Identities = 27/189 (14%), Positives = 55/189 (29%), Gaps = 39/189 (20%) Query: 98 YAPLGLYIENGQ--------QKVALNLASGEGNFFIRP-GGVFYVAGD-KVGIVRLDAFK 147 G + GQ + +P G+ ++ ++R Sbjct: 231 GRTAGYGVTQGQTERVRAVTVVNGRVRTNRAKLSHDQPIKGLLFIGRGEGAKVLRKLPKH 290 Query: 148 TSKEIQFAVQSGPM--------LMENGVINPRIHPNVASSKIRNGVGINKH-GNAVFLLS 198 T ++++++Q P L+ +G+I + R VG++ G + L+ Sbjct: 291 TRIKVRWSLQGRPQMAISGNNFLVHDGIIRAI---DDREMHPRTAVGVDSDTGEVLLLVV 347 Query: 199 QQA------TNFYDFACYAKAKLNVEQLLYLDGTISHMYM------KGGAIPWQRYPF-- 244 + A L ++ + LDG S + K + F Sbjct: 348 DGRQADSRGYTMVELANLMVD-LGADEAVNLDGGGSSTMVGKNRRGKVAVLNDPSDGFQR 406 Query: 245 --VTMISVE 251 I V Sbjct: 407 WVANAIEVT 415 Score = 41.2 bits (95), Expect = 0.032, Method: Composition-based stats. Identities = 20/113 (17%), Positives = 37/113 (32%), Gaps = 5/113 (4%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ 84 + P + + T++P+T +++ + A + DI + Sbjct: 86 VAPGVKFTRWSQTDARGPIVAHLLTIDPKTPGLRIDYASMGA---VRRVAPVRDILAVDN 142 Query: 85 VQMAMNGGIYDES--YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAG 135 +NG YD APLGL + + + FFI G + Sbjct: 143 AVAGVNGDFYDIGHTGAPLGLGKDRQRGLLHAREDGWNKAFFINRHGRAGIGD 195 >UniRef50_A6LP25 Putative uncharacterized protein n=1 Tax=Thermosipho melanesiensis BI429 RepID=A6LP25_THEM4 Length = 534 Score = 87.8 bits (216), Expect = 3e-16, Method: Composition-based stats. Identities = 32/165 (19%), Positives = 50/165 (30%), Gaps = 17/165 (10%) Query: 103 LYIENGQQK---VALNLASGEGNFFIRPGGVFYVAGDKVG--IVRLDAFKTSKEIQFAVQ 157 I++ + N G I Y++ G I + I+ A+ Sbjct: 365 YQIKDNKIISIGYIENAPEGAMVLSISKKYEKYLSNVTPGTKIDLVLNSDFPFPIKHAIG 424 Query: 158 SGPMLMENGVINPRIHPN--------VASSKIRNGVGINKHGNAVFLLSQQATNF----Y 205 +GP+L+ENG S R + I K G F++ + N Y Sbjct: 425 AGPLLIENGKKLIDSSEEKLRYSNGLALSKTTRTIIAITKEGRVDFIVIEGYNNTGGMNY 484 Query: 206 DFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISV 250 D A + LDG S + + Q I V Sbjct: 485 DIATDFLISKGYFYAMMLDGGGSGAMVIQNEVVNQDGQIQRGIPV 529 Score = 49.3 bits (116), Expect = 1e-04, Method: Composition-based stats. Identities = 14/118 (11%), Positives = 32/118 (27%), Gaps = 7/118 (5%) Query: 24 TLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQG 83 + ++ + ++P+ + L +I Sbjct: 212 IIKNGLIWERKVEKFNNEKYLINYLKIDPKKVEI----IPVISSKGIGTREDLREILKAN 267 Query: 84 QVQMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGI 140 +N +D S P+ L I++G+ S F I ++ + I Sbjct: 268 NCIAGINANYFDPSTNLPIDLIIKDGKILSDK--YSLRPTFIITYTNEVFIKRINLEI 323 >UniRef50_Q5ULM2 Orf92 n=1 Tax=Lactobacillus phage LP65 RepID=Q5ULM2_9CAUD Length = 556 Score = 87.8 bits (216), Expect = 3e-16, Method: Composition-based stats. Identities = 31/224 (13%), Positives = 59/224 (26%), Gaps = 23/224 (10%) Query: 41 PTLTVQAYTVNPQTERVKMY----WQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE 96 + + T K+ ++ + A+N G+++ Sbjct: 316 SGASYVFVRIPKTTNTGKILSPKLALTSSDGSLSGTKRPTLRYAKDNDTIFAVNAGLFNV 375 Query: 97 SY-APLGLYIENGQ-QKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKT----SK 150 S P+G I NG + + + T + Sbjct: 376 STVEPVGQLIINGISLINTPMTSDNGVTINPNECYPLAIDANGDLTTYPRNADTADMIAA 435 Query: 151 EIQFAVQSGPMLMENGVINPRIHPN---VASSKIRNGVGINKHGNAVFLLSQ-------- 199 +++AV + L++N I N IR +G ++G Sbjct: 436 GVKYAVTAWGKLVDNFEIATTDIENEIVHNGRYIRQSIGQYQNGYYCVCTVDMTRGSVTN 495 Query: 200 -QATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRY 242 + + A K V+ LDG S + G Y Sbjct: 496 EAGLYYKELAQIFVDK-GVKFAFSLDGGGSAETVLGKRQLNPIY 538 >UniRef50_Q01TI8 Putative uncharacterized protein n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q01TI8_SOLUE Length = 340 Score = 87.5 bits (215), Expect = 4e-16, Method: Composition-based stats. Identities = 35/274 (12%), Positives = 70/274 (25%), Gaps = 53/274 (19%) Query: 20 FLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADI 79 F+ +TL+ + T+ +N + + G T+ D Sbjct: 34 FVGVTLITRTETSP-------RAETMHIAEINLNAPGIGVKLTSP-GGTLETVRQTTLDY 85 Query: 80 NSQGQVQMAMNGGIYDE----SYAP--LGLYIENGQQKVALNLASGEGNFFIRPGGVFYV 133 +Q Q+A+NG + + +GL NG + + Sbjct: 86 LNQEHAQLAINGEFFLPFPSSDFNSMLIGLAASNGNVYSSFEAPVQSYAIVTDAPALNID 145 Query: 134 AGDKVGIVRLDA-------FKTSKEIQFAVQSGPMLMENGVI------------------ 168 + IV + + + + ++ NGV Sbjct: 146 QSNHASIVHDNTSFVDGKHVLENVTLWNTIAGSAQIITNGVASIPTYLDATHPNGLLTPG 205 Query: 169 ----NPRIHPNVASSKIRNGVGINKHGNAVFL-LSQQ-----ATNFYDFACYAKAKLNVE 218 + R +G+++ +FL + A ++ Sbjct: 206 GPASYSNSNSWYNLINARTVIGLSQDNQTLFLFTVDNAGGSRGMTLPEVANLLIGDYSIY 265 Query: 219 QLLYLDGTISHMYMKGGAIPWQRYPFVTMISVER 252 L LDG S A+ I+V Sbjct: 266 NALNLDGGGSTSM----AMQDPVTGMGRFINVSS 295 >UniRef50_A3DIP4 Exopolysaccharide biosynthesis protein n=3 Tax=Clostridium thermocellum RepID=A3DIP4_CLOTH Length = 382 Score = 87.5 bits (215), Expect = 4e-16, Method: Composition-based stats. Identities = 23/175 (13%), Positives = 44/175 (25%), Gaps = 28/175 (16%) Query: 101 LGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQ------- 153 +EN + + I G+ + + I Sbjct: 207 TSYIVENNRVARKFRGDTECK---IPSDGMVITFYEPISSEEKFEVGDWIGIDIDPDFGP 263 Query: 154 --FAVQSGPMLMENGVINPRIHPN----VASSKIRNGVGINKHGNAVFLLSQQATNFY-- 205 A + G L+ +G + + + + R +G+ G V + Y Sbjct: 264 GFQAYECGSWLVRDGQVVAVDRDDWVGLLTNRDPRTAIGVKHDGKVVLVTVDGRQPGYSV 323 Query: 206 -----DFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPW----QRYPFVTMISVE 251 + A Y L ++ LDG S + + I V Sbjct: 324 GLSSRELAGYLL-TLGIKDAAMLDGGASTQMIVQNKTVNRLPARERMLGGGIVVV 377 >UniRef50_B5W3X9 Putative uncharacterized protein n=3 Tax=Arthrospira RepID=B5W3X9_SPIMA Length = 812 Score = 87.5 bits (215), Expect = 4e-16, Method: Composition-based stats. Identities = 22/133 (16%), Positives = 39/133 (29%), Gaps = 23/133 (17%) Query: 138 VGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPR------IHPNVASSKIRNGVGINKH- 190 V +V E + +GP+L+ + IR+ VG+ + Sbjct: 671 VQLVDETNPPDFAEYPHILGAGPLLLRGNQVVLDARAENFSDAFNTQRAIRSAVGLKTNT 730 Query: 191 -GN---------AVFLLSQQA-----TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGG 235 G + ++ + + A K +L L LDG S GG Sbjct: 731 PGRSGSDSPAVSLLLVVVHPRLGGPGPSLAELAELMK-QLGATDALNLDGGSSTGLYLGG 789 Query: 236 AIPWQRYPFVTMI 248 + + I Sbjct: 790 YLLDRPPQTAAPI 802 Score = 48.6 bits (114), Expect = 2e-04, Method: Composition-based stats. Identities = 14/131 (10%), Positives = 33/131 (25%), Gaps = 21/131 (16%) Query: 23 LTLLPLFAVAADDCAL-----------------SDPTLTVQAYTVNPQTERVKMYWQKAN 65 + L V ++ + + + + Sbjct: 427 IVWAEGLRWQQKYIDLNSHQRLFTPPPSNSTGRESARFPVVWLEIDLNNQGISLQPILSR 486 Query: 66 GEAWGTLHALLADINSQGQVQMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGEGNFF 124 + + L+ + S + A+NGG ++ + PLG + L G Sbjct: 487 PGSRSGVSPLV-HVASSTRAAAAINGGFFNRNNQYPLGAIRHQNRWLSGPILN--RGAIA 543 Query: 125 IRPGGVFYVAG 135 F++ Sbjct: 544 WTDQNQFFIDR 554 >UniRef50_C9N2Q2 Metallophosphoesterase n=2 Tax=Actinomycetales RepID=C9N2Q2_9ACTO Length = 1163 Score = 87.5 bits (215), Expect = 4e-16, Method: Composition-based stats. Identities = 25/205 (12%), Positives = 54/205 (26%), Gaps = 32/205 (15%) Query: 59 MYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLY-IENGQQKVALNLA 117 + A + A + + + P+ + +G+ ++ Sbjct: 210 LAAYNAANVPAQGVGAYTSAWGGADRAPAV-------DDARPVAEVAVRDGEVVS---VS 259 Query: 118 SGEGNFFIRPGGVFYVA-------------GDKVGIVRLDAFKTSKEIQFAVQSGPMLME 164 G G+ + V V GD V I + AV +L+ Sbjct: 260 DGPGSGPVPEDTVVLVGREAGAGLLAALEPGDPVKIAYRARTDGGAVPRTAVGGRELLVV 319 Query: 165 NGVINPRIHPNVASSKIRNGVGINKHGNAVFLLS-------QQATNFYDFACYAKAKLNV 217 +G ++ R VG ++ G + +++ + + + Sbjct: 320 DGAAQNHDGEGNNTAAPRTAVGFSEDGRTMQVVTVDGRQTDSGGVTLTELGEMMR-RAGS 378 Query: 218 EQLLYLDGTISHMYMKGGAIPWQRY 242 L LDG S + Sbjct: 379 YSALNLDGGGSSTLVARQPGSDTLR 403 >UniRef50_Q7U4D6 Putative uncharacterized protein n=11 Tax=Cyanobacteria RepID=Q7U4D6_SYNPX Length = 589 Score = 87.1 bits (214), Expect = 5e-16, Method: Composition-based stats. Identities = 38/233 (16%), Positives = 66/233 (28%), Gaps = 37/233 (15%) Query: 52 PQTERVKMYWQKANGEAWGTLH--ALLADINSQGQVQMAMNGGI-----------YDESY 98 P R + W + +G L L + + +N G + Y Sbjct: 349 PILNRGVVAWGDNDQLQFGRLRLDQQLQVNGGRRRGLSYLNSGYVQRGLSRYTRAWGPIY 408 Query: 99 APLG-----LYIENGQQKVALNLASGEGNFFIRPGGVFYV---------AGDKVGIVRLD 144 PL L ++ G+ + AS I G V ++ Sbjct: 409 RPLSGEEEALLVQGGRVTQRFDRASIRRGVLIPADGDLVVARGGTPLPAKPGDAVMLSQR 468 Query: 145 AFKTSKEIQFAVQSGPMLMENGVIN-----PRIHPNVAS-SKIRNGVGINKHGNAVFL-- 196 + + GP+LM+ G I P+ + + R VG G + Sbjct: 469 TTSGLGDQANVLGGGPLLMQGGQIVLNGRAEGFSPDFLALAAPRTVVGQGTGGTWLLALR 528 Query: 197 -LSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMI 248 + + A A +L ++ L LDG S + G + Sbjct: 529 GAAGSDPTLLETA-LAAQQLGLKDALNLDGGSSTTVVVAGRTVMNGRGSAPRV 580 >UniRef50_Q67T45 Putative uncharacterized protein n=1 Tax=Symbiobacterium thermophilum RepID=Q67T45_SYMTH Length = 921 Score = 87.1 bits (214), Expect = 5e-16, Method: Composition-based stats. Identities = 24/132 (18%), Positives = 46/132 (34%), Gaps = 11/132 (8%) Query: 108 GQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGV 167 G+ A G F + K G +++ S + +A+ L+ +G Sbjct: 217 GRVTNVPVPADGFILVGTNEAARF-LDPLKPGDPVTVSYRPSPAVAWAIGGQNYLVRDGA 275 Query: 168 INPRIHPNVASSKIRNGVGINKHG-NAVFLLSQQ------ATNFYDFACYAKAKLNVEQL 220 + + + AS + R+ VG + G L+ + + A + K+ Sbjct: 276 VVSGL--DNASRRPRSAVGFSADGRRMYLLVIEGDSSRSVGATLAEMAAFMKS-FGAANA 332 Query: 221 LYLDGTISHMYM 232 L LDG S + Sbjct: 333 LELDGGGSSTIV 344 >UniRef50_Q2BF40 Putative uncharacterized protein n=1 Tax=Bacillus sp. NRRL B-14911 RepID=Q2BF40_9BACI Length = 657 Score = 86.7 bits (213), Expect = 6e-16, Method: Composition-based stats. Identities = 24/116 (20%), Positives = 40/116 (34%), Gaps = 11/116 (9%) Query: 134 AGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVA---SSKIRNGVGINKH 190 + ++ K ++ + SGP+L+ NG ++ + PN R V I+K Sbjct: 254 KPGDTVEIAINIDDKWKNSEYMLASGPLLVNNGKVDLGMDPNSTRARERAPRTAVAIDKT 313 Query: 191 -GNAVFLLSQQA------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPW 239 + N +FA Y KL + L LDG S + Sbjct: 314 MSKVFLVTVDGRLAESKGMNLTEFAQYL-VKLGAYKALNLDGGGSTAIIARKNGND 368 >UniRef50_A1VEZ3 Putative uncharacterized protein n=4 Tax=Desulfovibrio vulgaris RepID=A1VEZ3_DESVV Length = 311 Score = 86.7 bits (213), Expect = 7e-16, Method: Composition-based stats. Identities = 30/182 (16%), Positives = 63/182 (34%), Gaps = 9/182 (4%) Query: 47 AYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIE 106 A ++P ++ G +L A + + A+N +Y +++ Sbjct: 74 ALRIDPNLWDFSLHTATGEGGYPLSLGAWAEKL----NLGAAINSSMYLPDVRTSTGFLK 129 Query: 107 NGQQKVALNLASGEGNFFIR-PGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMEN 165 G+ + + G+FF+ P D + + VQ+ ++ N Sbjct: 130 AGEHVNNPRVTTKFGSFFVAAPDDPTLPQADLLDRAIDPWAERLPHYNMVVQNYRLISTN 189 Query: 166 GVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKA-KLNVEQLLYLD 224 I I VG + G +FL ++ + FA A L++ ++Y++ Sbjct: 190 RRIL--WPQGGPEYSI-AAVGQDGSGAILFLHCREPMTAHAFASMLLALPLDIHDVMYVE 246 Query: 225 GT 226 G Sbjct: 247 GG 248 >UniRef50_A5D3T7 Hypothetical membrane protein n=1 Tax=Pelotomaculum thermopropionicum SI RepID=A5D3T7_PELTS Length = 887 Score = 86.3 bits (212), Expect = 9e-16, Method: Composition-based stats. Identities = 27/157 (17%), Positives = 50/157 (31%), Gaps = 29/157 (18%) Query: 104 YIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQF--------- 154 ++NG + L G I P G + L+ ++ + Sbjct: 219 VVKNGVVQQVLTDQPG---VPIPPDGYVLRGHGQAARFILENLPAGSKVSYTYSVMPQGD 275 Query: 155 ----AVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLL-------SQQ--- 200 AV +L+E G + N+A R G++K G ++L+ S Sbjct: 276 KLFAAVGGQALLVEEGRLPAYFTQNIAGKHARTAAGVSKDGKTLYLVAVEKQSASDGTVV 335 Query: 201 --ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGG 235 + A + + + V + + LDG S Sbjct: 336 SRGMTQEELAEFLIS-IGVWRAVNLDGGGSTTLAARH 371 Score = 41.2 bits (95), Expect = 0.033, Method: Composition-based stats. Identities = 15/112 (13%), Positives = 34/112 (30%), Gaps = 3/112 (2%) Query: 24 TLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQG 83 T+ + ++ L V + +K+ + + + Sbjct: 44 TVTRGAVLQTVRMTTNEGPLNVYILKADLSDPYLKVDTIVGADGTLAKN-QTVTAMAGRA 102 Query: 84 QVQMAMNGGIYDE--SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYV 133 A+NG + S P+GL + G+ + L S F + + + Sbjct: 103 GAVAAVNGDFFQMKESGRPIGLLYQGGRLIESPALRSDMYGFAVTKDKLPIL 154 >UniRef50_A5D3R0 Hypothetical membrane protein n=1 Tax=Pelotomaculum thermopropionicum SI RepID=A5D3R0_PELTS Length = 485 Score = 86.3 bits (212), Expect = 1e-15, Method: Composition-based stats. Identities = 35/175 (20%), Positives = 56/175 (32%), Gaps = 26/175 (14%) Query: 97 SYAPLG---LYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLD--------- 144 P G + + NG G I G G+ Sbjct: 318 DTTPPGRTAVVVRNGIV-----TGIRSGQVEIPEDGYVIWYGENNYERDDQFSAGRQVDY 372 Query: 145 --AFKTSKEIQF--AVQSGPMLMENGVINPRI--HPNVASSKIRNGVGINKHGNAVFLLS 198 FK +++ +F + + P+L+ NG I P + R+ VG+ V Sbjct: 373 RVTFKENQQARFKATISNYPLLLSNGAIALGDITEPKLTIGAPRSFVGVTWDNILVMGTV 432 Query: 199 QQATNFYDFACYAKAKLNVEQLLYLDGTIS-HMYMKGGAIPWQRYPFVTMISVER 252 A N ++ A K L ++ L LDG S +Y G I + V + Sbjct: 433 DSA-NVWELAEVTKN-LGLKDALNLDGGASCGLYYDGAYIRQPGRLLSNCLVVIQ 485 >UniRef50_B5YE82 Putative uncharacterized protein n=2 Tax=Dictyoglomus RepID=B5YE82_DICT6 Length = 691 Score = 86.3 bits (212), Expect = 1e-15, Method: Composition-based stats. Identities = 28/223 (12%), Positives = 68/223 (30%), Gaps = 29/223 (13%) Query: 45 VQAYTVNPQ-TERVKMYWQKANGEAWGT------LHALLADINSQGQVQMAMNGGIYDES 97 + + +N + + A G+ + + ++ + + Sbjct: 123 IDIFQINLKIKIGENIIPVNAINSPRGSDNLNLFTKYFGKETQIRENASAGIDIEVVLKD 182 Query: 98 -----YAPLGLY--IENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSK 150 G+ I G +K + + + + I + Sbjct: 183 KIPSLGKTSGIVSNIYYGVKKTPIKENTCIISLGGTALKYLPLFSIGKEIEIITECNPPI 242 Query: 151 EIQFAVQSGPMLMENGVINPRIHPN-------VASSKIRNGVGINKHGNAVFLLSQQA-- 201 ++ A+ GP+L++NG I V S R +GI K+ + F++ + Sbjct: 243 PLKEAIGGGPILLKNGDIVLGNTDELAFDNNIVNSRHPRTIIGI-KNNSIYFIVIEGRKE 301 Query: 202 ----TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 + + K ++ + + +DG S + G + Q Sbjct: 302 NSAGVSLKEACEILK-EMGINDAINMDGGGSSQKLIWGRLVNQ 343 >UniRef50_A4FD37 Secreted protein n=1 Tax=Saccharopolyspora erythraea NRRL 2338 RepID=A4FD37_SACEN Length = 519 Score = 85.9 bits (211), Expect = 1e-15, Method: Composition-based stats. Identities = 24/165 (14%), Positives = 45/165 (27%), Gaps = 24/165 (14%) Query: 102 GLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPM 161 G G L A+ R G +V + A V GP Sbjct: 344 GPVPAGGTVVQGLGQAAEWLVAHARAGEPLWVD--QQIREESGAPLRLGPSDDIVNGGPE 401 Query: 162 LMENGVINPRIHPNV-------------ASSKIRNGVGINKHGNAVFLLSQQA------- 201 L+ +G + + + R+ +G++ G + ++ Sbjct: 402 LVRDGQVRINLQEDGIIHDAPSFAYTWGLKRNPRSVIGVDAQGRVILATTEGRMPGFSDG 461 Query: 202 TNFYDFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFV 245 + A + +A L + LDG S M + + Sbjct: 462 WGLPEAAEFVRA-LGAVDAMALDGGGSAGMVVDDRVVTTPSDATG 505 >UniRef50_B1HN11 Putative uncharacterized protein n=2 Tax=Bacillaceae RepID=B1HN11_LYSSC Length = 815 Score = 85.9 bits (211), Expect = 1e-15, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 59/192 (30%), Gaps = 31/192 (16%) Query: 81 SQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFY-------- 132 + +++ ++G D + G + EGN + G Sbjct: 185 NAWGLELVVSGASQDTNTLHFGDQFSG--TVSHVTTYGAEGNSAVPADGFVISVQNKELA 242 Query: 133 -----VAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVA---SSKIRNG 184 ++ V L + + QF + +GPML+ NG ++ + N + R Sbjct: 243 AELSNISAGTNIDVSLSIDQKWMDAQFILAAGPMLVRNGQVDISMPTNSGFASTRSPRTA 302 Query: 185 VGINKHG-NAVFLLSQQA-------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGA 236 V ++ G + N D A + + + + LDG S + Sbjct: 303 VAVDATGTKVSLITIDGRLSGHSNGVNLSDLASHLIS-IGATSAINLDGGGSTAMVAR-- 359 Query: 237 IPWQRYPFVTMI 248 F ++ Sbjct: 360 --NPGGYFANLV 369 >UniRef50_C8X0Z8 Putative uncharacterized protein n=1 Tax=Desulfohalobium retbaense DSM 5692 RepID=C8X0Z8_DESRD Length = 302 Score = 85.5 bits (210), Expect = 2e-15, Method: Composition-based stats. Identities = 26/186 (13%), Positives = 62/186 (33%), Gaps = 10/186 (5%) Query: 44 TVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGL 103 + ++P+ R +Y A A T+ + + A+N +Y E Sbjct: 63 ELTVLRIDPEFFRFVLYSASAERGADRTVRQWVE----DKNLVAAINASMYWEDRETSTG 118 Query: 104 YIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFA--VQSGPM 161 + N + G FF+ + + L+ + Q+A +Q+ + Sbjct: 119 LMTNFGHVNNGRVHPEFGAFFVANPRRAQLPPVDILDRSLEQQWRKRVAQYATIIQNYRL 178 Query: 162 LMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKA-KLNVEQL 220 L G + V + G+ +F+L + + + + L++ Sbjct: 179 LDAKGE---NVWQASRQEHSSAAVAEDSQGHILFILQHEPVSVHALGSRLENLSLDLSTA 235 Query: 221 LYLDGT 226 ++++G Sbjct: 236 MFVEGG 241 >UniRef50_Q4ZC55 ORF005 n=1 Tax=Staphylococcus phage EW RepID=Q4ZC55_9CAUD Length = 576 Score = 85.1 bits (209), Expect = 2e-15, Method: Composition-based stats. Identities = 26/229 (11%), Positives = 53/229 (23%), Gaps = 33/229 (14%) Query: 45 VQAYTV---NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPL 101 + + + +K+ + + + + N S L Sbjct: 152 YSIVRIPHKDRKGNVIKLKRGIEGSDKSHPEPVTATEFSKRSGATYVSNASTGSGSRVML 211 Query: 102 -GLYIENGQQK-----VALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFA 155 G I NGQ V + G ++ + + Sbjct: 212 HGEQIYNGQILETVKDYEPLKTRWTLAIADDNTLVSFPPGVTAKEIKDKGYNNT-----V 266 Query: 156 VQSGPMLMENGVIN---PRIHPNVASSKIRNGVGINKHGNAVFLLSQQA----------T 202 GP L+ +G I N S R + + + +F Sbjct: 267 SGFGP-LITDGQIVYKKGDYSTNSEESHPRQVICQLDNKDLLFFTCDGRVKSQGLLQKGM 325 Query: 203 NFYDFACYAKAKL-----NVEQLLYLDGTISHMYMKGGAIPWQRYPFVT 246 + K++ ++ LDG S + G + Sbjct: 326 TLSEVIETLKSEYPIGSNGIKFAYNLDGGGSSSSVLRGRRLNKVTDNNN 374 >UniRef50_A5GW09 Putative uncharacterized protein SynRCC307_2165 n=1 Tax=Synechococcus sp. RCC307 RepID=A5GW09_SYNR3 Length = 563 Score = 84.8 bits (208), Expect = 2e-15, Method: Composition-based stats. Identities = 45/285 (15%), Positives = 80/285 (28%), Gaps = 39/285 (13%) Query: 1 MAHQLLIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMY 60 ++ Q ++G G + + + F L + P +R + Sbjct: 269 LSRQNMVGLGSLLGLARSQGAVAAINGGFFNRIQALPLGGLRDDGDWLSG-PILDRGAIA 327 Query: 61 WQKANGEAWGTLH--ALLADINSQGQVQMAMNGG----------------IYDESYAPLG 102 W + + L + A+N G + Sbjct: 328 WAPQELPRFSRVRLNETLISDAGKRIQLNAINSGWVSKGVAQYNSLWGPRYKAITGREEA 387 Query: 103 LYIENGQQKVALNLASGEGNFFIRPGGVFYVA----------GDKVGIVRLDAFKTSKEI 152 + ++ Q + A +R G VA GD V + R K E+ Sbjct: 388 VLVQGQQVARRFSHAELSRGVGLRRGETLVVARGGAPLPLKAGDGVSLERSMVPKAFAEL 447 Query: 153 QFAVQSGPMLMENGVINPR------IHPNVASSKIRNGVGINKHGNAVFLLSQQA---TN 203 +Q GP+L+ G + + R+ VG + + + Q Sbjct: 448 PNLIQGGPLLLNQGKVVLNGKAERFSSAFMRQKAPRSVVGSDDELIWLLAVEGQGNAGPT 507 Query: 204 FYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMI 248 + A + KL ++Q L LDG S M F I Sbjct: 508 LRETAELMQ-KLGLKQALNLDGGSSTRLMVRNRGQSSGRGFGAAI 551 >UniRef50_A4XD34 Putative uncharacterized protein n=1 Tax=Salinispora tropica CNB-440 RepID=A4XD34_SALTO Length = 430 Score = 84.8 bits (208), Expect = 3e-15, Method: Composition-based stats. Identities = 21/113 (18%), Positives = 33/113 (29%), Gaps = 13/113 (11%) Query: 153 QFAVQSGPMLMENGVIN-PRIHPNVASSKIRNGVGINKHGNAVFLLSQQATN------FY 205 FAV L+++G I P + R G G V + Sbjct: 314 TFAVNGRYRLVKDGQIVAPSGSDSFFDRHPRTIAGTTLDGKIVLVTIDGRQTTSVGTTMT 373 Query: 206 DFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQR-----YPFVTMISVERK 253 + A A L + + LDG S G++ Q P + + Sbjct: 374 ETASV-AAALGMHDAVNLDGGGSTTMSVEGSLVNQPSGNEERPVGDALVYIDR 425 >UniRef50_B1X2V5 Putative uncharacterized protein n=2 Tax=Cyanothece RepID=B1X2V5_CYAA5 Length = 309 Score = 84.4 bits (207), Expect = 3e-15, Method: Composition-based stats. Identities = 26/193 (13%), Positives = 60/193 (31%), Gaps = 32/193 (16%) Query: 74 ALLADINSQGQVQMAMNGGIYDE-SYAPLGLYIENGQQKVALNLASG-EGNFFIRPGGVF 131 + D + + +NGG +D + I+ G+ + N + P Sbjct: 89 KTVEDFAQETEAIAVLNGGFFDPVNSQTTSYVIKEGEAIADPSNNPRLMDNPQLEPYLKQ 148 Query: 132 YVAGDK------------VGIVRLDAFKTSKEIQFAVQSGPMLM-------------ENG 166 + + + + ++ ++ GP L+ NG Sbjct: 149 ILNRSEFRRYQCNELTRYAITYHQEPVPENCQLTESIGGGPQLLPNLSAEEEAFFESVNG 208 Query: 167 VINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACY----AKAKLNVEQLLY 222 + R + + R + I G+ ++++ +Q + + L+V + Sbjct: 209 QV-TRDPLGLERANARTAIAITSSGDVLWIMVEQTSPSTGLSLLKLREFLESLDVTSAMN 267 Query: 223 LDGTISHMYMKGG 235 LDG S + G Sbjct: 268 LDGGSSSSFFYQG 280 >UniRef50_A7SGX9 Predicted protein (Fragment) n=2 Tax=Nematostella vectensis RepID=A7SGX9_NEMVE Length = 442 Score = 84.0 bits (206), Expect = 4e-15, Method: Composition-based stats. Identities = 25/178 (14%), Positives = 48/178 (26%), Gaps = 21/178 (11%) Query: 39 SDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIY-DES 97 + V + G ++ + +A+ + Q + +A N G + ++ Sbjct: 61 RKADVQGHVSVVENPLNTFSILEPGEVGGCGKSVRSSVANSSRQKKCHVASNAGFFKTKN 120 Query: 98 YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQ 157 LG + NG+ + + NF IR G + V Sbjct: 121 GNCLGNIVSNGKLVMDADGVQ-NANFGIRKDGTIVTG----YLSENTVLDQENPFVQLVT 175 Query: 158 SGPMLMENGVINPRIHPN---------------VASSKIRNGVGINKHGNAVFLLSQQ 200 L+ NG + V R +G + G V + Sbjct: 176 GVIWLVRNGEVYVNASKKAECEDLQESGSVDLFVNVLAARTAIGHDAQGRVVIVQVDG 233 >UniRef50_UPI0001746B2F hypothetical protein VspiD_16055 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001746B2F Length = 325 Score = 84.0 bits (206), Expect = 4e-15, Method: Composition-based stats. Identities = 44/192 (22%), Positives = 71/192 (36%), Gaps = 25/192 (13%) Query: 44 TVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGL 103 TV +NP + + ++ +G A T LA + A+ D + PLGL Sbjct: 93 TVNVIEINPANYQFQTSFK--DGFALTTAKERLAT----ERAAFAITANFRDPAGKPLGL 146 Query: 104 YIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEI-QFAVQSGPML 162 + G Q+ A G F+V K F+ + + Q A Q P L Sbjct: 147 VVHEGTQRNPTFPAW---------TGYFFVKAGKPWFGPKSLFEETPGVLQEASQGYPSL 197 Query: 163 MENGVINPRIH------PNVASSKIRNGVGINKHGNAVFLL--SQQATNFYDFACYAKAK 214 M+N + + + R G+ ++GN VF+L + N + + Sbjct: 198 MKNHTVFSYVDLPSTRYFDGNRVTYRALAGMKQNGNIVFILSGTGGVMNVSEVTA-LAQR 256 Query: 215 LNVEQLLYLDGT 226 LNV+ LDG Sbjct: 257 LNVQHATLLDGG 268 >UniRef50_Q2JUI0 Conserved domain protein n=2 Tax=Synechococcus RepID=Q2JUI0_SYNJA Length = 411 Score = 84.0 bits (206), Expect = 5e-15, Method: Composition-based stats. Identities = 20/157 (12%), Positives = 45/157 (28%), Gaps = 21/157 (13%) Query: 108 GQQKVALNLA----SGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLM 163 G+ V + + G G + GD++ + + + +GP+L+ Sbjct: 230 GKVTVPIPPKGYILAARGLEGALEAGKL-IPGDRLRLDWTVDPLELEAYPHILGAGPLLL 288 Query: 164 ENGVINPRIHPNV------ASSKIRNGVGINK---HGNAVFLLSQQ------ATNFYDFA 208 +G + R+ + + + + L++ + A Sbjct: 289 LDGQVVLDAELEGFQPLFRRQQAARSAICLRQGQPDNRDLLLVAAGNAQENQGLTLLEMA 348 Query: 209 CYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFV 245 + +L L LDG S + G Sbjct: 349 QLLR-QLGCRHALNLDGGRSSTLVLGEEAVNLEPEIG 384 Score = 40.5 bits (93), Expect = 0.053, Method: Composition-based stats. Identities = 18/118 (15%), Positives = 38/118 (32%), Gaps = 4/118 (3%) Query: 12 ITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGT 71 + + ++ + L D + V V+ +++ W + Sbjct: 37 VPAQTLLNGQPVRVVEGIPLYQRMVWLEDRRILVSVVAVSLAAGQLRPIWADPA--SLVG 94 Query: 72 LHALLADINSQGQVQMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGEGNFFIRPG 128 L L + + A+NGG ++ + PLG G+ + L G +F Sbjct: 95 LGELP-AFSRERGAVAAINGGFFNRNTRQPLGAIRLEGRWISSPILGRGAIAWFDAAN 151 >UniRef50_A5ILT0 Putative uncharacterized protein n=6 Tax=Thermotogaceae RepID=A5ILT0_THEP1 Length = 553 Score = 83.6 bits (205), Expect = 6e-15, Method: Composition-based stats. Identities = 23/167 (13%), Positives = 51/167 (30%), Gaps = 21/167 (12%) Query: 103 LYIENGQQ-----KVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQ 157 +++ + K + + + + G I+ AV+ Sbjct: 375 FVVKDSKVSQIGYKSRAEDSEYVVSISKKYEKYLSDLKEGDGAYLSLQPNIPLRIKQAVE 434 Query: 158 SGPMLMENGVINPRIHPN--------VASSKIRNGVGINKHGNAVFLLSQQ------ATN 203 GP+L++NG P + R + K G FL+ + Sbjct: 435 GGPLLIQNGAPIPDAWEEKARYGGGIAYAKAPRTVIA-TKDGKLWFLVFEGYNHITRGLT 493 Query: 204 FYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISV 250 + + + ++ E + +DG S + G++ + I V Sbjct: 494 YDELVDFLISR-GFEDAMCVDGGSSSVMAVAGSLFGRTENSTAAIPV 539 Score = 40.8 bits (94), Expect = 0.047, Method: Composition-based stats. Identities = 16/104 (15%), Positives = 34/104 (32%), Gaps = 7/104 (6%) Query: 39 SDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESY 98 V ++P+ +K L ++ + + +NG +D Sbjct: 237 EGEKTVVNYLIMDPEKVTIKPVVS----GNGFGTIERLDEMVKRVEGIAGINGNYFDPVT 292 Query: 99 A-PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIV 141 P+GL + +G+ A+ G F I ++ V + Sbjct: 293 KFPIGLVVIDGKPYSAMFG--GRPVFAITEDNRVFIGRIIVDVT 334 >UniRef50_C9RD84 Copper amine oxidase domain protein n=1 Tax=Ammonifex degensii KC4 RepID=C9RD84_AMMDK Length = 465 Score = 82.8 bits (203), Expect = 1e-14, Method: Composition-based stats. Identities = 25/176 (14%), Positives = 56/176 (31%), Gaps = 32/176 (18%) Query: 102 GLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQF------- 154 + +ENG G G + G G + F ++++ Sbjct: 181 TVVVENGVV-----TRMGSGPCPVPDNGYVIGFGPAAAAKFANRFYPGAKVEWWVVFEAK 235 Query: 155 -----------AVQSGPMLMENGVINPRIHPNVASSKIR-------NGVGINKHGNAVFL 196 +Q GP+L+++G I H + + + + +G + G V Sbjct: 236 DGAPLQWSGRTVIQGGPLLLKDGAIVLDSHLDELYREPKFSRYGSWSFIGTDFEGCLVLG 295 Query: 197 LSQQATNFYDFACYAKAKLNVEQLLYLDGTIS-HMYMKGGAIPWQRYPFVTMISVE 251 + ++ A + + + LDG S ++ +G + ++V Sbjct: 296 SVLGVDSLWNMARVLQQA-GIRNAVCLDGNASCGLWYRGSYLVTPGRALSNCVAVT 350 >UniRef50_C0Z816 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0Z816_BREBN Length = 1054 Score = 82.4 bits (202), Expect = 1e-14, Method: Composition-based stats. Identities = 26/154 (16%), Positives = 48/154 (31%), Gaps = 20/154 (12%) Query: 86 QMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDA 145 N P NG A G F VG Sbjct: 222 LFVDNVAKEVRVNQPGVYIPYNGYVLWGHGAA-----------GAFLKQNFPVGATAAVE 270 Query: 146 FKTSK---EIQFAVQSGPMLMENGVINPRIHPN--VASSKIRNGVGINKHGNAVFLLS-- 198 ++T+ ++ AV +L++ G + + S R VG+++ G +++++ Sbjct: 271 YQTTPQTLNLKQAVGGNVILVDQGKALTSFQADKSITSKTARTSVGVSQDGKTLYMVTID 330 Query: 199 -QQATNFYDFACYAKAKLNVEQLLYLDGTISHMY 231 Q + A A+L + + DG S Sbjct: 331 ASQGVYLDELAKIM-AELGSYRAVNFDGGGSTTM 363 Score = 55.1 bits (131), Expect = 2e-06, Method: Composition-based stats. Identities = 14/133 (10%), Positives = 41/133 (30%), Gaps = 7/133 (5%) Query: 28 LFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQM 87 + + ++ +T+ V+ V++ T + + + Sbjct: 51 GTTLQKYTKSFANQVVTIMVTKVDLNNPYVEVKPVYGTKGKL-TDKQTVTQMARETGAIA 109 Query: 88 AMNGGIYDES--YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDA 145 A+N + + AP G+ +++ + ++ L S + + V G Sbjct: 110 AINADFFHMTKRGAPFGIVMKDDELISSMGLVSYWYALGLTGDKMAIVDKFGFGGKVT-- 167 Query: 146 FKTSKEIQFAVQS 158 +++Q Sbjct: 168 --APNGATYSIQG 178 >UniRef50_A6TUG6 Copper amine oxidase domain protein n=1 Tax=Alkaliphilus metalliredigens QYMF RepID=A6TUG6_ALKMQ Length = 491 Score = 82.1 bits (201), Expect = 2e-14, Method: Composition-based stats. Identities = 26/113 (23%), Positives = 45/113 (39%), Gaps = 10/113 (8%) Query: 149 SKEIQFAVQSGPMLMENGVINPR-------IHPNVASSKIRNGVGINKHGNAVFLLSQQA 201 KE+ A+ +GP L++NGVI + + R+ +G+ K V Sbjct: 257 WKEVTSAIGAGPTLIKNGVITANGLSEGFFEDEILTNRGQRSFIGVTKENKLVMGTVPS- 315 Query: 202 TNFYDFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFVTMISVERK 253 + + A AK +L + Q + LDG S + K + I + +K Sbjct: 316 VSVKELAEIAK-ELGLYQAINLDGGASSGLIYKDRMVHAPGRLLSNAIVITKK 367 >UniRef50_C5C0E0 Metallophosphoesterase n=1 Tax=Beutenbergia cavernae DSM 12333 RepID=C5C0E0_BEUC1 Length = 1327 Score = 81.7 bits (200), Expect = 2e-14, Method: Composition-based stats. Identities = 21/134 (15%), Positives = 45/134 (33%), Gaps = 10/134 (7%) Query: 108 GQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGP--MLMEN 165 G+ ++ + + + + D V + + ++ A+ P L+E+ Sbjct: 248 GEGRLPDGVRALVARPGAAADALATLVADDHVDVAYGLREDAGDVAVALGGAPEDWLLED 307 Query: 166 GVINPRIHPNVASSKIRNGVGINKHGNA-VFLLSQQA------TNFYDFACYAKAKLNVE 218 G I V R VG ++ G F++ + + A+L + Sbjct: 308 GEITSATGGYVDVRHPRTAVGFDETGTTAYFVVVDGRQSHSIGMTLPELGRFL-AQLGAD 366 Query: 219 QLLYLDGTISHMYM 232 + LDG S + Sbjct: 367 DAINLDGGGSSEMV 380 >UniRef50_Q03K73 Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase n=3 Tax=Streptococcus thermophilus RepID=Q03K73_STRTD Length = 179 Score = 81.3 bits (199), Expect = 3e-14, Method: Composition-based stats. Identities = 18/115 (15%), Positives = 34/115 (29%), Gaps = 14/115 (12%) Query: 142 RLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASS---KIRNGVGI----NKHGNAV 194 + GP L+ENG + + V R + + ++ + + Sbjct: 33 TTAQKLVDSGVVNTFAFGPTLVENGKVAVSENEEVGQDMADNPRTAIVVNEESDRSVHYI 92 Query: 195 FLLSQQAT------NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYP 243 ++S T Y+ A K+ V LD S G + + Sbjct: 93 VIVSDGRTSESSGLTLYEMAELMKS-YGVMTGYNLDVGDSSTMYSNGQVINKPTH 146 >UniRef50_C9M6C8 Putative uncharacterized protein n=1 Tax=Jonquetella anthropi E3_33 E1 RepID=C9M6C8_9BACT Length = 603 Score = 81.3 bits (199), Expect = 3e-14, Method: Composition-based stats. Identities = 25/120 (20%), Positives = 41/120 (34%), Gaps = 12/120 (10%) Query: 135 GDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVI---NPRIHPN-VASSKIRNGVGINKH 190 GD + + + ++ A+Q GP+L+++G I N I + R VG Sbjct: 443 GDPITLNVQWRDENAQGTVGALQGGPLLLKDGKIQRMNEGIAVGVINRRHPRTLVGRIGK 502 Query: 191 GNAVFLLSQQATNFY------DFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYP 243 +L ++ D A L LL LDG S + G + Sbjct: 503 -TVWWLAVDGRAPWHSSGLTLDEATTLGQYLGFTDLLNLDGGGSTELLYHGYPVNKPSDG 561 >UniRef50_UPI0001C31921 Collagen triple helix repeat protein n=2 Tax=Conexibacter woesei DSM 14684 RepID=UPI0001C31921 Length = 1426 Score = 80.9 bits (198), Expect = 4e-14, Method: Composition-based stats. Identities = 22/193 (11%), Positives = 51/193 (26%), Gaps = 20/193 (10%) Query: 51 NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQ 110 + + ++ +A + + +G Sbjct: 168 DLPVAALNAAGAVPADGYVAFTPKWGRSSRARSLAGVANVAEALVTDGRVV--AVSDG-- 223 Query: 111 KVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAF----KTSKEIQFAVQSGPMLMENG 166 A + +G R + + G A+ ++++QFA+ +L+ +G Sbjct: 224 VGAGEIPAGSFYLVGRESAADAIRALRAGDEVRLAYGLSGDVAQQLQFAIGGNEVLVRDG 283 Query: 167 VINPRIHPNVASSKIRNGVGINKHGN-AVFLLSQQA------TNFYDFACYAKAKLNVEQ 219 + S R +G G + ++ A + E Sbjct: 284 QVVGSD----QSVHPRTAIGFKDGGRTLLLFVADGRQTQVLGMTTQKVAQLLRDA-GAET 338 Query: 220 LLYLDGTISHMYM 232 + LDG S + Sbjct: 339 AMNLDGGGSTTLV 351 >UniRef50_Q1MS76 Putative uncharacterized protein LI0093 n=1 Tax=Lawsonia intracellularis PHE/MN1-00 RepID=Q1MS76_LAWIP Length = 331 Score = 80.9 bits (198), Expect = 4e-14, Method: Composition-based stats. Identities = 32/244 (13%), Positives = 75/244 (30%), Gaps = 26/244 (10%) Query: 28 LFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQM 87 L+ + V +NPQ ++ G+A+ +L ++ Sbjct: 88 LWLGKFPGVTKAGDVFEVVMLKINPQYYDFSLHMASQTGKAF-SLQDWSNT----YELSA 142 Query: 88 AMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFI-RPGGVFYVAGDKVGIVRLDAF 146 +N +Y Y+ N ++ G FF+ P D + + Sbjct: 143 VINASMYLPDGVTSTGYLRNHDHINNAHVGKRLGAFFVASPYNSTLPNADLLDRTSDNWE 202 Query: 147 KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYD 206 + + VQ+ ++ N + S V + G F+ ++ + D Sbjct: 203 ILLPQYKIVVQNYRVISANRQCLWSTKKVIHSIA---AVARDGKGYLFFIHTKYPISDLD 259 Query: 207 FACYAKA-KLNVEQLLYLDGTISH----------MYMKGG------AIPWQRYPFVTMIS 249 F + +++ ++Y++G G +I P +I Sbjct: 260 FGNLLLSLPIDIRIVMYVEGGSQAGLLINTSNFKQLWMGKHPVSILSINNTSVPIPNVIG 319 Query: 250 VERK 253 ++++ Sbjct: 320 IKKR 323 >UniRef50_A4XGY7 Putative uncharacterized protein n=2 Tax=Clostridia RepID=A4XGY7_CALS8 Length = 877 Score = 80.9 bits (198), Expect = 4e-14, Method: Composition-based stats. Identities = 18/90 (20%), Positives = 39/90 (43%), Gaps = 9/90 (10%) Query: 150 KEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAV-FLLSQQAT------ 202 ++I+ A L+++G I P +A R+ +GI+K G + + Sbjct: 266 EKIKAAASGNTFLLKDGKI-PSFTHEIAGRHPRSAIGIDKTGRYLYLVAVDGRNGKSIGL 324 Query: 203 NFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 + + A + ++ ++V + LDG S + Sbjct: 325 SQGELASFLQS-IDVWTAINLDGGYSTQLI 353 >UniRef50_C7QCB3 Putative uncharacterized protein n=1 Tax=Catenulispora acidiphila DSM 44928 RepID=C7QCB3_CATAD Length = 585 Score = 80.5 bits (197), Expect = 5e-14, Method: Composition-based stats. Identities = 33/206 (16%), Positives = 67/206 (32%), Gaps = 17/206 (8%) Query: 27 PLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQ 86 P+ VA +T ++ + ++ G + Sbjct: 341 PVVQVARVRPDAVYTGVTADVAVIDQKHSGFVLHPGHEGGLNSVITTVPNQIDANARPNL 400 Query: 87 MAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAF 146 +A+ G + S + G Y A +L +G + I G V R +F Sbjct: 401 IALFNGGFKISESHGGYYDHG---VTAASLVNGAASEVIFKDGHMAVGMWG----RDYSF 453 Query: 147 KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKI--------RNGVGINKHGNAVFLLS 198 + + +I Q+ ++++ G + P I + + R+GVG+ G+ V+ + Sbjct: 454 QKNADIVSVRQNLKLMVDGGQVVPYIDDSSTWGRADHGSAAVWRSGVGVKADGDIVW-VG 512 Query: 199 QQATNFYDFACYAKAKLNVEQLLYLD 224 A K + + LD Sbjct: 513 GNELTAPSLARLLKDA-GAVRAMQLD 537 >UniRef50_Q30YC1 Putative uncharacterized protein n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. G20 RepID=Q30YC1_DESDG Length = 383 Score = 80.5 bits (197), Expect = 5e-14, Method: Composition-based stats. Identities = 30/250 (12%), Positives = 69/250 (27%), Gaps = 54/250 (21%) Query: 23 LTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQ 82 +++ +A D + TV ++P+ +++Y G T + Sbjct: 88 ISVAQGLELAESSAVFRDTSGTVALLRIDPRHYSLQLYTISEQGGPPQTPSEW----AAL 143 Query: 83 GQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNF----FIRPGGVFYVAGDKV 138 + +N ++ + Y+ NG + G+F + P Sbjct: 144 YNLDAVINASMFLPDGSTSTGYMRNGTAANNSRINQRFGSFLVFSPLPPHAAASDGQPPA 203 Query: 139 GIVRLDAFKTS--------------------------------------KEIQFAVQSGP 160 G + D + + + VQ+ Sbjct: 204 GGTQPDPYAPAAAHTTAARNTPNDAGSDNQQLPAADVLDRYADDWQTLLPRYRGVVQNFR 263 Query: 161 MLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKA---KLNV 217 M+ + + S +G + G +F+ S+ T + + Y L Sbjct: 264 MISADRKPLWPEEGDSFSIA---AIGKDTQGRILFIHSRAQTTVRELSEYLLDICPSLGA 320 Query: 218 EQLLYLDGTI 227 +Y++G Sbjct: 321 T--MYVEGGA 328 >UniRef50_A7HN47 Putative uncharacterized protein n=1 Tax=Fervidobacterium nodosum Rt17-B1 RepID=A7HN47_FERNB Length = 528 Score = 80.5 bits (197), Expect = 5e-14, Method: Composition-based stats. Identities = 21/128 (16%), Positives = 41/128 (32%), Gaps = 16/128 (12%) Query: 137 KVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPN--------VASSKIRNGVGIN 188 + +++ AV +GP+L+++ I + R + I Sbjct: 402 GADVSVELYTDNGYKVKNAVGAGPLLIQDKKIIQDAAEEKLRYGGGIPTTRASRTIIAI- 460 Query: 189 KHGNAVFLLSQQ----ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIP--WQRY 242 K G + + NF + A + +K E + LDG S + G + Sbjct: 461 KDGKVHLITIEGTNGTGMNFDEAAQFLLSK-GYESAMMLDGGGSTGMVYAGKLVTINSPR 519 Query: 243 PFVTMISV 250 + V Sbjct: 520 NIPVALGV 527 >UniRef50_B7KAR9 Polysaccharide deacetylase n=3 Tax=Cyanothece RepID=B7KAR9_CYAP7 Length = 627 Score = 80.1 bits (196), Expect = 7e-14, Method: Composition-based stats. Identities = 30/245 (12%), Positives = 72/245 (29%), Gaps = 26/245 (10%) Query: 29 FAVAADDCALSDPTLTVQAYTVNPQTER--VKMYWQKANGEAWGTLHALLADINSQGQVQ 86 + + Y + + + +++I + + Sbjct: 355 WGGYPAPRYDGGFNFNTRIYKQDFTVNDTQLTLITGGRPNTIHADTRYQVSEIIAGTGAE 414 Query: 87 MAMNGGIYD----ESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVR 142 A++GG + +S +G + G + I+ + ++ D+V V Sbjct: 415 AAVDGGFFSLESLDSNVMIGPVL--GHNTGEFIPGNAWEIPRIKGRPLVLMSSDRVRFVP 472 Query: 143 LDAFK------------TSKEIQFAVQSGPMLMENGVINP----RIHPNVASSKIRNGVG 186 D K ++I A L+ + P + +++ R G Sbjct: 473 FDPNKHNTYEGVISEATEGEKITDAFVGAAWLVRDNQPQPPEAFGELFDFEAARHRAFWG 532 Query: 187 INKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMY-MKGGAIPWQRYPFV 245 IN+ G V +++ + +L + + LD S +G ++ V Sbjct: 533 INQAGQPVIGVTKTMVGSVELGEIL-HQLGLRDAVMLDSGASTSLAYQGESLVGYTPRPV 591 Query: 246 TMISV 250 + Sbjct: 592 PHVVA 596 >UniRef50_C6WLB3 Metallophosphoesterase n=1 Tax=Actinosynnema mirum DSM 43827 RepID=C6WLB3_ACTMD Length = 1118 Score = 79.4 bits (194), Expect = 1e-13, Method: Composition-based stats. Identities = 26/140 (18%), Positives = 40/140 (28%), Gaps = 21/140 (15%) Query: 104 YIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLM 163 E+G L +G + PG V A + A+ +L+ Sbjct: 253 VPEDGYVL--LGREAGATALAVAPGDHLSVR------YSTRAGDGGSAPRAAIGGNQVLL 304 Query: 164 ENGVINPRIHPNVASSKIRNGVGINKHGNAVFL-LSQQAT-------NFYDFACYAKAKL 215 + + P+ R VG + G +FL N D A + L Sbjct: 305 RDSEVVAPDDPS----HPRTAVGFSADGRRMFLLTVDGRQSAHLLGLNLKDVAEALRD-L 359 Query: 216 NVEQLLYLDGTISHMYMKGG 235 L LDG S + Sbjct: 360 GAHNALNLDGGGSSTLVARE 379 >UniRef50_A4FAG7 Secreted protein n=5 Tax=Actinomycetales RepID=A4FAG7_SACEN Length = 434 Score = 79.4 bits (194), Expect = 1e-13, Method: Composition-based stats. Identities = 27/181 (14%), Positives = 50/181 (27%), Gaps = 34/181 (18%) Query: 103 LYIENGQQKVALNLASGEGNFFI-------RPGGVFYVAGDKVGIVRLDAFKTSK----E 151 + + +G+ + G G R GV + K G ++ + E Sbjct: 257 IVVRDGK-VAEVRPEPGAGAIAAGDFVLVGREDGVGELDDLKPGDPVSVDYQLAPVGVPE 315 Query: 152 IQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHG-NAVFLLSQQAT------NF 204 +F V P+L +G P + + + R G + G + + Sbjct: 316 FRFVVGGFPIL-RDGTALPGL--DDQALAPRTSAGASADGKRVYLVAMDGRSQVSAGLTV 372 Query: 205 YDFACYAKAKLNVEQLLYLDGTISHMYMKGG-------AIPWQR----YPFVTMISVERK 253 + A K + + LDG S + P I + Sbjct: 373 SELADLLKRS-GADDAVNLDGGGSTTLVAREAGAQHVTVRNNPSDGRERPVANGIGIFSG 431 Query: 254 G 254 G Sbjct: 432 G 432 >UniRef50_B8FVQ0 Ig-like, group 2 n=2 Tax=Desulfitobacterium hafniense RepID=B8FVQ0_DESHD Length = 913 Score = 79.0 bits (193), Expect = 1e-13, Method: Composition-based stats. Identities = 22/152 (14%), Positives = 44/152 (28%), Gaps = 27/152 (17%) Query: 103 LYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGP-- 160 + + + G+ I G V+ + L+ +++QF V S P Sbjct: 212 IVVSGNTIL---EIREGQPAAEIPEEGFVVVSRGEQAAKLLEQAAPGEQLQFQVTSTPDW 268 Query: 161 -----------MLMENGVINPRIH---PNVASSKIRNGVGINKHGN-AVFLLSQQAT--- 202 +L+++G I + R G + G+ + + Sbjct: 269 NDLKMSTTGTSLLIQDGEIPATFSYSTASFNQRNPRTMAGSTEDGSELILVTVDGRQDNS 328 Query: 203 ---NFYDFACYAKAKLNVEQLLYLDGTISHMY 231 + A +L Q + DG S Sbjct: 329 IGLTQQESAELML-ELGAYQAIMFDGGASTTM 359 >UniRef50_C1TLP7 Sporulation related-protein with S-layer-like domain n=1 Tax=Dethiosulfovibrio peptidovorans DSM 11002 RepID=C1TLP7_9BACT Length = 565 Score = 78.6 bits (192), Expect = 2e-13, Method: Composition-based stats. Identities = 17/143 (11%), Positives = 52/143 (36%), Gaps = 12/143 (8%) Query: 112 VALNLASGEGNFFIRPGGV-FYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLM-ENGVIN 169 + E ++R G+ + ++ + + + + E + +Q+GPM++ G + Sbjct: 412 SNHFMKPEENILYLRTSGLGPFSDAAEIKLQTIWSDEAMIEAKQVIQAGPMILGLEGPFS 471 Query: 170 PRIHPNV--ASSKIRNGVGINKHGNAVFLLSQQATNFYD------FACYAKAKLNVEQLL 221 + R G + +++ ++++ A + + + + + Sbjct: 472 SEWFSDSIINKRHPRTLAGWDGD-RLCWIVIDGRSSWHSDGATLSEAAFIARQAGLVKAI 530 Query: 222 YLDGTISH-MYMKGGAIPWQRYP 243 +DG S ++ KG + Sbjct: 531 NMDGGGSSQLWWKGITVNLPSEG 553 >UniRef50_B4WHW3 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WHW3_9SYNE Length = 335 Score = 78.6 bits (192), Expect = 2e-13, Method: Composition-based stats. Identities = 34/206 (16%), Positives = 58/206 (28%), Gaps = 50/206 (24%) Query: 74 ALLADINSQGQVQMAMNGGIYDE-SYAPLGLYIE----------NGQQKVALNLASGEGN 122 A + + +NGG +D + I N + NL Sbjct: 98 ATIEAFAERTNADYIINGGFFDPHNGKTTSHLISQEQTVSDPADNERLINNSNLGQYMAQ 157 Query: 123 FFIRPGGVFYVAGDKVGIVR-------------------LDAFKTSKEIQFAVQSGPMLM 163 R Y + R EI A+ +GP L+ Sbjct: 158 ILNRSEFRVYRCRQASVVERGGLEGSLTEEAVVYDITFHNAPPPDGCEIDTAIGAGPQLL 217 Query: 164 -------------ENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ-----ATNFY 205 ++G I R R+ +G+ G ++ ++ Sbjct: 218 PADTSWVEGFIDYDDG-ILFRDAIGSRQPNARSAIGLYPDGAIALIMVEKSASSIGMTLL 276 Query: 206 DFACYAKAKLNVEQLLYLDGTISHMY 231 + A +AK+ L + +LL LDG S Sbjct: 277 ELADFAKS-LGITKLLNLDGGSSSAL 301 >UniRef50_UPI0001744904 hypothetical protein VspiD_09360 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001744904 Length = 258 Score = 78.2 bits (191), Expect = 2e-13, Method: Composition-based stats. Identities = 40/242 (16%), Positives = 74/242 (30%), Gaps = 26/242 (10%) Query: 1 MAHQLLIGKGMITLNLKRIF-LALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKM 59 +A LL+ G+ + ++L + D + +Q T + +V++ Sbjct: 5 LALILLLATGLPGGGRLPAQDVVVSLGHGVVWSRKDISAPLEGW-LQVVTFDASKVKVEV 63 Query: 60 YWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE-SYAPLGLYIENGQQKVALNLAS 118 + + E +H + + NGG +D ++AP GL + G Sbjct: 64 LAR-QDRETALPMHRWMTEAR----AIAGCNGGYFDPATFAPSGLQVVEGLATGKYQQFG 118 Query: 119 GEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSK--EIQFAVQSGPMLMENGVINPRIHPNV 176 G G F V K I E + VQ P+L+ +G + Sbjct: 119 EWG-------GGFGVRSGKAQIWTEQEILAMPTFEAESFVQCSPVLV-DG-VRRFTGAGE 169 Query: 177 ASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAK------LNVEQLLYLDGTISHM 230 R + + ++ + A + V + L LDG S Sbjct: 170 DVRARRTFIAHDGGARWALGVTSG-IGLRELAELLVNQGAGLLGFKVSRALNLDGGPSTG 228 Query: 231 YM 232 Sbjct: 229 LW 230 >UniRef50_A3YXL4 Putative uncharacterized protein n=2 Tax=Chroococcales RepID=A3YXL4_9SYNE Length = 610 Score = 78.2 bits (191), Expect = 2e-13, Method: Composition-based stats. Identities = 30/177 (16%), Positives = 46/177 (25%), Gaps = 21/177 (11%) Query: 91 GGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSK 150 G S G+ I +G LN A + + V V +A + + Sbjct: 415 AGYQALSGNESGVLIRDGVVLQRLNGAQLQRGIPLGREDTLVVGRAGVMPPWPEASRLTL 474 Query: 151 EIQ----FAVQSGPMLME-----------NGVINPRIHPNVASSKIRNGVGINKHGNAVF 195 Q Q+ M NG R +G + F Sbjct: 475 SSQSSDPLGQQAYVMGGGPLLLLGGRVVLNGTAEGFSSAFQGQGAPRTVIGSDG-RQIWF 533 Query: 196 LLSQQ----ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMI 248 L Q + A + +L + + L LDG S G + I Sbjct: 534 LTLQGVDHAGPTLGETATLLR-QLGLREALNLDGGSSTGLFVGNTQTVRGRGVAASI 589 >UniRef50_C6D5A3 Copper amine oxidase domain protein n=1 Tax=Paenibacillus sp. JDR-2 RepID=C6D5A3_PAESJ Length = 904 Score = 78.2 bits (191), Expect = 3e-13, Method: Composition-based stats. Identities = 26/160 (16%), Positives = 49/160 (30%), Gaps = 16/160 (10%) Query: 86 QMAMNGGIYD-ESYAP-LGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRL 143 + +NG + P G +G A+ ++ G + + Sbjct: 244 ALVVNGVVQQISDGKPLTGTIPADGYILRGHGTAAQFILEHLQV-GQSVTSDYSLVSATS 302 Query: 144 DAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSK-----IRNGVGINKHGNAVFLLS 198 + V +L+ NG ++ R VG +K G V+L++ Sbjct: 303 GQKVDPTSFEMLVGGHTILVNNGAA-ATFSRDITGVSGSSYVSRTAVGYSKDGTKVYLIT 361 Query: 199 QQ------ATNFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 + N + KL V + + LDG S + Sbjct: 362 SEDYGDSTGLNLKELQQVM-VKLGVYKGINLDGGGSTTMI 400 Score = 46.2 bits (108), Expect = 0.001, Method: Composition-based stats. Identities = 10/109 (9%), Positives = 30/109 (27%), Gaps = 3/109 (2%) Query: 31 VAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMN 90 + + + V+ V + + G L+ ++ + +N Sbjct: 76 LWTTTRSGKAAKANIHVIQVDLTNPYVNLNTMSGKNDTVGKLNTVMNMT-KENGAVAGIN 134 Query: 91 GGIY--DESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDK 137 ++ +P+G + +G + G F + + Sbjct: 135 ADVFITTTEGSPMGAQVTSGTLMTSPMQIKGMYAFAVTKDRKPVIDSYT 183 >UniRef50_D2PZR6 Sporulation domain protein n=4 Tax=Actinomycetales RepID=D2PZR6_9ACTO Length = 537 Score = 77.4 bits (189), Expect = 4e-13, Method: Composition-based stats. Identities = 26/178 (14%), Positives = 49/178 (27%), Gaps = 31/178 (17%) Query: 102 GLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQ-FAVQSGP 160 G E G+ A + G + + + + V GP Sbjct: 364 GAIPEGGRTVQATGSQVAALLAAAQVGRRLEI---RAVLTDARGRDVRLSPRTSVVNGGP 420 Query: 161 MLMENGVINPRIHPNV--------------ASSKIRNGVGINKHGNAVFLLSQQAT---- 202 L+ +G + + R G++ G V + + Sbjct: 421 ELVRDGRLMATPKADGMAPAGNPNFYYGWVHKRNPRTIAGVDAQGRTVLITADGRNVSSL 480 Query: 203 --NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQR------YPFVTMISVER 252 + A AK+ L + + + LDG S + G + Q P + + R Sbjct: 481 GLGIAEAAAVAKS-LGLREAVNLDGGGSTTMVANGKVVNQPSDAAGERPVGDALVITR 537 Score = 45.9 bits (107), Expect = 0.001, Method: Composition-based stats. Identities = 23/204 (11%), Positives = 52/204 (25%), Gaps = 19/204 (9%) Query: 27 PLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQ 86 + DD + S VQ T++P+ R + A+ + + + Sbjct: 164 TGWDGEPDDLSGSTGPWQVQVLTIDPKKFRGTL---DASYGLDLEARETTSTLATLTGAT 220 Query: 87 MAMNGGIYDES------YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVA-GDKVG 139 A+N G + P G+ + +G+ G + Sbjct: 221 AAVNAGFFVLDPKAGAPGDPAGVAVYDGRLVSEPTAGRPALVVGENARGTSVERFRWRGE 280 Query: 140 IVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKI-RNGVGINKHGNAVFLLS 198 I + P L+ N ++ ++ + + F + Sbjct: 281 IRGTGRPLPLDGLNRV----PGLIRN---CGGTTDDLPTAAPLHDVTCTDADELIAFDAA 333 Query: 199 QQATNFYD-FACYAKAKLNVEQLL 221 + A + V + Sbjct: 334 YGVSTPSGPGAEVIVDRRGVVTAI 357 >UniRef50_A3TM75 Putative uncharacterized protein n=1 Tax=Janibacter sp. HTCC2649 RepID=A3TM75_9MICO Length = 1151 Score = 77.4 bits (189), Expect = 4e-13, Method: Composition-based stats. Identities = 29/182 (15%), Positives = 58/182 (31%), Gaps = 26/182 (14%) Query: 77 ADINSQGQVQMAMNGGIYDESYAPLGL-----YIENGQQKVALNLASGEGN-------FF 124 I + G G Y + G + G ++ +GEG Sbjct: 223 TTIPTNGVTVFTPLWGDYTRTRVTGGATKVREVVTRGGVVESVATVAGEGQLDKDQLAIV 282 Query: 125 IRPGGVFYVAGDKVGIVRLDAF---KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKI 181 R G +A VG + ++ A+ ML+++ V+ P+ + Sbjct: 283 GRGAGADTLAALNVGDKVSVDYALRNEGANLKMAISGNTMLLKDNVVLPQTDKAI---HP 339 Query: 182 RNGVGINKHGNAVFL-LSQQAT------NFYDFACYAKAKLNVEQLLYLDGTISHMYMKG 234 R +G + G+ +F+ + + + K ++ + LDG S + Sbjct: 340 RTAIGFDADGSTMFVLTVDGRMAASRGMTYAETGAFLK-EVGATSGINLDGGGSSTMLAR 398 Query: 235 GA 236 A Sbjct: 399 EA 400 >UniRef50_D1Y6Q3 Putative liporotein n=1 Tax=Pyramidobacter piscolens W5455 RepID=D1Y6Q3_9BACT Length = 572 Score = 77.1 bits (188), Expect = 5e-13, Method: Composition-based stats. Identities = 23/134 (17%), Positives = 47/134 (35%), Gaps = 15/134 (11%) Query: 134 AGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIH---PNVASSKIRNGVGINKH 190 GD+V + ++ AVQ+GP+L G + R +G + Sbjct: 440 KGDRVTLETQWRETPPIDVASAVQAGPLLYAPGHQFWDEMLSLSILTLRHPRTLLGWDGK 499 Query: 191 GNAVFLLSQQATNFYDFACYAKA------KLNVEQLLYLDGTISHMYMKGGAIPW----- 239 V++++ ++++ + +L + LL LDG S G + Sbjct: 500 -RMVWIVADGRSSWHSRGLFLNEAEQLGRRLGLTALLNLDGGGSSEMWWDGHVVNAVSDG 558 Query: 240 QRYPFVTMISVERK 253 + + V +K Sbjct: 559 RERRMPYGLMVLKK 572 >UniRef50_D2PYC0 Metallophosphoesterase n=1 Tax=Kribbella flavida DSM 17836 RepID=D2PYC0_9ACTO Length = 1094 Score = 76.7 bits (187), Expect = 6e-13, Method: Composition-based stats. Identities = 36/234 (15%), Positives = 66/234 (28%), Gaps = 31/234 (13%) Query: 28 LFAVAADDCALSDPTLTVQAYTVNPQTER--VKMYWQKANGEAWGTLHALLADINSQGQV 85 AVA S +A +PQ +++ + G + I Sbjct: 125 GPAVADGQLVKSQSEDPYRAVAFDPQGVGRILEVLFDGTAG-PYRLNRLNSPVIRKDEIG 183 Query: 86 QMAMNGGIYDESYAPLGLYIENGQQKVALNLA------------SGEGNFFIRPGGVFYV 133 G Y ++A G + +G R G + Sbjct: 184 AFTTLWGSYSRAHAVAGAAKVTEVVVAGDTVTAVAAAAGAGDIPAGTTILVGREAGADEL 243 Query: 134 AGDKVGIVRLDAFKTSKE----IQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINK 189 KVG AF ++ AV + +L++ G P + + R G+G + Sbjct: 244 GRLKVGDRVPVAFAPRASDGSVVRTAVGAHALLVKEGKPQP---ADDTAYAGRTGLGFSA 300 Query: 190 HGNAVFLLS--------QQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGG 235 G + ++S + + A+ + LDG S + Sbjct: 301 DGKKMVIVSIDSNRLTHSRGATLAEMGRILAAR-GAYVGVELDGGGSTTLVSRR 353 >UniRef50_C1YVW0 Putative uncharacterized protein n=1 Tax=Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 RepID=C1YVW0_NOCDA Length = 730 Score = 76.3 bits (186), Expect = 9e-13, Method: Composition-based stats. Identities = 32/250 (12%), Positives = 61/250 (24%), Gaps = 23/250 (9%) Query: 2 AHQLLIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYW 61 A Q +G GM L + A ++ A + + + Sbjct: 106 ATQAPLGAGMSDGRLLTSPDPGFANAVVIDAGGRGSVRQVAFEGTAS---LPSGDLDIDA 162 Query: 62 QKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNL----- 116 + L +D + + G A + G + Sbjct: 163 LNTSAVPADGLGLYTSDWGGHPRAHVVYEPGTSPGDTAVAEAVVSEGVVERVSVTPGSGP 222 Query: 117 ----ASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRI 172 + ++ V E + V +L+ +G P Sbjct: 223 IEEDEQVLVARGSAAERIADLSEGDPVEVEHTLTAEGAEPRVVVGGRHVLVRDGEPVPVE 282 Query: 173 HPNVASSKIRNGVGINKHGNAVFLLS-QQAT------NFYDFACYAKAKLNVEQLLYLDG 225 S R +G ++ G + +++ + A A EQ L LDG Sbjct: 283 DV---SRAPRTAIGFSEDGEVMHVVTADGRNRGHAGSTLAEVAELLAAS-GAEQALELDG 338 Query: 226 TISHMYMKGG 235 S + Sbjct: 339 GGSSTLLVRE 348 >UniRef50_C3YJA0 Putative uncharacterized protein n=3 Tax=Branchiostoma floridae RepID=C3YJA0_BRAFL Length = 851 Score = 76.3 bits (186), Expect = 1e-12, Method: Composition-based stats. Identities = 26/172 (15%), Positives = 47/172 (27%), Gaps = 21/172 (12%) Query: 45 VQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE-SYAPLGL 103 +N + + + + +A+N G +D + A LG Sbjct: 324 GHYTVINNPLRTFSVVEPGGSNGCMEPRRRTVTQTSQTRTCHVALNAGFFDTRTGACLGN 383 Query: 104 YIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLM 163 + +GQ+ NF IR G V + D + V L+ Sbjct: 384 VVTDGQRVQDSGGIQ-NANFGIRKDGTIVVG----YLSEQDVLREDNPFVQLVSGVIWLV 438 Query: 164 ENGVINP---------------RIHPNVASSKIRNGVGINKHGNAVFLLSQQ 200 N + + V R VG ++ G V + + Sbjct: 439 RNATVYVNESRTTECADIQETGTLDRFVNVVSARTAVGHDEEGRVVLVHIEG 490 >UniRef50_Q0AWB0 Putative uncharacterized protein n=1 Tax=Syntrophomonas wolfei subsp. wolfei str. Goettingen RepID=Q0AWB0_SYNWW Length = 497 Score = 75.9 bits (185), Expect = 1e-12, Method: Composition-based stats. Identities = 28/174 (16%), Positives = 54/174 (31%), Gaps = 24/174 (13%) Query: 102 GLYIENGQQ--KVALNLASGEGNFFIRPGGVF-YVAGDKVGIVRLDAFKTSKEIQF---- 154 G+ + NG + E F I G Y+ ++ + ++ + F Sbjct: 168 GVIVSNGHVSSITTSSFNIPENGFAIVYNGASSYLVDERYKVGDEVYYEVIIKPTFTNPS 227 Query: 155 -------AVQSGPMLMENGVINPRIHPNV-------ASSKIRNGVGINKHGNAVFLLSQQ 200 A+ +GP L+ NG + SS R+ +G G + Sbjct: 228 DWEEVQCAIGAGPSLIINGNVTASGEEEGFFEAKINTSSSPRSFIGATADGRIIMGNMDA 287 Query: 201 ATNFYDFACYAKAKLNVEQLLYLDGTIS-HMYMKGGAIPWQRYPFVTMISVERK 253 AT A ++ + + LDG S +Y + ++ + Sbjct: 288 ATLKKAAAAC--QRMGLVNAMCLDGGYSIALYYASAGVSLAGRDINNGLAFVGR 339 Score = 42.4 bits (98), Expect = 0.016, Method: Composition-based stats. Identities = 16/109 (14%), Positives = 37/109 (33%), Gaps = 8/109 (7%) Query: 30 AVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAM 89 A + + + V+ T++ + + ++ A + LA + + A+ Sbjct: 16 ASSDAYKSEKINGVGVKYVTLDMKDKNIQPRVLNAQNQI--CATESLASMAKKAGAFAAI 73 Query: 90 NGGIYDESY---APLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAG 135 NG ++ P G I++G+ ++ I G V Sbjct: 74 NGTYFEAYGGTPVPWGTIIKDGKVLH---ISQSGAVVGITSSGKLLVDR 119 >UniRef50_C5CET4 Putative uncharacterized protein n=1 Tax=Kosmotoga olearia TBF 19.5.1 RepID=C5CET4_KOSOT Length = 558 Score = 75.5 bits (184), Expect = 1e-12, Method: Composition-based stats. Identities = 21/101 (20%), Positives = 42/101 (41%), Gaps = 13/101 (12%) Query: 149 SKEIQFAVQSGPMLMENGVINPRIHPNVAS------SKIRNGVGINKHGNAVFLLSQQ-- 200 +++++FA++ GP+++ G + S R +GI K G +F++ Sbjct: 439 NEKLKFAIEGGPLIISRGKPVTEYEKSFYSSSLLDIRAPRTLIGITKSGTLMFMIIDGYQ 498 Query: 201 ----ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAI 237 F + + K N E L+ +DG S + G + Sbjct: 499 MKSYGLTFKEMVEFFTDK-NFEYLMCVDGGKSSALVFKGEV 538 Score = 50.5 bits (119), Expect = 6e-05, Method: Composition-based stats. Identities = 23/99 (23%), Positives = 41/99 (41%), Gaps = 7/99 (7%) Query: 38 LSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDES 97 +S + + A ++P+ R ++ ANG L + + +NGG +D S Sbjct: 249 VSGRRIILTALELDPE--RFDIHPVLANGRIPSGESLL--SMAKRYDAFAVINGGYFDPS 304 Query: 98 -YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAG 135 + P+GL IE+G+ +L FF G + Sbjct: 305 SFYPIGLLIEDGKLISLPSLERP--LFFQTEDGKMGIGR 341 >UniRef50_A4FAL4 Putative uncharacterized protein n=2 Tax=Actinomycetales RepID=A4FAL4_SACEN Length = 1118 Score = 74.7 bits (182), Expect = 2e-12, Method: Composition-based stats. Identities = 15/92 (16%), Positives = 30/92 (32%), Gaps = 11/92 (11%) Query: 152 IQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHG-NAVFLLSQQA------TNF 204 + AV +L+ +GV+ + + R G + G + Sbjct: 301 PKAAVGGNKVLLRDGVVQ---QVDDTALHPRTAAGFSADGTRMWLVTIDGRQADSRGMTE 357 Query: 205 YDFACYAKAKLNVEQLLYLDGTISHMYMKGGA 236 + A + ++ L + L LDG S + Sbjct: 358 RELAEHLRS-LGADDALNLDGGGSSTLLAREE 388 Score = 40.5 bits (93), Expect = 0.053, Method: Composition-based stats. Identities = 15/95 (15%), Positives = 32/95 (33%), Gaps = 5/95 (5%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ 84 + P + D L +V+ R++ + + L++ ++ Sbjct: 81 VAPGLELTEFDRFGPAGWLRGDMLSVDLTESRLEPTYLHPGAA---SARTPLSEQAARIG 137 Query: 85 VQMAMNGGIYDES--YAPLGLYIENGQQKVALNLA 117 +NG +D APLG+ + G+ A Sbjct: 138 AVAGVNGDFFDIDATGAPLGVGLSGGELLNAPGSG 172 >UniRef50_C0AEZ6 Putative uncharacterized protein n=1 Tax=Opitutaceae bacterium TAV2 RepID=C0AEZ6_9BACT Length = 421 Score = 74.7 bits (182), Expect = 2e-12, Method: Composition-based stats. Identities = 20/105 (19%), Positives = 38/105 (36%), Gaps = 6/105 (5%) Query: 132 YVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHG 191 + + L A +++++ AV +L+ G + + R+ VG+ G Sbjct: 284 AILDINWRLTDLPAGVHTRDVRDAVSGNVILIAAGRLQEGGGAFWTTRHPRSAVGVAADG 343 Query: 192 -NAVFLLSQQATNFY---DFACY--AKAKLNVEQLLYLDGTISHM 230 A+ +L + F D + A L + LDG S Sbjct: 344 RRALLVLVDGRSLFSAGMDLSALRDYLAHLGAHDAVNLDGGGSSA 388 >UniRef50_B2A2E0 Copper amine oxidase domain protein n=1 Tax=Natranaerobius thermophilus JW/NM-WN-LF RepID=B2A2E0_NATTJ Length = 514 Score = 74.7 bits (182), Expect = 3e-12, Method: Composition-based stats. Identities = 27/154 (17%), Positives = 46/154 (29%), Gaps = 10/154 (6%) Query: 104 YIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLM 163 ENG + G V + ++ V +GP L+ Sbjct: 207 IPENGFVVELGTQQAVSQLPENYSPGDKAKLKPSVSDEDREEPIEIEDFIHMVGAGPKLV 266 Query: 164 ENGVINPRIHPN------VASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNV 217 NG + + + R+ +G N + A N D A +L + Sbjct: 267 NNGREDVDLEKDQMTGERHTIKARRSFIGYN-DNEVIMGTVDGA-NHEDTAAIC-VELGL 323 Query: 218 EQLLYLDGTISH-MYMKGGAIPWQRYPFVTMISV 250 + + LDG S +Y +G I + V Sbjct: 324 TEAMALDGGASSGLYYEGDYITRPGREISNALVV 357 >UniRef50_C8VW07 S-layer domain protein n=1 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8VW07_DESAS Length = 921 Score = 74.7 bits (182), Expect = 3e-12, Method: Composition-based stats. Identities = 24/148 (16%), Positives = 45/148 (30%), Gaps = 24/148 (16%) Query: 103 LYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSK------------ 150 + ++N G I G L+ S Sbjct: 211 VVVQNDLVVSVNQNKPG---VPIPANGYVLEGHGAAAQFLLENLPVSSRVQTSYLVTPQT 267 Query: 151 -EIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ------ATN 203 ++ A+ +L+++G + P + + R VGI ++L + + T Sbjct: 268 GNLRAALGGNTLLVQDGQLAP-FTQEITGNYARTAVGIMPDNKTLYLAAAENGNGSVGTT 326 Query: 204 FYDFACYAKAKLNVEQLLYLDGTISHMY 231 A + A L V + + LDG S Sbjct: 327 QTGMAEFLLA-LGVNRAVNLDGGGSTTL 353 Score = 48.2 bits (113), Expect = 2e-04, Method: Composition-based stats. Identities = 16/112 (14%), Positives = 32/112 (28%), Gaps = 6/112 (5%) Query: 24 TLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQG 83 + P + + A V+ VK+ +L + G Sbjct: 42 IISPGVVLQ----TYMYNKTRIYAIKVDLSNPYVKIDTMIGADGTLNKAQSLTGMTSRTG 97 Query: 84 QVQMAMNGGIYDE-SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVA 134 +NGG + ++ P+GL NG + + F + + Sbjct: 98 -AVAGINGGFFQMKNHRPIGLEFSNGNLVSSPAMREDMPGFAVTNNNQAIIG 148 >UniRef50_A6NQQ4 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6NQQ4_9BACE Length = 1060 Score = 74.4 bits (181), Expect = 4e-12, Method: Composition-based stats. Identities = 22/135 (16%), Positives = 39/135 (28%), Gaps = 12/135 (8%) Query: 102 GLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPM 161 + I G+ +++N + N + GD V I +E A+ Sbjct: 294 SIAIPEGKVVLSINNKA---NSYWLSNVKSLKPGDLVDIDVTTTDSIWQEADQAMGGLYK 350 Query: 162 LMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYA------KAKL 215 L+ G + + + VG+ G +F Y +L Sbjct: 351 LVTAGKVESGLPTGQTAY---TAVGVKADGTVIFYTIDGKQPGYSVGASLTQVAMRLVEL 407 Query: 216 NVEQLLYLDGTISHM 230 + LDG S Sbjct: 408 GCVDAISLDGGGSTT 422 Score = 40.5 bits (93), Expect = 0.049, Method: Composition-based stats. Identities = 16/125 (12%), Positives = 31/125 (24%), Gaps = 13/125 (10%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINS--- 81 L + L T + A G+ + L Sbjct: 46 LTHQIFWSDTYSDLRTERYFTY-------TPNKNVTPVVAYGDKVTSRETLTTMAQRLEG 98 Query: 82 -QGQVQMAMNGG-IYDESYAPLGLYIENGQQKVALNLAS-GEGNFFIRPGGVFYVAGDKV 138 ++ +NG + PLG I +G + + G R G ++ + Sbjct: 99 EGKRLVGGINGDLYVMATGEPLGTVITDGVLRSVPGTNNQGYYAIGFRSDGTAFIGKPDL 158 Query: 139 GIVRL 143 + Sbjct: 159 TVTAT 163 >UniRef50_Q7NGC8 Glr3243 protein n=1 Tax=Gloeobacter violaceus RepID=Q7NGC8_GLOVI Length = 540 Score = 74.0 bits (180), Expect = 4e-12, Method: Composition-based stats. Identities = 28/191 (14%), Positives = 59/191 (30%), Gaps = 19/191 (9%) Query: 72 LHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVF 131 + AL G G + A + I + ++ + + P G+ Sbjct: 335 VGALNGTAAGAGLSVFTPEWGSLYLARAGETVAIARADRIESVVEPMADQIVSLPPDGLL 394 Query: 132 YVAGDKVGIVRLDAFKTSKEIQF---------AVQSGPMLMENGVIN-----PRIHPNVA 177 VA + L A + + +GP+L++N + R P+V Sbjct: 395 LVARSEPLRTALRAVAAGTPVVLDVQPSGSGSLLGAGPLLVQNDKLVLDAQGERFRPDVR 454 Query: 178 SSK-IRNGVGINKHGNAVFLLSQQAT----NFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 + R + + + ++ + +A + + L LDG S + Sbjct: 455 APGVARTAIARRGSLGILAVAARNGWAAGLSLESWANLLLQQFQADDALNLDGGGSSGFY 514 Query: 233 KGGAIPWQRYP 243 GG + + Sbjct: 515 LGGRLRDRPEG 525 Score = 41.2 bits (95), Expect = 0.031, Method: Composition-based stats. Identities = 17/140 (12%), Positives = 33/140 (23%), Gaps = 7/140 (5%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ 84 + + +P R+ + + G A L+ + + Sbjct: 213 WGVGLVYREVRLVWEGLSQQLHLLEFDPARVRLGLLRPPSLG-----ALAPLSALGNSQG 267 Query: 85 VQMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDK-VGIVR 142 A+NGG ++ + LG +GQ G +R Sbjct: 268 AWGAVNGGFFNRNTREALGALRSDGQWWTGAVAGLPPRGAASWQDGRLDFDRLNWSAAIR 327 Query: 143 LDAFKTSKEIQFAVQSGPML 162 +G L Sbjct: 328 TGDQTLPVGALNGTAAGAGL 347 >UniRef50_B8HP94 Polysaccharide deacetylase n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HP94_CYAP4 Length = 645 Score = 74.0 bits (180), Expect = 4e-12, Method: Composition-based stats. Identities = 28/189 (14%), Positives = 57/189 (30%), Gaps = 17/189 (8%) Query: 78 DINSQGQVQMAMNGGIYDESYAPLGLYI------ENGQQK----VALNLASGEGNFFIRP 127 +I ++ Q ++GG + + + I +NGQ +G I P Sbjct: 442 EILAKTQAVAGVDGGFFSLEFLDSNVMIGPVLSQKNGQFVPGNASENPRLNGRPLVLISP 501 Query: 128 GGVFYVA-GDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHP----NVASSKIR 182 GV ++ A + L++ P + +++ R Sbjct: 502 TGVRFIPFDASKHNSLEGIQAEDPGATDAFVAAAWLVKQNQPQPEQSFGNLFDFNAARHR 561 Query: 183 NGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMY-MKGGAIPWQR 241 GIN++G +S + + K + LD S +G ++ Sbjct: 562 AFWGINQNGQPTIGVSTEPVGSVELGEILY-KAGFRDAVMLDSGASTSLAYQGESLVGYT 620 Query: 242 YPFVTMISV 250 V + Sbjct: 621 PRPVPHVVA 629 >UniRef50_C6IV65 Putative uncharacterized protein n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6IV65_9BACL Length = 257 Score = 74.0 bits (180), Expect = 5e-12, Method: Composition-based stats. Identities = 41/266 (15%), Positives = 83/266 (31%), Gaps = 44/266 (16%) Query: 10 GMITLNLKRIFLALTLLPLF--------AVAADDCALSDPTLTVQAYTVNPQTERVKMYW 61 G+ ++ IFL + L+ + + + + + A V+P+ ++ Sbjct: 10 GIAMASVMAIFLVILLMIGWGSRYLLPRHYEYHETTAA-NGVKLHALVVDPERIELR--- 65 Query: 62 QKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEG 121 A + G MNGG + + A L L + N Q A G G Sbjct: 66 --AADQPLGRYR------------VYGMNGGFFY-NEAVLSLAVNNDQPVQGTAGAYGSG 110 Query: 122 NFFIR-PGGVFYVAGDK-----VGIVRLDAFKTSKEIQFAVQSGPML-MENGV------I 168 F + G G + + ++ Q G + + + + Sbjct: 111 WFNAKYARGTLVWDGATGSFSVQVVSAASELAVTDRTRYFAQGGVSMKLPDDAGWRAAAV 170 Query: 169 NPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKL---NVEQLLYLDG 225 PN +++R+G+ + +G +++ F K+ L + ++LDG Sbjct: 171 EAEHLPNPDENRLRSGLAYDANGQLWLIVTPTRCTAEAFRTAVKSALADGGLVDGIFLDG 230 Query: 226 TISHMYMKGGAIPW-QRYPFVTMISV 250 S MI + Sbjct: 231 DGSAQLNAAETKLNGDSRDLRQMIVI 256 >UniRef50_D1VTW3 Copper amine oxidase N-domain superfamily n=1 Tax=Peptoniphilus lacrimalis 315-B RepID=D1VTW3_9FIRM Length = 765 Score = 74.0 bits (180), Expect = 5e-12, Method: Composition-based stats. Identities = 22/139 (15%), Positives = 45/139 (32%), Gaps = 24/139 (17%) Query: 135 GDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINP--RIHPNVASSKIRNGVGINKHGN 192 GDK+ I + K + + +L++N I P + ++ ++ R +GI G Sbjct: 246 GDKLKITYDIYPQ--KSWKMLIGGHSLLVDNSKIRPYKKDINSIGGTRARTCIGIADGGK 303 Query: 193 -AVFLLSQQA------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGG-------AIP 238 + + + + + + L ++ L LDG S + + Sbjct: 304 SVYIVSCEGRTKRSSGMSLNELSNFMVN-LGCQRALNLDGGGSTAMVVRNLGDLNRTRVI 362 Query: 239 WQ-----RYPFVTMISVER 252 V I V Sbjct: 363 NPEGNKAERKVVNGIGVFN 381 Score = 45.1 bits (105), Expect = 0.002, Method: Composition-based stats. Identities = 21/131 (16%), Positives = 40/131 (30%), Gaps = 4/131 (3%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ 84 + P S + + +K+ NG T A ++ + ++ Sbjct: 32 IAPGVVNFQYTVKTSKGKAILNVLKCDLNNPYIKLNTVAGNGSY--TEKASVSQMANRTN 89 Query: 85 VQMAMNGGIYDES--YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVR 142 NG + + PLG I +G K + + + +F I Y+ G Sbjct: 90 AVGLCNGDFFSMALQGVPLGPSIIDGDIKSSPGVLTDIYSFGIDSTNTAYILDTNFGGKV 149 Query: 143 LDAFKTSKEIQ 153 T+ I Sbjct: 150 TAPNGTTYNID 160 >UniRef50_C6J2I2 Copper amine oxidase domain-containing protein n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J2I2_9BACL Length = 930 Score = 73.2 bits (178), Expect = 8e-12, Method: Composition-based stats. Identities = 20/141 (14%), Positives = 48/141 (34%), Gaps = 12/141 (8%) Query: 102 GLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPM 161 G +G A A+ ++ G A ++ + +++ + + Sbjct: 284 GPVPADGYILRAHGTAADYVASHLQV-GQRVDAIYQLQSLTDGQLVDPADLKVMIGGHTL 342 Query: 162 LMENGVINP----RIHPNVASSKIRNGVGINKHGNAVFLLSQQ------ATNFYDFACYA 211 L++ G + + S+ R VG +K G ++++ + + + Sbjct: 343 LVDQGKASAFTRSTTSISGGSAVARTAVGYSKDGKTAYIITAEKNSNSTGLTLKELQGFM 402 Query: 212 KAKLNVEQLLYLDGTISHMYM 232 + V + L LDG S + Sbjct: 403 -TGIGVWKGLNLDGGGSTTMV 422 >UniRef50_UPI00017890C7 copper amine oxidase domain protein n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI00017890C7 Length = 900 Score = 72.0 bits (175), Expect = 2e-11, Method: Composition-based stats. Identities = 21/138 (15%), Positives = 46/138 (33%), Gaps = 9/138 (6%) Query: 105 IENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLME 164 + G + + A+ E G A + + +Q + +L++ Sbjct: 260 VPEGSYILRTHGAAAEFVKNNLAVGQRLEADYALVSKKTGQKLDPTNLQMMIGGHTILVD 319 Query: 165 NGVINPRIHP--NVASSKIRNGVGINKHGNAVFLLSQQ------ATNFYDFACYAKAKLN 216 NG ++ ++ R VG +K G +L++ + + K+ Sbjct: 320 NGKATSFSRNVNDLGGNRARTAVGYSKDGRYAYLIATESNDNSKGMTLQQLQDFM-TKVG 378 Query: 217 VEQLLYLDGTISHMYMKG 234 V + + LDG S + Sbjct: 379 VWKGMNLDGGGSTTMVNR 396 Score = 53.9 bits (128), Expect = 5e-06, Method: Composition-based stats. Identities = 16/115 (13%), Positives = 34/115 (29%), Gaps = 3/115 (2%) Query: 23 LTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQ 82 + + + V+ + VK+ G + T + + Sbjct: 64 IITSGAVLMKYQYINSAGAKSLANVIRVDLNNKYVKLDVMTGQGNQFTT-RQSTGGMAKE 122 Query: 83 GQVQMAMNGGIYDES--YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAG 135 A+NG ++ AP+G + NG + + G F + G + Sbjct: 123 NGAVAAINGDFFNTGREGAPMGAQVSNGLMMSSPSDLKGMYAFAVTNDGKPILDE 177 >UniRef50_C9PT69 Putative uncharacterized protein n=1 Tax=Prevotella sp. oral taxon 472 str. F0295 RepID=C9PT69_9BACT Length = 814 Score = 71.7 bits (174), Expect = 3e-11, Method: Composition-based stats. Identities = 19/189 (10%), Positives = 45/189 (23%), Gaps = 38/189 (20%) Query: 76 LADINSQGQVQMAM--NGGIYDESYAPLGLYIE--NGQQKVALNLASGEGNFFIRPGGVF 131 + + Q+ + NG + + +GQ + ++ G Sbjct: 174 VNHLRGDNQLVLYNQHNGQYTHTDAKGTEVLVTLIDGQTWELNKEVRAKVVSVVQNKGNM 233 Query: 132 YVAGDKVGIVRLDAFKTS----------------------KEIQFAVQSGPM--LMENGV 167 ++ + + P ++++G Sbjct: 234 HIPAGSAVLSANGEIAKKLAALTAGTEVKIALNLSIGGATAPFTDVIGGDPRSPMLQDGT 293 Query: 168 INPRIHPNVASSKIRNGVGINKHGNA-VFLLSQQATNF------YDFACYAKAKLNVEQL 220 +N N R G G + + + + + A K + Sbjct: 294 VNTTEIWN--ELHPRTGFGYTQDKKTAIHCVVDGRSTISAGANTKELAEIMKF-VGAYNA 350 Query: 221 LYLDGTISH 229 + LDG S Sbjct: 351 MNLDGGGSS 359 >UniRef50_A9GRW8 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GRW8_SORC5 Length = 387 Score = 71.3 bits (173), Expect = 3e-11, Method: Composition-based stats. Identities = 27/216 (12%), Positives = 61/216 (28%), Gaps = 30/216 (13%) Query: 26 LPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQ-------KANGEAWGTLHALLAD 78 P + + V+ V + G+ + L + Sbjct: 126 APRLHKTLLHPDPNRSWAELFIVAVDLARVDVHLMAGSREPAATTEEGKPYERLAKIPE- 184 Query: 79 INSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKV 138 ++ A NGG E G+ + +G V G +A Sbjct: 185 -ADHERLLAAFNGGFMTEHGQ-WGMRV-DGVTLV--RPRDQGCTLARHRDGRLQIAP--- 236 Query: 139 GIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHP----------NVASSKIRNGVGIN 188 ++Q+ Q+ +++ G ++P + + + R+ VG++ Sbjct: 237 ---WTRLSAGESDMQWWRQTPSCMVDEGELHPLLRAPQVRNWGATLDGNTVIRRSAVGLD 293 Query: 189 KHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLD 224 + G +++ T A + + LD Sbjct: 294 RDGKVLYVGISNHTTAPAIALGMQHA-GASAVAQLD 328 >UniRef50_D2PRV8 Metallophosphoesterase n=1 Tax=Kribbella flavida DSM 17836 RepID=D2PRV8_9ACTO Length = 1163 Score = 70.9 bits (172), Expect = 3e-11, Method: Composition-based stats. Identities = 21/149 (14%), Positives = 41/149 (27%), Gaps = 20/149 (13%) Query: 103 LYIENGQQKVALNL-------ASGEGNFFIRPGGVFY--VAGDKVGIVRLDAFKTSKEIQ 153 + + +G + A+ + G V+ + + Sbjct: 259 VILRDGVVVSSSATPATTPLAANEQALIGREAGATALGAFEPGDKVEVKYGPRADAADAA 318 Query: 154 FAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGN-AVFLLSQQAT------NFYD 206 A+ L +G I + + + R +G + G V L + Sbjct: 319 VALSGNKQLARDGQI---LTVDDTALHPRTSIGFSADGRRMVLLTVDGRMVDSRGLTEKE 375 Query: 207 FACYAKAKLNVEQLLYLDGTISHMYMKGG 235 A L + +L LDG S +K Sbjct: 376 LARLMLD-LGSDDVLNLDGGGSSTMLKRS 403 >UniRef50_A8F5X1 Putative uncharacterized protein n=1 Tax=Thermotoga lettingae TMO RepID=A8F5X1_THELT Length = 550 Score = 69.7 bits (169), Expect = 9e-11, Method: Composition-based stats. Identities = 19/147 (12%), Positives = 44/147 (29%), Gaps = 24/147 (16%) Query: 110 QKVALNLASGEGNFFIRPGGVFYVAGD------KVGIVRLDAFKTSKE----IQFAVQSG 159 +G + + G V ++ + I + + I+ A+++G Sbjct: 377 VVDGKVNGTGWLSKAPKNGFVLAISSKYKKYLEGIQIGDTVEYIVNTNFPYPIKHAIEAG 436 Query: 160 PMLMENGVINPRIHPN--------VASSKIRNGVGINKHGNAVFLLS-----QQATNFYD 206 P+++ G P + +S R G V + N+ + Sbjct: 437 PLILYEGSPIPDRNDEKNRYGGSIARASATRTLAATLPDGKVVLAVINDQDGSGGVNYDE 496 Query: 207 FACYAKAKLNVEQLLYLDGTISHMYMK 233 ++ K + DG S + + Sbjct: 497 LVEFSLKK-GFYSAMNFDGGSSSIMVI 522 Score = 48.2 bits (113), Expect = 3e-04, Method: Composition-based stats. Identities = 20/186 (10%), Positives = 47/186 (25%), Gaps = 32/186 (17%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ 84 + + + + V ++PQ +K + ++ + + Sbjct: 223 IAKGVYWEQKVEKIGNKDMLVNYLWIDPQFVDLKPEISSGGIGSLESVEK----MVIRKN 278 Query: 85 VQMAMNGGIYDES-YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRL 143 +N +D + P+GL I +G+ VF + Sbjct: 279 AVAGVNANYFDTNTGLPIGLLIVDGKILSMPYGDRP----------VFIQTFSNEVYISR 328 Query: 144 DAFKTSKEIQFAVQSGPMLME------NGVINPRIHPNVASSKIRNGVGINKHGNAVFLL 197 F + ++ + L++ G + S R F + Sbjct: 329 IYFDVNIKVGQLL----FLVKGINTIAQGEVLIFTPEFGLSIPYR-------DEMVYFSV 377 Query: 198 SQQATN 203 N Sbjct: 378 VDGKVN 383 >UniRef50_C7PW43 Ig domain protein group 2 domain protein n=2 Tax=Catenulispora acidiphila DSM 44928 RepID=C7PW43_CATAD Length = 1174 Score = 69.4 bits (168), Expect = 1e-10, Method: Composition-based stats. Identities = 26/201 (12%), Positives = 62/201 (30%), Gaps = 14/201 (6%) Query: 18 RIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLA 77 + ++TL +D Q V+ V++ +++ + + Sbjct: 69 AVSPSVTLTHGVQQHSDVLRTVGGAQRAQVMDVDLADPNVRLGVVESHDHLTDAADEVPS 128 Query: 78 DINSQGQVQMAMNGGIY--DESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAG 135 + + +NG + S PLG+ + +G+ + + + G + Sbjct: 129 SMAHRTGAVGGVNGDFFEIYGSGRPLGMVVIDGRLVKSPDPTWNADLWVRHDGSIGIGTE 188 Query: 136 DKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKI---RNGVG--INKH 190 G + A + AV S +G R+ P++ + V + Sbjct: 189 TYAGSLTDGAATAAITAVNAVNS-----LSGNAIVRVTPDLGTPSPIAASTVVAGHLGAD 243 Query: 191 GNAVFL--LSQQATNFYDFAC 209 G + + ++ T A Sbjct: 244 GTTLLVDSVTAGVTTLPQLAA 264 Score = 58.6 bits (140), Expect = 2e-07, Method: Composition-based stats. Identities = 14/107 (13%), Positives = 30/107 (28%), Gaps = 12/107 (11%) Query: 137 KVGIVRLDAFKTSKEIQFAVQSGPMLMENGV--INPRIHPNVASSKIRNGVGINKHG-NA 193 I + ++ + G +L++NG + + VG+++ G +A Sbjct: 286 GDQIAVSEKIGPDPDVVQGLSGGAILVQNGQRAVPLQGSGENNVDNPVTAVGVSQDGKHA 345 Query: 194 VFLLSQQ--------ATNFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 VF A + + D S + Sbjct: 346 VFAAFDGHQSEDVAQGLTRPQIAGWMTQH-GAYNAILFDSGGSTQMV 391 >UniRef50_C4DE18 Putative uncharacterized protein n=1 Tax=Stackebrandtia nassauensis DSM 44728 RepID=C4DE18_9ACTO Length = 393 Score = 69.0 bits (167), Expect = 2e-10, Method: Composition-based stats. Identities = 28/180 (15%), Positives = 54/180 (30%), Gaps = 34/180 (18%) Query: 105 IENGQQKVALNLASGEGNFFIRPGGVFYVA-----------GDKVGIVRLDAFKTSKEIQ 153 I +G+ ++ A GEG + GD+V + A + ++ Sbjct: 217 IVDGKVTR-VSDAPGEGQIAKDATVLLARDKGVEHLNGLSEGDEVNVDYQLASEDGADLD 275 Query: 154 FAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHG-NAVFLLSQQAT------NFYD 206 A+ G +L+++G + ++ + A+ R VG N G + + Sbjct: 276 TAL-GGLILIDDGTML-DLNDDAATLAPRTAVGSNADGSKLYMVAVDGRSSTSVGATVKS 333 Query: 207 FACYAKAKLNV-EQLLYLDGTISHMYMKGG-------AIPWQRYP----FVTMISVERKG 254 A L ++ LDG S + + V K Sbjct: 334 MADIMVN-LGADHNVINLDGGGSTTLVARKAGNTATSVRNTPSDGSQRKVANGLGVFTKA 392 Score = 45.5 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 25/185 (13%), Positives = 50/185 (27%), Gaps = 27/185 (14%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ 84 + P + TV+ E V++ A + + + + + Sbjct: 31 VAPGVTYEKKTISTPHGKSIGHILTVDLTREDVEVGLLTPGPVAATGV---VTSLADRVK 87 Query: 85 VQMAMNGGIYDES--YAPLGLYIENGQQKVALNLASGE------------GNFFIRPGGV 130 +NG ++ A +G I +G+ + F I G Sbjct: 88 AVAGVNGDFFNIGQTGAAVGPEILDGKDRKGPVPGKQRHGPTPPPGTDNDSVFGITKDGK 147 Query: 131 FYVAG---DKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGI 187 + D +F+ Q+A+ +GV S K R G Sbjct: 148 TVIDRLNVDGKATTAAGSFELKGLNQYAIT------VDGVGVFNADWGKTSRK-RATCGT 200 Query: 188 NKHGN 192 + + Sbjct: 201 DDDRD 205 >UniRef50_Q7NIQ9 Gll2123 protein n=1 Tax=Gloeobacter violaceus RepID=Q7NIQ9_GLOVI Length = 518 Score = 68.2 bits (165), Expect = 3e-10, Method: Composition-based stats. Identities = 32/218 (14%), Positives = 65/218 (29%), Gaps = 24/218 (11%) Query: 53 QTERVKMYWQKANGEAWG---TLHALLADINSQGQVQMAMNGGIYDESYA------PLGL 103 RV++ W + D + + +NGG + + +G Sbjct: 288 THRRVRLVWLSGGNYTTRHSEGNRYPVGDFIERERAVGGINGGFFAFAGLRATNSDMVGP 347 Query: 104 YIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQ------ 157 Y+ + + + + RP + G + + F T E + + Sbjct: 348 YLSQNEGRFMPGAPEFDKSLRGRPVVLISATGLRFVPYSPETFDTEAEARAYLSDLSDLF 407 Query: 158 -SGPMLMENGVINPRIH------PNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACY 210 +G L+ NG N + + R +GI+K G + + N A Sbjct: 408 VAGVWLVNNGQALTTEQIEQFRLSNHSEFRRRTFMGIDKAGLPMVGATLTNVNATQLARA 467 Query: 211 AKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMI 248 + + + + LD S + G + I Sbjct: 468 L-EEAGLREAVLLDSGFSTSLVHQGKVLVT-GHTAPSI 503 >UniRef50_A3P9C8 Putative lipoprotein n=32 Tax=pseudomallei group RepID=A3P9C8_BURP0 Length = 563 Score = 67.0 bits (162), Expect = 5e-10, Method: Composition-based stats. Identities = 25/163 (15%), Positives = 39/163 (23%), Gaps = 35/163 (21%) Query: 107 NGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPML---- 162 NG L ++ PG V+ A + + A GP L Sbjct: 384 NGYVLQGLGASAAWLQAHATPGTRLAVSRRLSADGADLALASGTSLVEA---GPTLSVPN 440 Query: 163 MENGVINPRIHPNVAS--------------------SKIRNGVGINKHGNAVFLLSQQAT 202 + P V R G+ G + + Sbjct: 441 LAQSAAQEGFAPTVGGVDAGEGAAANGNWYNGWYVARNGRTAAGVAADGTILLVEIDGRQ 500 Query: 203 -------NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIP 238 + + A A L + LDG S + GG + Sbjct: 501 PALSVGTSIPETAAVM-AWLGATSAVNLDGGGSSNMVVGGKMV 542 Score = 46.2 bits (108), Expect = 9e-04, Method: Composition-based stats. Identities = 14/112 (12%), Positives = 30/112 (26%), Gaps = 9/112 (8%) Query: 27 PLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQ 86 P + A + V ++P + G G ++ ++ Sbjct: 160 PGTRHTSLAGAPTTGPWIVNVLAIDPSRAGAALSLALPGGNDLGAGGETVSAARARVNAL 219 Query: 87 MAMNGGIYD---------ESYAPLGLYIENGQQKVALNLASGEGNFFIRPGG 129 +NGG + +P+G + +G+ A G Sbjct: 220 AGVNGGFFTNINPFGAPLPPRSPVGATVVDGRLVAAAIGRRPGLLLARDANG 271 >UniRef50_A6WEB7 Putative uncharacterized protein n=1 Tax=Kineococcus radiotolerans SRS30216 RepID=A6WEB7_KINRD Length = 986 Score = 66.7 bits (161), Expect = 6e-10, Method: Composition-based stats. Identities = 30/171 (17%), Positives = 50/171 (29%), Gaps = 16/171 (9%) Query: 71 TLHALLADINSQGQVQMAMNGGIYDESYAPLGL-YIENGQQKVALNLASGEGNFFIRPGG 129 AL + G V++ + G AP + +G VA + P G Sbjct: 192 GDRALTVADPAAGAVELEVRAGRVSAVRAPGAVPVPADGYVLVATGSRARA--LSATPVG 249 Query: 130 VFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSK--IRNGVGI 187 +V L FA+ + L+ +G I P + + R +G Sbjct: 250 AAAGTDLRVRDDALSPGSRG----FALGARLELVRDGAIAPIDVADPTWAALRARTALGW 305 Query: 188 NKHGNAVFLLSQQAT------NFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 G+ + L T + A + LDG S + Sbjct: 306 TATGDLLLLTVDGGTSRSRGLTAVETAQRMVEA-GARGAVMLDGGGSAQLV 355 >UniRef50_A6G841 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G841_9DELT Length = 507 Score = 66.7 bits (161), Expect = 7e-10, Method: Composition-based stats. Identities = 22/160 (13%), Positives = 40/160 (25%), Gaps = 30/160 (18%) Query: 111 KVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKT-----SKEIQFAVQSGPMLMEN 165 V + + + G + V + A++ + + + GPML+E Sbjct: 320 IVGRRVVAVGRGLPVPLNGFVVPTPETVEVGAEVAYEPLRGSGGRPLVAGIAGGPMLLEG 379 Query: 166 GV--------------INPRIHPNVASSK---IRNGVGINKHGNAVFLLSQQA------- 201 G + + R VG++ VF+ Sbjct: 380 GALTLDLRREDFWGSAPPVTFSQDETGDQNLLPRLAVGLDHAQRLVFVAVDGRDFGRALG 439 Query: 202 TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQR 241 +A L LDG S + G Sbjct: 440 MTLGGVGEVLQA-LGCHTATNLDGGASKRMVLRGRALDLS 478 Score = 41.2 bits (95), Expect = 0.031, Method: Composition-based stats. Identities = 26/175 (14%), Positives = 50/175 (28%), Gaps = 35/175 (20%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ 84 + P A A ++ + + V+PQ R+ + ++ A Q Sbjct: 166 VAPGLEHARVAQACAEGPVHLNVLRVDPQRVRLAVDDRREGVRAGQPFTEWT----RQRG 221 Query: 85 VQMAMNGGIY----------DESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVA 134 A++GG + Y P+GL + G+ A G + A Sbjct: 222 ATAAVSGGFFLYSEPDIEAPSARYDPVGLLLGEGRCLSPPVFARGA---------LLLDA 272 Query: 135 GDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINK 189 V I L + + + +G + R +G + Sbjct: 273 EGGVAIEPLG-----------LGGTHLRLADGRPLDAAEAARWNRS-RARIGPDA 315 >UniRef50_UPI00016BFF19 Ig-like, group 2 n=1 Tax=Epulopiscium sp. 'N.t. morphotype B' RepID=UPI00016BFF19 Length = 935 Score = 66.7 bits (161), Expect = 8e-10, Method: Composition-based stats. Identities = 8/60 (13%), Positives = 15/60 (25%), Gaps = 7/60 (11%) Query: 179 SKIRNGVGINKH-GNAVFLLSQQAT-----NFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 R + G + + + L + +YLDG S + Sbjct: 303 RHPRTLIATTDKFGELLLITIDGRQYSAGATHDEVIQILLD-LGAKDAMYLDGGGSTTMV 361 >UniRef50_B0JW05 Polysaccharide deacetylase family protein n=4 Tax=Chroococcales RepID=B0JW05_MICAN Length = 616 Score = 64.7 bits (156), Expect = 2e-09, Method: Composition-based stats. Identities = 30/220 (13%), Positives = 62/220 (28%), Gaps = 19/220 (8%) Query: 49 TVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYD----ESYAPLGLY 104 V + + + + +I + + A++GG + +S +G Sbjct: 368 EVEFSSNTLVLISGGIPKTIHADSRYQVEEIIAGTEAIAAVDGGFFSLKELDSNQMIGPV 427 Query: 105 IE-NGQQKVALNLA----SGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSK----EIQFA 155 + NG G I V Y+ D L+ ++ A Sbjct: 428 LSENGGFIPGYEGEIDKLEGRPLVIITDRWVRYLPFDPARHNTLEGIAAEAGDDLKVTDA 487 Query: 156 VQSGPMLMENGVINPRIHPN----VASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYA 211 + L+++G + + R GIN+ G V +S+ + Sbjct: 488 FVAAAWLVKDGQPQSLESFGTLYGFDALRHRAFWGINQAGQPVIGVSRDPIDSMALGELL 547 Query: 212 KAKLNVEQLLYLDGTISHMY-MKGGAIPWQRYPFVTMISV 250 + + LD S +G + V + Sbjct: 548 VQA-GFREAVMLDSGASTSLAYQGQSQVHYTPRPVPHVVA 586 >UniRef50_Q72HQ9 Putative uncharacterized protein n=4 Tax=Thermaceae RepID=Q72HQ9_THET2 Length = 487 Score = 64.7 bits (156), Expect = 3e-09, Method: Composition-based stats. Identities = 20/111 (18%), Positives = 38/111 (34%), Gaps = 12/111 (10%) Query: 149 SKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVG--------INKHGNAVFLLSQQ 200 + ++A++ GP+L++ G R K G L+S+ Sbjct: 374 NPPFRYALEGGPLLLKEGR-YAYDPAKENFKDPRPLQAVAPQAAVAWTKEGKLWLLVSE- 431 Query: 201 ATNFYDFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFVTMISV 250 T A + L L +DG S +++KG + ++S Sbjct: 432 PTTPGALARALLS-LGAWNALRMDGGGSAQLWVKGVLRSPYQGSPRPVVSA 481 >UniRef50_UPI0001C1628D hypothetical protein CRC_02750 n=2 Tax=Nostocaceae RepID=UPI0001C1628D Length = 633 Score = 64.7 bits (156), Expect = 3e-09, Method: Composition-based stats. Identities = 34/244 (13%), Positives = 65/244 (26%), Gaps = 25/244 (10%) Query: 29 FAVAADDCALSDPTLTVQA----YTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ 84 + ++ V Y VN + + G L D+ Q Sbjct: 376 WGGYPAPVDPTNFNFNVDIEKREYKVN--NTELILIGGGIPGTFHADSRYQLPDMLKDTQ 433 Query: 85 VQMAMNGGIYD----ESYAPLGLYIENGQQKVALNLAS-----GEGNFFIRPGGVFYVA- 134 V A++GG + +S +G + + + N + I P V ++ Sbjct: 434 VVAAVDGGFFSLKYLDSNTMIGPVLSGNRGFIPGNASENLKLRDRPLVLINPHSVSFIPF 493 Query: 135 ---GDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNV----ASSKIRNGVGI 187 +K + L+ N +++ R GI Sbjct: 494 VPETHNTLEGIQATSPENKGVTDTFVGAAWLVRNNTPRTAADFGNLYDYDAARHRAFWGI 553 Query: 188 NKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMY-MKGGAIPWQRYPFVT 246 N G V +++ +L + LD S +G ++ V Sbjct: 554 NLAGMPVIGVTKTPVGSVSLGEILY-QLGFRDAVMLDSGASTSLSYRGKSLVAYTPRPVP 612 Query: 247 MISV 250 V Sbjct: 613 HAVV 616 >UniRef50_Q826N8 Putative secreted protein n=1 Tax=Streptomyces avermitilis RepID=Q826N8_STRAW Length = 409 Score = 64.3 bits (155), Expect = 3e-09, Method: Composition-based stats. Identities = 31/180 (17%), Positives = 51/180 (28%), Gaps = 33/180 (18%) Query: 103 LYIENGQQKV------ALNLASGEGNFFIRPGGV-----FYVAGDKVGIVRLDAFKTSKE 151 + + +GQ + +A+G R G + RL A + Sbjct: 233 VTVRDGQVVSFADSPGSGPIAAGTTVLVGREAGAQQLRKLSTGEEVSVEHRLVAAGSQVA 292 Query: 152 IQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGN-AVFLLSQQ------ATNF 204 FA+ P+L G P + + +S +R VGI G + L Sbjct: 293 YSFAIGGYPVL-RQGKPLPGL--DTVTSAVRTAVGIKDAGRRLLLLAIDGAAAYRSGLTI 349 Query: 205 YDFACYAKAKLNVEQLLYLDGTISHMYMKGG-------AIPWQR----YPFVTMISVERK 253 + A + L + LDG S + P I V + Sbjct: 350 AEVASVMR-GLGATEAFSLDGGGSTTLVARAPGATSVTVRNHPSGGAERPVPNGIGVFTR 408 Score = 47.4 bits (111), Expect = 4e-04, Method: Composition-based stats. Identities = 25/187 (13%), Positives = 49/187 (26%), Gaps = 21/187 (11%) Query: 22 ALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINS 81 ++ L P D +V+ RV++ A A ++ + Sbjct: 38 SVPLAPGVEYTQFDIPADRGVTHAHLLSVDLADPRVRVGLLHPGAVA---ARAPVSRLAD 94 Query: 82 QGQVQMAMNGGIYD----------ESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVF 131 +NG ++ + + G I +G A + PG Sbjct: 95 SQGALAGINGDFFNITETQHPGVEATGSTDGPAIADGHTLKAAVPNGQRFGPALPPGTTT 154 Query: 132 --YVAGDKVGIVRLDAFKTSKEIQFAVQSGPM------LMENGVINPRIHPNVASSKIRN 183 + RLD + A P+ + G + + S+ R Sbjct: 155 EDVLGVGDDRRARLDRLALEGSVDTAAGELPLRGLNQYALPEGSVGAYTADWGSVSRARA 214 Query: 184 GVGINKH 190 G + Sbjct: 215 VCGTDTD 221 >UniRef50_Q2JPV6 Polysaccharide deacetylase family protein n=2 Tax=Synechococcus RepID=Q2JPV6_SYNJB Length = 723 Score = 64.3 bits (155), Expect = 3e-09, Method: Composition-based stats. Identities = 29/228 (12%), Positives = 63/228 (27%), Gaps = 28/228 (12%) Query: 52 PQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYD------ESYAPLGLYI 105 RV ++ + + Q +NG + S +G + Sbjct: 314 INQVRVLTLRGGRAATVHAERRYEVSTLAQRYQADAGINGSFFSIPWINSASNVMVGPAM 373 Query: 106 E-NGQQKVALNLASGEGNFFIR-----PGGVFYVAGDKVGIVRLDAF-KTSKEIQFAVQS 158 N + + + + +V D + L+ + ++ + Sbjct: 374 AANHKTFIPGRPEDDQAIRGRPLVLLGRDRLRFVPFDPDTMTHLENIRQLMPDVTDLFVA 433 Query: 159 GPMLMENG------VINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAK 212 G L+++G IN + A + R G++ V +++ N A Sbjct: 434 GLWLVKDGHALSPAEINSFRLASAAEFRPRAFFGVDDQERVVIGVTKTHVNAAILASLLP 493 Query: 213 AKLNVEQLLYLDGTISHMYMKGGAI--------PWQRYPFVTMISVER 252 + + + LD S + G I P I + Sbjct: 494 KT-GIREAVLLDSGFSTSLVYQGEILATGHAGPNQPSRPVPHAILLYD 540 >UniRef50_C1XUX9 Putative uncharacterized protein n=1 Tax=Meiothermus silvanus DSM 9946 RepID=C1XUX9_9DEIN Length = 497 Score = 63.2 bits (152), Expect = 7e-09, Method: Composition-based stats. Identities = 18/115 (15%), Positives = 41/115 (35%), Gaps = 9/115 (7%) Query: 134 AGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVI--NPRIHPNVASS-----KIRNGVG 186 + G + + +A+++GP+L+++G NP + P ++ V Sbjct: 365 PPVRTGEILKLYGSLEPPLAYALEAGPLLIQSGAYAFNPNLEPFTDPRPLNATAPQSAVA 424 Query: 187 INKHGNAVFLLSQQATNFYDFACYAKA-KLNVEQLLYLDGTISHMYMKGGAIPWQ 240 + G ++S T A + N+ + +D S G++ Sbjct: 425 WTQDGRLWLVVSD-PTTPSTLARALQLYNPNIWGAIRMDAGGSAQLYVRGSLRTP 478 Score = 41.2 bits (95), Expect = 0.030, Method: Composition-based stats. Identities = 23/124 (18%), Positives = 38/124 (30%), Gaps = 17/124 (13%) Query: 24 TLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQG 83 + P F + L + +P R++ G+ L A L + Sbjct: 192 VIAPGFRYREVW-TFTPEPLRLYLVEADPGRWRME-----PVGQP--GLRAYLPSLAPT- 242 Query: 84 QVQMAMNGGIYDE-SYAPLGLYIENGQQKV------ALNLASGEGNFFIRPGGVFYVAGD 136 +NGG +D S P+GL+I++G AL G V Sbjct: 243 -ALAILNGGYFDPKSGTPIGLWIKDGVALNFPFGRSALMWEQNRVFAGFPKFGTVIVTQS 301 Query: 137 KVGI 140 + Sbjct: 302 GQRL 305 >UniRef50_A9EQ62 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9EQ62_SORC5 Length = 397 Score = 61.6 bits (148), Expect = 2e-08, Method: Composition-based stats. Identities = 28/229 (12%), Positives = 63/229 (27%), Gaps = 24/229 (10%) Query: 8 GKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGE 67 G G+ T L+ + + ++ +++ Sbjct: 106 GDGVWTPIGDSAARPGEAQVLWK-SVVHPDPKRVFAAIAVVAIDLGRVDLRLVAGTKEPF 164 Query: 68 AWGTLHALLADINSQGQV---QMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFF 124 + + G A NGG G+ ++ + A + Sbjct: 165 SPDIPAERRPGLVPGGHAAELVAAFNGGFKAMHGH-YGMMLDGDTFLPPRDRACTIALYR 223 Query: 125 IRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKI--- 181 + + G R+ A++ + P L+E G ++ ++ + Sbjct: 224 SGAVRIRTWPELRDGEARMAAYRQTP---------PCLVEQGELHHALYDSNRDWGATVS 274 Query: 182 ------RNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLD 224 R+ +G++ G +F +A A KA + LD Sbjct: 275 GETVIRRSALGVDATGKLLFYGLGEAVTARSLARGMKAA-GAHDVAELD 322 >UniRef50_C7LY43 Putative uncharacterized protein n=1 Tax=Acidimicrobium ferrooxidans DSM 10331 RepID=C7LY43_ACIFD Length = 397 Score = 60.9 bits (146), Expect = 4e-08, Method: Composition-based stats. Identities = 35/172 (20%), Positives = 59/172 (34%), Gaps = 24/172 (13%) Query: 63 KANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGN 122 G W + + + + + A N G + G Y G+ V L +G + Sbjct: 182 PPGGGPWPYMAPITNPVAA--DLVAAFNSGFRMQDAN--GGYYAYGRTAVPL--RNGAAS 235 Query: 123 FFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHP-------- 174 F I GV + K I Q+ L+ NG INP ++ Sbjct: 236 FVISTSGVPTI------ETWTHGNHVPKGIAVVRQNLIPLISNGRINPLVNSTNFAIWGA 289 Query: 175 --NVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLD 224 R+GVGI ++G V+ ++ + A A++ + LD Sbjct: 290 TVGNQLLVWRSGVGITRNGALVY-VTGPGLSVASLARLL-ARVGAVNAMELD 339 >UniRef50_B8CD22 Predicted protein n=1 Tax=Thalassiosira pseudonana RepID=B8CD22_THAPS Length = 572 Score = 59.3 bits (142), Expect = 1e-07, Method: Composition-based stats. Identities = 15/119 (12%), Positives = 29/119 (24%), Gaps = 32/119 (26%) Query: 146 FKTSKEIQFAVQSGPMLMENGV----------------INPRIHPNVA---SSKIRNGVG 186 + + AV GP+ ++ + + R G+G Sbjct: 404 YTLPTPLDNAVAGGPIFFDDNNDEQTMDLPSEDFKGSAPPVTFSQDETFDRNLLPRMGIG 463 Query: 187 INKH-----GNAVFLLSQQAT-------NFYDFACYAKAKLNVEQLLYLDGTISHMYMK 233 I + V + + K L + + LDG S + Sbjct: 464 ITNNDSSGEKELVCVAVDGRNLDRALGLTLQGTSDLLK-TLGCVKAMNLDGGSSKRMVI 521 >UniRef50_C0DAA9 Putative uncharacterized protein n=1 Tax=Clostridium asparagiforme DSM 15981 RepID=C0DAA9_9CLOT Length = 798 Score = 59.0 bits (141), Expect = 1e-07, Method: Composition-based stats. Identities = 24/157 (15%), Positives = 41/157 (26%), Gaps = 36/157 (22%) Query: 123 FFIRPGGVFYVAGDKVGIVRLDAF----KTSKEIQFAVQSGPMLMENGVINPRIHPNVA- 177 F G + G + I + + +A G L+ +G ++ Sbjct: 620 FEAAGDGYYRWDGPEAVICLEPPEGIGAEDWDSVCWAFGGGMSLISDGESLFEQETGLSR 679 Query: 178 ----------------------SSKIRNGVGINKHGNAVFLLSQQATNF------YDFAC 209 S R VG+ G L+ T Sbjct: 680 LEDEGWLGPLSRQTQESEIHRLSKHPRTAVGVTDQGELFVLVFSGRTALSVGADYAQMGR 739 Query: 210 YAKAKL-NVEQLLYLDGTISHM--YMKGGAIPWQRYP 243 A+ + NV ++ +DG S + G YP Sbjct: 740 IARTLVPNVRHMMNVDGGGSAVFGMAVGKVFVELSYP 776 >UniRef50_A9QSK3 Polysaccharide biosynthesis protein n=7 Tax=Streptococcaceae RepID=A9QSK3_LACLK Length = 300 Score = 56.3 bits (134), Expect = 9e-07, Method: Composition-based stats. Identities = 29/208 (13%), Positives = 54/208 (25%), Gaps = 20/208 (9%) Query: 51 NPQTERVKMYWQKANGEAWGTLHALLAD-------INSQGQVQMAMNGGIYDESYAPLGL 103 + T + + ++ N E T+ I + ++ G Sbjct: 97 DLSTNNITI-YRINNPEVLKTVTNRTDQRMKMSEVIAKYPNALIMNASAFDMQTGQVAGF 155 Query: 104 YIENGQQKVALNLASGEG-NFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPML 162 I NG+ + + F I G + Q A G + Sbjct: 156 QINNGKLIQDWSPGTTTQYAFVINKDGSCKIYDS----STPALTIIKNGGQQAYDFGTAI 211 Query: 163 MENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAK--AKLNVEQL 220 + +G I P I + +K N +LS + K + L ++ + Sbjct: 212 IRDGKIQPSDGSVDWKIHI--FIANDKDNNLYAILSDTNAGYD---NIMKSVSNLKLKNM 266 Query: 221 LYLDGTISHMYMKGGAIPWQRYPFVTMI 248 L LD S + Sbjct: 267 LLLDSGGSSQLSVNDKTIVASQDDRAVP 294 >UniRef50_Q119M8 Putative uncharacterized protein n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q119M8_TRIEI Length = 283 Score = 55.1 bits (131), Expect = 2e-06, Method: Composition-based stats. Identities = 27/169 (15%), Positives = 51/169 (30%), Gaps = 27/169 (15%) Query: 105 IENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLME 164 I NGQ +S F ++ G+ V+ G + K E+ L+ Sbjct: 117 ITNGQFFRNDKSSSTALAFPLKSDGI-IVSDGYAGEIEFSHEKLMLEVWN----NRALIS 171 Query: 165 NGVIN---------------PRIHPNVASSKIRNGVGI-NKHGN----AVFLLSQQATNF 204 N V R +G+ +K G+ + + + + Sbjct: 172 KFQPNNLQFSIATNFIVGLQENADKGVEDQTGRTFIGVQDKDGDRLYETILIFTSKQATQ 231 Query: 205 YDFACYAKAKLNVEQLLYLDGTISHMYM-KGGAIPWQRYPFVTMISVER 252 K+ Q++ LDG S + +G + MI++ Sbjct: 232 PHATNVLKS-FGATQVMMLDGGGSTQLICQGNNYIDSQRTIPQMIAIFS 279 >UniRef50_A4FIV8 Secreted protein n=1 Tax=Saccharopolyspora erythraea NRRL 2338 RepID=A4FIV8_SACEN Length = 94 Score = 54.3 bits (129), Expect = 3e-06, Method: Composition-based stats. Identities = 9/68 (13%), Positives = 20/68 (29%), Gaps = 9/68 (13%) Query: 186 GINKHGNAVFLLSQQA-------TNFYDFACYAKAKLNVEQLLYLDGTISHMY-MKGGAI 237 GI++ G + + + A + ++ L + LD S + + G Sbjct: 3 GIDQAGRLLPVTVDGRRPGSSAGFTLLEAARFMRS-LGAVNAMNLDSGGSTSFVVNGKPA 61 Query: 238 PWQRYPFV 245 Sbjct: 62 NSPSDATG 69 >UniRef50_Q8DHE7 Tlr2012 protein n=1 Tax=Thermosynechococcus elongatus BP-1 RepID=Q8DHE7_THEEB Length = 332 Score = 53.9 bits (128), Expect = 5e-06, Method: Composition-based stats. Identities = 39/258 (15%), Positives = 60/258 (23%), Gaps = 65/258 (25%) Query: 34 DDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGI 93 V RV M A+G+ T A + AMN Sbjct: 16 HYSTHRVDQTQVYMVAAPMADYRVVMTEPAASGQD--TYLETTAAFARRTGAVAAMNTNF 73 Query: 94 Y--------------------------------------DESYAPLGL---YIENGQQKV 112 + P G+ YI NG+ Sbjct: 74 FRWLNAQSLRSFQDYQKWIMGESNPEGISYRALRQTCLSGRGTIPAGVAGAYIVNGRVVR 133 Query: 113 ALNLA-SGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPR 171 SG NF + G FY ++ V L++ G P Sbjct: 134 PYEGGYSGIVNFPAQGGIEFYRGR------------LPEQPFNVVSGSQQLLDGGRKLPV 181 Query: 172 IHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYD------FACYAKAKLNVEQLLYLDG 225 + S K + + VF++S N + V + +DG Sbjct: 182 DGSDRTSPK---AILGRRGNEYVFVVSDGRGNGGSPGLSFLQLQDFLLQQGVTEATAMDG 238 Query: 226 TISHMYMKGGAIPWQRYP 243 S + G + Sbjct: 239 GESATLVVEGQVKNHPRD 256 >UniRef50_UPI00016A4F20 hypothetical protein BthaT_13010 n=4 Tax=pseudomallei group RepID=UPI00016A4F20 Length = 196 Score = 52.8 bits (125), Expect = 1e-05, Method: Composition-based stats. Identities = 13/71 (18%), Positives = 23/71 (32%), Gaps = 8/71 (11%) Query: 178 SSKIRNGVGINKHGNAVFLLSQQAT-------NFYDFACYAKAKLNVEQLLYLDGTISHM 230 + R VG+ G+ + + + + A A L + LDG+ S Sbjct: 126 ARNGRTAVGVAADGSVLLVGIDGRQPVPGVGASVPETAAGM-AWLGAASAVTLDGSGSSN 184 Query: 231 YMKGGAIPWQR 241 + GG Sbjct: 185 LVIGGKTVRPS 195 >UniRef50_A5N8M7 Predicted regulatory protein n=2 Tax=Clostridium kluyveri RepID=A5N8M7_CLOK5 Length = 535 Score = 50.5 bits (119), Expect = 5e-05, Method: Composition-based stats. Identities = 22/209 (10%), Positives = 46/209 (22%), Gaps = 60/209 (28%) Query: 34 DDCALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGG 92 ++ ++ T + + NP V + + ++ + + A+N Sbjct: 369 EELGIATRTFRGKILVIKNPSKVEVGYT------KELLRNNKTTDELAKENKALCAINAS 422 Query: 93 IYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEI 152 K I + ++ Sbjct: 423 YISA----------------------------------------KADINSKRSLPETE-- 440 Query: 153 QFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAK 212 G +++ NG + + N N G + + Y Sbjct: 441 -----FG-IIVHNGNVIYKNSGNTKY----NIAGFTDKNVLISGEYSIGASLYQLQQILL 490 Query: 213 AKLNVEQLLYLDGTISHMYMKGGAIPWQR 241 LDG IS G I + Sbjct: 491 EN-GAYTAAVLDGGISSTMYYKGNIINKP 518 >UniRef50_A9V0B9 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9V0B9_MONBE Length = 623 Score = 50.1 bits (118), Expect = 7e-05, Method: Composition-based stats. Identities = 14/84 (16%), Positives = 23/84 (27%), Gaps = 21/84 (25%) Query: 161 MLMENGVINPRIH---------------PNVASSKIRNGVGINKHGNAVFLLSQQ----- 200 L+ NG + R + + +G + Sbjct: 268 WLVRNGQVYVNESIAYECSNIEESGSLQEFANLQSARTALAHDSNGAVRIVQHNGQSGHY 327 Query: 201 ATNFYDFACYAKAKLNVEQLLYLD 224 N Y+FA Y K + V + LD Sbjct: 328 GINLYEFAKYLKQQ-GVVNAINLD 350 Score = 49.7 bits (117), Expect = 1e-04, Method: Composition-based stats. Identities = 13/86 (15%), Positives = 30/86 (34%), Gaps = 3/86 (3%) Query: 50 VNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE-SYAPLGLYIENG 108 ++P+ G G ++++ ++ A N G ++ G + +G Sbjct: 101 LDPRRTLSVYEPGAPGGCGDGDHRTIVSETAARHDCIYATNAGFFNTHDGTCYGDIVSDG 160 Query: 109 QQKVALNLASGEGNFFIRPGGVFYVA 134 + A N + F +R V+ Sbjct: 161 RLVQADNHTN--VQFGVRHDNTIQVS 184 >UniRef50_Q92JI8 Uncharacterized protein RC0079 n=11 Tax=Rickettsia RepID=Y079_RICCN Length = 282 Score = 48.6 bits (114), Expect = 2e-04, Method: Composition-based stats. Identities = 15/89 (16%), Positives = 27/89 (30%), Gaps = 15/89 (16%) Query: 111 KVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINP 170 V +N + FI G + I V P+L++NG Sbjct: 145 VVNINDSVKLILEFIDKDGKLINLSNTASI---------------VTGIPLLVQNGKNVV 189 Query: 171 RIHPNVASSKIRNGVGINKHGNAVFLLSQ 199 + R +G+ G V ++ + Sbjct: 190 DNPKQDDPAHARTALGVCNDGTIVIVVVE 218 >UniRef50_A7MD65 Zgc:165534 protein n=3 Tax=Clupeocephala RepID=A7MD65_DANRE Length = 313 Score = 48.2 bits (113), Expect = 2e-04, Method: Composition-based stats. Identities = 11/36 (30%), Positives = 18/36 (50%), Gaps = 1/36 (2%) Query: 202 TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAI 237 N + A + K + NV + LDG S Y+ G++ Sbjct: 1 MNLWQVAKFLKDQ-NVMNAINLDGGGSATYVLNGSL 35 >UniRef50_C6D289 Putative uncharacterized protein n=1 Tax=Paenibacillus sp. JDR-2 RepID=C6D289_PAESJ Length = 274 Score = 48.2 bits (113), Expect = 3e-04, Method: Composition-based stats. Identities = 25/160 (15%), Positives = 51/160 (31%), Gaps = 16/160 (10%) Query: 87 MAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIR-PGGVFYVAGDK-----VGI 140 +NGG + E+ L + + +G G G ++ G G + Sbjct: 94 YGVNGGFFYENSL-LSIAVVDGLPVNGALDDYGSGAENVKYARGTLVWDGASDKLSVQVV 152 Query: 141 VRLDAFKTSKEIQFAVQSGPM--LMENG----VINPRIHPNVASSKIRNGVGINKHGNAV 194 + K +F Q G L ++ + P ++R+ ++ G Sbjct: 153 RQAADLKVMDHTRFWAQGGISMSLGQDRNWLEQVETEQAPYPDDDRLRSAAVYDREGVLY 212 Query: 195 FLLSQQATNFYDFACYAKAKLN---VEQLLYLDGTISHMY 231 ++S + F K+ + ++LDG S Sbjct: 213 LIVSSSKGSLQSFRDAILEKVGRGMLVDGIFLDGDGSSQL 252 >UniRef50_Q1IXC2 Putative uncharacterized protein n=3 Tax=Deinococcus RepID=Q1IXC2_DEIGD Length = 637 Score = 48.2 bits (113), Expect = 3e-04, Method: Composition-based stats. Identities = 19/133 (14%), Positives = 31/133 (23%), Gaps = 18/133 (13%) Query: 137 KVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFL 196 + Q A+ +GP+L++ G + V + + F Sbjct: 503 TATLNWQAQDAAWDTAQDALSAGPLLVQGGRVVLNAAREVFDT---SASIWRPTRQVAFG 559 Query: 197 LSQQATNF-------YDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPW--------QR 241 + + A A V + LD S G Sbjct: 560 VLDGQPTIAYLEYGTPETFAAALAAAGVRDAVRLDSGSSATAYVTGGYANLGGYLNTVWS 619 Query: 242 YPFVTMISVERKG 254 P I KG Sbjct: 620 RPVPNAIVFVPKG 632 >UniRef50_B2HMV0 Lipoprotein LprO n=21 Tax=Mycobacterium RepID=B2HMV0_MYCMM Length = 381 Score = 48.2 bits (113), Expect = 3e-04, Method: Composition-based stats. Identities = 34/210 (16%), Positives = 57/210 (27%), Gaps = 40/210 (19%) Query: 74 ALLADINSQGQVQMAMNGGIYDESYA------------PLGLYIEN------------GQ 109 L GQ +A+N +D PLG +++N G Sbjct: 149 PPLQAWQRMGQPTIAINANFFDVRGQKGGSWRTTGCSSPLGAFVDNTHGMGRANQAVTGT 208 Query: 110 QKVALNLASGEGNFF--------IRPGGVFYVAGDKVGIVRLD-----AFKTSKEIQFAV 156 A GN I GG YV K +K +F Sbjct: 209 VAYAGKQGLSGGNEVWTSLTTMIIPVGGAPYVLRPKGRQDYDLATPVIQDLLNKNAKFVA 268 Query: 157 QSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLN 216 +G L+ G I + S R + +K + +++ + + + L Sbjct: 269 VAGIGLLSPGDI--GQLHDGGPSAARTALAYSKPKDEMYIFEGGSYTPDNIQDLFR-GLG 325 Query: 217 VEQLLYLDGTISHMYMKGGAIPWQRYPFVT 246 + + LDG S + Sbjct: 326 SDTAILLDGGGSSAIVLRRDTGGMWAGAGA 355 >UniRef50_Q8YTL3 All2704 protein n=4 Tax=Nostocaceae RepID=Q8YTL3_ANASP Length = 310 Score = 48.2 bits (113), Expect = 3e-04, Method: Composition-based stats. Identities = 35/253 (13%), Positives = 67/253 (26%), Gaps = 55/253 (21%) Query: 46 QAYTVNPQTERV----------KMYWQKANGEA--------WGTLHALLADINSQGQVQM 87 NP++ + ++Y + A G+ + L + + + Sbjct: 64 HVIIFNPRSANLDFKVNLGLSHQLYTKDARGKIRREYIPKQFNELISDSNSTLNGRRPIA 123 Query: 88 AMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFK 147 A+N D P GL I G + F F ++G K + Sbjct: 124 AINADYIDPENKPQGLNISRGVEYSGD---------FKNKRSSFGISGGKPQERKATIQA 174 Query: 148 TSKEIQ----FAVQSGPMLMENGVINPRIHPNVA----SSKIRNGVGINKHGNAVFLLSQ 199 +EI V G S R+ I G + L++ Sbjct: 175 GRREINILNYNLVGGNGRFYRQGKFKDICQDLGEFACKQSTNRSMAAITNKGYVILLVND 234 Query: 200 -------------QATNFYDFACYA-----KAKLN-VEQLLYLDGTIS-HMYMKGGAIPW 239 Q F + L +++ + DG +S +Y Sbjct: 235 IKANSNIEINSNNQELTPDKFDDVLEGISRQNCLGKIQEGILFDGGMSPGLYYNKKIYVE 294 Query: 240 QRYPFVTMISVER 252 P ++ + + Sbjct: 295 NPGPIGSVFLIYK 307 >UniRef50_C2LSG0 Putative uncharacterized protein n=1 Tax=Streptococcus salivarius SK126 RepID=C2LSG0_STRSL Length = 339 Score = 47.4 bits (111), Expect = 5e-04, Method: Composition-based stats. Identities = 12/196 (6%), Positives = 42/196 (21%), Gaps = 31/196 (15%) Query: 53 QTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKV 112 + + + + + + +N G + Sbjct: 80 KDGKYILQKGRTEDNNPELTEQSIKYEAKRRDALALINAGFWSYEG-------------- 125 Query: 113 ALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQ-------FAVQSGPMLMEN 165 L+ + + G+ Y + ++ + + +L+++ Sbjct: 126 -LDRPFAQKEIELGKTGLLYGDDQNNITAGTYPNIDTAKMFTHMGSNGWDTGAFGILIKD 184 Query: 166 GVINPRIHPNV-ASSKIRNGVGINKHGNAVFLLSQQATNF-----YDFACYAKAKLNVEQ 219 ++ R+ G + + + ++ L Sbjct: 185 KKVDKTWEKGDPDQPNARSIYVETYDGIIRIIQTYGHNSLNKGLNHEDVYKLLKNLGYSN 244 Query: 220 ---LLYLDGTISHMYM 232 LDG + Sbjct: 245 IRLAFLLDGGGTTRMY 260 >UniRef50_D2W0I7 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2W0I7_NAEGR Length = 201 Score = 45.5 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 12/58 (20%), Positives = 18/58 (31%), Gaps = 1/58 (1%) Query: 74 ALLADINSQGQVQMAMNGGIYD-ESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGV 130 +D +A N G +D + LG I NG+ F + G Sbjct: 104 ERTSDTAKLNDCIVATNAGFFDVANGYCLGKVISNGKVLNDYGRVHPNAAFGLIDDGK 161 >UniRef50_A0YKD9 Putative uncharacterized protein n=1 Tax=Lyngbya sp. PCC 8106 RepID=A0YKD9_9CYAN Length = 83 Score = 43.9 bits (102), Expect = 0.004, Method: Composition-based stats. Identities = 10/58 (17%), Positives = 19/58 (32%), Gaps = 1/58 (1%) Query: 189 KHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVT 246 + +F+L+ D A K ++++ LDG S + G Sbjct: 4 ADFHLIFILTSPGKTQGDAAQLLKD-FGAKKVMMLDGGGSTQLIVSGRELVSSSDATP 60 >UniRef50_UPI00019038D8 hypothetical protein Retl8_15906 n=1 Tax=Rhizobium etli 8C-3 RepID=UPI00019038D8 Length = 91 Score = 43.9 bits (102), Expect = 0.006, Method: Composition-based stats. Identities = 12/64 (18%), Positives = 29/64 (45%), Gaps = 1/64 (1%) Query: 32 AADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ-VQMAMN 90 A + T+ ++++W+ A+GE + +L + ++G+ + A+N Sbjct: 26 AQQCGQETFDEAKYVVCTLEVGKVDLRLFWKGADGEPYRAFSSLADAVRAEGRKLIFAVN 85 Query: 91 GGIY 94 G+Y Sbjct: 86 AGMY 89 >UniRef50_C5KB48 Putative uncharacterized protein n=1 Tax=Perkinsus marinus ATCC 50983 RepID=C5KB48_9ALVE Length = 925 Score = 43.5 bits (101), Expect = 0.006, Method: Composition-based stats. Identities = 22/180 (12%), Positives = 50/180 (27%), Gaps = 26/180 (14%) Query: 72 LHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQ-KVALNLASGEGNFFIRPGGV 130 + + ++ + + ++ + P +I++G + A + F P V Sbjct: 707 ITQWIRELTKADENIVLLSNSVQSNFQNPYDCFIQDGIMQRQGYASACPKYAFGDSPVDV 766 Query: 131 FYVAGDKVGIV-----RLDAFKTSKEIQFAVQSGPMLMENGV----INPRIHPN------ 175 F + D D + AV P + +G + Sbjct: 767 FVLDSDTEERRLRSCKVTDECNERFPWRRAVSGRP-FVTDGELRKIPHWDHEDGEEIKYG 825 Query: 176 ------VASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISH 229 ++ + V ++ G V + Q Y+F + + L S Sbjct: 826 EVPWLPSSTEAAFSAVCESRDGQVVLAYAIQPLTAYEFGRALIDS-GIHDAVLL--GGSG 882 >UniRef50_C7Q9L8 Putative uncharacterized protein n=1 Tax=Catenulispora acidiphila DSM 44928 RepID=C7Q9L8_CATAD Length = 206 Score = 42.8 bits (99), Expect = 0.012, Method: Composition-based stats. Identities = 25/135 (18%), Positives = 44/135 (32%), Gaps = 18/135 (13%) Query: 98 YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQ 157 G + +G+Q L F R G F V V ++ Q Sbjct: 2 GDSGGGFYLDGKQHGPLVPGVAAEVF--RRDGSFTVG------VWGRDVTLGSDVVGVRQ 53 Query: 158 SGPMLMENGVI--------NPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFAC 209 ++++ G I + A+ R+GVG+ G+ VF L + A Sbjct: 54 QRQLMVDGGRIAGDIDSLFTWGVTDGGATYVRRSGVGVTADGDVVFAL-GPTMSPRSLAT 112 Query: 210 YAKAKLNVEQLLYLD 224 + + + + LD Sbjct: 113 ALQ-RAGAVRAMELD 126 >UniRef50_C4ICA7 Putative uncharacterized protein n=1 Tax=Clostridium butyricum E4 str. BoNT E BL5262 RepID=C4ICA7_CLOBU Length = 42 Score = 42.8 bits (99), Expect = 0.012, Method: Composition-based stats. Identities = 7/26 (26%), Positives = 8/26 (30%) Query: 214 KLNVEQLLYLDGTISHMYMKGGAIPW 239 KL + LDG S G Sbjct: 3 KLGAVNAINLDGGKSSTMYYNGNTIN 28 >UniRef50_A9GVQ9 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GVQ9_SORC5 Length = 308 Score = 42.0 bits (97), Expect = 0.018, Method: Composition-based stats. Identities = 27/208 (12%), Positives = 55/208 (26%), Gaps = 16/208 (7%) Query: 53 QTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKV 112 K+ + + WG A V G + GL + G + Sbjct: 110 NGPDFKLIRRVNFTDFWGAARAQETATRRARVVVSGTFGTFNQPTGLAFGL--KAGGNLI 167 Query: 113 ALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRI 172 + A+ G + + + + + P ++ G ++ Sbjct: 168 SYGYAAPGGPSPEPHRVMLFAFNNARSRGWIGDYNRDSFTVS-----PDVV--GAVHVDA 220 Query: 173 HPNVASSKIRNGVGINKHGN-----AVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTI 227 R VG+ + + S + + A + +Q + LDG+ Sbjct: 221 DFRPGDLTGRTFVGVRDDDRDGNAETILVFSSSSATTWQ-ASTTISAFGAQQKVMLDGSY 279 Query: 228 SH-MYMKGGAIPWQRYPFVTMISVERKG 254 S + + GG V G Sbjct: 280 STYLIVDGGPRISTAGRLVPHGIAFYSG 307 >UniRef50_Q9RZG9 Putative uncharacterized protein n=1 Tax=Deinococcus radiodurans RepID=Q9RZG9_DEIRA Length = 270 Score = 42.0 bits (97), Expect = 0.020, Method: Composition-based stats. Identities = 32/211 (15%), Positives = 57/211 (27%), Gaps = 29/211 (13%) Query: 53 QTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAP------LGLYIE 106 + +++ K S +NGG Y P L L + Sbjct: 9 TYNGITLHYMKTTASNIVLRRINSNVTASGH---YGINGGFYIL-GEPIESQPLLSLTVN 64 Query: 107 NGQ-----QKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQ----FAVQ 157 N + SG N G VF+ + VR+ + + + + Q Sbjct: 65 NDVPVGSLIYQDTSYGSGWANVGYARGTVFHDTVSRTIGVRVVSNASQISVTNRSNYWAQ 124 Query: 158 SGPMLMENGVINPRIHPNVASSKI-------RNGVGINKHGNAVFLLSQQAT---NFYDF 207 G + N R + R G+ N+ G +++ F Sbjct: 125 GGVSMSLQNDANWRDIAVTQQNLPNPDGVIQRAGLVYNEAGYVYLVMTASGQLGATAGQF 184 Query: 208 ACYAKAKLNVEQLLYLDGTISHMYMKGGAIP 238 K L ++LD + S + Sbjct: 185 RQAIKQTLGALDGIFLDSSGSAQMLCAEFRN 215 >UniRef50_C3QEU3 Predicted protein n=5 Tax=Bacteroides RepID=C3QEU3_9BACE Length = 1114 Score = 40.5 bits (93), Expect = 0.050, Method: Composition-based stats. Identities = 28/222 (12%), Positives = 61/222 (27%), Gaps = 38/222 (17%) Query: 51 NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQ 110 + + Y ++ + + I + + +N G ++ + + Sbjct: 890 DANLYTFR-YVKEPHPGLTNPIGTKALFIIGKNNQPLKVNSGDFEATIT---------KI 939 Query: 111 KVALNLASGEGNFFIRPGGVFYVAGDKVGIV-----RLDAFKTSKEIQFAVQSGPMLMEN 165 + V V GDK + D + S E++ + P+ + N Sbjct: 940 IDGRGTTVEAPYVTDKNEWVLQVTGDKADELVQNLKTGDKVQISAELKIGSSTNPIKVHN 999 Query: 166 GVIN----------PRIHPNVASSKIRNGVGINKHG-NAVFLLSQQATN------FYDFA 208 + P + + +G+ + V T+ FY+ Sbjct: 1000 SSMYRYVYNGVYSAPPKKEDAETINPTTNLGMTQDKSKIVIFCVDGRTDSDRGLDFYEAY 1059 Query: 209 CYAKAKLNVEQLLYLDGTISHMYM-----KGGAIPWQRYPFV 245 K KL + ++ DG S + G I Sbjct: 1060 RVCK-KLGLYDVIRFDGGGSTVMWTYENGIGKVINHVSDTKG 1100 >UniRef50_C7M125 Putative uncharacterized protein n=1 Tax=Acidimicrobium ferrooxidans DSM 10331 RepID=C7M125_ACIFD Length = 344 Score = 40.1 bits (92), Expect = 0.073, Method: Composition-based stats. Identities = 32/226 (14%), Positives = 65/226 (28%), Gaps = 36/226 (15%) Query: 19 IFLALTLLP-------LFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQK-ANGEAWG 70 +T P V + A S V ++ V+++ G ++ Sbjct: 82 TGAPVTWTPAGAVGSGGAVVFTSEVAPSPGAAPVGVAWIDQAHAVVQLFAGTTQPGGSFR 141 Query: 71 TLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGV 130 + + + + A GG G + ++G L G + G Sbjct: 142 YQGMVPPSLVT--NLVAAFEGGFQ--FAVSNGGFEQDGVV--GAPLVEGAASLVELTNGR 195 Query: 131 FYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPN----------VASSK 180 + + A + Q+ +L+++G + P N + Sbjct: 196 VEIGAWGSEVGPSPA------VSAVRQNLTLLVDHGAVLPTASENPLVTWGYSLGNLLAT 249 Query: 181 IRNGVGINKHGNAVFLLSQQ--ATNFYDFACYAKAKLNVEQLLYLD 224 R+G+GI HGN V++ + + LD Sbjct: 250 WRSGLGITSHGNLVWVGGPGLSPATLGS----MLVWAGAVRGMQLD 291 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_B2II06 Putative uncharacterized protein n=5 Tax=Rhizobi... 196 4e-49 UniRef50_P27840 Uncharacterized protein yigE n=68 Tax=Enterobact... 196 7e-49 UniRef50_Q98NI9 Mlr0120 protein n=3 Tax=Alphaproteobacteria RepI... 182 7e-45 UniRef50_A9W4Y6 Putative uncharacterized protein n=4 Tax=Methylo... 180 4e-44 UniRef50_B9JX75 Putative uncharacterized protein n=1 Tax=Agrobac... 178 1e-43 UniRef50_C8PYM1 Putative uncharacterized protein n=1 Tax=Enhydro... 169 6e-41 UniRef50_D2LB58 Putative uncharacterized protein n=1 Tax=Rhodomi... 167 3e-40 UniRef50_Q0BWY7 Putative lipoprotein n=1 Tax=Hyphomonas neptuniu... 167 3e-40 UniRef50_Q1IX28 Periplasmic YigE-like protein n=3 Tax=Deinococcu... 163 6e-39 UniRef50_Q26CZ6 Periplasmic protein n=1 Tax=Flavobacteria bacter... 162 8e-39 UniRef50_A9CIN9 Putative uncharacterized protein n=6 Tax=Proteob... 162 9e-39 UniRef50_Q11X50 Putative uncharacterized protein n=2 Tax=Bactero... 160 3e-38 UniRef50_Q093S1 Putative uncharacterized protein n=1 Tax=Stigmat... 160 3e-38 UniRef50_A9D6B9 Putative uncharacterized protein n=1 Tax=Hoeflea... 160 3e-38 UniRef50_A0Q3C5 Conserved protein n=7 Tax=Clostridia RepID=A0Q3C... 159 7e-38 UniRef50_C5CWT4 Periplasmic protein-like protein n=3 Tax=Bacteri... 158 2e-37 UniRef50_D0SJ31 Putative uncharacterized protein n=1 Tax=Acineto... 158 2e-37 UniRef50_Q0G184 Putative uncharacterized protein n=2 Tax=Auranti... 157 3e-37 UniRef50_C4G6X0 Putative uncharacterized protein n=2 Tax=Lactoba... 156 7e-37 UniRef50_Q1MEZ5 Conserved hypothetical exported protein n=45 Tax... 155 2e-36 UniRef50_B9KP42 Periplasmic protein-like protein n=37 Tax=Rhodob... 153 5e-36 UniRef50_D1AL67 Periplasmic protein-like protein n=1 Tax=Sebalde... 152 7e-36 UniRef50_B8I4Q1 Putative uncharacterized protein n=3 Tax=Clostri... 152 8e-36 UniRef50_A5WGQ7 Periplasmic protein-like protein n=1 Tax=Psychro... 152 1e-35 UniRef50_D0WLU9 Putative uncharacterized protein n=1 Tax=Actinom... 152 1e-35 UniRef50_A1B5U0 Putative uncharacterized protein n=1 Tax=Paracoc... 151 3e-35 UniRef50_C7PF78 Putative uncharacterized protein n=1 Tax=Chitino... 150 3e-35 UniRef50_A6LS70 Putative uncharacterized protein n=23 Tax=Clostr... 150 4e-35 UniRef50_C4FXK4 Putative uncharacterized protein n=1 Tax=Catonel... 150 4e-35 UniRef50_B8FUP3 Putative uncharacterized protein n=2 Tax=Desulfi... 147 2e-34 UniRef50_A7GCS1 Putative uncharacterized protein n=12 Tax=Clostr... 147 3e-34 UniRef50_Q1QCK8 Periplasmic protein-like n=1 Tax=Psychrobacter c... 146 6e-34 UniRef50_A0PY15 Conserved protein n=4 Tax=Clostridium RepID=A0PY... 146 7e-34 UniRef50_D1REU9 Putative uncharacterized protein n=1 Tax=Legione... 146 7e-34 UniRef50_A4VXL8 Exopolysaccharide biosynthesis protein related t... 146 8e-34 UniRef50_B7AQ96 Putative uncharacterized protein n=1 Tax=Bactero... 145 2e-33 UniRef50_Q5WVS5 Putative uncharacterized protein n=5 Tax=Legione... 144 2e-33 UniRef50_Q2NAA1 Putative uncharacterized protein n=3 Tax=Erythro... 144 3e-33 UniRef50_C2KZT9 Exopolysaccharide biosynthesis protein n=2 Tax=F... 144 3e-33 UniRef50_Q97FU6 Uncharaterized conserved protein, YOME B.subtili... 144 3e-33 UniRef50_A5EW16 Putative uncharacterized protein n=1 Tax=Dichelo... 144 4e-33 UniRef50_C2HB28 Exopolysaccharide biosynthesis protein n=4 Tax=E... 142 9e-33 UniRef50_B0TEY5 Putative uncharacterized protein n=1 Tax=Helioba... 142 1e-32 UniRef50_C6XT14 Exopolysaccharide biosynthesis protein n=1 Tax=P... 141 2e-32 UniRef50_C3QHD0 Exopolysaccharide biosynthesis protein n=2 Tax=B... 141 2e-32 UniRef50_UPI0000E45D54 PREDICTED: similar to N-acetylglucosamine... 140 3e-32 UniRef50_C6D6X3 Exopolysaccharide biosynthesis protein n=6 Tax=B... 140 3e-32 UniRef50_Q892K3 N-acetylmuramoyl-L-alanine amidase/putative S-la... 140 4e-32 UniRef50_Q97FU3 Uncharaterized conserved protein, YOME B.subtili... 140 5e-32 UniRef50_C2JZN3 N-acetylmuramoyl-L-alanine amidase/probable S-la... 139 7e-32 UniRef50_C0GEE0 Putative uncharacterized protein n=1 Tax=Dethiob... 139 7e-32 UniRef50_D1N9W8 Putative uncharacterized protein n=1 Tax=Victiva... 139 8e-32 UniRef50_C8N6Z2 Lipoprotein n=1 Tax=Cardiobacterium hominis ATCC... 139 1e-31 UniRef50_C7TED9 N-acetylmuramoyl-L-alanine amidase n=2 Tax=Lacto... 138 2e-31 UniRef50_B2HYZ5 Predicted periplasmic protein n=11 Tax=Acinetoba... 138 2e-31 UniRef50_C1I4R7 Putative uncharacterized protein (Fragment) n=1 ... 138 2e-31 UniRef50_C6IYX5 Putative uncharacterized protein n=1 Tax=Paeniba... 137 3e-31 UniRef50_B1BC21 Putative uncharacterized protein n=2 Tax=Clostri... 137 4e-31 UniRef50_D1BL19 Putative uncharacterized protein n=4 Tax=Veillon... 136 7e-31 UniRef50_B2KU41 N-acetylmuramoyl-L-alanine amidase/putative S-la... 136 7e-31 UniRef50_UPI0001744B4D hypothetical protein VspiD_05050 n=1 Tax=... 135 1e-30 UniRef50_A0YND3 Putative uncharacterized protein n=1 Tax=Lyngbya... 134 4e-30 UniRef50_C8WTH1 N-acetylglucosamine-1-phosphodiester alpha-N-ace... 132 8e-30 UniRef50_UPI000178A82C copper amine oxidase domain protein n=1 T... 132 1e-29 UniRef50_C0ZEQ6 Putative uncharacterized protein n=1 Tax=Breviba... 131 2e-29 UniRef50_B2V2N5 Putative uncharacterized protein n=8 Tax=Clostri... 131 2e-29 UniRef50_C1CWE2 Putative LysM lysin domain protein, n=1 Tax=Dein... 131 2e-29 UniRef50_A9KDD2 Hypothetical exported protein n=6 Tax=Coxiella b... 130 3e-29 UniRef50_B1I1S0 N-acetylglucosamine-1-phosphodiester alpha-N-ace... 130 4e-29 UniRef50_C3R3M8 Putative uncharacterized protein n=1 Tax=Bactero... 129 9e-29 UniRef50_O31980 SPBc2 prophage-derived uncharacterized protein y... 129 1e-28 UniRef50_B7H7U4 Putative uncharacterized protein n=27 Tax=Bacill... 129 1e-28 UniRef50_C9KSV8 N-acetylmuramoyl-L-alanine amidase/putative S-la... 129 1e-28 UniRef50_UPI000180BA0C PREDICTED: similar to predicted protein n... 129 1e-28 UniRef50_C6J074 Copper amine oxidase domain-containing protein n... 129 1e-28 UniRef50_C5PL46 Exopolysaccharide biosynthesis protein n=2 Tax=S... 127 3e-28 UniRef50_C0ZFU4 Putative uncharacterized protein n=1 Tax=Breviba... 127 4e-28 UniRef50_D0TN59 Predicted protein n=3 Tax=Bacteroides RepID=D0TN... 126 8e-28 UniRef50_Q1IXP5 Peptidoglycan-binding LysM domain-containing pro... 125 9e-28 UniRef50_B3CE38 Putative uncharacterized protein n=3 Tax=Bactero... 125 9e-28 UniRef50_Q9UK23 N-acetylglucosamine-1-phosphodiester alpha-N-ace... 125 1e-27 UniRef50_D2NR45 Exopolysaccharide biosynthesis protein related t... 125 1e-27 UniRef50_A7LRK4 Putative uncharacterized protein n=1 Tax=Bactero... 125 1e-27 UniRef50_A6L610 Putative uncharacterized protein n=1 Tax=Bactero... 124 2e-27 UniRef50_C6D0A3 Exopolysaccharide biosynthesis protein n=1 Tax=P... 124 3e-27 UniRef50_C5RID5 Putative uncharacterized protein n=1 Tax=Clostri... 124 3e-27 UniRef50_C8WU56 N-acetylglucosamine-1-phosphodiester alpha-N-ace... 123 4e-27 UniRef50_C6J7B9 Exopolysaccharide biosynthesis protein n=2 Tax=B... 123 7e-27 UniRef50_Q8A0T0 Putative uncharacterized protein n=10 Tax=Bacter... 122 8e-27 UniRef50_A9WEC1 Putative uncharacterized protein n=3 Tax=Chlorof... 122 1e-26 UniRef50_C1XS52 Predicted periplasmic protein (DUF2233) n=1 Tax=... 122 1e-26 UniRef50_C6XXH4 Putative uncharacterized protein n=1 Tax=Pedobac... 122 1e-26 UniRef50_A5USB9 Putative uncharacterized protein n=2 Tax=Roseifl... 122 1e-26 UniRef50_UPI0001BC3362 hypothetical protein BcroD2_01243 n=1 Tax... 122 1e-26 UniRef50_A7V127 Putative uncharacterized protein n=1 Tax=Bactero... 122 1e-26 UniRef50_UPI0001923977 PREDICTED: similar to predicted protein, ... 122 2e-26 UniRef50_C6XT12 NHL repeat containing protein n=2 Tax=Pedobacter... 121 2e-26 UniRef50_B2J8B3 Putative uncharacterized protein n=1 Tax=Nostoc ... 121 2e-26 UniRef50_C4ICA6 Peptidase, M56 family n=1 Tax=Clostridium butyri... 121 3e-26 UniRef50_UPI0001694670 hypothetical protein Plarl_22443 n=1 Tax=... 121 3e-26 UniRef50_C6CV17 Exopolysaccharide biosynthesis protein n=1 Tax=P... 121 3e-26 UniRef50_C9PX63 Putative uncharacterized protein n=1 Tax=Prevote... 120 3e-26 UniRef50_C0CND1 Putative uncharacterized protein n=1 Tax=Blautia... 120 3e-26 UniRef50_B4CZJ8 Putative uncharacterized protein n=1 Tax=Chthoni... 120 4e-26 UniRef50_Q73Q09 Putative uncharacterized protein n=1 Tax=Trepone... 120 4e-26 UniRef50_D1CZ42 Putative uncharacterized protein n=1 Tax=Brucell... 120 5e-26 UniRef50_B8CYN3 SpoIID/LytB domain protein n=1 Tax=Halothermothr... 119 7e-26 UniRef50_A7LRK2 Putative uncharacterized protein n=1 Tax=Bactero... 119 8e-26 UniRef50_B9YC35 Putative uncharacterized protein n=2 Tax=Holdema... 119 1e-25 UniRef50_B7ASL4 Putative uncharacterized protein n=1 Tax=Bactero... 118 1e-25 UniRef50_Q7X4R9 XcbC n=1 Tax=Neisseria meningitidis RepID=Q7X4R9... 118 1e-25 UniRef50_C6XWN0 Putative uncharacterized protein n=1 Tax=Pedobac... 118 1e-25 UniRef50_A9B1E5 Putative uncharacterized protein n=1 Tax=Herpeto... 118 2e-25 UniRef50_A6L611 Putative uncharacterized protein n=1 Tax=Bactero... 118 2e-25 UniRef50_B3RIP6 Putative uncharacterized protein (Fragment) n=2 ... 118 2e-25 UniRef50_C6JBU1 Putative uncharacterized protein n=1 Tax=Ruminoc... 117 3e-25 UniRef50_C5S6T1 Putative uncharacterized protein n=1 Tax=Allochr... 116 6e-25 UniRef50_D2V2G1 N-acetylglucosamine-1-phosphodiester alpha-N-ace... 116 7e-25 UniRef50_Q8YP57 All4343 protein n=5 Tax=Nostocaceae RepID=Q8YP57... 116 7e-25 UniRef50_C4Z4Z5 Putative uncharacterized protein n=1 Tax=Eubacte... 115 1e-24 UniRef50_B8G1I8 Peptidase M56 BlaR1 n=4 Tax=Desulfitobacterium h... 115 1e-24 UniRef50_Q4UP44 Putative uncharacterized protein n=4 Tax=Bacteri... 115 1e-24 UniRef50_C6PYU6 Putative uncharacterized protein n=1 Tax=Clostri... 115 1e-24 UniRef50_B4VX04 Putative uncharacterized protein n=1 Tax=Microco... 115 2e-24 UniRef50_B8HTR4 Putative uncharacterized protein n=1 Tax=Cyanoth... 114 3e-24 UniRef50_B0BZE5 Putative uncharacterized protein n=1 Tax=Acaryoc... 114 3e-24 UniRef50_B0C332 Putative uncharacterized protein n=2 Tax=Bacteri... 114 3e-24 UniRef50_C3R3L4 Putative uncharacterized protein n=2 Tax=Bactero... 113 4e-24 UniRef50_C4Z6E6 Putative uncharacterized protein n=1 Tax=Eubacte... 112 1e-23 UniRef50_Q8YKH7 All7320 protein n=2 Tax=Cyanobacteria RepID=Q8YK... 112 1e-23 UniRef50_B2A8G9 Copper amine oxidase domain protein n=1 Tax=Natr... 112 1e-23 UniRef50_C0WEQ2 Exopolysaccharide biosynthesis protein n=1 Tax=A... 112 2e-23 UniRef50_A7C442 Putative uncharacterized protein n=1 Tax=Beggiat... 111 2e-23 UniRef50_C6LDL7 Putative uncharacterized protein n=1 Tax=Bryante... 111 2e-23 UniRef50_UPI0001C164F4 hypothetical protein CRD_01886 n=2 Tax=No... 111 3e-23 UniRef50_B5VVA8 S-layer domain protein n=3 Tax=Cyanobacteria Rep... 110 3e-23 UniRef50_A7LVE9 Putative uncharacterized protein n=1 Tax=Bactero... 110 3e-23 UniRef50_C8PNM8 Putative uncharacterized protein n=1 Tax=Trepone... 110 4e-23 UniRef50_B8HPJ4 Putative uncharacterized protein n=2 Tax=Cyanoth... 110 4e-23 UniRef50_B4VYL6 Tat pathway signal sequence domain protein n=1 T... 110 5e-23 UniRef50_UPI0001BC7E39 hypothetical protein BacD2_08600 n=1 Tax=... 110 6e-23 UniRef50_Q8RCE6 Putative uncharacterized protein n=5 Tax=Thermoa... 110 6e-23 UniRef50_A6TKB7 Exopolysaccharide biosynthesis protein n=1 Tax=A... 109 7e-23 UniRef50_C4V4S8 Exopolysaccharide biosynthesis protein n=1 Tax=S... 109 1e-22 UniRef50_B4AZH7 Putative uncharacterized protein n=1 Tax=Cyanoth... 109 1e-22 UniRef50_B4WFN8 Putative uncharacterized protein n=1 Tax=Synecho... 108 1e-22 UniRef50_B6V2M3 Gp2.43 n=1 Tax=Bacillus phage SPO1 RepID=B6V2M3_... 108 1e-22 UniRef50_UPI0001C16068 conserved hypothetical protein n=2 Tax=No... 108 2e-22 UniRef50_A0YL57 Putative uncharacterized protein n=1 Tax=Lyngbya... 108 2e-22 UniRef50_A1HRE9 Exopolysaccharide biosynthesis protein n=1 Tax=T... 107 3e-22 UniRef50_C9LSB3 Putative secreted protein n=1 Tax=Selenomonas sp... 107 3e-22 UniRef50_B7KAU9 Putative uncharacterized protein n=7 Tax=Chrooco... 107 4e-22 UniRef50_UPI0001BC335A hypothetical protein BcroD2_01203 n=1 Tax... 107 4e-22 UniRef50_B3QZA6 Putative uncharacterized protein n=1 Tax=Chloroh... 107 5e-22 UniRef50_A5GW09 Putative uncharacterized protein SynRCC307_2165 ... 106 6e-22 UniRef50_B4WS35 Putative uncharacterized protein n=1 Tax=Synecho... 106 6e-22 UniRef50_B0CAS6 Putative uncharacterized protein n=1 Tax=Acaryoc... 106 8e-22 UniRef50_A0YXN3 Putative uncharacterized protein n=1 Tax=Lyngbya... 106 9e-22 UniRef50_C6IP98 Putative uncharacterized protein n=2 Tax=Bactero... 105 1e-21 UniRef50_A0LEU6 Putative uncharacterized protein n=1 Tax=Syntrop... 105 1e-21 UniRef50_B8HPB3 Putative uncharacterized protein n=1 Tax=Cyanoth... 105 1e-21 UniRef50_C2FS46 Putative uncharacterized protein n=2 Tax=Sphingo... 105 1e-21 UniRef50_C9KQW2 Putative secreted protein n=2 Tax=Veillonellacea... 105 2e-21 UniRef50_Q8YKN4 All7259 protein n=2 Tax=Cyanobacteria RepID=Q8YK... 105 2e-21 UniRef50_A7M0H0 Putative uncharacterized protein n=2 Tax=Bactero... 104 2e-21 UniRef50_A3DHF5 Ig-like, group 2 n=3 Tax=Clostridium thermocellu... 104 3e-21 UniRef50_A6TVJ8 Exopolysaccharide biosynthesis protein n=2 Tax=A... 104 4e-21 UniRef50_UPI00016C4EC3 hypothetical protein GobsU_32169 n=1 Tax=... 103 4e-21 UniRef50_A9V9Y5 Predicted protein n=1 Tax=Monosiga brevicollis R... 103 7e-21 UniRef50_B7KAR9 Polysaccharide deacetylase n=3 Tax=Cyanothece Re... 102 1e-20 UniRef50_D2AUR4 Exopolysaccharide biosynthesis protein related t... 102 1e-20 UniRef50_B5RQG1 Uncharacterized conserved protein n=20 Tax=Borre... 102 1e-20 UniRef50_C6IEV9 Putative uncharacterized protein n=2 Tax=Bactero... 102 1e-20 UniRef50_D2J8B1 Putative uncharacterized protein n=1 Tax=Staphyl... 102 1e-20 UniRef50_C9RVV6 Putative uncharacterized protein n=3 Tax=Geobaci... 102 2e-20 UniRef50_B3PTF7 Putative uncharacterized protein n=3 Tax=Rhizobi... 100 4e-20 UniRef50_A7M0G9 Putative uncharacterized protein n=2 Tax=Bactero... 100 4e-20 UniRef50_B2UNL7 Putative uncharacterized protein n=1 Tax=Akkerma... 100 5e-20 UniRef50_UPI0001744905 hypothetical protein VspiD_09365 n=1 Tax=... 100 5e-20 UniRef50_B1XK15 Putative uncharacterized protein n=1 Tax=Synecho... 100 7e-20 UniRef50_UPI0001C43112 hypothetical protein BpOF4_05820 n=1 Tax=... 100 8e-20 UniRef50_P74396 Slr0280 protein n=1 Tax=Synechocystis sp. PCC 68... 99 9e-20 UniRef50_B8J2Y6 Putative uncharacterized protein n=2 Tax=Desulfo... 99 1e-19 UniRef50_Q4ZC55 ORF005 n=1 Tax=Staphylococcus phage EW RepID=Q4Z... 99 1e-19 UniRef50_UPI00019088BB hypothetical protein RetlC8_25680 n=2 Tax... 99 1e-19 UniRef50_B0JW05 Polysaccharide deacetylase family protein n=4 Ta... 99 2e-19 UniRef50_C7IFA0 Exopolysaccharide biosynthesis protein n=1 Tax=C... 99 2e-19 UniRef50_B5YE82 Putative uncharacterized protein n=2 Tax=Dictyog... 99 2e-19 UniRef50_B5Y710 Copper amine oxidase N-domain family n=2 Tax=Cop... 99 2e-19 UniRef50_Q5N4C8 Putative uncharacterized protein n=2 Tax=Synecho... 98 2e-19 UniRef50_B7IEY1 Putative uncharacterized protein n=1 Tax=Thermos... 98 3e-19 UniRef50_Q8DHU5 Tll1850 protein n=1 Tax=Thermosynechococcus elon... 97 4e-19 UniRef50_D1B6I7 N-acetylglucosamine-1-phosphodiester alpha-N-ace... 97 5e-19 UniRef50_UPI0001C30FBA N-acetylglucosamine-1-phosphodiester alph... 97 7e-19 UniRef50_A8W171 Flagellar protein FliS n=1 Tax=Bacillus seleniti... 97 8e-19 UniRef50_A4J956 Copper amine oxidase domain protein n=1 Tax=Desu... 96 8e-19 UniRef50_B1X2V5 Putative uncharacterized protein n=2 Tax=Cyanoth... 96 8e-19 UniRef50_C1A670 Putative uncharacterized protein n=1 Tax=Gemmati... 96 1e-18 UniRef50_C1YVW0 Putative uncharacterized protein n=1 Tax=Nocardi... 96 1e-18 UniRef50_D2RLV8 N-acetylglucosamine-1-phosphodiester alpha-N-ace... 96 1e-18 UniRef50_UPI0001744904 hypothetical protein VspiD_09360 n=1 Tax=... 95 1e-18 UniRef50_Q7U4D6 Putative uncharacterized protein n=11 Tax=Cyanob... 95 2e-18 UniRef50_A3DIP4 Exopolysaccharide biosynthesis protein n=3 Tax=C... 95 2e-18 UniRef50_D1R528 Putative uncharacterized protein n=1 Tax=Parachl... 95 2e-18 UniRef50_C7LNU2 Putative uncharacterized protein n=1 Tax=Desulfo... 95 2e-18 UniRef50_Q3AA51 Conserved domain protein n=1 Tax=Carboxydothermu... 95 3e-18 UniRef50_C8X0Z8 Putative uncharacterized protein n=1 Tax=Desulfo... 95 3e-18 UniRef50_UPI0001C3370C hypothetical protein UCYN_10670 n=1 Tax=c... 95 3e-18 UniRef50_D1VRM0 Putative copper amine oxidase N-domain family n=... 94 3e-18 UniRef50_UPI00017896CA metallophosphoesterase n=1 Tax=Geobacillu... 94 3e-18 UniRef50_D2ASL7 Exopolysaccharide biosynthesis protein related t... 94 3e-18 UniRef50_UPI0001C1628D hypothetical protein CRC_02750 n=2 Tax=No... 94 5e-18 UniRef50_Q01TI8 Putative uncharacterized protein n=1 Tax=Candida... 94 5e-18 UniRef50_B8HP94 Polysaccharide deacetylase n=1 Tax=Cyanothece sp... 93 8e-18 UniRef50_B9XE16 Putative uncharacterized protein n=1 Tax=bacteri... 93 9e-18 UniRef50_C1ABL2 Putative uncharacterized protein n=1 Tax=Gemmati... 93 9e-18 UniRef50_B2S1G8 Hypothetical cytosolic protein n=2 Tax=Borrelia ... 93 9e-18 UniRef50_B8I064 Exopolysaccharide biosynthesis protein n=1 Tax=C... 93 1e-17 UniRef50_A7SGX9 Predicted protein (Fragment) n=2 Tax=Nematostell... 93 1e-17 UniRef50_Q1MS76 Putative uncharacterized protein LI0093 n=1 Tax=... 93 1e-17 UniRef50_A6LP25 Putative uncharacterized protein n=1 Tax=Thermos... 92 1e-17 UniRef50_Q2JPV6 Polysaccharide deacetylase family protein n=2 Ta... 92 1e-17 UniRef50_A4CSS0 Putative uncharacterized protein n=1 Tax=Synecho... 92 1e-17 UniRef50_Q5ULM2 Orf92 n=1 Tax=Lactobacillus phage LP65 RepID=Q5U... 92 2e-17 UniRef50_A1VEZ3 Putative uncharacterized protein n=4 Tax=Desulfo... 92 2e-17 UniRef50_A7HB86 Putative uncharacterized protein n=4 Tax=Anaerom... 92 2e-17 UniRef50_B0JGJ2 Putative uncharacterized protein n=2 Tax=Microcy... 90 5e-17 UniRef50_A9QSN5 Exopolysaccharide biosynthesis protein n=4 Tax=L... 90 5e-17 UniRef50_B7DMS1 Copper amine oxidase domain protein n=3 Tax=Alic... 90 6e-17 UniRef50_A4XD34 Putative uncharacterized protein n=1 Tax=Salinis... 90 6e-17 UniRef50_B1HN11 Putative uncharacterized protein n=2 Tax=Bacilla... 90 8e-17 UniRef50_A4FD37 Secreted protein n=1 Tax=Saccharopolyspora eryth... 90 9e-17 UniRef50_C3YJA0 Putative uncharacterized protein n=3 Tax=Branchi... 89 1e-16 UniRef50_A9NEV6 Hypothetical surface-anchored protein n=1 Tax=Ac... 89 2e-16 UniRef50_B5W3X9 Putative uncharacterized protein n=3 Tax=Arthros... 89 2e-16 UniRef50_C9N2Q2 Metallophosphoesterase n=2 Tax=Actinomycetales R... 88 3e-16 UniRef50_Q9L2D5 Putative secreted protein n=2 Tax=Streptomyces R... 88 3e-16 UniRef50_A7HN47 Putative uncharacterized protein n=1 Tax=Fervido... 88 3e-16 UniRef50_Q2BF40 Putative uncharacterized protein n=1 Tax=Bacillu... 87 4e-16 UniRef50_B8I1Q9 Ig-like, group 2 n=3 Tax=Clostridium RepID=B8I1Q... 87 5e-16 UniRef50_B4WHW3 Putative uncharacterized protein n=1 Tax=Synecho... 87 6e-16 UniRef50_Q2JUI0 Conserved domain protein n=2 Tax=Synechococcus R... 87 6e-16 UniRef50_A5D3T7 Hypothetical membrane protein n=1 Tax=Pelotomacu... 85 1e-15 UniRef50_Q30YC1 Putative uncharacterized protein n=1 Tax=Desulfo... 85 2e-15 UniRef50_Q03K73 Exopolysaccharide biosynthesis protein related t... 85 2e-15 UniRef50_A5ILT0 Putative uncharacterized protein n=6 Tax=Thermot... 84 3e-15 UniRef50_C5C0E0 Metallophosphoesterase n=1 Tax=Beutenbergia cave... 84 3e-15 UniRef50_A1SN25 Exopolysaccharide biosynthesis protein-like n=1 ... 84 4e-15 UniRef50_A5D3R0 Hypothetical membrane protein n=1 Tax=Pelotomacu... 84 5e-15 UniRef50_C0Z816 Putative uncharacterized protein n=1 Tax=Breviba... 84 5e-15 UniRef50_Q7NIQ9 Gll2123 protein n=1 Tax=Gloeobacter violaceus Re... 84 6e-15 UniRef50_D2PZR6 Sporulation domain protein n=4 Tax=Actinomycetal... 84 6e-15 UniRef50_Q67T45 Putative uncharacterized protein n=1 Tax=Symbiob... 83 7e-15 UniRef50_B2A2E0 Copper amine oxidase domain protein n=1 Tax=Natr... 83 9e-15 UniRef50_A9V0B9 Predicted protein n=1 Tax=Monosiga brevicollis R... 83 1e-14 UniRef50_A6TUG6 Copper amine oxidase domain protein n=1 Tax=Alka... 82 1e-14 UniRef50_A9BJK8 Putative uncharacterized protein n=1 Tax=Petroto... 82 1e-14 UniRef50_C7QHR1 Putative uncharacterized protein n=1 Tax=Catenul... 82 2e-14 UniRef50_D1Y6Q3 Putative liporotein n=1 Tax=Pyramidobacter pisco... 81 3e-14 UniRef50_A4XGY7 Putative uncharacterized protein n=2 Tax=Clostri... 81 3e-14 UniRef50_UPI0001C31921 Collagen triple helix repeat protein n=2 ... 81 3e-14 UniRef50_B8FVQ0 Ig-like, group 2 n=2 Tax=Desulfitobacterium hafn... 81 4e-14 UniRef50_C6IV65 Putative uncharacterized protein n=1 Tax=Paeniba... 80 6e-14 UniRef50_C6WLB3 Metallophosphoesterase n=1 Tax=Actinosynnema mir... 80 6e-14 UniRef50_A4FAG7 Secreted protein n=5 Tax=Actinomycetales RepID=A... 80 9e-14 UniRef50_C1TLP7 Sporulation related-protein with S-layer-like do... 79 1e-13 UniRef50_C7PW43 Ig domain protein group 2 domain protein n=2 Tax... 79 1e-13 UniRef50_D1VTW3 Copper amine oxidase N-domain superfamily n=1 Ta... 79 2e-13 UniRef50_A3TM75 Putative uncharacterized protein n=1 Tax=Janibac... 78 3e-13 UniRef50_C9RD84 Copper amine oxidase domain protein n=1 Tax=Ammo... 78 3e-13 UniRef50_C9M6C8 Putative uncharacterized protein n=1 Tax=Jonquet... 78 4e-13 UniRef50_UPI0001746B2F hypothetical protein VspiD_16055 n=1 Tax=... 77 4e-13 UniRef50_C8VW07 S-layer domain protein n=1 Tax=Desulfotomaculum ... 77 5e-13 UniRef50_A4FAL4 Putative uncharacterized protein n=2 Tax=Actinom... 77 5e-13 UniRef50_C7QCB3 Putative uncharacterized protein n=1 Tax=Catenul... 77 5e-13 UniRef50_A3YXL4 Putative uncharacterized protein n=2 Tax=Chrooco... 76 9e-13 UniRef50_C5CET4 Putative uncharacterized protein n=1 Tax=Kosmoto... 76 1e-12 UniRef50_C9PT69 Putative uncharacterized protein n=1 Tax=Prevote... 76 1e-12 UniRef50_C6D5A3 Copper amine oxidase domain protein n=1 Tax=Paen... 75 2e-12 UniRef50_C6J2I2 Copper amine oxidase domain-containing protein n... 75 2e-12 UniRef50_UPI00017890C7 copper amine oxidase domain protein n=1 T... 75 2e-12 UniRef50_A6NQQ4 Putative uncharacterized protein n=1 Tax=Bactero... 74 5e-12 UniRef50_Q8DHE7 Tlr2012 protein n=1 Tax=Thermosynechococcus elon... 74 6e-12 UniRef50_Q0AWB0 Putative uncharacterized protein n=1 Tax=Syntrop... 74 6e-12 UniRef50_Q7NGC8 Glr3243 protein n=1 Tax=Gloeobacter violaceus Re... 73 1e-11 UniRef50_C0AEZ6 Putative uncharacterized protein n=1 Tax=Opituta... 73 1e-11 UniRef50_C4DE18 Putative uncharacterized protein n=1 Tax=Stackeb... 72 1e-11 UniRef50_A9GRW8 Putative uncharacterized protein n=1 Tax=Sorangi... 72 2e-11 UniRef50_A8F5X1 Putative uncharacterized protein n=1 Tax=Thermot... 72 2e-11 UniRef50_D2PRV8 Metallophosphoesterase n=1 Tax=Kribbella flavida... 72 2e-11 UniRef50_A9EQ62 Putative uncharacterized protein n=1 Tax=Sorangi... 70 5e-11 UniRef50_Q826N8 Putative secreted protein n=1 Tax=Streptomyces a... 70 6e-11 UniRef50_Q72HQ9 Putative uncharacterized protein n=4 Tax=Thermac... 70 8e-11 UniRef50_D2PYC0 Metallophosphoesterase n=1 Tax=Kribbella flavida... 67 4e-10 UniRef50_UPI00016BFF19 Ig-like, group 2 n=1 Tax=Epulopiscium sp.... 67 6e-10 UniRef50_A6G841 Putative uncharacterized protein n=1 Tax=Plesioc... 67 6e-10 UniRef50_A5N8M7 Predicted regulatory protein n=2 Tax=Clostridium... 67 8e-10 UniRef50_A3P9C8 Putative lipoprotein n=32 Tax=pseudomallei group... 66 1e-09 UniRef50_C2LSG0 Putative uncharacterized protein n=1 Tax=Strepto... 65 2e-09 UniRef50_Q1IXC2 Putative uncharacterized protein n=3 Tax=Deinoco... 65 3e-09 UniRef50_C1XUX9 Putative uncharacterized protein n=1 Tax=Meiothe... 65 3e-09 UniRef50_A6WEB7 Putative uncharacterized protein n=1 Tax=Kineoco... 64 5e-09 UniRef50_A9QSK3 Polysaccharide biosynthesis protein n=7 Tax=Stre... 63 7e-09 UniRef50_Q119M8 Putative uncharacterized protein n=1 Tax=Trichod... 61 3e-08 UniRef50_Q8YTL3 All2704 protein n=4 Tax=Nostocaceae RepID=Q8YTL3... 60 8e-08 UniRef50_B2HMV0 Lipoprotein LprO n=21 Tax=Mycobacterium RepID=B2... 60 9e-08 UniRef50_C6D289 Putative uncharacterized protein n=1 Tax=Paeniba... 59 1e-07 UniRef50_B8CD22 Predicted protein n=1 Tax=Thalassiosira pseudona... 59 2e-07 UniRef50_C0DAA9 Putative uncharacterized protein n=1 Tax=Clostri... 57 4e-07 UniRef50_C7LY43 Putative uncharacterized protein n=1 Tax=Acidimi... 55 2e-06 UniRef50_A4FIV8 Secreted protein n=1 Tax=Saccharopolyspora eryth... 55 2e-06 UniRef50_D2W0I7 Predicted protein n=1 Tax=Naegleria gruberi RepI... 55 3e-06 UniRef50_UPI00016A4F20 hypothetical protein BthaT_13010 n=4 Tax=... 52 1e-05 UniRef50_Q92JI8 Uncharacterized protein RC0079 n=11 Tax=Ricketts... 50 5e-05 UniRef50_A7MD65 Zgc:165534 protein n=3 Tax=Clupeocephala RepID=A... 50 6e-05 Sequences not found previously or not previously below threshold: UniRef50_B3VCD3 BJA-8 n=1 Tax=Carukia barnesi RepID=B3VCD3_CARBN 47 4e-04 UniRef50_A0YKD9 Putative uncharacterized protein n=1 Tax=Lyngbya... 47 8e-04 UniRef50_C4ICA7 Putative uncharacterized protein n=1 Tax=Clostri... 46 0.001 UniRef50_Q9RZG9 Putative uncharacterized protein n=1 Tax=Deinoco... 45 0.002 UniRef50_C7M125 Putative uncharacterized protein n=1 Tax=Acidimi... 43 0.007 UniRef50_B1FR41 YD repeat n=1 Tax=Burkholderia ambifaria IOP40-1... 43 0.011 UniRef50_UPI00019038D8 hypothetical protein Retl8_15906 n=1 Tax=... 42 0.022 UniRef50_C3QEU3 Predicted protein n=5 Tax=Bacteroides RepID=C3QE... 41 0.032 UniRef50_C5KB48 Putative uncharacterized protein n=1 Tax=Perkins... 41 0.034 UniRef50_B4DZG9 cDNA FLJ52812, highly similar to N-acetylglucosa... 40 0.063 UniRef50_D1BLE5 Putative uncharacterized protein n=3 Tax=Veillon... 40 0.064 UniRef50_A9GVQ9 Putative uncharacterized protein n=1 Tax=Sorangi... 40 0.064 >UniRef50_B2II06 Putative uncharacterized protein n=5 Tax=Rhizobiales RepID=B2II06_BEII9 Length = 269 Score = 196 bits (499), Expect = 4e-49, Method: Composition-based stats. Identities = 72/245 (29%), Positives = 126/245 (51%), Gaps = 2/245 (0%) Query: 11 MITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWG 70 + T L R+FL L L A A L++ + + + ++++WQ+ G+ +G Sbjct: 17 IFTKLLMRVFLPLFLSAGTAWAEPCLPLTEEGINYVVCRFDTKRSDLRLFWQQPGGQPYG 76 Query: 71 TLHALLADIN-SQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGG 129 L A + ++ AMN G++ E +P+GLYI+ G+ N+ +G GNF ++P G Sbjct: 77 GFAPLRAQLQPKGETLEFAMNAGMFQEDLSPVGLYIQEGRLLHPANMRNGPGNFHMKPNG 136 Query: 130 VFYVAGDKVGIVRLDAF-KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGIN 188 +FY + G++ F ++ + +A QSGP+L+ N ++P+I P S KIRNGVG+ Sbjct: 137 IFYFSQTSAGVMETGRFLQSGLKPDYATQSGPLLVANNQLHPKIEPTGTSEKIRNGVGVR 196 Query: 189 KHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMI 248 + +F +S+ F+ FA + +L+ L+LDG+IS +Y Q P ++ Sbjct: 197 DNHEVIFAISEAPVTFFRFARLFRDRLHCPDALFLDGSISSLYAPSLNRDDQWRPIGPIV 256 Query: 249 SVERK 253 K Sbjct: 257 GAVSK 261 >UniRef50_P27840 Uncharacterized protein yigE n=68 Tax=Enterobacteriaceae RepID=YIGE_ECOLI Length = 254 Score = 196 bits (498), Expect = 7e-49, Method: Composition-based stats. Identities = 254/254 (100%), Positives = 254/254 (100%) Query: 1 MAHQLLIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMY 60 MAHQLLIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMY Sbjct: 1 MAHQLLIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMY 60 Query: 61 WQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGE 120 WQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGE Sbjct: 61 WQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGE 120 Query: 121 GNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSK 180 GNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSK Sbjct: 121 GNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSK 180 Query: 181 IRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 IRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ Sbjct: 181 IRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 Query: 241 RYPFVTMISVERKG 254 RYPFVTMISVERKG Sbjct: 241 RYPFVTMISVERKG 254 >UniRef50_Q98NI9 Mlr0120 protein n=3 Tax=Alphaproteobacteria RepID=Q98NI9_RHILO Length = 263 Score = 182 bits (463), Expect = 7e-45, Method: Composition-based stats. Identities = 72/252 (28%), Positives = 122/252 (48%), Gaps = 3/252 (1%) Query: 5 LLIGKGMITLNLKRIFLALTLLPLFAVA-ADDCALSDPTLTVQAYTVNPQTERVKMYWQK 63 + G + L + + + V+ + + V+P+ ++++W+ Sbjct: 10 APLLLGAVKAALPQAVASTMAFSQWFVSLPPCRDFAFEATSYLICEVDPKLYSIELFWKD 69 Query: 64 ANGEAWGTLHALLADINSQGQV-QMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGN 122 G+ + +LH L A + G+ A+N G+Y P+GLY+E G++ + SG GN Sbjct: 70 PVGKPFQSLHNLDAAQRAAGRTMLFAINAGMYHPDLRPVGLYVERGREMAGVRTGSGSGN 129 Query: 123 FFIRPGGVFYVAGDKVGIVRLDAFKTS-KEIQFAVQSGPMLMENGVINPRIHPNVASSKI 181 F ++P G+FY++G K + F +A QSGPML+ +G ++P+ + S K Sbjct: 130 FSLQPNGIFYISGGKAAVRATRDFVRKRPSTDYATQSGPMLVIDGQLHPKFQSDGTSRKT 189 Query: 182 RNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQR 241 R+GVG+ K G AVF +S NF+ FA + L + L+LDGTIS ++ Sbjct: 190 RDGVGVRKDGVAVFAISNGTVNFHTFARLFRDALGCDNALFLDGTISSLFAPAIGRNDDY 249 Query: 242 YPFVTMISVERK 253 + MI V RK Sbjct: 250 WNLGPMIGVFRK 261 >UniRef50_A9W4Y6 Putative uncharacterized protein n=4 Tax=Methylobacterium extorquens group RepID=A9W4Y6_METEP Length = 258 Score = 180 bits (457), Expect = 4e-44, Method: Composition-based stats. Identities = 76/237 (32%), Positives = 123/237 (51%), Gaps = 4/237 (1%) Query: 19 IFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLAD 78 + + P A A+ TV+ + ERV+++W +G +G+L +L Sbjct: 24 APVPVQAQPAPAAKGPCQAVEFEGQPYTVCTVDLRRERVRLFWLGTDGLPYGSLSSL--A 81 Query: 79 INSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKV 138 ++ AMN G+YD+ AP+GLY+E+G++ + A+G GNF ++P GVFYV GD+ Sbjct: 82 DRQGPRLSFAMNAGMYDKGQAPVGLYVEDGRELKGASTANGPGNFHLKPNGVFYVKGDRA 141 Query: 139 GIVRLDAF-KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNA-VFL 196 G++ + + FA QSGPML+ +G I+P+I + S KIRNGVG+ G+ VF Sbjct: 142 GVLDTGRYLRAKPAPDFATQSGPMLVIDGKIHPKISADGPSQKIRNGVGVRDGGHVAVFA 201 Query: 197 LSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 +S++ F FA K L+LDG++S +Y G P ++ + Sbjct: 202 ISERPVTFGAFARLFKDSFGCRNALFLDGSVSSLYAPGLGRSDLSRPLGPLVGAVGR 258 >UniRef50_B9JX75 Putative uncharacterized protein n=1 Tax=Agrobacterium vitis S4 RepID=B9JX75_AGRVS Length = 274 Score = 178 bits (452), Expect = 1e-43, Method: Composition-based stats. Identities = 78/246 (31%), Positives = 127/246 (51%), Gaps = 4/246 (1%) Query: 12 ITLNLKRIFLALTLLPLFAV--AADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAW 69 I + L I L + P A ++ + +P T ++++ + A+G+ + Sbjct: 26 IVVWLFAILSPLVISPERAEAEEQSCRDQTENGFAYRVCRFDPATRTIRIFNRNADGDVY 85 Query: 70 GTLHALLADINSQG-QVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPG 128 G AL + + Q + A+NGG+Y +P+GL+++ G + A G GNF+++P Sbjct: 86 GGFEALRSQLWQQRLILTFAVNGGMYHSDLSPVGLFVDYGMTRKTAETADGWGNFYLKPN 145 Query: 129 GVFYVAGDKVGIVRLDAFK-TSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGI 187 GVF++ G++ F+ E FA QSGPML+ +GV++P+ P S KIRNGVGI Sbjct: 146 GVFFLKDGHAGVLETGQFETQKIEADFATQSGPMLVIDGVLHPKFLPTSDSLKIRNGVGI 205 Query: 188 NKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTM 247 + G VF+LS+ FYD A + + +L LYLDGTIS + + YP + Sbjct: 206 DASGQVVFVLSKDPVRFYDMAAFFRDRLGAANALYLDGTISSLAEPMAGRIDRAYPLGPI 265 Query: 248 ISVERK 253 I+V + Sbjct: 266 IAVVDQ 271 >UniRef50_C8PYM1 Putative uncharacterized protein n=1 Tax=Enhydrobacter aerosaccus SK60 RepID=C8PYM1_9GAMM Length = 271 Score = 169 bits (429), Expect = 6e-41, Method: Composition-based stats. Identities = 71/264 (26%), Positives = 107/264 (40%), Gaps = 18/264 (6%) Query: 7 IGKGMITLNLKRIFLALTLLPLFAVAA-----DDCALSDPTLTVQAYTVNPQTE-RVKMY 60 I K + + L L A DC ++ + ++ Sbjct: 6 ITKTQAVVVSLCLALLLVASIGLARQFTVKTMPDCQRKSQPFDYSICELDAKNAANFSLH 65 Query: 61 WQKANGE------AWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVAL 114 WQ + + TL L + AMN G+YD ++AP+G + NG+Q AL Sbjct: 66 WQNPSSASHPLLLTFTTLRDYLVSEQPAKTLLFAMNAGMYDSNFAPIGYTVINGKQIRAL 125 Query: 115 NLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTS----KEIQFAVQSGPMLMENGVINP 170 NL G GNF + P GVF+ I + + FA QSGPML+ +G I+P Sbjct: 126 NLKQGGGNFHLMPNGVFWQDRQGFYITESQSMAKKLASGAKPTFATQSGPMLVIDGNIHP 185 Query: 171 RIHPNVASSKIRNGVGINKHG--NAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTIS 228 N S K RNG+G+ H F++S +FY+FA K++L + L+LDG + Sbjct: 186 AFDANSTSRKYRNGIGVCGHNPSRVKFVISDTPVSFYEFADLFKSQLGCDNALFLDGGSA 245 Query: 229 HMYMKGGAIPWQRYPFVTMISVER 252 MI+V + Sbjct: 246 SALYSQTLSRNDNKYMGVMIAVTQ 269 >UniRef50_D2LB58 Putative uncharacterized protein n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LB58_RHOVA Length = 247 Score = 167 bits (424), Expect = 3e-40, Method: Composition-based stats. Identities = 69/243 (28%), Positives = 112/243 (46%), Gaps = 4/243 (1%) Query: 15 NLKRIFLALTLLPLFAVA--ADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTL 72 F+A+ + A + + V+++WQK +G + L Sbjct: 2 LRATAFIAMAAFCGSSEAAAQTCKPYAFEGNGYTLCEASLDRFAVRLFWQKPDGGPYTYL 61 Query: 73 HALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFY 132 AL G++ A+NGG++ Y P+GL++ENG++ V N G GNF +RP G+FY Sbjct: 62 SALPKTDERGGRLAFALNGGMFHPDYKPVGLHVENGRELVRANTRPGPGNFHLRPNGIFY 121 Query: 133 VAGDKVGIVRLDAFKTS-KEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHG 191 + G++ AF + FA QSGPML+ +G ++PRI S+K R+GV + Sbjct: 122 FGEAEAGVMETGAFLKKKPKANFATQSGPMLVIDGKLHPRIAKANVSAKPRDGVCVRGDK 181 Query: 192 NAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTIS-HMYMKGGAIPWQRYPFVTMISV 250 + VF +S F F + L L+LDG + +++ G + MI+V Sbjct: 182 SVVFAISDGGVPFDTFMRLFRDGLKCRNALFLDGGTAPALFVPGTRSGNVLFGLGPMIAV 241 Query: 251 ERK 253 K Sbjct: 242 YEK 244 >UniRef50_Q0BWY7 Putative lipoprotein n=1 Tax=Hyphomonas neptunium ATCC 15444 RepID=Q0BWY7_HYPNA Length = 249 Score = 167 bits (423), Expect = 3e-40, Method: Composition-based stats. Identities = 65/228 (28%), Positives = 108/228 (47%), Gaps = 5/228 (2%) Query: 30 AVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADIN-SQGQVQMA 88 S L + + + ++++ + G +G L + G + A Sbjct: 21 VEEGPCQTRSFENLPYLVCSFDASQDTIRLFLRDETGVPFGQFDRLANHVASKGGNLVFA 80 Query: 89 MNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKT 148 MN G+Y + P+GLYIE G+ ++ L + G GNF + P GVF++ K G+ AF Sbjct: 81 MNAGMYHDDRRPVGLYIEEGEAEMNLVRSPGPGNFGMLPNGVFWIDAGKAGVSETLAFDE 140 Query: 149 ---SKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGN-AVFLLSQQATNF 204 +FA QSGPML+ +G ++P ++P+ S + RNGVG+++ G F++S NF Sbjct: 141 RFKETPPRFATQSGPMLVIDGALHPALNPDGTSLRRRNGVGVSEDGRQVYFVISDVPVNF 200 Query: 205 YDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVER 252 + FA + +L LYLDG +S Y+ ++ V R Sbjct: 201 HSFARLFRDELGTPNALYLDGAVSKAYVPALERSETGLDMGPIVGVIR 248 >UniRef50_Q1IX28 Periplasmic YigE-like protein n=3 Tax=Deinococcus RepID=Q1IX28_DEIGD Length = 317 Score = 163 bits (412), Expect = 6e-39, Method: Composition-based stats. Identities = 74/234 (31%), Positives = 122/234 (52%), Gaps = 9/234 (3%) Query: 3 HQLLIGKGMITLNLKRIFLALTLLPLFAVAAD----DCALSDPTLTVQAYTVNPQTERVK 58 H L + N+ RIF+ LLPL A + ++ + V+ + + ++ Sbjct: 65 HHRLDMLSVRFPNVLRIFV--LLLPLTACSQAGGLDVRRVTAEGMLYTVAAVDLKRDHLR 122 Query: 59 MYWQKAN-GEAWGTLHALLADINSQG-QVQMAMNGGIYDESYAPLGLYIENGQQKVALNL 116 ++W+ G+ + T + A + G QV A N GIY PLGL++E G+ + LN Sbjct: 123 LHWKNPATGQPYRTFAEVSARLRKDGEQVLFATNSGIYGPGLEPLGLHVEEGRTLIGLNN 182 Query: 117 ASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKT-SKEIQFAVQSGPMLMENGVINPRIHPN 175 A GNF + P GVF+V G++ G+ A++ + + FA QSGP+L++ G ++P + Sbjct: 183 ARSGGNFALLPNGVFWVKGNQAGVTETQAYRRLNIQPTFATQSGPLLVQGGRLHPAFNKG 242 Query: 176 VASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISH 229 +S K+R+GVG+ + G F +S NF+ FA + + L LYLDG+IS Sbjct: 243 SSSFKVRSGVGVCRDGRVRFAVSAGPVNFHSFAVFFRDVLGCPDALYLDGSISA 296 >UniRef50_Q26CZ6 Periplasmic protein n=1 Tax=Flavobacteria bacterium BBFL7 RepID=Q26CZ6_9BACT Length = 241 Score = 162 bits (411), Expect = 8e-39, Method: Composition-based stats. Identities = 67/223 (30%), Positives = 107/223 (47%), Gaps = 6/223 (2%) Query: 33 ADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADI-NSQGQVQMAMNG 91 D + D ++ ++ +++++YW + + T L + ++ AMN Sbjct: 22 TQDLIIKDDRFHIKV--IDLTKQKLQLYWLDQDNKPIETFEQLNMHVKQQDKRLVYAMNA 79 Query: 92 GIYDESYAPLGLYIENGQQKVALNL-ASGEGNFFIRPGGVFYV-AGDKVGIVRLDAFKTS 149 G+Y + ++P GLYIENG L+ G GNF+++P GVFY+ K + Sbjct: 80 GMYLKDHSPQGLYIENGTIHKQLDTVTVGYGNFYLQPNGVFYLTQDGKAQVTATPQLSNF 139 Query: 150 KEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFAC 209 I +A QSGPML+ N I+P + + IRN VGI G + +S++ NFYDFA Sbjct: 140 SNITYATQSGPMLVINDTIHPAFNKGSKNVHIRNAVGILPDGRILLAISKEKINFYDFAT 199 Query: 210 YAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVER 252 + K + + LYLDG +S +Y + F MI V Sbjct: 200 FFKNQ-GCKNALYLDGFVSRIYDPTINVEQMDGHFGVMIGVSD 241 >UniRef50_A9CIN9 Putative uncharacterized protein n=6 Tax=Proteobacteria RepID=A9CIN9_AGRT5 Length = 254 Score = 162 bits (410), Expect = 9e-39, Method: Composition-based stats. Identities = 69/222 (31%), Positives = 117/222 (52%), Gaps = 3/222 (1%) Query: 35 DCALSDPTLTVQAYTVNPQTERVKMYWQK-ANGEAWGTLHALLADINSQGQ-VQMAMNGG 92 +++ + +P +++Y Q +G+ + L + + Q AMNGG Sbjct: 30 CKSINHAGGRYTVCSFDPAKNTIRIYDQDHVSGQGYRNFADLSSALWRQHMFSVFAMNGG 89 Query: 93 IYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAF-KTSKE 151 +Y Y+P+GL++ENG ++ ++ G GNF + P GVFY+ G+ G++ +A+ + Sbjct: 90 MYHSDYSPVGLFVENGVERSPVSTRGGWGNFHLLPNGVFYLDGNTAGVLETEAYLAADPK 149 Query: 152 IQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYA 211 FA QSGPML+ +G ++PR P+ S K RNGVG+++ G F +S+ FYDF Sbjct: 150 PDFATQSGPMLVIDGKLHPRFLPDSDSLKRRNGVGVSRDGMVHFAISETTVRFYDFGTLF 209 Query: 212 KAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 + L+ LYLDGTIS + + Q + +I+V + Sbjct: 210 RDVLDAPNALYLDGTISSVDIPAMNRRDQLFSMGPIIAVVDR 251 >UniRef50_Q11X50 Putative uncharacterized protein n=2 Tax=Bacteroidetes RepID=Q11X50_CYTH3 Length = 244 Score = 160 bits (406), Expect = 3e-38, Method: Composition-based stats. Identities = 82/217 (37%), Positives = 121/217 (55%), Gaps = 3/217 (1%) Query: 39 SDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADI-NSQGQVQMAMNGGIYDES 97 T+ V +YTV+PQ + ++ YW+ NGE ++ L A + + + A NGG+Y E Sbjct: 25 QQDTIDVISYTVDPQKDNLQFYWKNDNGEILKSIKKLKAYVESKGSTLLFATNGGMYKED 84 Query: 98 YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDK-VGIVRLDAFKTSKEIQFAV 156 +PLGL+I+NG+ LN A G+GNF+++P GVFY+ D I + + F + I+FA Sbjct: 85 RSPLGLFIQNGKTVTPLNKAKGQGNFYMQPNGVFYITNDNEAVICKTEDFINNGNIKFAT 144 Query: 157 QSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLN 216 QSGPM++ N I+P + IRNGVGI + +F +S++ NF+DFA Y + L Sbjct: 145 QSGPMIIVNNQIHPSFIKGSKNLNIRNGVGILPNKKIIFAMSEKEVNFFDFALYFQN-LG 203 Query: 217 VEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 E LYLDG +S Y+ F MI V K Sbjct: 204 CENALYLDGFVSRSYLLEKKWLQTDGEFGVMIGVTEK 240 >UniRef50_Q093S1 Putative uncharacterized protein n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q093S1_STIAU Length = 278 Score = 160 bits (406), Expect = 3e-38, Method: Composition-based stats. Identities = 71/257 (27%), Positives = 123/257 (47%), Gaps = 9/257 (3%) Query: 5 LLIGKGMITLNLKRIFLALTLLPLFAVAADD-----CALSDPTLTVQAYTVNPQTERVKM 59 LLIG G+ T + T ++ ++ T Y V+ +++ Sbjct: 19 LLIGSGLGTGATHLLAAPHTPAATRSLQTPTGRVAARRIAYRGNTYDTYEVDLTQSKLRF 78 Query: 60 YWQKANGEAWGTLHALLADINSQG-QVQMAMNGGIYDESYAPLGLYIENGQQKVALNLAS 118 Y+Q+ +G + +L L + +G ++ A N G++ + P+GLY+E+G++ V LN Sbjct: 79 YFQQPDGTPFSSLGNLRGWLQGRGKRLVFATNAGMFTPARRPVGLYVEDGREFVGLNTQE 138 Query: 119 GEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQ--FAVQSGPMLMENGVINPRIHPNV 176 GNFF++P VF+V GI+ A+ + +A QSGP L+ +G ++P Sbjct: 139 EAGNFFLKPNAVFFVTETGAGILESSAYAAHPPAKVLYATQSGPALLLHGQMHPAFREGS 198 Query: 177 ASSKIR-NGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGG 235 + R +GVGI VF ++QQA N ++FA + + + + LYLDG +S MY+ Sbjct: 199 RNLSPRRSGVGIVTPTRVVFAMTQQAVNLHEFASFFRDQFGCQDALYLDGVVSRMYLPAL 258 Query: 236 AIPWQRYPFVTMISVER 252 F MI++ Sbjct: 259 GRDELDGDFGAMIAISE 275 >UniRef50_A9D6B9 Putative uncharacterized protein n=1 Tax=Hoeflea phototrophica DFL-43 RepID=A9D6B9_9RHIZ Length = 286 Score = 160 bits (406), Expect = 3e-38, Method: Composition-based stats. Identities = 64/234 (27%), Positives = 110/234 (47%), Gaps = 6/234 (2%) Query: 26 LPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINS---- 81 + T++PQT +++ ++ G+ G++ A++ + + Sbjct: 47 MTKPDWPEGCVEQVFEGARAILCTIDPQTHDMRLVYRDRMGDVLGSVSAVVDQLAAGAGT 106 Query: 82 QGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKV--G 139 ++ +AMN G+Y +P+GLY+EN + ALN G GNFF++P GVF+V D Sbjct: 107 DHKLVLAMNAGMYHADMSPVGLYVENSVEIAALNRDDGFGNFFLKPNGVFFVLKDGNAGV 166 Query: 140 IVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQ 199 + + ++A QSGPML+ +GVI+PR P+ S IRNGVG+ G VF +++ Sbjct: 167 LETDAYAEADLSPEYATQSGPMLVIDGVIHPRFLPDGTSKFIRNGVGVRPDGKVVFAITR 226 Query: 200 QATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 + FA + E L+ DG +S + + P + V + Sbjct: 227 DRVSLGSFARLFRDVAGCENALFFDGAVSSLALGSKMEIDSEEPAGPVAVVVAR 280 >UniRef50_A0Q3C5 Conserved protein n=7 Tax=Clostridia RepID=A0Q3C5_CLONN Length = 335 Score = 159 bits (403), Expect = 7e-38, Method: Composition-based stats. Identities = 37/235 (15%), Positives = 66/235 (28%), Gaps = 34/235 (14%) Query: 30 AVAADDCALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMA 88 + + + NP+ +V E G+ + + + A Sbjct: 99 TNEIEKREIHGDKFKGHLLVIKNPKKIKVGY------NEHLGSKGETTSAMAKRYNSIAA 152 Query: 89 MNGGIY-------------DESYAPLGLYIENGQQK-VALNLASGEGNFFIRPGGVFYVA 134 +N G + + + P G+ I NG+ L I G+ V Sbjct: 153 INAGGFVANNASSKDANPSETNGNPGGILISNGEIVYNNLRNNEKICIAGITADGILLVG 212 Query: 135 GDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAV 194 ++ AV GP L+ NG + R +G K G+ + Sbjct: 213 N------YNLDEMMKLNVKDAVSFGPALIVNGQKTITSGDGGWGTAPRTAIGQRKDGSIL 266 Query: 195 FLLSQQ------ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYP 243 FL+ A + + + LDG S G + + Y Sbjct: 267 FLVIDGKYIGRLAVTLRELQDILY-EYGAYNAVNLDGGSSSTMYYNGKVISEPYK 320 >UniRef50_C5CWT4 Periplasmic protein-like protein n=3 Tax=Bacteria RepID=C5CWT4_VARPS Length = 238 Score = 158 bits (400), Expect = 2e-37, Method: Composition-based stats. Identities = 67/210 (31%), Positives = 118/210 (56%), Gaps = 4/210 (1%) Query: 44 TVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ-VQMAMNGGIYDESYAPLG 102 ++ + ER++++ +G + L A + ++ + + AMN G+Y ++P+G Sbjct: 27 RYTVVKIDVRRERLELFLHDDSGAPFKRFDRLEAWLAARNRQLVFAMNAGMYHADFSPVG 86 Query: 103 LYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFK--TSKEIQFAVQSGP 160 L ++ G+++ LNLA+G GNFF++P GVF V+ +V + + ++ A QSGP Sbjct: 87 LLVQEGREEAPLNLAAGAGNFFLKPNGVFLVSDAGPRVVESSEYAALPKEGVRLATQSGP 146 Query: 161 MLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQL 220 +L+ GV++P P+ S KIRNGVG++ H A+F++S+Q NFY+FA Y + L+ Sbjct: 147 LLLRRGVVHPAFIPDSDSRKIRNGVGVSGH-TAIFVISEQPVNFYEFALYFRDVLHCRDA 205 Query: 221 LYLDGTISHMYMKGGAIPWQRYPFVTMISV 250 LYLDGT+S ++ ++ V Sbjct: 206 LYLDGTVSALHSLALRRSDFTRELGPILGV 235 >UniRef50_D0SJ31 Putative uncharacterized protein n=1 Tax=Acinetobacter junii SH205 RepID=D0SJ31_ACIJU Length = 252 Score = 158 bits (399), Expect = 2e-37, Method: Composition-based stats. Identities = 67/233 (28%), Positives = 123/233 (52%), Gaps = 4/233 (1%) Query: 23 LTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKAN-GEAWGTLHALLADINS 81 + A + ++ + + V+ + ++++ + G+ + + +D+ + Sbjct: 17 MVFQATTVFAFEYQSIKFEDVQFEVIKVD-DLKDLQLFLKNPRIGDFYQKFSNIQSDLAA 75 Query: 82 QGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIV 141 +++ AMN G+Y ++ P+GLYIE ++ LN ++G GNFF++P GV I Sbjct: 76 CKELRFAMNAGMYHPNFEPVGLYIEKKKKLSELNESTGFGNFFMQPNGVVVWNDHGAVIH 135 Query: 142 RLDAFKTSK-EIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ 200 +K + FA QSGPML+ G+IN + + S KIRNGVG+ + F++S+Q Sbjct: 136 STADYKRANFTANFATQSGPMLVHKGLINSQFIKDSNSLKIRNGVGVRDD-HLYFVISEQ 194 Query: 201 ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 NFY FA + K +L V++ LYLDG+IS +Y+K ++Y ++ + + Sbjct: 195 RINFYQFAKFFKHQLRVDEALYLDGSISSLYLKDIQRNDRKYNLGPIVGLTHQ 247 >UniRef50_Q0G184 Putative uncharacterized protein n=2 Tax=Aurantimonadaceae RepID=Q0G184_9RHIZ Length = 268 Score = 157 bits (397), Expect = 3e-37, Method: Composition-based stats. Identities = 66/249 (26%), Positives = 118/249 (47%), Gaps = 8/249 (3%) Query: 11 MITLNLKRIFLALTLLPLFAVAADDCALSD----PTLTVQAYTVNPQTERVKMYWQKANG 66 + + L+P+ + A + ++ V + + + G Sbjct: 21 LSFPAVLATGFLSWLVPVPDLPAGHEGICRIAMAGSVETILCEVPLSSFDLHLRALDDAG 80 Query: 67 EAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIR 126 + T A + G+V +AMN G+Y E P+GL +++G+ L +G GNF +R Sbjct: 81 RPYETFEKAAASL--SGEVVLAMNAGMYHEDRRPVGLTVQDGRIVKKAVLGTGSGNFSLR 138 Query: 127 PGGVFYVAGDKVGIVRLDAFKTSK-EIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGV 185 P G+FY+ + + + + + A QSGPML+ G ++PR P S +RNGV Sbjct: 139 PNGIFYLEDGRAFVRETERYLGESHDPVLATQSGPMLLIGGKVHPRFIPTSDSLYVRNGV 198 Query: 186 GINKHGNAVFLL-SQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPF 244 G+++ G VFL +++ NFYDFA + + + V+ L+ DG +S + + I ++R Sbjct: 199 GVSEDGRTVFLALTRKPINFYDFALFFRDTVGVKDALFFDGQVSSLSYRAANIAYRRDRL 258 Query: 245 VTMISVERK 253 M+ V +K Sbjct: 259 GPMLLVTKK 267 >UniRef50_C4G6X0 Putative uncharacterized protein n=2 Tax=Lactobacillales RepID=C4G6X0_ABIDE Length = 345 Score = 156 bits (394), Expect = 7e-37, Method: Composition-based stats. Identities = 38/223 (17%), Positives = 66/223 (29%), Gaps = 17/223 (7%) Query: 37 ALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE 96 + TV + + A + + + + +A+NG Y Sbjct: 118 TVRKNNTTVYVADIKLSDSSY-LKTALAYDSFGTNVTETTSSMATNNNAILAVNGDYYGA 176 Query: 97 SYAPLGLYIENGQQKVALNL-ASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFA 155 + G I+NG S + + G F + + + + Sbjct: 177 DRS--GYVIKNGVIYRNTVRSDSEYPDLAVYKDGSFKIIYET---EVTAEELLADGVVNL 231 Query: 156 VQSGPMLMENGVINPRIHPNVAS---SKIRNGVGINKHGNAVFLLSQQATN------FYD 206 GP L+ENG I+ + V R +GI + + ++S T+ Y+ Sbjct: 232 FAFGPSLVENGEISVDQNTEVRQAMTKNPRTAIGIVDKNHYILVVSDGRTSESEGLSLYE 291 Query: 207 FACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMIS 249 A K + LDG S G I IS Sbjct: 292 LAEVLK-EYGATTAYNLDGGGSSTMYFNGNIVNNPTTNGHRIS 333 >UniRef50_Q1MEZ5 Conserved hypothetical exported protein n=45 Tax=Rhizobiales RepID=Q1MEZ5_RHIL3 Length = 258 Score = 155 bits (391), Expect = 2e-36, Method: Composition-based stats. Identities = 60/232 (25%), Positives = 101/232 (43%), Gaps = 11/232 (4%) Query: 32 AADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQM-AMN 90 A S T+ P ++++W+ A+G + +L + ++G+ A+N Sbjct: 27 AQACEQESFEEAKYVVCTLEPGKADLRLFWKNADGAPYRAFSSLAEAVRAEGRTLAFAVN 86 Query: 91 GGIYDESYAPLGLYIENGQQKVALNLASGEG------NFFIRPGGVFYVAGDKVGIVRLD 144 G+Y ++P+GLY+ENG++ N E NF+ +P GVF++ GI+ D Sbjct: 87 AGMYRADFSPMGLYVENGRELNPANTTEAESSSGQVPNFYKKPNGVFFLGETGAGILPTD 146 Query: 145 AFK-TSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATN 203 F + +FA QSGPML+ +NP R+GVG + G F +S+ N Sbjct: 147 EFLKRRPKARFATQSGPMLVIANKLNPIFIVGSTDRTRRSGVGTCERGAVRFAISEDRVN 206 Query: 204 FYDFACYAKAKLNVEQLLYLDGT-ISHMYMKGGAIPWQR--YPFVTMISVER 252 F+DFA + L L+LDG +Y + + + Sbjct: 207 FHDFARLFRDHLKCPDALFLDGGRGVGLYNPDMGHNDWSWHGGYGPIFGLVE 258 >UniRef50_B9KP42 Periplasmic protein-like protein n=37 Tax=Rhodobacterales RepID=B9KP42_RHOSK Length = 245 Score = 153 bits (387), Expect = 5e-36, Method: Composition-based stats. Identities = 64/242 (26%), Positives = 106/242 (43%), Gaps = 8/242 (3%) Query: 17 KRIFLALTLLPLF-----AVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGT 71 R LA L L+ L+ + ++++ +G +G+ Sbjct: 1 MRTRLAAILFALWPAACATAEPACRDLTFEGTRYSLCEA-QAGDDIRIFQTAPDGRPYGS 59 Query: 72 LHALLADI-NSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGV 130 + + + Q+ AMN G+Y P+GL IE ++ L ++G GNF + P GV Sbjct: 60 FERINSALDGEGRQLAFAMNAGMYHADRRPVGLLIEEEVERAPLVTSAGPGNFGLLPNGV 119 Query: 131 FYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKH 190 F V I + A QSGPML+ G ++PR + S IRNGVG++ Sbjct: 120 FCVGDGFRVIESRSFAAERPACRHASQSGPMLVIGGELHPRFLVHSDSRYIRNGVGVSAD 179 Query: 191 G-NAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMIS 249 G AVF +S + F++F + +L + + LY DG+IS +Y +G P ++ Sbjct: 180 GRRAVFAISNRPVTFHEFGRLFRDELGLPEALYFDGSISRLYDRGARRSDWGTPMGPIVG 239 Query: 250 VE 251 + Sbjct: 240 LV 241 >UniRef50_D1AL67 Periplasmic protein-like protein n=1 Tax=Sebaldella termitidis ATCC 33386 RepID=D1AL67_SEBTE Length = 266 Score = 152 bits (385), Expect = 7e-36, Method: Composition-based stats. Identities = 94/219 (42%), Positives = 132/219 (60%), Gaps = 4/219 (1%) Query: 36 CALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYD 95 + D TV + E +KMYW+ N +A+ L + + + ++ A NGGIY Sbjct: 50 KKIEDRGFTVY--KPDLNKEIIKMYWKDENNKAYSELSKFIQEN-TGNKINFATNGGIYS 106 Query: 96 ESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFA 155 E Y P GLYIEN + +NLA GEGNF+++P GVFY+ ++ I AF+ ++ I +A Sbjct: 107 EEYEPNGLYIENHKIISKINLADGEGNFYMQPNGVFYIQNNQPKISESKAFEYNENISYA 166 Query: 156 VQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKL 215 QSGP+L+ENGVIN +I N S KIR+ VGI++ FL+S + NFYDF+ YA KL Sbjct: 167 TQSGPLLIENGVINKKIGKNSESFKIRSAVGIDRENKVFFLMSSEKINFYDFSKYALDKL 226 Query: 216 NVEQLLYLDGTISHMYMKG-GAIPWQRYPFVTMISVERK 253 N + LL+LDG IS MY IP Q YPF +I+ E++ Sbjct: 227 NCKDLLFLDGAISKMYFADEKKIPEQDYPFAVIITSEKR 265 >UniRef50_B8I4Q1 Putative uncharacterized protein n=3 Tax=Clostridium RepID=B8I4Q1_CLOCE Length = 346 Score = 152 bits (385), Expect = 8e-36, Method: Composition-based stats. Identities = 31/225 (13%), Positives = 56/225 (24%), Gaps = 28/225 (12%) Query: 34 DDCALSDPTLTVQAYTVN-PQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGG 92 + + + V+ P +V + + I + A+NGG Sbjct: 120 EYFDVESRNFKGKMIIVDDPTRIKVGYSSKMP------RSGETTSSIARRNGAVAAINGG 173 Query: 93 IY------DESYAPLGLYIENGQQKVA--LNLASGEGNFFIRPGGVFYVAGDKVGIVRLD 144 + +G I NG+ N + G+ V Sbjct: 174 GFIDKGWAGTGGVAIGFVISNGKYISGKLTNNYTKRDTIAFTKDGMLIVGK------HSQ 227 Query: 145 AFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT-- 202 A I+ + GP L+ NG R +G + G+ + L+ + Sbjct: 228 AELAKYNIKEGISFGPPLIVNGKPTINKGDGGWGISPRTAIGQKEDGSVMLLVIDGRSLK 287 Query: 203 ----NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYP 243 + LDG S G + Sbjct: 288 SFGATLKEVQDIMLEH-GAVNAANLDGGSSATMYYDGKVVNTPSD 331 >UniRef50_A5WGQ7 Periplasmic protein-like protein n=1 Tax=Psychrobacter sp. PRwf-1 RepID=A5WGQ7_PSYWF Length = 309 Score = 152 bits (384), Expect = 1e-35, Method: Composition-based stats. Identities = 56/206 (27%), Positives = 96/206 (46%), Gaps = 6/206 (2%) Query: 53 QTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKV 112 + + Q + E L+ D+ +++ A N G+YD ++AP+G + G+Q + Sbjct: 95 NQPQAAIVDQDKSHEPLYKFDTLIKDLPKDSELKFAANAGMYDGNFAPIGYTVIQGRQIL 154 Query: 113 ALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKT------SKEIQFAVQSGPMLMENG 166 +LNL G GNF + P GV + + + +A QSGPML+ +G Sbjct: 155 SLNLKQGGGNFHLLPNGVLWWDKANHVHITESTQLDAMLKSGEAKPWYATQSGPMLVIDG 214 Query: 167 VINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGT 226 I+P+ + + S KIRNGVG+ F+ S++ NFY FA + K L+ + L+LDG Sbjct: 215 HIHPKFNSDSTSKKIRNGVGVCDGSQIHFVTSREPVNFYQFARFFKEDLHCDNALFLDGG 274 Query: 227 ISHMYMKGGAIPWQRYPFVTMISVER 252 ++ + M+ + Sbjct: 275 VASALYAPDVAAQEEKNMGVMVGLIE 300 >UniRef50_D0WLU9 Putative uncharacterized protein n=1 Tax=Actinomyces sp. oral taxon 848 str. F0332 RepID=D0WLU9_9ACTO Length = 447 Score = 152 bits (383), Expect = 1e-35, Method: Composition-based stats. Identities = 39/238 (16%), Positives = 66/238 (27%), Gaps = 27/238 (11%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ 84 V + T+T TV + AN + + + I S Sbjct: 206 ANGTVKVEQIATGSGNNTVTYYVATVKLTDAT-ALKSAFANNQFGRNITQKTSTIASNNN 264 Query: 85 VQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLD 144 A+NG Y + G+ I NG +G G + + Sbjct: 265 AIFAINGDYY--GFRSSGIVIRNGVVYRDDGARAG---LAFYRDGSVKIYDET---STNG 316 Query: 145 AFKTSKEIQFAVQSGPMLMENGVINPRIHP----------NVASSKIRNGVGINKHGNAV 194 + + + GP L++NG I I ++ ++ R VG K G V Sbjct: 317 QKLVKEGVWNTLSFGPSLVKNGKIVEGIDDVEIDTNFGNHSIQGNQPRTLVGAKKDGTLV 376 Query: 195 FLLSQQA-------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFV 245 F++ + A + LDG S G + + Sbjct: 377 FVVVDGRDAGYSRGVTMTEAAKIMLEQ-GCVTAYNLDGGGSSTMYFNGEVINEPSNGG 433 >UniRef50_A1B5U0 Putative uncharacterized protein n=1 Tax=Paracoccus denitrificans PD1222 RepID=A1B5U0_PARDP Length = 251 Score = 151 bits (381), Expect = 3e-35, Method: Composition-based stats. Identities = 67/239 (28%), Positives = 104/239 (43%), Gaps = 4/239 (1%) Query: 19 IFLALTLLPLFAVAADDCALSDPTLTVQAYTVNP-QTERVKMYWQKANGEAWGTLHALLA 77 F AL + L A+A T+ Q ++++ +G G A+ Sbjct: 12 AFGALIAMTLPALAGICEKRDFDGQGYVICTLTAGQEPGLRLWLNGPDGRTLGDFTAVRR 71 Query: 78 DINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDK 137 + + AMN G+Y + P+GLY+ +G + L A G GNF + P GVF G + Sbjct: 72 TLAQGESLGFAMNAGMYHPDFTPVGLYVSDGVSQHDLVTAGGGGNFGMLPNGVFCAGGAR 131 Query: 138 V--GIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNA-V 194 I K + + + A QSGPML+ +G ++PR + S IRNGVG++ G Sbjct: 132 PYQVIESRAFAKAAPDCRLATQSGPMLVIDGALHPRFLVDSDSRYIRNGVGVSPDGQTAW 191 Query: 195 FLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 F +S +A F+ F + L LY DG+IS +Y G +I + Sbjct: 192 FAISDRAVTFHQFGRLFRDGLGARDALYFDGSISRLYAPGLGRADFGRRLGPIIGYVGQ 250 >UniRef50_C7PF78 Putative uncharacterized protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PF78_CHIPD Length = 273 Score = 150 bits (380), Expect = 3e-35, Method: Composition-based stats. Identities = 70/226 (30%), Positives = 115/226 (50%), Gaps = 8/226 (3%) Query: 36 CALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADIN--SQGQVQMAMNGGI 93 + A VNP + ++W A+ + L D+ + + M NGG+ Sbjct: 48 ITFTHNGQQYDAIVVNPAVSDISLHWLSADQQTPYKSIQALQDVLLEKKKDILMITNGGM 107 Query: 94 YDESYAPLGLYIENGQQKVALNLASGE-GNFFIRPGGVFYVAGDKVGIVRLDAFKTSK-- 150 + ++ P+GL+I G++ ++ A+ + GNF+++P GVFY+ + + Sbjct: 108 FMKNNIPVGLFISQGRELRPIDAATDQPGNFYMQPNGVFYLDHTGPHVSTTTDYLKRSRA 167 Query: 151 --EIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATN-FYDF 207 +I A QSGPML+ G+IN + +P + +R+GVGI +GN VF++S++A FYDF Sbjct: 168 HSKIVAATQSGPMLVSKGIINAKFNPGSVNRNLRSGVGILSNGNVVFIISKEAQTTFYDF 227 Query: 208 ACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 A KA+ + LYLDG IS MY+K F MI+V + Sbjct: 228 ASIFKARFGCKDALYLDGAISKMYLKNSRPGDLNGDFGAMIAVTAR 273 >UniRef50_A6LS70 Putative uncharacterized protein n=23 Tax=Clostridium RepID=A6LS70_CLOB8 Length = 356 Score = 150 bits (379), Expect = 4e-35, Method: Composition-based stats. Identities = 35/224 (15%), Positives = 63/224 (28%), Gaps = 30/224 (13%) Query: 37 ALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYD 95 + + V +P ++ + G + I A+NGG + Sbjct: 132 DIENSKYNGYYLVVKDPTRVKIGV------SSKLGVEGETTSTIAENNDAIAAINGGAFT 185 Query: 96 E----------SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDA 145 + G+ + G+ KV + F I GV V Sbjct: 186 DQSSAAQWTGNGGLASGIVMTGGEVKVNDVGDNPTTTFGIDKNGVMVVGD------YTVE 239 Query: 146 FKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA---- 201 IQ A+ GP L+ NG + + + +G K G+ + L+ Sbjct: 240 KLKELGIQEALSFGPALIINGNMVKINGDGGFGTAPKTAIGQMKDGSIILLVIDGREIGS 299 Query: 202 --TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYP 243 + +L + LDG S G + + Sbjct: 300 IGATLKELQEIM-HQLGAWNAMNLDGGKSTTLYYYGEVRNKPSN 342 >UniRef50_C4FXK4 Putative uncharacterized protein n=1 Tax=Catonella morbi ATCC 51271 RepID=C4FXK4_9FIRM Length = 305 Score = 150 bits (379), Expect = 4e-35, Method: Composition-based stats. Identities = 44/240 (18%), Positives = 76/240 (31%), Gaps = 21/240 (8%) Query: 23 LTLLPLFAVAADDCAL-----SDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLA 77 T+ A D A+ + + TV + + A+ + A + Sbjct: 62 ATVNTATAYEDDTKAIAIDTYERNSTQIHVATVTIKG-DASIKTALADETYGRNVKAKTS 120 Query: 78 DINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDK 137 +A+NG Y G I NGQ + + + + I G F + + Sbjct: 121 TTAQSVNAVLAVNGDYY--GARDAGYVIRNGQLLRSDSQDPNQEDLVIYQDGSFEIIREG 178 Query: 138 VGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVA---SSKIRNGVGINKHGNAV 194 +K + GP L+E+ + V +S R +GI + V Sbjct: 179 DI---TAQELLNKGAVQVLSFGPALIEDSQVAVDSTDEVGKAMASNPRTAIGIIDDKHYV 235 Query: 195 FLLSQQAT------NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMI 248 ++S T + + A + K +L V LDG S G I + I Sbjct: 236 LVVSDGRTDESKGLSLKELADFMK-ELKVTTAYNLDGGGSSTMYFNGQIINKPTTNGHNI 294 >UniRef50_B8FUP3 Putative uncharacterized protein n=2 Tax=Desulfitobacterium hafniense RepID=B8FUP3_DESHD Length = 350 Score = 147 bits (372), Expect = 2e-34, Method: Composition-based stats. Identities = 37/226 (16%), Positives = 57/226 (25%), Gaps = 28/226 (12%) Query: 36 CALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIY 94 +S V NPQ R+ A + G +++ +N G + Sbjct: 123 VEVSGKGFQGYLLKVGNPQRVRL------AATDQLGDRGLKVSEFVENNHAVAGINAGGF 176 Query: 95 DE------SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKT 148 + P G+ I G+ N + I V V Sbjct: 177 ADPGGVSFGGTPTGILITEGKIIHKDNWETYS-LIGITKHDVLVVG------RYTLEQIE 229 Query: 149 SKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT------ 202 I+ AV GP L+ NG R +G G + L+ Sbjct: 230 ELGIRDAVSFGPALIVNGEPMITYGDGGWGIAPRTAIGQTHDGTILLLVIDGRQLGSLGA 289 Query: 203 NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPW-QRYPFVTM 247 D LDG S + G + P+ Sbjct: 290 TLKDVQDILIEH-GAVNGANLDGGSSSTLVYEGEVKNKPSSPYGPR 334 >UniRef50_A7GCS1 Putative uncharacterized protein n=12 Tax=Clostridium RepID=A7GCS1_CLOBL Length = 339 Score = 147 bits (371), Expect = 3e-34, Method: Composition-based stats. Identities = 38/253 (15%), Positives = 72/253 (28%), Gaps = 36/253 (14%) Query: 8 GKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTV-NPQTERVKMYWQKANG 66 KG+ +LK + + + + + + ++ + NPQ ++ Sbjct: 87 DKGISNSSLKENYGDIKIKNKYGNSVERYDINTAKFDGYILEIKNPQKVKIGYT------ 140 Query: 67 EAWGTLHALLADINSQGQVQMAMNGGIY-----------DESYAPLGLYIENGQQKVALN 115 + G + + + + A+NGG + P GL I NG+ Sbjct: 141 KYMGKMGERTSKMAERHGAVAAVNGGGFRDVSSTGKLWTGTGAYPEGLVISNGKVIYNDF 200 Query: 116 LASGEGNF-FIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHP 174 + + N G+ V V + A+ L+ NG P Sbjct: 201 KSGQKVNVTAFTKEGLLVVGDHTVDE------LLKMGVVEALSFRNTLIINGKPIPY--- 251 Query: 175 NVASSKIRNGVGINKHGNAVFLLSQQA------TNFYDFACYAKAKLNVEQLLYLDGTIS 228 R +G + G V L+ + + V LDG S Sbjct: 252 -NEGINPRTAIGQKQDGTIVLLVIDGRRGIKQGATLEEVENILLQR-GVVNASNLDGGSS 309 Query: 229 HMYMKGGAIPWQR 241 G + + Sbjct: 310 STMYYKGKVINRP 322 >UniRef50_Q1QCK8 Periplasmic protein-like n=1 Tax=Psychrobacter cryohalolentis K5 RepID=Q1QCK8_PSYCK Length = 276 Score = 146 bits (369), Expect = 6e-34, Method: Composition-based stats. Identities = 67/238 (28%), Positives = 114/238 (47%), Gaps = 7/238 (2%) Query: 24 TLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANG-EAWGTLHALLADINSQ 82 T ++ + + + T +Q+ + + + ++WQ+++ + T LL+ + Sbjct: 38 TASTDWSCQSHNTPFAYSTCHIQSDLLTNKRYSLALFWQQSDSRQPLLTFDNLLSTLPPS 97 Query: 83 GQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAG-DKVGIV 141 ++ AMN G+Y+E+YAP+G + ++ ALNL G GNF + P GV + KV I Sbjct: 98 QSLKFAMNAGMYNENYAPIGYTVIKSEEIRALNLKEGGGNFHLLPNGVLWWDKSGKVQIT 157 Query: 142 RLDAFKTS-----KEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFL 196 +A + +A QSGPML+ N I+P+ P+ S+KIRNG+G+ G+ F+ Sbjct: 158 ESNALAEQLKNGIAQPLYATQSGPMLVINDAIHPQFDPDGTSAKIRNGIGVCSDGSLQFV 217 Query: 197 LSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERKG 254 S+ FY FA K +L L+LDG I+ + MI + Sbjct: 218 NSEAPVAFYQFASLFKNELKCPNALFLDGGIASALYAPTIDKHDKKEMGVMIGLVESA 275 >UniRef50_A0PY15 Conserved protein n=4 Tax=Clostridium RepID=A0PY15_CLONN Length = 436 Score = 146 bits (368), Expect = 7e-34, Method: Composition-based stats. Identities = 31/232 (13%), Positives = 62/232 (26%), Gaps = 38/232 (16%) Query: 37 ALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE 96 + + V P ++++ + + + + + ++I + A+N G + + Sbjct: 200 DIKTNRFNGKMLIV-PNSKKIVIGFNEESPSK---VGKTTSEIAKENNAICAINAGGFTD 255 Query: 97 S------------------YAPLGLYIENGQQKVAL---NLASGEGNFFIRPGGVFYVAG 135 P G+ I NG+ G V Sbjct: 256 DVSGKSAEVVLNPDSGYETRKPCGILIHNGEFVYNDDKGRKNEKIDIVGFSKRGKLIVGK 315 Query: 136 DKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVF 195 + I+ AV GP L+ +G + R +G + G+ +F Sbjct: 316 ------YTLEELKNINIKEAVSFGPALIVDGNPVNILGDGGWGVAPRTAIGQRRDGSVLF 369 Query: 196 LLSQQA------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQR 241 L+ D K K LDG + + Sbjct: 370 LVIDGRGFKSMGATIKDVQDIMK-KYGAVNASNLDGGTVSTMYYKDKVINKP 420 >UniRef50_D1REU9 Putative uncharacterized protein n=1 Tax=Legionella longbeachae D-4968 RepID=D1REU9_LEGLO Length = 260 Score = 146 bits (368), Expect = 7e-34, Method: Composition-based stats. Identities = 51/210 (24%), Positives = 85/210 (40%), Gaps = 18/210 (8%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ 84 L P + P V + V+ + ++ + K + + + + Sbjct: 40 LSPGIEYQDLAGGILAPWSHVYVFRVDLKKNKLGLVNAKNLSLKYAS----VNQFAEHSK 95 Query: 85 VQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLD 144 +++NGG +D + PLGL I NG+ + L S G FFI+ K I L Sbjct: 96 ALLSINGGFFDHKFNPLGLRITNGKLENPLKRISWWGVFFIKNN--------KAYISSLR 147 Query: 145 AFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAV-FLLSQQATN 203 F+ +I FA+QSGP L+ N I P + P +A R+ +GI G + + + A Sbjct: 148 QFQYDNDIDFAIQSGPRLLVNRKI-PSLKPGIAE---RSALGITADGKIILLVTTNAAMT 203 Query: 204 FYDFACYAKA-KLNVEQLLYLDGTISHMYM 232 A ++ L+ + LDG S Sbjct: 204 TNKLAHLLRSPPLSCMDAINLDGGSSSQLY 233 >UniRef50_A4VXL8 Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase n=12 Tax=Firmicutes RepID=A4VXL8_STRSY Length = 312 Score = 146 bits (368), Expect = 8e-34, Method: Composition-based stats. Identities = 42/224 (18%), Positives = 72/224 (32%), Gaps = 17/224 (7%) Query: 36 CALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYD 95 ++ TV + + + A + A ++ + +A+NG Y Sbjct: 84 ETITTNNTTVYVADIQVSSPEY-LKTALAQNTYGTNVTAKTSETAAANNAILAVNGDYYG 142 Query: 96 ESYAPLGLYIENGQQKVALNLASGE-GNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQF 154 + G I+NG + G+ I G F V + K + Sbjct: 143 AN--STGYVIKNGVLYRDTVRDNAAYGDLAIYADGSFEVIYEN---EITAQELIDKGVVN 197 Query: 155 AVQSGPMLMENGVINPRIHPNVA---SSKIRNGVGINKHGNAVFLLSQQATN------FY 205 + GP L+ENG I V SS R+ +GI + + +++ T+ Y Sbjct: 198 LLAFGPSLVENGEIVVDTSTEVGRAMSSNPRSAIGIIDENHYIIVVADGRTSESQGLSLY 257 Query: 206 DFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMIS 249 A K + + LDG S G + IS Sbjct: 258 QLAEVMK-QYGAQTAYNLDGGGSSTLYFNGQVINNPTTNGNTIS 300 >UniRef50_B7AQ96 Putative uncharacterized protein n=1 Tax=Bacteroides pectinophilus ATCC 43243 RepID=B7AQ96_9BACE Length = 305 Score = 145 bits (365), Expect = 2e-33, Method: Composition-based stats. Identities = 38/227 (16%), Positives = 67/227 (29%), Gaps = 21/227 (9%) Query: 37 ALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE 96 L + ++ + + A+G + + + I +A+NG Y Sbjct: 74 ELREYDTSIYVADIQLADASY-LRAGLADGTFGRNVTEVTSQIAQDSNAILAINGDFY-- 130 Query: 97 SYAPLGLYIENGQQKVALNLASGEGN-----FFIRPGGVFYVAGDKVGIVRLDAFKTSKE 151 + G + NG +GN I G V + Sbjct: 131 GFRNKGYVMRNGYLYRETAQQGRQGNSRQEDLVIYEDGHMDVIEEN---EVAAQTLKDSG 187 Query: 152 IQFAVQSGPMLMENGVINPRIHPNVASS---KIRNGVGINKHGNAVFLLSQQAT------ 202 GP L++NG I + V S R +G+ + + +S T Sbjct: 188 ASQIFSFGPGLIKNGNITVDENSEVEQSMQSNPRTAIGMITPLHYIMAVSDGRTEASEGL 247 Query: 203 NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMIS 249 Y A K + + LDG S G + + + + IS Sbjct: 248 TLYQLAQIMKGQ-DCVTAYNLDGGGSSTMWFNGEVVNKPTSYGSKIS 293 >UniRef50_Q5WVS5 Putative uncharacterized protein n=5 Tax=Legionella RepID=Q5WVS5_LEGPL Length = 258 Score = 144 bits (364), Expect = 2e-33, Method: Composition-based stats. Identities = 49/233 (21%), Positives = 88/233 (37%), Gaps = 27/233 (11%) Query: 11 MITLNLKRIFLALT---------LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYW 61 + + IF +T L P + L +P + + ++ ++ + Sbjct: 17 FLLILALAIFTPMTSYSASDWQELTPGIEYQDLEGGLLNPWSHIHVFRIDLNKNQMALVT 76 Query: 62 QKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEG 121 + +A + + +++NGG +D + PLGL I N +Q+ L S G Sbjct: 77 A----KNLAQKNASVDQFAEHSKALLSINGGFFDHEFNPLGLRINNKKQENPLKRISWWG 132 Query: 122 NFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKI 181 F+++ K I + F I FA+QSGP L+ G I P + V Sbjct: 133 IFYVKDN--------KPRITNIRNFHYDSNIDFAIQSGPRLLIRGNI-PSLKAGV---AD 180 Query: 182 RNGVGINKHGNA-VFLLSQQATNFYDFACYAKA-KLNVEQLLYLDGTISHMYM 232 R +GI G + + + A + A ++ L+ + LDG S Sbjct: 181 RTALGITDDGKVIILVTTNAAMSTRQLAQIMRSPPLSCSDAINLDGGSSSQLY 233 >UniRef50_Q2NAA1 Putative uncharacterized protein n=3 Tax=Erythrobacter RepID=Q2NAA1_ERYLH Length = 277 Score = 144 bits (363), Expect = 3e-33, Method: Composition-based stats. Identities = 63/230 (27%), Positives = 105/230 (45%), Gaps = 8/230 (3%) Query: 26 LPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQV 85 A + L+ + + P R+ + + Sbjct: 54 TSNVAAESACERLTFQEVVLTHCVAVPAKHRITTVLGPPHRSFAKLAEGRSSA------P 107 Query: 86 QMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDA 145 A+N G++D P+G Y+E+ ++ ALN G GNF ++P GVFY + + + ++ Sbjct: 108 VFAVNAGMFDGDGKPIGYYVEDSERLQALNTNDGAGNFHLKPNGVFYGSNGEWRVRTTES 167 Query: 146 FKTS--KEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATN 203 F + QF QSGPML+ +G ++P I + S +IRNGVG+++ G A F++S+ + Sbjct: 168 FLANVSDRPQFGTQSGPMLLIDGKLHPEISEDGPSRQIRNGVGVDRQGRAHFVISEGPIS 227 Query: 204 FYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 F FA + + N LYLDG +S ++ R P MI VE + Sbjct: 228 FGKFARFFRDVANTPNALYLDGNVSGLWDPANDRMDARAPIGPMIVVETR 277 >UniRef50_C2KZT9 Exopolysaccharide biosynthesis protein n=2 Tax=Firmicutes RepID=C2KZT9_9FIRM Length = 438 Score = 144 bits (363), Expect = 3e-33, Method: Composition-based stats. Identities = 40/236 (16%), Positives = 74/236 (31%), Gaps = 18/236 (7%) Query: 24 TLLPLFAVAADDCALSDPTLTVQAYTVNPQ-TERVKMYWQKANGEAWGTLHALLADINSQ 82 + + Y + + T+ + AN + +D+ + Sbjct: 197 VIGTYSDSKSKITVTRYRAYDSNIYVADVEVTDGTSILSAFANNTYGRNITDTTSDMAEE 256 Query: 83 GQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVR 142 +A+NG Y G I NG + + I G + Sbjct: 257 NNAVLAINGDYY--GARQSGYVIRNGVVYRSQGSNGED--MVISKDGSLSFISES---DT 309 Query: 143 LDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVA---SSKIRNGVGINKHGNAVFLLSQ 199 K+ + GP+L+ENG + + V +S R +G + +F++S Sbjct: 310 TTDSLIQKQTWQVLSFGPVLVENGQVAVSENDEVGMAMASNPRTAIGTVAKNHYLFVVSD 369 Query: 200 QATN------FYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMIS 249 T+ Y+ A + K+ L + LDG S + G + IS Sbjct: 370 GRTSESAGLSLYELANFMKS-LGATNVYNLDGGGSSTMVFQGEVVNNPTTNGNKIS 424 >UniRef50_Q97FU6 Uncharaterized conserved protein, YOME B.subtilis ortholog n=2 Tax=Clostridium RepID=Q97FU6_CLOAB Length = 347 Score = 144 bits (362), Expect = 3e-33, Method: Composition-based stats. Identities = 42/240 (17%), Positives = 71/240 (29%), Gaps = 32/240 (13%) Query: 39 SDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIY--- 94 D T + +P ++ Q G + ++ + + A+NGG + Sbjct: 115 GDGKFTANVLIIKDPNRVKIGYAAQ------IGYVGETTREMAKRYKAVAAINGGYFKDT 168 Query: 95 -------DESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDK---VGIVRLD 144 P G + NGQ + ++ + D VG Sbjct: 169 SPNKQSGGVGAIPTGFIMSNGQIVYPQDNSNWSEITSEEENRALTIDKDGNLQVGGTYSP 228 Query: 145 AFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNF 204 I+ AV + P L++NG N +V+ + R +G + +F++ Sbjct: 229 DQLIKSGIREAVITEPYLIKNGK-NTIQANSVSGTNPRTAIGQRADKSIIFMVIDGRQGV 287 Query: 205 YDFA-----CYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQR------YPFVTMISVERK 253 A KL LDG S G I +I VE K Sbjct: 288 KLGATVGDVQVLMHKLGAVNAACLDGGGSTAMYYNGEIINNPSNATGERAVPDIIYVEPK 347 >UniRef50_A5EW16 Putative uncharacterized protein n=1 Tax=Dichelobacter nodosus VCS1703A RepID=A5EW16_DICNV Length = 263 Score = 144 bits (362), Expect = 4e-33, Method: Composition-based stats. Identities = 76/260 (29%), Positives = 123/260 (47%), Gaps = 19/260 (7%) Query: 12 ITLNLKRIFLALTLLP-LFAVAADDCALS---------DPTLTVQAYTVNPQTERVKMYW 61 + + L++I + + L L AA +V P+ ++++ W Sbjct: 1 MLVALRKIIVPVILSSFLLETAAAHLDFKKVAGGNFARFHHQSVDYAVFMPEHDKIRFLW 60 Query: 62 QKANGEAWGTLHALLADINS-QGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGE 120 Q GE + T+H L + + QV MN GI++++ P GL+IE LN SG+ Sbjct: 61 QNDRGENYQTMHHALRALTNEGYQVHFLMNAGIFNQNAQPAGLWIEKKALLRPLNRRSGK 120 Query: 121 GNFFIRPGGVFYVAGDKVGIVRL-DAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASS 179 GNF I+P GVFY+ +K I+ + +AVQSGP+L+ +G IN R+ N ++ Sbjct: 121 GNFHIQPNGVFYLTQEKAHIITTVQWHNNPPKADYAVQSGPLLIIDGAINSRLPKNHKAA 180 Query: 180 KIRNGVGINKHGNAVFLLS----QQA--TNFYDFACYAKAKLNVEQLLYLDGTISHMYMK 233 RN V ++K F+++ A N Y FA + +Q LYLDG++S Y+ Sbjct: 181 YKRNAVCVDKARRVYFVITTRYDDGAHFPNLYRFAHAL-QTIGCQQALYLDGSLSDFYLP 239 Query: 234 GGAIPWQRYPFVTMISVERK 253 + + F MI+V K Sbjct: 240 MESSRFHWQKFAGMIAVVSK 259 >UniRef50_C2HB28 Exopolysaccharide biosynthesis protein n=4 Tax=Enterococcus faecium RepID=C2HB28_ENTFC Length = 308 Score = 142 bits (359), Expect = 9e-33, Method: Composition-based stats. Identities = 38/234 (16%), Positives = 73/234 (31%), Gaps = 18/234 (7%) Query: 20 FLALTLLPLFAVAADDCALSDPTL-TVQAYTVNPQTERVK-MYWQKANGEAWGTLHALLA 77 F + + + LS + Y + + AN + + Sbjct: 63 FEPVITDNSYQDENINITLSSERVDETTVYVADITVSDSSYLKTALANNTYGRNIKETTS 122 Query: 78 DINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVA-LNLASGEGNFFIRPGGVFYVAGD 136 I + Q +A+NG Y + G + NG + + + I G F + + Sbjct: 123 AIAQEQQAILAINGDYY--GFRDKGYVLRNGTLYRDTPSDDETKEDLVIDKNGDFSIIKE 180 Query: 137 KVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASS---KIRNGVGINKHGNA 193 +++Q + GP L+ENG + V+ S R + + Sbjct: 181 ---AETSAEKLVEEDVQQVLSFGPALVENGEVTVSEDEEVSQSMKSNPRTAIAQVGTNHY 237 Query: 194 VFLLSQQAT------NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQR 241 + ++S+ T + + A K + LDG S G + Q Sbjct: 238 LVVVSEGRTDDSQGLSLSELATVLKNH-GAKTAYNLDGGGSTTLYFNGKVINQT 290 >UniRef50_B0TEY5 Putative uncharacterized protein n=1 Tax=Heliobacterium modesticaldum Ice1 RepID=B0TEY5_HELMI Length = 327 Score = 142 bits (357), Expect = 1e-32, Method: Composition-based stats. Identities = 37/226 (16%), Positives = 65/226 (28%), Gaps = 26/226 (11%) Query: 32 AADDCALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMN 90 + + T + V +P +V + G + ++ + A+N Sbjct: 93 RIEVEDIQGYRFTGKVMIVHDPLRIKVAV------SSKLGEAGETVPEMARREGAVAAIN 146 Query: 91 GGIY-DESYA-----PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLD 144 GG + D + P G+ + GQ ++ E I G V +R Sbjct: 147 GGGFIDPNGQGNGAYPDGITVSRGQFISVIDEDQKENIIGITKKGQMIVGRYSARELRS- 205 Query: 145 AFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT-- 202 +I V GP L+ NG R G+G G+ + ++ Sbjct: 206 -----MDISEVVTFGPPLVVNGRPTITSGDGGWGVAPRTGIGQRSDGSIIMVVIDGRQIG 260 Query: 203 ----NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPF 244 + K LDG S + G + Q Sbjct: 261 SIGATLRELQDLLL-KYGAVTAGNLDGGASTTMVYNGKVINQPSSV 305 >UniRef50_C6XT14 Exopolysaccharide biosynthesis protein n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6XT14_PEDHD Length = 303 Score = 141 bits (356), Expect = 2e-32, Method: Composition-based stats. Identities = 38/253 (15%), Positives = 73/253 (28%), Gaps = 38/253 (15%) Query: 25 LLPLFAVAADDCAL--SDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINS- 81 + P + + ++ + VK+ + ++ Sbjct: 50 IQPGVEETDIHYQSQSGGLSTKIFILKIDLKNPDVKLQAATPYDAPGYG-SQTVPEMAKY 108 Query: 82 ----QGQVQMAMNGGIYDES-YAPLGLYIENGQQKVALNLAS------GEGNFFIRPGGV 130 +V +NG ++ S Y PLG+ + G + G I G Sbjct: 109 VDAANNRVIAGINGDFFNTSSYVPLGIIYKKGVAIKPAFTDNTDKPQQGLSFLGILANGK 168 Query: 131 FYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKH 190 Y+ G D +++ A+ +G L+++ + P V R GVGI Sbjct: 169 PYI-----GDKETDYPTIKSQLKEALGAGVFLVKDYKKITQSIPTVD---PRTGVGITDD 220 Query: 191 GNAVFLLSQQA-------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPW---- 239 F++ N+ + A V+ + LDG S +M Sbjct: 221 DLVYFIVVDGRNFYNSNGINYQEMGKIMYA-FGVKNAVNLDGGGSSTFMIKHPRVDVWQI 279 Query: 240 ---QRYPFVTMIS 249 I+ Sbjct: 280 RNKPSDGSPRAIA 292 >UniRef50_C3QHD0 Exopolysaccharide biosynthesis protein n=2 Tax=Bacteroides RepID=C3QHD0_9BACE Length = 311 Score = 141 bits (356), Expect = 2e-32, Method: Composition-based stats. Identities = 42/246 (17%), Positives = 77/246 (31%), Gaps = 32/246 (13%) Query: 24 TLLPLF-AVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKA--NGEAWGTLHALLADIN 80 TL P A+ + + + + + V+ + + M G+ L + Sbjct: 62 TLAPGVKALEMEILSATGMAVKMFVLEVDLKDTHLTMKASSPKDEGKLKTKQQMTLQALA 121 Query: 81 ---SQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDK 137 +V A+NG + P G+Y NG + F + G + Sbjct: 122 HDKQGSRVLAAVNGDFFATDGTPQGIYYRNGVCLKNTMTDNVCTFFAVTKGKKAVIG--- 178 Query: 138 VGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLL 197 + EIQ AV LM NG + P + + + R +G+ + L+ Sbjct: 179 ---SYDEYDTYKDEIQEAVGGRVRLMTNGNVLP---QTLTALEPRTAIGVTDNNVVYILV 232 Query: 198 SQQATNFY-------DFACYAKAKLNVEQLLYLDGTISHMYMKGGA---------IPWQR 241 + +Y + KA L + + LDG S ++ I Sbjct: 233 ADGRNFWYSNGMRYAEMGAVMKA-LGAKDAINLDGGGSSTFIIRSKAGFEENRFAIRNWP 291 Query: 242 YPFVTM 247 Y + Sbjct: 292 YDNGGV 297 >UniRef50_UPI0000E45D54 PREDICTED: similar to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase precursor n=2 Tax=Strongylocentrotus purpuratus RepID=UPI0000E45D54 Length = 447 Score = 140 bits (354), Expect = 3e-32, Method: Composition-based stats. Identities = 37/227 (16%), Positives = 61/227 (26%), Gaps = 27/227 (11%) Query: 39 SDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE-S 97 S VN V + +G A + Q +A+N G ++ S Sbjct: 100 SGERAPGHIVRVNSPARTVSVLEPFDSGGCTNHHRATVDSTAKQDNCLVAVNAGFFNPRS 159 Query: 98 YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQ 157 A G + NG+ N + IR G + + +T V Sbjct: 160 GACYGNVVSNGRLVQ-TNGGLQNAHLGIRADGTLVFGY----LSEENVLQTENPFIQLVG 214 Query: 158 SGPMLMENGVINPRIHP---------------NVASSKIRNGVGINKHGNAVFLLSQQ-- 200 L+ +G I R VG ++ G V + Sbjct: 215 GVGWLLRDGEIYVEESKKAECGDTEEASSVDLFFNMLSARVAVGSDEKGRLVIAVIDGQT 274 Query: 201 ---ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPF 244 + FA + + V + DG S ++ G I Sbjct: 275 LKRGLSLLSFAKWLLSH-GVTNAINFDGGGSATFVVNGTIVNHPSDP 320 >UniRef50_C6D6X3 Exopolysaccharide biosynthesis protein n=6 Tax=Bacteria RepID=C6D6X3_PAESJ Length = 344 Score = 140 bits (354), Expect = 3e-32, Method: Composition-based stats. Identities = 37/235 (15%), Positives = 66/235 (28%), Gaps = 27/235 (11%) Query: 36 CALSDPTLTVQAYTVNPQ-TERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIY 94 + + Y + ++ + A + + I S A+NG Y Sbjct: 112 VETGSGSDMITYYVADVAFNSKMNLLTAFAKDSFGTNITQNTSTIASNNNAVFAINGDYY 171 Query: 95 DESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQF 154 + G+ I NG G F G + ++ + Sbjct: 172 --GFRSDGVVIRNGTVYRDEPARIGLAMF---NDGTMKSYDE---EETSTDDLLAQGVTN 223 Query: 155 AVQSGPMLMENGVINPRIH----------PNVASSKIRNGVGINKHGNAVFLLSQQA--- 201 A GP L+ +G I ++ +S R G+G+ + VF++ Sbjct: 224 AFSFGPALVTDGEIAGDFSHVEIDKNFGNRSIQNSNPRTGIGMISANHYVFVVVDGRSTG 283 Query: 202 ----TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVER 252 +FA K +L + LDG S G + V Sbjct: 284 YSRGMTLTEFADLFK-ELGATEAYNLDGGGSSTMYFMGRVVNNPLGKGNERGVSD 337 >UniRef50_Q892K3 N-acetylmuramoyl-L-alanine amidase/putative S-layer protein n=1 Tax=Clostridium tetani RepID=Q892K3_CLOTE Length = 708 Score = 140 bits (353), Expect = 4e-32, Method: Composition-based stats. Identities = 44/253 (17%), Positives = 76/253 (30%), Gaps = 32/253 (12%) Query: 24 TLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALL----ADI 79 + + A V + + RV + N + + +++ A I Sbjct: 63 IVPGVTEKAYRFIDKDGKKQYVSLMEIRWTSSRVGVKAGTPNNKDSYGMQSVIMQAKASI 122 Query: 80 NSQGQVQMAMNGGIYDE-SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKV 138 S V +NG Y + P+G+ +NG+ A N A+ F + G + K Sbjct: 123 ASGDNVVGGVNGDFYYTVTGEPIGIVYKNGKAVKA-NHAAEWNFFGVLEDGTPIIGDGK- 180 Query: 139 GIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLS 198 + +Q A+ +L+ G I + R VGI K G F+ Sbjct: 181 -----KYNEVKDSLQEALGGNAILVREGRIYQTPSIGG-YREPRTAVGIKKDGTIFFVTV 234 Query: 199 QQAT-------NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGG------AIPWQR---- 241 + D A L + L LDG S ++ + + Sbjct: 235 DGRQEGHSAGISMPDLAQLMID-LGAVEALNLDGGGSSTFVSRKLGSSDLILKNKPSGGI 293 Query: 242 -YPFVTMISVERK 253 V K Sbjct: 294 MRNVGNSWLVINK 306 >UniRef50_Q97FU3 Uncharaterized conserved protein, YOME B.subtilis ortholog n=6 Tax=Clostridium RepID=Q97FU3_CLOAB Length = 354 Score = 140 bits (352), Expect = 5e-32, Method: Composition-based stats. Identities = 37/236 (15%), Positives = 67/236 (28%), Gaps = 39/236 (16%) Query: 32 AADDCALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMN 90 + + + + +P ++ G ++I A+N Sbjct: 115 QIECKKIQGNKFSGLMLVIHDPTKVKIGYT------SKLGVEGETTSEIAKHNNALAAVN 168 Query: 91 GGIY------------DESYAPLGLYIENGQQKVALNLAS---GEGNFFIRPGGVFYVAG 135 GG + P G+ I +G+ N +G I GV V Sbjct: 169 GGGFQENSSGSKVVWTGTGALPTGIIISDGKVVYPKNPDQLSIQKGTAAITKSGVLVVGD 228 Query: 136 DKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHP----NVASSKIRNGVGINKHG 191 + ++ + A+ GP L+ NGV R + ++ R +G K G Sbjct: 229 HSIRE------LLNENVVEAINFGPTLIVNGVDQTRDSFGNSIDSQGAQPRTAIGQRKDG 282 Query: 192 NAVFLLSQQAT------NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQR 241 + L D + + N + LDG S G + Sbjct: 283 AILLLTVDGRQGLQMGATIKDIQKIMEQE-NAYNAVNLDGGASTTMYYNGHVINNP 337 >UniRef50_C2JZN3 N-acetylmuramoyl-L-alanine amidase/probable S-layer protein n=2 Tax=Lactobacillus rhamnosus RepID=C2JZN3_LACRH Length = 559 Score = 139 bits (351), Expect = 7e-32, Method: Composition-based stats. Identities = 45/226 (19%), Positives = 76/226 (33%), Gaps = 22/226 (9%) Query: 23 LTLLPLFAVAA-DDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLH----ALLA 77 TL P + S + +NP+ ++ A + A Sbjct: 124 ATLTPGVTEQRLTYISQSGTQNKYYSVALNPKNPNTQLLTGTPGDGATSGVQTVSDQASA 183 Query: 78 DINSQGQVQMAMNGGIYD-ESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGD 136 I + QV A+NG ++ S P G I++G + A ++ E F I+ G + Sbjct: 184 AIKNGHQVVAAVNGDLFKIASGVPTGNVIKDGVELHAA-TSARESFFGIKKDGTPIIGD- 241 Query: 137 KVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFL 196 K ++Q A+ +L+ +G +N S+ R VGI G F+ Sbjct: 242 -----EQTYQKVKGDLQQALGGRNILVADGKVNET-KAIGTDSEPRTAVGIKADGTVFFV 295 Query: 197 LSQQAT-------NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGG 235 + + D A + L LDG S Y+ Sbjct: 296 VVDGRQAPTSNGLSMVDLANLMIQR-GAVTALNLDGGGSSTYVARE 340 >UniRef50_C0GEE0 Putative uncharacterized protein n=1 Tax=Dethiobacter alkaliphilus AHT 1 RepID=C0GEE0_9FIRM Length = 379 Score = 139 bits (351), Expect = 7e-32, Method: Composition-based stats. Identities = 32/231 (13%), Positives = 63/231 (27%), Gaps = 24/231 (10%) Query: 27 PLFAVAADDCALSDPT-LTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQV 85 + S V+P RV + +G ++ I + Sbjct: 157 NGVNIEIRHVEKSHNRIWIATVKLVDPTKLRVAF-----AHDEYGAPRKPVSKIANSNNA 211 Query: 86 QMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDA 145 +A+N + P + G+ + G I G+ +G + Sbjct: 212 ILAINASGFS-GNVPFSPVVREGEVYSMDINHTPMG---ITACGMLMDSGKRGV-----E 262 Query: 146 FKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA---- 201 + P+L+ NG + N + R +G ++G+ +F++ Sbjct: 263 QMIEDGAHQVITFRPVLVRNGQM-TSTAQNNNTIHPRTAIGQKENGDLIFIVVDGRRNNW 321 Query: 202 ---TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMIS 249 N D A + LDG S G + + Sbjct: 322 STGINLGDLAQIFIDE-GAAWAYNLDGGGSTTLYFNGKVLNKPSDGRERPV 371 >UniRef50_D1N9W8 Putative uncharacterized protein n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N9W8_9BACT Length = 275 Score = 139 bits (350), Expect = 8e-32, Method: Composition-based stats. Identities = 32/231 (13%), Positives = 70/231 (30%), Gaps = 28/231 (12%) Query: 25 LLPLFAVAA-DDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQG 83 L P + D + + + T +++ + +G + ++ + Sbjct: 40 LAPGVRLTTFDRVRVFGKPQFISVLRADLSTPGLRLGLAECDGGNY----ETVSHFGRRL 95 Query: 84 QVQMAMNGGIYDESYAPLG--LYIENGQQKV-ALNLASGEGNFFIRPGGVFYVAGDKVGI 140 A+N G + P+G +G+ L F + G + G Sbjct: 96 DALAAVNAGFFAMKGNPMGVRYLKIDGKVLNADLGGDPERAYFVLDQTGRPAIVG----- 150 Query: 141 VRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ 200 A + + AV +L+++G + P + + R G++ + + ++ Sbjct: 151 ---PADFAPERCRSAVYGNRLLLKDGKVPP--LGDDKARHPRTAAGLSGN-TLLLVVIDG 204 Query: 201 A------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKG--GAIPWQRYP 243 F + A K L + LDG S G + Sbjct: 205 RARESAGVTFAELATLLKD-LGCTDAVNLDGGGSSTMWTRHHGVVNHPSDN 254 >UniRef50_C8N6Z2 Lipoprotein n=1 Tax=Cardiobacterium hominis ATCC 15826 RepID=C8N6Z2_9GAMM Length = 304 Score = 139 bits (349), Expect = 1e-31, Method: Composition-based stats. Identities = 75/227 (33%), Positives = 111/227 (48%), Gaps = 11/227 (4%) Query: 34 DDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQG-QVQMAMNGG 92 + + + Y +P V ++W+ A+G A+ L L + G +V MN G Sbjct: 77 SYASTTYKNVRYGIYQADPAQ--VSLHWKTADGSAYANLATLKRSLEQSGARVAFLMNAG 134 Query: 93 IYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKT-SKE 151 IY E+ P GL+IE GQ V LN +G+GNF I+P GVFY+ K I A+ + Sbjct: 135 IYSENDTPAGLWIERGQTLVPLNRKNGKGNFHIQPNGVFYIERGKARIQTSAAYHIGNHH 194 Query: 152 IQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA------TNFY 205 +AVQSGP+L+ +G NPR N++S RN V F+L++ +F+ Sbjct: 195 PDWAVQSGPLLLLDGKPNPRFVKNLSSPHKRNAVCTTADNRLYFILTEDYDLGSEWPSFH 254 Query: 206 DFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVER 252 FA L LYLDGT+S Y+ G A + +V +I+V Sbjct: 255 RFAEAL-QHLGCHDALYLDGTLSGWYIPGIAGTFHWTHYVGIIAVTT 300 >UniRef50_C7TED9 N-acetylmuramoyl-L-alanine amidase n=2 Tax=Lactobacillus rhamnosus RepID=C7TED9_LACRG Length = 1561 Score = 138 bits (348), Expect = 2e-31, Method: Composition-based stats. Identities = 40/225 (17%), Positives = 75/225 (33%), Gaps = 21/225 (9%) Query: 23 LTLLPLFAVAA-DDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLH----ALLA 77 TL P + + ++P+ + N + A Sbjct: 117 ATLAPGITEQKLTYLNQNGVQNKYYSVALDPKNPNTTLLAGMPNDGTKPGMQTVRNQANA 176 Query: 78 DINSQGQVQMAMNGGIYDE-SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGD 136 I+ QV A+N Y+ + APLG ++NG + + + E F I+ G + Sbjct: 177 AISHGQQVVAAVNADYYNMATGAPLGNVVKNGTEIYSA-PDTNEAFFGIKKDGTPMIG-- 233 Query: 137 KVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFL 196 + ++Q AV + +++G +N ++ VGI G F+ Sbjct: 234 ----TAATYQQRKGDLQQAVGGPSIFVKDGKVNATQVAGSEGNEPCTAVGIKADGTVFFV 289 Query: 197 LSQQAT-------NFYDFACYAKAKLNVEQLLYLDGTISHMYMKG 234 + + DFA + L+LDG S ++ Sbjct: 290 VIDGRQAPLSTGISVGDFAKLMIER-GAVNALFLDGGGSATFVAR 333 >UniRef50_B2HYZ5 Predicted periplasmic protein n=11 Tax=Acinetobacter RepID=B2HYZ5_ACIBC Length = 204 Score = 138 bits (347), Expect = 2e-31, Method: Composition-based stats. Identities = 63/207 (30%), Positives = 102/207 (49%), Gaps = 8/207 (3%) Query: 12 ITLNLKRI--FLALTLLPLFAVAADDCALSDPTLTVQAYTV-NPQTERVKMYWQKANGEA 68 + + + I F+ T L +D + N + R+ + + Sbjct: 1 MKILVLCIVNFIIFTQSALALEYRQIRNTTDDQFE--VIEISNLEQLRLFLK-NPQTDQY 57 Query: 69 WGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPG 128 + + + + + Q+ AMNGG++ ++P+GLYIENG++ LN G GNFF++P Sbjct: 58 YKSFDNIQYQLKACEQLTFAMNGGMFHSGFSPVGLYIENGRESQPLNEDKGWGNFFLQPN 117 Query: 129 GVFYVAGDKVGIVRLDAFKTSK-EIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGI 187 GV + I+ + +K + +A QSGPML+ NG INP N S KIRNGVG+ Sbjct: 118 GVLAWNDKQAVILTTEQYKAKVFQPDYATQSGPMLVINGKINPLFLANSDSKKIRNGVGV 177 Query: 188 NKHGNAVFLLSQQATNFYDFACYAKAK 214 K+ F++S+ NFY FA + + K Sbjct: 178 -KNNKLYFVISKNRVNFYSFAQFFQKK 203 >UniRef50_C1I4R7 Putative uncharacterized protein (Fragment) n=1 Tax=Clostridium sp. 7_2_43FAA RepID=C1I4R7_9CLOT Length = 894 Score = 138 bits (347), Expect = 2e-31, Method: Composition-based stats. Identities = 33/253 (13%), Positives = 70/253 (27%), Gaps = 33/253 (13%) Query: 24 TLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLAD----I 79 + + + ++ + + V + N E+ L + + Sbjct: 46 LAPGVIEKGYTFEDNTGKRIESFVIEIDTKNKNVSIEASTPNDESAYGLQPVRKQAEALL 105 Query: 80 NSQGQVQMAMNGGIYDE-SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKV 138 V +N Y+ + P G+ +++G F I G + Sbjct: 106 AKGENVVAGVNADFYNMATGEPNGVLLKDGVIIKNH--PESRKFFGILKDGSAVIGD--- 160 Query: 139 GIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLS 198 + ++ A+ +L+++G + + R VGI +GN F+ Sbjct: 161 ---YNKFNEVKDNVEEALGGNAILVKDGQVFETPQTGAD-KEPRTAVGIKSNGNVFFITV 216 Query: 199 QQAT-------NFYDFACYAKAKLNVEQLLYLDGTISHMY-----------MKGGAIPWQ 240 + D A + + Q L LDG S + +K Sbjct: 217 DGRQEPYSAGLSMDDLAQLMIS-MGAIQALNLDGGGSTTHLSRIPGTDNLEVKNRPSDNS 275 Query: 241 RYPFVTMISVERK 253 + K Sbjct: 276 ERSVANSWMIISK 288 >UniRef50_C6IYX5 Putative uncharacterized protein n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6IYX5_9BACL Length = 347 Score = 137 bits (345), Expect = 3e-31, Method: Composition-based stats. Identities = 35/226 (15%), Positives = 70/226 (30%), Gaps = 30/226 (13%) Query: 34 DDCALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGG 92 + +S + TV +P R+ + + ++ + ++ +NGG Sbjct: 97 EIEEISGKSYHGYVLTVNDPTKIRLGVPAK-------RGKGEKVSSMVARTGALAGVNGG 149 Query: 93 IYDE------SYAPLGLYIENGQQK-VALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDA 145 + + + P+G+ I G+ ++ + + G Sbjct: 150 GFADPNWKGNGFKPIGVVISRGKLYYNGISSGAATQIVGLDKQGKMIAGK------YTLE 203 Query: 146 FKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT--- 202 IQ AV P ++ NG R R +G + G +F++ Sbjct: 204 ELDKLGIQEAVTFQPRIIVNGKGQIRSQKEGWGIAPRTAMGQREDGAILFVVIDGRQPGY 263 Query: 203 ----NFYDFACYAKAKLNVEQLLYLDGTISHMYMK-GGAIPWQRYP 243 + YD + LDG S + +K GG I + Sbjct: 264 SIGASLYDVQQIMLER-GAVIAANLDGGSSTVLVKEGGEIVNKPSS 308 >UniRef50_B1BC21 Putative uncharacterized protein n=2 Tax=Clostridium botulinum RepID=B1BC21_CLOBO Length = 326 Score = 137 bits (344), Expect = 4e-31, Method: Composition-based stats. Identities = 32/223 (14%), Positives = 58/223 (26%), Gaps = 32/223 (14%) Query: 38 LSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIY-- 94 L + + NP+ RV + G + ++I A+NGG + Sbjct: 100 LENSRFKAYLMEISNPKKVRVGYA------KKLGKVGEPTSEIAKDFNAIAAINGGSFTD 153 Query: 95 ---------DESYAPLGLYIENGQQK-VALNLASGEGNFFIRPGGVFYVAGDKVGIVRLD 144 P G+ + +G+ ++ + G + Sbjct: 154 ETSNGTKYSGTGAFPEGVIMSHGKVIWKTVSTNTKIDIIAFNNEGKLILGK------YTI 207 Query: 145 AFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ---- 200 A+ P L+ +G A R +G K G +FL++ Sbjct: 208 NELRKLNCIEALCYKPSLIVDGKKAKIKGDGGAGMAPRTAIGQKKDGTILFLVADGTMFK 267 Query: 201 --ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQR 241 + K LDG S G + Sbjct: 268 RDGLRMDELQDILYEK-GAYNATNLDGGSSATMYYDGEVINNP 309 >UniRef50_D1BL19 Putative uncharacterized protein n=4 Tax=Veillonellaceae RepID=D1BL19_VEIPT Length = 312 Score = 136 bits (343), Expect = 7e-31, Method: Composition-based stats. Identities = 27/224 (12%), Positives = 51/224 (22%), Gaps = 27/224 (12%) Query: 36 CALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIY 94 + + +P+ +V ++I A+NGG + Sbjct: 88 EKIQSARYVGYILEIPDPRRIQVGTAA------NIQEKGDTTSNIAKMNNAVAAINGGGF 141 Query: 95 D------ESYAPLGLYIENGQQK--VALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAF 146 P G + +G+ + G Sbjct: 142 HDPNGTGTGRLPYGFILHDGEYVIGKDVGPDEDVDFVGFSKAGNLIAGN------YNKTQ 195 Query: 147 KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYD 206 + + GP L+ +G R +G K G +FL+ Y Sbjct: 196 LGDMKAMEGITFGPPLIVDGKKMITEGDGGWGVGPRTAIGQKKDGTVLFLVIDGRQPGYS 255 Query: 207 FACYAKA------KLNVEQLLYLDGTISHMYMKGGAIPWQRYPF 244 + + LDG S G + + Sbjct: 256 IGATLRDVQDILFEKGCYIAANLDGGSSSTLYLNGKVVNKPADL 299 >UniRef50_B2KU41 N-acetylmuramoyl-L-alanine amidase/putative S-layer protein (Fragment) n=1 Tax=Lactobacillus rhamnosus HN001 RepID=B2KU41_LACRH Length = 470 Score = 136 bits (342), Expect = 7e-31, Method: Composition-based stats. Identities = 45/226 (19%), Positives = 76/226 (33%), Gaps = 22/226 (9%) Query: 23 LTLLPLFAVAA-DDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLH----ALLA 77 TL P + S + +NP+ ++ A + A Sbjct: 124 ATLTPGVTEQRLTYISQSGTQNKYYSVALNPKNPNTQLLTGTPGDGATSGVQTVSDQASA 183 Query: 78 DINSQGQVQMAMNGGIYD-ESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGD 136 I + QV A+NG ++ S P G I++G + A ++ E F I+ G + Sbjct: 184 AIKNGHQVVAAVNGDLFKIASGVPTGNVIKDGVELHAA-TSARESFFGIKKDGTPIIGD- 241 Query: 137 KVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFL 196 K ++Q A+ +L+ +G +N S+ R VGI G F+ Sbjct: 242 -----EQTYQKVKGDLQQALGGRNILVADGKVNET-KAIGTDSEPRTAVGIKADGTVFFV 295 Query: 197 LSQQAT-------NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGG 235 + + D A + L LDG S Y+ Sbjct: 296 VVDGRQAPTSNGLSMVDLANLMIQR-GAVTALNLDGGGSSTYVARE 340 >UniRef50_UPI0001744B4D hypothetical protein VspiD_05050 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001744B4D Length = 235 Score = 135 bits (341), Expect = 1e-30, Method: Composition-based stats. Identities = 62/204 (30%), Positives = 96/204 (47%), Gaps = 12/204 (5%) Query: 36 CALSDPTLTVQAYTVNPQTE-RVKMYWQKANGEAWGTLHALLADINSQGQVQ-MAMNGGI 93 + V+ R+ + W +G+ G+ LL + QG+ A N GI Sbjct: 8 ERIEFEGAIYHVLRVDRADFSRLDLRWLGQDGKPLGSFGPLLQEAARQGRRIEFATNAGI 67 Query: 94 YDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAF--KTSKE 151 Y+ P GL I G++ V LNLA GEGNF++ P GVFY+ V A ++ + Sbjct: 68 YERGPKPCGLTIAGGKELVPLNLAKGEGNFYLHPNGVFYLDDQTGAGVMTGAEYGQSGLQ 127 Query: 152 IQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINK-HGNAVFLLSQ------QATNF 204 + A QSGP+L+ G I+P + N + ++RN VG+ G VF++S F Sbjct: 128 PRLATQSGPILLRQGKIHPAFNFNSPNRRLRNAVGVRASDGQVVFVMSDREDRVKGRVTF 187 Query: 205 YDFACYAKAKLNVEQLLYLDGTIS 228 + + + L + L+LDG IS Sbjct: 188 HQLSRFFL-HLGCQDALFLDGDIS 210 >UniRef50_A0YND3 Putative uncharacterized protein n=1 Tax=Lyngbya sp. PCC 8106 RepID=A0YND3_9CYAN Length = 304 Score = 134 bits (336), Expect = 4e-30, Method: Composition-based stats. Identities = 35/272 (12%), Positives = 76/272 (27%), Gaps = 36/272 (13%) Query: 13 TLNLKRIFLALT----LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMY--WQKANG 66 L + L L + L + ++ T +++ + Sbjct: 29 LGLLLKRPLPTAKQEQLFQGITYQRIRRS-KPDPLMIHIVKIDLTTPGIELLVTPGEQGE 87 Query: 67 EAWGTLHALLADINSQGQVQMAMNGGIYDESY--APLGLYIENGQQKVALNLASGEGNFF 124 + ++ + +Q+A+NG + Y P+ Y +G++ A +G + Sbjct: 88 DDQDISAQTTSEFLQKHYLQLAINGSFFHPFYVHNPIDYYPNSGERVNIFGQAISQGKIY 147 Query: 125 IRPGG---VFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVIN-PRIHPNVASSK 180 V ++ K + D K + +L++ G + + Sbjct: 148 SIVNKGWSVLCISPKKKAEIYFDT--CPKNTLQGIAGNLILIDQGQPIKVKKFSDANQKF 205 Query: 181 IRNGVGINKHG-NAVFLLSQQATNFYD-------FACYAKAKLNVEQLLYLDGTISHMYM 232 R V I+K G +L ++Y + VE L DG S + Sbjct: 206 PRTAVAIDKTGETLWLILIDGRQSWYSKGVTLATLTNIIQELDGVETALNFDGGGSTTLV 265 Query: 233 KGG----AIPWQR---------YPFVTMISVE 251 + P + + Sbjct: 266 ISEGTDTKVLNAPFHSRIPMRQRPVANHLGIY 297 >UniRef50_C8WTH1 N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase-like protein exopolysaccharide biosynthesis protein n=4 Tax=Alicyclobacillus acidocaldarius RepID=C8WTH1_ALIAD Length = 352 Score = 132 bits (333), Expect = 8e-30, Method: Composition-based stats. Identities = 44/273 (16%), Positives = 82/273 (30%), Gaps = 42/273 (15%) Query: 5 LLIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTV-NPQTERVKMYWQK 63 L G + +++R + P + L +PT V +P+ RV Sbjct: 94 NLSGPTIPIQDIQRRDFSHINDPTVQM----ITLHEPTFNAFILLVKDPKRIRVV----- 144 Query: 64 ANGEAWGTLHALLADINSQGQVQMAMNGGIY------DESYAPLGLYIENGQQKVALN-L 116 + + + +N G + P G+ I +G+ Sbjct: 145 -ATKYLHVRGETVMQMVQDSGAIAGINAGGFVDTNWQGTGAYPQGITITDGKLVSMTGSP 203 Query: 117 ASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNV 176 + + G S ++ V GP+L+ENG P + Sbjct: 204 SQPQPVIAFTKEGQMIAG------TYSLNQLRSLDVWQCVGFGPVLVENGK--PTVSAEN 255 Query: 177 ASSKIRNGVGINKHGNAVFLLSQQA---------TNFYDFACYAKAKLNVEQLLYLDGTI 227 + R +G K G + L++ +F D A + + + LDG Sbjct: 256 YAVNPRTAIGQTKDGTVILLVTDGRYATGPNDVGASFADVARIML-QFHADIAANLDGGS 314 Query: 228 SHMYMKGGAIPWQR------YPFVTMISVERKG 254 S ++ G + + T I V +G Sbjct: 315 SATFVYKGRMWNRPVDILGARAVATSIVVMPEG 347 >UniRef50_UPI000178A82C copper amine oxidase domain protein n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI000178A82C Length = 377 Score = 132 bits (332), Expect = 1e-29, Method: Composition-based stats. Identities = 41/238 (17%), Positives = 75/238 (31%), Gaps = 22/238 (9%) Query: 29 FAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMA 88 A + + + Q TV+ +V++ A +A L I + +A Sbjct: 149 VTSARKTFKVGARSFSAQVVTVSLLHPKVELDVVLAGNKA--GKVEDLRSIAKRSNAVVA 206 Query: 89 MNGGIYDESYA-----PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRL 143 +NG +D + P G + G + + + + G V Sbjct: 207 INGTFFDAYTSGAYKAPYGYLVSKGNIFHKASGDNRTIFTYDSNNLATMMPGLDFKSVY- 265 Query: 144 DAFKTSKEIQFAVQSGPMLMENGVINPRIHPNV-------ASSKIRNGVGINKHGNAVFL 196 + ++ A+Q+GP L+ NG + + R+ +GI K + L Sbjct: 266 ----ETGRMEGALQAGPRLLTNGKVTLDVKKEGFKDPKILTGGGARSALGITKDHKLILL 321 Query: 197 LSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGA-IPWQRYPFVTMISVERK 253 + A K Q + LDG S G+ + I V+ K Sbjct: 322 TT-GGATIPQLAEIMKQA-GAYQAMNLDGGASSGLYYNGSYLTTPGRQISNAIVVKYK 377 >UniRef50_C0ZEQ6 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0ZEQ6_BREBN Length = 356 Score = 131 bits (330), Expect = 2e-29, Method: Composition-based stats. Identities = 41/259 (15%), Positives = 74/259 (28%), Gaps = 36/259 (13%) Query: 15 NLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLH 73 NL + L L V D + + + +P R+ + +K G+ Sbjct: 112 NLVTVTLPEKPKELIEVEDIDVSKGSYYFKGKIMYISDPSRVRLVVTNRKDRGD------ 165 Query: 74 ALLADINSQGQVQMAMNGGIY------DESYAPLGLYIENGQQKVALNLASGEGNFFIRP 127 LL + ++ +N + + G+ I G+ N SGE + Sbjct: 166 -LLDEFVNKTGAIGIVNASGFADPDGYGKGARAYGVVIHEGKILQGYNPRSGETALGLTY 224 Query: 128 GGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGV-INPRIHPNVASSKIRNGVG 186 G ++ AV P L+ NG + + R +G Sbjct: 225 DGKLITG------SYSAEQLVKMGVRDAVSFRPQLIVNGKNMFEGKPAKSWGIQPRTAIG 278 Query: 187 INKHGNAVFLLSQQAT-------NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIP- 238 + G VF + + D A + V + +DG S M + G Sbjct: 279 QKEDGTIVFAVIDGRQPGHSIGASMNDMAELLAER-GVVTAMAMDGGSSSMMLHNGEAIT 337 Query: 239 ------WQRYPFVTMISVE 251 + +V Sbjct: 338 KTSSPYHRGRYLPNAWAVF 356 >UniRef50_B2V2N5 Putative uncharacterized protein n=8 Tax=Clostridium RepID=B2V2N5_CLOBA Length = 348 Score = 131 bits (330), Expect = 2e-29, Method: Composition-based stats. Identities = 34/230 (14%), Positives = 64/230 (27%), Gaps = 35/230 (15%) Query: 37 ALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIY- 94 + + NP +V M + G L +++ + A+NGG + Sbjct: 118 DIHTDRYDGYMLEIENPHKVKVAMT------KYLGKLGQKTSEMAEEHNAIAAINGGSFV 171 Query: 95 ----------DESYAPLGLYIENGQQKVALNLASGEGN---FFIRPGGVFYVAGDKVGIV 141 P G I +G+ + + G V Sbjct: 172 DKSSDGITYAGTGGQPGGFVISSGKVVYPIGKCNEHSVENVIAFTKKGQLIVGN------ 225 Query: 142 RLDAFKTSKEIQFAVQSG-PMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ 200 A ++Q A+ P ++ NG+ + + R VG + G +FL Sbjct: 226 HTLAELKKLDVQEAMCFREPNVIINGIRQHKKEDYIDGINPRTAVGQKEDGTVLFLALDG 285 Query: 201 A------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPF 244 Y+ +++ LDG S G + + Sbjct: 286 RKLSKPGATIYEVQEIMRSR-GAINAGMLDGGYSTTMYYKGDVINSPNAW 334 >UniRef50_C1CWE2 Putative LysM lysin domain protein, n=1 Tax=Deinococcus deserti VCD115 RepID=C1CWE2_DEIDV Length = 442 Score = 131 bits (329), Expect = 2e-29, Method: Composition-based stats. Identities = 43/250 (17%), Positives = 86/250 (34%), Gaps = 12/250 (4%) Query: 10 GMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAW 69 GM L +R+ + + A L + VQ V+ + V + + Sbjct: 189 GMKILVAQRVPVPIPPRA-TGKAVTFKQLRPLNIPVQLVRVDLRHRDVLVAPVLPHAGLV 247 Query: 70 GTLHALLADINSQGQVQMAMNGGIYDE-SYAPLGLYIENGQQKVALNLASGEGNFFIRPG 128 L A + + + Q +NG + +YAP G + G+ I P Sbjct: 248 FGLGARVGQLAQRSGAQALINGSYFHPRTYAPAGDIVMQGRML---TWGRIPMALAITPD 304 Query: 129 GVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRI-----HPNVASSKIRN 183 + ++R T + ++ + +GP ++ G ++ P + R+ Sbjct: 305 NRATIRATTTPLLRRPLDTTWRGMETVIATGPRIVTGGAVHTNYNQVFRDPALFGRAARS 364 Query: 184 GVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMY-MKGGAIPWQRY 242 VG++ + + V + ++ + +L V++ L LDG S G A+ Sbjct: 365 AVGLSSNRDLVMVSTRVRLTTTEMGKVM-TRLGVKEALLLDGGSSAGLAWNGRAVLDSMR 423 Query: 243 PFVTMISVER 252 I V Sbjct: 424 KVSYGIGVFT 433 >UniRef50_A9KDD2 Hypothetical exported protein n=6 Tax=Coxiella burnetii RepID=A9KDD2_COXBN Length = 255 Score = 130 bits (328), Expect = 3e-29, Method: Composition-based stats. Identities = 43/214 (20%), Positives = 74/214 (34%), Gaps = 19/214 (8%) Query: 22 ALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINS 81 + A + + A+ +NP+ + ++ A Sbjct: 34 PGMAYTVVTPAFSSESRPGLFTHLYAWKINPRQYHFNIVTA----KSLQQTALYAAQAAK 89 Query: 82 QGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIV 141 +A+NGG + + PLGL I + + +L S G F I+ + I Sbjct: 90 IKDTVLAINGGFFTPNLEPLGLRISDNKVLSSLKRISWWGIFMIKNN--------RAAIT 141 Query: 142 RLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA 201 ++ S EI FA+Q+GP L+ +G I S R+ +G+ G+ + ++ Sbjct: 142 SPQNYRYSPEINFAIQAGPRLIIDGRI----PQLRGGSAQRSALGVTPTGDIIIAITDNN 197 Query: 202 --TNFYDFACYAKAKLNVEQLLYLDGTISHMYMK 233 A KL L LDG S Sbjct: 198 LLLTATQLA-ILLQKLGCSNALNLDGGTSSQLFV 230 >UniRef50_B1I1S0 N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase-like exopolysaccharide biosynthesis protein n=1 Tax=Candidatus Desulforudis audaxviator MP104C RepID=B1I1S0_DESAP Length = 345 Score = 130 bits (327), Expect = 4e-29, Method: Composition-based stats. Identities = 32/237 (13%), Positives = 60/237 (25%), Gaps = 25/237 (10%) Query: 30 AVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAM 89 V L V P V ++ + + + Sbjct: 115 RVEVKIFELKGIGYRGYIAKVKPFDPGVLRVT------YREGPGETTSEAVRRTGAVLGV 168 Query: 90 NGGIYD-------ESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVR 142 NGG + P+G + +G+ + F G V GI Sbjct: 169 NGGGFYRAPVDGLMHTLPIGNTMVDGKLVGGFQPPREDLFFAGFDGRGRLVG----GIFN 224 Query: 143 LDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT 202 + V P+L+++ P + R +G +G+ + ++ Sbjct: 225 DRTALLGTGARQGVSFVPILIKDRQPVPIPEKWRNQRQPRTILGEYANGDLIMIVVDGRQ 284 Query: 203 -------NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVER 252 D K V LDG S +++ G I + + Sbjct: 285 ADWSSGVTLEDL-QVTLIKFGVIDAYNLDGGGSSVFVFGNQILNRPSDGRERVVATN 340 >UniRef50_C3R3M8 Putative uncharacterized protein n=1 Tax=Bacteroides sp. 2_2_4 RepID=C3R3M8_9BACE Length = 329 Score = 129 bits (324), Expect = 9e-29, Method: Composition-based stats. Identities = 38/288 (13%), Positives = 71/288 (24%), Gaps = 58/288 (20%) Query: 13 TLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTL 72 +K+ TL +V L + G+ + Sbjct: 41 LGWVKQTTEFGTLPEYISVYKSPSELEGMKAIAFIAVADMSKANFATI-----GDQIYSK 95 Query: 73 HALLADINSQGQVQMAMNGGIYDES-YAPLGLYIENGQQK---------VALNLASGEGN 122 Q + + MNGG + + L G+ + G Sbjct: 96 TPNQIWQAEQQKYPIIMNGGYFVMGAGKSVSLLCREGEVLAVNSQEEIRSQKSYYPTRGI 155 Query: 123 FFIRPGGVF---YVAGDKVGIVRLDAFK---------------------TSKEIQFAVQS 158 F + G F + G+ + A+ Sbjct: 156 FQLSKNGYFSTDWAYTTTDGVTYTYEQPSPNKSGYEPQPAPSAYFPTRGVKLNAETAIGG 215 Query: 159 GPMLMENGVINPRIHPNV---------ASSKIRNGVGINKHGNAVFLLSQQA-------- 201 GP+L+++G + + S R +G+ + +F + + Sbjct: 216 GPILLKDGSVRNTFIEELFDEESGVAPESYHPRTAIGVTANNKVIFFVCEGRSVTEGVKG 275 Query: 202 TNFYDFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFVTMI 248 N A K+ L + LDG S M + G + I Sbjct: 276 MNMAMMANILKS-LGCVDAMNLDGGGSTCMLVNGQPVIKPSAGAQRAI 322 >UniRef50_O31980 SPBc2 prophage-derived uncharacterized protein yomE n=2 Tax=root RepID=YOME_BACSU Length = 644 Score = 129 bits (324), Expect = 1e-28, Method: Composition-based stats. Identities = 33/247 (13%), Positives = 68/247 (27%), Gaps = 29/247 (11%) Query: 28 LFAVAADDCALSDPTLTVQAYTVNPQT-------------ERVKMYWQKANGEAWGTLHA 74 F V + + + V P+T + + T Sbjct: 58 YFTVTSSFKQDATLGIEYYVTKVTPKTTEAKKSMVQKTFAYDFEKSIDPTSSYFGTTNRE 117 Query: 75 LLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLA--SGEGNFFIRPGGVFY 132 + + + + +A+N + + +GL I++G + A +G G Sbjct: 118 TVLSMAKRKRSVVAINASGWRSNGEVMGLQIKDGVLYKDYDAAGYTGAEACVFFDDGTMK 177 Query: 133 VAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINP---RIHPNVASSKIRNGVGINK 189 V G++ SK + + G L+++ ++ R +G Sbjct: 178 VYGNR---EVDADILISKGARNSFAFGIWLVKDSKPRTAQMTTWADLNVKHPRQAIGQRS 234 Query: 190 HGNAVFLLSQQA------TNFYDFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRY 242 G V + YD ++ LDG S ++G I Sbjct: 235 DGTLVIITVDGRSLRSSGITAYDMPSLFLSE-GCINAFLLDGGGSSQTAVEGKYINNISD 293 Query: 243 PFVTMIS 249 + Sbjct: 294 GIERAVV 300 >UniRef50_B7H7U4 Putative uncharacterized protein n=27 Tax=Bacillus cereus group RepID=B7H7U4_BACC4 Length = 365 Score = 129 bits (324), Expect = 1e-28, Method: Composition-based stats. Identities = 27/217 (12%), Positives = 55/217 (25%), Gaps = 23/217 (10%) Query: 43 LTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE------ 96 + T+ + + G ++ + + +A+N + + Sbjct: 115 FEGKLVTI---SNPFNVKLVSHQGTQGANRGEKISVMAKRNHALVAVNASGFADETGRGG 171 Query: 97 SYAPLGLYIENGQQKVALNLASGEGNF-FIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFA 155 G+ IENG+ + + G K++ A Sbjct: 172 GNVATGIVIENGKAIDTNMDRNAPTIITGLTKFGQMITGN------YSTQQLLDKQVVSA 225 Query: 156 VQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT------NFYDFAC 209 P L+ NG S R+ + + G +FL+ + Sbjct: 226 AGFMPQLIVNGEKMITEGDGGWGSAPRSIMAQKEDGTIMFLVIDGRQTHSIGATLKECQD 285 Query: 210 YAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVT 246 K + +DG S GG + Sbjct: 286 ILYEK-GAINAMAMDGGSSATLYLGGKVINSPSTLSH 321 >UniRef50_C9KSV8 N-acetylmuramoyl-L-alanine amidase/putative S-layer protein n=1 Tax=Bacteroides finegoldii DSM 17565 RepID=C9KSV8_9BACE Length = 390 Score = 129 bits (324), Expect = 1e-28, Method: Composition-based stats. Identities = 39/204 (19%), Positives = 67/204 (32%), Gaps = 26/204 (12%) Query: 60 YWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASG 119 + ++ + + A LA S +V A+NG + + P G+Y NG + Sbjct: 183 FLGTSSSYYYVSRDAALAYDKSGSRVLAAVNGDFFAKDGTPQGIYYRNGTCLKGTMTDNV 242 Query: 120 EGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASS 179 F I + + + IQ AV LM NG + P V + Sbjct: 243 CTFFAITKNKRAIIG------SYDEYDSYKENIQEAVGGRVRLMTNGNVLP---QTVTAL 293 Query: 180 KIRNGVGINKHGNAVFLLSQQATNFY-------DFACYAKAKLNVEQLLYLDGTISHMYM 232 + R +G+ L++ +Y + KA L + + LDG S ++ Sbjct: 294 EPRTAIGVTDDNVVYILVADGRNFWYSNGMRYAEMGAVMKA-LGAKNAINLDGGGSSTFI 352 Query: 233 KGG---------AIPWQRYPFVTM 247 AI Y + Sbjct: 353 IRKIAGFEDGRFAIRNWPYDNGGV 376 >UniRef50_UPI000180BA0C PREDICTED: similar to predicted protein n=1 Tax=Ciona intestinalis RepID=UPI000180BA0C Length = 621 Score = 129 bits (323), Expect = 1e-28, Method: Composition-based stats. Identities = 42/258 (16%), Positives = 71/258 (27%), Gaps = 38/258 (14%) Query: 8 GKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTV-NPQTERVKMYWQKANG 66 + + + R F++ L ++ + T+ NP + G Sbjct: 68 SENLFPVTNTRYFVSDIALNQWSTYNK------NHVYGHVTTIYNPSKTFSVLEAT--YG 119 Query: 67 EAWGTLHALLADINSQGQVQMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGEGNFFI 125 G + +GQ +A NGG ++ LG I G+ + + F I Sbjct: 120 GCQGNQLDTSVNAARRGQCFVAQNGGYFNTKTQSCLGNVISRGRTLHTSDATNAH--FGI 177 Query: 126 RPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHP----------- 174 G V DA V L+ NG Sbjct: 178 LSNGSIVVGYI------SDADLRRLNFTNLVGGVIWLVRNGTSFVEESVSMESSDTEETG 231 Query: 175 ----NVASSKIRNGVGINKHGNAVFLLSQQ-----ATNFYDFACYAKAKLNVEQLLYLDG 225 R +G +KHG V + N F+ + L++ + LDG Sbjct: 232 TLRYFSDVQSARTAIGHDKHGWVVLVQVDGQTGARGVNLNSFSKFLIEDLHLVNAINLDG 291 Query: 226 TISHMYMKGGAIPWQRYP 243 S + G + Sbjct: 292 GGSATLVINGTLANTPSD 309 >UniRef50_C6J074 Copper amine oxidase domain-containing protein n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J074_9BACL Length = 406 Score = 129 bits (323), Expect = 1e-28, Method: Composition-based stats. Identities = 36/255 (14%), Positives = 77/255 (30%), Gaps = 22/255 (8%) Query: 12 ITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGT 71 + + + A + + + Q T++ +V++ A Sbjct: 161 LPYTSRSALTSAGASKAIVSARKTFKVGGRSFSTQMVTISLMDPKVRLKVALAGD--AVG 218 Query: 72 LHALLADINSQGQVQMAMNGGIYDESY-----APLGLYIENGQQKVALNLASGEGNFFIR 126 L+ + + + +A+NG ++ AP G + G+ K+ + + Sbjct: 219 KVEELSSLAKRHKAVVAINGTFFNAYTDNAYKAPYGYIVSGGELKMKASGDKRTIFTYDS 278 Query: 127 PGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNV-------ASS 179 + GD A+Q+GP L+ NG + + Sbjct: 279 NLLARLIPGDDFNDAFNAGTMEG-----ALQAGPRLVVNGKVAVDVKAEGFKDPKILTGG 333 Query: 180 KIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIP 238 R+ +G+ + + L + A K Q + LDG S +Y G + Sbjct: 334 GARSALGLTRDHKLILLTT-GGATIPQLAEIMKQA-GAYQAMNLDGGASSGLYYNGKYLT 391 Query: 239 WQRYPFVTMISVERK 253 + V + Sbjct: 392 QPGRKISNALIVTYQ 406 >UniRef50_C5PL46 Exopolysaccharide biosynthesis protein n=2 Tax=Sphingobacterium spiritivorum RepID=C5PL46_9SPHI Length = 288 Score = 127 bits (320), Expect = 3e-28, Method: Composition-based stats. Identities = 29/238 (12%), Positives = 62/238 (26%), Gaps = 25/238 (10%) Query: 25 LLPLFAVAADDCA-LSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQG 83 L P L + ++ Q + + T S+ Sbjct: 39 LKPGIIWKQGHFDNLFKAEQEINFIEIDLQKIKQPIRLAG-----LQTGFKNTTTFASEA 93 Query: 84 QVQMAMNGGIYDE-SYAPLGLYIENGQQKVALNLASG-EGNFFIRPGGVFYVAGDKVGIV 141 A+NG ++ + L N Q L G R K+ I+ Sbjct: 94 NALAAINGAFFNTKTGGGTTLVRINKQLINETVLKEGKSPKRSFRSNAALAFDTKKIVII 153 Query: 142 ----RLDAFKTSKEIQFAVQSGPMLM-ENGVINPRIHPNVASSKIRNGVGINKHGNAVFL 196 R + ++ + GP+L+ ++ + + R+ + + + + Sbjct: 154 KGDDRDSTWDKKIKMPNVMTCGPLLLHKSHRAYLDSNAFNNNRHPRSAIALTTEHKLILI 213 Query: 197 LSQQA------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMK-----GGAIPWQRYP 243 + + + K + L LDG S G + + Sbjct: 214 TVDGRNAQAYGMSLIELSNVMKWLKG-KDALNLDGGGSTTLYIKDEGKNGIVNYPTDN 270 >UniRef50_C0ZFU4 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0ZFU4_BREBN Length = 359 Score = 127 bits (319), Expect = 4e-28, Method: Composition-based stats. Identities = 29/226 (12%), Positives = 65/226 (28%), Gaps = 26/226 (11%) Query: 32 AADDCALSDPTLTVQAYTV---NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMA 88 +L + V +P ++ + + + + + +A Sbjct: 132 TVKVYSLQEGGYRGYMAKVRLNDPNALKMVL-----ANNSVKSKGETTSQAGKRTGSILA 186 Query: 89 MNGGIYDESY----APLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLD 144 +N G + PLG+ + +G+ + + G K Sbjct: 187 INAGGFMSDKQGNLTPLGITVVDGK-IRTFSNNAKLSFVGFNNKGHLVGTSIK-----TQ 240 Query: 145 AFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA--- 201 A T + I P L++ G P + + R +G +G+ + ++ Sbjct: 241 AQITQQGILQGASFLPRLLQGGKRLPIPREWANARQPRTLIGHFDNGDLLLIVIDGRRDG 300 Query: 202 ----TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYP 243 + A + +V LDG S + G + + Sbjct: 301 WSNGVTLEE-AQRKLQEWHVVDAYNLDGGGSSAFYYNGKLLNKPSG 345 >UniRef50_D0TN59 Predicted protein n=3 Tax=Bacteroides RepID=D0TN59_9BACE Length = 315 Score = 126 bits (316), Expect = 8e-28, Method: Composition-based stats. Identities = 30/257 (11%), Positives = 68/257 (26%), Gaps = 31/257 (12%) Query: 4 QLLIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQK 63 Q L+ + + L + + + ++ V + Sbjct: 42 QKLLAGSETVARVYTDTSFVVALGVTETDVHFQKADSRSTHIFIIDIDLNEPGVSLEVGM 101 Query: 64 ANGEAWGTL--HALLADINS-----QGQVQMAMNGGIYDESYAPL-GLYIENGQQKVALN 115 L ++ +V +N +D S + G NG Sbjct: 102 PYDADVRNNFQRQTLTEMADYADRPWHRVAAMINADFWDVSTMDIRGPIHRNGVILKNSF 161 Query: 116 ------LASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVIN 169 + +A ++ ++ SG +++ +G I+ Sbjct: 162 IFKETLPQQALSFIALTKDNKMVIAD------SVEYRGMQYNLKEVTGSGVIVLRDGEIS 215 Query: 170 PRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT-------NFYDFACYAKAKLNVEQLLY 222 +P + R +G + G+ F+++ + + KA L + Sbjct: 216 GATYPGID---PRTCLGYSDDGHVYFMVADGRVEFYSYGLTYPEMGSIMKA-LGCSWAVN 271 Query: 223 LDGTISHMYMKGGAIPW 239 LDG S + I Sbjct: 272 LDGGGSTQMLIRHPIAD 288 >UniRef50_Q1IXP5 Peptidoglycan-binding LysM domain-containing protein n=2 Tax=Deinococcus RepID=Q1IXP5_DEIGD Length = 444 Score = 125 bits (315), Expect = 9e-28, Method: Composition-based stats. Identities = 38/226 (16%), Positives = 71/226 (31%), Gaps = 11/226 (4%) Query: 34 DDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGI 93 L + VQ V+ + V + A ++ + Q +NG Sbjct: 214 SFKQLKALNIPVQVLRVDLRHRNVLVAPVLPRTGLGTAGGARVSTLARTSGAQAVVNGSY 273 Query: 94 YDE-SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEI 152 + SYAP G + G+ I P + ++ + + Sbjct: 274 FHPRSYAPAGDLVVQGRLL---AWGRIPVALAITPDNRAAIMTSTTPLLGRPLEVSWHGM 330 Query: 153 QFAVQSGPMLMENGVINPRI-----HPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDF 207 + + +GP ++ G + + P + R+ VG+ + + VF+ + + Sbjct: 331 ETVIATGPRILNGGTVVRQYASAFRDPALFGRAARSAVGLKSNRDLVFVTTHAKLTTTEM 390 Query: 208 ACYAKAKLNVEQLLYLDGTISHMY-MKGGAIPWQRYPFVTMISVER 252 A+L V L LDG S G A+ I V Sbjct: 391 GKVM-ARLGVRDALLLDGGSSAGLAWNGQAVLDSVRKVAYGIGVFT 435 >UniRef50_B3CE38 Putative uncharacterized protein n=3 Tax=Bacteroides RepID=B3CE38_9BACE Length = 285 Score = 125 bits (315), Expect = 9e-28, Method: Composition-based stats. Identities = 31/227 (13%), Positives = 63/227 (27%), Gaps = 31/227 (13%) Query: 32 AADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNG 91 A+ +L V + P+ R + ++ + +A+NG Sbjct: 49 EAEFVSLYGVPQHVTILEIKPERHRFDILIHSP--------KEETSNAARRSGAVVAING 100 Query: 92 GIYD-ESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSK 150 ++ + + ++G G G + K+ I+ Sbjct: 101 SYFNIKQGTSICYLRKDGVVVDTTAT----GVLSTVSNGAVKIDKGKLDIIAWKKQDEKT 156 Query: 151 EIQ---FAVQSGPMLMENGVINPRIH---PNVASSKIRNGVGINKHGNAVFLLSQQA--- 201 Q + SGP+++ +G V + R+ V + K G + Sbjct: 157 CEQKEGSILVSGPLMLLDGKTCDLSACNRSFVQTKHPRSAVALMKDGTVFLIAVAGRFEG 216 Query: 202 ----TNFYDFACYAKAKLNVEQLLYLDGTISHMYM----KGGAIPWQ 240 N + + L + L LDG S I + Sbjct: 217 KAEGINIPELTHLLRV-LGARKALNLDGGGSTTLWSASAPDNGIVNK 262 >UniRef50_Q9UK23 N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase n=29 Tax=Chordata RepID=NAGPA_HUMAN Length = 515 Score = 125 bits (314), Expect = 1e-27, Method: Composition-based stats. Identities = 33/228 (14%), Positives = 63/228 (27%), Gaps = 29/228 (12%) Query: 38 LSDPTLTVQAYT-VNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE 96 D + V P + G A + + ++A NGG + Sbjct: 85 FRDRAVAGHLTRAVEPLR-TFSVLEPGGPGGCAARRRATVEETARAADCRVAQNGGFFRM 143 Query: 97 S-YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFA 155 + LG + + ++ + F IR G + + T Sbjct: 144 NSGECLGNVVSDERRVSSSGGLQ-NAQFGIRRDGTLVTGY----LSEEEVLDTENPFVQL 198 Query: 156 VQSGPMLMENGVINPR---------------IHPNVASSKIRNGVGINKHGNAVFLLSQQ 200 + L+ NG I V R +G ++ G V + Sbjct: 199 LSGVVWLIRNGSIYINESQATECDETQETGSFSKFVNVISARTAIGHDRKGQLVLFHADG 258 Query: 201 -----ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYP 243 N ++ A + + +V + LDG S ++ G + Sbjct: 259 HTEQRGINLWEMAEFLLKQ-DVVNAINLDGGGSATFVLNGTLASYPSD 305 >UniRef50_D2NR45 Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase n=3 Tax=Micrococcineae RepID=D2NR45_9MICC Length = 356 Score = 125 bits (314), Expect = 1e-27, Method: Composition-based stats. Identities = 33/215 (15%), Positives = 65/215 (30%), Gaps = 27/215 (12%) Query: 46 QAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYI 105 + + + AN + + +++ S+ A+NG Y + G+ I Sbjct: 134 FVADIKLDNATL-LRSAFANNKFGQNIIDTPSNMASEHNGIWAINGDYY--GFRTTGIVI 190 Query: 106 ENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMEN 165 NG G G + + S+ + + GP L+++ Sbjct: 191 RNGVVYRDSGAREG---LAFYRDGSVKLYDETA---TNAQTLVSEGVWNTLSFGPALVKD 244 Query: 166 GVINPRIHP----------NVASSKIRNGVGINKHGNAVFLLSQQA-------TNFYDFA 208 I I ++ ++ R GVG+ + VF++ +FA Sbjct: 245 SAIVDGIDSVEVDTNFGNHSIQGNQPRTGVGVLGTNHLVFIVVDGRSTNYSRGVTMPEFA 304 Query: 209 CYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYP 243 K L LDG S + + + Sbjct: 305 QMFKD-LGCVSAYNLDGGGSSAMVFNNKLVNRPQG 338 >UniRef50_A7LRK4 Putative uncharacterized protein n=1 Tax=Bacteroides ovatus ATCC 8483 RepID=A7LRK4_BACOV Length = 326 Score = 125 bits (314), Expect = 1e-27, Method: Composition-based stats. Identities = 26/257 (10%), Positives = 63/257 (24%), Gaps = 38/257 (14%) Query: 28 LFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGT------LHALLADINS 81 + + V+ V + + + + Sbjct: 75 GVRITDVIFTYCAKPTRMIIAEVDLNKN-VTIVTSTPDNKPEVGKILQQVTVQAEKAEAA 133 Query: 82 QGQVQMAMNGGIYDESY---APLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKV 138 +V + NG Y + P GL+ ++G + F++ G ++ Sbjct: 134 GRKVILGTNGDFYSKKNDLWIPGGLFYKDGVAIKTEIGWEADHVFYMLKDGTAHIT---- 189 Query: 139 GIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVA--SSKIRNGVGINKHGN-AVF 195 + +E+ A+ ++++G + N R VG++ Sbjct: 190 --SVPEFKLVEREVVHAIGGWQRMVQDGEVVKNFTVNDNAMQFHPRTFVGVSADNRKVYL 247 Query: 196 LLSQQATNFYDFACYAKAKL------NVEQLLYLDGTISHMYMKG---------GAIPWQ 240 + Y + + Q +DG S ++ + Sbjct: 248 FVVDGRQPEYSNGMRLEDMMLLCQGAGCYQAFNMDGGGSTTMVRRVEKGSSVSFEVMNQP 307 Query: 241 RYPFV----TMISVERK 253 + V K Sbjct: 308 SDNPARSVINGLQVIEK 324 >UniRef50_A6L610 Putative uncharacterized protein n=1 Tax=Bacteroides vulgatus ATCC 8482 RepID=A6L610_BACV8 Length = 287 Score = 124 bits (312), Expect = 2e-27, Method: Composition-based stats. Identities = 26/184 (14%), Positives = 55/184 (29%), Gaps = 19/184 (10%) Query: 75 LLADINSQGQVQMAMNGGIYD-ESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYV 133 + + Q + A+NG + + +N +R G ++ Sbjct: 86 TTSQLAEQSRSSAAINGSYFSIKEGFSTCYLRKNEAVIDTTTTEER----HLRVNGAVHM 141 Query: 134 AGDKVGIVRLDAFKTSKEIQ---FAVQSGPMLMENGVI---NPRIHPNVASSKIRNGVGI 187 + + I+ + K + SGP+LM++G + R+ + + Sbjct: 142 VDNNIRIIPWNDENEKKGFPLDGDILASGPLLMQDGKTCDFTTIDREFSETRHPRSAIAL 201 Query: 188 NKHGNAVFLLSQQA-------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 K G+ + + + + A + L L LDG S G + Sbjct: 202 TKEGDIMLVAVDGRAEGHADGMSIAELAYLLRI-LKAHCALNLDGGGSTTLWVNGQVVNH 260 Query: 241 RYPF 244 Sbjct: 261 PSDN 264 >UniRef50_C6D0A3 Exopolysaccharide biosynthesis protein n=1 Tax=Paenibacillus sp. JDR-2 RepID=C6D0A3_PAESJ Length = 349 Score = 124 bits (312), Expect = 3e-27, Method: Composition-based stats. Identities = 30/233 (12%), Positives = 61/233 (26%), Gaps = 36/233 (15%) Query: 42 TLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIY------ 94 + T+ NP ++ + I + + A+N + Sbjct: 112 NYKGKIITISNPNRVKLV-------SSKLSDHGEQIFVIAKRAKALAAINASGFVDLDGH 164 Query: 95 DESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQF 154 A G+ IE+G + N + E I GV + +Q+ Sbjct: 165 GNGGASTGVVIEDG-VIKSQNKNTKEFVAGITKDGVMITGK------YSANELVNLGVQY 217 Query: 155 AVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATN------FYDFA 208 A P L+ NG R +G G+ +F++ + Sbjct: 218 AAGFKPQLIVNGQKMV-EGDGGWGWGPRTAIGQKADGSIIFVVIDGRQTRSVGASIKEVQ 276 Query: 209 CYAKAKLNVEQLLYLDGTISHMYMKGG-------AIPWQRYPFVTMISVERKG 254 + + +DG S G + + ++ + Sbjct: 277 DLLYER-GAVNAMCMDGGSSSSMYFNGDNITIPSSRNNIPRYLPNIWALIPQA 328 >UniRef50_C5RID5 Putative uncharacterized protein n=1 Tax=Clostridium cellulovorans 743B RepID=C5RID5_CLOCL Length = 347 Score = 124 bits (311), Expect = 3e-27, Method: Composition-based stats. Identities = 37/229 (16%), Positives = 62/229 (27%), Gaps = 32/229 (13%) Query: 37 ALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYD 95 + + +P + M + G +++ A+NGG + Sbjct: 118 KIEHDRYIAHILEIKDPTKIKAVMT------KYVGKNGQKTSEMALDYDAIAAINGGAFA 171 Query: 96 E-----------SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGI-VRL 143 + P G I NG N + V + K+ + Sbjct: 172 DVSASGQKWAGNGAIPGGFVITNGAIVYP----KENVNKYDVQNVVAFTKEGKLVVGDYC 227 Query: 144 DAFKTSKEIQFAVQSGP-MLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT 202 + + A+ P ++ +GV + R +G G V L+ T Sbjct: 228 INDLMAMGVTEAMCFRPPSIIIDGVAQIT-DKLQDGTNPRTAIGQKADGTVVLLVIDGRT 286 Query: 203 ------NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFV 245 YD K LNV LDG S G I + Sbjct: 287 LSMPGATLYDVQQIFKD-LNVVNAGNLDGGYSSTMYFNGEIINSPNAWS 334 >UniRef50_C8WU56 N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase-like protein exopolysaccharide biosynthesis protein n=2 Tax=Alicyclobacillus acidocaldarius RepID=C8WU56_ALIAD Length = 296 Score = 123 bits (310), Expect = 4e-27, Method: Composition-based stats. Identities = 31/223 (13%), Positives = 60/223 (26%), Gaps = 32/223 (14%) Query: 38 LSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIY-- 94 + +P T V +P+ + G + + + +NGG + Sbjct: 77 IHEPNFTAYVLWVRDPRRVEIV------ETRYAGDVGETVEQFVNDWHAVAGVNGGSFTD 130 Query: 95 ----DESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSK 150 G+ I NG+ + G A + Sbjct: 131 TNWQGTGGLVQGIVISNGRILKRASGPES--IVGFTADGRLISG------TYTLAELQAM 182 Query: 151 EIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA--------- 201 + A+ GP L++ G ++ R +G G + +++ Sbjct: 183 GVTQALMFGPTLVDRG-VDQIQGAGDWGYAPRTAIGQTADGTVILMVTDGRELHGPADIG 241 Query: 202 TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPF 244 + D A + L LDG S + G + Q Sbjct: 242 ASLGDIARLMIS-LGAVTAANLDGGSSATLVYDGCLINQPTDI 283 >UniRef50_C6J7B9 Exopolysaccharide biosynthesis protein n=2 Tax=Bacillales RepID=C6J7B9_9BACL Length = 355 Score = 123 bits (308), Expect = 7e-27, Method: Composition-based stats. Identities = 38/232 (16%), Positives = 67/232 (28%), Gaps = 28/232 (12%) Query: 34 DDCALSDPTLTVQAYTVNPQTER---VKMYWQKAN------GEAWGTLHALLADINSQGQ 84 + + ++ Y VNP T R +K+ + + G+ + Sbjct: 116 PFETIQSDRIRIELYKVNPGTYRGYAMKIRLKSPDAMKMTLGKDRLGGAETTMQAVQRYG 175 Query: 85 VQMAMNGGIYDESY---APLGLYIENGQQKVALNLASGEGNF-FIRPGGVFYVAGDKVGI 140 +N G + +S PL I NGQ + + F + G + Sbjct: 176 AVAGINAGGFADSRGQRYPLSTTILNGQYVNGFEPSYKDLFFVGLNQSGQLIGGKFQ--- 232 Query: 141 VRLDAFKTSKEIQFAVQSGPMLMENGV--INPRIHPNVASSKIRNGVGINKHGNAVFLLS 198 + +F P+L++NGV P R +G K + L+ Sbjct: 233 --NKESLDKLKPKFGASFVPILLQNGVKLPIPDKWKTSPLRAPRTVIGNYKDDQLLVLVV 290 Query: 199 QQ-------ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYP 243 + A L V+ LDG S + G + Sbjct: 291 DGDNEKGRSGATLEELQNKL-ANLGVQDAYNLDGGGSSSLVVNGRVINHPSD 341 >UniRef50_Q8A0T0 Putative uncharacterized protein n=10 Tax=Bacteroides RepID=Q8A0T0_BACTN Length = 308 Score = 122 bits (307), Expect = 8e-27, Method: Composition-based stats. Identities = 36/210 (17%), Positives = 69/210 (32%), Gaps = 26/210 (12%) Query: 40 DPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDES-Y 98 T ++ +NP+ K G A+ ++ I + Q A+NG +D + Sbjct: 80 QGTQSINILEINPK-------TGKKIGIAFTGQLEKISRIARKHQAIGAINGSYFDMTKG 132 Query: 99 APLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQF---A 155 + Q +L +R G Y KV ++ D + Sbjct: 133 NSVCFLKVGSQVVDTTSLDE----LKLRVTGAVYEKKGKVKLIPWDRQIEKNYKKNKGSV 188 Query: 156 VQSGPMLMENGVINPRIH---PNVASSKIRNGVGINKHGNAVFLLSQQA-------TNFY 205 + SGP+++++G + + R+ + + + G +F+ N Sbjct: 189 LASGPLMLKDGEYYDWSQCNANFIETKHPRSAICLTEEGKILFVTVDGRSPENAVGINIP 248 Query: 206 DFACYAKAKLNVEQLLYLDGTISHMYMKGG 235 + A L + L LDG S G Sbjct: 249 ELAHLL-HVLGGKDALNLDGGGSTALWLSG 277 >UniRef50_A9WEC1 Putative uncharacterized protein n=3 Tax=Chloroflexus RepID=A9WEC1_CHLAA Length = 265 Score = 122 bits (306), Expect = 1e-26, Method: Composition-based stats. Identities = 45/238 (18%), Positives = 88/238 (36%), Gaps = 31/238 (13%) Query: 24 TLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQG 83 TL A L P L VQ ++P R + + + L+ ++ Sbjct: 49 TLRTGIA----FRQLEAPGLPVQVVRIDPAHVRFVVGYDPTSP-------LTLSAWVARY 97 Query: 84 QVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRL 143 A+NGG +D+ P+ L I N Q G ++ GG+F + + Sbjct: 98 GAVAAINGGFFDQQGEPVALLISNQQV---------FGYSYVDQGGMFAIDEQGKPHLWS 148 Query: 144 --DAFKTSKEIQFAVQSGPMLME-NGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ 200 D A+Q P+L+ NG + R+ + ++++G + +++ Sbjct: 149 LADQPYDGTPFVQAIQGWPLLVRTNGEA--AYTDDDGQRARRSAIALDRNGYVLLIVAPG 206 Query: 201 AT-NFYDFACYAKAK-LNVEQLLYLDGTISHMYMK----GGAIPWQRYPFVTMISVER 252 AT + +++ + + L++E + LDG S + GG P + + Sbjct: 207 ATFSLAEWSQFLASADLDIEIAVNLDGGSSSGLIAQSDQGGVRVDSFTPLPFALLILE 264 >UniRef50_C1XS52 Predicted periplasmic protein (DUF2233) n=1 Tax=Meiothermus silvanus DSM 9946 RepID=C1XS52_9DEIN Length = 294 Score = 122 bits (306), Expect = 1e-26, Method: Composition-based stats. Identities = 42/254 (16%), Positives = 79/254 (31%), Gaps = 27/254 (10%) Query: 6 LIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKAN 65 LI + RI T + V + V A VN V + Sbjct: 48 LISNTIHPGQRLRIRPPATSFSVKLVTRPVL-----KVPVLAVHVNLAHPEVSIRSLLPP 102 Query: 66 GEAWGTLHALLADINSQGQVQMAMNGGIYDE-SYAPLGLYIENGQQKVALNLASGEGNFF 124 +L + + ++ A+NGG + ++ P G + G Q V ++ + Sbjct: 103 PGVGRG-GEVLQRLAWRTRLVAAINGGYFHPRTFWPAGDLVVGGHQLVKGSI---QTALA 158 Query: 125 IRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNV------AS 178 I P V +T + + + +GP ++ G + Sbjct: 159 ITPDKRARVMVG---------PQTWRGYETVIANGPYILRRGRLVVTPRAEGYNDPAIWG 209 Query: 179 SKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMY-MKGGAI 237 R+ VG+ +F+ ++ + AKL ++ + LDG S KG + Sbjct: 210 RARRSAVGVVNERYLIFVSTKMELTLSELGKVM-AKLGAKEAIVLDGGSSTGLVWKGETL 268 Query: 238 PWQRYPFVTMISVE 251 I + Sbjct: 269 IRPGRALSYGIGIF 282 >UniRef50_C6XXH4 Putative uncharacterized protein n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6XXH4_PEDHD Length = 289 Score = 122 bits (306), Expect = 1e-26, Method: Composition-based stats. Identities = 33/240 (13%), Positives = 63/240 (26%), Gaps = 39/240 (16%) Query: 42 TLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYD-ESYAP 100 + + R + A + + ++ A+NG +D ++ Sbjct: 58 NQNISYLEIK-NKGRSPVLAISAEEKVL----KTTSTFGTENNALAAVNGSFFDVKNGGS 112 Query: 101 LGLYIENGQQK------VALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQF 154 + G+ + A + + G + + Sbjct: 113 VDFIKVGGKVLAENRLEKNDSRARHQQAAVVISNGKLALKKWDGTADWEQRLTE----EN 168 Query: 155 AVQSGPMLMENGV-INPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA------TNFYDF 207 + SGP+LM NG S R +GI +G + L + + Sbjct: 169 VLLSGPLLMLNGTDEALDSTSFSRSRHPRTAIGIKPNGRILLLTVDGRNSNSAGMSLTEL 228 Query: 208 ACYAKAKLNVEQLLYLDGTISHMYM----KGGAIPWQR-----------YPFVTMISVER 252 A K L + LDG S GG + +I V++ Sbjct: 229 AKTMK-WLGCTSSINLDGGGSTTLWVSGFPGGGVVNYPTDNKLWDHAGQRKVANVILVKK 287 >UniRef50_A5USB9 Putative uncharacterized protein n=2 Tax=Roseiflexus RepID=A5USB9_ROSS1 Length = 282 Score = 122 bits (306), Expect = 1e-26, Method: Composition-based stats. Identities = 47/234 (20%), Positives = 93/234 (39%), Gaps = 19/234 (8%) Query: 23 LTLLPLFAVAA--DDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADIN 80 + V SDP + + A ++P T R+++ + + T Sbjct: 54 IVAASGVEVRTFTTGEDSSDPPVPIYAVRLDPATIRLRIRYAPDAPQPLRT-------WF 106 Query: 81 SQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGI 140 + +A+NGG + L + +G + A G P G ++ Sbjct: 107 VAHRPLVAVNGGFFTAENRATALIVSDGTV-YGTSYAGFGGMLAAAPDGRVWIQA----- 160 Query: 141 VRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLS-Q 199 +R + + + + A+QS PML+ G + I+ N R V I++ G + ++ Sbjct: 161 LRDEPYDPNIPLDQAIQSFPMLIYPGGVVASINDNG-QRARRTVVAIDRAGRVLLIVCPT 219 Query: 200 QATNFYDFACYAKAK-LNVEQLLYLDGTISH-MYMKGGAIPWQRYPFVTMISVE 251 A + + A + + + +++ L LDG S +++ GA+ WQ F + SV Sbjct: 220 SAFSLQELATWLASSDMEIDRALNLDGGSSSGIFVNAGAVRWQIDSFAALPSVI 273 >UniRef50_UPI0001BC3362 hypothetical protein BcroD2_01243 n=1 Tax=Butyrivibrio crossotus DSM 2876 RepID=UPI0001BC3362 Length = 356 Score = 122 bits (306), Expect = 1e-26, Method: Composition-based stats. Identities = 41/229 (17%), Positives = 64/229 (27%), Gaps = 27/229 (11%) Query: 35 DCALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGI 93 +S T V +P V +G+ G + I + +A+N G Sbjct: 135 FYEVSGSTFAGTMVVVTDPSRVFV-----GTSGDYKGEAGINVPAICDKYGATLAINAGG 189 Query: 94 YDE------SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFK 147 +++ PLG+ + GQ K N+ S VF + Sbjct: 190 FEDIGGVGNGGTPLGIVMSEGQLKYG-NVNSSYDLIGFDNNNVFVIGQM------TGQQA 242 Query: 148 TSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATN---- 203 + I+ AV GP L+ NG R +G G + L+ Sbjct: 243 IDRGIRDAVSFGPFLILNGTPLEVSGMGG-GLNPRTAIGQRADGAVLLLIIDGRQTHSLG 301 Query: 204 --FYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISV 250 D LDG S + G I + V Sbjct: 302 ASMNDLINVMLD-FGAVNAANLDGGGSTVLYYDGEIKNKISSIYGARGV 349 >UniRef50_A7V127 Putative uncharacterized protein n=1 Tax=Bacteroides uniformis ATCC 8492 RepID=A7V127_BACUN Length = 277 Score = 122 bits (306), Expect = 1e-26, Method: Composition-based stats. Identities = 31/229 (13%), Positives = 67/229 (29%), Gaps = 31/229 (13%) Query: 35 DCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIY 94 +L V + ++P+ R + + A+NG + Sbjct: 50 FSSLYGVPQEVSIFEISPKRYRFDVLVHNP--------KEETSIAARHAGAVAAINGSYF 101 Query: 95 DE-SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSK--- 150 D + + ++G + G G + ++ ++ + Sbjct: 102 DMKAGNSVCYLRKDGVVIDTTST----GVLATVSNGAVLIKKGRLELIPWSKQEEKACTL 157 Query: 151 EIQFAVQSGPMLMENGVINPRIHPN---VASSKIRNGVGINKHGNAVFLLSQQA------ 201 + + SGP+++++G + N V + R+ V + + G + ++ Sbjct: 158 KKGTVLASGPLMLKDGQVCDLSGTNRNFVDTKHPRSAVALTREGKILLIVVDGRRKGKAE 217 Query: 202 -TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKG----GAIPWQRYPFV 245 N + A L E L LDG S G I Sbjct: 218 GINIPELAH-MIRILGGEDALNLDGGGSSTLWSGALPDKGIANTPSGSA 265 >UniRef50_UPI0001923977 PREDICTED: similar to predicted protein, partial n=1 Tax=Hydra magnipapillata RepID=UPI0001923977 Length = 290 Score = 122 bits (305), Expect = 2e-26, Method: Composition-based stats. Identities = 33/248 (13%), Positives = 64/248 (25%), Gaps = 38/248 (15%) Query: 22 ALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINS 81 + + D T+ + + + +G A + + Sbjct: 54 STFITNYIGYETDHLQFGHRTVIKNPLKM------LSVLEPLKSGGCKTNSLAYVHESAK 107 Query: 82 QGQVQMAMNGGIYDES------YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAG 135 Q ++A+N G ++ G I NG N NF IR G + Sbjct: 108 QQNCRIAVNAGFFNPFETDKDYGKCYGNIISNGNLVQD-NGGIQNANFGIRSDGTLVIGY 166 Query: 136 DKVGIVRLDAFKTSKEIQFAVQSGPMLMENG---------------VINPRIHPNVASSK 180 + + + ++ NG + Sbjct: 167 ----LPEKEVIDKKNPFLQLLSGVGWILRNGSSYLKESEKAECKESETTGTLDKFFNVKS 222 Query: 181 IRNGVGINKHGNAVFLLSQQ-----ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGG 235 R +G + G+ + N Y+ Y K K+ + + DG S Y++ Sbjct: 223 ARTMIGYDAKGHVHIVQFDGKTGKSGINLYEAVEYLK-KIGLINAINFDGGGSATYVQDS 281 Query: 236 AIPWQRYP 243 I Sbjct: 282 IILNYPSN 289 >UniRef50_C6XT12 NHL repeat containing protein n=2 Tax=Pedobacter heparinus DSM 2366 RepID=C6XT12_PEDHD Length = 646 Score = 121 bits (304), Expect = 2e-26, Method: Composition-based stats. Identities = 37/266 (13%), Positives = 74/266 (27%), Gaps = 23/266 (8%) Query: 3 HQLLIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQ 62 Q LI + + + + + + + VN +V M Sbjct: 55 TQKLIDETSLIGTVISDDEITVAPGVTETDIHYTDTAGKAMHLFILKVNLNEPQVFMEVA 114 Query: 63 KANGEAWGTLHALLADIN----SQGQVQMAMNGGIYDES-YAPLGLYIENGQQKVALNLA 117 + A + V +NG +D S P+G+ +NG + Sbjct: 115 TPFNLPAYARQTVPAQAAEIDTATHMVIAGINGDFFDTSTGIPMGIVHKNGSIVKSTFND 174 Query: 118 SGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVA 177 + + + S ++ + SG ML+ N + + Sbjct: 175 NTLKPQQAVSFFGVTENNVPIIDFKSGYAALSSQLYNSTGSGVMLVNN---HLPVSQPYT 231 Query: 178 SSKIRNGVGINKHGNAVFLLSQQA-------TNFYDFACYAKAKLNVEQLLYLDGTISHM 230 + R VG + +G F++ N+ A NV+ + LDG S Sbjct: 232 AIDPRTSVGYDDNGIVYFVVIDGRDAPYSNGMNYAQLTSAFMA-FNVKNAVNLDGGGSST 290 Query: 231 YMKGGAIPW-------QRYPFVTMIS 249 +M + ++ Sbjct: 291 FMTRNPVTNLLQVRNQPSDGTARAVA 316 >UniRef50_B2J8B3 Putative uncharacterized protein n=1 Tax=Nostoc punctiforme PCC 73102 RepID=B2J8B3_NOSP7 Length = 276 Score = 121 bits (304), Expect = 2e-26, Method: Composition-based stats. Identities = 39/257 (15%), Positives = 72/257 (28%), Gaps = 38/257 (14%) Query: 11 MITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWG 70 + T++ + + P + P + + Sbjct: 3 LWLNLRSSTQSISTVVSSPPKSIRYFERTLPQSIAHVLFI-PVNSKFLVTPA------LS 55 Query: 71 TLHALLADINSQGQVQMAMNGGIYDESYA-PLGLYI----------ENGQQKVALNLASG 119 A + + + + +N G +D + I EN + NL S Sbjct: 56 QKVATVEEFAQKHRAVAILNAGFFDPANQKTTSYVILQRKLVADPKENERLVNNPNLKSY 115 Query: 120 EGNFFIRPGGVFYVAGDKVG---IVRLDAFKTSKEIQFAVQSGPMLM------ENG---V 167 F R Y G V ++ + ++ A+ +GP L+ + G Sbjct: 116 LSQIFNRTEFRRYSCGQTVRYDIVLHSASQPAGCQLVDAIGAGPSLLPELTLEKEGFVDN 175 Query: 168 INPRIHPNVASSKIRNGVGINKHGNAVFLLSQ-------QATNFYDFACYAKAKLNVEQL 220 N R R VGI G+ V ++ + A + K L ++ Sbjct: 176 ANKRDALGSNQPNARTAVGITHDGSVVLVMVAQKPSAPANGISLPALANFMK-TLGADKA 234 Query: 221 LYLDGTISHMYMKGGAI 237 + LDG S G Sbjct: 235 MNLDGGSSSSLYYNGKT 251 >UniRef50_C4ICA6 Peptidase, M56 family n=1 Tax=Clostridium butyricum E4 str. BoNT E BL5262 RepID=C4ICA6_CLOBU Length = 568 Score = 121 bits (303), Expect = 3e-26, Method: Composition-based stats. Identities = 27/194 (13%), Positives = 53/194 (27%), Gaps = 25/194 (12%) Query: 32 AADDCALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMN 90 + L+ + +P+ +V + + + I A+N Sbjct: 387 EIELYQLTTDKYKGYYMEIKDPKRIKVGVAVK------LNEEGQTASKIAQNYNAVAAIN 440 Query: 91 GGIY----------DESYAPLGLYIENGQQKVALNLASGEG-NFFIRPGGVFYVAGDKVG 139 GG + P+G+ + G+ + + F I V Sbjct: 441 GGGFLDQSSTGYWNGTGGIPVGIIMSKGEVIYNDVEETEKTELFAIDKQRQMIVG----- 495 Query: 140 IVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQ 199 K +Q AV GP L+ +G ++ R +G + G + L+ Sbjct: 496 -TYSVEDLKEKGVQEAVSFGPSLIIDGKMSEMTGDGGWGIAPRTAIGQKEDGTIILLVID 554 Query: 200 QATNFYDFACYAKA 213 K Sbjct: 555 GR-GIGSLGATLKE 567 >UniRef50_UPI0001694670 hypothetical protein Plarl_22443 n=1 Tax=Paenibacillus larvae subsp. larvae BRL-230010 RepID=UPI0001694670 Length = 363 Score = 121 bits (303), Expect = 3e-26, Method: Composition-based stats. Identities = 27/220 (12%), Positives = 60/220 (27%), Gaps = 21/220 (9%) Query: 47 AYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIE 106 Y +P++ RV + +K GE ++ + ++ +AP+G + Sbjct: 125 MYVFDPRSIRVVVPGKKGEGERITSMVERTGAVAGVNGGGF-IDPDGLGNGFAPIGAILS 183 Query: 107 NGQQKVALNLAS-GEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMEN 165 G+ + G + + ++ AV P ++ N Sbjct: 184 GGKVLYNDQKEDIPQHIVGFTDKGTLVIGK------YSIDQLRAMKVSEAVSFYPRVIAN 237 Query: 166 GVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT------NFYDFACYAKAKLNVEQ 219 G R +G G +F++ + + Sbjct: 238 GKPLITKGDGGWGRAPRTALGQRADGTVIFVVIDGRQAHSVGATLREVQDLLLEQ-GCIN 296 Query: 220 LLYLDGTISHMYMKGGAIPWQR------YPFVTMISVERK 253 +LDG S +K + Q + + + Sbjct: 297 AGFLDGGASSEMVKDRKLLTQPSSRYGERRLPSGFLIFNR 336 >UniRef50_C6CV17 Exopolysaccharide biosynthesis protein n=1 Tax=Paenibacillus sp. JDR-2 RepID=C6CV17_PAESJ Length = 355 Score = 121 bits (303), Expect = 3e-26, Method: Composition-based stats. Identities = 27/218 (12%), Positives = 56/218 (25%), Gaps = 20/218 (9%) Query: 38 LSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDES 97 + A + ++ + G + +N G + + Sbjct: 132 IKADNFQSYAMKIKLKSGD---AMKMVLGNDKVGGAETTLAAVQRYGAVAGVNAGGFADG 188 Query: 98 Y---APLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQF 154 PL I NG + F G+ G + ++F Sbjct: 189 GGKRYPLSTTILNGDYVEGFEPTRADLFFV----GLNASNKLVGGKFTSKQQLDNLNVKF 244 Query: 155 AVQSGPMLMENGVI--NPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT-------NFY 205 P+L++NG P + + R + K G + +++ Sbjct: 245 GASFVPVLLKNGSPTTIPSKWQSSPTRAPRTVIANYKDGQLLIIVADGRNEGGSSGATLA 304 Query: 206 DFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYP 243 + +L LDG S + G + + Sbjct: 305 EM-QILLQRLGAVDGYNLDGGGSSSMIWNGRVINKPSD 341 >UniRef50_C9PX63 Putative uncharacterized protein n=1 Tax=Prevotella sp. oral taxon 472 str. F0295 RepID=C9PX63_9BACT Length = 294 Score = 120 bits (302), Expect = 3e-26, Method: Composition-based stats. Identities = 32/240 (13%), Positives = 72/240 (30%), Gaps = 39/240 (16%) Query: 43 LTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLG 102 TV + P+ ++ A+G A + ++ + + + +NG + + Sbjct: 65 QTVTVAEITPKRS-LEFDIAIADGG------ATVGEMAQRTKALVGINGSYFGMNKRSAI 117 Query: 103 LYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIV-RLDAFKTSKEIQFA--VQSG 159 Y+ G+ + + +R G G K+ I+ + + + SG Sbjct: 118 TYLRQGRTVLDTTTTAE---LALRVTGAIRTHGRKLRIMPWNKEIERRYHCRHGSTLASG 174 Query: 160 PMLMENGV---INPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-------TNFYDFAC 209 +L+ G + V R+ + + G +F+ N + Sbjct: 175 HLLLYRGQSILLRSSSMGFVVKKHPRSAIALTSRGTVLFVTVDGRHPGYAGGMNLIELRH 234 Query: 210 YAKAKLNVEQLLYLDGTISHMYMKGG----AIPWQR-----------YPFVTMISVERKG 254 + +L + LDG S G + + V ++G Sbjct: 235 FL-QQLGCTDAINLDGGGSTTLWAKGFSTKGVANYPCDNRKFDHDGERKVANAVVVMKRG 293 >UniRef50_C0CND1 Putative uncharacterized protein n=1 Tax=Blautia hydrogenotrophica DSM 10507 RepID=C0CND1_9FIRM Length = 454 Score = 120 bits (302), Expect = 3e-26, Method: Composition-based stats. Identities = 28/184 (15%), Positives = 54/184 (29%), Gaps = 16/184 (8%) Query: 66 GEAWGTLHALLADINSQGQVQMAMNGGIY-DESYAPL-GLYIENGQQKVALNLASGEGNF 123 G +G ++ + + +NG + S P G + + ++G Sbjct: 233 GGTYGNPRRTVSQELADHNGVLGINGSGFSYSSGIPAPGKSMIKDRTVYEDVYSNGNIMC 292 Query: 124 FIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNV-ASSKIR 182 GG+F ++++ GP L+ENG R Sbjct: 293 VTGEGGMF-----TAPAGMTVQEMLQRDVKDTYCFGPTLVENGEAFEISEQFQQTYRYQR 347 Query: 183 NGVGINKHGNAVFLLSQQ-------ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGG 235 VG+ G+ ++ + + L+ E LDG S + G Sbjct: 348 TAVGMISPGDYYLVIVDGKGVGGSQGMTYEELQQVFLD-LDCEYAYNLDGGGSTTLVFKG 406 Query: 236 AIPW 239 + Sbjct: 407 RVIN 410 >UniRef50_B4CZJ8 Putative uncharacterized protein n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4CZJ8_9BACT Length = 251 Score = 120 bits (302), Expect = 4e-26, Method: Composition-based stats. Identities = 44/247 (17%), Positives = 74/247 (29%), Gaps = 25/247 (10%) Query: 15 NLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHA 74 + + +T+ T NP+ + GT Sbjct: 22 WVLKESADRPAPTELEFTERHVQGDAGDVTLWVVTFNPKACAFAVMDNPTGAFDLGTASE 81 Query: 75 LLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVA 134 + +NGG + PLGL + G + L A GV V Sbjct: 82 -------KRGALAGVNGGYFHPDRTPLGLVVRQGVEIHPLERAKLLS-------GVLSVM 127 Query: 135 GDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAV 194 + + R AFK S ++ A+Q+GP L+E P + R V N G Sbjct: 128 PTTITLQRTGAFKGSSAVREALQAGPFLIEKEKPIPGLEA--TKEAARTVVFQNAKGRCG 185 Query: 195 FLLSQQATNFYDFACYAKA-----KLNVEQLLYLDGTISHMYMKGGA---IPWQRYPFVT 246 FL+ + T A + + + + LDG S G + Sbjct: 186 FLICKS-TTLAGMADLLATSSIFPEGKIIRAMNLDGGTSTALWVRGTPPFYAREWKSVRN 244 Query: 247 MISVERK 253 +++ + Sbjct: 245 YLAIVPR 251 >UniRef50_Q73Q09 Putative uncharacterized protein n=1 Tax=Treponema denticola RepID=Q73Q09_TREDE Length = 293 Score = 120 bits (302), Expect = 4e-26, Method: Composition-based stats. Identities = 32/219 (14%), Positives = 69/219 (31%), Gaps = 26/219 (11%) Query: 34 DDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTL--HALLADINSQGQVQMAMNG 91 D L + A ++ ++K+ + + + +A+N Sbjct: 57 SHIKYEDYPLIIHAVKIDLTNPKLKIVVTEPALFNSKGMVKRETTLSFARRHNTVIALNA 116 Query: 92 GIYDE-------SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLD 144 ++ PLG++I+ ++ G ++ I + Sbjct: 117 AFFNVISFSFSLRGEPLGIHID--KKINLSKPFPKYGALCFLDDNSAFI------IESQN 168 Query: 145 AFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATN- 203 +I++AV ++++NG R VG+ G ++L + N Sbjct: 169 TEDIKADIEYAVSGNRIILKNGKPIITNI--SKKENSRTCVGLADGGKTLYLFFAEGENK 226 Query: 204 ------FYDFACYAKAKLNVEQLLYLDGTISHMYMKGGA 236 YD A + KL + ++LDG S + Sbjct: 227 KKSRGITYDQAHFFMKKLGAQDAIHLDGGGSSSLIIKKE 265 >UniRef50_D1CZ42 Putative uncharacterized protein n=1 Tax=Brucella sp. 83/13 RepID=D1CZ42_9RHIZ Length = 248 Score = 120 bits (301), Expect = 5e-26, Method: Composition-based stats. Identities = 46/167 (27%), Positives = 79/167 (47%), Gaps = 10/167 (5%) Query: 96 ESYAPLGLYIENGQQKVALNLASGEG------NFFIRPGGVFYVAGDKVGIVRLDAF-KT 148 ++PLGL+I +G+++ + A + NF+ +P G+F++ G++ + F K Sbjct: 82 AGFSPLGLFIADGKEQSPIQPAGAKTSDKPVPNFYKKPNGIFFLDESGAGLLPTEQFVKR 141 Query: 149 SKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFA 208 ++ A QSGPML+ +NP A R+GVG+ K G F++S A NF+DFA Sbjct: 142 RPKVWLATQSGPMLVIENRLNPIFIIGSADKSRRSGVGVCKDGVIHFVVSDDAVNFHDFA 201 Query: 209 CYAKAKLNVEQLLYLDGTISHMYM---KGGAIPWQRYPFVTMISVER 252 + + +L L+LDG G + M ++ Sbjct: 202 RFFRDRLECPNALFLDGGGGAGLYDPALGRNDMSWHGGYGPMFALIE 248 >UniRef50_B8CYN3 SpoIID/LytB domain protein n=1 Tax=Halothermothrix orenii H 168 RepID=B8CYN3_HALOH Length = 833 Score = 119 bits (299), Expect = 7e-26, Method: Composition-based stats. Identities = 32/216 (14%), Positives = 64/216 (29%), Gaps = 30/216 (13%) Query: 54 TERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVA 113 + K G+ + ++ V ++ + +P + +NG A Sbjct: 631 RTDEAVIINKYYGQVAPPAREWITELVVSNGVVQSI------KDGSPGTVIPDNGFIIQA 684 Query: 114 LNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIH 173 + F V G+ +K+I+ A+ +GP L++NG I Sbjct: 685 HGQSRQFLKLFKEGDKVVLQNNFGPGLT-------NKDIKMALGAGPTLIKNGKIYITGK 737 Query: 174 PNV------ASSKIRNGVGINKHGNAVFLLSQQA-------TNFYDFACYAKAKLNVEQL 220 R +GI + + + + A + K NV Q Sbjct: 738 AEGFQPDILRGRAPRTALGITSGNHLIMVTVDGRQPGFSIGMTLEELAQFML-KYNVVQA 796 Query: 221 LYLDGTISHMYMKGGAIPWQRYP---FVTMISVERK 253 + LDG S + G + ++ + Sbjct: 797 MNLDGGASSRMVVRGYTMNNPSDKRLISNGLLIKYR 832 Score = 53.5 bits (127), Expect = 7e-06, Method: Composition-based stats. Identities = 26/167 (15%), Positives = 56/167 (33%), Gaps = 18/167 (10%) Query: 45 VQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLY 104 + ++ + + + A+G+ G L+ + S + +NGG Y + PLGL+ Sbjct: 521 ITMLDLDLNNDFLYVEPFLASGKLSGLSD--LSQVVSGKKALAGINGGFYSYTGRPLGLF 578 Query: 105 IENGQQKVALNLASGEGNF------FIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQS 158 + NG+ + R + + + ++ AV Sbjct: 579 MINGEIVSEDIMGRTALVITPDDIIIDRIDWTARLTNTRGEEILIEGANRRPRTDEAV-- 636 Query: 159 GPMLMEN---GVINPRIHPNVASSKIRNGVGIN-KHGNAVFLLSQQA 201 + N G + P + + NGV + K G+ ++ Sbjct: 637 ----IINKYYGQVAPPAREWITELVVSNGVVQSIKDGSPGTVIPDNG 679 >UniRef50_A7LRK2 Putative uncharacterized protein n=1 Tax=Bacteroides ovatus ATCC 8483 RepID=A7LRK2_BACOV Length = 315 Score = 119 bits (299), Expect = 8e-26, Method: Composition-based stats. Identities = 28/241 (11%), Positives = 72/241 (29%), Gaps = 28/241 (11%) Query: 28 LFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALL-------ADIN 80 + + + T++ + + A Sbjct: 61 GIKLNELAFKTFGESQHIFVATIDLNELTFTPATKDDKNVPATGPESSAPLPIHAFAAEA 120 Query: 81 SQGQVQMAMNGGIYDES-YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVG 139 + V + +NG Y ++ +GL+ ++G + + + G YV Sbjct: 121 NGKTVWLGVNGDYYADNPRRVMGLFYKDGVCINSQYFEGHDEVLYQLKNGETYVGQA--- 177 Query: 140 IVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHP--NVASSKIRNGVGINKHGNAVFL- 196 +A + A+ +L+++GV+ ++ ++ R VG+++ +++ Sbjct: 178 ---DEALAHEANLLHALGGYGLLVKDGVVQNFYEEMGDLQNTHPRTSVGLSQDRKTMYVF 234 Query: 197 LSQQA---------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTM 247 + A KA + + LDG S + + P + Sbjct: 235 VVDGRRKDSFFALGLTLPHLATMMKA-VGCYNAINLDGGGSTTLIIRK-VNDGGKPTFPI 292 Query: 248 I 248 + Sbjct: 293 L 293 >UniRef50_B9YC35 Putative uncharacterized protein n=2 Tax=Holdemania filiformis DSM 12042 RepID=B9YC35_9FIRM Length = 368 Score = 119 bits (298), Expect = 1e-25, Method: Composition-based stats. Identities = 29/223 (13%), Positives = 59/223 (26%), Gaps = 26/223 (11%) Query: 32 AADDCALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMN 90 + L T + V +P V +G ++ + MN Sbjct: 143 EIEIIDLKGTTFEGKLMIVHDPSRVFVACNPNMDSGAPGYSVEKYIEL----NDAIAGMN 198 Query: 91 GGIYDE------SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLD 144 G +++ G+ I +G+ + + I V Sbjct: 199 AGGFEDAGGNGNGGTAYGIVIHDGKLISG-SPSEFTPVIGINNANQLVVGDMTA------ 251 Query: 145 AFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT-- 202 +I+ AV GP+ ++N + R +G G + ++ Sbjct: 252 QQALDYDIRDAVTFGPVFIKNWEVVFESGR-HPGLNPRTVIGQRYDGAFLLMVLDGRQPS 310 Query: 203 ----NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQR 241 + D + + LDG S + + G Sbjct: 311 SFGSTYQDIIDIM-QQYDAVNAANLDGGNSTVMVYDGETLNTT 352 >UniRef50_B7ASL4 Putative uncharacterized protein n=1 Tax=Bacteroides pectinophilus ATCC 43243 RepID=B7ASL4_9BACE Length = 367 Score = 118 bits (297), Expect = 1e-25, Method: Composition-based stats. Identities = 33/227 (14%), Positives = 65/227 (28%), Gaps = 23/227 (10%) Query: 34 DDCALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGG 92 + + T + + +P V + + ++A I ++ +NGG Sbjct: 142 EIVDIKGTTYRGKLMIIKDPSRVFVGTVP-----QFFEGDGKVVAKIAARYNAVGGVNGG 196 Query: 93 IYDES-----YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFK 147 + + P+GL + +G+ + A+ + V + Sbjct: 197 EFVDGELTYTAMPVGLVMTDGRIVNG-DTATRCHVTGFTKDNILVVGNMTGQQALDMGMR 255 Query: 148 TSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT----- 202 I ++ GP L+ NG + R VG G + L Sbjct: 256 DCVSISSSI--GPFLIINGEAQ-DVSGVGGGLNPRTAVGQRADGAVLLLAIDGRQANSLG 312 Query: 203 -NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRY-PFVTM 247 +F D + +DG S G++ Y P Sbjct: 313 ASFADLLYIM-QQYGAVNASTMDGGTSTQMYYEGSVINTPYSPTGPR 358 >UniRef50_Q7X4R9 XcbC n=1 Tax=Neisseria meningitidis RepID=Q7X4R9_NEIME Length = 256 Score = 118 bits (297), Expect = 1e-25, Method: Composition-based stats. Identities = 29/254 (11%), Positives = 70/254 (27%), Gaps = 26/254 (10%) Query: 15 NLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHA 74 ++ IF+ + A C + +N + + + + + Sbjct: 6 SILSIFILSFFNSEYTYAQSLCIQQSSQNHIHIAKINLNCKGINLIATQEADK-----GM 60 Query: 75 LLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVA 134 ++ + + +A+NG + Y P GL I + + + Sbjct: 61 TVSQFARKYRTDIAINGSFFRTGYFPFGLAITDHKTWDKTRDVQKRVFLACNRQNRCMIE 120 Query: 135 GDKVGIVRLDAFKTSKEIQFAVQSGPMLME----NGVINPRIHPNVASSKIRNGVGINKH 190 + D++K + P + + H + + R VG+++ Sbjct: 121 DKNMVSKVDDSWKLAVSGWQ--SFNPATKKFECSDDDPVGCTHIKFITKQPRTMVGLDEK 178 Query: 191 -GNAVFLLSQQAT------NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIP----- 238 ++ + A L + + + LDG S +KG Sbjct: 179 RNYLYLVVIDGRLPKFKGATLNELGQ-LAASLKLTKAINLDGGGSSTMVKGYNRISTLPA 237 Query: 239 --WQRYPFVTMISV 250 + + V Sbjct: 238 TQKKERVVANHLGV 251 >UniRef50_C6XWN0 Putative uncharacterized protein n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6XWN0_PEDHD Length = 328 Score = 118 bits (297), Expect = 1e-25, Method: Composition-based stats. Identities = 31/276 (11%), Positives = 74/276 (26%), Gaps = 33/276 (11%) Query: 3 HQLLIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQ 62 + ++ + + + + V+ +T ++++ Sbjct: 60 TKKIMDNTTVIGTFISDETGSVTAGINITRLAFLRKDKLPVRIFIMEVDMKTPKLEIQAM 119 Query: 63 KANGEAWGTLHALLADINSQG-----QVQMAMNGGIYDESYAPLGLYIENGQQKVALNLA 117 + L L+++ + A+NG + + AP L+ N + A Sbjct: 120 APYNDYINGL-QRLSEMCRDNELPGTNIVAAVNGDTFSTTGAPTSLFYINNRVYYGTV-A 177 Query: 118 SGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVA 177 +G F G + ++ +I+ AV L++N + + Sbjct: 178 TGRTFFAAMKDGTIVI--GGKDTKGVERPVDKAQIKNAVGGNQWLVDNNIKATLTDATI- 234 Query: 178 SSKIRNGVGINKHGNAVFLLSQQ-------ATNFYDFACYAKAKLNVEQLLYLDGTISHM 230 R +G N + ++ D A L + + LDG S Sbjct: 235 --SARTAIGYNANKVIYAIVVDGSQATYSNGLTLVDLRDIM-AALGTKDAVNLDGASSST 291 Query: 231 YMKGG------AIPWQR-------YPFVTMISVERK 253 + + + + K Sbjct: 292 LVAKDLTKGTWNVLNKPALALNAERLIGNGLGFILK 327 >UniRef50_A9B1E5 Putative uncharacterized protein n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9B1E5_HERA2 Length = 272 Score = 118 bits (296), Expect = 2e-25, Method: Composition-based stats. Identities = 34/222 (15%), Positives = 77/222 (34%), Gaps = 19/222 (8%) Query: 20 FLALTLLPLFAVAADDCALSDPTLTVQ---AYTVNPQTERVKMYWQKANGEAWGTLHALL 76 T+ + + + V+P R+++ + A+ + Sbjct: 44 AEPTTIDNQWQTLEPGLEFREIGYDITNVQILRVDPAYFRLRVGYDVASPG-------RV 96 Query: 77 ADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGD 136 ++ + + +NGG +D L I +G G G Sbjct: 97 SEWAAALKPVAVINGGYFDAQGRATALTIFDGVINGTSYDGFGGMLAVDSADG------W 150 Query: 137 KVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFL 196 + +R + +++ + A+QS PML+ +G + + + R+ V I++ G + + Sbjct: 151 SLRSLREQPYDSTEVLNQALQSAPMLVVHGAAIEQPNDDGD-RARRSVVAIDQTGRLLLM 209 Query: 197 LSQQAT-NFYDFACYA-KAKLNVEQLLYLDGTISHMYMKGGA 236 + + D + + K L ++ L LDG S + Sbjct: 210 VCSWPSFTLTDLSQWLVKQDLAIDAALNLDGGSSTGLVVASE 251 >UniRef50_A6L611 Putative uncharacterized protein n=1 Tax=Bacteroides vulgatus ATCC 8482 RepID=A6L611_BACV8 Length = 308 Score = 118 bits (295), Expect = 2e-25, Method: Composition-based stats. Identities = 31/255 (12%), Positives = 61/255 (23%), Gaps = 44/255 (17%) Query: 23 LTLLPLFAVAAD-DCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINS 81 L ++S V V+ + + + L+ + Sbjct: 40 TIAPALIHYRFAGYDSISQAHQNVDVLEVDLTSPSYDIQLV------YEEHGDSLSSVAE 93 Query: 82 QGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIV 141 + A+NG E G+ L S ++ G + +K + Sbjct: 94 RNNAAAAINGTYEAE----ASFIKIGGRLLAQNRLDSTHIRYWKHEGAFLFDDDNKNIDI 149 Query: 142 R--LDAFKTSKEIQFAVQSGPMLMENGVIN------------------PRIHPNVASSKI 181 R D+ S + PML++N + Sbjct: 150 RFASDSTFLSHPAANILSGAPMLIDNNDPVGLNFTGNVEGMDLNKLDYEDFRRHQGVRHP 209 Query: 182 RNGVGINKHGNAVFLLSQQA------TNFYDFACYAKAKLNVEQLLYLDGTISHMYM--- 232 R V + +H + + + + + + L LDG S Sbjct: 210 RTAVALTEHKKLLLITVDGRSTQAAGMSANELTRFLLTYFCPQSALNLDGGGSTTMWIAS 269 Query: 233 ----KGGAIPWQRYP 243 GG + Sbjct: 270 SEQRVGGVVNHPTDN 284 >UniRef50_B3RIP6 Putative uncharacterized protein (Fragment) n=2 Tax=Trichoplax adhaerens RepID=B3RIP6_TRIAD Length = 344 Score = 118 bits (295), Expect = 2e-25, Method: Composition-based stats. Identities = 38/214 (17%), Positives = 62/214 (28%), Gaps = 29/214 (13%) Query: 51 NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE-SYAPLGLYIENGQ 109 +P + Q G L +AD + +A N G ++ + G I NG+ Sbjct: 15 DPLRTISVLEPQNTGGCNMSKLS-TVADTARKAHCYVAENAGFFNTETGGCYGNIISNGR 73 Query: 110 QKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVIN 169 N+ + NF IR G V + + V L+ NG Sbjct: 74 LVRLTNVQN--VNFGIRKNGSIIVGY----LTEEEILDKENPFVQLVSGVIWLVRNGKSY 127 Query: 170 PRIH---------------PNVASSKIRNGVGINKHGNAVFLLSQQ-----ATNFYDFAC 209 + + R +G +++GN + + + N YDFA Sbjct: 128 VKESMKMESNKHEETGTLKQFIEVKSARTAIGHDRNGNVMLMQIEGQTNARGLNLYDFAK 187 Query: 210 YAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYP 243 + LDG S G Sbjct: 188 KLIKS-GFVNAINLDGGGSSTTAIDGIAVGYPSD 220 >UniRef50_C6JBU1 Putative uncharacterized protein n=1 Tax=Ruminococcus sp. 5_1_39BFAA RepID=C6JBU1_9FIRM Length = 291 Score = 117 bits (294), Expect = 3e-25, Method: Composition-based stats. Identities = 37/231 (16%), Positives = 71/231 (30%), Gaps = 20/231 (8%) Query: 22 ALTLLPLFAVAAD----DCALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALL 76 + L ++ +Y V + +T K + +G Sbjct: 52 PTVTQSGNTYEYKSSTLNIKLKRKSVHGISYWVAHIKTSNAKQLKSALSNGTYGGSRQTT 111 Query: 77 ADINSQGQVQMAMNGGIYDESY---APLGLYIENGQQKVALNLASGEGNFFIRPGGVFYV 133 +D S + +NG +D +PLG+ I+NG + ++ G Y Sbjct: 112 SDAVSSNGGIIGVNGSAFDYGTGKPSPLGMCIKNGIIYGDYMTSYS--VMAVKKDGTIYT 169 Query: 134 AGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNA 193 + + ++ GP+L+++G R VG+ K + Sbjct: 170 PAQGLM----GKNLLAAGVKDTYNFGPVLIKDGEAQLPWTET-EKYYPRTAVGMVKPNDY 224 Query: 194 VFLLSQ----QATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 V L++ N +D K+ LDG S G + + Sbjct: 225 VLLVTDTGSYNGLNHWDMVNIFKS-YGCTYAYNLDGGGSATLYFNGKVMNK 274 >UniRef50_C5S6T1 Putative uncharacterized protein n=1 Tax=Allochromatium vinosum DSM 180 RepID=C5S6T1_CHRVI Length = 272 Score = 116 bits (291), Expect = 6e-25, Method: Composition-based stats. Identities = 42/243 (17%), Positives = 78/243 (32%), Gaps = 24/243 (9%) Query: 22 ALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINS 81 P+ + + T+ + + R+ + + + + Sbjct: 43 PALEAPISHSERTLESSTGRTVRAHLALFDSRRYRLAVLDLGPD---LASASDWPEHTRA 99 Query: 82 QGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIV 141 + A+NGG + PLGL I G++ GV Y + + Sbjct: 100 A-GLLAAVNGGFFHADGQPLGLVIAGGERLNRFET-------VKLLSGVLYGDARGIHLE 151 Query: 142 RLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA 201 R F++S I VQSGP L+E G + + S R + + + V ++ Sbjct: 152 RRARFQSSPGIDALVQSGPYLVEQGRAVRGLSTHDVSR--RTFIATDWRRHWVLGATRDG 209 Query: 202 TNFYDFACYA-----KAKLNVEQLLYLDGTISHMYMKGGAIPWQR------YPFVTMISV 250 + A A VE+ L LDG S ++ + P ++ V Sbjct: 210 LTLAELAEALATPGALAPWPVERALNLDGGTSTGFLFDPGAGQEPIHLRARRPVRNLVGV 269 Query: 251 ERK 253 + Sbjct: 270 RAR 272 >UniRef50_D2V2G1 N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase n=1 Tax=Naegleria gruberi RepID=D2V2G1_NAEGR Length = 558 Score = 116 bits (291), Expect = 7e-25, Method: Composition-based stats. Identities = 30/199 (15%), Positives = 59/199 (29%), Gaps = 36/199 (18%) Query: 66 GEAWGTLHALLADINSQGQ----VQMAMNGGIYDES-YAPLGLYIENGQQKVALNLASGE 120 G + + A + DI N G ++ + LG + +G+ + + Sbjct: 193 GGCYYNVTAPVRDIAKYHANGYFCHYTTNAGFFNTHKHTCLGNVVSDGRI--SHVSTNHN 250 Query: 121 GNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHP------ 174 NF I G +++ D ++ + L+ G Sbjct: 251 VNFGITKDGKYFIG-------YTDENTKLEDFDQMISGVIWLVRKGESYVDESSKIEDMS 303 Query: 175 ---------NVASSKIRNGVGINKHGNAVFLLSQQ------ATNFYDFACYAKAKLNVEQ 219 + R+ +G +K G V + Y+ A +L VE Sbjct: 304 IQETGNAKRFITVRASRSALGHDKEGRLVLVSIDGDGNHNKGPTLYELATLMI-ELGVEN 362 Query: 220 LLYLDGTISHMYMKGGAIP 238 + LDG S ++ + Sbjct: 363 AINLDGGGSVTVVRDNDVV 381 >UniRef50_Q8YP57 All4343 protein n=5 Tax=Nostocaceae RepID=Q8YP57_ANASP Length = 660 Score = 116 bits (290), Expect = 7e-25, Method: Composition-based stats. Identities = 47/295 (15%), Positives = 86/295 (29%), Gaps = 49/295 (16%) Query: 6 LIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKAN 65 L+G I +R F + + + L Q + +P R + W A Sbjct: 366 LVGTAPILQTAQRYFAVGAINGGYFNRNNRYPLGAIRQNNQWLS-SPILNRGAIAWNDAG 424 Query: 66 GEAWG--TLHALLADINSQGQVQMAMNGGI-----------YDESYAPLG-----LYIEN 107 +G +L LA ++ +A+N G + + Y PL + ++N Sbjct: 425 QFYFGRLSLQETLATSSNLRVPILALNSGYVQNGIARYTPAWGKMYTPLTDNERIVIVQN 484 Query: 108 GQQKVA-LNLASGEGNFFIRPGGVFYVAGDKV-------------GIVRLDAFKTSKEIQ 153 + +G+ NF I G I Sbjct: 485 NKITNQFPGNKAGQTNFPIPNNGYLLTLRGNATTVASQLPVGTDVQITSATTPGEFNRYP 544 Query: 154 FAVQSGPMLMENGVINPR------IHPNVASSKIRNGVGINKHGNAVFLLSQQA-----T 202 + +GP+L++N I + +A +R+G+ + + Sbjct: 545 HIIGAGPLLLQNSQIVLDAKSEQFSNAFIAERAVRSGICTTANNTLLIAAVHNRAGGPGP 604 Query: 203 NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFV----TMISVERK 253 N + A K L L LDG S G + + I + + Sbjct: 605 NLAEHAQLMKL-LGCVNALNLDGGSSTSLYLSGQLLDRYPNTAARVHNGIGIFLR 658 Score = 75.8 bits (185), Expect = 1e-12, Method: Composition-based stats. Identities = 26/148 (17%), Positives = 43/148 (29%), Gaps = 7/148 (4%) Query: 20 FLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADI 79 +T L V VNP+T + + N + +L Sbjct: 317 PRDITWATGLRWRQQFVNLGTNRFPVVLLEVNPRTIGLTLKPIVTNPDTLVGTAPIL-QT 375 Query: 80 NSQGQVQMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKV 138 + A+NGG ++ + PLG +N Q + L G G FY + Sbjct: 376 AQRYFAVGAINGGYFNRNNRYPLGAIRQNNQWLSSPILN--RGAIAWNDAGQFYFGRLSL 433 Query: 139 GIVRLDAFKTSKEIQFAVQSGPMLMENG 166 + I ++NG Sbjct: 434 QETLATSSNLRVPILALNSGY---VQNG 458 >UniRef50_C4Z4Z5 Putative uncharacterized protein n=1 Tax=Eubacterium eligens ATCC 27750 RepID=C4Z4Z5_EUBE2 Length = 388 Score = 115 bits (289), Expect = 1e-24, Method: Composition-based stats. Identities = 36/227 (15%), Positives = 63/227 (27%), Gaps = 24/227 (10%) Query: 34 DDCALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGG 92 + + T + + V +P V E ++ADI + +NGG Sbjct: 164 EIVDVKGATYSGKLMIVKDPSRLFVGTVP-----EFTNGNGMVVADIAKRYDAIGGVNGG 218 Query: 93 IYDESYA-----PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFK 147 + + P+GL +++G+ S I + + + Sbjct: 219 EFVDGETTYTAMPIGLVMKDGEILNDNGGTSHVT--GITFDNKLVLGNMNAAKAKELNIR 276 Query: 148 TSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT----- 202 I + GP L+ NG I + R +G G + L Sbjct: 277 DCVSISNHI--GPFLIVNGEAQ-DIVGIAGGTNPRTAIGQTADGKILLLAVDGRQPNSIG 333 Query: 203 -NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPW-QRYPFVTM 247 F D A+ +DG S G + P Sbjct: 334 ATFSDLQDIM-AQYGAVNASTMDGGTSTQMYYDGEVINVPYSPTGPR 379 >UniRef50_B8G1I8 Peptidase M56 BlaR1 n=4 Tax=Desulfitobacterium hafniense RepID=B8G1I8_DESHD Length = 747 Score = 115 bits (289), Expect = 1e-24, Method: Composition-based stats. Identities = 38/241 (15%), Positives = 67/241 (27%), Gaps = 37/241 (15%) Query: 37 ALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYD 95 L + + + +P+ + + E GT+ L D+ S+ +N G Sbjct: 519 NLHGFNIKGKVMLISDPKRVTLAVT------EEIGTVEEKLTDMVSRSGAIAGINAGGIY 572 Query: 96 ESYA------PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTS 149 S P G+ ++NG+ + G V + Sbjct: 573 LSLEEGNEVFPDGITVQNGEVVYNNAGDQAVEFIGLDAEGKLITGPMNVQEI------KE 626 Query: 150 KEIQFAVQSGPMLMENGVINPRIH------PNVASSKIRNGVGINKHGNAVFLLSQQATN 203 K IQ V P L +NG R R G+G G +F++ Sbjct: 627 KNIQEGVGFSPPLADNGTTLVREGKPAVPGDGGWGIAPRAGIGQRADGTLIFMVIDGRDP 686 Query: 204 FYDFACYAKA------KLNVEQLLYLDGTISHMYMKGGAIPWQ------RYPFVTMISVE 251 + K + + + L G + G + + P T V Sbjct: 687 DWSIGATLKDMENLFLEYGAVEAVNLSGGSMVEMVYDGKVLNKVSNIFGERPIPTGFVVM 746 Query: 252 R 252 Sbjct: 747 P 747 >UniRef50_Q4UP44 Putative uncharacterized protein n=4 Tax=Bacteria RepID=Q4UP44_XANC8 Length = 439 Score = 115 bits (289), Expect = 1e-24, Method: Composition-based stats. Identities = 33/236 (13%), Positives = 65/236 (27%), Gaps = 25/236 (10%) Query: 21 LALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHAL-LADI 79 LTL P + + ++ T +++ + G A Sbjct: 172 APLTLAPGVRYWRQAIG-GAQPVMLHIAQIDLTTPGLQLVGTPGDRSDGGEFRATPTTAF 230 Query: 80 NSQGQVQMAMNGGIYDES---------YAPLGL--YIENGQQKVALNLASGEGNFFIRPG 128 G + +A+N + + P G A S R Sbjct: 231 VRDGALTLAINADYFLPFDGGHLLDKPFVPAAGQGVTAEGLAIEAGRTDSAAATSDPRVN 290 Query: 129 GVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIH---PNVASSKIRNGV 185 V+ + + + V +GP+L+ +G PR + R+ V Sbjct: 291 AALCVSQRDAVRIVRGS--CPAGSRLGVGAGPLLLLDGKRQPREASRAAYYDGPEPRSAV 348 Query: 186 GINKHGN-AVFLLSQQATNFYDFACYA------KAKLNVEQLLYLDGTISHMYMKG 234 G+++ G+ +++ Y +L + LDG S Sbjct: 349 GLDRSGHTLWMVVADGRQPGYSAGMTLDALTAVFEQLGAHAAINLDGGGSSTLAAR 404 >UniRef50_C6PYU6 Putative uncharacterized protein n=1 Tax=Clostridium carboxidivorans P7 RepID=C6PYU6_9CLOT Length = 369 Score = 115 bits (288), Expect = 1e-24, Method: Composition-based stats. Identities = 30/224 (13%), Positives = 56/224 (25%), Gaps = 35/224 (15%) Query: 37 ALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAM------ 89 + D V NP ++ + ++I + A+ Sbjct: 146 EIDDTKFHACILEVKNPTRMKIGYT------NKLKEVGQKTSEIAEENGAAAAINGGGFT 199 Query: 90 ----NGGIY-DESYAPLGLYIENGQQKVALNLASGEGNF-FIRPGGVFYVAGDKVGIVRL 143 NG ++ P G+ I NG+ + + N G V V Sbjct: 200 DKSSNGKLWTGTGAYPQGIVISNGKVVYSDVKNNEAVNVTAFTKDGKLIVGDHTV----- 254 Query: 144 DAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA-- 201 + + A+ L+ NG + R +G G + L+ Sbjct: 255 -SELLRDNVTEAISFRNSLIINGKPVALAEEGLN---PRTAIGQKADGTIIMLVIDGRKG 310 Query: 202 ----TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQR 241 + + + LDG S G + Sbjct: 311 LKAGASLKEVQNILLQR-GALNASSLDGGSSSTMYFNGEVINDP 353 >UniRef50_B4VX04 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VX04_9CYAN Length = 681 Score = 115 bits (287), Expect = 2e-24, Method: Composition-based stats. Identities = 47/286 (16%), Positives = 83/286 (29%), Gaps = 46/286 (16%) Query: 7 IGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANG 66 +G + + A + F + L Y+ P R + W + Sbjct: 387 VGTAPLIKTAQLWKAAAAINGGFFNRNNQLPLGAIRRDGYWYSG-PILNRGAIAWTDQHQ 445 Query: 67 EAWG--TLHALLADINSQGQVQMAMNGGI-----------YDESYAPLG-----LYIENG 108 +G +L L N + + +N G + +Y PL ++N Sbjct: 446 FKFGRFSLQETLITANGERFPSLFLNSGYVQAGISRYTPAWGVTYTPLTDNEVIWVVQNN 505 Query: 109 QQKVA-LNLASGEGNFFIRPGGVFYVAGD--------------KVGIVRLDAFKTSKEIQ 153 Q +GE +F I G V I + + Sbjct: 506 QITAQLPGGVAGEESFVIPVNGYLLTHRGHDPNAIAKSLTLGTTVQIEQKTLPVEFNDYP 565 Query: 154 FAVQSGPMLMENGVINPR------IHPNVASSKIRNGVGINKHGNAVFLLSQQAT----- 202 + +GP+L++N I + S IR+ +GI +G + Sbjct: 566 HILGAGPLLLQNRQIVLDAKAENFSNAFAQQSAIRSAIGITANGTLIIAAMHNRVGGRGP 625 Query: 203 NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMI 248 N + A +L L LDG S GG + + + Sbjct: 626 NLTETAQLM-QQLGAVDALNLDGGSSTGLYLGGHLLDRSPHTAARV 670 Score = 74.3 bits (181), Expect = 4e-12, Method: Composition-based stats. Identities = 18/102 (17%), Positives = 34/102 (33%), Gaps = 2/102 (1%) Query: 23 LTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQ 82 + P L D + V+P+ +VK+ ++ L+ Sbjct: 340 ILWAPGIWWRQRTVTLGDHQFPLVWLEVDPKNPQVKLSPMWSHPTTQVGTAPLIKT-AQL 398 Query: 83 GQVQMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGEGNF 123 + A+NGG ++ + PLG +G L G + Sbjct: 399 WKAAAAINGGFFNRNNQLPLGAIRRDGYWYSGPILNRGAIAW 440 >UniRef50_B8HTR4 Putative uncharacterized protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HTR4_CYAP4 Length = 603 Score = 114 bits (286), Expect = 3e-24, Method: Composition-based stats. Identities = 45/291 (15%), Positives = 80/291 (27%), Gaps = 45/291 (15%) Query: 6 LIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKAN 65 L+G + +R +A + F L + + +P R + W Sbjct: 307 LVGIQPLPRLAQRWQVAAAINGGFFNRNQQVPLGAIRQSGSWIS-SPILNRGAIGWNDQG 365 Query: 66 GEAWGTLH--ALLADINSQGQVQMAMNGGI-----------YDESYAP-----LGLYIEN 107 G L L + Q ++ G + +Y P + + N Sbjct: 366 EFTLGRLRLQQTLITASGQSLPINTLDSGFVQKGIARYTRAWGPTYTPRVAKETVITVVN 425 Query: 108 GQQK-VALNLASGEGNFFIRPGGVFYV---------AGDKVGIVRLDAFKTSKEIQFAVQ 157 + A+ I P G V + I + Sbjct: 426 DRVAGQQTASANTPTPILIPPNGYLLVLRDVPLPVFGEGSLQIQMNALPADFNRFPQILG 485 Query: 158 SGPMLMENGVIN--PRIHPNVA----SSKIRNGVGINKHGNAVFLLSQQAT-----NFYD 206 +GP+L+E G I P + R+G+G G + + + + Sbjct: 486 AGPLLLERGQIVLNPDLEQFGNGLDAQQAPRSGIGRTSTGQILLVTTHNRIGGAGPTLAE 545 Query: 207 FACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRY----PFVTMISVERK 253 +A K L L LDG S GG + + I + + Sbjct: 546 WAAILK-TLGAVDALNLDGGSSTALYLGGQLLDRHPVTSARVQNAIGLFLR 595 Score = 67.8 bits (164), Expect = 3e-10, Method: Composition-based stats. Identities = 26/188 (13%), Positives = 52/188 (27%), Gaps = 14/188 (7%) Query: 23 LTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQ 82 + A+ + +NP +++ + + L + + Sbjct: 261 ILWSSGVRWREQTLAVGGDRYPLTWLEINPHQAGLQLRPIWNQPDTLVGI-QPLPRLAQR 319 Query: 83 GQVQMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIV 141 QV A+NGG ++ + PLG ++G + L G G F + ++ Sbjct: 320 WQVAAAINGGFFNRNQQVPLGAIRQSGSWISSPILN--RGAIGWNDQGEFTLGRLRLQQT 377 Query: 142 RLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA 201 + A S I ++ G I + R V + Sbjct: 378 LITASGQSLPINTLDSGF---VQKG-IARYTRAWGPTYTPRVA------KETVITVVNDR 427 Query: 202 TNFYDFAC 209 A Sbjct: 428 VAGQQTAS 435 >UniRef50_B0BZE5 Putative uncharacterized protein n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0BZE5_ACAM1 Length = 584 Score = 114 bits (286), Expect = 3e-24, Method: Composition-based stats. Identities = 44/295 (14%), Positives = 86/295 (29%), Gaps = 47/295 (15%) Query: 3 HQLLIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQ 62 L+G + + +A + + + L Q + +P R + W Sbjct: 289 RNALLGIHPLLSMAQGNQVAAAINAGYFNRNNKTPLGAIRQNGQWIS-SPILNRGVVAWN 347 Query: 63 KANGEAWG--TLHALLADINSQGQVQMAMNGGI-----------YDESYAPL-----GLY 104 G L +L+ + ++++ G + +Y P+ + Sbjct: 348 PQGQFQMGRLNLQQVLSTSGGKRLSIVSLDSGYPQKGIARYTPTWGPTYTPILKTEKIIT 407 Query: 105 IENGQQKVALNLASGEGNFFIRPGGVFYVAG-----------DKVGIVRLDAFKTSKEIQ 153 + N Q +++ +F I G V ++ I + Sbjct: 408 VINNQVVSE-KVSTSTKSFAIPKNGYLLVLRSFDVGGALASGTQLQIQTATTPASFNGFP 466 Query: 154 FAVQSGPMLMENGVINPRI------HPNVASSKIRNGVGINKHGNAVFLLSQQ-----AT 202 V +GP+L+ NG + P S R+G+G G + Sbjct: 467 NIVGAGPLLVSNGQVVLNAKAEKFRPPFDTQSAPRSGIGQTADGTILLAAVHNQVSGPGP 526 Query: 203 NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFV----TMISVERK 253 ++A +L L LDG S GG + + I V + Sbjct: 527 TLKEWALIM-QRLGSVNALNLDGGSSTSLYLGGQLLDRHPVTAARVQNGIGVFWR 580 Score = 74.7 bits (182), Expect = 2e-12, Method: Composition-based stats. Identities = 32/142 (22%), Positives = 51/142 (35%), Gaps = 4/142 (2%) Query: 20 FLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADI 79 ++ P +L + V +NPQT +K+ N A +H LL+ + Sbjct: 243 QRSILWAPGILRQERVISLGNKQYPVTWLALNPQTPGLKLQPIWGNRNALLGIHPLLS-M 301 Query: 80 NSQGQVQMAMNGGIYDESY-APLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKV 138 QV A+N G ++ + PLG +NGQ + L G P G F + + Sbjct: 302 AQGNQVAAAINAGYFNRNNKTPLGAIRQNGQWISSPILN--RGVVAWNPQGQFQMGRLNL 359 Query: 139 GIVRLDAFKTSKEIQFAVQSGP 160 V + I P Sbjct: 360 QQVLSTSGGKRLSIVSLDSGYP 381 >UniRef50_B0C332 Putative uncharacterized protein n=2 Tax=Bacteria RepID=B0C332_ACAM1 Length = 306 Score = 114 bits (285), Expect = 3e-24, Method: Composition-based stats. Identities = 30/253 (11%), Positives = 65/253 (25%), Gaps = 31/253 (12%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMY--WQKANGEAWGTLHALLADINSQ 82 L S V ++ +++ + + + +Q Sbjct: 40 LFAGITYQRQVYT-SPRPYIVHIAKIDLTHPGIRVIATPGQPADDDNEFRAQPTSAFLTQ 98 Query: 83 GQVQMAMNGGIYDESY--APLGLYIENGQQKVALNLASGEGNFFIRPGGV--FYVAGDKV 138 ++Q+AMN G + P G + L + G + P Sbjct: 99 FRLQLAMNAGYFYHFNEKTPWDYAPHTGGRVNVLGQSISMGQPYSPPQKQWPVLCFDQSQ 158 Query: 139 GIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGN-AVFLL 197 + + AV +L + + + R+ +++ G ++ Sbjct: 159 RGRIVATGHCPSDTLHAVAGNYIL----HPDQPLQLDSDKPYARSIAALDQTGTTLWLIV 214 Query: 198 SQQATNFYD----FACY--AKAKLNVEQLLYLDGTISHMYM----KGGAIPWQR------ 241 Y FA ++ + L LDG S + G + Sbjct: 215 VDGKQPDYSEGATFADIEQLIKQIGADIALNLDGGGSTTLVTSTRSGAKLLNAPIHGKWP 274 Query: 242 ---YPFVTMISVE 251 P T + + Sbjct: 275 MNERPVATHLGIY 287 >UniRef50_C3R3L4 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=C3R3L4_9BACE Length = 431 Score = 113 bits (284), Expect = 4e-24, Method: Composition-based stats. Identities = 37/281 (13%), Positives = 67/281 (23%), Gaps = 62/281 (22%) Query: 24 TLLPLFAVAADDCALSDPTLTVQAYTVNP-QTERVKMYWQKANGEAWGTLHALLADINSQ 82 TL A+ L + + + G A G Sbjct: 150 TLPDYLAIYKSPSTLKGKNAVAYIAVADMDKNASFSVL-----GNATGVKTLTQFYNAES 204 Query: 83 GQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLA-------------SGEGNFFIRPGG 129 + + MNGG + + A + L N + F G Sbjct: 205 VKPAIVMNGGYFASNGATVSLLYRNNVMLAPNLQSMSRSDGTSNVAFYPTRSAFGEIENG 264 Query: 130 VFYVAGDKVGIVRLDAFKTSK------------------------EIQFAVQSGPMLMEN 165 F V + + + A+ GP+L++N Sbjct: 265 KFEVNWVYTVSSGQTYAYPAPSPNKSGVSPMQIPSVNYPEGASIWKAKNAIGGGPVLLKN 324 Query: 166 GVINPRIHPNV---------ASSKIRNGVGINKHGNAVFLLSQQAT--------NFYDFA 208 G+ + S+ R+ +GI +F + + + A Sbjct: 325 GLYKNTWEAELFDTASGIGPTSNNPRSAIGITGDNRLIFFVCEGRNKTPNVPGFTLEEVA 384 Query: 209 CYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYPFVTMI 248 + L + LDG S M + G + Sbjct: 385 YILRD-LGCLDAMNLDGGGSSCMLVNGQETIKPSDGAQRSV 424 >UniRef50_C4Z6E6 Putative uncharacterized protein n=1 Tax=Eubacterium eligens ATCC 27750 RepID=C4Z6E6_EUBE2 Length = 360 Score = 112 bits (281), Expect = 1e-23, Method: Composition-based stats. Identities = 38/251 (15%), Positives = 73/251 (29%), Gaps = 41/251 (16%) Query: 36 CALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIY 94 +S + + + +P +V + WG L +I S +NGG+Y Sbjct: 118 VEISGRSFFGKMLIIKDPSQVKVGTTY------PWGDYGKELHEIVSGAGAIAGVNGGLY 171 Query: 95 ----DESYAPLGLYIENGQQ-KVALNLASGEGNFFIRPGGVFYVAG-DKVGIVRLDAFKT 148 + +PLG+ +++G+ + + SG + + V D + +++ Sbjct: 172 VSSGNRGGSPLGIVVQDGKITYNSPSALSGLYLIGLNKDNLLVVKDIDGMSAADFESYVN 231 Query: 149 SKEIQFAVQS----------GPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLS 198 I+ AV L+ N + R +G G + L++ Sbjct: 232 EAGIRDAVAFQEESSDSNNHFVPLIINNEARVLKGQGS-GANPRTAIGQRVDGAILLLVT 290 Query: 199 QQA-------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGA---------IPWQRY 242 D + LDG S + G + Sbjct: 291 DGRGASGHLGATASDLISVM-QEYGAVNAANLDGGSSSTMVYNGGYEMTSVTFYYQNSSW 349 Query: 243 PFVTMISVERK 253 T V K Sbjct: 350 KLPTAFVVMPK 360 >UniRef50_Q8YKH7 All7320 protein n=2 Tax=Cyanobacteria RepID=Q8YKH7_ANASP Length = 314 Score = 112 bits (281), Expect = 1e-23, Method: Composition-based stats. Identities = 37/269 (13%), Positives = 67/269 (24%), Gaps = 43/269 (15%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLH----------- 73 L L + T++ T +K + + Sbjct: 45 LFRGIVYQR-LIESKPRPLIIHIVTIDLNTPGIKPFITPDIENLSKNVGVGKQAIIDNET 103 Query: 74 --ALLADINSQGQVQMAMNGGIYDESY--APLGLYIENGQQKVALNLASGEGNFFIRP-- 127 ++ ++ QV++A+NG + P Y +G L G + Sbjct: 104 KARTTSEFVAEFQVKLAINGSYFYPFKEVTPWHYYPHSGDTTKVLGQTISNGKIYANKKS 163 Query: 128 GGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKI--RNGV 185 + + K + +L+ G I+ N ++ K R Sbjct: 164 SWYVLCFDNNNQAQIPGGEECPKNTIQGLAGDDVLVFQGKPKINIYANSSADKPYSRVVA 223 Query: 186 GINKHGN-AVFLLSQQATNFYDFA------CYAKAKLNVEQLLYLDGTISHMYMKGG--- 235 I+K G +L Y AKL V + LDG S + Sbjct: 224 AIDKTGKKLWLVLVDGKQPLYSEGFTKRELTQFIAKLGVYNAINLDGGGSTTLVVANPDK 283 Query: 236 -------------AIPWQRYPFVTMISVE 251 I + P + Sbjct: 284 EGKPKILNAPTHTKILMRDRPVANHLGFY 312 >UniRef50_B2A8G9 Copper amine oxidase domain protein n=1 Tax=Natranaerobius thermophilus JW/NM-WN-LF RepID=B2A8G9_NATTJ Length = 718 Score = 112 bits (280), Expect = 1e-23, Method: Composition-based stats. Identities = 31/281 (11%), Positives = 69/281 (24%), Gaps = 50/281 (17%) Query: 9 KGMITLNLKRIFLALTLLPLFAVAADDC-ALSDPTLTVQAYTVNPQTERVKMYWQKANGE 67 G+ + + I + F + D + ++ + + + Sbjct: 452 IGLYISDQRLIREPMPNRSAFFYSKDGEATIERTAFNGGLMYIDDINTNLSIDGVNRSRG 511 Query: 68 AWGTLHALLA----------------DINSQGQVQMAMNGGIYDESYAPLGLYIENGQQK 111 + +I + +A+N G ++G Sbjct: 512 REELIVYTPEQGNTTGTTSSTFRGHKEIVISDEEIIAINHG--------DSQIPDDGYVL 563 Query: 112 VALNL--ASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVIN 169 + + G+ +G + ++ FA+ GP ++E G ++ Sbjct: 564 SIHEQYVRANQDLIDELETGMTTKLHWNMGQSKNVE-----DVVFALGGGPRILEKGEVD 618 Query: 170 PRIHPN------VASSKIRNGVGINKHGNAVFLLSQQA-------TNFYDFACYAKAKLN 216 R R VG+ + G + + + K + Sbjct: 619 IRSMEEVISDNVSQGRSPRTAVGVTRDGQLLLTAVDGRQSGLSIGMTLEELGNFMKDR-G 677 Query: 217 VEQLLYLDGTISHMYMKGGAIPWQR----YPFVTMISVERK 253 + L LDG S M I + K Sbjct: 678 AQDALNLDGGGSTMMWFDNEFQNNPSNGIRNIGNSIVIREK 718 Score = 70.5 bits (171), Expect = 5e-11, Method: Composition-based stats. Identities = 20/131 (15%), Positives = 41/131 (31%), Gaps = 9/131 (6%) Query: 10 GMITLNLKRIFLALT----LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKAN 65 + + + ++F + + + + + + ++P VK A Sbjct: 360 TSLIVTIPKVFETILEEQEIANGLKYTSIRKGQENGPIKIHELRLDP-HGDVKPELIMAQ 418 Query: 66 GEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFI 125 G L + + A+NGG Y + P+GLYI + + FF Sbjct: 419 DGFSGF--ERLDSMAKRNNAIAAINGGFYWRAGHPIGLYISDQRLIREPMPN--RSAFFY 474 Query: 126 RPGGVFYVAGD 136 G + Sbjct: 475 SKDGEATIERT 485 >UniRef50_C0WEQ2 Exopolysaccharide biosynthesis protein n=1 Tax=Acidaminococcus sp. D21 RepID=C0WEQ2_9FIRM Length = 470 Score = 112 bits (279), Expect = 2e-23, Method: Composition-based stats. Identities = 35/227 (15%), Positives = 66/227 (29%), Gaps = 29/227 (12%) Query: 55 ERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGL--YIENGQQKV 112 R+ + L + + + NG G+ I NG+ Sbjct: 245 PRLSFTGTVTRPDGAEMKITGLNRMRLENDLIFYNNGYDDTTDTNAAGVEVAIRNGRVIK 304 Query: 113 ALNLAS---GEGNFFIRPGGVFYVA------GDKVGIVRLDAFKTSKEIQFAVQSGPMLM 163 S + G GDKV I + + +GP+L+ Sbjct: 305 TGTTGSMPMSWNMTVLSGHGTAADFLRPLAVGDKVKIKTSLGSPLADKAPSVGTAGPLLV 364 Query: 164 ENGVINPR------IHPNVASSKIRNGVGINKHGNAVFLLSQQA------TNFYDFACYA 211 +G++N R VGI K G + +++ + A Y Sbjct: 365 YDGLVNVTASLEEIPSDIADGRAPRTAVGIKKDGTILVVVADGRSSRSAGMTLPELARYL 424 Query: 212 KAKLNVEQLLYLDGTISHMYMKGGAIPWQR-----YPFVTMISVERK 253 +L ++ + DG S + GA+ + P + + + Sbjct: 425 I-QLGADRAMNFDGGGSSEMVVNGAVKNRPSDGAERPVRVALGLFPR 470 >UniRef50_A7C442 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. PS RepID=A7C442_9GAMM Length = 299 Score = 111 bits (278), Expect = 2e-23, Method: Composition-based stats. Identities = 28/261 (10%), Positives = 70/261 (26%), Gaps = 36/261 (13%) Query: 24 TLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVK-MYWQKANGEAWGTLHALLADINSQ 82 TL + + + + +V+ ++ + + + ++ Sbjct: 34 TLFEGITYIREVRQ-TPRPIIIHFISVDLTKPNIRFLVTPGEVRDDGEIGARTTSQFLTE 92 Query: 83 GQVQMAMNGGIYDESYAPL-------GLYIENGQQKVALNLASGEGNFFIRPGGVFYVAG 135 ++Q+A+NG + + PL + G + LAS G + + F Sbjct: 93 FKLQLAINGNFFYP-FHPLFSVDFWNAYPKKRGDPVYVVGLASSHGQVYSQTKKSFETLY 151 Query: 136 DKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPN--VASSKIRNGVGINKHGNA 193 + A+ + ++ G I R + ++K Sbjct: 152 ISADNQARFQTSIGP-LYHAISGRELFIKQGKIQGPFPKGAFNEKPYPRTALALDKTAKT 210 Query: 194 VFL-LSQQ-------ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKG----------- 234 + + + + A ++ + L LDG S + Sbjct: 211 LMIFVVDGKLKNYSEGVTLMELADIVQS-YGADMALNLDGGGSSTLVMEGPSKKMVLLNM 269 Query: 235 ---GAIPWQRYPFVTMISVER 252 G I + + + Sbjct: 270 PIHGRIQGKERLIGNHLGIYT 290 >UniRef50_C6LDL7 Putative uncharacterized protein n=1 Tax=Bryantella formatexigens DSM 14469 RepID=C6LDL7_9FIRM Length = 400 Score = 111 bits (278), Expect = 2e-23, Method: Composition-based stats. Identities = 31/222 (13%), Positives = 58/222 (26%), Gaps = 17/222 (7%) Query: 39 SDPTLTVQAYTVNPQTERVK-MYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDES 97 + + + + A L+ ++ + + +A+NG Y + Sbjct: 170 EKYGTQISYVLADIYVGDITCLRTAFAQDTYGVGYSEKLSGMSDRMKAVLAVNGDSYSNN 229 Query: 98 Y-APLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAV 156 G I NG + + G + + + + Sbjct: 230 RHRNNGTIIRNGVIYRSQATDAETCVLNW--DGTMDIYTPDQMDI---QKLIERGAYQSW 284 Query: 157 QSGPMLM-ENGVINPRI--HPNVASSKIRNGVGINKHGNAVFLLSQQA------TNFYDF 207 GP L+ ENG + S R +G + G+ LL + Sbjct: 285 VFGPSLLDENGKAKDSFLTWDYIRQSHPRTAIGYYEPGHYCLLLVDGRQKASRGMFLDEM 344 Query: 208 ACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMIS 249 A +L + LDG G Y +S Sbjct: 345 AQLF-EELGCKAAYNLDGGHCSFMNFQGQTANHPYKPEHTVS 385 >UniRef50_UPI0001C164F4 hypothetical protein CRD_01886 n=2 Tax=Nostocaceae RepID=UPI0001C164F4 Length = 300 Score = 111 bits (277), Expect = 3e-23, Method: Composition-based stats. Identities = 32/248 (12%), Positives = 75/248 (30%), Gaps = 27/248 (10%) Query: 21 LALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLA--- 77 + + A + + ++ + + + N + + Sbjct: 40 TPTIIPTVRAYRSQI-----NGIPFYQTIIDLEDPNILLTIGLPNSANFANTISRTNGDE 94 Query: 78 ---DINSQGQVQMAMNGGIYDES--YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFY 132 + ++ + +NG + +G + G+ + + G Sbjct: 95 NFDQLVARSGAAVVVNGTFAYTNPQKTVMGNLVAGGRSLKYSPWENFGTTLGLGVGNKPE 154 Query: 133 VAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRI------HPNVASSKIRNGVG 186 + +V + + F++ SGP L+ NG ++ P V + +R +G Sbjct: 155 MITARV-----EGRPEWNKHWFSITSGPRLLRNGEVSVNPRLEGFKDPAVLGTSLRTAIG 209 Query: 187 INKHGNAVFLL-SQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQR-YPF 244 ++ G +FL + + A KA + + + LDG S I Sbjct: 210 FSEDGKRLFLANFDEKLYLEEEAEAMKA-IGCYEAMNLDGGPSRALASDNVILVPPARKL 268 Query: 245 VTMISVER 252 +I V Sbjct: 269 TNVILVYD 276 >UniRef50_B5VVA8 S-layer domain protein n=3 Tax=Cyanobacteria RepID=B5VVA8_SPIMA Length = 789 Score = 110 bits (276), Expect = 3e-23, Method: Composition-based stats. Identities = 25/181 (13%), Positives = 50/181 (27%), Gaps = 24/181 (13%) Query: 91 GGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGD------------KV 138 + + + + + Q + L + I G V ++ Sbjct: 602 ASYFPLTLNEIVVVVSGDQVTRQIELPDDQTPTAIPTNGYLLVFRSFRSAVSAFGVGSRL 661 Query: 139 GIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHP------NVASSKIRNGVGINKHGN 192 I + + GP+L++N I IR+ VG+ G Sbjct: 662 TITATTTPSEFIDFPHIMGGGPLLVQNRNIVVNAEAEGFNYWFGQQLAIRSAVGVTATGE 721 Query: 193 AVFLLSQQATN-----FYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTM 247 + + N + A +L + LDG S + GG + + Sbjct: 722 VLMVTVHNRVNGAGPSLTEMAKLM-QQLGAIDAINLDGGSSTSLVLGGHLLNRTPDTAAR 780 Query: 248 I 248 + Sbjct: 781 V 781 Score = 52.7 bits (125), Expect = 1e-05, Method: Composition-based stats. Identities = 16/121 (13%), Positives = 34/121 (28%), Gaps = 5/121 (4%) Query: 22 ALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINS 81 ++ + P L T+ P + + ++ L I Sbjct: 454 SILVYPGLRSRQQYLTLEGDKFPAVWLTITP-RSGLSIAPIWTETNQMRGTNSFL-AITK 511 Query: 82 QGQVQMAMNGGIYDESY-APLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGI 140 +NGG ++ + PLG+ ++ + L G G + K+ Sbjct: 512 TWSSLGGINGGYFNRNNLLPLGVIRKDNKWFSGPILN--RGAMAWDNNGRVRMGRLKLVE 569 Query: 141 V 141 Sbjct: 570 T 570 >UniRef50_A7LVE9 Putative uncharacterized protein n=1 Tax=Bacteroides ovatus ATCC 8483 RepID=A7LVE9_BACOV Length = 332 Score = 110 bits (276), Expect = 3e-23, Method: Composition-based stats. Identities = 37/270 (13%), Positives = 65/270 (24%), Gaps = 57/270 (21%) Query: 23 LTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQ 82 TL V L N + +K + + + Sbjct: 61 GTLPEHINVYKSPETLEGKKAIAYIAVGNMSKATFGVLGEKTGLKKPK-------EFYEE 113 Query: 83 GQVQMAMNGGIYDESYAPLGLYIENGQQK--VALNLASGEGN----------FFIRPGGV 130 + +NGG + E L L NG+ A N F G Sbjct: 114 NNSTIVINGGFFYEG--SLSLIWRNGEMVCKNNDVTAEDWTNGPFWYPVLAAFCEMNDGS 171 Query: 131 F-------------YVAGDKVGIVRLDAFKTS-------KEIQFAVQSGPMLMENGVINP 170 F Y + + + + + GP+L+ +G I Sbjct: 172 FKSMWTYTTLSNVTYWYSEPSPVKSETTPDENFPSTGTVLNAKTGIGGGPVLLLDGNIKN 231 Query: 171 RIHPNV------ASSKIRNGVGINKHGNAVFLLSQQ--------ATNFYDFACYAKAKLN 216 + ++ R+ +GI + + + + A K L Sbjct: 232 TYEEEILSDIGATVNRPRSAIGITNDKKMILFVCEGDGMTTGVAGMTTENVANIMK-TLG 290 Query: 217 VEQLLYLDGTISH-MYMKGGAIPWQRYPFV 245 + LDG S M + G Sbjct: 291 CTDAINLDGGGSSCMLVNGQETIKTSDSSG 320 >UniRef50_C8PNM8 Putative uncharacterized protein n=1 Tax=Treponema vincentii ATCC 35580 RepID=C8PNM8_9SPIO Length = 306 Score = 110 bits (276), Expect = 4e-23, Method: Composition-based stats. Identities = 38/230 (16%), Positives = 67/230 (29%), Gaps = 32/230 (13%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKAN---GEAWGTLHALLADINS 81 L P A D A L V ++ V + +A D Sbjct: 64 LSPGI--EAADIADPQLPLIVHIVKIDLLNPSVSVITSEAALFKNTRGRIRGETTRDFAL 121 Query: 82 QGQVQMAMNGGIYDES-------YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVA 134 + A N + + +G++I + ++ N G F+ Sbjct: 122 RHNTIAAFNAAPFKTNSLLFSIYRTIVGIHITDFRRMSMPNERYGALLFYKDKTAR---- 177 Query: 135 GDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAV 194 I S ++++AV ++ NG I P + R VG+ G + Sbjct: 178 ----IIGSQTEDALSADVRYAVGGFWTILRNGTIVP---QKLHRRDSRTAVGLADSGKTL 230 Query: 195 FLL-SQQ-------ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGA 236 F+ + +F + A L + L LDG S + Sbjct: 231 FVAAVEGENKRKSRGLSFEETAMLM-QTLGADDALQLDGGSSSTLVLQEN 279 >UniRef50_B8HPJ4 Putative uncharacterized protein n=2 Tax=Cyanothece sp. PCC 7425 RepID=B8HPJ4_CYAP4 Length = 338 Score = 110 bits (275), Expect = 4e-23, Method: Composition-based stats. Identities = 32/200 (16%), Positives = 60/200 (30%), Gaps = 17/200 (8%) Query: 46 QAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIY----DESYAPL 101 +N T +K + L ++ + + Q+ +NG + L Sbjct: 73 HIVLINLATTGLKFRVTSPAADGSTALEKTIS-FTRRSKAQIGINGNFFQALSSTRAKVL 131 Query: 102 GLYIENGQQKVA-LNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGP 160 GL +G+ + N G NF F +G + V P Sbjct: 132 GLAASSGRVYSSWSNGYQGAINFSSNRTATFVTPPSGLGTTTVPLLT----PYNLVSGLP 187 Query: 161 MLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ-------ATNFYDFACYAKA 213 +L++NG N R+ +G+ ++ + N + A + Sbjct: 188 VLVKNGQNVTVGVANPNEYAARSVIGLTQNQQLLLFAVDGPRSNVSTGMNQIELADLLIS 247 Query: 214 KLNVEQLLYLDGTISHMYMK 233 V + LDG S + Sbjct: 248 DFKVVHAVNLDGGGSSTLVF 267 >UniRef50_B4VYL6 Tat pathway signal sequence domain protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VYL6_9CYAN Length = 299 Score = 110 bits (274), Expect = 5e-23, Method: Composition-based stats. Identities = 32/257 (12%), Positives = 71/257 (27%), Gaps = 21/257 (8%) Query: 11 MITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWG 70 ++ + P+ A + + + + ++ + AN G Sbjct: 19 LVLGGAALAQGLGLVSPVAAESVSFRRSTILGVPLYQTHIDLTNPDTFIAIGLANNSTLG 78 Query: 71 TLHALLAD-----INSQGQVQMAMNGGIYDES--YAPLGLYIENGQQKVALNLASGEGNF 123 + + + + + +G + + +G + G + Sbjct: 79 NHQGAIGEESFGNMVRRYHAAVVASGTFFSKKDPKRLMGNMVSAGTFLKYSPWENYGTTL 138 Query: 124 FIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVI------NPRIHPNVA 177 +R G + +V D + F++ GP L+ G + P V Sbjct: 139 GLRVGNQPELVTARV-----DGKPDWGQHWFSLTGGPRLLRKGKVWLAPRSEGFTDPRVM 193 Query: 178 SSKIRNGVGINKHGN-AVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGA 236 R +G G V + + A +A + + + +DG S G Sbjct: 194 GVAHRCAIGFPASGKKLVLVTFLAPLPLWREAKVMRA-IGCSEAMNIDGGSSSALYHRGR 252 Query: 237 I-PWQRYPFVTMISVER 252 I + I V Sbjct: 253 ILVNPKRMLTNAIVVYD 269 >UniRef50_UPI0001BC7E39 hypothetical protein BacD2_08600 n=1 Tax=Bacteroides sp. D2 RepID=UPI0001BC7E39 Length = 660 Score = 110 bits (274), Expect = 6e-23, Method: Composition-based stats. Identities = 21/230 (9%), Positives = 55/230 (23%), Gaps = 34/230 (14%) Query: 29 FAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMA 88 F + + + + V ++ + ++ + A L+ + + Sbjct: 62 FIWYSYNKSAFNARQQVNVLEIDLSSPDYELEFVSAPQLDS------LSSVALKHDAVAG 115 Query: 89 MNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKT 148 +NG E NG + L G ++ G + Y + Sbjct: 116 INGTYELE----ASFVKVNGSIISPITLPEGHLRYWKHEGAIAYDGYKVEIGYGTKESYS 171 Query: 149 SKEIQFAVQSGPMLMENGVIN-PRIHPNVAS-----------------SKIRNGVGINKH 190 + P+L+++ ++ R V + + Sbjct: 172 YNSMPNIFSGAPVLIDDYQPVGKTFIGDITGINLNSLDGEDYRRHQGVRHPRTAVALTEQ 231 Query: 191 GNAVFLLSQQA------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKG 234 + + + + + L +DG S Sbjct: 232 NKLLLVTVDGRADLAAGMTAKELTSFINQYFKPQHALNVDGGGSTTMYIR 281 >UniRef50_Q8RCE6 Putative uncharacterized protein n=5 Tax=Thermoanaerobacterales RepID=Q8RCE6_THETN Length = 815 Score = 110 bits (274), Expect = 6e-23, Method: Composition-based stats. Identities = 34/271 (12%), Positives = 69/271 (25%), Gaps = 40/271 (14%) Query: 5 LLIGKGMITLNLKRIFLALTLLPLFAVAADDCALS-DPTLTVQAYTVNPQTERVKMY--- 60 LI + ++ T P +++ TV +N + + Sbjct: 150 NLITDPASNGKMATFYIDKTGTPYIDYWTKKMSITLPDGTTVFLAAINKISSTFQYTVMY 209 Query: 61 ---WQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLA 117 W + + A + L+ + Q + + P E G Sbjct: 210 TRDWYRFSPGANENVPQLVEVVVDQNDTVIEV------RQGQPSTEIPEGGYVL------ 257 Query: 118 SGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVA 177 + ++ + ++I+ AV G +L++ G I P + Sbjct: 258 ---AASGDIGNLLLRLSPGDKIQKDITTNPPFEDIKMAVSGGTILVKGGKIYP-FTHEIK 313 Query: 178 SSKIRNGVGINKHGNAVF-LLSQQ----ATNFYDFACYAKAKLNVEQLLYLDGTISHMY- 231 R +G K V + + A + L L LDG S Sbjct: 314 GYAARTAIGYTKDKRYVLMVTVDGPPYRGMTQEELASLMLS-LGAYDALNLDGGGSTQMA 372 Query: 232 ----------MKGGAIPWQRYPFVTMISVER 252 + + ++V Sbjct: 373 VRPLGETEAVLYNYSPNSYERKVPNGVAVFS 403 Score = 61.2 bits (147), Expect = 3e-08, Method: Composition-based stats. Identities = 10/105 (9%), Positives = 32/105 (30%), Gaps = 3/105 (2%) Query: 43 LTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE-SYAPL 101 + + ++ + + + + + ++ + A+NG +D + + Sbjct: 85 ININILKIDLKDPYLDLSVIFSPSGIKE--RMPIREMANSYGAVAAINGDFFDTKTGFVI 142 Query: 102 GLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAF 146 G +++G F+I G Y+ + Sbjct: 143 GATVKDGNLITDPASNGKMATFYIDKTGTPYIDYWTKKMSITLPD 187 >UniRef50_A6TKB7 Exopolysaccharide biosynthesis protein n=1 Tax=Alkaliphilus metalliredigens QYMF RepID=A6TKB7_ALKMQ Length = 236 Score = 109 bits (273), Expect = 7e-23, Method: Composition-based stats. Identities = 32/232 (13%), Positives = 72/232 (31%), Gaps = 23/232 (9%) Query: 37 ALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADI--NSQGQVQMAMNGGIY 94 + T+ V + V + + + + + + A+NGG + Sbjct: 4 TMRRYDTTIHVLEV--PKQGVVIMPCLGDRTKRQPVQQIRHSYFEANGYKRIGAVNGGFF 61 Query: 95 DESYA-PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQ 153 D + P G++ + ++ + A + G ++ ++ K+ Sbjct: 62 DGNRTLPYGMFYVDSGFLLSESWAGDAFLELVHENGKLHIDDITANQLKTKY----KKAN 117 Query: 154 FAVQSGPMLMENGVINP---RIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFY----- 205 +A+ L+ G +N P S R +G + N +F++++ Sbjct: 118 WAISLSYSLVVGGKMNIMKGDKFPFTNQSHPRTLIG-DNQENYIFVVTEGRMTKEKGLTA 176 Query: 206 -DFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQR---YPFVTMISVERK 253 + A +L + DG S G I + I + K Sbjct: 177 VESARVML-ELGCNTAINADGGGSSAMDVEGKIQNKYYDNRAVADGILIYTK 227 >UniRef50_C4V4S8 Exopolysaccharide biosynthesis protein n=1 Tax=Selenomonas flueggei ATCC 43531 RepID=C4V4S8_9FIRM Length = 491 Score = 109 bits (272), Expect = 1e-22, Method: Composition-based stats. Identities = 28/198 (14%), Positives = 52/198 (26%), Gaps = 25/198 (12%) Query: 76 LADINSQGQVQMAMNGGIYDESYAPLG--LYIENGQQ--KVALNLASGEGNFFIRPGGVF 131 + + + P G I NG+ + + + G Sbjct: 287 VNAERGADNLIIYNRAYGRSTGTNPYGLEYVIRNGRVAEINTNDSLIPPDGYVVSVHGTL 346 Query: 132 Y-------VAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNV------AS 178 V ++ D + + +GP L+ENG ++ Sbjct: 347 MDAFAAAGVRVGDPAVLTEDLGEPWNRAVQVLGAGPRLVENGSVHVTAGEEQFPGDIRYG 406 Query: 179 SKIRNGVGINKHGNAVFLLSQQAT------NFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 R VG+ + GN +F + +FA + V + LDG S Sbjct: 407 RAPRTAVGVTQKGNILFAVVDGRQSHSHGLTLTEFADLLV-QFGVRDAINLDGGGSSEIC 465 Query: 233 KGGAIPW-QRYPFVTMIS 249 G + + Sbjct: 466 ADGDVLNSPSDGSERAVG 483 Score = 59.3 bits (142), Expect = 1e-07, Method: Composition-based stats. Identities = 19/142 (13%), Positives = 36/142 (25%), Gaps = 4/142 (2%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ 84 L A D +T +P RV+ A G G ++ I Sbjct: 164 LAAGLTQREYVYADEDGPVTAYFIEADPARYRVR--PALARGIIPG--RQTVSGIAQDTN 219 Query: 85 VQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLD 144 A+N + S +G+ +G + VF + Sbjct: 220 AAAAINASYFALSGELIGITKIDGTVVSSTYFDRSAFGVMPDNSFVFGTVSYNGTVKLDR 279 Query: 145 AFKTSKEIQFAVQSGPMLMENG 166 + + +++ N Sbjct: 280 TSLPVSGVNAERGADNLIIYNR 301 >UniRef50_B4AZH7 Putative uncharacterized protein n=1 Tax=Cyanothece sp. PCC 7822 RepID=B4AZH7_9CHRO Length = 298 Score = 109 bits (272), Expect = 1e-22, Method: Composition-based stats. Identities = 33/205 (16%), Positives = 65/205 (31%), Gaps = 37/205 (18%) Query: 68 AWGTLHALLADINSQGQVQMAMNGGIYDES-YAPLGLYIENGQQK----------VALNL 116 + ++ + A+NGG +D + I G+ +L Sbjct: 70 TLARSLETVENLAKKQGAMAAINGGFFDPNNGKTTSYIIHQGKIIADPKNNERLMKNPDL 129 Query: 117 ASGEGNFFIRPGGVFYVAGDKVGIV---RLDAFKTSKEIQFAVQSGPMLM---------- 163 R Y G V T ++ ++ +GP L+ Sbjct: 130 TRYLDKILNRSEWRRYQCGATVRYSISFHNQPTLTGCQLLDSLGAGPRLLPEMTAQTEGF 189 Query: 164 ---ENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ--------ATNFYDFACYAK 212 NG + + + R +GI +G+ +++++ Q + + A + K Sbjct: 190 IDLVNGTMI-KDALGLKEPNARTAIGITANGDLIWIMAAQKAHSSRATGLSLLELAEFLK 248 Query: 213 AKLNVEQLLYLDGTISHMYMKGGAI 237 L V++ L LDG S + G Sbjct: 249 -TLGVQEALNLDGGSSSTFYYQGKT 272 >UniRef50_B4WFN8 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WFN8_9SYNE Length = 309 Score = 108 bits (271), Expect = 1e-22, Method: Composition-based stats. Identities = 30/268 (11%), Positives = 64/268 (23%), Gaps = 43/268 (16%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGE---------------AW 69 L + + ++ + + + Sbjct: 41 LFEGITYSR-YIEQQPRPQLIHLLEIDLSASGIVPFVTPGISKTSPKADREVDIEATQPH 99 Query: 70 GTLHALLADINSQGQVQMAMNGGIYDESY--APLGLYIENGQQKVALNLASGEGNFFIRP 127 TL + ++Q+A+N ++ P G+ + LA +G Sbjct: 100 ETLAQKTSSFLKTHRLQLAVNANFFNPFNETTPWQYSPREGELTNLVGLAISDGQIVSPG 159 Query: 128 G-GVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVG 186 + + D + + + AV + +EN P + Sbjct: 160 DKNYPALCFLEGRAEIRDEGVCAPDTKQAVAGLRLNLENR--PPPDVETIYKFYPVCVAA 217 Query: 187 INKHGN-AVFLLSQQATNFY-------DFACYAKAKLNVEQLLYLDGTISHMY------- 231 ++ G LL Y + A + +A L + LDG S Sbjct: 218 LDAEGTTLWLLLVDGKQPLYSEGMTRPEVADFLQA-LGATTAVQLDGGGSTTLAIASERD 276 Query: 232 ------MKGGAIPWQRYPFVTMISVERK 253 + IP + + Sbjct: 277 VAIINSVIHAKIPGNERAVANHLGFFAR 304 >UniRef50_B6V2M3 Gp2.43 n=1 Tax=Bacillus phage SPO1 RepID=B6V2M3_BPSP1 Length = 437 Score = 108 bits (271), Expect = 1e-22, Method: Composition-based stats. Identities = 25/216 (11%), Positives = 59/216 (27%), Gaps = 17/216 (7%) Query: 37 ALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE 96 ++ +K+ N + ++ + N I++ Sbjct: 28 TSKTDYFITHVPNLDKNGNLIKLRHGFQNDLINSGVGETARSFCNRHSASLVANASIWNT 87 Query: 97 S-YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFA 155 + G+ I++G+ + I+ + V + A Sbjct: 88 NNGLIRGVQIQDGKVIQDAKDTNSYT-LGIKSDNTLVMYPPSV----TAEQVLADGCIDA 142 Query: 156 VQSGPMLMENGVIN----PRIHPNVASSKIRNGVGINKHGNAVFLLSQQA------TNFY 205 + + ++++G NV RN + + + +FL + + Sbjct: 143 ITAFYPMIQDGAAFDLSGVTTVSNVTEHHPRNVIAQLPNKDLLFLTCEGRTKANQGMTYD 202 Query: 206 DFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQR 241 D A+ V LDG S + G + Sbjct: 203 DMIRILLAR-GVTTAYCLDGGGSSQTVVRGHLVNNP 237 >UniRef50_UPI0001C16068 conserved hypothetical protein n=2 Tax=Nostocaceae RepID=UPI0001C16068 Length = 613 Score = 108 bits (269), Expect = 2e-22, Method: Composition-based stats. Identities = 43/299 (14%), Positives = 77/299 (25%), Gaps = 54/299 (18%) Query: 4 QLLIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQK 63 Q G +TL +R + + + L Q + P R + W Sbjct: 320 QTQTGTAPLTLTAQRYSAMAAINGGYFNRNNQLPLGAVRQNDQWISG-PILNRGAIAWNY 378 Query: 64 ANGEAWGTLHALLADINSQGQ-----VQMAMNGGI-----------YDESYAPLG----- 102 +G L I Q + +N G + +Y PL Sbjct: 379 QGEFYFGRLSLNETLIVDQDNKQTSLPVLFLNSGYVQNGIARYTFAWGPNYVPLTNNETI 438 Query: 103 LYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVG-------------IVRLDAFKTS 149 + ++NG+ + G I G + + Sbjct: 439 ITVQNGKI---TKQSPPGGAISIPGDGYLLILRGTAVSKTSLLSVGTKVNLESSTTPGEF 495 Query: 150 KEIQFAVQSGPMLMENGVINPR------IHPNVASSKIRNGVGINKHGNAVFLLSQQAT- 202 + +GP+L++N I + +R+ + + N + Sbjct: 496 NTYPHIIGAGPLLIQNQRIVVDAKAEKFSQAFIKERAVRSAICTTNNDNLILAAVNNRVG 555 Query: 203 ----NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFV----TMISVERK 253 + A K+ L LDG S GG + + I V K Sbjct: 556 GWGPTLEEHAQLM-QKIGCTNALNLDGGSSTSLYLGGQLLDRFPNTAARVHNGIGVFLK 613 Score = 74.7 bits (182), Expect = 2e-12, Method: Composition-based stats. Identities = 24/160 (15%), Positives = 47/160 (29%), Gaps = 4/160 (2%) Query: 19 IFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLAD 78 + +T L + V +N +T + + N + L Sbjct: 272 VPRDITWSKGLRWQQKFINLDKDSFPVVWLEINRKTSGLNLQPILPNPQTQTGTAPLTLT 331 Query: 79 INSQGQVQMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDK 137 + A+NGG ++ + PLG +N Q L G + + F Sbjct: 332 -AQRYSAMAAINGGYFNRNNQLPLGAVRQNDQWISGPILNRGAIAWNYQGEFYFGRLSLN 390 Query: 138 VGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVA 177 ++ K + + SG ++NG+ Sbjct: 391 ETLIVDQDNKQTSLPVLFLNSGY--VQNGIARYTFAWGPN 428 >UniRef50_A0YL57 Putative uncharacterized protein n=1 Tax=Lyngbya sp. PCC 8106 RepID=A0YL57_9CYAN Length = 620 Score = 108 bits (269), Expect = 2e-22, Method: Composition-based stats. Identities = 41/284 (14%), Positives = 70/284 (24%), Gaps = 44/284 (15%) Query: 7 IGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANG 66 +G + R + F L + + +P R + W Sbjct: 327 VGLAPLLKTASRSQALAAINGGFFNRNTLFPLGAIRRQGRWLS-SPILNRGAIGWTDQGE 385 Query: 67 EAWGTLHALLADINSQGQ--VQMAMNGGI-----------YDESYAP-----LGLYIENG 108 L I + G +N G + +Y P + N Sbjct: 386 IYLDRLTRFETLITATGNRFPIQHLNSGYVEAGISRYTSDWGSTYFPVLDREWAMITRNH 445 Query: 109 QQKVALNLASGEGNFF--IRPGGVFYVAGDK-----------VGIVRLDAFKTSKEIQFA 155 + L + + V K V I + Sbjct: 446 RIIEHLFRERPIKLPVPILDDDYLLVVRNHKEKVTQLPIGTEVKIESQTDPPIWETYPQI 505 Query: 156 VQSGPMLMENGVINPR------IHPNVASSKIRNGVGINKHGNAVFLLSQQAT-----NF 204 + +GP+L+++G I IR+ VG + + N Sbjct: 506 LAAGPLLLQSGEIVLDAPSERFSEAFSNQQAIRSAVGRTPDNKLLLVAVHNRPLGSGPNL 565 Query: 205 YDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMI 248 + A KL + L LDG S GG + + I Sbjct: 566 TELAQIL-QKLGAVEALNLDGGSSTSLYLGGELIDRPAQTAAPI 608 Score = 75.5 bits (184), Expect = 2e-12, Method: Composition-based stats. Identities = 22/147 (14%), Positives = 42/147 (28%), Gaps = 13/147 (8%) Query: 23 LTLLPLFAVAADDCALSD---------PTLTVQAYTVNPQTERVKMYWQKANGEAWGTLH 73 + +P + + V ++ ++ + + + L Sbjct: 271 ILWMPGLRWRQQYIEIPNSQPTASSLPNRFPVFWLEIDLTAPQLSLKPILSRNTSRVGLA 330 Query: 74 ALLADINSQGQVQMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGEGNFFIRPGGVFY 132 LL S+ Q A+NGG ++ + PLG G+ + L G G Y Sbjct: 331 PLLKT-ASRSQALAAINGGFFNRNTLFPLGAIRRQGRWLSSPILN--RGAIGWTDQGEIY 387 Query: 133 VAGDKVGIVRLDAFKTSKEIQFAVQSG 159 + + A IQ Sbjct: 388 LDRLTRFETLITATGNRFPIQHLNSGY 414 >UniRef50_A1HRE9 Exopolysaccharide biosynthesis protein n=1 Tax=Thermosinus carboxydivorans Nor1 RepID=A1HRE9_9FIRM Length = 487 Score = 107 bits (268), Expect = 3e-22, Method: Composition-based stats. Identities = 34/268 (12%), Positives = 71/268 (26%), Gaps = 34/268 (12%) Query: 9 KGMITLNLKRIFLALTLLPLFAVAADDCALSD---PTLTVQAYTVNPQTERVKMYWQKAN 65 G I LK ++ PL A +T Q Y P +R+ + Sbjct: 225 TGEILGLLKLDGEIVSTPPLARTAMGIMPDGKIIMDQVTYQGYVQLPSGDRLSLDGVNRE 284 Query: 66 GEA----WGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEG 121 + + + ++ + G + +G A + Sbjct: 285 RGVDEIILYSSRYGVTTGTNIYGIEYVIAGDEVKAVKTNDSVIPADGFVLSAHGRQAQA- 343 Query: 122 NFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENG------VINPRIHPN 175 + + + + A+ +GPML++NG I Sbjct: 344 --------LAGLKVGDKVKIHQSLGPVWDKTVHALGAGPMLLKNGSIYLTTKIEEFGSDV 395 Query: 176 VASSKIRNGVGINKHGNAVFLLSQQA------TNFYDFACYAKAKLNVEQLLYLDGTISH 229 R +G+ K G + ++ + A +L + LDG S Sbjct: 396 AGGRAPRTALGLTKDGRVLLVVVDGRQPTSAGMTLLELA-LFLQELGAVDAMNLDGGGSS 454 Query: 230 MYMKGGAIPWQR-----YPFVTMISVER 252 + + + + ++V Sbjct: 455 EMVINDKVVNKPSDGRERKVGSALAVIS 482 Score = 45.8 bits (107), Expect = 0.001, Method: Composition-based stats. Identities = 18/113 (15%), Positives = 37/113 (32%), Gaps = 5/113 (4%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ 84 ++P + ++ T++ + + ANG G ++ + + Sbjct: 156 VMPGLTYTSWLSGRPYGPVSAHILTIDLKQ-GFVLKPVLANGVVQG--LDTVSAMARACR 212 Query: 85 VQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDK 137 A+NG + + LGL +G+ LA I P G + Sbjct: 213 AVAAVNGSYFAPTGEILGLLKLDGEIVSTPPLA--RTAMGIMPDGKIIMDQVT 263 >UniRef50_C9LSB3 Putative secreted protein n=1 Tax=Selenomonas sputigena ATCC 35185 RepID=C9LSB3_9FIRM Length = 475 Score = 107 bits (268), Expect = 3e-22, Method: Composition-based stats. Identities = 23/213 (10%), Positives = 53/213 (24%), Gaps = 31/213 (14%) Query: 50 VNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQ 109 V+ + + + + + +G A+N ++G Sbjct: 273 VDAERGEDSLVIYNHYYGSTTRTNEYGQEYIVRGGRVAAVNSS--------DSPIPKDGL 324 Query: 110 QKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVIN 169 A V V + + + +GPML+++G+ + Sbjct: 325 VISVHGKA---------KDAFSQVKVGDAVRVAETIGAPWESLPTVIGAGPMLVKDGIAH 375 Query: 170 PRIHPN------VASSKIRNGVGINKHGNAVFLLSQQAT------NFYDFACYAKAKLNV 217 R G+ G+ + + + A + + Sbjct: 376 VTATEEEFPPDIARGRAPRTAFGVTAEGHYLLAVVDGRQPHSIGCTLQEMAEFML-QFGA 434 Query: 218 EQLLYLDGTISHMYMKGGAIPW-QRYPFVTMIS 249 Q + DG S + GG + + Sbjct: 435 VQAINFDGGGSSALVVGGELENSPSDGQERAVG 467 Score = 48.9 bits (115), Expect = 2e-04, Method: Composition-based stats. Identities = 16/108 (14%), Positives = 28/108 (25%), Gaps = 4/108 (3%) Query: 24 TLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQG 83 T P ++ + LT +P R K A ++ ++ Sbjct: 149 TPAPGLKLSTLKRLDARGRLTGWVLEADPARYRAVPVLAKGA----VPGRASVSAMSDMA 204 Query: 84 QVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVF 131 A+N + + LGL +G F F Sbjct: 205 GADAAINASYFAPNGEILGLLKMDGTIVGTTYFRRSAVGFAADGRAYF 252 >UniRef50_B7KAU9 Putative uncharacterized protein n=7 Tax=Chroococcales RepID=B7KAU9_CYAP7 Length = 644 Score = 107 bits (267), Expect = 4e-22, Method: Composition-based stats. Identities = 35/246 (14%), Positives = 63/246 (25%), Gaps = 47/246 (19%) Query: 52 PQTERVKMYWQKANGEAWGTLHALLADINSQGQ--VQMAMNGGI-----------YDESY 98 P R + W G L I + G + +N G + +Y Sbjct: 395 PILNRGAIAWNDRGQVKMGRLRLQETVITNGGNRLPVLYLNSGYVQSGMARYTRDWGATY 454 Query: 99 APLG-----LYIENGQQKVALNLA-SGEGNFFIRPGGVFYVAGDKVG------------I 140 PL + ++N Q +G+ I G + Sbjct: 455 TPLSDDELIITVQNNQVISQRQGGKAGQNVIPIPNDGYLLAIRKNSVPASALTIGTSLNL 514 Query: 141 VRLDAFKTSKEIQFAVQSGPMLMENGVINPR------IHPNVASSKIRNGVGINKHGNAV 194 + +GP+L+ NG I R+ + + G + Sbjct: 515 ESGTIPADFNNYPHILGAGPLLLLNGQIVLDVASEQFSKGFQNQKASRSAIATTRDGKLM 574 Query: 195 FLLSQQAT-----NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFV---- 245 + + + A ++ L L LDG S GG + + Sbjct: 575 VVAVHNRVGGSGASLPELAQILQS-LGAVDALNLDGGSSTSLALGGQLIDRSPVTAAKVH 633 Query: 246 TMISVE 251 I + Sbjct: 634 NGIGIF 639 Score = 77.4 bits (189), Expect = 4e-13, Method: Composition-based stats. Identities = 17/138 (12%), Positives = 40/138 (28%), Gaps = 4/138 (2%) Query: 23 LTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQ 82 +T P L + V ++ ++ + + +N + ++ I Sbjct: 304 ITWTPGLIWRQKIIPLKGDSFPVTWLDIDLKSPNIFLKPVTSNPDTLEGTEPIVT-IGRN 362 Query: 83 GQVQMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIV 141 A+NGG ++ + PLG N + L G G + ++ Sbjct: 363 TTASAAINGGFFNRNNRLPLGAIRTNNRWVSGPILN--RGAIAWNDRGQVKMGRLRLQET 420 Query: 142 RLDAFKTSKEIQFAVQSG 159 + + + Sbjct: 421 VITNGGNRLPVLYLNSGY 438 >UniRef50_UPI0001BC335A hypothetical protein BcroD2_01203 n=1 Tax=Butyrivibrio crossotus DSM 2876 RepID=UPI0001BC335A Length = 366 Score = 107 bits (267), Expect = 4e-22, Method: Composition-based stats. Identities = 30/225 (13%), Positives = 69/225 (30%), Gaps = 27/225 (12%) Query: 34 DDCALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGG 92 +S + + + +P V + + E L ++ + +NGG Sbjct: 121 KIVEISGRSYYGKLMMIKDPSKVSVATIYPW-SDENKSKYGVTLGELVTNAGAIAGINGG 179 Query: 93 IYDESY----APLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAG-DKVGIVRLDAFK 147 Y P GL + NG+ + + G+ + + + + + +++ Sbjct: 180 EYCSDGNWGGRPKGLVVSNGELQYN-SPQWGDVMVGFNEDNILVIKDLNGMSVGQIEEMV 238 Query: 148 TSKEIQFAVQS----------GPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLL 197 ++ I+ V L+ NG + I+ + + + R +G G + + Sbjct: 239 KTERIRDCVSFKDIDDGDSNHFTKLIING-VATEINGSGSGANPRTCIGQRADGTVLMFV 297 Query: 198 SQQA-------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGG 235 + D K + +DG S G Sbjct: 298 TDGRGASGHIGATAADLISVMK-EYGAVNAANIDGGSSSSMYYKG 341 >UniRef50_B3QZA6 Putative uncharacterized protein n=1 Tax=Chloroherpeton thalassium ATCC 35110 RepID=B3QZA6_CHLT3 Length = 280 Score = 107 bits (266), Expect = 5e-22, Method: Composition-based stats. Identities = 31/229 (13%), Positives = 66/229 (28%), Gaps = 26/229 (11%) Query: 44 TVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGL 103 + +NP+ K+ + + ++ A Q + A+N G++ Sbjct: 59 QIYVIRINPEHYAFKLMCASEHAKTPLSVKAW----CKQHGLISAINAGMFQADMLSAVS 114 Query: 104 YIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLM 163 ++N L+ F P + + Q Q M+ Sbjct: 115 LMKNFAHINNPRLSKDNTIFAFNPTKKDLPKAQIIDRTVQNYDALKSVYQSQFQGIRMIA 174 Query: 164 ENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKA-KLNVEQLLY 222 + P+ S +G + GN +F+ S+ +DF +++++ +Y Sbjct: 175 PGRKNVWQEQPDEWSIA---ALGSDGDGNILFIFSRSPYTVHDFINILLELPIDIQRAMY 231 Query: 223 LDGTISHMYMKGGAIP------------------WQRYPFVTMISVERK 253 LDG P +I + +K Sbjct: 232 LDGGAVAQLYFSNKHIEIDESGVYESVLTTPVSSQTIAPIPNVIGIVKK 280 >UniRef50_A5GW09 Putative uncharacterized protein SynRCC307_2165 n=1 Tax=Synechococcus sp. RCC307 RepID=A5GW09_SYNR3 Length = 563 Score = 106 bits (265), Expect = 6e-22, Method: Composition-based stats. Identities = 42/285 (14%), Positives = 76/285 (26%), Gaps = 39/285 (13%) Query: 1 MAHQLLIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMY 60 ++ Q ++G G + + + F L + P +R + Sbjct: 269 LSRQNMVGLGSLLGLARSQGAVAAINGGFFNRIQALPLGGLRDDGDWLSG-PILDRGAIA 327 Query: 61 WQKANGEAWGTLH--ALLADINSQGQVQMAMNGG----------------IYDESYAPLG 102 W + + L + A+N G + Sbjct: 328 WAPQELPRFSRVRLNETLISDAGKRIQLNAINSGWVSKGVAQYNSLWGPRYKAITGREEA 387 Query: 103 LYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGI----------VRLDAFKTSKEI 152 + ++ Q + A +R G VA + R K E+ Sbjct: 388 VLVQGQQVARRFSHAELSRGVGLRRGETLVVARGGAPLPLKAGDGVSLERSMVPKAFAEL 447 Query: 153 QFAVQSGPMLMENGVINPR------IHPNVASSKIRNGVGINKHGNAVFLLSQQA---TN 203 +Q GP+L+ G + + R+ VG + + + Q Sbjct: 448 PNLIQGGPLLLNQGKVVLNGKAERFSSAFMRQKAPRSVVGSDDELIWLLAVEGQGNAGPT 507 Query: 204 FYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMI 248 + A KL ++Q L LDG S M F I Sbjct: 508 LRETAELM-QKLGLKQALNLDGGSSTRLMVRNRGQSSGRGFGAAI 551 >UniRef50_B4WS35 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WS35_9SYNE Length = 687 Score = 106 bits (265), Expect = 6e-22, Method: Composition-based stats. Identities = 42/290 (14%), Positives = 76/290 (26%), Gaps = 45/290 (15%) Query: 2 AHQLLIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYW 61 A IG I KR + F + L Q + P R M W Sbjct: 388 ASNTAIGIEPIVTTAKRAQAIGAVNAGFFNRNNQLPLGAVRSAGQWISG-PILGRGAMAW 446 Query: 62 QKANGEAWGTLH--ALLADINSQGQVQMAMNGGI-----------YDESYAPLG-----L 103 + + + +A+N G + +Y P+ + Sbjct: 447 NDSGELVIDRFALSESVTTGVGEAFPILAVNSGYVKAGIGRYTEGWGSTYTPIVDNEIIV 506 Query: 104 YIENGQQK-VALNLASGEGNFFIRPGGVFYV-------------AGDKVGIVRLDAFKTS 149 ++N + +G + I G + + G V + Sbjct: 507 TVQNHEVIAQKSMGKAGSSSVPIPRDGGYLLALRSYRSAGQSFQPGTPVLLSSQSQPAVF 566 Query: 150 KEIQFAVQSGPMLMENGVINPR------IHPNVASSKIRNGVGINKHGNAVFLLSQQAT- 202 ++ + GP+L+ + I + + R VG G + Sbjct: 567 EQYPNMIGGGPLLVRDRNIVLNPQLEGFSTNFIQGAAPRTAVGKTSDGTWIIATMHDRVG 626 Query: 203 ----NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMI 248 + A K +L L LDG S GG + + + Sbjct: 627 GRGPTLTETAYIMK-QLGAVDALNLDGGSSSSLYLGGQLLNRHPRTAARV 675 Score = 66.2 bits (160), Expect = 1e-09, Method: Composition-based stats. Identities = 19/138 (13%), Positives = 37/138 (26%), Gaps = 4/138 (2%) Query: 23 LTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQ 82 + P ++ V + V P + + + A + ++ + Sbjct: 346 VAWAPGLRWRQQYINVNQHRFPVYMFIVRPNPDALTLRPIHAASNTAIGIEPIVTT-AKR 404 Query: 83 GQVQMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIV 141 Q A+N G ++ + PLG GQ L G G G + + Sbjct: 405 AQAIGAVNAGFFNRNNQLPLGAVRSAGQWISGPIL--GRGAMAWNDSGELVIDRFALSES 462 Query: 142 RLDAFKTSKEIQFAVQSG 159 + I Sbjct: 463 VTTGVGEAFPILAVNSGY 480 >UniRef50_B0CAS6 Putative uncharacterized protein n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0CAS6_ACAM1 Length = 279 Score = 106 bits (264), Expect = 8e-22, Method: Composition-based stats. Identities = 33/231 (14%), Positives = 63/231 (27%), Gaps = 43/231 (18%) Query: 42 TLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQG-QVQMAMNGGIYDESYA- 99 TV + P R + +G +AD + +NGG +D + Sbjct: 32 QATVHVLRI-PNHPRYTVRLDVVDGL------QTVADFAQGTPKPVAVINGGYFDPANQL 84 Query: 100 PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYV-------------AGDKVGIVRLDAF 146 GQ S + + + Sbjct: 85 TTSYIRRGGQILADPTQNSRLVDNPDLKVYLPKILNRSEFRQYQCGAKTTYAITSYNQPI 144 Query: 147 KTSKEIQFAVQSGPMLM-------------ENGVINPRIHPNVASSKIRNGVGINKHGNA 193 + +A+ +GP L+ +G + R R+ VGI G Sbjct: 145 PPDCTLNYALGAGPQLLPQLTSQAEGFTDSVDGQVI-RDAIGSRQPNARSAVGITDKGEV 203 Query: 194 VFLLSQQ------ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIP 238 +++L +Q + + A + + + L LDG S + + Sbjct: 204 IWVLVEQQSATKPGLSLPELADFMEQQ-GAASALNLDGGSSSSLVYQDQVI 253 >UniRef50_A0YXN3 Putative uncharacterized protein n=1 Tax=Lyngbya sp. PCC 8106 RepID=A0YXN3_9CYAN Length = 775 Score = 106 bits (264), Expect = 9e-22, Method: Composition-based stats. Identities = 32/202 (15%), Positives = 62/202 (30%), Gaps = 33/202 (16%) Query: 84 QVQMAMNGGIYDESYAPLGL-----YIENGQQKVALNLASGEGNFFIRPGGVFYVAG--- 135 + +A + +SY PL L +EN Q + + I G Sbjct: 572 KAGIARYTPDWGKSYTPLTLNEVIITVENNQLSRQIESNDDQTPIEIPQNGYLLTFRSFR 631 Query: 136 ---------DKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNV------ASSK 180 K+ I + + +GP+L++ G I Sbjct: 632 SALSAFPLGGKIAITAKTTPSEFNQYPHILGAGPLLLQQGQIVVDAEAEGFNIWFAKQRA 691 Query: 181 IRNGVGINKHGNAVFLLSQQATN-----FYDFACYAKAKLNVEQLLYLDGTISHMYMKGG 235 IR+G+G+ +G+ + + + A +L + L LDG S + GG Sbjct: 692 IRSGIGVTANGDLLIVTVHNRVGGPGPDLTELAQ-LIQQLGAVEGLNLDGGSSTSLILGG 750 Query: 236 AIPWQRYPFV----TMISVERK 253 + + + + + Sbjct: 751 HLLNRTADTAARVHNGLGLFWR 772 Score = 63.1 bits (152), Expect = 8e-09, Method: Composition-based stats. Identities = 21/138 (15%), Positives = 39/138 (28%), Gaps = 5/138 (3%) Query: 23 LTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQ 82 + L+ V V + +K+ + + L+ I + Sbjct: 437 ILWTTGLNWRQQYLTLNQDRFPVVWLEVK-RNSGLKLQPIWTDKTQMKGTAS-LSQITNS 494 Query: 83 GQVQMAMNGGIYDESY-APLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIV 141 A+NGG ++ + PLG +N + L G G +A K+ Sbjct: 495 WGSLAAINGGYFNRNNLLPLGTIRQNNKWFSGPILN--RGVMAWNDTGTVKMARLKLTET 552 Query: 142 RLDAFKTSKEIQFAVQSG 159 + EI Sbjct: 553 LTLSSGREFEIDLLNTGY 570 >UniRef50_C6IP98 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=C6IP98_9BACE Length = 536 Score = 105 bits (263), Expect = 1e-21, Method: Composition-based stats. Identities = 37/300 (12%), Positives = 71/300 (23%), Gaps = 76/300 (25%) Query: 24 TLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQG 83 TL + L+ + T +V++ + + G Sbjct: 242 TLPAEIELYETTSNLNGSNFHAWYAIGDLSTGKVEVRVHIPS----SPATIDTQSASFNG 297 Query: 84 QVQMAMNGGIYDESYAPLGLYIEN----------------GQQKVALNLASGEGNFFIRP 127 + +NGG + + G+ + N G + G F + Sbjct: 298 DCYLLVNGGYFY-NGNHTGIAVINSIKSGSVSAVRGSLKTGDTEYNSMYNVTRGTFGVDA 356 Query: 128 GGVFYV------------------------AGDKVGIVRLDAFKTSKEIQFAVQSGPMLM 163 G V + S ++A+ +GP+L+ Sbjct: 357 SGKPNVVWTGTDASSNVFYFDRPLPSVKGENKYGIVTNENPTTAISWSPKYALSAGPVLL 416 Query: 164 ENGVINPRIHPNVASSKI--------------------RNGVGINKHGNAVFLLSQQATN 203 ++ I + R +G + G V + Sbjct: 417 KDKKIPFDFTETSKGTDYYLSNYEIIPYDIFGANVTPDRTAIGYREDGKVVIFICDGRIT 476 Query: 204 ------FYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ----RYPFVTMISVERK 253 + A K L + LDG S + G V+ I +K Sbjct: 477 ASGGATLTELAQIMK-GLGCVGAINLDGGGSTGMVVGDEHLNDMTGGNRAVVSTIGFFKK 535 >UniRef50_A0LEU6 Putative uncharacterized protein n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LEU6_SYNFM Length = 300 Score = 105 bits (263), Expect = 1e-21, Method: Composition-based stats. Identities = 31/209 (14%), Positives = 64/209 (30%), Gaps = 10/209 (4%) Query: 43 LTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLG 102 + ++P+ K+ N T + A+N G+Y E Sbjct: 76 YRITVVRIDPRYYAFKLINASENTREKMTAREWSRQF----NLIAAVNAGMYQEDGLASV 131 Query: 103 LYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPML 162 Y++N L + P G + D ++ + VQS M+ Sbjct: 132 GYMKNFDHVNNPRLGRDKTVLAFNPSGPDVPEVQIIDRECQDFNSLRQKYRTFVQSIRMI 191 Query: 163 MENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKA-KLNVEQLL 221 + R S+ +G ++ G + L + +DF L++++ + Sbjct: 192 SCDRKNVWRQQAGRWSTV---AIGTDETGKVLLLFCRSPITVHDFIEVLLTLPLSLQRAM 248 Query: 222 YLDGTISHMYMK--GGAIPWQRYPFVTMI 248 YL+G G + + + Sbjct: 249 YLEGGPQASLYLSTGKTTLERYGSWEPAL 277 >UniRef50_B8HPB3 Putative uncharacterized protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HPB3_CYAP4 Length = 304 Score = 105 bits (262), Expect = 1e-21, Method: Composition-based stats. Identities = 39/267 (14%), Positives = 68/267 (25%), Gaps = 42/267 (15%) Query: 6 LIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKAN 65 LIG + + P S P + + P ++ A Sbjct: 17 LIGSTTACTQTSTTASSAPVAPTPPQPLQYKVYSLPHSKIHTLVI-PAGSTYEVTAAIAP 75 Query: 66 GEAWGTLHALLADINSQGQVQMAMNGGIYDE-SYAPLGLYIENGQQKVALNLASGEG--- 121 LA Q Q +NGG +D + + G+Q Sbjct: 76 D------VQPLATFAQQHQAIAVLNGGFFDPVNGKSTSHVVLAGKQVANPQDNERLIQNP 129 Query: 122 ----------NFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMEN------ 165 N E+ A+ +GP L+ Sbjct: 130 DLIPYLPLILNRSELRNYRCAGQIRYEISRHDKPIPPGCELLMALGAGPQLLPQNTSVQE 189 Query: 166 ------GVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQ--------QATNFYDFACYA 211 G R R+ +G+ G+ V+L+ + + A + Sbjct: 190 GFMAYSGETITRDSLGSLYPNARSAIGLKADGSLVWLMVAERSDANQPGGLSLPELAQFM 249 Query: 212 KAKLNVEQLLYLDGTISHMYMKGGAIP 238 ++ L V + + LDG S + G Sbjct: 250 QS-LGVVKGMNLDGGSSASFYYQGQTH 275 >UniRef50_C2FS46 Putative uncharacterized protein n=2 Tax=Sphingobacterium spiritivorum RepID=C2FS46_9SPHI Length = 341 Score = 105 bits (262), Expect = 1e-21, Method: Composition-based stats. Identities = 35/284 (12%), Positives = 72/284 (25%), Gaps = 39/284 (13%) Query: 3 HQLLIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQ 62 Q +I I + + + +++ ++ ++ Sbjct: 63 TQSIISNTDIIKTFRLDSTTTVADGIVHTHIRYLNRLNLPVSMHVLEIDLSKPKLAAQAL 122 Query: 63 KANGEAWGTLHALLADINS------QGQVQMAMNGGIYDESYA----PLGLYIENGQQKV 112 E +L ++ G++ +A+NG S P G YI G+Q Sbjct: 123 GPFNEVLY-ATQILPEMAKYNESGSGGKMMVAINGDAVLTSGTTVNAPSGSYIRYGRQIK 181 Query: 113 ALNLASGEGN---FFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVIN 169 + F + GV ++ G++ I V L+ N + Sbjct: 182 TNTTTATAFTIPYFAVTKAGVPFI-GNRPSATYPAEAVDLNTIYHLVSGTNWLVFNNNLI 240 Query: 170 PRIHPNVASSKIRNGVGINKHGNAVFLLSQQ-------ATNFYDFACYAKAKLNVEQLLY 222 V R +GIN + ++ D K L + + Sbjct: 241 TSTTATV---SARTAIGINADKKVICVVVDGGDDAFSTGITLNDLGIVMK-TLGSSRAFF 296 Query: 223 LDGTISHMYMKGGA-------------IPWQRYPFVTMISVERK 253 +G +K + I + Sbjct: 297 TNGGNFSAMVKRKEDAKGLRWDMLNRPVNKTGSATANGIGFVLR 340 >UniRef50_C9KQW2 Putative secreted protein n=2 Tax=Veillonellaceae RepID=C9KQW2_9FIRM Length = 503 Score = 105 bits (262), Expect = 2e-21, Method: Composition-based stats. Identities = 28/200 (14%), Positives = 54/200 (27%), Gaps = 27/200 (13%) Query: 80 NSQGQVQMAMNGGIYDESYAPLG--LYIENGQQ--KVALNLASGEGNFFIRPGGVFYVAG 135 + + G + G+ + I G+ Sbjct: 305 RGADSLVIYNRAYGSSTGTNEYGREYIVRGGRVTDIRQNDSPIPADGVVISVHGMAADEL 364 Query: 136 DKVGI-----VRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVA------SSKIRNG 184 V + + + + + F + GP L+ENG ++ + R+ Sbjct: 365 GGVQVGDPVMIEENLGDGWQNMDFIIGCGPRLVENGRVHVTVDEEDFPADIRIGRAPRSA 424 Query: 185 VGINKHGNAVFLLSQQAT------NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIP 238 VGI K G + + D+A K + L LDG S + G + Sbjct: 425 VGITKDGRYLLAVVDGRQSHSVGLTLTDWAKLLV-KFGAQDALNLDGGGSSDLVVNGDVQ 483 Query: 239 WQRYP-----FVTMISVERK 253 + + +K Sbjct: 484 NSPSDGQERLVGDGLVLVKK 503 Score = 47.0 bits (110), Expect = 7e-04, Method: Composition-based stats. Identities = 16/109 (14%), Positives = 31/109 (28%), Gaps = 6/109 (5%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ 84 + P ++ +P R + A G+ G ++ I ++ Sbjct: 178 IEPGLTEHTYVYWDDYGKVSAWLLEADPARYR--LVPTLAKGKIPG--REAVSGIVARAG 233 Query: 85 VQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYV 133 +N + + LG+ NGQ F +R G Sbjct: 234 GVAGINASYFAPAGDILGVTQINGQTVGT--TYYTRSAFGLRKDGTPVF 280 >UniRef50_Q8YKN4 All7259 protein n=2 Tax=Cyanobacteria RepID=Q8YKN4_ANASP Length = 245 Score = 105 bits (261), Expect = 2e-21, Method: Composition-based stats. Identities = 30/223 (13%), Positives = 57/223 (25%), Gaps = 39/223 (17%) Query: 45 VQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYA-PLGL 103 + P + A + + + + + N G +D + Sbjct: 2 AHILLI-PANSPFVVT------GALSAKVSTVEEFAQKHRAFAIFNAGFFDPANQKSTSY 54 Query: 104 YIENG----------QQKVALNLASGEGNFFIRPGGVFYVAGDKVG---IVRLDAFKTSK 150 + G + L F R Y+ G + ++ + Sbjct: 55 VVVTGQMVADPKDNERLVNNPQLKPYLNLIFNRSEFRRYLCGQTTRYDITLHNESPPANC 114 Query: 151 EIQFAVQSGPMLMENGVINPRIHPN---------VASSKIRNGVGINKHGNAVFLLS--- 198 + A+ +GP L+ P + R VGI G+ + ++ Sbjct: 115 RLVDAIGAGPRLLPKLTSVPEGFVDNAKGRDALLSKQLNARTAVGITSEGSIILVMVAQK 174 Query: 199 -----QQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGA 236 + A K KL + LDG S G Sbjct: 175 PSKPKNSGISLVQLADLMK-KLGASAAMNLDGGSSSSLYYNGK 216 >UniRef50_A7M0H0 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=A7M0H0_BACOV Length = 354 Score = 104 bits (260), Expect = 2e-21, Method: Composition-based stats. Identities = 28/252 (11%), Positives = 65/252 (25%), Gaps = 49/252 (19%) Query: 32 AADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNG 91 +S V ++ + + K+ + NG++ + +NG Sbjct: 95 YERYDDVSKAQQIVNVLEIDLLSNKYKVEFTYNNGDSL-------STTAQVRGAIGGING 147 Query: 92 GIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSK- 150 G E NG + L G ++ G ++ +GI+ + Sbjct: 148 GYEQE----AIYIRINGTNISEVTLPEGHLRYWKHDGALYSDGKSDIGIIYGGRNGKAAI 203 Query: 151 ------EIQFAVQSGPMLMENGVI------------------NPRIHPNVASSKIRNGVG 186 ++ + S P L+++ + R V Sbjct: 204 DTYKQHSAKYLLASAPTLIDDYNPLGETFVGNYTMEQLESFDYEDYRRHQGVRHPRTVVA 263 Query: 187 INKHGNAVFLLSQQA-------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGA--- 236 + + + + + + + + K N + L +DG S G Sbjct: 264 VTEDKDLLLVTIDGRWAGKAEGMSAKEVTLFLKKHFNPQYALNMDGGGSTTMYVKGKGAA 323 Query: 237 ---IPWQRYPFV 245 + Sbjct: 324 KTDVVNYPTDNG 335 >UniRef50_A3DHF5 Ig-like, group 2 n=3 Tax=Clostridium thermocellum RepID=A3DHF5_CLOTH Length = 929 Score = 104 bits (259), Expect = 3e-21, Method: Composition-based stats. Identities = 28/222 (12%), Positives = 58/222 (26%), Gaps = 36/222 (16%) Query: 52 PQTERVKMYWQKANGEAWGTLHALLADINS---QGQVQMAMNGGIYDESYAPLGLYIENG 108 P W A + + D+ + + P +NG Sbjct: 182 PFQYTDITIWTSAWDKYSLGVSQQYPDLVEVVVDNGTVVEI------RQGLPAVEIPQNG 235 Query: 109 QQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVI 168 ++ + + G ++ V ++I+ AV +L+++G I Sbjct: 236 YVIISRGANAQFLLQHFKVGDPVEISFSTV--------LDWQKIEMAVTGSAILVKDGQI 287 Query: 169 NPRIHPNVASSKIRNGVGINKHGN-AVFLLSQQA------TNFYDFACYAKAKLNVEQLL 221 + ++ R G +K G + + + A + L + Sbjct: 288 PEKFSYEISGVHPRTAAGTSKSGKELILVTVDGRQAASKGMTQRELANLMLS-LGAYNAI 346 Query: 222 YLDGTISHMYMKG-------GAIPWQRYP----FVTMISVER 252 LDG S + + T I V Sbjct: 347 NLDGGGSTSMVSRIPGTNDLKVVNTPSDGALRSISTAIGVFS 388 >UniRef50_A6TVJ8 Exopolysaccharide biosynthesis protein n=2 Tax=Alkaliphilus RepID=A6TVJ8_ALKMQ Length = 942 Score = 104 bits (259), Expect = 4e-21, Method: Composition-based stats. Identities = 33/228 (14%), Positives = 68/228 (29%), Gaps = 27/228 (11%) Query: 49 TVNP---QTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYA-PLGLY 104 ++ + + + WGT + + +N + + P Sbjct: 168 RIDLSSINKYTDRYQYTMLIDKNWGTHTPGYNEKLLDMVEVIVINDEVAEIRRRQPATGI 227 Query: 105 IENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLME 164 NG VA +G G + + +I+ A+ G +L++ Sbjct: 228 PSNGYVLVASQTETGWGRAGHLFDNLKVGDRLTLHQEIQPNLN---QIELALGGGTLLVK 284 Query: 165 NGVINPRIHPNVASSKIRNGVGINKHGN-AVFLLSQQATNFY------DFACYAKAKLNV 217 +G + +VA + R+ +GI++ + + +NFY + L Sbjct: 285 DGQA-AHLTQSVAGAHPRSAIGISRDRKQVILVTIDGRSNFYHGVDGRELGNILL-GLGA 342 Query: 218 EQLLYLDGTISHMYMKGG-------AIPWQR----YPFVTMISVERKG 254 + +DG S + I V I+V K Sbjct: 343 HDAIIMDGGGSTTMIARELGEAKPQIINNPSEGVERRIVNGIAVLSKA 390 Score = 43.9 bits (102), Expect = 0.005, Method: Composition-based stats. Identities = 19/161 (11%), Positives = 50/161 (31%), Gaps = 11/161 (6%) Query: 1 MAHQLLIGKGMITLNLKRIFLAL----TLLPLFAVAADDCALSDPTLTVQAYTVNPQTER 56 ++ LL+ G + + ++ L + A ++ + Sbjct: 13 LSTTLLLSMGTSVGFASGVINVADQQQQISNGVTHRNIVRFNANGWLNINALYIDLTADS 72 Query: 57 VKMYWQKANGEAWGTLHALLADIN-SQGQVQMAMNGGIYDESY--APLGLYIENGQQKVA 113 +++ K++ L+ + ++ V AMNG + +PLG I+ G+ + Sbjct: 73 LELDLLKSSKGVATK--ETLSTMVNARENVVGAMNGDFFYMLTPDSPLGAMIKGGEMISS 130 Query: 114 L--NLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEI 152 FF+ + + + ++ Sbjct: 131 PIDRSDQNYATFFVDTNKQAFASYWDRDLYITTDGGKRIDL 171 >UniRef50_UPI00016C4EC3 hypothetical protein GobsU_32169 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4EC3 Length = 279 Score = 103 bits (258), Expect = 4e-21, Method: Composition-based stats. Identities = 28/227 (12%), Positives = 66/227 (29%), Gaps = 33/227 (14%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGE-AWGTLHALLADINSQG 83 L + + + A ++ + + NG+ T + + Sbjct: 33 LFSGVELTD-LIGDTPRLMKGHAVRIDLKAAGIGFLATPGNGDRPGETDGLKTSTFLKRH 91 Query: 84 QVQMAMNGGIYDESYA-------PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGD 136 ++Q+A+N + + +G+ + G+ + + Sbjct: 92 KLQLAINAAPFGPIHKDEEKEQDVVGVQVSGGKLVSPAQPGYPA---------LLLAKDN 142 Query: 137 KVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGN-AVF 195 + I I+ AV ++++ G + S R G++ G V Sbjct: 143 RARIAAPPFDLE--GIENAVGGFHIVLKGGEVLT----GDKSIHPRTAAGVSADGKTLVL 196 Query: 196 LLSQQAT-------NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGG 235 L+ + + KA L + + LDG + + G Sbjct: 197 LVIDGRQKDFSDGATTAEVGEWLKA-LGCAEGINLDGGGTTTLVVAG 242 >UniRef50_A9V9Y5 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9V9Y5_MONBE Length = 298 Score = 103 bits (256), Expect = 7e-21, Method: Composition-based stats. Identities = 28/194 (14%), Positives = 56/194 (28%), Gaps = 20/194 (10%) Query: 64 ANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAP---LGLYIENGQQKVALNLASGE 120 G H +++ + A N G + + P G I +G Sbjct: 86 GIGPNGCEHHRTVSEQAKLLTCEYATNAGFF--DFTPPACEGNLITDGVSIQHPCPNQVN 143 Query: 121 GNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVA--- 177 + GD++ I + ++ + L+ +G Sbjct: 144 FGRKLGMTCPDSTQGDRIVIGYMQEADI-ADLTELITGRGWLIRHGQAYTNQSREFTPTD 202 Query: 178 ----SSKIRNGVGINKHGNAVFLLSQQ------ATNFYDFACYAKAKLNVEQLLYLDGTI 227 R +G+ K G + L+ + ++ A +L+V Q + LDG Sbjct: 203 SFVSEKAPRTALGLTKDGAILSLVVDGIEEELVGPDLHEMASLLL-ELDVVQAINLDGGG 261 Query: 228 SHMYMKGGAIPWQR 241 S + G + Sbjct: 262 SSTAVYQGHVFNMP 275 >UniRef50_B7KAR9 Polysaccharide deacetylase n=3 Tax=Cyanothece RepID=B7KAR9_CYAP7 Length = 627 Score = 102 bits (254), Expect = 1e-20, Method: Composition-based stats. Identities = 29/248 (11%), Positives = 73/248 (29%), Gaps = 27/248 (10%) Query: 29 FAVAADDCALSDPTLTVQAYTVNPQTER--VKMYWQKANGEAWGTLHALLADINSQGQVQ 86 + + Y + + + +++I + + Sbjct: 355 WGGYPAPRYDGGFNFNTRIYKQDFTVNDTQLTLITGGRPNTIHADTRYQVSEIIAGTGAE 414 Query: 87 MAMNGGIYD----ESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVR 142 A++GG + +S +G + G + I+ + ++ D+V V Sbjct: 415 AAVDGGFFSLESLDSNVMIGPVL--GHNTGEFIPGNAWEIPRIKGRPLVLMSSDRVRFVP 472 Query: 143 LD------------AFKTSKEIQFAVQSGPMLMENGVINPRIHP----NVASSKIRNGVG 186 D ++I A L+ + P + +++ R G Sbjct: 473 FDPNKHNTYEGVISEATEGEKITDAFVGAAWLVRDNQPQPPEAFGELFDFEAARHRAFWG 532 Query: 187 INKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGA--IPWQRYPF 244 IN+ G V +++ + +L + + LD S G + + P Sbjct: 533 INQAGQPVIGVTKTMVGSVELGEIL-HQLGLRDAVMLDSGASTSLAYQGESLVGYTPRPV 591 Query: 245 VTMISVER 252 ++++ Sbjct: 592 PHVVALFP 599 >UniRef50_D2AUR4 Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase-like protein n=1 Tax=Streptosporangium roseum DSM 43021 RepID=D2AUR4_STRRD Length = 487 Score = 102 bits (254), Expect = 1e-20, Method: Composition-based stats. Identities = 27/254 (10%), Positives = 68/254 (26%), Gaps = 20/254 (7%) Query: 6 LIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVK-MYWQKA 64 +G ++ L + ++ +++ + A + +K + Sbjct: 224 PMGISVVGGRLLSEAVPG--RSGLVISGRKVRITELKTVITAIPADGAKTEIKGINRAAG 281 Query: 65 NGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENG-QQKVALNLASGEGNF 123 E + G ++ ++ G + G + + Sbjct: 282 ADELVLYTEEFGTKTAADGGAEIVVDAQGRIVKARAAGGVVPRGTYVLHGTGIMATWLLE 341 Query: 124 FIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVAS----- 178 + V + + + A + E + G L+ NG + + + Sbjct: 342 HAQETSVMKLDTKVIDLRTERAVPLTPE-THIMGGGVGLLRNGRVRISAKADGHASVVMM 400 Query: 179 --SKIRNGVGINKHGNAVFLLSQQAT-------NFYDFACYAKAKLNVEQLLYLDGTISH 229 R VG+ K G + + + A + L +Q + DG S Sbjct: 401 LRRHPRTMVGVTKSGGLILATVDGRNPGVTVGASMVEAAQLMR-WLGAKQAINFDGGGST 459 Query: 230 MYMKGGAIPWQRYP 243 + G + + Sbjct: 460 AMVVGHKVINRPSD 473 Score = 50.8 bits (120), Expect = 4e-05, Method: Composition-based stats. Identities = 21/173 (12%), Positives = 45/173 (26%), Gaps = 19/173 (10%) Query: 39 SDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDES- 97 + ++ V+P+ R ++ + +NGG ++ Sbjct: 160 TTGPWDMRVLMVDPRAFRGSFKTSV---GTSVAKRETTTSMSKLTKAIAGVNGGFFNIHT 216 Query: 98 -----YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEI 152 P+G+ + G+ R + + I A K I Sbjct: 217 PKALQGDPMGISVVGGRLLSEAVPGRSGLVISGRKVRITELKTVITAIPADGAKTEIKGI 276 Query: 153 QFAVQSGPMLM----------ENGVINPRIHPNVASSKIRNGVGINKHGNAVF 195 A + +++ +G + K R G+ G V Sbjct: 277 NRAAGADELVLYTEEFGTKTAADGGAEIVVDAQGRIVKARAAGGVVPRGTYVL 329 >UniRef50_B5RQG1 Uncharacterized conserved protein n=20 Tax=Borrelia RepID=B5RQG1_BORRA Length = 269 Score = 102 bits (254), Expect = 1e-20, Method: Composition-based stats. Identities = 28/217 (12%), Positives = 54/217 (24%), Gaps = 31/217 (14%) Query: 31 VAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEA----WGTLHALLADINSQGQVQ 86 + + V + + +K K + + + +V Sbjct: 29 LQPKYEIIKGSFQESNYVIVKIKNKDLKFIISKPIYDTKMNNYYFKGQTTSQFLISNKVD 88 Query: 87 MAMNGGIYDESYA---PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRL 143 +A+N Y P G+YI N + G I+ Sbjct: 89 IAINTSPYTIKGTMFYPNGIYIYNKKLISHAKKDQGIII------------IKNNQIILN 136 Query: 144 DAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHG-NAVFLLSQQA- 201 K + L++NG N R +G +K + + + Sbjct: 137 PKHNEIKNSDYGFGGFFSLIKNGKYTKNFKEN---KHPRTIIGTDKENKHLYLITVEGRG 193 Query: 202 ------TNFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 + + A V + LDG S + Sbjct: 194 TNNSKGISLNE-AIDLSLSYGVTNSINLDGGGSSTLV 229 >UniRef50_C6IEV9 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=C6IEV9_9BACE Length = 343 Score = 102 bits (254), Expect = 1e-20, Method: Composition-based stats. Identities = 32/266 (12%), Positives = 62/266 (23%), Gaps = 55/266 (20%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVK----MYWQKANGEAWGTLHALLADIN 80 L + L+ + + + + + + G ++ Sbjct: 59 LPDYIHIYKSPENLAGKKAIAYIAVADMSKAKFEVLGDIAFSQEANGYGGKSIHTPSEFY 118 Query: 81 SQGQVQMAMNGG-IYDESY--APLGLYIENGQQKVA----------LNLASGEGNFFIRP 127 + + +NGG + + L I GQ G F Sbjct: 119 ESSKAPVVINGGLFFYSAGFYYSQNLVIREGQLLAPNQNYYSKDWVTMWYPTLGAFCQMK 178 Query: 128 GGVFYV------------------------AGDKVGIVRLDAFKTSKEIQFAVQSGPMLM 163 G F + + E + +L+ Sbjct: 179 DGTFQTTWTYQASDGINYCYPAPADNDINKDPLQAPSSTFPNGAKALEATTGIGGVTVLL 238 Query: 164 ENGVINPRIHPN-----VASSKIRNGVGINKHGNAVFLLSQQA--------TNFYDFACY 210 G I AS++ R +GI + + + + + A Sbjct: 239 RAGEIKNTYVEEMLDISAASNQPRTAIGITTNKKMIIFVCEGRNMTEGVAGLTTANVAKV 298 Query: 211 AKAKLNVEQLLYLDGTISHMYMKGGA 236 K L + L LDG S + G Sbjct: 299 MKD-LGCTEALNLDGGGSSCMLVNGK 323 >UniRef50_D2J8B1 Putative uncharacterized protein n=1 Tax=Staphylococcus aureus RepID=D2J8B1_STAAU Length = 569 Score = 102 bits (254), Expect = 1e-20, Method: Composition-based stats. Identities = 22/238 (9%), Positives = 54/238 (22%), Gaps = 29/238 (12%) Query: 30 AVAADDCALSDPTL--TVQAYTV---NPQTERVKMYWQKANGEAWGTLHALLADINSQGQ 84 A ++ + T + + + +K+ +G+ H + + Sbjct: 133 AYYSEITTVKGRNFETTYYITHIPHKDKEGNLIKIKRGI-SGDINKPDHITPREFAKRTG 191 Query: 85 VQMAMNGG-IYDESYAPLGLYIENGQQKVALNLASGEGN---FFIRPGGVFYVAGDKVGI 140 N P G+ I NG+ ++ I V Sbjct: 192 ATFVSNASTGSGTQLLPHGVQIYNGKIIKSVKDYDALEQRWSLAIGEDNTLRTYAPNV-- 249 Query: 141 VRLDAFKTSKEIQFAVQSGPMLMENGVINP---RIHPNVASSKIRNGVGINKHGNAVFLL 197 ++ + +++ I PN R+ + + + +F Sbjct: 250 --NAETLLAQGETNVLSGFGAFIQDNKITVKPGDFSPNTDVKHPRSVIAQLPNKDIIFFA 307 Query: 198 SQQA-----------TNFYDFACYAKAKLN-VEQLLYLDGTISHMYMKGGAIPWQRYP 243 + + LDG S ++ + Sbjct: 308 CDGRENNNKGFVEKGMTLQEVGETLFKHYGEITLAYNLDGGGSTAHVLRSTKLNKSQD 365 >UniRef50_C9RVV6 Putative uncharacterized protein n=3 Tax=Geobacillus RepID=C9RVV6_GEOSY Length = 652 Score = 102 bits (253), Expect = 2e-20, Method: Composition-based stats. Identities = 24/135 (17%), Positives = 43/135 (31%), Gaps = 21/135 (15%) Query: 136 DKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVF 195 D V I ++ A+ L+ +G + P V R VGI+K+GN + Sbjct: 359 DAVEISLQYDQPEWSGVKEALGGRYRLVADGKVQPFSIEGV---HPRTAVGIDKNGNVML 415 Query: 196 LLSQQA-------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGG-----AIPWQR-- 241 ++ + A +L + LDG S ++ + + Sbjct: 416 IVVDGRQPAYSQGMTLNELAKLM-HELGAVDAMTLDGGGSSTFVVRQPNGQLKVENKPSD 474 Query: 242 ---YPFVTMISVERK 253 P + V K Sbjct: 475 GFARPVANALLVVYK 489 Score = 62.4 bits (150), Expect = 1e-08, Method: Composition-based stats. Identities = 18/126 (14%), Positives = 40/126 (31%), Gaps = 8/126 (6%) Query: 21 LALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADIN 80 ++ + + + V ++ ER+ + +N + G + + Sbjct: 137 VSTRIASGVEKEEMEIVGARGKQHVYKLDIDTSNERMAIETALSNDQVLGI--EPVLEQA 194 Query: 81 SQGQ-----VQMAMNGGIYDESYAPLGLYIENGQ-QKVALNLASGEGNFFIRPGGVFYVA 134 + V A+NG + + +P L + G+ A+ F I G + Sbjct: 195 KRYDGRDGIVLAAVNGDYFKQDGSPTDLMVHRGEIVITNTTPAAERTIFGISADGKPMIG 254 Query: 135 GDKVGI 140 V I Sbjct: 255 NPDVQI 260 >UniRef50_B3PTF7 Putative uncharacterized protein n=3 Tax=Rhizobium RepID=B3PTF7_RHIE6 Length = 325 Score = 100 bits (250), Expect = 4e-20, Method: Composition-based stats. Identities = 37/233 (15%), Positives = 70/233 (30%), Gaps = 22/233 (9%) Query: 1 MAHQLLIGKGMITLNLKRIFLALTLL---PLFAVAA-DDCALSDPTLTVQAYTVNPQTER 56 ++ + + + + L P F VA A + V+P R Sbjct: 56 LSASMRLALSPVIPAVL--PGPLVWQEPEPGFEVAELPVLADGREVDRIFLSRVDPARFR 113 Query: 57 VKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNL 116 + + L + +NG +D+ P +I G Sbjct: 114 FVTHNAAPGDKGIDEWEKTLP------NAVLIVNGSYFDKHGRPDTPFISEGIAMGPRQY 167 Query: 117 ASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLM-ENGVINPRIHPN 175 + G F + D A+ S P+L+ ++G + + Sbjct: 168 DARAGAFTADKDTAEIR-----DLSHQDWQTAFVGASNAMVSYPLLIGDDGQTHVNVK-- 220 Query: 176 VASSKIRNGVGINKHGNAVFLLSQQA-TNFYDFACYAK-AKLNVEQLLYLDGT 226 R V + G V +++A + A + K + LN++ L LDG Sbjct: 221 SRWLANRTFVAKDDLGRVVIGTTKEAFFSLDRLAQFLKTSPLNLKVALNLDGG 273 >UniRef50_A7M0G9 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=A7M0G9_BACOV Length = 621 Score = 100 bits (249), Expect = 4e-20, Method: Composition-based stats. Identities = 35/184 (19%), Positives = 61/184 (33%), Gaps = 18/184 (9%) Query: 74 ALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYV 133 A + I + A+NG S P + + KVA + S + GV + Sbjct: 103 AKTSMIAKDKKALFAINGSY-SISGNPSTFTMVDKVVKVASTIES-----ASKVNGVIAI 156 Query: 134 AGDKVGI----VRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNV-ASSKIRNGVGIN 188 + D E + A+ SGPML+ G + + R+ +GI Sbjct: 157 DAEGSVDVKSCTFSDYTDVEDEYESALASGPMLLMEGKVCSFPQDAIYTQRMARSVIGIT 216 Query: 189 KHGNAVFLLSQQATNFY------DFACYAKAKLNVEQLLYL-DGTISHMYMKGGAIPWQR 241 G + L A + A + L ++ + L DG+ S ++ G + Sbjct: 217 AQGKMMLLTIDGAITGNADGATLEEAAFIAKTLGMKNAVCLADGSSSTLWTSGKGVVNHP 276 Query: 242 YPFV 245 Sbjct: 277 VGNG 280 >UniRef50_B2UNL7 Putative uncharacterized protein n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UNL7_AKKM8 Length = 249 Score = 100 bits (249), Expect = 5e-20, Method: Composition-based stats. Identities = 36/159 (22%), Positives = 62/159 (38%), Gaps = 12/159 (7%) Query: 81 SQGQVQMAMNGGIY--DESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKV 138 + +NGG + D PLGL +++G++ L S + + GG + + Sbjct: 70 RKSPCVAGVNGGFFSADAGGTPLGLVVQDGKRLSPLATGSFAVSGVVYEGGRDGLTLVRS 129 Query: 139 GIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLS 198 ++R + +Q A+Q GP L+ENG ++ S R + + +S Sbjct: 130 SVLR--RMRRLPAMQAAIQGGPFLVENGSAVKGLNA--QKSTYRTFIATDGGRRWCIGVS 185 Query: 199 QQATNFYDFACYAKAK--LN---VEQLLYLDGTISHMYM 232 + A + A L VE L LDG S + Sbjct: 186 SS-LTLKELAAWLAAPGALGNFRVETALNLDGGSSSAFW 223 >UniRef50_UPI0001744905 hypothetical protein VspiD_09365 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001744905 Length = 251 Score = 100 bits (249), Expect = 5e-20, Method: Composition-based stats. Identities = 41/231 (17%), Positives = 71/231 (30%), Gaps = 27/231 (11%) Query: 23 LTLLPLFAVAADDCALSDPTLTVQAYTVN-----PQTERVKMYWQKANGEAWGTLHAL-- 75 + + L A L A V P + + A +H Sbjct: 3 VIVETLPAQWTVRSQAGPVKLPGGAIQVKKQLAGPTEAELNLILFTAGKYEMRVVHQPER 62 Query: 76 -----LADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGV 130 LA + NGG + + PLGL + +G SG GGV Sbjct: 63 DKGVSLATKMRELGAIAGCNGGYFTPDFLPLGLEVSDGV-------RSGTFQRSSLLGGV 115 Query: 131 FYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKH 190 F V + +V D + K + +Q+GP L+ G+ + + R + ++ Sbjct: 116 FLVRHGRPAMVWKDEYIEQKGVTQLLQAGPRLVHAGLPVAGLEA--TKRRARTFILTDQA 173 Query: 191 GNAVFLLSQQATNFYDFACY-----AKAKLNVEQLLYLDGTISHMYMKGGA 236 GN + + + ++ V++ L DG S Sbjct: 174 GNWALGTCKS-VTLRELSDLLSTRALLPEVTVKRALNFDGGNSTGLWWRAE 223 >UniRef50_B1XK15 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7002 RepID=B1XK15_SYNP2 Length = 595 Score = 99.7 bits (247), Expect = 7e-20, Method: Composition-based stats. Identities = 32/265 (12%), Positives = 68/265 (25%), Gaps = 34/265 (12%) Query: 9 KGMITLNLKRIFLALTLLPLFAVAADDC-ALSDPTLTVQAYTVNPQTERVKMYWQKANGE 67 G++ N + A + N + V++ Sbjct: 331 LGIVQGNRALRSGPILNRGAVAWDNAGRWEFDRLKVETDIVAGNGERVGVELI-----NS 385 Query: 68 AWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRP 127 + A L D + A++ I G + + +G+ ++ I Sbjct: 386 GYVKAGAALYDRAWGSRYTTAVDHEIVLTVMTSGG---RDQVIRQETAGKAGQNSYEIPQ 442 Query: 128 GGVFYVAGD------------KVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPR---- 171 GG V + + + V GP+L++NG + Sbjct: 443 GGYLLVFRSFRTGAAKFPVGVTLERRPRFTPNSFATLPNIVGGGPLLLKNGQVVLNGQAE 502 Query: 172 --IHPNVASSKIRNGVGINKHGNAVFLLSQQ------ATNFYDFACYAKAKLNVEQLLYL 223 S R+ + + + + ++A + +L L L Sbjct: 503 QFSTAFNIQSASRSAIARTRDNKILLVTLHGAAEETAGATLNEWANILR-RLGATDALNL 561 Query: 224 DGTISHMYMKGGAIPWQRYPFVTMI 248 DG S G + + + Sbjct: 562 DGGGSSALALGANLSDRHPTTAGRV 586 Score = 64.7 bits (156), Expect = 3e-09, Method: Composition-based stats. Identities = 23/181 (12%), Positives = 58/181 (32%), Gaps = 12/181 (6%) Query: 22 ALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINS 81 ++ LP ++ A S + V ++P ++++ + L LL Sbjct: 252 SVQWLPGVQWRQENFAASSGPVRVTWLEIDPTQRQLQLKPITPDNNTIVGLAPLLIQ-AD 310 Query: 82 QGQVQMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGI 140 Q A+N G ++ + PLG ++ + + + + + G D++ + Sbjct: 311 TNQAIAAINAGFFNRNNQYPLG-IVQGNRALRSGPILNRGAVAWDNAGR---WEFDRLKV 366 Query: 141 VRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ 200 + + G L+ +G + + R ++ + S Sbjct: 367 ETDIVAGNGERV------GVELINSGYVKAGAALYDRAWGSRYTTAVDHEIVLTVMTSGG 420 Query: 201 A 201 Sbjct: 421 R 421 >UniRef50_UPI0001C43112 hypothetical protein BpOF4_05820 n=1 Tax=Bacillus pseudofirmus OF4 RepID=UPI0001C43112 Length = 762 Score = 99.7 bits (247), Expect = 8e-20, Method: Composition-based stats. Identities = 37/228 (16%), Positives = 65/228 (28%), Gaps = 23/228 (10%) Query: 41 PTLTVQAYTVNPQTE-RVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYA 99 T ++N R + + EA + DI + A N D+ Sbjct: 179 NGQTYDLGSINGARGEREAVLYTSGYKEATTGASQWVTDIVVTNTNKSAANFSFGDKITG 238 Query: 100 PLGLYIENGQQKVALNLASG---EGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAV 156 + G+ A G N + V+ V ++ +F + Sbjct: 239 TVSEIRRLGEGANATIPKDGFVISANGGPFRDALTGVSVGDELTVEASINDAWRDAEFIL 298 Query: 157 QSGPMLMENGVI---NPRIHPNVASSKIRNGVGINKHG-NAVFLLSQQAT-------NFY 205 +GP L+ NG P R VG + G + Sbjct: 299 ATGPTLVRNGQTSISMSTSSPFARERAPRTAVGASSDGTKLFLVTIDGRQSGYSNGVTIP 358 Query: 206 DFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVERK 253 + A Y ++ + + LDG S + YP+ +SV + Sbjct: 359 ELAAYMRS-IGAHNAINLDGGGSTTMVAR-------YPWADHVSVVNR 398 Score = 47.3 bits (111), Expect = 5e-04, Method: Composition-based stats. Identities = 20/121 (16%), Positives = 40/121 (33%), Gaps = 10/121 (8%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERV--KMYWQKANGEAWGTLHALLADINSQ 82 L ++ + + S VQ V + V ++Y+ G T +A+ Sbjct: 49 LAEGVSLVRESYSGSGRNQAVQVLDVQYRNPNVGLELYYPTPIGRVQTTSQQAMANTYEN 108 Query: 83 GQVQMAMNGGIYDE-SYAPLGLYIENGQQKV-------ALNLASGEGNFFIRPGGVFYVA 134 +V A+N Y+ + P+ L +EN + + F + G + Sbjct: 109 HRVVGAVNASFYNMSNGMPVNLLVENNKILNYGVLSNDQGGPVNAPFAFGVNRNGALTLT 168 Query: 135 G 135 Sbjct: 169 D 169 >UniRef50_P74396 Slr0280 protein n=1 Tax=Synechocystis sp. PCC 6803 RepID=P74396_SYNY3 Length = 610 Score = 99.3 bits (246), Expect = 9e-20, Method: Composition-based stats. Identities = 37/248 (14%), Positives = 73/248 (29%), Gaps = 47/248 (18%) Query: 52 PQTERVKMYWQKANGEAWG--TLHALLADINSQGQVQMAMNGGI-----------YDESY 98 P R + W +G +L ++ + Q +N G + SY Sbjct: 364 PILNRGAIAWNDQGQTTFGRLSLSEIITTGSGQRLTANYLNSGYVQRGIARYTPAWGPSY 423 Query: 99 APLG-----LYIENGQQKVALN-LASGEGNFFIRPGGVFYVAGD------------KVGI 140 PL ++N Q +G+ I G + + + Sbjct: 424 IPLSDNEQVYVVQNSQVTAQYPLPKAGQQQMPIPSDGYLIIDRGNQIPAGVLAVGTTLNV 483 Query: 141 VRLDAFKTSKEIQFAVQSGPMLMENGVINPR------IHPNVASSKIRNGVGINKHGNAV 194 + + +GP+L++ G + R+ + ++++GN + Sbjct: 484 NGRSTPEAFNAFPNGMGAGPLLIDQGRMVLNATGEGFSSAFQQQRASRSAIAVDRNGNII 543 Query: 195 FLLSQQAT-----NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRY----PFV 245 + S + +FA +L L LDG S GG + + Sbjct: 544 LVASHNRVGGAGASLGEFAQIL-QQLGAVNALNLDGGSSTSLALGGQLLDRSPVTAARVS 602 Query: 246 TMISVERK 253 I V + Sbjct: 603 NAIGVFVR 610 Score = 62.4 bits (150), Expect = 1e-08, Method: Composition-based stats. Identities = 19/131 (14%), Positives = 37/131 (28%), Gaps = 2/131 (1%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ 84 +S V T+NP++ + + AN L+ I + Sbjct: 275 WTEGITWQQRFVNISGGQFPVTTVTINPRSPGISLRPLMANPTMAQGTAPLVT-IARDQR 333 Query: 85 VQMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRL 143 +A+N G ++ + PLG + L G + + F I Sbjct: 334 AAVAINAGFFNRNNQLPLGAVWSQQNWRSGPILNRGAIAWNDQGQTTFGRLSLSEIITTG 393 Query: 144 DAFKTSKEIQF 154 + + Sbjct: 394 SGQRLTANYLN 404 >UniRef50_B8J2Y6 Putative uncharacterized protein n=2 Tax=Desulfovibrio RepID=B8J2Y6_DESDA Length = 429 Score = 99.3 bits (246), Expect = 1e-19, Method: Composition-based stats. Identities = 34/201 (16%), Positives = 62/201 (30%), Gaps = 9/201 (4%) Query: 36 CALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYD 95 L+D + A ++P + + +G L+ Q + A+N +Y Sbjct: 138 FQLTDSEALLTALRIDPAHFDFILCARSQDGGNLRPLNQW----AEQYGLTAAINASMYL 193 Query: 96 ESY-APLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQF 154 G +NG + F P V + + + Sbjct: 194 PDGITSTGYMRQNGHHNNKRVVQRFGAFFVAGPDSPDLPGAAIVDRDDPQWEQRIGQYRL 253 Query: 155 AVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFA-CYAKA 213 +Q+ M + I P I + V + G +FL +Q Y FA Sbjct: 254 VIQNYRMTSADRRIL--WSPGGPHYSI-SAVAQDGDGRILFLHCRQPVEAYAFAQQLLHL 310 Query: 214 KLNVEQLLYLDGTISHMYMKG 234 LNV ++Y++G + Sbjct: 311 PLNVRTVMYVEGGGQAGLLVR 331 >UniRef50_Q4ZC55 ORF005 n=1 Tax=Staphylococcus phage EW RepID=Q4ZC55_9CAUD Length = 576 Score = 99.3 bits (246), Expect = 1e-19, Method: Composition-based stats. Identities = 27/240 (11%), Positives = 50/240 (20%), Gaps = 29/240 (12%) Query: 32 AADDCALSDPTLTVQAYTV---NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMA 88 D + + + + +K+ + + + + Sbjct: 139 EVDYTKGRKFDTSYSIVRIPHKDRKGNVIKLKRGIEGSDKSHPEPVTATEFSKRSGATYV 198 Query: 89 MNGGIYDESYAPL-GLYIENGQQ---KVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLD 144 N S L G I NGQ I V Sbjct: 199 SNASTGSGSRVMLHGEQIYNGQILETVKDYEPLKTRWTLAIADDNTLVSFPPGV----TA 254 Query: 145 AFKTSKEIQFAVQSGPMLMENGVIN---PRIHPNVASSKIRNGVGINKHGNAVFLLSQQA 201 K V L+ +G I N S R + + + +F Sbjct: 255 KEIKDKGYNNTVSGFGPLITDGQIVYKKGDYSTNSEESHPRQVICQLDNKDLLFFTCDGR 314 Query: 202 ----------TNFYDFACYAKAKL-----NVEQLLYLDGTISHMYMKGGAIPWQRYPFVT 246 + K++ ++ LDG S + G + Sbjct: 315 VKSQGLLQKGMTLSEVIETLKSEYPIGSNGIKFAYNLDGGGSSSSVLRGRRLNKVTDNNN 374 >UniRef50_UPI00019088BB hypothetical protein RetlC8_25680 n=2 Tax=Rhizobium etli RepID=UPI00019088BB Length = 332 Score = 99.0 bits (245), Expect = 1e-19, Method: Composition-based stats. Identities = 29/227 (12%), Positives = 63/227 (27%), Gaps = 26/227 (11%) Query: 13 TLNLKRIFLALTLLPLFAVAADDCAL----------SDPTLTVQAYTVNPQTERVKMYWQ 62 + + + + ++P R ++ Sbjct: 67 MQLALKSPVPAVQPGSLVWQEPEIGFEVAELPVLADGREVDRIFLSRIDPMRFRFVVHNA 126 Query: 63 KANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGN 122 + + + + +NG YD P +I G + G Sbjct: 127 SQGDK---GIDEWEHALPK---AVLIVNGSYYDMHGRPDTPFISEGVAMGPRQYDAKAGA 180 Query: 123 FFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLM-ENGVINPRIHPNVASSKI 181 FF + D A+ S P+L+ ++G + + Sbjct: 181 FFADAASADIR-----DLTHQDWGSALAGATNAMVSYPLLIGDDGQTHVNVK--SRWLAN 233 Query: 182 RNGVGINKHGNAVFLLSQQA-TNFYDFACYAK-AKLNVEQLLYLDGT 226 R V + G + +++A + A + K + L+++ L LDG Sbjct: 234 RTFVAKDGSGRILIGTTKEAFFSLDRLAEFLKASPLDLKVALNLDGG 280 >UniRef50_B0JW05 Polysaccharide deacetylase family protein n=4 Tax=Chroococcales RepID=B0JW05_MICAN Length = 616 Score = 98.6 bits (244), Expect = 2e-19, Method: Composition-based stats. Identities = 33/268 (12%), Positives = 71/268 (26%), Gaps = 26/268 (9%) Query: 6 LIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAY--TVNPQTERVKMYWQK 63 L+ G + R + + V + + + Sbjct: 327 LLTIGRFGQSRTREIVP----QAWGGDPLPRRDGGFNFATTIQKREVEFSSNTLVLISGG 382 Query: 64 ANGEAWGTLHALLADINSQGQVQMAMNGGIYD----ESYAPLGLYIE-NGQQKVALNLA- 117 + +I + + A++GG + +S +G + NG Sbjct: 383 IPKTIHADSRYQVEEIIAGTEAIAAVDGGFFSLKELDSNQMIGPVLSENGGFIPGYEGEI 442 Query: 118 ---SGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKE----IQFAVQSGPMLMENGVINP 170 G I V Y+ D L+ + A + L+++G Sbjct: 443 DKLEGRPLVIITDRWVRYLPFDPARHNTLEGIAAEAGDDLKVTDAFVAAAWLVKDGQPQS 502 Query: 171 RIHP----NVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGT 226 + + R GIN+ G V +S+ + + + LD Sbjct: 503 LESFGTLYGFDALRHRAFWGINQAGQPVIGVSRDPIDSMALGELLVQA-GFREAVMLDSG 561 Query: 227 ISHMYMKGG--AIPWQRYPFVTMISVER 252 S G + + P ++++ Sbjct: 562 ASTSLAYQGQSQVHYTPRPVPHVVALFP 589 >UniRef50_C7IFA0 Exopolysaccharide biosynthesis protein n=1 Tax=Clostridium papyrosolvens DSM 2782 RepID=C7IFA0_9CLOT Length = 385 Score = 98.6 bits (244), Expect = 2e-19, Method: Composition-based stats. Identities = 29/220 (13%), Positives = 65/220 (29%), Gaps = 25/220 (11%) Query: 47 AYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAP----LG 102 V+ + R + ++ E G + + + + + Sbjct: 150 VLIVDKKGARFETFYSNITLEHKGNKIKINDMNRIGKNNDIVLYNDKFGSTNRAEIKNTT 209 Query: 103 LYIENGQQK--------VALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQF 154 + ++N V + +F+ + G K G + ++ Sbjct: 210 IIVDNNVITTLVESTKEVNIRKGMNVISFYGGKESIPEKMGLKAGDKVNIRMEPYLGYRY 269 Query: 155 -AVQSGPMLMENGVINPRIHP----NVASSKIRNGVGINKHGNAVFLLSQQA-------T 202 A + G ML+++G + + R +GI G + L++ Sbjct: 270 QAYECGSMLVKDGKTVVPERDKWAGTLGNRDPRTVIGIKTDGKIIMLVADGRQPGYSEGM 329 Query: 203 NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRY 242 + Y KL V + LDG S + G++ + Sbjct: 330 TGKEMGEYLV-KLGVRDVAMLDGGASSQMIINGSLRNRPS 368 Score = 68.1 bits (165), Expect = 3e-10, Method: Composition-based stats. Identities = 18/124 (14%), Positives = 39/124 (31%), Gaps = 3/124 (2%) Query: 29 FAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMA 88 + + +P+ ERV+ + +G L+DI + + A Sbjct: 61 VQYKHKTEIIKGNKQEIYMLEFDPRDERVEFKPALSFDNIFGF--EKLSDICKRNEAYAA 118 Query: 89 MNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKT 148 +NGG + + P G+ +GQ + + G F + + Sbjct: 119 INGGFFYQFGEPTGMVAIDGQML-TASTGLSPVLIVDKKGARFETFYSNITLEHKGNKIK 177 Query: 149 SKEI 152 ++ Sbjct: 178 INDM 181 >UniRef50_B5YE82 Putative uncharacterized protein n=2 Tax=Dictyoglomus RepID=B5YE82_DICT6 Length = 691 Score = 98.6 bits (244), Expect = 2e-19, Method: Composition-based stats. Identities = 29/238 (12%), Positives = 71/238 (29%), Gaps = 31/238 (13%) Query: 44 TVQAYTVNPQ-TERVKMYWQKANGEAWGT------LHALLADINSQGQVQMAMNGGIYDE 96 + + +N + + A G+ + + ++ + + Sbjct: 122 IIDIFQINLKIKIGENIIPVNAINSPRGSDNLNLFTKYFGKETQIRENASAGIDIEVVLK 181 Query: 97 S-----YAPLGLY--IENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTS 149 G+ I G +K + + + + I + Sbjct: 182 DKIPSLGKTSGIVSNIYYGVKKTPIKENTCIISLGGTALKYLPLFSIGKEIEIITECNPP 241 Query: 150 KEIQFAVQSGPMLMENGVINPRIHPN-------VASSKIRNGVGINKHGNAVFLLSQQA- 201 ++ A+ GP+L++NG I V S R +GI K+ + F++ + Sbjct: 242 IPLKEAIGGGPILLKNGDIVLGNTDELAFDNNIVNSRHPRTIIGI-KNNSIYFIVIEGRK 300 Query: 202 -----TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ--RYPFVTMISVER 252 + + K ++ + + +DG S + G + Q P ++ Sbjct: 301 ENSAGVSLKEACEILK-EMGINDAINMDGGGSSQKLIWGRLVNQKVERPVPVAFGIKN 357 Score = 44.6 bits (104), Expect = 0.003, Method: Composition-based stats. Identities = 17/104 (16%), Positives = 37/104 (35%), Gaps = 13/104 (12%) Query: 32 AADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNG 91 A + LT + P ++M + + L+++ + +NG Sbjct: 33 TAIYEKIVTDNLTYHVTKIEP-DPFLEMETIVSQNK--------LSEVYKYESYDLIING 83 Query: 92 GIYDE-SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVA 134 +D ++ P+GL ++NG+ L G F + + Sbjct: 84 NFFDPKTFEPVGLVVKNGELIH---LPIKRGVFGLTFDNKPIID 124 >UniRef50_B5Y710 Copper amine oxidase N-domain family n=2 Tax=Coprothermobacter proteolyticus DSM 5265 RepID=B5Y710_COPPD Length = 485 Score = 98.6 bits (244), Expect = 2e-19, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 56/204 (27%), Gaps = 10/204 (4%) Query: 58 KMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLA 117 + W + + + L +N G + NG + ++ +G Sbjct: 284 RTMWGENSIRVFTPLRGATTRLNVDGINVIVRNGEVVEQVTGTNVPIPPDGYVIHLGGTE 343 Query: 118 SGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNV- 176 + F + Y V I + +GP L+ NG I + Sbjct: 344 VRFKDRFEVGTRLSYRDIYDVRNSSNPEMWQEGVIWGTLSAGPRLITNGEITLDPASELL 403 Query: 177 ------ASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHM 230 R+ +GI ++ + + D A K L + LDG S Sbjct: 404 DIPKITGQPLTRSALGITQNNELLMVTV-SKCTIQDLATIMKD-LGAYNAMNLDGGASTS 461 Query: 231 YMKGGA-IPWQRYPFVTMISVERK 253 G + + V + Sbjct: 462 LYANGKFLATPTRKISNALMVLPR 485 Score = 50.4 bits (119), Expect = 5e-05, Method: Composition-based stats. Identities = 18/134 (13%), Positives = 39/134 (29%), Gaps = 7/134 (5%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ 84 + A+S T TV V +K+ A + + + + Sbjct: 139 IETGVESYTTTVAVSTGTATVNVVKVFLNDPTIKLEIVNAQDQI--GVLEPFESMVKRKN 196 Query: 85 VQMAMNGGIYDES-----YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVG 139 A+NG + + P + NG+ + F +A K+ Sbjct: 197 PLAAINGTFFQVADTSLPMEPAANLVINGRIEHLGTPEKYASTFAFTQDNQVDIANVKMK 256 Query: 140 IVRLDAFKTSKEIQ 153 + + + E++ Sbjct: 257 LQGEYTYIPNPELE 270 >UniRef50_Q5N4C8 Putative uncharacterized protein n=2 Tax=Synechococcus elongatus RepID=Q5N4C8_SYNP6 Length = 605 Score = 98.2 bits (243), Expect = 2e-19, Method: Composition-based stats. Identities = 39/286 (13%), Positives = 75/286 (26%), Gaps = 44/286 (15%) Query: 5 LLIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKA 64 L G ++ + A+ + F + L Y+ P R M W Sbjct: 305 SLEGTQVLQAVARDRGAAIAINAGFFNRNNRLPLGAIRRDNIWYSG-PILNRGAMAWNDQ 363 Query: 65 NGEAWG--TLHALLADINSQGQVQMAMNGGIYDESY----------------APLGLYIE 106 L L + +A+N G + + ++ Sbjct: 364 GEVLIDRLGLQETLQLSSGTRIPLVALNSGYVRAGAARYTEAWGNSYQTILDNEVVVTVQ 423 Query: 107 NGQQKVALNLASGEGN-FFIRPGGVFYV------------AGDKVGIVRLDAFKTSKEIQ 153 + N F I G V G + +++ Sbjct: 424 GDRVVSQSQADKAGSNRFTIPRNGYLIVLRSANSLRTSLVNGTTIQVLQQAQPSQFDRFP 483 Query: 154 FAVQSGPMLMENGVINPRIHPNVASSK------IRNGVGINKHGNAVFLLSQQ-----AT 202 A+ GP+L+++G + S R+ +G+ G V + + + Sbjct: 484 HALGGGPLLVKSGRVVVNPQAEGFSRAFEIEAAPRSAIGLMPDGRLVLVAAHEQNQGQGP 543 Query: 203 NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMI 248 A +L V L DG S + G + + + Sbjct: 544 TLPQMAAIM-QQLGVVDALNFDGGSSTSLIVNGQLVNRARGSAARV 588 Score = 66.6 bits (161), Expect = 8e-10, Method: Composition-based stats. Identities = 18/149 (12%), Positives = 38/149 (25%), Gaps = 8/149 (5%) Query: 18 RIFLALTLLP----LFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLH 73 LT P + + T V ++ + V++ A + Sbjct: 251 SNPAPLTPAPPDLAGTQLQQRQVTVDGATFPVFVIQLDLRQPNVRLAPIWAGNGSLEGT- 309 Query: 74 ALLADINSQGQVQMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGEGNFFIRPGGVFY 132 +L + +A+N G ++ + PLG + L G G Sbjct: 310 QVLQAVARDRGAAIAINAGFFNRNNRLPLGAIRRDNIWYSGPILN--RGAMAWNDQGEVL 367 Query: 133 VAGDKVGIVRLDAFKTSKEIQFAVQSGPM 161 + + + T + Sbjct: 368 IDRLGLQETLQLSSGTRIPLVALNSGYVR 396 >UniRef50_B7IEY1 Putative uncharacterized protein n=1 Tax=Thermosipho africanus TCF52B RepID=B7IEY1_THEAB Length = 535 Score = 98.2 bits (243), Expect = 3e-19, Method: Composition-based stats. Identities = 28/166 (16%), Positives = 44/166 (26%), Gaps = 19/166 (11%) Query: 103 LYIENGQQKVALNLASGEG-----NFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQ 157 I N Q + + + I+ A+ Sbjct: 366 FVISNNQIISKEYVEKVPKDSMVLLITKKYDKYLKNIEVGSKVNLTINSDFPFPIKHAIG 425 Query: 158 SGPMLMENGVINPRIHPN--------VASSKIRNGVGINKHGNAVFLLSQQ-----ATNF 204 +GP+L+ENG S R + I K G F++ + N Sbjct: 426 AGPLLIENGKKLIDSDEEKLRYGNGLALSKTSRTIIAITKEGKVDFIVIEGYNDSPGMN- 484 Query: 205 YDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISV 250 YD A + + LDG S + + Q I V Sbjct: 485 YDIATEFLLEKGYFYAMMLDGGGSSAMVIQDEVVNQDGTIQRGIPV 530 Score = 67.4 bits (163), Expect = 5e-10, Method: Composition-based stats. Identities = 12/91 (13%), Positives = 30/91 (32%), Gaps = 5/91 (5%) Query: 24 TLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQG 83 TL ++ + ++P+ V++ ++ L ++ Sbjct: 213 TLKDGLEWERKIEVFNEKKYLINYLHIDPKK--VEILPIISSNGI--GTRQDLREMLKNN 268 Query: 84 QVQMAMNGGIYDESYA-PLGLYIENGQQKVA 113 +N +D S P+ L I++G+ Sbjct: 269 NCIAGINANYFDPSTNIPIDLVIKDGKLLSD 299 >UniRef50_Q8DHU5 Tll1850 protein n=1 Tax=Thermosynechococcus elongatus BP-1 RepID=Q8DHU5_THEEB Length = 575 Score = 97.4 bits (241), Expect = 4e-19, Method: Composition-based stats. Identities = 37/291 (12%), Positives = 78/291 (26%), Gaps = 45/291 (15%) Query: 6 LIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKAN 65 L+G + +R A + F L + P R + W Sbjct: 284 LVGLATVPELAQRWQAAAAINGGFFNRDRQAPLGAIRREGNWLSG-PILNRGAIGWDDRG 342 Query: 66 G------------EAWGTLHALLADINSQGQVQMAMN-----GGIYDESYAPLGLYIENG 108 ++ + Q +A+ G ++ + + + + N Sbjct: 343 QIVVGRLSLQQRVRTPTGTVPIVTFNSGYVQAGLALYTPSWGGSYQGKTGSEVVITVRNE 402 Query: 109 QQKVALNLASGEGNFFIRPGGVFYVAGDK-----------VGIVRLDAFKTSKEIQFAVQ 157 Q + + G + V + I V Sbjct: 403 QVVGQQPINKDQTVPIPSEGFLLVARNFNSALANFPPGAAVQLETTAVPAAFNRIPNIVG 462 Query: 158 SGPMLMENGVINPRIHPN------VASSKIRNGVGINKHGNAVFLLSQQAT-----NFYD 206 +GP+L+E G + A + R+ +G G+ V++ + + Sbjct: 463 AGPLLVEQGRVVLNAALEQFGAGLDAQAAPRSAMGNRSDGSIVWVTTHNRIGGMGPTLAE 522 Query: 207 FACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRY----PFVTMISVERK 253 +A +L + + LDG S GG + + + V + Sbjct: 523 WAQI-VHRLGLINAVNLDGGSSTALYLGGVLVDRHGVTTTRVNNALGVFWQ 572 Score = 73.2 bits (178), Expect = 9e-12, Method: Composition-based stats. Identities = 19/105 (18%), Positives = 34/105 (32%), Gaps = 2/105 (1%) Query: 20 FLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADI 79 + P L V +NPQ +++ + + L A + ++ Sbjct: 235 PRTIQWAPGLRWQQQTVILGTRQFPVDLLIINPQQPGLRLRPLEISPTTLVGL-ATVPEL 293 Query: 80 NSQGQVQMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGEGNF 123 + Q A+NGG ++ PLG G L G + Sbjct: 294 AQRWQAAAAINGGFFNRDRQAPLGAIRREGNWLSGPILNRGAIGW 338 >UniRef50_D1B6I7 N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase-like exopolysaccharide biosynthesis protein n=1 Tax=Thermanaerovibrio acidaminovorans DSM 6589 RepID=D1B6I7_THEAS Length = 486 Score = 97.0 bits (240), Expect = 5e-19, Method: Composition-based stats. Identities = 39/221 (17%), Positives = 69/221 (31%), Gaps = 25/221 (11%) Query: 56 RVKMYWQKANGEAWGTLHALLA-----DINSQGQVQMAMNGGIYDESYAPL-GLYIENGQ 109 +++ + + ++ + GG Y L L +++G Sbjct: 268 DGSVFFGDGSASFGVSNGEWTLPIGDFNVPPKNGNLSIFYGGAYRPGNQALLSLSVKDGI 327 Query: 110 QKVALNLASGEGNFFIRPGGVF--YVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGV 167 + A R GD + +VR AF + + +Q GPM++EN Sbjct: 328 VQDEPQGADFTLLANGRAAEALGSLNIGDTLQLVRRFAFPAFEACRLVIQGGPMIVENRR 387 Query: 168 INPRIHPNVAS----SKIRNGVGINKHGNAVFLLSQQA------TNFYDFACYAKAKLNV 217 R S R VGI++ G VF++ + A A + + Sbjct: 388 YVNRSEGLSRSIRERRHPRTLVGIDEQG-LVFMVIDGRNGHSSGVTLEEAANLALEE-GL 445 Query: 218 EQLLYLDGTISHMYMKGGAIPW-----QRYPFVTMISVERK 253 L LDG S + G + P I + + Sbjct: 446 VAALNLDGGGSSQMIWRGVTVNIPSDGKERPLPYGIGLFPR 486 Score = 40.0 bits (92), Expect = 0.070, Method: Composition-based stats. Identities = 17/103 (16%), Positives = 32/103 (31%), Gaps = 7/103 (6%) Query: 36 CALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYD 95 P L V+ +V + ++G A ++ ++ + +NGG Y Sbjct: 183 EESGGPKLYYAVLRVDTSKVQVDPVFA----GSYGMGRAPMSFLSQMSKAVAMVNGGYYY 238 Query: 96 ESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKV 138 +Y P+G + G G I G + Sbjct: 239 SAY-PIGTMVHRG--IPLGRPIMGRSAVGITQDGSVFFGDGSA 278 >UniRef50_UPI0001C30FBA N-acetylglucosamine-1-phosphodiester alpha-N- acetylglucosaminidase-like exopolysaccharide biosynthesis protein n=2 Tax=Conexibacter woesei DSM 14684 RepID=UPI0001C30FBA Length = 249 Score = 96.6 bits (239), Expect = 7e-19, Method: Composition-based stats. Identities = 27/189 (14%), Positives = 54/189 (28%), Gaps = 31/189 (16%) Query: 88 AMNGGIYDES-YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAF 146 A+ G + + PLG G V +A+ G V D + Sbjct: 57 AIVAGFFVRDPHLPLGEVRVGGVPVVHEPVAAPW------AGRRACVHVDGEIRIAPREE 110 Query: 147 KTSKEIQFAVQSGPMLMENGVINPRIHPNVAS---------------SKIRNGVGINKHG 191 VQ+GP+L+ +G + R +G+++ Sbjct: 111 LADVGGGDLVQAGPLLVRDGTAAIVDGEDREGFSAGASQFDSDITAERHPRCALGVSED- 169 Query: 192 NAVFLLSQQATN-------FYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPF 244 + + + + A + + + LDG S + G + + Y Sbjct: 170 ELLAVCCDGRRSGVDAGLDLAELARLMVS-FGAREAINLDGGGSATLVHRGHLLNRPYAD 228 Query: 245 VTMISVERK 253 + E + Sbjct: 229 RDQPAPESR 237 >UniRef50_A8W171 Flagellar protein FliS n=1 Tax=Bacillus selenitireducens MLS10 RepID=A8W171_9BACI Length = 750 Score = 96.6 bits (239), Expect = 8e-19, Method: Composition-based stats. Identities = 34/238 (14%), Positives = 69/238 (28%), Gaps = 32/238 (13%) Query: 26 LPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMY--------WQKANGEAWGTLHALLA 77 P+ A D + +V +N ++ E Sbjct: 152 QPVIAPFELDITAKTSSGSVNIDRINTSRGAGEVILYTPGQRRTATPTNEFGREFTVTDT 211 Query: 78 DINSQGQVQMA--MNGGIYDE---SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFY 132 Q+ + G I + +G IR Sbjct: 212 SKRINDQLSFGDSVRGSITNMKEYGRRNASPIPGDGFIISGHGNRLDGLLDGIRA----- 266 Query: 133 VAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVAS---SKIRNGVGINK 189 V++D K+ + + +GP+L++NG ++ + + ++ R+G+GI+ Sbjct: 267 ---GDDIEVKVDIEDRWKDAEMIMATGPLLVQNGRVDITMSSSASTYSVPNPRSGIGIDA 323 Query: 190 HGNAVFLLSQQA-------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 GN +F+ FA Y + + + LDG S + + Sbjct: 324 QGNTMFVTVDGRQSGYSQGMTIPQFANYMRDQ-GAVMAINLDGGGSTTMVARDFSRDR 380 >UniRef50_A4J956 Copper amine oxidase domain protein n=1 Tax=Desulfotomaculum reducens MI-1 RepID=A4J956_DESRM Length = 480 Score = 96.3 bits (238), Expect = 8e-19, Method: Composition-based stats. Identities = 25/144 (17%), Positives = 43/144 (29%), Gaps = 12/144 (8%) Query: 111 KVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENG---- 166 A + G + VA V + + + I+ + PML+E G Sbjct: 200 VKAPAGGYVLAGWGSSAGQLVGVAEGTKARVITEMPEDWQNIRHVLTGSPMLVEGGLPVD 259 Query: 167 -VINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT------NFYDFACYAKAKLNVEQ 219 +N + +V R +G+ G + ++ + A L Q Sbjct: 260 QAVNEGLWGSVLKYSPRTALGVTAQGKVLLVVVDGRQESSAGLTLEEMAYLMID-LGAVQ 318 Query: 220 LLYLDGTISHMYMKGGAIPWQRYP 243 + LDG S G I Sbjct: 319 AVGLDGGGSSEMWVKGKIVNNPSD 342 Score = 60.4 bits (145), Expect = 6e-08, Method: Composition-based stats. Identities = 16/105 (15%), Positives = 31/105 (29%), Gaps = 8/105 (7%) Query: 11 MITLNLKRIFLALTLLPLFAVAADDCALS-----DPTLTVQAYTVNPQTERVKMYWQKAN 65 M+ + A + + V+P + ++ Sbjct: 10 MVLCLSMLWAVPAWAADQLAKGVQYRSFERNNWEGKPIKGHILEVDPGVKYTEIRPVM-- 67 Query: 66 GEAWGTLHALLADINSQGQVQMAMNGGIYDES-YAPLGLYIENGQ 109 G L+ + + A+NGG +D PLG I +G+ Sbjct: 68 GNEVFGQRENLSKMAQRTGAIAAVNGGFFDMGSGVPLGNLIIDGK 112 >UniRef50_B1X2V5 Putative uncharacterized protein n=2 Tax=Cyanothece RepID=B1X2V5_CYAA5 Length = 309 Score = 96.3 bits (238), Expect = 8e-19, Method: Composition-based stats. Identities = 32/241 (13%), Positives = 71/241 (29%), Gaps = 42/241 (17%) Query: 41 PTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE-SYA 99 PT V + P K+ + D + + +NGG +D + Sbjct: 63 PTAKVHTLLI-PSESNFKIDVA------VSETLKTVEDFAQETEAIAVLNGGFFDPVNSQ 115 Query: 100 PLGLYIENGQQKVALNLASGE----------GNFFIRPGGVFYVAGD---KVGIVRLDAF 146 I+ G+ + R Y + + Sbjct: 116 TTSYVIKEGEAIADPSNNPRLMDNPQLEPYLKQILNRSEFRRYQCNELTRYAITYHQEPV 175 Query: 147 KTSKEIQFAVQSGPMLM-------------ENGVINPRIHPNVASSKIRNGVGINKHGNA 193 + ++ ++ GP L+ NG + R + + R + I G+ Sbjct: 176 PENCQLTESIGGGPQLLPNLSAEEEAFFESVNGQV-TRDPLGLERANARTAIAITSSGDV 234 Query: 194 VFLLSQQ-----ATNFYDFACYAKAKLNVEQLLYLDGTISHM-YMKGGAIPWQRYPFVTM 247 ++++ +Q + + L+V + LDG S + +G + ++ + Sbjct: 235 LWIMVEQTSPSTGLSLLKLREFL-ESLDVTSAMNLDGGSSSSFFYQGDSFYGKQSTDGNV 293 Query: 248 I 248 I Sbjct: 294 I 294 >UniRef50_C1A670 Putative uncharacterized protein n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1A670_GEMAT Length = 426 Score = 95.9 bits (237), Expect = 1e-18, Method: Composition-based stats. Identities = 20/176 (11%), Positives = 46/176 (26%), Gaps = 28/176 (15%) Query: 99 APLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQS 158 G +G V + R V +V + + + + + Sbjct: 255 RSGGAIPRDGALLVGTGDRAAGVAAMSRFDTV------RVHLNTWPRLTSQRAPKAVIGG 308 Query: 159 GPMLMENGVINP--------RIHPNVASSKIRNGVGINKHGNA-VFLLSQQA------TN 203 P+++++G I N + R + +++ G + Sbjct: 309 WPLVLQDGENVAARAATLEGTISRNAEARHPRTAIAVSRSGQTAWLVTVDGRATNSVGMT 368 Query: 204 FYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQR------YPFVTMISVERK 253 + A + + L L DG S + G + + V + Sbjct: 369 LVELAEFLR-TLGAWHALNFDGGGSTTMVIDGRVVNVPTDAAGEREVGNALIVRER 423 Score = 46.2 bits (108), Expect = 0.001, Method: Composition-based stats. Identities = 21/175 (12%), Positives = 44/175 (25%), Gaps = 12/175 (6%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQ-- 82 L P ++ Q + + +A G + + + Sbjct: 61 LAPGLRHEVRLDPRGPWRF--HIVEIDLQNPALSLDVVRAKDALQG--RERTSLMAKRVQ 116 Query: 83 ---GQVQMAMNGGIYD-ESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKV 138 +V++A+N +D + + G+ L L + + G + D Sbjct: 117 SDTSRVRVAVNADFFDLATGENENNQVLGGEWWKGLPLTDSPYDTYDNVHGQVAI--DPA 174 Query: 139 GIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNA 193 G + F + P+L N R + G Sbjct: 175 GGLTFGRFILDARAWTRTSAVPVLSINSKPKGRYESTALYTARYGATAPTDTGRV 229 >UniRef50_C1YVW0 Putative uncharacterized protein n=1 Tax=Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 RepID=C1YVW0_NOCDA Length = 730 Score = 95.9 bits (237), Expect = 1e-18, Method: Composition-based stats. Identities = 32/250 (12%), Positives = 62/250 (24%), Gaps = 23/250 (9%) Query: 2 AHQLLIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYW 61 A Q +G GM L + A ++ A + + + + Sbjct: 106 ATQAPLGAGMSDGRLLTSPDPGFANAVVIDAGGRGSVRQVAFEGTA---SLPSGDLDIDA 162 Query: 62 QKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGE- 120 + L +D + + G A + G + Sbjct: 163 LNTSAVPADGLGLYTSDWGGHPRAHVVYEPGTSPGDTAVAEAVVSEGVVERVSVTPGSGP 222 Query: 121 --------GNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRI 172 + ++ V E + V +L+ +G P Sbjct: 223 IEEDEQVLVARGSAAERIADLSEGDPVEVEHTLTAEGAEPRVVVGGRHVLVRDGEPVPVE 282 Query: 173 HPNVASSKIRNGVGINKHGNAVFLLS-QQAT------NFYDFACYAKAKLNVEQLLYLDG 225 S R +G ++ G + +++ + A A EQ L LDG Sbjct: 283 DV---SRAPRTAIGFSEDGEVMHVVTADGRNRGHAGSTLAEVAELLAAS-GAEQALELDG 338 Query: 226 TISHMYMKGG 235 S + Sbjct: 339 GGSSTLLVRE 348 >UniRef50_D2RLV8 N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase-like exopolysaccharide biosynthesis protein n=1 Tax=Acidaminococcus fermentans DSM 20731 RepID=D2RLV8_ACIFE Length = 477 Score = 95.9 bits (237), Expect = 1e-18, Method: Composition-based stats. Identities = 33/223 (14%), Positives = 65/223 (29%), Gaps = 32/223 (14%) Query: 60 YWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIE-NGQQKVALNLAS 118 + +G+ + + + ++ + +G G + G + + Sbjct: 258 TVTRPDGKTFAIGG--VDRMRLANELILFNDGYDDTTDTNAYGTEVRLAGGVVREIRKGA 315 Query: 119 GE-----GNFFIRPGGVF------YVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGV 167 G G + G GD V + + + +GP L+ +G Sbjct: 316 GSMALTPGTTVLSGNGAAAAFLNGLRTGDPVKVTQTLGNAAADSAPSVGSAGPQLVRDGR 375 Query: 168 INPRIHPN------VASSKIRNGVGINKHGNAVFLLSQQA------TNFYDFACYAKAKL 215 + R GVGI K G + +++ +F Y +L Sbjct: 376 VQVTSEEEEIADDIALGRAPRTGVGIKKDGTVLVVVADGRSDDSVGMTLTEFGRYFV-QL 434 Query: 216 NVEQLLYLDGTISHMYMKGGAIPWQR-----YPFVTMISVERK 253 ++ + DG S + G I P + V RK Sbjct: 435 GADRAMNFDGGGSSEMVVNGKIMNDPSDGTERPVRVALGVFRK 477 Score = 44.6 bits (104), Expect = 0.003, Method: Composition-based stats. Identities = 13/131 (9%), Positives = 29/131 (22%), Gaps = 8/131 (6%) Query: 28 LFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQM 87 L + T+ P ++ + G+ L+ I + Sbjct: 149 GLTYRESTVNLGAGRVKTYLLTLAPS-SDFRLDFIPGYGKTIQ--RGTLSMIQKRSGAVA 205 Query: 88 AMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFK 147 +N +D +G + + I G + + Sbjct: 206 LVNASYFDSDIWVVGNLKIQDKWLGMESTP--RTGLVIPRTGAPSIQPG---LSYSGTVT 260 Query: 148 TSKEIQFAVQS 158 FA+ Sbjct: 261 RPDGKTFAIGG 271 >UniRef50_UPI0001744904 hypothetical protein VspiD_09360 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001744904 Length = 258 Score = 95.5 bits (236), Expect = 1e-18, Method: Composition-based stats. Identities = 39/246 (15%), Positives = 74/246 (30%), Gaps = 26/246 (10%) Query: 1 MAHQLLIGKGMITLNLKRIF-LALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKM 59 +A LL+ G+ + ++L + D + +Q T + +V++ Sbjct: 5 LALILLLATGLPGGGRLPAQDVVVSLGHGVVWSRKDISAPLEGW-LQVVTFDASKVKVEV 63 Query: 60 YWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE-SYAPLGLYIENGQQKVALNLAS 118 + + E +H + + + NGG +D ++AP GL + G Sbjct: 64 L-ARQDRETALPMHRWMTEA----RAIAGCNGGYFDPATFAPSGLQVVEGLATGKYQQFG 118 Query: 119 GEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSK--EIQFAVQSGPMLMENGVINPRIHPNV 176 G G F V K I E + VQ P+L++ + Sbjct: 119 EWG-------GGFGVRSGKAQIWTEQEILAMPTFEAESFVQCSPVLVDG--VRRFTGAGE 169 Query: 177 ASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAK------LNVEQLLYLDGTISHM 230 R + + ++ + A + V + L LDG S Sbjct: 170 DVRARRTFIAHDGGARWALGVTSG-IGLRELAELLVNQGAGLLGFKVSRALNLDGGPSTG 228 Query: 231 YMKGGA 236 Sbjct: 229 LWGRDE 234 >UniRef50_Q7U4D6 Putative uncharacterized protein n=11 Tax=Cyanobacteria RepID=Q7U4D6_SYNPX Length = 589 Score = 95.5 bits (236), Expect = 2e-18, Method: Composition-based stats. Identities = 43/277 (15%), Positives = 72/277 (25%), Gaps = 38/277 (13%) Query: 8 GKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGE 67 G + + + + F L Q + P R + W + Sbjct: 306 GLRFLPQLSQPANAVIAINGGFFNRILQLPLGALRQQGQWLSG-PILNRGVVAWGDNDQL 364 Query: 68 AWGTLH--ALLADINSQGQVQMAMNGGI-----------YDESYAPLG-----LYIENGQ 109 +G L L + + +N G + Y PL L ++ G+ Sbjct: 365 QFGRLRLDQQLQVNGGRRRGLSYLNSGYVQRGLSRYTRAWGPIYRPLSGEEEALLVQGGR 424 Query: 110 QKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQ---------FAVQSGP 160 + AS I G VA + + + GP Sbjct: 425 VTQRFDRASIRRGVLIPADGDLVVARGGTPLPAKPGDAVMLSQRTTSGLGDQANVLGGGP 484 Query: 161 MLMENGVINPRIHPNVASS------KIRNGVGINKHGNAVF---LLSQQATNFYDFACYA 211 +LM+ G I S R VG G + + + A A Sbjct: 485 LLMQGGQIVLNGRAEGFSPDFLALAAPRTVVGQGTGGTWLLALRGAAGSDPTLLETA-LA 543 Query: 212 KAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMI 248 +L ++ L LDG S + G + Sbjct: 544 AQQLGLKDALNLDGGSSTTVVVAGRTVMNGRGSAPRV 580 >UniRef50_A3DIP4 Exopolysaccharide biosynthesis protein n=3 Tax=Clostridium thermocellum RepID=A3DIP4_CLOTH Length = 382 Score = 95.5 bits (236), Expect = 2e-18, Method: Composition-based stats. Identities = 23/186 (12%), Positives = 46/186 (24%), Gaps = 32/186 (17%) Query: 94 YDESYA----PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTS 149 + + +EN + + I G+ + + Sbjct: 196 FGPTNRADKLHTSYIVENNRVARKFRGDTECK---IPSDGMVITFYEPISSEEKFEVGDW 252 Query: 150 KEIQ---------FAVQSGPMLMENGVINPRIHPN----VASSKIRNGVGINKHGNAVFL 196 I A + G L+ +G + + + + R +G+ G V + Sbjct: 253 IGIDIDPDFGPGFQAYECGSWLVRDGQVVAVDRDDWVGLLTNRDPRTAIGVKHDGKVVLV 312 Query: 197 LSQQATNFY-------DFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPW----QRYPFV 245 Y + A Y L ++ LDG S + + Sbjct: 313 TVDGRQPGYSVGLSSRELAGYLL-TLGIKDAAMLDGGASTQMIVQNKTVNRLPARERMLG 371 Query: 246 TMISVE 251 I V Sbjct: 372 GGIVVV 377 Score = 43.9 bits (102), Expect = 0.005, Method: Composition-based stats. Identities = 15/116 (12%), Positives = 35/116 (30%), Gaps = 3/116 (2%) Query: 37 ALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE 96 ++ + ++ + VK+ A +G L DI A+N G + Sbjct: 68 EINGIKQKINILEIDLSSGGVKIKPALAFDTIYGF--QSLKDIAINNNAYAAVNAGFFYS 125 Query: 97 SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEI 152 P G+ + +G+ + + I+ + + +I Sbjct: 126 YGEPSGMVVIDGKVYTK-STGKYPVFVVQGKNAFLSEIKSNIWILHGNRRIAADDI 180 >UniRef50_D1R528 Putative uncharacterized protein n=1 Tax=Parachlamydia acanthamoebae str. Hall's coccus RepID=D1R528_9CHLA Length = 380 Score = 95.5 bits (236), Expect = 2e-18, Method: Composition-based stats. Identities = 21/129 (16%), Positives = 37/129 (28%), Gaps = 23/129 (17%) Query: 147 KTSKEIQFAVQSGPMLMENGVINPRIHPNVASSK------IRNGVGINKHGNAVFLLSQQ 200 + +I V P+L+ G + S R VGI ++GN +F++ Sbjct: 251 EEWSDIVHIVGGTPILVRGGRLVTDFSAEQTGSHFLNVRLARTAVGILENGNWLFVVVDG 310 Query: 201 ---------ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPW-------QRYPF 244 D A KL + L L G + + + Sbjct: 311 FYKNIWNTKGITIPDLAELM-QKLGCVEALNLCGGKCSTMVLKNVVVNDPPDGTRKGRKV 369 Query: 245 VTMISVERK 253 + + K Sbjct: 370 SDALVIIPK 378 Score = 58.1 bits (139), Expect = 3e-07, Method: Composition-based stats. Identities = 15/141 (10%), Positives = 37/141 (26%), Gaps = 14/141 (9%) Query: 15 NLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHA 74 + R +A+ L + + + +V VNP + Sbjct: 29 AIVRTAIAVELPEGISYSHIFLS---DQTSVHVLEVNPHFFDI-------IPVKNKGDVE 78 Query: 75 LLADINSQGQVQMAMNGGIYDESYA----PLGLYIENGQQKVALNLASGEGNFFIRPGGV 130 ++ + + + A+NGG + P+G+ + + G + V Sbjct: 79 AVSSMAKRHKAIAAVNGGFFKMKGEFADLPMGILKIDNHWYGTPHKPRGAIGWSHADEKV 138 Query: 131 FYVAGDKVGIVRLDAFKTSKE 151 + + Sbjct: 139 LFDQILTQIYGYSGNAVIPID 159 >UniRef50_C7LNU2 Putative uncharacterized protein n=1 Tax=Desulfomicrobium baculatum DSM 4028 RepID=C7LNU2_DESBD Length = 276 Score = 95.1 bits (235), Expect = 2e-18, Method: Composition-based stats. Identities = 25/245 (10%), Positives = 68/245 (27%), Gaps = 30/245 (12%) Query: 9 KGMITLNLKRIFLALT-------LLPLFAVAA-----DDCALSDPTLTVQAYTVNPQTER 56 + + T + +A L P + L + ++ Sbjct: 7 RAVFTCLVLCAPVASLHAEEWRLLAPGLELREFLIPDQVGDLEGRQSGMAVLRIDSDRFD 66 Query: 57 VKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDES--YAPLGLYIENGQQKVAL 114 V + G ++ +N G++ G + + Sbjct: 67 VALGSALGTGR-MRSMQEW----ARHSGFVAVINAGMFRADDRMRSTGYMRDAAVMINSF 121 Query: 115 NLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHP 174 + +P + +R K+ + Q + +++N + R Sbjct: 122 IHPNYGAFLAFQP------RDPSLPALRWVDRKSDPDWQAVLADYDGIIQNYRLISRERE 175 Query: 175 NVASSKIR----NGVGINKHGNAVFLLSQQATNFYDFACYAKA-KLNVEQLLYLDGTISH 229 N+ R + +++ G +F+ + + ++FA L++ +Y++G Sbjct: 176 NLWEPSDRRHSGAAIAMDREGRLLFIHCRARLSLHEFAQALIDLPLDLIGAMYVEGGADA 235 Query: 230 MYMKG 234 Sbjct: 236 AMYVD 240 >UniRef50_Q3AA51 Conserved domain protein n=1 Tax=Carboxydothermus hydrogenoformans Z-2901 RepID=Q3AA51_CARHZ Length = 356 Score = 94.7 bits (234), Expect = 3e-18, Method: Composition-based stats. Identities = 21/154 (13%), Positives = 42/154 (27%), Gaps = 10/154 (6%) Query: 105 IENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLME 164 I G+ + P + + +++GP L++ Sbjct: 202 IVYGKTSIPPQGFVLNTGSLCPPDNLLNSNVTLKIEPENQENVLWSKAYAVLEAGPYLVK 261 Query: 165 NGVINPRIHPNV-------ASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNV 217 G I S R+G+G+ K+ + + ++A KL Sbjct: 262 EGKIIADPLKENFTHYKIKDGSFARSGIGVTKNKKLLLVTV-NRATIKEWAIIM-QKLGA 319 Query: 218 EQLLYLDGTISHMYMKGGA-IPWQRYPFVTMISV 250 + LDG S G + + + Sbjct: 320 YYAMNLDGGASSGLYVNGKYLTKPGRLLSNALVI 353 Score = 50.8 bits (120), Expect = 4e-05, Method: Composition-based stats. Identities = 17/126 (13%), Positives = 42/126 (33%), Gaps = 5/126 (3%) Query: 33 ADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGG 92 +++ TV+ V+ T+++KM A + L + + + + +NG Sbjct: 33 EKKLKINNKNFTVKGVIVDLNTKKLKMQTVLAKNQI--GQVESLESMVKRKKGLIGINGA 90 Query: 93 IY---DESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTS 149 + D P G + +G+ N +F + ++ + Sbjct: 91 FFSAYDAYKEPYGNLMIDGRLIRKGNGERCSVGILPGNEVIFGYVKWDISVIFSTYSEEG 150 Query: 150 KEIQFA 155 E+ + Sbjct: 151 TELPYI 156 >UniRef50_C8X0Z8 Putative uncharacterized protein n=1 Tax=Desulfohalobium retbaense DSM 5692 RepID=C8X0Z8_DESRD Length = 302 Score = 94.7 bits (234), Expect = 3e-18, Method: Composition-based stats. Identities = 30/229 (13%), Positives = 71/229 (31%), Gaps = 26/229 (11%) Query: 44 TVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGL 103 + ++P+ R +Y A A T+ + + A+N +Y E Sbjct: 63 ELTVLRIDPEFFRFVLYSASAERGADRTVRQWVE----DKNLVAAINASMYWEDRETSTG 118 Query: 104 YIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFA--VQSGPM 161 + N + G FF+ + + L+ + Q+A +Q+ + Sbjct: 119 LMTNFGHVNNGRVHPEFGAFFVANPRRAQLPPVDILDRSLEQQWRKRVAQYATIIQNYRL 178 Query: 162 LMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKA-KLNVEQL 220 L G + V + G+ +F+L + + + + L++ Sbjct: 179 LDAKGE---NVWQASRQEHSSAAVAEDSQGHILFILQHEPVSVHALGSRLENLSLDLSTA 235 Query: 221 LYLDGTISHMYMKG---------GAIPWQ-------RYPFVTMISVERK 253 ++++G + G + P +I + R+ Sbjct: 236 MFVEGGVEASLAVRCPELEAFWNGRYDSRLLPAPVASRPVPNIIGITRR 284 >UniRef50_UPI0001C3370C hypothetical protein UCYN_10670 n=1 Tax=cyanobacterium UCYN-A RepID=UPI0001C3370C Length = 438 Score = 94.7 bits (234), Expect = 3e-18, Method: Composition-based stats. Identities = 24/136 (17%), Positives = 42/136 (30%), Gaps = 16/136 (11%) Query: 131 FYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHP------NVASSKIRNG 184 + G + I K ++ + GP+L+ +G I+ + R+ Sbjct: 301 LFFIGSTLKIESKTVPKKFNQLSHILGGGPLLINDGSISLNVKDEKFTKSFQKQKASRSA 360 Query: 185 VGINKHGNAVFLLSQQ-----ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPW 239 +GI + + N + A KL L LDG S + GG + Sbjct: 361 IGITNKDKTILVTVHNSINSNGVNLNEMAQIM-QKLGSINALNLDGGGSTSLVLGGRLID 419 Query: 240 QRYPFV----TMISVE 251 + I V Sbjct: 420 RFPVTAAKIHNGIGVF 435 Score = 75.5 bits (184), Expect = 2e-12, Method: Composition-based stats. Identities = 21/142 (14%), Positives = 45/142 (31%), Gaps = 8/142 (5%) Query: 23 LTLLPLFAVAADDCALSDPT----LTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLAD 78 + +P + + V ++ ++ +V + + + L + Sbjct: 97 IVWVPGVIWRQQFITVKNKKGYNIFPVNLLEIDNKSSKVILRPIT-SNLNGQIGTSSLEE 155 Query: 79 INSQGQVQMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDK 137 I + +V A+NGG ++ + PLG N + L G G G F++ Sbjct: 156 IAKKWRVVAAINGGFFNRNNRLPLGAIRHNNDWLSSPIL--GRGAVGWNENGKFFIDHLS 213 Query: 138 VGIVRLDAFKTSKEIQFAVQSG 159 + + IQ Sbjct: 214 LKEFLILNNGERISIQSLNSGY 235 >UniRef50_D1VRM0 Putative copper amine oxidase N-domain family n=1 Tax=Peptoniphilus lacrimalis 315-B RepID=D1VRM0_9FIRM Length = 361 Score = 94.3 bits (233), Expect = 3e-18, Method: Composition-based stats. Identities = 30/225 (13%), Positives = 64/225 (28%), Gaps = 20/225 (8%) Query: 39 SDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESY 98 + V+P + + + + ++ ++ Y+ Sbjct: 146 EFNVFDLWYVNVDPTDTAGIFKFTNKYNKELNLKNGRVIEVVKDTVTKI------YEPKG 199 Query: 99 APL----GLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQF 154 G I G+ V + + V + + ++ Sbjct: 200 KISLPKDGYIIYFGKDSVDKSYIDQRFKLGRKVELVLVDSKGNETFKYNGQDISYSKVTE 259 Query: 155 AVQSGPMLMENGVINPRIHPNVA-------SSKIRNGVGINKHGNAVFLLSQQATNFYDF 207 V +GPML++NG N ++ R+ +GI K+G + L + N Sbjct: 260 LVAAGPMLLQNGKNVVAESKNNYKEGKINSATGQRSAIGITKNGKVILLTA--VANVDKL 317 Query: 208 ACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISVER 252 A L + LDG S G + + + + + Sbjct: 318 ALIMND-LGCIDAMNLDGGASSALFANGKVIKNAGRNLNTVLIFK 361 >UniRef50_UPI00017896CA metallophosphoesterase n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI00017896CA Length = 2050 Score = 94.3 bits (233), Expect = 3e-18, Method: Composition-based stats. Identities = 26/178 (14%), Positives = 52/178 (29%), Gaps = 34/178 (19%) Query: 95 DESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQF 154 D+ +P+G GQ ++ + + ++ +++ Sbjct: 253 DQGNSPIG----QGQVVLSASGSQRSKLAGLKA--------GDEVTAGFQLDNEWQDVTM 300 Query: 155 AVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT-------NFYDF 207 A+ ML+++GV+ P + R VG G+ V N+ + Sbjct: 301 AIGGTVMLVKDGVVQQHTDP---AVHPRTVVGTKADGSVVLFEVDGRQPGFSEGLNYIEL 357 Query: 208 ACYAKAKLNVEQLLYLDGTISHMYMK-------GGAIPWQRYP----FVTMISVERKG 254 +L V L LDG S ++ + I + K Sbjct: 358 GE-MLQELGVVNALNLDGGGSATFVARLPGETERKVLNSPSDGGERKTANGILLVNKA 414 Score = 64.7 bits (156), Expect = 3e-09, Method: Composition-based stats. Identities = 22/112 (19%), Positives = 43/112 (38%), Gaps = 10/112 (8%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADIN---- 80 + P + V +P +++ +G+ +G ++ + Sbjct: 68 IGPGATYTWANMQKGSGEQKVHMVEFDPSQGNLELQPGLTDGKVYGMQG--VSKMASDAD 125 Query: 81 -SQGQVQMAMNGGIYDES-YAPLGLYIENGQQKVALNLASGEGNFFIRPGGV 130 + +V A+NG YD S PLGL++ +G+ + SG F I+ G Sbjct: 126 KAGNRVIAAVNGDFYDMSTGIPLGLFMGDGELL--TDPPSGRNAFGIKQDGT 175 >UniRef50_D2ASL7 Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase-like protein n=2 Tax=Actinomycetales RepID=D2ASL7_STRRD Length = 1138 Score = 94.3 bits (233), Expect = 3e-18, Method: Composition-based stats. Identities = 37/251 (14%), Positives = 63/251 (25%), Gaps = 42/251 (16%) Query: 8 GKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGE 67 G G+ +L + +A + A + G Sbjct: 144 GIGIHNGDLVQAPVAGHNNAVAVTADGVGRVLQMHFDGT---------------ATPAGG 188 Query: 68 AWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGL-----YIENGQQKVALNLASGEGN 122 + TL I G G Y A G + G + ++G G Sbjct: 189 SPITLTQFNQLIQGNGVGLFTPLWGSYGRGRAVEGAAAVTEVVLEGGVVTEVRTSAGSGP 248 Query: 123 FFIRPGGVF-----------YVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPR 171 + GD+V + ++ AV +L+++GV Sbjct: 249 IPAGTAILLGRDAGASALAALKPGDRVEVRYQPKPSEGGAVKAAVGGSQILVKDGVAQTS 308 Query: 172 IHPNVASSKIRNGVGINKHGN-AVFLLSQQATN------FYDFACYAKAKLNVEQLLYLD 224 ++ R VG + G L + A+L L LD Sbjct: 309 AD---NTAHPRTAVGFSADGRKMYLLTVDGRQTDSRGVTLTELGA-MMAELGAHDALNLD 364 Query: 225 GTISHMYMKGG 235 G S + Sbjct: 365 GGGSSTMLARE 375 Score = 44.3 bits (103), Expect = 0.004, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 51/183 (27%), Gaps = 20/183 (10%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQ-TERVKMYWQKANGEAWGTLHALLADINSQG 83 + P +++ D + L A + V + + L+ + Sbjct: 69 VAPGITLSSFDRYDTAGWLRADAIAADLTAGASVDYVYSGEVSKT-----EPLSGPAKRS 123 Query: 84 QVQMAMNGGIYDES--YAPLGLYIENGQQKVAL--NLASGEGNFFIRPGGVFYVAGDKVG 139 + A+NG +D + A G+ I NG A + G V + D Sbjct: 124 RAVAAVNGDFFDINSSGAAQGIGIHNGDLVQAPVAGHNNAVAVTADGVGRVLQMHFDGTA 183 Query: 140 IVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQ 199 + T + +Q + G+ P + + + ++ + Sbjct: 184 TPAGGSPITLTQFNQLIQGNGV----GLFTPLWGSYGRGRAVEGAAAVTE------VVLE 233 Query: 200 QAT 202 Sbjct: 234 GGV 236 >UniRef50_UPI0001C1628D hypothetical protein CRC_02750 n=2 Tax=Nostocaceae RepID=UPI0001C1628D Length = 633 Score = 93.6 bits (231), Expect = 5e-18, Method: Composition-based stats. Identities = 32/254 (12%), Positives = 65/254 (25%), Gaps = 32/254 (12%) Query: 24 TLLPLFAVAADDCALSDPTLTVQA----YTVNPQTERVKMYWQKANGEAWGTLHALLADI 79 + ++ V Y VN + + G L D+ Sbjct: 371 IAPLAWGGYPAPVDPTNFNFNVDIEKREYKVN--NTELILIGGGIPGTFHADSRYQLPDM 428 Query: 80 NSQGQVQMAMNGGIYD----ESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAG 135 QV A++GG + +S +G + + + N +R + + Sbjct: 429 LKDTQVVAAVDGGFFSLKYLDSNTMIGPVLSGNR---GFIPGNASENLKLRDRPLVLINP 485 Query: 136 DKVGIVR------------LDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNV----ASS 179 V + +K + L+ N ++ Sbjct: 486 HSVSFIPFVPETHNTLEGIQATSPENKGVTDTFVGAAWLVRNNTPRTAADFGNLYDYDAA 545 Query: 180 KIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGA--I 237 + R GIN G V +++ +L + LD S G + Sbjct: 546 RHRAFWGINLAGMPVIGVTKTPVGSVSLGEILY-QLGFRDAVMLDSGASTSLSYRGKSLV 604 Query: 238 PWQRYPFVTMISVE 251 + P + + Sbjct: 605 AYTPRPVPHAVVLV 618 >UniRef50_Q01TI8 Putative uncharacterized protein n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q01TI8_SOLUE Length = 340 Score = 93.6 bits (231), Expect = 5e-18, Method: Composition-based stats. Identities = 32/250 (12%), Positives = 63/250 (25%), Gaps = 46/250 (18%) Query: 44 TVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAP--- 100 T+ +N + + G T+ D +Q Q+A+NG + + Sbjct: 51 TMHIAEINLNAPGIGVKLTSP-GGTLETVRQTTLDYLNQEHAQLAINGEFFLPFPSSDFN 109 Query: 101 ---LGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAF-------KTSK 150 +GL NG + + + IV + + Sbjct: 110 SMLIGLAASNGNVYSSFEAPVQSYAIVTDAPALNIDQSNHASIVHDNTSFVDGKHVLENV 169 Query: 151 EIQFAVQSGPMLMENGVI----------------------NPRIHPNVASSKIRNGVGIN 188 + + ++ NGV + R +G++ Sbjct: 170 TLWNTIAGSAQIITNGVASIPTYLDATHPNGLLTPGGPASYSNSNSWYNLINARTVIGLS 229 Query: 189 KHGNAVFL-LSQQ-----ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRY 242 + +FL + A ++ L LDG S A+ Sbjct: 230 QDNQTLFLFTVDNAGGSRGMTLPEVANLLIGDYSIYNALNLDGGGSTSM----AMQDPVT 285 Query: 243 PFVTMISVER 252 I+V Sbjct: 286 GMGRFINVSS 295 >UniRef50_B8HP94 Polysaccharide deacetylase n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HP94_CYAP4 Length = 645 Score = 93.2 bits (230), Expect = 8e-18, Method: Composition-based stats. Identities = 29/213 (13%), Positives = 63/213 (29%), Gaps = 18/213 (8%) Query: 57 VKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYD----ESYAPLGLYIE--NGQQ 110 + + + +I ++ Q ++GG + +S +G + NGQ Sbjct: 421 LILISGGRPITIHADSRYQVPEILAKTQAVAGVDGGFFSLEFLDSNVMIGPVLSQKNGQF 480 Query: 111 K----VALNLASGEGNFFIRPGGVFYVAGD-KVGIVRLDAFKTSKEIQFAVQSGPMLMEN 165 +G I P GV ++ D A + L++ Sbjct: 481 VPGNASENPRLNGRPLVLISPTGVRFIPFDASKHNSLEGIQAEDPGATDAFVAAAWLVKQ 540 Query: 166 GVINPRIHP----NVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLL 221 P + +++ R GIN++G +S + + + Sbjct: 541 NQPQPEQSFGNLFDFNAARHRAFWGINQNGQPTIGVSTEPVGSVELGEILYKA-GFRDAV 599 Query: 222 YLDGTISHMYMKGGA--IPWQRYPFVTMISVER 252 LD S G + + P ++++ Sbjct: 600 MLDSGASTSLAYQGESLVGYTPRPVPHVVALVP 632 >UniRef50_B9XE16 Putative uncharacterized protein n=1 Tax=bacterium Ellin514 RepID=B9XE16_9BACT Length = 398 Score = 92.8 bits (229), Expect = 9e-18, Method: Composition-based stats. Identities = 18/163 (11%), Positives = 47/163 (28%), Gaps = 27/163 (16%) Query: 107 NGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENG 166 + ++++ I+PG + + + + + AV P+L+ +G Sbjct: 230 EDKMIISIDPKLASRFAGIQPGTILHFSTGTSRDI--------AKADTAVGGRPLLLVHG 281 Query: 167 VINPRIHPNVAS-----SKIRNGVGINKHGNAVFLLSQQA-------TNFYDFACYAKAK 214 + R +G N ++ + + A + + Sbjct: 282 KELETSKQKGNNAATIVRHPRTALGWNA-RYFFLVVVDGRQKELSMGMSSQELAHFM-ST 339 Query: 215 LNVEQLLYLDGTISHMYMKGGAIPWQR-----YPFVTMISVER 252 L + + LDG S + G + + + + Sbjct: 340 LGCTEAMNLDGGGSTTFWLDGKVVNSPSDRHERSVANALIIMQ 382 Score = 47.0 bits (110), Expect = 6e-04, Method: Composition-based stats. Identities = 21/110 (19%), Positives = 41/110 (37%), Gaps = 10/110 (9%) Query: 19 IFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWG--TLHALL 76 + +L+P A ++ ++ + + + + A G+ G ++ + Sbjct: 29 TPIFSSLVPGLDYA--HITETNHPWSIHVARLERSHKELDLVSTLAQGKIVGLSSVANQV 86 Query: 77 ADI-NSQGQVQMAMNGGIYDE-----SYAPLGLYIENGQQKVALNLASGE 120 G+ +A+NG + PLGL I NG+ A N AS Sbjct: 87 KTFPAGSGKPLVAVNGDFFVIAKGPYQGDPLGLQILNGELVSAPNGASFW 136 >UniRef50_C1ABL2 Putative uncharacterized protein n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1ABL2_GEMAT Length = 311 Score = 92.8 bits (229), Expect = 9e-18, Method: Composition-based stats. Identities = 36/239 (15%), Positives = 83/239 (34%), Gaps = 28/239 (11%) Query: 29 FAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMA 88 +A + TV ++P + + + G A + N+ +A Sbjct: 68 WAEWPVQLGARGISTTVIVVDIDPARIALTLEIARD-----GDALAPWSLDNAPKDAVIA 122 Query: 89 MNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKT 148 +N G + + P G + ++ A + F I G + + + Sbjct: 123 LNAGQFTDDG-PWGWVVHRQREWQAPGVGPLSAAFVIDTAGRAAILRAD----EIAEARR 177 Query: 149 SKEIQFAVQSGPMLMENGVINPRIHP----NVASSKIRNGVGINKHGNAVFLLSQQ---- 200 + A+QS P+++ +G + P + ++ IR +G+ G+ + L++ Sbjct: 178 RGGWEEALQSFPLILNDGALPPGLCAPGAVDLEHRDIRLTLGVLPDGHVLLALTRYAGVG 237 Query: 201 --------ATNFYDFACYAKAKLNVEQLLYLDGTISHM-YMKGGAIPWQRYPFVTMISV 250 + A + +L V + + LDG +S ++ G + Q + + Sbjct: 238 SAGNRLPIGPTTGEMATIMR-ELGVARAVMLDGGLSAQLLVRDGPVTTQWHGLRRVPLA 295 >UniRef50_B2S1G8 Hypothetical cytosolic protein n=2 Tax=Borrelia RepID=B2S1G8_BORHD Length = 262 Score = 92.8 bits (229), Expect = 9e-18, Method: Composition-based stats. Identities = 26/226 (11%), Positives = 57/226 (25%), Gaps = 31/226 (13%) Query: 29 FAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGE--AWGTLHALLADINSQGQVQ 86 + + S + + + + + + + + +V Sbjct: 29 VGMQYEIIKSSFKESNYVIVKIKNKNLKFIIPKPIYDQKMNNYYFKGQTTSQFLLSNKVD 88 Query: 87 MAMNGGIYDES---YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRL 143 +A+N Y+ + P GLYI + + A G I+ Sbjct: 89 IAINTSPYEIKENMFYPNGLYIYDKKIISNAKKAQGIII------------IKNNQIILN 136 Query: 144 DAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHG-NAVFLLSQQA- 201 K + L+ NG N R +G +K + + + Sbjct: 137 PKQDEIKNSDYGFSGFFPLITNGNYTKNFKEN---KHPRTIIGTDKENKHLYLITVEGRG 193 Query: 202 ------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQR 241 + + A + + LDG S + + Sbjct: 194 TNNSKGISLNE-AIDLSLNYAITNSINLDGGGSSTLVT--KVNNLP 236 >UniRef50_B8I064 Exopolysaccharide biosynthesis protein n=1 Tax=Clostridium cellulolyticum H10 RepID=B8I064_CLOCE Length = 383 Score = 92.8 bits (229), Expect = 1e-17, Method: Composition-based stats. Identities = 31/220 (14%), Positives = 66/220 (30%), Gaps = 25/220 (11%) Query: 47 AYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYA----PLG 102 ++ R + ++ E+ G + + + + + Sbjct: 148 VLILDKMGARFETFYSNIFLESKGNRVKINEMNRVGKNDDIILYIDKFGNTNRAEVKSTS 207 Query: 103 LYIENGQQKVALNLASG----EGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQ----- 153 L ++N + + +G + I G DK+G+ D E Sbjct: 208 LIVDNNKIISIIESTKEVNIKKGMYVISFYGDKSSLPDKIGLKTGDKVNIRIEPYLGYNY 267 Query: 154 FAVQSGPMLMENGVINPRIHP----NVASSKIRNGVGINKHGNAVFLLSQQA-------T 202 A + G ML++NG + + R +GI +G V +++ Sbjct: 268 QAYECGSMLVKNGKSVVPERDKWAGTLGNRDPRTVIGIKTNGKIVLVVADGRQPGYSEGM 327 Query: 203 NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRY 242 + + K+ V LDG + + G I + Sbjct: 328 TGKEMGEFLV-KIGVRDAAMLDGGATSQMIINGRIQNRPS 366 Score = 64.3 bits (155), Expect = 4e-09, Method: Composition-based stats. Identities = 16/88 (18%), Positives = 32/88 (36%), Gaps = 2/88 (2%) Query: 29 FAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMA 88 + ++ + +P+ ERV+ + +G L+DI + A Sbjct: 59 VQYKSTTETINGYKQEIYMLEFDPRDERVEFKPALSFDNIFGF--EKLSDICKRNGAYAA 116 Query: 89 MNGGIYDESYAPLGLYIENGQQKVALNL 116 +NGG + + P G+ +GQ Sbjct: 117 VNGGFFYQFGDPAGMVAIDGQMLTTSTG 144 >UniRef50_A7SGX9 Predicted protein (Fragment) n=2 Tax=Nematostella vectensis RepID=A7SGX9_NEMVE Length = 442 Score = 92.8 bits (229), Expect = 1e-17, Method: Composition-based stats. Identities = 26/176 (14%), Positives = 49/176 (27%), Gaps = 21/176 (11%) Query: 43 LTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIY-DESYAPL 101 + V + G ++ + +A+ + Q + +A N G + ++ L Sbjct: 65 VQGHVSVVENPLNTFSILEPGEVGGCGKSVRSSVANSSRQKKCHVASNAGFFKTKNGNCL 124 Query: 102 GLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPM 161 G + NG+ + + NF IR G + V Sbjct: 125 GNIVSNGKLVMDADGVQ-NANFGIRKDGTIVTGY----LSENTVLDQENPFVQLVTGVIW 179 Query: 162 LMENGVINPRIHP---------------NVASSKIRNGVGINKHGNAVFLLSQQAT 202 L+ NG + V R +G + G V + T Sbjct: 180 LVRNGEVYVNASKKAECEDLQESGSVDLFVNVLAARTAIGHDAQGRVVIVQVDGKT 235 >UniRef50_Q1MS76 Putative uncharacterized protein LI0093 n=1 Tax=Lawsonia intracellularis PHE/MN1-00 RepID=Q1MS76_LAWIP Length = 331 Score = 92.8 bits (229), Expect = 1e-17, Method: Composition-based stats. Identities = 33/245 (13%), Positives = 77/245 (31%), Gaps = 26/245 (10%) Query: 27 PLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQ 86 L+ + V +NPQ ++ G+A+ L D ++ ++ Sbjct: 87 GLWLGKFPGVTKAGDVFEVVMLKINPQYYDFSLHMASQTGKAFS-----LQDWSNTYELS 141 Query: 87 MAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFI-RPGGVFYVAGDKVGIVRLDA 145 +N +Y Y+ N ++ G FF+ P D + + Sbjct: 142 AVINASMYLPDGVTSTGYLRNHDHINNAHVGKRLGAFFVASPYNSTLPNADLLDRTSDNW 201 Query: 146 FKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFY 205 + + VQ+ ++ N + S V + G F+ ++ + Sbjct: 202 EILLPQYKIVVQNYRVISANRQCLWSTKKVIHSIA---AVARDGKGYLFFIHTKYPISDL 258 Query: 206 DFACYAKA-KLNVEQLLYLDGTISH----------MYMKGG------AIPWQRYPFVTMI 248 DF + +++ ++Y++G G +I P +I Sbjct: 259 DFGNLLLSLPIDIRIVMYVEGGSQAGLLINTSNFKQLWMGKHPVSILSINNTSVPIPNVI 318 Query: 249 SVERK 253 ++++ Sbjct: 319 GIKKR 323 >UniRef50_A6LP25 Putative uncharacterized protein n=1 Tax=Thermosipho melanesiensis BI429 RepID=A6LP25_THEM4 Length = 534 Score = 92.4 bits (228), Expect = 1e-17, Method: Composition-based stats. Identities = 27/165 (16%), Positives = 46/165 (27%), Gaps = 17/165 (10%) Query: 103 LYIENGQQKV-----ALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQ 157 I++ + + + + I + I+ A+ Sbjct: 365 YQIKDNKIISIGYIENAPEGAMVLSISKKYEKYLSNVTPGTKIDLVLNSDFPFPIKHAIG 424 Query: 158 SGPMLMENGVINPRIHPN--------VASSKIRNGVGINKHGNAVFLLSQQATNF----Y 205 +GP+L+ENG S R + I K G F++ + N Y Sbjct: 425 AGPLLIENGKKLIDSSEEKLRYSNGLALSKTTRTIIAITKEGRVDFIVIEGYNNTGGMNY 484 Query: 206 DFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMISV 250 D A + LDG S + + Q I V Sbjct: 485 DIATDFLISKGYFYAMMLDGGGSGAMVIQNEVVNQDGQIQRGIPV 529 Score = 67.0 bits (162), Expect = 6e-10, Method: Composition-based stats. Identities = 11/91 (12%), Positives = 29/91 (31%), Gaps = 5/91 (5%) Query: 24 TLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQG 83 + ++ + ++P+ V++ ++ L +I Sbjct: 212 IIKNGLIWERKVEKFNNEKYLINYLKIDPKK--VEIIPVISSKGI--GTREDLREILKAN 267 Query: 84 QVQMAMNGGIYDESYA-PLGLYIENGQQKVA 113 +N +D S P+ L I++G+ Sbjct: 268 NCIAGINANYFDPSTNLPIDLIIKDGKILSD 298 >UniRef50_Q2JPV6 Polysaccharide deacetylase family protein n=2 Tax=Synechococcus RepID=Q2JPV6_SYNJB Length = 723 Score = 92.4 bits (228), Expect = 1e-17, Method: Composition-based stats. Identities = 29/227 (12%), Positives = 63/227 (27%), Gaps = 28/227 (12%) Query: 53 QTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIY------DESYAPLGLYI- 105 RV ++ + + Q +NG + S +G + Sbjct: 315 NQVRVLTLRGGRAATVHAERRYEVSTLAQRYQADAGINGSFFSIPWINSASNVMVGPAMA 374 Query: 106 ENGQQKVALNLASGEGNFFIR-----PGGVFYVAGDKVGIVRLDAF-KTSKEIQFAVQSG 159 N + + + + +V D + L+ + ++ +G Sbjct: 375 ANHKTFIPGRPEDDQAIRGRPLVLLGRDRLRFVPFDPDTMTHLENIRQLMPDVTDLFVAG 434 Query: 160 PMLMENG------VINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKA 213 L+++G IN + A + R G++ V +++ N A Sbjct: 435 LWLVKDGHALSPAEINSFRLASAAEFRPRAFFGVDDQERVVIGVTKTHVNAAILASLLPK 494 Query: 214 KLNVEQLLYLDGTISHMYMKGGAI--------PWQRYPFVTMISVER 252 + + + LD S + G I P I + Sbjct: 495 T-GIREAVLLDSGFSTSLVYQGEILATGHAGPNQPSRPVPHAILLYD 540 >UniRef50_A4CSS0 Putative uncharacterized protein n=1 Tax=Synechococcus sp. WH 7805 RepID=A4CSS0_SYNPV Length = 549 Score = 92.4 bits (228), Expect = 1e-17, Method: Composition-based stats. Identities = 36/278 (12%), Positives = 71/278 (25%), Gaps = 38/278 (13%) Query: 8 GKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGE 67 G + + + + F L L + P R + W Sbjct: 266 GLRFLNQLAQPVNSLAAINGGFFNRVRQLPLGAVRLDGVWLSG-PILNRGAVGWDGPGPL 324 Query: 68 AWGTLHALLADINSQGQ--VQMAMNGGI-----------YDESYAPLG-----LYIENGQ 109 + L + G+ +N G + Y L + I +G+ Sbjct: 325 LFDRLRLDQEMRVNGGRRWGLGFLNSGYVQRGLSRYTRAWGPIYRSLSGEELAILIRDGR 384 Query: 110 QKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEI---------QFAVQSGP 160 + + G VA + + + + + + + GP Sbjct: 385 VTDQFSKTELARGVPLPEGASLVVARARAPLPAKPGDEVAIRLKVSSPVGERRQVMAGGP 444 Query: 161 MLMENGVINPR------IHPNVASSKIRNGVGINKHGNAVF---LLSQQATNFYDFACYA 211 +L++ G + R + + R VG + + LS + A Sbjct: 445 LLLKEGQVVLRGRQEGFSSGFLGQAAPRTVVGQDPKHRWMLTLEGLSGSDPTLLET-TLA 503 Query: 212 KAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMIS 249 +L + L LDG S + I Sbjct: 504 LQQLGLSDALNLDGGSSTTMLIANRTVMTGRGVPPRIQ 541 Score = 42.3 bits (98), Expect = 0.016, Method: Composition-based stats. Identities = 24/195 (12%), Positives = 49/195 (25%), Gaps = 29/195 (14%) Query: 14 LNLKRIFLALTLLPLFAVAADDCALSDPTL------TVQAYTVNPQTERVKMYWQKANGE 67 F+A P A + D + ++ Y + + Sbjct: 202 GTRASPFVAARFTPEIQTAIRKGLILDMRVVQVGVKPLRIYRAGLPLGNESLLLRPLAPL 261 Query: 68 AWGTLHALLADINSQGQVQMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGEGNF--- 123 T L + A+NGG ++ PLG +G L G + Sbjct: 262 KAQTGLRFLNQLAQPVNSLAAINGGFFNRVRQLPLGAVRLDGVWLSGPILNRGAVGWDGP 321 Query: 124 -----------------FIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQS--GPMLME 164 R G+ ++ V + I ++ +L+ Sbjct: 322 GPLLFDRLRLDQEMRVNGGRRWGLGFLNSGYVQRGLSRYTRAWGPIYRSLSGEELAILIR 381 Query: 165 NGVINPRIHPNVASS 179 +G + + + Sbjct: 382 DGRVTDQFSKTELAR 396 >UniRef50_Q5ULM2 Orf92 n=1 Tax=Lactobacillus phage LP65 RepID=Q5ULM2_9CAUD Length = 556 Score = 92.0 bits (227), Expect = 2e-17, Method: Composition-based stats. Identities = 30/221 (13%), Positives = 58/221 (26%), Gaps = 23/221 (10%) Query: 42 TLTVQAYTVNPQTERVKMY----WQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDES 97 + + T K+ ++ + A+N G+++ S Sbjct: 317 GASYVFVRIPKTTNTGKILSPKLALTSSDGSLSGTKRPTLRYAKDNDTIFAVNAGLFNVS 376 Query: 98 Y-APLGLYIENGQ-QKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTS----KE 151 P+G I NG + + + T+ Sbjct: 377 TVEPVGQLIINGISLINTPMTSDNGVTINPNECYPLAIDANGDLTTYPRNADTADMIAAG 436 Query: 152 IQFAVQSGPMLMENGVINPRIHPN---VASSKIRNGVGINKHGNAVFLLSQ--------- 199 +++AV + L++N I N IR +G ++G Sbjct: 437 VKYAVTAWGKLVDNFEIATTDIENEIVHNGRYIRQSIGQYQNGYYCVCTVDMTRGSVTNE 496 Query: 200 QATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQ 240 + + A K V+ LDG S + G Sbjct: 497 AGLYYKELAQIFVDK-GVKFAFSLDGGGSAETVLGKRQLNP 536 >UniRef50_A1VEZ3 Putative uncharacterized protein n=4 Tax=Desulfovibrio vulgaris RepID=A1VEZ3_DESVV Length = 311 Score = 92.0 bits (227), Expect = 2e-17, Method: Composition-based stats. Identities = 32/238 (13%), Positives = 72/238 (30%), Gaps = 25/238 (10%) Query: 34 DDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGI 93 ++ + A ++P ++ G +L A + + A+N + Sbjct: 61 SHPDTAEGPTVLVALRIDPNLWDFSLHTATGEGGYPLSLGAW----AEKLNLGAAINSSM 116 Query: 94 YDESYAPLGLYIENGQQKVALNLASGEGNFFI-RPGGVFYVAGDKVGIVRLDAFKTSKEI 152 Y +++ G+ + + G+FF+ P D + + Sbjct: 117 YLPDVRTSTGFLKAGEHVNNPRVTTKFGSFFVAAPDDPTLPQADLLDRAIDPWAERLPHY 176 Query: 153 QFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAK 212 VQ+ ++ N I S VG + G +FL ++ + FA Sbjct: 177 NMVVQNYRLISTNRRILWPQGGPEYSIA---AVGQDGSGAILFLHCREPMTAHAFASMLL 233 Query: 213 A-KLNVEQLLYLDGT----------ISHMYMKGGAIPWQRY------PFVTMISVERK 253 A L++ ++Y++G G P ++ ++ Sbjct: 234 ALPLDIHDVMYVEGGPQAGLLLHSQSQTRIWMGRHRADFWGTGNAEAPLPNILGARKR 291 >UniRef50_A7HB86 Putative uncharacterized protein n=4 Tax=Anaeromyxobacter RepID=A7HB86_ANADF Length = 287 Score = 91.6 bits (226), Expect = 2e-17, Method: Composition-based stats. Identities = 33/243 (13%), Positives = 72/243 (29%), Gaps = 21/243 (8%) Query: 18 RIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLA 77 R + LF + ++P +K+ A GE Sbjct: 44 RTLEPGLEMGLFDGPPAGEEAR----PIAVVRIDPARFELKLLNASAPGE---GTLRTAR 96 Query: 78 DINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDK 137 + A+N +Y E Y + ++ P Sbjct: 97 AWAERAGASAAINASMYQEDYRTSVSLMRTRHHVNQRRVSKDRSVLAFDP---LARGASP 153 Query: 138 VGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIR----NGVGINKHGNA 193 V I+ + +++ A Q+ L+++ + NV + R +G++ G Sbjct: 154 VRII----DRDCDDLERAAQTYGTLVQSIRLVSCDRKNVWAPSARRFSAAAIGVDAKGRV 209 Query: 194 VFLLSQQATNFYDFACYAKA-KLNVEQLLYLDGTISHMYMK--GGAIPWQRYPFVTMISV 250 +F+ ++ ++ A + + Q +Y++G GG F + Sbjct: 210 LFIHARTPWPVHELVNALLALPIELRQAMYVEGGPEAQLFVRGGGRQHEWVGGFEHVPQA 269 Query: 251 ERK 253 E + Sbjct: 270 ENR 272 >UniRef50_B0JGJ2 Putative uncharacterized protein n=2 Tax=Microcystis aeruginosa RepID=B0JGJ2_MICAN Length = 607 Score = 90.5 bits (223), Expect = 5e-17, Method: Composition-based stats. Identities = 25/187 (13%), Positives = 55/187 (29%), Gaps = 30/187 (16%) Query: 94 YDESYAP-----LGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAG---------DKVG 139 + +Y P GL ++ + LN + + I G + ++V Sbjct: 417 WGSNYHPLTERETGLVVQGDRVTEKLNNLFPQDSIKIPENGYLVICRKTDISLNIGERVN 476 Query: 140 IVRLDAFKTSKEIQFAVQSGPMLMENGVINPR------IHPNVASSKIRNGVGINKHGNA 193 + + + +GP+L++NG I R+ + +++ G Sbjct: 477 LDSVTLPGDFANYPQILGAGPLLLQNGRIVLDGNAEKFSPAFQNQQASRSAIAVSREGKI 536 Query: 194 VFLLSQQAT-----NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFV--- 245 + + + A + + L LDG S G + + Sbjct: 537 LLVAIHNRVGGRGATLGELARILLL-MAAKDGLNLDGGSSTGIALAGYLLDRSAVTAAKV 595 Query: 246 -TMISVE 251 I + Sbjct: 596 HNGIGIF 602 Score = 76.6 bits (187), Expect = 7e-13, Method: Composition-based stats. Identities = 21/138 (15%), Positives = 43/138 (31%), Gaps = 4/138 (2%) Query: 23 LTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQ 82 + P L V ++P+ ++ + AN + + L+ INS+ Sbjct: 271 IVWQPGLIWNQKYIQLDQDWFPVTWLEIDPRNPQITIKPITANSTSMRGTNPLIT-INSE 329 Query: 83 GQVQMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIV 141 +NGG ++ + PLG +G+ L G G + + Sbjct: 330 SNAVAMINGGFFNRNNQLPLGAIRVDGKWLSGPILN--RGAIAWDNRGKIRIDRLSLEET 387 Query: 142 RLDAFKTSKEIQFAVQSG 159 + A + + Sbjct: 388 LITATGQRFPLTQLNSAF 405 >UniRef50_A9QSN5 Exopolysaccharide biosynthesis protein n=4 Tax=Lactococcus lactis RepID=A9QSN5_LACLK Length = 303 Score = 90.5 bits (223), Expect = 5e-17, Method: Composition-based stats. Identities = 30/217 (13%), Positives = 66/217 (30%), Gaps = 18/217 (8%) Query: 33 ADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGG 92 LS + + + ++ +++ + + MN Sbjct: 96 EKFTNLSSGNFKIY-----KAHSPQILKTATSADSPVVSMSEVISKYPNS----LIMNAS 146 Query: 93 IYDES-YAPLGLYIENGQQKVALNLASGEGN-FFIRPGGVFYVAGDKVGIVRLDAFKTSK 150 ++ + G I NG+ N N F G + + K Sbjct: 147 GFNMTTGKITGFQINNGKLFKDWNSDKRATNAFVFNKNGSSDIYNST----TPASEILKK 202 Query: 151 EIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACY 210 + + G +L+++G P + +G +K N ++S +T + Sbjct: 203 GAEMSFSFGSILIKDGKSLPSDGTVNWEIH--SFIGNDKDNNIYLIISDTSTGYQSIMEK 260 Query: 211 AKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTM 247 KL++E + +DG S G I + ++ Sbjct: 261 F-QKLHLENVQVMDGGGSSQMSLNGQIIYPSQDSRSV 296 >UniRef50_B7DMS1 Copper amine oxidase domain protein n=3 Tax=Alicyclobacillus acidocaldarius RepID=B7DMS1_9BACL Length = 354 Score = 90.1 bits (222), Expect = 6e-17, Method: Composition-based stats. Identities = 34/161 (21%), Positives = 59/161 (36%), Gaps = 10/161 (6%) Query: 101 LGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGP 160 + +G + G + D V + + A+ +GP Sbjct: 196 ITPIPPDGYDIEIGAGEAKTPIVTRVHVGDPAILTDTVLALPSEKPVPFAAYPNAIGAGP 255 Query: 161 MLMENGVINP-------RIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKA 213 ML++NG I+ + + +R+ VGI++ G+ +FL +A N + A AKA Sbjct: 256 MLVQNGRIDVEPSLEGLDEPDILNAETLRSVVGIDRAGHLIFLTIHEA-NVWQEASIAKA 314 Query: 214 KLNVEQLLYLDGTISHMYMKGGAIPWQR-YPFVTMISVERK 253 L + + LDG S G T I V ++ Sbjct: 315 -LGLWDAMNLDGGSSVGLWYEGRYLTPPKRALATAIVVVQR 354 Score = 39.6 bits (91), Expect = 0.088, Method: Composition-based stats. Identities = 18/73 (24%), Positives = 26/73 (35%), Gaps = 4/73 (5%) Query: 38 LSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDES 97 L+ VQ ++ ++ AN A LA I +Q A+NG +D Sbjct: 36 LTVDGQAVQELVIDLHAPGIEAKPAIANHRL--GTTASLAAIAAQNHAVAAINGTFFDAG 93 Query: 98 Y--APLGLYIENG 108 P G NG Sbjct: 94 GDNFPAGAIELNG 106 >UniRef50_A4XD34 Putative uncharacterized protein n=1 Tax=Salinispora tropica CNB-440 RepID=A4XD34_SALTO Length = 430 Score = 90.1 bits (222), Expect = 6e-17, Method: Composition-based stats. Identities = 20/113 (17%), Positives = 32/113 (28%), Gaps = 13/113 (11%) Query: 153 QFAVQSGPMLMENGVINPRIHPNVA-SSKIRNGVGINKHGNAVFLLSQQATN------FY 205 FAV L+++G I + R G G V + Sbjct: 314 TFAVNGRYRLVKDGQIVAPSGSDSFFDRHPRTIAGTTLDGKIVLVTIDGRQTTSVGTTMT 373 Query: 206 DFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQR-----YPFVTMISVERK 253 + A A L + + LDG S G++ Q P + + Sbjct: 374 ETASV-AAALGMHDAVNLDGGGSTTMSVEGSLVNQPSGNEERPVGDALVYIDR 425 Score = 44.3 bits (103), Expect = 0.004, Method: Composition-based stats. Identities = 12/101 (11%), Positives = 27/101 (26%), Gaps = 9/101 (8%) Query: 33 ADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGG 92 V T++P + + A D+ + + +N Sbjct: 79 DQIGDTPRGPWVVNVLTIDPTQSKGHLAATYGPDLAG---VEKTTDLVREADALVGVNAS 135 Query: 93 IYDES------YAPLGLYIENGQQKVALNLASGEGNFFIRP 127 + + P+GL I G+ + E + + Sbjct: 136 FFTFTASAEYPGDPVGLGIYGGRLLSEPTGDAAEADLVLDA 176 >UniRef50_B1HN11 Putative uncharacterized protein n=2 Tax=Bacillaceae RepID=B1HN11_LYSSC Length = 815 Score = 89.7 bits (221), Expect = 8e-17, Method: Composition-based stats. Identities = 32/246 (13%), Positives = 67/246 (27%), Gaps = 37/246 (15%) Query: 27 PLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQ 86 A+A + + Y ++ + ++ Sbjct: 137 SGKAIADYYTTNLSFQVNGKTYPIDLINS----ERGTNKNVLYTPEKKTTGTNA--WGLE 190 Query: 87 MAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFY-------------V 133 + ++G D + G + EGN + G + Sbjct: 191 LVVSGASQDTNTLHFGDQFSG--TVSHVTTYGAEGNSAVPADGFVISVQNKELAAELSNI 248 Query: 134 AGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVIN---PRIHPNVASSKIRNGVGINKH 190 + V L + + QF + +GPML+ NG ++ P ++ R V ++ Sbjct: 249 SAGTNIDVSLSIDQKWMDAQFILAAGPMLVRNGQVDISMPTNSGFASTRSPRTAVAVDAT 308 Query: 191 G-NAVFLLSQQA-------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRY 242 G + N D A + + + + LDG S + Sbjct: 309 GTKVSLITIDGRLSGHSNGVNLSDLASHLIS-IGATSAINLDGGGSTAMVAR----NPGG 363 Query: 243 PFVTMI 248 F ++ Sbjct: 364 YFANLV 369 Score = 42.3 bits (98), Expect = 0.014, Method: Composition-based stats. Identities = 14/133 (10%), Positives = 26/133 (19%), Gaps = 15/133 (11%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINS--- 81 + P V VN K+ N + + Sbjct: 25 VSPQVNHVQQTYTSGSIRQFVNVLDVNLNNTYTKLEIGMPNP---INSLKTTSAMAKQNT 81 Query: 82 --QGQVQMAMNGGIYDESYAPLGLYIENGQQKV-------ALNLASGEGNFFIRPGGVFY 132 +V A+N + + P L E + + F + G Sbjct: 82 YDGHRVVGAVNASYFLGNGMPANLLAEKNEIVNYGILGDTYDSPTQKPVAFGLSKSGKAI 141 Query: 133 VAGDKVGIVRLDA 145 + Sbjct: 142 ADYYTTNLSFQVN 154 >UniRef50_A4FD37 Secreted protein n=1 Tax=Saccharopolyspora erythraea NRRL 2338 RepID=A4FD37_SACEN Length = 519 Score = 89.7 bits (221), Expect = 9e-17, Method: Composition-based stats. Identities = 24/162 (14%), Positives = 45/162 (27%), Gaps = 23/162 (14%) Query: 102 GLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPM 161 G G L A+ R G +V D+ A V GP Sbjct: 344 GPVPAGGTVVQGLGQAAEWLVAHARAGEPLWV--DQQIREESGAPLRLGPSDDIVNGGPE 401 Query: 162 LMENGVINPRIHPNV-------------ASSKIRNGVGINKHGNAVFLLSQQATN----- 203 L+ +G + + + R+ +G++ G + ++ Sbjct: 402 LVRDGQVRINLQEDGIIHDAPSFAYTWGLKRNPRSVIGVDAQGRVILATTEGRMPGFSDG 461 Query: 204 --FYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYP 243 + A + +A L + LDG S + + Sbjct: 462 WGLPEAAEFVRA-LGAVDAMALDGGGSAGMVVDDRVVTTPSD 502 >UniRef50_C3YJA0 Putative uncharacterized protein n=3 Tax=Branchiostoma floridae RepID=C3YJA0_BRAFL Length = 851 Score = 89.3 bits (220), Expect = 1e-16, Method: Composition-based stats. Identities = 26/172 (15%), Positives = 47/172 (27%), Gaps = 21/172 (12%) Query: 45 VQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE-SYAPLGL 103 +N + + + + +A+N G +D + A LG Sbjct: 324 GHYTVINNPLRTFSVVEPGGSNGCMEPRRRTVTQTSQTRTCHVALNAGFFDTRTGACLGN 383 Query: 104 YIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLM 163 + +GQ+ + NF IR G V + D + V L+ Sbjct: 384 VVTDGQRVQD-SGGIQNANFGIRKDGTIVVGY----LSEQDVLREDNPFVQLVSGVIWLV 438 Query: 164 ENGVINPRI---------------HPNVASSKIRNGVGINKHGNAVFLLSQQ 200 N + V R VG ++ G V + + Sbjct: 439 RNATVYVNESRTTECADIQETGTLDRFVNVVSARTAVGHDEEGRVVLVHIEG 490 >UniRef50_A9NEV6 Hypothetical surface-anchored protein n=1 Tax=Acholeplasma laidlawii PG-8A RepID=A9NEV6_ACHLI Length = 520 Score = 88.6 bits (218), Expect = 2e-16, Method: Composition-based stats. Identities = 15/119 (12%), Positives = 36/119 (30%), Gaps = 15/119 (12%) Query: 132 YVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVAS------SKIRNGV 185 + +V+ + ++ A+ +G +L+++G + ++ S R + Sbjct: 263 LITSSDTILVQELLGNGFENVRNAIGTGQLLVKDGAVQHAAFKSLPSNNMAHFRHPRTAI 322 Query: 186 GINKHGNAVFLLSQQATNF--------YDFACYAKAKLNVEQLLYLDGTISHMYMKGGA 236 G G F++ + K LDG S + + Sbjct: 323 GQKADGTVFFIVVDGRDALSGKYGVKYSELGELMKMH-GAVTAFNLDGGGSSTMLLRNS 380 >UniRef50_B5W3X9 Putative uncharacterized protein n=3 Tax=Arthrospira RepID=B5W3X9_SPIMA Length = 812 Score = 88.6 bits (218), Expect = 2e-16, Method: Composition-based stats. Identities = 22/134 (16%), Positives = 39/134 (29%), Gaps = 23/134 (17%) Query: 137 KVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPR------IHPNVASSKIRNGVGINKH 190 V +V E + +GP+L+ + IR+ VG+ + Sbjct: 670 PVQLVDETNPPDFAEYPHILGAGPLLLRGNQVVLDARAENFSDAFNTQRAIRSAVGLKTN 729 Query: 191 --GN---------AVFLLSQQA-----TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKG 234 G + ++ + + A K +L L LDG S G Sbjct: 730 TPGRSGSDSPAVSLLLVVVHPRLGGPGPSLAELAELMK-QLGATDALNLDGGSSTGLYLG 788 Query: 235 GAIPWQRYPFVTMI 248 G + + I Sbjct: 789 GYLLDRPPQTAAPI 802 Score = 64.7 bits (156), Expect = 3e-09, Method: Composition-based stats. Identities = 18/193 (9%), Positives = 47/193 (24%), Gaps = 27/193 (13%) Query: 23 LTLLPLFAVAADDCAL-----------------SDPTLTVQAYTVNPQTERVKMYWQKAN 65 + L V ++ + + + + Sbjct: 427 IVWAEGLRWQQKYIDLNSHQRLFTPPPSNSTGRESARFPVVWLEIDLNNQGISLQPILSR 486 Query: 66 GEAWGTLHALLADINSQGQVQMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGEGNFF 124 + + L+ + S + A+NGG ++ + PLG + L G Sbjct: 487 PGSRSGVSPLV-HVASSTRAAAAINGGFFNRNNQYPLGAIRHQNRWLSGPILN--RGAIA 543 Query: 125 IRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENG--VINPRIHPNVASSKIR 182 F++ ++ + + + ++ G + R Sbjct: 544 WTDQNQFFIDRLELSQTLFRDRREPIPLARLNSAF---IQRGLSRYTSDWGSTYSPFTPR 600 Query: 183 NGVGINKHGNAVF 195 + I G + Sbjct: 601 E-IAITVQGETII 612 >UniRef50_C9N2Q2 Metallophosphoesterase n=2 Tax=Actinomycetales RepID=C9N2Q2_9ACTO Length = 1163 Score = 88.2 bits (217), Expect = 3e-16, Method: Composition-based stats. Identities = 25/205 (12%), Positives = 52/205 (25%), Gaps = 32/205 (15%) Query: 59 MYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLY-IENGQQKVALNLA 117 + A + A + + + P+ + +G+ ++ Sbjct: 210 LAAYNAANVPAQGVGAYTSAWGGADRAPAV-------DDARPVAEVAVRDGEVVS---VS 259 Query: 118 SGEGNFFIRPGGVFYV-------------AGDKVGIVRLDAFKTSKEIQFAVQSGPMLME 164 G G+ + V V GD V I + AV +L+ Sbjct: 260 DGPGSGPVPEDTVVLVGREAGAGLLAALEPGDPVKIAYRARTDGGAVPRTAVGGRELLVV 319 Query: 165 NGVINPRIHPNVASSKIRNGVGINKHGNAV-FLLSQQAT------NFYDFACYAKAKLNV 217 +G ++ R VG ++ G + + + + + Sbjct: 320 DGAAQNHDGEGNNTAAPRTAVGFSEDGRTMQVVTVDGRQTDSGGVTLTELGEMMR-RAGS 378 Query: 218 EQLLYLDGTISHMYMKGGAIPWQRY 242 L LDG S + Sbjct: 379 YSALNLDGGGSSTLVARQPGSDTLR 403 Score = 45.8 bits (107), Expect = 0.001, Method: Composition-based stats. Identities = 26/140 (18%), Positives = 51/140 (36%), Gaps = 17/140 (12%) Query: 2 AHQLLIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYW 61 A ++ G G+ T R + P +++ D SD L V +V+ V+ + Sbjct: 62 ARSVVDGDGIETARATR-----PVAPGVRLSSYDRLESDKWLRVDTLSVDLDGSGVRADY 116 Query: 62 QKANGEAWGTLHALLADINSQG------QVQMAMNGGIYDES--YAPLGLYIENGQQKVA 113 + A ++++ + + A+N +D + APLG I +G+ + Sbjct: 117 LSSGKVA---DRHTVSELAAGHDPGKGRRTVAAINADFFDINQTGAPLGPGINDGRTVHS 173 Query: 114 LNLA-SGEGNFFIRPGGVFY 132 + F R G Sbjct: 174 PATGVNRAVGFGPRNAGRVL 193 >UniRef50_Q9L2D5 Putative secreted protein n=2 Tax=Streptomyces RepID=Q9L2D5_STRCO Length = 428 Score = 87.8 bits (216), Expect = 3e-16, Method: Composition-based stats. Identities = 20/196 (10%), Positives = 45/196 (22%), Gaps = 14/196 (7%) Query: 53 QTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKV 112 + A+ + + +++N + G+ Sbjct: 219 SRFKAPTPVPDADARFTEDDDPGAEAVVAADGTVLSLN-----PNGRGGVTVPTGGRVLQ 273 Query: 113 ALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRI 172 + PG D + P L+ N Sbjct: 274 GTGTGADWLRAHATPGTDLAFEERLHDERFGDDIPLDSSVDVVNGHYP-LVHN--AQYAY 330 Query: 173 HPNVASSKIRNGVGINKHGNAVFLLSQQ-----ATNFYDFACYAKAKLNVEQLLYLDGTI 227 + R+ + ++ G +F+ + +FA L L +DG Sbjct: 331 TGQNTAVDPRSAIAVDGPGRTLFVTATGKSGRNGVTLDEFARILLD-LGAVDGLNMDGGG 389 Query: 228 SHMYMKGGAIPWQRYP 243 S + A+ + Sbjct: 390 STTLVVEQAVVNRPSD 405 >UniRef50_A7HN47 Putative uncharacterized protein n=1 Tax=Fervidobacterium nodosum Rt17-B1 RepID=A7HN47_FERNB Length = 528 Score = 87.8 bits (216), Expect = 3e-16, Method: Composition-based stats. Identities = 23/157 (14%), Positives = 45/157 (28%), Gaps = 16/157 (10%) Query: 108 GQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGV 167 G + N + + +++ AV +GP+L+++ Sbjct: 373 GYVEYVPNNSLVIAISNDIQKQYLPKISVGADVSVELYTDNGYKVKNAVGAGPLLIQDKK 432 Query: 168 INPRIHPN--------VASSKIRNGVGINKHGNAVFLLSQQ----ATNFYDFACYAKAKL 215 I + R + I K G + + NF + A + +K Sbjct: 433 IIQDAAEEKLRYGGGIPTTRASRTIIAI-KDGKVHLITIEGTNGTGMNFDEAAQFLLSK- 490 Query: 216 NVEQLLYLDGTISHMYMKGGAIP--WQRYPFVTMISV 250 E + LDG S + G + + V Sbjct: 491 GYESAMMLDGGGSTGMVYAGKLVTINSPRNIPVALGV 527 Score = 48.1 bits (113), Expect = 3e-04, Method: Composition-based stats. Identities = 13/85 (15%), Positives = 27/85 (31%), Gaps = 5/85 (5%) Query: 29 FAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMA 88 + T V ++P+ + + A L+ I SQ + Sbjct: 214 IDYIKKTETFAGRTFIVNYLILDPRFVDLVPVL----PKNGIGYTAQLSSILSQNGLIHG 269 Query: 89 MNGGIYDE-SYAPLGLYIENGQQKV 112 +N +D + P+ + I G+ Sbjct: 270 VNANYFDPATGMPIDIIISGGKVLS 294 >UniRef50_Q2BF40 Putative uncharacterized protein n=1 Tax=Bacillus sp. NRRL B-14911 RepID=Q2BF40_9BACI Length = 657 Score = 87.4 bits (215), Expect = 4e-16, Method: Composition-based stats. Identities = 27/146 (18%), Positives = 47/146 (32%), Gaps = 20/146 (13%) Query: 104 YIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLM 163 E+G A + + + + ++ K ++ + SGP+L+ Sbjct: 233 IPEDGFVLSAHGTSLPA---------LKEMKPGDTVEIAINIDDKWKNSEYMLASGPLLV 283 Query: 164 ENGVINPRIHPNVA---SSKIRNGVGINKH-GNAVFLLSQQA------TNFYDFACYAKA 213 NG ++ + PN R V I+K + N +FA Y Sbjct: 284 NNGKVDLGMDPNSTRARERAPRTAVAIDKTMSKVFLVTVDGRLAESKGMNLTEFAQYLV- 342 Query: 214 KLNVEQLLYLDGTISHMYMKGGAIPW 239 KL + L LDG S + Sbjct: 343 KLGAYKALNLDGGGSTAIIARKNGND 368 Score = 43.5 bits (101), Expect = 0.008, Method: Composition-based stats. Identities = 16/121 (13%), Positives = 37/121 (30%), Gaps = 3/121 (2%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTE--RVKMYWQKANGEAWGTLHALLADINSQ 82 + P + ++ +++ VN ++ + + Sbjct: 29 VSPGVKYKENREVINSYNQSIKFLEVNMADPYTKLDLSIPLPLNTISTVSAQAKLNHREG 88 Query: 83 GQVQMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIV 141 +V A+NG +D S P+ L + +A+G + +P + K I Sbjct: 89 NRVVGAVNGSFFDMSTKLPMYLISYRNKVMNTGIIATGSDQYVNKPVAFGINSQGKPQIE 148 Query: 142 R 142 Sbjct: 149 S 149 >UniRef50_B8I1Q9 Ig-like, group 2 n=3 Tax=Clostridium RepID=B8I1Q9_CLOCE Length = 952 Score = 87.4 bits (215), Expect = 5e-16, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 50/186 (26%), Gaps = 38/186 (20%) Query: 103 LYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQF-------- 154 + +E+G N + + G + D F +++ Sbjct: 210 MVVEDG-IVKEFNENKPSMD--MPKNGFVVLGAGSHIQYLKDNFNVGDPVEYNITMNVDT 266 Query: 155 -----AVQSGPMLMENGVINPRIHPNVAS---SKIRNGVGINKHGNAVFLL-SQQA---- 201 A+ G ML+++ + N S R +G +K G + + Sbjct: 267 NNMKMALTGGAMLVKDDKVLTSFSHNPVSPSTRASRTAIGTSKDGKTLIVAAVDGRSSAS 326 Query: 202 --TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGA------IPWQR-----YPFVTMI 248 + A Y +L L LDG S + + + + Sbjct: 327 IGMTQSELASYM-HELGCANALNLDGGGSTTLVARKQGTTGLSVQNRPSDGSQRGVGASL 385 Query: 249 SVERKG 254 + G Sbjct: 386 GIFSVG 391 Score = 51.6 bits (122), Expect = 3e-05, Method: Composition-based stats. Identities = 14/125 (11%), Positives = 34/125 (27%), Gaps = 9/125 (7%) Query: 24 TLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQG 83 T+ + + D + + V+ + VK+ G + + ++ Sbjct: 40 TITSGVTLESYDRFTTSGWIKSYVLRVDLSNKNVKVDTLVNKKSVVGY--STVLNLAKNS 97 Query: 84 QVQMAMNGGIYDES------YAPLGLYIENGQQ-KVALNLASGEGNFFIRPGGVFYVAGD 136 A+NG +D G + +G+ A + F + Sbjct: 98 GAIAAVNGSFFDFGPSGSGKGYTYGPVVSSGEIDLAATRDSKDTATFSLNDVNEALFTYW 157 Query: 137 KVGIV 141 + Sbjct: 158 NTKVE 162 >UniRef50_B4WHW3 Putative uncharacterized protein n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WHW3_9SYNE Length = 335 Score = 86.6 bits (213), Expect = 6e-16, Method: Composition-based stats. Identities = 38/256 (14%), Positives = 66/256 (25%), Gaps = 59/256 (23%) Query: 29 FAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMA 88 A T P T ++ + + A + + Sbjct: 62 VDSAYGYQKRYFKGAIAHVVT-QPPTVQLSIAVAQE--------LATIEAFAERTNADYI 112 Query: 89 MNGGIYDE-SYAPLGLYIE----------NGQQKVALNLASGEGNFFIRPGGVFYVAGDK 137 +NGG +D + I N + NL R Y Sbjct: 113 INGGFFDPHNGKTTSHLISQEQTVSDPADNERLINNSNLGQYMAQILNRSEFRVYRCRQA 172 Query: 138 VGIVR-------------------LDAFKTSKEIQFAVQSGPMLM-------------EN 165 + R EI A+ +GP L+ ++ Sbjct: 173 SVVERGGLEGSLTEEAVVYDITFHNAPPPDGCEIDTAIGAGPQLLPADTSWVEGFIDYDD 232 Query: 166 GVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ-----ATNFYDFACYAKAKLNVEQL 220 G I R R+ +G+ G ++ ++ + A +AK+ L + +L Sbjct: 233 G-ILFRDAIGSRQPNARSAIGLYPDGAIALIMVEKSASSIGMTLLELADFAKS-LGITKL 290 Query: 221 LYLDGTISHMYMKGGA 236 L LDG S Sbjct: 291 LNLDGGSSSALSVAER 306 >UniRef50_Q2JUI0 Conserved domain protein n=2 Tax=Synechococcus RepID=Q2JUI0_SYNJA Length = 411 Score = 86.6 bits (213), Expect = 6e-16, Method: Composition-based stats. Identities = 39/296 (13%), Positives = 80/296 (27%), Gaps = 57/296 (19%) Query: 5 LLIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKA 64 L+G G + + + F L L + + +P R + W A Sbjct: 91 SLVGLGELPAFSRERGAVAAINGGFFNRNTRQPLGAIRLEGRWIS-SPILGRGAIAWFDA 149 Query: 65 NGEAWG---------TLHALLADINSQGQVQMAMNGGIYDES---YAP-LG--------- 102 G + L + + +N G Y P G Sbjct: 150 ANSPPGLEPVRFARLRMQIELRNDLGDRIPLVGINSGYILPGIAQYTPDWGSTYTTQTDD 209 Query: 103 ---LYIENGQQKVALNLA-SGEGNFFIRPGGVFY--------------VAGDKVGIVRLD 144 L ++ + + L+ +G+ I P G + GD++ + Sbjct: 210 EVLLIVQEDRLQQILSAGVAGKVTVPIPPKGYILAARGLEGALEAGKLIPGDRLRLDWTV 269 Query: 145 AFKTSKEIQFAVQSGPMLMENGVINPRIHPNV------ASSKIRNGVGINK---HGNAVF 195 + + +GP+L+ +G + R+ + + + + Sbjct: 270 DPLELEAYPHILGAGPLLLLDGQVVLDAELEGFQPLFRRQQAARSAICLRQGQPDNRDLL 329 Query: 196 LLSQQ------ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFV 245 L++ + A + +L L LDG S + G Sbjct: 330 LVAAGNAQENQGLTLLEMAQLLR-QLGCRHALNLDGGRSSTLVLGEEAVNLEPEIG 384 Score = 53.5 bits (127), Expect = 6e-06, Method: Composition-based stats. Identities = 18/118 (15%), Positives = 38/118 (32%), Gaps = 4/118 (3%) Query: 12 ITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGT 71 + + ++ + L D + V V+ +++ W + Sbjct: 37 VPAQTLLNGQPVRVVEGIPLYQRMVWLEDRRILVSVVAVSLAAGQLRPIWADPA--SLVG 94 Query: 72 LHALLADINSQGQVQMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGEGNFFIRPG 128 L L + + A+NGG ++ + PLG G+ + L G +F Sbjct: 95 LGELP-AFSRERGAVAAINGGFFNRNTRQPLGAIRLEGRWISSPILGRGAIAWFDAAN 151 >UniRef50_A5D3T7 Hypothetical membrane protein n=1 Tax=Pelotomaculum thermopropionicum SI RepID=A5D3T7_PELTS Length = 887 Score = 85.5 bits (210), Expect = 1e-15, Method: Composition-based stats. Identities = 28/185 (15%), Positives = 55/185 (29%), Gaps = 40/185 (21%) Query: 104 YIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQF--------- 154 ++NG + L G I P G + L+ ++ + Sbjct: 219 VVKNGVVQQVLTDQPG---VPIPPDGYVLRGHGQAARFILENLPAGSKVSYTYSVMPQGD 275 Query: 155 ----AVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLL-------SQQ--- 200 AV +L+E G + N+A R G++K G ++L+ S Sbjct: 276 KLFAAVGGQALLVEEGRLPAYFTQNIAGKHARTAAGVSKDGKTLYLVAVEKQSASDGTVV 335 Query: 201 --ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGG------AIPWQR-----YPFVTM 247 + A + + + V + + LDG S ++ + Sbjct: 336 SRGMTQEELAEFLIS-IGVWRAVNLDGGGSTTLAARHLGDFNVSLINRPQGKSQRSVPNA 394 Query: 248 ISVER 252 I + Sbjct: 395 IGIFS 399 Score = 50.0 bits (118), Expect = 6e-05, Method: Composition-based stats. Identities = 15/112 (13%), Positives = 34/112 (30%), Gaps = 3/112 (2%) Query: 24 TLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQG 83 T+ + ++ L V + +K+ + + + Sbjct: 44 TVTRGAVLQTVRMTTNEGPLNVYILKADLSDPYLKVDTIVGADGTLAKN-QTVTAMAGRA 102 Query: 84 QVQMAMNGGIYDE--SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYV 133 A+NG + S P+GL + G+ + L S F + + + Sbjct: 103 GAVAAVNGDFFQMKESGRPIGLLYQGGRLIESPALRSDMYGFAVTKDKLPIL 154 >UniRef50_Q30YC1 Putative uncharacterized protein n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. G20 RepID=Q30YC1_DESDG Length = 383 Score = 85.5 bits (210), Expect = 2e-15, Method: Composition-based stats. Identities = 29/254 (11%), Positives = 68/254 (26%), Gaps = 54/254 (21%) Query: 23 LTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQ 82 +++ +A D + TV ++P+ +++Y G ++ + Sbjct: 88 ISVAQGLELAESSAVFRDTSGTVALLRIDPRHYSLQLYTISEQGGPP----QTPSEWAAL 143 Query: 83 GQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFI----------------- 125 + +N ++ + Y+ NG + G+F + Sbjct: 144 YNLDAVINASMFLPDGSTSTGYMRNGTAANNSRINQRFGSFLVFSPLPPHAAASDGQPPA 203 Query: 126 -------------------------RPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGP 160 A D + D + VQ+ Sbjct: 204 GGTQPDPYAPAAAHTTAARNTPNDAGSDNQQLPAADVLDRYADDWQTLLPRYRGVVQNFR 263 Query: 161 MLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKA---KLNV 217 M+ + + S +G + G +F+ S+ T + + Y L Sbjct: 264 MISADRKPLWPEEGDSFSIA---AIGKDTQGRILFIHSRAQTTVRELSEYLLDICPSLGA 320 Query: 218 EQLLYLDGTISHMY 231 +Y++G Sbjct: 321 T--MYVEGGAQAAL 332 >UniRef50_Q03K73 Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase n=3 Tax=Streptococcus thermophilus RepID=Q03K73_STRTD Length = 179 Score = 84.7 bits (208), Expect = 2e-15, Method: Composition-based stats. Identities = 21/135 (15%), Positives = 38/135 (28%), Gaps = 17/135 (12%) Query: 122 NFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASS-- 179 + G F D + GP L+ENG + + V Sbjct: 16 IPAVYSDGSFKTFNDS---ETTAQKLVDSGVVNTFAFGPTLVENGKVAVSENEEVGQDMA 72 Query: 180 -KIRNGVGI----NKHGNAVFLLSQQAT------NFYDFACYAKAKLNVEQLLYLDGTIS 228 R + + ++ + + ++S T Y+ A K+ V LD S Sbjct: 73 DNPRTAIVVNEESDRSVHYIVIVSDGRTSESSGLTLYEMAELMKS-YGVMTGYNLDVGDS 131 Query: 229 HMYMKGGAIPWQRYP 243 G + + Sbjct: 132 STMYSNGQVINKPTH 146 >UniRef50_A5ILT0 Putative uncharacterized protein n=6 Tax=Thermotogaceae RepID=A5ILT0_THEP1 Length = 553 Score = 84.3 bits (207), Expect = 3e-15, Method: Composition-based stats. Identities = 24/174 (13%), Positives = 51/174 (29%), Gaps = 25/174 (14%) Query: 103 LYIENGQQ-----KVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQ 157 +++ + K + + + + G I+ AV+ Sbjct: 375 FVVKDSKVSQIGYKSRAEDSEYVVSISKKYEKYLSDLKEGDGAYLSLQPNIPLRIKQAVE 434 Query: 158 SGPMLMENGVINPRIHPN--------VASSKIRNGVGINKHGNAVFLLSQQ------ATN 203 GP+L++NG P + R + K G FL+ + Sbjct: 435 GGPLLIQNGAPIPDAWEEKARYGGGIAYAKAPRTVIA-TKDGKLWFLVFEGYNHITRGLT 493 Query: 204 FYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAI----PWQRYPFVTMISVERK 253 + + + ++ E + +DG S + G++ I V K Sbjct: 494 YDELVDFLISR-GFEDAMCVDGGSSSVMAVAGSLFGRTENSTAAIPVGIVVWEK 546 Score = 53.5 bits (127), Expect = 7e-06, Method: Composition-based stats. Identities = 23/181 (12%), Positives = 50/181 (27%), Gaps = 21/181 (11%) Query: 25 LLPLFAVAADDCALS-DPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQG 83 + V ++P+ +K L ++ + Sbjct: 222 VADGVVFERKIEDFGEGEKTVVNYLIMDPEKVTIKPVVS----GNGFGTIERLDEMVKRV 277 Query: 84 QVQMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVR 142 + +NG +D P+GL + +G+ A+ G F I ++ V + Sbjct: 278 EGIAGINGNYFDPVTKFPIGLVVIDGKPYSAMFG--GRPVFAITEDNRVFIGRIIVDVT- 334 Query: 143 LDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQAT 202 +++ F V+ L E G + K F++ Sbjct: 335 ----LMMRDVLFLVKGINTLGE-GEVLVYTREFSKEIP-------EKDDRIYFVVKDSKV 382 Query: 203 N 203 + Sbjct: 383 S 383 >UniRef50_C5C0E0 Metallophosphoesterase n=1 Tax=Beutenbergia cavernae DSM 12333 RepID=C5C0E0_BEUC1 Length = 1327 Score = 84.3 bits (207), Expect = 3e-15, Method: Composition-based stats. Identities = 21/136 (15%), Positives = 45/136 (33%), Gaps = 10/136 (7%) Query: 108 GQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGP--MLMEN 165 G+ ++ + + + + D V + + ++ A+ P L+E+ Sbjct: 248 GEGRLPDGVRALVARPGAAADALATLVADDHVDVAYGLREDAGDVAVALGGAPEDWLLED 307 Query: 166 GVINPRIHPNVASSKIRNGVGINKHGNA-VFLLSQQA------TNFYDFACYAKAKLNVE 218 G I V R VG ++ G F++ + + A+L + Sbjct: 308 GEITSATGGYVDVRHPRTAVGFDETGTTAYFVVVDGRQSHSIGMTLPELGRFL-AQLGAD 366 Query: 219 QLLYLDGTISHMYMKG 234 + LDG S + Sbjct: 367 DAINLDGGGSSEMVAR 382 Score = 45.0 bits (105), Expect = 0.002, Method: Composition-based stats. Identities = 22/193 (11%), Positives = 49/193 (25%), Gaps = 20/193 (10%) Query: 24 TLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQG 83 + P + + D L+ + + T+ +++ + + A A + ++ Sbjct: 69 VIAPGLELTSFARIGEDGWLSGEVLVAHLGTDALEVGYVAPDDVAGN---ATVTEMAEAS 125 Query: 84 QVQMAMNGGIYDES--YAPLGLYIEN--GQQKVALNLASGEGNFFIRPGGVF--YVAGDK 137 A+NG +D + APLG+ ++ G K A F G Sbjct: 126 GAVAAVNGDFFDINNSGAPLGVAVDEETGLLKSAAPGRERAVAFDSAGVGRLAELFLEGS 185 Query: 138 VGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLL 197 + + G + + R V + + Sbjct: 186 ATFADHEIAIAGLNTTEVPEGG----------VAVFDSAWGDHTRTRVLADGEQGVE-VT 234 Query: 198 SQQATNFYDFACY 210 Sbjct: 235 VDGEGTVLAVGAV 247 >UniRef50_A1SN25 Exopolysaccharide biosynthesis protein-like n=1 Tax=Nocardioides sp. JS614 RepID=A1SN25_NOCSJ Length = 420 Score = 83.9 bits (206), Expect = 4e-15, Method: Composition-based stats. Identities = 24/171 (14%), Positives = 45/171 (26%), Gaps = 26/171 (15%) Query: 103 LYIENGQQKVA-----LNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQ 157 + + NG+ + + F R G + + Q A+ Sbjct: 249 VTVVNGRVRTNRAKLSHDQPIKGLLFIGRGEGAKVLRKLPKHTRIKVRWSLQGRPQMAIS 308 Query: 158 SGPMLMENGVINPRIHPNVASSKIRNGVGINKH-GNAVFLLSQQAT------NFYDFACY 210 L+ +G+I + R VG++ G + L+ + A Sbjct: 309 GNNFLVHDGIIRAI---DDREMHPRTAVGVDSDTGEVLLLVVDGRQADSRGYTMVELANL 365 Query: 211 AKAKLNVEQLLYLDGTISHMYMKGGAI------PWQRYPF----VTMISVE 251 L ++ + LDG S + F I V Sbjct: 366 MVD-LGADEAVNLDGGGSSTMVGKNRRGKVAVLNDPSDGFQRWVANAIEVT 415 Score = 47.3 bits (111), Expect = 4e-04, Method: Composition-based stats. Identities = 20/111 (18%), Positives = 37/111 (33%), Gaps = 5/111 (4%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ 84 + P + + T++P+T +++ + A + DI + Sbjct: 86 VAPGVKFTRWSQTDARGPIVAHLLTIDPKTPGLRIDYASMG---AVRRVAPVRDILAVDN 142 Query: 85 VQMAMNGGIYDES--YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYV 133 +NG YD APLGL + + + FFI G + Sbjct: 143 AVAGVNGDFYDIGHTGAPLGLGKDRQRGLLHAREDGWNKAFFINRHGRAGI 193 >UniRef50_A5D3R0 Hypothetical membrane protein n=1 Tax=Pelotomaculum thermopropionicum SI RepID=A5D3R0_PELTS Length = 485 Score = 83.9 bits (206), Expect = 5e-15, Method: Composition-based stats. Identities = 33/178 (18%), Positives = 51/178 (28%), Gaps = 26/178 (14%) Query: 94 YDESYAPLG---LYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLD------ 144 Y P G + + NG G I G G+ Sbjct: 315 YKYDTTPPGRTAVVVRNGIV-----TGIRSGQVEIPEDGYVIWYGENNYERDDQFSAGRQ 369 Query: 145 -------AFKTSKEIQFAVQSGPMLMENGVINPRI--HPNVASSKIRNGVGINKHGNAVF 195 + + + P+L+ NG I P + R+ VG+ V Sbjct: 370 VDYRVTFKENQQARFKATISNYPLLLSNGAIALGDITEPKLTIGAPRSFVGVTWDNILVM 429 Query: 196 LLSQQATNFYDFACYAKAKLNVEQLLYLDGTIS-HMYMKGGAIPWQRYPFVTMISVER 252 A N ++ A K L ++ L LDG S +Y G I + V + Sbjct: 430 GTVDSA-NVWELAEVTKN-LGLKDALNLDGGASCGLYYDGAYIRQPGRLLSNCLVVIQ 485 Score = 45.4 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 22/165 (13%), Positives = 40/165 (24%), Gaps = 10/165 (6%) Query: 29 FAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMA 88 + S T VN V++ A LA + ++ A Sbjct: 171 TRYEQREVETSAGIKTANLVYVNMNNPAVELRPVMAEDRV--GRVEELASMAARTGAVAA 228 Query: 89 MNGGIYDE----SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLD 144 +NG ++ P G G +L S V+ D Sbjct: 229 INGTFFNAYDANDLMPHGTL---GANYSYYHLGSNATLGVDDRNRVYIGQLTPYVEGGTD 285 Query: 145 AFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKI-RNGVGIN 188 +A +G++ ++ R V + Sbjct: 286 GSWEWPNWWYAWGINHYYSPDGIVVFTPEYKYDTTPPGRTAVVVR 330 >UniRef50_C0Z816 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0Z816_BREBN Length = 1054 Score = 83.9 bits (206), Expect = 5e-15, Method: Composition-based stats. Identities = 22/142 (15%), Positives = 42/142 (29%), Gaps = 14/142 (9%) Query: 98 YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQ 157 P NG A G V + ++ AV Sbjct: 234 NQPGVYIPYNGYVLWGHGAAGAFLKQNFPVGATAAVEYQT--------TPQTLNLKQAVG 285 Query: 158 SGPMLMENGVINPRIHPN--VASSKIRNGVGINKHGNAVFLLS---QQATNFYDFACYAK 212 +L++ G + + S R VG+++ G +++++ Q + A Sbjct: 286 GNVILVDQGKALTSFQADKSITSKTARTSVGVSQDGKTLYMVTIDASQGVYLDELAKIM- 344 Query: 213 AKLNVEQLLYLDGTISHMYMKG 234 A+L + + DG S Sbjct: 345 AELGSYRAVNFDGGGSTTMATR 366 Score = 67.0 bits (162), Expect = 5e-10, Method: Composition-based stats. Identities = 13/138 (9%), Positives = 41/138 (29%), Gaps = 7/138 (5%) Query: 23 LTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQ 82 + + + ++ +T+ V+ V++ + + + Sbjct: 46 TPIGEGTTLQKYTKSFANQVVTIMVTKVDLNNPYVEVKPVYGTKGKLTDK-QTVTQMARE 104 Query: 83 GQVQMAMNGGIYDES--YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGI 140 A+N + + AP G+ +++ + ++ L S + + V G Sbjct: 105 TGAIAAINADFFHMTKRGAPFGIVMKDDELISSMGLVSYWYALGLTGDKMAIVDKFGFGG 164 Query: 141 VRLDAFKTSKEIQFAVQS 158 +++Q Sbjct: 165 KVT----APNGATYSIQG 178 >UniRef50_Q7NIQ9 Gll2123 protein n=1 Tax=Gloeobacter violaceus RepID=Q7NIQ9_GLOVI Length = 518 Score = 83.6 bits (205), Expect = 6e-15, Method: Composition-based stats. Identities = 31/218 (14%), Positives = 62/218 (28%), Gaps = 24/218 (11%) Query: 53 QTERVKMYWQKANGEAWG---TLHALLADINSQGQVQMAMNGGIYD------ESYAPLGL 103 RV++ W + D + + +NGG + + +G Sbjct: 288 THRRVRLVWLSGGNYTTRHSEGNRYPVGDFIERERAVGGINGGFFAFAGLRATNSDMVGP 347 Query: 104 YIENGQQKVALNLAS------GEGNFFIRPGGVFYVAGDKVG-IVRLDAFKTSKEIQFAV 156 Y+ + + G I G+ +V +A ++ Sbjct: 348 YLSQNEGRFMPGAPEFDKSLRGRPVVLISATGLRFVPYSPETFDTEAEARAYLSDLSDLF 407 Query: 157 QSGPMLMENGVINPRIH------PNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACY 210 +G L+ NG N + + R +GI+K G + + N A Sbjct: 408 VAGVWLVNNGQALTTEQIEQFRLSNHSEFRRRTFMGIDKAGLPMVGATLTNVNATQLARA 467 Query: 211 AKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMI 248 + + + + LD S + G + I Sbjct: 468 L-EEAGLREAVLLDSGFSTSLVHQGKVLVTG-HTAPSI 503 >UniRef50_D2PZR6 Sporulation domain protein n=4 Tax=Actinomycetales RepID=D2PZR6_9ACTO Length = 537 Score = 83.6 bits (205), Expect = 6e-15, Method: Composition-based stats. Identities = 27/177 (15%), Positives = 47/177 (26%), Gaps = 29/177 (16%) Query: 102 GLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPM 161 G E G+ A + G + V V GP Sbjct: 364 GAIPEGGRTVQATGSQVAALLAAAQVGRRLEIRA--VLTDARGRDVRLSPRTSVVNGGPE 421 Query: 162 LMENGVINPRIHPNV--------------ASSKIRNGVGINKHGNAVFLLSQQA------ 201 L+ +G + + R G++ G V + + Sbjct: 422 LVRDGRLMATPKADGMAPAGNPNFYYGWVHKRNPRTIAGVDAQGRTVLITADGRNVSSLG 481 Query: 202 TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQR------YPFVTMISVER 252 + A AK+ L + + + LDG S + G + Q P + + R Sbjct: 482 LGIAEAAAVAKS-LGLREAVNLDGGGSTTMVANGKVVNQPSDAAGERPVGDALVITR 537 Score = 58.1 bits (139), Expect = 2e-07, Method: Composition-based stats. Identities = 20/146 (13%), Positives = 39/146 (26%), Gaps = 14/146 (9%) Query: 27 PLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQ 86 + DD + S VQ T++P+ R + A+ + + + Sbjct: 164 TGWDGEPDDLSGSTGPWQVQVLTIDPKKFRGTL---DASYGLDLEARETTSTLATLTGAT 220 Query: 87 MAMNGGIYDES------YAPLGLYIENGQQKVALNLASGEGNFFIRPGGV-FYVAGDKVG 139 A+N G + P G+ + +G+ G + Sbjct: 221 AAVNAGFFVLDPKAGAPGDPAGVAVYDGRLVSEPTAGRPALVVGENARGTSVERFRWRGE 280 Query: 140 IVRLDAFKTSKEIQFAVQSGPMLMEN 165 I + P L+ N Sbjct: 281 IRGTGRPLPLDGLNRV----PGLIRN 302 >UniRef50_Q67T45 Putative uncharacterized protein n=1 Tax=Symbiobacterium thermophilum RepID=Q67T45_SYMTH Length = 921 Score = 83.2 bits (204), Expect = 7e-15, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 50/162 (30%), Gaps = 20/162 (12%) Query: 108 GQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGV 167 G+ A G F K G +++ S + +A+ L+ +G Sbjct: 217 GRVTNVPVPADGFILVGTNEAARFL-DPLKPGDPVTVSYRPSPAVAWAIGGQNYLVRDGA 275 Query: 168 INPRIHPNVASSKIRNGVGINKHGN-AVFLLSQQ------ATNFYDFACYAKAKLNVEQL 220 + + + AS + R+ VG + G L+ + + A + K+ Sbjct: 276 VVSGL--DNASRRPRSAVGFSADGRRMYLLVIEGDSSRSVGATLAEMAAFMKS-FGAANA 332 Query: 221 LYLDGTISHMYMKGG-----AIPWQR----YPFVTMISVERK 253 L LDG S + + P I + + Sbjct: 333 LELDGGGSSTIVARRPGEPLTVLNLPASAQRPVPNGIGLFAQ 374 Score = 49.7 bits (117), Expect = 8e-05, Method: Composition-based stats. Identities = 13/114 (11%), Positives = 29/114 (25%), Gaps = 5/114 (4%) Query: 16 LKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHAL 75 + L + + T + + A G Sbjct: 35 VTARHEVTPLAQGLTLTNFQRLYPYGWVNGWLLTADLSQPTLTSDVITAPGLT---EREP 91 Query: 76 LADINSQGQVQMAMNGGIYDESYA--PLGLYIENGQQKVALNLASGEGNFFIRP 127 + D+ ++ A+NG + + LG ++ GQ + + G Sbjct: 92 VLDMAAREGAVAAINGDFFALGGSGIALGTVVKRGQYLQSPQPSWPNGAVVGTD 145 >UniRef50_B2A2E0 Copper amine oxidase domain protein n=1 Tax=Natranaerobius thermophilus JW/NM-WN-LF RepID=B2A2E0_NATTJ Length = 514 Score = 82.8 bits (203), Expect = 9e-15, Method: Composition-based stats. Identities = 27/154 (17%), Positives = 46/154 (29%), Gaps = 10/154 (6%) Query: 104 YIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLM 163 ENG + G V + ++ V +GP L+ Sbjct: 207 IPENGFVVELGTQQAVSQLPENYSPGDKAKLKPSVSDEDREEPIEIEDFIHMVGAGPKLV 266 Query: 164 ENGVINPRIHPN------VASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNV 217 NG + + + R+ +G N + A N D A +L + Sbjct: 267 NNGREDVDLEKDQMTGERHTIKARRSFIGYN-DNEVIMGTVDGA-NHEDTAAICV-ELGL 323 Query: 218 EQLLYLDGTISH-MYMKGGAIPWQRYPFVTMISV 250 + + LDG S +Y +G I + V Sbjct: 324 TEAMALDGGASSGLYYEGDYITRPGREISNALVV 357 >UniRef50_A9V0B9 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9V0B9_MONBE Length = 623 Score = 82.8 bits (203), Expect = 1e-14, Method: Composition-based stats. Identities = 30/251 (11%), Positives = 56/251 (22%), Gaps = 77/251 (30%) Query: 50 VNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE-SYAPLGLYIENG 108 ++P+ G G ++++ ++ A N G ++ G + +G Sbjct: 101 LDPRRTLSVYEPGAPGGCGDGDHRTIVSETAARHDCIYATNAGFFNTHDGTCYGDIVSDG 160 Query: 109 QQKVAL---------------------------NLASGEGNFFIRPGGVFYVAGDKVGIV 141 + A AS G V ++ Sbjct: 161 RLVQADNHTNVQFGVRHDNTIQVSVALFSNPKSQTASSRGLPASPIDRCMQVGYFQLDGN 220 Query: 142 RLDAFKTSKEIQFA--------------VQ--------------SGPMLMENGVINPRIH 173 A I A + L+ NG + Sbjct: 221 ETRAEDFQFLITGACHQKLAHEASESSMIGSCCFPHDTWAHKHIGAIWLVRNGQVYVNES 280 Query: 174 ---------------PNVASSKIRNGVGINKHGNAVFLLSQQ-----ATNFYDFACYAKA 213 R + + +G + N Y+FA Y K Sbjct: 281 IAYECSNIEESGSLQEFANLQSARTALAHDSNGAVRIVQHNGQSGHYGINLYEFAKYLKQ 340 Query: 214 KLNVEQLLYLD 224 + V + LD Sbjct: 341 Q-GVVNAINLD 350 >UniRef50_A6TUG6 Copper amine oxidase domain protein n=1 Tax=Alkaliphilus metalliredigens QYMF RepID=A6TUG6_ALKMQ Length = 491 Score = 82.4 bits (202), Expect = 1e-14, Method: Composition-based stats. Identities = 25/113 (22%), Positives = 44/113 (38%), Gaps = 10/113 (8%) Query: 149 SKEIQFAVQSGPMLMENGVINPR-------IHPNVASSKIRNGVGINKHGNAVFLLSQQA 201 KE+ A+ +GP L++NGVI + + R+ +G+ K V Sbjct: 257 WKEVTSAIGAGPTLIKNGVITANGLSEGFFEDEILTNRGQRSFIGVTKENKLVMGTVPS- 315 Query: 202 TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPW-QRYPFVTMISVERK 253 + + A AK +L + Q + LDG S + + I + +K Sbjct: 316 VSVKELAEIAK-ELGLYQAINLDGGASSGLIYKDRMVHAPGRLLSNAIVITKK 367 >UniRef50_A9BJK8 Putative uncharacterized protein n=1 Tax=Petrotoga mobilis SJ95 RepID=A9BJK8_PETMO Length = 561 Score = 82.4 bits (202), Expect = 1e-14, Method: Composition-based stats. Identities = 35/248 (14%), Positives = 67/248 (27%), Gaps = 30/248 (12%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQ-TERVKMYWQKANGEAWGTLHALLADINSQG 83 LL L + + + L + + Q ++W K+ W L Sbjct: 313 LLHLSSYSRPALIIGTNFLDIDYIKLEYQLNIDNLLFWIKSINSTWKGDVKLYTHHYKGN 372 Query: 84 QVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVF------YVAGDK 137 + N + EN + EG I + G K Sbjct: 373 ITETEENYVFFLID--------ENNRIISKNKTTPSEGEKLILVDKKYEKYLENISLGTK 424 Query: 138 VGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPN--------VASSKIRNGVGINK 189 V + + + ++ GP+L+ + ++ + R V I+K Sbjct: 425 VDFTLNKSENLTNDPTLLLEGGPILIHSKYTQEQLDAEKKSYSNGIIYGKAPRTVVAIDK 484 Query: 190 HGNAVFLLSQQ------ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPW-QRY 242 N ++ + + + E + LDG S + G I + Sbjct: 485 EQNINLMVIEGLDNPETGLTYDETRNLLFKIGEFEVAMMLDGGSSSIVYYEGEIQNFKNE 544 Query: 243 PFVTMISV 250 I V Sbjct: 545 KTRNYIPV 552 Score = 55.0 bits (131), Expect = 2e-06, Method: Composition-based stats. Identities = 11/106 (10%), Positives = 30/106 (28%), Gaps = 7/106 (6%) Query: 22 ALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINS 81 + + L+ L + V+P+ ++K+ + + Sbjct: 230 STSENEGLQYIEKTEELNGQRLKIYQLIVDPKIYQIKV------DLNNLGTRSDVYSFLK 283 Query: 82 QGQVQMAMNGGIYDESY-APLGLYIENGQQKVALNLASGEGNFFIR 126 + ++N +D P+G I +G + + Sbjct: 284 EKNPIFSVNASFFDPQTLEPVGNIISDGALLHLSSYSRPALIIGTN 329 >UniRef50_C7QHR1 Putative uncharacterized protein n=1 Tax=Catenulispora acidiphila DSM 44928 RepID=C7QHR1_CATAD Length = 636 Score = 81.6 bits (200), Expect = 2e-14, Method: Composition-based stats. Identities = 29/227 (12%), Positives = 63/227 (27%), Gaps = 23/227 (10%) Query: 8 GKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGE 67 G+G + + + + ++ + ++ Sbjct: 374 GEGTWSPAVVVNGVPVIQTAKLRSDPQHLE-----YLSAVAWMDQKHASFVLHPGSQQPG 428 Query: 68 AWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRP 127 G + + NG G + NG+ L F+ Sbjct: 429 TAGYNQTDHLSGDQFKNLIATWNGAFLLNPNDAHGGFYLNGKTYGTLVPGQASEVFY--K 486 Query: 128 GGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASS-------- 179 G V G + + + Q+ +L++NG +NP + + Sbjct: 487 DGTMNVGSWNSG----PGLQMAPNVVGVRQNLQLLVDNGQVNPSVDSDDKKLWGVTVKNA 542 Query: 180 --KIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLD 224 R+G+G+ GN V+ + A + A + + + LD Sbjct: 543 YFVWRSGIGVTADGNLVYAM-GPALSVRTLAELL-QRAGAVRGMELD 587 >UniRef50_D1Y6Q3 Putative liporotein n=1 Tax=Pyramidobacter piscolens W5455 RepID=D1Y6Q3_9BACT Length = 572 Score = 81.2 bits (199), Expect = 3e-14, Method: Composition-based stats. Identities = 24/146 (16%), Positives = 48/146 (32%), Gaps = 15/146 (10%) Query: 122 NFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIH---PNVAS 178 R GD+V + ++ AVQ+GP+L G + Sbjct: 428 LIVSRDPKFTLQKGDRVTLETQWRETPPIDVASAVQAGPLLYAPGHQFWDEMLSLSILTL 487 Query: 179 SKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKA------KLNVEQLLYLDGTISHMYM 232 R +G + V++++ ++++ + +L + LL LDG S Sbjct: 488 RHPRTLLGWDGK-RMVWIVADGRSSWHSRGLFLNEAEQLGRRLGLTALLNLDGGGSSEMW 546 Query: 233 KGGAIPW-----QRYPFVTMISVERK 253 G + + + V +K Sbjct: 547 WDGHVVNAVSDGRERRMPYGLMVLKK 572 >UniRef50_A4XGY7 Putative uncharacterized protein n=2 Tax=Clostridia RepID=A4XGY7_CALS8 Length = 877 Score = 81.2 bits (199), Expect = 3e-14, Method: Composition-based stats. Identities = 19/113 (16%), Positives = 43/113 (38%), Gaps = 9/113 (7%) Query: 127 PGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVG 186 +++D ++I+ A L+++G I P +A R+ +G Sbjct: 243 KNLKANFKVGDKVSLKIDLSIPLEKIKAAASGNTFLLKDGKI-PSFTHEIAGRHPRSAIG 301 Query: 187 INKHGNAV-FLLSQQA------TNFYDFACYAKAKLNVEQLLYLDGTISHMYM 232 I+K G + + + + A + ++ ++V + LDG S + Sbjct: 302 IDKTGRYLYLVAVDGRNGKSIGLSQGELASFLQS-IDVWTAINLDGGYSTQLI 353 Score = 44.6 bits (104), Expect = 0.003, Method: Composition-based stats. Identities = 9/101 (8%), Positives = 26/101 (25%), Gaps = 7/101 (6%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ 84 + P + + + + ++ K+ G + Sbjct: 37 IAPRTYYEKYELLTDEGFVDINCIKLDLIDGGFDFDVLKSKVANTGDFVYNMVSSQIDKN 96 Query: 85 VQMAMNGGIYDESYA-------PLGLYIENGQQKVALNLAS 118 A+N + + P+G+ + G+ + N Sbjct: 97 PIAAINANFFYTNTKTDYNKIWPIGISVSGGKILSSPNNKQ 137 >UniRef50_UPI0001C31921 Collagen triple helix repeat protein n=2 Tax=Conexibacter woesei DSM 14684 RepID=UPI0001C31921 Length = 1426 Score = 81.2 bits (199), Expect = 3e-14, Method: Composition-based stats. Identities = 22/195 (11%), Positives = 51/195 (26%), Gaps = 20/195 (10%) Query: 51 NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQ 110 + + ++ +A + + +G Sbjct: 168 DLPVAALNAAGAVPADGYVAFTPKWGRSSRARSLAGVANVAEALVTDGRVV--AVSDG-- 223 Query: 111 KVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAF----KTSKEIQFAVQSGPMLMENG 166 A + +G R + + G A+ ++++QFA+ +L+ +G Sbjct: 224 VGAGEIPAGSFYLVGRESAADAIRALRAGDEVRLAYGLSGDVAQQLQFAIGGNEVLVRDG 283 Query: 167 VINPRIHPNVASSKIRNGVGINKHGN-AVFLLSQQA------TNFYDFACYAKAKLNVEQ 219 + S R +G G + ++ A + E Sbjct: 284 QVVGSDQ----SVHPRTAIGFKDGGRTLLLFVADGRQTQVLGMTTQKVAQLLRDA-GAET 338 Query: 220 LLYLDGTISHMYMKG 234 + LDG S + Sbjct: 339 AMNLDGGGSTTLVAR 353 Score = 44.3 bits (103), Expect = 0.004, Method: Composition-based stats. Identities = 21/181 (11%), Positives = 48/181 (26%), Gaps = 12/181 (6%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ 84 + P ++ + Q T + + T ++ + Sbjct: 46 IGPGVSMRHLKTLEAGGWFDYQLLT---ARLQGGVVTSDLLSGDSVTEAGPISKKADRAG 102 Query: 85 VQMAMNGGIYDESYA--PLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVR 142 +NG +D + + P L + G+ + N + G+ + + Sbjct: 103 AVAGVNGDFFDINNSNAPQNLAVRGGELLKSANFGLTAPATGVTRDGIGQLLSTTLDAKA 162 Query: 143 L-DAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQA 201 A + P +G SS+ R+ G+ A L++ Sbjct: 163 TFGGADLPVAALNAAGAVP---ADG-YVAFTPKWGRSSRARSLAGV--ANVAEALVTDGR 216 Query: 202 T 202 Sbjct: 217 V 217 >UniRef50_B8FVQ0 Ig-like, group 2 n=2 Tax=Desulfitobacterium hafniense RepID=B8FVQ0_DESHD Length = 913 Score = 80.9 bits (198), Expect = 4e-14, Method: Composition-based stats. Identities = 19/176 (10%), Positives = 40/176 (22%), Gaps = 30/176 (17%) Query: 98 YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQ 157 P E G V+ + + PG +++ + Sbjct: 225 GQPAAEIPEEGFVVVSRGEQAAKLLEQAAPGEQLQFQVTS--------TPDWNDLKMSTT 276 Query: 158 SGPMLMENGVINPRIH---PNVASSKIRNGVGINKHG-NAVFLLSQQAT------NFYDF 207 +L+++G I + R G + G + + + Sbjct: 277 GTSLLIQDGEIPATFSYSTASFNQRNPRTMAGSTEDGSELILVTVDGRQDNSIGLTQQES 336 Query: 208 ACYAKAKLNVEQLLYLDGTISHMYMKG-------GAIPWQR----YPFVTMISVER 252 A +L Q + DG S + + I + Sbjct: 337 AELML-ELGAYQAIMFDGGASTTMAARQPGAFSPDVVNLPSEGILRNVASGIGIFS 391 Score = 51.2 bits (121), Expect = 3e-05, Method: Composition-based stats. Identities = 13/152 (8%), Positives = 36/152 (23%), Gaps = 21/152 (13%) Query: 24 TLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQG 83 T+ + + L + V+ VK+ + ++ ++ + + Sbjct: 40 TVTDGVTLENISRFTTGGWLNINVLRVDMTNPYVKIDTL--SNDSITDDLVSISALAEKE 97 Query: 84 QVQMAMNGGIYDES----YAPLGLYIENGQQKVALNLASGEGNF---------------F 124 A+N ++ G + G + N + Sbjct: 98 GAVAAVNSSFFNPFTAGKGYADGPTVRAGDLLSTSAWYNRSKNEMASLSVDYANQLLFHY 157 Query: 125 IRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAV 156 + D V + ++ Q Sbjct: 158 WKNDLTLITGNDTAFAVTQYNQPSRQDYQDLT 189 >UniRef50_C6IV65 Putative uncharacterized protein n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6IV65_9BACL Length = 257 Score = 80.1 bits (196), Expect = 6e-14, Method: Composition-based stats. Identities = 41/265 (15%), Positives = 82/265 (30%), Gaps = 42/265 (15%) Query: 10 GMITLNLKRIFLALTLLPLF-------AVAADDCALSDPTLTVQAYTVNPQTERVKMYWQ 62 G+ ++ IFL + L+ + + + + A V+P+ ++ Sbjct: 10 GIAMASVMAIFLVILLMIGWGSRYLLPRHYEYHETTAANGVKLHALVVDPERIELR---- 65 Query: 63 KANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGN 122 A + G MNGG + + A L L + N Q A G G Sbjct: 66 -AADQPLGRYR------------VYGMNGGFFY-NEAVLSLAVNNDQPVQGTAGAYGSGW 111 Query: 123 FFIR-PGGVFYVAGDK-----VGIVRLDAFKTSKEIQFAVQSGPML-MENGV------IN 169 F + G G + + ++ Q G + + + + Sbjct: 112 FNAKYARGTLVWDGATGSFSVQVVSAASELAVTDRTRYFAQGGVSMKLPDDAGWRAAAVE 171 Query: 170 PRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKL---NVEQLLYLDGT 226 PN +++R+G+ + +G +++ F K+ L + ++LDG Sbjct: 172 AEHLPNPDENRLRSGLAYDANGQLWLIVTPTRCTAEAFRTAVKSALADGGLVDGIFLDGD 231 Query: 227 ISHMYMKGGAIPW-QRYPFVTMISV 250 S MI + Sbjct: 232 GSAQLNAAETKLNGDSRDLRQMIVI 256 >UniRef50_C6WLB3 Metallophosphoesterase n=1 Tax=Actinosynnema mirum DSM 43827 RepID=C6WLB3_ACTMD Length = 1118 Score = 80.1 bits (196), Expect = 6e-14, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 48/187 (25%), Gaps = 27/187 (14%) Query: 57 VKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNL 116 W + + ++ + V + E+G L Sbjct: 212 FTPLWGEYTRSRSVRDAERVREVVVRNDVVTEV------RDTIGADRVPEDGYVL--LGR 263 Query: 117 ASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNV 176 +G + PG V A + A+ +L+ + + P Sbjct: 264 EAGATALAVAPGDHLSVRY------STRAGDGGSAPRAAIGGNQVLLRDSEVVAPDDP-- 315 Query: 177 ASSKIRNGVGINKHGNAVFL-LSQQAT-------NFYDFACYAKAKLNVEQLLYLDGTIS 228 S R VG + G +FL N D A + L L LDG S Sbjct: 316 --SHPRTAVGFSADGRRMFLLTVDGRQSAHLLGLNLKDVAEALRD-LGAHNALNLDGGGS 372 Query: 229 HMYMKGG 235 + Sbjct: 373 STLVARE 379 Score = 47.0 bits (110), Expect = 6e-04, Method: Composition-based stats. Identities = 17/100 (17%), Positives = 37/100 (37%), Gaps = 6/100 (6%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ 84 + P + + D +D + A TV+ K+ + T A L++ + + Sbjct: 73 VAPGVELTSFDWYRADGWVRGDALTVDLAG-GTKVDYLDPGS---VTAAAPLSEQADRTR 128 Query: 85 VQMAMNGGIYDESYA--PLGLYIENGQQKVALNLASGEGN 122 A+NG +D + + P G + +G + + Sbjct: 129 AVAAVNGDFFDINNSDAPEGAAVRDGVPVKSASPGRERAV 168 >UniRef50_A4FAG7 Secreted protein n=5 Tax=Actinomycetales RepID=A4FAG7_SACEN Length = 434 Score = 79.7 bits (195), Expect = 9e-14, Method: Composition-based stats. Identities = 27/180 (15%), Positives = 50/180 (27%), Gaps = 32/180 (17%) Query: 103 LYIENGQQ--KVALNLA----SGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSK----EI 152 + + +G+ A +G+ R GV + K G ++ + E Sbjct: 257 IVVRDGKVAEVRPEPGAGAIAAGDFVLVGREDGVGELDDLKPGDPVSVDYQLAPVGVPEF 316 Query: 153 QFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHG-NAVFLLSQQA------TNFY 205 +F V P+L +G P + + + R G + G + Sbjct: 317 RFVVGGFPIL-RDGTALPGL--DDQALAPRTSAGASADGKRVYLVAMDGRSQVSAGLTVS 373 Query: 206 DFACYAKAKLNVEQLLYLDGTISHMYMKGG-------AIPWQR----YPFVTMISVERKG 254 + A K + + LDG S + P I + G Sbjct: 374 ELADLLKRS-GADDAVNLDGGGSTTLVAREAGAQHVTVRNNPSDGRERPVANGIGIFSGG 432 Score = 40.8 bits (94), Expect = 0.044, Method: Composition-based stats. Identities = 15/184 (8%), Positives = 41/184 (22%), Gaps = 20/184 (10%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ 84 + P + + + V + T ++ + Sbjct: 66 VAPGVTFRSFTWETGHGSTQGYVLEADLANPLVDLGLLH---RKTVTEPTAISALADGQG 122 Query: 85 VQMAMNGGIYD---------ESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAG 135 +NG ++ + + +G I G+ + + G Sbjct: 123 AVAGVNGDFFNNTDEHEGVEPTGSAVGAEIAAGEARKGAVPDGQRFGPGLPEGTSTEDVF 182 Query: 136 DKVGIVRLDAFKTSKEIQFAVQSGPMLMEN--------GVINPRIHPNVASSKIRNGVGI 187 E + G + ++ G + +S++R G Sbjct: 183 GLGSDRTARVASVRLEGKIETGHGVLGVDGLNQYAIPVGGVGVFTEAWGTTSRVRATCGT 242 Query: 188 NKHG 191 + Sbjct: 243 DTDR 246 >UniRef50_C1TLP7 Sporulation related-protein with S-layer-like domain n=1 Tax=Dethiosulfovibrio peptidovorans DSM 11002 RepID=C1TLP7_9BACT Length = 565 Score = 79.3 bits (194), Expect = 1e-13, Method: Composition-based stats. Identities = 17/148 (11%), Positives = 48/148 (32%), Gaps = 19/148 (12%) Query: 113 ALNLASGEGNFFIRPGGVFYVAGDKVG---------IVRLDAFKTSKEIQFAVQSGPMLM 163 ++G +F + Y+ +G + + + + E + +Q+GPM++ Sbjct: 405 KARRSAGSNHFMKPEENILYLRTSGLGPFSDAAEIKLQTIWSDEAMIEAKQVIQAGPMIL 464 Query: 164 -ENGVINPRIHPNV--ASSKIRNGVGINKHGNAVFLLSQQATNFYD------FACYAKAK 214 G + + R G + +++ ++++ A + + Sbjct: 465 GLEGPFSSEWFSDSIINKRHPRTLAGWDGD-RLCWIVIDGRSSWHSDGATLSEAAFIARQ 523 Query: 215 LNVEQLLYLDGTISHMYMKGGAIPWQRY 242 + + + +DG S G Sbjct: 524 AGLVKAINMDGGGSSQLWWKGITVNLPS 551 >UniRef50_C7PW43 Ig domain protein group 2 domain protein n=2 Tax=Catenulispora acidiphila DSM 44928 RepID=C7PW43_CATAD Length = 1174 Score = 79.3 bits (194), Expect = 1e-13, Method: Composition-based stats. Identities = 26/201 (12%), Positives = 62/201 (30%), Gaps = 14/201 (6%) Query: 18 RIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLA 77 + ++TL +D Q V+ V++ +++ + + Sbjct: 69 AVSPSVTLTHGVQQHSDVLRTVGGAQRAQVMDVDLADPNVRLGVVESHDHLTDAADEVPS 128 Query: 78 DINSQGQVQMAMNGGIY--DESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAG 135 + + +NG + S PLG+ + +G+ + + + G + Sbjct: 129 SMAHRTGAVGGVNGDFFEIYGSGRPLGMVVIDGRLVKSPDPTWNADLWVRHDGSIGIGTE 188 Query: 136 DKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKI---RNGVG--INKH 190 G + A + AV S +G R+ P++ + V + Sbjct: 189 TYAGSLTDGAATAAITAVNAVNS-----LSGNAIVRVTPDLGTPSPIAASTVVAGHLGAD 243 Query: 191 GNAVFL--LSQQATNFYDFAC 209 G + + ++ T A Sbjct: 244 GTTLLVDSVTAGVTTLPQLAA 264 Score = 62.4 bits (150), Expect = 1e-08, Method: Composition-based stats. Identities = 26/244 (10%), Positives = 54/244 (22%), Gaps = 20/244 (8%) Query: 5 LLIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKA 64 +G +I L + L+ + T + A Sbjct: 152 RPLGMVVIDGRLVKSPDPTWNADLWVRHDGSIGIGTETYAGSLTD---GAATAAITAVNA 208 Query: 65 NGEAWGTLHALLADINSQGQVQMA--MNGGIYDESYAPL-GLYIENGQQKVA--LNLASG 119 G + A + G L + G + Sbjct: 209 VNSLSGNAIVRVTPDLGTPSPIAASTVVAGHLGADGTTLLVDSVTAGVTTLPQLAAGTED 268 Query: 120 EGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGV--INPRIHPNVA 177 + G + I + ++ + G +L++NG + + Sbjct: 269 LVGSGTQAGWLSQNIHMGDQIAVSEKIGPDPDVVQGLSGGAILVQNGQRAVPLQGSGENN 328 Query: 178 SSKIRNGVGINKHG-NAVFLLSQQAT--------NFYDFACYAKAKLNVEQLLYLDGTIS 228 VG+++ G +AVF A + + D S Sbjct: 329 VDNPVTAVGVSQDGKHAVFAAFDGHQSEDVAQGLTRPQIAGWMTQH-GAYNAILFDSGGS 387 Query: 229 HMYM 232 + Sbjct: 388 TQMV 391 >UniRef50_D1VTW3 Copper amine oxidase N-domain superfamily n=1 Tax=Peptoniphilus lacrimalis 315-B RepID=D1VTW3_9FIRM Length = 765 Score = 78.9 bits (193), Expect = 2e-13, Method: Composition-based stats. Identities = 18/131 (13%), Positives = 38/131 (29%), Gaps = 22/131 (16%) Query: 143 LDAFKTSKEIQFAVQSGPMLMENGVINPRIHP--NVASSKIRNGVGINKHGN-AVFLLSQ 199 K + + +L++N I P ++ ++ R +GI G + + Sbjct: 252 TYDIYPQKSWKMLIGGHSLLVDNSKIRPYKKDINSIGGTRARTCIGIADGGKSVYIVSCE 311 Query: 200 QA------TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGG-------AIPWQ-----R 241 + + + + L ++ L LDG S + + Sbjct: 312 GRTKRSSGMSLNELSNFMVN-LGCQRALNLDGGGSTAMVVRNLGDLNRTRVINPEGNKAE 370 Query: 242 YPFVTMISVER 252 V I V Sbjct: 371 RKVVNGIGVFN 381 Score = 58.1 bits (139), Expect = 3e-07, Method: Composition-based stats. Identities = 19/131 (14%), Positives = 37/131 (28%), Gaps = 4/131 (3%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ 84 + P S + + +K+ NG ++ + ++ Sbjct: 32 IAPGVVNFQYTVKTSKGKAILNVLKCDLNNPYIKLNTVAGNGSYTEKAS--VSQMANRTN 89 Query: 85 VQMAMNGGIYDE--SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVR 142 NG + PLG I +G K + + + +F I Y+ G Sbjct: 90 AVGLCNGDFFSMALQGVPLGPSIIDGDIKSSPGVLTDIYSFGIDSTNTAYILDTNFGGKV 149 Query: 143 LDAFKTSKEIQ 153 T+ I Sbjct: 150 TAPNGTTYNID 160 >UniRef50_A3TM75 Putative uncharacterized protein n=1 Tax=Janibacter sp. HTCC2649 RepID=A3TM75_9MICO Length = 1151 Score = 78.2 bits (191), Expect = 3e-13, Method: Composition-based stats. Identities = 24/181 (13%), Positives = 53/181 (29%), Gaps = 26/181 (14%) Query: 77 ADINSQGQVQMAMNGGIYDESYAPLGL-----YIENGQQKVALNLASGEGNF-------- 123 I + G G Y + G + G ++ +GEG Sbjct: 223 TTIPTNGVTVFTPLWGDYTRTRVTGGATKVREVVTRGGVVESVATVAGEGQLDKDQLAIV 282 Query: 124 --FIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKI 181 + + V ++ A+ ML+++ V+ P+ + Sbjct: 283 GRGAGADTLAALNVGDKVSVDYALRNEGANLKMAISGNTMLLKDNVVLPQTDKAI---HP 339 Query: 182 RNGVGINKHGNAVFL-LSQQAT------NFYDFACYAKAKLNVEQLLYLDGTISHMYMKG 234 R +G + G+ +F+ + + + K ++ + LDG S + Sbjct: 340 RTAIGFDADGSTMFVLTVDGRMAASRGMTYAETGAFLK-EVGATSGINLDGGGSSTMLAR 398 Query: 235 G 235 Sbjct: 399 E 399 >UniRef50_C9RD84 Copper amine oxidase domain protein n=1 Tax=Ammonifex degensii KC4 RepID=C9RD84_AMMDK Length = 465 Score = 77.8 bits (190), Expect = 3e-13, Method: Composition-based stats. Identities = 25/176 (14%), Positives = 56/176 (31%), Gaps = 32/176 (18%) Query: 102 GLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQF------- 154 + +ENG G G + G G + F ++++ Sbjct: 181 TVVVENGVV-----TRMGSGPCPVPDNGYVIGFGPAAAAKFANRFYPGAKVEWWVVFEAK 235 Query: 155 -----------AVQSGPMLMENGVINPRIHPNVASSKIR-------NGVGINKHGNAVFL 196 +Q GP+L+++G I H + + + + +G + G V Sbjct: 236 DGAPLQWSGRTVIQGGPLLLKDGAIVLDSHLDELYREPKFSRYGSWSFIGTDFEGCLVLG 295 Query: 197 LSQQATNFYDFACYAKAKLNVEQLLYLDGTIS-HMYMKGGAIPWQRYPFVTMISVE 251 + ++ A + + + LDG S ++ +G + ++V Sbjct: 296 SVLGVDSLWNMARVLQQA-GIRNAVCLDGNASCGLWYRGSYLVTPGRALSNCVAVT 350 Score = 53.9 bits (128), Expect = 6e-06, Method: Composition-based stats. Identities = 26/115 (22%), Positives = 43/115 (37%), Gaps = 8/115 (6%) Query: 44 TVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE--SYAPL 101 V V+ RV+++ A + LA + + A+NG ++ Sbjct: 41 KVHLIKVDISDPRVRVFPVLAQNKT--GRAESLASMACRVGAVAAVNGTFFNAYSDLTSW 98 Query: 102 GLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIV-RLDAFKTSKEIQFA 155 G I+ GQ L SG G + PGG+ VA + R+D + +A Sbjct: 99 GALIDAGQVYR---LGSGSGALSLGPGGLAEVARLNSRVEGRIDEKDDWQHWWYA 150 >UniRef50_C9M6C8 Putative uncharacterized protein n=1 Tax=Jonquetella anthropi E3_33 E1 RepID=C9M6C8_9BACT Length = 603 Score = 77.8 bits (190), Expect = 4e-13, Method: Composition-based stats. Identities = 22/119 (18%), Positives = 38/119 (31%), Gaps = 12/119 (10%) Query: 136 DKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPN----VASSKIRNGVGINKHG 191 D + + + ++ A+Q GP+L+++G I + R VG Sbjct: 444 DPITLNVQWRDENAQGTVGALQGGPLLLKDGKIQRMNEGIAVGVINRRHPRTLVGRIGK- 502 Query: 192 NAVFLLSQQATNFY------DFACYAKAKLNVEQLLYLDGTISH-MYMKGGAIPWQRYP 243 +L ++ D A L LL LDG S + G + Sbjct: 503 TVWWLAVDGRAPWHSSGLTLDEATTLGQYLGFTDLLNLDGGGSTELLYHGYPVNKPSDG 561 >UniRef50_UPI0001746B2F hypothetical protein VspiD_16055 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001746B2F Length = 325 Score = 77.4 bits (189), Expect = 4e-13, Method: Composition-based stats. Identities = 44/192 (22%), Positives = 71/192 (36%), Gaps = 25/192 (13%) Query: 44 TVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGL 103 TV +NP + + ++ +G A T LA + A+ D + PLGL Sbjct: 93 TVNVIEINPANYQFQTSFK--DGFALTTAKERLAT----ERAAFAITANFRDPAGKPLGL 146 Query: 104 YIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEI-QFAVQSGPML 162 + G Q+ A G F+V K F+ + + Q A Q P L Sbjct: 147 VVHEGTQRNPTFPAWT---------GYFFVKAGKPWFGPKSLFEETPGVLQEASQGYPSL 197 Query: 163 MENGVINPRIH------PNVASSKIRNGVGINKHGNAVFLL--SQQATNFYDFACYAKAK 214 M+N + + + R G+ ++GN VF+L + N + + Sbjct: 198 MKNHTVFSYVDLPSTRYFDGNRVTYRALAGMKQNGNIVFILSGTGGVMNVSEVTA-LAQR 256 Query: 215 LNVEQLLYLDGT 226 LNV+ LDG Sbjct: 257 LNVQHATLLDGG 268 >UniRef50_C8VW07 S-layer domain protein n=1 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8VW07_DESAS Length = 921 Score = 77.4 bits (189), Expect = 5e-13, Method: Composition-based stats. Identities = 25/192 (13%), Positives = 55/192 (28%), Gaps = 33/192 (17%) Query: 78 DINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDK 137 ++ Q + +++N P NG A+ + + Sbjct: 210 EVVVQNDLVVSVN------QNKPGVPIPANGYVLEGHGAAAQFLLENLPVSSRVQTSY-- 261 Query: 138 VGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLL 197 + ++ A+ +L+++G + P + + R VGI ++L Sbjct: 262 ------LVTPQTGNLRAALGGNTLLVQDGQLAP-FTQEITGNYARTAVGIMPDNKTLYLA 314 Query: 198 SQQ------ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGG------AIPWQR---- 241 + + T A + A L V + + LDG S ++ Sbjct: 315 AAENGNGSVGTTQTGMAEFLLA-LGVNRAVNLDGGGSTTLSARHAGDGEASLINHPQLTQ 373 Query: 242 -YPFVTMISVER 252 + V Sbjct: 374 QRLLPDAVGVFS 385 Score = 62.0 bits (149), Expect = 2e-08, Method: Composition-based stats. Identities = 19/135 (14%), Positives = 38/135 (28%), Gaps = 7/135 (5%) Query: 1 MAHQLLIGKGMITLNLKRIFLAL-TLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKM 59 +A L+ G + + P + + A V+ VK+ Sbjct: 18 LALGLVWNAGFTPAYADINPVNTEIISPGVVLQ----TYMYNKTRIYAIKVDLSNPYVKI 73 Query: 60 YWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE-SYAPLGLYIENGQQKVALNLAS 118 L + S+ +NGG + ++ P+GL NG + + Sbjct: 74 DTMIGADGTL-NKAQSLTGMTSRTGAVAGINGGFFQMKNHRPIGLEFSNGNLVSSPAMRE 132 Query: 119 GEGNFFIRPGGVFYV 133 F + + Sbjct: 133 DMPGFAVTNNNQAII 147 >UniRef50_A4FAL4 Putative uncharacterized protein n=2 Tax=Actinomycetales RepID=A4FAL4_SACEN Length = 1118 Score = 77.0 bits (188), Expect = 5e-13, Method: Composition-based stats. Identities = 29/276 (10%), Positives = 67/276 (24%), Gaps = 40/276 (14%) Query: 2 AHQLLIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYW 61 A +G G+ L + + AV L+ L + R+ Sbjct: 151 ATGAPLGVGLSGGELLNAPGSGH-NHVAAVGDRLGGLAQVFLEAAFTRADGAKHRIS--D 207 Query: 62 QKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEG 121 A + + + + + + + +G ++ + Sbjct: 208 LNAPKVSPNGIALYTSFWGAASRATAV------AGAARVTEAEVADGVVTRISDMPAEGS 261 Query: 122 NFFIR---------PGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRI 172 + + + V + + AV +L+ +GV+ Sbjct: 262 APGGTVRLLGVDAGADILRTLRQGERIEVHYAPRSEGELPKAAVGGNKVLLRDGVVQ--- 318 Query: 173 HPNVASSKIRNGVGINKHG-NAVFLLSQQA------TNFYDFACYAKAKLNVEQLLYLDG 225 + + R G + G + + A + ++ L + L LDG Sbjct: 319 QVDDTALHPRTAAGFSADGTRMWLVTIDGRQADSRGMTERELAEHLRS-LGADDALNLDG 377 Query: 226 TISHMYMKGGA------IPWQR-----YPFVTMISV 250 S + + P I Sbjct: 378 GGSSTLLAREEGGPAAVVHNAPSDGHERPVPNGIGF 413 Score = 50.0 bits (118), Expect = 7e-05, Method: Composition-based stats. Identities = 21/145 (14%), Positives = 42/145 (28%), Gaps = 11/145 (7%) Query: 8 GKGMITLNLKRIFLALT------LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYW 61 G+ T ++ T + P + D L +V+ R++ + Sbjct: 58 ADGIATGGVRSAEDIATGSVTKPVAPGLELTEFDRFGPAGWLRGDMLSVDLTESRLEPTY 117 Query: 62 QKANGEAWGTLHALLADINSQGQVQMAMNGGIYDES--YAPLGLYIENGQQKVALNLASG 119 + L++ ++ +NG +D APLG+ + G+ A Sbjct: 118 LHPG---AASARTPLSEQAARIGAVAGVNGDFFDIDATGAPLGVGLSGGELLNAPGSGHN 174 Query: 120 EGNFFIRPGGVFYVAGDKVGIVRLD 144 G + R D Sbjct: 175 HVAAVGDRLGGLAQVFLEAAFTRAD 199 >UniRef50_C7QCB3 Putative uncharacterized protein n=1 Tax=Catenulispora acidiphila DSM 44928 RepID=C7QCB3_CATAD Length = 585 Score = 77.0 bits (188), Expect = 5e-13, Method: Composition-based stats. Identities = 33/207 (15%), Positives = 67/207 (32%), Gaps = 17/207 (8%) Query: 26 LPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQV 85 P+ VA +T ++ + ++ G + Sbjct: 340 QPVVQVARVRPDAVYTGVTADVAVIDQKHSGFVLHPGHEGGLNSVITTVPNQIDANARPN 399 Query: 86 QMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDA 145 +A+ G + S + G Y A +L +G + I G V R + Sbjct: 400 LIALFNGGFKISESHGGYYDHG---VTAASLVNGAASEVIFKDGHMAVGMWG----RDYS 452 Query: 146 FKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKI--------RNGVGINKHGNAVFLL 197 F+ + +I Q+ ++++ G + P I + + R+GVG+ G+ V+ + Sbjct: 453 FQKNADIVSVRQNLKLMVDGGQVVPYIDDSSTWGRADHGSAAVWRSGVGVKADGDIVW-V 511 Query: 198 SQQATNFYDFACYAKAKLNVEQLLYLD 224 A K + + LD Sbjct: 512 GGNELTAPSLARLLKDA-GAVRAMQLD 537 >UniRef50_A3YXL4 Putative uncharacterized protein n=2 Tax=Chroococcales RepID=A3YXL4_9SYNE Length = 610 Score = 76.2 bits (186), Expect = 9e-13, Method: Composition-based stats. Identities = 44/278 (15%), Positives = 74/278 (26%), Gaps = 40/278 (14%) Query: 8 GKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGE 67 G +T +R + + + L + + P R + WQ Sbjct: 315 GLTSLTALAQREQALVAINGGYFNRIRRLPLGALKAEGRWLSG-PILNRGAIGWQPGGLP 373 Query: 68 AWGTLHALLADINSQGQ--VQMAMN----------------GGIYDESYAPLGLYIENGQ 109 ++G L I+ +GQ ++N G S G+ I +G Sbjct: 374 SFGRLALQEQLIDERGQRWPLSSLNSGYVQRGLARYTADWGAGYQALSGNESGVLIRDGV 433 Query: 110 QKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQF----AVQSGPMLME- 164 LN A + + V V +A + + Q Q+ M Sbjct: 434 VLQRLNGAQLQRGIPLGREDTLVVGRAGVMPPWPEASRLTLSSQSSDPLGQQAYVMGGGP 493 Query: 165 ----------NGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ----ATNFYDFACY 210 NG R +G + FL Q + A Sbjct: 494 LLLLGGRVVLNGTAEGFSSAFQGQGAPRTVIGSDG-RQIWFLTLQGVDHAGPTLGETATL 552 Query: 211 AKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMI 248 + +L + + L LDG S G + I Sbjct: 553 LR-QLGLREALNLDGGSSTGLFVGNTQTVRGRGVAASI 589 Score = 46.2 bits (108), Expect = 0.001, Method: Composition-based stats. Identities = 19/109 (17%), Positives = 37/109 (33%), Gaps = 5/109 (4%) Query: 23 LTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQ 82 L ++ + + + + +PQ + + G L + + Sbjct: 268 ALLNRSVSINRQVLPVGSRRMLISSVRFDPQQSPLDLRLLTRPDGMQGLTS--LTALAQR 325 Query: 83 GQVQMAMNGGIYDESYA-PLGLYIENGQQKVALNLASGEGNFFIRPGGV 130 Q +A+NGG ++ PLG G+ L G +PGG+ Sbjct: 326 EQALVAINGGYFNRIRRLPLGALKAEGRWLSGPILN--RGAIGWQPGGL 372 >UniRef50_C5CET4 Putative uncharacterized protein n=1 Tax=Kosmotoga olearia TBF 19.5.1 RepID=C5CET4_KOSOT Length = 558 Score = 75.8 bits (185), Expect = 1e-12, Method: Composition-based stats. Identities = 21/107 (19%), Positives = 42/107 (39%), Gaps = 13/107 (12%) Query: 149 SKEIQFAVQSGPMLMENGVINPRIHPNVAS------SKIRNGVGINKHGNAVFLLSQQ-- 200 +++++FA++ GP+++ G + S R +GI K G +F++ Sbjct: 439 NEKLKFAIEGGPLIISRGKPVTEYEKSFYSSSLLDIRAPRTLIGITKSGTLMFMIIDGYQ 498 Query: 201 ----ATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYP 243 F + + K N E L+ +DG S + G + Sbjct: 499 MKSYGLTFKEMVEFFTDK-NFEYLMCVDGGKSSALVFKGEVFSSPSS 544 Score = 61.2 bits (147), Expect = 3e-08, Method: Composition-based stats. Identities = 23/97 (23%), Positives = 41/97 (42%), Gaps = 7/97 (7%) Query: 38 LSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE- 96 +S + + A ++P+ R ++ ANG L + + +NGG +D Sbjct: 249 VSGRRIILTALELDPE--RFDIHPVLANGRIPSG--ESLLSMAKRYDAFAVINGGYFDPS 304 Query: 97 SYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYV 133 S+ P+GL IE+G+ +L FF G + Sbjct: 305 SFYPIGLLIEDGKLISLPSLERP--LFFQTEDGKMGI 339 >UniRef50_C9PT69 Putative uncharacterized protein n=1 Tax=Prevotella sp. oral taxon 472 str. F0295 RepID=C9PT69_9BACT Length = 814 Score = 75.8 bits (185), Expect = 1e-12, Method: Composition-based stats. Identities = 21/211 (9%), Positives = 49/211 (23%), Gaps = 40/211 (18%) Query: 76 LADINSQGQVQMAM--NGGIYDESYAPLGLYIE--NGQQKVALNLASGEGNFFIRPGGVF 131 + + Q+ + NG + + +GQ + ++ G Sbjct: 174 VNHLRGDNQLVLYNQHNGQYTHTDAKGTEVLVTLIDGQTWELNKEVRAKVVSVVQNKGNM 233 Query: 132 YVAGDKVGIVRLDAFKTS----------------------KEIQFAVQSGPM--LMENGV 167 ++ + + P ++++G Sbjct: 234 HIPAGSAVLSANGEIAKKLAALTAGTEVKIALNLSIGGATAPFTDVIGGDPRSPMLQDGT 293 Query: 168 INPRIHPNVASSKIRNGVGINKHGNA-VFLLSQQATNF------YDFACYAKAKLNVEQL 220 +N N R G G + + + + + A K + Sbjct: 294 VNTTEIWN--ELHPRTGFGYTQDKKTAIHCVVDGRSTISAGANTKELAEIMKF-VGAYNA 350 Query: 221 LYLDGTISHMYMKG--GAIPWQRYPFVTMIS 249 + LDG S G + +S Sbjct: 351 MNLDGGGSSCLFLKDFGPMNKNSDGNERAVS 381 >UniRef50_C6D5A3 Copper amine oxidase domain protein n=1 Tax=Paenibacillus sp. JDR-2 RepID=C6D5A3_PAESJ Length = 904 Score = 75.5 bits (184), Expect = 2e-12, Method: Composition-based stats. Identities = 28/179 (15%), Positives = 50/179 (27%), Gaps = 25/179 (13%) Query: 97 SYAPL-GLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFA 155 PL G +G A+ ++ G + + + Sbjct: 256 DGKPLTGTIPADGYILRGHGTAAQFILEHLQV-GQSVTSDYSLVSATSGQKVDPTSFEML 314 Query: 156 VQSGPMLMENGVINPRIHP----NVASSKIRNGVGINKHGNAVFLLSQQ------ATNFY 205 V +L+ NG + +S R VG +K G V+L++ + N Sbjct: 315 VGGHTILVNNGAAATFSRDITGVSGSSYVSRTAVGYSKDGTKVYLITSEDYGDSTGLNLK 374 Query: 206 DFACYAKAKLNVEQLLYLDGTISHMYMKG------------GAIPWQRYPFVTMISVER 252 + KL V + + LDG S ++ + I V Sbjct: 375 ELQQVMV-KLGVYKGINLDGGGSTTMIERPLGETSLRLAHSTQYDTTQRSISNSIGVFT 432 Score = 54.7 bits (130), Expect = 3e-06, Method: Composition-based stats. Identities = 10/109 (9%), Positives = 30/109 (27%), Gaps = 3/109 (2%) Query: 31 VAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMN 90 + + + V+ V + + G L + ++ + +N Sbjct: 76 LWTTTRSGKAAKANIHVIQVDLTNPYVNLNTMSGKNDTVGKL-NTVMNMTKENGAVAGIN 134 Query: 91 GGIY--DESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDK 137 ++ +P+G + +G + G F + + Sbjct: 135 ADVFITTTEGSPMGAQVTSGTLMTSPMQIKGMYAFAVTKDRKPVIDSYT 183 >UniRef50_C6J2I2 Copper amine oxidase domain-containing protein n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J2I2_9BACL Length = 930 Score = 75.1 bits (183), Expect = 2e-12, Method: Composition-based stats. Identities = 20/143 (13%), Positives = 46/143 (32%), Gaps = 12/143 (8%) Query: 102 GLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPM 161 G +G A A+ ++ G A ++ + +++ + + Sbjct: 284 GPVPADGYILRAHGTAADYVASHLQV-GQRVDAIYQLQSLTDGQLVDPADLKVMIGGHTL 342 Query: 162 LMENGVI----NPRIHPNVASSKIRNGVGINKHGNAVFLLS------QQATNFYDFACYA 211 L++ G + S+ R VG +K G ++++ + + Sbjct: 343 LVDQGKASAFTRSTTSISGGSAVARTAVGYSKDGKTAYIITAEKNSNSTGLTLKELQGFM 402 Query: 212 KAKLNVEQLLYLDGTISHMYMKG 234 + V + L LDG S + Sbjct: 403 -TGIGVWKGLNLDGGGSTTMVTR 424 Score = 62.0 bits (149), Expect = 2e-08, Method: Composition-based stats. Identities = 18/135 (13%), Positives = 36/135 (26%), Gaps = 5/135 (3%) Query: 5 LLIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTV--QAYTVNPQTERVKMYWQ 62 L + + +K +T S + V V+ V++ Sbjct: 71 ALTAEAAASQTVKLSEEMITSGAKLVKYEYTVTRSGKPVKVLTDVIEVDLTNPYVQLDVM 130 Query: 63 KANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYA--PLGLYIENGQQKVALNLASGE 120 G T + + + +NG + P+G + G + G Sbjct: 131 TGKGGQV-TTRQSVEGMVKETGAVAGVNGDFFATGGQGVPMGSAVSKGTVVTSPAQLQGM 189 Query: 121 GNFFIRPGGVFYVAG 135 F +R G + Sbjct: 190 YAFAVRNDGTPLIDR 204 >UniRef50_UPI00017890C7 copper amine oxidase domain protein n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI00017890C7 Length = 900 Score = 75.1 bits (183), Expect = 2e-12, Method: Composition-based stats. Identities = 21/138 (15%), Positives = 46/138 (33%), Gaps = 9/138 (6%) Query: 105 IENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLME 164 + G + + A+ E G A + + +Q + +L++ Sbjct: 260 VPEGSYILRTHGAAAEFVKNNLAVGQRLEADYALVSKKTGQKLDPTNLQMMIGGHTILVD 319 Query: 165 NGVINPRIHP--NVASSKIRNGVGINKHGNAVFLLSQQ------ATNFYDFACYAKAKLN 216 NG ++ ++ R VG +K G +L++ + + K+ Sbjct: 320 NGKATSFSRNVNDLGGNRARTAVGYSKDGRYAYLIATESNDNSKGMTLQQLQDFM-TKVG 378 Query: 217 VEQLLYLDGTISHMYMKG 234 V + + LDG S + Sbjct: 379 VWKGMNLDGGGSTTMVNR 396 Score = 67.4 bits (163), Expect = 4e-10, Method: Composition-based stats. Identities = 16/115 (13%), Positives = 34/115 (29%), Gaps = 3/115 (2%) Query: 23 LTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQ 82 + + + V+ + VK+ G + T + + Sbjct: 64 IITSGAVLMKYQYINSAGAKSLANVIRVDLNNKYVKLDVMTGQGNQF-TTRQSTGGMAKE 122 Query: 83 GQVQMAMNGGIYDES--YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAG 135 A+NG ++ AP+G + NG + + G F + G + Sbjct: 123 NGAVAAINGDFFNTGREGAPMGAQVSNGLMMSSPSDLKGMYAFAVTNDGKPILDE 177 >UniRef50_A6NQQ4 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6NQQ4_9BACE Length = 1060 Score = 73.9 bits (180), Expect = 5e-12, Method: Composition-based stats. Identities = 22/135 (16%), Positives = 39/135 (28%), Gaps = 12/135 (8%) Query: 102 GLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPM 161 + I G+ +++N + N + GD V I +E A+ Sbjct: 294 SIAIPEGKVVLSINNKA---NSYWLSNVKSLKPGDLVDIDVTTTDSIWQEADQAMGGLYK 350 Query: 162 LMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYA------KAKL 215 L+ G + + + VG+ G +F Y +L Sbjct: 351 LVTAGKVESGLPTGQTAY---TAVGVKADGTVIFYTIDGKQPGYSVGASLTQVAMRLVEL 407 Query: 216 NVEQLLYLDGTISHM 230 + LDG S Sbjct: 408 GCVDAISLDGGGSTT 422 Score = 40.8 bits (94), Expect = 0.039, Method: Composition-based stats. Identities = 16/126 (12%), Positives = 33/126 (26%), Gaps = 15/126 (11%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQ-- 82 L + L T + A G+ + L + + Sbjct: 46 LTHQIFWSDTYSDLRTERYFTY-------TPNKNVTPVVAYGDKVTS-RETLTTMAQRLE 97 Query: 83 ---GQVQMAMNGG-IYDESYAPLGLYIENGQQKVALNLAS-GEGNFFIRPGGVFYVAGDK 137 ++ +NG + PLG I +G + + G R G ++ Sbjct: 98 GEGKRLVGGINGDLYVMATGEPLGTVITDGVLRSVPGTNNQGYYAIGFRSDGTAFIGKPD 157 Query: 138 VGIVRL 143 + + Sbjct: 158 LTVTAT 163 >UniRef50_Q8DHE7 Tlr2012 protein n=1 Tax=Thermosynechococcus elongatus BP-1 RepID=Q8DHE7_THEEB Length = 332 Score = 73.5 bits (179), Expect = 6e-12, Method: Composition-based stats. Identities = 34/257 (13%), Positives = 55/257 (21%), Gaps = 63/257 (24%) Query: 34 DDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGI 93 V RV M A+G+ T A + AMN Sbjct: 16 HYSTHRVDQTQVYMVAAPMADYRVVMTEPAASGQD--TYLETTAAFARRTGAVAAMNTNF 73 Query: 94 Y--------------------------------------DESYAPLGL---YIENGQQKV 112 + P G+ YI NG+ Sbjct: 74 FRWLNAQSLRSFQDYQKWIMGESNPEGISYRALRQTCLSGRGTIPAGVAGAYIVNGRVVR 133 Query: 113 ALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRI 172 G G + ++ V L++ G P Sbjct: 134 PYEGG-YSGIVNFPAQGGIEFYRGR----------LPEQPFNVVSGSQQLLDGGRKLPVD 182 Query: 173 HPNVASSKIRNGVGINKHGNAVFLLSQQATNFYD------FACYAKAKLNVEQLLYLDGT 226 + S K + + VF++S N + V + +DG Sbjct: 183 GSDRTSPK---AILGRRGNEYVFVVSDGRGNGGSPGLSFLQLQDFLLQQGVTEATAMDGG 239 Query: 227 ISHMYMKGGAIPWQRYP 243 S + G + Sbjct: 240 ESATLVVEGQVKNHPRD 256 >UniRef50_Q0AWB0 Putative uncharacterized protein n=1 Tax=Syntrophomonas wolfei subsp. wolfei str. Goettingen RepID=Q0AWB0_SYNWW Length = 497 Score = 73.5 bits (179), Expect = 6e-12, Method: Composition-based stats. Identities = 28/174 (16%), Positives = 50/174 (28%), Gaps = 24/174 (13%) Query: 102 GLYIENGQQKVALN-----LASGEGNFFIRPGGVFYVAGDKVGIV---------RLDAFK 147 G+ + NG +G + KVG Sbjct: 168 GVIVSNGHVSSITTSSFNIPENGFAIVYNGASSYLVDERYKVGDEVYYEVIIKPTFTNPS 227 Query: 148 TSKEIQFAVQSGPMLMENGVINPRIHPNV-------ASSKIRNGVGINKHGNAVFLLSQQ 200 +E+Q A+ +GP L+ NG + SS R+ +G G + Sbjct: 228 DWEEVQCAIGAGPSLIINGNVTASGEEEGFFEAKINTSSSPRSFIGATADGRIIMGNMDA 287 Query: 201 ATNFYDFACYAKAKLNVEQLLYLDGTIS-HMYMKGGAIPWQRYPFVTMISVERK 253 AT A ++ + + LDG S +Y + ++ + Sbjct: 288 ATLKKAAAAC--QRMGLVNAMCLDGGYSIALYYASAGVSLAGRDINNGLAFVGR 339 Score = 44.3 bits (103), Expect = 0.004, Method: Composition-based stats. Identities = 16/107 (14%), Positives = 37/107 (34%), Gaps = 8/107 (7%) Query: 32 AADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNG 91 + + + V+ T++ + + ++ A + T LA + + A+NG Sbjct: 18 SDAYKSEKINGVGVKYVTLDMKDKNIQPRVLNAQNQICAT--ESLASMAKKAGAFAAING 75 Query: 92 GIYDESY---APLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAG 135 ++ P G I++G+ ++ I G V Sbjct: 76 TYFEAYGGTPVPWGTIIKDGKVLH---ISQSGAVVGITSSGKLLVDR 119 >UniRef50_Q7NGC8 Glr3243 protein n=1 Tax=Gloeobacter violaceus RepID=Q7NGC8_GLOVI Length = 540 Score = 72.8 bits (177), Expect = 1e-11, Method: Composition-based stats. Identities = 31/222 (13%), Positives = 66/222 (29%), Gaps = 24/222 (10%) Query: 56 RVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALN 115 R+ G+ + AL G G + A + I + ++ Sbjct: 319 RLNWSAAIRTGDQTLPVGALNGTAAGAGLSVFTPEWGSLYLARAGETVAIARADRIESVV 378 Query: 116 LASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQ---------FAVQSGPMLMENG 166 + + P G+ VA + L A + + +GP+L++N Sbjct: 379 EPMADQIVSLPPDGLLLVARSEPLRTALRAVAAGTPVVLDVQPSGSGSLLGAGPLLVQND 438 Query: 167 VINPR-----IHPNVASSK-IRNGVGINKHGNAVFLLSQQAT----NFYDFACYAKAKLN 216 + P+V + R + + + ++ + +A + Sbjct: 439 KLVLDAQGERFRPDVRAPGVARTAIARRGSLGILAVAARNGWAAGLSLESWANLLLQQFQ 498 Query: 217 VEQLLYLDGTISHMYMKGGAIPWQR-----YPFVTMISVERK 253 + L LDG S + GG + + P + + K Sbjct: 499 ADDALNLDGGGSSGFYLGGRLRDRPEGSFERPVHNGLGLWLK 540 Score = 49.7 bits (117), Expect = 1e-04, Method: Composition-based stats. Identities = 17/140 (12%), Positives = 36/140 (25%), Gaps = 7/140 (5%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ 84 + + +P R+ + A L+ + + Sbjct: 213 WGVGLVYREVRLVWEGLSQQLHLLEFDPARVRLGLLRP-----PSLGALAPLSALGNSQG 267 Query: 85 VQMAMNGGIYDESYA-PLGLYIENGQQKVALNLA-SGEGNFFIRPGGVFYVAGDKVGIVR 142 A+NGG ++ + LG +GQ G + G + + + +R Sbjct: 268 AWGAVNGGFFNRNTREALGALRSDGQWWTGAVAGLPPRGAASWQDGRLDFDRLNWSAAIR 327 Query: 143 LDAFKTSKEIQFAVQSGPML 162 +G L Sbjct: 328 TGDQTLPVGALNGTAAGAGL 347 >UniRef50_C0AEZ6 Putative uncharacterized protein n=1 Tax=Opitutaceae bacterium TAV2 RepID=C0AEZ6_9BACT Length = 421 Score = 72.8 bits (177), Expect = 1e-11, Method: Composition-based stats. Identities = 19/105 (18%), Positives = 36/105 (34%), Gaps = 8/105 (7%) Query: 133 VAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHG- 191 + + L A +++++ AV +L+ G + + R+ VG+ G Sbjct: 285 ILDINWRLTDLPAGVHTRDVRDAVSGNVILIAAGRLQEGGGAFWTTRHPRSAVGVAADGR 344 Query: 192 NAVFLLSQQA------TNFYDFACYAKAKLNVEQLLYLDGTISHM 230 A+ +L + Y A L + LDG S Sbjct: 345 RALLVLVDGRSLFSAGMDLSALRDYL-AHLGAHDAVNLDGGGSSA 388 Score = 48.1 bits (113), Expect = 3e-04, Method: Composition-based stats. Identities = 20/171 (11%), Positives = 41/171 (23%), Gaps = 13/171 (7%) Query: 23 LTLLPLFAVAADDCALSDPTLTVQAYTVNPQ-TERVKMYWQKANGEAWGTLHA-----LL 76 L P A + Q V+ Q +++ +A + Sbjct: 35 TPLAPGLVWIAAAGVRGGLLVETQVLDVDLQADAGLRLETLVGASQARADSGQYFLRSIP 94 Query: 77 ADINSQGQVQMAMNGGIYD--ESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFY-- 132 + + +N +D + P GL + G L + G Sbjct: 95 SQMQRDSGALAVINASFFDIKSTQTPSGLVVRGGWLLREPMLRDHPAVLSLADGRALIGT 154 Query: 133 --VAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKI 181 G + +E+ A L++ + P + Sbjct: 155 PGWNGRVSVRNKNGDTHAMREMPLA-GVNRTLLQEREVVLFCPPWQRTPDP 204 >UniRef50_C4DE18 Putative uncharacterized protein n=1 Tax=Stackebrandtia nassauensis DSM 44728 RepID=C4DE18_9ACTO Length = 393 Score = 72.4 bits (176), Expect = 1e-11, Method: Composition-based stats. Identities = 25/180 (13%), Positives = 53/180 (29%), Gaps = 34/180 (18%) Query: 105 IENGQQKVALNLASGEGNFFIRPGGVFYVAGD-----------KVGIVRLDAFKTSKEIQ 153 I +G+ ++ A GEG + +V + A + ++ Sbjct: 217 IVDGKVTR-VSDAPGEGQIAKDATVLLARDKGVEHLNGLSEGDEVNVDYQLASEDGADLD 275 Query: 154 FAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHG-NAVFLLSQQAT------NFYD 206 A+ +L+++G + ++ + A+ R VG N G + + Sbjct: 276 TALGGL-ILIDDGTML-DLNDDAATLAPRTAVGSNADGSKLYMVAVDGRSSTSVGATVKS 333 Query: 207 FACYAKAKLNV-EQLLYLDGTISHMYMKGG------AIPWQR-----YPFVTMISVERKG 254 A L ++ LDG S + ++ + V K Sbjct: 334 MADIMVN-LGADHNVINLDGGGSTTLVARKAGNTATSVRNTPSDGSQRKVANGLGVFTKA 392 Score = 56.6 bits (135), Expect = 8e-07, Method: Composition-based stats. Identities = 18/139 (12%), Positives = 37/139 (26%), Gaps = 7/139 (5%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ 84 + P + TV+ E V++ A + + + + + Sbjct: 31 VAPGVTYEKKTISTPHGKSIGHILTVDLTREDVEVGLLTPGPVAATGV---VTSLADRVK 87 Query: 85 VQMAMNGGIYDES--YAPLGLYIENGQQKVALNLASGEGNFFIRP--GGVFYVAGDKVGI 140 +NG ++ A +G I +G+ + P K G Sbjct: 88 AVAGVNGDFFNIGQTGAAVGPEILDGKDRKGPVPGKQRHGPTPPPGTDNDSVFGITKDGK 147 Query: 141 VRLDAFKTSKEIQFAVQSG 159 +D + A S Sbjct: 148 TVIDRLNVDGKATTAAGSF 166 >UniRef50_A9GRW8 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GRW8_SORC5 Length = 387 Score = 72.0 bits (175), Expect = 2e-11, Method: Composition-based stats. Identities = 27/232 (11%), Positives = 61/232 (26%), Gaps = 28/232 (12%) Query: 8 GKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGE 67 G G + P + + V+ V + Sbjct: 110 GDGRWVPIADHAHPSD--APRLHKTLLHPDPNRSWAELFIVAVDLARVDVHLMAGSREPA 167 Query: 68 AWGTLHALLADINS-----QGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGN 122 A + ++ A NGG E G+ + +G + Sbjct: 168 ATTEEGKPYERLAKIPEADHERLLAAFNGGFMTEHGQ-WGMRV-DGVTL--VRPRDQGCT 223 Query: 123 FFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHP-------- 174 G +A ++Q+ Q+ +++ G ++P + Sbjct: 224 LARHRDGRLQIAP------WTRLSAGESDMQWWRQTPSCMVDEGELHPLLRAPQVRNWGA 277 Query: 175 --NVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLD 224 + + R+ VG+++ G +++ T A + + LD Sbjct: 278 TLDGNTVIRRSAVGLDRDGKVLYVGISNHTTAPAIALGMQHA-GASAVAQLD 328 >UniRef50_A8F5X1 Putative uncharacterized protein n=1 Tax=Thermotoga lettingae TMO RepID=A8F5X1_THELT Length = 550 Score = 72.0 bits (175), Expect = 2e-11, Method: Composition-based stats. Identities = 18/172 (10%), Positives = 44/172 (25%), Gaps = 24/172 (13%) Query: 105 IENGQQ-----KVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSG 159 + +G+ + + + I+ A+++G Sbjct: 377 VVDGKVNGTGWLSKAPKNGFVLAISSKYKKYLEGIQIGDTVEYIVNTNFPYPIKHAIEAG 436 Query: 160 PMLMENGVINPRIHPN--------VASSKIRNGVGINKHGNAVFLLS-----QQATNFYD 206 P+++ G P + +S R G V + N+ + Sbjct: 437 PLILYEGSPIPDRNDEKNRYGGSIARASATRTLAATLPDGKVVLAVINDQDGSGGVNYDE 496 Query: 207 FACYAKAKLNVEQLLYLDGTISHMYMKGGAIPW-----QRYPFVTMISVERK 253 ++ K + DG S + + + + V +K Sbjct: 497 LVEFSLKK-GFYSAMNFDGGSSSIMVIFDRVVSKVPTGWVRAVPASLLVVKK 547 Score = 66.2 bits (160), Expect = 9e-10, Method: Composition-based stats. Identities = 21/180 (11%), Positives = 40/180 (22%), Gaps = 20/180 (11%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ 84 + + + + V ++PQ +K + + + + Sbjct: 223 IAKGVYWEQKVEKIGNKDMLVNYLWIDPQFVDLKPEISSGGIGSL----ESVEKMVIRKN 278 Query: 85 VQMAMNGGIYDES-YAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRL 143 +N +D + P+GL I +G+ V I Sbjct: 279 AVAGVNANYFDTNTGLPIGLLIVDGKILSMPYGDRPVFIQTFSNEVYISRIYFDVNIKVG 338 Query: 144 DAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATN 203 K I Q G + S R F + N Sbjct: 339 QLLFLVKGINTIAQ--------GEVLIFTPEFGLSIPYR-------DEMVYFSVVDGKVN 383 >UniRef50_D2PRV8 Metallophosphoesterase n=1 Tax=Kribbella flavida DSM 17836 RepID=D2PRV8_9ACTO Length = 1163 Score = 71.6 bits (174), Expect = 2e-11, Method: Composition-based stats. Identities = 24/216 (11%), Positives = 53/216 (24%), Gaps = 41/216 (18%) Query: 55 ERVKMYWQKANGEAWGTLHALLADI--NSQGQVQMAMNGGIYDESYAPLG--LYIENGQQ 110 +V++ + G + ++ + P+ + + +G Sbjct: 217 SKVELTNLNSPSVNAGGIGLYTPQWGEAARTRTV----------DGLPMVREVILRDGVV 266 Query: 111 KVALNL-------ASGEGNFFIRPGGVFY--VAGDKVGIVRLDAFKTSKEIQFAVQSGPM 161 + A+ + G V+ + + A+ Sbjct: 267 VSSSATPATTPLAANEQALIGREAGATALGAFEPGDKVEVKYGPRADAADAAVALSGNKQ 326 Query: 162 LMENGVINPRIHPNVASSKIRNGVGINKHGN-AVFLLSQQAT------NFYDFACYAKAK 214 L +G I + + R +G + G V L + A Sbjct: 327 LARDGQILT---VDDTALHPRTSIGFSADGRRMVLLTVDGRMVDSRGLTEKELARLMLD- 382 Query: 215 LNVEQLLYLDGTISHMYMKG-------GAIPWQRYP 243 L + +L LDG S +K + Sbjct: 383 LGSDDVLNLDGGGSSTMLKRSPGEATPEVVNKPSDG 418 Score = 46.6 bits (109), Expect = 7e-04, Method: Composition-based stats. Identities = 26/171 (15%), Positives = 53/171 (30%), Gaps = 15/171 (8%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ 84 + P + D + L + V + + + +L+ ++ Sbjct: 90 VAPGVTYTSFDRLDARGWLRGDILVADLDAGGVTVDYLNPG---TVSGREVLSQQAARKG 146 Query: 85 VQMAMNGGIYDES--YAPLGLYIEN------GQQKVALNLASGEGNFFIRPGGVFYVAGD 136 +NG +D + APLG+ IE G+ E I G+ +A Sbjct: 147 AIAGVNGDFFDINDTGAPLGVGIERDADGGAGRFVNGPAAGHNETAV-IGTDGLGRIAQV 205 Query: 137 KVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGI 187 + D + E+ + P + G I +++ R G+ Sbjct: 206 FLAGSATDDDASKVELTNL--NSPS-VNAGGIGLYTPQWGEAARTRTVDGL 253 >UniRef50_A9EQ62 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9EQ62_SORC5 Length = 397 Score = 70.5 bits (171), Expect = 5e-11, Method: Composition-based stats. Identities = 28/229 (12%), Positives = 63/229 (27%), Gaps = 24/229 (10%) Query: 8 GKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGE 67 G G+ T L+ + + ++ +++ Sbjct: 106 GDGVWTPIGDSAARPGEAQVLWK-SVVHPDPKRVFAAIAVVAIDLGRVDLRLVAGTKEPF 164 Query: 68 AWGTLHALLADINSQGQV---QMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFF 124 + + G A NGG G+ ++ + A + Sbjct: 165 SPDIPAERRPGLVPGGHAAELVAAFNGGFKAMHGH-YGMMLDGDTFLPPRDRACTIALYR 223 Query: 125 IRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKI--- 181 + + G R+ A++ + P L+E G ++ ++ + Sbjct: 224 SGAVRIRTWPELRDGEARMAAYRQTP---------PCLVEQGELHHALYDSNRDWGATVS 274 Query: 182 ------RNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLD 224 R+ +G++ G +F +A A KA + LD Sbjct: 275 GETVIRRSALGVDATGKLLFYGLGEAVTARSLARGMKAA-GAHDVAELD 322 >UniRef50_Q826N8 Putative secreted protein n=1 Tax=Streptomyces avermitilis RepID=Q826N8_STRAW Length = 409 Score = 70.5 bits (171), Expect = 6e-11, Method: Composition-based stats. Identities = 36/236 (15%), Positives = 65/236 (27%), Gaps = 36/236 (15%) Query: 50 VNPQTERVKMYWQKANGEAWGTLHALLADI--NSQGQVQMAMNGGIYDE-SYAPLGLYIE 106 V+ + + G++ A AD S+ + + S + + Sbjct: 177 VDTAAGELPLRGLNQYALPEGSVGAYTADWGSVSRARAVCGTDTDRAAPCSTDTYEVTVR 236 Query: 107 NGQQKV------ALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKT-----SKEIQFA 155 +GQ + +A+G R G + G + FA Sbjct: 237 DGQVVSFADSPGSGPIAAGTTVLVGREAGAQQLRKLSTGEEVSVEHRLVAAGSQVAYSFA 296 Query: 156 VQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGN-AVFLLSQQ------ATNFYDFA 208 + P+L G P + + +S +R VGI G + L + A Sbjct: 297 IGGYPVL-RQGKPLPGL--DTVTSAVRTAVGIKDAGRRLLLLAIDGAAAYRSGLTIAEVA 353 Query: 209 CYAKAKLNVEQLLYLDGTISHMYMKGG------AIPWQR-----YPFVTMISVERK 253 + L + LDG S + + P I V + Sbjct: 354 SVMR-GLGATEAFSLDGGGSTTLVARAPGATSVTVRNHPSGGAERPVPNGIGVFTR 408 Score = 55.8 bits (133), Expect = 1e-06, Method: Composition-based stats. Identities = 24/188 (12%), Positives = 48/188 (25%), Gaps = 21/188 (11%) Query: 22 ALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINS 81 ++ L P D +V+ RV++ A ++ + Sbjct: 38 SVPLAPGVEYTQFDIPADRGVTHAHLLSVDLADPRVRVGLLHPG---AVAARAPVSRLAD 94 Query: 82 QGQVQMAMNGGIYD----------ESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVF 131 +NG ++ + + G I +G A + PG Sbjct: 95 SQGALAGINGDFFNITETQHPGVEATGSTDGPAIADGHTLKAAVPNGQRFGPALPPGTTT 154 Query: 132 --YVAGDKVGIVRLDAFKTSKEIQFAVQSGPM------LMENGVINPRIHPNVASSKIRN 183 + RLD + A P+ + G + + S+ R Sbjct: 155 EDVLGVGDDRRARLDRLALEGSVDTAAGELPLRGLNQYALPEGSVGAYTADWGSVSRARA 214 Query: 184 GVGINKHG 191 G + Sbjct: 215 VCGTDTDR 222 >UniRef50_Q72HQ9 Putative uncharacterized protein n=4 Tax=Thermaceae RepID=Q72HQ9_THET2 Length = 487 Score = 70.1 bits (170), Expect = 8e-11, Method: Composition-based stats. Identities = 29/223 (13%), Positives = 58/223 (26%), Gaps = 27/223 (12%) Query: 50 VNPQTERVKMYWQKANGEAWGTLHALLADINS--QGQVQMAMNGGI--YDESYAPLGLY- 104 V ++ + +A+ +V++ +N Y P GL Sbjct: 261 VTLSYPYGRVALLWDGFAFFLGFPQFVAEAVGPDGSRVRVGVNASRARYTAHTVP-GLVG 319 Query: 105 --------IENGQQKV--ALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQF 154 + + G++ + +VG + ++ Sbjct: 320 REGEGVALVREDRVVALLPAPAELPPGHWALTFPRDAPPFPLEVGGRLALYGTLNPPFRY 379 Query: 155 AVQSGPMLMENGVINPRIHPNVASSKIRNGVG--------INKHGNAVFLLSQQATNFYD 206 A++ GP+L++ G R K G L+S+ T Sbjct: 380 ALEGGPLLLKEGR-YAYDPAKENFKDPRPLQAVAPQAAVAWTKEGKLWLLVSE-PTTPGA 437 Query: 207 FACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYPFVTMIS 249 A + L L +DG S G + + Sbjct: 438 LARALLS-LGAWNALRMDGGGSAQLWVKGVLRSPYQGSPRPVV 479 Score = 43.1 bits (100), Expect = 0.008, Method: Composition-based stats. Identities = 17/90 (18%), Positives = 30/90 (33%), Gaps = 13/90 (14%) Query: 24 TLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQG 83 L P + L + R ++ G+ AL D+ Sbjct: 187 VLAPGVRYREVY-GFTPEPLRLYLVEAE----RGRLLPVGTPGK-----RALPKDLAP-- 234 Query: 84 QVQMAMNGGIYDE-SYAPLGLYIENGQQKV 112 +NGG +D S P+GL++++G Sbjct: 235 GALAVLNGGYFDPKSGTPIGLWVQDGVTLS 264 >UniRef50_D2PYC0 Metallophosphoesterase n=1 Tax=Kribbella flavida DSM 17836 RepID=D2PYC0_9ACTO Length = 1094 Score = 67.4 bits (163), Expect = 4e-10, Method: Composition-based stats. Identities = 35/232 (15%), Positives = 62/232 (26%), Gaps = 29/232 (12%) Query: 28 LFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQ-KANGEAWGTLHALLADINSQGQVQ 86 AVA S +A +PQ + + I Sbjct: 125 GPAVADGQLVKSQSEDPYRAVAFDPQGVGRILEVLFDGTAGPYRLNRLNSPVIRKDEIGA 184 Query: 87 MAMNGGIYDESYAPLGLYIENGQQKVALNL------------ASGEGNFFIRPGGVFYVA 134 G Y ++A G + +G R G + Sbjct: 185 FTTLWGSYSRAHAVAGAAKVTEVVVAGDTVTAVAAAAGAGDIPAGTTILVGREAGADELG 244 Query: 135 GDKVGIVRLDAFKTSKE----IQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKH 190 KVG AF ++ AV + +L++ G P + + R G+G + Sbjct: 245 RLKVGDRVPVAFAPRASDGSVVRTAVGAHALLVKEGKPQP---ADDTAYAGRTGLGFSAD 301 Query: 191 GNAVFLLS--------QQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKG 234 G + ++S + + A+ + LDG S + Sbjct: 302 GKKMVIVSIDSNRLTHSRGATLAEMGRILAAR-GAYVGVELDGGGSTTLVSR 352 Score = 45.4 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 15/106 (14%), Positives = 32/106 (30%), Gaps = 6/106 (5%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ 84 + P + + +V+ ++ + T L ++ Sbjct: 50 VAPGVTAGSLETFDERGWQQGNTLSVDLTK-GARIDYLSPGQ---VTATKPLDQQANEAG 105 Query: 85 VQMAMNGGIYDES--YAPLGLYIENGQQKVALNLASGEGNFFIRPG 128 A+N +D + APLG + +GQ + + F G Sbjct: 106 AVAAVNADFFDINNSGAPLGPAVADGQLVKSQSEDPYRAVAFDPQG 151 >UniRef50_UPI00016BFF19 Ig-like, group 2 n=1 Tax=Epulopiscium sp. 'N.t. morphotype B' RepID=UPI00016BFF19 Length = 935 Score = 67.0 bits (162), Expect = 6e-10, Method: Composition-based stats. Identities = 10/81 (12%), Positives = 19/81 (23%), Gaps = 9/81 (11%) Query: 164 ENGVINPRIHPNVAS--SKIRNGVGINKH-GNAVFLLSQQAT-----NFYDFACYAKAKL 215 ++G V R + G + + + L Sbjct: 286 QDGHPYLGAANIVGKQVRHPRTLIATTDKFGELLLITIDGRQYSAGATHDEVIQILLD-L 344 Query: 216 NVEQLLYLDGTISHMYMKGGA 236 + +YLDG S + Sbjct: 345 GAKDAMYLDGGGSTTMVARDR 365 Score = 49.7 bits (117), Expect = 1e-04, Method: Composition-based stats. Identities = 24/193 (12%), Positives = 51/193 (26%), Gaps = 19/193 (9%) Query: 24 TLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQG 83 + + L V V+ VK+ A + + + Sbjct: 33 IITEGVLHTQTSLVTNIGWLDVNVLQVDLTNFNVKVAPIDAG---VFDTKQTIYKMATDT 89 Query: 84 QVQMAMNGGIYD-ESYAPL-GLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIV 141 A+N + S P G I +G K + E + VF + + Sbjct: 90 GAVAAVNADFFSLTSTVPSFGAAIADGAVKQIYSTELAEVGPEMNMASVFVDKNLNILMD 149 Query: 142 R--LDAFKTSKEIQFAVQSG----------PMLMENGVINPRIHPNVASSKIRNGVGINK 189 + + A +G P+++++ + + R + K Sbjct: 150 YIDTTVSLHASNGEVAYANGYNKIGTYFNLPVVLDD--QYTATTNEITAKFPRAATLVIK 207 Query: 190 HGNAVFLLSQQAT 202 +G + A Sbjct: 208 NGVVALHKTSGAV 220 >UniRef50_A6G841 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G841_9DELT Length = 507 Score = 67.0 bits (162), Expect = 6e-10, Method: Composition-based stats. Identities = 22/161 (13%), Positives = 40/161 (24%), Gaps = 30/161 (18%) Query: 111 KVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKT-----SKEIQFAVQSGPMLMEN 165 V + + + G + V + A++ + + + GPML+E Sbjct: 320 IVGRRVVAVGRGLPVPLNGFVVPTPETVEVGAEVAYEPLRGSGGRPLVAGIAGGPMLLEG 379 Query: 166 GV--------------INPRIHPNVASSK---IRNGVGINKHGNAVFLLSQQA------- 201 G + + R VG++ VF+ Sbjct: 380 GALTLDLRREDFWGSAPPVTFSQDETGDQNLLPRLAVGLDHAQRLVFVAVDGRDFGRALG 439 Query: 202 TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRY 242 +A L LDG S + G Sbjct: 440 MTLGGVGEVLQA-LGCHTATNLDGGASKRMVLRGRALDLSS 479 Score = 45.8 bits (107), Expect = 0.001, Method: Composition-based stats. Identities = 24/141 (17%), Positives = 41/141 (29%), Gaps = 15/141 (10%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQ 84 + P A A ++ + + V+PQ R+ + + + Q Sbjct: 166 VAPGLEHARVAQACAEGPVHLNVLRVDPQRVRLAV----DDRREGVRAGQPFTEWTRQRG 221 Query: 85 VQMAMNGGIY----------DESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVA 134 A++GG + Y P+GL + G+ A G GGV Sbjct: 222 ATAAVSGGFFLYSEPDIEAPSARYDPVGLLLGEGRCLSPPVFARGALLL-DAEGGVAIEP 280 Query: 135 GDKVGIVRLDAFKTSKEIQFA 155 G A + A Sbjct: 281 LGLGGTHLRLADGRPLDAAEA 301 >UniRef50_A5N8M7 Predicted regulatory protein n=2 Tax=Clostridium kluyveri RepID=A5N8M7_CLOK5 Length = 535 Score = 66.6 bits (161), Expect = 8e-10, Method: Composition-based stats. Identities = 21/211 (9%), Positives = 45/211 (21%), Gaps = 60/211 (28%) Query: 32 AADDCALSDPTLTVQAYTV-NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMN 90 ++ ++ T + + NP V + + ++ + + A+N Sbjct: 367 TIEELGIATRTFRGKILVIKNPSKVEVGYT------KELLRNNKTTDELAKENKALCAIN 420 Query: 91 GGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSK 150 K I + ++ Sbjct: 421 ASY----------------------------------------ISAKADINSKRSLPETE 440 Query: 151 EIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACY 210 +++ NG + + N N G + + Y Sbjct: 441 --------FGIIVHNGNVIYKNSGNTKY----NIAGFTDKNVLISGEYSIGASLYQLQQI 488 Query: 211 AKAKLNVEQLLYLDGTISHMYMKGGAIPWQR 241 LDG IS G I + Sbjct: 489 LLEN-GAYTAAVLDGGISSTMYYKGNIINKP 518 >UniRef50_A3P9C8 Putative lipoprotein n=32 Tax=pseudomallei group RepID=A3P9C8_BURP0 Length = 563 Score = 65.8 bits (159), Expect = 1e-09, Method: Composition-based stats. Identities = 29/201 (14%), Positives = 50/201 (24%), Gaps = 35/201 (17%) Query: 81 SQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGI 140 ++ ++ + NG + NG L ++ PG V+ + Sbjct: 358 ARYELVVDANGAVVAGHATLGAPPPPNGYVLQGLGASAAWLQAHATPGTRLAVSR---RL 414 Query: 141 VRLDAFKTSKEIQFAVQSGPML----MENGVINPRIHPNVAS------------------ 178 A V++GP L + P V Sbjct: 415 SADGADLALASGTSLVEAGPTLSVPNLAQSAAQEGFAPTVGGVDAGEGAAANGNWYNGWY 474 Query: 179 --SKIRNGVGINKHGNAVFLLSQQAT-------NFYDFACYAKAKLNVEQLLYLDGTISH 229 R G+ G + + + + A A L + LDG S Sbjct: 475 VARNGRTAAGVAADGTILLVEIDGRQPALSVGTSIPETAAVM-AWLGATSAVNLDGGGSS 533 Query: 230 MYMKGGAIPWQRYPFVTMISV 250 + GG + V V Sbjct: 534 NMVVGGKMVGHPSDAVGERGV 554 Score = 58.5 bits (140), Expect = 2e-07, Method: Composition-based stats. Identities = 14/113 (12%), Positives = 30/113 (26%), Gaps = 9/113 (7%) Query: 27 PLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQ 86 P + A + V ++P + G G ++ ++ Sbjct: 160 PGTRHTSLAGAPTTGPWIVNVLAIDPSRAGAALSLALPGGNDLGAGGETVSAARARVNAL 219 Query: 87 MAMNGGIYD---------ESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGV 130 +NGG + +P+G + +G+ A G Sbjct: 220 AGVNGGFFTNINPFGAPLPPRSPVGATVVDGRLVAAAIGRRPGLLLARDANGR 272 >UniRef50_C2LSG0 Putative uncharacterized protein n=1 Tax=Streptococcus salivarius SK126 RepID=C2LSG0_STRSL Length = 339 Score = 65.1 bits (157), Expect = 2e-09, Method: Composition-based stats. Identities = 12/198 (6%), Positives = 43/198 (21%), Gaps = 31/198 (15%) Query: 53 QTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKV 112 + + + + + + +N G + Sbjct: 80 KDGKYILQKGRTEDNNPELTEQSIKYEAKRRDALALINAGFWS---------------YE 124 Query: 113 ALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQ-------FAVQSGPMLMEN 165 L+ + + G+ Y + ++ + + +L+++ Sbjct: 125 GLDRPFAQKEIELGKTGLLYGDDQNNITAGTYPNIDTAKMFTHMGSNGWDTGAFGILIKD 184 Query: 166 GVINPRIHP-NVASSKIRNGVGINKHGNAVFLLSQQATNF-----YDFACYAKAKLNVEQ 219 ++ + R+ G + + + ++ L Sbjct: 185 KKVDKTWEKGDPDQPNARSIYVETYDGIIRIIQTYGHNSLNKGLNHEDVYKLLKNLGYSN 244 Query: 220 ---LLYLDGTISHMYMKG 234 LDG + Sbjct: 245 IRLAFLLDGGGTTRMYTR 262 >UniRef50_Q1IXC2 Putative uncharacterized protein n=3 Tax=Deinococcus RepID=Q1IXC2_DEIGD Length = 637 Score = 64.7 bits (156), Expect = 3e-09, Method: Composition-based stats. Identities = 28/198 (14%), Positives = 45/198 (22%), Gaps = 21/198 (10%) Query: 75 LLADINSQGQVQMAMNG--GIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVF- 131 LL G + NG +Y + NG + + P Sbjct: 438 LLTAFVGDGHSAVGGNGLTTLYLVPGTSTVARVVNGSNVPPAGTLAVTFDPARFPQLPRS 497 Query: 132 YVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSKIRNGVGINKHG 191 + Q A+ +GP+L++ G + V + + Sbjct: 498 AGQPLTATLNWQAQDAAWDTAQDALSAGPLLVQGGRVVLNAAREVFDT---SASIWRPTR 554 Query: 192 NAVFLLSQQATNF-------YDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPW----- 239 F + + A A V + LD S G Sbjct: 555 QVAFGVLDGQPTIAYLEYGTPETFAAALAAAGVRDAVRLDSGSSATAYVTGGYANLGGYL 614 Query: 240 ---QRYPFVTMISVERKG 254 P I KG Sbjct: 615 NTVWSRPVPNAIVFVPKG 632 >UniRef50_C1XUX9 Putative uncharacterized protein n=1 Tax=Meiothermus silvanus DSM 9946 RepID=C1XUX9_9DEIN Length = 497 Score = 64.7 bits (156), Expect = 3e-09, Method: Composition-based stats. Identities = 19/132 (14%), Positives = 45/132 (34%), Gaps = 14/132 (10%) Query: 134 AGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVI--NPRIHPNVASS-----KIRNGVG 186 + G + + +A+++GP+L+++G NP + P ++ V Sbjct: 365 PPVRTGEILKLYGSLEPPLAYALEAGPLLIQSGAYAFNPNLEPFTDPRPLNATAPQSAVA 424 Query: 187 INKHGNAVFLLSQQATNFYDFACYAKA-KLNVEQLLYLDGTISHMYMKGGAIPWQ----- 240 + G ++S T A + N+ + +D S G++ Sbjct: 425 WTQDGRLWLVVSD-PTTPSTLARALQLYNPNIWGAIRMDAGGSAQLYVRGSLRTPLIEPQ 483 Query: 241 RYPFVTMISVER 252 V +++ Sbjct: 484 ARKVVNGLALYP 495 Score = 45.8 bits (107), Expect = 0.001, Method: Composition-based stats. Identities = 21/124 (16%), Positives = 36/124 (29%), Gaps = 17/124 (13%) Query: 24 TLLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADINSQG 83 + P F + + L + +P R++ Q +L Sbjct: 192 VIAPGFRY-REVWTFTPEPLRLYLVEADPGRWRMEPVGQPGLRAYLPSLAPT-------- 242 Query: 84 QVQMAMNGGIYDE-SYAPLGLYIENGQQKV------ALNLASGEGNFFIRPGGVFYVAGD 136 +NGG +D S P+GL+I++G AL G V Sbjct: 243 -ALAILNGGYFDPKSGTPIGLWIKDGVALNFPFGRSALMWEQNRVFAGFPKFGTVIVTQS 301 Query: 137 KVGI 140 + Sbjct: 302 GQRL 305 >UniRef50_A6WEB7 Putative uncharacterized protein n=1 Tax=Kineococcus radiotolerans SRS30216 RepID=A6WEB7_KINRD Length = 986 Score = 63.9 bits (154), Expect = 5e-09, Method: Composition-based stats. Identities = 30/175 (17%), Positives = 49/175 (28%), Gaps = 16/175 (9%) Query: 69 WGTLHALLADINSQGQVQMAMNGGIYDESYAP-LGLYIENGQQKVALNLASGEGNFFIRP 127 AL + G V++ + G AP +G VA + P Sbjct: 190 GAGDRALTVADPAAGAVELEVRAGRVSAVRAPGAVPVPADGYVLVATGSRAR--ALSATP 247 Query: 128 GGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPNVASSK--IRNGV 185 G +V L FA+ + L+ +G I P + + R + Sbjct: 248 VGAAAGTDLRVRDDALSPGSRG----FALGARLELVRDGAIAPIDVADPTWAALRARTAL 303 Query: 186 GINKHGNAVFLLSQQAT------NFYDFACYAKAKLNVEQLLYLDGTISHMYMKG 234 G G+ + L T + A + LDG S + Sbjct: 304 GWTATGDLLLLTVDGGTSRSRGLTAVETAQRMVEA-GARGAVMLDGGGSAQLVAR 357 >UniRef50_A9QSK3 Polysaccharide biosynthesis protein n=7 Tax=Streptococcaceae RepID=A9QSK3_LACLK Length = 300 Score = 63.1 bits (152), Expect = 7e-09, Method: Composition-based stats. Identities = 32/209 (15%), Positives = 58/209 (27%), Gaps = 22/209 (10%) Query: 51 NPQTERVKMYWQKANGEAWGTLHALLAD-------INSQGQVQMAMNGGIYDE-SYAPLG 102 + T + + ++ N E T+ I + MN +D + G Sbjct: 97 DLSTNNITI-YRINNPEVLKTVTNRTDQRMKMSEVIAKYPNALI-MNASAFDMQTGQVAG 154 Query: 103 LYIENGQQKVALNLASG-EGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPM 161 I NG+ + + + F I G + Q A G Sbjct: 155 FQINNGKLIQDWSPGTTTQYAFVINKDGSCKIYDS----STPALTIIKNGGQQAYDFGTA 210 Query: 162 LMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKA--KLNVEQ 219 ++ +G I P I + +K N +LS + K+ L ++ Sbjct: 211 IIRDGKIQPSDGSVDWKIHI--FIANDKDNNLYAILSDTNAGYD---NIMKSVSNLKLKN 265 Query: 220 LLYLDGTISHMYMKGGAIPWQRYPFVTMI 248 +L LD S + Sbjct: 266 MLLLDSGGSSQLSVNDKTIVASQDDRAVP 294 >UniRef50_Q119M8 Putative uncharacterized protein n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q119M8_TRIEI Length = 283 Score = 61.2 bits (147), Expect = 3e-08, Method: Composition-based stats. Identities = 26/169 (15%), Positives = 50/169 (29%), Gaps = 27/169 (15%) Query: 105 IENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLME 164 I NGQ +S F ++ G+ V+ G + K E+ L+ Sbjct: 117 ITNGQFFRNDKSSSTALAFPLKSDGI-IVSDGYAGEIEFSHEKLMLEVWN----NRALIS 171 Query: 165 NGVI---------------NPRIHPNVASSKIRNGVGI-NKHGN----AVFLLSQQATNF 204 V R +G+ +K G+ + + + + Sbjct: 172 KFQPNNLQFSIATNFIVGLQENADKGVEDQTGRTFIGVQDKDGDRLYETILIFTSKQATQ 231 Query: 205 YDFACYAKAKLNVEQLLYLDGTISHMYM-KGGAIPWQRYPFVTMISVER 252 K+ Q++ LDG S + +G + MI++ Sbjct: 232 PHATNVLKS-FGATQVMMLDGGGSTQLICQGNNYIDSQRTIPQMIAIFS 279 >UniRef50_Q8YTL3 All2704 protein n=4 Tax=Nostocaceae RepID=Q8YTL3_ANASP Length = 310 Score = 60.1 bits (144), Expect = 8e-08, Method: Composition-based stats. Identities = 34/253 (13%), Positives = 63/253 (24%), Gaps = 55/253 (21%) Query: 46 QAYTVNPQTERV----------KMYWQKANGEAWGTLHALL--------ADINSQGQVQM 87 NP++ + ++Y + A G+ + + Sbjct: 64 HVIIFNPRSANLDFKVNLGLSHQLYTKDARGKIRREYIPKQFNELISDSNSTLNGRRPIA 123 Query: 88 AMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFK 147 A+N D P GL I G + F F ++G K + Sbjct: 124 AINADYIDPENKPQGLNISRGVEYSGD---------FKNKRSSFGISGGKPQERKATIQA 174 Query: 148 TSKEIQ----FAVQSGPMLMENGVINPRIHPNVA----SSKIRNGVGINKHGNAVFLLSQ 199 +EI V G S R+ I G + L++ Sbjct: 175 GRREINILNYNLVGGNGRFYRQGKFKDICQDLGEFACKQSTNRSMAAITNKGYVILLVND 234 Query: 200 -------------QATNFYDFACYA-----KAKLN-VEQLLYLDGTISHMYMKGGAI-PW 239 Q F + L +++ + DG +S I Sbjct: 235 IKANSNIEINSNNQELTPDKFDDVLEGISRQNCLGKIQEGILFDGGMSPGLYYNKKIYVE 294 Query: 240 QRYPFVTMISVER 252 P ++ + + Sbjct: 295 NPGPIGSVFLIYK 307 >UniRef50_B2HMV0 Lipoprotein LprO n=21 Tax=Mycobacterium RepID=B2HMV0_MYCMM Length = 381 Score = 59.7 bits (143), Expect = 9e-08, Method: Composition-based stats. Identities = 29/210 (13%), Positives = 53/210 (25%), Gaps = 40/210 (19%) Query: 74 ALLADINSQGQVQMAMNGGIYDESYA------------PLGLYIEN------------GQ 109 L GQ +A+N +D PLG +++N G Sbjct: 149 PPLQAWQRMGQPTIAINANFFDVRGQKGGSWRTTGCSSPLGAFVDNTHGMGRANQAVTGT 208 Query: 110 QKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLD-------------AFKTSKEIQFAV 156 A GN + V +K +F Sbjct: 209 VAYAGKQGLSGGNEVWTSLTTMIIPVGGAPYVLRPKGRQDYDLATPVIQDLLNKNAKFVA 268 Query: 157 QSGPMLMENGVINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLN 216 +G L+ G I + S R + +K + +++ + + + L Sbjct: 269 VAGIGLLSPGDI--GQLHDGGPSAARTALAYSKPKDEMYIFEGGSYTPDNIQDLFR-GLG 325 Query: 217 VEQLLYLDGTISHMYMKGGAIPWQRYPFVT 246 + + LDG S + Sbjct: 326 SDTAILLDGGGSSAIVLRRDTGGMWAGAGA 355 >UniRef50_C6D289 Putative uncharacterized protein n=1 Tax=Paenibacillus sp. JDR-2 RepID=C6D289_PAESJ Length = 274 Score = 59.3 bits (142), Expect = 1e-07, Method: Composition-based stats. Identities = 30/182 (16%), Positives = 58/182 (31%), Gaps = 17/182 (9%) Query: 87 MAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIR-PGGVFYVAGDK-----VGI 140 +NGG + E+ L + + +G G G ++ G G + Sbjct: 94 YGVNGGFFYENSL-LSIAVVDGLPVNGALDDYGSGAENVKYARGTLVWDGASDKLSVQVV 152 Query: 141 VRLDAFKTSKEIQFAVQSGPM--LMENG----VINPRIHPNVASSKIRNGVGINKHGNAV 194 + K +F Q G L ++ + P ++R+ ++ G Sbjct: 153 RQAADLKVMDHTRFWAQGGISMSLGQDRNWLEQVETEQAPYPDDDRLRSAAVYDREGVLY 212 Query: 195 FLLSQQATNFYDFACYAKAKLN---VEQLLYLDGTISHMY-MKGGAIPWQRYPFVTMISV 250 ++S + F K+ + ++LDG S + P V MI + Sbjct: 213 LIVSSSKGSLQSFRDAILEKVGRGMLVDGIFLDGDGSSQLRSAEKTLTGDNRPVVQMIRI 272 Query: 251 ER 252 R Sbjct: 273 VR 274 >UniRef50_B8CD22 Predicted protein n=1 Tax=Thalassiosira pseudonana RepID=B8CD22_THAPS Length = 572 Score = 58.5 bits (140), Expect = 2e-07, Method: Composition-based stats. Identities = 14/121 (11%), Positives = 25/121 (20%), Gaps = 30/121 (24%) Query: 143 LDAFKTSKEIQFAVQSGPMLMENGV-----------------INPRIHPNVASSK--IRN 183 + + AV GP+ ++ R Sbjct: 401 NVTYTLPTPLDNAVAGGPIFFDDNNDEQTMDLPSEDFKGSAPPVTFSQDETFDRNLLPRM 460 Query: 184 GVGINKH-----GNAVFLLSQQATNFYDFA------CYAKAKLNVEQLLYLDGTISHMYM 232 G+GI + V + L + + LDG S + Sbjct: 461 GIGITNNDSSGEKELVCVAVDGRNLDRALGLTLQGTSDLLKTLGCVKAMNLDGGSSKRMV 520 Query: 233 K 233 Sbjct: 521 I 521 >UniRef50_C0DAA9 Putative uncharacterized protein n=1 Tax=Clostridium asparagiforme DSM 15981 RepID=C0DAA9_9CLOT Length = 798 Score = 57.4 bits (137), Expect = 4e-07, Method: Composition-based stats. Identities = 24/157 (15%), Positives = 41/157 (26%), Gaps = 36/157 (22%) Query: 123 FFIRPGGVFYVAGDKVGIVRLDAF----KTSKEIQFAVQSGPMLMENGVINPRIHPNVA- 177 F G + G + I + + +A G L+ +G ++ Sbjct: 620 FEAAGDGYYRWDGPEAVICLEPPEGIGAEDWDSVCWAFGGGMSLISDGESLFEQETGLSR 679 Query: 178 ----------------------SSKIRNGVGINKHGNAVFLLSQQATNF------YDFAC 209 S R VG+ G L+ T Sbjct: 680 LEDEGWLGPLSRQTQESEIHRLSKHPRTAVGVTDQGELFVLVFSGRTALSVGADYAQMGR 739 Query: 210 YAKAKL-NVEQLLYLDGTISHM--YMKGGAIPWQRYP 243 A+ + NV ++ +DG S + G YP Sbjct: 740 IARTLVPNVRHMMNVDGGGSAVFGMAVGKVFVELSYP 776 >UniRef50_C7LY43 Putative uncharacterized protein n=1 Tax=Acidimicrobium ferrooxidans DSM 10331 RepID=C7LY43_ACIFD Length = 397 Score = 55.4 bits (132), Expect = 2e-06, Method: Composition-based stats. Identities = 35/172 (20%), Positives = 59/172 (34%), Gaps = 24/172 (13%) Query: 63 KANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGN 122 G W + + + + + A N G + G Y G+ V L +G + Sbjct: 182 PPGGGPWPYMAPITNPVAAD--LVAAFNSGFRMQDAN--GGYYAYGRTAVPLR--NGAAS 235 Query: 123 FFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHP-------- 174 F I GV + K I Q+ L+ NG INP ++ Sbjct: 236 FVISTSGVPTI------ETWTHGNHVPKGIAVVRQNLIPLISNGRINPLVNSTNFAIWGA 289 Query: 175 --NVASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLD 224 R+GVGI ++G V+ ++ + A A++ + LD Sbjct: 290 TVGNQLLVWRSGVGITRNGALVY-VTGPGLSVASLARLL-ARVGAVNAMELD 339 >UniRef50_A4FIV8 Secreted protein n=1 Tax=Saccharopolyspora erythraea NRRL 2338 RepID=A4FIV8_SACEN Length = 94 Score = 55.0 bits (131), Expect = 2e-06, Method: Composition-based stats. Identities = 10/80 (12%), Positives = 22/80 (27%), Gaps = 14/80 (17%) Query: 186 GINKHGNAVFLLSQQAT-------NFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIP 238 GI++ G + + + A + ++ L + LD S ++ G Sbjct: 3 GIDQAGRLLPVTVDGRRPGSSAGFTLLEAARFMRS-LGAVNAMNLDSGGSTSFVVNGKPA 61 Query: 239 WQR------YPFVTMISVER 252 + V Sbjct: 62 NSPSDATGERAVGDALVVVP 81 >UniRef50_D2W0I7 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2W0I7_NAEGR Length = 201 Score = 54.7 bits (130), Expect = 3e-06, Method: Composition-based stats. Identities = 18/101 (17%), Positives = 25/101 (24%), Gaps = 3/101 (2%) Query: 60 YWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYD-ESYAPLGLYIENGQQKVALNLAS 118 N +D +A N G +D + LG I NG+ Sbjct: 90 QPSTTNSFTDTPCKERTSDTAKLNDCIVATNAGFFDVANGYCLGKVISNGKVLNDYGRVH 149 Query: 119 GEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSG 159 F + G V +I VQ G Sbjct: 150 PNAAFGLIDDGK--GNAQYVFGYLNQTDYEKLKITQLVQVG 188 >UniRef50_UPI00016A4F20 hypothetical protein BthaT_13010 n=4 Tax=pseudomallei group RepID=UPI00016A4F20 Length = 196 Score = 52.4 bits (124), Expect = 1e-05, Method: Composition-based stats. Identities = 13/71 (18%), Positives = 23/71 (32%), Gaps = 8/71 (11%) Query: 178 SSKIRNGVGINKHGNAVFLLSQQAT-------NFYDFACYAKAKLNVEQLLYLDGTISHM 230 + R VG+ G+ + + + + A A L + LDG+ S Sbjct: 126 ARNGRTAVGVAADGSVLLVGIDGRQPVPGVGASVPETAAGM-AWLGAASAVTLDGSGSSN 184 Query: 231 YMKGGAIPWQR 241 + GG Sbjct: 185 LVIGGKTVRPS 195 >UniRef50_Q92JI8 Uncharacterized protein RC0079 n=11 Tax=Rickettsia RepID=Y079_RICCN Length = 282 Score = 50.4 bits (119), Expect = 5e-05, Method: Composition-based stats. Identities = 15/89 (16%), Positives = 27/89 (30%), Gaps = 15/89 (16%) Query: 111 KVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINP 170 V +N + FI G + I V P+L++NG Sbjct: 145 VVNINDSVKLILEFIDKDGKLINLSNTASI---------------VTGIPLLVQNGKNVV 189 Query: 171 RIHPNVASSKIRNGVGINKHGNAVFLLSQ 199 + R +G+ G V ++ + Sbjct: 190 DNPKQDDPAHARTALGVCNDGTIVIVVVE 218 >UniRef50_A7MD65 Zgc:165534 protein n=3 Tax=Clupeocephala RepID=A7MD65_DANRE Length = 313 Score = 50.4 bits (119), Expect = 6e-05, Method: Composition-based stats. Identities = 11/42 (26%), Positives = 18/42 (42%), Gaps = 1/42 (2%) Query: 202 TNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPWQRYP 243 N + A + K + NV + LDG S Y+ G++ Sbjct: 1 MNLWQVAKFLKDQ-NVMNAINLDGGGSATYVLNGSLASYPSD 41 >UniRef50_B3VCD3 BJA-8 n=1 Tax=Carukia barnesi RepID=B3VCD3_CARBN Length = 230 Score = 47.3 bits (111), Expect = 4e-04, Method: Composition-based stats. Identities = 20/115 (17%), Positives = 31/115 (26%), Gaps = 6/115 (5%) Query: 51 NPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE-SYAPLGLYIENGQ 109 NP + A S+ + +A N G + S +G NG+ Sbjct: 113 NPLRTFSVLEPLHAGSCNPWVAPRANVLTTSKKRCVVASNAGYFRTSSGGCIGNIFSNGR 172 Query: 110 QKVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLME 164 N NF IR G + V + + V L+ Sbjct: 173 LVQTSNG-IQNANFGIRKDGSIVL----VYLTEDNVRDEENPFVQLVSGVGWLLR 222 >UniRef50_A0YKD9 Putative uncharacterized protein n=1 Tax=Lyngbya sp. PCC 8106 RepID=A0YKD9_9CYAN Length = 83 Score = 46.6 bits (109), Expect = 8e-04, Method: Composition-based stats. Identities = 12/69 (17%), Positives = 21/69 (30%), Gaps = 6/69 (8%) Query: 189 KHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGTISHMYMKGGAIPW-----QRYP 243 + +F+L+ D A K ++++ LDG S + G Sbjct: 4 ADFHLIFILTSPGKTQGDAAQLLKD-FGAKKVMMLDGGGSTQLIVSGRELVSSSDATPRT 62 Query: 244 FVTMISVER 252 I V Sbjct: 63 IPQAIGVLS 71 >UniRef50_C4ICA7 Putative uncharacterized protein n=1 Tax=Clostridium butyricum E4 str. BoNT E BL5262 RepID=C4ICA7_CLOBU Length = 42 Score = 45.8 bits (107), Expect = 0.001, Method: Composition-based stats. Identities = 11/40 (27%), Positives = 13/40 (32%), Gaps = 2/40 (5%) Query: 214 KLNVEQLLYLDGTISHMYMKGGAIPW--QRYPFVTMISVE 251 KL + LDG S G + T I VE Sbjct: 3 KLGAVNAINLDGGKSSTMYYNGNTINETEGRKIPTAILVE 42 >UniRef50_Q9RZG9 Putative uncharacterized protein n=1 Tax=Deinococcus radiodurans RepID=Q9RZG9_DEIRA Length = 270 Score = 45.4 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 30/209 (14%), Positives = 53/209 (25%), Gaps = 29/209 (13%) Query: 55 ERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAP------LGLYIENG 108 + +++ K S +NGG Y P L L + N Sbjct: 11 NGITLHYMKTTASNIVLRRINSNVTASGH---YGINGGFYIL-GEPIESQPLLSLTVNND 66 Query: 109 Q-----QKVALNLASGEGNFFIRPGGVFYVAGDKV----GIVRLDAFKTSKEIQFAVQSG 159 + SG N G VF+ + + + + Q G Sbjct: 67 VPVGSLIYQDTSYGSGWANVGYARGTVFHDTVSRTIGVRVVSNASQISVTNRSNYWAQGG 126 Query: 160 PMLMENGVINPRIHPNVASSKI-------RNGVGINKHGNAVFLLSQQAT---NFYDFAC 209 + N R + R G+ N+ G +++ F Sbjct: 127 VSMSLQNDANWRDIAVTQQNLPNPDGVIQRAGLVYNEAGYVYLVMTASGQLGATAGQFRQ 186 Query: 210 YAKAKLNVEQLLYLDGTISHMYMKGGAIP 238 K L ++LD + S + Sbjct: 187 AIKQTLGALDGIFLDSSGSAQMLCAEFRN 215 >UniRef50_C7M125 Putative uncharacterized protein n=1 Tax=Acidimicrobium ferrooxidans DSM 10331 RepID=C7M125_ACIFD Length = 344 Score = 43.5 bits (101), Expect = 0.007, Method: Composition-based stats. Identities = 32/224 (14%), Positives = 65/224 (29%), Gaps = 32/224 (14%) Query: 19 IFLALTLLP-------LFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQK-ANGEAWG 70 +T P V + A S V ++ V+++ G ++ Sbjct: 82 TGAPVTWTPAGAVGSGGAVVFTSEVAPSPGAAPVGVAWIDQAHAVVQLFAGTTQPGGSFR 141 Query: 71 TLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQKVALNLASGEGNFFIRPGGV 130 + + + A GG + G + ++G L G + G Sbjct: 142 YQGMVPPSLV--TNLVAAFEGGF--QFAVSNGGFEQDGVV--GAPLVEGAASLVELTNGR 195 Query: 131 FYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINPRIHPN----------VASSK 180 + + A + Q+ +L+++G + P N + Sbjct: 196 VEIGAWGSEVGPSPA------VSAVRQNLTLLVDHGAVLPTASENPLVTWGYSLGNLLAT 249 Query: 181 IRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLD 224 R+G+GI HGN V++ + + LD Sbjct: 250 WRSGLGITSHGNLVWV--GGPGLSPATLGSMLVWAGAVRGMQLD 291 >UniRef50_B1FR41 YD repeat n=1 Tax=Burkholderia ambifaria IOP40-10 RepID=B1FR41_9BURK Length = 356 Score = 42.7 bits (99), Expect = 0.011, Method: Composition-based stats. Identities = 34/282 (12%), Positives = 67/282 (23%), Gaps = 70/282 (24%) Query: 27 PLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADIN---SQG 83 + A + + ++ + K+ + + + + + Sbjct: 84 TGLSAAFEARDTPHHIQELAYVVIDGRRYSGKVVTDRDDALINPVQLSWEQALALTRTGT 143 Query: 84 QVQMAMNGGIYDESYAPLGLYIEN---GQQKVALNLASGEGNFFIRPGGVFYVAGDKVGI 140 + +NGG ++ EN G+ G + D Sbjct: 144 GSFVCINGGFFNHKRMACEDAPENASIGKMVSDGLERPG-----------LPLPRDYADD 192 Query: 141 VRLDAFKTSKEIQFAVQSGPMLMENGVIN-----------------PRIHPNVA------ 177 R F+ I+ A P L E G P P++ Sbjct: 193 YRPLEFEDGSCIEVA----PRLSEGGKPVVTDETLADPKYDIKRTDPAWRPDITPGRLQH 248 Query: 178 --SSKIRNGVGINKH---GNAVFLLS---------QQATNFYDFACYA--KAKLN--VEQ 219 S R + G ++ + +A KL+ Sbjct: 249 AVSRHPRAAISQPAAWAAGKTRLIVGTAWDRNNNPDAGYSLAQWAQIVSRLDKLSTPPNS 308 Query: 220 LLYLDGTISHMYMK----GGAI----PWQRYPFVTMISVERK 253 LDG S + G I + P +I+ + Sbjct: 309 STNLDGGGSLALVAMTASGKRICVAQNGEGRPVANLIAFVER 350 >UniRef50_UPI00019038D8 hypothetical protein Retl8_15906 n=1 Tax=Rhizobium etli 8C-3 RepID=UPI00019038D8 Length = 91 Score = 42.0 bits (97), Expect = 0.022, Method: Composition-based stats. Identities = 11/64 (17%), Positives = 26/64 (40%), Gaps = 1/64 (1%) Query: 32 AADDCALSDPTLTVQAYTVNPQTERVKMYWQKANGEAWGTLHALLADI-NSQGQVQMAMN 90 A + T+ ++++W+ A+GE + +L + ++ A+N Sbjct: 26 AQQCGQETFDEAKYVVCTLEVGKVDLRLFWKGADGEPYRAFSSLADAVRAEGRKLIFAVN 85 Query: 91 GGIY 94 G+Y Sbjct: 86 AGMY 89 >UniRef50_C3QEU3 Predicted protein n=5 Tax=Bacteroides RepID=C3QEU3_9BACE Length = 1114 Score = 41.2 bits (95), Expect = 0.032, Method: Composition-based stats. Identities = 13/104 (12%), Positives = 33/104 (31%), Gaps = 19/104 (18%) Query: 25 LLPLFAVAADDCALSDPTLTVQAYTVNPQTERVKMYWQKAN-----------GEAWGTLH 73 L + ++ + A ++ +V A+ L Sbjct: 726 LASGIVY--KHYSFTNFNQNIYAIEIDMNNPKVTFETVMADEICPNPNGNNNSNNGKVLR 783 Query: 74 ALLADINSQ-----GQVQMAMNGGIYDE-SYAPLGLYIENGQQK 111 L++ ++ + + +N G ++ P G++IE G+ Sbjct: 784 ETLSETCTRRRDEGRNIIVGINTGFFNSHDGFPRGMHIEEGEPV 827 Score = 40.0 bits (92), Expect = 0.067, Method: Composition-based stats. Identities = 15/78 (19%), Positives = 27/78 (34%), Gaps = 9/78 (11%) Query: 163 MENGVI-NPRIHPNVASSKIRNGVGINKHG-NAVFLLSQQATN------FYDFACYAKAK 214 + NGV P + + +G+ + V T+ FY+ K K Sbjct: 1006 VYNGVYSAPPKKEDAETINPTTNLGMTQDKSKIVIFCVDGRTDSDRGLDFYEAYRVCK-K 1064 Query: 215 LNVEQLLYLDGTISHMYM 232 L + ++ DG S + Sbjct: 1065 LGLYDVIRFDGGGSTVMW 1082 >UniRef50_C5KB48 Putative uncharacterized protein n=1 Tax=Perkinsus marinus ATCC 50983 RepID=C5KB48_9ALVE Length = 925 Score = 41.2 bits (95), Expect = 0.034, Method: Composition-based stats. Identities = 23/190 (12%), Positives = 52/190 (27%), Gaps = 24/190 (12%) Query: 69 WGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLYIENGQQ-KVALNLASGEGNFFIRP 127 + + ++ + + ++ + P +I++G + A + F P Sbjct: 704 GYPITQWIRELTKADENIVLLSNSVQSNFQNPYDCFIQDGIMQRQGYASACPKYAFGDSP 763 Query: 128 GGVFYVAGDKVGIV-----RLDAFKTSKEIQFAVQSGPMLMENGV----INPRIHPN--- 175 VF + D D + AV P + +G + Sbjct: 764 VDVFVLDSDTEERRLRSCKVTDECNERFPWRRAVSGRP-FVTDGELRKIPHWDHEDGEEI 822 Query: 176 ---------VASSKIRNGVGINKHGNAVFLLSQQATNFYDFACYAKAKLNVEQLLYLDGT 226 ++ + V ++ G V + Q Y+F + + L G+ Sbjct: 823 KYGEVPWLPSSTEAAFSAVCESRDGQVVLAYAIQPLTAYEFGRALIDS-GIHDAVLLGGS 881 Query: 227 ISHMYMKGGA 236 G Sbjct: 882 GDVGVWIQGR 891 >UniRef50_B4DZG9 cDNA FLJ52812, highly similar to N-acetylglucosamine-1-phosphodiesteralpha-N- acetylglucosaminidase (EC 3.1.4.45) n=2 Tax=Mammalia RepID=B4DZG9_HUMAN Length = 161 Score = 40.4 bits (93), Expect = 0.063, Method: Composition-based stats. Identities = 11/76 (14%), Positives = 23/76 (30%), Gaps = 3/76 (3%) Query: 38 LSDPTLTVQAYT-VNPQTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDE 96 D + V P + G A + + ++A NGG + Sbjct: 85 FRDRAVAGHLTRAVEPLR-TFSVLEPGGPGGCAARRRATVEETARAADCRVAQNGGFFRM 143 Query: 97 S-YAPLGLYIENGQQK 111 + LG + + ++ Sbjct: 144 NSGECLGNVVSDERRV 159 >UniRef50_D1BLE5 Putative uncharacterized protein n=3 Tax=Veillonella RepID=D1BLE5_VEIPT Length = 446 Score = 40.4 bits (93), Expect = 0.064, Method: Composition-based stats. Identities = 37/264 (14%), Positives = 67/264 (25%), Gaps = 35/264 (13%) Query: 2 AHQLLIGKGMITLNLKRIFLALTLLPLFAVAADDCALSDPTLTVQAYTVNPQTERV--KM 59 AH +IG + + +R P + L + P + Sbjct: 187 AHTAIIGAKVKGGSFER--------PYSVASDGAIDLERISNRGT-LRYTPNRGYFIEEK 237 Query: 60 YWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAP--LGL--YIENGQQKVALN 115 + D + + + Y S G + NG+ Sbjct: 238 KPLLQAKTKNESFVITSVDTVRKENA-LTLYTSTYGPSTKTNEYGYEVTVANGKVISKQK 296 Query: 116 LAS--GEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFA--VQS-----GPMLMENG 166 S GE + + G A K+ + + +E+ G M+M+ G Sbjct: 297 GNSKIGENQYVLSGHGESGNALRKLKVGTPITIQNREELAQVSTTGGASLEVGTMVMKGG 356 Query: 167 VINPRIHPNVASSKIRNGVGINKHGNAVFLLSQQ------ATNFYDFACYAKAKLNVEQL 220 N R+ +G K + V L + + A +KL V Sbjct: 357 RYVGADESNNKG---RSFIGTTKEHDLVVLTVDKSELQSVGVTQKEGAQLL-SKLGVVDG 412 Query: 221 LYLDGTISHMYMKGGAIPWQRYPF 244 L S + + Sbjct: 413 AELSNQGSIDIVINDDYVHKSSAP 436 >UniRef50_A9GVQ9 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GVQ9_SORC5 Length = 308 Score = 40.4 bits (93), Expect = 0.064, Method: Composition-based stats. Identities = 27/209 (12%), Positives = 58/209 (27%), Gaps = 21/209 (10%) Query: 53 QTERVKMYWQKANGEAWGTLHALLADINSQGQVQMAMNGGIYDESYAPLGLY--IENGQQ 110 K+ + + WG A V G + P GL ++ G Sbjct: 110 NGPDFKLIRRVNFTDFWGAARAQETATRRARVVV----SGTFGTFNQPTGLAFGLKAGGN 165 Query: 111 KVALNLASGEGNFFIRPGGVFYVAGDKVGIVRLDAFKTSKEIQFAVQSGPMLMENGVINP 170 ++ A+ G + + + + + + + P ++ G ++ Sbjct: 166 LISYGYAAPGGPSPEPHRVMLFAFNNARSRGWIGDYN-----RDSFTVSPDVV--GAVHV 218 Query: 171 RIHPNVASSKIRNGVGINKHGN-----AVFLLSQQATNFYDFACYAKAKLNVEQLLYLDG 225 R VG+ + + S + + A + +Q + LDG Sbjct: 219 DADFRPGDLTGRTFVGVRDDDRDGNAETILVFSSSSATTWQ-ASTTISAFGAQQKVMLDG 277 Query: 226 TISHMYMKGG--AIPWQRYPFVTMISVER 252 + S + G I I+ Sbjct: 278 SYSTYLIVDGGPRISTAGRLVPHGIAFYS 306 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.312 0.136 0.344 Lambda K H 0.267 0.0421 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,088,754,819 Number of Sequences: 3077464 Number of extensions: 37446203 Number of successful extensions: 133997 Number of sequences better than 1.0e-01: 333 Number of HSP's better than 0.1 without gapping: 540 Number of HSP's successfully gapped in prelim test: 434 Number of HSP's that attempted gapping in prelim test: 130717 Number of HSP's gapped (non-prelim): 1407 length of query: 254 length of database: 1,040,396,356 effective HSP length: 126 effective length of query: 128 effective length of database: 652,635,892 effective search space: 83537394176 effective search space used: 83537394176 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.3 bits) S2: 91 (39.6 bits)