BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (330 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P37749 Uncharacterized protein yefG n=5 Tax=Escherichia... 682 0.0 UniRef50_Q4JZC8 Putative glycosyl transferase n=2 Tax=Streptococ... 183 8e-45 UniRef50_Q7P740 Nucleotide sugar synthetase n=1 Tax=Fusobacteriu... 181 5e-44 UniRef50_Q4JYT0 Putative glycosyl transferase n=1 Tax=Streptococ... 177 6e-43 UniRef50_D0R4M2 Putative glycosyltransferase n=1 Tax=Lactobacill... 173 9e-42 UniRef50_Q4JYV0 Putative glycosyl transferase n=2 Tax=Streptococ... 170 6e-41 UniRef50_Q1WU31 Galactofuranosyltransferase n=2 Tax=Lactobacillu... 169 2e-40 UniRef50_D0RXG2 Galactofuranose transferase n=13 Tax=Streptococc... 169 2e-40 UniRef50_D1PTN6 Putative uncharacterized protein n=1 Tax=Prevote... 167 4e-40 UniRef50_C2EH03 Possible galactofuranosyltransferase n=1 Tax=Lac... 167 6e-40 UniRef50_C0BRQ3 Putative uncharacterized protein n=2 Tax=Bifidob... 165 2e-39 UniRef50_B0N1W1 Putative uncharacterized protein n=1 Tax=Clostri... 165 2e-39 UniRef50_B0BR56 Glycosyltransferase n=1 Tax=Actinobacillus pleur... 164 4e-39 UniRef50_UPI000196921F hypothetical protein BACCELL_02894 n=1 Ta... 162 2e-38 UniRef50_D0BKT6 Galactofuranosyltransferase n=1 Tax=Granulicatel... 159 1e-37 UniRef50_B0P5G1 Putative uncharacterized protein n=1 Tax=Clostri... 159 2e-37 UniRef50_A8RK64 Putative uncharacterized protein n=1 Tax=Clostri... 158 3e-37 UniRef50_A0Z7X9 Putative uncharacterized protein n=1 Tax=marine ... 155 1e-36 UniRef50_C0WVC5 Possible galactofuranosyltransferase n=2 Tax=Lac... 155 3e-36 UniRef50_C7IU57 Putative uncharacterized protein n=1 Tax=Thermoa... 154 4e-36 UniRef50_C2F0P9 Galactofuranosyltransferase n=2 Tax=Lactobacillu... 150 6e-35 UniRef50_Q032N6 Glycosyltransferase n=1 Tax=Lactococcus lactis s... 147 5e-34 UniRef50_A7AHD2 Putative uncharacterized protein n=1 Tax=Parabac... 146 1e-33 UniRef50_Q03GL2 Glycosyltransferase n=1 Tax=Pediococcus pentosac... 145 2e-33 UniRef50_A5ZF92 Putative uncharacterized protein n=2 Tax=Bactero... 142 1e-32 UniRef50_C4Z1X5 Putative uncharacterized protein n=1 Tax=Eubacte... 142 2e-32 UniRef50_A2RHU2 Putative galactofuranose transferase n=1 Tax=Lac... 142 2e-32 UniRef50_UPI0001968A2E hypothetical protein BACCELL_04078 n=1 Ta... 137 4e-31 UniRef50_C2E8T4 Possible galactofuranosyltransferase n=1 Tax=Lac... 137 6e-31 UniRef50_C2EVL7 Possible galactofuranosyltransferase n=1 Tax=Lac... 137 6e-31 UniRef50_C2FKS4 Possible galactofuranosyltransferase n=1 Tax=Lac... 136 1e-30 UniRef50_UPI000196CD65 hypothetical protein CATMIT_02517 n=1 Tax... 136 1e-30 UniRef50_C9LJY2 Putative uncharacterized protein n=1 Tax=Prevote... 132 1e-29 UniRef50_C7TE97 Glycosyl transferase,galactofuranosyltransferase... 132 2e-29 UniRef50_C9LPN1 Galactofuranosyltransferase n=2 Tax=Veillonellac... 131 4e-29 UniRef50_C7XW37 Glycosyltransferase n=1 Tax=Lactobacillus coleoh... 128 4e-28 UniRef50_C0YXY6 Possible galactofuranosyltransferase n=5 Tax=Lac... 126 9e-28 UniRef50_D1PDY2 Putative galactofuranosyltransferase n=1 Tax=Pre... 125 3e-27 UniRef50_C6Z1L9 Glycosyltransferase n=1 Tax=Bacteroides sp. 4_3_... 122 2e-26 UniRef50_C3QC04 Galactofuranosyltransferase n=3 Tax=Bacteroides ... 114 7e-24 UniRef50_A7HN15 Galactofuranosyltransferase n=1 Tax=Fervidobacte... 107 4e-22 UniRef50_Q04DG9 Glycosyltransferase n=1 Tax=Oenococcus oeni PSU-... 105 3e-21 UniRef50_Q042V6 Glycosyltransferase n=4 Tax=Lactobacillus gasser... 96 1e-18 UniRef50_C9A0R8 Putative uncharacterized protein n=1 Tax=Enteroc... 93 1e-17 UniRef50_C7G7A4 Putative uncharacterized protein n=1 Tax=Rosebur... 92 4e-17 UniRef50_Q03A82 Glycosyltransferase n=1 Tax=Lactobacillus casei ... 87 8e-16 UniRef50_B1MXC4 Glycosyltransferase n=3 Tax=Leuconostoc RepID=B1... 74 8e-12 UniRef50_B1MVL6 Putative glycosyl transferase n=1 Tax=Leuconosto... 74 8e-12 UniRef50_C7TIE1 Glycosyl transferase, galactofuranosyltransferas... 70 1e-10 UniRef50_B1I7N2 Nss n=10 Tax=Streptococcus pneumoniae RepID=B1I7... 64 6e-09 UniRef50_C4ZG42 Putative uncharacterized protein n=1 Tax=Eubacte... 64 9e-09 UniRef50_A3CM54 Nucleotide sugar synthetase-like protein, putati... 62 2e-08 UniRef50_Q5ULS2 Orf42 n=1 Tax=Lactobacillus phage LP65 RepID=Q5U... 61 5e-08 UniRef50_Q3DVD0 Nucleotide sugar synthetase-like protein n=9 Tax... 60 1e-07 UniRef50_C0XA00 Possible galactofuranosyl transferase n=3 Tax=La... 52 4e-05 >UniRef50_P37749 Uncharacterized protein yefG n=5 Tax=Escherichia coli RepID=YEFG_ECOLI Length = 330 Score = 682 bits (1761), Expect = 0.0, Method: Compositional matrix adjust. Identities = 330/330 (100%), Positives = 330/330 (100%) Query: 1 MYFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGL 60 MYFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGL Sbjct: 1 MYFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGL 60 Query: 61 ENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGGGSDSVRLATCDMVIS 120 ENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGGGSDSVRLATCDMVIS Sbjct: 61 ENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGGGSDSVRLATCDMVIS 120 Query: 121 HNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYT 180 HNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYT Sbjct: 121 HNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYT 180 Query: 181 EGCDFTLFGVNYENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLK 240 EGCDFTLFGVNYENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLK Sbjct: 181 EGCDFTLFGVNYENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLK 240 Query: 241 FNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQIS 300 FNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQIS Sbjct: 241 FNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQIS 300 Query: 301 ENTKIISQKIRTGSYFRDVLEEVIDDLKTR 330 ENTKIISQKIRTGSYFRDVLEEVIDDLKTR Sbjct: 301 ENTKIISQKIRTGSYFRDVLEEVIDDLKTR 330 >UniRef50_Q4JZC8 Putative glycosyl transferase n=2 Tax=Streptococcus pneumoniae RepID=Q4JZC8_STRPN Length = 357 Score = 183 bits (464), Expect = 8e-45, Method: Compositional matrix adjust. Identities = 109/297 (36%), Positives = 173/297 (58%), Gaps = 31/297 (10%) Query: 55 TFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLK----FRIVPLIHDIDELRGG------ 104 + L L+ DV+I+ PM + +F LLK + + +IHD++ LR G Sbjct: 66 SLLWRLKKNDVVIYQHPMYGV--RVANFAIPLLKKYKNIKFISVIHDLESLRKGIQGVIE 123 Query: 105 ------GGSDSVRLATCDMVISHNPQMTKYLSKY-MSQDKIKDIKIFDYLVSSDVEHRDV 157 +D L+ D VISHNP+MT+YL + ++ + +++IFDYL S++E + + Sbjct: 124 DNETTNAIADKELLSKFDKVISHNPKMTEYLEGIGIKKENLVELQIFDYLDPSEIEEK-I 182 Query: 158 TDKQRGVIYAGNLSRHKCSFIYTE-----GCDFTLFGVNYENKDNPKYLGSFDAQSPEKI 212 D GV+ AGNL++ K S+IY LFG N+ N++ P+ + F + P K+ Sbjct: 183 ED---GVVIAGNLAKGKSSYIYKLLENELNFKLNLFGPNFINEELPENVEYFGSLPPNKL 239 Query: 213 --NLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIV 270 L G +FGL+WDGDS+ETCSG G+YLK+NNPHKTSLYL+ +PV IW +AALA FI Sbjct: 240 PQKLVG-KFGLVWDGDSLETCSGNTGNYLKYNNPHKTSLYLASGIPVIIWKEAALAQFIE 298 Query: 271 DNRIGYAVGSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDL 327 +N +G V ++ E++ ++ +++ Y I NT + +K+R G ++R + + +D Sbjct: 299 ENNVGITVNNLSEIEFVMQNISEGEYLSIKRNTMQLGEKLRNGYFYRQAISKCKNDF 355 >UniRef50_Q7P740 Nucleotide sugar synthetase n=1 Tax=Fusobacterium nucleatum subsp. vincentii ATCC 49256 RepID=Q7P740_FUSNV Length = 357 Score = 181 bits (458), Expect = 5e-44, Method: Compositional matrix adjust. Identities = 106/299 (35%), Positives = 168/299 (56%), Gaps = 41/299 (13%) Query: 57 LCGLENKDVLIFNFPMAKPFWHILSFFHRLLK------FRIVPLIHDIDELRGGGGS--- 107 L L++ D + F FP+ H F H +LK +IV LIHD++ +R Sbjct: 68 LSVLKSGDSIFFQFPVV----HNSIFLHNILKRLKLKGIKIVVLIHDMESIRLISEKSLS 123 Query: 108 --DSVR--------LATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDV 157 +R L +I+ N M K+L ++ +++IFDYL+S +VE + + Sbjct: 124 FLQKLRIKIEEFEFLKASSYLITPNKYMRKFLEDKNITIQMGELEIFDYLISEEVEEKIL 183 Query: 158 TDK---QRGVIYAGNLSRHKCSFIYT--EGCDFTLFGVNY-ENKD----NPKYLGSFDAQ 207 K + ++ AGNLS+ K +++Y +F L+GVNY E+KD N Y GS+ A Sbjct: 184 EKKVSSKNSIVIAGNLSKEKSAYVYLLPTNLNFELYGVNYIEDKDSQSENINYNGSYMAD 243 Query: 208 SPEKINLPGM---QFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAA 264 LP + +FGL+WDG S+ETC G +G YL +NNPHK SLYL E+P+ IW+KAA Sbjct: 244 K-----LPAVLNGKFGLVWDGSSIETCKGGYGKYLMYNNPHKVSLYLVSEIPIIIWEKAA 298 Query: 265 LADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEV 323 LA FI++N+IG+ + S+ ++ E + ++ E YK + +NT I SQ++ G Y + ++ ++ Sbjct: 299 LASFIIENKIGFTINSLNDINEKLKGLSDEEYKVMKQNTVIFSQRLSKGFYLKKIIRDI 357 >UniRef50_Q4JYT0 Putative glycosyl transferase n=1 Tax=Streptococcus pneumoniae RepID=Q4JYT0_STRPN Length = 354 Score = 177 bits (448), Expect = 6e-43, Method: Compositional matrix adjust. Identities = 120/350 (34%), Positives = 193/350 (55%), Gaps = 29/350 (8%) Query: 1 MYFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGL 60 +Y++++ S A KA D I D + +V + +V+ + KL L + Sbjct: 4 LYYIHEEFGSDSTAATKAPNDLQKIFQDCKFKPLVTLKKNSKIVRIFDYAFKLLLCLIRI 63 Query: 61 ENKDVLIFNFPMAKPFWHILSFFHRLLKFR---IVPLIHDIDELRGGGG-----SDSVRL 112 + D++IF FP A + +F +LL+++ ++ LI+D++ LR G S + Sbjct: 64 RSNDIVIFQFPFA-THGKLKNFLMKLLQYKKAKMIFLINDLESLRYSGNKKNLISKEQYI 122 Query: 113 ATCDMVISHNPQMTKYL-SKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLS 171 D++I HN +M ++L + +KI + +FDYL+ D +++ + V+ AGNLS Sbjct: 123 KNADVIICHNQRMKEFLIENKIDSEKIVVLGVFDYLL--DKFNKEKASFDKTVVIAGNLS 180 Query: 172 RHKCSFIYTE------GCDFTLFGVNY----ENKDNPKYLGSFDAQSPEKIN--LPGMQF 219 K ++ TE F L+G N+ N D Y GSF SPEKI L G F Sbjct: 181 PQKSGYL-TELLKNENRIKFNLYGPNFTSSTNNNDCVSYKGSF---SPEKIPFILEG-DF 235 Query: 220 GLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVG 279 GL+WDGDS+ TCSG G+YLK+NNPHK SL+++ ++PV IW ++AL+DF+ +N IG V Sbjct: 236 GLVWDGDSILTCSGITGEYLKYNNPHKVSLFIASKIPVIIWKQSALSDFVKENNIGIVVN 295 Query: 280 SIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDLKT 329 + EMQEI+ +MT E Y+ EN + +S+K+R G + +E+ + +K Sbjct: 296 DLIEMQEIITNMTEEQYEIFRENIEQLSKKVRQGYFTNLAIEKSLSIIKN 345 >UniRef50_D0R4M2 Putative glycosyltransferase n=1 Tax=Lactobacillus johnsonii FI9785 RepID=D0R4M2_LACJF Length = 349 Score = 173 bits (438), Expect = 9e-42, Method: Compositional matrix adjust. Identities = 112/338 (33%), Positives = 183/338 (54%), Gaps = 26/338 (7%) Query: 12 RDAGFKARKDALDIAS--DYENISV---VNIPLWGGVVQRIISSVKLSTFLCGLENKDVL 66 ++AG KA D + IA ++E + V N V Q+I + +E+ +L Sbjct: 14 QNAGSKAPNDVVKIAEKLNFEKLFVNVHRNESALDKVKQQIEYKSNWKSVYSKIESNSIL 73 Query: 67 IFNFPMAKPFWHILSFFHRLLKFR------IVPLIHDIDELRGGGGSDSVR------LAT 114 + P+ + H LS H L K + ++ ++HD++ELR ++ + L Sbjct: 74 LLQVPI---YVHQLSRIHFLKKIKSQKKVKLIFVVHDVEELRVAFNNNFQKKQFEDMLKL 130 Query: 115 CDMVISHNPQMTKYLSKY-MSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRH 173 D+++ HN M + K ++KI ++KIFDYL + D+ + + K+ VI AGNL Sbjct: 131 ADVIVVHNEVMANFFEKKGFPKEKIVNLKIFDYLYNFDLNKKVIFSKK--VIIAGNLDEK 188 Query: 174 KCSFIYTEG---CDFTLFGVNYENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVET 230 K ++ F L+G NY K++ K + E NL FGLIWDG+S+ET Sbjct: 189 KTEYLKKLDKIDAKFDLYGPNYVKKNSNKITYKGVVPANELPNLLDSGFGLIWDGNSIET 248 Query: 231 CSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDS 290 CSG FG+YLK+NNPHK SLYL+ LPVFIW KAA A F+ +N +GY + S+ ++ I++ Sbjct: 249 CSGYFGNYLKYNNPHKLSLYLTAGLPVFIWSKAAEAKFVDENHLGYTIDSLSDIPLILER 308 Query: 291 MTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDLK 328 +T+ Y ++ +N +++ +KI G + L + I+++K Sbjct: 309 LTLADYNRLIKNVRLVGEKISRGDFMTVALTDAINNIK 346 >UniRef50_Q4JYV0 Putative glycosyl transferase n=2 Tax=Streptococcus pneumoniae RepID=Q4JYV0_STRPN Length = 356 Score = 170 bits (430), Expect = 6e-41, Method: Compositional matrix adjust. Identities = 113/349 (32%), Positives = 190/349 (54%), Gaps = 47/349 (13%) Query: 11 RRDAGFKARKDALDIASD--YENI---SVVNIPLWGGVVQRIISSVKLSTF----LCGLE 61 +++AG KAR+D DI Y+ + S +N VQR++ K+ L + Sbjct: 15 KKNAGGKARQDVTDILESIGYQKLIAESEMNERQELNAVQRLVHHYKVKKMWKKTLSVVG 74 Query: 62 NKDVLIFNFPMAKPFWHILSFFHRLLK------FRIVPLIHDIDELRGGGGSDSVRLAT- 114 D +I FP+ H L FF++++K ++ LIHD++ LR S S+ L + Sbjct: 75 KGDEVIIQFPLLN---HSL-FFNQVIKQLSKNGVKVYFLIHDLESLRWSQ-SKSISLKSR 129 Query: 115 -------------CDMVISHNPQMTKYLSKY-MSQDKIKDIKIFDYLVSSDVEHRDVTDK 160 + +I+HN +M Y+ Y + KI ++ FDY++ S E +++ + Sbjct: 130 IRLNIEEHSVLRLSEGIIAHNKKMKSYIKTYSVESSKIIPLETFDYIIPSYHERKNLDNF 189 Query: 161 QRG--VIYAGNLSRHKCSFIY--TEGCDFTLFGVNYENKDNPK--YLGSFDAQSPEKIN- 213 Q ++ AGNL +HK ++Y +F L+G+ YE D+ Y GSF PE++ Sbjct: 190 QLNAPIVIAGNLKQHKAGYVYHLPSNVEFNLYGIGYEQTDDKSVHYCGSF---MPEELPF 246 Query: 214 -LPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDN 272 L G FGL+WDG S E+C +G+YL+ NNPHKTSLYL+ +PV +W +AA+A FI +N Sbjct: 247 VLKG-SFGLVWDGPSSESCIETYGEYLRVNNPHKTSLYLASGIPVVVWSEAAIASFIKEN 305 Query: 273 RIGYAVGSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLE 321 G V ++ E+ E++ +T++ Y+ + +NT+II +++R G Y + ++ Sbjct: 306 NCGILVSNLSELPELLSMITVDEYELMKKNTEIIGERLRQGFYTKQAVK 354 >UniRef50_Q1WU31 Galactofuranosyltransferase n=2 Tax=Lactobacillus salivarius RepID=Q1WU31_LACS1 Length = 335 Score = 169 bits (427), Expect = 2e-40, Method: Compositional matrix adjust. Identities = 96/241 (39%), Positives = 147/241 (60%), Gaps = 11/241 (4%) Query: 90 RIVPLIHDIDELRGGGGSDSVRLATCDM---VISHNPQMTKYLSKYMSQDKIKDIKIFDY 146 ++ LIHD++ LR G ++ L +M VI+HN +M +L + I D++IFDY Sbjct: 94 KVYILIHDLESLRFKNGGNNFELDLLNMSDGVIAHNKKMIDWLRNNGVEVPIVDLEIFDY 153 Query: 147 LVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYTEGCDF--TLFGVNYENKDNPKYLGSF 204 + ++ + DK V YAGNL +K +F+ DF TLFG N+ KY+ Sbjct: 154 DNNIPLQENYIFDK--SVCYAGNL--NKATFLKEYEPDFKLTLFGPNFSPALMSKYIEYK 209 Query: 205 DAQSPEKI--NLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDK 262 + SP+++ L FGLIWDGDS + CSG +G+YLK+NNPHKTSLYLS +P+ IW + Sbjct: 210 GSLSPDELAKELLTQNFGLIWDGDSSKGCSGIYGEYLKYNNPHKTSLYLSSGMPIIIWRE 269 Query: 263 AALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEE 322 AALA+F+ N++G V ++ +++ I+D MT E Y++I NT I+ K+R+G Y + + E Sbjct: 270 AALAEFVDKNKLGIVVDNLSQIKPILDKMTKEEYQEIKSNTIKIAHKLRSGFYIKKAITE 329 Query: 323 V 323 + Sbjct: 330 L 330 >UniRef50_D0RXG2 Galactofuranose transferase n=13 Tax=Streptococcus RepID=D0RXG2_9STRE Length = 351 Score = 169 bits (427), Expect = 2e-40, Method: Compositional matrix adjust. Identities = 122/361 (33%), Positives = 189/361 (52%), Gaps = 49/361 (13%) Query: 2 YFLND---LNFSRRDAGFKARKD--ALDIASDYENISVVNIPLWGGV----VQRIISSVK 52 Y+L D N ++AG KAR D A+ I+ YE + + + W + Q+ Sbjct: 3 YYLKDSFLHNEHEKNAGSKARNDVEAILISEGYEGLEL-KVENWYKMNFFKAQQHKYRAT 61 Query: 53 LSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLK------FRIVPLIHDIDELRGGGG 106 S F L D L+ FP+ H F +L+K + LIHD++ LR G Sbjct: 62 KSVF-DQLGAGDELVIQFPII----HHTFFISQLIKQAQKRGAKFYLLIHDVETLRHAAG 116 Query: 107 SD-----SVR--------LATCDMVISHNPQMTKYL-SKYMSQDKIKDIKIFDYLVSSDV 152 S+ VR L + D +I HN M K L + + DK+ ++IFDYL+ + Sbjct: 117 SEVKFRHKVRNYFQEKKALMSVDGIIVHNDIMKKVLVGQGVPADKMASLEIFDYLIP-NF 175 Query: 153 EHRDVTDKQRGVIYAGNLSRHKCSFIYT--EGCDFTLFGVNYENK---DNPKYLGSFDAQ 207 E + + K + +I AGNL+ K ++Y + + L+GV Y+ N Y GSF Sbjct: 176 EVQALPQKDQPIIVAGNLNPAKSGYLYNLPDQPAYNLYGVGYDESRALKNTSYFGSF--- 232 Query: 208 SPEKINLPGM---QFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAA 264 P+ +LP FGL+WDGDS ETC G++G+YL+FNN HK SLYL+ PV +W ++A Sbjct: 233 MPD--DLPAALEGSFGLVWDGDSSETCQGSYGNYLRFNNSHKASLYLASGFPVVVWKESA 290 Query: 265 LADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVI 324 LA FI++ G AV S+ +++ +++++T + Y +SEN K I + +R G Y R L+++ Sbjct: 291 LAHFILEKSCGIAVASLHDLEAVLENLTEKEYADLSENAKRIGKDLREGYYLRSALKKLN 350 Query: 325 D 325 D Sbjct: 351 D 351 >UniRef50_D1PTN6 Putative uncharacterized protein n=1 Tax=Prevotella bergensis DSM 17361 RepID=D1PTN6_9BACT Length = 338 Score = 167 bits (424), Expect = 4e-40, Method: Compositional matrix adjust. Identities = 106/327 (32%), Positives = 166/327 (50%), Gaps = 19/327 (5%) Query: 17 KARKDALDIASDYENISVVNIPLWGGVVQRIISSV-KLSTFLCGLENKDVLIFNFPMAKP 75 KA+KD + +++ + G + R ++ + + L L+ DVL +PM K Sbjct: 12 KAKKDIDTVVEQLGYVNLSKVQCGNGGIGRFLTKLLAMVNILTTLKRDDVLFLQYPMKK- 70 Query: 76 FWHILSFFHRLLKFRIVPLIHDIDELRGGG---GSDSVRLATCDMVISHNPQMTKYLSKY 132 F+ + L ++V +IHD+ R ++ + D +I+HNP MT+YL ++ Sbjct: 71 FYKMACTLAHLKGAKVVTVIHDLGAFRRHKLTPEQENRLFSKTDFLIAHNPTMTEYLQQH 130 Query: 133 MSQDKIKDIKIFDYLVSSDVEHRDVT--DKQRGVIYAGNLSRHKCSFIY-----TEGCDF 185 Q + + IFDYL + V + D R ++YAGNL + F+Y + Sbjct: 131 GFQGGVHHLGIFDYLSAKPVRQPNAQPHDPWR-IVYAGNLGVWRNEFLYHLDTAIKHWTL 189 Query: 186 TLFGVNYENKDNP----KYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKF 241 L+G +E K N Y G D S E I FGL+WDG SV+ C+GA+G+YLK Sbjct: 190 DLYGKGFEPKKNNCQKLTYHGFID--SDEFIERVDADFGLVWDGASVDECNGAWGEYLKI 247 Query: 242 NNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQISE 301 NNPHKTS YL +PV +W K+A+A FI N +G V S+ E+ ++ +T E Y+ + Sbjct: 248 NNPHKTSFYLRAGIPVIVWSKSAMAPFIRKNGLGLTVDSLAEIDSHLEQLTPEQYQAMRA 307 Query: 302 NTKIISQKIRTGSYFRDVLEEVIDDLK 328 N I QK+ TGS+ + L+ + K Sbjct: 308 NAYTIGQKLATGSHIKRGLDAAQEYFK 334 >UniRef50_C2EH03 Possible galactofuranosyltransferase n=1 Tax=Lactobacillus salivarius ATCC 11741 RepID=C2EH03_9LACO Length = 344 Score = 167 bits (422), Expect = 6e-40, Method: Compositional matrix adjust. Identities = 113/343 (32%), Positives = 185/343 (53%), Gaps = 29/343 (8%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLC--- 58 Y L+ + + +AG KA++D I S E ++ NI + + I S + +L Sbjct: 12 YLLSVYDKTEYNAGPKAKRDISRILS--EKLNFKNIEFYFNLDNTIFSKINKIKYLNWDI 69 Query: 59 --GLENK--DVLIFNFPMAKP--FWHILSFFHRLLKFRIVPLIHDIDELRGGGGSDSV-- 110 L+NK D + +P+ ILS R +K + ++HD++ LR ++ Sbjct: 70 PRKLKNKKIDNIFIQYPIYSTVVIKKILSSLDRDVK--VYYIVHDLESLRLFKNDENYLS 127 Query: 111 ----RLATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIY 166 RL D +ISHN MTK+L + + ++ D++IFDYL + + +K + Y Sbjct: 128 EEINRLNDADGIISHNSIMTKWLKENGVKTQVSDLEIFDYLTKNVAPESNSYEKT--LCY 185 Query: 167 AGNLSRHKCSFIYTEGCDFTLFGVNYENKDNPKYLGSFDAQSPEKINLPGM---QFGLIW 223 AGNL K F+ E ++G N K+ PK + +PE+ LP FGLIW Sbjct: 186 AGNL--QKSDFLVNEFYPIDVYGPN-PKKEYPKTVSYKGVFTPEE--LPKHLKENFGLIW 240 Query: 224 DGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKE 283 DG+ ++ C+G +G+Y+K+NNPHK SLYLS LPV IW+KAALA+F+ +++G VGS+ + Sbjct: 241 DGNRIDECNGVYGEYMKYNNPHKVSLYLSSGLPVIIWEKAALAEFVSKHQVGIVVGSLAQ 300 Query: 284 MQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDD 326 +Q + S+T E Y + N +++S+K++ G Y + +ID+ Sbjct: 301 LQNKLGSLTEEEYLNLRYNAQLVSEKLKNGYYIVKAVSNLIDN 343 >UniRef50_C0BRQ3 Putative uncharacterized protein n=2 Tax=Bifidobacterium RepID=C0BRQ3_9BIFI Length = 354 Score = 165 bits (418), Expect = 2e-39, Method: Compositional matrix adjust. Identities = 107/286 (37%), Positives = 160/286 (55%), Gaps = 28/286 (9%) Query: 55 TFLCGLENKDVLIFNFPMAKPFWHILSFFH----RLLKFR---IVPLIHDIDELRGGGGS 107 T CG DV++ FP+ ++ +S + R +K R V LIHD++ LRG + Sbjct: 66 TVRCG----DVVLVQFPLI--MYNKVSLYALPSVRRMKARGALFVFLIHDLETLRGYSYT 119 Query: 108 D--SVRLATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQR-GV 164 D + D++ISHNP+M++ L KY + I +I IFDYL+ + V +QR G+ Sbjct: 120 DFDKQWVTEADLLISHNPRMSEVLRKYGATVPIVEIGIFDYLLP---QANPVPMEQRHGI 176 Query: 165 IYAGNLSRHKCSFIYT-----EGCDFTLFGVNYENKDNPKYLGSFDAQSPEKI--NLPGM 217 AGNLS K ++Y D L+G Y+ + N K +P+++ L G Sbjct: 177 DIAGNLSHGKAEYVYRLAERFPKADINLYGPKYDRR-NGKTAWYRGIVAPDELPDKLEG- 234 Query: 218 QFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYA 277 +FGLIWDGDS++TC G +G YL NNPHK SLYL+ + PV IW+KAALA F+V+ +G A Sbjct: 235 RFGLIWDGDSLDTCGGYYGKYLTVNNPHKLSLYLAADKPVIIWNKAALAPFVVEQGVGVA 294 Query: 278 VGSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEV 323 V S++E + MT Y ++ + QK+R G + R+V+ +V Sbjct: 295 VESLQEAMAVEYGMTQSEYARMVRRASQLGQKLREGWFTREVMAKV 340 >UniRef50_B0N1W1 Putative uncharacterized protein n=1 Tax=Clostridium ramosum DSM 1402 RepID=B0N1W1_9FIRM Length = 358 Score = 165 bits (417), Expect = 2e-39, Method: Compositional matrix adjust. Identities = 93/278 (33%), Positives = 157/278 (56%), Gaps = 17/278 (6%) Query: 60 LENKDVLIFNFPMAKPFWH--ILSFFHRLLKFRIVPLIHDIDELRGGGGS--DSVRL-AT 114 + + D+L+ +P K + ++ + K + +IHD+ ++ + ++L Sbjct: 74 IRDNDILVIQYPFGKYDVNDRQIAKIKKTKKVDFIAIIHDLPSIQDKTADKLEEIKLLKK 133 Query: 115 CDMVISHNPQMTKYLSKY-MSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRH 173 D+VI HN +M + L + + +K+ ++IFDYL + D++ R K G+ AGNLS Sbjct: 134 FDIVICHNKKMLEVLKELGIDNNKLVCLEIFDYLCNEDIKAR--VSKDDGITVAGNLSSS 191 Query: 174 KCSFIYT-------EGCDFTLFGVNYENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGD 226 K +IY E F L+G N+E + Y GS + E I +GLIWDGD Sbjct: 192 KAGYIYKLLDKCNEENIIFNLYGPNFERDNESSYNGSLPPE--ELIKKIKGSYGLIWDGD 249 Query: 227 SVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQE 286 S+E C+G FG+Y K NNPH+ S+ L+ ++P+ IW +AAL DF++DN IG A+ S+K +++ Sbjct: 250 SLELCNGTFGEYQKINNPHRVSMNLAAKMPILIWKEAALKDFVIDNNIGVAIDSLKNIKD 309 Query: 287 IVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVI 324 I++S+ Y + +N + +S+KIR+G Y + + E I Sbjct: 310 ILNSIKDSDYDIMRDNLESVSKKIRSGYYTKKAINEAI 347 >UniRef50_B0BR56 Glycosyltransferase n=1 Tax=Actinobacillus pleuropneumoniae serovar 3 str. JL03 RepID=B0BR56_ACTPJ Length = 349 Score = 164 bits (415), Expect = 4e-39, Method: Compositional matrix adjust. Identities = 119/352 (33%), Positives = 192/352 (54%), Gaps = 40/352 (11%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIA-SDYENISVVNIP-----LWGGVVQRIISSVK-LS 54 Y + +L+ AG KA +D +IA S +VV L +++++I + L Sbjct: 4 YQIVELSTEHNHAGSKAVQDVYEIALSMGYKANVVRTATSVDSLLAKILRQVIFFIDWLK 63 Query: 55 TFLCGLENKDVLIFNFPMAKPFWH-------ILSFFHRLLKFRIVPLIHDIDELRGGGGS 107 + N VLI N P++H IL+ R+ K + + L+HD++ELR + Sbjct: 64 IYFSIESNSIVLIQN-----PYYHKQLIRNWILNRLKRIKKVKFISLVHDVEELRKSLYN 118 Query: 108 DSVR------LATCDMVISHNPQMTKY-LSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDK 160 + + L+ D +I HN +M + + K S+DK+ + IFDYL S V+ + V+ Sbjct: 119 NYYKNEFETMLSLADSIIVHNDKMKSFFIKKGYSEDKLISLGIFDYLQKS-VDKKRVSF- 176 Query: 161 QRGVIYAGNLSRHKCSFIYTEGC----DFTLFGVNYENK----DNPKYLGSFDA-QSPEK 211 +R + AGNL K S+I G L+G N+E+ N +Y GSF A + P+K Sbjct: 177 ERAISVAGNLDIKKSSYIAQLGSLPAIKAHLYGPNFEHSLEAFPNIEYHGSFPATEIPQK 236 Query: 212 INLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVD 271 + + G FGL+WDG S+ETC+G FG+YL++NNPHK SLYLS +PV IWDKAA ADF+ Sbjct: 237 L-VSG--FGLVWDGQSIETCTGDFGEYLQYNNPHKLSLYLSSGMPVVIWDKAAEADFVKK 293 Query: 272 NRIGYAVGSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEV 323 + +G V S+ E+Q+ ++ MT + ++++ N + + + +G Y + + E Sbjct: 294 HNVGLCVSSLSELQDKLNVMTEQEFEEMVNNVEKQTACLISGEYTKKAISEA 345 >UniRef50_UPI000196921F hypothetical protein BACCELL_02894 n=1 Tax=Bacteroides cellulosilyticus DSM 14838 RepID=UPI000196921F Length = 345 Score = 162 bits (409), Expect = 2e-38, Method: Compositional matrix adjust. Identities = 105/334 (31%), Positives = 172/334 (51%), Gaps = 15/334 (4%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASD--YENISVVNIPLWGGVVQRIISSVKLSTFLCG 59 Y+L+ +AG KA+ D + S Y+N + + +I+ + L Sbjct: 4 YYLSKNYNGLNNAGNKAKTDIEETLSKLGYKNAGLPQTTYSNKIAGFLITLAGVLKVLFT 63 Query: 60 LENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRG---GGGSDSVRLATCD 116 + DV++ +P K + + + H L + +++ +IHD+ R + RL D Sbjct: 64 VSANDVVVVQYPFKKYYSFVCNIIH-LKRGKVITIIHDLGTFRRKKLTAEQEIKRLNHSD 122 Query: 117 MVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSS-DVEHRDVTDKQRGVIYAGNLSRHKC 175 ++I HN +M +L + + ++IFDYL S + ++ K VIYAG L+ K Sbjct: 123 VLIVHNDKMEIWLKEQGYTKPMVCLEIFDYLSPSVNNNTQEPNQKPIKVIYAGALTYKKN 182 Query: 176 SFIYT-----EGCDFTLFGVNYEN-KDNPKYLGSFDAQSP--EKINLPGMQFGLIWDGDS 227 ++Y+ F L+G +E K K L F P + I FGLIW+GDS Sbjct: 183 RYLYSLNDVMSKWQFELYGGGFEEAKIEDKTLFKFKGFVPSDQLIEQVSAHFGLIWEGDS 242 Query: 228 VETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEI 287 + TCSG FG YL+ NNPHK SLY+ LP+ IW +AALA F+ +N+IG + S++E+ I Sbjct: 243 IHTCSGDFGIYLRINNPHKVSLYIRCNLPIIIWKEAALASFVAENKIGVCIDSLEELDSI 302 Query: 288 VDSMTIETYKQISENTKIISQKIRTGSYFRDVLE 321 + S++ E+Y ++ N K I++KI +G Y + +E Sbjct: 303 LSSISAESYNEMVRNIKEINKKIASGYYCKRAVE 336 >UniRef50_D0BKT6 Galactofuranosyltransferase n=1 Tax=Granulicatella elegans ATCC 700633 RepID=D0BKT6_9LACT Length = 331 Score = 159 bits (402), Expect = 1e-37, Method: Compositional matrix adjust. Identities = 109/345 (31%), Positives = 177/345 (51%), Gaps = 45/345 (13%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLE 61 Y+L + + AG KAR DA I + G +++ + L Sbjct: 3 YYLKENYAKAKHAGSKARLDAEKIMVE------------AGYAPYFLNN---HSNAVPLT 47 Query: 62 NKDVLIFNFPMAKPFWH-----ILSFFHRLLKFRIVPLIHDIDELRGGG----------- 105 DV++ FP+ W IL+ F + KF+ LIHDI+ LR Sbjct: 48 KDDVIVLQFPL---LWQSLKKQILTRFLKNRKFKAYLLIHDIESLRNRKIKTVKDFKHSI 104 Query: 106 ---GSDSVRLATCDMVISHNPQMTKYLSKY-MSQDKIKDIKIFDYLVSSDVEHRDVTDKQ 161 + L D +I+HN +M L + + ++KI +++FDY++ E + +K Sbjct: 105 IYFLQNKTVLEKVDGIIAHNDKMKAELVRLGIPEEKIVALEMFDYVIPH-YEEKTAYEKN 163 Query: 162 RGVIYAGNLSRHKCSFI--YTEGCDFTLFGVNYENKDNPKYLGSFDAQSPEKI--NLPGM 217 VI AGN K + +F+++G+N+E + PK + A SP+++ +L G Sbjct: 164 T-VIVAGNFDIRKTKYARQLPGNPEFSIYGINFEEEHLPKNVHYKGAFSPDELSHHLQG- 221 Query: 218 QFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYA 277 FGL+WDGDS TCSG +G+YLK NNPHK SLYL+ P+ +W ++ALADF+ N+ G Sbjct: 222 GFGLVWDGDSPHTCSGMYGEYLKMNNPHKASLYLASGFPIIVWSQSALADFVRQNKCGIL 281 Query: 278 VGSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEE 322 V S+ E+ E ++S++ Y+++ +N+K I +KIR G + + LE+ Sbjct: 282 VDSLFEIAESLESLSENDYQEMIKNSKRIGKKIRNGIFLKTALEK 326 >UniRef50_B0P5G1 Putative uncharacterized protein n=1 Tax=Clostridium sp. SS2/1 RepID=B0P5G1_9CLOT Length = 359 Score = 159 bits (401), Expect = 2e-37, Method: Compositional matrix adjust. Identities = 99/286 (34%), Positives = 158/286 (55%), Gaps = 23/286 (8%) Query: 60 LENKDVLIFNFPMAKPFWHI--LSFFHRLLKFRIVPLIHDIDELRGG-----GGSDSVRL 112 L+ DVLI FP+ + F L + + + +IHD++ LR +RL Sbjct: 75 LKEGDVLIIQFPLQEGFIFASHLLKNLKKKNIKTIAVIHDLETLRITKDNTISKKRKIRL 134 Query: 113 ATCDM--------VISHNPQMTKYL-SKYMSQDKIKDIKIFDYLVSSDVE-HRDVTDKQR 162 ++ ++ HN M K L K +S+D + +K+FDYL+ E + K+ Sbjct: 135 YIEEIPTLKQFSKIVVHNQSMKKVLMKKGISEDSMVTLKMFDYLIKEGNELPGNTKSKEN 194 Query: 163 GVIYAGNLSRHKCSFIYTEGCD--FTLFGVNYEN-KDNP-KYLGSFDAQSPEKINLPGMQ 218 +I AGNLS +K ++Y D F L+G+NY DN KY GS+ + +L G Sbjct: 195 NIIIAGNLSSYKVGYVYELPHDVKFDLYGINYTGVTDNKIKYHGSYPSDELP-WHLKGA- 252 Query: 219 FGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAV 278 +GL+WDGD+ +TCSG FGDYL+ NNPHKTSLYL+ +P+ W+KAA+A ++ NR+G V Sbjct: 253 YGLVWDGDTAKTCSGIFGDYLRINNPHKTSLYLACGIPIITWNKAAIAQYVRKNRVGITV 312 Query: 279 GSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVI 324 S+ E+ E + ++ + Y + +N K S+++R G Y + ++E + Sbjct: 313 SSLDEINEKLKDVSKDEYNLMRKNAKKCSERVRKGYYLKKAIQEAL 358 >UniRef50_A8RK64 Putative uncharacterized protein n=1 Tax=Clostridium bolteae ATCC BAA-613 RepID=A8RK64_9CLOT Length = 349 Score = 158 bits (399), Expect = 3e-37, Method: Compositional matrix adjust. Identities = 115/342 (33%), Positives = 177/342 (51%), Gaps = 39/342 (11%) Query: 11 RRDAGFKARKDALDIASD--YENISVVNIPLWGGVVQRIISSVKLSTFLCGLENK----D 64 + +AG KA D L ++ + Y+ I + VQ IIS V ++T+ L NK D Sbjct: 14 QNNAGSKAGNDVLRVSQECGYKLIPLYESNQVRTRVQDIISGV-IATY--SLRNKLVDGD 70 Query: 65 VLIFNFPMAKPFW-HILSFFHRL-LKFRIVPLIHDIDELR----GGGGSDSVR------L 112 +++ +P+ + +I R K RI LIHDID LR G G D ++ L Sbjct: 71 IVLMQYPLNRLLMKNIFRILKRCKSKIRIATLIHDIDYLRDIPLGDKGVDGMKVLELSLL 130 Query: 113 ATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSR 172 + D +I HNP M + L K + + +FDYL D +++ + VI AGNL Sbjct: 131 GSSDYLICHNPFMIRTLQKEKLSVEYISLDLFDYLY--DGTPATISEDKSTVIVAGNLLE 188 Query: 173 HKCSFIYTEGCD-----FTLFGVNYE----NKDNPKYLGSFDAQSPEKI--NLPGMQFGL 221 K ++Y D +L+G NY DN Y GSF P+++ NL G +GL Sbjct: 189 SKAGYLYQIKKDKHKFALSLYGSNYAVDKMQMDNATYHGSF---KPDELIANLYG-AYGL 244 Query: 222 IWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSI 281 +WDG S ETCSG++G YL+ NNPHK SLY++ +PV IW +AAL I +N +G+ + S+ Sbjct: 245 VWDGSSTETCSGSYGKYLRINNPHKVSLYIAAGIPVVIWKEAALCSLIEENALGFGISSL 304 Query: 282 KEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEV 323 E++E + S Y+ N + +K+ +G + + VL ++ Sbjct: 305 DELEEALKSHE-HLYQSYRNNVLNMKEKVCSGGFLKYVLVQI 345 >UniRef50_A0Z7X9 Putative uncharacterized protein n=1 Tax=marine gamma proteobacterium HTCC2080 RepID=A0Z7X9_9GAMM Length = 348 Score = 155 bits (393), Expect = 1e-36, Method: Compositional matrix adjust. Identities = 87/288 (30%), Positives = 156/288 (54%), Gaps = 18/288 (6%) Query: 57 LCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGGGSDSVRLATC- 115 L L+ +L+ +P K ++ + +L +++ +IHD+ R +A+ Sbjct: 63 LLRLKRHSILVVQYP-TKKYYDFIVQIAKLKHCKVITIIHDLRSHRKQKMHVDKEMASLN 121 Query: 116 --DMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQR-GVIYAGNLSR 172 D+VI+HN MT +L + K ++ IFDYL + + +++AG L + Sbjct: 122 KNDVVIAHNSFMTAWLQDHGLTSKAVNLNIFDYLCELKASSTPTPPRDKFRLVFAGVLEK 181 Query: 173 HKCSFIYT----EGCDFT--LFGVNYENKDNPK-----YLGSFDAQSPEKINLPGMQFGL 221 K F+Y+ FT L+G+ + + + P+ Y G F A E ++ +FG+ Sbjct: 182 RKNGFLYSLDALNAKSFTCNLYGIGFNDSELPQDSIVTYQGVFPAD--EIVDRVEGEFGI 239 Query: 222 IWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSI 281 +WDG S++ C G+FG+YLK NNPHKTS+YL LP+ IWD+AA+A F+ D +G AV S+ Sbjct: 240 VWDGTSLDECKGSFGEYLKINNPHKTSMYLRAGLPIIIWDQAAIATFVQDKNVGIAVASL 299 Query: 282 KEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDLKT 329 ++ E + S++ + Y+++ N + +SQ++ G++ +EE + L + Sbjct: 300 AQVDEALQSVSDDDYREMKRNAESVSQQLGEGAFLTAAVEEAMSQLAS 347 >UniRef50_C0WVC5 Possible galactofuranosyltransferase n=2 Tax=Lactobacillus fermentum RepID=C0WVC5_LACFE Length = 350 Score = 155 bits (391), Expect = 3e-36, Method: Compositional matrix adjust. Identities = 106/307 (34%), Positives = 160/307 (52%), Gaps = 28/307 (9%) Query: 39 LWGGVVQRI----ISSVKLSTFLCGLENKDVLIFNFPM--AKPFWHILSFFHRLLKFRIV 92 LW + RI +S L F + D ++ +P+ K I H+ ++ Sbjct: 53 LWDWINTRIHKYQLSRSVLPKFFKEHPDIDNVVIQYPLYSNKLIKQITDSVHQNSHAKLY 112 Query: 93 PLIHDIDELR--------GGGGSDSVRLATCDMVISHNPQMTKYLSKYMSQDKIKDIKIF 144 +IHD + +R G DS L+ D +I HN +M K+L + + + D+ IF Sbjct: 113 FIIHDAEMIRLYADEPKRAQGELDSFNLS--DGIIGHNAKMNKFLKEQGVKVPLVDLGIF 170 Query: 145 DYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYTEGCDFTLFGVNYENKDNP--KYLG 202 DY ++ DK V YAGNL + F LFG N N Y G Sbjct: 171 DYDNPQPLQEYKGYDK--SVCYAGNLIDAEFLQDVHPTNRFDLFGPNPAESYNEGLNYKG 228 Query: 203 SFDAQSPEKINLPGM---QFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFI 259 F SP +LP FGL+W G SV+TC G FG YLK+NNPHKTSLYLS LPV I Sbjct: 229 QF---SP--TDLPAHMDENFGLVWHGTSVDTCDGVFGRYLKWNNPHKTSLYLSSGLPVII 283 Query: 260 WDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDV 319 WD+AALADF+++N +G + S+ ++ + +D++T E Y+Q+ +N + ++ ++RTG Y Sbjct: 284 WDQAALADFVLENGVGITISSLNDLNDKLDALTEEEYRQMHDNVQKVANQMRTGYYITHA 343 Query: 320 LEEVIDD 326 +E++I++ Sbjct: 344 MEKMINN 350 >UniRef50_C7IU57 Putative uncharacterized protein n=1 Tax=Thermoanaerobacter ethanolicus CCSD1 RepID=C7IU57_THEET Length = 356 Score = 154 bits (390), Expect = 4e-36, Method: Compositional matrix adjust. Identities = 95/295 (32%), Positives = 169/295 (57%), Gaps = 33/295 (11%) Query: 63 KDVLIFNFPMAKPFWHILSFFHRLLKFR-----IVPLIHDIDELRGGGGSDSVR-----L 112 K +++ ++P+ P L F + L+ R ++ +IHD++ LR ++V+ L Sbjct: 64 KVIIVTHYPLLNPV--ALKIFIQALELRRCDITLIGIIHDVNSLRYQQNENAVQREIQFL 121 Query: 113 ATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDY-------LVSSDVEHRD--VTDKQRG 163 D +ISHN MTK+L + + KI+++++FDY +V S++ + + + + Sbjct: 122 NMFDFLISHNSAMTKWLVEQGFKGKIQELELFDYKIDGNKNIVKSEINRTEGELKENRYI 181 Query: 164 VIYAGNLSRHKCSFIYT-EGCDFT-----LFGVNYEN----KDNPKYLGSFDAQSPEKIN 213 + +AGNL K FIY+ E +F+ L+G N+ + + N Y G +++ + Sbjct: 182 ITFAGNLDPQKSGFIYSLENVNFSNLFFYLYGPNFVSNQISEKNIIYKGVYESNLL-PLY 240 Query: 214 LPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNR 273 L G +GLIWDGDSV+TCSGA G+YLK+N+PHK SLY+ LPV IW KAA A+ + + Sbjct: 241 LEG-NWGLIWDGDSVKTCSGALGNYLKYNSPHKLSLYIVAGLPVIIWSKAAAAELVKKYK 299 Query: 274 IGYAVGSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDLK 328 IG + S++E+ ++S++ E Y+ EN I++ K++ G + + ++I+ ++ Sbjct: 300 IGIVIDSLEEIPVKLESISNEEYQNYRENVMILANKLKKGEFIIGAVNKIINSVE 354 >UniRef50_C2F0P9 Galactofuranosyltransferase n=2 Tax=Lactobacillus reuteri RepID=C2F0P9_LACRE Length = 334 Score = 150 bits (379), Expect = 6e-35, Method: Compositional matrix adjust. Identities = 98/333 (29%), Positives = 167/333 (50%), Gaps = 16/333 (4%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLE 61 Y ++ ++ + G KA+KD AS + V+++ ++ +QR + + L Sbjct: 3 YLISAIDPIKNSGGNKAKKDIDFFASQLNDTRVIHVKIYYTRLQRYLLTRLSIIKLVKTH 62 Query: 62 NKDVLIFNFPMAKPF--WHILSFFHRLLKFRIVPLIHDIDELR---GGGGSDSVRLATCD 116 D I FP++ P+ + + ++ IHD+ L+ + V D Sbjct: 63 PADRYILQFPISTPYVLRQFIEVIQKYTNAKVDLFIHDLPALQLSMDDKERELVLFNQVD 122 Query: 117 MVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVI-YAGNLSRHKC 175 +I HN M K+L Q + ++ +FDY ++ + D I + GNL++ Sbjct: 123 NLIVHNQAMKKWLVDNGVQTNMIELGLFDYDNEQPMQKKQEYDPANFTICFPGNLAKSTF 182 Query: 176 SFIYTEGCDFTLFGVNYENK--DNPKYLGSFDAQSPEKINLPGM---QFGLIWDGDSVET 230 ++G N + ++ +Y G + +PE+ LP FGLIWDG+S+ET Sbjct: 183 LTKVNLSHQLNIYGPNKLDSYPESIRYCGQY---TPEE--LPKHLTEDFGLIWDGNSIET 237 Query: 231 CSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDS 290 CSG FG+YLK+NNPHKTSLYLS +PV IWD+AALA I ++ +G + S+ E+ ++ S Sbjct: 238 CSGTFGEYLKYNNPHKTSLYLSTGIPVIIWDQAALAPLIKESGVGICISSLTELDSVLLS 297 Query: 291 MTIETYKQISENTKIISQKIRTGSYFRDVLEEV 323 +T E Y+ + + + QK+R G Y + L ++ Sbjct: 298 LTNEQYQLMKRKAEKLGQKLRKGYYTKHALTKL 330 >UniRef50_Q032N6 Glycosyltransferase n=1 Tax=Lactococcus lactis subsp. cremoris SK11 RepID=Q032N6_LACLS Length = 345 Score = 147 bits (371), Expect = 5e-34, Method: Compositional matrix adjust. Identities = 110/342 (32%), Positives = 174/342 (50%), Gaps = 26/342 (7%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASD--YENI-SVVNIPLWGGVVQRIISSVKLSTFLC 58 Y++N L +AG KA DA +I YE++ S VNI ++ + SV++ + Sbjct: 4 YYINALQKENMNAGSKAVNDATEIFEKMGYESLLSKVNIK--NIYLRTLFFSVQVMIRIL 61 Query: 59 GLENKDVLIFNFPMAKPFWHILSFF--HRLLKFRIVPLIHDIDELRGGGGSDSVRLATCD 116 L ++ NFP F I +F R ++ LIHDI ELR G + + + Sbjct: 62 FLPKNTKVVSNFPPIFFFERICLYFLKKRSKSLKVFILIHDIYELRIGKNNSTPYRNLLN 121 Query: 117 M------VISHNPQMTKYLSKY-MSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGN 169 I+HN +M +L K ++ I D++IFDYL S ++ + VI AGN Sbjct: 122 FKNSNFYFIAHNDKMVSWLVKEGYKKNNIIDLEIFDYL--SVIKEDAGGTYGKSVIIAGN 179 Query: 170 LSRHKCSFIY----TEGCDFTLFGVNY----ENKDNPKYLGSFDAQSPEKINLPGMQFGL 221 L+ K S++ DF L+G N E N Y GSF A E N+ +GL Sbjct: 180 LAPEKSSYLMELFKISEIDFNLYGPNVSSDVEKSKNVIYHGSFPAD--EIPNIIQGSYGL 237 Query: 222 IWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSI 281 IWD ++ +G +G+Y ++NNPHKTSLYL+ P+ +W+KAALA FI+++ +G+ V ++ Sbjct: 238 IWDSETTIGGTGKYGNYQRYNNPHKTSLYLAAGFPIIMWEKAALASFIMEHNLGFLVNTL 297 Query: 282 KEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEV 323 +E+ + + Y ++ EN + KIR G + + LE+ Sbjct: 298 EEIPSKIAKIKEVDYNRMRENVEKFGNKIRMGYFLTEALEKA 339 >UniRef50_A7AHD2 Putative uncharacterized protein n=1 Tax=Parabacteroides merdae ATCC 43184 RepID=A7AHD2_9PORP Length = 352 Score = 146 bits (368), Expect = 1e-33, Method: Compositional matrix adjust. Identities = 96/325 (29%), Positives = 164/325 (50%), Gaps = 16/325 (4%) Query: 14 AGFKARKDALDIASD--YENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFP 71 AG KA+ D I D Y NI + ++ +I+ + + L + D+LI +P Sbjct: 16 AGSKAKTDMEQIMCDLGYRNIGFPCLVCSNKILGFVITLLSMIKVCFKLRSGDILIIQYP 75 Query: 72 MAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGGGS---DSVRLATCDMVISHNPQMTKY 128 + K + + + H +++ LIHD+ R + + RL D +I+ N M+ + Sbjct: 76 LKKYYTLLCNIVH-YRGAKVITLIHDLGSFRRKRLTVLQEIDRLQNSDYLITLNDSMSAW 134 Query: 129 LSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYT-----EGC 183 L + ++KI+DYL + V ++ V+YAG L +K F+Y Sbjct: 135 LQTKGCEVPKGELKIWDYLSPAIVLNKIEPATDYTVVYAGALGYNKNRFLYELDRLPRQW 194 Query: 184 DFTLFGVNYENKD--NPKYLGSFDAQSPEKINLPGMQ--FGLIWDGDSVETCSGAFGDYL 239 +++G E N +Y S++ P + +Q FGL+WDGDS E C+G +G+YL Sbjct: 195 HLSVYGKGLEADKILNKEYF-SYNGFLPADQLISSVQGDFGLVWDGDSYEACTGNYGEYL 253 Query: 240 KFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQI 299 ++NNPHK SLY+ LP+ IW+KAALA FI + IG + S++E+ ++ +T++ Y ++ Sbjct: 254 RYNNPHKVSLYVRCHLPLIIWEKAALAPFIKEKEIGICINSLEELDGKLEKLTVDDYFKM 313 Query: 300 SENTKIISQKIRTGSYFRDVLEEVI 324 IS + G +F L+E + Sbjct: 314 KSRVIEISNLLSVGYFFTKALDEAV 338 >UniRef50_Q03GL2 Glycosyltransferase n=1 Tax=Pediococcus pentosaceus ATCC 25745 RepID=Q03GL2_PEDPA Length = 338 Score = 145 bits (366), Expect = 2e-33, Method: Compositional matrix adjust. Identities = 102/341 (29%), Positives = 183/341 (53%), Gaps = 22/341 (6%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLW-GGVVQRIISSVKLSTFLCGL 60 Y L N + AG KA+KD I + + V I L +++ ++ ++ L Sbjct: 4 YVLRITNGQKNTAGDKAKKDITSILNK-QGFKSVEIRLRESKLIKLFTTNFMINKQLNNF 62 Query: 61 ENKDVLIFNFPMAKPFWHILSFFHRLLK--FRIVPLIHDIDELR------GGGGSDSVRL 112 + D+ + +PM F + ++ K R + +IHD++ LR + L Sbjct: 63 KKNDIFVIQYPMYSRFATKI-ILNKCEKKGIRTICVIHDLEALRLYKNDENKIAEEKAIL 121 Query: 113 ATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSR 172 + + +I HN +M ++L + + + ++IFDYL +D E V +K +I+AGNL + Sbjct: 122 SRFNCLIVHNEKMREWLVEQDVKVPMVSLQIFDYL--NDKELVKVENKL-NLIFAGNLEK 178 Query: 173 HKCSFIYTEGCDFTLFGVNYEN--KDNPKYLGSFDAQSPEKIN--LPGMQFGLIWDGDSV 228 + T+FGV+ + N Y G ++P+++ L G FGLIWDG+S+ Sbjct: 179 SAFLEKWNLEKKITVFGVHPSDLYPHNVIYKG---VKTPDELPKYLSG-SFGLIWDGNSI 234 Query: 229 ETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIV 288 ET +G +GDY K+NNPHK SLYLS LPV +W KAA+++FIV N++G ++ S+ ++++ + Sbjct: 235 ETNTGIYGDYTKYNNPHKVSLYLSSGLPVIVWKKAAISEFIVKNKLGISIDSLGDLEDSL 294 Query: 289 DSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDLKT 329 + E Y + N + +++K+R G++ +E+ I+ +K+ Sbjct: 295 SKINAEKYTNMVSNVEKMARKLRKGTFTTKAVEKAINLIKS 335 >UniRef50_A5ZF92 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=A5ZF92_9BACE Length = 345 Score = 142 bits (359), Expect = 1e-32, Method: Compositional matrix adjust. Identities = 101/332 (30%), Positives = 165/332 (49%), Gaps = 23/332 (6%) Query: 13 DAGFKARKDALDI--ASDYENISVVNIPLWGGVVQ--RIISSVKLSTFLCGLENKDVLIF 68 +AG KA+ D I + + N+ + VV R + SV L + LC L DVL+ Sbjct: 14 NAGNKAKTDIEQIMESHGFRNVGLKQTRYRNVVVAFCRTLFSV-LKSILC-LRKGDVLVL 71 Query: 69 NFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGG---GSDSVRLATCDMVISHNPQM 125 +P+ K + + + H L ++V LIHD+ R + RL D VI H+ +M Sbjct: 72 QYPLKKYYAFVCNMAH-LRGCKVVTLIHDLGSFRRKKLTIPQEIARLDHSDCVIVHSERM 130 Query: 126 TKYLSKYMSQDKIKDIKIFDYLVSSD-VEHRDVTDKQRGVIYAGNLSRHKCSFIYTE--- 181 +L ++ + K++ ++IFDYL S V D +++ G LS + F+Y + Sbjct: 131 RDWLLEHGIKAKLQILEIFDYLSDSQPVAGNDSPKSPNRILFVGALSSYHNDFLYKQVNS 190 Query: 182 --GCDFTLFGVNYENKDNPKYLGSFD----AQSPEKINLPGMQFGLIWDGDSVETCSGAF 235 D L+G E + K G D S E I ++GL W G S+E SGA Sbjct: 191 PRSYDIVLYGSGLETE---KLEGKVDYKGFVSSDELIATAEGEYGLAWYGSSLEGGSGAL 247 Query: 236 GDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIET 295 G+YL++N PHK SLY+ LP+ +W+KA LA F+ N +G + S+ E+++I+ ++ Sbjct: 248 GEYLQYNAPHKMSLYIRCGLPIIVWEKAGLAPFVKKNNVGICISSLTELEDILPKISAGQ 307 Query: 296 YKQISENTKIISQKIRTGSYFRDVLEEVIDDL 327 Y ++ +N I+ K+ G Y +++ DL Sbjct: 308 YMEMKKNVLQIADKLSHGYYCFKAIKQACADL 339 >UniRef50_C4Z1X5 Putative uncharacterized protein n=1 Tax=Eubacterium eligens ATCC 27750 RepID=C4Z1X5_EUBE2 Length = 240 Score = 142 bits (358), Expect = 2e-32, Method: Compositional matrix adjust. Identities = 82/232 (35%), Positives = 135/232 (58%), Gaps = 15/232 (6%) Query: 108 DSVRLATCDMVISHNPQMTKYLSKY-MSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIY 166 D D VI+HN +M +YL ++ + + KI ++ IFDYL + + ++ + + + Sbjct: 13 DETMYEIADYVIAHNSKMKRYLIEHGVEESKIYELGIFDYLTNINPNNKSIR-YSKTLNI 71 Query: 167 AGNLSRHKCSFIYT-EGCD----FTLFGVNYE----NKDNPKYLGSFDA-QSPEKINLPG 216 AGNL +K ++I G D F L+G+N++ + Y G+F + + P ++ Sbjct: 72 AGNLDANKSNYIRELNGVDKTINFNLYGLNFDKNVLTSEAIHYKGAFPSDEIPSQLT--- 128 Query: 217 MQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGY 276 FGL+WDG++ C+G G+YLK+NNPHK SLY+ LPV IW +AA A+F+ N +G Sbjct: 129 EGFGLVWDGNTASCCAGNTGEYLKYNNPHKLSLYMVSGLPVVIWSQAAEAEFVKCNNVGL 188 Query: 277 AVGSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDLK 328 V SI++ D+++ Y ++ EN K +S K+R G Y R V++++I DLK Sbjct: 189 VVDSIEDFSIKFDNLSENDYYKMVENAKNVSYKLRNGEYLRKVIQDIIKDLK 240 >UniRef50_A2RHU2 Putative galactofuranose transferase n=1 Tax=Lactococcus lactis subsp. cremoris MG1363 RepID=A2RHU2_LACLM Length = 344 Score = 142 bits (358), Expect = 2e-32, Method: Compositional matrix adjust. Identities = 108/341 (31%), Positives = 182/341 (53%), Gaps = 29/341 (8%) Query: 12 RDAGFKARKDALDIASD--YENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFN 69 + AG KA+ D+ I D Y+++ +I + +I++ + L ++ V+ N Sbjct: 9 QTAGAKAKIDSDTIFKDSGYKSLFSHHIQTNKVYINKILNIILGIISLTFIKKGSVITTN 68 Query: 70 FPM----AKPFWHILSFFHRLLKFRIVPLIHDIDELRGGGGSDSV------RLATCDMVI 119 +P K W+ L R+ K +++ LIHD+D +R + +L D +I Sbjct: 69 YPPNLIDKKIVWNYLYKIKRIKKIKLIILIHDLDFIRNNDNDSNQEKKYIEQLDVADAII 128 Query: 120 SHNPQMTKYL-SKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFI 178 HN +M L K +S+DK+ D+KIFDYL +D++ + + ++ AGNL K ++ Sbjct: 129 VHNTKMIDLLVEKGLSKDKLIDLKIFDYL--ADIKSSGGSYGNKFIV-AGNLDIQKSKYL 185 Query: 179 YT----EGCDFTLFGVNYE----NKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVET 230 +G F L+G Y + D KY GSF ++S N+ +GL+WD + + Sbjct: 186 SKISKIDGIYFNLYGPGYNQNDYDSDKSKYYGSFPSESIP--NVIQGSYGLVWDSEELSG 243 Query: 231 CSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDS 290 G +GDY ++NNPHKTSLYL+ PV +W+KAALA FIV+N +G+ V ++ E+ ++ Sbjct: 244 GVGPYGDYQRYNNPHKTSLYLAAGFPVVVWEKAALAPFIVENNLGFVVDNLDELPSKIEE 303 Query: 291 MTIETYKQISENTKIISQKIRTGSYFRDVL---EEVIDDLK 328 ++ + Y ++ N K I QKI +G + + L E VI++ K Sbjct: 304 ISEDEYNRMKLNVKEIGQKICSGYFLNEALKKAETVIEENK 344 >UniRef50_UPI0001968A2E hypothetical protein BACCELL_04078 n=1 Tax=Bacteroides cellulosilyticus DSM 14838 RepID=UPI0001968A2E Length = 355 Score = 137 bits (346), Expect = 4e-31, Method: Compositional matrix adjust. Identities = 97/333 (29%), Positives = 174/333 (52%), Gaps = 31/333 (9%) Query: 13 DAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGL--ENKDVLIF-- 68 +AG KA KD + + +V+ +P + ++I L LC N VL F Sbjct: 27 NAGSKAMKDIMALLDSKGYKAVLALPTRTNKIIKLIDIPILLFTLCFRVGRNGTVLYFVP 86 Query: 69 -NFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGGGSDSVR-----LATCDMVISHN 122 NF K +L FF+R+++F+++ I+DI+ +R + +A D++++ N Sbjct: 87 SNFQRIK----LLKFFNRIIRFKLICFINDIESMRMEKSKEYAHAEMNSIAVADIILAPN 142 Query: 123 PQMTKYL-SKYMSQDKIKDIKIFDYLVSSD---VEHR-DVTDKQRGVIYAGNLSRHKCSF 177 + L +KY + + I I+DYL + + EH ++ ++ V +AGNL +K F Sbjct: 143 DNSIQILQNKYHFTNHLVSIGIWDYLNNFEPIASEHTTNMVFNEKSVAFAGNL--NKAPF 200 Query: 178 I---YTEGCDFTLFGVNYENKD--NPKYLGSFDAQSPEKI--NLPGMQFGLIWDGDSVET 230 I + +F ++G N E K N +++G ++P+++ N+ +GL+WDG S+ T Sbjct: 201 INELSSVNLNFKIWGSNTEEKKDRNIEFMGK---KAPDELIENISQCTWGLVWDGISINT 257 Query: 231 CSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDS 290 C G G YL+FNN HK LYL+ +PV +W+++ +A F+ ++G V S+ + +I++ Sbjct: 258 CCGLLGTYLRFNNSHKCGLYLAARVPVIVWEESGMASFVNKYKVGICVSSLHDAADIINC 317 Query: 291 MTIETYKQISENTKIISQKIRTGSYFRDVLEEV 323 M + Y +N + I Q I G +F + LE+ Sbjct: 318 MDQKVYNIYKKNAQSIGQLISEGKFFLEALEKA 350 >UniRef50_C2E8T4 Possible galactofuranosyltransferase n=1 Tax=Lactobacillus ruminis ATCC 25644 RepID=C2E8T4_9LACO Length = 337 Score = 137 bits (345), Expect = 6e-31, Method: Compositional matrix adjust. Identities = 88/332 (26%), Positives = 172/332 (51%), Gaps = 24/332 (7%) Query: 10 SRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRII-SSVKLSTFLCGLENKDVLIF 68 ++ DAG KA+ D + ++ E V+++ L +++ + +KL G + +V I Sbjct: 15 AKNDAGPKAKTDINEFLTE-EGFKVMDLDLPEKRLEKFLFVHLKLKRLFKGRQFDNV-IL 72 Query: 69 NFPMAKPFW--HILSFFHRLLKFRIVPLIHDIDELRGGGGSDSVR------LATCDMVIS 120 +P F I+ ++ + + ++HD++ LR G+ + D +I Sbjct: 73 QYPFYSVFLTKKIIENAKKVTHGKFLIMVHDVETLRVYDGNKQFEKDEMEIFNSADGLIV 132 Query: 121 HNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSF--- 177 HN +M ++L ++ I + IFDY +D + + + Q+ + +AGNL K +F Sbjct: 133 HNSKMAEWLKQHGVTVPITILGIFDY--RNDCQKNERFEYQKSICFAGNL--EKSTFLKK 188 Query: 178 IYTEGCDFTLFGVNYENK--DNPKYLGSFDAQS-PEKINLPGMQFGLIWDGDSVETCSGA 234 + ++G + K Y G + P +N FGLIWDGD + C+G Sbjct: 189 VKLNDAKLDVYGPSPAQKYQKGVTYCGVYTPDDLPNHLN---ENFGLIWDGDEMSACTGV 245 Query: 235 FGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIE 294 FG+Y+++N PHKTSLYLS +PV IW +AA+A+F+ +N +G A+ ++ ++ ++ + Sbjct: 246 FGNYMRYNAPHKTSLYLSSGIPVIIWKEAAMAEFVSENEVGIAIENLNDLDNVLQKVDDA 305 Query: 295 TYKQISENTKIISQKIRTGSYFRDVLEEVIDD 326 Y+++ N +++++R+GSY ++ + + + D Sbjct: 306 GYRKMKSNALNLAERLRSGSYVKEAVRKALGD 337 >UniRef50_C2EVL7 Possible galactofuranosyltransferase n=1 Tax=Lactobacillus vaginalis ATCC 49540 RepID=C2EVL7_9LACO Length = 336 Score = 137 bits (345), Expect = 6e-31, Method: Compositional matrix adjust. Identities = 100/343 (29%), Positives = 169/343 (49%), Gaps = 31/343 (9%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLE 61 Y L + + G KA++DA+ IS+ +P + ++ SS L L Sbjct: 7 YVLEWHDSEKNTGGVKAKQDAVTFLKKDGFISI-EVP--SSKLGKVWSSFWARYILRNLS 63 Query: 62 NKDVLIFNFPMAKPF----WHILSFFHRLLKFRIVPLIHDIDELRGGGGS--DSVR---- 111 +++ +P KPF W L + K +++ LIHD++ +R S VR Sbjct: 64 G--IIVIQYPSGKPFLRKLW--LEAACKNKKLKVILLIHDLESIRFFNDSKYSDVRQSEF 119 Query: 112 --LATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGN 169 +A D +++ N +M L K I + +DY + + + D QR + YAGN Sbjct: 120 EFIAKADGLVALNERMKSLLVKGGIVKPITTLDAWDYDNKNPIIEKK--DYQRRICYAGN 177 Query: 170 LSRHKCSFIYTEGCDFTL--FGVNYEN--KDNPKYLGSFDAQS-PEKINLPGMQFGLIWD 224 L K F+ C ++ FG N E + KY+G F Q P +N +GL+WD Sbjct: 178 L--RKALFLSDLKCKTSIYVFGPNSETTFSKSIKYMGQFSPQKLPSHLN---GDYGLVWD 232 Query: 225 GDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEM 284 G S ETC G +G YL++N PHK SLY+S LPV +WDKAA+A+F+ +G + ++ ++ Sbjct: 233 GVSSETCKGMYGQYLRYNTPHKFSLYISSGLPVIVWDKAAIAEFVKKYNVGLTISNLNDI 292 Query: 285 QEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDL 327 ++ S+ YK++ +N +++K+R G + + ++I + Sbjct: 293 DNLLHSVPSSQYKELQKNVIKVAEKMRNGQFLTTAINDLIKKI 335 >UniRef50_C2FKS4 Possible galactofuranosyltransferase n=1 Tax=Lactobacillus plantarum subsp. plantarum ATCC 14917 RepID=C2FKS4_LACPL Length = 343 Score = 136 bits (343), Expect = 1e-30, Method: Compositional matrix adjust. Identities = 96/330 (29%), Positives = 177/330 (53%), Gaps = 27/330 (8%) Query: 13 DAGFKARKD--ALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNF 70 +AG KA++D A+ + E +S+V IP V R++ ++++ + N+ +++ + Sbjct: 15 NAGSKAKQDIEAILFKAGLEKLSLV-IPT--NRVGRVLYAIRIWKKVFNGLNEGLIVVQY 71 Query: 71 PMAKPFW--HILSFFHRLLKFRIVPLIHDIDELR-GGGGSDSVR-----LATCDMVISHN 122 P+ ++ + +IV ++HDI+ LR D++ L D +I HN Sbjct: 72 PLYSKVITKQLVKEAGKRPNVKIVAIVHDIESLRIDVNHEDAINTEIDLLNGFDFLIVHN 131 Query: 123 PQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVI-YAGNLSRHKCSF---I 178 +M +L + + + FDYL V + K V+ +AGNL+ K SF I Sbjct: 132 TKMKSWLIENGLTIPSEVLGAFDYLSDFSVP---IQRKSGNVVNFAGNLA--KSSFLTKI 186 Query: 179 YTEGCDFTLFGVNYENKDNP-KYLGSFDAQSPEKINLPGMQ-FGLIWDGDSVETCSGAFG 236 + + ++G N + Y G + SPE+++ + +GL WDGDS+ TCSG +G Sbjct: 187 TSTDVKYHIYGPNPQKYSTALAYKGIY---SPEQLSEQFVSGYGLAWDGDSITTCSGVYG 243 Query: 237 DYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETY 296 +YLK NNPHK SLY+ LPV +WD +A++D++ N +G +V S+ E+ +I+ +T Y Sbjct: 244 EYLKINNPHKVSLYIRSGLPVIVWDDSAMSDWVQKNDLGLSVSSLAELGDIISGVTDHQY 303 Query: 297 KQISENTKIISQKIRTGSYFRDVLEEVIDD 326 + +EN ++++Q+++ G Y R+ +++ + Sbjct: 304 QIYTENARVVAQRMQQGLYIREAFTKLLKN 333 >UniRef50_UPI000196CD65 hypothetical protein CATMIT_02517 n=1 Tax=Catenibacterium mitsuokai DSM 15897 RepID=UPI000196CD65 Length = 349 Score = 136 bits (342), Expect = 1e-30, Method: Compositional matrix adjust. Identities = 85/251 (33%), Positives = 136/251 (54%), Gaps = 24/251 (9%) Query: 90 RIVPLIHDIDELRGGGGS--------DSVRLATCDMVISHNPQMTKYL-SKYMSQDKIKD 140 ++ LIHD+D LR + D D +I+HN M +YL S+ ++++KI + Sbjct: 103 KLAFLIHDLDSLRKLFLNAQDDFEYMDHKMYDISDYIIAHNDSMIEYLVSQGVAREKIHN 162 Query: 141 IKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYT----EGCDFTLFGVNYENK- 195 + IFDYL S+ + R V AGNL K +++ + F L+GV+ + Sbjct: 163 LHIFDYLCDSN----NTIKFDRSVSIAGNLDEKKSNYLAKLKDIKAVHFDLYGVHLNEEI 218 Query: 196 --DNPKYLGSFDAQSPEKINLPGMQ-FGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLS 252 N Y G+F P++IN FGL+WDG S+E C G G+YLK+NNPHK SLYL Sbjct: 219 LASNITYHGAF---PPDEINNQLYSGFGLVWDGSSIERCDGNTGEYLKYNNPHKLSLYLV 275 Query: 253 MELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQISENTKIISQKIRT 312 +PV IW +AA A F+ + +G V S+ E+ E S++ E Y ++ + ++S++++ Sbjct: 276 SGIPVVIWKEAAEAKFVEEYGLGITVNSLDELGEKFASLSEEEYFEMVKRVAVVSERLKN 335 Query: 313 GSYFRDVLEEV 323 G Y ++E+ Sbjct: 336 GYYLTQAIKEI 346 >UniRef50_C9LJY2 Putative uncharacterized protein n=1 Tax=Prevotella tannerae ATCC 51259 RepID=C9LJY2_9BACT Length = 353 Score = 132 bits (333), Expect = 1e-29, Method: Compositional matrix adjust. Identities = 93/336 (27%), Positives = 161/336 (47%), Gaps = 25/336 (7%) Query: 11 RRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISS-----VKLSTFLCGLENKDV 65 R+ AG KA+ D DI + N+ L + I++ + T+ + DV Sbjct: 17 RQGAGNKAKGDYEDILV---QMGAHNLGLRRTYYKEYIAAFLTDLAGIVTYALSVRKGDV 73 Query: 66 LIFNFPMAKPFWHILSFFHRLLKFR---IVPLIHDIDELRGGG---GSDSVRLATCDMVI 119 + +P K F SF RL ++R + IHD+ R + RL+ D +I Sbjct: 74 VFLQYPTKKYF----SFMCRLARWREANSMAFIHDLGAFRRKKVTVKQEIRRLSNADYII 129 Query: 120 SHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFI- 178 + N M ++L + + + + DYL +S+ + T ++YAG++ K F+ Sbjct: 130 AANDTMAEWLKSHGLKRPCHGMGLHDYLSNSETVDKPATFPPHRIVYAGSIEERKNMFLT 189 Query: 179 ----YTEGCDFTLFGVNY-ENKDNPKYLGSFDAQSPEK-INLPGMQFGLIWDGDSVETCS 232 + ++G N+ + + L + +P+ I FGL+WDGDS+ C+ Sbjct: 190 KLSGVIRHGEIHVYGSNHIAALKSTRNLILHEPMTPDNFIATAKGDFGLVWDGDSLTACT 249 Query: 233 GAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMT 292 G FG+YL+ N PHK S YL LP+ IW ++ALAD + IG V I E+++ ++S+T Sbjct: 250 GDFGEYLRINTPHKASFYLRAGLPLIIWSRSALADIVDREGIGITVDRIDEIEDHIESLT 309 Query: 293 IETYKQISENTKIISQKIRTGSYFRDVLEEVIDDLK 328 + ++I +N K +SQ + G R +E+ + +K Sbjct: 310 GQEIRKIRDNVKRVSQDLADGLSMRRAVEKAMCRIK 345 >UniRef50_C7TE97 Glycosyl transferase,galactofuranosyltransferase n=2 Tax=Lactobacillus rhamnosus RepID=C7TE97_LACRG Length = 338 Score = 132 bits (332), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 106/332 (31%), Positives = 171/332 (51%), Gaps = 33/332 (9%) Query: 13 DAGFKARKDALDIASDYENISVVNIPLW------GGVVQRIISSVKLSTFLCGLENKDVL 66 DAGFKAR D + S+ I IP +QR+ + LS L ++ VL Sbjct: 13 DAGFKARAD-VKYFSNRMGIKTAEIPATRVNLKINRELQRLRAVRSLSKKLSA--DQSVL 69 Query: 67 IFNFPMAKPFWHI-LSFFHR-LLKF--RIVPLIHDIDELRGGGGSDSVR-----LATCDM 117 I +P+ PF S H+ LLK + + L+HD+ ++G + S++ L D Sbjct: 70 I-QYPL--PFNSFDYSLLHKTLLKHNAKCIFLVHDLISVQGQAKNVSIKQEIEELKRADF 126 Query: 118 VISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKC-- 175 +I HN M +L K+ I FDY V DVE + + +++AGNL + K Sbjct: 127 LIVHNQAMQNFLEDQGLSQKMATINFFDYRV--DVEP-PIRSEVANIVFAGNLVKSKFLK 183 Query: 176 SFIYTEGCDFTLFGVNYENKDNPK---YLGSFDAQS-PEKINLPGMQFGLIWDGDSVETC 231 E + ++G + P+ + G+ D+ P K+ + G +GL+WDG S + Sbjct: 184 KLPQLEIFKWHVYGSGMTAEQFPESVVFHGAIDSGVLPSKL-VDG--WGLVWDGISTDRI 240 Query: 232 SGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSM 291 SG GDYL+ N+PHK SLYL+ LP+ +W ++ALA+ ++ +G AV ++ E++ ++ S+ Sbjct: 241 SGVSGDYLRLNSPHKASLYLASGLPLIVWRESALANVVLQLGLGIAVDNLMEIEPVIKSL 300 Query: 292 TIETYKQISENTKIISQKIRTGSYFRDVLEEV 323 + ++I N +IISQKIR G +D LE + Sbjct: 301 SHTQIEKIQTNVQIISQKIRNGGMLKDALESL 332 >UniRef50_C9LPN1 Galactofuranosyltransferase n=2 Tax=Veillonellaceae RepID=C9LPN1_9FIRM Length = 344 Score = 131 bits (329), Expect = 4e-29, Method: Compositional matrix adjust. Identities = 92/289 (31%), Positives = 158/289 (54%), Gaps = 28/289 (9%) Query: 53 LSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLK-FRI---VPLIHDIDELRGGGGSD 108 L+T ++N D+LI +P +H L +L+ FRI + L+HD+D +R G D Sbjct: 62 LNTIRKSVKNDDILIIQYPHYN--FHALG--EKLIDLFRIKNTILLVHDVDSVRYQTGID 117 Query: 109 S--VRLATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIY 166 L +V+ HN +M+ YL K+ + K +I IFDYL+ + + ++ +++ Sbjct: 118 EEIKLLNLAKVVLLHNQKMSDYLVKHGLKTKTVNINIFDYLLYNTPSQESFSFGKQ-IVF 176 Query: 167 AGNLSRHKCSFIYTEGCD-----FTLFG--VNYENKDNPK--YLGSFDAQSPEKI--NLP 215 AGNL K F+ G D +LFG ++ E K++ ++GS+ SP++I L Sbjct: 177 AGNLG--KSHFLNLMGQDSLGLSLSLFGPGLSEEMKESSHVHWMGSY---SPDEIPFKLK 231 Query: 216 GMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIG 275 G FGL+WDG S++ C G G Y+K N PHK +LY++ +PV W +AA+AD + +IG Sbjct: 232 G-SFGLVWDGTSLDECDGFMGRYMKINFPHKLALYIAAGIPVVTWSQAAIADIVKTYKIG 290 Query: 276 YAVGSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVI 324 + V S++E+ +DS+ + Y + +N + +K+ +G + E+ + Sbjct: 291 FVVDSLREVSNYIDSINEKEYAEYKKNILKLQKKVMSGYFTALAFEKAV 339 >UniRef50_C7XW37 Glycosyltransferase n=1 Tax=Lactobacillus coleohominis 101-4-CHN RepID=C7XW37_9LACO Length = 338 Score = 128 bits (321), Expect = 4e-28, Method: Compositional matrix adjust. Identities = 80/277 (28%), Positives = 149/277 (53%), Gaps = 20/277 (7%) Query: 62 NKDVLIFNFPMAKPF--WHILSFFHRLLKFRIVPLIHDIDELR-----GGGGSDSVR-LA 113 N D L+ +P+ + I+ F + ++ ++HD++ LR G + + L Sbjct: 65 NIDELVVQYPVYSRYIIRAIIKNFRKYSNGKLYFIVHDLEGLRLYKDDSIFGIEEIEFLN 124 Query: 114 TCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRG-VIYAGNLSR 172 D +++HNP M KYL + + KI + FDYLV+ +++ + + +AGNL Sbjct: 125 LVDGIVAHNPSMKKYLEEKGVKSKITCLDFFDYLVNEKNIYKNQKNNMNDRICFAGNLD- 183 Query: 173 HKCSFIYTEGCD---FTLFGVNYEN--KDNPKYLGSFDAQSPEKINL-PGMQFGLIWDGD 226 K FI + ++G+N + KD +Y G F P+K+ L +FGL+WDGD Sbjct: 184 -KAPFINKMSLNSIKLDVYGINRSSLYKDGIEYKGVF---PPDKLPLILNEKFGLVWDGD 239 Query: 227 SVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQE 286 S++ C+G +G+Y+K+N+PHK SLYLS +P+ +W ++AL++ + +G +V ++K ++E Sbjct: 240 SIQCCNGTYGNYIKYNSPHKASLYLSAGIPIIVWKQSALSELVKKYNLGLSVNNLKNIEE 299 Query: 287 IVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEV 323 ++ + Y ++ N S+ I++G +E + Sbjct: 300 VLHKIPNCEYNELKSNAIQYSKVIKSGQNIIRAIESL 336 >UniRef50_C0YXY6 Possible galactofuranosyltransferase n=5 Tax=Lactobacillus RepID=C0YXY6_LACRE Length = 338 Score = 126 bits (317), Expect = 9e-28, Method: Compositional matrix adjust. Identities = 80/243 (32%), Positives = 133/243 (54%), Gaps = 19/243 (7%) Query: 90 RIVPLIHDIDELRGGGGS-----DSVRL-ATCDMVISHNPQMTKYLSKYMSQDKIKDIKI 143 +I+ +IHDI+ LR G + +R+ D +I HN +M K+L ++ + + Sbjct: 95 KIILIIHDIESLRLHYGEKGYIDEELRVFNMADGLIVHNAKMEKWLRDNGVTVPMESLGL 154 Query: 144 FDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYT---EGCDFTLFGVNYENK--DNP 198 FDY + ++ ++ + V +AGNLS K F+ + +FG N K N Sbjct: 155 FDY--DNKIKLASGSNYETSVCFAGNLS--KAGFLEKLSLKRVKLNVFGPNPLEKYGANI 210 Query: 199 KYLGSFDAQSPEKI-NLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPV 257 Y G + P+++ N FGL+WDG + TC G FG+Y+KFNNPHK SLYLS +PV Sbjct: 211 VYKGQY---PPDELPNYLKGNFGLVWDGTTPITCDGLFGNYMKFNNPHKASLYLSSGIPV 267 Query: 258 FIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFR 317 +W +AA+AD + IG V S+ E+ E++ +++ Y ++ N K +++K+R+G Y + Sbjct: 268 VVWRQAAIADLVEKMNIGIVVDSLNELDEVLPNVSSIDYSELVNNAKEVAEKLRSGFYIK 327 Query: 318 DVL 320 + Sbjct: 328 TAI 330 >UniRef50_D1PDY2 Putative galactofuranosyltransferase n=1 Tax=Prevotella copri DSM 18205 RepID=D1PDY2_9BACT Length = 351 Score = 125 bits (313), Expect = 3e-27, Method: Compositional matrix adjust. Identities = 89/340 (26%), Positives = 161/340 (47%), Gaps = 32/340 (9%) Query: 9 FSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTF---LCG------ 59 +++ AG KA+ D + + + +N+ L R I + K+ F L G Sbjct: 14 YNQTSAGNKAKTDTEETLVE---MGAINLGL-----HRTIKNSKIFAFFRNLAGIIRACI 65 Query: 60 -LENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRG---GGGSDSVRLATC 115 L+ D+L +P+ K F I + R + + LIHDI +R + RL+ Sbjct: 66 LLKKGDILFLQYPIKKYFTFICTV-ARFKGAKTISLIHDIGSIRTHRLTTQQEVKRLSHS 124 Query: 116 DMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVS--SDVEHRDVTDKQRGVIYAGNLSRH 173 D +++ N +M ++L Q I+ + ++DY + H ++YAG + Sbjct: 125 DYILATNNKMKEWLISNNFQKPIEGLGLWDYRSPYFNKNSHPICNPGNISIVYAGAIHVR 184 Query: 174 KCSFIYT-----EGCDFTLFGVNYE---NKDNPKYLGSFDAQSPEKINLPGMQFGLIWDG 225 K F+ + + ++G E +NP Q E I FGL+WDG Sbjct: 185 KNPFLIQLSKKLKTWNLIIYGKKEELTGWANNPLITFKGFVQPDEFIRTVKADFGLVWDG 244 Query: 226 DSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQ 285 DS++TCSG FG+YLK+N PHK S YL LP+ IW +AA+ + + A+ ++ E++ Sbjct: 245 DSLDTCSGIFGEYLKWNTPHKVSFYLRAGLPIIIWKQAAVTPILEKAGVCIAINTLSELE 304 Query: 286 EIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVID 325 + ++ ++ + ++ ENTK +++++ G + R L+ + Sbjct: 305 QKLNELSSDELSKMKENTKRLAERLNQGFFLRQALDNYLS 344 >UniRef50_C6Z1L9 Glycosyltransferase n=1 Tax=Bacteroides sp. 4_3_47FAA RepID=C6Z1L9_9BACE Length = 400 Score = 122 bits (305), Expect = 2e-26, Method: Compositional matrix adjust. Identities = 85/262 (32%), Positives = 138/262 (52%), Gaps = 22/262 (8%) Query: 66 LIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGG----GSDSVRLATCDMVISH 121 +IFN+ AK + +L F R KF V L+HDI+ +R D + L D++I H Sbjct: 128 IIFNYHFAK--FILLIFKSRRCKF--VVLLHDIETIRQKRIKPIKMDRIILDLADVIIVH 183 Query: 122 NPQMTKYLS--KYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLS-----RHK 174 QM + +S K+ + FDYL S ++ D + +IYAGNL R Sbjct: 184 THQMAEKISCIDKCPNSKLIKLAFFDYLSSIEMIGND-SAANINLIYAGNLDKSLFLRRL 242 Query: 175 CSFIYTEGCDFTLFGV---NYENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETC 231 + L+G N N + +Y G F A + I +GL+WDG+SV++C Sbjct: 243 QDVGFNNEFKMFLYGAYSDNIPNTEGVEYKGKFAADRFDSIE---GNWGLVWDGESVDSC 299 Query: 232 SGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSM 291 +G +G+YLK N+P K SLYL+ PV +W K+ALA ++ + ++G V S+K++++ + S+ Sbjct: 300 TGQYGEYLKINSPFKFSLYLAANRPVVVWSKSALASYVKEYKLGICVDSLKDIEKTIKSL 359 Query: 292 TIETYKQISENTKIISQKIRTG 313 TI+ I + S++I++G Sbjct: 360 TIDELVNIQSSVYEYSKRIKSG 381 >UniRef50_C3QC04 Galactofuranosyltransferase n=3 Tax=Bacteroides RepID=C3QC04_9BACE Length = 334 Score = 114 bits (284), Expect = 7e-24, Method: Compositional matrix adjust. Identities = 86/302 (28%), Positives = 145/302 (48%), Gaps = 21/302 (6%) Query: 14 AGFKARKDALDIA--SDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFP 71 A KA +D IA + YE ++ ++ ++ +K+ L N L+ +P Sbjct: 25 ASVKAPQDIHKIALQNGYEEYPIILRGYKNKLLFIVVLFLKMIRLAINLPNGATLLIQYP 84 Query: 72 MAKPFWHILSFFHRLLKFR-IVPLIHDIDELRGGG---GSDSVRLATCDMVISHNPQMTK 127 P +L F LK + ++ L+HDI+ +R G G ++ L+ D +I H P+M Sbjct: 85 SLNP--KMLYFIFPFLKKKYLITLLHDINSVREKGELSGFENKVLSNFDEIIVHTPEMQT 142 Query: 128 YLSKYMSQD-KIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHK--CSFIY-TEGC 183 Y + + K + F Y+ D E R ++ + V +AGN+ + F++ + Sbjct: 143 YFEQRLRPGIKYHYLGCFPYIAVPDKEARQLS---KQVCFAGNIDKSVFFSDFVFENKDL 199 Query: 184 DFTLFGVNYEN---KDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLK 240 D ++G N K+ +Y G F P+ I +GL+WDGDS ETCSG +G YLK Sbjct: 200 DLIVYGSCSSNNAMKNKYEYKGVF---KPDMIGHLEGSWGLVWDGDSTETCSGTWGSYLK 256 Query: 241 FNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQIS 300 PHK SLY+ LP+ +W +A+A + +G V S+ E+ + +++ YK+ Sbjct: 257 IIAPHKFSLYVLAGLPLIVWKDSAMAKLVEMKNLGITVTSLSEISARISAVSDNDYKEYC 316 Query: 301 EN 302 N Sbjct: 317 AN 318 >UniRef50_A7HN15 Galactofuranosyltransferase n=1 Tax=Fervidobacterium nodosum Rt17-B1 RepID=A7HN15_FERNB Length = 350 Score = 107 bits (268), Expect = 4e-22, Method: Compositional matrix adjust. Identities = 74/252 (29%), Positives = 134/252 (53%), Gaps = 26/252 (10%) Query: 94 LIHDIDELRGGGGSDSVR----LATCDMVISHNPQMTKYLSKYMS-QDKIKDIKIFDYLV 148 +IHDI+ +R D R + + H+ +M Y+ + + + KI + +FDY++ Sbjct: 102 VIHDIESIRLARSIDFTREKLVFSNFTHAVCHSKKMADYIKEKLGYKGKIYILGLFDYIL 161 Query: 149 SSDVEHRDVTDK-----QRGVIYAGNLSRHKCSFIY-----TEGCDFT--LFGVNYE--N 194 + V R ++ + + +AGNLS K +F+ ++T L+G Y+ Sbjct: 162 DTPVYERVMSKTLPSLGKYVISFAGNLS--KSTFLKKIIKEVNPLNYTVYLYGKGYDGDT 219 Query: 195 KDNP-KYLGSFDA-QSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLS 252 KD +Y G F + P KI FGL+WDG+ V SG G YLK+N+PHK SLY+ Sbjct: 220 KDGVLEYKGVFHPDELPYKIE---GHFGLVWDGEEVNGISGTVGHYLKYNSPHKASLYIV 276 Query: 253 MELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQISENTKIISQKIRT 312 LP+ +W ++A+ + + + IG+ V S+KE+ EI+ ++ + Y+ ENT + +K+ + Sbjct: 277 SGLPLIVWKESAIYETVKEYNIGFGVNSLKEIDEILSKVSEKDYQVWRENTIKLGKKLAS 336 Query: 313 GSYFRDVLEEVI 324 G ++++ ++ Sbjct: 337 GENVKEIINRIL 348 >UniRef50_Q04DG9 Glycosyltransferase n=1 Tax=Oenococcus oeni PSU-1 RepID=Q04DG9_OENOB Length = 306 Score = 105 bits (261), Expect = 3e-21, Method: Compositional matrix adjust. Identities = 87/293 (29%), Positives = 143/293 (48%), Gaps = 30/293 (10%) Query: 44 VQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKF--RIVPLIHDIDEL 101 + R +V LS G D+L+ +P L+F L K R V LIHD + Sbjct: 26 INRAEKTVDLSVIKPG----DLLVHQYPSYLGDQWELNFQKELKKVGSRTVILIHDFETF 81 Query: 102 R-GGGGSDSVR---LATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDV 157 R S + L+T D +I+HN +MT L + I I++FDYL E Sbjct: 82 RIHDYKSKKIAFQVLSTADYLITHNKKMTNRL--FRINQNIFQIELFDYLSP---EKNKT 136 Query: 158 TDKQRGVIYAGNLSRHKCSFIYTEGCDFTLFGV-----NYENKDNPKYLGSFDAQSPEKI 212 T ++YAG+LS+ Y+ +FG + E D YL P+++ Sbjct: 137 TKIPTSLVYAGSLSKSSWIKNYSLKIPIDIFGRLPKKWSLEKND---YLVLHKPIIPDQL 193 Query: 213 NL-PGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVD 271 + ++GL+WD D E + +Y K N+PHK SLYL+ +PV +W+K+A+ F+++ Sbjct: 194 PIFLNNKWGLVWDEDQ-EKNKTNYQNYQKINSPHKLSLYLAANIPVIVWEKSAITKFVLE 252 Query: 272 NRIGYAVGSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVI 324 N+IG A+ ++ E+ + + I+ S+N +S+KIR G + +L ++I Sbjct: 253 NKIGIAINNLAEIPDKIKKAEID-----SDNLDNLSKKIRGGYFTEKLLRKII 300 >UniRef50_Q042V6 Glycosyltransferase n=4 Tax=Lactobacillus gasseri RepID=Q042V6_LACGA Length = 353 Score = 96.3 bits (238), Expect = 1e-18, Method: Compositional matrix adjust. Identities = 59/246 (23%), Positives = 123/246 (50%), Gaps = 18/246 (7%) Query: 94 LIHDIDELRGGGGSDSVR------LATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYL 147 ++ DID LR S + R L + +IS N +MT++L + ++ D+L Sbjct: 105 ILEDIDPLRDKKMSTNDRKLGLESLNSNKGIISQNKKMTRFLVNQGVRVTTVELSALDFL 164 Query: 148 VSSDVEHRDVTDKQRGVIYAGNLSRHKCSFI-------YTEGCDFTLFGVNYENKD---N 197 VS+ E + ++Y GNLS + F+ + ++G+ +K N Sbjct: 165 VSNYKEKKHKKSADTIIVYGGNLSSEQAGFLNHLPISKSNNKIKYRVYGMGEMSKQLSSN 224 Query: 198 PKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPV 257 Y G F A+ E I+ +GL+W+ D ++ Y ++ PHK S+Y +P+ Sbjct: 225 AIYCGGFSAE--ESIDKLKGDWGLVWNNDGSKSNKSGQNSYYEYVCPHKLSMYAICGMPI 282 Query: 258 FIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFR 317 + K+A+ADF+++N+ G + +++E+++ +++++ + Y + +N I+ K+ G Y + Sbjct: 283 IVGKKSAMADFVINNKCGIVINNLEEIEKKINAISQQEYLEYQKNISKIASKMALGFYTQ 342 Query: 318 DVLEEV 323 + + ++ Sbjct: 343 NAIRKI 348 >UniRef50_C9A0R8 Putative uncharacterized protein n=1 Tax=Enterococcus gallinarum EG2 RepID=C9A0R8_ENTGA Length = 338 Score = 93.2 bits (230), Expect = 1e-17, Method: Compositional matrix adjust. Identities = 87/331 (26%), Positives = 148/331 (44%), Gaps = 46/331 (13%) Query: 13 DAGFKARKDALDIAS--DYENISVVNIPLWGGVVQRIIS--------SVKLSTFLCGLEN 62 D+ KA+ D DIA DY+ PL+ + R I + ++ G+ N Sbjct: 15 DSVKKAKADVCDIAKGMDYQ-------PLY---IYRYIDENEDDYALTSRIDGITAGVAN 64 Query: 63 KDVLIFNFPMAKPFWHILSFFHRLLKFRI--VPLIHDIDELRGGGGSDSVRL-ATCDMVI 119 +D++++ +P F R+ + I + IHD + LRG D L ++I Sbjct: 65 QDMVVYQYPSYNGAHFDRMFLQRMKQRGIYTILFIHDAEMLRGKVDFDEAALFNEATLLI 124 Query: 120 SHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVT----DKQRGVIYAGNLSRHKC 175 H+ M L + K+ FDY H++V+ ++ V++AGNL++ Sbjct: 125 VHSQAMQTALVERGVIRKMVQKPFFDY------RHKEVSVSHERPEKRVVFAGNLAKTLF 178 Query: 176 SFIYTEGCDFTLFGVNYENKDNP-----KYLGSFDAQSP-EKINLPGMQFGLIWDGDSVE 229 + + ++G E D P Y G F+ + K+ G FGL WD D + Sbjct: 179 LQQWPNRTEILVYG---EKNDRPFGANVHYCGVFEQEELIRKMEKNG--FGLAWD-DKL- 231 Query: 230 TCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVD 289 G + Y K+N PHK SLYLS+ +PV +W +AA+A+ + +G + I+E+ + Sbjct: 232 PAGGDYQQYTKYNAPHKISLYLSLGIPVIVWQQAAIAEMVQKLGLGIVIAGIEEIDHKLG 291 Query: 290 SMTIETYKQISENTKIISQKIRTGSYFRDVL 320 +T E ++ N S +R+G + R L Sbjct: 292 ELTDEEMLRMKNNVLSFSCLLRSGIFTRTAL 322 >UniRef50_C7G7A4 Putative uncharacterized protein n=1 Tax=Roseburia intestinalis L1-82 RepID=C7G7A4_9FIRM Length = 345 Score = 91.7 bits (226), Expect = 4e-17, Method: Compositional matrix adjust. Identities = 74/340 (21%), Positives = 153/340 (45%), Gaps = 22/340 (6%) Query: 1 MYFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLC-G 59 +Y LN DA KA +D + +D + + ++P +++ L FL Sbjct: 5 IYVLNQRQDETFDAAGKAMRDVFSVLADKKAKIIWSVPKHCSKYLKLLDLPYLVLFLLFC 64 Query: 60 LENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGGGSDS----VR-LAT 114 ++ D + ++ P +L L K+RI+ I+D++ R G +D VR LA Sbjct: 65 VKKSDSVFYSIPENHLKIRLLKRLQLLKKYRIICFINDLNAFRYDGQNDGDPGEVRALAA 124 Query: 115 CDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSS-------DVEHRDVTDKQRGVIYA 167 D +++ N L K + + I+DY ++ ++ H + + + +A Sbjct: 125 ADKILAPNVNTVSMLKKNGISSDMIPVGIWDYRMNETQIAKIREISHAHKKENEVKIAFA 184 Query: 168 GNLSRHKCSFIYTEGCD--FTLFGVNYENKDNPKYLGSFDAQSPEKINLP----GMQFGL 221 GNL++ + + D L+G + ++ G + +P M +GL Sbjct: 185 GNLNKSEFLSVMEIPSDVRMELWGKLDQEREKTLADGCYYHGILSSDEIPFAVAEMDYGL 244 Query: 222 IWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSI 281 +WDG + G G+YL++NN HK +LYL+ +PV +W ++ +A+F+ ++ G + + Sbjct: 245 VWDGSGKDEIEGGLGEYLRYNNSHKCALYLASGIPVIVWSRSGMANFVREHACGITIDRL 304 Query: 282 KEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLE 321 ++ + + + Y+++ E ++ K+ G Y ++ Sbjct: 305 GDLDQAIHTA---DYEKLKEAALAVAPKLWEGYYLSQAID 341 >UniRef50_Q03A82 Glycosyltransferase n=1 Tax=Lactobacillus casei ATCC 334 RepID=Q03A82_LACC3 Length = 208 Score = 87.0 bits (214), Expect = 8e-16, Method: Compositional matrix adjust. Identities = 49/161 (30%), Positives = 83/161 (51%), Gaps = 6/161 (3%) Query: 161 QRGVIYAGNLSRHKCSFIYTEGCDFTLFGVNYE----NKDNPKYLGSFDAQSPEKINLPG 216 + +++AGN++ K E +FG ++ N Y GSF E N Sbjct: 33 HKKIVFAGNINNSKYLSQVPEHWHLDVFGGQPHQELLDRQNINYKGSFTPT--ELPNHFD 90 Query: 217 MQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGY 276 FGL+WD DS + G +Y + HK SLYL+ +PVFIW AA A+++ +N +G+ Sbjct: 91 GGFGLVWDSDSFDEVIGEPAEYNRLCYEHKLSLYLAKRMPVFIWKHAAAANWVTENHVGF 150 Query: 277 AVGSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFR 317 AV ++ ++ I+++ T + Y + N +S+ IR G + + Sbjct: 151 AVENLADIWPIIENFTEDQYNAMQPNLARVSKLIRNGVFAK 191 >UniRef50_B1MXC4 Glycosyltransferase n=3 Tax=Leuconostoc RepID=B1MXC4_LEUCK Length = 327 Score = 73.9 bits (180), Expect = 8e-12, Method: Compositional matrix adjust. Identities = 69/285 (24%), Positives = 122/285 (42%), Gaps = 28/285 (9%) Query: 53 LSTFLCGLENKDVLIFNFP--MAKPFWHILSFFHRLLKFRIVP---LIHDIDELRGGGGS 107 ++++L + D+++ FP M++ F F + LK R V LIHDI+ LR Sbjct: 55 INSWLQQVNTSDIVLHQFPSYMSEKF---EVQFAKTLKARQVKRAILIHDIEPLRLMKHP 111 Query: 108 --DSVRLATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVI 165 + L D+VI H+ M L + +FDYL S + Sbjct: 112 IWEFDLLNLYDIVIVHSQAMKVQLQSLGVTSQFIIQPLFDYLGLS----YPFVSFSHEIN 167 Query: 166 YAGNLSRHKCSFIYTEGCDFTLFGVNYEN------KDNPKYLGSFDAQSPEKINLPGMQ- 218 +AG + + LFG + N Y G+ D PE++ + Sbjct: 168 FAGTFQKSPW-LQQAQNVHINLFGAKPKKWRDTTFPANVTYKGNLD---PEQLIMAFRDG 223 Query: 219 FGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAV 278 FGLIWD D + + Y K+N PHK SLY+ LP+ W ++A+ I + IG+ + Sbjct: 224 FGLIWDNDFEDKT---YKTYTKYNAPHKASLYIRAGLPLIAWRESAIGQIIAEQEIGFVI 280 Query: 279 GSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEV 323 + ++ + T + +N + ++Q++ +G + + L ++ Sbjct: 281 DKLNQLPAQLSETTAAQFNLWQQNMQPLAQQLASGYFTKATLTQL 325 >UniRef50_B1MVL6 Putative glycosyl transferase n=1 Tax=Leuconostoc citreum KM20 RepID=B1MVL6_LEUCK Length = 559 Score = 73.9 bits (180), Expect = 8e-12, Method: Compositional matrix adjust. Identities = 71/292 (24%), Positives = 138/292 (47%), Gaps = 31/292 (10%) Query: 50 SVKLSTFLCGLENKDVLIFNFPMAKPFW--HILSFFHRLLKFRIVPLIHDIDELRGGGGS 107 + +L + + D +++ +P P ++L++FH ++ IHDI+ LR + Sbjct: 275 TAQLEKHCLQINSGDTVVWQYPKYSPQLELNMLNWFHNR-GIKVASFIHDINLLREEPLN 333 Query: 108 --------DSVRLATCDMVISHNPQMTKYLSKYMSQDKIKDI---KIFDYLVSSDVEHRD 156 D + L++ D I P+ + ++ K+K+I K +D+++ V Sbjct: 334 REHYLPEYDKILLSSFDANIV--PEKFEQALYSLANVKLKNIVALKPYDFIIQKPVLPAT 391 Query: 157 VTDKQRGVIYAGNLSRHKCSFIYTEGCDF--TLFG-VNYENKD--NPKYLGSFDAQSPEK 211 + + ++YAG+L++ F E DF T++G N+ + + NPK + + E Sbjct: 392 YS---QDIVYAGSLAK----FPALEDIDFNLTVYGEKNFSDVNFVNPKIIDGGFLPAEEL 444 Query: 212 INLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVD 271 + FGLIWD D A Y K+N P+K SLY+ LPV W ++A+A I Sbjct: 445 ASSLNNGFGLIWDEDRQNPYRQA---YTKWNWPYKFSLYMVSGLPVIAWSESAIAKLIES 501 Query: 272 NRIGYAVGSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEV 323 +G+ V + ++ V S++ + +++ N I K+ G+ + L+++ Sbjct: 502 ENLGFIVTDLSQIASKVRSISQTEFNEMAANAAEIGNKLAHGNSTKTALKKL 553 >UniRef50_C7TIE1 Glycosyl transferase, galactofuranosyltransferase n=2 Tax=Lactobacillus rhamnosus RepID=C7TIE1_LACRL Length = 338 Score = 69.7 bits (169), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 48/160 (30%), Positives = 81/160 (50%), Gaps = 17/160 (10%) Query: 166 YAGNLSRHKCSFI--YTEGCDFTLFG-------VNYENKDNPKYLGSFDAQSPEKINLP- 215 YAGNL K F+ + E ++G + + D+ +YLGS+ E++ L Sbjct: 168 YAGNLVDRKAGFLQNFPENLHIKVYGSADGKTDLPFSLADSVEYLGSY---RQEELALAL 224 Query: 216 GMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIG 275 +GLIWD D F Y + N HK SLYLS+ LPV ++ A+ ++ +N +G Sbjct: 225 NDGYGLIWDEDK----EHHFDPYARINMTHKFSLYLSLGLPVIACNQTAIGRYVSENGLG 280 Query: 276 YAVGSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSY 315 A+ S+ + I++ +T + + +I + IS IR+G + Sbjct: 281 IAIDSLDNLGNIIEGVTEDDFNRIVDKVANISDLIRSGRH 320 >UniRef50_B1I7N2 Nss n=10 Tax=Streptococcus pneumoniae RepID=B1I7N2_STRPI Length = 336 Score = 64.3 bits (155), Expect = 6e-09, Method: Compositional matrix adjust. Identities = 63/283 (22%), Positives = 124/283 (43%), Gaps = 16/283 (5%) Query: 52 KLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRL--LKFRIVPLIHDIDELRGGGG--- 106 +L + + D+L+F P F F +L ++ +I+ IHD+ L Sbjct: 54 RLDGIMASISIGDILVFQSPTWNGFEFDRLLFDKLKDMQVKIICFIHDVVPLMFDSNYYL 113 Query: 107 -SDSVRLAT-CDMVISHNPQM-TKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRG 163 D + + D++I + +M T+ + + ++ KI ++D+ D+ K+ Sbjct: 114 MKDYMYMYNLSDVLIVPSERMKTRLMEEGLTTKKILVQGMWDH--PHDLSLYTPAFKKE- 170 Query: 164 VIYAGNLSRHKCSFIYTEGCDFTLFGVNYENKDNPKYLGSFDAQSPEKI--NLPGMQFGL 221 + +AG+L R +++ +F E + + L + E++ L FGL Sbjct: 171 LFFAGSLERFPDLQNWSQDTPLRVFSNKGEASSSARNLSIEGWKKDEELLLELSKGGFGL 230 Query: 222 IWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSI 281 +W G Y N HK S YL+ +PV + + A FIVD +G+ S+ Sbjct: 231 VW---GTYQNDGESNQYYTLNISHKVSTYLTAGIPVIVPSSLSTAKFIVDQGLGFVANSL 287 Query: 282 KEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVI 324 +E+ IVD M ++ Y++++ K S ++ G + + + + I Sbjct: 288 EEVHAIVDKMNLQEYQEMTNRIKTFSYLLKEGYFTKKLFVDAI 330 >UniRef50_C4ZG42 Putative uncharacterized protein n=1 Tax=Eubacterium rectale ATCC 33656 RepID=C4ZG42_EUBR3 Length = 345 Score = 63.5 bits (153), Expect = 9e-09, Method: Compositional matrix adjust. Identities = 70/285 (24%), Positives = 122/285 (42%), Gaps = 22/285 (7%) Query: 52 KLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFR---IVPLIHDIDELRGGGGSD 108 +L + L D++IF +P + SF +++ +R ++ + DI +L Sbjct: 52 RLDGIIAPLNYGDIVIFQYPSWIGVNYDESFVNKIKSYRDTKLIIFVQDIQKLMFDSEQA 111 Query: 109 SV-----RLATCDMVISHNPQMTKYLSKY-MSQDKIKDIKIFDYLVSSDVEHRDVTDKQR 162 + L D++I + +M +YL + + + + I+D + SD+ D R Sbjct: 112 ILDMEIKTLNKADLLILPSKKMHRYLKENGLDEKPVIYQTIWD--MPSDICFVDHA-VTR 168 Query: 163 GVIYAGNLSRHKCSFIYTEGCDFTLFGVN---YENKDNPKYLGSFDAQSPEKINLPGMQF 219 +AGN +R Y + N EN D+ + G F+ Q L F Sbjct: 169 CFHFAGNYNRFPFLAEYHGKTPIYQYDANKPDRENDDSFCWKGYFE-QEKLMHELSKGGF 227 Query: 220 GLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVG 279 GL+W D F Y N P+K L+ +PV + F+ N +GYAV Sbjct: 228 GLVWSDDEY------FDRYYSMNQPYKLGTNLAAGIPVIVKRGCVHDKFVERNGLGYAVD 281 Query: 280 SIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVI 324 ++ E ++V S+T Y ++ N K I + I G+Y R +L++ I Sbjct: 282 TLDEADKLVQSITDAEYIELYRNVKNIQKLILDGAYTRKILQDAI 326 >UniRef50_A3CM54 Nucleotide sugar synthetase-like protein, putative n=7 Tax=Firmicutes RepID=A3CM54_STRSV Length = 334 Score = 62.4 bits (150), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 65/290 (22%), Positives = 128/290 (44%), Gaps = 27/290 (9%) Query: 50 SVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFH------RLLKFRIVPLIHDIDELRG 103 S +L + + DV+I+ P W+ F ++L+ +++ IHD+ L Sbjct: 52 SRRLDGIMASVGYGDVVIYQ----SPTWNGREFDQAFISKLKILQAKLITFIHDVPPLMF 107 Query: 104 GGG----SDSVRLAT-CDMVISHNPQMT-KYLSKYMSQDKIKDIKIFDYLVSSDVEHRDV 157 + + + D VI + QM K +++ ++ DKI +++D+ + Sbjct: 108 PSNYYLMPEYIDMYNQSDAVIVPSEQMRDKLVAEGLTVDKILVQRMWDHPYDLPLHQPQF 167 Query: 158 TDKQRGVIYAGNLSRHKCSFIYTEGCDFTLFGVNYENKDNPKYLGSFDA--QSPEKI-NL 214 K + +AG++ R ++ +F E + NP+ S+ PE + L Sbjct: 168 APK---LYFAGSVERFPHLINWSYATPLEIFSP--EEESNPEANVSYRGWVSRPELLLEL 222 Query: 215 PGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRI 274 GL+W VE +Y N HK++ YL+ +PV + + A+ I D + Sbjct: 223 SKGGLGLVW---GVEENPADEPEYYGLNISHKSATYLAAGIPVIVPSYLSNAELIRDRGL 279 Query: 275 GYAVGSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVI 324 G+ V S++E IV+++T E Y+ + E + S ++ G + + VL + + Sbjct: 280 GFVVDSLEEASRIVENLTAEEYQAMVERVRKFSFLLKEGYFSKKVLVDAV 329 >UniRef50_Q5ULS2 Orf42 n=1 Tax=Lactobacillus phage LP65 RepID=Q5ULS2_9CAUD Length = 337 Score = 61.2 bits (147), Expect = 5e-08, Method: Compositional matrix adjust. Identities = 67/248 (27%), Positives = 116/248 (46%), Gaps = 21/248 (8%) Query: 91 IVPLIHDIDELRGGGG--SDSVRLATC--DMVISHNPQMTKYLSKYMSQDKIKDIKIFDY 146 ++ L+HDI+ RG SD +L +V++ + +S I ++++ Y Sbjct: 93 VIGLVHDIEYARGFSTDFSDQYKLLKLYDGLVVTGHRIKAIIQESGISSIPITCMELWPY 152 Query: 147 LVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYT-EGCDFT-LFG--VNYENKDNPKYLG 202 L + VEHR + R + YAGNLSR F + EG + ++G V+ N + + Sbjct: 153 LTNYVVEHRIEPNNNR-IEYAGNLSRSNGLFSKSLEGIEHVDVWGKQVDRSNSEKTGLVV 211 Query: 203 SFDAQSPEKINLPGM---QFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFI 259 A P+ +LP +GL+W D + DY K N HK SLYLS +LP+ + Sbjct: 212 QHGAVHPD--DLPARLYSGYGLVWYVDR------KYQDYTKINVSHKASLYLSAKLPLIV 263 Query: 260 WDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDV 319 + L++ + +IG V + E+ E + S + K ++ + I I +GS F + Sbjct: 264 SSSSYLSELVDKYKIGICVDRLDEIPEKLLSRN-DYCKYVNNIEEHIYDSISSGSCFTEP 322 Query: 320 LEEVIDDL 327 +++ L Sbjct: 323 FVDLMSKL 330 >UniRef50_Q3DVD0 Nucleotide sugar synthetase-like protein n=9 Tax=Streptococcus agalactiae RepID=Q3DVD0_STRAG Length = 335 Score = 59.7 bits (143), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 69/290 (23%), Positives = 121/290 (41%), Gaps = 34/290 (11%) Query: 50 SVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKF--RIVPLIHDIDELRGGGG- 106 S ++ + GL D+++F P F +L + RI+ +HDI L Sbjct: 52 STRMDGIIAGLGRGDIVVFQVPTWNSTEFDELFLDKLQAYGARIITFVHDIVPLMFESNF 111 Query: 107 --SDSV--RLATCDMVISHNPQMTKYL-SKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQ 161 D V D+VI M YL K M+ K+ +++D+ V+ D+ + Q Sbjct: 112 YLLDRVIDMYNRSDVVILPTKAMHDYLIEKGMTTSKVLYQEVWDHPVNIDLPRPEC---Q 168 Query: 162 RGVIYAGNLSRHKCSFIYTEGCDFTLFG----VNYEN-------KDNPKYLGSFDAQSPE 210 + + +AG++ R + E +G +N E KD+ + + S + Sbjct: 169 KVLSFAGDIQRFPFVNDWKENIPLIYYGDGSRLNSEANVHAQGWKDDVELMLSLSKRG-- 226 Query: 211 KINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIV 270 FGL W D E Y + N +K S +L+ LP+ + DFI Sbjct: 227 -------GFGLCWSEDREELVERR---YSRMNASYKLSTFLAAGLPIIANHDISSRDFIK 276 Query: 271 DNRIGYAVGSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVL 320 + +G+ V +++E E +++M ETY EN + I+ +R G + +L Sbjct: 277 QHGLGFTVETLEEAVEKINNMEKETYDSYVENVEKIATLLRNGYITKKLL 326 >UniRef50_C0XA00 Possible galactofuranosyl transferase n=3 Tax=Lactobacillus RepID=C0XA00_9LACO Length = 337 Score = 51.6 bits (122), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 28/106 (26%), Positives = 53/106 (50%), Gaps = 6/106 (5%) Query: 219 FGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAV 278 FGL+W + + +Y+ N +K S YL+ +P+ + K A+ I +IG Sbjct: 232 FGLVWSEEPY------WSEYMTMNTSYKLSTYLAAGIPIIVNSKTPEAETIKRKKIGIIA 285 Query: 279 GSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVI 324 S+ E Q V + + YK+I++N + ++ IR G + + L + + Sbjct: 286 DSLAEAQAKVLQVNDDEYKEITDNVESFAKLIREGYFTKKALADAV 331 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P37749 Uncharacterized protein yefG n=5 Tax=Escherichia... 417 e-115 UniRef50_UPI000196921F hypothetical protein BACCELL_02894 n=1 Ta... 344 2e-93 UniRef50_A7AHD2 Putative uncharacterized protein n=1 Tax=Parabac... 333 4e-90 UniRef50_A5ZF92 Putative uncharacterized protein n=2 Tax=Bactero... 327 4e-88 UniRef50_D1PTN6 Putative uncharacterized protein n=1 Tax=Prevote... 326 8e-88 UniRef50_C2F0P9 Galactofuranosyltransferase n=2 Tax=Lactobacillu... 325 1e-87 UniRef50_C9LJY2 Putative uncharacterized protein n=1 Tax=Prevote... 324 2e-87 UniRef50_A0Z7X9 Putative uncharacterized protein n=1 Tax=marine ... 323 5e-87 UniRef50_C2E8T4 Possible galactofuranosyltransferase n=1 Tax=Lac... 316 1e-84 UniRef50_Q4JYT0 Putative glycosyl transferase n=1 Tax=Streptococ... 315 1e-84 UniRef50_Q03GL2 Glycosyltransferase n=1 Tax=Pediococcus pentosac... 311 2e-83 UniRef50_D1PDY2 Putative galactofuranosyltransferase n=1 Tax=Pre... 308 2e-82 UniRef50_C2EVL7 Possible galactofuranosyltransferase n=1 Tax=Lac... 304 4e-81 UniRef50_Q4JZC8 Putative glycosyl transferase n=2 Tax=Streptococ... 302 1e-80 UniRef50_C0WVC5 Possible galactofuranosyltransferase n=2 Tax=Lac... 301 3e-80 UniRef50_D0RXG2 Galactofuranose transferase n=13 Tax=Streptococc... 298 1e-79 UniRef50_C0BRQ3 Putative uncharacterized protein n=2 Tax=Bifidob... 298 2e-79 UniRef50_B0BR56 Glycosyltransferase n=1 Tax=Actinobacillus pleur... 298 2e-79 UniRef50_D0BKT6 Galactofuranosyltransferase n=1 Tax=Granulicatel... 298 2e-79 UniRef50_Q4JYV0 Putative glycosyl transferase n=2 Tax=Streptococ... 297 4e-79 UniRef50_Q1WU31 Galactofuranosyltransferase n=2 Tax=Lactobacillu... 296 7e-79 UniRef50_B0N1W1 Putative uncharacterized protein n=1 Tax=Clostri... 294 2e-78 UniRef50_C2EH03 Possible galactofuranosyltransferase n=1 Tax=Lac... 293 5e-78 UniRef50_C2FKS4 Possible galactofuranosyltransferase n=1 Tax=Lac... 293 7e-78 UniRef50_C0YXY6 Possible galactofuranosyltransferase n=5 Tax=Lac... 290 5e-77 UniRef50_Q032N6 Glycosyltransferase n=1 Tax=Lactococcus lactis s... 286 6e-76 UniRef50_D0R4M2 Putative glycosyltransferase n=1 Tax=Lactobacill... 279 8e-74 UniRef50_UPI000196CD65 hypothetical protein CATMIT_02517 n=1 Tax... 276 7e-73 UniRef50_C9A0R8 Putative uncharacterized protein n=1 Tax=Enteroc... 275 2e-72 UniRef50_C9LPN1 Galactofuranosyltransferase n=2 Tax=Veillonellac... 274 3e-72 UniRef50_B0P5G1 Putative uncharacterized protein n=1 Tax=Clostri... 273 5e-72 UniRef50_C7IU57 Putative uncharacterized protein n=1 Tax=Thermoa... 272 2e-71 UniRef50_A8RK64 Putative uncharacterized protein n=1 Tax=Clostri... 269 1e-70 UniRef50_C7XW37 Glycosyltransferase n=1 Tax=Lactobacillus coleoh... 268 2e-70 UniRef50_Q7P740 Nucleotide sugar synthetase n=1 Tax=Fusobacteriu... 267 3e-70 UniRef50_UPI0001968A2E hypothetical protein BACCELL_04078 n=1 Ta... 266 7e-70 UniRef50_C7G7A4 Putative uncharacterized protein n=1 Tax=Rosebur... 265 2e-69 UniRef50_C7TE97 Glycosyl transferase,galactofuranosyltransferase... 259 1e-67 UniRef50_A2RHU2 Putative galactofuranose transferase n=1 Tax=Lac... 258 2e-67 UniRef50_C3QC04 Galactofuranosyltransferase n=3 Tax=Bacteroides ... 256 6e-67 UniRef50_B1MXC4 Glycosyltransferase n=3 Tax=Leuconostoc RepID=B1... 248 3e-64 UniRef50_B1I7N2 Nss n=10 Tax=Streptococcus pneumoniae RepID=B1I7... 246 9e-64 UniRef50_A3CM54 Nucleotide sugar synthetase-like protein, putati... 245 2e-63 UniRef50_C6Z1L9 Glycosyltransferase n=1 Tax=Bacteroides sp. 4_3_... 241 2e-62 UniRef50_C4ZG42 Putative uncharacterized protein n=1 Tax=Eubacte... 239 9e-62 UniRef50_A7HN15 Galactofuranosyltransferase n=1 Tax=Fervidobacte... 239 1e-61 UniRef50_Q3DVD0 Nucleotide sugar synthetase-like protein n=9 Tax... 236 7e-61 UniRef50_C4Z1X5 Putative uncharacterized protein n=1 Tax=Eubacte... 232 2e-59 UniRef50_Q04DG9 Glycosyltransferase n=1 Tax=Oenococcus oeni PSU-... 231 3e-59 UniRef50_Q042V6 Glycosyltransferase n=4 Tax=Lactobacillus gasser... 224 4e-57 UniRef50_C7TIE1 Glycosyl transferase, galactofuranosyltransferas... 220 7e-56 UniRef50_C0XA00 Possible galactofuranosyl transferase n=3 Tax=La... 209 8e-53 UniRef50_B1MVL6 Putative glycosyl transferase n=1 Tax=Leuconosto... 206 8e-52 UniRef50_Q5ULS2 Orf42 n=1 Tax=Lactobacillus phage LP65 RepID=Q5U... 196 1e-48 UniRef50_Q03A82 Glycosyltransferase n=1 Tax=Lactobacillus casei ... 194 5e-48 Sequences not found previously or not previously below threshold: UniRef50_B5A7L9 Nucleotide sugar synthetase-like protein n=3 Tax... 191 2e-47 UniRef50_C4ZG41 Putative uncharacterized protein n=1 Tax=Eubacte... 111 2e-23 UniRef50_C2EWJ6 Putative uncharacterized protein n=1 Tax=Lactoba... 76 1e-12 UniRef50_C5RCT8 Possible transposase n=2 Tax=Lactobacillales Rep... 69 3e-10 UniRef50_C5RCT9 Putative uncharacterized protein n=1 Tax=Weissel... 60 8e-08 UniRef50_B0VJD0 Putative uncharacterized protein n=1 Tax=Candida... 60 1e-07 UniRef50_A8TGW1 Glycosyl transferase group 1 n=2 Tax=Methanococc... 52 2e-05 UniRef50_B8DZD9 Glycosyl transferase group 1 n=2 Tax=Bacteria Re... 50 1e-04 UniRef50_A3XA25 Glycosyl transferase, group 1 n=1 Tax=Roseobacte... 50 1e-04 UniRef50_B9KB31 Putative uncharacterized protein n=1 Tax=Thermot... 49 3e-04 UniRef50_C4N530 Putative glycosyltransferase n=1 Tax=Capnocytoph... 48 5e-04 UniRef50_B8EDB2 Putative uncharacterized protein n=1 Tax=Shewane... 48 6e-04 UniRef50_A8U9E3 Putative uncharacterized protein n=1 Tax=Carnoba... 46 0.001 UniRef50_C4Z1X4 Putative uncharacterized protein n=1 Tax=Eubacte... 45 0.003 UniRef50_A7ZC10 Glycosyl transferase, group 1 family protein n=1... 45 0.005 UniRef50_Q1NU87 Glycosyl transferase, group 1 n=2 Tax=Proteobact... 45 0.005 UniRef50_B5EGU3 Putative uncharacterized protein n=1 Tax=Geobact... 44 0.006 UniRef50_Q0W4G3 Glycosyltransferase (Group 1) n=1 Tax=uncultured... 44 0.007 UniRef50_Q83GT6 Glycosyltransferase domain-containing protein n=... 44 0.009 UniRef50_B5JPV4 Glycosyl transferase, group 2 family protein n=1... 43 0.011 UniRef50_Q1WS01 Glycosyltransferase n=2 Tax=Lactobacillus saliva... 43 0.012 UniRef50_D1Y981 Glycosyltransferase, group 1 family protein n=2 ... 43 0.012 UniRef50_C3NKC1 Glycosyl transferase group 1 n=4 Tax=Thermoprote... 43 0.012 UniRef50_UPI0001C4246F glycosyltransferase n=1 Tax=Bacillus pseu... 43 0.014 UniRef50_B3EBP3 Glycosyl transferase family 2 n=1 Tax=Geobacter ... 43 0.014 UniRef50_B5YCR2 WbpH n=10 Tax=Bacteria RepID=B5YCR2_DICT6 43 0.016 UniRef50_B8GDE8 Glycosyl transferase group 1 n=1 Tax=Methanospha... 43 0.016 UniRef50_B9P363 Glycosyl transferase, group 1 n=1 Tax=Prochloroc... 43 0.017 UniRef50_C2HRD4 Glycosyl transferase group 1 family protein n=1 ... 43 0.018 UniRef50_A8ZY94 Glycosyl transferase group 1 n=1 Tax=Desulfococc... 43 0.019 UniRef50_Q2LWM1 Glycosyltransferase n=1 Tax=Syntrophus aciditrop... 43 0.020 UniRef50_C6LKZ9 Putative glycosyl transferase n=1 Tax=Bryantella... 43 0.020 UniRef50_B9YG53 Glycosyl transferase group 1 n=1 Tax='Nostoc azo... 42 0.022 UniRef50_A8RK70 Putative uncharacterized protein n=2 Tax=Clostri... 42 0.022 UniRef50_A5D386 Glycosyltransferase n=1 Tax=Pelotomaculum thermo... 42 0.024 UniRef50_C8P2S3 Glycosyltransferase n=1 Tax=Erysipelothrix rhusi... 42 0.025 UniRef50_Q8AAS2 Lipopolysaccharide biosynthesis RfbU-related pro... 42 0.028 UniRef50_A8F8F9 Putative uncharacterized protein n=1 Tax=Thermot... 42 0.029 UniRef50_Q1IPW4 Glycosyl transferase, group 1 n=1 Tax=Candidatus... 42 0.030 UniRef50_Q2ILL1 Glycosyl transferase, group 1 n=1 Tax=Anaeromyxo... 42 0.031 UniRef50_C0BNJ6 Glycosyl transferase group 1 n=1 Tax=Flavobacter... 42 0.032 UniRef50_A8RE04 Putative uncharacterized protein n=1 Tax=Eubacte... 42 0.032 UniRef50_Q9YCS3 Glycosyl transferase, group 1 n=1 Tax=Aeropyrum ... 42 0.035 UniRef50_B5EVI1 Glycosyltransferase n=6 Tax=Vibrionales RepID=B5... 42 0.036 UniRef50_D2EEQ9 Glycosyl transferase group 1 n=1 Tax=Candidatus ... 41 0.039 UniRef50_A9BF89 Glycosyl transferase group 1 n=8 Tax=Thermotogac... 41 0.047 UniRef50_C5CAF6 Putative uncharacterized protein n=1 Tax=Microco... 41 0.049 UniRef50_C6P8E8 Glycosyl transferase group 1 n=1 Tax=Thermoanaer... 41 0.058 UniRef50_D1YZX9 Putative glycosyltransferase n=1 Tax=Methanocell... 41 0.061 UniRef50_D1JK96 WbpH n=1 Tax=Bacteroides sp. 2_1_16 RepID=D1JK96... 41 0.063 UniRef50_B7L1A4 Glycosyl transferase group 1 n=1 Tax=Methylobact... 41 0.065 UniRef50_Q467B6 Mannose-6-phosphate isomerase, bifunctional enzy... 41 0.066 UniRef50_Q39W08 Putative uncharacterized protein n=1 Tax=Geobact... 41 0.068 UniRef50_A3DHW2 Glycosyl transferase, group 1 n=3 Tax=Clostridiu... 41 0.069 UniRef50_B5IQU9 Glycosyl transferase, group 1 family protein n=1... 41 0.070 UniRef50_B5M6M0 Glycosyltransferase n=2 Tax=Kosmotoga olearia TB... 41 0.074 UniRef50_A6LJY0 Putative uncharacterized protein n=1 Tax=Thermos... 41 0.077 UniRef50_Q8RBZ2 Predicted glycosyltransferases n=1 Tax=Thermoana... 40 0.094 >UniRef50_P37749 Uncharacterized protein yefG n=5 Tax=Escherichia coli RepID=YEFG_ECOLI Length = 330 Score = 417 bits (1073), Expect = e-115, Method: Composition-based stats. Identities = 330/330 (100%), Positives = 330/330 (100%) Query: 1 MYFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGL 60 MYFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGL Sbjct: 1 MYFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGL 60 Query: 61 ENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGGGSDSVRLATCDMVIS 120 ENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGGGSDSVRLATCDMVIS Sbjct: 61 ENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGGGSDSVRLATCDMVIS 120 Query: 121 HNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYT 180 HNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYT Sbjct: 121 HNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYT 180 Query: 181 EGCDFTLFGVNYENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLK 240 EGCDFTLFGVNYENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLK Sbjct: 181 EGCDFTLFGVNYENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLK 240 Query: 241 FNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQIS 300 FNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQIS Sbjct: 241 FNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQIS 300 Query: 301 ENTKIISQKIRTGSYFRDVLEEVIDDLKTR 330 ENTKIISQKIRTGSYFRDVLEEVIDDLKTR Sbjct: 301 ENTKIISQKIRTGSYFRDVLEEVIDDLKTR 330 >UniRef50_UPI000196921F hypothetical protein BACCELL_02894 n=1 Tax=Bacteroides cellulosilyticus DSM 14838 RepID=UPI000196921F Length = 345 Score = 344 bits (883), Expect = 2e-93, Method: Composition-based stats. Identities = 103/341 (30%), Positives = 172/341 (50%), Gaps = 15/341 (4%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIAS--DYENISVVNIPLWGGVVQRIISSVKLSTFLCG 59 Y+L+ +AG KA+ D + S Y+N + + +I+ + L Sbjct: 4 YYLSKNYNGLNNAGNKAKTDIEETLSKLGYKNAGLPQTTYSNKIAGFLITLAGVLKVLFT 63 Query: 60 LENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGG---GSDSVRLATCD 116 + DV++ +P K + + + H L + +++ +IHD+ R + RL D Sbjct: 64 VSANDVVVVQYPFKKYYSFVCNIIH-LKRGKVITIIHDLGTFRRKKLTAEQEIKRLNHSD 122 Query: 117 MVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTD-KQRGVIYAGNLSRHKC 175 ++I HN +M +L + + ++IFDYL S + + K VIYAG L+ K Sbjct: 123 VLIVHNDKMEIWLKEQGYTKPMVCLEIFDYLSPSVNNNTQEPNQKPIKVIYAGALTYKKN 182 Query: 176 SFIYTEG-----CDFTLFGVNYEN---KDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDS 227 ++Y+ F L+G +E +D + S + I FGLIW+GDS Sbjct: 183 RYLYSLNDVMSKWQFELYGGGFEEAKIEDKTLFKFKGFVPSDQLIEQVSAHFGLIWEGDS 242 Query: 228 VETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEI 287 + TCSG FG YL+ NNPHK SLY+ LP+ IW +AALA F+ +N+IG + S++E+ I Sbjct: 243 IHTCSGDFGIYLRINNPHKVSLYIRCNLPIIIWKEAALASFVAENKIGVCIDSLEELDSI 302 Query: 288 VDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDLK 328 + S++ E+Y ++ N K I++KI +G Y + +E L+ Sbjct: 303 LSSISAESYNEMVRNIKEINKKIASGYYCKRAVENAESLLQ 343 >UniRef50_A7AHD2 Putative uncharacterized protein n=1 Tax=Parabacteroides merdae ATCC 43184 RepID=A7AHD2_9PORP Length = 352 Score = 333 bits (855), Expect = 4e-90, Method: Composition-based stats. Identities = 95/338 (28%), Positives = 163/338 (48%), Gaps = 14/338 (4%) Query: 3 FLNDLNFSRRDAGFKARKDALDIASD--YENISVVNIPLWGGVVQRIISSVKLSTFLCGL 60 + + AG KA+ D I D Y NI + ++ +I+ + + L Sbjct: 5 YFSKCYKELYSAGSKAKTDMEQIMCDLGYRNIGFPCLVCSNKILGFVITLLSMIKVCFKL 64 Query: 61 ENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGGG---SDSVRLATCDM 117 + D+LI +P+ K + + + H +++ LIHD+ R + RL D Sbjct: 65 RSGDILIIQYPLKKYYTLLCNIVH-YRGAKVITLIHDLGSFRRKRLTVLQEIDRLQNSDY 123 Query: 118 VISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSF 177 +I+ N M+ +L + ++KI+DYL + V ++ V+YAG L +K F Sbjct: 124 LITLNDSMSAWLQTKGCEVPKGELKIWDYLSPAIVLNKIEPATDYTVVYAGALGYNKNRF 183 Query: 178 IYTEG-----CDFTLFGVNYENKD--NPKYLGS-FDAQSPEKINLPGMQFGLIWDGDSVE 229 +Y +++G E N +Y + + I+ FGL+WDGDS E Sbjct: 184 LYELDRLPRQWHLSVYGKGLEADKILNKEYFSYNGFLPADQLISSVQGDFGLVWDGDSYE 243 Query: 230 TCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVD 289 C+G +G+YL++NNPHK SLY+ LP+ IW+KAALA FI + IG + S++E+ ++ Sbjct: 244 ACTGNYGEYLRYNNPHKVSLYVRCHLPLIIWEKAALAPFIKEKEIGICINSLEELDGKLE 303 Query: 290 SMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDL 327 +T++ Y ++ IS + G +F L+E + L Sbjct: 304 KLTVDDYFKMKSRVIEISNLLSVGYFFTKALDEAVKFL 341 >UniRef50_A5ZF92 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=A5ZF92_9BACE Length = 345 Score = 327 bits (838), Expect = 4e-88, Method: Composition-based stats. Identities = 95/339 (28%), Positives = 161/339 (47%), Gaps = 17/339 (5%) Query: 3 FLNDLNFSRRDAGFKARKDALDIAS--DYENISVVNIPLWGGVVQRIISSVKLSTFLCGL 60 +L+ +AG KA+ D I + N+ + VV + + + L Sbjct: 4 YLSRNYRGVDNAGNKAKTDIEQIMESHGFRNVGLKQTRYRNVVVAFCRTLFSVLKSILCL 63 Query: 61 ENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGGG---SDSVRLATCDM 117 DVL+ +P+ K + + + H L ++V LIHD+ R + RL D Sbjct: 64 RKGDVLVLQYPLKKYYAFVCNMAH-LRGCKVVTLIHDLGSFRRKKLTIPQEIARLDHSDC 122 Query: 118 VISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSS-DVEHRDVTDKQRGVIYAGNLSRHKCS 176 VI H+ +M +L ++ + K++ ++IFDYL S V D +++ G LS + Sbjct: 123 VIVHSERMRDWLLEHGIKAKLQILEIFDYLSDSQPVAGNDSPKSPNRILFVGALSSYHND 182 Query: 177 FIYTE-----GCDFTLFGVNYENKD---NPKYLGSFDAQSPEKINLPGMQFGLIWDGDSV 228 F+Y + D L+G E + Y G S E I ++GL W G S+ Sbjct: 183 FLYKQVNSPRSYDIVLYGSGLETEKLEGKVDYKG--FVSSDELIATAEGEYGLAWYGSSL 240 Query: 229 ETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIV 288 E SGA G+YL++N PHK SLY+ LP+ +W+KA LA F+ N +G + S+ E+++I+ Sbjct: 241 EGGSGALGEYLQYNAPHKMSLYIRCGLPIIVWEKAGLAPFVKKNNVGICISSLTELEDIL 300 Query: 289 DSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDL 327 ++ Y ++ +N I+ K+ G Y +++ DL Sbjct: 301 PKISAGQYMEMKKNVLQIADKLSHGYYCFKAIKQACADL 339 >UniRef50_D1PTN6 Putative uncharacterized protein n=1 Tax=Prevotella bergensis DSM 17361 RepID=D1PTN6_9BACT Length = 338 Score = 326 bits (835), Expect = 8e-88, Method: Composition-based stats. Identities = 101/325 (31%), Positives = 163/325 (50%), Gaps = 13/325 (4%) Query: 16 FKARKDALDIASDYENISVVNIPLWGGVVQRIISSVK-LSTFLCGLENKDVLIFNFPMAK 74 KA+KD + +++ + G + R ++ + + L L+ DVL +PM K Sbjct: 11 NKAKKDIDTVVEQLGYVNLSKVQCGNGGIGRFLTKLLAMVNILTTLKRDDVLFLQYPMKK 70 Query: 75 PFWHILSFFHRLLKFRIVPLIHDIDELRGGG---GSDSVRLATCDMVISHNPQMTKYLSK 131 + + H L ++V +IHD+ R ++ + D +I+HNP MT+YL + Sbjct: 71 FYKMACTLAH-LKGAKVVTVIHDLGAFRRHKLTPEQENRLFSKTDFLIAHNPTMTEYLQQ 129 Query: 132 YMSQDKIKDIKIFDYLVSSDVEHRDVTDKQR-GVIYAGNLSRHKCSFIYT-----EGCDF 185 + Q + + IFDYL + V + ++YAGNL + F+Y + Sbjct: 130 HGFQGGVHHLGIFDYLSAKPVRQPNAQPHDPWRIVYAGNLGVWRNEFLYHLDTAIKHWTL 189 Query: 186 TLFGVNYENKDNPKYLGSFDA--QSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNN 243 L+G +E K N ++ S E I FGL+WDG SV+ C+GA+G+YLK NN Sbjct: 190 DLYGKGFEPKKNNCQKLTYHGFIDSDEFIERVDADFGLVWDGASVDECNGAWGEYLKINN 249 Query: 244 PHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQISENT 303 PHKTS YL +PV +W K+A+A FI N +G V S+ E+ ++ +T E Y+ + N Sbjct: 250 PHKTSFYLRAGIPVIVWSKSAMAPFIRKNGLGLTVDSLAEIDSHLEQLTPEQYQAMRANA 309 Query: 304 KIISQKIRTGSYFRDVLEEVIDDLK 328 I QK+ TGS+ + L+ + K Sbjct: 310 YTIGQKLATGSHIKRGLDAAQEYFK 334 >UniRef50_C2F0P9 Galactofuranosyltransferase n=2 Tax=Lactobacillus reuteri RepID=C2F0P9_LACRE Length = 334 Score = 325 bits (833), Expect = 1e-87, Method: Composition-based stats. Identities = 94/330 (28%), Positives = 162/330 (49%), Gaps = 10/330 (3%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLE 61 Y ++ ++ + G KA+KD AS + V+++ ++ +QR + + L Sbjct: 3 YLISAIDPIKNSGGNKAKKDIDFFASQLNDTRVIHVKIYYTRLQRYLLTRLSIIKLVKTH 62 Query: 62 NKDVLIFNFPMAKPF--WHILSFFHRLLKFRIVPLIHDIDEL---RGGGGSDSVRLATCD 116 D I FP++ P+ + + ++ IHD+ L + V D Sbjct: 63 PADRYILQFPISTPYVLRQFIEVIQKYTNAKVDLFIHDLPALQLSMDDKERELVLFNQVD 122 Query: 117 MVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQR-GVIYAGNLSRHKC 175 +I HN M K+L Q + ++ +FDY ++ + D + + GNL++ Sbjct: 123 NLIVHNQAMKKWLVDNGVQTNMIELGLFDYDNEQPMQKKQEYDPANFTICFPGNLAKSTF 182 Query: 176 SFIYTEGCDFTLFGVNYENK--DNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSG 233 ++G N + ++ +Y G + + E FGLIWDG+S+ETCSG Sbjct: 183 LTKVNLSHQLNIYGPNKLDSYPESIRYCGQYTPE--ELPKHLTEDFGLIWDGNSIETCSG 240 Query: 234 AFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTI 293 FG+YLK+NNPHKTSLYLS +PV IWD+AALA I ++ +G + S+ E+ ++ S+T Sbjct: 241 TFGEYLKYNNPHKTSLYLSTGIPVIIWDQAALAPLIKESGVGICISSLTELDSVLLSLTN 300 Query: 294 ETYKQISENTKIISQKIRTGSYFRDVLEEV 323 E Y+ + + + QK+R G Y + L ++ Sbjct: 301 EQYQLMKRKAEKLGQKLRKGYYTKHALTKL 330 >UniRef50_C9LJY2 Putative uncharacterized protein n=1 Tax=Prevotella tannerae ATCC 51259 RepID=C9LJY2_9BACT Length = 353 Score = 324 bits (832), Expect = 2e-87, Method: Composition-based stats. Identities = 89/341 (26%), Positives = 158/341 (46%), Gaps = 19/341 (5%) Query: 3 FLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRII-----SSVKLSTFL 57 +++ R+ AG KA+ D DI + N+ L + I + T+ Sbjct: 9 YVSRNYKGRQGAGNKAKGDYEDILVQ---MGAHNLGLRRTYYKEYIAAFLTDLAGIVTYA 65 Query: 58 CGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGG---GSDSVRLAT 114 + DV+ +P K F + R + + IHD+ R + RL+ Sbjct: 66 LSVRKGDVVFLQYPTKKYFSFMC-RLARWREANSMAFIHDLGAFRRKKVTVKQEIRRLSN 124 Query: 115 CDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHK 174 D +I+ N M ++L + + + + DYL +S+ + T ++YAG++ K Sbjct: 125 ADYIIAANDTMAEWLKSHGLKRPCHGMGLHDYLSNSETVDKPATFPPHRIVYAGSIEERK 184 Query: 175 CSFIYTE-----GCDFTLFGVNYENK-DNPKYLGSFDAQSPE-KINLPGMQFGLIWDGDS 227 F+ + ++G N+ + + L + +P+ I FGL+WDGDS Sbjct: 185 NMFLTKLSGVIRHGEIHVYGSNHIAALKSTRNLILHEPMTPDNFIATAKGDFGLVWDGDS 244 Query: 228 VETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEI 287 + C+G FG+YL+ N PHK S YL LP+ IW ++ALAD + IG V I E+++ Sbjct: 245 LTACTGDFGEYLRINTPHKASFYLRAGLPLIIWSRSALADIVDREGIGITVDRIDEIEDH 304 Query: 288 VDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDLK 328 ++S+T + ++I +N K +SQ + G R +E+ + +K Sbjct: 305 IESLTGQEIRKIRDNVKRVSQDLADGLSMRRAVEKAMCRIK 345 >UniRef50_A0Z7X9 Putative uncharacterized protein n=1 Tax=marine gamma proteobacterium HTCC2080 RepID=A0Z7X9_9GAMM Length = 348 Score = 323 bits (828), Expect = 5e-87, Method: Composition-based stats. Identities = 95/344 (27%), Positives = 170/344 (49%), Gaps = 20/344 (5%) Query: 3 FLNDLNFSRRDAGFKARKDALDIAS--DYENISVVNIPLWGGVVQRIISSVKLSTFLCGL 60 F++ +R A KA+ D D ++NI + + + L L Sbjct: 7 FISRNYKARFSAAGKAKIDCEDALEKNGFKNIGLPRATYTSTLPNFFWTLFGTILGLLRL 66 Query: 61 ENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGGG---SDSVRLATCDM 117 + +L+ +P K + I+ +L +++ +IHD+ R + L D+ Sbjct: 67 KRHSILVVQYPTKKYYDFIV-QIAKLKHCKVITIIHDLRSHRKQKMHVDKEMASLNKNDV 125 Query: 118 VISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQR-GVIYAGNLSRHKCS 176 VI+HN MT +L + K ++ IFDYL + + +++AG L + K Sbjct: 126 VIAHNSFMTAWLQDHGLTSKAVNLNIFDYLCELKASSTPTPPRDKFRLVFAGVLEKRKNG 185 Query: 177 FIYTEG------CDFTLFGVNYENKDNP-----KYLGSFDAQSPEKINLPGMQFGLIWDG 225 F+Y+ L+G+ + + + P Y G F A E ++ +FG++WDG Sbjct: 186 FLYSLDALNAKSFTCNLYGIGFNDSELPQDSIVTYQGVFPAD--EIVDRVEGEFGIVWDG 243 Query: 226 DSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQ 285 S++ C G+FG+YLK NNPHKTS+YL LP+ IWD+AA+A F+ D +G AV S+ ++ Sbjct: 244 TSLDECKGSFGEYLKINNPHKTSMYLRAGLPIIIWDQAAIATFVQDKNVGIAVASLAQVD 303 Query: 286 EIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDLKT 329 E + S++ + Y+++ N + +SQ++ G++ +EE + L + Sbjct: 304 EALQSVSDDDYREMKRNAESVSQQLGEGAFLTAAVEEAMSQLAS 347 >UniRef50_C2E8T4 Possible galactofuranosyltransferase n=1 Tax=Lactobacillus ruminis ATCC 25644 RepID=C2E8T4_9LACO Length = 337 Score = 316 bits (809), Expect = 1e-84, Method: Composition-based stats. Identities = 84/337 (24%), Positives = 168/337 (49%), Gaps = 16/337 (4%) Query: 1 MYFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGL 60 +Y + ++ DAG KA+ D + ++ E V+++ L +++ + L Sbjct: 6 LYSVLRSLRAKNDAGPKAKTDINEFLTE-EGFKVMDLDLPEKRLEKFLFVHLKLKRLFKG 64 Query: 61 ENKDVLIFNFPMAKPF--WHILSFFHRLLKFRIVPLIHDIDELRGGGGSDSVR------L 112 D +I +P F I+ ++ + + ++HD++ LR G+ Sbjct: 65 RQFDNVILQYPFYSVFLTKKIIENAKKVTHGKFLIMVHDVETLRVYDGNKQFEKDEMEIF 124 Query: 113 ATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSR 172 + D +I HN +M ++L ++ I + IFDY +D + + + Q+ + +AGNL + Sbjct: 125 NSADGLIVHNSKMAEWLKQHGVTVPITILGIFDYR--NDCQKNERFEYQKSICFAGNLEK 182 Query: 173 HKCSF-IYTEGCDFTLFGVNYENK--DNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVE 229 + ++G + K Y G + + N FGLIWDGD + Sbjct: 183 STFLKKVKLNDAKLDVYGPSPAQKYQKGVTYCGVYTPD--DLPNHLNENFGLIWDGDEMS 240 Query: 230 TCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVD 289 C+G FG+Y+++N PHKTSLYLS +PV IW +AA+A+F+ +N +G A+ ++ ++ ++ Sbjct: 241 ACTGVFGNYMRYNAPHKTSLYLSSGIPVIIWKEAAMAEFVSENEVGIAIENLNDLDNVLQ 300 Query: 290 SMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDD 326 + Y+++ N +++++R+GSY ++ + + + D Sbjct: 301 KVDDAGYRKMKSNALNLAERLRSGSYVKEAVRKALGD 337 >UniRef50_Q4JYT0 Putative glycosyl transferase n=1 Tax=Streptococcus pneumoniae RepID=Q4JYT0_STRPN Length = 354 Score = 315 bits (808), Expect = 1e-84, Method: Composition-based stats. Identities = 111/346 (32%), Positives = 183/346 (52%), Gaps = 21/346 (6%) Query: 1 MYFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGL 60 +Y++++ S A KA D I D + +V + +V+ + KL L + Sbjct: 4 LYYIHEEFGSDSTAATKAPNDLQKIFQDCKFKPLVTLKKNSKIVRIFDYAFKLLLCLIRI 63 Query: 61 ENKDVLIFNFPMAK--PFWHILSFFHRLLKFRIVPLIHDIDELRGGGG-----SDSVRLA 113 + D++IF FP A + L + K +++ LI+D++ LR G S + Sbjct: 64 RSNDIVIFQFPFATHGKLKNFLMKLLQYKKAKMIFLINDLESLRYSGNKKNLISKEQYIK 123 Query: 114 TCDMVISHNPQMTKYLSKY-MSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSR 172 D++I HN +M ++L + + +KI + +FDYL+ D +++ + V+ AGNLS Sbjct: 124 NADVIICHNQRMKEFLIENKIDSEKIVVLGVFDYLL--DKFNKEKASFDKTVVIAGNLSP 181 Query: 173 HKCSFI-----YTEGCDFTLFGVNY----ENKDNPKYLGSFDAQSPEKINLPGMQFGLIW 223 K ++ F L+G N+ N D Y GSF + I FGL+W Sbjct: 182 QKSGYLTELLKNENRIKFNLYGPNFTSSTNNNDCVSYKGSFSPEKIPFI--LEGDFGLVW 239 Query: 224 DGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKE 283 DGDS+ TCSG G+YLK+NNPHK SL+++ ++PV IW ++AL+DF+ +N IG V + E Sbjct: 240 DGDSILTCSGITGEYLKYNNPHKVSLFIASKIPVIIWKQSALSDFVKENNIGIVVNDLIE 299 Query: 284 MQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDLKT 329 MQEI+ +MT E Y+ EN + +S+K+R G + +E+ + +K Sbjct: 300 MQEIITNMTEEQYEIFRENIEQLSKKVRQGYFTNLAIEKSLSIIKN 345 >UniRef50_Q03GL2 Glycosyltransferase n=1 Tax=Pediococcus pentosaceus ATCC 25745 RepID=Q03GL2_PEDPA Length = 338 Score = 311 bits (797), Expect = 2e-83, Method: Composition-based stats. Identities = 95/337 (28%), Positives = 171/337 (50%), Gaps = 14/337 (4%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLE 61 Y L N + AG KA+KD I + SV +++ ++ ++ L + Sbjct: 4 YVLRITNGQKNTAGDKAKKDITSILNKQGFKSVEIRLRESKLIKLFTTNFMINKQLNNFK 63 Query: 62 NKDVLIFNFPMAKPF-WHILSFFHRLLKFRIVPLIHDIDELRGGGGSDSVR------LAT 114 D+ + +PM F I+ R + +IHD++ LR ++ L+ Sbjct: 64 KNDIFVIQYPMYSRFATKIILNKCEKKGIRTICVIHDLEALRLYKNDENKIAEEKAILSR 123 Query: 115 CDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHK 174 + +I HN +M ++L + + + ++IFDYL ++ + + +I+AGNL + Sbjct: 124 FNCLIVHNEKMREWLVEQDVKVPMVSLQIFDYLNDKELVK---VENKLNLIFAGNLEKSA 180 Query: 175 CSFIYTEGCDFTLFGVNYEN--KDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCS 232 + T+FGV+ + N Y G E FGLIWDG+S+ET + Sbjct: 181 FLEKWNLEKKITVFGVHPSDLYPHNVIYKGVKTPD--ELPKYLSGSFGLIWDGNSIETNT 238 Query: 233 GAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMT 292 G +GDY K+NNPHK SLYLS LPV +W KAA+++FIV N++G ++ S+ ++++ + + Sbjct: 239 GIYGDYTKYNNPHKVSLYLSSGLPVIVWKKAAISEFIVKNKLGISIDSLGDLEDSLSKIN 298 Query: 293 IETYKQISENTKIISQKIRTGSYFRDVLEEVIDDLKT 329 E Y + N + +++K+R G++ +E+ I+ +K+ Sbjct: 299 AEKYTNMVSNVEKMARKLRKGTFTTKAVEKAINLIKS 335 >UniRef50_D1PDY2 Putative galactofuranosyltransferase n=1 Tax=Prevotella copri DSM 18205 RepID=D1PDY2_9BACT Length = 351 Score = 308 bits (788), Expect = 2e-82, Method: Composition-based stats. Identities = 82/340 (24%), Positives = 158/340 (46%), Gaps = 16/340 (4%) Query: 3 FLNDLNFSRRDAGFKARKDALDIASDYE--NISVVNIPLWGGVVQRIISSVKLSTFLCGL 60 +++ +++ AG KA+ D + + N+ + + + + L Sbjct: 8 YISRDYYNQTSAGNKAKTDTEETLVEMGAINLGLHRTIKNSKIFAFFRNLAGIIRACILL 67 Query: 61 ENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGG---GSDSVRLATCDM 117 + D+L +P+ K F I + R + + LIHDI +R + RL+ D Sbjct: 68 KKGDILFLQYPIKKYFTFICT-VARFKGAKTISLIHDIGSIRTHRLTTQQEVKRLSHSDY 126 Query: 118 VISHNPQMTKYLSKYMSQDKIKDIKIFDYLVS--SDVEHRDVTDKQRGVIYAGNLSRHKC 175 +++ N +M ++L Q I+ + ++DY + H ++YAG + K Sbjct: 127 ILATNNKMKEWLISNNFQKPIEGLGLWDYRSPYFNKNSHPICNPGNISIVYAGAIHVRKN 186 Query: 176 SFI-----YTEGCDFTLFGVNYE---NKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDS 227 F+ + + ++G E +NP Q E I FGL+WDGDS Sbjct: 187 PFLIQLSKKLKTWNLIIYGKKEELTGWANNPLITFKGFVQPDEFIRTVKADFGLVWDGDS 246 Query: 228 VETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEI 287 ++TCSG FG+YLK+N PHK S YL LP+ IW +AA+ + + A+ ++ E+++ Sbjct: 247 LDTCSGIFGEYLKWNTPHKVSFYLRAGLPIIIWKQAAVTPILEKAGVCIAINTLSELEQK 306 Query: 288 VDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDL 327 ++ ++ + ++ ENTK +++++ G + R L+ + + Sbjct: 307 LNELSSDELSKMKENTKRLAERLNQGFFLRQALDNYLSVI 346 >UniRef50_C2EVL7 Possible galactofuranosyltransferase n=1 Tax=Lactobacillus vaginalis ATCC 49540 RepID=C2EVL7_9LACO Length = 336 Score = 304 bits (778), Expect = 4e-81, Method: Composition-based stats. Identities = 92/338 (27%), Positives = 160/338 (47%), Gaps = 21/338 (6%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLE 61 Y L + + G KA++DA+ IS I + + ++ SS L L Sbjct: 7 YVLEWHDSEKNTGGVKAKQDAVTFLKKDGFIS---IEVPSSKLGKVWSSFWARYILRNLS 63 Query: 62 NKDVLIFNFPMAKPFWHIL--SFFHRLLKFRIVPLIHDIDELRGG--------GGSDSVR 111 +++ +P KPF L + K +++ LIHD++ +R S+ Sbjct: 64 G--IIVIQYPSGKPFLRKLWLEAACKNKKLKVILLIHDLESIRFFNDSKYSDVRQSEFEF 121 Query: 112 LATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLS 171 +A D +++ N +M L K I + +DY + + + D QR + YAGNL Sbjct: 122 IAKADGLVALNERMKSLLVKGGIVKPITTLDAWDYDNKNPIIEK--KDYQRRICYAGNLR 179 Query: 172 RHKCSFIYTEGCDFTLFGVNYEN--KDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVE 229 + +FG N E + KY+G F Q + +GL+WDG S E Sbjct: 180 KALFLSDLKCKTSIYVFGPNSETTFSKSIKYMGQFSPQK--LPSHLNGDYGLVWDGVSSE 237 Query: 230 TCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVD 289 TC G +G YL++N PHK SLY+S LPV +WDKAA+A+F+ +G + ++ ++ ++ Sbjct: 238 TCKGMYGQYLRYNTPHKFSLYISSGLPVIVWDKAAIAEFVKKYNVGLTISNLNDIDNLLH 297 Query: 290 SMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDL 327 S+ YK++ +N +++K+R G + + ++I + Sbjct: 298 SVPSSQYKELQKNVIKVAEKMRNGQFLTTAINDLIKKI 335 >UniRef50_Q4JZC8 Putative glycosyl transferase n=2 Tax=Streptococcus pneumoniae RepID=Q4JZC8_STRPN Length = 357 Score = 302 bits (773), Expect = 1e-80, Method: Composition-based stats. Identities = 114/354 (32%), Positives = 185/354 (52%), Gaps = 32/354 (9%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASDY--ENISVVNIPL-WGGVVQRIISS----VKLS 54 YF+ + AG KA D I+ + + I P V+Q++ Sbjct: 6 YFIKVEKDLKNTAGIKAPDDIEKISEELGMKEIRFPKFPFEKNKVIQKLWLFCVVGYNWI 65 Query: 55 TFLCGLENKDVLIFNFPMA--KPFWHILSFFHRLLKFRIVPLIHDIDELRGGGG------ 106 + L L+ DV+I+ PM + + + + + +IHD++ LR G Sbjct: 66 SLLWRLKKNDVVIYQHPMYGVRVANFAIPLLKKYKNIKFISVIHDLESLRKGIQGVIEDN 125 Query: 107 ------SDSVRLATCDMVISHNPQMTKYLSKYMSQD-KIKDIKIFDYLVSSDVEHRDVTD 159 +D L+ D VISHNP+MT+YL + + +++IFDYL S++E + Sbjct: 126 ETTNAIADKELLSKFDKVISHNPKMTEYLEGIGIKKENLVELQIFDYLDPSEIEEKI--- 182 Query: 160 KQRGVIYAGNLSRHKCSFIYTE-----GCDFTLFGVNYENKDNPKYLGSFDAQSPEKIN- 213 + GV+ AGNL++ K S+IY LFG N+ N++ P+ + F + P K+ Sbjct: 183 -EDGVVIAGNLAKGKSSYIYKLLENELNFKLNLFGPNFINEELPENVEYFGSLPPNKLPQ 241 Query: 214 LPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNR 273 +FGL+WDGDS+ETCSG G+YLK+NNPHKTSLYL+ +PV IW +AALA FI +N Sbjct: 242 KLVGKFGLVWDGDSLETCSGNTGNYLKYNNPHKTSLYLASGIPVIIWKEAALAQFIEENN 301 Query: 274 IGYAVGSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDL 327 +G V ++ E++ ++ +++ Y I NT + +K+R G ++R + + +D Sbjct: 302 VGITVNNLSEIEFVMQNISEGEYLSIKRNTMQLGEKLRNGYFYRQAISKCKNDF 355 >UniRef50_C0WVC5 Possible galactofuranosyltransferase n=2 Tax=Lactobacillus fermentum RepID=C0WVC5_LACFE Length = 350 Score = 301 bits (770), Expect = 3e-80, Method: Composition-based stats. Identities = 102/351 (29%), Positives = 163/351 (46%), Gaps = 30/351 (8%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASDYENISV------------VNIPLWGGVVQRI-- 47 Y ++ + S G KA D + + + LW + RI Sbjct: 4 YIISLKDPSGNVGGPKANMDNIKFLKEQMGFKELWLDYGWKDGFWWHHNLWDWINTRIHK 63 Query: 48 --ISSVKLSTFLCGLENKDVLIFNFPMA--KPFWHILSFFHRLLKFRIVPLIHDIDELRG 103 +S L F + D ++ +P+ K I H+ ++ +IHD + +R Sbjct: 64 YQLSRSVLPKFFKEHPDIDNVVIQYPLYSNKLIKQITDSVHQNSHAKLYFIIHDAEMIRL 123 Query: 104 GGGS------DSVRLATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDV 157 + D +I HN +M K+L + + + D+ IFDY ++ Sbjct: 124 YADEPKRAQGELDSFNLSDGIIGHNAKMNKFLKEQGVKVPLVDLGIFDYDNPQPLQEYKG 183 Query: 158 TDKQRGVIYAGNLSRHKCSFIYTEGCDFTLFGVNYENKDN--PKYLGSFDAQSPEKINLP 215 + V YAGNL + F LFG N N Y G F + Sbjct: 184 --YDKSVCYAGNLIDAEFLQDVHPTNRFDLFGPNPAESYNEGLNYKGQFSPT--DLPAHM 239 Query: 216 GMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIG 275 FGL+W G SV+TC G FG YLK+NNPHKTSLYLS LPV IWD+AALADF+++N +G Sbjct: 240 DENFGLVWHGTSVDTCDGVFGRYLKWNNPHKTSLYLSSGLPVIIWDQAALADFVLENGVG 299 Query: 276 YAVGSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDD 326 + S+ ++ + +D++T E Y+Q+ +N + ++ ++RTG Y +E++I++ Sbjct: 300 ITISSLNDLNDKLDALTEEEYRQMHDNVQKVANQMRTGYYITHAMEKMINN 350 >UniRef50_D0RXG2 Galactofuranose transferase n=13 Tax=Streptococcus RepID=D0RXG2_9STRE Length = 351 Score = 298 bits (764), Expect = 1e-79, Method: Composition-based stats. Identities = 109/352 (30%), Positives = 172/352 (48%), Gaps = 31/352 (8%) Query: 2 YFLND---LNFSRRDAGFKARKDALDIA--SDYENISVVNIPLWGG----VVQRIISSVK 52 Y+L D N ++AG KAR D I YE + + + W Q+ Sbjct: 3 YYLKDSFLHNEHEKNAGSKARNDVEAILISEGYEGLEL-KVENWYKMNFFKAQQHKYRAT 61 Query: 53 LSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLK--FRIVPLIHDIDELRGGGG---- 106 + L D L+ FP+ + I + K + LIHD++ LR G Sbjct: 62 -KSVFDQLGAGDELVIQFPIIHHTFFISQLIKQAQKRGAKFYLLIHDVETLRHAAGSEVK 120 Query: 107 ---------SDSVRLATCDMVISHNPQMTKYLSKYMSQ-DKIKDIKIFDYLVSSDVEHRD 156 + L + D +I HN M K L DK+ ++IFDYL+ + E + Sbjct: 121 FRHKVRNYFQEKKALMSVDGIIVHNDIMKKVLVGQGVPADKMASLEIFDYLIPN-FEVQA 179 Query: 157 VTDKQRGVIYAGNLSRHKCSFIYTEG--CDFTLFGVNYENKDNPKYLGSFDAQSPEKINL 214 + K + +I AGNL+ K ++Y + L+GV Y+ K F + P+ + Sbjct: 180 LPQKDQPIIVAGNLNPAKSGYLYNLPDQPAYNLYGVGYDESRALKNTSYFGSFMPDDLPA 239 Query: 215 -PGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNR 273 FGL+WDGDS ETC G++G+YL+FNN HK SLYL+ PV +W ++ALA FI++ Sbjct: 240 ALEGSFGLVWDGDSSETCQGSYGNYLRFNNSHKASLYLASGFPVVVWKESALAHFILEKS 299 Query: 274 IGYAVGSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVID 325 G AV S+ +++ +++++T + Y +SEN K I + +R G Y R L+++ D Sbjct: 300 CGIAVASLHDLEAVLENLTEKEYADLSENAKRIGKDLREGYYLRSALKKLND 351 >UniRef50_C0BRQ3 Putative uncharacterized protein n=2 Tax=Bifidobacterium RepID=C0BRQ3_9BIFI Length = 354 Score = 298 bits (763), Expect = 2e-79, Method: Composition-based stats. Identities = 108/345 (31%), Positives = 165/345 (47%), Gaps = 19/345 (5%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASD-----YENISVVNIPLWGGVVQRIISSVKLSTF 56 Y + + + AG KAR D + +E N + +V + Sbjct: 4 YVICERSLHHAHAGSKARDDIRQVLESQSWQPFEVRPGENKGYFDKLVCVGRTLAVWHRL 63 Query: 57 LCGLENKDVLIFNFP--MAKPFWHILSFFHRLLKFR---IVPLIHDIDELRGGG--GSDS 109 + DV++ FP M R +K R V LIHD++ LRG D Sbjct: 64 ERTVRCGDVVLVQFPLIMYNKVSLYALPSVRRMKARGALFVFLIHDLETLRGYSYTDFDK 123 Query: 110 VRLATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGN 169 + D++ISHNP+M++ L KY + I +I IFDYL+ + +++ G+ AGN Sbjct: 124 QWVTEADLLISHNPRMSEVLRKYGATVPIVEIGIFDYLLPQ--ANPVPMEQRHGIDIAGN 181 Query: 170 LSRHKCSFIYTE-----GCDFTLFGVNYENKDNPKYLGSFDAQSPEKINLPGMQFGLIWD 224 LS K ++Y D L+G Y+ ++ E + +FGLIWD Sbjct: 182 LSHGKAEYVYRLAERFPKADINLYGPKYDRRNGKTAWYRGIVAPDELPDKLEGRFGLIWD 241 Query: 225 GDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEM 284 GDS++TC G +G YL NNPHK SLYL+ + PV IW+KAALA F+V+ +G AV S++E Sbjct: 242 GDSLDTCGGYYGKYLTVNNPHKLSLYLAADKPVIIWNKAALAPFVVEQGVGVAVESLQEA 301 Query: 285 QEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDLKT 329 + MT Y ++ + QK+R G + R+V+ +V L + Sbjct: 302 MAVEYGMTQSEYARMVRRASQLGQKLREGWFTREVMAKVQAVLPS 346 >UniRef50_B0BR56 Glycosyltransferase n=1 Tax=Actinobacillus pleuropneumoniae serovar 3 str. JL03 RepID=B0BR56_ACTPJ Length = 349 Score = 298 bits (763), Expect = 2e-79, Method: Composition-based stats. Identities = 108/346 (31%), Positives = 177/346 (51%), Gaps = 28/346 (8%) Query: 2 YFLNDLNFSRRDAGFKARKDALDI-ASDYENISVVNI-----PLWGGVVQRIISSVKLST 55 Y + +L+ AG KA +D +I S +VV L +++++I + Sbjct: 4 YQIVELSTEHNHAGSKAVQDVYEIALSMGYKANVVRTATSVDSLLAKILRQVIFFIDWLK 63 Query: 56 FLCGLENKDVLIFNFPMAKPF---WHILSFFHRLLKFRIVPLIHDIDELRGGGGSDSVR- 111 +E+ +++ P IL+ R+ K + + L+HD++ELR ++ + Sbjct: 64 IYFSIESNSIVLIQNPYYHKQLIRNWILNRLKRIKKVKFISLVHDVEELRKSLYNNYYKN 123 Query: 112 -----LATCDMVISHNPQMTKY-LSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVI 165 L+ D +I HN +M + + K S+DK+ + IFDYL S + +R + Sbjct: 124 EFETMLSLADSIIVHNDKMKSFFIKKGYSEDKLISLGIFDYLQKSV--DKKRVSFERAIS 181 Query: 166 YAGNLSRHKCSFIYTEG----CDFTLFGVNYENK----DNPKYLGSFDAQSPEKINLPGM 217 AGNL K S+I G L+G N+E+ N +Y GSF A E Sbjct: 182 VAGNLDIKKSSYIAQLGSLPAIKAHLYGPNFEHSLEAFPNIEYHGSFPAT--EIPQKLVS 239 Query: 218 QFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYA 277 FGL+WDG S+ETC+G FG+YL++NNPHK SLYLS +PV IWDKAA ADF+ + +G Sbjct: 240 GFGLVWDGQSIETCTGDFGEYLQYNNPHKLSLYLSSGMPVVIWDKAAEADFVKKHNVGLC 299 Query: 278 VGSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEV 323 V S+ E+Q+ ++ MT + ++++ N + + + +G Y + + E Sbjct: 300 VSSLSELQDKLNVMTEQEFEEMVNNVEKQTACLISGEYTKKAISEA 345 >UniRef50_D0BKT6 Galactofuranosyltransferase n=1 Tax=Granulicatella elegans ATCC 700633 RepID=D0BKT6_9LACT Length = 331 Score = 298 bits (762), Expect = 2e-79, Method: Composition-based stats. Identities = 104/341 (30%), Positives = 170/341 (49%), Gaps = 37/341 (10%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLE 61 Y+L + + AG KAR DA I + + + L Sbjct: 3 YYLKENYAKAKHAGSKARLDAEKIMVEAGYAP---------------YFLNNHSNAVPLT 47 Query: 62 NKDVLIFNFPMA--KPFWHILSFFHRLLKFRIVPLIHDIDELRGGG-------------- 105 DV++ FP+ IL+ F + KF+ LIHDI+ LR Sbjct: 48 KDDVIVLQFPLLWQSLKKQILTRFLKNRKFKAYLLIHDIESLRNRKIKTVKDFKHSIIYF 107 Query: 106 GSDSVRLATCDMVISHNPQMTKYLSK-YMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGV 164 + L D +I+HN +M L + + ++KI +++FDY++ E ++ V Sbjct: 108 LQNKTVLEKVDGIIAHNDKMKAELVRLGIPEEKIVALEMFDYVIPHYEEK--TAYEKNTV 165 Query: 165 IYAGNLSRHKCSF--IYTEGCDFTLFGVNYENKDNPKYLGSFDAQSPEKI-NLPGMQFGL 221 I AGN K + +F+++G+N+E + PK + A SP+++ + FGL Sbjct: 166 IVAGNFDIRKTKYARQLPGNPEFSIYGINFEEEHLPKNVHYKGAFSPDELSHHLQGGFGL 225 Query: 222 IWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSI 281 +WDGDS TCSG +G+YLK NNPHK SLYL+ P+ +W ++ALADF+ N+ G V S+ Sbjct: 226 VWDGDSPHTCSGMYGEYLKMNNPHKASLYLASGFPIIVWSQSALADFVRQNKCGILVDSL 285 Query: 282 KEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEE 322 E+ E ++S++ Y+++ +N+K I +KIR G + + LE+ Sbjct: 286 FEIAESLESLSENDYQEMIKNSKRIGKKIRNGIFLKTALEK 326 >UniRef50_Q4JYV0 Putative glycosyl transferase n=2 Tax=Streptococcus pneumoniae RepID=Q4JYV0_STRPN Length = 356 Score = 297 bits (760), Expect = 4e-79, Method: Composition-based stats. Identities = 105/354 (29%), Positives = 180/354 (50%), Gaps = 36/354 (10%) Query: 2 YFLNDL---NFSRRDAGFKARKDALDIASDYENISV-----VNIPLWGGVVQRIISSVKL 53 YF+ + +++AG KAR+D DI + +N VQR++ K+ Sbjct: 3 YFVEETLLDEQDKKNAGGKARQDVTDILESIGYQKLIAESEMNERQELNAVQRLVHHYKV 62 Query: 54 STF----LCGLENKDVLIFNFPMAKPFWHILSFFHRLLK--FRIVPLIHDIDELRGG--- 104 L + D +I FP+ +L K ++ LIHD++ LR Sbjct: 63 KKMWKKTLSVVGKGDEVIIQFPLLNHSLFFNQVIKQLSKNGVKVYFLIHDLESLRWSQSK 122 Query: 105 --GGSDSVRLA--------TCDMVISHNPQMTKYLSKYMSQD-KIKDIKIFDYLVSSDVE 153 +RL + +I+HN +M Y+ Y + KI ++ FDY++ S E Sbjct: 123 SISLKSRIRLNIEEHSVLRLSEGIIAHNKKMKSYIKTYSVESSKIIPLETFDYIIPSYHE 182 Query: 154 HRDVTDKQRG--VIYAGNLSRHKCSFIY--TEGCDFTLFGVNYE--NKDNPKYLGSFDAQ 207 +++ + Q ++ AGNL +HK ++Y +F L+G+ YE + + Y GSF + Sbjct: 183 RKNLDNFQLNAPIVIAGNLKQHKAGYVYHLPSNVEFNLYGIGYEQTDDKSVHYCGSFMPE 242 Query: 208 SPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALAD 267 E + FGL+WDG S E+C +G+YL+ NNPHKTSLYL+ +PV +W +AA+A Sbjct: 243 --ELPFVLKGSFGLVWDGPSSESCIETYGEYLRVNNPHKTSLYLASGIPVVVWSEAAIAS 300 Query: 268 FIVDNRIGYAVGSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLE 321 FI +N G V ++ E+ E++ +T++ Y+ + +NT+II +++R G Y + ++ Sbjct: 301 FIKENNCGILVSNLSELPELLSMITVDEYELMKKNTEIIGERLRQGFYTKQAVK 354 >UniRef50_Q1WU31 Galactofuranosyltransferase n=2 Tax=Lactobacillus salivarius RepID=Q1WU31_LACS1 Length = 335 Score = 296 bits (758), Expect = 7e-79, Method: Composition-based stats. Identities = 104/328 (31%), Positives = 169/328 (51%), Gaps = 9/328 (2%) Query: 3 FLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLE- 61 ++ N S+ +AG KA+ D + Y+ G + + L L+ Sbjct: 5 IVSMYNKSQNEAGPKAKIDVENFLKIYDFKIQDFYFYGGRRAELVSYRQSLFDIPFRLKG 64 Query: 62 NKDVLIFNFP-MAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGG---GSDSVRLATCDM 117 + IF +P + + + + ++ LIHD++ LR + L D Sbjct: 65 RYENAIFQYPALNERTNKAIMRNLKKNSQKVYILIHDLESLRFKNGGNNFELDLLNMSDG 124 Query: 118 VISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSF 177 VI+HN +M +L + I D++IFDY + ++ + + V YAGNL++ Sbjct: 125 VIAHNKKMIDWLRNNGVEVPIVDLEIFDYDNNIPLQENYI--FDKSVCYAGNLNKATFLK 182 Query: 178 IYTEGCDFTLFGVNYENKDNPKYLGSFDAQSPEKI--NLPGMQFGLIWDGDSVETCSGAF 235 Y TLFG N+ KY+ + SP+++ L FGLIWDGDS + CSG + Sbjct: 183 EYEPDFKLTLFGPNFSPALMSKYIEYKGSLSPDELAKELLTQNFGLIWDGDSSKGCSGIY 242 Query: 236 GDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIET 295 G+YLK+NNPHKTSLYLS +P+ IW +AALA+F+ N++G V ++ +++ I+D MT E Sbjct: 243 GEYLKYNNPHKTSLYLSSGMPIIIWREAALAEFVDKNKLGIVVDNLSQIKPILDKMTKEE 302 Query: 296 YKQISENTKIISQKIRTGSYFRDVLEEV 323 Y++I NT I+ K+R+G Y + + E+ Sbjct: 303 YQEIKSNTIKIAHKLRSGFYIKKAITEL 330 >UniRef50_B0N1W1 Putative uncharacterized protein n=1 Tax=Clostridium ramosum DSM 1402 RepID=B0N1W1_9FIRM Length = 358 Score = 294 bits (754), Expect = 2e-78, Method: Composition-based stats. Identities = 105/345 (30%), Positives = 170/345 (49%), Gaps = 23/345 (6%) Query: 3 FLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLW------GGVVQRIISSVKLSTF 56 FL++ G KARKD I ++ P + V Sbjct: 11 FLSEQQKKEYLGGAKARKDIDLILKQLGYKEIICRPCRDFSSPKNVINSLYSIQVNWIKI 70 Query: 57 LCGLENKDVLIFNFPMAKPFWHILSF--FHRLLKFRIVPLIHDIDELRGG---GGSDSVR 111 + + D+L+ +P K + + K + +IHD+ ++ + Sbjct: 71 KRIIRDNDILVIQYPFGKYDVNDRQIAKIKKTKKVDFIAIIHDLPSIQDKTADKLEEIKL 130 Query: 112 LATCDMVISHNPQMTKYLSKYMS-QDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNL 170 L D+VI HN +M + L + +K+ ++IFDYL + D++ R K G+ AGNL Sbjct: 131 LKKFDIVICHNKKMLEVLKELGIDNNKLVCLEIFDYLCNEDIKAR--VSKDDGITVAGNL 188 Query: 171 SRHKCSFIYT-------EGCDFTLFGVNYENKDNPKYLGSFDAQSPEKINLPGMQFGLIW 223 S K +IY E F L+G N+E + Y GS + E I +GLIW Sbjct: 189 SSSKAGYIYKLLDKCNEENIIFNLYGPNFERDNESSYNGSLPPE--ELIKKIKGSYGLIW 246 Query: 224 DGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKE 283 DGDS+E C+G FG+Y K NNPH+ S+ L+ ++P+ IW +AAL DF++DN IG A+ S+K Sbjct: 247 DGDSLELCNGTFGEYQKINNPHRVSMNLAAKMPILIWKEAALKDFVIDNNIGVAIDSLKN 306 Query: 284 MQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDLK 328 +++I++S+ Y + +N + +S+KIR+G Y + + E I L+ Sbjct: 307 IKDILNSIKDSDYDIMRDNLESVSKKIRSGYYTKKAINEAILKLE 351 >UniRef50_C2EH03 Possible galactofuranosyltransferase n=1 Tax=Lactobacillus salivarius ATCC 11741 RepID=C2EH03_9LACO Length = 344 Score = 293 bits (751), Expect = 5e-78, Method: Composition-based stats. Identities = 101/340 (29%), Positives = 173/340 (50%), Gaps = 23/340 (6%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFL---- 57 Y L+ + + +AG KA++D I S E ++ NI + + I S + +L Sbjct: 12 YLLSVYDKTEYNAGPKAKRDISRILS--EKLNFKNIEFYFNLDNTIFSKINKIKYLNWDI 69 Query: 58 ---CGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGGGSDSV---- 110 + D + +P+ ++ ++HD++ LR ++ Sbjct: 70 PRKLKNKKIDNIFIQYPIYSTVVIKKILSSLDRDVKVYYIVHDLESLRLFKNDENYLSEE 129 Query: 111 --RLATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAG 168 RL D +ISHN MTK+L + + ++ D++IFDYL + + ++ + YAG Sbjct: 130 INRLNDADGIISHNSIMTKWLKENGVKTQVSDLEIFDYLTKNVAPESN--SYEKTLCYAG 187 Query: 169 NLSRHKCSFIYTEGCDFTLFGVNYENK--DNPKYLGSFDAQSPEKINLPGMQFGLIWDGD 226 NL K F+ E ++G N + + Y G F + E FGLIWDG+ Sbjct: 188 NLQ--KSDFLVNEFYPIDVYGPNPKKEYPKTVSYKGVFTPE--ELPKHLKENFGLIWDGN 243 Query: 227 SVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQE 286 ++ C+G +G+Y+K+NNPHK SLYLS LPV IW+KAALA+F+ +++G VGS+ ++Q Sbjct: 244 RIDECNGVYGEYMKYNNPHKVSLYLSSGLPVIIWEKAALAEFVSKHQVGIVVGSLAQLQN 303 Query: 287 IVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDD 326 + S+T E Y + N +++S+K++ G Y + +ID+ Sbjct: 304 KLGSLTEEEYLNLRYNAQLVSEKLKNGYYIVKAVSNLIDN 343 >UniRef50_C2FKS4 Possible galactofuranosyltransferase n=1 Tax=Lactobacillus plantarum subsp. plantarum ATCC 14917 RepID=C2FKS4_LACPL Length = 343 Score = 293 bits (749), Expect = 7e-78, Method: Composition-based stats. Identities = 84/335 (25%), Positives = 165/335 (49%), Gaps = 15/335 (4%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLE 61 Y ++ +AG KA++D I + +++ + V R++ ++++ + Sbjct: 4 YLVSTKLEKNNNAGSKAKQDIEAILFK-AGLEKLSLVIPTNRVGRVLYAIRIWKKVFNGL 62 Query: 62 NKDVLIFNFPMAKPFW--HILSFFHRLLKFRIVPLIHDIDELRGGGGSD------SVRLA 113 N+ +++ +P+ ++ + +IV ++HDI+ LR + L Sbjct: 63 NEGLIVVQYPLYSKVITKQLVKEAGKRPNVKIVAIVHDIESLRIDVNHEDAINTEIDLLN 122 Query: 114 TCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRH 173 D +I HN +M +L + + + FDYL V V +AGNL++ Sbjct: 123 GFDFLIVHNTKMKSWLIENGLTIPSEVLGAFDYLSDFSV--PIQRKSGNVVNFAGNLAKS 180 Query: 174 KCS-FIYTEGCDFTLFGVNYENKDNP-KYLGSFDAQSPEKINLPGMQFGLIWDGDSVETC 231 I + + ++G N + Y G + + + + +GL WDGDS+ TC Sbjct: 181 SFLTKITSTDVKYHIYGPNPQKYSTALAYKGIYSPEQLSEQFVS--GYGLAWDGDSITTC 238 Query: 232 SGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSM 291 SG +G+YLK NNPHK SLY+ LPV +WD +A++D++ N +G +V S+ E+ +I+ + Sbjct: 239 SGVYGEYLKINNPHKVSLYIRSGLPVIVWDDSAMSDWVQKNDLGLSVSSLAELGDIISGV 298 Query: 292 TIETYKQISENTKIISQKIRTGSYFRDVLEEVIDD 326 T Y+ +EN ++++Q+++ G Y R+ +++ + Sbjct: 299 TDHQYQIYTENARVVAQRMQQGLYIREAFTKLLKN 333 >UniRef50_C0YXY6 Possible galactofuranosyltransferase n=5 Tax=Lactobacillus RepID=C0YXY6_LACRE Length = 338 Score = 290 bits (742), Expect = 5e-77, Method: Composition-based stats. Identities = 90/325 (27%), Positives = 154/325 (47%), Gaps = 15/325 (4%) Query: 10 SRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRI-ISSVKLSTFLCGLENKDVLIF 68 +AG KA+ D + V+Q+ ++ + + FL + D + Sbjct: 13 HDNNAGPKAKIDIENFLLKDGFEKWNFTINQESVLQKAKVAYIDVPRFLAKQNDIDEIFL 72 Query: 69 NFPMA-KPFWHILSFFHRLLKFRIVPLIHDIDELRGGGGS------DSVRLATCDMVISH 121 +P K L + + +I+ +IHDI+ LR G + D +I H Sbjct: 73 QYPTYSKIVTKQLVKRLQQMNSKIILIIHDIESLRLHYGEKGYIDEELRVFNMADGLIVH 132 Query: 122 NPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCS-FIYT 180 N +M K+L ++ + +FDY + ++ ++ + V +AGNLS+ + Sbjct: 133 NAKMEKWLRDNGVTVPMESLGLFDY--DNKIKLASGSNYETSVCFAGNLSKAGFLEKLSL 190 Query: 181 EGCDFTLFGVNYENK--DNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDY 238 + +FG N K N Y G + E N FGL+WDG + TC G FG+Y Sbjct: 191 KRVKLNVFGPNPLEKYGANIVYKGQYPPD--ELPNYLKGNFGLVWDGTTPITCDGLFGNY 248 Query: 239 LKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQ 298 +KFNNPHK SLYLS +PV +W +AA+AD + IG V S+ E+ E++ +++ Y + Sbjct: 249 MKFNNPHKASLYLSSGIPVVVWRQAAIADLVEKMNIGIVVDSLNELDEVLPNVSSIDYSE 308 Query: 299 ISENTKIISQKIRTGSYFRDVLEEV 323 + N K +++K+R+G Y + + + Sbjct: 309 LVNNAKEVAEKLRSGFYIKTAISNL 333 >UniRef50_Q032N6 Glycosyltransferase n=1 Tax=Lactococcus lactis subsp. cremoris SK11 RepID=Q032N6_LACLS Length = 345 Score = 286 bits (733), Expect = 6e-76, Method: Composition-based stats. Identities = 105/340 (30%), Positives = 170/340 (50%), Gaps = 22/340 (6%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASDYENISVV-NIPLWGGVVQRIISSVKLSTFLCGL 60 Y++N L +AG KA DA +I S++ + + ++ + SV++ + L Sbjct: 4 YYINALQKENMNAGSKAVNDATEIFEKMGYESLLSKVNIKNIYLRTLFFSVQVMIRILFL 63 Query: 61 ENKDVLIFNFPMAKPFWHILSFF--HRLLKFRIVPLIHDIDELRGGGGSDSVRLATCD-- 116 ++ NFP F I +F R ++ LIHDI ELR G + + + Sbjct: 64 PKNTKVVSNFPPIFFFERICLYFLKKRSKSLKVFILIHDIYELRIGKNNSTPYRNLLNFK 123 Query: 117 ----MVISHNPQMTKYLSKYMSQDK-IKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLS 171 I+HN +M +L K + I D++IFDYL S ++ + VI AGNL+ Sbjct: 124 NSNFYFIAHNDKMVSWLVKEGYKKNNIIDLEIFDYL--SVIKEDAGGTYGKSVIIAGNLA 181 Query: 172 RHKCSFIYTE----GCDFTLFGVNY----ENKDNPKYLGSFDAQSPEKINLPGMQFGLIW 223 K S++ DF L+G N E N Y GSF A E N+ +GLIW Sbjct: 182 PEKSSYLMELFKISEIDFNLYGPNVSSDVEKSKNVIYHGSFPAD--EIPNIIQGSYGLIW 239 Query: 224 DGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKE 283 D ++ +G +G+Y ++NNPHKTSLYL+ P+ +W+KAALA FI+++ +G+ V +++E Sbjct: 240 DSETTIGGTGKYGNYQRYNNPHKTSLYLAAGFPIIMWEKAALASFIMEHNLGFLVNTLEE 299 Query: 284 MQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEV 323 + + + Y ++ EN + KIR G + + LE+ Sbjct: 300 IPSKIAKIKEVDYNRMRENVEKFGNKIRMGYFLTEALEKA 339 >UniRef50_D0R4M2 Putative glycosyltransferase n=1 Tax=Lactobacillus johnsonii FI9785 RepID=D0R4M2_LACJF Length = 349 Score = 279 bits (715), Expect = 8e-74, Method: Composition-based stats. Identities = 108/345 (31%), Positives = 175/345 (50%), Gaps = 20/345 (5%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASDYENISV-----VNIPLWGGVVQRIISSVKLSTF 56 Y + + ++AG KA D + IA + N V Q+I + Sbjct: 4 YQIVMKTAAGQNAGSKAPNDVVKIAEKLNFEKLFVNVHRNESALDKVKQQIEYKSNWKSV 63 Query: 57 LCGLENKDVLIFNFPMAKPF---WHILSFFHRLLKFRIVPLIHDIDELR------GGGGS 107 +E+ +L+ P+ H L K +++ ++HD++ELR Sbjct: 64 YSKIESNSILLLQVPIYVHQLSRIHFLKKIKSQKKVKLIFVVHDVEELRVAFNNNFQKKQ 123 Query: 108 DSVRLATCDMVISHNPQMTKYLSKYM-SQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIY 166 L D+++ HN M + K ++KI ++KIFDYL + D+ + + K+ VI Sbjct: 124 FEDMLKLADVIVVHNEVMANFFEKKGFPKEKIVNLKIFDYLYNFDLNKKVIFSKK--VII 181 Query: 167 AGNLSRHKCSFIYTEG---CDFTLFGVNYENKDNPKYLGSFDAQSPEKINLPGMQFGLIW 223 AGNL K ++ F L+G NY K++ K + E NL FGLIW Sbjct: 182 AGNLDEKKTEYLKKLDKIDAKFDLYGPNYVKKNSNKITYKGVVPANELPNLLDSGFGLIW 241 Query: 224 DGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKE 283 DG+S+ETCSG FG+YLK+NNPHK SLYL+ LPVFIW KAA A F+ +N +GY + S+ + Sbjct: 242 DGNSIETCSGYFGNYLKYNNPHKLSLYLTAGLPVFIWSKAAEAKFVDENHLGYTIDSLSD 301 Query: 284 MQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDLK 328 + I++ +T+ Y ++ +N +++ +KI G + L + I+++K Sbjct: 302 IPLILERLTLADYNRLIKNVRLVGEKISRGDFMTVALTDAINNIK 346 >UniRef50_UPI000196CD65 hypothetical protein CATMIT_02517 n=1 Tax=Catenibacterium mitsuokai DSM 15897 RepID=UPI000196CD65 Length = 349 Score = 276 bits (706), Expect = 7e-73, Method: Composition-based stats. Identities = 101/341 (29%), Positives = 158/341 (46%), Gaps = 25/341 (7%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASDYENISV-VNIPLWGGVVQRIISSVKLSTFLCGL 60 Y+L S DA KA +D I VN+ G + + + L + Sbjct: 12 YYLQVTIDSNLDASTKAVQDCNKILHQNGFAPFEVNLYKSGNKYIKKVHNFLAFNHLNKI 71 Query: 61 ENKDVLIFNFPMA--KPFWHILSFFHRLLKFRIVPLIHDIDELRGGGGS--------DSV 110 + +L+ P+ K + IL + ++ LIHD+D LR + D Sbjct: 72 DEGALLVVPHPLYVNKRYIDILEKVKQKKHIKLAFLIHDLDSLRKLFLNAQDDFEYMDHK 131 Query: 111 RLATCDMVISHNPQMTKYLSKYMS-QDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGN 169 D +I+HN M +YL ++KI ++ IFDYL S + R V AGN Sbjct: 132 MYDISDYIIAHNDSMIEYLVSQGVAREKIHNLHIFDYLCDS----NNTIKFDRSVSIAGN 187 Query: 170 LSRHKCSFIYTEG----CDFTLFGVNYENK---DNPKYLGSFDAQSPEKINLPGMQFGLI 222 L K +++ F L+GV+ + N Y G+F E N FGL+ Sbjct: 188 LDEKKSNYLAKLKDIKAVHFDLYGVHLNEEILASNITYHGAFPPD--EINNQLYSGFGLV 245 Query: 223 WDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIK 282 WDG S+E C G G+YLK+NNPHK SLYL +PV IW +AA A F+ + +G V S+ Sbjct: 246 WDGSSIERCDGNTGEYLKYNNPHKLSLYLVSGIPVVIWKEAAEAKFVEEYGLGITVNSLD 305 Query: 283 EMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEV 323 E+ E S++ E Y ++ + ++S++++ G Y ++E+ Sbjct: 306 ELGEKFASLSEEEYFEMVKRVAVVSERLKNGYYLTQAIKEI 346 >UniRef50_C9A0R8 Putative uncharacterized protein n=1 Tax=Enterococcus gallinarum EG2 RepID=C9A0R8_ENTGA Length = 338 Score = 275 bits (703), Expect = 2e-72, Method: Composition-based stats. Identities = 74/326 (22%), Positives = 135/326 (41%), Gaps = 10/326 (3%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLE 61 + + + D+ KA+ D DIA + + + ++ G+ Sbjct: 4 WVTTIIESNAADSVKKAKADVCDIAKGMDYQPLYIYRYIDENEDDYALTSRIDGITAGVA 63 Query: 62 NKDVLIFNFPMAKPFWHILSFFHRLLKFRI--VPLIHDIDELRGGGGSDSVRL-ATCDMV 118 N+D++++ +P F R+ + I + IHD + LRG D L ++ Sbjct: 64 NQDMVVYQYPSYNGAHFDRMFLQRMKQRGIYTILFIHDAEMLRGKVDFDEAALFNEATLL 123 Query: 119 ISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFI 178 I H+ M L + K+ FDY ++ V++AGNL++ Sbjct: 124 IVHSQAMQTALVERGVIRKMVQKPFFDYRHKEV--SVSHERPEKRVVFAGNLAKTLFLQQ 181 Query: 179 YTEGCDFTLFGVNYENK--DNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFG 236 + + ++G + N Y G F+ + + FGL WD G + Sbjct: 182 WPNRTEILVYGEKNDRPFGANVHYCGVFEQEEL-IRKMEKNGFGLAWDD--KLPAGGDYQ 238 Query: 237 DYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETY 296 Y K+N PHK SLYLS+ +PV +W +AA+A+ + +G + I+E+ + +T E Sbjct: 239 QYTKYNAPHKISLYLSLGIPVIVWQQAAIAEMVQKLGLGIVIAGIEEIDHKLGELTDEEM 298 Query: 297 KQISENTKIISQKIRTGSYFRDVLEE 322 ++ N S +R+G + R L + Sbjct: 299 LRMKNNVLSFSCLLRSGIFTRTALVD 324 >UniRef50_C9LPN1 Galactofuranosyltransferase n=2 Tax=Veillonellaceae RepID=C9LPN1_9FIRM Length = 344 Score = 274 bits (701), Expect = 3e-72, Method: Composition-based stats. Identities = 84/340 (24%), Positives = 157/340 (46%), Gaps = 18/340 (5%) Query: 3 FLNDLNFSRRDAGFKARKDALDIA-SDYENIS-VVNIPLWGGVVQRIISSVK----LSTF 56 ++ + +++ G K +D + Y+ IS + +P + + L+T Sbjct: 6 YIREKAPNQQHGGNKGVEDINTVLGKKYDEISELYTLPGRRHLFDYARFFTRNWSNLNTI 65 Query: 57 LCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGGGSDS--VRLAT 114 ++N D+LI +P + L+HD+D +R G D L Sbjct: 66 RKSVKNDDILIIQYPHYNFHALGEKLIDLFRIKNTILLVHDVDSVRYQTGIDEEIKLLNL 125 Query: 115 CDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHK 174 +V+ HN +M+ YL K+ + K +I IFDYL+ + + + +++AGNL + Sbjct: 126 AKVVLLHNQKMSDYLVKHGLKTKTVNINIFDYLLYNTPSQESFSFG-KQIVFAGNLGKSH 184 Query: 175 CSFIY---TEGCDFTLFGVNYEN----KDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDS 227 + + G +LFG + ++GS+ E FGL+WDG S Sbjct: 185 FLNLMGQDSLGLSLSLFGPGLSEEMKESSHVHWMGSYSPD--EIPFKLKGSFGLVWDGTS 242 Query: 228 VETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEI 287 ++ C G G Y+K N PHK +LY++ +PV W +AA+AD + +IG+ V S++E+ Sbjct: 243 LDECDGFMGRYMKINFPHKLALYIAAGIPVVTWSQAAIADIVKTYKIGFVVDSLREVSNY 302 Query: 288 VDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDL 327 +DS+ + Y + +N + +K+ +G + E+ + + Sbjct: 303 IDSINEKEYAEYKKNILKLQKKVMSGYFTALAFEKAVTMI 342 >UniRef50_B0P5G1 Putative uncharacterized protein n=1 Tax=Clostridium sp. SS2/1 RepID=B0P5G1_9CLOT Length = 359 Score = 273 bits (699), Expect = 5e-72, Method: Composition-based stats. Identities = 108/357 (30%), Positives = 176/357 (49%), Gaps = 34/357 (9%) Query: 1 MYFLNDL---NFSRRDAGFKARKDALDIASDYENISVVNIPLWGG----VVQRIISSVKL 53 +Y + + S A KAR D DI D++ + +++++ S L Sbjct: 5 IYIVKEHICSGESEFTAAGKARIDVEDILYDWKAKDIKIKIKKNSNENSILKKLFSHYHL 64 Query: 54 ----STFLCGLENKDVLIFNFPMAKPFWHI--LSFFHRLLKFRIVPLIHDIDELRGGGGS 107 L L+ DVLI FP+ + F L + + + +IHD++ LR + Sbjct: 65 YQIWKKSLDKLKEGDVLIIQFPLQEGFIFASHLLKNLKKKNIKTIAVIHDLETLRITKDN 124 Query: 108 -------------DSVRLATCDMVISHNPQMTKYL-SKYMSQDKIKDIKIFDYLVSSDVE 153 + L ++ HN M K L K +S+D + +K+FDYL+ E Sbjct: 125 TISKKRKIRLYIEEIPTLKQFSKIVVHNQSMKKVLMKKGISEDSMVTLKMFDYLIKEGNE 184 Query: 154 HR-DVTDKQRGVIYAGNLSRHKCSFIYTE--GCDFTLFGVNYE--NKDNPKYLGSFDAQS 208 + K+ +I AGNLS +K ++Y F L+G+NY + KY GS+ S Sbjct: 185 LPGNTKSKENNIIIAGNLSSYKVGYVYELPHDVKFDLYGINYTGVTDNKIKYHGSYP--S 242 Query: 209 PEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADF 268 E +GL+WDGD+ +TCSG FGDYL+ NNPHKTSLYL+ +P+ W+KAA+A + Sbjct: 243 DELPWHLKGAYGLVWDGDTAKTCSGIFGDYLRINNPHKTSLYLACGIPIITWNKAAIAQY 302 Query: 269 IVDNRIGYAVGSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVID 325 + NR+G V S+ E+ E + ++ + Y + +N K S+++R G Y + ++E + Sbjct: 303 VRKNRVGITVSSLDEINEKLKDVSKDEYNLMRKNAKKCSERVRKGYYLKKAIQEALS 359 >UniRef50_C7IU57 Putative uncharacterized protein n=1 Tax=Thermoanaerobacter ethanolicus CCSD1 RepID=C7IU57_THEET Length = 356 Score = 272 bits (695), Expect = 2e-71, Method: Composition-based stats. Identities = 99/354 (27%), Positives = 174/354 (49%), Gaps = 30/354 (8%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLE 61 YFL AG KA+ DA + ++ + ++ S K L + Sbjct: 4 YFLTLKLTENYTAGSKAKIDAEYFLYQSGFKKL-DLYEGRTKIHKLTSVFKKLRDLPLNK 62 Query: 62 NKDVLIFNFPMAKPF---WHILSFFHRLLKFRIVPLIHDIDELRGGGGS-----DSVRLA 113 K +++ ++P+ P I + R ++ +IHD++ LR + L Sbjct: 63 GKVIIVTHYPLLNPVALKIFIQALELRRCDITLIGIIHDVNSLRYQQNENAVQREIQFLN 122 Query: 114 TCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVE---------HRDVTDKQRGV 164 D +ISHN MTK+L + + KI+++++FDY + + ++ + + + Sbjct: 123 MFDFLISHNSAMTKWLVEQGFKGKIQELELFDYKIDGNKNIVKSEINRTEGELKENRYII 182 Query: 165 IYAGNLSRHKCSFIYTE------GCDFTLFGVNYEN----KDNPKYLGSFDAQSPEKINL 214 +AGNL K FIY+ F L+G N+ + + N Y G +++ Sbjct: 183 TFAGNLDPQKSGFIYSLENVNFSNLFFYLYGPNFVSNQISEKNIIYKGVYESNLLPL--Y 240 Query: 215 PGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRI 274 +GLIWDGDSV+TCSGA G+YLK+N+PHK SLY+ LPV IW KAA A+ + +I Sbjct: 241 LEGNWGLIWDGDSVKTCSGALGNYLKYNSPHKLSLYIVAGLPVIIWSKAAAAELVKKYKI 300 Query: 275 GYAVGSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDLK 328 G + S++E+ ++S++ E Y+ EN I++ K++ G + + ++I+ ++ Sbjct: 301 GIVIDSLEEIPVKLESISNEEYQNYRENVMILANKLKKGEFIIGAVNKIINSVE 354 >UniRef50_A8RK64 Putative uncharacterized protein n=1 Tax=Clostridium bolteae ATCC BAA-613 RepID=A8RK64_9CLOT Length = 349 Score = 269 bits (688), Expect = 1e-70, Method: Composition-based stats. Identities = 106/339 (31%), Positives = 164/339 (48%), Gaps = 29/339 (8%) Query: 9 FSRRDAGFKARKDALDIASD--YENISVVNIPLWGGVVQRIISSVKLS-TFLCGLENKDV 65 + +AG KA D L ++ + Y+ I + VQ IIS V + + L + D+ Sbjct: 12 NGQNNAGSKAGNDVLRVSQECGYKLIPLYESNQVRTRVQDIISGVIATYSLRNKLVDGDI 71 Query: 66 LIFNFPMAKPFWHILSFFHRLLK--FRIVPLIHDIDELRGGGGSDS----------VRLA 113 ++ +P+ + + + K RI LIHDID LR D L Sbjct: 72 VLMQYPLNRLLMKNIFRILKRCKSKIRIATLIHDIDYLRDIPLGDKGVDGMKVLELSLLG 131 Query: 114 TCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRH 173 + D +I HNP M + L K + + +FDYL D +++ + VI AGNL Sbjct: 132 SSDYLICHNPFMIRTLQKEKLSVEYISLDLFDYLY--DGTPATISEDKSTVIVAGNLLES 189 Query: 174 KCSFIYTEGCD-----FTLFGVNYE----NKDNPKYLGSFDAQSPEKINLPGMQFGLIWD 224 K ++Y D +L+G NY DN Y GSF E I +GL+WD Sbjct: 190 KAGYLYQIKKDKHKFALSLYGSNYAVDKMQMDNATYHGSFKPD--ELIANLYGAYGLVWD 247 Query: 225 GDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEM 284 G S ETCSG++G YL+ NNPHK SLY++ +PV IW +AAL I +N +G+ + S+ E+ Sbjct: 248 GSSTETCSGSYGKYLRINNPHKVSLYIAAGIPVVIWKEAALCSLIEENALGFGISSLDEL 307 Query: 285 QEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEV 323 +E + S Y+ N + +K+ +G + + VL ++ Sbjct: 308 EEALKSH-EHLYQSYRNNVLNMKEKVCSGGFLKYVLVQI 345 >UniRef50_C7XW37 Glycosyltransferase n=1 Tax=Lactobacillus coleohominis 101-4-CHN RepID=C7XW37_9LACO Length = 338 Score = 268 bits (685), Expect = 2e-70, Method: Composition-based stats. Identities = 84/335 (25%), Positives = 159/335 (47%), Gaps = 14/335 (4%) Query: 3 FLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLEN 62 L + + AG KA D I S+ S +++ + L N Sbjct: 6 LLTFKDIGKNHAGPKATHDIELILSNNGFKSKEFHLNLNSKIEKWYYAHFYFVKLFKNTN 65 Query: 63 KDVLIFNFPMAKPFWH--ILSFFHRLLKFRIVPLIHDIDELRGGGG------SDSVRLAT 114 D L+ +P+ + I+ F + ++ ++HD++ LR + L Sbjct: 66 IDELVVQYPVYSRYIIRAIIKNFRKYSNGKLYFIVHDLEGLRLYKDDSIFGIEEIEFLNL 125 Query: 115 CDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDK-QRGVIYAGNLSRH 173 D +++HNP M KYL + + KI + FDYLV+ +++ + + +AGNL + Sbjct: 126 VDGIVAHNPSMKKYLEEKGVKSKITCLDFFDYLVNEKNIYKNQKNNMNDRICFAGNLDKA 185 Query: 174 KCSFIYTEG-CDFTLFGVNYEN--KDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVET 230 + ++G+N + KD +Y G F I +FGL+WDGDS++ Sbjct: 186 PFINKMSLNSIKLDVYGINRSSLYKDGIEYKGVFPPDKLPLI--LNEKFGLVWDGDSIQC 243 Query: 231 CSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDS 290 C+G +G+Y+K+N+PHK SLYLS +P+ +W ++AL++ + +G +V ++K ++E++ Sbjct: 244 CNGTYGNYIKYNSPHKASLYLSAGIPIIVWKQSALSELVKKYNLGLSVNNLKNIEEVLHK 303 Query: 291 MTIETYKQISENTKIISQKIRTGSYFRDVLEEVID 325 + Y ++ N S+ I++G +E + + Sbjct: 304 IPNCEYNELKSNAIQYSKVIKSGQNIIRAIESLEN 338 >UniRef50_Q7P740 Nucleotide sugar synthetase n=1 Tax=Fusobacterium nucleatum subsp. vincentii ATCC 49256 RepID=Q7P740_FUSNV Length = 357 Score = 267 bits (684), Expect = 3e-70, Method: Composition-based stats. Identities = 104/356 (29%), Positives = 181/356 (50%), Gaps = 36/356 (10%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASDYENISV-----VNIPLWGGVVQRIISSVKLSTF 56 + + L+ + A KAR D +I + ++ +++I + + Sbjct: 4 FIVEKLSKLEKTAWSKARNDVEEILISEGYQPLEIFSNLDDRSNMSTIKKIRAHFHMKKI 63 Query: 57 ----LCGLENKDVLIFNFPMAKPFWHILSFFH--RLLKFRIVPLIHDIDELRGGGGS--- 107 L L++ D + F FP+ + + +L +IV LIHD++ +R Sbjct: 64 WEKKLSVLKSGDSIFFQFPVVHNSIFLHNILKRLKLKGIKIVVLIHDMESIRLISEKSLS 123 Query: 108 ----------DSVRLATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDV 157 + L +I+ N M K+L ++ +++IFDYL+S +VE + + Sbjct: 124 FLQKLRIKIEEFEFLKASSYLITPNKYMRKFLEDKNITIQMGELEIFDYLISEEVEEKIL 183 Query: 158 TDK---QRGVIYAGNLSRHKCSFIY--TEGCDFTLFGVNY-----ENKDNPKYLGSFDAQ 207 K + ++ AGNLS+ K +++Y +F L+GVNY +N Y GS+ A Sbjct: 184 EKKVSSKNSIVIAGNLSKEKSAYVYLLPTNLNFELYGVNYIEDKDSQSENINYNGSYMAD 243 Query: 208 SPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALAD 267 + +FGL+WDG S+ETC G +G YL +NNPHK SLYL E+P+ IW+KAALA Sbjct: 244 K--LPAVLNGKFGLVWDGSSIETCKGGYGKYLMYNNPHKVSLYLVSEIPIIIWEKAALAS 301 Query: 268 FIVDNRIGYAVGSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEV 323 FI++N+IG+ + S+ ++ E + ++ E YK + +NT I SQ++ G Y + ++ ++ Sbjct: 302 FIIENKIGFTINSLNDINEKLKGLSDEEYKVMKQNTVIFSQRLSKGFYLKKIIRDI 357 >UniRef50_UPI0001968A2E hypothetical protein BACCELL_04078 n=1 Tax=Bacteroides cellulosilyticus DSM 14838 RepID=UPI0001968A2E Length = 355 Score = 266 bits (681), Expect = 7e-70, Method: Composition-based stats. Identities = 86/336 (25%), Positives = 167/336 (49%), Gaps = 15/336 (4%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCG-L 60 Y++ + +AG KA KD + + +V+ +P + ++I L LC + Sbjct: 16 YYIKFGISANPNAGSKAMKDIMALLDSKGYKAVLALPTRTNKIIKLIDIPILLFTLCFRV 75 Query: 61 ENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGGGSDSVR-----LATC 115 +++ P +L FF+R+++F+++ I+DI+ +R + +A Sbjct: 76 GRNGTVLYFVPSNFQRIKLLKFFNRIIRFKLICFINDIESMRMEKSKEYAHAEMNSIAVA 135 Query: 116 DMVISHNPQMTKYLS-KYMSQDKIKDIKIFDYLVSSDV----EHRDVTDKQRGVIYAGNL 170 D++++ N + L KY + + I I+DYL + + ++ ++ V +AGNL Sbjct: 136 DIILAPNDNSIQILQNKYHFTNHLVSIGIWDYLNNFEPIASEHTTNMVFNEKSVAFAGNL 195 Query: 171 SRHKCSF-IYTEGCDFTLFGVNYENKD--NPKYLGSFDAQSPEKINLPGMQFGLIWDGDS 227 ++ + + +F ++G N E K N +++G + N+ +GL+WDG S Sbjct: 196 NKAPFINELSSVNLNFKIWGSNTEEKKDRNIEFMGKKAPDELIE-NISQCTWGLVWDGIS 254 Query: 228 VETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEI 287 + TC G G YL+FNN HK LYL+ +PV +W+++ +A F+ ++G V S+ + +I Sbjct: 255 INTCCGLLGTYLRFNNSHKCGLYLAARVPVIVWEESGMASFVNKYKVGICVSSLHDAADI 314 Query: 288 VDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEV 323 ++ M + Y +N + I Q I G +F + LE+ Sbjct: 315 INCMDQKVYNIYKKNAQSIGQLISEGKFFLEALEKA 350 >UniRef50_C7G7A4 Putative uncharacterized protein n=1 Tax=Roseburia intestinalis L1-82 RepID=C7G7A4_9FIRM Length = 345 Score = 265 bits (677), Expect = 2e-69, Method: Composition-based stats. Identities = 71/343 (20%), Positives = 150/343 (43%), Gaps = 22/343 (6%) Query: 1 MYFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLC-G 59 +Y LN DA KA +D + +D + + ++P +++ L FL Sbjct: 5 IYVLNQRQDETFDAAGKAMRDVFSVLADKKAKIIWSVPKHCSKYLKLLDLPYLVLFLLFC 64 Query: 60 LENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGGGSDSV-----RLAT 114 ++ D + ++ P +L L K+RI+ I+D++ R G +D LA Sbjct: 65 VKKSDSVFYSIPENHLKIRLLKRLQLLKKYRIICFINDLNAFRYDGQNDGDPGEVRALAA 124 Query: 115 CDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSS-------DVEHRDVTDKQRGVIYA 167 D +++ N L K + + I+DY ++ ++ H + + + +A Sbjct: 125 ADKILAPNVNTVSMLKKNGISSDMIPVGIWDYRMNETQIAKIREISHAHKKENEVKIAFA 184 Query: 168 GNLSRHKCSFIY--TEGCDFTLFGVNYENKDNPKYLGSFDAQSPEKINLP----GMQFGL 221 GNL++ + + L+G + ++ G + +P M +GL Sbjct: 185 GNLNKSEFLSVMEIPSDVRMELWGKLDQEREKTLADGCYYHGILSSDEIPFAVAEMDYGL 244 Query: 222 IWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSI 281 +WDG + G G+YL++NN HK +LYL+ +PV +W ++ +A+F+ ++ G + + Sbjct: 245 VWDGSGKDEIEGGLGEYLRYNNSHKCALYLASGIPVIVWSRSGMANFVREHACGITIDRL 304 Query: 282 KEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVI 324 ++ + + + Y+++ E ++ K+ G Y ++ Sbjct: 305 GDLDQAIHT---ADYEKLKEAALAVAPKLWEGYYLSQAIDSAC 344 >UniRef50_C7TE97 Glycosyl transferase,galactofuranosyltransferase n=2 Tax=Lactobacillus rhamnosus RepID=C7TE97_LACRG Length = 338 Score = 259 bits (661), Expect = 1e-67, Method: Composition-based stats. Identities = 95/335 (28%), Positives = 156/335 (46%), Gaps = 27/335 (8%) Query: 7 LNFSRRDAGFKARKDALDIASDYENISVVNIPLW------GGVVQRIISSVKLSTFLCGL 60 N DAGFKAR D S+ I IP +QR+ + LS L Sbjct: 7 YNSKSFDAGFKARADV-KYFSNRMGIKTAEIPATRVNLKINRELQRLRAVRSLSK---KL 62 Query: 61 ENKDVLIFNFPMAKPFW-HILSFFHRLL---KFRIVPLIHDIDELRGGGGS-----DSVR 111 ++ +P+ PF S H+ L + + L+HD+ ++G + + Sbjct: 63 SADQSVLIQYPL--PFNSFDYSLLHKTLLKHNAKCIFLVHDLISVQGQAKNVSIKQEIEE 120 Query: 112 LATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLS 171 L D +I HN M +L K+ I FDY V + + + +++AGNL Sbjct: 121 LKRADFLIVHNQAMQNFLEDQGLSQKMATINFFDYRVDVE---PPIRSEVANIVFAGNLV 177 Query: 172 RHKCS--FIYTEGCDFTLFGVNYENKDNPKYLGSFDA-QSPEKINLPGMQFGLIWDGDSV 228 + K E + ++G + P+ + A S + +GL+WDG S Sbjct: 178 KSKFLKKLPQLEIFKWHVYGSGMTAEQFPESVVFHGAIDSGVLPSKLVDGWGLVWDGIST 237 Query: 229 ETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIV 288 + SG GDYL+ N+PHK SLYL+ LP+ +W ++ALA+ ++ +G AV ++ E++ ++ Sbjct: 238 DRISGVSGDYLRLNSPHKASLYLASGLPLIVWRESALANVVLQLGLGIAVDNLMEIEPVI 297 Query: 289 DSMTIETYKQISENTKIISQKIRTGSYFRDVLEEV 323 S++ ++I N +IISQKIR G +D LE + Sbjct: 298 KSLSHTQIEKIQTNVQIISQKIRNGGMLKDALESL 332 >UniRef50_A2RHU2 Putative galactofuranose transferase n=1 Tax=Lactococcus lactis subsp. cremoris MG1363 RepID=A2RHU2_LACLM Length = 344 Score = 258 bits (660), Expect = 2e-67, Method: Composition-based stats. Identities = 102/336 (30%), Positives = 168/336 (50%), Gaps = 26/336 (7%) Query: 9 FSRRDAGFKARKDALDIASD--YENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVL 66 + AG KA+ D+ I D Y+++ +I + +I++ + L ++ V+ Sbjct: 6 PIDQTAGAKAKIDSDTIFKDSGYKSLFSHHIQTNKVYINKILNIILGIISLTFIKKGSVI 65 Query: 67 IFNFPM----AKPFWHILSFFHRLLKFRIVPLIHDIDELRGGGGSDSVR------LATCD 116 N+P K W+ L R+ K +++ LIHD+D +R + L D Sbjct: 66 TTNYPPNLIDKKIVWNYLYKIKRIKKIKLIILIHDLDFIRNNDNDSNQEKKYIEQLDVAD 125 Query: 117 MVISHNPQMTKYLSKYMS-QDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKC 175 +I HN +M L + +DK+ D+KIFDYL I AGNL K Sbjct: 126 AIIVHNTKMIDLLVEKGLSKDKLIDLKIFDYLADI---KSSGGSYGNKFIVAGNLDIQKS 182 Query: 176 SFIYT----EGCDFTLFGVNYE----NKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDS 227 ++ +G F L+G Y + D KY GSF ++S N+ +GL+WD + Sbjct: 183 KYLSKISKIDGIYFNLYGPGYNQNDYDSDKSKYYGSFPSES--IPNVIQGSYGLVWDSEE 240 Query: 228 VETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEI 287 + G +GDY ++NNPHKTSLYL+ PV +W+KAALA FIV+N +G+ V ++ E+ Sbjct: 241 LSGGVGPYGDYQRYNNPHKTSLYLAAGFPVVVWEKAALAPFIVENNLGFVVDNLDELPSK 300 Query: 288 VDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEV 323 ++ ++ + Y ++ N K I QKI +G + + L++ Sbjct: 301 IEEISEDEYNRMKLNVKEIGQKICSGYFLNEALKKA 336 >UniRef50_C3QC04 Galactofuranosyltransferase n=3 Tax=Bacteroides RepID=C3QC04_9BACE Length = 334 Score = 256 bits (655), Expect = 6e-67, Method: Composition-based stats. Identities = 85/313 (27%), Positives = 144/313 (46%), Gaps = 21/313 (6%) Query: 14 AGFKARKDALDIA--SDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFP 71 A KA +D IA + YE ++ ++ ++ +K+ L N L+ +P Sbjct: 25 ASVKAPQDIHKIALQNGYEEYPIILRGYKNKLLFIVVLFLKMIRLAINLPNGATLLIQYP 84 Query: 72 MAKPFWHILSFFHRLLKFR-IVPLIHDIDELRGG---GGSDSVRLATCDMVISHNPQMTK 127 P +L F LK + ++ L+HDI+ +R G ++ L+ D +I H P+M Sbjct: 85 SLNP--KMLYFIFPFLKKKYLITLLHDINSVREKGELSGFENKVLSNFDEIIVHTPEMQT 142 Query: 128 YLSKYM-SQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIY---TEGC 183 Y + + K + F Y+ D E R ++ + V +AGN+ + + + Sbjct: 143 YFEQRLRPGIKYHYLGCFPYIAVPDKEARQLS---KQVCFAGNIDKSVFFSDFVFENKDL 199 Query: 184 DFTLFGVNYEN---KDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLK 240 D ++G N K+ +Y G F P+ I +GL+WDGDS ETCSG +G YLK Sbjct: 200 DLIVYGSCSSNNAMKNKYEYKGVF---KPDMIGHLEGSWGLVWDGDSTETCSGTWGSYLK 256 Query: 241 FNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQIS 300 PHK SLY+ LP+ +W +A+A + +G V S+ E+ + +++ YK+ Sbjct: 257 IIAPHKFSLYVLAGLPLIVWKDSAMAKLVEMKNLGITVTSLSEISARISAVSDNDYKEYC 316 Query: 301 ENTKIISQKIRTG 313 N + G Sbjct: 317 ANILKFQPVLLKG 329 >UniRef50_B1MXC4 Glycosyltransferase n=3 Tax=Leuconostoc RepID=B1MXC4_LEUCK Length = 327 Score = 248 bits (633), Expect = 3e-64, Method: Composition-based stats. Identities = 68/317 (21%), Positives = 123/317 (38%), Gaps = 20/317 (6%) Query: 17 KARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPF 76 KA+ D IA + + ++++L + D+++ FP Sbjct: 19 KAKADYAHIADQSGWTVLPLARYNDARYDDATRTQFINSWLQQVNTSDIVLHQFPSYMSE 78 Query: 77 WHILSFFHRLL--KFRIVPLIHDIDELRGGGGS--DSVRLATCDMVISHNPQMTKYLSKY 132 + F L + + LIHDI+ LR + L D+VI H+ M L Sbjct: 79 KFEVQFAKTLKARQVKRAILIHDIEPLRLMKHPIWEFDLLNLYDIVIVHSQAMKVQLQSL 138 Query: 133 MSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYTEGCDFTLFGVNY 192 + +FDYL S + +AG + + LFG Sbjct: 139 GVTSQFIIQPLFDYLGLS----YPFVSFSHEINFAGTFQKSPWLQQA-QNVHINLFGAKP 193 Query: 193 EN------KDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHK 246 + N Y G+ D + + I FGLIWD D + + Y K+N PHK Sbjct: 194 KKWRDTTFPANVTYKGNLDPE--QLIMAFRDGFGLIWDNDFEDK---TYKTYTKYNAPHK 248 Query: 247 TSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQISENTKII 306 SLY+ LP+ W ++A+ I + IG+ + + ++ + T + +N + + Sbjct: 249 ASLYIRAGLPLIAWRESAIGQIIAEQEIGFVIDKLNQLPAQLSETTAAQFNLWQQNMQPL 308 Query: 307 SQKIRTGSYFRDVLEEV 323 +Q++ +G + + L ++ Sbjct: 309 AQQLASGYFTKATLTQL 325 >UniRef50_B1I7N2 Nss n=10 Tax=Streptococcus pneumoniae RepID=B1I7N2_STRPI Length = 336 Score = 246 bits (628), Expect = 9e-64, Method: Composition-based stats. Identities = 68/317 (21%), Positives = 128/317 (40%), Gaps = 16/317 (5%) Query: 18 ARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFW 77 A+ IAS V + +L + + D+L+F P F Sbjct: 20 AQNAVQKIASQLGFREVGIYFYNIASDSPSEMNKRLDGIMASISIGDILVFQSPTWNGFE 79 Query: 78 HILSFFHRLL--KFRIVPLIHDIDELRGGGG----SDSVRL-ATCDMVISHNPQMTKYL- 129 F +L + +I+ IHD+ L D + + D++I + +M L Sbjct: 80 FDRLLFDKLKDMQVKIICFIHDVVPLMFDSNYYLMKDYMYMYNLSDVLIVPSERMKTRLM 139 Query: 130 SKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYTEGCDFTLFG 189 + ++ KI ++D+ D+ K + + +AG+L R +++ +F Sbjct: 140 EEGLTTKKILVQGMWDH--PHDLSLYTPAFK-KELFFAGSLERFPDLQNWSQDTPLRVFS 196 Query: 190 VNYENKDNPKYLGSFDAQSPEKI--NLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKT 247 E + + L + E++ L FGL+W G Y N HK Sbjct: 197 NKGEASSSARNLSIEGWKKDEELLLELSKGGFGLVW---GTYQNDGESNQYYTLNISHKV 253 Query: 248 SLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQISENTKIIS 307 S YL+ +PV + + A FIVD +G+ S++E+ IVD M ++ Y++++ K S Sbjct: 254 STYLTAGIPVIVPSSLSTAKFIVDQGLGFVANSLEEVHAIVDKMNLQEYQEMTNRIKTFS 313 Query: 308 QKIRTGSYFRDVLEEVI 324 ++ G + + + + I Sbjct: 314 YLLKEGYFTKKLFVDAI 330 >UniRef50_A3CM54 Nucleotide sugar synthetase-like protein, putative n=7 Tax=Firmicutes RepID=A3CM54_STRSV Length = 334 Score = 245 bits (625), Expect = 2e-63, Method: Composition-based stats. Identities = 67/318 (21%), Positives = 130/318 (40%), Gaps = 19/318 (5%) Query: 18 ARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFW 77 A+ D +A + + S +L + + DV+I+ P Sbjct: 20 AQNDVTKLAKQLGFNELSFYFYDIYSDSQSELSRRLDGIMASVGYGDVVIYQSPTWNGRE 79 Query: 78 HILSFFHRLL--KFRIVPLIHDIDELRGGGG----SDSV-RLATCDMVISHNPQMTKYLS 130 +F +L + +++ IHD+ L + + D VI + QM L Sbjct: 80 FDQAFISKLKILQAKLITFIHDVPPLMFPSNYYLMPEYIDMYNQSDAVIVPSEQMRDKLV 139 Query: 131 -KYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYTEGCDFTLFG 189 + ++ DKI +++D+ + K + +AG++ R ++ +F Sbjct: 140 AEGLTVDKILVQRMWDHPYDLPLHQPQFAPK---LYFAGSVERFPHLINWSYATPLEIFS 196 Query: 190 VNYENKDNPKYLGSFDA--QSPEKI-NLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHK 246 E + NP+ S+ PE + L GL+W VE +Y N HK Sbjct: 197 P--EEESNPEANVSYRGWVSRPELLLELSKGGLGLVW---GVEENPADEPEYYGLNISHK 251 Query: 247 TSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQISENTKII 306 ++ YL+ +PV + + A+ I D +G+ V S++E IV+++T E Y+ + E + Sbjct: 252 SATYLAAGIPVIVPSYLSNAELIRDRGLGFVVDSLEEASRIVENLTAEEYQAMVERVRKF 311 Query: 307 SQKIRTGSYFRDVLEEVI 324 S ++ G + + VL + + Sbjct: 312 SFLLKEGYFSKKVLVDAV 329 >UniRef50_C6Z1L9 Glycosyltransferase n=1 Tax=Bacteroides sp. 4_3_47FAA RepID=C6Z1L9_9BACE Length = 400 Score = 241 bits (616), Expect = 2e-62, Method: Composition-based stats. Identities = 86/352 (24%), Positives = 158/352 (44%), Gaps = 32/352 (9%) Query: 7 LNFSRRDAGFKARKDALDIASDYENISVV---------NIPLWGGVVQRIISSVKLSTFL 57 ++ ++ +A K R D + A + I + ++ + + L + Sbjct: 51 ISENKYNAASKPRNDTITTAIRLGFKPFIFNSRILGRSKIRYFRTILFWLNQILLLVSIC 110 Query: 58 CGLEN--KDVLIFNFPMAKPFWHILSFF---HRLLKFRIVPLIHDIDELRGGG----GSD 108 N V+ +P +H F + + + V L+HDI+ +R D Sbjct: 111 FRCRNIKDSVIFIQYPFIIFNYHFAKFILLIFKSRRCKFVVLLHDIETIRQKRIKPIKMD 170 Query: 109 SVRLATCDMVISHNPQMTKYL--SKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIY 166 + L D++I H QM + + K+ + FDYL S ++ D + +IY Sbjct: 171 RIILDLADVIIVHTHQMAEKISCIDKCPNSKLIKLAFFDYLSSIEMIGND-SAANINLIY 229 Query: 167 AGNLSRHKCS-----FIYTEGCDFTLFGV---NYENKDNPKYLGSFDAQSPEKINLPGMQ 218 AGNL + + L+G N N + +Y G F A + I Sbjct: 230 AGNLDKSLFLRRLQDVGFNNEFKMFLYGAYSDNIPNTEGVEYKGKFAADRFDSIE---GN 286 Query: 219 FGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAV 278 +GL+WDG+SV++C+G +G+YLK N+P K SLYL+ PV +W K+ALA ++ + ++G V Sbjct: 287 WGLVWDGESVDSCTGQYGEYLKINSPFKFSLYLAANRPVVVWSKSALASYVKEYKLGICV 346 Query: 279 GSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDLKTR 330 S+K++++ + S+TI+ I + S++I++G + + L + Sbjct: 347 DSLKDIEKTIKSLTIDELVNIQSSVYEYSKRIKSGKMLETAIFSSLKLLHEK 398 >UniRef50_C4ZG42 Putative uncharacterized protein n=1 Tax=Eubacterium rectale ATCC 33656 RepID=C4ZG42_EUBR3 Length = 345 Score = 239 bits (610), Expect = 9e-62, Method: Composition-based stats. Identities = 68/285 (23%), Positives = 121/285 (42%), Gaps = 22/285 (7%) Query: 52 KLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKF---RIVPLIHDIDELRGGGGS- 107 +L + L D++IF +P + SF +++ + +++ + DI +L Sbjct: 52 RLDGIIAPLNYGDIVIFQYPSWIGVNYDESFVNKIKSYRDTKLIIFVQDIQKLMFDSEQA 111 Query: 108 ----DSVRLATCDMVISHNPQMTKYLSKYMS-QDKIKDIKIFDYLVSSDVEHRDVTDKQR 162 + L D++I + +M +YL + + + I+D + SD+ D R Sbjct: 112 ILDMEIKTLNKADLLILPSKKMHRYLKENGLDEKPVIYQTIWD--MPSDICFVDHA-VTR 168 Query: 163 GVIYAGNLSRHKCSFIYTEGCDFTLFGVN---YENKDNPKYLGSFDAQSPEKINLPGMQF 219 +AGN +R Y + N EN D+ + G F+ + L F Sbjct: 169 CFHFAGNYNRFPFLAEYHGKTPIYQYDANKPDRENDDSFCWKGYFEQEKL-MHELSKGGF 227 Query: 220 GLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVG 279 GL+W D F Y N P+K L+ +PV + F+ N +GYAV Sbjct: 228 GLVWSDDEY------FDRYYSMNQPYKLGTNLAAGIPVIVKRGCVHDKFVERNGLGYAVD 281 Query: 280 SIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVI 324 ++ E ++V S+T Y ++ N K I + I G+Y R +L++ I Sbjct: 282 TLDEADKLVQSITDAEYIELYRNVKNIQKLILDGAYTRKILQDAI 326 >UniRef50_A7HN15 Galactofuranosyltransferase n=1 Tax=Fervidobacterium nodosum Rt17-B1 RepID=A7HN15_FERNB Length = 350 Score = 239 bits (610), Expect = 1e-61, Method: Composition-based stats. Identities = 82/345 (23%), Positives = 149/345 (43%), Gaps = 22/345 (6%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLE 61 Y S+ +AG+KA+ D I + V RI S +L + Sbjct: 8 YVPYFHWDSKFNAGYKAKNDVEIIFESAKFKRVDIFKKASDSNSRIFSLSRLISLYLKRN 67 Query: 62 --NKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGGGSDSVR----LATC 115 N ++ F + + +IHDI+ +R D R + Sbjct: 68 FANNAIVFFQNGTGLDLLIAPALRKAFKNAKRCIVIHDIESIRLARSIDFTREKLVFSNF 127 Query: 116 DMVISHNPQMTKYLSKY-MSQDKIKDIKIFDYLVSSDVEHRDVTD-----KQRGVIYAGN 169 + H+ +M Y+ + + KI + +FDY++ + V R ++ + + +AGN Sbjct: 128 THAVCHSKKMADYIKEKLGYKGKIYILGLFDYILDTPVYERVMSKTLPSLGKYVISFAGN 187 Query: 170 LSRHKCSF-----IYTEGCDFTLFGVNYENKDN---PKYLGSFDAQSPEKINLPGMQFGL 221 LS+ + L+G Y+ +Y G F E FGL Sbjct: 188 LSKSTFLKKIIKEVNPLNYTVYLYGKGYDGDTKDGVLEYKGVFHPD--ELPYKIEGHFGL 245 Query: 222 IWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSI 281 +WDG+ V SG G YLK+N+PHK SLY+ LP+ +W ++A+ + + + IG+ V S+ Sbjct: 246 VWDGEEVNGISGTVGHYLKYNSPHKASLYIVSGLPLIVWKESAIYETVKEYNIGFGVNSL 305 Query: 282 KEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDD 326 KE+ EI+ ++ + Y+ ENT + +K+ +G ++++ ++ Sbjct: 306 KEIDEILSKVSEKDYQVWRENTIKLGKKLASGENVKEIINRILSK 350 >UniRef50_Q3DVD0 Nucleotide sugar synthetase-like protein n=9 Tax=Streptococcus agalactiae RepID=Q3DVD0_STRAG Length = 335 Score = 236 bits (603), Expect = 7e-61, Method: Composition-based stats. Identities = 65/285 (22%), Positives = 115/285 (40%), Gaps = 16/285 (5%) Query: 50 SVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLL--KFRIVPLIHDIDELRGGGGS 107 S ++ + GL D+++F P F +L RI+ +HDI L Sbjct: 52 STRMDGIIAGLGRGDIVVFQVPTWNSTEFDELFLDKLQAYGARIITFVHDIVPLMFESNF 111 Query: 108 -----DSVRLATCDMVISHNPQMTKYL-SKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQ 161 D+VI M YL K M+ K+ +++D+ V+ D+ + Q Sbjct: 112 YLLDRVIDMYNRSDVVILPTKAMHDYLIEKGMTTSKVLYQEVWDHPVNIDLPRPEC---Q 168 Query: 162 RGVIYAGNLSRHKCSFIYTEGCDFTLFGVN--YENKDNPKYLGSFDAQSPEKINLPGMQF 219 + + +AG++ R + E +G ++ N G D F Sbjct: 169 KVLSFAGDIQRFPFVNDWKENIPLIYYGDGSRLNSEANVHAQGWKDDVELMLSLSKRGGF 228 Query: 220 GLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVG 279 GL W D E Y + N +K S +L+ LP+ + DFI + +G+ V Sbjct: 229 GLCWSEDREELVER---RYSRMNASYKLSTFLAAGLPIIANHDISSRDFIKQHGLGFTVE 285 Query: 280 SIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVI 324 +++E E +++M ETY EN + I+ +R G + +L + + Sbjct: 286 TLEEAVEKINNMEKETYDSYVENVEKIATLLRNGYITKKLLIDAV 330 >UniRef50_C4Z1X5 Putative uncharacterized protein n=1 Tax=Eubacterium eligens ATCC 27750 RepID=C4Z1X5_EUBE2 Length = 240 Score = 232 bits (591), Expect = 2e-59, Method: Composition-based stats. Identities = 81/233 (34%), Positives = 131/233 (56%), Gaps = 13/233 (5%) Query: 106 GSDSVRLATCDMVISHNPQMTKYLSKYMSQ-DKIKDIKIFDYLVSSDVEHRDVTDKQRGV 164 D D VI+HN +M +YL ++ + KI ++ IFDYL + + ++ + + + Sbjct: 11 HIDETMYEIADYVIAHNSKMKRYLIEHGVEESKIYELGIFDYLTNINPNNKSI-RYSKTL 69 Query: 165 IYAGNLSRHKCSFIYTEG-----CDFTLFGVNYEN----KDNPKYLGSFDAQSPEKINLP 215 AGNL +K ++I +F L+G+N++ + Y G+F S E + Sbjct: 70 NIAGNLDANKSNYIRELNGVDKTINFNLYGLNFDKNVLTSEAIHYKGAFP--SDEIPSQL 127 Query: 216 GMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIG 275 FGL+WDG++ C+G G+YLK+NNPHK SLY+ LPV IW +AA A+F+ N +G Sbjct: 128 TEGFGLVWDGNTASCCAGNTGEYLKYNNPHKLSLYMVSGLPVVIWSQAAEAEFVKCNNVG 187 Query: 276 YAVGSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDLK 328 V SI++ D+++ Y ++ EN K +S K+R G Y R V++++I DLK Sbjct: 188 LVVDSIEDFSIKFDNLSENDYYKMVENAKNVSYKLRNGEYLRKVIQDIIKDLK 240 >UniRef50_Q04DG9 Glycosyltransferase n=1 Tax=Oenococcus oeni PSU-1 RepID=Q04DG9_OENOB Length = 306 Score = 231 bits (589), Expect = 3e-59, Method: Composition-based stats. Identities = 90/321 (28%), Positives = 149/321 (46%), Gaps = 32/321 (9%) Query: 14 AGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMA 73 KA++D IA NI + R +V LS + D+L+ +P Sbjct: 4 GADKAKEDFAKIAE--------NIGFGILKINRAEKTVDLSVI----KPGDLLVHQYPSY 51 Query: 74 KPFWHILSFFHRLLKF--RIVPLIHDIDELR----GGGGSDSVRLATCDMVISHNPQMTK 127 L+F L K R V LIHD + R L+T D +I+HN +MT Sbjct: 52 LGDQWELNFQKELKKVGSRTVILIHDFETFRIHDYKSKKIAFQVLSTADYLITHNKKMTN 111 Query: 128 YLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYTEGCDFTL 187 L + I I++FDYL E T ++YAG+LS+ Y+ + Sbjct: 112 RL--FRINQNIFQIELFDYLSP---EKNKTTKIPTSLVYAGSLSKSSWIKNYSLKIPIDI 166 Query: 188 FG--VNYENKDNPKYLGSFDAQSPEKINL-PGMQFGLIWDGDSVETCSGAFGDYLKFNNP 244 FG + + YL P+++ + ++GL+WD D E + +Y K N+P Sbjct: 167 FGRLPKKWSLEKNDYLVLHKPIIPDQLPIFLNNKWGLVWDEDQ-EKNKTNYQNYQKINSP 225 Query: 245 HKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQISENTK 304 HK SLYL+ +PV +W+K+A+ F+++N+IG A+ ++ E+ + + I+ S+N Sbjct: 226 HKLSLYLAANIPVIVWEKSAITKFVLENKIGIAINNLAEIPDKIKKAEID-----SDNLD 280 Query: 305 IISQKIRTGSYFRDVLEEVID 325 +S+KIR G + +L ++I Sbjct: 281 NLSKKIRGGYFTEKLLRKIIS 301 >UniRef50_Q042V6 Glycosyltransferase n=4 Tax=Lactobacillus gasseri RepID=Q042V6_LACGA Length = 353 Score = 224 bits (571), Expect = 4e-57, Method: Composition-based stats. Identities = 70/333 (21%), Positives = 147/333 (44%), Gaps = 25/333 (7%) Query: 14 AGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIF---NF 70 AG K +D + I + V + R + L L ++ + Sbjct: 26 AGNKFPRDIISIFEKNDYTPVYIREGYVKK--RPWEFLNDVYQLIRLPRNSIVFYIDRVH 83 Query: 71 PMAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGGGSDSVR------LATCDMVISHNPQ 124 P +++ R + ++ DID LR S + R L + +IS N + Sbjct: 84 P--NLSRNLVYSILRRKNIKSFSILEDIDPLRDKKMSTNDRKLGLESLNSNKGIISQNKK 141 Query: 125 MTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFI------ 178 MT++L + ++ D+LVS+ E + ++Y GNLS + F+ Sbjct: 142 MTRFLVNQGVRVTTVELSALDFLVSNYKEKKHKKSADTIIVYGGNLSSEQAGFLNHLPIS 201 Query: 179 -YTEGCDFTLFGVNYENKD---NPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGA 234 + ++G+ +K N Y G F A+ E I+ +GL+W+ D ++ Sbjct: 202 KSNNKIKYRVYGMGEMSKQLSSNAIYCGGFSAE--ESIDKLKGDWGLVWNNDGSKSNKSG 259 Query: 235 FGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIE 294 Y ++ PHK S+Y +P+ + K+A+ADF+++N+ G + +++E+++ +++++ + Sbjct: 260 QNSYYEYVCPHKLSMYAICGMPIIVGKKSAMADFVINNKCGIVINNLEEIEKKINAISQQ 319 Query: 295 TYKQISENTKIISQKIRTGSYFRDVLEEVIDDL 327 Y + +N I+ K+ G Y ++ + ++ + Sbjct: 320 EYLEYQKNISKIASKMALGFYTQNAIRKIEKKI 352 >UniRef50_C7TIE1 Glycosyl transferase, galactofuranosyltransferase n=2 Tax=Lactobacillus rhamnosus RepID=C7TIE1_LACRL Length = 338 Score = 220 bits (560), Expect = 7e-56, Method: Composition-based stats. Identities = 70/344 (20%), Positives = 131/344 (38%), Gaps = 30/344 (8%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIA--SDYENISVVNIPLWGGVVQRIISSVKLSTFLCG 59 Y ++ L A KA+ D + + + W R ++ + Sbjct: 3 YVISPLQPDTDQATVKAKMDTAYFFGKVGFRELFLSRYVFWNDEHWR----SEILGIIAT 58 Query: 60 LENKDVLIFNFPMAK--PFWHILSFFHRLLKFRIVPLIHDIDELRG----GGGSDSVRLA 113 + DV+I+ P + I+ +HD++ LR G Sbjct: 59 VGKGDVVIYQIPTYAEPSVEKAVVELVHKQGALIIAFVHDVEYLRFPDSYDKGQVLSFFK 118 Query: 114 TCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRH 173 + D +I + + L+ + + Y + + YAGNL Sbjct: 119 SFDALIVGTQLVKEKLAADGVNIPMIPSGPWGYRQPI---AYRRPSFSKTLHYAGNLVDR 175 Query: 174 KCSFI--YTEGCDFTLFGV-------NYENKDNPKYLGSFDAQSPEKINLPGMQFGLIWD 224 K F+ + E ++G + D+ +YLGS+ + E +GLIWD Sbjct: 176 KAGFLQNFPENLHIKVYGSADGKTDLPFSLADSVEYLGSYRQE--ELALALNDGYGLIWD 233 Query: 225 GDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEM 284 D F Y + N HK SLYLS+ LPV ++ A+ ++ +N +G A+ S+ + Sbjct: 234 EDK----EHHFDPYARINMTHKFSLYLSLGLPVIACNQTAIGRYVSENGLGIAIDSLDNL 289 Query: 285 QEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDLK 328 I++ +T + + +I + IS IR+G + + + + +K Sbjct: 290 GNIIEGVTEDDFNRIVDKVANISDLIRSGRHNQMAALQAVLAVK 333 >UniRef50_C0XA00 Possible galactofuranosyl transferase n=3 Tax=Lactobacillus RepID=C0XA00_9LACO Length = 337 Score = 209 bits (533), Expect = 8e-53, Method: Composition-based stats. Identities = 51/289 (17%), Positives = 108/289 (37%), Gaps = 21/289 (7%) Query: 49 SSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHR---LLKFRIVPLIHDIDELRGGG 105 S + + L D +IF P + + + IHD+ L Sbjct: 51 KSTRFDGIIASLSVGDTVIFQSPNWIAIEWDQALIDHVNIYPNVKKIIFIHDVIPLMFES 110 Query: 106 G-----SDSVRLATCDMVISHNPQMTKYLSKYMSQDK-IKDIKIFDYLVSSDVEHRDVTD 159 D++I + +M +L + ++K +D+ + + Sbjct: 111 NRYLLPQHIDYYNKADVLIVPSKKMYDFLRENGLKEKPYVVQHFWDH-YPCQINYFVTPQ 169 Query: 160 KQRGVIYAGNLSRHKCSFIY-TEGCDFTLFG---VNYENKDNPKYLGSFDAQSPEKINLP 215 + + +AGN + + +F +E+ N +++G + + Sbjct: 170 NNKVINFAGNADKFDFVNNWGNPRVKLQVFSDPCKKFED-QNLEFMGWKNDPILLEELRR 228 Query: 216 GMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIG 275 FGL+W + + +Y+ N +K S YL+ +P+ + K A+ I +IG Sbjct: 229 SGGFGLVWSEEPY------WSEYMTMNTSYKLSTYLAAGIPIIVNSKTPEAETIKRKKIG 282 Query: 276 YAVGSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVI 324 S+ E Q V + + YK+I++N + ++ IR G + + L + + Sbjct: 283 IIADSLAEAQAKVLQVNDDEYKEITDNVESFAKLIREGYFTKKALADAV 331 >UniRef50_B1MVL6 Putative glycosyl transferase n=1 Tax=Leuconostoc citreum KM20 RepID=B1MVL6_LEUCK Length = 559 Score = 206 bits (525), Expect = 8e-52, Method: Composition-based stats. Identities = 70/324 (21%), Positives = 138/324 (42%), Gaps = 27/324 (8%) Query: 17 KARKDALDIASDYENISV-VNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKP 75 K R D A + + P + +L + + D +++ +P P Sbjct: 244 KPRNDVSKAAMAMGYTPIDFDTPYIDNEK---WMTAQLEKHCLQINSGDTVVWQYPKYSP 300 Query: 76 FW--HILSFFHRLLKFRIVPLIHDIDELRGGGGS--------DSVRLATCDMVISHNPQM 125 ++L++FH ++ IHDI+ LR + D + L++ D I Sbjct: 301 QLELNMLNWFHNR-GIKVASFIHDINLLREEPLNREHYLPEYDKILLSSFDANIVPEKFE 359 Query: 126 TK-YLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYTEGCD 184 Y + I +K +D+++ V + + ++YAG+L++ + + Sbjct: 360 QALYSLANVKLKNIVALKPYDFIIQKPVLPATYS---QDIVYAGSLAKFPA--LEDIDFN 414 Query: 185 FTLFG-VNYENKD--NPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKF 241 T++G N+ + + NPK + + E + FGLIWD D A Y K+ Sbjct: 415 LTVYGEKNFSDVNFVNPKIIDGGFLPAEELASSLNNGFGLIWDEDRQNPYRQA---YTKW 471 Query: 242 NNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQISE 301 N P+K SLY+ LPV W ++A+A I +G+ V + ++ V S++ + +++ Sbjct: 472 NWPYKFSLYMVSGLPVIAWSESAIAKLIESENLGFIVTDLSQIASKVRSISQTEFNEMAA 531 Query: 302 NTKIISQKIRTGSYFRDVLEEVID 325 N I K+ G+ + L+++ + Sbjct: 532 NAAEIGNKLAHGNSTKTALKKLEN 555 >UniRef50_Q5ULS2 Orf42 n=1 Tax=Lactobacillus phage LP65 RepID=Q5ULS2_9CAUD Length = 337 Score = 196 bits (498), Expect = 1e-48, Method: Composition-based stats. Identities = 74/336 (22%), Positives = 130/336 (38%), Gaps = 20/336 (5%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLE 61 Y D + DA K RKD +I + +S S + + Sbjct: 5 YTELDSDCIAYDASVKPRKDIEEIVA-INFLSFPLSIPSLKYGDDRNSDEYMRKVASKVS 63 Query: 62 NKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGG---GSDSVRLATCDMV 118 DV++ P L ++ L+HDI+ RG L D + Sbjct: 64 KGDVVLIQTPAYIADEIGLVNKLHDRGAIVIGLVHDIEYARGFSTDFSDQYKLLKLYDGL 123 Query: 119 ISHNPQMTKYLSKYMSQD-KIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSF 177 + ++ + + I ++++ YL + VEHR + R + YAGNLSR F Sbjct: 124 VVTGHRIKAIIQESGISSIPITCMELWPYLTNYVVEHRIEPNNNR-IEYAGNLSRSNGLF 182 Query: 178 IYTEGC--DFTLFGV--NYENKDNPKYLGSFDAQSPEKIN-LPGMQFGLIWDGDSVETCS 232 + ++G + N + + A P+ + +GL+W D Sbjct: 183 SKSLEGIEHVDVWGKQVDRSNSEKTGLVVQHGAVHPDDLPARLYSGYGLVWYVDR----- 237 Query: 233 GAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMT 292 + DY K N HK SLYLS +LP+ + + L++ + +IG V + E+ E + ++ Sbjct: 238 -KYQDYTKINVSHKASLYLSAKLPLIVSSSSYLSELVDKYKIGICVDRLDEIPEKL--LS 294 Query: 293 IETYKQISENTKI-ISQKIRTGSYFRDVLEEVIDDL 327 Y + N + I I +GS F + +++ L Sbjct: 295 RNDYCKYVNNIEEHIYDSISSGSCFTEPFVDLMSKL 330 >UniRef50_Q03A82 Glycosyltransferase n=1 Tax=Lactobacillus casei ATCC 334 RepID=Q03A82_LACC3 Length = 208 Score = 194 bits (492), Expect = 5e-48, Method: Composition-based stats. Identities = 53/199 (26%), Positives = 92/199 (46%), Gaps = 7/199 (3%) Query: 130 SKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYTEGCDFTLFG 189 + + I F Y D + + +++AGN++ K E +FG Sbjct: 3 KELDFKGPIIPQGPFSYRFIED-DDPVPPKFHKKIVFAGNINNSKYLSQVPEHWHLDVFG 61 Query: 190 VNYE----NKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPH 245 ++ N Y GSF E N FGL+WD DS + G +Y + H Sbjct: 62 GQPHQELLDRQNINYKGSFTPT--ELPNHFDGGFGLVWDSDSFDEVIGEPAEYNRLCYEH 119 Query: 246 KTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQISENTKI 305 K SLYL+ +PVFIW AA A+++ +N +G+AV ++ ++ I+++ T + Y + N Sbjct: 120 KLSLYLAKRMPVFIWKHAAAANWVTENHVGFAVENLADIWPIIENFTEDQYNAMQPNLAR 179 Query: 306 ISQKIRTGSYFRDVLEEVI 324 +S+ IR G + + + + Sbjct: 180 VSKLIRNGVFAKHAALDAL 198 >UniRef50_B5A7L9 Nucleotide sugar synthetase-like protein n=3 Tax=Streptococcus RepID=B5A7L9_STRPA Length = 330 Score = 191 bits (486), Expect = 2e-47, Method: Composition-based stats. Identities = 54/284 (19%), Positives = 111/284 (39%), Gaps = 20/284 (7%) Query: 50 SVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLL--KFRIVPLIHDIDELRGGGGS 107 S +L + GL + DV+IF P ++L +IV IHD+ L G Sbjct: 52 SKRLDGIVAGLRHGDVVIFQTPTWNTTEFDEKLMNKLKLYDIKIVLFIHDVVPLMFSGNF 111 Query: 108 -----DSVRLATCDMVISHNPQMTKYLSKYMSQ-DKIKDIKIFDYLVSSDVEHRDVTDKQ 161 D+V++ + +M L + K ++D+ + + + + Sbjct: 112 YLMDRTIAYYNKADVVVAPSQKMIDKLRDFGMNVSKTVVQGMWDHPTQAPMFPAGL---K 168 Query: 162 RGVIYAGNLSRHKCSFIYTEGCDFTLFG-VNYENKDNPKYLGSFDAQSPEKINLPGMQFG 220 R + + GN R + ++ N E N ++ + + FG Sbjct: 169 REIHFPGNPERFSFVKEWKYDIPLKVYTWQNVELPQNVH-KINYRPDEQLLMEMSQGGFG 227 Query: 221 LIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGS 280 L+W D + +Y +K +L+ +PV + + A + I +N +G+ V Sbjct: 228 LVWMDDKDK-------EYQSLYCSYKLGSFLAAGIPVIVQEGIANQELIENNGLGWIVKD 280 Query: 281 IKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVI 324 ++E V ++ + Y ++ +N + + +R G + R +L E + Sbjct: 281 VEEAIMKVKNVNEDEYIELVKNVRSFNPILRKGFFTRRLLTESV 324 >UniRef50_C4ZG41 Putative uncharacterized protein n=1 Tax=Eubacterium rectale ATCC 33656 RepID=C4ZG41_EUBR3 Length = 303 Score = 111 bits (279), Expect = 2e-23, Method: Composition-based stats. Identities = 60/330 (18%), Positives = 108/330 (32%), Gaps = 49/330 (14%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLE 61 Y + L R++A A + V++ + L Sbjct: 9 YNIGGLIGLRQNAVKNAG-------ETLGFKEMSLFKFPDTYDSDDELHVRMDGIIASLC 61 Query: 62 NKDVLIFNFPMAK---PFWHILSFFHRLLKFRIVPLIHDIDELRGGGG----SDSVRLAT 114 +D++IF P + + R +IV I +I R + L Sbjct: 62 PEDIVIFQHPSGESPRYDGFLFEHLRRYHGTKIVAFIQEIASDRDDSEYSLSDEITLLNR 121 Query: 115 CDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHK 174 DM I + + Y ++K YL+ + + D + N ++K Sbjct: 122 ADMFIFASAALRDYYIANGLKEK-------PYLIQN------IPDYMTDIC--ANEHKNK 166 Query: 175 CSFIYTEGCDFTLFGVNYENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGA 234 +I E + +N N + +Y +S + L FGLIWD D Sbjct: 167 KLYIMAETSQ-NEYPLNNLNIEVVQYDEYHVTES--ILRLSDGGFGLIWDTDEQA----- 218 Query: 235 FGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIE 294 LY++ +PV + + ++ DN IG A +++ I S + + Sbjct: 219 ------------LGLYMAAGIPVIVKKGLSCEKYVTDNEIGAAATDFEDVYRIAVSESED 266 Query: 295 TYKQISENTKIISQKIRTGSYFRDVLEEVI 324 Q N K + G Y R +L + + Sbjct: 267 KLSQYYANVKKLQDLFINGIYTRKLLLDTL 296 >UniRef50_C2EWJ6 Putative uncharacterized protein n=1 Tax=Lactobacillus vaginalis ATCC 49540 RepID=C2EWJ6_9LACO Length = 146 Score = 76.1 bits (186), Expect = 1e-12, Method: Composition-based stats. Identities = 27/126 (21%), Positives = 45/126 (35%), Gaps = 12/126 (9%) Query: 153 EHRDVTDKQRGVIYAGN-LSRHKCSFI---YTEGCDFTLFGVNYENK--DNPKYLGSFDA 206 ++ + +AG+ + K F + +F + N +++GS Sbjct: 9 SGKEKPHYAPIINFAGDPTNPEKYGFGGTWFNPDVKLRVFTSPKDWGVGRNIEFVGSMPD 68 Query: 207 QSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALA 266 + FGLIW GD + +Y+K N K S YL+ LPV + Sbjct: 69 IALLNDIRRTGGFGLIWSGDPY------WLEYMKHNCSFKLSTYLAAGLPVIVNSGTPAR 122 Query: 267 DFIVDN 272 D I Sbjct: 123 DIIEKK 128 >UniRef50_C5RCT8 Possible transposase n=2 Tax=Lactobacillales RepID=C5RCT8_WEIPA Length = 307 Score = 68.8 bits (167), Expect = 3e-10, Method: Composition-based stats. Identities = 31/157 (19%), Positives = 52/157 (33%), Gaps = 11/157 (7%) Query: 69 NFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGGGSDSVRLATCDMVISHNPQMTKY 128 F L + K + + IH + G + L D++ + Sbjct: 157 QFQTEDSLDRFLVSQFNVYKEKSLKRIH--RGFKIGVDEEVALLNKFDLITLPSIAAENI 214 Query: 129 LSKYMSQDK-IKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYTEGCD-FT 186 L K I FD+L + + V +AGN+S K F+ Sbjct: 215 LRKQGLIVPTIIQQGPFDFLTQAPEVSSIFSS---IVNFAGNISFSKVGFLRDINTPNIL 271 Query: 187 LFGVN--YENKDNPKYLGSFDAQSPEKINLPGMQFGL 221 +FG N + +N Y+G F + + I +GL Sbjct: 272 VFGSNLDFTLPNNVSYMGKF--DNDDLIPKLNSGYGL 306 >UniRef50_C5RCT9 Putative uncharacterized protein n=1 Tax=Weissella paramesenteroides ATCC 33313 RepID=C5RCT9_WEIPA Length = 84 Score = 60.3 bits (145), Expect = 8e-08, Method: Composition-based stats. Identities = 15/78 (19%), Positives = 40/78 (51%), Gaps = 1/78 (1%) Query: 249 LYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQISENTKIISQ 308 +YL+ + + + +++DN+ G + +I+ + + + S++ + Y ++ K Sbjct: 1 MYLAAGIVPIADHASNVGKWLIDNKCGITIPNIESLDDAIQSISRQEYDELEIAVKSQQN 60 Query: 309 KIRTGSYFRDVLEEVIDD 326 K+R G Y + L ++I+ Sbjct: 61 KVRQGYYTQK-LVKLINK 77 >UniRef50_B0VJD0 Putative uncharacterized protein n=1 Tax=Candidatus Cloacamonas acidaminovorans RepID=B0VJD0_9BACT Length = 377 Score = 60.0 bits (144), Expect = 1e-07, Method: Composition-based stats. Identities = 43/276 (15%), Positives = 81/276 (29%), Gaps = 57/276 (20%) Query: 82 FFHRLLKFRIVPLIHDIDEL-------RGGGGSDS--------VRLATCDMVISHNPQMT 126 + KF++V +H+ L R + D V + N ++ Sbjct: 88 ILKKRQKFKVVFDVHEFFALSFSERFPRFLRYPAYLFYQLSLKQLMKIADAVFTVNQEIC 147 Query: 127 KYLSKYMSQDKIKDI------KIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRH------- 173 L + + ++DY + + + IY G L+ Sbjct: 148 NQLLGRNKRIPSLVLPNYPVKNVWDYECNIPGSLEQLCQMKFDFIYTGGLTEDRGIYKIL 207 Query: 174 ------KCSFIYTEGCDFTLF-----------GVNYENKDNPKYLGSFDAQSPEKINLPG 216 K F + + F +N N + Y S+ + L Sbjct: 208 KVVSLLKHDFPFLKVLILGKFLKPETEKRFNQSINDYNLNAIIYYQSWIPAEKIGLLLKR 267 Query: 217 MQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGY 276 +FGL W L+ + P K YLS LPV + I N +G Sbjct: 268 CRFGL-W-------IFNPKNRRLRLSTPLKVLEYLSAGLPVITIKTPLMKALIEKNGVGI 319 Query: 277 A----VGSIKEMQEIVDSMTIETYKQISENTKIISQ 308 ++ + + ++ Y +S+ +S+ Sbjct: 320 CSPYQSKALADACAKMLKLSDNEYNAMSKKCLELSE 355 >UniRef50_A8TGW1 Glycosyl transferase group 1 n=2 Tax=Methanococcus voltae RepID=A8TGW1_METVO Length = 378 Score = 52.3 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 46/306 (15%), Positives = 103/306 (33%), Gaps = 54/306 (17%) Query: 33 SVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIV 92 + N+PL+ +I+ + + D + + K ++ + V Sbjct: 68 FIKNLPLFYKKAYKILKKLDFDAI--HTHDFDTAFLGYVIKK---QGKKNTNKTNPIKWV 122 Query: 93 PLIHDI-DELRGGGGS---------DSVRLATCDMVISHNPQMTKYLSKYMSQDKIK-DI 141 IHD+ + D + + D +I N + + + ++ K+ I Sbjct: 123 YDIHDLYESFIEKNNPNLAKLISKMDVILMKNADDLIVVNEKFINLIDERINDKKVLGKI 182 Query: 142 KIF-DYLVSSDVEHRDVTDKQRGVIY-AGNLSRHKCSFIYTE-----GCDFTLFGVN--- 191 KI + + + + DK +++ G LS+ + T+ G+ Sbjct: 183 KIVRNTINPPKITLKSPADKPDFMVFYGGVLSKTRYIMEMINICEELDIKMTIAGMGVLE 242 Query: 192 ------YENKDNPKYLGSFDAQSP-EKINLPGMQFGLIWDGDSVETCSGAFGDYLKFN-- 242 + N ++LG +++N + F + ++ N Sbjct: 243 NEIIAHSKESKNIRFLGKLPHDKLLDEMNNYSLNF-------------AIYDPVIRNNQL 289 Query: 243 -NPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVG----SIKEMQEIVDSMTIETYK 297 P+K + M +P+ + + + D + N G V S+KE + S E + Sbjct: 290 ATPNKLFESMCMGIPIIVTKGSVMGDIVEKNNCGLTVDFDEKSVKEAILKLKS-DKEFFN 348 Query: 298 QISENT 303 +S+N Sbjct: 349 TLSKNA 354 >UniRef50_B8DZD9 Glycosyl transferase group 1 n=2 Tax=Bacteria RepID=B8DZD9_DICTD Length = 373 Score = 49.9 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 56/325 (17%), Positives = 110/325 (33%), Gaps = 51/325 (15%) Query: 18 ARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFW 77 A+ D +I I ++ +P+ ++R+I + ++ F+ P P Sbjct: 38 AQHDKEEIVDG---IHLIPLPIVRSRIRRMIYLPIRALKEALKLKANIYHFHDPELIPIG 94 Query: 78 HILSFFHRLLKFRIVPLIH-DIDE-----------LRGG-----GGSDSVRLATCDMVIS 120 +L F + +++ +H D+ + LRG + D +I+ Sbjct: 95 VLLKVFAK---GKVIYDVHEDVPKQIMSKYWIPKKLRGIISFIVNLGEKKFSFLFDAIIT 151 Query: 121 HNPQMTKYLSKYMSQDKIKDIKIFDYLVSS-DVEHRDVTDKQRGVIYAGNLSRHKCSFIY 179 L + IK + L +V+ ++ D +IY G LS+ + Sbjct: 152 ATD---DILKNFSFYRNAISIKNYPMLSKFLEVKGKEKKDDVFKIIYIGGLSKIRGISEV 208 Query: 180 TEGCDFTLFGVNYENKDNPKYLGSFDAQSPEKINLPGMQFG----LIW-----------D 224 + ++ V+ + G F EK F L W D Sbjct: 209 VKALEY----VDSNKEVRLILCGKFSPIEYEKEVRNLKGFEKVDYLGWLEPDEVVNKLVD 264 Query: 225 GDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIK-- 282 D+ C +Y+ P K Y++ LPV + + + N G V + Sbjct: 265 VDAGIVCLHPITNYVTA-LPVKLFEYMAAGLPVIASNFPLWREIVEGNNCGICVDPLNPK 323 Query: 283 EMQEIVDSMTI--ETYKQISENTKI 305 E+ E + + + +++ EN K Sbjct: 324 EIAEAIKYLIEHLDKAQKMGENGKK 348 >UniRef50_A3XA25 Glycosyl transferase, group 1 n=1 Tax=Roseobacter sp. MED193 RepID=A3XA25_9RHOB Length = 408 Score = 49.6 bits (117), Expect = 1e-04, Method: Composition-based stats. Identities = 16/124 (12%), Positives = 36/124 (29%), Gaps = 17/124 (13%) Query: 67 IFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDE--------------LRGGGGSDSVRL 112 I+ + + + ++ ++HDI+ L+ + + L Sbjct: 115 IYAYIPSVLTLYGAKVLKMRSGAPLIAIVHDIESGLAHSLGITSKPIMLKIMRMVERIGL 174 Query: 113 ATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSR 172 D VI M + + I + I+ V+Y+GN + Sbjct: 175 NFADHVIVLTEGMKNEIIDIGCRKPIDVLPIW---SQVADIAPIDDAGPVRVMYSGNFGK 231 Query: 173 HKCS 176 + Sbjct: 232 KQNL 235 >UniRef50_B9KB31 Putative uncharacterized protein n=1 Tax=Thermotoga neapolitana DSM 4359 RepID=B9KB31_THENN Length = 370 Score = 48.8 bits (115), Expect = 3e-04, Method: Composition-based stats. Identities = 60/339 (17%), Positives = 108/339 (31%), Gaps = 53/339 (15%) Query: 3 FLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLEN 62 + ++GF+ R I EN V L + + L E+ Sbjct: 33 VIYQYRTKESESGFEERG-IEYIPLKCENTGSVLRKLSERRIFD-----EKICHLVERED 86 Query: 63 KDVL-IFNFPMAKPFWHILSFFHRLLKFRIVPLIH----------------DIDEL---R 102 DVL + +FP KP L + +I+ IH D+ E R Sbjct: 87 YDVLYLHHFPATKPLKPFL--ITKKQGKKIIYDIHEYHPQNFLNVLPRPLSDLKEFFMWR 144 Query: 103 GGGGSDSVRLATCDMVISHNPQMTKYL--SKYMSQDKIKDIKIFDYLVSSDVEHRDVTDK 160 L D+ I + + + ++ K + +S D K Sbjct: 145 IFKKQ----LELSDLCIFVSEETRDEIVAKTGLAPSKTFVVP----NYASLKIEPDSGRK 196 Query: 161 QRGVIYAG----NLSRHKCSF--IYTEGCDFTLFGVNYEN-KDNPKYLGSFDAQSPEKIN 213 ++ +I G NL+ K + +G F + G+ + D P SF Sbjct: 197 RKEIIMVGKTQRNLTYEKKLIKALIEKGFSFRVIGMESKVFSDVPHTYTSFLPYEKMMEE 256 Query: 214 LPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDK-AALADFIVDN 272 + F L+ S T + PHK ++ PV + ++A + + Sbjct: 257 ISKGMFSLV----SYSTIGREDYKNDLYALPHKFYDSIAAGTPVVVKKSFVSMARLVKEL 312 Query: 273 RIGYAVGSIKEMQEIVDSMTI--ETYKQISENTKIISQK 309 IG + ++ + + + Y++I EN K Sbjct: 313 EIGVVIDP-SNTEDSLRKIEDACQRYERILENIKKHQNL 350 >UniRef50_C4N530 Putative glycosyltransferase n=1 Tax=Capnocytophaga canimorsus RepID=C4N530_9FLAO Length = 403 Score = 47.6 bits (112), Expect = 5e-04, Method: Composition-based stats. Identities = 44/314 (14%), Positives = 92/314 (29%), Gaps = 54/314 (17%) Query: 37 IPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIH 96 + L+ + + L D +I + P ++ +L K I + Sbjct: 78 VRLFFNYYSFAFFACLKALCLSFRNRYDAIIVHEPSPIIQFYPALLLKKLQKTPIYFWVM 137 Query: 97 DI--DELRGGGGSDSVRL-------------ATCDMVISHNPQMTKYLSKYMSQDKIKDI 141 D+ + L GG + + + ++I+ L K DKI+ Sbjct: 138 DLWPESLEIAGGVRNKIVLGYYERLVKKFYNNSEKILITSKGFRKSILQKGDFSDKIEYF 197 Query: 142 KIF--DYLVSSDVEHRDVTDKQR-GVIYAGNLSRHKCSFIY---------TEGCDFTLFG 189 + D +V D+ + V++AGN+ + + F + G Sbjct: 198 PNWAEDSIVEGDMSYPTPELPSGFRVMFAGNIGEAQDMENIMRATLILKEEKNIQFIIVG 257 Query: 190 ------------VNYENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGD 237 N +D +G + ++ L+ F Sbjct: 258 DGRKMPFVQDFIKNNSLQDTVHCVGKYPVEAMYSF-FSKADLMLV-----SLKNDKIFN- 310 Query: 238 YLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAV-----GSIKEMQEIVDSMT 292 P K Y++ P+ AD I + G+ + S+ ++ Sbjct: 311 ---LTMPAKIQAYMAASKPIIAMINGEGADIIKEANCGFTIPAGDYKSLSDIILKSSKFK 367 Query: 293 IETYKQISENTKII 306 E +++ +N K Sbjct: 368 KEELEKLGKNGKEF 381 >UniRef50_B8EDB2 Putative uncharacterized protein n=1 Tax=Shewanella baltica OS223 RepID=B8EDB2_SHEB2 Length = 388 Score = 47.6 bits (112), Expect = 6e-04, Method: Composition-based stats. Identities = 49/306 (16%), Positives = 97/306 (31%), Gaps = 55/306 (17%) Query: 47 IISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDE------ 100 + + + + L K V + P I+ + +L + V + DI Sbjct: 84 VFFMLWVFSILLLKRPKKVYVSTDPPV-LVPFIVMIYCKLFRANYVYHLQDIHPEAANVV 142 Query: 101 -------LRGGGGSDSVRLATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVE 153 R G DS+ + D++I+ +M + + + L + V Sbjct: 143 IPVKPLLFRVLKGMDSITMRHADLLITITKEMAEEIRNRSLTVSPIKL-----LANPSVS 197 Query: 154 HRDVT---DKQRGVIYAGN---LSRHKCSFIYTEG-------CDFTLFGVNY-------- 192 V K+ G + GN L R + F G Sbjct: 198 FEHVAVPLAKKTGFTFCGNAGRLQRMPILIQAIKQYCQAGGTLQFVFAGAGVYANQLQDL 257 Query: 193 -ENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYL 251 E N Y G A +++ ++ L+ D V +F P K+S Y+ Sbjct: 258 AETYVNVSYKGLVSASEAAQLS-ADYEWALLPIEDEV----------TRFAFPSKSSSYV 306 Query: 252 SMELPVFI--WDKAALADFIVDNRIGYAVG-SIKEMQEIVDSMTIETYKQISENTKIISQ 308 + D ++A+++ N +G + ++ + + ++ Y + N + Sbjct: 307 FSGAKILAVCGDYTSVAEWVTTNCLGVVIEPNVDSLCQTFFAIESGGYDKFQFNIEREQL 366 Query: 309 KIRTGS 314 K R G Sbjct: 367 KKRLGF 372 >UniRef50_A8U9E3 Putative uncharacterized protein n=1 Tax=Carnobacterium sp. AT7 RepID=A8U9E3_9LACT Length = 402 Score = 46.5 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 46/269 (17%), Positives = 96/269 (35%), Gaps = 40/269 (14%) Query: 22 ALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGL-ENKDVLIFNFPMAKPFWHIL 80 D S YE +N+P+ ++ I ++++ + + + D +I + + PF I+ Sbjct: 75 ISDFISSYETY-TINLPIIKHELRFIEYKIQINKWYEKMDKETDKIIIIYDLYIPFLKII 133 Query: 81 SFFHR-LLKFRIVPLIHDI---------------DELRGGGGSDSVRLATCDMVISHNPQ 124 + +IV +I D+ + L + D + Q Sbjct: 134 KWMKDTYENVKIVVMIPDLVGKYRNNSIKSKTKRNLLERKVDKTFELMNQADGYLLITEQ 193 Query: 125 MTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYTEGCD 184 +++++ + ++ I D +V+ + +KQ +Y+G LS Y Sbjct: 194 ISRFIEDEN-KPRMVIDGIVD---DKNVKFKITNNKQTIFMYSGLLSSQ-----YNVDKL 244 Query: 185 FTLFGVNYENKDNPKYLGSFDAQSPEKINLPGMQ---FGLI---------WDGDSVETCS 232 +F +N E +L + + + +GLI + D + Sbjct: 245 IDIF-LNLEENQAQLWLCGYGELEAKLKKIESTNIKFYGLIPKKEVSDLEYQADVLINPR 303 Query: 233 GAFGDYLKFNNPHKTSLYLSMELPVFIWD 261 G+Y K++ P K YL PV + Sbjct: 304 SNKGEYTKYSFPSKNLEYLLKGKPVICYK 332 >UniRef50_C4Z1X4 Putative uncharacterized protein n=1 Tax=Eubacterium eligens ATCC 27750 RepID=C4Z1X4_EUBE2 Length = 93 Score = 45.3 bits (106), Expect = 0.003, Method: Composition-based stats. Identities = 16/90 (17%), Positives = 29/90 (32%), Gaps = 1/90 (1%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLE 61 Y++N +AG KA D +I G++ +I + L + Sbjct: 5 YYINIKMKENNNAGSKAVNDCNNILKQCGIEPYTLNIKGEGLLGKINKVFEFEK-LKKIP 63 Query: 62 NKDVLIFNFPMAKPFWHILSFFHRLLKFRI 91 VL P+ + + K +I Sbjct: 64 ENSVLFIQHPIYINKNYYIDVLKNTKKKKI 93 >UniRef50_A7ZC10 Glycosyl transferase, group 1 family protein n=1 Tax=Campylobacter concisus 13826 RepID=A7ZC10_CAMC1 Length = 368 Score = 44.5 bits (104), Expect = 0.005, Method: Composition-based stats. Identities = 44/284 (15%), Positives = 90/284 (31%), Gaps = 39/284 (13%) Query: 48 ISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGG-- 105 I + L + NKD + + + + + + ++ ++V H+ L Sbjct: 76 IFNFFLKKIINNYLNKDAIFYTRHLKIAKFLLEN---KMPDQKVVFEAHECFTLGNKALY 132 Query: 106 GSDSVRLATCDMVISHNPQMTKYLSK----YMSQDKIKDIKIFDYLVSSDVEHRDVTDKQ 161 + L D + SHN L K ++ + D + + Sbjct: 133 DMEKEILQNADFIFSHNSSTLSELRKFFGLQIANSAVVYNG-----CKQDYDFKKKDFDF 187 Query: 162 RGVIYAGNLSRHKCS-----FIYTEGCDFTLFGVNYENK----------DNPKYLGSFDA 206 + Y G+ K F L+G N N + L F Sbjct: 188 SSINYYGSFLLWKGLDLMLDFALKTNIKLELYGKNSGNSFMTLKNTLKEREIENLVCFKG 247 Query: 207 QSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALA 266 P+ + LI + +++ DY ++ P K Y++ V + +A Sbjct: 248 LLPQNEVVKS----LI-ENNTILIIPSVKSDYSLYSTPLKLFEYMANSNVVLAPNFPPVA 302 Query: 267 DFIVDNRIGYAVG-----SIKEMQEIVDSMTIETYKQISENTKI 305 + + D G+ S++E + ++ E +IS+N Sbjct: 303 EIVKDGENGFLYEAGDEKSLEEKFNYIKTLGNEELNKISKNAYE 346 >UniRef50_Q1NU87 Glycosyl transferase, group 1 n=2 Tax=Proteobacteria RepID=Q1NU87_9DELT Length = 420 Score = 44.5 bits (104), Expect = 0.005, Method: Composition-based stats. Identities = 42/297 (14%), Positives = 91/297 (30%), Gaps = 52/297 (17%) Query: 22 ALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILS 81 D+ + ++ ++ + + L ++I P + Sbjct: 77 IDDVTVERLSLPTEIGRPLVRILNALRLGISLMYRAVTRRYDVIMISTVPPVLG-GFSAA 135 Query: 82 FFHRLLKFRIVPLIHDIDE--------------LRGGGGSDSVRLATCDMVISHNPQMTK 127 RL R + DI D+ D ++ + M Sbjct: 136 LAARLSNARFIYHCMDIHPEIGRISGEFAQPIVFSTLRKLDNWSCRQADPIVVLSRDMET 195 Query: 128 YLSK--YMSQDKIKDIKIFDYLVSSDVEHRDVTDKQR---GVIYAGNLSRHKCSFIYTE- 181 L + + +++ + F S + + D R V+YAGN+ R + + E Sbjct: 196 TLRERAGGHRFRVEVLNNFPLPDSGEDLKPEEFDSHRDGLTVLYAGNVGRFQGLQMAVEA 255 Query: 182 --------GCDFTLFGVNYENKD----------NPKYLGSFDAQSPEKINLPGMQFGLIW 223 +F + G + ++LG+ ++ K + G + Sbjct: 256 MTKLKERTDIEFLVMGDGVAKSELQAQVEKSGAKVRFLGAQSVET-AKAAMRSADIGYV- 313 Query: 224 DGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVF--IWDKAALADFIVDNRIGYAV 278 + ++ P KT YL P+ + D++ LA I ++ G++V Sbjct: 314 ---------SLVPEMYRYAYPSKTMNYLEQGCPIIAAVEDESGLAKEIWEDGCGFSV 361 >UniRef50_B5EGU3 Putative uncharacterized protein n=1 Tax=Geobacter bemidjiensis Bem RepID=B5EGU3_GEOBB Length = 414 Score = 44.2 bits (103), Expect = 0.006, Method: Composition-based stats. Identities = 13/63 (20%), Positives = 25/63 (39%), Gaps = 1/63 (1%) Query: 243 NPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQISEN 302 + K +LYL +PV + + + R G + + E+ V+ + Y+ EN Sbjct: 306 SSEKMALYLQSGVPVIAYANESYELLMEHYRCGELIRDMSELPAAVERI-EADYEGYREN 364 Query: 303 TKI 305 Sbjct: 365 ALS 367 >UniRef50_Q0W4G3 Glycosyltransferase (Group 1) n=1 Tax=uncultured methanogenic archaeon RC-I RepID=Q0W4G3_UNCMA Length = 352 Score = 44.2 bits (103), Expect = 0.007, Method: Composition-based stats. Identities = 29/167 (17%), Positives = 62/167 (37%), Gaps = 15/167 (8%) Query: 29 YENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLK 88 Y NI++ I ++++V + N D++ + ++ K Sbjct: 42 YSNITIHEI---SWKKTELVANVLEIRKIVKSFNPDII----HFTSFHFILILLVPFFKK 94 Query: 89 FRIVPLIHDIDELRGGGGSDSV-----RLATCDMVISHNPQMTKYLSKYMS-QDKIKDIK 142 +RIV HD+D +G L D++I+H + L + + KI + Sbjct: 95 YRIVVTAHDVDAHQGTDNFFYKFVLDQYLKLGDLLITHGKNLKDRLVEKGFDESKIFILP 154 Query: 143 IFDY--LVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYTEGCDFTL 187 DY ++ VE + + +++ G + ++K E + Sbjct: 155 HGDYSFFLNYSVEKNSSVENRDTLLFFGRILKYKGLNYLLESLKLVI 201 >UniRef50_Q83GT6 Glycosyltransferase domain-containing protein n=2 Tax=Tropheryma whipplei RepID=Q83GT6_TROWT Length = 876 Score = 43.8 bits (102), Expect = 0.009, Method: Composition-based stats. Identities = 15/83 (18%), Positives = 32/83 (38%), Gaps = 10/83 (12%) Query: 250 YLSMELPVFIWDKAALADFIVDNRIGYAVG-----SIKEMQEIVDSMTIETYKQISENTK 304 YL LP+ I D A ++ + +G V S+ + E + + N + Sbjct: 739 YLWAGLPMVITDGDVFAQYVKEYNLGLVVEQGNVRSLADALEKIL-FDQDFILACKRNIE 797 Query: 305 IISQKIRTGSYFRDVLEEVIDDL 327 ++ ++ +VL +I + Sbjct: 798 EFRRR----FFWEEVLRPLIRRI 816 >UniRef50_B5JPV4 Glycosyl transferase, group 2 family protein n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JPV4_9BACT Length = 634 Score = 43.4 bits (101), Expect = 0.011, Method: Composition-based stats. Identities = 17/127 (13%), Positives = 39/127 (30%), Gaps = 19/127 (14%) Query: 66 LIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRGG----------------GGSDS 109 + + P + L + + HDI LR + Sbjct: 355 VFLSRPHISIKYIDL--IKERSSAKTLYFGHDIHHLRLELEAIYKDVEELKILKTRKQEI 412 Query: 110 VRLATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGN 169 D ++ + + Y++ +K ++ I+ Y +S E + G+++ GN Sbjct: 413 ELWEKADYLLYPSKEEVDYIASKGFAEKAVEVPIYFYDPNSKPERLPFEESD-GILFVGN 471 Query: 170 LSRHKCS 176 + Sbjct: 472 FNHPPNL 478 >UniRef50_Q1WS01 Glycosyltransferase n=2 Tax=Lactobacillus salivarius RepID=Q1WS01_LACS1 Length = 350 Score = 43.4 bits (101), Expect = 0.012, Method: Composition-based stats. Identities = 30/185 (16%), Positives = 65/185 (35%), Gaps = 18/185 (9%) Query: 26 ASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHR 85 ++ VNI +V +I +K+ F E I P + + Sbjct: 49 LQKLDDKVSVNIYTGNNLVSKIWFIMKIFIF----EKDTNYISLSPSLILIANKVRKLCN 104 Query: 86 LLKFRIVPLIHDIDELRGGGGSDSVRLATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFD 145 K++I+ IH L+ D +L D ++ + + + L + + + + IF+ Sbjct: 105 -KKYKIISWIH--FSLKNQDMFDPGKLVNADGHLAISSVIREQLLELGVKSD-EIMTIFN 160 Query: 146 YLVSSDVEHRDVTDKQRGVIYAG--------NLSRHKCSFIYTEGCDFTL--FGVNYENK 195 + + + + YAG N+S +G ++ L +G + + Sbjct: 161 PIERHNSIPEVKKENYLNLFYAGRMTFDGQKNISELLSGISKIKGINYHLDMYGSGEDLE 220 Query: 196 DNPKY 200 +Y Sbjct: 221 KCKEY 225 >UniRef50_D1Y981 Glycosyltransferase, group 1 family protein n=2 Tax=Propionibacterium acnes RepID=D1Y981_PROAC Length = 391 Score = 43.4 bits (101), Expect = 0.012, Method: Composition-based stats. Identities = 12/81 (14%), Positives = 33/81 (40%), Gaps = 1/81 (1%) Query: 243 NPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQISEN 302 +P+K Y++ L V K + D I D+ +G V S ++ ++ + + Sbjct: 302 SPNKLYDYMAAGLAVVSNAKVPIRDVISDDEVGACVDS-TDLVAGIERVRDADEATMKRW 360 Query: 303 TKIISQKIRTGSYFRDVLEEV 323 + + + + ++++ Sbjct: 361 HERARELMANKYSLQASVDKL 381 >UniRef50_C3NKC1 Glycosyl transferase group 1 n=4 Tax=Thermoprotei RepID=C3NKC1_SULIN Length = 373 Score = 43.4 bits (101), Expect = 0.012, Method: Composition-based stats. Identities = 16/69 (23%), Positives = 34/69 (49%), Gaps = 2/69 (2%) Query: 255 LPVFIWDKAALADFIVDNRIGYAVG--SIKEMQEIVDSMTIETYKQISENTKIISQKIRT 312 LPV I D+ A+ +G + +IKE++E + +M YK++ I ++ Sbjct: 302 LPVIINDELGTANLATRYNVGLVIHGINIKEIKEFLRNMNENKYKELQLGIAKIQKEYTW 361 Query: 313 GSYFRDVLE 321 ++ + ++E Sbjct: 362 ENHTKKLVE 370 >UniRef50_UPI0001C4246F glycosyltransferase n=1 Tax=Bacillus pseudofirmus OF4 RepID=UPI0001C4246F Length = 419 Score = 43.0 bits (100), Expect = 0.014, Method: Composition-based stats. Identities = 38/246 (15%), Positives = 77/246 (31%), Gaps = 52/246 (21%) Query: 102 RGGGGSDSVRLATCDMVISHNP------QMTKYLSKYMSQDKI---------KDIKIFDY 146 RG + D +I + K+ + + + DI+ F+ Sbjct: 162 RGLTAGEQWIYKKADALIFTKEGDTDYIKEKKWDIEQGGEINLDKCHYINNGVDIESFEL 221 Query: 147 L-VSSDVEHRDVTDKQRGVIYAG---------NLSRHKCSFIYTEGCDFTLFGVN----- 191 L ++ V+ D++ + V+Y G NL E F ++G Sbjct: 222 LASNNKVDDEDLSSAKFNVVYVGAIRPVNNVGNLLDAASLLKDKEDIQFLIYGDGNQKEM 281 Query: 192 ------YENKDNPKYLGSFDAQ-SPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNP 244 EN N K G + + P ++ + + S ++ + N+ Sbjct: 282 LEKRVVEENLTNVKLKGFVNKRLIPYILSKSSVN---------ILNYSQTQYNWTRGNSS 332 Query: 245 HKTSLYLSMELPVFIWDKAALADFIVDNRIGY-----AVGSIKEMQEIVDSMTIETYKQI 299 +K Y++ P+ K + + G + + + + E Y I Sbjct: 333 NKLFEYMASGKPIISTVKMG-YSILDKYQCGIELEKSTPEELANAIIEIRNFSEEQYNAI 391 Query: 300 SENTKI 305 S+N K Sbjct: 392 SKNAKK 397 >UniRef50_B3EBP3 Glycosyl transferase family 2 n=1 Tax=Geobacter lovleyi SZ RepID=B3EBP3_GEOLS Length = 669 Score = 43.0 bits (100), Expect = 0.014, Method: Composition-based stats. Identities = 11/61 (18%), Positives = 29/61 (47%) Query: 246 KTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQISENTKI 305 K + YL LP+ + + ++ + +G+ VGS+ + ++ +++ +Q N + Sbjct: 316 KIATYLQYGLPIVVNEIGEMSRHVRQFGLGWVVGSVTDTGRVLANLSRSDLEQSGLNAEQ 375 Query: 306 I 306 Sbjct: 376 F 376 >UniRef50_B5YCR2 WbpH n=10 Tax=Bacteria RepID=B5YCR2_DICT6 Length = 373 Score = 43.0 bits (100), Expect = 0.016, Method: Composition-based stats. Identities = 46/312 (14%), Positives = 99/312 (31%), Gaps = 57/312 (18%) Query: 31 NISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFR 90 + ++ +P G ++R+I + L N + + + P + +LL R Sbjct: 47 GVHILPLPKVGSRLERVIKQPWRALRLALKTNSSI----YHLHDPELIPIGLILKLLGKR 102 Query: 91 IVPLIHDIDELRGGGGS-----------------DSVRLATCDMVISHNPQMTKYLSKYM 133 ++ H+ L+ + D ++ P +T+ K Sbjct: 103 VIFDSHEDVPLQLLSKPYLNRFALRMLSQVFSIFEKYSCRYFDGIVCATPSITEKFLKIN 162 Query: 134 SQDKIKDIKIFDYLVSSD-VEHRDVTDKQRGVIYAGNLSRHKCSFIYTEGCDFTLFGVNY 192 ++ + L S + H +K + Y G +S+ + + +F Sbjct: 163 PNS--VNVNNYPLLEESKFLYHNYNENKMNEICYIGGISQIRGINELIKALEFV------ 214 Query: 193 ENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDY-------------- 238 DN + + + +S E G W + G Y Sbjct: 215 ---DNVRLNLAGNFESAELEKRIKGMKG--WKKVNYYGFVGRENVYEIMARSKAGVVIFS 269 Query: 239 ---LKFNN-PHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIK--EMQEIVDSMT 292 N+ P+K Y+S LPV + + + + G V + E+ + + + Sbjct: 270 PLPNHINSQPNKMFEYMSAGLPVITSNFPLWREIVERDNCGICVDPLNPKEIADAIRYII 329 Query: 293 I--ETYKQISEN 302 E K++ +N Sbjct: 330 AHPEEAKKMGDN 341 >UniRef50_B8GDE8 Glycosyl transferase group 1 n=1 Tax=Methanosphaerula palustris E1-9c RepID=B8GDE8_METPE Length = 364 Score = 43.0 bits (100), Expect = 0.016, Method: Composition-based stats. Identities = 28/143 (19%), Positives = 52/143 (36%), Gaps = 14/143 (9%) Query: 42 GVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKP-FWHILSFFHRLLKFRIVPLIHDIDE 100 ++ + ++ FL + +D I + F +L F HR I+ IHD++ Sbjct: 64 NFIKNTLIFTRVIRFLRTVREEDPDIIHVNGYSLWFSLLLPFLHRYP---IITTIHDVNP 120 Query: 101 LRGGGGSDSVR-----LATCDMVISHNPQMTKYLSKYM-SQDKIKDIKIFDYLVSSDVEH 154 G D D +I H ++ K + I I DY S E Sbjct: 121 HTGSRQFDQTIARRLFFWYSDALIVH----GEWAKKQLSVSTPIYVIPHGDYSFFSTCEG 176 Query: 155 RDVTDKQRGVIYAGNLSRHKCSF 177 +V ++ +++ G + +K Sbjct: 177 EEVAEEVGTILFFGRIEDYKGLQ 199 >UniRef50_B9P363 Glycosyl transferase, group 1 n=1 Tax=Prochlorococcus marinus str. MIT 9202 RepID=B9P363_PROMA Length = 407 Score = 43.0 bits (100), Expect = 0.017, Method: Composition-based stats. Identities = 50/349 (14%), Positives = 97/349 (27%), Gaps = 58/349 (16%) Query: 24 DIASDYENISVVNIPLWGGVVQRIISSVKLSTF--LCGLENKDVLIFNFPMAKPFWHILS 81 I + + IS ++T L + D+ I+ + + Sbjct: 67 KILLHRVFLFPSHDKSSIKRAINYISFAIMATLYGLFKINKPDI-IYAYHPPLTVGICGA 125 Query: 82 FFHRLLKFRIVPLIHDIDE--------------LRGGGGSDSVRLATCDMVISHNPQMTK 127 + K +V I D+ L+ D +I + Sbjct: 126 ILKKFYKVPLVYDIQDMWPDSLKATGMVNSKLILKITSKLCKKTYKLSDKIIVLSNGFRN 185 Query: 128 YLSKYMS-QDKIKDIKIFDYLVSSDVEHRDV---TDKQRGVIYAGNLSRHKCSFIYTEGC 183 L K + KI+ I + + + ++ ++ K+ +I+AGN+ + + Sbjct: 186 CLIKRGVEKSKIEIIYNWSNIDNKNINTSNIIQINKKKFNIIFAGNIGKAQSL------- 238 Query: 184 DFTLFGVNYENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETC--SGAFGDYL-- 239 + L+ N + INL + D G YL Sbjct: 239 ETLLYAAEIIKTKNQNIDFYIIGDGIDLINLKNQVKNMHLDNIKFIPRIEPKYIGGYLNK 298 Query: 240 --------------KFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVG-----S 280 K P KT Y++ P+ + AD I + G Sbjct: 299 ADAFLVHLRNNSLFKITIPSKTQTYMAFGKPIIMAVNGDAADLIKEAECGIVTEPQNPKQ 358 Query: 281 IKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDLKT 329 + E + S + +I N K + + ID+ + Sbjct: 359 LAIAIEKLVSYKKKRLNKIGLNGLNFYNK-------NLAINKGIDNFAS 400 >UniRef50_C2HRD4 Glycosyl transferase group 1 family protein n=1 Tax=Vibrio cholerae bv. albensis VL426 RepID=C2HRD4_VIBCH Length = 378 Score = 42.6 bits (99), Expect = 0.018, Method: Composition-based stats. Identities = 16/70 (22%), Positives = 30/70 (42%), Gaps = 2/70 (2%) Query: 241 FNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVG--SIKEMQEIVDSMTIETYKQ 298 F P+K Y +PV + D +A+ + G + S++ + E VD + + Sbjct: 289 FCMPNKLFEYAMAGIPVIVSDMKEMAEAVQTADFGVVLTEYSVESINEAVDRLAERDLTE 348 Query: 299 ISENTKIISQ 308 +S N +Q Sbjct: 349 LSNNAYQFAQ 358 >UniRef50_A8ZY94 Glycosyl transferase group 1 n=1 Tax=Desulfococcus oleovorans Hxd3 RepID=A8ZY94_DESOH Length = 522 Score = 42.6 bits (99), Expect = 0.019, Method: Composition-based stats. Identities = 18/88 (20%), Positives = 43/88 (48%), Gaps = 5/88 (5%) Query: 246 KTSLYLSMELPVFIWDKAALADFIVDNRIGYAV--GSIKEMQEIVDSMTIETYKQI---S 300 K YLS LPV A+ + GY + S+ + ++++S+ + ++++ S Sbjct: 358 KFIEYLSAGLPVITSPLIEQANIVNRYDCGYILKDNSVDNLVDVLESILAQGHQELAVKS 417 Query: 301 ENTKIISQKIRTGSYFRDVLEEVIDDLK 328 EN +++ ++++VL + + + K Sbjct: 418 ENALAAAKQYFDWHHYKEVLVQAVLNNK 445 >UniRef50_Q2LWM1 Glycosyltransferase n=1 Tax=Syntrophus aciditrophicus SB RepID=Q2LWM1_SYNAS Length = 379 Score = 42.6 bits (99), Expect = 0.020, Method: Composition-based stats. Identities = 11/70 (15%), Positives = 31/70 (44%), Gaps = 5/70 (7%) Query: 241 FNNPHKTSLYLSMELPVFIWDKA-ALADFIVDNRIGYAVGSIK--EMQEIVDSM--TIET 295 + PHK Y++ + V + A +A F+ + + G V + ++ + +D + + + Sbjct: 284 YAMPHKMFDYMAAGMAVICPEFAMEVAPFVKEAKCGLLVDTANPADLAKKLDELVSSPDL 343 Query: 296 YKQISENTKI 305 ++ + Sbjct: 344 IHEMGVRAQK 353 >UniRef50_C6LKZ9 Putative glycosyl transferase n=1 Tax=Bryantella formatexigens DSM 14469 RepID=C6LKZ9_9FIRM Length = 413 Score = 42.6 bits (99), Expect = 0.020, Method: Composition-based stats. Identities = 32/212 (15%), Positives = 78/212 (36%), Gaps = 27/212 (12%) Query: 112 LATCDMVISHNP-QMTKYLSKYMSQDKIKDIKIFDY-LVSSDVEHRDVTDKQRGVIYAGN 169 ++ D +I + + ++ + + +I I Y L + H + ++Y G+ Sbjct: 192 ISKADHIICTSEFAKSSLIANNIDEKRIHVIS---YGLEQNKKSHNIGKPGKLSLLYVGS 248 Query: 170 LSRHKCSF--------IYTEGCDFTLFGVNYENKDNPKYLGSFDAQSPEKINLPGMQFGL 221 +S K + I +E + L G NY + + + + + + L Sbjct: 249 VSCEKGLYFLLEAVKRINSEEIELVLVGKNYIDDKLLEPYKKWCNFIGDIPHTQVENYYL 308 Query: 222 IWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSI 281 D + T +FG S +S +P+ A AD+I + G+ + + Sbjct: 309 NADVFILPTLFDSFGRV--------VSEAMSYGIPIISTSNAGAADYIKNGENGFVIPA- 359 Query: 282 KEMQEIVDSM-----TIETYKQISENTKIISQ 308 ++ +V+ + + K + + + ++ Sbjct: 360 GDIDSMVEKIRYFLLNRDEVKIMGKKAQTTAE 391 >UniRef50_B9YG53 Glycosyl transferase group 1 n=1 Tax='Nostoc azollae' 0708 RepID=B9YG53_ANAAZ Length = 475 Score = 42.2 bits (98), Expect = 0.022, Method: Composition-based stats. Identities = 45/328 (13%), Positives = 100/328 (30%), Gaps = 66/328 (20%) Query: 30 ENISVVNIPLWGGVVQRIISSVK--LSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLL 87 + V + + G V ++ +++ L + L+ + P P L H L Sbjct: 137 RSTQVWSGRIRGKAVNGVLFTLRAFLHIIRNFRRHNVFLVTSAPPFLPIAGYL--AHLCL 194 Query: 88 KFRIVPLIHDIDE-----LRGGGGS----------DSVRLATCDMVISHNPQMTKYLSKY 132 K V LI+D+ L+ + + + ++ +P M K + Sbjct: 195 KISYVCLIYDLYPDIAIALQVIKRNHWLAGFWRQLNRMMWRKSKGIVVLSPDMKKRVIAI 254 Query: 133 MSQD--KIKDIKIF---DYLVSSDVEHRDVTDKQR-----GVIYAGNLSR--------HK 174 + K+ I + D +V E ++ V+Y+GN+ R Sbjct: 255 CPEVADKVSVIHSWGDPDLIVPIAKEINWFAEEHNLVNKFTVLYSGNMGRCHDMDTILET 314 Query: 175 CSFIYTEGCDFTLFGVNYENKDNPKYLGSFDAQSPEKIN----------LPGMQFGLIWD 224 + E F G + K + + + + L L+ Sbjct: 315 AKQLRNEPIQFVCIGSGAKRKSFIEAVNKSGVTNFLFLPYQDKQVLPYSLTACDLSLVSV 374 Query: 225 GDSVETCSGAFGDYLKFNNPHKTSLYLSMELPV--FIWDKAALADFIVDNRIGYAVGSIK 282 +E+ P K L+ P+ + L I + G V + Sbjct: 375 EAGMES----------LVAPSKLYPALAAGRPIAAICSKYSYLRQLIAEGNCGVCVENGD 424 Query: 283 EMQEIVDSMTIETYKQISENTKIISQKI 310 S+ + + ++ + + +++ + Sbjct: 425 -------SLALAEFIRLLNSDRQLAELM 445 >UniRef50_A8RK70 Putative uncharacterized protein n=2 Tax=Clostridium RepID=A8RK70_9CLOT Length = 405 Score = 42.2 bits (98), Expect = 0.022, Method: Composition-based stats. Identities = 45/320 (14%), Positives = 103/320 (32%), Gaps = 46/320 (14%) Query: 22 ALDIASDYENISVVNIPLWGGVVQRIISS---VKLSTFLCGLENKDVLIFNFPMAKPFWH 78 + + + I+ + ++ S V L+ + ++ +D + + F Sbjct: 77 IATVLLEQQFINAIKKYYSNTKFDLVLYSTPPVTLARVVAYIKKRDKALSYLMLKDIFPQ 136 Query: 79 ILSFFHRLLKFRIVPLIHDIDELRGGGGSDSVRLATCDMVISHNPQMTKYLSKY--MSQD 136 L K + +I+ + + D++ + +Y+ ++ + + Sbjct: 137 NSIDLGILKKTGLKGVIY-----KYFSLKEQKLYKLSDVIGCTSEANIRYVKEHDELDKK 191 Query: 137 KIKD----------IKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKC-SFIYT----- 180 KI + + + D + ++ +Y GNL R + FI Sbjct: 192 KIIEFCPNCSDWYDLSLPDNGKKEVRNKYGLPVDKKIFVYGGNLGRPQDVPFIVKCLEAC 251 Query: 181 ---EGCDFTLFGVNYENKDNPKYLGSFDAQSPEKI-NLPGMQFGLIWDGDSVETCSGAFG 236 + F + G E +Y+ + LP ++ DS+ C Sbjct: 252 KDMKNVYFLVVGSGTEKHYLDEYVEKESCSHVRVMGQLPKQEY------DSMVACCDCGI 305 Query: 237 DYLKF-----NNPHKTSLYLSMELPVFIWDKAA--LADFIVDNRIGY--AVGSIKEMQEI 287 +L + N P + Y+ +PV A + D + DN G+ I+ + Sbjct: 306 IFLDYRFTVPNTPSRLLAYIQAGIPVLTCTDPATDVGDIVEDNGFGWQCTSDKIENFVRL 365 Query: 288 VDSMTIETYK-QISENTKII 306 V+ + ++ EN Sbjct: 366 VEHIANLDIDPRMKENGLKY 385 >UniRef50_A5D386 Glycosyltransferase n=1 Tax=Pelotomaculum thermopropionicum SI RepID=A5D386_PELTS Length = 400 Score = 42.2 bits (98), Expect = 0.024, Method: Composition-based stats. Identities = 12/51 (23%), Positives = 21/51 (41%), Gaps = 2/51 (3%) Query: 246 KTSLYLSMELPVFIWDKAALADFIVDNRIGYAV--GSIKEMQEIVDSMTIE 294 K YL+ LPV I D LAD + G ++++ + ++ Sbjct: 306 KVFSYLACGLPVVIPDIPDLADVVRRAGCGLVAAPDRLEDLAAALKAVLDN 356 >UniRef50_C8P2S3 Glycosyltransferase n=1 Tax=Erysipelothrix rhusiopathiae ATCC 19414 RepID=C8P2S3_ERYRH Length = 404 Score = 42.2 bits (98), Expect = 0.025, Method: Composition-based stats. Identities = 44/253 (17%), Positives = 84/253 (33%), Gaps = 50/253 (19%) Query: 102 RGGGGSDSVRLATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQ 161 R ++ LA D+ I N Q ++K IK ++L+ + + Sbjct: 176 RQFILKNNEYLAESDITIVPNSISINR-VCGRVQAEVKQIKRNEFLLPLE---------K 225 Query: 162 RGVIYAGNLSRHKC-SFI---------YTEGCDFTLFGVNYENKDNPKYL---------- 201 + ++YAGNL + + F+ F G + KY Sbjct: 226 KIIVYAGNLGKPQSIDFLIESLEKIKESENFVHFAFCGSGTDANKLKKYCIDNPTQCSYY 285 Query: 202 GSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWD 261 G +++ + +GLI + N P + Y+ LP+ Sbjct: 286 GQLSKLKSDEL-ISISDYGLI----------LLDARFTIPNIPSRMLSYMKFGLPLIALT 334 Query: 262 K--AALADFIVDNRIGYAVGS----IKEMQEIVDSMTIETYKQISENTKIISQKIRTGSY 315 + D I++N +GY S +K + + ++ E Y + S + + Sbjct: 335 DINTDIKDTIIENNLGYWAESRGEEMKNIINSIHGLSDENYVKSSNCVIKYVKDMCN--- 391 Query: 316 FRDVLEEVIDDLK 328 +E+I LK Sbjct: 392 TEKGYKEIIKQLK 404 >UniRef50_Q8AAS2 Lipopolysaccharide biosynthesis RfbU-related protein n=1 Tax=Bacteroides thetaiotaomicron RepID=Q8AAS2_BACTN Length = 368 Score = 42.2 bits (98), Expect = 0.028, Method: Composition-based stats. Identities = 53/328 (16%), Positives = 109/328 (33%), Gaps = 62/328 (18%) Query: 20 KDALDIASDYENISVVNIPLWGGVVQRI--ISSVKLSTFLCGLENKDVLIFNFPMAKPFW 77 KD + + + +P G+ Q I I + + G + V+++NFP Sbjct: 43 KDIKNALAVVDGFVSTPVPYPIGIKQWIHQICTFISIKIILGRKPDYVVLYNFPAIAS-- 100 Query: 78 HILSFFHRLLKFRIVPLIHDI------------DELRGGGGSDSVRL--ATCDMVISHNP 123 + + ++HD+ D +R + +R D VI+ + Sbjct: 101 ---LKILKACHKHGIKVVHDLTEWESNNRWSPSDMMRKIDINLRMRYCVKKMDGVIAISR 157 Query: 124 QMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQR-GVIYAGN--------LSRHK 174 + Y KY + I D R+++ + ++YAG+ L Sbjct: 158 YLYDYYKKY--TNTILVPPTVDLTAGKWNRQRELSAGDKIKLVYAGSAGFGVKDRLDTIA 215 Query: 175 CSFIYTEGCDFTL-----------FGVNYENKDNPKYLGSFDAQSPEKINLPGMQFG-LI 222 + + F + +G ++ N + G K + F LI Sbjct: 216 KAIVKFPNMQFDVIGMTEGQYVSGYGELPKDCKNILFHGRLPHTETVK-AVQDADFQFLI 274 Query: 223 WDGDSVETCSGAFGDYLKFNN--PHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGS 280 D + LK N P K ++ PV + ++D++ D + G+ V Sbjct: 275 RDSN------------LKNNAGFPTKFVESITCCTPVIATLTSNISDYLKDGKNGFVVDD 322 Query: 281 ---IKEMQEIVDSMTIETYKQISENTKI 305 + ++ ++ ++ Q+ E K Sbjct: 323 SHSLDDVFGLISKLSPSEIIQMKEACKN 350 >UniRef50_A8F8F9 Putative uncharacterized protein n=1 Tax=Thermotoga lettingae TMO RepID=A8F8F9_THELT Length = 363 Score = 41.9 bits (97), Expect = 0.029, Method: Composition-based stats. Identities = 44/270 (16%), Positives = 95/270 (35%), Gaps = 35/270 (12%) Query: 62 NKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGGGSDSVR---------- 111 N DV+ F++ ++ +L K +I IH+I + G Sbjct: 84 NPDVVYFHYLPFTGSG-MIKRLKQLGK-KIFFEIHEIIPEQFMGKYAIFSPVKSLIWKEF 141 Query: 112 ---LATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAG 168 + D VI + + Y+ K + F L + + + K + ++ G Sbjct: 142 STSIRLSDGVICISEDIAMYVFDRCGIQK----EFF-ILPNMALMEIESNAKSKEIVLVG 196 Query: 169 NLSRHKC------SFIYTEGCDFTLFGVNYENKDNPKYLGSFDAQSPEKIN-LPGMQFGL 221 SR + G F + G+ + + Y E + L F L Sbjct: 197 KDSRELFYEKEILRKLIDSGFRFKVIGLKSDLFKDIPYEYVPFLPYDEMMEQLSRASFSL 256 Query: 222 IWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDK-AALADFIVDNRIGYAVGS 280 I G+ S + +Y +F+ P+K ++ PV + ++ + +G + Sbjct: 257 ISYGNEK---SRDYKNY-EFSMPNKLFDSIAAGTPVIVRRSFVSMVKIVERFGVGVVIEP 312 Query: 281 IKEMQEIVDSMTI--ETYKQISENTKIISQ 308 ++++ V+ + + Y +I N ++ + Sbjct: 313 -RDVESSVEKILKAYDDYDRILSNLRVCKK 341 >UniRef50_Q1IPW4 Glycosyl transferase, group 1 n=1 Tax=Candidatus Koribacter versatilis Ellin345 RepID=Q1IPW4_ACIBL Length = 346 Score = 41.9 bits (97), Expect = 0.030, Method: Composition-based stats. Identities = 11/42 (26%), Positives = 21/42 (50%) Query: 251 LSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMT 292 ++ PV W + AL + + D GY V S++ M + ++ Sbjct: 265 MACGTPVITWRRGALPEIVADGVTGYIVDSLEAMVSAISDVS 306 >UniRef50_Q2ILL1 Glycosyl transferase, group 1 n=1 Tax=Anaeromyxobacter dehalogenans 2CP-C RepID=Q2ILL1_ANADE Length = 463 Score = 41.9 bits (97), Expect = 0.031, Method: Composition-based stats. Identities = 15/72 (20%), Positives = 30/72 (41%), Gaps = 4/72 (5%) Query: 241 FNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVG--SIKEMQEIVDSMTIE--TY 296 + P+K Y+ M LPV +A + + +G V +E+ V+ + + Sbjct: 358 YCAPNKLFEYMMMGLPVVAPSFPGMARIVAGDDVGLCVDPSRPEEIAAAVNRLARDPAAR 417 Query: 297 KQISENTKIISQ 308 ++ N +SQ Sbjct: 418 ARMRANGLRLSQ 429 >UniRef50_C0BNJ6 Glycosyl transferase group 1 n=1 Tax=Flavobacteria bacterium MS024-3C RepID=C0BNJ6_9BACT Length = 400 Score = 41.9 bits (97), Expect = 0.032, Method: Composition-based stats. Identities = 48/343 (13%), Positives = 97/343 (28%), Gaps = 63/343 (18%) Query: 19 RKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLE------NKDVLIFNFPM 72 + D +I ++ G R+ + L ++ + D + P Sbjct: 52 KSDDFEILNEIPVYRANLFSRRNGGALRLFINYFSFAILASVKVRKIKGSFDAIFVYEPS 111 Query: 73 AKPFWHILSFFHRLLKFRIVPLIHDI--DELRGGGG-SDSVRLATCDMVI--SHNPQMTK 127 F + K I D+ + L GG + L + + +N + Sbjct: 112 PITVGIPAIFAKKRFKAPIYFWAQDLWPESLVAAGGVKNKFILEFFNSLTKWIYNHSIKV 171 Query: 128 YLSKYMSQDKIKDIKI------FDYLVSSDVEHRDVTDKQR----------GVIYAGNLS 171 + +D I D I F Y ++ ++ + + + +I+AGN+ Sbjct: 172 LIQSNGFRDYILDQGIPNDKILF-YPNPTEDFYKPLQEVKEYQEFFEKENFNIIFAGNIG 230 Query: 172 RHKC--------SFIYTEGCDFTLFGVNYENKDNP------------KYLGSFDAQSPEK 211 + + I + G + +LGSF E Sbjct: 231 EAQSFITIIEAINNIKELPIKVNVLGDGRYKETAIGLIKDKGLESHFNFLGSFPPT--EM 288 Query: 212 INLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVD 271 L+ S P K YL+ P+ A + D Sbjct: 289 PKFFSHADALL--------VSLKKDKIFSLTIPAKVQSYLACGKPIIASIDGEGAKIVSD 340 Query: 272 NRIGYAV---GS--IKEMQEIVDSMTIETYKQISENTKIISQK 309 + G S + + + + ++ T Q+ N + +K Sbjct: 341 AKCGVTSPAEDSIALSNIIKELMALNKSTLNQMGNNGRAYYEK 383 >UniRef50_A8RE04 Putative uncharacterized protein n=1 Tax=Eubacterium dolichum DSM 3991 RepID=A8RE04_9FIRM Length = 745 Score = 41.9 bits (97), Expect = 0.032, Method: Composition-based stats. Identities = 36/218 (16%), Positives = 75/218 (34%), Gaps = 32/218 (14%) Query: 109 SVRLATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVS-SDVEHRDVTDKQRG--VI 165 + L D VI+ + L++++ + K I YL S+ E ++ + Sbjct: 148 NQSLNAADFVITECDLYREILAEFLDPEHTKTI----YLTKGSEFEQPMISKPMETLHIC 203 Query: 166 YAGNLS---------RHKCSFIYTEGCDFTLFGVNYENKDNPKYLGSFDAQSPEKINLPG 216 Y G+++ R + + G D + AQ E I G Sbjct: 204 YLGSINNIISIDMIVRFLKTLQNYRPIVVDIIGKGETKDDFIR---KLKAQGIETIYH-G 259 Query: 217 MQFGLIWDGDSVE-TCSGAFGDYLKFNN-----PHKTSLYLSMELPVFIWDKAALADFIV 270 FG D + FG + N K+ Y LP+ K ++ Sbjct: 260 ALFG----EDKWKIMNQCHFGINMMINTVRVGLTMKSVDYFEAGLPILNNIKGDTWTYVD 315 Query: 271 DNRIGYAVG--SIKEMQEIVDSMTIETYKQISENTKII 306 + +G+ V +I+++ + + ++++ N + + Sbjct: 316 NFNLGFNVDEKNIEDVARKLAKLDERQFEEMQRNVRDV 353 >UniRef50_Q9YCS3 Glycosyl transferase, group 1 n=1 Tax=Aeropyrum pernix RepID=Q9YCS3_AERPE Length = 383 Score = 41.9 bits (97), Expect = 0.035, Method: Composition-based stats. Identities = 44/286 (15%), Positives = 92/286 (32%), Gaps = 40/286 (13%) Query: 23 LDIASDYENISVVN--IPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHIL 80 I + +N IP + I ++ + + ++ D+ I P+ +L Sbjct: 60 EKIFERSFHFPSINTRIPSLEIAAKIIEYTLSIISITAEAKHYDIAIAQDPITATIAIML 119 Query: 81 SFFHRLLKFRIVPLIHDIDELRGGGGSDSVRLA---TCDMVISHNPQMTKYLSKYMSQDK 137 + + +++ H+ + L D+V + ++ + K ++ Sbjct: 120 KQKNYIS--KVILQSHNFTSPTRSKLYKFLDLYTTTHSDIVWCLSNRLAEIRRKLGAKYT 177 Query: 138 I-KDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYTE----------GCDFT 186 + I I D D K ++Y G+LS+ K I E Sbjct: 178 VQTPICIRD--DVIDKTLNYTRRKSNDIVYIGSLSKDKGVDILLELVKTFTKNGNDTIIH 235 Query: 187 LFGVNY----------ENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFG 236 + G E Y G + +I G++ S E+ + Sbjct: 236 IVGKGLLYEKIFERIGEINKRVIYYGPQPLKRALQIA-SRASLGVVLTRPSYESLTTD-- 292 Query: 237 DYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIK 282 P K +YL+ PV + + ++ ++ + G VG +K Sbjct: 293 -------PMKPKVYLAAHTPVILPEYFEISSYVNRFKAGMVVGKLK 331 >UniRef50_B5EVI1 Glycosyltransferase n=6 Tax=Vibrionales RepID=B5EVI1_VIBFM Length = 378 Score = 41.9 bits (97), Expect = 0.036, Method: Composition-based stats. Identities = 29/206 (14%), Positives = 72/206 (34%), Gaps = 19/206 (9%) Query: 106 GSDSVRLATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDY-LVSSDVEH-RDVTDKQRG 163 + + D++++ + ++ K + ++ + D+ L + V +D+ + +R Sbjct: 155 QHEQKLINKADLILAASKKLLIKFPKG--KTQLLTHGV-DFTLFNQPVPRAKDLPNDERP 211 Query: 164 VI-YAGNLSRHKCSFIYTEG------CDFTLFGVNYENKDNPKYLGSFDAQSPEKINLPG 216 + + G+LS + + F G N + K + P+ Sbjct: 212 IAGFYGSLSDWLDYDLLNQVIAENPLWHFVFIGKNELTYNPFKNHPNLHLLGPKMHYHLP 271 Query: 217 MQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGY 276 + W + + + ++ NP K YL+ P+ ALA +I + Sbjct: 272 R-YSQHWQANLLPFVD---NEQIRACNPLKLLEYLATGTPIISTSFPALAPYIAEIH--- 324 Query: 277 AVGSIKEMQEIVDSMTIETYKQISEN 302 V S ++ ++++ N Sbjct: 325 TVNSTQDFTTHLNNIHSNWLNSSVNN 350 >UniRef50_D2EEQ9 Glycosyl transferase group 1 n=1 Tax=Candidatus Parvarchaeum acidiphilum ARMAN-4 RepID=D2EEQ9_9EURY Length = 341 Score = 41.5 bits (96), Expect = 0.039, Method: Composition-based stats. Identities = 46/284 (16%), Positives = 90/284 (31%), Gaps = 76/284 (26%) Query: 86 LLKFRIVPLIHDI------------DELRGGGGSDSVRLA--TCDMVISHNPQMTKYLSK 131 L K +I+ +HD+ LRG + A D ++ ++ Q L Sbjct: 85 LRKRKIILTVHDLAVFGTMKVKGVYGSLRGMSFQKQFKFAVERADTIMVNSTQTRDELIN 144 Query: 132 YMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHK------CSFIYTE---G 182 + D K + D L D K+ + Y G +R K FI + Sbjct: 145 ILKTDPGKI--VVDNLGIEDKFKPLEVKKEGIIGYFGGFNRRKRVDKLIDDFIASSLNAK 202 Query: 183 CDFTLFGVNYE---NKDNPKYLGS--FDAQSPEK-------------INLPGMQFGLIWD 224 +FG N + K + GS F + PE+ FGL Sbjct: 203 LKLVIFGNNEDYPLLKKKYEKFGSIIFKEKIPEEEIVKVINSFDFFVYPTSYEGFGL--- 259 Query: 225 GDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEM 284 + C ++ +P FI+ A + + + I Sbjct: 260 --PLLEC-------------------IACGIPSFIYKDAVIPEEVKKYAIEI-------- 290 Query: 285 QEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDLK 328 + +D + + Y ++ + ++K++ + ++++ K Sbjct: 291 -DKLDDIITKDYNELQKEFTEKAKKVKEEFSWDKCRANLLEEYK 333 >UniRef50_A9BF89 Glycosyl transferase group 1 n=8 Tax=Thermotogaceae RepID=A9BF89_PETMO Length = 369 Score = 41.5 bits (96), Expect = 0.047, Method: Composition-based stats. Identities = 52/322 (16%), Positives = 118/322 (36%), Gaps = 43/322 (13%) Query: 19 RKDALDIASDYENISVVNI-PLWGGVVQRIISSVKLSTFLCGL---ENKDVLIFNFPMAK 74 + D + + + + I G ++++I+ L +C L EN D+L + +A Sbjct: 38 KDDKEYTDGNIKYLPIKEINETIGNPLKKLINRRPLDKKICDLVAEENYDILYMHHFLAS 97 Query: 75 PFWHILSFFHRLLKFRIVPLIHDIDELRGGGGSD---------------SVRLATCDMVI 119 + K +IV IH+ + +L D+ I Sbjct: 98 KPLDPFKIAKKRNK-KIVYDIHEYHPENFLAELEGMIGNLKVKTVWRFFKKQLDLSDLAI 156 Query: 120 SHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSD---VEHRDVTDKQRGVIYAGNL------ 170 + + + + DK K Y++ + + D+ K++ ++ G + Sbjct: 157 FVSEETRNDVVNKTNIDKEKT-----YIIPNYANFIIKPDIQKKRKEIVLVGKVTRKIED 211 Query: 171 SRHKCSFIYTEGCDFTLFGVNYENKDNPKYLGSFDAQSPEKIN-LPGMQFGLIWDGDSVE 229 + + +G F + G++ + + + + E +N L F LI S Sbjct: 212 EKKILKSLIEKGFSFKIIGMDSKEFMDITHESTEFLPYDEMMNELSNSLFSLI----SYN 267 Query: 230 TCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDK-AALADFIVDNRIGYAVGSIKEMQEIV 288 T + PHK L+ PV + + ++A + + +G + +++E V Sbjct: 268 TVKNRDYKNDIYALPHKFYDSLAAGTPVIVKESFVSMAKQVENLGLGVVIDP-SKVEESV 326 Query: 289 DSMTI--ETYKQISENTKIISQ 308 + +T + Y++I +N + + Sbjct: 327 EKITNAYKNYEKIIKNVEKHQK 348 >UniRef50_C5CAF6 Putative uncharacterized protein n=1 Tax=Micrococcus luteus NCTC 2665 RepID=C5CAF6_MICLC Length = 710 Score = 41.5 bits (96), Expect = 0.049, Method: Composition-based stats. Identities = 20/67 (29%), Positives = 32/67 (47%), Gaps = 3/67 (4%) Query: 221 LIWDGDSVETCSGAF---GDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYA 277 L W + +T G G+Y++F+ P+KT + PV + +ADF+ NR+G Sbjct: 606 LEWYVPATKTVVGLVLLGGEYVRFSFPYKTMSLIERGYPVLCFADMGIADFLERNRVGLG 665 Query: 278 VGSIKEM 284 V E Sbjct: 666 VARSSEA 672 >UniRef50_C6P8E8 Glycosyl transferase group 1 n=1 Tax=Thermoanaerobacterium thermosaccharolyticum DSM 571 RepID=C6P8E8_CLOTS Length = 370 Score = 41.1 bits (95), Expect = 0.058, Method: Composition-based stats. Identities = 19/103 (18%), Positives = 42/103 (40%), Gaps = 3/103 (2%) Query: 231 CSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAV--GSIKEMQEIV 288 C+ + +K++ P K Y++ LPV D L I +N G + I +++ + Sbjct: 264 CTLFPTELIKYSFPLKAIEYMAAGLPVIATDIGDLGKLIKENECGITIKYSVIDFVEKTI 323 Query: 289 DSM-TIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDLKTR 330 D + + +N + ++ F+ + + + L R Sbjct: 324 DLIENRDKMSIYGQNGRNFAKSFDWKELFKKEMSIIFEKLDKR 366 >UniRef50_D1YZX9 Putative glycosyltransferase n=1 Tax=Methanocella paludicola SANAE RepID=D1YZX9_METPS Length = 374 Score = 41.1 bits (95), Expect = 0.061, Method: Composition-based stats. Identities = 13/80 (16%), Positives = 32/80 (40%), Gaps = 7/80 (8%) Query: 240 KFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAV--GSIKEMQEIVDSMTIETYK 297 K+ +P+K + P+ + + A + + + G V G I+ ++ ++ + + Sbjct: 281 KYESPNKLFEAMMCGKPIIVNSEIAASRIVKEENCGILVPYGDIEALENLIKMLKND--- 337 Query: 298 QISENTKIISQKIRTGSYFR 317 EN K + R + Sbjct: 338 --PENRKKLGDNGRNAYISK 355 >UniRef50_D1JK96 WbpH n=1 Tax=Bacteroides sp. 2_1_16 RepID=D1JK96_9BACE Length = 375 Score = 41.1 bits (95), Expect = 0.063, Method: Composition-based stats. Identities = 36/224 (16%), Positives = 63/224 (28%), Gaps = 33/224 (14%) Query: 85 RLLKFRIVPLIHDIDELRGGGGSDSVRLATCDMVISHNPQMTKYLSKYMSQDKIK-DIKI 143 + L ++ V + + ++ D I K ++ + + Sbjct: 124 KRLVYKFVGFV--------YKQYEMIKCKEFDAAIVCYHWTRDRFKKVNDNVELVLNFPL 175 Query: 144 FDYLVSSDVEHRDVTDKQRGVIYAG------NLSRHKCSFIYTEGCDFTLFG------VN 191 D E T + YAG N+ + F L G + Sbjct: 176 ID--RDKVKERPLRTTNDIKICYAGTISDAWNIPTLINAIENLNDVKFNLAGWTDDELMG 233 Query: 192 YENK----DNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKT 247 + Y G Q + G+ S C G G+ NN K Sbjct: 234 RMKSLIGWEKVNYFGKLPKQEVNEKVYSHSDIGVALYHYSP-LCKGKIGNMS--NN--KL 288 Query: 248 SLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSM 291 YL M +PV D + + N G V + I++++ Sbjct: 289 FEYLLMGMPVICTDFDLWKEVVEKNHCGICVNP-SDANAIMEAI 331 >UniRef50_B7L1A4 Glycosyl transferase group 1 n=1 Tax=Methylobacterium chloromethanicum CM4 RepID=B7L1A4_METC4 Length = 388 Score = 40.7 bits (94), Expect = 0.065, Method: Composition-based stats. Identities = 24/133 (18%), Positives = 51/133 (38%), Gaps = 14/133 (10%) Query: 56 FLCGLENK-DVLIFNFPMAKPFWH-ILSFFHRLLKFRIVPLIHDIDELRGG---GGSDSV 110 L L+NK D+ +F + P ++ F R L +R +HD + + Sbjct: 86 VLAVLKNKPDICLFQAWIKLPLLDAMIIRFLRTLGYRCFVTVHDAAPHEQKWWHSVTIPI 145 Query: 111 RLATCDMVISHNPQMTKYLSKYMSQDKIKDI--KIFDYLVSSDVEHRDVTDKQRG----- 163 + D I H+ + L K + + I + D+ S ++ ++ Sbjct: 146 FFKSFDGGICHSNKAIDVLRKLGVRTPLTKIPHGLLDHYQKSSIDKEQISLPPIADDPRV 205 Query: 164 --VIYAGNLSRHK 174 +++ G+++R K Sbjct: 206 FVILFFGHVTRRK 218 >UniRef50_Q467B6 Mannose-6-phosphate isomerase, bifunctional enzyme n=15 Tax=cellular organisms RepID=Q467B6_METBF Length = 458 Score = 40.7 bits (94), Expect = 0.066, Method: Composition-based stats. Identities = 13/55 (23%), Positives = 27/55 (49%) Query: 251 LSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQISENTKI 305 ++ PV ++ ++ + I D G+ V S++E E V + + K+ EN + Sbjct: 269 MASGTPVIAMNRGSMPELIRDGETGFLVNSVEEAAEAVQKLGSISRKKCRENVEK 323 >UniRef50_Q39W08 Putative uncharacterized protein n=1 Tax=Geobacter metallireducens GS-15 RepID=Q39W08_GEOMG Length = 359 Score = 40.7 bits (94), Expect = 0.068, Method: Composition-based stats. Identities = 26/147 (17%), Positives = 54/147 (36%), Gaps = 11/147 (7%) Query: 38 PLWGGVVQRIISSVKLSTFLCGL--ENKDVLIFNFPMAKPFWH-----ILSFFHRLLKFR 90 LW V KLS L + DV+ +P W +L ++ + K + Sbjct: 51 KLWSPYVAESWRFRKLSFILKSIDQRKPDVIFMQYPAEGYGWSLVPHMLLIYYVFIKKIK 110 Query: 91 IVPLIHDIDELRGGGGSDSVR-LATCDMVISHNPQMTKYLSKY--MSQDKIKDIKIFDYL 147 + ++H+ L L+ +I L+K+ K + I Sbjct: 111 FITVLHEYSSLSWKSRFFIRNILSKSSQLIFTTNFELNNLAKHVPGILSKANVLPILS-N 169 Query: 148 VSSDVEHRDVTDKQRGVIYAGNLSRHK 174 + + E + ++D+ ++Y G++ +K Sbjct: 170 IPNPKEIKQISDRSIDILYFGHIRPNK 196 >UniRef50_A3DHW2 Glycosyl transferase, group 1 n=3 Tax=Clostridium thermocellum RepID=A3DHW2_CLOTH Length = 347 Score = 40.7 bits (94), Expect = 0.069, Method: Composition-based stats. Identities = 48/336 (14%), Positives = 105/336 (31%), Gaps = 46/336 (13%) Query: 17 KARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPF 76 K + I + +V+ + + + I K +N +L K Sbjct: 23 KVKSLTEAIKDNIGEHNVLCVDTYNWKKRPIRLLRKCFRLARKCKNIVILPAQN-GIKVL 81 Query: 77 WHILSFFHRLLKFRIVPLIHD--IDELRGGGGSDSVRLATCDMVISHNPQMTKYLSKYMS 134 + S ++L ++ ++ + L D + M++ L ++ Sbjct: 82 VPLFSLINKLFGRKLFYVVIGGWLPTFLKNYKWLVSWLHHMDGIFVETASMSEKLIEFGL 141 Query: 135 QDKIK-----DIKIFD-------YLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYTEG 182 ++ + ++I D + + + K++G+ A N + E Sbjct: 142 KNVLVMPNFRQLRIVDINELQDTHALPYKLCTFSRVLKEKGIEDAINAVIKVNTDCGREV 201 Query: 183 CDFTLFGVNYENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFN 242 C ++G E KY +F A + E DY Sbjct: 202 CTLDIYGQIDE-----KYKDAFWAIMSNVPAYIKYK-----GEAPYEKAVDVLKDYYLML 251 Query: 243 NPHKTSLY-------------LSMELPVFIWDKAALADFIVDNRIG--YAVGSIKEMQEI 287 P Y + LPV D ++ + D + G + IKE+ EI Sbjct: 252 FP----TYYEGEGFAGTIIDAFASGLPVIASDWRYNSEIVQDYKTGRIFRTKDIKELAEI 307 Query: 288 VDSMTI--ETYKQISENTKIISQKIRTGSYFRDVLE 321 + + ++ +N ++K +G+ + ++E Sbjct: 308 ILYCLEHGDEVMEMKKNCIEEARKYTSGNAIKKLIE 343 >UniRef50_B5IQU9 Glycosyl transferase, group 1 family protein n=1 Tax=Thermococcus barophilus MP RepID=B5IQU9_9EURY Length = 396 Score = 40.7 bits (94), Expect = 0.070, Method: Composition-based stats. Identities = 48/335 (14%), Positives = 108/335 (32%), Gaps = 40/335 (11%) Query: 17 KARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPF 76 K D + + + Y+ I + + I + + ++ +V++ P P Sbjct: 61 KKDGDIIRLFT-YQPIKKDASLIERTLYYTIFPVLASIWLVFNRKSSNVILVTSPP--PQ 117 Query: 77 WHILSFFHRLLKFRIVPLIHDI-------------DEL--RGGGGSDSVRLATCDMVISH 121 ++++ +L++ +++ + D+ L R +S L D V Sbjct: 118 MYLIALIGKLMRKKVIVDVRDLFLDVSVNLGFIKKGSLIERIFRFLESKALQKADAVTLV 177 Query: 122 NPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDV----TDKQRGVIYAG--------- 168 P++ L + + K Y+V + V+ + ++ ++YAG Sbjct: 178 TPKIRHQLVEEYGINPAKC-----YVVPNGVDLETFKCDKSKRKLQMVYAGYFGHAQDFD 232 Query: 169 NLSRHKCSFIYTEGCDFTLFGVNYENKD---NPKYLGSFDAQSPEKINLPGMQFGLIWDG 225 + E L G +D N + LG + L+ Sbjct: 233 TFLKGYALLRENERVPLILAGGGETLEDVLKNVEKLGISKWIKYVGMLSRKDVVKLL-CS 291 Query: 226 DSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQ 285 S+ + LK+ P K YL+ LP + ++R G + +E+ Sbjct: 292 SSIGVAPIKVDESLKYAIPSKIYEYLACGLPFIGVGVGEIEKIAEESRAGCVGKTPEEVA 351 Query: 286 EIVDSMTIETYKQISENTKIISQKIRTGSYFRDVL 320 E + + +++ + S + L Sbjct: 352 ECIMKLLNSNLEKLKVRALRYVTRFSRESSAKKFL 386 >UniRef50_B5M6M0 Glycosyltransferase n=2 Tax=Kosmotoga olearia TBF 19.5.1 RepID=B5M6M0_KOSOT Length = 381 Score = 40.7 bits (94), Expect = 0.074, Method: Composition-based stats. Identities = 41/292 (14%), Positives = 96/292 (32%), Gaps = 42/292 (14%) Query: 44 VQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRG 103 V R ++ + L I +F KP + +I+ H+ Sbjct: 77 VNRYKYEREVLRIVDKLSFDLAYIHHFATVKPL--AIFRLLSRKNVKIITDFHEYVPEEY 134 Query: 104 GGGSDSV---------------RLATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLV 148 G + + + D + + + +L D+K F++ Sbjct: 135 LFGVEQIPRSIKMWLGQKLYRHMIMKSDGTVFVSKK---FLEDAKEWKP--DLKAFNFPN 189 Query: 149 --SSDVEHRDVTDKQRGVIYAGN----LSRHKCSFIYTEGCDFTLFGVNYENKDNPKYLG 202 + + + +Q+ VI+AG + F F++ + + K + Sbjct: 190 YGNLKIPPINQIKRQKEVIFAGTTERKIENELKIFEILNSKGFSIVSIGTDIKAPFEIKK 249 Query: 203 SFDAQSPEKIN-LPGMQFGLIWDGDSVETCSGAFGDYLK---FNNPHKTSLYLSMELPVF 258 + + I + F ++ +C G+YL ++ P+K ++ PV Sbjct: 250 LPFLKYEKMIERISNAAFSIV-----SYSCRNKKGNYLNKYVYSMPNKFFDSIAAGTPVI 304 Query: 259 IWDK-AALADFIVDNRIGYAV--GSIKEMQEIVDSM--TIETYKQISENTKI 305 + + + I ++ IG + + KE E + + + Y+++ N Sbjct: 305 LDKDFLGMRELIENDGIGVVIDRDNPKESAEKITAFWESKVEYEKLLLNISR 356 >UniRef50_A6LJY0 Putative uncharacterized protein n=1 Tax=Thermosipho melanesiensis BI429 RepID=A6LJY0_THEM4 Length = 368 Score = 40.7 bits (94), Expect = 0.077, Method: Composition-based stats. Identities = 36/282 (12%), Positives = 88/282 (31%), Gaps = 41/282 (14%) Query: 43 VVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELR 102 + S+ + F ++++ P H + K + ++L Sbjct: 96 YFHYFLVSMPVKAFKVAKNKGKKVVYDLHEYHPENHFKNLKGLAKKVK--------EKLM 147 Query: 103 GGGGSDSVRLATCDMVISHNPQ----MTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVT 158 + D +I + + M L + I + Sbjct: 148 WKVIKNQFFF--SDKLIFVSEEARNDMLNILKTHKDSIVI---------PNYANIKLKSP 196 Query: 159 DKQRGVIYAG----NLSRHKC--SFIYTEGCDFTLFGVNYENKDNP--KYLGSFDAQSPE 210 +K + ++ G N+ + + EG + G+ ++ KY Sbjct: 197 EKIKEIVIVGKTPRNIQNEREILKNLNKEGFSIKIVGIKTNILNDIPCKYTDVLPYDKM- 255 Query: 211 KINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDK-AALADFI 269 + + F LI S ++ + +Y+ ++ PHK ++ PV + ++ + + Sbjct: 256 MVEVSKSAFSLI----SYKSFGQEYKNYI-YSFPHKFFDSIAAGTPVIVNRSFVSMKNEV 310 Query: 270 VDNRIGYAVG--SIKEMQEIVDSMTIETYKQISENTKIISQK 309 IG + ++KE + Y++ EN + + Sbjct: 311 EKYGIGIVIEPQNVKESVRKILEAYKN-YEKFLENIETYKDR 351 >UniRef50_Q8RBZ2 Predicted glycosyltransferases n=1 Tax=Thermoanaerobacter tengcongensis RepID=Q8RBZ2_THETN Length = 411 Score = 40.3 bits (93), Expect = 0.094, Method: Composition-based stats. Identities = 16/91 (17%), Positives = 38/91 (41%), Gaps = 9/91 (9%) Query: 243 NPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGS-----IKEMQEIVDSMTIETYK 297 +P+K YL+ P I + +A + I + G V + E + +++ + Sbjct: 324 SPNKIFDYLASGRP-IISNVSASKEIIEEANAGIIVPPENPKLLAEGILKIKNLSEKERN 382 Query: 298 QISENTKIISQKIRTGSYFRDVLEEVIDDLK 328 Q+ N + + + + E++I +L+ Sbjct: 383 QMGLNGRKY---VEQHYDIKKLTEKLIKELE 410 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P37749 Uncharacterized protein yefG n=5 Tax=Escherichia... 350 3e-95 UniRef50_UPI000196921F hypothetical protein BACCELL_02894 n=1 Ta... 335 2e-90 UniRef50_A0Z7X9 Putative uncharacterized protein n=1 Tax=marine ... 330 3e-89 UniRef50_A7AHD2 Putative uncharacterized protein n=1 Tax=Parabac... 328 1e-88 UniRef50_C2F0P9 Galactofuranosyltransferase n=2 Tax=Lactobacillu... 327 4e-88 UniRef50_C9LJY2 Putative uncharacterized protein n=1 Tax=Prevote... 323 8e-87 UniRef50_A5ZF92 Putative uncharacterized protein n=2 Tax=Bactero... 322 1e-86 UniRef50_C2E8T4 Possible galactofuranosyltransferase n=1 Tax=Lac... 319 5e-86 UniRef50_D1PTN6 Putative uncharacterized protein n=1 Tax=Prevote... 319 7e-86 UniRef50_C0BRQ3 Putative uncharacterized protein n=2 Tax=Bifidob... 315 1e-84 UniRef50_Q03GL2 Glycosyltransferase n=1 Tax=Pediococcus pentosac... 309 7e-83 UniRef50_Q1WU31 Galactofuranosyltransferase n=2 Tax=Lactobacillu... 309 8e-83 UniRef50_Q4JYT0 Putative glycosyl transferase n=1 Tax=Streptococ... 306 7e-82 UniRef50_C0YXY6 Possible galactofuranosyltransferase n=5 Tax=Lac... 306 8e-82 UniRef50_D1PDY2 Putative galactofuranosyltransferase n=1 Tax=Pre... 304 2e-81 UniRef50_C0WVC5 Possible galactofuranosyltransferase n=2 Tax=Lac... 303 5e-81 UniRef50_C2FKS4 Possible galactofuranosyltransferase n=1 Tax=Lac... 303 8e-81 UniRef50_C2EVL7 Possible galactofuranosyltransferase n=1 Tax=Lac... 301 3e-80 UniRef50_Q4JZC8 Putative glycosyl transferase n=2 Tax=Streptococ... 299 6e-80 UniRef50_D0RXG2 Galactofuranose transferase n=13 Tax=Streptococc... 298 1e-79 UniRef50_B0BR56 Glycosyltransferase n=1 Tax=Actinobacillus pleur... 298 2e-79 UniRef50_Q4JYV0 Putative glycosyl transferase n=2 Tax=Streptococ... 298 3e-79 UniRef50_D0BKT6 Galactofuranosyltransferase n=1 Tax=Granulicatel... 295 2e-78 UniRef50_C9A0R8 Putative uncharacterized protein n=1 Tax=Enteroc... 291 3e-77 UniRef50_C9LPN1 Galactofuranosyltransferase n=2 Tax=Veillonellac... 288 2e-76 UniRef50_B0N1W1 Putative uncharacterized protein n=1 Tax=Clostri... 288 2e-76 UniRef50_C2EH03 Possible galactofuranosyltransferase n=1 Tax=Lac... 285 1e-75 UniRef50_C7IU57 Putative uncharacterized protein n=1 Tax=Thermoa... 285 1e-75 UniRef50_C7XW37 Glycosyltransferase n=1 Tax=Lactobacillus coleoh... 279 7e-74 UniRef50_UPI000196CD65 hypothetical protein CATMIT_02517 n=1 Tax... 277 3e-73 UniRef50_D0R4M2 Putative glycosyltransferase n=1 Tax=Lactobacill... 276 5e-73 UniRef50_Q032N6 Glycosyltransferase n=1 Tax=Lactococcus lactis s... 276 6e-73 UniRef50_B0P5G1 Putative uncharacterized protein n=1 Tax=Clostri... 275 1e-72 UniRef50_Q7P740 Nucleotide sugar synthetase n=1 Tax=Fusobacteriu... 269 7e-71 UniRef50_B1MXC4 Glycosyltransferase n=3 Tax=Leuconostoc RepID=B1... 267 4e-70 UniRef50_C7TE97 Glycosyl transferase,galactofuranosyltransferase... 266 1e-69 UniRef50_A8RK64 Putative uncharacterized protein n=1 Tax=Clostri... 262 1e-68 UniRef50_UPI0001968A2E hypothetical protein BACCELL_04078 n=1 Ta... 260 5e-68 UniRef50_A3CM54 Nucleotide sugar synthetase-like protein, putati... 260 6e-68 UniRef50_A7HN15 Galactofuranosyltransferase n=1 Tax=Fervidobacte... 257 3e-67 UniRef50_C7TIE1 Glycosyl transferase, galactofuranosyltransferas... 254 2e-66 UniRef50_Q3DVD0 Nucleotide sugar synthetase-like protein n=9 Tax... 253 7e-66 UniRef50_B1I7N2 Nss n=10 Tax=Streptococcus pneumoniae RepID=B1I7... 252 1e-65 UniRef50_C3QC04 Galactofuranosyltransferase n=3 Tax=Bacteroides ... 250 5e-65 UniRef50_C6Z1L9 Glycosyltransferase n=1 Tax=Bacteroides sp. 4_3_... 249 1e-64 UniRef50_C7G7A4 Putative uncharacterized protein n=1 Tax=Rosebur... 248 2e-64 UniRef50_A2RHU2 Putative galactofuranose transferase n=1 Tax=Lac... 246 7e-64 UniRef50_B5A7L9 Nucleotide sugar synthetase-like protein n=3 Tax... 243 5e-63 UniRef50_C0XA00 Possible galactofuranosyl transferase n=3 Tax=La... 241 2e-62 UniRef50_C4ZG42 Putative uncharacterized protein n=1 Tax=Eubacte... 238 2e-61 UniRef50_Q042V6 Glycosyltransferase n=4 Tax=Lactobacillus gasser... 235 1e-60 UniRef50_C4Z1X5 Putative uncharacterized protein n=1 Tax=Eubacte... 227 3e-58 UniRef50_Q04DG9 Glycosyltransferase n=1 Tax=Oenococcus oeni PSU-... 225 1e-57 UniRef50_B1MVL6 Putative glycosyl transferase n=1 Tax=Leuconosto... 219 1e-55 UniRef50_Q5ULS2 Orf42 n=1 Tax=Lactobacillus phage LP65 RepID=Q5U... 217 5e-55 UniRef50_Q03A82 Glycosyltransferase n=1 Tax=Lactobacillus casei ... 197 5e-49 UniRef50_C4ZG41 Putative uncharacterized protein n=1 Tax=Eubacte... 192 1e-47 UniRef50_B8EDB2 Putative uncharacterized protein n=1 Tax=Shewane... 127 4e-28 UniRef50_B9KB31 Putative uncharacterized protein n=1 Tax=Thermot... 127 4e-28 UniRef50_B8DZD9 Glycosyl transferase group 1 n=2 Tax=Bacteria Re... 122 2e-26 UniRef50_B0VJD0 Putative uncharacterized protein n=1 Tax=Candida... 118 3e-25 UniRef50_C4N530 Putative glycosyltransferase n=1 Tax=Capnocytoph... 117 5e-25 UniRef50_A8TGW1 Glycosyl transferase group 1 n=2 Tax=Methanococc... 110 1e-22 UniRef50_A8U9E3 Putative uncharacterized protein n=1 Tax=Carnoba... 98 3e-19 UniRef50_C2EWJ6 Putative uncharacterized protein n=1 Tax=Lactoba... 94 5e-18 UniRef50_C5RCT8 Possible transposase n=2 Tax=Lactobacillales Rep... 92 3e-17 UniRef50_A3XA25 Glycosyl transferase, group 1 n=1 Tax=Roseobacte... 89 2e-16 UniRef50_C5RCT9 Putative uncharacterized protein n=1 Tax=Weissel... 78 3e-13 Sequences not found previously or not previously below threshold: UniRef50_A6T0A3 Glycosyltransferase n=4 Tax=Betaproteobacteria R... 78 5e-13 UniRef50_B5YCR2 WbpH n=10 Tax=Bacteria RepID=B5YCR2_DICT6 77 1e-12 UniRef50_B5JXL1 WblG protein n=2 Tax=Gammaproteobacteria RepID=B... 72 2e-11 UniRef50_B9P363 Glycosyl transferase, group 1 n=1 Tax=Prochloroc... 70 1e-10 UniRef50_A9BF89 Glycosyl transferase group 1 n=8 Tax=Thermotogac... 69 3e-10 UniRef50_C0BNJ6 Glycosyl transferase group 1 n=1 Tax=Flavobacter... 68 3e-10 UniRef50_Q1ILE2 Glycosyl transferase, group 1 n=1 Tax=Candidatus... 67 1e-09 UniRef50_A6LCB7 Glycosyltransferase family 4 n=6 Tax=Bacteroidal... 65 3e-09 UniRef50_B3CFJ1 Putative uncharacterized protein n=1 Tax=Bactero... 65 3e-09 UniRef50_A3CXX8 Glycosyl transferase, group 1 n=4 Tax=Methanomic... 63 1e-08 UniRef50_A8F8F9 Putative uncharacterized protein n=1 Tax=Thermot... 62 2e-08 UniRef50_A7ZC10 Glycosyl transferase, group 1 family protein n=1... 61 8e-08 UniRef50_B5M6M0 Glycosyltransferase n=2 Tax=Kosmotoga olearia TB... 60 1e-07 UniRef50_A3WUD4 Predicted glycosyltransferase n=1 Tax=Nitrobacte... 60 1e-07 UniRef50_Q8AAS2 Lipopolysaccharide biosynthesis RfbU-related pro... 60 1e-07 UniRef50_B6YUV9 Glycosyltransferase n=1 Tax=Thermococcus onnurin... 60 1e-07 UniRef50_A4XKF8 Glycosyl transferase, group 1 n=1 Tax=Caldicellu... 59 2e-07 UniRef50_Q1NU87 Glycosyl transferase, group 1 n=2 Tax=Proteobact... 59 3e-07 UniRef50_C1XLL8 Glycosyltransferase n=1 Tax=Meiothermus ruber DS... 58 3e-07 UniRef50_C6LL02 Putative glycosyltransferase n=1 Tax=Bryantella ... 58 4e-07 UniRef50_C8P2S3 Glycosyltransferase n=1 Tax=Erysipelothrix rhusi... 58 5e-07 UniRef50_Q7NQL7 Probable glycosyltransferase n=1 Tax=Chromobacte... 58 5e-07 UniRef50_C4G144 Putative uncharacterized protein n=1 Tax=Abiotro... 58 6e-07 UniRef50_C5U4Y0 Glycosyl transferase group 1 n=1 Tax=Methanocald... 58 6e-07 UniRef50_C6I9C3 Glycosyltransferase n=2 Tax=Bacteroides RepID=C6... 57 7e-07 UniRef50_A7ZF00 Glycosyl transferase, group 1 n=1 Tax=Campylobac... 57 7e-07 UniRef50_B5IQU9 Glycosyl transferase, group 1 family protein n=1... 57 9e-07 UniRef50_A5FL46 Glycosyltransferase family 4 n=1 Tax=Flavobacter... 57 1e-06 UniRef50_C5A2Y6 Glycosyltransferase, family 1, putative n=1 Tax=... 57 1e-06 UniRef50_UPI0001C4246F glycosyltransferase n=1 Tax=Bacillus pseu... 57 1e-06 UniRef50_B9Z8K7 Glycosyl transferase group 1 n=1 Tax=Lutiella ni... 57 1e-06 UniRef50_D1N7C5 Glycosyl transferase group 1 n=1 Tax=Victivallis... 56 2e-06 UniRef50_C4L5H1 Glycosyl transferase group 1 n=1 Tax=Exiguobacte... 56 2e-06 UniRef50_C6LKZ9 Putative glycosyl transferase n=1 Tax=Bryantella... 56 2e-06 UniRef50_D1JK96 WbpH n=1 Tax=Bacteroides sp. 2_1_16 RepID=D1JK96... 56 2e-06 UniRef50_A3I1Z7 Putative glycosyltransferase protein n=1 Tax=Alg... 56 2e-06 UniRef50_A3U615 Lipopolysaccharide biosynthesis protein, putativ... 56 2e-06 UniRef50_A8RK70 Putative uncharacterized protein n=2 Tax=Clostri... 55 3e-06 UniRef50_Q2C5V3 Putative uncharacterized protein n=1 Tax=Photoba... 55 3e-06 UniRef50_A9A104 Glycosyl transferase group 1 n=1 Tax=Desulfococc... 55 3e-06 UniRef50_C2HRD4 Glycosyl transferase group 1 family protein n=1 ... 55 3e-06 UniRef50_C7M4B3 Glycosyl transferase group 1 n=6 Tax=Bacteroidet... 55 3e-06 UniRef50_C1CBD3 Capsular polysaccharide biosynthesis protein Cps... 55 3e-06 UniRef50_Q2YUX0 Capsular polysaccharide synthesis enzyme CapL n=... 55 4e-06 UniRef50_O26550 LPS biosynthesis RfbU related protein n=1 Tax=Me... 55 5e-06 UniRef50_Q9UZI9 Putative glycosyltransferase, family 1 n=1 Tax=P... 54 6e-06 UniRef50_B5JPV4 Glycosyl transferase, group 2 family protein n=1... 54 8e-06 UniRef50_C7PLX0 Glycosyl transferase group 1 n=1 Tax=Chitinophag... 54 9e-06 UniRef50_Q1GZQ5 Glycosyl transferase, group 1 n=1 Tax=Methylobac... 53 1e-05 UniRef50_B0RZK4 Capsular polysaccharide synthesis related protei... 53 1e-05 UniRef50_Q8RBZ2 Predicted glycosyltransferases n=1 Tax=Thermoana... 53 1e-05 UniRef50_D1PQ72 Capsular polysaccharide biosynthesis protein Cps... 53 1e-05 UniRef50_A1TKR8 Glycosyl transferase, group 1 n=1 Tax=Acidovorax... 53 1e-05 UniRef50_C7PGD4 Glycosyl transferase group 1 n=2 Tax=Chitinophag... 53 1e-05 UniRef50_Q1Q6V4 Putative uncharacterized protein n=1 Tax=Candida... 53 1e-05 UniRef50_C4Z1X4 Putative uncharacterized protein n=1 Tax=Eubacte... 53 1e-05 UniRef50_A6A5R7 Glycosyl transferase, group 1 n=2 Tax=Vibrio cho... 53 1e-05 UniRef50_C2HHA8 Glycosyltransferase n=1 Tax=Finegoldia magna ATC... 53 1e-05 UniRef50_C9RVT7 Glycosyl transferase group 1 n=4 Tax=Bacillaceae... 53 2e-05 UniRef50_B2UPC2 Glycosyl transferase group 1 n=1 Tax=Akkermansia... 53 2e-05 UniRef50_B6FJA1 Putative uncharacterized protein n=1 Tax=Clostri... 53 2e-05 UniRef50_A6LJY0 Putative uncharacterized protein n=1 Tax=Thermos... 53 2e-05 UniRef50_C1TR28 Glycosyltransferase n=1 Tax=Dethiosulfovibrio pe... 53 2e-05 UniRef50_C6A091 Glycosyltransferase n=1 Tax=Thermococcus sibiric... 53 2e-05 UniRef50_A0PZR3 Putative glycosyl transferase n=14 Tax=Clostridi... 52 3e-05 UniRef50_Q1WS01 Glycosyltransferase n=2 Tax=Lactobacillus saliva... 52 3e-05 UniRef50_A7V9M4 Putative uncharacterized protein n=1 Tax=Bactero... 52 3e-05 UniRef50_C3WZE2 Glycosyltransferase n=2 Tax=Fusobacterium RepID=... 52 3e-05 UniRef50_Q8CX81 Glycosyltransferase (Capsular polysaccharide syn... 52 3e-05 UniRef50_B9DUI8 Putative glycosyl transferase n=1 Tax=Streptococ... 52 3e-05 UniRef50_Q2RKP7 Glycosyl transferase, group 1 n=1 Tax=Moorella t... 52 3e-05 UniRef50_C6P8E8 Glycosyl transferase group 1 n=1 Tax=Thermoanaer... 52 3e-05 UniRef50_B5IDW8 Glycosyl transferase, group 1 family protein n=2... 52 3e-05 UniRef50_C1RJB3 Glycosyltransferase n=1 Tax=Cellulomonas flavige... 52 3e-05 UniRef50_B2T0K0 Glycosyl transferase family 2 n=1 Tax=Burkholder... 52 4e-05 UniRef50_Q9WZ99 Lipopolysaccharide biosynthesis protein n=1 Tax=... 52 4e-05 UniRef50_Q13D55 Glycosyl transferase, group 1 n=1 Tax=Rhodopseud... 52 4e-05 UniRef50_A3HV73 Predicted glycosyltransferase n=1 Tax=Algoriphag... 52 4e-05 UniRef50_Q2LWM1 Glycosyltransferase n=1 Tax=Syntrophus aciditrop... 51 5e-05 UniRef50_Q2ILL1 Glycosyl transferase, group 1 n=1 Tax=Anaeromyxo... 51 5e-05 UniRef50_Q6LYB1 Glycosyl transferase, group 1 n=1 Tax=Methanococ... 51 6e-05 UniRef50_A8ZY94 Glycosyl transferase group 1 n=1 Tax=Desulfococc... 51 6e-05 UniRef50_A1EL38 Epimerase/dehydratase n=3 Tax=Vibrio cholerae Re... 51 6e-05 UniRef50_C5EKE4 Glycosyl transferase n=1 Tax=Clostridiales bacte... 51 6e-05 UniRef50_D1YZX9 Putative glycosyltransferase n=1 Tax=Methanocell... 51 6e-05 UniRef50_Q47GL0 Colanic acid biosynthesis glycosyl-transferase n... 51 6e-05 UniRef50_B7JGK2 Glycosyl transferase, group 1 family protein n=1... 51 6e-05 UniRef50_B5EVI1 Glycosyltransferase n=6 Tax=Vibrionales RepID=B5... 51 7e-05 UniRef50_B8E1B1 Glycosyl transferase group 1 n=1 Tax=Dictyoglomu... 51 8e-05 UniRef50_B9ZCT2 Glycosyl transferase group 1 n=1 Tax=Natrialba m... 50 9e-05 UniRef50_Q1Q1W3 Similar to capsular polysaccharide biosynthesis ... 50 9e-05 UniRef50_B6BKD9 Capsular polysaccharide biosynthesis protein Cps... 50 1e-04 UniRef50_B1CBD7 Putative uncharacterized protein n=1 Tax=Anaerof... 50 1e-04 UniRef50_B2UYP0 WblI protein n=6 Tax=Clostridium RepID=B2UYP0_CLOBA 50 1e-04 UniRef50_Q2FQD1 Glycosyl transferase, group 1 n=1 Tax=Methanospi... 50 1e-04 UniRef50_A6L9I7 Glycosyltransferase family 4 n=5 Tax=Bacteroidal... 50 1e-04 UniRef50_C6PFB8 Glycosyl transferase group 1 n=2 Tax=Bacteria Re... 50 1e-04 UniRef50_C2WF92 Glycosyl transferase group 1 n=2 Tax=Bacillus ce... 50 1e-04 UniRef50_A8RE04 Putative uncharacterized protein n=1 Tax=Eubacte... 50 1e-04 UniRef50_C0C2C9 Putative uncharacterized protein n=1 Tax=Clostri... 50 1e-04 UniRef50_A3SJ03 Probable glycosyltransferase n=1 Tax=Roseovarius... 50 1e-04 UniRef50_B7AUJ6 Putative uncharacterized protein n=1 Tax=Bactero... 50 1e-04 UniRef50_Q0AZ01 Capsular polysaccharide biosynthesis protein Cps... 50 2e-04 UniRef50_Q0W4G3 Glycosyltransferase (Group 1) n=1 Tax=uncultured... 49 2e-04 UniRef50_B0VF96 Putative uncharacterized protein n=1 Tax=Candida... 49 2e-04 UniRef50_Q5LH96 Possible capsular polysaccharide related protein... 49 2e-04 UniRef50_A5FN98 Candidate alpha-glycosyltransferase; Glycosyltra... 49 3e-04 UniRef50_B5YA59 Glycosyl transferase, group 1 family protein n=1... 49 3e-04 UniRef50_A8UK17 Putative uncharacterized protein n=1 Tax=Flavoba... 48 3e-04 UniRef50_Q9YCS3 Glycosyl transferase, group 1 n=1 Tax=Aeropyrum ... 48 3e-04 UniRef50_A6GZ44 Probable L-fucosamine transferase n=1 Tax=Flavob... 48 3e-04 UniRef50_Q2LPR2 Glycosyltransferase n=1 Tax=Syntrophus aciditrop... 48 3e-04 UniRef50_B4S4G0 Glycosyl transferase group 1 n=1 Tax=Prosthecoch... 48 4e-04 UniRef50_C0VZT2 Glycosyltransferase n=1 Tax=Actinomyces coleocan... 48 4e-04 UniRef50_B8DZE0 Glycosyl transferase group 1 n=1 Tax=Dictyoglomu... 48 4e-04 UniRef50_A5I5D7 Glycosyl transferase, group 1 family n=3 Tax=Clo... 48 4e-04 UniRef50_Q55374 Slr0907 protein n=2 Tax=Chroococcales RepID=Q553... 48 4e-04 UniRef50_Q73L31 Glycosyl transferase, group 1 family protein n=1... 48 4e-04 UniRef50_B5IAN8 Glycosyl transferase, group 1 family protein n=3... 48 4e-04 UniRef50_C6IGY3 Putative uncharacterized protein n=1 Tax=Bactero... 48 5e-04 UniRef50_A5I5E4 Glycosyl transferase, group 1 family n=2 Tax=Clo... 48 5e-04 UniRef50_A4SDY1 Glycosyl transferase, group 1 n=1 Tax=Chlorobium... 48 5e-04 UniRef50_C3AT67 Glycosyl transferase, group 1 n=3 Tax=Bacillus R... 48 5e-04 UniRef50_B7KM09 Glycosyl transferase group 1 n=5 Tax=Cyanobacter... 48 5e-04 UniRef50_D1Y981 Glycosyltransferase, group 1 family protein n=2 ... 48 6e-04 UniRef50_B0KUI7 Glycosyl transferase group 1 n=1 Tax=Pseudomonas... 48 6e-04 UniRef50_B9EAW2 Putative uncharacterized protein n=1 Tax=Macroco... 48 6e-04 UniRef50_B3QYT6 Glycosyl transferase group 1 n=1 Tax=Chloroherpe... 48 6e-04 UniRef50_P72922 Slr1085 protein n=1 Tax=Synechocystis sp. PCC 68... 48 7e-04 UniRef50_B5YDS1 Glycosyltransferase n=1 Tax=Dictyoglomus thermop... 47 7e-04 UniRef50_B9MRN9 Glycosyl transferase group 1 n=1 Tax=Anaerocellu... 47 7e-04 UniRef50_B3QUF3 Glycosyl transferase group 1 n=1 Tax=Chloroherpe... 47 7e-04 UniRef50_C2CI25 L-fucosamine transferase n=1 Tax=Anaerococcus te... 47 7e-04 UniRef50_B9YG53 Glycosyl transferase group 1 n=1 Tax='Nostoc azo... 47 7e-04 UniRef50_C7NW83 Glycosyl transferase group 1 n=1 Tax=Halomicrobi... 47 8e-04 UniRef50_A6DJF7 Putative glycosyl transferase n=1 Tax=Lentisphae... 47 0.001 UniRef50_A1S0Y8 Glycosyl transferase, group 1 n=1 Tax=Thermofilu... 47 0.001 UniRef50_C1I3K1 Glycosyltransferase n=1 Tax=Clostridium sp. 7_2_... 47 0.001 UniRef50_A6AFA8 Putative uncharacterized protein n=1 Tax=Vibrio ... 47 0.001 UniRef50_B0G7U5 Putative uncharacterized protein n=2 Tax=Lachnos... 47 0.001 UniRef50_A5I5D9 Glycosyl transferase, group 1 family n=3 Tax=Clo... 47 0.001 UniRef50_D2QRL3 Glycosyl transferase group 1 n=1 Tax=Spirosoma l... 46 0.001 UniRef50_Q2N6D5 Glycosyl transferase, group 1 family protein n=1... 46 0.001 UniRef50_C1FUA9 Glycosyl transferase, group 1 family n=1 Tax=Clo... 46 0.001 UniRef50_B1C0Q5 Putative uncharacterized protein n=1 Tax=Clostri... 46 0.001 UniRef50_Q20YQ8 Glycosyl transferase, group 1 n=1 Tax=Rhodopseud... 46 0.001 UniRef50_C1DVV7 Putative glycosyl transferase, group 1 n=2 Tax=S... 46 0.001 UniRef50_Q1AYI8 Glycosyl transferase, group 1 n=1 Tax=Rubrobacte... 46 0.001 UniRef50_B9YBN8 Putative uncharacterized protein n=1 Tax=Holdema... 46 0.002 UniRef50_B4BR56 Glycosyl transferase group 1 n=1 Tax=Geobacillus... 46 0.002 UniRef50_A0PZ08 Putative uncharacterized protein n=1 Tax=Clostri... 46 0.002 UniRef50_A5D386 Glycosyltransferase n=1 Tax=Pelotomaculum thermo... 46 0.002 UniRef50_D2F291 Glycosyl transferase n=1 Tax=Bacteroides sp. D20... 46 0.002 UniRef50_A9BJB8 Glycosyl transferase group 1 n=1 Tax=Petrotoga m... 46 0.002 UniRef50_B1YL51 Glycosyl transferase group 1 n=1 Tax=Exiguobacte... 46 0.002 UniRef50_A5N237 Predicted glycosyltransferase n=2 Tax=Clostridiu... 46 0.002 UniRef50_A3TY33 Putative uncharacterized protein n=1 Tax=Oceanic... 46 0.002 UniRef50_C5CAF6 Putative uncharacterized protein n=1 Tax=Microco... 46 0.002 UniRef50_C1FUA3 Glycosyl transferase, group 1 family n=1 Tax=Clo... 46 0.002 UniRef50_B3EBP3 Glycosyl transferase family 2 n=1 Tax=Geobacter ... 46 0.002 UniRef50_Q5V5W8 Glycosyltransferase n=1 Tax=Haloarcula marismort... 46 0.002 UniRef50_A9KI03 Glycosyl transferase group 1 n=1 Tax=Clostridium... 46 0.003 UniRef50_B5EGU3 Putative uncharacterized protein n=1 Tax=Geobact... 45 0.003 UniRef50_A6TJS0 Glycosyl transferase, group 1 n=1 Tax=Alkaliphil... 45 0.003 UniRef50_Q7M7N2 CAPSULAR POLYSACCHARIDE SYNTHESIS ENZYME CAP5I ,... 45 0.003 UniRef50_C3WK21 L-fucosamine transferase n=3 Tax=Fusobacterium R... 45 0.003 UniRef50_D1Y955 Glycosyltransferase, group 1 family protein n=3 ... 45 0.003 UniRef50_B0KTZ0 Glycosyl transferase group 1 n=1 Tax=Pseudomonas... 45 0.003 UniRef50_C6NU25 Glycosyl transferase, group 1 n=1 Tax=Acidithiob... 45 0.003 UniRef50_B3QYV4 Glycosyl transferase group 1 n=1 Tax=Chloroherpe... 45 0.003 UniRef50_B8E2Q1 Glycosyl transferase group 1 n=1 Tax=Dictyoglomu... 45 0.003 UniRef50_B6IYH8 Glycosyl transferase, group 1 family protein n=1... 45 0.003 UniRef50_B9YYP4 Glycosyl transferase group 1 n=1 Tax=Lutiella ni... 45 0.003 UniRef50_C2INF3 Glycosyl transferase family 2 n=1 Tax=Vibrio cho... 45 0.003 UniRef50_Q83GT6 Glycosyltransferase domain-containing protein n=... 45 0.003 UniRef50_B1C0Q1 Putative uncharacterized protein n=1 Tax=Clostri... 45 0.003 UniRef50_A0YEZ9 Putative uncharacterized protein n=1 Tax=marine ... 45 0.004 UniRef50_Q39W08 Putative uncharacterized protein n=1 Tax=Geobact... 45 0.004 UniRef50_C6JNS5 Putative uncharacterized protein n=1 Tax=Fusobac... 45 0.004 UniRef50_Q12KU7 Glycosyl transferase, group 1 n=1 Tax=Shewanella... 45 0.004 UniRef50_D1C8N3 Glycosyl transferase group 1 n=1 Tax=Sphaerobact... 45 0.004 UniRef50_A3DHW2 Glycosyl transferase, group 1 n=3 Tax=Clostridiu... 45 0.004 >UniRef50_P37749 Uncharacterized protein yefG n=5 Tax=Escherichia coli RepID=YEFG_ECOLI Length = 330 Score = 350 bits (899), Expect = 3e-95, Method: Composition-based stats. Identities = 330/330 (100%), Positives = 330/330 (100%) Query: 1 MYFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGL 60 MYFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGL Sbjct: 1 MYFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGL 60 Query: 61 ENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGGGSDSVRLATCDMVIS 120 ENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGGGSDSVRLATCDMVIS Sbjct: 61 ENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGGGSDSVRLATCDMVIS 120 Query: 121 HNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYT 180 HNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYT Sbjct: 121 HNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYT 180 Query: 181 EGCDFTLFGVNYENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLK 240 EGCDFTLFGVNYENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLK Sbjct: 181 EGCDFTLFGVNYENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLK 240 Query: 241 FNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQIS 300 FNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQIS Sbjct: 241 FNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQIS 300 Query: 301 ENTKIISQKIRTGSYFRDVLEEVIDDLKTR 330 ENTKIISQKIRTGSYFRDVLEEVIDDLKTR Sbjct: 301 ENTKIISQKIRTGSYFRDVLEEVIDDLKTR 330 >UniRef50_UPI000196921F hypothetical protein BACCELL_02894 n=1 Tax=Bacteroides cellulosilyticus DSM 14838 RepID=UPI000196921F Length = 345 Score = 335 bits (858), Expect = 2e-90, Method: Composition-based stats. Identities = 101/341 (29%), Positives = 169/341 (49%), Gaps = 15/341 (4%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASDYENI--SVVNIPLWGGVVQRIISSVKLSTFLCG 59 Y+L+ +AG KA+ D + S + + +I+ + L Sbjct: 4 YYLSKNYNGLNNAGNKAKTDIEETLSKLGYKNAGLPQTTYSNKIAGFLITLAGVLKVLFT 63 Query: 60 LENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGG---GSDSVRLATCD 116 + DV++ +P K + + + H L + +++ +IHD+ R + RL D Sbjct: 64 VSANDVVVVQYPFKKYYSFVCNIIH-LKRGKVITIIHDLGTFRRKKLTAEQEIKRLNHSD 122 Query: 117 MVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTD-KQRGVIYAGNLSRHKC 175 ++I HN +M +L + + ++IFDYL S + + K VIYAG L+ K Sbjct: 123 VLIVHNDKMEIWLKEQGYTKPMVCLEIFDYLSPSVNNNTQEPNQKPIKVIYAGALTYKKN 182 Query: 176 SFIYTEG-----CDFTLFGVNYEN---KDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDS 227 ++Y+ F L+G +E +D + S + I FGLIW+GDS Sbjct: 183 RYLYSLNDVMSKWQFELYGGGFEEAKIEDKTLFKFKGFVPSDQLIEQVSAHFGLIWEGDS 242 Query: 228 VETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEI 287 + TCSG FG YL+ NNPHK SLY+ LP+ IW +AALA F+ +N+IG + S++E+ I Sbjct: 243 IHTCSGDFGIYLRINNPHKVSLYIRCNLPIIIWKEAALASFVAENKIGVCIDSLEELDSI 302 Query: 288 VDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDLK 328 + S++ E+Y ++ N K I++KI +G Y + +E L+ Sbjct: 303 LSSISAESYNEMVRNIKEINKKIASGYYCKRAVENAESLLQ 343 >UniRef50_A0Z7X9 Putative uncharacterized protein n=1 Tax=marine gamma proteobacterium HTCC2080 RepID=A0Z7X9_9GAMM Length = 348 Score = 330 bits (847), Expect = 3e-89, Method: Composition-based stats. Identities = 92/344 (26%), Positives = 164/344 (47%), Gaps = 20/344 (5%) Query: 3 FLNDLNFSRRDAGFKARKDALDIASDYENI--SVVNIPLWGGVVQRIISSVKLSTFLCGL 60 F++ +R A KA+ D D + + + L L Sbjct: 7 FISRNYKARFSAAGKAKIDCEDALEKNGFKNIGLPRATYTSTLPNFFWTLFGTILGLLRL 66 Query: 61 ENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGG---GSDSVRLATCDM 117 + +L+ +P K + I+ +L +++ +IHD+ R + L D+ Sbjct: 67 KRHSILVVQYPTKKYYDFIVQ-IAKLKHCKVITIIHDLRSHRKQKMHVDKEMASLNKNDV 125 Query: 118 VISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEH-RDVTDKQRGVIYAGNLSRHKCS 176 VI+HN MT +L + K ++ IFDYL + +++AG L + K Sbjct: 126 VIAHNSFMTAWLQDHGLTSKAVNLNIFDYLCELKASSTPTPPRDKFRLVFAGVLEKRKNG 185 Query: 177 FIYTEG------CDFTLFGVNYENKDN-----PKYLGSFDAQSPEKINLPGMQFGLIWDG 225 F+Y+ L+G+ + + + Y G F A E ++ +FG++WDG Sbjct: 186 FLYSLDALNAKSFTCNLYGIGFNDSELPQDSIVTYQGVFPAD--EIVDRVEGEFGIVWDG 243 Query: 226 DSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQ 285 S++ C G+FG+YLK NNPHKTS+YL LP+ IWD+AA+A F+ D +G AV S+ ++ Sbjct: 244 TSLDECKGSFGEYLKINNPHKTSMYLRAGLPIIIWDQAAIATFVQDKNVGIAVASLAQVD 303 Query: 286 EIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDLKT 329 E + S++ + Y+++ N + +SQ++ G++ +EE + L + Sbjct: 304 EALQSVSDDDYREMKRNAESVSQQLGEGAFLTAAVEEAMSQLAS 347 >UniRef50_A7AHD2 Putative uncharacterized protein n=1 Tax=Parabacteroides merdae ATCC 43184 RepID=A7AHD2_9PORP Length = 352 Score = 328 bits (842), Expect = 1e-88, Method: Composition-based stats. Identities = 93/338 (27%), Positives = 161/338 (47%), Gaps = 14/338 (4%) Query: 3 FLNDLNFSRRDAGFKARKDALDIASDYEN--ISVVNIPLWGGVVQRIISSVKLSTFLCGL 60 + + AG KA+ D I D I + ++ +I+ + + L Sbjct: 5 YFSKCYKELYSAGSKAKTDMEQIMCDLGYRNIGFPCLVCSNKILGFVITLLSMIKVCFKL 64 Query: 61 ENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGGG---SDSVRLATCDM 117 + D+LI +P+ K + + + H +++ LIHD+ R + RL D Sbjct: 65 RSGDILIIQYPLKKYYTLLCNIVH-YRGAKVITLIHDLGSFRRKRLTVLQEIDRLQNSDY 123 Query: 118 VISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSF 177 +I+ N M+ +L + ++KI+DYL + V ++ V+YAG L +K F Sbjct: 124 LITLNDSMSAWLQTKGCEVPKGELKIWDYLSPAIVLNKIEPATDYTVVYAGALGYNKNRF 183 Query: 178 IYTEG-----CDFTLFGVNYENKD--NPKYLGS-FDAQSPEKINLPGMQFGLIWDGDSVE 229 +Y +++G E N +Y + + I+ FGL+WDGDS E Sbjct: 184 LYELDRLPRQWHLSVYGKGLEADKILNKEYFSYNGFLPADQLISSVQGDFGLVWDGDSYE 243 Query: 230 TCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVD 289 C+G +G+YL++NNPHK SLY+ LP+ IW+KAALA FI + IG + S++E+ ++ Sbjct: 244 ACTGNYGEYLRYNNPHKVSLYVRCHLPLIIWEKAALAPFIKEKEIGICINSLEELDGKLE 303 Query: 290 SMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDL 327 +T++ Y ++ IS + G +F L+E + L Sbjct: 304 KLTVDDYFKMKSRVIEISNLLSVGYFFTKALDEAVKFL 341 >UniRef50_C2F0P9 Galactofuranosyltransferase n=2 Tax=Lactobacillus reuteri RepID=C2F0P9_LACRE Length = 334 Score = 327 bits (838), Expect = 4e-88, Method: Composition-based stats. Identities = 94/330 (28%), Positives = 162/330 (49%), Gaps = 10/330 (3%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLE 61 Y ++ ++ + G KA+KD AS + V+++ ++ +QR + + L Sbjct: 3 YLISAIDPIKNSGGNKAKKDIDFFASQLNDTRVIHVKIYYTRLQRYLLTRLSIIKLVKTH 62 Query: 62 NKDVLIFNFPMAKPF--WHILSFFHRLLKFRIVPLIHDIDEL---RGGGGSDSVRLATCD 116 D I FP++ P+ + + ++ IHD+ L + V D Sbjct: 63 PADRYILQFPISTPYVLRQFIEVIQKYTNAKVDLFIHDLPALQLSMDDKERELVLFNQVD 122 Query: 117 MVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDK-QRGVIYAGNLSRHKC 175 +I HN M K+L Q + ++ +FDY ++ + D + + GNL++ Sbjct: 123 NLIVHNQAMKKWLVDNGVQTNMIELGLFDYDNEQPMQKKQEYDPANFTICFPGNLAKSTF 182 Query: 176 SFIYTEGCDFTLFGVNYENK--DNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSG 233 ++G N + ++ +Y G + + E FGLIWDG+S+ETCSG Sbjct: 183 LTKVNLSHQLNIYGPNKLDSYPESIRYCGQYTPE--ELPKHLTEDFGLIWDGNSIETCSG 240 Query: 234 AFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTI 293 FG+YLK+NNPHKTSLYLS +PV IWD+AALA I ++ +G + S+ E+ ++ S+T Sbjct: 241 TFGEYLKYNNPHKTSLYLSTGIPVIIWDQAALAPLIKESGVGICISSLTELDSVLLSLTN 300 Query: 294 ETYKQISENTKIISQKIRTGSYFRDVLEEV 323 E Y+ + + + QK+R G Y + L ++ Sbjct: 301 EQYQLMKRKAEKLGQKLRKGYYTKHALTKL 330 >UniRef50_C9LJY2 Putative uncharacterized protein n=1 Tax=Prevotella tannerae ATCC 51259 RepID=C9LJY2_9BACT Length = 353 Score = 323 bits (827), Expect = 8e-87, Method: Composition-based stats. Identities = 87/339 (25%), Positives = 157/339 (46%), Gaps = 13/339 (3%) Query: 3 FLNDLNFSRRDAGFKARKDALDIASDYE--NISVVNIPLWGGVVQRIISSVKLSTFLCGL 60 +++ R+ AG KA+ D DI N+ + + + + T+ + Sbjct: 9 YVSRNYKGRQGAGNKAKGDYEDILVQMGAHNLGLRRTYYKEYIAAFLTDLAGIVTYALSV 68 Query: 61 ENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGG---GSDSVRLATCDM 117 DV+ +P K F + R + + IHD+ R + RL+ D Sbjct: 69 RKGDVVFLQYPTKKYFSFMC-RLARWREANSMAFIHDLGAFRRKKVTVKQEIRRLSNADY 127 Query: 118 VISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSF 177 +I+ N M ++L + + + + DYL +S+ + T ++YAG++ K F Sbjct: 128 IIAANDTMAEWLKSHGLKRPCHGMGLHDYLSNSETVDKPATFPPHRIVYAGSIEERKNMF 187 Query: 178 IYTE-----GCDFTLFGVNYENK-DNPKYLGSFDAQSPE-KINLPGMQFGLIWDGDSVET 230 + + ++G N+ + + L + +P+ I FGL+WDGDS+ Sbjct: 188 LTKLSGVIRHGEIHVYGSNHIAALKSTRNLILHEPMTPDNFIATAKGDFGLVWDGDSLTA 247 Query: 231 CSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDS 290 C+G FG+YL+ N PHK S YL LP+ IW ++ALAD + IG V I E+++ ++S Sbjct: 248 CTGDFGEYLRINTPHKASFYLRAGLPLIIWSRSALADIVDREGIGITVDRIDEIEDHIES 307 Query: 291 MTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDLKT 329 +T + ++I +N K +SQ + G R +E+ + +K Sbjct: 308 LTGQEIRKIRDNVKRVSQDLADGLSMRRAVEKAMCRIKE 346 >UniRef50_A5ZF92 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=A5ZF92_9BACE Length = 345 Score = 322 bits (826), Expect = 1e-86, Method: Composition-based stats. Identities = 94/339 (27%), Positives = 159/339 (46%), Gaps = 17/339 (5%) Query: 3 FLNDLNFSRRDAGFKARKDALDIASDYENI--SVVNIPLWGGVVQRIISSVKLSTFLCGL 60 +L+ +AG KA+ D I + + VV + + + L Sbjct: 4 YLSRNYRGVDNAGNKAKTDIEQIMESHGFRNVGLKQTRYRNVVVAFCRTLFSVLKSILCL 63 Query: 61 ENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGGG---SDSVRLATCDM 117 DVL+ +P+ K + + + H L ++V LIHD+ R + RL D Sbjct: 64 RKGDVLVLQYPLKKYYAFVCNMAH-LRGCKVVTLIHDLGSFRRKKLTIPQEIARLDHSDC 122 Query: 118 VISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSS-DVEHRDVTDKQRGVIYAGNLSRHKCS 176 VI H+ +M +L ++ + K++ ++IFDYL S V D +++ G LS + Sbjct: 123 VIVHSERMRDWLLEHGIKAKLQILEIFDYLSDSQPVAGNDSPKSPNRILFVGALSSYHND 182 Query: 177 FIYTE-----GCDFTLFGVNYENKD---NPKYLGSFDAQSPEKINLPGMQFGLIWDGDSV 228 F+Y + D L+G E + Y G S E I ++GL W G S+ Sbjct: 183 FLYKQVNSPRSYDIVLYGSGLETEKLEGKVDYKG--FVSSDELIATAEGEYGLAWYGSSL 240 Query: 229 ETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIV 288 E SGA G+YL++N PHK SLY+ LP+ +W+KA LA F+ N +G + S+ E+++I+ Sbjct: 241 EGGSGALGEYLQYNAPHKMSLYIRCGLPIIVWEKAGLAPFVKKNNVGICISSLTELEDIL 300 Query: 289 DSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDL 327 ++ Y ++ +N I+ K+ G Y +++ DL Sbjct: 301 PKISAGQYMEMKKNVLQIADKLSHGYYCFKAIKQACADL 339 >UniRef50_C2E8T4 Possible galactofuranosyltransferase n=1 Tax=Lactobacillus ruminis ATCC 25644 RepID=C2E8T4_9LACO Length = 337 Score = 319 bits (819), Expect = 5e-86, Method: Composition-based stats. Identities = 83/337 (24%), Positives = 167/337 (49%), Gaps = 16/337 (4%) Query: 1 MYFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGL 60 +Y + ++ DAG KA+ D + ++ V+++ L +++ + L Sbjct: 6 LYSVLRSLRAKNDAGPKAKTDINEFLTEEGFK-VMDLDLPEKRLEKFLFVHLKLKRLFKG 64 Query: 61 ENKDVLIFNFPMAKPF--WHILSFFHRLLKFRIVPLIHDIDELRGGGGS------DSVRL 112 D +I +P F I+ ++ + + ++HD++ LR G+ + Sbjct: 65 RQFDNVILQYPFYSVFLTKKIIENAKKVTHGKFLIMVHDVETLRVYDGNKQFEKDEMEIF 124 Query: 113 ATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSR 172 + D +I HN +M ++L ++ I + IFDY +D + + + Q+ + +AGNL + Sbjct: 125 NSADGLIVHNSKMAEWLKQHGVTVPITILGIFDYR--NDCQKNERFEYQKSICFAGNLEK 182 Query: 173 HKCSFIYTE-GCDFTLFGVNYENK--DNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVE 229 ++G + K Y G + + N FGLIWDGD + Sbjct: 183 STFLKKVKLNDAKLDVYGPSPAQKYQKGVTYCGVYTPD--DLPNHLNENFGLIWDGDEMS 240 Query: 230 TCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVD 289 C+G FG+Y+++N PHKTSLYLS +PV IW +AA+A+F+ +N +G A+ ++ ++ ++ Sbjct: 241 ACTGVFGNYMRYNAPHKTSLYLSSGIPVIIWKEAAMAEFVSENEVGIAIENLNDLDNVLQ 300 Query: 290 SMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDD 326 + Y+++ N +++++R+GSY ++ + + + D Sbjct: 301 KVDDAGYRKMKSNALNLAERLRSGSYVKEAVRKALGD 337 >UniRef50_D1PTN6 Putative uncharacterized protein n=1 Tax=Prevotella bergensis DSM 17361 RepID=D1PTN6_9BACT Length = 338 Score = 319 bits (819), Expect = 7e-86, Method: Composition-based stats. Identities = 101/326 (30%), Positives = 162/326 (49%), Gaps = 13/326 (3%) Query: 16 FKARKDALDIASDYENISVVNIPLWGGVVQRIISSVK-LSTFLCGLENKDVLIFNFPMAK 74 KA+KD + +++ + G + R ++ + + L L+ DVL +PM K Sbjct: 11 NKAKKDIDTVVEQLGYVNLSKVQCGNGGIGRFLTKLLAMVNILTTLKRDDVLFLQYPMKK 70 Query: 75 PFWHILSFFHRLLKFRIVPLIHDIDELRGGG---GSDSVRLATCDMVISHNPQMTKYLSK 131 + + H L ++V +IHD+ R ++ + D +I+HNP MT+YL + Sbjct: 71 FYKMACTLAH-LKGAKVVTVIHDLGAFRRHKLTPEQENRLFSKTDFLIAHNPTMTEYLQQ 129 Query: 132 YMSQDKIKDIKIFDYLVSSDVEHRDVTDKQR-GVIYAGNLSRHKCSFIYTEG-----CDF 185 + Q + + IFDYL + V + ++YAGNL + F+Y Sbjct: 130 HGFQGGVHHLGIFDYLSAKPVRQPNAQPHDPWRIVYAGNLGVWRNEFLYHLDTAIKHWTL 189 Query: 186 TLFGVNYENKDNPKYLGSFDA--QSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNN 243 L+G +E K N ++ S E I FGL+WDG SV+ C+GA+G+YLK NN Sbjct: 190 DLYGKGFEPKKNNCQKLTYHGFIDSDEFIERVDADFGLVWDGASVDECNGAWGEYLKINN 249 Query: 244 PHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQISENT 303 PHKTS YL +PV +W K+A+A FI N +G V S+ E+ ++ +T E Y+ + N Sbjct: 250 PHKTSFYLRAGIPVIVWSKSAMAPFIRKNGLGLTVDSLAEIDSHLEQLTPEQYQAMRANA 309 Query: 304 KIISQKIRTGSYFRDVLEEVIDDLKT 329 I QK+ TGS+ + L+ + K Sbjct: 310 YTIGQKLATGSHIKRGLDAAQEYFKE 335 >UniRef50_C0BRQ3 Putative uncharacterized protein n=2 Tax=Bifidobacterium RepID=C0BRQ3_9BIFI Length = 354 Score = 315 bits (808), Expect = 1e-84, Method: Composition-based stats. Identities = 107/345 (31%), Positives = 163/345 (47%), Gaps = 19/345 (5%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIP-----LWGGVVQRIISSVKLSTF 56 Y + + + AG KAR D + P + +V + Sbjct: 4 YVICERSLHHAHAGSKARDDIRQVLESQSWQPFEVRPGENKGYFDKLVCVGRTLAVWHRL 63 Query: 57 LCGLENKDVLIFNFP--MAKPFWHILSFFHRLLKFR---IVPLIHDIDELRGGG--GSDS 109 + DV++ FP M R +K R V LIHD++ LRG D Sbjct: 64 ERTVRCGDVVLVQFPLIMYNKVSLYALPSVRRMKARGALFVFLIHDLETLRGYSYTDFDK 123 Query: 110 VRLATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGN 169 + D++ISHNP+M++ L KY + I +I IFDYL+ + +++ G+ AGN Sbjct: 124 QWVTEADLLISHNPRMSEVLRKYGATVPIVEIGIFDYLLPQ--ANPVPMEQRHGIDIAGN 181 Query: 170 LSRHKCSFIYTE-----GCDFTLFGVNYENKDNPKYLGSFDAQSPEKINLPGMQFGLIWD 224 LS K ++Y D L+G Y+ ++ E + +FGLIWD Sbjct: 182 LSHGKAEYVYRLAERFPKADINLYGPKYDRRNGKTAWYRGIVAPDELPDKLEGRFGLIWD 241 Query: 225 GDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEM 284 GDS++TC G +G YL NNPHK SLYL+ + PV IW+KAALA F+V+ +G AV S++E Sbjct: 242 GDSLDTCGGYYGKYLTVNNPHKLSLYLAADKPVIIWNKAALAPFVVEQGVGVAVESLQEA 301 Query: 285 QEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDLKT 329 + MT Y ++ + QK+R G + R+V+ +V L + Sbjct: 302 MAVEYGMTQSEYARMVRRASQLGQKLREGWFTREVMAKVQAVLPS 346 >UniRef50_Q03GL2 Glycosyltransferase n=1 Tax=Pediococcus pentosaceus ATCC 25745 RepID=Q03GL2_PEDPA Length = 338 Score = 309 bits (792), Expect = 7e-83, Method: Composition-based stats. Identities = 95/337 (28%), Positives = 171/337 (50%), Gaps = 14/337 (4%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLE 61 Y L N + AG KA+KD I + SV +++ ++ ++ L + Sbjct: 4 YVLRITNGQKNTAGDKAKKDITSILNKQGFKSVEIRLRESKLIKLFTTNFMINKQLNNFK 63 Query: 62 NKDVLIFNFPMAKPF-WHILSFFHRLLKFRIVPLIHDIDELRGGGGSDSVR------LAT 114 D+ + +PM F I+ R + +IHD++ LR ++ L+ Sbjct: 64 KNDIFVIQYPMYSRFATKIILNKCEKKGIRTICVIHDLEALRLYKNDENKIAEEKAILSR 123 Query: 115 CDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHK 174 + +I HN +M ++L + + + ++IFDYL ++ + + +I+AGNL + Sbjct: 124 FNCLIVHNEKMREWLVEQDVKVPMVSLQIFDYLNDKEL---VKVENKLNLIFAGNLEKSA 180 Query: 175 CSFIYTEGCDFTLFGVNYEN--KDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCS 232 + T+FGV+ + N Y G E FGLIWDG+S+ET + Sbjct: 181 FLEKWNLEKKITVFGVHPSDLYPHNVIYKGVKTPD--ELPKYLSGSFGLIWDGNSIETNT 238 Query: 233 GAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMT 292 G +GDY K+NNPHK SLYLS LPV +W KAA+++FIV N++G ++ S+ ++++ + + Sbjct: 239 GIYGDYTKYNNPHKVSLYLSSGLPVIVWKKAAISEFIVKNKLGISIDSLGDLEDSLSKIN 298 Query: 293 IETYKQISENTKIISQKIRTGSYFRDVLEEVIDDLKT 329 E Y + N + +++K+R G++ +E+ I+ +K+ Sbjct: 299 AEKYTNMVSNVEKMARKLRKGTFTTKAVEKAINLIKS 335 >UniRef50_Q1WU31 Galactofuranosyltransferase n=2 Tax=Lactobacillus salivarius RepID=Q1WU31_LACS1 Length = 335 Score = 309 bits (792), Expect = 8e-83, Method: Composition-based stats. Identities = 104/329 (31%), Positives = 163/329 (49%), Gaps = 11/329 (3%) Query: 3 FLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLE- 61 ++ N S+ +AG KA+ D + Y+ G + + L L+ Sbjct: 5 IVSMYNKSQNEAGPKAKIDVENFLKIYDFKIQDFYFYGGRRAELVSYRQSLFDIPFRLKG 64 Query: 62 NKDVLIFNFPMAK-PFWHILSFFHRLLKFRIVPLIHDIDELRGGG---GSDSVRLATCDM 117 + IF +P + + ++ LIHD++ LR + L D Sbjct: 65 RYENAIFQYPALNERTNKAIMRNLKKNSQKVYILIHDLESLRFKNGGNNFELDLLNMSDG 124 Query: 118 VISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSF 177 VI+HN +M +L + I D++IFDY + ++ + + V YAGNL++ Sbjct: 125 VIAHNKKMIDWLRNNGVEVPIVDLEIFDYDNNIPLQENYI--FDKSVCYAGNLNKATFLK 182 Query: 178 IYTEGCDFTLFGVNYEN---KDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGA 234 Y TLFG N+ +Y GS K L FGLIWDGDS + CSG Sbjct: 183 EYEPDFKLTLFGPNFSPALMSKYIEYKGSLSPDELAK-ELLTQNFGLIWDGDSSKGCSGI 241 Query: 235 FGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIE 294 +G+YLK+NNPHKTSLYLS +P+ IW +AALA+F+ N++G V ++ +++ I+D MT E Sbjct: 242 YGEYLKYNNPHKTSLYLSSGMPIIIWREAALAEFVDKNKLGIVVDNLSQIKPILDKMTKE 301 Query: 295 TYKQISENTKIISQKIRTGSYFRDVLEEV 323 Y++I NT I+ K+R+G Y + + E+ Sbjct: 302 EYQEIKSNTIKIAHKLRSGFYIKKAITEL 330 >UniRef50_Q4JYT0 Putative glycosyl transferase n=1 Tax=Streptococcus pneumoniae RepID=Q4JYT0_STRPN Length = 354 Score = 306 bits (784), Expect = 7e-82, Method: Composition-based stats. Identities = 108/346 (31%), Positives = 182/346 (52%), Gaps = 21/346 (6%) Query: 1 MYFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGL 60 +Y++++ S A KA D I D + +V + +V+ + KL L + Sbjct: 4 LYYIHEEFGSDSTAATKAPNDLQKIFQDCKFKPLVTLKKNSKIVRIFDYAFKLLLCLIRI 63 Query: 61 ENKDVLIFNFPMAKP--FWHILSFFHRLLKFRIVPLIHDIDELRGGGGSDS-----VRLA 113 + D++IF FP A + L + K +++ LI+D++ LR G + + Sbjct: 64 RSNDIVIFQFPFATHGKLKNFLMKLLQYKKAKMIFLINDLESLRYSGNKKNLISKEQYIK 123 Query: 114 TCDMVISHNPQMTKYLSKYMS-QDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSR 172 D++I HN +M ++L + +KI + +FDYL+ D +++ + V+ AGNLS Sbjct: 124 NADVIICHNQRMKEFLIENKIDSEKIVVLGVFDYLL--DKFNKEKASFDKTVVIAGNLSP 181 Query: 173 HKCSFIYTE-----GCDFTLFGVNYENKDN----PKYLGSFDAQSPEKINLPGMQFGLIW 223 K ++ F L+G N+ + N Y GSF + + FGL+W Sbjct: 182 QKSGYLTELLKNENRIKFNLYGPNFTSSTNNNDCVSYKGSFSPEK--IPFILEGDFGLVW 239 Query: 224 DGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKE 283 DGDS+ TCSG G+YLK+NNPHK SL+++ ++PV IW ++AL+DF+ +N IG V + E Sbjct: 240 DGDSILTCSGITGEYLKYNNPHKVSLFIASKIPVIIWKQSALSDFVKENNIGIVVNDLIE 299 Query: 284 MQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDLKT 329 MQEI+ +MT E Y+ EN + +S+K+R G + +E+ + +K Sbjct: 300 MQEIITNMTEEQYEIFRENIEQLSKKVRQGYFTNLAIEKSLSIIKN 345 >UniRef50_C0YXY6 Possible galactofuranosyltransferase n=5 Tax=Lactobacillus RepID=C0YXY6_LACRE Length = 338 Score = 306 bits (784), Expect = 8e-82, Method: Composition-based stats. Identities = 90/329 (27%), Positives = 154/329 (46%), Gaps = 15/329 (4%) Query: 10 SRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRI-ISSVKLSTFLCGLENKDVLIF 68 +AG KA+ D + V+Q+ ++ + + FL + D + Sbjct: 13 HDNNAGPKAKIDIENFLLKDGFEKWNFTINQESVLQKAKVAYIDVPRFLAKQNDIDEIFL 72 Query: 69 NFPMA-KPFWHILSFFHRLLKFRIVPLIHDIDELRGGG------GSDSVRLATCDMVISH 121 +P K L + + +I+ +IHDI+ LR + D +I H Sbjct: 73 QYPTYSKIVTKQLVKRLQQMNSKIILIIHDIESLRLHYGEKGYIDEELRVFNMADGLIVH 132 Query: 122 NPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCS-FIYT 180 N +M K+L ++ + +FDY + ++ ++ + V +AGNLS+ + Sbjct: 133 NAKMEKWLRDNGVTVPMESLGLFDY--DNKIKLASGSNYETSVCFAGNLSKAGFLEKLSL 190 Query: 181 EGCDFTLFGVNYENK--DNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDY 238 + +FG N K N Y G + E N FGL+WDG + TC G FG+Y Sbjct: 191 KRVKLNVFGPNPLEKYGANIVYKGQYPPD--ELPNYLKGNFGLVWDGTTPITCDGLFGNY 248 Query: 239 LKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQ 298 +KFNNPHK SLYLS +PV +W +AA+AD + IG V S+ E+ E++ +++ Y + Sbjct: 249 MKFNNPHKASLYLSSGIPVVVWRQAAIADLVEKMNIGIVVDSLNELDEVLPNVSSIDYSE 308 Query: 299 ISENTKIISQKIRTGSYFRDVLEEVIDDL 327 + N K +++K+R+G Y + + + L Sbjct: 309 LVNNAKEVAEKLRSGFYIKTAISNLEKGL 337 >UniRef50_D1PDY2 Putative galactofuranosyltransferase n=1 Tax=Prevotella copri DSM 18205 RepID=D1PDY2_9BACT Length = 351 Score = 304 bits (780), Expect = 2e-81, Method: Composition-based stats. Identities = 82/340 (24%), Positives = 158/340 (46%), Gaps = 16/340 (4%) Query: 3 FLNDLNFSRRDAGFKARKDALDIASDYE--NISVVNIPLWGGVVQRIISSVKLSTFLCGL 60 +++ +++ AG KA+ D + + N+ + + + + L Sbjct: 8 YISRDYYNQTSAGNKAKTDTEETLVEMGAINLGLHRTIKNSKIFAFFRNLAGIIRACILL 67 Query: 61 ENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGG---GSDSVRLATCDM 117 + D+L +P+ K F I + R + + LIHDI +R + RL+ D Sbjct: 68 KKGDILFLQYPIKKYFTFICT-VARFKGAKTISLIHDIGSIRTHRLTTQQEVKRLSHSDY 126 Query: 118 VISHNPQMTKYLSKYMSQDKIKDIKIFDYLVS--SDVEHRDVTDKQRGVIYAGNLSRHKC 175 +++ N +M ++L Q I+ + ++DY + H ++YAG + K Sbjct: 127 ILATNNKMKEWLISNNFQKPIEGLGLWDYRSPYFNKNSHPICNPGNISIVYAGAIHVRKN 186 Query: 176 SFIYT-----EGCDFTLFGVNYE---NKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDS 227 F+ + + ++G E +NP Q E I FGL+WDGDS Sbjct: 187 PFLIQLSKKLKTWNLIIYGKKEELTGWANNPLITFKGFVQPDEFIRTVKADFGLVWDGDS 246 Query: 228 VETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEI 287 ++TCSG FG+YLK+N PHK S YL LP+ IW +AA+ + + A+ ++ E+++ Sbjct: 247 LDTCSGIFGEYLKWNTPHKVSFYLRAGLPIIIWKQAAVTPILEKAGVCIAINTLSELEQK 306 Query: 288 VDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDL 327 ++ ++ + ++ ENTK +++++ G + R L+ + + Sbjct: 307 LNELSSDELSKMKENTKRLAERLNQGFFLRQALDNYLSVI 346 >UniRef50_C0WVC5 Possible galactofuranosyltransferase n=2 Tax=Lactobacillus fermentum RepID=C0WVC5_LACFE Length = 350 Score = 303 bits (776), Expect = 5e-81, Method: Composition-based stats. Identities = 102/351 (29%), Positives = 163/351 (46%), Gaps = 30/351 (8%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIA-SDYENISVV-----------NIPLWGGVVQRI-- 47 Y ++ + S G KA D + + + LW + RI Sbjct: 4 YIISLKDPSGNVGGPKANMDNIKFLKEQMGFKELWLDYGWKDGFWWHHNLWDWINTRIHK 63 Query: 48 --ISSVKLSTFLCGLENKDVLIFNFPMA--KPFWHILSFFHRLLKFRIVPLIHDIDELRG 103 +S L F + D ++ +P+ K I H+ ++ +IHD + +R Sbjct: 64 YQLSRSVLPKFFKEHPDIDNVVIQYPLYSNKLIKQITDSVHQNSHAKLYFIIHDAEMIRL 123 Query: 104 GGGS------DSVRLATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDV 157 + D +I HN +M K+L + + + D+ IFDY ++ Sbjct: 124 YADEPKRAQGELDSFNLSDGIIGHNAKMNKFLKEQGVKVPLVDLGIFDYDNPQPLQEYKG 183 Query: 158 TDKQRGVIYAGNLSRHKCSFIYTEGCDFTLFGVNYENKDN--PKYLGSFDAQSPEKINLP 215 + V YAGNL + F LFG N N Y G F + Sbjct: 184 --YDKSVCYAGNLIDAEFLQDVHPTNRFDLFGPNPAESYNEGLNYKGQFSPT--DLPAHM 239 Query: 216 GMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIG 275 FGL+W G SV+TC G FG YLK+NNPHKTSLYLS LPV IWD+AALADF+++N +G Sbjct: 240 DENFGLVWHGTSVDTCDGVFGRYLKWNNPHKTSLYLSSGLPVIIWDQAALADFVLENGVG 299 Query: 276 YAVGSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDD 326 + S+ ++ + +D++T E Y+Q+ +N + ++ ++RTG Y +E++I++ Sbjct: 300 ITISSLNDLNDKLDALTEEEYRQMHDNVQKVANQMRTGYYITHAMEKMINN 350 >UniRef50_C2FKS4 Possible galactofuranosyltransferase n=1 Tax=Lactobacillus plantarum subsp. plantarum ATCC 14917 RepID=C2FKS4_LACPL Length = 343 Score = 303 bits (775), Expect = 8e-81, Method: Composition-based stats. Identities = 84/335 (25%), Positives = 162/335 (48%), Gaps = 15/335 (4%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLE 61 Y ++ +AG KA++D I + + + V R++ ++++ + Sbjct: 4 YLVSTKLEKNNNAGSKAKQDIEAILFKAGLEKLSLV-IPTNRVGRVLYAIRIWKKVFNGL 62 Query: 62 NKDVLIFNFPMAKPF--WHILSFFHRLLKFRIVPLIHDIDELRGGGGSD------SVRLA 113 N+ +++ +P+ ++ + +IV ++HDI+ LR + L Sbjct: 63 NEGLIVVQYPLYSKVITKQLVKEAGKRPNVKIVAIVHDIESLRIDVNHEDAINTEIDLLN 122 Query: 114 TCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRH 173 D +I HN +M +L + + + FDYL V V +AGNL++ Sbjct: 123 GFDFLIVHNTKMKSWLIENGLTIPSEVLGAFDYLSDFSV--PIQRKSGNVVNFAGNLAKS 180 Query: 174 KCS-FIYTEGCDFTLFGVNYEN-KDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETC 231 I + + ++G N + Y G + + + +GL WDGDS+ TC Sbjct: 181 SFLTKITSTDVKYHIYGPNPQKYSTALAYKGIYSPE--QLSEQFVSGYGLAWDGDSITTC 238 Query: 232 SGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSM 291 SG +G+YLK NNPHK SLY+ LPV +WD +A++D++ N +G +V S+ E+ +I+ + Sbjct: 239 SGVYGEYLKINNPHKVSLYIRSGLPVIVWDDSAMSDWVQKNDLGLSVSSLAELGDIISGV 298 Query: 292 TIETYKQISENTKIISQKIRTGSYFRDVLEEVIDD 326 T Y+ +EN ++++Q+++ G Y R+ +++ + Sbjct: 299 TDHQYQIYTENARVVAQRMQQGLYIREAFTKLLKN 333 >UniRef50_C2EVL7 Possible galactofuranosyltransferase n=1 Tax=Lactobacillus vaginalis ATCC 49540 RepID=C2EVL7_9LACO Length = 336 Score = 301 bits (770), Expect = 3e-80, Method: Composition-based stats. Identities = 91/338 (26%), Positives = 160/338 (47%), Gaps = 21/338 (6%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLE 61 Y L + + G KA++DA+ IS+ + + ++ SS L L Sbjct: 7 YVLEWHDSEKNTGGVKAKQDAVTFLKKDGFISI---EVPSSKLGKVWSSFWARYILRNLS 63 Query: 62 NKDVLIFNFPMAKPFWHIL--SFFHRLLKFRIVPLIHDIDELRGG--------GGSDSVR 111 +++ +P KPF L + K +++ LIHD++ +R S+ Sbjct: 64 G--IIVIQYPSGKPFLRKLWLEAACKNKKLKVILLIHDLESIRFFNDSKYSDVRQSEFEF 121 Query: 112 LATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLS 171 +A D +++ N +M L K I + +DY + + + D QR + YAGNL Sbjct: 122 IAKADGLVALNERMKSLLVKGGIVKPITTLDAWDYDNKNPIIEK--KDYQRRICYAGNLR 179 Query: 172 RHKCSFIYTEGCDFTLFGVNYEN--KDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVE 229 + +FG N E + KY+G F Q + +GL+WDG S E Sbjct: 180 KALFLSDLKCKTSIYVFGPNSETTFSKSIKYMGQFSPQK--LPSHLNGDYGLVWDGVSSE 237 Query: 230 TCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVD 289 TC G +G YL++N PHK SLY+S LPV +WDKAA+A+F+ +G + ++ ++ ++ Sbjct: 238 TCKGMYGQYLRYNTPHKFSLYISSGLPVIVWDKAAIAEFVKKYNVGLTISNLNDIDNLLH 297 Query: 290 SMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDL 327 S+ YK++ +N +++K+R G + + ++I + Sbjct: 298 SVPSSQYKELQKNVIKVAEKMRNGQFLTTAINDLIKKI 335 >UniRef50_Q4JZC8 Putative glycosyl transferase n=2 Tax=Streptococcus pneumoniae RepID=Q4JZC8_STRPN Length = 357 Score = 299 bits (767), Expect = 6e-80, Method: Composition-based stats. Identities = 112/356 (31%), Positives = 178/356 (50%), Gaps = 36/356 (10%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLW---GGVVQRIISSV----KLS 54 YF+ + AG KA D I+ + + V+Q++ Sbjct: 6 YFIKVEKDLKNTAGIKAPDDIEKISEELGMKEIRFPKFPFEKNKVIQKLWLFCVVGYNWI 65 Query: 55 TFLCGLENKDVLIFNFPMA--KPFWHILSFFHRLLKFRIVPLIHDIDELRGGGGS----- 107 + L L+ DV+I+ PM + + + + + +IHD++ LR G Sbjct: 66 SLLWRLKKNDVVIYQHPMYGVRVANFAIPLLKKYKNIKFISVIHDLESLRKGIQGVIEDN 125 Query: 108 -------DSVRLATCDMVISHNPQMTKYLSKYMSQD-KIKDIKIFDYLVSSDVEHRDVTD 159 D L+ D VISHNP+MT+YL + + +++IFDYL S++E + Sbjct: 126 ETTNAIADKELLSKFDKVISHNPKMTEYLEGIGIKKENLVELQIFDYLDPSEIEEKI--- 182 Query: 160 KQRGVIYAGNLSRHKCSFIYTE-----GCDFTLFGVNY---ENKDNPKYLGSFDAQSPEK 211 + GV+ AGNL++ K S+IY LFG N+ E +N +Y GS Sbjct: 183 -EDGVVIAGNLAKGKSSYIYKLLENELNFKLNLFGPNFINEELPENVEYFGSLPPNK--L 239 Query: 212 INLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVD 271 +FGL+WDGDS+ETCSG G+YLK+NNPHKTSLYL+ +PV IW +AALA FI + Sbjct: 240 PQKLVGKFGLVWDGDSLETCSGNTGNYLKYNNPHKTSLYLASGIPVIIWKEAALAQFIEE 299 Query: 272 NRIGYAVGSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDL 327 N +G V ++ E++ ++ +++ Y I NT + +K+R G ++R + + +D Sbjct: 300 NNVGITVNNLSEIEFVMQNISEGEYLSIKRNTMQLGEKLRNGYFYRQAISKCKNDF 355 >UniRef50_D0RXG2 Galactofuranose transferase n=13 Tax=Streptococcus RepID=D0RXG2_9STRE Length = 351 Score = 298 bits (764), Expect = 1e-79, Method: Composition-based stats. Identities = 105/350 (30%), Positives = 165/350 (47%), Gaps = 31/350 (8%) Query: 2 YFLND---LNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKL----S 54 Y+L D N ++AG KAR D I + + + Sbjct: 3 YYLKDSFLHNEHEKNAGSKARNDVEAILISEGYEGLELKVENWYKMNFFKAQQHKYRATK 62 Query: 55 TFLCGLENKDVLIFNFPMAKPFWHILSFFHR--LLKFRIVPLIHDIDELRGGGG------ 106 + L D L+ FP+ + I + + LIHD++ LR G Sbjct: 63 SVFDQLGAGDELVIQFPIIHHTFFISQLIKQAQKRGAKFYLLIHDVETLRHAAGSEVKFR 122 Query: 107 -------SDSVRLATCDMVISHNPQMTKYLSKYMSQ-DKIKDIKIFDYLVSSDVEHRDVT 158 + L + D +I HN M K L DK+ ++IFDYL+ + E + + Sbjct: 123 HKVRNYFQEKKALMSVDGIIVHNDIMKKVLVGQGVPADKMASLEIFDYLIPN-FEVQALP 181 Query: 159 DKQRGVIYAGNLSRHKCSFIYTE--GCDFTLFGVNYENK---DNPKYLGSFDAQSPEKIN 213 K + +I AGNL+ K ++Y + L+GV Y+ N Y GSF + Sbjct: 182 QKDQPIIVAGNLNPAKSGYLYNLPDQPAYNLYGVGYDESRALKNTSYFGSFMPD--DLPA 239 Query: 214 LPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNR 273 FGL+WDGDS ETC G++G+YL+FNN HK SLYL+ PV +W ++ALA FI++ Sbjct: 240 ALEGSFGLVWDGDSSETCQGSYGNYLRFNNSHKASLYLASGFPVVVWKESALAHFILEKS 299 Query: 274 IGYAVGSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEV 323 G AV S+ +++ +++++T + Y +SEN K I + +R G Y R L+++ Sbjct: 300 CGIAVASLHDLEAVLENLTEKEYADLSENAKRIGKDLREGYYLRSALKKL 349 >UniRef50_B0BR56 Glycosyltransferase n=1 Tax=Actinobacillus pleuropneumoniae serovar 3 str. JL03 RepID=B0BR56_ACTPJ Length = 349 Score = 298 bits (763), Expect = 2e-79, Method: Composition-based stats. Identities = 106/350 (30%), Positives = 172/350 (49%), Gaps = 28/350 (8%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIP------LWGGVVQRIISSVKLST 55 Y + +L+ AG KA +D +IA + V L +++++I + Sbjct: 4 YQIVELSTEHNHAGSKAVQDVYEIALSMGYKANVVRTATSVDSLLAKILRQVIFFIDWLK 63 Query: 56 FLCGLENKDVLIFNFPMAKPF---WHILSFFHRLLKFRIVPLIHDIDELR------GGGG 106 +E+ +++ P IL+ R+ K + + L+HD++ELR Sbjct: 64 IYFSIESNSIVLIQNPYYHKQLIRNWILNRLKRIKKVKFISLVHDVEELRKSLYNNYYKN 123 Query: 107 SDSVRLATCDMVISHNPQMTKYLSKYMS-QDKIKDIKIFDYLVSSDVEHRDVTDKQRGVI 165 L+ D +I HN +M + K +DK+ + IFDYL S + +R + Sbjct: 124 EFETMLSLADSIIVHNDKMKSFFIKKGYSEDKLISLGIFDYLQKSV--DKKRVSFERAIS 181 Query: 166 YAGNLSRHKCSFIYTEG----CDFTLFGVNYENK----DNPKYLGSFDAQSPEKINLPGM 217 AGNL K S+I G L+G N+E+ N +Y GSF A E Sbjct: 182 VAGNLDIKKSSYIAQLGSLPAIKAHLYGPNFEHSLEAFPNIEYHGSFPAT--EIPQKLVS 239 Query: 218 QFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYA 277 FGL+WDG S+ETC+G FG+YL++NNPHK SLYLS +PV IWDKAA ADF+ + +G Sbjct: 240 GFGLVWDGQSIETCTGDFGEYLQYNNPHKLSLYLSSGMPVVIWDKAAEADFVKKHNVGLC 299 Query: 278 VGSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDL 327 V S+ E+Q+ ++ MT + ++++ N + + + +G Y + + E + Sbjct: 300 VSSLSELQDKLNVMTEQEFEEMVNNVEKQTACLISGEYTKKAISEAERVI 349 >UniRef50_Q4JYV0 Putative glycosyl transferase n=2 Tax=Streptococcus pneumoniae RepID=Q4JYV0_STRPN Length = 356 Score = 298 bits (762), Expect = 3e-79, Method: Composition-based stats. Identities = 104/354 (29%), Positives = 177/354 (50%), Gaps = 36/354 (10%) Query: 2 YFLNDL---NFSRRDAGFKARKDALDIASDYENISVV-----NIPLWGGVVQRIISSVK- 52 YF+ + +++AG KAR+D DI ++ N VQR++ K Sbjct: 3 YFVEETLLDEQDKKNAGGKARQDVTDILESIGYQKLIAESEMNERQELNAVQRLVHHYKV 62 Query: 53 ---LSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLK--FRIVPLIHDIDELRGGGGS 107 L + D +I FP+ +L K ++ LIHD++ LR Sbjct: 63 KKMWKKTLSVVGKGDEVIIQFPLLNHSLFFNQVIKQLSKNGVKVYFLIHDLESLRWSQSK 122 Query: 108 DSVR-------------LATCDMVISHNPQMTKYLSKYMSQD-KIKDIKIFDYLVSSDVE 153 L + +I+HN +M Y+ Y + KI ++ FDY++ S E Sbjct: 123 SISLKSRIRLNIEEHSVLRLSEGIIAHNKKMKSYIKTYSVESSKIIPLETFDYIIPSYHE 182 Query: 154 HRDVTDKQRG--VIYAGNLSRHKCSFIYTE--GCDFTLFGVNYE--NKDNPKYLGSFDAQ 207 +++ + Q ++ AGNL +HK ++Y +F L+G+ YE + + Y GSF + Sbjct: 183 RKNLDNFQLNAPIVIAGNLKQHKAGYVYHLPSNVEFNLYGIGYEQTDDKSVHYCGSFMPE 242 Query: 208 SPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALAD 267 E + FGL+WDG S E+C +G+YL+ NNPHKTSLYL+ +PV +W +AA+A Sbjct: 243 --ELPFVLKGSFGLVWDGPSSESCIETYGEYLRVNNPHKTSLYLASGIPVVVWSEAAIAS 300 Query: 268 FIVDNRIGYAVGSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLE 321 FI +N G V ++ E+ E++ +T++ Y+ + +NT+II +++R G Y + ++ Sbjct: 301 FIKENNCGILVSNLSELPELLSMITVDEYELMKKNTEIIGERLRQGFYTKQAVK 354 >UniRef50_D0BKT6 Galactofuranosyltransferase n=1 Tax=Granulicatella elegans ATCC 700633 RepID=D0BKT6_9LACT Length = 331 Score = 295 bits (755), Expect = 2e-78, Method: Composition-based stats. Identities = 105/348 (30%), Positives = 167/348 (47%), Gaps = 41/348 (11%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLE 61 Y+L + + AG KAR DA I + + + L Sbjct: 3 YYLKENYAKAKHAGSKARLDAEKIMVEAGYAP---------------YFLNNHSNAVPLT 47 Query: 62 NKDVLIFNFPMA--KPFWHILSFFHRLLKFRIVPLIHDIDELRGGG-------------- 105 DV++ FP+ IL+ F + KF+ LIHDI+ LR Sbjct: 48 KDDVIVLQFPLLWQSLKKQILTRFLKNRKFKAYLLIHDIESLRNRKIKTVKDFKHSIIYF 107 Query: 106 GSDSVRLATCDMVISHNPQMTKYLSKYMS-QDKIKDIKIFDYLVSSDVEHRDVTDKQRGV 164 + L D +I+HN +M L + ++KI +++FDY++ E ++ V Sbjct: 108 LQNKTVLEKVDGIIAHNDKMKAELVRLGIPEEKIVALEMFDYVIPHYEEK--TAYEKNTV 165 Query: 165 IYAGNLSRHKCSF--IYTEGCDFTLFGVNYENK---DNPKYLGSFDAQSPEKINLPGMQF 219 I AGN K + +F+++G+N+E + N Y G+F E + F Sbjct: 166 IVAGNFDIRKTKYARQLPGNPEFSIYGINFEEEHLPKNVHYKGAFSPD--ELSHHLQGGF 223 Query: 220 GLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVG 279 GL+WDGDS TCSG +G+YLK NNPHK SLYL+ P+ +W ++ALADF+ N+ G V Sbjct: 224 GLVWDGDSPHTCSGMYGEYLKMNNPHKASLYLASGFPIIVWSQSALADFVRQNKCGILVD 283 Query: 280 SIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDL 327 S+ E+ E ++S++ Y+++ +N+K I +KIR G + + LE+ L Sbjct: 284 SLFEIAESLESLSENDYQEMIKNSKRIGKKIRNGIFLKTALEKCERSL 331 >UniRef50_C9A0R8 Putative uncharacterized protein n=1 Tax=Enterococcus gallinarum EG2 RepID=C9A0R8_ENTGA Length = 338 Score = 291 bits (744), Expect = 3e-77, Method: Composition-based stats. Identities = 72/331 (21%), Positives = 133/331 (40%), Gaps = 10/331 (3%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLE 61 + + + D+ KA+ D DIA + + + ++ G+ Sbjct: 4 WVTTIIESNAADSVKKAKADVCDIAKGMDYQPLYIYRYIDENEDDYALTSRIDGITAGVA 63 Query: 62 NKDVLIFNFPMAKPFWHILSFFHRLL--KFRIVPLIHDIDELRGGGGSDSV-RLATCDMV 118 N+D++++ +P F R+ + IHD + LRG D ++ Sbjct: 64 NQDMVVYQYPSYNGAHFDRMFLQRMKQRGIYTILFIHDAEMLRGKVDFDEAALFNEATLL 123 Query: 119 ISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFI 178 I H+ M L + K+ FDY ++ V++AGNL++ Sbjct: 124 IVHSQAMQTALVERGVIRKMVQKPFFDYRHKEV--SVSHERPEKRVVFAGNLAKTLFLQQ 181 Query: 179 YTEGCDFTLFGVNYENK--DNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFG 236 + + ++G + N Y G F+ + + FGL WD G + Sbjct: 182 WPNRTEILVYGEKNDRPFGANVHYCGVFEQEEL-IRKMEKNGFGLAWDD--KLPAGGDYQ 238 Query: 237 DYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETY 296 Y K+N PHK SLYLS+ +PV +W +AA+A+ + +G + I+E+ + +T E Sbjct: 239 QYTKYNAPHKISLYLSLGIPVIVWQQAAIAEMVQKLGLGIVIAGIEEIDHKLGELTDEEM 298 Query: 297 KQISENTKIISQKIRTGSYFRDVLEEVIDDL 327 ++ N S +R+G + R L + + Sbjct: 299 LRMKNNVLSFSCLLRSGIFTRTALVDSEVKI 329 >UniRef50_C9LPN1 Galactofuranosyltransferase n=2 Tax=Veillonellaceae RepID=C9LPN1_9FIRM Length = 344 Score = 288 bits (737), Expect = 2e-76, Method: Composition-based stats. Identities = 81/340 (23%), Positives = 155/340 (45%), Gaps = 18/340 (5%) Query: 3 FLNDLNFSRRDAGFKARKDALDIA-SDYENI-SVVNIPLWGGVVQRIISSVK----LSTF 56 ++ + +++ G K +D + Y+ I + +P + + L+T Sbjct: 6 YIREKAPNQQHGGNKGVEDINTVLGKKYDEISELYTLPGRRHLFDYARFFTRNWSNLNTI 65 Query: 57 LCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGG--GSDSVRLAT 114 ++N D+LI +P + L+HD+D +R + L Sbjct: 66 RKSVKNDDILIIQYPHYNFHALGEKLIDLFRIKNTILLVHDVDSVRYQTGIDEEIKLLNL 125 Query: 115 CDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHK 174 +V+ HN +M+ YL K+ + K +I IFDYL+ + + +++AGNL + Sbjct: 126 AKVVLLHNQKMSDYLVKHGLKTKTVNINIFDYLLYNTPSQESF-SFGKQIVFAGNLGKSH 184 Query: 175 CSFIY---TEGCDFTLFGVNYENK----DNPKYLGSFDAQSPEKINLPGMQFGLIWDGDS 227 + + G +LFG + + ++GS+ E FGL+WDG S Sbjct: 185 FLNLMGQDSLGLSLSLFGPGLSEEMKESSHVHWMGSYSPD--EIPFKLKGSFGLVWDGTS 242 Query: 228 VETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEI 287 ++ C G G Y+K N PHK +LY++ +PV W +AA+AD + +IG+ V S++E+ Sbjct: 243 LDECDGFMGRYMKINFPHKLALYIAAGIPVVTWSQAAIADIVKTYKIGFVVDSLREVSNY 302 Query: 288 VDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDL 327 +DS+ + Y + +N + +K+ +G + E+ + + Sbjct: 303 IDSINEKEYAEYKKNILKLQKKVMSGYFTALAFEKAVTMI 342 >UniRef50_B0N1W1 Putative uncharacterized protein n=1 Tax=Clostridium ramosum DSM 1402 RepID=B0N1W1_9FIRM Length = 358 Score = 288 bits (737), Expect = 2e-76, Method: Composition-based stats. Identities = 103/345 (29%), Positives = 170/345 (49%), Gaps = 23/345 (6%) Query: 3 FLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLW------GGVVQRIISSVKLSTF 56 FL++ G KARKD I ++ P + V Sbjct: 11 FLSEQQKKEYLGGAKARKDIDLILKQLGYKEIICRPCRDFSSPKNVINSLYSIQVNWIKI 70 Query: 57 LCGLENKDVLIFNFPMAKPFWHI--LSFFHRLLKFRIVPLIHDIDELRGG---GGSDSVR 111 + + D+L+ +P K + ++ + K + +IHD+ ++ + Sbjct: 71 KRIIRDNDILVIQYPFGKYDVNDRQIAKIKKTKKVDFIAIIHDLPSIQDKTADKLEEIKL 130 Query: 112 LATCDMVISHNPQMTKYLSKYMS-QDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNL 170 L D+VI HN +M + L + +K+ ++IFDYL + D+ + K G+ AGNL Sbjct: 131 LKKFDIVICHNKKMLEVLKELGIDNNKLVCLEIFDYLCNEDI--KARVSKDDGITVAGNL 188 Query: 171 SRHKCSFIYTE-------GCDFTLFGVNYENKDNPKYLGSFDAQSPEKINLPGMQFGLIW 223 S K +IY F L+G N+E + Y GS + E I +GLIW Sbjct: 189 SSSKAGYIYKLLDKCNEENIIFNLYGPNFERDNESSYNGSLPPE--ELIKKIKGSYGLIW 246 Query: 224 DGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKE 283 DGDS+E C+G FG+Y K NNPH+ S+ L+ ++P+ IW +AAL DF++DN IG A+ S+K Sbjct: 247 DGDSLELCNGTFGEYQKINNPHRVSMNLAAKMPILIWKEAALKDFVIDNNIGVAIDSLKN 306 Query: 284 MQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDLK 328 +++I++S+ Y + +N + +S+KIR+G Y + + E I L+ Sbjct: 307 IKDILNSIKDSDYDIMRDNLESVSKKIRSGYYTKKAINEAILKLE 351 >UniRef50_C2EH03 Possible galactofuranosyltransferase n=1 Tax=Lactobacillus salivarius ATCC 11741 RepID=C2EH03_9LACO Length = 344 Score = 285 bits (730), Expect = 1e-75, Method: Composition-based stats. Identities = 93/338 (27%), Positives = 164/338 (48%), Gaps = 19/338 (5%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIA-SDYENISVVNIPLWGG----VVQRIISSVKLSTF 56 Y L+ + + +AG KA++D I ++ + +I Sbjct: 12 YLLSVYDKTEYNAGPKAKRDISRILSEKLNFKNIEFYFNLDNTIFSKINKIKYLNWDIPR 71 Query: 57 LCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGGGSDSVR----- 111 + D + +P+ ++ ++HD++ LR ++ Sbjct: 72 KLKNKKIDNIFIQYPIYSTVVIKKILSSLDRDVKVYYIVHDLESLRLFKNDENYLSEEIN 131 Query: 112 -LATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNL 170 L D +ISHN MTK+L + + ++ D++IFDYL + + ++ + YAGNL Sbjct: 132 RLNDADGIISHNSIMTKWLKENGVKTQVSDLEIFDYLTKNV--APESNSYEKTLCYAGNL 189 Query: 171 SRHKCSFIYTEGCDFTLFGVNYENK--DNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSV 228 + F+ E ++G N + + Y G F + E FGLIWDG+ + Sbjct: 190 QKS--DFLVNEFYPIDVYGPNPKKEYPKTVSYKGVFTPE--ELPKHLKENFGLIWDGNRI 245 Query: 229 ETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIV 288 + C+G +G+Y+K+NNPHK SLYLS LPV IW+KAALA+F+ +++G VGS+ ++Q + Sbjct: 246 DECNGVYGEYMKYNNPHKVSLYLSSGLPVIIWEKAALAEFVSKHQVGIVVGSLAQLQNKL 305 Query: 289 DSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDD 326 S+T E Y + N +++S+K++ G Y + +ID+ Sbjct: 306 GSLTEEEYLNLRYNAQLVSEKLKNGYYIVKAVSNLIDN 343 >UniRef50_C7IU57 Putative uncharacterized protein n=1 Tax=Thermoanaerobacter ethanolicus CCSD1 RepID=C7IU57_THEET Length = 356 Score = 285 bits (729), Expect = 1e-75, Method: Composition-based stats. Identities = 100/355 (28%), Positives = 172/355 (48%), Gaps = 30/355 (8%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLE 61 YFL AG KA+ DA + + ++ S K L + Sbjct: 4 YFLTLKLTENYTAGSKAKIDAEYFLYQSGFKKLDLYE-GRTKIHKLTSVFKKLRDLPLNK 62 Query: 62 NKDVLIFNFPMAKPF---WHILSFFHRLLKFRIVPLIHDIDELRGGGGS-----DSVRLA 113 K +++ ++P+ P I + R ++ +IHD++ LR + L Sbjct: 63 GKVIIVTHYPLLNPVALKIFIQALELRRCDITLIGIIHDVNSLRYQQNENAVQREIQFLN 122 Query: 114 TCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVE---------HRDVTDKQRGV 164 D +ISHN MTK+L + + KI+++++FDY + + ++ + + + Sbjct: 123 MFDFLISHNSAMTKWLVEQGFKGKIQELELFDYKIDGNKNIVKSEINRTEGELKENRYII 182 Query: 165 IYAGNLSRHKCSFIYTE------GCDFTLFGVNY----ENKDNPKYLGSFDAQSPEKINL 214 +AGNL K FIY+ F L+G N+ ++ N Y G + +S Sbjct: 183 TFAGNLDPQKSGFIYSLENVNFSNLFFYLYGPNFVSNQISEKNIIYKGVY--ESNLLPLY 240 Query: 215 PGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRI 274 +GLIWDGDSV+TCSGA G+YLK+N+PHK SLY+ LPV IW KAA A+ + +I Sbjct: 241 LEGNWGLIWDGDSVKTCSGALGNYLKYNSPHKLSLYIVAGLPVIIWSKAAAAELVKKYKI 300 Query: 275 GYAVGSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDLKT 329 G + S++E+ ++S++ E Y+ EN I++ K++ G + + ++I+ ++ Sbjct: 301 GIVIDSLEEIPVKLESISNEEYQNYRENVMILANKLKKGEFIIGAVNKIINSVEE 355 >UniRef50_C7XW37 Glycosyltransferase n=1 Tax=Lactobacillus coleohominis 101-4-CHN RepID=C7XW37_9LACO Length = 338 Score = 279 bits (715), Expect = 7e-74, Method: Composition-based stats. Identities = 83/335 (24%), Positives = 159/335 (47%), Gaps = 14/335 (4%) Query: 3 FLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLEN 62 L + + AG KA D I S+ S +++ + L N Sbjct: 6 LLTFKDIGKNHAGPKATHDIELILSNNGFKSKEFHLNLNSKIEKWYYAHFYFVKLFKNTN 65 Query: 63 KDVLIFNFPMAKPFWH--ILSFFHRLLKFRIVPLIHDIDELRGGGGS------DSVRLAT 114 D L+ +P+ + I+ F + ++ ++HD++ LR + L Sbjct: 66 IDELVVQYPVYSRYIIRAIIKNFRKYSNGKLYFIVHDLEGLRLYKDDSIFGIEEIEFLNL 125 Query: 115 CDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQR-GVIYAGNLSRH 173 D +++HNP M KYL + + KI + FDYLV+ +++ + + +AGNL + Sbjct: 126 VDGIVAHNPSMKKYLEEKGVKSKITCLDFFDYLVNEKNIYKNQKNNMNDRICFAGNLDKA 185 Query: 174 KCSFIYTEG-CDFTLFGVNYEN--KDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVET 230 + ++G+N + KD +Y G F + +FGL+WDGDS++ Sbjct: 186 PFINKMSLNSIKLDVYGINRSSLYKDGIEYKGVFPPDK--LPLILNEKFGLVWDGDSIQC 243 Query: 231 CSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDS 290 C+G +G+Y+K+N+PHK SLYLS +P+ +W ++AL++ + +G +V ++K ++E++ Sbjct: 244 CNGTYGNYIKYNSPHKASLYLSAGIPIIVWKQSALSELVKKYNLGLSVNNLKNIEEVLHK 303 Query: 291 MTIETYKQISENTKIISQKIRTGSYFRDVLEEVID 325 + Y ++ N S+ I++G +E + + Sbjct: 304 IPNCEYNELKSNAIQYSKVIKSGQNIIRAIESLEN 338 >UniRef50_UPI000196CD65 hypothetical protein CATMIT_02517 n=1 Tax=Catenibacterium mitsuokai DSM 15897 RepID=UPI000196CD65 Length = 349 Score = 277 bits (709), Expect = 3e-73, Method: Composition-based stats. Identities = 101/342 (29%), Positives = 157/342 (45%), Gaps = 25/342 (7%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASDYENISV-VNIPLWGGVVQRIISSVKLSTFLCGL 60 Y+L S DA KA +D I VN+ G + + + L + Sbjct: 12 YYLQVTIDSNLDASTKAVQDCNKILHQNGFAPFEVNLYKSGNKYIKKVHNFLAFNHLNKI 71 Query: 61 ENKDVLIFNFPMA--KPFWHILSFFHRLLKFRIVPLIHDIDELR--------GGGGSDSV 110 + +L+ P+ K + IL + ++ LIHD+D LR D Sbjct: 72 DEGALLVVPHPLYVNKRYIDILEKVKQKKHIKLAFLIHDLDSLRKLFLNAQDDFEYMDHK 131 Query: 111 RLATCDMVISHNPQMTKYLSKYMS-QDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGN 169 D +I+HN M +YL ++KI ++ IFDYL S + R V AGN Sbjct: 132 MYDISDYIIAHNDSMIEYLVSQGVAREKIHNLHIFDYLCDS----NNTIKFDRSVSIAGN 187 Query: 170 LSRHKCSFIYTEG----CDFTLFGVNYENK---DNPKYLGSFDAQSPEKINLPGMQFGLI 222 L K +++ F L+GV+ + N Y G+F E N FGL+ Sbjct: 188 LDEKKSNYLAKLKDIKAVHFDLYGVHLNEEILASNITYHGAFPPD--EINNQLYSGFGLV 245 Query: 223 WDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIK 282 WDG S+E C G G+YLK+NNPHK SLYL +PV IW +AA A F+ + +G V S+ Sbjct: 246 WDGSSIERCDGNTGEYLKYNNPHKLSLYLVSGIPVVIWKEAAEAKFVEEYGLGITVNSLD 305 Query: 283 EMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVI 324 E+ E S++ E Y ++ + ++S++++ G Y ++E+ Sbjct: 306 ELGEKFASLSEEEYFEMVKRVAVVSERLKNGYYLTQAIKEIE 347 >UniRef50_D0R4M2 Putative glycosyltransferase n=1 Tax=Lactobacillus johnsonii FI9785 RepID=D0R4M2_LACJF Length = 349 Score = 276 bits (707), Expect = 5e-73, Method: Composition-based stats. Identities = 107/346 (30%), Positives = 174/346 (50%), Gaps = 20/346 (5%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASDYENISVV-----NIPLWGGVVQRIISSVKLSTF 56 Y + + ++AG KA D + IA + N V Q+I + Sbjct: 4 YQIVMKTAAGQNAGSKAPNDVVKIAEKLNFEKLFVNVHRNESALDKVKQQIEYKSNWKSV 63 Query: 57 LCGLENKDVLIFNFPMAKPF---WHILSFFHRLLKFRIVPLIHDIDELR------GGGGS 107 +E+ +L+ P+ H L K +++ ++HD++ELR Sbjct: 64 YSKIESNSILLLQVPIYVHQLSRIHFLKKIKSQKKVKLIFVVHDVEELRVAFNNNFQKKQ 123 Query: 108 DSVRLATCDMVISHNPQMTKYLSKYMS-QDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIY 166 L D+++ HN M + K ++KI ++KIFDYL + D+ + + + VI Sbjct: 124 FEDMLKLADVIVVHNEVMANFFEKKGFPKEKIVNLKIFDYLYNFDLNKKVI--FSKKVII 181 Query: 167 AGNLSRHKCSFIYTEGC---DFTLFGVNYENKDNPKYLGSFDAQSPEKINLPGMQFGLIW 223 AGNL K ++ F L+G NY K++ K + E NL FGLIW Sbjct: 182 AGNLDEKKTEYLKKLDKIDAKFDLYGPNYVKKNSNKITYKGVVPANELPNLLDSGFGLIW 241 Query: 224 DGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKE 283 DG+S+ETCSG FG+YLK+NNPHK SLYL+ LPVFIW KAA A F+ +N +GY + S+ + Sbjct: 242 DGNSIETCSGYFGNYLKYNNPHKLSLYLTAGLPVFIWSKAAEAKFVDENHLGYTIDSLSD 301 Query: 284 MQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDLKT 329 + I++ +T+ Y ++ +N +++ +KI G + L + I+++K Sbjct: 302 IPLILERLTLADYNRLIKNVRLVGEKISRGDFMTVALTDAINNIKE 347 >UniRef50_Q032N6 Glycosyltransferase n=1 Tax=Lactococcus lactis subsp. cremoris SK11 RepID=Q032N6_LACLS Length = 345 Score = 276 bits (707), Expect = 6e-73, Method: Composition-based stats. Identities = 103/341 (30%), Positives = 168/341 (49%), Gaps = 22/341 (6%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASDYENISV-VNIPLWGGVVQRIISSVKLSTFLCGL 60 Y++N L +AG KA DA +I S+ + + ++ + SV++ + L Sbjct: 4 YYINALQKENMNAGSKAVNDATEIFEKMGYESLLSKVNIKNIYLRTLFFSVQVMIRILFL 63 Query: 61 ENKDVLIFNFPMAKPFWHILSFFHRL--LKFRIVPLIHDIDELRGGGGSDSVRLATCD-- 116 ++ NFP F I +F + ++ LIHDI ELR G + + + Sbjct: 64 PKNTKVVSNFPPIFFFERICLYFLKKRSKSLKVFILIHDIYELRIGKNNSTPYRNLLNFK 123 Query: 117 ----MVISHNPQMTKYLSKYMSQDK-IKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLS 171 I+HN +M +L K + I D++IFDYL ++ + VI AGNL+ Sbjct: 124 NSNFYFIAHNDKMVSWLVKEGYKKNNIIDLEIFDYLS--VIKEDAGGTYGKSVIIAGNLA 181 Query: 172 RHKCSFIYTE----GCDFTLFGVNY----ENKDNPKYLGSFDAQSPEKINLPGMQFGLIW 223 K S++ DF L+G N E N Y GSF A E N+ +GLIW Sbjct: 182 PEKSSYLMELFKISEIDFNLYGPNVSSDVEKSKNVIYHGSFPAD--EIPNIIQGSYGLIW 239 Query: 224 DGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKE 283 D ++ +G +G+Y ++NNPHKTSLYL+ P+ +W+KAALA FI+++ +G+ V +++E Sbjct: 240 DSETTIGGTGKYGNYQRYNNPHKTSLYLAAGFPIIMWEKAALASFIMEHNLGFLVNTLEE 299 Query: 284 MQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVI 324 + + + Y ++ EN + KIR G + + LE+ Sbjct: 300 IPSKIAKIKEVDYNRMRENVEKFGNKIRMGYFLTEALEKAE 340 >UniRef50_B0P5G1 Putative uncharacterized protein n=1 Tax=Clostridium sp. SS2/1 RepID=B0P5G1_9CLOT Length = 359 Score = 275 bits (704), Expect = 1e-72, Method: Composition-based stats. Identities = 106/357 (29%), Positives = 173/357 (48%), Gaps = 34/357 (9%) Query: 1 MYFLNDLN---FSRRDAGFKARKDALDIASDYENISVVNIPLWGG----VVQRIISSVK- 52 +Y + + S A KAR D DI D++ + +++++ S Sbjct: 5 IYIVKEHICSGESEFTAAGKARIDVEDILYDWKAKDIKIKIKKNSNENSILKKLFSHYHL 64 Query: 53 ---LSTFLCGLENKDVLIFNFPMAKPFWHI--LSFFHRLLKFRIVPLIHDIDELRGGGGS 107 L L+ DVLI FP+ + F L + + + +IHD++ LR + Sbjct: 65 YQIWKKSLDKLKEGDVLIIQFPLQEGFIFASHLLKNLKKKNIKTIAVIHDLETLRITKDN 124 Query: 108 -------------DSVRLATCDMVISHNPQMTKYLSKYMS-QDKIKDIKIFDYLVSSDVE 153 + L ++ HN M K L K +D + +K+FDYL+ E Sbjct: 125 TISKKRKIRLYIEEIPTLKQFSKIVVHNQSMKKVLMKKGISEDSMVTLKMFDYLIKEGNE 184 Query: 154 HR-DVTDKQRGVIYAGNLSRHKCSFIYTE--GCDFTLFGVNYE--NKDNPKYLGSFDAQS 208 + K+ +I AGNLS +K ++Y F L+G+NY + KY GS+ S Sbjct: 185 LPGNTKSKENNIIIAGNLSSYKVGYVYELPHDVKFDLYGINYTGVTDNKIKYHGSYP--S 242 Query: 209 PEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADF 268 E +GL+WDGD+ +TCSG FGDYL+ NNPHKTSLYL+ +P+ W+KAA+A + Sbjct: 243 DELPWHLKGAYGLVWDGDTAKTCSGIFGDYLRINNPHKTSLYLACGIPIITWNKAAIAQY 302 Query: 269 IVDNRIGYAVGSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVID 325 + NR+G V S+ E+ E + ++ + Y + +N K S+++R G Y + ++E + Sbjct: 303 VRKNRVGITVSSLDEINEKLKDVSKDEYNLMRKNAKKCSERVRKGYYLKKAIQEALS 359 >UniRef50_Q7P740 Nucleotide sugar synthetase n=1 Tax=Fusobacterium nucleatum subsp. vincentii ATCC 49256 RepID=Q7P740_FUSNV Length = 357 Score = 269 bits (689), Expect = 7e-71, Method: Composition-based stats. Identities = 104/356 (29%), Positives = 179/356 (50%), Gaps = 36/356 (10%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGV-----VQRIISSVKLSTF 56 + + L+ + A KAR D +I + +++I + + Sbjct: 4 FIVEKLSKLEKTAWSKARNDVEEILISEGYQPLEIFSNLDDRSNMSTIKKIRAHFHMKKI 63 Query: 57 ----LCGLENKDVLIFNFPMAKPFWHILSFFH--RLLKFRIVPLIHDIDELRGGGGS--- 107 L L++ D + F FP+ + + +L +IV LIHD++ +R Sbjct: 64 WEKKLSVLKSGDSIFFQFPVVHNSIFLHNILKRLKLKGIKIVVLIHDMESIRLISEKSLS 123 Query: 108 ----------DSVRLATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDV 157 + L +I+ N M K+L ++ +++IFDYL+S +VE + + Sbjct: 124 FLQKLRIKIEEFEFLKASSYLITPNKYMRKFLEDKNITIQMGELEIFDYLISEEVEEKIL 183 Query: 158 TDK---QRGVIYAGNLSRHKCSFIYTE--GCDFTLFGVNY-----ENKDNPKYLGSFDAQ 207 K + ++ AGNLS+ K +++Y +F L+GVNY +N Y GS+ A Sbjct: 184 EKKVSSKNSIVIAGNLSKEKSAYVYLLPTNLNFELYGVNYIEDKDSQSENINYNGSYMAD 243 Query: 208 SPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALAD 267 + +FGL+WDG S+ETC G +G YL +NNPHK SLYL E+P+ IW+KAALA Sbjct: 244 K--LPAVLNGKFGLVWDGSSIETCKGGYGKYLMYNNPHKVSLYLVSEIPIIIWEKAALAS 301 Query: 268 FIVDNRIGYAVGSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEV 323 FI++N+IG+ + S+ ++ E + ++ E YK + +NT I SQ++ G Y + ++ ++ Sbjct: 302 FIIENKIGFTINSLNDINEKLKGLSDEEYKVMKQNTVIFSQRLSKGFYLKKIIRDI 357 >UniRef50_B1MXC4 Glycosyltransferase n=3 Tax=Leuconostoc RepID=B1MXC4_LEUCK Length = 327 Score = 267 bits (683), Expect = 4e-70, Method: Composition-based stats. Identities = 66/320 (20%), Positives = 122/320 (38%), Gaps = 20/320 (6%) Query: 14 AGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMA 73 KA+ D IA + + ++++L + D+++ FP Sbjct: 16 GALKAKADYAHIADQSGWTVLPLARYNDARYDDATRTQFINSWLQQVNTSDIVLHQFPSY 75 Query: 74 KPFWHILSFFH--RLLKFRIVPLIHDIDELRGGGGS--DSVRLATCDMVISHNPQMTKYL 129 + F + + + LIHDI+ LR + L D+VI H+ M L Sbjct: 76 MSEKFEVQFAKTLKARQVKRAILIHDIEPLRLMKHPIWEFDLLNLYDIVIVHSQAMKVQL 135 Query: 130 SKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYTEGCDFTLFG 189 + +FDYL + +AG + + LFG Sbjct: 136 QSLGVTSQFIIQPLFDYL----GLSYPFVSFSHEINFAGTFQKSPWLQQA-QNVHINLFG 190 Query: 190 VNYEN------KDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNN 243 + N Y G+ D + + I FGLIWD D + + Y K+N Sbjct: 191 AKPKKWRDTTFPANVTYKGNLDPE--QLIMAFRDGFGLIWDNDFEDK---TYKTYTKYNA 245 Query: 244 PHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQISENT 303 PHK SLY+ LP+ W ++A+ I + IG+ + + ++ + T + +N Sbjct: 246 PHKASLYIRAGLPLIAWRESAIGQIIAEQEIGFVIDKLNQLPAQLSETTAAQFNLWQQNM 305 Query: 304 KIISQKIRTGSYFRDVLEEV 323 + ++Q++ +G + + L ++ Sbjct: 306 QPLAQQLASGYFTKATLTQL 325 >UniRef50_C7TE97 Glycosyl transferase,galactofuranosyltransferase n=2 Tax=Lactobacillus rhamnosus RepID=C7TE97_LACRG Length = 338 Score = 266 bits (679), Expect = 1e-69, Method: Composition-based stats. Identities = 84/330 (25%), Positives = 149/330 (45%), Gaps = 17/330 (5%) Query: 7 LNFSRRDAGFKARKDALDIASDYENI--SVVNIPLWGGVVQRIISSVKLSTFLCGLENKD 64 N DAGFKAR D ++ + + + + + + + L Sbjct: 7 YNSKSFDAGFKARADVKYFSNRMGIKTAEIPATRVNLKINRELQRLRAVRSLSKKLSADQ 66 Query: 65 VLIFNFPMAKPFWHILSFFHR---LLKFRIVPLIHDIDELRGGGG-----SDSVRLATCD 116 ++ +P+ S H+ + + L+HD+ ++G + L D Sbjct: 67 SVLIQYPLP-FNSFDYSLLHKTLLKHNAKCIFLVHDLISVQGQAKNVSIKQEIEELKRAD 125 Query: 117 MVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCS 176 +I HN M +L K+ I FDY V + + + +++AGNL + K Sbjct: 126 FLIVHNQAMQNFLEDQGLSQKMATINFFDYRVDVE---PPIRSEVANIVFAGNLVKSKFL 182 Query: 177 FIYT--EGCDFTLFGVNYENKDNPKYLGSFDA-QSPEKINLPGMQFGLIWDGDSVETCSG 233 E + ++G + P+ + A S + +GL+WDG S + SG Sbjct: 183 KKLPQLEIFKWHVYGSGMTAEQFPESVVFHGAIDSGVLPSKLVDGWGLVWDGISTDRISG 242 Query: 234 AFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTI 293 GDYL+ N+PHK SLYL+ LP+ +W ++ALA+ ++ +G AV ++ E++ ++ S++ Sbjct: 243 VSGDYLRLNSPHKASLYLASGLPLIVWRESALANVVLQLGLGIAVDNLMEIEPVIKSLSH 302 Query: 294 ETYKQISENTKIISQKIRTGSYFRDVLEEV 323 ++I N +IISQKIR G +D LE + Sbjct: 303 TQIEKIQTNVQIISQKIRNGGMLKDALESL 332 >UniRef50_A8RK64 Putative uncharacterized protein n=1 Tax=Clostridium bolteae ATCC BAA-613 RepID=A8RK64_9CLOT Length = 349 Score = 262 bits (670), Expect = 1e-68, Method: Composition-based stats. Identities = 105/347 (30%), Positives = 164/347 (47%), Gaps = 30/347 (8%) Query: 2 YFLNDL-NFSRRDAGFKARKDALDIASDYENI--SVVNIPLWGGVVQRIISSVKLSTFLC 58 Y L+ + + +AG KA D L ++ + + VQ IIS V + L Sbjct: 4 YILSVVGANGQNNAGSKAGNDVLRVSQECGYKLIPLYESNQVRTRVQDIISGVIATYSLR 63 Query: 59 -GLENKDVLIFNFPMAKPFWHILSFFHRLLKFRI--VPLIHDIDELRGGGGSDS------ 109 L + D+++ +P+ + + + K +I LIHDID LR D Sbjct: 64 NKLVDGDIVLMQYPLNRLLMKNIFRILKRCKSKIRIATLIHDIDYLRDIPLGDKGVDGMK 123 Query: 110 ----VRLATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVI 165 L + D +I HNP M + L K + + +FDYL D +++ + VI Sbjct: 124 VLELSLLGSSDYLICHNPFMIRTLQKEKLSVEYISLDLFDYLY--DGTPATISEDKSTVI 181 Query: 166 YAGNLSRHKCSFIYTEG-----CDFTLFGVNYENK----DNPKYLGSFDAQSPEKINLPG 216 AGNL K ++Y +L+G NY DN Y GSF E I Sbjct: 182 VAGNLLESKAGYLYQIKKDKHKFALSLYGSNYAVDKMQMDNATYHGSFKPD--ELIANLY 239 Query: 217 MQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGY 276 +GL+WDG S ETCSG++G YL+ NNPHK SLY++ +PV IW +AAL I +N +G+ Sbjct: 240 GAYGLVWDGSSTETCSGSYGKYLRINNPHKVSLYIAAGIPVVIWKEAALCSLIEENALGF 299 Query: 277 AVGSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEV 323 + S+ E++E + S Y+ N + +K+ +G + + VL ++ Sbjct: 300 GISSLDELEEALKSH-EHLYQSYRNNVLNMKEKVCSGGFLKYVLVQI 345 >UniRef50_UPI0001968A2E hypothetical protein BACCELL_04078 n=1 Tax=Bacteroides cellulosilyticus DSM 14838 RepID=UPI0001968A2E Length = 355 Score = 260 bits (665), Expect = 5e-68, Method: Composition-based stats. Identities = 85/338 (25%), Positives = 166/338 (49%), Gaps = 15/338 (4%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISS-VKLSTFLCGL 60 Y++ + +AG KA KD + + +V+ +P + ++I + L T + Sbjct: 16 YYIKFGISANPNAGSKAMKDIMALLDSKGYKAVLALPTRTNKIIKLIDIPILLFTLCFRV 75 Query: 61 ENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGGGSDSVR-----LATC 115 +++ P +L FF+R+++F+++ I+DI+ +R + +A Sbjct: 76 GRNGTVLYFVPSNFQRIKLLKFFNRIIRFKLICFINDIESMRMEKSKEYAHAEMNSIAVA 135 Query: 116 DMVISHNPQMTKYLS-KYMSQDKIKDIKIFDYLVSSDV----EHRDVTDKQRGVIYAGNL 170 D++++ N + L KY + + I I+DYL + + ++ ++ V +AGNL Sbjct: 136 DIILAPNDNSIQILQNKYHFTNHLVSIGIWDYLNNFEPIASEHTTNMVFNEKSVAFAGNL 195 Query: 171 SRHKCSF-IYTEGCDFTLFGVNYENKD--NPKYLGSFDAQSPEKINLPGMQFGLIWDGDS 227 ++ + + +F ++G N E K N +++G N+ +GL+WDG S Sbjct: 196 NKAPFINELSSVNLNFKIWGSNTEEKKDRNIEFMGKKAPDEL-IENISQCTWGLVWDGIS 254 Query: 228 VETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEI 287 + TC G G YL+FNN HK LYL+ +PV +W+++ +A F+ ++G V S+ + +I Sbjct: 255 INTCCGLLGTYLRFNNSHKCGLYLAARVPVIVWEESGMASFVNKYKVGICVSSLHDAADI 314 Query: 288 VDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVID 325 ++ M + Y +N + I Q I G +F + LE+ Sbjct: 315 INCMDQKVYNIYKKNAQSIGQLISEGKFFLEALEKAEK 352 >UniRef50_A3CM54 Nucleotide sugar synthetase-like protein, putative n=7 Tax=Firmicutes RepID=A3CM54_STRSV Length = 334 Score = 260 bits (664), Expect = 6e-68, Method: Composition-based stats. Identities = 65/317 (20%), Positives = 122/317 (38%), Gaps = 17/317 (5%) Query: 18 ARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFW 77 A+ D +A + + S +L + + DV+I+ P Sbjct: 20 AQNDVTKLAKQLGFNELSFYFYDIYSDSQSELSRRLDGIMASVGYGDVVIYQSPTWNGRE 79 Query: 78 HILSFFHRLL--KFRIVPLIHDIDELRGGGG-----SDSVRLATCDMVISHNPQMTKYLS 130 +F +L + +++ IHD+ L D VI + QM L Sbjct: 80 FDQAFISKLKILQAKLITFIHDVPPLMFPSNYYLMPEYIDMYNQSDAVIVPSEQMRDKLV 139 Query: 131 KYMSQ-DKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYTEGCDFTLFG 189 DKI +++D+ + K + +AG++ R ++ +F Sbjct: 140 AEGLTVDKILVQRMWDHPYDLPLHQPQFAPK---LYFAGSVERFPHLINWSYATPLEIFS 196 Query: 190 VNYEN--KDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKT 247 E+ + N Y G + L GL+W VE +Y N HK+ Sbjct: 197 PEEESNPEANVSYRGWVSRPEL-LLELSKGGLGLVW---GVEENPADEPEYYGLNISHKS 252 Query: 248 SLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQISENTKIIS 307 + YL+ +PV + + A+ I D +G+ V S++E IV+++T E Y+ + E + S Sbjct: 253 ATYLAAGIPVIVPSYLSNAELIRDRGLGFVVDSLEEASRIVENLTAEEYQAMVERVRKFS 312 Query: 308 QKIRTGSYFRDVLEEVI 324 ++ G + + VL + + Sbjct: 313 FLLKEGYFSKKVLVDAV 329 >UniRef50_A7HN15 Galactofuranosyltransferase n=1 Tax=Fervidobacterium nodosum Rt17-B1 RepID=A7HN15_FERNB Length = 350 Score = 257 bits (658), Expect = 3e-67, Method: Composition-based stats. Identities = 82/345 (23%), Positives = 149/345 (43%), Gaps = 22/345 (6%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLE 61 Y S+ +AG+KA+ D I + V RI S +L + Sbjct: 8 YVPYFHWDSKFNAGYKAKNDVEIIFESAKFKRVDIFKKASDSNSRIFSLSRLISLYLKRN 67 Query: 62 --NKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGGGSDSVR----LATC 115 N ++ F + + +IHDI+ +R D R + Sbjct: 68 FANNAIVFFQNGTGLDLLIAPALRKAFKNAKRCIVIHDIESIRLARSIDFTREKLVFSNF 127 Query: 116 DMVISHNPQMTKYLSKY-MSQDKIKDIKIFDYLVSSDVEHRDVTD-----KQRGVIYAGN 169 + H+ +M Y+ + + KI + +FDY++ + V R ++ + + +AGN Sbjct: 128 THAVCHSKKMADYIKEKLGYKGKIYILGLFDYILDTPVYERVMSKTLPSLGKYVISFAGN 187 Query: 170 LSRHKCSF-----IYTEGCDFTLFGVNYENKDN---PKYLGSFDAQSPEKINLPGMQFGL 221 LS+ + L+G Y+ +Y G F E FGL Sbjct: 188 LSKSTFLKKIIKEVNPLNYTVYLYGKGYDGDTKDGVLEYKGVFHPD--ELPYKIEGHFGL 245 Query: 222 IWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSI 281 +WDG+ V SG G YLK+N+PHK SLY+ LP+ +W ++A+ + + + IG+ V S+ Sbjct: 246 VWDGEEVNGISGTVGHYLKYNSPHKASLYIVSGLPLIVWKESAIYETVKEYNIGFGVNSL 305 Query: 282 KEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDD 326 KE+ EI+ ++ + Y+ ENT + +K+ +G ++++ ++ Sbjct: 306 KEIDEILSKVSEKDYQVWRENTIKLGKKLASGENVKEIINRILSK 350 >UniRef50_C7TIE1 Glycosyl transferase, galactofuranosyltransferase n=2 Tax=Lactobacillus rhamnosus RepID=C7TIE1_LACRL Length = 338 Score = 254 bits (650), Expect = 2e-66, Method: Composition-based stats. Identities = 67/342 (19%), Positives = 126/342 (36%), Gaps = 26/342 (7%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLE 61 Y ++ L A KA+ D + + S ++ + + Sbjct: 3 YVISPLQPDTDQATVKAKMDTAYFFGKVGFRELFLSRYVFWNDEHWRS--EILGIIATVG 60 Query: 62 NKDVLIFNFPMAK--PFWHILSFFHRLLKFRIVPLIHDIDELRG----GGGSDSVRLATC 115 DV+I+ P + I+ +HD++ LR G + Sbjct: 61 KGDVVIYQIPTYAEPSVEKAVVELVHKQGALIIAFVHDVEYLRFPDSYDKGQVLSFFKSF 120 Query: 116 DMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHK- 174 D +I + + L+ + + Y + + YAGNL K Sbjct: 121 DALIVGTQLVKEKLAADGVNIPMIPSGPWGYRQPI---AYRRPSFSKTLHYAGNLVDRKA 177 Query: 175 -CSFIYTEGCDFTLFGV-------NYENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGD 226 + E ++G + D+ +YLGS+ + +GLIWD D Sbjct: 178 GFLQNFPENLHIKVYGSADGKTDLPFSLADSVEYLGSYRQEELAL--ALNDGYGLIWDED 235 Query: 227 SVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQE 286 F Y + N HK SLYLS+ LPV ++ A+ ++ +N +G A+ S+ + Sbjct: 236 K----EHHFDPYARINMTHKFSLYLSLGLPVIACNQTAIGRYVSENGLGIAIDSLDNLGN 291 Query: 287 IVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDLK 328 I++ +T + + +I + IS IR+G + + + + +K Sbjct: 292 IIEGVTEDDFNRIVDKVANISDLIRSGRHNQMAALQAVLAVK 333 >UniRef50_Q3DVD0 Nucleotide sugar synthetase-like protein n=9 Tax=Streptococcus agalactiae RepID=Q3DVD0_STRAG Length = 335 Score = 253 bits (646), Expect = 7e-66, Method: Composition-based stats. Identities = 65/316 (20%), Positives = 117/316 (37%), Gaps = 16/316 (5%) Query: 22 ALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILS 81 D+ + S ++ + GL D+++F P Sbjct: 24 VKDVGRQLGYDEMGIYFYNDHAETHGERSTRMDGIIAGLGRGDIVVFQVPTWNSTEFDEL 83 Query: 82 FFHRLL--KFRIVPLIHDIDELRGGGGS-----DSVRLATCDMVISHNPQMTKYLSKYMS 134 F +L RI+ +HDI L D+VI M YL + Sbjct: 84 FLDKLQAYGARIITFVHDIVPLMFESNFYLLDRVIDMYNRSDVVILPTKAMHDYLIEKGM 143 Query: 135 Q-DKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYTEGCDFTLFGVNY- 192 K+ +++D+ V+ D+ + Q+ + +AG++ R + E +G Sbjct: 144 TTSKVLYQEVWDHPVNIDLPRPEC---QKVLSFAGDIQRFPFVNDWKENIPLIYYGDGSR 200 Query: 193 -ENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYL 251 ++ N G D FGL W D E Y + N +K S +L Sbjct: 201 LNSEANVHAQGWKDDVELMLSLSKRGGFGLCWSEDREELVER---RYSRMNASYKLSTFL 257 Query: 252 SMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQISENTKIISQKIR 311 + LP+ + DFI + +G+ V +++E E +++M ETY EN + I+ +R Sbjct: 258 AAGLPIIANHDISSRDFIKQHGLGFTVETLEEAVEKINNMEKETYDSYVENVEKIATLLR 317 Query: 312 TGSYFRDVLEEVIDDL 327 G + +L + + L Sbjct: 318 NGYITKKLLIDAVHML 333 >UniRef50_B1I7N2 Nss n=10 Tax=Streptococcus pneumoniae RepID=B1I7N2_STRPI Length = 336 Score = 252 bits (645), Expect = 1e-65, Method: Composition-based stats. Identities = 65/317 (20%), Positives = 121/317 (38%), Gaps = 16/317 (5%) Query: 18 ARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFW 77 A+ IAS V + +L + + D+L+F P F Sbjct: 20 AQNAVQKIASQLGFREVGIYFYNIASDSPSEMNKRLDGIMASISIGDILVFQSPTWNGFE 79 Query: 78 HILSFFHRLL--KFRIVPLIHDIDELRGGGG-----SDSVRLATCDMVISHNPQMTKYLS 130 F +L + +I+ IHD+ L D++I + +M L Sbjct: 80 FDRLLFDKLKDMQVKIICFIHDVVPLMFDSNYYLMKDYMYMYNLSDVLIVPSERMKTRLM 139 Query: 131 KYMSQ-DKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYTEGCDFTLFG 189 + KI ++D+ + ++ + +AG+L R +++ +F Sbjct: 140 EEGLTTKKILVQGMWDHPHDLSLY---TPAFKKELFFAGSLERFPDLQNWSQDTPLRVFS 196 Query: 190 VNYENKDNPKYLGSFDAQSPE--KINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKT 247 E + + L + E + L FGL+W G Y N HK Sbjct: 197 NKGEASSSARNLSIEGWKKDEELLLELSKGGFGLVW---GTYQNDGESNQYYTLNISHKV 253 Query: 248 SLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQISENTKIIS 307 S YL+ +PV + + A FIVD +G+ S++E+ IVD M ++ Y++++ K S Sbjct: 254 STYLTAGIPVIVPSSLSTAKFIVDQGLGFVANSLEEVHAIVDKMNLQEYQEMTNRIKTFS 313 Query: 308 QKIRTGSYFRDVLEEVI 324 ++ G + + + + I Sbjct: 314 YLLKEGYFTKKLFVDAI 330 >UniRef50_C3QC04 Galactofuranosyltransferase n=3 Tax=Bacteroides RepID=C3QC04_9BACE Length = 334 Score = 250 bits (638), Expect = 5e-65, Method: Composition-based stats. Identities = 82/313 (26%), Positives = 139/313 (44%), Gaps = 21/313 (6%) Query: 14 AGFKARKDALDIASDYEN--ISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFP 71 A KA +D IA ++ ++ ++ +K+ L N L+ +P Sbjct: 25 ASVKAPQDIHKIALQNGYEEYPIILRGYKNKLLFIVVLFLKMIRLAINLPNGATLLIQYP 84 Query: 72 MAKPFWHILSFFHRLLKFR-IVPLIHDIDELRGG---GGSDSVRLATCDMVISHNPQMTK 127 P +L F LK + ++ L+HDI+ +R G ++ L+ D +I H P+M Sbjct: 85 SLNP--KMLYFIFPFLKKKYLITLLHDINSVREKGELSGFENKVLSNFDEIIVHTPEMQT 142 Query: 128 YLSKYM-SQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIY---TEGC 183 Y + + K + F Y+ D E R ++ + V +AGN+ + + + Sbjct: 143 YFEQRLRPGIKYHYLGCFPYIAVPDKEARQLS---KQVCFAGNIDKSVFFSDFVFENKDL 199 Query: 184 DFTLFGVNYEN---KDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLK 240 D ++G N K+ +Y G F I +GL+WDGDS ETCSG +G YLK Sbjct: 200 DLIVYGSCSSNNAMKNKYEYKGVFKPD---MIGHLEGSWGLVWDGDSTETCSGTWGSYLK 256 Query: 241 FNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQIS 300 PHK SLY+ LP+ +W +A+A + +G V S+ E+ + +++ YK+ Sbjct: 257 IIAPHKFSLYVLAGLPLIVWKDSAMAKLVEMKNLGITVTSLSEISARISAVSDNDYKEYC 316 Query: 301 ENTKIISQKIRTG 313 N + G Sbjct: 317 ANILKFQPVLLKG 329 >UniRef50_C6Z1L9 Glycosyltransferase n=1 Tax=Bacteroides sp. 4_3_47FAA RepID=C6Z1L9_9BACE Length = 400 Score = 249 bits (636), Expect = 1e-64, Method: Composition-based stats. Identities = 84/354 (23%), Positives = 155/354 (43%), Gaps = 32/354 (9%) Query: 5 NDLNFSRRDAGFKARKDALDIASDYENISVVN---------IPLWGGVVQRIISSVKLST 55 ++ ++ +A K R D + A + I + ++ + + L + Sbjct: 49 RFISENKYNAASKPRNDTITTAIRLGFKPFIFNSRILGRSKIRYFRTILFWLNQILLLVS 108 Query: 56 FLCGLE--NKDVLIFNFPMAK---PFWHILSFFHRLLKFRIVPLIHDIDELRGGG----G 106 V+ +P F + + + + V L+HDI+ +R Sbjct: 109 ICFRCRNIKDSVIFIQYPFIIFNYHFAKFILLIFKSRRCKFVVLLHDIETIRQKRIKPIK 168 Query: 107 SDSVRLATCDMVISHNPQMTKYL--SKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGV 164 D + L D++I H QM + + K+ + FDYL S ++ D + Sbjct: 169 MDRIILDLADVIIVHTHQMAEKISCIDKCPNSKLIKLAFFDYLSSIEMIGNDSAAN-INL 227 Query: 165 IYAGNLSRHKCS-----FIYTEGCDFTLFGV---NYENKDNPKYLGSFDAQSPEKINLPG 216 IYAGNL + + L+G N N + +Y G F A + I Sbjct: 228 IYAGNLDKSLFLRRLQDVGFNNEFKMFLYGAYSDNIPNTEGVEYKGKFAADRFDSIE--- 284 Query: 217 MQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGY 276 +GL+WDG+SV++C+G +G+YLK N+P K SLYL+ PV +W K+ALA ++ + ++G Sbjct: 285 GNWGLVWDGESVDSCTGQYGEYLKINSPFKFSLYLAANRPVVVWSKSALASYVKEYKLGI 344 Query: 277 AVGSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDLKTR 330 V S+K++++ + S+TI+ I + S++I++G + + L + Sbjct: 345 CVDSLKDIEKTIKSLTIDELVNIQSSVYEYSKRIKSGKMLETAIFSSLKLLHEK 398 >UniRef50_C7G7A4 Putative uncharacterized protein n=1 Tax=Roseburia intestinalis L1-82 RepID=C7G7A4_9FIRM Length = 345 Score = 248 bits (633), Expect = 2e-64, Method: Composition-based stats. Identities = 71/343 (20%), Positives = 151/343 (44%), Gaps = 22/343 (6%) Query: 1 MYFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFL-CG 59 +Y LN DA KA +D + +D + + ++P +++ L FL Sbjct: 5 IYVLNQRQDETFDAAGKAMRDVFSVLADKKAKIIWSVPKHCSKYLKLLDLPYLVLFLLFC 64 Query: 60 LENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGGGS-----DSVRLAT 114 ++ D + ++ P +L L K+RI+ I+D++ R G + + LA Sbjct: 65 VKKSDSVFYSIPENHLKIRLLKRLQLLKKYRIICFINDLNAFRYDGQNDGDPGEVRALAA 124 Query: 115 CDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSS-------DVEHRDVTDKQRGVIYA 167 D +++ N L K + + I+DY ++ ++ H + + + +A Sbjct: 125 ADKILAPNVNTVSMLKKNGISSDMIPVGIWDYRMNETQIAKIREISHAHKKENEVKIAFA 184 Query: 168 GNLSRHKCSFIYTE--GCDFTLFGVNYENKDNPKYLGSFDA---QSPEKI-NLPGMQFGL 221 GNL++ + + L+G + ++ G + S E + M +GL Sbjct: 185 GNLNKSEFLSVMEIPSDVRMELWGKLDQEREKTLADGCYYHGILSSDEIPFAVAEMDYGL 244 Query: 222 IWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSI 281 +WDG + G G+YL++NN HK +LYL+ +PV +W ++ +A+F+ ++ G + + Sbjct: 245 VWDGSGKDEIEGGLGEYLRYNNSHKCALYLASGIPVIVWSRSGMANFVREHACGITIDRL 304 Query: 282 KEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVI 324 ++ + + + Y+++ E ++ K+ G Y ++ Sbjct: 305 GDLDQAIHT---ADYEKLKEAALAVAPKLWEGYYLSQAIDSAC 344 >UniRef50_A2RHU2 Putative galactofuranose transferase n=1 Tax=Lactococcus lactis subsp. cremoris MG1363 RepID=A2RHU2_LACLM Length = 344 Score = 246 bits (629), Expect = 7e-64, Method: Composition-based stats. Identities = 102/345 (29%), Positives = 167/345 (48%), Gaps = 26/345 (7%) Query: 6 DLNFSRRDAGFKARKDALDIASDYENISV--VNIPLWGGVVQRIISSVKLSTFLCGLENK 63 + AG KA+ D+ I D S+ +I + +I++ + L ++ Sbjct: 3 FSIPIDQTAGAKAKIDSDTIFKDSGYKSLFSHHIQTNKVYINKILNIILGIISLTFIKKG 62 Query: 64 DVLIFNFPM----AKPFWHILSFFHRLLKFRIVPLIHDIDELRG---GGGSDSVR---LA 113 V+ N+P K W+ L R+ K +++ LIHD+D +R + L Sbjct: 63 SVITTNYPPNLIDKKIVWNYLYKIKRIKKIKLIILIHDLDFIRNNDNDSNQEKKYIEQLD 122 Query: 114 TCDMVISHNPQMTKYLSKYMS-QDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSR 172 D +I HN +M L + +DK+ D+KIFDYL I AGNL Sbjct: 123 VADAIIVHNTKMIDLLVEKGLSKDKLIDLKIFDYLADIKSSG---GSYGNKFIVAGNLDI 179 Query: 173 HKCSFIYTE----GCDFTLFGVNYE----NKDNPKYLGSFDAQSPEKINLPGMQFGLIWD 224 K ++ G F L+G Y + D KY GSF ++S N+ +GL+WD Sbjct: 180 QKSKYLSKISKIDGIYFNLYGPGYNQNDYDSDKSKYYGSFPSES--IPNVIQGSYGLVWD 237 Query: 225 GDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEM 284 + + G +GDY ++NNPHKTSLYL+ PV +W+KAALA FIV+N +G+ V ++ E+ Sbjct: 238 SEELSGGVGPYGDYQRYNNPHKTSLYLAAGFPVVVWEKAALAPFIVENNLGFVVDNLDEL 297 Query: 285 QEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDLKT 329 ++ ++ + Y ++ N K I QKI +G + + L++ ++ Sbjct: 298 PSKIEEISEDEYNRMKLNVKEIGQKICSGYFLNEALKKAETVIEE 342 >UniRef50_B5A7L9 Nucleotide sugar synthetase-like protein n=3 Tax=Streptococcus RepID=B5A7L9_STRPA Length = 330 Score = 243 bits (621), Expect = 5e-63, Method: Composition-based stats. Identities = 59/333 (17%), Positives = 121/333 (36%), Gaps = 20/333 (6%) Query: 1 MYFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGL 60 +Y N S + + D+A + S +L + GL Sbjct: 3 VYITNINGQSIQSTAQLCQNTVTDVAVSLGYRELGIYCYQIHTDSESELSKRLDGIVAGL 62 Query: 61 ENKDVLIFNFPMAKPFWHILSFFHRLL--KFRIVPLIHDIDELRGGGGS-----DSVRLA 113 + DV+IF P ++L +IV IHD+ L G Sbjct: 63 RHGDVVIFQTPTWNTTEFDEKLMNKLKLYDIKIVLFIHDVVPLMFSGNFYLMDRTIAYYN 122 Query: 114 TCDMVISHNPQMTKYLSKYMSQ-DKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSR 172 D+V++ + +M L + K ++D+ + + + +R + + GN R Sbjct: 123 KADVVVAPSQKMIDKLRDFGMNVSKTVVQGMWDHPTQAPMFPAGL---KREIHFPGNPER 179 Query: 173 HKCSFIYTEGCDFTLFG-VNYENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETC 231 + ++ N E N ++ + + FGL+W D + Sbjct: 180 FSFVKEWKYDIPLKVYTWQNVELPQNVH-KINYRPDEQLLMEMSQGGFGLVWMDDKDK-- 236 Query: 232 SGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSM 291 +Y +K +L+ +PV + + A + I +N +G+ V ++E V ++ Sbjct: 237 -----EYQSLYCSYKLGSFLAAGIPVIVQEGIANQELIENNGLGWIVKDVEEAIMKVKNV 291 Query: 292 TIETYKQISENTKIISQKIRTGSYFRDVLEEVI 324 + Y ++ +N + + +R G + R +L E + Sbjct: 292 NEDEYIELVKNVRSFNPILRKGFFTRRLLTESV 324 >UniRef50_C0XA00 Possible galactofuranosyl transferase n=3 Tax=Lactobacillus RepID=C0XA00_9LACO Length = 337 Score = 241 bits (616), Expect = 2e-62, Method: Composition-based stats. Identities = 51/319 (15%), Positives = 108/319 (33%), Gaps = 19/319 (5%) Query: 18 ARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFW 77 A++ I + S + + L D +IF P Sbjct: 20 AQQMVAKIGVQMGMNELGIYAYHWKEEPDQAKSTRFDGIIASLSVGDTVIFQSPNWIAIE 79 Query: 78 HILSFFHR---LLKFRIVPLIHDIDELRGGGG-----SDSVRLATCDMVISHNPQMTKYL 129 + + + IHD+ L D++I + +M +L Sbjct: 80 WDQALIDHVNIYPNVKKIIFIHDVIPLMFESNRYLLPQHIDYYNKADVLIVPSKKMYDFL 139 Query: 130 SKYMSQ-DKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIY-TEGCDFTL 187 + + +D+ + + + + +AGN + + + Sbjct: 140 RENGLKEKPYVVQHFWDH-YPCQINYFVTPQNNKVINFAGNADKFDFVNNWGNPRVKLQV 198 Query: 188 FGV--NYENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPH 245 F N +++G + + FGL+W + + +Y+ N + Sbjct: 199 FSDPCKKFEDQNLEFMGWKNDPILLEELRRSGGFGLVWSEEPY------WSEYMTMNTSY 252 Query: 246 KTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQISENTKI 305 K S YL+ +P+ + K A+ I +IG S+ E Q V + + YK+I++N + Sbjct: 253 KLSTYLAAGIPIIVNSKTPEAETIKRKKIGIIADSLAEAQAKVLQVNDDEYKEITDNVES 312 Query: 306 ISQKIRTGSYFRDVLEEVI 324 ++ IR G + + L + + Sbjct: 313 FAKLIREGYFTKKALADAV 331 >UniRef50_C4ZG42 Putative uncharacterized protein n=1 Tax=Eubacterium rectale ATCC 33656 RepID=C4ZG42_EUBR3 Length = 345 Score = 238 bits (607), Expect = 2e-61, Method: Composition-based stats. Identities = 71/319 (22%), Positives = 125/319 (39%), Gaps = 22/319 (6%) Query: 18 ARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFW 77 A + L +A + +L + L D++IF +P Sbjct: 18 AHRRVLKVAQSIGCHEMGLSFYPLKPDYAKEIDKRLDGIIAPLNYGDIVIFQYPSWIGVN 77 Query: 78 HILSFFHR---LLKFRIVPLIHDIDELRGGGGS-----DSVRLATCDMVISHNPQMTKYL 129 + SF ++ +++ + DI +L + L D++I + +M +YL Sbjct: 78 YDESFVNKIKSYRDTKLIIFVQDIQKLMFDSEQAILDMEIKTLNKADLLILPSKKMHRYL 137 Query: 130 SKYMS-QDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYTEGCDFTLF 188 + + + I+D + SD+ D R +AGN +R Y + Sbjct: 138 KENGLDEKPVIYQTIWD--MPSDICFVDHA-VTRCFHFAGNYNRFPFLAEYHGKTPIYQY 194 Query: 189 GVN---YENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPH 245 N EN D+ + G F+ + L FGL+W D F Y N P+ Sbjct: 195 DANKPDRENDDSFCWKGYFEQEKL-MHELSKGGFGLVWSDDEY------FDRYYSMNQPY 247 Query: 246 KTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQISENTKI 305 K L+ +PV + F+ N +GYAV ++ E ++V S+T Y ++ N K Sbjct: 248 KLGTNLAAGIPVIVKRGCVHDKFVERNGLGYAVDTLDEADKLVQSITDAEYIELYRNVKN 307 Query: 306 ISQKIRTGSYFRDVLEEVI 324 I + I G+Y R +L++ I Sbjct: 308 IQKLILDGAYTRKILQDAI 326 >UniRef50_Q042V6 Glycosyltransferase n=4 Tax=Lactobacillus gasseri RepID=Q042V6_LACGA Length = 353 Score = 235 bits (600), Expect = 1e-60, Method: Composition-based stats. Identities = 68/331 (20%), Positives = 145/331 (43%), Gaps = 21/331 (6%) Query: 14 AGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNF-PM 72 AG K +D + I + V + R + L L ++ + Sbjct: 26 AGNKFPRDIISIFEKNDYTPVYIREGYVKK--RPWEFLNDVYQLIRLPRNSIVFYIDRVH 83 Query: 73 AKPFWHILSFFHRLLKFRIVPLIHDIDELRGGGGSDSVR------LATCDMVISHNPQMT 126 +++ R + ++ DID LR S + R L + +IS N +MT Sbjct: 84 PNLSRNLVYSILRRKNIKSFSILEDIDPLRDKKMSTNDRKLGLESLNSNKGIISQNKKMT 143 Query: 127 KYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFI-------Y 179 ++L + ++ D+LVS+ E + ++Y GNLS + F+ Sbjct: 144 RFLVNQGVRVTTVELSALDFLVSNYKEKKHKKSADTIIVYGGNLSSEQAGFLNHLPISKS 203 Query: 180 TEGCDFTLFGVN---YENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFG 236 + ++G+ + N Y G F A+ E I+ +GL+W+ D ++ Sbjct: 204 NNKIKYRVYGMGEMSKQLSSNAIYCGGFSAE--ESIDKLKGDWGLVWNNDGSKSNKSGQN 261 Query: 237 DYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETY 296 Y ++ PHK S+Y +P+ + K+A+ADF+++N+ G + +++E+++ +++++ + Y Sbjct: 262 SYYEYVCPHKLSMYAICGMPIIVGKKSAMADFVINNKCGIVINNLEEIEKKINAISQQEY 321 Query: 297 KQISENTKIISQKIRTGSYFRDVLEEVIDDL 327 + +N I+ K+ G Y ++ + ++ + Sbjct: 322 LEYQKNISKIASKMALGFYTQNAIRKIEKKI 352 >UniRef50_C4Z1X5 Putative uncharacterized protein n=1 Tax=Eubacterium eligens ATCC 27750 RepID=C4Z1X5_EUBE2 Length = 240 Score = 227 bits (580), Expect = 3e-58, Method: Composition-based stats. Identities = 81/240 (33%), Positives = 132/240 (55%), Gaps = 13/240 (5%) Query: 99 DELRGGGGSDSVRLATCDMVISHNPQMTKYLSKYMSQ-DKIKDIKIFDYLVSSDVEHRDV 157 + D D VI+HN +M +YL ++ + KI ++ IFDYL + + ++ + Sbjct: 4 ESQHIFEHIDETMYEIADYVIAHNSKMKRYLIEHGVEESKIYELGIFDYLTNINPNNKSI 63 Query: 158 TDKQRGVIYAGNLSRHKCSFIYTEG-----CDFTLFGVNYEN----KDNPKYLGSFDAQS 208 + + AGNL +K ++I +F L+G+N++ + Y G+F S Sbjct: 64 -RYSKTLNIAGNLDANKSNYIRELNGVDKTINFNLYGLNFDKNVLTSEAIHYKGAFP--S 120 Query: 209 PEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADF 268 E + FGL+WDG++ C+G G+YLK+NNPHK SLY+ LPV IW +AA A+F Sbjct: 121 DEIPSQLTEGFGLVWDGNTASCCAGNTGEYLKYNNPHKLSLYMVSGLPVVIWSQAAEAEF 180 Query: 269 IVDNRIGYAVGSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDLK 328 + N +G V SI++ D+++ Y ++ EN K +S K+R G Y R V++++I DLK Sbjct: 181 VKCNNVGLVVDSIEDFSIKFDNLSENDYYKMVENAKNVSYKLRNGEYLRKVIQDIIKDLK 240 >UniRef50_Q04DG9 Glycosyltransferase n=1 Tax=Oenococcus oeni PSU-1 RepID=Q04DG9_OENOB Length = 306 Score = 225 bits (575), Expect = 1e-57, Method: Composition-based stats. Identities = 88/321 (27%), Positives = 146/321 (45%), Gaps = 32/321 (9%) Query: 14 AGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMA 73 KA++D IA + + + R +V LS + D+L+ +P Sbjct: 4 GADKAKEDFAKIAENIGFGIL--------KINRAEKTVDLSVI----KPGDLLVHQYPSY 51 Query: 74 KPFWHILSFFHRLLKF--RIVPLIHDIDELRGG----GGSDSVRLATCDMVISHNPQMTK 127 L+F L K R V LIHD + R L+T D +I+HN +MT Sbjct: 52 LGDQWELNFQKELKKVGSRTVILIHDFETFRIHDYKSKKIAFQVLSTADYLITHNKKMTN 111 Query: 128 YLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYTEGCDFTL 187 L + I I++FDYL E T ++YAG+LS+ Y+ + Sbjct: 112 RL--FRINQNIFQIELFDYLSP---EKNKTTKIPTSLVYAGSLSKSSWIKNYSLKIPIDI 166 Query: 188 FG--VNYENKDNPKYLGSFDAQSPE-KINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNP 244 FG + + YL P+ ++GL+WD D E + +Y K N+P Sbjct: 167 FGRLPKKWSLEKNDYLVLHKPIIPDQLPIFLNNKWGLVWDED-QEKNKTNYQNYQKINSP 225 Query: 245 HKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQISENTK 304 HK SLYL+ +PV +W+K+A+ F+++N+IG A+ ++ E+ + + I+ S+N Sbjct: 226 HKLSLYLAANIPVIVWEKSAITKFVLENKIGIAINNLAEIPDKIKKAEID-----SDNLD 280 Query: 305 IISQKIRTGSYFRDVLEEVID 325 +S+KIR G + +L ++I Sbjct: 281 NLSKKIRGGYFTEKLLRKIIS 301 >UniRef50_B1MVL6 Putative glycosyl transferase n=1 Tax=Leuconostoc citreum KM20 RepID=B1MVL6_LEUCK Length = 559 Score = 219 bits (558), Expect = 1e-55, Method: Composition-based stats. Identities = 70/324 (21%), Positives = 136/324 (41%), Gaps = 27/324 (8%) Query: 17 KARKDALDIASDYENISV-VNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKP 75 K R D A + + P + +L + + D +++ +P P Sbjct: 244 KPRNDVSKAAMAMGYTPIDFDTPYIDNEK---WMTAQLEKHCLQINSGDTVVWQYPKYSP 300 Query: 76 FW--HILSFFHRLLKFRIVPLIHDIDELRGGGGS--------DSVRLATCDMVISHNPQM 125 ++L++FH ++ IHDI+ LR + D + L++ D I Sbjct: 301 QLELNMLNWFHN-RGIKVASFIHDINLLREEPLNREHYLPEYDKILLSSFDANIVPEKFE 359 Query: 126 TK-YLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYTEGCD 184 Y + I +K +D+++ V + ++YAG+L++ + Sbjct: 360 QALYSLANVKLKNIVALKPYDFIIQKPVL---PATYSQDIVYAGSLAKFPALEDI--DFN 414 Query: 185 FTLFG-VNYENKD--NPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKF 241 T++G N+ + + NPK + + E + FGLIWD D A Y K+ Sbjct: 415 LTVYGEKNFSDVNFVNPKIIDGGFLPAEELASSLNNGFGLIWDEDRQNPYRQA---YTKW 471 Query: 242 NNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQISE 301 N P+K SLY+ LPV W ++A+A I +G+ V + ++ V S++ + +++ Sbjct: 472 NWPYKFSLYMVSGLPVIAWSESAIAKLIESENLGFIVTDLSQIASKVRSISQTEFNEMAA 531 Query: 302 NTKIISQKIRTGSYFRDVLEEVID 325 N I K+ G+ + L+++ + Sbjct: 532 NAAEIGNKLAHGNSTKTALKKLEN 555 >UniRef50_Q5ULS2 Orf42 n=1 Tax=Lactobacillus phage LP65 RepID=Q5ULS2_9CAUD Length = 337 Score = 217 bits (552), Expect = 5e-55, Method: Composition-based stats. Identities = 73/336 (21%), Positives = 128/336 (38%), Gaps = 20/336 (5%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLE 61 Y D + DA K RKD +I + +S S + + Sbjct: 5 YTELDSDCIAYDASVKPRKDIEEIVA-INFLSFPLSIPSLKYGDDRNSDEYMRKVASKVS 63 Query: 62 NKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGG---GSDSVRLATCDMV 118 DV++ P L ++ L+HDI+ RG L D + Sbjct: 64 KGDVVLIQTPAYIADEIGLVNKLHDRGAIVIGLVHDIEYARGFSTDFSDQYKLLKLYDGL 123 Query: 119 ISHNPQMTKYLSKYMSQD-KIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSF 177 + ++ + + I ++++ YL + VEHR + + YAGNLSR F Sbjct: 124 VVTGHRIKAIIQESGISSIPITCMELWPYLTNYVVEHRIEPNN-NRIEYAGNLSRSNGLF 182 Query: 178 IYTEGC--DFTLFGV--NYENKDNPKYLGSFDAQSPE-KINLPGMQFGLIWDGDSVETCS 232 + ++G + N + + A P+ +GL+W D Sbjct: 183 SKSLEGIEHVDVWGKQVDRSNSEKTGLVVQHGAVHPDDLPARLYSGYGLVWYVDR----- 237 Query: 233 GAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMT 292 + DY K N HK SLYLS +LP+ + + L++ + +IG V + E+ E + ++ Sbjct: 238 -KYQDYTKINVSHKASLYLSAKLPLIVSSSSYLSELVDKYKIGICVDRLDEIPEKL--LS 294 Query: 293 IETYKQISENTKI-ISQKIRTGSYFRDVLEEVIDDL 327 Y + N + I I +GS F + +++ L Sbjct: 295 RNDYCKYVNNIEEHIYDSISSGSCFTEPFVDLMSKL 330 >UniRef50_Q03A82 Glycosyltransferase n=1 Tax=Lactobacillus casei ATCC 334 RepID=Q03A82_LACC3 Length = 208 Score = 197 bits (501), Expect = 5e-49, Method: Composition-based stats. Identities = 55/209 (26%), Positives = 94/209 (44%), Gaps = 10/209 (4%) Query: 125 MTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYTEGCD 184 M K L + I F Y + + + +++AGN++ K E Sbjct: 1 MKKEL---DFKGPIIPQGPFSYRF-IEDDDPVPPKFHKKIVFAGNINNSKYLSQVPEHWH 56 Query: 185 FTLFGVNYENK----DNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLK 240 +FG + N Y GSF E N FGL+WD DS + G +Y + Sbjct: 57 LDVFGGQPHQELLDRQNINYKGSFTPT--ELPNHFDGGFGLVWDSDSFDEVIGEPAEYNR 114 Query: 241 FNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQIS 300 HK SLYL+ +PVFIW AA A+++ +N +G+AV ++ ++ I+++ T + Y + Sbjct: 115 LCYEHKLSLYLAKRMPVFIWKHAAAANWVTENHVGFAVENLADIWPIIENFTEDQYNAMQ 174 Query: 301 ENTKIISQKIRTGSYFRDVLEEVIDDLKT 329 N +S+ IR G + + + + + Sbjct: 175 PNLARVSKLIRNGVFAKHAALDALLAVNE 203 >UniRef50_C4ZG41 Putative uncharacterized protein n=1 Tax=Eubacterium rectale ATCC 33656 RepID=C4ZG41_EUBR3 Length = 303 Score = 192 bits (489), Expect = 1e-47, Method: Composition-based stats. Identities = 61/336 (18%), Positives = 110/336 (32%), Gaps = 49/336 (14%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLE 61 Y + L R++A A + V++ + L Sbjct: 9 YNIGGLIGLRQNAVKNAG-------ETLGFKEMSLFKFPDTYDSDDELHVRMDGIIASLC 61 Query: 62 NKDVLIFNFPMAK---PFWHILSFFHRLLKFRIVPLIHDIDELRGGGG----SDSVRLAT 114 +D++IF P + + R +IV I +I R + L Sbjct: 62 PEDIVIFQHPSGESPRYDGFLFEHLRRYHGTKIVAFIQEIASDRDDSEYSLSDEITLLNR 121 Query: 115 CDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHK 174 DM I + + Y ++K YL+ + + D + N ++K Sbjct: 122 ADMFIFASAALRDYYIANGLKEK-------PYLIQN------IPDYMTDIC--ANEHKNK 166 Query: 175 CSFIYTEGCDFTLFGVNYENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGA 234 +I E + +N N + +Y +S + L FGLIWD D Sbjct: 167 KLYIMAETSQ-NEYPLNNLNIEVVQYDEYHVTES--ILRLSDGGFGLIWDTDEQA----- 218 Query: 235 FGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIE 294 LY++ +PV + + ++ DN IG A +++ I S + + Sbjct: 219 ------------LGLYMAAGIPVIVKKGLSCEKYVTDNEIGAAATDFEDVYRIAVSESED 266 Query: 295 TYKQISENTKIISQKIRTGSYFRDVLEEVIDDLKTR 330 Q N K + G Y R +L + + + R Sbjct: 267 KLSQYYANVKKLQDLFINGIYTRKLLLDTLILCRER 302 >UniRef50_B8EDB2 Putative uncharacterized protein n=1 Tax=Shewanella baltica OS223 RepID=B8EDB2_SHEB2 Length = 388 Score = 127 bits (320), Expect = 4e-28, Method: Composition-based stats. Identities = 50/314 (15%), Positives = 99/314 (31%), Gaps = 55/314 (17%) Query: 39 LWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDI 98 + V + + + + L K V + P I+ + +L + V + DI Sbjct: 76 IIRRAVDAVFFMLWVFSILLLKRPKKVYVSTDPP-VLVPFIVMIYCKLFRANYVYHLQDI 134 Query: 99 DE-------------LRGGGGSDSVRLATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFD 145 R G DS+ + D++I+ +M + + + Sbjct: 135 HPEAANVVIPVKPLLFRVLKGMDSITMRHADLLITITKEMAEEIRNRSLTVSPIKL---- 190 Query: 146 YLVSSDVEHRDVT---DKQRGVIYAGN---LSRHKCSFIYTE-------GCDFTLFGVNY 192 L + V V K+ G + GN L R + F G Sbjct: 191 -LANPSVSFEHVAVPLAKKTGFTFCGNAGRLQRMPILIQAIKQYCQAGGTLQFVFAGAGV 249 Query: 193 ---------ENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNN 243 E N Y G A +++ ++ L+ D V +F Sbjct: 250 YANQLQDLAETYVNVSYKGLVSASEAAQLS-ADYEWALLPIEDEV----------TRFAF 298 Query: 244 PHKTSLYLSMELPV--FIWDKAALADFIVDNRIGYAVG-SIKEMQEIVDSMTIETYKQIS 300 P K+S Y+ + D ++A+++ N +G + ++ + + ++ Y + Sbjct: 299 PSKSSSYVFSGAKILAVCGDYTSVAEWVTTNCLGVVIEPNVDSLCQTFFAIESGGYDKFQ 358 Query: 301 ENTKIISQKIRTGS 314 N + K R G Sbjct: 359 FNIEREQLKKRLGF 372 >UniRef50_B9KB31 Putative uncharacterized protein n=1 Tax=Thermotoga neapolitana DSM 4359 RepID=B9KB31_THENN Length = 370 Score = 127 bits (320), Expect = 4e-28, Method: Composition-based stats. Identities = 60/340 (17%), Positives = 109/340 (32%), Gaps = 53/340 (15%) Query: 3 FLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLEN 62 + ++GF+ R I EN V L + + L E+ Sbjct: 33 VIYQYRTKESESGFEERG-IEYIPLKCENTGSVLRKLSERRIFD-----EKICHLVERED 86 Query: 63 KDVL-IFNFPMAKPFWHILSFFHRLLKFRIVPLIH----------------DIDEL---R 102 DVL + +FP KP L + +I+ IH D+ E R Sbjct: 87 YDVLYLHHFPATKPLKPFL--ITKKQGKKIIYDIHEYHPQNFLNVLPRPLSDLKEFFMWR 144 Query: 103 GGGGSDSVRLATCDMVISHNPQMTKYL--SKYMSQDKIKDIKIFDYLVSSDVEHRDVTDK 160 L D+ I + + + ++ K + + +S D K Sbjct: 145 IFKKQ----LELSDLCIFVSEETRDEIVAKTGLAPSKTFVVPNY----ASLKIEPDSGRK 196 Query: 161 QRGVIYAG----NLSRHKCSF--IYTEGCDFTLFGVNYEN-KDNPKYLGSFDAQSPEKIN 213 ++ +I G NL+ K + +G F + G+ + D P SF Sbjct: 197 RKEIIMVGKTQRNLTYEKKLIKALIEKGFSFRVIGMESKVFSDVPHTYTSFLPYEKMMEE 256 Query: 214 LPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDK-AALADFIVDN 272 + F L+ S T + PHK ++ PV + ++A + + Sbjct: 257 ISKGMFSLV----SYSTIGREDYKNDLYALPHKFYDSIAAGTPVVVKKSFVSMARLVKEL 312 Query: 273 RIGYAVGSIKEMQEIVDSMTI--ETYKQISENTKIISQKI 310 IG + ++ + + + Y++I EN K Sbjct: 313 EIGVVIDP-SNTEDSLRKIEDACQRYERILENIKKHQNLF 351 >UniRef50_B8DZD9 Glycosyl transferase group 1 n=2 Tax=Bacteria RepID=B8DZD9_DICTD Length = 373 Score = 122 bits (306), Expect = 2e-26, Method: Composition-based stats. Identities = 56/325 (17%), Positives = 110/325 (33%), Gaps = 51/325 (15%) Query: 18 ARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFW 77 A+ D +I I ++ +P+ ++R+I + ++ F+ P P Sbjct: 38 AQHDKEEIVDG---IHLIPLPIVRSRIRRMIYLPIRALKEALKLKANIYHFHDPELIPIG 94 Query: 78 HILSFFHRLLKFRIVPLIH-DIDE-----------LRGG-----GGSDSVRLATCDMVIS 120 +L F + +++ +H D+ + LRG + D +I+ Sbjct: 95 VLLKVFAK---GKVIYDVHEDVPKQIMSKYWIPKKLRGIISFIVNLGEKKFSFLFDAIIT 151 Query: 121 HNPQMTKYLSKYMSQDKIKDIKIFDYLVSS-DVEHRDVTDKQRGVIYAGNLSRHKCSFIY 179 L + IK + L +V+ ++ D +IY G LS+ + Sbjct: 152 ATD---DILKNFSFYRNAISIKNYPMLSKFLEVKGKEKKDDVFKIIYIGGLSKIRGISEV 208 Query: 180 TEGCDFTLFGVNYENKDNPKYLGSFDAQSPEKINLPGMQFG----LIW-----------D 224 + ++ V+ + G F EK F L W D Sbjct: 209 VKALEY----VDSNKEVRLILCGKFSPIEYEKEVRNLKGFEKVDYLGWLEPDEVVNKLVD 264 Query: 225 GDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIK-- 282 D+ C +Y+ P K Y++ LPV + + + N G V + Sbjct: 265 VDAGIVCLHPITNYVT-ALPVKLFEYMAAGLPVIASNFPLWREIVEGNNCGICVDPLNPK 323 Query: 283 EMQEIVDSMTI--ETYKQISENTKI 305 E+ E + + + +++ EN K Sbjct: 324 EIAEAIKYLIEHLDKAQKMGENGKK 348 >UniRef50_B0VJD0 Putative uncharacterized protein n=1 Tax=Candidatus Cloacamonas acidaminovorans RepID=B0VJD0_9BACT Length = 377 Score = 118 bits (296), Expect = 3e-25, Method: Composition-based stats. Identities = 51/315 (16%), Positives = 95/315 (30%), Gaps = 60/315 (19%) Query: 65 VLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDEL-------RGGGGSDS-------- 109 ++I P+ IL + KF++V +H+ L R Sbjct: 74 IIICVEPLTLLIAWIL---KKRQKFKVVFDVHEFFALSFSERFPRFLRYPAYLFYQLSLK 130 Query: 110 VRLATCDMVISHNPQMTKYLSKYMSQDKIKDIK------IFDYLVSSDVEHRDVTDKQRG 163 + D V + N ++ L + + ++DY + + + Sbjct: 131 QLMKIADAVFTVNQEICNQLLGRNKRIPSLVLPNYPVKNVWDYECNIPGSLEQLCQMKFD 190 Query: 164 VIYAGNLSRH-------------KCSFIYTEGCDFTLF-----------GVNYENKDNPK 199 IY G L+ K F + + F +N N + Sbjct: 191 FIYTGGLTEDRGIYKILKVVSLLKHDFPFLKVLILGKFLKPETEKRFNQSINDYNLNAII 250 Query: 200 YLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFI 259 Y S+ + L +FGL W L+ + P K YLS LPV Sbjct: 251 YYQSWIPAEKIGLLLKRCRFGL-W-------IFNPKNRRLRLSTPLKVLEYLSAGLPVIT 302 Query: 260 WDKAALADFIVDNRIGYA----VGSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSY 315 + I N +G ++ + + ++ Y +S+ +S+ Sbjct: 303 IKTPLMKALIEKNGVGICSPYQSKALADACAKMLKLSDNEYNAMSKKCLELSENKYNWET 362 Query: 316 FRDVLEEVIDDLKTR 330 L +VI+ L + Sbjct: 363 MEPELFKVINGLGKK 377 >UniRef50_C4N530 Putative glycosyltransferase n=1 Tax=Capnocytophaga canimorsus RepID=C4N530_9FLAO Length = 403 Score = 117 bits (294), Expect = 5e-25, Method: Composition-based stats. Identities = 44/314 (14%), Positives = 92/314 (29%), Gaps = 54/314 (17%) Query: 37 IPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIH 96 + L+ + + L D +I + P ++ +L K I + Sbjct: 78 VRLFFNYYSFAFFACLKALCLSFRNRYDAIIVHEPSPIIQFYPALLLKKLQKTPIYFWVM 137 Query: 97 DI--DELRGGGGSDSVRL-------------ATCDMVISHNPQMTKYLSKYMSQDKIKDI 141 D+ + L GG + + + ++I+ L K DKI+ Sbjct: 138 DLWPESLEIAGGVRNKIVLGYYERLVKKFYNNSEKILITSKGFRKSILQKGDFSDKIEYF 197 Query: 142 KIF--DYLVSSDVEHRDVT-DKQRGVIYAGNLSRHKCSFIY---------TEGCDFTLFG 189 + D +V D+ + V++AGN+ + + F + G Sbjct: 198 PNWAEDSIVEGDMSYPTPELPSGFRVMFAGNIGEAQDMENIMRATLILKEEKNIQFIIVG 257 Query: 190 ------------VNYENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGD 237 N +D +G + ++ L+ F Sbjct: 258 DGRKMPFVQDFIKNNSLQDTVHCVGKYPVEAMYS-FFSKADLMLV-----SLKNDKIFN- 310 Query: 238 YLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAV-----GSIKEMQEIVDSMT 292 P K Y++ P+ AD I + G+ + S+ ++ Sbjct: 311 ---LTMPAKIQAYMAASKPIIAMINGEGADIIKEANCGFTIPAGDYKSLSDIILKSSKFK 367 Query: 293 IETYKQISENTKII 306 E +++ +N K Sbjct: 368 KEELEKLGKNGKEF 381 >UniRef50_A8TGW1 Glycosyl transferase group 1 n=2 Tax=Methanococcus voltae RepID=A8TGW1_METVO Length = 378 Score = 110 bits (274), Expect = 1e-22, Method: Composition-based stats. Identities = 49/337 (14%), Positives = 109/337 (32%), Gaps = 58/337 (17%) Query: 6 DLNFSRRDAGFKARKDALDIASDYEN----ISVVNIPLWGGVVQRIISSVKLSTFLCGLE 61 D + K + + I + N+PL+ +I+ + Sbjct: 37 DRDCKNPSETTKEGIEIIRIPVKASYGSMKDFIKNLPLFYKKAYKILKKLDFDAI--HTH 94 Query: 62 NKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDI-DELRGGGGS---------DSVR 111 + D + + K ++ + V IHD+ + D + Sbjct: 95 DFDTAFLGYVIKK---QGKKNTNKTNPIKWVYDIHDLYESFIEKNNPNLAKLISKMDVIL 151 Query: 112 LATCDMVISHNPQMTKYLSKYMSQDKIK-DIKIF-DYLVSSDVEHRDVTDKQRGVIY-AG 168 + D +I N + + + ++ K+ IKI + + + + DK +++ G Sbjct: 152 MKNADDLIVVNEKFINLIDERINDKKVLGKIKIVRNTINPPKITLKSPADKPDFMVFYGG 211 Query: 169 NLSRHKCSFIY-----TEGCDFTLFGVN---------YENKDNPKYLGSFDAQSP-EKIN 213 LS+ + T+ G+ + N ++LG +++N Sbjct: 212 VLSKTRYIMEMINICEELDIKMTIAGMGVLENEIIAHSKESKNIRFLGKLPHDKLLDEMN 271 Query: 214 LPGMQFGLIWDGDSVETCSGAFGDYLKFN---NPHKTSLYLSMELPVFIWDKAALADFIV 270 + F + ++ N P+K + M +P+ + + + D + Sbjct: 272 NYSLNF-------------AIYDPVIRNNQLATPNKLFESMCMGIPIIVTKGSVMGDIVE 318 Query: 271 DNRIGYAVG----SIKEMQEIVDSMTIETYKQISENT 303 N G V S+KE + S E + +S+N Sbjct: 319 KNNCGLTVDFDEKSVKEAILKLKS-DKEFFNTLSKNA 354 >UniRef50_A8U9E3 Putative uncharacterized protein n=1 Tax=Carnobacterium sp. AT7 RepID=A8U9E3_9LACT Length = 402 Score = 98.5 bits (244), Expect = 3e-19, Method: Composition-based stats. Identities = 46/269 (17%), Positives = 96/269 (35%), Gaps = 40/269 (14%) Query: 22 ALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGL-ENKDVLIFNFPMAKPFWHIL 80 D S YE +N+P+ ++ I ++++ + + + D +I + + PF I+ Sbjct: 75 ISDFISSYETY-TINLPIIKHELRFIEYKIQINKWYEKMDKETDKIIIIYDLYIPFLKII 133 Query: 81 SFFHR-LLKFRIVPLIHDI---------------DELRGGGGSDSVRLATCDMVISHNPQ 124 + +IV +I D+ + L + D + Q Sbjct: 134 KWMKDTYENVKIVVMIPDLVGKYRNNSIKSKTKRNLLERKVDKTFELMNQADGYLLITEQ 193 Query: 125 MTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYTEGCD 184 +++++ + ++ I D +V+ + +KQ +Y+G LS Y Sbjct: 194 ISRFIEDEN-KPRMVIDGIVD---DKNVKFKITNNKQTIFMYSGLLS-----SQYNVDKL 244 Query: 185 FTLFGVNYENKDNPKYLGSFDAQSPEKINLPGMQ---FGLI---------WDGDSVETCS 232 +F +N E +L + + + +GLI + D + Sbjct: 245 IDIF-LNLEENQAQLWLCGYGELEAKLKKIESTNIKFYGLIPKKEVSDLEYQADVLINPR 303 Query: 233 GAFGDYLKFNNPHKTSLYLSMELPVFIWD 261 G+Y K++ P K YL PV + Sbjct: 304 SNKGEYTKYSFPSKNLEYLLKGKPVICYK 332 >UniRef50_C2EWJ6 Putative uncharacterized protein n=1 Tax=Lactobacillus vaginalis ATCC 49540 RepID=C2EWJ6_9LACO Length = 146 Score = 94.3 bits (233), Expect = 5e-18, Method: Composition-based stats. Identities = 27/126 (21%), Positives = 45/126 (35%), Gaps = 12/126 (9%) Query: 153 EHRDVTDKQRGVIYAGN-LSRHKCSFI---YTEGCDFTLFGVNYENK--DNPKYLGSFDA 206 ++ + +AG+ + K F + +F + N +++GS Sbjct: 9 SGKEKPHYAPIINFAGDPTNPEKYGFGGTWFNPDVKLRVFTSPKDWGVGRNIEFVGSMPD 68 Query: 207 QSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALA 266 + FGLIW GD + +Y+K N K S YL+ LPV + Sbjct: 69 IALLNDIRRTGGFGLIWSGDPY------WLEYMKHNCSFKLSTYLAAGLPVIVNSGTPAR 122 Query: 267 DFIVDN 272 D I Sbjct: 123 DIIEKK 128 >UniRef50_C5RCT8 Possible transposase n=2 Tax=Lactobacillales RepID=C5RCT8_WEIPA Length = 307 Score = 91.9 bits (227), Expect = 3e-17, Method: Composition-based stats. Identities = 31/157 (19%), Positives = 52/157 (33%), Gaps = 11/157 (7%) Query: 69 NFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGGGSDSVRLATCDMVISHNPQMTKY 128 F L + K + + IH + G + L D++ + Sbjct: 157 QFQTEDSLDRFLVSQFNVYKEKSLKRIH--RGFKIGVDEEVALLNKFDLITLPSIAAENI 214 Query: 129 LSKYMSQDK-IKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYTEGCD-FT 186 L K I FD+L + + V +AGN+S K F+ Sbjct: 215 LRKQGLIVPTIIQQGPFDFLTQAPEVSSIFSS---IVNFAGNISFSKVGFLRDINTPNIL 271 Query: 187 LFGVN--YENKDNPKYLGSFDAQSPEKINLPGMQFGL 221 +FG N + +N Y+G F + + I +GL Sbjct: 272 VFGSNLDFTLPNNVSYMGKF--DNDDLIPKLNSGYGL 306 >UniRef50_A3XA25 Glycosyl transferase, group 1 n=1 Tax=Roseobacter sp. MED193 RepID=A3XA25_9RHOB Length = 408 Score = 88.9 bits (219), Expect = 2e-16, Method: Composition-based stats. Identities = 38/325 (11%), Positives = 84/325 (25%), Gaps = 44/325 (13%) Query: 19 RKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKD-VLIFNFPMAKPFW 77 + +A + + T D I+ + + Sbjct: 70 KIKIERVAVSPRGKG----GFKDRLKNDLRYLKSCLTHAIKGSYSDAEAIYAYIPSVLTL 125 Query: 78 HILSFFHRLLKFRIVPLIHDIDE--------------LRGGGGSDSVRLATCDMVISHNP 123 + ++ ++HDI+ L+ + + L D VI Sbjct: 126 YGAKVLKMRSGAPLIAIVHDIESGLAHSLGITSKPIMLKIMRMVERIGLNFADHVIVLTE 185 Query: 124 QMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCS------- 176 M + + I + I+ V+Y+GN + + Sbjct: 186 GMKNEIIDIGCRKPIDVLPIW---SQVADIAPIDDAGPVRVMYSGNFGKKQNLDQLLPLL 242 Query: 177 ---FIYTEGCDFTLFGVNYENKDNPKYLGSFDAQSPEKINLPGMQ-FGLIWDGDSVETCS 232 + + G E + + S + + L F +V Sbjct: 243 SHISSTLPAVEIVMRGGGSERPRIEEEVQKRGITSAQFLELAPSDAFMASLQSANVHLVP 302 Query: 233 GAFGDYLKFNNPHKTSLYLSMELPV--FIWDKAALADFIVDNRIGYAVGSIK------EM 284 A + + P K +S P + L D+ G V E+ Sbjct: 303 QAL-NVANYALPSKLFSVMSAGRPFVCIAEKNSPLDVLAQDSGAGICVYPEDEAKLCQEV 361 Query: 285 QEIVDSMTIETYKQISENTKIISQK 309 + ++ + + ++ E+ + QK Sbjct: 362 EALLSDIPRQQ--KMGESGRQFVQK 384 >UniRef50_C5RCT9 Putative uncharacterized protein n=1 Tax=Weissella paramesenteroides ATCC 33313 RepID=C5RCT9_WEIPA Length = 84 Score = 78.5 bits (192), Expect = 3e-13, Method: Composition-based stats. Identities = 15/78 (19%), Positives = 40/78 (51%), Gaps = 1/78 (1%) Query: 249 LYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQISENTKIISQ 308 +YL+ + + + +++DN+ G + +I+ + + + S++ + Y ++ K Sbjct: 1 MYLAAGIVPIADHASNVGKWLIDNKCGITIPNIESLDDAIQSISRQEYDELEIAVKSQQN 60 Query: 309 KIRTGSYFRDVLEEVIDD 326 K+R G Y + L ++I+ Sbjct: 61 KVRQGYYTQK-LVKLINK 77 >UniRef50_A6T0A3 Glycosyltransferase n=4 Tax=Betaproteobacteria RepID=A6T0A3_JANMA Length = 406 Score = 78.1 bits (191), Expect = 5e-13, Method: Composition-based stats. Identities = 45/314 (14%), Positives = 89/314 (28%), Gaps = 54/314 (17%) Query: 37 IPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIH 96 I L + I+S + S +L + D++ P F L K ++ + Sbjct: 77 IRLALNYLSFIVSGLLFSPWLLRKKRYDIVFVYAPSPILQALPAIFIAWLKKCGVIVWVQ 136 Query: 97 DIDE--------------LRGGGGSDSVRLATCDMVISHNPQMTKYLSKYMSQDKIKDIK 142 D+ LR G D+++ + +++ +I Sbjct: 137 DLWPESLEATGYVRNPRILRWVAGMVRFIYRHTDLLLVQSRAFEAKVAELAPGKRIAYYP 196 Query: 143 I-FD--YLVSSDVEHRDVTDKQ--RGVIYAGNLSRHKCSFIYTE---------GCDFTLF 188 D + + + V++AGNL + E +F +F Sbjct: 197 NSVDSTFAEPFSGTLPHIPQLEAGFSVMFAGNLGAGQAVETIVEAAALVRDHAEINFVVF 256 Query: 189 G-----------VNYENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGD 237 G +N N LG F ++ F S + Sbjct: 257 GHGSRFAWMQEEINRRGLSNIHLLGRFPIETMP-------GFM---QKASALLVTLTDQP 306 Query: 238 YLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAV-----GSIKEMQEIVDSMT 292 P K Y++ P+ A + + + G + + + + MT Sbjct: 307 IFSLTVPSKVQAYMAAGRPILACLNGEGARLVAEAQAGLVIPAESASGLADSILQLYKMT 366 Query: 293 IETYKQISENTKII 306 +Q+ EN + Sbjct: 367 CAEREQMGENGRRY 380 >UniRef50_B5YCR2 WbpH n=10 Tax=Bacteria RepID=B5YCR2_DICT6 Length = 373 Score = 76.5 bits (187), Expect = 1e-12, Method: Composition-based stats. Identities = 49/316 (15%), Positives = 100/316 (31%), Gaps = 57/316 (18%) Query: 30 ENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKF 89 + + ++ +P G ++R+I + L N + + P P IL Sbjct: 46 DGVHILPLPKVGSRLERVIKQPWRALRLALKTNSSIYHLHDPELIPIGLILKLL----GK 101 Query: 90 RIVPLIHDIDELRGGGGSDSVRLAT-----------------CDMVISHNPQMTKYLSKY 132 R++ H+ L+ R A D ++ P +T+ K Sbjct: 102 RVIFDSHEDVPLQLLSKPYLNRFALRMLSQVFSIFEKYSCRYFDGIVCATPSITEKFLKI 161 Query: 133 MSQDKIKDIKIFDYLVSSD-VEHRDVTDKQRGVIYAGNLSRHKCSFIYTEGCDFTLFGVN 191 ++ + L S + H +K + Y G +S+ + + +F Sbjct: 162 NPNS--VNVNNYPLLEESKFLYHNYNENKMNEICYIGGISQIRGINELIKALEFV----- 214 Query: 192 YENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDY------------- 238 DN + + + +S E G W + G Y Sbjct: 215 ----DNVRLNLAGNFESAELEKRIKGMKG--WKKVNYYGFVGRENVYEIMARSKAGVVIF 268 Query: 239 ----LKFNN-PHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIK--EMQEIVDSM 291 N+ P+K Y+S LPV + + + + G V + E+ + + + Sbjct: 269 SPLPNHINSQPNKMFEYMSAGLPVITSNFPLWREIVERDNCGICVDPLNPKEIADAIRYI 328 Query: 292 TI--ETYKQISENTKI 305 E K++ +N + Sbjct: 329 IAHPEEAKKMGDNGRR 344 >UniRef50_B5JXL1 WblG protein n=2 Tax=Gammaproteobacteria RepID=B5JXL1_9GAMM Length = 379 Score = 72.3 bits (176), Expect = 2e-11, Method: Composition-based stats. Identities = 49/319 (15%), Positives = 91/319 (28%), Gaps = 57/319 (17%) Query: 27 SDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRL 86 + I + R+ S +L DV + P P+ + + Sbjct: 51 ESRDGIQFYGVEKGQSRYTRMRRSSRLVYLKAKSLKADVYHLHDPELLPY----ALKLQK 106 Query: 87 LKFRIVPLIH-DIDELRGGGGSDSVRLATC----------------DMVISHNPQMTKYL 129 R++ H D+ + + + VI+ P + Sbjct: 107 QGARVIFDAHEDLPKQLLSKPYINPLVRRAIAWLLGVYEAYVCRRLSGVITATPTIRDKF 166 Query: 130 SKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHK-------CSFIYTEG 182 +K + DI + L E D K V Y G +S + Sbjct: 167 AKINTT--TTDINNYPLLGELSTESSDWVRKSPSVCYIGGISTIRGCAELVEAMQSVGGP 224 Query: 183 CDFTLFGVNYENK------------DNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVET 230 + G N+ + + YLG + + + M GL+ S Sbjct: 225 VKLQMAG-NFTDSTIESRCKQSGGWERVNYLGYLNREEVRDLLAVSMA-GLVTFYPSPNH 282 Query: 231 CSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSI--KEMQEIV 288 P+K Y+S +PV + I N G V + +E+ + + Sbjct: 283 VDAQ---------PNKMFEYMSSGIPVIGSRFPLWQEIIEGNNCGICVDPLDPEEVAKAI 333 Query: 289 DSM--TIETYKQISENTKI 305 + +T + + EN K Sbjct: 334 SFIVQNPDTAESMGENGKR 352 >UniRef50_B9P363 Glycosyl transferase, group 1 n=1 Tax=Prochlorococcus marinus str. MIT 9202 RepID=B9P363_PROMA Length = 407 Score = 69.6 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 45/323 (13%), Positives = 95/323 (29%), Gaps = 39/323 (12%) Query: 24 DIASDYENISVVNIPLWGGVVQRIISSVKLSTF--LCGLENKDVLIFNFPMAKPFWHILS 81 I + + IS ++T L + D++ P + Sbjct: 67 KILLHRVFLFPSHDKSSIKRAINYISFAIMATLYGLFKINKPDIIYAYHPPLT-VGICGA 125 Query: 82 FFHRLLKFRIVPLIHDIDE--------------LRGGGGSDSVRLATCDMVISHNPQMTK 127 + K +V I D+ L+ D +I + Sbjct: 126 ILKKFYKVPLVYDIQDMWPDSLKATGMVNSKLILKITSKLCKKTYKLSDKIIVLSNGFRN 185 Query: 128 YLSKYMS-QDKIKDIKIFDYLVSSDVEHRDV---TDKQRGVIYAGNLSRHKCS------- 176 L K + KI+ I + + + ++ ++ K+ +I+AGN+ + + Sbjct: 186 CLIKRGVEKSKIEIIYNWSNIDNKNINTSNIIQINKKKFNIIFAGNIGKAQSLETLLYAA 245 Query: 177 ---FIYTEGCDFTLFGVNYENKDNPKYLGSFDAQSPEKINLPGMQF--GLIWDGDSVETC 231 + DF + G + + + + + + I ++ G + D+ Sbjct: 246 EIIKTKNQNIDFYIIGDGIDLINLKNQVKNMHLDNIKFIPRIEPKYIGGYLNKADAFLVH 305 Query: 232 SGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGS-----IKEMQE 286 K P KT Y++ P+ + AD I + G + E Sbjct: 306 LRN-NSLFKITIPSKTQTYMAFGKPIIMAVNGDAADLIKEAECGIVTEPQNPKQLAIAIE 364 Query: 287 IVDSMTIETYKQISENTKIISQK 309 + S + +I N K Sbjct: 365 KLVSYKKKRLNKIGLNGLNFYNK 387 >UniRef50_A9BF89 Glycosyl transferase group 1 n=8 Tax=Thermotogaceae RepID=A9BF89_PETMO Length = 369 Score = 68.8 bits (167), Expect = 3e-10, Method: Composition-based stats. Identities = 49/321 (15%), Positives = 111/321 (34%), Gaps = 41/321 (12%) Query: 19 RKDALDIASDYENISVVNI-PLWGGVVQRIISSVKLSTFLCGL---ENKDVLIFNFPMAK 74 + D + + + + I G ++++I+ L +C L EN D+L + Sbjct: 38 KDDKEYTDGNIKYLPIKEINETIGNPLKKLINRRPLDKKICDLVAEENYDILYMHH-FLA 96 Query: 75 PFWHILSFFHRLLKFRIVPLIHDIDELRGGGGSD---------------SVRLATCDMVI 119 + +IV IH+ + +L D+ I Sbjct: 97 SKPLDPFKIAKKRNKKIVYDIHEYHPENFLAELEGMIGNLKVKTVWRFFKKQLDLSDLAI 156 Query: 120 SHNPQMTKYLSK--YMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNL-----SR 172 + + + + ++K I + ++ + D+ K++ ++ G + Sbjct: 157 FVSEETRNDVVNKTNIDKEKTYIIPNY----ANFIIKPDIQKKRKEIVLVGKVTRKIEDE 212 Query: 173 HKCSF-IYTEGCDFTLFGVNYEN-KDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVET 230 K + +G F + G++ + D F L F LI S T Sbjct: 213 KKILKSLIEKGFSFKIIGMDSKEFMDITHESTEFLPYDEMMNELSNSLFSLI----SYNT 268 Query: 231 CSGAFGDYLKFNNPHKTSLYLSMELPVFIWDK-AALADFIVDNRIGYAVGSIKEMQEIVD 289 + PHK L+ PV + + ++A + + +G + +++E V+ Sbjct: 269 VKNRDYKNDIYALPHKFYDSLAAGTPVIVKESFVSMAKQVENLGLGVVIDP-SKVEESVE 327 Query: 290 SMTI--ETYKQISENTKIISQ 308 +T + Y++I +N + + Sbjct: 328 KITNAYKNYEKIIKNVEKHQK 348 >UniRef50_C0BNJ6 Glycosyl transferase group 1 n=1 Tax=Flavobacteria bacterium MS024-3C RepID=C0BNJ6_9BACT Length = 400 Score = 68.4 bits (166), Expect = 3e-10, Method: Composition-based stats. Identities = 45/343 (13%), Positives = 96/343 (27%), Gaps = 63/343 (18%) Query: 19 RKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLE------NKDVLIFNFPM 72 + D +I ++ G R+ + L ++ + D + P Sbjct: 52 KSDDFEILNEIPVYRANLFSRRNGGALRLFINYFSFAILASVKVRKIKGSFDAIFVYEPS 111 Query: 73 AKPFWHILSFFHRLLKFRIVPLIHDI--DELRGGGGSDSVR-LATCDMVI--SHNPQMTK 127 F + K I D+ + L GG + L + + +N + Sbjct: 112 PITVGIPAIFAKKRFKAPIYFWAQDLWPESLVAAGGVKNKFILEFFNSLTKWIYNHSIKV 171 Query: 128 YLSKYMSQDKIKDIKIFD-----YLVSSDVEHRD----------VTDKQRGVIYAGNLSR 172 + +D I D I + Y ++ ++ + +I+AGN+ Sbjct: 172 LIQSNGFRDYILDQGIPNDKILFYPNPTEDFYKPLQEVKEYQEFFEKENFNIIFAGNIGE 231 Query: 173 HKC--------SFIYTEGCDFTLFGVNYENKDNP------------KYLGSFDA-QSPEK 211 + + I + G + +LGSF + P+ Sbjct: 232 AQSFITIIEAINNIKELPIKVNVLGDGRYKETAIGLIKDKGLESHFNFLGSFPPTEMPKF 291 Query: 212 INLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVD 271 + S P K YL+ P+ A + D Sbjct: 292 FSHADA-----------LLVSLKKDKIFSLTIPAKVQSYLACGKPIIASIDGEGAKIVSD 340 Query: 272 NRIGYAVG-----SIKEMQEIVDSMTIETYKQISENTKIISQK 309 + G ++ + + + ++ T Q+ N + +K Sbjct: 341 AKCGVTSPAEDSIALSNIIKELMALNKSTLNQMGNNGRAYYEK 383 >UniRef50_Q1ILE2 Glycosyl transferase, group 1 n=1 Tax=Candidatus Koribacter versatilis Ellin345 RepID=Q1ILE2_ACIBL Length = 392 Score = 66.9 bits (162), Expect = 1e-09, Method: Composition-based stats. Identities = 42/317 (13%), Positives = 95/317 (29%), Gaps = 54/317 (17%) Query: 27 SDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRL 86 + + + + +P+ +R +V V F+ P P +L ++ Sbjct: 55 EERDGVRIHAVPIPKNRRERTTRTVWQVYRKVLGLKPQVAHFHDPELIPVGMLL----KM 110 Query: 87 LKFRIVPLIH-DIDELRGGG----------------GSDSVRLATCDMVISHNPQMTKYL 129 R+V +H D G +S A D +I+ + Sbjct: 111 RGIRVVYDVHEDYSATMMDKEWLPLPVRWLARAGVVGFESAGCAMFDQIITATEG----I 166 Query: 130 SKYMSQDKIKDIKIFDYLVSSDVEHRDV-TDKQRGVIYAGNLSRHKCSFIYTEGCDFTLF 188 ++ M K ++ F L + + V++ G L + + E Sbjct: 167 AERMPPKKTVPVQNFPMLAEFPTAGGVPYSSRDNVVVFVGGLGLIRGAKEMVEAIQL--- 223 Query: 189 GVNYENKDNPKYLGSFDAQSPEKIN-------LPGMQFGLIWDGDSVETCSG-------- 233 + NPK L + P + +G + ++ G Sbjct: 224 ---VPDHLNPKLLIVGPLEPPVSPEWIAAIDVKKRVTWGGVKRRHELKDIFGVARCGIIA 280 Query: 234 --AFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIK--EMQEIVD 289 ++L P+K Y++ LP+ D + R G +V + + + + Sbjct: 281 FLPLKNHL-NAQPNKMFEYMAAGLPLVASDFPYMRKVTDGARCGISVDPLSAQSIADAMQ 339 Query: 290 SMTIE--TYKQISENTK 304 + +++ + + Sbjct: 340 WIFEHPAEAEEMGKRGR 356 >UniRef50_A6LCB7 Glycosyltransferase family 4 n=6 Tax=Bacteroidales RepID=A6LCB7_PARD8 Length = 396 Score = 65.4 bits (158), Expect = 3e-09, Method: Composition-based stats. Identities = 38/282 (13%), Positives = 82/282 (29%), Gaps = 40/282 (14%) Query: 59 GLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDEL-------RGGGG----- 106 +++ D +I P + F + ++V + D+ L GG Sbjct: 104 RIKSYDKVIIQTPPVLAAASAMLLFRCCYRKKVVLNVSDLWPLSAVELGAMKEGGVYHGV 163 Query: 107 ---SDSVRLATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRG 163 + + ++ Y+ +Y + + + + K Sbjct: 164 MGRIERFLYRKATACQGQSKEIVDYIKRYEPGKASFLYRNLQHRFVLPNQTPS-SRKPFR 222 Query: 164 VIYAGNLSRHKCSFIYTE-------GCDFTLFGVNY-----ENKDNPKYLGSFDAQSPEK 211 ++YAG L + E G + LFG E+ G F S K Sbjct: 223 IVYAGLLGVAQNILELIECVDFKGMGAELHLFGGGNQALEIEDYVRTHDKGVFYHGSLPK 282 Query: 212 INLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVD 271 + ++ P K L + +P+ + + + Sbjct: 283 ERMREE-------LTRYHASIIPLTVRIRGAVPSKLFDLLPLGVPILFCGGGEGEEIVKE 335 Query: 272 NRIGYAV-----GSIKEMQEIVDSMTIETYKQISENTKIISQ 308 N++G + + + + + E Y+Q+ N +SQ Sbjct: 336 NQLGLVSAPGDYERLSKNVQAMSHLPDEEYRQLKANCLRLSQ 377 >UniRef50_B3CFJ1 Putative uncharacterized protein n=1 Tax=Bacteroides intestinalis DSM 17393 RepID=B3CFJ1_9BACE Length = 413 Score = 65.4 bits (158), Expect = 3e-09, Method: Composition-based stats. Identities = 51/288 (17%), Positives = 81/288 (28%), Gaps = 63/288 (21%) Query: 37 IPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIH 96 + L + + SS F N D +I + + +L K + + Sbjct: 88 LRLMINYLSFVFSSCFNVFFYFAWRNYDAVIVHEVSPIFQAYPAILLRKLRKVPVYLWVL 147 Query: 97 DIDELRGGGGSDSV--------------RLATCDMVISHNPQMTK-YLSKYMSQDKIKDI 141 DI G CD ++ + + T+ LSK DKIK Sbjct: 148 DIWPDAMMSGGGIKNRKILSFVNRLVVNIYGQCDRILISSKRFTESILSKGDFVDKIKYF 207 Query: 142 KIF-DYLVSSDVEHRDVTDKQ-RGVIYAGNLSRHKCSFIY---------TEGCDFTLFGV 190 + D L+ D E+ ++ AGNL R + + + G Sbjct: 208 PNWSDDLLKVDSEYPIPQLPDGFKIMLAGNLGRSQNLDAVVQLILSLRDIKDLKWIFIGN 267 Query: 191 NYENK------------DNPKYLGSFDAQSPE--------KINLPGMQFGLIWDGDSVET 230 E + D LG F ++ + F Sbjct: 268 GSEKEWLDNFIEANKLSDVAFTLGRFPLEAMPGFFKKANALLVTLRSGF----------- 316 Query: 231 CSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAV 278 +L P + Y+S PV AD I + GYAV Sbjct: 317 ------PHLGMVVPARLQAYMSAGRPVLAMIGNGGADVIKEANCGYAV 358 >UniRef50_A3CXX8 Glycosyl transferase, group 1 n=4 Tax=Methanomicrobia RepID=A3CXX8_METMJ Length = 402 Score = 63.1 bits (152), Expect = 1e-08, Method: Composition-based stats. Identities = 36/283 (12%), Positives = 84/283 (29%), Gaps = 51/283 (18%) Query: 56 FLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDI-------------DEL- 101 L D ++ + P R + + + + D+ + Sbjct: 100 LLFTRSRFDAIMTSAPPL-FTGIPGYVLKRTSRVKWILDVRDLWIDASIGLGFLREGSIY 158 Query: 102 -RGGGGSDSVRLATCDMVISHNPQMTKYLSKY-MSQDKIKDIKIFDYLVSSDVEHRDVTD 159 R + + LA D++ ++ + +S + ++ + V+++ Sbjct: 159 ERMSRKFEQMCLARADLIGVTTEELGRRISSHYRVTAPMELMPN---GVNTEFFQPTDGG 215 Query: 160 KQRGVIYAGNLSRHKCSFIYTE---------GCDFTLFGVNYENK------------DNP 198 K+R +IYAGN+ + F + G + D+ Sbjct: 216 KKRQIIYAGNVGHAQDLDKVALAIKSMNGTYNLKFVIVGDGDTRESLERLVKAESLTDSV 275 Query: 199 KYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVF 258 + G+ E L L+ V L++ P K Y++ +P Sbjct: 276 IFTGTLP--REEIPRLLSES--LV----GVAPLKRLAN--LEYAAPTKAYEYMACGIPFV 325 Query: 259 IWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQISE 301 +A ++ G + E S ++ +++ E Sbjct: 326 GCGNGEIAQLARESGAGVIADNTPEAIAATLSALLDDPEKMEE 368 >UniRef50_A8F8F9 Putative uncharacterized protein n=1 Tax=Thermotoga lettingae TMO RepID=A8F8F9_THELT Length = 363 Score = 62.3 bits (150), Expect = 2e-08, Method: Composition-based stats. Identities = 47/304 (15%), Positives = 98/304 (32%), Gaps = 40/304 (13%) Query: 28 DYENISVVNIPLWGGVVQRIISSVKLSTFLCGL---ENKDVLIFNFPMAKPFWHILSFFH 84 D E I +P +++ L N DV+ F++ I Sbjct: 47 DIELIEFEKLPPSRNILKWFKKWKNFDERLFNRILEVNPDVVYFHYLPFTGSGMIKRL-- 104 Query: 85 RLLKFRIVPLIHDIDELRGGGGSDS-------------VRLATCDMVISHNPQMTKYLSK 131 + L +I IH+I + G + D VI + + Y+ Sbjct: 105 KQLGKKIFFEIHEIIPEQFMGKYAIFSPVKSLIWKEFSTSIRLSDGVICISEDIAMYVFD 164 Query: 132 Y-MSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKC------SFIYTEGCD 184 Q + + + + + K + ++ G SR + G Sbjct: 165 RCGIQKEFFILP------NMALMEIESNAKSKEIVLVGKDSRELFYEKEILRKLIDSGFR 218 Query: 185 FTLFGVNYENKDNPKYLGS-FDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNN 243 F + G+ + + Y F L F LI G+ S + +Y +F+ Sbjct: 219 FKVIGLKSDLFKDIPYEYVPFLPYDEMMEQLSRASFSLISYGNEK---SRDYKNY-EFSM 274 Query: 244 PHKTSLYLSMELPVFIWDK-AALADFIVDNRIGYAVGSIKEMQEIVDSMTI--ETYKQIS 300 P+K ++ PV + ++ + +G + ++++ V+ + + Y +I Sbjct: 275 PNKLFDSIAAGTPVIVRRSFVSMVKIVERFGVGVVIEP-RDVESSVEKILKAYDDYDRIL 333 Query: 301 ENTK 304 N + Sbjct: 334 SNLR 337 >UniRef50_A7ZC10 Glycosyl transferase, group 1 family protein n=1 Tax=Campylobacter concisus 13826 RepID=A7ZC10_CAMC1 Length = 368 Score = 60.7 bits (146), Expect = 8e-08, Method: Composition-based stats. Identities = 44/305 (14%), Positives = 96/305 (31%), Gaps = 30/305 (9%) Query: 23 LDIASDYENISVVNIPLWGGVV-QRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILS 81 A + ++ + + I + L + NKD + + Sbjct: 50 EKFALKLKASDILFVRKKILFLKSNKIFNFFLKKIINNYLNKDAIFY---TRHLKIAKFL 106 Query: 82 FFHRLLKFRIVPLIHDIDEL--RGGGGSDSVRLATCDMVISHNPQMTKYLSKYMSQDKIK 139 +++ ++V H+ L + + L D + SHN L K+ +I Sbjct: 107 LENKMPDQKVVFEAHECFTLGNKALYDMEKEILQNADFIFSHNSSTLSELRKF-FGLQIA 165 Query: 140 DIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIY-----TEGCDFTLFGVNYEN 194 + + D + + + Y G+ K + L+G N N Sbjct: 166 NSAVVYNGCKQDYDFKKKDFDFSSINYYGSFLLWKGLDLMLDFALKTNIKLELYGKNSGN 225 Query: 195 KDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVET---------CSGAFGDYLKFNNPH 245 ++ + +I GL+ + V++ DY ++ P Sbjct: 226 S----FMTLKNTLKEREIENLVCFKGLLPQNEVVKSLIENNTILIIPSVKSDYSLYSTPL 281 Query: 246 KTSLYLSMELPVFIWDKAALADFIVDNRIGYAVG-----SIKEMQEIVDSMTIETYKQIS 300 K Y++ V + +A+ + D G+ S++E + ++ E +IS Sbjct: 282 KLFEYMANSNVVLAPNFPPVAEIVKDGENGFLYEAGDEKSLEEKFNYIKTLGNEELNKIS 341 Query: 301 ENTKI 305 +N Sbjct: 342 KNAYE 346 >UniRef50_B5M6M0 Glycosyltransferase n=2 Tax=Kosmotoga olearia TBF 19.5.1 RepID=B5M6M0_KOSOT Length = 381 Score = 60.0 bits (144), Expect = 1e-07, Method: Composition-based stats. Identities = 39/293 (13%), Positives = 90/293 (30%), Gaps = 38/293 (12%) Query: 44 VQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRG 103 V R ++ + L I +F KP +I+ H+ Sbjct: 77 VNRYKYEREVLRIVDKLSFDLAYIHHFATVKPLAIF--RLLSRKNVKIITDFHEYVPEEY 134 Query: 104 GGGSDSV---------------RLATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLV 148 G + + + D + + + + ++ K + + L Sbjct: 135 LFGVEQIPRSIKMWLGQKLYRHMIMKSDGTVFVSKKFLEDAKEWKPDLKAFNFPNYGNLK 194 Query: 149 SSDVEHRDVTDKQRGVIYAGN----LSRHKCSFIYTEGCDFTLFGVNYENKDNPKYLGS- 203 + +Q+ VI+AG + F F++ + + K + Sbjct: 195 IPPINQ---IKRQKEVIFAGTTERKIENELKIFEILNSKGFSIVSIGTDIKAPFEIKKLP 251 Query: 204 FDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLK---FNNPHKTSLYLSMELPVFIW 260 F + F ++ +C G+YL ++ P+K ++ PV + Sbjct: 252 FLKYEKMIERISNAAFSIV-----SYSCRNKKGNYLNKYVYSMPNKFFDSIAAGTPVILD 306 Query: 261 DK-AALADFIVDNRIGYAV--GSIKEMQEIVDSM--TIETYKQISENTKIISQ 308 + + I ++ IG + + KE E + + + Y+++ N Sbjct: 307 KDFLGMRELIENDGIGVVIDRDNPKESAEKITAFWESKVEYEKLLLNISRKQD 359 >UniRef50_A3WUD4 Predicted glycosyltransferase n=1 Tax=Nitrobacter sp. Nb-311A RepID=A3WUD4_9BRAD Length = 410 Score = 60.0 bits (144), Expect = 1e-07, Method: Composition-based stats. Identities = 47/307 (15%), Positives = 94/307 (30%), Gaps = 41/307 (13%) Query: 58 CGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDE-------LRGGGGSDSV 110 D +++ +P F L + + I D+ + G + + Sbjct: 98 FRARRFD-VVYCYPPITG-GLAAIFATLLSRRPFLIDIQDLWPDSVVKSGMAGTRKMEKI 155 Query: 111 R-------LATCDMVISHNPQMTKYLSKYMSQ-DKIKDIKIF-DYLVSSDVEHRDVTDK- 160 +++ + + L + Q DKI + + D ++ D++ Sbjct: 156 LALMCDFVYRRAAGIVAQSKGIKTRLIERDVQPDKISVVYNWADERAAAPAGLADLSSYG 215 Query: 161 ---QRGVIYAGNLSRHKCSFIYT----------EGCDFTLFGVNYENKDNPKYLGSFDAQ 207 + ++Y GNL R + + L G E+ + + A+ Sbjct: 216 FDNKFNIVYGGNLGRVQGLEVMVRAAHFARRKVPHLQLLLIGDGIESDSLKQLVQQLGAE 275 Query: 208 SPEK-INLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALA 266 + +P G I+ V + P KT Y++M P+ I + A Sbjct: 276 NVRIAPGVPRRMIGDIFAAADVLAMHLWSDPLFRITIPQKTQFYMAMGKPILIGVEGEAA 335 Query: 267 DFIVDNRIGYAVGS-----IKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLE 321 DF+ G AV S + + + M E + +I F + Sbjct: 336 DFVTQAGAGVAVPSDNVQAMADAMIRLSLMPKELLTDMGRRGHEAYWRI---FSFSTAIA 392 Query: 322 EVIDDLK 328 E L+ Sbjct: 393 ETESTLQ 399 >UniRef50_Q8AAS2 Lipopolysaccharide biosynthesis RfbU-related protein n=1 Tax=Bacteroides thetaiotaomicron RepID=Q8AAS2_BACTN Length = 368 Score = 59.6 bits (143), Expect = 1e-07, Method: Composition-based stats. Identities = 49/327 (14%), Positives = 104/327 (31%), Gaps = 60/327 (18%) Query: 20 KDALDIASDYENISVVNIPLWGGVVQRI--ISSVKLSTFLCGLENKDVLIFNFPMAKPFW 77 KD + + + +P G+ Q I I + + G + V+++NFP Sbjct: 43 KDIKNALAVVDGFVSTPVPYPIGIKQWIHQICTFISIKIILGRKPDYVVLYNFPAIASLK 102 Query: 78 HILSFFHRLLKFRIVPLIHDI----------DELRGGGGSDSVRL----ATCDMVISHNP 123 + ++ +HD+ ++R+ D VI+ + Sbjct: 103 --ILKACHKHGIKV---VHDLTEWESNNRWSPSDMMRKIDINLRMRYCVKKMDGVIAISR 157 Query: 124 QMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQR-GVIYAGN--------LSRHK 174 + Y KY + I D R+++ + ++YAG+ L Sbjct: 158 YLYDYYKKY--TNTILVPPTVDLTAGKWNRQRELSAGDKIKLVYAGSAGFGVKDRLDTIA 215 Query: 175 CSFIYTEGCDFTL-----------FGVNYENKDNPKYLGSFDAQSPEKINLPGMQFGLIW 223 + + F + +G ++ N + G K + Sbjct: 216 KAIVKFPNMQFDVIGMTEGQYVSGYGELPKDCKNILFHGRLPHTETVKA---------VQ 266 Query: 224 DGDSVETCSGAFGDYLKFN--NPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGS- 280 D D LK N P K ++ PV + ++D++ D + G+ V Sbjct: 267 DADFQFLIR---DSNLKNNAGFPTKFVESITCCTPVIATLTSNISDYLKDGKNGFVVDDS 323 Query: 281 --IKEMQEIVDSMTIETYKQISENTKI 305 + ++ ++ ++ Q+ E K Sbjct: 324 HSLDDVFGLISKLSPSEIIQMKEACKN 350 >UniRef50_B6YUV9 Glycosyltransferase n=1 Tax=Thermococcus onnurineus NA1 RepID=B6YUV9_THEON Length = 363 Score = 59.6 bits (143), Expect = 1e-07, Method: Composition-based stats. Identities = 35/262 (13%), Positives = 83/262 (31%), Gaps = 52/262 (19%) Query: 43 VVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDI-DEL 101 ++ + +K+ + + D I F ++ V +HD+ + + Sbjct: 77 YLRSLFLLMKMDFDIIHTHDFDTAILGF-----------ILKKMKNKVWVYDVHDLYESM 125 Query: 102 ----------RGGGGSDSVRLATCDMVISHNPQMTKYLSKYMSQDKIKDIKIF--DYLVS 149 R DS D+V + + ++ + + + + DYL Sbjct: 126 VKLVSGEKLARIVRYVDSYFQKNADLVFTASQKVKQVVQRNNKNV-FVILNTVNPDYL-- 182 Query: 150 SDVEHRDVTDKQR-GVIYAGNLSRHKCSFIYTE-----GCDFTLFGVNYENKDNP----- 198 +++ + Y G LS ++ + G + + G + + + Sbjct: 183 -----KNLPKYPTFTIFYGGVLSGNRFLLEMLKIAERLGVTYRVAGKGWVDVEEILKKQL 237 Query: 199 -KYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPV 257 K F + L + ++ + P+K S++ P+ Sbjct: 238 GKNFLGFIPHEQIFVELERAHL--------TFAIYDPRFENIRLSLPNKVFEAASVKTPI 289 Query: 258 FIWDKAALADFIVDNRIGYAVG 279 + ALA+ + +IG+AV Sbjct: 290 LVSRGTALAELVESMKIGWAVE 311 >UniRef50_A4XKF8 Glycosyl transferase, group 1 n=1 Tax=Caldicellulosiruptor saccharolyticus DSM 8903 RepID=A4XKF8_CALS8 Length = 365 Score = 59.2 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 37/314 (11%), Positives = 87/314 (27%), Gaps = 62/314 (19%) Query: 39 LWGGVVQRIISSVKLSTFLCGLENK--DVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIH 96 + R S + ++ + D++ F+ P + +++ +H Sbjct: 51 YGLKKLSRAKRYKNYSKIIRIVKEEKPDIIHFHDPDLLLLALYFKLILKK---KVIYDVH 107 Query: 97 DIDELRGG-----------------GGSDSVRLATCDMVISHNPQMTKYLSKYMSQDKIK 139 + L + D V+ +K+ ++K+ Sbjct: 108 EDYSLAFKDREYLPKLLRNLFSSIFNLFEKNVSKLFDGVVVVTE---DIFNKFNCKNKVI 164 Query: 140 DIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSF--------IYTEGCDFTLFGV- 190 +K F + + D ++Y G++S + + + G Sbjct: 165 -LKNFPTIDMYEKREEYNLDGTINLVYIGSVSYQRGITNLILAVKDLQNLNIKLDIVGPA 223 Query: 191 ---NYENK----DNPKYLGSFDAQSPEKINLPGM---QFGLIWDGDSVETCSGAFGDYLK 240 NY + +N K ++ F YL Sbjct: 224 ESSNYFEEIKKFENEKIKIWGRVPKSCVPDILKNAHIGF----------VTLLPLKRYLT 273 Query: 241 FNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQIS 300 + P K Y++ +P+ + I + G V ++++I ++ Sbjct: 274 -SLPLKLFEYMAAGVPIVASNFELWEGIIESSNCGIIVDP-TDIEQIKSAI-----LYFY 326 Query: 301 ENTKIISQKIRTGS 314 N + I K + G Sbjct: 327 NNRQEIINKGQNGY 340 >UniRef50_Q1NU87 Glycosyl transferase, group 1 n=2 Tax=Proteobacteria RepID=Q1NU87_9DELT Length = 420 Score = 58.8 bits (141), Expect = 3e-07, Method: Composition-based stats. Identities = 40/260 (15%), Positives = 82/260 (31%), Gaps = 51/260 (19%) Query: 61 ENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDE--------------LRGGGG 106 DV++ + + RL R + DI Sbjct: 115 RRYDVIMISTVPPVLGGFSAALAARLSNARFIYHCMDIHPEIGRISGEFAQPIVFSTLRK 174 Query: 107 SDSVRLATCDMVISHNPQMTKYLSKY--MSQDKIKDIKIFDYLVSSDVEHRDVTDKQRG- 163 D+ D ++ + M L + + +++ + F S + + D R Sbjct: 175 LDNWSCRQADPIVVLSRDMETTLRERAGGHRFRVEVLNNFPLPDSGEDLKPEEFDSHRDG 234 Query: 164 --VIYAGNLSRHKCSFIYTE---------GCDFTLFGVNYENKD----------NPKYLG 202 V+YAGN+ R + + E +F + G + ++LG Sbjct: 235 LTVLYAGNVGRFQGLQMAVEAMTKLKERTDIEFLVMGDGVAKSELQAQVEKSGAKVRFLG 294 Query: 203 SFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVF--IW 260 + ++ K + G + + ++ P KT YL P+ + Sbjct: 295 AQSVETA-KAAMRSADIGYV----------SLVPEMYRYAYPSKTMNYLEQGCPIIAAVE 343 Query: 261 DKAALADFIVDNRIGYAVGS 280 D++ LA I ++ G++V Sbjct: 344 DESGLAKEIWEDGCGFSVPP 363 >UniRef50_C1XLL8 Glycosyltransferase n=1 Tax=Meiothermus ruber DSM 1279 RepID=C1XLL8_MEIRU Length = 381 Score = 58.4 bits (140), Expect = 3e-07, Method: Composition-based stats. Identities = 45/276 (16%), Positives = 80/276 (28%), Gaps = 57/276 (20%) Query: 43 VVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIH-----D 97 + R++ +V E D+ F+ P P +L RL ++V +H D Sbjct: 63 RLGRMLGTVWAVYKQALAERGDIYHFHDPELIPVGMLL----RLQGKKVVYDVHEDVPTD 118 Query: 98 I-------DELRGGGGSDSVRL-----ATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFD 145 I LRG S L + +++ + + K ++ F Sbjct: 119 ILSKRWIPKPLRGLVASSMRLLERLAGSLLSGIVTVTEPIAARFPAH----KTILLQNFP 174 Query: 146 YLVSSDVEHRDVTDKQR-GVIYAGNLSRHKCSFIY--------TEGCDFTLFG----VNY 192 + + V+YAG+++ + F TL G Sbjct: 175 HPQEIAALGSTSYQNRPARVLYAGSITAVRGLFEMLEAMQHLQNPEVRLTLIGAFAPPGL 234 Query: 193 --ENKDNPKY-----LGSF-DAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNP 244 E + +P Y LG ++ E + + +L + P Sbjct: 235 RAEAEQHPGYRYTDFLGYKNRPETLELLASSRIG----------LAVLHPIPSFL-VSQP 283 Query: 245 HKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGS 280 K Y+ +P D I D G V Sbjct: 284 TKLYEYMMAGIPFVASDFPLWRRSIGDVSCGLFVNP 319 >UniRef50_C6LL02 Putative glycosyltransferase n=1 Tax=Bryantella formatexigens DSM 14469 RepID=C6LL02_9FIRM Length = 396 Score = 58.4 bits (140), Expect = 4e-07, Method: Composition-based stats. Identities = 45/298 (15%), Positives = 93/298 (31%), Gaps = 47/298 (15%) Query: 20 KDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHI 79 KD Y+ + + ++ V+ +++++ + + L++ DVL F Sbjct: 56 KDIEYRL--YKRKNSDHGNIFSRYVRDTLTNIREAIGILKLKDVDVL---FEDVSYSSFW 110 Query: 80 LSFFHRLLKFRIVPLIHDIDEL---------------RGGGGSDSVRLATCDMVISHNPQ 124 ++ ++V ++ D+ + D +I + Sbjct: 111 AVKAAKMKGIKVVAMLQDVWPDNAVQSHLISEGSFLYKYFEMWQKSVYMKADKLICISDD 170 Query: 125 MTKYLSKYMSQ-DKIKDIKIFDY---------LVSSDVEHRDVTDKQRGVIYAGNLSRHK 174 M ++ DKI+ I + Y + V+ ++ + IYAGN+ + + Sbjct: 171 MKDFIVSKGVDADKIEVIYNWGYSDEVVDISWEENEFVKKYNLDKDKFYAIYAGNIGKMQ 230 Query: 175 CSFIY---------TEGCDFTLFGVN--YENKDNPKYLGSFDAQSPEKINLPGMQFGLIW 223 I E F + G + P + + I+ Sbjct: 231 NVEIVVNAAKELQDREDIQFLIIGDGARKTAIEKMATDVKNITMLPMQPSEISTH---IY 287 Query: 224 DGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVF--IWDKAALADFIVDNRIGYAVG 279 V G K P KT + LS PV ++ A I ++ G +V Sbjct: 288 SAAGVNIIPLVAGG-TKTAMPSKTGIVLSCGQPVVFAFGGESRFAKMIKESGSGASVN 344 >UniRef50_C8P2S3 Glycosyltransferase n=1 Tax=Erysipelothrix rhusiopathiae ATCC 19414 RepID=C8P2S3_ERYRH Length = 404 Score = 58.0 bits (139), Expect = 5e-07, Method: Composition-based stats. Identities = 51/367 (13%), Positives = 106/367 (28%), Gaps = 73/367 (19%) Query: 19 RKDALDI-ASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFW 77 + + + I + + V+ I ++ L ++ D++I+ P Sbjct: 54 KNNIIKIKVPQVSGLKSKIVKGINTVLFPHILWNRM-KSLINVKKFDLVIYPTPPISIER 112 Query: 78 HILSFFHRLLKFRIVPLIHDIDE---------------LRGGGGSDSVRLATCDMVISHN 122 I + + L+ DI + + D + + Sbjct: 113 IIQKIKKSNKNIQTLLLLKDIFPQNAVDLEYFSKSSIIFKYFRKQEMRLYNVSDSIGCMS 172 Query: 123 PQMTKYLSKYMSQDKIKDIKIFDYLV--------------SSDVEHRDVTDKQRGVIYAG 168 +++ K DI I + + +++ ++YAG Sbjct: 173 QGNRQFILKNNEYLAESDITIVPNSISINRVCGRVQAEVKQIKRNEFLLPLEKKIIVYAG 232 Query: 169 NLSRHKC----------SFIYTEGCDFTLFGVNYENK-------DNPKYLGSFDA----Q 207 NL + + F G + DNP + + Sbjct: 233 NLGKPQSIDFLIESLEKIKESENFVHFAFCGSGTDANKLKKYCIDNPTQCSYYGQLSKLK 292 Query: 208 SPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFI--WDKAAL 265 S E I++ +GLI N P + Y+ LP+ + Sbjct: 293 SDELISIS--DYGLILLDARFTIP----------NIPSRMLSYMKFGLPLIALTDINTDI 340 Query: 266 ADFIVDNRIGYAVGS----IKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLE 321 D I++N +GY S +K + + ++ E Y + S + + + Sbjct: 341 KDTIIENNLGYWAESRGEEMKNIINSIHGLSDENYVKSSNCVIKYVKDMCN---TEKGYK 397 Query: 322 EVIDDLK 328 E+I LK Sbjct: 398 EIIKQLK 404 >UniRef50_Q7NQL7 Probable glycosyltransferase n=1 Tax=Chromobacterium violaceum RepID=Q7NQL7_CHRVO Length = 367 Score = 57.7 bits (138), Expect = 5e-07, Method: Composition-based stats. Identities = 44/275 (16%), Positives = 80/275 (29%), Gaps = 48/275 (17%) Query: 38 PLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIH- 96 P GG + R+ +VK DV+ F+ P P + ++V +H Sbjct: 58 PKRGGRLARMTGTVKRVYQAALRLRPDVVHFHDPELIPAG----VRLKQAGIKVVYDVHE 113 Query: 97 DID-----------ELRGGGGSDSVRLA-----TCDMVISHNPQMTKYLSKYMSQDKIKD 140 D+ +R L D +++ P + K + + D Sbjct: 114 DVPRQILAKHWIPGAVRPLVSGGFETLEDWAARRFDAIVTSTPHIRKRFERLGA--NALD 171 Query: 141 IKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYTEGCD-----FTLFGVNYENK 195 + F ++ V ++ V Y G +SR + L G E+ Sbjct: 172 VCNFP-ILEELVRDTPWESRRNEVCYIGGISRIRGIEPIVAALPDTSTRLNLAGPWSESD 230 Query: 196 DNPKYL---GSFDAQSPEKINL-------PGMQFGLIWDGDSVETCSGAFGDYLKFNNPH 245 K G ++ + GL+ +Y+ P Sbjct: 231 LRAKVTAEPGWARVNDLGVLDRKGVAEVLARSKIGLV--------TLFPTPNYVD-ALPI 281 Query: 246 KTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGS 280 K Y++ +PV D + + D G V Sbjct: 282 KLFEYMAAGMPVIASDFPVWREIVADAGCGVLVDP 316 >UniRef50_C4G144 Putative uncharacterized protein n=1 Tax=Abiotrophia defectiva ATCC 49176 RepID=C4G144_ABIDE Length = 387 Score = 57.7 bits (138), Expect = 6e-07, Method: Composition-based stats. Identities = 34/324 (10%), Positives = 90/324 (27%), Gaps = 37/324 (11%) Query: 28 DYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLL 87 Y++I +N+P+ ++ + L + K+ + ++ RL Sbjct: 76 KYKHIKTINLPIIKQILNAVNVYFN---ILNQEDKKNSFVICDGLSYLASKAAVLACRLK 132 Query: 88 KFRIVPLIHDIDEL-----RGGGGSDSVRLATCDMVISHNPQMTKYLSKYMSQDKIKDI- 141 K + V +I D+ E + + +M L + + Sbjct: 133 KIKSVVIITDLPEFLVGTDKRAAKRYKRLFDKFSAYVVLTEKMAVRL--GYTDKPYVVLE 190 Query: 142 KIFDYLVSSDVEHRDVTDKQRGVIYAGNLSR--------HKCSFIYTEGCDFTLFGVNYE 193 D ++ ++ ++YAG + + + ++G Sbjct: 191 GQVDSREKREIPG-QKQFNKKIIMYAGIVQKLYGLKILTEGFIKANLNDYELHIYGNGDY 249 Query: 194 NKD-----NPKYLGSFDAQSPE--KINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHK 246 +++ P + + L+ +Y K++ P K Sbjct: 250 SEEIDRISEIHKNVRHFPSQPNSTIVEKEKEAYLLV-------NPRPTTEEYTKYSFPSK 302 Query: 247 TSLYLSMELPVFIWDKAALADFIVDNRIGY---AVGSIKEMQEIVDSMTIETYKQISENT 303 Y+ V + + + +V I + V ++ E Sbjct: 303 NMEYMVSGTAVLTTALPGMPEEYKKHVYLIEDESVDGISNAFKKVAGLSDEEVLNKGRLA 362 Query: 304 KIISQKIRTGSYFRDVLEEVIDDL 327 + + + + E+I+ + Sbjct: 363 REFVLSEKNNKIQTEKIIELINRI 386 >UniRef50_C5U4Y0 Glycosyl transferase group 1 n=1 Tax=Methanocaldococcus infernus ME RepID=C5U4Y0_9EURY Length = 324 Score = 57.7 bits (138), Expect = 6e-07, Method: Composition-based stats. Identities = 45/289 (15%), Positives = 105/289 (36%), Gaps = 61/289 (21%) Query: 44 VQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIH--DIDEL 101 + +I+++++ + E+ D++ ++ F + + +H D+ L Sbjct: 57 LTYLINAIRIGKEILKKEDIDLIHSHY----AFPQGCVGSYLRKYCPHILTLHGSDVLFL 112 Query: 102 R---GGGGSDSVRLATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVT 158 R G + L D +I + YL+ + ++ + + + E +++ Sbjct: 113 RKSFLGRLFFNYSLRGADKIICVSK----YLASQIDRESVV-------IYNGVDEGKNLG 161 Query: 159 DKQRGVIYAGNLSRHKCSFIYTE-----GCDFTLFGVNYENKDNPKYLGSFDAQSPEK-- 211 D G+ Y G+ + K + + F + G +N++N +YLG + K Sbjct: 162 DHGFGL-YVGSFVKQKGLDLLLKAIEGIDFKFKIIGGLGKNRENIEYLGKLSHEETLKYM 220 Query: 212 -------INLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAA 264 + FG++ ++ E PV + Sbjct: 221 GMCSFLVVPSRVEGFGIV------------------------ALEAMACEKPVIAMNTGG 256 Query: 265 LADFIVDNRIGYAVGSIKEMQEIVDSMTIET--YKQISENTKIISQKIR 311 L + +++ G+ V +KEM+E + + + K++ N K S+K Sbjct: 257 LREIVINGYNGFLVNDVKEMREKIKLLIEDEDLRKELGRNAKKFSKKFS 305 >UniRef50_C6I9C3 Glycosyltransferase n=2 Tax=Bacteroides RepID=C6I9C3_9BACE Length = 396 Score = 57.3 bits (137), Expect = 7e-07, Method: Composition-based stats. Identities = 48/352 (13%), Positives = 107/352 (30%), Gaps = 54/352 (15%) Query: 17 KARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTF-LCGLENKDVLIFNFPMAKP 75 K D +DI + S V L + S L + + D +I P Sbjct: 60 KENLDGIDIFRFWLFASNVKRVLPRVLSMLSFSFSVLFSLKYVRKKRFDFIIVESPPLTL 119 Query: 76 FWHILSFFHRLLKFRIVPLIHDIDEL---------------RGGGGSDSVRLATCDMVIS 120 F ++ K +++ I D+ L + + Sbjct: 120 GLSG-YFLSKVCKSKMIMNISDLWPLSARELGVLTDGVIYCMLEKL-EYFLYKKSVACMG 177 Query: 121 HNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIY- 179 + ++ Y+S++ + ++ ++ T+ ++YAG L + Sbjct: 178 QSQEIVSYISQHGASRTYLFRNGVTPERFQNIPNKKRTNGNLIIVYAGLLGVAQGILEIC 237 Query: 180 ------TEGCDFTLFGVNYEN-----------KDNPKYLGSFDAQSPEKINLPGMQFGLI 222 + G +F ++G E + + G E +L Sbjct: 238 QKIDFKSLGTEFHIYGAGGEQHLIEEFLLTNSERGISFHG--RVSRDEIPSLLKQA---- 291 Query: 223 WDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIK 282 D + FG P K ++ +P+ + I +N +G+ S + Sbjct: 292 -DVTLIPLVKNIFG-----AVPSKIYESMAAGVPILFAGEGEGQRIIEENCLGWVSRS-R 344 Query: 283 EMQEIVDSM-----TIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDLKT 329 + +++++++ Q EN K ++ + L + + L T Sbjct: 345 DYEKLIENIKLIRSNDIDMLQKRENCKNAAENLFNRPKQIKALFQYLSQLNT 396 >UniRef50_A7ZF00 Glycosyl transferase, group 1 n=1 Tax=Campylobacter concisus 13826 RepID=A7ZF00_CAMC1 Length = 401 Score = 57.3 bits (137), Expect = 7e-07, Method: Composition-based stats. Identities = 50/350 (14%), Positives = 107/350 (30%), Gaps = 50/350 (14%) Query: 22 ALDIASDYENISVVNIPLWGGVVQRIISSVKLS--------TFLCGLENKDVLIFNFPMA 73 I S S+ G ++RI++ L + N DV+I P Sbjct: 58 ISHIDSGVSFFSIKTPFYKGNGLRRIVNMFAFVLNLIKCTKKLLVEIGNIDVIIMASPHP 117 Query: 74 KPFWHILSFFHRLLKFRIVPLIHDIDEL---------------RGGGGSDSVRLATCDMV 118 F K +++ I DI L ++ D + Sbjct: 118 -FAIFAAKFMANKAKAKLIIDIKDIWPLSITELTSAKKYHPFVMLTKLTELFAYRVQDKI 176 Query: 119 ISHNPQMTKYLSKYMSQDKIKDIKI-----FDYLVSSDVEHRDVTDKQRGVIYAGNLSRH 173 IS + +Y + K I F ++ + ++ + V Y G +++ Sbjct: 177 ISPLNNINEYFYDSGLNIQAKHIPTGLDLDFYENINHENLQINIPSDKFIVGYIGGITKS 236 Query: 174 K-----------CSFIYTEGCDFTLFGVNYENKDNPKYLGSFDAQSPEKINLPGMQFGLI 222 Y + F + G + K + F ++ Sbjct: 237 NAIEFLLESANYFLQNY-KDILFLVVGDGSYKNNLVK-KYNSSNILYIDAVKKEEAFMIM 294 Query: 223 WDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGS-- 280 D + Y +P K + Y+ +P+ D ++ G +V S Sbjct: 295 SKCDVLYRAMLPLKIYSYGISPLKMNEYMFAGVPIVHSFDYKEHDIVMKVGCGISVKSGS 354 Query: 281 ---IKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDL 327 I+ + +M E +++ +N K ++ ++ +++++I+ L Sbjct: 355 MLEIQNAILKIYNMPKEEREKMGQNGKEY---VKNNLSYKVLVKKMIEVL 401 >UniRef50_B5IQU9 Glycosyl transferase, group 1 family protein n=1 Tax=Thermococcus barophilus MP RepID=B5IQU9_9EURY Length = 396 Score = 56.9 bits (136), Expect = 9e-07, Method: Composition-based stats. Identities = 44/299 (14%), Positives = 98/299 (32%), Gaps = 39/299 (13%) Query: 39 LWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDI 98 + + I + + ++ +V++ P P ++++ +L++ +++ + D+ Sbjct: 82 IERTLYYTIFPVLASIWLVFNRKSSNVILVTSPP--PQMYLIALIGKLMRKKVIVDVRDL 139 Query: 99 -------------DEL--RGGGGSDSVRLATCDMVISHNPQMTKYLSKYMSQDKIKDIKI 143 L R +S L D V P++ L + + K Sbjct: 140 FLDVSVNLGFIKKGSLIERIFRFLESKALQKADAVTLVTPKIRHQLVEEYGINPAKC--- 196 Query: 144 FDYLVSSDVEHR----DVTDKQRGVIYAGNLSRHK----CSFIY-----TEGCDFTLFGV 190 Y+V + V+ D + ++ ++YAG + Y E L G Sbjct: 197 --YVVPNGVDLETFKCDKSKRKLQMVYAGYFGHAQDFDTFLKGYALLRENERVPLILAGG 254 Query: 191 NYENKD---NPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKT 247 +D N + LG + L+ S+ + LK+ P K Sbjct: 255 GETLEDVLKNVEKLGISKWIKYVGMLSRKDVVKLL-CSSSIGVAPIKVDESLKYAIPSKI 313 Query: 248 SLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQISENTKII 306 YL+ LP + ++R G + +E+ E + + +++ Sbjct: 314 YEYLACGLPFIGVGVGEIEKIAEESRAGCVGKTPEEVAECIMKLLNSNLEKLKVRALRY 372 >UniRef50_A5FL46 Glycosyltransferase family 4 n=1 Tax=Flavobacterium johnsoniae UW101 RepID=A5FL46_FLAJ1 Length = 395 Score = 56.9 bits (136), Expect = 1e-06, Method: Composition-based stats. Identities = 30/307 (9%), Positives = 91/307 (29%), Gaps = 34/307 (11%) Query: 32 ISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRI 91 + ++ +V + S L +L + +I P L + +I Sbjct: 75 YPSNSSNIFKRIVSTLSFSTTLFFYLLFSKIPKKVIVQSPPL-LLSFTAVLALWLRRKKI 133 Query: 92 VPLIHDI---------------DELRGGGGSDSVRLATCDMVISHNPQMTKYLSKYMSQD 136 + + D+ + + + + + ++ ++ + Sbjct: 134 ILNVSDLWPTAAIELGVLKKNSISHKFLLFIERFIYRKANHIFGQSNEIIDHIHSIFPEK 193 Query: 137 KIKDIKIF-DYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSF-------IYTEGCDFTLF 188 K + + D+ + + +E + + + YAG L + F + + +F Sbjct: 194 KCFLYRNYPDHFIENRLEEKQNLSEPIKLFYAGLLGVAQGVFELIQELDLKNLNIELHIF 253 Query: 189 GVNYENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKF--NNPHK 246 G E +L + + + + + P K Sbjct: 254 GDGAEKNQILNFLEKNQTEKIIFHGMLERN---VLHKTLQNLDIALVPLKTRIYGSVPSK 310 Query: 247 TSLYLSMELPVFIWDKAALADFIVDNRIGYAV-----GSIKEMQEIVDSMTIETYKQISE 301 Y ++ PV + + + +N +G+ V ++ + + + E + + + Sbjct: 311 IFEYSALGFPVLYFGGGEGENIVEENNLGWVVPVEDFKNLNDTLKQISEFGKEEIQIMKK 370 Query: 302 NTKIISQ 308 + ++ Sbjct: 371 TIFLHAK 377 >UniRef50_C5A2Y6 Glycosyltransferase, family 1, putative n=1 Tax=Thermococcus gammatolerans EJ3 RepID=C5A2Y6_THEGJ Length = 380 Score = 56.5 bits (135), Expect = 1e-06, Method: Composition-based stats. Identities = 50/313 (15%), Positives = 88/313 (28%), Gaps = 54/313 (17%) Query: 24 DIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCG-----LENKDVLIFNFPMAKPFWH 78 + E + V IP+ +KL F ++NKD L Sbjct: 49 KLFEIVEGVRVCRIPIASKYASFFDFFIKLPLFYLKAVLYVVKNKDGLYAIHANDFDTAP 108 Query: 79 ILSFFHRLLKFRIVPLIHDIDELRGGGGSDSVRLA---------------TCDMVISHN- 122 + F RLL + + IHD+ R + L D VI+ Sbjct: 109 LAFFLSRLLGVKFIYDIHDLYHSRISLLREKKTLNFLQRLILSLEIIHAKLADSVITVTR 168 Query: 123 ------PQMTKYLSKYMS-QDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNL-SRHK 174 + ++L DK+ + L + R+ +K V Y G + S Sbjct: 169 SLGGRHKGVKEFLVNRGVVPDKVYVVWNTPELEFFPLIKRERGEK-FTVGYIGTIRSVSS 227 Query: 175 CSFIYTE-----GCDFTLFGVNYENKDNPKYLGSFDA-----QSPEKINLPGMQFGLIWD 224 ++ G L E ++ Sbjct: 228 FIPLFEVARRDRRLKLLFVGSGASKGKIANLLTERYPNVDVEFIDEVPYHNVFKY----- 282 Query: 225 GDSVETCSGAFGDY-----LKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVG 279 C + Y +K K + M +PV + + DF+V R G + Sbjct: 283 ---YTLCDAVYSMYPPTDNIKRALAVKMFESIVMGIPVIVNRDTLMEDFVVLYRCGVSSN 339 Query: 280 -SIKEMQEIVDSM 291 S ++M ++ + Sbjct: 340 LSTEDMANALEKI 352 >UniRef50_UPI0001C4246F glycosyltransferase n=1 Tax=Bacillus pseudofirmus OF4 RepID=UPI0001C4246F Length = 419 Score = 56.5 bits (135), Expect = 1e-06, Method: Composition-based stats. Identities = 37/245 (15%), Positives = 73/245 (29%), Gaps = 50/245 (20%) Query: 102 RGGGGSDSVRLATCDMVISHNP------QMTKYLSKYMSQDKI---------KDIKIFDY 146 RG + D +I + K+ + + + DI+ F+ Sbjct: 162 RGLTAGEQWIYKKADALIFTKEGDTDYIKEKKWDIEQGGEINLDKCHYINNGVDIESFEL 221 Query: 147 L-VSSDVEHRDVTDKQRGVIYAG---------NLSRHKCSFIYTEGCDFTLFGVNY---- 192 L ++ V+ D++ + V+Y G NL E F ++G Sbjct: 222 LASNNKVDDEDLSSAKFNVVYVGAIRPVNNVGNLLDAASLLKDKEDIQFLIYGDGNQKEM 281 Query: 193 -------ENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPH 245 EN N K G + ++ S ++ + N+ + Sbjct: 282 LEKRVVEENLTNVKLKG--FVNKRLIPYILSKS------SVNILNYSQTQYNWTRGNSSN 333 Query: 246 KTSLYLSMELPVFIWDKAALADFIVDNRIGY-----AVGSIKEMQEIVDSMTIETYKQIS 300 K Y++ P+ K + + G + + + + E Y IS Sbjct: 334 KLFEYMASGKPIISTVKMG-YSILDKYQCGIELEKSTPEELANAIIEIRNFSEEQYNAIS 392 Query: 301 ENTKI 305 +N K Sbjct: 393 KNAKK 397 >UniRef50_B9Z8K7 Glycosyl transferase group 1 n=1 Tax=Lutiella nitroferrum 2002 RepID=B9Z8K7_9NEIS Length = 369 Score = 56.5 bits (135), Expect = 1e-06, Method: Composition-based stats. Identities = 45/315 (14%), Positives = 95/315 (30%), Gaps = 58/315 (18%) Query: 18 ARKDALDIASDYENISVVNI-------PLWGGVVQRIISSVKLSTFLCGLENKDVLIFNF 70 A D I +D + + + P GG + R+ +V+ +V F+ Sbjct: 28 AGHDVTLIVADGQGEEIRDGVRIHDVGPKTGGRLARMTGTVERVYRAALALKPEVAHFHD 87 Query: 71 PMAKPFWHILSFFHRLLKFRIVPLIH-DID-----------ELRGGGGSDSVRLAT---- 114 P P + R ++V +H D+ R L Sbjct: 88 PELIP----AALKLRRAGIKVVYDVHEDVPRQVLAKHWIPGAARPLVSKGVELLEHYAAR 143 Query: 115 -CDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDV-TDKQRGVIYAGNLSR 172 D +++ P + + + ++ I + +Y + +++ ++ V Y G +SR Sbjct: 144 RFDAIVAATPLIRRRFAALGAR----AIDVCNYPILAELVRDTPWEARRNEVCYLGGISR 199 Query: 173 HKCSFIYTEGCD-----FTLFGVNYENKDNPKYLGSFDAQSPEKINLPG----------M 217 + L G+ E + + + + Sbjct: 200 TRGIAPIITALPSTATRLNLAGLWSETELKAELQQQPGWARVNDLGVLDRAGVAEVLAAS 259 Query: 218 QFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYA 277 + GL+ + ++ + L P K Y++ LPV D D + G Sbjct: 260 KVGLV-----TLLPTPSYVESL----PIKLFEYMAAGLPVIASDFPLWRDIVDGAGCGLL 310 Query: 278 VGSIKEMQEIVDSMT 292 V + I ++ Sbjct: 311 VDP-NDAAAIASAIN 324 >UniRef50_D1N7C5 Glycosyl transferase group 1 n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N7C5_9BACT Length = 342 Score = 56.1 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 41/324 (12%), Positives = 91/324 (28%), Gaps = 74/324 (22%) Query: 43 VVQRIISSVKLSTFLCGL--ENKDVLI-FNFPMAKPFWHILSFFHRLLKFRIVPLIHDID 99 +Q+ + L L DV+ P IL+ +F + L +DI Sbjct: 12 RIQKFLVISWHFFRLAFLHVRRNDVVFAVTNPA--FIIFILAVLRSFRRFEYILLAYDIF 69 Query: 100 E-------LRGGGGSDSVR--------LATCDMVISHNPQMTKYLSKYMSQ-DKIKDIKI 143 L + D VI M L + ++ I Sbjct: 70 PENLVAAGLARQKSFHYRVVKKIFDWSYSRADRVIVIGRDMANILKNKGVENRRLLLIPN 129 Query: 144 F--------------DYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFI--------YTE 181 + YL ++++ + ++AGN+ R + ++ Sbjct: 130 WSDCSRIQPTSPRNNPYLCQLGIQNK------KVFLFAGNIGRVQGINNLLTAITLVKSK 183 Query: 182 GCDFTLFGVN--------YENKDNPKYLGSFDAQSPE-KINLPGMQFGLIWDGDSVETCS 232 F G + + + + E + D Sbjct: 184 QAVFLFIGSGAMADTVRMQQAESRYHNIYYLEPLPLEKQPEFLNA-------CDVAIVTL 236 Query: 233 GAFGDYLKFNNPHKTSLYLSMELPV--FIWDKAALADFIVDNRIGYAVGS-----IKEMQ 285 G + L P K+ ++ P+ + +A I + + G+ V + E+ Sbjct: 237 GT--NMLGLGVPSKSYFSMAAGKPLLYIGEHDSEIAQVIAEEQCGWQVEPHEPARLAELI 294 Query: 286 EIVDSMTIETYKQISENTKIISQK 309 +++ ++ E + + ++K Sbjct: 295 DLICALPEEKLTAAGQAARRTAEK 318 >UniRef50_C4L5H1 Glycosyl transferase group 1 n=1 Tax=Exiguobacterium sp. AT1b RepID=C4L5H1_EXISA Length = 405 Score = 56.1 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 44/303 (14%), Positives = 91/303 (30%), Gaps = 41/303 (13%) Query: 28 DYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENK-----------DVLIFNFPMAKPF 76 + + + + +L FL + DV+ P Sbjct: 61 ELNDQPFIKRLITKKRKHTSNMVSRLFLFLEQMVKGIREVRKLELKPDVVFATTPS-FFM 119 Query: 77 WHILSFFHRLLKFRIVPLIHDI--DELRGGGGS------------DSVRLATCDMVISHN 122 + ++ R + + + D+ + ++G G + D VI ++ Sbjct: 120 AFVGAYAKRKYRVPFILDVRDLWPESVKGVGVFKYDWVLTPAFWMEKRLYRVADEVIINS 179 Query: 123 PQMTKYLSKYMS-QDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIY-- 179 YL + +KI + +E +D + ++YAGN+ + + Sbjct: 180 EGFRSYLRQRGVPNEKIHYMPNSIREAERTLERTIPSDDRMEILYAGNMGLAQDVSLLLE 239 Query: 180 -------TEGCDFTLFGVNYENKD-NPKYLGSFDAQSPEKINLPGMQ-FGLIWDGDSVET 230 F L G Y ++ +P + F I + D V Sbjct: 240 LAERFRDEPRIHFKLIGYGYRKEELKQTIKDRGFQNFLFLEAMPRTEAFQAIKNAD-VAF 298 Query: 231 CSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAV--GSIKEMQEIV 288 S + P K Y+++ P+ A+ I +GY I E++ I+ Sbjct: 299 VSLIEQEVFDTVIPGKLIDYMAVGKPIVAAVSGHAANVIEAAEVGYVSRKRDIDEIERIL 358 Query: 289 DSM 291 + Sbjct: 359 RKL 361 >UniRef50_C6LKZ9 Putative glycosyl transferase n=1 Tax=Bryantella formatexigens DSM 14469 RepID=C6LKZ9_9FIRM Length = 413 Score = 55.7 bits (133), Expect = 2e-06, Method: Composition-based stats. Identities = 33/209 (15%), Positives = 72/209 (34%), Gaps = 21/209 (10%) Query: 112 LATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLS 171 ++ D +I + L +K + + L + H + ++Y G++S Sbjct: 192 ISKADHIICTSEFAKSSLIANNIDEKRIHVISYG-LEQNKKSHNIGKPGKLSLLYVGSVS 250 Query: 172 RHKCSF--------IYTEGCDFTLFGVNYENKDNPKYLGSFDAQSPEKINLPGMQFGLIW 223 K + I +E + L G NY + + + + + + L Sbjct: 251 CEKGLYFLLEAVKRINSEEIELVLVGKNYIDDKLLEPYKKWCNFIGDIPHTQVENYYLNA 310 Query: 224 DGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVG--SI 281 D + T +FG S +S +P+ A AD+I + G+ + I Sbjct: 311 DVFILPTLFDSFGRV--------VSEAMSYGIPIISTSNAGAADYIKNGENGFVIPAGDI 362 Query: 282 KEMQEIVDS--MTIETYKQISENTKIISQ 308 M E + + + K + + + ++ Sbjct: 363 DSMVEKIRYFLLNRDEVKIMGKKAQTTAE 391 >UniRef50_D1JK96 WbpH n=1 Tax=Bacteroides sp. 2_1_16 RepID=D1JK96_9BACE Length = 375 Score = 55.7 bits (133), Expect = 2e-06, Method: Composition-based stats. Identities = 38/244 (15%), Positives = 68/244 (27%), Gaps = 38/244 (15%) Query: 84 HRLLKFRIVPLIHDIDELRGGGGSDSVRLATCDMVISHNPQMTKYLSKYMSQDKIKDIKI 143 + L ++ V + + ++ D I K D ++ + Sbjct: 123 LKRLVYKFVGFV--------YKQYEMIKCKEFDAAIVCYHWTRDRFKK--VNDNVELVLN 172 Query: 144 FDYLV-SSDVEHRDVTDKQRGVIYAGNLSRHKCS------FIYTEGCDFTLFG------V 190 F + E T + YAG +S F L G + Sbjct: 173 FPLIDRDKVKERPLRTTNDIKICYAGTISDAWNIPTLINAIENLNDVKFNLAGWTDDELM 232 Query: 191 NYENK----DNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHK 246 + Y G Q + G+ S C G G+ + N K Sbjct: 233 GRMKSLIGWEKVNYFGKLPKQEVNEKVYSHSDIGVALYHYSPL-CKGKIGN-MSNN---K 287 Query: 247 TSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSM-----TIETYKQISE 301 YL M +PV D + + N G V + I++++ + Q+ Sbjct: 288 LFEYLLMGMPVICTDFDLWKEVVEKNHCGICVNP-SDANAIMEAIVYIQKNQKEAYQMGL 346 Query: 302 NTKI 305 N + Sbjct: 347 NGQK 350 >UniRef50_A3I1Z7 Putative glycosyltransferase protein n=1 Tax=Algoriphagus sp. PR1 RepID=A3I1Z7_9SPHI Length = 398 Score = 55.7 bits (133), Expect = 2e-06, Method: Composition-based stats. Identities = 42/321 (13%), Positives = 87/321 (27%), Gaps = 55/321 (17%) Query: 31 NISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFR 90 N N + + +S + E D+++ + L + + Sbjct: 67 NAPDSNKDSFLKRAFNALKFAFISIYFALKEPHDLVLSSSGPITTAIPGL-ISKKFRSKK 125 Query: 91 IVPLIHDIDEL--------------RGGGGSDSVRLATCDMVISHNPQMTKYLSKYMSQD 136 V + D+ + + + D+V++ + M + K Sbjct: 126 FVFEVRDLWPAGGIQLGKINNLFAQKIALSFEKLIYKNSDLVVACSVGMEDGVKKVNPDK 185 Query: 137 KIKDIK------IFDYLVSSDVEHRDVTDKQRGVIYAGNLS--------RHKCSFIYTEG 182 I +F + + + IYAG+L EG Sbjct: 186 PTLVIPNSSDVVLFSSITDRPSGFKSEWENTCNFIYAGSLGLMDECEQIIKGFIDSRMEG 245 Query: 183 CDFTLFGVNYENK------------DNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVET 230 G E +N ++G E + S T Sbjct: 246 IHMFFLGDGAERNHLETLAKQNGLQENIHFMGLLP--KKELVKWFQAA------RASFVT 297 Query: 231 CSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIK--EMQEIV 288 + L N+P+K + +PV K +A + ++ G V M E + Sbjct: 298 FKNI--EVLHTNSPNKLFDSFAAGIPVIQSTKGWIATLVNESNCGINVDPEDPKSMAEAI 355 Query: 289 DSMTIET--YKQISENTKIIS 307 + +++ N K ++ Sbjct: 356 LYLRDNPTIAEEMGGNAKKLA 376 >UniRef50_A3U615 Lipopolysaccharide biosynthesis protein, putative glycosyltransferase n=1 Tax=Croceibacter atlanticus HTCC2559 RepID=A3U615_9FLAO Length = 388 Score = 55.7 bits (133), Expect = 2e-06, Method: Composition-based stats. Identities = 43/324 (13%), Positives = 104/324 (32%), Gaps = 48/324 (14%) Query: 38 PLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHD 97 W + + SV L + C + +V+I P + + K +++ + D Sbjct: 81 NKWKRLFSMLSFSVSLLFYRCFKKPAEVVIIQSPPL-LIAYSAIRLYASKKRKVILNVSD 139 Query: 98 I--------DELRGGG----GSDSVRLA--TCDMVISHNPQMTKYLSKYMSQDKIKDIKI 143 + L+ G DMV+ + ++ ++ Q ++ + Sbjct: 140 LWPQAGLDLGALKKGKFYNYLKTIEAYNYTKADMVMGQSQEILDHVRSIHPQKEMVLYRN 199 Query: 144 FDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIY------TEGCDFTLFGVNYENKDN 197 F + + + + +IYAG L + E + ++G E + Sbjct: 200 FP-DIKFNTDSTISINAPVKIIYAGLLGVAQGILELCKHIKLPENVELHIYGDGPEKQAL 258 Query: 198 PKYLGS--------FDAQSPEKINLPGMQ--FGLIWDGDSVETCSGAFGDYLKFNNPHKT 247 +YL S S +++ L+ + + P K Sbjct: 259 LQYLDSANKNNIICHGEVSKSELHTLYENHHIALVPLVRPILG-----------SVPSKI 307 Query: 248 SLYLSMELPVFIWDKAALADFIVDNRIGYAVG--SIKEMQEIVDSMTIETYKQISENTKI 305 LP+ + ++ +GY + KE+ E + ++ ++ K Sbjct: 308 FEIAHFGLPILYMAGGEGEHIVKEHALGYVINTQDYKELNETLCTLKVKDLLDNRIEIKA 367 Query: 306 ISQKIRTGSYFRDVLEEVIDDLKT 329 I++K L+++++ ++ Sbjct: 368 IAKKH---FVIEKQLQQLLNAIEA 388 >UniRef50_A8RK70 Putative uncharacterized protein n=2 Tax=Clostridium RepID=A8RK70_9CLOT Length = 405 Score = 55.4 bits (132), Expect = 3e-06, Method: Composition-based stats. Identities = 46/362 (12%), Positives = 108/362 (29%), Gaps = 67/362 (18%) Query: 1 MYFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIIS---SVKLSTFL 57 +Y ++ + ++D+ +I ++ L + ++ + Sbjct: 35 VYVVSPIERKNNKKTHIVKEDSWEIL-KVRTGNIQQTNLIEKGIATVLLEQQFINAIKKY 93 Query: 58 CGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDE----------------- 100 D+++++ P ++++ + K ++ DI Sbjct: 94 YSNTKFDLVLYSTPP-VTLARVVAYIKKRDKALSYLMLKDIFPQNSIDLGILKKTGLKGV 152 Query: 101 -LRGGGGSDSVRLATCDMVISHNPQMTKYLSKYM--SQDKIKD----------IKIFDYL 147 + + D++ + +Y+ ++ + KI + + + D Sbjct: 153 IYKYFSLKEQKLYKLSDVIGCTSEANIRYVKEHDELDKKKIIEFCPNCSDWYDLSLPDNG 212 Query: 148 VSSDVEHRDVTDKQRGVIYAGNLSRHKCS-FIYT--------EGCDFTLFGVNYENKDNP 198 + ++ +Y GNL R + FI + F + G E Sbjct: 213 KKEVRNKYGLPVDKKIFVYGGNLGRPQDVPFIVKCLEACKDMKNVYFLVVGSGTEKHYLD 272 Query: 199 KYL----GSFDAQSPEKINLPGMQFGLIWDGDSVETCSG---AFGDY--LKFNNPHKTSL 249 +Y+ S + DS+ C F DY N P + Sbjct: 273 EYVEKESCSHVRVMGQLPKQEY---------DSMVACCDCGIIFLDYRFTVPNTPSRLLA 323 Query: 250 YLSMELPVFIWDKAA--LADFIVDNRIGY--AVGSIKEMQEIVDSMTIETYK-QISENTK 304 Y+ +PV A + D + DN G+ I+ +V+ + ++ EN Sbjct: 324 YIQAGIPVLTCTDPATDVGDIVEDNGFGWQCTSDKIENFVRLVEHIANLDIDPRMKENGL 383 Query: 305 II 306 Sbjct: 384 KY 385 >UniRef50_Q2C5V3 Putative uncharacterized protein n=1 Tax=Photobacterium sp. SKA34 RepID=Q2C5V3_9GAMM Length = 366 Score = 55.4 bits (132), Expect = 3e-06, Method: Composition-based stats. Identities = 36/288 (12%), Positives = 93/288 (32%), Gaps = 41/288 (14%) Query: 64 DVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGGGSDS-------------- 109 D +I + P+ L F + +++ ++ + + Sbjct: 94 DTIIL-YSGYSPYLFRLIPFCKKNNIKLIFDC--VEWYQPKSKIEYLYKPYYWNIELSMR 150 Query: 110 VRLATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGN 169 L CD +I + + Y S I + + ++ + + ++YAGN Sbjct: 151 YLLKKCDYIICISKYLENYYKASGSGVVRIPPTI-KFNKNIIPVPNEIENNKIKLVYAGN 209 Query: 170 LSRHKCSFIYTEGC-------DFTLFGVNYENKDNPKYLGSFDAQSPEKINLPGMQFGLI 222 E + + GV+ N+ + Y G I F ++ Sbjct: 210 PGHKDLLNDIIEAINGLEDYFELHIVGVSGVNQKSIYYYGYLPHYKSLNIVKS-CHFSIL 268 Query: 223 WDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYA----- 277 ++ + +G K + +PV + + DF+ G+ Sbjct: 269 LRPNNKVSNAG---------FSTKIVESMCNGIPVIANNTGDIKDFVNKEN-GFLFEGED 318 Query: 278 VGSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVID 325 ++ + + + +++ Y ++S+N + + ++ VL +++ Sbjct: 319 SSALHVILKKISNISDANYNKLSKNAFMTAVDFFHIDCYKKVLRDIVS 366 >UniRef50_A9A104 Glycosyl transferase group 1 n=1 Tax=Desulfococcus oleovorans Hxd3 RepID=A9A104_DESOH Length = 353 Score = 55.4 bits (132), Expect = 3e-06, Method: Composition-based stats. Identities = 44/328 (13%), Positives = 106/328 (32%), Gaps = 46/328 (14%) Query: 12 RDAGFKARKDALDIASDYENI-----SVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVL 66 +AGF D + + I + QR++ N D+ Sbjct: 12 YNAGF----DVTYVVPGTGDKIFDGVKFSFIRPGKNLFQRLVIKPYKIYKAAVKLNADIY 67 Query: 67 IFNFPMAKPFWHILSFFHRLLKFRIVPLIH-DID---ELRGGGG--------------SD 108 F+ P P+ ++L +++ IH D++ + R G + Sbjct: 68 HFHDPELIPYGYLLLKA----GHKVIYDIHEDLENKIKDRRIKGLGYLLPFFAAYVGKIE 123 Query: 109 SVRLATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAG 168 I+ N + L+ + + + I + + ++ + + V+YAG Sbjct: 124 KYFCKRFTYNITVNQDIKTKLNIKNVEI-VTNYPIVELFKRKEADNVRSINDEFTVVYAG 182 Query: 169 NLSRHKCSFIYTEGCDF----------TLFGVNYENKDNPKYLGSFDAQSPEKINLPGMQ 218 L+R + + F + + G + + Sbjct: 183 LLNRIRGIKEIVDAMSFLRGTARLLLLGKWQDKLYQDECMHSEGWKYTEFKGFLP-LEEA 241 Query: 219 FGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAV 278 + +I + D ++L F+ P+K Y++ P+ + + D D + Sbjct: 242 YDVIQNSDVGVVNFYPLKNHL-FSMPNKAFEYMAAGKPMVMSNFDYWKDLFADCALFSNP 300 Query: 279 GSIKEMQEIVDSMTIET--YKQISENTK 304 + +E+ ++ + + + ++S+N K Sbjct: 301 ENSEEIAANIEKLASDRQLFNELSKNAK 328 >UniRef50_C2HRD4 Glycosyl transferase group 1 family protein n=1 Tax=Vibrio cholerae bv. albensis VL426 RepID=C2HRD4_VIBCH Length = 378 Score = 55.4 bits (132), Expect = 3e-06, Method: Composition-based stats. Identities = 37/232 (15%), Positives = 76/232 (32%), Gaps = 35/232 (15%) Query: 102 RGGGGSDSVRLATCDMVISHNPQMTKYLSK-YMSQDKIKDIKIFDYLVSSDVEHRDVT-- 158 R +S+ + +CD++I + + + Y + + + + + Sbjct: 137 RISKYIESLFIRSCDLIIVVGENIADWYANAYKIERPLVVKNSPRFRLQAKKNLIRERLG 196 Query: 159 --DKQRGVIYAGNLSRHKCSFIYTEGCD----------FTLFGVNYEN----KDNPKYLG 202 Q+ ++Y G L + + + + F +G + + Sbjct: 197 ILPNQKILLYQGGLMKGRGVQLILDAFKERKDSHVVAVFMGYGDLTTEIEQAAKKHQNIF 256 Query: 203 SFDAQSPE--KINLPGMQFG--LIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVF 258 F A SP G LI TC + F P+K Y +PV Sbjct: 257 YFPAVSPNVVLDYTASADIGISLI-----ENTCLSYY-----FCMPNKLFEYAMAGIPVI 306 Query: 259 IWDKAALADFIVDNRIGYAVG--SIKEMQEIVDSMTIETYKQISENTKIISQ 308 + D +A+ + G + S++ + E VD + ++S N +Q Sbjct: 307 VSDMKEMAEAVQTADFGVVLTEYSVESINEAVDRLAERDLTELSNNAYQFAQ 358 >UniRef50_C7M4B3 Glycosyl transferase group 1 n=6 Tax=Bacteroidetes RepID=C7M4B3_CAPOD Length = 400 Score = 55.4 bits (132), Expect = 3e-06, Method: Composition-based stats. Identities = 48/306 (15%), Positives = 91/306 (29%), Gaps = 62/306 (20%) Query: 51 VKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDE---------- 100 + E D++++ P +S+ + K L+ DI Sbjct: 86 LWAIKRYLSNEKFDMVLYTTPP-ITLLKPISYIKKRDKAYTYLLLKDIFPQNAVDLGLMK 144 Query: 101 -----LRGGGGSDSVRLATCDMVISHNPQMTKYLSKYMSQDKIKDIKI---------FD- 145 + + D + + +L + + + ++I FD Sbjct: 145 EGSFLHKIFVKKEKKLYQISDTIGCMSQANVDFLLHHHPEIPQQKVEINPNSITPISFDK 204 Query: 146 --YLVSSDVEHRDVTDKQRGVIYAGNLSRHKCS--------FIYTEGCDFTLFGVNYE-- 193 + + + ++ +Y GNL + + E F + G E Sbjct: 205 SEAERKAIKQKYHLPLDKKIFVYGGNLGKPQGLDFLLDTIQATKNEDVYFLIVGNGTEYH 264 Query: 194 ---------NKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNP 244 N L S + L GLI+ N P Sbjct: 265 RIKIWFEEQKPQNAMLLSSLPKDDYD-HLLSSCDVGLIFLDKRFTIP----------NFP 313 Query: 245 HKTSLYLSMELPVFI--WDKAALADFIVDNRIGYAV--GSIKEMQEIVDSMTIETYKQIS 300 + YL MELPV + D I + + GY V G + +MQ+++D + + + Sbjct: 314 SRLLSYLEMELPVIAATDPHTDIGDVIEEAQCGYKVLSGDLIQMQKVIDHLLMSDLDALG 373 Query: 301 ENTKII 306 N K + Sbjct: 374 RNAKNL 379 >UniRef50_C1CBD3 Capsular polysaccharide biosynthesis protein Cps4F n=10 Tax=Streptococcus pneumoniae RepID=C1CBD3_STRP7 Length = 408 Score = 55.4 bits (132), Expect = 3e-06, Method: Composition-based stats. Identities = 26/237 (10%), Positives = 64/237 (27%), Gaps = 46/237 (19%) Query: 101 LRGGGGSDSVRLATCDMVISHNPQMTKYLSKY--MSQDKIKDIKIFDYLVSSDVEHRDVT 158 + D + +P Y +++ KI + Y + Sbjct: 159 FKLFKFISKKVYRASDYIFVTSPSFKNYFVNQFDITEQKITYLP--QYAEDLFIPDESRV 216 Query: 159 DKQR-GVIYAGNLSRHKCSF-------------IYTEGCDFTLFGVNYE----------- 193 +K+ + +AGN+ + + + F G E Sbjct: 217 NKESVDLTFAGNIGKAQNLETILKAASLIEKNTDLPKKIQFHFVGDGTELLSMKALAHEL 276 Query: 194 NKDNPKYLGSFDAQSPEKINLPGMQFGLI-WDGDSVETCSGAFGDYLKFNNPHKTSLYLS 252 N + G + L+ GDS+ + + P K Y++ Sbjct: 277 ELKNVSFYGRRSLEEMPTFYK-KSDAMLVSLIGDSIVSRT----------IPGKVQSYMA 325 Query: 253 MELPVFIWDKAALADFIVDNRIGYA-----VGSIKEMQEIVDSMTIETYKQISENTK 304 P+ + + + G+ V + + ++ E +++ + + Sbjct: 326 AGKPIIGAISGDTKTIVEEAKCGFVSPEQDVEQLAQNICKFSMLSTEEQRELGKQAR 382 >UniRef50_Q2YUX0 Capsular polysaccharide synthesis enzyme CapL n=55 Tax=Bacillales RepID=Q2YUX0_STAAB Length = 401 Score = 55.0 bits (131), Expect = 4e-06, Method: Composition-based stats. Identities = 36/318 (11%), Positives = 94/318 (29%), Gaps = 43/318 (13%) Query: 29 YENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKD-VLIFNFPMAKPFWHILSFFHRLL 87 + N G ++ K + + D +L+++ P P + HRLL Sbjct: 68 LKYSRFNNKSKVGRIINFFSLFSKFVINIPKMLKYDQILVYSNPPILPLIPDV--LHRLL 125 Query: 88 KFRIVPLIHDIDEL------------RGGGGSDSV---RLATCDMVISHNPQMTKYLSKY 132 K + +++DI + + VI +M YL + Sbjct: 126 KKKYSFVVYDIAPDNAIKTGATRPGSMIDKLMRYINKHVYKNAENVIVLGTEMKNYLVNH 185 Query: 133 MSQD---KIKDIKIF-------DYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYTEG 182 I I + D + +D + ++Y+GN+ + + Sbjct: 186 QISKNPDNIHVIPNWYDMRQLQDNRIYNDTFKAYREQYDKIILYSGNMGQLQDMETLISF 245 Query: 183 CDFT---------LFGVNYENKDNPKYLGSFDAQSPEKINLPGM-QFGLIWDGDSVETCS 232 L G + D + ++ + + + + + Sbjct: 246 LKLNKDQPQTLTILCGHGKKFADVKTAIEDHRIENVKMFEFLTGTDYADV-LKIADVCIA 304 Query: 233 GAFGDYLKFNNPHKTSLYLSMELP--VFIWDKAALADFIVDNRIGYAVGS--IKEMQEIV 288 + + P K YL+ + P + + ++ + + G + + + + Sbjct: 305 SLIKEGVGLGVPSKNYGYLAAKKPLVLIMDKQSDIVQHVEQYDAGIQIDNGDAHAIYNFI 364 Query: 289 DSMTIETYKQISENTKII 306 ++ + + ++ E + Sbjct: 365 NTHSSKELHEMGERAHQL 382 >UniRef50_O26550 LPS biosynthesis RfbU related protein n=1 Tax=Methanothermobacter thermautotrophicus str. Delta H RepID=O26550_METTH Length = 411 Score = 54.6 bits (130), Expect = 5e-06, Method: Composition-based stats. Identities = 41/345 (11%), Positives = 104/345 (30%), Gaps = 81/345 (23%) Query: 27 SDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRL 86 + + V+ +++ + S++ + DV+ ++ P I H + Sbjct: 94 DQRKGLPVLIRKYGSLLLRSLWYSIR--------KPFDVIHAHY--LFPTGFIGLLCHWI 143 Query: 87 LKFRIVPLIH--DIDEL-RGGGGSDSV---RLATCDMVISHNPQMTKYLS-KYMS-QDKI 138 +V +H D+++L R + L VI+ + + + + ++ + K+ Sbjct: 144 SGKPLVVTVHGSDVNKLARKNSLLSKISGFILRRTSAVIAVSRDLGEKVVNEFGVDRGKV 203 Query: 139 KDIKI-FDYLVSSDVEHRDV------TDKQRGVIYAGNLSRHKCSF--------IYTEGC 183 I + D + ++ + K R V++ GN+ K + + + Sbjct: 204 HVINMGVDTDIFMPLDRDECRERLGLPLKGRVVLFVGNIIPSKGVYYLIESLKDLELDDV 263 Query: 184 DFTLFGVNYENKDNPKYLG----------SFDAQSPEK------------INLPGMQFGL 221 + G + + G F + + FGL Sbjct: 264 KCIILGAPVDEEYLRTLRGLAESMDSDVEFFGPVPYTEVPTWMNAADVFVLPSLEEGFGL 323 Query: 222 IWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSI 281 + L+ PV + +F+ D+ GY V Sbjct: 324 V------------------------ALEALACGTPVIATATGGIMEFVRDSETGYTVPPG 359 Query: 282 KE--MQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVI 324 + + + + + E+ + ++ +E+V+ Sbjct: 360 DSRAIADRIKRILDPENRAEVESMRERGMRLAESFSTIKQIEKVL 404 >UniRef50_Q9UZI9 Putative glycosyltransferase, family 1 n=1 Tax=Pyrococcus abyssi RepID=Q9UZI9_PYRAB Length = 377 Score = 54.2 bits (129), Expect = 6e-06, Method: Composition-based stats. Identities = 45/313 (14%), Positives = 87/313 (27%), Gaps = 55/313 (17%) Query: 26 ASDYENISVVNIPLWGGVVQRIISSVKLSTFLCG-----LENKDVLIFNFPMAKPFWHIL 80 + I V I + VKL F L+N+D L + Sbjct: 49 FDMVDGIKVYRISIISRYASLFDFLVKLPLFYLNAMLYILKNRDGLYAIHANDFDTAPLA 108 Query: 81 SFFHRLLKFRIVPLIHDIDELRGGGGSD---------------SVRLATCDMVISHN--- 122 F R+L + + IHD+ R + + D VI+ + Sbjct: 109 FFISRILGVKFIFDIHDLYYTRISLLEEQEKDTILRKILRRTEILFAKLSDSVITVSRSI 168 Query: 123 ----PQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFI 178 + ++L + +++ V + I G + + Sbjct: 169 GGKHKGLKEFLVNSGVPPDKIYV-VWN--APDPRVFPRVKRHKHRGIVVGYIGTIRSISN 225 Query: 179 YTEGCDFTLFGVNYENKDNPKYLGSFDAQSPEKINLPGMQF--------GLIWDGDSVE- 229 + + ++GS E L +++ + E Sbjct: 226 FIPLFEI----AKENMLLRIIFVGS-GPLKDEIRELLSIKYPNIRVDFIESVPYEKVSEY 280 Query: 230 --TCSGAFGDY-----LKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVG--- 279 C + Y +K K + M +PV + + DF+ + G AV Sbjct: 281 YTLCDVIYSVYPMTTNIKMAIAVKMLESIIMGIPVIVNKDTLMEDFVNIYKCGVAVDMNV 340 Query: 280 -SIKEMQEIVDSM 291 SIK + + + Sbjct: 341 GSIKNALDKIKKI 353 >UniRef50_B5JPV4 Glycosyl transferase, group 2 family protein n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JPV4_9BACT Length = 634 Score = 53.8 bits (128), Expect = 8e-06, Method: Composition-based stats. Identities = 14/119 (11%), Positives = 35/119 (29%), Gaps = 17/119 (14%) Query: 74 KPFWHILSFFHRLLKFRIVPLIHDIDELRGG----------------GGSDSVRLATCDM 117 + + + HDI LR + D Sbjct: 361 HISIKYIDLIKERSSAKTLYFGHDIHHLRLELEAIYKDVEELKILKTRKQEIELWEKADY 420 Query: 118 VISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCS 176 ++ + + Y++ +K ++ I+ Y + R ++ G+++ GN + Sbjct: 421 LLYPSKEEVDYIASKGFAEKAVEVPIYFYD-PNSKPERLPFEESDGILFVGNFNHPPNL 478 >UniRef50_C7PLX0 Glycosyl transferase group 1 n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PLX0_CHIPD Length = 407 Score = 53.8 bits (128), Expect = 9e-06, Method: Composition-based stats. Identities = 35/297 (11%), Positives = 77/297 (25%), Gaps = 47/297 (15%) Query: 22 ALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILS 81 N++ + + I + + F DVL+ P P + + Sbjct: 62 IEITRVKAGNLNKDKLLQRALRLVLISLRLAFALFRKAKRGDDVLLVTNPA--PLLLLAA 119 Query: 82 FFHRLLKFRIVPLIHDIDE---------------LRGGGGSDSVRLATCDMVISHNPQMT 126 R R ++HD+ R + + +++ M Sbjct: 120 RICRWKGLRCYTIVHDVFPENLVVAKLVKPGSLPYRFLKSIFNAAYSRMTVLLVLGRDMK 179 Query: 127 KYLSKYMSQDK----IKDIKIF---DYLVSSDVEHRDVTDK-----QRGVIYAGNLSRHK 174 + M++ I I+ + + + D + + +AGNL R + Sbjct: 180 ALFEEKMAKYSYRPLIHIIENWADIEIISPQDKMGNGLVQDLHIGEKIIFQFAGNLGRVQ 239 Query: 175 CSFIY--------TEGCDFTLFGVNYENKDNPKYLGSF----DAQSPEKINLPGMQFGLI 222 F G K+ Y+ + + QF Sbjct: 240 GLQELFAIIREVRNPLLHFMFIGEGASKKELQAYVEAHQLTNVSILDSFPRSKQQQF--- 296 Query: 223 WDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPV--FIWDKAALADFIVDNRIGYA 277 S + P K+ L+ P+ + + ++ +G+ Sbjct: 297 -LNASDIGIVSLQDGMVGLGVPSKSYNILAAGKPILYIGDRSGEIGQMVEEHGVGWC 352 >UniRef50_Q1GZQ5 Glycosyl transferase, group 1 n=1 Tax=Methylobacillus flagellatus KT RepID=Q1GZQ5_METFK Length = 382 Score = 53.4 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 42/295 (14%), Positives = 87/295 (29%), Gaps = 31/295 (10%) Query: 25 IASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFH 84 I +++ R I + S ++ F+F + F Sbjct: 52 IFMQLTFVNIWGKDATWRRGARYILGLCRSIKHAKKNYSNIAHFHFFHVGVLEFLSVLFF 111 Query: 85 RLLKFRIVPLIHDIDELRGGGGSDSVR---LATCDMVISHNPQMTKYLSKYMSQD--KIK 139 RL F++V +HD++ + G + C+ +I HN L + KI Sbjct: 112 RLFGFKVVATVHDVESFKPGLTFSILLKWTYFLCNQLIVHNQVSKDELINRSNVSINKIH 171 Query: 140 DIKIFDYLVSSDVEHR--------DVTDKQRGVIYAGNLSRHKCS----------FIYTE 181 I Y+ +V+ ++ +++ G + K + Sbjct: 172 VIPHGSYIGLVAPSMPKADARQALNVSSDEKVILFFGQIKEVKGLDLLIEALGLVKDKLK 231 Query: 182 GCDFTLFGVNYENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGD---SVETCSGAFGDY 238 + G + KY +++ I D + Y Sbjct: 232 PFKLVIAGK-VWKDNFSKYQELISKNGLNDFCKLDIRY--IPDQEISMFYSAADLIVLPY 288 Query: 239 LKFNNPHKTSLYLSMELPVFIWDKAALADFIV--DNRIGYAVGSIKEMQEIVDSM 291 K + +S PV D +A I +N ++ G+ ++ E + + Sbjct: 289 RKIYQSGVLLMAMSYGTPVLASDLPGMAQIINDGENGFLFSAGNYIDLAEKLVKI 343 >UniRef50_B0RZK4 Capsular polysaccharide synthesis related protein n=2 Tax=Finegoldia magna RepID=B0RZK4_FINM2 Length = 380 Score = 53.4 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 52/332 (15%), Positives = 107/332 (32%), Gaps = 46/332 (13%) Query: 31 NISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRL-LKF 89 NI N ++ S K L ++ + + + K Sbjct: 60 NIDADNSNPMKRLIPTYKFSQKAYKILERIKPD---LIHVQSYDMLEIATKYKKNNDNKV 116 Query: 90 RIVPLIHDID-ELRGGGGS-------------DSVRLATCDMVISHNPQMTKYLSKYMSQ 135 +I+ + DI L S ++ LA D++I + + ++ S+ Sbjct: 117 KIIYEVPDIHRYLTDDKKSFPMNIVSSILKKRENNMLAFVDLMIMTSMKFWEHFDGKYSK 176 Query: 136 DKIKDIKIFDYLVSSDVEHRDVTDKQRG---VIYAGNL---SRHKCSFIYTEGCDFTLFG 189 D + + L + Q V Y G L + K EG + L Sbjct: 177 DNLVFMPNIPNLELFKDYDKLRVKNQHDTFTVGYIGGLRYLNELKKLVKAMEGLEMNLMM 236 Query: 190 VNYEN----------KDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYL 239 +E+ KD +Y G F D + + A + Sbjct: 237 AGFESGTYFKELSETKDFIEYRGKFYYDDEIAELYSKC--------DCIFSVYDASMKNV 288 Query: 240 KFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVG--SIKEMQEIVDSMTIET-- 295 + P+K + ELP+ + L++ + + +G AV S +EM+ ++ + + Sbjct: 289 RIALPNKLYESIHAELPIIVARDTYLSEVVNEWNVGVAVDHESDEEMRNVLIKLRDDQSF 348 Query: 296 YKQISENTKIISQKIRTGSYFRDVLEEVIDDL 327 Y + EN + + ++ +L+ + + Sbjct: 349 YHSLQENCRKMQSELNPQKNNERLLKRIENLF 380 >UniRef50_Q8RBZ2 Predicted glycosyltransferases n=1 Tax=Thermoanaerobacter tengcongensis RepID=Q8RBZ2_THETN Length = 411 Score = 53.4 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 51/364 (14%), Positives = 106/364 (29%), Gaps = 74/364 (20%) Query: 23 LDIASDYENISVVNIPL----WGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWH 78 ++ + + P W VV I + ++ E D++I Sbjct: 63 EEVFDGVRFVWLNTFPYTKNDWRRVVNMISYAFRVIKVAKKFEKPDIII-----GSSMHF 117 Query: 79 ILS----FFHRLLKFRIVPLIHDIDE----LRGGGGS-----------DSVRLATCDMVI 119 + R K R + + D+ G + + +I Sbjct: 118 FAPLAGWWLSRKYKARFIFEVRDLWPQTAIDMGAIKENSILAKLLYIWEKFMYERAEKII 177 Query: 120 SHNPQMTKYLSKYMSQDKIKD-------IKIFDYLVSSDVEHRDVTDKQR-----GVIYA 167 P Y++K ++ I+ F+ +S D ++ V+YA Sbjct: 178 VLLPDAKSYIAKRGMPEQKIVWIPNGVNIERFETDISIDENLEVFEVFKKYKDKFKVVYA 237 Query: 168 GNLSRHKCSFIYTE---------GCDFTLFGVN--------YENKDNPKYLGSFDAQSPE 210 G + E F L G NK + S + Sbjct: 238 GAHGPANGLEVVIETAELLRDYEDIQFILIGDGVGKESLIEMANKKKLTNIVFLSPISKK 297 Query: 211 KINLPGMQFGLIWDGDSVETCSGAFGDYLKF-NNPHKTSLYLSMELPVFIWDKAALADFI 269 I + + S D K+ +P+K YL+ P+ + +A + I Sbjct: 298 FIPT-------VLRKADLLLHSLKHMDVFKYGISPNKIFDYLASGRPIIS-NVSASKEII 349 Query: 270 VDNRIGYAVGS-----IKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVI 324 + G V + E + +++ + Q+ N + + + + E++I Sbjct: 350 EEANAGIIVPPENPKLLAEGILKIKNLSEKERNQMGLNGRKY---VEQHYDIKKLTEKLI 406 Query: 325 DDLK 328 +L+ Sbjct: 407 KELE 410 >UniRef50_D1PQ72 Capsular polysaccharide biosynthesis protein Cps4F n=1 Tax=Subdoligranulum variabile DSM 15176 RepID=D1PQ72_9FIRM Length = 407 Score = 53.4 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 37/293 (12%), Positives = 75/293 (25%), Gaps = 60/293 (20%) Query: 42 GVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDE- 100 SS + + L E+ DV+ N + R ++V D+ Sbjct: 82 NYYSYAWSSSQYARHL--REDYDVVFTNQTSPVMMSSAAFAYARRHGKKVVMYCMDLWPA 139 Query: 101 -------------LRGGGGSDSVRLATCDMVISHNPQMTKYLSK-YMSQD-KIKDIKI-- 143 R D ++ + YL + + D KI + Sbjct: 140 CLAAGGLGESSPVYRFFDRESRRLYNQPDRILITSRMFRAYLVERHGVDDGKIAYLPQYA 199 Query: 144 ---FDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYTE---------GCDFTLFGVN 191 FD L + D +++AGN+ + E + + G Sbjct: 200 AARFDAL-PPPASGKQTVD----LMFAGNVGAAQSLTTVLEAAALLRDQPQLRWHIVGDG 254 Query: 192 YE-----------NKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLK 240 E D + G + L+ + ++ Sbjct: 255 SELAHLQKLAAEKQLDCVIFHGRKPPEEMP-RYYAMADAMLV---------TLTADPFIS 304 Query: 241 FNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYA--VGSIKEMQEIVDSM 291 P K Y++ P+ + + R G+ + ++ + V Sbjct: 305 LTLPGKVQTYMAAGKPILAAAAGEIPQVLAAARCGWCAKAENAADLAQKVRQF 357 >UniRef50_A1TKR8 Glycosyl transferase, group 1 n=1 Tax=Acidovorax citrulli AAC00-1 RepID=A1TKR8_ACIAC Length = 401 Score = 53.0 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 35/310 (11%), Positives = 90/310 (29%), Gaps = 47/310 (15%) Query: 37 IPLWGGVVQRIISSVKLSTFLCGLENKDVLIFN-FPMAKPFWHILSFFHRLLKFRIVPLI 95 +W + + S + D++ + +PM ++ +I+ Sbjct: 87 FRIWSYRISSLNSFDQYIFDRMSEYQVDIVHVHDYPMLA----AGVALAKMRGVKIIYDA 142 Query: 96 HDIDEL---------RGGGGSDSVRLATCDMVISHNPQMTKYL-SKYMSQDKIKDIKIFD 145 H++ ++S + D I+ NP + + +Y + + Sbjct: 143 HELYYAQTQLPVSIQEKYKQNESRLMRHVDAAITVNPYIADIIAKRYSVKTPWVIMNAAP 202 Query: 146 YLV----SSDVEHRDVTDKQRGVIYAGNLSRHKCS-------FIYTEGCDFTLFGVNY-- 192 ++ R V+Y G +S ++ E + G Sbjct: 203 PRAVSHGDLLRARFNLPSSTRIVVYQGWISDNRGIDCAVEAAKYLAENIALVVIGYGDYE 262 Query: 193 -------ENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPH 245 E+ D + + ++++ D + F +P+ Sbjct: 263 ATLRKMVEDHDLSSRVFFYGGVPSDELHALTCG------ADLGIIPYHGVDENNFFCSPN 316 Query: 246 KTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTI-----ETYKQIS 300 K + +P D L D + G + ++ Q S+ E + ++ Sbjct: 317 KLFEFAVANIPFVCNDLPFLRDIVEKFGNG-VISDLRSPQAAAQSINQVFADPEKFSRMK 375 Query: 301 ENTKIISQKI 310 E ++ +++ Sbjct: 376 EGAELAGREL 385 >UniRef50_C7PGD4 Glycosyl transferase group 1 n=2 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PGD4_CHIPD Length = 415 Score = 53.0 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 44/311 (14%), Positives = 90/311 (28%), Gaps = 59/311 (18%) Query: 44 VQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHD--IDEL 101 +S+ L + DV+I P + + + + + I D I+ Sbjct: 99 FSFAVSAFFKVLQLLPRKKFDVVISVVPPFHL-GLLAVLYKKFRGAKFLYHIQDMQIEAA 157 Query: 102 RGGGGS------------DSVRLATCDMVISHNPQMTKYLSKYMSQDKIKDIKIF-DYLV 148 R + DMV S + M + + + +D I + D + Sbjct: 158 RDLNMIKSPKVIKALFGLERYIFKNADMVSSISDGMMRKIQEKAGKD-IFFFPNWVDVSL 216 Query: 149 SSDVEHRDVTD-------KQRGVIYAGNLSRHKCSFIYTEG---------CDFTLFGVNY 192 +E R + V+Y+G + + + F + G Sbjct: 217 FHPIEDRIKLKTAYGFDVNDKIVLYSGAIGEKQGLESILKTADTLRADTQLKFLICGSGP 276 Query: 193 ENKD--------NPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNP 244 + + F Q EK N D G+ P Sbjct: 277 YKEKLKADAEALGLSNVIFFPLQPFEKFNEFLN------MADVHLVIQ--KGNASDLVMP 328 Query: 245 HKTSLYLSME--LPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQISEN 302 K + L++ + D +L D + + +G V + + ++T + + +N Sbjct: 329 SKLTTILAVGGLALITANDGTSLHDLVKKHNMGILVKAEDQ-----QALTDGIVRAMGDN 383 Query: 303 TKIISQKIRTG 313 ++ R G Sbjct: 384 AAELA---RNG 391 >UniRef50_Q1Q6V4 Putative uncharacterized protein n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q6V4_9BACT Length = 404 Score = 53.0 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 28/216 (12%), Positives = 67/216 (31%), Gaps = 37/216 (17%) Query: 111 RLATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNL 170 D+ IS + + + + + KI + F V+ + ++ ++Y G L Sbjct: 183 IYDLVDVFISPSMFLKNKVEEMGFKGKIIYLPNF---VNLEDYRPQYDWQENAIVYFGRL 239 Query: 171 SRHKCSFIYT-----EGCDFTLFGVNYENK-----------DNPKYLGSFDAQSPEKINL 214 S+ K F + G + N K+LG + K + Sbjct: 240 SKEKGLFTLIEAMKGLNTKLKIIGEGPIKEGLKLSVRSLELKNIKFLGYKAGEEL-KDEI 298 Query: 215 PGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRI 274 F ++ + + NNP ++ P + + + DN Sbjct: 299 RKSMFVILPSE---------WYE----NNPRSIIEGFALGKPAIGARIGGIPELVKDNET 345 Query: 275 GYAVG--SIKEMQEIVDSMTIE--TYKQISENTKII 306 G + +++ + + ++ +N + + Sbjct: 346 GLTFEPWNADDLKRRISQLIENPSEISRMGKNARKM 381 >UniRef50_C4Z1X4 Putative uncharacterized protein n=1 Tax=Eubacterium eligens ATCC 27750 RepID=C4Z1X4_EUBE2 Length = 93 Score = 53.0 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 16/90 (17%), Positives = 29/90 (32%), Gaps = 1/90 (1%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLE 61 Y++N +AG KA D +I G++ +I + L + Sbjct: 5 YYINIKMKENNNAGSKAVNDCNNILKQCGIEPYTLNIKGEGLLGKINKVFEFEK-LKKIP 63 Query: 62 NKDVLIFNFPMAKPFWHILSFFHRLLKFRI 91 VL P+ + + K +I Sbjct: 64 ENSVLFIQHPIYINKNYYIDVLKNTKKKKI 93 >UniRef50_A6A5R7 Glycosyl transferase, group 1 n=2 Tax=Vibrio cholerae RepID=A6A5R7_VIBCH Length = 379 Score = 53.0 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 43/261 (16%), Positives = 80/261 (30%), Gaps = 41/261 (15%) Query: 47 IISSVKLSTFLCGLENKDVLIFN--FPMAKPFWHILSFFHRLLKFRIVPLIHDI------ 98 + L L N +I F P + + +++ ++D Sbjct: 77 LKFQFFLLAKLFKFRNDYKIIHAADFDTILPALFMKLILRK----KVIYDVYDFYVDAFS 132 Query: 99 --DELR-GGGGSDSVRLATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFD--YLVSSDVE 153 LR D + D VI N + +S Q D Y ++ V Sbjct: 133 VPKILRGFIKKIDLFSMGVVDGVIITNESRFEQISGSSPQRICVIHNTPDVSYNINYQVM 192 Query: 154 HRDVTDKQRGVIYAGNLSRHKCS------FIYTEGCDFTLFGVNYENKDNPKYLGSFDAQ 207 + + + V Y G L ++ F + G Y G Sbjct: 193 TSNKDEFKINVAYVGILQPNRLLEEILEVFARNPSWKLDIAGFGVLEDLVVSYAGK---- 248 Query: 208 SPEKINLPGMQFGLIWDGDSVETCSGA---FGDYL------KFNNPHKTSLYLSMELPVF 258 +P I G I D++ S A F Y +F++P+K + + P+ Sbjct: 249 NPNIIFH-----GRISYDDAIRVNSNADLLFATYDPNVPNHRFSSPNKLYEAMLLSKPII 303 Query: 259 IWDKAALADFIVDNRIGYAVG 279 + + + + D IG+++ Sbjct: 304 VCNSTGIDALVNDEGIGFSID 324 >UniRef50_C2HHA8 Glycosyltransferase n=1 Tax=Finegoldia magna ATCC 53516 RepID=C2HHA8_PEPMA Length = 408 Score = 53.0 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 46/341 (13%), Positives = 91/341 (26%), Gaps = 66/341 (19%) Query: 19 RKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLEN---KDVLIFNFPMAKP 75 +K +I + + + G R+I+ ++ L + +++ Sbjct: 55 KKYKEEIIDSLKFVYIKAKNYTGNGKDRVINMIQFYKNLLSVSKKYSGVDVVYCSAPHSL 114 Query: 76 FWHILSFFHRLLKFRIVPLIHDI--DELRGGGGS-------------DSVRLATCDMVIS 120 W + +++ D+ + G + + D +I Sbjct: 115 TWLASRKIAKNNHAKLICETRDLWPETFIEMGKFSKNHPVAKILYAIEKSVYKSSDALIF 174 Query: 121 HNPQMTKYLSKYMSQD-------KIKDIKIFDYLVSSDVEHRDVTDKQR--GVIYAGNLS 171 YL ++ F+ + D + V+Y G L Sbjct: 175 TMEGGKDYLKSRGINRDNVFHINNGVVLEDFNESIKKYHLEDVDLDDKNIFKVVYTGALG 234 Query: 172 RH----------KCSFIYTEGCDFTLFGVN-----------YENKDNPKYLGSFDAQSPE 210 R Y E ++G ++ N + GS Sbjct: 235 RANQVDTLIDAMNKLSDY-EDIKLIVYGKGEFEASLIKKVRENSQKNVVFKGS--VDKRY 291 Query: 211 KINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPH-KTSLYLSMELPVFIWDKAALADFI 269 ++ + +G + K+ K YL+ P+ K D I Sbjct: 292 VPSIVT--------RSDLNVITGQDINLYKYGMSFNKLFEYLAANKPILSNLKCN-YDII 342 Query: 270 VDNRIGYAVGS-----IKEMQEIVDSMTIETYKQISENTKI 305 G V S +KE ++ YKQ EN+K Sbjct: 343 EKFNCGKTVKSGSADALKEGILYFYNLKDIDYKQFCENSKE 383 >UniRef50_C9RVT7 Glycosyl transferase group 1 n=4 Tax=Bacillaceae RepID=C9RVT7_GEOSY Length = 418 Score = 52.7 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 41/349 (11%), Positives = 97/349 (27%), Gaps = 68/349 (19%) Query: 16 FKARKDALDIASDYENISVVNIPLWGGVVQRIISSV---------KLSTFLCGLENKDVL 66 K + + ++ I V ++ + + L + DV+ Sbjct: 50 PKPYRGLFYLFEQWDGIPVHRTWIYPSPKGSFWKRLASYFSFTFSSFYSLLVKAKPTDVI 109 Query: 67 IFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDE--------------LRGGGGSDSVRL 112 I N P +L + + V + DI +R + Sbjct: 110 ICNSPPL-FLGITGYVGAKLKRAKFVFNVADIWPESAVELGILKNRLFIRMARWLELFLY 168 Query: 113 ATCDMVISHNPQMTKYLSKYM-------SQDKIKDIKIFDYLVSSDVEHRDVT-DKQRGV 164 + + + Y+ + + +F L + ++ + + Sbjct: 169 RKAWKIAAATEGIRDYMIEQGKAPEDVFLLPNGVNTDVFRPLPKNKKLLAELGLEGKVVF 228 Query: 165 IYAGNLSRHKCS----------FIYTEGCDFTLFGVNYENK-----------DNPKYLGS 203 YAG + + E F G E + DN + GS Sbjct: 229 TYAGTMGYAQGLDSVLRAAAIVKAKDERAHFLFVGDGQEREKLMALKEELGLDNVTFYGS 288 Query: 204 FDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKA 263 + +I + ++ S+ G P K ++ PV + Sbjct: 289 VPVEKMPEIFSIT-DYSIV----SLRNIDLFKG-----ARPSKIFPAIATGTPVLYCGEG 338 Query: 264 ALADFIVDNRIGYAV--GSIKEMQEI---VDSMTIETYKQISENTKIIS 307 A+ + G + +++ + + E Y++++EN + ++ Sbjct: 339 ESAEILETYHCGKIAPPENPEQIAAAVLELLCLPREEYEKMAENGRKLA 387 >UniRef50_B2UPC2 Glycosyl transferase group 1 n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UPC2_AKKM8 Length = 390 Score = 52.7 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 45/339 (13%), Positives = 85/339 (25%), Gaps = 77/339 (22%) Query: 17 KARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFL---CGLENKDVLIFNFPMA 73 K R+ A DI + + L + + L DV+IF Sbjct: 66 KIRRSAADILVCK--TPLFSSALGCRLYAAARYTSGFKKALTNHLNKHQYDVVIFGSGFE 123 Query: 74 KPFWHILSFFHRLLKFRIVPLIH--------DIDEL--RGGGGSDSVRLATCDMVISHNP 123 L+ L RI+ H ++ R + D +I + Sbjct: 124 DSLLLALTKNKLLPSMRILTWSHASYDNYFTNMGSFFSRYMKEAIKAYYHRFDEIIVLSD 183 Query: 124 Q-MTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHK-------- 174 ++ K+ + Y ++ R T + +Y G LS K Sbjct: 184 GDEKEFREKHHLPARRI------YNPNTMNPARKSTLTSKTFVYVGALSHQKGTDLAVRA 237 Query: 175 --CSFIYTEGCDFTLFGVNYENKDNPKYLGS-------------------FDAQSPEKIN 213 + + ++G +Y+ S F S Sbjct: 238 FHKFIETDQEWNLHIYGEGPLKGWIEEYVSSNGLHHRIILHGPCGNMEEEFPRHSILLFP 297 Query: 214 LPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNR 273 FGL+ VE + LP+ D + + + Sbjct: 298 SRCEGFGLV----QVEA--------------------MCCGLPILAADIPICREIVEKHH 333 Query: 274 IGYA--VGSIKEMQEIVDSMTIETYKQISENTKIISQKI 310 G + +++ + MT + N + Sbjct: 334 AGILFESDNPEDLCRAMREMTASDLSSYAANGLAAAPLF 372 >UniRef50_B6FJA1 Putative uncharacterized protein n=1 Tax=Clostridium nexile DSM 1787 RepID=B6FJA1_9CLOT Length = 415 Score = 52.7 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 57/370 (15%), Positives = 113/370 (30%), Gaps = 82/370 (22%) Query: 23 LDIASDYENISVVNIPLWGGVVQRIISSVKL-------STFLCGLENKDVLIFNFPMAKP 75 +I E V + G +QRI++ V+L E DV+ + P Sbjct: 59 EEIVDGIEYTFVKSRDYRGNGLQRILNMVELPFQMWKTMKLFFKKEKPDVIYTSSPDL-F 117 Query: 76 FWHILSFFHRLLKFRIVPLIHDI--DELRGGGGS-------------DSVRLATCDMVIS 120 F R K +V + D+ + + G + D +I Sbjct: 118 VAFFALVFGRKKKIPVVVEVRDLWPESIVEYNGMSRKNPIIQILYQLEKWIYKKADRLIF 177 Query: 121 HNPQMTKYLSKYMSQDKI-----------KDIKIFDYLV-SSDVEHRDVTDKQRGVIYAG 168 P +Y+ I D++ F+Y ++ V D+ + ++Y G Sbjct: 178 TMPGGKEYIKDKGWDKAIDLRKVHHINNGVDLEEFEYNKRNNHVCDEDLESNDKKIVYVG 237 Query: 169 NLSRHKCSFIYTE-----------GCDFTLFGVNYENKD----------NPKYLGSFDAQ 207 ++ + F ++G E + N + G + Sbjct: 238 SIRLANNLGQLIKAAKVLKEKHRDDIKFLIYGDGTEKEQLEKFACEEKLNVVFKGK--VE 295 Query: 208 SPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNP-HKTSLYLSMELPVFIWDKAALA 266 + D G K+ +K Y++ E P + + Sbjct: 296 KKYIPYILSK-------ADVNIINVKNTGL-TKYGCSWNKLFEYIASENP-IVCNFPQKY 346 Query: 267 DFIVDNRIGYA---------VGSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFR 317 D I + +G + +KE+ + +T E + + EN K + +K Y Sbjct: 347 DLINEYHLGKSEKFASSRDYAKRLKELVD----ITEEERRVVRENAKQL-KKEYDYQYLT 401 Query: 318 DVLEEVIDDL 327 LE++ ++ Sbjct: 402 TCLEKIFSNV 411 >UniRef50_A6LJY0 Putative uncharacterized protein n=1 Tax=Thermosipho melanesiensis BI429 RepID=A6LJY0_THEM4 Length = 368 Score = 52.7 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 36/312 (11%), Positives = 97/312 (31%), Gaps = 49/312 (15%) Query: 27 SDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRL 86 Y + I + + + + + D++ F++ + Sbjct: 60 KKYFGKTSTLINNFSSIKKFDRKIFDMIKTF----DYDIIYFHY-FLVSMPVKAFKVAKN 114 Query: 87 LKFRIVPLIHDIDE---LRGGGGSDSVR------------LATCDMVISHNPQ----MTK 127 ++V +H+ + G D +I + + M Sbjct: 115 KGKKVVYDLHEYHPENHFKNLKGLAKKVKEKLMWKVIKNQFFFSDKLIFVSEEARNDMLN 174 Query: 128 YLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAG----NLSRHKCS--FIYTE 181 L + +D I + + +K + ++ G N+ + + E Sbjct: 175 ILKTH--KDSIV-------IPNYANIKLKSPEKIKEIVIVGKTPRNIQNEREILKNLNKE 225 Query: 182 GCDFTLFGVNYENKDNPKYLGS-FDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLK 240 G + G+ ++ + + + F LI S ++ + +Y+ Sbjct: 226 GFSIKIVGIKTNILNDIPCKYTDVLPYDKMMVEVSKSAFSLI----SYKSFGQEYKNYI- 280 Query: 241 FNNPHKTSLYLSMELPVFIWDK-AALADFIVDNRIGYAVGSIKEMQEIVDSMTI--ETYK 297 ++ PHK ++ PV + ++ + + IG + + ++E V + + Y+ Sbjct: 281 YSFPHKFFDSIAAGTPVIVNRSFVSMKNEVEKYGIGIVIEP-QNVKESVRKILEAYKNYE 339 Query: 298 QISENTKIISQK 309 + EN + + Sbjct: 340 KFLENIETYKDR 351 >UniRef50_C1TR28 Glycosyltransferase n=1 Tax=Dethiosulfovibrio peptidovorans DSM 11002 RepID=C1TR28_9BACT Length = 378 Score = 52.7 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 39/288 (13%), Positives = 88/288 (30%), Gaps = 61/288 (21%) Query: 46 RIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIH-DID----- 99 RII + + D+ + P +L L +++ H D+ Sbjct: 64 RIIKTARKVVDKAIATKADLYHIHDP-----ELLLFSKKLLKHGKVIYDAHEDVPRQILS 118 Query: 100 ------ELR-----GGGGSDSVRLATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLV 148 LR ++ + + V++ P ++ K + I I +Y Sbjct: 119 KGWIPRPLRRPLSFITEKVENHYVKRLNGVVTATP----FIKKRFIKINTHSIDINNYPK 174 Query: 149 SSDVEHRDVTDKQRG-VIYAGNLSRHKCSFIYTE-----GCDFTLFGV--NYENKDNPK- 199 ++ + +++ + Y G +S + F + L G N + +D K Sbjct: 175 LKELGDIEHASEKKRLICYVGGISTIRGIFEMVQAMQYVNGKLLLAGSFSNQKERDLVKT 234 Query: 200 -----------YLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTS 248 Y + ++ GL+ +Y+ P K Sbjct: 235 FPGWRKVIELGYCDRKKVKKILSLSRA----GLV--------VLRPTINYID-ALPVKLF 281 Query: 249 LYLSMELPVFIWDKAALADFIVDNRIGYAVGSIK--EMQEIVDSMTIE 294 Y++ E+PV + ++ G V + E+ + ++ + Sbjct: 282 EYMASEIPVVASSFPLWIKIVSSSKCGICVNPLNPKEIGKAINWILDN 329 >UniRef50_C6A091 Glycosyltransferase n=1 Tax=Thermococcus sibiricus MM 739 RepID=C6A091_THESM Length = 355 Score = 52.7 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 41/261 (15%), Positives = 72/261 (27%), Gaps = 40/261 (15%) Query: 28 DYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLL 87 ++ + L + +KL + D + F Sbjct: 59 KAGYGPLMALKLPLFYLNAFRVILKLKPDAIHTHDFDTAVL----------GFFFKKLKK 108 Query: 88 KFRIVPLIHDI-DEL----RGGGGS---DSVRLATCDMVISHNPQMTKYLSKYMSQDKIK 139 K V +HD+ + R DS+ + D+VI+ N +M K L + ++ + Sbjct: 109 KVMWVYDVHDLYESFVRNNRVSSLIKRLDSIIMGKADIVITVNQEMMKILLER-ARPNMT 167 Query: 140 DIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIY-----TEGCDFTLFGVNYEN 194 I I + + V + K + Y G LS + T + G Sbjct: 168 VI-IMNTINPFGVSQK--KAKIFTIFYGGVLSEGRFVKEIVDISSTLDVSIRIAGSGTLK 224 Query: 195 KD----NPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLY 250 + + YLG L I SV K +P+K Sbjct: 225 DEIKSSSAVYLG-HVPHERALEELSRAHVTFILYDPSVLNN--------KIASPNKLFEA 275 Query: 251 LSMELPVFIWDKAALADFIVD 271 + PV + Sbjct: 276 MWTGTPVLVVKGTLPEKLAEK 296 >UniRef50_A0PZR3 Putative glycosyl transferase n=14 Tax=Clostridium RepID=A0PZR3_CLONN Length = 374 Score = 52.3 bits (124), Expect = 3e-05, Method: Composition-based stats. Identities = 40/285 (14%), Positives = 93/285 (32%), Gaps = 30/285 (10%) Query: 69 NFPMAKPFWHILSFFHRLLKFRIVPLIHDID-----------------ELRGGGGSDSVR 111 + P + + +I DI Sbjct: 86 HIPKYPFLDFSFFKYCKSHNIKIGLFYRDIHWKFEQYKNNVGTAKRIISTMFYNYDLKKY 145 Query: 112 LATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLS 171 D++ + +M KYL + + KI+++ + +D + Y G +S Sbjct: 146 KELVDVLYLPSKEMFKYL-EIDFKGKIEELPPGSEKNDRVDLVKYKSDDYLNIFYVGGIS 204 Query: 172 RHKC----SFIYTEG---CDFTLFGVNYENKDNPKYLGSFDAQSPEKINLPGMQ-FGLIW 223 F T+ + + ++ + I+ G + + + Sbjct: 205 TELYDIRELFKVANELPWIKLTVCCRESDWNNVADAYLNYLNERISIIHKSGEELYAYVK 264 Query: 224 DGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVG-SIK 282 D + +Y +F P K Y+S + P+ +F+ N IG+++ S + Sbjct: 265 KADILNLFIRPT-EYWEFAVPVKLFTYISYKKPIISAKNTVTGNFVEKNNIGWSIDYSKE 323 Query: 283 EMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDL 327 E+ +++ + K+I E T + + + ++ ++V+ DL Sbjct: 324 ELIKMLTELKNNR-KEIVEKTSNMEKVLEENTWNSRA-KKVVKDL 366 >UniRef50_Q1WS01 Glycosyltransferase n=2 Tax=Lactobacillus salivarius RepID=Q1WS01_LACS1 Length = 350 Score = 52.3 bits (124), Expect = 3e-05, Method: Composition-based stats. Identities = 47/321 (14%), Positives = 98/321 (30%), Gaps = 44/321 (13%) Query: 26 ASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHR 85 ++ VNI +V +I +K+ F E I P + + Sbjct: 49 LQKLDDKVSVNIYTGNNLVSKIWFIMKIFIF----EKDTNYISLSPSLILIANKVRKLCN 104 Query: 86 LLKFRIVPLIHDIDELRGGGGSDSVRLATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFD 145 K++I+ IH L+ D +L D ++ + + + L + + + + IF+ Sbjct: 105 -KKYKIISWIH--FSLKNQDMFDPGKLVNADGHLAISSVIREQLLELGVKSD-EIMTIFN 160 Query: 146 YLVSSDVEHRDVTDKQRGVIYAG--------NLSRHKCSFIYTE--GCDFTLFGVNYENK 195 + + + + YAG N+S + ++G + + Sbjct: 161 PIERHNSIPEVKKENYLNLFYAGRMTFDGQKNISELLSGISKIKGINYHLDMYGSGEDLE 220 Query: 196 DNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHK-------TS 248 +Y G + W G + K Sbjct: 221 KCKEY-GKKLGINGNI----------TWHGWTKNLWEEINVRPSALVMSSKYEGLPMIML 269 Query: 249 LYLSMELPVFIWDKAALADFIVDNRIGYAVGS--IKEMQEIVDSMTIETYKQISENTKII 306 +S +PV D + + GY S I+E+ + + ++ + TK I Sbjct: 270 ESISRGIPVVTTKFDGYEDIVKEGVNGYTYKSGNIEELGQKLIKISEQKMS-----TKDI 324 Query: 307 SQKIRTGSYFRDVLEEVIDDL 327 I Y + + + + L Sbjct: 325 QDSIEN-YYTENYFKNLENAL 344 >UniRef50_A7V9M4 Putative uncharacterized protein n=1 Tax=Bacteroides uniformis ATCC 8492 RepID=A7V9M4_BACUN Length = 408 Score = 52.3 bits (124), Expect = 3e-05, Method: Composition-based stats. Identities = 49/348 (14%), Positives = 103/348 (29%), Gaps = 67/348 (19%) Query: 16 FKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCG-----LENK--DVLIF 68 K + +I + E + G V RI S L L N D++I Sbjct: 47 KKQPCEYREIVDNVEYCWIPVNRYKGNGVGRIFSMFLFVFKLYHYFVEYLRNFEPDIVIA 106 Query: 69 NFPMAKPFWHILSFFHRLLKFRIVPLIHDIDEL---------------RGGGGSDSVRLA 113 + + + +++ +HD+ L + +++ Sbjct: 107 SS-TYPLDIYPAYKIAKHYHAKLIYEVHDLWPLSPMELGGYSKKHPFIQIMQKAENDCYR 165 Query: 114 TCDMVISHNPQMTKYLSKYMS-QDKIKDIK----IFDYLVSSDVEHRD-------VTDKQ 161 D V+S P+ +++ ++ + K + + D+ + + + Sbjct: 166 YVDTVVSMLPKAEEHMREHGLGKGKFHYVPNGIVLSDWNNPKGIPEEHGLLLSRFQKEGK 225 Query: 162 RGVIYAGNLSRHKCSF--------IYTEGCDFTLFGVNYENKDNPKY-------LGSFDA 206 V +AG + + + L G E ++ KY F Sbjct: 226 FIVGFAGAHGIANSLYAVIDAVSSLAEQNVVLVLVGGGQEKENLIKYAHKKEIVNVYFLP 285 Query: 207 --QSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKF-NNPHKTSLYLSMELPVFIWDKA 263 NL V +F +P+K Y+ P+ A Sbjct: 286 TIDKLAIPNLLKEM--------DVLYIGLQKQSLFRFGISPNKMFDYMMAAKPIIQAIDA 337 Query: 264 ALADFIVDNRIGYAV--GSIKEMQEIV---DSMTIETYKQISENTKII 306 + + + G V ++ E+ + + SM E +++ EN K Sbjct: 338 GN-NLVGEADCGIDVEPDNVGEISKAILALKSMPEEERRRLGENGKKF 384 >UniRef50_C3WZE2 Glycosyltransferase n=2 Tax=Fusobacterium RepID=C3WZE2_9FUSO Length = 413 Score = 51.9 bits (123), Expect = 3e-05, Method: Composition-based stats. Identities = 43/328 (13%), Positives = 95/328 (28%), Gaps = 54/328 (16%) Query: 44 VQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDE--- 100 +Q + S L D++I + P + + + I D+ Sbjct: 90 IQFFFKVIFFSKKLLKDSKPDIIIASSPHPFNGLAGMYLAKKYK-CPFIIEIRDLWPETW 148 Query: 101 ------------LRGGGGSDSVRLATCDMVISHNPQMTKYLSKYMSQDKIKD-------I 141 + + D +I+ Y + K + + Sbjct: 149 VAMGATTRKSILYKFFAYIEKKLYKNADKIITLTAN-KDYYTSIGIDGKKVEIISNGVDL 207 Query: 142 KIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCS--------FIYTEGCDFTLFGVNYE 193 + +D + + ++Y G+ S+ + E F L G Sbjct: 208 ESYDSNLKEYKSPLTFSKDNFNILYTGSHSQGDALDILIETAELLSKEKIVFHLVGEGVI 267 Query: 194 NKD-------NPKYLGSFDAQ--SPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNP 244 ++ N F E +L D+V Y +P Sbjct: 268 KEELKKRVKENNINNVKFYDIVKKYEIPSLLKES-------DAVIMLLRDIPLYKYGMSP 320 Query: 245 HKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGS-----IKEMQEIVDSMTIETYKQI 299 +K YL+ P+ + + D + + G +V + +KE + MTI+ + + Sbjct: 321 NKMYEYLASTKPII-FSGSVANDMVKEANAGVSVEAENPKKLKEGILSLQKMTIDEREVL 379 Query: 300 SENTKIISQKIRTGSYFRDVLEEVIDDL 327 + + ++ +E++I +L Sbjct: 380 GKKGRKYVEENYDTKVLSKKIEKIILNL 407 >UniRef50_Q8CX81 Glycosyltransferase (Capsular polysaccharide synthesis) n=3 Tax=Bacillaceae RepID=Q8CX81_OCEIH Length = 386 Score = 51.9 bits (123), Expect = 3e-05, Method: Composition-based stats. Identities = 46/329 (13%), Positives = 103/329 (31%), Gaps = 60/329 (18%) Query: 18 ARKDALDIASDYENISVVNIPL--WGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKP 75 A++D S+ + + + +I L + ++R+ + N DV F+ P P Sbjct: 40 AKEDKN---SERKEVPIKHIKLKSYTSRLKRMTIGALAAYKQAKKLNADVYHFHDPELLP 96 Query: 76 FWHILSFFHRLLKFRIVPLIHD--IDELRGGGGSDSVRLATCDMVISHN-PQMTKYLSKY 132 +L + ++ IH+ I + +I+ M ++ SK Sbjct: 97 VGWLL----KNKSNHVIYDIHEDYITSIMQKDYMSRPIRK----LIAFTYKTMERFFSKN 148 Query: 133 M---SQDKIK--------DIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYTE 181 M +K I + + EH + ++Y GN++ + + Sbjct: 149 MELCLAEKYYQDIYPTGKCILNYPTINQKISEHHRTGTPEYKLLYTGNVTLDR-GALIHA 207 Query: 182 GCDFT-------------------LFGVNYENKDNPKYLG--SFDAQSPEKINLPGMQFG 220 ++ KDN + G F + + + Sbjct: 208 RIPVIDERMEVYYVGKCPNQLAEQIYNKAATRKDNIQIEGIDQFVEKEDIEERYLQHNW- 266 Query: 221 LIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGS 280 + Y+K K Y++ +PV + +FI ++ G V Sbjct: 267 -----LAGIALFPPTEHYMKKELT-KFFEYMNAGIPVICSNFPVWENFINKHQCGITVDP 320 Query: 281 IK--EMQEIVDSMTI--ETYKQISENTKI 305 E+++ + + + +++ N K Sbjct: 321 YNDQEIKDAISYLVENPDKAEEMGANGKK 349 >UniRef50_B9DUI8 Putative glycosyl transferase n=1 Tax=Streptococcus uberis 0140J RepID=B9DUI8_STRU0 Length = 347 Score = 51.9 bits (123), Expect = 3e-05, Method: Composition-based stats. Identities = 38/289 (13%), Positives = 91/289 (31%), Gaps = 29/289 (10%) Query: 28 DYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLL 87 + I + + +++R+ + + D+L+ + + Sbjct: 53 KKKYILQIIFSYFFSIIRRLWHLYFI------IPKTDILLIQKAVIPKLKPTFLKHLKRK 106 Query: 88 KFRIVPLIHDIDELRGGGGSDSVRLATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYL 147 K RIV + D L + D++I N + Y S+Y + D Sbjct: 107 KIRIVFDVDDAIYL-LKNDNSQEIAKNVDLIICGNDNLRTYYSQY--NKNCVVLPTTD-N 162 Query: 148 VSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYTEGCDFTLFGVNYENKDNPKYLGSFDAQ 207 + + D T + + + G+ + K + + + E + Sbjct: 163 TNKYKPYWDDTFNNKTIGWIGSRTTIKNLHEIVKPINQLV-----EKYPQVSFKII-SND 216 Query: 208 SPEKINLPGMQFGLIWDGDSVETCSGAF---------GDYLKFNNPHKTSLYLSMELPVF 258 + + + + W D+ ++ + K YL+M+ PV Sbjct: 217 ALDFPKIIKNTKLIKWSSDTYLEDLSQLTVGIMPLKDDEFNRGKCGFKLIQYLNMKKPVV 276 Query: 259 IWDKAALADFIVDNRIGYAVGSIKEMQEIVDSM--TIETYKQISENTKI 305 + + I +N G+ V I++ + ++++ E Y N + Sbjct: 277 GSNVGVNGEIINNN--GFVVNDIEDWFKNLETLLFNQEEYNNCQTNIEN 323 >UniRef50_Q2RKP7 Glycosyl transferase, group 1 n=1 Tax=Moorella thermoacetica ATCC 39073 RepID=Q2RKP7_MOOTA Length = 381 Score = 51.9 bits (123), Expect = 3e-05, Method: Composition-based stats. Identities = 33/262 (12%), Positives = 71/262 (27%), Gaps = 50/262 (19%) Query: 80 LSFFHRLLKFRIVPLIHD--------IDELRGGGGS--DSVRLATCDMVISHNPQMTKYL 129 L RI+ I D I L + L D VI + + + Sbjct: 110 CLLSKALGGKRIIYDIFDLYADSRRNIPTLIRKLLHVLEFKALEWVDAVILADESRREQI 169 Query: 130 SKYMSQDKIKDIKIFDYLVSSDV------EHRDVTDKQRGVIYAGNLSRHKCSFIY---- 179 + + I Y DV + ++Y G L + Sbjct: 170 AGTRPRRLITI-----YNSPPDVLDTLRRNGPPPRLTELYLVYVGLLQVERGLLEMMEIL 224 Query: 180 --TEGCDFTLFGVNYENKD----------NPKYLGSFDAQSPEKINLPGMQFGLIWDGDS 227 L G+ +++ N + G + L + D Sbjct: 225 GRHPEWHLDLAGLGGGDEERILGLARSLPNVTWHG---------PIIYKRALKLSYAADV 275 Query: 228 VETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVG--SIKEMQ 285 + ++++P+K + + PV + + + G V + ++ Sbjct: 276 LIATYDPTIPNHRYSSPNKVFEAMMLAKPVVVARNTGIDRLVEKINCGLVVPYGDVAALE 335 Query: 286 EIVDSMTIETY--KQISENTKI 305 + + + +Q+ EN + Sbjct: 336 AALIRLARDPALRQQLGENGRR 357 >UniRef50_C6P8E8 Glycosyl transferase group 1 n=1 Tax=Thermoanaerobacterium thermosaccharolyticum DSM 571 RepID=C6P8E8_CLOTS Length = 370 Score = 51.9 bits (123), Expect = 3e-05, Method: Composition-based stats. Identities = 35/233 (15%), Positives = 71/233 (30%), Gaps = 47/233 (20%) Query: 105 GGSDSVRLATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHR-----DVTD 159 + + + D+ S N Q+ + L K M+ I Y + + V + V Sbjct: 130 RNMEKYCIESSDITFSVNEQLIE-LRKNMT-------GITPYYIPNGVNYELFKGDKVKH 181 Query: 160 KQRGVIYAGNLSRHKCSFI---------YTEGCDFTLFGVNYENK------------DNP 198 ++++G+L + + G D Sbjct: 182 NGTILVFSGSLEHWAGIEMPIKALPILRRELDVSMMILGRGKYEPVLKKLSRDYKVNDFV 241 Query: 199 KYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVF 258 +LG + GL C+ + +K++ P K Y++ LPV Sbjct: 242 HFLGKVKYRDLPLHFK-KSDIGL---------CTLFPTELIKYSFPLKAIEYMAAGLPVI 291 Query: 259 IWDKAALADFIVDNRIGYAVG-SIKEMQEIVDSM--TIETYKQISENTKIISQ 308 D L I +N G + S+ + E + + +N + ++ Sbjct: 292 ATDIGDLGKLIKENECGITIKYSVIDFVEKTIDLIENRDKMSIYGQNGRNFAK 344 >UniRef50_B5IDW8 Glycosyl transferase, group 1 family protein n=2 Tax=Aciduliprofundum boonei T469 RepID=B5IDW8_9EURY Length = 371 Score = 51.9 bits (123), Expect = 3e-05, Method: Composition-based stats. Identities = 39/308 (12%), Positives = 90/308 (29%), Gaps = 50/308 (16%) Query: 57 LCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHD-------------IDELRG 103 + E D++ N + +L +++V HD I + Sbjct: 78 ILKKEKFDIVHAND---FDTLPLAIMLKKLHGWKVVYDAHDHYSSMIADVLPTLIPSIIF 134 Query: 104 GGGSDSVRLATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDY-LVSSDVEH---RDVTD 159 + L D I+ + + + L + + + DY L + V+ + + Sbjct: 135 K--LEKYLLKFTDARIAASEAIARELDTLPFEIVLNAKNLKDYTLTQAKVQQFRAKINPE 192 Query: 160 KQRGVIYAGNLSRHKCSFIYT------EGCDFTLFGVNY---------ENKDNPKYLGSF 204 + ++Y G L + G ++ N +Y+G Sbjct: 193 GKFLIVYIGILKLWTPLPQIIQAVKKLPEVKLIVGGKGPHENEIIGMIKDAKNIEYVGW- 251 Query: 205 DAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAA 264 D V S + Y + +K L+ P+ Sbjct: 252 -VNKKYIPLYTLAS-------DLVVLPSNSAKLYTRVAVANKIMEGLAAGKPLIAGTNTE 303 Query: 265 LADFIVDNRIGYAVG--SIKEMQEIVDSMTIET--YKQISENTKIISQKIRTGSYFRDVL 320 + + G + + ++ + + YK+ ++N ++ ++K + D L Sbjct: 304 GGKIVRECNAGLLCDYGDVGCLVNSINKLMKDKELYKKYAKNARVCAEKKYNWNIMSDRL 363 Query: 321 EEVIDDLK 328 + LK Sbjct: 364 IRLYKSLK 371 >UniRef50_C1RJB3 Glycosyltransferase n=1 Tax=Cellulomonas flavigena DSM 20109 RepID=C1RJB3_9CELL Length = 412 Score = 51.9 bits (123), Expect = 3e-05, Method: Composition-based stats. Identities = 30/199 (15%), Positives = 52/199 (26%), Gaps = 32/199 (16%) Query: 111 RLATCDMVISHNPQMTKYLSKYMS-QDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGN 169 + V+ +P M L K+ + + + V YAGN Sbjct: 167 LYSFAAAVVVISPYMRDALIARGIDPGKLHVVFNWSPDETPCASRAPSDAAGCTVTYAGN 226 Query: 170 LSRHKCS----------FIYTEGCDFTLFGVNYENKD---------NPKYLGSFDAQSPE 210 L + G G + ++ G A Sbjct: 227 LGSAQGLDTAVQAARIVADELPGFRLLFVGSGTMESELRRLADGLECVEFRGRVPADKMP 286 Query: 211 KINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIV 270 +I F L+ GD L P K LS +PV + + + Sbjct: 287 EI-YAETSFQLV--------ILRPTGD-LAGAVPSKLQASLSAGMPVICSAPGSAPEIVR 336 Query: 271 DNRIGYAVG--SIKEMQEI 287 G+ +++E+ E Sbjct: 337 SAAAGFTASPGNVEELAEA 355 >UniRef50_B2T0K0 Glycosyl transferase family 2 n=1 Tax=Burkholderia phytofirmans PsJN RepID=B2T0K0_BURPP Length = 817 Score = 51.5 bits (122), Expect = 4e-05, Method: Composition-based stats. Identities = 38/211 (18%), Positives = 65/211 (30%), Gaps = 41/211 (19%) Query: 88 KFRIVPLIH-----------DIDELRGGGGSDSVRLATCDMVISHNPQMTKYLS---KYM 133 + V +H D R + + + VI + M YL+ + + Sbjct: 564 GLKWVLDVHGVVPEEFRMHNDFFSARIHDDCERMAVENASRVIVVSQAMGNYLAHKYREL 623 Query: 134 SQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYTE-------GCDFT 186 + K+ + IF L R D + VIYAG + + D+ Sbjct: 624 LRAKVIVLPIF-ALEEGSEGARPYKDGRPTVIYAGGTQKWQNVPAMMNVVARALDHADWW 682 Query: 187 LFGVNYENKDN-----PKYLGSF---DAQSPEKINLP-GMQFGLIWDGDSVETCSGAFGD 237 ++ + E N + +F A E + FG I D V Sbjct: 683 MYTPDVEVMRNAAAPEVQNHPNFHVASATRAELNEVYQQCHFGFILRDDMVVN------- 735 Query: 238 YLKFNNPHKTSLYLSMELPVFIWDKAALADF 268 + P K YL+ + + D + DF Sbjct: 736 --HVSCPTKLIEYLAFGIVPIV-DTPNIGDF 763 >UniRef50_Q9WZ99 Lipopolysaccharide biosynthesis protein n=1 Tax=Thermotoga maritima RepID=Q9WZ99_THEMA Length = 434 Score = 51.5 bits (122), Expect = 4e-05, Method: Composition-based stats. Identities = 43/348 (12%), Positives = 96/348 (27%), Gaps = 47/348 (13%) Query: 21 DALDIASDYENISVVNIPLWGGVVQRI-ISSVKLSTFLCGLENKDVLIFNFPMAKPFWHI 79 D I I + G I + + + DV + P + Sbjct: 85 DIEVIRVRLPYIERHQLLRRGVEHFEIALKMFSYAKEYLRNKRVDVSLVYSPPITLYKTA 144 Query: 80 LSFFHRLLKFRIVPLIHDIDE--------------LRGGGGSDSVRLATCDMVISHNPQM 125 RL V + D+ +R + D++ H+ + Sbjct: 145 WK-VKRLKDAPFVLNVQDLFPQAAIDLGILKNPLLIRLFKQVEKKAYQLADLITVHSERN 203 Query: 126 TKYLSK--YMSQDKIKDIKIF-------DYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCS 176 +++ K+ ++ + +D + ++ V +AG L + Sbjct: 204 KEFVKSVLNGDGRKVLVMENWVDENEIKPGDKINDFSIKHGLTEKFVVSFAGTLGFSQDM 263 Query: 177 FIYT---------EGCDFTLFGVNYENKDNPKYLGSFDAQSPE-KINLPGMQFGLIWDGD 226 + + F + G +++ K S + Q+ ++P + L+ Sbjct: 264 EVIIRAANELKEYKDIVFIIVGNGVRLEESKKLAESLNLQNIRFIPSVPREIYPLVLHSS 323 Query: 227 SVETCSGAFGDYLKFNNPHKTSLYLSMELPVFI--WDKAALADFIVDNRIGYAV-----G 279 V + P K +S +PV + + G+A+ Sbjct: 324 DVSLATLTKDVKTPV-VPSKILSIMSAGIPVIAVMNLEGDAPKLVEKANAGFAIPAGDYK 382 Query: 280 SIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDL 327 S+ E ++ E + + N + I R E+ Sbjct: 383 SLAEKILLLYK-NPELRESLGRNGRRY---IEENLSSRKAAEKYEKIF 426 >UniRef50_Q13D55 Glycosyl transferase, group 1 n=1 Tax=Rhodopseudomonas palustris BisB5 RepID=Q13D55_RHOPS Length = 422 Score = 51.5 bits (122), Expect = 4e-05, Method: Composition-based stats. Identities = 35/299 (11%), Positives = 76/299 (25%), Gaps = 61/299 (20%) Query: 42 GVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDI--D 99 + ++++ + +L D + L + + + D+ Sbjct: 96 NYISFVVTATLIGPWLLRGRRFDAIFVYAVSPILQAIPAIAIKWLKRAPLTVWVQDLWPQ 155 Query: 100 ELR---GGGGSDSV---------RLATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYL 147 L + D+++ + ++ + ++ + Sbjct: 156 SLEVTGFVKNKSILAAVGTLTRWIYRRSDLLLVQSQGFVPEVTAMAGEVPVRY-----HP 210 Query: 148 VSSDVEHRDVTDKQR--------GVIYAGN----------LSRHKCSFIYTEGCDFTLFG 189 D++ + R V++AGN L+ + + G Sbjct: 211 NPGDLQDQRFWGGDRVYTLRPGFNVVFAGNVGTAQSPDTLLNVARELSNQP-DIRIVVVG 269 Query: 190 VNY-----------ENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDY 238 + DN ++ G F A I S G Sbjct: 270 TGSRLDWLRQEAKEQRLDNLEFAGRFPASYMAGIFA----------QASALLVVLGKGTI 319 Query: 239 LKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAV--GSIKEMQEIVDSMTIET 295 L P K S YL+ P+ A I + G AV + + + +M Sbjct: 320 LTQTVPSKVSSYLAAGKPIIGSIDGEGAHVIREAGAGIAVPAEDVHALASAIVAMRDAD 378 >UniRef50_A3HV73 Predicted glycosyltransferase n=1 Tax=Algoriphagus sp. PR1 RepID=A3HV73_9SPHI Length = 414 Score = 51.5 bits (122), Expect = 4e-05, Method: Composition-based stats. Identities = 41/339 (12%), Positives = 102/339 (30%), Gaps = 62/339 (18%) Query: 36 NIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLI 95 V S L L D+L I + R + + + Sbjct: 73 KFGFIKRVKAFFTFSYLAKRLLKRLSRPDLLYITS-TPLTTGWIGLWAKRKMALPYIFEV 131 Query: 96 HDIDE--------------LRGGGGSDSVRLATCDMVISHNPQMTKYLSKYMSQDKIK-- 139 D+ ++ ++ +++ +P + ++L K +++ Sbjct: 132 RDLWPQAPIEVKAIKNPLAIKYLKKVEAKIYRHALKLVALSPGIAEHLRKVSPDSQVQLI 191 Query: 140 -----DIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSR----------HKCSFIYTEGCD 184 F + + + Y G L + K + + Sbjct: 192 PNFSDVQTFFPQKKQERILQKYQLKDELTFAYTGALGKVNAVEELLALAKVAQQKGKNWQ 251 Query: 185 FTLFGVNYENK-----------DNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSG 233 F + G + + N ++ F ++ L + F W S Sbjct: 252 FIIMGKGSQEQALKQLASKSQLQNVIFV-PFGSKEKVNEVLSMVDF--AW-------ISF 301 Query: 234 AFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAV--GSIKEMQEIVDSM 291 A LK N+P+K ++ + + K + + N++G G I+E +++ M Sbjct: 302 AHLPVLKTNSPNKFFDAIAAGKAILVNHKGWVYQLTIKNKLGLTFLPGKIEESFHLIEEM 361 Query: 292 TIET--YKQISENTKIISQKIRTGSYFRD-VLEEVIDDL 327 ++ +N S+++ + ++ + +I+ + Sbjct: 362 ESNPVLLDRMKQN----SRRLAENYFCKEIAISRLINVI 396 >UniRef50_Q2LWM1 Glycosyltransferase n=1 Tax=Syntrophus aciditrophicus SB RepID=Q2LWM1_SYNAS Length = 379 Score = 51.1 bits (121), Expect = 5e-05, Method: Composition-based stats. Identities = 22/182 (12%), Positives = 59/182 (32%), Gaps = 24/182 (13%) Query: 132 YMSQDKIKDIKIFDYLV---SSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYTEGCDFTLF 188 S + + +F + + + + K ++ G+ + + + Sbjct: 188 KGSDVTLIHLGLFGKIRGWPQTLEALKTMKQKNVRLVVIGDFN---DGSRADFDSVVSAY 244 Query: 189 GVNYENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTS 248 G+N Y + +L GL+ G + PHK Sbjct: 245 GLN---DRVVVYD--WMPFEDAFKHLMQAHIGLV------VFQPGILNHV--YAMPHKMF 291 Query: 249 LYLSMELPVFIWDKA-ALADFIVDNRIGYAVG--SIKEMQEIVDSM--TIETYKQISENT 303 Y++ + V + A +A F+ + + G V + ++ + +D + + + ++ Sbjct: 292 DYMAAGMAVICPEFAMEVAPFVKEAKCGLLVDTANPADLAKKLDELVSSPDLIHEMGVRA 351 Query: 304 KI 305 + Sbjct: 352 QK 353 >UniRef50_Q2ILL1 Glycosyl transferase, group 1 n=1 Tax=Anaeromyxobacter dehalogenans 2CP-C RepID=Q2ILL1_ANADE Length = 463 Score = 51.1 bits (121), Expect = 5e-05, Method: Composition-based stats. Identities = 16/77 (20%), Positives = 32/77 (41%), Gaps = 6/77 (7%) Query: 236 GDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSI--KEMQEIVDSMTI 293 +Y + P+K Y+ M LPV +A + + +G V +E+ V+ + Sbjct: 355 NNY--YCAPNKLFEYMMMGLPVVAPSFPGMARIVAGDDVGLCVDPSRPEEIAAAVNRLAR 412 Query: 294 ETY--KQISENTKIISQ 308 + ++ N +SQ Sbjct: 413 DPAARARMRANGLRLSQ 429 >UniRef50_Q6LYB1 Glycosyl transferase, group 1 n=1 Tax=Methanococcus maripaludis RepID=Q6LYB1_METMP Length = 364 Score = 51.1 bits (121), Expect = 6e-05, Method: Composition-based stats. Identities = 38/250 (15%), Positives = 69/250 (27%), Gaps = 42/250 (16%) Query: 81 SFFHRLLKFRIVPLIHDID--ELR-------GGGGSDSVRLATCDMVISHNP--QMTKYL 129 + + K ++ IHD+ LR D D++I N +M + Sbjct: 105 NKLKKKFKVPLIYDIHDLYESYLRNNKFLKIIIRNLDIFYCKKADLIIIVNDSFKMRPQI 164 Query: 130 SKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYTEGCDFTLFG 189 K + + + + G+ YAG L + +F Sbjct: 165 KGR----KTVLL----MNTPLLRGNVFSKNTKEGIFYAGGLQESRKMD--------FVFE 208 Query: 190 VNYENKDNPKYLGSFDAQSPEKINL---PGMQFGLIWDGDSVET---CSGAFGDYL---- 239 N E G G I D E C Y Sbjct: 209 ANKELNQRVTIAGDGPLLEEYMKEYDSNLNTFLGRIPAKDVEEKTKNCKLIIAPYDPFYL 268 Query: 240 --KFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIG--YAVGSIKEMQEIVDSMTIET 295 + +P+K + P + + + D + R G + G I++ E ++ + Sbjct: 269 NNRLASPNKLFEAMKYGKPSLVSEGTVMGDIVKKERCGETFRYGDIEDFIEKC-NLIEKN 327 Query: 296 YKQISENTKI 305 Y + N Sbjct: 328 YDFYALNAYS 337 >UniRef50_A8ZY94 Glycosyl transferase group 1 n=1 Tax=Desulfococcus oleovorans Hxd3 RepID=A8ZY94_DESOH Length = 522 Score = 51.1 bits (121), Expect = 6e-05, Method: Composition-based stats. Identities = 35/250 (14%), Positives = 81/250 (32%), Gaps = 36/250 (14%) Query: 106 GSDSVRLATCDMVISHNPQMTKYLSKY-MSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGV 164 + + D+VI+ + ++ K+ + I I ++ + + Sbjct: 205 LFEQQFIKYADLVITTSDGHASWMKKHFGCRCPILTIHNCVSADLDAIKPKTYAGGPIKL 264 Query: 165 IYAGNLSRHKCSFIYTEGCDFTLFGVNYEN--------------------KDNPKYLGSF 204 + G LS + G+ +L Sbjct: 265 YFHG-LSDAPKKIDVMVKAISRVPGIELVLRCVASENLLAVKKLVNDLGVSHKVHFLDLV 323 Query: 205 DAQSPEKINLPGMQFGL-IWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKA 263 + G+ IW E C L+ +K YLS LPV Sbjct: 324 TPEEVAFYANRDGDLGIHIW---ETENCVNT----LR-ALTNKFIEYLSAGLPVITSPLI 375 Query: 264 ALADFIVDNRIGYAV--GSIKEMQEIVDSMTIETYKQI---SENTKIISQKIRTGSYFRD 318 A+ + GY + S+ + ++++S+ + ++++ SEN +++ ++++ Sbjct: 376 EQANIVNRYDCGYILKDNSVDNLVDVLESILAQGHQELAVKSENALAAAKQYFDWHHYKE 435 Query: 319 VLEEVIDDLK 328 VL + + + K Sbjct: 436 VLVQAVLNNK 445 >UniRef50_A1EL38 Epimerase/dehydratase n=3 Tax=Vibrio cholerae RepID=A1EL38_VIBCH Length = 378 Score = 51.1 bits (121), Expect = 6e-05, Method: Composition-based stats. Identities = 37/302 (12%), Positives = 87/302 (28%), Gaps = 40/302 (13%) Query: 39 LWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHD- 97 + I+ + + TF + + + PF + + +IV H Sbjct: 51 CYKVENSWIVGELGIRTFFISNKAIGDITHVHGVWTPFEFFVIKAAKKRNSKIVVSPHGA 110 Query: 98 IDELRGGGGS----------DSVRLATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYL 147 ++ L D++I ++ + L + I I L Sbjct: 111 LEPWAFQSKGLKKRIAWYLYQKRILTEADLIIVNSQKERDNLRRLGLNGPIAVIPNGIIL 170 Query: 148 VSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYTEGCDFTLFGVNYENKDNPKYLGSFDAQ 207 D + +++++ ++Y L + K + + V + G D Sbjct: 171 DGYDKQSAMKSEREKIILYFSRLDKKKGIELLIKAWR----KVKDKRGYKLHIQGYGDQS 226 Query: 208 SPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLY----------------L 251 ++ +FGL + + ++ + ++ + S Y L Sbjct: 227 YRLFLHDMVSKFGL---EEEILFIDPSYDE-KRWASFFSASFYILPSYSENFGITVAEAL 282 Query: 252 SMELPVFIWDKAALADFIVDNRIGYAV----GSIKEMQEIVDSMTIETYKQISENTKIIS 307 LP + D + IG++V +I E + + + Sbjct: 283 ISGLPAITTTEMPWEDLTNE-GIGWSVKCNESAIAEAITEAIGIDESKLALMRRRAVDYA 341 Query: 308 QK 309 +K Sbjct: 342 EK 343 >UniRef50_C5EKE4 Glycosyl transferase n=1 Tax=Clostridiales bacterium 1_7_47FAA RepID=C5EKE4_9FIRM Length = 391 Score = 51.1 bits (121), Expect = 6e-05, Method: Composition-based stats. Identities = 45/327 (13%), Positives = 97/327 (29%), Gaps = 60/327 (18%) Query: 20 KDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHI 79 D Y+ S + + ++ +++++ + + ++ + ++F FW I Sbjct: 56 NDIEYHL--YKRKSSRHANIISRYLRDTLTNIREAIGILRIKGGN-VLFEDVSYSSFWPI 112 Query: 80 LSFFHRLLKFRIVPLIHDI------------DELRGGGGSD---SVRLATCDMVISHNPQ 124 L + +V ++ D+ ++ + D +I + Sbjct: 113 L--AAKSKGLHVVAMLQDVWPDNAVQSGLIKEKSILYKYFEAWQLYVYRKSDRIICISDD 170 Query: 125 MTKYLSKYMS-QDKIKDIKIFDY---------LVSSDVEHRDVTDKQRGVIYAGNLSRHK 174 M ++ + KI+ I + Y + V+ + +YAGN+ + + Sbjct: 171 MKSFIVDKGVDEKKIEVIYNWGYSDSTVNISWDKNEFVKKYGLVPDVFYAVYAGNIGKMQ 230 Query: 175 CSFIYTEGCDFTLFGVNYENKDNPKYLGSFDAQSPEKINLPGMQFGL------------- 221 + E ++ +L D + E I L + L Sbjct: 231 NVEMIVETAR------KLKDNKKIHFLIIGDGVNREAIELRIRTYQLMNVTMLPMQPSEL 284 Query: 222 ---IWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVF--IWDKAALADFIVDNRI-- 274 I+ V G +K P KT + LS PV + I Sbjct: 285 ATHIYSAAGVNIIPLVPGG-IKTAMPSKTGIVLSCGRPVIFCFGKNTSFEHKFRHRNIPI 343 Query: 275 ---GYAVGSIKEMQEIVDSMTIETYKQ 298 GY V + + + Y Sbjct: 344 RITGYDVNKLANLICELSMQPSSEYNN 370 >UniRef50_D1YZX9 Putative glycosyltransferase n=1 Tax=Methanocella paludicola SANAE RepID=D1YZX9_METPS Length = 374 Score = 51.1 bits (121), Expect = 6e-05, Method: Composition-based stats. Identities = 23/197 (11%), Positives = 60/197 (30%), Gaps = 38/197 (19%) Query: 146 YLVSSDVEHRDVTDKQRG-------VIYAGNLSRHKCSFIYTE------GCDFTLFGVNY 192 Y+ +S + ++ + + YAG + +++ + L G Sbjct: 172 YIYNSPPDRFNLQTSKIKDAADEMVIFYAGAMHKYRGIDHMIKAIENIDNVKLVLVGPGS 231 Query: 193 EN----------KDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFN 242 + + KY+G + D + + K+ Sbjct: 232 DVLPYMEQINRYNNKIKYIGWLPTYEDVLLKTMEA--------DVLFRFNDPRVLKSKYE 283 Query: 243 NPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVG--SIKEMQEIVDSMTIETYKQIS 300 +P+K + P+ + + A + + + G V I+ ++ ++ + + Sbjct: 284 SPNKLFEAMMCGKPIIVNSEIAASRIVKEENCGILVPYGDIEALENLIKMLKND-----P 338 Query: 301 ENTKIISQKIRTGSYFR 317 EN K + R + Sbjct: 339 ENRKKLGDNGRNAYISK 355 >UniRef50_Q47GL0 Colanic acid biosynthesis glycosyl-transferase n=2 Tax=Proteobacteria RepID=Q47GL0_DECAR Length = 396 Score = 50.7 bits (120), Expect = 6e-05, Method: Composition-based stats. Identities = 49/315 (15%), Positives = 101/315 (32%), Gaps = 71/315 (22%) Query: 64 DVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDE--------------LRGGGGSDS 109 D +++ P I++ R R +I DI R S Sbjct: 100 DGVVWYSPT-IFLGPIVNALKREGDCRGYLIIRDIFPEWAADMGLMGRGLPYRFFKAIAS 158 Query: 110 VRLATCDMVISHNP----QMTKYLSKYMSQDKIKDIKIF-----DYLVSSDVEHRDVTDK 160 + + D++ P +L K +++ + + D S V + + Sbjct: 159 YQYSVADVIGVQTPGNLTYFDDWLVKSG--GRLEVLHNWLARTTDTGCSISVANTRLA-G 215 Query: 161 QRGVIYAGN----------LSRHKCSFIYTEGCDFTLFGVNYE--------NKDNPKYLG 202 + +YAGN L + + F G E + Sbjct: 216 RTVFVYAGNMGVAQGVRILLDLAENLKDRS-DIGFLFVGRGSEVPCLIASAEHRGLDNVV 274 Query: 203 SFDAQSPEKINLP--GMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIW 260 FD P++I GL+ + N P K Y+ LPV Sbjct: 275 FFDEIDPDEIPGLYAQCHIGLV----------ALDPRHKTHNIPGKFLSYMLSGLPVLAS 324 Query: 261 DKAA--LADFIVDNRIGYA-----VGSIKEMQ-EIVDSMTIETYKQISENTKIISQKIRT 312 L D I+ ++G V S+ ++ ++VD+++ + ++ E + +++K+ Sbjct: 325 INPGNDLVDLILQEKVGGVCTDSSVASLAQLARDLVDNVSND--REKGERCRRLAEKL-- 380 Query: 313 GSYFRDVLEEVIDDL 327 + + +++ L Sbjct: 381 -FSPQAAVRQIVTAL 394 >UniRef50_B7JGK2 Glycosyl transferase, group 1 family protein n=16 Tax=Bacillus cereus group RepID=B7JGK2_BACC0 Length = 413 Score = 50.7 bits (120), Expect = 6e-05, Method: Composition-based stats. Identities = 50/346 (14%), Positives = 105/346 (30%), Gaps = 46/346 (13%) Query: 23 LDIASDYENISVVNIPLWGGVVQRIIS-----SVKLSTFLCGLENKDVLIFNFPMAKPFW 77 +I D I + +R+I + E D + + P Sbjct: 60 ENIEKDIVRIHPKTRKYTRNLFRRLILYIEVAFRLILAICKDKEKYDYIFVSTPS-IFIP 118 Query: 78 HILSFFHRLLKFRIVPLIHDIDE--------------LRGGGGSDSVRLATCDMVISHNP 123 F R +K +++ + D+ L+ + D +I ++ Sbjct: 119 VAGMFAKRKMKAKLIVDVRDLWPESLIGIGFFNKNWILKFAYKLEYKIYHAADNIIINSK 178 Query: 124 QMTKYLSKYMSQDKIKDIKIFDYLVSSDVE--HRDVTDKQRGVIYAGNLS---------- 171 Y+S + + L ++ + Q VIY GN+ Sbjct: 179 GFYSYISSTGIAPNRISF-MPNSLTEKELSTVPKKNISDQLTVIYTGNIGLAQDIEKLIL 237 Query: 172 RHKCSFIYTEGCDFTLFGVNYENKD---NPKYLGSFDAQSPEKINLPGMQFGLIWDGDSV 228 + Y + F + G Y+ K+ + + + I D D + Sbjct: 238 IAEHLKEY-KNISFKIIGYGYQKKELGESIEAK-QLPNMQLIEPKNREDTLAEIVDAD-I 294 Query: 229 ETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGS--IKEMQE 286 S D K P K Y+SM P+ + I D + G+ + E+ + Sbjct: 295 AYVSLVEKDVFKKVLPGKVMDYMSMRKPIVADVAGYAKEVIEDAQCGFVTEDRTVAELSD 354 Query: 287 IVDSMTIETY--KQISENTKIISQKIRTGSYFRDVLEEVIDDLKTR 330 + + + ++ EN + + ++ +E ++ L+ R Sbjct: 355 YIIKLAQDKQLRNRLGENGYQYAFRTLR---WKTNIETLLKILEER 397 >UniRef50_B5EVI1 Glycosyltransferase n=6 Tax=Vibrionales RepID=B5EVI1_VIBFM Length = 378 Score = 50.7 bits (120), Expect = 7e-05, Method: Composition-based stats. Identities = 29/215 (13%), Positives = 65/215 (30%), Gaps = 41/215 (19%) Query: 108 DSVRLATCDMVISHNPQM-------TKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDK 160 + + D++++ + ++ L + F +D+ + Sbjct: 157 EQKLINKADLILAASKKLLIKFPKGKTQLLTHGVD--------FTLFNQPVPRAKDLPND 208 Query: 161 QRGVI-YAGNLSRHKCSFIYTEG------CDFTLFGVN------YENKDNPKYLGSFDAQ 207 +R + + G+LS + + F G N ++N N LG Sbjct: 209 ERPIAGFYGSLSDWLDYDLLNQVIAENPLWHFVFIGKNELTYNPFKNHPNLHLLG--PKM 266 Query: 208 SPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALAD 267 W + + + ++ NP K YL+ P+ ALA Sbjct: 267 HYHLPRYSQH-----WQANLLPFVD---NEQIRACNPLKLLEYLATGTPIISTSFPALAP 318 Query: 268 FIVDNRIGYAVGSIKEMQEIVDSMTIETYKQISEN 302 +I + V S ++ ++++ N Sbjct: 319 YIAEIH---TVNSTQDFTTHLNNIHSNWLNSSVNN 350 >UniRef50_B8E1B1 Glycosyl transferase group 1 n=1 Tax=Dictyoglomus turgidum DSM 6724 RepID=B8E1B1_DICTD Length = 388 Score = 50.7 bits (120), Expect = 8e-05, Method: Composition-based stats. Identities = 24/204 (11%), Positives = 56/204 (27%), Gaps = 44/204 (21%) Query: 48 ISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVP--------LIHDID 99 I L + D++ + P + IV +H I Sbjct: 75 IYFAFLVEDFFKKQKFDIVHSHHPFVIG--KTALKLAKKYHIPIVFTHHTQYHKYVHYIP 132 Query: 100 ------ELRGGGGSDSVRLATCDMVISHNPQMTKYLSKYMSQDKI------KDIKIFDYL 147 + D+VI+ ++ + + + +I D+ +++ Sbjct: 133 LVPEKISAKIAIKESVKYANQVDLVIAPTKEIKDMIINFGVKTRIEILPTGIDLSMWEEP 192 Query: 148 VSSDVEHRDVTDKQRGVIYAGNLSRHKC----------SFIYTEGCDFTLFGVNYEN--- 194 + + ++R ++YAG L++ K E + G E Sbjct: 193 IQEEFLRNFPWKEKRILLYAGRLAKEKNIEFIFTSLEKLLKKREDIILLVVGDGDERKNL 252 Query: 195 ---------KDNPKYLGSFDAQSP 209 +D ++G + Sbjct: 253 ENLVKRLNLEDKIVFMGWHPREEL 276 >UniRef50_B9ZCT2 Glycosyl transferase group 1 n=1 Tax=Natrialba magadii ATCC 43099 RepID=B9ZCT2_NATMA Length = 404 Score = 50.3 bits (119), Expect = 9e-05, Method: Composition-based stats. Identities = 39/247 (15%), Positives = 83/247 (33%), Gaps = 42/247 (17%) Query: 113 ATCDMVISHNPQMTKYLSKYMSQDKIKDIK-----IFDYLVSSDVEHRDVTDKQRGVIYA 167 + DMV+ + M +YL ++ + + D + + + D++ ++Y Sbjct: 161 NSSDMVLPISDAMKRYLQNNSYTTPMQTLPTGAEVVGDIPTENILREKYGIDEEYVLLYM 220 Query: 168 GNLSRHK---CSFIYTEGCDFTLFGVNYE---------------------NKDNPKYLGS 203 G++S + F + + VN D K+ G Sbjct: 221 GSMSPSRKLEFLFDVLNVLEID-YDVNLVMVGGRLKSNRERLKQAANKKGVSDIVKFTGW 279 Query: 204 FDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKA 263 + + + GL L N P KT Y+S+ PV Sbjct: 280 VSDRVQIQSAIATADVGL---------SPLPTDSVLSTNAPIKTLEYMSLGTPVVASTTP 330 Query: 264 ALADFIVDNRIGYAVG-SIKEMQEIVDSM--TIETYKQISENTKIISQKIRTGSYFRDVL 320 D + +R G AV +K + +D + + E ++ + + + R D++ Sbjct: 331 DQQDVLKTSRAGLAVDYKVKSFVDAIDELLSSEERRNRMGKRGRDYIRNNRNFGVLSDLV 390 Query: 321 EEVIDDL 327 E++ + + Sbjct: 391 EDIYNQV 397 >UniRef50_Q1Q1W3 Similar to capsular polysaccharide biosynthesis glycosyltransferase n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q1W3_9BACT Length = 379 Score = 50.3 bits (119), Expect = 9e-05, Method: Composition-based stats. Identities = 54/330 (16%), Positives = 103/330 (31%), Gaps = 49/330 (14%) Query: 34 VVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVP 93 + N + V + I S + L + N +I + ++ R + Sbjct: 58 IPNKKKFKFVPEIIRSIFYVWLKLKKISNLQNVIV----YNSSTYFWTYIFRKKTIPTIL 113 Query: 94 LIHD----IDELRGGGGS-------DSVRLATCDMVISHNPQMTKYLSKYMSQ--DKIKD 140 +IH I + G D + + D VI + + +Y + K+ Sbjct: 114 IIHGTNMPITSMSVGRKKAFFVACSDRLAIKKADRVILVSQEGLQYYQDKHPKYKSKMVF 173 Query: 141 IKIFDYLVSSDVEHRDVTDKQ------RGVIYAGNLSRHKCSFIYTEGCDFTLFGVNYEN 194 F + R+++ K + Y G L+ K + +F E Sbjct: 174 YPTFSEDSLFYPKDRNISKKDLFLAGKNILTYVGRLNVQKKVSLV-----IRIFSQILEI 228 Query: 195 KDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTS------ 248 K N D E + + GL + F N S Sbjct: 229 KTNSHLCIVGDGPDRETLIDLTKKLGLGKYVTFYGNVCHE--EIPTFFNASDLSFTLSYW 286 Query: 249 --------LYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTI-ETYKQI 299 L+ PV + D A I + + G+ + S E + ++ I + Y Q Sbjct: 287 EGTAQTILESLACGTPVIVSDVADNRQIITNGKDGFVLESDDEEKGAAYAIKIMDNYDQF 346 Query: 300 SENTKIISQKIRTGSYFRDVLEEVIDDLKT 329 S+N +K Y ++ ++I ++K+ Sbjct: 347 SKNALDKGKK----YYASAIVPKIIQEIKS 372 >UniRef50_B6BKD9 Capsular polysaccharide biosynthesis protein Cps4F, putative n=1 Tax=Campylobacterales bacterium GD 1 RepID=B6BKD9_9PROT Length = 401 Score = 50.3 bits (119), Expect = 1e-04, Method: Composition-based stats. Identities = 33/215 (15%), Positives = 68/215 (31%), Gaps = 40/215 (18%) Query: 139 KDIKIF-DYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCS-----------FIYTEGCDFT 186 + + D L S D + +AGN+ + + Y E Sbjct: 196 HYLPNWADDLDMSLESFEFCKDNKVHFTFAGNIGKVQNLENIIKAFSLLPIEYQERTQLN 255 Query: 187 LFGVNYENKD---------NPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGD 237 + G ++ N + G + F ++ D Sbjct: 256 IIGDGSNLEELKQISSSKHNIVFHGKKPREEMAMYYKAS-DFLIVSLVDKPI-------- 306 Query: 238 YLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYA-----VGSIKEMQEIVDSMT 292 P KT Y+S + P+ AD I +N +GY V IK + ++ Sbjct: 307 -FSVTVPAKTQTYISAKKPILAIINGETADIIKENNLGYCAHPNDVNEIKNIFIKSINLG 365 Query: 293 IETYKQISENTKIISQKIRTGSYFRDVLEEVIDDL 327 E ++N + +++ I + + + + +++L Sbjct: 366 EEEKLAFTKNCEYLTENI----FNKTKIIDSLEEL 396 >UniRef50_B1CBD7 Putative uncharacterized protein n=1 Tax=Anaerofustis stercorihominis DSM 17244 RepID=B1CBD7_9FIRM Length = 374 Score = 50.0 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 50/320 (15%), Positives = 99/320 (30%), Gaps = 55/320 (17%) Query: 27 SDYENISVVNIPLWGG---VVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFF 83 YE+ +V + + + R++ + K N D+ F+ P P+ L Sbjct: 43 KSYESKNVHIVGIGTNNRGRISRMLFTAKDIYKKALEINADIYHFHDPELLPYGLKLKKK 102 Query: 84 HRLLKFRIVPLIHD-----------IDEL------RGGGGSDSVRLATCDMVISHNPQMT 126 + ++ H+ I + + +S D +I P M Sbjct: 103 GKK----VIFDSHENYSEQIKEKYYIPKFLRGFISKIFKSYESKVSKKIDGLIFPCP-MF 157 Query: 127 KYLSKYMSQDKIKDIKIFDYLVS--SDVEHRDVTDKQRGVIYAGNLSRHKCSFI-----Y 179 + I L + E + + V YAG L+ ++ Y Sbjct: 158 GKHPFEGRSKRCVYINNTPILEELYNKYEESEKDYSKPVVCYAGGLNYNRGITHLINACY 217 Query: 180 TEGCDFTL--------FG---VNYENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSV 228 G L +G E + + G + + +N+ G +V Sbjct: 218 KSGAKLILGGNFVPASYGEELKKTEEYECVDFRGYLN--RDDILNMYKES----TIGANV 271 Query: 229 ETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAV--GSIKEMQE 286 G + + N K ++SM LPV D + I G V +I E++ Sbjct: 272 LLNVGQYA--VLSNLSTKIYEFMSMGLPVISNDYPYAREVIEKYNFGIVVNSDNIDEIEN 329 Query: 287 IVDSMTIE--TYKQISENTK 304 + ++ +++ N + Sbjct: 330 AIKYLSENPKEAEEMGRNGR 349 >UniRef50_B2UYP0 WblI protein n=6 Tax=Clostridium RepID=B2UYP0_CLOBA Length = 408 Score = 50.0 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 40/305 (13%), Positives = 90/305 (29%), Gaps = 45/305 (14%) Query: 40 WGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDID 99 + + ++++ + K + + + ++ IHD+ Sbjct: 81 RIKNISTFMYKLRMNYKKIADKYKPDAVIASSTYPLDIYPAHRIAKRCDAKLCFEIHDLW 140 Query: 100 EL--RGGGGSDSV-------------RLATCDMVISHNPQMTKYLSKYMS-QDKIKDI-- 141 L GG D+++S P K++ + DK + Sbjct: 141 PLSPMEIGGFSEKNPAIVVLQRAEDFAYKNSDVIVSILPNADKHIRERGFSTDKYVYVPN 200 Query: 142 KIFDYLVSSDVEHRDVTDKQ-------RGVIYAGNLSRHKCS--------FIYTEGCDFT 186 I + + + + V Y GN S E + Sbjct: 201 GIIPGEKKNPPTEKTIEKLKELKNQGYFLVGYTGNHSPANVLDTMIDAAKKTKDEKVKYV 260 Query: 187 LFGVNYENKDNPKYLGSFDAQSPEKI-NLPGMQFGLIWDGDSVETCSGAFGDYLKFN--- 242 L G + Y + D Q+ E + + + ++ C + FN Sbjct: 261 LVGKGNVKDELINYAKTNDVQNVEFLDPVLKDNMDNV--LQLLDICYISLKKQNLFNYGV 318 Query: 243 NPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAV-----GSIKEMQEIVDSMTIETYK 297 +P+K Y+ PV +A+ + D+ G V ++ E + ++ + Sbjct: 319 SPNKLFDYMMAARPVIYAIEASNDP-VKDSNCGITVPAENPDAVVEAVLKIKELSDDEKN 377 Query: 298 QISEN 302 ++ +N Sbjct: 378 KMGQN 382 >UniRef50_Q2FQD1 Glycosyl transferase, group 1 n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FQD1_METHJ Length = 369 Score = 50.0 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 30/278 (10%), Positives = 75/278 (26%), Gaps = 38/278 (13%) Query: 81 SFFHRLLKFRIVPLIHDIDELRGGGGSDSV---------RLATCDMVISHNPQMTKYLSK 131 + L I+ I D + + D +I + + + Sbjct: 102 LIISKFLSRPIIYDIFDFYSDQIAFSPTIRKIVESIDCYFMKHVDEIILVDSSRIHQIKQ 161 Query: 132 YMSQDKIKDIKIFDYLVSS----DVEHRDVTDKQRGVIYAGNLSRHKCSFIYTE------ 181 ++ Y D + + +AG LS + + Sbjct: 162 GNYKNVSII-----YNSPPKELADKLRTSNQSENFKIFFAGGLSLDRDISSIIKACGDIT 216 Query: 182 GCDFTLFGVNYENKDNPKYL-----GSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFG 236 F + G + F + + + D + Sbjct: 217 DIFFEIAGYGPRVAELLNICKTNSRVQFLGEINYDTVILKS-----FQADLLFAFYDPKV 271 Query: 237 DYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVG--SIKEMQEIVDSMTIE 294 + +P+K + P+ + +A + + G V I ++E V + + Sbjct: 272 PNNFYASPNKLFEAMMCGKPILVNSGTTMAKIVSEENCGLVVPYGDIPSIREAVLKIKND 331 Query: 295 TY--KQISENTKIISQKIRTGSYFRDVLEEVIDDLKTR 330 + +++ +N K ++ + L ++ + TR Sbjct: 332 KHFQEKLGKNGKRAFERTFNWNIMEKRLIKIYQRILTR 369 >UniRef50_A6L9I7 Glycosyltransferase family 4 n=5 Tax=Bacteroidales RepID=A6L9I7_PARD8 Length = 401 Score = 50.0 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 33/286 (11%), Positives = 71/286 (24%), Gaps = 55/286 (19%) Query: 37 IPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIH 96 I L + + + L D + + + I Sbjct: 78 IRLALNYLSFVFFAS--LYVLTHRIETDSIFCFGTSPVFQMYPALLLKKKTGVNASLWIQ 135 Query: 97 DI--DELR----GGGGSDSVRLAT--------CDMVISHNP-QMTKYLSKYMSQDKIKDI 141 D+ + + G L+ D++ +P LSK +DK+ + Sbjct: 136 DLWPESVAAASGLKSGFVMNLLSKLVTGIYCRTDILFVQSPAFFESVLSKGNFKDKLIYV 195 Query: 142 KIF--DYLVSS---DVEHRDVTDKQRGVIYAGNLSRHK-CSFIYT--------EGCDFTL 187 + D + ++ + V++AGN+ + I + + Sbjct: 196 PNWAEDVFMKEISDSNKYESLMPNGFKVMFAGNIGAAQDFGSIIQAAVLTRHLPDIKWII 255 Query: 188 FGVNYENKD------------NPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAF 235 G D +LG + + L+ Sbjct: 256 VGDGRMKSDIEQKVQSLKLNDTVFFLGRYPVEEMPDFFSL-ADVMLV----------SLK 304 Query: 236 GDYL-KFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGS 280 +Y+ P K Y++ P+ + + G S Sbjct: 305 EEYIFSLTIPSKVQAYMASAKPMVTMLSGMGNKIVEEANCGLTTNS 350 >UniRef50_C6PFB8 Glycosyl transferase group 1 n=2 Tax=Bacteria RepID=C6PFB8_CLOTS Length = 374 Score = 50.0 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 37/312 (11%), Positives = 80/312 (25%), Gaps = 49/312 (15%) Query: 28 DYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLL 87 + + W + + + + D +I P L Sbjct: 64 GSGIRNFPYLVKWNIYLLIWLFKNHNTYDYIHACDFDTII---PTIIT--------KYLF 112 Query: 88 KFRIVPLIHDIDELRGGG----------GSDSVRLATCDMVISHNPQMTKYLSKYMSQDK 137 K ++V I D D + D VI + K + + Sbjct: 113 KKKVVYDIFDFYSDMLRKVPSAIKKLIKKVDFFCINRVDAVIIADECRKKQIKGSNPKRL 172 Query: 138 IKDIKIFDYLVSSDVEHRDVTDKQR------GVIYAGNLSRHKCSFIY------TEGCDF 185 I Y D+ + D + + + Y G L + Sbjct: 173 IVI-----YNTPEDINNYDNCFEYKEYGAQLRIAYVGLLQIERGLIEMIKVVQRHPNWML 227 Query: 186 TLFGVNYENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYL---KFN 242 L G + + + +F L +S + + + +F+ Sbjct: 228 YLAGFGGDENEILSHCSNFSNIKFY--GRVNYDIALK-INNSSDVMFATYDPSIPNHRFS 284 Query: 243 NPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVG-----SIKEMQEIVDSMTIETYK 297 + +K + + P+ + + + + R+G V ++ + M + Sbjct: 285 SANKLFEAMMLGKPIIVAKNTGMDELVEKYRLGEIVEYGDMYDLENALKNFSQMNLYDRD 344 Query: 298 QISENTKIISQK 309 S K I K Sbjct: 345 SFSIRVKEIYNK 356 >UniRef50_C2WF92 Glycosyl transferase group 1 n=2 Tax=Bacillus cereus group RepID=C2WF92_BACCE Length = 384 Score = 49.6 bits (117), Expect = 1e-04, Method: Composition-based stats. Identities = 38/285 (13%), Positives = 86/285 (30%), Gaps = 49/285 (17%) Query: 61 ENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELR------GGGGSDSVRLAT 114 + D+ N P I + +++ H++ R G + + Sbjct: 95 KKYDIYHSNDLNTLPQGFICAKILGRK--KLIYDSHEVQTSRTGYNSSIYGIMEKFFIKF 152 Query: 115 CDMVISHNPQMTKYLSK-YMSQDKIKDIKIFDYLVSSDVEHR-------DVTDKQRGVIY 166 CD++I N KY Y K+ + ++ ++ +++ + ++Y Sbjct: 153 CDVMIMENHTRAKYTEDLYGFYPKVI--HNYPFVSRPELSKSIDLHGMLNISRDEPILLY 210 Query: 167 AGNLSRHKCSFIYTEGCDFTLFGV------------------NYENKDNPKYLGSFDAQS 208 G + + + GV + E +D ++L Q Sbjct: 211 QGGIQIGRGLDKLVQAVPLFKRGVVVFIGDGRIKPELQQMVQDMELEDRVRFLPKVPVQ- 269 Query: 209 PEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADF 268 + I+ + + F Y + +K Y+ +PV + Sbjct: 270 -DLIHYTKNAY-----LGFQVLNNVCFNHYS--ASSNKLFEYMMSGVPVIACQFPEIQGV 321 Query: 269 IVDNRIGYAVGSIK--EMQEIVDSMTI--ETYKQISENTKIISQK 309 + IG V S + + V+ + E +++ N K Sbjct: 322 VEKENIGVCVDSHDPASIADGVNYLLDHPEEREKMKVNCLQSRNK 366 >UniRef50_A8RE04 Putative uncharacterized protein n=1 Tax=Eubacterium dolichum DSM 3991 RepID=A8RE04_9FIRM Length = 745 Score = 49.6 bits (117), Expect = 1e-04, Method: Composition-based stats. Identities = 42/336 (12%), Positives = 100/336 (29%), Gaps = 68/336 (20%) Query: 16 FKARKDALDIASDYENISVVNIPLWGG----VVQRIISSVKLSTFLCGLENKDVLIFNFP 71 KA+ + + + P + + + + + D LI+ Sbjct: 41 NKAKI----VDKRAGTTYLKSRPYYKNISLSRIFSHFLFARSAYHYACKKQPD-LIYAVI 95 Query: 72 MAKPFWHILSFFHRLLKFRIVPLIHDI--DELRGGG-------------GSDSVRLATCD 116 L + + K R++ I+D+ + + L D Sbjct: 96 PPNTLGKYLKKYKKKNKVRLIFDIYDLWPESFVNQKFAGIARPFFLKWASFRNQSLNAAD 155 Query: 117 MVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRG---VIYAGNLSRH 173 VI+ + L++++ + K I YL + + K + Y G+++ Sbjct: 156 FVITECDLYREILAEFLDPEHTKTI----YLTKGSEFEQPMISKPMETLHICYLGSINNI 211 Query: 174 ----------KCSFIYTEGCDFTLFGVNYENKDNPK----------YLG-SFDAQSPEKI 212 K Y + G D + Y G F + + Sbjct: 212 ISIDMIVRFLKTLQNYRP-IVVDIIGKGETKDDFIRKLKAQGIETIYHGALFGEDKWKIM 270 Query: 213 NLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDN 272 N FG+ + ++ K+ Y LP+ K ++ + Sbjct: 271 N--QCHFGI-----------NMMINTVRVGLTMKSVDYFEAGLPILNNIKGDTWTYVDNF 317 Query: 273 RIGYAVG--SIKEMQEIVDSMTIETYKQISENTKII 306 +G+ V +I+++ + + ++++ N + + Sbjct: 318 NLGFNVDEKNIEDVARKLAKLDERQFEEMQRNVRDV 353 >UniRef50_C0C2C9 Putative uncharacterized protein n=1 Tax=Clostridium hylemonae DSM 15053 RepID=C0C2C9_9CLOT Length = 375 Score = 49.6 bits (117), Expect = 1e-04, Method: Composition-based stats. Identities = 30/262 (11%), Positives = 74/262 (28%), Gaps = 29/262 (11%) Query: 69 NFPMAKPFWHILSFFHRLLKFRIVPLIHDID-----------------ELRGGGGSDSVR 111 + P+ F + +I DI + Sbjct: 87 HLPLYPFLDFGFFKFCKKHNIKIALFYRDIHWKFGQYKAKVPLYQRMVTIPMYRYDLFQY 146 Query: 112 LATCDMVISHNPQMTKYLSKYMSQDKIKDIKIF--DYLVSSDVEHRDVTDKQRGVIYAG- 168 D++ M KY++ + K+ + +V E+ + +D + Y G Sbjct: 147 QKYLDIIYLPTLNMGKYVNGLGNVSKVDTLPPGADKRIVDDKSENYNTSDSTINIFYVGG 206 Query: 169 -----NLSRHKCSFIYTEGCDFTLFGVNYENKDNPKYLGSFDAQSPEKINLPGMQFGLIW 223 N + L E + + + ++ G + + Sbjct: 207 VLGIYNFEKLLKIAKQKSYVRLKLCCRENEWNMAKHKYSDYLTERVDIVHKHGKELEQYY 266 Query: 224 DGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVG---- 279 + +C Y++ P K Y+S ++P+ A +F+ G+++ Sbjct: 267 AWADLCSCYFEPSLYMEMAVPIKLLEYVSFQVPIIATKGTAAGNFVEKYGCGFSIPYDEG 326 Query: 280 SIKEMQEIVDSMTIETYKQISE 301 ++ + E + + + Sbjct: 327 KLENVLETIHEDKRLLLNKYKQ 348 >UniRef50_A3SJ03 Probable glycosyltransferase n=1 Tax=Roseovarius nubinhibens ISM RepID=A3SJ03_9RHOB Length = 281 Score = 49.6 bits (117), Expect = 1e-04, Method: Composition-based stats. Identities = 29/244 (11%), Positives = 68/244 (27%), Gaps = 49/244 (20%) Query: 88 KFRIVPLIHDIDELRGGGGSDSVRLATCDMVISHNPQMTKY---------------LSKY 132 +++ +H+ + ++ + ++ + K Sbjct: 3 GIKVIYDVHEDYP--EAVSENYRLPKVARKLLPPIVRFVEWFSSPFFSSIVTVTPQIQKR 60 Query: 133 MSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYA--GNLSRHKCSFIYTEGCDFTLFGV 190 K ++ + LV E D + R + +A G ++R++ G Sbjct: 61 FPSKKTILVRNWP-LVEEFHEPVDTPMRDRPMEFAYIGTITRNRNILGM-----IDAVGS 114 Query: 191 NYENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFG-------------- 236 E + G F ++ + L WD + G Sbjct: 115 LRETGATLRLAGDFTIEADRHVALTRTG----WDRVKFDGWVSREGVADILANARAGLVV 170 Query: 237 ----DYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVG--SIKEMQEIVDS 290 ++ P K Y++ +PV D + + + G V + +E+ + Sbjct: 171 LRPVEHEMLTLPIKLFEYMAAGVPVISSDFPLWREIVEEVGCGLLVDPENPEEIAAAMRW 230 Query: 291 MTIE 294 M Sbjct: 231 MIEN 234 >UniRef50_B7AUJ6 Putative uncharacterized protein n=1 Tax=Bacteroides pectinophilus ATCC 43243 RepID=B7AUJ6_9BACE Length = 390 Score = 49.6 bits (117), Expect = 1e-04, Method: Composition-based stats. Identities = 56/325 (17%), Positives = 109/325 (33%), Gaps = 56/325 (17%) Query: 19 RKDALDIASDYENISVVNIPLWGGVVQRIISSVKLST----FLCGLENKDVLIFNF-PMA 73 R + SD +I + + V + L + ++ ++ DV+ P Sbjct: 36 RHRVETMFSDNVHIHRMPLFQEKTKVIQRTLRFLLFSLECLWIGLTKDADVVFCGSGPPT 95 Query: 74 KPFWHILSFFHRLLKFRIVPLIHDI--DEL-------------RGGGGSDSVRLATCDMV 118 + IL H+L ++V + D + L G + D + Sbjct: 96 QGV--ILGLIHKLTHKKVVYNLQDAFPESLVTTGITSEGSAVYNVGLKMERFTYNNVDRI 153 Query: 119 ISHNPQM-TKYLSKYMSQDKIKDIKIF--DY-----LVSSDVEHR-DVTDKQRGVIYAGN 169 I+ + M +SK + DK+ + + D +D+ R D++ + V YAGN Sbjct: 154 ITISESMFGNMISKVDNPDKVSMVYNWLDDTVHHVERQDNDLFERFDLSRDRFIVTYAGN 213 Query: 170 LSRHKCSFIYTE---------GCDFTLFGVNYENKDNPKYLGSFDAQSPEKINLPGMQ-- 218 + + + E +F +FG +D Y + + L G Sbjct: 214 VGKAQGIETLIEAADILKSDRDIEFCIFGAGASLEDIKAYAAGKGLDNVRFLPLLGKDDI 273 Query: 219 ---FGLIWDGD-SVETCSGAFGDYLKFNNPHKTSLYLSMELPVFI--WDKAALADFIVDN 272 + L GD S+ C G P KT ++ + + + + L + + Sbjct: 274 SKVYSL---GDVSLVMCRKGVG---TSGMPSKTWSIIAAQTALIVSFDAGSELYNMVSSG 327 Query: 273 RIGYAVG--SIKEMQEIVDSMTIET 295 G AV E+ + + M + Sbjct: 328 NCGIAVDAERPAELADAILKMKSDK 352 >UniRef50_Q0AZ01 Capsular polysaccharide biosynthesis protein Cps4F n=1 Tax=Syntrophomonas wolfei subsp. wolfei str. Goettingen RepID=Q0AZ01_SYNWW Length = 397 Score = 49.6 bits (117), Expect = 2e-04, Method: Composition-based stats. Identities = 24/237 (10%), Positives = 67/237 (28%), Gaps = 36/237 (15%) Query: 111 RLATCDMVISHNPQMTKYLSK-YMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGN 169 D + + KY + I+ + ++ + D++ +++AGN Sbjct: 163 IYKRADKLAISSKLFQKYFEEVIGIDSDIQYLPVYAESLFEDIQSECHESGIINLVFAGN 222 Query: 170 LSRHKCSFIYTE---------GCDFTLFGVNYENKD-----------NPKYLGSFDAQSP 209 + + + ++ + G + + N + G + Sbjct: 223 IGEMQSVETIIKAANELKDFDKINWHIVGDGSDRINCEELAIEFGLSNVIFYGQRPIEDM 282 Query: 210 EKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFI 269 F L+ + + + P+K Y++ P+ I Sbjct: 283 PDFYSMADAF-LV---------TLKANKEISYTLPNKVQSYMAAGKPIIGAIDGETRLVI 332 Query: 270 VDNRIGYAVGSIKEMQEIV----DSMTIETYKQISENTKI-ISQKIRTGSYFRDVLE 321 + G + + + + E + + EN + + + R ++E Sbjct: 333 EEAGCGLCCEAEDYIALVELVRDFAFNTEKHNVMGENARKYYQEHFDKKIFMRSLIE 389 >UniRef50_Q0W4G3 Glycosyltransferase (Group 1) n=1 Tax=uncultured methanogenic archaeon RC-I RepID=Q0W4G3_UNCMA Length = 352 Score = 49.2 bits (116), Expect = 2e-04, Method: Composition-based stats. Identities = 30/202 (14%), Positives = 69/202 (34%), Gaps = 24/202 (11%) Query: 31 NISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFR 90 ++ + + + + +++ + I +F +L F K+R Sbjct: 42 YSNITIHEISWKKTELVANVLEIRKIVKSFNPD---IIHFTSFHFILILLVPF--FKKYR 96 Query: 91 IVPLIHDIDELRGGGGSDSVR-----LATCDMVISHNPQMTKYLSKYMS-QDKIKDIKIF 144 IV HD+D +G L D++I+H + L + + KI + Sbjct: 97 IVVTAHDVDAHQGTDNFFYKFVLDQYLKLGDLLITHGKNLKDRLVEKGFDESKIFILPHG 156 Query: 145 DY--LVSSDVEHRDVTDKQRGVIYAGNLSRHKCSF----------IYTEGCDFTLFGVNY 192 DY ++ VE + + +++ G + ++K + G Sbjct: 157 DYSFFLNYSVEKNSSVENRDTLLFFGRILKYKGLNYLLESLKLVIQEHPDVKLIVAGKGN 216 Query: 193 ENKDNPKYLGSFDAQSPEKINL 214 ++ + SF A++ + N Sbjct: 217 MDEYR-DLVQSFKAENLDIHNY 237 >UniRef50_B0VF96 Putative uncharacterized protein n=1 Tax=Candidatus Cloacamonas acidaminovorans RepID=B0VF96_9BACT Length = 437 Score = 49.2 bits (116), Expect = 2e-04, Method: Composition-based stats. Identities = 37/303 (12%), Positives = 88/303 (29%), Gaps = 58/303 (19%) Query: 36 NIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLI 95 N+ + G + + ++ + N + ++ + P + + + + ++ Sbjct: 107 NVFIPDGEILWLPFAIHKLKKIMATNNINQVLVSVPPYSL-IFLAKYLKKHYQTKVCLDF 165 Query: 96 HDIDELRGGGGS--------------DSVRLATCDMVISHNPQMTKYLSK---YMSQDKI 138 D G + + D VI + M + Y+ ++K Sbjct: 166 RDPWSFGIGRKYLKPPDWVTAIENRWEKEIVTRADQVICVSSVMIEEFRHLYPYLDKNKF 225 Query: 139 KDIKIFDYLVSSDVEHRDVTDKQRGVIYAGN--------------LSRHKCSFIYTEGCD 184 I V + + + +IY G+ L + + Sbjct: 226 VCITNGYDEKDFPVSLPPIRNAKFTIIYTGSFYDELQPDILWQAILELIQEGCLNPRKIA 285 Query: 185 FTLFGVNYEN------------KDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCS 232 ++G N+ N + G + ++ +I D + Sbjct: 286 VEIYGRNFRNFVLGKYITDPILNQIVHFHGYINHRNSIQILRA---------ADVLLLYL 336 Query: 233 GAFGDYLKFNNPHKTSLYLSMELPV--FIWDKAALADFIVDNRIGYAVG--SIKEMQEIV 288 G+ G+ + K YL P+ I A AD + + SI ++E + Sbjct: 337 GS-GEAQRAIVTAKVFEYLRSGKPILAIIDSSGAAADILKPANTAFIADSSSIHSIKETL 395 Query: 289 DSM 291 ++ Sbjct: 396 GNL 398 >UniRef50_Q5LH96 Possible capsular polysaccharide related protein n=1 Tax=Bacteroides fragilis NCTC 9343 RepID=Q5LH96_BACFN Length = 365 Score = 48.8 bits (115), Expect = 2e-04, Method: Composition-based stats. Identities = 43/327 (13%), Positives = 90/327 (27%), Gaps = 42/327 (12%) Query: 32 ISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFF-HRLLKFR 90 + +P G V + + + L + M L+ + Sbjct: 52 TDLGLVPNGKGYVGKFFRATSKLKKIFNLYRNENC-----MYFACSFDLALITLLFSNKK 106 Query: 91 IVPLIHDIDE-------LR-GGGGSDSVRLATCDMVISHNPQMTKYLSKYMSQDKIKDIK 142 + I D+ +R D + + + + KYL +K Sbjct: 107 YIYQISDLVYGYFPTTLIRSFFKTIDKLIIKKSITTVVTSEGFIKYLGSKNIANKFIYQP 166 Query: 143 ---IFDYLVSSDVEHRDVTDKQRGVIYAG------NLSRHKCSFIYTEGCDFTLFGVNYE 193 + L + T+ + G L K Y F +G Sbjct: 167 NNLPKEVLGKNIEISECPTNDHFIFSFIGFVRADSVLLFAKTVGEYFPNYSFHFYGKAMN 226 Query: 194 ---------NKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNP 244 N Y G + + N+ D V +C ++ P Sbjct: 227 MTVIDNLIKEYPNIHYFGPYK-YPDDLQNIYEK-------VDLVVSCYDVKSLNVRLAEP 278 Query: 245 HKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVG--SIKEMQEIVDSMTIETYKQISEN 302 +K + P+ + LA+ + +G+ V ++ ++E + +T K Sbjct: 279 NKLFESIYYGKPIIVSSHTFLAEKVNSMGVGFGVDASDMQTIKEFISGLTSTEIKNRINK 338 Query: 303 TKIISQKIRTGSYFRDVLEEVIDDLKT 329 K I + +L+++ + K Sbjct: 339 IKSIPKATMIDDGAIQLLKKIETNFKN 365 >UniRef50_A5FN98 Candidate alpha-glycosyltransferase; Glycosyltransferase family 4 n=1 Tax=Flavobacterium johnsoniae UW101 RepID=A5FN98_FLAJ1 Length = 365 Score = 48.8 bits (115), Expect = 3e-04, Method: Composition-based stats. Identities = 48/298 (16%), Positives = 90/298 (30%), Gaps = 33/298 (11%) Query: 35 VNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPL 94 V+ + +V D+L + + +L +I + Sbjct: 52 VDYLFLNKKNKIDFKAVFKLRKFLKNNKVDILHAH-----SSSFFTAVLVKLTLIKIKII 106 Query: 95 IHDIDE----LRGGGGSDSVRLATC-DMVISHNPQMTKYLSKYMSQDKIKDIKIF--DYL 147 HD LR + VIS N + + Y+ +K F D+ Sbjct: 107 WHDHYGINQDLRLRKSLILKYSSLFFKGVISVNTALKDWAVSYLLCSNVKYFPNFIEDFY 166 Query: 148 VSSDVEHRDVTDKQRGVIYAGNLSRHKC----------SFIYTEGCDFTLFGVNYENKDN 197 S+ + +R + A NL K F LFG ++++ + Sbjct: 167 ASNQKIALAGEEGKRIICVA-NLRPQKNHLFLLDTANLIKDKFPDWSFHLFGKDFKDSYS 225 Query: 198 PKYLGSFDAQSPEKINLPGMQFGLIWDGDSVET-CSGAFGDYLKFNNPHKTSLYLSMELP 256 ++ FD ++ +G I D +V T C A L P Y +LP Sbjct: 226 AEF---FDKIQDLQLCETVFFYGSIDDVSNVLTQCDVAVLPSLSEGLPLAVLEYGLHKLP 282 Query: 257 VFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTI-----ETYKQISENTKIISQK 309 V + + I G + + + ++T + ++ +N Q Sbjct: 283 VIATNVGEIKKIITSENNGVIIEA-NNTYQFTQALTDLIIQKDKRVKMGKNLNEFIQL 339 >UniRef50_B5YA59 Glycosyl transferase, group 1 family protein n=1 Tax=Dictyoglomus thermophilum H-6-12 RepID=B5YA59_DICT6 Length = 399 Score = 48.8 bits (115), Expect = 3e-04, Method: Composition-based stats. Identities = 34/246 (13%), Positives = 79/246 (32%), Gaps = 34/246 (13%) Query: 113 ATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRG-------VI 165 D+VI+ + ++ + L ++ + I+ + L ++ K ++ Sbjct: 156 NHSDLVIAPSTKIKRLLKEFGVKKPIEVLPNGIDLDKFRKIPKNEARKDLNLPLNAVLLL 215 Query: 166 YAGNLSRHKCSFIYTEGCDFTLFGVNYENKDNPKYLGSFD--AQSPEKINLPGMQFGLIW 223 + G L + K E + N D YL L L Sbjct: 216 FVGRLGKEKNIEFLIEVLEII-----KNNTDKLIYLVIVGDNPDKRVMEELKNKAKALNV 270 Query: 224 DGDSVETCSGAFGDYLKFNNPHKTSLY--------------LSMELPVFIWDKAALADFI 269 ++ T + +K ++ ++ LPV + A++DF+ Sbjct: 271 YDRTIFTGYLDYDKVIKAYYASDIFVFSSITETQGLVILEAMASGLPVVAIEDDAISDFV 330 Query: 270 VDNRIGYAVGSIKEMQEIVDS------MTIETYKQISENTKIISQKIRTGSYFRDVLEEV 323 + G+ + + +E ++I + Y+++S N S+ + +L Sbjct: 331 KNGINGFLIPNNQEAKKIFSEKIITLIENRDLYEKMSINALDNSKLFHIKILNKKLLSLY 390 Query: 324 IDDLKT 329 +K Sbjct: 391 EYLIKE 396 >UniRef50_A8UK17 Putative uncharacterized protein n=1 Tax=Flavobacteriales bacterium ALC-1 RepID=A8UK17_9FLAO Length = 388 Score = 48.4 bits (114), Expect = 3e-04, Method: Composition-based stats. Identities = 43/337 (12%), Positives = 100/337 (29%), Gaps = 52/337 (15%) Query: 24 DIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPM-AKPFWHILSF 82 + + S N+ + + S L + +D +I P ++ Sbjct: 69 KVLNVLGYYS--NVSPRRVISNFLFSIQLFFILLFKVSKQDKIIL--PSRPVELIFFIAM 124 Query: 83 FHRLLKFRIVPLIHD-------IDELRGGGGSDS-------VRLATCDMVISHNPQMTKY 128 L +I I D I+ R + L + P + Sbjct: 125 LKLLRGVKIYLDIQDIWPDALEIENKRKKRIFEIYCNLYLKPSLKHYTGTLHVAPSFKLW 184 Query: 129 LSKYMSQDK--IKDIKIFDYL-VSSDVEHRDVTDKQRGVIYAGN---LSRHKCSFIYTEG 182 L +Y + + + ++ ++ + V A + + Sbjct: 185 LRRYAKKTPSSFVSLGWENERWSDVVLKEYKESNVVKMVCVAQLQHQIDVMPILEVLRNN 244 Query: 183 CDFTLF-----GVNYENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGD 237 L G + Y+ + + + E + D E Sbjct: 245 KKLHLTILGEDGTGERYNEVINYINTHNITNIEILGKI----------DRQEMVKHLMDK 294 Query: 238 YLKF------NNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVG-SIKEMQEIVDS 290 L + P+K Y++ LP+ + ++F+V+N IG+ + +++ ++ S Sbjct: 295 DLGVLPMITSSLPNKIFDYMAAMLPIIVLGDNDSSNFVVENDIGWQCNFNSEDLDVLLQS 354 Query: 291 MTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDL 327 + + + + I ++ R VL + I D+ Sbjct: 355 LKAKDIQSKKKQVVSIRD-----NFSRQVLHKKIKDI 386 >UniRef50_Q9YCS3 Glycosyl transferase, group 1 n=1 Tax=Aeropyrum pernix RepID=Q9YCS3_AERPE Length = 383 Score = 48.4 bits (114), Expect = 3e-04, Method: Composition-based stats. Identities = 40/286 (13%), Positives = 91/286 (31%), Gaps = 40/286 (13%) Query: 23 LDIASDYENISVVNIPLWGGVV--QRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHIL 80 I + +N + + + I ++ + + ++ D+ I P+ +L Sbjct: 60 EKIFERSFHFPSINTRIPSLEIAAKIIEYTLSIISITAEAKHYDIAIAQDPITATIAIML 119 Query: 81 SFFHRLLKFRIVPLIHDIDE---LRGGGGSDSVRLATCDMVISHNPQMTKYLSKYMSQDK 137 + +++ H+ + D D+V + ++ + K ++ Sbjct: 120 K--QKNYISKVILQSHNFTSPTRSKLYKFLDLYTTTHSDIVWCLSNRLAEIRRKLGAKYT 177 Query: 138 I-KDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYTE----------GCDFT 186 + I I D ++ + + K ++Y G+LS+ K I E Sbjct: 178 VQTPICIRDDVIDKTLNY--TRRKSNDIVYIGSLSKDKGVDILLELVKTFTKNGNDTIIH 235 Query: 187 LFGVNY----------ENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFG 236 + G E Y G G++ S E+ + Sbjct: 236 IVGKGLLYEKIFERIGEINKRVIYYGP-QPLKRALQIASRASLGVVLTRPSYESLTTD-- 292 Query: 237 DYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIK 282 P K +YL+ PV + + ++ ++ + G VG +K Sbjct: 293 -------PMKPKVYLAAHTPVILPEYFEISSYVNRFKAGMVVGKLK 331 >UniRef50_A6GZ44 Probable L-fucosamine transferase n=1 Tax=Flavobacterium psychrophilum JIP02/86 RepID=A6GZ44_FLAPJ Length = 400 Score = 48.4 bits (114), Expect = 3e-04, Method: Composition-based stats. Identities = 48/347 (13%), Positives = 107/347 (30%), Gaps = 47/347 (13%) Query: 3 FLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLE- 61 + ++ A D + I + G+ I L L Sbjct: 36 IVTPSERRKKIATNLVVNDKVTILQVKTFNIQKTNKIEKGIGTLAIEYQYLHAIKKHLSG 95 Query: 62 -NKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDE---------------LRGGG 105 D+++++ P F+ +++F + L+ DI + Sbjct: 96 YKFDLVLYSTPP-ITFYKVINFIKKRDNAYAYLLLKDIFPQNAVDMKMIKQGGFLHKMFV 154 Query: 106 GSDSVRLATCDMVISHNPQMTKYLSKYMSQDK---------IKDIKIFDYLVSSDVE--- 153 + D + + ++ K+ + + K DY + Sbjct: 155 KKEKKLYKISDTIGCMSQANVDFVLKHNPEIPRDKIEVNPNSIEPKFIDYSAIEKKQIKV 214 Query: 154 HRDVTDKQRGVIYAGNLSRHKCS--------FIYTEGCDFTLFGVNYENKDNPKYLGSFD 205 ++ ++ ++Y GNL + + + F + G E ++ + Sbjct: 215 KYNLPLDRKILVYGGNLGKPQGLEFLLETIAKVAVSDVFFLIVGDGTEYAKINQWFSNNK 274 Query: 206 AQSPEKINLP-GMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFI--WDK 262 Q+ I + L+ + ++ N P + YL M++PV Sbjct: 275 PQNANLIKFLPKNDYDLLLAACDIGMIFLDK-NFTIPNFPSRLLSYLEMKIPVIAATDIN 333 Query: 263 AALADFIVDNRIGYAVGS--IKEM-QEIVDSMTIETYKQISENTKII 306 + I D G +V + I EM Q I+ ++ +SEN++ + Sbjct: 334 TDIGKIITDANCGASVTAGEIGEMRQAILKCLSNIE--SMSENSRRL 378 >UniRef50_Q2LPR2 Glycosyltransferase n=1 Tax=Syntrophus aciditrophicus SB RepID=Q2LPR2_SYNAS Length = 387 Score = 48.4 bits (114), Expect = 3e-04, Method: Composition-based stats. Identities = 45/321 (14%), Positives = 104/321 (32%), Gaps = 32/321 (9%) Query: 22 ALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFP---MAKPFWH 78 + + + +QR + + D + N P + Sbjct: 56 VNNAVIGGRATHLGSKREISSEIQRTRRILTELKRKISMCRPDAVHLNSPCGRFGIMRDY 115 Query: 79 ILSFFHRLLKFRIVPL----IHD-IDELRGGGGSDSVRLATCDMVISHNPQMTKYLSKYM 133 + + + ++ I D I+ R A+ + ++ N +++ K+ Sbjct: 116 LCALAVKKKGIPVIVHFRCNIEDQINNSRLSLYFFKKLAASANSILVLNSPSKEFVLKHA 175 Query: 134 SQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYT------EGCDFTL 187 +D + D + + ++ V++ GN+ K + G +F L Sbjct: 176 RRDSRQVANFIDDDYVIEQPKPISPEIRK-VLFVGNVLESKGAKEIVSAASDFPGTEFIL 234 Query: 188 FGVNYENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKT 247 G + + G+ + + L+ + D S G + Sbjct: 235 AGPVEDKVSSLNAPGNVRLLRKQIPHEEIRD--LLDEADVFLFPSYTEG------FSNAM 286 Query: 248 SLYLSMELPVFIWDKAALADFIVDNRIGYAV--GSIKEMQEIVDSMTIETY-KQISE-NT 303 ++ +P+ A AD I D+ G V GS+ ++ + S++ + +++S N Sbjct: 287 LEAMARGVPIIATGVGANADMIEDSG-GMLVRAGSVPDIVHSLRSISPASVRERMSRWNV 345 Query: 304 KIISQKIRTGSYFRDVLEEVI 324 K+R V+E +I Sbjct: 346 ----NKVRDAYLINRVMECLI 362 >UniRef50_B4S4G0 Glycosyl transferase group 1 n=1 Tax=Prosthecochloris aestuarii DSM 271 RepID=B4S4G0_PROA2 Length = 374 Score = 48.4 bits (114), Expect = 4e-04, Method: Composition-based stats. Identities = 14/79 (17%), Positives = 33/79 (41%), Gaps = 5/79 (6%) Query: 108 DSVRLATCDMVISHNPQMTKYLSKY--MSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVI 165 + + D +I +M KY+++ ++KI + + V +++ + K+ +I Sbjct: 143 EKKLFTSADAIIVTTEEMRKYITERIKGVEEKISVLPNY---VDTELFKPEEAAKEFDLI 199 Query: 166 YAGNLSRHKCSFIYTEGCD 184 + G S K E + Sbjct: 200 FIGRFSEQKNLKSLLEAIE 218 >UniRef50_C0VZT2 Glycosyltransferase n=1 Tax=Actinomyces coleocanis DSM 15436 RepID=C0VZT2_9ACTO Length = 360 Score = 48.4 bits (114), Expect = 4e-04, Method: Composition-based stats. Identities = 17/74 (22%), Positives = 35/74 (47%), Gaps = 2/74 (2%) Query: 236 GDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVG-SIKEMQEIVDSMTIE 294 +Y +F P K YL LPVF F+ N G+ V ++ ++ ++S++ Sbjct: 268 DEYWRFAVPIKLFDYLGHGLPVFATADTWAGKFVSQNECGWTVEYDVEAIKTQLESLSSL 327 Query: 295 TYKQ-ISENTKIIS 307 ++ + + + K +S Sbjct: 328 SWVEDMEAHIKTVS 341 >UniRef50_B8DZE0 Glycosyl transferase group 1 n=1 Tax=Dictyoglomus turgidum DSM 6724 RepID=B8DZE0_DICTD Length = 415 Score = 48.4 bits (114), Expect = 4e-04, Method: Composition-based stats. Identities = 34/252 (13%), Positives = 71/252 (28%), Gaps = 45/252 (17%) Query: 108 DSVRLATCDMVISHNPQMTKYLSKYMSQ-DKIKDIKI------FDYLVSSDVEHRDVTDK 160 + D +I P+ +Y+ K +KI I F + + T Sbjct: 167 EKFLYKRADKIIVLLPRANEYIEKLGISPEKIVWIPNGVDFERFQFKNGGSLRDETYTSD 226 Query: 161 QRGVIYAGNLSRHKCSFIYTE----------GCDFTLFGVNYEN--------KDNPKYLG 202 + V Y G + + + E F G E K+ L Sbjct: 227 EFIVTYTGAIGKANNLDVAVEAAKILQKDYPNIKFLFVGDGPEKGRLLEIVKKEKINNLE 286 Query: 203 SFDAQSPE-KINLPGMQFGLIWDGDSVETCSGAFGDY-LKFNNPHKTSLYLSMELPVFIW 260 + + + D + Y + N K YL+ P+ Sbjct: 287 FKAPLPKDRIVEIIQKA-----DVLFLALKDSPLYKYGISLN---KLFDYLASGKPIIFS 338 Query: 261 DKAALADFIVDNRIGYAVGS-----IKEMQEIVDSMTIETYKQISENTKIISQKIRTGSY 315 + + + + G V + + + M+ E + + N + +K + Sbjct: 339 SNSINNP-VDEAKAGITVPPDNPQALADAIIKLYKMSPEERRAMGLNGRKYVEK----YH 393 Query: 316 FRDVLEEVIDDL 327 VL + ++ + Sbjct: 394 SIPVLVDKLEKI 405 >UniRef50_A5I5D7 Glycosyl transferase, group 1 family n=3 Tax=Clostridium botulinum A RepID=A5I5D7_CLOBH Length = 392 Score = 48.4 bits (114), Expect = 4e-04, Method: Composition-based stats. Identities = 46/333 (13%), Positives = 105/333 (31%), Gaps = 62/333 (18%) Query: 27 SDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRL 86 + + L V+ E+ DV + P +IL+ + Sbjct: 50 EGIKIKRLFKYNLGTTVLVEKNLLAHFDLINNLSESFDVYHCHDTETWPIGYILA---KR 106 Query: 87 LKFRIVPLIHDIDELRGGGG----------------SDSVRLATCDMVISHNPQMTKYLS 130 + + H+ + + CD I+ N +++ L Sbjct: 107 DGAKFICDSHEFFPDYICKEWHASDFKYELTKLLVIARGEYIKYCDGAITVNEIISEELY 166 Query: 131 KYMSQDKIKDIKIFDYLVSSDVEHR-----------DVTDKQRGVIYAGNLSRHKCS--- 176 K K + I++ ++V + +++ ++ ++++G + + Sbjct: 167 NQ-FNLKYKPLVIYNTRSINEVSKKYNIGIDIRKKYNISKDKKILLFSGTVEPSRGLDLV 225 Query: 177 ---FIYTEGCDFTLFGVNYEN----------KDNPKYLGSFDAQSPEKINLPGMQFG--L 221 Y E C F + G + N K N K F + + F L Sbjct: 226 IKSMPYVENCVFIIAGGDKFNCIEKLQELVQKYNVKDKVIFTGKLNYQELFNYGFFADLL 285 Query: 222 IWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVG-- 279 ++ G + ++ P+K Y+ P+ I + ++ + IG + Sbjct: 286 VYLGIPTVKN-------MDYSAPNKFFDYMMAGKPMIISNLKFMSSVVTRYDIGEIIDIK 338 Query: 280 --SIKEMQEIVDSM--TIETYKQISENTKIISQ 308 + KE+ ++S+ + E + S+N I Sbjct: 339 NINFKEIGYKINSLIHSDEKSCKYSKNIFKIQD 371 >UniRef50_Q55374 Slr0907 protein n=2 Tax=Chroococcales RepID=Q55374_SYNY3 Length = 1014 Score = 48.0 bits (113), Expect = 4e-04, Method: Composition-based stats. Identities = 39/273 (14%), Positives = 81/273 (29%), Gaps = 36/273 (13%) Query: 67 IFNFPMAKPFWHILSFFHRLLKFRIVPLIHD---IDELRGGGGSDSVRLATCDMVISHNP 123 + P + I RL + I D I + L D+ + + Sbjct: 106 FLSVPYFPEDYQISIALKRLYNCPLCVFIMDDQNIYSPQVPDSLVDELLYRADICLGISR 165 Query: 124 QMTKYLSKYMSQD----KIKDIKIFDYLVS--SDVEHRDVTDKQRGVIYAGNLSRHKCSF 177 + + +L+ K++GV+ GN+ + Sbjct: 166 PLCDAYEAKFKRKFWFVPPVVQG---HLIKTEPPQSWDRSPSKRQGVMI-GNVWSQQWLD 221 Query: 178 IYT-----EGCDFTLFG-VNYEN---------KDNPKYLGSFDAQSPEKINLPGMQFGLI 222 G +G N + +D + G F + L F +I Sbjct: 222 QLRSLCRATGVKIDWYGNPNRDWLNFNEAELGEDGINFKG-FVPEDELIDILRTADFAVI 280 Query: 223 WDGDSVETCSGAFGDYLKFNNPHK-TSLYLSMELPVFI--WDKAALADFIVDNRIG-YAV 278 G S + SG + K + P + + + LP+ + ++A+A F+ IG Sbjct: 281 PTGISDQ--SGDRPELTKLSLPSRSCFITATANLPILVVGSPESAVAQFVQSQGIGTVCA 338 Query: 279 GSIKEMQEIVDSMTIETYKQISENTKIISQKIR 311 ++ +D + + + + Q + Sbjct: 339 YQPEDFSAKIDYLLENQ-GPMRKQAFKLGQNLS 370 >UniRef50_Q73L31 Glycosyl transferase, group 1 family protein n=1 Tax=Treponema denticola RepID=Q73L31_TREDE Length = 385 Score = 48.0 bits (113), Expect = 4e-04, Method: Composition-based stats. Identities = 43/332 (12%), Positives = 101/332 (30%), Gaps = 54/332 (16%) Query: 32 ISVVNIPLWGGVVQRIISSVKLSTFLCGLE--NKDVLIFNFPMAKPFWHILSFFHRLLKF 89 I + +IP + RI +K + ++ N D++ + +F + L Sbjct: 54 IRIPSIPFFKWSEFRIGLFLKHTKAYNKVKALNFDIVHTQ--TEFSMGNFGTFIAKDLNI 111 Query: 90 RIVPLIHDI--DELRGGGGS------------DSVRLATCDMVISHNPQMTKYLSKYMSQ 135 + H + + +A VI+ + L Y + Sbjct: 112 PCIHTYHTVYEEYTHYISNFGKSPLKKVVRKLSKRYIAHFSGVIAPTEKTRDLLISYGVK 171 Query: 136 DKIKDIK-----------IFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYTEGCD 184 +KI + I D +S ++ ++ +I+ G +S+ K Sbjct: 172 NKIYVVPTGINLEKFKKDIPDAETNSLLKSFNIKKDSFKLIFLGRISKEKNI-----ETL 226 Query: 185 FTLFGVNYENKDNPKYLGSF-DAQSPEKINLPGMQFGLIWDGDSVETCSGAFGD---YLK 240 + +N + + E L + + T Y K Sbjct: 227 INIMPKIVSENNNIQLIIVGDGPDRLELEERVRY---LDLQDNVIFTNRIPNDKVPIYYK 283 Query: 241 ----FNNPHKT-------SLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVD 289 F +P KT ++ +PV ++D + ++ + G E+ + + Sbjct: 284 AADLFISPSKTETQGLTILEAMAAGVPVLVYDDTNIKGLVLHKKTGLLFKENDELLDNIK 343 Query: 290 SM--TIETYKQISENTKIISQKIRTGSYFRDV 319 E + ++ I++ + ++ + V Sbjct: 344 FALNNKEEIQSYAKEAFKIAEDFSSANFAKKV 375 >UniRef50_B5IAN8 Glycosyl transferase, group 1 family protein n=3 Tax=Aciduliprofundum boonei T469 RepID=B5IAN8_9EURY Length = 375 Score = 48.0 bits (113), Expect = 4e-04, Method: Composition-based stats. Identities = 45/359 (12%), Positives = 105/359 (29%), Gaps = 71/359 (19%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLE 61 Y ++ + +R+ G K+ + + V ++P + + ++ + + + Sbjct: 19 YVVHRYSTMQRNDGHKSYVITTKLPNTKNFEYVDSVPYY--RLSKVGMGTSIIRKIKKIN 76 Query: 62 NKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDI---DELRGGGGSDSVRLATCDMV 118 + + + F I+ IHD+ G DS + Sbjct: 77 PD---LIHTHSYIAAPVLSYFHKINPNIPILRHIHDVYIGKYEEYSGWEDSKMYERFEGF 133 Query: 119 I---------SHNPQMTKYLSKYMSQD-------KIKDIKIFDYLVSSDVEHRDVTDKQR 162 I + + L + DI+ F + + + K + Sbjct: 134 IIKLPYTAYITPSKYTKDKLIELGLPKERIHVVHPGVDIEKFGNSNRNYLREKYNIPKDK 193 Query: 163 GVI-YAGNLSRHK---CSFIYTEGCD---FTLFGVNYEN--------------------- 194 +I + G LS K + L G N Sbjct: 194 KIIGFVGRLSTGKGPQYLIEAAKDLKEAYIVLVGPNPNPKTSGILGIESMLRSLVKKYRM 253 Query: 195 KDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSME 254 +D + G + F L S+ G + L+ Sbjct: 254 EDRVIFAGKIRDEEVPLYYDSFDIFCL----PSISEGFGMS-----------IAEALAAG 298 Query: 255 LPVFIWDKAALADFIVD--NRIGYAVGSIKEMQEIVDSMTIET--YKQISENTKIISQK 309 PV ++ A+ + + D N + + +++E ++ + + Y+++ +NT+ +K Sbjct: 299 KPVVSFNITAIPEIVKDGYNGLLAMPKDVDDLKEKLEMLINDERLYERLKKNTRSSVEK 357 >UniRef50_C6IGY3 Putative uncharacterized protein n=1 Tax=Bacteroides sp. 1_1_6 RepID=C6IGY3_9BACE Length = 420 Score = 48.0 bits (113), Expect = 5e-04, Method: Composition-based stats. Identities = 41/235 (17%), Positives = 78/235 (33%), Gaps = 25/235 (10%) Query: 110 VRLATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGN 169 + + ++ M + K + I I D + + + T ++ V+Y G Sbjct: 192 EAINNSNGLVLLTEAMMDFYQKDL--KHIVMEGIVD-VGTMGKSDVEPTTDKKVVLYTGT 248 Query: 170 LSRHKCSFIYTEGCDFTLFGVNYENKDNPKYLGSFDAQSPEKIN---LPGMQ---FGLIW 223 L + E + + S E IN + FGL+ Sbjct: 249 LRKIFGIMNLVEAFKMV-------KDRDVELWICGSGDSKEAINEAARIDSRIKFFGLVD 301 Query: 224 DGDSVE---------TCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRI 274 ++E + G+Y K++ P KT YL V I + + + D Sbjct: 302 SETALEMQHKATILVNPRTSEGEYTKYSFPSKTMEYLLAGRSVIINHLSGIPEEYYDYVY 361 Query: 275 GYAVGSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDLKT 329 S++ + E + S+ K+ E K Q I + +E VI +++ Sbjct: 362 TPKDESVEALAECISSVIHLDIKEREERAKKGRQFIIEKKNSKVQMERVIKMIES 416 >UniRef50_A5I5E4 Glycosyl transferase, group 1 family n=2 Tax=Clostridium botulinum A RepID=A5I5E4_CLOBH Length = 401 Score = 48.0 bits (113), Expect = 5e-04, Method: Composition-based stats. Identities = 45/311 (14%), Positives = 101/311 (32%), Gaps = 45/311 (14%) Query: 48 ISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGGGS 107 + S+ L L + +K +I + + + + I+ I Sbjct: 103 LYSLLLIDKLKKINSKAKII-----YEDREFYPDAIRDYNETKGICTINKIIYSYYMSLW 157 Query: 108 DSVRLATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYA 167 + ++ A D VI + + K + ++K+ I+++ + + K +IY Sbjct: 158 EIMKAAKSDQVIVTDKNIFKRFKNRLGKNKVNI--IYNFTNIEPIHSMNHEKKTYDIIYC 215 Query: 168 GNLSRHKCSFIYTE----------GCDFTLFGV--------------NYENKDNPKYLGS 203 G +++ + F + F L G +N LG Sbjct: 216 GGVTKVRGIFQVVKAVKLAKNDGYNLKFLLIGPIDSNFKNELLDYINKNNLNENVDILG- 274 Query: 204 FDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKA 263 E + + K N P K Y++ LPV D Sbjct: 275 -RINFSELPKYLEKS--------KIGIVTLLSIPKYKKNIPMKQFEYMAYGLPVVGSDLP 325 Query: 264 ALADFIVDNRIGYAVGSIK--EMQEIVDSM--TIETYKQISENTKIISQKIRTGSYFRDV 319 + +F ++ G V + E+ + + ++ + E Y ++S+N ++ + Sbjct: 326 PIKEFTGESNSGILVNPMNEREIWKAIKALLESEELYYELSKNGIQAVKEKYNWKCSEEK 385 Query: 320 LEEVIDDLKTR 330 L + ++L + Sbjct: 386 LISIYNNLLDK 396 >UniRef50_A4SDY1 Glycosyl transferase, group 1 n=1 Tax=Chlorobium phaeovibrioides DSM 265 RepID=A4SDY1_PROVI Length = 366 Score = 48.0 bits (113), Expect = 5e-04, Method: Composition-based stats. Identities = 29/220 (13%), Positives = 69/220 (31%), Gaps = 34/220 (15%) Query: 106 GSDSVRLATCDMVISHNPQMTKYLSKY--MSQDKIKDIKIFDYLVSSDVEHRDVTDKQRG 163 +S + D + P M + + K + K+ I + V++++ + K Sbjct: 141 NYESKLFSAADGIEVTTPMMQESIEKRISGTNGKVTVIPNY---VNTELFAPSSSVKDID 197 Query: 164 VIYAGNLSRHKCSF-----IYTEGCDFTLFGVNYENKDNPKYLGSFDAQSPEKINLPGMQ 218 +++ G L+ K + + G + +Q+ E Sbjct: 198 ILFIGRLNPQKNLQTLLKALQGLKLKIAIIGKGPLEDE-------LKSQAQELELDIE-- 248 Query: 219 FGLIWDGDSVETCSGAFGDYLKF--------NNPHKTSLYLSMELPVFIWDKAALADFIV 270 W G+ + + K +P ++ LPV D + + I Sbjct: 249 ----WPGNIPNPDLPNWLNRSKIFVLPSHYEGHPKTLIEAMATGLPVIGGDAPGIREIIS 304 Query: 271 DNRIGY-AVGSIKEMQEIVDSM--TIETYKQISENTKIIS 307 + G+ + E V+ + + + +S N + + Sbjct: 305 HLKTGWLCPTDTGGISEAVEELMASEALRQHLSNNARGFA 344 >UniRef50_C3AT67 Glycosyl transferase, group 1 n=3 Tax=Bacillus RepID=C3AT67_BACMY Length = 373 Score = 48.0 bits (113), Expect = 5e-04, Method: Composition-based stats. Identities = 49/330 (14%), Positives = 101/330 (30%), Gaps = 61/330 (18%) Query: 18 ARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFW 77 ARKD NI + IP + RII KL + +K ++ + P Sbjct: 42 ARKDYD---EKIGNIQIFAIPKSKNFLSRIILQPKLLCKILKTRSK-IVHLHNPDTILLG 97 Query: 78 HILSFFHRLLKFRIVPLIHDIDELRGG-----------------GGSDSVRLATCDMVIS 120 +L F++ +++ H+ R G + + D I Sbjct: 98 FLLKMFNK----KVIYDTHEDFSKRILIREWIPYSYRKLIAKVVTGLELMASRCFDGFIV 153 Query: 121 HNPQMTKYLSKYMS--QDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHK---- 174 + + I ++ + ++ V+Y G +S + Sbjct: 154 TQEGLLAKYKNALLIENAPIHQGELI--TRAYELSKLISKSNYIRVVYVGGISEQRGLRQ 211 Query: 175 ---CSFIYTEGCDFTLFGVNYENKDN-----------PKYLGSFDAQSPEKINLPGMQFG 220 + L+ + ++K+ YLG + + G Sbjct: 212 IVQALEEINKTYSCRLWLIGPDSKEINELQKLNGWKYVDYLGKLPQEKA-FSYVIKSDIG 270 Query: 221 LIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVG- 279 L D V+ +P+K Y+++ +P D I + GY V Sbjct: 271 LCTILDVVDHAE---------TSPNKIFEYMTLGIPFIASDFPKWEKKIGCSNAGYFVDP 321 Query: 280 -SIKEMQEIVDSMTIETYKQ--ISENTKII 306 +I ++ +++ + + + +S N K Sbjct: 322 QNINKIADLILKIGSDENLKSYLSNNGKKY 351 >UniRef50_B7KM09 Glycosyl transferase group 1 n=5 Tax=Cyanobacteria RepID=B7KM09_CYAP7 Length = 566 Score = 47.6 bits (112), Expect = 5e-04, Method: Composition-based stats. Identities = 48/304 (15%), Positives = 89/304 (29%), Gaps = 51/304 (16%) Query: 22 ALDIASDY----ENISVVNIPLWGGVVQRIISSVKLSTFLCGL--ENKDVLIFNFPMAKP 75 N + G ++ I+ ++ L + +L+ P Sbjct: 72 IETFLGGMVKRTRATQFWNGRIRGKLINGILFFLRAGIHLLKNVQKENKLLLTTAPA--F 129 Query: 76 FWHILSFFHRLLKFRIVPLIHDIDE-----LRGGGGSDSV-----RLA-----TCDMVIS 120 + F K V LI+D+ L+ + + L + +I Sbjct: 130 LIFLGYFLKVFRKISYVCLIYDLYPDVAVQLKVVSPKNLIVKLWEFLNVKTWEKAEKIIV 189 Query: 121 HNPQMTKYLSKYMSQ--DKIKDIKIF---DYLVSSDVEHRDVTDK-----QRGVIYAGNL 170 N M + Q DKI I + ++V D + V+Y+GN+ Sbjct: 190 LNSSMKNRILAKHPQFYDKISVIHNWADPKWIVPLDKSDNWFAQNHNLVDKFTVLYSGNM 249 Query: 171 SRHKC--------SFIYTEGCDFTLFGVNYENKDNPKYLGSFDAQSPEKINLPGMQ---F 219 R + F G E + K + S+ ++ + Q + Sbjct: 250 GRCHDVTTILDAVLQLQNAPIQFVFIGGGAEYEKLLKQVKSWGLKNCLFLPYQDKQILPY 309 Query: 220 GLIWDGDSVETCS-GAFGDYLKFNNPHKTSLYLSMELPV--FIWDKAALADFIVDNRIGY 276 L S+ + G G P K L+ P+ + L I D + G Sbjct: 310 SLTACDLSIVSIKPGMEG----IVAPSKFYSMLAAGRPIVAICEKHSYLRQIINDAKCGI 365 Query: 277 AVGS 280 A+ + Sbjct: 366 AIEN 369 >UniRef50_D1Y981 Glycosyltransferase, group 1 family protein n=2 Tax=Propionibacterium acnes RepID=D1Y981_PROAC Length = 391 Score = 47.6 bits (112), Expect = 6e-04, Method: Composition-based stats. Identities = 13/86 (15%), Positives = 35/86 (40%), Gaps = 1/86 (1%) Query: 243 NPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQISEN 302 +P+K Y++ L V K + D I D+ +G V S ++ ++ + + Sbjct: 302 SPNKLYDYMAAGLAVVSNAKVPIRDVISDDEVGACVDS-TDLVAGIERVRDADEATMKRW 360 Query: 303 TKIISQKIRTGSYFRDVLEEVIDDLK 328 + + + + ++++ L+ Sbjct: 361 HERARELMANKYSLQASVDKLARVLE 386 >UniRef50_B0KUI7 Glycosyl transferase group 1 n=1 Tax=Pseudomonas putida GB-1 RepID=B0KUI7_PSEPG Length = 415 Score = 47.6 bits (112), Expect = 6e-04, Method: Composition-based stats. Identities = 40/264 (15%), Positives = 71/264 (26%), Gaps = 44/264 (16%) Query: 82 FFHRLLKFRIVPLIHDID-------ELRGGGGS-DSVRLATCDMVISHNPQMTKYL-SKY 132 R + R+V H+I R G + + D I+ KY Y Sbjct: 133 LAARFSRARLVYDAHEISTSREGYSSFRRLVGFVEKQLMPKVDGSITTTDARAKYFARAY 192 Query: 133 MSQDKIKDIKIFDYLVSSDVEHRDVTDKQ-----RGVIYAGNLSRHKCSFIYT------E 181 ++ L +R + + ++Y G L + + Sbjct: 193 GIARPTV-LQNRPRLTHCKNSNRIREELELKASWPIILYQGGLQQGRGLEKLIRTAADVP 251 Query: 182 GCDFTLFGVNYENKDNPKYLGSFDAQS----------PEKINL-PGMQFGLIWDGDSVET 230 F G D Q + + G+ + T Sbjct: 252 NAYFVFIGGGRLALDLAHLRDKLGLQERVHFIPTVSLSQLPSYTASADIGVQPIEN---T 308 Query: 231 CSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAV---GSIKEMQEI 287 C Y +N K YL LPV D + + N +G V S + Sbjct: 309 C---LNHYTTDSN--KLFEYLIAGLPVVATDFPEIRRIVRSNNVGLLVPANDSSSLAGAL 363 Query: 288 VDSMTIETYKQ-ISENTKIISQKI 310 + +T + + N + + K+ Sbjct: 364 IQLVTDLELRSTFATNARSTAGKL 387 >UniRef50_B9EAW2 Putative uncharacterized protein n=1 Tax=Macrococcus caseolyticus JCSC5402 RepID=B9EAW2_MACCJ Length = 384 Score = 47.6 bits (112), Expect = 6e-04, Method: Composition-based stats. Identities = 39/357 (10%), Positives = 97/357 (27%), Gaps = 48/357 (13%) Query: 4 LNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENK 63 L +A K + + + + + N L V++ + ++ + + + Sbjct: 41 LKQSYKEGYNAVRKHKTELPNFIVN----RINNNKLKTIVLKHLPNAFLMMKMIQEGYKQ 96 Query: 64 DVLIFNFPMAKPFWHILSFFHRLLKFRI-VPLIHDIDELRGGGGS------DSVRLATCD 116 D I++ ++ RI + H+++ R S + + D Sbjct: 97 DADIYHSHDLNTLIQGIACAKLRTDKRILIFDAHEVNTSRTNYKSGLVGAIEKFLIRFTD 156 Query: 117 MVISHNPQMTKYLSKYMSQDKIKDIKIFD--YLVSSDVEHRDVTDKQRGVIYAGNLSRHK 174 I N Y + + + Y + + IY G L + Sbjct: 157 RTIVENETRATYHELLYGYRPM-SLYNYSEYYDIDEVEAINLPLKYDKTFIYQGGLQEGR 215 Query: 175 CSFIYTEGCDFTLFGVN-------------------YENKDNPKYLGSFDAQSPEK-INL 214 N + +D + G +S Sbjct: 216 GLERLLRAFKAADIPANLLMVGDGKIRPQLEALTRSFNLQDKVTFTGRVPYESLRSYTKA 275 Query: 215 PGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRI 274 F + F Y + +K Y+ +PV + + + + I Sbjct: 276 AYAGF--------QILENVNFNHYS--ASSNKLYEYMMAHVPVIATNLLEIKNVVEKEGI 325 Query: 275 GYAV--GSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDLKT 329 G + S +E+ + + M + + K + + + +++++ Sbjct: 326 GLIIKHDSEEELTDAIRKMFEDE--TMRNAMKERMKVSKEQYNWEKEKQKLLNLYHD 380 >UniRef50_B3QYT6 Glycosyl transferase group 1 n=1 Tax=Chloroherpeton thalassium ATCC 35110 RepID=B3QYT6_CHLT3 Length = 397 Score = 47.6 bits (112), Expect = 6e-04, Method: Composition-based stats. Identities = 31/280 (11%), Positives = 75/280 (26%), Gaps = 53/280 (18%) Query: 83 FHRLLKFRIVPLIHDI--DEL--------RGGGGSDSVRLATCDMVISHNPQMTKYLSKY 132 + + + +IHD+ + + + + D +IS K + Sbjct: 135 ILQKHGKKSIVIIHDVVFELFQPKYKWITKIVSKIEIDAVQQADHIISLTEHDAKRFQQL 194 Query: 133 MSQDKIKDIKIFDYLVSSDVEHRDVTD--------KQRGVIYAGNLSRHKCSFIYTEGC- 183 + + + + V RD ++ +++ G + + + Sbjct: 195 GVKKPMTVLPL---PTEKPVIQRDAVKSVQEKYQIDEKSLMFIGGYNHNVHAAEKIMDII 251 Query: 184 -------DFTLFGVNYE-------NKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVE 229 F + G N +G F L+ V Sbjct: 252 APKLPEYKFYILGSVCNAKEIVQREAKNVIKVGRVSDDEKHAFYEL-CTFCLV----PVF 306 Query: 230 TCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVD 289 G F K LS + + A ++ G + E + Sbjct: 307 GVPGGFST--------KLVEALSYGMVILTTALGARGVTFEHDKHGVVCDDFESYPERIR 358 Query: 290 SMTIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDLKT 329 S+ + S+ +S+K ++ V + + +++ Sbjct: 359 SLNDSQKQAYSQAALELSEK----YNYKTVYRDYLPIIES 394 >UniRef50_P72922 Slr1085 protein n=1 Tax=Synechocystis sp. PCC 6803 RepID=P72922_SYNY3 Length = 424 Score = 47.6 bits (112), Expect = 7e-04, Method: Composition-based stats. Identities = 40/295 (13%), Positives = 81/295 (27%), Gaps = 59/295 (20%) Query: 33 SVVNIPLWGGVVQRIISSVKLSTFLCGLEN-KD-VLIFNFPMAKPFWHILSFFHRLLKFR 90 + L G + I+ ++ L D +L+ P + H L K Sbjct: 69 RLWPQRLRGRAIAGILYCLRAIVKLRLGSRLGDLILVTTEPPY--LMVVAYILHLLYKKP 126 Query: 91 IVPLIHDIDE---LRGGGGSDSV-------RLA-----TCDMVISHNPQMTKYLSKY--M 133 + LI+D+ ++ G + L + +I + M K ++ Sbjct: 127 YICLIYDLYPDVAVKLGVAKEKDAIVKLWRWLNRLTWQKAEAIIVLSESMAKVIADQQPA 186 Query: 134 SQDKIKDIKIF-DYL-------VSSDVEHRDVTDKQRGVIYAGNLSRHKCSF-------- 177 KI+ + + D + + R D+ V+Y+GNL R Sbjct: 187 LAGKIEVVHNWADGVLIQPRAKADNWFAQRHGLDRTFTVLYSGNLGRCHDLETVMEAARL 246 Query: 178 IYTEGCDFTLFGVNYENKDNPKYLGSF---------DAQSPEKINLPG-MQFGLIWDGDS 227 + E F G + +++ P L+ Sbjct: 247 LQQEEVQFVFIGAGAKAPVCQEFVQRHQLTNCLFLPFQPKPVLPFSLTACDLSLV----- 301 Query: 228 VETCSGAFGDYLKFNNPHKTSLYLSMELPV--FIWDKAALADFIVDNRIGYAVGS 280 P K L+ + + L + I + + G + + Sbjct: 302 -----SILPQVEGLVVPSKFYGCLAAGTAIAAICPPHSYLREIIAEAQCGATIDN 351 >UniRef50_B5YDS1 Glycosyltransferase n=1 Tax=Dictyoglomus thermophilum H-6-12 RepID=B5YDS1_DICT6 Length = 387 Score = 47.3 bits (111), Expect = 7e-04, Method: Composition-based stats. Identities = 21/147 (14%), Positives = 54/147 (36%), Gaps = 18/147 (12%) Query: 48 ISSVKLSTFLCGLENKDVLIFNFP---------MAKPFWHILSFFHRLLKFRI---VPLI 95 + +N D++ + P +AK + + F H + +PLI Sbjct: 75 LYFSFFVEDFFKKQNFDIVHSHHPFVIGKTALKLAKKYRIPIVFTHHTQYHKYVHYIPLI 134 Query: 96 HDIDELRGGGGSDSVRLATCDMVISHNPQMTKYLSKYMSQDKIKDIK------IFDYLVS 149 + + D+VI+ ++ + + + + +I+ + +++ +S Sbjct: 135 PEKISAKFAIQESVKYANQVDLVIAPTKEIKEMIINFGVKTRIEILPTGIDFSLWEKDIS 194 Query: 150 SDVEHRDVTDKQRGVIYAGNLSRHKCS 176 + +R ++YAG L++ K Sbjct: 195 EEFLKNFPWKDKRILLYAGRLAKEKNI 221 >UniRef50_B9MRN9 Glycosyl transferase group 1 n=1 Tax=Anaerocellum thermophilum DSM 6725 RepID=B9MRN9_ANATD Length = 366 Score = 47.3 bits (111), Expect = 7e-04, Method: Composition-based stats. Identities = 42/302 (13%), Positives = 93/302 (30%), Gaps = 49/302 (16%) Query: 43 VVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDEL- 101 +R + +K+ + + D++ F+ P FF + +++ +H+ L Sbjct: 58 RSKRYKNYLKIIRIIKEFKP-DIIHFHDPDLLVLSLYFKFFLKK---KVIYDVHEDYPLA 113 Query: 102 -RGGGGSDSVRLATCDMVISHNPQMTKYLSKYMSQ-----------DKIKDIKIFDYLVS 149 R ++ + + L + + IK + + Sbjct: 114 FRDRKYFPEPFTMLFSILFNLAEKTISRLLDGVVTVTEDIYLKFNCKNKEVIKNYPVAET 173 Query: 150 SDVEHRDVTDKQRGVIYAGNLSRHKCS--------FIYTEGCDFTLFGVNYENKDNPKYL 201 E D +IY G++S+ + ++ + G +N KY Sbjct: 174 FLEEKDGTVDGTLNLIYIGSVSQSRGITNLILAVKSLHELDIRLDIVGP----AENQKYF 229 Query: 202 GSFDAQSPEKINLPGMQFGLIWDGDSVETCSG---------AFGDYLKFNNPHKTSLYLS 252 E+I + +G + D G Y K + P K Y++ Sbjct: 230 EEIKKYEDERIRI----WGRVPKKDIPGILKGSHVGFVTLLPLSRY-KTSLPLKLFEYMA 284 Query: 253 MELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQISENTKIISQKIRT 312 ++ V + I + G V ++Q++ +++ N I +K Sbjct: 285 AKVAVVASNFELWRKIIEEADCGVLVDP-TDIQDVKNAI-----LFFYNNRDQIIKKGLN 338 Query: 313 GS 314 G Sbjct: 339 GY 340 >UniRef50_B3QUF3 Glycosyl transferase group 1 n=1 Tax=Chloroherpeton thalassium ATCC 35110 RepID=B3QUF3_CHLT3 Length = 387 Score = 47.3 bits (111), Expect = 7e-04, Method: Composition-based stats. Identities = 48/325 (14%), Positives = 100/325 (30%), Gaps = 59/325 (18%) Query: 45 QRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIH----DIDE 100 + + + + DV++ P A P+ + +R + + H D+ Sbjct: 81 WFYMDAFRDIRKWHREKPIDVVVSRDPGALPYM---AKLNRSKQIAVFYQPHNFYADL-S 136 Query: 101 LRGGGGS---------DSVRLATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSD 151 +R + + V+ ++ K I K Sbjct: 137 VRPDVNPKNAKKYHLLEKKYIPKMTGVLCLQDSQAEWFRKAFPSQNILVAKPGMMRTQP- 195 Query: 152 VEHRDVTDKQRGVIYAGNLSRHK--------CSFIYTEGCDFTLFG-------------- 189 H ++R + Y G+L K + E L G Sbjct: 196 --HAGKGFQRRLIGYVGSLQLKKGVETLLRAFQILEPEKFKVILVGGRNQHEMAELKQRI 253 Query: 190 --VNYENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKT 247 + E+K S+ + G+I D + YL P+K Sbjct: 254 YELGLEDKVLITGWVSYAQVEIYLEKISV---GIIPLSDEF------YNRYLT--APNKL 302 Query: 248 SLYLSMELPVFIWDKAALADFIVDNRIGYAV--GSIKEMQEIVDSM--TIETYKQISENT 303 YLS +P+ D ++ DFI + G V + + + + + + ++Y+ Sbjct: 303 FDYLSRGIPIVASDLPSIRDFIAEGNEGLFVPPENPEALAAAIRKIFESEQSYEAFHARA 362 Query: 304 KIISQKIRTGSYFRDVLEEVIDDLK 328 + + K + R+++E++ LK Sbjct: 363 RKSAIKYLWENQARNMIEQIQKCLK 387 >UniRef50_C2CI25 L-fucosamine transferase n=1 Tax=Anaerococcus tetradius ATCC 35098 RepID=C2CI25_9FIRM Length = 409 Score = 47.3 bits (111), Expect = 7e-04, Method: Composition-based stats. Identities = 50/355 (14%), Positives = 106/355 (29%), Gaps = 47/355 (13%) Query: 17 KARKDALDIASDYENI-SVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKP 75 K D + NI I + ++ D++I+ P Sbjct: 58 KFINDIHYLDVKIGNITKTNKIEKGIATINIENQFLRAIKVWLSDVKFDIVIYPTPP-IT 116 Query: 76 FWHILSFFHRLLKFRIVPLIHDIDE---------------LRGGGGSDSVRLATCDMVIS 120 F+ ++ + R ++ DI + + D + Sbjct: 117 FYKVIKYLKRRDGALTYLMLKDIFPQNAVDLGFFSKKSLIYKYFRKKEKKLYEISDYIGC 176 Query: 121 HNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDV------------EHRDVTDKQRGVIYAG 168 + YL K K ++IF + E ++ +R IY G Sbjct: 177 MSKANVDYLLKCNPSLDEKKVEIFPNSIEPVNLSIDEEEIYFVREKYNLPHDKRIFIYGG 236 Query: 169 NLSRHKCS---------FIYTEGCDFTLFGVNYENKDNPKYLGSFDAQSPEKINLP-GMQ 218 NL + + + F + G E + K + + + Sbjct: 237 NLGKPQGLTFLLNCIESIKSIDDIMFLMVGNGTEFEYIVKMKDELNLYNLRIMAKLSKDD 296 Query: 219 FG-LIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPV--FIWDKAALADFIVDNRIG 275 F L+ S + N P + Y+ ++P+ + + + I+ N +G Sbjct: 297 FDKLV--ASSDVGIISLDHRFTIPNYPSRILSYMQSKIPILAITDNNSDIGLDIIKNNMG 354 Query: 276 YAV--GSIKEMQEIVDSM-TIETYKQISENTKIISQKIRTGSYFRDVLEEVIDDL 327 + SI E +EI+ + T ISE + + ++ + + +I+ + Sbjct: 355 WRCKKNSINEFREIISEIMTDFNIHNISEKKENSFEYLKNNFDIKKSIMNIINKI 409 >UniRef50_B9YG53 Glycosyl transferase group 1 n=1 Tax='Nostoc azollae' 0708 RepID=B9YG53_ANAAZ Length = 475 Score = 47.3 bits (111), Expect = 7e-04, Method: Composition-based stats. Identities = 44/294 (14%), Positives = 79/294 (26%), Gaps = 62/294 (21%) Query: 47 IISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGGG 106 L + L+ + P P L+ LK V LI+D+ Sbjct: 156 FTLRAFLHIIRNFRRHNVFLVTSAPPFLPIAGYLAHLC--LKISYVCLIYDLYPDIAIAL 213 Query: 107 SDSVR----------LA-----TCDMVISHNPQMTKYLSKYMSQ--DKIKDIKIF---DY 146 R L ++ +P M K + + DK+ I + D Sbjct: 214 QVIKRNHWLAGFWRQLNRMMWRKSKGIVVLSPDMKKRVIAICPEVADKVSVIHSWGDPDL 273 Query: 147 LVSSDVEHRDVTD-----KQRGVIYAGNLSR--------HKCSFIYTEGCDFTLFGVNYE 193 +V E + + V+Y+GN+ R + E F G + Sbjct: 274 IVPIAKEINWFAEEHNLVNKFTVLYSGNMGRCHDMDTILETAKQLRNEPIQFVCIGSGAK 333 Query: 194 NKDNPKYLGS-------FDA--QSPEKINLPG-MQFGLIWDGDSVETCSGAFGDYLKFNN 243 K + + F L+ +E+ Sbjct: 334 RKSFIEAVNKSGVTNFLFLPYQDKQVLPYSLTACDLSLVSVEAGMES----------LVA 383 Query: 244 PHKTSLYLSMELPV--FIWDKAALADFIVDNRIGYAVGS-----IKEMQEIVDS 290 P K L+ P+ + L I + G V + + E +++S Sbjct: 384 PSKLYPALAAGRPIAAICSKYSYLRQLIAEGNCGVCVENGDSLALAEFIRLLNS 437 >UniRef50_C7NW83 Glycosyl transferase group 1 n=1 Tax=Halomicrobium mukohataei DSM 12286 RepID=C7NW83_HALMD Length = 416 Score = 47.3 bits (111), Expect = 8e-04, Method: Composition-based stats. Identities = 39/272 (14%), Positives = 75/272 (27%), Gaps = 44/272 (16%) Query: 39 LWGGVVQRIISSVKLSTFLCGLE-NKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHD 97 ++ +L DV+I P + F L + I D Sbjct: 85 FVDRLLYYFTFVAHALFWLFQRRQKYDVIITTSPP-IFTGMVALPFSMLGSLNWILDIRD 143 Query: 98 I-------------DELRGGGGSDSV--RLATCDMVISHNPQMTKYLSKY-MSQDKIKDI 141 + D + L D++ T L + + +I I Sbjct: 144 LWIDVSSDLGFISEDGIITKSSRRYQAATLRQADLITVTTHGTTTQLRERYDFETEISVI 203 Query: 142 KIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRH-------KCSFIYTEGCDFTLFGVNYEN 194 V + V + + + +IY GNL + F + G Sbjct: 204 PN---GVDTSVFTPEPSSNEVELIYTGNLGYGQDLETCIRALHYTESDVRFRIVGDGDLR 260 Query: 195 KDNPKYLGS-FDAQSPEKINLPGMQFGLIWDGD-SVETCSGAFG-------DYLKFNNPH 245 + + + + + GL+ + A G D L++ P Sbjct: 261 PELVELAEKIGVSDQVDFM-------GLVPREQIPQLLGTAAIGVAPLKEQDSLEYAVPT 313 Query: 246 KTSLYLSMELPVFIWDKAALADFIVDNRIGYA 277 K Y + ELPV + + + + ++ G Sbjct: 314 KLYEYWACELPVLALGQGTIEEIVSESGAGVV 345 >UniRef50_A6DJF7 Putative glycosyl transferase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DJF7_9BACT Length = 406 Score = 46.9 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 31/211 (14%), Positives = 63/211 (29%), Gaps = 32/211 (15%) Query: 104 GGGSDSVRLATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDK--Q 161 + D V + M L +++ IKIF V + + Sbjct: 164 KQMQQIKINQSFDEVFVVSQYMKDELITQGFEER--KIKIFP-PVPLPKRQVPINTYSDE 220 Query: 162 RGVIYAGNLSRHK-------CSFIYTEGCDFTLFGVNYENKDNPKYLGSFDAQSPEKINL 214 +++A + R K + +FG +Y + + Sbjct: 221 NIIVFATQIIRGKGLDCLINALSLVKNDFKLYVFGSGSHK----EYCQQLVKDLNLEDKI 276 Query: 215 PGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKT--------SLYLSMELPVFIWDKAALA 266 F S E S + + P +L LPV +D + Sbjct: 277 IFKGF------VSQEELSNIYASAVMGTVPSVWPEPIATVGLEFLRHGLPVIGFDAGGIK 330 Query: 267 DFIVDNRIGYAVG--SIKEMQEIVDSMTIET 295 D+++D+ G+ + I+ M + +D + + Sbjct: 331 DWLIDDTSGFLIPWMDIQAMADKIDLLLNDK 361 >UniRef50_A1S0Y8 Glycosyl transferase, group 1 n=1 Tax=Thermofilum pendens Hrk 5 RepID=A1S0Y8_THEPD Length = 380 Score = 46.9 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 44/305 (14%), Positives = 82/305 (26%), Gaps = 56/305 (18%) Query: 61 ENKDVLIFNFPMAKPFWHILSFFHRLL--KFRIVPLIH----DIDELRGGGGSDSVRLAT 114 + DVL P P +S + V + H D R + Sbjct: 88 KQYDVLYI--PAYSPNELTVSLLKKSKALSVPAVAVFHCMLADNVLARLYTPLYIAAFNS 145 Query: 115 CDMVISHNPQMTKYLSKYMS-QDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRH 173 D + N +L + ++KI+ I + + + +++ G L + Sbjct: 146 FDKLHVLNRFQRNFLKSHGIPEEKIEFIPNGVDTSTFQLCRDPSASEDFNIVFVGRLLKD 205 Query: 174 K------------CSFIYTEGCDFTLFGVNYENKD---------NPKYLGSFDAQSPEKI 212 K + FT+ G +D N +LG ++ I Sbjct: 206 KGVDTLLRIIYLINDELNLHDVKFTIVGSGPLEEDIKKLAQKYQNVVFLGYVKHENMPSI 265 Query: 213 NLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDN 272 F L S + G P + LP + D + D Sbjct: 266 YREANLFLL---------PSRSEG------MPLSLLEAQACGLPAVASKIPGVLDIVRDG 310 Query: 273 RIGYAVG--------SIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVLEEVI 324 G V S E + + + Y +++ + I V+ ++ Sbjct: 311 VTGRLVDAEDVRGFVSAIEECYRLWESSPQEYYNLNKKIREY---IVRNYDLEVVVGKIE 367 Query: 325 DDLKT 329 L Sbjct: 368 KMLHE 372 >UniRef50_C1I3K1 Glycosyltransferase n=1 Tax=Clostridium sp. 7_2_43FAA RepID=C1I3K1_9CLOT Length = 369 Score = 46.9 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 44/305 (14%), Positives = 101/305 (33%), Gaps = 52/305 (17%) Query: 16 FKARKDALDIASDYENISVVNIPL------WGGVVQRIISSVKLSTFLCGLENKDVLIFN 69 KA D I + + ++ + G + R K++ + D+ F+ Sbjct: 28 SKAGYDVSLIINSDHDKTLYGTKIKALDHSNNGRLHRFFKKSKVALDKAMELDADLYHFH 87 Query: 70 FPMAKPFWHILSFFHRLLKFRIVPLIHD-----------IDEL-------RGGGGSDSVR 111 P L + +++ +H+ + + R + R Sbjct: 88 DPELIKLGMAL----KKKGKKVIYDVHEDVPKQILAKSYLGPMWVRKTISRAYNFYEKSR 143 Query: 112 LATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLS 171 D VI+ + ++ + +K+ I D + ++ R+ DK +IY G+++ Sbjct: 144 SENFDAVIAASDELAAKF-NNTNSISVKNFAIRDVIENAKPIKREDNDK-FVIIYVGSIT 201 Query: 172 RHKCSFIYTEGCDFTLFGVNYENKDNPKYLGSFDAQSPEKINLPGMQ------FGLIWDG 225 + + + + + K LG+F+++ ++ + FG + Sbjct: 202 KIRGIKELIQVTEL------FLGKVELWILGTFESEELKEECMSLEGYKYCKYFGALPVK 255 Query: 226 DSVE--------TCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYA 277 D C+ + K + P K Y++ E P+ + D F +G Sbjct: 256 DVYSYIKASDLGMCTLYPTENYKESIPIKVLEYMACEKPLVLSDFEFWKKFF--GNVGKY 313 Query: 278 VGSIK 282 V + Sbjct: 314 VDPLD 318 >UniRef50_A6AFA8 Putative uncharacterized protein n=1 Tax=Vibrio cholerae 623-39 RepID=A6AFA8_VIBCH Length = 366 Score = 46.9 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 47/333 (14%), Positives = 106/333 (31%), Gaps = 42/333 (12%) Query: 27 SDYENISVVNIPLWGGVVQRIISSVKLSTFLCG--LENKDVLIFNFPMAKPFWHILSFFH 84 + + I +G + ++ + + ++ L+NKD + + + F Sbjct: 45 KNDDYIPFKYKGEYGKKLSSLVGFIIWNVYIFQFLLKNKDKINAVHVVNLDSIIPVVIFR 104 Query: 85 RLLKFRIVPLIHD---------IDELRGGGGSDSVRLATCDMVISHNPQMTKYLSKYMSQ 135 + KF ++ I+D + ++ + D+++ + Q K Sbjct: 105 FIKKFNLIYDIYDCYAESHSLGFFATKFFYQAERLVCKYSDLIVLPHEQRLKQSKIESLF 164 Query: 136 DKIKDIKIFDYLVSSDVEHRDVTDKQRGVI---YAGNLSRHKCSFIYT-------EGCDF 185 K I+ L++S + ++++ +I YAGNL Sbjct: 165 HKTIIIENVP-LINSLKCYGEISNHTNDIINLIYAGNLQSEHRGLENLIKIVPLFPHVRL 223 Query: 186 TLFG---------VNYENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFG 236 T+ G N DN Y GS I G +C Sbjct: 224 TICGDGELKKYVFENANKHDNIIYKGSISYAELMMEIALSD----IIVGLYYLSCKNHI- 278 Query: 237 DYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAV-GSIKEMQEIVDSMTIET 295 F +P+K +L+ P+ + G+++ ++ E++ + + Sbjct: 279 ----FASPNKYYEHLAFGKPMITTKGTPPGADVERENTGWSIGDHYNDLYELISKINLSD 334 Query: 296 YKQISENTKIISQKIRTGSYFRDVLEEVIDDLK 328 + N + + Y +EE ++ +K Sbjct: 335 IVEKGSNAHSLWISKYSNYYNLK-VEEYMERIK 366 >UniRef50_B0G7U5 Putative uncharacterized protein n=2 Tax=Lachnospiraceae RepID=B0G7U5_9FIRM Length = 353 Score = 46.9 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 46/291 (15%), Positives = 90/291 (30%), Gaps = 59/291 (20%) Query: 61 ENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIH----D-----IDELRGGGGSDSVR 111 + D++ N P + + ++ ++V H D I R Sbjct: 41 KEADIVHIN--TVFPDSVLAACLAKMQGKKVVYYGHSTMEDFRNSFIGSNRFAPLFKKWI 98 Query: 112 ---LATCDMVISHNPQMTKYLSKYMSQDKI------KDIKIFDYLVSSD-----VEHRDV 157 D+VI+ L Y Q KI D + F E + Sbjct: 99 CFCYGLGDIVITPTEYSKGLLKSYGIQKKIYAVSNGVDTEFFQSRNKEKEKQELHEKYKI 158 Query: 158 TDKQRGVIYAGNLSRHKCSFIYTE------GCDFTLFGVNYE-------------NKDNP 198 ++ V+ AG+L + K + + F FG + +N Sbjct: 159 PMNRKVVVSAGHLIQRKGILDFLKLAKMMPETTFIWFGGGNDSLVTQEVKEAVKKKSENV 218 Query: 199 KYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVF 258 + G +A+ F S E G L+ E+PV Sbjct: 219 IFAGYVEAEELRNAYCGADAFAFF----SYEETEGIV-----------VLEALACEVPVV 263 Query: 259 IWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQISENTKIISQK 309 + D ++I D Y V +++E + + + E +++ N + ++++ Sbjct: 264 LRDIPVYKEWIQDQMQAYKVHNLEEYKHNLMHIFKEDQEELKGNARELAEQ 314 >UniRef50_A5I5D9 Glycosyl transferase, group 1 family n=3 Tax=Clostridium botulinum A RepID=A5I5D9_CLOBH Length = 406 Score = 46.9 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 53/379 (13%), Positives = 114/379 (30%), Gaps = 54/379 (14%) Query: 2 YFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLE 61 Y + + R+D+ ++ I + I T+L L Sbjct: 31 YDVTCIYVGRKDSSGITKEGIKYIEVKPCYKVYNALDKIFEYKNFIYERFSYDTYLKILN 90 Query: 62 NK-----DVLIFNFPMAKPFWHILSFFHR-LLKFRIVPLIHD------IDELRGGGGSDS 109 D+ + I + K +IV +H+ +D + + Sbjct: 91 KCKEVKCDLYHLHD---IYLLQICENLKKMYFKPKIVYDVHESYVDIVLDYNKNKRNINK 147 Query: 110 VRLATC------------DMVISHNPQMTKYLSKYMSQDKIKDIKIFD---YLVSSDVEH 154 D +I+ + + KY+ K+ I + YL+ + E Sbjct: 148 YLFYLYLYFWEKVKALKCDFIINVEENINRTFEKYLGDSKVDLIYNYPLEEYLMKNSKEE 207 Query: 155 RDVTDKQRGVIYAGNLSRHKC----------SFIYTEGCDFTLFGV--NYENKDNPKYLG 202 + DK+ ++Y G +++ + Y + G + + + K Sbjct: 208 TKLEDKKYDLVYCGGITKIRGVMNILEAINIGKQYKKDISVIFIGPINDSNLQRDIKNYI 267 Query: 203 SFDAQSPEKINLPGMQFGLIWDGDSVETCS----GAFGDYLKFNNPHKTSLYLSMELPVF 258 + S + F +W+ Y+K P K Y+ M LPV Sbjct: 268 EKNDLSDNVFFKGKISFERVWNYYKESKIGLVPLHDIRKYVK-AIPIKMFEYMIMGLPVI 326 Query: 259 IWDKAALADFI--VDNRIGYAVGSIK---EMQEIVDSMTIET--YKQISENTKIISQKIR 311 + + + + G V +I+ E + + + Y + S N + + I Sbjct: 327 GSNLPHIREVVLNEKYICGEVVNNIENPKEFWHSIYKILSDQELYNKYSINARQSIKYIY 386 Query: 312 TGSYFRDVLEEVIDDLKTR 330 S L + +++ + Sbjct: 387 NWSIMEKKLLSIYNNILDK 405 >UniRef50_D2QRL3 Glycosyl transferase group 1 n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QRL3_9SPHI Length = 375 Score = 46.5 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 47/320 (14%), Positives = 100/320 (31%), Gaps = 64/320 (20%) Query: 31 NISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFR 90 I + +P + V+ R++ + C ++ P PF + RLL + Sbjct: 48 GIRFICLPYFRRVIWRLLITCPFIVLRCIWLRPQLVHVYAPEFLPFAY----VFRLLGAQ 103 Query: 91 IVPLI----HDIDELRGGGGSD--SVRLATCDMVISHNPQMTKYLS--KYMSQDKIKDIK 142 ++ + H L+ D + Q YL ++ + Sbjct: 104 VIYEVQENLHKKLPLKTSNNGALLRQMFRLFDRL----AQRHFYLIFTEHGYLSTYMQLA 159 Query: 143 -----IFDYLVSSDVEH----RDVTDKQRGVIYAGNLS----------RHKCSFIYTEGC 183 +++Y + S +E + + + Y G LS I Sbjct: 160 RPHVVVYNYPLLSFLEPFYTPYNPSSETPSFFYIGLLSFDRAVDTLVDSFAKLGITYPRF 219 Query: 184 DFTLFG------VNYEN-------KDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVET 230 LFG N EN +D+ + G + Q GL ++ Sbjct: 220 IVHLFGRRTFTDTNLENLSGYARIRDHLHFYG-YTDQRLAFPYARDATAGL-----ALLK 273 Query: 231 CSGAFGD-YLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVG--SIKEMQEI 287 G + + Y K Y+++ LPV D D + + G+ V + ++ + Sbjct: 274 PVGDYPESYTT-----KLFEYMALGLPVITSDFPLYRDIVDRHHCGFCVSPYNAAQVADS 328 Query: 288 VDSMTI--ETYKQISENTKI 305 + + + +++ + + Sbjct: 329 LAYLIENPDEARRMGQRGRQ 348 >UniRef50_Q2N6D5 Glycosyl transferase, group 1 family protein n=1 Tax=Erythrobacter litoralis HTCC2594 RepID=Q2N6D5_ERYLH Length = 379 Score = 46.5 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 31/251 (12%), Positives = 76/251 (30%), Gaps = 44/251 (17%) Query: 90 RIVPLIHDIDELR---------GGGGSDSVRLATCDMVISHNPQMTKYLSK-YMSQDKIK 139 R++ H+++ R ++ + D I + + + + Y + Sbjct: 121 RLIYDAHELETERAGWSGGLKKLARLAERALIGFADETIVVSGAIADWYANTYGMEKPHL 180 Query: 140 DIKIFDYL----VSSDVEHRDVTDKQRGVIYA--GNLSRHKCSFI-------YTEGCDFT 186 + + + + D ++ +I+ G L + + E Sbjct: 181 IRNMPEAIPGTTSNGDGLRAELGLGGDDIIFVYLGRLGHGRGIPLLVAAFRNVGEDRHLV 240 Query: 187 LFGVNY-------ENKDNPKYLGSFDAQSPEKINLPGM---QFGLIWDGDSVETCSGAFG 236 L G + D S + I F +I D Sbjct: 241 LLGDGPMQGDLLKDTADAANIHVLEPVPSEQVIGYVASADIGFSMIEDVALS-------- 292 Query: 237 DYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGS-IKEMQEIVDSMTIET 295 ++ P+K L V + + +A+F+ G+ V + E+ ++V+S+T Sbjct: 293 --YRYCLPNKLFESRRAGLAVIVSNLVEMANFVRAYGGGWIVENDPDELAKLVNSLTRVD 350 Query: 296 YKQISENTKII 306 + + + + Sbjct: 351 IAAVKKEARPV 361 >UniRef50_C1FUA9 Glycosyl transferase, group 1 family n=1 Tax=Clostridium botulinum A2 str. Kyoto RepID=C1FUA9_CLOBJ Length = 394 Score = 46.5 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 49/349 (14%), Positives = 107/349 (30%), Gaps = 62/349 (17%) Query: 27 SDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRL 86 + +V ++ + ++ LE F+ L Sbjct: 58 KGKKYQNVFRRSIFYNETFSFETYEEIFNIAKKLECN-AYHFHDLYLNLIGKRLKNLS-- 114 Query: 87 LKFRIVPLIHD-----IDELRGGGGSDSVRLA---------------TCDMVISHNPQMT 126 K +++ +H+ I E R G + D +I+ + Sbjct: 115 FKPKVIYDVHECYPEQIREYRKYSGLKKIINNIYSYFIRFWEVYCCKNYDYIITTESSVN 174 Query: 127 KYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYTE----- 181 K Y+ ++K+ I++Y E D +K+ IY G ++R + + E Sbjct: 175 KKFRSYIGKNKVDI--IYNYANFDVKEFLDFNEKEFDAIYCGGINRIRSAMELLEVANIA 232 Query: 182 -----GCDFTLFGV-----------NYENKDNPKYLGSFD---AQSPEKINLPGMQFGLI 222 L G N+ K+N + + + GL Sbjct: 233 KNHMPDFKLLLLGPITGQDLKRDMKNFIEKNNLENNVILKDRVPFPEVEKYYAKSKIGL- 291 Query: 223 WDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIK 282 ++ S F ++ KT Y++ LP+ + +A +I + G V + Sbjct: 292 ----AIFKPSLTFKKTVQI----KTFEYMAFGLPMVGSNFGNIAKYIKEANTGITVNPLS 343 Query: 283 --EMQEIVDSMTIET--YKQISENTKIISQKIRTGSYFRDVLEEVIDDL 327 E+ + + + + Y S+N + + + L + + + Sbjct: 344 PQEIWKAIHKILQDKNSYDVYSKNGINAVNEKYNWNIMKRELLRIYNKI 392 >UniRef50_B1C0Q5 Putative uncharacterized protein n=1 Tax=Clostridium spiroforme DSM 1552 RepID=B1C0Q5_9FIRM Length = 423 Score = 46.5 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 53/356 (14%), Positives = 104/356 (29%), Gaps = 66/356 (18%) Query: 30 ENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKF 89 + + + R + + +I + + + L Sbjct: 79 RFKLLPENSVVIKRIIRYLLLNIKQYRTARNLSNCDVILAGSTPPTQGIVAALLGKKLCL 138 Query: 90 RIVPLIHDIDE---------------LRGGGGSDSVRLATCDMVISHNPQMTKYLSKYMS 134 +V ++ DI + G + D +I + L Sbjct: 139 PVVYIVQDIFPDSLVSTGISSEKSLFFKIGKLIEKYTYKHADKIIVICDEFKHNLVDKGV 198 Query: 135 -QDKIKDIKIF---DYLVSSD------VEHRDVTDKQRGVIYAGNLSRHK---------C 175 +KIK I + D ++ E ++ V YAGN+ + + Sbjct: 199 LAEKIKVIYNWINADEVIPISRNSNKLFEEYNLDKNNFFVTYAGNMGKAQDIDTIINVAK 258 Query: 176 SFIYTEGCDFTLFGV-----------NYENKDNPKYLGSFDAQSPEKINLPGMQFGLIWD 224 + F LFG N E +N L P+ G + Sbjct: 259 IMQEYKDIKFILFGSGDGKKYYENLINSEKINNITIL----PIQPQNRVSEVYSLGNV-- 312 Query: 225 GDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFI--WDKAALADFIVDNRIGYAVGS-- 280 S+ +C G K P KT ++ V + L + I D++ G A S Sbjct: 313 --SIVSCKKGAG---KTALPSKTWSIMATATAVITNFDKDSELNNIINDSKSGIACESGN 367 Query: 281 IKEMQEIVDSMTIETY--KQISENTKIISQK-IRTGSYFRD---VLEEVIDDLKTR 330 + E++ + + + ++ N + ++ + + + VL E I K R Sbjct: 368 VMEIKHAILKLYDDRALCSKMGNNGREYIKRNLDSNMCTKKYIQVLNEAISIKKDR 423 >UniRef50_Q20YQ8 Glycosyl transferase, group 1 n=1 Tax=Rhodopseudomonas palustris BisB18 RepID=Q20YQ8_RHOPB Length = 398 Score = 46.5 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 29/147 (19%), Positives = 43/147 (29%), Gaps = 32/147 (21%) Query: 149 SSDVEHRDVTDKQRGVIYAGNLSRHK-------CSFIYTEGCDFTLFGVNYE-------- 193 + D+ + + Y G +S K + F G E Sbjct: 209 NIDLPPELAAIPRPRLGYVGVISDFKIDLELLQTLAVAHPDWHFVFIGDEREGQHSDVVT 268 Query: 194 ---NKDNPKYLGSFDAQSPEKINLPGMQF--GLIWDGDSVETCSGAFGDYLKFNNPHKTS 248 N +LG +S + + F GL+ DY + P K Sbjct: 269 RMAQLSNVHFLGW---RSYQDLPRYLAGFDVGLLPQ---------LINDYTRAMFPMKFF 316 Query: 249 LYLSMELPVFIWDKAALADFIVDNRIG 275 YL+ LPV AL D + IG Sbjct: 317 EYLAAGLPVVATPLPALRDLAAVHGIG 343 >UniRef50_C1DVV7 Putative glycosyl transferase, group 1 n=2 Tax=Sulfurihydrogenibium RepID=C1DVV7_SULAA Length = 380 Score = 46.5 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 46/321 (14%), Positives = 108/321 (33%), Gaps = 45/321 (14%) Query: 18 ARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFW 77 A +D+ + I++ + L R+I + + EN DV+ P P Sbjct: 42 AEEDSCYYKEKTKVITLKSPQLLFSKNYRLIINPIKLAKVIEEENPDVVEIGSPFLIPS- 100 Query: 78 HILSFFHRLLKFRIVPLIH-DIDE-----------LR-GGGGSDSVRLATCDMVISHNPQ 124 ++++ F++V H D+++ +R ++ D+VI+ + Sbjct: 101 -VVNYLKEKSGFKVVGFFHSDLEKVSTNLLKGKNIIRPIIKKYVYKTYSSMDLVIAPSNY 159 Query: 125 MTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYTEGCD 184 + YL I+++++ + + DV DK + YA + + K IY Sbjct: 160 IKNYLR----SINIQNVEVVYHGIDLDVFTNTHVDKSLRLHYA--IEKDKVVLIYVGRFS 213 Query: 185 --------FTLF-GVNYENKDNPKYLGSF-DAQSPEKINLPGMQFGLIWDGDSVETCSGA 234 +F +N+ +L E + F ++ + E + Sbjct: 214 PDKNFSHLLKIFKTLNFIKPKKFHFLLVGDGPLKEEVYDQLESDFTVVDYIEDREEIAKL 273 Query: 235 FGDYLKFNNPHKTSLY-------LSMELPVFIWDKAALADFIVDNRIGYAVGSIK----- 282 + F K+ + + LPV + + + + + S+ Sbjct: 274 YKISDIFVTASKSDTFGISLIEAQACGLPVVAYKENSFPEICYYKDL-LCTDSLDFIYKV 332 Query: 283 -EMQEIVDSMTIETYKQISEN 302 + E ++ + + +Q + Sbjct: 333 VNLSESLERVEKDKIQQFVKK 353 >UniRef50_Q1AYI8 Glycosyl transferase, group 1 n=1 Tax=Rubrobacter xylanophilus DSM 9941 RepID=Q1AYI8_RUBXD Length = 361 Score = 46.5 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 28/175 (16%), Positives = 62/175 (35%), Gaps = 14/175 (8%) Query: 14 AGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLE--NKDVLIFNFP 71 A +A + I + + ++ + R + ++LS + + D++ +P Sbjct: 28 ALSEAGMNVE-ILTTTGPVPDDSLGVDIHASVRNWNLLRLSEAMREIRAIRPDIVHLQYP 86 Query: 72 MAKPFWHILSFFHRLLKFRIVPLIHD---IDELRGGGGSDSVRLATCDMVISHNPQMTKY 128 A+ +L L + +V IH+ + LR L D V++ + Sbjct: 87 TAEYKAGLLPQALVLSRVPLVVTIHEASYVHILRRISL--YPFLILADRVVATTRFEADF 144 Query: 129 LSKY--MSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYTE 181 L + K+ + I SS + + ++Y G ++ K + E Sbjct: 145 LVGLYPSVRKKLSVVSI----GSSIPAGPVLVRDRNTIMYFGLIAPRKGIEEFLE 195 >UniRef50_B9YBN8 Putative uncharacterized protein n=1 Tax=Holdemania filiformis DSM 12042 RepID=B9YBN8_9FIRM Length = 355 Score = 46.1 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 36/325 (11%), Positives = 95/325 (29%), Gaps = 56/325 (17%) Query: 35 VNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPL 94 + P ++ + + + L K +I + Sbjct: 58 PDRPGITKKTFDLLMLFSIVRHIQKERFDAIYF-----ESLHTWNLPIMMMSGKVKIFQV 112 Query: 95 IHDIDELRGGGG------SDSVRLATCDMVISHNPQMTKYLSKY--MSQDKIKDIKIFDY 146 IH++ G + + D ++ N + + + +S D++K ++++ Sbjct: 113 IHEVIPHEGDSQVKMINLMNKAVVKFADTIVLRNKKYIQTMIDRYDISPDRVKYLELW-- 170 Query: 147 LVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYTE------GCDFTLFG----------V 190 V++ G ++ +K E F + G Sbjct: 171 ---RRYPEYIAPVHSGRVLFFGRINPYKGVDNLLEIVRLCPEIQFNVIGRVDPQMQDVVD 227 Query: 191 NYENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLY 250 + N K + + + ++ + + SG D K++ Sbjct: 228 QLAKEMNVKLNNDYVTDEEMRKAFINCDWVIVPYNSASQ--SGIIIDAYKYS-------- 277 Query: 251 LSMELPVFIWDKAALADFIVDNRIGYAV-----GSIKEMQEIVDSMTIETYKQISENTKI 305 PV + A+++ + +++ GY V ++ + ++ I+ Y +S Sbjct: 278 ----RPVIAFVVGAISEQVDNHKSGYLVRAGDNKKFADILKKAMNLNIDEYNAMSRYAYQ 333 Query: 306 ISQKIRTGSYFRDVLEEVIDDLKTR 330 K +E + L+ + Sbjct: 334 YGSKK---YATPGAVERFVKLLEEK 355 >UniRef50_B4BR56 Glycosyl transferase group 1 n=1 Tax=Geobacillus sp. G11MC16 RepID=B4BR56_9BACI Length = 401 Score = 46.1 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 36/314 (11%), Positives = 98/314 (31%), Gaps = 42/314 (13%) Query: 33 SVVNIPLWGGVVQRIISSVKLST-----FLCGLENKDVLIFNFPMAKPFWHILSFFHRLL 87 SV N + R+I +++ L DV+ + P R Sbjct: 74 SVKNRKYARTIWNRLIYYLEIMLRFILFILTDQRKYDVVFVSSPP-IFIGFTGMLAKRKY 132 Query: 88 KFRIVPLIHDI--DELRGGGGS------------DSVRLATCDMVISHNPQMTKYLSKYM 133 + ++V + D+ + L+G + D ++ ++ +++ Sbjct: 133 RAKLVLDVRDLWPESLKGVNVFNNRLILAVFQWLEKKLYNESDHIVINSLG----FLEHI 188 Query: 134 SQDKIKDIKIFDYLVSSDVEHRDV-----TDKQRGVIYAGNLSRHKCSFIYTE------- 181 + ++ ++ E + + V+YAGN+ + S + E Sbjct: 189 CTKSSVPAENVSFIPNAAREVEIIDRHSRKPPRFSVVYAGNIGLAQDSLLLMELAEELRK 248 Query: 182 -GCDFTLFGVNYENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETC-SGAFGDYL 239 ++ G K ++ + + + + + L + + + Sbjct: 249 HEITISVVGYGLRRKQFADFVKVNNLNNVKIMPALSRKECLEFIATHQAAIVTLNSSEVF 308 Query: 240 KFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIK--EMQEIVDSMTI--ET 295 K P K Y++ +P+ I ++G+ + EM + + + Sbjct: 309 KTVLPGKIIDYMTCGVPIVAAVSGYSKQVIESEQVGFVSENRDRQEMINYILYLKNNPDL 368 Query: 296 YKQISENTKIISQK 309 ++++ N ++ Sbjct: 369 AEEMANNCTNYVKR 382 >UniRef50_A0PZ08 Putative uncharacterized protein n=1 Tax=Clostridium novyi NT RepID=A0PZ08_CLONN Length = 374 Score = 46.1 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 39/311 (12%), Positives = 96/311 (30%), Gaps = 64/311 (20%) Query: 29 YENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLK 88 + +N L + + +L +L + +++ + F ++L + + K Sbjct: 35 FNKKIFLNYKLGNIIKRFFSVKKELINYLKDEKPTTIIVSGY-FIGLFHNVLKRYIKRNK 93 Query: 89 FRIVPLIH--------------------DIDELRGGGGSDSVRLATCDMVISHNPQMTKY 128 +I+ +H I L + L+ + + + +M KY Sbjct: 94 CKIIYDMHGCVEEAIEYVHPRIKVKFLSKIYYL-FTKHQEKKLLSISNGIFIVSHEMEKY 152 Query: 129 LSKYMSQ---------DKIKDIKIF---DYLVSSDVEHR---DVTDKQRGVIYAGNLSRH 173 + IF D ++S ++ R + + + +Y+G +S+ Sbjct: 153 VKIKYPNIANKLQFYYVPCGIDSIFRSVDERLNSRIKWRKKLSLNNDETVFVYSGGMSKW 212 Query: 174 KCSFIYTE----------GCDFTLFGVNYENKDNP---KYLGSFDAQSPEKINLPG---- 216 + +F E +N Y + +S + ++ Sbjct: 213 QKINEIINLYNKLSKDIPNSRLCIFTGEVEKVNNIVDPLYKDKYIIKSLKSKDVINALTA 272 Query: 217 MQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAAL-ADFIVDNRIG 275 FG+I D++ P+K S Y+ L + I + + + +G Sbjct: 273 CDFGIILRDDNLTNN---------VAFPNKVSEYIEAGLNIIISESLRTPKEIVTKYNLG 323 Query: 276 YAVGSIKEMQE 286 + + + Sbjct: 324 IGIKNDLNIDN 334 >UniRef50_A5D386 Glycosyltransferase n=1 Tax=Pelotomaculum thermopropionicum SI RepID=A5D386_PELTS Length = 400 Score = 46.1 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 12/51 (23%), Positives = 21/51 (41%), Gaps = 2/51 (3%) Query: 246 KTSLYLSMELPVFIWDKAALADFIVDNRIGYAV--GSIKEMQEIVDSMTIE 294 K YL+ LPV I D LAD + G ++++ + ++ Sbjct: 306 KVFSYLACGLPVVIPDIPDLADVVRRAGCGLVAAPDRLEDLAAALKAVLDN 356 >UniRef50_D2F291 Glycosyl transferase n=1 Tax=Bacteroides sp. D20 RepID=D2F291_9BACE Length = 407 Score = 46.1 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 51/324 (15%), Positives = 103/324 (31%), Gaps = 70/324 (21%) Query: 59 GLENKDVLIFNFPMAKPFWHILS--FFHRLLKFRIVPLIHDIDELR-------GGGGSDS 109 ++ D +I + P + +L +L K L+HDI G Sbjct: 98 QIKEGDEVII---LTNPAFFMLFMPLVRKLTKCHYHILVHDIFPENLVSLGNLGRKSFLY 154 Query: 110 VRLAT--------CDMVISHNPQMTKYL----SKYMSQDKIKDIKIFDYLVSSDVEHRDV 157 L D IS M + + I + D + ++ Sbjct: 155 AFLKKIFDWAYGTADSCISIGEDMRQVVLRKTKGQNFLRLITNWADVDEVKPFPKVDTNL 214 Query: 158 TD-------KQRGVIYAGNLSRHKCS--------FIYTEGCDFTLFGVN---------YE 193 + +AGNL + + + + C F G E Sbjct: 215 FKEMESSLTGKIVFQFAGNLGKAQGLDNLMNAIDMVKNKDCKFLFVGAGAKRNDIESFAE 274 Query: 194 NKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSM 253 +N Y G F ++S + L G++ G +G P K+ ++ Sbjct: 275 RHENTVYAG-FRSRSNQNDFLNACDIGIV------TLADGMYG----LGVPSKSYNIMAT 323 Query: 254 ELPV--FIWDKAALADFIVDNRIGYAV--GSIKEMQEIVDSMTIE--TYKQISENTKIIS 307 P+ + +A I IG+ V + +++ ++ + E +++ N ++ Sbjct: 324 GKPILYIGESDSEIALCIKRYNIGWVVEPNNPSLLKDKIEDILREPSEIQEMGANALTVA 383 Query: 308 QKIRTGSYFRD-VLEEVIDDLKTR 330 + + ++ VLE+ D +K R Sbjct: 384 NE----FFAKNVVLEQYHDFIKKR 403 >UniRef50_A9BJB8 Glycosyl transferase group 1 n=1 Tax=Petrotoga mobilis SJ95 RepID=A9BJB8_PETMO Length = 385 Score = 46.1 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 40/311 (12%), Positives = 95/311 (30%), Gaps = 42/311 (13%) Query: 33 SVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFW-HILSFFHRLLKFRI 91 + QRI F N D++ + P + ++S +L Sbjct: 55 EFPAVKFLFEKEQRIALPFSPEIFKLKELNLDIIHSHDPFSMGILARVVSRMLKLKHVAT 114 Query: 92 -----VPLIHDIDEL-RGGGGSDSVRLA----TCDMVISHNPQMTKYLSKYMSQDKIKD- 140 +H + + R + D +I+ + + L +Y + Sbjct: 115 HHTMYDYYLHYLPLIVRPQPEFVQRLIKNWCLKTDKIIAPTDNIKETLVEYGVPSEHIVT 174 Query: 141 ------IKIFDYLVSSDV--EHRDVTDKQRGVIYAGNLSRHKCSFIYTEGCDFTLFGVNY 192 + FD ++ D+ E+ + + R +++ G L + K +F Sbjct: 175 IPTGIDLASFDKPINWDLKKEYPQIKEDDRILLFVGRLGKEKNIS-----FLLKVFKKVL 229 Query: 193 ENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGD------SVETCSGAFGDYLKFNNPHK 246 ++ K++ E++ M + + + YL + Sbjct: 230 MDETKVKFVIVGGGPEKEELEKLAMDLNIAENVIFTGPQPREKVIDAYKQAYLFIFASYT 289 Query: 247 ------TSLYLSMELPVFIWDKAALADFI-VDNRIGYAVGSIKEMQ---EIVDSMTI-ET 295 ++ PV K + D + ++ G + + E EI+ + T Sbjct: 290 ETQGLVILESMAAGTPVVALGKLGVYDILSQEDAGGIMIKELNEDDFSHEILKVLKESST 349 Query: 296 YKQISENTKII 306 Y+++ + +I Sbjct: 350 YEELKKKAEIF 360 >UniRef50_B1YL51 Glycosyl transferase group 1 n=1 Tax=Exiguobacterium sibiricum 255-15 RepID=B1YL51_EXIS2 Length = 405 Score = 46.1 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 32/233 (13%), Positives = 71/233 (30%), Gaps = 48/233 (20%) Query: 81 SFFHRLLKFRIVPLIHDI--DEL--------RGGGGS----DSVRLATCDMVISHNPQMT 126 L K ++ + D+ + L R + D ++ ++ Sbjct: 122 VVAKHLKKAPLILDVRDLWPESLVGVGITKSRLLLAPLYWLEKWMYRQADQIVINSEGFR 181 Query: 127 KYLSKYMS-QDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIY------ 179 Y+ K +KI I + + R +Q V+Y GN+ + F+ Sbjct: 182 SYIEKKGIAPEKIHYIPN-SIEENEWLIKRRKVSEQVRVVYTGNIGLAQDVFLLLDVAEQ 240 Query: 180 ---TEGCDFTLFGVNYENKD-----------NPKYLGSFD-AQSPEKINLPGMQFGLIWD 224 + +F + G Y + N ++ + + +++ + F + + Sbjct: 241 LKEDKHIEFHVVGYGYHKEKFEAHVLERGLTNIHFMNAMPRWDALKQLAKSDIAFATLVE 300 Query: 225 GDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYA 277 + +T P K Y++M + A I D G+ Sbjct: 301 STAFDTV-----------TPGKIIDYMAMGCAIVGAVSGHAAKVIEDAGAGFV 342 >UniRef50_A5N237 Predicted glycosyltransferase n=2 Tax=Clostridium kluyveri RepID=A5N237_CLOK5 Length = 373 Score = 46.1 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 37/313 (11%), Positives = 84/313 (26%), Gaps = 60/313 (19%) Query: 28 DYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLL 87 +Y + D L + ++ L Sbjct: 63 EYGTGYKQIFKFLNFKEDVYKYLKDKDFQALHCHDFDGLFIGY-----------NINKRL 111 Query: 88 KFRIVPLIHDIDELRGGGGS--------------DSVRLATCDMVISHNPQMTKYLSKYM 133 K ++ HD+ + + + D I P+M + K Sbjct: 112 KLKLTYDEHDLFYMYFYNRKGLLNKIIYHFIILLERHMVKKADTHIVVTPKMKEAYKK-- 169 Query: 134 SQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGN---LSRHKCSFI----YTEGCDFT 186 I + Y + + +D + + G+ K Y + Sbjct: 170 ISKNIYIVNNAPYKSLFNHIEK-TSDNLLRIGFIGSVRYYDEIKALIDAAQKYDKAVKVI 228 Query: 187 LFGVNYENKD---------NPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGD 237 + G + N + G+++ E++ + T + GD Sbjct: 229 ICGWGIYAEQLAGYSKKFSNVEIKGAYNISELEELYK-----------NIDITYAFYPGD 277 Query: 238 YLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVG--SI-KEMQEIVDSMTIE 294 + P+K + E P+ + N GY + ++ +E++ I++ + + Sbjct: 278 TATISMPNKFYESIITETPIIANKVTEFGHEVWKNNFGYGIEGKNLKEEIEHIIEKLLKD 337 Query: 295 --TYKQISENTKI 305 I EN + Sbjct: 338 PAEKNSIIENMRK 350 >UniRef50_A3TY33 Putative uncharacterized protein n=1 Tax=Oceanicola batsensis HTCC2597 RepID=A3TY33_9RHOB Length = 389 Score = 46.1 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 47/304 (15%), Positives = 87/304 (28%), Gaps = 60/304 (19%) Query: 21 DALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLI-FNFPMAKPFWHI 79 D + + + L V+ ++ E DV+ +P Sbjct: 61 DVARVWVFAREKRSILLRLANVVIYCARLFFRILR-----ERPDVVTASTYPPVVAAM-S 114 Query: 80 LSFFHRLLKFRIVPLIHDIDE------------------LRGGGGSDSVRLATCDMVISH 121 + +L+ R V + DI LR D++ L +++ Sbjct: 115 AALAAKLVGARFVYHLQDIHPEVSRLSGSALGRFPVFGLLRWL---DTLTLRMAARIVTL 171 Query: 122 NPQMTKYLSKYMSQ--DKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKC---- 175 + M + L + ++ KI+ I D + VI+AGNL R++ Sbjct: 172 SDDMAETLRQRDARLAGKIRIINNLSLDEGRAPVPAPREDGRFRVIFAGNLGRYQDLPLV 231 Query: 176 ------SFIYTEGCDFTLFGVNYENKDNPKYLGSFD--------AQSPEKINLPGMQFGL 221 F + G ++ + G + L G+ Sbjct: 232 AAGIARLFDRHPELELFFLGNGALERELKESWGDHPQVRFHPFVPFDEARAMLAESDLGI 291 Query: 222 IWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPV--FIWDKAALADFIVDNRIGYAVG 279 + S+ A P K Y S+ LPV + + +A + G V Sbjct: 292 V----SIMPGLSA------VAYPSKLLTYQSLGLPVLAIVDPDSHIARDLAATGAGVVVR 341 Query: 280 SIKE 283 E Sbjct: 342 DRSE 345 >UniRef50_C5CAF6 Putative uncharacterized protein n=1 Tax=Micrococcus luteus NCTC 2665 RepID=C5CAF6_MICLC Length = 710 Score = 45.7 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 20/70 (28%), Positives = 32/70 (45%), Gaps = 3/70 (4%) Query: 221 LIWDGDSVETCSGAF---GDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYA 277 L W + +T G G+Y++F+ P+KT + PV + +ADF+ NR+G Sbjct: 606 LEWYVPATKTVVGLVLLGGEYVRFSFPYKTMSLIERGYPVLCFADMGIADFLERNRVGLG 665 Query: 278 VGSIKEMQEI 287 V E Sbjct: 666 VARSSEAIRA 675 >UniRef50_C1FUA3 Glycosyl transferase, group 1 family n=1 Tax=Clostridium botulinum A2 str. Kyoto RepID=C1FUA3_CLOBJ Length = 392 Score = 45.7 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 48/323 (14%), Positives = 88/323 (27%), Gaps = 59/323 (18%) Query: 19 RKDALDIASDYENISVVNIPLWGGVVQRI---ISSVKLSTFLCGLENKDVLIFN----FP 71 + D + + + + L I S +++ + +I+ +P Sbjct: 70 KFDIITNLNMSTVNKLAKVILKDNYDIYYLNGIYSPQVAKIIKNKNKNKKVIWQLQESYP 129 Query: 72 MAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGGGSDSVRLATCDMVISHNPQMTKYLSK 131 + F + + D +I + + KY Sbjct: 130 DYIRDYISTQSFIKNFNKYLYSF--------YINLYQKYYSTLFDYIIVTDDSIKKYF-N 180 Query: 132 YMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRH----------KCSFIYTE 181 + DKI I Y SS + K+ +IY G ++ K Sbjct: 181 PIINDKIVAI----YNYSSLEAYGTNVSKEYDLIYCGGITTARGAMSILKAVKIGKEIKS 236 Query: 182 GCDFTLFGVNYENK---------------DNPKYLGSFDAQSPEKINLPGMQFGLIWDGD 226 G E N ++GS L + GL Sbjct: 237 DIKMVFVGPVSEKDLRTKMDRYIMKNNLGKNIFFIGSISFDKIGS-YLAKSKIGL----- 290 Query: 227 SVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIK--EM 284 Y K N K Y+ LP+ D + F+ ++ G V E+ Sbjct: 291 ---APLFPISKYKK-NISMKIFEYMQYGLPIVGSDFGPIKSFLEESNSGICVNPEDGTEI 346 Query: 285 QEIVDSMTIET--YKQISENTKI 305 + + S+ + Y + S+N K Sbjct: 347 WKAIKSILEDEKLYMKYSQNGKK 369 >UniRef50_B3EBP3 Glycosyl transferase family 2 n=1 Tax=Geobacter lovleyi SZ RepID=B3EBP3_GEOLS Length = 669 Score = 45.7 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 11/61 (18%), Positives = 29/61 (47%) Query: 246 KTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQISENTKI 305 K + YL LP+ + + ++ + +G+ VGS+ + ++ +++ +Q N + Sbjct: 316 KIATYLQYGLPIVVNEIGEMSRHVRQFGLGWVVGSVTDTGRVLANLSRSDLEQSGLNAEQ 375 Query: 306 I 306 Sbjct: 376 F 376 >UniRef50_Q5V5W8 Glycosyltransferase n=1 Tax=Haloarcula marismortui RepID=Q5V5W8_HALMA Length = 416 Score = 45.7 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 27/171 (15%), Positives = 52/171 (30%), Gaps = 43/171 (25%) Query: 156 DVTDKQRGVIYAGNLSRHKCSF--------IYTEGCDFTLFGVNYE------------NK 195 + ++Y GN+ + + ++ L G E + Sbjct: 215 TPENNDSRIVYTGNIGSAQALEPCIRAMQHVSSDNALLQLVGDGDEVSRLKSVTERLGLE 274 Query: 196 DNPKYLGSFDAQSPEKINLPGMQ-FGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSME 254 D +++G + GL DS E L + P K Y++ Sbjct: 275 DRVEFVGL--VDRERIPEILDSATVGLAPIKDSPE---------LDYAIPTKLYEYMACS 323 Query: 255 LPVFIWDKAALADFIVDNRIGY-----------AVGSIKEMQEIVDSMTIE 294 LPV + + + F D +G A+ ++ E E +M + Sbjct: 324 LPVVVTGRGEIKRFTTDTDVGIHTEPDPESIAVAIETLLENPEKRVAMGQD 374 >UniRef50_A9KI03 Glycosyl transferase group 1 n=1 Tax=Clostridium phytofermentans ISDg RepID=A9KI03_CLOPH Length = 418 Score = 45.7 bits (107), Expect = 0.003, Method: Composition-based stats. Identities = 28/264 (10%), Positives = 69/264 (26%), Gaps = 45/264 (17%) Query: 31 NISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLL-KF 89 N+ +N+P+ + + + ++ + M F HIL + R+ Sbjct: 89 NVGFLNLPIIKNFSRYHSLKTYFKKWALNKSGETKIVIAYAMTFTFTHILRYVKRINSNI 148 Query: 90 RIVPLIHDIDELR-----------GGGGSDSVRL----ATCDMVISHNPQMTKYLSKYMS 134 ++ D+ + + + D + M L Sbjct: 149 ITCLIVPDLPQYMNLTYNKKVIYTLIKNIEINLIKSDTNYIDSYVLLTEYMRDAL---NI 205 Query: 135 QDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSR--------HKCSFIYTEGCDFT 186 I+ + +V + + ++Y+G L+ + + Sbjct: 206 NVPYVVIEGISTNLFDNVNGVPEDNGIKTILYSGGLNEKYGVIKLIQAFEKLQEKNYQLI 265 Query: 187 LFGVNYEN---------KDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGD 237 + G + G + + + + Sbjct: 266 ICGSGDAEAYIKKAATRDKRIIFKGLLKREEVLALQKS---------ATVLVNPRPNNEE 316 Query: 238 YLKFNNPHKTSLYLSMELPVFIWD 261 Y K++ P K YLS +P+ + Sbjct: 317 YTKYSFPSKNLEYLSSGIPLISYR 340 >UniRef50_B5EGU3 Putative uncharacterized protein n=1 Tax=Geobacter bemidjiensis Bem RepID=B5EGU3_GEOBB Length = 414 Score = 45.3 bits (106), Expect = 0.003, Method: Composition-based stats. Identities = 13/60 (21%), Positives = 24/60 (40%), Gaps = 1/60 (1%) Query: 246 KTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQISENTKI 305 K +LYL +PV + + + R G + + E+ V+ + Y+ EN Sbjct: 309 KMALYLQSGVPVIAYANESYELLMEHYRCGELIRDMSELPAAVERI-EADYEGYRENALS 367 >UniRef50_A6TJS0 Glycosyl transferase, group 1 n=1 Tax=Alkaliphilus metalliredigens QYMF RepID=A6TJS0_ALKMQ Length = 405 Score = 45.3 bits (106), Expect = 0.003, Method: Composition-based stats. Identities = 33/305 (10%), Positives = 84/305 (27%), Gaps = 51/305 (16%) Query: 48 ISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDI--------- 98 + + ++ D++I + P ++ K V I D+ Sbjct: 91 FMFSSVFYSMKKIDKPDIIITSSPT-FFSIFSGYWYSLRKKADFVLEIRDLWPAAMIELG 149 Query: 99 ---DEL--RGGGGSDSVRLATCDMVISHNPQMTKYLSKYMSQ-DKIKDIK-------IFD 145 + R + +I + DK+ I + Sbjct: 150 VMKEGFITRVLEKMELFFYRKSKKLIMVTQSFKDNVVNRGISGDKVHVITNGVNQDLFYP 209 Query: 146 YLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSF-IYTEGCDFTLFGVNYENKDNPKYLGSF 204 + ++ ++ + + V Y G + I ++ N +++ Sbjct: 210 KEKNQELINKHNLEDKFVVSYVGAHGISQNLSTILEVAKKLRIY-------KNIEFVFVG 262 Query: 205 DAQSPEKINLPGMQFGL----IWDGDSVETCSGAFG------------DYLKFNNPHKTS 248 + +K+ + L D E + + K P K Sbjct: 263 EGAEKDKLKQILREEELKNVQFIDAQPKELIPEFYNLSDLCLIPLKNIELFKTFIPSKMF 322 Query: 249 LYLSMELPVFIWDKAALADFIVDNRIGYAV--GSIKEMQEIVDSM--TIETYKQISENTK 304 ++ +P+ + A + D++ V + E+ ++ + E Y Q+ + Sbjct: 323 EIMACGVPIVASLEGEAAQILQDSKAAVVVKPDNSDEIAAAIEELINDKEKYNQMKASGP 382 Query: 305 IISQK 309 +K Sbjct: 383 EFVEK 387 >UniRef50_Q7M7N2 CAPSULAR POLYSACCHARIDE SYNTHESIS ENZYME CAP5I , n=2 Tax=Wolinella succinogenes RepID=Q7M7N2_WOLSU Length = 385 Score = 45.3 bits (106), Expect = 0.003, Method: Composition-based stats. Identities = 17/95 (17%), Positives = 35/95 (36%), Gaps = 3/95 (3%) Query: 234 AFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIK--EMQEIVDSM 291 G K+ P+K ++ L + I +A + +G M + ++ + Sbjct: 286 PTGFNTKYALPNKFFEFIQARLAIAIGPSLEMARIVQKENLGIISKDFTPLSMAKALNQL 345 Query: 292 TIETYKQISENTKIISQKIRTGSYFRDVLEEVIDD 326 T E +N + KI +L+EV++ Sbjct: 346 THEEILAYKQNADR-AAKIYNAKENEKILQEVMER 379 >UniRef50_C3WK21 L-fucosamine transferase n=3 Tax=Fusobacterium RepID=C3WK21_9FUSO Length = 389 Score = 45.3 bits (106), Expect = 0.003, Method: Composition-based stats. Identities = 32/227 (14%), Positives = 67/227 (29%), Gaps = 28/227 (12%) Query: 101 LRGGGGSDSVRLATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDK 160 + + + D + + Y+ K + + F +K Sbjct: 149 FKYFKRKEKLLYEISDYIGCMSKGNMDYILKNNPGISQEKVYYFPNTKKDTGNRSMDFEK 208 Query: 161 QR-GVIYAGN-------LSRHKCS--FIYTEGCDFTLFGVNYENKDNPKY-LGSFDAQSP 209 ++ +Y GN L+ F + +F G E +Y + + Sbjct: 209 EKLQFVYGGNMGLPQGVLNIAPAITYFKNDKDIEFIFVGKGTEWNKINEYFKEQKNVKVL 268 Query: 210 EKINLPGMQFGLIWDGDSVETCSGAF----GDYLKFNNPHKTSLYLSMELPVFI--WDKA 263 E + + +C F + N P +T YL +P+ Sbjct: 269 ESLPREE-------YEKLLSSCDAGFIFLDSRFTIPNYPSRTLAYLEKGIPIIAATDKNT 321 Query: 264 ALADFIVDNRIGY--AVGSIKEMQEIVDSMTIET--YKQISENTKII 306 + + + DN +G I + E + M K+ S+N + + Sbjct: 322 DIRNLVQDNNVGLWSCSDDIASLIENIKIMKENKEIRKEFSKNAREL 368 >UniRef50_D1Y955 Glycosyltransferase, group 1 family protein n=3 Tax=Propionibacterium acnes RepID=D1Y955_PROAC Length = 375 Score = 45.3 bits (106), Expect = 0.003, Method: Composition-based stats. Identities = 12/59 (20%), Positives = 26/59 (44%), Gaps = 1/59 (1%) Query: 237 DYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVG-SIKEMQEIVDSMTIE 294 +Y F P K Y+ P+ + + D + + +G+ V S+ E +++ +T Sbjct: 280 EYRDFAAPLKLFEYIGNGKPIIATEDTFVGDVVTRDELGWTVKASVVEFAALLEQLTQH 338 >UniRef50_B0KTZ0 Glycosyl transferase group 1 n=1 Tax=Pseudomonas putida GB-1 RepID=B0KTZ0_PSEPG Length = 402 Score = 45.3 bits (106), Expect = 0.003, Method: Composition-based stats. Identities = 34/220 (15%), Positives = 67/220 (30%), Gaps = 39/220 (17%) Query: 109 SVRLATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAG 168 + D V++ M S + K+ I D + +Y+G Sbjct: 176 YRIVNGADSVVAITDAMLDVFSPKL--KKVVVEGIADSDYVNRPSSSVRKPY---FLYSG 230 Query: 169 NLSRH--------KCSFIYTEGCDFTLFGVNYENKD---------NPKYLGSFDAQSPEK 211 L R E + G + + KYLG D + + Sbjct: 231 TLDRRYGIRKLLDAFVESNIENYHLYICGDGDDRANVESVSATNARVKYLGQLDRNAVLQ 290 Query: 212 INLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKAAL-ADFIV 270 + + + + K++ P K Y+S +PV ++ + A++ Sbjct: 291 LQR---------EASLLINPRDNESAFTKYSFPSKIIEYMSSGVPVMMYALDGIPAEY-- 339 Query: 271 DNRIGYAV----GSIKEMQEIVDSMTIETYKQISENTKII 306 R Y V +++M V S I+ ++ + K Sbjct: 340 -YRFCYLVPPGADGLRDMLAKVASFDIDELVEMGQTAKKF 378 >UniRef50_C6NU25 Glycosyl transferase, group 1 n=1 Tax=Acidithiobacillus caldus ATCC 51756 RepID=C6NU25_9GAMM Length = 475 Score = 45.3 bits (106), Expect = 0.003, Method: Composition-based stats. Identities = 26/157 (16%), Positives = 55/157 (35%), Gaps = 28/157 (17%) Query: 49 SSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDELRGGGGSD 108 S L+ + ++ D+ + + + F RL + ++HD+ +LR D Sbjct: 123 HSSGLNNYPINVKPGDI-FISLDLYRKFNFKALQELRLQGLKTYFVVHDLLDLRSCCLGD 181 Query: 109 SVR--------------------LATCDMVISHNPQMTKYL-----SKYMSQDKIKDIKI 143 S L+T D +I + + L K + ++K I Sbjct: 182 SDLTSSLIAHIARRTYRNWLHGVLSTSDGIICVSKSIADELLDWLNKKGIYENKNLQIGF 241 Query: 144 FDYLVSSDVEHRDVTDKQRGVIYAGNLS-RHKCSFIY 179 F YL + + +++ L+ ++K F+ Sbjct: 242 F-YLGADFINLKELGTYGVPDCIHTTLNAKNKPIFLM 277 >UniRef50_B3QYV4 Glycosyl transferase group 1 n=1 Tax=Chloroherpeton thalassium ATCC 35110 RepID=B3QYV4_CHLT3 Length = 426 Score = 45.3 bits (106), Expect = 0.003, Method: Composition-based stats. Identities = 31/320 (9%), Positives = 75/320 (23%), Gaps = 52/320 (16%) Query: 33 SVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIV 92 ++ ++G V I ++ + + F L +L F ++ Sbjct: 80 KLLQRKIFGKVKSSEIFFLRAVGHILKNSPDGKRVVVITRNTTFLFYLVMLKKLFGFTVL 139 Query: 93 PLIHDID--------------EL--RGGGGS--DSVRLATCDMVISHNPQMTKYLSKYMS 134 H R + L C +I + Sbjct: 140 FEAHGYHGTVNLPNLPLRPPLSFLQRYSSYRLLEQFLLNQCSGLICITRPQCQLYRADFV 199 Query: 135 QDKIKDIKIFDYLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYTEGCDFTLFGVNYEN 194 + + + ++ ++Y G L+ D + + Sbjct: 200 KIPTAVLPL-GSRPPEMKTAALPQFSKKRLVYIGRLT-------THIDVDLMIQAIKAIA 251 Query: 195 KDNPK--YLGSFDAQSPEKINLP------GMQFGL-IWDG---------DSVETCSGAFG 236 D ++G F L W + + Sbjct: 252 SDGISLVWVGLKSGDIEVLAKKIREAGLPEGAFLLKGWMAHKDMAALLREETSVGLATYK 311 Query: 237 DYLK---FNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGY---AVGSIKEMQEIVDS 290 + P K Y ++ LPV + D + D G + + + + Sbjct: 312 PTYRSAVVTCPTKIFDYYAVGLPVIAAKLPTVEDLVTDGHHGVLYDTANAHESLVAAISR 371 Query: 291 MTIET--YKQISENTKIISQ 308 + + Y ++ + ++ Sbjct: 372 LCTDEALYSKMQASVLAAAE 391 >UniRef50_B8E2Q1 Glycosyl transferase group 1 n=1 Tax=Dictyoglomus turgidum DSM 6724 RepID=B8E2Q1_DICTD Length = 402 Score = 45.3 bits (106), Expect = 0.003, Method: Composition-based stats. Identities = 41/261 (15%), Positives = 83/261 (31%), Gaps = 42/261 (16%) Query: 102 RGGGGSDSVRLATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDK- 160 R D++I+ + ++ + L + Q I+ + L + K Sbjct: 145 RIAIWISREYCNHSDLIIAPSTKIKRLLKNFGIQKPIEILPNGIDLDRFKKIPKPEARKS 204 Query: 161 ---QRGVI---YAGNLSRHKCSFIYTEGCDFTLFGVNYENKDNPKYLGSFD--AQSPEKI 212 VI + G L + K E + EN + YL Sbjct: 205 LGLPTDVILLLFVGRLGKEKNIEFLIEVMKYI-----KENNEKLIYLVIVGDNPDKRVME 259 Query: 213 NLPGMQFGL-IWDGDSVETCSGAFGDYLKFNNPHKTS-----------------LYLSME 254 L L ++D T + +Y + + S ++ Sbjct: 260 ELKNKAKTLNVYD----RTIFTGYLEYERVIEAYYASDIFVFSSITETQGLVILEAMASG 315 Query: 255 LPVFIWDKAALADFIVDNRIGYAVGSIKE----MQEIVDSMTIET--YKQISENTKIISQ 308 LPV D A++DF+ D G+ V + +E E + ++ + Y ++S + S+ Sbjct: 316 LPVVAIDDDAISDFVKDGINGFLVPNNQENKRLFSEKIKNLIEDKDLYTKMSLHALETSR 375 Query: 309 KIRTGSYFRDVLEEVIDDLKT 329 + + +L D ++ Sbjct: 376 SFHIKNLNKKLLALYEDLIRE 396 >UniRef50_B6IYH8 Glycosyl transferase, group 1 family protein n=1 Tax=Rhodospirillum centenum SW RepID=B6IYH8_RHOCS Length = 400 Score = 45.3 bits (106), Expect = 0.003, Method: Composition-based stats. Identities = 26/199 (13%), Positives = 47/199 (23%), Gaps = 52/199 (26%) Query: 106 GSDSVRLATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTD-KQRGV 164 + D + + + + + Y V + R + + Sbjct: 154 ERELEEYDLADRITVPSGAVRDSFVAQGVAP--GKLAVVPYGVDLGLFRRVAPRAPEFRI 211 Query: 165 IYAGNLSRHKCSFIY--------TEGCDFTLFGVNYENKDNP--------KYLGS----- 203 ++ G LS K G L G +LGS Sbjct: 212 LFVGGLSVRKGLHDLFAAVRLAGIPGARLVLAGGGLPETARLLAAAGVPFDWLGSLSWSD 271 Query: 204 ----FDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFI 259 F A + + FGL+ S ++ PV + Sbjct: 272 LVREFSAAAAFVLPSIEDGFGLV------------------------VSQAMACGCPVIV 307 Query: 260 WDKAALADFIVDNRIGYAV 278 AD + + G+ V Sbjct: 308 SRNVGAADLVQEGVTGFVV 326 >UniRef50_B9YYP4 Glycosyl transferase group 1 n=1 Tax=Lutiella nitroferrum 2002 RepID=B9YYP4_9NEIS Length = 391 Score = 45.3 bits (106), Expect = 0.003, Method: Composition-based stats. Identities = 27/236 (11%), Positives = 71/236 (30%), Gaps = 40/236 (16%) Query: 105 GGSDSVRLATCDMVISHNPQMTKYLSKYMSQDK----IKDIKIFDYLVSSDVE---HRDV 157 + V + + + ++ + I L + ++ R Sbjct: 147 KAREQFVYNNAAGVFALTSLLIDDIKQHYAIAHDRFCILPDGFDPALAQAAMQRHADRPR 206 Query: 158 TDKQRGVIYAGNLSRHK-------CSFIYTEGCDFTLFGVNYEN-------------KDN 197 V+Y G+L K + + + G + Sbjct: 207 QPGPVRVLYLGSLHPWKGVGTLIDALPLVQSELELVIAGGEPHRIDELRARAAALGVEKP 266 Query: 198 PKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPV 257 +LG +F +I D ++ +P K Y++M P+ Sbjct: 267 VHFLGKVPPAE---------RFDVIAQADICALPLTNSSIASRYTSPLKLFEYMAMGKPI 317 Query: 258 FIWDKAALADFIVDN--RIGYAVGSIKEMQEIVDSMT--IETYKQISENTKIISQK 309 I D ++ + + D + + + + + ++D + ++ +++ EN +S + Sbjct: 318 VIADLPSIKEIVTDQVSAVFFEAENKESLAAVLDKLAMRKQSQQELGENAARLSSR 373 >UniRef50_C2INF3 Glycosyl transferase family 2 n=1 Tax=Vibrio cholerae TMA 21 RepID=C2INF3_VIBCH Length = 1223 Score = 45.3 bits (106), Expect = 0.003, Method: Composition-based stats. Identities = 38/247 (15%), Positives = 72/247 (29%), Gaps = 47/247 (19%) Query: 45 QRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDI--DELR 102 ++ ++ T L L + + + + + IH + +E R Sbjct: 496 RKNKFLARMLTLLLALRCRKIYFHSVLRMHDSRFGYLMYFPF--IKKAIDIHGVVPEEFR 553 Query: 103 GGGGS---------DSVRLATCDMVISHNPQMTKYLSKY---MSQDKIKDIKIFDYLVSS 150 G + + + ++V++ M +YL+ Q K + IF ++ Sbjct: 554 YYGDYYSACLYEKYEKIAVKNANIVLTVTDAMKQYLADKYRIDLQVKGVTLPIFQ-NANN 612 Query: 151 DVEHRDVTDKQRGVIYAGNLSRHK-------CSFIYTEGCDFTLFGVNYEN--------- 194 D ++ +Q VIYAG L + + + F E Sbjct: 613 DECNKSSDSEQLKVIYAGGLHKWQQVDKMLSAIEQVRDKFKIYFFCPQPEEISQRLSPEA 672 Query: 195 --KDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLS 252 N + G FG I D + + P K YL Sbjct: 673 FKSANIEI-GCKSPDELRHFYQIS-DFGFILREDIIVNN---------VSCPTKLIEYLD 721 Query: 253 ME-LPVF 258 + +PV Sbjct: 722 YDIIPVI 728 >UniRef50_Q83GT6 Glycosyltransferase domain-containing protein n=2 Tax=Tropheryma whipplei RepID=Q83GT6_TROWT Length = 876 Score = 45.0 bits (105), Expect = 0.003, Method: Composition-based stats. Identities = 14/82 (17%), Positives = 34/82 (41%), Gaps = 8/82 (9%) Query: 250 YLSMELPVFIWDKAALADFIVDNRIGYAVG--SIKEMQEIVDSM--TIETYKQISENTKI 305 YL LP+ I D A ++ + +G V +++ + + ++ + + N + Sbjct: 739 YLWAGLPMVITDGDVFAQYVKEYNLGLVVEQGNVRSLADALEKILFDQDFILACKRNIEE 798 Query: 306 ISQKIRTGSYFRDVLEEVIDDL 327 R ++ +VL +I + Sbjct: 799 F----RRRFFWEEVLRPLIRRI 816 >UniRef50_B1C0Q1 Putative uncharacterized protein n=1 Tax=Clostridium spiroforme DSM 1552 RepID=B1C0Q1_9FIRM Length = 323 Score = 45.0 bits (105), Expect = 0.003, Method: Composition-based stats. Identities = 43/268 (16%), Positives = 84/268 (31%), Gaps = 41/268 (15%) Query: 49 SSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPL-IHDIDELRGGGGS 107 ++ D L + + K IL + + + +++ + + G Sbjct: 1 MFPLRLKKFIKKKHYD-LCLHTQIDKISLEILKKYCKKINIKLIYDAVEWFSPEQFKRGD 59 Query: 108 DSVRLATCD-----------MVISHNPQMTKYLSKYMSQDKIKDIKI-FDYLVSSDVEHR 155 S+ D VI+ + YL K+ IK +KI V + + + Sbjct: 60 KSISYKRNDSYNTKWIDKQYYVIAISE----YLKKHFLDRNIKVLKIPVILDVQNIISKK 115 Query: 156 DVTDKQRGVIYAGNLSRHKCSFIYTEGC-----------DFTLFGVNYENKDNPKYLGSF 204 ++++ + ++YAG+ + F G F + GV E N + + Sbjct: 116 NISNDKLVIMYAGSPGKKDYLFEILNGILLLKKEEQRKLKFIIIGVTREQLVNICSVENN 175 Query: 205 DAQSPEKINLPGMQFGLIWDGDSV---------ETCSGAFGDYLKFNNPHKTSLYLSMEL 255 ++ + I D + Y K P K L+ Sbjct: 176 IIDKLSEVIEIKGR---ISRADVLCELEKANFTILIRSEKQRYAKAGFPTKFVESLATGT 232 Query: 256 PVFIWDKAALADFIVDNRIGYAVGSIKE 283 PV + L D+++D V S E Sbjct: 233 PVISNLTSDLNDYLIDKTNSIVVDSCDE 260 >UniRef50_A0YEZ9 Putative uncharacterized protein n=1 Tax=marine gamma proteobacterium HTCC2143 RepID=A0YEZ9_9GAMM Length = 391 Score = 45.0 bits (105), Expect = 0.004, Method: Composition-based stats. Identities = 41/291 (14%), Positives = 76/291 (26%), Gaps = 40/291 (13%) Query: 17 KARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPM-AKP 75 K + D I + + I + L + L +E + F + Sbjct: 59 KEKIDLSSIPENIKVIPLPIFYLPFAKSYLKLGEQHFKKALRAIERHQI---QFDLIYCH 115 Query: 76 FWHIL----SFFHRLLKFRIVPLIH--DIDELRGG----GGSDSVRLATCDMVISHNPQM 125 F + +V H D+ L + L D VI+ + Sbjct: 116 FVWSAGYAGARLKETFHKPLVVTGHGYDVYSLPFKNEHWRYQITWTLNQADQVITVSQSN 175 Query: 126 TKYLSKYMSQDKIKDIK------IFDYLVSSDVEHRDVTDKQRGVIYA-GNLSRHKCSFI 178 L I+ I +F+ + + + +I A GNL K + Sbjct: 176 RNILRSLDISCPIEVIPNGVSKSLFEVQDQAACRRELAMPQDQKIILAIGNLLPIKGHEL 235 Query: 179 YTEGCDFTLFGVNYENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDY 238 D + + + + + + Q GL D + Sbjct: 236 LISAFDLV-----DQEQRSCHLVIIGSGECLPLLKKQASQLGLA-DKITFTGAIAHDQLQ 289 Query: 239 LKFN------NPHKTSLY-------LSMELPVFIWDKAALADFIVDNRIGY 276 N P K + L+ +PV + I + +GY Sbjct: 290 TWINGADLLAMPSKKESFGVVQIEALACGVPVVATKNGGSEEIITSDTVGY 340 >UniRef50_Q39W08 Putative uncharacterized protein n=1 Tax=Geobacter metallireducens GS-15 RepID=Q39W08_GEOMG Length = 359 Score = 45.0 bits (105), Expect = 0.004, Method: Composition-based stats. Identities = 31/180 (17%), Positives = 61/180 (33%), Gaps = 21/180 (11%) Query: 38 PLWGGVVQRIISSVKLSTFLCGL--ENKDVLIFNFPM----AKPFWHILSFFHRL-LKFR 90 LW V KLS L + DV+ +P H+L ++ K + Sbjct: 51 KLWSPYVAESWRFRKLSFILKSIDQRKPDVIFMQYPAEGYGWSLVPHMLLIYYVFIKKIK 110 Query: 91 IVPLIHDIDELRGGGGSDSV-RLATCDMVISHNPQMTKYLSKY--MSQDKIKDIKIFDYL 147 + ++H+ L L+ +I L+K+ K + I Sbjct: 111 FITVLHEYSSLSWKSRFFIRNILSKSSQLIFTTNFELNNLAKHVPGILSKANVLPILS-N 169 Query: 148 VSSDVEHRDVTDKQRGVIYAGNLSRHK----------CSFIYTEGCDFTLFGVNYENKDN 197 + + E + ++D+ ++Y G++ +K S I + L G + +N Sbjct: 170 IPNPKEIKQISDRSIDILYFGHIRPNKGIEDFLRVICSSKIVNKNLSIKLVGQIPKGYEN 229 >UniRef50_C6JNS5 Putative uncharacterized protein n=1 Tax=Fusobacterium varium ATCC 27725 RepID=C6JNS5_FUSVA Length = 420 Score = 45.0 bits (105), Expect = 0.004, Method: Composition-based stats. Identities = 51/348 (14%), Positives = 107/348 (30%), Gaps = 70/348 (20%) Query: 19 RKDALDIASDYENISVVNIPLWGGVVQRII----SSVKLSTFLCGLENKDVLIFNFPMAK 74 + + E I + ++RI + L+ ++ L KD+ + + Sbjct: 57 KMYISEKYEGIEFIFIKTRTYTNNGIERIKNFIDYYINLNKYIKKLGKKDIPDIIYASSP 116 Query: 75 PFWHILSFFHRLLKFRI--VPLIHDI--------DELRGGGGS-------DSVRLATCDM 117 +LS K +I + I D + + + + Sbjct: 117 HPLALLSGIKNSRKLKIPCIGEIRDFWPEVFFLGGKFKEKSLIGKLLLKGEKYLYKNLNA 176 Query: 118 VISHNPQMTKYLSKYMSQ---------DKI------KDIKIFD-YLVSSDVEHRDVTDKQ 161 +I +Y+ ++ KI D+ FD L + D+ ++ Sbjct: 177 LIFLKEGDKEYIKEHKWSLEQGGDIDLKKIYYINNGVDLDEFDKNLNEYKYQDEDLESEK 236 Query: 162 RGVIYAGNLSRHKCS--------FIYTEGCDFTLFGVNYENK-----------DNPKYLG 202 +IY G + + I E +F +FG E DN K+ G Sbjct: 237 FKIIYIGAIRKVNNIERIVEVAKKIKMEEIEFLIFGDGNERAILELKCKKEGIDNVKFKG 296 Query: 203 SFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDK 262 + + ++ S + ++ + N+ +K Y++ P+ K Sbjct: 297 Y--VEKKYIPYILSKA------NLNILNYSQSEYNWKRGNSSNKLFEYMASGKPILSTVK 348 Query: 263 AALADFIVDNRIGYAV--GSIKEMQEIV---DSMTIETYKQISENTKI 305 I + G + +I E + +MT E Y+ + N + Sbjct: 349 MG-YSIIEKYKCGLELETENIDEFYHKILEMKNMTKEKYRLMGLNARN 395 >UniRef50_Q12KU7 Glycosyl transferase, group 1 n=1 Tax=Shewanella denitrificans OS217 RepID=Q12KU7_SHEDO Length = 357 Score = 45.0 bits (105), Expect = 0.004, Method: Composition-based stats. Identities = 38/271 (14%), Positives = 76/271 (28%), Gaps = 23/271 (8%) Query: 41 GGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHILSFFHRLLKFRIVPLIHDIDE 100 + I S VK + + DV+I + F + + V D Sbjct: 56 RRKSKVIYSYVKRLFQIITIHKYDVVIIEKELFPYFPATIERLLNFFNVKYVVDYDDAIF 115 Query: 101 LR-----------GGGGSDSVRLATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVS 149 +V + +VI+ N + Y +++ + D + Sbjct: 116 QMYEESSNWMVKFFLKDKINVVMNNAHIVITGNDFLFSKAKFYGAKNVTVIPTVID--ID 173 Query: 150 SDVEHRDVTDKQRGVIYAGNLSRHKCSFIYTEGCDFTLFGVNYENKDNP---KYLGSFDA 206 V+ + V + G+ S K + + N +G F Sbjct: 174 RYVQESTRQEVPT-VCWIGSPSTQKYVVE-LKDVLIRVCLENKAKLVLVGATTSVGDFFP 231 Query: 207 QSPEKI---NLPGMQFGLIWDGDSVETCSGAFGDYLKFNNPHKTSLYLSMELPVFIWDKA 263 +I F + + + + K +K Y++ PV + Sbjct: 232 DIDLEIIPWEESTEAFIIKHSDIGIMPIASTNWERGK--CGYKLIQYMATGKPVVASNFG 289 Query: 264 ALADFIVDNRIGYAVGSIKEMQEIVDSMTIE 294 A D + + G V S E E + + + Sbjct: 290 ANIDIVKNECCGLLVNSDDEWYEALTQLIKD 320 >UniRef50_D1C8N3 Glycosyl transferase group 1 n=1 Tax=Sphaerobacter thermophilus DSM 20745 RepID=D1C8N3_SPHTD Length = 410 Score = 45.0 bits (105), Expect = 0.004, Method: Composition-based stats. Identities = 11/63 (17%), Positives = 25/63 (39%), Gaps = 4/63 (6%) Query: 234 AFGDYLKFNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIK--EMQEIVDSM 291 + +Y+ P K Y++ LPV + ++ F+ D G V ++ + + Sbjct: 292 PYTEYVTH--PVKLFEYMAWGLPVICSNLPNMSRFVRDGEYGLVVDPRDPADIAAAITRL 349 Query: 292 TIE 294 + Sbjct: 350 ARD 352 >UniRef50_A3DHW2 Glycosyl transferase, group 1 n=3 Tax=Clostridium thermocellum RepID=A3DHW2_CLOTH Length = 347 Score = 45.0 bits (105), Expect = 0.004, Method: Composition-based stats. Identities = 47/336 (13%), Positives = 104/336 (30%), Gaps = 46/336 (13%) Query: 17 KARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPF 76 K + I + +V+ + + + I K +N +L + Sbjct: 23 KVKSLTEAIKDNIGEHNVLCVDTYNWKKRPIRLLRKCFRLARKCKNIVILPAQNGIKVLV 82 Query: 77 WHILSFFHRLLKFRIVPLIHD--IDELRGGGGSDSVRLATCDMVISHNPQMTKYLSKYMS 134 S ++L ++ ++ + L D + M++ L ++ Sbjct: 83 PLF-SLINKLFGRKLFYVVIGGWLPTFLKNYKWLVSWLHHMDGIFVETASMSEKLIEFGL 141 Query: 135 QDKIK-----DIKIFD-------YLVSSDVEHRDVTDKQRGVIYAGNLSRHKCSFIYTEG 182 ++ + ++I D + + + K++G+ A N + E Sbjct: 142 KNVLVMPNFRQLRIVDINELQDTHALPYKLCTFSRVLKEKGIEDAINAVIKVNTDCGREV 201 Query: 183 CDFTLFGVNYENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLKFN 242 C ++G E KY +F A + E DY Sbjct: 202 CTLDIYGQIDE-----KYKDAFWAIMSNVPAYIKYK-----GEAPYEKAVDVLKDYYLML 251 Query: 243 NPHKTSLY-------------LSMELPVFIWDKAALADFIVDNRIG--YAVGSIKEMQEI 287 P Y + LPV D ++ + D + G + IKE+ EI Sbjct: 252 FP----TYYEGEGFAGTIIDAFASGLPVIASDWRYNSEIVQDYKTGRIFRTKDIKELAEI 307 Query: 288 VDSMTI--ETYKQISENTKIISQKIRTGSYFRDVLE 321 + + ++ +N ++K +G+ + ++E Sbjct: 308 ILYCLEHGDEVMEMKKNCIEEARKYTSGNAIKKLIE 343 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.313 0.144 0.417 Lambda K H 0.267 0.0444 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,971,250,961 Number of Sequences: 3077464 Number of extensions: 82922375 Number of successful extensions: 275881 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 234 Number of HSP's successfully gapped in prelim test: 689 Number of HSP's that attempted gapping in prelim test: 274335 Number of HSP's gapped (non-prelim): 1090 length of query: 330 length of database: 1,040,396,356 effective HSP length: 129 effective length of query: 201 effective length of database: 643,403,500 effective search space: 129324103500 effective search space used: 129324103500 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.3 bits) S2: 93 (40.3 bits)