BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (344 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P11290 Uncharacterized glycosyltransferase yibD n=58 Ta... 721 0.0 UniRef50_B4SXB8 Glycosyl transferase, group 2 family protein n=3... 597 e-169 UniRef50_A8GFL8 Glycosyl transferase family 2 n=1 Tax=Serratia p... 323 4e-87 UniRef50_D1RQX9 Putative glycosyl transferase n=1 Tax=Serratia o... 321 3e-86 UniRef50_B5XTH2 Glycosyl transferase, group 2 family protein n=2... 236 7e-61 UniRef50_B4F184 Glycosyl transferase n=2 Tax=Proteus mirabilis R... 158 3e-37 UniRef50_D1NJF8 Glycosyl transferase family 2 n=1 Tax=Clostridiu... 145 2e-33 UniRef50_C4K3K5 Glycosyltransferase group 2 n=3 Tax=Enterobacter... 142 2e-32 UniRef50_C3WCV9 Glycosyl transferase n=1 Tax=Fusobacterium morti... 140 8e-32 UniRef50_D1BQ92 Glycosyl transferase family 2 n=1 Tax=Veillonell... 140 9e-32 UniRef50_Q84GX9 Orf34 n=2 Tax=Photorhabdus luminescens RepID=Q84... 135 2e-30 UniRef50_A6LE28 Glycosyltransferase family 2 n=1 Tax=Parabactero... 134 6e-30 UniRef50_D2RIJ6 Glycosyl transferase family 2 n=1 Tax=Acidaminoc... 133 1e-29 UniRef50_A8UPC2 Glycosyltransferase n=1 Tax=Flavobacteriales bac... 131 3e-29 UniRef50_B7HZB4 Beta-1,3-N-acetylglucosaminyltransferase n=37 Ta... 130 7e-29 UniRef50_P71059 Uncharacterized glycosyltransferase epsJ n=3 Tax... 128 3e-28 UniRef50_C4FFR1 Putative uncharacterized protein n=1 Tax=Bifidob... 127 4e-28 UniRef50_Q02W55 Glycosyltransferase n=3 Tax=Lactococcus lactis R... 127 6e-28 UniRef50_Q93DZ7 Eps1I n=1 Tax=Streptococcus thermophilus RepID=Q... 125 2e-27 UniRef50_C3AT96 Glycosyltransferase family 2 n=4 Tax=Bacillus Re... 125 2e-27 UniRef50_B1JIU1 Glycosyl transferase family 2 n=28 Tax=Enterobac... 125 2e-27 UniRef50_Q9RG46 Cps1I n=2 Tax=Streptococcus suis RepID=Q9RG46_STRSU 125 2e-27 UniRef50_C2FUI9 Glycosyltransferase yibD n=3 Tax=Bacteroidetes R... 125 3e-27 UniRef50_B1R2U2 Glycosyl transferase, group 2 family protein n=4... 125 3e-27 UniRef50_D0GIA2 Glycosyltransferase, group 2 family n=1 Tax=Lept... 124 3e-27 UniRef50_A4E6Y2 Putative uncharacterized protein n=1 Tax=Collins... 124 4e-27 UniRef50_UPI0001BC61E6 putative glycosyl transferase n=1 Tax=Fus... 124 6e-27 UniRef50_B1QTB6 Glycosyl transferase, group 2 family protein n=4... 124 7e-27 UniRef50_Q93TI7 Putative glycosyltransferase CpsIVM n=1 Tax=Stre... 123 8e-27 UniRef50_C3R6R9 Glycosyltransferase involved capsular polysaccha... 123 8e-27 UniRef50_B7AIU6 Putative uncharacterized protein n=1 Tax=Bactero... 123 9e-27 UniRef50_D0GIA1 Glycosyltransferase n=1 Tax=Leptotrichia goodfel... 122 2e-26 UniRef50_UPI0001B4A4FE ss-1,4-galactosyltransferase n=1 Tax=Bact... 122 2e-26 UniRef50_B2DG34 Eps5I n=3 Tax=Streptococcus pneumoniae RepID=B2D... 122 2e-26 UniRef50_C4Z667 Putative uncharacterized protein n=1 Tax=Eubacte... 121 3e-26 UniRef50_C3WBU5 Predicted protein n=3 Tax=Fusobacterium RepID=C3... 121 3e-26 UniRef50_A8ULG5 Capsular polysaccharide biosynthsis protein n=1 ... 121 3e-26 UniRef50_A9VPZ1 Glycosyl transferase family 2 n=3 Tax=Firmicutes... 121 3e-26 UniRef50_B5CU67 Putative uncharacterized protein n=1 Tax=Bactero... 121 4e-26 UniRef50_C1ME88 Glycosyltransferase n=2 Tax=Enterobacteriaceae R... 121 4e-26 UniRef50_O86893 Putative glycosyl transferase n=1 Tax=Streptococ... 121 4e-26 UniRef50_B0MQU6 Putative uncharacterized protein n=1 Tax=Eubacte... 121 4e-26 UniRef50_A7VPB2 Putative uncharacterized protein n=1 Tax=Clostri... 121 4e-26 UniRef50_B2UQP2 Glycosyl transferase family 2 n=1 Tax=Akkermansi... 121 5e-26 UniRef50_Q8XN37 Capsular polysaccharide biosynthsis protein n=1 ... 120 5e-26 UniRef50_C3ADH9 Putative uncharacterized protein n=1 Tax=Bacillu... 120 6e-26 UniRef50_B5CP79 Putative uncharacterized protein n=1 Tax=Ruminoc... 120 7e-26 UniRef50_Q8GPB8 Eps6I n=5 Tax=Streptococcus RepID=Q8GPB8_STRTR 120 8e-26 UniRef50_C7H3W5 Glycosyl transferase CpsJ n=1 Tax=Faecalibacteri... 119 1e-25 UniRef50_B2UYP7 Minor teichoic acid biosynthesis protein GgaB n=... 119 1e-25 UniRef50_C9LAR4 Glycosyl transferase, family 2/glycosyl transfer... 119 1e-25 UniRef50_D2EWI1 Glycoside transferase family 2 protein n=3 Tax=B... 119 2e-25 UniRef50_C7H3W1 Capsular polysaccharide biosynthesis protein Cps... 119 2e-25 UniRef50_A6BH37 Putative uncharacterized protein n=1 Tax=Dorea l... 118 2e-25 UniRef50_A8SCX5 Putative uncharacterized protein n=1 Tax=Faecali... 118 3e-25 UniRef50_B0P2L4 Putative uncharacterized protein n=1 Tax=Clostri... 118 3e-25 UniRef50_A6L5V6 Glycosyltransferase family 2 n=2 Tax=Bacteroides... 118 3e-25 UniRef50_A8U9G0 Capsular polysaccharide biosynthsis protein n=1 ... 118 3e-25 UniRef50_C6WZX8 Beta-1,3-glucosyltransferase n=1 Tax=Flavobacter... 118 4e-25 UniRef50_C8WIW0 Glycosyl transferase family 2 n=1 Tax=Eggerthell... 117 4e-25 UniRef50_C9MPZ6 Putative glycosyl transferase n=1 Tax=Prevotella... 117 5e-25 UniRef50_D1QMD5 Glycosyl transferase, group 2 family n=1 Tax=Pre... 117 5e-25 UniRef50_A5ZCV6 Putative uncharacterized protein n=1 Tax=Bactero... 117 5e-25 UniRef50_C1I2E9 Glycosyl transferase n=1 Tax=Clostridium sp. 7_2... 117 8e-25 UniRef50_C3QCN3 Glycoside transferase family 2 n=6 Tax=Bacteroid... 116 1e-24 UniRef50_A7VCQ5 Putative uncharacterized protein n=1 Tax=Clostri... 116 1e-24 UniRef50_D2BMH0 Glycosyltransferase, family 2 n=2 Tax=Lactococcu... 116 1e-24 UniRef50_D1QQ98 Capsular polysaccharide biosythesis protein CpsI... 116 1e-24 UniRef50_B5LBK9 Putative glycosyl transferase n=1 Tax=Streptococ... 116 1e-24 UniRef50_D1PWT5 Exopolysaccharide biosynthesis protein n=1 Tax=P... 116 1e-24 UniRef50_A7VPB1 Putative uncharacterized protein n=1 Tax=Clostri... 116 1e-24 UniRef50_C0F023 Putative uncharacterized protein n=1 Tax=Eubacte... 116 2e-24 UniRef50_C9BVV0 Glycosyl transferase n=3 Tax=Enterococcus faeciu... 115 2e-24 UniRef50_B3JEB6 Putative uncharacterized protein n=1 Tax=Bactero... 115 2e-24 UniRef50_P71057 Putative glycosyltransferase epsH n=3 Tax=Bacill... 115 2e-24 UniRef50_B0NP43 Putative uncharacterized protein n=1 Tax=Bactero... 115 2e-24 UniRef50_D0BME7 Capsular polysaccharide biosynthesis protein Cps... 115 2e-24 UniRef50_C7H3W2 Capsular polysaccharide biosynthesis protein n=1... 115 2e-24 UniRef50_B3Z5I4 Glycosyltransferase n=2 Tax=Bacillus cereus grou... 115 2e-24 UniRef50_C0YL33 Glycosyltransferase n=1 Tax=Chryseobacterium gle... 115 3e-24 UniRef50_C4ZG45 Glycosyltransferase Family 2 modular protein n=1... 115 3e-24 UniRef50_C9RJ54 Glycosyl transferase family 2 n=1 Tax=Fibrobacte... 115 3e-24 UniRef50_C3Q1B0 Glycosyltransferase family 2 protein n=2 Tax=Bac... 114 4e-24 UniRef50_A5KMN1 Putative uncharacterized protein n=1 Tax=Ruminoc... 114 4e-24 UniRef50_A3KLK7 Beta-1,3-N-acetylglucosaminyl transferase n=3 Ta... 114 4e-24 UniRef50_C0X9Z9 Glycosyltransferase n=1 Tax=Lactobacillus gasser... 114 4e-24 UniRef50_D0R4M4 Glycosyltransferase n=1 Tax=Lactobacillus johnso... 114 4e-24 UniRef50_C6ID60 Glycosyltransferase n=5 Tax=Bacteroides RepID=C6... 114 4e-24 UniRef50_Q815A0 Glycosyltransferase n=6 Tax=Bacillus cereus RepI... 114 5e-24 UniRef50_C3PZ30 Glycosyl transferase n=4 Tax=Bacteroides RepID=C... 114 5e-24 UniRef50_Q5LBM4 Putative glycosyltransferase protein n=4 Tax=Bac... 114 6e-24 UniRef50_B4AD18 Glycosyltransferase n=18 Tax=Enterococcus RepID=... 114 7e-24 UniRef50_C7X8L4 Glycoside transferase, family 2 n=1 Tax=Parabact... 114 7e-24 UniRef50_UPI0001C377F0 glycosyl transferase, group 2 family prot... 114 7e-24 UniRef50_A3CP06 Cell-wall biogenesis glycosyltransferase, putati... 114 7e-24 UniRef50_O06035 EpsG n=1 Tax=Lactococcus lactis subsp. cremoris ... 113 8e-24 UniRef50_B0ACE9 Putative uncharacterized protein n=2 Tax=Firmicu... 113 1e-23 UniRef50_C6LFI9 Glycosyl transferase, group 2 family n=1 Tax=Bry... 113 1e-23 UniRef50_Q9F0B5 Beta(1,4)galactosyltransferase EpsM n=1 Tax=Lact... 113 1e-23 UniRef50_C9K1G9 Putative glycosyltransferase n=1 Tax=Klebsiella ... 113 1e-23 UniRef50_A3F4D9 EpsM n=1 Tax=Lactococcus lactis subsp. cremoris ... 113 1e-23 UniRef50_B5CU57 Putative uncharacterized protein n=1 Tax=Bactero... 112 1e-23 UniRef50_B3H2A3 Putative glycosyltransferase n=1 Tax=Actinobacil... 112 1e-23 UniRef50_B1C0P6 Putative uncharacterized protein n=1 Tax=Clostri... 112 1e-23 UniRef50_C8NFZ4 Glycosyl transferase CpsJ(V) n=1 Tax=Granulicate... 112 2e-23 UniRef50_B7CD61 Putative uncharacterized protein n=1 Tax=Eubacte... 112 2e-23 UniRef50_O87182 CpsIaI n=15 Tax=Streptococcus agalactiae RepID=O... 112 2e-23 UniRef50_Q03KL3 Glycosyltransferase involved in cell wall biogen... 112 2e-23 UniRef50_B0ABV6 Putative uncharacterized protein n=1 Tax=Clostri... 112 2e-23 UniRef50_B7GUL7 Glycosyl transferase, family 2 n=1 Tax=Bifidobac... 112 2e-23 UniRef50_B7HXA1 N-acetylglucosaminyltransferase n=2 Tax=Bacillus... 112 2e-23 UniRef50_A5ZUY3 Putative uncharacterized protein n=1 Tax=Ruminoc... 112 2e-23 UniRef50_C2M8C3 Glycosyl transferase, family 2 n=1 Tax=Capnocyto... 112 2e-23 UniRef50_A1YVC6 Glycosyltransferase n=1 Tax=Lactobacillus johnso... 112 2e-23 UniRef50_C8WM64 Glycosyl transferase family 2 n=1 Tax=Eggerthell... 112 2e-23 UniRef50_B2UQN8 Glycosyl transferase family 2 n=1 Tax=Akkermansi... 112 3e-23 UniRef50_C6IJ34 Glycoside transferase family 2 n=2 Tax=Bacteroid... 111 3e-23 UniRef50_C0BRK2 Putative uncharacterized protein n=1 Tax=Bifidob... 111 3e-23 UniRef50_B1C3W3 Putative uncharacterized protein n=1 Tax=Clostri... 111 3e-23 UniRef50_D0ANJ0 Putative uncharacterized protein n=1 Tax=Enteroc... 111 3e-23 UniRef50_C9BUH6 Glycosyltransferase n=1 Tax=Enterococcus faecium... 111 3e-23 UniRef50_UPI0001C4194D glycosyl transferase GT2 family n=1 Tax=M... 111 3e-23 UniRef50_Q5LIR2 Putative glycosyltransferase n=6 Tax=Bacteroides... 111 4e-23 UniRef50_A8U9H0 Putative capsular polysaccharide glycosyltransfe... 111 4e-23 UniRef50_C2M970 Glycosyl transferase family 2 n=1 Tax=Porphyromo... 111 4e-23 UniRef50_B9YAB2 Putative uncharacterized protein n=1 Tax=Holdema... 111 5e-23 UniRef50_C9LBS0 Glycosyl transferase, group 2 family n=1 Tax=Bla... 111 5e-23 UniRef50_Q74L33 Putative uncharacterized protein n=1 Tax=Lactoba... 111 5e-23 UniRef50_Q15RB7 Glycosyl transferase, family 2 n=1 Tax=Pseudoalt... 110 5e-23 UniRef50_A6BIH4 Putative uncharacterized protein n=1 Tax=Dorea l... 110 5e-23 UniRef50_A7V2P1 Putative uncharacterized protein n=3 Tax=Bactero... 110 6e-23 UniRef50_Q6ASC1 Related to glycosyltransferase involved capsular... 110 6e-23 UniRef50_C6Z980 Glycosyltransferase family 2 n=1 Tax=Bacteroides... 110 7e-23 UniRef50_A6NTS0 Putative uncharacterized protein n=1 Tax=Bactero... 110 8e-23 UniRef50_C4Z1X9 Putative uncharacterized protein n=1 Tax=Eubacte... 110 8e-23 UniRef50_D0TDE5 Glycosyltransferase family 2 n=1 Tax=Bacteroides... 110 9e-23 UniRef50_D2Q666 Glycosyl transferase, family 2 n=4 Tax=cellular ... 110 1e-22 UniRef50_A7I1X7 Ss-1,4-galactosyltransferase n=1 Tax=Campylobact... 110 1e-22 UniRef50_B3C8H4 Putative uncharacterized protein n=2 Tax=Bactero... 110 1e-22 UniRef50_UPI0001968EF9 hypothetical protein BACCELL_04174 n=1 Ta... 109 1e-22 UniRef50_C2C0Z6 Glycosyltransferase n=3 Tax=Listeria RepID=C2C0Z... 109 1e-22 UniRef50_UPI0001C41E5E glycosyl transferase GT2 family n=1 Tax=M... 109 1e-22 UniRef50_A5UNM1 Glycosyltransferase/CDP-glycerol:poly(Glyceropho... 109 1e-22 UniRef50_C5NWF7 Glycosyl transferase, group 2 family n=2 Tax=Fir... 109 1e-22 UniRef50_A8U9H2 Glycosyl transferase, family 2/glycosyl transfer... 109 1e-22 UniRef50_C5VL41 Glycosyl transferase, group 2 family n=2 Tax=Pre... 109 1e-22 UniRef50_B0N0F0 Putative uncharacterized protein n=1 Tax=Clostri... 109 2e-22 UniRef50_C2DBB8 Glycosyltransferase n=19 Tax=Bacteria RepID=C2DB... 109 2e-22 UniRef50_Q8A777 Glycoside transferase family 2 n=1 Tax=Bacteroid... 109 2e-22 UniRef50_B0ACD9 Putative uncharacterized protein n=1 Tax=Clostri... 109 2e-22 UniRef50_C2M966 Glycosyl transferase family protein n=1 Tax=Porp... 109 2e-22 UniRef50_D1PQG3 Glycosyltransferase n=1 Tax=Subdoligranulum vari... 109 2e-22 UniRef50_C4FMC2 Putative uncharacterized protein n=3 Tax=Firmicu... 108 2e-22 UniRef50_UPI0001851388 hypothetical protein Bcoam_19734 n=1 Tax=... 108 2e-22 UniRef50_D2RJ84 Glycosyl transferase family 2 n=1 Tax=Acidaminoc... 108 2e-22 UniRef50_B6FJ78 Putative uncharacterized protein n=1 Tax=Clostri... 108 2e-22 UniRef50_A6NTT2 Putative uncharacterized protein n=1 Tax=Bactero... 108 2e-22 UniRef50_D0WAZ5 Glycosyl transferase, group 2 family n=5 Tax=Nei... 108 2e-22 UniRef50_D0BKT8 Glycosyl transferase, group 2 n=1 Tax=Granulicat... 108 3e-22 UniRef50_D0BKT9 Capsular polysaccharide biosynthesis protein Cps... 108 3e-22 UniRef50_D0TAC7 N-acetylglucosaminyltransferase n=1 Tax=Bacteroi... 108 3e-22 UniRef50_C4ZHK7 Glycosyltransferase Family 2 modular protein n=1... 108 3e-22 UniRef50_C6IMR5 Glycoside transferase family 2 n=1 Tax=Bacteroid... 108 3e-22 UniRef50_A6NTT3 Putative uncharacterized protein n=1 Tax=Bactero... 108 3e-22 UniRef50_A8SZ40 Putative uncharacterized protein n=1 Tax=Coproco... 108 4e-22 UniRef50_B2V0U1 Glycosyl transferase, group 2 family protein n=1... 108 4e-22 UniRef50_C4G9K6 Putative uncharacterized protein n=1 Tax=Shuttle... 108 4e-22 UniRef50_C0QZN4 Putative glycosyl transferase family 2 protein n... 108 4e-22 UniRef50_A6L2Z0 Glycosyltransferase family 2 n=1 Tax=Bacteroides... 107 5e-22 UniRef50_Q1Z875 Glycosyltransferase n=1 Tax=Photobacterium profu... 107 5e-22 UniRef50_UPI00019688E7 hypothetical protein BACCELL_03020 n=1 Ta... 107 5e-22 UniRef50_A8RE09 Putative uncharacterized protein n=1 Tax=Eubacte... 107 5e-22 UniRef50_Q8ABQ9 Glycoside transferase family 2 n=1 Tax=Bacteroid... 107 5e-22 UniRef50_Q2NH61 Predicted glycosyltransferase n=1 Tax=Methanosph... 107 6e-22 UniRef50_C2FKV3 Glycosyltransferase n=1 Tax=Lactobacillus planta... 107 6e-22 UniRef50_Q8XN54 Capsular polysaccharide biosynthsis protein n=1 ... 107 7e-22 UniRef50_B2ILQ1 Ss-1,4-galactosyltransferase n=6 Tax=Streptococc... 107 7e-22 UniRef50_C6JQJ1 Glycosyl transferase n=1 Tax=Fusobacterium variu... 107 8e-22 UniRef50_Q1WU29 Glycosyltransferase n=3 Tax=Lactobacillus RepID=... 107 8e-22 UniRef50_C3QVK6 Glycosyl transferase n=4 Tax=Bacteroides RepID=C... 107 8e-22 UniRef50_A7V9T0 Putative uncharacterized protein n=1 Tax=Bactero... 107 8e-22 UniRef50_C5VMR5 Glycosyl transferase, family 2 n=1 Tax=Prevotell... 107 8e-22 UniRef50_Q93DZ8 EpsG n=2 Tax=Lactobacillales RepID=Q93DZ8_STRTR 107 9e-22 UniRef50_C7UWL4 Glycosyl transferase n=6 Tax=Enterococcus RepID=... 107 9e-22 UniRef50_C6X0E9 Beta-1,3-glucosyltransferase n=1 Tax=Flavobacter... 107 9e-22 UniRef50_A6L7A5 Glycosyltransferase family 2 n=6 Tax=Bacteroides... 106 9e-22 UniRef50_C6IMR6 Putative uncharacterized protein n=1 Tax=Bactero... 106 1e-21 UniRef50_Q9F0B8 Beta(1,4)galactosyltransferase EpsJ n=1 Tax=Lact... 106 1e-21 UniRef50_B0P2L5 Putative uncharacterized protein n=1 Tax=Clostri... 106 1e-21 UniRef50_C0B702 Putative uncharacterized protein n=1 Tax=Coproco... 106 1e-21 UniRef50_C3WCW1 Sugar transferase n=1 Tax=Fusobacterium mortifer... 106 1e-21 UniRef50_A2STD8 Glycosyl transferase, family 2 n=1 Tax=Methanoco... 106 1e-21 UniRef50_B0ZTM8 Putative uncharacterized protein n=1 Tax=Campylo... 106 1e-21 UniRef50_Q5L9S8 Putative LPS biosynthesis related glycosyltransf... 106 1e-21 UniRef50_C0BRQ0 Putative uncharacterized protein n=2 Tax=Bifidob... 106 1e-21 UniRef50_B5JRG3 Glycosyl transferase, group 2 family protein n=1... 106 1e-21 UniRef50_UPI0001BC33C7 N-acetylglucosaminyltransferase n=1 Tax=B... 106 2e-21 UniRef50_B0NG25 Putative uncharacterized protein n=1 Tax=Clostri... 106 2e-21 UniRef50_A4VTV1 Glycosyltransferases involved in cell wall bioge... 106 2e-21 UniRef50_Q8RSN9 Putative glycosyl transferase n=1 Tax=Streptococ... 105 2e-21 UniRef50_Q8A2D4 Glycoside transferase family 2 n=2 Tax=Bacteroid... 105 2e-21 UniRef50_A6BH35 Putative uncharacterized protein n=1 Tax=Dorea l... 105 2e-21 UniRef50_B0P4B2 Putative uncharacterized protein n=1 Tax=Clostri... 105 2e-21 UniRef50_B0P2L7 Putative uncharacterized protein n=1 Tax=Clostri... 105 2e-21 UniRef50_Q1WRW5 Glycosyltransferase n=1 Tax=Lactobacillus saliva... 105 2e-21 UniRef50_B3CA81 Putative uncharacterized protein n=1 Tax=Bactero... 105 2e-21 UniRef50_C1QCB7 Glycosyl transferase n=1 Tax=Brachyspira murdoch... 105 2e-21 UniRef50_A1R005 SS-1,4-galactosyltransferase n=23 Tax=Borrelia R... 105 2e-21 UniRef50_B2RH99 Probable glycosyltransferase n=2 Tax=Porphyromon... 105 3e-21 UniRef50_A8EVS8 Probable glycosyltransferase n=1 Tax=Arcobacter ... 105 3e-21 UniRef50_C2LRT9 Glycosyl transferase, family 2/glycosyl transfer... 105 3e-21 UniRef50_C6LFJ2 Glycosyl transferase, family 2/glycosyl transfer... 105 3e-21 UniRef50_C7G7U9 Putative capsular polysaccharide biosythesis pro... 105 3e-21 UniRef50_C8PLK6 Ss-1,4-galactosyltransferase n=1 Tax=Campylobact... 105 4e-21 UniRef50_D2Q641 Glycosyl transferase family 2 n=2 Tax=Bifidobact... 104 4e-21 UniRef50_D1P3V3 Glycosyl transferase, group 2 family protein n=1... 104 4e-21 UniRef50_B2VF64 Predicted glycosyl transferase, family 2 n=3 Tax... 104 4e-21 UniRef50_A6BHE9 Putative uncharacterized protein n=1 Tax=Dorea l... 104 5e-21 UniRef50_Q8GPA4 Eps7G n=1 Tax=Streptococcus thermophilus RepID=Q... 104 5e-21 UniRef50_C6I7L6 Glycoside hydrolase n=1 Tax=Bacteroides sp. 3_2_... 104 5e-21 UniRef50_C5E891 Eps7G n=1 Tax=Bifidobacterium longum subsp. infa... 104 5e-21 UniRef50_A5ZUJ1 Putative uncharacterized protein n=1 Tax=Ruminoc... 104 5e-21 UniRef50_A5ZFA7 Putative uncharacterized protein n=1 Tax=Bactero... 104 5e-21 UniRef50_C0WDB4 Ss-1,4-galactosyltransferase n=1 Tax=Acidaminoco... 104 6e-21 UniRef50_D0T9V5 Glycosyltransferase family 2 n=1 Tax=Bacteroides... 104 6e-21 UniRef50_UPI0001BC50FC capsular polysaccharide biosynthsis prote... 103 6e-21 UniRef50_D1PNS5 Capsular polysaccharide biosythesis protein CpsI... 103 6e-21 UniRef50_D0BKT7 Capsular polysaccharide biosynthesis protein Cps... 103 6e-21 UniRef50_D0DSK1 Glycosyl transferase n=1 Tax=Lactobacillus ferme... 103 7e-21 UniRef50_Q4JZC9 Putative glycosyl transferase n=2 Tax=Streptococ... 103 7e-21 UniRef50_A7B4B8 Putative uncharacterized protein n=1 Tax=Ruminoc... 103 7e-21 UniRef50_A6BHF1 Putative uncharacterized protein n=1 Tax=Dorea l... 103 8e-21 UniRef50_A6BH38 Putative uncharacterized protein n=1 Tax=Dorea l... 103 8e-21 UniRef50_C9RN10 Glycosyl transferase family 2 n=1 Tax=Fibrobacte... 103 8e-21 UniRef50_Q14I39 Glycosyl transferase family protein n=18 Tax=Fra... 103 8e-21 UniRef50_B2UQ58 Glycosyl transferase family 2 n=1 Tax=Akkermansi... 103 8e-21 UniRef50_B0P6V0 Putative uncharacterized protein n=1 Tax=Anaerot... 103 8e-21 UniRef50_A4CKB6 Putative uncharacterized protein n=2 Tax=Flavoba... 103 9e-21 UniRef50_C3JBV0 Capsular polysaccharide biosynthsis protein n=2 ... 103 9e-21 UniRef50_Q2NH62 Predicted glycosyltransferase n=1 Tax=Methanosph... 103 9e-21 UniRef50_D2TC09 Predicted glycosyl transferase, family 2 n=2 Tax... 103 9e-21 UniRef50_D0WW97 Putative uncharacterized protein n=2 Tax=Vibrio ... 103 9e-21 UniRef50_C9LAR5 Putative beta1,3-glucosyltransferase WaaV n=1 Ta... 103 1e-20 UniRef50_Q9CH23 Teichoic acid biosynthesis protein n=1 Tax=Lacto... 103 1e-20 UniRef50_A3KLK8 Cps5J protein n=2 Tax=Streptococcus agalactiae R... 103 1e-20 UniRef50_Q8XN34 Spore coat polysaccharide biosynthesis protein n... 103 1e-20 UniRef50_C5NWI7 Glycosyl transferase, group 2 family n=1 Tax=Gem... 103 1e-20 UniRef50_B0NLZ9 Putative uncharacterized protein n=1 Tax=Bactero... 103 1e-20 UniRef50_D2BMR3 Glycosyltransferase, family 2 n=1 Tax=Lactococcu... 103 1e-20 UniRef50_B6W7G9 Putative uncharacterized protein n=1 Tax=Anaeroc... 102 1e-20 >UniRef50_P11290 Uncharacterized glycosyltransferase yibD n=58 Tax=Enterobacteriaceae RepID=YIBD_ECOLI Length = 344 Score = 721 bits (1860), Expect = 0.0, Method: Compositional matrix adjust. Identities = 344/344 (100%), Positives = 344/344 (100%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY Sbjct: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC Sbjct: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN Sbjct: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI 240 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI Sbjct: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI 240 Query: 241 KITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMY 300 KITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMY Sbjct: 241 KITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMY 300 Query: 301 KRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITRSAFNLR 344 KRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITRSAFNLR Sbjct: 301 KRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITRSAFNLR 344 >UniRef50_B4SXB8 Glycosyl transferase, group 2 family protein n=36 Tax=Enterobacteriaceae RepID=B4SXB8_SALNS Length = 344 Score = 597 bits (1538), Expect = e-169, Method: Compositional matrix adjust. Identities = 276/343 (80%), Positives = 303/343 (88%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M NS K+S+I+PLYNAG DF CM SLI QTW+ALEIII+NDGSTD+SVEIAK+YAE+Y Sbjct: 1 MKNSKTKVSIIVPLYNAGADFNACMASLIAQTWSALEIIIVNDGSTDHSVEIAKHYAEHY 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 PHVRLLHQANAGASVARN G++ ATG YVAFVDADD+VYP MYETLMTMAL DDLDVAQC Sbjct: 61 PHVRLLHQANAGASVARNLGLQAATGDYVAFVDADDQVYPKMYETLMTMALNDDLDVAQC 120 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 NADWC R+TG WQSIPTDRLRSTGVL+GPDWLRM L+SRRWTHVVWMGVYRR +I NN Sbjct: 121 NADWCVRKTGHAWQSIPTDRLRSTGVLSGPDWLRMALASRRWTHVVWMGVYRRALITDNN 180 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI 240 I F+ GLHHQDI+W+TE MFNA R RYTEQSLYKY+LH+ SVSRL RQGNKNLNYQRHYI Sbjct: 181 ITFVPGLHHQDILWSTEVMFNATRVRYTEQSLYKYFLHDNSVSRLQRQGNKNLNYQRHYI 240 Query: 241 KITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMY 300 KITRLLEKLNR+YA +I IYPEF QQIT+EALRVCHAVRKEPDILTRQRMIAEIFTSGMY Sbjct: 241 KITRLLEKLNRDYARRIPIYPEFRQQITWEALRVCHAVRKEPDILTRQRMIAEIFTSGMY 300 Query: 301 KRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITRSAFNL 343 +R++ NVRS K YQ LLWSFRLWQWRDKT SH R+ R A NL Sbjct: 301 RRMMANVRSAKAAYQTLLWSFRLWQWRDKTLSHRRMARKALNL 343 >UniRef50_A8GFL8 Glycosyl transferase family 2 n=1 Tax=Serratia proteamaculans 568 RepID=A8GFL8_SERP5 Length = 332 Score = 323 bits (829), Expect = 4e-87, Method: Compositional matrix adjust. Identities = 156/320 (48%), Positives = 209/320 (65%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 LSVI+PL+N G+ F + SL+ Q LE+II+NDGSTD S +IA YA+ + H+ ++ Sbjct: 2 LSVIVPLHNVGELFEPFLASLLAQNERRLEVIIVNDGSTDGSGDIAHRYAQQHTHITVID 61 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 Q NAG S ARN G+ VA GKYVAF DADD + P MY TL+ +A ++ LDV QCN + F Sbjct: 62 QLNAGVSCARNAGLAVARGKYVAFPDADDLLAPEMYSTLIELAEQNQLDVMQCNGERYFT 121 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 + E IP RL T V++G WL L SR++ HVVW+ VYR D I ++ + F GL Sbjct: 122 DQDELQPIIPLKRLGDTDVISGVQWLERALESRKFIHVVWLAVYRLDFIKQHRLYFEPGL 181 Query: 188 HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRLLE 247 HHQDI WTTE MFNA R +Y ++LY+ +H S+S R G N+ YQRHY+KI +L Sbjct: 182 HHQDIPWTTEVMFNAQRVKYLGKALYRQRVHGQSISNRRRTGKANVEYQRHYMKIVAMLV 241 Query: 248 KLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLITNV 307 LNR YADKI I P FH QIT EAL +CH++R+EP++ + R+ E F G+ + ++ N Sbjct: 242 ALNRRYADKIAIRPAFHWQITREALGICHSIRREPELAAQCRITDEFFQLGIDRAMVDNA 301 Query: 308 RSVKVGYQALLWSFRLWQWR 327 R +K G+ +LW RL QWR Sbjct: 302 RGLKQGWHVILWLHRLKQWR 321 >UniRef50_D1RQX9 Putative glycosyl transferase n=1 Tax=Serratia odorifera 4Rx13 RepID=D1RQX9_SEROD Length = 335 Score = 321 bits (822), Expect = 3e-86, Method: Compositional matrix adjust. Identities = 153/321 (47%), Positives = 206/321 (64%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 LSVI+PL+NAG F M SL+ Q LE+II+NDGSTD S IA YA YPH+ ++ Sbjct: 5 LSVIVPLHNAGALFEPFMASLLAQDERRLEVIIVNDGSTDGSGAIAHRYARQYPHITVID 64 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 Q N+G S ARN G+ +A GKYVAF DADD + P MY TL+ +A ++ LDV QCN F Sbjct: 65 QPNSGVSCARNAGLAIAQGKYVAFPDADDLLAPEMYSTLVALAEQEQLDVMQCNGQRYFS 124 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 E + IP RL TGV++G WL L +R++ HVVW+ VYR D I ++ + F GL Sbjct: 125 RENELQEIIPQKRLGDTGVISGAQWLERALKTRKFIHVVWLAVYRLDFIKRHQLYFEPGL 184 Query: 188 HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRLLE 247 HHQDI WTTE MFNA R +Y ++LY+ +H+ S+S R G N+ YQRHY+KI +L Sbjct: 185 HHQDIPWTTEVMFNAQRVKYLSKALYRQRVHDQSISNRRRTGKANVEYQRHYMKIVEMLV 244 Query: 248 KLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLITNV 307 LN+ YA K++I P FH QI EAL +CH++R+EP+ + ++ AE + G+ + + N Sbjct: 245 VLNQRYASKMVIRPAFHWQIAREALGICHSIRREPEPAAQCQITAEFYQRGIDRTMAANA 304 Query: 308 RSVKVGYQALLWSFRLWQWRD 328 R +K + LW RL QWRD Sbjct: 305 RGLKQIWHVTLWLHRLKQWRD 325 >UniRef50_B5XTH2 Glycosyl transferase, group 2 family protein n=21 Tax=Enterobacteriaceae RepID=B5XTH2_KLEP3 Length = 329 Score = 236 bits (603), Expect = 7e-61, Method: Compositional matrix adjust. Identities = 120/311 (38%), Positives = 176/311 (56%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M+ T LS++ +YN E L Q + E+I++NDGSTDNS+ + + + Sbjct: 1 MSQTPLLSIVAAVYNGEKFLAQFFECLEQQQLESYELILVNDGSTDNSLAVIAEWRDRLQ 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 +V++L Q N G SVARN G+ A+GKY+AF D DD++YP MY TL+ MA ++ LD+A CN Sbjct: 61 NVQVLEQENQGVSVARNTGLAAASGKYLAFPDIDDKLYPGMYRTLLEMAEKEHLDIATCN 120 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 + + + E+ P DRL STGVL G WL+ L SR++ HV W+ +YR D I ++ Sbjct: 121 GTYVYEKRRESHPIFPLDRLPSTGVLPGHVWLKQALDSRKFLHVTWLNIYRHDFIRQHRF 180 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIK 241 F GL HQDI WTTE + A R +YT Q Y YY+H+ SVS + + RHY+K Sbjct: 181 HFEPGLRHQDIPWTTEALLAAERVQYTSQQFYDYYIHSESVSHKPDNDDTLMRSARHYMK 240 Query: 242 ITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYK 301 I +LE +N+ Y DK+ QI E L + H D + +I E F G+++ Sbjct: 241 ILEMLEAINQRYPDKVRHIAACRWQIAKEGLGIIHTFDSMKDESKKHVIIKEFFDRGIWR 300 Query: 302 RLITNVRSVKV 312 + N + ++ Sbjct: 301 LIWKNACTFRL 311 >UniRef50_B4F184 Glycosyl transferase n=2 Tax=Proteus mirabilis RepID=B4F184_PROMH Length = 327 Score = 158 bits (399), Expect = 3e-37, Method: Compositional matrix adjust. Identities = 82/248 (33%), Positives = 136/248 (54%), Gaps = 3/248 (1%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 T +SVI+P++N +ESL+ QT+ E+IIINDGSTDNSV+IA Y + Sbjct: 2 KTPIISVIVPMFNEASRITRLLESLLCQTFHDFEVIIINDGSTDNSVDIAMLYCQQDNRF 61 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 L HQ N G S ARN G++ A G ++ F D+DD + P + +A E +DV N + Sbjct: 62 HLYHQTNQGLSSARNTGLKYAQGDWIVFFDSDDFIKPQLLAHWHQLACEQHIDVLIGNGE 121 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 + Q+ R V++G +W+ ++ +W H VW+ R ++I K++++F Sbjct: 122 RYDVHNPQKHQTTIHQRQPYQQVISGQEWVIHAVTQHQWPHFVWLQFIRHEIIKKHHLRF 181 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSR---LHRQGNKNLNYQRHYI 240 I GL+H+DI+WTT+ R + +Q LY Y + S++R ++ + +Y + I Sbjct: 182 IDGLYHEDILWTTQLALVTQRIAFDDQPLYYYCANPNSITRKPNPQKEAKRAQSYLQIVI 241 Query: 241 KITRLLEK 248 ++ +L +K Sbjct: 242 QLLQLADK 249 >UniRef50_D1NJF8 Glycosyl transferase family 2 n=1 Tax=Clostridium thermocellum JW20 RepID=D1NJF8_CLOTM Length = 322 Score = 145 bits (367), Expect = 2e-33, Method: Compositional matrix adjust. Identities = 84/253 (33%), Positives = 140/253 (55%), Gaps = 14/253 (5%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +SVI+P+YN C+ES++ QT++ LEII++NDGS D EI YA+ V+++H Sbjct: 4 ISVIVPVYNVEKYLPACIESILCQTYSELEIILVNDGSPDKCPEICDEYAKKDSRVKVIH 63 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 Q+N G S ARN GI+ ++GKY+ FVD+DD + PTM E L T ++ + +A C + F Sbjct: 64 QSNQGLSSARNTGIDNSSGKYIIFVDSDDTIEPTMVEDLYTCVIKYNCSMAACGRKYVF- 122 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 E G+ +P + + V + D +R + + W +Y+R++ +NI+F G Sbjct: 123 EDGKVVCKVPDNIEK---VFSFSDAIREMNTYFYFDMSAWAKIYKRELF--DNIRFPVGR 177 Query: 188 HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR-LL 246 +D + + + + Y + LY Y +S+SR NKN+N+ +I R + Sbjct: 178 LSEDFFIMYKLIALSQKVAYISKPLYNYLQRKSSISR-----NKNINHD--FITAAREQM 230 Query: 247 EKLNRNYADKIMI 259 E L++NY + +I Sbjct: 231 EFLDKNYPELAII 243 >UniRef50_C4K3K5 Glycosyltransferase group 2 n=3 Tax=Enterobacteriaceae RepID=C4K3K5_HAMD5 Length = 346 Score = 142 bits (358), Expect = 2e-32, Method: Compositional matrix adjust. Identities = 94/277 (33%), Positives = 146/277 (52%), Gaps = 17/277 (6%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 MNS + LSVIIP++NA +D ESL Q E+I INDGSTD S I + A+ Sbjct: 1 MNSID-LSVIIPVFNAENDLMELFESLKNQKNLFYEVIAINDGSTDQSQIILEKIAQIDK 59 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 L+ Q + G SVARN+GI A G+++AF D+DD + P +ET + A + DLD N Sbjct: 60 RWVLISQPHRGVSVARNQGIARARGRWIAFTDSDDWLAPQAFETWVQQAEKGDLDCLVGN 119 Query: 122 ADWCFRETGETWQSIPTDRLRST--GVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKN 179 FR + T R +LTG +W+ + W H V++ + +R ++KN Sbjct: 120 G---FRFKLNPQKPTETLLYRQPWGKILTGREWIIQSVHKNEWPHYVFLQLVKRAFLIKN 176 Query: 180 NIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSR---LHRQGNKNLNYQ 236 ++FI + H+DI+WTTE A R + + LY Y ++ S+++ + R ++ ++Y Sbjct: 177 QLRFIPNMLHEDILWTTELAEVAQRMGFCAEPLYGYRINPESITQSTSIERLAHRAISY- 235 Query: 237 RHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALR 273 ++EKL N A+ + FHQ + +A R Sbjct: 236 ------LNIIEKL-LNKAEAVKQDKAFHQALLKQAHR 265 >UniRef50_C3WCV9 Glycosyl transferase n=1 Tax=Fusobacterium mortiferum ATCC 9817 RepID=C3WCV9_FUSMR Length = 331 Score = 140 bits (352), Expect = 8e-32, Method: Compositional matrix adjust. Identities = 91/323 (28%), Positives = 167/323 (51%), Gaps = 24/323 (7%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPH-VRL 65 K+SVI+P+YN + C++S++ QT +EII+INDGSTDNS I + Y++ YP +R Sbjct: 5 KVSVIVPIYNTEKFLKKCIDSIVNQTLEEIEIILINDGSTDNSHNICEEYSKKYPQKIRY 64 Query: 66 LHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWC 125 ++ N G S +RN G+ +A G+Y+AFVD+DD + MY+ + A +++LD+ C + Sbjct: 65 INNKNIGCSASRNLGMSLAQGEYIAFVDSDDYIDKEMYQEMYEKAKKENLDIVVCGITYV 124 Query: 126 FRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIA 185 +ET + SIP++ + + L + RM + +YR+++I NI F Sbjct: 125 NQETKKRISSIPSNISKKSDYLIYKN--RMANPVNK--------LYRKNII--KNILFPI 172 Query: 186 GLHH-QDIVWTTEFMFNALRARYTEQSLYKYYLH-NTSVSRLHRQGNKNLNYQRHYIKIT 243 H +DIV+ + + A + + E+S Y Y H N S+ L ++ + ++ YI + Sbjct: 173 DTHSGEDIVFCYKSILKAKKINHIEKSYYYYIYHGNNSIFNLEKRLGIFIAFKELYIYLK 232 Query: 244 RLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRL 303 + +Y I +F++ + A+R + P +++ + E + YK L Sbjct: 233 ------DNDYLKNKEIIKKFYENFNFYAIRGAFFMLLNPKLVSNEEY--EKYDKLFYKEL 284 Query: 304 ITNVRSVKVGYQALLWSFRLWQW 326 ++ + + L++ +R W Sbjct: 285 -KKIKFLPFKSKVLVYYYRNIIW 306 >UniRef50_D1BQ92 Glycosyl transferase family 2 n=1 Tax=Veillonella parvula DSM 2008 RepID=D1BQ92_VEIPT Length = 322 Score = 140 bits (352), Expect = 9e-32, Method: Compositional matrix adjust. Identities = 80/217 (36%), Positives = 122/217 (56%), Gaps = 9/217 (4%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +SVI+P+YN C+ES+ QT+ LEII++NDGSTD S +I YA ++++H Sbjct: 4 ISVIVPVYNVESYVAECIESIQNQTYMNLEIILVNDGSTDASGDICDKYAAYDERIQVIH 63 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + NAG S ARN GIE A G Y+ FVD+DD + PTMYE ++ + E DLD+ +C A FR Sbjct: 64 KENAGVSAARNTGIESANGDYIGFVDSDDYIAPTMYEDMLKLMAEHDLDIIECTA---FR 120 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 G+T +I S + + L+M + VW +Y+R VI ++++F G Sbjct: 121 NNGDT--NIEGCNDGSLEIFNRDEALKMAMYD--CFVAVWSQLYKRRVI--SDVRFPVGR 174 Query: 188 HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSR 224 +D + F+ N R + + LY Y L+ S ++ Sbjct: 175 KFEDSAVSYLFIANTKRVGHINRCLYYYRLNPNSTTQ 211 >UniRef50_Q84GX9 Orf34 n=2 Tax=Photorhabdus luminescens RepID=Q84GX9_PHOLU Length = 328 Score = 135 bits (340), Expect = 2e-30, Method: Compositional matrix adjust. Identities = 73/218 (33%), Positives = 118/218 (54%), Gaps = 2/218 (0%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +++IIPL+N ++S+ Q E++IINDGSTDNS+ + Y + +R+ + Sbjct: 6 ITIIIPLFNNESYIACLLDSISVQENVNFEVVIINDGSTDNSLGVINEYKKRDNRIRIFN 65 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 Q N G SVARN GI+ A G++V F+D+DD + P + + A+ LDV N F Sbjct: 66 QENRGVSVARNHGIKYARGEWVLFIDSDDWLKPQILRKWLDQAIAQKLDVLIGNG-VSFT 124 Query: 128 ETGETWQSIPTDRLRSTG-VLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAG 186 E ++ R + G VLTG +W+ + R W H VW+ + ++++I+ N + F Sbjct: 125 EEPYLQENKRITRRQPYGRVLTGKEWIEYCVECREWPHFVWLQLIKKELIINNQLSFKDN 184 Query: 187 LHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSR 224 H+DI+WT A R + E+ LY Y + S++R Sbjct: 185 TLHEDILWTINLSLIANRIGFCEELLYCYRHNLESITR 222 >UniRef50_A6LE28 Glycosyltransferase family 2 n=1 Tax=Parabacteroides distasonis ATCC 8503 RepID=A6LE28_PARD8 Length = 329 Score = 134 bits (336), Expect = 6e-30, Method: Compositional matrix adjust. Identities = 76/234 (32%), Positives = 126/234 (53%), Gaps = 20/234 (8%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQ--TWTALEIIIINDGSTDNSVEIAKYYAEN 59 MN+ S+IIP YNA C+ES + Q + EII I+DGS DN++ I +A N Sbjct: 1 MNTELLTSIIIPCYNAEKYIHKCLESCLNQRLDYNTFEIIAIDDGSEDNTLNILHEFAHN 60 Query: 60 YP--HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDV 117 P H+++L Q NAG S ARN G+E A GKY+ ++D+DD ++ A+++ LD+ Sbjct: 61 NPQLHMQILTQKNAGQSRARNHGLEYAHGKYILYLDSDDFFVDNTISEILYTAIDNQLDM 120 Query: 118 -------AQCNADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGV 170 N + + + +PTD +++G D+++ + + +VW + Sbjct: 121 LWFDHQHVDENYNLLPLPQADCKKHVPTD------IMSGIDFIKKAFNH---SGMVWQFI 171 Query: 171 YRRDVIVKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSR 224 +R+D ++KNNI F G+ QDIV+T + ++ R +Y Y Y + + SV+R Sbjct: 172 FRKDFLIKNNILFFDGIIMQDIVYTLQCLYKCKRVQYYPICAYNYLIRSNSVTR 225 >UniRef50_D2RIJ6 Glycosyl transferase family 2 n=1 Tax=Acidaminococcus fermentans DSM 20731 RepID=D2RIJ6_ACIFE Length = 310 Score = 133 bits (334), Expect = 1e-29, Method: Compositional matrix adjust. Identities = 74/220 (33%), Positives = 124/220 (56%), Gaps = 6/220 (2%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 LSVI+P+Y C++SL+ QT+ L+II+++DGS DNS I YYA+ + ++H Sbjct: 6 LSVIVPVYKVEKFLDRCIQSLLKQTYRNLQIILVDDGSPDNSGIICDYYAKKDFRIEVIH 65 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 Q+N G S ARNRG+ A G+Y+AF+D+DD + P MYETL+ + +++LDVA+C+ D Sbjct: 66 QSNKGLSGARNRGLIEARGEYIAFLDSDDWIEPDMYETLIGLMEKNNLDVARCSID---- 121 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 ET ++ + T + + + VVW VY+RD++ + I G Sbjct: 122 ETDGSFHRLKTPSKQYANRVYENNTCLDLYFHEFLCKVVWNAVYKRDIV--DGILSPEGH 179 Query: 188 HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHR 227 +D + ++F R T + L+ Y+++ ++R H+ Sbjct: 180 QSEDNYVSGRYLFRCKRIWITSKVLHHYWMNPDGITRAHK 219 >UniRef50_A8UPC2 Glycosyltransferase n=1 Tax=Flavobacteriales bacterium ALC-1 RepID=A8UPC2_9FLAO Length = 329 Score = 131 bits (330), Expect = 3e-29, Method: Compositional matrix adjust. Identities = 76/219 (34%), Positives = 125/219 (57%), Gaps = 11/219 (5%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+IIP+YN D C+ESLI QT+ LEII++NDGS D+S I YA N +++ H Sbjct: 5 ISIIIPVYNVEDYLDRCIESLINQTYKNLEIILVNDGSKDSSGAICDRYALNDDRIKVYH 64 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWC-F 126 N G+S+ARN G++ + G Y+ FVD+DD V P M+E L++ A++++L V + ++ Sbjct: 65 IENGGSSIARNYGLKKSKGDYIGFVDSDDWVKPEMFEQLISFAIKNNLKVIETSSVHSHL 124 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAG 186 + G +I T ++ L+ + ++R+ VW +Y+R ++ + FI G Sbjct: 125 VKEGTVKNNIITAKIEDKNTA-----LKRIIKNKRFA--VWRRIYQRSIL--QDRYFIEG 175 Query: 187 LHHQDIVWTTEFMFNALRARYTEQSLYKYYLHN-TSVSR 224 + HQD+ +T + + + Y E Y Y + N TSV R Sbjct: 176 ILHQDVYYTLDIINEVSQIGYFENQHYVYNVQNPTSVIR 214 >UniRef50_B7HZB4 Beta-1,3-N-acetylglucosaminyltransferase n=37 Tax=Bacillus cereus group RepID=B7HZB4_BACC7 Length = 326 Score = 130 bits (327), Expect = 7e-29, Method: Compositional matrix adjust. Identities = 86/262 (32%), Positives = 136/262 (51%), Gaps = 38/262 (14%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 MM +SV+IPLYNA M+S++ QT+ +EI+I++DGS D S I K AE Y Sbjct: 1 MMKQNQLVSVVIPLYNAEKYIEETMQSILDQTYKNIEIVIVDDGSKDQSPSIVKNLAEKY 60 Query: 61 P-HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQ 119 P V+ +HQ N G SVARN GIE A+G+Y+AF+D+DD +PT E + ++++D Sbjct: 61 PGQVKYVHQKNQGVSVARNTGIENASGEYIAFLDSDDLWHPTKIEKQVKSIHKNNMDACY 120 Query: 120 CNADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVV------WMGVYRR 173 C + ETGE V ++++ +S TH V W +++R Sbjct: 121 CGYMNFYEETGEK-------------VEHTTNFIKGDMSKAFLTHQVVAQTSTW--IFKR 165 Query: 174 DVIVKNNIKFIAGLH-HQDIVWTTEFMFNALRAR---YTEQSLYKYYLHNTSVSRLHRQG 229 +++ +NI+F G +D+ EF+F + Y ++ L Y R+ +G Sbjct: 166 AIVMDHNIRFTPGCSWGEDL----EFLFKLMSVTNVCYVDEYLTYY--------RILSEG 213 Query: 230 NKNLNYQRHYIKITRLLEKLNR 251 N + Y+ + +K T+ LE NR Sbjct: 214 NLSSKYKDYELKTTKELEVFNR 235 >UniRef50_P71059 Uncharacterized glycosyltransferase epsJ n=3 Tax=Bacillus subtilis group RepID=EPSJ_BACSU Length = 344 Score = 128 bits (322), Expect = 3e-28, Method: Compositional matrix adjust. Identities = 76/240 (31%), Positives = 129/240 (53%), Gaps = 17/240 (7%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+I+P+YN C++SL+ QT + +EII++NDG+ D S EIA+ YA+ +R++H Sbjct: 5 VSIIVPMYNVEPFIEECIDSLLRQTLSDIEIILVNDGTPDRSGEIAEDYAKRDARIRVIH 64 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 QAN G S ARN GI+ A G Y+ FVD DD V M++ L A ++ LD+ C ++ Sbjct: 65 QANGGLSSARNTGIKAARGTYIGFVDGDDYVSSAMFQRLTEEAEQNQLDIVGCG---FYK 121 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRM--GLSSRRWTHVVWMGVYRRDVIVKNNIKFIA 185 ++ + +P +L + VLT P+ R+ VW +YRR++ + N+ F Sbjct: 122 QSSDRRTYVPP-QLEANRVLTKPEMTEQLKHAHETRFIWYVWRYLYRRELFERANLLF-- 178 Query: 186 GLHHQDIVWTTEFMFN------ALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHY 239 +DI + + FN A R + ++ LY Y + S++ + + + + Q+ Y Sbjct: 179 ---DEDIRFAEDSPFNLSAFREAERVKMLDEGLYIYRENPNSLTEIPYKPAMDEHLQKQY 235 >UniRef50_C4FFR1 Putative uncharacterized protein n=1 Tax=Bifidobacterium angulatum DSM 20098 RepID=C4FFR1_9BIFI Length = 327 Score = 127 bits (320), Expect = 4e-28, Method: Compositional matrix adjust. Identities = 65/217 (29%), Positives = 114/217 (52%), Gaps = 6/217 (2%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +SV++P+YN C++S++ QT+T +EII+++DGSTD S I Y+ P + ++H Sbjct: 9 ISVVVPVYNVKPYVAKCLDSILRQTYTNIEIIVVDDGSTDGSELICDDYSNKDPRITVIH 68 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 Q N G + ARN GI+ A G+Y+ FVD+DD + P MYE L+ A +++ +A C ++ F Sbjct: 69 QRNTGLAAARNTGIDAAHGEYLGFVDSDDFIEPFMYEKLLNAAQKNNCVLAVCGINYVF- 127 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 + IP + V PD + + R + W +Y+ ++ + I+F G Sbjct: 128 ---DNGTVIPKANIEPDQVFDFPDAITEMNTYRLFDMGAWSKLYKSELF--DGIRFPVGK 182 Query: 188 HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSR 224 +D + A + + + Y YY S+++ Sbjct: 183 LSEDFFIMFKIFDRAQKVAFVSDACYNYYQRTNSITK 219 >UniRef50_Q02W55 Glycosyltransferase n=3 Tax=Lactococcus lactis RepID=Q02W55_LACLS Length = 319 Score = 127 bits (319), Expect = 6e-28, Method: Compositional matrix adjust. Identities = 73/209 (34%), Positives = 114/209 (54%), Gaps = 1/209 (0%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+SVI+P+YN+ RTC+ESLI QT +EII++NDGSTD S++ E Y +RL+ Sbjct: 5 KVSVIVPIYNSEKYIRTCLESLIKQTLKPIEIILVNDGSTDRSIDHIFDLIEKYNFIRLI 64 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 N+GA+ ARN+G+ +A G Y++FVD+DD V+ T E L A LD+ + + + Sbjct: 65 EIENSGAAEARNKGLALARGNYISFVDSDDFVHSTFLEKLYQEAENKKLDIICGSFEATY 124 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAG 186 + + LR+ GV G + + R + ++W +YRR+ + KN++ Sbjct: 125 KNKVSKKMERNNELLRA-GVQEGIVLFKKQIQLRNYIPMIWAYMYRREFLTKNSLWLSPK 183 Query: 187 LHHQDIVWTTEFMFNALRARYTEQSLYKY 215 H+D +T A RA+ E Y Y Sbjct: 184 TIHEDEEFTPRVWNLAKRAKIIETFEYIY 212 >UniRef50_Q93DZ7 Eps1I n=1 Tax=Streptococcus thermophilus RepID=Q93DZ7_STRTR Length = 316 Score = 125 bits (315), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 81/226 (35%), Positives = 125/226 (55%), Gaps = 20/226 (8%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+I+P YN RTC+ES++ QT+ +E+II+NDGSTD S+ + ++ ++++++ Sbjct: 5 ISIIVPTYNVEKYIRTCIESILAQTYRNIEVIIVNDGSTDQSLAVISDLICSHHNIKVIN 64 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 Q N G SVARN GI+ ATGKY+AFVDADD++ P +L +A + D+ + FR Sbjct: 65 QKNQGLSVARNTGIDAATGKYIAFVDADDKIKPDFVSSLYQIADKTGADIVR----GSFR 120 Query: 128 ETGETWQSIPTDRLRSTGVLT--GPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIA 185 + +IP + V T G L LSS + VVW +YR D I N+I+F Sbjct: 121 DFN---GNIPKGWVPDFNVPTNYGTIVLDQFLSS-NISFVVWSSIYRLDFINSNHIRFTP 176 Query: 186 GLHHQDIVWTT-EFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGN 230 G+ +D +T +M L A E + Y Y R++R G+ Sbjct: 177 GILFEDADFTIRAYMLAKLVATSPEPN-YAY--------RINRPGS 213 >UniRef50_C3AT96 Glycosyltransferase family 2 n=4 Tax=Bacillus RepID=C3AT96_BACMY Length = 343 Score = 125 bits (315), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 80/216 (37%), Positives = 115/216 (53%), Gaps = 10/216 (4%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 + K+SVIIP+YNAG C+ESL+ QT E I INDGS D+S EI + Y + + Sbjct: 2 SQKVSVIIPVYNAGKYITRCIESLLNQTLQECEFIFINDGSIDDSREIIEGYKQVDDRII 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 L++Q N G S+ARN+G+ +A G+YV FVDADD + MYETL + + DV N Sbjct: 62 LINQVNQGVSIARNKGLHLAAGEYVGFVDADDCIEKDMYETLYNSVKQSNCDVVISN--- 118 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGP----DWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 F+ E + I T VL + L L + VV +YR ++I +NN Sbjct: 119 -FKGETEGHKVITTYPFPIDIVLEKDYIEQEVLPYFLKADNLNTVV-NKIYRNNLIKENN 176 Query: 181 IKFIAGLH-HQDIVWTTEFMFNALRARYTEQSLYKY 215 +KF +G+ +D ++ F A R +Y + Y Y Sbjct: 177 VKFPSGIALGEDGMFNVAFFSGATRMKYVDYVGYHY 212 >UniRef50_B1JIU1 Glycosyl transferase family 2 n=28 Tax=Enterobacteriaceae RepID=B1JIU1_YERPY Length = 341 Score = 125 bits (314), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 74/241 (30%), Positives = 116/241 (48%), Gaps = 27/241 (11%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 MNS + +SVI+P+YN + ++SL +QT +LEII +NDGSTD+S+ + + A Sbjct: 1 MNSVD-ISVIVPVYNCANYLTDLLDSLRSQTGVSLEIIAVNDGSTDHSLALLEQVAHIDS 59 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 + +++Q N G S ARN GI A G +VAF D DD + P T + + LDV N Sbjct: 60 RLVIINQQNQGLSEARNVGIAKAKGNWVAFADGDDWLAPHALSTWLQQVKQQKLDVLIGN 119 Query: 122 A-------------------DWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRW 162 W G+ W + + G W+ + + + W Sbjct: 120 GYQFSSNPERKPVTPLLGQQPWGATLNGQQWIILSVKQ-------NGQQWIILSVKQKEW 172 Query: 163 THVVWMGVYRRDVIVKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSV 222 H W+ + RRDVI + ++FI + H+DI+WTT+ A R + Y Y + S+ Sbjct: 173 PHYAWLQLIRRDVISTHQLQFIRDMLHEDILWTTQLALVAQRIGFCATPFYGYRTNPDSI 232 Query: 223 S 223 + Sbjct: 233 T 233 >UniRef50_Q9RG46 Cps1I n=2 Tax=Streptococcus suis RepID=Q9RG46_STRSU Length = 322 Score = 125 bits (314), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 78/219 (35%), Positives = 122/219 (55%), Gaps = 12/219 (5%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +SVI+P+YN D C+ S+I QT+T LE+I++NDGSTD+S +I Y +N ++ Sbjct: 5 ISVIVPIYNVQDYLDKCINSIINQTYTNLEVILVNDGSTDDSEKICLNYMKNDGRIKYYK 64 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF- 126 + N G + ARN G+E ATGKY+AFVD+DD + M+E + E + D+A+ D+C Sbjct: 65 KINGGLADARNFGLEHATGKYIAFVDSDDYIEVAMFERMHDNITEYNADIAE--IDFCLV 122 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAG 186 E G T + R + VLT + ++ LS + VW +Y RD+I +IKF Sbjct: 123 DENGYTKKK----RNSNFHVLTREETVKEFLSGSNIENNVWCKLYSRDII--KDIKFQIN 176 Query: 187 LHH--QDIVWTTEFMFNALRARY-TEQSLYKYYLHNTSV 222 +D+++ E + N R T + Y Y + N+S+ Sbjct: 177 NRSIGEDLLFNLEVLNNVTRVVVDTREYYYNYVIRNSSL 215 >UniRef50_C2FUI9 Glycosyltransferase yibD n=3 Tax=Bacteroidetes RepID=C2FUI9_9SPHI Length = 334 Score = 125 bits (313), Expect = 3e-27, Method: Compositional matrix adjust. Identities = 74/222 (33%), Positives = 123/222 (55%), Gaps = 15/222 (6%) Query: 6 NKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYA----ENYP 61 N +S+IIP++NA + + C+ SL+ QT+ LEI+ IND S DNS++I + + E Y Sbjct: 8 NSVSIIIPVFNADQNLKDCINSLLNQTYKNLEILFINDCSIDNSIQIIESFKEALLEKYG 67 Query: 62 HVRLL-HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 V++L H N G + ARN G++ ATG Y+ +VDADD + E L+ A++++ D+ C Sbjct: 68 RVQVLSHSYNKGVAAARNTGLDNATGDYIYYVDADDYIDHDAIEQLVNEAIKENADIVGC 127 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 N + F + T ++ T L + + G+ RW +WM + +RD+ KN Sbjct: 128 NWNLTFSKNERT-----MNQPHFTSPLEAIEKMLYGV--MRWN--LWMFLVKRDLYTKNE 178 Query: 181 IKFIAGLH-HQDIVWTTEFMFNALRARYTEQSLYKYYLHNTS 221 I+FI G + +D++ + NA + + ++LY Y N S Sbjct: 179 IQFIPGQNIGEDMLVMFKLFVNAQKVCFLNKALYHYRQDNAS 220 >UniRef50_B1R2U2 Glycosyl transferase, group 2 family protein n=4 Tax=Firmicutes RepID=B1R2U2_CLOBU Length = 353 Score = 125 bits (313), Expect = 3e-27, Method: Compositional matrix adjust. Identities = 63/208 (30%), Positives = 112/208 (53%), Gaps = 7/208 (3%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+I+P+YN C++S++ QT+ E+I+++DGSTD S +I + + ++++H Sbjct: 4 ISIIVPIYNVEKYVEKCVDSILKQTYINFELILVDDGSTDMSGKICDKFKKEDDRIKVIH 63 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + N G S ARN G+++ATGKY+ F+D+DD + TMYE + A+EDD D+ C D + Sbjct: 64 KKNGGLSSARNEGVKLATGKYIGFIDSDDYIDKTMYEKMFNKAIEDDSDIVIC--DMTYN 121 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 G + I T GVL D + L + + +Y+R++ + F G Sbjct: 122 LNG---KDISTTIFEDFGVLNREDTILKYLENDYFRSHAQNKLYKRELF--EYVSFPEGK 176 Query: 188 HHQDIVWTTEFMFNALRARYTEQSLYKY 215 ++D+ + + N + + + LY Y Sbjct: 177 LYEDVATFYKLIHNCNKISFVNEKLYYY 204 >UniRef50_D0GIA2 Glycosyltransferase, group 2 family n=1 Tax=Leptotrichia goodfellowii F0264 RepID=D0GIA2_9FUSO Length = 322 Score = 124 bits (312), Expect = 3e-27, Method: Compositional matrix adjust. Identities = 76/261 (29%), Positives = 134/261 (51%), Gaps = 10/261 (3%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY--PHVR 64 KLS+I+P+YN C+ SL EIII+NDGSTD+S++I + + +N+ ++ Sbjct: 2 KLSIIVPVYNVEKYIEKCLNSLCNPEIEK-EIIIVNDGSTDSSLKIVEEFKKNHDKENII 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++ Q N G ARN G+ A G+YV+F+D+DD V +YE ++ ++D +D + Sbjct: 61 IISQENKGLPEARNTGLRKAKGEYVSFIDSDDFVDKKLYEIMVKEVIKDRVDYGIGKYSY 120 Query: 125 CF-----RETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKN 179 + +E ++ D +S V G + L++ ++ VW +YRR+ +V+N Sbjct: 121 FYENKDKKEYNDSEIKDIIDSFQSNSVKKGKELLKILKKEDKYGPEVWDDIYRREFLVEN 180 Query: 180 NIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHY 239 N+ F H+D ++T E A + +Y Y Y S+ + + N + Q++ Sbjct: 181 NLFFKPDRLHEDEIFTLEVFLKAEKVKYYAVPFYCYLQRENSIMKTQKVKN-FTDMQKNI 239 Query: 240 IKITRLL-EKLNRNYADKIMI 259 + +LL E+ N KI+I Sbjct: 240 FDMEKLLTEEKNDKDIQKILI 260 >UniRef50_A4E6Y2 Putative uncharacterized protein n=1 Tax=Collinsella aerofaciens ATCC 25986 RepID=A4E6Y2_9ACTN Length = 335 Score = 124 bits (311), Expect = 4e-27, Method: Compositional matrix adjust. Identities = 73/229 (31%), Positives = 123/229 (53%), Gaps = 19/229 (8%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 LS+++P+YN+ + C+ SL++QT LEI++I+DGSTD+S+E K A +R+ Sbjct: 6 LSIVVPIYNSSETIEECVASLVSQTINDLEILLIDDGSTDDSLEKCKKLASKDNRIRVFT 65 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + NAG +ARN G+++ATG+++AFVD+DD MYETL++ A + DV +C Sbjct: 66 KENAGQGIARNYGLDLATGEFIAFVDSDDTCTKAMYETLLSRARDSGSDVVV--GGYCDL 123 Query: 128 ETGETWQSIPTD--RLRS--------TGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIV 177 G +S P LRS +++ P G VW VYRR +I Sbjct: 124 LNGAVVESHPAGSMELRSKDEIDRYMADLVSTP-----GHDDSAGCIAVWDSVYRRLLIE 178 Query: 178 KNNIKFIA--GLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSR 224 +N ++F + ++ +D+V+ + ++ LY+Y++ S S+ Sbjct: 179 ENGVQFPSERDVYSEDLVFKLRALASSSSVSVVPNCLYEYHVSGKSYSK 227 >UniRef50_UPI0001BC61E6 putative glycosyl transferase n=1 Tax=Fusobacterium gonidiaformans ATCC 25563 RepID=UPI0001BC61E6 Length = 318 Score = 124 bits (310), Expect = 6e-27, Method: Compositional matrix adjust. Identities = 73/227 (32%), Positives = 121/227 (53%), Gaps = 12/227 (5%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPH-VRL 65 +LS+IIP+YN R C+++ I E+I+I+DGS DNS I + Y + +P + Sbjct: 5 ELSIIIPVYNVELYLRKCLDT-IYPLQMKKEVILIDDGSQDNSFGIMQEYKDKFPEETII 63 Query: 66 LHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNA--- 122 + Q N G S RNRG+EVA GKYVAF+D+DD + YE ++++D+ N Sbjct: 64 ISQENKGISDVRNRGLEVALGKYVAFLDSDDFIDTIKYEEFFKRLKQEEVDILHGNGFYY 123 Query: 123 --DWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 D + ET SI D L G ++ G + + VW+ +YRR+ +++N+ Sbjct: 124 QIDEKKEKIFETEDSIFID-----TTLLGKEFYEKGYNLQIHRDYVWLNIYRREYLLENS 178 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHR 227 + F G+ ++D +++ E + A + RY Y Y ++ S++R R Sbjct: 179 LFFREGITYEDKIFSQEAFWKAEKIRYVPMFFYYYRQNSESITRKPR 225 >UniRef50_B1QTB6 Glycosyl transferase, group 2 family protein n=4 Tax=Bacteria RepID=B1QTB6_CLOBU Length = 326 Score = 124 bits (310), Expect = 7e-27, Method: Compositional matrix adjust. Identities = 72/228 (31%), Positives = 118/228 (51%), Gaps = 10/228 (4%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +SV+IP+YN C+ES+I QT+T L+II+++DGSTDNS I Y + + ++H Sbjct: 7 ISVVIPVYNVEKYLPKCIESIINQTYTNLQIILVDDGSTDNSGSICDEYKKKDTRIMVIH 66 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + N G S ARN GI+ + GKY+ FVD+DD + MYE +M + + +++ N + F Sbjct: 67 KKNGGLSDARNVGIKYSKGKYIGFVDSDDYINKKMYEIMMENMVNNKANISIVNRYYVF- 125 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 E G+ + + S V++ + + + + W +Y +++ N+I+F G Sbjct: 126 EDGQEFLRYSIN--ESIKVMSNLEAIEEMNNFSTFDMAAWDKLYEKELF--NDIEFPVGK 181 Query: 188 HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNY 235 +D R Y LY YY N S+SR NK +N+ Sbjct: 182 LSEDFYIMYLLFEKCKRIVYDSSPLYYYYQRNNSISR-----NKKINF 224 >UniRef50_Q93TI7 Putative glycosyltransferase CpsIVM n=1 Tax=Streptococcus agalactiae RepID=Q93TI7_STRAG Length = 283 Score = 123 bits (309), Expect = 8e-27, Method: Compositional matrix adjust. Identities = 71/210 (33%), Positives = 112/210 (53%), Gaps = 6/210 (2%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+SVI+P+YN+ C++S+ QT+ LEII++NDGSTD S E+ + ++ ++ Sbjct: 13 KISVIVPVYNSKQYLIACVDSIRKQTYKNLEIILVNDGSTDGSKELCEEIRKSDERIKTF 72 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 H+ N G S ARN GI +TG + FVD+DD + P MYETL+ + ++ +D QCN + Sbjct: 73 HKTNGGQSSARNLGILYSTGDLIGFVDSDDTIDPKMYETLLNIYEDEQVDWVQCNHKKIY 132 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAG 186 + + P VL D+L LS+ + V G+ RD+ +K IKF Sbjct: 133 SNGVNLYYNGP----EYYNVLNKQDFLYEFLSTNKIFSSVCEGLLSRDLALK--IKFREE 186 Query: 187 LHHQDIVWTTEFMFNALRARYTEQSLYKYY 216 ++D + + + NA + Q Y YY Sbjct: 187 KKYEDTQFYFDLIKNANKFVIISQPFYNYY 216 >UniRef50_C3R6R9 Glycosyltransferase involved capsular polysaccharide biosynthesis n=2 Tax=Bacteroides dorei RepID=C3R6R9_9BACE Length = 325 Score = 123 bits (309), Expect = 8e-27, Method: Compositional matrix adjust. Identities = 69/196 (35%), Positives = 112/196 (57%), Gaps = 13/196 (6%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQ--TWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 K+S+I+P+YN D C+ SL+ Q ++ EIII+NDGSTD+S+EI + Y E YP+ + Sbjct: 3 KISLIVPVYNVEDFVARCLNSLLEQNMSYEDYEIIIVNDGSTDSSLEIIQPYLEKYPNCK 62 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 L+ + N G S ARN G+ VA G+Y+ FVD+DD + P + + L +DLDV +W Sbjct: 63 LIDKPNGGLSSARNAGLNVALGEYIWFVDSDDFIMPNLLDDLYKECSSNDLDVLW--MEW 120 Query: 125 CFRETGETWQSIP----TDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 R E +P T R + + V+ G ++ S+ + W +Y+R + KN+ Sbjct: 121 --RRVSELGIELPQFYCTYRNKISHVINGVQFMEQVCSTYLFA---WAFLYKRCFLQKNS 175 Query: 181 IKFIAGLHHQDIVWTT 196 F+ G+ ++D+ + T Sbjct: 176 FLFMEGILYEDVEFNT 191 >UniRef50_B7AIU6 Putative uncharacterized protein n=1 Tax=Bacteroides eggerthii DSM 20697 RepID=B7AIU6_9BACE Length = 342 Score = 123 bits (309), Expect = 9e-27, Method: Compositional matrix adjust. Identities = 74/232 (31%), Positives = 118/232 (50%), Gaps = 20/232 (8%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+S+I+P+Y C+E+LI Q+ +EII+INDGS DNS EI YA ++++ Sbjct: 5 KISIIVPIYKVEKYLERCIETLINQSIKNIEIILINDGSPDNSSEICDKYALTDLRIKVI 64 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN----- 121 H+ N G +ARN G+EVATG Y+AFVD+DD V MY+ L A + D CN Sbjct: 65 HKENEGLGLARNTGLEVATGTYIAFVDSDDYVDINMYKKLAEYADKHHCDAVFCNFYTEI 124 Query: 122 --ADWCFRETGET-----WQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRD 174 W R + E W+ + + P R++ VW +Y+ Sbjct: 125 NNGKW--RNSSEVSDYELWEKSKIHSFMLDMIASKPH----EKLERKYQMSVWHAIYKHT 178 Query: 175 VIVKNNIKFIAG--LHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSR 224 +I K NI+F++ + +DI + +F+ + R Y + Y Y ++ S+++ Sbjct: 179 IIDKENIRFLSERVVVSEDIPFQVDFLSASNRIGYIPNAYYYYCKNDNSLTK 230 >UniRef50_D0GIA1 Glycosyltransferase n=1 Tax=Leptotrichia goodfellowii F0264 RepID=D0GIA1_9FUSO Length = 342 Score = 122 bits (306), Expect = 2e-26, Method: Compositional matrix adjust. Identities = 67/222 (30%), Positives = 114/222 (51%), Gaps = 6/222 (2%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV-RL 65 KLS+I+P+YN R C++S+ E+I++NDGS D ++ IAK Y E Y + + Sbjct: 2 KLSIIVPMYNVEKYLRQCLDSIYKLDKIEKEVILVNDGSKDETLNIAKEYGEKYKEITTV 61 Query: 66 LHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMY--ETLMTMALEDDLDVAQCNAD 123 + Q N+G S+ RN+G+E A G Y+ F D DD + ++ E L + + D+ N Sbjct: 62 IDQPNSGVSITRNKGLEKAVGDYIYFFDGDDFLNVDIFQEEVLKLFSENGETDILYGNRI 121 Query: 124 WCFRETG-ETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIK 182 F E E IP + V TGP+++ L + W V +Y+R+++++NNI Sbjct: 122 LYFNEKRLEMTYEIPQE--MENKVWTGPEYMMRALKEKFWNVQVGTAIYKRELLIRNNID 179 Query: 183 FIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSR 224 F H+D ++T + + A + R + + Y S+ + Sbjct: 180 FPEKRIHEDELFTIKVLNAAKKVRAVNKIFFYYMQREGSIMK 221 >UniRef50_UPI0001B4A4FE ss-1,4-galactosyltransferase n=1 Tax=Bacteroides sp. 2_1_7 RepID=UPI0001B4A4FE Length = 332 Score = 122 bits (306), Expect = 2e-26, Method: Compositional matrix adjust. Identities = 72/219 (32%), Positives = 118/219 (53%), Gaps = 8/219 (3%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +SVI+P+YN D +TC S+++Q++ +E+I++NDGSTD S EIAK + V L+ Sbjct: 6 VSVIVPIYNVEDYLQTCCNSILSQSFNNIEVILVNDGSTDGSCEIAKQIEKKDSRVILID 65 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 Q N G S ARN G++VA G YV FVD+DD + + L ++ DV+ C A + Sbjct: 66 QRNGGLSAARNAGLKVAKGDYVLFVDSDDWIEKNCIQELYQDIQRNNSDVSCCLAQYI-- 123 Query: 128 ETGETWQSIPTDRLRSTGV--LTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIA 185 +I R S G+ LTG + L L + + W +YRRD + +N + F Sbjct: 124 ---SAMGTIRRGR-SSFGINSLTGDEILPNALLVKTFPTSAWGKLYRRDFLHENGLLFKE 179 Query: 186 GLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSR 224 G+ ++D +++ + A + + ++ L+ S+SR Sbjct: 180 GIVNEDTLFSIQIACLAKKISFVDKVLFDIREREGSISR 218 >UniRef50_B2DG34 Eps5I n=3 Tax=Streptococcus pneumoniae RepID=B2DG34_STRPN Length = 333 Score = 122 bits (305), Expect = 2e-26, Method: Compositional matrix adjust. Identities = 65/214 (30%), Positives = 110/214 (51%), Gaps = 5/214 (2%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 L+++IP+YN + C++S+I+Q W E+I++NDGSTD S I + YA+ Y + ++H Sbjct: 8 LTIVIPVYNVEKYLKRCIDSVISQEWDKYEVILVNDGSTDASPNICEEYAQKYHFISVIH 67 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + N G S RN G+ A GKYV F+D+DD + M+ L M +E + D+ Q F Sbjct: 68 KENGGLSETRNTGLSHANGKYVFFLDSDDWITKDMFRNLSKMIMEQNYDILQFGMQ-MFH 126 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 E +++ D + LS T +Y+R++ KN I+F G Sbjct: 127 SEREELKNVQCKEKN----FNSSDAFKNMLSVEGITSFATDKIYKRELFEKNGIEFPIGY 182 Query: 188 HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTS 221 ++D+ + + +A + T Q Y Y++ N + Sbjct: 183 FYEDLGTVYKLILSAKKIYLTTQVYYCYFIGNDA 216 >UniRef50_C4Z667 Putative uncharacterized protein n=1 Tax=Eubacterium eligens ATCC 27750 RepID=C4Z667_EUBE2 Length = 329 Score = 121 bits (304), Expect = 3e-26, Method: Compositional matrix adjust. Identities = 64/216 (29%), Positives = 110/216 (50%), Gaps = 8/216 (3%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 LS+I+P+YN D C S+I Q + +EI++I+DGSTDNS EI Y+ E Y ++R+ H Sbjct: 13 LSIIVPVYNVQDYLEECCNSIINQKYLNIEILLIDDGSTDNSGEICDYFGEKYSNIRVFH 72 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + N G + RN GI A G+Y+ FVD+DD +Y MY +M + LE + + CN D Sbjct: 73 RENEGLAATRNFGIAQAQGQYLGFVDSDDWIYENMYSDMMKIVLEKNAQIVCCNFDL--- 129 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 E + I + + + + + + ++++++ N++F G Sbjct: 130 ---EIDKRIIKRGKNGIDIYNKNEAIVNLMYPQFYQFYAPNKIFKKELF--QNVEFPIGK 184 Query: 188 HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVS 223 H +DI+ + + + + QS Y Y S++ Sbjct: 185 HFEDIITIYKVFLKSEKVYFIWQSGYVYRQRKDSIT 220 >UniRef50_C3WBU5 Predicted protein n=3 Tax=Fusobacterium RepID=C3WBU5_FUSMR Length = 329 Score = 121 bits (304), Expect = 3e-26, Method: Compositional matrix adjust. Identities = 66/218 (30%), Positives = 122/218 (55%), Gaps = 5/218 (2%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP-HVRLL 66 LS+I+P+YN + C++SL + E+I++NDGS DNS +I + + YP ++ Sbjct: 3 LSIIVPIYNVEEYLAECLKSLYKISNLRYEVILVNDGSKDNSYQIMEEFKRLYPKQTVIV 62 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 ++ N G S ARN G+++A GKY++F+D+DD + +E + ++ +D+A N + Sbjct: 63 NKENGGLSSARNAGLKIAKGKYISFIDSDDFIDTDEFEKFVIEGIKSRVDIAVGNMRYYV 122 Query: 127 RETGETWQSIPTDRL-RSTGVLTGPDWL-RMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 G + + RL + +GV+TG D+L R+ + + + V +YR+D ++KN + F Sbjct: 123 --PGRIGEPLFRSRLVKDSGVVTGIDFLWRVFQAPKCYREEVVDDIYRKDFLIKNKLFFN 180 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSV 222 + H+D +TT A + +Y +++ Y Y S+ Sbjct: 181 EEIVHEDSEFTTFAYLKAKKVKYIDKAFYFYRQREGSI 218 >UniRef50_A8ULG5 Capsular polysaccharide biosynthsis protein n=1 Tax=Flavobacteriales bacterium ALC-1 RepID=A8ULG5_9FLAO Length = 344 Score = 121 bits (304), Expect = 3e-26, Method: Compositional matrix adjust. Identities = 73/238 (30%), Positives = 122/238 (51%), Gaps = 14/238 (5%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTA--LEIIIINDGSTDNSVEIAKYYAENYPHVR 64 KLS+IIP+YN TC++S++ Q E+IIINDGSTDNS +I K Y+ENY ++ Sbjct: 2 KLSIIIPVYNVEKYISTCLDSIVNQDLKKDEYEVIIINDGSTDNSGKIVKTYSENYSNIF 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 L Q N G ARN+G+ A G +V F+D DD + + +TL + A++ +LDV ++ Sbjct: 62 LYTQKNGGVGSARNKGLSYAKGDFVYFLDPDDYLAMNVLKTLTSTAVQYNLDVLTFDSTS 121 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 + + + V+TG D+ ++ +++ VW+ + R ++ ++KFI Sbjct: 122 FNDSSVLPENNSNNSEANLSDVITGIDY----IAKTKYSSTVWLFLINRSFLIDIDLKFI 177 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKI 242 G +D ++T E R + + Y Y R+ + N HY+K+ Sbjct: 178 KGRWMEDAIFTLELFIKVKRMAHLNINAYFY--------RVSSESAMNSYEPNHYLKL 227 >UniRef50_A9VPZ1 Glycosyl transferase family 2 n=3 Tax=Firmicutes RepID=A9VPZ1_BACWK Length = 324 Score = 121 bits (304), Expect = 3e-26, Method: Compositional matrix adjust. Identities = 64/217 (29%), Positives = 116/217 (53%), Gaps = 10/217 (4%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+S+I+P+Y C++S++ QT+ E+I+++DGS D EI YA+ ++++ Sbjct: 4 KISIIVPVYKVERYIHKCIDSILVQTFEDFELILVDDGSPDTCGEICDRYADKDMRIKVI 63 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 H+ N G S ARN G+E A G Y+ FVD+DD + P MYE L M +E D+A C++ + Sbjct: 64 HKENGGLSSARNAGLENAKGDYIGFVDSDDWIEPDMYELLYGMCVEQGCDIAICSSQIHY 123 Query: 127 RETGETWQSIP-TDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIA 185 T + P R+T ++ L + + VVW +++R+++ +IKF Sbjct: 124 SNRIVTSSNHPFIIHDRNTA-------MKTMLEGKLYDEVVWTKLFKRNLL--EDIKFPV 174 Query: 186 GLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSV 222 G+ ++D +T + + + + + Y Y + S+ Sbjct: 175 GMSYEDTAFTYKVIHKCKKVCFIGEPKYHYIKRDNSM 211 >UniRef50_B5CU67 Putative uncharacterized protein n=1 Tax=Bacteroides plebeius DSM 17135 RepID=B5CU67_9BACE Length = 331 Score = 121 bits (303), Expect = 4e-26, Method: Compositional matrix adjust. Identities = 74/236 (31%), Positives = 122/236 (51%), Gaps = 22/236 (9%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +SVI+P+YN + C+ES++ QT+ LEII+++DGS+D S EI YY + ++H Sbjct: 2 ISVIVPIYNVAEYLPACIESILNQTYKNLEIILVDDGSSDKSGEICDYYVKRDTRCIVIH 61 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 Q+N G S ARN G+E ATG+Y++F+D DD ++P M TL ++D D + Sbjct: 62 QSNKGLSEARNAGLECATGEYISFIDGDDYIHPKMLATLYEALQQEDYDFSMI------- 114 Query: 128 ETGETW-QSIPTDRLRSTGVLTGPDWLRMGL---SSRRWT------HVVWMGVYRRDVIV 177 +TW I + ++ L GL SS++ + VW +YRR++I Sbjct: 115 AHTQTWNHHIEKEPKMINSIIFETSDLMQGLYNVSSKKISLPECNFQAVWNKLYRRNLIG 174 Query: 178 KNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNL 233 + IA +D + + +A + +Q LY + +S++ H+ NKN Sbjct: 175 SSRFITIAT---EDTEFNNKIFLKTQKAIFLDQCLYYWVQRPSSIT--HQSVNKNF 225 >UniRef50_C1ME88 Glycosyltransferase n=2 Tax=Enterobacteriaceae RepID=C1ME88_9ENTR Length = 325 Score = 121 bits (303), Expect = 4e-26, Method: Compositional matrix adjust. Identities = 69/216 (31%), Positives = 112/216 (51%), Gaps = 4/216 (1%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +SVIIP+YNA ++ +E ++ + +E+II+NDGS+DN+ I + V+++ Sbjct: 1 MSVIIPVYNAEKTIKSLIEDILIEDRINIEVIIVNDGSSDNTSTIINSIKDG--RVQVIE 58 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 Q N G ARN G++ G++V F+DADD+V A + DV NA W Sbjct: 59 QNNKGVYAARNNGLDKHKGQWVMFLDADDDVSEGFIYNRYIQASSHNADVLLSNA-WKLW 117 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 G+ + + T++ VL+G W+ + R W H +W+ + R D I NN+ F G Sbjct: 118 HEGDNKKPVHTNQPYDE-VLSGNAWMSHAVKVREWPHYLWLQIVRSDYIRNNNLSFHDGR 176 Query: 188 HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVS 223 H+DI+WT E R ++ Y Y +N S++ Sbjct: 177 SHKDILWTMELALANGRFYISKDMDYTYKYNNESIT 212 >UniRef50_O86893 Putative glycosyl transferase n=1 Tax=Streptococcus pneumoniae RepID=O86893_STRPN Length = 323 Score = 121 bits (303), Expect = 4e-26, Method: Compositional matrix adjust. Identities = 66/215 (30%), Positives = 113/215 (52%), Gaps = 4/215 (1%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +SVI+P+YN D R ++SL+ QT+ E+I++NDGSTDNS EI Y + Y ++ + H Sbjct: 2 ISVIVPVYNVADYLRFALDSLLEQTYKDFEVILVNDGSTDNSGEICDEYGKLYDNIHVFH 61 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + N G S ARN G+E + G+++ F+D+DD P E L+T+ + D+D+ + Sbjct: 62 KKNGGLSDARNFGLEKSRGEFITFLDSDDYFEPYALELLITIQKKYDVDIVSTKGGITYS 121 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 + + + D L + +LT ++L + T W +Y+RD+ F G Sbjct: 122 HDIYSKKLMAEDYL-TVKILTNKEFLAAVYYNDEMTVSAWGKLYKRDLF---KTIFPKGK 177 Query: 188 HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSV 222 ++D+ E + N +T+ +Y YY S+ Sbjct: 178 IYEDLYVVAERLLNIKTVAHTDLPIYHYYQRQGSI 212 >UniRef50_B0MQU6 Putative uncharacterized protein n=1 Tax=Eubacterium siraeum DSM 15702 RepID=B0MQU6_9FIRM Length = 325 Score = 121 bits (303), Expect = 4e-26, Method: Compositional matrix adjust. Identities = 67/203 (33%), Positives = 107/203 (52%), Gaps = 4/203 (1%) Query: 24 CMESLITQTWT-ALEIIIINDGSTDNSVEIAKYYAENYPHVRLLHQANAGASVARNRGIE 82 C++SL+ Q + EIII+NDG+ DNS+ IA +A Y +R++ Q NAG S ARN G+ Sbjct: 29 CLDSLVRQNFGYKYEIIIVNDGTKDNSMTIADRFASKYDFIRIITQQNAGLSAARNTGLA 88 Query: 83 VATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFRETGETWQSIPTDRLR 142 A G+Y+AFVD+DD V P + T+A++++ D+ QC F ++ + + + R Sbjct: 89 NARGEYIAFVDSDDFVSPFYISEMYTLAVKNNADIVQCRYCNYFEKSDRSINVLLSHR-- 146 Query: 143 STGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGLHHQDIVWTTEFMFNA 202 GV+ G +S + W +YRR + NNI + G +D++ T A Sbjct: 147 -NGVIGGKKAFEKLISDVSVRNYAWNKMYRRSLFSDNNISYPVGKCFEDVITTPRLFACA 205 Query: 203 LRARYTEQSLYKYYLHNTSVSRL 225 +T + LY Y S++ L Sbjct: 206 ENVAFTRKVLYSYAHRPDSITGL 228 >UniRef50_A7VPB2 Putative uncharacterized protein n=1 Tax=Clostridium leptum DSM 753 RepID=A7VPB2_9CLOT Length = 332 Score = 121 bits (303), Expect = 4e-26, Method: Compositional matrix adjust. Identities = 62/208 (29%), Positives = 109/208 (52%), Gaps = 4/208 (1%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S++IP+YN R ++S+ QT+ E II++DGSTD EI Y Y + + + Sbjct: 17 ISIVIPVYNVDMFLRKALQSVADQTFQNYEAIIVDDGSTDRCPEIIAKYVRKYSNFKSVR 76 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 Q NAG S ARN G+ +A G+++AF+D+DD PT E + + +D D++ C+ + F Sbjct: 77 QENAGLSAARNTGLRLAQGEFIAFMDSDDYYQPTFLEDMYNACVANDCDISYCSFYFYFP 136 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 G IP T V L + + + H W +Y+R + ++N++ + + Sbjct: 137 YNG---LRIPLPFTCRTAVYESRQALNLLIRDKSMHHFAWNKLYKRSLFLENDVTY-TQM 192 Query: 188 HHQDIVWTTEFMFNALRARYTEQSLYKY 215 + +D+ + + ++A R T + LY Y Sbjct: 193 YFEDVATSPKLYYHAKRVAVTSKPLYNY 220 >UniRef50_B2UQP2 Glycosyl transferase family 2 n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UQP2_AKKM8 Length = 358 Score = 121 bits (303), Expect = 5e-26, Method: Compositional matrix adjust. Identities = 76/234 (32%), Positives = 117/234 (50%), Gaps = 10/234 (4%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+I+P YN CMESL Q+ +EII +NDGS+D + EI + + + VR++ Sbjct: 5 VSIIVPCYNVAAYLDQCMESLAGQSMEDIEIICVNDGSSDRTAEILREWRDRDGRVRVID 64 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN-ADWCF 126 + N+G S ARN G+E A GKY+ FVD DD V M+ L A+E D DVA C ++C Sbjct: 65 RKNSGVSAARNSGMEAAAGKYIGFVDPDDVVERNMFRRLFDAAVEKDADVAVCGYHEFCD 124 Query: 127 RETGETWQSIPTDRLRSTGVLT--GPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 R + +S + S G + R G R VW + RR+++ +N ++F+ Sbjct: 125 RGGMDMPESGWSP---SAGFFPEEKAERFRRGTPWSRCAGTVWNKLIRRELLEENGVRFV 181 Query: 185 AGLHH-QDIVWTTEFMFNALRARYTEQSLYKYYLH---NTSVSRLHRQGNKNLN 234 GL +D+ + + A R LY Y + S R R G+ ++ Sbjct: 182 PGLRQGEDLYFCLMLLTVAPRLLILPDRLYHYRRERPGSASCGRDPRLGDFRMD 235 >UniRef50_Q8XN37 Capsular polysaccharide biosynthsis protein n=1 Tax=Clostridium perfringens RepID=Q8XN37_CLOPE Length = 326 Score = 120 bits (302), Expect = 5e-26, Method: Compositional matrix adjust. Identities = 66/219 (30%), Positives = 119/219 (54%), Gaps = 2/219 (0%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+S+IIP+YN + C+++ + QT +EII I+DGSTD S +I YA+ Y ++++ Sbjct: 3 KVSIIIPIYNVENYLDKCLDTAVNQTLGCIEIIGIDDGSTDKSRDILNKYAKKYSFIKII 62 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 ++ N G S ARN GI A G+Y+ F+D+DD + + +A ++ LD+ +A+ Sbjct: 63 NKENGGLSSARNTGIINAKGEYIYFLDSDDYIEYDSMKICYEIATKNKLDIVSFDAESFV 122 Query: 127 RETGETWQSIPTDR--LRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 G T + DR + + G ++ + ++ +T + YRR+ ++ NN+ F Sbjct: 123 DGEGFTKKLKKYDRKDILDGNIKMGCEYFCELIKNKCYTASACLNFYRREFLIDNNLFFY 182 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVS 223 G+ H+D ++T + A + RY LY L + S++ Sbjct: 183 DGILHEDELYTVKAYTLANKVRYIPYKLYFRRLRSNSIT 221 >UniRef50_C3ADH9 Putative uncharacterized protein n=1 Tax=Bacillus mycoides DSM 2048 RepID=C3ADH9_BACMY Length = 338 Score = 120 bits (301), Expect = 6e-26, Method: Compositional matrix adjust. Identities = 71/218 (32%), Positives = 120/218 (55%), Gaps = 11/218 (5%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +SVI+P+YN C+ES++ Q + +E++++NDGSTD EI + Y E +++++ Sbjct: 8 ISVIVPIYNVEKWLARCIESVLKQPFVNIELVLVNDGSTDKCGEICEEYLEKDNRIKVIY 67 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + N G S ARN GIE ATGKY+ F+D DDEV + L +A ED +C+A C Sbjct: 68 KENGGVSSARNAGIEAATGKYIVFIDPDDEVSENYFTRLYGIA-ED----KKCDAVICGY 122 Query: 128 ETGETWQS-IPTDRLRSTGVLTGPDWL--RMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 +T S IP +L + V+ G D++ + S+ W +Y +I + NI+F Sbjct: 123 KTVPNNNSIIPNFKLDT--VMNGRDFVLSSPNVHSKNDLCFAWRYIYNLSIIKEENIRFN 180 Query: 185 AGLH-HQDIVWTTEFMFNALRARYTEQSLYKYYLHNTS 221 + +D+++ EF+ ++ R + LY Y ++N + Sbjct: 181 EQVFIGEDVIFNLEFLLHSQRVCAVSEVLYFYTINNPN 218 >UniRef50_B5CP79 Putative uncharacterized protein n=1 Tax=Ruminococcus lactaris ATCC 29176 RepID=B5CP79_9FIRM Length = 315 Score = 120 bits (301), Expect = 7e-26, Method: Compositional matrix adjust. Identities = 64/216 (29%), Positives = 111/216 (51%), Gaps = 5/216 (2%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +SVI+P+Y + C++S++ QT+ LE+I+++DGS D I YA VR++H Sbjct: 5 ISVIVPVYKVEKYLKRCVDSILAQTYPCLEVILVDDGSPDGCPAICDEYAREDRRVRVIH 64 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW-CF 126 + N G S ARN GI+ A GK++ FVD+DD V+P YE L+ E+ D+A C+ C Sbjct: 65 KENGGLSDARNAGIDAAKGKFLGFVDSDDYVHPRFYELLLQALKEEGADIAGCDVKKVCK 124 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAG 186 E E + P R +G + + + + V W +Y++++ I+F G Sbjct: 125 TEKIEEKEQQPIQRTVYSGREATANLYDAQMYLK--SVVAWNKLYKKELF--EEIRFPKG 180 Query: 187 LHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSV 222 H+D + + ++ + Y ++ Y Y+ + Sbjct: 181 KLHEDEFTSYKLLYQSESVVYINRAYYFYFQREDGI 216 >UniRef50_Q8GPB8 Eps6I n=5 Tax=Streptococcus RepID=Q8GPB8_STRTR Length = 328 Score = 120 bits (301), Expect = 8e-26, Method: Compositional matrix adjust. Identities = 69/212 (32%), Positives = 110/212 (51%), Gaps = 5/212 (2%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 L+V+IP+YN + C+ES++ Q W +I++++DGSTD+S +I YA+ Y + ++H Sbjct: 7 LTVVIPVYNVEKYLKRCVESVLVQEWHNYDILLVDDGSTDSSPQICDDYAKVYDFISVIH 66 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + N G S ARN GI A G+YV F+D+DD + P + L + D D+ N ++ Sbjct: 67 KKNGGLSAARNTGISNAEGEYVYFLDSDDWIEPNTFSDLAEVIESDQYDIISFNQEFVKS 126 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 E I +D R T LTG + L S T +YR+ + KN I+F G Sbjct: 127 E----HDIIKSDSKR-TKRLTGKEALIDMFSYGFVTGFATDKIYRKALFTKNTIQFPVGK 181 Query: 188 HHQDIVWTTEFMFNALRARYTEQSLYKYYLHN 219 +++D+ + A + T Q Y Y + N Sbjct: 182 YYEDLGTNYKLFLAAKKVYATNQKYYHYLIDN 213 >UniRef50_C7H3W5 Glycosyl transferase CpsJ n=1 Tax=Faecalibacterium prausnitzii A2-165 RepID=C7H3W5_9FIRM Length = 324 Score = 119 bits (298), Expect = 1e-25, Method: Compositional matrix adjust. Identities = 79/251 (31%), Positives = 128/251 (50%), Gaps = 18/251 (7%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+I+P+YN C+ SL+ Q++ +E+++++DGSTD S + YA VR+LH Sbjct: 5 VSIIVPVYNVKPYLNRCVNSLLGQSYQNMELLLVDDGSTDGSETLCGEYAAQDARVRVLH 64 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW--- 124 + N G S ARN G++ A G+Y++FVD DD V P E L ALE Q AD+ Sbjct: 65 KKNGGLSDARNAGVDAAKGEYLSFVDGDDWVSPYYIENLYR-ALE------QAGADFSAS 117 Query: 125 CFRET--GETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIK 182 CF E G+ QS+PT+RL + +L+ + LR L +Y+R + ++ Sbjct: 118 CFEEVFEGQPVQSVPTERLEAFEILSREECLRRILYQEGMEVTTPTKLYKRALF--EGVR 175 Query: 183 FIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKI 242 + G ++DI E A +A + Y Y+ +S+ + K L+ RH Sbjct: 176 YPVGKLYEDIPVAYEVTKRAHKAAHIANRDYYYFQRGSSIQNMAFNPRK-LDSVRH---C 231 Query: 243 TRLLEKLNRNY 253 L+E + R++ Sbjct: 232 HALMENVKRDF 242 >UniRef50_B2UYP7 Minor teichoic acid biosynthesis protein GgaB n=4 Tax=Clostridium RepID=B2UYP7_CLOBA Length = 335 Score = 119 bits (298), Expect = 1e-25, Method: Compositional matrix adjust. Identities = 75/233 (32%), Positives = 128/233 (54%), Gaps = 15/233 (6%) Query: 3 NSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPH 62 NS +S+I+P+Y A D + S++ QT E+I+++DGS DNS +I A N Sbjct: 14 NSEVLISIIMPIYKAEDYLNNSITSILNQTLENFELILVDDGSPDNSGKICDELALNDNR 73 Query: 63 VRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNA 122 ++++H+ N GAS ARN G+++A G+++AFVD+DD + P M+ETL +A E + D++QCN Sbjct: 74 IKVIHKENGGASTARNAGLDIAQGEFIAFVDSDDWIEPNMFETLFNLAKEHNADISQCNY 133 Query: 123 DWCFRETGETWQSIPTDR--LRSTGVLTGPDWL--RMGLSSRRWTHVVWMGVYRRDVIVK 178 + E + I D+ + S + + L M +S T V+W +Y++ + Sbjct: 134 ---IKVENEDEKIINDDKEIIISFNNIESLNNLYNEMYVS----TVVLWNKIYKKSLF-- 184 Query: 179 NNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNK 231 N I+F +D + +F + + YT + L YY NT S ++ + +K Sbjct: 185 NKIRFPNMRIFEDEAIMYKLLFESKKLVYTNKKL--YYYRNTPDSIMNAKFDK 235 >UniRef50_C9LAR4 Glycosyl transferase, family 2/glycosyl transferase family 8 n=1 Tax=Blautia hansenii DSM 20583 RepID=C9LAR4_RUMHA Length = 329 Score = 119 bits (298), Expect = 1e-25, Method: Compositional matrix adjust. Identities = 67/232 (28%), Positives = 124/232 (53%), Gaps = 6/232 (2%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+IIP+YN D C++S+ QT+ LEII++NDGSTD S E+A + + P ++L++ Sbjct: 7 ISIIIPIYNVEDYLERCIKSIQQQTYKNLEIILVNDGSTDGSTEVALEFCQMDPRIKLVN 66 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + N G S ARN GI+V GKY+ FVD+DD V E L + ++ +A C+ + Sbjct: 67 KKNGGLSDARNVGIDVCKGKYIVFVDSDDYVEARYVELLYKSLVSNNAQMAVCSFNIVND 126 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVV-WMGVYRRDVIVKNNIKFIAG 186 + + ++ + ++TG + L + + + +VV W +Y +++ +++F G Sbjct: 127 NGSVIKRKLVSE--KKNEIITGREMLNRVMRADGYKYVVAWNKIYASNLL--EDLRFEKG 182 Query: 187 LHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRH 238 ++D + E +N + + + LY Y S+ +L N+ + Q+ Sbjct: 183 KLYEDEFFNFEVSYNLTKVSFVAEPLYNYVQRAGSI-KLSELTNEKIEMQKE 233 >UniRef50_D2EWI1 Glycoside transferase family 2 protein n=3 Tax=Bacteroides RepID=D2EWI1_9BACE Length = 336 Score = 119 bits (298), Expect = 2e-25, Method: Compositional matrix adjust. Identities = 72/246 (29%), Positives = 128/246 (52%), Gaps = 18/246 (7%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+I+P+YN D C++S+ QT+ LEII+I+DGS+D S E+ +A+ ++++H Sbjct: 10 ISIIVPVYNVKDYLEKCLQSICVQTYKNLEIILIDDGSSDGSGELCDLFAQRDGRIKVIH 69 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 Q NAG S ARNRG+ VA G+++ FVD+DD + P MYE L + + D++ C+ + Sbjct: 70 QTNAGQSAARNRGLAVAQGEFLGFVDSDDWIEPDMYEFLYHLLKANGADISICSH---YI 126 Query: 128 ETGETWQSIPTDRLRSTGVLTG---PDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 ET ++ T S+G + + +R + +R + +W +Y+R + I F Sbjct: 127 ET-----AVKTRVKHSSGQFSSFSREEAIRTLVEDKRIRNYMWDKLYKRQLFA--GIYFP 179 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 +DI + + + + + Y Y S + QG K NY++ Y+ Sbjct: 180 VNRVFEDIAVSYQIFYKTQKVVMQDCPKYHYLKREGSTT----QG-KLYNYEKEYLLFQT 234 Query: 245 LLEKLN 250 + E++ Sbjct: 235 VYEQVK 240 >UniRef50_C7H3W1 Capsular polysaccharide biosynthesis protein CpsI n=1 Tax=Faecalibacterium prausnitzii A2-165 RepID=C7H3W1_9FIRM Length = 327 Score = 119 bits (298), Expect = 2e-25, Method: Compositional matrix adjust. Identities = 73/232 (31%), Positives = 120/232 (51%), Gaps = 10/232 (4%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M+N T +S+I+P+YN C++S++ QT+ LEII+++D STDNS+ I + AE Sbjct: 1 MLNDT--VSIIVPIYNKEKYLEKCLDSILGQTYRDLEIILVDDESTDNSLAICQRCAEKD 58 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 P +++ H++N G S ARN G+E +TG ++F D DD ++ E L + E ++A C Sbjct: 59 PRIKIYHKSNGGVSSARNLGLEKSTGTLISFADPDDSLHAECIERLKRVLDETGAEIAYC 118 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 TG T S + DW +SR+ V W VYRR V + Sbjct: 119 YGLDLLGTTGRTQTKSDETEKVSVMPVRRYDW-----NSRKAHTVCWGAVYRR--AVTQD 171 Query: 181 IKFIAGLH-HQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNK 231 + F L +D ++ + + A + +++LY YY+++ SV+ H K Sbjct: 172 VAFDTDLKIGEDTLFLAKCIRKAEKIACIDKALYYYYVNDDSVTSGHYSAGK 223 >UniRef50_A6BH37 Putative uncharacterized protein n=1 Tax=Dorea longicatena DSM 13814 RepID=A6BH37_9FIRM Length = 323 Score = 118 bits (296), Expect = 2e-25, Method: Compositional matrix adjust. Identities = 68/239 (28%), Positives = 124/239 (51%), Gaps = 12/239 (5%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+IIP+Y C++S+++QT++ LEII+++DGS D S I +A +R++H Sbjct: 5 ISIIIPVYKVEKYLDKCIKSIVSQTYSNLEIILVDDGSPDKSGLICDKWATLDSRIRVVH 64 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 Q NAGA ARN +++A G++++FVD+DD + M++ L++ E+D+D+ +C F Sbjct: 65 QKNAGAGAARNVALKLAQGEFISFVDSDDYLSVNMFKNLLSY-FEEDIDIVECE----FI 119 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 E ++ V + + +R ++ + ++W +YRR I N+ F G Sbjct: 120 SVEEDDVLFIDEKSVQARVFSSEEAMREHIADHYFRQLIWNKLYRRSCI--KNVYFPEGK 177 Query: 188 HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNK-----NLNYQRHYIK 241 D W + + A + + + LY Y + SV + + N+ Y+ YIK Sbjct: 178 KIDDEFWMYKVIGRAKQLVHCNEKLYAYRQQDNSVMHMLKPENRIQAIEAKGYRHQYIK 236 >UniRef50_A8SCX5 Putative uncharacterized protein n=1 Tax=Faecalibacterium prausnitzii M21/2 RepID=A8SCX5_9FIRM Length = 322 Score = 118 bits (296), Expect = 3e-25, Method: Compositional matrix adjust. Identities = 70/232 (30%), Positives = 117/232 (50%), Gaps = 14/232 (6%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 LS+I+P+Y + + C++S++ QT+T E+I++ DGS D + A +R+LH Sbjct: 2 LSIIVPVYKVENYLQKCIDSILAQTFTDFELILVEDGSPDGCPALCDAAAAKDARIRVLH 61 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 Q N G S ARN G++VA G+++ FVD+DD + P MYETL D+A C D+ Sbjct: 62 QKNGGLSAARNAGLDVARGEWIGFVDSDDYIAPEMYETLYKAVQSTGADLALC--DYAAV 119 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 + T +P + + TG + L+ ++ W +YRR ++ +++ G Sbjct: 120 DEAGT-PCLPPYTGLAQRIFTGRELLKKATNT--MAQPAWNKLYRR--VIFAQLRYPEGK 174 Query: 188 HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHY 239 ++DI E N +A + LY Y +S+ G+K L RH+ Sbjct: 175 LNEDIFVLPEICLNTKKAVVVPKELYYYVQRGSSIM----NGSKTL---RHF 219 >UniRef50_B0P2L4 Putative uncharacterized protein n=1 Tax=Clostridium sp. SS2/1 RepID=B0P2L4_9CLOT Length = 332 Score = 118 bits (295), Expect = 3e-25, Method: Compositional matrix adjust. Identities = 62/216 (28%), Positives = 121/216 (56%), Gaps = 6/216 (2%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +SVIIP YN D C++S+I+QT+ LEII+++DGS D++ EI YA+ ++++H Sbjct: 6 VSVIIPAYNIEDYIGRCLDSIISQTYKNLEIIVVDDGSRDHTGEILDNYAKKDRRIKVIH 65 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + N G S ARN+GIE A G Y+ F+D DD + MY+TL+ + E++ D+A C F Sbjct: 66 KENGGVSSARNKGIEAAEGDYIGFIDGDDLIESEMYKTLVDLLEEENADIAHCGYQMVFP 125 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 + + + + ++++T + L+ LS + +Y++++I + + Sbjct: 126 DRIDYYHNTGKKKIQTT-----EEGLKDLLSGEMIEPGLVNKIYKKELIKNCRLDETVKI 180 Query: 188 HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVS 223 ++D++ + + ++ Y + + Y Y + ++S + Sbjct: 181 -NEDLLMNYQLFKLSQKSVYYDITPYSYMIRSSSAT 215 >UniRef50_A6L5V6 Glycosyltransferase family 2 n=2 Tax=Bacteroides RepID=A6L5V6_BACV8 Length = 344 Score = 118 bits (295), Expect = 3e-25, Method: Compositional matrix adjust. Identities = 69/235 (29%), Positives = 123/235 (52%), Gaps = 22/235 (9%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAE-----NYP 61 K+S+I+P+YN CM+SL+ QT LEII+++DGS DNS + Y ++P Sbjct: 5 KVSIIVPVYNVEKYLDRCMQSLLGQTLKELEIILVDDGSPDNSPALCDNYESASDNLHWP 64 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 ++++H+ N G ARN G+E+ATG+Y+AF+D+DD V MYE L A D+DV CN Sbjct: 65 IIKVIHKINEGLGFARNSGLEIATGEYIAFLDSDDFVDVNMYEQLYLKAKAKDIDVVYCN 124 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTG----PDWLRMGLSSR-------RWTHVVWMGV 170 + +++ + P + + G D+L ++ R+ V + Sbjct: 125 CNI-YKDKKHIY---PRKDVEEEVIFRGREAVDDFLLDMIAPLPEYHHDVRYMMSVGHAI 180 Query: 171 YRRDVIVKNNIKFIA--GLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVS 223 YR + +++ ++F++ +DI++ +F+ + Y L+ Y ++ S+S Sbjct: 181 YRHSIFMEHRVQFVSEREFVSEDIIFDIDFLSHVRSVCYLPDCLHYYVVNPYSLS 235 >UniRef50_A8U9G0 Capsular polysaccharide biosynthsis protein n=1 Tax=Carnobacterium sp. AT7 RepID=A8U9G0_9LACT Length = 336 Score = 118 bits (295), Expect = 3e-25, Method: Compositional matrix adjust. Identities = 62/211 (29%), Positives = 121/211 (57%), Gaps = 4/211 (1%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQ-TWTALEIIIINDGSTDNSVEIAKYYAENYPHVRL 65 K+SVI+P+YN + C+ES+I+Q + +E+I+I+DGSTD S +I K Y E+Y ++ Sbjct: 5 KISVIVPVYNVENYLEDCIESVISQKNFQDIELILIDDGSTDQSSKIIKKYTESYTNILD 64 Query: 66 LHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWC 125 +Q NAG S ARN+GI+ A G+Y+ F+D+DD + E L +A+++DL++ Sbjct: 65 YYQENAGQSKARNKGIQHAKGEYIYFLDSDDLIPDLAMEHLYNLAVKEDLELILFEGKSF 124 Query: 126 FRETGETWQSIPTDRLRST---GVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIK 182 F ++ + + + R+ +L G D ++ + + ++ V++KN+I Sbjct: 125 FDKSVDNMKIGNMNYERNKVYDKILNGQDLFIEMTVNKDFYASPCLQFIKKKVLLKNSIY 184 Query: 183 FIAGLHHQDIVWTTEFMFNALRARYTEQSLY 213 F+ G+ H+D +++ + + + + + + L+ Sbjct: 185 FLEGIIHEDELFSYQLLMKSKKVKCIKDKLF 215 >UniRef50_C6WZX8 Beta-1,3-glucosyltransferase n=1 Tax=Flavobacteriaceae bacterium 3519-10 RepID=C6WZX8_FLAB3 Length = 361 Score = 118 bits (295), Expect = 4e-25, Method: Compositional matrix adjust. Identities = 65/224 (29%), Positives = 117/224 (52%), Gaps = 14/224 (6%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP-HVRL 65 K+SVI+P+Y + C+ SL+ QT +EII++NDGSTD S EI +++ +P +R Sbjct: 30 KVSVIVPVYQGEKELSKCLASLVNQTLDDIEIIVVNDGSTDGSGEIITEFSQRFPSKIRA 89 Query: 66 LHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWC 125 + N G S ARN G++ ATG+Y+ FVDADDEV + + +A + ++ CN Sbjct: 90 FTKENGGLSDARNFGLDRATGQYIGFVDADDEVSANTFAEMYELAEKHAAEMVICNIQKV 149 Query: 126 FRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMG---VYRRDVIVKNNIK 182 E G+ Q + T + P+ + + + + + ++R+D++ +N + Sbjct: 150 -DEAGKITQKL-------TQIPNMPERIVLEDHFSVFADISYFACNKIFRKDIL--HNKR 199 Query: 183 FIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLH 226 F G+H++DI + + T+ Y+Y+ S+++ H Sbjct: 200 FKKGIHYEDIQLIPQLLLECRVIAQTQNYHYQYFERRGSITKTH 243 >UniRef50_C8WIW0 Glycosyl transferase family 2 n=1 Tax=Eggerthella lenta DSM 2243 RepID=C8WIW0_EGGLE Length = 340 Score = 117 bits (294), Expect = 4e-25, Method: Compositional matrix adjust. Identities = 70/208 (33%), Positives = 112/208 (53%), Gaps = 2/208 (0%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+I+P+YN C+ SL QT++ E I I+DGSTD ++E A+ A + + Sbjct: 4 VSIIVPMYNVRPYIGECLASLKRQTFSDFEAICIDDGSTDGTLEAARNAAGDDERFVFVG 63 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 Q NAG SVARN GI +ATG+ V F+D+DD +TL A D+LDV +A + Sbjct: 64 QVNAGQSVARNVGIGLATGRSVLFLDSDDYYEDRALQTLFDHAERDELDVLFFSARTVYE 123 Query: 128 --ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIA 185 E+ ++ DR G++TG + L +R ++ + + RRD + ++ I F Sbjct: 124 DAESRARYRDDYEDRAAIEGIMTGQELLERFAENRSFSVSPALQLIRRDFLTESGISFFE 183 Query: 186 GLHHQDIVWTTEFMFNALRARYTEQSLY 213 G+ H+D ++T + A RA + + LY Sbjct: 184 GIVHEDNLFTGLILAQASRAAFLNEQLY 211 >UniRef50_C9MPZ6 Putative glycosyl transferase n=1 Tax=Prevotella veroralis F0319 RepID=C9MPZ6_9BACT Length = 346 Score = 117 bits (294), Expect = 5e-25, Method: Compositional matrix adjust. Identities = 81/305 (26%), Positives = 142/305 (46%), Gaps = 36/305 (11%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K S+++P+YNA R C+ SL+ Q + EI+++NDGSTD+S+ I + YA ++++L Sbjct: 5 KFSIVVPVYNAEGFLRQCLNSLLRQGLDSYEILLVNDGSTDSSLSICQEYANENSNIKVL 64 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPT-MYETLMTMALEDDLDVAQCNADWC 125 Q N G ARN G+E A G++V VD+DD + +Y + ++D D+ Q Sbjct: 65 SQTNQGVCAARNNGLEHAMGEWVILVDSDDYLLDNGLYTAFSAVEKQEDYDIIQ------ 118 Query: 126 FRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVV---WMGVYRRDVIVKNNI- 181 ++ + + W P + L S G GL R + V W+ YRRD + KN+I Sbjct: 119 YKSSYDFW---PKNILTSEVTFNGE-----GLELVRQSGFVSFCWLCFYRRDFLNKNHIH 170 Query: 182 ---KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRH 238 K+I G +D ++ + R LY+Y +H S + +++ Y R Sbjct: 171 FNNKYIVG---EDQLFVANVFLHNPRTAVVSTDLYRYIVHENSATT-----KRDVKYTR- 221 Query: 239 YIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSG 298 R +E ++ D + I ++ C+ + RM++ + G Sbjct: 222 -----RCVEDYLSSFNDILSIAKNVETDDRDSVVQACYKALNSKKMFGFSRMLSSQYDYG 276 Query: 299 MYKRL 303 +K++ Sbjct: 277 YFKKI 281 >UniRef50_D1QMD5 Glycosyl transferase, group 2 family n=1 Tax=Prevotella oris F0302 RepID=D1QMD5_9BACT Length = 300 Score = 117 bits (293), Expect = 5e-25, Method: Compositional matrix adjust. Identities = 68/211 (32%), Positives = 111/211 (52%), Gaps = 12/211 (5%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 +LSVIIP++N D R C++S+I Q +EII+++DGSTDNS++I YAE + + ++ Sbjct: 2 ELSVIIPVFNGADTLRRCLDSVIGQQIEGMEIILVDDGSTDNSLQICTQYAEKHAAITVI 61 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMT-MALEDDLDVAQCNADWC 125 HQ N G S ARN +EVA YV F+D DD + P Y+ LM + +++D+ + + D Sbjct: 62 HQTNKGLSEARNAALEVAKADYVTFIDCDDYLEPHTYKGLMNRLKKHENIDILEYSVDER 121 Query: 126 FRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHV-VWMGVYRRDVIVKNNIKFI 184 T +P DRL + +W L ++ + H W ++R+ + + +F Sbjct: 122 IGGKHHTRLQLP-DRLFT-------NWQNYWLKTKAYKHAYAWNKIFRKALFARE--QFK 171 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKY 215 A L +DI + + + T Y Y Sbjct: 172 ANLIFEDIELMSRLLKHCKNIMITSHGYYHY 202 >UniRef50_A5ZCV6 Putative uncharacterized protein n=1 Tax=Bacteroides caccae ATCC 43185 RepID=A5ZCV6_9BACE Length = 321 Score = 117 bits (293), Expect = 5e-25, Method: Compositional matrix adjust. Identities = 62/201 (30%), Positives = 110/201 (54%), Gaps = 14/201 (6%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +SVI+P+YN C++S+I Q++ LEII+++DGS D EI Y+ + ++H Sbjct: 2 ISVIVPIYNVERYINKCIDSIIHQSYKNLEIILVDDGSPDKCGEIIDEYSRIDKRIHVIH 61 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 ++N G S ARN G+++A+G ++ FVD+DD + PTMYE +++ ++ LD+ +C + Sbjct: 62 KSNGGLSSARNAGLDIASGDFIGFVDSDDWIEPTMYEEMLSFMEKEQLDLVECGINLVTN 121 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWT------HVVWMGVYRRDVIVKNNI 181 + P + VL+G D L L S+ T VW +++++ N Sbjct: 122 NISNKYIEKP------SIVLSGKDALIRHLDSKNRTVQSLPRTAVWSKLFKKEFWKTN-- 173 Query: 182 KFIAGLHHQDIVWTTEFMFNA 202 +F G H+D + T + ++ A Sbjct: 174 RFPVGEIHEDYLLTCKALYEA 194 >UniRef50_C1I2E9 Glycosyl transferase n=1 Tax=Clostridium sp. 7_2_43FAA RepID=C1I2E9_9CLOT Length = 323 Score = 117 bits (292), Expect = 8e-25, Method: Compositional matrix adjust. Identities = 62/224 (27%), Positives = 123/224 (54%), Gaps = 4/224 (1%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+IIP++N + R C+ES+I Q ++ E+I+++DGSTD S+EI + Y + + V++L Sbjct: 2 ISLIIPVFNVEEYLRQCLESIIKQRFSNYEVILVDDGSTDKSMEIIREYEKKFKRVKILS 61 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 Q N G S ARN ++ A G+YV ++D+DD + M E ++ A + D+ CN + + Sbjct: 62 QRNKGVSEARNLALKHAKGEYVLYIDSDDFLKFNMLELMVNKANKTKADIVMCNYNLYYG 121 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 + + + + L + + + + M L+ +W +++R ++++NN +F +G Sbjct: 122 VNNKNNRVVSYNVLED-KIYSSSEVIDMILNFNIQGQ-LWNKLFKRSLLLENNFEFESGR 179 Query: 188 HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNK 231 + QDI + + + + + + Y YY S +H++ K Sbjct: 180 YVQDIFPVFKVINKSNKIAFINE--YLYYYRQRETSTIHKKNTK 221 >UniRef50_C3QCN3 Glycoside transferase family 2 n=6 Tax=Bacteroides RepID=C3QCN3_9BACE Length = 322 Score = 116 bits (291), Expect = 1e-24, Method: Compositional matrix adjust. Identities = 67/210 (31%), Positives = 113/210 (53%), Gaps = 4/210 (1%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+SV+IP+YN R +ES++ Q+ LEII+++DGSTD S+ I + + +++ Sbjct: 7 KVSVVIPVYNTEKYVRQAVESVMYQSLKELEIIVVDDGSTDKSLSIVEKLGDTDKRIQIY 66 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 QAN G S+ARNRGI A G+Y+ F+D+DD + E E+ LD +A F Sbjct: 67 TQANQGQSIARNRGISHAHGEYIYFMDSDDLLEEDALELCYHKCKEEKLDFVFFDALVFF 126 Query: 127 R---ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 E T T++L + TG + L + L ++ +T V + R+VI K+N+ F Sbjct: 127 ENNVENAPTLNYKHTEKLED-KIYTGQEALEIQLQNKEYTPSVCLHFIHRNVIEKHNLSF 185 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLY 213 + H+D ++TT + +A +++ + Sbjct: 186 YPEIVHEDQLFTTLLYLQSTKAACIKRTFF 215 >UniRef50_A7VCQ5 Putative uncharacterized protein n=1 Tax=Clostridium sp. L2-50 RepID=A7VCQ5_9CLOT Length = 329 Score = 116 bits (291), Expect = 1e-24, Method: Compositional matrix adjust. Identities = 69/229 (30%), Positives = 117/229 (51%), Gaps = 19/229 (8%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+S+IIP+Y C++S++ QT +EII+++DGS DN I YA V ++ Sbjct: 4 KVSIIIPVYKVEKFLTRCLDSVLGQTLKEIEIILVDDGSPDNCGNICDEYAAKDKRVIVI 63 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN----- 121 H+ NAG S ARN G+EVA+G++V FVD+DD V TM+E + A+ D ++A C Sbjct: 64 HKKNAGVSAARNSGLEVASGEFVGFVDSDDYVAATMFEDMYRQAVLADAEMAMCQFAITD 123 Query: 122 ---ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVK 178 R +G+ + + D ++ ++ D+ SR VW +++R++I Sbjct: 124 GATDQLVHRSSGDDFNVLKFDNKKAFELIA--DF------SRPVQVTVWNKLFKRELI-- 173 Query: 179 NNIKF-IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLH 226 N++F + +D+ + +F + Y +LY YY + H Sbjct: 174 QNLRFDTSKRMAEDLEFLMRALFKSRTVVYVPYALYAYYAQREGAATFH 222 >UniRef50_D2BMH0 Glycosyltransferase, family 2 n=2 Tax=Lactococcus lactis subsp. lactis RepID=D2BMH0_LACLK Length = 333 Score = 116 bits (290), Expect = 1e-24, Method: Compositional matrix adjust. Identities = 76/220 (34%), Positives = 115/220 (52%), Gaps = 6/220 (2%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K SVIIP+YNA C+ES+++Q+++ LE+I+INDGSTD S EI + +A V++ Sbjct: 2 KFSVIIPVYNASKFIDYCVESVLSQSYSDLEVILINDGSTDKSGEICEGFARKDNRVKVY 61 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTM-YETLMTMALEDDLDVAQCNADWC 125 HQ N GA+VARN GI ATG Y+ F D+DD ++ +E + E DV ++ Sbjct: 62 HQENQGAAVARNHGIRKATGDYLLFPDSDDYYASSLGFEEINNQLEETKADVLIHDS--- 118 Query: 126 FRETGE-TWQSIPTDRLRSTGV-LTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 F E G T + + LR + L +++ + + T W V R + KNNI F Sbjct: 119 FVERGNGTPDDLHSSLLREEVLKLEKIEFIEFLIKKDKLTRSAWTKVISRSFLSKNNILF 178 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVS 223 +D +T E + + E+ Y Y H+ SV+ Sbjct: 179 PTVRQTEDTGFTAELVRLGNSFDWYEKQFYAYVKHDESVT 218 >UniRef50_D1QQ98 Capsular polysaccharide biosythesis protein CpsI n=1 Tax=Prevotella oris F0302 RepID=D1QQ98_9BACT Length = 324 Score = 116 bits (290), Expect = 1e-24, Method: Compositional matrix adjust. Identities = 76/229 (33%), Positives = 122/229 (53%), Gaps = 10/229 (4%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTA--LEIIIINDGSTDNSVEIAKYYAENYPHVRL 65 LS+IIP+YN+ R C+ESL Q + EII++NDGSTD S + + Y ++RL Sbjct: 7 LSIIIPVYNSELFLRDCVESLYQQNISEDLFEIILVNDGSTDRSYMLCQESLSRYTNIRL 66 Query: 66 LHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWC 125 Q N G SVARN G+E A GKY+ FVD+DD + + ++ A D+++ + + Sbjct: 67 FSQKNQGQSVARNLGLENARGKYILFVDSDDRLPQNVIIPMLQEAEGYDIEILIGSVE-V 125 Query: 126 FRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIA 185 F + GE +S+ +D + ++TG L G T VW ++ R I N+++F+ Sbjct: 126 FNKKGE--KSLNSDFQKYNQIVTGEYILLNG----GITGSVWGRLFLRSFIENNHLRFLP 179 Query: 186 GLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLN 234 G+ H+D+++T + + A R + Y Y ++ S R G LN Sbjct: 180 GIKHEDVLFTIQSLVFAKRVVSLKLCSYVYAWNSGSTDR-SLDGESKLN 227 >UniRef50_B5LBK9 Putative glycosyl transferase n=1 Tax=Streptococcus pneumoniae RepID=B5LBK9_STRPN Length = 322 Score = 116 bits (290), Expect = 1e-24, Method: Compositional matrix adjust. Identities = 65/214 (30%), Positives = 113/214 (52%), Gaps = 12/214 (5%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+SVI+P+YN C+ES++ QT+T LEI+++NDGSTD S E+ A+ +R++ Sbjct: 5 KISVIVPVYNVEAYLEKCVESILKQTYTNLEILLVNDGSTDKSGELCDQLAQRDQRIRVI 64 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 H+ N G S ARNRGIE A+ + F+D+DD + MYETL LE + D++ C Sbjct: 65 HKENGGLSDARNRGIEEASSDLIGFIDSDDYIDEDMYETLYRQMLESNADLSMCGH---- 120 Query: 127 RETGETWQSIPTDRLRSTGV--LTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 + + IP ++ + LT + ++M + ++ + +Y++++ ++F Sbjct: 121 ---YDVYDQIPEKQVATIQTWELTPQEAIKMVMEAKILSVTAVNKLYKKELF--EQLRFE 175 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLH 218 G +D + + T + Y YY+H Sbjct: 176 IGKIAEDAFIMIALIHQCRKVVATNEKKY-YYVH 208 >UniRef50_D1PWT5 Exopolysaccharide biosynthesis protein n=1 Tax=Prevotella bergensis DSM 17361 RepID=D1PWT5_9BACT Length = 321 Score = 116 bits (290), Expect = 1e-24, Method: Compositional matrix adjust. Identities = 74/255 (29%), Positives = 129/255 (50%), Gaps = 21/255 (8%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M +T K+S+I+P+YNA + C+ES++ Q++T E+++++DGS D+S+ + K +AE Sbjct: 1 MENTPKISIIVPVYNAEKEITRCVESILGQSYTNFELLLVDDGSKDSSLSLCKSFAEKDN 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 + + H+ N GA+ AR G+ A G+Y+ FVDADD + T TL+++A D LD+ QC Sbjct: 61 RINVFHKKNGGANSARAFGVREAGGEYINFVDADDYIPSTSLSTLVSIATTDQLDIVQCA 120 Query: 122 ADWCFRETG---ETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVK 178 F G ET+ P TGV ++ +++ V +YRR++ + Sbjct: 121 RK--FLPVGGEKETFSFFP-----QTGVFDSEQFISFLFAAKCNGGPVG-SLYRREIFGE 172 Query: 179 NNIKFIAGLHHQDIVWTTEFMFN---ALRARYT--EQSLYKYYLHNTSVSRLHRQGNKNL 233 D+V +F N ++A+ SL Y+ N S + + + L Sbjct: 173 ETFHL-----DNDVVLGEDFYMNLCLGIKAKRIGLNNSLVYVYIENESSATHNYEFVSVL 227 Query: 234 NYQRHYIKITRLLEK 248 + + R+LE+ Sbjct: 228 PLKHQLESVRRILEE 242 >UniRef50_A7VPB1 Putative uncharacterized protein n=1 Tax=Clostridium leptum DSM 753 RepID=A7VPB1_9CLOT Length = 330 Score = 116 bits (290), Expect = 1e-24, Method: Compositional matrix adjust. Identities = 66/221 (29%), Positives = 112/221 (50%), Gaps = 4/221 (1%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M +S+IIP+YN C+ES+ QT E+I+++DGSTD+S+EI + +A + Sbjct: 1 MKTVVPNVSLIIPVYNVEQYLPKCLESIAAQTLKGFEVILVDDGSTDHSLEILRGFARRF 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 P+ ++HQ N G S ARN GI+ A G+Y+AF+D+DD + P + L A + D+ C Sbjct: 61 PNTCVIHQENGGVSKARNAGIQAARGEYIAFMDSDDYIAPLYLQRLYESAKKYRADMVCC 120 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 + +R T + P ++ G+ T + +R + +W ++RR + +++ Sbjct: 121 SY---YRYNAVTHELRPAQFRKAPGIYTPKEMIRALILDMGIKGFIWNKLWRRTLFTEHH 177 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTS 221 I F + +DI + + R LY Y H S Sbjct: 178 ITF-PTMCFEDISVCQRGFYFSNRIAVIGDPLYFYVQHKDS 217 >UniRef50_C0F023 Putative uncharacterized protein n=1 Tax=Eubacterium hallii DSM 3353 RepID=C0F023_9FIRM Length = 412 Score = 116 bits (290), Expect = 2e-24, Method: Compositional matrix adjust. Identities = 81/272 (29%), Positives = 131/272 (48%), Gaps = 19/272 (6%) Query: 4 STNK----LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAEN 59 ST+K +SVIIP YN G ++S++ Q++ EII +NDGSTD ++EI Y Sbjct: 2 STDKKEYGISVIIPCYNVGQYVEETLKSVLNQSFKNYEIICLNDGSTDGTLEILNKYQSL 61 Query: 60 YPHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQ 119 YP++++ N G + RN+G++ A GKY+ ++D DD + ETL A D LD+ Sbjct: 62 YPYIQVFTGENHGVAYQRNKGVQRARGKYIYYLDGDDLIKENCLETLYQYAEADSLDILY 121 Query: 120 CNADWCFRETGETWQSIPT------DRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRR 173 A+ F E+ E ++ P GV G + + V + RR Sbjct: 122 FEAE-SFYESKEIEEAFPQFLTLYHRHKEYDGVYDGKNLYIEMENKGDIKMSVCLQFVRR 180 Query: 174 DVIVKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNL 233 ++ NNIKF + +D ++T M A RAR +L YL + + G + + Sbjct: 181 QFLIDNNIKFGEERYFEDNLYTVRTMIKAARARCVRDNL---YLRRVRGNSIMTGGKRKI 237 Query: 234 NYQRHYIKITR----LLEKLNRNYADKIMIYP 261 ++ Y+++ R +LE +N A + IY Sbjct: 238 RFE-SYLEVVRGLMQILEDEKQNSALQEAIYK 268 >UniRef50_C9BVV0 Glycosyl transferase n=3 Tax=Enterococcus faecium RepID=C9BVV0_ENTFC Length = 335 Score = 115 bits (289), Expect = 2e-24, Method: Compositional matrix adjust. Identities = 63/216 (29%), Positives = 114/216 (52%), Gaps = 7/216 (3%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 ++S+I+P+Y R C++S++ QT+T E+I+++DGS DNS +I YAE VR++ Sbjct: 3 EISIIVPVYKVEKYLRKCVDSILAQTFTDFEVILVDDGSPDNSGKICDEYAEKDNRVRVI 62 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 H+ N G S ARN GI+VA GKY+ FVD+DD + MYE L D D++ + Sbjct: 63 HKENGGLSSARNAGIDVARGKYLGFVDSDDYIDEDMYEILYENLKIHDADISSVELIPFY 122 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAG 186 G+ ++ ++ + +L + ++ L ++ W +YR+++ N +++ G Sbjct: 123 ---GDRYKK--ANKEKKVIILNKKEAIKSVLEGTQFYAYAWNKLYRKELFKDN--RYLDG 175 Query: 187 LHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSV 222 +D + +F + + + Y Y N S+ Sbjct: 176 KTFEDAYIIIDLLFQTEKIVVSNEEKYFYLQRNDSI 211 >UniRef50_B3JEB6 Putative uncharacterized protein n=1 Tax=Bacteroides coprocola DSM 17136 RepID=B3JEB6_9BACE Length = 344 Score = 115 bits (289), Expect = 2e-24, Method: Compositional matrix adjust. Identities = 63/232 (27%), Positives = 121/232 (52%), Gaps = 17/232 (7%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 + K+S+I+P+YN C+ESLI QT +EII+++D S D+ I YA+ ++ Sbjct: 2 SPKVSIIVPVYNVEKYIHRCVESLINQTLKDIEIILVDDESPDSCPSICDEYAQKDARIK 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++H+ N G + RN G+E+A+G+++AF+D+DD V TMYETL A + D C+ + Sbjct: 62 VIHKKNEGLGLTRNAGLEIASGEFIAFIDSDDWVDLTMYETLYNTAKNNQCDTVYCSLQY 121 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGL-----------SSRRWTHVVWMGVYRR 173 + + IP + G + L S ++ V +Y R Sbjct: 122 YYSPN----KIIPFKEVNEETFFQGRKEIDTFLLDMLAPLPTYHSDVKYMVSVCKAIYSR 177 Query: 174 DVIVKNNIKFIAG--LHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVS 223 ++I +++KF++ + +D+V+ T+++ + + + + Y Y+ + S++ Sbjct: 178 NIIRNHHLKFVSERIIASEDMVFHTQYLKLSEKIGFIPKYFYNYFQNENSIT 229 >UniRef50_P71057 Putative glycosyltransferase epsH n=3 Tax=Bacillus subtilis group RepID=EPSH_BACSU Length = 344 Score = 115 bits (289), Expect = 2e-24, Method: Compositional matrix adjust. Identities = 73/221 (33%), Positives = 117/221 (52%), Gaps = 8/221 (3%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 T +S+++ +YN RTC+ESL QT +EIII+NDGS D S +IA+ YA+ + Sbjct: 3 TPAVSLLVAVYNTETYIRTCLESLRNQTMDNIEIIIVNDGSADASPDIAEEYAKMDNRFK 62 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++HQ N G RN+GIE A G+++AF+D+DD + P E ++ A D+ D+ CN Sbjct: 63 VIHQENQGLGAVRNKGIEAARGEFIAFIDSDDWIEPDYCEQMLRTA-GDETDLVICNYAA 121 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 F +TG+T S D ++ +++ L + W +YRR +I + + F Sbjct: 122 EFEDTGKTMDS---DIAQTYQDQPKEHYIK-ALFEGKVRGFSWNKLYRRSMIEAHRLSFP 177 Query: 185 --AGLHH-QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSV 222 L H +D ++ F A Y + LY Y +H +S+ Sbjct: 178 LRGELEHVEDQFFSFRAHFFARSVSYVKTPLYHYRIHLSSI 218 >UniRef50_B0NP43 Putative uncharacterized protein n=1 Tax=Bacteroides stercoris ATCC 43183 RepID=B0NP43_BACSE Length = 319 Score = 115 bits (289), Expect = 2e-24, Method: Compositional matrix adjust. Identities = 68/220 (30%), Positives = 111/220 (50%), Gaps = 5/220 (2%) Query: 7 KLSVIIPLYNAGDDFRTCMESL--ITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 +LS++IP+YN C++S+ I A E+II+NDG+ D S+ I + YA+ YPH++ Sbjct: 2 ELSIVIPIYNVEAYIGRCLDSVSKIFGLGLACEVIIVNDGTPDGSMAIVEDYAKKYPHIK 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 +++Q N G ARN G+ A G+YV F+D+DD + TL DV N + Sbjct: 62 IINQDNQGLGEARNVGLRHACGEYVYFLDSDDTIKAENLVTLFRSGYNSLPDVLVGN--Y 119 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 GE +P + V TG D+ + +V G+Y++ ++ NN+ F Sbjct: 120 SNIVEGENRHYLPAIIEEQSRVYTGEDYFNR-YYVKSINILVVQGIYKKSFLLDNNLFFT 178 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSR 224 G+ +D+ W + + +A Y +Y YYL S+ R Sbjct: 179 KGIFFEDVNWMPKMLLHANAIYYKNICIYNYYLRQGSIIR 218 >UniRef50_D0BME7 Capsular polysaccharide biosynthesis protein CpsI n=1 Tax=Granulicatella elegans ATCC 700633 RepID=D0BME7_9LACT Length = 333 Score = 115 bits (288), Expect = 2e-24, Method: Compositional matrix adjust. Identities = 78/251 (31%), Positives = 135/251 (53%), Gaps = 16/251 (6%) Query: 6 NKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRL 65 +K+S+I+P+YN + R C+ES+ QT+T E I+INDGSTD+S EIA+ Y E+ L Sbjct: 5 SKISIIVPIYNVENYLRECLESIKNQTFTNFECIMINDGSTDSSKEIAEQYLED-ERFYL 63 Query: 66 LHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWC 125 ++Q N+G SVARNR I +A G++V FVD+DD ++P + E L+ +++D+D+ + + Sbjct: 64 INQNNSGQSVARNRSINLAKGEFVCFVDSDDMIHPRLLE-LLVAYIQEDVDIVEAD---- 118 Query: 126 FRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIA 185 F + + + + + G + L+ + + TH + R+ +I N+ F Sbjct: 119 FTDNLDDFIEENCSNVHISFEGYGENVLKACIDTWAITHQPVAKLIRKSLI--ENVLFPE 176 Query: 186 GLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRL 245 L ++D+ +T +F +R + + Y YY + + ++ Q I L Sbjct: 177 NLIYEDL-YTGIALFKEIR-KAVKLDFYGYYYRPNPLGTM---KGSSVEKQLDIFPICEL 231 Query: 246 LEKLNRNYADK 256 LEK Y+DK Sbjct: 232 LEKY---YSDK 239 >UniRef50_C7H3W2 Capsular polysaccharide biosynthesis protein n=1 Tax=Faecalibacterium prausnitzii A2-165 RepID=C7H3W2_9FIRM Length = 318 Score = 115 bits (288), Expect = 2e-24, Method: Compositional matrix adjust. Identities = 77/241 (31%), Positives = 115/241 (47%), Gaps = 14/241 (5%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 T +SVI+P+YN C++S++ QT+ LE+++++DGSTD S I YA V Sbjct: 2 ETELISVIVPVYNVERYLCRCVDSILHQTYRNLEVLLVDDGSTDASGAICDEYAAQEERV 61 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 +HQ N G S ARN G+E A G Y+ FVD+DD + M ETL E D DVA Sbjct: 62 TAVHQKNGGLSAARNTGLERAKGTYLCFVDSDDLLDSRMLETLCRDLQEQDADVAVVG-- 119 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 FR + P + +TG + +R L S W +Y+R++ +I++ Sbjct: 120 --FRMFEREEELAPAELAVPVQCMTGREAIRSTLVSDELGDFAWNKLYKRELF--RDIRY 175 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNK------NLNYQR 237 G +D T + Y LY YY S+ LHR+ K ++ YQ+ Sbjct: 176 PLGRMMEDQGTTYRIFQQCSKVVYRPVPLYYYYQRPDSI--LHRRNMKFYEDKLDMGYQK 233 Query: 238 H 238 + Sbjct: 234 Y 234 >UniRef50_B3Z5I4 Glycosyltransferase n=2 Tax=Bacillus cereus group RepID=B3Z5I4_BACCE Length = 342 Score = 115 bits (288), Expect = 2e-24, Method: Compositional matrix adjust. Identities = 73/245 (29%), Positives = 130/245 (53%), Gaps = 14/245 (5%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+SVI+P+YN+ C+ES+I QT+ +EI+IINDGS+D S I Y + + Sbjct: 4 KVSVIVPVYNSEKFISKCLESIIRQTYKNIEILIINDGSSDESETIINVYRKIDHRITYY 63 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 +Q N+G S ARNRGI ATG+YV F+D+DD V+ L+ + + D+ C Sbjct: 64 YQNNSGPSKARNRGILNATGEYVIFLDSDDTVHENYVMYLLNEMINSNSDLVCCGYKDIS 123 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAG 186 R + + ++ +++M + V+W +Y+R++I K N+K Sbjct: 124 R-----YGMLNCSDFNFESSISINSFMKM--VCKGTGGVLWGKIYKREIITKFNLKMDEK 176 Query: 187 LHH-QDIVWTTEFMFNALRARYTEQSLYKYY-LHNTSVSRLHRQGNKNLNYQRHYIKITR 244 + +D+V+ ++ + +Y E LY Y L+ S+S+ N +++Y ++YI + + Sbjct: 177 IFMCEDLVFVLQYASQCKKFKYVEAYLYNYNRLNQYSISK-----NISIHYMQNYILVCK 231 Query: 245 LLEKL 249 +E++ Sbjct: 232 RIEEI 236 >UniRef50_C0YL33 Glycosyltransferase n=1 Tax=Chryseobacterium gleum ATCC 35910 RepID=C0YL33_9FLAO Length = 333 Score = 115 bits (287), Expect = 3e-24, Method: Compositional matrix adjust. Identities = 64/232 (27%), Positives = 123/232 (53%), Gaps = 14/232 (6%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M N +++S+I+P+YN + C++SL+ Q+ +EI+++NDGS DNS +I + YA+ Y Sbjct: 1 MTNVPSRVSIIVPVYNVENYLAKCLDSLVKQSLPDIEILVVNDGSKDNSEKIIEGYAQKY 60 Query: 61 PH-VRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQ 119 P ++ + N G S ARN GI+ ATG Y+ FVD+DD V TM+E ++ +A + + + Sbjct: 61 PEKIKAYTKENGGLSDARNFGIDRATGDYIGFVDSDDYVTETMFEEMLLLAEKHNAKMVI 120 Query: 120 CNADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMG---VYRRDVI 176 N E G+ Q + T + P+ + + + ++ + + ++++++ Sbjct: 121 SNIQKV-DENGKVVQKL-------TQLPNMPEKITLENNFSVFSDISYFACNKLFKKELF 172 Query: 177 VKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQ 228 N +F G+H +DI + + T+ Y+Y S+++ H + Sbjct: 173 --NQKRFKKGVHFEDIQLIPQLLLECETIAQTQNFHYQYLERTDSITKTHTE 222 >UniRef50_C4ZG45 Glycosyltransferase Family 2 modular protein n=1 Tax=Eubacterium rectale ATCC 33656 RepID=C4ZG45_EUBR3 Length = 723 Score = 115 bits (287), Expect = 3e-24, Method: Compositional matrix adjust. Identities = 72/224 (32%), Positives = 115/224 (51%), Gaps = 7/224 (3%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTW--TALEIIIINDGSTDNSVEIAKYYAEN 59 MN +K+SVIIP YN C +S+ +QT+ LE+I+I+D STDN+ I + Sbjct: 1 MNGKDKISVIIPCYNVQKYIMRCFDSIYSQTYGFENLEVILIDDLSTDNTWSILESLQRQ 60 Query: 60 YPH--VRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDV 117 YP + L Q ARN G+++ TGKY+ FVDADD V+P M L EDD DV Sbjct: 61 YPENVISLKIQKKGKCGGARNLGMDICTGKYITFVDADDYVHPDMLRVLHDRMTEDDYDV 120 Query: 118 AQCNADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHV-VWMGVYRRDVI 176 AQC CF+ + ++ + + + + +T+V W +Y I Sbjct: 121 AQCGVS-CFKADKPNVLEVNDFEIQRLDLDNVDNRKNLIIGLTGFTNVTAWAKLYSTKFI 179 Query: 177 VKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNT 220 +++N++FI ++++D ++ + A + + +LY YY NT Sbjct: 180 IEHNLRFIEDVYYEDTYFSMLCVLLAKKYCKVQSTLY-YYFENT 222 >UniRef50_C9RJ54 Glycosyl transferase family 2 n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RJ54_FIBSS Length = 321 Score = 115 bits (287), Expect = 3e-24, Method: Compositional matrix adjust. Identities = 80/244 (32%), Positives = 118/244 (48%), Gaps = 22/244 (9%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +SVIIPLYN + C++SL+ QT+T E+II++DGSTD IA YA +L+ Sbjct: 4 VSVIIPLYNTEAFIKDCLDSLVAQTFTDFEVIIVDDGSTDGGARIAASYASADTRFKLIG 63 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC---NADW 124 Q N G S ARN G+++ G+YV F+D+DD V P ETL +A DVA C N D Sbjct: 64 QPNKGPSEARNTGLKIMRGEYVTFIDSDDCVAPNFLETLFFIAQLHKADVACCSIKNIDE 123 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWT-HVVWMGVYRRDVIVKNNIKF 183 ++ G T P + ++ LT + R+ L + W +Y+ D + K F Sbjct: 124 AYKCDGTT----PNTSISAS--LTPEEAARISLYQDSLPDYSAWNKLYKAD-LWKGKF-F 175 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 A ++D+ E + A + T+ LY Y ++ L Q KI Sbjct: 176 PANTIYEDLAVIPEILLGANKVVTTKSHLYFY----------RKRAGSELATQTGSTKII 225 Query: 244 RLLE 247 LL+ Sbjct: 226 LLLD 229 >UniRef50_C3Q1B0 Glycosyltransferase family 2 protein n=2 Tax=Bacteroides RepID=C3Q1B0_9BACE Length = 347 Score = 114 bits (286), Expect = 4e-24, Method: Compositional matrix adjust. Identities = 64/236 (27%), Positives = 118/236 (50%), Gaps = 17/236 (7%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M N K+SVI+P+YN C++S++ QT LEII+++DGS DN + YA+ Sbjct: 1 MENDIPKISVIVPVYNVERYLDRCIQSILNQTLKELEIILVDDGSPDNCPALCDEYAQKD 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 ++++H+ N G ARN G+ +A+GK+V+F+D+DD V P MYE L +A + D C Sbjct: 61 ARIKVIHKKNEGLGFARNAGLNLASGKFVSFLDSDDWVAPAMYEALYRVAEKMKCDTVYC 120 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGL-----------SSRRWTHVVWMG 169 + + + + + IP + + G + L S ++ V Sbjct: 121 SLQYYYSKD----KIIPFEEVGQEVFFRGRKSVDSFLLDMLAPLPSYRSDVKYMVSVCKA 176 Query: 170 VYRRDVIVKNNIKFIAG--LHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVS 223 +Y R +I N ++F++ + +D+ + ++ A + + Y Y+ + S++ Sbjct: 177 IYLRKIIEDNQLRFVSEKVVASEDMFFHVRYLKLAENVGFIPEYFYNYFQNECSIT 232 >UniRef50_A5KMN1 Putative uncharacterized protein n=1 Tax=Ruminococcus torques ATCC 27756 RepID=A5KMN1_9FIRM Length = 335 Score = 114 bits (286), Expect = 4e-24, Method: Compositional matrix adjust. Identities = 56/111 (50%), Positives = 76/111 (68%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+I+P YN G +FR C++SLI QT+ LEII+I+DGSTD S EIAK YA+ V L Sbjct: 22 ISIIVPFYNPGGNFRRCLDSLIHQTYGNLEIILIDDGSTDGSYEIAKEYAQIDKRVVLEK 81 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVA 118 Q N+G S ARNRGI++A G Y +F+D+DD + YE L+ + E +DV Sbjct: 82 QKNSGVSHARNRGIQIAHGDYFSFIDSDDYLDLDTYEYLLGILNEKKVDVV 132 >UniRef50_A3KLK7 Beta-1,3-N-acetylglucosaminyl transferase n=3 Tax=Streptococcus agalactiae RepID=A3KLK7_STRAG Length = 323 Score = 114 bits (286), Expect = 4e-24, Method: Compositional matrix adjust. Identities = 68/197 (34%), Positives = 107/197 (54%), Gaps = 13/197 (6%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 +NK+SVI+PLYN C+ SL QT+ +EII+I+DGSTDNS I K A+ + Sbjct: 6 SNKISVIVPLYNVEKYIINCISSLTNQTYQNIEIILIDDGSTDNSGRICKKLAQEDHRII 65 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 L + N G S ARN G++ ATG Y+ FVD+DD + TMYE L+ LE + D+A+ D+ Sbjct: 66 YLRKENGGVSSARNLGLQYATGSYIGFVDSDDYISKTMYENLLKRLLETNADIAE--TDF 123 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 + T + VL + +R LS + + + ++++ VI N+KF Sbjct: 124 ALIDNRFTKKKRKK----IQKVLNKEEAIREFLSGNVVENNLVIKLFKKTVIA--NLKF- 176 Query: 185 AGLHHQDIVWTTEFMFN 201 +D++ + +F+ Sbjct: 177 ----KEDVIVGEDMLFS 189 >UniRef50_C0X9Z9 Glycosyltransferase n=1 Tax=Lactobacillus gasseri JV-V03 RepID=C0X9Z9_9LACO Length = 294 Score = 114 bits (286), Expect = 4e-24, Method: Compositional matrix adjust. Identities = 71/229 (31%), Positives = 125/229 (54%), Gaps = 12/229 (5%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M++ K+SVIIP+YN + C+ES++TQT+T LEII+++DGSTD++ EI + Y E Y Sbjct: 1 MDTNEKVSVIIPVYNDEKYLKQCVESVLTQTYTNLEIILVDDGSTDHTPEICEEYREKYN 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 +R+LH+ N G +RN G+ +ATG+YV FVD DD + E L +A+++ D+A N Sbjct: 61 QIRVLHKKNGGVGSSRNAGLALATGEYVLFVDHDDWLDQHHIEDLYNLAVKNKADIAAGN 120 Query: 122 ADWCF--RETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTH-----VVWMGVYRRD 174 + + + T W + + + + +W + + + V W +Y+R Sbjct: 121 FNIFYDDKSTFAYWLNKDSYFEKDYSI---KEWFSLQYRTDYYNMSLVFTVPWDKLYKRS 177 Query: 175 VIVKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVS 223 + NI + + +D + T + A + Y +++Y + + +SVS Sbjct: 178 LF--KNIVYPENVPVEDDLTTWKIYLLANKIAYENKAIYTHRILKSSVS 224 >UniRef50_D0R4M4 Glycosyltransferase n=1 Tax=Lactobacillus johnsonii FI9785 RepID=D0R4M4_LACJF Length = 339 Score = 114 bits (286), Expect = 4e-24, Method: Compositional matrix adjust. Identities = 68/218 (31%), Positives = 113/218 (51%), Gaps = 12/218 (5%) Query: 3 NSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPH 62 N T ++S+I+P+YN C+ S+I QT+ E+I++NDGS DNS +I + YA+ Y Sbjct: 5 NGTPEISIIVPVYNVEKWIENCIRSVIAQTFNNWELILVNDGSLDNSQQICEKYAKKYSK 64 Query: 63 VRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNA 122 + L+++ N G S ARN GI+ A GKY+AFVD DD + L T + + +A C Sbjct: 65 IHLINKENGGLSDARNFGIKKAKGKYLAFVDGDDYIDSNYLSKLHTAIISSNASIAVCG- 123 Query: 123 DWCFRETGETWQSIPTDRLRSTGV---LTGPDWLRMGLSSRRWTH--VVWMGVYRRDVIV 177 ++E + + R V ++G +LR+ L+ + T V W +Y++D+ Sbjct: 124 ---YKEVDNKKKILSLKRTNDFFVQKLVSGQTFLRI-LAEKNCTLCVVAWNKLYKKDLFK 179 Query: 178 KNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKY 215 N+ K G H+D +++ + + LY Y Sbjct: 180 NNSYK--KGRLHEDEFIIAPLVYDVSKIALVDDELYNY 215 >UniRef50_C6ID60 Glycosyltransferase n=5 Tax=Bacteroides RepID=C6ID60_9BACE Length = 368 Score = 114 bits (286), Expect = 4e-24, Method: Compositional matrix adjust. Identities = 82/292 (28%), Positives = 130/292 (44%), Gaps = 22/292 (7%) Query: 15 YNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLHQANAGAS 74 YN C+ SL QT T +EII++NDGSTDNS + + YA + Q N G S Sbjct: 53 YNTEPFLAECLHSLEKQTLTDIEIILVNDGSTDNSGRLLREYAGKDARFVYVEQENQGLS 112 Query: 75 VARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFRETG-ETW 133 ARN G+E A+G Y+AF+D+DD + + L +A + D+ N + ++W Sbjct: 113 AARNTGMEHASGHYLAFLDSDDWLAENALQVLCAIAAKTRTDIVSGNTLAVHADGQVQSW 172 Query: 134 QSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGLHHQDIV 193 + + + V++G + + R + +V+ +YRRD I +N +F GL H+D + Sbjct: 173 ERRGRELFATGTVVSGSTYFSRVMDCRCYVPMVYNYLYRRDFIEQNGFRFEPGLVHEDEL 232 Query: 194 WTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQG----------NKNLNYQRHYIKIT 243 WT + + A + + Y Y S+ G K L Y R ++ Sbjct: 233 WTPQVLTTAQKITVADIDFYYYRQREGSIMTATAAGRRIASIQLIIEKLLEYSRKHLFEK 292 Query: 244 RLLEKLNRNYADKIMIYP---EFHQQITY--------EALRVCHAVRKEPDI 284 + E Y + IY H TY E LRVC +R++ + Sbjct: 293 KYREAKEALYVRLLQIYSTACTLHPDGTYTTLYDRAGEMLRVCEELRRQESL 344 >UniRef50_Q815A0 Glycosyltransferase n=6 Tax=Bacillus cereus RepID=Q815A0_BACCR Length = 319 Score = 114 bits (285), Expect = 5e-24, Method: Compositional matrix adjust. Identities = 64/218 (29%), Positives = 107/218 (49%), Gaps = 5/218 (2%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+I+P+YN C++S+I Q + LEI+++NDGSTD S I YAE + H Sbjct: 3 ISLIVPIYNVEGFLPRCLDSIIKQEYQNLEIVLVNDGSTDQSGLICDQYAEKDERFIVFH 62 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + N G S ARN G+ A G Y+ FVD DD + P M L + ++ D+A C + Sbjct: 63 KENGGVSSARNVGLNKANGNYIGFVDPDDWIEPNMINKLYELIIQHKADIAMCG--YVKE 120 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 + S P + ST +L D L L+ + + ++ D+ K N+ + Sbjct: 121 DINGLLISRPVE--LSTKILNKEDTLNSILNPNDFRGFLCNKLFSVDLFRKKNLYLDEDI 178 Query: 188 HH-QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSR 224 H +D+++ + + + + Y Y Y +HN + S+ Sbjct: 179 HFCEDLLFCCQAILESKQIVYDSTPYYHYIIHNNNASK 216 >UniRef50_C3PZ30 Glycosyl transferase n=4 Tax=Bacteroides RepID=C3PZ30_9BACE Length = 336 Score = 114 bits (285), Expect = 5e-24, Method: Compositional matrix adjust. Identities = 61/181 (33%), Positives = 101/181 (55%), Gaps = 14/181 (7%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 T+ +SVI+P++N R C+ES++ T+T L+II++NDGSTD+S EI Y + Sbjct: 7 TDLISVIVPIFNVEAYLRPCIESILASTYTTLQIILVNDGSTDHSGEICDEYTRKDTRIE 66 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++HQ NAG S ARN G++ A GKY++F+D DD ++P MYE L+ E + + Sbjct: 67 VIHQKNAGLSPARNSGMKAAKGKYISFIDGDDYIHPQMYEVLLEALQEGNYSFSMILGKQ 126 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLR---------MGLSSRRWTHVVWMGVYRRDV 175 + + SIP+ +S +LT ++ G+ + VVW +Y+R++ Sbjct: 127 VY--DNDKSYSIPSAYTKS--ILTQEIMIKSLFNHIHPQQGIKEVQ-AQVVWNKLYKREL 181 Query: 176 I 176 + Sbjct: 182 L 182 >UniRef50_Q5LBM4 Putative glycosyltransferase protein n=4 Tax=Bacteroides RepID=Q5LBM4_BACFN Length = 342 Score = 114 bits (285), Expect = 6e-24, Method: Compositional matrix adjust. Identities = 77/259 (29%), Positives = 136/259 (52%), Gaps = 35/259 (13%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+SVI+P+YN C+++L+ QT + +EII+I+D S DN +I YA YP+++++ Sbjct: 4 KVSVIVPIYNVEKYLDQCVQALLAQTLSDIEIILIDDESPDNCPKICDDYAAQYPNIKVI 63 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 H+ NAG +A N G++VATG+YVAF D+DD V MY T+ +A + C+A + Sbjct: 64 HKKNAGLGMACNSGLDVATGEYVAFCDSDDYVDSDMYMTMYNVAQK-----YTCDAVF-- 116 Query: 127 RETGE---TWQSIPTDRLRS-----------------TGVLTGPDWLRMGLSSRRWTHVV 166 TG T IPT + ++ + R + + VV Sbjct: 117 --TGLKRITMAGIPTGTVTHQKEFKLYKNKNEIHTLLKDLIASDPYAREERAIQVSAKVV 174 Query: 167 WMGVYRRDVIVKNNIKFIAG--LHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSR 224 +YRR++I K +++F++ L +D+++ + + N+ Q+ Y Y + S+S Sbjct: 175 ---LYRRNLIEKKHLRFVSERILPSEDLIFNVDVLANSNIVCVLPQTFYNYRTNPISISH 231 Query: 225 LHRQGNKNLNYQRHYIKIT 243 ++ +L +++ YI+IT Sbjct: 232 TIKKDKFSL-FKQLYIEIT 249 >UniRef50_B4AD18 Glycosyltransferase n=18 Tax=Enterococcus RepID=B4AD18_ENTFA Length = 326 Score = 114 bits (284), Expect = 7e-24, Method: Compositional matrix adjust. Identities = 59/213 (27%), Positives = 123/213 (57%), Gaps = 9/213 (4%) Query: 6 NKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRL 65 +++S+I+P+YN + + C+ES+++QT+T E+++++DGSTD+S E+ +++ Sbjct: 2 SEISIIVPVYNVENYLKKCVESILSQTFTDFELLLVDDGSTDSSGEMCDELKRLDERIKV 61 Query: 66 LHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWC 125 +H+ N G S ARN GI+VA GKY+ FVD+DD + M E L +++D D++ Sbjct: 62 IHKENGGLSSARNAGIDVAKGKYLTFVDSDDYIDTHMLEVLYKNMVQEDADLSIVGVTSV 121 Query: 126 FRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIA 185 + +G+ + P+++ V+T + +M L ++ + +Y+R++ +++++ Sbjct: 122 Y--SGQEPEITPSEKY----VVTQKEATKMILIGKQASVYAVAKLYKRNIF--DDLRYPV 173 Query: 186 GLHHQDIVWTTEFMFNALRARYTEQSLYKYYLH 218 G H+D V+ + N + + + Y +Y+H Sbjct: 174 GKAHED-VYVIMDVLNQCKKVVVDTASYYFYVH 205 >UniRef50_C7X8L4 Glycoside transferase, family 2 n=1 Tax=Parabacteroides sp. D13 RepID=C7X8L4_9PORP Length = 308 Score = 114 bits (284), Expect = 7e-24, Method: Compositional matrix adjust. Identities = 67/220 (30%), Positives = 109/220 (49%), Gaps = 4/220 (1%) Query: 6 NKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRL 65 N +SVIIP+YN + + S++ QT +EII+INDGSTD S+ I + A+ +R Sbjct: 4 NLVSVIIPVYNTEEFVEEAVRSIMNQTLQEIEIIVINDGSTDGSLSIIERLAKEDKRIRT 63 Query: 66 LHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWC 125 Q N G S RN+G E A GKY+ F+D+DD + P ET + LD +AD Sbjct: 64 YTQQNQGLSATRNKGNEYAQGKYIYFMDSDDYLEPETLETCYIKCEQQALDFVLFDADIL 123 Query: 126 FRETGETWQSIPTDRLRSTG---VLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIK 182 +E ++ + R T V GP L++ + + ++ + RR+ + + + Sbjct: 124 NKEKHKSLH-LNYQRKDCTNPELVYQGPHILKLLMDHKAYSPSACLNFIRREYLTEIQLD 182 Query: 183 FIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSV 222 F+ G+ H+D ++T A R + +K L S+ Sbjct: 183 FLPGIIHEDQLFTCLLYLQAQRVSSIHKDFFKRRLREDSI 222 >UniRef50_UPI0001C377F0 glycosyl transferase, group 2 family protein n=1 Tax=Ruminococcus flavefaciens FD-1 RepID=UPI0001C377F0 Length = 321 Score = 114 bits (284), Expect = 7e-24, Method: Compositional matrix adjust. Identities = 73/232 (31%), Positives = 118/232 (50%), Gaps = 14/232 (6%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 L+VI+P+YN C+ SLI+QT++ + II++NDGSTDNS +IA +AENY ++ ++H Sbjct: 4 LTVIVPIYNVEKYLEKCVLSLISQTYSQMCIILVNDGSTDNSGKIADKFAENYDNITVIH 63 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + N G S ARN GIE +Y+AFVD+DD V MYE L++ ++ +D++ + Sbjct: 64 KTNGGLSDARNVGIESTNTEYIAFVDSDDYVESNMYEVLLSKFSDEQIDISIGTVLY-EN 122 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWT-----HVVWMGVYRRDVIVKNNIK 182 E GE + T + + + ++ L ++ V ++R ++ + Sbjct: 123 EQGEGYCPYKTGI---SAIWNNEEAMKYLLCYSKFNMSFCDKVFKTKLFRTPAYGEDKLL 179 Query: 183 FIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLN 234 F G +D + A R Y LY Y S+SR N N+N Sbjct: 180 FPYGKKCEDQFLMCRVLARASRVAYNSIPLYHYVQRFGSISR-----NTNIN 226 >UniRef50_A3CP06 Cell-wall biogenesis glycosyltransferase, putative n=3 Tax=Streptococcus RepID=A3CP06_STRSV Length = 323 Score = 114 bits (284), Expect = 7e-24, Method: Compositional matrix adjust. Identities = 70/226 (30%), Positives = 115/226 (50%), Gaps = 16/226 (7%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 S +SVIIP+YNA D + C++SL+ Q++ LEII++NDGS DNS+ I K Y Y V Sbjct: 2 SKPAISVIIPVYNAQDGIKRCVDSLLNQSFKNLEIILLNDGSKDNSLNILKEYELKYSFV 61 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 R++ + N G +V RN+GI +A G+Y+ F+D DD V ET E +LD+ Sbjct: 62 RVIDKQNEGVAVTRNKGILLAEGEYIMFMDNDDFVDSDYIETFYQAIHEKNLDLVIGG-- 119 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 ++ + Q I + ++ + +W S+ W VYR + + NN++F Sbjct: 120 --YKRVNQDNQIIFSQDIQQS------EW------SKYIIMAPWAKVYRTEFLKTNNLEF 165 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQG 229 +DI++ + + Y +Y +N S+S ++G Sbjct: 166 FDYGIGEDIIFNLAAYKTTDKIGLLDYKGYNWYYNNQSISNTSQRG 211 >UniRef50_O06035 EpsG n=1 Tax=Lactococcus lactis subsp. cremoris RepID=O06035_LACLC Length = 316 Score = 113 bits (283), Expect = 8e-24, Method: Compositional matrix adjust. Identities = 75/238 (31%), Positives = 123/238 (51%), Gaps = 32/238 (13%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV-RL 65 KLS+IIP+YN C+ S++ QT+ +EII++NDGSTDNS +IA Y E +P+V + Sbjct: 3 KLSIIIPIYNVEKYLSKCLNSILEQTYKEIEIILVNDGSTDNSKDIAVSYCERFPNVFKY 62 Query: 66 LHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWC 125 + N G S ARN G+E +G +V F+D+DD + +YE +M +L+ + + +C+ W Sbjct: 63 FEKDNGGLSSARNFGLEKISGDFVGFLDSDDYIDNDLYE-IMINSLDSSIKIVECDFIWE 121 Query: 126 FRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIA 185 + E G++ +++ + D + G V W +Y + + K NIKF Sbjct: 122 Y-ENGKSVLD------KTSEYNSIKDLMVNG------RVVAWNKIYNVEWLEKINIKFKE 168 Query: 186 GLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 GL ++D+ F F + H TS+S + N ++Y +H IT Sbjct: 169 GLLYEDL----NFFFKIVP-------------HLTSISEVSTVKNSFVHYVQHKGTIT 209 >UniRef50_B0ACE9 Putative uncharacterized protein n=2 Tax=Firmicutes RepID=B0ACE9_9CLOT Length = 335 Score = 113 bits (282), Expect = 1e-23, Method: Compositional matrix adjust. Identities = 85/349 (24%), Positives = 169/349 (48%), Gaps = 35/349 (10%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+S+++P+YN C++S+I QT +EII+++DGS DN ++ YA+ ++++ Sbjct: 3 KVSIVVPIYNVEKYLEQCVDSIINQTLKEIEIILVDDGSPDNCPQMCDEYAKKDSRIKVV 62 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 H+ N G S ARN GIEVATG Y+ FVD+DD + MYE + +A+E+++D +D+ + Sbjct: 63 HKKNGGLSSARNAGIEVATGDYIGFVDSDDYIELDMYEKMYNIAIENNVDFVM--SDY-Y 119 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHV-------------VWMGVYRR 173 R + E D++++T + + + + + + VW +Y+R Sbjct: 120 RVSSE-------DKVKATLDMDKGIYDKNKIKKDIFPTLIMGEYINYGRLLAVWHCLYKR 172 Query: 174 DVIVKNNIKFIAGLHH-QDIVWTTEFMFNALRARYTEQS-LYKYYLHNTSVSRLHRQGNK 231 +++ KN++ F + + +D ++++ + Y + S LY Y + S+S+ ++ Sbjct: 173 ELLEKNDLTFDEEVKYSEDNLFSSIIGYKVNNFYYMKGSYLYNYRYNPNSISKTYKPDAW 232 Query: 232 NLNYQRHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALR-VCHAVRKEPDILTRQRM 290 ++ Y+K+ L+ L N D + + I Y A + H + + + Sbjct: 233 SV-----YLKMNNKLKDLFLNVED-YDFRNQLYAHIIYYAFNCINHVGNSDKSFSYKYKE 286 Query: 291 IAEIFTSGMYKRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITRS 339 I I + K L ++ K G + +W + K ++ I + Sbjct: 287 IKRILNT---KELEDGFKNFKAGNISTKLKISIWMLKHKFIINYMIIKG 332 >UniRef50_C6LFI9 Glycosyl transferase, group 2 family n=1 Tax=Bryantella formatexigens DSM 14469 RepID=C6LFI9_9FIRM Length = 689 Score = 113 bits (282), Expect = 1e-23, Method: Compositional matrix adjust. Identities = 70/227 (30%), Positives = 116/227 (51%), Gaps = 7/227 (3%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M K+SVIIP+YN C++S+ QT +EII +NDGSTD S++I + YAE Sbjct: 1 MADRKKVSVIIPVYNVEKYLEKCLDSVERQTLPEIEIICVNDGSTDGSLQILQSYAEKDD 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 + ++++ N G S ARN G+ VA G+YV F+D+DD + E L +++LD + Sbjct: 61 RIIIINKENGGLSSARNAGLAVAEGEYVYFLDSDDWILAETLEVLYDECKKENLDTILFD 120 Query: 122 ADWCFRETGETWQSIPT-----DRLRSTG-VLTGPDWLRMGLSSRRWTHVVWMGVYRRDV 175 AD F ET E + P+ R G +TG + ++ + + RR Sbjct: 121 ADSIF-ETEEVERLHPSYRDYYHREDCFGNTVTGQRLFAEMMPREQYRSSACLQMNRRAF 179 Query: 176 IVKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSV 222 + + + F G+ H+D ++T + A RA++ + Y+ + S+ Sbjct: 180 LEQYGLGFREGMIHEDELFTLQVSLRAERAKHLARPFYQRLVREESI 226 Score = 89.0 bits (219), Expect = 2e-16, Method: Compositional matrix adjust. Identities = 65/223 (29%), Positives = 106/223 (47%), Gaps = 12/223 (5%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 ++S+I+P+YN C+ SL Q + +EII ++D STD S+E+ + A + VR+L Sbjct: 376 EISIIMPVYNGEKYLWRCIRSLQKQMFRDIEIICVDDKSTDRSMELLQERARHDKRVRIL 435 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNA---D 123 Q + GA ARN G++ A GKY+ F+D DD M L A + D V A D Sbjct: 436 QQEHLGAGSARNLGMKEARGKYLLFLDCDDLFDKNMCRELYRAAEKHDAQVVLFGAQRMD 495 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRW---THVVWMGVYRRDVIVKNN 180 T + + L V +G + ++ R + ++ W ++RRD I++N Sbjct: 496 MMNNRTERMGWVLRSSELPKDAVFSGQE-----IADRLFQITSNCPWSKMFRRDFILENG 550 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVS 223 ++F + H D + M A R +Q ++ Y N S Sbjct: 551 LEFQSTRHCNDAYFVRMAMALAERMTMVDQ-IFTTYRFNMGAS 592 >UniRef50_Q9F0B5 Beta(1,4)galactosyltransferase EpsM n=1 Tax=Lactobacillus delbrueckii subsp. bulgaricus RepID=Q9F0B5_LACDE Length = 336 Score = 113 bits (282), Expect = 1e-23, Method: Compositional matrix adjust. Identities = 69/214 (32%), Positives = 113/214 (52%), Gaps = 10/214 (4%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+I+P+YN D C+ SL QT+ +EII+++DGS D S +I A ++++H Sbjct: 6 ISIIVPVYNVEDYLEDCVHSLELQTYKNIEIILVDDGSPDRSGKICDKLALKDARIKVIH 65 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + N G S ARN G+E + G+Y+AFVD+DD V P + L ++ + + VAQ + Sbjct: 66 KKNGGLSDARNEGLEHSNGEYIAFVDSDDIVEPNFIKVLYSLITQGNYQVAQVGTQF-VN 124 Query: 128 ETGETW-QSIPTD-----RLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 E+G+ Q I D +L VL + +R GL R+ W +Y+RD N I Sbjct: 125 ESGDPLNQKIVFDGDMGKKLNGNYVLNRDELIR-GLLLRKVHCAAWCNLYKRDFF--NEI 181 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKY 215 +F G+ ++DI+ + + + + LY Y Sbjct: 182 RFTKGILNEDILMWVDGVEKVDSLIISNECLYDY 215 >UniRef50_C9K1G9 Putative glycosyltransferase n=1 Tax=Klebsiella pneumoniae RepID=C9K1G9_KLEPN Length = 342 Score = 113 bits (282), Expect = 1e-23, Method: Compositional matrix adjust. Identities = 71/228 (31%), Positives = 123/228 (53%), Gaps = 17/228 (7%) Query: 8 LSVIIPLYNAGDDF-RTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 +SVI+P+YN +++ TC++S+ QT +EII++NDGST+N +E+ ++ + ++ Sbjct: 6 ISVILPIYNIKEEYLETCIQSVCRQTLDDIEIILVNDGSTNNCLEVCTKFSHLDNRIIVI 65 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALED-DLDVAQCNADWC 125 Q N+G SVARN G+++A G+++AFVD DD + P +L A + ++ + CN Sbjct: 66 DQPNSGVSVARNSGLDIARGEWIAFVDPDDWLEPDYLLSLYNNASSNTEIVICDCNVFSN 125 Query: 126 FRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRR---------WTHVVWMGVYRRDVI 176 + G + +P + S GVL + L L S+ V W +++ D I Sbjct: 126 GEKRGNNF--LPAN---SIGVLKNKNLLLGQLVSKVIAGYYPQLIGPGVPWGKLFKHDFI 180 Query: 177 VKNNIKFIAGL-HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVS 223 K N+KF+ G+ QD ++ + A + YT ++ Y Y + N S S Sbjct: 181 KKYNLKFVPGMVRMQDNIFCLYAFYYASQVFYTGENKYNYRMENNSAS 228 >UniRef50_A3F4D9 EpsM n=1 Tax=Lactococcus lactis subsp. cremoris RepID=A3F4D9_LACLC Length = 331 Score = 113 bits (282), Expect = 1e-23, Method: Compositional matrix adjust. Identities = 65/211 (30%), Positives = 114/211 (54%), Gaps = 5/211 (2%) Query: 6 NKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRL 65 N +S+I+P+YN+ R + SL+ QT+ +E+I+INDGSTD S E+ + + ++L Sbjct: 3 NLISIIVPVYNSEKYLRAAIHSLLNQTYQNIEVILINDGSTDGSQELISSFQKKDKRIKL 62 Query: 66 LHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWC 125 + N G S ARN GI+ A+G Y+ F+D DD + ++ + + + DV N C Sbjct: 63 YNTKNLGVSHARNYGIDRASGSYIMFLDPDDTYDKSYCLEMIGLINKFNADVVMSNYYIC 122 Query: 126 FRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIA 185 + + ++ D L G+L+ +R LS + VW ++R++VI NN+KF Sbjct: 123 --KGKNIYPNVNNDLLECEGLLSRDKTMRSILSDTGFKGFVWTRIFRKNVI--NNVKFNE 178 Query: 186 GLHH-QDIVWTTEFMFNALRARYTEQSLYKY 215 +++ +D+++ + NA YT + Y Y Sbjct: 179 SINYLEDMLFNISIVHNARIIAYTNKRHYFY 209 >UniRef50_B5CU57 Putative uncharacterized protein n=1 Tax=Bacteroides plebeius DSM 17135 RepID=B5CU57_9BACE Length = 327 Score = 112 bits (281), Expect = 1e-23, Method: Compositional matrix adjust. Identities = 64/184 (34%), Positives = 99/184 (53%), Gaps = 9/184 (4%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +SVI+P+Y C+ES++ QT+T LEII+++DGS D EI A ++++H Sbjct: 7 ISVIVPVYKVEPYLHRCVESILKQTYTNLEIILVDDGSPDKCGEICDELASTDKRIQVIH 66 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 Q N G S ARN G+ +ATG+Y+AFVD+DD ++P MY+ L ++ +A C A+ C Sbjct: 67 QKNKGLSGARNSGLNIATGEYIAFVDSDDCLHPEMYKRLYEDICLHNVKLAFCQANMCHG 126 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 E + + PT V+ LS W W +Y R ++ NI+F G Sbjct: 127 EVIKVTCNQPTVCKEKNYVMYR------SLSESIWW-AAWTKLYHRSLL--ENIRFPEGR 177 Query: 188 HHQD 191 ++D Sbjct: 178 TNED 181 >UniRef50_B3H2A3 Putative glycosyltransferase n=1 Tax=Actinobacillus pleuropneumoniae serovar 7 str. AP76 RepID=B3H2A3_ACTP7 Length = 331 Score = 112 bits (281), Expect = 1e-23, Method: Compositional matrix adjust. Identities = 77/256 (30%), Positives = 132/256 (51%), Gaps = 22/256 (8%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP----H 62 K+S+++P+YN C SL QT+ +E I IND + DNS+EI K + YP + Sbjct: 4 KVSILVPIYNTSRFIEKCAISLFEQTFQDIEYIFINDCTPDNSIEILKNVIDLYPNRKKY 63 Query: 63 VRLL-HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 V++L HQ N G + +RN GI A G+Y+ VD+DD + E L A++D+ D+ C+ Sbjct: 64 VKILEHQENKGLASSRNTGIANAKGEYILHVDSDDYIDLYAVELLYNKAIQDNADIVMCD 123 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 A W W +++ G + D L M LS + VW +++R++ + N I Sbjct: 124 ALW-------EWNKTSKLVIQNIGN-SNQDCLEMILSGKS-PPCVWNKLFKRELYINNRI 174 Query: 182 KFIAGLHH-QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI 240 K + G++ +D + T + ++ + + ++ LY Y NT+ KN++ + Y Sbjct: 175 KALDGVNFGEDYLVTPQLIYFSNKISKVDKPLYHYIQFNTNAY------TKNISKKSVY- 227 Query: 241 KITRLLEKLNRNYADK 256 I +L +L R + +K Sbjct: 228 DIISVLNELTRFFKEK 243 >UniRef50_B1C0P6 Putative uncharacterized protein n=1 Tax=Clostridium spiroforme DSM 1552 RepID=B1C0P6_9FIRM Length = 319 Score = 112 bits (281), Expect = 1e-23, Method: Compositional matrix adjust. Identities = 72/217 (33%), Positives = 115/217 (52%), Gaps = 12/217 (5%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+I+P+YN R C++S+I QT+ E I+INDGSTD S EI A+ ++H Sbjct: 6 ISIIVPVYNVQKHLRRCLDSIIQQTYENFECILINDGSTDGSGEICDEIAKQDKRFIVIH 65 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 Q NAG SVARN + +A G YV+FVD+DD + P +L+ + D D+++C+ FR Sbjct: 66 QQNAGLSVARNNALLMAKGDYVSFVDSDDYILPQYLNSLLNAIISTDSDISKCDY---FR 122 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 G+ + T V+ + R L + + +W +Y++ + N+I AG Sbjct: 123 --GKLTKDKDT---YDVSVVNSKKFTREVLLDKVGSQ-LWQYMYKKSLW--NDIVSPAGR 174 Query: 188 HHQDIVWTTEFMFNALRARYTEQSLYKYYLH-NTSVS 223 + QD++ + A + +Q LY YY+ N S S Sbjct: 175 YAQDMMILHKITNRAKKIAVIDQKLYFYYIDRNDSTS 211 >UniRef50_C8NFZ4 Glycosyl transferase CpsJ(V) n=1 Tax=Granulicatella adiacens ATCC 49175 RepID=C8NFZ4_9LACT Length = 328 Score = 112 bits (281), Expect = 2e-23, Method: Compositional matrix adjust. Identities = 82/306 (26%), Positives = 151/306 (49%), Gaps = 26/306 (8%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+S+I+P+YN C+ESL+ Q++ +EI++INDGSTDNS ++ A+ + + Sbjct: 14 KISIIVPVYNVEQYLERCVESLMNQSYKNIEILLINDGSTDNSGKLCDEIAKRDSRITVY 73 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 H+ N G S ARN G++ AT +V FVD+DD V MYE L++ L+ D +++ C + Sbjct: 74 HKENGGLSDARNYGVDKATADFVGFVDSDDYVDEDMYEVLLSNLLKYDAEISFCRLYDVY 133 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAG 186 + Q + + ++T + ++M L ++ ++ +Y++ + N I+F G Sbjct: 134 ND-----QVLKDESKNEPYLMTSEEAIKMVLEAKIFSVTAVNKLYKKSLF--NQIRFEKG 186 Query: 187 LHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRLL 246 +D + + + TE ++KYY +HR+ ++ Q+ K ++ Sbjct: 187 KIAEDAFLMVDLLSRCQKIAATE--VHKYYY-------MHREN--SITTQKFTPKFLNVI 235 Query: 247 EKLNRNYADKIMIYPEFHQQITYEA-LRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLIT 305 E +N I E + + Y+A R+C A D L + + S + +LI Sbjct: 236 EAYEKN----AKIVAEKYPNLQYQADTRICWAYFYVLDRLLKDEAYKD---SALEAQLIQ 288 Query: 306 NVRSVK 311 N++ K Sbjct: 289 NLKKYK 294 >UniRef50_B7CD61 Putative uncharacterized protein n=1 Tax=Eubacterium biforme DSM 3989 RepID=B7CD61_9FIRM Length = 310 Score = 112 bits (280), Expect = 2e-23, Method: Compositional matrix adjust. Identities = 56/136 (41%), Positives = 82/136 (60%), Gaps = 5/136 (3%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+S++IP+YN C+ES++ QT+ +EIIIINDG+TD+S++IA YA Y + + Sbjct: 3 KVSILIPVYNVEKYLSKCLESILLQTYKNIEIIIINDGTTDSSLDIANLYASKYDFIHVY 62 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 NAG S RNR +E ATG Y FVD+DD + P M E ++ A++ + D+ C +C Sbjct: 63 SYENAGISTTRNRALEKATGDYYLFVDSDDYLDPKMIEKMVDRAIQTNSDIVTCG--YCI 120 Query: 127 RETGETWQSIPTDRLR 142 G SIP +R Sbjct: 121 EYKG---LSIPVSPMR 133 >UniRef50_O87182 CpsIaI n=15 Tax=Streptococcus agalactiae RepID=O87182_STRAG Length = 333 Score = 112 bits (280), Expect = 2e-23, Method: Compositional matrix adjust. Identities = 71/251 (28%), Positives = 134/251 (53%), Gaps = 12/251 (4%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+IIP+YN+ + C++S++ QT +E+I+I+DGSTDNS EI ++ + + H Sbjct: 7 VSIIIPIYNSEAYLKECVQSVLQQTHPLIEVILIDDGSTDNSGEICDNLSQEDNRILVFH 66 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + N G S ARN G++ +TG+++ FVD+DD V P M E ++ + ++ D+A+ + D Sbjct: 67 KKNGGVSSARNLGLDKSTGEFITFVDSDDFVAPNMIEIMLKNLITENADIAEVDFD---- 122 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 R V + L+ LS R ++V +Y++ +I N++F L Sbjct: 123 -ISNERDYRKKKRRNFYKVFKNNNSLKEFLSGNRVENIVCTKLYKKSII--GNLRFDENL 179 Query: 188 H-HQDIVWTTEFMFNALRARY-TEQSLYKYYLHNTSVSRLHRQGNKN-LNYQRHYIKITR 244 +D+++ + + R T SLY Y + TS ++++ N+N L++ + +++ Sbjct: 180 KIGEDLLFNCKLLCQEHRIVVDTTSSLYTYRIVKTSA--MNQKFNENSLDFITIFNEVSS 237 Query: 245 LLEKLNRNYAD 255 L+ NY + Sbjct: 238 LVPAKLANYVE 248 >UniRef50_Q03KL3 Glycosyltransferase involved in cell wall biogenesis n=1 Tax=Streptococcus thermophilus LMD-9 RepID=Q03KL3_STRTD Length = 326 Score = 112 bits (280), Expect = 2e-23, Method: Compositional matrix adjust. Identities = 66/225 (29%), Positives = 116/225 (51%), Gaps = 6/225 (2%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 MN + ++S+++P+YN D C++S++ Q+ E+II++DGSTD S + Y+E+ Sbjct: 1 MNDSVEISIVLPVYNVIDYLNECIKSILRQSNKNFELIIVDDGSTDGSQILCDRYSESDS 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 ++++HQ NAG S ARN G+ A GKY+ F+D+DD V E L + +E D++ C+ Sbjct: 61 RIKVIHQKNAGLSAARNTGLNHAKGKYITFIDSDDFVGEKYIEVLYKIIIESCADISMCD 120 Query: 122 ADWCFRET---GETWQS-IPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIV 177 ++T T+Q P D+ G + + SS V W +Y+ ++ Sbjct: 121 YMEVAQDTQISDVTFQEKYPIDKYYYNGKEAIANVYKN--SSHGMEFVSWAKMYKLELFK 178 Query: 178 KNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSV 222 N I F H+D T + + + + + LY Y + + S+ Sbjct: 179 SNGILFPEEKLHEDAFTTYKLFYASNKIAFVNAPLYFYRIRSGSI 223 >UniRef50_B0ABV6 Putative uncharacterized protein n=1 Tax=Clostridium bartlettii DSM 16795 RepID=B0ABV6_9CLOT Length = 329 Score = 112 bits (280), Expect = 2e-23, Method: Compositional matrix adjust. Identities = 72/247 (29%), Positives = 126/247 (51%), Gaps = 13/247 (5%) Query: 2 MNSTN-KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 MN+ N K+SVI+P+YN R C++S++ Q +T+ E+I +NDGSTDNS+EI + Y + Sbjct: 1 MNTINPKISVIVPVYNTSTYLRRCLDSILEQDFTSYEVICVNDGSTDNSLEILREYEKKS 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 +++++Q N+G + RN ++ A G Y+AF+D+DD V L A++ D+ C Sbjct: 61 EKIKVINQVNSGVAKTRNTALKHAKGDYLAFLDSDDFVRENYLSRLYDAAIDTRSDIVIC 120 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTH-VVWMGVYRRDVIVKN 179 N +R + + P GV D L+ GL H +W +++R++ Sbjct: 121 NF---YRYYEQINLAKPVFYKFRRGVFNKYDILK-GLIPDNLIHSYLWNKLWKREIF--E 174 Query: 180 NIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHY 239 + + +D+ ++ ++ A + +LY Y + TS+ R N L Q Y Sbjct: 175 DFNSFPDMKFEDLAIMSQLIYKADKIAVINDALYYYRIRKTSIVR-----NICLQTQNDY 229 Query: 240 IKITRLL 246 +K L+ Sbjct: 230 MKAYALI 236 >UniRef50_B7GUL7 Glycosyl transferase, family 2 n=1 Tax=Bifidobacterium longum subsp. infantis ATCC 15697 RepID=B7GUL7_BIFLI Length = 349 Score = 112 bits (279), Expect = 2e-23, Method: Compositional matrix adjust. Identities = 73/246 (29%), Positives = 125/246 (50%), Gaps = 19/246 (7%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 ++V++P+YN TC++S++ QT+ L+II+++DGSTD S I +++A + ++H Sbjct: 17 ITVVLPIYNVAKWLDTCIQSIVKQTYRNLQIILVDDGSTDESPTICEHWAAKDSRINVVH 76 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 Q NAG S ARN G+ + G+Y+ FVD+DD V + ++ A + D+ CN R Sbjct: 77 QRNAGLSAARNTGLRLRKGEYICFVDSDDYVEHDYVKRMLDTAHAEQADMVICNV----R 132 Query: 128 ETGETWQSIP--TDRLRSTGVLTGPDWLRMGLSSRRWTHVV-WMGVYRRDVIVKNNIKFI 184 + E ++ D T LT ++ + S W H+V W +YR ++ +N+ + Sbjct: 133 KEDENGAALAEEADPSFETKTLTSRQYMIYAMQS-SWKHIVAWNKLYRSEIW--DNLTYP 189 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQ-RHYIKIT 243 G H+D + + R + LY Y H+ S+ N NY R+ KI Sbjct: 190 VGKIHEDEFVFAQLVVRCHRVACIDDVLYHYVQHDGSI--------MNENYSIRNLDKIE 241 Query: 244 RLLEKL 249 L ++L Sbjct: 242 ALCQRL 247 >UniRef50_B7HXA1 N-acetylglucosaminyltransferase n=2 Tax=Bacillus cereus RepID=B7HXA1_BACC7 Length = 353 Score = 112 bits (279), Expect = 2e-23, Method: Compositional matrix adjust. Identities = 66/219 (30%), Positives = 116/219 (52%), Gaps = 12/219 (5%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+I+P+YNA C++SL+ QT+ +EII+INDGSTDNS +I YA +R++H Sbjct: 6 VSIIVPVYNAEKYVHKCIDSLLQQTYKNIEIILINDGSTDNSGKICDEYANESSKIRVIH 65 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 Q N+G S RN GI+VA G Y+ FVD+DD V P M E ++ A+ + + C Sbjct: 66 QKNSGPSATRNVGIDVANGAYIQFVDSDDYVEPNMTEEMLK-AMNNKFQLVICGY----- 119 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDW----LRMGLSSRR-WTHVVWMGVYRRDVIVKNNIK 182 ++ + + + G D+ + G R + + +W +Y +I +N+++ Sbjct: 120 KSLSLFNKDINIKEHKCHISGGYDYSSFMMEFGYLFRDVYINPLWNKLYNTALIKENHLR 179 Query: 183 FIAGLH-HQDIVWTTEFMFNALRARYTEQSLYKYYLHNT 220 F ++ +D+++ E++ + LY Y + N+ Sbjct: 180 FKDNVYMGEDLLFNLEYIKKCNYINVINKYLYNYAVINS 218 >UniRef50_A5ZUY3 Putative uncharacterized protein n=1 Tax=Ruminococcus obeum ATCC 29174 RepID=A5ZUY3_9FIRM Length = 367 Score = 112 bits (279), Expect = 2e-23, Method: Compositional matrix adjust. Identities = 78/245 (31%), Positives = 123/245 (50%), Gaps = 17/245 (6%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+S+IIP YN C+ES+ QT +EII +NDGSTD+S+EI K YAEN P ++ Sbjct: 3 KVSIIIPTYNVEMFLDECLESIQRQTLQDIEIICVNDGSTDHSLEIIKKYAENDPRFVIV 62 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 + N G A N G++ ATG+Y+ V+ DD V MYE L +A E+DLD+ + + Sbjct: 63 DKENGGYGKAMNVGLDKATGEYIGIVEPDDYVPLAMYEDLYKIAKENDLDMVKADFYRFT 122 Query: 127 RETGETWQSIPTDRLRSTGVLTG--------PDWLRMGLSSRRWTHVVWMGVYRRDVIVK 178 R ++ + L TG G P+ LR +++ W G+Y+R+ + + Sbjct: 123 RNPENGNMNLVYNHLDLTGKWYGKLFDPSEEPETLRFIMNT-------WSGIYKREFLEQ 175 Query: 179 NNIKF--IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQ 236 ++I+ G QD + + A RA ++ Y N + S +R+ N + Sbjct: 176 HHIRHNETPGASFQDNGFYFQTFVYAKRAMIVDKPYYMNRRDNPNSSVNNREKVYCCNVE 235 Query: 237 RHYIK 241 YI+ Sbjct: 236 YDYIR 240 >UniRef50_C2M8C3 Glycosyl transferase, family 2 n=1 Tax=Capnocytophaga gingivalis ATCC 33624 RepID=C2M8C3_CAPGI Length = 321 Score = 112 bits (279), Expect = 2e-23, Method: Compositional matrix adjust. Identities = 64/229 (27%), Positives = 118/229 (51%), Gaps = 10/229 (4%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+I+P+YN C++SL+ QT+ +EI+++NDGS D + EI Y E YP +++ + Sbjct: 6 ISIIVPVYNGETYLARCLDSLLDQTYAPIEILVVNDGSQDRTAEILSQY-EQYPQIKVFY 64 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 Q N G S ARN G++ G +V FVD+DD V PT E L ++++ D++ C + Sbjct: 65 QENKGLSGARNLGLKHFEGDFVCFVDSDDYVTPTYCEELYKALIDNNADISVCQ----YT 120 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 + IP ++ + + + + L + L + W +++++++ N F G Sbjct: 121 LIDTKGKEIPQEKDKKIQLFSHIEALALLLEDGQLKSYAWGKLFKKELL--ENFSFPMGR 178 Query: 188 HHQDIVWTTEFMFN-ALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNY 235 +D + F+F+ A + LY Y + S+S + G K ++ Sbjct: 179 IFEDYA-SMFFLFDKATKVVMINNCLYNYIQYPNSLSN-KKGGEKFFHF 225 >UniRef50_A1YVC6 Glycosyltransferase n=1 Tax=Lactobacillus johnsonii RepID=A1YVC6_LACJO Length = 348 Score = 112 bits (279), Expect = 2e-23, Method: Compositional matrix adjust. Identities = 72/225 (32%), Positives = 115/225 (51%), Gaps = 17/225 (7%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M +K+S+I+P+YNA C++S+ITQT+ ++II++NDGS DNS EI + + P Sbjct: 1 MTQNSKVSIILPIYNAEKYLEECVKSVITQTYQNIQIILVNDGSKDNSWEICQNLKKKDP 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 + + Q N+G SVARN G+ +A G+++ FVD DD + + E L+ + + D+D+ C Sbjct: 61 RIIAITQKNSGVSVARNAGLNIADGEWIMFVDPDDVLSQDIVEKLL-LKTDSDVDIVACT 119 Query: 122 ADWCFRETGETW-QSIPTDRLRSTGVLTGPDWLRMGLSSRRWTH---------VVWMGVY 171 + F E + DR ++ L + L + R+ V W +Y Sbjct: 120 C-YGFNENEKKLAHFFANDR----AFISDKTDLYLQLLNSRYGQSGSLITAIGVPWGKIY 174 Query: 172 RRDVIVKNNIKFIAGLHH-QDIVWTTEFMFNALRARYTEQSLYKY 215 RR I K +KF L QD V+ A + Y ++ LY Y Sbjct: 175 RRSFIDKYELKFDPNLRRMQDNVFNMYAFDYARKIYYLDKPLYFY 219 >UniRef50_C8WM64 Glycosyl transferase family 2 n=1 Tax=Eggerthella lenta DSM 2243 RepID=C8WM64_EGGLE Length = 333 Score = 112 bits (279), Expect = 2e-23, Method: Compositional matrix adjust. Identities = 82/272 (30%), Positives = 122/272 (44%), Gaps = 14/272 (5%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +SVIIP+YN C+ES+I QT++ +EI+++NDGSTD S YA + V ++H Sbjct: 5 VSVIIPVYNVESYVSECIESVIAQTYSNIEIVVVNDGSTDGSGFSCDQYACSDSRVVVVH 64 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + N G S ARN GI V G +VAFVD DD V P ETLM D ++A F Sbjct: 65 KENEGLSAARNAGIAVCRGDFVAFVDGDDFVSPVFIETLMHAIEVCDCEIAAIPCGTAFE 124 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 + + V+ ++ L R T V W RR + + F GL Sbjct: 125 DGSSCELVAKAAFIPDARVMDSYTVQKLMLYQRLDTGVPWRLYARR---ILGDAPFAVGL 181 Query: 188 HHQDIVWTTEFMFNALRARYTE-QSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRLL 246 +++D+ +F+ + R + ++LY Y RL + G + Y IK + Sbjct: 182 YYEDLASVYKFIHDVDRVALVDCRALYAY--------RLRKSGIISQAYSP--IKALSAI 231 Query: 247 EKLNRNYADKIMIYPEFHQQITYEALRVCHAV 278 E R +D YP+ +C V Sbjct: 232 EVSRRLSSDMQEWYPDLAVASASRCFSLCRMV 263 >UniRef50_B2UQN8 Glycosyl transferase family 2 n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UQN8_AKKM8 Length = 376 Score = 112 bits (279), Expect = 3e-23, Method: Compositional matrix adjust. Identities = 84/258 (32%), Positives = 125/258 (48%), Gaps = 22/258 (8%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +SVI+P+YN C++SL QT LEII ++DGSTD S I + YA RL+ Sbjct: 8 ISVIVPIYNMEPLLPRCLDSLAAQTLRDLEIICVDDGSTDGSGGIVRKYASGDSRFRLIT 67 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN-ADWCF 126 Q N+G + ARN GI A Y+ F D DD V P MYE L +A E D+ QC+ + + Sbjct: 68 QENSGRAEARNAGIRAAAAPYLGFADPDDYVEPDMYERLYRLAEESGADMVQCSYSPFLP 127 Query: 127 RETGETWQSIPTDRL------RSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRR------- 173 E+GE+ + + ++L GV T + R T VVW ++RR Sbjct: 128 AESGES-RGMAEEKLLHIENTACDGVFTEKGEIFRLFLEDRITGVVWSKLFRRILPGCSA 186 Query: 174 DVIVKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNT-SVSRLHRQGN-K 231 + V+ F +G +D ++ + + T + LY Y L SVS +R+ + Sbjct: 187 PLEVRLPSSFTSG---EDTLYVSRAIARCRSVALTSEKLYHYGLGGPQSVSSRNRKAETR 243 Query: 232 NLNYQRHYIKITRLLEKL 249 +Y + +TR EKL Sbjct: 244 PASYYAVFEMLTR--EKL 259 >UniRef50_C6IJ34 Glycoside transferase family 2 n=2 Tax=Bacteroides RepID=C6IJ34_9BACE Length = 327 Score = 111 bits (278), Expect = 3e-23, Method: Compositional matrix adjust. Identities = 87/316 (27%), Positives = 147/316 (46%), Gaps = 39/316 (12%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M + +S+I+P+YN+ R C++S++ Q++T E+++INDGS D+S I YA++ Sbjct: 1 MRNETLVSIIVPVYNSAQYLRKCIDSILAQSYTEFELLLINDGSEDHSGLICDDYAQSDN 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 +++ HQ NAG S ARN G+E TG++ F+D+DD + E LM A D QC+ Sbjct: 61 RIQVFHQENAGVSAARNLGLEKHTGEHFLFIDSDDYIQERYLEELMKYA---SCDFVQCS 117 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTH--VVWMGVYRRDVIVKN 179 C G D L G G D ++ L + V + +Y+ + KN Sbjct: 118 C--CSEPVG-------NDYLFVDGNFEGSDEIKQCLLKYIYPEFTVPFGRLYKSSIQRKN 168 Query: 180 NIKFIAGLHH-QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRH 238 + F L+ +D +W ++++ R + Y Y+H+ K+++Y+ Sbjct: 169 KLFFDTYLYSGEDTLWVSQYLLCVHSLRVSSYIGY-IYVHHVG----EHLSQKSISYEH- 222 Query: 239 YIKITRLLEKLNRNYAD--------------KIMIYPEFHQQITYEALRVCHAVRKEPDI 284 + L KL +Y+D ++IY FH+ I Y A R +RKE + Sbjct: 223 ---LEYTLHKLLISYSDLEKRYDFDLTGVRYSVIIYF-FHRYIVYIANRNYTEIRKELEK 278 Query: 285 LTRQRMIAEIFTSGMY 300 +I +F Y Sbjct: 279 SCANPLIKGVFYDKKY 294 >UniRef50_C0BRK2 Putative uncharacterized protein n=1 Tax=Bifidobacterium pseudocatenulatum DSM 20438 RepID=C0BRK2_9BIFI Length = 350 Score = 111 bits (278), Expect = 3e-23, Method: Compositional matrix adjust. Identities = 78/259 (30%), Positives = 128/259 (49%), Gaps = 29/259 (11%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+SV++P+Y + C++S+ QT +EII+++DGS D IA YA ++++ Sbjct: 5 KVSVVVPIYKVEPFLQECVDSIQVQTLEDIEIILVDDGSPDRCGAIADEYARKDSRIKVV 64 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW-- 124 HQ N+G ARN GI ATG+YV FVD+DD + PTMY +L A +QC+AD Sbjct: 65 HQKNSGLGPARNTGIANATGEYVGFVDSDDWIAPTMYSSLYDAA-------SQCDADIVV 117 Query: 125 ---CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHV------------VWMG 169 C G +++ L + +L G D + + + +R + H+ VW Sbjct: 118 GGHCDMVNGSV-RAVKAHPL-AGQILRGSDQI-LPVRNRLFGHLPEDTEVESFPMRVWTA 174 Query: 170 VYRRDVIVKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQG 229 +YR D + N +KF A L +D ++ A +T + Y+Y + + S S + Sbjct: 175 IYRNDFLKVNGLKFQAIL-SEDTIFNLSAYRCANVVAFTGDTDYRYRMDDHS-SIMRSFS 232 Query: 230 NKNLNYQRHYIKITRLLEK 248 +K L ++ + L K Sbjct: 233 SKKLTQYEQFVDTVQFLAK 251 >UniRef50_B1C3W3 Putative uncharacterized protein n=1 Tax=Clostridium spiroforme DSM 1552 RepID=B1C3W3_9FIRM Length = 323 Score = 111 bits (278), Expect = 3e-23, Method: Compositional matrix adjust. Identities = 61/223 (27%), Positives = 112/223 (50%), Gaps = 1/223 (0%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 S+I+P+YN + C++S++ QT+ EII+++DGS D S I A Y ++++H Sbjct: 3 FSIILPIYNVENYLCECVDSILNQTYKDYEIILVDDGSKDKSPHICDELAHKYDCIKVIH 62 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + N G S ARN G + A GKY+ ++D+DD + + + + + +D+ F Sbjct: 63 KKNGGLSDARNAGTKEAIGKYIVYIDSDDYILDKEFLSKLAQKTKTGVDLIFYKYQKYFN 122 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 ET + T L + L + + + + + + W+ +R +IV+NNI F GL Sbjct: 123 ETKKLEDCTYTYSLAMSETLYA-NKIEALVKADAFYGMAWIKAVKRKLIVENNINFEVGL 181 Query: 188 HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGN 230 +D+ W + +FNA + ++ + Y S++ H N Sbjct: 182 LGEDMDWNYQVIFNASTIEFIDEPMIAYRQREGSITSTHTLKN 224 >UniRef50_D0ANJ0 Putative uncharacterized protein n=1 Tax=Enterococcus faecium C68 RepID=D0ANJ0_ENTFC Length = 351 Score = 111 bits (278), Expect = 3e-23, Method: Compositional matrix adjust. Identities = 52/112 (46%), Positives = 77/112 (68%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+SVI+P+YN R C++SL++QT +EI++++DGS D S EIA Y + Y +V+ + Sbjct: 4 KVSVIVPIYNVEKYLRKCVDSLLSQTLKEIEIVLVDDGSPDASGEIADEYQKKYSNVKTI 63 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVA 118 H+ N G ARN GIE ATG+YVAF+D+DD V MYE L +A +++ D+ Sbjct: 64 HRENGGLGPARNTGIENATGEYVAFLDSDDWVESDMYEKLYLVASKENADIV 115 >UniRef50_C9BUH6 Glycosyltransferase n=1 Tax=Enterococcus faecium 1,231,408 RepID=C9BUH6_ENTFC Length = 353 Score = 111 bits (278), Expect = 3e-23, Method: Compositional matrix adjust. Identities = 72/237 (30%), Positives = 125/237 (52%), Gaps = 25/237 (10%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 MNS++ +SVIIP+YN +C+ES+ QT+ L+III++DGS D+S +I + YA Sbjct: 1 MNSSDLISVIIPVYNVEKYVGSCLESVCNQTYKNLDIIIVDDGSKDSSNKICEDYALKDS 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 +R++H+ NAG +ARN G+E GKYVAF+D+DD V E ++ + + N Sbjct: 61 RIRIIHKNNAGLGLARNTGLEHVKGKYVAFIDSDDFVAANYIEAMLN-------GIKRSN 113 Query: 122 ADWCFRETGETWQ-----SIPTDRLRSTGVLTGP--DWLRMGLSSRRWTH--------VV 166 AD C+ E + S P R ST + + +G+ ++ V Sbjct: 114 ADTCYAGYYEYYDDNNIISKPA-RYDSTIFTNNAIIEKVLLGMIGTEPSYKVDFILPMSV 172 Query: 167 WMGVYRRDVIVKNNIKFIAGLHH--QDIVWTTEFMFNALRARYTEQSLYKYYLHNTS 221 W +Y ++I ++NIKF + + +DI++ ++ ++ + + +Y Y +N + Sbjct: 173 WHALYSSEIIQQHNIKFPSERDYISEDIIFHISYLQHSKITSFIKDCIYYYRCNNNN 229 >UniRef50_UPI0001C4194D glycosyl transferase GT2 family n=1 Tax=Methanobrevibacter ruminantium M1 RepID=UPI0001C4194D Length = 817 Score = 111 bits (278), Expect = 3e-23, Method: Compositional matrix adjust. Identities = 68/216 (31%), Positives = 108/216 (50%), Gaps = 14/216 (6%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+SVIIP+YN C++S+I QT +EII I+DGS+DNS+EI Y ++++ Sbjct: 365 KVSVIIPIYNNEKFLSKCLDSVINQTLNEIEIICIDDGSSDNSIEILNQYVLKDSRLKII 424 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 Q N GA+ ARN G+++A G+Y+AF+D+DD + +E L ++ D+ N+ Sbjct: 425 SQENLGAATARNNGLKIAKGEYIAFLDSDDWLELNAFEKLYENITTNNSDLVLFNS-IEH 483 Query: 127 RETGETWQ-------SIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKN 179 +E + SIP + D + G +W +YR + +N Sbjct: 484 KENANLKERIHIKNDSIPDYNYYTFNYNYKKDLVMNGYLD------IWSKMYRTSFLKEN 537 Query: 180 NIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKY 215 NI+F DI + + M NA + Y + LY Y Sbjct: 538 NIQFSNHQIFNDIQFHIKTMLNAKKISYCPEFLYNY 573 Score = 77.8 bits (190), Expect = 5e-13, Method: Compositional matrix adjust. Identities = 56/220 (25%), Positives = 107/220 (48%), Gaps = 26/220 (11%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTW--TALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 K+S+IIP+YN + + S+I+Q+ LE+I+++D STDNS I K Y Y + + Sbjct: 4 KISIIIPVYNVENYIEKSLNSIISQSIGIENLEVILVDDNSTDNSANIIKKYVSKYDNFK 63 Query: 65 LLHQANAGASVA---RNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 ++ + G+ RN G+ AT +Y+ ++D+DD + T E L + ++ D+ C Sbjct: 64 GIY-CDIGSGFCGRPRNIGLSYATSEYIMYLDSDDWLEETACEVLYNTIINENADIV-CG 121 Query: 122 ADWCFRETGE------TWQSIPTD-------RLRSTG-VLTGPDWLRMGLSSRRWTHV-- 165 + G W + TD R+++T ++ P++ + + ++ Sbjct: 122 SQTRLDNEGNRKFYYHLWVTTLTDPNEDYNTRMKTTQEIIDDPNFKLVVTDLDKNPNILG 181 Query: 166 ---VWMGVYRRDVIVKNNIKFIAGLHHQDIVWTTEFMFNA 202 VW ++++D+I +N + F + QD V+ F A Sbjct: 182 HANVWGKIFKKDLITENELSFPEDIVAQDSVFLLNSFFVA 221 >UniRef50_Q5LIR2 Putative glycosyltransferase n=6 Tax=Bacteroides RepID=Q5LIR2_BACFN Length = 356 Score = 111 bits (278), Expect = 4e-23, Method: Compositional matrix adjust. Identities = 65/228 (28%), Positives = 116/228 (50%), Gaps = 7/228 (3%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+I+P+YN + C++SLI QT+ +EII+++DGSTD+S I +AE ++++H Sbjct: 10 ISIIVPIYNIAEYASECIQSLINQTYKNIEIILVDDGSTDHSPAICDEFAEQDERIKVIH 69 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + N G S ARN G++VATG+Y+ FVD DD V MYETL + E D++ C Sbjct: 70 KRNGGLSDARNAGLDVATGEYIGFVDGDDWVDEDMYETLYHLIYEHQADISICT-----H 124 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 T ++ + + T + + + + + + +W +++R++ + ++F G Sbjct: 125 YTELPNRTKVKYKSKKTKIFSSQKAIATLIEDKIIQNYIWEKLFKRELFTE--LRFPVGW 182 Query: 188 HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNY 235 +DI + A + + Y Y S++ R K Y Sbjct: 183 SFEDIALCYKVFHKARKIVLLQTPKYHYRTRPGSITNSTRNPLKEFQY 230 >UniRef50_A8U9H0 Putative capsular polysaccharide glycosyltransferase n=1 Tax=Carnobacterium sp. AT7 RepID=A8U9H0_9LACT Length = 310 Score = 111 bits (278), Expect = 4e-23, Method: Compositional matrix adjust. Identities = 63/204 (30%), Positives = 111/204 (54%), Gaps = 5/204 (2%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M+ + +SVI+P+YN C++S+I Q + L+III++DGSTD S + K + + Sbjct: 1 MVQNQPLVSVIVPVYNVELYIEECLKSIIKQNYQMLQIIIVDDGSTDTSNQKVKPFLLD- 59 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 V+L+ Q N G S ARN G++ A GKYV FVD+DD + T + L+ + E +D+ + Sbjct: 60 SRVQLIQQENKGLSSARNSGLKAAIGKYVLFVDSDDHIELTAIKELVNLMEEKQIDLIRF 119 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWT--HVVWMGVYRRDVIVK 178 N + + D S + G +++ + R T V++ + +R+++VK Sbjct: 120 NGKAFLDGLNKPIKQNNYD--FSHRLQEGKKYIQDRFKANRQTLASPVYLYMVKREILVK 177 Query: 179 NNIKFIAGLHHQDIVWTTEFMFNA 202 N + F G+ H+D ++T + N+ Sbjct: 178 NALSFYEGILHEDELFTIQVFLNS 201 >UniRef50_C2M970 Glycosyl transferase family 2 n=1 Tax=Porphyromonas uenonis 60-3 RepID=C2M970_9PORP Length = 347 Score = 111 bits (278), Expect = 4e-23, Method: Compositional matrix adjust. Identities = 76/250 (30%), Positives = 119/250 (47%), Gaps = 23/250 (9%) Query: 3 NSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPH 62 + T K+SVIIP+YN R C++S++ QT E I ++DGS D S I YAE Sbjct: 10 DHTPKVSVIIPVYNVEQYLRRCLDSILGQTMPDWEAICVDDGSPDGSASILAEYAEQDSR 69 Query: 63 VRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNA 122 R+L + N G S ARN G +VA G YV +VD+DD ++P +E +T+A D D+ Sbjct: 70 FRILTKENGGLSSARNAGTQVARGLYVNYVDSDDFIHPQTFELALTLAERDGSDIVSWVP 129 Query: 123 DWCFR-------ETGE------TWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHV---- 165 D ++ + G+ W L + + +R + R + V Sbjct: 130 DMVYKRRLLLRAKLGQPYDDVVPWSMSQRYDLSKVHAVCTTEVVRHATNHRWFDGVSDPI 189 Query: 166 ----VWMGVYRRDVIVKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTS 221 W + RR++ V+ F+ GL +D W ++ M ++ R T LY YY + +S Sbjct: 190 RYFYAWRHLVRRELAVET--PFVEGLKFEDFPWWSDLMLHSPRVTITHLPLYYYYYNRSS 247 Query: 222 VSRLHRQGNK 231 +S + NK Sbjct: 248 ISSASSECNK 257 >UniRef50_B9YAB2 Putative uncharacterized protein n=1 Tax=Holdemania filiformis DSM 12042 RepID=B9YAB2_9FIRM Length = 320 Score = 111 bits (277), Expect = 5e-23, Method: Compositional matrix adjust. Identities = 61/226 (26%), Positives = 108/226 (47%), Gaps = 14/226 (6%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 LSVI+P+YN C++SL+ QT +EII+INDG+ DNS +I YA YP +R Sbjct: 5 LSVIVPVYNVEKYLPRCLDSLVNQTLKNMEIIVINDGTKDNSQQIIDDYASRYPQIRPFK 64 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + N G + RN G+ G+Y F+D+DD M+E ++ A +++ DV N +W + Sbjct: 65 KPNGGIAETRNFGLAQVQGEYFGFLDSDDFTELDMFEKMVAAAEKENADVVVSNFNWVYG 124 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 E RL G + + L + +W +YR D + + ++F G Sbjct: 125 E---------QKRLEKEGPYAPGKEMMIHLFA-----TLWNKIYRTDFVRETGLQFPTGY 170 Query: 188 HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNL 233 ++D + + + + +++ Y S++ + K++ Sbjct: 171 RYEDACFLYCLAPHVKKLAFVDEAFVSYIQLGNSITHTNNHEVKDM 216 >UniRef50_C9LBS0 Glycosyl transferase, group 2 family n=1 Tax=Blautia hansenii DSM 20583 RepID=C9LBS0_RUMHA Length = 267 Score = 111 bits (277), Expect = 5e-23, Method: Compositional matrix adjust. Identities = 66/225 (29%), Positives = 119/225 (52%), Gaps = 10/225 (4%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +SVI+P+YNA + C+ S++ Q++ LE+I+INDGS D+S +I K Y E+ +R + Sbjct: 2 VSVIVPVYNAEKNLEKCIMSILNQSYQKLELILINDGSKDSSGKICKKYTEHDKRIRYIE 61 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + N GAS RN GI +A G+Y+ FVD+DD + M E L+ E D D+ C F Sbjct: 62 RENKGASATRNEGISLALGQYIQFVDSDDYIEKNMVEALVNRIEETDADMVMCGYTEVFP 121 Query: 128 ETGETWQSIP-TDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAG 186 + + +P DR L G ++ + + + +Y+++ + + KF Sbjct: 122 DKNDV--RLPEIDRTIEMAEL-GTEY--PNIFEKFLLNSPCNKLYKKEKLSE---KFPED 173 Query: 187 LH-HQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGN 230 L +D+++ + N + + ++S Y Y + S++R +R+ + Sbjct: 174 LSLGEDLIFNLHNLENVKKISFIKESFYNYIIGQESLNRRYRKNS 218 >UniRef50_Q74L33 Putative uncharacterized protein n=1 Tax=Lactobacillus johnsonii RepID=Q74L33_LACJO Length = 292 Score = 111 bits (277), Expect = 5e-23, Method: Compositional matrix adjust. Identities = 72/241 (29%), Positives = 120/241 (49%), Gaps = 33/241 (13%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+SVIIP+YN C++S++ QT++ LEII+++DGSTD++ E+ + Y E Y ++R+L Sbjct: 3 KISVIIPVYNDEKYLAQCLDSVLRQTYSNLEIILVDDGSTDSTPELCEKYREKYANIRIL 62 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 H+ N G +RN G+E+ATG+Y+ FVD DD + T E L + ++D D+A N + Sbjct: 63 HKKNGGVGSSRNAGLEMATGEYILFVDHDDLLGETHIEELYKLLKKNDADIAVGNFNHFI 122 Query: 127 RETGETWQSIPTDRLRSTGV-LTGPDWLRMGLSSRRW---------------THVVWMGV 170 E R+ G+ L D+ + W V W + Sbjct: 123 EEK------------RAYGIWLKEDDYFEKTYTPEEWFTVEYETVPYNMSIIFAVPWAKL 170 Query: 171 YRRDVIVKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGN 230 Y++ + NI + +D + T + A + Y +++Y + + SVS Q N Sbjct: 171 YKKSLF--ENIVYPINARVEDDLTTWKIYLLADKIAYMNKAIYTHRIFENSVS---AQAN 225 Query: 231 K 231 K Sbjct: 226 K 226 >UniRef50_Q15RB7 Glycosyl transferase, family 2 n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15RB7_PSEA6 Length = 289 Score = 110 bits (276), Expect = 5e-23, Method: Compositional matrix adjust. Identities = 71/229 (31%), Positives = 120/229 (52%), Gaps = 17/229 (7%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +SV+IP+YN+ C++S++ QT+ EII+I+DGSTD+S++I K Y + ++LL Sbjct: 5 VSVVIPVYNSQSYIADCIDSVLAQTYQDFEIIVIDDGSTDSSLDILKSY-QKKQKIKLLQ 63 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNA----- 122 N+G SVARN GIE ATGKY+ F+D+DD E + +A + LD+ + Sbjct: 64 IENSGQSVARNLGIEQATGKYLIFIDSDDCWENNTLEVTVALAEQKQLDMVLFDGESFVD 123 Query: 123 ----DWCFRETGETW--QSIPTD---RLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRR 173 D RE T+ Q D R V +G ++L +++ ++ + +Y+ Sbjct: 124 DKVTDEHTRECLNTYLIQGFKPDTYVRNAPEDVFSGGEFLVQQMNNNKFIASPVLYLYKL 183 Query: 174 DVIVKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSV 222 + NI+F+ + H+D ++ E + N R SLY+ + S+ Sbjct: 184 EKF--KNIRFVPNIIHEDNAFSMELLVNDGRVMAIPSSLYRRRIRKNSI 230 >UniRef50_A6BIH4 Putative uncharacterized protein n=1 Tax=Dorea longicatena DSM 13814 RepID=A6BIH4_9FIRM Length = 331 Score = 110 bits (276), Expect = 5e-23, Method: Compositional matrix adjust. Identities = 53/126 (42%), Positives = 77/126 (61%), Gaps = 13/126 (10%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 LS+I+P+YN +TC+ SL+ Q A EII+++DGSTD+S I YA+ + ++++H Sbjct: 13 LSIIVPVYNVEKYLKTCINSLLEQKLDAYEIILVDDGSTDSSGGICDEYAKKHEKIQVIH 72 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD---- 123 + N G S ARN GIE A GKY+ FVD+DD + P MY+ L+ +A CNA Sbjct: 73 KKNGGLSSARNTGIENAVGKYIGFVDSDDYIMPEMYKNLIEVA-------KMCNAQMVMS 125 Query: 124 --WCFR 127 +CF Sbjct: 126 RYFCFE 131 >UniRef50_A7V2P1 Putative uncharacterized protein n=3 Tax=Bacteroidales RepID=A7V2P1_BACUN Length = 348 Score = 110 bits (276), Expect = 6e-23, Method: Compositional matrix adjust. Identities = 79/287 (27%), Positives = 132/287 (45%), Gaps = 28/287 (9%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M + K+S+I+P+YN C+ S++ QTW LE+I++ND + DNS+EIA+ ++P Sbjct: 1 MIESMKVSIIVPVYNVAKYIERCLLSVLDQTWQDLEVILVNDCTPDNSMEIARRVVASHP 60 Query: 62 H---VRLL-HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDV 117 VR L H+ N G S ARN GI + G Y+ F+D+DD + E L A++ D Sbjct: 61 RGTVVRCLEHEENRGLSAARNTGISASVGDYLYFLDSDDYISANAIELLADAAVQKRPDF 120 Query: 118 AQCNADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIV 177 N + TG + P TG G + +W + W + R +++ Sbjct: 121 VIGN----YEVTGARRWAPPLS--LGTGFYEGNALVLSTYVQGKWYVMAWNKLVSRPLVL 174 Query: 178 KNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKN----- 232 ++ + F G+ H+D +W+ + + ++ Y Y + S+ R N Sbjct: 175 QHKLYFQEGIVHEDDLWSFKLACMSQSMYVVNETTYYYSMQPDSIMRAPSMRNLECRVLV 234 Query: 233 LNYQRHYIKITRLL----------EKLNRNYADKIMIYPE---FHQQ 266 L Y +I+ +R L E L Y D+I+ + + FH Q Sbjct: 235 LGYIYDFIRSSRCLQDNRLIYIYFESLKAKYFDRILYFTKDTSFHYQ 281 >UniRef50_Q6ASC1 Related to glycosyltransferase involved capsular polysaccharide biosynthesis n=1 Tax=Desulfotalea psychrophila RepID=Q6ASC1_DESPS Length = 321 Score = 110 bits (276), Expect = 6e-23, Method: Compositional matrix adjust. Identities = 68/222 (30%), Positives = 117/222 (52%), Gaps = 13/222 (5%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTA--LEIIIINDGSTDNSVEIAKYYAENYPHVR 64 KLS IIPLYNA C+ S++ Q EII++NDG+ D+S +I E + +++ Sbjct: 3 KLSFIIPLYNAEKFIEKCLLSILQQNVPIQDYEIIVVNDGTKDDSRKIVSEMQEKFSNIK 62 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETL---MTMALEDDLDVAQCN 121 ++ Q N+G SV RNRG+ VATG+YV F+D+DD + + L +TM D L + Sbjct: 63 IIDQENSGPSVTRNRGLRVATGEYVWFIDSDDYIIDNSLQELLDILTMPSPDILYIGYKK 122 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 R+ T +R + T +L+ + + +VW+ +YRR ++ +N + Sbjct: 123 VYENKRDEKGT-----GERNGTAQPYTNHQFLK---KFKSFPPMVWLYIYRRTMLTENRL 174 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVS 223 F G+ H+D +T + +F A + + +++Y Y S++ Sbjct: 175 SFPEGIMHEDEEFTPKALFFAKKIIHFNRAIYCYLKREASIT 216 >UniRef50_C6Z980 Glycosyltransferase family 2 n=1 Tax=Bacteroides sp. 4_3_47FAA RepID=C6Z980_9BACE Length = 351 Score = 110 bits (275), Expect = 7e-23, Method: Compositional matrix adjust. Identities = 70/238 (29%), Positives = 127/238 (53%), Gaps = 28/238 (11%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+S+++P+YN C++SL+ QT +EII+++D S DN +I Y Y +V+++ Sbjct: 5 KVSIVVPIYNVEKYLDRCIQSLLAQTLKEIEIILVDDESPDNCPQICDRYVAKYTYVKVV 64 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDV--AQCNADW 124 H+ N G ARN G+EVATG+Y+AFVD+DD + M E L E LDV ++ N D Sbjct: 65 HKKNDGLGQARNSGMEVATGEYIAFVDSDDFIDADMMERLYDECKEHSLDVIYSEFNVDE 124 Query: 125 --CFRETGETWQSIPTDRLRSTGVLTGP---DWLRMGL--------SSRRWTHVVWMGVY 171 FR T L+ + TG + LR+ + SS ++ G+Y Sbjct: 125 YPGFRIT-----------LKPERLYTGREEIEQLRLDIVGAEPNHRSSVKFQCSACKGLY 173 Query: 172 RRDVIVKNNIKFIAGLHH--QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHR 227 ++ +++++F + + +D+++ + +++A++ + +Y Y L+ S+S +R Sbjct: 174 SMKLLSEHHLQFHSERQYISEDMLFNLDVLYHAMQVKTVPWQMYHYCLNGASLSHTYR 231 >UniRef50_A6NTS0 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6NTS0_9BACE Length = 353 Score = 110 bits (275), Expect = 8e-23, Method: Compositional matrix adjust. Identities = 68/224 (30%), Positives = 120/224 (53%), Gaps = 16/224 (7%) Query: 6 NKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRL 65 +++S+I+P+Y R C++S++ QT+T +E+I+++DGS DN I YA+ V + Sbjct: 7 SEISIIVPVYKVEPYLRRCVDSILAQTFTDIEVILVDDGSPDNCPAICDKYAQQDNRVIV 66 Query: 66 LHQANAGASVARNRGIEVA----TGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 +HQ G SVARN G+ A ++++FVD+DD V+P E L A E+++ ++ C Sbjct: 67 IHQEYQGVSVARNTGLNWALLNSNSQWISFVDSDDWVHPRFLEYLHRAATENNVQISVC- 125 Query: 122 ADWCFRET--GETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKN 179 D+C RE+ E ++ D L+ T + L L VVW +Y + + K Sbjct: 126 -DYC-RESIYEEPKSNVEYDTLKVTSL-----QLYQHLEKSLLFTVVWNKLYYKSLFEK- 177 Query: 180 NIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVS 223 I+F G +D + + M+ + + + LY YY++ T ++ Sbjct: 178 -IRFPVGKISEDTFVSYKLMYKCPQIVFIKHPLYFYYINQTGIT 220 >UniRef50_C4Z1X9 Putative uncharacterized protein n=1 Tax=Eubacterium eligens ATCC 27750 RepID=C4Z1X9_EUBE2 Length = 336 Score = 110 bits (275), Expect = 8e-23, Method: Compositional matrix adjust. Identities = 82/261 (31%), Positives = 134/261 (51%), Gaps = 32/261 (12%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +SVIIP +NA D ++S+ITQT + EIII+ +GSTD++ + YAE Y ++ + Sbjct: 11 ISVIIPAHNASDYIERAIKSIITQTDMSWEIIIVENGSTDDTYIKSMKYAEKYSNINIF- 69 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMY-ETLMTMALEDDLDVAQCNADW-- 124 Q+ G S ARN G++ ATGK++ F+DADD +YP + E M E DL + N++ Sbjct: 70 QSEKGVSKARNYGLKKATGKWICFLDADDYIYPDAFNEIYKYMEDESDLILFGHNSETHM 129 Query: 125 ----CFRETG-ETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKN 179 C + G E +Q++ + +++ VW +++ +I KN Sbjct: 130 VKGKCVKFDGNEEYQNLRCEMIKNPT----------------QNMTVWGKLFKHQIIEKN 173 Query: 180 NIKFIAGLH-HQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGN-----KNL 233 I+F L +D +T +M NA +A LY Y N S R ++ G+ K+L Sbjct: 174 KIEFDEQLELAEDSDFTFRYMQNASKALRITSQLYHYSKDNASTVRTYKAGSEEKYIKSL 233 Query: 234 NYQRHY-IKITRLLEKLNRNY 253 N+ + Y IK T+ +++ Y Sbjct: 234 NHTKEYTIKDTKEIQEAYNQY 254 >UniRef50_D0TDE5 Glycosyltransferase family 2 n=1 Tax=Bacteroides sp. 2_1_33B RepID=D0TDE5_9BACE Length = 329 Score = 110 bits (274), Expect = 9e-23, Method: Compositional matrix adjust. Identities = 73/252 (28%), Positives = 126/252 (50%), Gaps = 10/252 (3%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 T+ ++VIIP+YN C+ S+I Q++ L+II+++DGS+D+S I + Sbjct: 2 TDLVTVIIPVYNVEKYLAKCVNSIIKQSYKKLQIILVDDGSSDSSGAICDELLLLDERIE 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 + HQ N G S ARN+GI++A G+Y+AFVD DD + P MYE L+ + + + Sbjct: 62 VFHQKNKGVSSARNKGIDLAQGEYIAFVDPDDFLAPNMYEVLIGNIKKYSAQCSSVSMKK 121 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLS-SRRWTHVVWMGVYRRDVIVKNNIKF 183 + G P + + + +G + L +S W ++ +D+I NNI+F Sbjct: 122 IYESDGHEELLTPVGQFK---IQSGEEALDDAMSLDDPWLGYSVNKLFLKDIIDVNNIRF 178 Query: 184 IAG-LHHQDIVWTTEFM-FNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIK 241 + ++D ++ E+M + + RYT LY Y + TS ++ + N L H I Sbjct: 179 NENIIMNEDSIFCYEYMSYCLISVRYT-LPLYNYLIRKTSATQTIFKDNDKL---FHLID 234 Query: 242 ITRLLEKLNRNY 253 + +L + Y Sbjct: 235 SSEILYNFGKLY 246 >UniRef50_D2Q666 Glycosyl transferase, family 2 n=4 Tax=cellular organisms RepID=D2Q666_9BIFI Length = 329 Score = 110 bits (274), Expect = 1e-22, Method: Compositional matrix adjust. Identities = 67/225 (29%), Positives = 120/225 (53%), Gaps = 2/225 (0%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M T +S+I+P+YN G+ C+ SL+ QT+ +++I+++DGSTD S ++ +A+ Sbjct: 1 MNTGTPLVSIIVPVYNTGNYLNVCVNSLLRQTYANIQVILVDDGSTDGSADLCDAWADKD 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 V ++HQAN G S ARN G++ A G ++ FVD+DD + E L+ + + +A Sbjct: 61 SRVMVIHQANGGVSTARNAGLDAAEGDWLEFVDSDDWLEREAIEGLLGLVQSEHAQMAIF 120 Query: 121 NADWCFRE-TGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKN 179 N E T T +D S+GVL+ D L + L+ W + R++I + Sbjct: 121 NYRSVLDEDTPYTVCEKTSDYRISSGVLSRKDALDVILAYSGVKGYAWNKFFSRELIERQ 180 Query: 180 NIKFIAGLHH-QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVS 223 N++F + + +D++++ E+ + T++ LY Y + S S Sbjct: 181 NLRFDSKITMCEDLLFSVEYALLSTVTVGTDRCLYNYRNNPNSAS 225 >UniRef50_A7I1X7 Ss-1,4-galactosyltransferase n=1 Tax=Campylobacter hominis ATCC BAA-381 RepID=A7I1X7_CAMHC Length = 325 Score = 110 bits (274), Expect = 1e-22, Method: Compositional matrix adjust. Identities = 66/255 (25%), Positives = 128/255 (50%), Gaps = 23/255 (9%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +SVI+P++N + C++SL+ Q++ EI++I+DGS D S+EI + +A+ P +R+ Sbjct: 5 ISVIVPVFNVAEFLDDCIKSLLNQSYENFEILLIDDGSGDKSLEICQNFAKKEPKIRVFS 64 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + N G ARN G++ A G+++ FVD+DD + + + ++++ +A C A F Sbjct: 65 KQNGGQGSARNLGLDNAKGEFICFVDSDDILASGFLDKTFEILKQNNVKMAMC-AYAPFV 123 Query: 128 ETGE--TWQSIPTDRLRSTGVLTGPDWLR-MGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 + E +S+ ++ +L + + + L++ + VW +Y + + K ++F Sbjct: 124 DASEISNIESVKSE----VKILNDKEIFKEIFLANPAFGTAVWRNLYHKSIFEK--LRFP 177 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLH----NTSVSRLHRQGNKNLNYQRHYI 240 ++D+ E NA +A + ++LY Y L S S+ H Q I Sbjct: 178 QNQIYEDVAIAFEIFNNAEKAAFCNETLYFYRLRAGSTTNSFSKRHLQAE---------I 228 Query: 241 KITRLLEKLNRNYAD 255 + EK+ + Y D Sbjct: 229 SVKNFCEKVVKKYPD 243 >UniRef50_B3C8H4 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=B3C8H4_9BACE Length = 359 Score = 110 bits (274), Expect = 1e-22, Method: Compositional matrix adjust. Identities = 68/223 (30%), Positives = 110/223 (49%), Gaps = 4/223 (1%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 +N + K+SVIIP+YN + ++S+ QT LEII+INDGSTDNS I + YA Sbjct: 52 INPSIKVSVIIPVYNTATYLHSALDSICNQTLKDLEIILINDGSTDNSQYIIEKYARKDN 111 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 ++ L Q N VARN G+++ATG+Y+ F+D+DD + M +LD + Sbjct: 112 RIKYLIQPNQRQGVARNNGLQLATGQYIYFMDSDDILDIDALRQCYEMCERKELDFVTFD 171 Query: 122 ADWCFRETGE---TWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVK 178 A ET T+ ++ + G + + L + V W+ + + + Sbjct: 172 AK-TIHETSTSSGTFNYCRKGKIDEQKIWDGIELMHYELEYEIFYVVPWLCFTKHSFLKQ 230 Query: 179 NNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTS 221 F +G+ H+D ++ + M NA R Y Q +K + N+S Sbjct: 231 YFGGFPSGIIHEDTIFVVQIMLNAHRVGYIPQPFFKRRVRNSS 273 >UniRef50_UPI0001968EF9 hypothetical protein BACCELL_04174 n=1 Tax=Bacteroides cellulosilyticus DSM 14838 RepID=UPI0001968EF9 Length = 328 Score = 109 bits (273), Expect = 1e-22, Method: Compositional matrix adjust. Identities = 66/213 (30%), Positives = 110/213 (51%), Gaps = 9/213 (4%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +SVI+P+YN C++S++ QT+ A E+++++DGS D+S I YA+ V ++H Sbjct: 3 VSVIVPVYNVEKFIHQCIDSILCQTYKAFEVVLVDDGSLDSSGIICDEYAKKDCRVSVIH 62 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF- 126 + N G S ARN G+E A G+YV FVD+DD L+ +E+ + C+ Sbjct: 63 KENGGLSDARNVGLECAKGEYVIFVDSDDFWIDNCQLELLIAEIEETPSCDFIGFNCCYY 122 Query: 127 -RETGE--TWQSIPTDRLRSTGVLTGPDWLRMGL-SSRRWTHVVWMGVYRRDVIVKNNIK 182 TGE W S + ++ +T D L L + + + + +R ++ NNI+ Sbjct: 123 NSSTGEFSPWFSYSDEIMK----MTERDVLITNLVCTGVFPMSACLKIIKRSFLMNNNIR 178 Query: 183 FIAGLHHQDIVWTTEFMFNALRARYTEQSLYKY 215 FI G+ +DI W E + + + R Q +Y Y Sbjct: 179 FIKGIVCEDIPWFIELLHYSHKFRVVNQYMYAY 211 >UniRef50_C2C0Z6 Glycosyltransferase n=3 Tax=Listeria RepID=C2C0Z6_LISGR Length = 336 Score = 109 bits (273), Expect = 1e-22, Method: Compositional matrix adjust. Identities = 68/238 (28%), Positives = 115/238 (48%), Gaps = 13/238 (5%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 +LSVIIP YN + C+ S+I QT+ ALEII+INDGSTD+++E A +P +++ Sbjct: 3 RLSVIIPAYNVSEYIEACLYSVIQQTYQALEIIVINDGSTDDTLEKITQVAAEHPQIQVY 62 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 N G S RN+G+ +A+G+Y+ FVD+DD + MYE ++ E D+ D Sbjct: 63 SHENIGLSATRNKGVALASGEYITFVDSDDVLDARMYEKMVGSLYEAPADIVTIGVDRLA 122 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAG 186 + +T+ S+ R + + L + W +YR+ K ++F G Sbjct: 123 ED--QTYTSVLHKRFMKETIYNTSIFESTRL---LYDTTSWNKIYRKSFWEKEQLEFPEG 177 Query: 187 LHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 +DI + + + Y + R GNK++ Q++ +K+T+ Sbjct: 178 YLFEDIPVVMKAYLATEQVNLIAEVGYLW--------RSRTSGNKSITQQKNSLKMTK 227 >UniRef50_UPI0001C41E5E glycosyl transferase GT2 family n=1 Tax=Methanobrevibacter ruminantium M1 RepID=UPI0001C41E5E Length = 696 Score = 109 bits (273), Expect = 1e-22, Method: Compositional matrix adjust. Identities = 65/196 (33%), Positives = 108/196 (55%), Gaps = 10/196 (5%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQT-WTALEIIIINDGSTDNSVEIAKYYAENYPHVRL 65 K+SVIIP+YN D + S+I QT + +E+II++D STDNS I + YA +Y ++++ Sbjct: 5 KISVIIPIYNTEDYIEETLLSVINQTIFDEIEVIIVDDESTDNSKYIIEKYALDYSNIQV 64 Query: 66 LHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWC 125 HQ N G ++RN G+ + G+Y+ F+D+DD + PT YETL MAL+++ D+ N Sbjct: 65 FHQKNEGQGISRNYGLSKSKGEYIHFLDSDDYLPPTAYETLYNMALKNESDIVIGNV--L 122 Query: 126 FRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRR---WTHVVWMGVYRRDVIVKNNIK 182 W+ ++ D M L+ R W +V ++ R+ +++ NI+ Sbjct: 123 RFALYNVWEE---SLYKNAYNDFDEDIAIMSLNERPSILWDTLVTNKLFNREFLIRKNIR 179 Query: 183 FI-AGLHHQDIVWTTE 197 F + QDI ++ E Sbjct: 180 FPNKKISFQDIPFSLE 195 >UniRef50_A5UNM1 Glycosyltransferase/CDP-glycerol:poly(Glycerophosphate) glycerophosphotransferase, GT2 family n=3 Tax=Methanobrevibacter smithii RepID=A5UNM1_METS3 Length = 1193 Score = 109 bits (273), Expect = 1e-22, Method: Composition-based stats. Identities = 85/285 (29%), Positives = 141/285 (49%), Gaps = 14/285 (4%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 S K+SVI+P++N D T ++S++ Q+ +EII INDGSTD+S++I + YA+ + Sbjct: 2 SKVKVSVIVPVFNVRDYLSTSLDSILNQSLEDIEIICINDGSTDDSLKILENYAKKDNRI 61 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 +++ + N G ARN G++ A G++++FVDADD + M E L ++ +LD+ C Sbjct: 62 KIISKENEGQGTARNVGLDNACGEFISFVDADDFIKKDMLEKLYNKSVNGNLDLVMCKVS 121 Query: 124 WCFRETGETWQSIPTDRL--RSTGVLTG-PDWLRMGLSSRRWTHVV----WMGVYRRDVI 176 ET I D L S +G + L ++ +T ++ + +YRR I Sbjct: 122 SFDNET-----HIIDDNLWYYSLKCFSGFKKEVFNNLDTKEFTSLISVTPYNKLYRRSFI 176 Query: 177 VKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQ 236 KNNI+F +D V+ A R +++LY YY N S + + +K+ Sbjct: 177 EKNNIRFPEKYIFEDEVFFYNVYLKAKRISLIDENLY-YYRTNRKGSTVSKGQDKDYIDV 235 Query: 237 RHYIKITR-LLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRK 280 H ++ R LL + N K +Y F I + + RK Sbjct: 236 IHIFRLIRDLLVETNYINVYKKQVYNRFIHLILWRFSQTAPKYRK 280 Score = 82.4 bits (202), Expect = 2e-14, Method: Composition-based stats. Identities = 41/109 (37%), Positives = 70/109 (64%), Gaps = 4/109 (3%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWT---ALEIIIINDGSTDNSVEIAKYYAE 58 ++S SVI+ YN +ESL Q + +E+II++DGS D+S++IAK Y + Sbjct: 331 LDSCKPFSVIMASYNNDKYLEDAVESLTVQNFGFEHNVELIIVDDGSNDDSLKIAKKYQK 390 Query: 59 NYPH-VRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETL 106 NYP+ ++++ + N G + ARN G+E ATG Y+ F+D+DD++ P ++ + Sbjct: 391 NYPYNIQVISKENGGQASARNLGLEYATGDYINFLDSDDKLSPNTFKAV 439 >UniRef50_C5NWF7 Glycosyl transferase, group 2 family n=2 Tax=Firmicutes RepID=C5NWF7_9BACL Length = 320 Score = 109 bits (273), Expect = 1e-22, Method: Compositional matrix adjust. Identities = 70/261 (26%), Positives = 128/261 (49%), Gaps = 22/261 (8%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+SVI+P+YN C++S+I QT+ LEII++NDGSTDNS ++ A +R++ Sbjct: 4 KISVIVPVYNVEQYLERCVDSIINQTYKNLEIILVNDGSTDNSGQLCDELARKDDRIRVV 63 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 H+ N G S ARN GI+ A + + F+D+DD + MYE LM ++ NAD Sbjct: 64 HKKNGGLSDARNVGIDEAEAELIGFIDSDDYIDEDMYEVLMN-------NLKDANADLSM 116 Query: 127 RETGETWQSIPTDRL--RSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 + + ++P ++ + T L+ + ++M + ++ + +Y++ + + KF Sbjct: 117 CGHYDVYNNVPEAQVADKKTWELSPQEAIKMVMEAKILSVTAVNKLYKKSLF--SEFKFE 174 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 G +D + + + T + Y YY+H + ++ Q+ +K Sbjct: 175 IGKIAEDAFIMIKLLDKCNKIVATNEKKY-YYVHREN----------SITTQKFSLKFLN 223 Query: 245 LLEKLNRNYADKIMIYPEFHQ 265 ++E +N I YPE + Sbjct: 224 VIEAYEQNKEIIIKKYPELKE 244 >UniRef50_A8U9H2 Glycosyl transferase, family 2/glycosyl transferase family 8 n=2 Tax=Carnobacterium sp. AT7 RepID=A8U9H2_9LACT Length = 352 Score = 109 bits (273), Expect = 1e-22, Method: Compositional matrix adjust. Identities = 67/227 (29%), Positives = 112/227 (49%), Gaps = 10/227 (4%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+S+I+P+YN + ++ ++QT+ EII++NDGSTD S EI Y P + ++ Sbjct: 3 KISIIMPIYNVAEQLEKAIQCALSQTYKNSEIILVNDGSTDGSGEICDRYQLKDPRISVI 62 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 HQ NAG+ ARN G++ ATG+Y+ F D DD + E + A+E + DV C + Sbjct: 63 HQKNAGSGFARNAGLDRATGEYIYFADPDDYFEYNLIEETVAKAIESNADVVVCGY---Y 119 Query: 127 RETGETWQSIPTDRLRST--GVLTGP---DWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 E + +I T S G+LT D R S + +W +Y+ D + ++N Sbjct: 120 NEKVDKEGNITTLEKNSELDGLLTKEAFRDQFRKHYSLS--PYALWNKLYKHDFLKQHNC 177 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQ 228 +F QD ++ + N + +++ Y Y S +R+ Sbjct: 178 RFTNQKVGQDALFNQQVGKNVQTIYFHQKAYYHYVFREGSAVNRYRK 224 >UniRef50_C5VL41 Glycosyl transferase, group 2 family n=2 Tax=Prevotella RepID=C5VL41_9BACT Length = 298 Score = 109 bits (272), Expect = 1e-22, Method: Compositional matrix adjust. Identities = 55/117 (47%), Positives = 78/117 (66%), Gaps = 1/117 (0%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 +LS+IIP+YN D + C+ES++TQ +E+I+I+DGSTD+S +IA+ N + RL+ Sbjct: 2 RLSIIIPIYNVEDTLQRCLESVLTQMDERIEVILIDDGSTDSSGKIAEEMTANKGNCRLI 61 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMT-MALEDDLDVAQCNA 122 HQ N G S ARN GIEVATG Y+ FVD+DD V Y+ L+ +A D D+ + A Sbjct: 62 HQKNKGLSAARNAGIEVATGDYLTFVDSDDFVAEGTYDALLAVLAAHPDYDILEYPA 118 >UniRef50_B0N0F0 Putative uncharacterized protein n=1 Tax=Clostridium ramosum DSM 1402 RepID=B0N0F0_9FIRM Length = 309 Score = 109 bits (272), Expect = 2e-22, Method: Compositional matrix adjust. Identities = 62/216 (28%), Positives = 112/216 (51%), Gaps = 9/216 (4%) Query: 10 VIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLHQA 69 +IIP+YN+ R C++S++ Q +T +EII++NDGSTDNS+ I + Y +L++ Sbjct: 1 MIIPIYNSELHLRHCLDSILNQIYTNIEIILVNDGSTDNSLSICEEYKHQDERFKLININ 60 Query: 70 NAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFRET 129 N+GAS ARN G+ A G+YV+F+D+DD + M+ ++ + D+A+C + Sbjct: 61 NSGASHARNIGLSKAKGQYVSFIDSDDMIDKNMFHDMLGLIEIHKADMAECEIKYVRNHK 120 Query: 130 GETWQSIPTDRLRSTGVLTGPDWLRMGL--SSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 +I +L + D L S + TH + + D++ ++ K I G Sbjct: 121 TNKINNIEEIKL-----MNKDDMLNRFFRTSGEKDTHTLCNKLINIDIL--DDFKLIEGY 173 Query: 188 HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVS 223 ++D+ + + + YT Y YY +++ V+ Sbjct: 174 MNEDVKGVYDIIKKCNKCIYTSNKYYLYYRNSSGVT 209 >UniRef50_C2DBB8 Glycosyltransferase n=19 Tax=Bacteria RepID=C2DBB8_ENTFA Length = 322 Score = 109 bits (272), Expect = 2e-22, Method: Compositional matrix adjust. Identities = 62/216 (28%), Positives = 113/216 (52%), Gaps = 7/216 (3%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 ++S+I+P+Y R C++S++ QT+T E+I+++DGS DNS +I YA VR++ Sbjct: 3 EISIIVPVYKVEPYLRKCLDSILAQTFTNFEVILVDDGSPDNSGKICDEYAIKDSRVRVI 62 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 H+ N G S ARN GI++A GKY+ FVD+DD + MYE L D D++ F Sbjct: 63 HKENGGLSSARNAGIDIAQGKYLGFVDSDDYIAEDMYEVLYENLKFYDADISSVEI-IPF 121 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAG 186 R+ + +++ T + + +L + ++ L + W +YR+++ N +++ G Sbjct: 122 RD--DKYEN--TSKEKKEILLNKKEAIKSVLEGTDFYAYAWNKLYRKELFYNN--RYLEG 175 Query: 187 LHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSV 222 +D + + + + + Y Y N S+ Sbjct: 176 ETFEDAYIIIDILLETDKIIVSNEEKYFYLQRNDSI 211 >UniRef50_Q8A777 Glycoside transferase family 2 n=1 Tax=Bacteroides thetaiotaomicron RepID=Q8A777_BACTN Length = 330 Score = 109 bits (272), Expect = 2e-22, Method: Compositional matrix adjust. Identities = 69/221 (31%), Positives = 104/221 (47%), Gaps = 15/221 (6%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR-- 64 ++SVI+P+YN C+ SL+ QT +E I +ND + D+S++I Y E YP R Sbjct: 8 RISVIVPIYNVKLYIERCVRSLMEQTLENIEFIFVNDCTPDDSMDILHYVLEEYPKRREQ 67 Query: 65 ---LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 + H+ N G S RN G++ ATG+Y+ + D+DD V MYE L+ ALE D+ C+ Sbjct: 68 IKIIEHETNRGISAVRNTGLKNATGQYIIYCDSDDWVEKNMYEKLLVKALETSADIVGCD 127 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 F + +I LR L W +V +Y R I Sbjct: 128 ----FFDDYVAHSTIRKQLFPQQSEKCVKKMLRGELHCGTWNKLVIKSLYER-----TKI 178 Query: 182 KFIAGLHH-QDIVWTTEFMFNALRARYTEQSLYKYYLHNTS 221 F G++ +D+ F+A + Y ++LY Y HN S Sbjct: 179 DFPEGINMWEDVSTIIPLCFHATKIDYIPEALYHYIHHNVS 219 >UniRef50_B0ACD9 Putative uncharacterized protein n=1 Tax=Clostridium bartlettii DSM 16795 RepID=B0ACD9_9CLOT Length = 217 Score = 109 bits (272), Expect = 2e-22, Method: Compositional matrix adjust. Identities = 65/192 (33%), Positives = 106/192 (55%), Gaps = 13/192 (6%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 S K+SVI+P+YN CM+SLI QT +EII+I+D S D + EI K Y + + Sbjct: 2 SEPKVSVIVPVYNCEKYIERCMDSLINQTLADIEIIVIDDESKDTTFEILKEYEKKDKRI 61 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 +L+ + N+G S +RN+GIE+ATGKY+AFVD DD + E + +A ++ D+ C+ Sbjct: 62 KLMTKKNSGVSSSRNKGIEIATGKYIAFVDGDDWIDLDCLEKMYEVAEKNQTDITICSYT 121 Query: 124 WCFRETGETWQ-SIPTDRLRSTG-------VLTGPDWLRMGLS---SRRWTHVVWMGVYR 172 F++ + ++ D++ L GP ++ S S VW +Y Sbjct: 122 REFKDHSKPKNINLKEDKVYEDEDLKILHRKLVGP--IKEEFSQPESLDSLGTVWGKLYN 179 Query: 173 RDVIVKNNIKFI 184 R++++K++ KFI Sbjct: 180 RELLIKSDNKFI 191 >UniRef50_C2M966 Glycosyl transferase family protein n=1 Tax=Porphyromonas uenonis 60-3 RepID=C2M966_9PORP Length = 336 Score = 109 bits (272), Expect = 2e-22, Method: Compositional matrix adjust. Identities = 78/247 (31%), Positives = 122/247 (49%), Gaps = 17/247 (6%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESL--ITQTWTALEIIIINDGSTDNSVEIAKYYAEN 59 M+ LSVIIP+YN C++SL I TA+EII+INDGS D ++E+ Y Sbjct: 1 MSVELPLSVIIPIYNGAKTIARCLDSLVAILDAATAIEIIVINDGSQDETLEVVTNYQTQ 60 Query: 60 YPH--VRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDV 117 +P +R++ Q N G S ARN+G+EVA G YV FVDADD V L+T+ ++ D Sbjct: 61 HPEQRIRIISQTNQGQSAARNKGLEVAQGAYVWFVDADDWVDSVAASYLLTLITKESYDK 120 Query: 118 AQCNADWCFRETGETWQSIPTDRLRST----GVLTGPDWLRMGLSSRRWTHVVWMGVYRR 173 + R + + +P++ + ++ L D R L+ R + V ++RR Sbjct: 121 PYDMLCFGVRNVADEGEEVPSEWMDTSVHDFDQLQCTDG-RALLTQRNLSGSVCSFLWRR 179 Query: 174 DVIVKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYY------LHNTSVSRLHR 227 ++ ++ +F+ GL H+D ++ + A R Y YY +HN LH Sbjct: 180 AMLEEHQARFMEGLLHEDALFYWHYTAYAERVLLVPIVAYCYYQRADSTIHNPDT--LHL 237 Query: 228 QGNKNLN 234 +G L Sbjct: 238 RGISRLE 244 >UniRef50_D1PQG3 Glycosyltransferase n=1 Tax=Subdoligranulum variabile DSM 15176 RepID=D1PQG3_9FIRM Length = 286 Score = 109 bits (272), Expect = 2e-22, Method: Compositional matrix adjust. Identities = 74/239 (30%), Positives = 125/239 (52%), Gaps = 26/239 (10%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+I+P+YNA + C+ES+ QT+ LEI+++NDGS D S+++ + YA + L+ Sbjct: 8 VSIIVPIYNAQNHIARCVESIRRQTYENLEILLLNDGSQDVSLQVCQMYAGVDKRIVLID 67 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMA--LEDDLDVAQCNADWC 125 +AN+G + RN G++ A G Y+ FVDADD + P E L+ A + DL +A N Sbjct: 68 KANSGVAATRNLGLQFAKGAYLQFVDADDTILPNATELLVDRATRFDADLVIAHYNRITP 127 Query: 126 FRETGETWQS-----IPTDRLRSTG-VLTGP---DWLRMGL---SSRRWTHVVWMGVYRR 173 R E ++ D++++ G +L GP + GL + + V+W +YR Sbjct: 128 PRPRSEEAEARRPRPPRPDKVQTFGFLLEGPMNKEQFAAGLMQEPASFYYGVMWNKLYRA 187 Query: 174 DVIVKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQ------SLYKYYLHNTSVSRLH 226 D++ ++ A +++ W+ + FN RY E+ +Y YY NT S +H Sbjct: 188 DIVRAHDDILCA----EELDWSEDLYFNLSYIRYAEKFYALSTPIYNYY--NTPGSAVH 240 >UniRef50_C4FMC2 Putative uncharacterized protein n=3 Tax=Firmicutes RepID=C4FMC2_9FIRM Length = 317 Score = 108 bits (271), Expect = 2e-22, Method: Compositional matrix adjust. Identities = 75/240 (31%), Positives = 120/240 (50%), Gaps = 16/240 (6%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 LS+I+P+YN C++S++ QT+ LEII+++DG+TD S IA YA +++ H Sbjct: 5 LSIIVPIYNVEQYVDKCIQSILNQTYQNLEIILVDDGATDCSGSIADSYAAKDKRIKVFH 64 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + N G S ARN G++ TG Y+ FVD+DD + TM E L +A + D+ CN + +R Sbjct: 65 KENGGLSDARNYGLDHVTGDYILFVDSDDFIENTMCERLFAVANSTNADIVSCNY-YIYR 123 Query: 128 ETGETWQSIPTDRLR-STGVLTGPDWLRMGL--SSRRWTHVVWMGVYRRDVIVKNN-IKF 183 E + SI T ++ T TG D LR L + +VVW +++ ++ ++F Sbjct: 124 EDDDI--SIHTMSVQDDTRTFTGMDMLRYYLLKTEPFDLNVVWNKLFKSELFNGTVLVRF 181 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 G +D NA + LY Y + R G+ N+ R ++ T Sbjct: 182 PKGRVQEDNFTIFRLFLNANTIVTVNEPLYYY---------VQRAGSIMANFSRRFMTDT 232 >UniRef50_UPI0001851388 hypothetical protein Bcoam_19734 n=1 Tax=Bacillus coahuilensis m4-4 RepID=UPI0001851388 Length = 201 Score = 108 bits (271), Expect = 2e-22, Method: Compositional matrix adjust. Identities = 50/113 (44%), Positives = 73/113 (64%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +SVIIP+YNA + C+ES++ Q + EIII+NDGSTD S IA YAE + ++++H Sbjct: 12 ISVIIPMYNAETWIKQCLESVLCQKNVSFEIIIVNDGSTDTSPSIADDYAEKFSQIQVIH 71 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 + N G S ARN G+E + G YV F+D DD + ++L+ A E DV++C Sbjct: 72 KKNGGVSSARNAGLENSHGNYVFFLDIDDAISNNALQSLLEFAEEQSADVSEC 124 >UniRef50_D2RJ84 Glycosyl transferase family 2 n=1 Tax=Acidaminococcus fermentans DSM 20731 RepID=D2RJ84_ACIFE Length = 324 Score = 108 bits (271), Expect = 2e-22, Method: Compositional matrix adjust. Identities = 74/251 (29%), Positives = 124/251 (49%), Gaps = 25/251 (9%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 LSVI+P+YN C++S+I QT+ LEII+++DGSTD+S I YA + + ++H Sbjct: 3 LSVIVPVYNVEKYLCKCLDSIIQQTYKNLEIILVDDGSTDSSGLICDNYAAHDKRIHVIH 62 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + N G S ARN G+E+ +G Y+ FVD+DD + M+E L+ ++LDVA C Sbjct: 63 KKNEGLSEARNSGLEICSGDYIGFVDSDDWIATDMFEKLVKFVKCENLDVAMCG------ 116 Query: 128 ETGETWQS--IPTDRLRSTGVLTGPDWLRMGLSSRRWTHV--VWMGVYRRDVIVKNNIKF 183 + W + T + + D + L +R VW +YR + +KF Sbjct: 117 -VADVWPEHVVKTPSFKKVILTDVNDIISEILVNRHGGTAVPVWCRIYRSSLF--KTLKF 173 Query: 184 IAGLHHQDIVWTTEFMFNALR-ARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKI 242 G +++D + +++ R R+++ Y +N HR+G+ N Q H ++ Sbjct: 174 EKGRYYEDGFYLLKWIERTGRFGRFSDSG----YFYN------HREGSIT-NIQGHSKRV 222 Query: 243 TRLLEKLNRNY 253 + NY Sbjct: 223 DDFRKAYEDNY 233 >UniRef50_B6FJ78 Putative uncharacterized protein n=1 Tax=Clostridium nexile DSM 1787 RepID=B6FJ78_9CLOT Length = 321 Score = 108 bits (271), Expect = 2e-22, Method: Compositional matrix adjust. Identities = 62/212 (29%), Positives = 106/212 (50%), Gaps = 9/212 (4%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+SVI+P+Y C++S++ QT++ +E+I+++DG+ D EI Y E V+++ Sbjct: 3 KISVIVPVYKVEKYIHKCVDSILNQTFSDIEVILVDDGTPDRCGEICDAYGEQDSRVKVI 62 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 H+ N G S ARN G+ A+G+Y+ F+D+DD + M E + T E D+A C + Sbjct: 63 HKENGGLSDARNAGMPHASGEYIIFIDSDDYIESDMLEYMYTRLTEAGADMATCGLYEVY 122 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAG 186 + I + V +G + R L +W + RR I +++F G Sbjct: 123 SD------RIEQQKEEKDFVCSGEEAFRCILQGHTIRGEIWNKLIRRSCI--EDLRFPKG 174 Query: 187 LHHQDIVWTTEFMFNALRARYTEQSLYKYYLH 218 ++DI +T + M + + Y YYLH Sbjct: 175 KLYEDIYFTVDMMQRIKKVAVGTKPKY-YYLH 205 >UniRef50_A6NTT2 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6NTT2_9BACE Length = 324 Score = 108 bits (271), Expect = 2e-22, Method: Compositional matrix adjust. Identities = 60/195 (30%), Positives = 106/195 (54%), Gaps = 7/195 (3%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+I+P+YN C+ S+I QT+T LEII+++DGS DN + +AE ++++H Sbjct: 5 ISIIVPVYNVEPYLDQCVRSIIEQTYTDLEIILVDDGSPDNCPAMCDAWAEKDRRIKVVH 64 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + N G S ARN G+++A+G+Y+ FVD+DD + P + E+ + ++ + D+ A W + Sbjct: 65 KQNGGLSDARNAGMDIASGEYIGFVDSDDWLEPDILESALKQLVQANADIIAFGAIWEYS 124 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 E E IP L LR S +V +YR++++ + ++F+ G Sbjct: 125 EKHE----IPHP-LEDNIYCGSEQILRAYFSECMTLTIVVNKLYRKNIL--DEVRFLKGK 177 Query: 188 HHQDIVWTTEFMFNA 202 H+D +T + A Sbjct: 178 LHEDEFFTYRVLAKA 192 >UniRef50_D0WAZ5 Glycosyl transferase, group 2 family n=5 Tax=Neisseria RepID=D0WAZ5_NEILA Length = 337 Score = 108 bits (271), Expect = 2e-22, Method: Compositional matrix adjust. Identities = 61/220 (27%), Positives = 106/220 (48%), Gaps = 9/220 (4%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 S+I+P+YN R C++S++ + + E+I+++DGS D +I Y YPH++++H Sbjct: 3 FSIIVPIYNVEKYLRCCVDSVLAENFADYEMILVDDGSPDGCGKICDEYTGKYPHIKVIH 62 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDL-DVAQCNADWCF 126 Q N G S ARN GI A G Y+ F+D+DD T + DL +A D Sbjct: 63 QENGGLSDARNAGIRAAKGDYLIFLDSDDYWADTNRSKNAGGGILFDLQQLADKKVDLIL 122 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLR---MGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 + ++ IP + D++R + R + W + RR++I+KNN+ F Sbjct: 123 HPSSFNYRDIP-----KGADFSDNDFVRHFETLVEGRYYIANAWTKIVRREIIIKNNLFF 177 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVS 223 G H+D ++ + + + Y+Y + S+S Sbjct: 178 PKGYIHEDFPYSLQLARFIKTFAFYDNPFYQYRVLGNSIS 217 >UniRef50_D0BKT8 Glycosyl transferase, group 2 n=1 Tax=Granulicatella elegans ATCC 700633 RepID=D0BKT8_9LACT Length = 338 Score = 108 bits (270), Expect = 3e-22, Method: Compositional matrix adjust. Identities = 69/233 (29%), Positives = 119/233 (51%), Gaps = 16/233 (6%) Query: 6 NKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRL 65 N +S+I+P+YN + C+ES+ QT+T +E+I++ND STD S EI + Y E RL Sbjct: 10 NMISIIVPVYNVENYLEECLESIKNQTYTDIEVILVNDESTDGSKEICERYCELDSRFRL 69 Query: 66 LHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWC 125 ++Q N G SVARN G+ + G+ + FVD+DD V Y ++ + +D+D+ +CN Sbjct: 70 INQENQGQSVARNHGVSASIGELLTFVDSDD-VLSIKYLEILNRYMTEDIDLVECN---- 124 Query: 126 FRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIA 185 +R T ++ + + + + ++ +YRR ++ + F+ Sbjct: 125 YRATRSVFEHLKEAENIQIEFEGNSEESVIKACNYGISYSPVCKLYRRKLL--EDFPFLT 182 Query: 186 GLHHQDI---VWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQ-GNKNLN 234 G+ ++DI V +F+ +R Y YY S S +H+Q KNL+ Sbjct: 183 GVIYEDIYHGVGMLKFIRKMVRLDYV-----GYYYRKRSNSTMHKQFSEKNLD 230 >UniRef50_D0BKT9 Capsular polysaccharide biosynthesis protein CpsI n=2 Tax=Granulicatella elegans ATCC 700633 RepID=D0BKT9_9LACT Length = 335 Score = 108 bits (270), Expect = 3e-22, Method: Compositional matrix adjust. Identities = 63/216 (29%), Positives = 120/216 (55%), Gaps = 13/216 (6%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +SVI+P+YN + C+ES+ QT+T +E+I++NDGSTD+S EI + + + +L++ Sbjct: 2 ISVIVPVYNVEEYLEECLESIRQQTFTDIEVILVNDGSTDSSKEICERFCQVDNRFKLIN 61 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 Q N G SVARNRG++ + G+++ FVD+DD + + E L+ ++ D+D+ +C R Sbjct: 62 QENQGQSVARNRGVKESVGQFIMFVDSDDVINTNVLEVLLPY-MKTDVDIVECRMT---R 117 Query: 128 ETGETWQSIPTDRLRSTGVLTG--PDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIA 185 + E + + ST V G + + ++ + + + +YRR+++ K I F+ Sbjct: 118 KKEEFFLN-----KTSTIVFEGNSKEAILNCIAFKEVKYCAFKKLYRREIVQK--IPFLE 170 Query: 186 GLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTS 221 G ++D+ ++ + + + Y Y +H S Sbjct: 171 GYIYEDVFTGINYLKHIRKIVVIDYIGYYYRVHANS 206 >UniRef50_D0TAC7 N-acetylglucosaminyltransferase n=1 Tax=Bacteroides sp. 2_1_33B RepID=D0TAC7_9BACE Length = 334 Score = 108 bits (270), Expect = 3e-22, Method: Compositional matrix adjust. Identities = 51/118 (43%), Positives = 78/118 (66%) Query: 6 NKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRL 65 +K+S+IIP+YNA C++SL+ QT+ +EII++NDGS D+S I + Y + Y + L Sbjct: 2 SKVSIIIPVYNAEKTLCRCLDSLVVQTYEDVEIILVNDGSVDSSRNICETYRDKYSQIIL 61 Query: 66 LHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 ++Q NAG + ARN GI+ A+GKY++FVDADD V M E ++ A + ++ C D Sbjct: 62 INQENAGPATARNVGIDSASGKYISFVDADDYVECGMIEEMVNAAETNHAEMVICGYD 119 >UniRef50_C4ZHK7 Glycosyltransferase Family 2 modular protein n=1 Tax=Eubacterium rectale ATCC 33656 RepID=C4ZHK7_EUBR3 Length = 881 Score = 108 bits (270), Expect = 3e-22, Method: Compositional matrix adjust. Identities = 64/212 (30%), Positives = 110/212 (51%), Gaps = 4/212 (1%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+S+I+P+YNA + C+ S+ +QT +EII ++DGSTD+S EI +AE +++ Sbjct: 15 KVSIIVPIYNAEKYLKQCLGSIQSQTLKDIEIICVDDGSTDSSPEIMDSFAEKDSRFKVV 74 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 H+ N G + N G+ VATG+Y+ V+ADD + M+E L + + +D+ + N C+ Sbjct: 75 HKPNGGNGHSMNAGLSVATGEYIGCVEADDYIEKNMFEKLYLYSNDGSIDIVKSNFWNCY 134 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTH-VVWMGVYRRDVIVKNNIKF-- 183 + + + + L P ++ W H +W +Y+R +I +N+IKF Sbjct: 135 EDESGKITKVVNEERNNMPELALPFTIKE-YPQILWGHPSIWSAIYKRSLIEENHIKFKE 193 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKY 215 G D + E M A +T++ Y Y Sbjct: 194 AKGGGWVDNPFFFETMCCAKSIYWTKRPFYNY 225 >UniRef50_C6IMR5 Glycoside transferase family 2 n=1 Tax=Bacteroides sp. 1_1_6 RepID=C6IMR5_9BACE Length = 328 Score = 108 bits (269), Expect = 3e-22, Method: Compositional matrix adjust. Identities = 74/222 (33%), Positives = 108/222 (48%), Gaps = 19/222 (8%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPH---- 62 K+S+IIP+Y + C SL + +E + +ND + D S+ + + YPH Sbjct: 4 KISIIIPVYRVENYISCCCRSLFGMRFQDVEYLFVNDATPDGSMAVVESILNEYPHRKAH 63 Query: 63 VRLL-HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 V+LL H N G + ARN G+E ATG+Y+AFVDADD + M+E L A D+ C Sbjct: 64 VKLLAHPKNMGVASARNTGLEAATGEYIAFVDADDWIEENMFEKLYQRAEAGHFDIVGC- 122 Query: 122 ADWCFR-ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSR-RWTHVVWMGVYRRDVIVKN 179 DW ET + + P S D L+ LS RW +W + RR + VKN Sbjct: 123 -DWYLEFETSKRYMRQPVYERTS-------DCLKAMLSGEMRW--FLWAFLIRRGIYVKN 172 Query: 180 NIKFIAGLH-HQDIVWTTEFMFNALRARYTEQSLYKYYLHNT 220 NI+F+ G + +D+ A R+ ++LY Y N Sbjct: 173 NIRFLDGANIGEDMAVLIRCFSFARSYRHISEALYHYVKSNA 214 >UniRef50_A6NTT3 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6NTT3_9BACE Length = 331 Score = 108 bits (269), Expect = 3e-22, Method: Compositional matrix adjust. Identities = 90/331 (27%), Positives = 147/331 (44%), Gaps = 30/331 (9%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 S K+SVI+P YN C++SL+ Q++ LEII+ +DGSTD + + YA + + Sbjct: 2 SMKKISVIVPAYNIEPYLGRCLDSLLAQSYPGLEIIVTDDGSTDGTAALLDRYAAEHSQI 61 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 +HQ N G + AR ++ ATG Y+ FVD DD M++TL+ A D+A C Sbjct: 62 VAIHQPNGGVTAARLAALKQATGDYIGFVDGDDLAEHEMFQTLIDDAEHYGADIAHCGYQ 121 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHV---VWMGVYRRDVI---- 176 F S DR TG + D L R V +W +YRR++I Sbjct: 122 MVF-------PSGRVDRYYGTGRVVEQDGESGVLDLVRGDFVEPGLWNKLYRRELIEHFL 174 Query: 177 VKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQ 236 + + + + ++D++ A + + ++ LY Y L S S Q K L+ Q Sbjct: 175 AEGRMDWTVRI-NEDLLMNFLLFSLAKKCVFRDKCLYHYMLRKGSASSAKVQAYKLLDPQ 233 Query: 237 RHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKE---PDILTRQRM-IA 292 R + R Y + + + HQ +RV + P++ + R+ + Sbjct: 234 R----------VMQRIYEETTGLV-QLHQAAYVRYVRVLLGNTTQSTCPEVSQQARIELK 282 Query: 293 EIFTSGMYKRLITNVRSVKVGYQALLWSFRL 323 ++ YK+L R + +G LL +RL Sbjct: 283 QLICEKEYKQLSWKERYMVLGASKLLPLYRL 313 >UniRef50_A8SZ40 Putative uncharacterized protein n=1 Tax=Coprococcus eutactus ATCC 27759 RepID=A8SZ40_9FIRM Length = 321 Score = 108 bits (269), Expect = 4e-22, Method: Compositional matrix adjust. Identities = 60/216 (27%), Positives = 111/216 (51%), Gaps = 4/216 (1%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+S+I+P+YN ++ C+ SLI QT+ +EI++++DGS ++ YA+ + ++ Sbjct: 16 KVSIIVPVYNVENELDRCVASLINQTYQNIEILLVDDGSPGACPKMCDEYAKKDKRIIVI 75 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 H+ N G S ARN G++ A+GKYV FVD+DD + E +T ++DD+D+ A Sbjct: 76 HKPNGGLSSARNTGLKRASGKYVLFVDSDDYIETDSCEKFLTF-VKDDVDLV-VGAGRMI 133 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAG 186 + E + +R + D+ + W + +Y R+ ++KN++ F G Sbjct: 134 KGKDEIL--LRHSNIREREKYSARDFTIKSIKHSEWNSYACLSLYDREFLIKNDLFFREG 191 Query: 187 LHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSV 222 + +D+ E A Y + + Y Y + + S+ Sbjct: 192 IIFEDMQILPEIYLKAGNIVYMDYAFYNYVVRDGSI 227 >UniRef50_B2V0U1 Glycosyl transferase, group 2 family protein n=1 Tax=Clostridium botulinum E3 str. Alaska E43 RepID=B2V0U1_CLOBA Length = 324 Score = 108 bits (269), Expect = 4e-22, Method: Compositional matrix adjust. Identities = 78/280 (27%), Positives = 134/280 (47%), Gaps = 30/280 (10%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M+N ++ +I+P+Y C++S++ Q+++ LEII++NDGS DN EI + Y N Sbjct: 1 MLNE--RIGIIVPIYKVEKYLCRCLDSIVNQSYSNLEIILVNDGSPDNCHEICENYKRND 58 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 ++++HQ N G S ARN G++ ++ F+D+DD + M E L + E D+ C Sbjct: 59 NRIKVIHQKNMGLSAARNTGLKELQSPFLVFIDSDDYIELDMIEELYKLMKETKSDIVSC 118 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVV----WMGVYRRDVI 176 F S + L+ +T + LS +T+V+ W +Y+R++ Sbjct: 119 GHKQIF-------DSGNIEELKGNTQITIYSK-KEALSKFLFTNVIDVVSWNKLYKRELF 170 Query: 177 VKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQ 236 +IK+ +G ++D T + + A + YT + LY Y +TS+ GN+ Sbjct: 171 T--DIKYPSGKLYEDHFTTYKIIDRAEKITYTSKVLYNYCKRSTSIG-----GNE----- 218 Query: 237 RHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCH 276 + K +L E L+ I YP+ I L CH Sbjct: 219 -YNPKTMQLKEALDEECKFIIGEYPDLKDDIN---LAYCH 254 >UniRef50_C4G9K6 Putative uncharacterized protein n=1 Tax=Shuttleworthia satelles DSM 14600 RepID=C4G9K6_9FIRM Length = 445 Score = 108 bits (269), Expect = 4e-22, Method: Compositional matrix adjust. Identities = 67/207 (32%), Positives = 105/207 (50%), Gaps = 22/207 (10%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M K+S+IIP+YNA R C++S++ Q +T E+++++DGS+D S I YA Sbjct: 1 MGPNPKVSIIIPVYNAASALRRCLDSVLKQEFTDFELLLMDDGSSDESPAILDEYAGKDS 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 +RL+H+ N+G S RN+ +++A G YV F+DADD + P + LM A+E Q Sbjct: 61 RIRLVHKNNSGVSDTRNQALDLARGTYVQFLDADDWIAPEATK-LMVRAME------QKK 113 Query: 122 ADWC----FRETGETWQ---SIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRD 174 AD +R GE I D L G M + + V+W +Y R Sbjct: 114 ADLVVTDFYRVVGEHVSHKGDIEEDSLMDLQTFAGH---MMENPADFYYGVIWNKLYSRQ 170 Query: 175 VIVKNNIKFIAGLHHQDIVWTTEFMFN 201 +I +N I+ ++ W +F+FN Sbjct: 171 IIEENQIRM-----DPEVSWCEDFLFN 192 >UniRef50_C0QZN4 Putative glycosyl transferase family 2 protein n=1 Tax=Brachyspira hyodysenteriae WA1 RepID=C0QZN4_BRAHW Length = 350 Score = 108 bits (269), Expect = 4e-22, Method: Compositional matrix adjust. Identities = 69/199 (34%), Positives = 109/199 (54%), Gaps = 11/199 (5%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+SVIIP+YNA C++S+I QT +EII I+D STDNS I + Y + + +L Sbjct: 3 KVSVIIPVYNAEKYIEKCLDSVINQTLKEIEIICIDDCSTDNSHSILERYKKIDNRIVVL 62 Query: 67 -HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWC 125 ++ N G RN GI+ A G+Y+AF+D DD P L + + + + D++ N Sbjct: 63 RNETNQGIGKNRNIGIKKAVGEYLAFIDNDDFYSPNFLYDLFSTSKKYNSDISYTNT--I 120 Query: 126 FRETGETWQSIPTDRLRSTGVL--TGPDWLRM----GL-SSRRWTHVVWMGVYRRDVIVK 178 FRE E + + ++ V+ T P + + GL S + W VW ++R+D ++K Sbjct: 121 FREK-EDYNIVIDGNIKDDYVIDYTLPSYFYLYSEKGLYSDKLWPWEVWGKIWRKDFLIK 179 Query: 179 NNIKFIAGLHHQDIVWTTE 197 NNI FI + +D+V+ E Sbjct: 180 NNITFINSNYFEDLVFLLE 198 >UniRef50_A6L2Z0 Glycosyltransferase family 2 n=1 Tax=Bacteroides vulgatus ATCC 8482 RepID=A6L2Z0_BACV8 Length = 312 Score = 107 bits (268), Expect = 5e-22, Method: Compositional matrix adjust. Identities = 66/226 (29%), Positives = 121/226 (53%), Gaps = 14/226 (6%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M N T +S+IIP+YNA C++S+I QT++++E+III+DGS+D S I YA Y Sbjct: 1 MKNPT--VSIIIPVYNAEGTITRCLDSIIHQTYSSIEVIIIDDGSSDLSGSICDNYATKY 58 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 +++++H+ N G S+AR G++ + G Y VDADD + E +M +AL+ + D+ C Sbjct: 59 NYIQVIHKKNEGVSIARQIGVKHSKGLYSIHVDADDYIDKCEIERMMDIALKFNADMVVC 118 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGP-DWLRMGLSSRRWTHVVWMGVYRRDVIVKN 179 + +++ +Q D + L + + +G W ++ +Y N Sbjct: 119 DYILEYKKNKFVYQKGMKDACDAFDFLKQIYNGIFIG---TLWNKLIKTSLY-------N 168 Query: 180 NIKFIAGLHH-QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSR 224 N+ F+ ++ +D+ T+ + N + Y ++LY Y +S++R Sbjct: 169 NVIFLKQCNYCEDVYILTQILSNNISINYINEALYHYCYDASSLTR 214 >UniRef50_Q1Z875 Glycosyltransferase n=1 Tax=Photobacterium profundum 3TCK RepID=Q1Z875_PHOPR Length = 334 Score = 107 bits (268), Expect = 5e-22, Method: Compositional matrix adjust. Identities = 71/229 (31%), Positives = 118/229 (51%), Gaps = 14/229 (6%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 MN K+SVIIP+YN+ + C+ SL+ QT +EII IND S D S+ I Y +N+P Sbjct: 1 MNKNIKISVIIPVYNSVEYLDRCLNSLLNQTLKDIEIICINDASKDESLTILNKYKDNHP 60 Query: 62 HV-RLLH-QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQ 119 V +++ + N ARN G+++A+G+YVAFVD+DD V M+E L + A ++D D+ Sbjct: 61 EVIKVIDLKENKKQGGARNAGLDIASGEYVAFVDSDDWVELHMFEHLYSKARQNDFDIVD 120 Query: 120 CNADWCFRETG-ETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVK 178 CN T E SI +D+ ++ P + + V +Y+ K Sbjct: 121 CNYTIHNGYTAIEDRNSIISDK---DILINSPKDILLNFGG------VCTKIYKIKFFDK 171 Query: 179 N--NIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRL 225 + ++F G+ ++D + ++LYKYY ++ S +++ Sbjct: 172 SRYGLRFPEGMFYEDNYLMPFIAIKVKKIGKVNEALYKYYYNDLSTTKV 220 >UniRef50_UPI00019688E7 hypothetical protein BACCELL_03020 n=1 Tax=Bacteroides cellulosilyticus DSM 14838 RepID=UPI00019688E7 Length = 335 Score = 107 bits (268), Expect = 5e-22, Method: Compositional matrix adjust. Identities = 74/248 (29%), Positives = 119/248 (47%), Gaps = 19/248 (7%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSV----EIAKYYAENYPH 62 K+SVIIP+YN C SL+ Q+ +E I ++D S DNS+ I K Y E P Sbjct: 10 KVSVIIPVYNVEKYIERCARSLLEQSLKDIEYIFVDDCSPDNSITMLENILKEYPERQPL 69 Query: 63 VRLL-HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 V++L H+ N G + R G++ A G+Y+ D+DD V P MYET+ +A++ + D+ CN Sbjct: 70 VKILFHEPNRGLAYTRQEGVDAAKGEYIIHCDSDDWVEPEMYETMYNIAIKQNADIVCCN 129 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 + E W R+ + + P+ L + + VW + RR + + NNI Sbjct: 130 ----YFEEYNHWHK----RVEYSYLYETPEILLASIPTTL-NSAVWNKLIRRTLYIANNI 180 Query: 182 KFIAGLHHQ-DIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI 240 ++ G++ Q D+ T +L+ + Y Y N + + LNY I Sbjct: 181 RWFEGINMQEDLGITLRLRVLSLKTIVVPYAFYHYNRQNLFSMAMKPK----LNYIEEQI 236 Query: 241 KITRLLEK 248 +LLE+ Sbjct: 237 LCAKLLEQ 244 >UniRef50_A8RE09 Putative uncharacterized protein n=1 Tax=Eubacterium dolichum DSM 3991 RepID=A8RE09_9FIRM Length = 330 Score = 107 bits (268), Expect = 5e-22, Method: Compositional matrix adjust. Identities = 65/219 (29%), Positives = 115/219 (52%), Gaps = 6/219 (2%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+I+P+YNA +E+L QT T +EI+++NDGSTD+S+++ + +AE P ++++ Sbjct: 9 VSIIVPIYNAEKYLSRTVETLRRQTLTNIEILLVNDGSTDDSLKLCREFAEIDPRIQVID 68 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + N G S ARN G+++A G+YV FVDADD V E L+ E+ D+ C FR Sbjct: 69 KQNGGVSSARNIGLKLAKGQYVLFVDADDYVEANYAEILLKTLQENKSDIEMCGY---FR 125 Query: 128 ETGETWQSIPTDRLRSTGVLTG--PDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIA 185 E ++ Q I L P + + + ++ ++ I +NN++F Sbjct: 126 EDKKSEQIITLMEKDHVFDLNKELPSIHKAFVKGNGGIYSIFNKLFSLKKIKENNLQFSE 185 Query: 186 GLHH-QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVS 223 L H +D ++ +++ Y LY Y ++ S++ Sbjct: 186 KLSHAEDFLFCIQYVNTCSSFGYINIPLYHYVNNDNSLT 224 >UniRef50_Q8ABQ9 Glycoside transferase family 2 n=1 Tax=Bacteroides thetaiotaomicron RepID=Q8ABQ9_BACTN Length = 326 Score = 107 bits (268), Expect = 5e-22, Method: Compositional matrix adjust. Identities = 71/228 (31%), Positives = 119/228 (52%), Gaps = 22/228 (9%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPH----V 63 +S+I PLY D C +SL Q++T +E I ++D S+D SVE+ AE YP + Sbjct: 3 VSIITPLYKVEDFIARCADSLFRQSYTEIEYIFVDDCSSDRSVEVLLQVAERYPQLQQQI 62 Query: 64 RLLH-QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNA 122 R+LH ++N G + AR G+ ATG+YV +VDADD + P E ++ + + D+ C Sbjct: 63 RILHHESNRGVAAARETGLAAATGEYVYWVDADDWIEPDAIEKMVVRSEQGQKDIIACGW 122 Query: 123 DWCFRETGETWQSIPTDRLRST---GVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKN 179 CFR+ E +P T G+L+G + RW +W+ + +RD+ + N Sbjct: 123 YLCFRQ-NERRMPMPCYADAETALRGMLSG---------TMRWN--LWLYMIKRDLYLMN 170 Query: 180 NIKFIAGLH-HQDIVWTTEFMFNALRARYTEQSLYKYYLHN-TSVSRL 225 +I F+ G + +D++ + +A + + + Y Y N S++RL Sbjct: 171 DIHFMEGENVGEDMLVLIKLFSHAKSIGFVKDAFYHYVKQNENSLTRL 218 >UniRef50_Q2NH61 Predicted glycosyltransferase n=1 Tax=Methanosphaera stadtmanae DSM 3091 RepID=Q2NH61_METST Length = 358 Score = 107 bits (267), Expect = 6e-22, Method: Compositional matrix adjust. Identities = 67/212 (31%), Positives = 110/212 (51%), Gaps = 8/212 (3%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+SV+IP+YN R C+++++ QT +EII +NDGSTD S++I YA +R++ Sbjct: 3 KVSVVIPVYNVEKYLRDCLDTIVNQTLEDIEIICVNDGSTDKSLDILNEYASKDERMRVI 62 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 Q N G +VA N G+ ATG+Y+ +D+DD + E ++A E D+D A Sbjct: 63 SQENGGHAVATNVGMTYATGEYLFLMDSDDILKLNALEDTYSIAKEKDVDFVLFKAINYD 122 Query: 127 RETGETWQSI-----PTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 E E +++ R+ + + D +G + T W +Y R I +NNI Sbjct: 123 DEKDEYYEAENYSMNKIARIVKDNIFSYED---LGDAIFDITVTPWSKLYNRSFIEENNI 179 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLY 213 F GL D V+ + +F+A R + ++ L+ Sbjct: 180 TFPEGLVFDDNVFFWKVLFSAKRIYFYKEFLF 211 >UniRef50_C2FKV3 Glycosyltransferase n=1 Tax=Lactobacillus plantarum subsp. plantarum ATCC 14917 RepID=C2FKV3_LACPL Length = 304 Score = 107 bits (267), Expect = 6e-22, Method: Compositional matrix adjust. Identities = 71/242 (29%), Positives = 119/242 (49%), Gaps = 25/242 (10%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M++T K+++I+P+YN R +ES+I Q +T E I+++DGSTD S ++AK Y ++ Sbjct: 1 MDNTLKVTIIVPIYNVEGYIRQSIESIIKQKYTNFEAILVDDGSTDMSYQLAKRYTQDDD 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 L+ AN G S ARN G++ ATG +AF+D DD +Y ++ + +DD D+ Sbjct: 61 RFILIQTANFGQSHARNVGLKKATGDLIAFLDPDD-IYDPLFLYDAVLRFDDDTDLIS-- 117 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 + F P + R +G D + R VVW V+RR + + Sbjct: 118 --YLF----------PKQKKRLSGGDVSIDSFISEMFLGRIGTVVWNKVFRRKTLC--GV 163 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIK 241 F G H++I++ N ++ + ++ Y Y R +R GN N+ + IK Sbjct: 164 FFPEGEVHEEILFFMHAFQNIIKCKIIDEDRYFY--------RENRDGNTNMTFDEGRIK 215 Query: 242 IT 243 + Sbjct: 216 LV 217 >UniRef50_Q8XN54 Capsular polysaccharide biosynthsis protein n=1 Tax=Clostridium perfringens RepID=Q8XN54_CLOPE Length = 330 Score = 107 bits (266), Expect = 7e-22, Method: Compositional matrix adjust. Identities = 63/220 (28%), Positives = 101/220 (45%), Gaps = 15/220 (6%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 ++S+I+P+YN C++S++ Q++T E+I++NDGS D S I Y +R++ Sbjct: 4 EISIIVPIYNVEKYLSKCIDSILNQSFTNFELILVNDGSNDRSGIICDEYTLKDERIRVI 63 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN----A 122 H+ N G S ARN GI +A GKY+ FVD DD + MYE L + + D++ C Sbjct: 64 HKKNGGVSSARNSGISLALGKYIGFVDPDDYIKEEMYEILYNLCKKYKTDISICKLGREI 123 Query: 123 DWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIK 182 + E + I D + L R L ++ + + GV Sbjct: 124 NGTLINNEENGEIIELDNVNGLKELFKGILYRFSLCNKLFKRECFRGVI----------- 172 Query: 183 FIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSV 222 F G H+D+ T NA ++ Y + Y Y S+ Sbjct: 173 FPEGRIHEDLSTTYRLFANAKKSIYFDYVGYIYVKRENSI 212 >UniRef50_B2ILQ1 Ss-1,4-galactosyltransferase n=6 Tax=Streptococcus pneumoniae RepID=B2ILQ1_STRPS Length = 333 Score = 107 bits (266), Expect = 7e-22, Method: Compositional matrix adjust. Identities = 66/217 (30%), Positives = 113/217 (52%), Gaps = 8/217 (3%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+++P+YN + +ES++ QT+ LE+++++DGSTD+S EI + + +R+ H Sbjct: 20 VSIVVPVYNVEKYLKKSIESILNQTYDNLEVLLVDDGSTDSSGEICDSFIKVDSRIRVFH 79 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + N G S ARN GIE G+YV+F+D DD + L ++ +V+ C+ F Sbjct: 80 KENGGLSDARNFGIEHMKGQYVSFIDGDDYISKDYVWKLYHSLKNNNSEVSICS----FS 135 Query: 128 ETGETWQSIPTDRLRSTGV-LTGPDWLRMGLSSRRWTHVV-WMGVYRRDVIVKNNIKFIA 185 ET + I + L S V L+G L L++ + +VV W +YR + K +KF Sbjct: 136 LVDETGEKIKDELLDSGEVSLSGQQILEKALTADGYRYVVAWNKLYRSTLFEK--LKFKK 193 Query: 186 GLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSV 222 G+ ++D ++ R E+ LY Y S+ Sbjct: 194 GMLYEDEFLNYPLFWDCKRVSIVEEPLYLYVQRKGSI 230 >UniRef50_C6JQJ1 Glycosyl transferase n=1 Tax=Fusobacterium varium ATCC 27725 RepID=C6JQJ1_FUSVA Length = 337 Score = 107 bits (266), Expect = 8e-22, Method: Compositional matrix adjust. Identities = 68/219 (31%), Positives = 117/219 (53%), Gaps = 7/219 (3%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+S+IIP+YN + C++S+I Q+ +EII INDGSTD+S+EI + Y + + L+ Sbjct: 5 KISIIIPIYNVEKYLKQCLDSIINQSLKEIEIICINDGSTDSSLEIIQEYMKKDSRIILI 64 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 ++ N G S ARN GI+ +TGKY+ VD+DD V L A ++ LD+ N + Sbjct: 65 NKKNEGPSAARNDGIKKSTGKYILQVDSDDWVEEESLSKLYGYAEKNSLDIVSFNY---Y 121 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN-IKFIA 185 RE + I + T ++ + L+ R V + + I KNN I + Sbjct: 122 REFFKESIKIQVLFEINKNFFTSIEYFNLFLTEREIPSVCNKLI--KSTIYKNNHILYPV 179 Query: 186 GLH-HQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVS 223 G++ +D+ T+ ++ + + Y ++ LY Y ++ S++ Sbjct: 180 GINLGEDLATITKLIYFSQKIGYLDEYLYHYRYNDKSIT 218 >UniRef50_Q1WU29 Glycosyltransferase n=3 Tax=Lactobacillus RepID=Q1WU29_LACS1 Length = 315 Score = 107 bits (266), Expect = 8e-22, Method: Compositional matrix adjust. Identities = 49/120 (40%), Positives = 79/120 (65%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+S+I+P+YN + ++SL+ QT +EII+++DGSTDNS +IA+ Y ENY +V L+ Sbjct: 3 KISIIVPIYNVEKYLKRSLDSLVNQTLEDIEIILVDDGSTDNSHKIAEDYKENYSNVLLV 62 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 + N G S ARN G++ A+G+Y+AF+D+DD V M+E + ++ + + N W F Sbjct: 63 TKENGGLSDARNFGLQYASGEYIAFIDSDDYVESQMFERMYNLSENGRKKIIESNFLWEF 122 >UniRef50_C3QVK6 Glycosyl transferase n=4 Tax=Bacteroides RepID=C3QVK6_9BACE Length = 324 Score = 107 bits (266), Expect = 8e-22, Method: Compositional matrix adjust. Identities = 69/217 (31%), Positives = 118/217 (54%), Gaps = 10/217 (4%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +SVIIP+YNA + ++S+I QT+T LEI+++NDGSTD+S++I K YA P + + + Sbjct: 8 VSVIIPVYNAHNSITKTLQSVINQTYTNLEIVVVNDGSTDDSLDIIKTYAAEDPRIVVFN 67 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + N G AR GI++ATGKY+ ++D+DD ++ L+ A E D+ +C+ Sbjct: 68 KQNEGLVQARKSGIDIATGKYIQYLDSDDIMHEDAITRLVAKAEETQADMVVAPFMFCY- 126 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLR-MGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAG 186 GE +S D + L+G ++L+ + L W VW + R + +N I+ Sbjct: 127 -DGEFHKSTFFDFVE----LSGVEFLKNILLKKAYWC--VWSKFHLRS-LYQNEIERPDI 178 Query: 187 LHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVS 223 +D+V +T+ + + + + Y NTS+S Sbjct: 179 SFGEDVVLSTQLLLYLQKVVSVDYIIIDYNFTNTSMS 215 >UniRef50_A7V9T0 Putative uncharacterized protein n=1 Tax=Bacteroides uniformis ATCC 8492 RepID=A7V9T0_BACUN Length = 337 Score = 107 bits (266), Expect = 8e-22, Method: Compositional matrix adjust. Identities = 75/241 (31%), Positives = 123/241 (51%), Gaps = 24/241 (9%) Query: 6 NKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRL 65 N +S+IIP+YNA +++L+ Q E++++NDGSTDNS+ I YA+ Y +++ Sbjct: 5 NLISIIIPVYNASAFISNTIDNLLAQEVEK-ELLLVNDGSTDNSLNILNDYAQKYSCIKV 63 Query: 66 LHQANAGASVARNRGIEVATGKYVAFVDADDEVYP-TMYETLMTMALEDDLDVAQCNADW 124 L+Q N G S ARN IEVA G+++ FVD+DD + P T+ + L +D+D Sbjct: 64 LNQTNQGVSSARNHAIEVAQGEFLIFVDSDDLLEPGTLQKCLNIYHANNDIDCVVFTYKC 123 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVV-WMG--VYRRDVIVKNNI 181 C T + +G+ T WL M S T ++ +G +YR ++ NI Sbjct: 124 C----KPNLFVFDTKLYKKSGLYTIEQWL-MPFSVLHKTSIINCIGTKLYRTSILKDKNI 178 Query: 182 KFIAGLHH-QDIVWTTEFMFNALRARYTEQSLYKY-----------YLHNTSVSR--LHR 227 +F + + +D+ +TT ++ + Y + LY Y Y+HN +S+ LH Sbjct: 179 RFNEDISYCEDVGFTTIYLKYVRKLFYINEPLYCYMFVNPNSLMTKYMHNYPMSKKYLHM 238 Query: 228 Q 228 Q Sbjct: 239 Q 239 >UniRef50_C5VMR5 Glycosyl transferase, family 2 n=1 Tax=Prevotella melaninogenica ATCC 25845 RepID=C5VMR5_9BACT Length = 329 Score = 107 bits (266), Expect = 8e-22, Method: Compositional matrix adjust. Identities = 75/252 (29%), Positives = 115/252 (45%), Gaps = 7/252 (2%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTW--TALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 KLS+IIP+YNA C++S++ EI+++NDGS D S EIA+ YA Y +V Sbjct: 9 KLSIIIPMYNAEKYIGICLDSILGSNLLKEEYEIVLVNDGSQDKSPEIAQNYAAKYSNVT 68 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 L Q N G S ARN GI+ G+YV VDADD V + L + +D+ Sbjct: 69 YLTQENQGQSTARNYGIKTCKGEYVWCVDADDAVEKDIIGILYLLKENPSIDILGVQLKD 128 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 E S L ++ G D + +G ++ + I+ N + F+ Sbjct: 129 VDEEYKFICYSCIQPSLPHNKIILGRDAILLGYKPSSVCALIT----SKKFIIDNKLFFV 184 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 G+ HQD+ + + +A + +++ + Y Y H SV R K L Y + I I Sbjct: 185 VGITHQDVELSYRLVSHASKVLFSDIAPYIYIQHPLSV-RHTPNPEKRLKYLKDDIVILD 243 Query: 245 LLEKLNRNYADK 256 + L+ Y K Sbjct: 244 SFKTLSEEYNGK 255 >UniRef50_Q93DZ8 EpsG n=2 Tax=Lactobacillales RepID=Q93DZ8_STRTR Length = 322 Score = 107 bits (266), Expect = 9e-22, Method: Compositional matrix adjust. Identities = 62/208 (29%), Positives = 110/208 (52%), Gaps = 8/208 (3%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S++IP+YN D + C++S+++QT++ LEII+++DGSTD S I YY+EN ++++H Sbjct: 3 ISIVIPVYNVQDYIKKCLDSILSQTFSDLEIILVDDGSTDLSGRICDYYSENDKRIKVIH 62 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 AN G S ARN GI+ AT +++ F+D+DD V E L + + D++ + + Sbjct: 63 TANGGQSEARNVGIKNATSEWITFIDSDDYVSSDYIEYLYNLIQVHNADISIASFTYI-- 120 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 T + I ++ +R L + + VW +YR + N KF++G Sbjct: 121 ----TPKKIIKHGNGEVALMDAKTAIRRMLLNEGFDMGVWGKMYRTEYF--NKYKFVSGK 174 Query: 188 HHQDIVWTTEFMFNALRARYTEQSLYKY 215 +D + T + A + + +Y Y Sbjct: 175 LFEDSLITYQIFSEASTIVFGAKDIYFY 202 >UniRef50_C7UWL4 Glycosyl transferase n=6 Tax=Enterococcus RepID=C7UWL4_ENTFA Length = 326 Score = 107 bits (266), Expect = 9e-22, Method: Compositional matrix adjust. Identities = 63/219 (28%), Positives = 114/219 (52%), Gaps = 7/219 (3%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+S+I+P+YN C+ S++ QT+T E+I+++DGS D+S + +AE V+++ Sbjct: 3 KISIIVPVYNVEKYLEKCVRSILAQTFTDFELILVDDGSPDSSGAMCDQFAEQDQRVKVI 62 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 H+ N G S ARN GIE+ATG+Y+ FVD+DD + MYETL + + ++A Sbjct: 63 HKENGGLSDARNAGIELATGEYLGFVDSDDYIADDMYETLYNQIVTYEAELATVG----M 118 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAG 186 + E +S TD+ + +L+ + ++ L S +Y++++ N +++ G Sbjct: 119 IDVYENRESRLTDK-KEIKILSQNEAIQAVLDSTDVYAYAVNKLYKKELF--NKVRYPKG 175 Query: 187 LHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRL 225 +D E + + + Y YY S++ L Sbjct: 176 KIVEDAFIIIELLLQCQKIVTNTEQKYYYYRRADSITGL 214 >UniRef50_C6X0E9 Beta-1,3-glucosyltransferase n=1 Tax=Flavobacteriaceae bacterium 3519-10 RepID=C6X0E9_FLAB3 Length = 339 Score = 107 bits (266), Expect = 9e-22, Method: Compositional matrix adjust. Identities = 67/204 (32%), Positives = 112/204 (54%), Gaps = 17/204 (8%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+SVI+P+YN D + C+ S+I QT+ LEII+++DGSTDNS +I K A + + Sbjct: 2 KISVIVPVYNVQDYVKQCINSIIAQTYHDLEIILVDDGSTDNSFQICKDAAGKDSRIITV 61 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADD----EVYPTMYETLMTMALEDDL--DVAQC 120 Q NAG S ARN GI+++TG Y+ FVD+DD ++Y +M E L D + D A Sbjct: 62 SQKNAGVSAARNNGIKMSTGDYITFVDSDDWLHHDLYLSMTEILNRYPRTDVVMCDFATV 121 Query: 121 NADWCFRETGETWQSIPTDRLRSTGV----LTGPDWLRMGLSSRRWTHVVWMGVYRRDVI 176 + + T + + + + +G+ L D+ R+ + S +++R + Sbjct: 122 KNNTAQQITSQLREGLYIKKDIISGIYPTLLVTEDFGRLPIVS------ACTCIFKRSFM 175 Query: 177 VKNNIKFIAGL-HHQDIVWTTEFM 199 +KN+I+F GL + +D ++ + M Sbjct: 176 LKNSIQFDEGLRYSEDYLFMADVM 199 >UniRef50_A6L7A5 Glycosyltransferase family 2 n=6 Tax=Bacteroides RepID=A6L7A5_BACV8 Length = 315 Score = 106 bits (265), Expect = 9e-22, Method: Compositional matrix adjust. Identities = 65/224 (29%), Positives = 120/224 (53%), Gaps = 7/224 (3%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +SVIIP+YN C+ S++ QT T +E++++NDGSTD S +I YA ++++H Sbjct: 4 VSVIIPIYNTEKFLPLCINSVLNQTLTDIEVLLVNDGSTDGSGKICDEYACKDQRIQVIH 63 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 N G S ARN+G+E A G+Y+AF+D+DD + M TL + ++ +A C + Sbjct: 64 TLNQGVSHARNQGLETAKGEYIAFMDSDDWIETDMIATLYQLIRTNEAGLATCG----YI 119 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHV--VWMGVYRRDVIVKNNIKFIA 185 E + I +G LT + + + R + + +W +Y +++I K+++KF Sbjct: 120 IENEDGRPIYHINEVKSGKLTQWEAIHSLFNDRHYKYKGNLWDKLYHKEIIDKHHLKFNE 179 Query: 186 GLHH-QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQ 228 +++ +D ++ +++ + A YT Y Y +S L ++ Sbjct: 180 HIYYNEDRLFIFQYLSHCQSAAYTTSPYYHYVTRKSSAMNLSQK 223 >UniRef50_C6IMR6 Putative uncharacterized protein n=1 Tax=Bacteroides sp. 1_1_6 RepID=C6IMR6_9BACE Length = 341 Score = 106 bits (265), Expect = 1e-21, Method: Compositional matrix adjust. Identities = 64/219 (29%), Positives = 106/219 (48%), Gaps = 9/219 (4%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAE--NYPHVR 64 K+S++IP+YN C+ES+ Q + LE+I+++D TDNS+EI + Y E N+ V+ Sbjct: 4 KVSIVIPIYNVSVYIENCLESVRRQIYQDLEVILVDDCGTDNSMEIVQEYLEYHNFVEVK 63 Query: 65 LLHQA-NAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 ++H N G S ARN G+E ATG YV F+D+DD + L+ D N + Sbjct: 64 IIHHTHNRGLSAARNTGLEAATGDYVYFLDSDDALMEDCIFILVAPVEAQSYDFVIGNYE 123 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 ++P+ LRS + + RW + W + R ++ N + F Sbjct: 124 VKGSNKEYPALTLPSGALRSNKEIL------HSYAEGRWYMMAWNKLCNRKFLLDNKLFF 177 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSV 222 GL H+D++W+ + A + Y+Y + S+ Sbjct: 178 EEGLLHEDVIWSFKLACKARSMYVIYEPTYRYTIRAASI 216 >UniRef50_Q9F0B8 Beta(1,4)galactosyltransferase EpsJ n=1 Tax=Lactobacillus delbrueckii subsp. bulgaricus RepID=Q9F0B8_LACDE Length = 327 Score = 106 bits (265), Expect = 1e-21, Method: Compositional matrix adjust. Identities = 64/226 (28%), Positives = 115/226 (50%), Gaps = 22/226 (9%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 KLS+I+P+Y + + C+ S++ QT+ E+I+++DGS D+ +I Y++ + +V+++ Sbjct: 2 KLSIIVPVYKVEEYLKDCVNSILNQTFHDFELILVDDGSPDSCPKICDEYSQKFNNVKVV 61 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYE---TLMTMALEDDLDVAQC--- 120 H+ N G S ARN G++VATG+Y++F+D+DD + MYE ++M D + V +C Sbjct: 62 HKQNGGLSSARNAGMKVATGEYISFIDSDDYLASNMYEHVFSIMKKECADIVVVGRCYVY 121 Query: 121 -NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSS--RRWTHVVWMGVYRRDVIV 177 N RE ++ V+ GP + +S + W VY+R + Sbjct: 122 PNGSKKLREKQNVYE-----------VMDGPKATAIMNTSLLGYFDAAAWDKVYKRSLF- 169 Query: 178 KNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVS 223 N++ + G +D T + A R Y +Y Y S++ Sbjct: 170 -NDVSYPEGKLSEDWYTTYKVFAKANRIVYDSTPMYYYRQRGGSIT 214 >UniRef50_B0P2L5 Putative uncharacterized protein n=1 Tax=Clostridium sp. SS2/1 RepID=B0P2L5_9CLOT Length = 323 Score = 106 bits (265), Expect = 1e-21, Method: Compositional matrix adjust. Identities = 59/215 (27%), Positives = 111/215 (51%), Gaps = 7/215 (3%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +SVI+P+Y D C++SL+ QT+ LEII+++DGS D+ I + + ++++H Sbjct: 5 ISVIVPIYKVEDYLDRCIQSLVNQTYKNLEIILVDDGSPDDCPRICDEWGKKDQRIKVIH 64 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + N G S ARN G++V TG YV+++D+DD + MY+ +M + D+ +C F Sbjct: 65 KENGGLSDARNAGMKVMTGAYVSYIDSDDWIEENMYQMMMNAIEVNKADICEC----AFE 120 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 +T + + ++ L M + + VVW +Y+R+V+ ++ F G Sbjct: 121 KTSGSVKKEDQKNDDKVSIMDKRSAL-MAVVEEKIQPVVWNKLYKREVV--KDLWFEVGK 177 Query: 188 HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSV 222 +++D +T + + A + Y Y+ S+ Sbjct: 178 YNEDEFFTYKAIERADKIIQISDIGYYYFFREDSI 212 >UniRef50_C0B702 Putative uncharacterized protein n=1 Tax=Coprococcus comes ATCC 27758 RepID=C0B702_9FIRM Length = 283 Score = 106 bits (265), Expect = 1e-21, Method: Compositional matrix adjust. Identities = 69/254 (27%), Positives = 130/254 (51%), Gaps = 17/254 (6%) Query: 1 MMNSTN---KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYA 57 MM T K SV++P+YN C++S++ QT+ LEII+++DGS DN ++ + Sbjct: 1 MMKQTEEKIKFSVVLPIYNVEKYLNRCLDSVMNQTYKKLEIILVDDGSPDNCPQMCDNWE 60 Query: 58 ENYPHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDV 117 + ++++H+ NAG ARN G++VATG Y+AF D+DD + ++E L T+ + D+ D+ Sbjct: 61 KVDDRIKVVHKKNAGLGEARNSGLDVATGDYIAFFDSDDYIDTRLFEELYTVIISDNPDL 120 Query: 118 AQCNADWCFRETGETWQSIPT---DRLRSTGVLTG--PDWL----RMGLSSRRWTHVVWM 168 + R+ T IP ++ VL+ P+ + + G +S W Sbjct: 121 IEFGHHDVDRQGNITKTFIPKTPLEKYEGEEVLSKFLPELICTDPKTGTASDLLMS-AWS 179 Query: 169 GVYRRDVIVKNNIKFIAGLHH--QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLH 226 +YRR ++ + N F++ + +D+ + M N ++ Y Y ++ S++ ++ Sbjct: 180 CLYRRQLLAECNFHFVSERKYISEDVYSLMKLMPNVHSVNVVHKAYYYYCENDQSLTHVY 239 Query: 227 RQG--NKNLNYQRH 238 + K + +Q H Sbjct: 240 KPDRFEKLVAFQLH 253 >UniRef50_C3WCW1 Sugar transferase n=1 Tax=Fusobacterium mortiferum ATCC 9817 RepID=C3WCW1_FUSMR Length = 339 Score = 106 bits (265), Expect = 1e-21, Method: Compositional matrix adjust. Identities = 70/262 (26%), Positives = 136/262 (51%), Gaps = 8/262 (3%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+S++IP+YN C+ S+I Q +EII INDGSTDNS+EI + + ++++ Sbjct: 3 KISILIPVYNTSKYLERCLNSIINQNLKEIEIICINDGSTDNSLEILNSFVQKDSRIKII 62 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 ++ N G + ARN GI+ ATG+Y +D+DD + E + A +++LD+ +D F Sbjct: 63 NKNNGGLTTARNVGIKNATGEYCLNIDSDDWIEQGYLEEIYKKAKKENLDMVI--SDIIF 120 Query: 127 RETGETWQSIPTD-RLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIA 185 + + I D + +LTG ++L++ LS + + W + ++++ K I++ Sbjct: 121 DYEYKKEEIIKKDLNIEENKILTGKEYLKIFLSKNFYGY-TWNKLIKKEIYEKFGIEYDE 179 Query: 186 GLH-HQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 + ++D+ + + R ++ Y Y + SR+ + N L+ +R + +I + Sbjct: 180 EIFLYEDVEVIAKLALHMKRIGKINKAYYHYIQGENNGSRVIKFKN-YLDMERCFNEIKK 238 Query: 245 LLEKLNRNYADKIMIYPEFHQQ 266 L N ++M + EF ++ Sbjct: 239 YY--LKSNIEKEVMNFLEFREE 258 >UniRef50_A2STD8 Glycosyl transferase, family 2 n=1 Tax=Methanocorpusculum labreanum Z RepID=A2STD8_METLZ Length = 346 Score = 106 bits (264), Expect = 1e-21, Method: Compositional matrix adjust. Identities = 74/232 (31%), Positives = 118/232 (50%), Gaps = 15/232 (6%) Query: 2 MNSTNKL-SVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 MNS L SVI+P+YN +TC++S++ QT+ LEII++ S DNSV I Y + Sbjct: 1 MNSEVPLISVIVPVYNVEPYLKTCIDSILKQTYRNLEIILVASTSIDNSVVICDSYLQVD 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 +++LH G S ARN G+ +A+G Y+ FVD+DD + ETL ++ + D D+ QC Sbjct: 61 SRLQVLHTEPQGLSAARNIGVSMASGTYLCFVDSDDFISQNFVETLYGLSQKYDADIVQC 120 Query: 121 N----ADWCFRETGETWQSIPTDRLR----STGVLTGPDWLRMGLSSRRWTHVVWMGVYR 172 N ++ E+ SI D L S + + + SS W +YR Sbjct: 121 NFLEISENATYSICESCTSIFNDALEIHIYSNVDMCANLYNDLADSST----ASWTKLYR 176 Query: 173 RDVIVKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSR 224 +++ + N+K+ G H+D + + + A R T LY Y + S++R Sbjct: 177 KELFM--NVKYPIGKIHEDEGTSYKLFYLAKRVAVTSLKLYYYRSNPLSLTR 226 >UniRef50_B0ZTM8 Putative uncharacterized protein n=1 Tax=Campylobacter jejuni RepID=B0ZTM8_CAMJE Length = 351 Score = 106 bits (264), Expect = 1e-21, Method: Compositional matrix adjust. Identities = 68/234 (29%), Positives = 115/234 (49%), Gaps = 27/234 (11%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +SVIIP+YN R C++S++ QT +EII+INDGSTDN +I YA + ++H Sbjct: 8 VSVIIPIYNVEKYLRECLDSVVNQTLREIEIILINDGSTDNCGKICDEYATKDKRIIVIH 67 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + NAG A N G+++A G+Y+ FV++DD + MYE L A +D D+ +C F Sbjct: 68 KENAGLGAAYNSGLDIAKGEYIGFVESDDWIEFNMYEELYFKAKNNDSDLVKC----MFY 123 Query: 128 ETGETWQSIPTDRL--------------RSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRR 173 + T + P D+L + D+ ++ + + +W +Y + Sbjct: 124 DYNSTRE--PKDKLYMQKHEDFELFNCAPDNKIFKIEDYPKLLI----YHSSIWASLYHK 177 Query: 174 DVIVKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYY---LHNTSVSR 224 +I + + G +QD + E + A + ++ Y Y L+ +SV R Sbjct: 178 TLINRLRFQESKGATYQDFPFAIEALVLANKITTLRKAFYHYRQEPLNASSVKR 231 >UniRef50_Q5L9S8 Putative LPS biosynthesis related glycosyltransferase n=1 Tax=Bacteroides fragilis NCTC 9343 RepID=Q5L9S8_BACFN Length = 328 Score = 106 bits (264), Expect = 1e-21, Method: Compositional matrix adjust. Identities = 72/234 (30%), Positives = 119/234 (50%), Gaps = 18/234 (7%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP---HVR 64 +S+IIP+YN D C+ S+ TQT++ +E+I++ND + DNS+ IA E +V+ Sbjct: 2 VSIIIPIYNVQDYINDCLYSVYTQTYSEIEVILVNDCTPDNSMLIAHSNIEKLKKRFNVK 61 Query: 65 LL-HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMA---LEDDLDVAQC 120 ++ H++N G S ARN G++ ATG++V F+D+DDE+ P E + +A L D V Sbjct: 62 IIDHKSNKGLSAARNSGVKEATGEWVYFLDSDDEITPDCIECFVRLAERYLSVDFIVGG- 120 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 R G + RL ++ + G D + S W + W + R++ V NN Sbjct: 121 -----IRVIGRNI----SYRLTTSEYVIGNDNILKCYISGMWYEMAWNKLIRKEYFVSNN 171 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLN 234 + F GL H+D +++ + AL + Y Y + S + G KN + Sbjct: 172 LWFKEGLLHEDQLFSYQLATTALTMATVNKPTYIYKMR-MSGAITSSNGRKNFD 224 >UniRef50_C0BRQ0 Putative uncharacterized protein n=2 Tax=Bifidobacterium RepID=C0BRQ0_9BIFI Length = 331 Score = 106 bits (264), Expect = 1e-21, Method: Compositional matrix adjust. Identities = 69/225 (30%), Positives = 117/225 (52%), Gaps = 8/225 (3%) Query: 1 MMNSTNKL-SVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAEN 59 M ST L SVI+P+YN C+ESLI QT+ LEII+++DGSTD+S E +A Sbjct: 1 MAVSTQPLISVIVPVYNVERYLDQCVESLIGQTYERLEIILVDDGSTDSSCEQCNAWANR 60 Query: 60 YPHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQ 119 +R + Q NAG + ARN G+++A G+Y+ FVD+DD V P M+ TL+ E D D++ Sbjct: 61 DNRIRAVRQCNAGLAAARNTGLDLAKGEYIGFVDSDDYVLPDMFGTLLHNLQESDADLSI 120 Query: 120 CNADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKN 179 + + + + P ++ V+T + + W + ++++ + Sbjct: 121 ISYERENPDGSTYCNAFPDKKI----VMTSQEAFAYVNQHGYFYVTAWDKLAKKELF--D 174 Query: 180 NIKF-IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVS 223 N+++ + ++ +D T + + A R Y LY+Y + S S Sbjct: 175 NLRYPLDAVYAEDSPVTYQLLDKADRIVYDSTPLYRYRMSENSQS 219 >UniRef50_B5JRG3 Glycosyl transferase, group 2 family protein n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JRG3_9BACT Length = 339 Score = 106 bits (264), Expect = 1e-21, Method: Compositional matrix adjust. Identities = 72/254 (28%), Positives = 113/254 (44%), Gaps = 7/254 (2%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 ++SVI+P++N+ +++L QT ++EII+IND S DN+ EI AE ++++ Sbjct: 6 EVSVIVPVFNSARTIEASLDTLFGQTLESIEIIVINDASNDNTAEILDDIAEKQTKLKVI 65 Query: 67 HQA-NAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWC 125 H N G AR G+ A+ ++ F+DADD TM+ TL DDLD+A C + Sbjct: 66 HMPKNRGVHEARATGLRAASAPWIGFLDADDFAEKTMFATLHEACKNDDLDIAICGTNL- 124 Query: 126 FRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIA 185 Q + L L S T +W +YRR+VI+ K Sbjct: 125 ---VTSNRQFLRPKVLFEKDELITDSCFERYCSFELGTGALWNKLYRREVIIPAATKSFR 181 Query: 186 GLH--HQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 ++D + E A R R +Q+L+ Y +H TS ++ R Y Sbjct: 182 WRQDTNEDNLVNLECFLQAKRVRLIQQTLHSYVIHETSATQSTDSAKSYCQILRAYAIAL 241 Query: 244 RLLEKLNRNYADKI 257 ++ R DKI Sbjct: 242 ENSQRHGRQVLDKI 255 >UniRef50_UPI0001BC33C7 N-acetylglucosaminyltransferase n=1 Tax=Butyrivibrio crossotus DSM 2876 RepID=UPI0001BC33C7 Length = 340 Score = 106 bits (264), Expect = 2e-21, Method: Compositional matrix adjust. Identities = 69/215 (32%), Positives = 111/215 (51%), Gaps = 6/215 (2%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+I+P+YNA + C++SL+ Q+ LEII+++DGSTDNS +I A ++++H Sbjct: 6 ISIIVPVYNAENTIGICVDSLVRQSHNLLEIILVDDGSTDNSGKICDELAIKDNRIKVIH 65 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + N G SVARN GI A G + FVDADD + ++ L +D D+ C+ + Sbjct: 66 KDNEGVSVARNTGINKAAGYAITFVDADDTLDINAVSLMLNKMLNNDCDIVMCDYSIIYS 125 Query: 128 ETGETWQSIPTD-RLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAG 186 + E D L GV ++L L + +W ++ R+ +++NN+ Sbjct: 126 DHMEKHVYREDDFCLDKEGV---SEFLDDALKPQSGVGFIWGKLFNREFLIRNNLLLNDN 182 Query: 187 L-HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNT 220 L +D + F A R + +SLY YLHNT Sbjct: 183 LTAAEDAEFMLRTAFYADRVSFVAKSLYN-YLHNT 216 >UniRef50_B0NG25 Putative uncharacterized protein n=1 Tax=Clostridium scindens ATCC 35704 RepID=B0NG25_EUBSP Length = 348 Score = 106 bits (264), Expect = 2e-21, Method: Compositional matrix adjust. Identities = 89/309 (28%), Positives = 135/309 (43%), Gaps = 46/309 (14%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP-H 62 ST +S+IIP YNA +ES+++Q+ + EII+INDGS D + EI + Y + P Sbjct: 2 STINISIIIPAYNAEKYISETIESILSQSMQSFEIIVINDGSQDRTQEIVEGYRDKNPDK 61 Query: 63 VRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNA 122 +R+ +Q N G S ARN +E GKY+AF+DADD+V E L + ++D D++ C Sbjct: 62 IRIFYQKNQGQSAARNNALEYVRGKYIAFIDADDKVADWYLEELYNASEKEDADISVC-- 119 Query: 123 DWCFRETGETWQSIPTDRLRSTGVLTGPDW---LRMGLSSRRWTHVVWMGVYRR----DV 175 +Q TD T DW R G + HV R Sbjct: 120 ---------AYQKFDTDTGEITLKRYTEDWNVEFRPG-----YNHVFQYSPCARLCKTSF 165 Query: 176 IVKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNY 235 + K +F AG +D + + R + Y Y ++ SV R+G K Sbjct: 166 VKKYGFRFSAGEQLEDGPYCMMIDLLSSRTVIINKIGYYYRVYQDSVMGNVRKGKKRP-- 223 Query: 236 QRHYIKITRLLEKLNRNYADKIMI------------------YPEFHQQITYEALRVCHA 277 + Y I ++K+ N DKI++ Y +QQI E + C+ Sbjct: 224 KVPYKGIESAIKKVKENTNDKIVLDMLEFCTVKILTGLVTNMYKNCNQQIRKEICKYCYY 283 Query: 278 V--RKEPDI 284 + + PDI Sbjct: 284 IIHKYFPDI 292 >UniRef50_A4VTV1 Glycosyltransferases involved in cell wall biogenesis n=10 Tax=Streptococcus suis RepID=A4VTV1_STRSY Length = 334 Score = 106 bits (264), Expect = 2e-21, Method: Compositional matrix adjust. Identities = 64/211 (30%), Positives = 110/211 (52%), Gaps = 10/211 (4%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+I+P+YN C+ S++ QT+ +EI+++NDGSTDNS EI YA+ +R Sbjct: 4 ISIIVPIYNVEQYLSKCINSIVNQTYKHIEILLVNDGSTDNSEEICLAYAKKDSRIRYFK 63 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + N G S ARN GI A G Y+AF+D+DD ++ + L ++ VA D Sbjct: 64 KENGGLSDARNYGISRAKGDYLAFIDSDDFIHSEFIQRLHEAIERENALVAVAGYDRV-D 122 Query: 128 ETGE--TWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVV-WMGVYRRDVIVKNNIKFI 184 +G T + +PT++ VL+G + + L + VV W +Y++++ + +F Sbjct: 123 ASGHFLTAEPLPTNQ----AVLSGRNVCKKLLEADGHRFVVAWNKLYKKELF--EDFRFE 176 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKY 215 G H+D +T ++ + ++ LY Y Sbjct: 177 KGKIHEDEYFTYRLLYELEKVAIVKECLYYY 207 >UniRef50_Q8RSN9 Putative glycosyl transferase n=1 Tax=Streptococcus thermophilus RepID=Q8RSN9_STRTR Length = 229 Score = 105 bits (263), Expect = 2e-21, Method: Compositional matrix adjust. Identities = 62/208 (29%), Positives = 110/208 (52%), Gaps = 8/208 (3%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S++IP+YN D + C++S+++QT++ LEII+++DGSTD S I YY+EN ++++H Sbjct: 3 ISIVIPVYNVQDYIKKCLDSILSQTFSDLEIILVDDGSTDLSGRICDYYSENDKRIKVIH 62 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 AN G S ARN GI+ AT +++ F+D+DD V E L + + D++ + + Sbjct: 63 TANGGQSEARNVGIKNATSEWITFIDSDDYVSSDYIEYLYNLIQVHNADISIASFTYI-- 120 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 T + I ++ +R L + + VW +YR + N KF++G Sbjct: 121 ----TPKKIIKHGNGEVALMDAKTAIRRMLLNEGFDMGVWGKMYRTEYF--NKYKFVSGK 174 Query: 188 HHQDIVWTTEFMFNALRARYTEQSLYKY 215 +D + T + A + + +Y Y Sbjct: 175 LFEDSLITYQIFSEASTIVFGAKDIYFY 202 >UniRef50_Q8A2D4 Glycoside transferase family 2 n=2 Tax=Bacteroides RepID=Q8A2D4_BACTN Length = 329 Score = 105 bits (263), Expect = 2e-21, Method: Compositional matrix adjust. Identities = 63/236 (26%), Positives = 121/236 (51%), Gaps = 16/236 (6%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M + +S+I+P++N D C+ESL +QT +EII++++ S D S EI YA+ Sbjct: 1 MENKYLVSIIVPVHNTADYLHKCIESLRSQTLQQIEIILVDNLSVDGSSEICDEYAKIDK 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 +++LH + AG S+ARN G+ +A+ Y+ FVD+DD V P M+E ++ ++ D ++A CN Sbjct: 61 RIKVLHLSIAGLSIARNAGMRIASAPYIGFVDSDDYVEPAMFEKMLDAMVQSDAEIAYCN 120 Query: 122 --ADWCFRETGETWQSIPTDRLRSTG--VLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIV 177 ++ F+ + R++G V+ P + + + + +Y+ + Sbjct: 121 FLLEYEFKPNESPY--------RNSGDIVIRDPKNVLQDMMMEKVSCSACTKLYKSGFL- 171 Query: 178 KNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNL 233 +++F G +++D + E++ + + + Y Y TS+ H NL Sbjct: 172 -TSLQFPEGKNYEDRLVMYEWVALCKKVVWVDSPFYHYVERQTSIC--HTMSPMNL 224 >UniRef50_A6BH35 Putative uncharacterized protein n=1 Tax=Dorea longicatena DSM 13814 RepID=A6BH35_9FIRM Length = 329 Score = 105 bits (263), Expect = 2e-21, Method: Compositional matrix adjust. Identities = 65/223 (29%), Positives = 114/223 (51%), Gaps = 11/223 (4%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+S+IIP+YN + ES++ QT+T +E+II++DGS+D S +I + A+ V + Sbjct: 4 KISIIIPVYNLEKYIKRTAESVLKQTYTNIEVIIVDDGSSDESWKIIQKIADKDNRVIPI 63 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 HQ N G + AR +G+ +TG+++ FVD DDE+ MYE L+ A + D++ C F Sbjct: 64 HQENGGVTSARLKGVRRSTGEWIGFVDGDDEIESDMYEILLNNAKRYNADISHCGYQMIF 123 Query: 127 RET-----GETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 + T + DR L + GL ++ + ++ + V+ +I Sbjct: 124 EDGRVNYFHNTGCLVQQDRTTGLKDLLDGSLVEPGLCNKLFHKNLFHSLLHSTVM-NEDI 182 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSR 224 K ++D++ NA + + ++ LY Y + +TS SR Sbjct: 183 KI-----NEDLLMNYLLFLNAKVSIFEDRCLYHYIVRSTSASR 220 >UniRef50_B0P4B2 Putative uncharacterized protein n=1 Tax=Clostridium sp. SS2/1 RepID=B0P4B2_9CLOT Length = 342 Score = 105 bits (263), Expect = 2e-21, Method: Compositional matrix adjust. Identities = 69/265 (26%), Positives = 135/265 (50%), Gaps = 29/265 (10%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 LS+I+P+YN CM+S++ Q++ EII+++DGSTD+S + Y + V++LH Sbjct: 7 LSIILPIYNVEKYLDRCMKSILKQSFKDYEIILVDDGSTDSSPILCDKYGKQLNKVKVLH 66 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 ++N G S ARN G+E+A GKY+ ++D+DD + +++ + ++D+ + N Sbjct: 67 KSNGGLSSARNAGLELAKGKYIFWLDSDDYISEEALSSIVKTLNKSEVDILKFN------ 120 Query: 128 ETGETWQSIPTDRLRSTGVLTG---PDWLRMGLSSRRWTHV------VWMGVYRRDVIVK 178 ++ P D +R + + G + ++ + HV W +YRR I + Sbjct: 121 -----YRRQPNDIIRKSLLKPGLYNEEEIKKNVIPLALEHVGEFTMSAWSHIYRRKFIEE 175 Query: 179 NNIKFIA--GLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQ 236 ++FI+ + +D ++ + +A E LY Y L S+S +R+ N Sbjct: 176 EKLRFISEREVGSEDYLFNLQAFLSAKTLCAIEDDLYNYDLRLGSLSNKYRE-----NLI 230 Query: 237 RHYIKITRLLEK--LNRNYADKIMI 259 Y K+ L+ K ++ N ++++++ Sbjct: 231 EKYKKLYELMRKALISHNISNELLL 255 >UniRef50_B0P2L7 Putative uncharacterized protein n=1 Tax=Clostridium sp. SS2/1 RepID=B0P2L7_9CLOT Length = 319 Score = 105 bits (263), Expect = 2e-21, Method: Compositional matrix adjust. Identities = 74/263 (28%), Positives = 130/263 (49%), Gaps = 19/263 (7%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+IIP+Y C+E+++ QT+ LEII+I+D S D EI +A+ ++++H Sbjct: 5 VSMIIPVYGVEAYLGECLETVLNQTYENLEIILIDDESPDYCPEICDQHAQKDARIKVIH 64 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 Q N GA+ ARN G+++ATG+Y+ F+D+DD++ E L+ + DVA C+ ++ Sbjct: 65 QKNGGAANARNHGLDMATGEYICFIDSDDKIENNYVEKLLNAIKKSQSDVAVCSFKQWYK 124 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWT-HVVWMGVYRRDVIVKNNIKFIAG 186 +T SI + D++R LS WT ++W ++ + + ++F G Sbjct: 125 --NKTQDSIGF----KNKEYSSKDYIRKFLSD--WTCGLIWNKMFDKKTL--EGVRFEEG 174 Query: 187 LHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRLL 246 D +T + + A + ++SLY Y + + V Q YQ +K L Sbjct: 175 HKIDDEFFTYQAILKAEKVAIFDESLYWYRMRKSGVMLSGTQ------YQAQMLK--DRL 226 Query: 247 EKLNRNYADKIMIYPEFHQQITY 269 E + + Y I YP+ + Y Sbjct: 227 EYMTKRYEMVIEKYPDLKAEYLY 249 >UniRef50_Q1WRW5 Glycosyltransferase n=1 Tax=Lactobacillus salivarius UCC118 RepID=Q1WRW5_LACS1 Length = 347 Score = 105 bits (263), Expect = 2e-21, Method: Compositional matrix adjust. Identities = 96/336 (28%), Positives = 161/336 (47%), Gaps = 50/336 (14%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+I+P+YNA + R C++S+I Q ++ +E+I+++DGS D S EI K Y +RL H Sbjct: 2 ISIIVPIYNAENSLRECIDSVIYQDYSNIELILVDDGSVDRSYEICKEYKNKDKRIRLHH 61 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALED-DLDVAQC----NA 122 Q+N G S ARN+GI +A G+YV F+D+DD+ + + M E+ DL V N Sbjct: 62 QSNLGVSSARNKGISMANGEYVFFLDSDDQCENILSYCVSIMKKENIDLLVGGTDYYKNN 121 Query: 123 DWCFRET----GETWQSIPTDRLRST--------GVLTGPDWLRMGLSSRRWTHV--VWM 168 ++ E GE ++ I +D +S + P+ L +SSR+ + V Sbjct: 122 NFLISENANLFGEEYK-IYSDDSKSKLNRWVMERNITDFPELL---VSSRKTIRIGGVAA 177 Query: 169 GVYRRDVIVKNNIKFIAGL-HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHR 227 + R+ + ++FI L + +D+V+ E + R +S+Y+Y S + Sbjct: 178 KLIRKKSM--GGLRFIENLGYSEDLVFLYELLKKIDRLVVLNKSIYRYNYREDSAT---- 231 Query: 228 QGNKNLNYQRHYIKITRLLEKLNRNYADKIMI-YPEFHQQITYEAL------RVCHAVRK 280 N Y I+ + L K Y KI Y E ++ YE + + H + Sbjct: 232 ----NSGYNPKIIRFNKELSK----YIYKISTDYDEVYENSMYEKILEIYWGSIIHGITY 283 Query: 281 EPD--ILTRQRMIAE--IFTSGMYKRL-ITNVRSVK 311 P+ + R I++ + + M K+L I NVR K Sbjct: 284 NPNSKFYEKLRKISDESKYYNSMLKKLKIHNVRKKK 319 >UniRef50_B3CA81 Putative uncharacterized protein n=1 Tax=Bacteroides intestinalis DSM 17393 RepID=B3CA81_9BACE Length = 324 Score = 105 bits (262), Expect = 2e-21, Method: Compositional matrix adjust. Identities = 76/274 (27%), Positives = 128/274 (46%), Gaps = 29/274 (10%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQ--TWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 K+S+IIP YNA R C+ S + Q + EII+INDGSTD + I + + YP ++ Sbjct: 2 KISIIIPAYNAEKQIRKCVLSALQQDISHNEYEIIVINDGSTDQTAGIIQELSREYPTIK 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD- 123 + N G S ARN G+++A G+ + F DADD + L++DLD+ + + Sbjct: 62 SITTTNQGVSAARNLGMDIAQGEIILFADADDYFCTNSIAKIYEAILKNDLDILLFDYNH 121 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 W + + G+++G +++ G VWM YRR+ ++K N KF Sbjct: 122 WGNQNKLLKGFDYDARKFCPRGIVSGKSFIKTGY----MPAAVWMLAYRREYMLKMNFKF 177 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQ--------GNKNLN- 234 I + H+D + + A + + Y Y + +S + +++ NLN Sbjct: 178 I-NIRHEDEEFIPRVFYFAEKIMHLPLICYNYVQNESSFMQNYKEYGLFDKIKAMDNLNQ 236 Query: 235 YQRHYIK------------ITRLLEKLNRNYADK 256 + + Y+K RL+E LN ++A K Sbjct: 237 FTQKYVKEEDVAFTLSEHISLRLMENLNNSFAIK 270 >UniRef50_C1QCB7 Glycosyl transferase n=1 Tax=Brachyspira murdochii DSM 12563 RepID=C1QCB7_9SPIR Length = 334 Score = 105 bits (262), Expect = 2e-21, Method: Compositional matrix adjust. Identities = 50/117 (42%), Positives = 75/117 (64%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+IIP+YN + +TC+ES+I QT+ LEII INDGSTDNS+EI K YA + ++ Sbjct: 2 VSIIIPVYNVSNYLKTCLESIINQTYKDLEIICINDGSTDNSLEILKEYASKDKRIIIID 61 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 + NAG S ARN GIE ++G+Y+ +D+DD + E A +++ D+ + W Sbjct: 62 KKNAGVSAARNDGIEKSSGEYIFCMDSDDYIDNDFIEVFYNNAKKNNSDLVVLSTFW 118 >UniRef50_A1R005 SS-1,4-galactosyltransferase n=23 Tax=Borrelia RepID=A1R005_BORT9 Length = 358 Score = 105 bits (262), Expect = 2e-21, Method: Compositional matrix adjust. Identities = 68/213 (31%), Positives = 110/213 (51%), Gaps = 8/213 (3%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+SVII +N+ ++S + QT EII++NDGS DNS++IA+ YA+ Y V+++ Sbjct: 9 KVSVIICFFNSDSTLEVMIKSALNQTLRDKEIILVNDGSYDNSLKIAQKYADQYDCVKII 68 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 +Q N G + +RN G+E A G+Y+ + D+DD V TM E L A D+ DV C+ + + Sbjct: 69 NQKNLGIAASRNNGLEKAEGEYIIYWDSDDSVENTMLEVLYNRAKADNSDVV-CSQFYVY 127 Query: 127 --RETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVV----WMGVYRRDVIVKNN 180 + ++P TG + L +S + V W + RRD+IVKN+ Sbjct: 128 FIARNIKRRSALPFPNYPITGEEAFKNLLLTVFASFSKKNFVVGTLWDKLIRRDLIVKND 187 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLY 213 I + + +DIV+ A + + Y Sbjct: 188 I-IVNDIILEDIVFMLHVFLKADKVSFVNNYFY 219 >UniRef50_B2RH99 Probable glycosyltransferase n=2 Tax=Porphyromonas gingivalis RepID=B2RH99_PORG3 Length = 328 Score = 105 bits (262), Expect = 3e-21, Method: Compositional matrix adjust. Identities = 77/282 (27%), Positives = 129/282 (45%), Gaps = 19/282 (6%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 ++SVIIP+YNA + C+++L+ QT+ E+I+I+DGS D S I YA V + Sbjct: 4 RISVIIPVYNAEKFLKHCLDTLLAQTFKDFEVILIDDGSRDRSSNICDAYAYKDTRVHVY 63 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 H+ N G S AR +G+E+A+G Y+ VDADD V P+ E L A+ + D+ C+ + Sbjct: 64 HKENGGVSAARQKGLELASGAYIIHVDADDWVAPSYLEVLYEEAVRTNADITICDYIEVY 123 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAG 186 + PT T D + L ++ W + + VI +N I F G Sbjct: 124 PDKKVYKVQKPTS--------TSADDFVLDLCTKLHGS-CWNKLIKASVIRENKIDFPDG 174 Query: 187 LH-HQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRL 245 ++ +D ++ + Y ++LY Y+ + S QG+ + + YI L Sbjct: 175 INLREDKLFNLHVALCSSTISYAPKALYYYFCSHLGGSA-SSQGSLETS-MKSYIYGNNL 232 Query: 246 LEKLNRNYADKIMI-------YPEFHQQITYEALRVCHAVRK 280 N ++ ++ YP + YE V H ++ Sbjct: 233 FAHFLSNKSNSALLTAQRRFKYPTIAYLLIYEQDYVSHVSQE 274 >UniRef50_A8EVS8 Probable glycosyltransferase n=1 Tax=Arcobacter butzleri RM4018 RepID=A8EVS8_ARCB4 Length = 337 Score = 105 bits (261), Expect = 3e-21, Method: Compositional matrix adjust. Identities = 62/224 (27%), Positives = 115/224 (51%), Gaps = 7/224 (3%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTA-LEIIIINDGSTDNSVEIAKYYAEN 59 M N +S+I+P+YNA + C++S++ QT +E+I +NDGS+DNS+EI Y Sbjct: 1 MSNKEVFISIIVPVYNAEEYLDECLQSILKQTIIENIELICVNDGSSDNSLEILNLYKTK 60 Query: 60 YPHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVY-PTMYETLMTMALEDDLDVA 118 +P++ ++ Q NAG+++ARN G+++A G+Y+ FVD DD + L ++A ++ LD+ Sbjct: 61 FPNMLIIDQKNAGSAMARNNGLQIAKGEYIYFVDNDDYLANENCLSELYSVAKKESLDIL 120 Query: 119 QCNADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVK 178 N + SI + + + TG ++L T+ W + +R + + Sbjct: 121 NFNH-LILKNNSLNKFSI---KRENNKIYTGKEYLSTA-QKGNITNTPWDKILKRSYLKE 175 Query: 179 NNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSV 222 N + +G+ D ++A + + + Y Y + SV Sbjct: 176 INFAYTSGVISDDAEALLPLFYDAKKVSFVDNFAYVYRIRPNSV 219 >UniRef50_C2LRT9 Glycosyl transferase, family 2/glycosyl transferase family 8 n=1 Tax=Streptococcus salivarius SK126 RepID=C2LRT9_STRSL Length = 293 Score = 105 bits (261), Expect = 3e-21, Method: Compositional matrix adjust. Identities = 67/218 (30%), Positives = 114/218 (52%), Gaps = 8/218 (3%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M + +S++IP+YN + C+E+++ QT+ LEII++NDGS D S EI K + + Sbjct: 1 MENKEVVSIVIPIYNVEAYLKQCLETIVNQTYPNLEIILVNDGSPDKSEEICKEFFKRDS 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 +R + Q N G S ARN GI++ATG Y+ FVD DD V ETL T + + DV+ N Sbjct: 61 RIRYVRQVNGGLSAARNTGIDLATGDYITFVDPDDWVTDDYVETLYTQLKKYEADVSIAN 120 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVW----MGVYRRDVIV 177 + F E+ + T+ S + G + + + + + W M +Y+ + + Sbjct: 121 YN-LFNESTSKFLIKVTENDYSETLYEGREIIDQD-AIQETRDMAWACAMMKLYK--ISL 176 Query: 178 KNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKY 215 +++F G + +D + + A R +TE+ +Y Y Sbjct: 177 FEDLRFPVGKNVEDNFLMYKLLLKANRVVHTEKCIYWY 214 >UniRef50_C6LFJ2 Glycosyl transferase, family 2/glycosyl transferase family 8 n=1 Tax=Bryantella formatexigens DSM 14469 RepID=C6LFJ2_9FIRM Length = 402 Score = 105 bits (261), Expect = 3e-21, Method: Compositional matrix adjust. Identities = 70/230 (30%), Positives = 107/230 (46%), Gaps = 19/230 (8%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M+ K+SVIIP+YNA R ++ QT +EII ++DGSTD+S EI + YA Sbjct: 1 MSYDRKVSVIIPVYNAEKYLRDTLDDATGQTLREIEIICVDDGSTDSSAEIVEEYARRDA 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 VRL+ Q N A VARN G+ ATG+Y+ F DADD P M E + + D+ CN Sbjct: 61 RVRLIRQKNQYAGVARNHGMSHATGEYLVFWDADDCFEPAMLEKMYARITQAQADICVCN 120 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTH----------VVWMGVY 171 + D +RS L P +L+ + TH V W ++ Sbjct: 121 VN--------VLDETTKDLIRSESYLV-PAYLQGAQEYAKTTHPEYLFNIATNVPWNKMF 171 Query: 172 RRDVIVKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTS 221 R + + ++ ++F D+ + + A + ++ L Y +N S Sbjct: 172 RAEFVREHGLQFENRSRANDVYFVMMAFYLAEKIAVLDERLVTYRANNES 221 >UniRef50_C7G7U9 Putative capsular polysaccharide biosythesis protein CpsI n=1 Tax=Roseburia intestinalis L1-82 RepID=C7G7U9_9FIRM Length = 346 Score = 105 bits (261), Expect = 3e-21, Method: Compositional matrix adjust. Identities = 62/189 (32%), Positives = 104/189 (55%), Gaps = 18/189 (9%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M T+K+SVI+P+YN + C++S+ QT+ LEI++I+DGS D S EI Sbjct: 19 MTKETDKISVIVPVYNVKNYLEDCVQSITHQTYPDLEILLIDDGSDDGSGEICDRLQMTD 78 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 +R+ H+ + G S ARN+G++ A G Y+AFVD+DD + P MY L+ + + +A C Sbjct: 79 ARIRVFHEKHKGVSGARNKGLKEAAGAYIAFVDSDDVLEPDMYTYLIRLLKTHEAQIAAC 138 Query: 121 NADWCFRE-TGETWQSIPTDRL-----RSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRD 174 + + + G+ ++S P D + + GVL +LR VW +++R+ Sbjct: 139 SGWYKYDAGVGKKFES-PADVVCMNGREALGVLHERSYLRA---------YVWNKLFQRE 188 Query: 175 VIVKNNIKF 183 V+ + I+F Sbjct: 189 VL--DGIRF 195 >UniRef50_C8PLK6 Ss-1,4-galactosyltransferase n=1 Tax=Campylobacter gracilis RM3268 RepID=C8PLK6_9PROT Length = 421 Score = 105 bits (261), Expect = 4e-21, Method: Compositional matrix adjust. Identities = 75/284 (26%), Positives = 137/284 (48%), Gaps = 38/284 (13%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +SVIIP+YN + R C+E + QT+ LEI++++DGS D S I YA VR+LH Sbjct: 7 ISVIIPVYNVKEFLRACVERITAQTYANLEILLVDDGSNDGSEAICDEYAARDLRVRVLH 66 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN----AD 123 ++N G + ARN ++VA G++++FVD+DD + + ETL + + +A C +D Sbjct: 67 KSNGGQASARNAALDVARGEFISFVDSDDLISLDLIETLFRLTQKFCTKIAMCGYAAFSD 126 Query: 124 WC----FRETGETWQSIPTDRLRSTGVLT-----------GPDWL--RMGLSSRRWTHVV 166 F+ + ++ + + + G + P L R ++ V Sbjct: 127 ESEINNFKAALNSKENQNSKEISNPGGVANSNANAGEYKISPQELFRRSCTQDPYFSTAV 186 Query: 167 WMGVYRRDVIVKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLH 226 W ++ R++ ++F AG ++D+ + +FNA A YT++ LY Y + Sbjct: 187 WRALFAREIFTA--LRFPAGQIYEDVAIFFD-IFNASIAAYTDEILYFYRV--------- 234 Query: 227 RQGNKNLNYQRHYIKITRLLEKLNRNYADKIMI-YPEFHQQITY 269 R G+ ++ R ++ + + R Y + I YP ++ + Sbjct: 235 RAGSTVNSFARRHLAVIAAV----RRYTEAIAANYPSLAREANF 274 >UniRef50_D2Q641 Glycosyl transferase family 2 n=2 Tax=Bifidobacterium dentium RepID=D2Q641_9BIFI Length = 330 Score = 104 bits (260), Expect = 4e-21, Method: Compositional matrix adjust. Identities = 71/228 (31%), Positives = 114/228 (50%), Gaps = 11/228 (4%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M + +SVI+P++ C+ES++ QT L+II+I+DGS D E+ + Sbjct: 1 MADEQPLISVIVPVHGVEASLDACVESIVRQTHRNLDIILIDDGSPDRCPELCDRWMTQD 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 +R++HQ N+G S ARN GIE + G+Y+ FVD+DD + MYETL+ L + D++ Sbjct: 61 SRIRVIHQRNSGPSAARNAGIEASQGEYLGFVDSDDSIAEDMYETLLRNLLREQADISII 120 Query: 121 NADWCFRETGETWQSIPTD-RLRSTGVLTGPDWLRMGLSSRRWTHV-VWMGVYRRDVIVK 178 E GE++ T+ R R + + + L + HV VW +YRR++ Sbjct: 121 GTSL-VGEDGESYVPYATECRFRMN---SEQAFKYVNLPG--YFHVAVWDKLYRRELF-- 172 Query: 179 NNIKFIAGLHH-QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRL 225 ++++F H +D T + A Y Y Y+LH RL Sbjct: 173 DDVRFPVDRHSGEDYPVTFRMIDKAETLIYDSTPKYCYHLHERMTLRL 220 >UniRef50_D1P3V3 Glycosyl transferase, group 2 family protein n=1 Tax=Providencia rustigianii DSM 4541 RepID=D1P3V3_9ENTR Length = 324 Score = 104 bits (260), Expect = 4e-21, Method: Compositional matrix adjust. Identities = 67/190 (35%), Positives = 104/190 (54%), Gaps = 9/190 (4%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNS---VEIAKYYAENYP-HV 63 LSVIIP +N G+ C+ SL +E+I++NDGSTDNS +E KY EN +V Sbjct: 7 LSVIIPCFNNGEYVYDCLNSLPISATDLIEVIVVNDGSTDNSESEIERFKYSHENISSNV 66 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 L++Q N G S+ARN GI+ A GKY+AF+DADD P+++ET+ + L+ D+ NA Sbjct: 67 LLINQKNQGVSIARNNGIKHANGKYIAFLDADDLWTPSLWETIKPVMLDHKPDMIIFNAS 126 Query: 124 WCFRETG-ETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIK 182 + + ET+ T+ ++ L W W VY++++ ++IK Sbjct: 127 RFYNDNPKETYPLSVTNLDNGYHIVNQLTDLSGIFEKNSW--FSWCRVYKKELF--DDIK 182 Query: 183 FIAGLHHQDI 192 F G ++D+ Sbjct: 183 FPCGREYEDL 192 >UniRef50_B2VF64 Predicted glycosyl transferase, family 2 n=3 Tax=Enterobacteriaceae RepID=B2VF64_ERWT9 Length = 326 Score = 104 bits (260), Expect = 4e-21, Method: Compositional matrix adjust. Identities = 65/230 (28%), Positives = 114/230 (49%), Gaps = 14/230 (6%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M+ LS+I+P +N C+ESL Q +E+I+INDGS+DNS +A + Sbjct: 1 MLAPVPLLSIIVPFFNNEAFITPCLESLFAQIDKDIEVILINDGSSDNSAHLASRCVARH 60 Query: 61 PHVRL--LHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVA 118 P R+ + Q N+G + RN G++ A+G+Y+ F+D DD + P E L + L D+ Sbjct: 61 PEARVHFISQQNSGIASTRNVGLQQASGRYITFLDGDDVLSPHYVEILKPVLLSGQYDLL 120 Query: 119 QCNADWCFRETGETWQS----IPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRD 174 + R+ E + + ++ GV L + W +W VYRR+ Sbjct: 121 DFDYQRFTRDLPEVHKDEQVRVHEYDFKNNGVAC----LEPLFTRSMWH--LWNRVYRRE 174 Query: 175 VIVKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSR 224 ++ NN +F G ++D+++T F + R + + +LY Y ++ ++R Sbjct: 175 LL--NNERFEPGRRYEDVIFTPFIYFKSERIAHLDHTLYFYRDNSLGITR 222 >UniRef50_A6BHE9 Putative uncharacterized protein n=1 Tax=Dorea longicatena DSM 13814 RepID=A6BHE9_9FIRM Length = 363 Score = 104 bits (259), Expect = 5e-21, Method: Compositional matrix adjust. Identities = 70/234 (29%), Positives = 118/234 (50%), Gaps = 12/234 (5%) Query: 7 KLSVIIPLYNAGDDF-RTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRL 65 K+S+I+P+YN ++ + C++SL+ QT +E+I+I+DGST+ + +I YA V++ Sbjct: 16 KISIIVPVYNTKKEYLKKCVDSLLNQTMKEIEVILIDDGSTNQAGKICDEYAGVDNRVKV 75 Query: 66 LHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQ-CNADW 124 LHQ N G +VARN G+++A G+++ FVDADD TM E ++ A+E+ ++ N Sbjct: 76 LHQKNQGVAVARNNGLDIAEGEWITFVDADDWCENTMCEEILRKAVENSSEILIFTNYSV 135 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSR---------RWTHVVWMGVYRRDV 175 +T + Q D + + L+ L + W + R+ Sbjct: 136 QSEKTVKKNQFFDKDIKKFDEKMKEEAELKTMLRTHPSFSFQPPINMMGGTWCKLINRNF 195 Query: 176 IVKNNIKFIAG-LHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQ 228 + K+ +KF L QDI++ A Y + LY Y SVSR +R+ Sbjct: 196 LEKSGVKFEPELLRSQDIIFYLNLFEKADDISYYNRQLYYYRYDENSVSRRYRK 249 >UniRef50_Q8GPA4 Eps7G n=1 Tax=Streptococcus thermophilus RepID=Q8GPA4_STRTR Length = 328 Score = 104 bits (259), Expect = 5e-21, Method: Compositional matrix adjust. Identities = 61/194 (31%), Positives = 99/194 (51%), Gaps = 9/194 (4%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY-PHVRL 65 K S+IIP+YN D C+ S++ Q + EII+INDGS DNS+ I + + Y +++ Sbjct: 3 KFSIIIPVYNLEDYLYRCLYSVLNQDYNDFEIILINDGSDDNSLNIIEEFKNQYCSKIKV 62 Query: 66 LHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWC 125 + Q N G S ARN+G++ A G+Y+ F+D DD + ++ E + + Sbjct: 63 ISQVNQGVSSARNKGLQEAEGEYIIFIDGDDYIDSNHLSNILEYIGESENSFI---LNSL 119 Query: 126 FRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIA 185 F ETGET + IP G L L + R+ ++ ++ VI N++KF Sbjct: 120 FVETGETTRVIPKASKNYDCSFNGT--LMNILDNHRYQGFLFNKIFSNSVIKSNDLKFKE 177 Query: 186 GLHHQDIVWTTEFM 199 L++ + TEF+ Sbjct: 178 NLYYAE---DTEFV 188 >UniRef50_C6I7L6 Glycoside hydrolase n=1 Tax=Bacteroides sp. 3_2_5 RepID=C6I7L6_9BACE Length = 316 Score = 104 bits (259), Expect = 5e-21, Method: Compositional matrix adjust. Identities = 65/214 (30%), Positives = 105/214 (49%), Gaps = 4/214 (1%) Query: 3 NSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPH 62 N K+SVIIP+YN + +ES+ QT T LEII I+DGSTD+S ++ + A Sbjct: 5 NKHPKVSVIIPIYNTCHYVQEALESICRQTLTELEIIAIDDGSTDSSRKVVEQVAATDMR 64 Query: 63 VRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNA 122 +R+ QAN G S+ RN G++ GKYV F+D+DD + ET + LD +A Sbjct: 65 IRVYSQANQGQSITRNNGMQYVQGKYVYFMDSDDRLEADTLETCYSACEAGQLDFVCFDA 124 Query: 123 DWCFRETGETWQSIPTDR---LRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKN 179 D + + DR R V G + L+ LS+R ++ + + + + Sbjct: 125 DI-LNKDHPCARHFNYDRSACARPKQVYKGVELLQRQLSARVFSPSPCLNLISTSYLKTS 183 Query: 180 NIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLY 213 + F + H+D ++T++ A + Y Q + Sbjct: 184 GLTFYPRIIHEDQLFTSQLYLKAEKVGYIPQKFF 217 >UniRef50_C5E891 Eps7G n=1 Tax=Bifidobacterium longum subsp. infantis CCUG 52486 RepID=C5E891_BIFLO Length = 330 Score = 104 bits (259), Expect = 5e-21, Method: Compositional matrix adjust. Identities = 75/267 (28%), Positives = 128/267 (47%), Gaps = 18/267 (6%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M S+ SV+IP+YN+ D + +TQ E+I+++DGS DNS+ I + +E Sbjct: 1 MNESSILFSVVIPVYNSEDYIEQLLRHFVTQNNNRFEVIVVDDGSKDNSLSICQKVSEQN 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 + +L Q N GAS ARN G+ GKYV FVD+DD + L + ++ D+ Q Sbjct: 61 DFIVVLSQRNQGASAARNYGLRNVRGKYVVFVDSDDSITDDYVSALTQICETNNSDLIQL 120 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTG--PDWLRMGLSSRRWTHVVWMGVYRRDVIVK 178 N W + E ++ TD + + G + L+ + W VY+R +I Sbjct: 121 N--WLQGSSEEGYKPAYTDLMDCNPSVKGYCCELLQQKFNPP------WNKVYKRSLIDC 172 Query: 179 NNIKFIAGL-HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQR 237 I F + +D+ +T +++ N + LYKY+L+N + +L+Y Sbjct: 173 YGIMFNTNMVMGEDLEFTIKYVNNIQSVSINSRYLYKYFLNNDGLC-----ARASLSYFS 227 Query: 238 HYIKITRLLEKL--NRNYADKIMIYPE 262 KI ++KL NR++ +++Y + Sbjct: 228 DLRKIYDAMQKLVFNRSFGRAVLLYAD 254 >UniRef50_A5ZUJ1 Putative uncharacterized protein n=1 Tax=Ruminococcus obeum ATCC 29174 RepID=A5ZUJ1_9FIRM Length = 329 Score = 104 bits (259), Expect = 5e-21, Method: Compositional matrix adjust. Identities = 53/123 (43%), Positives = 75/123 (60%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M N +SVIIP+YNA + + + SL+ QT +EII ++DGSTD S+ I K + + Sbjct: 1 MSNDIPAVSVIIPVYNAAEFLKDGLNSLLKQTLREIEIICVDDGSTDGSLVILKEFEKAD 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 +R++HQ N GA ARN G++VA GKY+AF+DADD M + A E + +V Sbjct: 61 ARIRVIHQENQGAGAARNNGMDVARGKYLAFLDADDFFEKNMLKAAYDRAEETEAEVCVF 120 Query: 121 NAD 123 NAD Sbjct: 121 NAD 123 >UniRef50_A5ZFA7 Putative uncharacterized protein n=1 Tax=Bacteroides caccae ATCC 43185 RepID=A5ZFA7_9BACE Length = 328 Score = 104 bits (259), Expect = 5e-21, Method: Compositional matrix adjust. Identities = 60/191 (31%), Positives = 101/191 (52%), Gaps = 8/191 (4%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M S +SVIIP++N C+ES+ Q+ +EII++ + S DNS EI YA+ Sbjct: 1 MESNYLVSVIIPVHNTAPYLHKCVESVRNQSLKEIEIILVENMSIDNSAEICDEYAKIDS 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 ++++H AG +ARN GI+ AT Y+ FVD+DD + M++ L+ E+D D+ CN Sbjct: 61 RIKVIHLPIAGLCIARNAGIDAATAPYIGFVDSDDYIGTDMFKDLVEAMTENDADLTFCN 120 Query: 122 ADWCFR-ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 +C+ E G P +L + L L R + VW+ +++R++ + Sbjct: 121 --YCYEYEDGHIELLAPNS---GNNILCVKEELLRDLMWERKSCAVWIRLFKRELF--TS 173 Query: 181 IKFIAGLHHQD 191 ++F G ++D Sbjct: 174 LRFPEGFVYED 184 >UniRef50_C0WDB4 Ss-1,4-galactosyltransferase n=1 Tax=Acidaminococcus sp. D21 RepID=C0WDB4_9FIRM Length = 337 Score = 104 bits (259), Expect = 6e-21, Method: Compositional matrix adjust. Identities = 85/334 (25%), Positives = 156/334 (46%), Gaps = 38/334 (11%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+++P+YN C+ SL+ QT++ LEI++++DGSTD S + + + + ++H Sbjct: 16 ISLLVPIYNVASYVDRCVGSLVNQTYSNLEILLVDDGSTDGSGALCDEWVKKDSRIHVVH 75 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + N G S ARN GI+ A G Y+AF+D DD + P E L D D++ CN C+ Sbjct: 76 KENGGLSDARNAGIKRAAGAYLAFIDGDDYIAPEYCEKLYQALAAHDADISLCNI--CYE 133 Query: 128 -ETGETWQSIPTDRLRSTGVLTGPDWLRM--GLSSRRWTHVVWMGVYRRDVIVKN-NIKF 183 G P R GV T + L++ G+ V W +Y+R + N + F Sbjct: 134 WADGRRKDYEPPHGTRE-GVCTVKEVLQVLHGVGDLTLC-VAWNKLYKRIIFTGNPPLFF 191 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 G +++D + + A R +LY Y+ + S+ +H+ +K H+ + Sbjct: 192 PKGRYYEDEFLNYKLLNRASRISVIPDALYFYFQRDNSI--IHQYTDK------HFTDLC 243 Query: 244 RLLEKLNRNYADKIMIYPEFHQQITYE---ALRVCHAVRKE--PDILTRQ---------- 288 +E D + YP+ ++ ++ +L++C + K+ D+ R Sbjct: 244 CCIE-------DSLTWYPDDEEKRQWKLRHSLKLCFTLYKKCHDDLHFRNRTSEVKGLVF 296 Query: 289 RMIAEIFTSGMYKRLITNVRSVKVGYQALLWSFR 322 R++ E+ + + KR + V + VG L+ + Sbjct: 297 RIMKELKITYLTKRELVKVFLILVGGYGALYQVK 330 >UniRef50_D0T9V5 Glycosyltransferase family 2 n=1 Tax=Bacteroides sp. 2_1_33B RepID=D0T9V5_9BACE Length = 318 Score = 104 bits (259), Expect = 6e-21, Method: Compositional matrix adjust. Identities = 84/269 (31%), Positives = 124/269 (46%), Gaps = 34/269 (12%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M+ +SVI+P+Y A + C++SL+ QT+ EI++I+DGS D+S +I YA Sbjct: 1 MDKQPAVSVIVPVYKAENYLCHCVDSLLAQTFPDFEILLIDDGSPDHSGDICDEYARKDS 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 VR+ H+ N G S R GI+ A G+Y D DD V P M E L A E+D DV C+ Sbjct: 61 RVRVFHKENEGVSSTRQCGIDNAKGEYTIHADPDDWVEPDMLEELYKKAKEEDADVVICD 120 Query: 122 ADWCFRETGETWQSI---PTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVK 178 + + G+ + I P+D L VL L L +V Y DV Sbjct: 121 F---YEDIGDKRKYIRQQPSD-LDHKTVLRE---LFQQLHGSCCNKLVKRACYNGDV--- 170 Query: 179 NNIKFIAGLHHQDIVWTTEFMFNALRAR------YTEQSLYKYYLHNTSVSRLHRQGNKN 232 +F+ D+ + + FNAL R Y ++ Y +Y N + L R + Sbjct: 171 ---RFVP-----DLTFCEDLCFNALILRKEIKIGYLPRAFY-HYEQNVNAGSLTRNSSP- 220 Query: 233 LNYQRHYIKITRLLEKLNRNYADKIMIYP 261 Y+ + +LEK + AD MI P Sbjct: 221 ----EQYVVLMNILEK-GLDTADFQMIRP 244 >UniRef50_UPI0001BC50FC capsular polysaccharide biosynthsis protein n=4 Tax=Fusobacterium RepID=UPI0001BC50FC Length = 321 Score = 103 bits (258), Expect = 6e-21, Method: Compositional matrix adjust. Identities = 62/218 (28%), Positives = 112/218 (51%), Gaps = 8/218 (3%) Query: 15 YNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPH-VRLLHQANAGA 73 YN R C+ES+ E+I++NDGSTD+S++IA+ +A+ Y L+ Q + G Sbjct: 13 YNVESYLRECLESVYAVNLPK-EVILVNDGSTDHSLQIAEEFAKKYSEETILISQEHTGV 71 Query: 74 SV--ARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFRETGE 131 S ARNRG++ A G+Y+ F+D+DD + P+++E DLD+ N E GE Sbjct: 72 SPSEARNRGLKEAKGEYIYFMDSDDFLNPSLFEDFFAKIKGTDLDILSGNG--VRYENGE 129 Query: 132 TWQSIPTDR--LRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGLHH 189 I + + ++G +++ + W + V + +YRRD +++N+ F G+ + Sbjct: 130 FTDPIIHNEHITMTENPISGREFIYRMFEADSWVNYVCLNIYRRDFLLENSFFFQEGITY 189 Query: 190 QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHR 227 +D +++ F A + + Y Y + S++ R Sbjct: 190 EDNLFSIPVFFEAKKIIHLGIRFYSYRMRENSITHTKR 227 >UniRef50_D1PNS5 Capsular polysaccharide biosythesis protein CpsI n=1 Tax=Subdoligranulum variabile DSM 15176 RepID=D1PNS5_9FIRM Length = 326 Score = 103 bits (258), Expect = 6e-21, Method: Compositional matrix adjust. Identities = 68/219 (31%), Positives = 104/219 (47%), Gaps = 5/219 (2%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWT-ALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 +SVI+P+YN C++S++ Q + E+++++DGSTD +I YA VR+ Sbjct: 6 ISVIVPIYNTKAYLEECLDSILAQDFKEPYEVLLVDDGSTDGCTDICDAYAAKDNRVRVF 65 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 HQ N G S ARN GI+ A G+Y AFVD+DD V TL E D +A C + Sbjct: 66 HQENQGLSAARNTGIDAARGRYYAFVDSDDVVLSAYLRTLYNACEEHDAYMALCAVED-V 124 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTH-VVWMGVYRRDVIVKNNIKFIA 185 +E+GE+ + GV G D L + + V W +YR +V + + Sbjct: 125 QESGESCVPPHSTHPAQEGVFCGKDLLNEFYTPNGTVYTVAWNKLYRAEVW--KLLHYPE 182 Query: 186 GLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSR 224 G H+D + + + LY Y L + S+ R Sbjct: 183 GRLHEDDFVAHRLFWRCDKVVCVDTILYHYRLSSGSICR 221 >UniRef50_D0BKT7 Capsular polysaccharide biosynthesis protein CpsI n=1 Tax=Granulicatella elegans ATCC 700633 RepID=D0BKT7_9LACT Length = 328 Score = 103 bits (258), Expect = 6e-21, Method: Compositional matrix adjust. Identities = 60/195 (30%), Positives = 103/195 (52%), Gaps = 14/195 (7%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M +S+I+P+YN + C++S+ QT+ +E+I++NDGSTD S EI + + Sbjct: 1 MEYDKLISIIVPIYNVEEYLEECLDSIQCQTYNNIEVILVNDGSTDASKEICERFCRQDS 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 L++Q N G S ARNRG+ + G+++ FVD+DD + M E L+ DD+D+ +C Sbjct: 61 RFHLINQENKGLSGARNRGMSESHGEFITFVDSDDVLKEDMLEQLIKQVTIDDIDIVEC- 119 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMG----VYRRDVIV 177 W + E P + + G + L S ++V + ++RR+VI+ Sbjct: 120 --WYTNDQKEILIPSPEN---VKIIFQGNT--KEALVSLCRDNIVRLNAVAKLFRREVIL 172 Query: 178 KNNIKFIAGLHHQDI 192 N F+ GL ++D+ Sbjct: 173 --NFPFLEGLFYEDV 185 >UniRef50_D0DSK1 Glycosyl transferase n=1 Tax=Lactobacillus fermentum 28-3-CHN RepID=D0DSK1_LACFE Length = 310 Score = 103 bits (258), Expect = 7e-21, Method: Compositional matrix adjust. Identities = 58/195 (29%), Positives = 106/195 (54%), Gaps = 15/195 (7%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+IIPLYN G C+ S++ Q++T E+I+++DGSTDNS+E+ + + P +++ Sbjct: 2 ISIIIPLYNKGKYIEGCLLSILDQSYTDYEVIVVDDGSTDNSLEVVRELKKKDPRIKIYT 61 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 A+ G S ARN G+E A G+Y+ FVDADD + E L+ +AD Sbjct: 62 NAHGGVSTARNTGLENANGEYITFVDADDTLVDNYLERLLGFK----------DADLVV- 110 Query: 128 ETGETWQSIPTDRLRS-TGVLTGPDWLRMGLSSRRWT--HVVWMGVYRRDVIVKNNIKFI 184 +G +S +R+ + TG + ++ + + + +V +Y+ ++I+KNNI+F Sbjct: 111 -SGLIEESNERNRINAKTGEIRRLNFSEFMFNYQNFPIFSIVCTKLYKSNIILKNNIRFR 169 Query: 185 AGLHHQDIVWTTEFM 199 + +D + ++ Sbjct: 170 DRQYGEDTFFVMNYV 184 >UniRef50_Q4JZC9 Putative glycosyl transferase n=2 Tax=Streptococcus pneumoniae RepID=Q4JZC9_STRPN Length = 326 Score = 103 bits (258), Expect = 7e-21, Method: Compositional matrix adjust. Identities = 63/222 (28%), Positives = 111/222 (50%), Gaps = 13/222 (5%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+SVIIP+YN R C++S++ QT+ +E+I++NDGS DNS EI + Y Y +++L+ Sbjct: 2 KVSVIIPVYNVEKYLRRCLDSVVNQTYKDIEVILVNDGSPDNSKEICEEYVAKYSNIQLI 61 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADD--EVYPTMYETLMTMALEDDLDVAQ-CNAD 123 +Q NAG ARN G++ TG V FVD+DD E+ Y + D+ V Q Sbjct: 62 NQKNAGLGAARNTGLQYITGNAVTFVDSDDWLELDAIEYYVASMKKSDADIVVTQMIRKK 121 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLR--MGLSSRRWTHVVWMGVYRRDVIVKNNI 181 F G +I + VL + + + + +Y+R+ + + Sbjct: 122 EYFSNEGTNGTTIKEE------VLNQEQFAKKYFKIDGNNIEYYACAKLYKRE--IAREV 173 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVS 223 K+ GL +D+ ++ + + Y+ + Y Y+ ++ S++ Sbjct: 174 KYPVGLFAEDVPAAFGYIIRSQKIFYSTKVTYNYFFNDNSLT 215 >UniRef50_A7B4B8 Putative uncharacterized protein n=1 Tax=Ruminococcus gnavus ATCC 29149 RepID=A7B4B8_RUMGN Length = 336 Score = 103 bits (258), Expect = 7e-21, Method: Compositional matrix adjust. Identities = 69/218 (31%), Positives = 108/218 (49%), Gaps = 7/218 (3%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+I+P+YN TC++SLI QT +E+I+++DGSTD S E+ YAE +R++H Sbjct: 6 VSIIVPVYNVEPYIETCIQSLIRQTMGNIEVILVDDGSTDRSGELCDQYAEADERIRVIH 65 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + N G S ARN GI A G+Y+ FVD+DD V ++ E ++ A ++ DV D+ Sbjct: 66 KQNGGLSSARNAGISAAKGEYLLFVDSDDYVSASLVEKTVSCAEQNQADVVV--FDYQEI 123 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 E + T L + V+ ++ L T +Y+R I + G Sbjct: 124 ELCSGGKQTRTSALPAGQVIHAESVPQLLL----ITPSACNKLYKRSFWEMTQIVYPEGR 179 Query: 188 HHQDIVWTTEFMFNALRARYTE-QSLYKYYLHNTSVSR 224 + +D+ + A R Y E LY Y L S+ R Sbjct: 180 NFEDLSVIPRILLQAGRVVYLEGDPLYFYVLRVGSIMR 217 >UniRef50_A6BHF1 Putative uncharacterized protein n=1 Tax=Dorea longicatena DSM 13814 RepID=A6BHF1_9FIRM Length = 352 Score = 103 bits (258), Expect = 8e-21, Method: Compositional matrix adjust. Identities = 73/256 (28%), Positives = 131/256 (51%), Gaps = 26/256 (10%) Query: 7 KLSVIIPLYNAGDDFRT-CMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRL 65 K+SVI+P+YNA + T C++S+I Q + +E+I+++DGST++S E+ YA+ +++ Sbjct: 3 KISVIVPIYNAEKKYLTECVDSIIGQDYKNIEVILVDDGSTNSSGEVCDGYADKDERIQV 62 Query: 66 LHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN---- 121 +HQ N G SVARN GI+ ATG+++AFVD DD + ++ +M E D+ N Sbjct: 63 IHQKNQGVSVARNTGIDQATGEWIAFVDVDDWLEKNIFSEVMGQIKEKKTDLVVWNMYMN 122 Query: 122 ------ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHV--VWMGVYRR 173 ++ET T I + +R + T + + + T + + +Y++ Sbjct: 123 YSNVEKVAQNYKETFYTDSKIEIEEIRLRLLRT----ISIYNDEKNITTINYPFCHLYQK 178 Query: 174 DVIVKNNIKFIAGLHH-QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKN 232 +I + N++F + +D ++ ++ Y + LY Y H+ SVS Sbjct: 179 KIIQQWNVRFDSEFKQGEDKLFNYQYFTKINSILYINKPLYHYRQHSLSVSHKF------ 232 Query: 233 LNYQRHYIKITRLLEK 248 Y+ H TR+L+K Sbjct: 233 --YKEHEENTTRILKK 246 >UniRef50_A6BH38 Putative uncharacterized protein n=1 Tax=Dorea longicatena DSM 13814 RepID=A6BH38_9FIRM Length = 313 Score = 103 bits (258), Expect = 8e-21, Method: Compositional matrix adjust. Identities = 76/263 (28%), Positives = 133/263 (50%), Gaps = 25/263 (9%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 NK+SVI+P++N + C++S+I QT+ LE+I+++DGSTD S + + ++ Sbjct: 3 ANKISVIVPVFNVENYLEKCVKSIIEQTYKNLEVILVDDGSTDKSGFLCDELKKQDYRIK 62 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNA-- 122 ++H+ N G S ARN GI+V+TGKY++F+D+DD + T E +M L ++A CN Sbjct: 63 VIHKTNGGLSDARNAGIQVSTGKYLSFIDSDDYLERTALEQMMQAILISHSEIAICNIMR 122 Query: 123 ---DWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRR--DVIV 177 D C T E + PTD + VL G RR+ + V + D + Sbjct: 123 FYDDGC---TEEFYN--PTD---NQEVLEGI---------RRFDTLNQPSVCNKLFDAKL 165 Query: 178 KNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNY-Q 236 NI F G ++D E ++ A + T ++ Y Y S+ + N+ ++ + Sbjct: 166 FANISFPYGKFYEDTYVYHELLYKADKVVLTGKTGYWYLSRKGSILGRSQFSNRYFDFIE 225 Query: 237 RHYIKITRLLEKLNRNYADKIMI 259 Y++ L+++ + Y D+ + Sbjct: 226 AVYMRADFLIKRDIQPYGDEAFL 248 >UniRef50_C9RN10 Glycosyl transferase family 2 n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RN10_FIBSS Length = 318 Score = 103 bits (257), Expect = 8e-21, Method: Compositional matrix adjust. Identities = 73/239 (30%), Positives = 114/239 (47%), Gaps = 19/239 (7%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+SVI+P+YN + C+ESL+ Q+++ E+I+++DGSTD S I +YA +++ Sbjct: 3 KISVIVPVYNVEKYIKECIESLLNQSFSDFELILVDDGSTDKSASICDFYASKDSRIKVF 62 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 H+ N G S ARN GIE + G+++AFVD+DD V E L A DL C + Sbjct: 63 HKQNGGVSSARNLGIEKSVGEWIAFVDSDDYVDAKYLENLYKSAYSCDL--ISCGF-FVT 119 Query: 127 RETGETWQS--IPTDRLRSTGV--LTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIK 182 E GE S P+ + + DW + + W H++ + +I N++K Sbjct: 120 NEFGEKTNSNVNPSGFFYQKDIADVVNNDW----IVTSPWAHLI-----KTSLIKDNDVK 170 Query: 183 FIAGLH-HQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI 240 F +D ++ + A R + L YY T S LH K Y Y+ Sbjct: 171 FYENRSMGEDTIFVLSCLDKANCIRNIPELL--YYYRTTPNSLLHPDERKYGKYIEDYM 227 >UniRef50_Q14I39 Glycosyl transferase family protein n=18 Tax=Francisella RepID=Q14I39_FRAT1 Length = 335 Score = 103 bits (257), Expect = 8e-21, Method: Compositional matrix adjust. Identities = 57/185 (30%), Positives = 98/185 (52%), Gaps = 6/185 (3%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 +N +S+IIP+YN C+ES+I QT+ LEII+INDG+TDNS+ I + Y Sbjct: 4 LNYKQLISIIIPIYNTQQYLSRCLESVINQTYKNLEIILINDGATDNSLSICQKYKSKDS 63 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 + LL+Q N+G ++ARN +++A G Y+AF+D+DD V + L D+A Sbjct: 64 RIVLLNQQNSGQALARNNALDIAKGDYIAFIDSDDWVSLDYIQALYNHVFSYSADIAISA 123 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 C ++ + + IP++ + D + S++ + + + +R ++ K Sbjct: 124 TVGCNKQI-KAAEIIPSN----INIFDNNDAIIKAFLSKQLSSMACGSLIKRKLLDKQRF 178 Query: 182 K-FIA 185 + FIA Sbjct: 179 RNFIA 183 >UniRef50_B2UQ58 Glycosyl transferase family 2 n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UQ58_AKKM8 Length = 384 Score = 103 bits (257), Expect = 8e-21, Method: Compositional matrix adjust. Identities = 72/228 (31%), Positives = 111/228 (48%), Gaps = 19/228 (8%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+I+P YN +C+ESL+ QT +EII INDGSTD + + E + LL+ Sbjct: 5 VSIIVPCYNVAAYVDSCLESLVRQTLRNIEIICINDGSTDETWTHLLRWKEKDSRIILLN 64 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 Q NAG S ARN G++ A G YV F D DD + P MY L + ALE D D+ +C + F Sbjct: 65 QRNAGVSAARNAGLDAARGLYVGFADPDDYMDPEMYSRLFSAALEYDADIVEC-GNHVFE 123 Query: 128 ETG----ETWQSIPT----DRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKN 179 ++ E + P+ + W +M + VW ++R+ ++ + Sbjct: 124 DSSDRIIEAKRRSPSRHFEENASPASFFRDSIWGKMDI-------CVWSKLFRKSMLDAH 176 Query: 180 NIKFIAGLHH--QDIVWTTEFMFNALRARYTEQSLYKYYL-HNTSVSR 224 ++F L +D + + +A R + LY Y L N S+SR Sbjct: 177 RLRFNVHLKSGAEDETFRLMAVPHASRLLFIPDCLYYYRLMRNGSLSR 224 >UniRef50_B0P6V0 Putative uncharacterized protein n=1 Tax=Anaerotruncus colihominis DSM 17241 RepID=B0P6V0_9FIRM Length = 343 Score = 103 bits (257), Expect = 8e-21, Method: Compositional matrix adjust. Identities = 67/210 (31%), Positives = 103/210 (49%), Gaps = 8/210 (3%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +SVI+P+YN +D R MESL Q++ +++DGSTD S + A +L Sbjct: 4 ISVIVPIYNVKNDLRAAMESLRAQSFGDFRAWLVDDGSTDGSGAVCDELAREDARFCVLR 63 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 Q N+GA ARN + G+YV F+DADD + E L+ A + D D+A C +R Sbjct: 64 QKNSGAHAARNTALANCQGEYVYFMDADDLIERETLERLLQCARQTDADIAVCGYQIEYR 123 Query: 128 ETGE--TWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIA 185 G +S+P G++ P L GL++ W +YRR +++ N+ F A Sbjct: 124 RAGHLADCKSVPG----FDGIMAIPGALPDGLTT-ELIFPPWNKLYRRSFLMRYNLHFPA 178 Query: 186 GLHHQDIVWTTEFMFNALRARYTEQSLYKY 215 + D ++ TE A R + S Y+Y Sbjct: 179 -TYWDDTIFNTEAFGYARRLGFIRYSGYRY 207 >UniRef50_A4CKB6 Putative uncharacterized protein n=2 Tax=Flavobacteriales RepID=A4CKB6_9FLAO Length = 344 Score = 103 bits (257), Expect = 9e-21, Method: Compositional matrix adjust. Identities = 74/251 (29%), Positives = 126/251 (50%), Gaps = 10/251 (3%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTA--LEIIIINDGSTDNSVEIAKYYAENYPHVR 64 +LS++IPLYN CM+SL+ Q E+I+++DGS D S IA YA + +VR Sbjct: 2 RLSILIPLYNKEQYIDRCMDSLLAQGLAPGDFEVILVDDGSKDASAAIAGRYAAEHGNVR 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNA-D 123 L+ Q NAG S ARNR +E A G YV F+DADD + + L+ + E++LD+ + N+ D Sbjct: 62 LICQENAGPSAARNRALEAAGGDYVYFLDADDLLAGHVAGRLLDICEENNLDILEFNSRD 121 Query: 124 WCFRETGETWQSIPTDRLRS-TGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIK 182 + P D S T V G + +S + + W +R ++++ I+ Sbjct: 122 IENHLVADADLEGPPDGTESVTPVADGLTF----ISRYYFRNQAWRYFIKRAYLLESGIR 177 Query: 183 FIAGLH-HQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIK 241 F + ++D+++T + ++ R E ++Y + S+ + NL + + +K Sbjct: 178 FPEDMRAYEDLIFTASTILHSNRMARVELDAHRYVIVPGSIV-TSKDPEINLGFIQGMVK 236 Query: 242 ITRLLEKLNRN 252 L +L R Sbjct: 237 AVGELNRLIRG 247 >UniRef50_C3JBV0 Capsular polysaccharide biosynthsis protein n=2 Tax=Bacteria RepID=C3JBV0_9PORP Length = 326 Score = 103 bits (257), Expect = 9e-21, Method: Compositional matrix adjust. Identities = 65/227 (28%), Positives = 118/227 (51%), Gaps = 18/227 (7%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTA---LEIIIINDGSTDNSVEIAKYYAENYPHVR 64 S+IIP+YN + C+ES TQ+ A E+I+++DGS D S+ +A+ +P+ R Sbjct: 6 FSLIIPVYNVALYVQECLESCFTQSLIAPEEYEVIVVDDGSKDESISLAQPIIAKHPNAR 65 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALE-DDLDVAQCNAD 123 ++HQ N G S ARN G VATG+YV F+D+DD + P + + L + E ++ D+ ++ Sbjct: 66 IIHQENRGLSEARNAGFRVATGRYVWFIDSDDFITPGILKQLKSFLEEANNPDMVSLSS- 124 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 R GE + + T ++G + L G + V +Y+R+ ++ N + F Sbjct: 125 ---RLFGEGYIRTLELHPKFTEPMSGLELLDSG----QQIIPVHHYLYKRNFLLSNQLTF 177 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKY------YLHNTSVSR 224 ++H+D ++ ++ A + + Y Y +H+T++ R Sbjct: 178 YPRIYHEDEEFSPRALYFAREVAFFQGVAYHYRQRPNSIMHSTNLKR 224 >UniRef50_Q2NH62 Predicted glycosyltransferase n=1 Tax=Methanosphaera stadtmanae DSM 3091 RepID=Q2NH62_METST Length = 394 Score = 103 bits (257), Expect = 9e-21, Method: Compositional matrix adjust. Identities = 79/258 (30%), Positives = 136/258 (52%), Gaps = 36/258 (13%) Query: 7 KLSVIIPLYNAGDDF--RTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 K+SVI+P+YN G+ + +TC+ +L QT T +E+I +NDGSTDNS+ I + A+ Y ++ Sbjct: 3 KISVIMPVYN-GEKYLEKTCL-NLSKQTLTDIELICVNDGSTDNSLNILEKLADKYDFIK 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTM--YETLMTMALEDDLDVAQCNA 122 +++Q N G+ VARN GIE A G+Y+AF+DADD +Y + E + + D D+ N Sbjct: 61 IINQENQGSGVARNNGIENAIGEYIAFLDADD-IYVDVDALEKMYEYGSKHDADMVGANQ 119 Query: 123 -----------DWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVY 171 ++ +++ T+ S GV++ ++ G+ W + ++ Sbjct: 120 KRVSIDGNLEDNFNYKQKNYTYFS-------DYGVISPQEY---GIP---WA--FYKNIF 164 Query: 172 RRDVIVKNNIKFIAGLHHQDIVWTTEFM--FNALRARYTEQSLYKYYLHNTSVSRLHRQG 229 +R + K+NI F QD V+ E + N + T+ Y Y L + S+++ Sbjct: 165 KRSFLNKHNITFPNLKRGQDPVFLAEVLTKVNQIYVVCTDLYGYNYALGGGANSKVNDY- 223 Query: 230 NKNLNYQRHYIKITRLLE 247 NK L+Y HY ++L+ Sbjct: 224 NKKLDYMNHYKMTFKILD 241 >UniRef50_D2TC09 Predicted glycosyl transferase, family 2 n=2 Tax=Erwinia pyrifoliae RepID=D2TC09_ERWPY Length = 329 Score = 103 bits (257), Expect = 9e-21, Method: Compositional matrix adjust. Identities = 67/232 (28%), Positives = 120/232 (51%), Gaps = 18/232 (7%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M+ S LSVI+P +N C+ESL +Q +E+I+INDGS+DNS +A Y Sbjct: 1 MLTSPPLLSVIVPFFNNEAFLIPCLESLFSQIGNDIEVILINDGSSDNSAHLASQCLARY 60 Query: 61 PH--VRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVA 118 P VR + Q N+G + RN G++ A+G+Y+ F+D DD + P + L + L D+ Sbjct: 61 PEARVRYIAQQNSGIASTRNVGMQHASGRYMTFLDGDDVLSPHYVDILKPVLLSGQYDL- 119 Query: 119 QCNADWCFRETGETWQSIPTDRLRSTGV-LTGPDWLRMGLSSRRWTHV-----VWMGVYR 172 + D+ + + P + + GV + D+ + G+S +W VY+ Sbjct: 120 -IDFDY------QRFIHNPPEIHKQEGVRIHEYDFEKKGISCLEPLFTQSMWHLWNRVYK 172 Query: 173 RDVIVKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSR 224 R+++ K +F +G ++D+++T F + R + + +LY Y ++ ++R Sbjct: 173 RELLDKE--RFESGRRYEDVIFTPFIYFKSERMAHLDHTLYFYRDNSQGITR 222 >UniRef50_D0WW97 Putative uncharacterized protein n=2 Tax=Vibrio alginolyticus 40B RepID=D0WW97_VIBAL Length = 293 Score = 103 bits (257), Expect = 9e-21, Method: Compositional matrix adjust. Identities = 70/242 (28%), Positives = 126/242 (52%), Gaps = 16/242 (6%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M N+ +S+I+P+YN + C++S + QT+ +EII I DGS DNS+ IAK Y + Sbjct: 1 MSNNNPLVSIIVPIYNVEEYLPECLDSAVQQTYRNIEIIAIIDGSQDNSINIAKQYRDED 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 P ++++ Q N G S ARN GI+ + G ++ F+D+DD + E L+ + D+ + Sbjct: 61 PRIKVIEQTNQGLSAARNTGIKNSRGDFLFFLDSDDWLEKDAIEKLLLHSTGYDIVSGRV 120 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 F + + ++ R ++ D + S VVW +YR++V+ +N Sbjct: 121 TK---FYQEDKRYED-----YRKRRIIGKLDLDKKFFS---LELVVWNKLYRKEVL--DN 167 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI 240 I F+ GL H+D + + LR ++ Y + S+ ++ + N ++NYQ +++ Sbjct: 168 IWFVPGLIHEDEEFYWQVFTKKLRIYSICDNVIFYRIRKNSI--MNSKVN-DVNYQNNFV 224 Query: 241 KI 242 KI Sbjct: 225 KI 226 >UniRef50_C9LAR5 Putative beta1,3-glucosyltransferase WaaV n=1 Tax=Blautia hansenii DSM 20583 RepID=C9LAR5_RUMHA Length = 348 Score = 103 bits (257), Expect = 1e-20, Method: Compositional matrix adjust. Identities = 62/204 (30%), Positives = 116/204 (56%), Gaps = 16/204 (7%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M+S K+SVIIP+YNA + ++S++ Q+ ++I++INDGSTD+++ I + YA+ Sbjct: 6 MSSEYKVSVIIPVYNAEKTIKKAIDSVLAQSMNNIQIVVINDGSTDSTLNILQSYAD-LK 64 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 +++L++Q NAG S ARN GI+ A G+++ F+D+DD + P + E ++ A +++LD+ C+ Sbjct: 65 NLKLINQKNAGVSTARNNGIKNADGEFLFFLDSDDWLDPYLLEMMINYAEKNNLDLVACS 124 Query: 122 ADWCFRETGETW----QSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIV 177 E T Q++ T + T G + + ++R+++I Sbjct: 125 H----IERNATLYGGNQNLATKFVARTLEEIGSKVFDI------FPKSACAKIFRKEIIQ 174 Query: 178 KNNIKFIAGLH-HQDIVWTTEFMF 200 KN+I F + +D+ +T F+ Sbjct: 175 KNDIFFPEKMQLGEDLYFTYLFLL 198 >UniRef50_Q9CH23 Teichoic acid biosynthesis protein n=1 Tax=Lactococcus lactis subsp. lactis RepID=Q9CH23_LACLA Length = 963 Score = 103 bits (257), Expect = 1e-20, Method: Compositional matrix adjust. Identities = 61/197 (30%), Positives = 100/197 (50%), Gaps = 16/197 (8%) Query: 3 NSTNKLSVIIPLYNAGDDFRTCMESLITQ--TWTALEIIIINDGSTDNSVEIAKYYAENY 60 N KL++I+P+YN D + C+ SL+ Q L+II++++GSTDNS +IA+ + E Y Sbjct: 7 NGRKKLAIIVPIYNVADYLKDCLNSLLKQGVPEKELQIILVDEGSTDNSAQIAQKFVEKY 66 Query: 61 PHVRLLHQ-ANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQ 119 P+ LHQ NAG ARNRG +A +++ +VD+DD V P YE ++ + + Sbjct: 67 PNFFELHQFENAGLGAARNRGTRLARARFITYVDSDDIVAPNSYEYMLGILERTGSQIIT 126 Query: 120 CNADWCFRETGETWQSIPT----DRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDV 175 N F E +I +T + P+ + W W ++ D Sbjct: 127 GNVH-RFNSKKEWLNNIHARSHLGDFENTSLKEHPELI--------WDSTSWNKIFCLDF 177 Query: 176 IVKNNIKFIAGLHHQDI 192 + +NN+ F G+ ++DI Sbjct: 178 LKENNLYFPEGVLYEDI 194 >UniRef50_A3KLK8 Cps5J protein n=2 Tax=Streptococcus agalactiae RepID=A3KLK8_STRAG Length = 321 Score = 103 bits (257), Expect = 1e-20, Method: Compositional matrix adjust. Identities = 67/221 (30%), Positives = 115/221 (52%), Gaps = 7/221 (3%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+S+IIP+YN C++S+I QT+ LEII+I+DGSTD S + A+ + + Sbjct: 3 KISIIIPVYNVKKYLNDCIQSVINQTYNNLEIILIDDGSTDGSGDYCDEIAKKDSRIFVY 62 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 H+ N G S ARN GI+++TGKY+ F+D+DD + ETL +E D+A N + Sbjct: 63 HKTNGGLSEARNVGIKISTGKYITFIDSDDYIENLYIETLYNSLIEYKADIAIVNK-YHL 121 Query: 127 RETGET---WQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 ++ + S R+ T ++ + R T + + +YR+++ ++I+F Sbjct: 122 KDNEKIVHLANSKELFRVFDTKMILESMYSRENNFISTLT-MAQLKLYRKELF--SDIEF 178 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSR 224 G ++D T + A R + + LY Y + S++R Sbjct: 179 PIGKKYEDSFTTYKTYIEANRIVFVNRPLYAYRFRSGSITR 219 >UniRef50_Q8XN34 Spore coat polysaccharide biosynthesis protein n=1 Tax=Clostridium perfringens RepID=Q8XN34_CLOPE Length = 348 Score = 103 bits (256), Expect = 1e-20, Method: Compositional matrix adjust. Identities = 64/220 (29%), Positives = 109/220 (49%), Gaps = 3/220 (1%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+S+I+P YN C+ SLI QT+ +EIII++DGSTDN++ A +++L Sbjct: 5 KVSIIVPAYNIESYVERCLNSLINQTYENIEIIIVDDGSTDNTLVKISDMARKDKRIKVL 64 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 Q N G+S AR +G E++TG+++ FVD DD + E L+ + DD+D+ + F Sbjct: 65 EQENRGSSEARKKGYEMSTGEFILFVDGDDWIRNDTIEVLLEYS-NDDIDIVAFGYNEIF 123 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAG 186 E S+ D+ L +LR L++ + +W R+ I KNN+ F Sbjct: 124 NENEIQKNSLLYDKNNEKIELRDTQFLREILTNNISIN-IWNKFIRKKFIDKNNVVFPMK 182 Query: 187 LHH-QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRL 225 + + +D+ ++ LY Y+ S++ + Sbjct: 183 MSYAEDLALLISLAAKEPNVIVIKEKLYFYFKRENSITSI 222 >UniRef50_C5NWI7 Glycosyl transferase, group 2 family n=1 Tax=Gemella haemolysans ATCC 10379 RepID=C5NWI7_9BACL Length = 328 Score = 103 bits (256), Expect = 1e-20, Method: Compositional matrix adjust. Identities = 69/252 (27%), Positives = 119/252 (47%), Gaps = 12/252 (4%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +SVI+P+YN D M+SL QT+ EII++NDGSTDNS ++ Y+E Y +VR H Sbjct: 2 ISVILPVYNVEDYLHYAMDSLEKQTYKNFEIILVNDGSTDNSGKLCDEYSEKYSNVRAFH 61 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + N G S ARN GI+ A G+++ F+D DD + E L+ + + D D+ Sbjct: 62 KENGGLSDARNFGIQKAKGEFITFLDPDDYLEAYSLELLVGIQEKHDCDIVSTRVKATES 121 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 + S+ + + +L + T +YR+ ++ I F G Sbjct: 122 YNVYSDYSLVEKDFENVTHMDNDVFLEEAFYDKVVTVSACGKLYRKSIL---EIPFPKGK 178 Query: 188 HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSV--SRLHRQG-------NKNLNYQRH 238 ++D+ ++ + A + +T +Y YY S+ S+ ++ +KN + + Sbjct: 179 IYEDLFVISDHVTRAQKIVHTSLQIYNYYRRPGSIVNSKFSKKQYDFFEAIDKNREFAKK 238 Query: 239 YIKITRLLEKLN 250 + K LL +N Sbjct: 239 FSKEKDLLPAIN 250 >UniRef50_B0NLZ9 Putative uncharacterized protein n=1 Tax=Bacteroides stercoris ATCC 43183 RepID=B0NLZ9_BACSE Length = 305 Score = 103 bits (256), Expect = 1e-20, Method: Compositional matrix adjust. Identities = 72/210 (34%), Positives = 110/210 (52%), Gaps = 5/210 (2%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 ++SVIIP+YN R ++S+ QT LEII+I+DGSTD S I + YA+ +R Sbjct: 3 EVSVIIPIYNTAAYLRKALDSICGQTLKELEIILIDDGSTDGSRGIIEEYAKRDARIRWY 62 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 Q N G SVARNRG+ ATGKY+ F+D+DD + + + +LD +A Sbjct: 63 AQPNQGLSVARNRGLLHATGKYIYFMDSDDILDTQALRRCHDLCEKRELDFIFFDAKP-L 121 Query: 127 RETGETWQSIPT-DRLRST--GVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 E+ ++ Q+IP DR V +G + L LS+R + V++ + R + K F Sbjct: 122 TESADS-QNIPNYDRKGKIDESVHSGMEMLDYELSNRLFLTPVYLCMVNRLFLKKCFTCF 180 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLY 213 G+ H+D +T NA R Y ++ + Sbjct: 181 HPGIIHEDHSFTMMIYLNAQRTCYVPEAFF 210 >UniRef50_D2BMR3 Glycosyltransferase, family 2 n=1 Tax=Lactococcus lactis subsp. lactis KF147 RepID=D2BMR3_LACLK Length = 337 Score = 103 bits (256), Expect = 1e-20, Method: Compositional matrix adjust. Identities = 48/114 (42%), Positives = 71/114 (62%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+SVI+P+YN R C++S+I Q++ E+++I+DGS DNS +I Y + VR+ Sbjct: 3 KISVIVPVYNVEKLLRRCLDSIINQSFEDFELLLIDDGSKDNSGQICDDYKKQDERVRVW 62 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 H +N G S ARN GI+ G ++AF+D+DD V E L LED++DV C Sbjct: 63 HISNGGQSAARNLGIDNVQGDFIAFIDSDDFVELDYLEKLYQPMLEDEVDVVSC 116 >UniRef50_B6W7G9 Putative uncharacterized protein n=1 Tax=Anaerococcus hydrogenalis DSM 7454 RepID=B6W7G9_9FIRM Length = 351 Score = 102 bits (255), Expect = 1e-20, Method: Compositional matrix adjust. Identities = 65/192 (33%), Positives = 105/192 (54%), Gaps = 5/192 (2%) Query: 5 TNKL-SVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 TN+L S+++P+YNA + +ESLI Q + LEII ++DGSTDNS+ I K Y +N + Sbjct: 2 TNQLVSIVMPIYNAESYLKDSIESLINQKYKNLEIICVDDGSTDNSLRILKNYKKNDDRI 61 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 ++L Q N A VARN G+ A GKY+ F+D+DD M L+ A + + D+ Sbjct: 62 KILKQRNQFAGVARNNGLNNANGKYIMFLDSDDIFEKNMIYNLVNKAEKYNTDIIFF-GF 120 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 + F ET + +S+ ++ V++ D + ++ V W Y R+ I+ + IKF Sbjct: 121 YKFTETIKK-RSVIGIPYKNKNVISPIDIKSVIF--QKAQGVPWNKFYNREFILNSGIKF 177 Query: 184 IAGLHHQDIVWT 195 + DI ++ Sbjct: 178 QNLKSNNDIFFS 189 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P11290 Uncharacterized glycosyltransferase yibD n=58 Ta... 493 e-138 UniRef50_B4SXB8 Glycosyl transferase, group 2 family protein n=3... 465 e-129 UniRef50_A8GFL8 Glycosyl transferase family 2 n=1 Tax=Serratia p... 376 e-103 UniRef50_D1RQX9 Putative glycosyl transferase n=1 Tax=Serratia o... 373 e-102 UniRef50_B5XTH2 Glycosyl transferase, group 2 family protein n=2... 323 5e-87 UniRef50_A8SCX5 Putative uncharacterized protein n=1 Tax=Faecali... 289 8e-77 UniRef50_A9VPZ1 Glycosyl transferase family 2 n=3 Tax=Firmicutes... 287 4e-76 UniRef50_C7H3W2 Capsular polysaccharide biosynthesis protein n=1... 287 4e-76 UniRef50_B0ACE9 Putative uncharacterized protein n=2 Tax=Firmicu... 285 2e-75 UniRef50_C0WDB4 Ss-1,4-galactosyltransferase n=1 Tax=Acidaminoco... 284 2e-75 UniRef50_C4FFR1 Putative uncharacterized protein n=1 Tax=Bifidob... 282 2e-74 UniRef50_Q5LIR2 Putative glycosyltransferase n=6 Tax=Bacteroides... 280 5e-74 UniRef50_D1NJF8 Glycosyl transferase family 2 n=1 Tax=Clostridiu... 280 6e-74 UniRef50_B5CP79 Putative uncharacterized protein n=1 Tax=Ruminoc... 279 9e-74 UniRef50_A6NTT2 Putative uncharacterized protein n=1 Tax=Bactero... 279 1e-73 UniRef50_B6XVK3 Putative uncharacterized protein n=1 Tax=Bifidob... 278 2e-73 UniRef50_A7VPB1 Putative uncharacterized protein n=1 Tax=Clostri... 277 3e-73 UniRef50_B7AUI4 Putative uncharacterized protein n=1 Tax=Bactero... 274 3e-72 UniRef50_B6FJ78 Putative uncharacterized protein n=1 Tax=Clostri... 274 3e-72 UniRef50_C5S6D6 Methyltransferase FkbM family n=1 Tax=Allochroma... 274 3e-72 UniRef50_B4F184 Glycosyl transferase n=2 Tax=Proteus mirabilis R... 274 4e-72 UniRef50_C8NFZ4 Glycosyl transferase CpsJ(V) n=1 Tax=Granulicate... 274 5e-72 UniRef50_P71059 Uncharacterized glycosyltransferase epsJ n=3 Tax... 274 5e-72 UniRef50_D1BQ92 Glycosyl transferase family 2 n=1 Tax=Veillonell... 273 6e-72 UniRef50_A1A383 Putative glycosyltransferase n=1 Tax=Bifidobacte... 273 8e-72 UniRef50_C0BRJ5 Putative uncharacterized protein n=1 Tax=Bifidob... 273 8e-72 UniRef50_A7B4B8 Putative uncharacterized protein n=1 Tax=Ruminoc... 272 1e-71 UniRef50_D1PNS5 Capsular polysaccharide biosythesis protein CpsI... 272 2e-71 UniRef50_A7VCQ5 Putative uncharacterized protein n=1 Tax=Clostri... 271 2e-71 UniRef50_D2EWI1 Glycoside transferase family 2 protein n=3 Tax=B... 270 3e-71 UniRef50_B5LBK9 Putative glycosyl transferase n=1 Tax=Streptococ... 270 4e-71 UniRef50_B7GUL7 Glycosyl transferase, family 2 n=1 Tax=Bifidobac... 269 1e-70 UniRef50_C6ID60 Glycosyltransferase n=5 Tax=Bacteroides RepID=C6... 269 1e-70 UniRef50_C9BVV0 Glycosyl transferase n=3 Tax=Enterococcus faeciu... 269 1e-70 UniRef50_A6NTT3 Putative uncharacterized protein n=1 Tax=Bactero... 269 2e-70 UniRef50_C5NWF7 Glycosyl transferase, group 2 family n=2 Tax=Fir... 268 3e-70 UniRef50_C6LFI9 Glycosyl transferase, group 2 family n=1 Tax=Bry... 267 3e-70 UniRef50_C0C2F7 Putative uncharacterized protein n=1 Tax=Clostri... 266 6e-70 UniRef50_C4FMC2 Putative uncharacterized protein n=3 Tax=Firmicu... 266 9e-70 UniRef50_B5CU57 Putative uncharacterized protein n=1 Tax=Bactero... 266 1e-69 UniRef50_Q67AY0 Cps8J n=6 Tax=Streptococcus RepID=Q67AY0_STRAG 266 1e-69 UniRef50_C7H3W5 Glycosyl transferase CpsJ n=1 Tax=Faecalibacteri... 265 1e-69 UniRef50_D2Q666 Glycosyl transferase, family 2 n=4 Tax=cellular ... 265 2e-69 UniRef50_D2Q658 Glycosyl transferase, family 2 n=3 Tax=Bacteria ... 265 2e-69 UniRef50_UPI0001B4A4FE ss-1,4-galactosyltransferase n=1 Tax=Bact... 265 2e-69 UniRef50_D2Q655 Glycosyl transferase, family 2 n=2 Tax=Bifidobac... 264 2e-69 UniRef50_A3CKM9 Glycosyltransferase, family 2/glycosyltransferas... 264 3e-69 UniRef50_C7UWL4 Glycosyl transferase n=6 Tax=Enterococcus RepID=... 264 3e-69 UniRef50_P71057 Putative glycosyltransferase epsH n=3 Tax=Bacill... 264 4e-69 UniRef50_C0BRK2 Putative uncharacterized protein n=1 Tax=Bifidob... 264 4e-69 UniRef50_B3JEB6 Putative uncharacterized protein n=1 Tax=Bactero... 263 5e-69 UniRef50_B2UQP2 Glycosyl transferase family 2 n=1 Tax=Akkermansi... 263 5e-69 UniRef50_A4VTV1 Glycosyltransferases involved in cell wall bioge... 263 7e-69 UniRef50_C2DBB8 Glycosyltransferase n=19 Tax=Bacteria RepID=C2DB... 262 2e-68 UniRef50_A6L4H5 Glycosyltransferase family 2 n=2 Tax=Bacteroides... 261 2e-68 UniRef50_C0BRQ0 Putative uncharacterized protein n=2 Tax=Bifidob... 261 3e-68 UniRef50_B4AD18 Glycosyltransferase n=18 Tax=Enterococcus RepID=... 261 3e-68 UniRef50_C9RJB8 Glycosyl transferase family 2 n=1 Tax=Fibrobacte... 261 3e-68 UniRef50_D2Q661 Glycosyl transferase family 2 n=2 Tax=Bifidobact... 261 3e-68 UniRef50_D2Q652 Glycosyltransferase, family 2 n=1 Tax=Bifidobact... 260 4e-68 UniRef50_A6L7A5 Glycosyltransferase family 2 n=6 Tax=Bacteroides... 260 5e-68 UniRef50_C0FV15 Putative uncharacterized protein n=1 Tax=Rosebur... 260 5e-68 UniRef50_B2UYP7 Minor teichoic acid biosynthesis protein GgaB n=... 260 7e-68 UniRef50_B1QTB6 Glycosyl transferase, group 2 family protein n=4... 260 7e-68 UniRef50_C3Q1B0 Glycosyltransferase family 2 protein n=2 Tax=Bac... 259 9e-68 UniRef50_A5UNM1 Glycosyltransferase/CDP-glycerol:poly(Glyceropho... 259 1e-67 UniRef50_Q9F0B8 Beta(1,4)galactosyltransferase EpsJ n=1 Tax=Lact... 259 1e-67 UniRef50_A8SZ40 Putative uncharacterized protein n=1 Tax=Coproco... 258 2e-67 UniRef50_C6LCR2 Glycosyl transferase CpsJ n=1 Tax=Bryantella for... 257 3e-67 UniRef50_C2M8C3 Glycosyl transferase, family 2 n=1 Tax=Capnocyto... 257 3e-67 UniRef50_Q03KL3 Glycosyltransferase involved in cell wall biogen... 257 4e-67 UniRef50_B2ISC6 Glycosyl transferase, family 2/glycosyl transfer... 257 4e-67 UniRef50_D1A4K2 CDP-glycerol:poly(Glycerophosphate)glycerophosph... 257 4e-67 UniRef50_A7VPB2 Putative uncharacterized protein n=1 Tax=Clostri... 257 6e-67 UniRef50_A6NTS0 Putative uncharacterized protein n=1 Tax=Bactero... 257 6e-67 UniRef50_C5TEG1 Glycosyl transferase family 2 n=1 Tax=Zymomonas ... 257 6e-67 UniRef50_B2V0U1 Glycosyl transferase, group 2 family protein n=1... 256 1e-66 UniRef50_C3R5C9 Glycosyl transferase WciV n=3 Tax=Bacteroides Re... 255 1e-66 UniRef50_Q8GPB8 Eps6I n=5 Tax=Streptococcus RepID=Q8GPB8_STRTR 255 2e-66 UniRef50_B0P2L7 Putative uncharacterized protein n=1 Tax=Clostri... 255 2e-66 UniRef50_D2RJ84 Glycosyl transferase family 2 n=1 Tax=Acidaminoc... 254 3e-66 UniRef50_A6BH37 Putative uncharacterized protein n=1 Tax=Dorea l... 254 3e-66 UniRef50_C9LAR4 Glycosyl transferase, family 2/glycosyl transfer... 254 3e-66 UniRef50_Q9F0B5 Beta(1,4)galactosyltransferase EpsM n=1 Tax=Lact... 254 3e-66 UniRef50_B1JIU1 Glycosyl transferase family 2 n=28 Tax=Enterobac... 253 5e-66 UniRef50_C0BWY1 Putative uncharacterized protein n=1 Tax=Clostri... 253 8e-66 UniRef50_B3DP51 Glycosyltransferase for cell wall membrane n=10 ... 253 9e-66 UniRef50_C2LRT9 Glycosyl transferase, family 2/glycosyl transfer... 252 1e-65 UniRef50_UPI0001B5362C putative glycosyl transferase n=1 Tax=Str... 252 1e-65 UniRef50_B9AFV3 Putative uncharacterized protein n=1 Tax=Methano... 252 1e-65 UniRef50_B2RH99 Probable glycosyltransferase n=2 Tax=Porphyromon... 252 1e-65 UniRef50_D1JY96 Putative uncharacterized protein n=1 Tax=Bactero... 252 1e-65 UniRef50_D2RIJ6 Glycosyl transferase family 2 n=1 Tax=Acidaminoc... 252 1e-65 UniRef50_D1NRU6 Glycosyl transferase CpsJ n=1 Tax=Bifidobacteriu... 252 1e-65 UniRef50_C4K3K5 Glycosyltransferase group 2 n=3 Tax=Enterobacter... 252 1e-65 UniRef50_Q5LBM4 Putative glycosyltransferase protein n=4 Tax=Bac... 251 2e-65 UniRef50_C0X9Z9 Glycosyltransferase n=1 Tax=Lactobacillus gasser... 251 2e-65 UniRef50_Q815A0 Glycosyltransferase n=6 Tax=Bacillus cereus RepI... 251 2e-65 UniRef50_B0ZTM8 Putative uncharacterized protein n=1 Tax=Campylo... 251 3e-65 UniRef50_C3WCV9 Glycosyl transferase n=1 Tax=Fusobacterium morti... 250 4e-65 UniRef50_D0T9V5 Glycosyltransferase family 2 n=1 Tax=Bacteroides... 250 4e-65 UniRef50_O87182 CpsIaI n=15 Tax=Streptococcus agalactiae RepID=O... 250 5e-65 UniRef50_C0CHF0 Putative uncharacterized protein n=2 Tax=Blautia... 250 6e-65 UniRef50_A0Q780 Glycosyl transferase, family 2 n=16 Tax=Francise... 250 7e-65 UniRef50_D0R4M4 Glycosyltransferase n=1 Tax=Lactobacillus johnso... 249 8e-65 UniRef50_B0ABV6 Putative uncharacterized protein n=1 Tax=Clostri... 249 8e-65 UniRef50_O86893 Putative glycosyl transferase n=1 Tax=Streptococ... 249 9e-65 UniRef50_C7H2S4 Glycosyl transferase, group 2 family n=3 Tax=Clo... 249 1e-64 UniRef50_A5ZCV6 Putative uncharacterized protein n=1 Tax=Bactero... 249 1e-64 UniRef50_B9YAB2 Putative uncharacterized protein n=1 Tax=Holdema... 248 1e-64 UniRef50_C7D258 TagF domain-containing protein n=4 Tax=Enterococ... 248 2e-64 UniRef50_B1R2U2 Glycosyl transferase, group 2 family protein n=4... 248 2e-64 UniRef50_C8WM64 Glycosyl transferase family 2 n=1 Tax=Eggerthell... 248 3e-64 UniRef50_B0P2L5 Putative uncharacterized protein n=1 Tax=Clostri... 248 3e-64 UniRef50_UPI0001968EF9 hypothetical protein BACCELL_04174 n=1 Ta... 247 3e-64 UniRef50_B2ILQ1 Ss-1,4-galactosyltransferase n=6 Tax=Streptococc... 247 3e-64 UniRef50_Q8XN54 Capsular polysaccharide biosynthsis protein n=1 ... 247 5e-64 UniRef50_D0ANJ0 Putative uncharacterized protein n=1 Tax=Enteroc... 247 5e-64 UniRef50_B2UQ58 Glycosyl transferase family 2 n=1 Tax=Akkermansi... 247 5e-64 UniRef50_A7V2P1 Putative uncharacterized protein n=3 Tax=Bactero... 247 5e-64 UniRef50_C4G9K6 Putative uncharacterized protein n=1 Tax=Shuttle... 247 5e-64 UniRef50_A6BH38 Putative uncharacterized protein n=1 Tax=Dorea l... 247 6e-64 UniRef50_A2STD8 Glycosyl transferase, family 2 n=1 Tax=Methanoco... 247 7e-64 UniRef50_Q93DZ7 Eps1I n=1 Tax=Streptococcus thermophilus RepID=Q... 246 7e-64 UniRef50_A8FHL3 Glycosyltransferase n=2 Tax=Bacillus pumilus Rep... 246 8e-64 UniRef50_C0B702 Putative uncharacterized protein n=1 Tax=Coproco... 246 8e-64 UniRef50_B5G7W4 Glycosyltransferase n=3 Tax=Streptomyces RepID=B... 246 8e-64 UniRef50_C3PZ19 Glycosyl transferase n=1 Tax=Bacteroides sp. 9_1... 246 1e-63 UniRef50_Q8A2D4 Glycoside transferase family 2 n=2 Tax=Bacteroid... 245 2e-63 UniRef50_B7BDW0 Putative uncharacterized protein n=1 Tax=Parabac... 245 2e-63 UniRef50_C4ZHK7 Glycosyltransferase Family 2 modular protein n=1... 245 2e-63 UniRef50_C3QCN3 Glycoside transferase family 2 n=6 Tax=Bacteroid... 245 3e-63 UniRef50_A6P2R1 Putative uncharacterized protein n=1 Tax=Bactero... 244 3e-63 UniRef50_Q02W60 Glycosyltransferase n=1 Tax=Lactococcus lactis s... 244 3e-63 UniRef50_A6LG23 Glycosyltransferase family 2 n=6 Tax=Bacteroidal... 244 4e-63 UniRef50_B2DG34 Eps5I n=3 Tax=Streptococcus pneumoniae RepID=B2D... 244 4e-63 UniRef50_C6JQJ1 Glycosyl transferase n=1 Tax=Fusobacterium variu... 244 4e-63 UniRef50_C8WIW0 Glycosyl transferase family 2 n=1 Tax=Eggerthell... 244 4e-63 UniRef50_B7HXA1 N-acetylglucosaminyltransferase n=2 Tax=Bacillus... 244 4e-63 UniRef50_C5NWI7 Glycosyl transferase, group 2 family n=1 Tax=Gem... 243 5e-63 UniRef50_C6LFJ2 Glycosyl transferase, family 2/glycosyl transfer... 243 5e-63 UniRef50_B2IQI2 Glycosyl transferase, family 2 n=31 Tax=Firmicut... 243 5e-63 UniRef50_B2UQN8 Glycosyl transferase family 2 n=1 Tax=Akkermansi... 243 5e-63 UniRef50_Q74L33 Putative uncharacterized protein n=1 Tax=Lactoba... 243 6e-63 UniRef50_C1CFZ0 Glycosyl transferase, family 2/glycosyl transfer... 243 6e-63 UniRef50_UPI0001C4194D glycosyl transferase GT2 family n=1 Tax=M... 243 7e-63 UniRef50_D2Q641 Glycosyl transferase family 2 n=2 Tax=Bifidobact... 243 8e-63 UniRef50_C7X8L4 Glycoside transferase, family 2 n=1 Tax=Parabact... 243 9e-63 UniRef50_C7MN19 Glycosyl transferase n=1 Tax=Cryptobacterium cur... 242 1e-62 UniRef50_A5A8E7 Galactosyltransferase n=8 Tax=Streptococcus RepI... 242 1e-62 UniRef50_A1YVC6 Glycosyltransferase n=1 Tax=Lactobacillus johnso... 242 1e-62 UniRef50_C9RN10 Glycosyl transferase family 2 n=1 Tax=Fibrobacte... 242 1e-62 UniRef50_B0P2L4 Putative uncharacterized protein n=1 Tax=Clostri... 242 1e-62 UniRef50_Q84GX9 Orf34 n=2 Tax=Photorhabdus luminescens RepID=Q84... 242 1e-62 UniRef50_C0YL33 Glycosyltransferase n=1 Tax=Chryseobacterium gle... 242 2e-62 UniRef50_Q9AH91 Putative glycosyl transferase n=3 Tax=Streptococ... 242 2e-62 UniRef50_B0P2K9 Putative uncharacterized protein n=1 Tax=Clostri... 242 2e-62 UniRef50_C7G7U9 Putative capsular polysaccharide biosythesis pro... 242 2e-62 UniRef50_D2BBI0 Putative glycosyl/glycerophosphate transferase i... 242 2e-62 UniRef50_A7I1X7 Ss-1,4-galactosyltransferase n=1 Tax=Campylobact... 241 2e-62 UniRef50_C3ADH9 Putative uncharacterized protein n=1 Tax=Bacillu... 241 2e-62 UniRef50_C6Z980 Glycosyltransferase family 2 n=1 Tax=Bacteroides... 241 3e-62 UniRef50_A1A380 Putative uncharacterized protein n=1 Tax=Bifidob... 241 3e-62 UniRef50_B5CQR8 Putative uncharacterized protein n=1 Tax=Ruminoc... 241 3e-62 UniRef50_B3C8H4 Putative uncharacterized protein n=2 Tax=Bactero... 241 3e-62 UniRef50_A3KLK8 Cps5J protein n=2 Tax=Streptococcus agalactiae R... 241 3e-62 UniRef50_C6I619 Glycosyl transferase n=4 Tax=Bacteroides RepID=C... 240 5e-62 UniRef50_B0MQU6 Putative uncharacterized protein n=1 Tax=Eubacte... 240 5e-62 UniRef50_A3TMK5 Possible glycosyl hydrolase n=1 Tax=Janibacter s... 240 5e-62 UniRef50_C4Z667 Putative uncharacterized protein n=1 Tax=Eubacte... 240 6e-62 UniRef50_D2BBH8 Glycosyltransferase involved in cell wall biogen... 240 6e-62 UniRef50_A8U9H2 Glycosyl transferase, family 2/glycosyl transfer... 240 7e-62 UniRef50_Q5M3K9 Glycosyl transferase n=4 Tax=Streptococcus RepID... 240 7e-62 UniRef50_Q4JZ78 Putative glycosyl transferase n=1 Tax=Streptococ... 240 7e-62 UniRef50_Q8A821 Glycoside transferase family 2 n=1 Tax=Bacteroid... 240 8e-62 UniRef50_D1PNS4 Capsular polysaccharide biosythesis protein CpsI... 239 9e-62 UniRef50_Q8XN37 Capsular polysaccharide biosynthsis protein n=1 ... 239 9e-62 UniRef50_D0BME4 Glycosyl transferase n=10 Tax=Lactobacillales Re... 239 9e-62 UniRef50_B0NLZ9 Putative uncharacterized protein n=1 Tax=Bactero... 239 1e-61 UniRef50_D0BKT7 Capsular polysaccharide biosynthesis protein Cps... 239 1e-61 UniRef50_D2BMR3 Glycosyltransferase, family 2 n=1 Tax=Lactococcu... 239 1e-61 UniRef50_C6A9D3 Glycosyltransferase n=4 Tax=Bifidobacterium anim... 239 1e-61 UniRef50_C1DN10 Glycosyl transferase, family 2 protein n=1 Tax=A... 238 2e-61 UniRef50_C7H3W1 Capsular polysaccharide biosynthesis protein Cps... 238 2e-61 UniRef50_B0NM34 Putative uncharacterized protein n=3 Tax=Bactero... 238 2e-61 UniRef50_D2RJ85 Glycosyl transferase family 2 n=1 Tax=Acidaminoc... 238 2e-61 UniRef50_Q93DZ8 EpsG n=2 Tax=Lactobacillales RepID=Q93DZ8_STRTR 238 2e-61 UniRef50_C2EWJ8 Putative uncharacterized protein n=1 Tax=Lactoba... 238 2e-61 UniRef50_A6L5V6 Glycosyltransferase family 2 n=2 Tax=Bacteroides... 238 2e-61 UniRef50_C9K1G9 Putative glycosyltransferase n=1 Tax=Klebsiella ... 238 2e-61 UniRef50_UPI0001C377F0 glycosyl transferase, group 2 family prot... 238 3e-61 UniRef50_B4BNC6 Glycosyl transferase family 2 n=1 Tax=Geobacillu... 238 3e-61 UniRef50_C9BUH6 Glycosyltransferase n=1 Tax=Enterococcus faecium... 238 3e-61 UniRef50_A6BHF1 Putative uncharacterized protein n=1 Tax=Dorea l... 238 3e-61 UniRef50_C0FV12 Putative uncharacterized protein n=1 Tax=Rosebur... 237 3e-61 UniRef50_D2Q662 Glycosyl transferase, family 2 n=3 Tax=Bifidobac... 237 4e-61 UniRef50_D0DSJ8 Glycosyl transferase CpsJ n=1 Tax=Lactobacillus ... 237 4e-61 UniRef50_C2EYX7 Glycosyltransferase n=2 Tax=Lactobacillus reuter... 237 4e-61 UniRef50_Q83VE8 EpsN n=1 Tax=Lactococcus lactis subsp. cremoris ... 237 4e-61 UniRef50_A6BIH4 Putative uncharacterized protein n=1 Tax=Dorea l... 237 4e-61 UniRef50_Q6ASC1 Related to glycosyltransferase involved capsular... 237 5e-61 UniRef50_D2Q643 Glycosyltransferase EpsI n=2 Tax=Bifidobacterium... 237 6e-61 UniRef50_D0TDE5 Glycosyltransferase family 2 n=1 Tax=Bacteroides... 237 6e-61 UniRef50_C2M970 Glycosyl transferase family 2 n=1 Tax=Porphyromo... 237 6e-61 UniRef50_C9RJ54 Glycosyl transferase family 2 n=1 Tax=Fibrobacte... 237 7e-61 UniRef50_A6ASH7 Cps2K n=1 Tax=Vibrio harveyi HY01 RepID=A6ASH7_V... 237 7e-61 UniRef50_A6NTS7 Putative uncharacterized protein n=1 Tax=Bactero... 236 7e-61 UniRef50_Q047W8 Glycosyltransferase related enzyme n=1 Tax=Lacto... 236 7e-61 UniRef50_D2PVX1 Glycosyl transferase family 2 n=1 Tax=Kribbella ... 236 8e-61 UniRef50_B1C0P6 Putative uncharacterized protein n=1 Tax=Clostri... 236 8e-61 UniRef50_Q8RSN9 Putative glycosyl transferase n=1 Tax=Streptococ... 236 9e-61 UniRef50_B5CU67 Putative uncharacterized protein n=1 Tax=Bactero... 236 9e-61 UniRef50_C7PMN9 Glycosyl transferase family 2 n=1 Tax=Chitinopha... 236 9e-61 UniRef50_C8PLK6 Ss-1,4-galactosyltransferase n=1 Tax=Campylobact... 236 1e-60 UniRef50_C6WZX8 Beta-1,3-glucosyltransferase n=1 Tax=Flavobacter... 236 1e-60 UniRef50_C0FV38 Putative uncharacterized protein n=1 Tax=Rosebur... 235 1e-60 UniRef50_A5ZUY3 Putative uncharacterized protein n=1 Tax=Ruminoc... 235 2e-60 UniRef50_A8RE09 Putative uncharacterized protein n=1 Tax=Eubacte... 235 2e-60 UniRef50_Q2NH61 Predicted glycosyltransferase n=1 Tax=Methanosph... 235 2e-60 UniRef50_B0NM04 Putative uncharacterized protein n=1 Tax=Bactero... 235 2e-60 UniRef50_A3IC99 Predicted glycosyltransferase n=2 Tax=Bacillacea... 235 2e-60 UniRef50_Q8XN57 Beta-1,4-galactosyltransferase n=3 Tax=Clostridi... 234 3e-60 UniRef50_UPI0001C4190B glycosyl transferase GT2 family n=1 Tax=M... 234 3e-60 UniRef50_D1WVE3 CDP-glycerol:poly(Glycerophosphate) glycerophosp... 234 3e-60 UniRef50_C6X0E9 Beta-1,3-glucosyltransferase n=1 Tax=Flavobacter... 234 3e-60 UniRef50_C1YIQ9 Glycosyl/glycerophosphate transferase, teichoic ... 234 3e-60 UniRef50_B1C3W3 Putative uncharacterized protein n=1 Tax=Clostri... 234 4e-60 UniRef50_B9Y890 Putative uncharacterized protein n=1 Tax=Holdema... 234 4e-60 UniRef50_C3R6R9 Glycosyltransferase involved capsular polysaccha... 234 4e-60 UniRef50_D1JJJ8 Putative uncharacterized protein n=1 Tax=Bactero... 233 5e-60 UniRef50_B5G7W6 Glycosyltransferase n=2 Tax=Streptomyces RepID=B... 233 5e-60 UniRef50_A8ZY95 Glycosyl transferase family 2 n=1 Tax=Desulfococ... 233 5e-60 Sequences not found previously or not previously below threshold: UniRef50_D1PIY0 Glycosyl transferase, group 2 family n=1 Tax=Sub... 262 2e-68 UniRef50_D2Q667 Glycosyl transferase, family 2 n=3 Tax=Bifidobac... 250 5e-65 UniRef50_A6NTT8 Putative uncharacterized protein n=1 Tax=Bactero... 250 6e-65 UniRef50_Q5LZN9 Exopolysaccharide biosynthesis protein, sugar tr... 248 2e-64 UniRef50_A8SG33 Putative uncharacterized protein n=1 Tax=Faecali... 246 8e-64 UniRef50_B2UQ55 Glycosyl transferase family 2 n=1 Tax=Akkermansi... 246 1e-63 UniRef50_A6LCC5 Glycosyltransferase family 2 n=2 Tax=Bacteroidal... 245 2e-63 UniRef50_Q93TI5 Putative glycosyltransferase CpsIVJ n=1 Tax=Stre... 240 5e-62 UniRef50_C3X3W6 Putative uncharacterized protein n=1 Tax=Oxaloba... 240 6e-62 UniRef50_Q047X3 Glycosyltransferase related enzyme n=1 Tax=Lacto... 240 8e-62 UniRef50_C3JA21 Glycosyl transferase, family 2 n=3 Tax=Bacteria ... 239 1e-61 UniRef50_UPI0001972E41 putative sugar transferase n=1 Tax=Clostr... 238 2e-61 UniRef50_C7H1H9 Polysaccharide biosynthesis protein, Glycosyl tr... 238 3e-61 UniRef50_A5UMT7 Glycosyltransferase, GT2 family n=2 Tax=Methanob... 237 3e-61 UniRef50_Q4JZB4 Putative glycosyl transferase n=3 Tax=Streptococ... 237 5e-61 UniRef50_B9YA99 Putative uncharacterized protein n=1 Tax=Holdema... 236 8e-61 UniRef50_C6I618 Glycosyltransferase n=5 Tax=Bacteroidales RepID=... 236 9e-61 UniRef50_C9N8V9 Glycosyl transferase family 2 n=1 Tax=Streptomyc... 235 2e-60 >UniRef50_P11290 Uncharacterized glycosyltransferase yibD n=58 Tax=Enterobacteriaceae RepID=YIBD_ECOLI Length = 344 Score = 493 bits (1270), Expect = e-138, Method: Composition-based stats. Identities = 344/344 (100%), Positives = 344/344 (100%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY Sbjct: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC Sbjct: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN Sbjct: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI 240 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI Sbjct: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI 240 Query: 241 KITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMY 300 KITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMY Sbjct: 241 KITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMY 300 Query: 301 KRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITRSAFNLR 344 KRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITRSAFNLR Sbjct: 301 KRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITRSAFNLR 344 >UniRef50_B4SXB8 Glycosyl transferase, group 2 family protein n=36 Tax=Enterobacteriaceae RepID=B4SXB8_SALNS Length = 344 Score = 465 bits (1196), Expect = e-129, Method: Composition-based stats. Identities = 276/343 (80%), Positives = 303/343 (88%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M NS K+S+I+PLYNAG DF CM SLI QTW+ALEIII+NDGSTD+SVEIAK+YAE+Y Sbjct: 1 MKNSKTKVSIIVPLYNAGADFNACMASLIAQTWSALEIIIVNDGSTDHSVEIAKHYAEHY 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 PHVRLLHQANAGASVARN G++ ATG YVAFVDADD+VYP MYETLMTMAL DDLDVAQC Sbjct: 61 PHVRLLHQANAGASVARNLGLQAATGDYVAFVDADDQVYPKMYETLMTMALNDDLDVAQC 120 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 NADWC R+TG WQSIPTDRLRSTGVL+GPDWLRM L+SRRWTHVVWMGVYRR +I NN Sbjct: 121 NADWCVRKTGHAWQSIPTDRLRSTGVLSGPDWLRMALASRRWTHVVWMGVYRRALITDNN 180 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI 240 I F+ GLHHQDI+W+TE MFNA R RYTEQSLYKY+LH+ SVSRL RQGNKNLNYQRHYI Sbjct: 181 ITFVPGLHHQDILWSTEVMFNATRVRYTEQSLYKYFLHDNSVSRLQRQGNKNLNYQRHYI 240 Query: 241 KITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMY 300 KITRLLEKLNR+YA +I IYPEF QQIT+EALRVCHAVRKEPDILTRQRMIAEIFTSGMY Sbjct: 241 KITRLLEKLNRDYARRIPIYPEFRQQITWEALRVCHAVRKEPDILTRQRMIAEIFTSGMY 300 Query: 301 KRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITRSAFNL 343 +R++ NVRS K YQ LLWSFRLWQWRDKT SH R+ R A NL Sbjct: 301 RRMMANVRSAKAAYQTLLWSFRLWQWRDKTLSHRRMARKALNL 343 >UniRef50_A8GFL8 Glycosyl transferase family 2 n=1 Tax=Serratia proteamaculans 568 RepID=A8GFL8_SERP5 Length = 332 Score = 376 bits (967), Expect = e-103, Method: Composition-based stats. Identities = 156/322 (48%), Positives = 209/322 (64%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 LSVI+PL+N G+ F + SL+ Q LE+II+NDGSTD S +IA YA+ + H+ ++ Sbjct: 2 LSVIVPLHNVGELFEPFLASLLAQNERRLEVIIVNDGSTDGSGDIAHRYAQQHTHITVID 61 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 Q NAG S ARN G+ VA GKYVAF DADD + P MY TL+ +A ++ LDV QCN + F Sbjct: 62 QLNAGVSCARNAGLAVARGKYVAFPDADDLLAPEMYSTLIELAEQNQLDVMQCNGERYFT 121 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 + E IP RL T V++G WL L SR++ HVVW+ VYR D I ++ + F GL Sbjct: 122 DQDELQPIIPLKRLGDTDVISGVQWLERALESRKFIHVVWLAVYRLDFIKQHRLYFEPGL 181 Query: 188 HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRLLE 247 HHQDI WTTE MFNA R +Y ++LY+ +H S+S R G N+ YQRHY+KI +L Sbjct: 182 HHQDIPWTTEVMFNAQRVKYLGKALYRQRVHGQSISNRRRTGKANVEYQRHYMKIVAMLV 241 Query: 248 KLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLITNV 307 LNR YADKI I P FH QIT EAL +CH++R+EP++ + R+ E F G+ + ++ N Sbjct: 242 ALNRRYADKIAIRPAFHWQITREALGICHSIRREPELAAQCRITDEFFQLGIDRAMVDNA 301 Query: 308 RSVKVGYQALLWSFRLWQWRDK 329 R +K G+ +LW RL QWR Sbjct: 302 RGLKQGWHVILWLHRLKQWRHG 323 >UniRef50_D1RQX9 Putative glycosyl transferase n=1 Tax=Serratia odorifera 4Rx13 RepID=D1RQX9_SEROD Length = 335 Score = 373 bits (959), Expect = e-102, Method: Composition-based stats. Identities = 153/321 (47%), Positives = 206/321 (64%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 LSVI+PL+NAG F M SL+ Q LE+II+NDGSTD S IA YA YPH+ ++ Sbjct: 5 LSVIVPLHNAGALFEPFMASLLAQDERRLEVIIVNDGSTDGSGAIAHRYARQYPHITVID 64 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 Q N+G S ARN G+ +A GKYVAF DADD + P MY TL+ +A ++ LDV QCN F Sbjct: 65 QPNSGVSCARNAGLAIAQGKYVAFPDADDLLAPEMYSTLVALAEQEQLDVMQCNGQRYFS 124 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 E + IP RL TGV++G WL L +R++ HVVW+ VYR D I ++ + F GL Sbjct: 125 RENELQEIIPQKRLGDTGVISGAQWLERALKTRKFIHVVWLAVYRLDFIKRHQLYFEPGL 184 Query: 188 HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRLLE 247 HHQDI WTTE MFNA R +Y ++LY+ +H+ S+S R G N+ YQRHY+KI +L Sbjct: 185 HHQDIPWTTEVMFNAQRVKYLSKALYRQRVHDQSISNRRRTGKANVEYQRHYMKIVEMLV 244 Query: 248 KLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLITNV 307 LN+ YA K++I P FH QI EAL +CH++R+EP+ + ++ AE + G+ + + N Sbjct: 245 VLNQRYASKMVIRPAFHWQIAREALGICHSIRREPEPAAQCQITAEFYQRGIDRTMAANA 304 Query: 308 RSVKVGYQALLWSFRLWQWRD 328 R +K + LW RL QWRD Sbjct: 305 RGLKQIWHVTLWLHRLKQWRD 325 >UniRef50_B5XTH2 Glycosyl transferase, group 2 family protein n=21 Tax=Enterobacteriaceae RepID=B5XTH2_KLEP3 Length = 329 Score = 323 bits (829), Expect = 5e-87, Method: Composition-based stats. Identities = 122/327 (37%), Positives = 183/327 (55%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M+ T LS++ +YN E L Q + E+I++NDGSTDNS+ + + + Sbjct: 1 MSQTPLLSIVAAVYNGEKFLAQFFECLEQQQLESYELILVNDGSTDNSLAVIAEWRDRLQ 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 +V++L Q N G SVARN G+ A+GKY+AF D DD++YP MY TL+ MA ++ LD+A CN Sbjct: 61 NVQVLEQENQGVSVARNTGLAAASGKYLAFPDIDDKLYPGMYRTLLEMAEKEHLDIATCN 120 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 + + + E+ P DRL STGVL G WL+ L SR++ HV W+ +YR D I ++ Sbjct: 121 GTYVYEKRRESHPIFPLDRLPSTGVLPGHVWLKQALDSRKFLHVTWLNIYRHDFIRQHRF 180 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIK 241 F GL HQDI WTTE + A R +YT Q Y YY+H+ SVS + + RHY+K Sbjct: 181 HFEPGLRHQDIPWTTEALLAAERVQYTSQQFYDYYIHSESVSHKPDNDDTLMRSARHYMK 240 Query: 242 ITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYK 301 I +LE +N+ Y DK+ QI E L + H D + +I E F G+++ Sbjct: 241 ILEMLEAINQRYPDKVRHIAACRWQIAKEGLGIIHTFDSMKDESKKHVIIKEFFDRGIWR 300 Query: 302 RLITNVRSVKVGYQALLWSFRLWQWRD 328 + N + ++ ++ R+ ++R Sbjct: 301 LIWKNACTFRLRWRLGRRYLRIKRYRH 327 >UniRef50_A8SCX5 Putative uncharacterized protein n=1 Tax=Faecalibacterium prausnitzii M21/2 RepID=A8SCX5_9FIRM Length = 322 Score = 289 bits (741), Expect = 8e-77, Method: Composition-based stats. Identities = 75/288 (26%), Positives = 124/288 (43%), Gaps = 16/288 (5%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 LS+I+P+Y + + C++S++ QT+T E+I++ DGS D + A +R+LH Sbjct: 2 LSIIVPVYKVENYLQKCIDSILAQTFTDFELILVEDGSPDGCPALCDAAAAKDARIRVLH 61 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 Q N G S ARN G++VA G+++ FVD+DD + P MYETL D+A C+ Sbjct: 62 QKNGGLSAARNAGLDVARGEWIGFVDSDDYIAPEMYETLYKAVQSTGADLALCDYAAVDE 121 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 +P + + TG + L+ + W +YRR + +++ G Sbjct: 122 AGTPC---LPPYTGLAQRIFTGRELLKKA--TNTMAQPAWNKLYRRVIFA--QLRYPEGK 174 Query: 188 HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRLLE 247 ++DI E N +A + LY Y +S+ G+K L + R Sbjct: 175 LNEDIFVLPEICLNTKKAVVVPKELYYYVQRGSSIMN----GSKTLRHFDAAEAAQRYWN 230 Query: 248 KLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIF 295 L N +Y + V R+ P L + E+ Sbjct: 231 CLVEN-----EVYDALANAAKFTMGSVSRVYRQLPPALRKAPRSREML 273 >UniRef50_A9VPZ1 Glycosyl transferase family 2 n=3 Tax=Firmicutes RepID=A9VPZ1_BACWK Length = 324 Score = 287 bits (735), Expect = 4e-76, Method: Composition-based stats. Identities = 75/335 (22%), Positives = 151/335 (45%), Gaps = 14/335 (4%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 K+S+I+P+Y C++S++ QT+ E+I+++DGS D EI YA+ ++ Sbjct: 2 NPKISIIVPVYKVERYIHKCIDSILVQTFEDFELILVDDGSPDTCGEICDRYADKDMRIK 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++H+ N G S ARN G+E A G Y+ FVD+DD + P MYE L M +E D+A C++ Sbjct: 62 VIHKENGGLSSARNAGLENAKGDYIGFVDSDDWIEPDMYELLYGMCVEQGCDIAICSSQI 121 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 + I T + ++ L + + VVW +++R+++ +IKF Sbjct: 122 HYS------NRIVTSSNHPFIIHDRNTAMKTMLEGKLYDEVVWTKLFKRNLLE--DIKFP 173 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 G+ ++D +T + + + + + Y Y + S+ ++ Y ++ + Sbjct: 174 VGMSYEDTAFTYKVIHKCKKVCFIGEPKYHYIKRDNSMM-DRAVKELKIDSVLIYDEMCK 232 Query: 245 LLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLI 304 +E+ + +++ + + + R + ++I I + YK + Sbjct: 233 FMERYYKELYSLVIL--KLGNCSMVVLNLISMSGRFNEHKVKYYKVIG-ILNNYFYKTMS 289 Query: 305 TNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITRS 339 V + LL++ +L K ++ R Sbjct: 290 LKAYGKSVKF--LLFATKLHPLLYKFLINNFAKRG 322 >UniRef50_C7H3W2 Capsular polysaccharide biosynthesis protein n=1 Tax=Faecalibacterium prausnitzii A2-165 RepID=C7H3W2_9FIRM Length = 318 Score = 287 bits (735), Expect = 4e-76, Method: Composition-based stats. Identities = 81/276 (29%), Positives = 121/276 (43%), Gaps = 15/276 (5%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 T +SVI+P+YN C++S++ QT+ LE+++++DGSTD S I YA V Sbjct: 2 ETELISVIVPVYNVERYLCRCVDSILHQTYRNLEVLLVDDGSTDASGAICDEYAAQEERV 61 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 +HQ N G S ARN G+E A G Y+ FVD+DD + M ETL E D DVA Sbjct: 62 TAVHQKNGGLSAARNTGLERAKGTYLCFVDSDDLLDSRMLETLCRDLQEQDADVAVVGFR 121 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 RE + P + +TG + +R L S W +Y+R++ +I++ Sbjct: 122 MFEREE----ELAPAELAVPVQCMTGREAIRSTLVSDELGDFAWNKLYKRELFR--DIRY 175 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLH----RQGNKNLNYQRHY 239 G +D T + Y LY YY S+ + ++ YQ+ Y Sbjct: 176 PLGRMMEDQGTTYRIFQQCSKVVYRPVPLYYYYQRPDSILHRRNMKFYEDKLDMGYQK-Y 234 Query: 240 IKITRLLEKLNRNYADKIMI----YPEFHQQITYEA 271 I + N A + + YP Q A Sbjct: 235 RAIREAYPGMPENDAAMLSVVEHCYPYLMQDTVRRA 270 >UniRef50_B0ACE9 Putative uncharacterized protein n=2 Tax=Firmicutes RepID=B0ACE9_9CLOT Length = 335 Score = 285 bits (730), Expect = 2e-75, Method: Composition-based stats. Identities = 81/339 (23%), Positives = 155/339 (45%), Gaps = 15/339 (4%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+S+++P+YN C++S+I QT +EII+++DGS DN ++ YA+ ++++ Sbjct: 3 KVSIVVPIYNVEKYLEQCVDSIINQTLKEIEIILVDDGSPDNCPQMCDEYAKKDSRIKVV 62 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 H+ N G S ARN GIEVATG Y+ FVD+DD + MYE + +A+E+++D + Sbjct: 63 HKKNGGLSSARNAGIEVATGDYIGFVDSDDYIELDMYEKMYNIAIENNVDFVMSDYYRVS 122 Query: 127 RETGETWQSIPTDRLRSTGVLTG---PDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 E + + P + + VW +Y+R+++ KN++ F Sbjct: 123 SEDKVKATLDMDKGIYDKNKIKKDIFPTLIMGEYINYGRLLAVWHCLYKRELLEKNDLTF 182 Query: 184 IAGLHH-QDIVWTTEFMFNALRARYTE-QSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIK 241 + + +D ++++ + Y + LY Y + S+S+ ++ ++ Y+K Sbjct: 183 DEEVKYSEDNLFSSIIGYKVNNFYYMKGSYLYNYRYNPNSISKTYKPDAWSV-----YLK 237 Query: 242 ITRLLEKLNRNYADKIMIYPEFHQQITYEALRVC-HAVRKEPDILTRQRMIAEIFTSGMY 300 + L+ L N D + + I Y A H + + + I I + Sbjct: 238 MNNKLKDLFLNVEDY-DFRNQLYAHIIYYAFNCINHVGNSDKSFSYKYKEIKRILNT--- 293 Query: 301 KRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITRS 339 K L ++ K G + +W + K ++ I + Sbjct: 294 KELEDGFKNFKAGNISTKLKISIWMLKHKFIINYMIIKG 332 >UniRef50_C0WDB4 Ss-1,4-galactosyltransferase n=1 Tax=Acidaminococcus sp. D21 RepID=C0WDB4_9FIRM Length = 337 Score = 284 bits (728), Expect = 2e-75, Method: Composition-based stats. Identities = 80/336 (23%), Positives = 149/336 (44%), Gaps = 34/336 (10%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 +S+++P+YN C+ SL+ QT++ LEI++++DGSTD S + + + + Sbjct: 12 QAPLISLLVPIYNVASYVDRCVGSLVNQTYSNLEILLVDDGSTDGSGALCDEWVKKDSRI 71 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 ++H+ N G S ARN GI+ A G Y+AF+D DD + P E L D D++ CN Sbjct: 72 HVVHKENGGLSDARNAGIKRAAGAYLAFIDGDDYIAPEYCEKLYQALAAHDADISLCN-- 129 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWT-HVVWMGVYRRDVIVKN-NI 181 C+ + GV T + L++ T V W +Y+R + N + Sbjct: 130 ICYEWADGRRKDYEPPHGTREGVCTVKEVLQVLHGVGDLTLCVAWNKLYKRIIFTGNPPL 189 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIK 241 F G +++D + + A R +LY Y+ + S+ + +H+ Sbjct: 190 FFPKGRYYEDEFLNYKLLNRASRISVIPDALYFYFQRDNSIIHQYT--------DKHFTD 241 Query: 242 ITRLLEKLNRNYADKIMIYPEFHQQITYEA---LRVCHAVRKE--PDILTRQ-------- 288 + +E D + YP+ ++ ++ L++C + K+ D+ R Sbjct: 242 LCCCIE-------DSLTWYPDDEEKRQWKLRHSLKLCFTLYKKCHDDLHFRNRTSEVKGL 294 Query: 289 --RMIAEIFTSGMYKRLITNVRSVKVGYQALLWSFR 322 R++ E+ + + KR + V + VG L+ + Sbjct: 295 VFRIMKELKITYLTKRELVKVFLILVGGYGALYQVK 330 >UniRef50_C4FFR1 Putative uncharacterized protein n=1 Tax=Bifidobacterium angulatum DSM 20098 RepID=C4FFR1_9BIFI Length = 327 Score = 282 bits (721), Expect = 2e-74, Method: Composition-based stats. Identities = 72/260 (27%), Positives = 128/260 (49%), Gaps = 13/260 (5%) Query: 1 MMNST-NKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAEN 59 M N+ +SV++P+YN C++S++ QT+T +EII+++DGSTD S I Y+ Sbjct: 1 MSNTALPLISVVVPVYNVKPYVAKCLDSILRQTYTNIEIIVVDDGSTDGSELICDDYSNK 60 Query: 60 YPHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQ 119 P + ++HQ N G + ARN GI+ A G+Y+ FVD+DD + P MYE L+ A +++ +A Sbjct: 61 DPRITVIHQRNTGLAAARNTGIDAAHGEYLGFVDSDDFIEPFMYEKLLNAAQKNNCVLAV 120 Query: 120 CNADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKN 179 C ++ F + IP + V PD + + R + W +Y+ ++ Sbjct: 121 CGINYVF----DNGTVIPKANIEPDQVFDFPDAITEMNTYRLFDMGAWSKLYKSELFD-- 174 Query: 180 NIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHY 239 I+F G +D + A + + + Y YY S+++ + + L Sbjct: 175 GIRFPVGKLSEDFFIMFKIFDRAQKVAFVSDACYNYYQRTNSITKSKKINHDFLE----- 229 Query: 240 IKITRLLEKLNRNYADKIMI 259 + + ++RNY +I Sbjct: 230 -AAYKQMRYIDRNYPSLSVI 248 >UniRef50_Q5LIR2 Putative glycosyltransferase n=6 Tax=Bacteroides RepID=Q5LIR2_BACFN Length = 356 Score = 280 bits (717), Expect = 5e-74, Method: Composition-based stats. Identities = 64/232 (27%), Positives = 116/232 (50%), Gaps = 7/232 (3%) Query: 6 NKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRL 65 +S+I+P+YN + C++SLI QT+ +EII+++DGSTD+S I +AE +++ Sbjct: 8 PLISIIVPIYNIAEYASECIQSLINQTYKNIEIILVDDGSTDHSPAICDEFAEQDERIKV 67 Query: 66 LHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWC 125 +H+ N G S ARN G++VATG+Y+ FVD DD V MYETL + E D++ C Sbjct: 68 IHKRNGGLSDARNAGLDVATGEYIGFVDGDDWVDEDMYETLYHLIYEHQADISICTHYTE 127 Query: 126 FRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIA 185 ++ + + T + + + + + + +W +++R++ + ++F Sbjct: 128 LPN-----RTKVKYKSKKTKIFSSQKAIATLIEDKIIQNYIWEKLFKRELFTE--LRFPV 180 Query: 186 GLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQR 237 G +DI + A + + Y Y S++ R K Y + Sbjct: 181 GWSFEDIALCYKVFHKARKIVLLQTPKYHYRTRPGSITNSTRNPLKEFQYLQ 232 >UniRef50_D1NJF8 Glycosyl transferase family 2 n=1 Tax=Clostridium thermocellum JW20 RepID=D1NJF8_CLOTM Length = 322 Score = 280 bits (716), Expect = 6e-74, Method: Composition-based stats. Identities = 82/255 (32%), Positives = 135/255 (52%), Gaps = 12/255 (4%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 +SVI+P+YN C+ES++ QT++ LEII++NDGS D EI YA+ V+ Sbjct: 1 MELISVIVPVYNVEKYLPACIESILCQTYSELEIILVNDGSPDKCPEICDEYAKKDSRVK 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++HQ+N G S ARN GI+ ++GKY+ FVD+DD + PTM E L T ++ + +A C + Sbjct: 61 VIHQSNQGLSSARNTGIDNSSGKYIIFVDSDDTIEPTMVEDLYTCVIKYNCSMAACGRKY 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 F E G+ +P + V + D +R + + W +Y+R++ NI+F Sbjct: 121 VF-EDGKVVCKVPDN---IEKVFSFSDAIREMNTYFYFDMSAWAKIYKRELFD--NIRFP 174 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 G +D + + + + Y + LY Y +S+SR NKN+N+ Sbjct: 175 VGRLSEDFFIMYKLIALSQKVAYISKPLYNYLQRKSSISR-----NKNINH-DFITAARE 228 Query: 245 LLEKLNRNYADKIMI 259 +E L++NY + +I Sbjct: 229 QMEFLDKNYPELAII 243 >UniRef50_B5CP79 Putative uncharacterized protein n=1 Tax=Ruminococcus lactaris ATCC 29176 RepID=B5CP79_9FIRM Length = 315 Score = 279 bits (715), Expect = 9e-74, Method: Composition-based stats. Identities = 75/325 (23%), Positives = 147/325 (45%), Gaps = 24/325 (7%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 + +SVI+P+Y + C++S++ QT+ LE+I+++DGS D I YA VR Sbjct: 2 QDLISVIVPVYKVEKYLKRCVDSILAQTYPCLEVILVDDGSPDGCPAICDEYAREDRRVR 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNA-D 123 ++H+ N G S ARN GI+ A GK++ FVD+DD V+P YE L+ E+ D+A C+ Sbjct: 62 VIHKENGGLSDARNAGIDAAKGKFLGFVDSDDYVHPRFYELLLQALKEEGADIAGCDVKK 121 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRW--THVVWMGVYRRDVIVKNNI 181 C E E + P R V +G + ++ + + V W +Y++++ + I Sbjct: 122 VCKTEKIEEKEQQPIQR----TVYSGREATANLYDAQMYLKSVVAWNKLYKKELFEE--I 175 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIK 241 +F G H+D + + ++ + Y ++ Y Y+ + ++ Sbjct: 176 RFPKGKLHEDEFTSYKLLYQSESVVYINRAYYFYFQREDGIMGKEQRKISP--------A 227 Query: 242 ITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIA-----EIFT 296 + E++ R +++K + Q + Y L + + + I + Sbjct: 228 VLEAYEEMERFFSEKGE--KDLLQMMMYRYLSMVRRYASDAVKSGDEEEIRLGRYYDKKF 285 Query: 297 SGMYKRLITNVRSVKVGYQALLWSF 321 Y++ I ++ K ++ ++ Sbjct: 286 RKDYRKYIWKIKKPKRKFRLWMYYH 310 >UniRef50_A6NTT2 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6NTT2_9BACE Length = 324 Score = 279 bits (714), Expect = 1e-73, Method: Composition-based stats. Identities = 71/321 (22%), Positives = 143/321 (44%), Gaps = 26/321 (8%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+I+P+YN C+ S+I QT+T LEII+++DGS DN + +AE ++++H Sbjct: 5 ISIIVPVYNVEPYLDQCVRSIIEQTYTDLEIILVDDGSPDNCPAMCDAWAEKDRRIKVVH 64 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + N G S ARN G+++A+G+Y+ FVD+DD + P + E+ + ++ + D+ A W + Sbjct: 65 KQNGGLSDARNAGMDIASGEYIGFVDSDDWLEPDILESALKQLVQANADIIAFGAIWEYS 124 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 E E + + LR S +V +YR++++ + ++F+ G Sbjct: 125 EKHEIPHPLEDNIY-----CGSEQILRAYFSECMTLTIVVNKLYRKNILDE--VRFLKGK 177 Query: 188 HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRLLE 247 H+D +T + A ++ Y Y S+ +++++ +LE Sbjct: 178 LHEDEFFTYRVLAKANTVVVCKKIGYHYRQRENSIMGT-----------QYHLRHLDVLE 226 Query: 248 KLNRNYADKIMIYPEFHQQITYEALRVCHAVRK--------EPDILTRQRMIAEIFTSGM 299 ++ +P Q + C + + +PD + R ++ I G+ Sbjct: 227 AMDEKLTFLEQEFPSLLPQQRRAQMMTCLSHYRSILMNKHVDPDSVGRNKIRRYIKRIGL 286 Query: 300 YKRLITNVRSVKVGYQALLWS 320 + + R + + Sbjct: 287 HWSDVKGCRIKEAARHLGFYL 307 >UniRef50_B6XVK3 Putative uncharacterized protein n=1 Tax=Bifidobacterium catenulatum DSM 16992 RepID=B6XVK3_9BIFI Length = 347 Score = 278 bits (712), Expect = 2e-73, Method: Composition-based stats. Identities = 76/329 (23%), Positives = 141/329 (42%), Gaps = 23/329 (6%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 + + +SVIIP+Y C+ S++ Q++ LEII+++DGSTD + +A + Sbjct: 11 TMDLVSVIIPIYGVEQYLDRCVRSVVNQSYAHLEIILVDDGSTDRCPAMCDAWANKDSRI 70 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 +++H+ N G S ARN G+++ATG+++AFVD+DD V P T++ A ++ D+ C+ Sbjct: 71 QVIHKPNGGLSSARNAGLDMATGEFIAFVDSDDYVEPDYIATMIDAAQKNHADLVMCSVF 130 Query: 124 WCFRETGETWQSIPTDRLRSTGV------LTGPDWLRMGLSSRRWTHVV-WMGVYRRDVI 176 + Q+ P R + +G D +R +VV W +YR + Sbjct: 131 HEDADGNAIEQTAPVRRKEYAPITDTLRTCSGIDCMRQRGEENGVDNVVAWNKLYRCQLW 190 Query: 177 VKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQ 236 ++++ G H+D T A + LY Y S+ Y Sbjct: 191 K--DLRYPVGKLHEDEFVTYRIFGRTRTAVLLPERLYHYIERTGSIMHT--------KYT 240 Query: 237 RHYIKITRLLEKLNRNYADK--IMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEI 294 + I L R D+ + ++P F Q+ + + + R + + + Sbjct: 241 LQSLDIIEALIGKVRFLLDEGAMDLFPVFFSQLKDSIYKARQLDWSDAQVHKRLKELFRM 300 Query: 295 FTSGMYKRLITNVRSVKVGYQALLWSFRL 323 F + + +++R + Q RL Sbjct: 301 FRTIPW----SSIRYLPFKDQVNYMGTRL 325 >UniRef50_A7VPB1 Putative uncharacterized protein n=1 Tax=Clostridium leptum DSM 753 RepID=A7VPB1_9CLOT Length = 330 Score = 277 bits (710), Expect = 3e-73, Method: Composition-based stats. Identities = 74/300 (24%), Positives = 130/300 (43%), Gaps = 9/300 (3%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M +S+IIP+YN C+ES+ QT E+I+++DGSTD+S+EI + +A + Sbjct: 1 MKTVVPNVSLIIPVYNVEQYLPKCLESIAAQTLKGFEVILVDDGSTDHSLEILRGFARRF 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 P+ ++HQ N G S ARN GI+ A G+Y+AF+D+DD + P + L A + D+ C Sbjct: 61 PNTCVIHQENGGVSKARNAGIQAARGEYIAFMDSDDYIAPLYLQRLYESAKKYRADMVCC 120 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 + +R T + P ++ G+ T + +R + +W ++RR + +++ Sbjct: 121 SY---YRYNAVTHELRPAQFRKAPGIYTPKEMIRALILDMGIKGFIWNKLWRRTLFTEHH 177 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI 240 I F + +DI + + R LY Y H S G N+ Y+ Sbjct: 178 ITFPT-MCFEDISVCQRGFYFSNRIAVIGDPLYFYVQHKDSAI-----GALNIKKLNDYL 231 Query: 241 KITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMY 300 + L D + + A+ C+ + F ++ Sbjct: 232 RALADLRGFLELQRDFPPYRAVYRLHSLFVAITTCYLTLVAQKNAKNLSGLGRNFKEIIH 291 >UniRef50_B7AUI4 Putative uncharacterized protein n=1 Tax=Bacteroides pectinophilus ATCC 43243 RepID=B7AUI4_9BACE Length = 333 Score = 274 bits (702), Expect = 3e-72, Method: Composition-based stats. Identities = 79/306 (25%), Positives = 132/306 (43%), Gaps = 15/306 (4%) Query: 2 MNSTNK--LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAEN 59 MN+ K +SVI+P+Y C++S+I QT+T L+II+++DGS D S I YA Sbjct: 7 MNTPYKDLISVIVPVYKVERYLCRCVDSIIAQTYTNLQIILVDDGSPDGSGAICDDYAAK 66 Query: 60 YPHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQ 119 + ++H+ N G S ARN G+ A G Y+ FVD+DD + MYE L + + D++ Sbjct: 67 DSRITVIHKKNGGLSDARNAGLVAACGSYIGFVDSDDYIAVDMYEKLYKAIVACNADMSV 126 Query: 120 CNADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHV-VWMGVYRRDVIVK 178 CN + E G + V+ G L L + W +V W +Y R ++ Sbjct: 127 CNFRYV-DENGNNIEERNNALPVKDEVIDGITALTRTLDDKGWYYVTAWNRLYSRKLLQS 185 Query: 179 NNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLN---- 234 I F G H+D A + + Y Y S+ + G +NL+ Sbjct: 186 --IYFPDGKLHEDEFTVHRIYSKADKVACVADACYMYVQRGGSIMQTG-YGIRNLDAVEA 242 Query: 235 YQRHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDI----LTRQRM 290 ++ + L E + IY + + ++ ++ + + R Sbjct: 243 LEQRFEFCVNLPEMSDYILRLSQRIYSMLVEAGIKLDMHNAGCRQRYKELKKTYIKQCRQ 302 Query: 291 IAEIFT 296 I ++FT Sbjct: 303 IKKLFT 308 >UniRef50_B6FJ78 Putative uncharacterized protein n=1 Tax=Clostridium nexile DSM 1787 RepID=B6FJ78_9CLOT Length = 321 Score = 274 bits (702), Expect = 3e-72, Method: Composition-based stats. Identities = 63/263 (23%), Positives = 118/263 (44%), Gaps = 19/263 (7%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 K+SVI+P+Y C++S++ QT++ +E+I+++DG+ D EI Y E V+ Sbjct: 1 MPKISVIVPVYKVEKYIHKCVDSILNQTFSDIEVILVDDGTPDRCGEICDAYGEQDSRVK 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++H+ N G S ARN G+ A+G+Y+ F+D+DD + M E + T E D+A C Sbjct: 61 VIHKENGGLSDARNAGMPHASGEYIIFIDSDDYIESDMLEYMYTRLTEAGADMATCGLYE 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 + + I + V +G + R L +W + RR I +++F Sbjct: 121 VYSD------RIEQQKEEKDFVCSGEEAFRCILQGHTIRGEIWNKLIRRSCIE--DLRFP 172 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 G ++DI +T + M + + Y Y S++ + + Sbjct: 173 KGKLYEDIYFTVDMMQRIKKVAVGTKPKYYYLHREDSITGKPYRPQ-----------LYD 221 Query: 245 LLEKLNRNYADKIMIYPEFHQQI 267 +++ +NY +P ++ Sbjct: 222 IIDGYTKNYKVVQKAFPSLTEEA 244 >UniRef50_C5S6D6 Methyltransferase FkbM family n=1 Tax=Allochromatium vinosum DSM 180 RepID=C5S6D6_CHRVI Length = 711 Score = 274 bits (701), Expect = 3e-72, Method: Composition-based stats. Identities = 85/348 (24%), Positives = 142/348 (40%), Gaps = 26/348 (7%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 + +S+ +P+YN R C++S++TQ +T LEII +NDGSTD S I YA P +R Sbjct: 3 SPTVSIALPVYNVAPYLRACLDSIVTQDFTDLEIICVNDGSTDESPAILAEYARRDPRIR 62 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++ Q N G + RN + G+Y+ VD+DD + P L+ D Sbjct: 63 IVDQTNRGLASTRNTALAHVRGRYLLHVDSDDLLRPGAVSRLVEEIERTGADFIVFQHLM 122 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 E + + L L L + H W +R D + + + +F Sbjct: 123 IEDE----REFDNWYYDFDSWSLEPKTRLEQKLEILHY-HYTWSKFFRMDFLNQYDFRFP 177 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVS-----------RLHRQGNKNL 233 GL D V+ + + A + + LYKY + S+ R+ R+G Sbjct: 178 DGLQFDDNVYHWQVLLAAQKVVMLPEKLYKYRMRAGSIMWHRGRHHLDIFRIFREGIALF 237 Query: 234 NYQRHYIKITR--LLEKLNRNYADKIMIYPEFHQQITYEALRVC-----HAVRKEPDILT 286 Q Y + + + K++ Y I P F L VR P +L Sbjct: 238 REQGVYEAVFQDFMQYKMSLQYFWTRDIEPRFRYLAYRGILDGLSADEERFVRDYPHLLD 297 Query: 287 RQ--RMIAEIFTSGMYKRLITNVRSVKVGYQALLWSFRLWQWRDKTRS 332 + R +F S +Y++ ++++ + L S R+ + RDK R+ Sbjct: 298 PEAIRFYERVFASPLYRQGAHRALALQLTARRYLSSGRV-RLRDKARA 344 >UniRef50_B4F184 Glycosyl transferase n=2 Tax=Proteus mirabilis RepID=B4F184_PROMH Length = 327 Score = 274 bits (700), Expect = 4e-72, Method: Composition-based stats. Identities = 89/295 (30%), Positives = 152/295 (51%), Gaps = 3/295 (1%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 T +SVI+P++N +ESL+ QT+ E+IIINDGSTDNSV+IA Y + Sbjct: 2 KTPIISVIVPMFNEASRITRLLESLLCQTFHDFEVIIINDGSTDNSVDIAMLYCQQDNRF 61 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 L HQ N G S ARN G++ A G ++ F D+DD + P + +A E +DV N + Sbjct: 62 HLYHQTNQGLSSARNTGLKYAQGDWIVFFDSDDFIKPQLLAHWHQLACEQHIDVLIGNGE 121 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 + Q+ R V++G +W+ ++ +W H VW+ R ++I K++++F Sbjct: 122 RYDVHNPQKHQTTIHQRQPYQQVISGQEWVIHAVTQHQWPHFVWLQFIRHEIIKKHHLRF 181 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 I GL+H+DI+WTT+ R + +Q LY Y + S++R + Q + + Sbjct: 182 IDGLYHEDILWTTQLALVTQRIAFDDQPLYYYCANPNSITRKPNPQKEAKRAQSYLQIVI 241 Query: 244 RLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSG 298 +LL+ ++ + + ++ QQ E L + ++ Q +IA+ F++ Sbjct: 242 QLLQLADKQH--QPLLANALRQQALRE-LGSFFILFRKRCDTKNQAIIAQQFSTY 293 >UniRef50_C8NFZ4 Glycosyl transferase CpsJ(V) n=1 Tax=Granulicatella adiacens ATCC 49175 RepID=C8NFZ4_9LACT Length = 328 Score = 274 bits (700), Expect = 5e-72, Method: Composition-based stats. Identities = 79/307 (25%), Positives = 140/307 (45%), Gaps = 24/307 (7%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 K+S+I+P+YN C+ESL+ Q++ +EI++INDGSTDNS ++ A+ + Sbjct: 12 EPKISIIVPVYNVEQYLERCVESLMNQSYKNIEILLINDGSTDNSGKLCDEIAKRDSRIT 71 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 + H+ N G S ARN G++ AT +V FVD+DD V MYE L++ L+ D +++ C Sbjct: 72 VYHKENGGLSDARNYGVDKATADFVGFVDSDDYVDEDMYEVLLSNLLKYDAEISFCRLYD 131 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 + + Q + + ++T + ++M L ++ ++ +Y++ + N I+F Sbjct: 132 VYND-----QVLKDESKNEPYLMTSEEAIKMVLEAKIFSVTAVNKLYKKSLF--NQIRFE 184 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 G +D + + + TE Y Y S++ Q+ K Sbjct: 185 KGKIAEDAFLMVDLLSRCQKIAATEVHKYYYMHRENSITT-----------QKFTPKFLN 233 Query: 245 LLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLI 304 ++E +N YP Q R+C A D L + S + +LI Sbjct: 234 VIEAYEKNAKIVAEKYPNLQYQAD---TRICWAYFYVLDRLLKDEAYK---DSALEAQLI 287 Query: 305 TNVRSVK 311 N++ K Sbjct: 288 QNLKKYK 294 >UniRef50_P71059 Uncharacterized glycosyltransferase epsJ n=3 Tax=Bacillus subtilis group RepID=EPSJ_BACSU Length = 344 Score = 274 bits (700), Expect = 5e-72, Method: Composition-based stats. Identities = 73/247 (29%), Positives = 126/247 (51%), Gaps = 7/247 (2%) Query: 6 NKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRL 65 +S+I+P+YN C++SL+ QT + +EII++NDG+ D S EIA+ YA+ +R+ Sbjct: 3 PLVSIIVPMYNVEPFIEECIDSLLRQTLSDIEIILVNDGTPDRSGEIAEDYAKRDARIRV 62 Query: 66 LHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWC 125 +HQAN G S ARN GI+ A G Y+ FVD DD V M++ L A ++ LD+ C Sbjct: 63 IHQANGGLSSARNTGIKAARGTYIGFVDGDDYVSSAMFQRLTEEAEQNQLDIVGCGF--- 119 Query: 126 FRETGETWQSIPTDRLRSTGVLTGPDWLRMG--LSSRRWTHVVWMGVYRRDVIVKNNIKF 183 ++++ + +P +L + VLT P+ R+ VW +YRR++ + N+ F Sbjct: 120 YKQSSDRRTYVPP-QLEANRVLTKPEMTEQLKHAHETRFIWYVWRYLYRRELFERANLLF 178 Query: 184 IAGLHH-QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKI 242 + +D + A R + ++ LY Y + S++ + + + + Q+ Y Sbjct: 179 DEDIRFAEDSPFNLSAFREAERVKMLDEGLYIYRENPNSLTEIPYKPAMDEHLQKQYQAK 238 Query: 243 TRLLEKL 249 Sbjct: 239 IAFYNHY 245 >UniRef50_D1BQ92 Glycosyl transferase family 2 n=1 Tax=Veillonella parvula DSM 2008 RepID=D1BQ92_VEIPT Length = 322 Score = 273 bits (699), Expect = 6e-72, Method: Composition-based stats. Identities = 95/325 (29%), Positives = 150/325 (46%), Gaps = 17/325 (5%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 +SVI+P+YN C+ES+ QT+ LEII++NDGSTD S +I YA ++ Sbjct: 1 MELISVIVPVYNVESYVAECIESIQNQTYMNLEIILVNDGSTDASGDICDKYAAYDERIQ 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++H+ NAG S ARN GIE A G Y+ FVD+DD + PTMYE ++ + E DLD+ +C A Sbjct: 61 VIHKENAGVSAARNTGIESANGDYIGFVDSDDYIAPTMYEDMLKLMAEHDLDIIECTA-- 118 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 FR G+T +I S + + L+M + VW +Y+R VI ++++F Sbjct: 119 -FRNNGDT--NIEGCNDGSLEIFNRDEALKMAMYD--CFVAVWSQLYKRRVI--SDVRFP 171 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 G +D + F+ N R + + LY Y L+ S ++ ++ Y + + Sbjct: 172 VGRKFEDSAVSYLFIANTKRVGHINRCLYYYRLNPNSTTQTSFDAKSRWDFVLGYEERLQ 231 Query: 245 LLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLI 304 Y D + + + L A +P Q + R Sbjct: 232 YAIDHQLPYVD------DCNSLLMKAVLSCLTAYYAKPT--GNQVYYDKCKKMIETYRND 283 Query: 305 TNVRSVKVGYQALLWSFRLWQWRDK 329 + + + Y+ LWSF W K Sbjct: 284 ASYKLLNSKYKLFLWSFGRADWIHK 308 >UniRef50_A1A383 Putative glycosyltransferase n=1 Tax=Bifidobacterium adolescentis ATCC 15703 RepID=A1A383_BIFAA Length = 346 Score = 273 bits (698), Expect = 8e-72, Method: Composition-based stats. Identities = 67/253 (26%), Positives = 123/253 (48%), Gaps = 12/253 (4%) Query: 6 NKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRL 65 +SVI+P+Y C++S++ QT+ LEII+++DGS D+ ++ + +R+ Sbjct: 18 PLISVIVPVYKVERYLDRCVQSIVDQTYRNLEIILVDDGSPDSCPDMCDAWIAKDYRIRV 77 Query: 66 LHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWC 125 +HQ N G+ ARN GIE +TG+ + FVD+DD + PTMYE L +D D++ + Sbjct: 78 IHQENQGSGAARNNGIESSTGELIGFVDSDDWLEPTMYEALFNSLKNNDADISMTASIME 137 Query: 126 FRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIA 185 + + + P L V+T + + T W + +R V+ +I+F Sbjct: 138 YPDGRQICHYQPNVHL----VMTTSESFKYVNLPGYSTISPWSKLIKRSVLG--DIRFPE 191 Query: 186 G-LHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 G ++ +D +T E + + R + LY Y S+S + N Y + Sbjct: 192 GCVNGEDTRFTYEVLAASKRTTFDSVPLYHYRTTEGSLSHTYTNYNWA-----GYEATKQ 246 Query: 245 LLEKLNRNYADKI 257 ++E + R + +++ Sbjct: 247 MVELVRRRFPEQL 259 >UniRef50_C0BRJ5 Putative uncharacterized protein n=1 Tax=Bifidobacterium pseudocatenulatum DSM 20438 RepID=C0BRJ5_9BIFI Length = 338 Score = 273 bits (698), Expect = 8e-72, Method: Composition-based stats. Identities = 63/255 (24%), Positives = 120/255 (47%), Gaps = 11/255 (4%) Query: 3 NSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPH 62 +SVI+P+Y C++S++ QT+ LEII+++DGS D ++ +A Sbjct: 8 EDLPLISVIVPVYKVERYLDRCIQSIVDQTYRNLEIILVDDGSPDACPDMCDAWAARDCR 67 Query: 63 VRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNA 122 +R++HQ N G ARN GI ++TG+ + FVD+DD + PTMYE L +++ D + Sbjct: 68 IRVIHQENRGLGAARNSGIRISTGELIGFVDSDDWLEPTMYEVLYRSLTKENADTSMITG 127 Query: 123 DWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIK 182 + + + P L V+T + + T W + +R + NI+ Sbjct: 128 VLEYPNGRQIYLYHPDLHL----VMTTSESFKYINLPGYSTISAWSKLIKRSTLG--NIR 181 Query: 183 FIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKI 242 F G +D +T + + ++R + Y Y S+S + N N+ Y Sbjct: 182 FPEGCTGEDWQFTYDILAASVRTTFDSTPQYHYRQVAGSLSNT----STNFNFA-GYEAT 236 Query: 243 TRLLEKLNRNYADKI 257 +++ +++ + +++ Sbjct: 237 KNMVKLVHQKFPEQL 251 >UniRef50_A7B4B8 Putative uncharacterized protein n=1 Tax=Ruminococcus gnavus ATCC 29149 RepID=A7B4B8_RUMGN Length = 336 Score = 272 bits (697), Expect = 1e-71, Method: Composition-based stats. Identities = 78/282 (27%), Positives = 127/282 (45%), Gaps = 10/282 (3%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 + +S+I+P+YN TC++SLI QT +E+I+++DGSTD S E+ YAE + Sbjct: 2 DMSLVSIIVPVYNVEPYIETCIQSLIRQTMGNIEVILVDDGSTDRSGELCDQYAEADERI 61 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 R++H+ N G S ARN GI A G+Y+ FVD+DD V ++ E ++ A ++ DV + Sbjct: 62 RVIHKQNGGLSSARNAGISAAKGEYLLFVDSDDYVSASLVEKTVSCAEQNQADVVVFDYQ 121 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 E + T L + V+ ++ L T +Y+R I + Sbjct: 122 EI--ELCSGGKQTRTSALPAGQVIHAESVPQLLL----ITPSACNKLYKRSFWEMTQIVY 175 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTE-QSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKI 242 G + +D+ + A R Y E LY Y L S+ R ++ + Sbjct: 176 PEGRNFEDLSVIPRILLQAGRVVYLEGDPLYFYVLRVGSIMRSGAFEKGWEERRKAISDV 235 Query: 243 TRLLEK--LNRNYADKIMIYPEFHQQITYEALRVCHAVRKEP 282 + LLE+ + + Y K + Y F + V R P Sbjct: 236 SNLLEEKGMEQQY-KKELEYLAFEHLYFVPSKEVVLKDRNNP 276 >UniRef50_D1PNS5 Capsular polysaccharide biosythesis protein CpsI n=1 Tax=Subdoligranulum variabile DSM 15176 RepID=D1PNS5_9FIRM Length = 326 Score = 272 bits (695), Expect = 2e-71, Method: Composition-based stats. Identities = 79/290 (27%), Positives = 123/290 (42%), Gaps = 10/290 (3%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTA-LEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 +SVI+P+YN C++S++ Q + E+++++DGSTD +I YA VR+ Sbjct: 6 ISVIVPIYNTKAYLEECLDSILAQDFKEPYEVLLVDDGSTDGCTDICDAYAAKDNRVRVF 65 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 HQ N G S ARN GI+ A G+Y AFVD+DD V TL E D +A C + Sbjct: 66 HQENQGLSAARNTGIDAARGRYYAFVDSDDVVLSAYLRTLYNACEEHDAYMALCAVEDV- 124 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRR-WTHVVWMGVYRRDVIVKNNIKFIA 185 +E+GE+ + GV G D L + V W +YR +V + + Sbjct: 125 QESGESCVPPHSTHPAQEGVFCGKDLLNEFYTPNGTVYTVAWNKLYRAEVWKL--LHYPE 182 Query: 186 GLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRL 245 G H+D + + + LY Y L + S+ R + + Sbjct: 183 GRLHEDDFVAHRLFWRCDKVVCVDTILYHYRLSSGSICRSSIKPEAF----DAVDGLADR 238 Query: 246 LEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIF 295 N AD+ +I + L +C VR+ P + + E F Sbjct: 239 YRFYVENNADRSVI-DSAYAACWRRYLFLCAKVRQNPTPQLVKAISKEQF 287 >UniRef50_A7VCQ5 Putative uncharacterized protein n=1 Tax=Clostridium sp. L2-50 RepID=A7VCQ5_9CLOT Length = 329 Score = 271 bits (694), Expect = 2e-71, Method: Composition-based stats. Identities = 78/319 (24%), Positives = 141/319 (44%), Gaps = 6/319 (1%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 K+S+IIP+Y C++S++ QT +EII+++DGS DN I YA V Sbjct: 2 KPKVSIIIPVYKVEKFLTRCLDSVLGQTLKEIEIILVDDGSPDNCGNICDEYAAKDKRVI 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++H+ NAG S ARN G+EVA+G++V FVD+DD V TM+E + A+ D ++A C Sbjct: 62 VIHKKNAGVSAARNSGLEVASGEFVGFVDSDDYVAATMFEDMYRQAVLADAEMAMCQFAI 121 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 T + D + + SR VW +++R++I N++F Sbjct: 122 TDGATDQLVHRSSGDDFNVLKFDNKKAFELIADFSRPVQVTVWNKLFKRELIQ--NLRFD 179 Query: 185 -AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 + +D+ + +F + Y +LY YY + H + Y IT Sbjct: 180 TSKRMAEDLEFLMRALFKSRTVVYVPYALYAYYAQREGAATFHADHSIAW-YLEQNSNIT 238 Query: 244 RLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRL 303 ++++ +N A + + L + +A+ + + + + + + + Sbjct: 239 SIMDETAQNCASMKNL--AIGYKCVNGDLSIANAMVRAGKLDSEAVKLVKKDLKSNIREI 296 Query: 304 ITNVRSVKVGYQALLWSFR 322 +T+ Q LL+ Sbjct: 297 MTSELHAVKKVQMLLFIVS 315 >UniRef50_D2EWI1 Glycoside transferase family 2 protein n=3 Tax=Bacteroides RepID=D2EWI1_9BACE Length = 336 Score = 270 bits (692), Expect = 3e-71, Method: Composition-based stats. Identities = 78/343 (22%), Positives = 152/343 (44%), Gaps = 19/343 (5%) Query: 1 MMNST--NKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAE 58 M N+ +S+I+P+YN D C++S+ QT+ LEII+I+DGS+D S E+ +A+ Sbjct: 1 MKNTEALPLISIIVPVYNVKDYLEKCLQSICVQTYKNLEIILIDDGSSDGSGELCDLFAQ 60 Query: 59 NYPHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVA 118 ++++HQ NAG S ARNRG+ VA G+++ FVD+DD + P MYE L + + D++ Sbjct: 61 RDGRIKVIHQTNAGQSAARNRGLAVAQGEFLGFVDSDDWIEPDMYEFLYHLLKANGADIS 120 Query: 119 QCNADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVK 178 C+ S + + +R + +R + +W +Y+R + Sbjct: 121 ICSHYIETAVKTRVKHSS-----GQFSSFSREEAIRTLVEDKRIRNYMWDKLYKRQLFA- 174 Query: 179 NNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRH 238 I F +DI + + + + + Y Y S ++ K NY++ Sbjct: 175 -GIYFPVNRVFEDIAVSYQIFYKTQKVVMQDCPKYHYLKREGSTTQ-----GKLYNYEKE 228 Query: 239 YIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSG 298 Y+ + E++ + + I+ + + + + + P +I ++ Sbjct: 229 YLLFQTVYEQVK--FVREKEIWDKAPYCVHERGIHLIDHLMMLPQSSLIDDVINDVLARM 286 Query: 299 MYKRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITRSAF 341 + NV ++G + ++ R +R+ R+ F Sbjct: 287 ---QEFDNVSWPQLGVMHIFKRRMMYCHLKVYRCIYRLVRTFF 326 >UniRef50_B5LBK9 Putative glycosyl transferase n=1 Tax=Streptococcus pneumoniae RepID=B5LBK9_STRPN Length = 322 Score = 270 bits (692), Expect = 4e-71, Method: Composition-based stats. Identities = 72/263 (27%), Positives = 125/263 (47%), Gaps = 20/263 (7%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 MM K+SVI+P+YN C+ES++ QT+T LEI+++NDGSTD S E+ A+ Sbjct: 1 MMGE--KISVIVPVYNVEAYLEKCVESILKQTYTNLEILLVNDGSTDKSGELCDQLAQRD 58 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 +R++H+ N G S ARNRGIE A+ + F+D+DD + MYETL LE + D++ C Sbjct: 59 QRIRVIHKENGGLSDARNRGIEEASSDLIGFIDSDDYIDEDMYETLYRQMLESNADLSMC 118 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 + + E + T LT + ++M + ++ + +Y++++ Sbjct: 119 GHYDVYDQIPEKQVATI-----QTWELTPQEAIKMVMEAKILSVTAVNKLYKKELFE--Q 171 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI 240 ++F G +D + + T + Y Y S++ Q+ + Sbjct: 172 LRFEIGKIAEDAFIMIALIHQCRKVVATNEKKYYYVHRENSITT-----------QKFSL 220 Query: 241 KITRLLEKLNRNYADKIMIYPEF 263 K ++E +N YP+ Sbjct: 221 KFLNVIEAYEQNANIIRENYPDL 243 >UniRef50_B7GUL7 Glycosyl transferase, family 2 n=1 Tax=Bifidobacterium longum subsp. infantis ATCC 15697 RepID=B7GUL7_BIFLI Length = 349 Score = 269 bits (688), Expect = 1e-70, Method: Composition-based stats. Identities = 67/246 (27%), Positives = 117/246 (47%), Gaps = 11/246 (4%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 S ++V++P+YN TC++S++ QT+ L+II+++DGSTD S I +++A + Sbjct: 13 SDPLITVVLPIYNVAKWLDTCIQSIVKQTYRNLQIILVDDGSTDESPTICEHWAAKDSRI 72 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 ++HQ NAG S ARN G+ + G+Y+ FVD+DD V + ++ A + D+ CN Sbjct: 73 NVVHQRNAGLSAARNTGLRLRKGEYICFVDSDDYVEHDYVKRMLDTAHAEQADMVICNVR 132 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 + + D T LT ++ + S V W +YR ++ N+ + Sbjct: 133 K--EDENGAALAEEADPSFETKTLTSRQYMIYAMQSSWKHIVAWNKLYRSEIWD--NLTY 188 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 G H+D + + R + LY Y H+ S+ + R+ KI Sbjct: 189 PVGKIHEDEFVFAQLVVRCHRVACIDDVLYHYVQHDGSIMNENYS-------IRNLDKIE 241 Query: 244 RLLEKL 249 L ++L Sbjct: 242 ALCQRL 247 >UniRef50_C6ID60 Glycosyltransferase n=5 Tax=Bacteroides RepID=C6ID60_9BACE Length = 368 Score = 269 bits (688), Expect = 1e-70, Method: Composition-based stats. Identities = 82/292 (28%), Positives = 130/292 (44%), Gaps = 22/292 (7%) Query: 15 YNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLHQANAGAS 74 YN C+ SL QT T +EII++NDGSTDNS + + YA + Q N G S Sbjct: 53 YNTEPFLAECLHSLEKQTLTDIEIILVNDGSTDNSGRLLREYAGKDARFVYVEQENQGLS 112 Query: 75 VARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFRETG-ETW 133 ARN G+E A+G Y+AF+D+DD + + L +A + D+ N + ++W Sbjct: 113 AARNTGMEHASGHYLAFLDSDDWLAENALQVLCAIAAKTRTDIVSGNTLAVHADGQVQSW 172 Query: 134 QSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGLHHQDIV 193 + + + V++G + + R + +V+ +YRRD I +N +F GL H+D + Sbjct: 173 ERRGRELFATGTVVSGSTYFSRVMDCRCYVPMVYNYLYRRDFIEQNGFRFEPGLVHEDEL 232 Query: 194 WTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQG----------NKNLNYQRHYIKIT 243 WT + + A + + Y Y S+ G K L Y R ++ Sbjct: 233 WTPQVLTTAQKITVADIDFYYYRQREGSIMTATAAGRRIASIQLIIEKLLEYSRKHLFEK 292 Query: 244 RLLEKLNRNYADKIMIYP---EFHQQITY--------EALRVCHAVRKEPDI 284 + E Y + IY H TY E LRVC +R++ + Sbjct: 293 KYREAKEALYVRLLQIYSTACTLHPDGTYTTLYDRAGEMLRVCEELRRQESL 344 >UniRef50_C9BVV0 Glycosyl transferase n=3 Tax=Enterococcus faecium RepID=C9BVV0_ENTFC Length = 335 Score = 269 bits (688), Expect = 1e-70, Method: Composition-based stats. Identities = 76/328 (23%), Positives = 140/328 (42%), Gaps = 21/328 (6%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 ++S+I+P+Y R C++S++ QT+T E+I+++DGS DNS +I YAE VR++ Sbjct: 3 EISIIVPVYKVEKYLRKCVDSILAQTFTDFEVILVDDGSPDNSGKICDEYAEKDNRVRVI 62 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 H+ N G S ARN GI+VA GKY+ FVD+DD + MYE L D D++ + Sbjct: 63 HKENGGLSSARNAGIDVARGKYLGFVDSDDYIDEDMYEILYENLKIHDADISSVELIPFY 122 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAG 186 + + ++ + +L + ++ L ++ W +YR+++ N +++ G Sbjct: 123 GDRYK-----KANKEKKVIILNKKEAIKSVLEGTQFYAYAWNKLYRKELFKDN--RYLDG 175 Query: 187 LHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRLL 246 +D + +F + + + Y Y N S+ N ++ Sbjct: 176 KTFEDAYIIIDLLFQTEKIVVSNEEKYFYLQRNDSIMGKSFSMNN-----------FDVI 224 Query: 247 EKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLITN 306 E N + +P+ H R+C A D + +I + Sbjct: 225 EAWQYNKEKILDAFPDLHDSYYR---RLCWAYFYVLDKMVLSSSYQQIPRKKTVIEFLKK 281 Query: 307 VRSVKVGYQALLWSFRLWQWRDKTRSHH 334 R+ + Y+ S ++ R Sbjct: 282 NRNFILTYKGFTKSRKIAIIALSFRVDW 309 >UniRef50_A6NTT3 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6NTT3_9BACE Length = 331 Score = 269 bits (687), Expect = 2e-70, Method: Composition-based stats. Identities = 87/333 (26%), Positives = 146/333 (43%), Gaps = 30/333 (9%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 S K+SVI+P YN C++SL+ Q++ LEII+ +DGSTD + + YA + + Sbjct: 2 SMKKISVIVPAYNIEPYLGRCLDSLLAQSYPGLEIIVTDDGSTDGTAALLDRYAAEHSQI 61 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 +HQ N G + AR ++ ATG Y+ FVD DD M++TL+ A D+A C Sbjct: 62 VAIHQPNGGVTAARLAALKQATGDYIGFVDGDDLAEHEMFQTLIDDAEHYGADIAHCGYQ 121 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 F +G + T R+ G + + +W +YRR++I F Sbjct: 122 MVFP-SGRVDRYYGTGRVVEQ---DGESGVLDLVRGDFVEPGLWNKLYRRELIE----HF 173 Query: 184 IAG-------LHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQ 236 +A ++D++ A + + ++ LY Y L S S Q K L+ Q Sbjct: 174 LAEGRMDWTVRINEDLLMNFLLFSLAKKCVFRDKCLYHYMLRKGSASSAKVQAYKLLDPQ 233 Query: 237 RHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKE---PDILTRQRM-IA 292 R + R Y + + + HQ +RV + P++ + R+ + Sbjct: 234 R----------VMQRIYEETTGLV-QLHQAAYVRYVRVLLGNTTQSTCPEVSQQARIELK 282 Query: 293 EIFTSGMYKRLITNVRSVKVGYQALLWSFRLWQ 325 ++ YK+L R + +G LL +RL + Sbjct: 283 QLICEKEYKQLSWKERYMVLGASKLLPLYRLVR 315 >UniRef50_C5NWF7 Glycosyl transferase, group 2 family n=2 Tax=Firmicutes RepID=C5NWF7_9BACL Length = 320 Score = 268 bits (685), Expect = 3e-70, Method: Composition-based stats. Identities = 72/297 (24%), Positives = 129/297 (43%), Gaps = 20/297 (6%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 K+SVI+P+YN C++S+I QT+ LEII++NDGSTDNS ++ A +R Sbjct: 2 DEKISVIVPVYNVEQYLERCVDSIINQTYKNLEIILVNDGSTDNSGQLCDELARKDDRIR 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++H+ N G S ARN GI+ A + + F+D+DD + MYE LM + + D++ C Sbjct: 62 VVHKKNGGLSDARNVGIDEAEAELIGFIDSDDYIDEDMYEVLMNNLKDANADLSMCGHYD 121 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 + E + + T L+ + ++M + ++ + +Y++ + + KF Sbjct: 122 VYNNVPEAQVA-----DKKTWELSPQEAIKMVMEAKILSVTAVNKLYKKSLFSE--FKFE 174 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 G +D + + + T + Y Y S++ Q+ +K Sbjct: 175 IGKIAEDAFIMIKLLDKCNKIVATNEKKYYYVHRENSITT-----------QKFSLKFLN 223 Query: 245 LLEKLNRNYADKIMIYPEFHQQITYEALRVCHAV--RKEPDILTRQRMIAEIFTSGM 299 ++E +N I YPE + V R D + + S + Sbjct: 224 VIEAYEQNKEIIIKKYPELKEVAQMRMNWAYFYVLDRLLLDKEYNDKKLENKLISYL 280 >UniRef50_C6LFI9 Glycosyl transferase, group 2 family n=1 Tax=Bryantella formatexigens DSM 14469 RepID=C6LFI9_9FIRM Length = 689 Score = 267 bits (684), Expect = 3e-70, Method: Composition-based stats. Identities = 86/352 (24%), Positives = 149/352 (42%), Gaps = 15/352 (4%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M K+SVIIP+YN C++S+ QT +EII +NDGSTD S++I + YAE Sbjct: 1 MADRKKVSVIIPVYNVEKYLEKCLDSVERQTLPEIEIICVNDGSTDGSLQILQSYAEKDD 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 + ++++ N G S ARN G+ VA G+YV F+D+DD + E L +++LD + Sbjct: 61 RIIIINKENGGLSSARNAGLAVAEGEYVYFLDSDDWILAETLEVLYDECKKENLDTILFD 120 Query: 122 ADWCFR-ETGETWQSIPTDRLRSTGVL----TGPDWLRMGLSSRRWTHVVWMGVYRRDVI 176 AD F E E D TG + ++ + + RR + Sbjct: 121 ADSIFETEEVERLHPSYRDYYHREDCFGNTVTGQRLFAEMMPREQYRSSACLQMNRRAFL 180 Query: 177 VKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQ 236 + + F G+ H+D ++T + A RA++ + Y+ + S+ R Y Sbjct: 181 EQYGLGFREGMIHEDELFTLQVSLRAERAKHLARPFYQRLVREESIMTSVRAIRSAFGYY 240 Query: 237 RHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRK---EPDILTRQRMIAE 293 I +L + D + IY ++ V++ + R +A Sbjct: 241 VCNQLILAEATELVKE-KDIMDIYVNKISRLRRSIKSALKDVKRGEIDRFTSNRLEDMAL 299 Query: 294 IFTSGMYKRLITNVRSVKVGYQALLWSFRLW------QWRDKTRSHHRITRS 339 + +RL+ + R+++ Q L S K +S +R+ + Sbjct: 300 LEQFKYEQRLVISNRNLRKKIQELRASTTFKVGKTVTFIPRKLKSAYRVLKK 351 Score = 211 bits (537), Expect = 4e-53, Method: Composition-based stats. Identities = 71/297 (23%), Positives = 124/297 (41%), Gaps = 13/297 (4%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 ++S+I+P+YN C+ SL Q + +EII ++D STD S+E+ + A + VR+L Sbjct: 376 EISIIMPVYNGEKYLWRCIRSLQKQMFRDIEIICVDDKSTDRSMELLQERARHDKRVRIL 435 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 Q + GA ARN G++ A GKY+ F+D DD M L A + D V A Sbjct: 436 QQEHLGAGSARNLGMKEARGKYLLFLDCDDLFDKNMCRELYRAAEKHDAQVVLFGAQRMD 495 Query: 127 RETGETWQ---SIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 T + + + L V +G + + ++ W ++RRD I++N ++F Sbjct: 496 MMNNRTERMGWVLRSSELPKDAVFSGQEIADRLF--QITSNCPWSKMFRRDFILENGLEF 553 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 + H D + M A R +Q Y + S + L + + + + Sbjct: 554 QSTRHCNDAYFVRMAMALAERMTMVDQIFTTYRFNMG-ASTQGVKHEAPLEFYKAFSAVK 612 Query: 244 RLL--EKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSG 298 L + L Y + + Y ++ E +++M+ E F Sbjct: 613 EALTEKGLFALYKKSFVNWTLTESLFNYRTMKTA-----EAKAAIKEKMLTEGFDYF 664 >UniRef50_C0C2F7 Putative uncharacterized protein n=1 Tax=Clostridium hylemonae DSM 15053 RepID=C0C2F7_9CLOT Length = 326 Score = 266 bits (681), Expect = 6e-70, Method: Composition-based stats. Identities = 77/328 (23%), Positives = 137/328 (41%), Gaps = 30/328 (9%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+I+P+YN C++S++ QT+ E+I++NDGS D S I + YA+ VR + Sbjct: 4 VSIIVPVYNVEKYLNKCVDSILGQTFEDFELILVNDGSVDRSPSICRDYADKDGRVRYIS 63 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 Q N G ARN GIE A G Y+ FVD+DD + M E L D+A C ++ Sbjct: 64 QENMGPGRARNVGIEAAGGAYILFVDSDDYIAENMLEILYGNITTSGADMATCGLYNVYQ 123 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 + + IP G + + L+ + + + +++ +I + G+ Sbjct: 124 Q-----RCIPQYEGIEQFACDGGEAFGLLLTGEKIPGSSCNKLIKAEILK--DIGYPEGI 176 Query: 188 HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRLLE 247 ++D+ + TE M LY Y S++ + + + + Y R++E Sbjct: 177 VYEDVGFHTELMQKVQSVYVDTTPLYYYVHRENSIT-TRKFDSDAMMFIYAYEDTLRVVE 235 Query: 248 KLNRNYADKIMIYPEFHQQITYEALRVCHAV----------RKEPDILTRQRMIAEIFTS 297 K YP + ++ + A+ R+ P+ R + Sbjct: 236 KK----------YPAILTEARFKLIWAYFAILDRMLQEDGYREIPEYRQVIRFLKRNTVR 285 Query: 298 GMYKRLITNVRSVKVGYQALLWSFRLWQ 325 + R K+G ALL + RL++ Sbjct: 286 IVKNPYFHRAR--KLGALALLLNVRLYR 311 >UniRef50_C4FMC2 Putative uncharacterized protein n=3 Tax=Firmicutes RepID=C4FMC2_9FIRM Length = 317 Score = 266 bits (680), Expect = 9e-70, Method: Composition-based stats. Identities = 72/290 (24%), Positives = 127/290 (43%), Gaps = 6/290 (2%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 LS+I+P+YN C++S++ QT+ LEII+++DG+TD S IA YA ++ Sbjct: 2 EPLLSIIVPIYNVEQYVDKCIQSILNQTYQNLEIILVDDGATDCSGSIADSYAAKDKRIK 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 + H+ N G S ARN G++ TG Y+ FVD+DD + TM E L +A + D+ CN Sbjct: 62 VFHKENGGLSDARNYGLDHVTGDYILFVDSDDFIENTMCERLFAVANSTNADIVSCNYYI 121 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWT--HVVWMGVYRRDVIVKNNI- 181 + + ++ T TG D LR L +VVW +++ ++ + Sbjct: 122 YREDDDISIHTMSVQ--DDTRTFTGMDMLRYYLLKTEPFDLNVVWNKLFKSELFNGTVLV 179 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIK 241 +F G +D NA + LY Y S+ + + H Sbjct: 180 RFPKGRVQEDNFTIFRLFLNANTIVTVNEPLYYYVQRAGSIMANFSRRFMTDTVESHIYM 239 Query: 242 ITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMI 291 L++ + + + + +++ + V + D+LT+ + Sbjct: 240 NDYLMDHCSSVKNELQLYLLNSYVELSRR-VHVNKCKTEYNDLLTQYKHY 288 >UniRef50_B5CU57 Putative uncharacterized protein n=1 Tax=Bacteroides plebeius DSM 17135 RepID=B5CU57_9BACE Length = 327 Score = 266 bits (680), Expect = 1e-69, Method: Composition-based stats. Identities = 77/311 (24%), Positives = 132/311 (42%), Gaps = 13/311 (4%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M + +SVI+P+Y C+ES++ QT+T LEII+++DGS D EI A Sbjct: 1 MPMSALISVIVPVYKVEPYLHRCVESILKQTYTNLEIILVDDGSPDKCGEICDELASTDK 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 ++++HQ N G S ARN G+ +ATG+Y+AFVD+DD ++P MY+ L ++ +A C Sbjct: 61 RIQVIHQKNKGLSGARNSGLNIATGEYIAFVDSDDCLHPEMYKRLYEDICLHNVKLAFCQ 120 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 A+ C E + + PT V +++ S W +Y R ++ NI Sbjct: 121 ANMCHGEVIKVTCNQPT-------VCKEKNYVMYRSLSESIWWAAWTKLYHRSLLE--NI 171 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIK 241 +F G ++D + LY Y + S+ + N++ I Sbjct: 172 RFPEGRTNEDYAVMMLIYDQCEQIAINFNRLYNYCIRENSICT----SSLNIHKFDQLIS 227 Query: 242 ITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYK 301 +L + + + L H K+ D +++ I ++ Sbjct: 228 AEEVLIYIKERHPEFKKFAEAILMSSCLGLLHALHKEDKKKDFTAQEKKIYQLIQKYFST 287 Query: 302 RLITNVRSVKV 312 + N K Sbjct: 288 AIWNNKILTKQ 298 >UniRef50_Q67AY0 Cps8J n=6 Tax=Streptococcus RepID=Q67AY0_STRAG Length = 325 Score = 266 bits (680), Expect = 1e-69, Method: Composition-based stats. Identities = 78/333 (23%), Positives = 146/333 (43%), Gaps = 17/333 (5%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 N +S+I+P+YN + C+ S+I+Q++ E+II+NDGSTDNS+ + R Sbjct: 2 QNLVSIILPVYNVENYIENCLNSIISQSYFHFEVIIVNDGSTDNSMSYCEKIVCEDLRFR 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN-AD 123 ++++ N G S ARN G+E A G Y+ FVD+DD + P + LM + D+A C+ Sbjct: 62 IINKDNGGLSDARNVGLEAAKGDYIIFVDSDDFISPNLVSHLMACLENYEADLAICDPVH 121 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 + ET + + + +LT + + + + W VY++ + NI+F Sbjct: 122 FYLGGNQETHEKNIFKKHSNIHLLTNEEAICELFYQKTFLVSAWGKVYKKSIFD--NIRF 179 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 G +D + + Y+ LY Y S++ + + I Sbjct: 180 PKGKLFEDSAVMHLLFEKSNKIVYSNAKLYAYVHRKDSITTS--------QFSERDLDIL 231 Query: 244 RLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAE--IFTSGMYK 301 + +++ + Y IY I+Y+ + P+I ++AE + +K Sbjct: 232 DITDQILKRYKSFPKIYQ---AAISYKISACFRVILNAPNIEKYTPIVAECKKYVYYHWK 288 Query: 302 RLITNVRSVKVGYQALLWSFRLWQWRDKTRSHH 334 ++ N + ++ + L S L++ K H Sbjct: 289 DILFN-KKTRIKTKLALISIVLFKPFIKFIYTH 320 >UniRef50_C7H3W5 Glycosyl transferase CpsJ n=1 Tax=Faecalibacterium prausnitzii A2-165 RepID=C7H3W5_9FIRM Length = 324 Score = 265 bits (679), Expect = 1e-69, Method: Composition-based stats. Identities = 78/307 (25%), Positives = 136/307 (44%), Gaps = 24/307 (7%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 +S+I+P+YN C+ SL+ Q++ +E+++++DGSTD S + YA VR Sbjct: 2 DPLVSIIVPVYNVKPYLNRCVNSLLGQSYQNMELLLVDDGSTDGSETLCGEYAAQDARVR 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 +LH+ N G S ARN G++ A G+Y++FVD DD V P E L + D + + Sbjct: 62 VLHKKNGGLSDARNAGVDAAKGEYLSFVDGDDWVSPYYIENLYRALEQAGADFSASCFEE 121 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 F G+ QS+PT+RL + +L+ + LR L +Y+R + +++ Sbjct: 122 VFE--GQPVQSVPTERLEAFEILSREECLRRILYQEGMEVTTPTKLYKRALFE--GVRYP 177 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 G ++DI E A +A + Y Y+ +S+ + K L+ RH Sbjct: 178 VGKLYEDIPVAYEVTKRAHKAAHIANRDYYYFQRGSSIQNMAFNPRK-LDSVRH---CHA 233 Query: 245 LLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLI 304 L+E + R++ VC+ + + I + + K L Sbjct: 234 LMENVKRDFPRLS------RAAECRYLSNVCNILFQ----------IQDRQHEKIEKALW 277 Query: 305 TNVRSVK 311 V+ + Sbjct: 278 QEVKKYR 284 >UniRef50_D2Q666 Glycosyl transferase, family 2 n=4 Tax=cellular organisms RepID=D2Q666_9BIFI Length = 329 Score = 265 bits (678), Expect = 2e-69, Method: Composition-based stats. Identities = 78/337 (23%), Positives = 152/337 (45%), Gaps = 12/337 (3%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M T +S+I+P+YN G+ C+ SL+ QT+ +++I+++DGSTD S ++ +A+ Sbjct: 1 MNTGTPLVSIIVPVYNTGNYLNVCVNSLLRQTYANIQVILVDDGSTDGSADLCDAWADKD 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 V ++HQAN G S ARN G++ A G ++ FVD+DD + E L+ + + +A Sbjct: 61 SRVMVIHQANGGVSTARNAGLDAAEGDWLEFVDSDDWLEREAIEGLLGLVQSEHAQMAIF 120 Query: 121 NADWCF-RETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKN 179 N +T T +D S+GVL+ D L + L+ W + R++I + Sbjct: 121 NYRSVLDEDTPYTVCEKTSDYRISSGVLSRKDALDVILAYSGVKGYAWNKFFSRELIERQ 180 Query: 180 NIKFIAGL-HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRH 238 N++F + + +D++++ E+ + T++ LY Y + S S + + H Sbjct: 181 NLRFDSKITMCEDLLFSVEYALLSTVTVGTDRCLYNYRNNPNSASHKLDLESVATCLEAH 240 Query: 239 YIKITRLLEKLNRNYADKIMI-YPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTS 297 R++ + R ++ Y +++ A ++ + I + Sbjct: 241 ----KRMVSIVPRESKSSVLASYAILAEELLLRTYANDDATHRDEYLA-----ILRKYWW 291 Query: 298 GMYKRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHH 334 KRL + +++ S W + H+ Sbjct: 292 HALKRLRPSRYWMRILGGVFCPSLFYPIWNHEKGKHY 328 >UniRef50_D2Q658 Glycosyl transferase, family 2 n=3 Tax=Bacteria RepID=D2Q658_9BIFI Length = 332 Score = 265 bits (678), Expect = 2e-69, Method: Composition-based stats. Identities = 79/333 (23%), Positives = 137/333 (41%), Gaps = 23/333 (6%) Query: 1 MMNSTN---KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYA 57 M N +SVI+P+YN C++S++ Q++ LEII+++DGS D +A Sbjct: 1 MSNEVPDELLISVIVPVYNVERYLDQCVKSIVEQSYRNLEIILVDDGSRDACPGKCDMWA 60 Query: 58 ENYPHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALE-DDLD 116 + ++H+AN G S ARN G++ A G +AFVD+DD + P M T+ E ++D Sbjct: 61 AKDNRISVIHKANGGLSSARNAGLDQARGDCIAFVDSDDWIDPNMLRTMWQWMQEQHEVD 120 Query: 117 VAQCNADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSR-RWTHVVWMGVYRRDV 175 V C + F + D V T LR L R R VW ++ Sbjct: 121 VVMCGTEKNFEDGDTEH----IDGHLPARVFTSDQALRSFLYHRDRMASAVWNKLFDARF 176 Query: 176 IVKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNY 235 ++ I+F GL+++D + ++LY+Y + S++ + ++ Sbjct: 177 FREDGIRFPEGLNNEDYYVLAHVYRTMKGIYFNPKALYRYRIRKNSITTASLNEH---SF 233 Query: 236 QRHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIF 295 R I YAD + + A++ + + + L + M AE+ Sbjct: 234 DRAIIADKCCKYLSESGYAD--------RNALAFFAMQGRYDIVYD---LASKHMGAEVL 282 Query: 296 TSGMYKRLITNVRSVKVGYQALLWSFRLWQWRD 328 K + K G A+ ++W Sbjct: 283 AVWREKLAYAAKKVYKDGTLAISHKVKIWMMAH 315 >UniRef50_UPI0001B4A4FE ss-1,4-galactosyltransferase n=1 Tax=Bacteroides sp. 2_1_7 RepID=UPI0001B4A4FE Length = 332 Score = 265 bits (677), Expect = 2e-69, Method: Composition-based stats. Identities = 73/278 (26%), Positives = 130/278 (46%), Gaps = 6/278 (2%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 + +SVI+P+YN D +TC S+++Q++ +E+I++NDGSTD S EIAK + V Sbjct: 2 NMPLVSVIVPIYNVEDYLQTCCNSILSQSFNNIEVILVNDGSTDGSCEIAKQIEKKDSRV 61 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 L+ Q N G S ARN G++VA G YV FVD+DD + + L ++ DV+ C A Sbjct: 62 ILIDQRNGGLSAARNAGLKVAKGDYVLFVDSDDWIEKNCIQELYQDIQRNNSDVSCCLAQ 121 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 + LTG + L L + + W +YRRD + +N + F Sbjct: 122 YI----SAMGTIRRGRSSFGINSLTGDEILPNALLVKTFPTSAWGKLYRRDFLHENGLLF 177 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 G+ ++D +++ + A + + ++ L+ S+SR + + + Sbjct: 178 KEGIVNEDTLFSIQIACLAKKISFVDKVLFDIREREGSISRSSFE-KLFQDMNIALLSAR 236 Query: 244 RLLEKLNRNYADKIMIYP-EFHQQITYEALRVCHAVRK 280 +E+ N +Y + + Y L++ ++ Sbjct: 237 SFMEEHNCYNEPLDKLYKARYLKSNLYNLLQIAQRLQY 274 >UniRef50_D2Q655 Glycosyl transferase, family 2 n=2 Tax=Bifidobacterium dentium RepID=D2Q655_9BIFI Length = 340 Score = 264 bits (676), Expect = 2e-69, Method: Composition-based stats. Identities = 65/225 (28%), Positives = 105/225 (46%), Gaps = 5/225 (2%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 + +SVI+P+YN G C+ES+I QT++ +EI++++DGSTD S +A+ + Sbjct: 8 NQPLISVIVPVYNVGGFLDRCVESIIAQTYSRIEILLVDDGSTDGSSVKCDSWADKDDRI 67 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 ++H+ N G S ARN G++V G+Y+AFVD+DD V P E + ++ C+ Sbjct: 68 SVIHKVNGGLSDARNAGLDVCLGEYIAFVDSDDYVDPRFIEGMFDALSRHSSTLSVCDVV 127 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 E+G + R R V++G L + W +Y R + ++F Sbjct: 128 RE-DESGNALRKSRKARPR-ERVMSGRQCLGLTYVDPN-AVTAWSKLYHRSLWE--GLRF 182 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQ 228 G H+D E M+ R LY Y H S+ Sbjct: 183 PKGRLHEDEFVFHEIMYQCDRVVVMADELYHYVQHQGSIMHARYN 227 >UniRef50_A3CKM9 Glycosyltransferase, family 2/glycosyltransferase family 8, putative n=1 Tax=Streptococcus sanguinis SK36 RepID=A3CKM9_STRSV Length = 289 Score = 264 bits (676), Expect = 3e-69, Method: Composition-based stats. Identities = 68/223 (30%), Positives = 105/223 (47%), Gaps = 8/223 (3%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 S +SVIIP+YN C+ S+ QT+ LEII++NDGSTD S +I + A+ + Sbjct: 2 SQELISVIIPVYNVEQYLAECVNSVCRQTYQNLEIILVNDGSTDASGQICQELADQDARI 61 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 RL+HQ N G S ARN GIE ++ Y+ FVD+DD + L E D D+A Sbjct: 62 RLIHQDNQGLSGARNTGIEHSSADYLIFVDSDDWLPEQHVARLYEKLKEYDADIAI-GHH 120 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHV---VWMGVYRRDVIVKNN 180 FR + T+ + + T + + R V W +Y+R++ Sbjct: 121 CSFRAEDSAFLYYSTEHFET--LYTREEIIEEYPRRRMLDGVFLCAWAKLYKRELFDT-- 176 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVS 223 +++ G +D T + + + Y + LY Y L S+S Sbjct: 177 VRYPVGRVAEDAFTTYKLYLQSEKIIYLNEPLYYYRLRPNSIS 219 >UniRef50_C7UWL4 Glycosyl transferase n=6 Tax=Enterococcus RepID=C7UWL4_ENTFA Length = 326 Score = 264 bits (675), Expect = 3e-69, Method: Composition-based stats. Identities = 70/297 (23%), Positives = 132/297 (44%), Gaps = 16/297 (5%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 K+S+I+P+YN C+ S++ QT+T E+I+++DGS D+S + +AE V+ Sbjct: 1 MPKISIIVPVYNVEKYLEKCVRSILAQTFTDFELILVDDGSPDSSGAMCDQFAEQDQRVK 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++H+ N G S ARN GIE+ATG+Y+ FVD+DD + MYETL + + ++A Sbjct: 61 VIHKENGGLSDARNAGIELATGEYLGFVDSDDYIADDMYETLYNQIVTYEAELATVGMID 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 + + +L+ + ++ L S +Y++++ N +++ Sbjct: 121 VYENRESRLTD-----KKEIKILSQNEAIQAVLDSTDVYAYAVNKLYKKELF--NKVRYP 173 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 G +D E + + + Y YY S++ L KN + + Sbjct: 174 KGKIVEDAFIIIELLLQCQKIVTNTEQKYYYYRRADSITGLS-FSEKNFDVIEAWEHNGV 232 Query: 245 LLEKLNRNYADKIMIYPEFHQQITYEALRVCH--AVRKEPDILTRQRMIAEIFTSGM 299 L++ +Y + H++I + V A+ E + ++ I S Sbjct: 233 LVDN---SYPELAE---SVHRRICWAYFDVLDKMAISNEKNFNEQRESIKRFLYSNF 283 >UniRef50_P71057 Putative glycosyltransferase epsH n=3 Tax=Bacillus subtilis group RepID=EPSH_BACSU Length = 344 Score = 264 bits (675), Expect = 4e-69, Method: Composition-based stats. Identities = 77/316 (24%), Positives = 134/316 (42%), Gaps = 10/316 (3%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 T +S+++ +YN RTC+ESL QT +EIII+NDGS D S +IA+ YA+ Sbjct: 2 ETPAVSLLVAVYNTETYIRTCLESLRNQTMDNIEIIIVNDGSADASPDIAEEYAKMDNRF 61 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 +++HQ N G RN+GIE A G+++AF+D+DD + P E ++ A D+ D+ CN Sbjct: 62 KVIHQENQGLGAVRNKGIEAARGEFIAFIDSDDWIEPDYCEQMLRTA-GDETDLVICNYA 120 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 F +TG+T S + + L + W +YRR +I + + F Sbjct: 121 AEFEDTGKTMDSDIAQTYQDQ----PKEHYIKALFEGKVRGFSWNKLYRRSMIEAHRLSF 176 Query: 184 I---AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI 240 H +D ++ F A Y + LY Y +H +S+ + R K Sbjct: 177 PLRGELEHVEDQFFSFRAHFFARSVSYVKTPLYHYRIHLSSI--VQRYQKKLFESGLALY 234 Query: 241 KITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMY 300 + + N + F + + + + R + I ++ Sbjct: 235 ETNAAFLQENNKLEEYRKELDTFIVLHSSICMLNEWKTSGSRRLFEKLRNVGVICADPVF 294 Query: 301 KRLITNVRSVKVGYQA 316 + ++ + + Sbjct: 295 QESLSKTGTAPFDAKR 310 >UniRef50_C0BRK2 Putative uncharacterized protein n=1 Tax=Bifidobacterium pseudocatenulatum DSM 20438 RepID=C0BRK2_9BIFI Length = 350 Score = 264 bits (675), Expect = 4e-69, Method: Composition-based stats. Identities = 84/305 (27%), Positives = 138/305 (45%), Gaps = 11/305 (3%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 MN+ K+SV++P+Y + C++S+ QT +EII+++DGS D IA YA Sbjct: 1 MNN-PKVSVVVPIYKVEPFLQECVDSIQVQTLEDIEIILVDDGSPDRCGAIADEYARKDS 59 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 ++++HQ N+G ARN GI ATG+YV FVD+DD + PTMY +L A + D D+ Sbjct: 60 RIKVVHQKNSGLGPARNTGIANATGEYVGFVDSDDWIAPTMYSSLYDAASQCDADIVVGG 119 Query: 122 ADWCFRETGETWQSIP---TDRLRSTGVLTGPDWLRMGLSSR----RWTHVVWMGVYRRD 174 + ++ P S +L + L L + VW +YR D Sbjct: 120 HCDMVNGSVRAVKAHPLAGQILRGSDQILPVRNRLFGHLPEDTEVESFPMRVWTAIYRND 179 Query: 175 VIVKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLN 234 + N +KF A + +D ++ A +T + Y+Y + + S S + +K L Sbjct: 180 FLKVNGLKFQA-ILSEDTIFNLSAYRCANVVAFTGDTDYRYRMDDHS-SIMRSFSSKKLT 237 Query: 235 YQRHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEI 294 ++ + L K D+ ++ + I Y L V V + + R I + Sbjct: 238 QYEQFVDTVQFLAKEEPYQHDECIMRTK-RMAIDYCRLYVGLIVGSGLSVRDKLRNIRRL 296 Query: 295 FTSGM 299 S M Sbjct: 297 SQSHM 301 >UniRef50_B3JEB6 Putative uncharacterized protein n=1 Tax=Bacteroides coprocola DSM 17136 RepID=B3JEB6_9BACE Length = 344 Score = 263 bits (674), Expect = 5e-69, Method: Composition-based stats. Identities = 71/337 (21%), Positives = 154/337 (45%), Gaps = 25/337 (7%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 + K+S+I+P+YN C+ESLI QT +EII+++D S D+ I YA+ ++ Sbjct: 2 SPKVSIIVPVYNVEKYIHRCVESLINQTLKDIEIILVDDESPDSCPSICDEYAQKDARIK 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++H+ N G + RN G+E+A+G+++AF+D+DD V TMYETL A + D C+ + Sbjct: 62 VIHKKNEGLGLTRNAGLEIASGEFIAFIDSDDWVDLTMYETLYNTAKNNQCDTVYCSLQY 121 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGL-----------SSRRWTHVVWMGVYRR 173 + + IP + G + L S ++ V +Y R Sbjct: 122 YYSPN----KIIPFKEVNEETFFQGRKEIDTFLLDMLAPLPTYHSDVKYMVSVCKAIYSR 177 Query: 174 DVIVKNNIKFIAGLH--HQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNK 231 ++I +++KF++ +D+V+ T+++ + + + + Y Y+ + S++ + + Sbjct: 178 NIIRNHHLKFVSERIIASEDMVFHTQYLKLSEKIGFIPKYFYNYFQNENSITHTYTETKI 237 Query: 232 NLNYQRHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEA---LRVCHAVRKEPDILTRQ 288 K + ++ Y ++ Y ++ + L + + + K + + Sbjct: 238 -----ERLKKFIQEMDVTFSQYFNQSEYYIRLQRKALHYLRTSLYIKYQITKSHPFIEQI 292 Query: 289 RMIAEIFTSGMYKRLITNVRSVKVGYQALLWSFRLWQ 325 + E+ ++ + + + + + L+ F + Sbjct: 293 KEFKELCDDEIFSKTLVDYPYHLLPLKHRLFYFFIKF 329 >UniRef50_B2UQP2 Glycosyl transferase family 2 n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UQP2_AKKM8 Length = 358 Score = 263 bits (674), Expect = 5e-69, Method: Composition-based stats. Identities = 73/231 (31%), Positives = 108/231 (46%), Gaps = 4/231 (1%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+I+P YN CMESL Q+ +EII +NDGS+D + EI + + + VR++ Sbjct: 5 VSIIVPCYNVAAYLDQCMESLAGQSMEDIEIICVNDGSSDRTAEILREWRDRDGRVRVID 64 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + N+G S ARN G+E A GKY+ FVD DD V M+ L A+E D DVA C Sbjct: 65 RKNSGVSAARNSGMEAAAGKYIGFVDPDDVVERNMFRRLFDAAVEKDADVAVCGYHEFCD 124 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 G + R G R VW + RR+++ +N ++F+ GL Sbjct: 125 RGGMDMPESGWSPSAGFFPEEKAERFRRGTPWSRCAGTVWNKLIRRELLEENGVRFVPGL 184 Query: 188 H-HQDIVWTTEFMFNALRARYTEQSLYKYYL-HNTSVS--RLHRQGNKNLN 234 +D+ + + A R LY Y S S R R G+ ++ Sbjct: 185 RQGEDLYFCLMLLTVAPRLLILPDRLYHYRRERPGSASCGRDPRLGDFRMD 235 >UniRef50_A4VTV1 Glycosyltransferases involved in cell wall biogenesis n=10 Tax=Streptococcus suis RepID=A4VTV1_STRSY Length = 334 Score = 263 bits (672), Expect = 7e-69, Method: Composition-based stats. Identities = 75/319 (23%), Positives = 136/319 (42%), Gaps = 17/319 (5%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M+N +S+I+P+YN C+ S++ QT+ +EI+++NDGSTDNS EI YA+ Sbjct: 1 MIN----ISIIVPIYNVEQYLSKCINSIVNQTYKHIEILLVNDGSTDNSEEICLAYAKKD 56 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 +R + N G S ARN GI A G Y+AF+D+DD ++ + L ++ VA Sbjct: 57 SRIRYFKKENGGLSDARNYGISRAKGDYLAFIDSDDFIHSEFIQRLHEAIERENALVAVA 116 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVV-WMGVYRRDVIVKN 179 D +G + P + VL+G + + L + VV W +Y++++ Sbjct: 117 GYDRV-DASGHFLTAEP--LPTNQAVLSGRNVCKKLLEADGHRFVVAWNKLYKKELFE-- 171 Query: 180 NIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHY 239 + +F G H+D +T ++ + ++ LY Y S+ + Sbjct: 172 DFRFEKGKIHEDEYFTYRLLYELEKVAIVKECLYYYVDRENSIITSSMT-------DHRF 224 Query: 240 IKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGM 299 + + Y + ++ A V + + +Q+ + + + Sbjct: 225 HCLLEFQNERMDFYESRGDKELLLECYRSFLAFAVLFLGKYNHWLSKQQKKLLQTLFRIV 284 Query: 300 YKRLITNVRSVKVGYQALL 318 YK+L N R + L Sbjct: 285 YKQLKQNKRLALLMNAYYL 303 >UniRef50_C2DBB8 Glycosyltransferase n=19 Tax=Bacteria RepID=C2DBB8_ENTFA Length = 322 Score = 262 bits (670), Expect = 2e-68, Method: Composition-based stats. Identities = 73/310 (23%), Positives = 134/310 (43%), Gaps = 21/310 (6%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 ++S+I+P+Y R C++S++ QT+T E+I+++DGS DNS +I YA VR++ Sbjct: 3 EISIIVPVYKVEPYLRKCLDSILAQTFTNFEVILVDDGSPDNSGKICDEYAIKDSRVRVI 62 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 H+ N G S ARN GI++A GKY+ FVD+DD + MYE L D D++ Sbjct: 63 HKENGGLSSARNAGIDIAQGKYLGFVDSDDYIAEDMYEVLYENLKFYDADISSVEIIPFR 122 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAG 186 + E T + + +L + ++ L + W +YR+++ N +++ G Sbjct: 123 DDKYE-----NTSKEKKEILLNKKEAIKSVLEGTDFYAYAWNKLYRKELFYNN--RYLEG 175 Query: 187 LHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRLL 246 +D + + + + + Y Y N S+ Q+ K ++ Sbjct: 176 ETFEDAYIIIDILLETDKIIVSNEEKYFYLQRNDSIMG-----------QKFSKKTLDVI 224 Query: 247 EKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLITN 306 + N + +P+ R+C A D L +I + + Sbjct: 225 KAWQYNQKKILNHFPDLKDSYNR---RLCWAYFYVLDKLILSSNYKQIPETKEIIEFLKK 281 Query: 307 VRSVKVGYQA 316 R+ + Y+ Sbjct: 282 NRNFILSYKG 291 >UniRef50_D1PIY0 Glycosyl transferase, group 2 family n=1 Tax=Subdoligranulum variabile DSM 15176 RepID=D1PIY0_9FIRM Length = 322 Score = 262 bits (669), Expect = 2e-68, Method: Composition-based stats. Identities = 56/222 (25%), Positives = 97/222 (43%), Gaps = 7/222 (3%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 +SVIIP+Y C+ S+ QT++ LEI++++DGS D + +A+ P VR Sbjct: 1 MPAISVIIPVYQVERYLDACVASVEGQTFSDLEILLVDDGSPDGCPALCDAWAQKDPRVR 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++H+ N G S ARN GIE ATG ++AFVDADD + P + + D+ N Sbjct: 61 VIHRENGGLSAARNTGIEAATGDFLAFVDADDLLEPDTLRRAYQAQRQHNADLVIFN--L 118 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGL---SSRRWTHVVWMGVYRRDVIVKNNI 181 + + +P VL + + + + V W +Y+R++ + Sbjct: 119 VYVDDRNKVLPVPDFTGFRDEVLDADEVWQRYFALAEQKIYYVVAWNKLYKRELFAT--L 176 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVS 223 ++ G ++D + Y+Y S+ Sbjct: 177 RYAEGKRYEDQFLLPGLLAQCRTIVCLGYPGYRYVQRGGSIM 218 >UniRef50_A6L4H5 Glycosyltransferase family 2 n=2 Tax=Bacteroides RepID=A6L4H5_BACV8 Length = 321 Score = 261 bits (668), Expect = 2e-68, Method: Composition-based stats. Identities = 69/323 (21%), Positives = 129/323 (39%), Gaps = 20/323 (6%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M + + +S+I+P+YN C++SL+ QT+ +EII+++DGS D S EI YA+ Sbjct: 1 MSRNDSLISIIVPVYNVEQYLSRCVDSLVNQTYHNIEIILVDDGSPDRSGEICDEYAKKD 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 V+++HQ+N G S ARN +++A G Y+ FVD+DD + E L +A + D Sbjct: 61 KRVKVIHQSNGGLSDARNTALDIAKGDYLMFVDSDDWIEKKTCEILNRLASDYKADTVIF 120 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 ++ F L + +L ++ + ++ + Sbjct: 121 GLNFVFDSGKVIRSKRGMAGLTDKNMCMK--YLIHNVARGGIFNYACNKMFSSHLFE--G 176 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI 240 ++F G +D + + + R TEQ LY YY + S++ + ++ Sbjct: 177 LRFPVGRLAEDQDVIYKLIHRSQRIYVTEQPLYNYYQRDGSITAAQYYPRLIRDRHELWL 236 Query: 241 KITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCH-AVRKEPDILTRQRMIAEIFTSGM 299 K + + YP+ + L + + K + ++ Sbjct: 237 KRLDFIREY----------YPDLENYQIAQILGAAYVGIVKLEGNKQNGELKKKLICFSN 286 Query: 300 -----YKRLITNVRSVKVGYQAL 317 K L R +K+ Y Sbjct: 287 KYILKEKELSVLDRKIKLHYYCY 309 >UniRef50_C0BRQ0 Putative uncharacterized protein n=2 Tax=Bifidobacterium RepID=C0BRQ0_9BIFI Length = 331 Score = 261 bits (667), Expect = 3e-68, Method: Composition-based stats. Identities = 65/223 (29%), Positives = 113/223 (50%), Gaps = 7/223 (3%) Query: 3 NSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPH 62 ++ +SVI+P+YN C+ESLI QT+ LEII+++DGSTD+S E +A Sbjct: 4 STQPLISVIVPVYNVERYLDQCVESLIGQTYERLEIILVDDGSTDSSCEQCNAWANRDNR 63 Query: 63 VRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNA 122 +R + Q NAG + ARN G+++A G+Y+ FVD+DD V P M+ TL+ E D D++ + Sbjct: 64 IRAVRQCNAGLAAARNTGLDLAKGEYIGFVDSDDYVLPDMFGTLLHNLQESDADLSIISY 123 Query: 123 DWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIK 182 + + + P ++ V+T + + W + ++++ N++ Sbjct: 124 ERENPDGSTYCNAFPDKKI----VMTSQEAFAYVNQHGYFYVTAWDKLAKKELFD--NLR 177 Query: 183 FIAGLHH-QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSR 224 + + +D T + + A R Y LY+Y + S S Sbjct: 178 YPLDAVYAEDSPVTYQLLDKADRIVYDSTPLYRYRMSENSQSH 220 >UniRef50_B4AD18 Glycosyltransferase n=18 Tax=Enterococcus RepID=B4AD18_ENTFA Length = 326 Score = 261 bits (667), Expect = 3e-68, Method: Composition-based stats. Identities = 73/337 (21%), Positives = 157/337 (46%), Gaps = 24/337 (7%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 +++S+I+P+YN + + C+ES+++QT+T E+++++DGSTD+S E+ ++ Sbjct: 1 MSEISIIVPVYNVENYLKKCVESILSQTFTDFELLLVDDGSTDSSGEMCDELKRLDERIK 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++H+ N G S ARN GI+VA GKY+ FVD+DD + M E L +++D D++ Sbjct: 61 VIHKENGGLSSARNAGIDVAKGKYLTFVDSDDYIDTHMLEVLYKNMVQEDADLSIVGVTS 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 + +G+ + P+++ V+T + +M L ++ + +Y+R++ ++++ Sbjct: 121 VY--SGQEPEITPSEKY----VVTQKEATKMILIGKQASVYAVAKLYKRNIFD--DLRYP 172 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 G H+D+ + + + S Y Y S++ + ++L + + Sbjct: 173 VGKAHEDVYVIMDVLNQCKKVVVDTASYYFYVHREGSIT-ASKFNIRDLESIDSWEQAWH 231 Query: 245 LLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLI 304 ++++ Y+D RVC A D + + +E M KR++ Sbjct: 232 TIQEIMPEYSDLAFK-------------RVCAANFYVLDKMMKSDSESE--YPEMTKRIV 276 Query: 305 TNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITRSAF 341 ++ + A + + + H + F Sbjct: 277 GFLKENQKFIYANPYFTKNRKLSLTFLRIHPMLYKVF 313 >UniRef50_C9RJB8 Glycosyl transferase family 2 n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RJB8_FIBSS Length = 356 Score = 261 bits (667), Expect = 3e-68, Method: Composition-based stats. Identities = 66/318 (20%), Positives = 134/318 (42%), Gaps = 3/318 (0%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 +S+I+P+YN R C++S++ QT+T LEII+++DGS D +I YA + Sbjct: 3 QPLVSIIVPIYNVEPYLRRCLDSIVNQTYTNLEIILVDDGSPDGCPQICDEYAAKDKRIV 62 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++H+ N G S ARN G+++ G+YV+FVD+DD + E+ + D+ + Sbjct: 63 VIHKENGGLSDARNAGLDICKGEYVSFVDSDDWIATNSIESFYKIVQTQKPDIVIGDYIT 122 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 E+G + ++ + + G V W + ++D+ K F Sbjct: 123 A-TESGNYQNQTTSKQIIEMSLFNLLKCHQPGHPLFTQVVVAWNKLIKKDIAKKW--LFP 179 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 G +D T ++ F+A + T + Y Y S+ + NY + R Sbjct: 180 KGAICEDHYTTYKYFFSASKVILTNEITYYYRTREGSIVNNLAKNPIRDNYIQFEYLRER 239 Query: 245 LLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLI 304 + L++ D ++ + + + + + I + ++ + + Sbjct: 240 ISFFLDQARIDIANLFIDQYTRNAIYLYIIESKNKVNKGIHKQDILVFKKHIKTKSVKFF 299 Query: 305 TNVRSVKVGYQALLWSFR 322 N ++ + + ++L F Sbjct: 300 VNHLNICIKFYSILCKFS 317 >UniRef50_D2Q661 Glycosyl transferase family 2 n=2 Tax=Bifidobacterium dentium RepID=D2Q661_9BIFI Length = 337 Score = 261 bits (667), Expect = 3e-68, Method: Composition-based stats. Identities = 82/346 (23%), Positives = 145/346 (41%), Gaps = 24/346 (6%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 ++ +SVIIP+Y C+ S++ QT+ LEII+++DG D + + E VR Sbjct: 2 SDLVSVIIPIYGVEPYLDRCVRSVVNQTYRNLEIILVDDGGKDQCPVMCDAWVERDSRVR 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++H+ N G S ARN G++VA G+++AFVD DD V P + + +D+A C+ Sbjct: 62 VIHKPNGGLSSARNAGLDVAVGEFIAFVDGDDYVEPGYIAIMADAMGDVQVDLAMCSVFH 121 Query: 125 CFRETGETWQSIPTDRLRSTGV------LTGPDWLRMGLSSRRWTHVV-WMGVYRRDVIV 177 + Q+ P R V +G D +R +VV W +YRR + Sbjct: 122 EDIDGHAVAQTAPVRREEYAPVTDIRRTCSGLDCMRQRGDENGMDNVVAWNKLYRRSLWD 181 Query: 178 KNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQR 237 I++ G H+D T A A + LY Y + S+ + ++L+ Sbjct: 182 --GIRYPLGKIHEDEFVTYRIFGRARAAVLLPERLYHYIERDGSIVH-SKYTLRSLDIIE 238 Query: 238 HYI-KITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFT 296 I K+ LL++ + + P F Q+ R + + R ++ +F Sbjct: 239 ALIGKVHFLLDE------GAVDLVPGFFSQLKDSIYRARQLDWSDEQVHKRLEVLFRLFR 292 Query: 297 SGMYKRLITNVRSVKVGYQALLWSFRL---WQWRDKTRSHHRITRS 339 + + + R + V + R+ W+ K + + Sbjct: 293 TLPWSTM----RYLSVKDRVNYLGTRICPFLFWQRKLYDNKSAVKG 334 >UniRef50_D2Q652 Glycosyltransferase, family 2 n=1 Tax=Bifidobacterium dentium Bd1 RepID=D2Q652_9BIFI Length = 320 Score = 260 bits (666), Expect = 4e-68, Method: Composition-based stats. Identities = 71/252 (28%), Positives = 115/252 (45%), Gaps = 10/252 (3%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M+ +SVI+P+Y+ CM SL+ QT+ LEII+++DGS D+ + +A Sbjct: 1 MSEQPLVSVIVPVYSVERSLGRCMHSLVGQTYRHLEIILVDDGSPDDCPRLCDAWAGQDD 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 +R++H+ N G S ARN G++ G +VAFVD+DD V P L D D++ C+ Sbjct: 61 RIRVIHKPNGGLSDARNAGLDAMRGDFVAFVDSDDYVEPDYVNRLYAGV--SDADMSMCS 118 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 C G+ WQ ++ ++LR L R V W +Y + N + Sbjct: 119 I-ICEDSNGKAWQG-NEPITDERIAVSSEEYLRRALLDWRL-VVAWNKLYAASIW--NQL 173 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIK 241 +F G H+D + + LY Y ++S++ G N R Sbjct: 174 RFPVGCIHEDEYVLHQVVDRCKSINVLPDELYHYVSTDSSITHS---GFSIRNLDRLEAL 230 Query: 242 ITRLLEKLNRNY 253 + RL +++NY Sbjct: 231 VRRLEYCVSKNY 242 >UniRef50_A6L7A5 Glycosyltransferase family 2 n=6 Tax=Bacteroides RepID=A6L7A5_BACV8 Length = 315 Score = 260 bits (665), Expect = 5e-68, Method: Composition-based stats. Identities = 68/246 (27%), Positives = 125/246 (50%), Gaps = 9/246 (3%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 +SVIIP+YN C+ S++ QT T +E++++NDGSTD S +I YA ++ Sbjct: 1 MPLVSVIIPIYNTEKFLPLCINSVLNQTLTDIEVLLVNDGSTDGSGKICDEYACKDQRIQ 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++H N G S ARN+G+E A G+Y+AF+D+DD + M TL + ++ +A C Sbjct: 61 VIHTLNQGVSHARNQGLETAKGEYIAFMDSDDWIETDMIATLYQLIRTNEAGLATCGYII 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSR--RWTHVVWMGVYRRDVIVKNNIK 182 E + I +G LT + + + R ++ +W +Y +++I K+++K Sbjct: 121 ----ENEDGRPIYHINEVKSGKLTQWEAIHSLFNDRHYKYKGNLWDKLYHKEIIDKHHLK 176 Query: 183 FIAGLHH-QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGN--KNLNYQRHY 239 F +++ +D ++ +++ + A YT Y Y +S L ++ K + + Sbjct: 177 FNEHIYYNEDRLFIFQYLSHCQSAAYTTSPYYHYVTRKSSAMNLSQKNYTPKLCTFMDAF 236 Query: 240 IKITRL 245 +T L Sbjct: 237 DLMTSL 242 >UniRef50_C0FV15 Putative uncharacterized protein n=1 Tax=Roseburia inulinivorans DSM 16841 RepID=C0FV15_9FIRM Length = 278 Score = 260 bits (665), Expect = 5e-68, Method: Composition-based stats. Identities = 59/269 (21%), Positives = 114/269 (42%), Gaps = 3/269 (1%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 +SVI+P+Y C++SL+ Q++ LEII+++DGS D I Y ++ Sbjct: 2 KPLISVIVPIYKVEKYLTICVDSLLRQSYGNLEIILVDDGSPDECPAICDQYQNKDNRIK 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++H+ N G S ARN G+++A G+YVAFVD+DD + L ++ +A C Sbjct: 62 VIHKKNGGLSDARNAGLDIAQGEYVAFVDSDDFIDEDYICKLYDALQQNRASIAVCGIQI 121 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 E+ + + I + + + TG + ++ L W +Y + + ++F Sbjct: 122 I-DESNKVTEQICVTQDKKQELYTGREIIKKELQGEWVLVTAWGALYDAKIFKE--LRFP 178 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 G H++D N +++Y Y S+ + + ++Y + + Sbjct: 179 CGRHYEDEYVFVSVFENQECVVCIPENIYYYLQRADSIMGVTYKKQDCMDYFEMWHERID 238 Query: 245 LLEKLNRNYADKIMIYPEFHQQITYEALR 273 + +I + Y A+ Sbjct: 239 YFQTKEDRELLAPVIQSCLAWNVLYLAMN 267 >UniRef50_B2UYP7 Minor teichoic acid biosynthesis protein GgaB n=4 Tax=Clostridium RepID=B2UYP7_CLOBA Length = 335 Score = 260 bits (664), Expect = 7e-68, Method: Composition-based stats. Identities = 78/275 (28%), Positives = 131/275 (47%), Gaps = 10/275 (3%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 NS +S+I+P+Y A D + S++ QT E+I+++DGS DNS +I A N Sbjct: 13 KNSEVLISIIMPIYKAEDYLNNSITSILNQTLENFELILVDDGSPDNSGKICDELALNDN 72 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 ++++H+ N GAS ARN G+++A G+++AFVD+DD + P M+ETL +A E + D++QCN Sbjct: 73 RIKVIHKENGGASTARNAGLDIAQGEFIAFVDSDDWIEPNMFETLFNLAKEHNADISQCN 132 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVV-WMGVYRRDVIVKNN 180 E + I D + L + + VV W +Y++ + N Sbjct: 133 YIKVENED----EKIINDDKEIIISFNNIESLNNLYNEMYVSTVVLWNKIYKKSLF--NK 186 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI 240 I+F +D + +F + + YT + LY Y S+ + K L + Sbjct: 187 IRFPNMRIFEDEAIMYKLLFESKKLVYTNKKLYYYRNTPDSIMNA-KFDKKRLCMLDVFD 245 Query: 241 KITRLLEKLNRN--YADKIMIYPEFHQQITYEALR 273 + + KLN YA + Y + Y + Sbjct: 246 EKVEFMSKLNTEHLYAKTLKWYMWSIIDLYYGCIN 280 >UniRef50_B1QTB6 Glycosyl transferase, group 2 family protein n=4 Tax=Bacteria RepID=B1QTB6_CLOBU Length = 326 Score = 260 bits (664), Expect = 7e-68, Method: Composition-based stats. Identities = 82/305 (26%), Positives = 137/305 (44%), Gaps = 21/305 (6%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M + +SV+IP+YN C+ES+I QT+T L+II+++DGSTDNS I Y + Sbjct: 1 MINGELISVVIPVYNVEKYLPKCIESIINQTYTNLQIILVDDGSTDNSGSICDEYKKKDT 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 + ++H+ N G S ARN GI+ + GKY+ FVD+DD + MYE +M + + +++ N Sbjct: 61 RIMVIHKKNGGLSDARNVGIKYSKGKYIGFVDSDDYINKKMYEIMMENMVNNKANISIVN 120 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 + F + E + + ++ V++ + + + + W +Y +++ N+I Sbjct: 121 RYYVFEDGQEFLRYSINESIK---VMSNLEAIEEMNNFSTFDMAAWDKLYEKELF--NDI 175 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIK 241 +F G +D R Y LY YY N S+SR N ++ R + Sbjct: 176 EFPVGKLSEDFYIMYLLFEKCKRIVYDSSPLYYYYQRNNSISR---NKKINFDFVRAAKE 232 Query: 242 ITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYK 301 +E YP+ + + V +QRMI E K Sbjct: 233 QMTYIENR----------YPKLKPCMRVAYISANLTVYNS---YLKQRMIPEKCVVKEIK 279 Query: 302 RLITN 306 R I Sbjct: 280 REIGK 284 >UniRef50_C3Q1B0 Glycosyltransferase family 2 protein n=2 Tax=Bacteroides RepID=C3Q1B0_9BACE Length = 347 Score = 259 bits (663), Expect = 9e-68, Method: Composition-based stats. Identities = 76/349 (21%), Positives = 156/349 (44%), Gaps = 32/349 (9%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M N K+SVI+P+YN C++S++ QT LEII+++DGS DN + YA+ Sbjct: 1 MENDIPKISVIVPVYNVERYLDRCIQSILNQTLKELEIILVDDGSPDNCPALCDEYAQKD 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 ++++H+ N G ARN G+ +A+GK+V+F+D+DD V P MYE L +A + D C Sbjct: 61 ARIKVIHKKNEGLGFARNAGLNLASGKFVSFLDSDDWVAPAMYEALYRVAEKMKCDTVYC 120 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGL-----------SSRRWTHVVWMG 169 + + + + + IP + + G + L S ++ V Sbjct: 121 SLQYYYSKD----KIIPFEEVGQEVFFRGRKSVDSFLLDMLAPLPSYRSDVKYMVSVCKA 176 Query: 170 VYRRDVIVKNNIKFIAGLH--HQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHR 227 +Y R +I N ++F++ +D+ + ++ A + + Y Y+ + S++ + Sbjct: 177 IYLRKIIEDNQLRFVSEKVVASEDMFFHVRYLKLAENVGFIPEYFYNYFQNECSITHTYT 236 Query: 228 QGNKNLNYQRHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTR 287 + ++ R +++++ +A+ ++ ++ +AL + + Sbjct: 237 --------EDKIERLKRFIQEMHLVFAEY-FAEEDYWVRLQRKALHYLRNSLYIKYQIVK 287 Query: 288 QRMIAEIFTSGMYKRLITNVRSVKVG----YQALLWSFRLWQWRDKTRS 332 ++ + + K++ + YQ L RL+ K R+ Sbjct: 288 KKSFS--IQNYELKQICKDAFFYHYLNNYPYQLLPMKHRLFYLLIKYRA 334 >UniRef50_A5UNM1 Glycosyltransferase/CDP-glycerol:poly(Glycerophosphate) glycerophosphotransferase, GT2 family n=3 Tax=Methanobrevibacter smithii RepID=A5UNM1_METS3 Length = 1193 Score = 259 bits (662), Expect = 1e-67, Method: Composition-based stats. Identities = 81/316 (25%), Positives = 141/316 (44%), Gaps = 2/316 (0%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 S K+SVI+P++N D T ++S++ Q+ +EII INDGSTD+S++I + YA+ + Sbjct: 2 SKVKVSVIVPVFNVRDYLSTSLDSILNQSLEDIEIICINDGSTDDSLKILENYAKKDNRI 61 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 +++ + N G ARN G++ A G++++FVDADD + M E L ++ +LD+ C Sbjct: 62 KIISKENEGQGTARNVGLDNACGEFISFVDADDFIKKDMLEKLYNKSVNGNLDLVMCKVS 121 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGP--DWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 ET ++ L+ + L + + + +YRR I KNNI Sbjct: 122 SFDNETHIIDDNLWYYSLKCFSGFKKEVFNNLDTKEFTSLISVTPYNKLYRRSFIEKNNI 181 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIK 241 +F +D V+ A R +++LY Y + + Q ++ + Sbjct: 182 RFPEKYIFEDEVFFYNVYLKAKRISLIDENLYYYRTNRKGSTVSKGQDKDYIDVIHIFRL 241 Query: 242 ITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYK 301 I LL + N K +Y F I + + RK L ++ + S Sbjct: 242 IRDLLVETNYINVYKKQVYNRFIHLILWRFSQTAPKYRKNFFNLMKKEFDEILDDSSKDF 301 Query: 302 RLITNVRSVKVGYQAL 317 L + K+ + L Sbjct: 302 SLNVDELDFKIKSRTL 317 Score = 155 bits (393), Expect = 2e-36, Method: Composition-based stats. Identities = 72/339 (21%), Positives = 149/339 (43%), Gaps = 21/339 (6%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTW---TALEIIIINDGSTDNSVEIAKYYAE 58 ++S SVI+ YN +ESL Q + +E+II++DGS D+S++IAK Y + Sbjct: 331 LDSCKPFSVIMASYNNDKYLEDAVESLTVQNFGFEHNVELIIVDDGSNDDSLKIAKKYQK 390 Query: 59 NYP-HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDL-D 116 NYP +++++ + N G + ARN G+E ATG Y+ F+D+DD++ P ++ + +++ D Sbjct: 391 NYPYNIQVISKENGGQASARNLGLEYATGDYINFLDSDDKLSPNTFKAVYDFLVKNPAVD 450 Query: 117 VAQCNADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVI 176 V + + G + + +L PD+ ++ SS +R+ I Sbjct: 451 VISIPITFFDNQKGAHILNYKYKNEKVVNLLDEPDYPQLSASS---------AFIKREAI 501 Query: 177 VKNNIKFIAGLHH-QDIVWTTEFMFNALRARYTEQSLYKYYLH-NTSVSRLHRQGNKNLN 234 N++F L + +D ++ + + + + + + Y Y + S + + Q K Sbjct: 502 G--NLRFDTSLVNSEDALFVNKILIDNPKLGLVKNANYLYRKRFDESSTIDNSQKKKGFF 559 Query: 235 YQRHYIKITRLLEKLNRNYADKIMI--YPEFHQQITYEALRVCHAVRKEPDILTRQRMIA 292 R L++ + Y + Y + + + + +I + Sbjct: 560 TDRLKYYFKELIDYSVKKYDKALEFIQYTLIYDLQWMVKVEEIDNILNKDEINEFWEVFL 619 Query: 293 EIFTSGMYKRLITNVRSVKVGYQALLWSFRLWQWRDKTR 331 +I S + + +I N +++ + L + + KT Sbjct: 620 DIL-SYIDEDIIKNYKTLDNNVREFLLTIKQSDLTAKTI 657 >UniRef50_Q9F0B8 Beta(1,4)galactosyltransferase EpsJ n=1 Tax=Lactobacillus delbrueckii subsp. bulgaricus RepID=Q9F0B8_LACDE Length = 327 Score = 259 bits (662), Expect = 1e-67, Method: Composition-based stats. Identities = 69/317 (21%), Positives = 143/317 (45%), Gaps = 16/317 (5%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 KLS+I+P+Y + + C+ S++ QT+ E+I+++DGS D+ +I Y++ + +V+++ Sbjct: 2 KLSIIVPVYKVEEYLKDCVNSILNQTFHDFELILVDDGSPDSCPKICDEYSQKFNNVKVV 61 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 H+ N G S ARN G++VATG+Y++F+D+DD + MYE + ++ ++ D+ + + Sbjct: 62 HKQNGGLSSARNAGMKVATGEYISFIDSDDYLASNMYEHVFSIMKKECADIVVVGRCYVY 121 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLRMGLSS--RRWTHVVWMGVYRRDVIVKNNIKFI 184 + + ++ V+ GP + +S + W VY+R + N++ + Sbjct: 122 PNGSKKLR----EKQNVYEVMDGPKATAIMNTSLLGYFDAAAWDKVYKRSLF--NDVSYP 175 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 G +D T + A R Y +Y Y S++ N + Y ++ Sbjct: 176 EGKLSEDWYTTYKVFAKANRIVYDSTPMYYYRQRGGSITHTSSTVNYDAMYASR--EVYD 233 Query: 245 LLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLI 304 ++K Y E + + + V + +P + ++ K+L Sbjct: 234 FVKKRQPEYT------AEANFAYVFSRIGVIDNLTVQPTVDKQKIRKIRNDMKANIKQLK 287 Query: 305 TNVRSVKVGYQALLWSF 321 K+ + + F Sbjct: 288 KTDCFSKLSKKRKIQLF 304 >UniRef50_A8SZ40 Putative uncharacterized protein n=1 Tax=Coprococcus eutactus ATCC 27759 RepID=A8SZ40_9FIRM Length = 321 Score = 258 bits (660), Expect = 2e-67, Method: Composition-based stats. Identities = 70/308 (22%), Positives = 140/308 (45%), Gaps = 15/308 (4%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 + K+S+I+P+YN ++ C+ SLI QT+ +EI++++DGS ++ YA+ + Sbjct: 13 ESVKVSIIVPVYNVENELDRCVASLINQTYQNIEILLVDDGSPGACPKMCDEYAKKDKRI 72 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 ++H+ N G S ARN G++ A+GKYV FVD+DD + E +T ++DD+D+ Sbjct: 73 IVIHKPNGGLSSARNTGLKRASGKYVLFVDSDDYIETDSCEKFLTF-VKDDVDLVVGAGR 131 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 G+ + +R + D+ + W + +Y R+ ++KN++ F Sbjct: 132 MI---KGKDEILLRHSNIREREKYSARDFTIKSIKHSEWNSYACLSLYDREFLIKNDLFF 188 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLN----YQRHY 239 G+ +D+ E A Y + + Y Y + + S+ NK + Y Sbjct: 189 REGIIFEDMQILPEIYLKAGNIVYMDYAFYNYVVRDGSIMTSGNVENKRRTCVEILEEWY 248 Query: 240 IKITRLLEKLNRNYADKIMIYPEFHQ-------QITYEALRVCHAVRKEPDILTRQRMIA 292 I ++ ++++ R Y ++ + + + ++R I + ++I Sbjct: 249 ITMSNVMDRRYRRYLYGALVNFYLWSCRSIGFCKWNIKNVNFVFSIRYAVGIKEKLKVIM 308 Query: 293 EIFTSGMY 300 F +Y Sbjct: 309 FQFLPKLY 316 >UniRef50_C6LCR2 Glycosyl transferase CpsJ n=1 Tax=Bryantella formatexigens DSM 14469 RepID=C6LCR2_9FIRM Length = 314 Score = 257 bits (658), Expect = 3e-67, Method: Composition-based stats. Identities = 62/226 (27%), Positives = 109/226 (48%), Gaps = 4/226 (1%) Query: 3 NSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPH 62 N +SVI+P+Y C+ S++ Q++ +E+I+++DG+ DN ++ +A+ Sbjct: 7 NRYPLISVIVPVYMVEPYLDRCITSIVRQSFRDIEVILVDDGTLDNCGKMCDDWAKKDRR 66 Query: 63 VRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNA 122 +R+ HQ N G S ARN GI A GKY+ FVD+DD + M E L+ LE D+A CN Sbjct: 67 IRVFHQKNYGLSAARNTGINHAQGKYLCFVDSDDYIETDMLEKLLAALLEQQADMAVCNF 126 Query: 123 DWCFRETGETWQSIP-TDRLRSTGVLTGPDWLRMGLSSR-RWTHVVWMGVYRRDVIVKNN 180 + + + ++ P + + +TG + + M + + V W +Y+R + K Sbjct: 127 VYEYERAEQIYRKNPESYQNDIDQTITGREIMLMAEQGKHSYCVVAWNKLYKRSLFEK-- 184 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLH 226 +++ G H+D M R + Y Y S+ + Sbjct: 185 LRYPEGKIHEDEFVFHHIMSACQRVICLRYTGYHYVQREGSIMHRN 230 >UniRef50_C2M8C3 Glycosyl transferase, family 2 n=1 Tax=Capnocytophaga gingivalis ATCC 33624 RepID=C2M8C3_CAPGI Length = 321 Score = 257 bits (658), Expect = 3e-67, Method: Composition-based stats. Identities = 74/323 (22%), Positives = 143/323 (44%), Gaps = 9/323 (2%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 + +S+I+P+YN C++SL+ QT+ +EI+++NDGS D + EI Y E YP + Sbjct: 2 NNPLISIIVPVYNGETYLARCLDSLLDQTYAPIEILVVNDGSQDRTAEILSQY-EQYPQI 60 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 ++ +Q N G S ARN G++ G +V FVD+DD V PT E L ++++ D++ C Sbjct: 61 KVFYQENKGLSGARNLGLKHFEGDFVCFVDSDDYVTPTYCEELYKALIDNNADISVCQYT 120 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 + E IP ++ + + + + L + L + W +++++++ N F Sbjct: 121 LIDTKGKE----IPQEKDKKIQLFSHIEALALLLEDGQLKSYAWGKLFKKELLE--NFSF 174 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 G +D A + LY Y + S+S + G K ++ +++ Sbjct: 175 PMGRIFEDYASMFFLFDKATKVVMINNCLYNYIQYPNSLSN-KKGGEKFFHFFLGVMEMY 233 Query: 244 RLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRK-EPDILTRQRMIAEIFTSGMYKR 302 + + + I + +Q + R ++ E I + F S YK+ Sbjct: 234 DFACQYKEKFPNWNRIVEKTAKQSLFAIKRFYLFSKEVEKSIEQAMKERLTSFYSNEYKK 293 Query: 303 LITNVRSVKVGYQALLWSFRLWQ 325 + R + L + + + Sbjct: 294 IPFKYRWKIFSFLKLPFLYNTYI 316 >UniRef50_Q03KL3 Glycosyltransferase involved in cell wall biogenesis n=1 Tax=Streptococcus thermophilus LMD-9 RepID=Q03KL3_STRTD Length = 326 Score = 257 bits (658), Expect = 4e-67, Method: Composition-based stats. Identities = 69/264 (26%), Positives = 124/264 (46%), Gaps = 7/264 (2%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 MN + ++S+++P+YN D C++S++ Q+ E+II++DGSTD S + Y+E+ Sbjct: 1 MNDSVEISIVLPVYNVIDYLNECIKSILRQSNKNFELIIVDDGSTDGSQILCDRYSESDS 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 ++++HQ NAG S ARN G+ A GKY+ F+D+DD V E L + +E D++ C+ Sbjct: 61 RIKVIHQKNAGLSAARNTGLNHAKGKYITFIDSDDFVGEKYIEVLYKIIIESCADISMCD 120 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRR--WTHVVWMGVYRRDVIVKN 179 ++T + + G + + + V W +Y+ ++ N Sbjct: 121 YMEVAQDTQISDVTFQEKYPIDKYYYNGKEAIANVYKNSSHGMEFVSWAKMYKLELFKSN 180 Query: 180 NIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHY 239 I F H+D T + + + + + LY Y + + S+ K L+ Sbjct: 181 GILFPEEKLHEDAFTTYKLFYASNKIAFVNAPLYFYRIRSGSIM-TSEFTEKRLDM---- 235 Query: 240 IKITRLLEKLNRNYADKIMIYPEF 263 I+ TR R D ++Y F Sbjct: 236 IQATREEYLFFRENRDFELMYLAF 259 >UniRef50_B2ISC6 Glycosyl transferase, family 2/glycosyl transferase family 8 n=8 Tax=Streptococcus pneumoniae RepID=B2ISC6_STRPS Length = 696 Score = 257 bits (658), Expect = 4e-67, Method: Composition-based stats. Identities = 68/228 (29%), Positives = 107/228 (46%), Gaps = 7/228 (3%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M +K++VI+P+YN + R C++S+ITQT+ +EI+++NDGSTD S EI K ++E Sbjct: 1 MLVDDKITVIVPVYNVENYLRKCLDSIITQTYKNIEIVVVNDGSTDASGEICKEFSEMDH 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 + + Q NAG S ARN G+ +G YV FVD+DD + ETL E D+A N Sbjct: 61 RILYIEQENAGLSAARNTGLNNMSGNYVTFVDSDDWIELDYVETLYKEITEYQADIAVGN 120 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHV----VWMGVYRRDVIV 177 G + I D V ++ W +Y+ + Sbjct: 121 YYSFNESEGMFYFHISGDSYY-EKVYDNVSIFENLYETQEMRSFALISAWGKLYKARLFE 179 Query: 178 KNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRL 225 ++F G +D + + + Y +SLY Y + S+SR+ Sbjct: 180 --QLRFDMGKLGEDGYLNQKVYLLSEKVIYLNKSLYAYRIRKGSLSRI 225 >UniRef50_D1A4K2 CDP-glycerol:poly(Glycerophosphate)glycerophosph otransferase n=1 Tax=Thermomonospora curvata DSM 43183 RepID=D1A4K2_THECD Length = 1157 Score = 257 bits (657), Expect = 4e-67, Method: Composition-based stats. Identities = 73/271 (26%), Positives = 116/271 (42%), Gaps = 18/271 (6%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 LS+++P+YN + C+ESL QT E+I+++DGSTD S IAK +A R Sbjct: 2 PPLLSIVVPVYNVEPYLQECLESLQNQTMDDFEVIMVDDGSTDGSAAIAKEFASRDGRFR 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 L+ Q N G ARN G+ ATG Y+AFVD DD V P Y+ ++ + D A C Sbjct: 62 LIQQENQGPGPARNTGVRHATGTYLAFVDGDDVVPPNAYQRMINSLEKSGSDFA-CGGVR 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 F G + D T + R W VYR+ +N +F Sbjct: 121 RFSSKGIYPSKMHADIFDRPAKKTHISRRPELVGDR----TPWNKVYRKSFWDAHNFQFP 176 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNT---SVSRLHRQGNKNLNYQRHYIK 241 GL ++D A + +Y + + S+++ + + + Sbjct: 177 RGL-YEDPTVAIPAHVLAEKVDVLSIPVYHWRVREGGERSITQRRTEPGNLEDRVTAIRR 235 Query: 242 ITRLL---------EKLNRNYADKIMIYPEF 263 +++ L E ++R +ADK+ +Y E Sbjct: 236 VSKFLREHAPELMREYMSRVFADKLPMYIEV 266 >UniRef50_A7VPB2 Putative uncharacterized protein n=1 Tax=Clostridium leptum DSM 753 RepID=A7VPB2_9CLOT Length = 332 Score = 257 bits (656), Expect = 6e-67, Method: Composition-based stats. Identities = 70/303 (23%), Positives = 131/303 (43%), Gaps = 9/303 (2%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 +S++IP+YN R ++S+ QT+ E II++DGSTD EI Y Y + Sbjct: 13 EKPIISIVIPVYNVDMFLRKALQSVADQTFQNYEAIIVDDGSTDRCPEIIAKYVRKYSNF 72 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 + + Q NAG S ARN G+ +A G+++AF+D+DD PT E + + +D D++ C+ Sbjct: 73 KSVRQENAGLSAARNTGLRLAQGEFIAFMDSDDYYQPTFLEDMYNACVANDCDISYCSFY 132 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 + F G IP T V L + + + H W +Y+R + ++N++ + Sbjct: 133 FYFPYNG---LRIPLPFTCRTAVYESRQALNLLIRDKSMHHFAWNKLYKRSLFLENDVTY 189 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKN-----LNYQRH 238 ++ +D+ + + ++A R T + LY Y S+ + N L Y R+ Sbjct: 190 T-QMYFEDVATSPKLYYHAKRVAVTSKPLYNYTRRRGSILKSMNAKKINDFIRTLGYTRN 248 Query: 239 YIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSG 298 +++ +++ + + + + F S Sbjct: 249 FLEEADDFRAFRKSFNKFARRMKMVNWYSVLRLHVITLNFKGMWKNIKNSNRSIRYFMSE 308 Query: 299 MYK 301 Y+ Sbjct: 309 DYR 311 >UniRef50_A6NTS0 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6NTS0_9BACE Length = 353 Score = 257 bits (656), Expect = 6e-67, Method: Composition-based stats. Identities = 69/318 (21%), Positives = 137/318 (43%), Gaps = 22/318 (6%) Query: 6 NKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRL 65 +++S+I+P+Y R C++S++ QT+T +E+I+++DGS DN I YA+ V + Sbjct: 7 SEISIIVPVYKVEPYLRRCVDSILAQTFTDIEVILVDDGSPDNCPAICDKYAQQDNRVIV 66 Query: 66 LHQANAGASVARNRGIEVA----TGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 +HQ G SVARN G+ A ++++FVD+DD V+P E L A E+++ ++ C+ Sbjct: 67 IHQEYQGVSVARNTGLNWALLNSNSQWISFVDSDDWVHPRFLEYLHRAATENNVQISVCD 126 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 E+ P + + L L VVW +Y + + K I Sbjct: 127 YCR------ESIYEEPKSNVEYDTLKVTSLQLYQHLEKSLLFTVVWNKLYYKSLFEK--I 178 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIK 241 +F G +D + + M+ + + + LY YY++ T ++ Y + Sbjct: 179 RFPVGKISEDTFVSYKLMYKCPQIVFIKHPLYFYYINQTGITSS--------TYSPVRLV 230 Query: 242 ITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYK 301 L+ + + ++ ++ + +R ++ + + + Sbjct: 231 ELDALKLQQQFFHEQQAT--DWEKSCIIRTIRTYAFHIQKCSEFPLLKQYETKLQQELRR 288 Query: 302 RLITNVRSVKVGYQALLW 319 L +N + K+ W Sbjct: 289 LLKSNKKFYKLSLPRDAW 306 >UniRef50_C5TEG1 Glycosyl transferase family 2 n=1 Tax=Zymomonas mobilis subsp. mobilis ATCC 10988 RepID=C5TEG1_ZYMMO Length = 514 Score = 257 bits (656), Expect = 6e-67, Method: Composition-based stats. Identities = 81/305 (26%), Positives = 126/305 (41%), Gaps = 13/305 (4%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M N K+S I+P YN C+ SL Q+ EII++ D STD ++ A+ YA Sbjct: 1 MNNHEIKISFIVPAYNVKSYLGKCLTSLQQQSLDNFEIIVVEDCSTDGTLSKARQYARKD 60 Query: 61 PHVRLL-HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQ 119 P ++++ H+ N G ARN G+E A GKY+ FVD+DD V P E A D+ Sbjct: 61 PRIQIICHKENQGLGPARNTGMEYARGKYICFVDSDDWVDPGYGEAFYLEAERTSADMVV 120 Query: 120 CNADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKN 179 F + +R + LS VW YRR+ + Sbjct: 121 GTFYAVFSQGARIVSHFVDPAVRYASLPFDAQTCPAVLS---MPTPVWDKCYRREFLKNY 177 Query: 180 NIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHY 239 NI+F A L +DI + E M A R + Y Y + +T + N++L R Sbjct: 178 NIRFPA-LIGEDIPFQWEAMTQAKRISVVGEPFYYYRIRDT-------KKNRSLTAGRDI 229 Query: 240 IKITRLLEKLNRNYADKIMIYPEFHQQITYEALR-VCHAVRKEPDILTRQRMIAEIFTSG 298 L ++ + + Y + ++ + H K D L +A++F + Sbjct: 230 FADVFLAQEKALAFLNDSGHYEKLKSIWWERMIKELLHLTDKSGDTLVSDNFVAKVFHTM 289 Query: 299 MYKRL 303 + K L Sbjct: 290 LRKSL 294 >UniRef50_B2V0U1 Glycosyl transferase, group 2 family protein n=1 Tax=Clostridium botulinum E3 str. Alaska E43 RepID=B2V0U1_CLOBA Length = 324 Score = 256 bits (654), Expect = 1e-66, Method: Composition-based stats. Identities = 76/310 (24%), Positives = 135/310 (43%), Gaps = 22/310 (7%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M+N ++ +I+P+Y C++S++ Q+++ LEII++NDGS DN EI + Y N Sbjct: 1 MLNE--RIGIIVPIYKVEKYLCRCLDSIVNQSYSNLEIILVNDGSPDNCHEICENYKRND 58 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 ++++HQ N G S ARN G++ ++ F+D+DD + M E L + E D+ C Sbjct: 59 NRIKVIHQKNMGLSAARNTGLKELQSPFLVFIDSDDYIELDMIEELYKLMKETKSDIVSC 118 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 F T + + + L L + V W +Y+R++ + Sbjct: 119 GHKQIFDSGNIEELKGNTQ----ITIYSKKEALSKFLFTNVIDVVSWNKLYKRELFT--D 172 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI 240 IK+ +G ++D T + + A + YT + LY Y +TS+ Sbjct: 173 IKYPSGKLYEDHFTTYKIIDRAEKITYTSKVLYNYCKRSTSIGGNEYNP----------- 221 Query: 241 KITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMY 300 K +L E L+ I YP+ I L CH + + + I F + Sbjct: 222 KTMQLKEALDEECKFIIGEYPDLKDDIN---LAYCHWMFVIYNKMLLVDSIDNNFEKFLR 278 Query: 301 KRLITNVRSV 310 ++ +V + Sbjct: 279 NKIQESVSKL 288 >UniRef50_C3R5C9 Glycosyl transferase WciV n=3 Tax=Bacteroides RepID=C3R5C9_9BACE Length = 344 Score = 255 bits (653), Expect = 1e-66, Method: Composition-based stats. Identities = 82/338 (24%), Positives = 157/338 (46%), Gaps = 25/338 (7%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+I+P+YN C++SL+ QT+ +EII +NDGSTDNS +I K YAE + ++H Sbjct: 2 ISIIVPVYNTAPYLPQCLDSLVNQTYRDIEIICVNDGSTDNSPDILKAYAERDSRILVIH 61 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNA----- 122 Q N G S ARN+G+E A G++V FVD+DD + +TL++ + DV + Sbjct: 62 QENLGLSGARNKGLESARGEWVMFVDSDDWIGTDCCKTLLSH-TDKQTDVCLFSYIREFA 120 Query: 123 -----DWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIV 177 + F + +Q L + + LR+ W +Y+ +I Sbjct: 121 NQSFPQYLFSDKKMVYQGSSIHWLYARLIAPNDQELRLPGKIDSL-STAWGKLYKTSLIK 179 Query: 178 KNNIKFIA--GLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHN-TSVSRLHRQGNKNLN 234 ++ ++F+ + +D+++ + +A Y Y Y +N TS+++L++ Sbjct: 180 EHGLRFVPVRQIGTEDLLFNVYYFTWVKKAIYLPDLFYHYRKNNITSLTKLYKPHLTE-Q 238 Query: 235 YQRHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEI 294 + + I + L+R +K + Y ++ + + + + ++ I Sbjct: 239 WNLLFRLIEEWITPLHREDLEKALTY----RRALSLIGQGLNITFSPVSFMKQHGALSAI 294 Query: 295 FTSGMYKRLITNVRSVKVGYQALLWSFRLWQWRDKTRS 332 +S Y+ T++R + + Y + W RL+ + RS Sbjct: 295 LSSHEYR---TSIRQLPMNYFPIHW--RLFFEMARFRS 327 >UniRef50_Q8GPB8 Eps6I n=5 Tax=Streptococcus RepID=Q8GPB8_STRTR Length = 328 Score = 255 bits (652), Expect = 2e-66, Method: Composition-based stats. Identities = 78/338 (23%), Positives = 145/338 (42%), Gaps = 11/338 (3%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M L+V+IP+YN + C+ES++ Q W +I++++DGSTD+S +I YA+ Y Sbjct: 1 MKKDVLLTVVIPVYNVEKYLKRCVESVLVQEWHNYDILLVDDGSTDSSPQICDDYAKVYD 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 + ++H+ N G S ARN GI A G+YV F+D+DD + P + L + D D+ N Sbjct: 61 FISVIHKKNGGLSAARNTGISNAEGEYVYFLDSDDWIEPNTFSDLAEVIESDQYDIISFN 120 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 ++ ++ I + T LTG + L S T +YR+ + KN I Sbjct: 121 QEFV-----KSEHDIIKSDSKRTKRLTGKEALIDMFSYGFVTGFATDKIYRKALFTKNTI 175 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYL-HNTSVSRLHRQGNKNLNYQRHYI 240 +F G +++D+ + A + T Q Y Y + + S+++ K N Y Sbjct: 176 QFPVGKYYEDLGTNYKLFLAAKKVYATNQKYYHYLIDNPDSITK-SWNEQKFENMISFYK 234 Query: 241 KITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMY 300 ++ + ++ + ++ I F L + + + Sbjct: 235 ELYYSDDVRSKLDSSELAIAKIFFVNGMTHILASLFKSNLYKTYPKLTKCVKKELKINKL 294 Query: 301 KRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITR 338 K + +K+ + + +RL + H + + Sbjct: 295 KFQ----QKLKLPNRTKYFLYRLGLLNGAFQLQHILKK 328 >UniRef50_B0P2L7 Putative uncharacterized protein n=1 Tax=Clostridium sp. SS2/1 RepID=B0P2L7_9CLOT Length = 319 Score = 255 bits (651), Expect = 2e-66, Method: Composition-based stats. Identities = 75/315 (23%), Positives = 140/315 (44%), Gaps = 20/315 (6%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 MM +S+IIP+Y C+E+++ QT+ LEII+I+D S D EI +A+ Sbjct: 1 MME---LVSMIIPVYGVEAYLGECLETVLNQTYENLEIILIDDESPDYCPEICDQHAQKD 57 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 ++++HQ N GA+ ARN G+++ATG+Y+ F+D+DD++ E L+ + DVA C Sbjct: 58 ARIKVIHQKNGGAANARNHGLDMATGEYICFIDSDDKIENNYVEKLLNAIKKSQSDVAVC 117 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 + ++ + + D++R LS ++W ++ + + Sbjct: 118 SFKQWYKNKTQDSIGFKNKE------YSSKDYIRKFLSDWT-CGLIWNKMFDKKTLE--G 168 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI 240 ++F G D +T + + A + ++SLY Y + + V Q YQ + Sbjct: 169 VRFEEGHKIDDEFFTYQAILKAEKVAIFDESLYWYRMRKSGVMLSGTQ------YQAQML 222 Query: 241 KITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMY 300 K LE + + Y I YP+ + Y + ++ + + Sbjct: 223 K--DRLEYMTKRYEMVIEKYPDLKAEYLYNLTDNLISYTRQAQGIKELEKDVRAIKKQYW 280 Query: 301 KRLITNVRSVKVGYQ 315 R++ + VK+ Y Sbjct: 281 NRIMFGLIPVKIKYS 295 >UniRef50_D2RJ84 Glycosyl transferase family 2 n=1 Tax=Acidaminococcus fermentans DSM 20731 RepID=D2RJ84_ACIFE Length = 324 Score = 254 bits (650), Expect = 3e-66, Method: Composition-based stats. Identities = 83/342 (24%), Positives = 148/342 (43%), Gaps = 26/342 (7%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 LSVI+P+YN C++S+I QT+ LEII+++DGSTD+S I YA + + ++ Sbjct: 2 LLSVIVPVYNVEKYLCKCLDSIIQQTYKNLEIILVDDGSTDSSGLICDNYAAHDKRIHVI 61 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 H+ N G S ARN G+E+ +G Y+ FVD+DD + M+E L+ ++LDVA C + Sbjct: 62 HKKNEGLSEARNSGLEICSGDYIGFVDSDDWIATDMFEKLVKFVKCENLDVAMCGVADVW 121 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHV--VWMGVYRRDVIVKNNIKFI 184 E + T + + D + L +R VW +YR + +KF Sbjct: 122 PE-----HVVKTPSFKKVILTDVNDIISEILVNRHGGTAVPVWCRIYRSSLFKT--LKFE 174 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 G +++D + +++ R S Y Y S++ N Q H ++ Sbjct: 175 KGRYYEDGFYLLKWIERTGRFGRFSDSGYFYNHREGSIT----------NIQGHSKRVDD 224 Query: 245 LLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLI 304 + NY +P Y R+ R +++ + + +K+ + Sbjct: 225 FRKAYEDNYNYVKQHFPNSLNAAEY---RLFFTYRSILNLVGNTDLPYSHEIANKFKKNL 281 Query: 305 TNVRSVKVGYQALLWSFRL----WQWRDKTRSHHRITRSAFN 342 +++ + + L W R + ++SAF+ Sbjct: 282 SHIWKNPYMNIKIKLIYTLIGIDPVWYLLLRQFGKASKSAFS 323 >UniRef50_A6BH37 Putative uncharacterized protein n=1 Tax=Dorea longicatena DSM 13814 RepID=A6BH37_9FIRM Length = 323 Score = 254 bits (650), Expect = 3e-66, Method: Composition-based stats. Identities = 77/301 (25%), Positives = 143/301 (47%), Gaps = 13/301 (4%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 MN+ +S+IIP+Y C++S+++QT++ LEII+++DGS D S I +A Sbjct: 1 MNTM--ISIIIPVYKVEKYLDKCIKSIVSQTYSNLEIILVDDGSPDKSGLICDKWATLDS 58 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 +R++HQ NAGA ARN +++A G++++FVD+DD + M++ L++ E D+D+ +C Sbjct: 59 RIRVVHQKNAGAGAARNVALKLAQGEFISFVDSDDYLSVNMFKNLLSYFEE-DIDIVEC- 116 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 F E ++ V + + +R ++ + ++W +YRR I N+ Sbjct: 117 ---EFISVEEDDVLFIDEKSVQARVFSSEEAMREHIADHYFRQLIWNKLYRRSCIK--NV 171 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIK 241 F G D W + + A + + + LY Y + SV + + N+ + + Sbjct: 172 YFPEGKKIDDEFWMYKVIGRAKQLVHCNEKLYAYRQQDNSVMHMLKPENRIQAIEAKGYR 231 Query: 242 ITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYK 301 + EK R +M Y Y+ RV K+ D+ ++ +K Sbjct: 232 HQYIKEKFPRL---TLMSYDCLVMSCIYQIQRVYRTGTKD-DVQKVLEYSKKVLRENEFK 287 Query: 302 R 302 + Sbjct: 288 K 288 >UniRef50_C9LAR4 Glycosyl transferase, family 2/glycosyl transferase family 8 n=1 Tax=Blautia hansenii DSM 20583 RepID=C9LAR4_RUMHA Length = 329 Score = 254 bits (650), Expect = 3e-66, Method: Composition-based stats. Identities = 74/278 (26%), Positives = 135/278 (48%), Gaps = 7/278 (2%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M + +S+IIP+YN D C++S+ QT+ LEII++NDGSTD S E+A + + P Sbjct: 1 MLVSELISIIIPIYNVEDYLERCIKSIQQQTYKNLEIILVNDGSTDGSTEVALEFCQMDP 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 ++L+++ N G S ARN GI+V GKY+ FVD+DD V E L + ++ +A C+ Sbjct: 61 RIKLVNKKNGGLSDARNVGIDVCKGKYIVFVDSDDYVEARYVELLYKSLVSNNAQMAVCS 120 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVV-WMGVYRRDVIVKNN 180 + + + +++ ++TG + L + + + +VV W +Y +++ + Sbjct: 121 FNIVNDNGSVIKRKLVSEK--KNEIITGREMLNRVMRADGYKYVVAWNKIYASNLLE--D 176 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI 240 ++F G ++D + E +N + + + LY Y S+ K + + Sbjct: 177 LRFEKGKLYEDEFFNFEVSYNLTKVSFVAEPLYNYVQRAGSIKLSELTNEKIEMQKEMHE 236 Query: 241 KITRLLE--KLNRNYADKIMIYPEFHQQITYEALRVCH 276 K + K Y I +Y + I EA + + Sbjct: 237 KRISFYKDNKDKELYERAIQMYCNWMVNIVIEARGIIN 274 >UniRef50_Q9F0B5 Beta(1,4)galactosyltransferase EpsM n=1 Tax=Lactobacillus delbrueckii subsp. bulgaricus RepID=Q9F0B5_LACDE Length = 336 Score = 254 bits (649), Expect = 3e-66, Method: Composition-based stats. Identities = 78/334 (23%), Positives = 143/334 (42%), Gaps = 17/334 (5%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 +S+I+P+YN D C+ SL QT+ +EII+++DGS D S +I A ++ Sbjct: 3 DKLISIIVPVYNVEDYLEDCVHSLELQTYKNIEIILVDDGSPDRSGKICDKLALKDARIK 62 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++H+ N G S ARN G+E + G+Y+AFVD+DD V P + L ++ + + VAQ + Sbjct: 63 VIHKKNGGLSDARNEGLEHSNGEYIAFVDSDDIVEPNFIKVLYSLITQGNYQVAQVGTQF 122 Query: 125 CFRETGETWQSIPTD----RLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 Q I D + + + D L GL R+ W +Y+RD N Sbjct: 123 VNESGDPLNQKIVFDGDMGKKLNGNYVLNRDELIRGLLLRKVHCAAWCNLYKRDFF--NE 180 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQ-RHY 239 I+F G+ ++DI+ + + + + LY Y ++ G NL Y Sbjct: 181 IRFTKGILNEDILMWVDGVEKVDSLIISNECLYDYRQRYGGIT-----GGINLKLDSDAY 235 Query: 240 IKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQR-----MIAEI 294 L+++ Y I + ++ ++ R+ P+ ++ + ++ Sbjct: 236 NNSKIWLKRIEEKYPKSIEEANYNYFMCLWQIIKSTGDARQLPEEVSAVKNNIGSIVDNK 295 Query: 295 FTSGMYKRLITNVRSVKVGYQALLWSFRLWQWRD 328 + S +K + +L L + + Sbjct: 296 YLSKGHKMFFRLISVSPHLGIHILHKIWLKKLKS 329 >UniRef50_B1JIU1 Glycosyl transferase family 2 n=28 Tax=Enterobacteriaceae RepID=B1JIU1_YERPY Length = 341 Score = 253 bits (648), Expect = 5e-66, Method: Composition-based stats. Identities = 92/334 (27%), Positives = 149/334 (44%), Gaps = 19/334 (5%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 MNS + +SVI+P+YN + ++SL +QT +LEII +NDGSTD+S+ + + A Sbjct: 1 MNSVD-ISVIVPVYNCANYLTDLLDSLRSQTGVSLEIIAVNDGSTDHSLALLEQVAHIDS 59 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 + +++Q N G S ARN GI A G +VAF D DD + P T + + LDV N Sbjct: 60 RLVIINQQNQGLSEARNVGIAKAKGNWVAFADGDDWLAPHALSTWLQQVKQQKLDVLIGN 119 Query: 122 ADWCFRETGETWQSIP-------------TDRLRSTGVLTGPDWLRMGLSSRRWTHVVWM 168 + F E P + + G W+ + + + W H W+ Sbjct: 120 G-YQFSSNPERKPVTPLLGQQPWGATLNGQQWIILSVKQNGQQWIILSVKQKEWPHYAWL 178 Query: 169 GVYRRDVIVKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQ 228 + RRDVI + ++FI + H+DI+WTT+ A R + Y Y + S++ Sbjct: 179 QLIRRDVISTHQLQFIRDMLHEDILWTTQLALVAQRIGFCATPFYGYRTNPDSITNSPSM 238 Query: 229 GNKNLNYQRHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKE-PDILTR 287 L Y+ I L K+ + K + E+ + +RK+ P R Sbjct: 239 -QALLRRASSYVDIINALVKIAAST--KPPVRNALLSHANQESGHLLGLLRKKLPASPQR 295 Query: 288 QRMIAEIFTSGMYKRLITNVRSVKVGYQALLWSF 321 + T G+ L V ++ ++A+ SF Sbjct: 296 TALAIRFITLGLRPALFRGVTNLHELWRAIRCSF 329 >UniRef50_C0BWY1 Putative uncharacterized protein n=1 Tax=Clostridium hylemonae DSM 15053 RepID=C0BWY1_9CLOT Length = 328 Score = 253 bits (646), Expect = 8e-66, Method: Composition-based stats. Identities = 73/289 (25%), Positives = 129/289 (44%), Gaps = 11/289 (3%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+I+P+YN C+ S++ QT+ E+I+++DGS D S I Y + +R+ H Sbjct: 2 ISIIVPVYNVEKYLEKCICSILDQTYRDFELILVDDGSRDKSGGICDAYKKKDGRIRVFH 61 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 Q N G S ARN G+ A+G+Y+ ++D+DD V P E L A + +V+ C + Sbjct: 62 QENKGLSGARNAGLARASGEYITYIDSDDSVDPHYLEVLFENAQKYGAEVSVCGYRSVWE 121 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGL-SSRRWTHVVWMGVYRRDVIVKNNIKFIAG 186 + G +S P S + TG + + + + R W +Y + K +++ G Sbjct: 122 DRG--RKSRPEKVKTSVRICTGREAVYKIVAENDRKMITAWGKLYHARL--KPLLEYPQG 177 Query: 187 LHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLH---RQGNKNLNYQRHYIKIT 243 H+D T ++ A + + Q LY Y S+ R+ +K + Sbjct: 178 RTHEDEFVTYRVLYIADKVVISPQELYGYLQRGDSIMNSSYSERRLDKIRALEEAVDYFR 237 Query: 244 RLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIA 292 + +K YA+K + I + + V RK+ R+ +A Sbjct: 238 KEGDKELERYAEKRYL---LQLSIAWYRVSVFLPERKDLTARLREEWLA 283 >UniRef50_B3DP51 Glycosyltransferase for cell wall membrane n=10 Tax=Bifidobacterium RepID=B3DP51_BIFLD Length = 391 Score = 253 bits (646), Expect = 9e-66, Method: Composition-based stats. Identities = 58/222 (26%), Positives = 95/222 (42%), Gaps = 1/222 (0%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M +S+IIP+Y C+ S+ QT+ LEI++++DGS DN + +A P Sbjct: 1 MEQDPLVSIIIPVYKVEKFLDECVASVAAQTYANLEILLVDDGSPDNCPAMCDAWAARDP 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 +R++H+ N G S ARN GI A G Y+ F D+DD V PT+ E + E D D+ Sbjct: 61 RIRVIHKPNGGLSDARNSGIAEAIGAYIYFADSDDTVAPTLVEDCLNAMREYDADLVMFQ 120 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 D + + VLT + ++ + + W + Sbjct: 121 FDTISENNKPLLSNYRHNDFTEVQVLTPVEVIKKQVKA-EIDGYFWAFLAPASTYQGTGF 179 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVS 223 F G +D+ + A R ++LY Y + S++ Sbjct: 180 SFPVGRKIEDLSRICNVIGEATRVVRLPKALYHYRMREGSIT 221 >UniRef50_C2LRT9 Glycosyl transferase, family 2/glycosyl transferase family 8 n=1 Tax=Streptococcus salivarius SK126 RepID=C2LRT9_STRSL Length = 293 Score = 252 bits (645), Expect = 1e-65, Method: Composition-based stats. Identities = 79/316 (25%), Positives = 144/316 (45%), Gaps = 28/316 (8%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M + +S++IP+YN + C+E+++ QT+ LEII++NDGS D S EI K + + Sbjct: 1 MENKEVVSIVIPIYNVEAYLKQCLETIVNQTYPNLEIILVNDGSPDKSEEICKEFFKRDS 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 +R + Q N G S ARN GI++ATG Y+ FVD DD V ETL T + + DV+ N Sbjct: 61 RIRYVRQVNGGLSAARNTGIDLATGDYITFVDPDDWVTDDYVETLYTQLKKYEADVSIAN 120 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWM----GVYRRDVIV 177 + F E+ + T+ S + G + + + + + W +Y+ + Sbjct: 121 YNL-FNESTSKFLIKVTENDYSETLYEGREIIDQD-AIQETRDMAWACAMMKLYKISLFE 178 Query: 178 KNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYL-HNTSVSRLHRQGNKNLNYQ 236 +++F G + +D + + A R +TE+ +Y Y + ++S++ + Sbjct: 179 --DLRFPVGKNVEDNFLMYKLLLKANRVVHTEKCIYWYRVGRKDTLSQVWTEK------- 229 Query: 237 RHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFT 296 ++ +E N A M+ + R+ A+ K + A + Sbjct: 230 ----RVLDEMEAKNEKLALLGMLGYDLTWHRYIYKTRLKRAIEKMEE--------ANLQD 277 Query: 297 SGMYKRLITNVRSVKV 312 + YK N+R ++ Sbjct: 278 TETYKTAQVNLRFLEQ 293 >UniRef50_UPI0001B5362C putative glycosyl transferase n=1 Tax=Streptomyces sp. C RepID=UPI0001B5362C Length = 1187 Score = 252 bits (644), Expect = 1e-65, Method: Composition-based stats. Identities = 76/305 (24%), Positives = 121/305 (39%), Gaps = 33/305 (10%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 +LS+++P+YN C+ES+ QT+ E++I++DGSTD S IAK +AE R Sbjct: 2 PPRLSIVVPVYNVELYLDECLESIAAQTFADFEVVIVDDGSTDTSAVIAKAFAERDKRFR 61 Query: 65 LLHQANAGASVARNRGIEVAT--GKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNA 122 ++ Q NAG ARN G AT +Y+AFVD+DD + Y+ ++ E D A N Sbjct: 62 VVLQENAGLGAARNVGFRHATPGSEYIAFVDSDDTMPTYAYQRMIDSLDETGSDFAAGNV 121 Query: 123 DWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIK 182 FR G + T ++ R W VYRR ++ + Sbjct: 122 TR-FRSVGMQQSWGHRVAFANDRPKTHISKFPALVTDR----TAWNKVYRRSFWEEHGFQ 176 Query: 183 FIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNT---SVSRLH------------- 226 + G+ ++D + + A +Y + + T S+++ Sbjct: 177 YPEGILYEDAPVSIPAHYFAESVDVISDVVYHWRVRETGERSITQRSTDPVSVIDRVTSV 236 Query: 227 ---------RQGNKNLNYQRHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHA 277 + G K Y R Y L E+L Y P+F E + Sbjct: 237 RLVRQALLAKSGEKYARYLRDYDHNV-LSEELPLIYKYVGEGGPDFRAAFVKEVGGLVRE 295 Query: 278 VRKEP 282 + EP Sbjct: 296 IGTEP 300 >UniRef50_B9AFV3 Putative uncharacterized protein n=1 Tax=Methanobrevibacter smithii DSM 2375 RepID=B9AFV3_METSM Length = 379 Score = 252 bits (644), Expect = 1e-65, Method: Composition-based stats. Identities = 67/276 (24%), Positives = 133/276 (48%), Gaps = 7/276 (2%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 ++SV++P+YN + R C++SL QT+ E+I +NDGS D+S+ I + YA + Sbjct: 1 MPQISVVLPVYNVEEYLRQCLDSLANQTFEDFEVICVNDGSGDSSLSILEEYASEDERFK 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++ Q N G S ARN G++ GKY FVD+DD + E L D D+ + Sbjct: 61 IISQENKGLSGARNTGMDHIKGKYTIFVDSDDWLELNALEKLYNKITALDSDILMYKFRF 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSS--RRWTHVVWMGVYRRDVIVKNNIK 182 +++ + +S+ T+ L ++ +S + +H + +Y+ + + IK Sbjct: 121 FNQDSEQYSESVFTNLEVIDASLENKNFNYRDVSDILFKISHAPFNKLYKTSFLREAGIK 180 Query: 183 FIAGLHHQDIVWTTEFMFNALRA-RYTEQSLYKYYLHNTSVSRLHRQGNKN----LNYQR 237 F L+++D+ + F + + ++SLY Y + N S+S + + + L+ + Sbjct: 181 FPENLNYEDLYFFYMVFFKTEKVSVFRDESLYNYRIRNNSISTTGNERSFDIFEILDLTQ 240 Query: 238 HYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALR 273 +K + K+ ++ +++ ++ E LR Sbjct: 241 DNLKNVNIYSKVKDDFLMFVIVNLKYVYLRLNERLR 276 >UniRef50_B2RH99 Probable glycosyltransferase n=2 Tax=Porphyromonas gingivalis RepID=B2RH99_PORG3 Length = 328 Score = 252 bits (644), Expect = 1e-65, Method: Composition-based stats. Identities = 79/331 (23%), Positives = 139/331 (41%), Gaps = 23/331 (6%) Query: 6 NKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRL 65 ++SVIIP+YNA + C+++L+ QT+ E+I+I+DGS D S I YA V + Sbjct: 3 PRISVIIPVYNAEKFLKHCLDTLLAQTFKDFEVILIDDGSRDRSSNICDAYAYKDTRVHV 62 Query: 66 LHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWC 125 H+ N G S AR +G+E+A+G Y+ VDADD V P+ E L A+ + D+ C+ Sbjct: 63 YHKENGGVSAARQKGLELASGAYIIHVDADDWVAPSYLEVLYEEAVRTNADITICDYIEV 122 Query: 126 FRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIA 185 + + PT VL + W + + VI +N I F Sbjct: 123 YPDKKVYKVQKPTSTSADDFVLD---------LCTKLHGSCWNKLIKASVIRENKIDFPD 173 Query: 186 GL-HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 G+ +D ++ + Y ++LY Y+ + S + + + YI Sbjct: 174 GINLREDKLFNLHVALCSSTISYAPKALYYYFCSHLGGSASSQGSLETS--MKSYIYGNN 231 Query: 245 LLEKLNRNYADKIMI-------YPEFHQQITYEALRVCHAVRK--EPDILTRQRMIAEIF 295 L N ++ ++ YP + YE V H ++ + A Sbjct: 232 LFAHFLSNKSNSALLTAQRRFKYPTIAYLLIYEQDYVSHVSQEISSLSFTYIPDIWATSQ 291 Query: 296 TSGMYKRLITNVRSVKVGYQALLWSFRLWQW 326 S +Y+ + + + + L ++++W Sbjct: 292 ISALYRLTLILSTFLPLKF--LYSIIKIYKW 320 >UniRef50_D1JY96 Putative uncharacterized protein n=1 Tax=Bacteroides sp. 3_1_33FAA RepID=D1JY96_9BACE Length = 346 Score = 252 bits (644), Expect = 1e-65, Method: Composition-based stats. Identities = 82/350 (23%), Positives = 157/350 (44%), Gaps = 27/350 (7%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+I+P+YNA C++SL+ QT+ +EII +NDGSTDNS++I K YAE + ++H Sbjct: 2 ISIIVPVYNAAPYLPQCLDSLVNQTYRDIEIICVNDGSTDNSLDILKAYAERDSRILVIH 61 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 Q N G S ARN+G++ A G++V FVD+DD + + + + DV + + Sbjct: 62 QENQGLSDARNKGLKNARGEWVMFVDSDDWIDISTCQITLETVHAHQADVVMWTYNREYE 121 Query: 128 ETG---------ETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVK 178 +TW + +L V + L + W +W +YRR I Sbjct: 122 GHSLPKYYITSLQTWDANNISQLHRRMVGPINEELNSPDTMDAW-GTIWAKLYRRTYIDH 180 Query: 179 N-NIKFIAGLH---HQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLN 234 ++FI + +D+++ +M + RA Y +LY Y + S + H Sbjct: 181 PVPVRFIDTKNIGSAEDVLFNIGYMAHIGRAVYIPNALYHYRKESCSFTSRH-NPRLPQQ 239 Query: 235 YQRHYIKITRLL--EKLNRNYADKIMIYPEFHQQITYEALRV-CHAVRKEPDILTRQRMI 291 + Y +I LL +++ + + FH +I+ + + + + + I Sbjct: 240 WDNLYQEIKNLLATQQVGNEFQEA------FHNRISLGIIGLGLNELFAQSSFQDIFNRI 293 Query: 292 AEIFTSGMYKRLITNVR--SVKVGYQALLWSFRLWQWRDKTRSHHRITRS 339 + + K+ I++++ + + ++ + + F W + I + Sbjct: 294 DTLLQRPICKKAISSLKIQYMPLHWK-IFFCFAKWHFTTGVTLLLMIIKK 342 >UniRef50_D2RIJ6 Glycosyl transferase family 2 n=1 Tax=Acidaminococcus fermentans DSM 20731 RepID=D2RIJ6_ACIFE Length = 310 Score = 252 bits (644), Expect = 1e-65, Method: Composition-based stats. Identities = 79/318 (24%), Positives = 154/318 (48%), Gaps = 13/318 (4%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 LSVI+P+Y C++SL+ QT+ L+II+++DGS DNS I YYA+ + Sbjct: 3 QKLLSVIVPVYKVEKFLDRCIQSLLKQTYRNLQIILVDDGSPDNSGIICDYYAKKDFRIE 62 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++HQ+N G S ARNRG+ A G+Y+AF+D+DD + P MYETL+ + +++LDVA+C+ D Sbjct: 63 VIHQSNKGLSGARNRGLIEARGEYIAFLDSDDWIEPDMYETLIGLMEKNNLDVARCSID- 121 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 ET ++ + T + + + VVW VY+RD++ I Sbjct: 122 ---ETDGSFHRLKTPSKQYANRVYENNTCLDLYFHEFLCKVVWNAVYKRDIVD--GILSP 176 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 G +D + ++F R T + L+ Y+++ ++R H+ ++ I + Sbjct: 177 EGHQSEDNYVSGRYLFRCKRIWITSKVLHHYWMNPDGITRAHKIKPWDI-----CICTSL 231 Query: 245 LLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLI 304 L++ L ++ + +++ E + ++ ++ + + + Sbjct: 232 LIKDLPKDGIVTEKYKVQLKRKLARELFHYIRSNDSRCQVIAIKK--KQKYDILKWLDFS 289 Query: 305 TNVRSVKVGYQALLWSFR 322 +R + + + + ++ Sbjct: 290 RKIRFLVLLCKKNIKIYK 307 >UniRef50_D1NRU6 Glycosyl transferase CpsJ n=1 Tax=Bifidobacterium gallicum DSM 20093 RepID=D1NRU6_9BIFI Length = 345 Score = 252 bits (644), Expect = 1e-65, Method: Composition-based stats. Identities = 76/341 (22%), Positives = 142/341 (41%), Gaps = 17/341 (4%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 S +SVI+P+Y C+ES++ QT+ LEI++++DGS DN + +A + Sbjct: 14 SNPLVSVIVPIYGVEAFLDQCVESIVNQTYANLEILLVDDGSKDNCHAMVDEWARRDSRI 73 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 LH+ N G S ARN G+E A+G+YV FVD+DD + P + +T + + + D D+ + Sbjct: 74 IALHKPNGGLSDARNYGLERASGEYVYFVDSDDWIEPQLVQTALRTSQDTDSDMVVIAYN 133 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 + + S +++ + G + + L L + + + W +R V I+F Sbjct: 134 SANEDGSHVFPSKDSEKFPAEGRRSSAEAL-TELWNDKIQNFSWSIFAKRSVYD--GIRF 190 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 +D+ T + A + LY Y + + S+ L R+ IK Sbjct: 191 PLNQLMEDMGTTYKLYDAANAVYFLPVCLYNYRVRSNSI--LDRKPPAMSADTVKDIKAI 248 Query: 244 RLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQ------RMIAEIFTS 297 + + + Y + ++ E + + Y + + A + R + TS Sbjct: 249 DIFAQ--QRYPE--LLEKELNWSVRYLSAAIIWASESKQQFDRPSSYRAFVRSTRKYMTS 304 Query: 298 GMYKRLITNVRSVKVGYQALLWSFRLWQWR--DKTRSHHRI 336 + K + + + ++ K R H RI Sbjct: 305 RIKKLGVRRMTLTNMLKVTPIYLHCQPALTAVSKLRDHRRI 345 >UniRef50_C4K3K5 Glycosyltransferase group 2 n=3 Tax=Enterobacteriaceae RepID=C4K3K5_HAMD5 Length = 346 Score = 252 bits (644), Expect = 1e-65, Method: Composition-based stats. Identities = 97/336 (28%), Positives = 161/336 (47%), Gaps = 18/336 (5%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 MNS + LSVIIP++NA +D ESL Q E+I INDGSTD S I + A+ Sbjct: 1 MNSID-LSVIIPVFNAENDLMELFESLKNQKNLFYEVIAINDGSTDQSQIILEKIAQIDK 59 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 L+ Q + G SVARN+GI A G+++AF D+DD + P +ET + A + DLD N Sbjct: 60 RWVLISQPHRGVSVARNQGIARARGRWIAFTDSDDWLAPQAFETWVQQAEKGDLDCLVGN 119 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 + F+ + R +LTG +W+ + W H V++ + +R ++KN + Sbjct: 120 G-FRFKLNPQKPTETLLYRQPWGKILTGREWIIQSVHKNEWPHYVFLQLVKRAFLIKNQL 178 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSR---LHRQGNKNLNYQRH 238 +FI + H+DI+WTTE A R + + LY Y ++ S+++ + R ++ ++Y Sbjct: 179 RFIPNMLHEDILWTTELAEVAQRMGFCAEPLYGYRINPESITQSTSIERLAHRAISYLNI 238 Query: 239 YIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCH-----AVRKEPDILTRQRMIAE 293 K+ E + ++ FHQ + +A R +K P R ++ + Sbjct: 239 IEKLLNKAEAVKQD--------KAFHQALLKQAHRETRHFLGLMRKKLPPSQLRTQLAKK 290 Query: 294 IFTSGMYKRLITNVRSVKVGYQALLWSFRLWQWRDK 329 + G+ + + + AL +F R Sbjct: 291 FTSLGLRTAVFKGICHPDEFWHALRINFLWHIMRSG 326 >UniRef50_Q5LBM4 Putative glycosyltransferase protein n=4 Tax=Bacteroides RepID=Q5LBM4_BACFN Length = 342 Score = 251 bits (642), Expect = 2e-65, Method: Composition-based stats. Identities = 81/337 (24%), Positives = 154/337 (45%), Gaps = 17/337 (5%) Query: 6 NKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRL 65 K+SVI+P+YN C+++L+ QT + +EII+I+D S DN +I YA YP++++ Sbjct: 3 PKVSVIVPIYNVEKYLDQCVQALLAQTLSDIEIILIDDESPDNCPKICDDYAAQYPNIKV 62 Query: 66 LHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWC 125 +H+ NAG +A N G++VATG+YVAF D+DD V MY T+ +A + D Sbjct: 63 IHKKNAGLGMACNSGLDVATGEYVAFCDSDDYVDSDMYMTMYNVAQKYTCDAVFTGLKRI 122 Query: 126 FRETGETWQSIPTDRLRSTGVLTGPDWLRMGL--------SSRRWTHVVWMGVYRRDVIV 177 T + L L R + +YRR++I Sbjct: 123 TMAGIPTGTVTHQKEFKLYKNKNEIHTLLKDLIASDPYAREERAIQVSAKVVLYRRNLIE 182 Query: 178 KNNIKFIAGLH--HQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNY 235 K +++F++ +D+++ + + N+ Q+ Y Y + S+S ++ +L + Sbjct: 183 KKHLRFVSERILPSEDLIFNVDVLANSNIVCVLPQTFYNYRTNPISISHTIKKDKFSL-F 241 Query: 236 QRHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIF 295 ++ YI+IT +L ++ I F + Y +C+ + + ++++ + I Sbjct: 242 KQLYIEITDRCHRLGVEDNVQLRIQRMF---LGYTRNYICNILNSSITNIEKKQITSSIC 298 Query: 296 TSGMYKRLITNVRSVKVGYQALLWSFRLWQWRDKTRS 332 G++K + ++ + L + R S Sbjct: 299 KDGIWKPIW---KTYPLSVMPLPHRIFTFAMRHNFYS 332 >UniRef50_C0X9Z9 Glycosyltransferase n=1 Tax=Lactobacillus gasseri JV-V03 RepID=C0X9Z9_9LACO Length = 294 Score = 251 bits (642), Expect = 2e-65, Method: Composition-based stats. Identities = 75/251 (29%), Positives = 129/251 (51%), Gaps = 9/251 (3%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M++ K+SVIIP+YN + C+ES++TQT+T LEII+++DGSTD++ EI + Y E Y Sbjct: 1 MDTNEKVSVIIPVYNDEKYLKQCVESVLTQTYTNLEIILVDDGSTDHTPEICEEYREKYN 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 +R+LH+ N G +RN G+ +ATG+YV FVD DD + E L +A+++ D+A N Sbjct: 61 QIRVLHKKNGGVGSSRNAGLALATGEYVLFVDHDDWLDQHHIEDLYNLAVKNKADIAAGN 120 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTH-----VVWMGVYRRDVI 176 + + + + D + +W + + + V W +Y+R + Sbjct: 121 FNIFYDDKSTFAYWLNKDSYFEKD-YSIKEWFSLQYRTDYYNMSLVFTVPWDKLYKRSLF 179 Query: 177 VKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVS-RLHRQGNKNLNY 235 NI + + +D + T + A + Y +++Y + + +SVS + L Sbjct: 180 K--NIVYPENVPVEDDLTTWKIYLLANKIAYENKAIYTHRILKSSVSASVSETDVFPLRS 237 Query: 236 QRHYIKITRLL 246 I I RL+ Sbjct: 238 IEERITILRLI 248 >UniRef50_Q815A0 Glycosyltransferase n=6 Tax=Bacillus cereus RepID=Q815A0_BACCR Length = 319 Score = 251 bits (642), Expect = 2e-65, Method: Composition-based stats. Identities = 82/332 (24%), Positives = 141/332 (42%), Gaps = 19/332 (5%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 +S+I+P+YN C++S+I Q + LEI+++NDGSTD S I YAE + Sbjct: 2 LISLIVPIYNVEGFLPRCLDSIIKQEYQNLEIVLVNDGSTDQSGLICDQYAEKDERFIVF 61 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 H+ N G S ARN G+ A G Y+ FVD DD + P M L + ++ D+A C + Sbjct: 62 HKENGGVSSARNVGLNKANGNYIGFVDPDDWIEPNMINKLYELIIQHKADIAMCG--YVK 119 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAG 186 + S P + ST +L D L L+ + + ++ D+ K N+ Sbjct: 120 EDINGLLISRPVEL--STKILNKEDTLNSILNPNDFRGFLCNKLFSVDLFRKKNLYLDED 177 Query: 187 LHH-QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRL 245 +H +D+++ + + + + Y Y Y +HN + S+ N+ + Sbjct: 178 IHFCEDLLFCCQAILESKQIVYDSTPYYHYIIHNNNASKS--------NFSFKKLTALNS 229 Query: 246 LEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLIT 305 L+ + D+ I E Q Y + +R E +YK I Sbjct: 230 LDNIINLLNDQKGI--ELEQYKNYYMHLNISLLMNGIHEDKIRRQDKERLKKNLYKFKIN 287 Query: 306 NVRSVKVGYQALLWSFRL----WQWRDKTRSH 333 ++ S V + +L L + W++ S+ Sbjct: 288 DLTSKFVKFSCVLARLNLNLYYFIWKNAKPSN 319 >UniRef50_B0ZTM8 Putative uncharacterized protein n=1 Tax=Campylobacter jejuni RepID=B0ZTM8_CAMJE Length = 351 Score = 251 bits (641), Expect = 3e-65, Method: Composition-based stats. Identities = 72/324 (22%), Positives = 141/324 (43%), Gaps = 23/324 (7%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M+ +SVIIP+YN R C++S++ QT +EII+INDGSTDN +I YA Sbjct: 1 MVMQQPLVSVIIPIYNVEKYLRECLDSVVNQTLREIEIILINDGSTDNCGKICDEYATKD 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 + ++H+ NAG A N G+++A G+Y+ FV++DD + MYE L A +D D+ +C Sbjct: 61 KRIIVIHKENAGLGAAYNSGLDIAKGEYIGFVESDDWIEFNMYEELYFKAKNNDSDLVKC 120 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLT---GPDWLRMGLSSRR--WTHVVWMGVYRRDV 175 + + T E + + + ++ + + +W +Y + + Sbjct: 121 MF-YDYNSTREPKDKLYMQKHEDFELFNCAPDNKIFKIEDYPKLLIYHSSIWASLYHKTL 179 Query: 176 IVKNNIKFIA--GLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNL 233 I N ++F G +QD + E + A + ++ Y Y + S + R + Sbjct: 180 I--NRLRFQESKGATYQDFPFAIEALVLANKITTLRKAFYHYRQEPLNASSVKRTDQNLI 237 Query: 234 NYQRHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALR----VCHAVRKEPDILTRQR 289 I L + ++ +F ++ A+ + ++ + L ++ Sbjct: 238 KMIDQCIYARTRL--------ISLGVFEKFKEEFYAHAISPLYGFYYQLQDDLKPLFFEK 289 Query: 290 MIAEIFTSGMYKRLITNVRSVKVG 313 ++ F + + + L + + Sbjct: 290 LVK-FFNNSLSENLSYKYFNQREK 312 >UniRef50_C3WCV9 Glycosyl transferase n=1 Tax=Fusobacterium mortiferum ATCC 9817 RepID=C3WCV9_FUSMR Length = 331 Score = 250 bits (640), Expect = 4e-65, Method: Composition-based stats. Identities = 91/342 (26%), Positives = 170/342 (49%), Gaps = 24/342 (7%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPH- 62 K+SVI+P+YN + C++S++ QT +EII+INDGSTDNS I + Y++ YP Sbjct: 2 QEIKVSVIVPIYNTEKFLKKCIDSIVNQTLEEIEIILINDGSTDNSHNICEEYSKKYPQK 61 Query: 63 VRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNA 122 +R ++ N G S +RN G+ +A G+Y+AFVD+DD + MY+ + A +++LD+ C Sbjct: 62 IRYINNKNIGCSASRNLGMSLAQGEYIAFVDSDDYIDKEMYQEMYEKAKKENLDIVVCGI 121 Query: 123 DWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIK 182 + +ET + SIP++ + + L + + +YR+++I NI Sbjct: 122 TYVNQETKKRISSIPSNISKKSDYLIYKNRMAN----------PVNKLYRKNIIK--NIL 169 Query: 183 FIAGLH-HQDIVWTTEFMFNALRARYTEQSLYKYYLH-NTSVSRLHRQGNKNLNYQRHYI 240 F H +DIV+ + + A + + E+S Y Y H N S+ L ++ + ++ YI Sbjct: 170 FPIDTHSGEDIVFCYKSILKAKKINHIEKSYYYYIYHGNNSIFNLEKRLGIFIAFKELYI 229 Query: 241 KITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMY 300 + +Y I +F++ + A+R + P +++ + E + Y Sbjct: 230 YLKD------NDYLKNKEIIKKFYENFNFYAIRGAFFMLLNPKLVSNEEY--EKYDKLFY 281 Query: 301 KRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITRSAFN 342 K L ++ + + L++ +R W + + + I + N Sbjct: 282 KEL-KKIKFLPFKSKVLVYYYRNIIWGIRKFNLYDILKKIKN 322 >UniRef50_D0T9V5 Glycosyltransferase family 2 n=1 Tax=Bacteroides sp. 2_1_33B RepID=D0T9V5_9BACE Length = 318 Score = 250 bits (640), Expect = 4e-65, Method: Composition-based stats. Identities = 74/274 (27%), Positives = 123/274 (44%), Gaps = 22/274 (8%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M+ +SVI+P+Y A + C++SL+ QT+ EI++I+DGS D+S +I YA Sbjct: 1 MDKQPAVSVIVPVYKAENYLCHCVDSLLAQTFPDFEILLIDDGSPDHSGDICDEYARKDS 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 VR+ H+ N G S R GI+ A G+Y D DD V P M E L A E+D DV C+ Sbjct: 61 RVRVFHKENEGVSSTRQCGIDNAKGEYTIHADPDDWVEPDMLEELYKKAKEEDADVVICD 120 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 + + + P+D L LR ++ + +R ++ Sbjct: 121 FYEDIGDKRKYIRQQPSD-------LDHKTVLRELF--QQLHGSCCNKLVKRACY-NGDV 170 Query: 182 KFIAGLHH-QDIVWTTEFMFNALRARYTEQSLYKY--YLHNTSVSRLHRQGNKNLNYQRH 238 +F+ L +D+ + + ++ Y ++ Y Y ++ S++R Sbjct: 171 RFVPDLTFCEDLCFNALILRKEIKIGYLPRAFYHYEQNVNAGSLTRNSSP--------EQ 222 Query: 239 YIKITRLLEKLNRNYADKIMIYPEFHQQITYEAL 272 Y+ + +LEK + AD MI P Y+A Sbjct: 223 YVVLMNILEK-GLDTADFQMIRPILIYSWAYKAF 255 >UniRef50_O87182 CpsIaI n=15 Tax=Streptococcus agalactiae RepID=O87182_STRAG Length = 333 Score = 250 bits (640), Expect = 5e-65, Method: Composition-based stats. Identities = 73/304 (24%), Positives = 144/304 (47%), Gaps = 13/304 (4%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M +S+IIP+YN+ + C++S++ QT +E+I+I+DGSTDNS EI ++ Sbjct: 1 MEKRILVSIIIPIYNSEAYLKECVQSVLQQTHPLIEVILIDDGSTDNSGEICDNLSQEDN 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 + + H+ N G S ARN G++ +TG+++ FVD+DD V P M E ++ + ++ D+A+ + Sbjct: 61 RILVFHKKNGGVSSARNLGLDKSTGEFITFVDSDDFVAPNMIEIMLKNLITENADIAEVD 120 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 D + + V + L+ LS R ++V +Y++ +I N+ Sbjct: 121 FDISNERDYRKKKRRNFYK-----VFKNNNSLKEFLSGNRVENIVCTKLYKKSIIG--NL 173 Query: 182 KFIAGL-HHQDIVWTTEFMFNALRARY-TEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHY 239 +F L +D+++ + + R T SLY Y + TS + +L++ + Sbjct: 174 RFDENLKIGEDLLFNCKLLCQEHRIVVDTTSSLYTYRIVKTSAMN-QKFNENSLDFITIF 232 Query: 240 IKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGM 299 +++ L+ NY + + + + + + R+ +I + Sbjct: 233 NEVSSLVPAKLANYVEAKFLREKIK---CLRKMFELGSNIDNKIKVQREIFFKDIKSYPF 289 Query: 300 YKRL 303 YK + Sbjct: 290 YKAV 293 >UniRef50_D2Q667 Glycosyl transferase, family 2 n=3 Tax=Bifidobacterium RepID=D2Q667_9BIFI Length = 327 Score = 250 bits (639), Expect = 5e-65, Method: Composition-based stats. Identities = 51/234 (21%), Positives = 102/234 (43%), Gaps = 5/234 (2%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M S +SV+IP+Y C+ S++ Q L+I++++DGS D+ ++ +A Sbjct: 1 MTSQTVVSVVIPVYGVESYLDRCIRSVVGQDHERLDIVLVDDGSPDDCPQMCDEWAARDS 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 +R++H+ N G + ARN G++V G V FVD+DD V L+ + D+ C+ Sbjct: 61 RIRVVHKKNGGLASARNAGLDVIAGDMVTFVDSDDYVESNYVSELLHWHEQSGADIVMCS 120 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIV-KNN 180 + + R G++T + + L VW ++ ++ Sbjct: 121 CMHDLEDGS----IVTDARCTPEGIMTSEEAMIQFLYHLSLAGPVWGKMFDARFFKSEHG 176 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLN 234 ++F GL+ +D + + + LY Y + S+ ++ + + + Sbjct: 177 VRFHEGLNSEDYYVLAQVFLQMNKIFVRMEPLYHYRIRQGSIVHMNSFNDHSCD 230 >UniRef50_A6NTT8 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6NTT8_9BACE Length = 331 Score = 250 bits (639), Expect = 6e-65, Method: Composition-based stats. Identities = 75/320 (23%), Positives = 130/320 (40%), Gaps = 10/320 (3%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 K+SVIIP+Y ++S+++QT LE+I+++DGS D EI YA V Sbjct: 2 NIKISVIIPIYKVEAFLDKSIQSVLSQTHLNLELILVDDGSPDRCGEICDSYARQDKRVT 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPT-MYETLMTMALEDDLDVAQCNAD 123 ++HQ N G S ARN G++ ATG Y+ F+DADD E L+ DV Sbjct: 62 VIHQTNRGLSGARNAGLKAATGDYIVFLDADDYWDDNTALERLVECL---PADVVSFGMK 118 Query: 124 WCFRETGETWQS-IPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIK 182 + E G Q+ T + + +R L + + + +R + + + Sbjct: 119 HFWPENGRVVQNWSDTQNYQDIRGRAKYEQVRYMLENGCYIASACNKLIKRSLFSEQKLT 178 Query: 183 FIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKI 242 F G+ +DI W + A + Y+Y + S+S+ R N + + + Sbjct: 179 FREGVTSEDIDWCARLLIAAGTFSVSPCCFYRYRQRDGSISKSIRIKNI-YDLEENIKSC 237 Query: 243 TRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPD----ILTRQRMIAEIFTSG 298 ++L N + + Q L ++E +L+ R A + Sbjct: 238 VSYSKELEDNKELLELYWNYVSYQYGTLLLNAALVCKEEQKYAVSVLSNMREYAWLLKYH 297 Query: 299 MYKRLITNVRSVKVGYQALL 318 M +++ R +V +L Sbjct: 298 MNRKVNVLWRIYQVAGYGVL 317 >UniRef50_C0CHF0 Putative uncharacterized protein n=2 Tax=Blautia hydrogenotrophica DSM 10507 RepID=C0CHF0_9FIRM Length = 348 Score = 250 bits (639), Expect = 6e-65, Method: Composition-based stats. Identities = 82/343 (23%), Positives = 141/343 (41%), Gaps = 33/343 (9%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP-H 62 T K+S+I+P+Y CM+SL+ QT +EII +NDGS DNS+ I + Y + P Sbjct: 2 DTCKVSIIVPVYKVEQYLGQCMDSLVGQTLEEIEIIAVNDGSPDNSLAILQEYRDRDPEK 61 Query: 63 VRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNA 122 V + N G S ARN G+ A G+Y+ FVD+DD + + L A D D+ Sbjct: 62 VNVFSIENQGVSHARNYGVSKAKGEYLLFVDSDDYLEKNACQKLYEHAKLHDNDLVLFG- 120 Query: 123 DWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIK 182 + RE GET + L + ++ R + W + RR++ + Sbjct: 121 RYNLREGGETEKLY----LPVVEDFSLREYKEEM---ARLSPFPWDKLIRRELFQEIG-G 172 Query: 183 FIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKI 242 F G+ +D+ +A + + LY Y + ++ L H + I Sbjct: 173 FPEGIRFEDLPVAHMLAVSARKIGVVSECLYHYRVQAGFLNTL----------TEHTLDI 222 Query: 243 TRLLEKLNRNYADKIMIYPEFHQQITYEALRVC-------HAVRKEPDILTRQRMIAEIF 295 L+ L DK +Y E+HQ++ Y +R C ++ + + R++ ++ Sbjct: 223 VTALKILVNFLKDK-GLYQEYHQEVEYICVRHCCYRFVNLRKFTEKGKLDLQMRLVQLVY 281 Query: 296 T-----SGMYKRLITNVRSVKVGYQALLWSFRLWQWRDKTRSH 333 ++ R + VG + L+ + K Sbjct: 282 DFLESEFPKWRENPYVRRKMAVGMKHFLFVCERPKLMMKFVQK 324 >UniRef50_A0Q780 Glycosyl transferase, family 2 n=16 Tax=Francisella RepID=A0Q780_FRATN Length = 319 Score = 250 bits (638), Expect = 7e-65, Method: Composition-based stats. Identities = 69/330 (20%), Positives = 144/330 (43%), Gaps = 14/330 (4%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M ++++IP+YN + C++S+I QT+ LEII++NDGSTDNS+EI + YA+ Sbjct: 1 MKYCELITLVIPIYNIENYLGRCLDSVINQTYKDLEIILVNDGSTDNSLEICESYAKKDS 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 ++++++ N G S ARN G++ G YV F+D+DD V E L ++++ D++ N Sbjct: 61 RIKIINKNNGGLSSARNVGLDACKGDYVTFIDSDDWVSLDYIEILYKNIIDNNADISIIN 120 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 A + + + L + + ++ + +Y+ + N+ Sbjct: 121 AIKVKSQNNDFTLKEQKNLLHTY--FSSIEF----ALDNTLPVMACAKLYKTKLFE--NL 172 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIK 241 +F + +D +++A + T+ Y Y+ TS++ + K N + Sbjct: 173 RFTNSIIFEDEDIMYRLLYHANKIVCTDYIGYFYFQRPTSITSSKK---KRQNIIKSSDS 229 Query: 242 ITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEP---DILTRQRMIAEIFTSG 298 + +L K + + K I +F+ R P +++ +++ I Sbjct: 230 LIFVLTKKEQFFKGKTTIPYKFYIDSAGLLSRYYAKSFLYPFDKEMIKQRKKIKLFINKL 289 Query: 299 MYKRLITNVRSVKVGYQALLWSFRLWQWRD 328 + K+ + + + ++D Sbjct: 290 LQNTKSIETFRYKIKRNVIKYFVFFFLFKD 319 >UniRef50_D0R4M4 Glycosyltransferase n=1 Tax=Lactobacillus johnsonii FI9785 RepID=D0R4M4_LACJF Length = 339 Score = 249 bits (637), Expect = 8e-65, Method: Composition-based stats. Identities = 70/325 (21%), Positives = 139/325 (42%), Gaps = 18/325 (5%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 + N T ++S+I+P+YN C+ S+I QT+ E+I++NDGS DNS +I + YA+ Y Sbjct: 3 LNNGTPEISIIVPVYNVEKWIENCIRSVIAQTFNNWELILVNDGSLDNSQQICEKYAKKY 62 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 + L+++ N G S ARN GI+ A GKY+AFVD DD + L T + + +A C Sbjct: 63 SKIHLINKENGGLSDARNFGIKKAKGKYLAFVDGDDYIDSNYLSKLHTAIISSNASIAVC 122 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRR-WTHVVWMGVYRRDVIVKN 179 + T+ +++G +LR+ V W +Y++D+ N Sbjct: 123 GYKEV-DNKKKILSLKRTNDFFVQKLVSGQTFLRILAEKNCTLCVVAWNKLYKKDLFKNN 181 Query: 180 NIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHY 239 + G H+D +++ + + LY Y S+ L+ + Sbjct: 182 --SYKKGRLHEDEFIIAPLVYDVSKIALVDDELYNYVQRAGSIMSSKMTQKNLLDANDAF 239 Query: 240 IKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCH---------AVRKEPDILTRQRM 290 ++ ++ + + HQ + + + + + + + + Sbjct: 240 VERLNFY----KSNKNMKLFDLTVHQYLLWILSVMKYPKKVLTKKGRMILQNNFKQYSKS 295 Query: 291 IAEIFTSGMYK-RLITNVRSVKVGY 314 + ++K ++ R++K+ + Sbjct: 296 QKNKSNTNIFKFQMKLGERNIKLAW 320 >UniRef50_B0ABV6 Putative uncharacterized protein n=1 Tax=Clostridium bartlettii DSM 16795 RepID=B0ABV6_9CLOT Length = 329 Score = 249 bits (637), Expect = 8e-65, Method: Composition-based stats. Identities = 71/275 (25%), Positives = 130/275 (47%), Gaps = 14/275 (5%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M K+SVI+P+YN R C++S++ Q +T+ E+I +NDGSTDNS+EI + Y + Sbjct: 1 MNTINPKISVIVPVYNTSTYLRRCLDSILEQDFTSYEVICVNDGSTDNSLEILREYEKKS 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 +++++Q N+G + RN ++ A G Y+AF+D+DD V L A++ D+ C Sbjct: 61 EKIKVINQVNSGVAKTRNTALKHAKGDYLAFLDSDDFVRENYLSRLYDAAIDTRSDIVIC 120 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 N + + + P GV D L+ + +W +++R++ N Sbjct: 121 NFYRYYE---QINLAKPVFYKFRRGVFNKYDILKGLIPDNLIHSYLWNKLWKREIFEDFN 177 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI 240 F + +D+ ++ ++ A + +LY Y + TS+ R N L Q Y+ Sbjct: 178 -SFPD-MKFEDLAIMSQLIYKADKIAVINDALYYYRIRKTSIVR-----NICLQTQNDYM 230 Query: 241 KITRLLEKLNRNYADKIMIYPEFHQQITYEALRVC 275 K L+ ++ + Y + Y +L+ Sbjct: 231 KAYALIRLFLKDTGE----YSRLKRYYFYFSLKAY 261 >UniRef50_O86893 Putative glycosyl transferase n=1 Tax=Streptococcus pneumoniae RepID=O86893_STRPN Length = 323 Score = 249 bits (637), Expect = 9e-65, Method: Composition-based stats. Identities = 76/326 (23%), Positives = 143/326 (43%), Gaps = 10/326 (3%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +SVI+P+YN D R ++SL+ QT+ E+I++NDGSTDNS EI Y + Y ++ + H Sbjct: 2 ISVIVPVYNVADYLRFALDSLLEQTYKDFEVILVNDGSTDNSGEICDEYGKLYDNIHVFH 61 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + N G S ARN G+E + G+++ F+D+DD P E L+T+ + D+D+ + Sbjct: 62 KKNGGLSDARNFGLEKSRGEFITFLDSDDYFEPYALELLITIQKKYDVDIVSTKGGITYS 121 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 + + + D L + +LT ++L + T W +Y+RD+ F G Sbjct: 122 HDIYSKKLMAEDYL-TVKILTNKEFLAAVYYNDEMTVSAWGKLYKRDLFKTI---FPKGK 177 Query: 188 HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRLLE 247 ++D+ E + N +T+ +Y YY S+ ++ ++ +++ Sbjct: 178 IYEDLYVVAERLLNIKTVAHTDLPIYHYYQRQGSIVN-STFSDRQYDFFDAIDHNEAIIK 236 Query: 248 KLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLITNV 307 K + + + I L + DI R+I + + + I Sbjct: 237 KFYCGDKELLAALNA-KRVIGSFILSNSAFYNSKNDITKIIRIIKPYYWEVIKNKKIPMK 295 Query: 308 RSVK----VGYQALLWSFRLWQWRDK 329 R V+ + + + + Sbjct: 296 RKVQCVLFLLSPNYYYKIKDKMLQRG 321 >UniRef50_C7H2S4 Glycosyl transferase, group 2 family n=3 Tax=Clostridiales RepID=C7H2S4_9FIRM Length = 572 Score = 249 bits (636), Expect = 1e-64, Method: Composition-based stats. Identities = 65/275 (23%), Positives = 118/275 (42%), Gaps = 15/275 (5%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 K+S+I+P+YNA C++S++ Q + E+I+++DGS D S EI YA V Sbjct: 76 EPKVSIIVPVYNAEKSLARCVDSILNQEFRDFELILMDDGSKDRSGEICDGYARADARVV 135 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++H+ N G S RN+ I A G ++ FVD+DD + + ++ A E D+ + Sbjct: 136 VVHKENTGVSDTRNQAIARARGTFLQFVDSDDWLTADATKLMVRAAEETGCDMVIADFYR 195 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 E I D++ G M + + V+W +YRR ++ + I+ Sbjct: 196 VVGEMVSQKGDIDADQVIGREAFVG---FMMENPADYYYGVLWNKLYRRSLVEAHGIRMD 252 Query: 185 AGLHH-QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 A L +D ++ E++ A +Y Y S+ N+ +++ R Sbjct: 253 AKLSWCEDFLFNLEYVRYATTFYALRTPVYYYVKTKGSLV------NQKISFARTVEMKL 306 Query: 244 RLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAV 278 + E N + + + L+V H + Sbjct: 307 AMFECYNDFFKHVLD-----EDAYERKRLQVYHFL 336 >UniRef50_A5ZCV6 Putative uncharacterized protein n=1 Tax=Bacteroides caccae ATCC 43185 RepID=A5ZCV6_9BACE Length = 321 Score = 249 bits (636), Expect = 1e-64, Method: Composition-based stats. Identities = 77/313 (24%), Positives = 143/313 (45%), Gaps = 27/313 (8%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +SVI+P+YN C++S+I Q++ LEII+++DGS D EI Y+ + ++H Sbjct: 2 ISVIVPIYNVERYINKCIDSIIHQSYKNLEIILVDDGSPDKCGEIIDEYSRIDKRIHVIH 61 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 ++N G S ARN G+++A+G ++ FVD+DD + PTMYE +++ ++ LD+ +C + Sbjct: 62 KSNGGLSSARNAGLDIASGDFIGFVDSDDWIEPTMYEEMLSFMEKEQLDLVECGINLVTN 121 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWT------HVVWMGVYRRDVIVKNNI 181 + P+ VL+G D L L S+ T VW +++++ N Sbjct: 122 NISNKYIEKPS------IVLSGKDALIRHLDSKNRTVQSLPRTAVWSKLFKKEFWKTN-- 173 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYL-HNTSVSRLHRQGNKNLNYQRHYI 240 +F G H+D + T + ++ A + L + + S+ ++ Sbjct: 174 RFPVGEIHEDYLLTCKALYEAKSVGLLRRGLLNHLTDNPNSIVNTKFNSKDLYKGKQLEY 233 Query: 241 KITRLLEK---------LNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMI 291 +IT L EK + Y + Q E ++ ++++ I+ + Sbjct: 234 RITYLKEKKEPTLTNLAMISYYGYIVSAIWRCDQNGMPERDKLIAKIKEDKKIIFSLNL- 292 Query: 292 AEIFTSGMYKRLI 304 F + RLI Sbjct: 293 --PFKRKIDARLI 303 >UniRef50_B9YAB2 Putative uncharacterized protein n=1 Tax=Holdemania filiformis DSM 12042 RepID=B9YAB2_9FIRM Length = 320 Score = 248 bits (635), Expect = 1e-64, Method: Composition-based stats. Identities = 64/269 (23%), Positives = 119/269 (44%), Gaps = 24/269 (8%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 LSVI+P+YN C++SL+ QT +EII+INDG+ DNS +I YA YP +R Sbjct: 2 KPILSVIVPVYNVEKYLPRCLDSLVNQTLKNMEIIVINDGTKDNSQQIIDDYASRYPQIR 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 + N G + RN G+ G+Y F+D+DD M+E ++ A +++ DV N +W Sbjct: 62 PFKKPNGGIAETRNFGLAQVQGEYFGFLDSDDFTELDMFEKMVAAAEKENADVVVSNFNW 121 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 + E + P G + + + +W +YR D + + ++F Sbjct: 122 VYGEQKRLEKEGPYA--------PGKEMMIHLFA------TLWNKIYRTDFVRETGLQFP 167 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 G ++D + + + + +++ Y S++ + K++ + I Sbjct: 168 TGYRYEDACFLYCLAPHVKKLAFVDEAFVSYIQLGNSITHTNNHEVKDMIHVFEIIT--- 224 Query: 245 LLEKLNRNYADKIMIYPEFHQQITYEALR 273 +Y E+H ++ Y ++ Sbjct: 225 -------DYFKAQGWDQEYHDELEYLHIK 246 >UniRef50_Q5LZN9 Exopolysaccharide biosynthesis protein, sugar transferase n=2 Tax=Streptococcus thermophilus RepID=Q5LZN9_STRT1 Length = 324 Score = 248 bits (634), Expect = 2e-64, Method: Composition-based stats. Identities = 62/310 (20%), Positives = 120/310 (38%), Gaps = 23/310 (7%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M + +S++IP+YN C++S+ QT+ E+I++NDGSTD+S+ I + + Sbjct: 1 MYLKSLISIVIPVYNVEKYLEKCLQSVQNQTYNNFEVILVNDGSTDSSLSICEKFVNQDK 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 + + N G S ARN GI+ A G ++ FVD+DD + L+ + C+ Sbjct: 61 RFSVFSKENGGMSSARNFGIKKAKGSFITFVDSDDYIVKDYLSHLVAGIKSETS--IVCS 118 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 + E G + +S V + L + VW +Y I Sbjct: 119 KFFLVDEKGSLLTKKEAPKKKSEVVSIEESIKILLLQQNGYDLAVWGKLYPVSFFET--I 176 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIK 241 F G ++D+ T + + A + + Y Y S+ N ++ + Sbjct: 177 SFPEGKLYEDMGTTYKLLKLASEVVFLDAYDYAYVQRPNSIM--------NSSFNLKKLD 228 Query: 242 ITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYK 301 I ++ ++ D + +P + A ++ + E K Sbjct: 229 IIEMVHEME---NDILAQFPNLALYVKNRAFAA--------EVKIFLEIPKEKEFEQAQK 277 Query: 302 RLITNVRSVK 311 +L +++ + Sbjct: 278 QLWHDIKKNR 287 >UniRef50_C7D258 TagF domain-containing protein n=4 Tax=Enterococcus RepID=C7D258_ENTFA Length = 332 Score = 248 bits (634), Expect = 2e-64, Method: Composition-based stats. Identities = 75/338 (22%), Positives = 142/338 (42%), Gaps = 20/338 (5%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 +SV+IP+YN C++S+I QT+ LEIII+NDGSTDNS+ + + + ++ Sbjct: 1 MKLVSVVIPVYNVEKYVEKCLDSVINQTYQNLEIIIVNDGSTDNSLSVCQKKKLSDSRIK 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 L+++ N G S ARN GIE A G+++ F+D+DD + E L+ ++D++ Sbjct: 61 LINKENGGLSSARNAGIECAQGEFICFIDSDDWIELDYIEVLLNGMENTNVDISVIQMIK 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 +QS + + +R SS + +YR + I++ Sbjct: 121 VKDFNKIAFQS---ESQTKWDIFERETAMRELFSSNLIGYSANNKLYRISLFKS--IRYP 175 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 G+ +D T + ++ + + Y YYL + S+ R KN + + +I Sbjct: 176 EGMLMEDKGTTYRLIDSSTKVAVNGSTKYHYYLRDNSILRT-DFNQKNFDSFIIHEEILN 234 Query: 245 LLEKLNRNYADKIMIYPEFHQQITYEAL----RVCHAVRKEPDILTRQRMIAEIFTSGMY 300 +++ I P+ + YEA+ R+ + T+ I E + S + Sbjct: 235 FMDEH------YPSISPKVKSRYVYEAIRMMMRMIESNYTSKLDYTKCMDIVEKYQSEVL 288 Query: 301 KRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITR 338 +V+ + + + + I + Sbjct: 289 NDNKISVKLKLI----TKLLVKFPSLPSRISKNKLIAK 322 >UniRef50_B1R2U2 Glycosyl transferase, group 2 family protein n=4 Tax=Firmicutes RepID=B1R2U2_CLOBU Length = 353 Score = 248 bits (634), Expect = 2e-64, Method: Composition-based stats. Identities = 67/251 (26%), Positives = 125/251 (49%), Gaps = 9/251 (3%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+I+P+YN C++S++ QT+ E+I+++DGSTD S +I + + ++++H Sbjct: 4 ISIIVPIYNVEKYVEKCVDSILKQTYINFELILVDDGSTDMSGKICDKFKKEDDRIKVIH 63 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + N G S ARN G+++ATGKY+ F+D+DD + TMYE + A+EDD D+ C+ + Sbjct: 64 KKNGGLSSARNEGVKLATGKYIGFIDSDDYIDKTMYEKMFNKAIEDDSDIVICDMTYNLN 123 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 + I T GVL D + L + + +Y+R++ + F G Sbjct: 124 G-----KDISTTIFEDFGVLNREDTILKYLENDYFRSHAQNKLYKRELFEY--VSFPEGK 176 Query: 188 HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRLLE 247 ++D+ + + N + + + LY Y N + + NL + ++ Sbjct: 177 LYEDVATFYKLIHNCNKISFVNEKLYYYNQSNYNSITKKKFNINNLQLILNTEEMKDFF- 235 Query: 248 KLNRNYADKIM 258 + NY +I+ Sbjct: 236 -IRNNYTSEII 245 >UniRef50_C8WM64 Glycosyl transferase family 2 n=1 Tax=Eggerthella lenta DSM 2243 RepID=C8WM64_EGGLE Length = 333 Score = 248 bits (633), Expect = 3e-64, Method: Composition-based stats. Identities = 79/276 (28%), Positives = 123/276 (44%), Gaps = 14/276 (5%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 +SVIIP+YN C+ES+I QT++ +EI+++NDGSTD S YA + V Sbjct: 2 NPLVSVIIPVYNVESYVSECIESVIAQTYSNIEIVVVNDGSTDGSGFSCDQYACSDSRVV 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++H+ N G S ARN GI V G +VAFVD DD V P ETLM D ++A Sbjct: 62 VVHKENEGLSAARNAGIAVCRGDFVAFVDGDDFVSPVFIETLMHAIEVCDCEIAAIPCGT 121 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 F + + V+ ++ L R T V W +Y R ++ + F Sbjct: 122 AFEDGSSCELVAKAAFIPDARVMDSYTVQKLMLYQRLDTGVPW-RLYARRILG--DAPFA 178 Query: 185 AGLHHQDIVWTTEFMFNALRARYTE-QSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 GL+++D+ +F+ + R + ++LY Y L + + ++ IK Sbjct: 179 VGLYYEDLASVYKFIHDVDRVALVDCRALYAYRLRKSGI----------ISQAYSPIKAL 228 Query: 244 RLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVR 279 +E R +D YP+ +C V Sbjct: 229 SAIEVSRRLSSDMQEWYPDLAVASASRCFSLCRMVY 264 >UniRef50_B0P2L5 Putative uncharacterized protein n=1 Tax=Clostridium sp. SS2/1 RepID=B0P2L5_9CLOT Length = 323 Score = 248 bits (633), Expect = 3e-64, Method: Composition-based stats. Identities = 71/322 (22%), Positives = 141/322 (43%), Gaps = 26/322 (8%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 +SVI+P+Y D C++SL+ QT+ LEII+++DGS D+ I + + ++ Sbjct: 2 EELISVIVPIYKVEDYLDRCIQSLVNQTYKNLEIILVDDGSPDDCPRICDEWGKKDQRIK 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++H+ N G S ARN G++V TG YV+++D+DD + MY+ +M + D+ +C Sbjct: 62 VIHKENGGLSDARNAGMKVMTGAYVSYIDSDDWIEENMYQMMMNAIEVNKADICEC---- 117 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 F +T + + ++ L + + VVW +Y+R+V+ ++ F Sbjct: 118 AFEKTSGSVKKEDQKNDDKVSIMDKRSALMAVVEEK-IQPVVWNKLYKREVVK--DLWFE 174 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 G +++D +T + + A + Y Y+ S+ R Sbjct: 175 VGKYNEDEFFTYKAIERADKIIQISDIGYYYFFREDSIINET--------------YTIR 220 Query: 245 LLEKLNRNYA--DKIMIYPEFHQQITYEALRVC--HAVRKEPDILTRQRMIAEIFTSGMY 300 L+ L Y + YPE ++ + + +C H + + ++ + +Y Sbjct: 221 RLDGLEARYERMKYLGKYPEIYRVAKRQLIFMCMYHYQKGMRYLSGKELEDLKKRVKEIY 280 Query: 301 KRL-ITNVRSVKVGYQALLWSF 321 + + I K + +W + Sbjct: 281 RDIPIDKTDIRKYSSKEKIWYY 302 >UniRef50_UPI0001968EF9 hypothetical protein BACCELL_04174 n=1 Tax=Bacteroides cellulosilyticus DSM 14838 RepID=UPI0001968EF9 Length = 328 Score = 247 bits (632), Expect = 3e-64, Method: Composition-based stats. Identities = 65/229 (28%), Positives = 112/229 (48%), Gaps = 4/229 (1%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 +SVI+P+YN C++S++ QT+ A E+++++DGS D+S I YA+ V ++ Sbjct: 2 LVSVIVPVYNVEKFIHQCIDSILCQTYKAFEVVLVDDGSLDSSGIICDEYAKKDCRVSVI 61 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADD-EVYPTMYETLMTMALED-DLDVAQCNADW 124 H+ N G S ARN G+E A G+YV FVD+DD + E L+ E D N + Sbjct: 62 HKENGGLSDARNVGLECAKGEYVIFVDSDDFWIDNCQLELLIAEIEETPSCDFIGFNCCY 121 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 TGE + + + + + + + + + +R ++ NNI+FI Sbjct: 122 YNSSTGE-FSPWFSYSDEIMKMTERDVLITNLVCTGVFPMSACLKIIKRSFLMNNNIRFI 180 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLH-NTSVSRLHRQGNKN 232 G+ +DI W E + + + R Q +Y Y S++R + + N Sbjct: 181 KGIVCEDIPWFIELLHYSHKFRVVNQYMYAYRQGVQGSITRNFSEKSFN 229 >UniRef50_B2ILQ1 Ss-1,4-galactosyltransferase n=6 Tax=Streptococcus pneumoniae RepID=B2ILQ1_STRPS Length = 333 Score = 247 bits (632), Expect = 3e-64, Method: Composition-based stats. Identities = 66/251 (26%), Positives = 121/251 (48%), Gaps = 8/251 (3%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 + +S+++P+YN + +ES++ QT+ LE+++++DGSTD+S EI + + +R Sbjct: 17 EDLVSIVVPVYNVEKYLKKSIESILNQTYDNLEVLLVDDGSTDSSGEICDSFIKVDSRIR 76 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 + H+ N G S ARN GIE G+YV+F+D DD + L ++ +V+ C+ Sbjct: 77 VFHKENGGLSDARNFGIEHMKGQYVSFIDGDDYISKDYVWKLYHSLKNNNSEVSICSFSL 136 Query: 125 CFRETGETWQSIPTDRLRSTGV-LTGPDWLRMGLSSRRWTHVV-WMGVYRRDVIVKNNIK 182 ET + I + L S V L+G L L++ + +VV W +YR + K +K Sbjct: 137 V----DETGEKIKDELLDSGEVSLSGQQILEKALTADGYRYVVAWNKLYRSTLFEK--LK 190 Query: 183 FIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKI 242 F G+ ++D ++ R E+ LY Y S+ + + K + + Sbjct: 191 FKKGMLYEDEFLNYPLFWDCKRVSIVEEPLYLYVQRKGSIIQSNMTLEKIKMKDKMHTSR 250 Query: 243 TRLLEKLNRNY 253 + ++ Sbjct: 251 IEFYAEKKNSF 261 >UniRef50_Q8XN54 Capsular polysaccharide biosynthsis protein n=1 Tax=Clostridium perfringens RepID=Q8XN54_CLOPE Length = 330 Score = 247 bits (631), Expect = 5e-64, Method: Composition-based stats. Identities = 64/250 (25%), Positives = 110/250 (44%), Gaps = 11/250 (4%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 ++S+I+P+YN C++S++ Q++T E+I++NDGS D S I Y +R Sbjct: 2 KPEISIIVPIYNVEKYLSKCIDSILNQSFTNFELILVNDGSNDRSGIICDEYTLKDERIR 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++H+ N G S ARN GI +A GKY+ FVD DD + MYE L + + D++ C Sbjct: 62 VIHKKNGGVSSARNSGISLALGKYIGFVDPDDYIKEEMYEILYNLCKKYKTDISICKLGR 121 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 T I + L + L+ + + +++R+ + F Sbjct: 122 EINGT-----LINNEENGEIIELDNVNGLKELFKGILYRFSLCNKLFKRECFR--GVIFP 174 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 G H+D+ T NA ++ Y + Y Y S+ + + +I Sbjct: 175 EGRIHEDLSTTYRLFANAKKSIYFDYVGYIYVKRENSILTTTYNEKR----LQAFIGWNE 230 Query: 245 LLEKLNRNYA 254 +LE + Y Sbjct: 231 ILEFMINKYP 240 >UniRef50_D0ANJ0 Putative uncharacterized protein n=1 Tax=Enterococcus faecium C68 RepID=D0ANJ0_ENTFC Length = 351 Score = 247 bits (631), Expect = 5e-64, Method: Composition-based stats. Identities = 75/297 (25%), Positives = 133/297 (44%), Gaps = 15/297 (5%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+SVI+P+YN R C++SL++QT +EI++++DGS D S EIA Y + Y +V+ + Sbjct: 4 KVSVIVPIYNVEKYLRKCVDSLLSQTLKEIEIVLVDDGSPDASGEIADEYQKKYSNVKTI 63 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 H+ N G ARN GIE ATG+YVAF+D+DD V MYE L +A +++ D+ Sbjct: 64 HRENGGLGPARNTGIENATGEYVAFLDSDDWVESDMYEKLYLVASKENADIVVSGHCDFA 123 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLRMGL-------SSRRWTHVVWMGVYRRDVIVKN 179 + + P + + + + V M +Y ++++ K Sbjct: 124 NDKAMVKKQHPLSGNVYDTPNSIKNVRKNLFGHSPKDQEVEAFPMSVCMSLYNKNLLDKY 183 Query: 180 NIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHN-TSVSRLHRQGNKNLNYQRH 238 ++F + +D ++ A + +TE + Y Y S+++ NK ++ Sbjct: 184 RLRF-EKILSEDTIFNLNAYDVANKIAFTEYTDYCYRKEEQESITK-SLSPNKQTEFKEF 241 Query: 239 YIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIF 295 I + + K + + I Y L V I +++ I + Sbjct: 242 LITLKAIASKEDLECQIRAK-----RMAIDYCRLYVGLVDDANLGIKEKRKYIKKFI 293 >UniRef50_B2UQ58 Glycosyl transferase family 2 n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UQ58_AKKM8 Length = 384 Score = 247 bits (631), Expect = 5e-64, Method: Composition-based stats. Identities = 68/222 (30%), Positives = 104/222 (46%), Gaps = 3/222 (1%) Query: 6 NKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRL 65 +S+I+P YN +C+ESL+ QT +EII INDGSTD + + E + L Sbjct: 3 PDVSIIVPCYNVAAYVDSCLESLVRQTLRNIEIICINDGSTDETWTHLLRWKEKDSRIIL 62 Query: 66 LHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWC 125 L+Q NAG S ARN G++ A G YV F D DD + P MY L + ALE D D+ +C Sbjct: 63 LNQRNAGVSAARNAGLDAARGLYVGFADPDDYMDPEMYSRLFSAALEYDADIVECGNHVF 122 Query: 126 FRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIA 185 + ++ R P + VW ++R+ ++ + ++F Sbjct: 123 EDSSDRIIEAKRRSPSRHFEENASPASFFRDSIWGKMDICVWSKLFRKSMLDAHRLRFNV 182 Query: 186 GLH--HQDIVWTTEFMFNALRARYTEQSLYKYY-LHNTSVSR 224 L +D + + +A R + LY Y + N S+SR Sbjct: 183 HLKSGAEDETFRLMAVPHASRLLFIPDCLYYYRLMRNGSLSR 224 >UniRef50_A7V2P1 Putative uncharacterized protein n=3 Tax=Bacteroidales RepID=A7V2P1_BACUN Length = 348 Score = 247 bits (631), Expect = 5e-64, Method: Composition-based stats. Identities = 78/331 (23%), Positives = 144/331 (43%), Gaps = 21/331 (6%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M + K+S+I+P+YN C+ S++ QTW LE+I++ND + DNS+EIA+ ++P Sbjct: 1 MIESMKVSIIVPVYNVAKYIERCLLSVLDQTWQDLEVILVNDCTPDNSMEIARRVVASHP 60 Query: 62 HVRLL----HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDV 117 ++ H+ N G S ARN GI + G Y+ F+D+DD + E L A++ D Sbjct: 61 RGTVVRCLEHEENRGLSAARNTGISASVGDYLYFLDSDDYISANAIELLADAAVQKRPDF 120 Query: 118 AQCNADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIV 177 N + TG + P TG G + +W + W + R +++ Sbjct: 121 VIGN----YEVTGARRWAPPLSL--GTGFYEGNALVLSTYVQGKWYVMAWNKLVSRPLVL 174 Query: 178 KNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKN----- 232 ++ + F G+ H+D +W+ + + ++ Y Y + S+ R N Sbjct: 175 QHKLYFQEGIVHEDDLWSFKLACMSQSMYVVNETTYYYSMQPDSIMRAPSMRNLECRVLV 234 Query: 233 LNYQRHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIA 292 L Y +I+ +R L+ Y + ++ +I Y + ++ R + A Sbjct: 235 LGYIYDFIRSSRCLQDNRLIYIYFESLKAKYFDRILYFTKDTSFHYQSY--LVFRNKKYA 292 Query: 293 EIFT----SGMYKRLITNVRSVKVGYQALLW 319 + +K ++ N+ V Y L+ Sbjct: 293 SLLEMTGLRPEWKLMLQNIHYVLPTYAGYLY 323 >UniRef50_C4G9K6 Putative uncharacterized protein n=1 Tax=Shuttleworthia satelles DSM 14600 RepID=C4G9K6_9FIRM Length = 445 Score = 247 bits (631), Expect = 5e-64, Method: Composition-based stats. Identities = 67/264 (25%), Positives = 119/264 (45%), Gaps = 8/264 (3%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M K+S+IIP+YNA R C++S++ Q +T E+++++DGS+D S I YA Sbjct: 1 MGPNPKVSIIIPVYNAASALRRCLDSVLKQEFTDFELLLMDDGSSDESPAILDEYAGKDS 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 +RL+H+ N+G S RN+ +++A G YV F+DADD + P + ++ + D+ + Sbjct: 61 RIRLVHKNNSGVSDTRNQALDLARGTYVQFLDADDWIAPEATKLMVRAMEQKKADLVVTD 120 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 E I D L G M + + V+W +Y R +I +N I Sbjct: 121 FYRVVGEHVSHKGDIEEDSLMDLQTFAGH---MMENPADFYYGVIWNKLYSRQIIEENQI 177 Query: 182 KFIAGLHH-QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQ---- 236 + + +D ++ E++ + + +Y Y S+ K + + Sbjct: 178 RMDPEVSWCEDFLFNMEYLLHCRGIFALQVPVYYYVKTEGSLVSQGLSITKTVEMKASVF 237 Query: 237 RHYIKITRLLEKLNRNYADKIMIY 260 R+Y + A +I IY Sbjct: 238 RYYNDFFARVYDPEDYEARRIQIY 261 >UniRef50_A6BH38 Putative uncharacterized protein n=1 Tax=Dorea longicatena DSM 13814 RepID=A6BH38_9FIRM Length = 313 Score = 247 bits (630), Expect = 6e-64, Method: Composition-based stats. Identities = 74/306 (24%), Positives = 143/306 (46%), Gaps = 14/306 (4%) Query: 6 NKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRL 65 NK+SVI+P++N + C++S+I QT+ LE+I+++DGSTD S + + +++ Sbjct: 4 NKISVIVPVFNVENYLEKCVKSIIEQTYKNLEVILVDDGSTDKSGFLCDELKKQDYRIKV 63 Query: 66 LHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWC 125 +H+ N G S ARN GI+V+TGKY++F+D+DD + T E +M L ++A CN Sbjct: 64 IHKTNGGLSDARNAGIQVSTGKYLSFIDSDDYLERTALEQMMQAILISHSEIAICNIMRF 123 Query: 126 FRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIA 185 + + PTD + D L V ++ + NI F Sbjct: 124 YDDGCTEEFYNPTDNQEVLEGIRRFDTLNQ--------PSVCNKLFDAKLFA--NISFPY 173 Query: 186 GLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRH-YIKITR 244 G ++D E ++ A + T ++ Y Y S+ + N+ ++ Y++ Sbjct: 174 GKFYEDTYVYHELLYKADKVVLTGKTGYWYLSRKGSILGRSQFSNRYFDFIEAVYMRADF 233 Query: 245 LLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLI 304 L+++ + Y D+ + + + + H + K+ D + +++ YK+ Sbjct: 234 LIKRDIQPYGDEA--FLSLYAALANAEKNIDHKLNKD-DFHKVREYYRQVYNQIDYKKAR 290 Query: 305 TNVRSV 310 N++ + Sbjct: 291 FNIKQI 296 >UniRef50_A2STD8 Glycosyl transferase, family 2 n=1 Tax=Methanocorpusculum labreanum Z RepID=A2STD8_METLZ Length = 346 Score = 247 bits (630), Expect = 7e-64, Method: Composition-based stats. Identities = 79/308 (25%), Positives = 141/308 (45%), Gaps = 12/308 (3%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M + +SVI+P+YN +TC++S++ QT+ LEII++ S DNSV I Y + Sbjct: 1 MNSEVPLISVIVPVYNVEPYLKTCIDSILKQTYRNLEIILVASTSIDNSVVICDSYLQVD 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 +++LH G S ARN G+ +A+G Y+ FVD+DD + ETL ++ + D D+ QC Sbjct: 61 SRLQVLHTEPQGLSAARNIGVSMASGTYLCFVDSDDFISQNFVETLYGLSQKYDADIVQC 120 Query: 121 NADWCFRETG----ETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVV-WMGVYRRDV 175 N E+ SI D L + + D + + W +YR+++ Sbjct: 121 NFLEISENATYSICESCTSIFNDALE-IHIYSNVDMCANLYNDLADSSTASWTKLYRKEL 179 Query: 176 IVKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHR--QGNKNL 233 + N+K+ G H+D + + + A R T LY Y + S++R + +L Sbjct: 180 FM--NVKYPIGKIHEDEGTSYKLFYLAKRVAVTSLKLYYYRSNPLSLTRATYTLKKQDSL 237 Query: 234 NYQRHYIKITRLLEKLNRNYADKIMIYP-EFHQQITYEALRVCHAVRKEPDILTRQRMIA 292 + + R ++ YA + Y + I + + R + ++ + + I Sbjct: 238 QFSEEQMDFFRE-GGYDKLYALALKKYMISLSEHILLTKVYLSKKERNDTELEIKYKAIY 296 Query: 293 EIFTSGMY 300 +I +Y Sbjct: 297 KIAAKNLY 304 >UniRef50_Q93DZ7 Eps1I n=1 Tax=Streptococcus thermophilus RepID=Q93DZ7_STRTR Length = 316 Score = 246 bits (629), Expect = 7e-64, Method: Composition-based stats. Identities = 80/309 (25%), Positives = 145/309 (46%), Gaps = 15/309 (4%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 +S+I+P YN RTC+ES++ QT+ +E+II+NDGSTD S+ + ++ +++ Sbjct: 2 NPLISIIVPTYNVEKYIRTCIESILAQTYRNIEVIIVNDGSTDQSLAVISDLICSHHNIK 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 +++Q N G SVARN GI+ ATGKY+AFVDADD++ P +L +A + D+ + + Sbjct: 62 VINQKNQGLSVARNTGIDAATGKYIAFVDADDKIKPDFVSSLYQIADKTGADIVRGSFRD 121 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGP-DWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 +IP + V T + S + VVW +YR D I N+I+F Sbjct: 122 FNG-------NIPKGWVPDFNVPTNYGTIVLDQFLSSNISFVVWSSIYRLDFINSNHIRF 174 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYL-HNTSV--SRLHRQGNK-NLNYQRHY 239 G+ +D +T A + + Y Y + S+ ++ + K +L+ ++ Sbjct: 175 TPGILFEDADFTIRAYMLAKLVATSPEPNYAYRINRPGSILTTKTTKNAQKMSLSEEKII 234 Query: 240 IKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQ---RMIAEIFT 296 + +L+ + +++ + + + V + + + +I EI Sbjct: 235 SQFISMLKHEKSDVLCSLILKSIYACMRDWTGIIVRNNLSLDRKNSCFDTALTLIKEIIN 294 Query: 297 SGMYKRLIT 305 S K I Sbjct: 295 SRPLKEKIK 303 >UniRef50_A8FHL3 Glycosyltransferase n=2 Tax=Bacillus pumilus RepID=A8FHL3_BACP2 Length = 340 Score = 246 bits (629), Expect = 8e-64, Method: Composition-based stats. Identities = 75/323 (23%), Positives = 132/323 (40%), Gaps = 16/323 (4%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 +S+++ +YN C++S++ Q+++ +E+I++NDGSTD S E + +A R Sbjct: 1 MPAISLLVAVYNTSQYLEQCLQSIVDQSFSDIEVILVNDGSTDRSGEYLEAFAAQDQRFR 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++HQ N G RNRGIE A G Y+AF+D+DD + P E L A D+ Sbjct: 61 VIHQTNQGLGAVRNRGIEEAKGTYIAFIDSDDVLAPHYCEALYEKAKMTGADLVISEYWI 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 F ++ T IPT L L L T W +YRR I +++I+F Sbjct: 121 QFEQSKRT---IPTTLLAERS--AEKTSLIEALLHGDITGFSWNKLYRRAFIEEHDIRFP 175 Query: 185 ---AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIK 241 + +D T + + + LY Y +H +S+ + +++ Y H + Sbjct: 176 LRGELENIEDQYVTLRCFSLSRGIAFVHEPLYYYRVHLSSIVQRYQK-----QYFHHGLT 230 Query: 242 ITRLLEKLNRNYADKIMIYPEFHQQITYEALRVC---HAVRKEPDILTRQRMIAEIFTSG 298 Y D + I L + + + + E+ T Sbjct: 231 FYHAQRLFLTEYDDISLYEKALDFFIVNHTLHCMLNEWKSQNALSLREKLDHMREMVTHE 290 Query: 299 MYKRLITNVRSVKVGYQALLWSF 321 ++ + V K+ + + F Sbjct: 291 AFQDAVKQVEPSKLTSRKRMILF 313 >UniRef50_C0B702 Putative uncharacterized protein n=1 Tax=Coprococcus comes ATCC 27758 RepID=C0B702_9FIRM Length = 283 Score = 246 bits (629), Expect = 8e-64, Method: Composition-based stats. Identities = 68/261 (26%), Positives = 127/261 (48%), Gaps = 21/261 (8%) Query: 1 MMNSTN---KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYA 57 MM T K SV++P+YN C++S++ QT+ LEII+++DGS DN ++ + Sbjct: 1 MMKQTEEKIKFSVVLPIYNVEKYLNRCLDSVMNQTYKKLEIILVDDGSPDNCPQMCDNWE 60 Query: 58 ENYPHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDV 117 + ++++H+ NAG ARN G++VATG Y+AF D+DD + ++E L T+ + D+ D+ Sbjct: 61 KVDDRIKVVHKKNAGLGEARNSGLDVATGDYIAFFDSDDYIDTRLFEELYTVIISDNPDL 120 Query: 118 AQCNADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGL-----------SSRRWTHVV 166 + R+ T IP L G + L L ++ Sbjct: 121 IEFGHHDVDRQGNITKTFIPKTPLEK---YEGEEVLSKFLPELICTDPKTGTASDLLMSA 177 Query: 167 WMGVYRRDVIVKNNIKFIAGLHH--QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSR 224 W +YRR ++ + N F++ + +D+ + M N ++ Y Y ++ S++ Sbjct: 178 WSCLYRRQLLAECNFHFVSERKYISEDVYSLMKLMPNVHSVNVVHKAYYYYCENDQSLTH 237 Query: 225 LHRQG--NKNLNYQRHYIKIT 243 +++ K + +Q H ++ Sbjct: 238 VYKPDRFEKLVAFQLHLEELC 258 >UniRef50_A8SG33 Putative uncharacterized protein n=1 Tax=Faecalibacterium prausnitzii M21/2 RepID=A8SG33_9FIRM Length = 328 Score = 246 bits (629), Expect = 8e-64, Method: Composition-based stats. Identities = 62/275 (22%), Positives = 127/275 (46%), Gaps = 18/275 (6%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M + +S+I+P+YN + C+ SL QT+ +E++++NDGSTD S +I + Y+ Sbjct: 1 MSEQYDLISIIVPVYNVKNYLEQCLNSLANQTYKNIEVLLVNDGSTDGSDKICEKYSLKD 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 + + N+G + ARN GI+ + GKY+AFVD+DD V L T+ + +++ Sbjct: 61 SRFKKFDKPNSGPADARNLGIKNSHGKYIAFVDSDDYVDHGFILELYTILQKYKANISIA 120 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 + E + +GV++ L L+ + V ++ +++ N Sbjct: 121 GYTVFY----ENTNFRSSFSYDRSGVISAEAVLSSILTDETIANFVCNKLFEKELF--NR 174 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGN---------- 230 +KF +G ++D+ + + + + T + LY Y S++ + Sbjct: 175 VKFPSGKAYEDVAVFYKLVAASKKLCITSKCLYYYRKRFGSITTSVNPKSTLDGLEAATA 234 Query: 231 KNLNYQRHYIKITRLLEKLNRNYADKIMIYPEFHQ 265 +NL H+ + + E +N +++ + Y + + Sbjct: 235 RNLYVSSHFPTLKK--ECINSDFSMLVYSYNQLAK 267 >UniRef50_B5G7W4 Glycosyltransferase n=3 Tax=Streptomyces RepID=B5G7W4_9ACTO Length = 1271 Score = 246 bits (629), Expect = 8e-64, Method: Composition-based stats. Identities = 64/252 (25%), Positives = 110/252 (43%), Gaps = 11/252 (4%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M +LSVI+P+Y+ C++SL QT +EI++++DGS D S IA+ YA Sbjct: 1 MSAVPPRLSVIVPVYDVETYLPACLDSLAAQTLREIEIVVVDDGSPDGSALIAEEYARRD 60 Query: 61 PHVRLLHQANAGASVARNRGIEVAT--GKYVAFVDADDEVYPTMYETLMTMALEDDLDVA 118 P RL+ Q N G ARN G++ A +++AFVD+DD V Y ++ D A Sbjct: 61 PRFRLVRQENQGLGAARNTGVDAAHPAAEFLAFVDSDDMVPADAYRLMLASLDASGSDFA 120 Query: 119 QCNADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVK 178 N ++ WQS L + + L + R V+RRD + Sbjct: 121 TGNVRHI--DSRRVWQSPMHRFLGAGAERRTHVTRKRKLLTDRI---ACNKVFRRDFWTR 175 Query: 179 NNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNT----SVSRLHRQGNKNLN 234 + ++F G+ ++D F A + +Y + + S+++ + + Sbjct: 176 HGLRFPVGVLYEDTPVIVPAHFLAEAVDIVSEPVYYWRMREGEAGPSITQRRTEPKAVRD 235 Query: 235 YQRHYIKITRLL 246 +++R L Sbjct: 236 RAAAVTEVSRFL 247 >UniRef50_B2UQ55 Glycosyl transferase family 2 n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UQ55_AKKM8 Length = 341 Score = 246 bits (628), Expect = 1e-63, Method: Composition-based stats. Identities = 67/324 (20%), Positives = 133/324 (41%), Gaps = 20/324 (6%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M + +SV++P+YN C+ES+ +QT LEI+ ++D STD S+ I + +AE P Sbjct: 1 MTAPVNISVLVPVYNVEPYLAQCLESICSQTLRELEIVCVDDASTDGSLSILREFAERDP 60 Query: 62 HVRLLH-QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 V+++ N G S +RN + A G+Y+ VD+DD + + E + A D D C Sbjct: 61 RVKVVQAPENGGLSRSRNLAMSHAVGEYLFLVDSDDWLETDLLEEMYRRAKALDADRLAC 120 Query: 121 NADWCFRETGETWQS-IPTDRLRSTG--VLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIV 177 + + + +P D + P+ + + H + RR ++ Sbjct: 121 GFRYYYESAPDREDRFLPEDMAPPEKGWLPCTPETI------GKIHHGAGGMMIRRSIVE 174 Query: 178 KNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQR 237 K+ I+F G+ +D+ + R ++ Y Y S++ G+ + Sbjct: 175 KHGIRFPEGVACEDLYFHYAVFPWCRRVCVVSRAAYVYRKRAGSITSGFASGSSLQSL-- 232 Query: 238 HYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKE-PDILTRQ---RMIAE 293 Y+ + L+ K + I E+ + +RK P + ++ ++ Sbjct: 233 DYLTVAELVLKEWKE----AGILEEYRTAFLKMLVMGVRNIRKYAPHAVQKEVTRKVTDM 288 Query: 294 IFTSGMYKRLITNVRSVKVGYQAL 317 + +Y+ + + + L Sbjct: 289 LRQENLYRPAEDDACLSRREGKLL 312 >UniRef50_C3PZ19 Glycosyl transferase n=1 Tax=Bacteroides sp. 9_1_42FAA RepID=C3PZ19_9BACE Length = 326 Score = 246 bits (628), Expect = 1e-63, Method: Composition-based stats. Identities = 64/254 (25%), Positives = 118/254 (46%), Gaps = 4/254 (1%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+IIP+YN C++S+I Q++T E I+++DGSTD S EI +AE + ++H Sbjct: 2 ISIIIPVYNVKLYLDNCIQSVIQQSYTDFECILVDDGSTDGSSEICDQWAEKDNRIIIVH 61 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 Q N G S ARN+G+E A G+Y+ F+D+DD V ++ E + D+ C F+ Sbjct: 62 QPNGGVSSARNKGLEQAKGEYICFIDSDDWVDVDYLSAMINNLKEKETDLIICGLTQEFK 121 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 + G IP L ++ M + +Y+ ++I + I F + Sbjct: 122 D-GNCIDYIPQQSLNFELGCNHTNYFVMLNEQNLLYGPIV-KLYKNNIIQHHQILFDPQV 179 Query: 188 HH-QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRLL 246 + +D+++ +++ A +S Y Y + S + + K + KI + Sbjct: 180 SYGEDLLFNYQYLEYAHTIACINESHYHYRI-LGSGTLSSKFRPKQFDTDYQQWKILQSF 238 Query: 247 EKLNRNYADKIMIY 260 + + ++ Y Sbjct: 239 YQRHHLWSSTAQTY 252 >UniRef50_Q8A2D4 Glycoside transferase family 2 n=2 Tax=Bacteroides RepID=Q8A2D4_BACTN Length = 329 Score = 245 bits (626), Expect = 2e-63, Method: Composition-based stats. Identities = 74/343 (21%), Positives = 149/343 (43%), Gaps = 20/343 (5%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M + +S+I+P++N D C+ESL +QT +EII++++ S D S EI YA+ Sbjct: 1 MENKYLVSIIVPVHNTADYLHKCIESLRSQTLQQIEIILVDNLSVDGSSEICDEYAKIDK 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 +++LH + AG S+ARN G+ +A+ Y+ FVD+DD V P M+E ++ ++ D ++A CN Sbjct: 61 RIKVLHLSIAGLSIARNAGMRIASAPYIGFVDSDDYVEPAMFEKMLDAMVQSDAEIAYCN 120 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 + E + R V+ P + + + + +Y+ + + Sbjct: 121 FLLEY----EFKPNESPYRNSGDIVIRDPKNVLQDMMMEKVSCSACTKLYKSGFLTS--L 174 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIK 241 +F G +++D + E++ + + + Y Y TS+ N + Y + Sbjct: 175 QFPEGKNYEDRLVMYEWVALCKKVVWVDSPFYHYVERQTSICHTMSPMNLYHYFLAEYAR 234 Query: 242 ITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYK 301 + R +EK + + + +I + C + KE + R E + Sbjct: 235 M-RFMEKYSLFEEEDLF-------KIRTRLIGTCLTIFKEILLKVNLREFKE--PVKDMR 284 Query: 302 RLITNVRSVKVGYQALLWSFRL----WQWRDKTRSHHRITRSA 340 + + + ++ L + R+ + W +H + Sbjct: 285 QKMKEIAALPEDTYELKYRRRVRKIAYHWHLYYLTHFLFKKGG 327 >UniRef50_B7BDW0 Putative uncharacterized protein n=1 Tax=Parabacteroides johnsonii DSM 18315 RepID=B7BDW0_9PORP Length = 318 Score = 245 bits (626), Expect = 2e-63, Method: Composition-based stats. Identities = 62/255 (24%), Positives = 125/255 (49%), Gaps = 13/255 (5%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 N ++S+I+P Y TC+ES++ Q++ E+I+++DGS D S +I YA+ Sbjct: 3 KNLLPRVSIIVPCYKVEQYLPTCIESVLHQSYDNWELILVDDGSPDKSGKICDEYAKEDN 62 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 ++++H+ N G + ARN I++A G+Y++F+D DD ++ + L+++AL+ + QCN Sbjct: 63 RIKVIHKTNGGVAAARNVAIDLAEGEYISFLDGDDFLHVDYVQDLISLALKHQAGIVQCN 122 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 + ++ ++ + D ++ +VW +Y+ +++ N Sbjct: 123 YVRGNDRIFPNVTKDLSVKVYNSHSIFTSDAAKI---------IVWGKLYKTNIVK--NF 171 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHY-I 240 K G +D T + ++A + T + Y Y + S H++ NL++ Y Sbjct: 172 KIPEGRFFEDDWITWRWYYSAKKIVVTSRPYYYYAYNEMSTMTQHKKRP-NLSFIDAYNE 230 Query: 241 KITRLLEKLNRNYAD 255 +IT + R+ D Sbjct: 231 RITFFKQTAERDLED 245 >UniRef50_C4ZHK7 Glycosyltransferase Family 2 modular protein n=1 Tax=Eubacterium rectale ATCC 33656 RepID=C4ZHK7_EUBR3 Length = 881 Score = 245 bits (626), Expect = 2e-63, Method: Composition-based stats. Identities = 72/306 (23%), Positives = 141/306 (46%), Gaps = 11/306 (3%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M K+S+I+P+YNA + C+ S+ +QT +EII ++DGSTD+S EI +AE Sbjct: 9 MSTQNVKVSIIVPIYNAEKYLKQCLGSIQSQTLKDIEIICVDDGSTDSSPEIMDSFAEKD 68 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 +++H+ N G + N G+ VATG+Y+ V+ADD + M+E L + + +D+ + Sbjct: 69 SRFKVVHKPNGGNGHSMNAGLSVATGEYIGCVEADDYIEKNMFEKLYLYSNDGSIDIVKS 128 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 N C+ + + + + L P ++ +W +Y+R +I +N+ Sbjct: 129 NFWNCYEDESGKITKVVNEERNNMPELALPFTIKEYPQILWGHPSIWSAIYKRSLIEENH 188 Query: 181 IKFIA--GLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRH 238 IKF G D + E M A +T++ Y Y + + +K ++ Sbjct: 189 IKFKEAKGGGWVDNPFFFETMCCAKSIYWTKRPFYNYR------TEVEGSSSKGYDFNIP 242 Query: 239 YIKITRLLEKLNR-NYADKIMIYPEFHQQITYEALRVCHAVRKEPD--ILTRQRMIAEIF 295 + ++ L+ L R N D+ + + + + Y + + + + ++ R + E Sbjct: 243 FDRMQDNLDVLKRYNCQDEETLKLAYARALMYLCGAINESDYEGKEQLFYSKAREMLEKI 302 Query: 296 TSGMYK 301 + + Sbjct: 303 NPMVIR 308 >UniRef50_A6LCC5 Glycosyltransferase family 2 n=2 Tax=Bacteroidales RepID=A6LCC5_PARD8 Length = 312 Score = 245 bits (625), Expect = 2e-63, Method: Composition-based stats. Identities = 69/306 (22%), Positives = 131/306 (42%), Gaps = 17/306 (5%) Query: 6 NKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRL 65 +S+I+ +Y A R C++SL+ QT+ EI++++DGS D S EI YA VR+ Sbjct: 3 PVISIIVAVYKAESCLRRCVDSLLAQTFQDYEILLVDDGSPDRSGEICDEYARKDNRVRV 62 Query: 66 LHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWC 125 H+ N G + AR G+ A G+YV D DD V P M E L A E++ D+ C+ Sbjct: 63 FHKENGGVASARQCGMNNARGEYVIHADPDDWVEPNMLEELYGKAKEENADMIICD---V 119 Query: 126 FRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIA 185 +R+T + + + D ++ + +R K ++ F Sbjct: 120 WRDTSKKSTYVKQRLFILEHCVMLKDVF------QQLHGSCCNKLVKRACYSKYDVNFPE 173 Query: 186 GLHH-QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 GL + +D + + + ++ Y ++ Y Y + S R + K+ Sbjct: 174 GLSYGEDTYVSVCLLAHPIKMAYVPKAFYHYVQNVNYSSLTRRPVKILV---EQCEKLCD 230 Query: 245 LLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLI 304 LL + + IYP + +A + + + P I+T ++ + + +++ Sbjct: 231 LL-RCKLSENQFKEIYPALRYR---QACVILTSAGENPYIVTFKKDFEGVRKAILHQNFS 286 Query: 305 TNVRSV 310 +++ Sbjct: 287 FKRKTL 292 >UniRef50_C3QCN3 Glycoside transferase family 2 n=6 Tax=Bacteroides RepID=C3QCN3_9BACE Length = 322 Score = 245 bits (625), Expect = 3e-63, Method: Composition-based stats. Identities = 74/303 (24%), Positives = 136/303 (44%), Gaps = 9/303 (2%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M + K+SV+IP+YN R +ES++ Q+ LEII+++DGSTD S+ I + + Sbjct: 1 MKYNEIKVSVVIPVYNTEKYVRQAVESVMYQSLKELEIIVVDDGSTDKSLSIVEKLGDTD 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 +++ QAN G S+ARNRGI A G+Y+ F+D+DD + E E+ LD Sbjct: 61 KRIQIYTQANQGQSIARNRGISHAHGEYIYFMDSDDLLEEDALELCYHKCKEEKLDFVFF 120 Query: 121 NADWCFRETGETWQSIPTDRLRS--TGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVK 178 +A F E ++ + TG + L + L ++ +T V + R+VI K Sbjct: 121 DALVFFENNVENAPTLNYKHTEKLEDKIYTGQEALEIQLQNKEYTPSVCLHFIHRNVIEK 180 Query: 179 NNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRH 238 +N+ F + H+D ++TT + +A +++ + + S Y Sbjct: 181 HNLSFYPEIVHEDQLFTTLLYLQSTKAACIKRTFFHRRMRKDSTMTSKFAMRNIKGYLVV 240 Query: 239 YIKITRL-LEKLNRNYADKIMIYP-EFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFT 296 +I + + I +Y + ++A + K P+ + R+ + + Sbjct: 241 TEEILSFRRQTTEDKNKEIIDLYLSQMLDAAMWQAHSL-----KLPERIKLARLCLQKYK 295 Query: 297 SGM 299 + Sbjct: 296 KYV 298 >UniRef50_A6P2R1 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6P2R1_9BACE Length = 294 Score = 244 bits (624), Expect = 3e-63, Method: Composition-based stats. Identities = 66/261 (25%), Positives = 118/261 (45%), Gaps = 9/261 (3%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 + +S+I+P+YN C+ S+ QT E+++++DGSTD S +I + Y Sbjct: 2 ESPAVSIIVPVYNVKPYLEKCLSSIRFQTCRDFEVLLVDDGSTDGSADICRSYVALDSRF 61 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 RLL QANAGAS ARN G+ A G+Y+ FVD DD + E L+ A D+ Sbjct: 62 RLLTQANAGASAARNLGMTQAAGRYLQFVDGDDWLPTDSVELLLHTAESTGADLTV---A 118 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLS--SRRWTHVVWMGVYRRDVIVKNNI 181 +R GE + +R V+T ++ + + + V+W +YRR ++ N++ Sbjct: 119 HFYRVAGERCAA--RGHIREERVMTRTEYADYMVKAPANFYYGVMWNKLYRRAIVEANHL 176 Query: 182 KFIAGLHH-QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI 240 + A + +D ++ +++ + +Y Y S+ K + +R Sbjct: 177 ECPADVAWCEDFLFNLDYIACCRLIAAVPRPVYYYRKREGSLVSTQATLRKTIQTKRLTF 236 Query: 241 KITRLLEKLNRNYA-DKIMIY 260 + + Y K+ +Y Sbjct: 237 NEYKEFYQQLDLYERRKVRVY 257 >UniRef50_Q02W60 Glycosyltransferase n=1 Tax=Lactococcus lactis subsp. cremoris SK11 RepID=Q02W60_LACLS Length = 301 Score = 244 bits (624), Expect = 3e-63, Method: Composition-based stats. Identities = 76/313 (24%), Positives = 131/313 (41%), Gaps = 20/313 (6%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 K+SVI+P+YN C+ S++ Q++ E+++I+DGS D+S +I YA+ VR Sbjct: 1 MPKVSVIVPVYNVEKLLERCLNSVLNQSFQDFELLLIDDGSKDSSGQICDNYAKKDQRVR 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 + H N G S ARN GI+ G Y+ F+D+DD V E L +E + DV C Sbjct: 61 VWHIPNGGQSAARNLGIDNVYGTYIVFIDSDDFVELDYLEQLYQPMVEYEADVVSCRYIA 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRW---THVVW---MGVYRRDVIVK 178 E E + I + + + + + W +Y+ +++ + Sbjct: 121 YSEERDEEVKKIIQENQAKVYHFRNNQDVMQDFLEKEFKDDRNFPWESYANLYKTELLGE 180 Query: 179 NNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRH 238 I+F G +D + ++ +A T Y YY + TS +R+ Sbjct: 181 --IRFEVGREFEDNFFNYQYFKKVQKAIATSYIGYYYYANPTSKTRIT-----------F 227 Query: 239 YIKITRLLEKLNRNYADKIMIYPEFHQ-QITYEALRVCHAVRKEPDILTRQRMIAEIFTS 297 Y K L+++ D + YPEF + AL+ C K + Sbjct: 228 YEKQFDLIQQERLIIEDVKIKYPEFIDLEANNLALKYCWLYGKIFGDIRFSNFSVSKKIF 287 Query: 298 GMYKRLITNVRSV 310 G +K I ++++ Sbjct: 288 GSFKVFIKRIKNI 300 >UniRef50_A6LG23 Glycosyltransferase family 2 n=6 Tax=Bacteroidales RepID=A6LG23_PARD8 Length = 459 Score = 244 bits (623), Expect = 4e-63, Method: Composition-based stats. Identities = 62/235 (26%), Positives = 107/235 (45%) Query: 15 YNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLHQANAGAS 74 YN C++SL Q +EII++NDGSTD S EIA YA+ ++++HQ N GAS Sbjct: 16 YNVEKYLTVCIDSLKNQGGLRMEIILVNDGSTDLSGEIADEYAKKEERIKVIHQENGGAS 75 Query: 75 VARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFRETGETWQ 134 ARN G+++AT +Y+AF+D+DD + L A++ DV N C ++ Sbjct: 76 AARNVGLDIATEEYIAFLDSDDWIKDESLSLLYDEAVKHHADVVMGNMWLCHQDGSMDKP 135 Query: 135 SIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGLHHQDIVW 194 +L+G + + + + + + +Y R + K +F G+ H+D +W Sbjct: 136 FKCISNGSIKELLSGKESFIELVKTCFYLPMPFKYIYNRKYLHKIQARFEEGIMHEDELW 195 Query: 195 TTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRLLEKL 249 + A R T Y Y + SV + + + + + ++ Sbjct: 196 CPIVLCQAERMVITNIEFYYYRQNEESVMHTTSLARRLKSLFQVTEGLIKFSDQF 250 >UniRef50_B2DG34 Eps5I n=3 Tax=Streptococcus pneumoniae RepID=B2DG34_STRPN Length = 333 Score = 244 bits (623), Expect = 4e-63, Method: Composition-based stats. Identities = 75/288 (26%), Positives = 131/288 (45%), Gaps = 8/288 (2%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M+ L+++IP+YN + C++S+I+Q W E+I++NDGSTD S I + YA+ Y Sbjct: 1 MIIQNILLTIVIPVYNVEKYLKRCIDSVISQEWDKYEVILVNDGSTDASPNICEEYAQKY 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 + ++H+ N G S RN G+ A GKYV F+D+DD + M+ L M +E + D+ Q Sbjct: 61 HFISVIHKENGGLSETRNTGLSHANGKYVFFLDSDDWITKDMFRNLSKMIMEQNYDILQF 120 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 E E ++ D + LS T +Y+R++ KN Sbjct: 121 GMQMFHSEREELKNVQCKEKN-----FNSSDAFKNMLSVEGITSFATDKIYKRELFEKNG 175 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI 240 I+F G ++D+ + + +A + T Q Y Y++ N + K + + + Sbjct: 176 IEFPIGYFYEDLGTVYKLILSAKKIYLTTQVYYCYFIGNDAAITKQWSEKKISDVYKFHK 235 Query: 241 KITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQ 288 KI + + +D I + ++ L + +E L R Sbjct: 236 KIFNVSSLM---VSDDIFLSKSYYNNGLVYLLMKLYEENQEDTQLFRL 280 >UniRef50_C6JQJ1 Glycosyl transferase n=1 Tax=Fusobacterium varium ATCC 27725 RepID=C6JQJ1_FUSVA Length = 337 Score = 244 bits (623), Expect = 4e-63, Method: Composition-based stats. Identities = 74/269 (27%), Positives = 131/269 (48%), Gaps = 12/269 (4%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 MM K+S+IIP+YN + C++S+I Q+ +EII INDGSTD+S+EI + Y + Sbjct: 1 MMGK--KISIIIPIYNVEKYLKQCLDSIINQSLKEIEIICINDGSTDSSLEIIQEYMKKD 58 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 + L+++ N G S ARN GI+ +TGKY+ VD+DD V L A ++ LD+ Sbjct: 59 SRIILINKKNEGPSAARNDGIKKSTGKYILQVDSDDWVEEESLSKLYGYAEKNSLDIVSF 118 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 N +RE + I + T ++ + L+ R V + + + N+ Sbjct: 119 NY---YREFFKESIKIQVLFEINKNFFTSIEYFNLFLTEREIP-SVCNKLIKSTIYKNNH 174 Query: 181 IKFIAGL-HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHY 239 I + G+ +D+ T+ ++ + + Y ++ LY Y ++ S++ G+K + Y Sbjct: 175 ILYPVGINLGEDLATITKLIYFSQKIGYLDEYLYHYRYNDKSITSTLT-GSKRAELFKIY 233 Query: 240 IKITRLLEK----LNRNYADKIMIYPEFH 264 + ++ LN Y + F+ Sbjct: 234 ENLKEFFQEKKLDLNLLYFMFLKKVSNFY 262 >UniRef50_C8WIW0 Glycosyl transferase family 2 n=1 Tax=Eggerthella lenta DSM 2243 RepID=C8WIW0_EGGLE Length = 340 Score = 244 bits (623), Expect = 4e-63, Method: Composition-based stats. Identities = 72/223 (32%), Positives = 116/223 (52%), Gaps = 2/223 (0%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+I+P+YN C+ SL QT++ E I I+DGSTD ++E A+ A + + Sbjct: 4 VSIIVPMYNVRPYIGECLASLKRQTFSDFEAICIDDGSTDGTLEAARNAAGDDERFVFVG 63 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 Q NAG SVARN GI +ATG+ V F+D+DD +TL A D+LDV +A + Sbjct: 64 QVNAGQSVARNVGIGLATGRSVLFLDSDDYYEDRALQTLFDHAERDELDVLFFSARTVYE 123 Query: 128 --ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIA 185 E+ ++ DR G++TG + L +R ++ + + RRD + ++ I F Sbjct: 124 DAESRARYRDDYEDRAAIEGIMTGQELLERFAENRSFSVSPALQLIRRDFLTESGISFFE 183 Query: 186 GLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQ 228 G+ H+D ++T + A RA + + LY + S R+ Sbjct: 184 GIVHEDNLFTGLILAQASRAAFLNEQLYVRRVRAGSTMTSQRE 226 >UniRef50_B7HXA1 N-acetylglucosaminyltransferase n=2 Tax=Bacillus cereus RepID=B7HXA1_BACC7 Length = 353 Score = 244 bits (623), Expect = 4e-63, Method: Composition-based stats. Identities = 77/324 (23%), Positives = 141/324 (43%), Gaps = 9/324 (2%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+I+P+YNA C++SL+ QT+ +EII+INDGSTDNS +I YA +R++H Sbjct: 6 VSIIVPVYNAEKYVHKCIDSLLQQTYKNIEIILINDGSTDNSGKICDEYANESSKIRVIH 65 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 Q N+G S RN GI+VA G Y+ FVD+DD V P M E ++ A+ + + C Sbjct: 66 QKNSGPSATRNVGIDVANGAYIQFVDSDDYVEPNMTEEMLK-AMNNKFQLVICGYKSLSL 124 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 + + + L + + +W +Y +I +N+++F + Sbjct: 125 FNKDINIKEHKCHISGGYDYSSFMMEFGYLFRDVYINPLWNKLYNTALIKENHLRFKDNV 184 Query: 188 H-HQDIVWTTEFMFNALRARYTEQSLYKY-YLHNTSVSRLHRQGNKNLNYQRHYIKITRL 245 + +D+++ E++ + LY Y +++ S++ ++ Q+ R Sbjct: 185 YMGEDLLFNLEYIKKCNYINVINKYLYNYAVINSESLTISFKKD--FFKTQQMLFTAIRE 242 Query: 246 LEKLNRNYADKIMIYPEF-HQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLI 304 K++ Y D E + L + R+ I+ I + + Sbjct: 243 FLKVDYFYNDSNKKMIEIAYTDTVVSCLEHLFHKDSNLTVNLRKENISSIVNNNCTRE-- 300 Query: 305 TNVRSVKVGYQALLWSFRLWQWRD 328 N+ K G + +L + Sbjct: 301 -NIGYFKEGNIQKRFIGQLISLKS 323 >UniRef50_C5NWI7 Glycosyl transferase, group 2 family n=1 Tax=Gemella haemolysans ATCC 10379 RepID=C5NWI7_9BACL Length = 328 Score = 243 bits (622), Expect = 5e-63, Method: Composition-based stats. Identities = 65/245 (26%), Positives = 113/245 (46%), Gaps = 4/245 (1%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +SVI+P+YN D M+SL QT+ EII++NDGSTDNS ++ Y+E Y +VR H Sbjct: 2 ISVILPVYNVEDYLHYAMDSLEKQTYKNFEIILVNDGSTDNSGKLCDEYSEKYSNVRAFH 61 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + N G S ARN GI+ A G+++ F+D DD + E L+ + + D D+ Sbjct: 62 KENGGLSDARNFGIQKAKGEFITFLDPDDYLEAYSLELLVGIQEKHDCDIVSTRVKATES 121 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 + S+ + + +L + T +YR+ ++ I F G Sbjct: 122 YNVYSDYSLVEKDFENVTHMDNDVFLEEAFYDKVVTVSACGKLYRKSILE---IPFPKGK 178 Query: 188 HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRLLE 247 ++D+ ++ + A + +T +Y YY S+ + K ++ K + Sbjct: 179 IYEDLFVISDHVTRAQKIVHTSLQIYNYYRRPGSIVN-SKFSKKQYDFFEAIDKNREFAK 237 Query: 248 KLNRN 252 K ++ Sbjct: 238 KFSKE 242 >UniRef50_C6LFJ2 Glycosyl transferase, family 2/glycosyl transferase family 8 n=1 Tax=Bryantella formatexigens DSM 14469 RepID=C6LFJ2_9FIRM Length = 402 Score = 243 bits (622), Expect = 5e-63, Method: Composition-based stats. Identities = 85/357 (23%), Positives = 154/357 (43%), Gaps = 19/357 (5%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M+ K+SVIIP+YNA R ++ QT +EII ++DGSTD+S EI + YA Sbjct: 1 MSYDRKVSVIIPVYNAEKYLRDTLDDATGQTLREIEIICVDDGSTDSSAEIVEEYARRDA 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 VRL+ Q N A VARN G+ ATG+Y+ F DADD P M E + + D+ CN Sbjct: 61 RVRLIRQKNQYAGVARNHGMSHATGEYLVFWDADDCFEPAMLEKMYARITQAQADICVCN 120 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRM---GLSSRRWTHVVWMGVYRRDVIVK 178 + T + +S L + ++ + T+V W ++R + + + Sbjct: 121 VNVLDETTKDLIRS--ESYLVPAYLQGAQEYAKTTHPEYLFNIATNVPWNKMFRAEFVRE 178 Query: 179 NNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTS-----VSRLHRQGNKNL 233 + ++F D+ + + A + ++ L Y +N S +S+ K Sbjct: 179 HGLQFENRSRANDVYFVMMAFYLAEKIAVLDERLVTYRANNESSLTGTLSQTPLCAIKAF 238 Query: 234 NYQRHYIKITRLLE--KLNRNYADKIMIYPEF--HQQITYEALRVCHAVRKEPDI--LTR 287 + R + +E ++ +++ ++++ F H+ + EA + K+ L Sbjct: 239 DAVRKELLRLGAMENPQIRQSFDNRVLQSLLFGLHKCVQGEAFAQMYQYLKKEGFARLGI 298 Query: 288 QRMIAEIFTSGMYKRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITRSAFNLR 344 ++ Y+R + + + Y + + L + K + H + NLR Sbjct: 299 CPQEQYYYSEEAYRRFLRLLDEEPLDY---VLQYALEEKTKKQKRIHELKEKVQNLR 352 >UniRef50_B2IQI2 Glycosyl transferase, family 2 n=31 Tax=Firmicutes RepID=B2IQI2_STRPS Length = 328 Score = 243 bits (622), Expect = 5e-63, Method: Composition-based stats. Identities = 67/267 (25%), Positives = 119/267 (44%), Gaps = 17/267 (6%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 T +SVI+P+YN + S+ QT+ LEII+++DG+TD S + AE V Sbjct: 2 ETALISVIVPVYNVAQYLEKSIASIQKQTYQNLEIILVDDGATDESGRLCDSIAEQDDRV 61 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 +LH+ N G S ARN G++ A G Y+ F+D+DD ++P M ++L +++D DV+ C Sbjct: 62 SVLHKKNEGLSQARNDGMKQAHGDYLIFIDSDDYIHPEMIQSLYEQIMQEDADVSSCGVM 121 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 + + D V +LR L + + + +R++ + F Sbjct: 122 NVYANDESPQSANQDDYF----VCDSQRFLREYLIGEKIPGTICNKLIKREIATA--LSF 175 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 GL ++D + + + A + + Y Y+ S++ K+L Y Y K Sbjct: 176 PKGLIYEDAYYHFDLIKLAKKYVVNTKPYYYYFHRGDSIT-TKPYAEKDLAYIDIYQKF- 233 Query: 244 RLLEKLNRNYADKIMIYPEFHQQITYE 270 Y + + YP+ + + Sbjct: 234 ---------YNEVVKNYPDLKEVAFFR 251 >UniRef50_B2UQN8 Glycosyl transferase family 2 n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UQN8_AKKM8 Length = 376 Score = 243 bits (622), Expect = 5e-63, Method: Composition-based stats. Identities = 89/355 (25%), Positives = 144/355 (40%), Gaps = 20/355 (5%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M + +SVI+P+YN C++SL QT LEII ++DGSTD S I + YA Sbjct: 1 MPLPSPLISVIVPIYNMEPLLPRCLDSLAAQTLRDLEIICVDDGSTDGSGGIVRKYASGD 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 RL+ Q N+G + ARN GI A Y+ F D DD V P MYE L +A E D+ QC Sbjct: 61 SRFRLITQENSGRAEARNAGIRAAAAPYLGFADPDDYVEPDMYERLYRLAEESGADMVQC 120 Query: 121 NADWCFR-ETGETW-----QSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRD 174 + E+GE+ + + + GV T + R T VVW ++RR Sbjct: 121 SYSPFLPAESGESRGMAEEKLLHIENTACDGVFTEKGEIFRLFLEDRITGVVWSKLFRR- 179 Query: 175 VI------VKNNI--KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYL-HNTSVSRL 225 ++ ++ + F +G +D ++ + + T + LY Y L SVS Sbjct: 180 ILPGCSAPLEVRLPSSFTSG---EDTLYVSRAIARCRSVALTSEKLYHYGLGGPQSVSSR 236 Query: 226 HRQGNKNLNYQRHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDIL 285 +R+ ++ + Y + + + + + L Sbjct: 237 NRKAETRPASYYAVFEMLTREKLREGVLGSNRTAYMNYIVPLLFPDNEMPAGRLRHWAEL 296 Query: 286 TRQRMIAEIFTSGMYKRLITNVRSVKVG-YQALLWSFRLWQWRDKTRSHHRITRS 339 R+ I GM + + + G + L W+++ R R+ S Sbjct: 297 WREADITSEHVCGMPREQRAYLEAALAGRWIYLRLLQCFWKFKTARRKMFRLRFS 351 >UniRef50_Q74L33 Putative uncharacterized protein n=1 Tax=Lactobacillus johnsonii RepID=Q74L33_LACJO Length = 292 Score = 243 bits (622), Expect = 6e-63, Method: Composition-based stats. Identities = 81/305 (26%), Positives = 141/305 (46%), Gaps = 19/305 (6%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 K+SVIIP+YN C++S++ QT++ LEII+++DGSTD++ E+ + Y E Y ++R Sbjct: 1 MEKISVIIPVYNDEKYLAQCLDSVLRQTYSNLEIILVDDGSTDSTPELCEKYREKYANIR 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 +LH+ N G +RN G+E+ATG+Y+ FVD DD + T E L + ++D D+A N + Sbjct: 61 ILHKKNGGVGSSRNAGLEMATGEYILFVDHDDLLGETHIEELYKLLKKNDADIAVGNFNH 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTH-----VVWMGVYRRDVIVKN 179 E + D T +W + + + V W +Y++ + Sbjct: 121 FIEEKRAYGIWLKEDDYF-EKTYTPEEWFTVEYETVPYNMSIIFAVPWAKLYKKSLFE-- 177 Query: 180 NIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNY-QRH 238 NI + +D + T + A + Y +++Y + + SVS Q NK + Sbjct: 178 NIVYPINARVEDDLTTWKIYLLADKIAYMNKAIYTHRIFENSVS---AQANKTAVFPLEA 234 Query: 239 YIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSG 298 + LL +L + +I Y Y K+ D L + ++ Sbjct: 235 VEERISLLSQLGFDITTEIEAY-------HYRLNICIDTALKDGDYLKYRNAKQKLAILK 287 Query: 299 MYKRL 303 YK++ Sbjct: 288 KYKKI 292 >UniRef50_C1CFZ0 Glycosyl transferase, family 2/glycosyl transferase family 8 n=4 Tax=Streptococcus pneumoniae RepID=C1CFZ0_STRZJ Length = 291 Score = 243 bits (622), Expect = 6e-63, Method: Composition-based stats. Identities = 63/262 (24%), Positives = 123/262 (46%), Gaps = 8/262 (3%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 +S+++P+YN + R C++S+ QT+ E ++INDGS D+S +I + + E + Sbjct: 3 DELISIVVPIYNVENYLRMCLDSIQNQTYQNFECLLINDGSPDHSSKICEEFVEKDSRFK 62 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 +AN G S ARN GIE + G Y+ FVD+DD + + L +++ D++ + Sbjct: 63 YFEKANGGLSSARNLGIECSGGAYITFVDSDDWLEHDALDRLYGALKKENADISIGRYN- 121 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRM----GLSSRRWTHVVWMGVYRRDVIVKNN 180 + ET + + TD S V+ G + + + WT V + +++R+++ + Sbjct: 122 SYDETRYVYMTYVTDPDDSLEVIEGKAIMDREGVEEVRNGNWTVAV-LKLFKRELLQ--D 178 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI 240 + F G +D WT + + A R Y + +Y Y + + + + Sbjct: 179 LPFPIGKIAEDTYWTWKVLLRASRIVYLNRCVYWYRVGLSDTLSNTWSEKRMYDEIGARE 238 Query: 241 KITRLLEKLNRNYADKIMIYPE 262 + +L + + + I+IY Sbjct: 239 EKIAILASSDYDLTNHILIYKN 260 >UniRef50_UPI0001C4194D glycosyl transferase GT2 family n=1 Tax=Methanobrevibacter ruminantium M1 RepID=UPI0001C4194D Length = 817 Score = 243 bits (621), Expect = 7e-63, Method: Composition-based stats. Identities = 78/314 (24%), Positives = 143/314 (45%), Gaps = 20/314 (6%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 + K+SVIIP+YN C++S+I QT +EII I+DGS+DNS+EI Y Sbjct: 360 SENKIKVSVIIPIYNNEKFLSKCLDSVINQTLNEIEIICIDDGSSDNSIEILNQYVLKDS 419 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 ++++ Q N GA+ ARN G+++A G+Y+AF+D+DD + +E L ++ D+ N Sbjct: 420 RLKIISQENLGAATARNNGLKIAKGEYIAFLDSDDWLELNAFEKLYENITTNNSDLVLFN 479 Query: 122 A-DWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 + + + I D + T + L + +W +YR + +NN Sbjct: 480 SIEHKENANLKERIHIKNDSIPDYNYYTFNYNYKKDLVMNGYLD-IWSKMYRTSFLKENN 538 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKY-YLHNTSVSRLHRQGNKNLNYQRHY 239 I+F DI + + M NA + Y + LY Y +++ S+ GN++ Sbjct: 539 IQFSNHQIFNDIQFHIKTMLNAKKISYCPEFLYNYLRINHPSLQNNLSLGNESFILLDII 598 Query: 240 IKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGM 299 +I L I EF+ ++ +R K ++ + + + + Sbjct: 599 DEIEDYL------------IDNEFYNELKSNFIR-----FKLTELESTLEKLENPYRNEF 641 Query: 300 YKRLITNVRSVKVG 313 +K + N + +++ Sbjct: 642 FKLIKNNFKKMQLT 655 Score = 189 bits (480), Expect = 1e-46, Method: Composition-based stats. Identities = 59/281 (20%), Positives = 120/281 (42%), Gaps = 28/281 (9%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTW--TALEIIIINDGSTDNSVEIAKYYAENYPH 62 K+S+IIP+YN + + S+I+Q+ LE+I+++D STDNS I K Y Y + Sbjct: 2 NYKISIIIPVYNVENYIEKSLNSIISQSIGIENLEVILVDDNSTDNSANIIKKYVSKYDN 61 Query: 63 VR--LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 + + RN G+ AT +Y+ ++D+DD + T E L + ++ D+ Sbjct: 62 FKGIYCDIGSGFCGRPRNIGLSYATSEYIMYLDSDDWLEETACEVLYNTIINENADIVCG 121 Query: 121 NADWCFRETGETW-------------QSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHV-- 165 + E + + T + ++ P++ + + ++ Sbjct: 122 SQTRLDNEGNRKFYYHLWVTTLTDPNEDYNTRMKTTQEIIDDPNFKLVVTDLDKNPNILG 181 Query: 166 ---VWMGVYRRDVIVKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKY---YLHN 219 VW ++++D+I +N + F + QD V+ F A + + + Y + Sbjct: 182 HANVWGKIFKKDLITENELSFPEDIVAQDSVFLLNSFFVAEKIVFINDIIVHYNNLRCDD 241 Query: 220 TSVSRLHRQGNKNLNYQRHYIKITRLLEKLNRNYADKIMIY 260 S + + KNL IK L++ +++ ++ + Y Sbjct: 242 DDKSASYVKTTKNL---FGRIKAYDLMDNISKKFSKEEFFY 279 >UniRef50_D2Q641 Glycosyl transferase family 2 n=2 Tax=Bifidobacterium dentium RepID=D2Q641_9BIFI Length = 330 Score = 243 bits (620), Expect = 8e-63, Method: Composition-based stats. Identities = 62/220 (28%), Positives = 101/220 (45%), Gaps = 7/220 (3%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M + +SVI+P++ C+ES++ QT L+II+I+DGS D E+ + Sbjct: 1 MADEQPLISVIVPVHGVEASLDACVESIVRQTHRNLDIILIDDGSPDRCPELCDRWMTQD 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 +R++HQ N+G S ARN GIE + G+Y+ FVD+DD + MYETL+ L + D++ Sbjct: 61 SRIRVIHQRNSGPSAARNAGIEASQGEYLGFVDSDDSIAEDMYETLLRNLLREQADISII 120 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 + R R + + + VW +YRR++ + Sbjct: 121 GTSLVGEDGESYVPYATECRFR----MNSEQAFKYVNLPGYFHVAVWDKLYRRELFD--D 174 Query: 181 IKFIAGLH-HQDIVWTTEFMFNALRARYTEQSLYKYYLHN 219 ++F H +D T + A Y Y Y+LH Sbjct: 175 VRFPVDRHSGEDYPVTFRMIDKAETLIYDSTPKYCYHLHE 214 >UniRef50_C7X8L4 Glycoside transferase, family 2 n=1 Tax=Parabacteroides sp. D13 RepID=C7X8L4_9PORP Length = 308 Score = 243 bits (620), Expect = 9e-63, Method: Composition-based stats. Identities = 78/308 (25%), Positives = 134/308 (43%), Gaps = 6/308 (1%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 N +SVIIP+YN + + S++ QT +EII+INDGSTD S+ I + A+ + Sbjct: 2 DKNLVSVIIPVYNTEEFVEEAVRSIMNQTLQEIEIIVINDGSTDGSLSIIERLAKEDKRI 61 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 R Q N G S RN+G E A GKY+ F+D+DD + P ET + LD +AD Sbjct: 62 RTYTQQNQGLSATRNKGNEYAQGKYIYFMDSDDYLEPETLETCYIKCEQQALDFVLFDAD 121 Query: 124 WCFRETGET--WQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 +E ++ D V GP L++ + + ++ + RR+ + + + Sbjct: 122 ILNKEKHKSLHLNYQRKDCTNPELVYQGPHILKLLMDHKAYSPSACLNFIRREYLTEIQL 181 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIK 241 F+ G+ H+D ++T A R + +K L S+ ++ Y Sbjct: 182 DFLPGIIHEDQLFTCLLYLQAQRVSSIHKDFFKRRLREDSIMT----SKFSMRNMESYFT 237 Query: 242 ITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYK 301 IT L K ++ I + ++ A+ + + + + +R + E K Sbjct: 238 ITAHLSKFADSHPMFREIIDLYLSRMLNAAVWLSYKTPFKDRLKIARRCLKEYRQYVTTK 297 Query: 302 RLITNVRS 309 LI + Sbjct: 298 NLIVLLFK 305 >UniRef50_C7MN19 Glycosyl transferase n=1 Tax=Cryptobacterium curtum DSM 15641 RepID=C7MN19_CRYCD Length = 325 Score = 242 bits (619), Expect = 1e-62, Method: Composition-based stats. Identities = 64/272 (23%), Positives = 117/272 (43%), Gaps = 19/272 (6%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M N +SVI+P+YN CM+S++ QT+ LEII+++DGS+D S + +A+ Sbjct: 1 MAN--PLISVIVPVYNVEKYLDRCMQSIVDQTYRNLEIILVDDGSSDGSGALCDTWAQQD 58 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 + +LH+ N G S ARN G++ + G ++ FVD+DD + MYE+L ++ + D+A C Sbjct: 59 NRIIVLHKKNGGLSDARNAGVQTSHGDWIGFVDSDDYIDKQMYESLYSLIEYTNSDMALC 118 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 + + +P + VLT + L ++ V +Y ++ + Sbjct: 119 AVADVYAD----HTDVPAMSAENFTVLTTKEVLSDIFLNKTLMVGVPPRLYPAWLVRE-- 172 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI 240 + AG H+D + Y Y + +++ + N Sbjct: 173 VPSPAGKTHEDAFIVVDLFSRVEHIVIDRVPRYFYCHNEGTITSTPSLRAASDN------ 226 Query: 241 KITRLLEKLNRNYADKIMIYPEFHQQITYEAL 272 +E N +YP+ Q + + Sbjct: 227 -----IEAWEHNRVLVEKLYPDLMQDVLFRCY 253 >UniRef50_A5A8E7 Galactosyltransferase n=8 Tax=Streptococcus RepID=A5A8E7_STROR Length = 327 Score = 242 bits (619), Expect = 1e-62, Method: Composition-based stats. Identities = 71/287 (24%), Positives = 127/287 (44%), Gaps = 8/287 (2%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +SVI+P+YN + R MESL+ QT+ E+I+++DGSTDNS ++ YA Y V H Sbjct: 2 ISVIVPIYNVEEYLRYAMESLVNQTYKDFEVILVDDGSTDNSGQLCDQYASEYDWVSAYH 61 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + N G S ARN G+ A G ++ F+D DD P E LM + + + Sbjct: 62 KVNGGLSDARNYGVTKAKGDWIVFLDPDDYFEPCALELLMELQKRTGARIVAGKGKDAYD 121 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 I + L +L+ + L L T W ++ RD++ K+ + G Sbjct: 122 HDSYQNYQINEETLAKATLLSSEETLVEMLYGTIATVSAWAKLFPRDIVEKH--PYPKGK 179 Query: 188 HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRLLE 247 ++D+ ++++ + +Y YY S++ N N + Y + L + Sbjct: 180 IYEDLYVISDYLKEVESVCVIDLGIYHYYRRPNSIT----LSNFNSKHYDFYDSMNHLED 235 Query: 248 KLNRNYADKIMIYPEFHQQITYEALRVCHAV--RKEPDILTRQRMIA 292 + ++Y+ + ++ +L + + + +IL QR I Sbjct: 236 VVQKDYSSNTELDDAVIARLFIGSLHIFILISDSSKEEILCIQRKIR 282 >UniRef50_A1YVC6 Glycosyltransferase n=1 Tax=Lactobacillus johnsonii RepID=A1YVC6_LACJO Length = 348 Score = 242 bits (619), Expect = 1e-62, Method: Composition-based stats. Identities = 86/355 (24%), Positives = 160/355 (45%), Gaps = 26/355 (7%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M +K+S+I+P+YNA C++S+ITQT+ ++II++NDGS DNS EI + + P Sbjct: 1 MTQNSKVSIILPIYNAEKYLEECVKSVITQTYQNIQIILVNDGSKDNSWEICQNLKKKDP 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 + + Q N+G SVARN G+ +A G+++ FVD DD + + E L+ + D+D+ C Sbjct: 61 RIIAITQKNSGVSVARNAGLNIADGEWIMFVDPDDVLSQDIVEKLLLK-TDSDVDIVACT 119 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRW---------THVVWMGVYR 172 + DR ++ L + L + R+ V W +YR Sbjct: 120 CYGFNENEKKLAHFFANDR----AFISDKTDLYLQLLNSRYGQSGSLITAIGVPWGKIYR 175 Query: 173 RDVIVKNNIKFIAG-LHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNK 231 R I K +KF QD V+ A + Y ++ LY Y + +++ K Sbjct: 176 RSFIDKYELKFDPNLRRMQDNVFNMYAFDYARKIYYLDKPLYFYR-----IDHVNKYNEK 230 Query: 232 NL-NYQRHYIKITRLLEKLNRNYADKI--MIYPEFHQQITYEALRVCHA-VRKEPDILTR 287 ++ +++ ++ + R K IY E+ +++ L + + KE + Sbjct: 231 HIKEFKKIFLPVVREQYKCLNELGLYANERIYNEYINEVSNIYLNIIKGIILKEKNTEYI 290 Query: 288 QRMIAEIFTSGMYKRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITRSAFN 342 +R I ++ +K + + + K+ + + +L+ K + + A N Sbjct: 291 KREINQLSRKVYFKDIFVSNKKAKIQNKKI--RLKLFLITHKMEKGYILAYRALN 343 >UniRef50_C9RN10 Glycosyl transferase family 2 n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RN10_FIBSS Length = 318 Score = 242 bits (619), Expect = 1e-62, Method: Composition-based stats. Identities = 69/248 (27%), Positives = 115/248 (46%), Gaps = 11/248 (4%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 K+SVI+P+YN + C+ESL+ Q+++ E+I+++DGSTD S I +YA ++ Sbjct: 1 MPKISVIVPVYNVEKYIKECIESLLNQSFSDFELILVDDGSTDKSASICDFYASKDSRIK 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 + H+ N G S ARN GIE + G+++AFVD+DD V E L A D+ C + Sbjct: 61 VFHKQNGGVSSARNLGIEKSVGEWIAFVDSDDYVDAKYLENLYKSA--YSCDLISCGF-F 117 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 E GE S + +G D + +++ W + + +I N++KF Sbjct: 118 VTNEFGEKTNS----NVNPSGFFYQKD-IADVVNNDWIVTSPWAHLIKTSLIKDNDVKFY 172 Query: 185 AGL-HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 +D ++ + A R + LY Y S+ LH K Y Y+ Sbjct: 173 ENRSMGEDTIFVLSCLDKANCIRNIPELLYYYRTTPNSL--LHPDERKYGKYIEDYMWSN 230 Query: 244 RLLEKLNR 251 ++ + Sbjct: 231 NMVYSMKN 238 >UniRef50_B0P2L4 Putative uncharacterized protein n=1 Tax=Clostridium sp. SS2/1 RepID=B0P2L4_9CLOT Length = 332 Score = 242 bits (619), Expect = 1e-62, Method: Composition-based stats. Identities = 73/318 (22%), Positives = 152/318 (47%), Gaps = 14/318 (4%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 +SVIIP YN D C++S+I+QT+ LEII+++DGS D++ EI YA+ + Sbjct: 2 DEKLVSVIIPAYNIEDYIGRCLDSIISQTYKNLEIIVVDDGSRDHTGEILDNYAKKDRRI 61 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 +++H+ N G S ARN+GIE A G Y+ F+D DD + MY+TL+ + E++ D+A C Sbjct: 62 KVIHKENGGVSSARNKGIEAAEGDYIGFIDGDDLIESEMYKTLVDLLEEENADIAHCGYQ 121 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 F + + + + ++++T + L+ LS + +Y++++I + Sbjct: 122 MVFPDRIDYYHNTGKKKIQTT-----EEGLKDLLSGEMIEPGLVNKIYKKELIKNCRLDE 176 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 ++D++ + + ++ Y + + Y Y + ++S + N + + +++ Sbjct: 177 TV-KINEDLLMNYQLFKLSQKSVYYDITPYSYMIRSSSAT----GANSLITKREDALRVL 231 Query: 244 RLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRL 303 ++ N +IY + Y + +C K+ + Q+ + + + Sbjct: 232 NQIKDDCINNNLLSIIYKRY----IYLLMAICRDDLKDRSYIEYQKKQRKQLKKELKTDI 287 Query: 304 ITNVRSVKVGYQALLWSF 321 + K+ Y +L + Sbjct: 288 FKSCIPKKLKYMSLFSCY 305 >UniRef50_Q84GX9 Orf34 n=2 Tax=Photorhabdus luminescens RepID=Q84GX9_PHOLU Length = 328 Score = 242 bits (619), Expect = 1e-62, Method: Composition-based stats. Identities = 82/314 (26%), Positives = 143/314 (45%), Gaps = 5/314 (1%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +++IIPL+N ++S+ Q E++IINDGSTDNS+ + Y + +R+ + Sbjct: 6 ITIIIPLFNNESYIACLLDSISVQENVNFEVVIINDGSTDNSLGVINEYKKRDNRIRIFN 65 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 Q N G SVARN GI+ A G++V F+D+DD + P + + A+ LDV N Sbjct: 66 QENRGVSVARNHGIKYARGEWVLFIDSDDWLKPQILRKWLDQAIAQKLDVLIGNGVSFTE 125 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 E T R VLTG +W+ + R W H VW+ + ++++I+ N + F Sbjct: 126 EPYLQENKRITRRQPYGRVLTGKEWIEYCVECREWPHFVWLQLIKKELIINNQLSFKDNT 185 Query: 188 HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRLLE 247 H+DI+WT A R + E+ LY Y + S++R ++ K + YI + + Sbjct: 186 LHEDILWTINLSLIANRIGFCEELLYCYRHNLESITR-SKEIKKLYHRADSYITVVDSI- 243 Query: 248 KLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLITNV 307 N + +I + ++ A+R + + + ++ +++ Sbjct: 244 ---INISKEIKGEKSLRRSLSRHAIREVGSFFILYRKRIKDPALKKLLAKRFIEKIALRD 300 Query: 308 RSVKVGYQALLWSF 321 V LW Sbjct: 301 LLPGVHNYHELWFI 314 >UniRef50_C0YL33 Glycosyltransferase n=1 Tax=Chryseobacterium gleum ATCC 35910 RepID=C0YL33_9FLAO Length = 333 Score = 242 bits (618), Expect = 2e-62, Method: Composition-based stats. Identities = 67/301 (22%), Positives = 139/301 (46%), Gaps = 18/301 (5%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M N +++S+I+P+YN + C++SL+ Q+ +EI+++NDGS DNS +I + YA+ Y Sbjct: 1 MTNVPSRVSIIVPVYNVENYLAKCLDSLVKQSLPDIEILVVNDGSKDNSEKIIEGYAQKY 60 Query: 61 P-HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQ 119 P ++ + N G S ARN GI+ ATG Y+ FVD+DD V TM+E ++ +A + + + Sbjct: 61 PEKIKAYTKENGGLSDARNFGIDRATGDYIGFVDSDDYVTETMFEEMLLLAEKHNAKMVI 120 Query: 120 CNADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKN 179 N + + L + +S + ++++++ N Sbjct: 121 SNIQKVDENGKVVQKLTQLPNMPEKITLENNFSVFSDIS-----YFACNKLFKKELF--N 173 Query: 180 NIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHY 239 +F G+H +DI + + T+ Y+Y S+++ H + K L+ + Sbjct: 174 QKRFKKGVHFEDIQLIPQLLLECETIAQTQNFHYQYLERTDSITKTHTE--KGLDMLKAV 231 Query: 240 IKITRLL--------EKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMI 291 + ++ ++ +N+ +Y + + +++ + + ++R I Sbjct: 232 SDVEQVFRESPYSHKKEELKNFQIFEGVYSFLAYLAFVKEEEIFYSMSDQLVLFMKERQI 291 Query: 292 A 292 Sbjct: 292 K 292 >UniRef50_Q9AH91 Putative glycosyl transferase n=3 Tax=Streptococcus pneumoniae RepID=Q9AH91_STRPN Length = 354 Score = 242 bits (617), Expect = 2e-62, Method: Composition-based stats. Identities = 76/320 (23%), Positives = 142/320 (44%), Gaps = 23/320 (7%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 S ++SVI+P+YN R CM+SLI QT+ EII++NDGSTD+S + + +A+ + Sbjct: 6 SCPEISVIVPVYNVERYLRQCMDSLINQTYRDFEIILVNDGSTDSSGVLCEDWAKKDERI 65 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 ++H+ N G ARN G+E A GKY+ FVD+DD V M +TL E D++V Sbjct: 66 HVVHKKNEGLGFARNTGVEHAKGKYITFVDSDDYVSLDMLQTLYNAVQEYDVEVVYSAGY 125 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGL-----------SSRRWTHVVWMGVYR 172 + GE ++ + + G D L + + W +Y Sbjct: 126 YRSFSNGEIKKT--DVETKKPQLFEGGDVASKLLPDVISAPPEYPNDGKVGVSAWKVLYE 183 Query: 173 RDVIVKNNIKFIAGLHH--QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGN 230 ++ + + F + +D ++ + + A A LY Y + S+S +++ Sbjct: 184 ANLFKEKGLLFHSEREFISEDAIFQIDCLKLATSALVIPDILYYYRENFGSLSMKYKEER 243 Query: 231 KNLNYQRHYIKITRLL----EKLNRNYADKIM---IYPEFHQQITYEALRVCHAVRKEPD 283 L+ + ++ R+ +++ ++I+ I Q+ Y++ R+ H + Sbjct: 244 FELDKILYNEQLKRVEGLPNQEILEERIERILIANIRLCIFQESLYKSSRI-HKRLQRIR 302 Query: 284 ILTRQRMIAEIFTSGMYKRL 303 + + + + RL Sbjct: 303 QICKDPISKSVLRHYPIHRL 322 >UniRef50_B0P2K9 Putative uncharacterized protein n=1 Tax=Clostridium sp. SS2/1 RepID=B0P2K9_9CLOT Length = 327 Score = 242 bits (617), Expect = 2e-62, Method: Composition-based stats. Identities = 65/265 (24%), Positives = 115/265 (43%), Gaps = 8/265 (3%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+SVIIP+YN + C+ES++ QT+ LEI++INDG+TD+S +I + YA+ ++L+ Sbjct: 2 KISVIIPVYNTEKYLKECVESVLAQTYHNLEILLINDGATDSSPQICESYAKQDARIKLI 61 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEV-YPTMYETLMTMALEDDLDVAQCNADWC 125 H+ N G S RN GI+ +G YV F+D+DD P M E L + +D+ Sbjct: 62 HKENGGLSDTRNMGIKQCSGDYVLFLDSDDYWDDPKMVEKLADQMQQYPVDILNFRYKKY 121 Query: 126 FRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIA 185 +T + + + + L ++ + + + N + F Sbjct: 122 MEDTKQFVPCLKS--VEDIKQQEKDQILERLINEGLYISSACNKLISASFLKTNELYFEK 179 Query: 186 GLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRL 245 G+ +DI W + Y Y Y + S++ N + + K R Sbjct: 180 GITSEDIDWCARMLIQCDSIGYNNTEAYVYRQRSESITHTISYKNI-YDLSNNVKKCVRF 238 Query: 246 LEKLNRNYADKIMIYPEFHQQITYE 270 + + + Y +H + Y+ Sbjct: 239 GQVIPKG----TKFYELYHDFVAYQ 259 >UniRef50_C7G7U9 Putative capsular polysaccharide biosythesis protein CpsI n=1 Tax=Roseburia intestinalis L1-82 RepID=C7G7U9_9FIRM Length = 346 Score = 242 bits (617), Expect = 2e-62, Method: Composition-based stats. Identities = 67/265 (25%), Positives = 117/265 (44%), Gaps = 11/265 (4%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M T+K+SVI+P+YN + C++S+ QT+ LEI++I+DGS D S EI Sbjct: 19 MTKETDKISVIVPVYNVKNYLEDCVQSITHQTYPDLEILLIDDGSDDGSGEICDRLQMTD 78 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 +R+ H+ + G S ARN+G++ A G Y+AFVD+DD + P MY L+ + + +A C Sbjct: 79 ARIRVFHEKHKGVSGARNKGLKEAAGAYIAFVDSDDVLEPDMYTYLIRLLKTHEAQIAAC 138 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 + + + P D + + G + L + VW +++R+V+ Sbjct: 139 SGWYKYDAGVGKKFESPADVV----CMNGREALGVLHERSYLRAYVWNKLFQREVLD--G 192 Query: 181 IKFIAGLHH-QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHY 239 I+F L +D + ++ Y Y + + G YQ+ Sbjct: 193 IRFSTELSFAEDYEMFCRVLEQCDTIVCGTKAEYYYMQRKSGICNRGYDGI----YQKAM 248 Query: 240 IKITRLLEKLNRNYADKIMIYPEFH 264 + +R Y I+ + Sbjct: 249 MMFEDYCNVYSRRYPQYKKIFENHY 273 >UniRef50_D2BBI0 Putative glycosyl/glycerophosphate transferase involved in teichoic acid biosynthesis TagF/TagB/EpsJ/RodC-like protein n=1 Tax=Streptosporangium roseum DSM 43021 RepID=D2BBI0_STRRD Length = 1173 Score = 242 bits (617), Expect = 2e-62, Method: Composition-based stats. Identities = 70/245 (28%), Positives = 106/245 (43%), Gaps = 9/245 (3%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 + LSVI+P +N F C+ S+ QT LE+I ++DGS D S IAK +A R Sbjct: 2 SPLLSVIVPFFNVEPYFGACLASIARQTLRDLEVICVDDGSLDGSAVIAKEFAGKDGRFR 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++HQ NAG ARN G ATG+Y+ F+D+DD + Y L++ E D D A N Sbjct: 62 VIHQENAGLGAARNAGALHATGRYLCFMDSDDVLPADAYGLLVSSLSESDSDFACGNV-- 119 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 + + ETW+S R V R L R W V+R D ++ F Sbjct: 120 LYLNSTETWRSGLHSRPFRETVQRTHVNTRPELLLDR---TAWNKVFRHDFYREHGFSFP 176 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLH---NTSVSRLHRQGNKNLNYQRHYIK 241 GL ++D T A +Y + + S+++ + + + Sbjct: 177 GGL-YEDAPVTIPAHVLARSVDVLSDVIYYWRQREPGDPSITQRRTEPGNLEDRIASITR 235 Query: 242 ITRLL 246 + L Sbjct: 236 VRDFL 240 >UniRef50_A7I1X7 Ss-1,4-galactosyltransferase n=1 Tax=Campylobacter hominis ATCC BAA-381 RepID=A7I1X7_CAMHC Length = 325 Score = 241 bits (616), Expect = 2e-62, Method: Composition-based stats. Identities = 66/330 (20%), Positives = 144/330 (43%), Gaps = 20/330 (6%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 +SVI+P++N + C++SL+ Q++ EI++I+DGS D S+EI + +A+ P +R Sbjct: 2 KPLISVIVPVFNVAEFLDDCIKSLLNQSYENFEILLIDDGSGDKSLEICQNFAKKEPKIR 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 + + N G ARN G++ A G+++ FVD+DD + + + ++++ +A C Sbjct: 62 VFSKQNGGQGSARNLGLDNAKGEFICFVDSDDILASGFLDKTFEILKQNNVKMAMCAYAP 121 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRM-GLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 + S +L + + L++ + VW +Y + + K ++F Sbjct: 122 FVDAS---EISNIESVKSEVKILNDKEIFKEIFLANPAFGTAVWRNLYHKSIFEK--LRF 176 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 ++D+ E NA +A + ++LY Y L S + + + + I + Sbjct: 177 PQNQIYEDVAIAFEIFNNAEKAAFCNETLYFYRLRAGSTTNSFSK-----RHLQAEISVK 231 Query: 244 RLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKE------PDILTRQRMIAEIFTS 297 EK+ + Y D + I + + K+ P I + E F + Sbjct: 232 NFCEKVVKKYPDLKKMAN---FAICDSMTSISIMILKDGAKEFYPKITEFLPFLRENFMN 288 Query: 298 GMYKRLITNVRSVKVGYQALLWSFRLWQWR 327 + ++ + ++ +++ ++ + Sbjct: 289 ILKQKNYSKIKKIEMILLSISPKMLNLALK 318 >UniRef50_C3ADH9 Putative uncharacterized protein n=1 Tax=Bacillus mycoides DSM 2048 RepID=C3ADH9_BACMY Length = 338 Score = 241 bits (616), Expect = 2e-62, Method: Composition-based stats. Identities = 75/341 (21%), Positives = 142/341 (41%), Gaps = 15/341 (4%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M +SVI+P+YN C+ES++ Q + +E++++NDGSTD EI + Y E Sbjct: 1 MKGLHELISVIVPIYNVEKWLARCIESVLKQPFVNIELVLVNDGSTDKCGEICEEYLEKD 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 ++++++ N G S ARN GIE ATGKY+ F+D DDEV + L +A + D C Sbjct: 61 NRIKVIYKENGGVSSARNAGIEAATGKYIVFIDPDDEVSENYFTRLYGIAEDKKCDAVIC 120 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRM--GLSSRRWTHVVWMGVYRRDVIVK 178 + V+ G D++ + S+ W +Y +I + Sbjct: 121 GYKTVPNNNS------IIPNFKLDTVMNGRDFVLSSPNVHSKNDLCFAWRYIYNLSIIKE 174 Query: 179 NNIKFIAGL-HHQDIVWTTEFMFNALRARYTEQSLYKYYLHN-TSVSRLHRQGNKNLNYQ 236 NI+F + +D+++ EF+ ++ R + LY Y ++N S+ R+ + N + Sbjct: 175 ENIRFNEQVFIGEDVIFNLEFLLHSQRVCAVSEVLYFYTINNPNSLMRVPYKPNLENSLV 234 Query: 237 RHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFT 296 Y +L ++ + + Y + + + Sbjct: 235 LQYQMRRQLSQEFGLLHDKQFK-----KDMANYYINHIYRLLINNLKSAPNVDVNNAFRR 289 Query: 297 SGMYKRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRIT 337 Y+ +V+ + Y+ + ++ K + + I Sbjct: 290 IVNYEMFSDSVKGIGFSYRCNNFKEYIYYLAIKFKVYPLIV 330 >UniRef50_C6Z980 Glycosyltransferase family 2 n=1 Tax=Bacteroides sp. 4_3_47FAA RepID=C6Z980_9BACE Length = 351 Score = 241 bits (616), Expect = 3e-62, Method: Composition-based stats. Identities = 74/336 (22%), Positives = 151/336 (44%), Gaps = 29/336 (8%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 + K+S+++P+YN C++SL+ QT +EII+++D S DN +I Y Y +V Sbjct: 2 NKPKVSIVVPIYNVEKYLDRCIQSLLAQTLKEIEIILVDDESPDNCPQICDRYVAKYTYV 61 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 +++H+ N G ARN G+EVATG+Y+AFVD+DD + M E L E LDV Sbjct: 62 KVVHKKNDGLGQARNSGMEVATGEYIAFVDSDDFIDADMMERLYDECKEHSLDVI----- 116 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMG-----------LSSRRWTHVVWMGVYR 172 + E L+ + TG + + SS ++ G+Y Sbjct: 117 --YSEFNVDEYPGFRITLKPERLYTGREEIEQLRLDIVGAEPNHRSSVKFQCSACKGLYS 174 Query: 173 RDVIVKNNIKFIAGLHH--QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGN 230 ++ +++++F + + +D+++ + +++A++ + +Y Y L+ S+S +R Sbjct: 175 MKLLSEHHLQFHSERQYISEDMLFNLDVLYHAMQVKTVPWQMYHYCLNGASLSHTYRPDR 234 Query: 231 KNLNYQRHYIKITRLLEKLNRNYADKIMIYPEF-HQQITYEALRVCHAVRK-EPDILTRQ 288 + + +L+K + + + I Y + +R+ + + Sbjct: 235 W-----PKLLYMLEILDKKGQQFKHQEEFRLRLARTAIFYTRSGIGQEIRRTDISFKEKL 289 Query: 289 RMIAEIFTSGMYKRLITN--VRSVKVGYQALLWSFR 322 R + EI LI + ++ + ++ + + Sbjct: 290 RRVTEIMKDTTISSLIKDYPFYNLPLTWKIYGYLLK 325 >UniRef50_A1A380 Putative uncharacterized protein n=1 Tax=Bifidobacterium adolescentis ATCC 15703 RepID=A1A380_BIFAA Length = 323 Score = 241 bits (616), Expect = 3e-62, Method: Composition-based stats. Identities = 62/263 (23%), Positives = 109/263 (41%), Gaps = 13/263 (4%) Query: 2 MN--STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAEN 59 MN + +S+I+P+YN C+ SLI QT +EI++I+DGS DN+ I + Sbjct: 1 MNAVTEPLVSIIVPIYNVEKYLDQCVASLIQQTHHNIEILLIDDGSPDNAPAICDAWQHK 60 Query: 60 YPHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQ 119 V +LH+ N+G S +RN G+ A G+YV FVD DD + P E ++ + + + D+ Sbjct: 61 DERVHVLHKTNSGVSASRNAGLHAAKGEYVCFVDGDDWIEPDFVEYMLELQAKRNADIVA 120 Query: 120 CNADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKN 179 + CF T + + + + W +YRR + N Sbjct: 121 --SLCCFTST-----DFHQRTQDEVSTINPQEAMELFFYPTIELG-AWNKLYRRSFLTAN 172 Query: 180 NIKFIAG-LHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRH 238 N++FI + + + T A ++ +Y Y N + + + Sbjct: 173 NLEFITDFCAGEGLQFITLAASKANCVTVGQKKVYHYRTDN--ATSATTKPDVERQGIGA 230 Query: 239 YIKITRLLEKLNRNYADKIMIYP 261 + + E L+ + Y Sbjct: 231 ITTMQYICEHLDLSNPKVRKAYN 253 >UniRef50_B5CQR8 Putative uncharacterized protein n=1 Tax=Ruminococcus lactaris ATCC 29176 RepID=B5CQR8_9FIRM Length = 304 Score = 241 bits (616), Expect = 3e-62, Method: Composition-based stats. Identities = 65/264 (24%), Positives = 127/264 (48%), Gaps = 26/264 (9%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 MM +S+I+P+YN+ C++S++ QT+ LEI+++NDGS D S++I + YA Sbjct: 1 MM-----ISIIVPVYNSEKYIDRCLDSILNQTYKDLEIVLVNDGSNDQSLKILENYALRD 55 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 +++++Q N G + ARN G++ ATG Y+ +VD+DD + M E ++ ++ + D+ C Sbjct: 56 TRIKVVNQENKGVAAARNTGLDNATGDYILYVDSDDWIENNMVERMVELSA--NADIVFC 113 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 + + +S+ + + + L + + ++W + +R + + Sbjct: 114 G-----NDNAVSPESVKKIIGITKEIWNKDKIIYEFLRHKIMSGMLWNKLIKRSL--TDG 166 Query: 181 IKF-IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHY 239 +F + +D + + + N T + LY + S+S L+Y Sbjct: 167 CRFNPKTGYGEDAEFLWQVLQNTNSMIVTNEILYHHVPDENSISH--------LSYSEKK 218 Query: 240 IKITRLLEKLNRNYADKIMIYPEF 263 + EK+N AD IYPE+ Sbjct: 219 YSAISMWEKIN---ADTSAIYPEY 239 >UniRef50_B3C8H4 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=B3C8H4_9BACE Length = 359 Score = 241 bits (615), Expect = 3e-62, Method: Composition-based stats. Identities = 74/298 (24%), Positives = 133/298 (44%), Gaps = 12/298 (4%) Query: 1 MMNS------TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAK 54 MMN+ + K+SVIIP+YN + ++S+ QT LEII+INDGSTDNS I + Sbjct: 45 MMNNCSDINPSIKVSVIIPVYNTATYLHSALDSICNQTLKDLEIILINDGSTDNSQYIIE 104 Query: 55 YYAENYPHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDD 114 YA ++ L Q N VARN G+++ATG+Y+ F+D+DD + M + Sbjct: 105 KYARKDNRIKYLIQPNQRQGVARNNGLQLATGQYIYFMDSDDILDIDALRQCYEMCERKE 164 Query: 115 LDVAQCNADWCFR--ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYR 172 LD +A + T+ ++ + G + + L + V W+ + Sbjct: 165 LDFVTFDAKTIHETSTSSGTFNYCRKGKIDEQKIWDGIELMHYELEYEIFYVVPWLCFTK 224 Query: 173 RDVIVKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKN 232 + + F +G+ H+D ++ + M NA R Y Q +K + N+S + + Sbjct: 225 HSFLKQYFGGFPSGIIHEDTIFVVQIMLNAHRVGYIPQPFFKRRVRNSSTM----AASFS 280 Query: 233 LNYQRHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRM 290 ++ Y + + + + + I + ++ + + H ++ I T R Sbjct: 281 MHNIEGYTLVCTQIRSWTQQHVEWSAIIDLYLRKTLNSVIWLGHKMKTLEKIETYCRF 338 >UniRef50_A3KLK8 Cps5J protein n=2 Tax=Streptococcus agalactiae RepID=A3KLK8_STRAG Length = 321 Score = 241 bits (615), Expect = 3e-62, Method: Composition-based stats. Identities = 71/285 (24%), Positives = 130/285 (45%), Gaps = 18/285 (6%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 K+S+IIP+YN C++S+I QT+ LEII+I+DGSTD S + A+ + Sbjct: 1 MEKISIIIPVYNVKKYLNDCIQSVINQTYNNLEIILIDDGSTDGSGDYCDEIAKKDSRIF 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 + H+ N G S ARN GI+++TGKY+ F+D+DD + ETL +E D+A N Sbjct: 61 VYHKTNGGLSEARNVGIKISTGKYITFIDSDDYIENLYIETLYNSLIEYKADIAIVNKYH 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWT-HVVWMGVYRRDVIVKNNIKF 183 + + R + + ++ T + + +YR+++ ++I+F Sbjct: 121 LKDNEKIVHLANSKELFRVFDTKMILESMYSRENNFISTLTMAQLKLYRKELF--SDIEF 178 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 G ++D T + A R + + LY Y + S++R YQ + + Sbjct: 179 PIGKKYEDSFTTYKTYIEANRIVFVNRPLYAYRFRSGSITRS--------GYQLSNLDVL 230 Query: 244 RLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQ 288 + Y +++ + E + C A+ + ++L + Sbjct: 231 EM-------YEERLEYFRERGYDLLSTKYHYCIAIANQKELLKKH 268 >UniRef50_C6I619 Glycosyl transferase n=4 Tax=Bacteroides RepID=C6I619_9BACE Length = 329 Score = 240 bits (614), Expect = 5e-62, Method: Composition-based stats. Identities = 68/327 (20%), Positives = 135/327 (41%), Gaps = 11/327 (3%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M N+ +S+I+P+Y ++SL+ QT+ LEII+++DGS DN I YA Sbjct: 1 MNNTNPLVSIIVPVYKVERYIHRSIDSLLRQTYNNLEIILVDDGSPDNCPFICDKYAMQD 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 +R++H++N G S ARN + V TG+YV FVD+DD + M E + ++ Sbjct: 61 NRIRVIHKSNGGLSDARNAALNVMTGQYVTFVDSDDYIADNMIEKFIETVKLYQCNMVVA 120 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 + + Q R ++ + TG D R L + V ++ + + Sbjct: 121 GMNIIDKNN----QIYDYRRTLTSTLSTGIDITRKLLKDVFPFNFVCAKIFESSLFEE-- 174 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKY-YLHNTSVSRLHRQGNKNLNYQRHY 239 I+F G H++D T + R LY Y +++ +Y Sbjct: 175 IRFPVGRHYEDTATTYKLTHKCERIYCIADCLYFYELEREGNITSELNTSKAIKSYIDGC 234 Query: 240 IKITR--LLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTS 297 + + +++++D + + + Q A++ ++ + + ++ I + Sbjct: 235 LNSREQIFFCQKHKDFSDLLPVLTKRLQLWALLAIQSAISLGYKEYMFYYHQIRIYIRSL 294 Query: 298 GMYKRL--ITNVRSVKVGYQALLWSFR 322 +R+ + + + Y +L Sbjct: 295 SYTQRMTHLQIALTFPIIYYSLYPLLS 321 >UniRef50_B0MQU6 Putative uncharacterized protein n=1 Tax=Eubacterium siraeum DSM 15702 RepID=B0MQU6_9FIRM Length = 325 Score = 240 bits (614), Expect = 5e-62, Method: Composition-based stats. Identities = 72/243 (29%), Positives = 121/243 (49%), Gaps = 5/243 (2%) Query: 18 GDDFRTCMESLITQTWT-ALEIIIINDGSTDNSVEIAKYYAENYPHVRLLHQANAGASVA 76 C++SL+ Q + EIII+NDG+ DNS+ IA +A Y +R++ Q NAG S A Sbjct: 23 EKYIERCLDSLVRQNFGYKYEIIIVNDGTKDNSMTIADRFASKYDFIRIITQQNAGLSAA 82 Query: 77 RNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFRETGETWQSI 136 RN G+ A G+Y+AFVD+DD V P + T+A++++ D+ QC F ++ + + Sbjct: 83 RNTGLANARGEYIAFVDSDDFVSPFYISEMYTLAVKNNADIVQCRYCNYFEKSDRSINVL 142 Query: 137 PTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGLHHQDIVWTT 196 + R GV+ G +S + W +YRR + NNI + G +D++ T Sbjct: 143 LSHR---NGVIGGKKAFEKLISDVSVRNYAWNKMYRRSLFSDNNISYPVGKCFEDVITTP 199 Query: 197 EFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRLLEKLNRNYADK 256 A +T + LY Y S++ L + + +Y Y+ ++ L + Y+ + Sbjct: 200 RLFACAENVAFTRKVLYSYAHRPDSITGLAGRKMVD-SYLEAYLILSEYLAENGIYYSCR 258 Query: 257 IMI 259 I Sbjct: 259 TRI 261 >UniRef50_Q93TI5 Putative glycosyltransferase CpsIVJ n=1 Tax=Streptococcus agalactiae RepID=Q93TI5_STRAG Length = 315 Score = 240 bits (614), Expect = 5e-62, Method: Composition-based stats. Identities = 68/292 (23%), Positives = 130/292 (44%), Gaps = 16/292 (5%) Query: 15 YNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLHQANAGAS 74 YN C++S++ QT+ +EI++++DGSTDNS I Y+ +R+ H+ N G S Sbjct: 13 YNVEKFLEECIDSVLNQTYKNIEILLVDDGSTDNSGIICDNYSLKDKRIRVFHKNNGGLS 72 Query: 75 VARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFRETGETWQ 134 ARN G+ A G+Y++F+D+DD + + +D+ CN + + Sbjct: 73 DARNFGVVNAEGRYISFIDSDDYIDKNYIRKMYYCLSNHKVDMVICNYLSVYNNDLKPIV 132 Query: 135 SIPTDRLRSTGVLTGPDWLRMGL-SSRRWTHVVWMGVYRRDVIVKNNIKFIAGLHHQDIV 193 S RL+ + + + + W + R D+ K + F G H+D Sbjct: 133 S----RLQDCIIYSRVELFSNLYGKYKDPFTTAWGSLIRTDIAKK--VTFPIGKLHEDEF 186 Query: 194 WTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRLLEKLNRNY 253 T ++ + + + + LY Y S+ +NL+ Y + ++LEK N + Sbjct: 187 TTYKYYLYSDKIAFVPEPLYFYRRREASIMSSS-YSKRNLDSLEAYEERIKILEKNNISI 245 Query: 254 ADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGM-YKRLI 304 ++ + IY L + +RK P++ +Q ++ + + YK ++ Sbjct: 246 SETVYIYLSL-------ILYHIYRIRKYPEVCNKQELLDKFNNYYLKYKNIL 290 >UniRef50_A3TMK5 Possible glycosyl hydrolase n=1 Tax=Janibacter sp. HTCC2649 RepID=A3TMK5_9MICO Length = 552 Score = 240 bits (614), Expect = 5e-62, Method: Composition-based stats. Identities = 63/236 (26%), Positives = 107/236 (45%), Gaps = 12/236 (5%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M +++SV+IP+YN D C++S+ QT+ +L++I+I+DGSTD S + A Sbjct: 1 MSPELHRVSVVIPVYNVVDYLEQCLDSVAAQTYRSLDVILIDDGSTDGSGPLCDTLAARD 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALED-DLDVAQ 119 P R++HQ N G S ARN GI ATG +V F+D+DD PT +L+ + A Sbjct: 61 PRFRVIHQDNHGLSHARNVGIGAATGTFVTFLDSDDWWEPTFVSSLVKAMDDHPGAGTAM 120 Query: 120 CNADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGL-SSRRWTHVVWMGVYRRDVIVK 178 C+ T + R+ T VL+ + + + ++RR ++ Sbjct: 121 CSLTRVPGATYDP-------RISQTRVLSPAEAIAQFAGPHHSIFVIACAKLFRRTLLDT 173 Query: 179 NNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLN 234 + F G +D T + A + LY Y S++ ++ ++ Sbjct: 174 D--TFPIGRVAEDAFATHRLLMKAP-VVQVPEPLYLYRQRAHSITSRPMTVSRLID 226 >UniRef50_C3X3W6 Putative uncharacterized protein n=1 Tax=Oxalobacter formigenes HOxBLS RepID=C3X3W6_OXAFO Length = 337 Score = 240 bits (613), Expect = 6e-62, Method: Composition-based stats. Identities = 74/334 (22%), Positives = 134/334 (40%), Gaps = 20/334 (5%) Query: 6 NKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRL 65 +SVI+P+Y A C++S++ QT+ LEI++++DGS D I YA +R+ Sbjct: 4 PLVSVIVPVYRAQAYLARCLDSIVGQTYPNLEILLVDDGSPDACGAICDDYAARDARIRV 63 Query: 66 LHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWC 125 H N G + ARN ++ TG+YV FVDADD + P + E + A E D+ + Sbjct: 64 FHLENGGVARARNVALDAMTGEYVIFVDADDYIAPELVEKALQTARERSADIVVFDFWNI 123 Query: 126 FRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIA 185 ++ + + T V G + +R + +W +R + I+ + Sbjct: 124 EKDGLQ----LMTQCYHPGDV--GIEKMRELILKDELLSYMWNKFFRASLFE--GIRIAS 175 Query: 186 GLHHQDIVWTTEFMFNALRARYTEQSLYKY-YLHNTSVSRLHRQGNK-NLNYQRHYIKIT 243 D++ A A Y Y Y L++ S++ + NK N + + Sbjct: 176 VRTFSDLLVMPRIFMQAKSAVYVRTPYYYYNLLNDGSLTSIKSVSNKQNAKNRYGMFRAW 235 Query: 244 RLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSG----- 298 E++ R + ++ Y E + + RV R +R+ + Sbjct: 236 EERERVARECCESVVEYSERRAVNSAISTRVIDLYRPALSEEETERLGRFLEDKKRNRVS 295 Query: 299 ----MYKRLITNVRSVKVGYQAL-LWSFRLWQWR 327 Y+ L + + S +L+QW+ Sbjct: 296 RIDAKYRTLWWLSDHCPLLCRLYGYLSVKLFQWK 329 >UniRef50_C4Z667 Putative uncharacterized protein n=1 Tax=Eubacterium eligens ATCC 27750 RepID=C4Z667_EUBE2 Length = 329 Score = 240 bits (613), Expect = 6e-62, Method: Composition-based stats. Identities = 68/270 (25%), Positives = 122/270 (45%), Gaps = 8/270 (2%) Query: 3 NSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPH 62 LS+I+P+YN D C S+I Q + +EI++I+DGSTDNS EI Y+ E Y + Sbjct: 8 EKLPLLSIIVPVYNVQDYLEECCNSIINQKYLNIEILLIDDGSTDNSGEICDYFGEKYSN 67 Query: 63 VRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNA 122 +R+ H+ N G + RN GI A G+Y+ FVD+DD +Y MY +M + LE + + CN Sbjct: 68 IRVFHRENEGLAATRNFGIAQAQGQYLGFVDSDDWIYENMYSDMMKIVLEKNAQIVCCNF 127 Query: 123 DWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIK 182 D E + I + + + + + + ++++++ N++ Sbjct: 128 DL------EIDKRIIKRGKNGIDIYNKNEAIVNLMYPQFYQFYAPNKIFKKELFQ--NVE 179 Query: 183 FIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKI 242 F G H +DI+ + + + + QS Y Y S++ + Sbjct: 180 FPIGKHFEDIITIYKVFLKSEKVYFIWQSGYVYRQRKDSITNAIFNSKSAEVLDAIGFVM 239 Query: 243 TRLLEKLNRNYADKIMIYPEFHQQITYEAL 272 + N D ++ Y ++ +A+ Sbjct: 240 NDSKRRFPNNINDIMVGYIRYYISFLDKAI 269 >UniRef50_D2BBH8 Glycosyltransferase involved in cell wall biogenesis-like protein n=1 Tax=Streptosporangium roseum DSM 43021 RepID=D2BBH8_STRRD Length = 616 Score = 240 bits (613), Expect = 6e-62, Method: Composition-based stats. Identities = 74/266 (27%), Positives = 113/266 (42%), Gaps = 9/266 (3%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 +LSV++P YN F C+ S+ QT LE+I ++DGS D S IAK YA P R Sbjct: 1 MPRLSVVVPYYNVERYFDACLASIQGQTLGDLEVICVDDGSMDASAVIAKDYASRDPRFR 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++ Q N G ARN G+ A+G Y+AF D+DD V P YE L+ D+A N Sbjct: 61 VVVQENQGLGPARNTGVTHASGHYLAFADSDDIVPPRAYELLVGSLERSGSDIASGNVQR 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 E G + R T V T L R VW V+RR + ++F Sbjct: 121 LTSE-GLVQSWAHRNAFRKTQVGTHITRNVHLLFDR----SVWNKVFRRSFWDELGMEFP 175 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 A ++D+ T A + +Y + L S++ + + + R Sbjct: 176 A-RLYEDMPVTVPAHVRARAVDVLSEVVYIWRLREGSITERRFRAENVTDLVISVAETAR 234 Query: 245 LLEKL---NRNYADKIMIYPEFHQQI 267 LE+ R ++ ++ + + Sbjct: 235 FLEEHAPGLRRVYERDTLHNDLRVAV 260 >UniRef50_A8U9H2 Glycosyl transferase, family 2/glycosyl transferase family 8 n=2 Tax=Carnobacterium sp. AT7 RepID=A8U9H2_9LACT Length = 352 Score = 240 bits (612), Expect = 7e-62, Method: Composition-based stats. Identities = 62/232 (26%), Positives = 107/232 (46%), Gaps = 2/232 (0%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 K+S+I+P+YN + ++ ++QT+ EII++NDGSTD S EI Y P + Sbjct: 1 MKKISIIMPIYNVAEQLEKAIQCALSQTYKNSEIILVNDGSTDGSGEICDRYQLKDPRIS 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++HQ NAG+ ARN G++ ATG+Y+ F D DD + E + A+E + DV C + Sbjct: 61 VIHQKNAGSGFARNAGLDRATGEYIYFADPDDYFEYNLIEETVAKAIESNADVVVCGY-Y 119 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVV-WMGVYRRDVIVKNNIKF 183 + E + G+LT + + W +Y+ D + ++N +F Sbjct: 120 NEKVDKEGNITTLEKNSELDGLLTKEAFRDQFRKHYSLSPYALWNKLYKHDFLKQHNCRF 179 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNY 235 QD ++ + N + +++ Y Y S +R+ Y Sbjct: 180 TNQKVGQDALFNQQVGKNVQTIYFHQKAYYHYVFREGSAVNRYRKERFFYEY 231 >UniRef50_Q5M3K9 Glycosyl transferase n=4 Tax=Streptococcus RepID=Q5M3K9_STRT2 Length = 697 Score = 240 bits (612), Expect = 7e-62, Method: Composition-based stats. Identities = 65/216 (30%), Positives = 101/216 (46%), Gaps = 7/216 (3%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 T K+SVIIP+YN + +TC++S+++QT+ LEII+IN GSTDNS I + YA + Sbjct: 2 TEKISVIIPVYNVENQLQTCLDSVLSQTYQNLEIILINYGSTDNSDAICRSYAARDSRIL 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 + N G S AR+ GI A G YV +VDA+D V PT E L D+A N Sbjct: 62 YFKKDNGGLSDARHIGIRQAKGTYVTYVDAEDWVEPTYLEELYKALQAHKADIAVANYS- 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTH----VVWMGVYRRDVIVKNN 180 ++E+ + + D V + + + +Y+R + N+ Sbjct: 121 VYKESEDLFYFYIKDEDYYEQVYSPAQIIDGLFEETNNINIALISATGKLYKRSLF--ND 178 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYY 216 + F +D + + + R Y + LY Y Sbjct: 179 LLFPKEHAGEDGFFNLKAYLMSERTVYLNKGLYVYR 214 >UniRef50_Q4JZ78 Putative glycosyl transferase n=1 Tax=Streptococcus pneumoniae RepID=Q4JZ78_STRPN Length = 321 Score = 240 bits (612), Expect = 7e-62, Method: Composition-based stats. Identities = 69/300 (23%), Positives = 134/300 (44%), Gaps = 10/300 (3%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+S+++P+YN D + C++SLI Q++ +EII+++DGSTD+S I YA+ +R+L Sbjct: 3 KVSIVVPVYNVEDYLKYCIDSLINQSYKDIEIILVDDGSTDDSGRICDKYAQEDDRIRVL 62 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 H N G S ARN G++ A+ +++ F+D+DD E L+ + + +D+ + Sbjct: 63 HIENGGLSNARNTGVKFASAEWIIFIDSDDYYDRRTVEYLVQLQKKYAVDLVATSVIEVR 122 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAG 186 E + T+ + L L+ +Y++++++K F G Sbjct: 123 DFQSEDFLGSLTNI--DSLKLDRYTALKEMFYGNIVGTHPGGKLYKKEILMK--FPFPEG 178 Query: 187 LHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRLL 246 + ++D+ + E + +LYKYY S+ + Y I Sbjct: 179 MLYEDLAVSFEHIGACNEIAVGCINLYKYYRRPGSIVNSSYSDK----FLDFYKAIEMNR 234 Query: 247 EKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLITN 306 E + R+Y + + + ++ L V HA+ ++ E + + +I N Sbjct: 235 EFVQRDYPNDKEMNKALTVRYVFKGLHVVHAMLSSEMYSDVNKIRKE--YTRYFNDIIPN 292 >UniRef50_Q047X3 Glycosyltransferase related enzyme n=1 Tax=Lactobacillus delbrueckii subsp. bulgaricus ATCC BAA-365 RepID=Q047X3_LACDB Length = 332 Score = 240 bits (612), Expect = 8e-62, Method: Composition-based stats. Identities = 67/262 (25%), Positives = 122/262 (46%), Gaps = 9/262 (3%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M +SVI+P+Y D C++S++ QT+ LEII+I+DGS DN ++ +A+ Sbjct: 1 MNTELPLVSVIVPVYKVEDYLNQCVQSILDQTYHNLEIILIDDGSPDNCGQLIDDWAKKD 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 + LHQ N G S ARN G++ G+++AFVD+DD V E ++ A+ D ++ Sbjct: 61 KRIVALHQKNGGLSAARNTGLDNCHGEWLAFVDSDDYVAKDYIEKMLKAAMRDHTNLVIS 120 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRR---WTHVVWMGVYRRDVIV 177 + +GE + V ++ ++ S V W +Y R+V Sbjct: 121 HYYEVDEISGEQINI----KHNPAKVYNIDEFWKLYYSPEGNFVSLVVAWDKLYAREVFE 176 Query: 178 KNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQR 237 + ++ GL H+D + + NA R +LY Y ++ + + + ++ Sbjct: 177 ER--RYAVGLVHEDEEIIFDVVKNAHRISVISDNLYYYRVNRSGSIMANINASSEIDQAI 234 Query: 238 HYIKITRLLEKLNRNYADKIMI 259 IK TR + ++ + D ++ Sbjct: 235 LEIKKTRAADLMHNEFWDGALL 256 >UniRef50_Q8A821 Glycoside transferase family 2 n=1 Tax=Bacteroides thetaiotaomicron RepID=Q8A821_BACTN Length = 345 Score = 240 bits (612), Expect = 8e-62, Method: Composition-based stats. Identities = 72/278 (25%), Positives = 119/278 (42%), Gaps = 18/278 (6%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M +SVI+P+YN R C++SLI+QT +EII +ND S DNS+ I K Y Sbjct: 1 MDRELPLVSVIVPVYNVEKYLRQCLDSLISQTLQNIEIICVNDASPDNSLAILKEYEAKD 60 Query: 61 PHVRLLHQ-ANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQ 119 + ++ N ARNRGIE A Y+AFVD+DD V +YE L AL +D D+ Sbjct: 61 KRIVVVDLYENMRQGGARNRGIEYARANYIAFVDSDDWVSVDIYEKLFACALMNDADIVN 120 Query: 120 CN-ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVK 178 C+ ++ + + + + +++ +YR+ + Sbjct: 121 CDYYEYRGEDNIKKYIKYKELTFNMLKEIANKEFILRL-------PSPCHNLYRKAIFSD 173 Query: 179 NNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRH 238 N I+F + +D + A + + LY Y L+N S SR N + Sbjct: 174 NEIRFPEQIFWEDAAIMSLIFLLADKIVKVNEPLYYYRLNNDSTSRR----KNNYRFFER 229 Query: 239 YIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCH 276 I + L Y ++ ++I Y+ + + + Sbjct: 230 LIGANLFFQNLKAF-----GFYKKYKEEIDYQYINLTY 262 >UniRef50_D1PNS4 Capsular polysaccharide biosythesis protein CpsI n=1 Tax=Subdoligranulum variabile DSM 15176 RepID=D1PNS4_9FIRM Length = 342 Score = 239 bits (611), Expect = 9e-62, Method: Composition-based stats. Identities = 61/241 (25%), Positives = 107/241 (44%), Gaps = 7/241 (2%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M + +S++ +YN + S++ QT+T E+I+++DGS+D+S + A Sbjct: 1 MGKSPTISIVSTVYNTQPFLEQAVRSILAQTFTDFELILVDDGSSDDSGRLCDALAAEDA 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 +R+ HQ N G + ARN+G++ A G+Y+ FVD+DD + PTMYE L + +A C+ Sbjct: 61 RIRVFHQPNGGPASARNKGLDNARGRYIGFVDSDDLIEPTMYEVLYNAIQAPGVQLAACS 120 Query: 122 ADWCFRETGETWQSIPTDR--LRSTGVLTGPDWLRMGLSSRRWTHVV-WMGVYRRDVIVK 178 D ET + IP R +R G+ + L + + + W ++ + Sbjct: 121 GDCI----DETGRPIPGRRVAIREHGIRPAEELLLETFQTGSYYGPLSWNKLFDLRLFRD 176 Query: 179 NNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRH 238 I + + D +F R LY Y ++ + L+ R Sbjct: 177 KGIHYDETMLFGDDASVLHRVFEGERCNCLSDILYHYRTRAGQITSTATFSPRKLDDLRM 236 Query: 239 Y 239 Y Sbjct: 237 Y 237 >UniRef50_Q8XN37 Capsular polysaccharide biosynthsis protein n=1 Tax=Clostridium perfringens RepID=Q8XN37_CLOPE Length = 326 Score = 239 bits (611), Expect = 9e-62, Method: Composition-based stats. Identities = 75/329 (22%), Positives = 155/329 (47%), Gaps = 12/329 (3%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+S+IIP+YN + C+++ + QT +EII I+DGSTD S +I YA+ Y ++++ Sbjct: 3 KVSIIIPIYNVENYLDKCLDTAVNQTLGCIEIIGIDDGSTDKSRDILNKYAKKYSFIKII 62 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 ++ N G S ARN GI A G+Y+ F+D+DD + + +A ++ LD+ +A+ Sbjct: 63 NKENGGLSSARNTGIINAKGEYIYFLDSDDYIEYDSMKICYEIATKNKLDIVSFDAESFV 122 Query: 127 RETGETWQSIPTDRLR--STGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 G T + DR + G ++ + ++ +T + YRR+ ++ NN+ F Sbjct: 123 DGEGFTKKLKKYDRKDILDGNIKMGCEYFCELIKNKCYTASACLNFYRREFLIDNNLFFY 182 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 G+ H+D ++T + A + RY LY L + S++ +++ + Y ++ Sbjct: 183 DGILHEDELYTVKAYTLANKVRYIPYKLYFRRLRSNSITTQ------SISEKHLYGRLIA 236 Query: 245 LLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKE----PDILTRQRMIAEIFTSGMY 300 E K + + + + + + + K+ I ++ +I + + + Sbjct: 237 ARESYRHYLGYKDVFDNDTNTCLLNHISNMYYFMLKDIYENKRIYNKRYIIEDKIINDIK 296 Query: 301 KRLITNVRSVKVGYQALLWSFRLWQWRDK 329 + + + Y+ L + + ++ K Sbjct: 297 SKKDIKSFKLDLQYRFPLIIYTIKKYLKK 325 >UniRef50_D0BME4 Glycosyl transferase n=10 Tax=Lactobacillales RepID=D0BME4_9LACT Length = 321 Score = 239 bits (611), Expect = 9e-62, Method: Composition-based stats. Identities = 71/306 (23%), Positives = 133/306 (43%), Gaps = 7/306 (2%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 +SVIIP+YN + +ESL QT+ E+I++NDGSTDNS + YA+ Y +V Sbjct: 2 LISVIIPVYNVENYLHYAIESLFRQTYKNFEVILVNDGSTDNSRSLCDEYAQRYKNVYAY 61 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 H+ N G S ARN G+E A ++ F+D DD + +E L+ + + ++ + Sbjct: 62 HKTNGGLSDARNFGVEKANSDWIFFLDPDDYIEAFTFELLVNLQKQYNVPLVSTKVQSTS 121 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAG 186 + I L T + L + L ++ T +Y++ ++ + + F G Sbjct: 122 VYNEYSDSEISNLDLTQTKKVNKETALALMLENKIATVSSCAKLYKKKILQE--VPFPVG 179 Query: 187 LHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRLL 246 ++D E + A + Y YY S+ K N+ + Sbjct: 180 KIYEDFFVVAEHLNLAEEIVISPIVTYHYYSRTGSIVN-SAFSEKQYNF---FDAAEHTE 235 Query: 247 EKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEP-DILTRQRMIAEIFTSGMYKRLIT 305 + +N Y ++ + + A + + K+ ++L +++I +F S ++ R +T Sbjct: 236 KVINEKYNNRRELINSVEAKTVIGAFHIINNAEKDKANLLKIRKLIKNVFKSVIFNRQVT 295 Query: 306 NVRSVK 311 +K Sbjct: 296 LKHKLK 301 >UniRef50_B0NLZ9 Putative uncharacterized protein n=1 Tax=Bacteroides stercoris ATCC 43183 RepID=B0NLZ9_BACSE Length = 305 Score = 239 bits (611), Expect = 1e-61, Method: Composition-based stats. Identities = 72/252 (28%), Positives = 118/252 (46%), Gaps = 5/252 (1%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 ++SVIIP+YN R ++S+ QT LEII+I+DGSTD S I + YA+ +R Sbjct: 1 MPEVSVIIPIYNTAAYLRKALDSICGQTLKELEIILIDDGSTDGSRGIIEEYAKRDARIR 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 Q N G SVARNRG+ ATGKY+ F+D+DD + + + +LD +A Sbjct: 61 WYAQPNQGLSVARNRGLLHATGKYIYFMDSDDILDTQALRRCHDLCEKRELDFIFFDAKP 120 Query: 125 CFRE-TGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 + + V +G + L LS+R + V++ + R + K F Sbjct: 121 LTESADSQNIPNYDRKGKIDESVHSGMEMLDYELSNRLFLTPVYLCMVNRLFLKKCFTCF 180 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 G+ H+D +T NA R Y ++ + + S+ R G +N+ Y + Sbjct: 181 HPGIIHEDHSFTMMIYLNAQRTCYVPEAFFGRRIRPHSIM-TSRFGMRNI---EGYTTVC 236 Query: 244 RLLEKLNRNYAD 255 ++ L + + + Sbjct: 237 TQMQALGQEHPE 248 >UniRef50_D0BKT7 Capsular polysaccharide biosynthesis protein CpsI n=1 Tax=Granulicatella elegans ATCC 700633 RepID=D0BKT7_9LACT Length = 328 Score = 239 bits (610), Expect = 1e-61, Method: Composition-based stats. Identities = 68/322 (21%), Positives = 141/322 (43%), Gaps = 14/322 (4%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M +S+I+P+YN + C++S+ QT+ +E+I++NDGSTD S EI + + Sbjct: 1 MEYDKLISIIVPIYNVEEYLEECLDSIQCQTYNNIEVILVNDGSTDASKEICERFCRQDS 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 L++Q N G S ARNRG+ + G+++ FVD+DD + M E L+ DD+D+ +C Sbjct: 61 RFHLINQENKGLSGARNRGMSESHGEFITFVDSDDVLKEDMLEQLIKQVTIDDIDIVEC- 119 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 W + E P + ++ + L ++RR+VI+ N Sbjct: 120 --WYTNDQKEILIPSP-ENVKIIFQGNTKEALVSLCRDNIVRLNAVAKLFRREVIL--NF 174 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIK 241 F+ GL ++D+ + R + Y Y + S+ N + + Sbjct: 175 PFLEGLFYEDVYGGMGILKQIRRMVKIDYIGYYYRVRQGSIM--------NREFNLKNLD 226 Query: 242 ITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYK 301 + + +K+ + Y + + P ++++ + L +++ + + Sbjct: 227 LFTICDKVEQLYEGDVELLPYIYRRLFHLVLMHVVDYHIFEGNPYQEKYVEYLNRYAKSS 286 Query: 302 RLITNVRSVKVGYQALLWSFRL 323 +R+ ++ + +++ R+ Sbjct: 287 SPSFLMRAYRLFPKNIVFISRV 308 >UniRef50_D2BMR3 Glycosyltransferase, family 2 n=1 Tax=Lactococcus lactis subsp. lactis KF147 RepID=D2BMR3_LACLK Length = 337 Score = 239 bits (610), Expect = 1e-61, Method: Composition-based stats. Identities = 80/324 (24%), Positives = 137/324 (42%), Gaps = 21/324 (6%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 K+SVI+P+YN R C++S+I Q++ E+++I+DGS DNS +I Y + VR Sbjct: 1 MPKISVIVPVYNVEKLLRRCLDSIINQSFEDFELLLIDDGSKDNSGQICDDYKKQDERVR 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 + H +N G S ARN GI+ G ++AF+D+DD V E L LED++DV C Sbjct: 61 VWHISNGGQSAARNLGIDNVQGDFIAFIDSDDFVELDYLEKLYQPMLEDEVDVVSCRYIA 120 Query: 125 CFRETGETWQSIPTD------RLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVK 178 + E + D RS ++ + R + + +Y+ ++ + Sbjct: 121 YSEDRDEEVKKKIRDNHPEFYHFRSNHIVMKDFLEKEFKEERNFPWEAYANLYKAKLLGE 180 Query: 179 NNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRH 238 I+F G +D + ++ A T Y YY + +S +R+ Sbjct: 181 --IRFEVGREFEDNFFNYQYFKRVQIAVATSYIGYYYYANPSSKTRIT-----------F 227 Query: 239 YIKITRLLEKLNRNYADKIMIYPEFHQ-QITYEALRVCHAVRK-EPDILTRQRMIAEIFT 296 Y K L+++ + YPEF + AL+ C K DI +++ Sbjct: 228 YEKQFDLIQQERLIIENVKKEYPEFLDLEANNLALKYCWLYGKIFGDIQLANFSVSKSHI 287 Query: 297 SGMYKRLITNVRSVKVGYQALLWS 320 + K + + + W Sbjct: 288 KEIQKLYSEDKKYFQNLNIGKKWQ 311 >UniRef50_C3JA21 Glycosyl transferase, family 2 n=3 Tax=Bacteria RepID=C3JA21_9PORP Length = 342 Score = 239 bits (610), Expect = 1e-61, Method: Composition-based stats. Identities = 67/262 (25%), Positives = 114/262 (43%), Gaps = 23/262 (8%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+IIP+YN C++S++ QT+T E I +NDGS D S I Y ++++ Sbjct: 4 ISLIIPVYNVERYLARCLDSVLNQTYTDWEAICVNDGSPDGSRAILGAYQAKDARIKIVD 63 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + N G S ARN+G+E A G+YV F+D+DD ++P E + +A + ++ D +R Sbjct: 64 KENGGLSDARNKGLEQARGEYVVFLDSDDFIHPQTLEFSLALAKANGSNIVTWYKDPYYR 123 Query: 128 -------------ETGETWQSIPTDRLRSTGVLTGPDWLRMG-------LSSRRWTHVVW 167 + +L + LT D + VW Sbjct: 124 LTTILRRKLGFDVDKKLPKSYYKRYQLENVNQLTTSDVFAHFTEQSHSNIKMPLKHFYVW 183 Query: 168 MGVYRRDVIVKNNIKFIAGLHHQDIVWTTEFMFNAL-RARYTEQSLYKYYLHNTSVSRLH 226 + RR+++ + +FI G+ +D WT+E M R T Y YY + S+ Sbjct: 184 RFLIRRELLT--DFQFIKGITFEDFPWTSELMLRNKGRVTITSLPFYYYYPNEGSIDLST 241 Query: 227 RQGNKNLNYQRHYIKITRLLEK 248 ++ K ++ +L E Sbjct: 242 KRARKINDWITGLEHAYKLYEA 263 >UniRef50_C6A9D3 Glycosyltransferase n=4 Tax=Bifidobacterium animalis subsp. lactis RepID=C6A9D3_BIFLB Length = 337 Score = 239 bits (610), Expect = 1e-61, Method: Composition-based stats. Identities = 70/257 (27%), Positives = 119/257 (46%), Gaps = 13/257 (5%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 ++SVIIP+YN + CM+SL+ Q++ E++ +NDGSTD SV I + Y + V+ Sbjct: 1 MPEISVIIPVYNQEEYVGACMDSLLDQSFEDFEVLAVNDGSTDASVSILEGYEQRDGRVQ 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 L+ Q N+G SVARN G+ A G++V F+D DD V P +TL+ A E + D+A + Sbjct: 61 LIDQNNSGVSVARNNGLSHAKGEWVCFIDPDDYVAPDYLDTLLK-ATERNTDIAM-STCV 118 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRR--------WTHVVWMGVYRRDVI 176 F + Q + + + R + V W +YR + Sbjct: 119 AFDDQRSARQHFFPESFTARAADEKVELFRQLMDGSYAQPKGFVTAIGVPWGKLYRHAFL 178 Query: 177 VKNNIKFIAGL-HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNY 235 ++ ++F A L QD ++ + A Y + + Y Y + +S + N Y Sbjct: 179 QEHGLQFDAALPRMQDNLFNMQAFQLAHEIVYVDYAGYYYRM--GGLSARTYKNNARGLY 236 Query: 236 QRHYIKITRLLEKLNRN 252 + RL+ + + + Sbjct: 237 HPAIDERARLMREYHLD 253 >UniRef50_C1DN10 Glycosyl transferase, family 2 protein n=1 Tax=Azotobacter vinelandii DJ RepID=C1DN10_AZOVD Length = 328 Score = 238 bits (609), Expect = 2e-61, Method: Composition-based stats. Identities = 69/311 (22%), Positives = 135/311 (43%), Gaps = 10/311 (3%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 K+SVIIP+YN C++S + Q + LEII ++DGSTD S+EI + + + ++ Sbjct: 3 PEKVSVIIPVYNVAPYLAQCLDSAVNQDYDNLEIIAVDDGSTDTSLEILEAFRARHGNLL 62 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 + N G S+ARN G+E+ATG+Y+ FVD+DD + T + + +D+ + Sbjct: 63 VRSTKNQGVSIARNTGLELATGEYILFVDSDDFLERQAISTCVEILGRQQVDMVFFASTV 122 Query: 125 CFRETGET-WQSIPTDRLRSTG--VLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 F E+ ++ +R+ + + + + + + ++ +YRR + Sbjct: 123 FFDGVDESLRENFRGERVPDLHGAYIPAHTFFIQSIRLKSYLMMPYLYMYRRKEFE--GL 180 Query: 182 KFIAGLHHQDIVWTTEFMFNAL--RARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHY 239 +F + H+D ++TT + R LY + S+ +Q Y Sbjct: 181 RFYPEIVHEDNLFTTRLLLEKEHARIACIPDRLYNRRVRPESIMTQKKQKRHINGYLVVA 240 Query: 240 IKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCH-AVRKEPDILTRQRMIAEIFTSG 298 ++ ++ KL ++ +F Q AL C + R R++ + Sbjct: 241 QELLKI--KLPPENSEAGAALNQFVQLNLLRALETCKLTYAGRLPLHVRGRLVMLFARTK 298 Query: 299 MYKRLITNVRS 309 + +R + + Sbjct: 299 IERRKVKRILF 309 >UniRef50_C7H3W1 Capsular polysaccharide biosynthesis protein CpsI n=1 Tax=Faecalibacterium prausnitzii A2-165 RepID=C7H3W1_9FIRM Length = 327 Score = 238 bits (609), Expect = 2e-61, Method: Composition-based stats. Identities = 73/237 (30%), Positives = 121/237 (51%), Gaps = 10/237 (4%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M+N T +S+I+P+YN C++S++ QT+ LEII+++D STDNS+ I + AE Sbjct: 1 MLNDT--VSIIVPIYNKEKYLEKCLDSILGQTYRDLEIILVDDESTDNSLAICQRCAEKD 58 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 P +++ H++N G S ARN G+E +TG ++F D DD ++ E L + E ++A C Sbjct: 59 PRIKIYHKSNGGVSSARNLGLEKSTGTLISFADPDDSLHAECIERLKRVLDETGAEIAYC 118 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 TG T S + DW +SR+ V W VYRR V + Sbjct: 119 YGLDLLGTTGRTQTKSDETEKVSVMPVRRYDW-----NSRKAHTVCWGAVYRRAV--TQD 171 Query: 181 IKFIAGL-HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQ 236 + F L +D ++ + + A + +++LY YY+++ SV+ H K + Sbjct: 172 VAFDTDLKIGEDTLFLAKCIRKAEKIACIDKALYYYYVNDDSVTSGHYSAGKLTELK 228 >UniRef50_B0NM34 Putative uncharacterized protein n=3 Tax=Bacteroides RepID=B0NM34_BACSE Length = 348 Score = 238 bits (608), Expect = 2e-61, Method: Composition-based stats. Identities = 68/297 (22%), Positives = 133/297 (44%), Gaps = 14/297 (4%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M + ++SVI+P+YN C+ S+++QT+T E+++++DGS D EI YA Sbjct: 1 MERNIPEISVIVPVYNVEQYLAECISSILSQTFTDFELLLVDDGSPDRCGEICDEYAGKD 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLD--VA 118 VR+ HQ NAG S ARN+G+E A+G Y+AFVD+DD V T + L D Sbjct: 61 KRVRVFHQENAGLSCARNKGLEHASGTYIAFVDSDDYVTSTYLQELYASLPADKSQRGTV 120 Query: 119 QCNADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVK 178 C D F + S+ T + +L + ++ W +Y +I + Sbjct: 121 ICGFDKLFPDG-----SLHTVHVPQQTILPMDCSRVLAELVGKYVMYAWAKLYDNRLIKE 175 Query: 179 NNIKFIAGLHH-QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSR---LHRQGNKNLN 234 + I+F+ + +D+++ +++ + + S Y Y + + + + ++ Sbjct: 176 HGIRFVPAVSGLEDMLFMLDYLPYSDYLLIRDTSTYIYRVGYSMATLSTCIKDFRSEYAA 235 Query: 235 YQRHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMI 291 + + ++ R E + + ++ L + +KE ++R++ Sbjct: 236 FSNYLERVCRYKETYRLEDSSLVGVWDSLTVFFHKILLAIY---KKENHYSRKERIV 289 >UniRef50_D2RJ85 Glycosyl transferase family 2 n=1 Tax=Acidaminococcus fermentans DSM 20731 RepID=D2RJ85_ACIFE Length = 327 Score = 238 bits (608), Expect = 2e-61, Method: Composition-based stats. Identities = 63/294 (21%), Positives = 115/294 (39%), Gaps = 10/294 (3%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 N +S+I+ +YN C++S++ QT+ LEII+++DGS DNS +I +A + Sbjct: 2 EKNLISIIVAVYNVEKYLNKCIKSIVNQTYHNLEIILVDDGSKDNSGKICDEWAGKDSRI 61 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 ++H+ N G S ARN G++ + G+++ FVD DD + MYETL + + C Sbjct: 62 HVIHKQNGGVSDARNYGLKASNGEWICFVDGDDYISLNMYETLYKNRSKQG--ITVCGYY 119 Query: 124 WCFRETGETWQSIPTDRLRSTG---VLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 I LT + + + W +Y R V Sbjct: 120 VVENGKEIPVPGINKKLAPKEAANLYLTNERQSVLNSNFTYFGSYPWNKLYDRSVFQ--G 177 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI 240 +++ +D+ E + + R+ Y Y S++ Q + ++ + Sbjct: 178 VEYPEHKKFEDMYIMLELLHQSYEIRFIPDCKYYYIQRRGSITH---QKSIQVDSLDARL 234 Query: 241 KITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEI 294 K + LE+ KI + + ++RM A + Sbjct: 235 KQKKELEEFWDIKDKKIDELIALSYIAILRKYAINPYSERIKYTSLKKRMWASL 288 >UniRef50_Q93DZ8 EpsG n=2 Tax=Lactobacillales RepID=Q93DZ8_STRTR Length = 322 Score = 238 bits (608), Expect = 2e-61, Method: Composition-based stats. Identities = 64/241 (26%), Positives = 118/241 (48%), Gaps = 9/241 (3%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S++IP+YN D + C++S+++QT++ LEII+++DGSTD S I YY+EN ++++H Sbjct: 3 ISIVIPVYNVQDYIKKCLDSILSQTFSDLEIILVDDGSTDLSGRICDYYSENDKRIKVIH 62 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 AN G S ARN GI+ AT +++ F+D+DD V E L + + D++ + + Sbjct: 63 TANGGQSEARNVGIKNATSEWITFIDSDDYVSSDYIEYLYNLIQVHNADISIASFTYI-- 120 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 T + I ++ +R L + + VW +YR + N KF++G Sbjct: 121 ----TPKKIIKHGNGEVALMDAKTAIRRMLLNEGFDMGVWGKMYRTEYF--NKYKFVSGK 174 Query: 188 HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRLLE 247 +D + T + A + + +Y Y S + K + + + ++ Sbjct: 175 LFEDSLITYQIFSEASTIVFGAKDIYFYVNRKNS-TVNGTFNIKKFDLIEMNEEANKFIK 233 Query: 248 K 248 Sbjct: 234 H 234 >UniRef50_C2EWJ8 Putative uncharacterized protein n=1 Tax=Lactobacillus vaginalis ATCC 49540 RepID=C2EWJ8_9LACO Length = 499 Score = 238 bits (608), Expect = 2e-61, Method: Composition-based stats. Identities = 67/248 (27%), Positives = 114/248 (45%), Gaps = 11/248 (4%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 ++ VI+ +YN C++S++ QT+ LE+I+++DGSTD S+ I Y + V Sbjct: 11 EEEQILVIVSVYNTEKYLDQCIQSILKQTYHNLELILVDDGSTDRSLAICTRYQQQDSRV 70 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 R+ HQAN G S ARN G++VA G + FVD+DD V P E L+T +A + Sbjct: 71 RVFHQANQGVSKARNYGMDVAAGDLITFVDSDDYVMPNYLEELITALESTGSQIAISDFQ 130 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRR-----WTHVVWMGVYRRDVIVK 178 ++ G+ +P D S T +W +++ V+W +++++ Sbjct: 131 -SYKGNGQLLVHMP-DHEVSQRTFTPREWFDFAYTNKNDFFQLIYTVLWGKLFQQEAF-- 186 Query: 179 NNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRH 238 N I D T + A + Y + Y Y L+ +S HR+ +++ R Sbjct: 187 NQIYCPEDTQIDDEFTTWKTYLQASKIVYLDDEDYIYRLNQDGLS--HRKNQEDIRPLRS 244 Query: 239 YIKITRLL 246 + L Sbjct: 245 LEEEITFL 252 >UniRef50_UPI0001972E41 putative sugar transferase n=1 Tax=Clostridium sp. M62/1 RepID=UPI0001972E41 Length = 323 Score = 238 bits (608), Expect = 2e-61, Method: Composition-based stats. Identities = 81/325 (24%), Positives = 143/325 (44%), Gaps = 8/325 (2%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 +S++IP++N C+ S++ Q++ EII+++DGS D S E+ YYA + ++ Sbjct: 2 LISIVIPVFNVEKYLEECINSVLKQSFQDYEIILVDDGSQDKSGEMCDYYAGICEKITVI 61 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALE--DDLDVAQCNADW 124 H+ N G S ARN G++ A G+Y+ F+D+DD + E + A+ + +DV A Sbjct: 62 HKKNGGLSDARNVGVKAAKGEYILFLDSDDYIASESLEEVAQKAVNSRNKIDVFFLEAVK 121 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 + + E ++ G + ++ ++ + R++I ++N+ F Sbjct: 122 FYPDGREEPLGDGYEKRLIEG-QSKNTVMQHLAYLPKFPGSACSKMVSRELIERHNLYFQ 180 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYY-LHNTSVSRLH-RQGNKNLNYQRHYIKI 242 GL +DI WT + A YTE Y Y S++ + ++L Y Sbjct: 181 EGLLSEDIEWTVRLLKAAQTFGYTETPFYYYRQQREGSITNTFGIKNIESLLYIVEKFSS 240 Query: 243 TRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKR 302 LLE +N + M Y EF + + AL R + L R R I S K Sbjct: 241 HSLLENSFQNSINSFMAY-EFVIILYHYALLDKKERRMVKESLRRNRWILRYGLSRKTK- 298 Query: 303 LITNVRSVKVGYQALLWSFRLWQWR 327 I N+ G + ++++ R Sbjct: 299 -IINLAVSVFGMSVTSYLLKIYKNR 322 >UniRef50_A6L5V6 Glycosyltransferase family 2 n=2 Tax=Bacteroides RepID=A6L5V6_BACV8 Length = 344 Score = 238 bits (608), Expect = 2e-61, Method: Composition-based stats. Identities = 77/321 (23%), Positives = 139/321 (43%), Gaps = 30/321 (9%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPH- 62 K+S+I+P+YN CM+SL+ QT LEII+++DGS DNS + Y + Sbjct: 2 KQPKVSIIVPVYNVEKYLDRCMQSLLGQTLKELEIILVDDGSPDNSPALCDNYESASDNL 61 Query: 63 ----VRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVA 118 ++++H+ N G ARN G+E+ATG+Y+AF+D+DD V MYE L A D+DV Sbjct: 62 HWPIIKVIHKINEGLGFARNSGLEIATGEYIAFLDSDDFVDVNMYEQLYLKAKAKDIDVV 121 Query: 119 QCNADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSR-----------RWTHVVW 167 CN + + P + + G + + L R+ V Sbjct: 122 YCNCNIY----KDKKHIYPRKDVEEEVIFRGREAVDDFLLDMIAPLPEYHHDVRYMMSVG 177 Query: 168 MGVYRRDVIVKNNIKFIAGLHH--QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRL 225 +YR + +++ ++F++ +DI++ +F+ + Y L+ Y ++ S+S Sbjct: 178 HAIYRHSIFMEHRVQFVSEREFVSEDIIFDIDFLSHVRSVCYLPDCLHYYVVNPYSLSHT 237 Query: 226 HRQGNKNLNYQRHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDIL 285 + Y K+ L + A + + + + V + Sbjct: 238 YDDAK--------YHKMVNLSRIIKERLAKIYSEEKYYLHYLRSVFFLLRNTVMRMVLAG 289 Query: 286 TRQRMIAEIFTSGMYKRLITN 306 + I I +Y+ L++N Sbjct: 290 ETDQKIKRILEEKLYQELLSN 310 >UniRef50_C9K1G9 Putative glycosyltransferase n=1 Tax=Klebsiella pneumoniae RepID=C9K1G9_KLEPN Length = 342 Score = 238 bits (608), Expect = 2e-61, Method: Composition-based stats. Identities = 81/337 (24%), Positives = 157/337 (46%), Gaps = 23/337 (6%) Query: 4 STNKLSVIIPLYNA-GDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPH 62 + +SVI+P+YN + TC++S+ QT +EII++NDGST+N +E+ ++ Sbjct: 2 NMPIISVILPIYNIKEEYLETCIQSVCRQTLDDIEIILVNDGSTNNCLEVCTKFSHLDNR 61 Query: 63 VRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNA 122 + ++ Q N+G SVARN G+++A G+++AFVD DD + P +L A + ++ C+ Sbjct: 62 IIVIDQPNSGVSVARNSGLDIARGEWIAFVDPDDWLEPDYLLSLYNNA-SSNTEIVICDC 120 Query: 123 DWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRW---------THVVWMGVYRR 173 + +P + + GVL + L L S+ V W +++ Sbjct: 121 NVFSNGEKRGNNFLPANSI---GVLKNKNLLLGQLVSKVIAGYYPQLIGPGVPWGKLFKH 177 Query: 174 DVIVKNNIKFIAGL-HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKN 232 D I K N+KF+ G+ QD ++ + A + YT ++ Y Y + N S S + + + Sbjct: 178 DFIKKYNLKFVPGMVRMQDNIFCLYAFYYASQVFYTGENKYNYRMENNSASFKYNKKIIS 237 Query: 233 LNYQRHYIKITRLLEKL---NRNYADKI--MIYPEFHQQITYEALRVCHAVRKEPDILTR 287 ++++++ ++ ++K N Y + + + Y H + E D L Sbjct: 238 -HFEKYFEEVKAFIDKCMSGNERYQKALDAKTLTSINSYLLYYFCHDNHTIESENDTLGN 296 Query: 288 QRMIAEIFTSGMYKRLITNVRSVKVGYQALLWSFRLW 324 ++ + KR+ + + +L RL+ Sbjct: 297 --LMCRQPYASALKRVDFTYLTTQEKIFVILLKLRLY 331 >UniRef50_UPI0001C377F0 glycosyl transferase, group 2 family protein n=1 Tax=Ruminococcus flavefaciens FD-1 RepID=UPI0001C377F0 Length = 321 Score = 238 bits (607), Expect = 3e-61, Method: Composition-based stats. Identities = 84/328 (25%), Positives = 148/328 (45%), Gaps = 34/328 (10%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 L+VI+P+YN C+ SLI+QT++ + II++NDGSTDNS +IA +AENY ++ ++H Sbjct: 4 LTVIVPIYNVEKYLEKCVLSLISQTYSQMCIILVNDGSTDNSGKIADKFAENYDNITVIH 63 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + N G S ARN GIE +Y+AFVD+DD V MYE L++ ++ +D++ + Sbjct: 64 KTNGGLSDARNVGIESTNTEYIAFVDSDDYVESNMYEVLLSKFSDEQIDISIGTVLYE-N 122 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIV-----KNNIK 182 E GE + T + + + ++ L ++ V++ + ++ + Sbjct: 123 EQGEGYCPYKTG---ISAIWNNEEAMKYLLCYSKFNMSFCDKVFKTKLFRTPAYGEDKLL 179 Query: 183 FIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKI 242 F G +D + A R Y LY Y S+SR N N+N Sbjct: 180 FPYGKKCEDQFLMCRVLARASRVAYNSIPLYHYVQRFGSISR-----NTNINLAP----- 229 Query: 243 TRLLEKLNRNYADKIMIYPEF----HQQITYEALRVCHAVRKE-----PDILTRQRMIAE 293 LE + +P Y + +C A K +I + + A Sbjct: 230 ---LEAVESQLEFYKKWFPSISYTAETAYLYTYMSICSAYYKHSLDCPKEISNKAKSAAS 286 Query: 294 IFTSGMYKRLITNVRSVKVGYQALLWSF 321 ++ +++ +K +QA+++ + Sbjct: 287 VYLKSVFQ---FKELPLKKKFQAIVFRY 311 >UniRef50_B4BNC6 Glycosyl transferase family 2 n=1 Tax=Geobacillus sp. G11MC16 RepID=B4BNC6_9BACI Length = 516 Score = 238 bits (607), Expect = 3e-61, Method: Composition-based stats. Identities = 62/234 (26%), Positives = 114/234 (48%), Gaps = 8/234 (3%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 ++K+S+IIP+YN C+ S+ Q++ LE+I+INDGSTD S +I + + +N P R Sbjct: 2 SSKVSIIIPVYNIASYLEECLYSVQNQSYRDLEVIMINDGSTDGSEQILERFQKNDPRFR 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 L +Q N G RNRGI ++ G+YV F+D DD + + L++ D D A Sbjct: 62 LFNQENRGLGYTRNRGISLSNGQYVFFLDGDDRLPKDAIQHLLSAIETHDADYAVGKVLR 121 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 E L + +T + L + +++++ +++N++ FI Sbjct: 122 FHSERKYVPIRHLEFNLYTQKQVTTIEQNPELLQD----SIACNKLWKKEFLIQNDLSFI 177 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYL----HNTSVSRLHRQGNKNLN 234 G +++D+ +T A T + +Y + + HN S+++ + L+ Sbjct: 178 EGKYYEDLFFTMRAAVLAKTVAVTNKVVYYWRVRTEEHNRSITQEQMKLENTLH 231 >UniRef50_C7H1H9 Polysaccharide biosynthesis protein, Glycosyl transferase Family 2,YveR n=2 Tax=Faecalibacterium prausnitzii RepID=C7H1H9_9FIRM Length = 318 Score = 238 bits (607), Expect = 3e-61, Method: Composition-based stats. Identities = 62/284 (21%), Positives = 127/284 (44%), Gaps = 25/284 (8%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 +S+I+P+YNA + C+ S++ QT+ +E+II+NDGS D+S+ + + + + Sbjct: 7 DQPLVSIILPVYNAQNHLARCIGSILNQTYKNIELIILNDGSKDHSLPVCEEFRAKDSRI 66 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 L+ + N+G S RN G+++A G+YV FVD+DD + P + L+ A D+ Sbjct: 67 VLVDKENSGVSDTRNLGLKLAGGEYVQFVDSDDYIDPDFTQHLVEAAETHHADLVIAPYK 126 Query: 124 WCFRETGETWQSIPTDRLRSTGVLT------------------GPDWLRMGLSSR---RW 162 + + + GV++ D + L + + Sbjct: 127 MVIPAGATKPEQVLEKLEDNLGVMSVARPPEVREYGFLPAGVYDKDTFALRLMDKPASYF 186 Query: 163 THVVWMGVYRRDVIVKNNIKFIAGLHH-QDIVWTTEFMFNALRARYTEQSLYKYYLHNTS 221 V+W +YRR ++ N+I+F + L +D+V+ +++ A +++ Y Y + S Sbjct: 187 YSVLWNKLYRRILLTGNDIQFTSELKWAEDLVFNMQYIQYAEAFVSIDKAGYYYVQNPQS 246 Query: 222 VSRLHRQGNKNLNYQRHYIKITRLLEKLNRNYADKIMIYPEFHQ 265 + + ++ I+ R + L + P+ ++ Sbjct: 247 ICHTQINPA---SIVQNKIQTFRYYKDLYTRLGMYEEVRPQLYK 287 >UniRef50_C9BUH6 Glycosyltransferase n=1 Tax=Enterococcus faecium 1,231,408 RepID=C9BUH6_ENTFC Length = 353 Score = 238 bits (607), Expect = 3e-61, Method: Composition-based stats. Identities = 68/245 (27%), Positives = 120/245 (48%), Gaps = 10/245 (4%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 MNS++ +SVIIP+YN +C+ES+ QT+ L+III++DGS D+S +I + YA Sbjct: 1 MNSSDLISVIIPVYNVEKYVGSCLESVCNQTYKNLDIIIVDDGSKDSSNKICEDYALKDS 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 +R++H+ NAG +ARN G+E GKYVAF+D+DD V E ++ + D Sbjct: 61 RIRIIHKNNAGLGLARNTGLEHVKGKYVAFIDSDDFVAANYIEAMLNGIKRSNADTCYAG 120 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRR-------WTHVVWMGVYRRD 174 + + + D T L + + VW +Y + Sbjct: 121 YYEYYDDNNIISKPARYDSTIFTNNAIIEKVLLGMIGTEPSYKVDFILPMSVWHALYSSE 180 Query: 175 VIVKNNIKFIAGLHH--QDIVWTTEFMFNALRARYTEQSLYKYYL-HNTSVSRLHRQGNK 231 +I ++NIKF + + +DI++ ++ ++ + + +Y Y +N S++ R Sbjct: 181 IIQQHNIKFPSERDYISEDIIFHISYLQHSKITSFIKDCIYYYRCNNNNSLTHNFRDNEF 240 Query: 232 NLNYQ 236 + N + Sbjct: 241 DCNIE 245 >UniRef50_A6BHF1 Putative uncharacterized protein n=1 Tax=Dorea longicatena DSM 13814 RepID=A6BHF1_9FIRM Length = 352 Score = 238 bits (607), Expect = 3e-61, Method: Composition-based stats. Identities = 75/334 (22%), Positives = 149/334 (44%), Gaps = 17/334 (5%) Query: 5 TNKLSVIIPLYNAGD-DFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 K+SVI+P+YNA C++S+I Q + +E+I+++DGST++S E+ YA+ + Sbjct: 1 MEKISVIVPIYNAEKKYLTECVDSIIGQDYKNIEVILVDDGSTNSSGEVCDGYADKDERI 60 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 +++HQ N G SVARN GI+ ATG+++AFVD DD + ++ +M E D+ N Sbjct: 61 QVIHQKNQGVSVARNTGIDQATGEWIAFVDVDDWLEKNIFSEVMGQIKEKKTDLVVWNMY 120 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWT--------HVVWMGVYRRDV 175 + + Q+ + + LR+ + + + + +Y++ + Sbjct: 121 MNYSNVEKVAQNYKETFYTDSKIEIEEIRLRLLRTISIYNDEKNITTINYPFCHLYQKKI 180 Query: 176 IVKNNIKFIAG-LHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRL-HRQGNKNL 233 I + N++F + +D ++ ++ Y + LY Y H+ SVS +++ +N Sbjct: 181 IQQWNVRFDSEFKQGEDKLFNYQYFTKINSILYINKPLYHYRQHSLSVSHKFYKEHEENT 240 Query: 234 NYQRHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAE 293 L + N+ Y D I + + + +R + + + + + Sbjct: 241 TRILKKYYEIEPLIQSNKKYKDTYNIRTAY---LAWWLIRKYY-LHENSTVKKPLKEFEN 296 Query: 294 IFTSGMYKRLITNVRSVKVGYQALLWSFRLWQWR 327 + S Y I + + + L RL + Sbjct: 297 MLKSSPYSESIKKLNVADMKFS--LTKIRLILLK 328 >UniRef50_C0FV12 Putative uncharacterized protein n=1 Tax=Roseburia inulinivorans DSM 16841 RepID=C0FV12_9FIRM Length = 321 Score = 237 bits (606), Expect = 3e-61, Method: Composition-based stats. Identities = 85/339 (25%), Positives = 145/339 (42%), Gaps = 32/339 (9%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 + +++++P+YN +ES+ QT+T L+I++I+DGSTD+S+ + +A ++ Sbjct: 2 NDLITIVVPVYNVEKYLPHTLESICGQTYTNLQILLIDDGSTDDSLAVCHKWARQDNRIQ 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 + Q N G S RN+GI++A GKYV F+DADD V M E+L +A D DVA C Sbjct: 62 VYEQENQGVSRTRNKGIQLAIGKYVMFLDADDWVEADMLESLYRLAEVDHADVACC---- 117 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 + + ++ G + GL+ V +YR+D+I NI+F Sbjct: 118 -LLREENQLEEKDNCTITERTIIHGKNKTESGLALLTVWGPVC-KLYRKDLIE--NIQFE 173 Query: 185 AGLHHQDIVWTTEFMFNAL--RARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKI 242 +D+++ T + + RA + Y Y ++ S + +Q Y++ Sbjct: 174 EYKVAEDLLFNTNVVCSEKFERASLIQYPFYHYMIYAGSAMKQ--------EFQDKYLEA 225 Query: 243 TRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDIL--TRQRMIAEIFTSGMY 300 R+ E Y I PEF I V K + +++ AE + Sbjct: 226 MRVEELC---YEKLTKISPEF-ADINLIGCSVSRVFEKYAALSPEKKKQYKAEFKYCKKF 281 Query: 301 KRLITNV--------RSVKVGYQALLWSFRLWQWRDKTR 331 R N R + + + F LW + R Sbjct: 282 AREHKNALLQTKDRHRKISGWLKVYIPDFYLWTLSRRMR 320 >UniRef50_A5UMT7 Glycosyltransferase, GT2 family n=2 Tax=Methanobrevibacter smithii RepID=A5UMT7_METS3 Length = 377 Score = 237 bits (606), Expect = 3e-61, Method: Composition-based stats. Identities = 61/289 (21%), Positives = 124/289 (42%), Gaps = 1/289 (0%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 ++SV++P+YN R C++SL QT+ E+I +NDGS D+S +I + YA + Sbjct: 1 MPQISVVVPVYNVEKYLRECLDSLANQTFEDFEVICVNDGSDDSSPDILEEYASEDERFK 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++ Q N G S ARN G+ G+Y+ F+D+DD + E L A + ++ + Sbjct: 61 IVSQENKGLSGARNTGMNYIKGRYLLFLDSDDWLELNALELLYNHANALNSEMVIFPYRY 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWM-GVYRRDVIVKNNIKF 183 +ET + ++ T + ++ + + +Y + K +KF Sbjct: 121 FNQETKQYEENDFTKLNMFDSSVDNKNFNYKNIPETVFRIPHESIKLYDVKTLKKLAVKF 180 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 GL+++D + + F + +Y Y + N S+ + + ++ IK Sbjct: 181 PEGLNYEDAYFFYKIFFKLNKVSIIRTPIYNYRIRNDSICTTGTEKSFDIFKILTSIKNF 240 Query: 244 RLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIA 292 +++ ++ D+ +++ + + Y L R I + Sbjct: 241 LKEDEIYESFKDEFILFTVINLKFVYLRLDERFRDRYLEKIKKNYEFFS 289 >UniRef50_D2Q662 Glycosyl transferase, family 2 n=3 Tax=Bifidobacterium RepID=D2Q662_9BIFI Length = 342 Score = 237 bits (606), Expect = 4e-61, Method: Composition-based stats. Identities = 84/339 (24%), Positives = 141/339 (41%), Gaps = 16/339 (4%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 ++ +S+I+P+ NA + C++SL +Q + L+II+++DGSTD S E+ AE Sbjct: 7 ISPLPSVSIIVPVRNAKNYLDACVDSLRSQIYPKLDIILVDDGSTDGSSELCDAAAERDQ 66 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 +R++HQ G S ARN G++ ATG+YV FVD+DD++ + A E D+ C Sbjct: 67 RIRVIHQTCKGVSEARNAGLDTATGEYVLFVDSDDQLRVDAVMGALRCAHEHSADMVVCG 126 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRM-------GLSSRRWTHVVWMGVYRRD 174 D S + +R V+ + + W +YRR Sbjct: 127 YDVVEELADGGTVSQGSRTVRQEEVIDTRQRHEKDVANRLCAFDNDELLYQCWGKLYRRA 186 Query: 175 VIVKNNIKF-IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNL 233 I +++F +D V+ E + +R ++ Q Y Y H S+ R R G K Sbjct: 187 TIG--DLRFARDAAFGEDTVFVLELLARGIRVVFSPQCSYLYLQHADSLVRGFRSG-KTH 243 Query: 234 NYQRHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRV--CHAVRKEPDILTRQRMI 291 + + + K + + A++ +Y + E ++L R Sbjct: 244 DLEYSHGKHLAFYRDMPISDANRRGLYLRMANDTLWAVYDAGKAAGSVDETELLRFVRCA 303 Query: 292 AEIFTSGMYKRLITNV---RSVKVGYQALLWSFRLWQWR 327 A G+Y R I V R KV + + WR Sbjct: 304 ASSPYRGVYLREIRRVAVDRRTKVAFVVNMPFLWKLYWR 342 >UniRef50_D0DSJ8 Glycosyl transferase CpsJ n=1 Tax=Lactobacillus fermentum 28-3-CHN RepID=D0DSJ8_LACFE Length = 349 Score = 237 bits (606), Expect = 4e-61, Method: Composition-based stats. Identities = 68/324 (20%), Positives = 130/324 (40%), Gaps = 28/324 (8%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+S+I+P+YNA + C+ESL+ QT+ +EII+++DGSTD S ++ YA+ + ++ Sbjct: 2 KISIIVPVYNAEEYLSKCLESLVNQTYRNIEIILVDDGSTDASPQMCDEYAQKDDRIIVI 61 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 H+ N G S+ RN G+ TG + FVD+DD + E L+ +D+D+ + + Sbjct: 62 HKENEGLSLTRNVGMAAITGDFFTFVDSDDWLSYNYVENLVVALEANDIDIVFTSYLKEY 121 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLRMGL----------SSRRWTHVVWMGVYRRDVI 176 + P D + + LR + + W YR Sbjct: 122 KSKSVGVTFFPEDEIIFKSSEGSNEVLRRLVGEIGDELQHPERIENLNTAWGKCYRTSKF 181 Query: 177 VKNNIKFIAG--LHHQDIVWTTEFMFNALRARYTEQSLYKYY-LHNTSVSRLHR-----Q 228 +KF G + +D+ + + + R+ Y Y Y + S+ + + Sbjct: 182 K--QLKFDDGNEIRSEDLWFNIQAFYLTERSMYINNQGYHYNKQNENSIVSTYTPHLADK 239 Query: 229 GNKNLNYQRHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQ 288 NK N +I + +L E ++ + + + + ++ R Sbjct: 240 FNKLNNMIEDFIGLHKLGEDY--------ILALNSRRVLMLLTIAINEFSNQKISFWQRT 291 Query: 289 RMIAEIFTSGMYKRLITNVRSVKV 312 + + I Y + K+ Sbjct: 292 KKMKTILNGQEYTKAFRKFEYGKL 315 >UniRef50_C2EYX7 Glycosyltransferase n=2 Tax=Lactobacillus reuteri RepID=C2EYX7_LACRE Length = 326 Score = 237 bits (606), Expect = 4e-61, Method: Composition-based stats. Identities = 67/309 (21%), Positives = 123/309 (39%), Gaps = 7/309 (2%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M + +S+I+P++N C+ S+ QT+ E+++I+DGS DNS EI + + Sbjct: 1 MKEKSYNVSIIVPIFNVEKWLTRCLNSICKQTYGEFEVLLIDDGSIDNSGEICQTFVSKD 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 + + N G S RN GI A GKY FVD+DD + P E L + ++++D C Sbjct: 61 NRFKYYKKKNGGLSDTRNYGINRAKGKYFIFVDSDDYLTPDYVEMLYSGIQKNNVDAVIC 120 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTH---VVWMGVYRRDVIV 177 + + + L+G + + + ++ W +Y R + Sbjct: 121 GISRVDEDG--KILNNNCFNVTDKTFLSGKEIIELSFDNKSDGGDLVCAWNKIYDRKLF- 177 Query: 178 KNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQR 237 NN++F G +++D +A + LY Y N S+ K + Sbjct: 178 DNNLRFEKGRYYEDGYIFPNLFLRVNKAMIIHRPLYNYVQRNNSIIHSSMNIKKIKDDDA 237 Query: 238 HYIKITRLLEKLNRN-YADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFT 296 + L + +++ Y I Y + YE ++ + + R+ A+ Sbjct: 238 SMLSWIDLFKNVDKKLYILSIQKYKNWIIYKWYENRKIILNNNMGNYLQKQYRLYAKEKY 297 Query: 297 SGMYKRLIT 305 K I Sbjct: 298 PMSIKSKIK 306 >UniRef50_Q83VE8 EpsN n=1 Tax=Lactococcus lactis subsp. cremoris RepID=Q83VE8_LACLC Length = 351 Score = 237 bits (605), Expect = 4e-61, Method: Composition-based stats. Identities = 67/255 (26%), Positives = 120/255 (47%), Gaps = 19/255 (7%) Query: 6 NKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRL 65 + +SV++P+YN C ES+ QT+T LEII+++DGS D S ++ V + Sbjct: 3 DLISVVVPVYNVEKYLLKCFESISKQTYTNLEIILVDDGSKDTSGKMCDDLKLLDDRVIV 62 Query: 66 LHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWC 125 +H+ NAG ARN G+E+A+GKYV FVD+DD + M L ++ +D + C Sbjct: 63 VHKKNAGLGYARNTGLEMASGKYVLFVDSDDYIEENMVSRLYQRIVKYGVDTSFCGYSIF 122 Query: 126 FRETGETWQSIPTDRLRSTGVLTGPDWLRMGLS-----------SRRWTHVVWMGVYRRD 174 F + ++I + + D + L V + +Y D Sbjct: 123 FEDE----KTIKKCSYYADSIFEEEDVVEKVLLEMIAGLPNAKVDFIIPMSVCLALYSMD 178 Query: 175 VIVKNNIKFIAGLHH--QDIVWTTEFMFNALRARYTEQSLYKYYLH-NTSVSRLHRQGNK 231 +I +I+F + + +DIV+ ++ ++ + Y +LY Y + + S++ + R Sbjct: 179 IIRDYSIRFPSEREYISEDIVFDIAYLQHSNKVSYVSDALYNYRQNSSGSLTHVFRSDEF 238 Query: 232 NLNYQRHYIKITRLL 246 + + Y+KI L Sbjct: 239 D-RQKLMYLKINAEL 252 >UniRef50_A6BIH4 Putative uncharacterized protein n=1 Tax=Dorea longicatena DSM 13814 RepID=A6BIH4_9FIRM Length = 331 Score = 237 bits (605), Expect = 4e-61, Method: Composition-based stats. Identities = 72/330 (21%), Positives = 145/330 (43%), Gaps = 15/330 (4%) Query: 1 MMNSTNK---LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYA 57 M N+ + LS+I+P+YN +TC+ SL+ Q A EII+++DGSTD+S I YA Sbjct: 3 MNNNKRQDTVLSIIVPVYNVEKYLKTCINSLLEQKLDAYEIILVDDGSTDSSGGICDEYA 62 Query: 58 ENYPHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDV 117 + + ++++H+ N G S ARN GIE A GKY+ FVD+DD + P MY+ L+ +A + + Sbjct: 63 KKHEKIQVIHKKNGGLSSARNTGIENAVGKYIGFVDSDDYIMPEMYKNLIEVAKMCNAQM 122 Query: 118 AQCNADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIV 177 ++ SI +++ + + + +R V +Y ++ Sbjct: 123 VMSRYFCFENQSDLEKISINLKEVKNKIRVYNTEDVLKEFFTRNIPESVCSNLYSSELWS 182 Query: 178 KNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQR 237 K ++F+ G ++D + + + + + Y Y S++ N Y Sbjct: 183 K--LRFVEGEINEDTNVIYKLLVGSKKTVVMDAQFYGYRKRYGSIT--------NSGYSE 232 Query: 238 HYIKITRLLEKLNRNYADK-IMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFT 296 + +T + L + K + P + + + +++ + + E++ Sbjct: 233 KFKVVTEHMSYLENDVKRKHPDVMPYLYHFVGTHYFCLLNSILDSENFELYKSEY-ELYR 291 Query: 297 SGMYKRLITNVRSVKVGYQALLWSFRLWQW 326 + + +R K ++ + + L Sbjct: 292 KEFKRLVYYFIRWEKFQWKEYVIALLLILP 321 >UniRef50_Q4JZB4 Putative glycosyl transferase n=3 Tax=Streptococcus pneumoniae RepID=Q4JZB4_STRPN Length = 331 Score = 237 bits (605), Expect = 5e-61, Method: Composition-based stats. Identities = 62/296 (20%), Positives = 124/296 (41%), Gaps = 18/296 (6%) Query: 3 NSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPH 62 ++ +SV++P+YN + C+ S++ QT+ +EIII+NDGS D+S++I YA Sbjct: 8 DNLPLISVVLPVYNVENYLEKCLNSVVRQTYKNIEIIIVNDGSLDSSIDICNKYANKDSR 67 Query: 63 VRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNA 122 +++ + N G S ARN G++ + G+++ FVD+DD V E L ++ + + ++ C+ Sbjct: 68 IKVFTKPNGGLSDARNFGVKKSMGEFITFVDSDDYVTDDYVEYLYSLVKKYNCKMSMCSI 127 Query: 123 DWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIK 182 + + + L ++T + + +Y R + + I Sbjct: 128 FVHYISNDKMINNGTGREL----MMTAEQCIEKMCYHDEVDTCAYAKLYHRSLFSE--IC 181 Query: 183 FIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKI 242 F G +DI F A + + Y Y + S+ ++ + Sbjct: 182 FPKGRIFEDIGTIYRFFGTAGQIACGFNAKYYYEIRENSIVTS--------SFSEKKFDL 233 Query: 243 TRLLEKLNRNYADKIMIYPEFHQQITYEALRV-CHAVRKEPDILTRQRMIAEIFTS 297 + +++ RN+ D+ YP A+ + + D+ EI Sbjct: 234 LFMTDQM-RNFVDQR--YPSLKAATKRRAVYARFSTLNQLLDVQGYNEEKKEIIRF 286 >UniRef50_Q6ASC1 Related to glycosyltransferase involved capsular polysaccharide biosynthesis n=1 Tax=Desulfotalea psychrophila RepID=Q6ASC1_DESPS Length = 321 Score = 237 bits (605), Expect = 5e-61, Method: Composition-based stats. Identities = 75/318 (23%), Positives = 148/318 (46%), Gaps = 13/318 (4%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQT--WTALEIIIINDGSTDNSVEIAKYYAENYPH 62 KLS IIPLYNA C+ S++ Q EII++NDG+ D+S +I E + + Sbjct: 1 MPKLSFIIPLYNAEKFIEKCLLSILQQNVPIQDYEIIVVNDGTKDDSRKIVSEMQEKFSN 60 Query: 63 VRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNA 122 ++++ Q N+G SV RNRG+ VATG+YV F+D+DD + + L+ + D+ Sbjct: 61 IKIIDQENSGPSVTRNRGLRVATGEYVWFIDSDDYIIDNSLQELLDILTMPSPDILYIGY 120 Query: 123 DWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIK 182 + + + +R + T +L+ + + +VW+ +YRR ++ +N + Sbjct: 121 KKVYENKRD--EKGTGERNGTAQPYTNHQFLKKF---KSFPPMVWLYIYRRTMLTENRLS 175 Query: 183 FIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKI 242 F G+ H+D +T + +F A + + +++Y Y S++ ++N + +I Sbjct: 176 FPEGIMHEDEEFTPKALFFAKKIIHFNRAIYCYLKREASITSSPWNPLIDINSKY---EI 232 Query: 243 TRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKR 302 L + ++ ++D I+ + + + R+ V + ++ E R Sbjct: 233 CHSLAEFSKAHSDSRDIWQSY---FSRKTNRLAINVLDLCIAHNKPKLFKEKLAQLKRNR 289 Query: 303 LITNVRSVKVGYQALLWS 320 L +G + L++ Sbjct: 290 LYPLQAKASLGIKDYLYA 307 >UniRef50_D2Q643 Glycosyltransferase EpsI n=2 Tax=Bifidobacterium dentium RepID=D2Q643_9BIFI Length = 343 Score = 237 bits (604), Expect = 6e-61, Method: Composition-based stats. Identities = 78/290 (26%), Positives = 126/290 (43%), Gaps = 16/290 (5%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M +SVI+P+YN R C+ESL TQ + + E+I+++DGS D S + YA Sbjct: 1 MAFDELISVIVPVYNVRRYLRQCLESLATQNYPSFEVILVDDGSADGSDLLCDEYAAKDE 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 +++HQ NAG SVARN G+ A+G+Y++FVD+DD V TLM+ + + A Sbjct: 61 RFKVVHQQNAGLSVARNTGLTHASGQYISFVDSDDWVSGYYLSTLMSAIAQSGMPCASLA 120 Query: 122 ADWCFRETGETWQS----IPTDRLRSTGVLTGPDWL-RMGLSSRRWTHVVWMGVYRRDV- 175 FR+ +P+ R+ V + + L +R + RRDV Sbjct: 121 HLRPFRDGSGCRLYGDHGMPSMRVVGKNVKVADEMTAQKALLRQRINCGAQARLCRRDVL 180 Query: 176 --IVKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNL 233 I + F GL ++D+ M A + +SLY Y + + + Sbjct: 181 LSIAADGKVFPEGLLYEDLATVYRIMHEARGSALVYESLYAYRRRCSGIMGNYGA----- 235 Query: 234 NYQRHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPD 283 H K+ + + Y D + +PE ++ L + V D Sbjct: 236 ---LHDEKVKSAIRVSHGLYEDMMRWHPELGKETASRCLALLCTVYSSLD 282 >UniRef50_D0TDE5 Glycosyltransferase family 2 n=1 Tax=Bacteroides sp. 2_1_33B RepID=D0TDE5_9BACE Length = 329 Score = 237 bits (604), Expect = 6e-61, Method: Composition-based stats. Identities = 75/273 (27%), Positives = 127/273 (46%), Gaps = 8/273 (2%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 T+ ++VIIP+YN C+ S+I Q++ L+II+++DGS+D+S I + Sbjct: 2 TDLVTVIIPVYNVEKYLAKCVNSIIKQSYKKLQIILVDDGSSDSSGAICDELLLLDERIE 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 + HQ N G S ARN+GI++A G+Y+AFVD DD + P MYE L+ + + + Sbjct: 62 VFHQKNKGVSSARNKGIDLAQGEYIAFVDPDDFLAPNMYEVLIGNIKKYSAQCSSVSMKK 121 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLS-SRRWTHVVWMGVYRRDVIVKNNIKF 183 + G P + + + +G + L +S W ++ +D+I NNI+F Sbjct: 122 IYESDGHEELLTPVGQFK---IQSGEEALDDAMSLDDPWLGYSVNKLFLKDIIDVNNIRF 178 Query: 184 IAGLHH-QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKI 242 + +D ++ E+M L + LY Y + TS ++ + N L + +I Sbjct: 179 NENIIMNEDSIFCYEYMSYCLISVRYTLPLYNYLIRKTSATQTIFKDNDKLFHLIDSSEI 238 Query: 243 TRLLEKLNR---NYADKIMIYPEFHQQITYEAL 272 KL R + Y H Q+ Y Sbjct: 239 LYNFGKLYRGRKFFTKMQCYYINVHIQLIYYMF 271 >UniRef50_C2M970 Glycosyl transferase family 2 n=1 Tax=Porphyromonas uenonis 60-3 RepID=C2M970_9PORP Length = 347 Score = 237 bits (604), Expect = 6e-61, Method: Composition-based stats. Identities = 70/237 (29%), Positives = 109/237 (45%), Gaps = 23/237 (9%) Query: 3 NSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPH 62 + T K+SVIIP+YN R C++S++ QT E I ++DGS D S I YAE Sbjct: 10 DHTPKVSVIIPVYNVEQYLRRCLDSILGQTMPDWEAICVDDGSPDGSASILAEYAEQDSR 69 Query: 63 VRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNA 122 R+L + N G S ARN G +VA G YV +VD+DD ++P +E +T+A D D+ Sbjct: 70 FRILTKENGGLSSARNAGTQVARGLYVNYVDSDDFIHPQTFELALTLAERDGSDIVSWVP 129 Query: 123 DWCFR-------------ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTH----- 164 D ++ + W L + + +R + R + Sbjct: 130 DMVYKRRLLLRAKLGQPYDDVVPWSMSQRYDLSKVHAVCTTEVVRHATNHRWFDGVSDPI 189 Query: 165 ---VVWMGVYRRDVIVKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLH 218 W + RR++ V+ F+ GL +D W ++ M ++ R T LY YY + Sbjct: 190 RYFYAWRHLVRRELAVET--PFVEGLKFEDFPWWSDLMLHSPRVTITHLPLYYYYYN 244 >UniRef50_C9RJ54 Glycosyl transferase family 2 n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RJ54_FIBSS Length = 321 Score = 237 bits (604), Expect = 7e-61, Method: Composition-based stats. Identities = 78/275 (28%), Positives = 121/275 (44%), Gaps = 16/275 (5%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 +SVIIPLYN + C++SL+ QT+T E+II++DGSTD IA YA + Sbjct: 1 MPAVSVIIPLYNTEAFIKDCLDSLVAQTFTDFEVIIVDDGSTDGGARIAASYASADTRFK 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 L+ Q N G S ARN G+++ G+YV F+D+DD V P ETL +A DVA C+ Sbjct: 61 LIGQPNKGPSEARNTGLKIMRGEYVTFIDSDDCVAPNFLETLFFIAQLHKADVACCSIKN 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWT-HVVWMGVYRRDVIVKNNIKF 183 E + + P + ++ LT + R+ L + W +Y+ D+ F Sbjct: 121 I-DEAYKCDGTTPNTSISAS--LTPEEAARISLYQDSLPDYSAWNKLYKADLWK--GKFF 175 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 A ++D+ E + A + T+ LY Y S L Q KI Sbjct: 176 PANTIYEDLAVIPEILLGANKVVTTKSHLYFYRKRAGS----------ELATQTGSTKII 225 Query: 244 RLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAV 278 LL+ + I ++ + ++ Sbjct: 226 LLLDIAEDVFEKMKHISKPLYKAARSMLVSASFSI 260 >UniRef50_A6ASH7 Cps2K n=1 Tax=Vibrio harveyi HY01 RepID=A6ASH7_VIBHA Length = 330 Score = 237 bits (604), Expect = 7e-61, Method: Composition-based stats. Identities = 68/327 (20%), Positives = 142/327 (43%), Gaps = 24/327 (7%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M + +SVI+P+YN C+ SL+ QT++ E+I+++D S D+S++IAK Sbjct: 1 MNSKEPLISVIVPVYNVEKYIIKCLNSLVNQTYSNFEVIVVDDESPDDSIKIAKAAFSED 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 + ++N G S ARN G++ + G+Y++F+D+DD T E ++ ++++ D++ C Sbjct: 61 SRFIFVSKSNGGLSCARNFGVKYSKGEYISFLDSDDYFDETFLEVMINTLMDNNADISVC 120 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 E + + R V+TG + R L +Y+R+ I+ N Sbjct: 121 AMSLV----DECYDELEVRGPRENEVITGFEAFRNNLLMHTIASGAQNKLYKREHII--N 174 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI 240 + GL+++D T + ++ + + L+ Y S+ + + + + H Sbjct: 175 YPYPLGLYYEDRATTYKMFLDSSKVCLINKPLFYYLQREGSIMKGLSEKSVTDRFIVHDS 234 Query: 241 KITRLLE----KLNRNYAD-------------KIMIY-PEFHQQITYEALRVCHAVRKEP 282 ++ L + K N +Y + +I IY + + I+ R+ V Sbjct: 235 IVSELEKRSLLKDNLDYLNVCYILNVIVAGSHQIAIYSKNYKKDISNLFDRIDFNVLTIS 294 Query: 283 DILTRQRMIAEIFTSGMYKRLITNVRS 309 + + + F + + + + Sbjct: 295 SLSLMYKFHKKKFFALIVLLVSKRLFK 321 >UniRef50_A6NTS7 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6NTS7_9BACE Length = 334 Score = 236 bits (603), Expect = 7e-61, Method: Composition-based stats. Identities = 76/341 (22%), Positives = 141/341 (41%), Gaps = 24/341 (7%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 LS+I+P+Y A C+ S++ Q E+I+++DGS D S +I +AE ++++H Sbjct: 3 LSIIVPVYRAERCISNCITSILAQVNGDFELILVDDGSPDRSGQICDSWAEQDSRIQVIH 62 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + N G S ARN G+ + G+Y+ FVD+DD V P +++T A E+ C WC Sbjct: 63 KKNGGVSSARNVGLAASKGQYILFVDSDDYVEPEYASSMLTAAKENP----NCGHIWCGF 118 Query: 128 ETGETWQSIPT--DRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIA 185 +T + + +L M L W +YR D+I ++ I F Sbjct: 119 QTVSDYNHSDAAPNLKTDKDMLQFMRSEIMTLHELWLDTGPWNKLYRSDIIRQHGIHFPE 178 Query: 186 GL-HHQDIVWTTEFMFNALR--ARYTEQSLYKY-YLHNTSVSRLHRQGNKNLNYQRHYIK 241 + +D ++ +++ + + LY Y S+ +R + + Y K Sbjct: 179 DMSLGEDWLFNLQYLDASPSECILVVTKPLYNYVRTGQASLDEGYRP-----DMLQIYRK 233 Query: 242 ITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYK 301 + EK + ++ +F+ Y +V + P+ +++ + Sbjct: 234 LNGECEKYLKKWSVPAEQIQKFYNSRFYTYEKVLFNTFRNPNSTKKEKYRWN-------R 286 Query: 302 RLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITRSAFN 342 L+ + V W L+ + S+ I AFN Sbjct: 287 HLMRSEEFRNVLLARDCWVHPLYMLAYRLGSYRLIA--AFN 325 >UniRef50_Q047W8 Glycosyltransferase related enzyme n=1 Tax=Lactobacillus delbrueckii subsp. bulgaricus ATCC BAA-365 RepID=Q047W8_LACDB Length = 315 Score = 236 bits (603), Expect = 7e-61, Method: Composition-based stats. Identities = 65/244 (26%), Positives = 110/244 (45%), Gaps = 10/244 (4%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 K+S+I+P+Y A C+ES++ QT+ +E+++I DGS DNS EI YA P V Sbjct: 2 EDVKVSIIVPVYKAEASIVRCIESILGQTYHNIELLLIEDGSPDNSGEICDRYAG-DPRV 60 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 +++H N G S ARN G+E ATG+Y+ F D+DD P + + A++ D+ Sbjct: 61 KVIHNTNQGVSHARNTGLEYATGEYITFCDSDDYYTPNHVKENIYAAMKHGSDITISGYY 120 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 E + + L+ D ++ W +Y+R+V+ +KF Sbjct: 121 I------EKHGKFVSSVHMTNASLSKDDVVKHFTVDNEVGGYCWNKLYKREVLQ--GVKF 172 Query: 184 IAGL-HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKI 242 L +D + M A + Y + LY Y + S +R + + ++ Sbjct: 173 PKNLEVLEDTYFLCSAMKRARKIYYLAEPLYYYCDNQNSATRNMDMLYSDHDTVKYIDSY 232 Query: 243 TRLL 246 +L Sbjct: 233 KNIL 236 >UniRef50_D2PVX1 Glycosyl transferase family 2 n=1 Tax=Kribbella flavida DSM 17836 RepID=D2PVX1_9ACTO Length = 776 Score = 236 bits (603), Expect = 8e-61, Method: Composition-based stats. Identities = 69/249 (27%), Positives = 110/249 (44%), Gaps = 15/249 (6%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 T +LSV++P YN D C++S+ QTWT E I+++DGS D+S IAK + + R Sbjct: 2 TPRLSVVVPFYNVRDYIGDCLDSIARQTWTDFEAILVDDGSPDDSAAIAKEFCDRDARFR 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++ Q N G ARN G+ A G+Y+ FVD+DD V +E L+ A NA Sbjct: 62 IVAQENQGLGPARNTGVRHAQGEYLTFVDSDDLVTRHGFEKLIRSLDRTGSSFAGGNARR 121 Query: 125 CFRETGETWQSIPTDRLR----STGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 G I +T V+ PD + +VW VYRR + Sbjct: 122 FNNSAGVRPSWIHRQPFARDRLATHVIETPDLI--------LDRMVWNKVYRRSFWDEYG 173 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNT--SVSRLHRQGNKNLNYQRH 238 +F A + ++D T + +A+ +Y + + S+++ Q + R Sbjct: 174 YQFPA-IRYEDYPVTLKAHLDAVTVDALAVPVYYWRERESGESITQQKFQLGNLEDRVRS 232 Query: 239 YIKITRLLE 247 + L+E Sbjct: 233 AEMVMDLVE 241 >UniRef50_B1C0P6 Putative uncharacterized protein n=1 Tax=Clostridium spiroforme DSM 1552 RepID=B1C0P6_9FIRM Length = 319 Score = 236 bits (603), Expect = 8e-61, Method: Composition-based stats. Identities = 75/307 (24%), Positives = 139/307 (45%), Gaps = 26/307 (8%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 +T +S+I+P+YN R C++S+I QT+ E I+INDGSTD S EI A+ Sbjct: 2 NTPLISIIVPVYNVQKHLRRCLDSIIQQTYENFECILINDGSTDGSGEICDEIAKQDKRF 61 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 ++HQ NAG SVARN + +A G YV+FVD+DD + P +L+ + D D+++C+ Sbjct: 62 IVIHQQNAGLSVARNNALLMAKGDYVSFVDSDDYILPQYLNSLLNAIISTDSDISKCDYF 121 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 + V+ + R L + +W +Y++ + N+I Sbjct: 122 R--------GKLTKDKDTYDVSVVNSKKFTREVLLDK-VGSQLWQYMYKKSLW--NDIVS 170 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYL-HNTSVSRLHRQGNK-----NLNYQR 237 AG + QD++ + A + +Q LY YY+ N S S ++ K + ++ Sbjct: 171 PAGRYAQDMMILHKITNRAKKIAVIDQKLYFYYIDRNDSTSNDSKKKVKGAFDRAIAFKN 230 Query: 238 HYI---------KITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQ 288 Y+ +L++ + Y + + + + + + +++E + Sbjct: 231 RYLFAKEFGYEDCCNKLIKNVMDFYNNALTLKKHTNDLYLDDMKDLSSFLKQELKFQQQN 290 Query: 289 RMIAEIF 295 + I + Sbjct: 291 KEIGFKY 297 >UniRef50_B9YA99 Putative uncharacterized protein n=1 Tax=Holdemania filiformis DSM 12042 RepID=B9YA99_9FIRM Length = 319 Score = 236 bits (603), Expect = 8e-61, Method: Composition-based stats. Identities = 68/326 (20%), Positives = 127/326 (38%), Gaps = 13/326 (3%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP-HV 63 K++VI+P+YN C++SL QT+ L+++++NDGSTD+S +I + Y +P Sbjct: 2 EPKITVIVPVYNVEKYLPECLDSLAGQTYKNLQVLVVNDGSTDSSPQIIEDYVRRFPDRF 61 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 L++ N G S ARN I A G+++AF+D+DD P Y + + DV + + Sbjct: 62 AQLNKPNGGLSDARNFAIGKAEGEWIAFLDSDDWADPEYYAKMAEVMAR-GADVVVSDIE 120 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 + + E G+ +P +L W +YR D + + ++ Sbjct: 121 YFY-ENGQPSWIMPGLNPHVFDSPVKQAFLS--------PMFAWNKLYRADFFTRKHYRY 171 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 G ++DI T + ++ Y S+ + + + Sbjct: 172 PLGTWYEDIPVTMPIFAQSQTIGIQHEARIHYRQREGSIMSVTKSERLKEIFPVMKQVRD 231 Query: 244 RLLE-KLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKR 302 + E L+ Y D++ Y Y R + ++AE + + Sbjct: 232 QFAEMGLSERYHDELE-YLHIEHLRLYGMFRFLRSPLTSELKAMSDAVMAEFYPNWKKNP 290 Query: 303 LITNVRSVKVGYQALLWSFRLWQWRD 328 I N+ + S L ++ Sbjct: 291 YIRNLGWKDQLFLKTFGSLTLPLYKH 316 >UniRef50_C6I618 Glycosyltransferase n=5 Tax=Bacteroidales RepID=C6I618_9BACE Length = 329 Score = 236 bits (603), Expect = 9e-61, Method: Composition-based stats. Identities = 67/298 (22%), Positives = 131/298 (43%), Gaps = 28/298 (9%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY-PH 62 S+ K+S+IIP+Y A C++S+ITQT+T E+++++DGS D+S E+ + YA Y Sbjct: 2 SSPKISIIIPVYMAESYLHRCVDSIITQTFTDWELLLVDDGSPDHSGELCEKYASAYKSR 61 Query: 63 VRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNA 122 ++ H+ N G + AR G++ A G+Y VD DD + E L A+++ D+ C+ Sbjct: 62 IKAFHKPNGGVASAREFGMQQARGEYSIHVDPDDWIDSNTLEELYQQAVKEQADMVICDF 121 Query: 123 DWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIK 182 + + +L ++ L R + + R ++ K + Sbjct: 122 MMEYPNRQ-------IHNCQKPQLLDSSSFMHQLLQQER-HGSLCNKLIRTELYHKYQLH 173 Query: 183 FIAGLH-HQDIVWTTEFMFNALRARYTEQSLYKY--YLHNTSVSRLHRQGNKNLNYQRHY 239 F + +D+ + + + Y +LY Y Y ++ S+ R + ++ + Sbjct: 174 FPEKMICWEDLYICCSILLHGCKLAYVPHALYHYDFYTNDNSMVR-----HTDMRGLQAQ 228 Query: 240 IKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTS 297 I RL++ I PE+ ++ L+ + + L ++ I +F Sbjct: 229 IDFCRLMQA---------KISPEYLPELN--ELKGITLITAFRNQLLNEQAIRSLFPE 275 >UniRef50_Q8RSN9 Putative glycosyl transferase n=1 Tax=Streptococcus thermophilus RepID=Q8RSN9_STRTR Length = 229 Score = 236 bits (603), Expect = 9e-61, Method: Composition-based stats. Identities = 64/224 (28%), Positives = 112/224 (50%), Gaps = 8/224 (3%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S++IP+YN D + C++S+++QT++ LEII+++DGSTD S I YY+EN ++++H Sbjct: 3 ISIVIPVYNVQDYIKKCLDSILSQTFSDLEIILVDDGSTDLSGRICDYYSENDKRIKVIH 62 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 AN G S ARN GI+ AT +++ F+D+DD V E L + + D++ + + Sbjct: 63 TANGGQSEARNVGIKNATSEWITFIDSDDYVSSDYIEYLYNLIQVHNADISIASFTYI-- 120 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 T + I ++ +R L + + VW +YR + N KF++G Sbjct: 121 ----TPKKIIKHGNGEVALMDAKTAIRRMLLNEGFDMGVWGKMYRTEYF--NKYKFVSGK 174 Query: 188 HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNK 231 +D + T + A + + +Y Y S K Sbjct: 175 LFEDSLITYQIFSEASTIVFGAKDIYFYVNRKNSTVNGTFNIKK 218 >UniRef50_B5CU67 Putative uncharacterized protein n=1 Tax=Bacteroides plebeius DSM 17135 RepID=B5CU67_9BACE Length = 331 Score = 236 bits (602), Expect = 9e-61, Method: Composition-based stats. Identities = 80/328 (24%), Positives = 145/328 (44%), Gaps = 16/328 (4%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +SVI+P+YN + C+ES++ QT+ LEII+++DGS+D S EI YY + ++H Sbjct: 2 ISVIVPIYNVAEYLPACIESILNQTYKNLEIILVDDGSSDKSGEICDYYVKRDTRCIVIH 61 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 Q+N G S ARN G+E ATG+Y++F+D DD ++P M TL ++D D + + Sbjct: 62 QSNKGLSEARNAGLECATGEYISFIDGDDYIHPKMLATLYEALQQEDYDFSMIAHTQTWN 121 Query: 128 ETGETWQSIPTDRLRSTGV----LTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 E + + T L ++ L + VW +YRR++I + +F Sbjct: 122 HHIEKEPKMINSIIFETSDLMQGLYNVSSKKISLPECNFQ-AVWNKLYRRNLIGSS--RF 178 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 I + +D + + +A + +Q LY + +S++ + N N+ Sbjct: 179 IT-IATEDTEFNNKIFLKTQKAIFLDQCLYYWVQRPSSITH----QSVNKNFIDRAYSYL 233 Query: 244 RLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRL 303 L ++ +N+ + +++ A+ V + R +I AE Sbjct: 234 FCLNQIPKNHKEY---RAYCLEKLYKTAVNVQYRARNT-EIYNYAVKTAEELRVKTINEF 289 Query: 304 ITNVRSVKVGYQALLWSFRLWQWRDKTR 331 I+N +LL + + Sbjct: 290 ISNTHINIFHKISLLVFYYIPFTYHIFI 317 >UniRef50_C7PMN9 Glycosyl transferase family 2 n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PMN9_CHIPD Length = 329 Score = 236 bits (602), Expect = 9e-61, Method: Composition-based stats. Identities = 59/221 (26%), Positives = 105/221 (47%), Gaps = 7/221 (3%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV--R 64 K+SVIIP YN C+ S+I QT + +EII++ND S+D ++ + + ++ V + Sbjct: 2 KVSVIIPAYNVEKYIEECVMSVIEQTLSEIEIIVVNDQSSDGTLSVLEKIKASHSQVALK 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++ Q N G S ARN GI+ ATG+Y+ F+D DD + M + L +A + + +V C+ Sbjct: 62 IISQQNQGLSGARNTGIQAATGEYICFLDGDDWLEKEMLQDLYKVASDGNAEVVICDYTK 121 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 + + E D VL + + ++RR +V+ +F Sbjct: 122 VYEDRNEVLSGGRID----GNVLRNRKDILEAFLTGNIVISACNKMFRRTFVVERQFRFP 177 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRL 225 G ++D+ T + +A ++S Y Y + S+ Sbjct: 178 VGYLYEDLP-TVMLIADASVVIKVDKSYYNYRQRSGSIMNS 217 >UniRef50_C8PLK6 Ss-1,4-galactosyltransferase n=1 Tax=Campylobacter gracilis RM3268 RepID=C8PLK6_9PROT Length = 421 Score = 236 bits (602), Expect = 1e-60, Method: Composition-based stats. Identities = 71/290 (24%), Positives = 134/290 (46%), Gaps = 38/290 (13%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M+ +SVIIP+YN + R C+E + QT+ LEI++++DGS D S I YA Sbjct: 1 MSVGPLISVIIPVYNVKEFLRACVERITAQTYANLEILLVDDGSNDGSEAICDEYAARDL 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 VR+LH++N G + ARN ++VA G++++FVD+DD + + ETL + + +A C Sbjct: 61 RVRVLHKSNGGQASARNAALDVARGEFISFVDSDDLISLDLIETLFRLTQKFCTKIAMCG 120 Query: 122 ADWCFRETG--------ETWQSIPTDRLRSTG------------VLTGPDWLRMGLSSR- 160 E+ + ++ + + + G ++ + R + Sbjct: 121 YAAFSDESEINNFKAALNSKENQNSKEISNPGGVANSNANAGEYKISPQELFRRSCTQDP 180 Query: 161 RWTHVVWMGVYRRDVIVKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNT 220 ++ VW ++ R++ ++F AG ++D+ + NA A YT++ LY Y + Sbjct: 181 YFSTAVWRALFAREIFTA--LRFPAGQIYEDVAIFFDIF-NASIAAYTDEILYFYRVRAG 237 Query: 221 SVSRLHRQGNKNLNYQRHYIKITRLLEKLNRNYADKIMI-YPEFHQQITY 269 S ++ R ++ + + R Y + I YP ++ + Sbjct: 238 STVN---------SFARRHLAVIAAV----RRYTEAIAANYPSLAREANF 274 >UniRef50_C6WZX8 Beta-1,3-glucosyltransferase n=1 Tax=Flavobacteriaceae bacterium 3519-10 RepID=C6WZX8_FLAB3 Length = 361 Score = 236 bits (602), Expect = 1e-60, Method: Composition-based stats. Identities = 75/298 (25%), Positives = 133/298 (44%), Gaps = 18/298 (6%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP-H 62 K+SVI+P+Y + C+ SL+ QT +EII++NDGSTD S EI +++ +P Sbjct: 27 QMVKVSVIVPVYQGEKELSKCLASLVNQTLDDIEIIVVNDGSTDGSGEIITEFSQRFPSK 86 Query: 63 VRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNA 122 +R + N G S ARN G++ ATG+Y+ FVDADDEV + + +A + ++ CN Sbjct: 87 IRAFTKENGGLSDARNFGLDRATGQYIGFVDADDEVSANTFAEMYELAEKHAAEMVICNI 146 Query: 123 DWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIK 182 T + + VL + +S + ++R+D++ +N + Sbjct: 147 QKVDEAGKITQKLTQIPNMPERIVLEDHFSVFADIS-----YFACNKIFRKDIL--HNKR 199 Query: 183 FIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKI 242 F G+H++DI + + T+ Y+Y+ S+++ H + K L+ R + Sbjct: 200 FKKGIHYEDIQLIPQLLLECRVIAQTQNYHYQYFERRGSITKTHTE--KGLDILRAVADV 257 Query: 243 TRLL--------EKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIA 292 + RN+ IY + ++ + E +QR I Sbjct: 258 ENAFATSQYSHQKAGLRNFQILEGIYSFLAYLAFVKDEKISRKLSDELKKFRQQRGIG 315 >UniRef50_C0FV38 Putative uncharacterized protein n=1 Tax=Roseburia inulinivorans DSM 16841 RepID=C0FV38_9FIRM Length = 329 Score = 235 bits (601), Expect = 1e-60, Method: Composition-based stats. Identities = 65/291 (22%), Positives = 133/291 (45%), Gaps = 7/291 (2%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+S+IIP+YN+ + C+ES++ QT+ EI++++DGSTD+S I YA+ + + Sbjct: 2 KISIIIPVYNSTLYLKQCVESILAQTYHNFEILLVDDGSTDDSPMICDEYAQKDDRIVTI 61 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVY-PTMYETLMTMALEDDLDVAQCNADWC 125 H+ N G S ARN G+E A+G Y+ F+D DD P ++ + E + DV Sbjct: 62 HKQNGGTSDARNVGLEKASGDYITFMDNDDYWSNPDALCDIIKVISETEPDVVMHANMVY 121 Query: 126 FRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIA 185 +++ +T DR +G + + + + + VW + + +I ++++ F Sbjct: 122 WQDKNKTVNPSSFDRTLVSG-KKRKEAVENLIKAGMLSVTVWTKLVKTSLIREHDLYFKK 180 Query: 186 GLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI----- 240 G+ ++D W + A R + ++ Y Y + + + +N ++ I Sbjct: 181 GIRNEDTDWIARMLICAERFEWYDKPFYVYRKGHETAQTSQKMKYYMVNDLKNIIIEYTQ 240 Query: 241 KITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMI 291 + L++ R + YP + ++ + +I+ + + Sbjct: 241 YAEKYLDQEYRKVFYAYLAYPFAVWMGQAYLIEQNEQIKNDREIMKKYAYL 291 >UniRef50_A5ZUY3 Putative uncharacterized protein n=1 Tax=Ruminococcus obeum ATCC 29174 RepID=A5ZUY3_9FIRM Length = 367 Score = 235 bits (600), Expect = 2e-60, Method: Composition-based stats. Identities = 87/341 (25%), Positives = 150/341 (43%), Gaps = 14/341 (4%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+S+IIP YN C+ES+ QT +EII +NDGSTD+S+EI K YAEN P ++ Sbjct: 3 KVSIIIPTYNVEMFLDECLESIQRQTLQDIEIICVNDGSTDHSLEIIKKYAENDPRFVIV 62 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 + N G A N G++ ATG+Y+ V+ DD V MYE L +A E+DLD+ + + Sbjct: 63 DKENGGYGKAMNVGLDKATGEYIGIVEPDDYVPLAMYEDLYKIAKENDLDMVKADFYRFT 122 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDW-LRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIA 185 R ++ + L TG G + + R+ W G+Y+R+ + +++I+ Sbjct: 123 RNPENGNMNLVYNHLDLTGKWYGKLFDPSEEPETLRFIMNTWSGIYKREFLEQHHIRHNE 182 Query: 186 --GLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 G QD + + A RA ++ Y N + S +R+ N + YI+ Sbjct: 183 TPGASFQDNGFYFQTFVYAKRAMIVDKPYYMNRRDNPNSSVNNREKVYCCNVEYDYIREI 242 Query: 244 RLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRL 303 + + I+ F + + + + ++ + I S +KR Sbjct: 243 LMKD---------PEIWSRFKYMYSLKKFHTYNFTLQRIGEQFKREYVQRI--SQEFKRA 291 Query: 304 ITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITRSAFNLR 344 N + + + W + +D + +I +R Sbjct: 292 KENGELKQGAFYPVEWDDMMLLIQDPDAYYFKICLKNKQIR 332 >UniRef50_A8RE09 Putative uncharacterized protein n=1 Tax=Eubacterium dolichum DSM 3991 RepID=A8RE09_9FIRM Length = 330 Score = 235 bits (600), Expect = 2e-60, Method: Composition-based stats. Identities = 72/322 (22%), Positives = 150/322 (46%), Gaps = 5/322 (1%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+I+P+YNA +E+L QT T +EI+++NDGSTD+S+++ + +AE P ++++ Sbjct: 9 VSIIVPIYNAEKYLSRTVETLRRQTLTNIEILLVNDGSTDDSLKLCREFAEIDPRIQVID 68 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + N G S ARN G+++A G+YV FVDADD V E L+ E+ D+ C + Sbjct: 69 KQNGGVSSARNIGLKLAKGQYVLFVDADDYVEANYAEILLKTLQENKSDIEMCGYFREDK 128 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 ++ + + D + P + + + ++ ++ I +NN++F L Sbjct: 129 KSEQIITLMEKDHVFDLN-KELPSIHKAFVKGNGGIYSIFNKLFSLKKIKENNLQFSEKL 187 Query: 188 HH-QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRLL 246 H +D ++ +++ Y LY Y ++ S++ + + + ++ Sbjct: 188 SHAEDFLFCIQYVNTCSSFGYINIPLYHYVNNDNSLTTTFNKRQLAYHEDWLMVLKQNIV 247 Query: 247 EKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLITN 306 + + + +I Y + Y + VR LT + E+ + +KR+ + Sbjct: 248 SDVYKEHTQEI-YYTTLTEVNDYCIRAKRNKVRFSYKDLTSYPSVREMIKNIRFKRVDSI 306 Query: 307 VRSVKVGY--QALLWSFRLWQW 326 + + +L+ ++L+ Sbjct: 307 SCYILLIALRFKILFLYKLYIL 328 >UniRef50_Q2NH61 Predicted glycosyltransferase n=1 Tax=Methanosphaera stadtmanae DSM 3091 RepID=Q2NH61_METST Length = 358 Score = 235 bits (600), Expect = 2e-60, Method: Composition-based stats. Identities = 77/332 (23%), Positives = 149/332 (44%), Gaps = 7/332 (2%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+SV+IP+YN R C+++++ QT +EII +NDGSTD S++I YA +R++ Sbjct: 3 KVSVVIPVYNVEKYLRDCLDTIVNQTLEDIEIICVNDGSTDKSLDILNEYASKDERMRVI 62 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 Q N G +VA N G+ ATG+Y+ +D+DD + E ++A E D+D A Sbjct: 63 SQENGGHAVATNVGMTYATGEYLFLMDSDDILKLNALEDTYSIAKEKDVDFVLFKAINYD 122 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRW--THVVWMGVYRRDVIVKNNIKFI 184 E E +++ + ++ + L + T W +Y R I +NNI F Sbjct: 123 DEKDEYYEAENYSMNKIARIVKDNIFSYEDLGDAIFDITVTPWSKLYNRSFIEENNITFP 182 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 GL D V+ + +F+A R + ++ L+ ++TS + + + ++ + + Sbjct: 183 EGLVFDDNVFFWKVLFSAKRIYFYKEFLFTRRWYSTSSTT--KGDLRFIDSIAVFNLVWD 240 Query: 245 LLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLI 304 + + N K ++Y + R+ R+ + ++ +Y+ I Sbjct: 241 VFREFNLFEKYKKILYDKKVFLANMRFDRIKEEYREAYFSAMKDDFTKILYDDVLYQDFI 300 Query: 305 TNVRSVKVGYQALLWSFRLWQWRDKTRSHHRI 336 + + YQ L + ++ + ++ Sbjct: 301 SILSP---KYQKLYEQVLISTSAEEFIASRKV 329 >UniRef50_C9N8V9 Glycosyl transferase family 2 n=1 Tax=Streptomyces flavogriseus ATCC 33331 RepID=C9N8V9_9ACTO Length = 1295 Score = 235 bits (600), Expect = 2e-60, Method: Composition-based stats. Identities = 64/226 (28%), Positives = 102/226 (45%), Gaps = 8/226 (3%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 LSV++P++N D C++SL Q+ A+E+++++DGSTD S IA+ +A R Sbjct: 2 KPLLSVVVPVHNVEDYLAECLDSLAGQSLDAIEVVMVDDGSTDGSRRIAEEFAARDDRFR 61 Query: 65 LLHQANAGASVARNRGIEVAT-G-KYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNA 122 +HQ NAG ARN G+ T G Y+AF D+DD + Y ++ D+ N Sbjct: 62 CVHQPNAGLGAARNTGVARTTPGVPYLAFADSDDVLVRDAYARMLASLESTGSDLVTGN- 120 Query: 123 DWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIK 182 +R TG+ Q R + L L S R V W V+RR +++ Sbjct: 121 --VWRLTGQKRQQAWQYRWLTGDRLRTHITRDARLLSDR---VAWNKVFRRSFWDRHDFS 175 Query: 183 FIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQ 228 F G ++D + A Q +Y + + S++R Sbjct: 176 FPVGRLYEDTPVMIPAHYLAGSVDVLHQHVYHWRVREGSITRRRTD 221 >UniRef50_B0NM04 Putative uncharacterized protein n=1 Tax=Bacteroides stercoris ATCC 43183 RepID=B0NM04_BACSE Length = 346 Score = 235 bits (599), Expect = 2e-60, Method: Composition-based stats. Identities = 74/263 (28%), Positives = 125/263 (47%), Gaps = 9/263 (3%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 MM +SVIIP+Y A C+ SL+ Q++TA E+++I+DGS DNS +I +AE Sbjct: 1 MMEQCV-ISVIIPVYKAEKYLSRCIASLLAQSFTAFELLLIDDGSPDNSGKICHDWAERD 59 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 +R+LHQ NAG S ARN+G++ A G+Y+ FVD+DD V + L +LD Sbjct: 60 CRIRVLHQTNAGVSTARNKGLDEAKGEYIVFVDSDDYVGTDYLKHLYEE-RPLNLDSIVV 118 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTH-VVWMGVYRRDVIVKN 179 E + + V TG D + L+ ++ R +I + Sbjct: 119 QGFTIVSEQADRQYTTSF----IPSVYTGRDVRELFLNDEMRDMRGPVAKLFSRVLIERL 174 Query: 180 NIKFIAGLHH-QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVS-RLHRQGNKNLNYQR 237 ++F A + +D ++ +++F+ + Y Y S++ RL+ ++ ++ Sbjct: 175 GLRFPAEISFGEDTIFNFQYLFHCRQVVVGSHVDYVYCDVPGSLTKRLNDFASEYATFKL 234 Query: 238 HYIKITRLLEKLNRNYADKIMIY 260 + RL KL+ N AD + Sbjct: 235 YKSITVRLAGKLDLNVADMRTAF 257 >UniRef50_A3IC99 Predicted glycosyltransferase n=2 Tax=Bacillaceae RepID=A3IC99_9BACI Length = 329 Score = 235 bits (599), Expect = 2e-60, Method: Composition-based stats. Identities = 74/309 (23%), Positives = 130/309 (42%), Gaps = 6/309 (1%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 +SVI+P+YNA + C+ S+ Q+ +EII INDGS D S+ + + +A+ + +L Sbjct: 3 LVSVIVPVYNAENYLNKCLTSICHQSLKDIEIITINDGSEDQSLTVLQNFAQVDARITVL 62 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 N G S ARN G+E A G+Y+ FVDADD + TM E L ++ +C+ W Sbjct: 63 DTDNGGVSKARNLGMEQAKGQYITFVDADDWLELTMLEELFLACRLTGSNIVKCDLFWQ- 121 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAG 186 E G P+ + + L + + + +Y++ I + +KF Sbjct: 122 EEIGSRQLYYPSKTYFTYSATACLNKLFTEIGEKHFGFASC-KLYKKSFIDLHKLKFDET 180 Query: 187 LHH-QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRL 245 ++ +D V+ T + Y + LY Y + N + N NY Y K++ Sbjct: 181 MNFAEDTVFLTRAVIKNQSICYVPKQLYYYNVGNQNSLTSRAMSNVRNNYDLLYKKLSEE 240 Query: 246 LEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTR--QRMIAEIFTSGMYKRL 303 L+ Y + + + L C A + + + E+ + R Sbjct: 241 LQS-GEVYEKVKNSFLNYQFEGLLVILHQCPAKPQAIKNALQSFLQAYPEVLKVKLMNRK 299 Query: 304 ITNVRSVKV 312 + +K+ Sbjct: 300 LQQFIFLKL 308 >UniRef50_Q8XN57 Beta-1,4-galactosyltransferase n=3 Tax=Clostridium perfringens RepID=Q8XN57_CLOPE Length = 342 Score = 234 bits (598), Expect = 3e-60, Method: Composition-based stats. Identities = 77/322 (23%), Positives = 154/322 (47%), Gaps = 17/322 (5%) Query: 6 NKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRL 65 K+S+I+P+Y D C++S+I Q++ LEII+++DGSTD EI Y + +++ Sbjct: 3 PKVSLIVPIYKVQDYLNKCIDSIINQSYKNLEIILVDDGSTDRCGEIIDLYEKQDNRIKV 62 Query: 66 LHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWC 125 H+ N G S ARN G+ TG YV FVD+DD + M LM +LE + D+ Q + Sbjct: 63 FHKKNGGLSDARNYGMNFITGDYVIFVDSDDWLSEDMVSDLMKTSLELNADIVQSGFYYA 122 Query: 126 FRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIA 185 + + + + S VL + + + + + + W +Y+ +I +I F Sbjct: 123 YDDYLLYDNRYYLEDM-SPVVLNREELMYELVINEKVKNFAWGKLYKSVLIK--DISFKI 179 Query: 186 GLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRL 245 G+ +D+ W + M + + +Y Y S+ ++ + +NL+ + + + Sbjct: 180 GVLFEDVFWAHKVMDRVDKYVIIHKPMYYYRQRKDSIVAIYTE--RNLDILKGLQERHKF 237 Query: 246 LEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEI--FTSGMYKRL 303 +EK I + E ++ I +L + ++K+ +I R + +I + ++ + Sbjct: 238 IEK------KYISLINESYKIIFKTSLIHYYLLKKKKNIKNRDYYMNDIKEYIKSNFRNI 291 Query: 304 ITNVRSVKVGYQALLWSFRLWQ 325 ++++ + L+W L+Q Sbjct: 292 EKSLKNERE----LIWQLHLFQ 309 >UniRef50_UPI0001C4190B glycosyl transferase GT2 family n=1 Tax=Methanobrevibacter ruminantium M1 RepID=UPI0001C4190B Length = 381 Score = 234 bits (598), Expect = 3e-60, Method: Composition-based stats. Identities = 69/266 (25%), Positives = 124/266 (46%), Gaps = 7/266 (2%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 ++S++IP+YN R C++S + QT+ +EII INDGSTD+S++I K Y E+ + + Sbjct: 3 EISIVIPVYNVEKYLRECLDSAVNQTFKDIEIICINDGSTDSSLDILKEYQESDDRIIIF 62 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 +Q N G ARN GI + GKYV F+D+DD + E L + E LD Sbjct: 63 NQENQGPGAARNLGINKSKGKYVYFLDSDDYLELNALEKLYNICEEKSLDFVLFKLLNFN 122 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWT--HVVWMGVYRRDVIVKNNIKFI 184 +TG+T+Q+ + + + L + +Y+R++I +I + Sbjct: 123 DKTGKTFQTKYYNMAFLNDRIGDNVFSYKDLYDCVFNLAVSPPAKLYKRELIT--DIDYP 180 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI-KIT 243 G+ +D V+ + + A R + ++ LY + S++ ++ +Y+ I Sbjct: 181 EGIIFEDNVFFLKTLLKAKRIYFLDEFLYNRRRRDDSLTSSGSDDYYDIIPSMNYLFDIC 240 Query: 244 RLLEKLNRNYADKIMIYPEFHQQITY 269 R L+ + Y +F + Sbjct: 241 RDLDDFELLKEG--LYYKKFKELYIR 264 >UniRef50_D1WVE3 CDP-glycerol:poly(Glycerophosphate) glycerophosphotransferase n=3 Tax=Streptomyces RepID=D1WVE3_9ACTO Length = 1151 Score = 234 bits (598), Expect = 3e-60, Method: Composition-based stats. Identities = 67/227 (29%), Positives = 108/227 (47%), Gaps = 14/227 (6%) Query: 6 NKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRL 65 +L+V++PLYN + C+ SL QT LE+++++DGSTD+ +A+ +A+ P RL Sbjct: 3 PRLTVVVPLYNVEEYLGACLTSLAEQTMPDLEVVLVDDGSTDDGPSLAQEFADRDPRFRL 62 Query: 66 LHQANAGASVARNRGIEVAT--GKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 + Q NAG ARN GI A +++ FVD+DD V P Y ++ D A N Sbjct: 63 IRQENAGLGAARNAGIREARPGAEFLTFVDSDDVVPPGAYARMLAELDASGSDFATGNVL 122 Query: 124 WCFRETGETWQS----IPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKN 179 R GE QS P ++ R +T DW+ +G + V+RR ++ Sbjct: 123 RL-RSDGELEQSPMFRRPMEKARPATHVT-RDWILLG------DRIACNKVFRRTFWDEH 174 Query: 180 NIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLH 226 F G+ ++DI F A E+ +Y + + S++ Sbjct: 175 AFAFPTGVLYEDIAVVLPAHFLARSVDVIEEPVYHWRDRDGSITTRR 221 >UniRef50_C6X0E9 Beta-1,3-glucosyltransferase n=1 Tax=Flavobacteriaceae bacterium 3519-10 RepID=C6X0E9_FLAB3 Length = 339 Score = 234 bits (598), Expect = 3e-60, Method: Composition-based stats. Identities = 69/228 (30%), Positives = 117/228 (51%), Gaps = 6/228 (2%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+SVI+P+YN D + C+ S+I QT+ LEII+++DGSTDNS +I K A + + Sbjct: 2 KISVIVPVYNVQDYVKQCINSIIAQTYHDLEIILVDDGSTDNSFQICKDAAGKDSRIITV 61 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDD-LDVAQCNADWC 125 Q NAG S ARN GI+++TG Y+ FVD+DD ++ +Y ++ + DV C+ Sbjct: 62 SQKNAGVSAARNNGIKMSTGDYITFVDSDDWLHHDLYLSMTEILNRYPRTDVVMCDFATV 121 Query: 126 FRETGETWQS-IPTDRLRSTGVLTG--PDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIK 182 T + S + +++G P L R +++R ++KN+I+ Sbjct: 122 KNNTAQQITSQLREGLYIKKDIISGIYPTLLVTEDFGRLPIVSACTCIFKRSFMLKNSIQ 181 Query: 183 FIAGLHH-QDIVWTTEFMFNALRARYTEQSL-YKYYLHNTSVSRLHRQ 228 F GL + +D ++ + M + Y + Y Y + S S+ ++ Sbjct: 182 FDEGLRYSEDYLFMADVMTHCDSYYYLKNFFAYNYRQYEESRSKKYQS 229 >UniRef50_C1YIQ9 Glycosyl/glycerophosphate transferase, teichoic acid biosynthesis n=1 Tax=Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 RepID=C1YIQ9_NOCDA Length = 1164 Score = 234 bits (598), Expect = 3e-60, Method: Composition-based stats. Identities = 65/244 (26%), Positives = 102/244 (41%), Gaps = 8/244 (3%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 LSV++P+YN C+ SL QTW LE+++++DGS D S IA YAE P RL+ Sbjct: 4 LSVVVPIYNVERYLDECLRSLQVQTWDDLEVVMVDDGSPDGSAAIAARYAEQDPRFRLIQ 63 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 Q N G ARN GIE A G ++ FVD+DD + + + D + N Sbjct: 64 QENGGLGAARNTGIEHARGDFLTFVDSDDVIPSYAFGYHLESLRRSGSDFSSGNVHRL-N 122 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 E G + R+ T L R + V+RR ++ + F G+ Sbjct: 123 ELGTRQSPMHRRIFRTAKTGTHITREDTLLVDR----LATNKVWRRSFWDRHGMTFPVGV 178 Query: 188 HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNT---SVSRLHRQGNKNLNYQRHYIKITR 244 H+DI F A +Y + + S+++ + + K++ Sbjct: 179 LHEDIPVAIPLHFLAESVDVLSAPVYLWRERPSDDKSITQDRLRPQALEDRVSAVTKVSD 238 Query: 245 LLEK 248 L + Sbjct: 239 FLRE 242 >UniRef50_B1C3W3 Putative uncharacterized protein n=1 Tax=Clostridium spiroforme DSM 1552 RepID=B1C3W3_9FIRM Length = 323 Score = 234 bits (597), Expect = 4e-60, Method: Composition-based stats. Identities = 66/263 (25%), Positives = 123/263 (46%), Gaps = 4/263 (1%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 S+I+P+YN + C++S++ QT+ EII+++DGS D S I A Y ++++ Sbjct: 2 LFSIILPIYNVENYLCECVDSILNQTYKDYEIILVDDGSKDKSPHICDELAHKYDCIKVI 61 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEV-YPTMYETLMTMALEDDLDVAQCNADWC 125 H+ N G S ARN G + A GKY+ ++D+DD + L + +D+ Sbjct: 62 HKKNGGLSDARNAGTKEAIGKYIVYIDSDDYILDKEFLSKLAQK-TKTGVDLIFYKYQKY 120 Query: 126 FRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIA 185 F ET + T L + L + + + + + W+ +R +IV+NNI F Sbjct: 121 FNETKKLEDCTYTYSLAMSETLYANK-IEALVKADAFYGMAWIKAVKRKLIVENNINFEV 179 Query: 186 GLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRL 245 GL +D+ W + +FNA + ++ + Y S++ H N +++ K + Sbjct: 180 GLLGEDMDWNYQVIFNASTIEFIDEPMIAYRQREGSITSTHTLKN-LVDFVYILEKWSNR 238 Query: 246 LEKLNRNYADKIMIYPEFHQQIT 268 + ++ KI +Y + + Sbjct: 239 ITNDIKDEKLKIALYGSLAKYYS 261 >UniRef50_B9Y890 Putative uncharacterized protein n=1 Tax=Holdemania filiformis DSM 12042 RepID=B9Y890_9FIRM Length = 355 Score = 234 bits (597), Expect = 4e-60, Method: Composition-based stats. Identities = 66/317 (20%), Positives = 133/317 (41%), Gaps = 6/317 (1%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+SVI P+Y + R C++S++ Q++ E+II++DGS DN IA Y + V+++ Sbjct: 3 KISVITPVYKVENYLRKCIDSILNQSFDDFELIIVDDGSPDNCGAIADDYKDKDSRVKVI 62 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 H+ N GA ARN GI++A GKY+ F D+DD + E L +A+ +D ++ + Sbjct: 63 HKENGGAPSARNEGIKIAKGKYLYFPDSDDWIEENYLEELYKLAVSNDANLVISGYTMEY 122 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLR--MGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 E + +S + + L+ D + V W +Y+ I N + F Sbjct: 123 YENNK--KSSYKVSVENKVYLSREDVVNNLHLYFDNMMMAVPWNKLYKTSYIKSNELFFP 180 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYK-YYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 L D+ + E + N + + S Y + S + G + + I Sbjct: 181 -NLKWDDLHFNMEVIMNINKVVISSNSGYHFFRSREGSETTTVFDGLLYQKRREQFEHIL 239 Query: 244 RLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRL 303 ++ N + + ++ + ++ + +I + F + ++ Sbjct: 240 KVYRIWNVENDNIMGSIYGYYASRLVQCIQEISISNVSDKRSLIRNIIYDQFNNYAIEKG 299 Query: 304 ITNVRSVKVGYQALLWS 320 N + +++ + + Sbjct: 300 KINSKLLRLAMYPMKYK 316 >UniRef50_C3R6R9 Glycosyltransferase involved capsular polysaccharide biosynthesis n=2 Tax=Bacteroides dorei RepID=C3R6R9_9BACE Length = 325 Score = 234 bits (597), Expect = 4e-60, Method: Composition-based stats. Identities = 71/220 (32%), Positives = 116/220 (52%), Gaps = 5/220 (2%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQ--TWTALEIIIINDGSTDNSVEIAKYYAENYPH 62 K+S+I+P+YN D C+ SL+ Q ++ EIII+NDGSTD+S+EI + Y E YP+ Sbjct: 1 MKKISLIVPVYNVEDFVARCLNSLLEQNMSYEDYEIIIVNDGSTDSSLEIIQPYLEKYPN 60 Query: 63 VRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNA 122 +L+ + N G S ARN G+ VA G+Y+ FVD+DD + P + + L +DLDV Sbjct: 61 CKLIDKPNGGLSSARNAGLNVALGEYIWFVDSDDFIMPNLLDDLYKECSSNDLDVLWMEW 120 Query: 123 DWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIK 182 E Q T R + + V+ G ++ S+ + W +Y+R + KN+ Sbjct: 121 RRVSELGIELPQFYCTYRNKISHVINGVQFMEQVCSTYLF---AWAFLYKRCFLQKNSFL 177 Query: 183 FIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSV 222 F+ G+ ++D+ + T R + + Y Y + S+ Sbjct: 178 FMEGILYEDVEFNTRVFPLINRIKIYSRICYNYTCRHGSI 217 >UniRef50_D1JJJ8 Putative uncharacterized protein n=1 Tax=Bacteroides sp. 2_1_16 RepID=D1JJJ8_9BACE Length = 330 Score = 233 bits (596), Expect = 5e-60, Method: Composition-based stats. Identities = 67/270 (24%), Positives = 116/270 (42%), Gaps = 7/270 (2%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPH---V 63 K+S+IIPLYN C+ S+ QT++ +E+I+I+D +D++++I + + Sbjct: 2 KVSIIIPLYNVAPYVEKCLLSVFKQTFSNIEVILIDDCGSDDTMKIVNRLLNVNKNELDI 61 Query: 64 RLL-HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNA 122 ++ H N G S ARN GI+VATG Y+ F+D+DD + E + A + D+ N Sbjct: 62 HVIQHNRNKGLSAARNTGIKVATGDYLFFLDSDDMLSEDCIEKMAFFAEKTCADIVIANY 121 Query: 123 DWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIK 182 D +LT + + + + W + RRD I+ N + Sbjct: 122 DEILPNNSVCNSEC---FYLKDQLLTSNLDILRAFCQKDFFVMAWNKLVRRDFIINNGLL 178 Query: 183 FIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKI 242 F GL H+D +W+ F A +SLY Y + S+++ + + + Sbjct: 179 FKEGLIHEDELWSFMFALKANSCVIRRESLYYYRIRPGSITQRVTLEHVLSLVDILRLMV 238 Query: 243 TRLLEKLNRNYADKIMIYPEFHQQITYEAL 272 T N +K I + + L Sbjct: 239 TAFANIAPPNLLEKKQILSNYIENFKIMIL 268 >UniRef50_B5G7W6 Glycosyltransferase n=2 Tax=Streptomyces RepID=B5G7W6_9ACTO Length = 1200 Score = 233 bits (596), Expect = 5e-60, Method: Composition-based stats. Identities = 67/231 (29%), Positives = 109/231 (47%), Gaps = 17/231 (7%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 +LSVI+P+YN C++SL QT+ LE+I+++DGS D+S IA YAE RL+ Sbjct: 13 RLSVIVPIYNVETYLPACLDSLRAQTFEDLEVIMVDDGSPDDSARIAADYAELDERFRLV 72 Query: 67 HQANAGASVARNRGIEVAT--GKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 +ANAG ARN G+E + +Y+AFVD+DD + P Y +++ E D A N Sbjct: 73 RKANAGLGAARNTGLEHVSPGSEYLAFVDSDDLIPPDAYRLMVSSLDESGSDFATGNVHH 132 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTH---VVWMGVYRRDVIVKNNI 181 E T ++ +L+G R +S R + V+RR + Sbjct: 133 LRGEN--------TWQVPLLKMLSGQAVRRTHISERPALVADRIACNKVWRRTFWDEGTG 184 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNT----SVSRLHRQ 228 +F G+ ++D T + A + +Y + L S+++ + Sbjct: 185 RFPEGVWYEDTPVTVTAHYRARAIDIIGEPVYFWRLREGEAGLSITQRRTE 235 >UniRef50_A8ZY95 Glycosyl transferase family 2 n=1 Tax=Desulfococcus oleovorans Hxd3 RepID=A8ZY95_DESOH Length = 343 Score = 233 bits (596), Expect = 5e-60, Method: Composition-based stats. Identities = 58/235 (24%), Positives = 115/235 (48%), Gaps = 5/235 (2%) Query: 18 GDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLHQANAGASVAR 77 C+ S+++QT T +EII INDG+ D+S+ I + +A ++++++ N G S R Sbjct: 17 EAYIADCLSSIVSQTLTDMEIICINDGTPDDSMAIVRVFAGRDDRIKIINKENGGLSDTR 76 Query: 78 NRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR-----ETGET 132 N GI+ A G+Y+ F+D+DD + E L A++++LD+ + F E Sbjct: 77 NVGIDNANGEYIYFIDSDDYLDIRALEMLYAKAVQENLDIVYFDTKPVFETRELKERFRG 136 Query: 133 WQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGLHHQDI 192 ++ + + V+TG + L + V++ ++R I NNIKF G+ H+D Sbjct: 137 CEAYYSRKKAYDDVVTGQEKLVEMAENNDVLVPVFLQFFKRSFIRDNNIKFYKGILHEDN 196 Query: 193 VWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRLLE 247 +++ + A R + + L+ + S+ ++ + Y +++ L+ Sbjct: 197 LFSMQVAALAQRVSHINECLHIRRIRPGSIMTGEKRFANSYGYFICEMELIEFLK 251 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P11290 Uncharacterized glycosyltransferase yibD n=58 Ta... 389 e-107 UniRef50_B4SXB8 Glycosyl transferase, group 2 family protein n=3... 379 e-103 UniRef50_D1RQX9 Putative glycosyl transferase n=1 Tax=Serratia o... 327 5e-88 UniRef50_A8GFL8 Glycosyl transferase family 2 n=1 Tax=Serratia p... 327 6e-88 UniRef50_B6XVK3 Putative uncharacterized protein n=1 Tax=Bifidob... 311 2e-83 UniRef50_C7H3W2 Capsular polysaccharide biosynthesis protein n=1... 310 4e-83 UniRef50_A9VPZ1 Glycosyl transferase family 2 n=3 Tax=Firmicutes... 309 9e-83 UniRef50_A8SCX5 Putative uncharacterized protein n=1 Tax=Faecali... 306 9e-82 UniRef50_B5CP79 Putative uncharacterized protein n=1 Tax=Ruminoc... 304 3e-81 UniRef50_Q5LIR2 Putative glycosyltransferase n=6 Tax=Bacteroides... 302 1e-80 UniRef50_D1NJF8 Glycosyl transferase family 2 n=1 Tax=Clostridiu... 302 2e-80 UniRef50_A7B4B8 Putative uncharacterized protein n=1 Tax=Ruminoc... 300 4e-80 UniRef50_B7AUI4 Putative uncharacterized protein n=1 Tax=Bactero... 300 4e-80 UniRef50_B6FJ78 Putative uncharacterized protein n=1 Tax=Clostri... 299 8e-80 UniRef50_B5XTH2 Glycosyl transferase, group 2 family protein n=2... 299 9e-80 UniRef50_A6NTT2 Putative uncharacterized protein n=1 Tax=Bactero... 299 1e-79 UniRef50_C0WDB4 Ss-1,4-galactosyltransferase n=1 Tax=Acidaminoco... 296 5e-79 UniRef50_A6L4H5 Glycosyltransferase family 2 n=2 Tax=Bacteroides... 296 5e-79 UniRef50_D2EWI1 Glycoside transferase family 2 protein n=3 Tax=B... 296 6e-79 UniRef50_C0BRJ5 Putative uncharacterized protein n=1 Tax=Bifidob... 296 7e-79 UniRef50_B0ACE9 Putative uncharacterized protein n=2 Tax=Firmicu... 295 1e-78 UniRef50_C4FFR1 Putative uncharacterized protein n=1 Tax=Bifidob... 295 1e-78 UniRef50_P71059 Uncharacterized glycosyltransferase epsJ n=3 Tax... 295 2e-78 UniRef50_A1A383 Putative glycosyltransferase n=1 Tax=Bifidobacte... 294 2e-78 UniRef50_B5LBK9 Putative glycosyl transferase n=1 Tax=Streptococ... 293 5e-78 UniRef50_B7GUL7 Glycosyl transferase, family 2 n=1 Tax=Bifidobac... 291 2e-77 UniRef50_D2Q655 Glycosyl transferase, family 2 n=2 Tax=Bifidobac... 291 2e-77 UniRef50_C0FV15 Putative uncharacterized protein n=1 Tax=Rosebur... 290 4e-77 UniRef50_D1BQ92 Glycosyl transferase family 2 n=1 Tax=Veillonell... 290 4e-77 UniRef50_A7VPB1 Putative uncharacterized protein n=1 Tax=Clostri... 290 4e-77 UniRef50_D2Q661 Glycosyl transferase family 2 n=2 Tax=Bifidobact... 290 4e-77 UniRef50_D1PNS5 Capsular polysaccharide biosythesis protein CpsI... 290 5e-77 UniRef50_C4FMC2 Putative uncharacterized protein n=3 Tax=Firmicu... 290 5e-77 UniRef50_A7VCQ5 Putative uncharacterized protein n=1 Tax=Clostri... 290 7e-77 UniRef50_B3JEB6 Putative uncharacterized protein n=1 Tax=Bactero... 289 8e-77 UniRef50_C6LFI9 Glycosyl transferase, group 2 family n=1 Tax=Bry... 289 9e-77 UniRef50_C9RJB8 Glycosyl transferase family 2 n=1 Tax=Fibrobacte... 288 2e-76 UniRef50_A3CKM9 Glycosyltransferase, family 2/glycosyltransferas... 287 4e-76 UniRef50_D1PIY0 Glycosyl transferase, group 2 family n=1 Tax=Sub... 287 4e-76 UniRef50_B5CU57 Putative uncharacterized protein n=1 Tax=Bactero... 286 7e-76 UniRef50_UPI0001B4A4FE ss-1,4-galactosyltransferase n=1 Tax=Bact... 286 7e-76 UniRef50_C0BRK2 Putative uncharacterized protein n=1 Tax=Bifidob... 286 7e-76 UniRef50_D1A4K2 CDP-glycerol:poly(Glycerophosphate)glycerophosph... 286 8e-76 UniRef50_Q9F0B8 Beta(1,4)galactosyltransferase EpsJ n=1 Tax=Lact... 286 8e-76 UniRef50_C9BVV0 Glycosyl transferase n=3 Tax=Enterococcus faeciu... 286 8e-76 UniRef50_C3Q1B0 Glycosyltransferase family 2 protein n=2 Tax=Bac... 286 9e-76 UniRef50_D2Q652 Glycosyltransferase, family 2 n=1 Tax=Bifidobact... 286 1e-75 UniRef50_C7UWL4 Glycosyl transferase n=6 Tax=Enterococcus RepID=... 286 1e-75 UniRef50_D2Q666 Glycosyl transferase, family 2 n=4 Tax=cellular ... 285 1e-75 UniRef50_A4VTV1 Glycosyltransferases involved in cell wall bioge... 285 1e-75 UniRef50_D2Q658 Glycosyl transferase, family 2 n=3 Tax=Bacteria ... 285 1e-75 UniRef50_B4AD18 Glycosyltransferase n=18 Tax=Enterococcus RepID=... 285 1e-75 UniRef50_C0C2F7 Putative uncharacterized protein n=1 Tax=Clostri... 285 1e-75 UniRef50_B3DP51 Glycosyltransferase for cell wall membrane n=10 ... 284 4e-75 UniRef50_C5NWF7 Glycosyl transferase, group 2 family n=2 Tax=Fir... 283 6e-75 UniRef50_B2ISC6 Glycosyl transferase, family 2/glycosyl transfer... 282 1e-74 UniRef50_C8NFZ4 Glycosyl transferase CpsJ(V) n=1 Tax=Granulicate... 281 2e-74 UniRef50_C7H3W5 Glycosyl transferase CpsJ n=1 Tax=Faecalibacteri... 281 4e-74 UniRef50_Q8GPB8 Eps6I n=5 Tax=Streptococcus RepID=Q8GPB8_STRTR 280 4e-74 UniRef50_D2Q667 Glycosyl transferase, family 2 n=3 Tax=Bifidobac... 280 6e-74 UniRef50_A6NTS0 Putative uncharacterized protein n=1 Tax=Bactero... 280 6e-74 UniRef50_C6LCR2 Glycosyl transferase CpsJ n=1 Tax=Bryantella for... 279 8e-74 UniRef50_P71057 Putative glycosyltransferase epsH n=3 Tax=Bacill... 279 9e-74 UniRef50_B4F184 Glycosyl transferase n=2 Tax=Proteus mirabilis R... 279 9e-74 UniRef50_D1NRU6 Glycosyl transferase CpsJ n=1 Tax=Bifidobacteriu... 279 9e-74 UniRef50_C0BRQ0 Putative uncharacterized protein n=2 Tax=Bifidob... 279 1e-73 UniRef50_D2RJ84 Glycosyl transferase family 2 n=1 Tax=Acidaminoc... 278 2e-73 UniRef50_Q03KL3 Glycosyltransferase involved in cell wall biogen... 278 2e-73 UniRef50_A6L7A5 Glycosyltransferase family 2 n=6 Tax=Bacteroides... 278 2e-73 UniRef50_B2UQP2 Glycosyl transferase family 2 n=1 Tax=Akkermansi... 278 3e-73 UniRef50_C0BWY1 Putative uncharacterized protein n=1 Tax=Clostri... 278 3e-73 UniRef50_B1QTB6 Glycosyl transferase, group 2 family protein n=4... 278 3e-73 UniRef50_A8SZ40 Putative uncharacterized protein n=1 Tax=Coproco... 276 5e-73 UniRef50_C2DBB8 Glycosyltransferase n=19 Tax=Bacteria RepID=C2DB... 276 5e-73 UniRef50_Q67AY0 Cps8J n=6 Tax=Streptococcus RepID=Q67AY0_STRAG 275 2e-72 UniRef50_A8FHL3 Glycosyltransferase n=2 Tax=Bacillus pumilus Rep... 275 2e-72 UniRef50_C2M8C3 Glycosyl transferase, family 2 n=1 Tax=Capnocyto... 274 4e-72 UniRef50_D0ANJ0 Putative uncharacterized protein n=1 Tax=Enteroc... 273 6e-72 UniRef50_O86893 Putative glycosyl transferase n=1 Tax=Streptococ... 273 6e-72 UniRef50_C6ID60 Glycosyltransferase n=5 Tax=Bacteroides RepID=C6... 273 7e-72 UniRef50_B2V0U1 Glycosyl transferase, group 2 family protein n=1... 273 7e-72 UniRef50_B2UYP7 Minor teichoic acid biosynthesis protein GgaB n=... 273 8e-72 UniRef50_UPI0001B5362C putative glycosyl transferase n=1 Tax=Str... 272 1e-71 UniRef50_Q5LBM4 Putative glycosyltransferase protein n=4 Tax=Bac... 272 1e-71 UniRef50_A5UNM1 Glycosyltransferase/CDP-glycerol:poly(Glyceropho... 272 1e-71 UniRef50_C3R5C9 Glycosyl transferase WciV n=3 Tax=Bacteroides Re... 271 2e-71 UniRef50_A6NTT8 Putative uncharacterized protein n=1 Tax=Bactero... 271 2e-71 UniRef50_C7MN19 Glycosyl transferase n=1 Tax=Cryptobacterium cur... 271 2e-71 UniRef50_C9LAR4 Glycosyl transferase, family 2/glycosyl transfer... 271 2e-71 UniRef50_B0P2L7 Putative uncharacterized protein n=1 Tax=Clostri... 271 2e-71 UniRef50_A5ZCV6 Putative uncharacterized protein n=1 Tax=Bactero... 271 2e-71 UniRef50_Q815A0 Glycosyltransferase n=6 Tax=Bacillus cereus RepI... 271 4e-71 UniRef50_UPI0001968EF9 hypothetical protein BACCELL_04174 n=1 Ta... 270 4e-71 UniRef50_A6LCC5 Glycosyltransferase family 2 n=2 Tax=Bacteroidal... 270 4e-71 UniRef50_D0R4M4 Glycosyltransferase n=1 Tax=Lactobacillus johnso... 270 4e-71 UniRef50_D1JY96 Putative uncharacterized protein n=1 Tax=Bactero... 270 4e-71 UniRef50_Q9F0B5 Beta(1,4)galactosyltransferase EpsM n=1 Tax=Lact... 270 5e-71 UniRef50_B2ILQ1 Ss-1,4-galactosyltransferase n=6 Tax=Streptococc... 270 5e-71 UniRef50_C7D258 TagF domain-containing protein n=4 Tax=Enterococ... 270 6e-71 UniRef50_B0P2L5 Putative uncharacterized protein n=1 Tax=Clostri... 270 6e-71 UniRef50_A1A380 Putative uncharacterized protein n=1 Tax=Bifidob... 270 6e-71 UniRef50_A6NTT3 Putative uncharacterized protein n=1 Tax=Bactero... 270 7e-71 UniRef50_C7H2S4 Glycosyl transferase, group 2 family n=3 Tax=Clo... 269 9e-71 UniRef50_A6BH37 Putative uncharacterized protein n=1 Tax=Dorea l... 269 1e-70 UniRef50_A7VPB2 Putative uncharacterized protein n=1 Tax=Clostri... 269 1e-70 UniRef50_C6I619 Glycosyl transferase n=4 Tax=Bacteroides RepID=C... 268 1e-70 UniRef50_B0ZTM8 Putative uncharacterized protein n=1 Tax=Campylo... 268 2e-70 UniRef50_A0Q780 Glycosyl transferase, family 2 n=16 Tax=Francise... 268 2e-70 UniRef50_C3X3W6 Putative uncharacterized protein n=1 Tax=Oxaloba... 268 2e-70 UniRef50_C3ADH9 Putative uncharacterized protein n=1 Tax=Bacillu... 268 3e-70 UniRef50_D0T9V5 Glycosyltransferase family 2 n=1 Tax=Bacteroides... 268 3e-70 UniRef50_C0X9Z9 Glycosyltransferase n=1 Tax=Lactobacillus gasser... 267 3e-70 UniRef50_B2RH99 Probable glycosyltransferase n=2 Tax=Porphyromon... 267 4e-70 UniRef50_B9AFV3 Putative uncharacterized protein n=1 Tax=Methano... 267 4e-70 UniRef50_D2RIJ6 Glycosyl transferase family 2 n=1 Tax=Acidaminoc... 267 5e-70 UniRef50_Q5LZN9 Exopolysaccharide biosynthesis protein, sugar tr... 266 5e-70 UniRef50_Q8A2D4 Glycoside transferase family 2 n=2 Tax=Bacteroid... 266 6e-70 UniRef50_Q9AH91 Putative glycosyl transferase n=3 Tax=Streptococ... 266 6e-70 UniRef50_A2STD8 Glycosyl transferase, family 2 n=1 Tax=Methanoco... 266 8e-70 UniRef50_O87182 CpsIaI n=15 Tax=Streptococcus agalactiae RepID=O... 266 9e-70 UniRef50_D0DSJ8 Glycosyl transferase CpsJ n=1 Tax=Lactobacillus ... 266 9e-70 UniRef50_A6P2R1 Putative uncharacterized protein n=1 Tax=Bactero... 266 1e-69 UniRef50_B2UQ55 Glycosyl transferase family 2 n=1 Tax=Akkermansi... 266 1e-69 UniRef50_A6LG23 Glycosyltransferase family 2 n=6 Tax=Bacteroidal... 266 1e-69 UniRef50_C5TEG1 Glycosyl transferase family 2 n=1 Tax=Zymomonas ... 266 1e-69 UniRef50_B1R2U2 Glycosyl transferase, group 2 family protein n=4... 266 1e-69 UniRef50_A8SG33 Putative uncharacterized protein n=1 Tax=Faecali... 265 1e-69 UniRef50_B9YA99 Putative uncharacterized protein n=1 Tax=Holdema... 265 1e-69 UniRef50_C5S6D6 Methyltransferase FkbM family n=1 Tax=Allochroma... 265 2e-69 UniRef50_D2RJ85 Glycosyl transferase family 2 n=1 Tax=Acidaminoc... 265 2e-69 UniRef50_C4ZHK7 Glycosyltransferase Family 2 modular protein n=1... 265 2e-69 UniRef50_B5G7W4 Glycosyltransferase n=3 Tax=Streptomyces RepID=B... 265 2e-69 UniRef50_A6BH38 Putative uncharacterized protein n=1 Tax=Dorea l... 265 2e-69 UniRef50_A8U9H2 Glycosyl transferase, family 2/glycosyl transfer... 265 3e-69 UniRef50_B9YAB2 Putative uncharacterized protein n=1 Tax=Holdema... 264 3e-69 UniRef50_C9RN10 Glycosyl transferase family 2 n=1 Tax=Fibrobacte... 264 4e-69 UniRef50_C2LRT9 Glycosyl transferase, family 2/glycosyl transfer... 264 4e-69 UniRef50_B7BDW0 Putative uncharacterized protein n=1 Tax=Parabac... 264 4e-69 UniRef50_Q8XN54 Capsular polysaccharide biosynthsis protein n=1 ... 263 6e-69 UniRef50_B9Y890 Putative uncharacterized protein n=1 Tax=Holdema... 263 6e-69 UniRef50_Q4JZ78 Putative glycosyl transferase n=1 Tax=Streptococ... 263 6e-69 UniRef50_C3QCN3 Glycoside transferase family 2 n=6 Tax=Bacteroid... 263 7e-69 UniRef50_C3PZ19 Glycosyl transferase n=1 Tax=Bacteroides sp. 9_1... 263 8e-69 UniRef50_C2FKR8 Glycosyltransferase involved in cell wall biogen... 263 9e-69 UniRef50_C2EYX7 Glycosyltransferase n=2 Tax=Lactobacillus reuter... 262 1e-68 UniRef50_C5NWI7 Glycosyl transferase, group 2 family n=1 Tax=Gem... 262 1e-68 UniRef50_B2IQI2 Glycosyl transferase, family 2 n=31 Tax=Firmicut... 262 1e-68 UniRef50_B0P2K9 Putative uncharacterized protein n=1 Tax=Clostri... 262 1e-68 UniRef50_Q047X3 Glycosyltransferase related enzyme n=1 Tax=Lacto... 262 1e-68 UniRef50_Q93DZ7 Eps1I n=1 Tax=Streptococcus thermophilus RepID=Q... 262 1e-68 UniRef50_D2BBH8 Glycosyltransferase involved in cell wall biogen... 262 2e-68 UniRef50_C0YL33 Glycosyltransferase n=1 Tax=Chryseobacterium gle... 262 2e-68 UniRef50_D2Q641 Glycosyl transferase family 2 n=2 Tax=Bifidobact... 261 2e-68 UniRef50_A6NTS7 Putative uncharacterized protein n=1 Tax=Bactero... 261 2e-68 UniRef50_C1CFZ0 Glycosyl transferase, family 2/glycosyl transfer... 261 2e-68 UniRef50_D2BBI0 Putative glycosyl/glycerophosphate transferase i... 261 3e-68 UniRef50_C6Z980 Glycosyltransferase family 2 n=1 Tax=Bacteroides... 261 3e-68 UniRef50_C4G9K6 Putative uncharacterized protein n=1 Tax=Shuttle... 261 3e-68 UniRef50_Q4JZB4 Putative glycosyl transferase n=3 Tax=Streptococ... 261 3e-68 UniRef50_B2UQN8 Glycosyl transferase family 2 n=1 Tax=Akkermansi... 261 4e-68 UniRef50_C0B702 Putative uncharacterized protein n=1 Tax=Coproco... 260 5e-68 UniRef50_B0P2L4 Putative uncharacterized protein n=1 Tax=Clostri... 260 5e-68 UniRef50_C7X8L4 Glycoside transferase, family 2 n=1 Tax=Parabact... 260 5e-68 UniRef50_A7V2P1 Putative uncharacterized protein n=3 Tax=Bactero... 260 6e-68 UniRef50_A7I1X7 Ss-1,4-galactosyltransferase n=1 Tax=Campylobact... 260 7e-68 UniRef50_A5A8E7 Galactosyltransferase n=8 Tax=Streptococcus RepI... 260 7e-68 UniRef50_C9N8V9 Glycosyl transferase family 2 n=1 Tax=Streptomyc... 260 8e-68 UniRef50_Q93TI5 Putative glycosyltransferase CpsIVJ n=1 Tax=Stre... 260 8e-68 UniRef50_C7H1H9 Polysaccharide biosynthesis protein, Glycosyl tr... 260 8e-68 UniRef50_D2BMR3 Glycosyltransferase, family 2 n=1 Tax=Lactococcu... 259 1e-67 UniRef50_B5CQR8 Putative uncharacterized protein n=1 Tax=Ruminoc... 259 1e-67 UniRef50_C6LFJ2 Glycosyl transferase, family 2/glycosyl transfer... 259 1e-67 UniRef50_A8SCX8 Putative uncharacterized protein n=1 Tax=Faecali... 259 1e-67 UniRef50_C3PZ30 Glycosyl transferase n=4 Tax=Bacteroides RepID=C... 259 1e-67 UniRef50_B0NM34 Putative uncharacterized protein n=3 Tax=Bactero... 258 2e-67 UniRef50_B0ABV6 Putative uncharacterized protein n=1 Tax=Clostri... 258 2e-67 UniRef50_A6BIH4 Putative uncharacterized protein n=1 Tax=Dorea l... 258 2e-67 UniRef50_Q93DZ8 EpsG n=2 Tax=Lactobacillales RepID=Q93DZ8_STRTR 258 2e-67 UniRef50_A3KLK8 Cps5J protein n=2 Tax=Streptococcus agalactiae R... 258 2e-67 UniRef50_A6BHF1 Putative uncharacterized protein n=1 Tax=Dorea l... 258 2e-67 UniRef50_A5UMT7 Glycosyltransferase, GT2 family n=2 Tax=Methanob... 258 2e-67 UniRef50_A8RK62 Putative uncharacterized protein n=1 Tax=Clostri... 258 2e-67 UniRef50_D0BKT7 Capsular polysaccharide biosynthesis protein Cps... 258 3e-67 UniRef50_Q74L33 Putative uncharacterized protein n=1 Tax=Lactoba... 258 3e-67 UniRef50_C7H3X5 Putative capsular polysaccharide biosythesis pro... 258 3e-67 UniRef50_Q5M3K9 Glycosyl transferase n=4 Tax=Streptococcus RepID... 257 4e-67 UniRef50_Q8XN37 Capsular polysaccharide biosynthsis protein n=1 ... 257 4e-67 UniRef50_B7HXA1 N-acetylglucosaminyltransferase n=2 Tax=Bacillus... 257 5e-67 UniRef50_C8WM64 Glycosyl transferase family 2 n=1 Tax=Eggerthell... 257 5e-67 UniRef50_A7VUZ8 Putative uncharacterized protein n=1 Tax=Clostri... 257 5e-67 UniRef50_Q02W60 Glycosyltransferase n=1 Tax=Lactococcus lactis s... 257 5e-67 UniRef50_B0P3X3 Putative uncharacterized protein n=1 Tax=Clostri... 257 5e-67 UniRef50_C0CHF0 Putative uncharacterized protein n=2 Tax=Blautia... 257 5e-67 UniRef50_B5CU67 Putative uncharacterized protein n=1 Tax=Bactero... 257 5e-67 UniRef50_UPI0001C4194D glycosyl transferase GT2 family n=1 Tax=M... 256 6e-67 UniRef50_B2DG34 Eps5I n=3 Tax=Streptococcus pneumoniae RepID=B2D... 256 6e-67 UniRef50_Q8A821 Glycoside transferase family 2 n=1 Tax=Bacteroid... 256 7e-67 UniRef50_C6JQJ1 Glycosyl transferase n=1 Tax=Fusobacterium variu... 256 7e-67 UniRef50_D1PNS4 Capsular polysaccharide biosythesis protein CpsI... 256 8e-67 UniRef50_C1YIQ9 Glycosyl/glycerophosphate transferase, teichoic ... 256 8e-67 UniRef50_A1YVC6 Glycosyltransferase n=1 Tax=Lactobacillus johnso... 256 8e-67 UniRef50_C7G7U9 Putative capsular polysaccharide biosythesis pro... 256 1e-66 UniRef50_UPI0001972E41 putative sugar transferase n=1 Tax=Clostr... 256 1e-66 UniRef50_UPI0001C4190B glycosyl transferase GT2 family n=1 Tax=M... 256 1e-66 UniRef50_A6ASH7 Cps2K n=1 Tax=Vibrio harveyi HY01 RepID=A6ASH7_V... 255 1e-66 UniRef50_B0NLZ9 Putative uncharacterized protein n=1 Tax=Bactero... 255 1e-66 UniRef50_C7PMN9 Glycosyl transferase family 2 n=1 Tax=Chitinopha... 255 1e-66 UniRef50_B2UQ58 Glycosyl transferase family 2 n=1 Tax=Akkermansi... 255 2e-66 UniRef50_Q047W8 Glycosyltransferase related enzyme n=1 Tax=Lacto... 255 2e-66 UniRef50_D2PVX1 Glycosyl transferase family 2 n=1 Tax=Kribbella ... 255 3e-66 UniRef50_Q83VE8 EpsN n=1 Tax=Lactococcus lactis subsp. cremoris ... 254 3e-66 UniRef50_A5ZUY3 Putative uncharacterized protein n=1 Tax=Ruminoc... 254 3e-66 UniRef50_B4BNC6 Glycosyl transferase family 2 n=1 Tax=Geobacillu... 254 3e-66 UniRef50_B1JIU1 Glycosyl transferase family 2 n=28 Tax=Enterobac... 254 3e-66 UniRef50_A8RE09 Putative uncharacterized protein n=1 Tax=Eubacte... 254 3e-66 UniRef50_C6A9D3 Glycosyltransferase n=4 Tax=Bifidobacterium anim... 254 3e-66 UniRef50_A3TMK5 Possible glycosyl hydrolase n=1 Tax=Janibacter s... 254 4e-66 UniRef50_C2EBD2 Possible glycosyltransferase n=1 Tax=Lactobacill... 254 4e-66 UniRef50_C9RJ54 Glycosyl transferase family 2 n=1 Tax=Fibrobacte... 254 4e-66 UniRef50_C6WZX8 Beta-1,3-glucosyltransferase n=1 Tax=Flavobacter... 254 4e-66 UniRef50_D0BME4 Glycosyl transferase n=10 Tax=Lactobacillales Re... 254 4e-66 UniRef50_C0CH08 Putative uncharacterized protein n=1 Tax=Blautia... 254 5e-66 UniRef50_Q8XN57 Beta-1,4-galactosyltransferase n=3 Tax=Clostridi... 253 5e-66 UniRef50_C2EWJ8 Putative uncharacterized protein n=1 Tax=Lactoba... 253 5e-66 UniRef50_C6I618 Glycosyltransferase n=5 Tax=Bacteroidales RepID=... 253 5e-66 UniRef50_A5ZFA7 Putative uncharacterized protein n=1 Tax=Bactero... 253 6e-66 UniRef50_B3C8H4 Putative uncharacterized protein n=2 Tax=Bactero... 253 6e-66 UniRef50_A6L5V6 Glycosyltransferase family 2 n=2 Tax=Bacteroides... 253 7e-66 UniRef50_Q2NH61 Predicted glycosyltransferase n=1 Tax=Methanosph... 253 9e-66 UniRef50_UPI0001C377F0 glycosyl transferase, group 2 family prot... 252 1e-65 UniRef50_A7VGT1 Putative uncharacterized protein n=1 Tax=Clostri... 252 1e-65 UniRef50_C4Z667 Putative uncharacterized protein n=1 Tax=Eubacte... 252 2e-65 UniRef50_B5G7W6 Glycosyltransferase n=2 Tax=Streptomyces RepID=B... 251 2e-65 UniRef50_O87183 CpsIaJ n=6 Tax=Streptococcus agalactiae RepID=O8... 251 2e-65 UniRef50_C3AT96 Glycosyltransferase family 2 n=4 Tax=Bacillus Re... 251 2e-65 UniRef50_B0NM04 Putative uncharacterized protein n=1 Tax=Bactero... 251 2e-65 UniRef50_D1WVE3 CDP-glycerol:poly(Glycerophosphate) glycerophosp... 251 3e-65 UniRef50_C9BUH6 Glycosyltransferase n=1 Tax=Enterococcus faecium... 251 3e-65 UniRef50_B7AIU6 Putative uncharacterized protein n=1 Tax=Bactero... 251 4e-65 UniRef50_C1DN10 Glycosyl transferase, family 2 protein n=1 Tax=A... 251 4e-65 UniRef50_B5CU60 Putative uncharacterized protein n=1 Tax=Bactero... 251 4e-65 UniRef50_C7QZJ2 Glycosyl transferase family 2 n=1 Tax=Jonesia de... 250 5e-65 UniRef50_C0FV38 Putative uncharacterized protein n=1 Tax=Rosebur... 250 5e-65 UniRef50_B6FPH2 Putative uncharacterized protein n=1 Tax=Clostri... 250 5e-65 UniRef50_C8PLK6 Ss-1,4-galactosyltransferase n=1 Tax=Campylobact... 250 5e-65 UniRef50_C9K1G9 Putative glycosyltransferase n=1 Tax=Klebsiella ... 250 5e-65 UniRef50_D1XZD8 Glycosyltransferase, group 2 family protein n=1 ... 250 6e-65 UniRef50_B1C3W3 Putative uncharacterized protein n=1 Tax=Clostri... 250 7e-65 Sequences not found previously or not previously below threshold: UniRef50_D1WVE7 CDP-glycerol:poly(Glycerophosphate) glycerophosp... 253 8e-66 UniRef50_D1XQ46 CDP-glycerol:poly(Glycerophosphate) glycerophosp... 252 1e-65 >UniRef50_P11290 Uncharacterized glycosyltransferase yibD n=58 Tax=Enterobacteriaceae RepID=YIBD_ECOLI Length = 344 Score = 389 bits (1000), Expect = e-107, Method: Composition-based stats. Identities = 344/344 (100%), Positives = 344/344 (100%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY Sbjct: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC Sbjct: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN Sbjct: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI 240 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI Sbjct: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI 240 Query: 241 KITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMY 300 KITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMY Sbjct: 241 KITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMY 300 Query: 301 KRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITRSAFNLR 344 KRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITRSAFNLR Sbjct: 301 KRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITRSAFNLR 344 >UniRef50_B4SXB8 Glycosyl transferase, group 2 family protein n=36 Tax=Enterobacteriaceae RepID=B4SXB8_SALNS Length = 344 Score = 379 bits (973), Expect = e-103, Method: Composition-based stats. Identities = 276/343 (80%), Positives = 303/343 (88%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M NS K+S+I+PLYNAG DF CM SLI QTW+ALEIII+NDGSTD+SVEIAK+YAE+Y Sbjct: 1 MKNSKTKVSIIVPLYNAGADFNACMASLIAQTWSALEIIIVNDGSTDHSVEIAKHYAEHY 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 PHVRLLHQANAGASVARN G++ ATG YVAFVDADD+VYP MYETLMTMAL DDLDVAQC Sbjct: 61 PHVRLLHQANAGASVARNLGLQAATGDYVAFVDADDQVYPKMYETLMTMALNDDLDVAQC 120 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 NADWC R+TG WQSIPTDRLRSTGVL+GPDWLRM L+SRRWTHVVWMGVYRR +I NN Sbjct: 121 NADWCVRKTGHAWQSIPTDRLRSTGVLSGPDWLRMALASRRWTHVVWMGVYRRALITDNN 180 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI 240 I F+ GLHHQDI+W+TE MFNA R RYTEQSLYKY+LH+ SVSRL RQGNKNLNYQRHYI Sbjct: 181 ITFVPGLHHQDILWSTEVMFNATRVRYTEQSLYKYFLHDNSVSRLQRQGNKNLNYQRHYI 240 Query: 241 KITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMY 300 KITRLLEKLNR+YA +I IYPEF QQIT+EALRVCHAVRKEPDILTRQRMIAEIFTSGMY Sbjct: 241 KITRLLEKLNRDYARRIPIYPEFRQQITWEALRVCHAVRKEPDILTRQRMIAEIFTSGMY 300 Query: 301 KRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITRSAFNL 343 +R++ NVRS K YQ LLWSFRLWQWRDKT SH R+ R A NL Sbjct: 301 RRMMANVRSAKAAYQTLLWSFRLWQWRDKTLSHRRMARKALNL 343 >UniRef50_D1RQX9 Putative glycosyl transferase n=1 Tax=Serratia odorifera 4Rx13 RepID=D1RQX9_SEROD Length = 335 Score = 327 bits (838), Expect = 5e-88, Method: Composition-based stats. Identities = 152/325 (46%), Positives = 206/325 (63%), Gaps = 1/325 (0%) Query: 5 TNKL-SVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 + SVI+PL+NAG F M SL+ Q LE+II+NDGSTD S IA YA YPH+ Sbjct: 1 MPLILSVIVPLHNAGALFEPFMASLLAQDERRLEVIIVNDGSTDGSGAIAHRYARQYPHI 60 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 ++ Q N+G S ARN G+ +A GKYVAF DADD + P MY TL+ +A ++ LDV QCN Sbjct: 61 TVIDQPNSGVSCARNAGLAIAQGKYVAFPDADDLLAPEMYSTLVALAEQEQLDVMQCNGQ 120 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 F E + IP RL TGV++G WL L +R++ HVVW+ VYR D I ++ + F Sbjct: 121 RYFSRENELQEIIPQKRLGDTGVISGAQWLERALKTRKFIHVVWLAVYRLDFIKRHQLYF 180 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 GLHHQDI WTTE MFNA R +Y ++LY+ +H+ S+S R G N+ YQRHY+KI Sbjct: 181 EPGLHHQDIPWTTEVMFNAQRVKYLSKALYRQRVHDQSISNRRRTGKANVEYQRHYMKIV 240 Query: 244 RLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRL 303 +L LN+ YA K++I P FH QI EAL +CH++R+EP+ + ++ AE + G+ + + Sbjct: 241 EMLVVLNQRYASKMVIRPAFHWQIAREALGICHSIRREPEPAAQCQITAEFYQRGIDRTM 300 Query: 304 ITNVRSVKVGYQALLWSFRLWQWRD 328 N R +K + LW RL QWRD Sbjct: 301 AANARGLKQIWHVTLWLHRLKQWRD 325 >UniRef50_A8GFL8 Glycosyl transferase family 2 n=1 Tax=Serratia proteamaculans 568 RepID=A8GFL8_SERP5 Length = 332 Score = 327 bits (838), Expect = 6e-88, Method: Composition-based stats. Identities = 156/322 (48%), Positives = 209/322 (64%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 LSVI+PL+N G+ F + SL+ Q LE+II+NDGSTD S +IA YA+ + H+ ++ Sbjct: 2 LSVIVPLHNVGELFEPFLASLLAQNERRLEVIIVNDGSTDGSGDIAHRYAQQHTHITVID 61 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 Q NAG S ARN G+ VA GKYVAF DADD + P MY TL+ +A ++ LDV QCN + F Sbjct: 62 QLNAGVSCARNAGLAVARGKYVAFPDADDLLAPEMYSTLIELAEQNQLDVMQCNGERYFT 121 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 + E IP RL T V++G WL L SR++ HVVW+ VYR D I ++ + F GL Sbjct: 122 DQDELQPIIPLKRLGDTDVISGVQWLERALESRKFIHVVWLAVYRLDFIKQHRLYFEPGL 181 Query: 188 HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRLLE 247 HHQDI WTTE MFNA R +Y ++LY+ +H S+S R G N+ YQRHY+KI +L Sbjct: 182 HHQDIPWTTEVMFNAQRVKYLGKALYRQRVHGQSISNRRRTGKANVEYQRHYMKIVAMLV 241 Query: 248 KLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLITNV 307 LNR YADKI I P FH QIT EAL +CH++R+EP++ + R+ E F G+ + ++ N Sbjct: 242 ALNRRYADKIAIRPAFHWQITREALGICHSIRREPELAAQCRITDEFFQLGIDRAMVDNA 301 Query: 308 RSVKVGYQALLWSFRLWQWRDK 329 R +K G+ +LW RL QWR Sbjct: 302 RGLKQGWHVILWLHRLKQWRHG 323 >UniRef50_B6XVK3 Putative uncharacterized protein n=1 Tax=Bifidobacterium catenulatum DSM 16992 RepID=B6XVK3_9BIFI Length = 347 Score = 311 bits (798), Expect = 2e-83, Method: Composition-based stats. Identities = 75/327 (22%), Positives = 140/327 (42%), Gaps = 19/327 (5%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 + + +SVIIP+Y C+ S++ Q++ LEII+++DGSTD + +A + Sbjct: 11 TMDLVSVIIPIYGVEQYLDRCVRSVVNQSYAHLEIILVDDGSTDRCPAMCDAWANKDSRI 70 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 +++H+ N G S ARN G+++ATG+++AFVD+DD V P T++ A ++ D+ C+ Sbjct: 71 QVIHKPNGGLSSARNAGLDMATGEFIAFVDSDDYVEPDYIATMIDAAQKNHADLVMCSVF 130 Query: 124 WCFRETGETWQSIPTDRLRSTGV------LTGPDWLRMGLSSRRWTHVV-WMGVYRRDVI 176 + Q+ P R + +G D +R +VV W +YR + Sbjct: 131 HEDADGNAIEQTAPVRRKEYAPITDTLRTCSGIDCMRQRGEENGVDNVVAWNKLYRCQLW 190 Query: 177 VKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQ 236 ++++ G H+D T A + LY Y S+ +L+ Sbjct: 191 K--DLRYPVGKLHEDEFVTYRIFGRTRTAVLLPERLYHYIERTGSIMHTKYTLQ-SLDII 247 Query: 237 RHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFT 296 I R L + ++P F Q+ + + + R + + +F Sbjct: 248 EALIGKVRFLLD-----EGAMDLFPVFFSQLKDSIYKARQLDWSDAQVHKRLKELFRMFR 302 Query: 297 SGMYKRLITNVRSVKVGYQALLWSFRL 323 + + +++R + Q RL Sbjct: 303 TIPW----SSIRYLPFKDQVNYMGTRL 325 >UniRef50_C7H3W2 Capsular polysaccharide biosynthesis protein n=1 Tax=Faecalibacterium prausnitzii A2-165 RepID=C7H3W2_9FIRM Length = 318 Score = 310 bits (796), Expect = 4e-83, Method: Composition-based stats. Identities = 79/317 (24%), Positives = 125/317 (39%), Gaps = 6/317 (1%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 T +SVI+P+YN C++S++ QT+ LE+++++DGSTD S I YA V Sbjct: 2 ETELISVIVPVYNVERYLCRCVDSILHQTYRNLEVLLVDDGSTDASGAICDEYAAQEERV 61 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 +HQ N G S ARN G+E A G Y+ FVD+DD + M ETL E D DVA Sbjct: 62 TAVHQKNGGLSAARNTGLERAKGTYLCFVDSDDLLDSRMLETLCRDLQEQDADVAVVGFR 121 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 RE + P + +TG + +R L S W +Y+R++ +I++ Sbjct: 122 MFEREE----ELAPAELAVPVQCMTGREAIRSTLVSDELGDFAWNKLYKRELFR--DIRY 175 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 G +D T + Y LY YY S+ Y K Sbjct: 176 PLGRMMEDQGTTYRIFQQCSKVVYRPVPLYYYYQRPDSILHRRNMKFYEDKLDMGYQKYR 235 Query: 244 RLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRL 303 + E + + Y + E + A++ + ++ Sbjct: 236 AIREAYPGMPENDAAMLSVVEHCYPYLMQDTVRRAKMEAFLQECDPAAAKLLCTSSQRKY 295 Query: 304 ITNVRSVKVGYQALLWS 320 S ++ + L Sbjct: 296 QLLRCSRRLYAKLFLLK 312 >UniRef50_A9VPZ1 Glycosyl transferase family 2 n=3 Tax=Firmicutes RepID=A9VPZ1_BACWK Length = 324 Score = 309 bits (793), Expect = 9e-83, Method: Composition-based stats. Identities = 74/335 (22%), Positives = 154/335 (45%), Gaps = 14/335 (4%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 K+S+I+P+Y C++S++ QT+ E+I+++DGS D EI YA+ ++ Sbjct: 2 NPKISIIVPVYKVERYIHKCIDSILVQTFEDFELILVDDGSPDTCGEICDRYADKDMRIK 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++H+ N G S ARN G+E A G Y+ FVD+DD + P MYE L M +E D+A C++ Sbjct: 62 VIHKENGGLSSARNAGLENAKGDYIGFVDSDDWIEPDMYELLYGMCVEQGCDIAICSSQI 121 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 + T + P + ++ L + + VVW +++R+++ +IKF Sbjct: 122 HYSNRIVTSSNHPF------IIHDRNTAMKTMLEGKLYDEVVWTKLFKRNLLE--DIKFP 173 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 G+ ++D +T + + + + + Y Y + S+ ++ Y ++ + Sbjct: 174 VGMSYEDTAFTYKVIHKCKKVCFIGEPKYHYIKRDNSMM-DRAVKELKIDSVLIYDEMCK 232 Query: 245 LLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLI 304 +E+ + ++ + + + R + ++I I + YK + Sbjct: 233 FMERYYKEL--YSLVILKLGNCSMVVLNLISMSGRFNEHKVKYYKVIG-ILNNYFYKTM- 288 Query: 305 TNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITRS 339 ++++ + LL++ +L K ++ R Sbjct: 289 -SLKAYGKSVKFLLFATKLHPLLYKFLINNFAKRG 322 >UniRef50_A8SCX5 Putative uncharacterized protein n=1 Tax=Faecalibacterium prausnitzii M21/2 RepID=A8SCX5_9FIRM Length = 322 Score = 306 bits (784), Expect = 9e-82, Method: Composition-based stats. Identities = 75/289 (25%), Positives = 124/289 (42%), Gaps = 16/289 (5%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 LS+I+P+Y + + C++S++ QT+T E+I++ DGS D + A +R+LH Sbjct: 2 LSIIVPVYKVENYLQKCIDSILAQTFTDFELILVEDGSPDGCPALCDAAAAKDARIRVLH 61 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 Q N G S ARN G++VA G+++ FVD+DD + P MYETL D+A C+ Sbjct: 62 QKNGGLSAARNAGLDVARGEWIGFVDSDDYIAPEMYETLYKAVQSTGADLALCDYAAVDE 121 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 +P + + TG + L+ + W +YRR + +++ G Sbjct: 122 AGTPC---LPPYTGLAQRIFTGRELLKKA--TNTMAQPAWNKLYRRVIFA--QLRYPEGK 174 Query: 188 HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRLLE 247 ++DI E N +A + LY Y +S+ G+K L + R Sbjct: 175 LNEDIFVLPEICLNTKKAVVVPKELYYYVQRGSSIMN----GSKTLRHFDAAEAAQRYWN 230 Query: 248 KLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFT 296 L N +Y + V R+ P L + E+ Sbjct: 231 CLVEN-----EVYDALANAAKFTMGSVSRVYRQLPPALRKAPRSREMLR 274 >UniRef50_B5CP79 Putative uncharacterized protein n=1 Tax=Ruminococcus lactaris ATCC 29176 RepID=B5CP79_9FIRM Length = 315 Score = 304 bits (780), Expect = 3e-81, Method: Composition-based stats. Identities = 72/319 (22%), Positives = 140/319 (43%), Gaps = 12/319 (3%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 + +SVI+P+Y + C++S++ QT+ LE+I+++DGS D I YA VR Sbjct: 2 QDLISVIVPVYKVEKYLKRCVDSILAQTYPCLEVILVDDGSPDGCPAICDEYAREDRRVR 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++H+ N G S ARN GI+ A GK++ FVD+DD V+P YE L+ E+ D+A C+ Sbjct: 62 VIHKENGGLSDARNAGIDAAKGKFLGFVDSDDYVHPRFYELLLQALKEEGADIAGCDVKK 121 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRW--THVVWMGVYRRDVIVKNNIK 182 + V +G + ++ + + V W +Y++++ + I+ Sbjct: 122 V---CKTEKIEEKEQQPIQRTVYSGREATANLYDAQMYLKSVVAWNKLYKKELFEE--IR 176 Query: 183 FIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKI 242 F G H+D + + ++ + Y ++ Y Y+ + Q + Y ++ Sbjct: 177 FPKGKLHEDEFTSYKLLYQSESVVYINRAYYFYFQREDGIM-GKEQRKISPAVLEAYEEM 235 Query: 243 TRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKR 302 R + ++M+Y Y AV+ + R + Y++ Sbjct: 236 ERFFSEKGEKDLLQMMMYRYLSMVRRY----ASDAVKSGDEEEIRLGRYYDKKFRKDYRK 291 Query: 303 LITNVRSVKVGYQALLWSF 321 I ++ K ++ ++ Sbjct: 292 YIWKIKKPKRKFRLWMYYH 310 >UniRef50_Q5LIR2 Putative glycosyltransferase n=6 Tax=Bacteroides RepID=Q5LIR2_BACFN Length = 356 Score = 302 bits (774), Expect = 1e-80, Method: Composition-based stats. Identities = 71/334 (21%), Positives = 139/334 (41%), Gaps = 15/334 (4%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 +S+I+P+YN + C++SLI QT+ +EII+++DGSTD+S I +AE Sbjct: 4 NKPLPLISIIVPIYNIAEYASECIQSLINQTYKNIEIILVDDGSTDHSPAICDEFAEQDE 63 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 ++++H+ N G S ARN G++VATG+Y+ FVD DD V MYETL + E D++ C Sbjct: 64 RIKVIHKRNGGLSDARNAGLDVATGEYIGFVDGDDWVDEDMYETLYHLIYEHQADISICT 123 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 ++ + + T + + + + + + +W +++R++ + + Sbjct: 124 HYTELPN-----RTKVKYKSKKTKIFSSQKAIATLIEDKIIQNYIWEKLFKRELFTE--L 176 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIK 241 +F G +DI + A + + Y Y S++ R K Y + + Sbjct: 177 RFPVGWSFEDIALCYKVFHKARKIVLLQTPKYHYRTRPGSITNSTRNPLKEFQYLQALHE 236 Query: 242 ITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYK 301 + + N + + I + + P +++ I + F Sbjct: 237 QFQFAAENNIKVRKPKKLVQKTFHFINHIII--------LPPSSLKKKYINDAFEIAHTY 288 Query: 302 RLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHR 335 + N + + + +R Sbjct: 289 DYLRNWEIGVAATLRRFFVYNYFNAYASVIITYR 322 >UniRef50_D1NJF8 Glycosyl transferase family 2 n=1 Tax=Clostridium thermocellum JW20 RepID=D1NJF8_CLOTM Length = 322 Score = 302 bits (773), Expect = 2e-80, Method: Composition-based stats. Identities = 82/334 (24%), Positives = 145/334 (43%), Gaps = 13/334 (3%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 +SVI+P+YN C+ES++ QT++ LEII++NDGS D EI YA+ V+ Sbjct: 1 MELISVIVPVYNVEKYLPACIESILCQTYSELEIILVNDGSPDKCPEICDEYAKKDSRVK 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++HQ+N G S ARN GI+ ++GKY+ FVD+DD + PTM E L T ++ + +A C + Sbjct: 61 VIHQSNQGLSSARNTGIDNSSGKYIIFVDSDDTIEPTMVEDLYTCVIKYNCSMAACGRKY 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 F + + V + D +R + + W +Y+R++ NI+F Sbjct: 121 VFEDGKVVCKVPDN----IEKVFSFSDAIREMNTYFYFDMSAWAKIYKRELFD--NIRFP 174 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 G +D + + + + Y + LY Y +S+SR + Sbjct: 175 VGRLSEDFFIMYKLIALSQKVAYISKPLYNYLQRKSSISRNKNINH------DFITAARE 228 Query: 245 LLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLI 304 +E L++NY + +I + L + + + Y + Sbjct: 229 QMEFLDKNYPELAIIGHTAYASANLTVLDFYIKNGVDCPREKLNEFRKAVRDNMEYIKSN 288 Query: 305 TNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITR 338 + K Q L+ F ++ + ++ R Sbjct: 289 KYLSKSK-RIQFSLFLFSYHIYKLVFSIYRKVIR 321 >UniRef50_A7B4B8 Putative uncharacterized protein n=1 Tax=Ruminococcus gnavus ATCC 29149 RepID=A7B4B8_RUMGN Length = 336 Score = 300 bits (770), Expect = 4e-80, Method: Composition-based stats. Identities = 82/324 (25%), Positives = 138/324 (42%), Gaps = 11/324 (3%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 + +S+I+P+YN TC++SLI QT +E+I+++DGSTD S E+ YAE + Sbjct: 2 DMSLVSIIVPVYNVEPYIETCIQSLIRQTMGNIEVILVDDGSTDRSGELCDQYAEADERI 61 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 R++H+ N G S ARN GI A G+Y+ FVD+DD V ++ E ++ A ++ DV + Sbjct: 62 RVIHKQNGGLSSARNAGISAAKGEYLLFVDSDDYVSASLVEKTVSCAEQNQADVVVFDYQ 121 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 E + T L + V+ ++ L T +Y+R I + Sbjct: 122 EI--ELCSGGKQTRTSALPAGQVIHAESVPQLLL----ITPSACNKLYKRSFWEMTQIVY 175 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTE-QSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKI 242 G + +D+ + A R Y E LY Y L S+ R ++ + Sbjct: 176 PEGRNFEDLSVIPRILLQAGRVVYLEGDPLYFYVLRVGSIMRSGAFEKGWEERRKAISDV 235 Query: 243 TRLLEK--LNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMY 300 + LLE+ + + Y ++ Y F + V R P + + + + Sbjct: 236 SNLLEEKGMEQQYKKELE-YLAFEHLYFVPSKEVVLKDRNNPCLKEWRSYLDLQYPDWKE 294 Query: 301 KRLITNVRSVKVGYQALLWSFRLW 324 I + K LL R + Sbjct: 295 NPYIRQLSG-KDKILLLLLRHRCY 317 >UniRef50_B7AUI4 Putative uncharacterized protein n=1 Tax=Bacteroides pectinophilus ATCC 43243 RepID=B7AUI4_9BACE Length = 333 Score = 300 bits (770), Expect = 4e-80, Method: Composition-based stats. Identities = 73/330 (22%), Positives = 127/330 (38%), Gaps = 7/330 (2%) Query: 2 MNSTNK--LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAEN 59 MN+ K +SVI+P+Y C++S+I QT+T L+II+++DGS D S I YA Sbjct: 7 MNTPYKDLISVIVPVYKVERYLCRCVDSIIAQTYTNLQIILVDDGSPDGSGAICDDYAAK 66 Query: 60 YPHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQ 119 + ++H+ N G S ARN G+ A G Y+ FVD+DD + MYE L + + D++ Sbjct: 67 DSRITVIHKKNGGLSDARNAGLVAACGSYIGFVDSDDYIAVDMYEKLYKAIVACNADMSV 126 Query: 120 CNADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRW-THVVWMGVYRRDVIVK 178 CN + E G + V+ G L L + W W +Y R ++ Sbjct: 127 CNFRYV-DENGNNIEERNNALPVKDEVIDGITALTRTLDDKGWYYVTAWNRLYSRKLLQS 185 Query: 179 NNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRH 238 I F G H+D A + + Y Y S+ + G +NL+ Sbjct: 186 --IYFPDGKLHEDEFTVHRIYSKADKVACVADACYMYVQRGGSIMQTGY-GIRNLDAVEA 242 Query: 239 YIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSG 298 + L + + + + +++ + Sbjct: 243 LEQRFEFCVNLPEMSDYILRLSQRIYSMLVEAGIKLDMHNAGCRQRYKELKKTYIKQCRQ 302 Query: 299 MYKRLITNVRSVKVGYQALLWSFRLWQWRD 328 + K + V + + + ++ Sbjct: 303 IKKLFTLRQWIIDVAVFEIFPAINMAVFKR 332 >UniRef50_B6FJ78 Putative uncharacterized protein n=1 Tax=Clostridium nexile DSM 1787 RepID=B6FJ78_9CLOT Length = 321 Score = 299 bits (767), Expect = 8e-80, Method: Composition-based stats. Identities = 70/321 (21%), Positives = 138/321 (42%), Gaps = 13/321 (4%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 K+SVI+P+Y C++S++ QT++ +E+I+++DG+ D EI Y E V+ Sbjct: 1 MPKISVIVPVYKVEKYIHKCVDSILNQTFSDIEVILVDDGTPDRCGEICDAYGEQDSRVK 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++H+ N G S ARN G+ A+G+Y+ F+D+DD + M E + T E D+A C Sbjct: 61 VIHKENGGLSDARNAGMPHASGEYIIFIDSDDYIESDMLEYMYTRLTEAGADMATCGLYE 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 + + E + V +G + R L +W + RR I +++F Sbjct: 121 VYSDRIEQQKE------EKDFVCSGEEAFRCILQGHTIRGEIWNKLIRRSCIE--DLRFP 172 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 G ++DI +T + M + + Y Y S++ + + Y K + Sbjct: 173 KGKLYEDIYFTVDMMQRIKKVAVGTKPKYYYLHREDSITGKPYRPQ-LYDIIDGYTKNYK 231 Query: 245 LLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVR----KEPDILTRQRMIAEIFTSGMY 300 +++K + ++ + + I + + + R ++ +L +R + I + + Sbjct: 232 VVQKAFPSLTEEAECLWLWSRFIVLDKMLLEENYRTIEGRKELVLFIRRHMIRILKNKYF 291 Query: 301 KRLITNVRSVKVGYQALLWSF 321 R + L Sbjct: 292 NRNRKISAMILFLNLELYRKL 312 >UniRef50_B5XTH2 Glycosyl transferase, group 2 family protein n=21 Tax=Enterobacteriaceae RepID=B5XTH2_KLEP3 Length = 329 Score = 299 bits (767), Expect = 9e-80, Method: Composition-based stats. Identities = 122/327 (37%), Positives = 183/327 (55%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M+ T LS++ +YN E L Q + E+I++NDGSTDNS+ + + + Sbjct: 1 MSQTPLLSIVAAVYNGEKFLAQFFECLEQQQLESYELILVNDGSTDNSLAVIAEWRDRLQ 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 +V++L Q N G SVARN G+ A+GKY+AF D DD++YP MY TL+ MA ++ LD+A CN Sbjct: 61 NVQVLEQENQGVSVARNTGLAAASGKYLAFPDIDDKLYPGMYRTLLEMAEKEHLDIATCN 120 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 + + + E+ P DRL STGVL G WL+ L SR++ HV W+ +YR D I ++ Sbjct: 121 GTYVYEKRRESHPIFPLDRLPSTGVLPGHVWLKQALDSRKFLHVTWLNIYRHDFIRQHRF 180 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIK 241 F GL HQDI WTTE + A R +YT Q Y YY+H+ SVS + + RHY+K Sbjct: 181 HFEPGLRHQDIPWTTEALLAAERVQYTSQQFYDYYIHSESVSHKPDNDDTLMRSARHYMK 240 Query: 242 ITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYK 301 I +LE +N+ Y DK+ QI E L + H D + +I E F G+++ Sbjct: 241 ILEMLEAINQRYPDKVRHIAACRWQIAKEGLGIIHTFDSMKDESKKHVIIKEFFDRGIWR 300 Query: 302 RLITNVRSVKVGYQALLWSFRLWQWRD 328 + N + ++ ++ R+ ++R Sbjct: 301 LIWKNACTFRLRWRLGRRYLRIKRYRH 327 >UniRef50_A6NTT2 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6NTT2_9BACE Length = 324 Score = 299 bits (766), Expect = 1e-79, Method: Composition-based stats. Identities = 69/287 (24%), Positives = 124/287 (43%), Gaps = 7/287 (2%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+I+P+YN C+ S+I QT+T LEII+++DGS DN + +AE ++++H Sbjct: 5 ISIIVPVYNVEPYLDQCVRSIIEQTYTDLEIILVDDGSPDNCPAMCDAWAEKDRRIKVVH 64 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + N G S ARN G+++A+G+Y+ FVD+DD + P + E+ + ++ + D+ A W + Sbjct: 65 KQNGGLSDARNAGMDIASGEYIGFVDSDDWLEPDILESALKQLVQANADIIAFGAIWEYS 124 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 E E L LR S +V +YR++++ + ++F+ G Sbjct: 125 EKHEIPHP-----LEDNIYCGSEQILRAYFSECMTLTIVVNKLYRKNILDE--VRFLKGK 177 Query: 188 HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRLLE 247 H+D +T + A ++ Y Y S+ + K+T L + Sbjct: 178 LHEDEFFTYRVLAKANTVVVCKKIGYHYRQRENSIMGTQYHLRHLDVLEAMDEKLTFLEQ 237 Query: 248 KLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEI 294 + + L H +R I I Sbjct: 238 EFPSLLPQQRRAQMMTCLSHYRSILMNKHVDPDSVGRNKIRRYIKRI 284 >UniRef50_C0WDB4 Ss-1,4-galactosyltransferase n=1 Tax=Acidaminococcus sp. D21 RepID=C0WDB4_9FIRM Length = 337 Score = 296 bits (760), Expect = 5e-79, Method: Composition-based stats. Identities = 75/327 (22%), Positives = 137/327 (41%), Gaps = 20/327 (6%) Query: 6 NKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRL 65 +S+++P+YN C+ SL+ QT++ LEI++++DGSTD S + + + + + Sbjct: 14 PLISLLVPIYNVASYVDRCVGSLVNQTYSNLEILLVDDGSTDGSGALCDEWVKKDSRIHV 73 Query: 66 LHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWC 125 +H+ N G S ARN GI+ A G Y+AF+D DD + P E L D D++ CN + Sbjct: 74 VHKENGGLSDARNAGIKRAAGAYLAFIDGDDYIAPEYCEKLYQALAAHDADISLCNICYE 133 Query: 126 FRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWT-HVVWMGVYRRDVIVKN-NIKF 183 + + P GV T + L++ T V W +Y+R + N + F Sbjct: 134 WADGRRKDYEPPHG--TREGVCTVKEVLQVLHGVGDLTLCVAWNKLYKRIIFTGNPPLFF 191 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 G +++D + + A R +LY Y+ + S+ + +H+ + Sbjct: 192 PKGRYYEDEFLNYKLLNRASRISVIPDALYFYFQRDNSIIHQYTD--------KHFTDLC 243 Query: 244 RLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQ--------RMIAEIF 295 +E Y D + + + R R++ E+ Sbjct: 244 CCIEDSLTWYPDDEEKRQWKLRHSLKLCFTLYKKCHDDLHFRNRTSEVKGLVFRIMKELK 303 Query: 296 TSGMYKRLITNVRSVKVGYQALLWSFR 322 + + KR + V + VG L+ + Sbjct: 304 ITYLTKRELVKVFLILVGGYGALYQVK 330 >UniRef50_A6L4H5 Glycosyltransferase family 2 n=2 Tax=Bacteroides RepID=A6L4H5_BACV8 Length = 321 Score = 296 bits (760), Expect = 5e-79, Method: Composition-based stats. Identities = 69/329 (20%), Positives = 132/329 (40%), Gaps = 8/329 (2%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M + + +S+I+P+YN C++SL+ QT+ +EII+++DGS D S EI YA+ Sbjct: 1 MSRNDSLISIIVPVYNVEQYLSRCVDSLVNQTYHNIEIILVDDGSPDRSGEICDEYAKKD 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 V+++HQ+N G S ARN +++A G Y+ FVD+DD + E L +A + D Sbjct: 61 KRVKVIHQSNGGLSDARNTALDIAKGDYLMFVDSDDWIEKKTCEILNRLASDYKADTVIF 120 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 ++ F L + +L ++ + ++ + Sbjct: 121 GLNFVFDSGKVIRSKRGMAGLTDKNMCMK--YLIHNVARGGIFNYACNKMFSSHLFE--G 176 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI 240 ++F G +D + + + R TEQ LY YY + S++ + ++ Sbjct: 177 LRFPVGRLAEDQDVIYKLIHRSQRIYVTEQPLYNYYQRDGSITAAQYYPRLIRDRHELWL 236 Query: 241 KITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMY 300 K + + Y D Y + ++ ++ + + + Sbjct: 237 KRLDFIREY---YPDLENYQIAQILGAAYVGIVKLEGNKQNGELKKKLICFSNKYILKE- 292 Query: 301 KRLITNVRSVKVGYQALLWSFRLWQWRDK 329 K L R +K+ Y + + K Sbjct: 293 KELSVLDRKIKLHYYCYPLFWLYVKLFIK 321 >UniRef50_D2EWI1 Glycoside transferase family 2 protein n=3 Tax=Bacteroides RepID=D2EWI1_9BACE Length = 336 Score = 296 bits (759), Expect = 6e-79, Method: Composition-based stats. Identities = 77/343 (22%), Positives = 153/343 (44%), Gaps = 19/343 (5%) Query: 1 MMNST--NKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAE 58 M N+ +S+I+P+YN D C++S+ QT+ LEII+I+DGS+D S E+ +A+ Sbjct: 1 MKNTEALPLISIIVPVYNVKDYLEKCLQSICVQTYKNLEIILIDDGSSDGSGELCDLFAQ 60 Query: 59 NYPHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVA 118 ++++HQ NAG S ARNRG+ VA G+++ FVD+DD + P MYE L + + D++ Sbjct: 61 RDGRIKVIHQTNAGQSAARNRGLAVAQGEFLGFVDSDDWIEPDMYEFLYHLLKANGADIS 120 Query: 119 QCNADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVK 178 C+ ++ + + +R + +R + +W +Y+R + Sbjct: 121 ICSHYI-----ETAVKTRVKHSSGQFSSFSREEAIRTLVEDKRIRNYMWDKLYKRQLFA- 174 Query: 179 NNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRH 238 I F +DI + + + + + Y Y S ++ K NY++ Sbjct: 175 -GIYFPVNRVFEDIAVSYQIFYKTQKVVMQDCPKYHYLKREGSTTQG-----KLYNYEKE 228 Query: 239 YIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSG 298 Y+ + E++ + + I+ + + + + + P +I ++ Sbjct: 229 YLLFQTVYEQV--KFVREKEIWDKAPYCVHERGIHLIDHLMMLPQSSLIDDVINDVLARM 286 Query: 299 MYKRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITRSAF 341 + NV ++G + ++ R +R+ R+ F Sbjct: 287 ---QEFDNVSWPQLGVMHIFKRRMMYCHLKVYRCIYRLVRTFF 326 >UniRef50_C0BRJ5 Putative uncharacterized protein n=1 Tax=Bifidobacterium pseudocatenulatum DSM 20438 RepID=C0BRJ5_9BIFI Length = 338 Score = 296 bits (759), Expect = 7e-79, Method: Composition-based stats. Identities = 62/260 (23%), Positives = 117/260 (45%), Gaps = 11/260 (4%) Query: 3 NSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPH 62 +SVI+P+Y C++S++ QT+ LEII+++DGS D ++ +A Sbjct: 8 EDLPLISVIVPVYKVERYLDRCIQSIVDQTYRNLEIILVDDGSPDACPDMCDAWAARDCR 67 Query: 63 VRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNA 122 +R++HQ N G ARN GI ++TG+ + FVD+DD + PTMYE L +++ D + Sbjct: 68 IRVIHQENRGLGAARNSGIRISTGELIGFVDSDDWLEPTMYEVLYRSLTKENADTSMITG 127 Query: 123 DWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIK 182 + + + P L V+T + + T W + +R + NI+ Sbjct: 128 VLEYPNGRQIYLYHPDLHL----VMTTSESFKYINLPGYSTISAWSKLIKRSTLG--NIR 181 Query: 183 FIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKI 242 F G +D +T + + ++R + Y Y S+S N Y Sbjct: 182 FPEGCTGEDWQFTYDILAASVRTTFDSTPQYHYRQVAGSLSNTSTNFNFA-----GYEAT 236 Query: 243 TRLLEKLNRNYADKIMIYPE 262 +++ +++ + +++ Sbjct: 237 KNMVKLVHQKFPEQLPFALY 256 >UniRef50_B0ACE9 Putative uncharacterized protein n=2 Tax=Firmicutes RepID=B0ACE9_9CLOT Length = 335 Score = 295 bits (757), Expect = 1e-78, Method: Composition-based stats. Identities = 80/343 (23%), Positives = 154/343 (44%), Gaps = 15/343 (4%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 K+S+++P+YN C++S+I QT +EII+++DGS DN ++ YA+ ++ Sbjct: 1 MAKVSIVVPIYNVEKYLEQCVDSIINQTLKEIEIILVDDGSPDNCPQMCDEYAKKDSRIK 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++H+ N G S ARN GIEVATG Y+ FVD+DD + MYE + +A+E+++D + Sbjct: 61 VVHKKNGGLSSARNAGIEVATGDYIGFVDSDDYIELDMYEKMYNIAIENNVDFVMSDYYR 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTH----VVWMGVYRRDVIVKNN 180 E + + D + + VW +Y+R+++ KN+ Sbjct: 121 VSSEDKVKATLDMDKGIYDKNKI-KKDIFPTLIMGEYINYGRLLAVWHCLYKRELLEKND 179 Query: 181 IKFIAGLHH-QDIVWTTEFMFNALRARYTE-QSLYKYYLHNTSVSRLHRQGNKNLNYQRH 238 + F + + +D ++++ + Y + LY Y + S+S+ ++ ++ Sbjct: 180 LTFDEEVKYSEDNLFSSIIGYKVNNFYYMKGSYLYNYRYNPNSISKTYKPDAWSV----- 234 Query: 239 YIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSG 298 Y+K+ L+ L N D + I Y + H + + + I I + Sbjct: 235 YLKMNNKLKDLFLNVEDYDFRNQLYAHIIYYAFNCINHVGNSDKSFSYKYKEIKRILNT- 293 Query: 299 MYKRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITRSAF 341 K L ++ K G + +W + K ++ I + Sbjct: 294 --KELEDGFKNFKAGNISTKLKISIWMLKHKFIINYMIIKGYI 334 >UniRef50_C4FFR1 Putative uncharacterized protein n=1 Tax=Bifidobacterium angulatum DSM 20098 RepID=C4FFR1_9BIFI Length = 327 Score = 295 bits (757), Expect = 1e-78, Method: Composition-based stats. Identities = 77/338 (22%), Positives = 146/338 (43%), Gaps = 17/338 (5%) Query: 1 MMNS-TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAEN 59 M N+ +SV++P+YN C++S++ QT+T +EII+++DGSTD S I Y+ Sbjct: 1 MSNTALPLISVVVPVYNVKPYVAKCLDSILRQTYTNIEIIVVDDGSTDGSELICDDYSNK 60 Query: 60 YPHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQ 119 P + ++HQ N G + ARN GI+ A G+Y+ FVD+DD + P MYE L+ A +++ +A Sbjct: 61 DPRITVIHQRNTGLAAARNTGIDAAHGEYLGFVDSDDFIEPFMYEKLLNAAQKNNCVLAV 120 Query: 120 CNADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKN 179 C ++ F IP + V PD + + R + W +Y+ ++ Sbjct: 121 CGINYVFDNGTV----IPKANIEPDQVFDFPDAITEMNTYRLFDMGAWSKLYKSELFD-- 174 Query: 180 NIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHY 239 I+F G +D + A + + + Y YY S+++ + + L Sbjct: 175 GIRFPVGKLSEDFFIMFKIFDRAQKVAFVSDACYNYYQRTNSITKSKKINHDFL------ 228 Query: 240 IKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQR---MIAEIFT 296 + + ++RNY +I + + R++ +I Sbjct: 229 EAAYKQMRYIDRNYPSLSVIGHTAYASAALTVYDSYLKNQVPCPEAFRKQCVSIIQNQKQ 288 Query: 297 SGMYKRLITNVRSVKVGYQALLW-SFRLWQWRDKTRSH 333 I+ + ++ + +++ K R+ Sbjct: 289 YIAQANYISKAKRIQFHLFLINATLYKVIFMTYKIRNK 326 >UniRef50_P71059 Uncharacterized glycosyltransferase epsJ n=3 Tax=Bacillus subtilis group RepID=EPSJ_BACSU Length = 344 Score = 295 bits (756), Expect = 2e-78, Method: Composition-based stats. Identities = 90/336 (26%), Positives = 145/336 (43%), Gaps = 14/336 (4%) Query: 6 NKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRL 65 +S+I+P+YN C++SL+ QT + +EII++NDG+ D S EIA+ YA+ +R+ Sbjct: 3 PLVSIIVPMYNVEPFIEECIDSLLRQTLSDIEIILVNDGTPDRSGEIAEDYAKRDARIRV 62 Query: 66 LHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWC 125 +HQAN G S ARN GI+ A G Y+ FVD DD V M++ L A ++ LD+ C Sbjct: 63 IHQANGGLSSARNTGIKAARGTYIGFVDGDDYVSSAMFQRLTEEAEQNQLDIVGCGFYKQ 122 Query: 126 FRETGETWQSIPTDRLRSTGVLTGPDWLRMG--LSSRRWTHVVWMGVYRRDVIVKNNIKF 183 + +L + VLT P+ R+ VW +YRR++ + N+ F Sbjct: 123 SSDRRTYVPP----QLEANRVLTKPEMTEQLKHAHETRFIWYVWRYLYRRELFERANLLF 178 Query: 184 IAGLHH-QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKI 242 + +D + A R + ++ LY Y + S++ + + + + Q+ Y Sbjct: 179 DEDIRFAEDSPFNLSAFREAERVKMLDEGLYIYRENPNSLTEIPYKPAMDEHLQKQYQAK 238 Query: 243 TRLLEKLNRNYADKIMIYPEF-HQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYK 301 A K + Q+ C + DI + R I Y Sbjct: 239 IAFYNHYGLAGACKEDLNVYICRHQLPMLLANACASPNSPKDIKKKIRQILS------YD 292 Query: 302 RLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRIT 337 + VR ++ LL RL K R I Sbjct: 293 MVRQAVRHTPFQHEKLLRGERLVLALCKWRLTFLIK 328 >UniRef50_A1A383 Putative glycosyltransferase n=1 Tax=Bifidobacterium adolescentis ATCC 15703 RepID=A1A383_BIFAA Length = 346 Score = 294 bits (754), Expect = 2e-78, Method: Composition-based stats. Identities = 69/284 (24%), Positives = 128/284 (45%), Gaps = 12/284 (4%) Query: 6 NKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRL 65 +SVI+P+Y C++S++ QT+ LEII+++DGS D+ ++ + +R+ Sbjct: 18 PLISVIVPVYKVERYLDRCVQSIVDQTYRNLEIILVDDGSPDSCPDMCDAWIAKDYRIRV 77 Query: 66 LHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWC 125 +HQ N G+ ARN GIE +TG+ + FVD+DD + PTMYE L +D D++ + Sbjct: 78 IHQENQGSGAARNNGIESSTGELIGFVDSDDWLEPTMYEALFNSLKNNDADISMTASIME 137 Query: 126 FRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIA 185 + + + P L V+T + + T W + +R V+ +I+F Sbjct: 138 YPDGRQICHYQPNVHL----VMTTSESFKYVNLPGYSTISPWSKLIKRSVLG--DIRFPE 191 Query: 186 G-LHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 G ++ +D +T E + + R + LY Y S+S + N Y + Sbjct: 192 GCVNGEDTRFTYEVLAASKRTTFDSVPLYHYRTTEGSLSHTYTNYNWA-----GYEATKQ 246 Query: 245 LLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQ 288 ++E + R + +++ + R + R+ Sbjct: 247 MVELVRRRFPEQLPFALYGLMRSMISLYDQALVTRSSKEQQWRR 290 >UniRef50_B5LBK9 Putative glycosyl transferase n=1 Tax=Streptococcus pneumoniae RepID=B5LBK9_STRPN Length = 322 Score = 293 bits (752), Expect = 5e-78, Method: Composition-based stats. Identities = 79/332 (23%), Positives = 144/332 (43%), Gaps = 15/332 (4%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 MM K+SVI+P+YN C+ES++ QT+T LEI+++NDGSTD S E+ A+ Sbjct: 1 MMGE--KISVIVPVYNVEAYLEKCVESILKQTYTNLEILLVNDGSTDKSGELCDQLAQRD 58 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 +R++H+ N G S ARNRGIE A+ + F+D+DD + MYETL LE + D++ C Sbjct: 59 QRIRVIHKENGGLSDARNRGIEEASSDLIGFIDSDDYIDEDMYETLYRQMLESNADLSMC 118 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 + T LT + ++M + ++ + +Y++++ Sbjct: 119 GHYDVYD-----QIPEKQVATIQTWELTPQEAIKMVMEAKILSVTAVNKLYKKELFE--Q 171 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI 240 ++F G +D + + T + Y Y S++ + K LN Y Sbjct: 172 LRFEIGKIAEDAFIMIALIHQCRKVVATNEKKYYYVHRENSIT-TQKFSLKFLNVIEAYE 230 Query: 241 KITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMY 300 + ++ + + D + + + L D + R+IA + + Sbjct: 231 QNANIIRENYPDLTDVATMRLNWAY---FYVLDRLLVDADFKDKVLEDRLIAYLKKN--T 285 Query: 301 KRLITNVRSVKVGYQALLWSFRLWQWRDKTRS 332 K ++++ R + + L + K Sbjct: 286 KNILSDSRFTRARKVSFLALCLSRKLYTKILL 317 >UniRef50_B7GUL7 Glycosyl transferase, family 2 n=1 Tax=Bifidobacterium longum subsp. infantis ATCC 15697 RepID=B7GUL7_BIFLI Length = 349 Score = 291 bits (746), Expect = 2e-77, Method: Composition-based stats. Identities = 78/346 (22%), Positives = 149/346 (43%), Gaps = 15/346 (4%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 S ++V++P+YN TC++S++ QT+ L+II+++DGSTD S I +++A Sbjct: 11 NGSDPLITVVLPIYNVAKWLDTCIQSIVKQTYRNLQIILVDDGSTDESPTICEHWAAKDS 70 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 + ++HQ NAG S ARN G+ + G+Y+ FVD+DD V + ++ A + D+ CN Sbjct: 71 RINVVHQRNAGLSAARNTGLRLRKGEYICFVDSDDYVEHDYVKRMLDTAHAEQADMVICN 130 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 + D T LT ++ + S V W +YR ++ N+ Sbjct: 131 VRKEDENGAALAEE--ADPSFETKTLTSRQYMIYAMQSSWKHIVAWNKLYRSEIWD--NL 186 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIK 241 + G H+D + + R + LY Y H+ S+ N ++ Sbjct: 187 TYPVGKIHEDEFVFAQLVVRCHRVACIDDVLYHYVQHDGSIMN----ENYSIRNLDKIEA 242 Query: 242 ITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEI---FTSG 298 + + L L+ N ++ ++ I + +R + +R + R ++ + Sbjct: 243 LCQRLLFLDENGFNECLL--NTFNGIIADFMRAENNLRSDMQAQQRLHSLSSQAQALSPI 300 Query: 299 MYKRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITRSAFNLR 344 +K ++ KV + + + + R HH I + L+ Sbjct: 301 CFKSPMSWFEKTKVLFCLIFSPLSYAIFSN--RLHHLIKKIKSQLQ 344 >UniRef50_D2Q655 Glycosyl transferase, family 2 n=2 Tax=Bifidobacterium dentium RepID=D2Q655_9BIFI Length = 340 Score = 291 bits (746), Expect = 2e-77, Method: Composition-based stats. Identities = 63/245 (25%), Positives = 105/245 (42%), Gaps = 5/245 (2%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 + +SVI+P+YN G C+ES+I QT++ +EI++++DGSTD S +A+ + Sbjct: 8 NQPLISVIVPVYNVGGFLDRCVESIIAQTYSRIEILLVDDGSTDGSSVKCDSWADKDDRI 67 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 ++H+ N G S ARN G++V G+Y+AFVD+DD V P E + ++ C+ Sbjct: 68 SVIHKVNGGLSDARNAGLDVCLGEYIAFVDSDDYVDPRFIEGMFDALSRHSSTLSVCDVV 127 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 +S V++G L + W +Y R + ++F Sbjct: 128 REDESGNALRKSRKAR--PRERVMSGRQCLGLTYVDPN-AVTAWSKLYHRSLWE--GLRF 182 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 G H+D E M+ R LY Y H S+ ++ Sbjct: 183 PKGRLHEDEFVFHEIMYQCDRVVVMADELYHYVQHQGSIMHARYNVRYLDRIDAWLQRLD 242 Query: 244 RLLEK 248 +E Sbjct: 243 AFVEH 247 >UniRef50_C0FV15 Putative uncharacterized protein n=1 Tax=Roseburia inulinivorans DSM 16841 RepID=C0FV15_9FIRM Length = 278 Score = 290 bits (744), Expect = 4e-77, Method: Composition-based stats. Identities = 60/269 (22%), Positives = 111/269 (41%), Gaps = 3/269 (1%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 +SVI+P+Y C++SL+ Q++ LEII+++DGS D I Y ++ Sbjct: 2 KPLISVIVPIYKVEKYLTICVDSLLRQSYGNLEIILVDDGSPDECPAICDQYQNKDNRIK 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++H+ N G S ARN G+++A G+YVAFVD+DD + L ++ +A C Sbjct: 62 VIHKKNGGLSDARNAGLDIAQGEYVAFVDSDDFIDEDYICKLYDALQQNRASIAVCGIQI 121 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 T Q T + + TG + ++ L W +Y + + ++F Sbjct: 122 IDESNKVTEQICVTQDKKQE-LYTGREIIKKELQGEWVLVTAWGALYDAKIFKE--LRFP 178 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 G H++D N +++Y Y S+ + + ++Y + + Sbjct: 179 CGRHYEDEYVFVSVFENQECVVCIPENIYYYLQRADSIMGVTYKKQDCMDYFEMWHERID 238 Query: 245 LLEKLNRNYADKIMIYPEFHQQITYEALR 273 + +I + Y A+ Sbjct: 239 YFQTKEDRELLAPVIQSCLAWNVLYLAMN 267 >UniRef50_D1BQ92 Glycosyl transferase family 2 n=1 Tax=Veillonella parvula DSM 2008 RepID=D1BQ92_VEIPT Length = 322 Score = 290 bits (744), Expect = 4e-77, Method: Composition-based stats. Identities = 92/326 (28%), Positives = 146/326 (44%), Gaps = 17/326 (5%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 +SVI+P+YN C+ES+ QT+ LEII++NDGSTD S +I YA ++ Sbjct: 1 MELISVIVPVYNVESYVAECIESIQNQTYMNLEIILVNDGSTDASGDICDKYAAYDERIQ 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++H+ NAG S ARN GIE A G Y+ FVD+DD + PTMYE ++ + E DLD+ +C A Sbjct: 61 VIHKENAGVSAARNTGIESANGDYIGFVDSDDYIAPTMYEDMLKLMAEHDLDIIECTAFR 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 +T I S + + L+M + VW +Y+R VI ++++F Sbjct: 121 NNGDTN-----IEGCNDGSLEIFNRDEALKMAMYD--CFVAVWSQLYKRRVI--SDVRFP 171 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 G +D + F+ N R + + LY Y L+ S ++ ++ Y + + Sbjct: 172 VGRKFEDSAVSYLFIANTKRVGHINRCLYYYRLNPNSTTQTSFDAKSRWDFVLGYEERLQ 231 Query: 245 LLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLI 304 Y D + + + L A +P Q + R Sbjct: 232 YAIDHQLPYVD------DCNSLLMKAVLSCLTAYYAKPT--GNQVYYDKCKKMIETYRND 283 Query: 305 TNVRSVKVGYQALLWSFRLWQWRDKT 330 + + + Y+ LWSF W K Sbjct: 284 ASYKLLNSKYKLFLWSFGRADWIHKI 309 >UniRef50_A7VPB1 Putative uncharacterized protein n=1 Tax=Clostridium leptum DSM 753 RepID=A7VPB1_9CLOT Length = 330 Score = 290 bits (744), Expect = 4e-77, Method: Composition-based stats. Identities = 73/300 (24%), Positives = 128/300 (42%), Gaps = 9/300 (3%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M +S+IIP+YN C+ES+ QT E+I+++DGSTD+S+EI + +A + Sbjct: 1 MKTVVPNVSLIIPVYNVEQYLPKCLESIAAQTLKGFEVILVDDGSTDHSLEILRGFARRF 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 P+ ++HQ N G S ARN GI+ A G+Y+AF+D+DD + P + L A + D+ C Sbjct: 61 PNTCVIHQENGGVSKARNAGIQAARGEYIAFMDSDDYIAPLYLQRLYESAKKYRADMVCC 120 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 + T + P ++ G+ T + +R + +W ++RR + +++ Sbjct: 121 SYYRY---NAVTHELRPAQFRKAPGIYTPKEMIRALILDMGIKGFIWNKLWRRTLFTEHH 177 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI 240 I F + +DI + + R LY Y H S G N+ Y+ Sbjct: 178 ITFPT-MCFEDISVCQRGFYFSNRIAVIGDPLYFYVQHKDSAI-----GALNIKKLNDYL 231 Query: 241 KITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMY 300 + L D + + A+ C+ + F ++ Sbjct: 232 RALADLRGFLELQRDFPPYRAVYRLHSLFVAITTCYLTLVAQKNAKNLSGLGRNFKEIIH 291 >UniRef50_D2Q661 Glycosyl transferase family 2 n=2 Tax=Bifidobacterium dentium RepID=D2Q661_9BIFI Length = 337 Score = 290 bits (744), Expect = 4e-77, Method: Composition-based stats. Identities = 80/345 (23%), Positives = 141/345 (40%), Gaps = 22/345 (6%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 ++ +SVIIP+Y C+ S++ QT+ LEII+++DG D + + E VR Sbjct: 2 SDLVSVIIPIYGVEPYLDRCVRSVVNQTYRNLEIILVDDGGKDQCPVMCDAWVERDSRVR 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++H+ N G S ARN G++VA G+++AFVD DD V P + + +D+A C+ Sbjct: 62 VIHKPNGGLSSARNAGLDVAVGEFIAFVDGDDYVEPGYIAIMADAMGDVQVDLAMCSVFH 121 Query: 125 CFRETGETWQSIPTDRLRSTGV------LTGPDWLRMGLSSRRWTHVV-WMGVYRRDVIV 177 + Q+ P R V +G D +R +VV W +YRR + Sbjct: 122 EDIDGHAVAQTAPVRREEYAPVTDIRRTCSGLDCMRQRGDENGMDNVVAWNKLYRRSLWD 181 Query: 178 KNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQR 237 I++ G H+D T A A + LY Y + S+ + Sbjct: 182 --GIRYPLGKIHEDEFVTYRIFGRARAAVLLPERLYHYIERDGSIVHSKYTLRSLDIIEA 239 Query: 238 HYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTS 297 K+ LL++ + + P F Q+ R + + R ++ +F + Sbjct: 240 LIGKVHFLLDE------GAVDLVPGFFSQLKDSIYRARQLDWSDEQVHKRLEVLFRLFRT 293 Query: 298 GMYKRLITNVRSVKVGYQALLWSFRL---WQWRDKTRSHHRITRS 339 + + +R + V + R+ W+ K + + Sbjct: 294 LPW----STMRYLSVKDRVNYLGTRICPFLFWQRKLYDNKSAVKG 334 >UniRef50_D1PNS5 Capsular polysaccharide biosythesis protein CpsI n=1 Tax=Subdoligranulum variabile DSM 15176 RepID=D1PNS5_9FIRM Length = 326 Score = 290 bits (743), Expect = 5e-77, Method: Composition-based stats. Identities = 82/311 (26%), Positives = 131/311 (42%), Gaps = 10/311 (3%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTA-LEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 +SVI+P+YN C++S++ Q + E+++++DGSTD +I YA VR+ Sbjct: 6 ISVIVPIYNTKAYLEECLDSILAQDFKEPYEVLLVDDGSTDGCTDICDAYAAKDNRVRVF 65 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 HQ N G S ARN GI+ A G+Y AFVD+DD V TL E D +A C + Sbjct: 66 HQENQGLSAARNTGIDAARGRYYAFVDSDDVVLSAYLRTLYNACEEHDAYMALCAVEDV- 124 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLRMGLS-SRRWTHVVWMGVYRRDVIVKNNIKFIA 185 +E+GE+ + GV G D L + + V W +YR +V + + Sbjct: 125 QESGESCVPPHSTHPAQEGVFCGKDLLNEFYTPNGTVYTVAWNKLYRAEVWKL--LHYPE 182 Query: 186 GLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRL 245 G H+D + + + LY Y L + S+ R + + Sbjct: 183 GRLHEDDFVAHRLFWRCDKVVCVDTILYHYRLSSGSICRSSIKPEAF----DAVDGLADR 238 Query: 246 LEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLIT 305 N AD+ +I + L +C VR+ P + + E F + Sbjct: 239 YRFYVENNADRSVIDSAYAACW-RRYLFLCAKVRQNPTPQLVKAISKEQFLMQGLISYLP 297 Query: 306 NVRSVKVGYQA 316 N R +K+ + Sbjct: 298 NCRHMKMTEKL 308 >UniRef50_C4FMC2 Putative uncharacterized protein n=3 Tax=Firmicutes RepID=C4FMC2_9FIRM Length = 317 Score = 290 bits (743), Expect = 5e-77, Method: Composition-based stats. Identities = 75/319 (23%), Positives = 132/319 (41%), Gaps = 6/319 (1%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 LS+I+P+YN C++S++ QT+ LEII+++DG+TD S IA YA ++ Sbjct: 2 EPLLSIIVPIYNVEQYVDKCIQSILNQTYQNLEIILVDDGATDCSGSIADSYAAKDKRIK 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 + H+ N G S ARN G++ TG Y+ FVD+DD + TM E L +A + D+ CN Sbjct: 62 VFHKENGGLSDARNYGLDHVTGDYILFVDSDDFIENTMCERLFAVANSTNADIVSCNYYI 121 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWT--HVVWMGVYRRDVIVKNNI- 181 + + ++ T TG D LR L +VVW +++ ++ + Sbjct: 122 YREDDDISIHTM--SVQDDTRTFTGMDMLRYYLLKTEPFDLNVVWNKLFKSELFNGTVLV 179 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIK 241 +F G +D NA + LY Y S+ + + +I Sbjct: 180 RFPKGRVQEDNFTIFRLFLNANTIVTVNEPLYYYVQRAGSIMANFSR-RFMTDTVESHIY 238 Query: 242 ITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYK 301 + L + +++ +Y + V + D+LT+ + T+ Sbjct: 239 MNDYLMDHCSSVKNELQLYLLNSYVELSRRVHVNKCKTEYNDLLTQYKHYVLDHTADTSH 298 Query: 302 RLITNVRSVKVGYQALLWS 320 + + L+ Sbjct: 299 NPLWKCKQDIKRVLVKLYY 317 >UniRef50_A7VCQ5 Putative uncharacterized protein n=1 Tax=Clostridium sp. L2-50 RepID=A7VCQ5_9CLOT Length = 329 Score = 290 bits (742), Expect = 7e-77, Method: Composition-based stats. Identities = 76/332 (22%), Positives = 147/332 (44%), Gaps = 12/332 (3%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 K+S+IIP+Y C++S++ QT +EII+++DGS DN I YA V Sbjct: 2 KPKVSIIIPVYKVEKFLTRCLDSVLGQTLKEIEIILVDDGSPDNCGNICDEYAAKDKRVI 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++H+ NAG S ARN G+EVA+G++V FVD+DD V TM+E + A+ D ++A C Sbjct: 62 VIHKKNAGVSAARNSGLEVASGEFVGFVDSDDYVAATMFEDMYRQAVLADAEMAMCQFAI 121 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 T + D + + SR VW +++R++I N++F Sbjct: 122 TDGATDQLVHRSSGDDFNVLKFDNKKAFELIADFSRPVQVTVWNKLFKRELIQ--NLRFD 179 Query: 185 -AGLHHQDIVWTTEFMFNALRARYTEQSLY-KYYLHNTSVSRLHRQGNKNLNYQRHYIKI 242 + +D+ + +F + Y +LY Y + + + Y I Sbjct: 180 TSKRMAEDLEFLMRALFKSRTVVYVPYALYAYYAQREGAAT--FHADHSIAWYLEQNSNI 237 Query: 243 TRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDI-LTRQRMIAEIFTSGMYK 301 T ++++ +N A + + L + +A+ + + +++ + S + + Sbjct: 238 TSIMDETAQNCASMKNL--AIGYKCVNGDLSIANAMVRAGKLDSEAVKLVKKDLKSNIRE 295 Query: 302 RL---ITNVRSVKVGYQALLWSFRLWQWRDKT 330 + + V+ V++ + + + + K Sbjct: 296 IMTSELHAVKKVQMLLFIVSPALYMKVMKKKL 327 >UniRef50_B3JEB6 Putative uncharacterized protein n=1 Tax=Bacteroides coprocola DSM 17136 RepID=B3JEB6_9BACE Length = 344 Score = 289 bits (741), Expect = 8e-77, Method: Composition-based stats. Identities = 73/345 (21%), Positives = 155/345 (44%), Gaps = 22/345 (6%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 + K+S+I+P+YN C+ESLI QT +EII+++D S D+ I YA+ ++ Sbjct: 2 SPKVSIIVPVYNVEKYIHRCVESLINQTLKDIEIILVDDESPDSCPSICDEYAQKDARIK 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++H+ N G + RN G+E+A+G+++AF+D+DD V TMYETL A + D C+ + Sbjct: 62 VIHKKNEGLGLTRNAGLEIASGEFIAFIDSDDWVDLTMYETLYNTAKNNQCDTVYCSLQY 121 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGL-------SSRRWTHVVWMGVYRRDVIV 177 + + G +L L S ++ V +Y R++I Sbjct: 122 YYSPNKIIPFKEVNEETFFQGRKEIDTFLLDMLAPLPTYHSDVKYMVSVCKAIYSRNIIR 181 Query: 178 KNNIKFIAGLH--HQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNY 235 +++KF++ +D+V+ T+++ + + + + Y Y+ + S++ + + Sbjct: 182 NHHLKFVSERIIASEDMVFHTQYLKLSEKIGFIPKYFYNYFQNENSITHTYTETK----- 236 Query: 236 QRHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEA---LRVCHAVRKEPDILTRQRMIA 292 K + ++ Y ++ Y ++ + L + + + K + + + Sbjct: 237 IERLKKFIQEMDVTFSQYFNQSEYYIRLQRKALHYLRTSLYIKYQITKSHPFIEQIKEFK 296 Query: 293 EIFTSGMYKRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRIT 337 E+ ++ + + + Y L RL+ + K + + Sbjct: 297 ELCDDEIFSKTLVDY-----PYHLLPLKHRLFYFFIKFKITSLLI 336 >UniRef50_C6LFI9 Glycosyl transferase, group 2 family n=1 Tax=Bryantella formatexigens DSM 14469 RepID=C6LFI9_9FIRM Length = 689 Score = 289 bits (741), Expect = 9e-77, Method: Composition-based stats. Identities = 83/352 (23%), Positives = 150/352 (42%), Gaps = 15/352 (4%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M K+SVIIP+YN C++S+ QT +EII +NDGSTD S++I + YAE Sbjct: 1 MADRKKVSVIIPVYNVEKYLEKCLDSVERQTLPEIEIICVNDGSTDGSLQILQSYAEKDD 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 + ++++ N G S ARN G+ VA G+YV F+D+DD + E L +++LD + Sbjct: 61 RIIIINKENGGLSSARNAGLAVAEGEYVYFLDSDDWILAETLEVLYDECKKENLDTILFD 120 Query: 122 ADWCFRETGE-----TWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVI 176 AD F +++ +TG + ++ + + RR + Sbjct: 121 ADSIFETEEVERLHPSYRDYYHREDCFGNTVTGQRLFAEMMPREQYRSSACLQMNRRAFL 180 Query: 177 VKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQ 236 + + F G+ H+D ++T + A RA++ + Y+ + S+ R Y Sbjct: 181 EQYGLGFREGMIHEDELFTLQVSLRAERAKHLARPFYQRLVREESIMTSVRAIRSAFGYY 240 Query: 237 RHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRK---EPDILTRQRMIAE 293 I +L + D + IY ++ V++ + R +A Sbjct: 241 VCNQLILAEATELVKE-KDIMDIYVNKISRLRRSIKSALKDVKRGEIDRFTSNRLEDMAL 299 Query: 294 IFTSGMYKRLITNVRSVKVGYQALLWSFRLW------QWRDKTRSHHRITRS 339 + +RL+ + R+++ Q L S K +S +R+ + Sbjct: 300 LEQFKYEQRLVISNRNLRKKIQELRASTTFKVGKTVTFIPRKLKSAYRVLKK 351 Score = 223 bits (569), Expect = 9e-57, Method: Composition-based stats. Identities = 72/314 (22%), Positives = 122/314 (38%), Gaps = 9/314 (2%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+I+P+YN C+ SL Q + +EII ++D STD S+E+ + A + VR+L Sbjct: 377 ISIIMPVYNGEKYLWRCIRSLQKQMFRDIEIICVDDKSTDRSMELLQERARHDKRVRILQ 436 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 Q + GA ARN G++ A GKY+ F+D DD M L A + D V A Sbjct: 437 QEHLGAGSARNLGMKEARGKYLLFLDCDDLFDKNMCRELYRAAEKHDAQVVLFGAQRMDM 496 Query: 128 ETGETWQS---IPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 T + + + L V +G + ++ W ++RRD I++N ++F Sbjct: 497 MNNRTERMGWVLRSSELPKDAVFSGQEIADRLFQIT--SNCPWSKMFRRDFILENGLEFQ 554 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 + H D + M A R +Q Y + S + L + + + + Sbjct: 555 STRHCNDAYFVRMAMALAERMTMVDQIFTTYRFNMG-ASTQGVKHEAPLEFYKAFSAVKE 613 Query: 245 LLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLI 304 L + K + +T E +++M+ E F Sbjct: 614 ALTEKGLFALYKKSF---VNWTLTESLFNYRTMKTAEAKAAIKEKMLTEGFDYFGITECS 670 Query: 305 TNVRSVKVGYQALL 318 Y+ L Sbjct: 671 PEDIYNPQLYEEYL 684 >UniRef50_C9RJB8 Glycosyl transferase family 2 n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RJB8_FIBSS Length = 356 Score = 288 bits (737), Expect = 2e-76, Method: Composition-based stats. Identities = 66/318 (20%), Positives = 134/318 (42%), Gaps = 3/318 (0%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 +S+I+P+YN R C++S++ QT+T LEII+++DGS D +I YA + Sbjct: 3 QPLVSIIVPIYNVEPYLRRCLDSIVNQTYTNLEIILVDDGSPDGCPQICDEYAAKDKRIV 62 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++H+ N G S ARN G+++ G+YV+FVD+DD + E+ + D+ + Sbjct: 63 VIHKENGGLSDARNAGLDICKGEYVSFVDSDDWIATNSIESFYKIVQTQKPDIVIGDYIT 122 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 E+G + ++ + + G V W + ++D+ K F Sbjct: 123 A-TESGNYQNQTTSKQIIEMSLFNLLKCHQPGHPLFTQVVVAWNKLIKKDIAKKW--LFP 179 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 G +D T ++ F+A + T + Y Y S+ + NY + R Sbjct: 180 KGAICEDHYTTYKYFFSASKVILTNEITYYYRTREGSIVNNLAKNPIRDNYIQFEYLRER 239 Query: 245 LLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLI 304 + L++ D ++ + + + + + I + ++ + + Sbjct: 240 ISFFLDQARIDIANLFIDQYTRNAIYLYIIESKNKVNKGIHKQDILVFKKHIKTKSVKFF 299 Query: 305 TNVRSVKVGYQALLWSFR 322 N ++ + + ++L F Sbjct: 300 VNHLNICIKFYSILCKFS 317 >UniRef50_A3CKM9 Glycosyltransferase, family 2/glycosyltransferase family 8, putative n=1 Tax=Streptococcus sanguinis SK36 RepID=A3CKM9_STRSV Length = 289 Score = 287 bits (736), Expect = 4e-76, Method: Composition-based stats. Identities = 71/266 (26%), Positives = 113/266 (42%), Gaps = 9/266 (3%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 S +SVIIP+YN C+ S+ QT+ LEII++NDGSTD S +I + A+ + Sbjct: 2 SQELISVIIPVYNVEQYLAECVNSVCRQTYQNLEIILVNDGSTDASGQICQELADQDARI 61 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 RL+HQ N G S ARN GIE ++ Y+ FVD+DD + L E D D+A + Sbjct: 62 RLIHQDNQGLSGARNTGIEHSSADYLIFVDSDDWLPEQHVARLYEKLKEYDADIAIGHHC 121 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHV---VWMGVYRRDVIVKNN 180 E + + T + + R V W +Y+R++ Sbjct: 122 SFRAEDSA---FLYYSTEHFETLYTREEIIEEYPRRRMLDGVFLCAWAKLYKRELFDT-- 176 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI 240 +++ G +D T + + + Y + LY Y L S+S + + Sbjct: 177 VRYPVGRVAEDAFTTYKLYLQSEKIIYLNEPLYYYRLRPNSISMT-WNEQWFRDLIVGFE 235 Query: 241 KITRLLEKLNRNYADKIMIYPEFHQQ 266 + +L KL + + Y Q Sbjct: 236 EQLAILGKLGYDLSFYYKYYTFLLQY 261 >UniRef50_D1PIY0 Glycosyl transferase, group 2 family n=1 Tax=Subdoligranulum variabile DSM 15176 RepID=D1PIY0_9FIRM Length = 322 Score = 287 bits (735), Expect = 4e-76, Method: Composition-based stats. Identities = 57/245 (23%), Positives = 101/245 (41%), Gaps = 8/245 (3%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 +SVIIP+Y C+ S+ QT++ LEI++++DGS D + +A+ P VR Sbjct: 1 MPAISVIIPVYQVERYLDACVASVEGQTFSDLEILLVDDGSPDGCPALCDAWAQKDPRVR 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++H+ N G S ARN GIE ATG ++AFVDADD + P + + D+ N Sbjct: 61 VIHRENGGLSAARNTGIEAATGDFLAFVDADDLLEPDTLRRAYQAQRQHNADLVIFN--L 118 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGL---SSRRWTHVVWMGVYRRDVIVKNNI 181 + + +P VL + + + + V W +Y+R++ + Sbjct: 119 VYVDDRNKVLPVPDFTGFRDEVLDADEVWQRYFALAEQKIYYVVAWNKLYKRELFAT--L 176 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIK 241 ++ G ++D + Y+Y S+ L+ ++ Sbjct: 177 RYAEGKRYEDQFLLPGLLAQCRTIVCLGYPGYRYVQRGGSIM-AQGSSRNYLDRSEFLLE 235 Query: 242 ITRLL 246 Sbjct: 236 WCDYF 240 >UniRef50_B5CU57 Putative uncharacterized protein n=1 Tax=Bacteroides plebeius DSM 17135 RepID=B5CU57_9BACE Length = 327 Score = 286 bits (733), Expect = 7e-76, Method: Composition-based stats. Identities = 81/328 (24%), Positives = 139/328 (42%), Gaps = 15/328 (4%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M + +SVI+P+Y C+ES++ QT+T LEII+++DGS D EI A Sbjct: 1 MPMSALISVIVPVYKVEPYLHRCVESILKQTYTNLEIILVDDGSPDKCGEICDELASTDK 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 ++++HQ N G S ARN G+ +ATG+Y+AFVD+DD ++P MY+ L ++ +A C Sbjct: 61 RIQVIHQKNKGLSGARNSGLNIATGEYIAFVDSDDCLHPEMYKRLYEDICLHNVKLAFCQ 120 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 A+ C E + + PT V +++ S W +Y R ++ NI Sbjct: 121 ANMCHGEVIKVTCNQPT-------VCKEKNYVMYRSLSESIWWAAWTKLYHRSLLE--NI 171 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIK 241 +F G ++D + LY Y + S+ + N++ I Sbjct: 172 RFPEGRTNEDYAVMMLIYDQCEQIAINFNRLYNYCIRENSICTS----SLNIHKFDQLIS 227 Query: 242 ITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYK 301 +L + + + L H K+ D +++ I ++ Sbjct: 228 AEEVLIYIKERHPEFKKFAEAILMSSCLGLLHALHKEDKKKDFTAQEKKIYQLIQKYFST 287 Query: 302 RLITNVRSVKVGYQALLWSFRLWQWRDK 329 + N K + LL + R+ K Sbjct: 288 AIWNNKILTKQ--RILLLAARIHPTIYK 313 >UniRef50_UPI0001B4A4FE ss-1,4-galactosyltransferase n=1 Tax=Bacteroides sp. 2_1_7 RepID=UPI0001B4A4FE Length = 332 Score = 286 bits (733), Expect = 7e-76, Method: Composition-based stats. Identities = 77/323 (23%), Positives = 140/323 (43%), Gaps = 6/323 (1%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 + +SVI+P+YN D +TC S+++Q++ +E+I++NDGSTD S EIAK + V Sbjct: 2 NMPLVSVIVPIYNVEDYLQTCCNSILSQSFNNIEVILVNDGSTDGSCEIAKQIEKKDSRV 61 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 L+ Q N G S ARN G++VA G YV FVD+DD + + L ++ DV+ C A Sbjct: 62 ILIDQRNGGLSAARNAGLKVAKGDYVLFVDSDDWIEKNCIQELYQDIQRNNSDVSCCLAQ 121 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 + LTG + L L + + W +YRRD + +N + F Sbjct: 122 YI----SAMGTIRRGRSSFGINSLTGDEILPNALLVKTFPTSAWGKLYRRDFLHENGLLF 177 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 G+ ++D +++ + A + + ++ L+ S+SR + + + Sbjct: 178 KEGIVNEDTLFSIQIACLAKKISFVDKVLFDIREREGSISRSSFE-KLFQDMNIALLSAR 236 Query: 244 RLLEKLNRNYADKIMIYP-EFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKR 302 +E+ N +Y + + Y L++ ++ ++ Sbjct: 237 SFMEEHNCYNEPLDKLYKARYLKSNLYNLLQIAQRLQYPQYCKMYNFCMSSTCYLEYGDF 296 Query: 303 LITNVRSVKVGYQALLWSFRLWQ 325 + R ++ Y F L+ Sbjct: 297 AVFLPRKHRIMYSISKSPFLLFV 319 >UniRef50_C0BRK2 Putative uncharacterized protein n=1 Tax=Bifidobacterium pseudocatenulatum DSM 20438 RepID=C0BRK2_9BIFI Length = 350 Score = 286 bits (733), Expect = 7e-76, Method: Composition-based stats. Identities = 82/304 (26%), Positives = 132/304 (43%), Gaps = 12/304 (3%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 + K+SV++P+Y + C++S+ QT +EII+++DGS D IA YA + Sbjct: 2 NNPKVSVVVPIYKVEPFLQECVDSIQVQTLEDIEIILVDDGSPDRCGAIADEYARKDSRI 61 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 +++HQ N+G ARN GI ATG+YV FVD+DD + PTMY +L A + D D+ Sbjct: 62 KVVHQKNSGLGPARNTGIANATGEYVGFVDSDDWIAPTMYSSLYDAASQCDADIVVGGHC 121 Query: 124 WCFRETGETWQSIP---TDRLRSTGVLTGPDWLRMGLSSR----RWTHVVWMGVYRRDVI 176 + ++ P S +L + L L + VW +YR D + Sbjct: 122 DMVNGSVRAVKAHPLAGQILRGSDQILPVRNRLFGHLPEDTEVESFPMRVWTAIYRNDFL 181 Query: 177 VKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHN-TSVSRLHRQGNKNLNY 235 N +KF A + +D ++ A +T + Y+Y + + +S+ R L Sbjct: 182 KVNGLKFQA-ILSEDTIFNLSAYRCANVVAFTGDTDYRYRMDDHSSIMRSFSSKK--LTQ 238 Query: 236 QRHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIF 295 ++ + L K Y I I Y L V V + + R I + Sbjct: 239 YEQFVDTVQFLAK-EEPYQHDECIMRTKRMAIDYCRLYVGLIVGSGLSVRDKLRNIRRLS 297 Query: 296 TSGM 299 S M Sbjct: 298 QSHM 301 >UniRef50_D1A4K2 CDP-glycerol:poly(Glycerophosphate)glycerophosph otransferase n=1 Tax=Thermomonospora curvata DSM 43183 RepID=D1A4K2_THECD Length = 1157 Score = 286 bits (733), Expect = 8e-76, Method: Composition-based stats. Identities = 77/329 (23%), Positives = 133/329 (40%), Gaps = 18/329 (5%) Query: 6 NKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRL 65 LS+++P+YN + C+ESL QT E+I+++DGSTD S IAK +A RL Sbjct: 3 PLLSIVVPVYNVEPYLQECLESLQNQTMDDFEVIMVDDGSTDGSAAIAKEFASRDGRFRL 62 Query: 66 LHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWC 125 + Q N G ARN G+ ATG Y+AFVD DD V P Y+ ++ + D A C Sbjct: 63 IQQENQGPGPARNTGVRHATGTYLAFVDGDDVVPPNAYQRMINSLEKSGSDFA-CGGVRR 121 Query: 126 FRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIA 185 F G + D T + R W VYR+ +N +F Sbjct: 122 FSSKGIYPSKMHADIFDRPAKKTHISRRPELVGDR----TPWNKVYRKSFWDAHNFQFPR 177 Query: 186 GLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNT---SVSRLHRQGNKNLNYQRHYIKI 242 G ++D A + +Y + + S+++ + + ++ Sbjct: 178 G-LYEDPTVAIPAHVLAEKVDVLSIPVYHWRVREGGERSITQRRTEPGNLEDRVTAIRRV 236 Query: 243 TRLL---------EKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAE 293 ++ L E ++R +ADK+ +Y E + + ++ +E + + E Sbjct: 237 SKFLREHAPELMREYMSRVFADKLPMYIEVLHEGSEHYRQLLQKAVEEFAAEADEEIFQE 296 Query: 294 IFTSGMYKRLITNVRSVKVGYQALLWSFR 322 + K + + + L++ R Sbjct: 297 LPALKRLKYHLLRRGMIDELLEVLVFQQR 325 >UniRef50_Q9F0B8 Beta(1,4)galactosyltransferase EpsJ n=1 Tax=Lactobacillus delbrueckii subsp. bulgaricus RepID=Q9F0B8_LACDE Length = 327 Score = 286 bits (733), Expect = 8e-76, Method: Composition-based stats. Identities = 65/333 (19%), Positives = 139/333 (41%), Gaps = 12/333 (3%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 KLS+I+P+Y + + C+ S++ QT+ E+I+++DGS D+ +I Y++ + +V+++ Sbjct: 2 KLSIIVPVYKVEEYLKDCVNSILNQTFHDFELILVDDGSPDSCPKICDEYSQKFNNVKVV 61 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 H+ N G S ARN G++VATG+Y++F+D+DD + MYE + ++ ++ D+ + + Sbjct: 62 HKQNGGLSSARNAGMKVATGEYISFIDSDDYLASNMYEHVFSIMKKECADIVVVGRCYVY 121 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAG 186 + + + + + W VY+R + N++ + G Sbjct: 122 PNGSKKLRE--KQNVYEVMDGPKATAIMNTSLLGYFDAAAWDKVYKRSLF--NDVSYPEG 177 Query: 187 LHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRLL 246 +D T + A R Y +Y Y S++ N + ++ + Sbjct: 178 KLSEDWYTTYKVFAKANRIVYDSTPMYYYRQRGGSITHTSST--VNYDAMYASREVYDFV 235 Query: 247 EKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLITN 306 +K Y + + + V + +P + ++ K+L Sbjct: 236 KKRQPEYTAEANF------AYVFSRIGVIDNLTVQPTVDKQKIRKIRNDMKANIKQLKKT 289 Query: 307 VRSVKVGYQALLWSFRLWQWRDKTRSHHRITRS 339 K+ + + F + + R +S Sbjct: 290 DCFSKLSKKRKIQLFLIEHCLNSYTIIFRRVKS 322 >UniRef50_C9BVV0 Glycosyl transferase n=3 Tax=Enterococcus faecium RepID=C9BVV0_ENTFC Length = 335 Score = 286 bits (733), Expect = 8e-76, Method: Composition-based stats. Identities = 74/330 (22%), Positives = 140/330 (42%), Gaps = 21/330 (6%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 ++S+I+P+Y R C++S++ QT+T E+I+++DGS DNS +I YAE VR Sbjct: 1 MCEISIIVPVYKVEKYLRKCVDSILAQTFTDFEVILVDDGSPDNSGKICDEYAEKDNRVR 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++H+ N G S ARN GI+VA GKY+ FVD+DD + MYE L D D++ Sbjct: 61 VIHKENGGLSSARNAGIDVARGKYLGFVDSDDYIDEDMYEILYENLKIHDADISSVELIP 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 + + + ++ + +L + ++ L ++ W +YR+++ N +++ Sbjct: 121 FYGD-----RYKKANKEKKVIILNKKEAIKSVLEGTQFYAYAWNKLYRKELFKDN--RYL 173 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 G +D + +F + + + Y Y N S+ N Sbjct: 174 DGKTFEDAYIIIDLLFQTEKIVVSNEEKYFYLQRNDSIMGKSFSMNNF-----------D 222 Query: 245 LLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLI 304 ++E N + +P+ H V + + + + I T + + Sbjct: 223 VIEAWQYNKEKILDAFPDLHDSYYRRLCWAYFYVLDKMVLSSSYQQIPRKKTVIEF---L 279 Query: 305 TNVRSVKVGYQALLWSFRLWQWRDKTRSHH 334 R+ + Y+ S ++ R Sbjct: 280 KKNRNFILTYKGFTKSRKIAIIALSFRVDW 309 >UniRef50_C3Q1B0 Glycosyltransferase family 2 protein n=2 Tax=Bacteroides RepID=C3Q1B0_9BACE Length = 347 Score = 286 bits (732), Expect = 9e-76, Method: Composition-based stats. Identities = 76/340 (22%), Positives = 143/340 (42%), Gaps = 16/340 (4%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M N K+SVI+P+YN C++S++ QT LEII+++DGS DN + YA+ Sbjct: 1 MENDIPKISVIVPVYNVERYLDRCIQSILNQTLKELEIILVDDGSPDNCPALCDEYAQKD 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 ++++H+ N G ARN G+ +A+GK+V+F+D+DD V P MYE L +A + D C Sbjct: 61 ARIKVIHKKNEGLGFARNAGLNLASGKFVSFLDSDDWVAPAMYEALYRVAEKMKCDTVYC 120 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGL-------SSRRWTHVVWMGVYRR 173 + + + + + G + +L L S ++ V +Y R Sbjct: 121 SLQYYYSKDKIIPFEEVGQEVFFRGRKSVDSFLLDMLAPLPSYRSDVKYMVSVCKAIYLR 180 Query: 174 DVIVKNNIKFIAGLH--HQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNK 231 +I N ++F++ +D+ + ++ A + + Y Y+ + S++ + + Sbjct: 181 KIIEDNQLRFVSEKVVASEDMFFHVRYLKLAENVGFIPEYFYNYFQNECSITHTYTEDKI 240 Query: 232 NLNYQRHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMI 291 + E + H +L + + + K+ + + Sbjct: 241 ERLKRFIQEMHLVFAEYFAEEDYWVRLQRKALH--YLRNSLYIKYQIVKKKSFSIQNYEL 298 Query: 292 AEIFTSGMYKRLITNVRSVKVGYQALLWSFRLWQWRDKTR 331 +I + + N YQ L RL+ K R Sbjct: 299 KQICKDAFFYHYLNNY-----PYQLLPMKHRLFYLLIKYR 333 >UniRef50_D2Q652 Glycosyltransferase, family 2 n=1 Tax=Bifidobacterium dentium Bd1 RepID=D2Q652_9BIFI Length = 320 Score = 286 bits (732), Expect = 1e-75, Method: Composition-based stats. Identities = 73/333 (21%), Positives = 135/333 (40%), Gaps = 15/333 (4%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M+ +SVI+P+Y+ CM SL+ QT+ LEII+++DGS D+ + +A Sbjct: 1 MSEQPLVSVIVPVYSVERSLGRCMHSLVGQTYRHLEIILVDDGSPDDCPRLCDAWAGQDD 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 +R++H+ N G S ARN G++ G +VAFVD+DD V P L D D++ C+ Sbjct: 61 RIRVIHKPNGGLSDARNAGLDAMRGDFVAFVDSDDYVEPDYVNRLYAGV--SDADMSMCS 118 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 ++ ++ ++LR L R V W +Y + N + Sbjct: 119 IIC--EDSNGKAWQGNEPITDERIAVSSEEYLRRALLDWRL-VVAWNKLYAASIW--NQL 173 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIK 241 +F G H+D + + LY Y ++S++ +NL+ ++ Sbjct: 174 RFPVGCIHEDEYVLHQVVDRCKSINVLPDELYHYVSTDSSITHSG-FSIRNLDRLEALVR 232 Query: 242 ITRLLEKLNRNYADKIMI-YPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMY 300 +++NY + + +F + + + + I R I Sbjct: 233 RLEYC--VSKNYRQCAALTFSKFIEDLDF----ASGLDWSDRKIHARLGEIFGQLRRVPL 286 Query: 301 KRLITNVRSVKVGYQALLWSFRLWQWRDKTRSH 333 + ++ + L ++ L +W K H Sbjct: 287 SAGLFLSSKRRMQFFGLRFAPFLTEWILKKAKH 319 >UniRef50_C7UWL4 Glycosyl transferase n=6 Tax=Enterococcus RepID=C7UWL4_ENTFA Length = 326 Score = 286 bits (732), Expect = 1e-75, Method: Composition-based stats. Identities = 71/335 (21%), Positives = 142/335 (42%), Gaps = 10/335 (2%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 K+S+I+P+YN C+ S++ QT+T E+I+++DGS D+S + +AE V+ Sbjct: 1 MPKISIIVPVYNVEKYLEKCVRSILAQTFTDFELILVDDGSPDSSGAMCDQFAEQDQRVK 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++H+ N G S ARN GIE+ATG+Y+ FVD+DD + MYETL + + ++A Sbjct: 61 VIHKENGGLSDARNAGIELATGEYLGFVDSDDYIADDMYETLYNQIVTYEAELATVGMID 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 + + +L+ + ++ L S +Y++++ N +++ Sbjct: 121 VYENRESRLT-----DKKEIKILSQNEAIQAVLDSTDVYAYAVNKLYKKELF--NKVRYP 173 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 G +D E + + + Y YY S++ KN + + Sbjct: 174 KGKIVEDAFIIIELLLQCQKIVTNTEQKYYYYRRADSIT-GLSFSEKNFDVIEAWEHNGV 232 Query: 245 LLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLI 304 L++ A+ + + + + + + + +R + FT M + Sbjct: 233 LVDNSYPELAESVHRRICWAYFDVLDKMAISNEKNFNEQRESIKRFLYSNFTFVMKNHVF 292 Query: 305 TNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITRS 339 T R + + + S K + +H ++ Sbjct: 293 TKGRKLSMLALRVNDSCYNAL--AKFKYNHLTKKN 325 >UniRef50_D2Q666 Glycosyl transferase, family 2 n=4 Tax=cellular organisms RepID=D2Q666_9BIFI Length = 329 Score = 285 bits (731), Expect = 1e-75, Method: Composition-based stats. Identities = 75/336 (22%), Positives = 153/336 (45%), Gaps = 10/336 (2%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M T +S+I+P+YN G+ C+ SL+ QT+ +++I+++DGSTD S ++ +A+ Sbjct: 1 MNTGTPLVSIIVPVYNTGNYLNVCVNSLLRQTYANIQVILVDDGSTDGSADLCDAWADKD 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 V ++HQAN G S ARN G++ A G ++ FVD+DD + E L+ + + +A Sbjct: 61 SRVMVIHQANGGVSTARNAGLDAAEGDWLEFVDSDDWLEREAIEGLLGLVQSEHAQMAIF 120 Query: 121 NADWC-FRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKN 179 N +T T +D S+GVL+ D L + L+ W + R++I + Sbjct: 121 NYRSVLDEDTPYTVCEKTSDYRISSGVLSRKDALDVILAYSGVKGYAWNKFFSRELIERQ 180 Query: 180 NIKFIAGL-HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRH 238 N++F + + +D++++ E+ + T++ LY Y + S S + Sbjct: 181 NLRFDSKITMCEDLLFSVEYALLSTVTVGTDRCLYNYRNNPNSASHKLDLESVATC-LEA 239 Query: 239 YIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSG 298 + ++ ++ + +++ + Y +++ A ++ + I + Sbjct: 240 HKRMVSIVPRESKS--SVLASYAILAEELLLRTYANDDATHRDEYLA-----ILRKYWWH 292 Query: 299 MYKRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHH 334 KRL + +++ S W + H+ Sbjct: 293 ALKRLRPSRYWMRILGGVFCPSLFYPIWNHEKGKHY 328 >UniRef50_A4VTV1 Glycosyltransferases involved in cell wall biogenesis n=10 Tax=Streptococcus suis RepID=A4VTV1_STRSY Length = 334 Score = 285 bits (731), Expect = 1e-75, Method: Composition-based stats. Identities = 76/346 (21%), Positives = 142/346 (41%), Gaps = 18/346 (5%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 +S+I+P+YN C+ S++ QT+ +EI+++NDGSTDNS EI YA+ +R Sbjct: 1 MINISIIVPIYNVEQYLSKCINSIVNQTYKHIEILLVNDGSTDNSEEICLAYAKKDSRIR 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 + N G S ARN GI A G Y+AF+D+DD ++ + L ++ VA D Sbjct: 61 YFKKENGGLSDARNYGISRAKGDYLAFIDSDDFIHSEFIQRLHEAIERENALVAVAGYDR 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRW-THVVWMGVYRRDVIVKNNIKF 183 + VL+G + + L + V W +Y++++ + +F Sbjct: 121 VDASG---HFLTAEPLPTNQAVLSGRNVCKKLLEADGHRFVVAWNKLYKKELFE--DFRF 175 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 G H+D +T ++ + ++ LY Y S+ ++ + + Sbjct: 176 EKGKIHEDEYFTYRLLYELEKVAIVKECLYYYVDRENSIITSSMTDHR-------FHCLL 228 Query: 244 RLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRL 303 + Y + ++ A V + + +Q+ + + +YK+L Sbjct: 229 EFQNERMDFYESRGDKELLLECYRSFLAFAVLFLGKYNHWLSKQQKKLLQTLFRIVYKQL 288 Query: 304 ITNVRSVKVGYQALL-----WSFRLWQWRDKTRSHHRITRSAFNLR 344 N R + L +F ++ K + R+ RS + R Sbjct: 289 KQNKRLALLMNAYYLVGCLHLNFSVFLKTGKDKIQERLRRSESSTR 334 >UniRef50_D2Q658 Glycosyl transferase, family 2 n=3 Tax=Bacteria RepID=D2Q658_9BIFI Length = 332 Score = 285 bits (731), Expect = 1e-75, Method: Composition-based stats. Identities = 79/333 (23%), Positives = 139/333 (41%), Gaps = 23/333 (6%) Query: 1 MMNSTN---KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYA 57 M N +SVI+P+YN C++S++ Q++ LEII+++DGS D +A Sbjct: 1 MSNEVPDELLISVIVPVYNVERYLDQCVKSIVEQSYRNLEIILVDDGSRDACPGKCDMWA 60 Query: 58 ENYPHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALE-DDLD 116 + ++H+AN G S ARN G++ A G +AFVD+DD + P M T+ E ++D Sbjct: 61 AKDNRISVIHKANGGLSSARNAGLDQARGDCIAFVDSDDWIDPNMLRTMWQWMQEQHEVD 120 Query: 117 VAQCNADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSR-RWTHVVWMGVYRRDV 175 V C + F + D V T LR L R R VW ++ Sbjct: 121 VVMCGTEKNFEDGDTEHI----DGHLPARVFTSDQALRSFLYHRDRMASAVWNKLFDARF 176 Query: 176 IVKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNY 235 ++ I+F GL+++D + ++LY+Y + S++ + + + Sbjct: 177 FREDGIRFPEGLNNEDYYVLAHVYRTMKGIYFNPKALYRYRIRKNSITTASLNEH-SFDR 235 Query: 236 QRHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIF 295 K + L + YAD + + A++ + + + L + M AE+ Sbjct: 236 AIIADKCCKYLSESG--YAD--------RNALAFFAMQGRYDIVYD---LASKHMGAEVL 282 Query: 296 TSGMYKRLITNVRSVKVGYQALLWSFRLWQWRD 328 K + K G A+ ++W Sbjct: 283 AVWREKLAYAAKKVYKDGTLAISHKVKIWMMAH 315 >UniRef50_B4AD18 Glycosyltransferase n=18 Tax=Enterococcus RepID=B4AD18_ENTFA Length = 326 Score = 285 bits (731), Expect = 1e-75, Method: Composition-based stats. Identities = 72/337 (21%), Positives = 150/337 (44%), Gaps = 24/337 (7%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 +++S+I+P+YN + + C+ES+++QT+T E+++++DGSTD+S E+ ++ Sbjct: 1 MSEISIIVPVYNVENYLKKCVESILSQTFTDFELLLVDDGSTDSSGEMCDELKRLDERIK 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++H+ N G S ARN GI+VA GKY+ FVD+DD + M E L +++D D++ Sbjct: 61 VIHKENGGLSSARNAGIDVAKGKYLTFVDSDDYIDTHMLEVLYKNMVQEDADLSIVGVTS 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 + P V+T + +M L ++ + +Y+R++ ++++ Sbjct: 121 VYSGQE------PEITPSEKYVVTQKEATKMILIGKQASVYAVAKLYKRNIFD--DLRYP 172 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 G H+D+ + + + S Y Y S++ + ++L + + Sbjct: 173 VGKAHEDVYVIMDVLNQCKKVVVDTASYYFYVHREGSIT-ASKFNIRDLESIDSWEQAWH 231 Query: 245 LLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLI 304 ++++ Y+D RVC A D + + +E M KR++ Sbjct: 232 TIQEIMPEYSDLA-------------FKRVCAANFYVLDKMMKSDSESE--YPEMTKRIV 276 Query: 305 TNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITRSAF 341 ++ + A + + + H + F Sbjct: 277 GFLKENQKFIYANPYFTKNRKLSLTFLRIHPMLYKVF 313 >UniRef50_C0C2F7 Putative uncharacterized protein n=1 Tax=Clostridium hylemonae DSM 15053 RepID=C0C2F7_9CLOT Length = 326 Score = 285 bits (731), Expect = 1e-75, Method: Composition-based stats. Identities = 74/321 (23%), Positives = 132/321 (41%), Gaps = 10/321 (3%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 +S+I+P+YN C++S++ QT+ E+I++NDGS D S I + YA+ VR Sbjct: 1 MTAVSIIVPVYNVEKYLNKCVDSILGQTFEDFELILVNDGSVDRSPSICRDYADKDGRVR 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 + Q N G ARN GIE A G Y+ FVD+DD + M E L D+A C Sbjct: 61 YISQENMGPGRARNVGIEAAGGAYILFVDSDDYIAENMLEILYGNITTSGADMATCGLYN 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 ++ + IP G + + L+ + + + +++ +I + Sbjct: 121 VYQ-----QRCIPQYEGIEQFACDGGEAFGLLLTGEKIPGSSCNKLIKAEILK--DIGYP 173 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 G+ ++D+ + TE M LY Y S++ + + + + Y R Sbjct: 174 EGIVYEDVGFHTELMQKVQSVYVDTTPLYYYVHRENSIT-TRKFDSDAMMFIYAYEDTLR 232 Query: 245 LLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLI 304 ++EK + + + + R+ P+ R + + Sbjct: 233 VVEKKYPAILTEARFKLIWAYFAILDRMLQEDGYREIPEYRQVIRFLKRNTVRIVKNPYF 292 Query: 305 TNVRSVKVGYQALLWSFRLWQ 325 R K+G ALL + RL++ Sbjct: 293 HRAR--KLGALALLLNVRLYR 311 >UniRef50_B3DP51 Glycosyltransferase for cell wall membrane n=10 Tax=Bifidobacterium RepID=B3DP51_BIFLD Length = 391 Score = 284 bits (727), Expect = 4e-75, Method: Composition-based stats. Identities = 63/273 (23%), Positives = 107/273 (39%), Gaps = 3/273 (1%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M +S+IIP+Y C+ S+ QT+ LEI++++DGS DN + +A P Sbjct: 1 MEQDPLVSIIIPVYKVEKFLDECVASVAAQTYANLEILLVDDGSPDNCPAMCDAWAARDP 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 +R++H+ N G S ARN GI A G Y+ F D+DD V PT+ E + E D D+ Sbjct: 61 RIRVIHKPNGGLSDARNSGIAEAIGAYIYFADSDDTVAPTLVEDCLNAMREYDADLVMFQ 120 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 D + + VLT + ++ + + W + Sbjct: 121 FDTISENNKPLLSNYRHNDFTEVQVLTPVEVIKKQVKAE-IDGYFWAFLAPASTYQGTGF 179 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIK 241 F G +D+ + A R ++LY Y + S++ ++ R Sbjct: 180 SFPVGRKIEDLSRICNVIGEATRVVRLPKALYHYRMREGSIT-ASWDPQLTRDWTRAADD 238 Query: 242 ITRLLEKLNRNYADKIMIYP-EFHQQITYEALR 273 + + + F + YE +R Sbjct: 239 REEYIVHRFPELKGFMTLQQLNFFANLDYETMR 271 >UniRef50_C5NWF7 Glycosyl transferase, group 2 family n=2 Tax=Firmicutes RepID=C5NWF7_9BACL Length = 320 Score = 283 bits (725), Expect = 6e-75, Method: Composition-based stats. Identities = 70/337 (20%), Positives = 132/337 (39%), Gaps = 18/337 (5%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 K+SVI+P+YN C++S+I QT+ LEII++NDGSTDNS ++ A +R Sbjct: 2 DEKISVIVPVYNVEQYLERCVDSIINQTYKNLEIILVNDGSTDNSGQLCDELARKDDRIR 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++H+ N G S ARN GI+ A + + F+D+DD + MYE LM + + D++ C Sbjct: 62 VVHKKNGGLSDARNVGIDEAEAELIGFIDSDDYIDEDMYEVLMNNLKDANADLSMCGHYD 121 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 + E + + T L+ + ++M + ++ + +Y++ + + KF Sbjct: 122 VYNNVPEAQVA-----DKKTWELSPQEAIKMVMEAKILSVTAVNKLYKKSLFSE--FKFE 174 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 G +D + + + T + Y Y S++ +K Sbjct: 175 IGKIAEDAFIMIKLLDKCNKIVATNEKKYYYVHRENSITTQKFS-----------LKFLN 223 Query: 245 LLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLI 304 ++E +N I YPE + V + Y + Sbjct: 224 VIEAYEQNKEIIIKKYPELKEVAQMRMNWAYFYVLDRLLLDKEYNDKKLENKLISYLKDH 283 Query: 305 TNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITRSAF 341 + + + ++ I + + Sbjct: 284 SKGILTDSLFTKGRKIGFVALLLNRNLYRKIIEKRGY 320 >UniRef50_B2ISC6 Glycosyl transferase, family 2/glycosyl transferase family 8 n=8 Tax=Streptococcus pneumoniae RepID=B2ISC6_STRPS Length = 696 Score = 282 bits (723), Expect = 1e-74, Method: Composition-based stats. Identities = 76/340 (22%), Positives = 138/340 (40%), Gaps = 9/340 (2%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M +K++VI+P+YN + R C++S+ITQT+ +EI+++NDGSTD S EI K ++E Sbjct: 1 MLVDDKITVIVPVYNVENYLRKCLDSIITQTYKNIEIVVVNDGSTDASGEICKEFSEMDH 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 + + Q NAG S ARN G+ +G YV FVD+DD + ETL E D+A N Sbjct: 61 RILYIEQENAGLSAARNTGLNNMSGNYVTFVDSDDWIELDYVETLYKEITEYQADIAVGN 120 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTH----VVWMGVYRRDVIV 177 G + I D V ++ W +Y+ + Sbjct: 121 YYSFNESEGMFYFHISGDSY-YEKVYDNVSIFENLYETQEMRSFALISAWGKLYKARLFE 179 Query: 178 KNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQR 237 ++F G +D + + + Y +SLY Y + S+SR Sbjct: 180 --QLRFDMGKLGEDGYLNQKVYLLSEKVIYLNKSLYAYRIRKGSLSR-IWTEKWMHALVD 236 Query: 238 HYIKITRLLEKLNRNYADKIMIYPE-FHQQITYEALRVCHAVRKEPDILTRQRMIAEIFT 296 + LL + + IY + ++ + +QR++ ++ Sbjct: 237 AMSERITLLANMGYPLEKHLAIYRQMLEFSLSNGQASGLSDTATYKEFEMKQRLLNQLSR 296 Query: 297 SGMYKRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRI 336 ++ + + +L + + + +++ + I Sbjct: 297 QEESEKKAIVLAANYAYVDQVLTTIKSICYHNRSIRFYLI 336 >UniRef50_C8NFZ4 Glycosyl transferase CpsJ(V) n=1 Tax=Granulicatella adiacens ATCC 49175 RepID=C8NFZ4_9LACT Length = 328 Score = 281 bits (721), Expect = 2e-74, Method: Composition-based stats. Identities = 76/330 (23%), Positives = 141/330 (42%), Gaps = 14/330 (4%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 K+S+I+P+YN C+ESL+ Q++ +EI++INDGSTDNS ++ A+ + Sbjct: 12 EPKISIIVPVYNVEQYLERCVESLMNQSYKNIEILLINDGSTDNSGKLCDEIAKRDSRIT 71 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 + H+ N G S ARN G++ AT +V FVD+DD V MYE L++ L+ D +++ C Sbjct: 72 VYHKENGGLSDARNYGVDKATADFVGFVDSDDYVDEDMYEVLLSNLLKYDAEISFCRLYD 131 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 + + Q + + ++T + ++M L ++ ++ +Y++ + N I+F Sbjct: 132 VYND-----QVLKDESKNEPYLMTSEEAIKMVLEAKIFSVTAVNKLYKKSLF--NQIRFE 184 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 G +D + + + TE Y Y S++ + K LN Y K + Sbjct: 185 KGKIAEDAFLMVDLLSRCQKIAATEVHKYYYMHRENSIT-TQKFTPKFLNVIEAYEKNAK 243 Query: 245 LLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLI 304 ++ + N + +I + V + K+ I YK+ I Sbjct: 244 IVAEKYPNLQYQAD------TRICWAYFYVLDRLLKDEAYKDSALEAQLIQNLKKYKKFI 297 Query: 305 TNVRSVKVGYQALLWSFRLWQWRDKTRSHH 334 + + + K Sbjct: 298 LGNSLFNNKRKLSFLLLLIHKNLYKIIIKR 327 >UniRef50_C7H3W5 Glycosyl transferase CpsJ n=1 Tax=Faecalibacterium prausnitzii A2-165 RepID=C7H3W5_9FIRM Length = 324 Score = 281 bits (719), Expect = 4e-74, Method: Composition-based stats. Identities = 74/297 (24%), Positives = 130/297 (43%), Gaps = 16/297 (5%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 +S+I+P+YN C+ SL+ Q++ +E+++++DGSTD S + YA VR Sbjct: 2 DPLVSIIVPVYNVKPYLNRCVNSLLGQSYQNMELLLVDDGSTDGSETLCGEYAAQDARVR 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 +LH+ N G S ARN G++ A G+Y++FVD DD V P E L + D + + Sbjct: 62 VLHKKNGGLSDARNAGVDAAKGEYLSFVDGDDWVSPYYIENLYRALEQAGADFSASCFEE 121 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 F G+ QS+PT+RL + +L+ + LR L +Y+R + +++ Sbjct: 122 VFE--GQPVQSVPTERLEAFEILSREECLRRILYQEGMEVTTPTKLYKRALFE--GVRYP 177 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 G ++DI E A +A + Y Y+ +S+ + K Sbjct: 178 VGKLYEDIPVAYEVTKRAHKAAHIANRDYYYFQRGSSIQNMAFNPRK----LDSVRHCHA 233 Query: 245 LLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYK 301 L+E + R++ VC+ + + D + I + + K Sbjct: 234 LMENVKRDFPRLSRAAE------CRYLSNVCNILFQIQD--RQHEKIEKALWQEVKK 282 >UniRef50_Q8GPB8 Eps6I n=5 Tax=Streptococcus RepID=Q8GPB8_STRTR Length = 328 Score = 280 bits (718), Expect = 4e-74, Method: Composition-based stats. Identities = 79/338 (23%), Positives = 144/338 (42%), Gaps = 11/338 (3%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M L+V+IP+YN + C+ES++ Q W +I++++DGSTD+S +I YA+ Y Sbjct: 1 MKKDVLLTVVIPVYNVEKYLKRCVESVLVQEWHNYDILLVDDGSTDSSPQICDDYAKVYD 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 + ++H+ N G S ARN GI A G+YV F+D+DD + P + L + D D+ N Sbjct: 61 FISVIHKKNGGLSAARNTGISNAEGEYVYFLDSDDWIEPNTFSDLAEVIESDQYDIISFN 120 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 ++ E I + T LTG + L S T +YR+ + KN I Sbjct: 121 QEFVKSE-----HDIIKSDSKRTKRLTGKEALIDMFSYGFVTGFATDKIYRKALFTKNTI 175 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYL-HNTSVSRLHRQGNKNLNYQRHYI 240 +F G +++D+ + A + T Q Y Y + + S+++ K N Y Sbjct: 176 QFPVGKYYEDLGTNYKLFLAAKKVYATNQKYYHYLIDNPDSITKS-WNEQKFENMISFYK 234 Query: 241 KITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMY 300 ++ + ++ + ++ I F L + + + Sbjct: 235 ELYYSDDVRSKLDSSELAIAKIFFVNGMTHILASLFKSNLYKTYPKLTKCVKKELKINKL 294 Query: 301 KRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITR 338 K + +K+ + + +RL + H + + Sbjct: 295 K----FQQKLKLPNRTKYFLYRLGLLNGAFQLQHILKK 328 >UniRef50_D2Q667 Glycosyl transferase, family 2 n=3 Tax=Bifidobacterium RepID=D2Q667_9BIFI Length = 327 Score = 280 bits (716), Expect = 6e-74, Method: Composition-based stats. Identities = 57/300 (19%), Positives = 117/300 (39%), Gaps = 11/300 (3%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M S +SV+IP+Y C+ S++ Q L+I++++DGS D+ ++ +A Sbjct: 1 MTSQTVVSVVIPVYGVESYLDRCIRSVVGQDHERLDIVLVDDGSPDDCPQMCDEWAARDS 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 +R++H+ N G + ARN G++V G V FVD+DD V L+ + D+ C+ Sbjct: 61 RIRVVHKKNGGLASARNAGLDVIAGDMVTFVDSDDYVESNYVSELLHWHEQSGADIVMCS 120 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIV-KNN 180 + + R G++T + + L VW ++ ++ Sbjct: 121 CMHDLEDGSIVTDA----RCTPEGIMTSEEAMIQFLYHLSLAGPVWGKMFDARFFKSEHG 176 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI 240 ++F GL+ +D + + + LY Y + S+ ++ + + + + Sbjct: 177 VRFHEGLNSEDYYVLAQVFLQMNKIFVRMEPLYHYRIRQGSIVHMNSFNDHSCDEIQIAD 236 Query: 241 KITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMY 300 L + + + Y + V + + R IA ++ Sbjct: 237 LCCDYLARHG-YHNQSALNYFRLQSRA-----DVLFNIFQSDTPHEIYRRIARDMRRYLW 290 >UniRef50_A6NTS0 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6NTS0_9BACE Length = 353 Score = 280 bits (716), Expect = 6e-74, Method: Composition-based stats. Identities = 72/338 (21%), Positives = 137/338 (40%), Gaps = 14/338 (4%) Query: 2 MNSTNK--LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAEN 59 M +S+I+P+Y R C++S++ QT+T +E+I+++DGS DN I YA+ Sbjct: 1 MEELKLSEISIIVPVYKVEPYLRRCVDSILAQTFTDIEVILVDDGSPDNCPAICDKYAQQ 60 Query: 60 YPHVRLLHQANAGASVARNRGIEVA----TGKYVAFVDADDEVYPTMYETLMTMALEDDL 115 V ++HQ G SVARN G+ A ++++FVD+DD V+P E L A E+++ Sbjct: 61 DNRVIVIHQEYQGVSVARNTGLNWALLNSNSQWISFVDSDDWVHPRFLEYLHRAATENNV 120 Query: 116 DVAQCNADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDV 175 ++ C+ E+ P + + L L VVW +Y + + Sbjct: 121 QISVCDYCR------ESIYEEPKSNVEYDTLKVTSLQLYQHLEKSLLFTVVWNKLYYKSL 174 Query: 176 IVKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNY 235 K I+F G +D + + M+ + + + LY YY++ T ++ + + Sbjct: 175 FEK--IRFPVGKISEDTFVSYKLMYKCPQIVFIKHPLYFYYINQTGITSSTYSPVRLVEL 232 Query: 236 QRHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIF 295 ++ E+ ++ +I + +++ L ++ Sbjct: 233 DALKLQQQFFHEQQATDWEKSCIIRTIRTYAFHIQKCSEFPLLKQYETKLQQELRRLLKS 292 Query: 296 TSGMYKRLITNVRSVKVGYQALLWSFRLWQWRDKTRSH 333 YK + + SF R K + Sbjct: 293 NKKFYKLSLPRDAWISETVYPKRMSFYWKMMRLKVLIY 330 >UniRef50_C6LCR2 Glycosyl transferase CpsJ n=1 Tax=Bryantella formatexigens DSM 14469 RepID=C6LCR2_9FIRM Length = 314 Score = 279 bits (715), Expect = 8e-74, Method: Composition-based stats. Identities = 63/253 (24%), Positives = 114/253 (45%), Gaps = 7/253 (2%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 N +SVI+P+Y C+ S++ Q++ +E+I+++DG+ DN ++ +A+ Sbjct: 6 NNRYPLISVIVPVYMVEPYLDRCITSIVRQSFRDIEVILVDDGTLDNCGKMCDDWAKKDR 65 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 +R+ HQ N G S ARN GI A GKY+ FVD+DD + M E L+ LE D+A CN Sbjct: 66 RIRVFHQKNYGLSAARNTGINHAQGKYLCFVDSDDYIETDMLEKLLAALLEQQADMAVCN 125 Query: 122 ADWCFRETGETWQSIPTDRLRS-TGVLTGPDWLRMGLSSR-RWTHVVWMGVYRRDVIVKN 179 + + + ++ P +TG + + M + + V W +Y+R + K Sbjct: 126 FVYEYERAEQIYRKNPESYQNDIDQTITGREIMLMAEQGKHSYCVVAWNKLYKRSLFEK- 184 Query: 180 NIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHY 239 +++ G H+D M R + Y Y S+ + + + Sbjct: 185 -LRYPEGKIHEDEFVFHHIMSACQRVICLRYTGYHYVQREGSIMHR---NDSMQDAAEAF 240 Query: 240 IKITRLLEKLNRN 252 ++ R + + Sbjct: 241 LERCRFFLERDDR 253 >UniRef50_P71057 Putative glycosyltransferase epsH n=3 Tax=Bacillus subtilis group RepID=EPSH_BACSU Length = 344 Score = 279 bits (715), Expect = 9e-74, Method: Composition-based stats. Identities = 75/325 (23%), Positives = 134/325 (41%), Gaps = 10/325 (3%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 T +S+++ +YN RTC+ESL QT +EIII+NDGS D S +IA+ YA+ Sbjct: 2 ETPAVSLLVAVYNTETYIRTCLESLRNQTMDNIEIIIVNDGSADASPDIAEEYAKMDNRF 61 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 +++HQ N G RN+GIE A G+++AF+D+DD + P E ++ A ++ D+ CN Sbjct: 62 KVIHQENQGLGAVRNKGIEAARGEFIAFIDSDDWIEPDYCEQMLRTAGDE-TDLVICNYA 120 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 F +TG+T S + + L + W +YRR +I + + F Sbjct: 121 AEFEDTGKTMDSDIAQTYQDQ----PKEHYIKALFEGKVRGFSWNKLYRRSMIEAHRLSF 176 Query: 184 I---AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI 240 H +D ++ F A Y + LY Y +H +S+ + + K Sbjct: 177 PLRGELEHVEDQFFSFRAHFFARSVSYVKTPLYHYRIHLSSIVQRY--QKKLFESGLALY 234 Query: 241 KITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMY 300 + + N + F + + + + R + I ++ Sbjct: 235 ETNAAFLQENNKLEEYRKELDTFIVLHSSICMLNEWKTSGSRRLFEKLRNVGVICADPVF 294 Query: 301 KRLITNVRSVKVGYQALLWSFRLWQ 325 + ++ + + Sbjct: 295 QESLSKTGTAPFDAKRSCLLLMAKY 319 >UniRef50_B4F184 Glycosyl transferase n=2 Tax=Proteus mirabilis RepID=B4F184_PROMH Length = 327 Score = 279 bits (715), Expect = 9e-74, Method: Composition-based stats. Identities = 93/328 (28%), Positives = 158/328 (48%), Gaps = 2/328 (0%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 T +SVI+P++N +ESL+ QT+ E+IIINDGSTDNSV+IA Y + Sbjct: 2 KTPIISVIVPMFNEASRITRLLESLLCQTFHDFEVIIINDGSTDNSVDIAMLYCQQDNRF 61 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 L HQ N G S ARN G++ A G ++ F D+DD + P + +A E +DV N + Sbjct: 62 HLYHQTNQGLSSARNTGLKYAQGDWIVFFDSDDFIKPQLLAHWHQLACEQHIDVLIGNGE 121 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 + Q+ R V++G +W+ ++ +W H VW+ R ++I K++++F Sbjct: 122 RYDVHNPQKHQTTIHQRQPYQQVISGQEWVIHAVTQHQWPHFVWLQFIRHEIIKKHHLRF 181 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 I GL+H+DI+WTT+ R + +Q LY Y + S++R + Q + + Sbjct: 182 IDGLYHEDILWTTQLALVTQRIAFDDQPLYYYCANPNSITRKPNPQKEAKRAQSYLQIVI 241 Query: 244 RLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRL 303 +LL+ ++ + + ++ QQ E RK D + + + T ++ L Sbjct: 242 QLLQLADKQH--QPLLANALRQQALRELGSFFILFRKRCDTKNQAIIAQQFSTYHLFPAL 299 Query: 304 ITNVRSVKVGYQALLWSFRLWQWRDKTR 331 ++ + L+ R K R Sbjct: 300 NKGAQNWHQRWFICRVYCVLYWHRWKKR 327 >UniRef50_D1NRU6 Glycosyl transferase CpsJ n=1 Tax=Bifidobacterium gallicum DSM 20093 RepID=D1NRU6_9BIFI Length = 345 Score = 279 bits (715), Expect = 9e-74, Method: Composition-based stats. Identities = 69/335 (20%), Positives = 130/335 (38%), Gaps = 5/335 (1%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 S +SVI+P+Y C+ES++ QT+ LEI++++DGS DN + +A + Sbjct: 14 SNPLVSVIVPIYGVEAFLDQCVESIVNQTYANLEILLVDDGSKDNCHAMVDEWARRDSRI 73 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 LH+ N G S ARN G+E A+G+YV FVD+DD + P + +T + + + D D+ + Sbjct: 74 IALHKPNGGLSDARNYGLERASGEYVYFVDSDDWIEPQLVQTALRTSQDTDSDMVVIAYN 133 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 + + S +++ + G + + L L + + + W +R V I+F Sbjct: 134 SANEDGSHVFPSKDSEKFPAEGRRSSAEALTE-LWNDKIQNFSWSIFAKRSVYD--GIRF 190 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 +D+ T + A + LY Y + + S+ + Sbjct: 191 PLNQLMEDMGTTYKLYDAANAVYFLPVCLYNYRVRSNSILDRKPPAMSADTVKDIKAIDI 250 Query: 244 RLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRL 303 ++ ++ + A + R + TS + K Sbjct: 251 FAQQRYPELLEKELNWSVRYLSAAIIWASESKQQFDRPSSYRAFVRSTRKYMTSRIKKLG 310 Query: 304 ITNVRSVKVGYQALLWSFRLWQWR--DKTRSHHRI 336 + + + ++ K R H RI Sbjct: 311 VRRMTLTNMLKVTPIYLHCQPALTAVSKLRDHRRI 345 >UniRef50_C0BRQ0 Putative uncharacterized protein n=2 Tax=Bifidobacterium RepID=C0BRQ0_9BIFI Length = 331 Score = 279 bits (715), Expect = 1e-73, Method: Composition-based stats. Identities = 76/340 (22%), Positives = 139/340 (40%), Gaps = 19/340 (5%) Query: 3 NSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPH 62 ++ +SVI+P+YN C+ESLI QT+ LEII+++DGSTD+S E +A Sbjct: 4 STQPLISVIVPVYNVERYLDQCVESLIGQTYERLEIILVDDGSTDSSCEQCNAWANRDNR 63 Query: 63 VRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNA 122 +R + Q NAG + ARN G+++A G+Y+ FVD+DD V P M+ TL+ E D D++ + Sbjct: 64 IRAVRQCNAGLAAARNTGLDLAKGEYIGFVDSDDYVLPDMFGTLLHNLQESDADLSIISY 123 Query: 123 DWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIK 182 + + + P ++ V+T + + W + ++++ N++ Sbjct: 124 ERENPDGSTYCNAFPDKKI----VMTSQEAFAYVNQHGYFYVTAWDKLAKKELFD--NLR 177 Query: 183 FIAGLHH-QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIK 241 + + +D T + + A R Y LY+Y + S S + Sbjct: 178 YPLDAVYAEDSPVTYQLLDKADRIVYDSTPLYRYRMSENSQSHGITD--------KFAQS 229 Query: 242 ITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRM----IAEIFTS 297 +L+ + Y H + + +R + E+ Sbjct: 230 TGAMLDLVRTKYPHVEAYAAYGHLESIVGTCNRIMLAHQRKQWAQFERYARTELRELLPM 289 Query: 298 GMYKRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRIT 337 K +I + ++ A + + R H I Sbjct: 290 VERKGIIGKSQLLQWKLLATSPTLYGMMYALYKRRHPEIA 329 >UniRef50_D2RJ84 Glycosyl transferase family 2 n=1 Tax=Acidaminococcus fermentans DSM 20731 RepID=D2RJ84_ACIFE Length = 324 Score = 278 bits (712), Expect = 2e-73, Method: Composition-based stats. Identities = 71/318 (22%), Positives = 133/318 (41%), Gaps = 5/318 (1%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 LSVI+P+YN C++S+I QT+ LEII+++DGSTD+S I YA + + ++ Sbjct: 2 LLSVIVPVYNVEKYLCKCLDSIIQQTYKNLEIILVDDGSTDSSGLICDNYAAHDKRIHVI 61 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 H+ N G S ARN G+E+ +G Y+ FVD+DD + M+E L+ ++LDVA C + Sbjct: 62 HKKNEGLSEARNSGLEICSGDYIGFVDSDDWIATDMFEKLVKFVKCENLDVAMCGVADVW 121 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAG 186 E S L + + VW +YR + +KF G Sbjct: 122 PEHVVKTPSFKKVILTDVNDIISEILVNR---HGGTAVPVWCRIYRSSLFKT--LKFEKG 176 Query: 187 LHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRLL 246 +++D + +++ R S Y Y S++ + + ++++ Y + Sbjct: 177 RYYEDGFYLLKWIERTGRFGRFSDSGYFYNHREGSITNIQGHSKRVDDFRKAYEDNYNYV 236 Query: 247 EKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLITN 306 ++ N + F + + ++ ++ I+ + I Sbjct: 237 KQHFPNSLNAAEYRLFFTYRSILNLVGNTDLPYSHEIANKFKKNLSHIWKNPYMNIKIKL 296 Query: 307 VRSVKVGYQALLWSFRLW 324 + ++ R + Sbjct: 297 IYTLIGIDPVWYLLLRQF 314 >UniRef50_Q03KL3 Glycosyltransferase involved in cell wall biogenesis n=1 Tax=Streptococcus thermophilus LMD-9 RepID=Q03KL3_STRTD Length = 326 Score = 278 bits (712), Expect = 2e-73, Method: Composition-based stats. Identities = 75/331 (22%), Positives = 139/331 (41%), Gaps = 8/331 (2%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 MN + ++S+++P+YN D C++S++ Q+ E+II++DGSTD S + Y+E+ Sbjct: 1 MNDSVEISIVLPVYNVIDYLNECIKSILRQSNKNFELIIVDDGSTDGSQILCDRYSESDS 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 ++++HQ NAG S ARN G+ A GKY+ F+D+DD V E L + +E D++ C+ Sbjct: 61 RIKVIHQKNAGLSAARNTGLNHAKGKYITFIDSDDFVGEKYIEVLYKIIIESCADISMCD 120 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSS--RRWTHVVWMGVYRRDVIVKN 179 ++T + + G + + + V W +Y+ ++ N Sbjct: 121 YMEVAQDTQISDVTFQEKYPIDKYYYNGKEAIANVYKNSSHGMEFVSWAKMYKLELFKSN 180 Query: 180 NIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHY 239 I F H+D T + + + + + LY Y + + S+ + Sbjct: 181 GILFPEEKLHEDAFTTYKLFYASNKIAFVNAPLYFYRIRSGSIMTSEFTEKRL-----DM 235 Query: 240 IKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMI-AEIFTSG 298 I+ TR R D ++Y F + L + D R + E+ Sbjct: 236 IQATREEYLFFRENRDFELMYLAFVDYLHKVKLILKMIFASAGDNREITRSVCKELDHDI 295 Query: 299 MYKRLITNVRSVKVGYQALLWSFRLWQWRDK 329 + N+ +K Y + F + Sbjct: 296 EECKKSINIPMIKYVYYKGMAKFPRIFTSIR 326 >UniRef50_A6L7A5 Glycosyltransferase family 2 n=6 Tax=Bacteroides RepID=A6L7A5_BACV8 Length = 315 Score = 278 bits (712), Expect = 2e-73, Method: Composition-based stats. Identities = 71/325 (21%), Positives = 142/325 (43%), Gaps = 15/325 (4%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 +SVIIP+YN C+ S++ QT T +E++++NDGSTD S +I YA ++ Sbjct: 1 MPLVSVIIPIYNTEKFLPLCINSVLNQTLTDIEVLLVNDGSTDGSGKICDEYACKDQRIQ 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++H N G S ARN+G+E A G+Y+AF+D+DD + M TL + ++ +A C Sbjct: 61 VIHTLNQGVSHARNQGLETAKGEYIAFMDSDDWIETDMIATLYQLIRTNEAGLATCGYII 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSR--RWTHVVWMGVYRRDVIVKNNIK 182 + + I +G LT + + + R ++ +W +Y +++I K+++K Sbjct: 121 ENEDG----RPIYHINEVKSGKLTQWEAIHSLFNDRHYKYKGNLWDKLYHKEIIDKHHLK 176 Query: 183 FIAGLHH-QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIK 241 F +++ +D ++ +++ + A YT Y Y +S L ++ K Sbjct: 177 FNEHIYYNEDRLFIFQYLSHCQSAAYTTSPYYHYVTRKSSAMNLSQKNYTP--------K 228 Query: 242 ITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYK 301 + ++ + + + ++ + ++ + + + Sbjct: 229 LCTFMDAFDLMTSLSATFPTYILRALSADYIKSSFLFYTQHSREIPFNELWDRMLRIRKN 288 Query: 302 RLITNVRSVKVGYQALLWSFRLWQW 326 I S+K+ Y L W Sbjct: 289 NYIHLPLSLKIKYALYGIRLILRAW 313 >UniRef50_B2UQP2 Glycosyl transferase family 2 n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UQP2_AKKM8 Length = 358 Score = 278 bits (711), Expect = 3e-73, Method: Composition-based stats. Identities = 70/218 (32%), Positives = 102/218 (46%), Gaps = 2/218 (0%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+I+P YN CMESL Q+ +EII +NDGS+D + EI + + + VR++ Sbjct: 5 VSIIVPCYNVAAYLDQCMESLAGQSMEDIEIICVNDGSSDRTAEILREWRDRDGRVRVID 64 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + N+G S ARN G+E A GKY+ FVD DD V M+ L A+E D DVA C Sbjct: 65 RKNSGVSAARNSGMEAAAGKYIGFVDPDDVVERNMFRRLFDAAVEKDADVAVCGYHEFCD 124 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 G + R G R VW + RR+++ +N ++F+ GL Sbjct: 125 RGGMDMPESGWSPSAGFFPEEKAERFRRGTPWSRCAGTVWNKLIRRELLEENGVRFVPGL 184 Query: 188 H-HQDIVWTTEFMFNALRARYTEQSLYKYYL-HNTSVS 223 +D+ + + A R LY Y S S Sbjct: 185 RQGEDLYFCLMLLTVAPRLLILPDRLYHYRRERPGSAS 222 >UniRef50_C0BWY1 Putative uncharacterized protein n=1 Tax=Clostridium hylemonae DSM 15053 RepID=C0BWY1_9CLOT Length = 328 Score = 278 bits (711), Expect = 3e-73, Method: Composition-based stats. Identities = 67/286 (23%), Positives = 123/286 (43%), Gaps = 5/286 (1%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+I+P+YN C+ S++ QT+ E+I+++DGS D S I Y + +R+ H Sbjct: 2 ISIIVPVYNVEKYLEKCICSILDQTYRDFELILVDDGSRDKSGGICDAYKKKDGRIRVFH 61 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 Q N G S ARN G+ A+G+Y+ ++D+DD V P E L A + +V+ C + Sbjct: 62 QENKGLSGARNAGLARASGEYITYIDSDDSVDPHYLEVLFENAQKYGAEVSVCGYRSVWE 121 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGL-SSRRWTHVVWMGVYRRDVIVKNNIKFIAG 186 + G +S P S + TG + + + + R W +Y + K +++ G Sbjct: 122 DRG--RKSRPEKVKTSVRICTGREAVYKIVAENDRKMITAWGKLYHARL--KPLLEYPQG 177 Query: 187 LHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRLL 246 H+D T ++ A + + Q LY Y S+ + + + Sbjct: 178 RTHEDEFVTYRVLYIADKVVISPQELYGYLQRGDSIMNSSYSERRLDKIRALEEAVDYFR 237 Query: 247 EKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIA 292 ++ ++ I + + V RK+ R+ +A Sbjct: 238 KEGDKELERYAEKRYLLQLSIAWYRVSVFLPERKDLTARLREEWLA 283 >UniRef50_B1QTB6 Glycosyl transferase, group 2 family protein n=4 Tax=Bacteria RepID=B1QTB6_CLOBU Length = 326 Score = 278 bits (711), Expect = 3e-73, Method: Composition-based stats. Identities = 77/333 (23%), Positives = 145/333 (43%), Gaps = 8/333 (2%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M + +SV+IP+YN C+ES+I QT+T L+II+++DGSTDNS I Y + Sbjct: 1 MINGELISVVIPVYNVEKYLPKCIESIINQTYTNLQIILVDDGSTDNSGSICDEYKKKDT 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 + ++H+ N G S ARN GI+ + GKY+ FVD+DD + MYE +M + + +++ N Sbjct: 61 RIMVIHKKNGGLSDARNVGIKYSKGKYIGFVDSDDYINKKMYEIMMENMVNNKANISIVN 120 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 + F + E + + + V++ + + + + W +Y +++ N+I Sbjct: 121 RYYVFEDGQEFLRYSINESI---KVMSNLEAIEEMNNFSTFDMAAWDKLYEKELF--NDI 175 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIK 241 +F G +D R Y LY YY N S+SR N ++ R + Sbjct: 176 EFPVGKLSEDFYIMYLLFEKCKRIVYDSSPLYYYYQRNNSISRN---KKINFDFVRAAKE 232 Query: 242 ITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYK 301 +E + + Y + + ++ + +R I + + Sbjct: 233 QMTYIENRYPKLKPCMRVAYISANLTVYNSYLKQRMIPEKCVVKEIKREIGKNISYLKQN 292 Query: 302 RLITNVRSVKVGYQALLWSFRLWQWRDKTRSHH 334 I+ ++ +++ L S ++ Sbjct: 293 TTISKIKKIQIRLFLLNMSLYNLSFQLYKIKKR 325 >UniRef50_A8SZ40 Putative uncharacterized protein n=1 Tax=Coprococcus eutactus ATCC 27759 RepID=A8SZ40_9FIRM Length = 321 Score = 276 bits (708), Expect = 5e-73, Method: Composition-based stats. Identities = 71/313 (22%), Positives = 137/313 (43%), Gaps = 8/313 (2%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 + K+S+I+P+YN ++ C+ SLI QT+ +EI++++DGS ++ YA+ + Sbjct: 13 ESVKVSIIVPVYNVENELDRCVASLINQTYQNIEILLVDDGSPGACPKMCDEYAKKDKRI 72 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 ++H+ N G S ARN G++ A+GKYV FVD+DD + E +T ++DD+D+ Sbjct: 73 IVIHKPNGGLSSARNTGLKRASGKYVLFVDSDDYIETDSCEKFLTF-VKDDVDLVVGAGR 131 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 G+ + +R + D+ + W + +Y R+ ++KN++ F Sbjct: 132 MI---KGKDEILLRHSNIREREKYSARDFTIKSIKHSEWNSYACLSLYDREFLIKNDLFF 188 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKN----LNYQRHY 239 G+ +D+ E A Y + + Y Y + + S+ NK + Y Sbjct: 189 REGIIFEDMQILPEIYLKAGNIVYMDYAFYNYVVRDGSIMTSGNVENKRRTCVEILEEWY 248 Query: 240 IKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGM 299 I ++ ++++ R Y ++ + + + I E M Sbjct: 249 ITMSNVMDRRYRRYLYGALVNFYLWSCRSIGFCKWNIKNVNFVFSIRYAVGIKEKLKVIM 308 Query: 300 YKRLITNVRSVKV 312 ++ L KV Sbjct: 309 FQFLPKLYLKFKV 321 >UniRef50_C2DBB8 Glycosyltransferase n=19 Tax=Bacteria RepID=C2DBB8_ENTFA Length = 322 Score = 276 bits (708), Expect = 5e-73, Method: Composition-based stats. Identities = 73/334 (21%), Positives = 139/334 (41%), Gaps = 14/334 (4%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 ++S+I+P+Y R C++S++ QT+T E+I+++DGS DNS +I YA VR Sbjct: 1 MCEISIIVPVYKVEPYLRKCLDSILAQTFTNFEVILVDDGSPDNSGKICDEYAIKDSRVR 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++H+ N G S ARN GI++A GKY+ FVD+DD + MYE L D D++ Sbjct: 61 VIHKENGGLSSARNAGIDIAQGKYLGFVDSDDYIAEDMYEVLYENLKFYDADISSVEIIP 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 + E T + + +L + ++ L + W +YR+++ N +++ Sbjct: 121 FRDDKYE-----NTSKEKKEILLNKKEAIKSVLEGTDFYAYAWNKLYRKELFYNN--RYL 173 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 G +D + + + + + Y Y N S+ + K L+ + + Sbjct: 174 EGETFEDAYIIIDILLETDKIIVSNEEKYFYLQRNDSIM-GQKFSKKTLDVIKAWQYNQ- 231 Query: 245 LLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLI 304 +K+ ++ D Y + L I + +I + + + Sbjct: 232 --KKILNHFPDLKDSYNRRLCWAYFYVLDKLILSSNYKQIPETKEIIEFLKKNRNFILSY 289 Query: 305 TNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITR 338 + L +W K +R+ + Sbjct: 290 KGFTKSRKIAIVALSFGSVWY---KKLVEYRLKK 320 >UniRef50_Q67AY0 Cps8J n=6 Tax=Streptococcus RepID=Q67AY0_STRAG Length = 325 Score = 275 bits (704), Expect = 2e-72, Method: Composition-based stats. Identities = 78/333 (23%), Positives = 145/333 (43%), Gaps = 17/333 (5%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 N +S+I+P+YN + C+ S+I+Q++ E+II+NDGSTDNS+ + R Sbjct: 2 QNLVSIILPVYNVENYIENCLNSIISQSYFHFEVIIVNDGSTDNSMSYCEKIVCEDLRFR 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++++ N G S ARN G+E A G Y+ FVD+DD + P + LM + D+A C+ Sbjct: 62 IINKDNGGLSDARNVGLEAAKGDYIIFVDSDDFISPNLVSHLMACLENYEADLAICDPVH 121 Query: 125 CF-RETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 + ET + + + +LT + + + + W VY++ + NI+F Sbjct: 122 FYLGGNQETHEKNIFKKHSNIHLLTNEEAICELFYQKTFLVSAWGKVYKKSIFD--NIRF 179 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 G +D + + Y+ LY Y S++ + I Sbjct: 180 PKGKLFEDSAVMHLLFEKSNKIVYSNAKLYAYVHRKDSITTSQFSERD--------LDIL 231 Query: 244 RLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAE--IFTSGMYK 301 + +++ + Y IY I+Y+ + P+I ++AE + +K Sbjct: 232 DITDQILKRYKSFPKIYQA---AISYKISACFRVILNAPNIEKYTPIVAECKKYVYYHWK 288 Query: 302 RLITNVRSVKVGYQALLWSFRLWQWRDKTRSHH 334 ++ N + ++ + L S L++ K H Sbjct: 289 DILFN-KKTRIKTKLALISIVLFKPFIKFIYTH 320 >UniRef50_A8FHL3 Glycosyltransferase n=2 Tax=Bacillus pumilus RepID=A8FHL3_BACP2 Length = 340 Score = 275 bits (703), Expect = 2e-72, Method: Composition-based stats. Identities = 69/320 (21%), Positives = 132/320 (41%), Gaps = 10/320 (3%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 +S+++ +YN C++S++ Q+++ +E+I++NDGSTD S E + +A R Sbjct: 1 MPAISLLVAVYNTSQYLEQCLQSIVDQSFSDIEVILVNDGSTDRSGEYLEAFAAQDQRFR 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++HQ N G RNRGIE A G Y+AF+D+DD + P E L A D+ Sbjct: 61 VIHQTNQGLGAVRNRGIEEAKGTYIAFIDSDDVLAPHYCEALYEKAKMTGADLVISEYWI 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 F ++ T + + L L T W +YRR I +++I+F Sbjct: 121 QFEQSKRTIPTTLLAERSAE-----KTSLIEALLHGDITGFSWNKLYRRAFIEEHDIRFP 175 Query: 185 ---AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIK 241 + +D T + + + LY Y +H +S+ + +++ ++ + Sbjct: 176 LRGELENIEDQYVTLRCFSLSRGIAFVHEPLYYYRVHLSSIVQRYQKQ--YFHHGLTFYH 233 Query: 242 ITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYK 301 RL + + F T + + + + + E+ T ++ Sbjct: 234 AQRLFLTEYDDISLYEKALDFFIVNHTLHCMLNEWKSQNALSLREKLDHMREMVTHEAFQ 293 Query: 302 RLITNVRSVKVGYQALLWSF 321 + V K+ + + F Sbjct: 294 DAVKQVEPSKLTSRKRMILF 313 >UniRef50_C2M8C3 Glycosyl transferase, family 2 n=1 Tax=Capnocytophaga gingivalis ATCC 33624 RepID=C2M8C3_CAPGI Length = 321 Score = 274 bits (701), Expect = 4e-72, Method: Composition-based stats. Identities = 74/323 (22%), Positives = 143/323 (44%), Gaps = 9/323 (2%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 + +S+I+P+YN C++SL+ QT+ +EI+++NDGS D + EI Y E YP + Sbjct: 2 NNPLISIIVPVYNGETYLARCLDSLLDQTYAPIEILVVNDGSQDRTAEILSQY-EQYPQI 60 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 ++ +Q N G S ARN G++ G +V FVD+DD V PT E L ++++ D++ C Sbjct: 61 KVFYQENKGLSGARNLGLKHFEGDFVCFVDSDDYVTPTYCEELYKALIDNNADISVCQYT 120 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 + E IP ++ + + + + L + L + W +++++++ N F Sbjct: 121 LIDTKGKE----IPQEKDKKIQLFSHIEALALLLEDGQLKSYAWGKLFKKELLE--NFSF 174 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 G +D A + LY Y + S+S + G K ++ +++ Sbjct: 175 PMGRIFEDYASMFFLFDKATKVVMINNCLYNYIQYPNSLSN-KKGGEKFFHFFLGVMEMY 233 Query: 244 RLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRK-EPDILTRQRMIAEIFTSGMYKR 302 + + + I + +Q + R ++ E I + F S YK+ Sbjct: 234 DFACQYKEKFPNWNRIVEKTAKQSLFAIKRFYLFSKEVEKSIEQAMKERLTSFYSNEYKK 293 Query: 303 LITNVRSVKVGYQALLWSFRLWQ 325 + R + L + + + Sbjct: 294 IPFKYRWKIFSFLKLPFLYNTYI 316 >UniRef50_D0ANJ0 Putative uncharacterized protein n=1 Tax=Enterococcus faecium C68 RepID=D0ANJ0_ENTFC Length = 351 Score = 273 bits (699), Expect = 6e-72, Method: Composition-based stats. Identities = 78/351 (22%), Positives = 139/351 (39%), Gaps = 22/351 (6%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+SVI+P+YN R C++SL++QT +EI++++DGS D S EIA Y + Y +V+ + Sbjct: 4 KVSVIVPIYNVEKYLRKCVDSLLSQTLKEIEIVLVDDGSPDASGEIADEYQKKYSNVKTI 63 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 H+ N G ARN GIE ATG+YVAF+D+DD V MYE L +A +++ D+ Sbjct: 64 HRENGGLGPARNTGIENATGEYVAFLDSDDWVESDMYEKLYLVASKENADIVVSGHCDFA 123 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLRMGLSS-------RRWTHVVWMGVYRRDVIVKN 179 + + P + + + + V M +Y ++++ K Sbjct: 124 NDKAMVKKQHPLSGNVYDTPNSIKNVRKNLFGHSPKDQEVEAFPMSVCMSLYNKNLLDKY 183 Query: 180 NIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHY 239 ++F + +D ++ A + +TE + Y Y NK ++ Sbjct: 184 RLRF-EKILSEDTIFNLNAYDVANKIAFTEYTDYCYRKEEQESITKSLSPNKQTEFKEFL 242 Query: 240 IKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGM 299 I + + K D I Y L V I +++ I + Sbjct: 243 ITLKAIASK-----EDLECQIRAKRMAIDYCRLYVGLVDDANLGIKEKRKYIKKFIEDEG 297 Query: 300 YKRLITNV---------RSVKVGYQALLWSFRLWQWRDKTRSHHRITRSAF 341 + + + ++ L R + + + F Sbjct: 298 LIQCWKGYPTNTLPFQQKFFHKMLECKCYTILLLLSRIRKIIKNMTRKKLF 348 >UniRef50_O86893 Putative glycosyl transferase n=1 Tax=Streptococcus pneumoniae RepID=O86893_STRPN Length = 323 Score = 273 bits (699), Expect = 6e-72, Method: Composition-based stats. Identities = 81/329 (24%), Positives = 148/329 (44%), Gaps = 7/329 (2%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +SVI+P+YN D R ++SL+ QT+ E+I++NDGSTDNS EI Y + Y ++ + H Sbjct: 2 ISVIVPVYNVADYLRFALDSLLEQTYKDFEVILVNDGSTDNSGEICDEYGKLYDNIHVFH 61 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + N G S ARN G+E + G+++ F+D+DD P E L+T+ + D+D+ + Sbjct: 62 KKNGGLSDARNFGLEKSRGEFITFLDSDDYFEPYALELLITIQKKYDVDIVSTKGGITYS 121 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 + + + D L +LT ++L + T W +Y+RD+ F G Sbjct: 122 HDIYSKKLMAEDYLT-VKILTNKEFLAAVYYNDEMTVSAWGKLYKRDLFKTI---FPKGK 177 Query: 188 HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRLLE 247 ++D+ E + N +T+ +Y YY S+ ++ ++ +++ Sbjct: 178 IYEDLYVVAERLLNIKTVAHTDLPIYHYYQRQGSIVNS-TFSDRQYDFFDAIDHNEAIIK 236 Query: 248 KLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLITNV 307 K + + + I L + DI R+I + + + I Sbjct: 237 KFYCGDKELLAALNA-KRVIGSFILSNSAFYNSKNDITKIIRIIKPYYWEVIKNKKIPMK 295 Query: 308 RSVKVGYQALLWSFRLWQWRDKTRSHHRI 336 R V+ L ++ ++ +DK RI Sbjct: 296 RKVQCVLFLLSPNYY-YKIKDKMLQRGRI 323 >UniRef50_C6ID60 Glycosyltransferase n=5 Tax=Bacteroides RepID=C6ID60_9BACE Length = 368 Score = 273 bits (699), Expect = 7e-72, Method: Composition-based stats. Identities = 75/312 (24%), Positives = 130/312 (41%), Gaps = 4/312 (1%) Query: 15 YNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLHQANAGAS 74 YN C+ SL QT T +EII++NDGSTDNS + + YA + Q N G S Sbjct: 53 YNTEPFLAECLHSLEKQTLTDIEIILVNDGSTDNSGRLLREYAGKDARFVYVEQENQGLS 112 Query: 75 VARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFRETG-ETW 133 ARN G+E A+G Y+AF+D+DD + + L +A + D+ N + ++W Sbjct: 113 AARNTGMEHASGHYLAFLDSDDWLAENALQVLCAIAAKTRTDIVSGNTLAVHADGQVQSW 172 Query: 134 QSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGLHHQDIV 193 + + + V++G + + R + +V+ +YRRD I +N +F GL H+D + Sbjct: 173 ERRGRELFATGTVVSGSTYFSRVMDCRCYVPMVYNYLYRRDFIEQNGFRFEPGLVHEDEL 232 Query: 194 WTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRLLEKLNRNY 253 WT + + A + + Y Y S+ G + + Q K+ K Sbjct: 233 WTPQVLTTAQKITVADIDFYYYRQREGSIMTATAAGRRIASIQLIIEKLLEYSRKHL-FE 291 Query: 254 ADKIMIYPEFHQQITYEALRVC--HAVRKEPDILTRQRMIAEIFTSGMYKRLITNVRSVK 311 + ++ C H + R + + + + S + Sbjct: 292 KKYREAKEALYVRLLQIYSTACTLHPDGTYTTLYDRAGEMLRVCEELRRQESLGRWYSEE 351 Query: 312 VGYQALLWSFRL 323 + + L+ RL Sbjct: 352 ILNRMKLYYDRL 363 >UniRef50_B2V0U1 Glycosyl transferase, group 2 family protein n=1 Tax=Clostridium botulinum E3 str. Alaska E43 RepID=B2V0U1_CLOBA Length = 324 Score = 273 bits (699), Expect = 7e-72, Method: Composition-based stats. Identities = 67/331 (20%), Positives = 136/331 (41%), Gaps = 9/331 (2%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 ++ +I+P+Y C++S++ Q+++ LEII++NDGS DN EI + Y N ++ Sbjct: 3 NERIGIIVPIYKVEKYLCRCLDSIVNQSYSNLEIILVNDGSPDNCHEICENYKRNDNRIK 62 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++HQ N G S ARN G++ ++ F+D+DD + M E L + E D+ C Sbjct: 63 VIHQKNMGLSAARNTGLKELQSPFLVFIDSDDYIELDMIEELYKLMKETKSDIVSCGHKQ 122 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 F T + + + L L + V W +Y+R++ +IK+ Sbjct: 123 IFDSGNIEELKGNT----QITIYSKKEALSKFLFTNVIDVVSWNKLYKRELFT--DIKYP 176 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 +G ++D T + + A + YT + LY Y +TS+ K + + + + Sbjct: 177 SGKLYEDHFTTYKIIDRAEKITYTSKVLYNYCKRSTSIGGNEYNP-KTMQLKEALDEECK 235 Query: 245 LLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLI 304 + + D I + + Y + + ++ + R ++ + L Sbjct: 236 FIIGEYPDLKDDINLAYCHWMFVIYNKMLLVDSIDNNFEKFLRNKIQESVSKLFFNNNLS 295 Query: 305 TNVRSVKVGYQALLWSFRLWQWRDKTRSHHR 335 + + + ++ +R Sbjct: 296 KTRKLQYLVFLGNRRIYK--TLYKAFLKKYR 324 >UniRef50_B2UYP7 Minor teichoic acid biosynthesis protein GgaB n=4 Tax=Clostridium RepID=B2UYP7_CLOBA Length = 335 Score = 273 bits (698), Expect = 8e-72, Method: Composition-based stats. Identities = 79/329 (24%), Positives = 147/329 (44%), Gaps = 11/329 (3%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 NS +S+I+P+Y A D + S++ QT E+I+++DGS DNS +I A N Sbjct: 13 KNSEVLISIIMPIYKAEDYLNNSITSILNQTLENFELILVDDGSPDNSGKICDELALNDN 72 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 ++++H+ N GAS ARN G+++A G+++AFVD+DD + P M+ETL +A E + D++QCN Sbjct: 73 RIKVIHKENGGASTARNAGLDIAQGEFIAFVDSDDWIEPNMFETLFNLAKEHNADISQCN 132 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVV-WMGVYRRDVIVKNN 180 E + I D + L + + VV W +Y++ + N Sbjct: 133 YIKVENEDEK----IINDDKEIIISFNNIESLNNLYNEMYVSTVVLWNKIYKKSLF--NK 186 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI 240 I+F +D + +F + + YT + LY Y S+ + K L + Sbjct: 187 IRFPNMRIFEDEAIMYKLLFESKKLVYTNKKLYYYRNTPDSIMNA-KFDKKRLCMLDVFD 245 Query: 241 KITRLLEKLNRN--YADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRM-IAEIFTS 297 + + KLN YA + Y + Y + + +++ + + +++ +++ Sbjct: 246 EKVEFMSKLNTEHLYAKTLKWYMWSIIDLYYGCINNISDDYETINLIKSKAIDVSKKYSN 305 Query: 298 GMYKRLITNVRSVKVGYQALLWSFRLWQW 326 + L+ + Sbjct: 306 SPSHEFKWIILFSLFNKTPKLYKTVMNIL 334 >UniRef50_UPI0001B5362C putative glycosyl transferase n=1 Tax=Streptomyces sp. C RepID=UPI0001B5362C Length = 1187 Score = 272 bits (697), Expect = 1e-71, Method: Composition-based stats. Identities = 69/294 (23%), Positives = 122/294 (41%), Gaps = 12/294 (4%) Query: 6 NKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRL 65 +LS+++P+YN C+ES+ QT+ E++I++DGSTD S IAK +AE R+ Sbjct: 3 PRLSIVVPVYNVELYLDECLESIAAQTFADFEVVIVDDGSTDTSAVIAKAFAERDKRFRV 62 Query: 66 LHQANAGASVARNRGIEVAT--GKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 + Q NAG ARN G AT +Y+AFVD+DD + Y+ ++ E D A N Sbjct: 63 VLQENAGLGAARNVGFRHATPGSEYIAFVDSDDTMPTYAYQRMIDSLDETGSDFAAGNVT 122 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 FR G + T ++ R W VYRR ++ ++ Sbjct: 123 R-FRSVGMQQSWGHRVAFANDRPKTHISKFPALVTDR----TAWNKVYRRSFWEEHGFQY 177 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNT---SVSRLHRQGNKNLNYQRHYI 240 G+ ++D + + A +Y + + T S+++ ++ Sbjct: 178 PEGILYEDAPVSIPAHYFAESVDVISDVVYHWRVRETGERSITQRSTDPVSVIDRVTSVR 237 Query: 241 KITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEI 294 + + L L ++ ++ + E L + + E R + E+ Sbjct: 238 LVRQAL--LAKSGEKYARYLRDYDHNVLSEELPLIYKYVGEGGPDFRAAFVKEV 289 >UniRef50_Q5LBM4 Putative glycosyltransferase protein n=4 Tax=Bacteroides RepID=Q5LBM4_BACFN Length = 342 Score = 272 bits (697), Expect = 1e-71, Method: Composition-based stats. Identities = 81/346 (23%), Positives = 155/346 (44%), Gaps = 17/346 (4%) Query: 6 NKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRL 65 K+SVI+P+YN C+++L+ QT + +EII+I+D S DN +I YA YP++++ Sbjct: 3 PKVSVIVPIYNVEKYLDQCVQALLAQTLSDIEIILIDDESPDNCPKICDDYAAQYPNIKV 62 Query: 66 LHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWC 125 +H+ NAG +A N G++VATG+YVAF D+DD V MY T+ +A + D Sbjct: 63 IHKKNAGLGMACNSGLDVATGEYVAFCDSDDYVDSDMYMTMYNVAQKYTCDAVFTGLKRI 122 Query: 126 FRETGETWQSIPTDRLRSTGVLTGPDWLRMGL--------SSRRWTHVVWMGVYRRDVIV 177 T + L L R + +YRR++I Sbjct: 123 TMAGIPTGTVTHQKEFKLYKNKNEIHTLLKDLIASDPYAREERAIQVSAKVVLYRRNLIE 182 Query: 178 KNNIKFIAGLH--HQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNY 235 K +++F++ +D+++ + + N+ Q+ Y Y + S+S ++ +L + Sbjct: 183 KKHLRFVSERILPSEDLIFNVDVLANSNIVCVLPQTFYNYRTNPISISHTIKKDKFSL-F 241 Query: 236 QRHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIF 295 ++ YI+IT +L ++ I F + Y +C+ + + ++++ + I Sbjct: 242 KQLYIEITDRCHRLGVEDNVQLRIQRMF---LGYTRNYICNILNSSITNIEKKQITSSIC 298 Query: 296 TSGMYKRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITRSAF 341 G++K + ++ + L + R S + Sbjct: 299 KDGIWKPIW---KTYPLSVMPLPHRIFTFAMRHNFYSLLLVLAKIK 341 >UniRef50_A5UNM1 Glycosyltransferase/CDP-glycerol:poly(Glycerophosphate) glycerophosphotransferase, GT2 family n=3 Tax=Methanobrevibacter smithii RepID=A5UNM1_METS3 Length = 1193 Score = 272 bits (696), Expect = 1e-71, Method: Composition-based stats. Identities = 81/317 (25%), Positives = 141/317 (44%), Gaps = 2/317 (0%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 S K+SVI+P++N D T ++S++ Q+ +EII INDGSTD+S++I + YA+ + Sbjct: 2 SKVKVSVIVPVFNVRDYLSTSLDSILNQSLEDIEIICINDGSTDDSLKILENYAKKDNRI 61 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 +++ + N G ARN G++ A G++++FVDADD + M E L ++ +LD+ C Sbjct: 62 KIISKENEGQGTARNVGLDNACGEFISFVDADDFIKKDMLEKLYNKSVNGNLDLVMCKVS 121 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDW--LRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 ET ++ L+ + L + + + +YRR I KNNI Sbjct: 122 SFDNETHIIDDNLWYYSLKCFSGFKKEVFNNLDTKEFTSLISVTPYNKLYRRSFIEKNNI 181 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIK 241 +F +D V+ A R +++LY Y + + Q ++ + Sbjct: 182 RFPEKYIFEDEVFFYNVYLKAKRISLIDENLYYYRTNRKGSTVSKGQDKDYIDVIHIFRL 241 Query: 242 ITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYK 301 I LL + N K +Y F I + + RK L ++ + S Sbjct: 242 IRDLLVETNYINVYKKQVYNRFIHLILWRFSQTAPKYRKNFFNLMKKEFDEILDDSSKDF 301 Query: 302 RLITNVRSVKVGYQALL 318 L + K+ + L Sbjct: 302 SLNVDELDFKIKSRTLK 318 Score = 159 bits (402), Expect = 2e-37, Method: Composition-based stats. Identities = 69/332 (20%), Positives = 144/332 (43%), Gaps = 21/332 (6%) Query: 9 SVIIPLYNAGDDFRTCMESLITQTW---TALEIIIINDGSTDNSVEIAKYYAENYP-HVR 64 SVI+ YN +ESL Q + +E+II++DGS D+S++IAK Y +NYP +++ Sbjct: 338 SVIMASYNNDKYLEDAVESLTVQNFGFEHNVELIIVDDGSNDDSLKIAKKYQKNYPYNIQ 397 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDL-DVAQCNAD 123 ++ + N G + ARN G+E ATG Y+ F+D+DD++ P ++ + +++ DV Sbjct: 398 VISKENGGQASARNLGLEYATGDYINFLDSDDKLSPNTFKAVYDFLVKNPAVDVISIPIT 457 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 + + G + + +L PD+ ++ +R+ I N++F Sbjct: 458 FFDNQKGAHILNYKYKNEKVVNLLDEPDYPQL---------SASSAFIKREAIG--NLRF 506 Query: 184 IAGLHH-QDIVWTTEFMFNALRARYTEQSLYKYYLH-NTSVSRLHRQGNKNLNYQRHYIK 241 L + +D ++ + + + + + + Y Y + S + + Q K R Sbjct: 507 DTSLVNSEDALFVNKILIDNPKLGLVKNANYLYRKRFDESSTIDNSQKKKGFFTDRLKYY 566 Query: 242 ITRLLEKLNRNYADKIMI--YPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGM 299 L++ + Y + Y + + + + +I + +I S + Sbjct: 567 FKELIDYSVKKYDKALEFIQYTLIYDLQWMVKVEEIDNILNKDEINEFWEVFLDIL-SYI 625 Query: 300 YKRLITNVRSVKVGYQALLWSFRLWQWRDKTR 331 + +I N +++ + L + + KT Sbjct: 626 DEDIIKNYKTLDNNVREFLLTIKQSDLTAKTI 657 >UniRef50_C3R5C9 Glycosyl transferase WciV n=3 Tax=Bacteroides RepID=C3R5C9_9BACE Length = 344 Score = 271 bits (695), Expect = 2e-71, Method: Composition-based stats. Identities = 77/337 (22%), Positives = 152/337 (45%), Gaps = 23/337 (6%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+I+P+YN C++SL+ QT+ +EII +NDGSTDNS +I K YAE + ++H Sbjct: 2 ISIIVPVYNTAPYLPQCLDSLVNQTYRDIEIICVNDGSTDNSPDILKAYAERDSRILVIH 61 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 Q N G S ARN+G+E A G++V FVD+DD + +TL++ + DV + F Sbjct: 62 QENLGLSGARNKGLESARGEWVMFVDSDDWIGTDCCKTLLSH-TDKQTDVCLFSYIREFA 120 Query: 128 ETGETWQSIPTDRLRSTG---------VLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVK 178 ++ G ++ D W +Y+ +I + Sbjct: 121 NQSFPQYLFSDKKMVYQGSSIHWLYARLIAPNDQELRLPGKIDSLSTAWGKLYKTSLIKE 180 Query: 179 NNIKFIA--GLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHN-TSVSRLHRQGNKNLNY 235 + ++F+ + +D+++ + +A Y Y Y +N TS+++L++ + Sbjct: 181 HGLRFVPVRQIGTEDLLFNVYYFTWVKKAIYLPDLFYHYRKNNITSLTKLYKP-----HL 235 Query: 236 QRHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIF 295 + + RL+E+ + + +++ + + + + ++ I Sbjct: 236 TEQWNLLFRLIEEWITPLHREDLEKALTYRRALSLIGQGLNITFSPVSFMKQHGALSAIL 295 Query: 296 TSGMYKRLITNVRSVKVGYQALLWSFRLWQWRDKTRS 332 +S Y+ ++R + + Y + W RL+ + RS Sbjct: 296 SSHEYRT---SIRQLPMNYFPIHW--RLFFEMARFRS 327 >UniRef50_A6NTT8 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6NTT8_9BACE Length = 331 Score = 271 bits (695), Expect = 2e-71, Method: Composition-based stats. Identities = 77/333 (23%), Positives = 134/333 (40%), Gaps = 12/333 (3%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 K+SVIIP+Y ++S+++QT LE+I+++DGS D EI YA V Sbjct: 2 NIKISVIIPIYKVEAFLDKSIQSVLSQTHLNLELILVDDGSPDRCGEICDSYARQDKRVT 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPT-MYETLMTMALEDDLDVAQCNAD 123 ++HQ N G S ARN G++ ATG Y+ F+DADD E L+ DV Sbjct: 62 VIHQTNRGLSGARNAGLKAATGDYIVFLDADDYWDDNTALERLVECL---PADVVSFGMK 118 Query: 124 WCFRETGETWQS-IPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIK 182 + E G Q+ T + + +R L + + + +R + + + Sbjct: 119 HFWPENGRVVQNWSDTQNYQDIRGRAKYEQVRYMLENGCYIASACNKLIKRSLFSEQKLT 178 Query: 183 FIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKI 242 F G+ +DI W + A + Y+Y + S+S+ R N + + + Sbjct: 179 FREGVTSEDIDWCARLLIAAGTFSVSPCCFYRYRQRDGSISKSIRIKNIY-DLEENIKSC 237 Query: 243 TRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDI----LTRQRMIAEIFTSG 298 ++L N + + Q L ++E L+ R A + Sbjct: 238 VSYSKELEDNKELLELYWNYVSYQYGTLLLNAALVCKEEQKYAVSVLSNMREYAWLLKYH 297 Query: 299 MYKRLITNVRSVKVGYQALLWS-FRLW-QWRDK 329 M +++ R +V +L R++ R + Sbjct: 298 MNRKVNVLWRIYQVAGYGVLCQACRIYALLRHR 330 >UniRef50_C7MN19 Glycosyl transferase n=1 Tax=Cryptobacterium curtum DSM 15641 RepID=C7MN19_CRYCD Length = 325 Score = 271 bits (694), Expect = 2e-71, Method: Composition-based stats. Identities = 68/328 (20%), Positives = 135/328 (41%), Gaps = 23/328 (7%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 +SVI+P+YN CM+S++ QT+ LEII+++DGS+D S + +A+ + Sbjct: 3 NPLISVIVPVYNVEKYLDRCMQSIVDQTYRNLEIILVDDGSSDGSGALCDTWAQQDNRII 62 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 +LH+ N G S ARN G++ + G ++ FVD+DD + MYE+L ++ + D+A C Sbjct: 63 VLHKKNGGLSDARNAGVQTSHGDWIGFVDSDDYIDKQMYESLYSLIEYTNSDMALCAVAD 122 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 + + +P + VLT + L ++ V +Y ++ + + Sbjct: 123 VYAD----HTDVPAMSAENFTVLTTKEVLSDIFLNKTLMVGVPPRLYPAWLVRE--VPSP 176 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 AG H+D + Y Y + +++ + Sbjct: 177 AGKTHEDAFIVVDLFSRVEHIVIDRVPRYFYCHNEGTITSTPS-----------LRAASD 225 Query: 245 LLEKLNRNYADKIMIYPEFHQQITYEALRVCH-----AVRKEPDILTRQRMIAEIFTSGM 299 +E N +YP+ Q + + + ++ +Q++I + + Sbjct: 226 NIEAWEHNRVLVEKLYPDLMQDVLFRCYWAHFDVLDGMMLSNAEMHEQQKVINYLLSHKA 285 Query: 300 YKRLITNVR-SVKVGYQALLWSFRLWQW 326 +V K+ + L S +L++ Sbjct: 286 DILAHPHVSAKRKLALRLLSISKKLYRL 313 >UniRef50_C9LAR4 Glycosyl transferase, family 2/glycosyl transferase family 8 n=1 Tax=Blautia hansenii DSM 20583 RepID=C9LAR4_RUMHA Length = 329 Score = 271 bits (694), Expect = 2e-71, Method: Composition-based stats. Identities = 74/299 (24%), Positives = 140/299 (46%), Gaps = 7/299 (2%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M + +S+IIP+YN D C++S+ QT+ LEII++NDGSTD S E+A + + P Sbjct: 1 MLVSELISIIIPIYNVEDYLERCIKSIQQQTYKNLEIILVNDGSTDGSTEVALEFCQMDP 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 ++L+++ N G S ARN GI+V GKY+ FVD+DD V E L + ++ +A C+ Sbjct: 61 RIKLVNKKNGGLSDARNVGIDVCKGKYIVFVDSDDYVEARYVELLYKSLVSNNAQMAVCS 120 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRW-THVVWMGVYRRDVIVKNN 180 + + + +++ ++TG + L + + + V W +Y +++ + Sbjct: 121 FNIVNDNGSVIKRKLVSEK--KNEIITGREMLNRVMRADGYKYVVAWNKIYASNLLE--D 176 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI 240 ++F G ++D + E +N + + + LY Y S+ K + + Sbjct: 177 LRFEKGKLYEDEFFNFEVSYNLTKVSFVAEPLYNYVQRAGSIKLSELTNEKIEMQKEMHE 236 Query: 241 KITRLLE--KLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTS 297 K + K Y I +Y + I EA + + + +++ + + T Sbjct: 237 KRISFYKDNKDKELYERAIQMYCNWMVNIVIEARGIINRDLLKSFKRSKKVFLLSMCTR 295 >UniRef50_B0P2L7 Putative uncharacterized protein n=1 Tax=Clostridium sp. SS2/1 RepID=B0P2L7_9CLOT Length = 319 Score = 271 bits (694), Expect = 2e-71, Method: Composition-based stats. Identities = 71/317 (22%), Positives = 136/317 (42%), Gaps = 17/317 (5%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 +S+IIP+Y C+E+++ QT+ LEII+I+D S D EI +A+ ++ Sbjct: 2 MELVSMIIPVYGVEAYLGECLETVLNQTYENLEIILIDDESPDYCPEICDQHAQKDARIK 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++HQ N GA+ ARN G+++ATG+Y+ F+D+DD++ E L+ + DVA C+ Sbjct: 62 VIHQKNGGAANARNHGLDMATGEYICFIDSDDKIENNYVEKLLNAIKKSQSDVAVCSFKQ 121 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 ++ + + D++R LS ++W ++ + + ++F Sbjct: 122 WYKNKTQ------DSIGFKNKEYSSKDYIRKFLSDWTC-GLIWNKMFDKKTLE--GVRFE 172 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 G D +T + + A + ++SLY Y + + V Q + Sbjct: 173 EGHKIDDEFFTYQAILKAEKVAIFDESLYWYRMRKSGVMLSGT--------QYQAQMLKD 224 Query: 245 LLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLI 304 LE + + Y I YP+ + Y + ++ + + R++ Sbjct: 225 RLEYMTKRYEMVIEKYPDLKAEYLYNLTDNLISYTRQAQGIKELEKDVRAIKKQYWNRIM 284 Query: 305 TNVRSVKVGYQALLWSF 321 + VK+ Y F Sbjct: 285 FGLIPVKIKYSFFRSFF 301 >UniRef50_A5ZCV6 Putative uncharacterized protein n=1 Tax=Bacteroides caccae ATCC 43185 RepID=A5ZCV6_9BACE Length = 321 Score = 271 bits (694), Expect = 2e-71, Method: Composition-based stats. Identities = 73/328 (22%), Positives = 143/328 (43%), Gaps = 12/328 (3%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +SVI+P+YN C++S+I Q++ LEII+++DGS D EI Y+ + ++H Sbjct: 2 ISVIVPIYNVERYINKCIDSIIHQSYKNLEIILVDDGSPDKCGEIIDEYSRIDKRIHVIH 61 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 ++N G S ARN G+++A+G ++ FVD+DD + PTMYE +++ ++ LD+ +C + Sbjct: 62 KSNGGLSSARNAGLDIASGDFIGFVDSDDWIEPTMYEEMLSFMEKEQLDLVECGINLVTN 121 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 + P+ L L + VW +++++ N +F G Sbjct: 122 NISNKYIEKPSIVLSGKDALIRHLDSKNRTVQSLPRTAVWSKLFKKEFWKTN--RFPVGE 179 Query: 188 HHQDIVWTTEFMFNALRARYTEQSLYKYYL-HNTSVSRLHRQGNKNLNYQRHYIKITRLL 246 H+D + T + ++ A + L + + S+ ++ +IT L Sbjct: 180 IHEDYLLTCKALYEAKSVGLLRRGLLNHLTDNPNSIVNTKFNSKDLYKGKQLEYRITYLK 239 Query: 247 EKLNRNYADKIMI-YPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLIT 305 EK + MI Y + + + + R ++IA+I + Sbjct: 240 EKKEPTLTNLAMISYYGYIVSAIWRC--------DQNGMPERDKLIAKIKEDKKIIFSLN 291 Query: 306 NVRSVKVGYQALLWSFRLWQWRDKTRSH 333 K+ + + + +L+ K + Sbjct: 292 LPFKRKIDARLICYYPKLYLILRKFFNK 319 >UniRef50_Q815A0 Glycosyltransferase n=6 Tax=Bacillus cereus RepID=Q815A0_BACCR Length = 319 Score = 271 bits (693), Expect = 4e-71, Method: Composition-based stats. Identities = 71/323 (21%), Positives = 132/323 (40%), Gaps = 11/323 (3%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 +S+I+P+YN C++S+I Q + LEI+++NDGSTD S I YAE + Sbjct: 2 LISLIVPIYNVEGFLPRCLDSIIKQEYQNLEIVLVNDGSTDQSGLICDQYAEKDERFIVF 61 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 H+ N G S ARN G+ A G Y+ FVD DD + P M L + ++ D+A C Sbjct: 62 HKENGGVSSARNVGLNKANGNYIGFVDPDDWIEPNMINKLYELIIQHKADIAMCGYVKED 121 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAG 186 I ST +L D L L+ + + ++ D+ K N+ Sbjct: 122 ING----LLISRPVELSTKILNKEDTLNSILNPNDFRGFLCNKLFSVDLFRKKNLYLDED 177 Query: 187 LHH-QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRL 245 +H +D+++ + + + + Y Y Y +HN + S+ N + + + Sbjct: 178 IHFCEDLLFCCQAILESKQIVYDSTPYYHYIIHNNNASKS----NFSFKKLTALNSLDNI 233 Query: 246 LEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLIT 305 + LN ++ Y ++ + L + + ++ + + + L + Sbjct: 234 INLLNDQKGIELEQYKNYYMHLNISLLM--NGIHEDKIRRQDKERLKKNLYKFKINDLTS 291 Query: 306 NVRSVKVGYQALLWSFRLWQWRD 328 L + + W++ Sbjct: 292 KFVKFSCVLARLNLNLYYFIWKN 314 >UniRef50_UPI0001968EF9 hypothetical protein BACCELL_04174 n=1 Tax=Bacteroides cellulosilyticus DSM 14838 RepID=UPI0001968EF9 Length = 328 Score = 270 bits (692), Expect = 4e-71, Method: Composition-based stats. Identities = 70/324 (21%), Positives = 133/324 (41%), Gaps = 7/324 (2%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 +SVI+P+YN C++S++ QT+ A E+++++DGS D+S I YA+ V ++ Sbjct: 2 LVSVIVPVYNVEKFIHQCIDSILCQTYKAFEVVLVDDGSLDSSGIICDEYAKKDCRVSVI 61 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADD-EVYPTMYETLMTMALED-DLDVAQCNADW 124 H+ N G S ARN G+E A G+YV FVD+DD + E L+ E D N + Sbjct: 62 HKENGGLSDARNVGLECAKGEYVIFVDSDDFWIDNCQLELLIAEIEETPSCDFIGFNCCY 121 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 TGE + + + + + + + + + +R ++ NNI+FI Sbjct: 122 YNSSTGE-FSPWFSYSDEIMKMTERDVLITNLVCTGVFPMSACLKIIKRSFLMNNNIRFI 180 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLH-NTSVSRLHRQGNKNLNYQRHYIKIT 243 G+ +DI W E + + + R Q +Y Y S++R + + N + + T Sbjct: 181 KGIVCEDIPWFIELLHYSHKFRVVNQYMYAYRQGVQGSITRNFSEKSFN-DLFSILKEQT 239 Query: 244 RLLE--KLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYK 301 ++ + ++ + + I + + + + K Sbjct: 240 NFIQLSDFSLFAKSALLSFMAYEFCILLGYIGTFDNNFQRQKRQELLSYKWLLKYTENPK 299 Query: 302 RLITNVRSVKVGYQALLWSFRLWQ 325 + +G R + Sbjct: 300 VKKVSFCKSLLGIYVTEILLRNFL 323 >UniRef50_A6LCC5 Glycosyltransferase family 2 n=2 Tax=Bacteroidales RepID=A6LCC5_PARD8 Length = 312 Score = 270 bits (692), Expect = 4e-71, Method: Composition-based stats. Identities = 67/323 (20%), Positives = 130/323 (40%), Gaps = 17/323 (5%) Query: 6 NKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRL 65 +S+I+ +Y A R C++SL+ QT+ EI++++DGS D S EI YA VR+ Sbjct: 3 PVISIIVAVYKAESCLRRCVDSLLAQTFQDYEILLVDDGSPDRSGEICDEYARKDNRVRV 62 Query: 66 LHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWC 125 H+ N G + AR G+ A G+YV D DD V P M E L A E++ D+ C+ Sbjct: 63 FHKENGGVASARQCGMNNARGEYVIHADPDDWVEPNMLEELYGKAKEENADMIICD---V 119 Query: 126 FRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIA 185 +R+T + + + D + + +R K ++ F Sbjct: 120 WRDTSKKSTYVKQRLFILEHCVMLKDVFQQL------HGSCCNKLVKRACYSKYDVNFPE 173 Query: 186 GLHH-QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 GL + +D + + + ++ Y ++ Y Y + S R + + R Sbjct: 174 GLSYGEDTYVSVCLLAHPIKMAYVPKAFYHYVQNVNYSSLTRRPVKILVEQCEKLCDLLR 233 Query: 245 LLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLI 304 + IYP + +A + + + P I+T ++ + + +++ Sbjct: 234 ----CKLSENQFKEIYPALRYR---QACVILTSAGENPYIVTFKKDFEGVRKAILHQNFS 286 Query: 305 TNVRSVKVGYQALLWSFRLWQWR 327 +++ + + + Sbjct: 287 FKRKTLFWIAFYISPYIARFFYT 309 >UniRef50_D0R4M4 Glycosyltransferase n=1 Tax=Lactobacillus johnsonii FI9785 RepID=D0R4M4_LACJF Length = 339 Score = 270 bits (692), Expect = 4e-71, Method: Composition-based stats. Identities = 77/338 (22%), Positives = 136/338 (40%), Gaps = 10/338 (2%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 N T ++S+I+P+YN C+ S+I QT+ E+I++NDGS DNS +I + YA+ Y Sbjct: 4 NNGTPEISIIVPVYNVEKWIENCIRSVIAQTFNNWELILVNDGSLDNSQQICEKYAKKYS 63 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 + L+++ N G S ARN GI+ A GKY+AFVD DD + L T + + +A C Sbjct: 64 KIHLINKENGGLSDARNFGIKKAKGKYLAFVDGDDYIDSNYLSKLHTAIISSNASIAVCG 123 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSR-RWTHVVWMGVYRRDVIVKNN 180 + T+ +++G +LR+ V W +Y++D+ N Sbjct: 124 YKEV-DNKKKILSLKRTNDFFVQKLVSGQTFLRILAEKNCTLCVVAWNKLYKKDLFKNN- 181 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI 240 + G H+D +++ + + LY Y S+ L+ ++ Sbjct: 182 -SYKKGRLHEDEFIIAPLVYDVSKIALVDDELYNYVQRAGSIMSSKMTQKNLLDANDAFV 240 Query: 241 KITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMY 300 + + + ++ Q L V +K R + Sbjct: 241 ERLNFYKS-----NKNMKLFDLTVHQYLLWILSVMKYPKKVLTKKGRMILQNNFKQYSKS 295 Query: 301 KRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITR 338 ++ +N K + + +L W H I R Sbjct: 296 QKNKSNTNIFKFQMKLGERNIKL-AWVIAYIIPHGINR 332 >UniRef50_D1JY96 Putative uncharacterized protein n=1 Tax=Bacteroides sp. 3_1_33FAA RepID=D1JY96_9BACE Length = 346 Score = 270 bits (692), Expect = 4e-71, Method: Composition-based stats. Identities = 79/352 (22%), Positives = 155/352 (44%), Gaps = 27/352 (7%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+I+P+YNA C++SL+ QT+ +EII +NDGSTDNS++I K YAE + ++H Sbjct: 2 ISIIVPVYNAAPYLPQCLDSLVNQTYRDIEIICVNDGSTDNSLDILKAYAERDSRILVIH 61 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 Q N G S ARN+G++ A G++V FVD+DD + + + + DV + + Sbjct: 62 QENQGLSDARNKGLKNARGEWVMFVDSDDWIDISTCQITLETVHAHQADVVMWTYNREYE 121 Query: 128 ETGETWQSIPTDRLRSTGVLT---------GPDWLRMGLSSRRWTHVVWMGVYRRDVIVK 178 I + + ++ + L + W +W +YRR I Sbjct: 122 GHSLPKYYITSLQTWDANNISQLHRRMVGPINEELNSPDTMDAW-GTIWAKLYRRTYIDH 180 Query: 179 N-NIKFIAGLH---HQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLN 234 ++FI + +D+++ +M + RA Y +LY Y + S + H Sbjct: 181 PVPVRFIDTKNIGSAEDVLFNIGYMAHIGRAVYIPNALYHYRKESCSFTSRH-NPRLPQQ 239 Query: 235 YQRHYIKITRLL--EKLNRNYADKIMIYPEFHQQITYEALRV-CHAVRKEPDILTRQRMI 291 + Y +I LL +++ + + FH +I+ + + + + + I Sbjct: 240 WDNLYQEIKNLLATQQVGNEFQE------AFHNRISLGIIGLGLNELFAQSSFQDIFNRI 293 Query: 292 AEIFTSGMYKRLITNVR--SVKVGYQALLWSFRLWQWRDKTRSHHRITRSAF 341 + + K+ I++++ + + ++ + + F W + I + Sbjct: 294 DTLLQRPICKKAISSLKIQYMPLHWK-IFFCFAKWHFTTGVTLLLMIIKKII 344 >UniRef50_Q9F0B5 Beta(1,4)galactosyltransferase EpsM n=1 Tax=Lactobacillus delbrueckii subsp. bulgaricus RepID=Q9F0B5_LACDE Length = 336 Score = 270 bits (691), Expect = 5e-71, Method: Composition-based stats. Identities = 77/329 (23%), Positives = 137/329 (41%), Gaps = 7/329 (2%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 +S+I+P+YN D C+ SL QT+ +EII+++DGS D S +I A ++ Sbjct: 3 DKLISIIVPVYNVEDYLEDCVHSLELQTYKNIEIILVDDGSPDRSGKICDKLALKDARIK 62 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++H+ N G S ARN G+E + G+Y+AFVD+DD V P + L ++ + + VAQ + Sbjct: 63 VIHKKNGGLSDARNEGLEHSNGEYIAFVDSDDIVEPNFIKVLYSLITQGNYQVAQVGTQF 122 Query: 125 CFRETGETWQSIPTD----RLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 Q I D + + + D L GL R+ W +Y+RD N Sbjct: 123 VNESGDPLNQKIVFDGDMGKKLNGNYVLNRDELIRGLLLRKVHCAAWCNLYKRDFF--NE 180 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI 240 I+F G+ ++DI+ + + + + LY Y ++ + Y I Sbjct: 181 IRFTKGILNEDILMWVDGVEKVDSLIISNECLYDYRQRYGGITGGINLKLDSDAYNNSKI 240 Query: 241 KITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQ-RMIAEIFTSGM 299 + R+ EK ++ + Y QI + + ++ + S Sbjct: 241 WLKRIEEKYPKSIEEANYNYFMCLWQIIKSTGDARQLPEEVSAVKNNIGSIVDNKYLSKG 300 Query: 300 YKRLITNVRSVKVGYQALLWSFRLWQWRD 328 +K + +L L + + Sbjct: 301 HKMFFRLISVSPHLGIHILHKIWLKKLKS 329 >UniRef50_B2ILQ1 Ss-1,4-galactosyltransferase n=6 Tax=Streptococcus pneumoniae RepID=B2ILQ1_STRPS Length = 333 Score = 270 bits (691), Expect = 5e-71, Method: Composition-based stats. Identities = 66/308 (21%), Positives = 129/308 (41%), Gaps = 7/308 (2%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 + +S+++P+YN + +ES++ QT+ LE+++++DGSTD+S EI + + +R Sbjct: 17 EDLVSIVVPVYNVEKYLKKSIESILNQTYDNLEVLLVDDGSTDSSGEICDSFIKVDSRIR 76 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 + H+ N G S ARN GIE G+YV+F+D DD + L ++ +V+ C+ Sbjct: 77 VFHKENGGLSDARNFGIEHMKGQYVSFIDGDDYISKDYVWKLYHSLKNNNSEVSICSFSL 136 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRW-THVVWMGVYRRDVIVKNNIKF 183 + + L+G L L++ + V W +YR + K +KF Sbjct: 137 VDETGEKIKDEL---LDSGEVSLSGQQILEKALTADGYRYVVAWNKLYRSTLFEK--LKF 191 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 G+ ++D ++ R E+ LY Y S+ + + K + + Sbjct: 192 KKGMLYEDEFLNYPLFWDCKRVSIVEEPLYLYVQRKGSIIQSNMTLEKIKMKDKMHTSRI 251 Query: 244 RLLEKLNRNYADKIMIYPEFHQQITYEALRVC-HAVRKEPDILTRQRMIAEIFTSGMYKR 302 + ++ + + +T V + + R I + + K Sbjct: 252 EFYAEKKNSFLHQRSCQQYCNWIVTITVSHYNVLNVAFLKYLQHQFRRIVKYTQNDDKKL 311 Query: 303 LITNVRSV 310 +I N+ Sbjct: 312 IIQNILGY 319 >UniRef50_C7D258 TagF domain-containing protein n=4 Tax=Enterococcus RepID=C7D258_ENTFA Length = 332 Score = 270 bits (691), Expect = 6e-71, Method: Composition-based stats. Identities = 75/335 (22%), Positives = 147/335 (43%), Gaps = 9/335 (2%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 +SV+IP+YN C++S+I QT+ LEIII+NDGSTDNS+ + + + ++ Sbjct: 1 MKLVSVVIPVYNVEKYVEKCLDSVINQTYQNLEIIIVNDGSTDNSLSVCQKKKLSDSRIK 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 L+++ N G S ARN GIE A G+++ F+D+DD + E L+ ++D++ Sbjct: 61 LINKENGGLSSARNAGIECAQGEFICFIDSDDWIELDYIEVLLNGMENTNVDISVIQMIK 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 +QS + + +R SS + +YR + I++ Sbjct: 121 VKDFNKIAFQSESQTKW---DIFERETAMRELFSSNLIGYSANNKLYRISLFKS--IRYP 175 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 G+ +D T + ++ + + Y YYL + S+ R KN + + +I Sbjct: 176 EGMLMEDKGTTYRLIDSSTKVAVNGSTKYHYYLRDNSILRT-DFNQKNFDSFIIHEEILN 234 Query: 245 LLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLI 304 +++ + + K+ + + +R+ + T+ I E + S + Sbjct: 235 FMDEHYPSISPKVK--SRYVYEAIRMMMRMIESNYTSKLDYTKCMDIVEKYQSEVLNDNK 292 Query: 305 TNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITRS 339 +V+ +K+ + L+ L K + +I + Sbjct: 293 ISVK-LKLITKLLVKFPSLPSRISKNKLIAKIFKK 326 >UniRef50_B0P2L5 Putative uncharacterized protein n=1 Tax=Clostridium sp. SS2/1 RepID=B0P2L5_9CLOT Length = 323 Score = 270 bits (691), Expect = 6e-71, Method: Composition-based stats. Identities = 66/324 (20%), Positives = 135/324 (41%), Gaps = 18/324 (5%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 +SVI+P+Y D C++SL+ QT+ LEII+++DGS D+ I + + ++ Sbjct: 2 EELISVIVPIYKVEDYLDRCIQSLVNQTYKNLEIILVDDGSPDDCPRICDEWGKKDQRIK 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++H+ N G S ARN G++V TG YV+++D+DD + MY+ +M + D+ +C Sbjct: 62 VIHKENGGLSDARNAGMKVMTGAYVSYIDSDDWIEENMYQMMMNAIEVNKADICECAF-- 119 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 +T + + ++ L + + VVW +Y+R+V+ ++ F Sbjct: 120 --EKTSGSVKKEDQKNDDKVSIMDKRSALMAVVEEK-IQPVVWNKLYKREVVK--DLWFE 174 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 G +++D +T + + A + Y Y+ S+ + + Y ++ Sbjct: 175 VGKYNEDEFFTYKAIERADKIIQISDIGYYYFFREDSIINETYTIRRLDGLEARYERM-- 232 Query: 245 LLEKLNRNYADKIMIYPEFHQQITYEALRVCHA---VRKEPDILTRQRMIAEIFTSGMYK 301 + IY +Q+ + + ++ ++ + EI+ Sbjct: 233 ------KYLGKYPEIYRVAKRQLIFMCMYHYQKGMRYLSGKELEDLKKRVKEIYRDIPID 286 Query: 302 RLITNVRSVKVGYQALLWSFRLWQ 325 + S K + L Sbjct: 287 KTDIRKYSSKEKIWYYISKISLEF 310 >UniRef50_A1A380 Putative uncharacterized protein n=1 Tax=Bifidobacterium adolescentis ATCC 15703 RepID=A1A380_BIFAA Length = 323 Score = 270 bits (691), Expect = 6e-71, Method: Composition-based stats. Identities = 62/283 (21%), Positives = 112/283 (39%), Gaps = 13/283 (4%) Query: 2 MN--STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAEN 59 MN + +S+I+P+YN C+ SLI QT +EI++I+DGS DN+ I + Sbjct: 1 MNAVTEPLVSIIVPIYNVEKYLDQCVASLIQQTHHNIEILLIDDGSPDNAPAICDAWQHK 60 Query: 60 YPHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQ 119 V +LH+ N+G S +RN G+ A G+YV FVD DD + P E ++ + + + D+ Sbjct: 61 DERVHVLHKTNSGVSASRNAGLHAAKGEYVCFVDGDDWIEPDFVEYMLELQAKRNADIVA 120 Query: 120 CNADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKN 179 + + + + + + W +YRR + N Sbjct: 121 SLCCFTSTD-------FHQRTQDEVSTINPQEAMELFFYP-TIELGAWNKLYRRSFLTAN 172 Query: 180 NIKFIAG-LHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRH 238 N++FI + + + T A ++ +Y Y + + + + Sbjct: 173 NLEFITDFCAGEGLQFITLAASKANCVTVGQKKVYHYRT--DNATSATTKPDVERQGIGA 230 Query: 239 YIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKE 281 + + E L+ + Y LR A +E Sbjct: 231 ITTMQYICEHLDLSNPKVRKAYNWRLWSTYGYCLRQIIATHQE 273 >UniRef50_A6NTT3 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6NTT3_9BACE Length = 331 Score = 270 bits (690), Expect = 7e-71, Method: Composition-based stats. Identities = 84/331 (25%), Positives = 140/331 (42%), Gaps = 30/331 (9%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 S K+SVI+P YN C++SL+ Q++ LEII+ +DGSTD + + YA + + Sbjct: 2 SMKKISVIVPAYNIEPYLGRCLDSLLAQSYPGLEIIVTDDGSTDGTAALLDRYAAEHSQI 61 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 +HQ N G + AR ++ ATG Y+ FVD DD M++TL+ A D+A C Sbjct: 62 VAIHQPNGGVTAARLAALKQATGDYIGFVDGDDLAEHEMFQTLIDDAEHYGADIAHCGYQ 121 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 F + G + + +W +YRR++I F Sbjct: 122 MVFPSGRVDRYYGTGRVVEQ----DGESGVLDLVRGDFVEPGLWNKLYRRELIE----HF 173 Query: 184 I-AG------LHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQ 236 + G ++D++ A + + ++ LY Y L S S Q K L+ Q Sbjct: 174 LAEGRMDWTVRINEDLLMNFLLFSLAKKCVFRDKCLYHYMLRKGSASSAKVQAYKLLDPQ 233 Query: 237 RHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKE---PDILTRQRM-IA 292 R +I Y + + + HQ +RV + P++ + R+ + Sbjct: 234 RVMQRI----------YEETTGLV-QLHQAAYVRYVRVLLGNTTQSTCPEVSQQARIELK 282 Query: 293 EIFTSGMYKRLITNVRSVKVGYQALLWSFRL 323 ++ YK+L R + +G LL +RL Sbjct: 283 QLICEKEYKQLSWKERYMVLGASKLLPLYRL 313 >UniRef50_C7H2S4 Glycosyl transferase, group 2 family n=3 Tax=Clostridiales RepID=C7H2S4_9FIRM Length = 572 Score = 269 bits (689), Expect = 9e-71, Method: Composition-based stats. Identities = 60/262 (22%), Positives = 111/262 (42%), Gaps = 5/262 (1%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 K+S+I+P+YNA C++S++ Q + E+I+++DGS D S EI YA V Sbjct: 76 EPKVSIIVPVYNAEKSLARCVDSILNQEFRDFELILMDDGSKDRSGEICDGYARADARVV 135 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++H+ N G S RN+ I A G ++ FVD+DD + + ++ A E D+ + Sbjct: 136 VVHKENTGVSDTRNQAIARARGTFLQFVDSDDWLTADATKLMVRAAEETGCDMVIADFYR 195 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 E I D++ G M + + V+W +YRR ++ + I+ Sbjct: 196 VVGEMVSQKGDIDADQVIGREAFVG---FMMENPADYYYGVLWNKLYRRSLVEAHGIRMD 252 Query: 185 AGLHH-QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 A L +D ++ E++ A +Y Y S+ + + + + Sbjct: 253 AKLSWCEDFLFNLEYVRYATTFYALRTPVYYYVKTKGSLVNQKISFARTVEMKLAMFECY 312 Query: 244 R-LLEKLNRNYADKIMIYPEFH 264 + + A + +H Sbjct: 313 NDFFKHVLDEDAYERKRLQVYH 334 >UniRef50_A6BH37 Putative uncharacterized protein n=1 Tax=Dorea longicatena DSM 13814 RepID=A6BH37_9FIRM Length = 323 Score = 269 bits (689), Expect = 1e-70, Method: Composition-based stats. Identities = 66/301 (21%), Positives = 138/301 (45%), Gaps = 13/301 (4%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 MN+ +S+IIP+Y C++S+++QT++ LEII+++DGS D S I +A Sbjct: 1 MNTM--ISIIIPVYKVEKYLDKCIKSIVSQTYSNLEIILVDDGSPDKSGLICDKWATLDS 58 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 +R++HQ NAGA ARN +++A G++++FVD+DD + M++ L++ E D+D+ +C Sbjct: 59 RIRVVHQKNAGAGAARNVALKLAQGEFISFVDSDDYLSVNMFKNLLSYFEE-DIDIVECE 117 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 + ++ V + + +R ++ + ++W +YRR I N+ Sbjct: 118 FISVEEDDV----LFIDEKSVQARVFSSEEAMREHIADHYFRQLIWNKLYRRSCIK--NV 171 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIK 241 F G D W + + A + + + LY Y + SV + + N+ + Sbjct: 172 YFPEGKKIDDEFWMYKVIGRAKQLVHCNEKLYAYRQQDNSVMHMLKPENR----IQAIEA 227 Query: 242 ITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYK 301 + + + ++ + ++ + + D+ ++ +K Sbjct: 228 KGYRHQYIKEKFPRLTLMSYDCLVMSCIYQIQRVYRTGTKDDVQKVLEYSKKVLRENEFK 287 Query: 302 R 302 + Sbjct: 288 K 288 >UniRef50_A7VPB2 Putative uncharacterized protein n=1 Tax=Clostridium leptum DSM 753 RepID=A7VPB2_9CLOT Length = 332 Score = 269 bits (689), Expect = 1e-70, Method: Composition-based stats. Identities = 70/303 (23%), Positives = 131/303 (43%), Gaps = 9/303 (2%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 +S++IP+YN R ++S+ QT+ E II++DGSTD EI Y Y + Sbjct: 13 EKPIISIVIPVYNVDMFLRKALQSVADQTFQNYEAIIVDDGSTDRCPEIIAKYVRKYSNF 72 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 + + Q NAG S ARN G+ +A G+++AF+D+DD PT E + + +D D++ C+ Sbjct: 73 KSVRQENAGLSAARNTGLRLAQGEFIAFMDSDDYYQPTFLEDMYNACVANDCDISYCSFY 132 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 + F G IP T V L + + + H W +Y+R + ++N++ + Sbjct: 133 FYFPYNG---LRIPLPFTCRTAVYESRQALNLLIRDKSMHHFAWNKLYKRSLFLENDVTY 189 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKN-----LNYQRH 238 ++ +D+ + + ++A R T + LY Y S+ + N L Y R+ Sbjct: 190 -TQMYFEDVATSPKLYYHAKRVAVTSKPLYNYTRRRGSILKSMNAKKINDFIRTLGYTRN 248 Query: 239 YIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSG 298 +++ +++ + + + + F S Sbjct: 249 FLEEADDFRAFRKSFNKFARRMKMVNWYSVLRLHVITLNFKGMWKNIKNSNRSIRYFMSE 308 Query: 299 MYK 301 Y+ Sbjct: 309 DYR 311 >UniRef50_C6I619 Glycosyl transferase n=4 Tax=Bacteroides RepID=C6I619_9BACE Length = 329 Score = 268 bits (687), Expect = 1e-70, Method: Composition-based stats. Identities = 69/334 (20%), Positives = 137/334 (41%), Gaps = 11/334 (3%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M N+ +S+I+P+Y ++SL+ QT+ LEII+++DGS DN I YA Sbjct: 1 MNNTNPLVSIIVPVYKVERYIHRSIDSLLRQTYNNLEIILVDDGSPDNCPFICDKYAMQD 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 +R++H++N G S ARN + V TG+YV FVD+DD + M E + ++ Sbjct: 61 NRIRVIHKSNGGLSDARNAALNVMTGQYVTFVDSDDYIADNMIEKFIETVKLYQCNMVVA 120 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 + + Q R ++ + TG D R L + V ++ + + Sbjct: 121 GMNIIDKNN----QIYDYRRTLTSTLSTGIDITRKLLKDVFPFNFVCAKIFESSLFEE-- 174 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKY-YLHNTSVSRLHRQGNKNLNYQRHY 239 I+F G H++D T + R LY Y +++ +Y Sbjct: 175 IRFPVGRHYEDTATTYKLTHKCERIYCIADCLYFYELEREGNITSELNTSKAIKSYIDGC 234 Query: 240 IKITR--LLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTS 297 + + +++++D + + + Q A++ ++ + + ++ I + Sbjct: 235 LNSREQIFFCQKHKDFSDLLPVLTKRLQLWALLAIQSAISLGYKEYMFYYHQIRIYIRSL 294 Query: 298 GMYKRL--ITNVRSVKVGYQALLWSFRLWQWRDK 329 +R+ + + + Y +L + K Sbjct: 295 SYTQRMTHLQIALTFPIIYYSLYPLLSRLKKMYK 328 >UniRef50_B0ZTM8 Putative uncharacterized protein n=1 Tax=Campylobacter jejuni RepID=B0ZTM8_CAMJE Length = 351 Score = 268 bits (687), Expect = 2e-70, Method: Composition-based stats. Identities = 75/358 (20%), Positives = 138/358 (38%), Gaps = 24/358 (6%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M+ +SVIIP+YN R C++S++ QT +EII+INDGSTDN +I YA Sbjct: 1 MVMQQPLVSVIIPIYNVEKYLRECLDSVVNQTLREIEIILINDGSTDNCGKICDEYATKD 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 + ++H+ NAG A N G+++A G+Y+ FV++DD + MYE L A +D D+ +C Sbjct: 61 KRIIVIHKENAGLGAAYNSGLDIAKGEYIGFVESDDWIEFNMYEELYFKAKNNDSDLVKC 120 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVL-----TGPDWLRMGLSSRRWTHVVWMGVYRRDV 175 + + T E + + + + + +W +Y + + Sbjct: 121 -MFYDYNSTREPKDKLYMQKHEDFELFNCAPDNKIFKIEDYPKLLIYHSSIWASLYHKTL 179 Query: 176 IVKNNIKFIA--GLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNL 233 I N ++F G +QD + E + A + ++ Y Y + S + R + Sbjct: 180 I--NRLRFQESKGATYQDFPFAIEALVLANKITTLRKAFYHYRQEPLNASSVKRTDQNLI 237 Query: 234 NYQRHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAE 293 I L L K Y + ++ + L ++++ Sbjct: 238 KMIDQCIYARTRLISLGVFEKFKEEFY----AHAISPLYGFYYQLQDDLKPLFFEKLVKF 293 Query: 294 I---------FTSGMYKRLITNVRSVKVGYQALLWSFRL-WQWRDKTRSHHRITRSAF 341 + + +K + L + ++ K + R Sbjct: 294 FNNSLSENLSYKYFNQREKDFLFCILKKDFSKTLLCNKQSLLYKFKKILKFLLGRRYI 351 >UniRef50_A0Q780 Glycosyl transferase, family 2 n=16 Tax=Francisella RepID=A0Q780_FRATN Length = 319 Score = 268 bits (687), Expect = 2e-70, Method: Composition-based stats. Identities = 62/327 (18%), Positives = 134/327 (40%), Gaps = 8/327 (2%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M ++++IP+YN + C++S+I QT+ LEII++NDGSTDNS+EI + YA+ Sbjct: 1 MKYCELITLVIPIYNIENYLGRCLDSVINQTYKDLEIILVNDGSTDNSLEICESYAKKDS 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 ++++++ N G S ARN G++ G YV F+D+DD V E L ++++ D++ N Sbjct: 61 RIKIINKNNGGLSSARNVGLDACKGDYVTFIDSDDWVSLDYIEILYKNIIDNNADISIIN 120 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 A + + +L + +Y+ + N+ Sbjct: 121 AIKVKSQNND------FTLKEQKNLLHTYFSSIEFALDNTLPVMACAKLYKTKLFE--NL 172 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIK 241 +F + +D +++A + T+ Y Y+ TS++ ++ + I Sbjct: 173 RFTNSIIFEDEDIMYRLLYHANKIVCTDYIGYFYFQRPTSITSSKKKRQNIIKSSDSLIF 232 Query: 242 ITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYK 301 + E+ + + + + +++ +++ I + Sbjct: 233 VLTKKEQFFKGKTTIPYKFYIDSAGLLSRYYAKSFLYPFDKEMIKQRKKIKLFINKLLQN 292 Query: 302 RLITNVRSVKVGYQALLWSFRLWQWRD 328 K+ + + + ++D Sbjct: 293 TKSIETFRYKIKRNVIKYFVFFFLFKD 319 >UniRef50_C3X3W6 Putative uncharacterized protein n=1 Tax=Oxalobacter formigenes HOxBLS RepID=C3X3W6_OXAFO Length = 337 Score = 268 bits (686), Expect = 2e-70, Method: Composition-based stats. Identities = 71/341 (20%), Positives = 134/341 (39%), Gaps = 20/341 (5%) Query: 6 NKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRL 65 +SVI+P+Y A C++S++ QT+ LEI++++DGS D I YA +R+ Sbjct: 4 PLVSVIVPVYRAQAYLARCLDSIVGQTYPNLEILLVDDGSPDACGAICDDYAARDARIRV 63 Query: 66 LHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWC 125 H N G + ARN ++ TG+YV FVDADD + P + E + A E D+ + Sbjct: 64 FHLENGGVARARNVALDAMTGEYVIFVDADDYIAPELVEKALQTARERSADIVVFDFWNI 123 Query: 126 FRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIA 185 ++ + G + +R + +W +R + I+ + Sbjct: 124 EKDGLQLMTQCYHPGDV------GIEKMRELILKDELLSYMWNKFFRASLFE--GIRIAS 175 Query: 186 GLHHQDIVWTTEFMFNALRARYTEQSLYKYY-LHNTSVSRLHRQGNK-NLNYQRHYIKIT 243 D++ A A Y Y Y L++ S++ + NK N + + Sbjct: 176 VRTFSDLLVMPRIFMQAKSAVYVRTPYYYYNLLNDGSLTSIKSVSNKQNAKNRYGMFRAW 235 Query: 244 RLLEKLNRNYADKIMIYPE---FHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSG-- 298 E++ R + ++ Y E + I+ + + E + R + + + Sbjct: 236 EERERVARECCESVVEYSERRAVNSAISTRVIDLYRPALSEEETERLGRFLEDKKRNRVS 295 Query: 299 ----MYKRLITNVRSVKVGYQAL-LWSFRLWQWRDKTRSHH 334 Y+ L + + S +L+QW+ + Sbjct: 296 RIDAKYRTLWWLSDHCPLLCRLYGYLSVKLFQWKLALKERK 336 >UniRef50_C3ADH9 Putative uncharacterized protein n=1 Tax=Bacillus mycoides DSM 2048 RepID=C3ADH9_BACMY Length = 338 Score = 268 bits (685), Expect = 3e-70, Method: Composition-based stats. Identities = 75/341 (21%), Positives = 142/341 (41%), Gaps = 15/341 (4%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M +SVI+P+YN C+ES++ Q + +E++++NDGSTD EI + Y E Sbjct: 1 MKGLHELISVIVPIYNVEKWLARCIESVLKQPFVNIELVLVNDGSTDKCGEICEEYLEKD 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 ++++++ N G S ARN GIE ATGKY+ F+D DDEV + L +A + D C Sbjct: 61 NRIKVIYKENGGVSSARNAGIEAATGKYIVFIDPDDEVSENYFTRLYGIAEDKKCDAVIC 120 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMG--LSSRRWTHVVWMGVYRRDVIVK 178 + V+ G D++ + S+ W +Y +I + Sbjct: 121 GYKTVPNNNSIIP------NFKLDTVMNGRDFVLSSPNVHSKNDLCFAWRYIYNLSIIKE 174 Query: 179 NNIKFIAGL-HHQDIVWTTEFMFNALRARYTEQSLYKYYLHN-TSVSRLHRQGNKNLNYQ 236 NI+F + +D+++ EF+ ++ R + LY Y ++N S+ R+ + N + Sbjct: 175 ENIRFNEQVFIGEDVIFNLEFLLHSQRVCAVSEVLYFYTINNPNSLMRVPYKPNLENSLV 234 Query: 237 RHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFT 296 Y +L ++ + + Y + + + Sbjct: 235 LQYQMRRQLSQEFGLLHDKQFK-----KDMANYYINHIYRLLINNLKSAPNVDVNNAFRR 289 Query: 297 SGMYKRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRIT 337 Y+ +V+ + Y+ + ++ K + + I Sbjct: 290 IVNYEMFSDSVKGIGFSYRCNNFKEYIYYLAIKFKVYPLIV 330 >UniRef50_D0T9V5 Glycosyltransferase family 2 n=1 Tax=Bacteroides sp. 2_1_33B RepID=D0T9V5_9BACE Length = 318 Score = 268 bits (685), Expect = 3e-70, Method: Composition-based stats. Identities = 79/333 (23%), Positives = 140/333 (42%), Gaps = 29/333 (8%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M+ +SVI+P+Y A + C++SL+ QT+ EI++I+DGS D+S +I YA Sbjct: 1 MDKQPAVSVIVPVYKAENYLCHCVDSLLAQTFPDFEILLIDDGSPDHSGDICDEYARKDS 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 VR+ H+ N G S R GI+ A G+Y D DD V P M E L A E+D DV C+ Sbjct: 61 RVRVFHKENEGVSSTRQCGIDNAKGEYTIHADPDDWVEPDMLEELYKKAKEEDADVVICD 120 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 + + + P+D L LR + +R ++ Sbjct: 121 FYEDIGDKRKYIRQQPSD-------LDHKTVLRELFQQ--LHGSCCNKLVKRACY-NGDV 170 Query: 182 KFIAGLHH-QDIVWTTEFMFNALRARYTEQSLYKY--YLHNTSVSRLHRQGNKNLNYQRH 238 +F+ L +D+ + + ++ Y ++ Y Y ++ S++R Sbjct: 171 RFVPDLTFCEDLCFNALILRKEIKIGYLPRAFYHYEQNVNAGSLTRNSSP--------EQ 222 Query: 239 YIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAV---RKEPDILTRQRMIAEIF 295 Y+ + +LEK + AD MI P Y+A + + + I Sbjct: 223 YVVLMNILEK-GLDTADFQMIRPILIYSWAYKAFLMRLYTDTQFRSLFWKYQLDFFRSIL 281 Query: 296 TSGM-YKRLITNVRSVKVGYQALLWSFRLWQWR 327 S + +K++I + W++++++ Sbjct: 282 LSPLCFKKVI---LLFISCWSTQKWAYKIYELY 311 >UniRef50_C0X9Z9 Glycosyltransferase n=1 Tax=Lactobacillus gasseri JV-V03 RepID=C0X9Z9_9LACO Length = 294 Score = 267 bits (684), Expect = 3e-70, Method: Composition-based stats. Identities = 77/300 (25%), Positives = 144/300 (48%), Gaps = 11/300 (3%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M++ K+SVIIP+YN + C+ES++TQT+T LEII+++DGSTD++ EI + Y E Y Sbjct: 1 MDTNEKVSVIIPVYNDEKYLKQCVESVLTQTYTNLEIILVDDGSTDHTPEICEEYREKYN 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 +R+LH+ N G +RN G+ +ATG+YV FVD DD + E L +A+++ D+A N Sbjct: 61 QIRVLHKKNGGVGSSRNAGLALATGEYVLFVDHDDWLDQHHIEDLYNLAVKNKADIAAGN 120 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHV-----VWMGVYRRDVI 176 + + + + D + +W + + + W +Y+R + Sbjct: 121 FNIFYDDKSTFAYWLNKDSYFEKD-YSIKEWFSLQYRTDYYNMSLVFTVPWDKLYKRSLF 179 Query: 177 VKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQ 236 NI + + +D + T + A + Y +++Y + + +SVS + + Sbjct: 180 K--NIVYPENVPVEDDLTTWKIYLLANKIAYENKAIYTHRILKSSVSASVSETDVFP--L 235 Query: 237 RHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFT 296 R + +L + + +++ Y ++ AL K D L + ++ + Sbjct: 236 RSIEERITILRLIGFDTSNEESAYRYRLEKWKNHALDK-GEYLKYRDTLQKLAILEKWKN 294 >UniRef50_B2RH99 Probable glycosyltransferase n=2 Tax=Porphyromonas gingivalis RepID=B2RH99_PORG3 Length = 328 Score = 267 bits (684), Expect = 4e-70, Method: Composition-based stats. Identities = 79/329 (24%), Positives = 134/329 (40%), Gaps = 19/329 (5%) Query: 6 NKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRL 65 ++SVIIP+YNA + C+++L+ QT+ E+I+I+DGS D S I YA V + Sbjct: 3 PRISVIIPVYNAEKFLKHCLDTLLAQTFKDFEVILIDDGSRDRSSNICDAYAYKDTRVHV 62 Query: 66 LHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWC 125 H+ N G S AR +G+E+A+G Y+ VDADD V P+ E L A+ + D+ C+ Sbjct: 63 YHKENGGVSAARQKGLELASGAYIIHVDADDWVAPSYLEVLYEEAVRTNADITICDYIEV 122 Query: 126 FRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIA 185 + + PT VL + W + + VI +N I F Sbjct: 123 YPDKKVYKVQKPTSTSADDFVLD---------LCTKLHGSCWNKLIKASVIRENKIDFPD 173 Query: 186 GL-HHQDIVWTTEFMFNALRARYTEQSLYKYY--LHNTSVSRLHRQGNKNLNYQRHYIKI 242 G+ +D ++ + Y ++LY Y+ S S +Y Sbjct: 174 GINLREDKLFNLHVALCSSTISYAPKALYYYFCSHLGGSASSQGSLETSMKSYIYGNNLF 233 Query: 243 TRLLEKLNRN---YADKIMIYPEFHQQITYEALRVCHAVR--KEPDILTRQRMIAEIFTS 297 L + + A + YP + YE V H + + A S Sbjct: 234 AHFLSNKSNSALLTAQRRFKYPTIAYLLIYEQDYVSHVSQEISSLSFTYIPDIWATSQIS 293 Query: 298 GMYKRLITNVRSVKVGYQALLWSFRLWQW 326 +Y+ + + + + L ++++W Sbjct: 294 ALYRLTLILSTFLPLKF--LYSIIKIYKW 320 >UniRef50_B9AFV3 Putative uncharacterized protein n=1 Tax=Methanobrevibacter smithii DSM 2375 RepID=B9AFV3_METSM Length = 379 Score = 267 bits (684), Expect = 4e-70, Method: Composition-based stats. Identities = 72/309 (23%), Positives = 137/309 (44%), Gaps = 7/309 (2%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 ++SV++P+YN + R C++SL QT+ E+I +NDGS D+S+ I + YA + Sbjct: 1 MPQISVVLPVYNVEEYLRQCLDSLANQTFEDFEVICVNDGSGDSSLSILEEYASEDERFK 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++ Q N G S ARN G++ GKY FVD+DD + E L D D+ + Sbjct: 61 IISQENKGLSGARNTGMDHIKGKYTIFVDSDDWLELNALEKLYNKITALDSDILMYKFRF 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSS--RRWTHVVWMGVYRRDVIVKNNIK 182 +++ + +S+ T+ L ++ +S + +H + +Y+ + + IK Sbjct: 121 FNQDSEQYSESVFTNLEVIDASLENKNFNYRDVSDILFKISHAPFNKLYKTSFLREAGIK 180 Query: 183 FIAGLHHQDIVWTTEFMFNALRA-RYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIK 241 F L+++D+ + F + + ++SLY Y + N S+S ++ + Sbjct: 181 FPENLNYEDLYFFYMVFFKTEKVSVFRDESLYNYRIRNNSISTTG--NERSFDIFEILDL 238 Query: 242 ITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYK 301 L+ +N K + Y LR+ +R + L + + E ++ K Sbjct: 239 TQDNLKNVNIYSKVKDDFLMFVIVNLKYVYLRLNERLRNQ--YLQKMKEKYEFYSLNQVK 296 Query: 302 RLITNVRSV 310 + N Sbjct: 297 KDHLNTWHF 305 >UniRef50_D2RIJ6 Glycosyl transferase family 2 n=1 Tax=Acidaminococcus fermentans DSM 20731 RepID=D2RIJ6_ACIFE Length = 310 Score = 267 bits (683), Expect = 5e-70, Method: Composition-based stats. Identities = 78/318 (24%), Positives = 148/318 (46%), Gaps = 13/318 (4%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 LSVI+P+Y C++SL+ QT+ L+II+++DGS DNS I YYA+ + Sbjct: 3 QKLLSVIVPVYKVEKFLDRCIQSLLKQTYRNLQIILVDDGSPDNSGIICDYYAKKDFRIE 62 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++HQ+N G S ARNRG+ A G+Y+AF+D+DD + P MYETL+ + +++LDVA+C+ Sbjct: 63 VIHQSNKGLSGARNRGLIEARGEYIAFLDSDDWIEPDMYETLIGLMEKNNLDVARCS--- 119 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 G + + + V L VVW VY+RD++ I Sbjct: 120 IDETDGSFHRLKTPSKQYANRVYENNTCLD-LYFHEFLCKVVWNAVYKRDIVD--GILSP 176 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 G +D + ++F R T + L+ Y+++ ++R H+ I + Sbjct: 177 EGHQSEDNYVSGRYLFRCKRIWITSKVLHHYWMNPDGITRAHKIKPW-----DICICTSL 231 Query: 245 LLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLI 304 L++ L ++ + +++ E + ++ ++ + + + Sbjct: 232 LIKDLPKDGIVTEKYKVQLKRKLARELFHYIRSNDSRCQVIAIKK--KQKYDILKWLDFS 289 Query: 305 TNVRSVKVGYQALLWSFR 322 +R + + + + ++ Sbjct: 290 RKIRFLVLLCKKNIKIYK 307 >UniRef50_Q5LZN9 Exopolysaccharide biosynthesis protein, sugar transferase n=2 Tax=Streptococcus thermophilus RepID=Q5LZN9_STRT1 Length = 324 Score = 266 bits (682), Expect = 5e-70, Method: Composition-based stats. Identities = 62/311 (19%), Positives = 117/311 (37%), Gaps = 23/311 (7%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M + +S++IP+YN C++S+ QT+ E+I++NDGSTD+S+ I + + Sbjct: 1 MYLKSLISIVIPVYNVEKYLEKCLQSVQNQTYNNFEVILVNDGSTDSSLSICEKFVNQDK 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 + + N G S ARN GI+ A G ++ FVD+DD + L+ + C+ Sbjct: 61 RFSVFSKENGGMSSARNFGIKKAKGSFITFVDSDDYIVKDYLSHLVAGIKSETS--IVCS 118 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 + E G + +S V + L + VW +Y I Sbjct: 119 KFFLVDEKGSLLTKKEAPKKKSEVVSIEESIKILLLQQNGYDLAVWGKLYPVSFFET--I 176 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIK 241 F G ++D+ T + + A + + Y Y S+ K Sbjct: 177 SFPEGKLYEDMGTTYKLLKLASEVVFLDAYDYAYVQRPNSIMNSSFNLKK---------- 226 Query: 242 ITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYK 301 ++E ++ D + +P + A ++ + E K Sbjct: 227 -LDIIEMVHEMENDILAQFPNLALYVKNRAFAA--------EVKIFLEIPKEKEFEQAQK 277 Query: 302 RLITNVRSVKV 312 +L +++ + Sbjct: 278 QLWHDIKKNRK 288 >UniRef50_Q8A2D4 Glycoside transferase family 2 n=2 Tax=Bacteroides RepID=Q8A2D4_BACTN Length = 329 Score = 266 bits (682), Expect = 6e-70, Method: Composition-based stats. Identities = 72/343 (20%), Positives = 148/343 (43%), Gaps = 20/343 (5%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M + +S+I+P++N D C+ESL +QT +EII++++ S D S EI YA+ Sbjct: 1 MENKYLVSIIVPVHNTADYLHKCIESLRSQTLQQIEIILVDNLSVDGSSEICDEYAKIDK 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 +++LH + AG S+ARN G+ +A+ Y+ FVD+DD V P M+E ++ ++ D ++A CN Sbjct: 61 RIKVLHLSIAGLSIARNAGMRIASAPYIGFVDSDDYVEPAMFEKMLDAMVQSDAEIAYCN 120 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 + + + V+ P + + + + +Y+ + + Sbjct: 121 FLLEYEFKPNESPYRNSGDI----VIRDPKNVLQDMMMEKVSCSACTKLYKSGFLTS--L 174 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIK 241 +F G +++D + E++ + + + Y Y TS+ N + Y + Sbjct: 175 QFPEGKNYEDRLVMYEWVALCKKVVWVDSPFYHYVERQTSICHTMSPMNLYHYFLAEYAR 234 Query: 242 ITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYK 301 + R +EK + + + +I + C + KE + R E + Sbjct: 235 M-RFMEKYSLFEEEDL-------FKIRTRLIGTCLTIFKEILLKVNLREFKE--PVKDMR 284 Query: 302 RLITNVRSVKVGYQALLWSFRL----WQWRDKTRSHHRITRSA 340 + + + ++ L + R+ + W +H + Sbjct: 285 QKMKEIAALPEDTYELKYRRRVRKIAYHWHLYYLTHFLFKKGG 327 >UniRef50_Q9AH91 Putative glycosyl transferase n=3 Tax=Streptococcus pneumoniae RepID=Q9AH91_STRPN Length = 354 Score = 266 bits (682), Expect = 6e-70, Method: Composition-based stats. Identities = 79/336 (23%), Positives = 143/336 (42%), Gaps = 20/336 (5%) Query: 1 MMNST--NKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAE 58 M+ ++SVI+P+YN R CM+SLI QT+ EII++NDGSTD+S + + +A+ Sbjct: 1 MIKDMSCPEISVIVPVYNVERYLRQCMDSLINQTYRDFEIILVNDGSTDSSGVLCEDWAK 60 Query: 59 NYPHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVA 118 + ++H+ N G ARN G+E A GKY+ FVD+DD V M +TL E D++V Sbjct: 61 KDERIHVVHKKNEGLGFARNTGVEHAKGKYITFVDSDDYVSLDMLQTLYNAVQEYDVEVV 120 Query: 119 QCNADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGL-----------SSRRWTHVVW 167 + GE ++ + + G D L + + W Sbjct: 121 YSAGYYRSFSNGEIKKTDVETKKP--QLFEGGDVASKLLPDVISAPPEYPNDGKVGVSAW 178 Query: 168 MGVYRRDVIVKNNIKFIAGLHH--QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRL 225 +Y ++ + + F + +D ++ + + A A LY Y + S+S Sbjct: 179 KVLYEANLFKEKGLLFHSEREFISEDAIFQIDCLKLATSALVIPDILYYYRENFGSLSMK 238 Query: 226 HRQGNKNLNYQRHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDIL 285 +++ L+ Y + + +E L + I I + ++ K I Sbjct: 239 YKEERFELDKIL-YNEQLKRVEGLPNQEILEERIERILIANIRLCIFQ--ESLYKSSRIH 295 Query: 286 TRQRMIAEIFTSGMYKRLITNVRSVKVGYQALLWSF 321 R + I +I + K ++ + ++ + L Sbjct: 296 KRLQRIRQICKDPISKSVLRHYPIHRLPFTKRLICV 331 >UniRef50_A2STD8 Glycosyl transferase, family 2 n=1 Tax=Methanocorpusculum labreanum Z RepID=A2STD8_METLZ Length = 346 Score = 266 bits (681), Expect = 8e-70, Method: Composition-based stats. Identities = 78/324 (24%), Positives = 142/324 (43%), Gaps = 8/324 (2%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M + +SVI+P+YN +TC++S++ QT+ LEII++ S DNSV I Y + Sbjct: 1 MNSEVPLISVIVPVYNVEPYLKTCIDSILKQTYRNLEIILVASTSIDNSVVICDSYLQVD 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 +++LH G S ARN G+ +A+G Y+ FVD+DD + ETL ++ + D D+ QC Sbjct: 61 SRLQVLHTEPQGLSAARNIGVSMASGTYLCFVDSDDFISQNFVETLYGLSQKYDADIVQC 120 Query: 121 NADWCFRETGETWQSIPTDRLR---STGVLTGPDWLRMGLSSRRWTHVV-WMGVYRRDVI 176 N + T + + D + + W +YR+++ Sbjct: 121 NFLEISENATYSICESCTSIFNDALEIHIYSNVDMCANLYNDLADSSTASWTKLYRKELF 180 Query: 177 VKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQ 236 + N+K+ G H+D + + + A R T LY Y + S++R K + Q Sbjct: 181 M--NVKYPIGKIHEDEGTSYKLFYLAKRVAVTSLKLYYYRSNPLSLTRATYTLKKQDSLQ 238 Query: 237 RHYIKITRLLE-KLNRNYADKIMIYP-EFHQQITYEALRVCHAVRKEPDILTRQRMIAEI 294 ++ E ++ YA + Y + I + + R + ++ + + I +I Sbjct: 239 FSEEQMDFFREGGYDKLYALALKKYMISLSEHILLTKVYLSKKERNDTELEIKYKAIYKI 298 Query: 295 FTSGMYKRLITNVRSVKVGYQALL 318 +Y + + + A L Sbjct: 299 AAKNLYLPIYDKWKITLIHTLAKL 322 >UniRef50_O87182 CpsIaI n=15 Tax=Streptococcus agalactiae RepID=O87182_STRAG Length = 333 Score = 266 bits (681), Expect = 9e-70, Method: Composition-based stats. Identities = 74/330 (22%), Positives = 150/330 (45%), Gaps = 13/330 (3%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M +S+IIP+YN+ + C++S++ QT +E+I+I+DGSTDNS EI ++ Sbjct: 1 MEKRILVSIIIPIYNSEAYLKECVQSVLQQTHPLIEVILIDDGSTDNSGEICDNLSQEDN 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 + + H+ N G S ARN G++ +TG+++ FVD+DD V P M E ++ + ++ D+A+ + Sbjct: 61 RILVFHKKNGGVSSARNLGLDKSTGEFITFVDSDDFVAPNMIEIMLKNLITENADIAEVD 120 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 D + V + L+ LS R ++V +Y++ +I N+ Sbjct: 121 FDISNERDYRKKKRRNFY-----KVFKNNNSLKEFLSGNRVENIVCTKLYKKSIIG--NL 173 Query: 182 KFIAG-LHHQDIVWTTEFMFNALRARY-TEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHY 239 +F +D+++ + + R T SLY Y + TS + +L++ + Sbjct: 174 RFDENLKIGEDLLFNCKLLCQEHRIVVDTTSSLYTYRIVKTSAMN-QKFNENSLDFITIF 232 Query: 240 IKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGM 299 +++ L+ NY + + + + + + R+ +I + Sbjct: 233 NEVSSLVPAKLANYVEAKFLREKIK---CLRKMFELGSNIDNKIKVQREIFFKDIKSYPF 289 Query: 300 YKRLITNVRSVKVGYQALLWSFRLWQWRDK 329 YK + + + + S +L+ + Sbjct: 290 YKAVKYLSLKGLLSFYLMKCSPKLYVMAYR 319 >UniRef50_D0DSJ8 Glycosyl transferase CpsJ n=1 Tax=Lactobacillus fermentum 28-3-CHN RepID=D0DSJ8_LACFE Length = 349 Score = 266 bits (680), Expect = 9e-70, Method: Composition-based stats. Identities = 66/329 (20%), Positives = 127/329 (38%), Gaps = 22/329 (6%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+S+I+P+YNA + C+ESL+ QT+ +EII+++DGSTD S ++ YA+ + ++ Sbjct: 2 KISIIVPVYNAEEYLSKCLESLVNQTYRNIEIILVDDGSTDASPQMCDEYAQKDDRIIVI 61 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 H+ N G S+ RN G+ TG + FVD+DD + E L+ +D+D+ + + Sbjct: 62 HKENEGLSLTRNVGMAAITGDFFTFVDSDDWLSYNYVENLVVALEANDIDIVFTSYLKEY 121 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLRMGL----------SSRRWTHVVWMGVYRRDVI 176 + P D + + LR + + W YR Sbjct: 122 KSKSVGVTFFPEDEIIFKSSEGSNEVLRRLVGEIGDELQHPERIENLNTAWGKCYRTSKF 181 Query: 177 VKNNIKFIAG--LHHQDIVWTTEFMFNALRARYTEQSLYKYY-LHNTSVSRLHRQGNKNL 233 +KF G + +D+ + + + R+ Y Y Y + S+ + Sbjct: 182 K--QLKFDDGNEIRSEDLWFNIQAFYLTERSMYINNQGYHYNKQNENSIVSTYTP----- 234 Query: 234 NYQRHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILT--RQRMI 291 + + K+ ++E + + + L + I R + + Sbjct: 235 HLADKFNKLNNMIEDFIGLHKLGEDYILALNSRRVLMLLTIAINEFSNQKISFWQRTKKM 294 Query: 292 AEIFTSGMYKRLITNVRSVKVGYQALLWS 320 I Y + K+ ++ Sbjct: 295 KTILNGQEYTKAFRKFEYGKLSCVYRIFF 323 >UniRef50_A6P2R1 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6P2R1_9BACE Length = 294 Score = 266 bits (680), Expect = 1e-69, Method: Composition-based stats. Identities = 64/269 (23%), Positives = 119/269 (44%), Gaps = 9/269 (3%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 + +S+I+P+YN C+ S+ QT E+++++DGSTD S +I + Y Sbjct: 2 ESPAVSIIVPVYNVKPYLEKCLSSIRFQTCRDFEVLLVDDGSTDGSADICRSYVALDSRF 61 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 RLL QANAGAS ARN G+ A G+Y+ FVD DD + E L+ A D+ + Sbjct: 62 RLLTQANAGASAARNLGMTQAAGRYLQFVDGDDWLPTDSVELLLHTAESTGADLTVAHFY 121 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLS--SRRWTHVVWMGVYRRDVIVKNNI 181 E + +R V+T ++ + + + V+W +YRR ++ N++ Sbjct: 122 RVAGE-----RCAARGHIREERVMTRTEYADYMVKAPANFYYGVMWNKLYRRAIVEANHL 176 Query: 182 KFIAGLHH-QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRH-Y 239 + A + +D ++ +++ + +Y Y S+ K + +R + Sbjct: 177 ECPADVAWCEDFLFNLDYIACCRLIAAVPRPVYYYRKREGSLVSTQATLRKTIQTKRLTF 236 Query: 240 IKITRLLEKLNRNYADKIMIYPEFHQQIT 268 + ++L+ K+ +Y Sbjct: 237 NEYKEFYQQLDLYERRKVRVYRYLISSAV 265 >UniRef50_B2UQ55 Glycosyl transferase family 2 n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UQ55_AKKM8 Length = 341 Score = 266 bits (680), Expect = 1e-69, Method: Composition-based stats. Identities = 67/326 (20%), Positives = 132/326 (40%), Gaps = 22/326 (6%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M + +SV++P+YN C+ES+ +QT LEI+ ++D STD S+ I + +AE P Sbjct: 1 MTAPVNISVLVPVYNVEPYLAQCLESICSQTLRELEIVCVDDASTDGSLSILREFAERDP 60 Query: 62 HVRLLHQ--ANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQ 119 V+++ Q N G S +RN + A G+Y+ VD+DD + + E + A D D Sbjct: 61 RVKVV-QAPENGGLSRSRNLAMSHAVGEYLFLVDSDDWLETDLLEEMYRRAKALDADRLA 119 Query: 120 CNADWCFRETGETW-QSIPTDRLRSTGVL--TGPDWLRMGLSSRRWTHVVWMGVYRRDVI 176 C + + + + +P D P+ + + H + RR ++ Sbjct: 120 CGFRYYYESAPDREDRFLPEDMAPPEKGWLPCTPETI------GKIHHGAGGMMIRRSIV 173 Query: 177 VKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQ 236 K+ I+F G+ +D+ + R ++ Y Y S++ G+ + Sbjct: 174 EKHGIRFPEGVACEDLYFHYAVFPWCRRVCVVSRAAYVYRKRAGSITSGFASGSSLQSL- 232 Query: 237 RHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAE--- 293 Y+ + L+ K + I E+ + +RK ++ + + Sbjct: 233 -DYLTVAELVLKEWKE----AGILEEYRTAFLKMLVMGVRNIRKYAPHAVQKEVTRKVTD 287 Query: 294 -IFTSGMYKRLITNVRSVKVGYQALL 318 + +Y+ + + + L Sbjct: 288 MLRQENLYRPAEDDACLSRREGKLLK 313 >UniRef50_A6LG23 Glycosyltransferase family 2 n=6 Tax=Bacteroidales RepID=A6LG23_PARD8 Length = 459 Score = 266 bits (680), Expect = 1e-69, Method: Composition-based stats. Identities = 64/306 (20%), Positives = 125/306 (40%), Gaps = 4/306 (1%) Query: 15 YNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLHQANAGAS 74 YN C++SL Q +EII++NDGSTD S EIA YA+ ++++HQ N GAS Sbjct: 16 YNVEKYLTVCIDSLKNQGGLRMEIILVNDGSTDLSGEIADEYAKKEERIKVIHQENGGAS 75 Query: 75 VARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFRETGETWQ 134 ARN G+++AT +Y+AF+D+DD + L A++ DV N C ++ Sbjct: 76 AARNVGLDIATEEYIAFLDSDDWIKDESLSLLYDEAVKHHADVVMGNMWLCHQDGSMDKP 135 Query: 135 SIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGLHHQDIVW 194 +L+G + + + + + + +Y R + K +F G+ H+D +W Sbjct: 136 FKCISNGSIKELLSGKESFIELVKTCFYLPMPFKYIYNRKYLHKIQARFEEGIMHEDELW 195 Query: 195 TTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRLLEKLNRNYA 254 + A R T Y Y + SV + + + + + ++ + Sbjct: 196 CPIVLCQAERMVITNIEFYYYRQNEESVMHTTSLARRLKSLFQVTEGLIKFSDQFVSS-K 254 Query: 255 DKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLITNVRSVKVGY 314 + + ++ I + + D ++ + + ++ + Sbjct: 255 TSVDLRSWWYVNILRLYSKAFTLLPNIKDSSY---IVHKHYLDRFWRDCGQMIPESIQRC 311 Query: 315 QALLWS 320 + + Sbjct: 312 RNYYYK 317 >UniRef50_C5TEG1 Glycosyl transferase family 2 n=1 Tax=Zymomonas mobilis subsp. mobilis ATCC 10988 RepID=C5TEG1_ZYMMO Length = 514 Score = 266 bits (680), Expect = 1e-69, Method: Composition-based stats. Identities = 80/305 (26%), Positives = 125/305 (40%), Gaps = 13/305 (4%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M N K+S I+P YN C+ SL Q+ EII++ D STD ++ A+ YA Sbjct: 1 MNNHEIKISFIVPAYNVKSYLGKCLTSLQQQSLDNFEIIVVEDCSTDGTLSKARQYARKD 60 Query: 61 PHVRLL-HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQ 119 P ++++ H+ N G ARN G+E A GKY+ FVD+DD V P E A D+ Sbjct: 61 PRIQIICHKENQGLGPARNTGMEYARGKYICFVDSDDWVDPGYGEAFYLEAERTSADMVV 120 Query: 120 CNADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKN 179 F + +R + LS VW YRR+ + Sbjct: 121 GTFYAVFSQGARIVSHFVDPAVRYASLPFDAQTCPAVLS---MPTPVWDKCYRREFLKNY 177 Query: 180 NIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHY 239 NI+F A L +DI + E M A R + Y Y + + + N++L R Sbjct: 178 NIRFPA-LIGEDIPFQWEAMTQAKRISVVGEPFYYYRIRD-------TKKNRSLTAGRDI 229 Query: 240 IKITRLLEKLNRNYADKIMIYPEFHQQITYEALR-VCHAVRKEPDILTRQRMIAEIFTSG 298 L ++ + + Y + ++ + H K D L +A++F + Sbjct: 230 FADVFLAQEKALAFLNDSGHYEKLKSIWWERMIKELLHLTDKSGDTLVSDNFVAKVFHTM 289 Query: 299 MYKRL 303 + K L Sbjct: 290 LRKSL 294 >UniRef50_B1R2U2 Glycosyl transferase, group 2 family protein n=4 Tax=Firmicutes RepID=B1R2U2_CLOBU Length = 353 Score = 266 bits (680), Expect = 1e-69, Method: Composition-based stats. Identities = 74/348 (21%), Positives = 148/348 (42%), Gaps = 17/348 (4%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 +S+I+P+YN C++S++ QT+ E+I+++DGSTD S +I + + ++ Sbjct: 1 MIDISIIVPIYNVEKYVEKCVDSILKQTYINFELILVDDGSTDMSGKICDKFKKEDDRIK 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++H+ N G S ARN G+++ATGKY+ F+D+DD + TMYE + A+EDD D+ C+ + Sbjct: 61 VIHKKNGGLSSARNEGVKLATGKYIGFIDSDDYIDKTMYEKMFNKAIEDDSDIVICDMTY 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 I T GVL D + L + + +Y+R++ + F Sbjct: 121 NLNGKD-----ISTTIFEDFGVLNREDTILKYLENDYFRSHAQNKLYKRELFEY--VSFP 173 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 G ++D+ + + N + + + LY Y N + + NL + ++ Sbjct: 174 EGKLYEDVATFYKLIHNCNKISFVNEKLYYYNQSNYNSITKKKFNINNLQLILNTEEMKD 233 Query: 245 LL--EKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSG---- 298 K+ + + + L + + L + I Sbjct: 234 FFIRNNYTSEIIRKVDEFYFIQCILVMQMLINSSRLLNIKEFLYLELNIINKIDFKTYNS 293 Query: 299 ----MYKRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITRSAFN 342 YK N ++ +++++ ++++ K S + + N Sbjct: 294 GLYMFYKNKKCNYFRYELLKKSIVFILCIFKYISKKMSINLKIKKGIN 341 >UniRef50_A8SG33 Putative uncharacterized protein n=1 Tax=Faecalibacterium prausnitzii M21/2 RepID=A8SG33_9FIRM Length = 328 Score = 265 bits (679), Expect = 1e-69, Method: Composition-based stats. Identities = 63/343 (18%), Positives = 132/343 (38%), Gaps = 17/343 (4%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M + +S+I+P+YN + C+ SL QT+ +E++++NDGSTD S +I + Y+ Sbjct: 1 MSEQYDLISIIVPVYNVKNYLEQCLNSLANQTYKNIEVLLVNDGSTDGSDKICEKYSLKD 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 + + N+G + ARN GI+ + GKY+AFVD+DD V L T+ + +++ Sbjct: 61 SRFKKFDKPNSGPADARNLGIKNSHGKYIAFVDSDDYVDHGFILELYTILQKYKANISIA 120 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 + T + +GV++ L L+ + V ++ +++ N Sbjct: 121 GYTVFYENTN----FRSSFSYDRSGVISAEAVLSSILTDETIANFVCNKLFEKELF--NR 174 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI 240 +KF +G ++D+ + + + + T + LY Y S++ + Sbjct: 175 VKFPSGKAYEDVAVFYKLVAASKKLCITSKCLYYYRKRFGSITTSVNPKS---------- 224 Query: 241 KITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMY 300 LE +P ++ + + + E + + Sbjct: 225 -TLDGLEAATARNLYVSSHFPTLKKECINSDFSMLVYSYNQLAKSGKPVSDIECIHNRIS 283 Query: 301 KRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITRSAFNL 343 +L + + + + R+ + R Sbjct: 284 YQLKHHFKELHGNKYRPFGIVISIFPKTFYRTVGLVERVKKGW 326 >UniRef50_B9YA99 Putative uncharacterized protein n=1 Tax=Holdemania filiformis DSM 12042 RepID=B9YA99_9FIRM Length = 319 Score = 265 bits (679), Expect = 1e-69, Method: Composition-based stats. Identities = 65/325 (20%), Positives = 122/325 (37%), Gaps = 11/325 (3%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP-HV 63 K++VI+P+YN C++SL QT+ L+++++NDGSTD+S +I + Y +P Sbjct: 2 EPKITVIVPVYNVEKYLPECLDSLAGQTYKNLQVLVVNDGSTDSSPQIIEDYVRRFPDRF 61 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 L++ N G S ARN I A G+++AF+D+DD P Y + + DV + + Sbjct: 62 AQLNKPNGGLSDARNFAIGKAEGEWIAFLDSDDWADPEYYAKMAEVMAR-GADVVVSDIE 120 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 + + Q + V P ++ S W +YR D + + ++ Sbjct: 121 YFYENG----QPSWIMPGLNPHVFDSP--VKQAFLS---PMFAWNKLYRADFFTRKHYRY 171 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 G ++DI T + ++ Y S+ + + + Sbjct: 172 PLGTWYEDIPVTMPIFAQSQTIGIQHEARIHYRQREGSIMSVTKSERLKEIFPVMKQVRD 231 Query: 244 RLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRL 303 + E + Y Y R + ++AE + + Sbjct: 232 QFAEMGLSERYHDELEYLHIEHLRLYGMFRFLRSPLTSELKAMSDAVMAEFYPNWKKNPY 291 Query: 304 ITNVRSVKVGYQALLWSFRLWQWRD 328 I N+ + S L ++ Sbjct: 292 IRNLGWKDQLFLKTFGSLTLPLYKH 316 >UniRef50_C5S6D6 Methyltransferase FkbM family n=1 Tax=Allochromatium vinosum DSM 180 RepID=C5S6D6_CHRVI Length = 711 Score = 265 bits (679), Expect = 2e-69, Method: Composition-based stats. Identities = 85/350 (24%), Positives = 142/350 (40%), Gaps = 26/350 (7%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 + +S+ +P+YN R C++S++TQ +T LEII +NDGSTD S I YA P +R Sbjct: 3 SPTVSIALPVYNVAPYLRACLDSIVTQDFTDLEIICVNDGSTDESPAILAEYARRDPRIR 62 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++ Q N G + RN + G+Y+ VD+DD + P L+ D Sbjct: 63 IVDQTNRGLASTRNTALAHVRGRYLLHVDSDDLLRPGAVSRLVEEIERTGADFIVFQHLM 122 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 E + + L L L + H W +R D + + + +F Sbjct: 123 IEDE----REFDNWYYDFDSWSLEPKTRLEQKLEILHY-HYTWSKFFRMDFLNQYDFRFP 177 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVS-----------RLHRQGNKNL 233 GL D V+ + + A + + LYKY + S+ R+ R+G Sbjct: 178 DGLQFDDNVYHWQVLLAAQKVVMLPEKLYKYRMRAGSIMWHRGRHHLDIFRIFREGIALF 237 Query: 234 NYQRHYIKITR--LLEKLNRNYADKIMIYPEFHQQITYEALRVC-----HAVRKEPDILT 286 Q Y + + + K++ Y I P F L VR P +L Sbjct: 238 REQGVYEAVFQDFMQYKMSLQYFWTRDIEPRFRYLAYRGILDGLSADEERFVRDYPHLLD 297 Query: 287 RQ--RMIAEIFTSGMYKRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHH 334 + R +F S +Y++ ++++ + L S R+ + RDK R+ Sbjct: 298 PEAIRFYERVFASPLYRQGAHRALALQLTARRYLSSGRV-RLRDKARALR 346 >UniRef50_D2RJ85 Glycosyl transferase family 2 n=1 Tax=Acidaminococcus fermentans DSM 20731 RepID=D2RJ85_ACIFE Length = 327 Score = 265 bits (678), Expect = 2e-69, Method: Composition-based stats. Identities = 67/330 (20%), Positives = 125/330 (37%), Gaps = 11/330 (3%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 N +S+I+ +YN C++S++ QT+ LEII+++DGS DNS +I +A + Sbjct: 2 EKNLISIIVAVYNVEKYLNKCIKSIVNQTYHNLEIILVDDGSKDNSGKICDEWAGKDSRI 61 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 ++H+ N G S ARN G++ + G+++ FVD DD + MYETL + + C Sbjct: 62 HVIHKQNGGVSDARNYGLKASNGEWICFVDGDDYISLNMYETLYKNRSKQG--ITVCGYY 119 Query: 124 WCFRETGETWQSIPTDRLRSTG---VLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 I LT + + + W +Y R V Sbjct: 120 VVENGKEIPVPGINKKLAPKEAANLYLTNERQSVLNSNFTYFGSYPWNKLYDRSVFQ--G 177 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI 240 +++ +D+ E + + R+ Y Y S++ Q + ++ + Sbjct: 178 VEYPEHKKFEDMYIMLELLHQSYEIRFIPDCKYYYIQRRGSITH---QKSIQVDSLDARL 234 Query: 241 KITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMY 300 K + LE+ KI + + ++RM A S Y Sbjct: 235 KQKKELEEFWDIKDKKIDELIALSYIAILRKYAINPYSERIKYTSLKKRMWAS-LNSIGY 293 Query: 301 KRLITNVRSVKVGYQALLWSFRLWQWRDKT 330 ++ + + L+ ++K Sbjct: 294 DDFPLKIKLKLLLMHIPDLYYVLYSLKEKL 323 >UniRef50_C4ZHK7 Glycosyltransferase Family 2 modular protein n=1 Tax=Eubacterium rectale ATCC 33656 RepID=C4ZHK7_EUBR3 Length = 881 Score = 265 bits (678), Expect = 2e-69, Method: Composition-based stats. Identities = 75/346 (21%), Positives = 148/346 (42%), Gaps = 10/346 (2%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M K+S+I+P+YNA + C+ S+ +QT +EII ++DGSTD+S EI +AE Sbjct: 9 MSTQNVKVSIIVPIYNAEKYLKQCLGSIQSQTLKDIEIICVDDGSTDSSPEIMDSFAEKD 68 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 +++H+ N G + N G+ VATG+Y+ V+ADD + M+E L + + +D+ + Sbjct: 69 SRFKVVHKPNGGNGHSMNAGLSVATGEYIGCVEADDYIEKNMFEKLYLYSNDGSIDIVKS 128 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 N C+ + + + + L P ++ +W +Y+R +I +N+ Sbjct: 129 NFWNCYEDESGKITKVVNEERNNMPELALPFTIKEYPQILWGHPSIWSAIYKRSLIEENH 188 Query: 181 IKFIA--GLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRH 238 IKF G D + E M A +T++ Y Y S N + + Sbjct: 189 IKFKEAKGGGWVDNPFFFETMCCAKSIYWTKRPFYNYRTEVEGSSSKGYDFNIPFDRMQD 248 Query: 239 YIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPD--ILTRQRMIAEIFT 296 + + + N D+ + + + + Y + + + + ++ R + E Sbjct: 249 NLDVLKRY-----NCQDEETLKLAYARALMYLCGAINESDYEGKEQLFYSKAREMLEKIN 303 Query: 297 SGMYKRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITRSAFN 342 + + + K+ + L++ R K ++ + N Sbjct: 304 PMVIRDDFNDNDK-KIFLKFKSPIKNLYKDRKKVLIYNWVPFDNPN 348 >UniRef50_B5G7W4 Glycosyltransferase n=3 Tax=Streptomyces RepID=B5G7W4_9ACTO Length = 1271 Score = 265 bits (677), Expect = 2e-69, Method: Composition-based stats. Identities = 64/252 (25%), Positives = 110/252 (43%), Gaps = 11/252 (4%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M +LSVI+P+Y+ C++SL QT +EI++++DGS D S IA+ YA Sbjct: 1 MSAVPPRLSVIVPVYDVETYLPACLDSLAAQTLREIEIVVVDDGSPDGSALIAEEYARRD 60 Query: 61 PHVRLLHQANAGASVARNRGIEVAT--GKYVAFVDADDEVYPTMYETLMTMALEDDLDVA 118 P RL+ Q N G ARN G++ A +++AFVD+DD V Y ++ D A Sbjct: 61 PRFRLVRQENQGLGAARNTGVDAAHPAAEFLAFVDSDDMVPADAYRLMLASLDASGSDFA 120 Query: 119 QCNADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVK 178 N ++ WQS L + + L + R V+RRD + Sbjct: 121 TGNVRHI--DSRRVWQSPMHRFLGAGAERRTHVTRKRKLLTDRI---ACNKVFRRDFWTR 175 Query: 179 NNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNT----SVSRLHRQGNKNLN 234 + ++F G+ ++D F A + +Y + + S+++ + + Sbjct: 176 HGLRFPVGVLYEDTPVIVPAHFLAEAVDIVSEPVYYWRMREGEAGPSITQRRTEPKAVRD 235 Query: 235 YQRHYIKITRLL 246 +++R L Sbjct: 236 RAAAVTEVSRFL 247 >UniRef50_A6BH38 Putative uncharacterized protein n=1 Tax=Dorea longicatena DSM 13814 RepID=A6BH38_9FIRM Length = 313 Score = 265 bits (677), Expect = 2e-69, Method: Composition-based stats. Identities = 71/304 (23%), Positives = 134/304 (44%), Gaps = 12/304 (3%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+SVI+P++N + C++S+I QT+ LE+I+++DGSTD S + + ++++ Sbjct: 5 KISVIVPVFNVENYLEKCVKSIIEQTYKNLEVILVDDGSTDKSGFLCDELKKQDYRIKVI 64 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 H+ N G S ARN GI+V+TGKY++F+D+DD + T E +M L ++A CN + Sbjct: 65 HKTNGGLSDARNAGIQVSTGKYLSFIDSDDYLERTALEQMMQAILISHSEIAICNIMRFY 124 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAG 186 + PTD + D L V ++ + NI F G Sbjct: 125 DDGCTEEFYNPTDNQEVLEGIRRFDTLNQ--------PSVCNKLFDAKLFA--NISFPYG 174 Query: 187 LHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRLL 246 ++D E ++ A + T ++ Y Y S+ + N+ ++ L Sbjct: 175 KFYEDTYVYHELLYKADKVVLTGKTGYWYLSRKGSILGRSQFSNRYFDFIEAVYMRADFL 234 Query: 247 EKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLITN 306 K + + + + + H + K+ D + +++ YK+ N Sbjct: 235 IKRDIQ-PYGDEAFLSLYAALANAEKNIDHKLNKD-DFHKVREYYRQVYNQIDYKKARFN 292 Query: 307 VRSV 310 ++ + Sbjct: 293 IKQI 296 >UniRef50_A8U9H2 Glycosyl transferase, family 2/glycosyl transferase family 8 n=2 Tax=Carnobacterium sp. AT7 RepID=A8U9H2_9LACT Length = 352 Score = 265 bits (677), Expect = 3e-69, Method: Composition-based stats. Identities = 74/355 (20%), Positives = 138/355 (38%), Gaps = 20/355 (5%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 K+S+I+P+YN + ++ ++QT+ EII++NDGSTD S EI Y P + Sbjct: 1 MKKISIIMPIYNVAEQLEKAIQCALSQTYKNSEIILVNDGSTDGSGEICDRYQLKDPRIS 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++HQ NAG+ ARN G++ ATG+Y+ F D DD + E + A+E + DV C Sbjct: 61 VIHQKNAGSGFARNAGLDRATGEYIYFADPDDYFEYNLIEETVAKAIESNADVVVCGYYN 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVV-WMGVYRRDVIVKNNIKF 183 + ++ G+LT + + W +Y+ D + ++N +F Sbjct: 121 EKVDKEGNITTL-EKNSELDGLLTKEAFRDQFRKHYSLSPYALWNKLYKHDFLKQHNCRF 179 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 QD ++ + N + +++ Y Y S +R+ Y Sbjct: 180 TNQKVGQDALFNQQVGKNVQTIYFHQKAYYHYVFREGSAVNRYRKERFFYEYTIA-SHFE 238 Query: 244 RLLEKLNRNYADKIMIYPEFHQQITYEALRVCHA------------VRKEPDILTRQRMI 291 + + ++ ++ + E + + + D + I Sbjct: 239 DWMVHWGKEKEYHDLVNQQYWGALYLELSNLAWEDCPLTKMGKVERIEELMDDSKINQAI 298 Query: 292 AEIFTSGMYKRLITNVRSVKVGYQALLWSFRLWQWR----DKTRSHHRITRSAFN 342 +EI + K V + + Y + L Q R K + + + F Sbjct: 299 SEI-ETEKEKNSFVKVLILLLRYDKYAGALNLMQLRVTVGKKFQKSFNVIKKKFG 352 >UniRef50_B9YAB2 Putative uncharacterized protein n=1 Tax=Holdemania filiformis DSM 12042 RepID=B9YAB2_9FIRM Length = 320 Score = 264 bits (676), Expect = 3e-69, Method: Composition-based stats. Identities = 72/334 (21%), Positives = 136/334 (40%), Gaps = 15/334 (4%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 LSVI+P+YN C++SL+ QT +EII+INDG+ DNS +I YA YP +R Sbjct: 2 KPILSVIVPVYNVEKYLPRCLDSLVNQTLKNMEIIVINDGTKDNSQQIIDDYASRYPQIR 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 + N G + RN G+ G+Y F+D+DD M+E ++ A +++ DV N +W Sbjct: 62 PFKKPNGGIAETRNFGLAQVQGEYFGFLDSDDFTELDMFEKMVAAAEKENADVVVSNFNW 121 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 + E + P G + + + +W +YR D + + ++F Sbjct: 122 VYGEQKRLEKEGPYAP--------GKEMMIHLFA------TLWNKIYRTDFVRETGLQFP 167 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 G ++D + + + + +++ Y S++ + K++ + I Sbjct: 168 TGYRYEDACFLYCLAPHVKKLAFVDEAFVSYIQLGNSITHTNNHEVKDMIHVFEIITDYF 227 Query: 245 LLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLI 304 + ++ Y D++ Y + LR ++ T Q + S + Sbjct: 228 KAQGWDQEYHDELE-YLHIKFFLGNSFLRSTKIADRQDRKTTIQMGWERLNQSFPQWKKN 286 Query: 305 TNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITR 338 + S+ + W + HR + Sbjct: 287 PYLTSLGGMKNRYFKTVSNWNIMLYSELFHRFKK 320 >UniRef50_C9RN10 Glycosyl transferase family 2 n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RN10_FIBSS Length = 318 Score = 264 bits (675), Expect = 4e-69, Method: Composition-based stats. Identities = 67/323 (20%), Positives = 128/323 (39%), Gaps = 13/323 (4%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 K+SVI+P+YN + C+ESL+ Q+++ E+I+++DGSTD S I +YA ++ Sbjct: 1 MPKISVIVPVYNVEKYIKECIESLLNQSFSDFELILVDDGSTDKSASICDFYASKDSRIK 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 + H+ N G S ARN GIE + G+++AFVD+DD V E L A D+ C Sbjct: 61 VFHKQNGGVSSARNLGIEKSVGEWIAFVDSDDYVDAKYLENLYKSA--YSCDLISCGFFV 118 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 +T ++ + +++ W + + +I N++KF Sbjct: 119 TNEFGEKTNSNVNPSGFFYQKD------IADVVNNDWIVTSPWAHLIKTSLIKDNDVKFY 172 Query: 185 AGL-HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 +D ++ + A R + LY Y S+ K Y Y+ Sbjct: 173 ENRSMGEDTIFVLSCLDKANCIRNIPELLYYYRTTPNSLLH--PDERKYGKYIEDYMWSN 230 Query: 244 RLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRL 303 ++ + Y + + + K+ +++ +I + L Sbjct: 231 NMVYSMKNGTFYF--YYKRKCHALMTILYKKDMPLVKKISNESQKASGKKILCYWLNMSL 288 Query: 304 ITNVRSVKVGYQALLWSFRLWQW 326 + + + + + F + Sbjct: 289 KSKILMSYILFPKFIQFFVMIFL 311 >UniRef50_C2LRT9 Glycosyl transferase, family 2/glycosyl transferase family 8 n=1 Tax=Streptococcus salivarius SK126 RepID=C2LRT9_STRSL Length = 293 Score = 264 bits (675), Expect = 4e-69, Method: Composition-based stats. Identities = 73/297 (24%), Positives = 125/297 (42%), Gaps = 9/297 (3%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M + +S++IP+YN + C+E+++ QT+ LEII++NDGS D S EI K + + Sbjct: 1 MENKEVVSIVIPIYNVEAYLKQCLETIVNQTYPNLEIILVNDGSPDKSEEICKEFFKRDS 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 +R + Q N G S ARN GI++ATG Y+ FVD DD V ETL T + + DV+ N Sbjct: 61 RIRYVRQVNGGLSAARNTGIDLATGDYITFVDPDDWVTDDYVETLYTQLKKYEADVSIAN 120 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWM----GVYRRDVIV 177 + T + + T+ S + G + + + W +Y+ + Sbjct: 121 YNLFNESTSKFLIKV-TENDYSETLYEGREIIDQDAIQETRD-MAWACAMMKLYKISLFE 178 Query: 178 KNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQR 237 +++F G + +D + + A R +TE+ +Y Y + + L+ Sbjct: 179 --DLRFPVGKNVEDNFLMYKLLLKANRVVHTEKCIYWYRVGRKDTLSQVWTEKRVLDEME 236 Query: 238 HYIKITRLLEKLNRNYADKIMIYP-EFHQQITYEALRVCHAVRKEPDILTRQRMIAE 293 + LL L + IY + I R + + Sbjct: 237 AKNEKLALLGMLGYDLTWHRYIYKTRLKRAIEKMEEANLQDTETYKTAQVNLRFLEQ 293 >UniRef50_B7BDW0 Putative uncharacterized protein n=1 Tax=Parabacteroides johnsonii DSM 18315 RepID=B7BDW0_9PORP Length = 318 Score = 264 bits (675), Expect = 4e-69, Method: Composition-based stats. Identities = 65/327 (19%), Positives = 134/327 (40%), Gaps = 14/327 (4%) Query: 2 MNST--NKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAEN 59 MN ++S+I+P Y TC+ES++ Q++ E+I+++DGS D S +I YA+ Sbjct: 1 MNKNLLPRVSIIVPCYKVEQYLPTCIESVLHQSYDNWELILVDDGSPDKSGKICDEYAKE 60 Query: 60 YPHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQ 119 ++++H+ N G + ARN I++A G+Y++F+D DD ++ + L+++AL+ + Q Sbjct: 61 DNRIKVIHKTNGGVAAARNVAIDLAEGEYISFLDGDDFLHVDYVQDLISLALKHQAGIVQ 120 Query: 120 CNADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKN 179 CN G + S V +S +VW +Y+ +++ Sbjct: 121 CNYVR-----GNDRIFPNVTKDLSVKVYNSHSI----FTSDAAKIIVWGKLYKTNIVKNF 171 Query: 180 NIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHY 239 I G +D T + ++A + T + Y Y + S H++ NL++ Y Sbjct: 172 KI--PEGRFFEDDWITWRWYYSAKKIVVTSRPYYYYAYNEMSTMTQHKKRP-NLSFIDAY 228 Query: 240 IKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGM 299 + ++ + + + +K+ + EI S + Sbjct: 229 NERITFFKQTAERDLEDCSYRQLCKALLLSYMNPMATKEQKQMILFKFSESWNEIRHSSV 288 Query: 300 YKRLITNVRSVKVGYQALLWSFRLWQW 326 + + + ++ Sbjct: 289 VSFKLKFSMGLFNIFPNIVTKLANIFL 315 >UniRef50_Q8XN54 Capsular polysaccharide biosynthsis protein n=1 Tax=Clostridium perfringens RepID=Q8XN54_CLOPE Length = 330 Score = 263 bits (674), Expect = 6e-69, Method: Composition-based stats. Identities = 69/335 (20%), Positives = 133/335 (39%), Gaps = 8/335 (2%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 ++S+I+P+YN C++S++ Q++T E+I++NDGS D S I Y +R Sbjct: 2 KPEISIIVPIYNVEKYLSKCIDSILNQSFTNFELILVNDGSNDRSGIICDEYTLKDERIR 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++H+ N G S ARN GI +A GKY+ FVD DD + MYE L + + D++ C Sbjct: 62 VIHKKNGGVSSARNSGISLALGKYIGFVDPDDYIKEEMYEILYNLCKKYKTDISICKLGR 121 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 T I + L + L+ + + +++R+ + F Sbjct: 122 EINGT-----LINNEENGEIIELDNVNGLKELFKGILYRFSLCNKLFKRECFR--GVIFP 174 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 G H+D+ T NA ++ Y + Y Y S+ + + + Sbjct: 175 EGRIHEDLSTTYRLFANAKKSIYFDYVGYIYVKRENSILTTTYNEKRLQAFIGWNEILEF 234 Query: 245 LLEKLNRNYADKIMIY-PEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRL 303 ++ K + + + Y +V K + + + + F+ + ++ Sbjct: 235 MINKYPMLEDNVVACFGYSCIDHSNYILNQVIGFNNKIRLLNFIRYEVKQRFSLLVKNKI 294 Query: 304 ITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITR 338 + N + + L F + + + R Sbjct: 295 LINKLKLILILLMLNSRFLIISKYKGKGVYRKRIR 329 >UniRef50_B9Y890 Putative uncharacterized protein n=1 Tax=Holdemania filiformis DSM 12042 RepID=B9Y890_9FIRM Length = 355 Score = 263 bits (674), Expect = 6e-69, Method: Composition-based stats. Identities = 63/317 (19%), Positives = 127/317 (40%), Gaps = 2/317 (0%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 K+SVI P+Y + R C++S++ Q++ E+II++DGS DN IA Y + V+ Sbjct: 1 MCKISVITPVYKVENYLRKCIDSILNQSFDDFELIIVDDGSPDNCGAIADDYKDKDSRVK 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++H+ N GA ARN GI++A GKY+ F D+DD + E L +A+ +D ++ Sbjct: 61 VIHKENGGAPSARNEGIKIAKGKYLYFPDSDDWIEENYLEELYKLAVSNDANLVISGYTM 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 + E + + + V W +Y+ I N + F Sbjct: 121 EYYENNKKSSYKVSVENKVYLSREDVVNNLHLYFDNMMMAVPWNKLYKTSYIKSNELFFP 180 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYK-YYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 L D+ + E + N + + S Y + S + G + + I Sbjct: 181 -NLKWDDLHFNMEVIMNINKVVISSNSGYHFFRSREGSETTTVFDGLLYQKRREQFEHIL 239 Query: 244 RLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRL 303 ++ N + + ++ + ++ + +I + F + ++ Sbjct: 240 KVYRIWNVENDNIMGSIYGYYASRLVQCIQEISISNVSDKRSLIRNIIYDQFNNYAIEKG 299 Query: 304 ITNVRSVKVGYQALLWS 320 N + +++ + + Sbjct: 300 KINSKLLRLAMYPMKYK 316 >UniRef50_Q4JZ78 Putative glycosyl transferase n=1 Tax=Streptococcus pneumoniae RepID=Q4JZ78_STRPN Length = 321 Score = 263 bits (673), Expect = 6e-69, Method: Composition-based stats. Identities = 72/327 (22%), Positives = 139/327 (42%), Gaps = 10/327 (3%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+S+++P+YN D + C++SLI Q++ +EII+++DGSTD+S I YA+ +R+L Sbjct: 3 KVSIVVPVYNVEDYLKYCIDSLINQSYKDIEIILVDDGSTDDSGRICDKYAQEDDRIRVL 62 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 H N G S ARN G++ A+ +++ F+D+DD E L+ + + +D+ + Sbjct: 63 HIENGGLSNARNTGVKFASAEWIIFIDSDDYYDRRTVEYLVQLQKKYAVDLVATSVIEVR 122 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAG 186 E + T+ + L L+ +Y++++++K F G Sbjct: 123 DFQSEDFLGSLTNI--DSLKLDRYTALKEMFYGNIVGTHPGGKLYKKEILMK--FPFPEG 178 Query: 187 LHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRLL 246 + ++D+ + E + +LYKYY S+ + Y I Sbjct: 179 MLYEDLAVSFEHIGACNEIAVGCINLYKYYRRPGSIVNSSYSDK----FLDFYKAIEMNR 234 Query: 247 EKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLITN 306 E + R+Y + + + ++ L V HA+ ++ E + + +I N Sbjct: 235 EFVQRDYPNDKEMNKALTVRYVFKGLHVVHAMLSSEMYSDVNKIRKE--YTRYFNDIIPN 292 Query: 307 VRSVKVGYQALLWSFRLWQWRDKTRSH 333 L + +K R Sbjct: 293 PNITGKNKVKYLMLLVSPKLYEKIREK 319 >UniRef50_C3QCN3 Glycoside transferase family 2 n=6 Tax=Bacteroides RepID=C3QCN3_9BACE Length = 322 Score = 263 bits (673), Expect = 7e-69, Method: Composition-based stats. Identities = 74/319 (23%), Positives = 136/319 (42%), Gaps = 4/319 (1%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M + K+SV+IP+YN R +ES++ Q+ LEII+++DGSTD S+ I + + Sbjct: 1 MKYNEIKVSVVIPVYNTEKYVRQAVESVMYQSLKELEIIVVDDGSTDKSLSIVEKLGDTD 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 +++ QAN G S+ARNRGI A G+Y+ F+D+DD + E E+ LD Sbjct: 61 KRIQIYTQANQGQSIARNRGISHAHGEYIYFMDSDDLLEEDALELCYHKCKEEKLDFVFF 120 Query: 121 NADWCFRETGETWQSIPTDRLR--STGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVK 178 +A F E ++ + TG + L + L ++ +T V + R+VI K Sbjct: 121 DALVFFENNVENAPTLNYKHTEKLEDKIYTGQEALEIQLQNKEYTPSVCLHFIHRNVIEK 180 Query: 179 NNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRH 238 +N+ F + H+D ++TT + +A +++ + + S Y Sbjct: 181 HNLSFYPEIVHEDQLFTTLLYLQSTKAACIKRTFFHRRMRKDSTMTSKFAMRNIKGYLVV 240 Query: 239 YIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSG 298 +I + + I + Q+ A+ H+++ I + + + Sbjct: 241 TEEILSFRRQTTEDKNK--EIIDLYLSQMLDAAMWQAHSLKLPERIKLARLCLQKYKKYV 298 Query: 299 MYKRLITNVRSVKVGYQAL 317 K + + Sbjct: 299 STKTIGALLFKFLFINHKK 317 >UniRef50_C3PZ19 Glycosyl transferase n=1 Tax=Bacteroides sp. 9_1_42FAA RepID=C3PZ19_9BACE Length = 326 Score = 263 bits (673), Expect = 8e-69, Method: Composition-based stats. Identities = 74/327 (22%), Positives = 138/327 (42%), Gaps = 5/327 (1%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+IIP+YN C++S+I Q++T E I+++DGSTD S EI +AE + ++H Sbjct: 2 ISIIIPVYNVKLYLDNCIQSVIQQSYTDFECILVDDGSTDGSSEICDQWAEKDNRIIIVH 61 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 Q N G S ARN+G+E A G+Y+ F+D+DD V ++ E + D+ C F+ Sbjct: 62 QPNGGVSSARNKGLEQAKGEYICFIDSDDWVDVDYLSAMINNLKEKETDLIICGLTQEFK 121 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 + G IP L ++ M + +Y+ ++I + I F + Sbjct: 122 D-GNCIDYIPQQSLNFELGCNHTNYFVMLNEQNLLYGPIV-KLYKNNIIQHHQILFDPQV 179 Query: 188 HH-QDIVWTTEFMFNALRARYTEQSLYKYYL-HNTSVSRLHRQGNKNLNYQRHYIKITRL 245 + +D+++ +++ A +S Y Y + + ++S R + +YQ+ I + Sbjct: 180 SYGEDLLFNYQYLEYAHTIACINESHYHYRILGSGTLSSKFRPKQFDTDYQQWKILQSFY 239 Query: 246 LEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLIT 305 + + +Y + K+ L I EI + + + Sbjct: 240 QRHHLWSSTAQTYLYKRLWGILYDGIFLYPRLQNKKISYLRTILAIPEIKKLKDFTQCFS 299 Query: 306 NVRSVKVGY-QALLWSFRLWQWRDKTR 331 +K L F ++ R Sbjct: 300 CSNWIKQSILYRLYPIFYIYFSIIAKR 326 >UniRef50_C2FKR8 Glycosyltransferase involved in cell wall biogenesis n=1 Tax=Lactobacillus plantarum subsp. plantarum ATCC 14917 RepID=C2FKR8_LACPL Length = 354 Score = 263 bits (672), Expect = 9e-69, Method: Composition-based stats. Identities = 65/322 (20%), Positives = 127/322 (39%), Gaps = 4/322 (1%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 K++VI P+YN + + S++ QT +E+I+++DGS+D S I +A V Sbjct: 2 KPKITVIAPVYNVEEYLEKSVSSILEQTLKDIEVILVDDGSSDASGAIIGDFAAKDNRVI 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 LHQ NAGA+ ARN GI+ A GKY+ F+D DD + TM + + A E L + Sbjct: 62 GLHQVNAGAAAARNTGIKRAQGKYLYFLDPDDWLEKTMLQKMYDFAEEHQLQLTITGFVN 121 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 + E + + R + + V W +Y+ + ++++F Sbjct: 122 EYYEKEKAFSVKNVMSKRIFATQNDFRLHAHEYLNNTFLAVPWNKLYKTSYLKNHHLEFP 181 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYK-YYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 + D+ + E + N R + + Y + S + N + + I Sbjct: 182 -QVKWDDLHFNMEVIRNIERVGVLDIAEYHFFRSRPGSETTKVFNEKLFANRIQQFKHIL 240 Query: 244 RLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRL 303 + + + +M ++ + + I ++ + +I + + + Sbjct: 241 AVYDDWKISDNVILMYVYNYYALRMIQLTQ--EIADNNMHISKKKSYVKKILSDPLTQSA 298 Query: 304 ITNVRSVKVGYQALLWSFRLWQ 325 +S K + +RL Sbjct: 299 FKKGKSNKGLIGVFMIPYRLRM 320 >UniRef50_C2EYX7 Glycosyltransferase n=2 Tax=Lactobacillus reuteri RepID=C2EYX7_LACRE Length = 326 Score = 262 bits (671), Expect = 1e-68, Method: Composition-based stats. Identities = 69/327 (21%), Positives = 127/327 (38%), Gaps = 7/327 (2%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M + +S+I+P++N C+ S+ QT+ E+++I+DGS DNS EI + + Sbjct: 1 MKEKSYNVSIIVPIFNVEKWLTRCLNSICKQTYGEFEVLLIDDGSIDNSGEICQTFVSKD 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 + + N G S RN GI A GKY FVD+DD + P E L + ++++D C Sbjct: 61 NRFKYYKKKNGGLSDTRNYGINRAKGKYFIFVDSDDYLTPDYVEMLYSGIQKNNVDAVIC 120 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTH---VVWMGVYRRDVIV 177 + + + L+G + + + ++ W +Y R + Sbjct: 121 GISRVDEDGK--ILNNNCFNVTDKTFLSGKEIIELSFDNKSDGGDLVCAWNKIYDRKLFD 178 Query: 178 KNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQR 237 NN++F G +++D +A + LY Y N S+ K + Sbjct: 179 -NNLRFEKGRYYEDGYIFPNLFLRVNKAMIIHRPLYNYVQRNNSIIHSSMNIKKIKDDDA 237 Query: 238 HYIKITRLLEKLNRN-YADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFT 296 + L + +++ Y I Y + YE ++ + + R+ A+ Sbjct: 238 SMLSWIDLFKNVDKKLYILSIQKYKNWIIYKWYENRKIILNNNMGNYLQKQYRLYAKEKY 297 Query: 297 SGMYKRLITNVRSVKVGYQALLWSFRL 323 K I + S+ L Sbjct: 298 PMSIKSKIKDTISIINLDIVYYLKIIL 324 >UniRef50_C5NWI7 Glycosyl transferase, group 2 family n=1 Tax=Gemella haemolysans ATCC 10379 RepID=C5NWI7_9BACL Length = 328 Score = 262 bits (671), Expect = 1e-68, Method: Composition-based stats. Identities = 72/326 (22%), Positives = 131/326 (40%), Gaps = 8/326 (2%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +SVI+P+YN D M+SL QT+ EII++NDGSTDNS ++ Y+E Y +VR H Sbjct: 2 ISVILPVYNVEDYLHYAMDSLEKQTYKNFEIILVNDGSTDNSGKLCDEYSEKYSNVRAFH 61 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + N G S ARN GI+ A G+++ F+D DD + E L+ + + D D+ Sbjct: 62 KENGGLSDARNFGIQKAKGEFITFLDPDDYLEAYSLELLVGIQEKHDCDIVSTRVKATES 121 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 + S+ + + +L + T +YR+ ++ I F G Sbjct: 122 YNVYSDYSLVEKDFENVTHMDNDVFLEEAFYDKVVTVSACGKLYRKSILE---IPFPKGK 178 Query: 188 HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRLLE 247 ++D+ ++ + A + +T +Y YY S+ Q + + Sbjct: 179 IYEDLFVISDHVTRAQKIVHTSLQIYNYYRRPGSIVNSKFSKK-----QYDFFEAIDKNR 233 Query: 248 KLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLITNV 307 + + ++ + + P + + A ++ + K I ++ Sbjct: 234 EFAKKFSKEKDLLPAINIKEVVGAFKLLTTLDKFKKIDIKKIRRIIKMDFKYVVFCKKIN 293 Query: 308 RSVKVGYQALLWSFRLWQWRDKTRSH 333 K Y L S L+ + Sbjct: 294 AKFKFKYILFLLSPGLFFSIKEFYKK 319 >UniRef50_B2IQI2 Glycosyl transferase, family 2 n=31 Tax=Firmicutes RepID=B2IQI2_STRPS Length = 328 Score = 262 bits (670), Expect = 1e-68, Method: Composition-based stats. Identities = 67/330 (20%), Positives = 131/330 (39%), Gaps = 7/330 (2%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 +SVI+P+YN + S+ QT+ LEII+++DG+TD S + AE V +L Sbjct: 5 LISVIVPVYNVAQYLEKSIASIQKQTYQNLEIILVDDGATDESGRLCDSIAEQDDRVSVL 64 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 H+ N G S ARN G++ A G Y+ F+D+DD ++P M ++L +++D DV+ C + Sbjct: 65 HKKNEGLSQARNDGMKQAHGDYLIFIDSDDYIHPEMIQSLYEQIMQEDADVSSCGVMNVY 124 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAG 186 + D V +LR L + + + +R++ + F G Sbjct: 125 ANDESPQSANQDDYF----VCDSQRFLREYLIGEKIPGTICNKLIKREIATA--LSFPKG 178 Query: 187 LHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRLL 246 L ++D + + + A + + Y Y+ S++ K+L Y Y K + Sbjct: 179 LIYEDAYYHFDLIKLAKKYVVNTKPYYYYFHRGDSITTKPY-AEKDLAYIDIYQKFYNEV 237 Query: 247 EKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLITN 306 K + + + + + + ++ R + + + Sbjct: 238 VKNYPDLKEVAFFRLAYAHFFILDKMLLDDQYKQFEAYSQIHRFLKGHAFAISRNPIFRK 297 Query: 307 VRSVKVGYQALLWSFRLWQWRDKTRSHHRI 336 R + + S + ++ Sbjct: 298 GRRISALALFINISLYRFLLLKNIEKSKKL 327 >UniRef50_B0P2K9 Putative uncharacterized protein n=1 Tax=Clostridium sp. SS2/1 RepID=B0P2K9_9CLOT Length = 327 Score = 262 bits (670), Expect = 1e-68, Method: Composition-based stats. Identities = 69/333 (20%), Positives = 128/333 (38%), Gaps = 9/333 (2%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+SVIIP+YN + C+ES++ QT+ LEI++INDG+TD+S +I + YA+ ++L+ Sbjct: 2 KISVIIPVYNTEKYLKECVESVLAQTYHNLEILLINDGATDSSPQICESYAKQDARIKLI 61 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEV-YPTMYETLMTMALEDDLDVAQCNADWC 125 H+ N G S RN GI+ +G YV F+D+DD P M E L + +D+ Sbjct: 62 HKENGGLSDTRNMGIKQCSGDYVLFLDSDDYWDDPKMVEKLADQMQQYPVDILNFRYKKY 121 Query: 126 FRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIA 185 +T + + + + L ++ + + + N + F Sbjct: 122 MEDTKQFVPCLKS--VEDIKQQEKDQILERLINEGLYISSACNKLISASFLKTNELYFEK 179 Query: 186 GLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRL 245 G+ +DI W + Y Y Y + S++ N + Sbjct: 180 GITSEDIDWCARMLIQCDSIGYNNTEAYVYRQRSESITHTISYKN-----IYDLSNNVKK 234 Query: 246 LEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLIT 305 + + Y +H + Y+ + + R + + + + Sbjct: 235 CVRFGQVIPKGTKFYELYHDFVAYQYGTLLLSNTLVDGDKDRIKAVMKQMKDYSWLLTYH 294 Query: 306 NVRSVKVGYQALLWSFRLWQWRDKTRSHHRITR 338 +K+ Y R + + +I R Sbjct: 295 VNPKIKLQYYVKKVVGYNNLIR-FLKLYAKIKR 326 >UniRef50_Q047X3 Glycosyltransferase related enzyme n=1 Tax=Lactobacillus delbrueckii subsp. bulgaricus ATCC BAA-365 RepID=Q047X3_LACDB Length = 332 Score = 262 bits (670), Expect = 1e-68, Method: Composition-based stats. Identities = 70/340 (20%), Positives = 131/340 (38%), Gaps = 14/340 (4%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M +SVI+P+Y D C++S++ QT+ LEII+I+DGS DN ++ +A+ Sbjct: 1 MNTELPLVSVIVPVYKVEDYLNQCVQSILDQTYHNLEIILIDDGSPDNCGQLIDDWAKKD 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 + LHQ N G S ARN G++ G+++AFVD+DD V E ++ A+ D ++ Sbjct: 61 KRIVALHQKNGGLSAARNTGLDNCHGEWLAFVDSDDYVAKDYIEKMLKAAMRDHTNLVIS 120 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHV---VWMGVYRRDVIV 177 + +GE V ++ ++ S W +Y R+V Sbjct: 121 HYYEVDEISGEQINIKHN----PAKVYNIDEFWKLYYSPEGNFVSLVVAWDKLYAREVFE 176 Query: 178 KNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYL-HNTSVSRLHRQGNKNLNYQ 236 + ++ GL H+D + + NA R +LY Y + + S+ ++ Sbjct: 177 ER--RYAVGLVHEDEEIIFDVVKNAHRISVISDNLYYYRVNRSGSIMANINASSEIDQAI 234 Query: 237 RHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRK--EPDILTRQRMIAEI 294 K N + ++ + + ++ + I+ + I Sbjct: 235 LEIKKTRAADLMHNEFWDGALLATEDLLTSVINAKVKNNSEKTRTIYSQIMLIIKQNRRI 294 Query: 295 FTSGMYKRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHH 334 YK + L++ + + K Sbjct: 295 LNQHGYK--FDRKHRIYFAIPGLIYYLKRVRLSAKKILQR 332 >UniRef50_Q93DZ7 Eps1I n=1 Tax=Streptococcus thermophilus RepID=Q93DZ7_STRTR Length = 316 Score = 262 bits (670), Expect = 1e-68, Method: Composition-based stats. Identities = 81/317 (25%), Positives = 144/317 (45%), Gaps = 15/317 (4%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 +S+I+P YN RTC+ES++ QT+ +E+II+NDGSTD S+ + ++ +++ Sbjct: 2 NPLISIIVPTYNVEKYIRTCIESILAQTYRNIEVIIVNDGSTDQSLAVISDLICSHHNIK 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 +++Q N G SVARN GI+ ATGKY+AFVDADD++ P +L +A + D+ + + Sbjct: 62 VINQKNQGLSVARNTGIDAATGKYIAFVDADDKIKPDFVSSLYQIADKTGADIVRGSFRD 121 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGP-DWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 IP + V T + S + VVW +YR D I N+I+F Sbjct: 122 FNGN-------IPKGWVPDFNVPTNYGTIVLDQFLSSNISFVVWSSIYRLDFINSNHIRF 174 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYL-HNTSVSRLHRQGN---KNLNYQRHY 239 G+ +D +T A + + Y Y + S+ N +L+ ++ Sbjct: 175 TPGILFEDADFTIRAYMLAKLVATSPEPNYAYRINRPGSILTTKTTKNAQKMSLSEEKII 234 Query: 240 IKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTR---QRMIAEIFT 296 + +L+ + +++ + + + V + + + +I EI Sbjct: 235 SQFISMLKHEKSDVLCSLILKSIYACMRDWTGIIVRNNLSLDRKNSCFDTALTLIKEIIN 294 Query: 297 SGMYKRLITNVRSVKVG 313 S K I + V + Sbjct: 295 SRPLKEKIKFLTKVIII 311 >UniRef50_D2BBH8 Glycosyltransferase involved in cell wall biogenesis-like protein n=1 Tax=Streptosporangium roseum DSM 43021 RepID=D2BBH8_STRRD Length = 616 Score = 262 bits (670), Expect = 2e-68, Method: Composition-based stats. Identities = 74/266 (27%), Positives = 113/266 (42%), Gaps = 9/266 (3%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 +LSV++P YN F C+ S+ QT LE+I ++DGS D S IAK YA P R Sbjct: 1 MPRLSVVVPYYNVERYFDACLASIQGQTLGDLEVICVDDGSMDASAVIAKDYASRDPRFR 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++ Q N G ARN G+ A+G Y+AF D+DD V P YE L+ D+A N Sbjct: 61 VVVQENQGLGPARNTGVTHASGHYLAFADSDDIVPPRAYELLVGSLERSGSDIASGNVQR 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 E G + R T V T L R VW V+RR + ++F Sbjct: 121 LTSE-GLVQSWAHRNAFRKTQVGTHITRNVHLLFDR----SVWNKVFRRSFWDELGMEFP 175 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 A ++D+ T A + +Y + L S++ + + + R Sbjct: 176 A-RLYEDMPVTVPAHVRARAVDVLSEVVYIWRLREGSITERRFRAENVTDLVISVAETAR 234 Query: 245 LLEKL---NRNYADKIMIYPEFHQQI 267 LE+ R ++ ++ + + Sbjct: 235 FLEEHAPGLRRVYERDTLHNDLRVAV 260 >UniRef50_C0YL33 Glycosyltransferase n=1 Tax=Chryseobacterium gleum ATCC 35910 RepID=C0YL33_9FLAO Length = 333 Score = 262 bits (670), Expect = 2e-68, Method: Composition-based stats. Identities = 67/342 (19%), Positives = 145/342 (42%), Gaps = 10/342 (2%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M N +++S+I+P+YN + C++SL+ Q+ +EI+++NDGS DNS +I + YA+ Y Sbjct: 1 MTNVPSRVSIIVPVYNVENYLAKCLDSLVKQSLPDIEILVVNDGSKDNSEKIIEGYAQKY 60 Query: 61 P-HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQ 119 P ++ + N G S ARN GI+ ATG Y+ FVD+DD V TM+E ++ +A + + + Sbjct: 61 PEKIKAYTKENGGLSDARNFGIDRATGDYIGFVDSDDYVTETMFEEMLLLAEKHNAKMVI 120 Query: 120 CNADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKN 179 N + + L + ++ ++++++ N Sbjct: 121 SNIQKVDENGKVVQKLTQLPNMPEKITLENNFSVF-----SDISYFACNKLFKKELF--N 173 Query: 180 NIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHY 239 +F G+H +DI + + T+ Y+Y S+++ H + K L+ + Sbjct: 174 QKRFKKGVHFEDIQLIPQLLLECETIAQTQNFHYQYLERTDSITKTHTE--KGLDMLKAV 231 Query: 240 IKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGM 299 + ++ + ++ + + + + + + +E +++ + + Sbjct: 232 SDVEQVFRESPYSHKKEELKNFQIFEGVYSFLAYLAFVKEEEIFYSMSDQLVLFMKERQI 291 Query: 300 YKRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITRSAF 341 + I Y L + + ++ R Sbjct: 292 KIQDILKYNRFGKNYLLSLPLKKKIFYLLFFAGQKKLIRKLI 333 >UniRef50_D2Q641 Glycosyl transferase family 2 n=2 Tax=Bifidobacterium dentium RepID=D2Q641_9BIFI Length = 330 Score = 261 bits (669), Expect = 2e-68, Method: Composition-based stats. Identities = 73/336 (21%), Positives = 124/336 (36%), Gaps = 15/336 (4%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M + +SVI+P++ C+ES++ QT L+II+I+DGS D E+ + Sbjct: 1 MADEQPLISVIVPVHGVEASLDACVESIVRQTHRNLDIILIDDGSPDRCPELCDRWMTQD 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 +R++HQ N+G S ARN GIE + G+Y+ FVD+DD + MYETL+ L + D++ Sbjct: 61 SRIRVIHQRNSGPSAARNAGIEASQGEYLGFVDSDDSIAEDMYETLLRNLLREQADISII 120 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 + R R + + + VW +YRR++ + Sbjct: 121 GTSLVGEDGESYVPYATECRFR----MNSEQAFKYVNLPGYFHVAVWDKLYRRELFD--D 174 Query: 181 IKFIAGLH-HQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHY 239 ++F H +D T + A Y Y Y+LH R + Sbjct: 175 VRFPVDRHSGEDYPVTFRMIDKAETLIYDSTPKYCYHLHE-------RMTLRLRITTAQS 227 Query: 240 IKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGM 299 + LE + Y Y + I ++++ A Sbjct: 228 DETRTALELVRTKYPRSYD-YMLYGHLIATIGTYNNIILQRQRGQWADFERHAVRLVRKA 286 Query: 300 YKRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHR 335 R++ K +L K + Sbjct: 287 LPRVMRLPEIPKAKCAQMLLLCLSPALYRKVLIMFK 322 >UniRef50_A6NTS7 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6NTS7_9BACE Length = 334 Score = 261 bits (669), Expect = 2e-68, Method: Composition-based stats. Identities = 67/334 (20%), Positives = 129/334 (38%), Gaps = 18/334 (5%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 LS+I+P+Y A C+ S++ Q E+I+++DGS D S +I +AE ++++H Sbjct: 3 LSIIVPVYRAERCISNCITSILAQVNGDFELILVDDGSPDRSGQICDSWAEQDSRIQVIH 62 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALED-DLDVAQCNADWCF 126 + N G S ARN G+ + G+Y+ FVD+DD V P +++T A E+ + C Sbjct: 63 KKNGGVSSARNVGLAASKGQYILFVDSDDYVEPEYASSMLTAAKENPNCGHIWCGFQTVS 122 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAG 186 + M L W +YR D+I ++ I F Sbjct: 123 DYNHSDAAPNLKTDKDMLQFMRSE---IMTLHELWLDTGPWNKLYRSDIIRQHGIHFPED 179 Query: 187 L-HHQDIVWTTEFMFNALR--ARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 + +D ++ +++ + + LY Y + + + Y K+ Sbjct: 180 MSLGEDWLFNLQYLDASPSECILVVTKPLYNYVRTG----QASLDEGYRPDMLQIYRKLN 235 Query: 244 RLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRL 303 EK + ++ +F+ Y +V + P+ +++ + L Sbjct: 236 GECEKYLKKWSVPAEQIQKFYNSRFYTYEKVLFNTFRNPNSTKKEKYRWN-------RHL 288 Query: 304 ITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRIT 337 + + V W L+ + S+ I Sbjct: 289 MRSEEFRNVLLARDCWVHPLYMLAYRLGSYRLIA 322 >UniRef50_C1CFZ0 Glycosyl transferase, family 2/glycosyl transferase family 8 n=4 Tax=Streptococcus pneumoniae RepID=C1CFZ0_STRZJ Length = 291 Score = 261 bits (669), Expect = 2e-68, Method: Composition-based stats. Identities = 62/267 (23%), Positives = 122/267 (45%), Gaps = 6/267 (2%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 +S+++P+YN + R C++S+ QT+ E ++INDGS D+S +I + + E + Sbjct: 3 DELISIVVPIYNVENYLRMCLDSIQNQTYQNFECLLINDGSPDHSSKICEEFVEKDSRFK 62 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 +AN G S ARN GIE + G Y+ FVD+DD + + L +++ D++ + Sbjct: 63 YFEKANGGLSSARNLGIECSGGAYITFVDSDDWLEHDALDRLYGALKKENADISIGRYNS 122 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRM---GLSSRRWTHVVWMGVYRRDVIVKNNI 181 + ET + + TD S V+ G + V + +++R+++ ++ Sbjct: 123 -YDETRYVYMTYVTDPDDSLEVIEGKAIMDREGVEEVRNGNWTVAVLKLFKRELLQ--DL 179 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIK 241 F G +D WT + + A R Y + +Y Y + + + + + Sbjct: 180 PFPIGKIAEDTYWTWKVLLRASRIVYLNRCVYWYRVGLSDTLSNTWSEKRMYDEIGAREE 239 Query: 242 ITRLLEKLNRNYADKIMIYPEFHQQIT 268 +L + + + I+IY Q++ Sbjct: 240 KIAILASSDYDLTNHILIYKNRLQRVV 266 >UniRef50_D2BBI0 Putative glycosyl/glycerophosphate transferase involved in teichoic acid biosynthesis TagF/TagB/EpsJ/RodC-like protein n=1 Tax=Streptosporangium roseum DSM 43021 RepID=D2BBI0_STRRD Length = 1173 Score = 261 bits (668), Expect = 3e-68, Method: Composition-based stats. Identities = 66/245 (26%), Positives = 101/245 (41%), Gaps = 9/245 (3%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 + LSVI+P +N F C+ S+ QT LE+I ++DGS D S IAK +A R Sbjct: 2 SPLLSVIVPFFNVEPYFGACLASIARQTLRDLEVICVDDGSLDGSAVIAKEFAGKDGRFR 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++HQ NAG ARN G ATG+Y+ F+D+DD + Y L++ E D D A C Sbjct: 62 VIHQENAGLGAARNAGALHATGRYLCFMDSDDVLPADAYGLLVSSLSESDSDFA-CGNVL 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 T + + R T T + L R W V+R D ++ F Sbjct: 121 YLNSTETWRSGLHSRPFRETVQRTHVNTRPELLLDR----TAWNKVFRHDFYREHGFSFP 176 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHN---TSVSRLHRQGNKNLNYQRHYIK 241 G ++D T A +Y + S+++ + + + Sbjct: 177 -GGLYEDAPVTIPAHVLARSVDVLSDVIYYWRQREPGDPSITQRRTEPGNLEDRIASITR 235 Query: 242 ITRLL 246 + L Sbjct: 236 VRDFL 240 >UniRef50_C6Z980 Glycosyltransferase family 2 n=1 Tax=Bacteroides sp. 4_3_47FAA RepID=C6Z980_9BACE Length = 351 Score = 261 bits (667), Expect = 3e-68, Method: Composition-based stats. Identities = 72/353 (20%), Positives = 151/353 (42%), Gaps = 29/353 (8%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 + K+S+++P+YN C++SL+ QT +EII+++D S DN +I Y Y +V Sbjct: 2 NKPKVSIVVPIYNVEKYLDRCIQSLLAQTLKEIEIILVDDESPDNCPQICDRYVAKYTYV 61 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 +++H+ N G ARN G+EVATG+Y+AFVD+DD + M E L E LDV Sbjct: 62 KVVHKKNDGLGQARNSGMEVATGEYIAFVDSDDFIDADMMERLYDECKEHSLDVI----- 116 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSR-----------RWTHVVWMGVYR 172 + E L+ + TG + + ++ G+Y Sbjct: 117 --YSEFNVDEYPGFRITLKPERLYTGREEIEQLRLDIVGAEPNHRSSVKFQCSACKGLYS 174 Query: 173 RDVIVKNNIKFIAGLHH--QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGN 230 ++ +++++F + + +D+++ + +++A++ + +Y Y L+ S+S +R Sbjct: 175 MKLLSEHHLQFHSERQYISEDMLFNLDVLYHAMQVKTVPWQMYHYCLNGASLSHTYRPDR 234 Query: 231 KNLNYQRHYIKITRLLEKLNRNYADKIMIYPEF-HQQITYEALRVCHAVRK-EPDILTRQ 288 + + +L+K + + + I Y + +R+ + + Sbjct: 235 WP-----KLLYMLEILDKKGQQFKHQEEFRLRLARTAIFYTRSGIGQEIRRTDISFKEKL 289 Query: 289 RMIAEIFTSGMYKRLITN--VRSVKVGYQALLWSFRLWQWRDKTRSHHRITRS 339 R + EI LI + ++ + ++ + + + + Sbjct: 290 RRVTEIMKDTTISSLIKDYPFYNLPLTWKIYGYLLKCKCKWIMFIVMYINNKK 342 >UniRef50_C4G9K6 Putative uncharacterized protein n=1 Tax=Shuttleworthia satelles DSM 14600 RepID=C4G9K6_9FIRM Length = 445 Score = 261 bits (667), Expect = 3e-68, Method: Composition-based stats. Identities = 67/264 (25%), Positives = 119/264 (45%), Gaps = 8/264 (3%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M K+S+IIP+YNA R C++S++ Q +T E+++++DGS+D S I YA Sbjct: 1 MGPNPKVSIIIPVYNAASALRRCLDSVLKQEFTDFELLLMDDGSSDESPAILDEYAGKDS 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 +RL+H+ N+G S RN+ +++A G YV F+DADD + P + ++ + D+ + Sbjct: 61 RIRLVHKNNSGVSDTRNQALDLARGTYVQFLDADDWIAPEATKLMVRAMEQKKADLVVTD 120 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 E I D L G M + + V+W +Y R +I +N I Sbjct: 121 FYRVVGEHVSHKGDIEEDSLMDLQTFAGH---MMENPADFYYGVIWNKLYSRQIIEENQI 177 Query: 182 KFIAGLHH-QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQ---- 236 + + +D ++ E++ + + +Y Y S+ K + + Sbjct: 178 RMDPEVSWCEDFLFNMEYLLHCRGIFALQVPVYYYVKTEGSLVSQGLSITKTVEMKASVF 237 Query: 237 RHYIKITRLLEKLNRNYADKIMIY 260 R+Y + A +I IY Sbjct: 238 RYYNDFFARVYDPEDYEARRIQIY 261 >UniRef50_Q4JZB4 Putative glycosyl transferase n=3 Tax=Streptococcus pneumoniae RepID=Q4JZB4_STRPN Length = 331 Score = 261 bits (667), Expect = 3e-68, Method: Composition-based stats. Identities = 68/330 (20%), Positives = 134/330 (40%), Gaps = 12/330 (3%) Query: 3 NSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPH 62 ++ +SV++P+YN + C+ S++ QT+ +EIII+NDGS D+S++I YA Sbjct: 8 DNLPLISVVLPVYNVENYLEKCLNSVVRQTYKNIEIIIVNDGSLDSSIDICNKYANKDSR 67 Query: 63 VRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNA 122 +++ + N G S ARN G++ + G+++ FVD+DD V E L ++ + + ++ C+ Sbjct: 68 IKVFTKPNGGLSDARNFGVKKSMGEFITFVDSDDYVTDDYVEYLYSLVKKYNCKMSMCSI 127 Query: 123 DWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIK 182 + + + L ++T + + +Y R + + I Sbjct: 128 FVHYISNDKMINNGTGREL----MMTAEQCIEKMCYHDEVDTCAYAKLYHRSLFSE--IC 181 Query: 183 FIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKI 242 F G +DI F A + + Y Y + S+ K + ++ Sbjct: 182 FPKGRIFEDIGTIYRFFGTAGQIACGFNAKYYYEIRENSIVTSSFSEKK-FDLLFMTDQM 240 Query: 243 TRLLEKLNRNYA---DKIMIYPEFHQQITYEALRVCHAVRKEP--DILTRQRMIAEIFTS 297 +++ + + +Y F ++ + +KE I + + E + Sbjct: 241 RNFVDQRYPSLKAATKRRAVYARFSTLNQLLDVQGYNEEKKEIIRFIQDNKEALLENPKT 300 Query: 298 GMYKRLITNVRSVKVGYQALLWSFRLWQWR 327 RL S+ LLW L R Sbjct: 301 PKRDRLAVRALSLGFCIYKLLWKQYLKIQR 330 >UniRef50_B2UQN8 Glycosyl transferase family 2 n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UQN8_AKKM8 Length = 376 Score = 261 bits (667), Expect = 4e-68, Method: Composition-based stats. Identities = 83/354 (23%), Positives = 135/354 (38%), Gaps = 18/354 (5%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M + +SVI+P+YN C++SL QT LEII ++DGSTD S I + YA Sbjct: 1 MPLPSPLISVIVPIYNMEPLLPRCLDSLAAQTLRDLEIICVDDGSTDGSGGIVRKYASGD 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 RL+ Q N+G + ARN GI A Y+ F D DD V P MYE L +A E D+ QC Sbjct: 61 SRFRLITQENSGRAEARNAGIRAAAAPYLGFADPDDYVEPDMYERLYRLAEESGADMVQC 120 Query: 121 NADWC------FRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRD 174 + + + + GV T + R T VVW ++RR Sbjct: 121 SYSPFLPAESGESRGMAEEKLLHIENTACDGVFTEKGEIFRLFLEDRITGVVWSKLFRRI 180 Query: 175 V------IVKNNIKFIAG-LHHQDIVWTTEFMFNALRARYTEQSLYKY-YLHNTSVSRLH 226 + + ++ + +D ++ + + T + LY Y SVS + Sbjct: 181 LPGCSAPLE---VRLPSSFTSGEDTLYVSRAIARCRSVALTSEKLYHYGLGGPQSVSSRN 237 Query: 227 RQGNKNLNYQRHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILT 286 R+ ++ + Y + + + + + L Sbjct: 238 RKAETRPASYYAVFEMLTREKLREGVLGSNRTAYMNYIVPLLFPDNEMPAGRLRHWAELW 297 Query: 287 RQRMIAEIFTSGMYKRLITNVRSVKVG-YQALLWSFRLWQWRDKTRSHHRITRS 339 R+ I GM + + + G + L W+++ R R+ S Sbjct: 298 READITSEHVCGMPREQRAYLEAALAGRWIYLRLLQCFWKFKTARRKMFRLRFS 351 >UniRef50_C0B702 Putative uncharacterized protein n=1 Tax=Coprococcus comes ATCC 27758 RepID=C0B702_9FIRM Length = 283 Score = 260 bits (666), Expect = 5e-68, Method: Composition-based stats. Identities = 67/261 (25%), Positives = 124/261 (47%), Gaps = 21/261 (8%) Query: 1 MM---NSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYA 57 MM K SV++P+YN C++S++ QT+ LEII+++DGS DN ++ + Sbjct: 1 MMKQTEEKIKFSVVLPIYNVEKYLNRCLDSVMNQTYKKLEIILVDDGSPDNCPQMCDNWE 60 Query: 58 ENYPHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDV 117 + ++++H+ NAG ARN G++VATG Y+AF D+DD + ++E L T+ + D+ D+ Sbjct: 61 KVDDRIKVVHKKNAGLGEARNSGLDVATGDYIAFFDSDDYIDTRLFEELYTVIISDNPDL 120 Query: 118 AQCNADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSS-----------RRWTHVV 166 + R+ T IP L G + L L Sbjct: 121 IEFGHHDVDRQGNITKTFIPKTPLEK---YEGEEVLSKFLPELICTDPKTGTASDLLMSA 177 Query: 167 WMGVYRRDVIVKNNIKFIAGLHH--QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSR 224 W +YRR ++ + N F++ + +D+ + M N ++ Y Y ++ S++ Sbjct: 178 WSCLYRRQLLAECNFHFVSERKYISEDVYSLMKLMPNVHSVNVVHKAYYYYCENDQSLTH 237 Query: 225 LHRQGN--KNLNYQRHYIKIT 243 +++ K + +Q H ++ Sbjct: 238 VYKPDRFEKLVAFQLHLEELC 258 >UniRef50_B0P2L4 Putative uncharacterized protein n=1 Tax=Clostridium sp. SS2/1 RepID=B0P2L4_9CLOT Length = 332 Score = 260 bits (666), Expect = 5e-68, Method: Composition-based stats. Identities = 73/330 (22%), Positives = 152/330 (46%), Gaps = 14/330 (4%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 +SVIIP YN D C++S+I+QT+ LEII+++DGS D++ EI YA+ + Sbjct: 2 DEKLVSVIIPAYNIEDYIGRCLDSIISQTYKNLEIIVVDDGSRDHTGEILDNYAKKDRRI 61 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 +++H+ N G S ARN+GIE A G Y+ F+D DD + MY+TL+ + E++ D+A C Sbjct: 62 KVIHKENGGVSSARNKGIEAAEGDYIGFIDGDDLIESEMYKTLVDLLEEENADIAHCGYQ 121 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 F + + + + + T + L+ LS + +Y++++I + Sbjct: 122 MVFPDRIDYYHNTGKK-----KIQTTEEGLKDLLSGEMIEPGLVNKIYKKELIKNCRLD- 175 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 ++D++ + + ++ Y + + Y Y + ++S + N + + +++ Sbjct: 176 ETVKINEDLLMNYQLFKLSQKSVYYDITPYSYMIRSSSAT----GANSLITKREDALRVL 231 Query: 244 RLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRL 303 ++ N +IY + Y + +C K+ + Q+ + + + Sbjct: 232 NQIKDDCINNNLLSIIYKRY----IYLLMAICRDDLKDRSYIEYQKKQRKQLKKELKTDI 287 Query: 304 ITNVRSVKVGYQALLWSFRLWQWRDKTRSH 333 + K+ Y +L + + + + Sbjct: 288 FKSCIPKKLKYMSLFSCYLPHIMKVIYKMY 317 >UniRef50_C7X8L4 Glycoside transferase, family 2 n=1 Tax=Parabacteroides sp. D13 RepID=C7X8L4_9PORP Length = 308 Score = 260 bits (665), Expect = 5e-68, Method: Composition-based stats. Identities = 78/308 (25%), Positives = 134/308 (43%), Gaps = 6/308 (1%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 N +SVIIP+YN + + S++ QT +EII+INDGSTD S+ I + A+ + Sbjct: 2 DKNLVSVIIPVYNTEEFVEEAVRSIMNQTLQEIEIIVINDGSTDGSLSIIERLAKEDKRI 61 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 R Q N G S RN+G E A GKY+ F+D+DD + P ET + LD +AD Sbjct: 62 RTYTQQNQGLSATRNKGNEYAQGKYIYFMDSDDYLEPETLETCYIKCEQQALDFVLFDAD 121 Query: 124 WCFRETGETWQ--SIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 +E ++ D V GP L++ + + ++ + RR+ + + + Sbjct: 122 ILNKEKHKSLHLNYQRKDCTNPELVYQGPHILKLLMDHKAYSPSACLNFIRREYLTEIQL 181 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIK 241 F+ G+ H+D ++T A R + +K L S+ ++ Y Sbjct: 182 DFLPGIIHEDQLFTCLLYLQAQRVSSIHKDFFKRRLREDSIMTS----KFSMRNMESYFT 237 Query: 242 ITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYK 301 IT L K ++ I + ++ A+ + + + + +R + E K Sbjct: 238 ITAHLSKFADSHPMFREIIDLYLSRMLNAAVWLSYKTPFKDRLKIARRCLKEYRQYVTTK 297 Query: 302 RLITNVRS 309 LI + Sbjct: 298 NLIVLLFK 305 >UniRef50_A7V2P1 Putative uncharacterized protein n=3 Tax=Bacteroidales RepID=A7V2P1_BACUN Length = 348 Score = 260 bits (665), Expect = 6e-68, Method: Composition-based stats. Identities = 76/353 (21%), Positives = 145/353 (41%), Gaps = 28/353 (7%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M + K+S+I+P+YN C+ S++ QTW LE+I++ND + DNS+EIA+ ++P Sbjct: 1 MIESMKVSIIVPVYNVAKYIERCLLSVLDQTWQDLEVILVNDCTPDNSMEIARRVVASHP 60 Query: 62 HVRLL----HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDV 117 ++ H+ N G S ARN GI + G Y+ F+D+DD + E L A++ D Sbjct: 61 RGTVVRCLEHEENRGLSAARNTGISASVGDYLYFLDSDDYISANAIELLADAAVQKRPDF 120 Query: 118 AQCNADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIV 177 N + + TG G + +W + W + R +++ Sbjct: 121 VIGNYEV------TGARRWAPPLSLGTGFYEGNALVLSTYVQGKWYVMAWNKLVSRPLVL 174 Query: 178 KNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLN--- 234 ++ + F G+ H+D +W+ + + ++ Y Y + S+ R N Sbjct: 175 QHKLYFQEGIVHEDDLWSFKLACMSQSMYVVNETTYYYSMQPDSIMRAPSMRNLECRVLV 234 Query: 235 --YQRHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIA 292 Y +I+ +R L+ Y + ++ +I Y + ++ R + A Sbjct: 235 LGYIYDFIRSSRCLQDNRLIYIYFESLKAKYFDRILYFTKDTSFHYQSY--LVFRNKKYA 292 Query: 293 EIFT----SGMYKRLITNVRSVKVGYQALLWS-------FRLWQWRDKTRSHH 334 + +K ++ N+ V Y L+ + L K ++ Sbjct: 293 SLLEMTGLRPEWKLMLQNIHYVLPTYAGYLYFKAFVKSAYYLLVLSIKMKAVF 345 >UniRef50_A7I1X7 Ss-1,4-galactosyltransferase n=1 Tax=Campylobacter hominis ATCC BAA-381 RepID=A7I1X7_CAMHC Length = 325 Score = 260 bits (664), Expect = 7e-68, Method: Composition-based stats. Identities = 66/336 (19%), Positives = 143/336 (42%), Gaps = 20/336 (5%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 +SVI+P++N + C++SL+ Q++ EI++I+DGS D S+EI + +A+ P +R Sbjct: 2 KPLISVIVPVFNVAEFLDDCIKSLLNQSYENFEILLIDDGSGDKSLEICQNFAKKEPKIR 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 + + N G ARN G++ A G+++ FVD+DD + + + ++++ +A C Sbjct: 62 VFSKQNGGQGSARNLGLDNAKGEFICFVDSDDILASGFLDKTFEILKQNNVKMAMCAYAP 121 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSR-RWTHVVWMGVYRRDVIVKNNIKF 183 + S +L + + + + VW +Y + + K ++F Sbjct: 122 FVDASE---ISNIESVKSEVKILNDKEIFKEIFLANPAFGTAVWRNLYHKSIFEK--LRF 176 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 ++D+ E NA +A + ++LY Y L S + + + + I + Sbjct: 177 PQNQIYEDVAIAFEIFNNAEKAAFCNETLYFYRLRAGSTTNSFSK-----RHLQAEISVK 231 Query: 244 RLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKE------PDILTRQRMIAEIFTS 297 EK+ + Y D + I + + K+ P I + E F + Sbjct: 232 NFCEKVVKKYPDLKKMAN---FAICDSMTSISIMILKDGAKEFYPKITEFLPFLRENFMN 288 Query: 298 GMYKRLITNVRSVKVGYQALLWSFRLWQWRDKTRSH 333 + ++ + ++ +++ ++ + R Sbjct: 289 ILKQKNYSKIKKIEMILLSISPKMLNLALKFYKRMR 324 >UniRef50_A5A8E7 Galactosyltransferase n=8 Tax=Streptococcus RepID=A5A8E7_STROR Length = 327 Score = 260 bits (664), Expect = 7e-68, Method: Composition-based stats. Identities = 70/337 (20%), Positives = 136/337 (40%), Gaps = 12/337 (3%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +SVI+P+YN + R MESL+ QT+ E+I+++DGSTDNS ++ YA Y V H Sbjct: 2 ISVIVPIYNVEEYLRYAMESLVNQTYKDFEVILVDDGSTDNSGQLCDQYASEYDWVSAYH 61 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + N G S ARN G+ A G ++ F+D DD P E LM + + + Sbjct: 62 KVNGGLSDARNYGVTKAKGDWIVFLDPDDYFEPCALELLMELQKRTGARIVAGKGKDAYD 121 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 I + L +L+ + L L T W ++ RD++ K+ + G Sbjct: 122 HDSYQNYQINEETLAKATLLSSEETLVEMLYGTIATVSAWAKLFPRDIVEKH--PYPKGK 179 Query: 188 HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRLLE 247 ++D+ ++++ + +Y YY S++ N N + Y + L + Sbjct: 180 IYEDLYVISDYLKEVESVCVIDLGIYHYYRRPNSIT----LSNFNSKHYDFYDSMNHLED 235 Query: 248 KLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLITNV 307 + ++Y+ + ++ +L + + + L + Sbjct: 236 VVQKDYSSNTELDDAVIARLFIGSLHIFILISDSSKEEILC------IQRKIRPHLGRVL 289 Query: 308 RSVKVGYQALLWSFRLWQWRDKTRSHHRITRSAFNLR 344 ++ ++ + L ++ +++ N R Sbjct: 290 KNKRINLKNKAIYIMLSLVPGLYYRFKKLKKASKNER 326 >UniRef50_C9N8V9 Glycosyl transferase family 2 n=1 Tax=Streptomyces flavogriseus ATCC 33331 RepID=C9N8V9_9ACTO Length = 1295 Score = 260 bits (664), Expect = 8e-68, Method: Composition-based stats. Identities = 60/244 (24%), Positives = 101/244 (41%), Gaps = 8/244 (3%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 LSV++P++N D C++SL Q+ A+E+++++DGSTD S IA+ +A R Sbjct: 2 KPLLSVVVPVHNVEDYLAECLDSLAGQSLDAIEVVMVDDGSTDGSRRIAEEFAARDDRFR 61 Query: 65 LLHQANAGASVARNRGIEVAT-G-KYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNA 122 +HQ NAG ARN G+ T G Y+AF D+DD + Y ++ D+ N Sbjct: 62 CVHQPNAGLGAARNTGVARTTPGVPYLAFADSDDVLVRDAYARMLASLESTGSDLVTGNV 121 Query: 123 DWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIK 182 + + D LS R V W V+RR +++ Sbjct: 122 WRLTGQKRQQAWQYRWLTGDRLRTHITRDA--RLLSDR----VAWNKVFRRSFWDRHDFS 175 Query: 183 FIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKI 242 F G ++D + A Q +Y + + S++R + ++ Sbjct: 176 FPVGRLYEDTPVMIPAHYLAGSVDVLHQHVYHWRVREGSITRRRTDVRGVRDRIAACEQV 235 Query: 243 TRLL 246 + L Sbjct: 236 SAFL 239 >UniRef50_Q93TI5 Putative glycosyltransferase CpsIVJ n=1 Tax=Streptococcus agalactiae RepID=Q93TI5_STRAG Length = 315 Score = 260 bits (664), Expect = 8e-68, Method: Composition-based stats. Identities = 65/310 (20%), Positives = 125/310 (40%), Gaps = 8/310 (2%) Query: 15 YNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLHQANAGAS 74 YN C++S++ QT+ +EI++++DGSTDNS I Y+ +R+ H+ N G S Sbjct: 13 YNVEKFLEECIDSVLNQTYKNIEILLVDDGSTDNSGIICDNYSLKDKRIRVFHKNNGGLS 72 Query: 75 VARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFRETGETWQ 134 ARN G+ A G+Y++F+D+DD + + +D+ CN + + Sbjct: 73 DARNFGVVNAEGRYISFIDSDDYIDKNYIRKMYYCLSNHKVDMVICNYLSVYNNDLKPIV 132 Query: 135 SIPTDRLRSTGVLTGPDWLRMGL-SSRRWTHVVWMGVYRRDVIVKNNIKFIAGLHHQDIV 193 S D + + + + + W + R D+ K + F G H+D Sbjct: 133 SRLQDCI----IYSRVELFSNLYGKYKDPFTTAWGSLIRTDIAKK--VTFPIGKLHEDEF 186 Query: 194 WTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRLLEKLNRNY 253 T ++ + + + + LY Y S+ +NL+ Y + ++LEK N + Sbjct: 187 TTYKYYLYSDKIAFVPEPLYFYRRREASIMSSSY-SKRNLDSLEAYEERIKILEKNNISI 245 Query: 254 ADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLITNVRSVKVG 313 ++ + IY Y + K+ + + K + + +G Sbjct: 246 SETVYIYLSLILYHIYRIRKYPEVCNKQELLDKFNNYYLKYKNILGLKNRLKIITLKNLG 305 Query: 314 YQALLWSFRL 323 + +L Sbjct: 306 IPYQRYKHKL 315 >UniRef50_C7H1H9 Polysaccharide biosynthesis protein, Glycosyl transferase Family 2,YveR n=2 Tax=Faecalibacterium prausnitzii RepID=C7H1H9_9FIRM Length = 318 Score = 260 bits (664), Expect = 8e-68, Method: Composition-based stats. Identities = 64/312 (20%), Positives = 133/312 (42%), Gaps = 25/312 (8%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 +S+I+P+YNA + C+ S++ QT+ +E+II+NDGS D+S+ + + + + Sbjct: 7 DQPLVSIILPVYNAQNHLARCIGSILNQTYKNIELIILNDGSKDHSLPVCEEFRAKDSRI 66 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 L+ + N+G S RN G+++A G+YV FVD+DD + P + L+ A D+ Sbjct: 67 VLVDKENSGVSDTRNLGLKLAGGEYVQFVDSDDYIDPDFTQHLVEAAETHHADLVIAPYK 126 Query: 124 WCFRETGETWQSIPTDRLRSTGVLT------------------GPDWLRMGLSSR---RW 162 + + + GV++ D + L + + Sbjct: 127 MVIPAGATKPEQVLEKLEDNLGVMSVARPPEVREYGFLPAGVYDKDTFALRLMDKPASYF 186 Query: 163 THVVWMGVYRRDVIVKNNIKFIAGLHH-QDIVWTTEFMFNALRARYTEQSLYKYYLHNTS 221 V+W +YRR ++ N+I+F + L +D+V+ +++ A +++ Y Y + S Sbjct: 187 YSVLWNKLYRRILLTGNDIQFTSELKWAEDLVFNMQYIQYAEAFVSIDKAGYYYVQNPQS 246 Query: 222 VSRLHRQGNKNLNYQRHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKE 281 + + ++ I+ R + L + P+ ++ + A + + Sbjct: 247 ICHTQINP---ASIVQNKIQTFRYYKDLYTRLGMYEEVRPQLYKFLVDIAESTYPSGPFK 303 Query: 282 PDILTRQRMIAE 293 I + Sbjct: 304 KIIEEAKEYWKN 315 >UniRef50_D2BMR3 Glycosyltransferase, family 2 n=1 Tax=Lactococcus lactis subsp. lactis KF147 RepID=D2BMR3_LACLK Length = 337 Score = 259 bits (663), Expect = 1e-67, Method: Composition-based stats. Identities = 74/339 (21%), Positives = 134/339 (39%), Gaps = 11/339 (3%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 K+SVI+P+YN R C++S+I Q++ E+++I+DGS DNS +I Y + VR Sbjct: 1 MPKISVIVPVYNVEKLLRRCLDSIINQSFEDFELLLIDDGSKDNSGQICDDYKKQDERVR 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 + H +N G S ARN GI+ G ++AF+D+DD V E L LED++DV C Sbjct: 61 VWHISNGGQSAARNLGIDNVQGDFIAFIDSDDFVELDYLEKLYQPMLEDEVDVVSCRYIA 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRW---THVVW---MGVYRRDVIVK 178 + E + D + + + + W +Y+ ++ + Sbjct: 121 YSEDRDEEVKKKIRDNHPEFYHFRSNHIVMKDFLEKEFKEERNFPWEAYANLYKAKLLGE 180 Query: 179 NNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRH 238 I+F G +D + ++ A T Y YY + +S +R+ K + + Sbjct: 181 --IRFEVGREFEDNFFNYQYFKRVQIAVATSYIGYYYYANPSSKTRITFY-EKQFDLIQQ 237 Query: 239 YIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIF--T 296 I ++K + D Y + + + + I +++ Sbjct: 238 ERLIIENVKKEYPEFLDLEANNLALKYCWLYGKIFGDIQLANFSVSKSHIKEIQKLYSED 297 Query: 297 SGMYKRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHR 335 ++ L + Y A + + R R Sbjct: 298 KKYFQNLNIGKKWQTFFYFASHFPYITGLSNHFYRMLRR 336 >UniRef50_B5CQR8 Putative uncharacterized protein n=1 Tax=Ruminococcus lactaris ATCC 29176 RepID=B5CQR8_9FIRM Length = 304 Score = 259 bits (663), Expect = 1e-67, Method: Composition-based stats. Identities = 66/311 (21%), Positives = 130/311 (41%), Gaps = 18/311 (5%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 MM +S+I+P+YN+ C++S++ QT+ LEI+++NDGS D S++I + YA Sbjct: 1 MM-----ISIIVPVYNSEKYIDRCLDSILNQTYKDLEIVLVNDGSNDQSLKILENYALRD 55 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 +++++Q N G + ARN G++ ATG Y+ +VD+DD + M E ++ + + D+ C Sbjct: 56 TRIKVVNQENKGVAAARNTGLDNATGDYILYVDSDDWIENNMVERMVEL--SANADIVFC 113 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 + + +S+ + + + L + + ++W + +R + + Sbjct: 114 G-----NDNAVSPESVKKIIGITKEIWNKDKIIYEFLRHKIMSGMLWNKLIKRSL--TDG 166 Query: 181 IKF-IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHY 239 +F + +D + + + N T + LY + S+S L K + + Sbjct: 167 CRFNPKTGYGEDAEFLWQVLQNTNSMIVTNEILYHHVPDENSISHLSYSEKKY-SAISMW 225 Query: 240 IKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGM 299 KI + Y I + K + L R I G Sbjct: 226 EKINADTSAIYPEYLKYARI--SLTSAAVFGLFEARQCCYKNKNQLKYMRSITRKNIGGF 283 Query: 300 YKRLITNVRSV 310 K + + Sbjct: 284 LKADYISKKFK 294 >UniRef50_C6LFJ2 Glycosyl transferase, family 2/glycosyl transferase family 8 n=1 Tax=Bryantella formatexigens DSM 14469 RepID=C6LFJ2_9FIRM Length = 402 Score = 259 bits (662), Expect = 1e-67, Method: Composition-based stats. Identities = 83/357 (23%), Positives = 154/357 (43%), Gaps = 19/357 (5%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M+ K+SVIIP+YNA R ++ QT +EII ++DGSTD+S EI + YA Sbjct: 1 MSYDRKVSVIIPVYNAEKYLRDTLDDATGQTLREIEIICVDDGSTDSSAEIVEEYARRDA 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 VRL+ Q N A VARN G+ ATG+Y+ F DADD P M E + + D+ CN Sbjct: 61 RVRLIRQKNQYAGVARNHGMSHATGEYLVFWDADDCFEPAMLEKMYARITQAQADICVCN 120 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWT---HVVWMGVYRRDVIVK 178 + T + +S L + ++ + + +V W ++R + + + Sbjct: 121 VNVLDETTKDLIRS--ESYLVPAYLQGAQEYAKTTHPEYLFNIATNVPWNKMFRAEFVRE 178 Query: 179 NNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTS-----VSRLHRQGNKNL 233 + ++F D+ + + A + ++ L Y +N S +S+ K Sbjct: 179 HGLQFENRSRANDVYFVMMAFYLAEKIAVLDERLVTYRANNESSLTGTLSQTPLCAIKAF 238 Query: 234 NYQRHYIKITRLLE--KLNRNYADKIM--IYPEFHQQITYEALRVCHAVRKEPDI--LTR 287 + R + +E ++ +++ ++++ + H+ + EA + K+ L Sbjct: 239 DAVRKELLRLGAMENPQIRQSFDNRVLQSLLFGLHKCVQGEAFAQMYQYLKKEGFARLGI 298 Query: 288 QRMIAEIFTSGMYKRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITRSAFNLR 344 ++ Y+R + + + Y + + L + K + H + NLR Sbjct: 299 CPQEQYYYSEEAYRRFLRLLDEEPLDY---VLQYALEEKTKKQKRIHELKEKVQNLR 352 >UniRef50_A8SCX8 Putative uncharacterized protein n=1 Tax=Faecalibacterium prausnitzii M21/2 RepID=A8SCX8_9FIRM Length = 328 Score = 259 bits (662), Expect = 1e-67, Method: Composition-based stats. Identities = 58/234 (24%), Positives = 94/234 (40%), Gaps = 11/234 (4%) Query: 1 MM-NSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAEN 59 M +SVI+P+Y C++SL+ QT+ E+++++DGS D EI + YA Sbjct: 1 MNAKEQPLISVIVPVYKVEKYLPACLDSLLAQTYQNFELLLVDDGSPDKCWEILQQYAAQ 60 Query: 60 YPHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQ 119 V + + N G S ARN G+E A G+Y+ FVD+DD V P E L +A Sbjct: 61 DARVCIFRKENGGVSSARNFGLEQARGEYICFVDSDDLVLPQYLEWLYDALCSCGTRIAM 120 Query: 120 CNADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKN 179 C ET ++ W++ W + R+V+ Sbjct: 121 CRYRSIREN--ETLWTVTECPAPQKITTENYSWMQE-----WSGGHCWRMLTHREVLQ-- 171 Query: 180 NIKFIAGLHH-QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKN 232 +++F L + +D ++ A + LY Y +S R Sbjct: 172 DVRFDPALFYGEDALFFVTEFLKAGSLAFLNCPLYAYLERPSSAVRQEPSLRFY 225 >UniRef50_C3PZ30 Glycosyl transferase n=4 Tax=Bacteroides RepID=C3PZ30_9BACE Length = 336 Score = 259 bits (662), Expect = 1e-67, Method: Composition-based stats. Identities = 72/327 (22%), Positives = 129/327 (39%), Gaps = 14/327 (4%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 T+ +SVI+P++N R C+ES++ T+T L+II++NDGSTD+S EI Y + Sbjct: 6 ETDLISVIVPIFNVEAYLRPCIESILASTYTTLQIILVNDGSTDHSGEICDEYTRKDTRI 65 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 ++HQ NAG S ARN G++ A GKY++F+D DD ++P MYE L+ E + + Sbjct: 66 EVIHQKNAGLSPARNSGMKAAKGKYISFIDGDDYIHPQMYEVLLEALQEGNYSFSMILGK 125 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTH----VVWMGVYRRDVIVKN 179 + + T + + VVW +Y+R+++ Sbjct: 126 QVYDNDKSYSIPSAYTKSILTQEIMIKSLFNHIHPQQGIKEVQAQVVWNKLYKRELLDN- 184 Query: 180 NIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHY 239 +F +D + + +A + +Y + TS++ + ++ Y Sbjct: 185 --EFFQETGTEDTEFNCRIYQKSCQAVIIDIPMYYWVQRPTSITHQKVNP-RYIDRADSY 241 Query: 240 IKITRLLEKLNRNY------ADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAE 293 + + K N Y MI + + H K T + Sbjct: 242 YLCLQNIPKENTVYRGCCLEKLFKMIINVRYHASNTPYRHLAHLTAKRLKKQTANEFLKN 301 Query: 294 IFTSGMYKRLITNVRSVKVGYQALLWS 320 K + + + Y + Sbjct: 302 RHIPFHIKLSLLSFYYIPPLYSGFMKL 328 >UniRef50_B0NM34 Putative uncharacterized protein n=3 Tax=Bacteroides RepID=B0NM34_BACSE Length = 348 Score = 258 bits (661), Expect = 2e-67, Method: Composition-based stats. Identities = 68/297 (22%), Positives = 132/297 (44%), Gaps = 14/297 (4%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M + ++SVI+P+YN C+ S+++QT+T E+++++DGS D EI YA Sbjct: 1 MERNIPEISVIVPVYNVEQYLAECISSILSQTFTDFELLLVDDGSPDRCGEICDEYAGKD 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLD--VA 118 VR+ HQ NAG S ARN+G+E A+G Y+AFVD+DD V T + L D Sbjct: 61 KRVRVFHQENAGLSCARNKGLEHASGTYIAFVDSDDYVTSTYLQELYASLPADKSQRGTV 120 Query: 119 QCNADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVK 178 C D F + S+ T + +L + ++ W +Y +I + Sbjct: 121 ICGFDKLFPDG-----SLHTVHVPQQTILPMDCSRVLAELVGKYVMYAWAKLYDNRLIKE 175 Query: 179 NNIKFIAGLHH-QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRL---HRQGNKNLN 234 + I+F+ + +D+++ +++ + + S Y Y + + + ++ Sbjct: 176 HGIRFVPAVSGLEDMLFMLDYLPYSDYLLIRDTSTYIYRVGYSMATLSTCIKDFRSEYAA 235 Query: 235 YQRHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMI 291 + + ++ R E + + ++ L + +KE ++R++ Sbjct: 236 FSNYLERVCRYKETYRLEDSSLVGVWDSLTVFFHKILLAIY---KKENHYSRKERIV 289 >UniRef50_B0ABV6 Putative uncharacterized protein n=1 Tax=Clostridium bartlettii DSM 16795 RepID=B0ABV6_9CLOT Length = 329 Score = 258 bits (661), Expect = 2e-67, Method: Composition-based stats. Identities = 78/329 (23%), Positives = 143/329 (43%), Gaps = 10/329 (3%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M K+SVI+P+YN R C++S++ Q +T+ E+I +NDGSTDNS+EI + Y + Sbjct: 1 MNTINPKISVIVPVYNTSTYLRRCLDSILEQDFTSYEVICVNDGSTDNSLEILREYEKKS 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 +++++Q N+G + RN ++ A G Y+AF+D+DD V L A++ D+ C Sbjct: 61 EKIKVINQVNSGVAKTRNTALKHAKGDYLAFLDSDDFVRENYLSRLYDAAIDTRSDIVIC 120 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 N + + + P GV D L+ + +W +++R++ N Sbjct: 121 NFYRYYE---QINLAKPVFYKFRRGVFNKYDILKGLIPDNLIHSYLWNKLWKREIFEDFN 177 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI 240 F + +D+ ++ ++ A + +LY Y + TS+ R +Y + Y Sbjct: 178 -SFP-DMKFEDLAIMSQLIYKADKIAVINDALYYYRIRKTSIVRNI-CLQTQNDYMKAY- 233 Query: 241 KITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEP---DILTRQRMIAEIFTS 297 + RL K Y+ Y F + L V + KE + + M + F Sbjct: 234 ALIRLFLKDTGEYSRLKRYYFYFSLKAYIVMLAVNIMLFKEYPSFKLFLQNTMSIKSFMK 293 Query: 298 GMYKRLITNVRSVKVGYQALLWSFRLWQW 326 + + + Y L ++ Sbjct: 294 ACREDNFKMTKEDILNYNVLSSEVKMSIL 322 >UniRef50_A6BIH4 Putative uncharacterized protein n=1 Tax=Dorea longicatena DSM 13814 RepID=A6BIH4_9FIRM Length = 331 Score = 258 bits (661), Expect = 2e-67, Method: Composition-based stats. Identities = 70/328 (21%), Positives = 144/328 (43%), Gaps = 13/328 (3%) Query: 1 MMNST---NKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYA 57 M N+ LS+I+P+YN +TC+ SL+ Q A EII+++DGSTD+S I YA Sbjct: 3 MNNNKRQDTVLSIIVPVYNVEKYLKTCINSLLEQKLDAYEIILVDDGSTDSSGGICDEYA 62 Query: 58 ENYPHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDV 117 + + ++++H+ N G S ARN GIE A GKY+ FVD+DD + P MY+ L+ +A + + Sbjct: 63 KKHEKIQVIHKKNGGLSSARNTGIENAVGKYIGFVDSDDYIMPEMYKNLIEVAKMCNAQM 122 Query: 118 AQCNADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIV 177 ++ SI +++ + + + +R V +Y ++ Sbjct: 123 VMSRYFCFENQSDLEKISINLKEVKNKIRVYNTEDVLKEFFTRNIPESVCSNLYSSELWS 182 Query: 178 KNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQR 237 K ++F+ G ++D + + + + + Y Y S++ ++ Sbjct: 183 K--LRFVEGEINEDTNVIYKLLVGSKKTVVMDAQFYGYRKRYGSITNSGYSEK----FKV 236 Query: 238 HYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTS 297 ++ L + R + D + P + + + +++ + + E++ Sbjct: 237 VTEHMSYLENDVKRKHPD---VMPYLYHFVGTHYFCLLNSILDSENFELYKSEY-ELYRK 292 Query: 298 GMYKRLITNVRSVKVGYQALLWSFRLWQ 325 + + +R K ++ + + L Sbjct: 293 EFKRLVYYFIRWEKFQWKEYVIALLLIL 320 >UniRef50_Q93DZ8 EpsG n=2 Tax=Lactobacillales RepID=Q93DZ8_STRTR Length = 322 Score = 258 bits (661), Expect = 2e-67, Method: Composition-based stats. Identities = 74/335 (22%), Positives = 143/335 (42%), Gaps = 30/335 (8%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S++IP+YN D + C++S+++QT++ LEII+++DGSTD S I YY+EN ++++H Sbjct: 3 ISIVIPVYNVQDYIKKCLDSILSQTFSDLEIILVDDGSTDLSGRICDYYSENDKRIKVIH 62 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 AN G S ARN GI+ AT +++ F+D+DD V E L + + D++ + + Sbjct: 63 TANGGQSEARNVGIKNATSEWITFIDSDDYVSSDYIEYLYNLIQVHNADISIASFTYI-- 120 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 T + I ++ +R L + + VW +YR + N KF++G Sbjct: 121 ----TPKKIIKHGNGEVALMDAKTAIRRMLLNEGFDMGVWGKMYRTEYF--NKYKFVSGK 174 Query: 188 HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRLLE 247 +D + T + A + + +Y Y S K + + + ++ Sbjct: 175 LFEDSLITYQIFSEASTIVFGAKDIYFYVNRKNSTVNGTFNIKK-FDLIEMNEEANKFIK 233 Query: 248 KLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPD--------------ILTRQRMIAE 293 + E H+++ + + V + +L + + I Sbjct: 234 H------KFPDLSSEAHRRMIWAYFSTLNQVLSSTNEHDIDLYAPQLVAYLLKQDKFIKR 287 Query: 294 I-FTSGMYKRLITNVRSVKVGYQALLWSFRLWQWR 327 F K +++ + +W+ L R Sbjct: 288 NTFIPKRDKIAFFILKNFGLKTYRNVWNLYLKMTR 322 >UniRef50_A3KLK8 Cps5J protein n=2 Tax=Streptococcus agalactiae RepID=A3KLK8_STRAG Length = 321 Score = 258 bits (660), Expect = 2e-67, Method: Composition-based stats. Identities = 73/321 (22%), Positives = 133/321 (41%), Gaps = 5/321 (1%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 K+S+IIP+YN C++S+I QT+ LEII+I+DGSTD S + A+ + Sbjct: 1 MEKISIIIPVYNVKKYLNDCIQSVINQTYNNLEIILIDDGSTDGSGDYCDEIAKKDSRIF 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 + H+ N G S ARN GI+++TGKY+ F+D+DD + ETL +E D+A N Sbjct: 61 VYHKTNGGLSEARNVGIKISTGKYITFIDSDDYIENLYIETLYNSLIEYKADIAIVNKYH 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLR-MGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 + + R + + + + + +YR+++ ++I+F Sbjct: 121 LKDNEKIVHLANSKELFRVFDTKMILESMYSRENNFISTLTMAQLKLYRKELF--SDIEF 178 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 G ++D T + A R + + LY Y + S++R Q + NL+ Y + Sbjct: 179 PIGKKYEDSFTTYKTYIEANRIVFVNRPLYAYRFRSGSITRSGYQLS-NLDVLEMYEERL 237 Query: 244 RLLEKLNRNYADKIMIY-PEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKR 302 + + Y Q + + + + +I +K+ Sbjct: 238 EYFRERGYDLLSTKYHYCIAIANQKELLKKHFPNNPKIYKLDSQFREIYCDILNELPFKK 297 Query: 303 LITNVRSVKVGYQALLWSFRL 323 I + + L + Sbjct: 298 RIKLGIIRFLPFIKRLKYQFI 318 >UniRef50_A6BHF1 Putative uncharacterized protein n=1 Tax=Dorea longicatena DSM 13814 RepID=A6BHF1_9FIRM Length = 352 Score = 258 bits (660), Expect = 2e-67, Method: Composition-based stats. Identities = 71/334 (21%), Positives = 144/334 (43%), Gaps = 13/334 (3%) Query: 5 TNKLSVIIPLYNAGD-DFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 K+SVI+P+YNA C++S+I Q + +E+I+++DGST++S E+ YA+ + Sbjct: 1 MEKISVIVPIYNAEKKYLTECVDSIIGQDYKNIEVILVDDGSTNSSGEVCDGYADKDERI 60 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 +++HQ N G SVARN GI+ ATG+++AFVD DD + ++ +M E D+ N Sbjct: 61 QVIHQKNQGVSVARNTGIDQATGEWIAFVDVDDWLEKNIFSEVMGQIKEKKTDLVVWNMY 120 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWT--------HVVWMGVYRRDV 175 + + Q+ + + LR+ + + + + +Y++ + Sbjct: 121 MNYSNVEKVAQNYKETFYTDSKIEIEEIRLRLLRTISIYNDEKNITTINYPFCHLYQKKI 180 Query: 176 IVKNNIKFI-AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLN 234 I + N++F +D ++ ++ Y + LY Y H+ SVS + ++ Sbjct: 181 IQQWNVRFDSEFKQGEDKLFNYQYFTKINSILYINKPLYHYRQHSLSVSHKFYKEHEENT 240 Query: 235 YQRHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEI 294 +K +E L ++ Y + + +R + + + + + + Sbjct: 241 --TRILKKYYEIEPLIQSNKKYKDTYNIRTAYLAWWLIRKYY-LHENSTVKKPLKEFENM 297 Query: 295 FTSGMYKRLITNVRSVKVGYQALLWSFRLWQWRD 328 S Y I + + + L + Sbjct: 298 LKSSPYSESIKKLNVADMKFSLTKIRLILLKLHM 331 >UniRef50_A5UMT7 Glycosyltransferase, GT2 family n=2 Tax=Methanobrevibacter smithii RepID=A5UMT7_METS3 Length = 377 Score = 258 bits (660), Expect = 2e-67, Method: Composition-based stats. Identities = 61/297 (20%), Positives = 126/297 (42%), Gaps = 1/297 (0%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 ++SV++P+YN R C++SL QT+ E+I +NDGS D+S +I + YA + Sbjct: 1 MPQISVVVPVYNVEKYLRECLDSLANQTFEDFEVICVNDGSDDSSPDILEEYASEDERFK 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++ Q N G S ARN G+ G+Y+ F+D+DD + E L A + ++ + Sbjct: 61 IVSQENKGLSGARNTGMNYIKGRYLLFLDSDDWLELNALELLYNHANALNSEMVIFPYRY 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWM-GVYRRDVIVKNNIKF 183 +ET + ++ T + ++ + + +Y + K +KF Sbjct: 121 FNQETKQYEENDFTKLNMFDSSVDNKNFNYKNIPETVFRIPHESIKLYDVKTLKKLAVKF 180 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 GL+++D + + F + +Y Y + N S+ + + ++ IK Sbjct: 181 PEGLNYEDAYFFYKIFFKLNKVSIIRTPIYNYRIRNDSICTTGTEKSFDIFKILTSIKNF 240 Query: 244 RLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMY 300 +++ ++ D+ +++ + + Y L R I + + + Sbjct: 241 LKEDEIYESFKDEFILFTVINLKFVYLRLDERFRDRYLEKIKKNYEFFSLNQVNKNH 297 >UniRef50_A8RK62 Putative uncharacterized protein n=1 Tax=Clostridium bolteae ATCC BAA-613 RepID=A8RK62_9CLOT Length = 364 Score = 258 bits (660), Expect = 2e-67, Method: Composition-based stats. Identities = 66/342 (19%), Positives = 143/342 (41%), Gaps = 13/342 (3%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M+ +S+I+P+Y C+ S++ QT+ +E+I+I+DGS+D EI +A Sbjct: 1 MIKGNPLVSIIVPVYGVEAYLDQCINSIVEQTYQNIELILIDDGSSDRCPEICDAWATRD 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 +++LH+ N G S ARN G++ A G+++ FVD+DD + P L+ ++L ++ D++ Sbjct: 61 CRIQVLHKENGGQSSARNIGLDTAKGEWIFFVDSDDWIEPECISVLVEISLVNNADISVI 120 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 + + L+ D + + V +YR D++ + Sbjct: 121 -YPQNHKGNMVIPKPFFLKDKGDVYCLSAEDAVTYFMEQ---AVAVMGKLYRADILQ--D 174 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI 240 I+F G ++ + + A + ++LY Y + N S S + + + Sbjct: 175 IRFPVGRKAEEYIVQLAALKKAKIIAFCNRNLYDYLVRNDSDSHDIKPKYR----VDNIQ 230 Query: 241 KITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFT---S 297 I+ L+ R++ + + + +E V ++E + A Sbjct: 231 AISEALDICRRDFRCEEEWAFRWLCALLHEFYSVSEFAKEEKKKYNDILVYALSQVGGME 290 Query: 298 GMYKRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITRS 339 +YK++ + + + +++ + + + RS Sbjct: 291 QIYKKMEQPLDRIIYVANQYQKILKEEEYKILQKQYREVYRS 332 >UniRef50_D0BKT7 Capsular polysaccharide biosynthesis protein CpsI n=1 Tax=Granulicatella elegans ATCC 700633 RepID=D0BKT7_9LACT Length = 328 Score = 258 bits (659), Expect = 3e-67, Method: Composition-based stats. Identities = 64/322 (19%), Positives = 137/322 (42%), Gaps = 14/322 (4%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M +S+I+P+YN + C++S+ QT+ +E+I++NDGSTD S EI + + Sbjct: 1 MEYDKLISIIVPIYNVEEYLEECLDSIQCQTYNNIEVILVNDGSTDASKEICERFCRQDS 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 L++Q N G S ARNRG+ + G+++ FVD+DD + M E L+ DD+D+ +C Sbjct: 61 RFHLINQENKGLSGARNRGMSESHGEFITFVDSDDVLKEDMLEQLIKQVTIDDIDIVEC- 119 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 + + + ++ + L ++RR+VI+ N Sbjct: 120 ---WYTNDQKEILIPSPENVKIIFQGNTKEALVSLCRDNIVRLNAVAKLFRREVIL--NF 174 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIK 241 F+ GL ++D+ + R + Y Y + S+ + Sbjct: 175 PFLEGLFYEDVYGGMGILKQIRRMVKIDYIGYYYRVRQGSIMNREFNLKN--------LD 226 Query: 242 ITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYK 301 + + +K+ + Y + + P ++++ + L +++ + + Sbjct: 227 LFTICDKVEQLYEGDVELLPYIYRRLFHLVLMHVVDYHIFEGNPYQEKYVEYLNRYAKSS 286 Query: 302 RLITNVRSVKVGYQALLWSFRL 323 +R+ ++ + +++ R+ Sbjct: 287 SPSFLMRAYRLFPKNIVFISRV 308 >UniRef50_Q74L33 Putative uncharacterized protein n=1 Tax=Lactobacillus johnsonii RepID=Q74L33_LACJO Length = 292 Score = 258 bits (659), Expect = 3e-67, Method: Composition-based stats. Identities = 77/304 (25%), Positives = 137/304 (45%), Gaps = 17/304 (5%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 K+SVIIP+YN C++S++ QT++ LEII+++DGSTD++ E+ + Y E Y ++R Sbjct: 1 MEKISVIIPVYNDEKYLAQCLDSVLRQTYSNLEIILVDDGSTDSTPELCEKYREKYANIR 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 +LH+ N G +RN G+E+ATG+Y+ FVD DD + T E L + ++D D+A N + Sbjct: 61 ILHKKNGGVGSSRNAGLEMATGEYILFVDHDDLLGETHIEELYKLLKKNDADIAVGNFNH 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHV-----VWMGVYRRDVIVKN 179 E + D T +W + + + W +Y++ + Sbjct: 121 FIEEKRAYGIWLKEDDYF-EKTYTPEEWFTVEYETVPYNMSIIFAVPWAKLYKKSLFE-- 177 Query: 180 NIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHY 239 NI + +D + T + A + Y +++Y + + SVS + Sbjct: 178 NIVYPINARVEDDLTTWKIYLLADKIAYMNKAIYTHRIFENSVSAQANKTAVFP--LEAV 235 Query: 240 IKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGM 299 + LL +L + +I Y Y K+ D L + ++ Sbjct: 236 EERISLLSQLGFDITTEIEAY-------HYRLNICIDTALKDGDYLKYRNAKQKLAILKK 288 Query: 300 YKRL 303 YK++ Sbjct: 289 YKKI 292 >UniRef50_C7H3X5 Putative capsular polysaccharide biosythesis protein CpsI n=1 Tax=Faecalibacterium prausnitzii A2-165 RepID=C7H3X5_9FIRM Length = 310 Score = 258 bits (659), Expect = 3e-67, Method: Composition-based stats. Identities = 61/306 (19%), Positives = 128/306 (41%), Gaps = 12/306 (3%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 ++SVI+P+Y C+ES++ QT+ E+I+++DGS DN + +AE +R Sbjct: 1 MPEISVIVPVYKVEKYLDRCIESIVNQTYPDFELILVDDGSPDNCPALCDAWAERDGRIR 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALED-DLDVAQCNAD 123 ++H+ N G S ARN G++V GKYV F+D+DD + E L ++ + + C + Sbjct: 61 VIHKKNGGLSSARNAGMDVMCGKYVCFIDSDDWIELNFLEVLRSLFEKYPQAKITACASR 120 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGL--SSRRWTHVVWMGVYRRDVIVKNNI 181 V + VW + + ++ N Sbjct: 121 -----QDAIPDFQVIQPKEYITVYDKRKMFDYFFRVNGEPSNTGVWNKLIKTSILE--NF 173 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIK 241 ++ ++D+ EF A T Q L+ Y+++N+ ++R K+L+Y + + Sbjct: 174 RY-VITLNEDVEACYEFYTRADFMVETNQILHHYFVNNSGITRS-TFSEKDLDYLSVWDR 231 Query: 242 ITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYK 301 + E+ +Y ++ + + + +K ++ + + + ++ Sbjct: 232 VVTRTEQEQPDYLRFAILARKRANFTMLSKMLIRGYDKKNKNLKSIYKRLKSEVRKDYFQ 291 Query: 302 RLITNV 307 L + + Sbjct: 292 LLTSKM 297 >UniRef50_Q5M3K9 Glycosyl transferase n=4 Tax=Streptococcus RepID=Q5M3K9_STRT2 Length = 697 Score = 257 bits (658), Expect = 4e-67, Method: Composition-based stats. Identities = 71/340 (20%), Positives = 132/340 (38%), Gaps = 13/340 (3%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 T K+SVIIP+YN + +TC++S+++QT+ LEII+IN GSTDNS I + YA + Sbjct: 2 TEKISVIIPVYNVENQLQTCLDSVLSQTYQNLEIILINYGSTDNSDAICRSYAARDSRIL 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 + N G S AR+ GI A G YV +VDA+D V PT E L D+A N Sbjct: 62 YFKKDNGGLSDARHIGIRQAKGTYVTYVDAEDWVEPTYLEELYKALQAHKADIAVANYS- 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTH----VVWMGVYRRDVIVKNN 180 ++E+ + + D V + + + +Y+R + N+ Sbjct: 121 VYKESEDLFYFYIKDEDYYEQVYSPAQIIDGLFEETNNINIALISATGKLYKRSLF--ND 178 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNT--SVSRLHRQGNKNLNYQRH 238 + F +D + + + R Y + LY Y S + + + Sbjct: 179 LLFPKEHAGEDGFFNLKAYLMSERTVYLNKGLYVYRESPEMPSAT---WMQDWMMTLVYA 235 Query: 239 YIKITRLLEKLNRNYADKIMIYPE-FHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTS 297 + ++ ++ Y + + + I + ++++ Sbjct: 236 MEERLAIVASHGFPLEKYMVTYRQMLEACLKNVEEQGLRDSDAYRSIQEKFQVLSLAPQR 295 Query: 298 GMYKRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRIT 337 K+ + + +L + + + + + I Sbjct: 296 YETKKRAIVLAANYTYVDQVLTTIKSIVFHHRNIRFYLIN 335 >UniRef50_Q8XN37 Capsular polysaccharide biosynthsis protein n=1 Tax=Clostridium perfringens RepID=Q8XN37_CLOPE Length = 326 Score = 257 bits (658), Expect = 4e-67, Method: Composition-based stats. Identities = 70/327 (21%), Positives = 145/327 (44%), Gaps = 4/327 (1%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 K+S+IIP+YN + C+++ + QT +EII I+DGSTD S +I YA+ Y ++ Sbjct: 1 MVKVSIIIPIYNVENYLDKCLDTAVNQTLGCIEIIGIDDGSTDKSRDILNKYAKKYSFIK 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++++ N G S ARN GI A G+Y+ F+D+DD + + +A ++ LD+ +A+ Sbjct: 61 IINKENGGLSSARNTGIINAKGEYIYFLDSDDYIEYDSMKICYEIATKNKLDIVSFDAES 120 Query: 125 CFRETG--ETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIK 182 G + + + + G ++ + ++ +T + YRR+ ++ NN+ Sbjct: 121 FVDGEGFTKKLKKYDRKDILDGNIKMGCEYFCELIKNKCYTASACLNFYRREFLIDNNLF 180 Query: 183 FIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKI 242 F G+ H+D ++T + A + RY LY L + S++ + Sbjct: 181 FYDGILHEDELYTVKAYTLANKVRYIPYKLYFRRLRSNSITTQSISEKHLYGRLIAARES 240 Query: 243 TRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKR 302 R + + I+ + + + I ++ +I + + + + Sbjct: 241 YRHYLGYKDVFDNDTN--TCLLNHISNMYYFMLKDIYENKRIYNKRYIIEDKIINDIKSK 298 Query: 303 LITNVRSVKVGYQALLWSFRLWQWRDK 329 + + Y+ L + + ++ K Sbjct: 299 KDIKSFKLDLQYRFPLIIYTIKKYLKK 325 >UniRef50_B7HXA1 N-acetylglucosaminyltransferase n=2 Tax=Bacillus cereus RepID=B7HXA1_BACC7 Length = 353 Score = 257 bits (657), Expect = 5e-67, Method: Composition-based stats. Identities = 76/324 (23%), Positives = 137/324 (42%), Gaps = 9/324 (2%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+I+P+YNA C++SL+ QT+ +EII+INDGSTDNS +I YA +R++H Sbjct: 6 VSIIVPVYNAEKYVHKCIDSLLQQTYKNIEIILINDGSTDNSGKICDEYANESSKIRVIH 65 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 Q N+G S RN GI+VA G Y+ FVD+DD V P M E ++ + + C Sbjct: 66 QKNSGPSATRNVGIDVANGAYIQFVDSDDYVEPNMTEEMLKAM-NNKFQLVICGYKSLSL 124 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 + + + L + + +W +Y +I +N+++F + Sbjct: 125 FNKDINIKEHKCHISGGYDYSSFMMEFGYLFRDVYINPLWNKLYNTALIKENHLRFKDNV 184 Query: 188 H-HQDIVWTTEFMFNALRARYTEQSLYKY-YLHNTSVSRLHRQGNKNLNYQRHYIKITRL 245 + +D+++ E++ + LY Y +++ S++ Q+ R Sbjct: 185 YMGEDLLFNLEYIKKCNYINVINKYLYNYAVINSESLT--ISFKKDFFKTQQMLFTAIRE 242 Query: 246 LEKLNRNYADKIMIYPEF-HQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLI 304 K++ Y D E + L + R+ I+ I + + Sbjct: 243 FLKVDYFYNDSNKKMIEIAYTDTVVSCLEHLFHKDSNLTVNLRKENISSIVNNNCTRE-- 300 Query: 305 TNVRSVKVGYQALLWSFRLWQWRD 328 N+ K G + +L + Sbjct: 301 -NIGYFKEGNIQKRFIGQLISLKS 323 >UniRef50_C8WM64 Glycosyl transferase family 2 n=1 Tax=Eggerthella lenta DSM 2243 RepID=C8WM64_EGGLE Length = 333 Score = 257 bits (657), Expect = 5e-67, Method: Composition-based stats. Identities = 79/276 (28%), Positives = 120/276 (43%), Gaps = 14/276 (5%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 +SVIIP+YN C+ES+I QT++ +EI+++NDGSTD S YA + V Sbjct: 2 NPLVSVIIPVYNVESYVSECIESVIAQTYSNIEIVVVNDGSTDGSGFSCDQYACSDSRVV 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++H+ N G S ARN GI V G +VAFVD DD V P ETLM D ++A Sbjct: 62 VVHKENEGLSAARNAGIAVCRGDFVAFVDGDDFVSPVFIETLMHAIEVCDCEIAAIPCGT 121 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 F + + V+ ++ L R T V W +Y R ++ F Sbjct: 122 AFEDGSSCELVAKAAFIPDARVMDSYTVQKLMLYQRLDTGVPW-RLYARRILGDA--PFA 178 Query: 185 AGLHHQDIVWTTEFMFNALRARYTE-QSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 GL+++D+ +F+ + R + ++LY Y L + + IK Sbjct: 179 VGLYYEDLASVYKFIHDVDRVALVDCRALYAYRLRKSGIISQAYSP----------IKAL 228 Query: 244 RLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVR 279 +E R +D YP+ +C V Sbjct: 229 SAIEVSRRLSSDMQEWYPDLAVASASRCFSLCRMVY 264 >UniRef50_A7VUZ8 Putative uncharacterized protein n=1 Tax=Clostridium leptum DSM 753 RepID=A7VUZ8_9CLOT Length = 343 Score = 257 bits (657), Expect = 5e-67, Method: Composition-based stats. Identities = 72/324 (22%), Positives = 125/324 (38%), Gaps = 15/324 (4%) Query: 3 NSTNKLSVIIPLYNA-GDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 ++ +SV++P+Y + R C+ SL+ Q+ +EII+++DGS D EI AE Sbjct: 5 DNPILVSVVVPVYKIKEEYLRACVGSLLEQSLKEIEIILVDDGSPDRCGEICDKLAETDC 64 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 +R++HQ N G S ARNRG+E A GKY++FVDADD + P E D+ Sbjct: 65 RIRVIHQRNKGVSAARNRGMEQARGKYLSFVDADDWIDPDTLAEAYRNGNERQADIVFWC 124 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 + P + L G + +Y DVI Sbjct: 125 YQKEYVNRSVPAVPYPQNGLIY-HAGEGIEEFDPFFM--GLLGSSCCKLYAADVIK--GC 179 Query: 182 KFIAG-LHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI 240 ++ + +D+ + +A Y + Y Y S R + Y Sbjct: 180 RYDERLKNGEDVEFNFRVYRGLKKAVYLNRPFYHYRQLGNSAVRNYNPK-----MMEQYG 234 Query: 241 KITRLLE-KLNRNYADKIMIYPEFHQQITYEAL--RVCHAVRKEPDILTRQRMIAEIFTS 297 +E LN N ++ + + Y L ++ ++R + EI + Sbjct: 235 LTLAAIESDLNENREERFIAAFHTFTGLCYLLLASQLVFVKENPAPFRDKRRQMREITDT 294 Query: 298 GMYKRLITNVRSVKVGYQALLWSF 321 Y+ +I + + + + + F Sbjct: 295 EPYRSVIAGNQGIHLSIEKRVALF 318 >UniRef50_Q02W60 Glycosyltransferase n=1 Tax=Lactococcus lactis subsp. cremoris SK11 RepID=Q02W60_LACLS Length = 301 Score = 257 bits (657), Expect = 5e-67, Method: Composition-based stats. Identities = 76/313 (24%), Positives = 130/313 (41%), Gaps = 20/313 (6%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 K+SVI+P+YN C+ S++ Q++ E+++I+DGS D+S +I YA+ VR Sbjct: 1 MPKVSVIVPVYNVEKLLERCLNSVLNQSFQDFELLLIDDGSKDSSGQICDNYAKKDQRVR 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 + H N G S ARN GI+ G Y+ F+D+DD V E L +E + DV C Sbjct: 61 VWHIPNGGQSAARNLGIDNVYGTYIVFIDSDDFVELDYLEQLYQPMVEYEADVVSCRYIA 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRW---THVVW---MGVYRRDVIVK 178 E E + I + + + + + W +Y+ +++ + Sbjct: 121 YSEERDEEVKKIIQENQAKVYHFRNNQDVMQDFLEKEFKDDRNFPWESYANLYKTELLGE 180 Query: 179 NNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRH 238 I+F G +D + ++ +A T Y YY + TS +R Sbjct: 181 --IRFEVGREFEDNFFNYQYFKKVQKAIATSYIGYYYYANPTSKTR-----------ITF 227 Query: 239 YIKITRLLEKLNRNYADKIMIYPEFHQ-QITYEALRVCHAVRKEPDILTRQRMIAEIFTS 297 Y K L+++ D + YPEF + AL+ C K + Sbjct: 228 YEKQFDLIQQERLIIEDVKIKYPEFIDLEANNLALKYCWLYGKIFGDIRFSNFSVSKKIF 287 Query: 298 GMYKRLITNVRSV 310 G +K I ++++ Sbjct: 288 GSFKVFIKRIKNI 300 >UniRef50_B0P3X3 Putative uncharacterized protein n=1 Tax=Clostridium sp. SS2/1 RepID=B0P3X3_9CLOT Length = 1145 Score = 257 bits (657), Expect = 5e-67, Method: Composition-based stats. Identities = 68/343 (19%), Positives = 137/343 (39%), Gaps = 12/343 (3%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 K+SVIIP+YN + C++S++ QT L++I+I+DGS D S +IAK +A Y H Sbjct: 3 EIKVSVIIPIYNVEEYLEECLQSVVDQTLEDLQVIMIDDGSLDGSTDIAKKFASRYDHFE 62 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 + Q N G ARN G++ A GKY+ F+D+DD V YE + A ++ ++ + Sbjct: 63 YVRQVNGGLGNARNTGVKYAKGKYIIFLDSDDIVPDDAYEKMYLAAEKNHSEMVVGSVAR 122 Query: 125 CFRETGET---WQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 + + + T + D + + W + ++ +N+ Sbjct: 123 FDSKKDHISNLHEIAFRKYIDKTHMTDNTDLI--------YDTTSWNKLILKEFWDRNHF 174 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIK 241 +F + ++DI T + A + Y++ + + ++ Q + R I Sbjct: 175 EFPERILYEDIPVTIPMHYLANNVTMVQDVCYRWRIRDG-ANKSITQRADDFTNMRDRIT 233 Query: 242 ITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYK 301 + R+++K + ++ + + Y L + M+ I Sbjct: 234 VLRMVDKFFEENVKEQELWDAKYYKWLYIDLMIYVNNCIYLSDNRTLEMMKIIKDYIEET 293 Query: 302 RLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITRSAFNLR 344 + + + V Y+ + + + + NL+ Sbjct: 294 IPLETIDKLPVLYREKYVALMNLDEKRLVKLRQYEVDNYKNLK 336 >UniRef50_C0CHF0 Putative uncharacterized protein n=2 Tax=Blautia hydrogenotrophica DSM 10507 RepID=C0CHF0_9FIRM Length = 348 Score = 257 bits (657), Expect = 5e-67, Method: Composition-based stats. Identities = 77/351 (21%), Positives = 131/351 (37%), Gaps = 36/351 (10%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP-HVRL 65 K+S+I+P+Y CM+SL+ QT +EII +NDGS DNS+ I + Y + P V + Sbjct: 5 KVSIIVPVYKVEQYLGQCMDSLVGQTLEEIEIIAVNDGSPDNSLAILQEYRDRDPEKVNV 64 Query: 66 LHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWC 125 N G S ARN G+ A G+Y+ FVD+DD + + L A D D+ Sbjct: 65 FSIENQGVSHARNYGVSKAKGEYLLFVDSDDYLEKNACQKLYEHAKLHDNDLVLFGRYNL 124 Query: 126 FRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIA 185 +P S R + W + RR++ + F Sbjct: 125 REGGETEKLYLPVVEDFSLREYKEEMA--------RLSPFPWDKLIRRELFQEIG-GFPE 175 Query: 186 GLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRL 245 G+ +D+ +A + + LY Y + H + I Sbjct: 176 GIRFEDLPVAHMLAVSARKIGVVSECLYHYRVQAG----------FLNTLTEHTLDIVTA 225 Query: 246 LEKLNRNYADKIMIYPEFHQQITYEALRVC-------HAVRKEPDILTRQRMIAEIFT-- 296 L K+ N+ +Y E+HQ++ Y +R C ++ + + R++ ++ Sbjct: 226 L-KILVNFLKDKGLYQEYHQEVEYICVRHCCYRFVNLRKFTEKGKLDLQMRLVQLVYDFL 284 Query: 297 ---SGMYKRLITNVRSVKVGYQALLWSFRLWQWRDKTRSH---HRITRSAF 341 ++ R + VG + L+ + K I R Sbjct: 285 ESEFPKWRENPYVRRKMAVGMKHFLFVCERPKLMMKFVQKTDGKSILRKKL 335 >UniRef50_B5CU67 Putative uncharacterized protein n=1 Tax=Bacteroides plebeius DSM 17135 RepID=B5CU67_9BACE Length = 331 Score = 257 bits (657), Expect = 5e-67, Method: Composition-based stats. Identities = 77/333 (23%), Positives = 139/333 (41%), Gaps = 14/333 (4%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +SVI+P+YN + C+ES++ QT+ LEII+++DGS+D S EI YY + ++H Sbjct: 2 ISVIVPIYNVAEYLPACIESILNQTYKNLEIILVDDGSSDKSGEICDYYVKRDTRCIVIH 61 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 Q+N G S ARN G+E ATG+Y++F+D DD ++P M TL ++D D + + Sbjct: 62 QSNKGLSEARNAGLECATGEYISFIDGDDYIHPKMLATLYEALQQEDYDFSMIAHTQTWN 121 Query: 128 ETGETWQSIPTDRLRSTGVLTG---PDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 E + + T L + VW +YRR++I + +FI Sbjct: 122 HHIEKEPKMINSIIFETSDLMQGLYNVSSKKISLPECNFQAVWNKLYRRNLIGSS--RFI 179 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 + +D + + +A + +Q LY + +S++ N+ Sbjct: 180 T-IATEDTEFNNKIFLKTQKAIFLDQCLYYWVQRPSSITHQSVNK----NFIDRAYSYLF 234 Query: 245 LLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLI 304 L ++ +N+ + Y + + Y+ + +I AE I Sbjct: 235 CLNQIPKNHKE----YRAYCLEKLYKTAVNVQYRARNTEIYNYAVKTAEELRVKTINEFI 290 Query: 305 TNVRSVKVGYQALLWSFRLWQWRDKTRSHHRIT 337 +N +LL + + + Sbjct: 291 SNTHINIFHKISLLVFYYIPFTYHIFIRLLELK 323 >UniRef50_UPI0001C4194D glycosyl transferase GT2 family n=1 Tax=Methanobrevibacter ruminantium M1 RepID=UPI0001C4194D Length = 817 Score = 256 bits (656), Expect = 6e-67, Method: Composition-based stats. Identities = 77/337 (22%), Positives = 141/337 (41%), Gaps = 4/337 (1%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 + K+SVIIP+YN C++S+I QT +EII I+DGS+DNS+EI Y Sbjct: 360 SENKIKVSVIIPIYNNEKFLSKCLDSVINQTLNEIEIICIDDGSSDNSIEILNQYVLKDS 419 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 ++++ Q N GA+ ARN G+++A G+Y+AF+D+DD + +E L ++ D+ N Sbjct: 420 RLKIISQENLGAATARNNGLKIAKGEYIAFLDSDDWLELNAFEKLYENITTNNSDLVLFN 479 Query: 122 ADWCFRE-TGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 + + I D + T + L + +W +YR + +NN Sbjct: 480 SIEHKENANLKERIHIKNDSIPDYNYYTFNYNYKKDLVMNGYLD-IWSKMYRTSFLKENN 538 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKY-YLHNTSVSRLHRQGNKNLNYQRHY 239 I+F DI + + M NA + Y + LY Y +++ S+ GN++ Sbjct: 539 IQFSNHQIFNDIQFHIKTMLNAKKISYCPEFLYNYLRINHPSLQNNLSLGNESFILLDII 598 Query: 240 IKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGM 299 +I L N Y + + F L + + ++ + Sbjct: 599 DEIEDYLID-NEFYNELKSNFIRFKLTELESTLEKLENPYRNEFFKLIKNNFKKMQLTEY 657 Query: 300 YKRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRI 336 ++ + SF + ++ + ++ Sbjct: 658 QRKELPPENYQFFNDVLTYDSFFEYALKNSEKERQKL 694 Score = 192 bits (488), Expect = 2e-47, Method: Composition-based stats. Identities = 63/339 (18%), Positives = 136/339 (40%), Gaps = 31/339 (9%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTW--TALEIIIINDGSTDNSVEIAKYYAENYPH 62 K+S+IIP+YN + + S+I+Q+ LE+I+++D STDNS I K Y Y + Sbjct: 2 NYKISIIIPVYNVENYIEKSLNSIISQSIGIENLEVILVDDNSTDNSANIIKKYVSKYDN 61 Query: 63 VRLLHQANA-G-ASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 + ++ G RN G+ AT +Y+ ++D+DD + T E L + ++ D+ Sbjct: 62 FKGIYCDIGSGFCGRPRNIGLSYATSEYIMYLDSDDWLEETACEVLYNTIINENADIVCG 121 Query: 121 NADWCFRETGETWQSI-------PTDRLRSTGVLTGPDWLR-----------MGLSSRRW 162 + E + + +T + T + + + Sbjct: 122 SQTRLDNEGNRKFYYHLWVTTLTDPNEDYNTRMKTTQEIIDDPNFKLVVTDLDKNPNILG 181 Query: 163 THVVWMGVYRRDVIVKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKY---YLHN 219 VW ++++D+I +N + F + QD V+ F A + + + Y + Sbjct: 182 HANVWGKIFKKDLITENELSFPEDIVAQDSVFLLNSFFVAEKIVFINDIIVHYNNLRCDD 241 Query: 220 TSVSRLHRQGNKNLNYQRHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVR 279 S + + KNL IK L++ +++ ++ + Y + + ++ Sbjct: 242 DDKSASYVKTTKNL---FGRIKAYDLMDNISKKFSKEEFFYRYL---LVGKLNYWFNSFL 295 Query: 280 KEPDILTRQRMIAEIFTSGMYKRLITNVRSVKVGYQALL 318 + +I T + + S ++ +++ + + Sbjct: 296 MDSNISTYEIKLLFKKYSHLFSNCYKFNTNLRKDIKNIF 334 >UniRef50_B2DG34 Eps5I n=3 Tax=Streptococcus pneumoniae RepID=B2DG34_STRPN Length = 333 Score = 256 bits (656), Expect = 6e-67, Method: Composition-based stats. Identities = 77/339 (22%), Positives = 145/339 (42%), Gaps = 16/339 (4%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M+ L+++IP+YN + C++S+I+Q W E+I++NDGSTD S I + YA+ Y Sbjct: 1 MIIQNILLTIVIPVYNVEKYLKRCIDSVISQEWDKYEVILVNDGSTDASPNICEEYAQKY 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 + ++H+ N G S RN G+ A GKYV F+D+DD + M+ L M +E + D+ Q Sbjct: 61 HFISVIHKENGGLSETRNTGLSHANGKYVFFLDSDDWITKDMFRNLSKMIMEQNYDILQF 120 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 E E + D + LS T +Y+R++ KN Sbjct: 121 GMQMFHSEREEL-----KNVQCKEKNFNSSDAFKNMLSVEGITSFATDKIYKRELFEKNG 175 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI 240 I+F G ++D+ + + +A + T Q Y Y++ N + K + + + Sbjct: 176 IEFPIGYFYEDLGTVYKLILSAKKIYLTTQVYYCYFIGNDAAITKQWSEKKISDVYKFHK 235 Query: 241 KITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDIL---------TRQRMI 291 KI + + + Y ++ + Y +++ +++ + + Sbjct: 236 KIFNVSSLMVSDDIFLSKSY--YNNGLVYLLMKLYEENQEDTQLFRLILAELDANIVPIK 293 Query: 292 AEIFTSGMYKRLITNVRSVKVGYQALLWSFRLWQWRDKT 330 K + +R +K+ L ++ +W K Sbjct: 294 YLRNYPNFLKYVFYRIRLLKLIVILKLTLTKIKKWNQKK 332 >UniRef50_Q8A821 Glycoside transferase family 2 n=1 Tax=Bacteroides thetaiotaomicron RepID=Q8A821_BACTN Length = 345 Score = 256 bits (656), Expect = 7e-67, Method: Composition-based stats. Identities = 77/346 (22%), Positives = 132/346 (38%), Gaps = 15/346 (4%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M +SVI+P+YN R C++SLI+QT +EII +ND S DNS+ I K Y Sbjct: 1 MDRELPLVSVIVPVYNVEKYLRQCLDSLISQTLQNIEIICVNDASPDNSLAILKEYEAKD 60 Query: 61 PHVRLLHQ-ANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQ 119 + ++ N ARNRGIE A Y+AFVD+DD V +YE L AL +D D+ Sbjct: 61 KRIVVVDLYENMRQGGARNRGIEYARANYIAFVDSDDWVSVDIYEKLFACALMNDADIVN 120 Query: 120 CNADWCF-RETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVK 178 C+ + + + + +++ +YR+ + Sbjct: 121 CDYYEYRGEDNIKKYIKYKELTFNMLKEIANKEFILRL-------PSPCHNLYRKAIFSD 173 Query: 179 NNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQ-- 236 N I+F + +D + A + + LY Y L+N S SR Sbjct: 174 NEIRFPEQIFWEDAAIMSLIFLLADKIVKVNEPLYYYRLNNDSTSRRKNNYRFFERLIGA 233 Query: 237 RHYIKITRLLEKLNRNYADKIMIYPEF-HQQITYEALRVCHAVRKEPDILTRQRMIAEI- 294 + + + + + Y + I + L + K+ + + +I I Sbjct: 234 NLFFQNLKAFGFYKKYKEEIDYQYINLTYDAIIWGCLNDFRPIEKKRLEMAKIDIIKNIP 293 Query: 295 --FTSGMYKRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITR 338 + YK + + + V + + + H + R Sbjct: 294 DFRNNKYYKSRGFSAKKIIVNCITVNTKLGIVMYILYNFLHFFVRR 339 >UniRef50_C6JQJ1 Glycosyl transferase n=1 Tax=Fusobacterium varium ATCC 27725 RepID=C6JQJ1_FUSVA Length = 337 Score = 256 bits (656), Expect = 7e-67, Method: Composition-based stats. Identities = 77/346 (22%), Positives = 151/346 (43%), Gaps = 17/346 (4%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 MM K+S+IIP+YN + C++S+I Q+ +EII INDGSTD+S+EI + Y + Sbjct: 1 MMGK--KISIIIPIYNVEKYLKQCLDSIINQSLKEIEIICINDGSTDSSLEIIQEYMKKD 58 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 + L+++ N G S ARN GI+ +TGKY+ VD+DD V L A ++ LD+ Sbjct: 59 SRIILINKKNEGPSAARNDGIKKSTGKYILQVDSDDWVEEESLSKLYGYAEKNSLDIVSF 118 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 N F + Q + + T ++ + L+ R V + + + N+ Sbjct: 119 NYYREFFKESIKIQVLFE---INKNFFTSIEYFNLFLTEREIP-SVCNKLIKSTIYKNNH 174 Query: 181 IKFIAGL-HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHY 239 I + G+ +D+ T+ ++ + + Y ++ LY Y ++ S++ + Sbjct: 175 ILYPVGINLGEDLATITKLIYFSQKIGYLDEYLYHYRYNDKSITSTLTGSKRA------- 227 Query: 240 IKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQ--RMIAEIFTS 297 ++ ++ E L + +K + + + + + L + + EI Sbjct: 228 -ELFKIYENLKEFFQEKKLDLNLLYFMFLKKVSNFYTYPFERENELYLEGCKNFIEILQD 286 Query: 298 GMYKRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITRSAFNL 343 +K+ + + ++ + ++ F S + R L Sbjct: 287 KKFKKFFSFLDKKEILFIYIIKIFPFLFTIRCISSIFSLHRKIKLL 332 >UniRef50_D1PNS4 Capsular polysaccharide biosythesis protein CpsI n=1 Tax=Subdoligranulum variabile DSM 15176 RepID=D1PNS4_9FIRM Length = 342 Score = 256 bits (655), Expect = 8e-67, Method: Composition-based stats. Identities = 60/320 (18%), Positives = 125/320 (39%), Gaps = 4/320 (1%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M + +S++ +YN + S++ QT+T E+I+++DGS+D+S + A Sbjct: 1 MGKSPTISIVSTVYNTQPFLEQAVRSILAQTFTDFELILVDDGSSDDSGRLCDALAAEDA 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 +R+ HQ N G + ARN+G++ A G+Y+ FVD+DD + PTMYE L + +A C+ Sbjct: 61 RIRVFHQPNGGPASARNKGLDNARGRYIGFVDSDDLIEPTMYEVLYNAIQAPGVQLAACS 120 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVV-WMGVYRRDVIVKNN 180 D +R G+ + L + + + W ++ + Sbjct: 121 GDCIDETGRPI--PGRRVAIREHGIRPAEELLLETFQTGSYYGPLSWNKLFDLRLFRDKG 178 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI 240 I + + D +F R LY Y ++ + L+ R Y Sbjct: 179 IHYDETMLFGDDASVLHRVFEGERCNCLSDILYHYRTRAGQITSTATFSPRKLDDLRMYW 238 Query: 241 KITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMY 300 + + ++ ++ Y+ A ++ + + + + Sbjct: 239 DWLQYFASRPGR-EEYRQWATVWYWRVFYQFWCQSGAAGNLTELKPKFMAHKKHLDAVLP 297 Query: 301 KRLITNVRSVKVGYQALLWS 320 + +++ +A+L+ Sbjct: 298 DLVRSSLLPAGEKLRAVLFC 317 >UniRef50_C1YIQ9 Glycosyl/glycerophosphate transferase, teichoic acid biosynthesis n=1 Tax=Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 RepID=C1YIQ9_NOCDA Length = 1164 Score = 256 bits (655), Expect = 8e-67, Method: Composition-based stats. Identities = 65/247 (26%), Positives = 101/247 (40%), Gaps = 8/247 (3%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 LSV++P+YN C+ SL QTW LE+++++DGS D S IA YAE P R Sbjct: 1 MVALSVVVPIYNVERYLDECLRSLQVQTWDDLEVVMVDDGSPDGSAAIAARYAEQDPRFR 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 L+ Q N G ARN GIE A G ++ FVD+DD + + + D + N Sbjct: 61 LIQQENGGLGAARNTGIEHARGDFLTFVDSDDVIPSYAFGYHLESLRRSGSDFSSGNVHR 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 E G + R+ T L R V+RR ++ + F Sbjct: 121 L-NELGTRQSPMHRRIFRTAKTGTHITREDTLLVDRL----ATNKVWRRSFWDRHGMTFP 175 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNT---SVSRLHRQGNKNLNYQRHYIK 241 G+ H+DI F A +Y + + S+++ + + K Sbjct: 176 VGVLHEDIPVAIPLHFLAESVDVLSAPVYLWRERPSDDKSITQDRLRPQALEDRVSAVTK 235 Query: 242 ITRLLEK 248 ++ L + Sbjct: 236 VSDFLRE 242 >UniRef50_A1YVC6 Glycosyltransferase n=1 Tax=Lactobacillus johnsonii RepID=A1YVC6_LACJO Length = 348 Score = 256 bits (655), Expect = 8e-67, Method: Composition-based stats. Identities = 85/354 (24%), Positives = 155/354 (43%), Gaps = 24/354 (6%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M +K+S+I+P+YNA C++S+ITQT+ ++II++NDGS DNS EI + + P Sbjct: 1 MTQNSKVSIILPIYNAEKYLEECVKSVITQTYQNIQIILVNDGSKDNSWEICQNLKKKDP 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 + + Q N+G SVARN G+ +A G+++ FVD DD + + E L+ + D+D+ C Sbjct: 61 RIIAITQKNSGVSVARNAGLNIADGEWIMFVDPDDVLSQDIVEKLLLK-TDSDVDIVACT 119 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWT---------HVVWMGVYR 172 + DR ++ L + L + R+ V W +YR Sbjct: 120 CYGFNENEKKLAHFFANDR----AFISDKTDLYLQLLNSRYGQSGSLITAIGVPWGKIYR 175 Query: 173 RDVIVKNNIKFIAG-LHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNK 231 R I K +KF QD V+ A + Y ++ LY Y + + Sbjct: 176 RSFIDKYELKFDPNLRRMQDNVFNMYAFDYARKIYYLDKPLYFYRIDH----VNKYNEKH 231 Query: 232 NLNYQRHYIKITRLLEKLNRNYADK--IMIYPEFHQQITYEALRVCH-AVRKEPDILTRQ 288 +++ ++ + R K IY E+ +++ L + + KE + + Sbjct: 232 IKEFKKIFLPVVREQYKCLNELGLYANERIYNEYINEVSNIYLNIIKGIILKEKNTEYIK 291 Query: 289 RMIAEIFTSGMYKRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITRSAFN 342 R I ++ +K + + + K+ + + +L+ K + + A N Sbjct: 292 REINQLSRKVYFKDIFVSNKKAKIQNKKI--RLKLFLITHKMEKGYILAYRALN 343 >UniRef50_C7G7U9 Putative capsular polysaccharide biosythesis protein CpsI n=1 Tax=Roseburia intestinalis L1-82 RepID=C7G7U9_9FIRM Length = 346 Score = 256 bits (654), Expect = 1e-66, Method: Composition-based stats. Identities = 76/335 (22%), Positives = 135/335 (40%), Gaps = 13/335 (3%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M T+K+SVI+P+YN + C++S+ QT+ LEI++I+DGS D S EI Sbjct: 19 MTKETDKISVIVPVYNVKNYLEDCVQSITHQTYPDLEILLIDDGSDDGSGEICDRLQMTD 78 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 +R+ H+ + G S ARN+G++ A G Y+AFVD+DD + P MY L+ + + +A C Sbjct: 79 ARIRVFHEKHKGVSGARNKGLKEAAGAYIAFVDSDDVLEPDMYTYLIRLLKTHEAQIAAC 138 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 + + + P D + + G + L + VW +++R+V+ Sbjct: 139 SGWYKYDAGVGKKFESPADVV----CMNGREALGVLHERSYLRAYVWNKLFQREVLD--G 192 Query: 181 IKFIAGLHH-QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHY 239 I+F L +D + ++ Y Y + + G YQ+ Sbjct: 193 IRFSTELSFAEDYEMFCRVLEQCDTIVCGTKAEYYYMQRKSGICNRGYDGI----YQKAM 248 Query: 240 IKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGM 299 + +R Y I+ + R + R I + + Sbjct: 249 MMFEDYCNVYSRRYPQYKKIFEN--HYLFDLMGMAAAMSRNGNYVAADCRKIKKYVRQKL 306 Query: 300 YKRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHH 334 + L S+ + A + S + + R H Sbjct: 307 PQYLTDKKVSLYLKGSACVVSVSVRAFGAVYRMLH 341 >UniRef50_UPI0001972E41 putative sugar transferase n=1 Tax=Clostridium sp. M62/1 RepID=UPI0001972E41 Length = 323 Score = 256 bits (654), Expect = 1e-66, Method: Composition-based stats. Identities = 80/325 (24%), Positives = 138/325 (42%), Gaps = 8/325 (2%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 +S++IP++N C+ S++ Q++ EII+++DGS D S E+ YYA + ++ Sbjct: 2 LISIVIPVFNVEKYLEECINSVLKQSFQDYEIILVDDGSQDKSGEMCDYYAGICEKITVI 61 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMAL--EDDLDVAQCNADW 124 H+ N G S ARN G++ A G+Y+ F+D+DD + E + A+ + +DV A Sbjct: 62 HKKNGGLSDARNVGVKAAKGEYILFLDSDDYIASESLEEVAQKAVNSRNKIDVFFLEAVK 121 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 + + E ++ G + ++ ++ + R++I ++N+ F Sbjct: 122 FYPDGREEPLGDGYEKRLIEG-QSKNTVMQHLAYLPKFPGSACSKMVSRELIERHNLYFQ 180 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYY-LHNTSVSRLHRQGNK-NLNYQRHYIKI 242 GL +DI WT + A YTE Y Y S++ N +L Y Sbjct: 181 EGLLSEDIEWTVRLLKAAQTFGYTETPFYYYRQQREGSITNTFGIKNIESLLYIVEKFSS 240 Query: 243 TRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKR 302 LLE +N + M Y Y L R + L R R I S K Sbjct: 241 HSLLENSFQNSINSFMAYEFVIILYHYALLDK-KERRMVKESLRRNRWILRYGLSRKTK- 298 Query: 303 LITNVRSVKVGYQALLWSFRLWQWR 327 I N+ G + ++++ R Sbjct: 299 -IINLAVSVFGMSVTSYLLKIYKNR 322 >UniRef50_UPI0001C4190B glycosyl transferase GT2 family n=1 Tax=Methanobrevibacter ruminantium M1 RepID=UPI0001C4190B Length = 381 Score = 256 bits (654), Expect = 1e-66, Method: Composition-based stats. Identities = 75/348 (21%), Positives = 147/348 (42%), Gaps = 12/348 (3%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 ++S++IP+YN R C++S + QT+ +EII INDGSTD+S++I K Y E+ + Sbjct: 1 MVEISIVIPVYNVEKYLRECLDSAVNQTFKDIEIICINDGSTDSSLDILKEYQESDDRII 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 + +Q N G ARN GI + GKYV F+D+DD + E L + E LD Sbjct: 61 IFNQENQGPGAARNLGINKSKGKYVYFLDSDDYLELNALEKLYNICEEKSLDFVLFKLLN 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWT--HVVWMGVYRRDVIVKNNIK 182 +TG+T+Q+ + + + L + +Y+R++I +I Sbjct: 121 FNDKTGKTFQTKYYNMAFLNDRIGDNVFSYKDLYDCVFNLAVSPPAKLYKRELIT--DID 178 Query: 183 FIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKI 242 + G+ +D V+ + + A R + ++ LY + S++ + + Sbjct: 179 YPEGIIFEDNVFFLKTLLKAKRIYFLDEFLYNRRRRDDSLTSSGSDD--YYDIIPSMNYL 236 Query: 243 TRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQ------RMIAEIFT 296 + L+ K +Y + +++ +V ++E L R+ I E Sbjct: 237 FDICRDLDDFELLKEGLYYKKFKELYIRFSKVNDVHKEEFFNLIREDCLKHKEEIDEDIA 296 Query: 297 SGMYKRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITRSAFNLR 344 + ++ + + + DK + + + + +L+ Sbjct: 297 NDKLRKRSKFIYESVLSSDDYKEFHYRIRLYDKNKEINDLKKENKSLK 344 >UniRef50_A6ASH7 Cps2K n=1 Tax=Vibrio harveyi HY01 RepID=A6ASH7_VIBHA Length = 330 Score = 255 bits (653), Expect = 1e-66, Method: Composition-based stats. Identities = 65/330 (19%), Positives = 132/330 (40%), Gaps = 7/330 (2%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M + +SVI+P+YN C+ SL+ QT++ E+I+++D S D+S++IAK Sbjct: 1 MNSKEPLISVIVPVYNVEKYIIKCLNSLVNQTYSNFEVIVVDDESPDDSIKIAKAAFSED 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 + ++N G S ARN G++ + G+Y++F+D+DD T E ++ ++++ D++ C Sbjct: 61 SRFIFVSKSNGGLSCARNFGVKYSKGEYISFLDSDDYFDETFLEVMINTLMDNNADISVC 120 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNN 180 + + R V+TG + R L +Y+R+ I+ Sbjct: 121 AMSLVDE----CYDELEVRGPRENEVITGFEAFRNNLLMHTIASGAQNKLYKREHIINY- 175 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI 240 + GL+++D T + ++ + + L+ Y S+ + + + + + Sbjct: 176 -PYPLGLYYEDRATTYKMFLDSSKVCLINKPLFYYLQREGSIMKGLSEKSVTDRFI-VHD 233 Query: 241 KITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMY 300 I LEK + + + + + K F Sbjct: 234 SIVSELEKRSLLKDNLDYLNVCYILNVIVAGSHQIAIYSKNYKKDISNLFDRIDFNVLTI 293 Query: 301 KRLITNVRSVKVGYQALLWSFRLWQWRDKT 330 L + K + AL+ + K Sbjct: 294 SSLSLMYKFHKKKFFALIVLLVSKRLFKKF 323 >UniRef50_B0NLZ9 Putative uncharacterized protein n=1 Tax=Bacteroides stercoris ATCC 43183 RepID=B0NLZ9_BACSE Length = 305 Score = 255 bits (653), Expect = 1e-66, Method: Composition-based stats. Identities = 70/275 (25%), Positives = 121/275 (44%), Gaps = 5/275 (1%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 ++SVIIP+YN R ++S+ QT LEII+I+DGSTD S I + YA+ +R Sbjct: 1 MPEVSVIIPIYNTAAYLRKALDSICGQTLKELEIILIDDGSTDGSRGIIEEYAKRDARIR 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 Q N G SVARNRG+ ATGKY+ F+D+DD + + + +LD +A Sbjct: 61 WYAQPNQGLSVARNRGLLHATGKYIYFMDSDDILDTQALRRCHDLCEKRELDFIFFDAKP 120 Query: 125 CFRE-TGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 + + V +G + L LS+R + V++ + R + K F Sbjct: 121 LTESADSQNIPNYDRKGKIDESVHSGMEMLDYELSNRLFLTPVYLCMVNRLFLKKCFTCF 180 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 G+ H+D +T NA R Y ++ + + S+ + Y + Sbjct: 181 HPGIIHEDHSFTMMIYLNAQRTCYVPEAFFGRRIRPHSIMTS----RFGMRNIEGYTTVC 236 Query: 244 RLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAV 278 ++ L + + + + ++ + + H + Sbjct: 237 TQMQALGQEHPEWKTLIRKYLSYTLNDVIWAGHRM 271 >UniRef50_C7PMN9 Glycosyl transferase family 2 n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PMN9_CHIPD Length = 329 Score = 255 bits (653), Expect = 1e-66, Method: Composition-based stats. Identities = 69/337 (20%), Positives = 133/337 (39%), Gaps = 13/337 (3%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV--R 64 K+SVIIP YN C+ S+I QT + +EII++ND S+D ++ + + ++ V + Sbjct: 2 KVSVIIPAYNVEKYIEECVMSVIEQTLSEIEIIVVNDQSSDGTLSVLEKIKASHSQVALK 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++ Q N G S ARN GI+ ATG+Y+ F+D DD + M + L +A + + +V C+ Sbjct: 62 IISQQNQGLSGARNTGIQAATGEYICFLDGDDWLEKEMLQDLYKVASDGNAEVVICDYTK 121 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 + + E D VL + + ++RR +V+ +F Sbjct: 122 VYEDRNEVLSGGRID----GNVLRNRKDILEAFLTGNIVISACNKMFRRTFVVERQFRFP 177 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITR 244 G ++D+ T + +A ++S Y Y + S+ K + +I Sbjct: 178 VGYLYEDLP-TVMLIADASVVIKVDKSYYNYRQRSGSIMNSISP--KMVEKIVLVKQIID 234 Query: 245 LLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLI 304 ++ ++ + + + ++ A+ D R + I + + Sbjct: 235 YIKAKGEQGLEEALQSFYLNTYVLQLINQL--AIHGNKDTKQRDQYFNTILSKQESRFFF 292 Query: 305 TNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITRSAF 341 Y L+ R + + R Sbjct: 293 KKFYLN--HYINTRDKIGLFLLRMSPTFYAFLYRKYK 327 >UniRef50_B2UQ58 Glycosyl transferase family 2 n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UQ58_AKKM8 Length = 384 Score = 255 bits (653), Expect = 2e-66, Method: Composition-based stats. Identities = 71/248 (28%), Positives = 112/248 (45%), Gaps = 5/248 (2%) Query: 6 NKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRL 65 +S+I+P YN +C+ESL+ QT +EII INDGSTD + + E + L Sbjct: 3 PDVSIIVPCYNVAAYVDSCLESLVRQTLRNIEIICINDGSTDETWTHLLRWKEKDSRIIL 62 Query: 66 LHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWC 125 L+Q NAG S ARN G++ A G YV F D DD + P MY L + ALE D D+ +C Sbjct: 63 LNQRNAGVSAARNAGLDAARGLYVGFADPDDYMDPEMYSRLFSAALEYDADIVECGNHVF 122 Query: 126 FRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIA 185 + ++ R P + VW ++R+ ++ + ++F Sbjct: 123 EDSSDRIIEAKRRSPSRHFEENASPASFFRDSIWGKMDICVWSKLFRKSMLDAHRLRFNV 182 Query: 186 GLH--HQDIVWTTEFMFNALRARYTEQSLYKYY-LHNTSVSRLHRQG--NKNLNYQRHYI 240 L +D + + +A R + LY Y + N S+SR +K + + + Sbjct: 183 HLKSGAEDETFRLMAVPHASRLLFIPDCLYYYRLMRNGSLSRRCNVPTYSKCVQEFQRLL 242 Query: 241 KITRLLEK 248 I +K Sbjct: 243 YIVDYWQK 250 >UniRef50_Q047W8 Glycosyltransferase related enzyme n=1 Tax=Lactobacillus delbrueckii subsp. bulgaricus ATCC BAA-365 RepID=Q047W8_LACDB Length = 315 Score = 255 bits (652), Expect = 2e-66, Method: Composition-based stats. Identities = 73/321 (22%), Positives = 129/321 (40%), Gaps = 12/321 (3%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 K+S+I+P+Y A C+ES++ QT+ +E+++I DGS DNS EI YA P V Sbjct: 2 EDVKVSIIVPVYKAEASIVRCIESILGQTYHNIELLLIEDGSPDNSGEICDRYAG-DPRV 60 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 +++H N G S ARN G+E ATG+Y+ F D+DD P + + A++ D+ Sbjct: 61 KVIHNTNQGVSHARNTGLEYATGEYITFCDSDDYYTPNHVKENIYAAMKHGSDITISGYY 120 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 E + + L+ D ++ W +Y+R+V+ +KF Sbjct: 121 I------EKHGKFVSSVHMTNASLSKDDVVKHFTVDNEVGGYCWNKLYKREVLQ--GVKF 172 Query: 184 IAGL-HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKI 242 L +D + M A + Y + LY Y + S +R + + ++ Sbjct: 173 PKNLEVLEDTYFLCSAMKRARKIYYLAEPLYYYCDNQNSATRNMDMLYSDHDTVKYIDSY 232 Query: 243 TRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVR--KEPDILTRQRMIAEIFTSGMY 300 +L + +I+ + C + I + I + + + Sbjct: 233 KNILANHRLGQNLENIIHGTMFEMAVAYRYYECQGYHHGSQQLIKNLETDIKQFRKAYYH 292 Query: 301 KRLITNVRSVKVGYQALLWSF 321 + +R VK L Sbjct: 293 CSEFSLMRKVKWTLIMLWPRL 313 >UniRef50_D2PVX1 Glycosyl transferase family 2 n=1 Tax=Kribbella flavida DSM 17836 RepID=D2PVX1_9ACTO Length = 776 Score = 255 bits (651), Expect = 3e-66, Method: Composition-based stats. Identities = 67/245 (27%), Positives = 107/245 (43%), Gaps = 7/245 (2%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 T +LSV++P YN D C++S+ QTWT E I+++DGS D+S IAK + + R Sbjct: 2 TPRLSVVVPFYNVRDYIGDCLDSIARQTWTDFEAILVDDGSPDDSAAIAKEFCDRDARFR 61 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++ Q N G ARN G+ A G+Y+ FVD+DD V +E L+ A NA Sbjct: 62 IVAQENQGLGPARNTGVRHAQGEYLTFVDSDDLVTRHGFEKLIRSLDRTGSSFAGGNARR 121 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 G I + T + R +VW VYRR + +F Sbjct: 122 FNNSAGVRPSWIHRQPFARDRLATHVIETPDLILDR----MVWNKVYRRSFWDEYGYQFP 177 Query: 185 AGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNT--SVSRLHRQGNKNLNYQRHYIKI 242 A + ++D T + +A+ +Y + + S+++ Q + R + Sbjct: 178 A-IRYEDYPVTLKAHLDAVTVDALAVPVYYWRERESGESITQQKFQLGNLEDRVRSAEMV 236 Query: 243 TRLLE 247 L+E Sbjct: 237 MDLVE 241 >UniRef50_Q83VE8 EpsN n=1 Tax=Lactococcus lactis subsp. cremoris RepID=Q83VE8_LACLC Length = 351 Score = 254 bits (650), Expect = 3e-66, Method: Composition-based stats. Identities = 79/352 (22%), Positives = 148/352 (42%), Gaps = 19/352 (5%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 +SV++P+YN C ES+ QT+T LEII+++DGS D S ++ V ++ Sbjct: 4 LISVVVPVYNVEKYLLKCFESISKQTYTNLEIILVDDGSKDTSGKMCDDLKLLDDRVIVV 63 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 H+ NAG ARN G+E+A+GKYV FVD+DD + M L ++ +D + C F Sbjct: 64 HKKNAGLGYARNTGLEMASGKYVLFVDSDDYIEENMVSRLYQRIVKYGVDTSFCGYSIFF 123 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLRMGLS-----------SRRWTHVVWMGVYRRDV 175 + ++I + + D + L V + +Y D+ Sbjct: 124 EDE----KTIKKCSYYADSIFEEEDVVEKVLLEMIAGLPNAKVDFIIPMSVCLALYSMDI 179 Query: 176 IVKNNIKFIAGLHH--QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNL 233 I +I+F + + +DIV+ ++ ++ + Y +LY Y +++ + ++ Sbjct: 180 IRDYSIRFPSEREYISEDIVFDIAYLQHSNKVSYVSDALYNYRQNSSGSLTHVFRSDEFD 239 Query: 234 NYQRHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAE 293 + Y+KI L ++ F ++ + A +P + Sbjct: 240 RQKLMYLKINAELCTFFHKSKYELRTMRYFLGRVRTCVDKAVSANEFDPSFNPIS-YVKH 298 Query: 294 IFTSGMYKRLITNVRSVKVGYQALLWSFRL-WQWRDKTRSHHRITRSAFNLR 344 IF M +R+I K + L++ + ++ ++ RS N R Sbjct: 299 IFGDEMVRRIINVYPYKKNPLKIRLFNACIRYRLTHAAIWLTKLRRSRGNNR 350 >UniRef50_A5ZUY3 Putative uncharacterized protein n=1 Tax=Ruminococcus obeum ATCC 29174 RepID=A5ZUY3_9FIRM Length = 367 Score = 254 bits (650), Expect = 3e-66, Method: Composition-based stats. Identities = 86/344 (25%), Positives = 148/344 (43%), Gaps = 16/344 (4%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 K+S+IIP YN C+ES+ QT +EII +NDGSTD+S+EI K YAEN P Sbjct: 1 MAKVSIIIPTYNVEMFLDECLESIQRQTLQDIEIICVNDGSTDHSLEIIKKYAENDPRFV 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++ + N G A N G++ ATG+Y+ V+ DD V MYE L +A E+DLD+ + + Sbjct: 61 IVDKENGGYGKAMNVGLDKATGEYIGIVEPDDYVPLAMYEDLYKIAKENDLDMVKADFYR 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDW-LRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 R ++ + L TG G + + R+ W G+Y+R+ + +++I+ Sbjct: 121 FTRNPENGNMNLVYNHLDLTGKWYGKLFDPSEEPETLRFIMNTWSGIYKREFLEQHHIRH 180 Query: 184 IA--GLHHQDIVWTTEFMFNALRARYTEQSLY-KYYLHNTSVSRLHRQGNKNLNYQRHYI 240 G QD + + A RA ++ Y + S R+ N + YI Sbjct: 181 NETPGASFQDNGFYFQTFVYAKRAMIVDKPYYMNRRDNPNSSVNN-REKVYCCNVEYDYI 239 Query: 241 KITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMY 300 + + + I+ F + + + + ++ + I S + Sbjct: 240 REILMKD---------PEIWSRFKYMYSLKKFHTYNFTLQRIGEQFKREYVQRI--SQEF 288 Query: 301 KRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITRSAFNLR 344 KR N + + + W + +D + +I +R Sbjct: 289 KRAKENGELKQGAFYPVEWDDMMLLIQDPDAYYFKICLKNKQIR 332 >UniRef50_B4BNC6 Glycosyl transferase family 2 n=1 Tax=Geobacillus sp. G11MC16 RepID=B4BNC6_9BACI Length = 516 Score = 254 bits (650), Expect = 3e-66, Method: Composition-based stats. Identities = 71/317 (22%), Positives = 136/317 (42%), Gaps = 11/317 (3%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+S+IIP+YN C+ S+ Q++ LE+I+INDGSTD S +I + + +N P RL Sbjct: 4 KVSIIIPVYNIASYLEECLYSVQNQSYRDLEVIMINDGSTDGSEQILERFQKNDPRFRLF 63 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 +Q N G RNRGI ++ G+YV F+D DD + + L++ D D A Sbjct: 64 NQENRGLGYTRNRGISLSNGQYVFFLDGDDRLPKDAIQHLLSAIETHDADYAVGKVLRFH 123 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAG 186 E L + +T + L + +++++ +++N++ FI G Sbjct: 124 SERKYVPIRHLEFNLYTQKQVTTIEQNPELLQD----SIACNKLWKKEFLIQNDLSFIEG 179 Query: 187 LHHQDIVWTTEFMFNALRARYTEQSLYKYYLH----NTSVSRLHRQGNKNLNYQRHYIKI 242 +++D+ +T A T + +Y + + N S+++ + L+ + Sbjct: 180 KYYEDLFFTMRAAVLAKTVAVTNKVVYYWRVRTEEHNRSITQEQMKLENTLHRLSALKQN 239 Query: 243 TRLLEKLNRNYADKIMIYPEFHQQITYEALR-VCHAVRKEPDILTRQRMIAEIFTSGMYK 301 + L ++ + K++ + I L V A+ + + Q + + Sbjct: 240 RQWL--IDNRVSAKVVEQNDLKCLIDVLRLHAVKFALVDQTERKEWQENVLSFLQEIPRE 297 Query: 302 RLITNVRSVKVGYQALL 318 K+ Y +L Sbjct: 298 TAQQLAPKEKMLYDLIL 314 >UniRef50_B1JIU1 Glycosyl transferase family 2 n=28 Tax=Enterobacteriaceae RepID=B1JIU1_YERPY Length = 341 Score = 254 bits (650), Expect = 3e-66, Method: Composition-based stats. Identities = 90/343 (26%), Positives = 151/343 (44%), Gaps = 17/343 (4%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 MNS + +SVI+P+YN + ++SL +QT +LEII +NDGSTD+S+ + + A Sbjct: 1 MNSVD-ISVIVPVYNCANYLTDLLDSLRSQTGVSLEIIAVNDGSTDHSLALLEQVAHIDS 59 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 + +++Q N G S ARN GI A G +VAF D DD + P T + + LDV N Sbjct: 60 RLVIINQQNQGLSEARNVGIAKAKGNWVAFADGDDWLAPHALSTWLQQVKQQKLDVLIGN 119 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVL------------TGPDWLRMGLSSRRWTHVVWMG 169 + + L G W+ + + + W H W+ Sbjct: 120 GYQFSSNPERKPVTPLLGQQPWGATLNGQQWIILSVKQNGQQWIILSVKQKEWPHYAWLQ 179 Query: 170 VYRRDVIVKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQG 229 + RRDVI + ++FI + H+DI+WTT+ A R + Y Y + S++ Sbjct: 180 LIRRDVISTHQLQFIRDMLHEDILWTTQLALVAQRIGFCATPFYGYRTNPDSITNSPSM- 238 Query: 230 NKNLNYQRHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQR 289 L Y+ I L K+ + K + E+ + +RK+ ++ Sbjct: 239 QALLRRASSYVDIINALVKIAAST--KPPVRNALLSHANQESGHLLGLLRKKLPASPQRT 296 Query: 290 MIA-EIFTSGMYKRLITNVRSVKVGYQALLWSFRLWQWRDKTR 331 +A T G+ L V ++ ++A+ SF + + + R Sbjct: 297 ALAIRFITLGLRPALFRGVTNLHELWRAIRCSFLMGSYARQKR 339 >UniRef50_A8RE09 Putative uncharacterized protein n=1 Tax=Eubacterium dolichum DSM 3991 RepID=A8RE09_9FIRM Length = 330 Score = 254 bits (650), Expect = 3e-66, Method: Composition-based stats. Identities = 71/322 (22%), Positives = 150/322 (46%), Gaps = 5/322 (1%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 +S+I+P+YNA +E+L QT T +EI+++NDGSTD+S+++ + +AE P ++++ Sbjct: 9 VSIIVPIYNAEKYLSRTVETLRRQTLTNIEILLVNDGSTDDSLKLCREFAEIDPRIQVID 68 Query: 68 QANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFR 127 + N G S ARN G+++A G+YV FVDADD V E L+ E+ D+ C + Sbjct: 69 KQNGGVSSARNIGLKLAKGQYVLFVDADDYVEANYAEILLKTLQENKSDIEMCGYFREDK 128 Query: 128 ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGL 187 ++ + + D + P + + + ++ ++ I +NN++F L Sbjct: 129 KSEQIITLMEKDHVFDLN-KELPSIHKAFVKGNGGIYSIFNKLFSLKKIKENNLQFSEKL 187 Query: 188 HH-QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRLL 246 H +D ++ +++ Y LY Y ++ S++ + + + ++ Sbjct: 188 SHAEDFLFCIQYVNTCSSFGYINIPLYHYVNNDNSLTTTFNKRQLAYHEDWLMVLKQNIV 247 Query: 247 EKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLITN 306 + + + + + Y + Y + VR LT + E+ + +KR+ + Sbjct: 248 SDVYKEHT-QEIYYTTLTEVNDYCIRAKRNKVRFSYKDLTSYPSVREMIKNIRFKRVDSI 306 Query: 307 VRSVKVGY--QALLWSFRLWQW 326 + + +L+ ++L+ Sbjct: 307 SCYILLIALRFKILFLYKLYIL 328 >UniRef50_C6A9D3 Glycosyltransferase n=4 Tax=Bifidobacterium animalis subsp. lactis RepID=C6A9D3_BIFLB Length = 337 Score = 254 bits (650), Expect = 3e-66, Method: Composition-based stats. Identities = 70/258 (27%), Positives = 115/258 (44%), Gaps = 15/258 (5%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 ++SVIIP+YN + CM+SL+ Q++ E++ +NDGSTD SV I + Y + V+ Sbjct: 1 MPEISVIIPVYNQEEYVGACMDSLLDQSFEDFEVLAVNDGSTDASVSILEGYEQRDGRVQ 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 L+ Q N+G SVARN G+ A G++V F+D DD V P +TL+ A E + D+A Sbjct: 61 LIDQNNSGVSVARNNGLSHAKGEWVCFIDPDDYVAPDYLDTLLK-ATERNTDIAMSTCVA 119 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRW---------THVVWMGVYRRDV 175 + P L L + V W +YR Sbjct: 120 FDDQRSARQHFFPESFTARAA--DEKVELFRQLMDGSYAQPKGFVTAIGVPWGKLYRHAF 177 Query: 176 IVKNNIKFIAGL-HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLN 234 + ++ ++F A L QD ++ + A Y + + Y Y + +S + N Sbjct: 178 LQEHGLQFDAALPRMQDNLFNMQAFQLAHEIVYVDYAGYYYRM--GGLSARTYKNNARGL 235 Query: 235 YQRHYIKITRLLEKLNRN 252 Y + RL+ + + + Sbjct: 236 YHPAIDERARLMREYHLD 253 >UniRef50_A3TMK5 Possible glycosyl hydrolase n=1 Tax=Janibacter sp. HTCC2649 RepID=A3TMK5_9MICO Length = 552 Score = 254 bits (649), Expect = 4e-66, Method: Composition-based stats. Identities = 62/236 (26%), Positives = 106/236 (44%), Gaps = 12/236 (5%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M +++SV+IP+YN D C++S+ QT+ +L++I+I+DGSTD S + A Sbjct: 1 MSPELHRVSVVIPVYNVVDYLEQCLDSVAAQTYRSLDVILIDDGSTDGSGPLCDTLAARD 60 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALED-DLDVAQ 119 P R++HQ N G S ARN GI ATG +V F+D+DD PT +L+ + A Sbjct: 61 PRFRVIHQDNHGLSHARNVGIGAATGTFVTFLDSDDWWEPTFVSSLVKAMDDHPGAGTAM 120 Query: 120 CNADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGL-SSRRWTHVVWMGVYRRDVIVK 178 C+ + R+ T VL+ + + + ++RR ++ Sbjct: 121 CSLTRV-------PGATYDPRISQTRVLSPAEAIAQFAGPHHSIFVIACAKLFRRTLLDT 173 Query: 179 NNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLN 234 + F G +D T + A + LY Y S++ ++ ++ Sbjct: 174 D--TFPIGRVAEDAFATHRLLMKAP-VVQVPEPLYLYRQRAHSITSRPMTVSRLID 226 >UniRef50_C2EBD2 Possible glycosyltransferase n=1 Tax=Lactobacillus ruminis ATCC 25644 RepID=C2EBD2_9LACO Length = 327 Score = 254 bits (649), Expect = 4e-66, Method: Composition-based stats. Identities = 68/332 (20%), Positives = 122/332 (36%), Gaps = 23/332 (6%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDN-SVEIAKYYAENYPH 62 +SVI+P+YN CM SL QT+ LE ++++DGST+ ++ + + Sbjct: 2 DKPLISVILPVYNVEKYLSECMTSLFDQTYENLEFVMVDDGSTNKECSKMCDEFLKKDKR 61 Query: 63 VRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNA 122 V + H+ N G S ARN GI+ A G+YV VD DD V E L + + ++ C Sbjct: 62 VVVFHKENGGLSDARNYGIKKAKGQYVTCVDPDDIVDKDYVEYLYFLIEKYHCKMSICQH 121 Query: 123 DWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIK 182 + D ++ L L W +YR+D+ +++ Sbjct: 122 RVHYDNGNVKNLGSKGDEK-----MSNKHCLERMLYHDVIDTSAWAKLYRKDLFD--DVE 174 Query: 183 FIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKI 242 + G +DI T M ++ Y Y HN S+ + NK Sbjct: 175 YPKGKLFEDIGTTYALMLKCDDVAVGYEAKYTYNFHNNSIVNGTFKMNK----------- 223 Query: 243 TRLLEKLNRNYADKIMIYPEFHQQIT----YEALRVCHAVRKEPDILTRQRMIAEIFTSG 298 ++E ++ +D + +YP+ + Y L + + Q+ + I Sbjct: 224 LDMIEMTDKMASDVLRVYPDLRGAVLRRQVYSRLSTLNQMLNADGQAVNQKRVEIIKYVK 283 Query: 299 MYKRLITNVRSVKVGYQALLWSFRLWQWRDKT 330 R + + ++ + Sbjct: 284 QNSREVAKNPKTPKRDKIAFILLKIGYPLYRF 315 >UniRef50_C9RJ54 Glycosyl transferase family 2 n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RJ54_FIBSS Length = 321 Score = 254 bits (649), Expect = 4e-66, Method: Composition-based stats. Identities = 80/329 (24%), Positives = 129/329 (39%), Gaps = 13/329 (3%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 +SVIIPLYN + C++SL+ QT+T E+II++DGSTD IA YA + Sbjct: 1 MPAVSVIIPLYNTEAFIKDCLDSLVAQTFTDFEVIIVDDGSTDGGARIAASYASADTRFK 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 L+ Q N G S ARN G+++ G+YV F+D+DD V P ETL +A DVA C+ Sbjct: 61 LIGQPNKGPSEARNTGLKIMRGEYVTFIDSDDCVAPNFLETLFFIAQLHKADVACCSIKN 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWT-HVVWMGVYRRDVIVKNNIKF 183 T + LT + R+ L + W +Y+ D+ F Sbjct: 121 IDEA---YKCDGTTPNTSISASLTPEEAARISLYQDSLPDYSAWNKLYKADLWK--GKFF 175 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 A ++D+ E + A + T+ LY Y S K + + Sbjct: 176 PANTIYEDLAVIPEILLGANKVVTTKSHLYFYRKRAGSELATQTGSTKIILLLDIAEDVF 235 Query: 244 RLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILT-RQRMIAEIFTSGMYKR 302 ++ +++ +Y + + + + R+R IA I Sbjct: 236 EKMKHISK------PLYKAARSMLVSASFSILMRTSDDEKFAESRKRAIAHIRKYRFGTF 289 Query: 303 LITNVRSVKVGYQALLWSFRLWQWRDKTR 331 +R L + R + + Sbjct: 290 FDLRIRMRNRMAILLSYLPRSIFLKILKK 318 >UniRef50_C6WZX8 Beta-1,3-glucosyltransferase n=1 Tax=Flavobacteriaceae bacterium 3519-10 RepID=C6WZX8_FLAB3 Length = 361 Score = 254 bits (649), Expect = 4e-66, Method: Composition-based stats. Identities = 74/330 (22%), Positives = 139/330 (42%), Gaps = 18/330 (5%) Query: 3 NSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP- 61 K+SVI+P+Y + C+ SL+ QT +EII++NDGSTD S EI +++ +P Sbjct: 26 KQMVKVSVIVPVYQGEKELSKCLASLVNQTLDDIEIIVVNDGSTDGSGEIITEFSQRFPS 85 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 +R + N G S ARN G++ ATG+Y+ FVDADDEV + + +A + ++ CN Sbjct: 86 KIRAFTKENGGLSDARNFGLDRATGQYIGFVDADDEVSANTFAEMYELAEKHAAEMVICN 145 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 T + + VL + ++ ++R+D++ +N Sbjct: 146 IQKVDEAGKITQKLTQIPNMPERIVLEDHFSVFA-----DISYFACNKIFRKDIL--HNK 198 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIK 241 +F G+H++DI + + T+ Y+Y+ S+++ H + K L+ R Sbjct: 199 RFKKGIHYEDIQLIPQLLLECRVIAQTQNYHYQYFERRGSITKTHTE--KGLDILRAVAD 256 Query: 242 ITRLL--------EKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAE 293 + + RN+ IY + ++ + E +QR I Sbjct: 257 VENAFATSQYSHQKAGLRNFQILEGIYSFLAYLAFVKDEKISRKLSDELKKFRQQRGIGW 316 Query: 294 IFTSGMYKRLITNVRSVKVGYQALLWSFRL 323 + + + + + + + Sbjct: 317 LEIIRYQRFGKNYLLYLPTRKKIYYLLYFM 346 >UniRef50_D0BME4 Glycosyl transferase n=10 Tax=Lactobacillales RepID=D0BME4_9LACT Length = 321 Score = 254 bits (649), Expect = 4e-66, Method: Composition-based stats. Identities = 72/315 (22%), Positives = 134/315 (42%), Gaps = 7/315 (2%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 +SVIIP+YN + +ESL QT+ E+I++NDGSTDNS + YA+ Y +V Sbjct: 2 LISVIIPVYNVENYLHYAIESLFRQTYKNFEVILVNDGSTDNSRSLCDEYAQRYKNVYAY 61 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 H+ N G S ARN G+E A ++ F+D DD + +E L+ + + ++ + Sbjct: 62 HKTNGGLSDARNFGVEKANSDWIFFLDPDDYIEAFTFELLVNLQKQYNVPLVSTKVQSTS 121 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAG 186 + I L T + L + L ++ T +Y++ ++ + + F G Sbjct: 122 VYNEYSDSEISNLDLTQTKKVNKETALALMLENKIATVSSCAKLYKKKILQE--VPFPVG 179 Query: 187 LHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRLL 246 ++D E + A + Y YY S+ K N+ +++ Sbjct: 180 KIYEDFFVVAEHLNLAEEIVISPIVTYHYYSRTGSIVNS-AFSEKQYNFFDAAEHTEKVI 238 Query: 247 EKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEP-DILTRQRMIAEIFTSGMYKRLIT 305 N Y ++ + + A + + K+ ++L +++I +F S ++ R +T Sbjct: 239 ---NEKYNNRRELINSVEAKTVIGAFHIINNAEKDKANLLKIRKLIKNVFKSVIFNRQVT 295 Query: 306 NVRSVKVGYQALLWS 320 +K L Sbjct: 296 LKHKLKYILFLLFPL 310 >UniRef50_C0CH08 Putative uncharacterized protein n=1 Tax=Blautia hydrogenotrophica DSM 10507 RepID=C0CH08_9FIRM Length = 322 Score = 254 bits (649), Expect = 5e-66, Method: Composition-based stats. Identities = 63/315 (20%), Positives = 129/315 (40%), Gaps = 15/315 (4%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 +S+I+P+YN ++SL QT+ +EI+++NDGS DNS I +A+ ++++ Sbjct: 4 LVSIIVPVYNVEAYVDRSIQSLTNQTYRNIEILLVNDGSIDNSSVICDLWAQKDKRIKVI 63 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 H+ N G ARN+ +++A G+Y+ F+D+DD E L M E LD++ CN + Sbjct: 64 HKKNGGQGSARNKALDIARGEYICFLDSDDTAKKDYIEFLYNMMKEKHLDISACNNEIFD 123 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAG 186 + + LTG + ++ + W +Y++++ +I+F Sbjct: 124 ENGTFLRERKTGEGYVE---LTGTEAIKSLWTDGVINIGPWGKLYKKELWE--DIRFKE- 177 Query: 187 LHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRLL 246 +D A R Y+ + +Y + + S R +Q + + + Sbjct: 178 CFSEDWATMHFIYEKADRVGYSYECKIEYLIRSNSSIR---------KFQEKKLIMLDIA 228 Query: 247 EKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLITN 306 + + + P +Q+ + + + ++ I + +++ Sbjct: 229 KDNIQYAQSHPELLPASYQKAVSVYFHILFQLPNDSKYDIVRKQIKLLIKEIRLSIIVSR 288 Query: 307 VRSVKVGYQALLWSF 321 K LL F Sbjct: 289 ECRSKTKIAVLLSYF 303 >UniRef50_Q8XN57 Beta-1,4-galactosyltransferase n=3 Tax=Clostridium perfringens RepID=Q8XN57_CLOPE Length = 342 Score = 253 bits (648), Expect = 5e-66, Method: Composition-based stats. Identities = 78/331 (23%), Positives = 150/331 (45%), Gaps = 13/331 (3%) Query: 6 NKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRL 65 K+S+I+P+Y D C++S+I Q++ LEII+++DGSTD EI Y + +++ Sbjct: 3 PKVSLIVPIYKVQDYLNKCIDSIINQSYKNLEIILVDDGSTDRCGEIIDLYEKQDNRIKV 62 Query: 66 LHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWC 125 H+ N G S ARN G+ TG YV FVD+DD + M LM +LE + D+ Q + Sbjct: 63 FHKKNGGLSDARNYGMNFITGDYVIFVDSDDWLSEDMVSDLMKTSLELNADIVQSGFYYA 122 Query: 126 FRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIA 185 + + + + VL + + + + + + W +Y+ +I +I F Sbjct: 123 YDDYLLYDNRYYLEDMSPV-VLNREELMYELVINEKVKNFAWGKLYKSVLIK--DISFKI 179 Query: 186 GLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRL 245 G+ +D+ W + M + + +Y Y S+ ++ + +NL+ + + + Sbjct: 180 GVLFEDVFWAHKVMDRVDKYVIIHKPMYYYRQRKDSIVAIYTE--RNLDILKGLQERHKF 237 Query: 246 LEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLIT 305 +EK I + E ++ I +L + ++K+ +I R + +I K Sbjct: 238 IEK------KYISLINESYKIIFKTSLIHYYLLKKKKNIKNRDYYMNDI--KEYIKSNFR 289 Query: 306 NVRSVKVGYQALLWSFRLWQWRDKTRSHHRI 336 N+ + L+W L+Q + + Sbjct: 290 NIEKSLKNERELIWQLHLFQINPNLNRIYIL 320 >UniRef50_C2EWJ8 Putative uncharacterized protein n=1 Tax=Lactobacillus vaginalis ATCC 49540 RepID=C2EWJ8_9LACO Length = 499 Score = 253 bits (648), Expect = 5e-66, Method: Composition-based stats. Identities = 68/274 (24%), Positives = 119/274 (43%), Gaps = 13/274 (4%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 M VI+ +YN C++S++ QT+ LE+I+++DGSTD S+ I Y + Sbjct: 10 MEEEQIL--VIVSVYNTEKYLDQCIQSILKQTYHNLELILVDDGSTDRSLAICTRYQQQD 67 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQC 120 VR+ HQAN G S ARN G++VA G + FVD+DD V P E L+T +A Sbjct: 68 SRVRVFHQANQGVSKARNYGMDVAAGDLITFVDSDDYVMPNYLEELITALESTGSQIAIS 127 Query: 121 NADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRR-----WTHVVWMGVYRRDV 175 + ++ G+ +P D S T +W +++ V+W +++++ Sbjct: 128 DFQS-YKGNGQLLVHMP-DHEVSQRTFTPREWFDFAYTNKNDFFQLIYTVLWGKLFQQEA 185 Query: 176 IVKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNY 235 N I D T + A + Y + Y Y L+ +S R+ +++ Sbjct: 186 F--NQIYCPEDTQIDDEFTTWKTYLQASKIVYLDDEDYIYRLNQDGLSH--RKNQEDIRP 241 Query: 236 QRHYIKITRLLEKLNRNYADKIMIYPEFHQQITY 269 R + L L + ++ + ++ Sbjct: 242 LRSLEEEITFLHILGYDPQLPLLRINQLYRISIK 275 >UniRef50_C6I618 Glycosyltransferase n=5 Tax=Bacteroidales RepID=C6I618_9BACE Length = 329 Score = 253 bits (648), Expect = 5e-66, Method: Composition-based stats. Identities = 64/337 (18%), Positives = 132/337 (39%), Gaps = 13/337 (3%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY-PH 62 S+ K+S+IIP+Y A C++S+ITQT+T E+++++DGS D+S E+ + YA Y Sbjct: 2 SSPKISIIIPVYMAESYLHRCVDSIITQTFTDWELLLVDDGSPDHSGELCEKYASAYKSR 61 Query: 63 VRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNA 122 ++ H+ N G + AR G++ A G+Y VD DD + E L A+++ D+ C+ Sbjct: 62 IKAFHKPNGGVASAREFGMQQARGEYSIHVDPDDWIDSNTLEELYQQAVKEQADMVICDF 121 Query: 123 DWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIK 182 + P +L ++ L R + + R ++ K + Sbjct: 122 MMEYPNRQIHNCQKP-------QLLDSSSFMHQLLQQER-HGSLCNKLIRTELYHKYQLH 173 Query: 183 FIAGLH-HQDIVWTTEFMFNALRARYTEQSLYKY--YLHNTSVSRLHRQGNKNLNYQRHY 239 F + +D+ + + + Y +LY Y Y ++ S+ R Sbjct: 174 FPEKMICWEDLYICCSILLHGCKLAYVPHALYHYDFYTNDNSMVRHTDMRGLQAQIDFCR 233 Query: 240 IKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGM 299 + ++ + + I + + ++ E + R + S Sbjct: 234 LMQAKISPEYLPELNELKGITLITAFRNQLLNEQAIRSLFPEINDWYVTRYGHDYEKSNY 293 Query: 300 YKRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRI 336 Y + + ++ + + + K + + Sbjct: 294 YGLTLVLRGYNFKTARRRMFVAK-FLVQIKNKITRML 329 >UniRef50_A5ZFA7 Putative uncharacterized protein n=1 Tax=Bacteroides caccae ATCC 43185 RepID=A5ZFA7_9BACE Length = 328 Score = 253 bits (648), Expect = 6e-66, Method: Composition-based stats. Identities = 68/324 (20%), Positives = 141/324 (43%), Gaps = 7/324 (2%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M S +SVIIP++N C+ES+ Q+ +EII++ + S DNS EI YA+ Sbjct: 1 MESNYLVSVIIPVHNTAPYLHKCVESVRNQSLKEIEIILVENMSIDNSAEICDEYAKIDS 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 ++++H AG +ARN GI+ AT Y+ FVD+DD + M++ L+ E+D D+ CN Sbjct: 61 RIKVIHLPIAGLCIARNAGIDAATAPYIGFVDSDDYIGTDMFKDLVEAMTENDADLTFCN 120 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 + + + + + +L + L L R + VW+ +++R++ + Sbjct: 121 YCYEYEDGHIELLAPNSG----NNILCVKEELLRDLMWERKSCAVWIRLFKRELFTS--L 174 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIK 241 +F G ++D ++ R + +++ Y Y S+ N+ +I Sbjct: 175 RFPEGFVYEDRRNMPRWIMMCDRIVWVDKTYYYYVERANSICHTISPENRYHYILSCFIW 234 Query: 242 ITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYK 301 + + E+ + + + + + + + VR + + + Sbjct: 235 LDFIHEQALFSGRELVDVQTVILRNALMQFKQALQRVRPKNFKNEIDEIRKGFKHLLTFP 294 Query: 302 RLITNVRSVKVGYQALLWSFRLWQ 325 + +VR K + + + + ++ Sbjct: 295 KNEFDVRVYK-RIRNIAYYWPIYY 317 >UniRef50_B3C8H4 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=B3C8H4_9BACE Length = 359 Score = 253 bits (648), Expect = 6e-66, Method: Composition-based stats. Identities = 78/319 (24%), Positives = 139/319 (43%), Gaps = 13/319 (4%) Query: 1 MMNS------TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAK 54 MMN+ + K+SVIIP+YN + ++S+ QT LEII+INDGSTDNS I + Sbjct: 45 MMNNCSDINPSIKVSVIIPVYNTATYLHSALDSICNQTLKDLEIILINDGSTDNSQYIIE 104 Query: 55 YYAENYPHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDD 114 YA ++ L Q N VARN G+++ATG+Y+ F+D+DD + M + Sbjct: 105 KYARKDNRIKYLIQPNQRQGVARNNGLQLATGQYIYFMDSDDILDIDALRQCYEMCERKE 164 Query: 115 LDVAQCNADWCFR--ETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYR 172 LD +A + T+ ++ + G + + L + V W+ + Sbjct: 165 LDFVTFDAKTIHETSTSSGTFNYCRKGKIDEQKIWDGIELMHYELEYEIFYVVPWLCFTK 224 Query: 173 RDVIVKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKN 232 + + F +G+ H+D ++ + M NA R Y Q +K + N+S + + Sbjct: 225 HSFLKQYFGGFPSGIIHEDTIFVVQIMLNAHRVGYIPQPFFKRRVRNSSTM----AASFS 280 Query: 233 LNYQRHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIA 292 ++ Y + + + + + I + ++ + + H ++ I T R Sbjct: 281 MHNIEGYTLVCTQIRSWTQQHVEWSAIIDLYLRKTLNSVIWLGHKMKTLEKIETYCR-FQ 339 Query: 293 EIFTSGMYKRLITNVRSVK 311 + S K V +K Sbjct: 340 RLHLSQYVKLKNWLVFWLK 358 >UniRef50_A6L5V6 Glycosyltransferase family 2 n=2 Tax=Bacteroides RepID=A6L5V6_BACV8 Length = 344 Score = 253 bits (647), Expect = 7e-66, Method: Composition-based stats. Identities = 81/352 (23%), Positives = 148/352 (42%), Gaps = 35/352 (9%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPH- 62 K+S+I+P+YN CM+SL+ QT LEII+++DGS DNS + Y + Sbjct: 2 KQPKVSIIVPVYNVEKYLDRCMQSLLGQTLKELEIILVDDGSPDNSPALCDNYESASDNL 61 Query: 63 ----VRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVA 118 ++++H+ N G ARN G+E+ATG+Y+AF+D+DD V MYE L A D+DV Sbjct: 62 HWPIIKVIHKINEGLGFARNSGLEIATGEYIAFLDSDDFVDVNMYEQLYLKAKAKDIDVV 121 Query: 119 QCNADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSR-----------RWTHVVW 167 CN + + P + + G + + L R+ V Sbjct: 122 YCNCNIYKDKK----HIYPRKDVEEEVIFRGREAVDDFLLDMIAPLPEYHHDVRYMMSVG 177 Query: 168 MGVYRRDVIVKNNIKFIAGLHH--QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRL 225 +YR + +++ ++F++ +DI++ +F+ + Y L+ Y ++ S+S Sbjct: 178 HAIYRHSIFMEHRVQFVSEREFVSEDIIFDIDFLSHVRSVCYLPDCLHYYVVNPYSLSHT 237 Query: 226 HRQGNKNLNYQRHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDIL 285 + Y K+ L + A + + + + V + Sbjct: 238 YDDAK--------YHKMVNLSRIIKERLAKIYSEEKYYLHYLRSVFFLLRNTVMRMVLAG 289 Query: 286 TRQRMIAEIFTSGMYKRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRIT 337 + I I +Y+ L++N YQ + R+ + K + + Sbjct: 290 ETDQKIKRILEEKLYQELLSNY-----PYQRMDLKHRIVFFLLKYKCILLLK 336 >UniRef50_D1WVE7 CDP-glycerol:poly(Glycerophosphate) glycerophosphotransferase n=5 Tax=cellular organisms RepID=D1WVE7_9ACTO Length = 1217 Score = 253 bits (646), Expect = 8e-66, Method: Composition-based stats. Identities = 59/243 (24%), Positives = 94/243 (38%), Gaps = 8/243 (3%) Query: 6 NKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRL 65 LSV++P++N C+ S+ QT A+E+++++DGSTD S IA +A RL Sbjct: 7 PLLSVVVPVHNVEAYLADCLRSVADQTLEAIEVVVVDDGSTDGSARIAAEFAARDSRFRL 66 Query: 66 LHQANAGASVARNRGIEVATGK--YVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 + Q NAG S ARN G+ T Y+AF D+DD V YE + D+ N Sbjct: 67 VRQRNAGLSAARNAGVRHTTPTVPYLAFADSDDIVVHDAYERMTASLESTGSDLVTGNVW 126 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 + D L+ R V W V+RR + F Sbjct: 127 RLTERGRQQAWQYRWLTASRPRTHITRD--PRLLADR----VAWNKVFRRSFWDGHAFSF 180 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 G ++D A + +Y + + S++R + +++ Sbjct: 181 PEGKLYEDTPVMIPAHHLARSVDVLHEHVYYWRVREGSITRRRTDVTGVRDRIAACEQVS 240 Query: 244 RLL 246 L Sbjct: 241 AFL 243 >UniRef50_Q2NH61 Predicted glycosyltransferase n=1 Tax=Methanosphaera stadtmanae DSM 3091 RepID=Q2NH61_METST Length = 358 Score = 253 bits (646), Expect = 9e-66, Method: Composition-based stats. Identities = 77/334 (23%), Positives = 148/334 (44%), Gaps = 7/334 (2%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 K+SV+IP+YN R C+++++ QT +EII +NDGSTD S++I YA +R Sbjct: 1 MVKVSVVIPVYNVEKYLRDCLDTIVNQTLEDIEIICVNDGSTDKSLDILNEYASKDERMR 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++ Q N G +VA N G+ ATG+Y+ +D+DD + E ++A E D+D A Sbjct: 61 VISQENGGHAVATNVGMTYATGEYLFLMDSDDILKLNALEDTYSIAKEKDVDFVLFKAIN 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSS--RRWTHVVWMGVYRRDVIVKNNIK 182 E E +++ + ++ + L T W +Y R I +NNI Sbjct: 121 YDDEKDEYYEAENYSMNKIARIVKDNIFSYEDLGDAIFDITVTPWSKLYNRSFIEENNIT 180 Query: 183 FIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKI 242 F GL D V+ + +F+A R + ++ L+ ++TS + + + ++ + + Sbjct: 181 FPEGLVFDDNVFFWKVLFSAKRIYFYKEFLFTRRWYSTSSTT--KGDLRFIDSIAVFNLV 238 Query: 243 TRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKR 302 + + N K ++Y + R+ R+ + ++ +Y+ Sbjct: 239 WDVFREFNLFEKYKKILYDKKVFLANMRFDRIKEEYREAYFSAMKDDFTKILYDDVLYQD 298 Query: 303 LITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRI 336 I+ + YQ L + ++ + ++ Sbjct: 299 FISILSP---KYQKLYEQVLISTSAEEFIASRKV 329 >UniRef50_D1XQ46 CDP-glycerol:poly(Glycerophosphate) glycerophosphotransferase n=1 Tax=Streptomyces sp. ACTE RepID=D1XQ46_9ACTO Length = 1116 Score = 252 bits (645), Expect = 1e-65, Method: Composition-based stats. Identities = 63/244 (25%), Positives = 103/244 (42%), Gaps = 8/244 (3%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 LSV++P++N + C+ SL QT A+E+++++DGSTD S IA+ +A R Sbjct: 2 KPLLSVVVPVHNVEEYLEECLTSLARQTLEAIEVVVVDDGSTDGSRRIAEGFAARDGRFR 61 Query: 65 LLHQANAGASVARNRGIEVAT-G-KYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNA 122 +HQ NAG S ARN G+ T G Y+AF D+DD V P Y ++ D+ N Sbjct: 62 CVHQPNAGLSAARNAGVAHTTPGVPYLAFADSDDLVVPDAYARMLASLESTGSDLVTGNV 121 Query: 123 DWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIK 182 + Q+ L + T L+ R V W V+RR ++ Sbjct: 122 WRLTGQG--PRQAWQYRWLTADHPRTHVTRDPRLLADR----VAWNKVFRRSFWDRHAFT 175 Query: 183 FIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKI 242 F G ++D A + +Y + + S++R + ++ Sbjct: 176 FPVGRLYEDAPVMIPAHHLAGSVDVLHEHVYLWRVREGSITRRRTDVRGVRDRIAACEQV 235 Query: 243 TRLL 246 + L Sbjct: 236 SAFL 239 >UniRef50_UPI0001C377F0 glycosyl transferase, group 2 family protein n=1 Tax=Ruminococcus flavefaciens FD-1 RepID=UPI0001C377F0 Length = 321 Score = 252 bits (645), Expect = 1e-65, Method: Composition-based stats. Identities = 78/333 (23%), Positives = 144/333 (43%), Gaps = 26/333 (7%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 L+VI+P+YN C+ SLI+QT++ + II++NDGSTDNS +IA +AENY ++ Sbjct: 1 MYNLTVIVPIYNVEKYLEKCVLSLISQTYSQMCIILVNDGSTDNSGKIADKFAENYDNIT 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 ++H+ N G S ARN GIE +Y+AFVD+DD V MYE L++ ++ +D++ + Sbjct: 61 VIHKTNGGLSDARNVGIESTNTEYIAFVDSDDYVESNMYEVLLSKFSDEQIDISIGTVLY 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIV-----KN 179 + + + + + ++ L ++ V++ + ++ Sbjct: 121 ENEQGEGYCP----YKTGISAIWNNEEAMKYLLCYSKFNMSFCDKVFKTKLFRTPAYGED 176 Query: 180 NIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHY 239 + F G +D + A R Y LY Y S+SR N NL Sbjct: 177 KLLFPYGKKCEDQFLMCRVLARASRVAYNSIPLYHYVQRFGSISRNT---NINLAPLEAV 233 Query: 240 IKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKE-----PDILTRQRMIAEI 294 +K + + Y + +C A K +I + + A + Sbjct: 234 ESQLEFYKKWFPSISYTAE------TAYLYTYMSICSAYYKHSLDCPKEISNKAKSAASV 287 Query: 295 FTSGMYKRLITNVRSVKVGYQALLWSFRLWQWR 327 + +++ +K +QA+++ + + Sbjct: 288 YLKSVFQ---FKELPLKKKFQAIVFRYFNRLYY 317 >UniRef50_A7VGT1 Putative uncharacterized protein n=1 Tax=Clostridium sp. L2-50 RepID=A7VGT1_9CLOT Length = 444 Score = 252 bits (644), Expect = 1e-65, Method: Composition-based stats. Identities = 64/307 (20%), Positives = 114/307 (37%), Gaps = 3/307 (0%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 K+SVI+P+YN C++S+ QT +EII ++DGSTD+S +I Y + ++ Sbjct: 1 MIKVSVIVPVYNGEKYLEQCLDSICNQTLQEIEIICVDDGSTDSSWDILNRYKDQDDRIQ 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 L Q N A VARN G ATG+Y+ F D DD E + A D DV C + Sbjct: 61 LYRQQNLYAGVARNTGKSHATGEYLVFWDCDDFFESDALEKMYGKAKAVDADVCVCGGNQ 120 Query: 125 CFRETGETWQSIPTDRLRSTGVL-TGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 + + P +++ T + W +++R I + + F Sbjct: 121 FLESKQKLYPWPPYLSVKNIPETDTFNRFTNPDYYLNFTNAAAWNKIFKRSYIEQLKLDF 180 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTS--VSRLHRQGNKNLNYQRHYIK 241 + D+ +T + A R ++ L Y ++ S V + + + + Sbjct: 181 QPVRNGNDVYFTVNAIGLADRITVLDEKLVNYRVNQASGLVCSISKSPLSPIQAWMDIAE 240 Query: 242 ITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYK 301 + I + ++ + ++M +I G Y Sbjct: 241 NLEKYDGFGERSFANKAIGSMIYMLQNFQKWDAFRQAVEALKEYGLEKMHIQIQEDGYYY 300 Query: 302 RLITNVR 308 + Sbjct: 301 SVWNGEC 307 >UniRef50_C4Z667 Putative uncharacterized protein n=1 Tax=Eubacterium eligens ATCC 27750 RepID=C4Z667_EUBE2 Length = 329 Score = 252 bits (644), Expect = 2e-65, Method: Composition-based stats. Identities = 71/330 (21%), Positives = 135/330 (40%), Gaps = 14/330 (4%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 LS+I+P+YN D C S+I Q + +EI++I+DGSTDNS EI Y+ E Y Sbjct: 7 NEKLPLLSIIVPVYNVQDYLEECCNSIINQKYLNIEILLIDDGSTDNSGEICDYFGEKYS 66 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 ++R+ H+ N G + RN GI A G+Y+ FVD+DD +Y MY +M + LE + + CN Sbjct: 67 NIRVFHRENEGLAATRNFGIAQAQGQYLGFVDSDDWIYENMYSDMMKIVLEKNAQIVCCN 126 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNI 181 D E + I + + + + + + ++++++ N+ Sbjct: 127 FDL------EIDKRIIKRGKNGIDIYNKNEAIVNLMYPQFYQFYAPNKIFKKELFQ--NV 178 Query: 182 KFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIK 241 +F G H +DI+ + + + + QS Y Y S++ Sbjct: 179 EFPIGKHFEDIITIYKVFLKSEKVYFIWQSGYVYRQRKDSITNAIFNSKSAEVLDAIGFV 238 Query: 242 ITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMY- 300 + + N D ++ Y ++ +A+ + + + + + Sbjct: 239 MNDSKRRFPNNINDIMVGYIRYYISFLDKAIIGNSKELRIYQMTIQDVIKKYKKAIIINK 298 Query: 301 -----KRLITNVRSVKVGYQALLWSFRLWQ 325 +++ V LL + + Sbjct: 299 KLSFARKIQFLVCGYFAKLYELLAKIKNFL 328 >UniRef50_B5G7W6 Glycosyltransferase n=2 Tax=Streptomyces RepID=B5G7W6_9ACTO Length = 1200 Score = 251 bits (643), Expect = 2e-65, Method: Composition-based stats. Identities = 68/248 (27%), Positives = 112/248 (45%), Gaps = 17/248 (6%) Query: 8 LSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLH 67 LSVI+P+YN C++SL QT+ LE+I+++DGS D+S IA YAE RL+ Sbjct: 14 LSVIVPIYNVETYLPACLDSLRAQTFEDLEVIMVDDGSPDDSARIAADYAELDERFRLVR 73 Query: 68 QANAGASVARNRGIEVAT--GKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWC 125 +ANAG ARN G+E + +Y+AFVD+DD + P Y +++ E D A N Sbjct: 74 KANAGLGAARNTGLEHVSPGSEYLAFVDSDDLIPPDAYRLMVSSLDESGSDFATGNVHHL 133 Query: 126 FRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHV---VWMGVYRRDVIVKNNIK 182 E T ++ +L+G R +S R V+RR + + Sbjct: 134 RGEN--------TWQVPLLKMLSGQAVRRTHISERPALVADRIACNKVWRRTFWDEGTGR 185 Query: 183 FIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNT----SVSRLHRQGNKNLNYQRH 238 F G+ ++D T + A + +Y + L S+++ + + Sbjct: 186 FPEGVWYEDTPVTVTAHYRARAIDIIGEPVYFWRLREGEAGLSITQRRTEPQGVRDRVDA 245 Query: 239 YIKITRLL 246 + ++ L Sbjct: 246 CLVVSEFL 253 >UniRef50_O87183 CpsIaJ n=6 Tax=Streptococcus agalactiae RepID=O87183_STRAG Length = 315 Score = 251 bits (643), Expect = 2e-65, Method: Composition-based stats. Identities = 74/324 (22%), Positives = 128/324 (39%), Gaps = 17/324 (5%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 + +SVI+P+YN D C+ESLI QT+ +EI++INDGSTDN IAK ++E V Sbjct: 6 EKDLISVIVPIYNVEDYLVECIESLIVQTYRNIEILLINDGSTDNCATIAKEFSERDCRV 65 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 + ++N G S ARN GI + GKY+ FVD+DD+V L + D +A Sbjct: 66 IYIEKSNGGLSEARNYGIYHSKGKYLTFVDSDDKVSSDYIANLYNAIQKHDSSIAIGGYL 125 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTH---VVWMGVYRRDVIVKNN 180 + D V+ + L + + W ++ + + N+ Sbjct: 126 EFYERHNSIRNYEYLD-----KVIPVEEALLNMYDIKTYGSIFITAWGKLFHKSIF--ND 178 Query: 181 IKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI 240 ++F +H+D + + A Y ++ LY Y + S+ Sbjct: 179 LEFALNKYHEDEFFNYKAYLKANSITYIDKPLYHYRIRVGSIMNNSDNVIIARKKLDVLS 238 Query: 241 KITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMY 300 + ++ + + + E V R + + + M E Y Sbjct: 239 ALDERIKLITSLRKYSVFLQKT-------EIFYVNQYFRTKKFLKQQSVMFKEDNYIDAY 291 Query: 301 KRLITNVRSVKVGYQALLWSFRLW 324 + +R VK+ + L R + Sbjct: 292 RMYGRLLRKVKLVDKLKLIKNRFF 315 >UniRef50_C3AT96 Glycosyltransferase family 2 n=4 Tax=Bacillus RepID=C3AT96_BACMY Length = 343 Score = 251 bits (643), Expect = 2e-65, Method: Composition-based stats. Identities = 85/324 (26%), Positives = 132/324 (40%), Gaps = 4/324 (1%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+SVIIP+YNAG C+ESL+ QT E I INDGS D+S EI + Y + + L+ Sbjct: 4 KVSVIIPVYNAGKYITRCIESLLNQTLQECEFIFINDGSIDDSREIIEGYKQVDDRIILI 63 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCF 126 +Q N G S+ARN+G+ +A G+YV FVDADD + MYETL + + DV N Sbjct: 64 NQVNQGVSIARNKGLHLAAGEYVGFVDADDCIEKDMYETLYNSVKQSNCDVVISNFKGET 123 Query: 127 RETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAG 186 + + L L + VV +YR ++I +NN+KF +G Sbjct: 124 EGHKVITTYPFPIDIVLEKDYIEQEVLPYFLKADNLNTVV-NKIYRNNLIKENNVKFPSG 182 Query: 187 L-HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRL 245 + +D ++ F A R +Y + Y Y S +R + + Y Sbjct: 183 IALGEDGMFNVAFFSGATRMKYVDYVGYHYKEVAGSATRNISEKDYFQRAVEVYTMELPQ 242 Query: 246 LEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLIT 305 + + + + V K+ R + + + + + + Sbjct: 243 IYTDKIDRVRISKLRSIKFINSVMSYIHVYFTPSKDVSFNKRYKYVKSMIRNKHVREALP 302 Query: 306 NVRSVKVGYQALLWSFRLWQWRDK 329 Y L RL K Sbjct: 303 I--YYSEIYNKLGRYERLIINLIK 324 >UniRef50_B0NM04 Putative uncharacterized protein n=1 Tax=Bacteroides stercoris ATCC 43183 RepID=B0NM04_BACSE Length = 346 Score = 251 bits (643), Expect = 2e-65, Method: Composition-based stats. Identities = 79/339 (23%), Positives = 138/339 (40%), Gaps = 25/339 (7%) Query: 1 MMNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENY 60 MM +SVIIP+Y A C+ SL+ Q++TA E+++I+DGS DNS +I +AE Sbjct: 1 MMEQCV-ISVIIPVYKAEKYLSRCIASLLAQSFTAFELLLIDDGSPDNSGKICHDWAERD 59 Query: 61 PHVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLD-VAQ 119 +R+LHQ NAG S ARN+G++ A G+Y+ FVD+DD V + L +LD + Sbjct: 60 CRIRVLHQTNAGVSTARNKGLDEAKGEYIVFVDSDDYVGTDYLKHLYEE-RPLNLDSIVV 118 Query: 120 CNADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTH-VVWMGVYRRDVIVK 178 + + + V TG D + L+ ++ R +I + Sbjct: 119 QGFTIVSEQADRQYTTS-----FIPSVYTGRDVRELFLNDEMRDMRGPVAKLFSRVLIER 173 Query: 179 NNIKFIAGLHH-QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQR 237 ++F A + +D ++ +++F+ + Y Y S+++ + Sbjct: 174 LGLRFPAEISFGEDTIFNFQYLFHCRQVVVGSHVDYVYCDVPGSLTKRLNDFASEYATFK 233 Query: 238 HYIKI-TRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDIL----------T 286 Y I RL KL+ N AD + + + K P L Sbjct: 234 LYKSITVRLAGKLDLNVADMRTAFSFVVLFFQRALIADYRSAGKVPFSLRTQHLRRLAAE 293 Query: 287 RQRMIAEIFTSGMYKRLITNVRSVKVGYQALLWSFRLWQ 325 ++ + E + G + + + L+ Sbjct: 294 QRSFMKEFYHPGYKADKWGR----QFLLHGFCFCYNLYM 328 >UniRef50_D1WVE3 CDP-glycerol:poly(Glycerophosphate) glycerophosphotransferase n=3 Tax=Streptomyces RepID=D1WVE3_9ACTO Length = 1151 Score = 251 bits (642), Expect = 3e-65, Method: Composition-based stats. Identities = 61/243 (25%), Positives = 103/243 (42%), Gaps = 6/243 (2%) Query: 6 NKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRL 65 +L+V++PLYN + C+ SL QT LE+++++DGSTD+ +A+ +A+ P RL Sbjct: 3 PRLTVVVPLYNVEEYLGACLTSLAEQTMPDLEVVLVDDGSTDDGPSLAQEFADRDPRFRL 62 Query: 66 LHQANAGASVARNRGIEVAT--GKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 + Q NAG ARN GI A +++ FVD+DD V P Y ++ D A N Sbjct: 63 IRQENAGLGAARNAGIREARPGAEFLTFVDSDDVVPPGAYARMLAELDASGSDFATGNVL 122 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 + + + T + L R V+RR ++ F Sbjct: 123 RLRSDGELEQSPMFRRPMEKARPATHVTRDWILLGDRI----ACNKVFRRTFWDEHAFAF 178 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 G+ ++DI F A E+ +Y + + S++ + ++ Sbjct: 179 PTGVLYEDIAVVLPAHFLARSVDVIEEPVYHWRDRDGSITTRRAVARGIRDRVTAVTTVS 238 Query: 244 RLL 246 R L Sbjct: 239 RFL 241 >UniRef50_C9BUH6 Glycosyltransferase n=1 Tax=Enterococcus faecium 1,231,408 RepID=C9BUH6_ENTFC Length = 353 Score = 251 bits (642), Expect = 3e-65, Method: Composition-based stats. Identities = 74/337 (21%), Positives = 141/337 (41%), Gaps = 15/337 (4%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 MNS++ +SVIIP+YN +C+ES+ QT+ L+III++DGS D+S +I + YA Sbjct: 1 MNSSDLISVIIPVYNVEKYVGSCLESVCNQTYKNLDIIIVDDGSKDSSNKICEDYALKDS 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 +R++H+ NAG +ARN G+E GKYVAF+D+DD V E ++ + D Sbjct: 61 RIRIIHKNNAGLGLARNTGLEHVKGKYVAFIDSDDFVAANYIEAMLNGIKRSNADTCYAG 120 Query: 122 ADWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGL-------SSRRWTHVVWMGVYRRD 174 + + + D T L + VW +Y + Sbjct: 121 YYEYYDDNNIISKPARYDSTIFTNNAIIEKVLLGMIGTEPSYKVDFILPMSVWHALYSSE 180 Query: 175 VIVKNNIKFIAGLHH--QDIVWTTEFMFNALRARYTEQSLYKYYL-HNTSVSRLHRQGNK 231 +I ++NIKF + + +DI++ ++ ++ + + +Y Y +N S++ R Sbjct: 181 IIQQHNIKFPSERDYISEDIIFHISYLQHSKITSFIKDCIYYYRCNNNNSLTHNFRDNEF 240 Query: 232 NLNYQRHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMI 291 + N I++ + +E + + + + + + + Sbjct: 241 DCN-----IELVKKIEGDLSYILENEIFVNKTRRLLLGRCRTCIQKAVRYASFNKEFDLN 295 Query: 292 AEIFTSGMYKRLITNVRSVKVGYQALLWSFRLWQWRD 328 + + M LI + S + + + Sbjct: 296 SYLSKILMNDELIACIHSYPYKQNPIKQRIFNYFLLN 332 >UniRef50_B7AIU6 Putative uncharacterized protein n=1 Tax=Bacteroides eggerthii DSM 20697 RepID=B7AIU6_9BACE Length = 342 Score = 251 bits (641), Expect = 4e-65, Method: Composition-based stats. Identities = 85/340 (25%), Positives = 149/340 (43%), Gaps = 18/340 (5%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 K+S+I+P+Y C+E+LI Q+ +EII+INDGS DNS EI YA + Sbjct: 2 EIPKISIIVPIYKVEKYLERCIETLINQSIKNIEIILINDGSPDNSSEICDKYALTDLRI 61 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 +++H+ N G +ARN G+EVATG Y+AFVD+DD V MY+ L A + D CN Sbjct: 62 KVIHKENEGLGLARNTGLEVATGTYIAFVDSDDYVDINMYKKLAEYADKHHCDAVFCNFY 121 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLS------SRRWTHVVWMGVYRRDVIV 177 S +D L M S R++ VW +Y+ +I Sbjct: 122 TEINNGKWRNSSEVSDYELWEKSKIHSFMLDMIASKPHEKLERKYQMSVWHAIYKHTIID 181 Query: 178 KNNIKFIAGL--HHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLH-RQGNKNLN 234 K NI+F++ +DI + +F+ + R Y + Y Y ++ S+++ + Sbjct: 182 KENIRFLSERVVVSEDIPFQVDFLSASNRIGYIPNAYYYYCKNDNSLTKNFLTEKYDRFK 241 Query: 235 YQRHYIKITRLLEKLNRNYADKIMIY-----PEFHQQITYEALRVCHAVRKEPDILTRQR 289 R+++ + + + + Y E + + + K+P T +R Sbjct: 242 NLRNHLLMKNSTIEYKQRVNKLFIGYCRSYITELINSNQNNKKGILNIIVKDPIWETIKR 301 Query: 290 MIAE----IFTSGMYKRLITNVRSVKVGYQALLWSFRLWQ 325 I++ YK +++ + + Y + + + + Sbjct: 302 EYKSSYLPIYSRIYYKIILSKNSRILLIYSSFINLIKKIR 341 >UniRef50_C1DN10 Glycosyl transferase, family 2 protein n=1 Tax=Azotobacter vinelandii DJ RepID=C1DN10_AZOVD Length = 328 Score = 251 bits (641), Expect = 4e-65, Method: Composition-based stats. Identities = 68/310 (21%), Positives = 131/310 (42%), Gaps = 10/310 (3%) Query: 6 NKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRL 65 K+SVIIP+YN C++S + Q + LEII ++DGSTD S+EI + + + ++ + Sbjct: 4 EKVSVIIPVYNVAPYLAQCLDSAVNQDYDNLEIIAVDDGSTDTSLEILEAFRARHGNLLV 63 Query: 66 LHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWC 125 N G S+ARN G+E+ATG+Y+ FVD+DD + T + + +D+ + Sbjct: 64 RSTKNQGVSIARNTGLELATGEYILFVDSDDFLERQAISTCVEILGRQQVDMVFFASTVF 123 Query: 126 FRETGETWQSIPTDRLRST---GVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIK 182 F E+ + + + + + + + ++ +YRR ++ Sbjct: 124 FDGVDESLRENFRGERVPDLHGAYIPAHTFFIQSIRLKSYLMMPYLYMYRRKEFE--GLR 181 Query: 183 FIAGLHHQDIVWTTEFMFNAL--RARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYI 240 F + H+D ++TT + R LY + S+ +Q Y Sbjct: 182 FYPEIVHEDNLFTTRLLLEKEHARIACIPDRLYNRRVRPESIMTQKKQKRHINGYLVVAQ 241 Query: 241 KITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRK-EPDILTRQRMIAEIFTSGM 299 ++ ++ KL ++ +F Q AL C + R R++ + + Sbjct: 242 ELLKI--KLPPENSEAGAALNQFVQLNLLRALETCKLTYAGRLPLHVRGRLVMLFARTKI 299 Query: 300 YKRLITNVRS 309 +R + + Sbjct: 300 ERRKVKRILF 309 >UniRef50_B5CU60 Putative uncharacterized protein n=1 Tax=Bacteroides plebeius DSM 17135 RepID=B5CU60_9BACE Length = 351 Score = 251 bits (641), Expect = 4e-65, Method: Composition-based stats. Identities = 74/348 (21%), Positives = 141/348 (40%), Gaps = 10/348 (2%) Query: 4 STNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHV 63 K+S+I LYN ++ L+ QT+ LEIIII+DGSTD+SVE + A + + Sbjct: 2 KNYKVSIITCLYNVSRFLPAGLDCLLNQTYPNLEIIIIDDGSTDDSVEKCRKVAAHDARI 61 Query: 64 RLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNAD 123 ++ + N G ARN G++ ATG ++ F D DD + T+ E + + + +LD+ Sbjct: 62 KIYSKENGGLGSARNWGLDYATGDFIYFYDVDDRIELTLVERNVQLMDKHNLDLIIFGFY 121 Query: 124 WCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKF 183 + T + L + ++ L R W YRR I ++++F Sbjct: 122 VYDEKFHTTDSLQFQETLIESNEKLRGIYMDKLLLVRYGNGFAWNKFYRRSFIEAHHLRF 181 Query: 184 IAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKIT 243 QD V+ +F R + LY Y++++T +R ++ Y + + Sbjct: 182 ENQRIQQDEVFNLKFYPKLQRVYLSPDILYHYFIYDTGNTRSRFIPDRYEIYLSIFQHLM 241 Query: 244 RLLEKL-NRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKR 302 ++ ++ K Y F+ IT L + R++ I I + Sbjct: 242 TFSKEWGIQDERFKQYAYSRFYAGITNSILYNAFHPNAPYSNVERKKDILRILQKEEVRE 301 Query: 303 LITNVRSVKVGYQALLWSFR---------LWQWRDKTRSHHRITRSAF 341 ++ ++ + +R + W++ + + + F Sbjct: 302 CLSYIQENQPMGIETYLYWRAFTQMSYRQIVLWKNLFNTLRLLKQKMF 349 >UniRef50_C7QZJ2 Glycosyl transferase family 2 n=1 Tax=Jonesia denitrificans DSM 20603 RepID=C7QZJ2_JONDD Length = 1116 Score = 250 bits (640), Expect = 5e-65, Method: Composition-based stats. Identities = 63/265 (23%), Positives = 105/265 (39%), Gaps = 9/265 (3%) Query: 11 IIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLLHQAN 70 IIP++N D C++S+ TQ+ +EII I+DGSTD S ++ K YAE P +R+ Q N Sbjct: 2 IIPVFNVQDYLDECLDSIATQSLKDIEIIAIDDGSTDTSFDLLKQYAEKEPRLRIFTQEN 61 Query: 71 AGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADWCFRETG 130 +G RNRG+ +A G+++ FVD+DD + Y + + D + Sbjct: 62 SGLGATRNRGLSLARGRFITFVDSDDTIPHNAYSIMAKTLEKTGSDFVIGQIYR-TKGEQ 120 Query: 131 ETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIAGLHHQ 190 E W + + + G D L +Y++ +NN KF G+ + Sbjct: 121 EIWPKWSRNLHDTERLKIGIDDFPEILRDFYSP----NKLYKKSFWDQNNFKFREGVIFE 176 Query: 191 DIVWTTEFMFNALRARYTEQSLYKYYLHNT--SVSRLHRQGNKNLNYQRHYIKITRLL-- 246 D T+ A Y + + S+S + L Sbjct: 177 DQPLITQIFTKAKSFDVITTPTYCWKIREDGTSLSSNLFTEKMLKARIEASKLTVQYLIQ 236 Query: 247 EKLNRNYADKIMIYPEFHQQITYEA 271 E N ++ ++H +A Sbjct: 237 ESTPSNLEAWQTVFLKWHMMNYLKA 261 >UniRef50_C0FV38 Putative uncharacterized protein n=1 Tax=Roseburia inulinivorans DSM 16841 RepID=C0FV38_9FIRM Length = 329 Score = 250 bits (640), Expect = 5e-65, Method: Composition-based stats. Identities = 67/329 (20%), Positives = 141/329 (42%), Gaps = 4/329 (1%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 K+S+IIP+YN+ + C+ES++ QT+ EI++++DGSTD+S I YA+ + + Sbjct: 2 KISIIIPVYNSTLYLKQCVESILAQTYHNFEILLVDDGSTDDSPMICDEYAQKDDRIVTI 61 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEVY-PTMYETLMTMALEDDLDVAQCNADWC 125 H+ N G S ARN G+E A+G Y+ F+D DD P ++ + E + DV Sbjct: 62 HKQNGGTSDARNVGLEKASGDYITFMDNDDYWSNPDALCDIIKVISETEPDVVMHANMVY 121 Query: 126 FRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIA 185 +++ +T DR +G + + + + + VW + + +I ++++ F Sbjct: 122 WQDKNKTVNPSSFDRTLVSG-KKRKEAVENLIKAGMLSVTVWTKLVKTSLIREHDLYFKK 180 Query: 186 GLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRL 245 G+ ++D W + A R + ++ Y Y + + + +N ++ I Sbjct: 181 GIRNEDTDWIARMLICAERFEWYDKPFYVYRKGHETAQTSQKMKYYMVNDLKNIIIEYTQ 240 Query: 246 LEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKRLIT 305 + + + + Y + + ++ + + R I + + + + Sbjct: 241 YAEKYLDQEYRKVFYAYLAYPFAVWMGQAY--LIEQNEQIKNDREIMKKYAYLLTYDINP 298 Query: 306 NVRSVKVGYQALLWSFRLWQWRDKTRSHH 334 V V Y+ L + + + Sbjct: 299 AVHLVHRVYRILGFGLTSRILMLYIKKVY 327 >UniRef50_B6FPH2 Putative uncharacterized protein n=1 Tax=Clostridium nexile DSM 1787 RepID=B6FPH2_9CLOT Length = 344 Score = 250 bits (640), Expect = 5e-65, Method: Composition-based stats. Identities = 69/345 (20%), Positives = 123/345 (35%), Gaps = 10/345 (2%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 +SVI+P+YN C+ESL Q++ EI++I+DGSTD S + + Y V+ Sbjct: 1 MPFISVIVPVYNVEKYITRCVESLTGQSFQDYEILLIDDGSTDKSGILCDEIEKGYDKVK 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 H+ N G + ARN G++ A GKYV FVD+DD V E + E+ +D+ + Sbjct: 61 TYHKRNGGLASARNYGLKCAEGKYVVFVDSDDWVTDDFLEFIYNHLNENPVDILKYGFQR 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDW-------LRMGLSSRRWTHVVWMGVYRRDVIV 177 D D +++ + Y + Sbjct: 121 VSDNKLGEVIVPYYDEGIYKKAQIKSDILPGAIGPIQLFNYDKNALMSACTCAYSTSFLK 180 Query: 178 KNNIKFIAG--LHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNY 235 KNNI F++ + ++D ++ + A + T + LY Y S+S+ + Sbjct: 181 KNNISFVSERKILNEDHLFNFHALLCADKVEVTHKILYLYDFREGSLSKCYINQMVERK- 239 Query: 236 QRHYIKITRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIF 295 LL+K + Y + C K + + I Sbjct: 240 IELIKTYKNLLDKYKVFSEFEEEYYSQCVDSFYACITNECGNWGKNRKFKEISKKVYGIL 299 Query: 296 TSGMYKRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITRSA 340 + K + + + + + + + K+ I Sbjct: 300 KNPNCKMALKKCKHNNLTLKGYIIYYLMKFKMSKSIVWLYIKMKG 344 >UniRef50_C8PLK6 Ss-1,4-galactosyltransferase n=1 Tax=Campylobacter gracilis RM3268 RepID=C8PLK6_9PROT Length = 421 Score = 250 bits (639), Expect = 5e-65, Method: Composition-based stats. Identities = 64/250 (25%), Positives = 114/250 (45%), Gaps = 24/250 (9%) Query: 2 MNSTNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYP 61 M+ +SVIIP+YN + R C+E + QT+ LEI++++DGS D S I YA Sbjct: 1 MSVGPLISVIIPVYNVKEFLRACVERITAQTYANLEILLVDDGSNDGSEAICDEYAARDL 60 Query: 62 HVRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 VR+LH++N G + ARN ++VA G++++FVD+DD + + ETL + + +A C Sbjct: 61 RVRVLHKSNGGQASARNAALDVARGEFISFVDSDDLISLDLIETLFRLTQKFCTKIAMCG 120 Query: 122 ADWCFRETGETWQSIPTDRLRSTG--------------------VLTGPDWLRMGLSSRR 161 E+ + + ++ + R + Sbjct: 121 YAAFSDESEINNFKAALNSKENQNSKEISNPGGVANSNANAGEYKISPQELFRRSCTQDP 180 Query: 162 WTHV-VWMGVYRRDVIVKNNIKFIAGLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNT 220 + VW ++ R++ ++F AG ++D+ + NA A YT++ LY Y + Sbjct: 181 YFSTAVWRALFAREIFTA--LRFPAGQIYEDVAIFFDIF-NASIAAYTDEILYFYRVRAG 237 Query: 221 SVSRLHRQGN 230 S + + Sbjct: 238 STVNSFARRH 247 >UniRef50_C9K1G9 Putative glycosyltransferase n=1 Tax=Klebsiella pneumoniae RepID=C9K1G9_KLEPN Length = 342 Score = 250 bits (639), Expect = 5e-65, Method: Composition-based stats. Identities = 79/348 (22%), Positives = 153/348 (43%), Gaps = 24/348 (6%) Query: 4 STNKLSVIIPLYNA-GDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPH 62 + +SVI+P+YN + TC++S+ QT +EII++NDGST+N +E+ ++ Sbjct: 2 NMPIISVILPIYNIKEEYLETCIQSVCRQTLDDIEIILVNDGSTNNCLEVCTKFSHLDNR 61 Query: 63 VRLLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNA 122 + ++ Q N+G SVARN G+++A G+++AFVD DD + P +L A + ++ C+ Sbjct: 62 IIVIDQPNSGVSVARNSGLDIARGEWIAFVDPDDWLEPDYLLSLYNNA-SSNTEIVICDC 120 Query: 123 DWCFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRW---------THVVWMGVYRR 173 + +P + + GVL + L L S+ V W +++ Sbjct: 121 NVFSNGEKRGNNFLPANSI---GVLKNKNLLLGQLVSKVIAGYYPQLIGPGVPWGKLFKH 177 Query: 174 DVIVKNNIKFIAGLH-HQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKN 232 D I K N+KF+ G+ QD ++ + A + YT ++ Y Y + N S S Sbjct: 178 DFIKKYNLKFVPGMVRMQDNIFCLYAFYYASQVFYTGENKYNYRMENNSAS-FKYNKKII 236 Query: 233 LNYQRHYIKITRLLEKLNRNYADKIMIYP-----EFHQQITYEALRVCHAVRKEPDILTR 287 ++++++ ++ ++K + + Y H + E D L Sbjct: 237 SHFEKYFEEVKAFIDKCMSGNERYQKALDAKTLTSINSYLLYYFCHDNHTIESENDTLGN 296 Query: 288 QRMIAEIFTSGMYKRLITNVRSVKVGYQALLWSFRLW-QWRDKTRSHH 334 ++ + KR+ + + +L RL+ + + Sbjct: 297 --LMCRQPYASALKRVDFTYLTTQEKIFVILLKLRLYGVLKLIYKIKR 342 >UniRef50_D1XZD8 Glycosyltransferase, group 2 family protein n=1 Tax=Prevotella bivia JCVIHMP010 RepID=D1XZD8_9BACT Length = 320 Score = 250 bits (639), Expect = 6e-65, Method: Composition-based stats. Identities = 74/328 (22%), Positives = 139/328 (42%), Gaps = 12/328 (3%) Query: 5 TNKLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVR 64 K+S+I+P+YNA C++SL+ Q ++ EI++I+DGS D S +I YA+ V+ Sbjct: 1 MVKVSIIVPIYNAEKYLHRCVDSLLAQDFSDYEILLIDDGSLDASSKICDDYAQKECKVK 60 Query: 65 LLHQANAGASVARNRGIEVATGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCNADW 124 + HQ N+G + AR GIE A G+Y VDADD V M + L + A + D+ C+ Sbjct: 61 VFHQKNSGVAAARQLGIEQAQGEYTIHVDADDWVDAAMLKELWSKAEDTKADMVICDYFE 120 Query: 125 CFRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFI 184 ++ + L++ L + +R L + +R IKF Sbjct: 121 EYQNKTKRI-------LQNCDYLKKSELIRALLQQ--LHGSCCNKLVKRACYNNVGIKFD 171 Query: 185 AGLHH-QDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLH-RQGNKNLNYQRHYIKI 242 + + +D+++ + ++ Y + Y Y ++ S + K ++ + + Sbjct: 172 TMVSYCEDLLFNVSVIQTGIKVVYLPMAFYHYDQYSNGTSLVKGMSIKKLMSRIQVIKSL 231 Query: 243 TRLLEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRMIAEIFTSGMYKR 302 +L++ Y D + + + + + E + IA + Sbjct: 232 EHILKEDLSMYEDLLFLKKAKVKDDVFFGNYYSKELFIELFREINHQYIANYRFPKDKRP 291 Query: 303 LITNVRSVKVGYQALL-WSFRLWQWRDK 329 L + V GY L F+L++W + Sbjct: 292 LQSAVCLALSGYYGLAQALFKLYKWIRR 319 >UniRef50_B1C3W3 Putative uncharacterized protein n=1 Tax=Clostridium spiroforme DSM 1552 RepID=B1C3W3_9FIRM Length = 323 Score = 250 bits (639), Expect = 7e-65, Method: Composition-based stats. Identities = 71/321 (22%), Positives = 145/321 (45%), Gaps = 6/321 (1%) Query: 7 KLSVIIPLYNAGDDFRTCMESLITQTWTALEIIIINDGSTDNSVEIAKYYAENYPHVRLL 66 S+I+P+YN + C++S++ QT+ EII+++DGS D S I A Y ++++ Sbjct: 2 LFSIILPIYNVENYLCECVDSILNQTYKDYEIILVDDGSKDKSPHICDELAHKYDCIKVI 61 Query: 67 HQANAGASVARNRGIEVATGKYVAFVDADDEV-YPTMYETLMTMALEDDLDVAQCNADWC 125 H+ N G S ARN G + A GKY+ ++D+DD + L + +D+ Sbjct: 62 HKKNGGLSDARNAGTKEAIGKYIVYIDSDDYILDKEFLSKLAQK-TKTGVDLIFYKYQKY 120 Query: 126 FRETGETWQSIPTDRLRSTGVLTGPDWLRMGLSSRRWTHVVWMGVYRRDVIVKNNIKFIA 185 F ET + T L + L + + + + + W+ +R +IV+NNI F Sbjct: 121 FNETKKLEDCTYTYSLAMSETLYANK-IEALVKADAFYGMAWIKAVKRKLIVENNINFEV 179 Query: 186 GLHHQDIVWTTEFMFNALRARYTEQSLYKYYLHNTSVSRLHRQGNKNLNYQRHYIKITRL 245 GL +D+ W + +FNA + ++ + Y S++ H N +++ K + Sbjct: 180 GLLGEDMDWNYQVIFNASTIEFIDEPMIAYRQREGSITSTHTLKN-LVDFVYILEKWSNR 238 Query: 246 LEKLNRNYADKIMIYPEFHQQITYEALRVCHAVRKEPDILTRQRM-IAEIFTSGMYKRLI 304 + ++ KI +Y + + + + I ++ + + GM +R Sbjct: 239 ITNDIKDEKLKIALYGSLAKYYSNLLVVYSRLSDSKKKIYIKRIKNLGWLLKYGMSQRPQ 298 Query: 305 TNVRSVKV-GYQALLWSFRLW 324 + ++ G++ + + +++ Sbjct: 299 MIYKIYRIAGFRLTIIALKIF 319 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.315 0.173 0.600 Lambda K H 0.267 0.0529 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 2,383,830,460 Number of Sequences: 3077464 Number of extensions: 149372437 Number of successful extensions: 522251 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 10591 Number of HSP's successfully gapped in prelim test: 9657 Number of HSP's that attempted gapping in prelim test: 476704 Number of HSP's gapped (non-prelim): 26975 length of query: 344 length of database: 1,040,396,356 effective HSP length: 129 effective length of query: 215 effective length of database: 643,403,500 effective search space: 138331752500 effective search space used: 138331752500 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (20.7 bits) S2: 94 (40.5 bits)