BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (339 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P27128 Lipopolysaccharide 1,3-galactosyltransferase n=4... 705 0.0 UniRef50_D2TIX6 UDP-galactose:(Glucosyl) LPS alpha-1,3-galactosy... 499 e-140 UniRef50_Q9ZIT4 UDP-galactose:(Glucosyl) LPS alpha1,3-galactosyl... 372 e-102 UniRef50_Q9R9D1 UDP-glucose:(Glucosyl) LPS alpha1,3-glucosyltran... 358 2e-97 UniRef50_B6XJW2 Putative uncharacterized protein n=1 Tax=Provide... 316 7e-85 UniRef50_D0KD53 Lipopolysaccharide 3-alpha-galactosyltransferase... 310 7e-83 UniRef50_D2TY85 UDP-D-galactose:(Glucosyl)lipopolysaccharide-alp... 285 2e-75 UniRef50_UPI000197C525 UDP-D-galactose:(glucosyl) lipopolysaccha... 248 3e-64 UniRef50_UPI00019F16C6 hypothetical protein CATC2_20202 n=1 Tax=... 203 6e-51 UniRef50_D2U322 Lipopolysaccharide 1,2-glucosyltransferase n=1 T... 202 1e-50 UniRef50_A8ARL6 Putative uncharacterized protein n=3 Tax=Citroba... 201 2e-50 UniRef50_B6XJW1 Putative uncharacterized protein n=1 Tax=Provide... 201 4e-50 UniRef50_Q9ZIT6 UDP-glucose:(Galactosyl) LPS alpha1,2-glucosyltr... 199 2e-49 UniRef50_Q9ZIS1 UDP-galactose:(Galactosyl) LPS alpha1,2-galactos... 196 7e-49 UniRef50_B2PV91 Putative uncharacterized protein n=2 Tax=Enterob... 196 7e-49 UniRef50_D0KD54 Lipopolysaccharide glucosyltransferase I n=2 Tax... 190 5e-47 UniRef50_D1P7H1 Lipopolysaccharide 1,2-glucosyltransferase n=1 T... 189 1e-46 UniRef50_Q9ZIS6 UDP-galactose:(Glucosyl) LPS alpha1,2-galactosyl... 188 2e-46 UniRef50_P27129 Lipopolysaccharide 1,2-glucosyltransferase n=44 ... 183 6e-45 UniRef50_C1DGU7 Lipopolysaccharide 1,3-galactosyltransferase n=1... 154 4e-36 UniRef50_Q46Y64 Glycosyl transferase, family 8 n=1 Tax=Ralstonia... 128 4e-28 UniRef50_C3X7M2 Lipopolysaccharide 3-alpha-galactosyltransferase... 121 3e-26 UniRef50_Q116W1 Glycosyl transferase, family 8 n=1 Tax=Trichodes... 106 1e-21 UniRef50_UPI0001B4A0A4 putative glucosyltransferase n=1 Tax=Bact... 105 3e-21 UniRef50_C7QL87 Glycosyl transferase family 8 n=2 Tax=Cyanothece... 105 3e-21 UniRef50_Q5LF36 Putative glucosyltransferase n=1 Tax=Bacteroides... 100 7e-20 UniRef50_UPI0001A457E5 hypothetical protein NsubN_08151 n=1 Tax=... 98 3e-19 UniRef50_D1PTN4 Putative uncharacterized protein n=1 Tax=Prevote... 98 4e-19 UniRef50_C0WCJ1 Lipopolysaccharide 1,2-glucosyltransferase n=1 T... 97 8e-19 UniRef50_A1XRC1 Glycosyltransferase n=1 Tax=Haemophilus ducreyi ... 95 3e-18 UniRef50_D2ELM0 Lipopolysaccharide biosynthesis glycosyltransfer... 94 7e-18 UniRef50_C3PWZ8 Lipopolysaccharide 1,2-glucosyltransferase n=1 T... 94 7e-18 UniRef50_A8ARL4 Putative uncharacterized protein n=2 Tax=Citroba... 93 2e-17 UniRef50_A0KQP2 Glycosyl transferase, family 8 n=2 Tax=Aeromonas... 92 4e-17 UniRef50_Q64ZV2 Lipopolysaccharide 1,2-glucosyltransferase n=8 T... 87 8e-16 UniRef50_B7AIV0 Putative uncharacterized protein n=1 Tax=Bactero... 86 2e-15 UniRef50_C0QZN2 Glycosyl transferase, family 8 n=3 Tax=Brachyspi... 85 4e-15 UniRef50_Q9L6B2 Putative glycosyl transferase n=3 Tax=Pasteurell... 84 5e-15 UniRef50_C1QEC6 LPS:glycosyltransferase n=1 Tax=Brachyspira murd... 84 7e-15 UniRef50_Q03HK5 Lipopolysaccharide biosynthesis glycosyltransfer... 83 1e-14 UniRef50_C5WAK3 Ybl156 protein n=2 Tax=Enterobacteriaceae RepID=... 82 2e-14 UniRef50_C1QBZ8 LPS:glycosyltransferase n=1 Tax=Brachyspira murd... 82 3e-14 UniRef50_A5ZFA9 Putative uncharacterized protein n=1 Tax=Bactero... 81 5e-14 UniRef50_A1BHG0 Glycosyl transferase, family 8 n=2 Tax=Chlorobiu... 80 7e-14 UniRef50_A4ECW2 Putative uncharacterized protein n=1 Tax=Collins... 79 2e-13 UniRef50_C6I3U6 Putative uncharacterized protein n=2 Tax=Bactero... 79 2e-13 UniRef50_D2MYR1 Putative uncharacterized protein n=1 Tax=Campylo... 79 2e-13 UniRef50_B0NR59 Putative uncharacterized protein n=1 Tax=Bactero... 78 4e-13 UniRef50_Q38VK7 Putative bifunctional glycosyl transferase, fami... 78 4e-13 UniRef50_B2UPJ4 Glycosyl transferase family 8 n=1 Tax=Akkermansi... 78 5e-13 UniRef50_D2RIJ7 Glycosyl transferase family 8 n=1 Tax=Acidaminoc... 78 5e-13 UniRef50_A6LGX5 Glycosyltransferase family 8 n=1 Tax=Parabactero... 77 8e-13 UniRef50_Q4HGS8 General stress protein A, putative n=2 Tax=Campy... 77 1e-12 UniRef50_B3Z5I6 Glycosyltransferase family 8 n=2 Tax=Bacillus ce... 76 1e-12 UniRef50_C7IBC1 Glycosyl transferase family 8 n=2 Tax=Clostridiu... 76 2e-12 UniRef50_C7IBC8 Glycosyl transferase family 8 n=1 Tax=Clostridiu... 75 3e-12 UniRef50_B3CA80 Putative uncharacterized protein n=1 Tax=Bactero... 75 4e-12 UniRef50_C7HS13 Family 8 glycosyl transferase n=2 Tax=Anaerococc... 75 5e-12 UniRef50_Q3DNA2 Glycosyl transferase, family 8 n=9 Tax=Streptoco... 74 7e-12 UniRef50_D0D9G3 Putative general stress protein A n=1 Tax=Citrei... 74 7e-12 UniRef50_P43974 Putative glycosyltransferase HI0258 n=32 Tax=Hae... 74 1e-11 UniRef50_C9RWX3 Glycosyl transferase family 8 n=8 Tax=Bacillacea... 72 2e-11 UniRef50_B2ISC6 Glycosyl transferase, family 2/glycosyl transfer... 72 2e-11 UniRef50_UPI000196958D hypothetical protein BACCELL_01586 n=1 Ta... 72 4e-11 UniRef50_B4RJG6 LgtC n=29 Tax=Neisseria RepID=B4RJG6_NEIG2 72 4e-11 UniRef50_C6IB51 Glycosyltransferase n=4 Tax=Bacteroides RepID=C6... 72 4e-11 UniRef50_P25148 General stress protein A n=3 Tax=Bacillus subtil... 72 4e-11 UniRef50_C3XGD2 Putative uncharacterized protein n=1 Tax=Helicob... 71 5e-11 UniRef50_B8ISQ5 Glycosyl transferase family 8 n=1 Tax=Methylobac... 71 5e-11 UniRef50_C5ZV11 Glycosyl transferase n=2 Tax=Helicobacter canade... 71 5e-11 UniRef50_C0Z4I4 Putative uncharacterized protein gspA n=1 Tax=Br... 70 8e-11 UniRef50_B1I7M9 Glycosyl transferase, family 8 n=16 Tax=Streptoc... 70 1e-10 UniRef50_A7H2M2 Glycosyl transferase family 8 n=3 Tax=Campylobac... 70 1e-10 UniRef50_C2HBB8 Family 8 glycosyltransferase n=13 Tax=Lactobacil... 69 2e-10 UniRef50_C4VEI8 General stress protein A n=24 Tax=Enterococcus R... 69 2e-10 UniRef50_A3V3C9 Putative uncharacterized protein n=1 Tax=Loktane... 69 3e-10 UniRef50_C2HBB9 Family 8 glycosyltransferase n=35 Tax=Lactobacil... 69 3e-10 UniRef50_Q50FU8 Cj81-079 (Fragment) n=6 Tax=Campylobacter jejuni... 69 4e-10 UniRef50_UPI0001AEC697 glycosyl transferase family protein n=1 T... 68 4e-10 UniRef50_C0EQT1 Putative uncharacterized protein n=1 Tax=Neisser... 68 4e-10 UniRef50_B1MX28 Lipopolysaccharide biosynthesis protein, LPS:gly... 68 5e-10 UniRef50_A7H2X4 Glycosyl transferase family 8 n=2 Tax=Campylobac... 68 5e-10 UniRef50_B9KVD4 Glycosyl transferase, family 8 n=2 Tax=Rhodobact... 67 6e-10 UniRef50_A4UX76 Putative LPS biosynthesis protein n=2 Tax=Lactob... 67 7e-10 UniRef50_B7AUG6 Putative uncharacterized protein n=1 Tax=Bactero... 67 8e-10 UniRef50_A0ZYL4 Putative uncharacterized protein n=1 Tax=Archaea... 67 1e-09 UniRef50_B8G232 Glycosyl transferase family 8 n=2 Tax=Desulfitob... 67 1e-09 UniRef50_Q16CW9 Lipopolysaccharide 1,3-galactosyltransferase, pu... 67 1e-09 UniRef50_B7KM20 Glycosyl transferase family 8 n=1 Tax=Cyanothece... 66 2e-09 UniRef50_B3XL28 Glycosyl transferase family 8 n=1 Tax=Lactobacil... 65 3e-09 UniRef50_D2RIJ4 Glycosyl transferase family 8 n=2 Tax=Acidaminoc... 65 4e-09 UniRef50_Q3D426 Glycosyl transferase, family 8 n=7 Tax=Streptoco... 65 5e-09 UniRef50_A8AY72 Glycosyl transferase, family 8 SP1766 n=7 Tax=St... 64 8e-09 UniRef50_D2LIH7 Glycosyl transferase family 8 n=1 Tax=Rhodomicro... 63 1e-08 UniRef50_Q5WI33 Lipopolysaccharide glycosyltransferase n=6 Tax=F... 63 2e-08 UniRef50_C6LDU2 Glycosyl transferase, family 8 n=3 Tax=Firmicute... 63 2e-08 UniRef50_C6X2V2 Putative glycosyltransferase n=1 Tax=Flavobacter... 62 2e-08 UniRef50_C1CFZ1 Glycosyl transferase, family 8 n=17 Tax=Streptoc... 62 3e-08 UniRef50_Q65VF6 RfaJ protein n=1 Tax=Mannheimia succiniciproduce... 62 3e-08 UniRef50_C3XKY2 Glycosyl transferase n=2 Tax=Campylobacterales R... 62 3e-08 UniRef50_A5LNA9 Glycosyl transferase, family 8 n=2 Tax=Streptoco... 62 3e-08 UniRef50_C5SH34 Glycosyl transferase family 8 n=1 Tax=Asticcacau... 62 4e-08 UniRef50_C3XFW0 Glycosyl transferase n=1 Tax=Helicobacter bilis ... 61 5e-08 UniRef50_D1JY84 Putative uncharacterized protein n=1 Tax=Bactero... 61 6e-08 UniRef50_Q3D427 Glycosyl transferase, family 8 n=8 Tax=Streptoco... 61 6e-08 UniRef50_A8LP95 Lipopolysaccharide 1 n=1 Tax=Dinoroseobacter shi... 61 6e-08 UniRef50_C5EZG9 Glycosyl transferase family protein n=1 Tax=Heli... 61 7e-08 UniRef50_B1Y723 Glycosyl transferase family 8 n=1 Tax=Leptothrix... 60 8e-08 UniRef50_A3YS36 Putative sugar transferase n=2 Tax=Campylobacter... 60 8e-08 UniRef50_B3Q568 Putative glycosyltransferase protein n=4 Tax=Rhi... 60 1e-07 UniRef50_D1Y7U2 Glycosyltransferase, family 8 n=1 Tax=Pyramidoba... 60 1e-07 UniRef50_C5S494 Putative glycosyl transferase n=2 Tax=Actinobaci... 60 1e-07 UniRef50_C9PNX4 Family 8 glycosyl transferase n=1 Tax=Pasteurell... 60 1e-07 UniRef50_C6EQF4 Putative uncharacterized protein n=3 Tax=Campylo... 59 2e-07 UniRef50_UPI000190F79C lipopolysaccharide 1,2-glucosyltransferas... 59 2e-07 UniRef50_A7VNX5 Putative uncharacterized protein n=1 Tax=Clostri... 59 3e-07 UniRef50_B2ISC2 Glycosyl transferase, family 8 n=2 Tax=Streptoco... 59 3e-07 UniRef50_C3XN62 Glycosyl transferase n=1 Tax=Helicobacter wingha... 58 4e-07 UniRef50_C5S3F7 Putative glycosyl transferase n=2 Tax=Actinobaci... 58 5e-07 UniRef50_Q4JZJ9 Putative glycosyl transferase n=8 Tax=Streptococ... 58 5e-07 UniRef50_UPI0001A45357 glycosyl transferase family protein n=1 T... 57 7e-07 UniRef50_C2KV37 Lipopolysaccharide biosynthesis protein, LPS:gly... 57 8e-07 UniRef50_C7RG54 Glycosyl transferase family 8 n=1 Tax=Anaerococc... 57 1e-06 UniRef50_D2QX95 Glycosyl transferase family 8 n=1 Tax=Pirellula ... 57 1e-06 UniRef50_B7C7N8 Putative uncharacterized protein n=2 Tax=Firmicu... 57 1e-06 UniRef50_A4SAB5 Predicted protein (Fragment) n=1 Tax=Ostreococcu... 57 1e-06 UniRef50_C5ZVZ7 Putative glycosyltransferase n=3 Tax=Campylobact... 56 2e-06 UniRef50_A3CM53 Glycosyltransferase, putative n=9 Tax=Bacteria R... 56 2e-06 UniRef50_A4WWT5 Glycosyl transferase, family 8 n=1 Tax=Rhodobact... 56 2e-06 UniRef50_B1I7N1 Glycosyl transferase, family 8 n=8 Tax=Streptoco... 56 2e-06 UniRef50_C0X9Z8 Glycosyltransferase n=1 Tax=Lactobacillus gasser... 55 3e-06 UniRef50_B6VUC8 Putative uncharacterized protein n=1 Tax=Bactero... 55 3e-06 UniRef50_Q9L7A2 Putative glycosyl transferase n=1 Tax=Haemophilu... 55 3e-06 UniRef50_B2UQ54 Glycosyl transferase family 8 n=1 Tax=Akkermansi... 55 3e-06 UniRef50_C1QC03 LPS:glycosyltransferase n=1 Tax=Brachyspira murd... 55 4e-06 UniRef50_Q3DNS6 Glycosyl transferase, family 8 n=5 Tax=Streptoco... 54 5e-06 UniRef50_Q0I2Z7 Lipopolysaccharide biosynthesis protein, glycosy... 54 6e-06 UniRef50_Q2K5X3 Galactosyltransferase protein n=13 Tax=Rhizobium... 54 6e-06 UniRef50_A4VVV8 Lipopolysaccharide biosynthesis proteins, LPS:gl... 54 7e-06 UniRef50_B6GCA0 Putative uncharacterized protein n=1 Tax=Collins... 54 7e-06 UniRef50_C1IBL0 Glycosyl transferase n=1 Tax=Clostridium sp. 7_2... 54 7e-06 UniRef50_B3WD33 WbbM protein n=8 Tax=Lactobacillus RepID=B3WD33_... 54 8e-06 UniRef50_C4Z1V1 Putative uncharacterized protein n=1 Tax=Eubacte... 54 9e-06 UniRef50_C5ELK9 Glycosyl transferase n=3 Tax=Clostridiales RepID... 53 1e-05 UniRef50_C1QC00 LPS:glycosyltransferase n=1 Tax=Brachyspira murd... 53 1e-05 UniRef50_Q5M3K9 Glycosyl transferase n=4 Tax=Streptococcus RepID... 53 2e-05 UniRef50_UPI00016B2258 glycosyl transferase, family 8 n=1 Tax=ca... 52 2e-05 UniRef50_A5UC07 Aspartate-semialdehyde dehydrogenase n=7 Tax=Hae... 52 3e-05 UniRef50_B9KUH7 Lipopolysaccharide 3-alpha-galactosyltransferase... 52 3e-05 UniRef50_B7GNT4 Glycosyl transferase, family 8 n=5 Tax=Bifidobac... 52 3e-05 UniRef50_B9QZ95 Glycosyl transferase family 8 n=1 Tax=Labrenzia ... 52 4e-05 UniRef50_C8N7M8 Putative uncharacterized protein n=1 Tax=Cardiob... 52 4e-05 UniRef50_A5VK24 Glycosyl transferase, family 8 n=18 Tax=Lactobac... 51 5e-05 UniRef50_A7A7B4 Putative uncharacterized protein n=2 Tax=Bifidob... 50 1e-04 UniRef50_C8W7U9 Glycosyl transferase family 8 n=2 Tax=Atopobium ... 50 1e-04 UniRef50_C0AA16 Lipopolysaccharide biosynthesis protein LPS:glyc... 50 1e-04 UniRef50_A8FNA2 Putative uncharacterized protein n=2 Tax=Campylo... 50 1e-04 UniRef50_Q3DM64 Glycosyl transferase, family 8, degenerate n=6 T... 50 1e-04 UniRef50_C7TIE0 Glycosyl transferase, group 8 n=2 Tax=Lactobacil... 50 1e-04 UniRef50_C6IJ37 General stress protein A n=2 Tax=Bacteroides Rep... 50 2e-04 UniRef50_B1LK07 Glycosyl transferase family 8 n=10 Tax=Enterobac... 49 2e-04 UniRef50_B9ADW8 Putative uncharacterized protein n=1 Tax=Methano... 49 3e-04 UniRef50_A2DXT6 Glycosyl transferase family 8 protein n=1 Tax=Tr... 48 5e-04 UniRef50_C4ZG45 Glycosyltransferase Family 2 modular protein n=1... 48 5e-04 UniRef50_UPI00016C0890 glycosyl transferase family 8 protein n=2... 47 7e-04 UniRef50_D1N145 Glycosyl transferase family 8 n=1 Tax=Victivalli... 47 0.001 UniRef50_B3WD32 Glycosyl transferase n=9 Tax=Lactobacillus RepID... 46 0.002 UniRef50_Q92VQ2 Putative lipopolysaccharide 1,3-galactosyltransf... 45 0.003 UniRef50_A3VFX3 Glycosyltransferase n=1 Tax=Rhodobacterales bact... 45 0.003 UniRef50_Q1CSY7 Lipopolysaccharide 1,2-glycosyltransferase n=3 T... 45 0.004 UniRef50_C0X9Z7 Putative uncharacterized protein n=2 Tax=Lactoba... 44 0.006 UniRef50_B9BAZ6 Glycosyl transferase family 8 protein n=1 Tax=Bu... 44 0.006 UniRef50_C6DEN1 Glycosyl transferase family 8 n=1 Tax=Pectobacte... 44 0.008 UniRef50_A9S2B3 Predicted protein (Fragment) n=1 Tax=Physcomitre... 44 0.009 UniRef50_B2UQN6 Glycosyl transferase family 8 n=1 Tax=Akkermansi... 44 0.009 UniRef50_C0BAU3 Putative uncharacterized protein n=1 Tax=Coproco... 44 0.010 UniRef50_Q38VG7 Putative glycosyl transferase, family 8 n=1 Tax=... 43 0.015 UniRef50_C6DEN2 Glycosyl transferase family 8 n=1 Tax=Pectobacte... 43 0.018 UniRef50_Q5UNW1 Uncharacterized protein R707 n=1 Tax=Acanthamoeb... 43 0.019 UniRef50_A4IXE1 Glycosyl transferase, family 8 n=16 Tax=Francise... 42 0.023 UniRef50_A4UX79 LPS biosynthesis protein n=2 Tax=Lactobacillacea... 42 0.023 UniRef50_UPI0001693121 general stress protein n=1 Tax=Paenibacil... 42 0.024 UniRef50_B6JMU4 Lipopolysaccharide biosynthesis protein n=20 Tax... 42 0.028 UniRef50_A4MXF8 Biotin--protein ligase n=6 Tax=Haemophilus influ... 42 0.029 UniRef50_A2RLV8 Putative glycosyltransferase n=1 Tax=Lactococcus... 42 0.034 UniRef50_Q17VR5 Lipopolysaccharide biosynthesis protein n=19 Tax... 41 0.066 UniRef50_Q1RIL1 Lipopolysaccharide 1,2-glucosyltransferase RfaJ ... 41 0.077 UniRef50_Q1CUZ8 Lipopolysaccharide 1,2-glycosyltransferase n=12 ... 40 0.089 >UniRef50_P27128 Lipopolysaccharide 1,3-galactosyltransferase n=43 Tax=Enterobacteriaceae RepID=RFAI_ECOLI Length = 339 Score = 705 bits (1820), Expect = 0.0, Method: Compositional matrix adjust. Identities = 339/339 (100%), Positives = 339/339 (100%) Query: 1 MQQVFFQETEFLNSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLC 60 MQQVFFQETEFLNSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLC Sbjct: 1 MQQVFFQETEFLNSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLC 60 Query: 61 FHIFTDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYF 120 FHIFTDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYF Sbjct: 61 FHIFTDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYF 120 Query: 121 INKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKG 180 INKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKG Sbjct: 121 INKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKG 180 Query: 181 YFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYN 240 YFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYN Sbjct: 181 YFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYN 240 Query: 241 TQFSLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTAL 300 TQFSLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTAL Sbjct: 241 TQFSLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTAL 300 Query: 301 LKPNNSNQLRYSAKHMLKKHRYLKGFSNYLFYFIEKIKH 339 LKPNNSNQLRYSAKHMLKKHRYLKGFSNYLFYFIEKIKH Sbjct: 301 LKPNNSNQLRYSAKHMLKKHRYLKGFSNYLFYFIEKIKH 339 >UniRef50_D2TIX6 UDP-galactose:(Glucosyl) LPS alpha-1,3-galactosyltransferase n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TIX6_CITRO Length = 340 Score = 499 bits (1285), Expect = e-140, Method: Compositional matrix adjust. Identities = 236/337 (70%), Positives = 272/337 (80%) Query: 1 MQQVFFQETEFLNSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLC 60 MQQV+F+ETEFL S ID++H+ E + LDIAYG D+NFLFGCGISIAS+LK N L Sbjct: 1 MQQVYFKETEFLTSTIDFNHQDTAEKVVLDIAYGVDQNFLFGCGISIASVLKNNTDKTLH 60 Query: 61 FHIFTDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYF 120 FH+F D F + DR+ FD LA QYKT I IYLIN + LRSLPSTKNWT+AIYFRF IADYF Sbjct: 61 FHVFIDAFNETDRRMFDKLAAQYKTHITIYLINCEHLRSLPSTKNWTYAIYFRFAIADYF 120 Query: 121 INKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKG 180 I K K+LYLDADIICQG I+ L+NFSF DK+A VVTEG+ADWWEKRA SLG GI KG Sbjct: 121 IGKTNKLLYLDADIICQGGIDELVNFSFASDKIAAVVTEGKADWWEKRALSLGTEGITKG 180 Query: 181 YFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYN 240 YFNSG +LIN QWA + +SARAI ML++P+I+ +ITHPDQDVLN+LLADKL F DIK+N Sbjct: 181 YFNSGLILINLNQWAIECISARAIKMLSDPDIVGRITHPDQDVLNILLADKLHFLDIKFN 240 Query: 241 TQFSLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTAL 300 TQFSLNYQLK+ FINPV NDTI IHYIGPTKPWH WA DY +S+ F++AK ASPWKNTAL Sbjct: 241 TQFSLNYQLKDKFINPVNNDTILIHYIGPTKPWHSWAGDYLISKPFIDAKQASPWKNTAL 300 Query: 301 LKPNNSNQLRYSAKHMLKKHRYLKGFSNYLFYFIEKI 337 LKP NSNQ RY AKHMLK RY+KG Y YF++KI Sbjct: 301 LKPTNSNQFRYCAKHMLKNKRYIKGMVGYFLYFMKKI 337 >UniRef50_Q9ZIT4 UDP-galactose:(Glucosyl) LPS alpha1,3-galactosyltransferase WaaI n=26 Tax=Enterobacteriaceae RepID=Q9ZIT4_ECOLX Length = 335 Score = 372 bits (956), Expect = e-102, Method: Compositional matrix adjust. Identities = 175/308 (56%), Positives = 227/308 (73%), Gaps = 1/308 (0%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 LDIA+G D+NFLFGCG++IASIL N FH+FTDY D D+ YF LA QY +RI Sbjct: 27 LDIAFGIDRNFLFGCGVAIASILLNNREISCEFHVFTDYISDKDKLYFSDLAKQYNSRIN 86 Query: 89 IYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSF 148 IY+IN D+L+SLPSTKNWT+A YFRF+IADYF +K K+LYLDADI C+G+I+ L+++ F Sbjct: 87 IYVINCDKLKSLPSTKNWTYATYFRFIIADYFYHKHEKILYLDADIACKGSIKELLDYQF 146 Query: 149 PDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLN 208 +++A VV E +WW+ RA L +A GYFN+GFLLIN +W +S++AI ML Sbjct: 147 STNEIAAVVAERDVEWWQNRASVLTTPQLASGYFNAGFLLINIDEWNLNNISSKAIEMLR 206 Query: 209 EPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIG 268 +P+ + KITH DQDVLN+LL K+ F KYNT++S+NY+LK+ NPV +DT+FIHY+G Sbjct: 207 DPDWVSKITHLDQDVLNVLLNGKVKFISEKYNTRYSINYELKDKVDNPVNDDTVFIHYVG 266 Query: 269 PTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKHRYLKGFSN 328 PTKPWH+WA +YPVS++F+ AK ASPW LLKP NSNQ RY AKH K+ Y+ G N Sbjct: 267 PTKPWHEWA-NYPVSRSFLIAKAASPWSKEDLLKPVNSNQYRYCAKHKFKQKHYMAGIFN 325 Query: 329 YLFYFIEK 336 YL Y+ EK Sbjct: 326 YLKYYKEK 333 >UniRef50_Q9R9D1 UDP-glucose:(Glucosyl) LPS alpha1,3-glucosyltransferase WaaO n=29 Tax=Enterobacteriaceae RepID=Q9R9D1_ECOLX Length = 338 Score = 358 bits (918), Expect = 2e-97, Method: Compositional matrix adjust. Identities = 171/338 (50%), Positives = 234/338 (69%), Gaps = 3/338 (0%) Query: 1 MQQVFFQETEFLNSVIDYDHK-VETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRL 59 M +F E +N I +D + + +AYG DKNFLFGCG+SI S+L +N Sbjct: 1 MSAHYFNPQEMINKTIIFDERPAASVASSFHVAYGIDKNFLFGCGVSITSVLLHNSDVSF 60 Query: 60 CFHIFTDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADY 119 FH+F D + D + LA Y+T I+I+L+N +RL++LP+TKNW+ A+YFRFVIADY Sbjct: 61 VFHVFIDDIPEADIQRLAQLAKSYRTCIQIHLVNCERLKALPTTKNWSIAMYFRFVIADY 120 Query: 120 FINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAK 179 FI++ K+LYLDADI CQG ++PLI ++ VA VVTE A+WW R SL + K Sbjct: 121 FIDQQDKILYLDADIACQGNLKPLITMDLANN-VAAVVTERDANWWSLRGQSLQCNELEK 179 Query: 180 GYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKY 239 GYFNSG LLINT WA + VSA+A++ML + I+ ++T+ DQD+LN++L K+ F D KY Sbjct: 180 GYFNSGVLLINTLAWAQESVSAKAMSMLADKAIVSRLTYMDQDILNLILLGKVKFIDAKY 239 Query: 240 NTQFSLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTA 299 NTQFSLNY+LK+SF+ P+ ++T+ IHY+GPTKPWH WA YP +Q F++AK ASPWKN Sbjct: 240 NTQFSLNYELKKSFVCPINDETVLIHYVGPTKPWHYWA-GYPSAQPFIKAKEASPWKNEP 298 Query: 300 LLKPNNSNQLRYSAKHMLKKHRYLKGFSNYLFYFIEKI 337 L++P NSN RY AKH K+++ + G NY++YF KI Sbjct: 299 LMRPVNSNYARYCAKHNFKQNKPINGIMNYIYYFYLKI 336 >UniRef50_B6XJW2 Putative uncharacterized protein n=1 Tax=Providencia alcalifaciens DSM 30120 RepID=B6XJW2_9ENTR Length = 333 Score = 316 bits (810), Expect = 7e-85, Method: Compositional matrix adjust. Identities = 150/311 (48%), Positives = 208/311 (66%), Gaps = 3/311 (0%) Query: 28 CLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTD-YFGDDDRKYFDALALQYKTR 86 C +AYG D NFL+G G+SI S+L +N + FHIF D D+D F + Y T+ Sbjct: 25 CQHVAYGIDHNFLYGSGVSIVSLLMHNPHIQFAFHIFIDNSMSDEDIAKFAEICHLYNTK 84 Query: 87 IKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINF 146 I IY I+ + ++ LP+TKNWTHAIYFRF+IA+YF +K +LYLDAD++C I+ L++ Sbjct: 85 ITIYFIDSNNVKKLPTTKNWTHAIYFRFIIAEYFKDKIDYLLYLDADVVCNRNIDELLSH 144 Query: 147 SFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAM 206 + +A VV E WW+KRA SLG ++KGYFNSG + IN W V+ +++A+ Sbjct: 145 NLLG-YIAAVVPERDKAWWQKRADSLGFPSVSKGYFNSGVMYINLRTWKTNNVTEKSMAL 203 Query: 207 LNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHY 266 L + E+ ++ +PDQDVLN+LL D ++F +NTQFSLNY+LK+SF PV T+FIHY Sbjct: 204 LMDNEVSHRLVYPDQDVLNILLTDSVLFISSIFNTQFSLNYELKKSFDFPVKRTTVFIHY 263 Query: 267 IGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKHRYLKGF 326 +GPTKPWH+WA +Y +Q F+EA+ SPW+N LLK +SN LRY AKH + + +Y F Sbjct: 264 VGPTKPWHEWA-NYETAQPFLEARAVSPWRNVPLLKAKSSNHLRYCAKHNINQRKYFFAF 322 Query: 327 SNYLFYFIEKI 337 NY+ YF KI Sbjct: 323 KNYIAYFFSKI 333 >UniRef50_D0KD53 Lipopolysaccharide 3-alpha-galactosyltransferase n=3 Tax=Enterobacteriaceae RepID=D0KD53_PECWW Length = 336 Score = 310 bits (793), Expect = 7e-83, Method: Compositional matrix adjust. Identities = 155/328 (47%), Positives = 209/328 (63%), Gaps = 7/328 (2%) Query: 15 VIDYDHKVETENLC--LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDD 72 VI H C LDIA+GTD+ F++GC I+IASIL N L FH+FTD D D Sbjct: 8 VIKTVHSFSYSKKCAELDIAFGTDEKFIYGCAIAIASILLKNPDYCLSFHVFTDKLSDGD 67 Query: 73 RKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDA 132 + F +A QY T I IY+++ L++LP TK W++AIYFRF+IADYF KVLYLDA Sbjct: 68 KARFQEMAEQYNTTINIYIVDCSWLKTLPETKLWSYAIYFRFIIADYFYKILDKVLYLDA 127 Query: 133 DIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTA 192 DIIC G+++ LI + ++ VV +G ++WW+ RA ++ GYFNSG LLI Sbjct: 128 DIICNGSLQELIKLDL-SNHISAVVLDGDSNWWKNRAQKFQQPELSNGYFNSGVLLIEVN 186 Query: 193 QWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLK-- 250 W V+ ++ +L +PE+ K ITHPDQDVLN+LLA K + KYNTQFS+NY+LK Sbjct: 187 NWHQAAVTENSMRLLTDPEMKKIITHPDQDVLNVLLAGKSCHIESKYNTQFSINYELKYS 246 Query: 251 --ESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQ 308 ES P++N TIFIHYIGPTKPWH WA +Y ++ F++AK SPWKN +LL ++ Sbjct: 247 YGESAPTPISNKTIFIHYIGPTKPWHKWAANYACTKYFLKAKEHSPWKNESLLDAVTASN 306 Query: 309 LRYSAKHMLKKHRYLKGFSNYLFYFIEK 336 +RY AKH ++G ++L Y +K Sbjct: 307 MRYCAKHQFHNGEIIRGTLSFLKYLYKK 334 >UniRef50_D2TY85 UDP-D-galactose:(Glucosyl)lipopolysaccharide-alpha-1, 3-D-galactosyltransferase n=1 Tax=Arsenophonus nasoniae RepID=D2TY85_9ENTR Length = 343 Score = 285 bits (729), Expect = 2e-75, Method: Compositional matrix adjust. Identities = 145/309 (46%), Positives = 195/309 (63%), Gaps = 2/309 (0%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 IAYG DKNF G ISI S+L +N+ F+IFTD + D K FD L Y T+I I Sbjct: 36 IAYGADKNFSLGTAISICSMLYFNKIYTFHFYIFTDTISECDLKKFDELTSCYNTKITIL 95 Query: 91 LINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPD 150 LI+ +L+ LP+ K W+HAIYFRF+IA+YF NK K+LYLD+DIIC G I L + Sbjct: 96 LIDTLQLKKLPTNKLWSHAIYFRFIIANYFHNKTNKILYLDSDIICSGDISELFDIDLNQ 155 Query: 151 DKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEP 210 +A V Q W+KRA L IA GYFNSG +LI+T +W +++ + I +L + Sbjct: 156 HIIAAVADRDQY-LWKKRAEMLATPEIANGYFNSGVMLIDTDKWHKNKITEKTINILLDD 214 Query: 211 EIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIGPT 270 + K DQD LN+ L ++++F D K+NTQFS+NY+LK + P+ N+ FIHYIGPT Sbjct: 215 KTKAKFVFYDQDALNISLVNQVLFLDKKFNTQFSINYELKNKTLFPIINNVKFIHYIGPT 274 Query: 271 KPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKHRYLKGFSNYL 330 KPW+ W+ +YP + FM K SPWK T L+ + SNQ RY+AKHM K +Y+ NYL Sbjct: 275 KPWNIWS-EYPSTHLFMTIKKNSPWKTTPLIAASTSNQYRYAAKHMFNKKKYIYWLLNYL 333 Query: 331 FYFIEKIKH 339 +YF+ K H Sbjct: 334 YYFVNKALH 342 >UniRef50_UPI000197C525 UDP-D-galactose:(glucosyl) lipopolysaccharide-alpha-1,3-D-galactosyltransferase n=1 Tax=Providencia rettgeri DSM 1131 RepID=UPI000197C525 Length = 339 Score = 248 bits (632), Expect = 3e-64, Method: Compositional matrix adjust. Identities = 122/310 (39%), Positives = 191/310 (61%), Gaps = 2/310 (0%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 ++AYG DKNFLFG G+SI S+L N+ FH+FTD+ D D + F ++ QYKT + Sbjct: 28 FNVAYGADKNFLFGTGVSIVSVLLNNKDINFHFHVFTDFLSDKDIQLFSQISKQYKTSVT 87 Query: 89 IYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSF 148 ++ +N D L+ LP+ + W+HAIYFR +IADYF K KVLYLD+D++C G+I+ L + + Sbjct: 88 LHTLNMDILKKLPTNQVWSHAIYFRLIIADYFYKKCDKVLYLDSDVVCTGSIQILKSLNL 147 Query: 149 PDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLN 208 +A V+ + E A+ V GI KGYFNSG +LIN +W +Q++ +++++ Sbjct: 148 SSMPIAAVMDISEPHSVE-MANLFNVEGIKKGYFNSGVMLINPDEWNYRQLTEKSMSVFT 206 Query: 209 EPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIG 268 + ++ I + DQD +N+ + + D +N + +LN + K N ++N +F+H+IG Sbjct: 207 DKKLQPVIKYYDQDAINIAVHGDWLKLDNIFNHRINLNDRYKHKKNNDISN-AVFVHFIG 265 Query: 269 PTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKHRYLKGFSN 328 TKPWH+W+ Y + F+ AK SPWK+ L+ P N +Y++KH K +YL F + Sbjct: 266 STKPWHNWSKYYHEVRCFLNAKEKSPWKDIDLMTPQNITHHKYASKHFRYKEKYLSSFYH 325 Query: 329 YLFYFIEKIK 338 Y+ Y I KIK Sbjct: 326 YVLYTILKIK 335 >UniRef50_UPI00019F16C6 hypothetical protein CATC2_20202 n=1 Tax=Citrobacter youngae ATCC 29220 RepID=UPI00019F16C6 Length = 330 Score = 203 bits (517), Expect = 6e-51, Method: Compositional matrix adjust. Identities = 113/333 (33%), Positives = 182/333 (54%), Gaps = 19/333 (5%) Query: 12 LNSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDD 71 + +++++ L+IA+G DKNF+FG IS+ S+L +N+ + FH+FTDY D Sbjct: 9 IKKILEFNQAPSEHKTQLNIAWGVDKNFMFGAAISMTSVLLHNKDLNIHFHLFTDYIDAD 68 Query: 72 DRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLD 131 ++ LA Q+ T I IY+++ + L+ LPS W+HA+YFRF+ +Y K +LY+D Sbjct: 69 YQQRVAKLAEQFATNISIYIMDANGLKVLPSGNAWSHAMYFRFIAFEYLGEKVDSLLYID 128 Query: 132 ADIICQGTIEPLINFSFPDDKVAMV--VTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLI 189 AD++C+G++ L + A++ V + A EK K YFNSG + Sbjct: 129 ADVMCKGSLYELTQIDLGEHVAAVITDVDDSPARDIEKN----------KDYFNSGVIFA 178 Query: 190 NTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQL 249 N +W Q A +L + K++ PDQDVLN+L K+IF + ++N + + +L Sbjct: 179 NLKKWKEQNFINSAFDILLDKN--NKLSFPDQDVLNILFLKKVIFLERRFNAIYGIKQEL 236 Query: 250 K----ESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNN 305 K + +T +TI IHYIG TKPW+ WA +YP +Q F+EA +SPW + LL Sbjct: 237 KSKDTSKYKEYITPETILIHYIGVTKPWNSWA-NYPSAQYFVEAWKSSPWADVPLLPART 295 Query: 306 SNQLRYSAKHMLKKHRYLKGFSNYLFYFIEKIK 338 Q + ++H + +Y +Y+ Y K+K Sbjct: 296 PKQYKKKSRHERLQGKYFASAISYIGYLWAKLK 328 >UniRef50_D2U322 Lipopolysaccharide 1,2-glucosyltransferase n=1 Tax=Arsenophonus nasoniae RepID=D2U322_9ENTR Length = 334 Score = 202 bits (515), Expect = 1e-50, Method: Compositional matrix adjust. Identities = 104/312 (33%), Positives = 165/312 (52%), Gaps = 6/312 (1%) Query: 19 DHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDA 78 + K+ N +IAYG DKNFL G ISI S+L N + FH+FTDY D + F Sbjct: 14 EKKLTENNKNFNIAYGVDKNFLLGAAISINSVLINNTDTDFNFHLFTDYIDDGYIQRFQT 73 Query: 79 LALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQG 138 + +Y + I IYL++ L+ L ++ W++A YFR + +Y +LYLDAD+IC+G Sbjct: 74 MIAKYNSNIIIYLLDAAELKQLSTSDFWSYATYFRLIAFEYLSTNIHAILYLDADVICKG 133 Query: 139 TIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQ 198 +++ + + D A+V+ + A L +A + YFN+G + +N +W Sbjct: 134 SLKEIFQLNLADSFAAVVLDVDSMQ--QSSATRLNLADLNGKYFNAGVIYVNLQKWIEND 191 Query: 199 VSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLK----ESFI 254 S +++ ++ K+ + DQD LN+L + I+ YN + L +L + Sbjct: 192 FSKKSLELVRGKTNFGKLKYLDQDALNILFQTQNIYLSRDYNCIYKLKNELAYHDLSKYK 251 Query: 255 NPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAK 314 N +T+ TI IHY G TKPWH W +YP SQ F + SPWK+ L +L+ K Sbjct: 252 NTITDSTILIHYTGVTKPWHTWGINYPASQFFFNSYIHSPWKDQPLKMAEKRTELQEKYK 311 Query: 315 HMLKKHRYLKGF 326 H+ +H+Y++GF Sbjct: 312 HLFLQHKYMQGF 323 >UniRef50_A8ARL6 Putative uncharacterized protein n=3 Tax=Citrobacter RepID=A8ARL6_CITK8 Length = 339 Score = 201 bits (512), Expect = 2e-50, Method: Compositional matrix adjust. Identities = 105/323 (32%), Positives = 181/323 (56%), Gaps = 9/323 (2%) Query: 19 DHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDA 78 D+ ++ L+IAYG D+NFLFG GIS+ S+L N + F++ TDY D+ + + Sbjct: 16 DNATHQKSKKLNIAYGVDRNFLFGSGISMTSVLVNNPDIDIHFYVVTDYVDDEYLESVER 75 Query: 79 LALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQG 138 L Y T + + + + + R LPSTK WT+A+Y+R+ +Y + VLYLDADI+C+ Sbjct: 76 LTQMYGTTVTVLVFDNEAFRKLPSTKAWTYAMYYRYFAFEYLSRELDSVLYLDADIVCKN 135 Query: 139 TIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQ 198 ++ L + F + A+V + K LG+ +A+ YFNSG + N W ++ Sbjct: 136 SLRELTDIHFAGEYAAVVNDIDRVRL--KSGQRLGIPELARDYFNSGVVFANLHVWREKK 193 Query: 199 VSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKES----FI 254 + ++A +L+E + K++ + DQD+LN+L +I +N + ++ +LK + Sbjct: 194 LLSKAFEVLHERQ--KELLYFDQDILNILFVGHVILLRRDFNCIYGVDQELKNKNEYRYQ 251 Query: 255 NPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAK 314 + +T T+ IHY+G TKPWH WA +YPVS+ F+EA S W +LL N + + ++ Sbjct: 252 DFITESTVLIHYVGVTKPWHTWA-NYPVSKYFIEAYKKSAWAEKSLLNANTAKLYKRKSR 310 Query: 315 HMLKKHRYLKGFSNYLFYFIEKI 337 H + +Y++ +++ Y K+ Sbjct: 311 HERIQRKYIRSIFSHIMYIKNKL 333 >UniRef50_B6XJW1 Putative uncharacterized protein n=1 Tax=Providencia alcalifaciens DSM 30120 RepID=B6XJW1_9ENTR Length = 325 Score = 201 bits (510), Expect = 4e-50, Method: Compositional matrix adjust. Identities = 108/307 (35%), Positives = 168/307 (54%), Gaps = 7/307 (2%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 L+IAYG DK FLFG G+S+ SI+ N +L FH+FTDY D+ + L L I Sbjct: 19 LNIAYGVDKGFLFGSGLSMNSIIINNSDIKLKFHLFTDYMNDEFLSKLEKLTLNENVNID 78 Query: 89 IYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSF 148 IY+IN D L+ LP + W++A YFRF I D+ +LYLDAD+ C+G++ I+ +F Sbjct: 79 IYIINADELKKLPISHVWSYATYFRFFIFDHLCETLSSILYLDADVFCKGSLRKYIDIAF 138 Query: 149 PDDKVAMV--VTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAM 206 + A++ V Q ++ L + I YFN+G + +N W + + +A + Sbjct: 139 NGEYAAVIPDVPNMQISCVDR----LSMPQIKDKYFNAGVIFLNLKVWDKNKFTKQAFNL 194 Query: 207 LNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLK-ESFINPVTNDTIFIH 265 + K + + DQD LN++ + I+ YN ++L +L+ E++ + +T++T IH Sbjct: 195 ITNNHTGKTLKYLDQDALNIIFNCQNIYLPRDYNCIYTLKNELEHENYKDYITSETKLIH 254 Query: 266 YIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKHRYLKG 325 Y G TKPWH WA +YP SQ F A SPWKN L+ + + KH + ++L G Sbjct: 255 YTGATKPWHYWAVNYPASQTFKVAFETSPWKNDELVDAKKKPEYQERYKHEFNQKKFLTG 314 Query: 326 FSNYLFY 332 S+ + Y Sbjct: 315 ISSLIKY 321 >UniRef50_Q9ZIT6 UDP-glucose:(Galactosyl) LPS alpha1,2-glucosyltransferase WaaJ n=26 Tax=Enterobacteriaceae RepID=Q9ZIT6_ECOLX Length = 339 Score = 199 bits (505), Expect = 2e-49, Method: Compositional matrix adjust. Identities = 107/305 (35%), Positives = 172/305 (56%), Gaps = 7/305 (2%) Query: 25 ENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYK 84 E ++++G D+N+ G ISIASIL+ N+ ++ FHI DY + + LA +Y+ Sbjct: 26 ERETFNVSWGIDENYQVGAAISIASILENNKQNKFTFHIIADYLDKEYIELLSQLATKYQ 85 Query: 85 TRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLI 144 T IK+YLI+ + L++LP + W +IY+R + DYF + +LYLDADI+C+G++ LI Sbjct: 86 TVIKLYLIDSEPLKALPQSNIWPVSIYYRLLSFDYFSARLDSLLYLDADIVCKGSLNELI 145 Query: 145 NFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAI 204 F D+ A+V+ K A L YFNSG + IN +W Q+++ + Sbjct: 146 ALEFKDEYGAVVIDVDAMQ--SKSAERLCNEDFNGSYFNSGVMYINLREWLKQRLTEKFF 203 Query: 205 AMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKES----FINPVTND 260 +L++ IIKK+ +PDQD+LN++ KYN +++ + +E + + +D Sbjct: 204 DLLSDESIIKKLKYPDQDILNLMFLHHAKILPRKYNCIYTIKSEFEEKNSEYYTRFINDD 263 Query: 261 TIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKH 320 T+FIHY G TKPWHDWA +Y + F N SPW+N K ++ + KH+L + Sbjct: 264 TVFIHYTGITKPWHDWA-NYASADYFRNIYNISPWRNIPYKKAVKKHEHKEKYKHLLYQK 322 Query: 321 RYLKG 325 ++L G Sbjct: 323 KFLDG 327 >UniRef50_Q9ZIS1 UDP-galactose:(Galactosyl) LPS alpha1,2-galactosyltransferase WaaW n=29 Tax=Enterobacteriaceae RepID=Q9ZIS1_ECOLX Length = 342 Score = 196 bits (499), Expect = 7e-49, Method: Compositional matrix adjust. Identities = 106/322 (32%), Positives = 172/322 (53%), Gaps = 7/322 (2%) Query: 21 KVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALA 80 ++ + L+IAYG D+NFLFG +S+ S++ +N + FH+FTDY +D + +A Sbjct: 16 EIANTDRVLNIAYGIDRNFLFGAAVSMQSVVMHNPDLAVKFHLFTDYIDEDYLQRVNAFT 75 Query: 81 LQ-YKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGT 139 + ++IY ++ + PS K W++A +FR V Y +LY+DAD+IC+G+ Sbjct: 76 SKNANVEVRIYKVSSAFIDIFPSLKQWSYATFFRLVAFQYLSETIENLLYIDADVICKGS 135 Query: 140 IEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQV 199 + L++ +F DK A V+ + EK A L + G+ YFN+G + + WA Sbjct: 136 LAGLLDINFDGDKFAAVIKDVPF-MQEKPAKRLAIEGLPGNYFNAGVVYLQLEAWAKNDF 194 Query: 200 SARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLK----ESFIN 255 +AIAML K DQD+LN+L IF Y+ + ++Y+LK E + Sbjct: 195 MNKAIAMLASDPQHTKYKCLDQDILNILFFGHCIFISGDYDCFYGIDYELKNKSDEDYKK 254 Query: 256 PVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKH 315 +T+DT IHY+G TKPW+DW +YP + F EA AS W + A + N Q + ++H Sbjct: 255 TITDDTKLIHYVGVTKPWNDWT-NYPCQKYFNEAYQASCWNDVAFIPATNEKQYQVKSRH 313 Query: 316 MLKKHRYLKGFSNYLFYFIEKI 337 + + F ++ Y+ +KI Sbjct: 314 LKRNGNIASSFYYFMLYYSKKI 335 >UniRef50_B2PV91 Putative uncharacterized protein n=2 Tax=Enterobacteriaceae RepID=B2PV91_PROST Length = 342 Score = 196 bits (499), Expect = 7e-49, Method: Compositional matrix adjust. Identities = 108/316 (34%), Positives = 171/316 (54%), Gaps = 12/316 (3%) Query: 28 CLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 CLD+ YG+D+N+ FG G+S S+L N + FH F D D + +A Q++ Sbjct: 24 CLDVIYGSDENYQFGAGVSAVSLLINNPTTFFRFHYFLDKVSPDFLEKLKVIASQFQVEF 83 Query: 88 KIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 +Y ++ L++LP++ W+ A+YFR V DY + LYLDAD++C G ++ N Sbjct: 84 HVYELDNKLLKTLPASDVWSSAMYFRLVALDYLSSDYDFALYLDADVMCNGILDLTTNLI 143 Query: 148 FPDDKVAMVVTE--GQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIA 205 DKV VV + G E R H+ +AK YFNSG + +N +W +Q++ + Sbjct: 144 --KDKVCGVVADDIGVRTKSETRLHA---PSLAKTYFNSGVMFVNLKKWHEKQITQQCFE 198 Query: 206 MLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKES----FINPVTNDT 261 +L+ ++ +PDQDVLN++L + L ++NT ++L +L +S + +T +T Sbjct: 199 LLSAENAKQRYKYPDQDVLNLILREDLELLSQRFNTVYTLKNELYDSTHQKYQQVITPET 258 Query: 262 IFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKHR 321 + IHY G +KPWH WA +YP SQ F +A SPW L + + KH+LK+ Sbjct: 259 VLIHYTGVSKPWHTWA-NYPASQPFYKALMQSPWTTNDLKPATKFVERKKEYKHLLKQGN 317 Query: 322 YLKGFSNYLFYFIEKI 337 YL G + + Y EK+ Sbjct: 318 YLAGILSGIRYSFEKL 333 >UniRef50_D0KD54 Lipopolysaccharide glucosyltransferase I n=2 Tax=Pectobacterium RepID=D0KD54_PECWW Length = 336 Score = 190 bits (483), Expect = 5e-47, Method: Compositional matrix adjust. Identities = 125/339 (36%), Positives = 190/339 (56%), Gaps = 15/339 (4%) Query: 4 VFFQETEFLNSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHI 63 VF + L SV + H+ ++ L++AYG DKN+ GCG+SI SIL N FH+ Sbjct: 2 VFSSHIDVL-SVFEKRHQSIADHDTLNVAYGIDKNYAVGCGVSITSIL-INNSIDFTFHV 59 Query: 64 FTDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINK 123 F+D F DD K LA ++KT+I +Y IN + L++LP T W+HA+YFR + + +K Sbjct: 60 FSDDFDDDFIKKISILAEKFKTKIILYKINSEMLKTLPCTDIWSHAMYFRLLAFSHLSDK 119 Query: 124 APKVLYLDADIICQGTIEPLINFSFPDDKVAMV--VTEGQADWWEKRAHSLGVAGIAKGY 181 +LYLDAD++C+G++E L + A++ V E Q +K A L +A + Y Sbjct: 120 TSSLLYLDADVMCKGSLEQLHKLNTAPHVAAVIRDVPEMQ----KKSASRLKMAALEGEY 175 Query: 182 FNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNT 241 FNSG L N W ++ + L + E + I +PDQD++N+LL + F +YNT Sbjct: 176 FNSGVLFANLDIWNKLDLTQKIFDKLRDGE--ESIQYPDQDIMNILLNGNVTFLPKEYNT 233 Query: 242 QFSLNYQLKES----FINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKN 297 +S+ +LK+S + + +DTI IHY G TKPWH WA +YP + F A+ SPW Sbjct: 234 IYSIKNELKDSNHQKYKEVIKDDTILIHYTGVTKPWHKWA-NYPSTSYFQHAQENSPWST 292 Query: 298 TALLKPNNSNQLRYSAKHMLKKHRYLKGFSNYLFYFIEK 336 + L + +++ KH+LKK +YL G + Y + K Sbjct: 293 SDLKDADTFVEMKKKYKHLLKKGKYLSGLISAFKYSLNK 331 >UniRef50_D1P7H1 Lipopolysaccharide 1,2-glucosyltransferase n=1 Tax=Providencia rustigianii DSM 4541 RepID=D1P7H1_9ENTR Length = 324 Score = 189 bits (480), Expect = 1e-46, Method: Compositional matrix adjust. Identities = 113/332 (34%), Positives = 173/332 (52%), Gaps = 14/332 (4%) Query: 10 EFLNSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFG 69 + + S+I + EN I YG D+ FL+G G SIAS++ N+ + FHIF D Sbjct: 5 DMIKSLIKINDNERHENSYFHIGYGVDEKFLYGVGTSIASVMLNNKDTDFHFHIFVDNLP 64 Query: 70 DDDRKYFDALALQYKTRIKIYLINGDRLRSLP-STKNWTHAIYFRFVIADYFINKAPKVL 128 D++ F +I IY I+ ++ + LP +K W+HAIYFR +I Y + +L Sbjct: 65 DEN--LFREAVQGTSHKITIYFIDNEKFKLLPLPSKAWSHAIYFRLLIISYLSSSIDSLL 122 Query: 129 YLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLL 188 YLDADIIC+G + L +F + V + EK+ + ++ YFNSGFL Sbjct: 123 YLDADIICKGDLSELKALTFDEKTFVYAVKDKFCS--EKQNLPIDMSK----YFNSGFLY 176 Query: 189 INTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLN-Y 247 ++ A + + R I ++ + + +HPDQD LN+LL DKLI YN FSL+ Y Sbjct: 177 MSLKHLAQENIPNRVIELVEKNDF----SHPDQDALNVLLNDKLINISENYNYMFSLDWY 232 Query: 248 QLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSN 307 + + + + +FIH++G TKP+H+WA Y + A+ SPWKN LLKP Sbjct: 233 ITSKGHLAKIPDSVVFIHFVGLTKPFHEWASFYEEYKYLESARKNSPWKNIPLLKPEGYK 292 Query: 308 QLRYSAKHMLKKHRYLKGFSNYLFYFIEKIKH 339 QL H+ K +Y++ + Y ++K H Sbjct: 293 QLSRKKSHLRKNGKYVEFIFTTIQYLMKKTFH 324 >UniRef50_Q9ZIS6 UDP-galactose:(Glucosyl) LPS alpha1,2-galactosyltransferase WaaT n=26 Tax=Enterobacteriaceae RepID=Q9ZIS6_ECOLX Length = 331 Score = 188 bits (478), Expect = 2e-46, Method: Compositional matrix adjust. Identities = 105/332 (31%), Positives = 172/332 (51%), Gaps = 8/332 (2%) Query: 10 EFLNSVIDYDHKVETENLC-LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYF 68 EF+ Y + EN L+++YG DKNFL+G G+SI+S+L N FH+FTDY Sbjct: 3 EFIKERFSYLADNKKENAPELNVSYGIDKNFLYGAGVSISSVLINNSDINFVFHVFTDYV 62 Query: 69 GDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVL 128 DD K F+ A Q+ T I +YLI+ LP+++ W++A YFR + +Y +L Sbjct: 63 DDDYLKSFNETAKQFNTSIIVYLIDPKYFADLPTSQFWSYATYFRVLSFEYLSESISTLL 122 Query: 129 YLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLL 188 YLDAD++C+G+++PL F D+ A++ A L + + YFN+G + Sbjct: 123 YLDADVVCKGSLKPLTEIIFKDEFAAVIPDNDSTQ--AACAKRLNIPEMNGRYFNAGVIY 180 Query: 189 INTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQ 248 +N +W ++ + +L + + DQD LN+ I+ ++T ++L + Sbjct: 181 VNLKKWHEANLTPYLLKLLRGETKYGSLKYLDQDALNIAFNMNNIYLAKDFDTIYTLKNE 240 Query: 249 L----KESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPN 304 L + +T+ T+ IHY G TKPWH WA YP + F A+ SPWK L + Sbjct: 241 LYDRSHRKYQQTITDKTVLIHYTGITKPWHSWA-GYPSASYFNIAREQSPWKKYPLKEAR 299 Query: 305 NSNQLRYSAKHMLKKHRYLKGFSNYLFYFIEK 336 +++ KH+ Y+KG ++ + Y ++K Sbjct: 300 TVAEMQKQYKHLFAHGEYIKGITSLIKYKLKK 331 >UniRef50_P27129 Lipopolysaccharide 1,2-glucosyltransferase n=44 Tax=Enterobacteriaceae RepID=RFAJ_ECOLI Length = 338 Score = 183 bits (465), Expect = 6e-45, Method: Compositional matrix adjust. Identities = 107/313 (34%), Positives = 171/313 (54%), Gaps = 9/313 (2%) Query: 28 CLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 CL++AYG D N+L G G+SI SI+ N L F+I D + D + LA Q + RI Sbjct: 27 CLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRI 86 Query: 88 KIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 +Y IN D+L+ LP T+ W+ A+YFR ++LYLDAD++C+G I L++ Sbjct: 87 TLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLG 146 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 + VA VV + + EK L + YFNSG + ++ +WA +++ +A+++L Sbjct: 147 L-NGAVAAVVKDVEP-MQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 Query: 208 NEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLK----ESFINPVTNDTIF 263 + + K +PDQDV+N+LL +F +YNT +++ +LK +++ +T T+ Sbjct: 205 MSKDNVYK--YPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLL 262 Query: 264 IHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKHRYL 323 IHY G TKPWH WA YP + + A SPWK+ + + + + KH+L +H Y+ Sbjct: 263 IHYTGATKPWHKWAI-YPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQHHYI 321 Query: 324 KGFSNYLFYFIEK 336 G + Y K Sbjct: 322 SGIIAGVCYLCRK 334 >UniRef50_C1DGU7 Lipopolysaccharide 1,3-galactosyltransferase n=1 Tax=Azotobacter vinelandii DJ RepID=C1DGU7_AZOVD Length = 326 Score = 154 bits (390), Expect = 4e-36, Method: Compositional matrix adjust. Identities = 96/308 (31%), Positives = 157/308 (50%), Gaps = 15/308 (4%) Query: 28 CLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 L IA+G D+N+L GI+I SI++ N G L FH+F R D L + + Sbjct: 11 VLHIAFGVDENYLRPMGITIVSIIENNPGLELVFHVFISSISSASRVRLDRLERMFARPV 70 Query: 88 KIYLINGD-RLRSLPSTKNWTH---AIYFRFVIADYFINKAPKVLYLDADIICQGTIEPL 143 ++L++ ++ S K H A Y R +I + + +VLYLDADI+C G I L Sbjct: 71 NLHLVDEMLDVKDPASGKGQAHISKAAYIRLLIPEALRDFTDRVLYLDADILCVGDISGL 130 Query: 144 INFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARA 203 ++ D + A V+ + A+ KRA + YFNSG L I+ +W + V++RA Sbjct: 131 LHLDI-DGRTAAVIRDAGAE--SKRAGLVKKGQTLDNYFNSGVLYIDIPRWIERAVTSRA 187 Query: 204 IAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFIN-PVTNDTI 262 + + +P + + + DQD LN++L + F D +N Q+ L +LK+ + V +DT Sbjct: 188 LEKIADP--VLDLRYSDQDALNLVLDGDVRFIDKGWNHQYGLTGKLKKGRVGMDVPSDTK 245 Query: 263 FIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQL----RYSAKHMLK 318 F+H+IGP KPW W + + F+ + SPW AL + ++ R+ + M + Sbjct: 246 FVHFIGPMKPWRSWN-PHQSKELFLRYQALSPWAGEALDDNFSPREIYVYSRFMYRSMFQ 304 Query: 319 KHRYLKGF 326 + R+L G Sbjct: 305 QGRWLSGL 312 >UniRef50_Q46Y64 Glycosyl transferase, family 8 n=1 Tax=Ralstonia eutropha JMP134 RepID=Q46Y64_RALEJ Length = 331 Score = 128 bits (321), Expect = 4e-28, Method: Compositional matrix adjust. Identities = 81/313 (25%), Positives = 151/313 (48%), Gaps = 12/313 (3%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 IA+ D N+ G +IASI+ N G FH+ T +++++ L Y +++ Sbjct: 25 IAFCVDDNYFRAMGATIASIIDNNPGQHFTFHVLTFSALEENQRRLKQLEEMYPVSTQLH 84 Query: 91 LINGDRLRSLPSTKNWTH---AIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 L++ +H +I+ R VI + + +VLYLDADI+C ++ L++ Sbjct: 85 LLDLASFTQFSHFLGHSHYSLSIFTRLVIPEVLQGQTDRVLYLDADILCVNRLDELVDMD 144 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 +++A+VV + +R +LG+A YFN G L IN +W A+ ++ + + L Sbjct: 145 I-SNEIAVVVPDAPVT-LRRRVAALGLAHAE--YFNGGVLFINIDKWLAENITPQTLEAL 200 Query: 208 NEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLK-ESFINPVTNDTIFIHY 266 + + DQD LN +L + + ++N + L + L F +FIH+ Sbjct: 201 LDTSTDMRFN--DQDALNKVLNGRAKYISPRWNYLYDLIHDLNVNRFAMRPVGKAVFIHF 258 Query: 267 IGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTAL-LKPNNSNQLRYSAKHMLKKHRYLKG 325 G KPW DW+ + F + SPW++ L +P N+ ++R ++ M ++H+ ++ Sbjct: 259 AGSVKPWADWS-GHEARGLFRKYLALSPWRDMPLDPEPRNTKEMRMHSRFMFRQHKPVES 317 Query: 326 FSNYLFYFIEKIK 338 YL Y ++ + Sbjct: 318 LKWYLRYLRKRAQ 330 >UniRef50_C3X7M2 Lipopolysaccharide 3-alpha-galactosyltransferase n=1 Tax=Oxalobacter formigenes OXCC13 RepID=C3X7M2_OXAFO Length = 307 Score = 121 bits (304), Expect = 3e-26, Method: Compositional matrix adjust. Identities = 82/300 (27%), Positives = 139/300 (46%), Gaps = 13/300 (4%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 IA+G D + ++IASIL+ N+ S + FH+ + D L + I + Sbjct: 7 IAFGVDTIYAPKMCVTIASILENNKNSNIIFHVIYNDLSDKVIDEIKKSMLTLQAEINFH 66 Query: 91 LINGDRLRSLPSTKNWTH---AIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 I+ D L P N++H + RF I + + LYLDADIIC I L + Sbjct: 67 FIDVD-LSIFPKFSNFSHITSGAFLRFFIPELLQGLTDRALYLDADIICINNISDLFHLE 125 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 ++++ VV + ++ + A K YFNSG L+++ +W V + +++L Sbjct: 126 MDENEILAVVEDIDSETYLNEN-----ASFQKRYFNSGVLMMDIEKWNKNNVYGQLLSVL 180 Query: 208 NEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYI 267 NE DQD LN+++ DK+ + D +N + K+ V + FIH++ Sbjct: 181 NEKG--SGFNLIDQDALNLVMIDKVHYLDNIWNYMINAEQLDKKKEKYSVPENAKFIHFV 238 Query: 268 GPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKHRYLKGFS 327 GP KPWH + ++ ++ + + W L P N ++R A++ KK YL G + Sbjct: 239 GPVKPWHCYNIFDDITGLYLNYQKKTVW--DGLEMPKNYKEMRRYARYSFKKGNYLTGLN 296 >UniRef50_Q116W1 Glycosyl transferase, family 8 n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q116W1_TRIEI Length = 278 Score = 106 bits (264), Expect = 1e-21, Method: Compositional matrix adjust. Identities = 68/253 (26%), Positives = 125/253 (49%), Gaps = 18/253 (7%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 +++ + D+N+ G++I S+L N S HI T++ + ++ D L+ YK Sbjct: 2 MNLLFCFDQNYQQHFGVAITSVLLNNLSSHFDVHIITNFMEEKLKQKLDTLSKNYKCSFH 61 Query: 89 IYLING-DRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 +Y+IN D++ L + + ++A Y+R ++A+ KVLYLD+D++ +E L N Sbjct: 62 LYIINNLDKISKLKVSDHVSNATYYRLIMAEILPKHIDKVLYLDSDVVVISPLEELYNID 121 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 + + G + G +KG FNSG +++N +W +Q+S + I Sbjct: 122 LEN---YFIAASGFS----------GTLVKSKG-FNSGVMVVNLEKWRNEQISTKVIDFA 167 Query: 208 NEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLN-YQLKESFINPVTNDTIFIHY 266 + K+ + DQ LN ++ + D K+N Q L+ ++++ N + IHY Sbjct: 168 TKNR--DKLPYHDQSALNRVIKQNYLIIDRKWNFQVDLSPRKIQKPDDNIALKNARIIHY 225 Query: 267 IGPTKPWHDWAWD 279 IG +KPW+ W D Sbjct: 226 IGSSKPWYFWISD 238 >UniRef50_UPI0001B4A0A4 putative glucosyltransferase n=1 Tax=Bacteroides sp. 2_1_7 RepID=UPI0001B4A0A4 Length = 301 Score = 105 bits (261), Expect = 3e-21, Method: Compositional matrix adjust. Identities = 69/254 (27%), Positives = 123/254 (48%), Gaps = 18/254 (7%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 +DI TD N++ CG+ + SI N + HI T+ ++++ + +Y +I+ Sbjct: 1 MDIVCCTDNNYVIPCGVLVTSICVNNPKEEITVHILTEGISPENQEVLKKVVAKYGQQIQ 60 Query: 89 IYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSF 148 Y ++ + P +++ T A YFR ++ D KVLYLD D++ + ++ L + Sbjct: 61 FYTVDKKVFANCPISRHITLATYFRLIMTDILPKSVEKVLYLDCDVVVRHSLRSLWDTDI 120 Query: 149 PDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLN 208 A V+ + D + R ++ + GYFN+G LL+N W +S ++N Sbjct: 121 -KSYAAGVIPDMSID--DIRIYNRLQYSPSLGYFNAGVLLVNLRYWRENNLSESFFEIIN 177 Query: 209 E-PEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFIN--------PVTN 259 + PE ++ + DQDVLN++L + + +KYN Q Y K+ I+ Sbjct: 178 KYPE---RLRYHDQDVLNIVLKEIKLTLPMKYNVQHG--YFFKDPLISRTYRDEREQAIT 232 Query: 260 DTIFIHYIGPTKPW 273 D + +HY G +KPW Sbjct: 233 DPVILHYSG-SKPW 245 >UniRef50_C7QL87 Glycosyl transferase family 8 n=2 Tax=Cyanothece RepID=C7QL87_CYAP0 Length = 283 Score = 105 bits (261), Expect = 3e-21, Method: Compositional matrix adjust. Identities = 79/292 (27%), Positives = 128/292 (43%), Gaps = 15/292 (5%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 +DI + DKN+ G++I S++ N H+ T D K D L + + + Sbjct: 1 MDILFCFDKNYEQHFGVAITSLILNNTNKIKTIHLVTKDNSKDFLKKIDKLKSKTQAKFF 60 Query: 89 IYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSF 148 IY + L ++ + + + A Y+R + + K+LYLD+D++ ++E L N Sbjct: 61 IYSPDDKDLSNVKVSAHISTAAYYRLLAPELLPQDLKKILYLDSDLVVNSSLENLYNMDI 120 Query: 149 PDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLN 208 DD +A +KR G YFNSG +LIN W + + + L Sbjct: 121 SDDILAAYAGGKMGPGTKKRLQLTG-----DFYFNSGVMLINLEAWRTENIGNKCFKFLQ 175 Query: 209 E-PEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYI 267 E P++I+ DQD LN ++ K + D +N+ L + VTN +I IH+ Sbjct: 176 ENPDMIRLW---DQDALNKIVDGKFLNIDGIWNSLVDLT-----TGETRVTNQSIIIHFT 227 Query: 268 GPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKK 319 G KPW W P Q + SPW N P N ++ + K + K+ Sbjct: 228 GTLKPWQSWCI-RPEKQIYWYYLRQSPWSNAYPQFPKNFQEMLLAIKSVYKQ 278 >UniRef50_Q5LF36 Putative glucosyltransferase n=1 Tax=Bacteroides fragilis NCTC 9343 RepID=Q5LF36_BACFN Length = 308 Score = 100 bits (249), Expect = 7e-20, Method: Compositional matrix adjust. Identities = 75/286 (26%), Positives = 126/286 (44%), Gaps = 16/286 (5%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 +DI + D +++ CG++I S+ N + FHI T +R+ + +Y+ +I Sbjct: 1 MDIVHCIDNSYVAQCGVTITSVCVNNVNEVILFHILTTNLSIFNREMLKKIVDKYRQKII 60 Query: 89 IYLINGDRLRSLPSTK--NWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINF 146 Y ++ L P + + + A YFR ++ D KVLYLD D++ I+ L + Sbjct: 61 FYNVDEYLLNKCPLREGDHVSLATYFRILMPDILPKSLNKVLYLDCDLVVCKNIKRLWDT 120 Query: 147 SFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAM 206 + V G D R ++ I +GYFN+G LL+N A W +S + + Sbjct: 121 DISTHSLGAVYDGGTDDI---RTYNRLKYDIRQGYFNAGVLLVNLAYWREFHISNKLLKF 177 Query: 207 LNE-PEIIKKITHPDQDVLNMLLADKLIFADIKYNTQ---FSLNYQLKESF---INPVTN 259 + + PE ++ DQD LN +L KYN ++ L+E + I Sbjct: 178 IEQYPE---RLMFWDQDALNSVLIQTTKILPFKYNMLDAFYTKELALREEYLFEIEGALC 234 Query: 260 DTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNN 305 D +H+ P KPW D+P+ F E + W + + P N Sbjct: 235 DPTILHFSSPNKPWLK-TCDHPLKSFFFEYLKRTSWNDKFPIYPFN 279 >UniRef50_UPI0001A457E5 hypothetical protein NsubN_08151 n=1 Tax=Neisseria subflava NJ9703 RepID=UPI0001A457E5 Length = 345 Score = 98.2 bits (243), Expect = 3e-19, Method: Compositional matrix adjust. Identities = 82/319 (25%), Positives = 144/319 (45%), Gaps = 23/319 (7%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKT-RIKI 89 I Y D+N++ G ++ S+L+ N S + FH+ D FD + + I + Sbjct: 26 IVYAADQNYIKHIGTALLSVLQ-NNTSPIHFHLLVSGSEGYDFNIFDQIETSNQNYAISV 84 Query: 90 YLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFP 149 Y +N + +L +T +T A+Y+R I LYLD D++C G I+ L Sbjct: 85 YHLNTEYFSTLQTTHYFTIAMYYRMSIPCLLKGITHTALYLDTDVLCLGNIDDLFEIDIS 144 Query: 150 DDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQW---AAQQVSARAIAM 206 + +A V + K+ + G + YFNSG +L N +W A ++ + + Sbjct: 145 NSLIAAVPDAILYRAYIKQLNQFGFTD-TEPYFNSGVILFNIDKWNDMAIDKILSEKMQA 203 Query: 207 LNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHY 266 + + K++ PDQD+LN+ + + +N +++ K S + N+ +H+ Sbjct: 204 VEKQNF--KLSCPDQDILNLACIGHVHWLSENFNW---IHWHQKYSELIDNPNNIRLVHF 258 Query: 267 IGPTKPWHDWAWDYPVSQAFMEAKNASPWKN--------TALLKPNNSNQLRYSAKHMLK 318 +G KPWH + Q F KN SPW N T L PN + R +AK + K Sbjct: 259 VGHIKPWHQLGFHPAYDQYF---KN-SPWNNGYLEQPLSTWLPFPNPKRKFRQAAKRLWK 314 Query: 319 KHRYLKGFSNYLFYFIEKI 337 + + + ++ Y Y + +I Sbjct: 315 QGQKKQAWAYYREYLLRRI 333 >UniRef50_D1PTN4 Putative uncharacterized protein n=1 Tax=Prevotella bergensis DSM 17361 RepID=D1PTN4_9BACT Length = 305 Score = 98.2 bits (243), Expect = 4e-19, Method: Compositional matrix adjust. Identities = 73/291 (25%), Positives = 127/291 (43%), Gaps = 29/291 (9%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 +DI + D N+L C ++ SIL N+ ++ FH+ ++ ++ R + +A Y ++ Sbjct: 1 MDIVFNIDDNYLMQCCTTMVSILHNNKDGQISFHVISNGLTNESRLKIEQVAEAYHQQVF 60 Query: 89 IYLINGDRLRSLPSTKNWTH---AIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLIN 145 Y++N + + H A Y R +AD + K++Y+D D+I G+++ L N Sbjct: 61 FYVVNPEAMSDYEIFDKQGHISMATYLRLFVADILPERLHKIIYMDCDLIVNGSLDGLWN 120 Query: 146 FSFPDDKVAMVVTEGQADWWEKRAHSLGVAG--IAKGYFNSGFLLINTAQWAAQQVSARA 203 +A V D W +A + G A YFN+G L++N W VS +A Sbjct: 121 TDVEGYALAAV-----EDMWSGKADNYVRLGYDAADTYFNAGVLVVNLDYWREHNVSQQA 175 Query: 204 IAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNY-----------QLKES 252 + ++ DQDVLN L D + ++N Q L +L + Sbjct: 176 AQYVALHA--GQLKFNDQDVLNGLFHDSKLLLPFRWNVQDGLLRKRRKIRPEVMPKLDQE 233 Query: 253 FINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKP 303 NPV IH+ G KPW +++ P F + + + W+ + P Sbjct: 234 LENPV-----IIHFTGHRKPW-NFSCLNPYKNLFFKYVDMTEWRGFRPIVP 278 >UniRef50_C0WCJ1 Lipopolysaccharide 1,2-glucosyltransferase n=1 Tax=Acidaminococcus sp. D21 RepID=C0WCJ1_9FIRM Length = 338 Score = 97.1 bits (240), Expect = 8e-19, Method: Compositional matrix adjust. Identities = 86/333 (25%), Positives = 146/333 (43%), Gaps = 24/333 (7%) Query: 5 FFQETEFLNSVIDYDHKVE-TENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHI 63 +F FL V + E T+ L +AY + + G S+ S+L+ N + FHI Sbjct: 8 YFVPARFLKGVETFSKNAEKTDKAPLHVAYNVNDGYFQIMGASLVSVLENNAHRAVMFHI 67 Query: 64 FTDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSL-PSTKNWTHAIYFRFVIADYFIN 122 FTD + ++ + + LA +Y IK+Y ++ + + ++ Y R V+ Sbjct: 68 FTDGYSKENAQKMEQLADRYGCVIKLYTLHMEPFADFHVKVERFSRITYGRIVMPLILAA 127 Query: 123 KAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYF 182 + LYLDAD + ++ L ++ K V+E D ++R L + YF Sbjct: 128 ETDHFLYLDADTMVIRPLDELYHWDL-TGKAMGAVSERMPD-AKRRGDYLHLNN--GRYF 183 Query: 183 NSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQ 242 N G +++N +W Q ++ +A ++ EP+ ++ QD+LN++ F YN Sbjct: 184 NDGVMMVNIPEWQKQNITEKAFSLQKEPK--ERFLGQSQDILNIVFDGTNAFLPSIYN-- 239 Query: 243 FSLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTA--- 299 + +P TI IH+ G KPW DY A ASPW+ Sbjct: 240 -----EFGGGEDDPQQKGTI-IHWTGRRKPWQMVLSDYDAQWRSYNA--ASPWETLTAIL 291 Query: 300 -LLKPNNSNQLRYSAKHMLKK--HRYLKGFSNY 329 +LKP N + + AK+ K+ Y+KG + Y Sbjct: 292 PILKPENYHDFKEWAKYRRKESFRDYVKGMAYY 324 >UniRef50_A1XRC1 Glycosyltransferase n=1 Tax=Haemophilus ducreyi RepID=A1XRC1_HAEDU Length = 267 Score = 95.1 bits (235), Expect = 3e-18, Method: Compositional matrix adjust. Identities = 67/251 (26%), Positives = 128/251 (50%), Gaps = 15/251 (5%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 ++I + +D+N+ + + SIL +N + F+I ++ + + +L ++ + I+ Sbjct: 1 MNIVFSSDENYAPHLSVCLYSILSHNYN--INFYILDLGIKEESKSFIKSLVEKFNSNIE 58 Query: 89 IYLINGDRLRSLPSTKNW-THAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 I+ D + P ++ + A Y R + DY + + KVLYLD D I G++ L + Sbjct: 59 FIKISVDSFSNFPIYIDYISLATYARLKLTDY-LPQLEKVLYLDIDTIVNGSLIDLWDLD 117 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKG-YFNSGFLLINTAQWAAQQVSARAIAM 206 + +A V AD + + + + G+ K YFN+G LLI+ +W + +++ + Sbjct: 118 LNEYYIAAV-----ADPFIESLNYKTILGLDKNIYFNAGVLLIDCIKWKQYNIFDKSVKI 172 Query: 207 LNEPEIIKKITHPDQDVLNMLLADKLIFADIKYN---TQFSLNYQLKESFINPVTNDTIF 263 + + + KK+ + DQD+LN++L DK++ D +YN +Q + K +T + Sbjct: 173 IKD--LSKKLQYQDQDILNLILKDKVLLLDCRYNFMPSQLDFIKRDKVRKGIKITTPIVI 230 Query: 264 IHYIGPTKPWH 274 HY GP KPWH Sbjct: 231 YHYCGPKKPWH 241 >UniRef50_D2ELM0 Lipopolysaccharide biosynthesis glycosyltransferase n=1 Tax=Pediococcus acidilactici 7_4 RepID=D2ELM0_PEDAC Length = 552 Score = 94.0 bits (232), Expect = 7e-18, Method: Compositional matrix adjust. Identities = 67/259 (25%), Positives = 119/259 (45%), Gaps = 19/259 (7%) Query: 29 LDIAYGTDKNFLFGCGISIASILK-YNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 ++I D+N+ I+I + L+ N +R F + T+ GD R D L + T I Sbjct: 4 INILLAADRNYADQLCITIKTALETLNSATRAHFIVLTNNLGDQTRALLDKLMHNFHT-I 62 Query: 88 KIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKA-PKVLYLDADIICQGTIEPLINF 146 + ++ +R P+ ++ YFR + ++ +++YLD D++ + + L Sbjct: 63 EYLNLDDERFDFCPTNQHINKTAYFRIIAPKLLASRQIDRLIYLDVDVLIRKDLTELAES 122 Query: 147 SFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKG---YFNSGFLLINTAQWAAQQVSARA 203 + + V V+ GQA H LGV + YFNSG ++I+ AQW A +++ + Sbjct: 123 NLNQNTVGAVIDTGQA----FALHRLGVDPVVAASNLYFNSGIMVIDVAQWNAHRITEKT 178 Query: 204 IAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNY-------QLKESFINP 256 +A + +I DQD LN +LA ++ F K+N Q S+ + Q I+ Sbjct: 179 LAFIRNHA--DRIIFHDQDALNAVLAGEVQFLHPKWNLQNSIIFRKHRPINQGYAELIDE 236 Query: 257 VTNDTIFIHYIGPTKPWHD 275 + +H+ KPW D Sbjct: 237 AIKEPSIVHFTTHEKPWKD 255 Score = 84.7 bits (208), Expect = 4e-15, Method: Compositional matrix adjust. Identities = 66/280 (23%), Positives = 124/280 (44%), Gaps = 24/280 (8%) Query: 21 KVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALA 80 ++E +++ + F S SIL+ + + F + D+ D D ++ Sbjct: 271 ELEMHRGVINVISAANSAFTQALATSYVSILENDPDHQYNFFLLPDHLTDRDMMLLGSII 330 Query: 81 LQY-KTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGT 139 +Y IK+ +N + L + + Y+R ++A + + +YLD DII + Sbjct: 331 ARYDNATIKVVEVNEELLANAVESDRIVKTAYYR-ILAPALLPSINRAIYLDCDIIANTS 389 Query: 140 IEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQV 199 + L + + +A V G D EK +G+ + YFNSG +LI+ +W A+ Sbjct: 390 LHELWQTNLEGNVIAAVEDAGFHDRLEK----MGITKENEKYFNSGMMLIDLVRWRARST 445 Query: 200 SARAIAMLNE-PEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVT 258 + + + +N+ PE K+ DQD LN L D + ++N Q ++ + E+ P T Sbjct: 446 TQKVLDYINQNPE---KLRFHDQDALNANLYDDWLHLHPQWNAQSNI---IMETIFPPRT 499 Query: 259 ----------NDTIFIHYIGPTKPWHDWAWDYPVSQAFME 288 D IH+ G KPWH+ ++P + +++ Sbjct: 500 ELLEPYAETREDPKLIHFCGHVKPWHE-GCEHPYADVYLK 538 >UniRef50_C3PWZ8 Lipopolysaccharide 1,2-glucosyltransferase n=1 Tax=Bacteroides sp. 9_1_42FAA RepID=C3PWZ8_9BACE Length = 315 Score = 94.0 bits (232), Expect = 7e-18, Method: Compositional matrix adjust. Identities = 81/322 (25%), Positives = 147/322 (45%), Gaps = 27/322 (8%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 + IA D F+ C ++I SIL+ N+ + HI + + +D +A +Y T I Sbjct: 1 MHIALTIDSKFVRYCAVTIVSILENNDPKDIMLHIVSGHLPKEDVLTLSQVAEKYGTSIA 60 Query: 89 IYLINGDRLRSLP---STKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLIN 145 Y I ++L++ + + +++R V+A + +V+YLD+D + G+++ L + Sbjct: 61 FYYIPHEKLQNYEVKWQKQRLSMVVFYRCVLASILPSTISRVIYLDSDTLVLGSLKELWD 120 Query: 146 FSFPDDKVAMV--VTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARA 203 + +A V ++E+ + + Y N G LL+N A W + + Sbjct: 121 TNLNQLALAGVQDTVSPNPSYFERLQY-----APSYNYINGGVLLLNLAYWRKHNIEQQC 175 Query: 204 IAMLNE-PEIIKKITHPDQDVLNMLLADKLIFADIKYNTQ--FSLNYQLKESFINPVTND 260 I + P+ +I DQD+LN LL D+ + DIK+N Q F N + P D Sbjct: 176 IKYYQQYPD---RIILNDQDILNALLYDQKVLIDIKWNVQDDFYRNNRYTSPAWKPSYTD 232 Query: 261 TIF----IHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHM 316 I +HY G KPW A +P+ F + +P+ ++A K ++ R+ H+ Sbjct: 233 AILHPIILHYSG-RKPWAYHAM-HPLRHLFFHYQRLTPYDDSAKQKKISTRIYRFI--HL 288 Query: 317 LKKHRYLKGFSNYLFYFIEKIK 338 L Y+ G + ++KI+ Sbjct: 289 LP---YILGLKPKKYVNLKKIR 307 >UniRef50_A8ARL4 Putative uncharacterized protein n=2 Tax=Citrobacter RepID=A8ARL4_CITK8 Length = 314 Score = 92.8 bits (229), Expect = 2e-17, Method: Compositional matrix adjust. Identities = 87/322 (27%), Positives = 139/322 (43%), Gaps = 27/322 (8%) Query: 28 CLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 ++IAY TD N+L +SI S++ N L F +F D+D +A T Sbjct: 7 VINIAYCTDANYLEYVAVSIMSVIMNNPEQSLAFFVFVYDVSDED------IAKLQSTSN 60 Query: 88 KIYLINGDRL-----RSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEP 142 KI +I D+ + + K+ + Y R + +K + +YLDAD +C ++ Sbjct: 61 KIQVITIDKADIEKYNNDFAIKHLNRSTYMRLAVPRLLKDKVARFIYLDADTLCFDSLSE 120 Query: 143 LINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSAR 202 IN D+ V V + K A LG++ YFN+GFL IN A W + + Sbjct: 121 -INSVDIDNVVCAVSHDSLNIHDNKHARRLGLS--IDHYFNAGFLYINVANWIKHDIEHK 177 Query: 203 AIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFS-LNYQLKESFINPVTNDT 261 A +L E K + + DQD LN+ + + F D ++N F+ + KE+F Sbjct: 178 ANTVLFEQG--KSLPYFDQDALNIAMNGNITFIDNRWNFLFNWFTDEQKENFFYHSDTLP 235 Query: 262 IFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTAL------LKPNNSNQLRYSAKH 315 IH+ G KPW+ Q ++ + +PW+N L ++P + R ++ Sbjct: 236 RIIHFTGGRKPWYKEHTGLS-QQLYVFYHHFTPWRNAELRSYAPRMRPTD---YRVYSRQ 291 Query: 316 MLKKHRYLKGFSNYLFYFIEKI 337 KK Y Y Y KI Sbjct: 292 AAKKGNYFTAIKWYAKYLKTKI 313 >UniRef50_A0KQP2 Glycosyl transferase, family 8 n=2 Tax=Aeromonas RepID=A0KQP2_AERHH Length = 366 Score = 91.7 bits (226), Expect = 4e-17, Method: Compositional matrix adjust. Identities = 59/191 (30%), Positives = 104/191 (54%), Gaps = 18/191 (9%) Query: 111 YFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAH 170 Y+RF I + + KVL++D+D+I G I PL + D VA+V +K+ Sbjct: 89 YYRFAIP-HILKSIDKVLFIDSDMIALGDISPLWSIDMGDAIVAVVSDHILGCDKKKQL- 146 Query: 171 SLGVAGIAKG-YFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLA 229 + GI+ G YFN+GF+L+N +W A+ +S +A+ +L E H DQD LN++L Sbjct: 147 ---MRGISSGKYFNAGFMLMNLDKWRAKNISEQALRLLIEN---NGFEHNDQDALNIVLE 200 Query: 230 DKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEA 289 +K ++ D K+N Q N+ + +F+ I +H+ G KPWH ++ ++P +++ + Sbjct: 201 NKTVYIDNKWNAQ--PNHLAQNNFL------PILVHFCGQEKPWHIYS-NHPFKGSYLVS 251 Query: 290 KNASPWKNTAL 300 + + + N L Sbjct: 252 RRETDYANEPL 262 >UniRef50_Q64ZV2 Lipopolysaccharide 1,2-glucosyltransferase n=8 Tax=Bacteroides RepID=Q64ZV2_BACFR Length = 311 Score = 87.0 bits (214), Expect = 8e-16, Method: Compositional matrix adjust. Identities = 70/284 (24%), Positives = 115/284 (40%), Gaps = 26/284 (9%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 + IA D NF C +++ S+ N S C HI + D+K ++A Y +I Sbjct: 2 IHIACNIDSNFTIHCAVTLTSLFANNRNSEFCVHIIASTLPEADQKALSSIAESYGNKIC 61 Query: 89 IYLINGDRLRSLPSTKNWTH---AIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLIN 145 Y D L + K+ A Y+R +++ K+LY+D DI+ I + Sbjct: 62 FYFPEKDLLNNFSIKKSGNRISIATYYRCLLSRILPVNIDKILYIDCDIVVLNDISEFWD 121 Query: 146 FSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIA 205 + + G E+ +S YFN+G LLIN W ++ Sbjct: 122 TDITQYAIGCIEDIGSD---EEEYYSRLQYDKKYSYFNAGVLLINLKYWREHKIDEMCEQ 178 Query: 206 MLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQ---FSLNYQ--------LKESFI 254 +I DQD+LN LL +F ++N Q + Y LKE+ + Sbjct: 179 YFLAHS--DRIRFNDQDLLNALLYKDKLFVPFRWNVQDTFYRRTYSHKVKEHSGLKEALL 236 Query: 255 NPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNT 298 +P +HY KPW ++ +P+ Q + + + +PWK T Sbjct: 237 HPA-----ILHYTNK-KPW-NYDSMHPLKQEYFKYLDMTPWKGT 273 >UniRef50_B7AIV0 Putative uncharacterized protein n=1 Tax=Bacteroides eggerthii DSM 20697 RepID=B7AIV0_9BACE Length = 321 Score = 85.5 bits (210), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 78/317 (24%), Positives = 130/317 (41%), Gaps = 28/317 (8%) Query: 27 LCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTR 86 +C++ AY C +IASI N+ + H+ TDY ++ + +A + + Sbjct: 19 VCINDAYSQ------HCAATIASIFINNKNEVIKIHVITDYISKKNQSRLEKIAFNFNQQ 72 Query: 87 IKIYLINGDRLRSLPSTKNW-----THAIYFRFVIADYFINKAPKVLYLDADIICQGTIE 141 I+ Y N L P K+ T Y+R I K YLD D++ + Sbjct: 73 IQFYTFNNSTLNRWPCFKDGMPPHVTIQTYYRLFIPQILPLNIKKTFYLDCDLLVLHPLR 132 Query: 142 PLINFSFPDDKVAMVVTEGQADWWE---KRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQ 198 N + VA + AD W + A L + YFN+G LL+N Sbjct: 133 EFWNTKMQNKGVAAI-----ADQWTDYIEAATRLKYRN-DREYFNAGVLLLNLEYLRNHN 186 Query: 199 VSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNT-QFSLNYQLKESF---I 254 + AI + + I + DQDVLN L+ + I +K+N F +N ++ + + Sbjct: 187 FTNNAIDFVTKHA--NDIVYHDQDVLNKLIGENRIIMPVKWNVCSFKINDKIPHIYNATM 244 Query: 255 NPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLR-YSA 313 N D IH+ P KPW+ + +P + +PWK+ + N +R + Sbjct: 245 NDARKDPYIIHFFAPIKPWNQDS-SHPYRSYYYYFLQFTPWKHEVKCHYSLKNTIRTFLI 303 Query: 314 KHMLKKHRYLKGFSNYL 330 K L+K +Y +Y+ Sbjct: 304 KIGLRKSQYAIAPQSYM 320 >UniRef50_C0QZN2 Glycosyl transferase, family 8 n=3 Tax=Brachyspira RepID=C0QZN2_BRAHW Length = 339 Score = 84.7 bits (208), Expect = 4e-15, Method: Compositional matrix adjust. Identities = 74/274 (27%), Positives = 122/274 (44%), Gaps = 23/274 (8%) Query: 29 LDIAYGTDKNFLFGCGISIASILK-YNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 ++I +D N+ G +IASILK +E ++ FH+ ++++ +L + I Sbjct: 1 MNICLASDNNYAPYMGTAIASILKNSSEDEKIIFHLIDGGITKENKEKIISLKNIKECEI 60 Query: 88 KIY-----LINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEP 142 Y + +G +++ A+++R IA + K+LYLD+D+I G+++ Sbjct: 61 NFYTPDIKMYDG-WFEKTSCKAHFSAAMFYRLSIASIIPSNIDKILYLDSDLIATGSLKE 119 Query: 143 LINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSAR 202 L ++ A+V+ + + + GI YFNSG LLIN W + + Sbjct: 120 LFLMDI-ENHYAIVIKHSTNE-----KNKWSIDGI-NDYFNSGVLLINNKLWIKNNIEDQ 172 Query: 203 AIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFI-NPVTNDT 261 N K DQDVLN +L K+ +AD++YN Y E+ I NP Sbjct: 173 FNKFYNNNY---KTCFGDQDVLNNVLIGKVKYADMRYNVYAEKGYYNTENDIENP----- 224 Query: 262 IFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPW 295 I IHY+ P KPW + F +PW Sbjct: 225 IIIHYLSPEKPWKENCRGTLFIDEFWRYYQYTPW 258 >UniRef50_Q9L6B2 Putative glycosyl transferase n=3 Tax=Pasteurella RepID=Q9L6B2_PASMU Length = 302 Score = 84.3 bits (207), Expect = 5e-15, Method: Compositional matrix adjust. Identities = 64/260 (24%), Positives = 122/260 (46%), Gaps = 22/260 (8%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 ++I + +D + ++I SI+ +NE + F+IF D++++ + + Y + + Sbjct: 1 MNILFVSDDVYAKHLVVAIKSIINHNEKG-ISFYIFDLGIKDENKRNINDIVSSYGSEVN 59 Query: 89 IYLINGDRLRSLPSTKNW-THAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 +N S P ++ + A Y R A+Y + K++YLD D++ ++E L N Sbjct: 60 FIAVNEKEFESFPVQISYISLATYARLKAAEYLPDNLNKIIYLDVDVLVFNSLEMLWNVD 119 Query: 148 FPDDKVAMVVTEGQADWW---EKRAHSLGVAGIAKG-YFNSGFLLINTAQWAAQQVSARA 203 V +T D + EK H ++ K YFN+G +L N +W V +RA Sbjct: 120 -----VNNFLTAACYDSFIENEKSEHKKSISMSDKEYYFNAGVMLFNLDEWRKMDVFSRA 174 Query: 204 IAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTN---- 259 + +L ++ + DQD+LN+L +K+ + D ++N + ++K+ ++N Sbjct: 175 LDLL--AMYPNQMIYQDQDILNILFRNKVCYLDCRFNFMPNQLERIKQYHKGKLSNLHSL 232 Query: 260 -----DTIFIHYIGPTKPWH 274 + HY GP K WH Sbjct: 233 EKTTMPVVISHYCGPEKAWH 252 >UniRef50_C1QEC6 LPS:glycosyltransferase n=1 Tax=Brachyspira murdochii DSM 12563 RepID=C1QEC6_9SPIR Length = 242 Score = 84.0 bits (206), Expect = 7e-15, Method: Compositional matrix adjust. Identities = 72/257 (28%), Positives = 116/257 (45%), Gaps = 24/257 (9%) Query: 23 ETENLCLDIAYGTDKNFLFGCGISIASILKYN-EGSRLCFHIFTDYFGDDDRKYFDALAL 81 ET N+C DK F +I SILK + + FH+ T+ D+++ + L Sbjct: 3 ETMNICFT---ANDKYAPFMSA-TIVSILKNSKDDESFSFHVITNDISDENKMMIERLKE 58 Query: 82 QYKTRIKIYLINGDR----LRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQ 137 +IK Y N D+ + +++ +I+FR I + IN KVLYLD DII Sbjct: 59 IKTFKIKYYTPNIDKYNKWFEKINYQRHYAPSIFFRLDIPNLIIN-IDKVLYLDCDIIVN 117 Query: 138 GTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQ 197 ++ L N + V G ++ +K +G+ K YFNSG LL+N + + Sbjct: 118 SSLSELFNIDISEYFALAVEDTGDLNFLKKYKTKIGIEDKHK-YFNSGVLLLNNKLYMEK 176 Query: 198 QVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPV 257 ++ + N+ + I DQD+LN L DK+ F D K+N F + Sbjct: 177 NLNLESENYFNKYYNV--IECVDQDILNYLFRDKIKFIDNKWN-----------DFSSKN 223 Query: 258 TNDTIFIHYIGPTKPWH 274 + + +HY+G K W+ Sbjct: 224 IDKSAIMHYVGKIKSWN 240 >UniRef50_Q03HK5 Lipopolysaccharide biosynthesis glycosyltransferase n=1 Tax=Pediococcus pentosaceus ATCC 25745 RepID=Q03HK5_PEDPA Length = 549 Score = 83.2 bits (204), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 68/281 (24%), Positives = 123/281 (43%), Gaps = 23/281 (8%) Query: 6 FQETEFLNSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFT 65 E +L+ + +++E +++ + F+ S SIL+ + ++ F++ Sbjct: 256 LSEHPYLDEYHEELNELEINRGVVNVISAANSAFVEALATSYISILENDSENQYNFYLLP 315 Query: 66 DYFGDDDRKYFDALALQY-KTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKA 124 D+ D ++ +Y IKI ++ L + + + Y+R ++A + Sbjct: 316 DHLDQRDMLILGSVISRYDNASIKIVKVDEKLLENAVESDRILKSAYYR-ILAPELLPNI 374 Query: 125 PKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNS 184 + +YLD DII + L S + +A V G D R +G+ YFNS Sbjct: 375 NRAIYLDCDIIANTNLHDLWQTSLEGNVLAAVEDAGFHD----RLEHMGITHDNSKYFNS 430 Query: 185 GFLLINTAQWAAQQVSARAIAMLNE-PEIIKKITHPDQDVLNMLLADKLIFADIKYNTQF 243 G +LI+ W +Q V+ R + +N PE K+ DQD LN +L DK + K+N Q Sbjct: 431 GMMLIDLVSWRSQAVTQRVLDYINHNPE---KLRFHDQDALNAILYDKWLHLHPKWNAQS 487 Query: 244 SLNYQLKESFINPVT----------NDTIFIHYIGPTKPWH 274 ++ + ++ + P T + IH+ G KPWH Sbjct: 488 NI---VLDALVPPRTELLKLYAETRENPKLIHFCGHVKPWH 525 Score = 62.8 bits (151), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 58/275 (21%), Positives = 118/275 (42%), Gaps = 20/275 (7%) Query: 29 LDIAYGTDKNFLFGCGISIASILK-YNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 +++ D+N+ I+I + L+ N+ +R+ F + ++ + + LA T + Sbjct: 4 INVLLAADENYADQLQITIKTTLENLNKKTRVNFIVLSNNLSNSTKLALKKLAHGLHT-V 62 Query: 88 KIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINK-APKVLYLDADIICQGTIEPLINF 146 + ++ P+ + Y+R + + ++LYLD D++ + + L + Sbjct: 63 EYLDLDPSVFAFCPTNSHINKTAYYRILAPQLLAKRNIDRILYLDVDLLVRHDLTELYDA 122 Query: 147 SFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKG---YFNSGFLLINTAQWAAQQVSARA 203 + V V+ GQA + LGV + YFNSG L+I+ +W ++ + Sbjct: 123 ELNHNIVGAVIDTGQAFALNR----LGVDPVVAANNIYFNSGILVIDIKKWNENHITEKT 178 Query: 204 IAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQ----LKESF---INP 256 + + + I DQD LN +LA + K+N Q S+ ++ + E++ IN Sbjct: 179 LNYIKHQSHL--IIFHDQDALNAVLAGHVQMLHPKWNLQNSIVFRKHRPINEAYDQLINE 236 Query: 257 VTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKN 291 +H+ KPW + ++P + E N Sbjct: 237 AIKSPAIVHFTTHEKPWKTLS-EHPYLDEYHEELN 270 >UniRef50_C5WAK3 Ybl156 protein n=2 Tax=Enterobacteriaceae RepID=C5WAK3_ECOBB Length = 163 Score = 82.4 bits (202), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 45/160 (28%), Positives = 80/160 (50%), Gaps = 5/160 (3%) Query: 181 YFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYN 240 YFN+G + +N +W ++ + +L + + DQD LN+ I+ ++ Sbjct: 5 YFNAGVIYVNLKKWHEANLTPYLLKLLRGETKYGSLKYLDQDALNIAFNMNNIYLAKDFD 64 Query: 241 TQFSLNYQLKE----SFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWK 296 T ++L +L + + +T+ T+ IHY G TKPWH WA YP + F A+ SPWK Sbjct: 65 TIYTLKNELHDRSHRKYQQTITDKTVLIHYTGITKPWHSWA-GYPSASYFNIAREQSPWK 123 Query: 297 NTALLKPNNSNQLRYSAKHMLKKHRYLKGFSNYLFYFIEK 336 L + +++ KH+ Y+KG ++ + Y ++K Sbjct: 124 KYPLKEARTVAEMQKQYKHLFAHGEYIKGITSLIKYKLKK 163 >UniRef50_C1QBZ8 LPS:glycosyltransferase n=1 Tax=Brachyspira murdochii DSM 12563 RepID=C1QBZ8_9SPIR Length = 336 Score = 81.6 bits (200), Expect = 3e-14, Method: Compositional matrix adjust. Identities = 68/249 (27%), Positives = 115/249 (46%), Gaps = 22/249 (8%) Query: 26 NLCLDIAYGTDKNFLFGCGISIASILK-YNEGSRLCFHIFTDYFGDDDRKYFDALALQYK 84 N+CL +D+N+ +++ASILK N+ + FHI D+ + L Sbjct: 5 NICL----CSDENYAKYMAVTMASILKNTNDDENIIFHIIESNIKDETKNKLIYLKKIKN 60 Query: 85 TRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLI 144 IK Y + ++ + A Y R +I + I A KVLYLD+DII G+++ L Sbjct: 61 CEIKFYRVEYNK---------YPLATYLRLLIPE-LIKDADKVLYLDSDIIVNGSLKELF 110 Query: 145 NFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAI 204 + + A+ V + D +++ + + G + YFN+G +L N +S + Sbjct: 111 DIDI-NGYYALAVKDLYVDIYKEHKELIEI-GNNRIYFNAGVVLFNNKSCIDNNISQKFY 168 Query: 205 AMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFI 264 + E + K+ DQD+LN DK+ D K+N +Y K + P +D + I Sbjct: 169 SYFTENK--NKLKFHDQDILNHCFIDKVKIIDRKWNFMPFRDYNTKSHY--PTKDDAVII 224 Query: 265 HYIGPTKPW 273 H++ KPW Sbjct: 225 HFV-EHKPW 232 >UniRef50_A5ZFA9 Putative uncharacterized protein n=1 Tax=Bacteroides caccae ATCC 43185 RepID=A5ZFA9_9BACE Length = 310 Score = 81.3 bits (199), Expect = 5e-14, Method: Compositional matrix adjust. Identities = 71/276 (25%), Positives = 116/276 (42%), Gaps = 22/276 (7%) Query: 30 DIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKI 89 +I G D + CG + S+ + N G+ + ++ + + + L Y+ +I Sbjct: 3 NIICGIDDQYCQHCGAMLLSLFESNPGA-ITIYVLSLELSEKSKNLLKELVDSYQKQIHF 61 Query: 90 YLINGDRLRSLP--STKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 I + + + P ST + A Y R I + K LY+D+DII + I L Sbjct: 62 IDIPSELVLNFPMKSTDYPSLATYLRLFIPQLLPFEVDKALYVDSDIIFKKDISALY--- 118 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 D + G D + A LG + YFN+GF+L+N + +A+A + Sbjct: 119 --DSDITNYALAGMEDAPNQNALRLGFPE-SDLYFNAGFVLLNVKYLRDMDFTNKAMAYI 175 Query: 208 NEPEIIKKITHPDQDVLNMLLADKLIFADIKYNT--------QFSLNYQLKESFINPVTN 259 + +KI DQDVLN LL K++F IK+N F ++E N + Sbjct: 176 RDCR--EKIVLHDQDVLNALLHGKVLFVPIKWNMLDCFYRKPPFIAKKYMRELHEN--LD 231 Query: 260 DTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPW 295 IH+ GP KPWH +P+ + + W Sbjct: 232 SPAVIHFSGPLKPWHH-GCPHPLRKEYFNYSRKLSW 266 >UniRef50_A1BHG0 Glycosyl transferase, family 8 n=2 Tax=Chlorobium/Pelodictyon group RepID=A1BHG0_CHLPD Length = 307 Score = 80.5 bits (197), Expect = 7e-14, Method: Compositional matrix adjust. Identities = 72/285 (25%), Positives = 128/285 (44%), Gaps = 32/285 (11%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 ++I + TDKN++ ++ S+L+ N+ +I + + + + + +K Sbjct: 8 VNIVFATDKNYIQHLSAALVSLLENNKDLSFTVYIISSGMSEKSYRNIEEIIKTGNCTVK 67 Query: 89 IYLINGDRLRSLPSTKN-WTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 ++ + L + + Y+R +I D K+LYLD+DII G+I+ L N Sbjct: 68 HITVSDELFVKLATAHPFYPKGTYYRLLIPDLI--DEEKILYLDSDIIVNGSIKELYNQD 125 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 D V + G + R + I YFNSG +LIN A+W + + + I + Sbjct: 126 VEDYFVCAIEDPG---FDRHRQLQMDKESI---YFNSGMMLINLAKWKSTGLQKKVIDFI 179 Query: 208 -NEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQ---FSLNY----------QLKESF 253 + P+ I PDQ LN ++ + +KYN Q FS ++ +L E+ Sbjct: 180 EHNPD---AIWFPDQCGLNSVINGRWKKVPLKYNQQSSIFSDDFEKKFDCFSVEELAEAK 236 Query: 254 INPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNT 298 NPV IHY G +KPWH + +P + + + +P++N Sbjct: 237 KNPV-----IIHYTGGSKPWH-FKNRHPYKKLYWKYLKMTPYRNA 275 >UniRef50_A4ECW2 Putative uncharacterized protein n=1 Tax=Collinsella aerofaciens ATCC 25986 RepID=A4ECW2_9ACTN Length = 328 Score = 79.3 bits (194), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 62/261 (23%), Positives = 109/261 (41%), Gaps = 22/261 (8%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSR-LCFHIFTDYFGDDDRKYFDALALQYKTRI 87 +++ Y D NF+ +I S++ + G + + FH+F++ +D+++ + +Y + Sbjct: 4 MNLLYTVDNNFVPQLAANICSVVSNHSGIQDITFHVFSNGITEDNQRLLQEMVTEYNQNL 63 Query: 88 KIYLING--DRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLIN 145 Y I+ D L T W + R ++A + N+ +V+YLD D I G I L N Sbjct: 64 VFYDISNFKDALGFDFDTSGWNEIVLARLLMAHFLPNEIERVIYLDGDTIVLGDIALLWN 123 Query: 146 FSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIA 205 V MV R + L + G Y N+G LL++ QW + + + Sbjct: 124 QDLKGCVVGMVPEPTVG---PSRLNDLDLNGCL--YHNAGVLLVDLKQWRSTCCEDQLLD 178 Query: 206 MLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQ------------LKESF 253 ++ DQD LN +L DK+ +N +Y E+ Sbjct: 179 YCERRS--GRLFANDQDALNAVLKDKICSLSPAFNYSNIFDYYPFIFLNSLMPGFSDENS 236 Query: 254 INPVTNDTIFIHYIGPTKPWH 274 N + I +HY+G +PW Sbjct: 237 FNTARSKPIVVHYLGEERPWR 257 >UniRef50_C6I3U6 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=C6I3U6_9BACE Length = 310 Score = 79.3 bits (194), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 75/274 (27%), Positives = 114/274 (41%), Gaps = 23/274 (8%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 ++I D N++ + + S N+ ++ T D + Y + +Y + Sbjct: 1 MNILCCLDDNYVQHTSVMLTSFFINNDFEHHNIYVITMQLNDGNVAYLREVVNKYHSNFY 60 Query: 89 IYLINGDRLRSL--PSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINF 146 +Y +N L T + A Y R KVLY+D DI+ + ++E L Sbjct: 61 LYQVNEAMLSGFVRKETDYVSLAAYLRLFSTQVLPFNCSKVLYIDGDIVVRKSLEELWKM 120 Query: 147 SFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAM 206 + VA V +A+ H+ V GYFNSGF+LIN + W V+ +AI Sbjct: 121 DIENYAVAAVDETIKANCIR---HNYDVT---LGYFNSGFMLINLSFWRENSVAEKAIDY 174 Query: 207 LNE-PEIIKKITHPDQDVLNMLLADKL-IFADIKYN-TQFSLNYQLKES---------FI 254 + PE IK DQD LN +L L D+KYN T L Q E Sbjct: 175 MKRFPERIKSW---DQDALNGILYGGLWKRLDLKYNLTTIFLCKQYVEGQDFPKIYTEEY 231 Query: 255 NPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFME 288 N +D +HY GP KPW D+P + +++ Sbjct: 232 NSAISDPAVVHYTGPDKPWKYTVVDHPFKKDYLQ 265 >UniRef50_D2MYR1 Putative uncharacterized protein n=1 Tax=Campylobacter jejuni subsp. jejuni 414 RepID=D2MYR1_CAMJE Length = 383 Score = 79.3 bits (194), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 61/209 (29%), Positives = 92/209 (44%), Gaps = 22/209 (10%) Query: 47 IASILKYN--EGSRLCFHIFTDYFGDDDRKYFD----ALALQYKTRIKIYLINGDRLRSL 100 I ++ +YN E FHI +D+ D R + LA Y IKIY+IN D R+ Sbjct: 46 ILTLKQYNKSEEEGYVFHILSDFISDKTRMKLEYLKENLAKIYPCDIKIYIINEDNFRNF 105 Query: 101 PSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEG 160 K Y+R ++ K LY+DAD++C I L F +DKV V + Sbjct: 106 LHWKG-NFVAYYRLMVGSILPPDIEKCLYIDADMLCFSDIRKLFLFDL-EDKVLGAVADF 163 Query: 161 QADWWEKRAHSL--------GVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEI 212 W R G ++ YFNSG LLI+ +W Q + + + +L + Sbjct: 164 AT--WNTRFLKFRKLKYLFKGFLKFSREYFNSGLLLIDLKEWRRQNIEKKCLDVLKYYKC 221 Query: 213 IKKITHPDQDVLNMLLADKLIFADIKYNT 241 I PDQD LN+++ + I + +N Sbjct: 222 I----LPDQDALNIVIKENYIKLPLSFNC 246 >UniRef50_B0NR59 Putative uncharacterized protein n=1 Tax=Bacteroides stercoris ATCC 43183 RepID=B0NR59_BACSE Length = 306 Score = 78.2 bits (191), Expect = 4e-13, Method: Compositional matrix adjust. Identities = 74/267 (27%), Positives = 121/267 (45%), Gaps = 32/267 (11%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGD---DDRKYFDALALQYKTRI 87 I + D N++ G+ I S+L N + + I+ D D++ + YK I Sbjct: 6 IVFSIDHNYVMQAGVCILSLLM-NSDEKEYYDIYILSAADITEHDKELLNKTIFAYKADI 64 Query: 88 KIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 I+ DR + +N + A YFR +I D I + K++Y D D+I Q ++ +++ Sbjct: 65 NFIEID-DRFDNAFEIRNISKAAYFRLLIPD-LIPQYDKIIYSDVDVIFQSGLQEVLDTD 122 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 D+ + G A+ + LG+ GY NSGFLLIN +Q+ + L Sbjct: 123 LKDNYFGGIKAIG-AESIKDYIIQLGLN--IHGYINSGFLLINAKLQREKQLFNKIQEYL 179 Query: 208 NEPEIIKKITHPDQDVLNMLLADKLIFADIKYN-TQFSLNYQLKESFINPVTNDTIF--- 263 KK DQD++N++ ++L F +KY TQ S Y+L + NP ++F Sbjct: 180 T-----KKFQFQDQDIINIVCKNRLTFLPLKYCFTQKS--YEL--YYTNPKRLFSVFSPK 230 Query: 264 ----------IHYIGPTKPWHDWAWDY 280 IHY G KPW+ + + Y Sbjct: 231 EVEEAFTEGIIHYEGTNKPWNGFCYRY 257 >UniRef50_Q38VK7 Putative bifunctional glycosyl transferase, family 8 n=2 Tax=Lactobacillus sakei subsp. sakei 23K RepID=Q38VK7_LACSS Length = 569 Score = 78.2 bits (191), Expect = 4e-13, Method: Compositional matrix adjust. Identities = 67/284 (23%), Positives = 122/284 (42%), Gaps = 25/284 (8%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSR-LCFHIFTDYFGDDDRKYFDALALQYKTRI 87 ++I + NF+ I ASIL N+ R F + +D D+ + + + Sbjct: 285 INIVSAANSNFVEPLAILYASILNNNDDDRHYAFFVLSDQLTARDQATLRQITESFNAEL 344 Query: 88 KIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 ++ L ++ Y+R +I + + + +VLYLD D +C + L + Sbjct: 345 TFIEVDEIPLTAVIQDGQVLKTAYYRLLIPN-LLPEIERVLYLDCDTLCLENLARLWDVE 403 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 + VA V G + R + + + YFN+G LL+N W Q+++ + + + Sbjct: 404 LGNIPVAAVEDAG----FHNRLAQMAIDYKSIRYFNAGVLLMNLTIWRQQKITEQILTFI 459 Query: 208 NE-PEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSL--------NYQLKESFINPVT 258 E P+ K+ DQD LN +L D+ I K+N Q S+ ++ F++ Sbjct: 460 KEYPQ---KLRFHDQDALNAILHDRWIHLHPKWNVQTSILMDFIVAPTERINRQFLS-AQ 515 Query: 259 NDTIFIHYIGPTKPW-----HDWAWDYPVSQA-FMEAKNASPWK 296 + IH+ G KPW H + Y ++ F+E N P++ Sbjct: 516 KEPGLIHFCGSEKPWDKSSTHPYTPQYRFYKSRFLENNNPVPFR 559 Score = 64.7 bits (156), Expect = 4e-09, Method: Compositional matrix adjust. Identities = 50/199 (25%), Positives = 90/199 (45%), Gaps = 30/199 (15%) Query: 92 INGDRLRSLPSTKNWTHAIYFRFVIADYFINK-APKVLYLDADIICQGTIEPLINFSFPD 150 IN R+++ P ++ Y+R + + + +VLYLD D + + + PL + Sbjct: 73 INPRRIKNFPGNNHFDQTAYYRILAPQILLARHIERVLYLDLDTLIRTDLTPLYDSDLEG 132 Query: 151 DKVAMVVTEGQADWWEKRAHSLGVAGIAKG-----YFNSGFLLINTAQWAAQQVSARAIA 205 + + V+ G +A +L G+ K YFN+G L+I+T W +S + +A Sbjct: 133 NIIGAVIDPG-------KALTLKRLGVPKSQANNIYFNAGVLIIDTILWETHHISQKILA 185 Query: 206 MLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTND----- 260 ML P +++ QD LN++LA + K+N Q ++ ++ E P+ N+ Sbjct: 186 ML-VPYPGRRVND-IQDALNVVLAGRTKLLAPKWNVQNAILFKTYE----PINNEYSQLF 239 Query: 261 ------TIFIHYIGPTKPW 273 IH+ KPW Sbjct: 240 KQAIMAPKIIHFTTEKKPW 258 >UniRef50_B2UPJ4 Glycosyl transferase family 8 n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UPJ4_AKKM8 Length = 315 Score = 77.8 bits (190), Expect = 5e-13, Method: Compositional matrix adjust. Identities = 82/323 (25%), Positives = 136/323 (42%), Gaps = 37/323 (11%) Query: 29 LDIAYGTDKNFLFGCGISIASILK-YNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 ++I Y TD N G G+SI S+++ G +I T D+ F +L Y + Sbjct: 1 MNIVYATDDNGALGTGVSIVSLMENLPPGVHADIYIMTGGLSGDNTARFHSLQQGYNLHL 60 Query: 88 KIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 ++ D+ P W+ A Y+R +A + LY+D D I I P+ Sbjct: 61 H-FIDMKDKYTDFPVGSKWSAATYYRLGLAGELPATVERALYVDIDTIFNRDISPMYESE 119 Query: 148 FPDDKVAMVVT------EGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSA 201 F D +A V T E + W KR +LG I Y N+G +L + + + + Sbjct: 120 FGDCLIAGVFTTEDLSEESFSRW--KREMNLGRDSI---YINAGVILYHIGRIREECFES 174 Query: 202 RAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYN----TQFSLNYQLKESFI-NP 256 + ++ I +++ DQD+LN+ +++ +N +S+ ++ SF NP Sbjct: 175 QVLSWAKNN--IHRLSWQDQDILNVCYQQRILLLHPMWNICDGAIWSIRWEGVTSFRNNP 232 Query: 257 VTNDTIF--------IHYIGPTKPWHDWA--WDYPVSQAFMEAKNASPWKNTA--LLKPN 304 + + IHY G KPWH + DY + F + SPWK+ K N Sbjct: 233 LKPADLLEAARRPGIIHYWGHPKPWHPNSIRQDYGL---FYKYWKKSPWKDDIRDFRKQN 289 Query: 305 NSNQLRYSAKHML--KKHRYLKG 325 + ++ S L K R L+G Sbjct: 290 DPGRMFISKMRCLLGKGKRLLQG 312 >UniRef50_D2RIJ7 Glycosyl transferase family 8 n=1 Tax=Acidaminococcus fermentans DSM 20731 RepID=D2RIJ7_ACIFE Length = 330 Score = 77.8 bits (190), Expect = 5e-13, Method: Compositional matrix adjust. Identities = 79/323 (24%), Positives = 134/323 (41%), Gaps = 41/323 (12%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 L I + F G+ + SI + N+ L FH+F D D++++ A +Y Sbjct: 35 LHICCNVNDLFFKPAGVLLTSICENNKDLALNFHVFVDSCSDENKENLRKTAEKYGCNAY 94 Query: 89 IYLINGDRLRSLP-STKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 +Y ++ ++ K ++ Y R V+ N + LYLDAD++C ++ N+ Sbjct: 95 LYKMDMSIYQNFHIKVKRFSRVTYIRIVMPWVLRNVTNRYLYLDADMVCVKSLRVFFNYD 154 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 D V +V + +R L + G YF+ G + IN +W Q+V+ R + Sbjct: 155 LKDKAVGALVYDT-----PERIAFLKMKG--NVYFSDGLMWINVDEWIKQRVTERVFSYQ 207 Query: 208 N-EPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHY 266 +P K T QD++N++L D +L + + + F + D I IHY Sbjct: 208 GADPARFKGQT---QDLMNLVL-------DGNVQPIPALFHHMDKDF----SVDGILIHY 253 Query: 267 IGPTKPWH------DWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHML--- 317 G KPW D W + + + SPW + P +S K + Sbjct: 254 SGRDKPWEIVLDEDDELWRHYL--------DISPWPSMPNPMPPKRPIYYHSFKKLAQVY 305 Query: 318 -KKHRYLKGFSNYLFYFIEKIKH 339 KK +LK +Y I KI++ Sbjct: 306 SKKGNHLKELECLFWYGILKIRY 328 >UniRef50_A6LGX5 Glycosyltransferase family 8 n=1 Tax=Parabacteroides distasonis ATCC 8503 RepID=A6LGX5_PARD8 Length = 325 Score = 77.0 bits (188), Expect = 8e-13, Method: Compositional matrix adjust. Identities = 77/330 (23%), Positives = 138/330 (41%), Gaps = 31/330 (9%) Query: 30 DIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKI 89 DI +D N+L I S+ + N L FH+ ++ D K + Y+ ++ + Sbjct: 3 DIVVASDCNYLHLVSICAVSLFETNSSESLHFHLLSNGIDSADIKNLQTIVEGYRGKLSV 62 Query: 90 YLINGDRLR---SLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINF 146 Y I R R +P T + T Y R KVLY+D DII G+I L N Sbjct: 63 YPIENLRERLMTDVPETISLTS--YARLFAGSILPANLDKVLYIDCDIIFNGSIRDLFNT 120 Query: 147 SFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAM 206 + V ++ + ++K +++ Y N+G L+I +W ++ + + + Sbjct: 121 DLGNCLVGGILDPLISRTYKKEIK----IPMSEPYINAGVLIIPLNRWRSEGMEQKFVDF 176 Query: 207 LNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQF-SLNYQLK------------ESF 253 L K+ H DQ ++N + A + ++N SL Y K E + Sbjct: 177 LVANR--GKVHHHDQGIINAVCAGRKKILPPQFNVMSNSLCYPWKDLYKINTPFYDQEEY 234 Query: 254 INPVTNDTIFIHYIGPT--KPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRY 311 +++ I IH+ G +PW +P + F++ K + +K+ LKPNN + + Sbjct: 235 KKGISSPAI-IHFTGAIHGRPW-IVGCTHPYANKFLQFKAKTAYKDIP-LKPNNQSAALH 291 Query: 312 SAKHMLKKHRYLKGFSNYL--FYFIEKIKH 339 + +L + F Y+ Y++ KH Sbjct: 292 RLEGILYRLLPFSLFKRYMQSVYYLSYFKH 321 >UniRef50_Q4HGS8 General stress protein A, putative n=2 Tax=Campylobacter RepID=Q4HGS8_CAMCO Length = 403 Score = 76.6 bits (187), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 70/275 (25%), Positives = 118/275 (42%), Gaps = 42/275 (15%) Query: 31 IAYGTDKNFLFGCGISIASILK------YNEGSRLCFHIFTDYFGDDDRKYFDALALQ-- 82 I + D+N++ + I SI+K + + FHI +++ ++ R+ + L + Sbjct: 4 IIFSADENYIKYTSVLITSIIKNTNPKNHFQNRPYSFHILSNFVSEETREKLECLKKELN 63 Query: 83 --YKTRIKIYLINGDRLRSLPSTKNWTHAI--YFRFVIADYFINKAPKVLYLDADIICQG 138 Y I I++++ DR + PS+ ++ Y+R F + K LYLD+D++C Sbjct: 64 KIYPCEISIHIMSDDRFENFPSSGAAQNSKLPYYRLKFISLFDDNVDKCLYLDSDMLCMC 123 Query: 139 TIEPLINFSFPDDKVAMVVTEGQADWWEK--RAHSLGVAGIAKGYFNSGFLLINTAQWAA 196 I + + +V G K ++ V + YFNSGFLLIN ++ Sbjct: 124 DIREIFAIDLQGKIIGVVGDPGSKRSKIKFIENNTKKVLKFDENYFNSGFLLINAKEYKK 183 Query: 197 QQVSARAIAMLNEPEIIKK---ITHPDQDVLNMLLA-DKLIFADIKYN------------ 240 V + E+ KK I DQD+LN +++ DK++ YN Sbjct: 184 ANVEKKC------EELAKKCIYIKAADQDLLNAVISKDKILKLSFAYNFNIITLLYVICK 237 Query: 241 --TQFSLNYQLKESFINPVTNDTIFIHYIGPTKPW 273 + LNY +E F N I +HY KPW Sbjct: 238 DEKKNRLNYT-REEFTQSAKNPKI-LHY--GEKPW 268 >UniRef50_B3Z5I6 Glycosyltransferase family 8 n=2 Tax=Bacillus cereus group RepID=B3Z5I6_BACCE Length = 317 Score = 76.3 bits (186), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 69/293 (23%), Positives = 132/293 (45%), Gaps = 28/293 (9%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEG-SRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 L++ Y +D N+ G+S+ S+L+ N+ + L + + ++K +++ +Y I Sbjct: 3 LNVVYSSDDNYAQHVGVSLLSLLQNNQHFNNLNIFLIENNISSYNKKNLNSVCKKYNKTI 62 Query: 88 KIYLINGDRLRSLPSTKNWTHAI--YFRFVIADYFINKAPKVLYLDADIICQGTIEPLIN 145 + Y+ L L N + AI Y R +A + K++YLD D I ++ L Sbjct: 63 Q-YINFNVLLERLELNINDSIAINSYARLFLAGIIPEELDKIIYLDCDSIINSSLSDLW- 120 Query: 146 FSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIA 205 D V G D + ++GY N+G LLIN +W + + + + Sbjct: 121 ----DTDVTEYFVAGVCDTVSNQTKLRIDMDKSEGYINAGMLLINLKKWREENIEQKFME 176 Query: 206 MLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQ---FSLN-------YQLK----E 251 + + + + H DQ +N +L DK+++ K+N F+++ Y+L+ E Sbjct: 177 FIKKKD--GNVFHHDQGTINGVLKDKILYLHPKFNAMTPFFTMSRKEIMSYYELENYYNE 234 Query: 252 SFINPVTNDTIFIHYIGP--TKPWHDWAWDYPVSQAFMEAKNASPWKNTALLK 302 I+ + +FIHY +PW + +P++ + + +PWK+T L K Sbjct: 235 IEIDEAVKNPVFIHYTPAFVNRPWIE-GCKHPLTSLYKSYLDMTPWKSTDLWK 286 >UniRef50_C7IBC1 Glycosyl transferase family 8 n=2 Tax=Clostridium papyrosolvens DSM 2782 RepID=C7IBC1_9CLOT Length = 452 Score = 76.3 bits (186), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 68/282 (24%), Positives = 118/282 (41%), Gaps = 17/282 (6%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNE-GSRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 + I D +++ G+ I S+L+ + L F++ D D++ + Y +I Sbjct: 4 VKIVSACDSHYVQHLGVMITSLLENTSMKTSLEFYVIDGGITDADKELLCSCTCLYGCKI 63 Query: 88 KIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 I D + + + A YFR +++ KV+YLD DI+ I L Sbjct: 64 NFITIQADFYARFGESPSASDATYFRIFVSELLDTSVEKVIYLDCDIVVIKDIAELWKTD 123 Query: 148 FPDDKVAMVVTEG---QADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAI 204 + +A V G ++ LG+ YFN+G LLIN +W + +S Sbjct: 124 VSEYFLAAVADCGVEYSGEYAVTLKRKLGMKR-KDCYFNAGVLLINLVKWREESISKSIC 182 Query: 205 AMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFS-LNYQLKESF----INPVTN 259 L E + KI DQD LN +L ++ + D ++N Q + + +E + Sbjct: 183 KFLFENK--GKIDFADQDGLNAVLCNRWLPLDSRWNQQVAHCEFYEQEKVVWENVTRAVR 240 Query: 260 DTIFIH----YIGPTKPWHDWAWDYPVSQAFMEAKNASPWKN 297 + IH Y TKPW ++ +P Q + + +PWK+ Sbjct: 241 EPWIIHYTTSYFSGTKPW-NYLDMHPYRQEYYRYLHMTPWKS 281 >UniRef50_C7IBC8 Glycosyl transferase family 8 n=1 Tax=Clostridium papyrosolvens DSM 2782 RepID=C7IBC8_9CLOT Length = 464 Score = 75.5 bits (184), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 61/236 (25%), Positives = 104/236 (44%), Gaps = 12/236 (5%) Query: 72 DRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLD 131 +++ A +Y +RI+ + + + + + + YFR I + KV+YLD Sbjct: 10 NKECLRACVEKYGSRIRFLELKPELYQDFKTQSYFGYVTYFRIFIPEIVEASVRKVIYLD 69 Query: 132 ADIICQGTIEPLINFSFPDDKVAMVVTEG---QADWWEKRAHSLGVAGIAKGYFNSGFLL 188 DI+ +G I L + VA V G ++ +G+ K YFN+G LL Sbjct: 70 CDIVIKGDIRKLWENDISEYFVAAVEDVGIDIGGNFATMVKKHIGIPRKGK-YFNAGVLL 128 Query: 189 INTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQ 248 IN +W A + + L E +KI DQD LN + D+ + I++N Q + Sbjct: 129 INLDKWRADKTTETIRKYLIENR--EKIYFADQDGLNAVFKDRWLKLPIEWNQQADILEL 186 Query: 249 LKESFIN-----PVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTA 299 LK + I+ + + IHY KPW + +P+ + + +PW +TA Sbjct: 187 LKRNRIDRPDVMKAALNPMIIHYTKQVKPWQ-YKDCHPLKEEYHRYLRLTPWNDTA 241 >UniRef50_B3CA80 Putative uncharacterized protein n=1 Tax=Bacteroides intestinalis DSM 17393 RepID=B3CA80_9BACE Length = 301 Score = 74.7 bits (182), Expect = 4e-12, Method: Compositional matrix adjust. Identities = 73/300 (24%), Positives = 133/300 (44%), Gaps = 24/300 (8%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIF-TDYFGDDDRKYFDALALQYKTRI 87 +DI D+N++ CG+ +AS+ + + HI + +K +++ + Sbjct: 2 IDIVCSIDENYIEYCGVMLASLFVHTPDEKFRVHIICSSKVEKAGKKRLKVFCEKHQAEV 61 Query: 88 KIYLINGDRLRSLPSTK--NWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLIN 145 Y ++ ++ P K + + A Y R +++ + K+LYLD D+I +I+ L Sbjct: 62 YFYDVDYSLIKDFPIRKQDHLSLAAYLRLFMSELIPSNINKILYLDCDLIVVDSIKELWE 121 Query: 146 FSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQ-VSARAI 204 + D +A+ E ++ + + +L + YFNSG +LIN +W ++ V A Sbjct: 122 KNI--DNIAVAAVEERSPFDTESPVTLKYP-VEYSYFNSGVMLINLQKWREKKFVEACKS 178 Query: 205 AMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNY-------QLKESFINPV 257 + + E IK H DQDVLN LL + F I++N Y + K+ + + + Sbjct: 179 YIASNYENIK--LH-DQDVLNALLYKEKQFISIRWNLMDFFLYASPEVQPERKKDWDDAL 235 Query: 258 TNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHML 317 + I IH+ G KPW + D P ++ W NN N + Y + +L Sbjct: 236 KSPAI-IHFTGKRKPWM-YNCDSPFRDQYIRFAKQQGWHVI-----NNKNAIHYFFRKIL 288 >UniRef50_C7HS13 Family 8 glycosyl transferase n=2 Tax=Anaerococcus RepID=C7HS13_9FIRM Length = 276 Score = 74.7 bits (182), Expect = 5e-12, Method: Compositional matrix adjust. Identities = 70/259 (27%), Positives = 120/259 (46%), Gaps = 20/259 (7%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 ++I D+N+L + S+ + N+ + F I+ + D K + K K Sbjct: 1 MNILVSCDENYLNPLKTMLYSLFESNDTN---FEIYLIHKDIRDEKIKEIEKFVIKASSK 57 Query: 89 IYLINGDRLRSLPSTKN----WTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLI 144 +N ++++L S +T +Y+R + Y ++LYLD D++ + E L Sbjct: 58 RAKLNAIKVKNLFSNAKITFYYTEEMYYRLLAYKYLPENLDRILYLDPDVLVLNSCEKLY 117 Query: 145 NFSFPDDKVAM---VVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSA 201 N D+ A + Q+ + + S G I + YFNSG L+IN Q Sbjct: 118 NMDLGDNYFAAATHTIPTVQSANVARLSISSGHKDI-ENYFNSGILMINLKLSRDSQTYE 176 Query: 202 RAIAMLNEPEIIKK--ITHPDQDVLNMLLADKLIFAD-IKYN--TQFSLNYQLKESFINP 256 + + LN + K + PDQD+LN++ +K+I D IKYN + L Y+LK+ N Sbjct: 177 KEV--LNYVKNTKSLGLIMPDQDLLNVVFRNKIIKIDEIKYNYDARRYLTYKLKDKKYNL 234 Query: 257 --VTNDTIFIHYIGPTKPW 273 + ++T F+H+ G KPW Sbjct: 235 SYIISNTCFLHFCGKRKPW 253 >UniRef50_Q3DNA2 Glycosyl transferase, family 8 n=9 Tax=Streptococcus RepID=Q3DNA2_STRAG Length = 272 Score = 73.9 bits (180), Expect = 7e-12, Method: Compositional matrix adjust. Identities = 66/204 (32%), Positives = 96/204 (47%), Gaps = 19/204 (9%) Query: 101 PSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVV--T 158 P+T + IY+R + + ++LYLDAD++C L + D A T Sbjct: 72 PTTDRYPDTIYYRLLAHKFLPETLDRILYLDADMLCLNDFSSLYDMELGDQLYAAASHNT 131 Query: 159 EGQ-ADWWEKRAHSLGVAGIAKGYFNSGFLLIN--TAQWAAQQVSARAIAMLNEPEIIKK 215 +G+ D+ K L + YFN+G LL+N + Q + M N +I Sbjct: 132 DGKFLDYVNKL--RLKNVELESSYFNTGVLLMNLPAIRKVVHQQTILDYMMQNRGRLIL- 188 Query: 216 ITHPDQDVLNMLLAD--KLIFADI-KYNTQFSLNYQLK---ESFINPVTNDTIFIHYIGP 269 PDQD+LN L A+ K I +I Y+ ++SL YQLK E + V N T+F+H+ G Sbjct: 189 ---PDQDILNGLYANLVKPIPDEIYNYDARYSLIYQLKSRNEWDLEWVINHTVFLHFAGR 245 Query: 270 TKPW-HDWAWDYPVSQAFMEAKNA 292 KPW D+ Y FM AK A Sbjct: 246 DKPWKKDYRGRYSGLYKFM-AKEA 268 >UniRef50_D0D9G3 Putative general stress protein A n=1 Tax=Citreicella sp. SE45 RepID=D0D9G3_9RHOB Length = 327 Score = 73.9 bits (180), Expect = 7e-12, Method: Compositional matrix adjust. Identities = 75/294 (25%), Positives = 125/294 (42%), Gaps = 35/294 (11%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYN-EGSRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 +++ Y D G+SIAS L+ EG+ + H+ + +RK ++A Q+ R Sbjct: 12 INVVYACDNIQALPLGVSIASALENRAEGNPINIHVLSYRISRSNRK---SIASQFDGRD 68 Query: 88 KI---YLINGDRLRSL-----PSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGT 139 + I G+ + L S + + A Y R +I++ N + +YLD DII Sbjct: 69 DTLCWHEITGENRKLLEDLFTSSNRPYPPAAYARLLISEVIPN-IDRAIYLDTDIIVATD 127 Query: 140 IEPLINFSFPDDKVAMVVTEGQADWWEKRAHSL------GVAGIAKG--YFNSGFLLINT 191 + PL N F + + ++ KR +L GI G YF SG L+ + Sbjct: 128 LSPLWNTPFDGAGLLAIQDLPTSNDHIKRLRALLSPEDISRYGIEDGDSYFQSGVLVFDM 187 Query: 192 AQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKE 251 ++ + S + N P++ T PD D LN++ D D ++N S+ ++L Sbjct: 188 KEFTKTRASELIECLRNYPDL----TFPDNDALNIVFHDSFKLVDPRWNQMASV-FKLDA 242 Query: 252 SFINP--------VTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKN 297 + P + D IHY G KPW D +P ++EA S W + Sbjct: 243 ARDTPYSAEVFQALLQDPYIIHYSGRPKPWED-GCTHPYLDRWVEALKDSAWNS 295 >UniRef50_P43974 Putative glycosyltransferase HI0258 n=32 Tax=Haemophilus influenzae RepID=Y258_HAEIN Length = 330 Score = 73.6 bits (179), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 59/247 (23%), Positives = 115/247 (46%), Gaps = 7/247 (2%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 ++I + +D + +SI SI+K N ++ F+I +++ + LA Y ++ Sbjct: 39 MNIIFSSDHYYAPYLAVSIFSIIK-NTPKKINFYILDMKINQENKTIINNLASAYSCKVF 97 Query: 89 IYLINGDRLRSLPSTKNW-THAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 + ++ P T ++ + A Y R + Y I K +Y+D D + +++ L N Sbjct: 98 FLPVCESDFQNFPKTIDYISLATYARLNLTKY-IKNIEKAIYIDVDTLTNSSLQELWNID 156 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 + +A E ++G+ G + YFN+G LLIN +W + + ++I + Sbjct: 157 ITNYYLAACRDTFIDVKNEAYKKTIGLEGYS--YFNAGILLINLNKWKEENIFQKSINWM 214 Query: 208 NEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYI 267 N+ + K + DQD+LN + K+ F + ++N + +K+ + V + HY Sbjct: 215 NKYNNVMK--YQDQDILNGICKGKVKFINNRFNFTPTDRDLIKKKNLLCVKMPIVISHYC 272 Query: 268 GPTKPWH 274 GP K WH Sbjct: 273 GPNKFWH 279 >UniRef50_C9RWX3 Glycosyl transferase family 8 n=8 Tax=Bacillaceae RepID=C9RWX3_GEOSY Length = 276 Score = 72.4 bits (176), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 65/258 (25%), Positives = 108/258 (41%), Gaps = 31/258 (12%) Query: 35 TDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIYLING 94 TD N+L + + S+ N + +++ ++ + + + Q + IY ++ Sbjct: 8 TDANYLPPLRVLMHSLFCNNRRPFTFYLLYSRIAEEEIQALGEFVRRQGHELVPIY-VDP 66 Query: 95 DRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVA 154 P +++T +Y+R + +VLYLD DI+ ++ L + F Sbjct: 67 QLFHDAPVFRHYTVEMYYRLAAHLFLPPDVDRVLYLDPDIVAINPMDELYDMDF------ 120 Query: 155 MVVTEGQADWWEKRAHSLGVAGI----------AKGYFNSGFLLINTAQWAAQQVSARAI 204 EG + HS VA + AKGYFN+G +++N A A Sbjct: 121 ----EGNLFIAAEHTHSTKVANLFNKLRLKTPNAKGYFNTGVMMMNIAMMREHVRLADIY 176 Query: 205 AMLNEPEIIKKITHPDQDVLNMLLADKLIFADI-KYNTQFSLNYQLKESFINP------V 257 + + K+ PDQDVLN L DK+ D +YN + Y + NP + Sbjct: 177 QFIRDNRF--KLVLPDQDVLNGLYWDKIKPVDCYRYNYD-ARYYDFLQLLPNPKHDLAWI 233 Query: 258 TNDTIFIHYIGPTKPWHD 275 +T+FIHY G KPW D Sbjct: 234 EENTVFIHYCGKEKPWKD 251 >UniRef50_B2ISC6 Glycosyl transferase, family 2/glycosyl transferase family 8 n=8 Tax=Streptococcus pneumoniae RepID=B2ISC6_STRPS Length = 696 Score = 72.4 bits (176), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 54/176 (30%), Positives = 80/176 (45%), Gaps = 27/176 (15%) Query: 107 THAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWE 166 ++ ++ R+ IAD+ + K LYLD D++ ++ L D ++A V G Sbjct: 377 SYTVFLRYFIADFV--QEDKALYLDCDLVVTKNLDDLFATDLQDYRLAAVRDFG------ 428 Query: 167 KRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNM 226 G A + FN+G LL+N A W + + + I + NE K+ DQ +LNM Sbjct: 429 ------GRAYFGQEIFNAGVLLVNNAFWKKENMIQKLIDVTNEWH--DKVDQADQSILNM 480 Query: 227 LLADKLIFADIKYN-----TQFSLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWA 277 L K + D YN QF+ +YQL E P IHY+ KPW D A Sbjct: 481 LFEHKWLELDFDYNHIVIHKQFA-DYQLPEGQDYPA-----IIHYLSHRKPWKDLA 530 >UniRef50_UPI000196958D hypothetical protein BACCELL_01586 n=1 Tax=Bacteroides cellulosilyticus DSM 14838 RepID=UPI000196958D Length = 305 Score = 71.6 bits (174), Expect = 4e-11, Method: Compositional matrix adjust. Identities = 57/224 (25%), Positives = 91/224 (40%), Gaps = 12/224 (5%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 ++I D ++ C + + S + N G ++ T+ DD + + Y Sbjct: 1 MNIVCAADSGYVQHCSVMLISFFENNPGEEHAVYLLTEGLDLDDLDFIQKIVHSYNGHFF 60 Query: 89 IYLINGDRLRSLP--STKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINF 146 ++ L P ST + + A Y R +AD KVLYLD DII +I+ L Sbjct: 61 YCQVDFKFLEKCPIKSTDHLSIATYNRLFMADLLPADVNKVLYLDCDIIVNQSIKELWET 120 Query: 147 SFPDDKVAMVVTEGQA---DWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARA 203 D+ V E D +E+ + GYFN+G LL+N W ++ Sbjct: 121 PLRDNFVVAAFEERGCCAEDVYERLDYDSKY-----GYFNAGVLLVNLDYWRTHNMTQAF 175 Query: 204 IAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNY 247 I + +K+ DQDVLN DK + + +N +F Y Sbjct: 176 IEYIEHN--FEKLRAHDQDVLNAFFYDKSVHISLAWNVEFIFYY 217 >UniRef50_B4RJG6 LgtC n=29 Tax=Neisseria RepID=B4RJG6_NEIG2 Length = 307 Score = 71.6 bits (174), Expect = 4e-11, Method: Compositional matrix adjust. Identities = 68/315 (21%), Positives = 126/315 (40%), Gaps = 42/315 (13%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 +DI + D N+ ++ S+ + + + FH+ +++R A I+ Sbjct: 1 MDIVFAADDNYAAYLCVAAKSVEAAHPDTEIRFHVLDAGISEENRAAVAANLRGGGGNIR 60 Query: 89 IYLINGDRLRSLP-STKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 +N + P + ++ + Y R + +Y I KVLYLD D++ + ++PL + Sbjct: 61 FIDVNPEDFAGFPLNIRHISITTYARLKLGEY-IADCDKVLYLDTDVLVRDGLKPLWDTD 119 Query: 148 FPDDKVAMVV---TEGQADWWEKRAHSLGVAGIAKG--YFNSGFLLINTAQWAAQQVSAR 202 + V + E Q + +K G+A G YFN+G LLIN +W + Sbjct: 120 LGGNWVGACIDLFVERQEGYKQK-------IGMADGEYYFNAGVLLINLKKWRRHDIFKM 172 Query: 203 AIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTI 262 + + + + + + + DQD+LN L + +A+ ++N NY + D + Sbjct: 173 SCEWVEQYKDV--MQYQDQDILNGLFKGGVCYANSRFNF-MPTNYAFMANGFASRHTDPL 229 Query: 263 FI------------HYIGPTKPWHD----WAWDYPVSQAFMEAKNASPWKNTALLKPNNS 306 ++ HY G KPWH W + A W+ + P Sbjct: 230 YLDRTNTAMPVAVSHYCGSAKPWHRDCTVWGAERFTELAGSLTTVPEEWRGKLAVPPT-- 287 Query: 307 NQLRYSAKHMLKKHR 321 KHML++ R Sbjct: 288 -------KHMLQRWR 295 >UniRef50_C6IB51 Glycosyltransferase n=4 Tax=Bacteroides RepID=C6IB51_9BACE Length = 417 Score = 71.6 bits (174), Expect = 4e-11, Method: Compositional matrix adjust. Identities = 60/227 (26%), Positives = 102/227 (44%), Gaps = 22/227 (9%) Query: 59 LCFHIFTDYFGDDDRKYFDALALQYK-TRIKIYLINGDRLRSLPSTKNW-THAIYFRFVI 116 + +I TDY + +++ + + I+ +I+ + + L + T +R+ I Sbjct: 3 ISIYILTDYISLESKEFLQEIKNVFTCVTIQWEIIDSESFKQLKKKGGYITEHTLYRYAI 62 Query: 117 ADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAG 176 AD F N K LYLDAD++ G+IEPL A G D + +R + + Sbjct: 63 ADLFPN-LDKALYLDADLVINGSIEPLWELDLEGYYCA-----GVDDIFIRRINYRKILE 116 Query: 177 IAKG--YFNSGFLLINTAQWAAQQVSARAIAMLNEPEI-IKKITHPDQDVLNMLLADKLI 233 +A+ Y N+G LL+N ++ + +L I I + + DQD +N + K+ Sbjct: 117 LAEKDVYINAGVLLLNLKDLRKDKIQEK---LLQHTSIYINRDRYQDQDAINCICKGKIK 173 Query: 234 FADIKYNTQFSLNYQLKESFINP-VTNDTIFIHYIGPTKPWH-DWAW 278 Y N+ E+ P + +D I IHY G KPWH ++ W Sbjct: 174 LIPNIY------NFTTSETLHTPEMLSDIIIIHYTGSIKPWHQEYTW 214 >UniRef50_P25148 General stress protein A n=3 Tax=Bacillus subtilis group RepID=GSPA_BACSU Length = 286 Score = 71.6 bits (174), Expect = 4e-11, Method: Compositional matrix adjust. Identities = 59/255 (23%), Positives = 106/255 (41%), Gaps = 14/255 (5%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSR-LCFHIFTDYFGDDDRKYFDALALQYKTRI 87 + I D N+ G S+L + R + ++ D++K + L++ I Sbjct: 7 MHIVSCADDNYARHLGGMFVSLLTNMDQEREVKLYVIDGGIKPDNKKRLEETTLKFGVPI 66 Query: 88 KIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPK-VLYLDADIICQGTIEPLINF 146 + ++ + + + T A Y+R I D +++ K ++Y+D D + I L + Sbjct: 67 EFLEVDTNMYEHAVESSHITKAAYYRISIPDLIKDESIKRMIYIDCDALVLEDISKLWDL 126 Query: 147 SFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAM 206 VA V GQ +R + V K YFNSG ++I+ W Q ++ + I Sbjct: 127 DIAPYTVAAVEDAGQ----HERLKEMNVTDTGK-YFNSGIMIIDFESWRKQNITEKVINF 181 Query: 207 LNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFI-------NPVTN 259 +NE + DQD LN +L D+ ++N Q + +LK N Sbjct: 182 INEHPDEDFLVLHDQDALNAILYDQWYELHPRWNAQTYIMLKLKTPSTLLGRKQYNETRE 241 Query: 260 DTIFIHYIGPTKPWH 274 + +H+ G KPW+ Sbjct: 242 NPAIVHFCGGEKPWN 256 >UniRef50_C3XGD2 Putative uncharacterized protein n=1 Tax=Helicobacter bilis ATCC 43879 RepID=C3XGD2_9HELI Length = 364 Score = 71.2 bits (173), Expect = 5e-11, Method: Compositional matrix adjust. Identities = 62/240 (25%), Positives = 96/240 (40%), Gaps = 32/240 (13%) Query: 60 CFHIFTDYFGDDDRKYFDALALQ----YKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFV 115 CFHI TD + R+ A ++ Y ++Y ++ + LP N + YFR Sbjct: 49 CFHILTDGLKHETRQKLQAFQIELNKIYPCEFRVYTLSDSIFQGLPKLNN-NYLAYFRLK 107 Query: 116 IADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMV-VTEGQADWWEKRAHSLGV 174 IA LYLD D+IC I + +V V + Q KR ++G Sbjct: 108 IASCLPQDIKTCLYLDVDMICVADIREIFYTDLQGKICGVVLVPDHQQYCVLKRNSAIGD 167 Query: 175 AGI--AKGYFNSGFLLINTAQWAAQQVSARAIAMLNE--PEIIKKITHPDQDVLNMLLAD 230 + A YFNSG +LI+ Q+ V + + + P ++ DQD LN +L D Sbjct: 168 EFVFNASTYFNSGLMLIDVEQYRKYNVEQKCLEWFEQYVPVLL------DQDALNAVLGD 221 Query: 231 KLIFADIKYNTQFSLNYQLKESFINP---------------VTNDTIFIHYIGPT-KPWH 274 + +++N L ++ F V N+ +HY G T KPW Sbjct: 222 HICALPLEWNFFVELLKYKRQDFKGKDNNIVMKITYEEYMQVKNNMKILHYTGWTLKPWQ 281 >UniRef50_B8ISQ5 Glycosyl transferase family 8 n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8ISQ5_METNO Length = 328 Score = 71.2 bits (173), Expect = 5e-11, Method: Compositional matrix adjust. Identities = 65/255 (25%), Positives = 106/255 (41%), Gaps = 22/255 (8%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALAL----QYKTR 86 +A D+ F +++AS+L L HIF + + D +A Q + Sbjct: 15 VALCIDRAFFRHALVTVASLLDAGPRQPLDVHIF---YAEADPACMARIAALFADQDRHG 71 Query: 87 IKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINF 146 I+ DR P + + Y R ++ Y + + KVLYLDAD+I + PL Sbjct: 72 CHFQKISLDRFEGFPVSDAISAGTYARLLLP-YLMPRRAKVLYLDADLIVLDDVAPLWRT 130 Query: 147 SFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAM 206 VA V D + ++G + + YFN+G LL+N A W + ++ R A Sbjct: 131 ELGAAPVAAV-----RDPFCDNRPAIGFSP-DEPYFNAGVLLMNLAVWRREGLAERVAAH 184 Query: 207 LNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSL------NYQLKESFINPVTND 260 ++ + + DQD LN++L + F D ++N Q + + + Sbjct: 185 IDAHG--ASLKYFDQDALNVVLRGRARFVDPRWNFQPRMADATPADIACARAEFRRTRAR 242 Query: 261 TIFIHYIGPTKPWHD 275 IHY P KPW D Sbjct: 243 PAIIHYTTPHKPWKD 257 >UniRef50_C5ZV11 Glycosyl transferase n=2 Tax=Helicobacter canadensis MIT 98-5491 RepID=C5ZV11_9HELI Length = 397 Score = 71.2 bits (173), Expect = 5e-11, Method: Compositional matrix adjust. Identities = 62/271 (22%), Positives = 114/271 (42%), Gaps = 33/271 (12%) Query: 30 DIAYGTDKNFLFGCGISIASILKYNE---GSRLCFHIFTDYFGDDDRKYFDALALQ---- 82 ++ ++N++ + I SI++ + G FH+ D ++ K + L + Sbjct: 3 NVVLNLNENYVPYAAVLITSIIQNTQSSGGGGYNFHLLMDSISQENTKNLENLISELSKI 62 Query: 83 YKTRIKIYLINGDRLR--SLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTI 140 Y + IY+++ R S+P T N + Y+R I + +YLD D+I G + Sbjct: 63 YPCTLTIYILDDQLFREYSMP-TLNGNYLAYYRLKIGSALPLSIKRCVYLDVDMIVLGDL 121 Query: 141 EPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVS 200 L +V+ ++ + + I YFNSG LL++ W + + Sbjct: 122 RELFEVDLQGKICGVVMEHHSQKIYKPKNQAYKPINITGSYFNSGMLLVDLDLWRQENIE 181 Query: 201 ARAIAMLNEPEIIKKITHP--DQDVLNMLLADKLIFADIKYNTQFSLNYQ---------- 248 RA EI K + DQD+LN++L+ K I++N + Y+ Sbjct: 182 DRAF------EIGKNYHYSFHDQDILNIVLSGKTHKVGIEWNLMVCVYYRAICKDEKGRD 235 Query: 249 ----LKESFINPVTNDTIFIHYIGPTKPWHD 275 ++ F + + N I +HY TKPW++ Sbjct: 236 KLPYYRKDFNSALRNPKI-LHYFTHTKPWNN 265 >UniRef50_C0Z4I4 Putative uncharacterized protein gspA n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0Z4I4_BREBN Length = 264 Score = 70.5 bits (171), Expect = 8e-11, Method: Compositional matrix adjust. Identities = 59/249 (23%), Positives = 105/249 (42%), Gaps = 14/249 (5%) Query: 28 CLDIAYGTDKNFLFGCGISIASILKYNEGSR--LCFHIFTDYFGDDDRKYFDALALQYKT 85 + I + F + + S+ + N+ S+ + H+ +++ ++ Sbjct: 3 TIHIVTAVNDGFAIHLAVMLYSLFE-NKVSKNPVIVHVIDSQVSGENKSILTKTVKRFHA 61 Query: 86 RIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLIN 145 +IK I+ + T Y R I D + KV+YLD+DI+ + I PL N Sbjct: 62 QIKYVTIDPTLYDGFLVRDHLTQETYHRISIPDLLDKEVEKVIYLDSDIVIKKDITPLWN 121 Query: 146 FSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIA 205 +A V+ Q K H+ YFN+G L++N +W ++ + + Sbjct: 122 TKVDQYYLAAVMDSWQG--LNKLRHADLAIPDDCDYFNAGVLVMNLKKWREHNITKKIMD 179 Query: 206 MLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIH 265 + + + I I +P QD +N +L D + D K+ NYQ K + + + D IH Sbjct: 180 YMKKNQGI--IRYPSQDPMNAILHDNWLQLDTKW------NYQSKHLYKSNLRIDPAIIH 231 Query: 266 YIGP-TKPW 273 Y G +KPW Sbjct: 232 YTGEDSKPW 240 >UniRef50_B1I7M9 Glycosyl transferase, family 8 n=16 Tax=Streptococcus pneumoniae RepID=B1I7M9_STRPI Length = 406 Score = 70.1 bits (170), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 58/211 (27%), Positives = 95/211 (45%), Gaps = 38/211 (18%) Query: 85 TRIKIYL--INGD----RLRSLPSTKNWT-------HAIYFRFVIADYFINKAPKVLYLD 131 ++I+IYL + GD +L NW+ H + R+ I D+ KVLYLD Sbjct: 49 SQIRIYLQEMGGDLIDCKLIGSQFQMNWSNKLPHINHMTFARYFIPDFVTED--KVLYLD 106 Query: 132 ADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINT 191 +D+I G + L ++ +A A S AG+ FN+G LLIN Sbjct: 107 SDLIVTGDLTDLFELDLGENYLAA-------------ARSCFGAGVG---FNAGVLLINN 150 Query: 192 AQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNY---Q 248 +W ++ + + I + + + + DQ +LNML D+ + +YN Q +Y Sbjct: 151 KKWGSETIRQKLIDLTEKEH--ENVEEGDQSILNMLFKDQYSSLEDQYNFQIGYDYGAAA 208 Query: 249 LKESFI--NPVTNDTIFIHYIGPTKPWHDWA 277 K FI P+ + +HYI KPW+ ++ Sbjct: 209 FKHQFIFDIPLEPLPLILHYISQDKPWNQFS 239 >UniRef50_A7H2M2 Glycosyl transferase family 8 n=3 Tax=Campylobacter jejuni RepID=A7H2M2_CAMJD Length = 381 Score = 69.7 bits (169), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 63/258 (24%), Positives = 105/258 (40%), Gaps = 36/258 (13%) Query: 46 SIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQ----YKTRIKIYLINGDRLRSLP 101 S++ ++ FHI +D+ + + L Q Y +I ++++N D + + Sbjct: 32 SMSEFCNFDTDEGYVFHILSDHISESMKVRISNLEKQLNDIYPCKIVLHILNDDEFKGML 91 Query: 102 STKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQ 161 + + Y+R +A LYLD D++C G + L++ + + A+ + Sbjct: 92 KWRG-NYLAYYRIKMASVLPQNLKICLYLDCDMLCFGDLRELLSVDINNYQAAVCLDGNN 150 Query: 162 ADWWEKRAHSLG------VAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKK 215 +K SL + I K YFNSGF+L+N +W + ++I L + K Sbjct: 151 HKKNKKVFFSLKGREKYKFSNIEK-YFNSGFILVNLDRWRRDNIENKSIDFLKKF----K 205 Query: 216 ITHPDQDVLNMLLADKLIFAD-----IKYNTQFSLNYQ---------------LKESFIN 255 +PDQD LN L D L+ + + Y F N Q K F N Sbjct: 206 TLYPDQDALNFALNDTLLLPNRWNFSLGYFVAFLKNSQEILFLNQTKYPHLNYTKTEFEN 265 Query: 256 PVTNDTIFIHYIGPTKPW 273 V N I + P KPW Sbjct: 266 EVKNIKIAHFILDPFKPW 283 >UniRef50_C2HBB8 Family 8 glycosyltransferase n=13 Tax=Lactobacillales RepID=C2HBB8_ENTFC Length = 300 Score = 69.3 bits (168), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 55/223 (24%), Positives = 96/223 (43%), Gaps = 23/223 (10%) Query: 67 YFGDDDRKYFDALALQYKTR-------IKIYLINGDRLRSLPSTKNWTHAIYFRFVIADY 119 Y DDD + L++ + ++ IN + ++ + Y+R I + Sbjct: 40 YVIDDDIDFESKQLLRFSVKNARMNSDVEFLKINKEFFTNVVISDRIPETAYYRIAIPEL 99 Query: 120 FI-NKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIA 178 F + ++LY+D D+I I L F D VA V G + +R + + + Sbjct: 100 FRGTEVERILYMDCDMIALQDISKLWRLDFGDSIVAAVEDAG----FHQRLEKMEIPAKS 155 Query: 179 KGYFNSGFLLINTAQWAAQQVSARAIAML-NEPEIIKKITHPDQDVLNMLLADKLIFADI 237 YFNSG +LIN +W + ++ + + + + PE K+ DQD LN +L D+ + Sbjct: 156 MRYFNSGLMLINVKKWLDENITQKVLDFIEHNPE---KLRFHDQDALNAILHDRWLPLHP 212 Query: 238 KYNTQFSLNYQLK-------ESFINPVTNDTIFIHYIGPTKPW 273 ++N Q + + K E N+ IH+ G KPW Sbjct: 213 RWNAQGYIMAKAKKHPTAAGEREYEETRNNPYIIHFSGHVKPW 255 >UniRef50_C4VEI8 General stress protein A n=24 Tax=Enterococcus RepID=C4VEI8_ENTFA Length = 303 Score = 68.9 bits (167), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 60/260 (23%), Positives = 111/260 (42%), Gaps = 25/260 (9%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYN-EGSRLCFHIFTDYFGDDDRK--YFDALALQYKT 85 L I + NF+ SIL+ + + + F++ D + ++ YF Q Sbjct: 10 LAIVSCCNTNFVPHLAAMFVSILENSPSAAAVHFYVIDDNINFESKQLLYFTIKHTQLNA 69 Query: 86 RIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFI-NKAPKVLYLDADIICQGTIEPLI 144 + + IN +++ +++ Y+R I + F ++ ++LY+D D+I + L Sbjct: 70 ELTFFKINPHFFKNVVTSERIPKTAYYRIAIPELFRGSQIERLLYMDCDMIALDDVAKLW 129 Query: 145 NFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAI 204 ++ +A V G + +R + + + YFNSG LLI+ +W V+ + + Sbjct: 130 TVDLGENIIAAVEDAG----FHQRLEKMAIPAESMCYFNSGLLLIDVKKWLNLDVTTKVL 185 Query: 205 AMLNE-PEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINP------- 256 + E P+ K+ DQD LN +L D+ K+N Q Y L ++ +P Sbjct: 186 RFIEENPD---KLRFHDQDALNAVLHDRWTLLHPKWNAQ---GYILSKAKKHPTIYGEKQ 239 Query: 257 ---VTNDTIFIHYIGPTKPW 273 IH+ G KPW Sbjct: 240 YEETRRAPSIIHFTGHVKPW 259 >UniRef50_A3V3C9 Putative uncharacterized protein n=1 Tax=Loktanella vestfoldensis SKA53 RepID=A3V3C9_9RHOB Length = 324 Score = 68.6 bits (166), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 60/249 (24%), Positives = 102/249 (40%), Gaps = 25/249 (10%) Query: 94 GDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQG-TIEPLINFSFPDDK 152 G+ +P +K ++ A Y R + + F + ++ YLDAD+ G I+ + Sbjct: 75 GNAFDGMPVSKRFSLAAYLRIALPEAFAGQYDRIFYLDADVFVVGDAIDAVFRLDMLSCP 134 Query: 153 VAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARA--IAMLNEP 210 V V + K G+ YFNSG +L + ++ +V R A + Sbjct: 135 VGAVTDITKLKHPNKPTFDQKALGVDGPYFNSGVMLFDVERFITMRVRERCAEAAKFYQG 194 Query: 211 EIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIGPT 270 E I + DQ +LN++L + ++ +N Q+ + L E FI D +H+IG Sbjct: 195 EPI----YFDQTLLNIVLQKEWAQLNLGWNWQWPFSRSLFECFI-----DVQIVHFIGDD 245 Query: 271 KPWHDWAWDYPV----------SQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKH 320 KPW D P+ + + E P + AL N Y +H+ K H Sbjct: 246 KPWSDHKRRLPLKYRETARRFFQKFYPELAQKIPAADAAL---RNGALYHYFFRHITKIH 302 Query: 321 RYLKGFSNY 329 + K F+ + Sbjct: 303 LFTKCFNRH 311 >UniRef50_C2HBB9 Family 8 glycosyltransferase n=35 Tax=Lactobacillales RepID=C2HBB9_ENTFC Length = 305 Score = 68.6 bits (166), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 60/269 (22%), Positives = 117/269 (43%), Gaps = 19/269 (7%) Query: 31 IAYGTDKNFLFGCGISIASILK-YNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTR--I 87 + +D+N+ + IA+ L+ N+ R+ F++ D + ++ + +Y + I Sbjct: 30 VVTASDENYAPYLSVMIATALENCNKARRIKFYVIDDGLSEYSKQGLEETVNKYSSNASI 89 Query: 88 KIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKA-PKVLYLDADIICQGTIEPLINF 146 + + D + + T Y R + + + KVLYLD+D++ I L + Sbjct: 90 QFLTVEKDIYEDFLVSDHITTTAYLRISLPNLLAKEDYKKVLYLDSDVLVLDDIVKLYDE 149 Query: 147 SFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAM 206 + ++ GQ E+ LG+ YFNSG ++I+ QW ++++ + I Sbjct: 150 PLNGKTIGAIIDPGQVKALER----LGIDS-DDLYFNSGVMVIDIDQWNKKEITEKTIHY 204 Query: 207 LNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLK-------ESFINPVTN 259 L+E +I + DQD LN +L + K+N Q SL ++ E Sbjct: 205 LSENG--DRIIYHDQDALNAVLYEDWEQLHPKWNMQTSLIFERHPAPNEKYERLYKEGNE 262 Query: 260 DTIFIHYIGPTKPWHDWAWDYPVSQAFME 288 +H+ G KPW+ D+P + +++ Sbjct: 263 KPSIVHFTGHDKPWNTLK-DHPYTNLYLK 290 >UniRef50_Q50FU8 Cj81-079 (Fragment) n=6 Tax=Campylobacter jejuni RepID=Q50FU8_CAMJE Length = 333 Score = 68.6 bits (166), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 63/220 (28%), Positives = 96/220 (43%), Gaps = 22/220 (10%) Query: 27 LCLDIAYGTDKNFLFGCGISIASILKYNE-GSRLCFHIFTDYFGDDDRKYFDALALQ--- 82 L +I D N++ + IASI+K + S+L + + Y +D + L L+ Sbjct: 3 LSYNIVISCDNNYVKYVAVVIASIIKNTKINSQLKEYPYKFYILSNDISKNNILKLKKLI 62 Query: 83 -------YKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADII 135 Y + I+ I+ + P + HA Y+RF IAD I + K LYLDAD++ Sbjct: 63 QHLSNSYYNCELIIHKIDDSKFHRFPKAWHVNHATYYRFEIAD--IVEGNKCLYLDADVL 120 Query: 136 CQGTIEPLINFSFPDDKVAMVVTEGQADWWEK-----RAHSLGVAGIAKGYFNSGFLLIN 190 G I L ++KVA VVT+ + W K S + YFN+G +LI+ Sbjct: 121 VCGDIRELFYMEL-NNKVAGVVTDSCSRLWTKLYTKDNKTSSYIEFDPLMYFNAGVILID 179 Query: 191 TAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLAD 230 QW + + I N I DQ LN+ L + Sbjct: 180 LNQWKKHDIKNKCIDAFN---IYDHGGLADQSYLNIALKE 216 >UniRef50_UPI0001AEC697 glycosyl transferase family protein n=1 Tax=Alteromonas macleodii ATCC 27126 RepID=UPI0001AEC697 Length = 361 Score = 68.2 bits (165), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 69/276 (25%), Positives = 123/276 (44%), Gaps = 24/276 (8%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 + +A+ D F +S+ SIL N S + + F + R+ L L+ Sbjct: 2 ISVAFCIDDKFAPYAAVSVISILS-NTKSFVNIY-FIGNLSEGVREKL--LTLKNDRSAM 57 Query: 89 IYLINGDRLRSLPSTKNWTHAI----YFRFVIADYFINKAPKVLYLDADIICQGTIEPLI 144 +++ + L ++P + + + + R+ IA+ + K KV+YLDAD++ G I+ L Sbjct: 58 VFVAHNLPLSTMPLSDRYVERLNKITFVRYAIAE-VLTKLDKVIYLDADVLVCGDIKRLW 116 Query: 145 NFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAI 204 V V+ +KR +L + +K YFN+G LL++ W +++ Sbjct: 117 EQPLKKSYVGAVLDHSLMS--QKRHITLSLK--SKSYFNAGVLLVDLKIWRDRRIFQYLS 172 Query: 205 AMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFI 264 N E + + DQDVLN++L +K+ + N Q Y LK I + + + Sbjct: 173 RTHNTRE---RWEYNDQDVLNVVLDEKVQYLGADMNVQ---TYSLKHINI----KEPLIV 222 Query: 265 HYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTAL 300 H+ G KPWH + +P + + P+KN L Sbjct: 223 HFTGQEKPWHTSSV-HPYKDQYRVLLESVPFKNNKL 257 >UniRef50_C0EQT1 Putative uncharacterized protein n=1 Tax=Neisseria flavescens NRL30031/H210 RepID=C0EQT1_NEIFL Length = 212 Score = 68.2 bits (165), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 57/204 (27%), Positives = 88/204 (43%), Gaps = 26/204 (12%) Query: 127 VLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGI--AKGYFNS 184 VLYLD D++C G I L ++ + + + L V G YFNS Sbjct: 14 VLYLDTDVLCLGDISELF--------TVILAAVPETTLYRAYINKLNVFGFRSTDPYFNS 65 Query: 185 GFLLINTAQWAAQQVSARAIAMLNEPEIIKKI-THPDQDVLNMLLADKLIFADIKYNTQF 243 G LL N W + + E+ K I PDQD+LN+ K+ + YN Sbjct: 66 GVLLFNNKFWNESSAYTVLNEKIRQVELSKFILACPDQDLLNLSCKGKVGWLPESYN--- 122 Query: 244 SLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLK- 302 +++ + S +N + +H+IG TKPWH + +PV +F SPW N L + Sbjct: 123 RIHWHHQGSELNTNPKNIRLVHFIGGTKPWHHLGF-HPVYDSFYR---KSPWYNGYLHQK 178 Query: 303 -------PNNSNQLRYSAKHMLKK 319 PN + + +AK + K+ Sbjct: 179 PNIDLPFPNPHKRYKQAAKRLFKQ 202 >UniRef50_B1MX28 Lipopolysaccharide biosynthesis protein, LPS:glycosyltransferase n=2 Tax=Leuconostoc RepID=B1MX28_LEUCK Length = 283 Score = 67.8 bits (164), Expect = 5e-10, Method: Compositional matrix adjust. Identities = 60/253 (23%), Positives = 110/253 (43%), Gaps = 12/253 (4%) Query: 28 CLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 ++I D+N++ + + S+ + N + + D+ + Q + Sbjct: 11 SVNILITIDENYIKPLRVLLYSLRQTNPRENMTIWLAHDHIEVAQLEKLHQFVAQLGFVL 70 Query: 88 KIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 ++ S P+ K + +YFR + Y +V+YLD DI+ I PL N Sbjct: 71 HTIKVDTSLWASAPTFKQYPPEMYFRLLCGQYLPKTLHRVIYLDPDILVINPIRPLANMP 130 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 +A G + H G + YFNSG +L++ Q+V +AI + Sbjct: 131 LKGQMLAASSHMGLTGISQTINHL--RLGTRQVYFNSGVMLMD-LDMMRQRVDMKAILSV 187 Query: 208 NEPEIIKKITHPDQDVLNMLLADKLIFADIK---YNTQFSLNYQLKESF----INPVTND 260 + + K++ PDQD+LN L D+++ + Y+T+ ++ + K SF + V + Sbjct: 188 IQ-QYGKELILPDQDILNYLYGDEILSLPEEIWNYDTRDNIMHYAK-SFGSVDMRWVMEN 245 Query: 261 TIFIHYIGPTKPW 273 T+ +HY G KPW Sbjct: 246 TVILHYCGRPKPW 258 >UniRef50_A7H2X4 Glycosyl transferase family 8 n=2 Tax=Campylobacter RepID=A7H2X4_CAMJD Length = 497 Score = 67.8 bits (164), Expect = 5e-10, Method: Compositional matrix adjust. Identities = 56/204 (27%), Positives = 93/204 (45%), Gaps = 19/204 (9%) Query: 60 CFHIFTDYFGDDDRK---YFDALALQYKTRIKIYLINGDRLRSLPSTKNWTH--AIYFRF 114 CFHIFT+Y +D K L+ Y T+ I+++N + S W A++++ Sbjct: 51 CFHIFTEYKSEDTEKIALLAHKLSEIYPTKCLIHVMNNQDFQDF-SYPFWCQNAAMFYKI 109 Query: 115 VIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLG- 173 + D + K L++ AD+ G + L D+ +A + D + ++A + Sbjct: 110 KVVD-ILKDVDKCLFIGADLFALGDVRDLFALDLKDNLIAAALDTYNFDGYLRKAKAKNS 168 Query: 174 ----VAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLA 229 V AK Y N+ +LIN +W Q + A+ I LN+ ++ D DV ++ A Sbjct: 169 DEELVFNDAKNYINNDMMLINLKEWRKQNLQAKYIDYLNKYDLA-----GDLDVFPLVCA 223 Query: 230 DKLIFADIKYNTQFSLNYQLKESF 253 K+ KYN F L Y +ESF Sbjct: 224 PKIHILSSKYN--FILGYYTRESF 245 >UniRef50_B9KVD4 Glycosyl transferase, family 8 n=2 Tax=Rhodobacter sphaeroides RepID=B9KVD4_RHOSK Length = 334 Score = 67.4 bits (163), Expect = 6e-10, Method: Compositional matrix adjust. Identities = 61/249 (24%), Positives = 103/249 (41%), Gaps = 13/249 (5%) Query: 59 LCFHIFT-DYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIA 117 L H+ T D +++ ++ ALA I ++ + RL L ++ + A Y RF+ Sbjct: 30 LQVHLLTCDSCPEEEARFRVALAPFAHVGISVHRVPAARLEGLFVDRHLSPAAYLRFLAP 89 Query: 118 DYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWE-KRAHSLGVAG 176 + +VLYLD D+I + L+ VA G D + R +LG+ Sbjct: 90 EVLPEAVQRVLYLDCDLIVLDDVAQLLRLDLQGRAVAAAPDLGWKDAAQAARFRTLGIP- 148 Query: 177 IAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFAD 236 + + Y NSG LL++ +W +S + + + + DQD LN +LAD + D Sbjct: 149 LDRPYVNSGVLLMDLGRWRRDGLSQKLFDYVARHGSL--LLRHDQDALNAVLADDIHLLD 206 Query: 237 IKYNTQFSL-----NYQLKESFINPVT--NDTIFIHYIGPTKPWHDWAWDYPVSQAFMEA 289 ++N Q L L E V D +H+ KPW+ W + + Sbjct: 207 RRWNLQVLLLSPWAKRALPEDRQATVAARRDPAILHFSTADKPWNFRVWTR-RRELYFRF 265 Query: 290 KNASPWKNT 298 + +PW Sbjct: 266 RARTPWSRA 274 >UniRef50_A4UX76 Putative LPS biosynthesis protein n=2 Tax=Lactobacillaceae RepID=A4UX76_9LACO Length = 316 Score = 67.4 bits (163), Expect = 7e-10, Method: Compositional matrix adjust. Identities = 72/264 (27%), Positives = 111/264 (42%), Gaps = 34/264 (12%) Query: 25 ENLCLDIAYGTDKNFLFGCGISIASILKYNEGSR------LCFHIFTDYFGDDDRKYFDA 78 EN + I Y D N+ +S+AS++ + R LC + TD G D Sbjct: 2 ENQTVPIFYAVDDNYAPYLAVSLASLVAHTSPDRHYQVIVLCDDLNTDNQGRLKAFETDN 61 Query: 79 LALQYKTRIKIYLINGDRLRSLPSTKN-------WTHAIYFRFVIADYFINKAPKVLYLD 131 L +Q+ + IN DRL+ + KN +T IYFR IA+ F K K LYLD Sbjct: 62 LKIQFVS------IN-DRLKQEITDKNNKLRSDYFTFTIYFRLFIAELF-PKLDKALYLD 113 Query: 132 ADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGI-AKGYFNSGFLLIN 190 AD + + L + D+ V V E + GI ++ Y SG LL+N Sbjct: 114 ADTVVLKDVGELFDTQLGDNLVGAVPDPFVGHTPETIDYVEQAVGIDSQKYVCSGVLLMN 173 Query: 191 TAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLK 250 A+ + + + +LN+ K PDQD +N + +++ + + ++ Q + Sbjct: 174 LAEMRRLKFAEHFLQLLNKYHF--KCLAPDQDYMNAIARNRIYYLNPSWHIQIT------ 225 Query: 251 ESFINPVTNDTIFIHYIGPTKPWH 274 P D IHY KPW Sbjct: 226 ----TPQDVDPWLIHYNLFAKPWR 245 >UniRef50_B7AUG6 Putative uncharacterized protein n=1 Tax=Bacteroides pectinophilus ATCC 43243 RepID=B7AUG6_9BACE Length = 301 Score = 67.4 bits (163), Expect = 8e-10, Method: Compositional matrix adjust. Identities = 66/261 (25%), Positives = 122/261 (46%), Gaps = 28/261 (10%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHI-FTDYFGDDDRKYFDALALQYKTR- 86 ++I + F+ + + S++K N + H+ +TD RK D++ + + Sbjct: 1 MNILVAMNDAFVKCYQVMLTSLIKNNPDENITVHVPYTDGLS---RKGLDSIKELVRNQS 57 Query: 87 -----IKIYLINGDRLRSLPST--KNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGT 139 ++ Y DRL SL W+ ++FR ++ ++L+LD DII G+ Sbjct: 58 HGSASVREYYFGKDRLGSLDKLPLGMWSVEMFFRIFAQEFIPESEDRILWLDGDIIVNGS 117 Query: 140 IEPLINFSFPDDKVA----MVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWA 195 I+ N F A + ++ G+ ++ +LG + + Y NSG LLIN Sbjct: 118 IKDFYNTDFDSMYYAACEDIAISHGKI---KEEYDNLGWSS-EEIYVNSGVLLINLKALR 173 Query: 196 AQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFAD-IKYNTQFS-LNYQLKESF 253 ++ R A+ E + K+ +PDQ +LN + DK+ FAD +YN Q S +Y+L + Sbjct: 174 NNGIT-RDAAVEYALENMDKLHYPDQYMLNAMFHDKIKFADAFRYNCQVSGYSYKLADM- 231 Query: 254 INPVTNDTIFIHYIGPTKPWH 274 + +++ +H+ G +PW Sbjct: 232 ---ILSESAILHFPG-YRPWQ 248 >UniRef50_A0ZYL4 Putative uncharacterized protein n=1 Tax=Archaeal BJ1 virus RepID=A0ZYL4_9CAUD Length = 286 Score = 67.0 bits (162), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 70/279 (25%), Positives = 125/279 (44%), Gaps = 38/279 (13%) Query: 27 LCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKT- 85 + L++ Y + C IS S+L+ N+ + +I ++ D++ +F+ + Y++ Sbjct: 1 MTLNVCYIAGGDSWVPCYISAYSVLENNQDLDIHMYILSE--EDNNNPFFEHVEYLYESH 58 Query: 86 ---RIKIYLINGDRLRSLPST-KNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIE 141 I+ ++ D+ LP+ K+ + +YF+ I + + VL LDAD IC G++ Sbjct: 59 PSLEIEFIEVDMDQFDDLPAPGKHLSPGVYFKIAI-NRLLPTDGNVLLLDADTICDGSLS 117 Query: 142 PLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSA 201 L++ KV +A+ LG+ + FN+G L +N +WA Q + Sbjct: 118 SLLSLDL-SGKVLAAAPSNKAE-----TVRLGLQN-NRAKFNAGVLYVNLQEWAKQDIEE 170 Query: 202 RAIAML--NEPEIIKKITHPDQDVLNMLL--ADKLIFADIKYNTQFSLNYQLKESFINPV 257 R+ + +EPE+ DQD LN L+ D + + +YN L + + V Sbjct: 171 RSRQYIEEHEPEL------NDQDALNALVNNPDDMEYIHPRYNATKLLVREFEM-----V 219 Query: 258 TNDTIFIHYIGPTKPWH--------DWAWDYPVSQAFME 288 ++ IHY GP KPW D W+Y F + Sbjct: 220 DDEPTIIHYNGPDKPWRFVTERESGDLWWEYASKTPFRD 258 >UniRef50_B8G232 Glycosyl transferase family 8 n=2 Tax=Desulfitobacterium hafniense RepID=B8G232_DESHD Length = 280 Score = 67.0 bits (162), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 54/256 (21%), Positives = 109/256 (42%), Gaps = 13/256 (5%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 ++I + +++ + + S+L N G + ++ +D D + +++ Sbjct: 1 MNILVTLNSSYVKQLMVMLTSLLDSNPGEQFTVYVAHSAMSKEDFARIDQAIDSSRCKVE 60 Query: 89 IYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSF 148 ++ + L P T + +Y+R +Y + ++LYLD D++ ++ L F Sbjct: 61 GIKLSDEGLSKAPITSRYPKEMYYRIFAVNYLPDHLERILYLDPDLVVINPLKELYTIDF 120 Query: 149 PDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLN 208 + A + +K H Y NSG +++N + +Q + Sbjct: 121 QGNFFA--AASHVKELLKKLNHVRLNMAEDSTYVNSGVMMMNLSLLRQEQDVHEVYQYIE 178 Query: 209 EPEIIKKITHPDQDVLNMLLADKLIFADIK-------YNTQFSLNYQLKESFINP--VTN 259 E + ++ PDQDVLN + +D+ + D K Y ++LN + ++ I+ V + Sbjct: 179 EYK--HRLFLPDQDVLNGVYSDRTLTVDAKIYNLSERYYALYNLNPKYWDAKIDLDWVRS 236 Query: 260 DTIFIHYIGPTKPWHD 275 +T IHY G KPW D Sbjct: 237 NTAIIHYCGRNKPWKD 252 >UniRef50_Q16CW9 Lipopolysaccharide 1,3-galactosyltransferase, putative n=7 Tax=Rhodobacteraceae RepID=Q16CW9_ROSDO Length = 329 Score = 66.6 bits (161), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 50/184 (27%), Positives = 80/184 (43%), Gaps = 9/184 (4%) Query: 94 GDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGT-IEPLINFSFPDDK 152 GD L K TH +Y R + F + K+LYLD+DI QG L + Sbjct: 83 GDVFEGLRLDKGKTHDVYLRIALPTAFAGEYDKILYLDSDIFVQGGDFNALFDIDVAPHC 142 Query: 153 VAMVVTEGQADWWEKRAHSLGVAGI-AKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPE 211 +A V Q +++ + GI YFN+G +L++ + Q++ R + Sbjct: 143 IASVRDNVQWRTPKRQNKRNTIKGIPPSAYFNAGVMLMDVQAYTEQELMRRCVEFGRARR 202 Query: 212 IIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIGPTK 271 + + DQ++ N +L + +N Q+S + +L F P IH+IGP K Sbjct: 203 --RDLKRHDQNLYNAVLQNDWAEISPVWNWQYSWSTRLFAVFAYPN-----IIHFIGPAK 255 Query: 272 PWHD 275 PW D Sbjct: 256 PWKD 259 >UniRef50_B7KM20 Glycosyl transferase family 8 n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7KM20_CYAP7 Length = 347 Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 65/270 (24%), Positives = 110/270 (40%), Gaps = 31/270 (11%) Query: 25 ENLCLDIAYGTDKNFLFGCGISIASILKYNEGSR-LCFHIFTDYFGDDDRKYFDALALQY 83 EN + I G D F G +++ S L + R + +I +R + Sbjct: 9 ENEPITIVSGADDKFALGLAVTLYSALANLDTKRKIDIYIVDGGINSKNRDKLTQILNSD 68 Query: 84 KTRIKIYLINGDRLRSLPSTK---NWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTI 140 + I + D L L K + YFR ++ + + +V+YLD+D++ +G + Sbjct: 69 LMPVSIKWVKPD-LTVLEGVKLFGSLNVTTYFRLLLPELLPTQVERVIYLDSDLVVEGNL 127 Query: 141 EPLINFSFPD-------DKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQ 193 L + D V V G K LG+A Y N+G +LIN Q Sbjct: 128 ANLWEQELGNCPAVAVQDYVFPYVCNGL-----KTYQQLGLAS-NTPYCNAGVMLINIKQ 181 Query: 194 WAAQQVSARAIAMLNEPEIIKK----ITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQL 249 W + ++ + + E I+K + DQD +N L+A++ D+K+N Q Y Sbjct: 182 WRIEALNRKIL------EYIRKFYDLVYLADQDGINALIANRFKLLDLKWNVQIFGVYNG 235 Query: 250 KESFI---NPVTNDTIFIHYIGPTKPWHDW 276 K + + D +H+ P KPWH + Sbjct: 236 KIDLLCKPKELIRDAFILHFTTPIKPWHPY 265 >UniRef50_B3XL28 Glycosyl transferase family 8 n=1 Tax=Lactobacillus reuteri 100-23 RepID=B3XL28_LACRE Length = 331 Score = 65.1 bits (157), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 66/293 (22%), Positives = 128/293 (43%), Gaps = 31/293 (10%) Query: 30 DIAYGTDKNFLFGCGISIASILKYN-EGSRLCFHIFTDYFGDDDR----KYFDALALQYK 84 +I Y TD F G S+ S+L+ N E ++ F I +++ K D Sbjct: 6 NIVYATDDTFAPVLGTSLLSLLRNNKEAKKINFFILDSGISKENKFRIEKICDNFVNASL 65 Query: 85 TRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLI 144 IKI I+ + + + + + Y R I D N +VLYLD D + +++ L Sbjct: 66 KWIKIESISKKIGIDVKNDRG-SFSQYSRLFIGDVLDNSVERVLYLDCDTLILSSLKDLW 124 Query: 145 NFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAI 204 N + +A + + + ++ K + + + FNSG +LI+ W ++ +AI Sbjct: 125 NIELKGNIIA-ALKDAFSKYYRKNINLVNDDLM----FNSGVMLIDLKAWRDNKIKEKAI 179 Query: 205 AMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQ---FSLNYQLKESFINPVT--- 258 + + + K+ DQ VLN +L++K D +YN + L+Y+ + + +PV Sbjct: 180 SFIRQRH--GKVQQGDQGVLNSVLSNKTFALDPRYNLVSIFYDLDYREIKLYRSPVNFYS 237 Query: 259 --------NDTIFIHYIG---PTKPWHDWAWDYPVSQAFMEAKNASPWKNTAL 300 + + +H+ +PW + ++ + +++ +PWKN L Sbjct: 238 EKIIVKAKENPVILHFTSSFYSIRPWFKNS-NHQCKKIWLKFYQETPWKNQPL 289 >UniRef50_D2RIJ4 Glycosyl transferase family 8 n=2 Tax=Acidaminococcus RepID=D2RIJ4_ACIFE Length = 309 Score = 64.7 bits (156), Expect = 4e-09, Method: Compositional matrix adjust. Identities = 68/289 (23%), Positives = 115/289 (39%), Gaps = 26/289 (8%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSR-LCFHIFTDYFGDDDRKYFDALALQYKTRI 87 + I +D N+ ++ ASIL + G R + F+ F D ++ + A + I Sbjct: 4 ISIVLASDDNYAQHGAVACASILANHRGERPIHFYYFDDGISEEKQAGIAATVTGLQGSI 63 Query: 88 KIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 G +++ ++ + A Y R +I + +V+YLD D++ I+ L Sbjct: 64 TFIPTAGKEIQAH-TSGHVNRAAYLRLLIPELVPQAVHRVIYLDTDLVVLDDIQELWEMD 122 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKG--YFNSGFLLINTAQWAAQQVSARAIA 205 V V G R GI +G YFNSG +++ W +Q + I Sbjct: 123 LQGKPVGAVPDLGILASSRMRRQKEETLGIQEGKLYFNSGVMVMELEAWREKQYGDQVIR 182 Query: 206 MLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNT---QFSLNYQ-LKES-----FINP 256 + E H DQD LN + D +++N F+L + LK+S + Sbjct: 183 CVEE----GNFRHHDQDGLNKVFQDNWQPLPLRWNVIPPVFTLPVKVLKKSRWRNLALEA 238 Query: 257 VTNDTIFIHYIGPTKPW--------HDWAWDYPVSQAFMEAKNASPWKN 297 + +F H+ G KPW ++ + Y AF AK P K+ Sbjct: 239 LERPAVF-HWAGRYKPWEFPPKGHFNEKYYTYLARTAFAGAKMPQPGKD 286 >UniRef50_Q3D426 Glycosyl transferase, family 8 n=7 Tax=Streptococcus agalactiae RepID=Q3D426_STRAG Length = 401 Score = 64.7 bits (156), Expect = 5e-09, Method: Compositional matrix adjust. Identities = 58/253 (22%), Positives = 115/253 (45%), Gaps = 23/253 (9%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 I G D + +I SI+ +N+ L +I F + + Q+ R+K Sbjct: 5 IVLGADFQYRDQVMTTIKSIVSHNQ--HLTIYIINTDFPVEWFNILNHSLEQFDCRVKNI 62 Query: 91 LINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPD 150 I+ D +P+ + + A +FR+ I + + VLYLD+D+I +G+++PL + + + Sbjct: 63 PISSDVFEGIPTLSHISVAGFFRWFIPIHL--EEEIVLYLDSDVIVRGSLDPLFDINLEE 120 Query: 151 DKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEP 210 + + V AD + +L A FNSG +LIN + W +++ + + ++ Sbjct: 121 NLLGAV-----ADHFS----TLYYGDTAPVSFNSGVMLINNSLWKKEEIYNSLMRIADKG 171 Query: 211 EIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVT------NDTIFI 264 + DQ+ LN+L ++ I +YN Q + + ++ P + + + Sbjct: 172 SAVGV---GDQEYLNILTQNRWIDIGKQYNVQIGQDVNIN-AYGRPDLYHFYDDCEPVIV 227 Query: 265 HYIGPTKPWHDWA 277 HY KPW+ ++ Sbjct: 228 HYNSQDKPWNKYS 240 >UniRef50_A8AY72 Glycosyl transferase, family 8 SP1766 n=7 Tax=Streptococcus RepID=A8AY72_STRGC Length = 435 Score = 63.9 bits (154), Expect = 8e-09, Method: Compositional matrix adjust. Identities = 47/188 (25%), Positives = 82/188 (43%), Gaps = 28/188 (14%) Query: 95 DRLRSLPSTKNWTH---AIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDD 151 D LR ++H Y R+ I +Y KA + LYLD D++ ++ L D Sbjct: 100 DDLRMKWEESTYSHINYMAYARYFIPEYV--KADRALYLDCDLVVTQNLDHLFELDLEDY 157 Query: 152 KVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPE 211 +A V + LG+ FNSG +L+N +W + + + + + + Sbjct: 158 YIAAV----------RATFGLGIG------FNSGVMLLNNKRWREENIPQQLVELTDRE- 200 Query: 212 IIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNY---QLKESFI--NPVTNDTIFIHY 266 I+++ DQ +LNML ++ + + YN Q + Q F+ P++ +HY Sbjct: 201 -IERVLEGDQSILNMLFKEQYLELEDSYNFQIGFDMGAAQYGHDFVFDIPLSPLPAIVHY 259 Query: 267 IGPTKPWH 274 I KPW+ Sbjct: 260 ISALKPWN 267 >UniRef50_D2LIH7 Glycosyl transferase family 8 n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LIH7_RHOVA Length = 391 Score = 63.2 bits (152), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 69/269 (25%), Positives = 112/269 (41%), Gaps = 41/269 (15%) Query: 36 DKNFLFGCGISIASILKYNEGSRLC-FHIFTDYFGDDDRKYFDALALQYKTRIKIYLING 94 ++ ++ G IASI ++ +RL +F D +DR + + ++ + Sbjct: 51 NRRYMPGGAALIASIAEHASPNRLYDLIVFADDLASEDRDMLRNVCDKPNISLRFF---- 106 Query: 95 DRLRSLPSTKNWTHAIYFRFVIADYFINKAP-------KVLYLDADIICQGTIEPLINFS 147 D R TH F F +++ K P KV+Y+DAD I + L + Sbjct: 107 DVSRCFDGINFITH---FHFRKENFYRLKIPDLMRDFDKVVYIDADTITNRDLADLYDID 163 Query: 148 FPD------DKVAMVVTE--------GQADWWEKRAHS-LGVAGIAKGYFNSGFLLINTA 192 AM+ T+ G+ ++E LG+ GI+ YFNSG +L N Sbjct: 164 VDGYYIAAVRDFAMIATQNKKMLDIVGKKIYYETYVKDYLGLIGISN-YFNSGLVLFNIN 222 Query: 193 QWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLN--YQLK 250 + Q+S R IA++ K + DQD+LN++ +K+ D +N Y L Sbjct: 223 KINGSQISERLIALIG----TKLFAYVDQDILNIVFENKVKLIDYSWNMVIDCERLYHLS 278 Query: 251 ESFINPVTNDTI----FIHYIGPTKPWHD 275 E + D +HYIG KPW+D Sbjct: 279 EPDLYARYLDAGAAPHVVHYIGGNKPWND 307 >UniRef50_Q5WI33 Lipopolysaccharide glycosyltransferase n=6 Tax=Firmicutes RepID=Q5WI33_BACSK Length = 274 Score = 62.8 bits (151), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 47/180 (26%), Positives = 80/180 (44%), Gaps = 10/180 (5%) Query: 101 PSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEG 160 P K+++ +Y+R + + + ++LYLD DI+ I PL + D + Sbjct: 73 PVVKHYSSEMYYRLLAYRFLPTELDRILYLDPDILVLNPIRPLYEANI-DSYLYAAAQHS 131 Query: 161 QADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPD 220 + E L + Y+NSG LL+N A+ A + ++ PD Sbjct: 132 FINIQEINKFRLNAYEM-DAYYNSGVLLMNLAKQRETMDINDIFAYVETYR--NRLVLPD 188 Query: 221 QDVLNMLLADKLIFADIK---YNTQFSLNYQLKESF---INPVTNDTIFIHYIGPTKPWH 274 QDVLN L + ++ D + Y+ ++ Y+LK I+ V T+ +H+ G KPWH Sbjct: 189 QDVLNALYSPQIKNVDERLYNYDARYYRYYKLKSGGRFDIDAVLQQTVILHFCGKKKPWH 248 >UniRef50_C6LDU2 Glycosyl transferase, family 8 n=3 Tax=Firmicutes RepID=C6LDU2_9FIRM Length = 270 Score = 62.8 bits (151), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 58/234 (24%), Positives = 97/234 (41%), Gaps = 11/234 (4%) Query: 47 IASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNW 106 I SI+++ +I + D+ A TR+ + S P ++ + Sbjct: 8 IRSIVRFPSEDGYDIYILHSDLQEQDQSDAAAQVEDGDTRLHFRFVEPSVFASFPESERY 67 Query: 107 THAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWE 166 IY+R A + ++LYLD D + ++ L N F + + T + + Sbjct: 68 PRLIYYRIFAASLLPPEMDRILYLDGDTLVINPLDELYNMDF-EGNYFLACTHVRKFLTK 126 Query: 167 KRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNM 226 + LG+ ++ Y NSG LL+N + +Q IA E + +T PDQD++ Sbjct: 127 VNQYRLGMEEVST-YINSGVLLMNLKELREKQ-DFEEIASFVEKR-GRYLTLPDQDIITA 183 Query: 227 LLADKL-IFADIKYNTQFSL----NYQLKESFINP--VTNDTIFIHYIGPTKPW 273 L +K I +KYN + N + IN V + + IHY G KPW Sbjct: 184 LYGNKTGILDTMKYNLSDRMISVYNTEPGHKRINLEWVRENAVVIHYYGKQKPW 237 >UniRef50_C6X2V2 Putative glycosyltransferase n=1 Tax=Flavobacteriaceae bacterium 3519-10 RepID=C6X2V2_FLAB3 Length = 315 Score = 62.4 bits (150), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 78/314 (24%), Positives = 137/314 (43%), Gaps = 48/314 (15%) Query: 29 LDIAYGTDKNFLFGCGISIASIL-KYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 L I + D ++ + I+SI+ + ++ +I ++Y D+++ + +Q K+ I Sbjct: 9 LPIVFTCDDHYFKYAAVVISSIIHNSSRNTKYEINIVSEYISDENQSLAQKM-VQSKSNI 67 Query: 88 KI--YLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLIN 145 I + I + + + Y+RF I D + +VLYLD+D+I I + Sbjct: 68 SIQFHAIKIENPEVFHLNSYMSLSTYYRFFIFD-LLKDYDRVLYLDSDLIVDNDISFFAD 126 Query: 146 FSFPDDKVAMVV--------TEGQADWWEKRAHSLGVAGIAK--GYFNSGFLLINTAQWA 195 F ++K A+ + D R + + ++ YFN+G +L N Sbjct: 127 IDF-ENKPAICCPSIYVQNSLKNNTDHKFTREYFTQILKMSDVDEYFNAGVILFNIKLIR 185 Query: 196 AQQVSARAIAMLNEPEIIKKITHP---DQDVLNMLLAD----KLIFADIKYNTQFSLNYQ 248 AQ + + E IK I P DQD+LN +L + KLI + YN ++ + Sbjct: 186 AQGIDRKFF------EAIKNIKDPVYQDQDILNSVLRNNGGAKLISNE--YNHTKTMKFS 237 Query: 249 LKESFINPVTND---------TIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTA 299 LK F+N + N TI+ HY+G KPW ++ P S F+ +P+ Sbjct: 238 LKRIFLNALKNKFGKKRNNWFTIY-HYVGKVKPWQNFN---PDSALFLYYAYKTPFVREI 293 Query: 300 LLKPNNSNQLRYSA 313 L SN+L+ S+ Sbjct: 294 L----KSNRLKLSS 303 >UniRef50_C1CFZ1 Glycosyl transferase, family 8 n=17 Tax=Streptococcus pneumoniae RepID=C1CFZ1_STRZJ Length = 404 Score = 62.4 bits (150), Expect = 3e-08, Method: Compositional matrix adjust. Identities = 64/257 (24%), Positives = 112/257 (43%), Gaps = 41/257 (15%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTR---- 86 I + D +++ +I SI YN L F++F D D ++F + + KT Sbjct: 6 IVFNADNDYVDKLETAIKSICCYNNC--LKFYVFND---DIASEWFLMMNKRLKTIQSEI 60 Query: 87 IKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINF 146 + + +++ + KN ++A +FR+ I ++ K + LYLD+DII G+++ L F Sbjct: 61 VNVKIVDHVLKKFHLPLKNLSYATFFRYFIPNFV--KESRALYLDSDIIVTGSLDYL--F 116 Query: 147 SFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAM 206 D A+ E + FNSG LL+N W + ++ + + Sbjct: 117 DIELDGYALAAVED------------SFGDVPSTNFNSGMLLVNVDTWRDEDACSKLLEL 164 Query: 207 LNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSL--------NYQLKESFINPVT 258 N+ + + DQ +LNML D+ D +N + N++ E I+ + Sbjct: 165 TNQ---YHETAYGDQGILNMLFHDRWKRLDRNFNFMVGMDSVAHIEGNHKWYE--ISELK 219 Query: 259 NDTI--FIHYIGPTKPW 273 N + IHY G KPW Sbjct: 220 NGDLPSVIHYTG-VKPW 235 >UniRef50_Q65VF6 RfaJ protein n=1 Tax=Mannheimia succiniciproducens MBEL55E RepID=Q65VF6_MANSM Length = 309 Score = 62.0 bits (149), Expect = 3e-08, Method: Compositional matrix adjust. Identities = 64/262 (24%), Positives = 112/262 (42%), Gaps = 25/262 (9%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALAL------Q 82 ++I + D+N+ + I SIL N F+I ++ + L Sbjct: 1 MNIIFNCDENYAPYLSVVIKSILD-NTTLSTQFYILDFNISEESKSCIKNLIQNINKKNS 59 Query: 83 YKTRIKIYLINGDRLRSLPSTKNW-THAIYFRFVIADYFINKAPKVLYLDADIICQGTIE 141 ++ I I+ + + P T ++ + A Y R +ADY +N+ K +YLD DII + Sbjct: 60 FQHSINFIKIDDNDFQCFPQTISYISSATYARLKVADY-LNELNKAIYLDIDIIVISDLS 118 Query: 142 PLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSA 201 L + D+ V + + +G+ ++ Y N+G LL+N + Sbjct: 119 RLWHIDLADNLVGACLDPYIEYENQDYKRKIGLQD-SQPYINAGVLLLNLKALREFNLYQ 177 Query: 202 RAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQ----------LKE 251 +AI + I DQD+LN +L K++F D +YN F++N++ L Sbjct: 178 KAIDWNKD---YPNIQFQDQDILNGVLKGKVLFLDSRYN--FTVNHRNRIKLAHKGKLLL 232 Query: 252 SFINPVTNDTIFIHYIGPTKPW 273 S + T +HY+G KPW Sbjct: 233 SSLEKATKPICILHYVGSHKPW 254 >UniRef50_C3XKY2 Glycosyl transferase n=2 Tax=Campylobacterales RepID=C3XKY2_9HELI Length = 433 Score = 62.0 bits (149), Expect = 3e-08, Method: Compositional matrix adjust. Identities = 60/242 (24%), Positives = 96/242 (39%), Gaps = 15/242 (6%) Query: 11 FLNSVI-DYDHKVETENLCLDIAYGTDKNFLFGC--GISIASILKYNEGSRLCFHIFTDY 67 + SVI + + K+ ++ C + KN F I + + K FHI +D Sbjct: 19 LITSVIYNTNPKLTFKDFCQKEGFKALKNSYFSAYQNIDFSKLSKQEAQEGYIFHILSDS 78 Query: 68 FGDDDRKYF----DALALQYKTRIKIYLINGDRLRSLP--STKNWTHAIYFRFVIADYFI 121 + + L Y I ++IN + P + H Y+R + Y Sbjct: 79 ISSTTQNQLTELQNTLNTIYPCEILTHIINDKEFENFPISGAAHSNHLPYYRLKLDSYLD 138 Query: 122 NKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEG--QADWWEKRAHSLGVAGIAK 179 + K LYLD+D++C + L D VA + G + K + Sbjct: 139 DSITKCLYLDSDMLCLCDLRELFAIDLKDFVVAAINDPGTKKRKIKYKENGKKMILNFND 198 Query: 180 GYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLL-ADKLIFADIK 238 YFNSGFLLINT + ++ + + + IK DQD+LN + +KL+ I Sbjct: 199 NYFNSGFLLINTQNYKQHKIQEKCENLAKKCYYIKA---ADQDLLNATIPKEKLLKLPIA 255 Query: 239 YN 240 YN Sbjct: 256 YN 257 >UniRef50_A5LNA9 Glycosyl transferase, family 8 n=2 Tax=Streptococcus pneumoniae RepID=A5LNA9_STRPN Length = 402 Score = 62.0 bits (149), Expect = 3e-08, Method: Compositional matrix adjust. Identities = 62/256 (24%), Positives = 108/256 (42%), Gaps = 38/256 (14%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYK---TRI 87 I G D N+ +I SI +N L F+IF + D +++F + + + I Sbjct: 7 IVLGADNNYRDKLETTIKSICYHNRD--LKFYIFNE---DIPKEWFYLMEKRLEKLNCEI 61 Query: 88 KIYLINGDRLRSLPST-KNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINF 146 I+ ++++ + ++ + YFR+ IA++ K + +YLD D++ G I PL Sbjct: 62 LNIEIDAEKVKYFSTPDEHIKYMTYFRYFIAEFV--KEDRAVYLDCDMVIHGNINPLFQK 119 Query: 147 SFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAM 206 F + + V D W K FN+G +++N +W + + + Sbjct: 120 DFEGNYIIAV-----PDGW------------YKNIFNAGMMMVNVHKWKTDNICQNLLEL 162 Query: 207 LNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNY-----QLKESFINPVTND- 260 E + + DQ VLN+L +K YN L+ Q E F+N + Sbjct: 163 TAEKH---QEIYGDQGVLNLLFENKWKKVSPHYNFMVGLDTLGYWAQKPEWFLNSWDENY 219 Query: 261 -TIFIHYIGPTKPWHD 275 IH+ G KPW+D Sbjct: 220 KPAIIHFEGKDKPWND 235 >UniRef50_C5SH34 Glycosyl transferase family 8 n=1 Tax=Asticcacaulis excentricus CB 48 RepID=C5SH34_9CAUL Length = 307 Score = 61.6 bits (148), Expect = 4e-08, Method: Compositional matrix adjust. Identities = 62/260 (23%), Positives = 104/260 (40%), Gaps = 29/260 (11%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 I Y D N+LF +S AS + N S L I D + + + I++ Sbjct: 6 ICYVVDDNYLFPTLVS-ASQARENAPSSLA-DIVILCLSDASDRVRKVMPVAVALGIELI 63 Query: 91 LINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPD 150 + + +L H +Y R I +VLY+D D ++EPL+N P+ Sbjct: 64 EVPTASIENL-------HPMYGRLFIDKLLPKAYERVLYIDGDTQIAASLEPLLNVDIPE 116 Query: 151 DKVAMVVTEGQ-----ADWWEKRAHSLGV-AGIA----KGYFNSGFLLINTAQWAAQQVS 200 K V +D W R V AG+ + Y N+G L+ N WA +++ Sbjct: 117 GKFLAVRDPAAMFAKLSDKWASRIQGERVEAGLGDNPIEDYLNTGVLVFNMKDWA--ELA 174 Query: 201 ARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPV--- 257 + ++ K DQD +N+ + D+ ++ ++N L +E + PV Sbjct: 175 GETLKLIRARSTPFKFG--DQDPMNLAIGDRCLYISNRWNFPGFLIGSGQEERVKPVIYH 232 Query: 258 --TNDTIFIHYIGPTKP-WH 274 +N ++H P P WH Sbjct: 233 FMSNPRPWVHAGAPWGPKWH 252 >UniRef50_C3XFW0 Glycosyl transferase n=1 Tax=Helicobacter bilis ATCC 43879 RepID=C3XFW0_9HELI Length = 365 Score = 61.2 bits (147), Expect = 5e-08, Method: Compositional matrix adjust. Identities = 59/237 (24%), Positives = 89/237 (37%), Gaps = 31/237 (13%) Query: 61 FHIFTDYFGDDDRKYFDALALQ----YKTRIKIYLINGDRLRSLPS---TKNWTHAIYFR 113 FH+ TD + F L Y +I+ ++I+ + + LP + +A Y+R Sbjct: 36 FHVITDSIAKKTLEQFHILQTTLNDIYPCQIEAHIISDEDFKDLPKWGYEEAQQYAAYYR 95 Query: 114 FVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKR----- 168 + D+ K LYLD D++ + L F+ D + G + R Sbjct: 96 VKLVDFLPKNVDKCLYLDTDMLVLTDLREL--FALNLDGYIAASSSGSPNATISRYGIYR 153 Query: 169 ---AHSLGVAGIAKG-YFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVL 224 V YF SG +LINT +W Q V A+ L E E DQD L Sbjct: 154 KKKGGKKAVKSFETSFYFCSGLMLINTKEWIKQNVDIEAMRFLREYE----TEFADQDAL 209 Query: 225 NMLLADKL--------IFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIGPTKPW 273 N + D++ I A S N + + + N I +H GP K W Sbjct: 210 NFAMCDRVYNLGEQWGILAYQSLEAACSTNIDFSKRYEKAMINAKI-LHCNGPAKAW 265 >UniRef50_D1JY84 Putative uncharacterized protein n=1 Tax=Bacteroides sp. 3_1_33FAA RepID=D1JY84_9BACE Length = 312 Score = 60.8 bits (146), Expect = 6e-08, Method: Compositional matrix adjust. Identities = 63/258 (24%), Positives = 108/258 (41%), Gaps = 29/258 (11%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRI- 87 + IA+ + ++ +SI +L+ N L HI +DY D + L Y I Sbjct: 6 MHIAFCVNDHYAEYILVSIKGLLE-NNSDPLVIHILSDYISDKNTNRLKKLVGLYPNAIL 64 Query: 88 KIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 I +++ +L+ L T WT ++R ++ + +VLYLDAD + IE L + Sbjct: 65 DIVIVDDLKLKDLKDT--WTIYTWYRVLLPEILDASVHRVLYLDADTLVSENIEELFSLD 122 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAG--IAKGYFNSGFLLINTAQWAAQQVSARAIA 205 +A G D+ K + G K Y +G +++N W ++ + I Sbjct: 123 MTGKAIA-----GTVDFQSKDKSTYQRCGYEAEKEYVCAGVMMMNLDYWREHDIANKIID 177 Query: 206 MLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNT--------QFSLNY--QLKESFIN 255 + +I +PDQD +N + D + +KY+ + NY +L+E + Sbjct: 178 WGRDYN--DRIQYPDQDAINYICRDMKLLLPLKYDIIDGFFQDDYYFQNYPQELRECIES 235 Query: 256 PVTNDTIFIHYIGPTKPW 273 P IHY G PW Sbjct: 236 PA-----IIHYAGQA-PW 247 >UniRef50_Q3D427 Glycosyl transferase, family 8 n=8 Tax=Streptococcus agalactiae RepID=Q3D427_STRAG Length = 413 Score = 60.8 bits (146), Expect = 6e-08, Method: Compositional matrix adjust. Identities = 66/245 (26%), Positives = 99/245 (40%), Gaps = 33/245 (13%) Query: 41 FGCGISIASILKYNEGSRLCFH-IFTDYFGDDDR---KYFDALAL---QYKTRIKIYLIN 93 FG + +I+K +CFH F D++ +D ++F + + I I Sbjct: 13 FGYQEQVKTIIK-----SICFHNQFIDFYILNDDFPVEWFQMMEYHLSKMDCTISNTKIF 67 Query: 94 GDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKV 153 + ++ K + YFR+ I + + KVLYLD D+I + + V Sbjct: 68 NEEIKHFKFQKPMPYPTYFRYFIPE--VIHEDKVLYLDCDMIITSDLTSIFTLDISKYGV 125 Query: 154 AMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEII 213 A V R L + YFNSG LLIN W Q +S R + E + Sbjct: 126 AAV-----------RDDLLEEYDGKEDYFNSGLLLINNIFWREQGISQRLLDYTRENQ-- 172 Query: 214 KKITHPDQDVLNMLLADKLIFADIKYNTQFSLN-----YQLKESFINPVTNDTIFIHYIG 268 + + DQDVLN +L D + D YN + +Q E +N + IHY Sbjct: 173 GALQYHDQDVLNDVLCDNWLELDETYNYHTGADMLYNLFQQSERQLNRRKDLPKVIHYTA 232 Query: 269 PTKPW 273 TKPW Sbjct: 233 -TKPW 236 >UniRef50_A8LP95 Lipopolysaccharide 1 n=1 Tax=Dinoroseobacter shibae DFL 12 RepID=A8LP95_DINSH Length = 342 Score = 60.8 bits (146), Expect = 6e-08, Method: Compositional matrix adjust. Identities = 56/235 (23%), Positives = 107/235 (45%), Gaps = 14/235 (5%) Query: 107 THAIYFRFVIADYFINKAPKVLYLDADIIC-QGTIEPLINFSFPDDKVAMVVTEGQADWW 165 T + Y R ++ + ++LY+D+D+ + + L+ +A V Q Sbjct: 108 TGSTYLRLALSGALGHDYQRILYMDSDVFALRDGLHVLLFTDMRGKPLAAVRDNSQWRTS 167 Query: 166 EKRAHSLGVAGI-AKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVL 224 ++ L + A+ YFN+G LL++TA+ Q + A+A+ + ++ DQ +L Sbjct: 168 GRKPDDLVTLNLPARPYFNAGVLLMDTARLNEQDILAKALDLGTSQA--GRLARHDQTLL 225 Query: 225 NMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYP--V 282 N + + ++N QF+ S+I ++ D +H+IGP KPW D + +P + Sbjct: 226 NAVTSGNWAEMSPRWNWQFTW-----ASWIFALSEDARILHFIGPNKPWADTSGRFPKSI 280 Query: 283 SQAFMEAKNASPWKNTALLKPNNS--NQLRYSAKHMLKKHRYLKGFSNYLFYFIE 335 ++A+ + A + + + NS N R K ++K K S YL F + Sbjct: 281 TRAYGDFL-AEQFPERTVERAANSPINDPRRLIKSLIKHGLSRKKMSAYLARFAD 334 >UniRef50_C5EZG9 Glycosyl transferase family protein n=1 Tax=Helicobacter pullorum MIT 98-5489 RepID=C5EZG9_9HELI Length = 374 Score = 60.8 bits (146), Expect = 7e-08, Method: Compositional matrix adjust. Identities = 59/245 (24%), Positives = 98/245 (40%), Gaps = 33/245 (13%) Query: 56 GSRLCFHIFTDYFGDDDRKYFDALALQ----YKTRIKIYLINGDRLRSLP-STKNWTHAI 110 G FH+ D+ + ++ L L+ Y + I+++ + R+ T N + Sbjct: 6 GGAYNFHLLMDFVSQETKEKLQNLILELSKIYPCTLNIHILEDEIFRTQSLRTLNGNYLA 65 Query: 111 YFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQ----ADWWE 166 Y+R I + +YLD D+I G + L + K+ VV EG+ + E Sbjct: 66 YYRLRIGSALPLSIKRCVYLDVDMIVLGDLRELFKINL-QGKICGVVMEGKDNDTQNILE 124 Query: 167 KRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKK--ITHPDQDVL 224 + I YFNSG LL++ W + + RA EI+KK D+ +L Sbjct: 125 SKNKINKSIAIVSNYFNSGMLLVDLDLWRKENIEDRAF------EIVKKYYCHKHDEHIL 178 Query: 225 NMLLADK-------------LIFADIKYNTQFSLNYQL-KESFINPVTNDTIFIHYIGPT 270 N +L + L + N + +N ++ F N + N I +HY Sbjct: 179 NAVLQGQTFKILPQWNMMVFLYCRAVCLNERGKINMPYNRKDFNNALKNPKI-LHYHTHH 237 Query: 271 KPWHD 275 KPW D Sbjct: 238 KPWED 242 >UniRef50_B1Y723 Glycosyl transferase family 8 n=1 Tax=Leptothrix cholodnii SP-6 RepID=B1Y723_LEPCP Length = 316 Score = 60.5 bits (145), Expect = 8e-08, Method: Compositional matrix adjust. Identities = 58/250 (23%), Positives = 102/250 (40%), Gaps = 20/250 (8%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGS-RLCFHIFTDYFGDDDRKYFDALALQYKTRIKI 89 I D+ +L ++ S+++ N + H+ D R + +I+ Sbjct: 13 IVLACDEAYLMPLATTLRSVVESNAAHWPIECHVLVDDVSLPGRARVERSLPARAAQIRW 72 Query: 90 YLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFP 149 + ++ S + + + R ++AD + +VLYLD DI+ G + PL+ Sbjct: 73 HAVDLTDFSSFETQAAISKMTFARLLMADLLPAELERVLYLDTDILVLGDLLPLMRTEL- 131 Query: 150 DDKVAMVVTEG-QADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLN 208 D + V +G A+ G+ + YFN+G LLI+ A+W A +VSA A L Sbjct: 132 DGAILGAVRDGLDAELKSTSPAPTGMPDVCD-YFNAGVLLIDLARWRAGRVSAAARDHL- 189 Query: 209 EPEIIKKITHP-----DQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIF 263 + HP DQD LN+ + + + F + + + P I Sbjct: 190 -------VAHPQTPFADQDALNVACDGH--WKPLAAHWNFQGHRSTDIAALAPSQRPGI- 239 Query: 264 IHYIGPTKPW 273 +H+I KPW Sbjct: 240 VHFITALKPW 249 >UniRef50_A3YS36 Putative sugar transferase n=2 Tax=Campylobacter jejuni RepID=A3YS36_CAMJE Length = 459 Score = 60.5 bits (145), Expect = 8e-08, Method: Compositional matrix adjust. Identities = 55/215 (25%), Positives = 95/215 (44%), Gaps = 15/215 (6%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGS---RLCFHIFTDYFGDDDRKYFDALALQ----Y 83 I + + ++ + + SI+ S + CFHI + D+ K L + Y Sbjct: 4 IVFNSSNEYIENLSVLMYSIIINTNKSNTKKYCFHILSSNINDNTCKKLTLLEKELSSIY 63 Query: 84 KTRIKIYLINGDRLR--SLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIE 141 + IKIY IN + ++P + +A Y R ++A K LYLD D++ G I Sbjct: 64 PSEIKIYHINDNLFYDYNIPKHEGSYNA-YLRLMLASILSKDIKKCLYLDVDMLVLGDIS 122 Query: 142 PLINFSFPDDKVAMVVTEGQADWWEKRAH-SLGVAGIAKGYFNSGFLLINTAQWAAQQVS 200 L + DKV V + W + S + I +FNSG +LIN W + + Sbjct: 123 ELFDLDLK-DKVFAAVFILKHPWPNLNSKDSSEIFYIYGSHFNSGLMLINLDAWREKNIE 181 Query: 201 ARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFA 235 +R+++ + + + D+ VLN +L+ IF+ Sbjct: 182 SRSLSFIKNYYVPYAV---DEYVLNAILSKDDIFS 213 >UniRef50_B3Q568 Putative glycosyltransferase protein n=4 Tax=Rhizobium etli RepID=B3Q568_RHIE6 Length = 331 Score = 60.1 bits (144), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 54/254 (21%), Positives = 101/254 (39%), Gaps = 27/254 (10%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGS-RLCFHIFTDYFGDDDRKYF------DALALQY 83 I + D + ++ S+ + N+ L H+ + G++ ++ ++ +Q+ Sbjct: 22 IVFAVDAAYAVPLATALRSVAENNQSVWPLDIHVIHEGIGEETKRLILESLPANSAIIQW 81 Query: 84 KTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPL 143 + +G R P T A R ++ + + LYLD DI+ ++E L Sbjct: 82 HPIATLSFASGFSTR--PGVSKMTFA---RILLPQFLPQTCDRALYLDGDILVLTSLEQL 136 Query: 144 INFSFPDDKVAMVVTEGQADWWEKRAHSLGVAG----IAKGYFNSGFLLINTAQWAAQQV 199 N + + V D+W G + K YFN+G LLI+ A+W +++ Sbjct: 137 WNTDLGEAVIGAV-----PDYWLDNPAGSGPGARGGALVKRYFNAGILLIDLAKWRNERI 191 Query: 200 SARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTN 259 S R++ L+ + DQD LN+ K D +N QF + + Sbjct: 192 SERSLDYLDR---FPTTEYSDQDALNVACDGKWKILDRAWNFQFEPRQAIAGIALE---Q 245 Query: 260 DTIFIHYIGPTKPW 273 +H++ KPW Sbjct: 246 KAAIVHFVTNVKPW 259 >UniRef50_D1Y7U2 Glycosyltransferase, family 8 n=1 Tax=Pyramidobacter piscolens W5455 RepID=D1Y7U2_9BACT Length = 617 Score = 60.1 bits (144), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 70/271 (25%), Positives = 118/271 (43%), Gaps = 44/271 (16%) Query: 100 LPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVT- 158 L + ++ T ++RF+I D + KVLYLD D+I Q I L + + + + Sbjct: 353 LRAKEHVTTETFYRFLILD-LLKMYDKVLYLDCDMIIQRDIADLYDLDLGTNLIGAALDP 411 Query: 159 --EGQADWWE--KRAHSLGVAGIAK--GYFNSGFLLINTAQWAAQQVSARAIAMLNEPEI 212 GQ + R + V + YF +G LL+N A+ + V+ R + + E I Sbjct: 412 DFTGQCNGANPATRKYCDAVLKLKDCFTYFQAGVLLMNVAELN-KSVTVRQLLEMAETGI 470 Query: 213 IKKITHPDQDVLNMLLADKLIFADIKYN-------------TQFSLNYQLKESFINPVTN 259 K + DQD+LN++ + ++ D+ +N +F+ +Y L + + N Sbjct: 471 YK---YSDQDILNVVCEGRALYLDMAWNLLSDCDHYRWHHVVKFAPHYIL-DMYENAREK 526 Query: 260 DTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTAL----------LKPNN---- 305 I IHY G KPW D+ F +A +P+ L +P N Sbjct: 527 PYI-IHYAGFLKPWMKLGEDF--GYEFWKAARETPFYEELLYAALVPHGNTTRPQNFLHM 583 Query: 306 -SNQLRYSAKHMLKKHRYLKGFSNYLFYFIE 335 N+L AK +L K L+ F+ +L+Y I+ Sbjct: 584 LINRLVPLAKAVLPKGSRLRYFARHLYYRIK 614 >UniRef50_C5S494 Putative glycosyl transferase n=2 Tax=Actinobacillus minor RepID=C5S494_9PAST Length = 287 Score = 60.1 bits (144), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 71/268 (26%), Positives = 110/268 (41%), Gaps = 43/268 (16%) Query: 28 CLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 ++IA D+N+ I S+ +++ R + I DY ++F AL Q+ T + Sbjct: 7 TINIALAADRNYAEQVITLIKSVCYHHKNVRF-YLIHQDY----PDEWFMALN-QHLTNV 60 Query: 88 KIYLING---DRLRSLPS-TKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPL 143 +I D R L ++ T A ++R++I + +V+YLD+DI+ G IE + Sbjct: 61 GAEIIPVTVLDSFRFLSKLQEHITQATFYRYIIPEI---PEDRVIYLDSDIVVDGNIEEM 117 Query: 144 INFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARA 203 FS + K + V + + E H K YFN G LLIN W ++ Sbjct: 118 Y-FSDFNGKYVLAVEDMYISYTE---HGYIEFPDLKPYFNGGVLLINNQLWKENDLAEYL 173 Query: 204 IAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSL------------------ 245 I M + + DQD+LN +L DK YN Q + Sbjct: 174 IQMTKQ---YPNVMFGDQDILNFVLKDKWGILSHVYNYQTGIIHAFPRLEENMSDEEIIT 230 Query: 246 NYQLKESFINPVTNDTIFIHYIGPTKPW 273 YQ + + P I IHY KPW Sbjct: 231 KYQKQADEVKP-----IIIHYTTKYKPW 253 >UniRef50_C9PNX4 Family 8 glycosyl transferase n=1 Tax=Pasteurella dagmatis ATCC 43325 RepID=C9PNX4_9PAST Length = 285 Score = 59.7 bits (143), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 48/198 (24%), Positives = 91/198 (45%), Gaps = 18/198 (9%) Query: 92 INGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDD 151 ++ + + + P+ + + A YFR+++ +++ VLYLD D++ G++ + F D+ Sbjct: 73 VDSEVISTFPTLDHISEASYFRYLLGQLPLDR---VLYLDCDVVVTGSLTEIYYTDFGDN 129 Query: 152 KVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPE 211 + V + + HS K YFNSG LLI+ +W Q + + + + + Sbjct: 130 MMYAV----EDAFLNIAPHSYKEFPDMKPYFNSGMLLIDLNKWRDQNIENQLMDLTKQA- 184 Query: 212 IIKKITHPDQDVLNMLLADKLIFADIKYNTQFS-----LNYQLKES---FINPVTNDTIF 263 + + DQD +N++L K D YN Q + +++ E+ + + Sbjct: 185 --VNLYYGDQDAMNIILKGKWQALDKIYNYQTGSLIAFIQHKMPEALEKYKDLQGQQPKV 242 Query: 264 IHYIGPTKPWHDWAWDYP 281 IHYI KPW +D P Sbjct: 243 IHYITRYKPWLLPEYDLP 260 >UniRef50_C6EQF4 Putative uncharacterized protein n=3 Tax=Campylobacter jejuni RepID=C6EQF4_CAMJE Length = 958 Score = 59.3 bits (142), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 49/209 (23%), Positives = 93/209 (44%), Gaps = 34/209 (16%) Query: 106 WTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKV-------AMVVT 158 +T A+Y+R I + F N KV+Y D+D+I + I L + ++ A+ Sbjct: 98 FTTAMYYRIFIPEIFSN-FKKVIYCDSDVIFKADISHLFFIDLNNKEIGACRDIAALYAY 156 Query: 159 EGQADWWEKRAHS----LGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIK 214 + W++ + + I+ YFNSG ++ + + + ++ + ++ I Sbjct: 157 RKRETVWQQNIRNNFDKINFRSIS-DYFNSGVIVFDIVKCIQMKTVSKCLTVIKN---ID 212 Query: 215 KITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFI---NPVTNDTI-------FI 264 + PDQDVLN++ + F +++N ++ + K++F+ + N+ I Sbjct: 213 NLYFPDQDVLNIVFCGHVHFLPLEWNFLWTTYIEYKDNFMYLPKKIINEIYKAKTKPKII 272 Query: 265 HYIGPTKPWHD-------WAWDYPVSQAF 286 HYI TKPW D W W +P F Sbjct: 273 HYISETKPWKDKNSFFVEW-WKFPRKNLF 300 >UniRef50_UPI000190F79C lipopolysaccharide 1,2-glucosyltransferase n=1 Tax=Salmonella enterica subsp. enterica serovar Typhi str. E01-6750 RepID=UPI000190F79C Length = 98 Score = 59.3 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 32/84 (38%), Positives = 49/84 (58%), Gaps = 1/84 (1%) Query: 18 YDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFD 77 +D ++ L+I+YG D+N+L G G+SIAS++ N L FHI D + KY + Sbjct: 15 FDESNNNDDNVLNISYGVDENYLDGVGVSIASVV-LNNNIPLAFHIICDSYSPCFVKYIE 73 Query: 78 ALALQYKTRIKIYLINGDRLRSLP 101 LA+Q+ +I +YLI + L LP Sbjct: 74 RLAVQHHIKISLYLIKVESLEVLP 97 >UniRef50_A7VNX5 Putative uncharacterized protein n=1 Tax=Clostridium leptum DSM 753 RepID=A7VNX5_9CLOT Length = 344 Score = 58.5 bits (140), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 60/303 (19%), Positives = 127/303 (41%), Gaps = 36/303 (11%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYN-EGSRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 ++ + +D NF G ++ S+ + N E + +I + +++ +++ QY+ + Sbjct: 13 MNCVFSSDDNFADILGCALISLFENNREQETIEVYILDGGISEGNKRKLESIFQQYERMV 72 Query: 88 ------KIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIE 141 I + G+ + ++ W + + R +I + +VLYLD DI+ G+++ Sbjct: 73 HFIEVPDISQLTGEAV----TSGRWPISTFARILIDSLLPKEVKRVLYLDCDILVLGSLK 128 Query: 142 PLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSA 201 L DK A V + ++ +R + G+ G Y N+G +LI+ +W Q+ Sbjct: 129 NLWEIDL-KDKTAAGVMDCLSN---QRKQNAGING-EDSYINAGVMLIDMDKWRENQIEK 183 Query: 202 RAIAML---------NEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKES 252 + + + N+ +I K+ H D VL +F D Y + Y+ +S Sbjct: 184 QCMNYIRICNGQVAYNDQGVINKVLHKDLLVLPPEYNAMTLFFDFTYPDM--IKYRKPQS 241 Query: 253 F-----INPVTNDTIFIHYIG---PTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPN 304 + ++ +H+ +PW + ++P + + SPW+ L N Sbjct: 242 YYSAQQVDHARKHPRIVHFTSSFLSLRPWVKGS-EHPYAPLWRNYYKRSPWRAKDLRSDN 300 Query: 305 NSN 307 S+ Sbjct: 301 RSS 303 >UniRef50_B2ISC2 Glycosyl transferase, family 8 n=2 Tax=Streptococcus pneumoniae RepID=B2ISC2_STRPS Length = 401 Score = 58.5 bits (140), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 67/261 (25%), Positives = 103/261 (39%), Gaps = 54/261 (20%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIF-----TDYFGDDDRKYFDALALQYKT 85 I G D +++ +I SI N+ + F++F T++F D++ + Sbjct: 5 IVLGADNHYMDKVETTIKSICSKNKEVK--FYVFNSDLPTEWFQLMDKRLSVLGSEIVNV 62 Query: 86 RIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLIN 145 ++ LIN L T + + A Y R+ I K + LYLD+DII + L Sbjct: 63 KVTESLINQFHL----PTPHLSSATYLRYFIPTIVFEK--RALYLDSDIIVTADLTSL-- 114 Query: 146 FSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIA 205 F FP D + + ++G FNSG LLI+T +W + + + Sbjct: 115 FEFPLDGCPLAAVPD-------------IPNTSEG-FNSGVLLIDTDRWREDDIQNQLLN 160 Query: 206 MLNEPEIIKKITH--PDQDVLNMLLADKLIFADIKYNTQFSLN-----------YQLKES 252 + IK H DQ++LNML D+ + YN Q + Y L E Sbjct: 161 L-----TIKHHEHVYGDQEILNMLFKDRWKKLSLSYNLQVGYDTYRHSLGDNEWYHLFEG 215 Query: 253 FINPVTNDTIFIHYIGPTKPW 273 N IHY KPW Sbjct: 216 IPN-------IIHYTTQNKPW 229 >UniRef50_C3XN62 Glycosyl transferase n=1 Tax=Helicobacter winghamensis ATCC BAA-430 RepID=C3XN62_9HELI Length = 284 Score = 58.2 bits (139), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 40/163 (24%), Positives = 71/163 (43%), Gaps = 21/163 (12%) Query: 128 LYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFL 187 +YLD D++ + + V+ E + +L + ++K YFN+G L Sbjct: 1 MYLDVDMLVLKDLREIFAIDLEGKICGAVLDYKANRILEPKNKALPMLNLSKDYFNAGLL 60 Query: 188 LINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNT------ 241 LI+ +W +Q++ ++ I LN+ DQ LN++L DK+ + +NT Sbjct: 61 LIDLEKWKSQKLESKLIETLNQYH----CKEHDQSALNVVLKDKIKILPLSWNTLVYYYV 116 Query: 242 ---------QFSLNYQLKESFINPVTNDTIFIHYIGPTKPWHD 275 F+L Y K+ +N + +HY KPW+D Sbjct: 117 NAKACDDTKNFNLFYTRKD--LNKALKNPHILHYYLGFKPWND 157 >UniRef50_C5S3F7 Putative glycosyl transferase n=2 Tax=Actinobacillus minor RepID=C5S3F7_9PAST Length = 275 Score = 58.2 bits (139), Expect = 5e-07, Method: Compositional matrix adjust. Identities = 46/184 (25%), Positives = 88/184 (47%), Gaps = 23/184 (12%) Query: 95 DRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVA 154 + ++LP + + +FR+ I F+N KVLYLD D++ G++ + D VA Sbjct: 78 EEYKTLPHIS--SASTFFRYFIPA-FVND-DKVLYLDCDLVVNGSLSIFFDLELNDHYVA 133 Query: 155 MVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIK 214 + + ++ +K+ +FN+G LLIN W Q+++ +A+ + + + + Sbjct: 134 ASLDDIAFNFHQKK------------HFNAGVLLINNKLWRKQEITLKALELTD--RLNE 179 Query: 215 KITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKES----FINPVTNDT-IFIHYIGP 269 K+ DQ+VLN+L +K I + N Y + + +I +D + +H+ Sbjct: 180 KLEEGDQEVLNILFQNKWIELNPYLNYLVGAEYLYRRNGVTQYIRRQEDDVPLILHFNTK 239 Query: 270 TKPW 273 KPW Sbjct: 240 YKPW 243 >UniRef50_Q4JZJ9 Putative glycosyl transferase n=8 Tax=Streptococcus pneumoniae RepID=Q4JZJ9_STRPN Length = 344 Score = 57.8 bits (138), Expect = 5e-07, Method: Compositional matrix adjust. Identities = 66/280 (23%), Positives = 110/280 (39%), Gaps = 38/280 (13%) Query: 19 DHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDA 78 ++K + N ++I Y TD NF+ SI S+ N L I D D +++ + Sbjct: 22 ENKFRSRNF-MNIVYATDNNFVDVLSASIKSLYTTNSDLDLNLWIIADKVSDRNKEKINR 80 Query: 79 LALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQG 138 L+ Q+ R +I I + + + + R + + KVLYLD+DII Sbjct: 81 LSKQFAQR-EINWIENVEIPFKLHLDRGSISSFSRLFLGSVLPSSMSKVLYLDSDIIVMD 139 Query: 139 TIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQ 198 ++ + + F + G D + K + I K FN+G +LIN W Sbjct: 140 SLRSIFDIDFKGK-----ILYGVNDTFNKEYKQVLGIPIDKPMFNAGVMLINLELWRNNN 194 Query: 199 VSARAIAMLNEPEIIKK----ITHPDQDVLNMLLADKL-----------IFADIKYNTQF 243 V R + ++I+K I D VLN +L + IF D+ Y Sbjct: 195 VEERFL------QVIQKFNGTILQGDLGVLNAVLYNSFGVLPPEYNYMTIFEDLTYEEMI 248 Query: 244 ----SLNYQLKESFINPVTNDTIFIHYIGPT----KPWHD 275 +NY KE N + I + + + +PW + Sbjct: 249 VFKKPINYYSKEEIKN--ARERIVLRHFTTSFLSKRPWQE 286 >UniRef50_UPI0001A45357 glycosyl transferase family protein n=1 Tax=Neisseria subflava NJ9703 RepID=UPI0001A45357 Length = 264 Score = 57.4 bits (137), Expect = 7e-07, Method: Compositional matrix adjust. Identities = 49/172 (28%), Positives = 74/172 (43%), Gaps = 19/172 (11%) Query: 109 AIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTE--GQADWWE 166 A +FR ++ +++A LYLD+D++ ++ L N VA V + DW Sbjct: 81 AAFFRLMMQHLPVDRA---LYLDSDMVVTQSLHDLFNLDMRGYPVAAVQDSYLARTDW-- 135 Query: 167 KRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNM 226 H G+ YFNSG LL + QW ++ + + I K + + DQ LN Sbjct: 136 --NHPTGLH--TTPYFNSGMLLADLGQWRKHNIAEQLLQ--TAATIDKTVPYGDQCFLNT 189 Query: 227 LLADKLIFADIKYNTQ-----FSLNYQLKESFINPVTNDTIFIHYIGPTKPW 273 + + + + +N Q F Y L E F P T I IHY KPW Sbjct: 190 VFQENWLQLEESWNYQTGARRFFQTYDLDEMFPLPDTTPPI-IHYTTLAKPW 240 >UniRef50_C2KV37 Lipopolysaccharide biosynthesis protein, LPS:glycosyltransferase n=1 Tax=Oribacterium sinus F0268 RepID=C2KV37_9FIRM Length = 324 Score = 57.0 bits (136), Expect = 8e-07, Method: Compositional matrix adjust. Identities = 60/273 (21%), Positives = 115/273 (42%), Gaps = 28/273 (10%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 + I YG ++ F+ +S++S+L + EG L FHI + ++ ++ +I Sbjct: 1 MHIVYGVNEAFMPILAVSLSSLLLHAEGEALHFHILSLGIEEESKEKLRQYVETEGQKIS 60 Query: 89 IYLINGDRL----RSLPS--TKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEP 142 Y + ++L LP+ T ++ A R I K LYLDAD + +I Sbjct: 61 FYDLE-EKLSEWKEKLPALFTGKFSKATLLRLFIPSTLPETITKALYLDADTVVLQSILS 119 Query: 143 LINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSAR 202 L + D + M + ++K L +A + Y+N+G +L+N + + + + Sbjct: 120 LYHLRLGDKLLGMA---PEPSIYKKHKEFLSLAEESP-YYNAGVMLMNLSLLREEGMEEK 175 Query: 203 AIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYN-------------TQFSLNYQL 249 + E ++ DQD+LNM+ ++ ++N +FS YQ Sbjct: 176 CLRYYQMKE--GQLPFNDQDILNMVCKGRIRSLPQRFNFFSNYAYARYSALCRFSPWYQE 233 Query: 250 KES--FINPVTNDTIFIHYIGPTKPWHDWAWDY 280 ES + + +H+ G +PW + +Y Sbjct: 234 LESKKSYSQAKAHPVIVHFAGDERPWREGNHNY 266 >UniRef50_C7RG54 Glycosyl transferase family 8 n=1 Tax=Anaerococcus prevotii DSM 20548 RepID=C7RG54_ANAPD Length = 273 Score = 57.0 bits (136), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 44/178 (24%), Positives = 83/178 (46%), Gaps = 12/178 (6%) Query: 103 TKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQA 162 T + +Y+R + ++ ++LYLD D++ ++ L+ D +A G+ Sbjct: 77 TDRYPKEMYYRLLAGEFLPENLGEILYLDPDMLVINPLDDLLRTDISDYILAAASHTGKT 136 Query: 163 DWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQD 222 D + + + G Y+NSG LLIN + A +++ I E + + PDQD Sbjct: 137 D-MANNVNRIRL-GTDTDYYNSGLLLINLKR-AREEIDPDEIFSFVEDNHMNLLL-PDQD 192 Query: 223 VLNMLLADKLI-FADIKYNTQFSLNYQL------KESFINPVTNDTIFIHYIGPTKPW 273 +LN + D++ D+ YN + NY K++ + + + T+ +H+ G KPW Sbjct: 193 ILNAMYGDRIYPLDDLIYNYD-ARNYSSYLIRSKKQADLAWLMDHTVVLHFCGRDKPW 249 >UniRef50_D2QX95 Glycosyl transferase family 8 n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2QX95_9PLAN Length = 350 Score = 57.0 bits (136), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 70/302 (23%), Positives = 128/302 (42%), Gaps = 45/302 (14%) Query: 28 CLDIAYGTDKNFLFGCGISIASIL-KYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTR 86 LD+ D F G +I S+L + S+L + ++R D + R Sbjct: 4 VLDVLTSADDRFAIGLAGTIKSVLASLSPSSKLNLWVLDGGISSENRD--DLIHHWNDPR 61 Query: 87 IKIYLINGDR--LRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTI---- 140 + + + DR L + + A Y+R + + + K+LY+DAD++ Q + Sbjct: 62 LSVNWLPVDRALLAEFKVAPHMSDAAYYRLLAPNLLPSSVKKLLYIDADLLVQRDLTDLW 121 Query: 141 -EPL----------INFSFPD---------DKVAMVVTEGQADWWEKRAHSLGVAGIAKG 180 EP I F D D ++ +V +E+ LG+A + Sbjct: 122 DEPFDGHSCIAVHDIGAPFLDSNQILLEKPDALSRIVCRNPIPMFEE----LGLAPETR- 176 Query: 181 YFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYN 240 YFNSG +I+ W ++Q+S + +L ++I H DQ LN++LA++ AD ++N Sbjct: 177 YFNSGVFMIDLETWRSEQLSVQMFDVLCT-HRERQIYH-DQFALNIVLANRWKAADYRWN 234 Query: 241 TQFSLNYQLK---ESFINPVT----NDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNAS 293 Q + ++LK +F+ P + ++ + KPW +P+ + F + S Sbjct: 235 -QLAYIHELKVPQHTFLEPQVFQQYKHSPWVVHFTYRKPWQP-ECQHPLRKRFFDYLAGS 292 Query: 294 PW 295 W Sbjct: 293 KW 294 >UniRef50_B7C7N8 Putative uncharacterized protein n=2 Tax=Firmicutes RepID=B7C7N8_9FIRM Length = 416 Score = 56.6 bits (135), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 63/250 (25%), Positives = 101/250 (40%), Gaps = 32/250 (12%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 I D +++ +I SI +N+ + F+I + + + + + I Sbjct: 22 IVLACDNSYMDKLETTIKSICAHNKNIK--FYILNEDLPIEWFRLMTKRLSYFNSEILNI 79 Query: 91 LINGDRLRSLPS-TKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFP 149 ++GD + +++ + YFR++I DY KVLYLD DII +++ L N Sbjct: 80 KVSGDSFKKFRCPSEHINYQSYFRYLIPDYV--SEEKVLYLDCDIIVTESLDGLFNLDLK 137 Query: 150 DDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNE 209 + VA V + G FNSG LLIN W + + I + E Sbjct: 138 NYPVAAVP---------------DLPTTNDG-FNSGVLLINNKYWRENDILNKLIKLTVE 181 Query: 210 PEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIF------ 263 + + DQ +LN+L DK + YN Q + Q E I + +F Sbjct: 182 ---YHEKVYGDQGILNILFKDKWYRLPLTYNLQVGSDSQ--EHMIGNMEWYKLFDGIPKV 236 Query: 264 IHYIGPTKPW 273 IHY KPW Sbjct: 237 IHYTYTHKPW 246 >UniRef50_A4SAB5 Predicted protein (Fragment) n=1 Tax=Ostreococcus lucimarinus CCE9901 RepID=A4SAB5_OSTLU Length = 259 Score = 56.6 bits (135), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 68/268 (25%), Positives = 115/268 (42%), Gaps = 40/268 (14%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGS-RLCFHIFT--DYFGDDDRK---YFDALALQYK 84 IA+ D LF G I+S+L R+ FHIFT D D + Y A+ ++ Sbjct: 5 IAFACDPTQLFTLGPVISSVLSATASPHRIRFHIFTARDALTDASVQLNCYSRAIPFIWE 64 Query: 85 TRIKIYLINGDRLR---SLPSTKNWT--HAI-YFRFVIADYFINKAPKVLYLDADIICQG 138 ++ + D +R ++ S K W +A Y RF A+ ++ KV+YLD DII +G Sbjct: 65 ----LHEFSKDMIRANITVHSRKEWRLQNAFNYARFYFAE-ILSDVQKVVYLDTDIIVKG 119 Query: 139 TIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGV-----------AGIAKGY--FNSG 185 I L + + +++ KR+ LG +G+ + FN+G Sbjct: 120 DICRLHDANLRSSSTSVIAAV-------KRSVPLGSLLNFSNAAVKSSGLREKMHSFNAG 172 Query: 186 FLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSL 245 LLI+ W +++++ L + K +H Q L ++ D +N + Sbjct: 173 VLLIDLESWRRKRITSTVETWLKMNSVSKLYSHGSQPPLLLVFGDSFESIPSHWNVD-GV 231 Query: 246 NYQLKESFINPVTNDTIFIHYIGPTKPW 273 Y K+ V N+ +H+ G +KPW Sbjct: 232 GY--KKGLRASVLNEARVLHWSGQSKPW 257 >UniRef50_C5ZVZ7 Putative glycosyltransferase n=3 Tax=Campylobacterales RepID=C5ZVZ7_9HELI Length = 431 Score = 56.2 bits (134), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 60/246 (24%), Positives = 98/246 (39%), Gaps = 30/246 (12%) Query: 52 KYNEGSRLCFHIFTDYFGDDDR----KYFDALALQYKTRIKIYLINGDRLRSLP--STKN 105 K ++ FHI +D D + + L+ Y ++I++IN P + Sbjct: 59 KLDKSEGYVFHILSDSIPKDLQTKLQNFIQELSAFYPCTLQIHIINDIDFAHFPISGAAH 118 Query: 106 WTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWW 165 +H Y+R DY K LYLD+D++ + L D+ ++ G + Sbjct: 119 SSHLPYYRLKWQDYIKPAPQKCLYLDSDMLVLCDLRELFALDLKDNIAGIIGDCGSKNRK 178 Query: 166 EKRAHS--LGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDV 223 K + + YFNSGFLLIN+ Q+ +Q+ + + + IK DQD+ Sbjct: 179 IKYQENNYKKTFYFDENYFNSGFLLINSKQYIKEQIWEKCENLAKKCTYIKA---ADQDL 235 Query: 224 LNMLL---------------ADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIG 268 LN + L++ K + LNY +E+F N I +HY Sbjct: 236 LNFTIPINKRLKLPFAYNFQCITLLYVLCKDECKNRLNYT-REAFNKSFKNPKI-LHY-- 291 Query: 269 PTKPWH 274 KPW Sbjct: 292 GEKPWR 297 >UniRef50_A3CM53 Glycosyltransferase, putative n=9 Tax=Bacteria RepID=A3CM53_STRSV Length = 1074 Score = 55.8 bits (133), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 45/178 (25%), Positives = 75/178 (42%), Gaps = 28/178 (15%) Query: 105 NWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADW 164 N ++++ R+ A + + + LYLD DI+ + + + V G Sbjct: 479 NIHYSVFLRYFTATFV--EEDQALYLDCDIVVTRDLSEIFAVDLGSYPLGAVRDLG---- 532 Query: 165 WEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVL 224 G + FNSG LLIN W ++ + I M + + K+T DQ +L Sbjct: 533 --------GEVYFGEQIFNSGVLLINVNYWRENDIAGQLIEMTD--NLHDKVTQDDQSIL 582 Query: 225 NMLLADKLIFADIKYN-----TQFSLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWA 277 NML ++ + YN T FS +Y+ ++ PV IHY+ KPW ++ Sbjct: 583 NMLFENRWMELPFAYNCITLHTTFS-DYEPEKGLYPPV------IHYLTERKPWKEYT 633 Score = 53.9 bits (128), Expect = 8e-06, Method: Compositional matrix adjust. Identities = 63/254 (24%), Positives = 105/254 (41%), Gaps = 43/254 (16%) Query: 36 DKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIYLINGD 95 D+ + +I SIL YN+ ++ ++F D+ + F+ L Q + ++ I+ D Sbjct: 9 DQAYQEQVSTTIKSILYYNKNVKI--YVFNQGLSDEWFRDFNELVEQLDS--ELVNISLD 64 Query: 96 RLRSLPSTKNWTH---AIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDK 152 ++ P H A Y R+ I + +VLYLD+D++ ++PL + Sbjct: 65 QVTISPEWLTQDHISSATYARYFIPQFVAEG--RVLYLDSDLVVNRDLQPLFDIPLEGKL 122 Query: 153 VAMVVTEGQADWWEKRAHSLGVAGIAKGY-FNSGFLLINTAQWAAQQVSARAIAMLNEPE 211 VA V G A GY FN+G LLI+ W +++ + + E + Sbjct: 123 VAAV-------------------GDAGGYGFNAGVLLIDNRSWKERELQE---SFIKETD 160 Query: 212 IIKKITHP--------DQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFIN---PVTND 260 I + DQ VLN +LA + D YN Q + S N + + Sbjct: 161 RIMGLVQSGQMEDFNGDQTVLNHVLAQDWLPLDKIYNLQVGHDLVAFYSGWNGHFELDQE 220 Query: 261 TIFIHYIGPTKPWH 274 + IHY KPW+ Sbjct: 221 PLIIHYTTFRKPWN 234 >UniRef50_A4WWT5 Glycosyl transferase, family 8 n=1 Tax=Rhodobacter sphaeroides ATCC 17025 RepID=A4WWT5_RHOS5 Length = 319 Score = 55.8 bits (133), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 45/171 (26%), Positives = 73/171 (42%), Gaps = 9/171 (5%) Query: 107 THAIYFRFVIADYFINKAPKVLYLDADIICQGT-IEPLINFSFPDDKVAMVVTEGQADWW 165 T Y R V+ + F ++LYLD+DI QG + LI +A V Q Sbjct: 87 TAVTYLRLVLPEAFSEDYDRILYLDSDIYIQGGDLGALIALPLAGRPLAAVRDNKQWRTP 146 Query: 166 EKRAHSLGVAGIA-KGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVL 224 +R G+ + YFNSG LL + + A + A+ + +++ DQ +L Sbjct: 147 SRRMVDFDRLGLPQRPYFNSGVLLFDVPAFRAANLLQEALRIGRSQG--RQLVRHDQSLL 204 Query: 225 NMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIGPTKPWHD 275 N + +N Q++ + +L + + P IH+IG KPW D Sbjct: 205 NACMLGNWAELSPSWNWQYTWSSRLFAAMLGPN-----IIHFIGRCKPWCD 250 >UniRef50_B1I7N1 Glycosyl transferase, family 8 n=8 Tax=Streptococcus pneumoniae RepID=B1I7N1_STRPI Length = 817 Score = 55.8 bits (133), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 63/258 (24%), Positives = 104/258 (40%), Gaps = 43/258 (16%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCF---HIFTDYFGDDDRKYFDALALQYKTRI 87 I D+N++ +I SIL +N ++ I D+F RK + I Sbjct: 5 IVLAGDRNYIRQLETTIKSILYHNRDVKIYILNQDIMPDWF----RKPRKIARMLGSEII 60 Query: 88 KIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 + L + + + Y R+ IADY + KVLYLD+D+I ++E L + Sbjct: 61 DVKLPEQTVFQDWEKQDHISSITYARYFIADYI--QEDKVLYLDSDLIVNTSLEKLFSIC 118 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAI--A 205 + +A V GI FN+G LLIN +W +++ R I + Sbjct: 119 LEEKSLAAVK---------------DTDGIT---FNTGVLLINNKKWRQEKLKERLIEQS 160 Query: 206 MLNEPEIIK-KITH--PDQDVLNMLLADKLIFADIKYNTQFSL-------NYQLKESFIN 255 ++ E+ + + H DQ + N +L D + YN Q N+Q +F Sbjct: 161 IVTMKEVEEGRFEHFNGDQTIFNQVLQDDWLELGRAYNLQVGHDIVALYNNWQEHLAF-- 218 Query: 256 PVTNDTIFIHYIGPTKPW 273 + + IH+ KPW Sbjct: 219 --NDKPVVIHFTTYRKPW 234 >UniRef50_C0X9Z8 Glycosyltransferase n=1 Tax=Lactobacillus gasseri JV-V03 RepID=C0X9Z8_9LACO Length = 675 Score = 55.5 bits (132), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 41/145 (28%), Positives = 67/145 (46%), Gaps = 24/145 (16%) Query: 101 PSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEG 160 PS+ + RF+I D A KVLYLD+D+I ++ + +F DDK+ V + Sbjct: 73 PSSDQIKKISFGRFLIPDLI--SADKVLYLDSDLIVTDNLQSIFQMNF-DDKMLFAVHDY 129 Query: 161 QADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPD 220 Q FNSG +LIN +W ++VS++ I M + + D Sbjct: 130 QN----------------PDQFNSGVMLINNKRWREEKVSSKLIEMSKQQAL-----ASD 168 Query: 221 QDVLNMLLADKLIFADIKYNTQFSL 245 Q V+N + +++ ++ YN Q L Sbjct: 169 QAVINEVFKNQIGELNLSYNYQIGL 193 >UniRef50_B6VUC8 Putative uncharacterized protein n=1 Tax=Bacteroides dorei DSM 17855 RepID=B6VUC8_9BACE Length = 315 Score = 55.5 bits (132), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 56/258 (21%), Positives = 108/258 (41%), Gaps = 18/258 (6%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 + I + + C + + S+ + N+ + ++F+ D++ K + L +Y T+++ Sbjct: 2 ISILCNSSNEYAIHCKVMLTSLFENNKQNDKEVYVFSTSMSDENIKGLELLGQRYGTKVQ 61 Query: 89 IYLINGDRLRSLPSTKNWTH-AIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 I +++ +L+ LP + + A Y R AD + K+LYLD DII ++ L + Sbjct: 62 IIIVDSQKLQFLPIHFAYHNIACYLRLFAAD-LLPGINKLLYLDCDIIVNSDLKALWDID 120 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAI--A 205 D A T +L + Y N+G +LIN W V+ + + A Sbjct: 121 ITD--YAFAATHDLTYCEPNFKKNLQLEE-NDTYINTGVMLINCDYWRNNNVAQKVLDYA 177 Query: 206 MLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFIN------PVTN 259 + N ++I DQD LN + ++N Y+ + N + Sbjct: 178 IHNGDKMIA----ADQDALNATMQGSFKLFSEEWNVYPDYFYEKPNLYTNVYPILDEIRR 233 Query: 260 DTIFIHYIGPTKPWHDWA 277 + IH++ KPW ++ Sbjct: 234 NPKIIHFLY-VKPWFNYC 250 >UniRef50_Q9L7A2 Putative glycosyl transferase n=1 Tax=Haemophilus ducreyi RepID=Q9L7A2_HAEDU Length = 269 Score = 55.1 bits (131), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 44/184 (23%), Positives = 83/184 (45%), Gaps = 24/184 (13%) Query: 99 SLPSTKNWTH----AIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVA 154 ++ + K ++H +FR+ I+D+ + KV+YLDADI+ G++ L + +A Sbjct: 73 TIKNFKTYSHISSDTTFFRYFISDFI--EQDKVIYLDADIVVNGSLTELYQTDISNYFLA 130 Query: 155 MVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIK 214 V D ++ + + FN+G LLIN +W ++ +++ + I Sbjct: 131 AV-----KDIISEKIY------VNNHIFNAGMLLINNKKWREHNITQFCLSL--SEKYIN 177 Query: 215 KITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQL----KESFINPVTNDT-IFIHYIGP 269 + DQ +LN++ DK + + YN +Y K ++ + + IHY Sbjct: 178 SLPDADQSILNLIFKDKWLKLNRGYNYLIGTDYLFFKYGKTRYLEDLGETIPLIIHYNTE 237 Query: 270 TKPW 273 KPW Sbjct: 238 AKPW 241 >UniRef50_B2UQ54 Glycosyl transferase family 8 n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UQ54_AKKM8 Length = 328 Score = 55.1 bits (131), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 75/327 (22%), Positives = 134/327 (40%), Gaps = 36/327 (11%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKY--FDALALQYKTRIK 88 + +D + +++ S+L G + I+ G D + + LA + R++ Sbjct: 12 VVLASDNRGILPLSVTVFSLLN-TAGPETFYKIYVLSDGIDGENWASVERLAAPFDCRLE 70 Query: 89 IYLINGDRLR-SLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 ++G + P T+ W + R I + + +LYLD D++ + L + Sbjct: 71 FIDVSGILEKHDFPHTEQWPVPAWGRVFIPELLKEERGNILYLDIDVLVCRDLTELFRTN 130 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAI--- 204 D K VV E + L + GYFNSG LL+N + + + RA+ Sbjct: 131 M-DGKAIGVVFENFSRPGSHFNERLEMPLTCTGYFNSGVLLMNVDVFREKNL-VRAVLDY 188 Query: 205 AMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYN-----TQFSLNYQLKESFINPVTN 259 A+ + ++T PDQD LN L + + ++N T+ L +E F VT Sbjct: 189 AVTHR----DRLTCPDQDALNGALCELTVPLHPRWNWHDGLTRRILKNDPREQFWRGVTP 244 Query: 260 --------DTIFIHYIGPTKPW-HDWAWDYPVSQAFMEA----KNASPWKN-TALLKPNN 305 + +HY G KPW ++W ++ + M + P + A+LK + Sbjct: 245 RQAVEAALEPGILHYQGVHKPWRYNWRYEGERYERVMREAGLLRGPLPGRTLPAVLKKHL 304 Query: 306 SNQL-RYSAKHMLKKHRYLKGFSNYLF 331 + R +A+ +L R +GF N L Sbjct: 305 YRPVYRMTARKIL---RLKEGFDNRLL 328 >UniRef50_C1QC03 LPS:glycosyltransferase n=1 Tax=Brachyspira murdochii DSM 12563 RepID=C1QC03_9SPIR Length = 347 Score = 55.1 bits (131), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 48/169 (28%), Positives = 79/169 (46%), Gaps = 16/169 (9%) Query: 107 THAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVA--MVVTEGQADW 164 +++ +FR I I A K++YLD D+I ++ L F DD V E D Sbjct: 89 SNSTWFRLSIPS-LIPNADKIVYLDGDMIINSSLREL----FSDDMSDYYAYVVEDVMDK 143 Query: 165 WEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVL 224 ++ +G + K YFN+GFL+IN W + + N + + + + DQD+L Sbjct: 144 IDEVKAPIGFSKTDK-YFNAGFLMINNKLWIEDNLEEK---FYNAVDTMPILGYKDQDIL 199 Query: 225 NMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIGPTKPW 273 N L +++ F D K++ L+ + I+ N IH +G KPW Sbjct: 200 NYCLKNRVKFIDKKWDF---LDNKSCYKEISADINKINIIHCVG--KPW 243 >UniRef50_Q3DNS6 Glycosyl transferase, family 8 n=5 Tax=Streptococcus agalactiae RepID=Q3DNS6_STRAG Length = 401 Score = 54.3 bits (129), Expect = 5e-06, Method: Compositional matrix adjust. Identities = 59/254 (23%), Positives = 107/254 (42%), Gaps = 37/254 (14%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIF-----TDYFGDDDRKYFDALALQYKT 85 +A D N+L ++I SI YN + F++F ++ + +RK + L + Sbjct: 5 VALAVDSNYLDKALVTIKSICVYNRN--ITFYLFNQDTPVEWVRNINRK-LEPLGSKL-I 60 Query: 86 RIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLIN 145 +KIY + L + + W FR +ADY + +VLYLD+DII ++ L Sbjct: 61 NVKIYNYDIAHLTTFLTVSTW-----FRLFLADYI--PSSRVLYLDSDIIVNTNLDYLFE 113 Query: 146 FSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIA 205 F +A V + + +G FN+G LL N W ++ + Sbjct: 114 LDFKGYYLAAVKDPHKNE---------------EGGFNAGMLLANLELWREDGLTKTLLK 158 Query: 206 MLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQ---FSLNYQLKESFINPVTNDTI 262 E+ + + DQ +LN++ ++ + + +N Q Y + + N T Sbjct: 159 T--AEELHRVVKTGDQSILNIVCHNRWLSLNKTWNFQTYDVVSRYNHRSYLYLNIENRTP 216 Query: 263 -FIHYIGPTKPWHD 275 IH++ KPW++ Sbjct: 217 NIIHFLTSDKPWNE 230 >UniRef50_Q0I2Z7 Lipopolysaccharide biosynthesis protein, glycosyltransferase, family 8 n=1 Tax=Haemophilus somnus 129PT RepID=Q0I2Z7_HAES1 Length = 354 Score = 54.3 bits (129), Expect = 6e-06, Method: Compositional matrix adjust. Identities = 34/105 (32%), Positives = 57/105 (54%), Gaps = 13/105 (12%) Query: 181 YFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKK----ITHPDQDVLNMLLADKLIFAD 236 YFN+G LLIN +W V +++ + E K+ + DQD+LN + A+ + + D Sbjct: 202 YFNAGVLLINVVEWEKCHVFEKSLQWI---EYCKRNNIEFLYQDQDILNAIFANNVKYLD 258 Query: 237 IKYN-TQFSLNY--QLKESFINPVTNDTI---FIHYIGPTKPWHD 275 ++YN T +LN ++ + +N T+ IHY+GP K WH+ Sbjct: 259 LRYNFTANALNRLKRVSKKELNQYEEATMPLAIIHYVGPKKSWHE 303 >UniRef50_Q2K5X3 Galactosyltransferase protein n=13 Tax=Rhizobium RepID=Q2K5X3_RHIEC Length = 333 Score = 54.3 bits (129), Expect = 6e-06, Method: Compositional matrix adjust. Identities = 60/245 (24%), Positives = 98/245 (40%), Gaps = 29/245 (11%) Query: 99 SLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVT 158 L + W+ A R + + ++LYLDAD++ ++ L F +A V Sbjct: 105 GLQARGRWSAATLARLYMDRDIPDHIERLLYLDADVLAVAPVDELFTLDFQGKALAAVDD 164 Query: 159 EGQADWWEKRAHSLGVAGIAKG--YFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKI 216 A + EK G+ +G YFN+G LL + W+A AR + EI K+ Sbjct: 165 YVMA-FPEKSGARQRKIGMGEGGRYFNAGVLLFD---WSA--CRARGL-FPRTREIFKER 217 Query: 217 TH----PDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIGPTKP 272 +H DQD LN+ + D ++NTQ L P + H+ G KP Sbjct: 218 SHLFENNDQDALNVTFDGDWLVLDPRWNTQTGL---------LPFVDRPAIFHFTGRKKP 268 Query: 273 WHDWAWDYPVSQAFMEAKNASPWKNTA----LLKPNNSNQLRYSAKHMLKKHRYLKGFSN 328 W + P M + A +NT +P+ ++++ H+ K+ L + Sbjct: 269 WQA---NVPWVHRRMANRYADDLRNTPWASFCRQPSRTDRVAGFLSHVGKQIGGLTRLAR 325 Query: 329 YLFYF 333 YF Sbjct: 326 MRAYF 330 >UniRef50_A4VVV8 Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases n=7 Tax=Firmicutes RepID=A4VVV8_STRSY Length = 334 Score = 54.3 bits (129), Expect = 7e-06, Method: Compositional matrix adjust. Identities = 58/293 (19%), Positives = 112/293 (38%), Gaps = 37/293 (12%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLC-FHIFTDYFGDDDRKYFDALALQYKTRI 87 ++I + + F+ + SI++ + C F++F+D +++ ++ Sbjct: 6 VNILFTLNDAFVPQVAACMGSIMRTLDEDDTCHFYLFSDGISQQNKENLHQFVTDGGNKL 65 Query: 88 KIYLINGDRLRSLPS-------TKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTI 140 I L +L S T W + R ++ + +++YLD D + I Sbjct: 66 TIV-----ELENLESYFDFEVDTNGWASVVLARLLVDKLLPEEVDRIIYLDGDTLVLENI 120 Query: 141 EPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVS 200 L + KV + E A + +LG Y N+G LLI+ +W ++ + Sbjct: 121 RELWEVDL-EGKVLGMCPEPTASSERREGLNLGTY----TYHNAGVLLIDLKRWRSKSIG 175 Query: 201 ARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSL----NYQLKESFINP 256 E ++ DQD LN L +++ I YN F++ Y+ E P Sbjct: 176 TIIFDYYKEKN--GELFANDQDALNGALKEEIKTLSITYN-YFNIFDVYPYRTLEKLSRP 232 Query: 257 VT-----------NDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNT 298 T +H++G +PW + + ++ A N +PW+ T Sbjct: 233 STFISKEEFVKIRKQPRIVHFLGEERPWR-IGNKHRFREDYVSALNQTPWRGT 284 >UniRef50_B6GCA0 Putative uncharacterized protein n=1 Tax=Collinsella stercoris DSM 13279 RepID=B6GCA0_9ACTN Length = 990 Score = 54.3 bits (129), Expect = 7e-06, Method: Compositional matrix adjust. Identities = 48/182 (26%), Positives = 81/182 (44%), Gaps = 26/182 (14%) Query: 111 YFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMV-----VTEGQADWW 165 Y+RF+I D + KVLYLD+D+I +G + L D +A V Sbjct: 734 YYRFLIQD-LLPYYDKVLYLDSDLIIRGDVSELFATDLGDSLLAAAHDIDFVANVNMKRG 792 Query: 166 EKRAHSLGVAGIAK--GYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDV 223 ++ A++ V G+ YF +G L++NT ++ + ++ I + DQDV Sbjct: 793 DRFAYAKEVLGMKDPYSYFQAGVLVLNTRAMRSRHTMEEWLEFASDDRFI----YNDQDV 848 Query: 224 LNMLLADKLIFADIKYNTQ------------FSLNYQLKESFINPVTNDTIFIHYIGPTK 271 LN ++++ D +N F+ Y ++FI +N+ I +HY G K Sbjct: 849 LNAHCEGEVVYLDYSWNVMIDCFGRINKVFTFAPAYMF-DAFIESRSNEKI-VHYAGFEK 906 Query: 272 PW 273 PW Sbjct: 907 PW 908 >UniRef50_C1IBL0 Glycosyl transferase n=1 Tax=Clostridium sp. 7_2_43FAA RepID=C1IBL0_9CLOT Length = 273 Score = 53.9 bits (128), Expect = 7e-06, Method: Compositional matrix adjust. Identities = 49/206 (23%), Positives = 88/206 (42%), Gaps = 44/206 (21%) Query: 92 INGDR--LRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFP 149 IN +R ++ P +K + +Y+R + K+LYLD DI+ +I PL Sbjct: 67 INVERSVFKNAPVSKRYPQEMYYRLLAPLILPKSIKKILYLDPDILIINSIRPLWETEL- 125 Query: 150 DDKVAMVVTEGQADWWEKRAHSLGVAGIAK-----------GYFNSGFLLINTAQWAAQQ 198 ++ A +GV G+ Y+NSG +L++ + Sbjct: 126 ------------GNYIFAAASHVGVTGVINDINRVRLRVDHDYYNSGVMLMDLTK----- 168 Query: 199 VSARAIAMLNE-----PEIIKKITHPDQDVLNMLLADKLIFADI---KYNTQFSLNYQLK 250 AR+I + E E +++ PDQD+ N L + + D Y+ + NY L+ Sbjct: 169 --ARSIVNVEEIFQCVREHKEELLLPDQDIFNYLYGKQTLPLDDAIWNYDARKYSNYLLR 226 Query: 251 ESF---INPVTNDTIFIHYIGPTKPW 273 ++ +T +T+ +H+ G +KPW Sbjct: 227 SGGNYDMDWITRNTVVLHFCGKSKPW 252 >UniRef50_B3WD33 WbbM protein n=8 Tax=Lactobacillus RepID=B3WD33_LACCB Length = 318 Score = 53.9 bits (128), Expect = 8e-06, Method: Compositional matrix adjust. Identities = 47/169 (27%), Positives = 71/169 (42%), Gaps = 12/169 (7%) Query: 107 THAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWE 166 T IYFR IAD F + K +Y+DAD + G + L D+ VA V + E Sbjct: 90 TLTIYFRLFIADMF-PQYDKAIYIDADTVADGDLAELFTTDLGDNLVAGVADPVMMTYPE 148 Query: 167 KRAHSLGVAGIAKG-YFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLN 225 + G+ G Y NSG L++N AQ + S R + +L + DQD +N Sbjct: 149 TIEYIQRDFGVQPGEYINSGVLILNLAQMRQEHFSDRFLHLLKTYHF--TMIAADQDYIN 206 Query: 226 MLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIGPTKPWH 274 ++ ++ + +N Q + + IHY KPWH Sbjct: 207 VIAQHRIKYLPKTWNMQTGVPTAAESG--------GKLIHYNLFGKPWH 247 >UniRef50_C4Z1V1 Putative uncharacterized protein n=1 Tax=Eubacterium eligens ATCC 27750 RepID=C4Z1V1_EUBE2 Length = 607 Score = 53.9 bits (128), Expect = 9e-06, Method: Compositional matrix adjust. Identities = 56/203 (27%), Positives = 86/203 (42%), Gaps = 25/203 (12%) Query: 95 DRLRSLPSTKNWTHAIYFRFVIADYF--INKAPKVLYLDADIICQGTIEPLINFSFPDDK 152 DR + + +T IYFR IA+ F +NKA +Y+D+D + I L + D Sbjct: 355 DRQENRLYSGEFTLTIYFRLFIAELFPELNKA---VYIDSDTVINDDIAKLYSVDMGDAM 411 Query: 153 VAMVVTEGQADWWEKRAHSL-GVAGIAKG-YFNSGFLLINTAQWAAQQVSARAIAMLNEP 210 V + A AH + V GI + Y NSG LL+N + ++ R + ++ E Sbjct: 412 FG-AVRDTFAGKNTILAHYIENVVGIERNEYVNSGVLLMNLDKIRQAHLADRFLKLMAEY 470 Query: 211 EIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIGPT 270 PDQD +N + A ++ F D ++N + + E P IHY Sbjct: 471 HF--DSVAPDQDYINSMCAKEIYFLDKEWNV---MPNKGGEYIARPK-----LIHYNLFD 520 Query: 271 KPWH-------DWAWDYPVSQAF 286 KPWH ++ W Y F Sbjct: 521 KPWHYSEIPYEEYFWQYAAESGF 543 >UniRef50_C5ELK9 Glycosyl transferase n=3 Tax=Clostridiales RepID=C5ELK9_9FIRM Length = 333 Score = 53.1 bits (126), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 53/204 (25%), Positives = 89/204 (43%), Gaps = 33/204 (16%) Query: 126 KVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSG 185 K LYLD D I +I PL D V MV+ + +++ S+G+ G Y+NSG Sbjct: 108 KALYLDCDTIVCKSIRPLYETELGDAVVGMVM---EPTVYKEMKESIGM-GKDDPYYNSG 163 Query: 186 FLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSL 245 LL+ +W + V + + ++ DQD +N L ++ +KYN + Sbjct: 164 VLLMALDRWRQEDVLQKLLDFYKSCH--GRLFACDQDTINGALKGRIKTLPVKYN--YFT 219 Query: 246 NYQL-----------------KESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFME 288 NY+ +E+++ + I IHY+G +PW A ++ + E Sbjct: 220 NYRYFRYSTLCSMCAAYREIGEEAYLEARRSPAI-IHYLGDERPW--IAGNHNHFKKLYE 276 Query: 289 AKNA-SPWKNTALLKPNNSNQLRY 311 A +PWK+T P + + RY Sbjct: 277 YYLAKTPWKDT----PKQTGKERY 296 >UniRef50_C1QC00 LPS:glycosyltransferase n=1 Tax=Brachyspira murdochii DSM 12563 RepID=C1QC00_9SPIR Length = 332 Score = 53.1 bits (126), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 66/270 (24%), Positives = 105/270 (38%), Gaps = 23/270 (8%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYN-EGSRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 +DI D N+ G +IASIL + E + FH+ ++++ +L I Sbjct: 1 MDICLSADDNYAKYMGTTIASILSNSKEDEEIYFHLLDGGITEENKNKLLSLKNIKNCDI 60 Query: 88 KIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 Y +N + + +FR + I K+LYLD D I +++ L Sbjct: 61 IFYSVNNMNYK-------YDAPHFFRLNVPS-LIPNVDKLLYLDCDTIVLNSLKELFEID 112 Query: 148 FPDDKVAMVVTEGQADWWE--KRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIA 205 + A+ + + K H L V I YFNSG L+IN W ++ Sbjct: 113 ISN-YYALACEDVFLNCIISFKNMHGLNVNDI---YFNSGMLMINNKLWRDDKLEN---L 165 Query: 206 MLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIH 265 ++ H DQDVLN ++ ++ D K+N K I+ V IH Sbjct: 166 FYDDYSKFGNTGHADQDVLNRIIKGRVKIVDSKWNFLSHKKVYSKAPDISLVN----IIH 221 Query: 266 YIGPTKPWHDWAWDYPVSQAFMEAKNASPW 295 Y G KPW + + F + +PW Sbjct: 222 YAGE-KPWKETSSKAFFIDEFWKYYQLTPW 250 >UniRef50_Q5M3K9 Glycosyl transferase n=4 Tax=Streptococcus RepID=Q5M3K9_STRT2 Length = 697 Score = 52.8 bits (125), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 45/177 (25%), Positives = 76/177 (42%), Gaps = 33/177 (18%) Query: 105 NWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMV---VTEGQ 161 N +A Y R+ +AD+ + + LYLD+D++ G++E L +A V +GQ Sbjct: 373 NSNYASYLRYFVADFVSEE--RALYLDSDMVVTGSLEDLFTLDLQGRPLAAVRDYAVQGQ 430 Query: 162 ADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQ 221 + F++GF++I+TA W + I M +E K+ +Q Sbjct: 431 D---------------RQAMFDAGFMVIDTAYWKQYNMRRHLIDMTSEWH--DKVPFAEQ 473 Query: 222 DVLNMLLADKLIFADIKYNTQFSLNYQLKESFIN----PVTND-TIFIHYIGPTKPW 273 +LNM +F + F NY + +S ++ P D +HY KPW Sbjct: 474 SILNM------VFCNNWLTLSFDNNYAVTKSSLSGYHLPNGQDYPKVLHYTSHRKPW 524 >UniRef50_UPI00016B2258 glycosyl transferase, family 8 n=1 Tax=candidate division TM7 single-cell isolate TM7c RepID=UPI00016B2258 Length = 327 Score = 52.4 bits (124), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 68/304 (22%), Positives = 123/304 (40%), Gaps = 48/304 (15%) Query: 28 CLDIAYGTDKNFLFGCGISIASILKYNEGSRLC--FHIFTDYFGDDDRKYFDALALQYKT 85 L++ Y +D N+ ISI S+++ N+ + F++ D K+ + + Sbjct: 5 ILNVIYQSDDNYAVVSAISIVSLMENNKHLKQINIFYLGHQLKKDSINKFNKMVGNYHNA 64 Query: 86 RIKIYLING--DRLRSLPSTKNWT--HAIYFRFVIADYFINKAPKVLYLDADIICQGTIE 141 I ++ D L+ + K W + +++ + K ++LY++ + G ++ Sbjct: 65 TITFVDVSSYPDELKEI-GVKAWKGLYITWYKMLAFAKLDIKTDRILYINPHTVISGALD 123 Query: 142 PLINFSFPDDKVAMVVTEGQADWWEKRAHS--LGVAGIAKGYFNSGFLLINTAQWAAQQV 199 L+ F D+ +A+ + AH +G+ I GYFN G +LIN +W ++ Sbjct: 124 GLLELDFEDNVMALSYDATMVN-----AHKDVIGLKPI-DGYFNCGIMLINHKKWMKDKI 177 Query: 200 SARAIAML--NEPEIIKKITHPDQDVLNMLLADKLIFADIKYN--TQF---------SLN 246 A+ L N E+ DQD+ N+ + ++YN T F N Sbjct: 178 DAKMREHLRYNHYEV------ADQDLCNVFFKGNIKKVGVEYNFSTVFYGYDIKKYIKAN 231 Query: 247 YQLKESFINPVTNDTIFIHYIGPT----------KPWHDWAWDYPVSQAFMEAKNASPWK 296 L ESF + D I Y P +PW + PV + + N +PWK Sbjct: 232 GFLPESF---YSYDEIMESYYTPKIIHSQFGMNGRPWQQ-GNENPVGILWRKYLNLTPWK 287 Query: 297 NTAL 300 N + Sbjct: 288 NATM 291 >UniRef50_A5UC07 Aspartate-semialdehyde dehydrogenase n=7 Tax=Haemophilus influenzae RepID=A5UC07_HAEIE Length = 300 Score = 52.4 bits (124), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 46/180 (25%), Positives = 70/180 (38%), Gaps = 13/180 (7%) Query: 99 SLPSTKNWTH----AIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVA 154 S P+ + H A FR + +V+YLD D+I I+ L + + D +A Sbjct: 68 SFPTVMSPAHIQSSASLFRLYLHQILPQHIERVIYLDIDLIIHQAIDELWDINLEDSLIA 127 Query: 155 MVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIK 214 V WE + + Y N+G +LIN +W + I + + Sbjct: 128 GVSDFFSEYLWEHPFYE------KQQYINTGVMLINLNKWRENNIEQYFIEY--AAKYGE 179 Query: 215 KITHPDQDVLNMLLADKLI-FADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIGPTKPW 273 + DQDV+N + LI +K+N Q L + IHYIG KPW Sbjct: 180 FFVYGDQDVINFSIPTNLIKLLPVKFNIQVKFIEYLWMEHKEKIKFTPHIIHYIGSNKPW 239 >UniRef50_B9KUH7 Lipopolysaccharide 3-alpha-galactosyltransferase n=1 Tax=Rhodobacter sphaeroides KD131 RepID=B9KUH7_RHOSK Length = 304 Score = 52.0 bits (123), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 47/168 (27%), Positives = 71/168 (42%), Gaps = 23/168 (13%) Query: 126 KVLYLDADIICQGTIEPLINF---SFP-----DDKVAMVVTEGQADWWEKRAHSLGVAGI 177 +VLYLD D+ + PL + FP D V+ + G+ RA A Sbjct: 106 RVLYLDGDVRVVDDLSPLFSLDMRGFPLAGVRDYVVSKRLARGEPVKVRNRARIEEEARC 165 Query: 178 AKG-----YFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKL 232 G YFN+G LL++ + AA A+ L+ K T DQD LN + A ++ Sbjct: 166 MSGADASTYFNAGVLLLDASAIAADHSLCSAMQDLDR---ASKWTLGDQDHLNNVFAGRV 222 Query: 233 IFADIKYNTQFSLNYQLKE--SFINPVTNDTIF-----IHYIGPTKPW 273 D YN+ +S + + + P + + IH+ GP KPW Sbjct: 223 RLIDPAYNSSWSRTPRQRRYVERLGPAPAELTYAPDAIIHFHGPAKPW 270 >UniRef50_B7GNT4 Glycosyl transferase, family 8 n=5 Tax=Bifidobacterium RepID=B7GNT4_BIFLI Length = 1013 Score = 52.0 bits (123), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 57/245 (23%), Positives = 103/245 (42%), Gaps = 26/245 (10%) Query: 111 YFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMV-----VTEGQADWW 165 Y+RF+I N KVLYLD+DII G I L + D+ + V + Sbjct: 755 YYRFLIQQLLPN-YDKVLYLDSDIIIVGDIAKLYDIDLQDNLLGAVRDIDFLGNLNVKHG 813 Query: 166 EKRAHSLGVAGIAK--GYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDV 223 ++ +++ V + YF +G L++NT + + + + P I + DQDV Sbjct: 814 KRMSYAKDVLKMKNPYDYFQAGVLVLNTKGMRNRYSIEQWLTYASNPNYI----YNDQDV 869 Query: 224 LNMLLADKLIFADIKYNTQFSLNYQLK-----------ESFINPVTNDTIFIHYIGPTKP 272 LN K+++ ++N ++ ++++ +N I IHY G KP Sbjct: 870 LNAYCEGKVLYLPWEWNVVHDCGGRVGNLFTQAPNDVYDAYVKSRSNPQI-IHYAGYQKP 928 Query: 273 WHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKHRYLKGFSNYLFY 332 W D DY S + +P+ + + +N+ + + L KH G N + Sbjct: 929 WVDPDCDY--SSIYWRYARETPFYERLIKRVVLANEPQIPEEVFLPKHERAVGEDNPIRK 986 Query: 333 FIEKI 337 F++ + Sbjct: 987 FVDPL 991 >UniRef50_B9QZ95 Glycosyl transferase family 8 n=1 Tax=Labrenzia alexandrii DFL-11 RepID=B9QZ95_9RHOB Length = 309 Score = 51.6 bits (122), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 64/254 (25%), Positives = 103/254 (40%), Gaps = 27/254 (10%) Query: 30 DIAYGTDKNFLFGCGISIASILKYNEGSRLC-FHIFTDYFGDDDRKYFDALALQYKTR-- 86 +IA D L G ++I S L+++ S C H+ D + D+ L+ +K Sbjct: 4 NIAACADTKVLPGLAVTIRSSLEHS--SIPCRIHVLADRLSEQDKH---KLSNSWKPHPM 58 Query: 87 ---IKIYLINGDRLRSLPSTKNW-THAIYFRFVIADYFINKAPKVLYLDADIICQGTIEP 142 + Y I+ + ST + + Y R+ I+D F+ + K +YLD D++ + Sbjct: 59 CQDVVFYDIDYQNISKFRSTMYLKSKSAYSRYFISD-FLGEESKCIYLDCDLLVLRDLAE 117 Query: 143 LINFSFPDDKVAMV--VTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVS 200 L + V ++ AD L + YFNSG L+I+ +W +++ Sbjct: 118 LNTAKMHGKTIGSVRDISVRTADPHLFIGERLQLTN-PYDYFNSGVLIIDLDRW--RKLD 174 Query: 201 ARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTND 260 AR + E DQD LN+ F D +NT + P T + Sbjct: 175 ARNHLIDLTLERADTFHSQDQDALNVFFDGDTEFLDPVWNTS---------QYERPDTAE 225 Query: 261 TIFIHYIGPTKPWH 274 IH IG KPWH Sbjct: 226 NRIIHLIGTVKPWH 239 >UniRef50_C8N7M8 Putative uncharacterized protein n=1 Tax=Cardiobacterium hominis ATCC 15826 RepID=C8N7M8_9GAMM Length = 618 Score = 51.6 bits (122), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 70/296 (23%), Positives = 116/296 (39%), Gaps = 59/296 (19%) Query: 18 YDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFD 77 Y V+T+ + + +D N+ G I SIL D+F D KY D Sbjct: 269 YAQPVQTDKPVVSVVIASDDNYTPHLGALICSIL--------------DHFPAD--KYLD 312 Query: 78 AL-------ALQYKTRIKI--------YLINGDRLRSLPSTKNWTHAIYFRFVIADYFIN 122 + AL K +++ +L D + L + +++ A ++R ++ D I Sbjct: 313 LIILDGGISALNRKLLMRLLPTHANIQFLELKDEFQQLATHMHFSRATFYRLIL-DKLIP 371 Query: 123 KAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVV----------------TEGQADWWE 166 KVLY+D D I I L + D + V T G Sbjct: 372 GRDKVLYIDCDTIVLDDISTLFDTPLGDHAIGAVFDYIMHHFCLNDVLSIDTTGSLPAKR 431 Query: 167 KRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNM 226 +G+ + YF +G +L N + +S I+ L + K+ DQD+LN Sbjct: 432 YLHDYVGLEDGWQRYFQAGVILFNMEKLRRLDLSEVMISDL----LNKRYWFLDQDILNK 487 Query: 227 LLADKLIFADIKYNTQFSLN--YQ-LKESFI---NPVTNDTIFIHYIG-PTKPWHD 275 +++ D ++N+ S+ YQ L ++I D IHY G TKPW++ Sbjct: 488 YFLGDVVYLDPRWNSVNSVQNIYQGLPATYIAELKTTETDPKIIHYAGFETKPWNN 543 >UniRef50_A5VK24 Glycosyl transferase, family 8 n=18 Tax=Lactobacillales RepID=A5VK24_LACRD Length = 282 Score = 51.2 bits (121), Expect = 5e-05, Method: Compositional matrix adjust. Identities = 51/194 (26%), Positives = 75/194 (38%), Gaps = 22/194 (11%) Query: 92 INGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDD 151 +N P T + IY+R + K+LYLDAD++C + L S Sbjct: 63 VNDQLFNKAPVTDRYPTTIYYRLLAHRLLPQDLHKILYLDADVLCINDLSSLYETSLDGY 122 Query: 152 KVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPE 211 A + + E A GY+NSG LL+N + + + + Sbjct: 123 LYASAIHTNLTNTTEVINKIRLQNFDADGYYNSGVLLMNLDTIRKK---------VKDTD 173 Query: 212 IIKKI-TH----PDQDVLNML-------LADKLIFADIKYNTQFSLNYQLKESFINPVTN 259 I I TH PDQDVLN L + D+L D + + E + V Sbjct: 174 IFNYIRTHTLLLPDQDVLNALYGRYIKSVPDQLYNFDTRKGGIYE-TISFGEWTTDWVMR 232 Query: 260 DTIFIHYIGPTKPW 273 +T+ +HY G KPW Sbjct: 233 NTVILHYCGRDKPW 246 >UniRef50_A7A7B4 Putative uncharacterized protein n=2 Tax=Bifidobacterium adolescentis L2-32 RepID=A7A7B4_BIFAD Length = 1009 Score = 50.4 bits (119), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 55/244 (22%), Positives = 95/244 (38%), Gaps = 24/244 (9%) Query: 111 YFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVV-TEGQADWWEKRA 169 Y+RF+I + KVLYLD+DII G I L N + + + A+ K Sbjct: 751 YYRFLIQK-VLPFYDKVLYLDSDIIINGDIAKLYNIDLQGKMLGAIRDIDFLANLNVKHG 809 Query: 170 HSLGVAGIA------KGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDV 223 +G A YF +G L++NT + + + P+ I + DQDV Sbjct: 810 KRMGYAQTVLKMKNPYDYFQAGVLVLNTKAMREHYTIKQWLTYASNPDFI----YNDQDV 865 Query: 224 LNMLLADKLIFADIKYNTQFSLNYQLKESFINP----------VTNDTIFIHYIGPTKPW 273 LN +++ ++N ++ F+ ND +HY G KPW Sbjct: 866 LNAHCEGNVLYLPWEWNVVHDCGGRVGNLFVQAPNDIYDAYMKSRNDPQIVHYAGFQKPW 925 Query: 274 HDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKHRYLKGFSNYLFYF 333 D D+ + + + +P+ L + +N+ A + KH G N + Sbjct: 926 TDPDCDF--ASMYWKYARETPFYERLLKRVVKANESEIPAGVLRPKHERAVGEDNPIRKI 983 Query: 334 IEKI 337 ++ + Sbjct: 984 VDPL 987 >UniRef50_C8W7U9 Glycosyl transferase family 8 n=2 Tax=Atopobium RepID=C8W7U9_ATOPD Length = 1014 Score = 50.1 bits (118), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 40/188 (21%), Positives = 84/188 (44%), Gaps = 26/188 (13%) Query: 111 YFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMV-----VTEGQADWW 165 YFRF+ D ++ KV+YLD+D++ G + L + ++ +A + Sbjct: 759 YFRFLAQD-ILSAYDKVVYLDSDLVVNGNVAELYDVRIGNNLIAATLDIDYLANLNIRGG 817 Query: 166 EKRAHSLGVAGIAK--GYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDV 223 ++ +SL V + YF +G ++ NTA+ + + + P I + DQD+ Sbjct: 818 DRMKYSLDVLNLKNPYAYFQAGVMVFNTAELRRYHTVPEWLRIASNPIFI----YNDQDI 873 Query: 224 LNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIF------------IHYIGPTK 271 LN ++++ +N ++ + +E + P+ +++F +H+ G K Sbjct: 874 LNSECQGRVLYLPADWNVTHNIFGRAEELY--PMAPNSVFDDYQAARRAPKIVHFAGAIK 931 Query: 272 PWHDWAWD 279 PW + + D Sbjct: 932 PWQNASCD 939 >UniRef50_C0AA16 Lipopolysaccharide biosynthesis protein LPS:glycosyltransferase-like protein n=1 Tax=Opitutaceae bacterium TAV2 RepID=C0AA16_9BACT Length = 726 Score = 50.1 bits (118), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 66/305 (21%), Positives = 127/305 (41%), Gaps = 44/305 (14%) Query: 28 CLDIAYGTDKNFLFGCGISIASILKY-NEGSRLCFHIFTDYFGDDDRKYFDALALQYKTR 86 C++IA+ D F+ ++I SI+ + + I T+ + K+ D + Sbjct: 404 CINIAFNCDDKFVPYLCVAIKSIVATASTENNYDILILTEGLSPANLKWIDGIKHAKNVS 463 Query: 87 IKI-----YLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIE 141 +++ YL + D + S + Y R + + + K KVLYLD D+I Q + Sbjct: 464 LRVVNVRDYLQDKD-ISSFFMRSMVSRIAYVRLYLGE-LLEKYAKVLYLDCDLIAQSDVA 521 Query: 142 PLINFSF--------PDDKVAMVVTEGQADWWEKRAH---SLGVAGIAKGYFNSGFLLIN 190 L N + PD ++ + A + + + LGV I++ YFNSG ++ + Sbjct: 522 ELFNMNLDGNVCAAVPDLAISTETIKNVAAYRDIDVYLRDVLGVTDISQ-YFNSGVMVFD 580 Query: 191 TAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLK 250 + + IA + DQ+VLN L K++ ++N + SL + Sbjct: 581 LEKIRTDNLQQTFIAAAAKNTKF----FMDQNVLNSALYGKVLLLGFEWNKRVSLAMANR 636 Query: 251 ESFINPVTNDTIFIHYIGPTKPW--------HDWAWDYPVSQAFMEAKNASPWKNTALLK 302 ++ T ++ +H+ KP ++W W+Y F E + ++K Sbjct: 637 DT-----TTESKILHFAAEPKPLQKIHMPEHYNW-WEYARQLPFYEELLSR------VIK 684 Query: 303 PNNSN 307 P+++N Sbjct: 685 PSSTN 689 >UniRef50_A8FNA2 Putative uncharacterized protein n=2 Tax=Campylobacter jejuni subsp. jejuni 81116 RepID=A8FNA2_CAMJ8 Length = 791 Score = 50.1 bits (118), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 50/203 (24%), Positives = 85/203 (41%), Gaps = 27/203 (13%) Query: 106 WTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDK---VAMVVTEGQA 162 ++ A Y+RF I F + K++YLD DII + + L + F DK A + Q Sbjct: 466 FSEATYYRFFIPKIF-KEFKKIIYLDTDIIVKQDLNLLYSIDF--DKPLAAAKCMIFSQV 522 Query: 163 DWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQD 222 + R L + + YF +G ++ N + + + + L E +K DQD Sbjct: 523 KQADHRITKLKMKQ-PENYFQAGVMVYNIQKCLKMDFTQKCLNKLQE---LKDPPLVDQD 578 Query: 223 VLNMLLADKLIFADIKYNTQFSLNYQL-------KESFI---NPVTNDTIFIHYIGPTKP 272 VLN + + + +K+N ++++Y++ + F+ D IHY KP Sbjct: 579 VLNAVFEGDIHYISLKWNCLWNVSYRIPNFKILYSKDFLKDYQEAERDPYIIHYCDYFKP 638 Query: 273 WH-------DWAWDYPVSQAFME 288 W+ D W Y F E Sbjct: 639 WNSPHLPKADIWWHYARQTPFYE 661 >UniRef50_Q3DM64 Glycosyl transferase, family 8, degenerate n=6 Tax=Streptococcus agalactiae RepID=Q3DM64_STRAG Length = 394 Score = 50.1 bits (118), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 47/175 (26%), Positives = 75/175 (42%), Gaps = 26/175 (14%) Query: 107 THAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWE 166 T+ Y R+ I A KVLYLD D + ++ L D +A + Sbjct: 81 TYMAYARYYIPQLI--DAEKVLYLDIDTLVVDNLDKLFEIELGDYPIAAI---------- 128 Query: 167 KRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNM 226 L GI +FNSG +LIN+ W +V+ + + + E E+ I DQ VLN+ Sbjct: 129 -----LDGDGI---HFNSGVMLINSLYWMRYRVTEKLLE-ITERELDNGI-FGDQGVLNL 178 Query: 227 LLADKLIFADIKYNTQFSLN----YQLKESFINPVTNDTIFIHYIGPTKPWHDWA 277 L + + + KYN Q + Y+ + + + IHY KPW+ ++ Sbjct: 179 LFDNNWLKLEDKYNAQVGNDLGAFYENWQGYFDRNFESPTIIHYCTHDKPWNTFS 233 >UniRef50_C7TIE0 Glycosyl transferase, group 8 n=2 Tax=Lactobacillus rhamnosus RepID=C7TIE0_LACRL Length = 286 Score = 49.7 bits (117), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 45/192 (23%), Positives = 83/192 (43%), Gaps = 19/192 (9%) Query: 95 DRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVA 154 D++ + + + + +R + A Y + ++LYLD D++ I P+ + PDDK Sbjct: 76 DQVHTANTNTRYPSVVLWR-LFAPYIFSDTDRLLYLDNDVLICDDISPMFDM-LPDDKAI 133 Query: 155 MVVTEGQADWWEKRAHSLGVAGIA--KGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEI 212 V + Q + I YFNSG LLINT ++ + + +N + Sbjct: 134 GAVNDFQTLLYADTKEGSIWPEIKHFDSYFNSGVLLINTHKYIQAYTQDQLVNTINTSD- 192 Query: 213 IKKITHPDQDVLNMLLADKLIFADIKYNTQ--------FSLNYQLKES-FINPVTNDTIF 263 + DQ +LN L + I ++YN Q ++L+Y LK++ + + Sbjct: 193 ---YSFIDQTILNNLFESQSIHLPLQYNYQKDDEWLNGYALHYNLKQAKKMQAARKKVVI 249 Query: 264 IHYIGPTK--PW 273 H++ + PW Sbjct: 250 RHFVSEIRSLPW 261 >UniRef50_C6IJ37 General stress protein A n=2 Tax=Bacteroides RepID=C6IJ37_9BACE Length = 309 Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 54/214 (25%), Positives = 89/214 (41%), Gaps = 29/214 (13%) Query: 84 KTRIKIYLINGDR-----------LRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDA 132 + ++KI LI R L+ + + +T A +R ++ D + + KV+Y+D Sbjct: 50 RLKLKIQLIGEGRTCYSFVNLQGKLQHIYIDQKYTEAASYRLLLPD-LLPEYKKVIYIDC 108 Query: 133 DIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTA 192 DII + + L + V E D+ ++G Y NSGFL++N Sbjct: 109 DIIVRNDLVQLYHSIDLGMNYLAAVFEASMDFQLDHLKTIGCN--PNEYINSGFLIMNLE 166 Query: 193 QWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNT--QFSLNYQLK 250 + + I E + + PDQDVLN L D+++ YN+ F L Q K Sbjct: 167 LMRKDNMVEKFI----EASKVDYLEFPDQDVLNQLCKDRILALPPYYNSIRTFYLP-QYK 221 Query: 251 ESFINPVTNDTIF-------IHYIGPTKPWHDWA 277 + F+ T +HY G KPW+ + Sbjct: 222 KFFLQKYTEQDWLEVHRHGTVHYTG-AKPWNQFT 254 >UniRef50_B1LK07 Glycosyl transferase family 8 n=10 Tax=Enterobacteriaceae RepID=B1LK07_ECOSM Length = 630 Score = 49.3 bits (116), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 68/338 (20%), Positives = 136/338 (40%), Gaps = 62/338 (18%) Query: 36 DKNFLFGCGISIASILKYNEGSR----LCFHIFTDYFGDDDRKYFDALALQYKTRIKIYL 91 D N+ G I SI+ +++ SR + + + ++ +A ++ + Sbjct: 284 DNNYALSGGALINSIVLHSDASRNYDIVVLENKVSHL--NKQRLIKLVAGHNNISLRFFD 341 Query: 92 ING-DRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPD 150 +N + + + +++ + Y R I F + KV+++D+D + + + L++ Sbjct: 342 VNSFTEMSDVHTRAHFSASTYARLFIPQLF-REYKKVVFIDSDTVVKADLATLLDVEIGT 400 Query: 151 DKVAMV---VTEG---------------QADWWEKRAHSLGVAGIAKGYFNSGFLLINTA 192 + VA V V EG A+ + K+ +LG+ YF +G ++ N Sbjct: 401 NLVAAVKDIVMEGFVKFGTMSESDDGIMPAEQYLKK--TLGMTN-PDEYFQAGIIVFNVE 457 Query: 193 QWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNT----------- 241 Q + A+ ++ L KK DQD++N + ++ F +++N Sbjct: 458 QMVTENTFAQLMSALKA----KKYWFLDQDIMNKVFFGRVKFLPLEWNVYHGNGNTDDFF 513 Query: 242 ---QFSLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKN- 297 +FS + ++ NP IHY G KPW+ D+ F+E ++PW+ Sbjct: 514 PNLKFSTYMRFLQARRNPK-----MIHYAGENKPWNTEKVDF--YDDFLENVLSTPWEKE 566 Query: 298 -------TALLKPNNSNQLRYSAKHMLKKHRYLKGFSN 328 A + PN +L+ + K R L + N Sbjct: 567 IYYRQLPVATVVPNQHTELQQTVLLQTKIKRALMPYVN 604 >UniRef50_B9ADW8 Putative uncharacterized protein n=1 Tax=Methanobrevibacter smithii DSM 2375 RepID=B9ADW8_METSM Length = 223 Score = 48.5 bits (114), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 39/149 (26%), Positives = 67/149 (44%), Gaps = 4/149 (2%) Query: 97 LRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMV 156 L + +++ A Y + IA KV+YLD D + + + ++N ++ +A Sbjct: 40 LNKMSVKGDFSLATYSKLFIASLLPETVDKVIYLDCDALVLDSFKEILNLDL-NNYLAAG 98 Query: 157 VTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKI 216 V K+A L + Y N+G LLIN +W + V + + L E + K Sbjct: 99 VLALNCTAEVKKAIDLNEDDL---YINAGMLLINLKRWRQENVENQFLEKLVEFNLRGKH 155 Query: 217 THPDQDVLNMLLADKLIFADIKYNTQFSL 245 DQ V+N + + L+ + KYN + SL Sbjct: 156 FGMDQGVINNVSSKNLLVLNPKYNLEGSL 184 >UniRef50_A2DXT6 Glycosyl transferase family 8 protein n=1 Tax=Trichomonas vaginalis RepID=A2DXT6_TRIVA Length = 334 Score = 48.1 bits (113), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 46/166 (27%), Positives = 73/166 (43%), Gaps = 28/166 (16%) Query: 111 YFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAH 170 Y R V +D + + L LD D + G+ + F++ +D A+VV D W++ Sbjct: 147 YIRIVFSDAH-PELERFLQLDGDTLVTGSFDEFY-FAYFNDTYAVVV----LDIWKE--- 197 Query: 171 SLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLAD 230 G K YFN G ++ N ++ +++ + L E E+ + + DQ VLN + D Sbjct: 198 ---YEGF-KNYFNCGSVVFNCQKFRDDKMADKVRTKLKEYEVTRGEWNNDQTVLNDIFGD 253 Query: 231 KLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIG----PTKP 272 K I A KYN F+ +T T H+ G P KP Sbjct: 254 KKIIAHKKYN-----------EFMPSLTMQTRIFHFYGLKKKPYKP 288 >UniRef50_C4ZG45 Glycosyltransferase Family 2 modular protein n=1 Tax=Eubacterium rectale ATCC 33656 RepID=C4ZG45_EUBR3 Length = 723 Score = 47.8 bits (112), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 54/221 (24%), Positives = 96/221 (43%), Gaps = 17/221 (7%) Query: 27 LCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIF-TDYFGDDDRKYFDALALQYKT 85 +CL I + D N+ G ++ SI++ N + + FHI D + ++ +A Sbjct: 345 ICLGI-HDKDGNYSVWAGTTMQSIVE-NTKAPIVFHILHDDTLNEMNKNKLSLIADNSGN 402 Query: 86 RIKIYLINGDRLRSLPSTKN-WTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLI 144 I+ + N D SL + N +T FR ++ D + K++YLD+D+ IE L Sbjct: 403 GIEFHHFNPDIFGSLADSMNRFTIGTMFRIMLPD-IMPDLKKIIYLDSDLFVNTDIEELW 461 Query: 145 NFSFPDDKVAMVVTEGQA---DWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSA 201 N + D + + + +W A + G + YFN+G L +N Sbjct: 462 NLNI--DNYCLAAAQDCSTIRNWGTPYAVAAGQTSRDR-YFNAGVLCMNLDNIRKNGSLF 518 Query: 202 RAIA--MLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYN 240 + + + + P + PDQD LN + + K + D K+N Sbjct: 519 QQVMDYLSDNP----RTWLPDQDALNAIFSGKTLLIDEKWN 555 >UniRef50_UPI00016C0890 glycosyl transferase family 8 protein n=2 Tax=Epulopiscium sp. 'N.t. morphotype B' RepID=UPI00016C0890 Length = 593 Score = 47.4 bits (111), Expect = 7e-04, Method: Compositional matrix adjust. Identities = 65/283 (22%), Positives = 113/283 (39%), Gaps = 51/283 (18%) Query: 31 IAYGTDKNFLFGCGISIASILKY-NEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKI 89 I TD F+ G ++ S++K N + IF + + + +Q RI Sbjct: 288 IVLTTDDRFIIGAAATLISLVKTSNVNNNYDIIIFHKDLSEKSKTLLRNVVVQ---RINF 344 Query: 90 YLINGDRLRSLPS------TKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPL 143 L D + + NW +YF+ +I ++ K L+LD D+I I L Sbjct: 345 SLRFYDVGYEMSTYNVYKPGNNWQPCVYFKLLIPS-IMHNYKKSLHLDCDLIILEDIANL 403 Query: 144 INFSFPDDKVA------MVVTEGQADWWEKRAHS-LGVAGIAKGYFNSGFLLINTAQWAA 196 ++ + VA + T + W K H L + + + YFN G ++ N ++ Sbjct: 404 LSIDLKGNAVAGCAEMGCITTSIRRTWANKYYHEKLRITNMVE-YFNGGVIVFNINEF-- 460 Query: 197 QQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINP 256 ++++ A +L+E E KK + +QD+L+ + + +N L F+ Sbjct: 461 HKITSLA-QLLHEAE--KKHLNLEQDILSKSFVNHIYLLPQSWN--------LTRDFLGT 509 Query: 257 VTN-------DTIF------------IHYIGPTKPWHDWAWDY 280 V N I+ IHYIGP KPW + +Y Sbjct: 510 VMNLYKQYLPSNIYQKYLDARQKPKIIHYIGPLKPWDNPNLEY 552 >UniRef50_D1N145 Glycosyl transferase family 8 n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N145_9BACT Length = 311 Score = 46.6 bits (109), Expect = 0.001, Method: Compositional matrix adjust. Identities = 59/251 (23%), Positives = 106/251 (42%), Gaps = 15/251 (5%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 + +A TD+N+L ++ AS+L + G + H+ + + D F+AL R+ Sbjct: 5 IQVAMATDRNYLDYALVAAASLLAQHPGGGITLHLLHEELDESDFARFEALRRIDGFRLV 64 Query: 89 IYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSF 148 I + P + W+ + Y+R ++ + K+LYLD D++ I L N Sbjct: 65 PRKIERGFFQGWPELR-WSTSAYYRLILPS-LLPDLEKILYLDCDLLVLDDIAELWNTEL 122 Query: 149 PDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLN 208 A + +K +G+ A YFNSG +L N + A + R I + + Sbjct: 123 GSRSCAAAAVRVAPEHQKK----IGLPAEAV-YFNSGVMLFNLRKMAHENHEKRFIRLFD 177 Query: 209 EPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQ------LKESFINPVTNDTI 262 E+ +I +PDQD+LN+ + + ++N S+ E+ + Sbjct: 178 --ELGGRIKYPDQDILNLAYWNDYVKLSQRWNLVTSVYRNPPTPALYSEAEVVEALRRPG 235 Query: 263 FIHYIGPTKPW 273 H+ G KPW Sbjct: 236 IAHFTGTHKPW 246 >UniRef50_B3WD32 Glycosyl transferase n=9 Tax=Lactobacillus RepID=B3WD32_LACCB Length = 279 Score = 45.8 bits (107), Expect = 0.002, Method: Compositional matrix adjust. Identities = 44/157 (28%), Positives = 63/157 (40%), Gaps = 24/157 (15%) Query: 117 ADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAG 176 AD +VLYLD DI+C+ + L D +A V+ + WW H L Sbjct: 104 ADLIPELPDRVLYLDTDIVCRRSFSNLYQEPMKDVDIAGVL-DHYGKWWFH--HKLTWF- 159 Query: 177 IAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFAD 236 Y NSG LL+N A + R ++ + + PDQ LN++ K I Sbjct: 160 ---DYINSGVLLMNLASIRQDGLLVRCRRLIRH----RWLFMPDQSALNIIAKSKQILPR 212 Query: 237 IKYNTQFSLNYQLKESFINPVTNDTIFIHYIGPTKPW 273 KYN Q + V DT+F H+ + W Sbjct: 213 -KYNEQ------------HKVETDTVFQHFTTSFRFW 236 >UniRef50_Q92VQ2 Putative lipopolysaccharide 1,3-galactosyltransferase n=1 Tax=Sinorhizobium meliloti RepID=Q92VQ2_RHIME Length = 337 Score = 45.4 bits (106), Expect = 0.003, Method: Compositional matrix adjust. Identities = 63/270 (23%), Positives = 102/270 (37%), Gaps = 47/270 (17%) Query: 35 TDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDD---RKYFDALALQYKTRIKIYL 91 TD+N+ + S ++ +G+ +F G +D R++ +A+A T+IK+ Sbjct: 24 TDQNYALPTFSAALSADQHTKGADTAIRMFV--VGAEDTWARQFDEAVA---GTKIKVI- 77 Query: 92 INGDRLRSLPSTKNWTHAIYF------RFVIADYFINKAPKVLYLDADIICQGTIEPLIN 145 RL L + Y RF I + LY+D D + G ++ L+ Sbjct: 78 --AARLPQLAELSPYHRDHYLPPIALARFWIDSLLDAGVDRFLYIDGDTMVDGELDSLLA 135 Query: 146 FSFPDDKVAMV----------VTEGQADWWEKR--AHSLGVAGIAKGYFNSGFLLINTAQ 193 + P + + V+ G+ KR AH G+ + YFNSG + + Sbjct: 136 STPPAEGLMAAPDFLNIFMDEVSRGK-----KRDLAHLEGIGCRPETYFNSGVIYASREA 190 Query: 194 WAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQL--KE 251 W V M+ PE DQ LN ++ ++YN Q L + Sbjct: 191 W-NDIVPVAMKFMVEHPE---HCPASDQSALNHAARGRVTMLSLRYNYQSEHMMVLDPRR 246 Query: 252 SFINPVTNDTIFIHYIGPTKPWH--DWAWD 279 I P H+ G KPW+ W WD Sbjct: 247 RGIGPA-----IWHFTGGPKPWNTPGWPWD 271 >UniRef50_A3VFX3 Glycosyltransferase n=1 Tax=Rhodobacterales bacterium HTCC2654 RepID=A3VFX3_9RHOB Length = 615 Score = 45.1 bits (105), Expect = 0.003, Method: Compositional matrix adjust. Identities = 60/291 (20%), Positives = 123/291 (42%), Gaps = 39/291 (13%) Query: 15 VIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIF--TDYFGDDD 72 V+ + + +++A+ +D+ +L +AS++++ R +++F + GD D Sbjct: 255 VVPFARGARFNDGAVNVAFTSDRPYLPQTAAMVASLIEHAAPDR-EYNLFYLHENIGDRD 313 Query: 73 RKYFDALALQYKTRIKIYLIN-GDRLRSLPSTKNWT--HAIYFRFVIADYFINKAPKVLY 129 +LA+ + I ++ IN G ++ T +A Y RF++ D + +++Y Sbjct: 314 LDLLRSLAVAHGN-ITLHTINVGTAFSREYRARHHTPSNATYNRFLLFD-LLPDVERLVY 371 Query: 130 LDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHS---------------LGV 174 LD D++ G + L + D +A V R + LG+ Sbjct: 372 LDVDLVLCGDVAELFDTDMNDAPLAAVTDALMTRVLATRVRTRDPEVPDLYAYLSDDLGL 431 Query: 175 AG--IAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKL 232 + I++ YFN+G +++N A +V L E + DQD+LN+ D+ Sbjct: 432 SDDQISR-YFNAGVMVMNFAAMDVAKVGRE----LREMVAGNRYFFRDQDILNVYFRDRF 486 Query: 233 IFADIKYNTQFSLNYQLKESFINPVTNDTI-------FIHYIGP-TKPWHD 275 + ++N S + ++ P+ ND + +H+ KPW + Sbjct: 487 VTLPSRFNVHNS-DRGAYDNVPVPIRNDALAAKADPFIVHFAAAHQKPWRE 536 >UniRef50_Q1CSY7 Lipopolysaccharide 1,2-glycosyltransferase n=3 Tax=Helicobacter pylori RepID=Q1CSY7_HELPH Length = 341 Score = 45.1 bits (105), Expect = 0.004, Method: Compositional matrix adjust. Identities = 41/203 (20%), Positives = 83/203 (40%), Gaps = 33/203 (16%) Query: 102 STKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKV-------- 153 + K ++ I R ++A F ++ K++ D D + G I +F P D V Sbjct: 78 AKKRFSKMILCRLLLASIF-SQYEKIIMFDVDTLFVGDISE--SFFIPMDGVYFGATKED 134 Query: 154 -AMVVTEGQADWWEKR---AHSLGVAGIAKGY------------FNSGFLLINTAQWAAQ 197 +++ D + R + +GV K FN+GF+L+N A W Sbjct: 135 FSLIGIHNANDLFSSRLNWSRGMGVKLNHKSLIFQEVEILYENPFNAGFMLVNLALWREH 194 Query: 198 QVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPV 257 + + I + + + P+QD+ ++ ++ KYN ++ ++ + + P Sbjct: 195 HLEEKLIDFFKTRD--EGLLLPEQDLFVLVCQGCILEMPCKYN----VHPRMVGTRMIPK 248 Query: 258 TNDTIFIHYIGPTKPWHDWAWDY 280 +D +H+ KPW + + Y Sbjct: 249 KSDACMLHFYADEKPWKHFRYPY 271 >UniRef50_C0X9Z7 Putative uncharacterized protein n=2 Tax=Lactobacillus gasseri JV-V03 RepID=C0X9Z7_9LACO Length = 416 Score = 44.3 bits (103), Expect = 0.006, Method: Compositional matrix adjust. Identities = 64/260 (24%), Positives = 103/260 (39%), Gaps = 34/260 (13%) Query: 31 IAYGTDKNFLFGCGISIASIL---KYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 IA + ++ +I SIL K E L + I ++F + +R A Q +RI Sbjct: 5 IALSANYGYIDKIETTIKSILYNVKNVEIHLLNYDIPQEWFANINR-----YANQIGSRI 59 Query: 88 KIYLINGDRLRSLPST-KNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINF 146 + + L L S K+ Y R +I KA +VLYLD+D++ I+ L + Sbjct: 60 IDEKFDPEELHDLNSGFKHINQMTYARLLIPKLI--KANRVLYLDSDLVVDDEIDELFSR 117 Query: 147 SFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAM 206 F K+ V ++ R + + N+G LLIN + + + Sbjct: 118 KFNGKKILAV-----THIFDVRNKNESRVDLPVPSINAGVLLINNQELRKDHNLSEKLL- 171 Query: 207 LNEPEIIKKITHP--DQDVLNMLLADKLIFADIKYNTQFSL----------NYQLKESFI 254 + +K P DQD +N D++ KYN Q N + + Sbjct: 172 ----DFARKNNFPQDDQDTINNWFKDEIGSLSFKYNYQIGADRFLFWSNNSNTETATEIL 227 Query: 255 NPVTNDTIFIHYIGPTKPWH 274 + V N I IHYI KP++ Sbjct: 228 DKVKNPKI-IHYISDDKPFN 246 >UniRef50_B9BAZ6 Glycosyl transferase family 8 protein n=1 Tax=Burkholderia multivorans CGD1 RepID=B9BAZ6_9BURK Length = 617 Score = 44.3 bits (103), Expect = 0.006, Method: Compositional matrix adjust. Identities = 58/275 (21%), Positives = 106/275 (38%), Gaps = 39/275 (14%) Query: 28 CLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 + I D NF+ IAS+ + R+ I D D++ + Sbjct: 283 AVSIVTVADGNFVPHLAAFIASVQDNIDPERVLDLIVLDGGIPADQQRLLMKQFHRNGKG 342 Query: 88 KIYLINGDRLRS-LPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINF 146 ++ I L S +P ++ A ++R + + + K +V+Y+D+D I G + L + Sbjct: 343 RLSFIQCAHLFSDIPLHGPFSAATFYRLSMGE-LLAKHRRVVYVDSDTIVLGDLSELFDL 401 Query: 147 SFPDDKVAMV--------VTEGQADWWEKRA--------HSLGVAGIAKGYFNSGFLLIN 190 ++ VA V V+ G E +G+ YF +G ++I+ Sbjct: 402 DLGNNAVAAVPDVIMKSFVSSGVPALREAGGAPAGIYLKERVGMGNRGNEYFQAGLIVID 461 Query: 191 TAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNT--------- 241 ++ ++ A L + ++ DQDVLN L + F D+ +N Sbjct: 462 LDEFRRLRIGEDAYKDL----LARRYWFLDQDVLNKYLLGHVKFLDLSWNVVNASMDVLS 517 Query: 242 --QFSLNYQLKESFINPVTNDTIFIHYIG-PTKPW 273 + + ++KE F P +HY G KPW Sbjct: 518 GLETDIAAKVKEVFAAPS-----MVHYAGHEAKPW 547 >UniRef50_C6DEN1 Glycosyl transferase family 8 n=1 Tax=Pectobacterium carotovorum subsp. carotovorum PC1 RepID=C6DEN1_PECCP Length = 602 Score = 43.9 bits (102), Expect = 0.008, Method: Compositional matrix adjust. Identities = 40/157 (25%), Positives = 67/157 (42%), Gaps = 26/157 (16%) Query: 126 KVLYLDADIICQGTIEPLINF---SFP----DDKVAMVVTEGQADWWEKRAHSLGVAGIA 178 + LYLD+D++ Q + PL+ FP D+V +V H++ + GI Sbjct: 434 RALYLDSDVVIQSSPLPLLYMDMEEFPLAACHDQVGPLVD-----------HAVTLHGIP 482 Query: 179 KG-YFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADI 237 G YFNSG +L++ A AI + + + + DQ LN + + D Sbjct: 483 NGRYFNSGVMLLDFHHPATLPAIEAAITYSEDTDSV--LIFQDQCALNKAIRGLYLTLDG 540 Query: 238 KYNTQFSLNYQLKESFINPVTNDTIFIHYIGPTKPWH 274 KYN Y ++ + + +H++ KPWH Sbjct: 541 KYNC-----YMPPGRPMSAMYENAAIVHFVSTPKPWH 572 >UniRef50_A9S2B3 Predicted protein (Fragment) n=1 Tax=Physcomitrella patens subsp. patens RepID=A9S2B3_PHYPA Length = 275 Score = 43.9 bits (102), Expect = 0.009, Method: Compositional matrix adjust. Identities = 60/256 (23%), Positives = 101/256 (39%), Gaps = 27/256 (10%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNE-GSRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 + IA D N+L G +I SIL + E S + FH + K F A+ + + Sbjct: 11 VHIAMTLDANYLRGSMAAIYSILLHAECASNVRFHFVATKEKKNKCKSFCRSAMYFYSCE 70 Query: 88 KIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 + LI N Y RF +A + +++YLD D++ G IE L + Sbjct: 71 LLKLIYSSDFVITQEPLN-----YARFYLAHMIDSCVKRIIYLDLDVLVLGRIEELWMTN 125 Query: 148 FPDDKV-------AMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVS 200 + V A + ++W + + A YFNSG +LIN +W + + Sbjct: 126 MGNSTVGTPEYCHANFPSYFTENFWINSSLASTFANKQPCYFNSGMMLINLERWRKTRCT 185 Query: 201 ARA---IAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPV 257 + + + + I + + P L + A + D ++N Q L + V Sbjct: 186 STLEYWMEVQKQQHIYELGSLPP---LLLTFAGSIQAIDNRWN-QHGLGGDI-------V 234 Query: 258 TNDTIFIHYIGPTKPW 273 D +H+ G KPW Sbjct: 235 KGDCRSLHWSGGGKPW 250 >UniRef50_B2UQN6 Glycosyl transferase family 8 n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UQN6_AKKM8 Length = 371 Score = 43.9 bits (102), Expect = 0.009, Method: Compositional matrix adjust. Identities = 55/225 (24%), Positives = 85/225 (37%), Gaps = 56/225 (24%) Query: 81 LQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTI 140 LQ + NG++ R P Y R + F +++YLDAD++ G + Sbjct: 84 LQLPEEFRHLFQNGNKDRYSPLA-------YARLMAGSLFPQYG-RIVYLDADVLLAGDV 135 Query: 141 EPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKG------------------YF 182 L F D + A V G L + I KG Y Sbjct: 136 AELY---FSDLRGASVAAAGDG---------LALWSIEKGTMHPHLEYMGNYLSFPLSYC 183 Query: 183 NSGFLLINTAQWAAQQVSARAIAML-NEPEIIKKITHPDQDVLNMLLADKLIFADIKYNT 241 NSG L+++ Q + + R + L + P+ +PDQD+LN+ L + ++N Sbjct: 184 NSGVLVLDLDQMRRRNLEHRLLQQLRSRPD---PFPYPDQDILNIALHGDMTTLPPEWNF 240 Query: 242 QFSLNYQLKE---------SFINPVT----NDTIFIHYIGPTKPW 273 QF L++ E F N T +H +GP KPW Sbjct: 241 QF-LSWTWDEEKTRLLRGTEFENVPTISCGRSWKLLHMVGPEKPW 284 >UniRef50_C0BAU3 Putative uncharacterized protein n=1 Tax=Coprococcus comes ATCC 27758 RepID=C0BAU3_9FIRM Length = 348 Score = 43.5 bits (101), Expect = 0.010, Method: Compositional matrix adjust. Identities = 54/209 (25%), Positives = 87/209 (41%), Gaps = 42/209 (20%) Query: 111 YFRF----VIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVV---TEGQAD 163 YFR ++ADY K +Y+D+D++ I L +A T G + Sbjct: 98 YFRLLMPQILADY-----DKAVYIDSDLVVNADIAELYATDVDGYLLAAAKDADTAGLYN 152 Query: 164 WWE--KRAHSLGVAGIAK--GYFNSGFLLINTAQWAAQQVSARAI--AMLNEPEIIKKIT 217 +E K+ + + I K YF +G ++ N A++ +A + A E E++ Sbjct: 153 GFEPNKKKYMDTILKIKKPYEYFQAGVIVFNLAEFRKTYTTAEMLKFAASYEWELL---- 208 Query: 218 HPDQDVLNMLLADKLIFADIKYNT-------QFSLNYQLKESFIN----PVTNDTIFIHY 266 DQDVLN L ++ F D+ +N + S L +++ + IHY Sbjct: 209 --DQDVLNYLAQGRVKFVDMAWNVMVDWRGIRLSQIIALAPKYLHDEHMEARKNPKIIHY 266 Query: 267 IGPTKPWHD-WA------WDYPVSQAFME 288 GP KPWH W+ W Y + F E Sbjct: 267 AGPDKPWHQPWSDMAEEFWKYSRNTVFYE 295 >UniRef50_Q38VG7 Putative glycosyl transferase, family 8 n=1 Tax=Lactobacillus sakei subsp. sakei 23K RepID=Q38VG7_LACSS Length = 304 Score = 43.1 bits (100), Expect = 0.015, Method: Compositional matrix adjust. Identities = 43/204 (21%), Positives = 86/204 (42%), Gaps = 13/204 (6%) Query: 43 CGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPS 102 ISIA++LK + + I T + K + L K I+ ++ + Sbjct: 1 MSISIATLLKKHMEDEINIFIITSNISEKYIKVIEGLFNNPKH--NIFWVSMPEIDIPLE 58 Query: 103 TKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQA 162 T + A Y R +++YLD D + + + L ++ + + + + Sbjct: 59 TDRGSLAQYGRLFFDRLIPENIQRLIYLDCDTLIEENLRELWVTDLGENTIG-IARDAFS 117 Query: 163 DWWEKRAHSLGVAGIAKG--YFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPD 220 D ++K + G+ K FNSG ++I+ W +++ R I +L E +I+ D Sbjct: 118 DRYKK------LLGLEKDSELFNSGVMIIDRGSWNEKRIEDRIIDLLTEKR--GRISQGD 169 Query: 221 QDVLNMLLADKLIFADIKYNTQFS 244 Q V++++ + D K+N+ S Sbjct: 170 QGVIDIIFQNDAKILDPKWNSMSS 193 >UniRef50_C6DEN2 Glycosyl transferase family 8 n=1 Tax=Pectobacterium carotovorum subsp. carotovorum PC1 RepID=C6DEN2_PECCP Length = 615 Score = 42.7 bits (99), Expect = 0.018, Method: Compositional matrix adjust. Identities = 37/149 (24%), Positives = 66/149 (44%), Gaps = 12/149 (8%) Query: 126 KVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKG-YFNS 184 + LYLD+D++ + + L++ +A + + ++ + GI G YFNS Sbjct: 443 RALYLDSDVVIRRSPLGLLHMDMGGYPLAARTERAH----PRISRAIKLHGIPNGRYFNS 498 Query: 185 GFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFS 244 G LL++ A Q AIA ++ K+ + DQ LN + + D K+N Sbjct: 499 GILLLDFQHPATQSTLNTAIAY--SEQLDNKLLYLDQCALNKSIQGLYLDLDEKFNW--- 553 Query: 245 LNYQLKESFINPVTNDTIFIHYIGPTKPW 273 + + + +P D +H+I KPW Sbjct: 554 --FIVPDDTAHPQDEDAAIMHFISTPKPW 580 >UniRef50_Q5UNW1 Uncharacterized protein R707 n=1 Tax=Acanthamoeba polyphaga mimivirus RepID=YR707_MIMIV Length = 281 Score = 42.7 bits (99), Expect = 0.019, Method: Compositional matrix adjust. Identities = 37/151 (24%), Positives = 66/151 (43%), Gaps = 13/151 (8%) Query: 126 KVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVA-GIAKGYFNS 184 K++ LD D+I I+ L S P A + + +K + + G G N+ Sbjct: 96 KIILLDLDMIIAKNIDHLFKLSAP----AACLKRFHIPYGQKIPPKMICSNGKLVGSINA 151 Query: 185 GFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFS 244 G +L+ + + + + + I K +P+QD L++ +K YN QF Sbjct: 152 GLMLLEPDKREWEDIKKDIV----KENFIGKFKYPEQDYLSLRYCNKWTSITFNYNFQFG 207 Query: 245 LNYQLKESFINPVTNDTIF-IHYIGPTKPWH 274 L +++K+ T D I+ IH+ KPW+ Sbjct: 208 LTHRVKKYH---YTIDNIYVIHFSSSYKPWN 235 >UniRef50_A4IXE1 Glycosyl transferase, family 8 n=16 Tax=Francisella RepID=A4IXE1_FRATW Length = 296 Score = 42.4 bits (98), Expect = 0.023, Method: Compositional matrix adjust. Identities = 59/267 (22%), Positives = 109/267 (40%), Gaps = 30/267 (11%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKY---FDALALQYKT 85 + I + DKN + G ++I S++ + C+ I+ Y + ++K F+++ + K Sbjct: 4 IPIVFTFDKNIILGGAVTIKSLIDH-ANPDTCYDIYV-YHPNINKKSISAFNSMIEKTKH 61 Query: 86 RIKIYLINGDRLRSLP--STKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPL 143 I + ++ + +P + + W ++R +I + + KV+Y D D++ Q + + Sbjct: 62 SISFHNVDESIFKDVPIDTRRGWI-ITFYRLLIPK-LLPQYDKVIYSDVDVLFQSDMSEV 119 Query: 144 INFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARA 203 N + A V+ E H + GF+++NT +R Sbjct: 120 YNTDLTSYEWAGVIAEKHQQ--NMVQHKYFKENNNSYIYWPGFMVMNTKLMRENNFISRC 177 Query: 204 IAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQ-----------LKES 252 ++ E ++ D DVLN L K+ KY T S+ Y LKE Sbjct: 178 FDTMH--EFNTRLKFRDLDVLN-LTCRKIKSLPFKYVTLQSIYYLNTIQEAPEYIFLKEI 234 Query: 253 FIN----PVTNDTIFIHYIG-PTKPWH 274 + + N+ IHY G P KPW Sbjct: 235 YSDNELLDAKNNPAIIHYAGSPGKPWR 261 >UniRef50_A4UX79 LPS biosynthesis protein n=2 Tax=Lactobacillaceae RepID=A4UX79_9LACO Length = 186 Score = 42.4 bits (98), Expect = 0.023, Method: Compositional matrix adjust. Identities = 43/149 (28%), Positives = 62/149 (41%), Gaps = 26/149 (17%) Query: 126 KVLYLDADIICQGTIEPLINFSFPD-DKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNS 184 ++LYLDAD++C+ E + S D V ++ G+ W+ A Y NS Sbjct: 15 RILYLDADVVCRRPFEDFYHQSLAGTDFVGVLDHYGR--WFFHHQQR------AFDYINS 66 Query: 185 GFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFS 244 G LL+N ++ AR + +I PDQ +N L K FA KYN Q Sbjct: 67 GMLLMNLDMIRQDKLLARCRECCRKWPMIM----PDQSAMNKLAKHK-AFAPEKYNEQ-- 119 Query: 245 LNYQLKESFINPVTNDTIFIHYIGPTKPW 273 V +DT+F H+ K W Sbjct: 120 ----------QDVQSDTVFQHFSTRWKLW 138 >UniRef50_UPI0001693121 general stress protein n=1 Tax=Paenibacillus larvae subsp. larvae BRL-230010 RepID=UPI0001693121 Length = 352 Score = 42.4 bits (98), Expect = 0.024, Method: Compositional matrix adjust. Identities = 62/272 (22%), Positives = 103/272 (37%), Gaps = 49/272 (18%) Query: 43 CGISIASILKYNEGSRLCFHIFTD-YFGDDDRKYFDALALQYKTRIKIY--LINGDRLRS 99 G +AS+ N S + HI D + +++ L + I Y I + L++ Sbjct: 19 AGAVLASVF-CNTSSSVNVHILHDETLTEANKQKLIELTSSFNQTIHFYPVTIPDNMLQA 77 Query: 100 LPSTKN---WTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMV 156 + K+ WT A +R +I K++YLD D++ I L D +A V Sbjct: 78 MAGVKSISFWTQASMYRLLIPALI--PVDKIIYLDCDVLVNMNIAELWEVQLGDFYLAAV 135 Query: 157 VTEGQADWWEKR-----AHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARA---IAMLN 208 W++ H + YFNSG +L +A + + MLN Sbjct: 136 --------WDQAIMAAVQHIIPYGLNPDSYFNSGVIL-----FALNNIRKKIDWYEEMLN 182 Query: 209 EPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIF----I 264 + PDQD LN + + + D ++N F N V+ F + Sbjct: 183 FLRRYPDTSMPDQDTLNAVFGENYLQLDRRFN------------FFNMVSPHHDFNNKIV 230 Query: 265 HYIGPTKPWHDWAWDYPVSQAFMEAKNASPWK 296 H+ G K W + P + + E + +PWK Sbjct: 231 HFAGSEKCWDVHS---PGANLYQEYLSLTPWK 259 >UniRef50_B6JMU4 Lipopolysaccharide biosynthesis protein n=20 Tax=Helicobacter RepID=B6JMU4_HELP2 Length = 398 Score = 42.0 bits (97), Expect = 0.028, Method: Compositional matrix adjust. Identities = 62/282 (21%), Positives = 111/282 (39%), Gaps = 44/282 (15%) Query: 20 HKVETENLCLDIAYGTDKNFLFGCGISIASIL----KYNEGSRLCFHIFTDYFGDDDRKY 75 H + ++ + IA+ DKN+L G + S+L K N+ R H ++D+ Sbjct: 7 HSFKEQDFHIPIAFAFDKNYLIPAGACLYSLLESIAKANKKIRYTLHALVVGLNEEDKAK 66 Query: 76 FDALALQYKTRIKIYLINGDR-LRSLPS------TKNWTHAIYFRFVIADYFINKAPKVL 128 + + +K + + + + L ++P+ TK ++ + ++ +AD F K K++ Sbjct: 67 LNQITEPFKEFAVLEVKDIEPFLDTIPNPFDEDFTKRFSKMVLVKYFLADLF-PKYSKMV 125 Query: 129 YLDADII-CQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFL 187 + D D+I C +N + E +++ GV + K + GFL Sbjct: 126 WSDVDVIFCNEFSADFLN-----------IKENDENYF------YGVLEVEKHHMMEGFL 168 Query: 188 LINTAQWAAQQVSARAIAMLNEPEI-----IKKITHPDQDVLNMLLADKLIFADIKYNTQ 242 N + + R +L E K P+ L + + IK + Sbjct: 169 FCNLDYQRKKNFTLRMHDLLKGNEAKGELDFTKWCWPNMKALGIEYCVFPYYYTIKDFSN 228 Query: 243 FSLNYQLKESFINPVTNDTIFIHY---IGPTKPWHDWAWDYP 281 LN K++ + N TI IHY G KP WDYP Sbjct: 229 AYLNENYKKTILEARENPTI-IHYDAWWGAVKP-----WDYP 264 >UniRef50_A4MXF8 Biotin--protein ligase n=6 Tax=Haemophilus influenzae RepID=A4MXF8_HAEIN Length = 172 Score = 42.0 bits (97), Expect = 0.029, Method: Compositional matrix adjust. Identities = 30/94 (31%), Positives = 49/94 (52%), Gaps = 5/94 (5%) Query: 179 KGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIK 238 K YFN+G L +N ++ S + ++ + + + DQD+LN D+ I D + Sbjct: 38 KDYFNAGVLYLNMEKYQLGISSFSKELITLHTQLKESLIYGDQDILNYYFEDRWIPLDKR 97 Query: 239 YNTQFSLNYQLKESFINPVTNDTIFIHYIGPTKP 272 YN F L++ + SF + T+ IF H+ GP KP Sbjct: 98 YN--FQLDHMI--SFDSLDTSPNIF-HFTGPHKP 126 >UniRef50_A2RLV8 Putative glycosyltransferase n=1 Tax=Lactococcus lactis subsp. cremoris MG1363 RepID=A2RLV8_LACLM Length = 397 Score = 42.0 bits (97), Expect = 0.034, Method: Compositional matrix adjust. Identities = 47/220 (21%), Positives = 91/220 (41%), Gaps = 24/220 (10%) Query: 115 VIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGV 174 + Y + + ++LYLD+D + E + P DK+ V+ + ++ S Sbjct: 93 IFMPYSLEEYSQLLYLDSDTLIYEGFEEIFGL-LPQDKILGVIPDFYFFAINEKNSS--- 148 Query: 175 AGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIF 234 +GYFNSG +IN ++ Q +++ + N E +I + DQ LN +L + Sbjct: 149 ---KRGYFNSGVYMINVEKYI--QKNSKEELLKNLMENFSEILYVDQTFLNNTFRGELFY 203 Query: 235 ADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFME-AKNAS 293 +++N Q N+ + + + +FI K H F+E ++ Sbjct: 204 LPLRFNYQKDDNWLNNWAILEAPESSQLFIKERANIKIRH-----------FIEFGSHSM 252 Query: 294 PWKNTALLKPNNS---NQLRYSAKHMLKKHRYLKGFSNYL 330 PW++ + N ++ +KKHR +K +L Sbjct: 253 PWQHIEVRDQFEEYFWNVWNVLKEYRVKKHRPIKSLKMFL 292 >UniRef50_Q17VR5 Lipopolysaccharide biosynthesis protein n=19 Tax=Helicobacter RepID=Q17VR5_HELAH Length = 405 Score = 40.8 bits (94), Expect = 0.066, Method: Compositional matrix adjust. Identities = 52/196 (26%), Positives = 81/196 (41%), Gaps = 35/196 (17%) Query: 102 STKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIE-----PLINFSF----PDDK 152 S K ++ I R ++A F + K++ D D + G I PL F D Sbjct: 123 SQKRFSKMIMCRLLLASLF-PQYDKMIMFDVDTLFVGDISESFFIPLEAHYFGAVREKDL 181 Query: 153 VAMVVTEGQADWWE---KRAHSLGVAG----------IAKGYFNSGFLLINTAQWAAQQV 199 +AM + D +E +RA S+GVA + YFN+GFL +N W + + Sbjct: 182 IAMNRNSAK-DLYELRQRRAKSIGVANAFPNLEEAQILFDNYFNAGFLALNLKLWRKENL 240 Query: 200 SARAIAMLNEPEIIK--KITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPV 257 + I I+K K+ DQD L + +++ YN S + SF P Sbjct: 241 ENQLIGFF----ILKNEKLLFNDQDALCFVCRGRILELPYPYNAHPS--FLDTPSF--PS 292 Query: 258 TNDTIFIHYIGPTKPW 273 + +H+ G KPW Sbjct: 293 IKEVCMLHFWG-DKPW 307 >UniRef50_Q1RIL1 Lipopolysaccharide 1,2-glucosyltransferase RfaJ n=10 Tax=Rickettsia RepID=Q1RIL1_RICBR Length = 530 Score = 40.8 bits (94), Expect = 0.077, Method: Compositional matrix adjust. Identities = 46/206 (22%), Positives = 77/206 (37%), Gaps = 30/206 (14%) Query: 106 WTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQAD-- 163 W + +R F N +LYLDADII + SF ++ + G D Sbjct: 337 WPPLVMYRLYFDQVFPN-LESILYLDADIIVLRDLN-----SFKKLDMSNYIVAGSMDTA 390 Query: 164 --WWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQ 221 + + I Y NSG + +N +Q + ++ + +PDQ Sbjct: 391 LTYCTLKVEEECNRKINNFYKNSGIVFLNLQNMREKQAKNMVLDAMHNSKC--SFAYPDQ 448 Query: 222 DVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFI-HYIGPTKPWH--DWAW 278 D+LN+ + Y S+ + FI+ + FI HY G KPW+ + W Sbjct: 449 DLLNIAFHN--------YIYPLSMRWNFYTYFIDRDNYFSYFIMHYAGKKKPWNNEEIKW 500 Query: 279 DYPVSQAFMEA-------KNASPWKN 297 + + + E + +PW N Sbjct: 501 TKDILEKYQEIEKYYWRYREFTPWGN 526 >UniRef50_Q1CUZ8 Lipopolysaccharide 1,2-glycosyltransferase n=12 Tax=Helicobacter RepID=Q1CUZ8_HELPH Length = 372 Score = 40.4 bits (93), Expect = 0.089, Method: Compositional matrix adjust. Identities = 26/98 (26%), Positives = 49/98 (50%), Gaps = 7/98 (7%) Query: 177 IAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFAD 236 I + ++N GFL++N W A + R + + ++ + + P+QD+L + K++ Sbjct: 202 ICENHYNVGFLIVNLKLWRADHLEERLLNLTHQKG--QCVFCPEQDLLTLACYQKVLQLP 259 Query: 237 IKYNTQ-FSLNYQLKESFINPVTNDTIFIHYIGPTKPW 273 YN F LN ++ FI P + + +H+ KPW Sbjct: 260 YIYNAHPFMLN---QKRFI-PDKKEIVMLHFYFVGKPW 293 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P27128 Lipopolysaccharide 1,3-galactosyltransferase n=4... 431 e-119 UniRef50_D2TIX6 UDP-galactose:(Glucosyl) LPS alpha-1,3-galactosy... 399 e-110 UniRef50_Q9R9D1 UDP-glucose:(Glucosyl) LPS alpha1,3-glucosyltran... 372 e-101 UniRef50_Q9ZIT4 UDP-galactose:(Glucosyl) LPS alpha1,3-galactosyl... 343 5e-93 UniRef50_B6XJW2 Putative uncharacterized protein n=1 Tax=Provide... 337 4e-91 UniRef50_D0KD53 Lipopolysaccharide 3-alpha-galactosyltransferase... 332 9e-90 UniRef50_D2TY85 UDP-D-galactose:(Glucosyl)lipopolysaccharide-alp... 320 6e-86 UniRef50_D2U322 Lipopolysaccharide 1,2-glucosyltransferase n=1 T... 302 9e-81 UniRef50_UPI000197C525 UDP-D-galactose:(glucosyl) lipopolysaccha... 301 2e-80 UniRef50_UPI00019F16C6 hypothetical protein CATC2_20202 n=1 Tax=... 298 2e-79 UniRef50_A8ARL6 Putative uncharacterized protein n=3 Tax=Citroba... 291 3e-77 UniRef50_Q9ZIS1 UDP-galactose:(Galactosyl) LPS alpha1,2-galactos... 290 4e-77 UniRef50_Q9ZIS6 UDP-galactose:(Glucosyl) LPS alpha1,2-galactosyl... 289 7e-77 UniRef50_B2PV91 Putative uncharacterized protein n=2 Tax=Enterob... 287 3e-76 UniRef50_B6XJW1 Putative uncharacterized protein n=1 Tax=Provide... 284 3e-75 UniRef50_Q46Y64 Glycosyl transferase, family 8 n=1 Tax=Ralstonia... 284 4e-75 UniRef50_C1DGU7 Lipopolysaccharide 1,3-galactosyltransferase n=1... 283 5e-75 UniRef50_Q9ZIT6 UDP-glucose:(Galactosyl) LPS alpha1,2-glucosyltr... 279 8e-74 UniRef50_P27129 Lipopolysaccharide 1,2-glucosyltransferase n=44 ... 277 4e-73 UniRef50_D0KD54 Lipopolysaccharide glucosyltransferase I n=2 Tax... 275 2e-72 UniRef50_D1PTN4 Putative uncharacterized protein n=1 Tax=Prevote... 267 5e-70 UniRef50_D1P7H1 Lipopolysaccharide 1,2-glucosyltransferase n=1 T... 263 5e-69 UniRef50_C7QL87 Glycosyl transferase family 8 n=2 Tax=Cyanothece... 261 2e-68 UniRef50_C7IBC1 Glycosyl transferase family 8 n=2 Tax=Clostridiu... 258 2e-67 UniRef50_C3X7M2 Lipopolysaccharide 3-alpha-galactosyltransferase... 257 4e-67 UniRef50_UPI0001B4A0A4 putative glucosyltransferase n=1 Tax=Bact... 257 5e-67 UniRef50_C0WCJ1 Lipopolysaccharide 1,2-glucosyltransferase n=1 T... 255 2e-66 UniRef50_Q5LF36 Putative glucosyltransferase n=1 Tax=Bacteroides... 253 6e-66 UniRef50_D2ELM0 Lipopolysaccharide biosynthesis glycosyltransfer... 253 7e-66 UniRef50_A1BHG0 Glycosyl transferase, family 8 n=2 Tax=Chlorobiu... 251 3e-65 UniRef50_P25148 General stress protein A n=3 Tax=Bacillus subtil... 249 1e-64 UniRef50_A7VNX5 Putative uncharacterized protein n=1 Tax=Clostri... 249 1e-64 UniRef50_D2RIJ4 Glycosyl transferase family 8 n=2 Tax=Acidaminoc... 246 1e-63 UniRef50_Q38VK7 Putative bifunctional glycosyl transferase, fami... 245 2e-63 UniRef50_Q03HK5 Lipopolysaccharide biosynthesis glycosyltransfer... 242 1e-62 UniRef50_Q64ZV2 Lipopolysaccharide 1,2-glucosyltransferase n=8 T... 242 1e-62 UniRef50_A6LGX5 Glycosyltransferase family 8 n=1 Tax=Parabactero... 241 2e-62 UniRef50_B7AIV0 Putative uncharacterized protein n=1 Tax=Bactero... 241 3e-62 UniRef50_UPI0001A457E5 hypothetical protein NsubN_08151 n=1 Tax=... 241 4e-62 UniRef50_A8ARL4 Putative uncharacterized protein n=2 Tax=Citroba... 238 3e-61 UniRef50_C2HBB8 Family 8 glycosyltransferase n=13 Tax=Lactobacil... 236 8e-61 UniRef50_C2HBB9 Family 8 glycosyltransferase n=35 Tax=Lactobacil... 236 8e-61 UniRef50_C3PWZ8 Lipopolysaccharide 1,2-glucosyltransferase n=1 T... 236 9e-61 UniRef50_C0QZN2 Glycosyl transferase, family 8 n=3 Tax=Brachyspi... 234 3e-60 UniRef50_B4RJG6 LgtC n=29 Tax=Neisseria RepID=B4RJG6_NEIG2 232 2e-59 UniRef50_B3Q568 Putative glycosyltransferase protein n=4 Tax=Rhi... 231 3e-59 UniRef50_A7A7B4 Putative uncharacterized protein n=2 Tax=Bifidob... 230 6e-59 UniRef50_B3CA80 Putative uncharacterized protein n=1 Tax=Bactero... 230 8e-59 UniRef50_C0Z4I4 Putative uncharacterized protein gspA n=1 Tax=Br... 228 1e-58 UniRef50_D2RIJ7 Glycosyl transferase family 8 n=1 Tax=Acidaminoc... 228 2e-58 UniRef50_B7GNT4 Glycosyl transferase, family 8 n=5 Tax=Bifidobac... 228 2e-58 UniRef50_C4VEI8 General stress protein A n=24 Tax=Enterococcus R... 228 3e-58 UniRef50_Q116W1 Glycosyl transferase, family 8 n=1 Tax=Trichodes... 227 4e-58 UniRef50_B2UPJ4 Glycosyl transferase family 8 n=1 Tax=Akkermansi... 227 5e-58 UniRef50_UPI000196958D hypothetical protein BACCELL_01586 n=1 Ta... 226 8e-58 UniRef50_A5ZFA9 Putative uncharacterized protein n=1 Tax=Bactero... 225 2e-57 UniRef50_B3Z5I6 Glycosyltransferase family 8 n=2 Tax=Bacillus ce... 224 4e-57 UniRef50_A4ECW2 Putative uncharacterized protein n=1 Tax=Collins... 223 6e-57 UniRef50_C5ZV11 Glycosyl transferase n=2 Tax=Helicobacter canade... 223 9e-57 UniRef50_B9KVD4 Glycosyl transferase, family 8 n=2 Tax=Rhodobact... 222 2e-56 UniRef50_C5ELK9 Glycosyl transferase n=3 Tax=Clostridiales RepID... 221 2e-56 UniRef50_C1QBZ8 LPS:glycosyltransferase n=1 Tax=Brachyspira murd... 221 3e-56 UniRef50_A1XRC1 Glycosyltransferase n=1 Tax=Haemophilus ducreyi ... 221 4e-56 UniRef50_B1Y723 Glycosyl transferase family 8 n=1 Tax=Leptothrix... 219 9e-56 UniRef50_C6I3U6 Putative uncharacterized protein n=2 Tax=Bactero... 219 1e-55 UniRef50_Q9L6B2 Putative glycosyl transferase n=3 Tax=Pasteurell... 218 3e-55 UniRef50_B7KM20 Glycosyl transferase family 8 n=1 Tax=Cyanothece... 217 4e-55 UniRef50_C7IBC8 Glycosyl transferase family 8 n=1 Tax=Clostridiu... 217 4e-55 UniRef50_B6GCA0 Putative uncharacterized protein n=1 Tax=Collins... 217 5e-55 UniRef50_P43974 Putative glycosyltransferase HI0258 n=32 Tax=Hae... 217 5e-55 UniRef50_B8ISQ5 Glycosyl transferase family 8 n=1 Tax=Methylobac... 217 6e-55 UniRef50_B3XL28 Glycosyl transferase family 8 n=1 Tax=Lactobacil... 216 8e-55 UniRef50_C1QC00 LPS:glycosyltransferase n=1 Tax=Brachyspira murd... 210 5e-53 UniRef50_Q65VF6 RfaJ protein n=1 Tax=Mannheimia succiniciproduce... 210 7e-53 UniRef50_D0D9G3 Putative general stress protein A n=1 Tax=Citrei... 210 9e-53 UniRef50_A0KQP2 Glycosyl transferase, family 8 n=2 Tax=Aeromonas... 209 1e-52 UniRef50_C2KV37 Lipopolysaccharide biosynthesis protein, LPS:gly... 209 1e-52 UniRef50_D2QX95 Glycosyl transferase family 8 n=1 Tax=Pirellula ... 208 3e-52 UniRef50_B6VUC8 Putative uncharacterized protein n=1 Tax=Bactero... 208 3e-52 UniRef50_B1MX28 Lipopolysaccharide biosynthesis protein, LPS:gly... 207 4e-52 UniRef50_D1Y7U2 Glycosyltransferase, family 8 n=1 Tax=Pyramidoba... 207 5e-52 UniRef50_B2UQ54 Glycosyl transferase family 8 n=1 Tax=Akkermansi... 206 9e-52 UniRef50_C9PNX4 Family 8 glycosyl transferase n=1 Tax=Pasteurell... 206 1e-51 UniRef50_C8N7M8 Putative uncharacterized protein n=1 Tax=Cardiob... 205 1e-51 UniRef50_B8G232 Glycosyl transferase family 8 n=2 Tax=Desulfitob... 204 3e-51 UniRef50_C1QC03 LPS:glycosyltransferase n=1 Tax=Brachyspira murd... 204 4e-51 UniRef50_C8W7U9 Glycosyl transferase family 8 n=2 Tax=Atopobium ... 203 8e-51 UniRef50_A4VVV8 Lipopolysaccharide biosynthesis proteins, LPS:gl... 203 9e-51 UniRef50_A4UX76 Putative LPS biosynthesis protein n=2 Tax=Lactob... 202 1e-50 UniRef50_B7C7N8 Putative uncharacterized protein n=2 Tax=Firmicu... 201 3e-50 UniRef50_Q2K5X3 Galactosyltransferase protein n=13 Tax=Rhizobium... 201 4e-50 UniRef50_UPI0001AEC697 glycosyl transferase family protein n=1 T... 200 8e-50 UniRef50_Q3D427 Glycosyl transferase, family 8 n=8 Tax=Streptoco... 200 8e-50 UniRef50_C9RWX3 Glycosyl transferase family 8 n=8 Tax=Bacillacea... 199 1e-49 UniRef50_D1JY84 Putative uncharacterized protein n=1 Tax=Bactero... 198 2e-49 UniRef50_A3V3C9 Putative uncharacterized protein n=1 Tax=Loktane... 198 3e-49 UniRef50_UPI0001A45357 glycosyl transferase family protein n=1 T... 197 4e-49 UniRef50_D2LIH7 Glycosyl transferase family 8 n=1 Tax=Rhodomicro... 197 5e-49 UniRef50_A5LNA9 Glycosyl transferase, family 8 n=2 Tax=Streptoco... 197 6e-49 UniRef50_B2ISC6 Glycosyl transferase, family 2/glycosyl transfer... 197 7e-49 UniRef50_Q5WI33 Lipopolysaccharide glycosyltransferase n=6 Tax=F... 196 1e-48 UniRef50_A3CM53 Glycosyltransferase, putative n=9 Tax=Bacteria R... 194 5e-48 UniRef50_Q3D426 Glycosyl transferase, family 8 n=7 Tax=Streptoco... 193 8e-48 UniRef50_Q4JZJ9 Putative glycosyl transferase n=8 Tax=Streptococ... 193 9e-48 UniRef50_C1QEC6 LPS:glycosyltransferase n=1 Tax=Brachyspira murd... 192 1e-47 UniRef50_C5S494 Putative glycosyl transferase n=2 Tax=Actinobaci... 192 2e-47 UniRef50_B1I7N1 Glycosyl transferase, family 8 n=8 Tax=Streptoco... 191 3e-47 UniRef50_Q4HGS8 General stress protein A, putative n=2 Tax=Campy... 191 3e-47 UniRef50_B3WD33 WbbM protein n=8 Tax=Lactobacillus RepID=B3WD33_... 190 6e-47 UniRef50_C1CFZ1 Glycosyl transferase, family 8 n=17 Tax=Streptoc... 189 1e-46 UniRef50_B2ISC2 Glycosyl transferase, family 8 n=2 Tax=Streptoco... 189 1e-46 UniRef50_C1IBL0 Glycosyl transferase n=1 Tax=Clostridium sp. 7_2... 188 2e-46 UniRef50_Q3DNA2 Glycosyl transferase, family 8 n=9 Tax=Streptoco... 188 2e-46 UniRef50_C6LDU2 Glycosyl transferase, family 8 n=3 Tax=Firmicute... 187 5e-46 UniRef50_C7RG54 Glycosyl transferase family 8 n=1 Tax=Anaerococc... 187 6e-46 UniRef50_C6X2V2 Putative glycosyltransferase n=1 Tax=Flavobacter... 186 6e-46 UniRef50_C0AA16 Lipopolysaccharide biosynthesis protein LPS:glyc... 186 1e-45 UniRef50_B9QZ95 Glycosyl transferase family 8 n=1 Tax=Labrenzia ... 185 1e-45 UniRef50_C3XGD2 Putative uncharacterized protein n=1 Tax=Helicob... 185 2e-45 UniRef50_Q3DM64 Glycosyl transferase, family 8, degenerate n=6 T... 185 2e-45 UniRef50_C6IB51 Glycosyltransferase n=4 Tax=Bacteroides RepID=C6... 185 2e-45 UniRef50_B7AUG6 Putative uncharacterized protein n=1 Tax=Bactero... 185 2e-45 UniRef50_B0NR59 Putative uncharacterized protein n=1 Tax=Bactero... 185 2e-45 UniRef50_A5VK24 Glycosyl transferase, family 8 n=18 Tax=Lactobac... 185 3e-45 UniRef50_C7HS13 Family 8 glycosyl transferase n=2 Tax=Anaerococc... 184 4e-45 UniRef50_C4ZG45 Glycosyltransferase Family 2 modular protein n=1... 184 4e-45 UniRef50_A5UC07 Aspartate-semialdehyde dehydrogenase n=7 Tax=Hae... 184 5e-45 UniRef50_C5S3F7 Putative glycosyl transferase n=2 Tax=Actinobaci... 183 6e-45 UniRef50_B1LK07 Glycosyl transferase family 8 n=10 Tax=Enterobac... 182 1e-44 UniRef50_B1I7M9 Glycosyl transferase, family 8 n=16 Tax=Streptoc... 182 2e-44 UniRef50_Q3DNS6 Glycosyl transferase, family 8 n=5 Tax=Streptoco... 182 2e-44 UniRef50_C3XKY2 Glycosyl transferase n=2 Tax=Campylobacterales R... 182 2e-44 UniRef50_A7H2M2 Glycosyl transferase family 8 n=3 Tax=Campylobac... 181 3e-44 UniRef50_C5EZG9 Glycosyl transferase family protein n=1 Tax=Heli... 181 3e-44 UniRef50_Q9L7A2 Putative glycosyl transferase n=1 Tax=Haemophilu... 181 4e-44 UniRef50_A4WWT5 Glycosyl transferase, family 8 n=1 Tax=Rhodobact... 180 6e-44 UniRef50_Q5M3K9 Glycosyl transferase n=4 Tax=Streptococcus RepID... 180 7e-44 UniRef50_Q0I2Z7 Lipopolysaccharide biosynthesis protein, glycosy... 180 8e-44 UniRef50_C5ZVZ7 Putative glycosyltransferase n=3 Tax=Campylobact... 179 1e-43 UniRef50_D2MYR1 Putative uncharacterized protein n=1 Tax=Campylo... 179 1e-43 UniRef50_A8AY72 Glycosyl transferase, family 8 SP1766 n=7 Tax=St... 178 3e-43 UniRef50_A8FNA2 Putative uncharacterized protein n=2 Tax=Campylo... 177 5e-43 UniRef50_A8LP95 Lipopolysaccharide 1 n=1 Tax=Dinoroseobacter shi... 177 6e-43 UniRef50_C4Z1V1 Putative uncharacterized protein n=1 Tax=Eubacte... 176 6e-43 UniRef50_Q50FU8 Cj81-079 (Fragment) n=6 Tax=Campylobacter jejuni... 176 9e-43 UniRef50_A3YS36 Putative sugar transferase n=2 Tax=Campylobacter... 169 1e-40 UniRef50_Q16CW9 Lipopolysaccharide 1,3-galactosyltransferase, pu... 169 1e-40 UniRef50_C6EQF4 Putative uncharacterized protein n=3 Tax=Campylo... 168 3e-40 UniRef50_D1N145 Glycosyl transferase family 8 n=1 Tax=Victivalli... 166 8e-40 UniRef50_C3XFW0 Glycosyl transferase n=1 Tax=Helicobacter bilis ... 161 3e-38 UniRef50_C6IJ37 General stress protein A n=2 Tax=Bacteroides Rep... 158 2e-37 UniRef50_UPI00016B2258 glycosyl transferase, family 8 n=1 Tax=ca... 158 3e-37 UniRef50_C5SH34 Glycosyl transferase family 8 n=1 Tax=Asticcacau... 158 3e-37 UniRef50_C0EQT1 Putative uncharacterized protein n=1 Tax=Neisser... 158 3e-37 UniRef50_A7H2X4 Glycosyl transferase family 8 n=2 Tax=Campylobac... 156 7e-37 UniRef50_A0ZYL4 Putative uncharacterized protein n=1 Tax=Archaea... 155 2e-36 UniRef50_UPI00016C0890 glycosyl transferase family 8 protein n=2... 154 5e-36 UniRef50_A4SAB5 Predicted protein (Fragment) n=1 Tax=Ostreococcu... 149 1e-34 UniRef50_C5WAK3 Ybl156 protein n=2 Tax=Enterobacteriaceae RepID=... 148 2e-34 UniRef50_B9ADW8 Putative uncharacterized protein n=1 Tax=Methano... 148 3e-34 UniRef50_C0X9Z8 Glycosyltransferase n=1 Tax=Lactobacillus gasser... 142 2e-32 UniRef50_C3XN62 Glycosyl transferase n=1 Tax=Helicobacter wingha... 139 1e-31 UniRef50_B9KUH7 Lipopolysaccharide 3-alpha-galactosyltransferase... 131 3e-29 UniRef50_C7TIE0 Glycosyl transferase, group 8 n=2 Tax=Lactobacil... 129 1e-28 UniRef50_A2DXT6 Glycosyl transferase family 8 protein n=1 Tax=Tr... 107 6e-22 Sequences not found previously or not previously below threshold: UniRef50_D2QX94 Glycosyl transferase family 8 n=1 Tax=Pirellula ... 196 1e-48 UniRef50_B9BAZ6 Glycosyl transferase family 8 protein n=1 Tax=Bu... 188 2e-46 UniRef50_C0BAU3 Putative uncharacterized protein n=1 Tax=Coproco... 188 3e-46 UniRef50_B6G807 Putative uncharacterized protein n=2 Tax=Collins... 185 3e-45 UniRef50_A5EVI8 Glycosyl transferase family 8 protein n=1 Tax=Di... 183 8e-45 UniRef50_A3VFX3 Glycosyltransferase n=1 Tax=Rhodobacterales bact... 181 3e-44 UniRef50_C8WAA9 Glycosyl transferase family 8 n=2 Tax=Atopobium ... 181 3e-44 UniRef50_UPI0001693121 general stress protein n=1 Tax=Paenibacil... 175 1e-42 UniRef50_C2LRU0 Glycosyl transferase, family 8 n=1 Tax=Streptoco... 175 2e-42 UniRef50_Q046Z9 Lipopolysaccharide biosynthesis glycosyltransfer... 169 1e-40 UniRef50_B8PIH6 Predicted protein n=2 Tax=Agaricomycetes RepID=B... 163 1e-38 UniRef50_B2UQN6 Glycosyl transferase family 8 n=1 Tax=Akkermansi... 158 2e-37 UniRef50_C7XX93 Glycosyl transferase, family 8 n=1 Tax=Lactobaci... 158 3e-37 UniRef50_A1VG39 Glycosyl transferase, family 8 n=1 Tax=Desulfovi... 155 2e-36 UniRef50_Q1RIL1 Lipopolysaccharide 1,2-glucosyltransferase RfaJ ... 154 4e-36 UniRef50_Q38VG7 Putative glycosyl transferase, family 8 n=1 Tax=... 153 1e-35 UniRef50_C0X9Z7 Putative uncharacterized protein n=2 Tax=Lactoba... 152 2e-35 UniRef50_A4IXE1 Glycosyl transferase, family 8 n=16 Tax=Francise... 143 8e-33 UniRef50_B6ACJ0 Glycosyl transferase family 8 protein, putative ... 141 4e-32 UniRef50_Q17VR5 Lipopolysaccharide biosynthesis protein n=19 Tax... 139 1e-31 UniRef50_UPI000197AD97 hypothetical protein BACCOPRO_03221 n=1 T... 138 3e-31 UniRef50_Q39T65 Glycosyl transferase, family 8 n=1 Tax=Geobacter... 136 8e-31 UniRef50_B3XPR8 Glycosyl transferase family 8 n=4 Tax=Lactobacil... 132 1e-29 UniRef50_A9S2B3 Predicted protein (Fragment) n=1 Tax=Physcomitre... 131 3e-29 UniRef50_C6DEN2 Glycosyl transferase family 8 n=1 Tax=Pectobacte... 131 3e-29 UniRef50_O48684 F3I6.10 protein n=46 Tax=Embryophyta RepID=O4868... 131 4e-29 UniRef50_D1IFB6 Whole genome shotgun sequence of line PN40024, s... 129 1e-28 UniRef50_Q1CUZ8 Lipopolysaccharide 1,2-glycosyltransferase n=12 ... 128 3e-28 UniRef50_B9HMR5 Glycosyltransferase, CAZy family GT8 n=25 Tax=Ma... 126 1e-27 UniRef50_D1IU75 Whole genome shotgun sequence of line PN40024, s... 125 2e-27 UniRef50_Q92VQ2 Putative lipopolysaccharide 1,3-galactosyltransf... 124 5e-27 UniRef50_B3WD32 Glycosyl transferase n=9 Tax=Lactobacillus RepID... 124 5e-27 UniRef50_C6DEN1 Glycosyl transferase family 8 n=1 Tax=Pectobacte... 122 2e-26 UniRef50_Q8LF94 Avr9/Cf-9 rapidly elicited protein 231 n=13 Tax=... 120 6e-26 UniRef50_C7TID9 Glycosyl transferase, group 8 n=2 Tax=Lactobacil... 120 7e-26 UniRef50_A2RLV8 Putative glycosyltransferase n=1 Tax=Lactococcus... 119 1e-25 UniRef50_D0IR33 LPS 1,2-glycosyltransferase n=3 Tax=Helicobacter... 117 7e-25 UniRef50_A9UZX9 Predicted protein (Fragment) n=1 Tax=Monosiga br... 115 2e-24 UniRef50_A9UXT0 Predicted protein (Fragment) n=1 Tax=Monosiga br... 115 2e-24 UniRef50_Q1CSY7 Lipopolysaccharide 1,2-glycosyltransferase n=3 T... 114 4e-24 UniRef50_C6DEN3 Glycosyl transferase family 8 n=1 Tax=Pectobacte... 114 4e-24 UniRef50_A9SH80 Predicted protein n=2 Tax=Physcomitrella patens ... 114 5e-24 UniRef50_C7PRU3 Glycosyl transferase family 8 n=1 Tax=Chitinopha... 114 6e-24 UniRef50_UPI0000587C70 PREDICTED: similar to MGC81998 protein n=... 112 2e-23 UniRef50_UPI0001621115 predicted protein n=1 Tax=Physcomitrella ... 112 2e-23 UniRef50_UPI0001B55E75 hypothetical protein SSPB78_11600 n=1 Tax... 111 3e-23 UniRef50_Q04CN2 Lipopolysaccharide biosynthesis glycosyltransfer... 111 5e-23 UniRef50_Q02ZT7 Lipopolysaccharide biosynthesis glycosyltransfer... 110 9e-23 UniRef50_C3YRN2 Putative uncharacterized protein (Fragment) n=1 ... 109 1e-22 UniRef50_Q04CN3 Lipopolysaccharide biosynthesis glycosyltransfer... 109 2e-22 UniRef50_C5FDY7 Glycogenin n=1 Tax=Microsporum canis CBS 113480 ... 108 5e-22 UniRef50_Q68CQ7 Glycosyltransferase 8 domain-containing protein ... 105 2e-21 UniRef50_A5BZU1 Putative uncharacterized protein n=1 Tax=Vitis v... 105 2e-21 UniRef50_B3XPR6 Putative uncharacterized protein n=3 Tax=Lactoba... 104 5e-21 UniRef50_A2E3L1 Glycosyl transferase family 8 protein n=2 Tax=Tr... 103 7e-21 UniRef50_B2KBT4 Glycosyl transferase family 8 n=1 Tax=Elusimicro... 103 1e-20 UniRef50_B4WN64 Glycosyl transferase family 8 n=1 Tax=Synechococ... 103 1e-20 UniRef50_Q9FH36 Similarity to unknown protein n=28 Tax=Embryophy... 102 2e-20 UniRef50_B9IA47 Glycosyltransferase n=7 Tax=rosids RepID=B9IA47_... 102 2e-20 UniRef50_UPI000180B580 PREDICTED: similar to glycosyltransferase... 102 2e-20 UniRef50_D1HMA0 Whole genome shotgun sequence of line PN40024, s... 101 5e-20 UniRef50_B5ZNF8 Glycosyl transferase family 8 n=7 Tax=Rhizobium ... 100 6e-20 UniRef50_B6HCQ7 Pc18g02120 protein n=2 Tax=mitosporic Trichocoma... 100 7e-20 UniRef50_B4QUA9 GD18236 n=2 Tax=Sophophora RepID=B4QUA9_DROSI 100 9e-20 UniRef50_O95461 Glycosyltransferase-like protein LARGE1 n=84 Tax... 100 9e-20 UniRef50_Q0U987 Putative uncharacterized protein n=1 Tax=Phaeosp... 99 1e-19 UniRef50_Q6Z5D6 Glycosyltransferase family-like n=6 Tax=Poaceae ... 99 1e-19 UniRef50_A2RAV0 Catalytic activity: UDP-glucose + glycogenin <=>... 99 1e-19 UniRef50_Q726Y5 Glycosyl transferase, family 8 n=4 Tax=Desulfovi... 99 2e-19 UniRef50_Q9H1C3 Glycosyltransferase 8 domain-containing protein ... 99 2e-19 UniRef50_Q9VBY3 CG9996 n=6 Tax=Sophophora RepID=Q9VBY3_DROME 98 3e-19 UniRef50_C1QEC8 LPS:glycosyltransferase n=1 Tax=Brachyspira murd... 98 4e-19 UniRef50_Q2R1U9 Glycosyl transferase family 8 protein, expressed... 98 4e-19 UniRef50_B4WFJ6 Glycosyl transferase family 8 n=1 Tax=Synechococ... 98 4e-19 UniRef50_Q9FX71 T6J4.1 protein n=2 Tax=rosids RepID=Q9FX71_ARATH 98 5e-19 UniRef50_B7PNZ0 Glycosyltransferase domain-containing protein, p... 97 7e-19 UniRef50_B6JNQ8 Lipopolysaccharide 1,2-glucosyltransferase n=18 ... 97 8e-19 UniRef50_Q9M9Y5 F4H5.13 protein n=4 Tax=rosids RepID=Q9M9Y5_ARATH 96 1e-18 UniRef50_B2VRF2 Glycogenin-2 n=1 Tax=Pyrenophora tritici-repenti... 96 1e-18 UniRef50_A7S9E5 Predicted protein n=1 Tax=Nematostella vectensis... 96 1e-18 UniRef50_Q2L3C5 Glycosyl transferase-like protein n=3 Tax=Magnol... 96 2e-18 UniRef50_Q02ZT6 Lipopolysaccharide biosynthesis glycosyltransfer... 96 2e-18 UniRef50_A5DLS6 Putative uncharacterized protein n=2 Tax=Pichia ... 96 2e-18 UniRef50_B6JMU4 Lipopolysaccharide biosynthesis protein n=20 Tax... 95 4e-18 UniRef50_Q871S1 Glycogenin n=3 Tax=Sordariaceae RepID=Q871S1_NEUCR 95 4e-18 >UniRef50_P27128 Lipopolysaccharide 1,3-galactosyltransferase n=43 Tax=Enterobacteriaceae RepID=RFAI_ECOLI Length = 339 Score = 431 bits (1109), Expect = e-119, Method: Composition-based stats. Identities = 339/339 (100%), Positives = 339/339 (100%) Query: 1 MQQVFFQETEFLNSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLC 60 MQQVFFQETEFLNSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLC Sbjct: 1 MQQVFFQETEFLNSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLC 60 Query: 61 FHIFTDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYF 120 FHIFTDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYF Sbjct: 61 FHIFTDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYF 120 Query: 121 INKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKG 180 INKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKG Sbjct: 121 INKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKG 180 Query: 181 YFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYN 240 YFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYN Sbjct: 181 YFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYN 240 Query: 241 TQFSLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTAL 300 TQFSLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTAL Sbjct: 241 TQFSLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTAL 300 Query: 301 LKPNNSNQLRYSAKHMLKKHRYLKGFSNYLFYFIEKIKH 339 LKPNNSNQLRYSAKHMLKKHRYLKGFSNYLFYFIEKIKH Sbjct: 301 LKPNNSNQLRYSAKHMLKKHRYLKGFSNYLFYFIEKIKH 339 >UniRef50_D2TIX6 UDP-galactose:(Glucosyl) LPS alpha-1,3-galactosyltransferase n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TIX6_CITRO Length = 340 Score = 399 bits (1025), Expect = e-110, Method: Composition-based stats. Identities = 236/337 (70%), Positives = 272/337 (80%) Query: 1 MQQVFFQETEFLNSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLC 60 MQQV+F+ETEFL S ID++H+ E + LDIAYG D+NFLFGCGISIAS+LK N L Sbjct: 1 MQQVYFKETEFLTSTIDFNHQDTAEKVVLDIAYGVDQNFLFGCGISIASVLKNNTDKTLH 60 Query: 61 FHIFTDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYF 120 FH+F D F + DR+ FD LA QYKT I IYLIN + LRSLPSTKNWT+AIYFRF IADYF Sbjct: 61 FHVFIDAFNETDRRMFDKLAAQYKTHITIYLINCEHLRSLPSTKNWTYAIYFRFAIADYF 120 Query: 121 INKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKG 180 I K K+LYLDADIICQG I+ L+NFSF DK+A VVTEG+ADWWEKRA SLG GI KG Sbjct: 121 IGKTNKLLYLDADIICQGGIDELVNFSFASDKIAAVVTEGKADWWEKRALSLGTEGITKG 180 Query: 181 YFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYN 240 YFNSG +LIN QWA + +SARAI ML++P+I+ +ITHPDQDVLN+LLADKL F DIK+N Sbjct: 181 YFNSGLILINLNQWAIECISARAIKMLSDPDIVGRITHPDQDVLNILLADKLHFLDIKFN 240 Query: 241 TQFSLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTAL 300 TQFSLNYQLK+ FINPV NDTI IHYIGPTKPWH WA DY +S+ F++AK ASPWKNTAL Sbjct: 241 TQFSLNYQLKDKFINPVNNDTILIHYIGPTKPWHSWAGDYLISKPFIDAKQASPWKNTAL 300 Query: 301 LKPNNSNQLRYSAKHMLKKHRYLKGFSNYLFYFIEKI 337 LKP NSNQ RY AKHMLK RY+KG Y YF++KI Sbjct: 301 LKPTNSNQFRYCAKHMLKNKRYIKGMVGYFLYFMKKI 337 >UniRef50_Q9R9D1 UDP-glucose:(Glucosyl) LPS alpha1,3-glucosyltransferase WaaO n=29 Tax=Enterobacteriaceae RepID=Q9R9D1_ECOLX Length = 338 Score = 372 bits (955), Expect = e-101, Method: Composition-based stats. Identities = 171/338 (50%), Positives = 233/338 (68%), Gaps = 3/338 (0%) Query: 1 MQQVFFQETEFLNSVIDYDH-KVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRL 59 M +F E +N I +D + +AYG DKNFLFGCG+SI S+L +N Sbjct: 1 MSAHYFNPQEMINKTIIFDERPAASVASSFHVAYGIDKNFLFGCGVSITSVLLHNSDVSF 60 Query: 60 CFHIFTDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADY 119 FH+F D + D + LA Y+T I+I+L+N +RL++LP+TKNW+ A+YFRFVIADY Sbjct: 61 VFHVFIDDIPEADIQRLAQLAKSYRTCIQIHLVNCERLKALPTTKNWSIAMYFRFVIADY 120 Query: 120 FINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAK 179 FI++ K+LYLDADI CQG ++PLI ++ VA VVTE A+WW R SL + K Sbjct: 121 FIDQQDKILYLDADIACQGNLKPLITMDLANN-VAAVVTERDANWWSLRGQSLQCNELEK 179 Query: 180 GYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKY 239 GYFNSG LLINT WA + VSA+A++ML + I+ ++T+ DQD+LN++L K+ F D KY Sbjct: 180 GYFNSGVLLINTLAWAQESVSAKAMSMLADKAIVSRLTYMDQDILNLILLGKVKFIDAKY 239 Query: 240 NTQFSLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTA 299 NTQFSLNY+LK+SF+ P+ ++T+ IHY+GPTKPWH WA YP +Q F++AK ASPWKN Sbjct: 240 NTQFSLNYELKKSFVCPINDETVLIHYVGPTKPWHYWA-GYPSAQPFIKAKEASPWKNEP 298 Query: 300 LLKPNNSNQLRYSAKHMLKKHRYLKGFSNYLFYFIEKI 337 L++P NSN RY AKH K+++ + G NY++YF KI Sbjct: 299 LMRPVNSNYARYCAKHNFKQNKPINGIMNYIYYFYLKI 336 >UniRef50_Q9ZIT4 UDP-galactose:(Glucosyl) LPS alpha1,3-galactosyltransferase WaaI n=26 Tax=Enterobacteriaceae RepID=Q9ZIT4_ECOLX Length = 335 Score = 343 bits (880), Expect = 5e-93, Method: Composition-based stats. Identities = 175/331 (52%), Positives = 235/331 (70%), Gaps = 1/331 (0%) Query: 6 FQETEFLNSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFT 65 +++ + ++ ++ LDIA+G D+NFLFGCG++IASIL N FH+FT Sbjct: 4 LNDSDIILFEYNFHYQNIRSKNTLDIAFGIDRNFLFGCGVAIASILLNNREISCEFHVFT 63 Query: 66 DYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAP 125 DY D D+ YF LA QY +RI IY+IN D+L+SLPSTKNWT+A YFRF+IADYF +K Sbjct: 64 DYISDKDKLYFSDLAKQYNSRINIYVINCDKLKSLPSTKNWTYATYFRFIIADYFYHKHE 123 Query: 126 KVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSG 185 K+LYLDADI C+G+I+ L+++ F +++A VV E +WW+ RA L +A GYFN+G Sbjct: 124 KILYLDADIACKGSIKELLDYQFSTNEIAAVVAERDVEWWQNRASVLTTPQLASGYFNAG 183 Query: 186 FLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSL 245 FLLIN +W +S++AI ML +P+ + KITH DQDVLN+LL K+ F KYNT++S+ Sbjct: 184 FLLINIDEWNLNNISSKAIEMLRDPDWVSKITHLDQDVLNVLLNGKVKFISEKYNTRYSI 243 Query: 246 NYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNN 305 NY+LK+ NPV +DT+FIHY+GPTKPWH+WA +YPVS++F+ AK ASPW LLKP N Sbjct: 244 NYELKDKVDNPVNDDTVFIHYVGPTKPWHEWA-NYPVSRSFLIAKAASPWSKEDLLKPVN 302 Query: 306 SNQLRYSAKHMLKKHRYLKGFSNYLFYFIEK 336 SNQ RY AKH K+ Y+ G NYL Y+ EK Sbjct: 303 SNQYRYCAKHKFKQKHYMAGIFNYLKYYKEK 333 >UniRef50_B6XJW2 Putative uncharacterized protein n=1 Tax=Providencia alcalifaciens DSM 30120 RepID=B6XJW2_9ENTR Length = 333 Score = 337 bits (865), Expect = 4e-91, Method: Composition-based stats. Identities = 151/328 (46%), Positives = 213/328 (64%), Gaps = 3/328 (0%) Query: 11 FLNSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDY-FG 69 + I + ++ C +AYG D NFL+G G+SI S+L +N + FHIF D Sbjct: 8 MVKKTIPIGNIEIDDSSCQHVAYGIDHNFLYGSGVSIVSLLMHNPHIQFAFHIFIDNSMS 67 Query: 70 DDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLY 129 D+D F + Y T+I IY I+ + ++ LP+TKNWTHAIYFRF+IA+YF +K +LY Sbjct: 68 DEDIAKFAEICHLYNTKITIYFIDSNNVKKLPTTKNWTHAIYFRFIIAEYFKDKIDYLLY 127 Query: 130 LDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLI 189 LDAD++C I+ L++ + +A VV E WW+KRA SLG ++KGYFNSG + I Sbjct: 128 LDADVVCNRNIDELLSHNLLGY-IAAVVPERDKAWWQKRADSLGFPSVSKGYFNSGVMYI 186 Query: 190 NTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQL 249 N W V+ +++A+L + E+ ++ +PDQDVLN+LL D ++F +NTQFSLNY+L Sbjct: 187 NLRTWKTNNVTEKSMALLMDNEVSHRLVYPDQDVLNILLTDSVLFISSIFNTQFSLNYEL 246 Query: 250 KESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQL 309 K+SF PV T+FIHY+GPTKPWH+WA +Y +Q F+EA+ SPW+N LLK +SN L Sbjct: 247 KKSFDFPVKRTTVFIHYVGPTKPWHEWA-NYETAQPFLEARAVSPWRNVPLLKAKSSNHL 305 Query: 310 RYSAKHMLKKHRYLKGFSNYLFYFIEKI 337 RY AKH + + +Y F NY+ YF KI Sbjct: 306 RYCAKHNINQRKYFFAFKNYIAYFFSKI 333 >UniRef50_D0KD53 Lipopolysaccharide 3-alpha-galactosyltransferase n=3 Tax=Enterobacteriaceae RepID=D0KD53_PECWW Length = 336 Score = 332 bits (852), Expect = 9e-90, Method: Composition-based stats. Identities = 154/332 (46%), Positives = 209/332 (62%), Gaps = 7/332 (2%) Query: 13 NSVIDYDHKVETENLC--LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGD 70 VI H C LDIA+GTD+ F++GC I+IASIL N L FH+FTD D Sbjct: 6 EKVIKTVHSFSYSKKCAELDIAFGTDEKFIYGCAIAIASILLKNPDYCLSFHVFTDKLSD 65 Query: 71 DDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYL 130 D+ F +A QY T I IY+++ L++LP TK W++AIYFRF+IADYF KVLYL Sbjct: 66 GDKARFQEMAEQYNTTINIYIVDCSWLKTLPETKLWSYAIYFRFIIADYFYKILDKVLYL 125 Query: 131 DADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLIN 190 DADIIC G+++ LI + ++ VV +G ++WW+ RA ++ GYFNSG LLI Sbjct: 126 DADIICNGSLQELIKLDLSNH-ISAVVLDGDSNWWKNRAQKFQQPELSNGYFNSGVLLIE 184 Query: 191 TAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLK 250 W V+ ++ +L +PE+ K ITHPDQDVLN+LLA K + KYNTQFS+NY+LK Sbjct: 185 VNNWHQAAVTENSMRLLTDPEMKKIITHPDQDVLNVLLAGKSCHIESKYNTQFSINYELK 244 Query: 251 ESF----INPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNS 306 S+ P++N TIFIHYIGPTKPWH WA +Y ++ F++AK SPWKN +LL + Sbjct: 245 YSYGESAPTPISNKTIFIHYIGPTKPWHKWAANYACTKYFLKAKEHSPWKNESLLDAVTA 304 Query: 307 NQLRYSAKHMLKKHRYLKGFSNYLFYFIEKIK 338 + +RY AKH ++G ++L Y +K Sbjct: 305 SNMRYCAKHQFHNGEIIRGTLSFLKYLYKKAF 336 >UniRef50_D2TY85 UDP-D-galactose:(Glucosyl)lipopolysaccharide-alpha-1, 3-D-galactosyltransferase n=1 Tax=Arsenophonus nasoniae RepID=D2TY85_9ENTR Length = 343 Score = 320 bits (820), Expect = 6e-86, Method: Composition-based stats. Identities = 147/336 (43%), Positives = 205/336 (61%), Gaps = 3/336 (0%) Query: 4 VFFQETEFLNSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHI 63 ++F E + + ++ + + IAYG DKNF G ISI S+L +N+ F+I Sbjct: 10 MYFNSKEIILTSYEFS-SADAKTPQFHIAYGADKNFSLGTAISICSMLYFNKIYTFHFYI 68 Query: 64 FTDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINK 123 FTD + D K FD L Y T+I I LI+ +L+ LP+ K W+HAIYFRF+IA+YF NK Sbjct: 69 FTDTISECDLKKFDELTSCYNTKITILLIDTLQLKKLPTNKLWSHAIYFRFIIANYFHNK 128 Query: 124 APKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFN 183 K+LYLD+DIIC G I L + +A V Q W+KRA L IA GYFN Sbjct: 129 TNKILYLDSDIICSGDISELFDIDLNQHIIAAVADRDQ-YLWKKRAEMLATPEIANGYFN 187 Query: 184 SGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQF 243 SG +LI+T +W +++ + I +L + + K DQD LN+ L ++++F D K+NTQF Sbjct: 188 SGVMLIDTDKWHKNKITEKTINILLDDKTKAKFVFYDQDALNISLVNQVLFLDKKFNTQF 247 Query: 244 SLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKP 303 S+NY+LK + P+ N+ FIHYIGPTKPW+ W+ +YP + FM K SPWK T L+ Sbjct: 248 SINYELKNKTLFPIINNVKFIHYIGPTKPWNIWS-EYPSTHLFMTIKKNSPWKTTPLIAA 306 Query: 304 NNSNQLRYSAKHMLKKHRYLKGFSNYLFYFIEKIKH 339 + SNQ RY+AKHM K +Y+ NYL+YF+ K H Sbjct: 307 STSNQYRYAAKHMFNKKKYIYWLLNYLYYFVNKALH 342 >UniRef50_D2U322 Lipopolysaccharide 1,2-glucosyltransferase n=1 Tax=Arsenophonus nasoniae RepID=D2U322_9ENTR Length = 334 Score = 302 bits (775), Expect = 9e-81, Method: Composition-based stats. Identities = 107/332 (32%), Positives = 171/332 (51%), Gaps = 8/332 (2%) Query: 11 FLNSVIDY--DHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYF 68 F+ + + K+ N +IAYG DKNFL G ISI S+L N + FH+FTDY Sbjct: 4 FIKQKFNIAGEKKLTENNKNFNIAYGVDKNFLLGAAISINSVLINNTDTDFNFHLFTDYI 63 Query: 69 GDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVL 128 D + F + +Y + I IYL++ L+ L ++ W++A YFR + +Y +L Sbjct: 64 DDGYIQRFQTMIAKYNSNIIIYLLDAAELKQLSTSDFWSYATYFRLIAFEYLSTNIHAIL 123 Query: 129 YLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLL 188 YLDAD+IC+G+++ + + D A+V+ + A L +A + YFN+G + Sbjct: 124 YLDADVICKGSLKEIFQLNLADSFAAVVLDVDS--MQQSSATRLNLADLNGKYFNAGVIY 181 Query: 189 INTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLN-- 246 +N +W S +++ ++ K+ + DQD LN+L + I+ YN + L Sbjct: 182 VNLQKWIENDFSKKSLELVRGKTNFGKLKYLDQDALNILFQTQNIYLSRDYNCIYKLKNE 241 Query: 247 --YQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPN 304 Y + N +T+ TI IHY G TKPWH W +YP SQ F + SPWK+ L Sbjct: 242 LAYHDLSKYKNTITDSTILIHYTGVTKPWHTWGINYPASQFFFNSYIHSPWKDQPLKMAE 301 Query: 305 NSNQLRYSAKHMLKKHRYLKGFSNYLFYFIEK 336 +L+ KH+ +H+Y++GF + Y + K Sbjct: 302 KRTELQEKYKHLFLQHKYMQGFLCLIKYKLLK 333 >UniRef50_UPI000197C525 UDP-D-galactose:(glucosyl) lipopolysaccharide-alpha-1,3-D-galactosyltransferase n=1 Tax=Providencia rettgeri DSM 1131 RepID=UPI000197C525 Length = 339 Score = 301 bits (772), Expect = 2e-80, Method: Composition-based stats. Identities = 124/326 (38%), Positives = 195/326 (59%), Gaps = 2/326 (0%) Query: 13 NSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDD 72 S + D + ++AYG DKNFLFG G+SI S+L N+ FH+FTD+ D D Sbjct: 12 TSFTNKDVNKDLSKKKFNVAYGADKNFLFGTGVSIVSVLLNNKDINFHFHVFTDFLSDKD 71 Query: 73 RKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDA 132 + F ++ QYKT + ++ +N D L+ LP+ + W+HAIYFR +IADYF K KVLYLD+ Sbjct: 72 IQLFSQISKQYKTSVTLHTLNMDILKKLPTNQVWSHAIYFRLIIADYFYKKCDKVLYLDS 131 Query: 133 DIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTA 192 D++C G+I+ L + + +A V+ + E A+ V GI KGYFNSG +LIN Sbjct: 132 DVVCTGSIQILKSLNLSSMPIAAVMDISEPHSVE-MANLFNVEGIKKGYFNSGVMLINPD 190 Query: 193 QWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKES 252 +W +Q++ +++++ + ++ I + DQD +N+ + + D +N + +LN + K Sbjct: 191 EWNYRQLTEKSMSVFTDKKLQPVIKYYDQDAINIAVHGDWLKLDNIFNHRINLNDRYKHK 250 Query: 253 FINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYS 312 N ++N +F+H+IG TKPWH+W+ Y + F+ AK SPWK+ L+ P N +Y+ Sbjct: 251 KNNDISN-AVFVHFIGSTKPWHNWSKYYHEVRCFLNAKEKSPWKDIDLMTPQNITHHKYA 309 Query: 313 AKHMLKKHRYLKGFSNYLFYFIEKIK 338 +KH K +YL F +Y+ Y I KIK Sbjct: 310 SKHFRYKEKYLSSFYHYVLYTILKIK 335 >UniRef50_UPI00019F16C6 hypothetical protein CATC2_20202 n=1 Tax=Citrobacter youngae ATCC 29220 RepID=UPI00019F16C6 Length = 330 Score = 298 bits (764), Expect = 2e-79, Method: Composition-based stats. Identities = 110/337 (32%), Positives = 181/337 (53%), Gaps = 15/337 (4%) Query: 6 FQETEFLNSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFT 65 F + + +++++ L+IA+G DKNF+FG IS+ S+L +N+ + FH+FT Sbjct: 3 FDCHQSIKKILEFNQAPSEHKTQLNIAWGVDKNFMFGAAISMTSVLLHNKDLNIHFHLFT 62 Query: 66 DYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAP 125 DY D ++ LA Q+ T I IY+++ + L+ LPS W+HA+YFRF+ +Y K Sbjct: 63 DYIDADYQQRVAKLAEQFATNISIYIMDANGLKVLPSGNAWSHAMYFRFIAFEYLGEKVD 122 Query: 126 KVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSG 185 +LY+DAD++C+G++ L + A++ + + K YFNSG Sbjct: 123 SLLYIDADVMCKGSLYELTQIDLGEHVAAVITDVDDSPARD--------IEKNKDYFNSG 174 Query: 186 FLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSL 245 + N +W Q A +L + K++ PDQDVLN+L K+IF + ++N + + Sbjct: 175 VIFANLKKWKEQNFINSAFDILLDKN--NKLSFPDQDVLNILFLKKVIFLERRFNAIYGI 232 Query: 246 NYQLK----ESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALL 301 +LK + +T +TI IHYIG TKPW+ WA +YP +Q F+EA +SPW + LL Sbjct: 233 KQELKSKDTSKYKEYITPETILIHYIGVTKPWNSWA-NYPSAQYFVEAWKSSPWADVPLL 291 Query: 302 KPNNSNQLRYSAKHMLKKHRYLKGFSNYLFYFIEKIK 338 Q + ++H + +Y +Y+ Y K+K Sbjct: 292 PARTPKQYKKKSRHERLQGKYFASAISYIGYLWAKLK 328 >UniRef50_A8ARL6 Putative uncharacterized protein n=3 Tax=Citrobacter RepID=A8ARL6_CITK8 Length = 339 Score = 291 bits (744), Expect = 3e-77, Method: Composition-based stats. Identities = 105/324 (32%), Positives = 181/324 (55%), Gaps = 9/324 (2%) Query: 18 YDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFD 77 D+ ++ L+IAYG D+NFLFG GIS+ S+L N + F++ TDY D+ + + Sbjct: 15 IDNATHQKSKKLNIAYGVDRNFLFGSGISMTSVLVNNPDIDIHFYVVTDYVDDEYLESVE 74 Query: 78 ALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQ 137 L Y T + + + + + R LPSTK WT+A+Y+R+ +Y + VLYLDADI+C+ Sbjct: 75 RLTQMYGTTVTVLVFDNEAFRKLPSTKAWTYAMYYRYFAFEYLSRELDSVLYLDADIVCK 134 Query: 138 GTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQ 197 ++ L + F + A+V + K LG+ +A+ YFNSG + N W + Sbjct: 135 NSLRELTDIHFAGEYAAVVNDIDRVRL--KSGQRLGIPELARDYFNSGVVFANLHVWREK 192 Query: 198 QVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKES----F 253 ++ ++A +L+E + K++ + DQD+LN+L +I +N + ++ +LK + Sbjct: 193 KLLSKAFEVLHERQ--KELLYFDQDILNILFVGHVILLRRDFNCIYGVDQELKNKNEYRY 250 Query: 254 INPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSA 313 + +T T+ IHY+G TKPWH WA +YPVS+ F+EA S W +LL N + + + Sbjct: 251 QDFITESTVLIHYVGVTKPWHTWA-NYPVSKYFIEAYKKSAWAEKSLLNANTAKLYKRKS 309 Query: 314 KHMLKKHRYLKGFSNYLFYFIEKI 337 +H + +Y++ +++ Y K+ Sbjct: 310 RHERIQRKYIRSIFSHIMYIKNKL 333 >UniRef50_Q9ZIS1 UDP-galactose:(Galactosyl) LPS alpha1,2-galactosyltransferase WaaW n=29 Tax=Enterobacteriaceae RepID=Q9ZIS1_ECOLX Length = 342 Score = 290 bits (743), Expect = 4e-77, Method: Composition-based stats. Identities = 106/322 (32%), Positives = 172/322 (53%), Gaps = 7/322 (2%) Query: 21 KVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALA 80 ++ + L+IAYG D+NFLFG +S+ S++ +N + FH+FTDY +D + +A Sbjct: 16 EIANTDRVLNIAYGIDRNFLFGAAVSMQSVVMHNPDLAVKFHLFTDYIDEDYLQRVNAFT 75 Query: 81 LQY-KTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGT 139 + ++IY ++ + PS K W++A +FR V Y +LY+DAD+IC+G+ Sbjct: 76 SKNANVEVRIYKVSSAFIDIFPSLKQWSYATFFRLVAFQYLSETIENLLYIDADVICKGS 135 Query: 140 IEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQV 199 + L++ +F DK A V+ + EK A L + G+ YFN+G + + WA Sbjct: 136 LAGLLDINFDGDKFAAVIKDV-PFMQEKPAKRLAIEGLPGNYFNAGVVYLQLEAWAKNDF 194 Query: 200 SARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLK----ESFIN 255 +AIAML K DQD+LN+L IF Y+ + ++Y+LK E + Sbjct: 195 MNKAIAMLASDPQHTKYKCLDQDILNILFFGHCIFISGDYDCFYGIDYELKNKSDEDYKK 254 Query: 256 PVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKH 315 +T+DT IHY+G TKPW+DW +YP + F EA AS W + A + N Q + ++H Sbjct: 255 TITDDTKLIHYVGVTKPWNDWT-NYPCQKYFNEAYQASCWNDVAFIPATNEKQYQVKSRH 313 Query: 316 MLKKHRYLKGFSNYLFYFIEKI 337 + + F ++ Y+ +KI Sbjct: 314 LKRNGNIASSFYYFMLYYSKKI 335 >UniRef50_Q9ZIS6 UDP-galactose:(Glucosyl) LPS alpha1,2-galactosyltransferase WaaT n=26 Tax=Enterobacteriaceae RepID=Q9ZIS6_ECOLX Length = 331 Score = 289 bits (741), Expect = 7e-77, Method: Composition-based stats. Identities = 105/332 (31%), Positives = 172/332 (51%), Gaps = 8/332 (2%) Query: 10 EFLNSVIDYDHKVETENLC-LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYF 68 EF+ Y + EN L+++YG DKNFL+G G+SI+S+L N FH+FTDY Sbjct: 3 EFIKERFSYLADNKKENAPELNVSYGIDKNFLYGAGVSISSVLINNSDINFVFHVFTDYV 62 Query: 69 GDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVL 128 DD K F+ A Q+ T I +YLI+ LP+++ W++A YFR + +Y +L Sbjct: 63 DDDYLKSFNETAKQFNTSIIVYLIDPKYFADLPTSQFWSYATYFRVLSFEYLSESISTLL 122 Query: 129 YLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLL 188 YLDAD++C+G+++PL F D+ A++ A L + + YFN+G + Sbjct: 123 YLDADVVCKGSLKPLTEIIFKDEFAAVIPDNDSTQAAC--AKRLNIPEMNGRYFNAGVIY 180 Query: 189 INTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQ 248 +N +W ++ + +L + + DQD LN+ I+ ++T ++L + Sbjct: 181 VNLKKWHEANLTPYLLKLLRGETKYGSLKYLDQDALNIAFNMNNIYLAKDFDTIYTLKNE 240 Query: 249 L----KESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPN 304 L + +T+ T+ IHY G TKPWH WA YP + F A+ SPWK L + Sbjct: 241 LYDRSHRKYQQTITDKTVLIHYTGITKPWHSWA-GYPSASYFNIAREQSPWKKYPLKEAR 299 Query: 305 NSNQLRYSAKHMLKKHRYLKGFSNYLFYFIEK 336 +++ KH+ Y+KG ++ + Y ++K Sbjct: 300 TVAEMQKQYKHLFAHGEYIKGITSLIKYKLKK 331 >UniRef50_B2PV91 Putative uncharacterized protein n=2 Tax=Enterobacteriaceae RepID=B2PV91_PROST Length = 342 Score = 287 bits (736), Expect = 3e-76, Method: Composition-based stats. Identities = 102/314 (32%), Positives = 165/314 (52%), Gaps = 8/314 (2%) Query: 28 CLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 CLD+ YG+D+N+ FG G+S S+L N + FH F D D + +A Q++ Sbjct: 24 CLDVIYGSDENYQFGAGVSAVSLLINNPTTFFRFHYFLDKVSPDFLEKLKVIASQFQVEF 83 Query: 88 KIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 +Y ++ L++LP++ W+ A+YFR V DY + LYLDAD++C G ++ N Sbjct: 84 HVYELDNKLLKTLPASDVWSSAMYFRLVALDYLSSDYDFALYLDADVMCNGILDLTTNL- 142 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 D +V + K L +AK YFNSG + +N +W +Q++ + +L Sbjct: 143 IKDKVCGVVADDIGVRT--KSETRLHAPSLAKTYFNSGVMFVNLKKWHEKQITQQCFELL 200 Query: 208 NEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQL----KESFINPVTNDTIF 263 + ++ +PDQDVLN++L + L ++NT ++L +L + + +T +T+ Sbjct: 201 SAENAKQRYKYPDQDVLNLILREDLELLSQRFNTVYTLKNELYDSTHQKYQQVITPETVL 260 Query: 264 IHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKHRYL 323 IHY G +KPWH WA +YP SQ F +A SPW L + + KH+LK+ YL Sbjct: 261 IHYTGVSKPWHTWA-NYPASQPFYKALMQSPWTTNDLKPATKFVERKKEYKHLLKQGNYL 319 Query: 324 KGFSNYLFYFIEKI 337 G + + Y EK+ Sbjct: 320 AGILSGIRYSFEKL 333 >UniRef50_B6XJW1 Putative uncharacterized protein n=1 Tax=Providencia alcalifaciens DSM 30120 RepID=B6XJW1_9ENTR Length = 325 Score = 284 bits (727), Expect = 3e-75, Method: Composition-based stats. Identities = 107/309 (34%), Positives = 165/309 (53%), Gaps = 3/309 (0%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 L+IAYG DK FLFG G+S+ SI+ N +L FH+FTDY D+ + L L I Sbjct: 19 LNIAYGVDKGFLFGSGLSMNSIIINNSDIKLKFHLFTDYMNDEFLSKLEKLTLNENVNID 78 Query: 89 IYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSF 148 IY+IN D L+ LP + W++A YFRF I D+ +LYLDAD+ C+G++ I+ +F Sbjct: 79 IYIINADELKKLPISHVWSYATYFRFFIFDHLCETLSSILYLDADVFCKGSLRKYIDIAF 138 Query: 149 PDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLN 208 + A++ + L + I YFN+G + +N W + + +A ++ Sbjct: 139 NGEYAAVIPD--VPNMQISCVDRLSMPQIKDKYFNAGVIFLNLKVWDKNKFTKQAFNLIT 196 Query: 209 EPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQL-KESFINPVTNDTIFIHYI 267 K + + DQD LN++ + I+ YN ++L +L E++ + +T++T IHY Sbjct: 197 NNHTGKTLKYLDQDALNIIFNCQNIYLPRDYNCIYTLKNELEHENYKDYITSETKLIHYT 256 Query: 268 GPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKHRYLKGFS 327 G TKPWH WA +YP SQ F A SPWKN L+ + + KH + ++L G S Sbjct: 257 GATKPWHYWAVNYPASQTFKVAFETSPWKNDELVDAKKKPEYQERYKHEFNQKKFLTGIS 316 Query: 328 NYLFYFIEK 336 + + Y K Sbjct: 317 SLIKYKKFK 325 >UniRef50_Q46Y64 Glycosyl transferase, family 8 n=1 Tax=Ralstonia eutropha JMP134 RepID=Q46Y64_RALEJ Length = 331 Score = 284 bits (726), Expect = 4e-75, Method: Composition-based stats. Identities = 79/318 (24%), Positives = 149/318 (46%), Gaps = 12/318 (3%) Query: 26 NLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKT 85 IA+ D N+ G +IASI+ N G FH+ T +++++ L Y Sbjct: 20 KPSFHIAFCVDDNYFRAMGATIASIIDNNPGQHFTFHVLTFSALEENQRRLKQLEEMYPV 79 Query: 86 RIKIYLING---DRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEP 142 +++L++ + +++ +I+ R VI + + +VLYLDADI+C ++ Sbjct: 80 STQLHLLDLASFTQFSHFLGHSHYSLSIFTRLVIPEVLQGQTDRVLYLDADILCVNRLDE 139 Query: 143 LINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSAR 202 L++ ++ VV +R +LG+A YFN G L IN +W A+ ++ + Sbjct: 140 LVDMDISNEI--AVVVPDAPVTLRRRVAALGLAHAE--YFNGGVLFINIDKWLAENITPQ 195 Query: 203 AIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLK-ESFINPVTNDT 261 + L + + DQD LN +L + + ++N + L + L F Sbjct: 196 TLEALLDTSTDMR--FNDQDALNKVLNGRAKYISPRWNYLYDLIHDLNVNRFAMRPVGKA 253 Query: 262 IFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTAL-LKPNNSNQLRYSAKHMLKKH 320 +FIH+ G KPW DW+ + F + SPW++ L +P N+ ++R ++ M ++H Sbjct: 254 VFIHFAGSVKPWADWS-GHEARGLFRKYLALSPWRDMPLDPEPRNTKEMRMHSRFMFRQH 312 Query: 321 RYLKGFSNYLFYFIEKIK 338 + ++ YL Y ++ + Sbjct: 313 KPVESLKWYLRYLRKRAQ 330 >UniRef50_C1DGU7 Lipopolysaccharide 1,3-galactosyltransferase n=1 Tax=Azotobacter vinelandii DJ RepID=C1DGU7_AZOVD Length = 326 Score = 283 bits (725), Expect = 5e-75, Method: Composition-based stats. Identities = 96/329 (29%), Positives = 159/329 (48%), Gaps = 15/329 (4%) Query: 19 DHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDA 78 + L IA+G D+N+L GI+I SI++ N G L FH+F R D Sbjct: 2 SDTATRNSDVLHIAFGVDENYLRPMGITIVSIIENNPGLELVFHVFISSISSASRVRLDR 61 Query: 79 LALQYKTRIKIYLINGD-RLRSLPSTK---NWTHAIYFRFVIADYFINKAPKVLYLDADI 134 L + + ++L++ ++ S K + + A Y R +I + + +VLYLDADI Sbjct: 62 LERMFARPVNLHLVDEMLDVKDPASGKGQAHISKAAYIRLLIPEALRDFTDRVLYLDADI 121 Query: 135 ICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQW 194 +C G I L++ A++ G KRA + YFNSG L I+ +W Sbjct: 122 LCVGDISGLLHLDIDGRTAAVIRDAGAE---SKRAGLVKKGQTLDNYFNSGVLYIDIPRW 178 Query: 195 AAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFI 254 + V++RA+ + +P + + + DQD LN++L + F D +N Q+ L +LK+ + Sbjct: 179 IERAVTSRALEKIADP--VLDLRYSDQDALNLVLDGDVRFIDKGWNHQYGLTGKLKKGRV 236 Query: 255 N-PVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQL---- 309 V +DT F+H+IGP KPW W + + F+ + SPW AL + ++ Sbjct: 237 GMDVPSDTKFVHFIGPMKPWRSWN-PHQSKELFLRYQALSPWAGEALDDNFSPREIYVYS 295 Query: 310 RYSAKHMLKKHRYLKGFSNYLFYFIEKIK 338 R+ + M ++ R+L G Y + K K Sbjct: 296 RFMYRSMFQQGRWLSGLIWYGKFLHRKHK 324 >UniRef50_Q9ZIT6 UDP-glucose:(Galactosyl) LPS alpha1,2-glucosyltransferase WaaJ n=26 Tax=Enterobacteriaceae RepID=Q9ZIT6_ECOLX Length = 339 Score = 279 bits (715), Expect = 8e-74, Method: Composition-based stats. Identities = 112/337 (33%), Positives = 183/337 (54%), Gaps = 9/337 (2%) Query: 6 FQETEFLNSVIDYDHKVET--ENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHI 63 F+ +I+ D + E ++++G D+N+ G ISIASIL+ N+ ++ FHI Sbjct: 5 FKHLTQFKDIIELDKRPVKLDERETFNVSWGIDENYQVGAAISIASILENNKQNKFTFHI 64 Query: 64 FTDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINK 123 DY + + LA +Y+T IK+YLI+ + L++LP + W +IY+R + DYF + Sbjct: 65 IADYLDKEYIELLSQLATKYQTVIKLYLIDSEPLKALPQSNIWPVSIYYRLLSFDYFSAR 124 Query: 124 APKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFN 183 +LYLDADI+C+G++ LI F D+ A+V+ K A L YFN Sbjct: 125 LDSLLYLDADIVCKGSLNELIALEFKDEYGAVVIDVDA--MQSKSAERLCNEDFNGSYFN 182 Query: 184 SGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQF 243 SG + IN +W Q+++ + +L++ IIKK+ +PDQD+LN++ KYN + Sbjct: 183 SGVMYINLREWLKQRLTEKFFDLLSDESIIKKLKYPDQDILNLMFLHHAKILPRKYNCIY 242 Query: 244 SLNYQLKES----FINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTA 299 ++ + +E + + +DT+FIHY G TKPWHDWA +Y + F N SPW+N Sbjct: 243 TIKSEFEEKNSEYYTRFINDDTVFIHYTGITKPWHDWA-NYASADYFRNIYNISPWRNIP 301 Query: 300 LLKPNNSNQLRYSAKHMLKKHRYLKGFSNYLFYFIEK 336 K ++ + KH+L + ++L G + Y + K Sbjct: 302 YKKAVKKHEHKEKYKHLLYQKKFLDGVFTAIKYNVMK 338 >UniRef50_P27129 Lipopolysaccharide 1,2-glucosyltransferase n=44 Tax=Enterobacteriaceae RepID=RFAJ_ECOLI Length = 338 Score = 277 bits (709), Expect = 4e-73, Method: Composition-based stats. Identities = 106/324 (32%), Positives = 169/324 (52%), Gaps = 9/324 (2%) Query: 17 DYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYF 76 D+ + CL++AYG D N+L G G+SI SI+ N L F+I D + D + Sbjct: 16 DFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKI 75 Query: 77 DALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIIC 136 LA Q + RI +Y IN D+L+ LP T+ W+ A+YFR ++LYLDAD++C Sbjct: 76 AKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVC 135 Query: 137 QGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAA 196 +G I L++ A+V EK L + YFNSG + ++ +WA Sbjct: 136 KGDISQLLHLGLNGAVAAVVKD--VEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWAD 193 Query: 197 QQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKES---- 252 +++ +A+++L + + K +PDQDV+N+LL +F +YNT +++ +LK+ Sbjct: 194 AKLTEKALSILMSKDNVYK--YPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQN 251 Query: 253 FINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYS 312 + +T T+ IHY G TKPWH WA YP + + A SPWK+ + + + + Sbjct: 252 YKKLITESTLLIHYTGATKPWHKWAI-YPSVKYYKIALENSPWKDDSPRDAKSIIEFKKR 310 Query: 313 AKHMLKKHRYLKGFSNYLFYFIEK 336 KH+L +H Y+ G + Y K Sbjct: 311 YKHLLVQHHYISGIIAGVCYLCRK 334 >UniRef50_D0KD54 Lipopolysaccharide glucosyltransferase I n=2 Tax=Pectobacterium RepID=D0KD54_PECWW Length = 336 Score = 275 bits (703), Expect = 2e-72, Method: Composition-based stats. Identities = 122/337 (36%), Positives = 188/337 (55%), Gaps = 11/337 (3%) Query: 4 VFFQETEFLNSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHI 63 VF + L SV + H+ ++ L++AYG DKN+ GCG+SI SIL N FH+ Sbjct: 2 VFSSHIDVL-SVFEKRHQSIADHDTLNVAYGIDKNYAVGCGVSITSILINNS-IDFTFHV 59 Query: 64 FTDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINK 123 F+D F DD K LA ++KT+I +Y IN + L++LP T W+HA+YFR + + +K Sbjct: 60 FSDDFDDDFIKKISILAEKFKTKIILYKINSEMLKTLPCTDIWSHAMYFRLLAFSHLSDK 119 Query: 124 APKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFN 183 +LYLDAD++C+G++E L + A++ + +K A L +A + YFN Sbjct: 120 TSSLLYLDADVMCKGSLEQLHKLNTAPHVAAVIRD--VPEMQKKSASRLKMAALEGEYFN 177 Query: 184 SGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQF 243 SG L N W ++ + L + E + I +PDQD++N+LL + F +YNT + Sbjct: 178 SGVLFANLDIWNKLDLTQKIFDKLRDGE--ESIQYPDQDIMNILLNGNVTFLPKEYNTIY 235 Query: 244 SLNYQLKES----FINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTA 299 S+ +LK+S + + +DTI IHY G TKPWH WA +YP + F A+ SPW + Sbjct: 236 SIKNELKDSNHQKYKEVIKDDTILIHYTGVTKPWHKWA-NYPSTSYFQHAQENSPWSTSD 294 Query: 300 LLKPNNSNQLRYSAKHMLKKHRYLKGFSNYLFYFIEK 336 L + +++ KH+LKK +YL G + Y + K Sbjct: 295 LKDADTFVEMKKKYKHLLKKGKYLSGLISAFKYSLNK 331 >UniRef50_D1PTN4 Putative uncharacterized protein n=1 Tax=Prevotella bergensis DSM 17361 RepID=D1PTN4_9BACT Length = 305 Score = 267 bits (682), Expect = 5e-70, Method: Composition-based stats. Identities = 66/284 (23%), Positives = 124/284 (43%), Gaps = 15/284 (5%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 +DI + D N+L C ++ SIL N+ ++ FH+ ++ ++ R + +A Y ++ Sbjct: 1 MDIVFNIDDNYLMQCCTTMVSILHNNKDGQISFHVISNGLTNESRLKIEQVAEAYHQQVF 60 Query: 89 IYLINGD---RLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLIN 145 Y++N + + + A Y R +AD + K++Y+D D+I G+++ L N Sbjct: 61 FYVVNPEAMSDYEIFDKQGHISMATYLRLFVADILPERLHKIIYMDCDLIVNGSLDGLWN 120 Query: 146 FSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIA 205 +A V LG A YFN+G L++N W VS +A Sbjct: 121 TDVEGYALAAVED--MWSGKADNYVRLGY-DAADTYFNAGVLVVNLDYWREHNVSQQAAQ 177 Query: 206 MLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKE------SFINPVTN 259 + ++ DQDVLN L D + ++N Q L + ++ ++ Sbjct: 178 YVALH--AGQLKFNDQDVLNGLFHDSKLLLPFRWNVQDGLLRKRRKIRPEVMPKLDQELE 235 Query: 260 DTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKP 303 + + IH+ G KPW+ ++ P F + + + W+ + P Sbjct: 236 NPVIIHFTGHRKPWN-FSCLNPYKNLFFKYVDMTEWRGFRPIVP 278 >UniRef50_D1P7H1 Lipopolysaccharide 1,2-glucosyltransferase n=1 Tax=Providencia rustigianii DSM 4541 RepID=D1P7H1_9ENTR Length = 324 Score = 263 bits (673), Expect = 5e-69, Method: Composition-based stats. Identities = 111/332 (33%), Positives = 168/332 (50%), Gaps = 14/332 (4%) Query: 10 EFLNSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFG 69 + + S+I + EN I YG D+ FL+G G SIAS++ N+ + FHIF D Sbjct: 5 DMIKSLIKINDNERHENSYFHIGYGVDEKFLYGVGTSIASVMLNNKDTDFHFHIFVDNLP 64 Query: 70 DDDRKYFDALALQYKTRIKIYLINGDRLRSLP-STKNWTHAIYFRFVIADYFINKAPKVL 128 D++ F +I IY I+ ++ + LP +K W+HAIYFR +I Y + +L Sbjct: 65 DEN--LFREAVQGTSHKITIYFIDNEKFKLLPLPSKAWSHAIYFRLLIISYLSSSIDSLL 122 Query: 129 YLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLL 188 YLDADIIC+G + L +F + V + + YFNSGFL Sbjct: 123 YLDADIICKGDLSELKALTFDEKTFVYAVKDKFCS------EKQNLPIDMSKYFNSGFLY 176 Query: 189 INTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLN-Y 247 ++ A + + R I ++ + + +HPDQD LN+LL DKLI YN FSL+ Y Sbjct: 177 MSLKHLAQENIPNRVIELVEKND----FSHPDQDALNVLLNDKLINISENYNYMFSLDWY 232 Query: 248 QLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSN 307 + + + + +FIH++G TKP+H+WA Y + A+ SPWKN LLKP Sbjct: 233 ITSKGHLAKIPDSVVFIHFVGLTKPFHEWASFYEEYKYLESARKNSPWKNIPLLKPEGYK 292 Query: 308 QLRYSAKHMLKKHRYLKGFSNYLFYFIEKIKH 339 QL H+ K +Y++ + Y ++K H Sbjct: 293 QLSRKKSHLRKNGKYVEFIFTTIQYLMKKTFH 324 >UniRef50_C7QL87 Glycosyl transferase family 8 n=2 Tax=Cyanothece RepID=C7QL87_CYAP0 Length = 283 Score = 261 bits (668), Expect = 2e-68, Method: Composition-based stats. Identities = 78/296 (26%), Positives = 128/296 (43%), Gaps = 13/296 (4%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 +DI + DKN+ G++I S++ N H+ T D K D L + + + Sbjct: 1 MDILFCFDKNYEQHFGVAITSLILNNTNKIKTIHLVTKDNSKDFLKKIDKLKSKTQAKFF 60 Query: 89 IYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSF 148 IY + L ++ + + + A Y+R + + K+LYLD+D++ ++E L N Sbjct: 61 IYSPDDKDLSNVKVSAHISTAAYYRLLAPELLPQDLKKILYLDSDLVVNSSLENLYNMDI 120 Query: 149 PDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLN 208 DD +A +KR G YFNSG +LIN W + + + L Sbjct: 121 SDDILAAYAGGKMGPGTKKRLQLTG-----DFYFNSGVMLINLEAWRTENIGNKCFKFLQ 175 Query: 209 EPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIG 268 E + ++ DQD LN ++ K + D +N+ L + VTN +I IH+ G Sbjct: 176 ENPDMIRLW--DQDALNKIVDGKFLNIDGIWNSLVDLT-----TGETRVTNQSIIIHFTG 228 Query: 269 PTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKHRYLK 324 KPW W P Q + SPW N P N ++ + K + K+ + K Sbjct: 229 TLKPWQSWCIR-PEKQIYWYYLRQSPWSNAYPQFPKNFQEMLLAIKSVYKQIKPKK 283 >UniRef50_C7IBC1 Glycosyl transferase family 8 n=2 Tax=Clostridium papyrosolvens DSM 2782 RepID=C7IBC1_9CLOT Length = 452 Score = 258 bits (659), Expect = 2e-67, Method: Composition-based stats. Identities = 67/288 (23%), Positives = 115/288 (39%), Gaps = 15/288 (5%) Query: 27 LCLDIAYGTDKNFLFGCGISIASILKYNE-GSRLCFHIFTDYFGDDDRKYFDALALQYKT 85 + I D +++ G+ I S+L+ + L F++ D D++ + Y Sbjct: 2 ETVKIVSACDSHYVQHLGVMITSLLENTSMKTSLEFYVIDGGITDADKELLCSCTCLYGC 61 Query: 86 RIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLIN 145 +I I D + + + A YFR +++ KV+YLD DI+ I L Sbjct: 62 KINFITIQADFYARFGESPSASDATYFRIFVSELLDTSVEKVIYLDCDIVVIKDIAELWK 121 Query: 146 FSFPDDKVAMVVTEGQADWWEKRAHSLGVAGI--AKGYFNSGFLLINTAQWAAQQVSARA 203 + +A V G E G+ YFN+G LLIN +W + +S Sbjct: 122 TDVSEYFLAAVADCGVEYSGEYAVTLKRKLGMKRKDCYFNAGVLLINLVKWREESISKSI 181 Query: 204 IAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINP-----VT 258 L E + KI DQD LN +L ++ + D ++N Q + ++ + Sbjct: 182 CKFLFENK--GKIDFADQDGLNAVLCNRWLPLDSRWNQQVAHCEFYEQEKVVWENVTRAV 239 Query: 259 NDTIFIHYIGP----TKPWHDWAWDYPVSQAFMEAKNASPWKNTALLK 302 + IHY TKPW+ + +P Q + + +PWK+ Sbjct: 240 REPWIIHYTTSYFSGTKPWN-YLDMHPYRQEYYRYLHMTPWKSFIPPD 286 >UniRef50_C3X7M2 Lipopolysaccharide 3-alpha-galactosyltransferase n=1 Tax=Oxalobacter formigenes OXCC13 RepID=C3X7M2_OXAFO Length = 307 Score = 257 bits (657), Expect = 4e-67, Method: Composition-based stats. Identities = 81/311 (26%), Positives = 140/311 (45%), Gaps = 11/311 (3%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 IA+G D + ++IASIL+ N+ S + FH+ + D L + I Sbjct: 5 FHIAFGVDTIYAPKMCVTIASILENNKNSNIIFHVIYNDLSDKVIDEIKKSMLTLQAEIN 64 Query: 89 IYLINGDR--LRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINF 146 + I+ D + + T + RF I + + LYLDADIIC I L + Sbjct: 65 FHFIDVDLSIFPKFSNFSHITSGAFLRFFIPELLQGLTDRALYLDADIICINNISDLFHL 124 Query: 147 SFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAM 206 ++++ VV + ++ + A K YFNSG L+++ +W V + +++ Sbjct: 125 EMDENEILAVVEDIDSETYLNEN-----ASFQKRYFNSGVLMMDIEKWNKNNVYGQLLSV 179 Query: 207 LNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHY 266 LNE + DQD LN+++ DK+ + D +N + K+ V + FIH+ Sbjct: 180 LNEKGSGFNLI--DQDALNLVMIDKVHYLDNIWNYMINAEQLDKKKEKYSVPENAKFIHF 237 Query: 267 IGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKHRYLKGF 326 +GP KPWH + ++ ++ + + W L P N ++R A++ KK YL G Sbjct: 238 VGPVKPWHCYNIFDDITGLYLNYQKKTVWDG--LEMPKNYKEMRRYARYSFKKGNYLTGL 295 Query: 327 SNYLFYFIEKI 337 + + Y K Sbjct: 296 NWGMRYIKTKF 306 >UniRef50_UPI0001B4A0A4 putative glucosyltransferase n=1 Tax=Bacteroides sp. 2_1_7 RepID=UPI0001B4A0A4 Length = 301 Score = 257 bits (656), Expect = 5e-67, Method: Composition-based stats. Identities = 67/308 (21%), Positives = 134/308 (43%), Gaps = 13/308 (4%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 +DI TD N++ CG+ + SI N + HI T+ ++++ + +Y +I+ Sbjct: 1 MDIVCCTDNNYVIPCGVLVTSICVNNPKEEITVHILTEGISPENQEVLKKVVAKYGQQIQ 60 Query: 89 IYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSF 148 Y ++ + P +++ T A YFR ++ D KVLYLD D++ + ++ L + Sbjct: 61 FYTVDKKVFANCPISRHITLATYFRLIMTDILPKSVEKVLYLDCDVVVRHSLRSLWDTDI 120 Query: 149 PDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLN 208 ++ D R ++ + GYFN+G LL+N W +S ++N Sbjct: 121 KSYAAGVIPDMSIDDI---RIYNRLQYSPSLGYFNAGVLLVNLRYWRENNLSESFFEIIN 177 Query: 209 EPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNY------QLKESFINPVTNDTI 262 + +++ + DQDVLN++L + + +KYN Q + + D + Sbjct: 178 KY--PERLRYHDQDVLNIVLKEIKLTLPMKYNVQHGYFFKDPLISRTYRDEREQAITDPV 235 Query: 263 FIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKHRY 322 +HY G KPW ++ P + F + S + + +++ + +L+K Sbjct: 236 ILHYSGS-KPWF-IEFEPPFKKDFAFYLDTSGLDKSFIRHIPMKARIKARFRSLLEKLGL 293 Query: 323 LKGFSNYL 330 + + Sbjct: 294 IAPKDSLF 301 >UniRef50_C0WCJ1 Lipopolysaccharide 1,2-glucosyltransferase n=1 Tax=Acidaminococcus sp. D21 RepID=C0WCJ1_9FIRM Length = 338 Score = 255 bits (652), Expect = 2e-66, Method: Composition-based stats. Identities = 81/333 (24%), Positives = 143/333 (42%), Gaps = 24/333 (7%) Query: 5 FFQETEFLNSVIDYDHKVE-TENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHI 63 +F FL V + E T+ L +AY + + G S+ S+L+ N + FHI Sbjct: 8 YFVPARFLKGVETFSKNAEKTDKAPLHVAYNVNDGYFQIMGASLVSVLENNAHRAVMFHI 67 Query: 64 FTDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLP-STKNWTHAIYFRFVIADYFIN 122 FTD + ++ + + LA +Y IK+Y ++ + + ++ Y R V+ Sbjct: 68 FTDGYSKENAQKMEQLADRYGCVIKLYTLHMEPFADFHVKVERFSRITYGRIVMPLILAA 127 Query: 123 KAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYF 182 + LYLDAD + ++ L ++ + V + ++R L + YF Sbjct: 128 ETDHFLYLDADTMVIRPLDELYHWDLTGKAMGAVSE--RMPDAKRRGDYLHLNN--GRYF 183 Query: 183 NSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQ 242 N G +++N +W Q ++ +A ++ EP+ ++ QD+LN++ F YN Sbjct: 184 NDGVMMVNIPEWQKQNITEKAFSLQKEPK--ERFLGQSQDILNIVFDGTNAFLPSIYN-- 239 Query: 243 FSLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKN----T 298 + +P T IH+ G KPW DY + ASPW+ Sbjct: 240 -----EFGGGEDDPQQKGT-IIHWTGRRKPWQMVLSDYDAQ--WRSYNAASPWETLTAIL 291 Query: 299 ALLKPNNSNQLRYSAKHMLKK--HRYLKGFSNY 329 +LKP N + + AK+ K+ Y+KG + Y Sbjct: 292 PILKPENYHDFKEWAKYRRKESFRDYVKGMAYY 324 >UniRef50_Q5LF36 Putative glucosyltransferase n=1 Tax=Bacteroides fragilis NCTC 9343 RepID=Q5LF36_BACFN Length = 308 Score = 253 bits (647), Expect = 6e-66, Method: Composition-based stats. Identities = 73/300 (24%), Positives = 128/300 (42%), Gaps = 14/300 (4%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 +DI + D +++ CG++I S+ N + FHI T +R+ + +Y+ +I Sbjct: 1 MDIVHCIDNSYVAQCGVTITSVCVNNVNEVILFHILTTNLSIFNREMLKKIVDKYRQKII 60 Query: 89 IYLINGDRLRS--LPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINF 146 Y ++ L L + + A YFR ++ D KVLYLD D++ I+ L + Sbjct: 61 FYNVDEYLLNKCPLREGDHVSLATYFRILMPDILPKSLNKVLYLDCDLVVCKNIKRLWDT 120 Query: 147 SFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAM 206 + V G D R ++ I +GYFN+G LL+N A W +S + + Sbjct: 121 DISTHSLGAVYDGGTDDI---RTYNRLKYDIRQGYFNAGVLLVNLAYWREFHISNKLLKF 177 Query: 207 LNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQ---LKESFINPVT---ND 260 + + +++ DQD LN +L KYN + + L+E ++ + D Sbjct: 178 IEQY--PERLMFWDQDALNSVLIQTTKILPFKYNMLDAFYTKELALREEYLFEIEGALCD 235 Query: 261 TIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKH 320 +H+ P KPW D+P+ F E + W + + P N + + K Sbjct: 236 PTILHFSSPNKPWLK-TCDHPLKSFFFEYLKRTSWNDKFPIYPFNMSLKSRLCLFLWNKG 294 >UniRef50_D2ELM0 Lipopolysaccharide biosynthesis glycosyltransferase n=1 Tax=Pediococcus acidilactici 7_4 RepID=D2ELM0_PEDAC Length = 552 Score = 253 bits (646), Expect = 7e-66, Method: Composition-based stats. Identities = 61/292 (20%), Positives = 122/292 (41%), Gaps = 16/292 (5%) Query: 9 TEFLNSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYF 68 +L+ + ++E +++ + F S SIL+ + + F + D+ Sbjct: 259 HPYLDEYHEELGELEMHRGVINVISAANSAFTQALATSYVSILENDPDHQYNFFLLPDHL 318 Query: 69 GDDDRKYFDALALQY-KTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKV 127 D D ++ +Y IK+ +N + L + + Y+R + + Sbjct: 319 TDRDMMLLGSIIARYDNATIKVVEVNEELLANAVESDRIVKTAYYRILAPALLP-SINRA 377 Query: 128 LYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFL 187 +YLD DII ++ L + + +A V G D R +G+ + YFNSG + Sbjct: 378 IYLDCDIIANTSLHELWQTNLEGNVIAAVEDAGFHD----RLEKMGITKENEKYFNSGMM 433 Query: 188 LINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNY 247 LI+ +W A+ + + + +N+ +K+ DQD LN L D + ++N Q ++ Sbjct: 434 LIDLVRWRARSTTQKVLDYINQN--PEKLRFHDQDALNANLYDDWLHLHPQWNAQSNIIM 491 Query: 248 QL----KESFINPV---TNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNA 292 + + + P D IH+ G KPWH+ ++P + +++ Sbjct: 492 ETIFPPRTELLEPYAETREDPKLIHFCGHVKPWHE-GCEHPYADVYLKYHEM 542 Score = 220 bits (561), Expect = 6e-56, Method: Composition-based stats. Identities = 68/274 (24%), Positives = 124/274 (45%), Gaps = 20/274 (7%) Query: 29 LDIAYGTDKNFLFGCGISIASILKY-NEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 ++I D+N+ I+I + L+ N +R F + T+ GD R D L + T I Sbjct: 4 INILLAADRNYADQLCITIKTALETLNSATRAHFIVLTNNLGDQTRALLDKLMHNFHT-I 62 Query: 88 KIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFIN-KAPKVLYLDADIICQGTIEPLINF 146 + ++ +R P+ ++ YFR + + + +++YLD D++ + + L Sbjct: 63 EYLNLDDERFDFCPTNQHINKTAYFRIIAPKLLASRQIDRLIYLDVDVLIRKDLTELAES 122 Query: 147 SFPDDKVAMVVTEGQADWWEKRAHSLGVAGI---AKGYFNSGFLLINTAQWAAQQVSARA 203 + + V V+ GQA H LGV + + YFNSG ++I+ AQW A +++ + Sbjct: 123 NLNQNTVGAVIDTGQAFA----LHRLGVDPVVAASNLYFNSGIMVIDVAQWNAHRITEKT 178 Query: 204 IAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKE-------SFINP 256 +A + +I DQD LN +LA ++ F K+N Q S+ ++ I+ Sbjct: 179 LAFIRNH--ADRIIFHDQDALNAVLAGEVQFLHPKWNLQNSIIFRKHRPINQGYAELIDE 236 Query: 257 VTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAK 290 + +H+ KPW D +P + E Sbjct: 237 AIKEPSIVHFTTHEKPWKDLTV-HPYLDEYHEEL 269 >UniRef50_A1BHG0 Glycosyl transferase, family 8 n=2 Tax=Chlorobium/Pelodictyon group RepID=A1BHG0_CHLPD Length = 307 Score = 251 bits (642), Expect = 3e-65, Method: Composition-based stats. Identities = 65/284 (22%), Positives = 117/284 (41%), Gaps = 20/284 (7%) Query: 24 TENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQY 83 ++I + TDKN++ ++ S+L+ N+ +I + + + + + Sbjct: 3 HMKNTVNIVFATDKNYIQHLSAALVSLLENNKDLSFTVYIISSGMSEKSYRNIEEIIKTG 62 Query: 84 KTRIKIYLINGDRLRSLPS-TKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEP 142 +K ++ + L + + Y+R +I D K+LYLD+DII G+I+ Sbjct: 63 NCTVKHITVSDELFVKLATAHPFYPKGTYYRLLIPDLI--DEEKILYLDSDIIVNGSIKE 120 Query: 143 LINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSAR 202 L N D V + G H YFNSG +LIN A+W + + + Sbjct: 121 LYNQDVEDYFVCAIEDPGFDR------HRQLQMDKESIYFNSGMMLINLAKWKSTGLQKK 174 Query: 203 AIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKES--------FI 254 I + I PDQ LN ++ + +KYN Q S+ E + Sbjct: 175 VIDFIE--HNPDAIWFPDQCGLNSVINGRWKKVPLKYNQQSSIFSDDFEKKFDCFSVEEL 232 Query: 255 NPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNT 298 + + IHY G +KPWH + +P + + + +P++N Sbjct: 233 AEAKKNPVIIHYTGGSKPWH-FKNRHPYKKLYWKYLKMTPYRNA 275 >UniRef50_P25148 General stress protein A n=3 Tax=Bacillus subtilis group RepID=GSPA_BACSU Length = 286 Score = 249 bits (635), Expect = 1e-64, Method: Composition-based stats. Identities = 60/283 (21%), Positives = 113/283 (39%), Gaps = 15/283 (5%) Query: 25 ENLCLDIAYGTDKNFLFGCGISIASILKYNEGSR-LCFHIFTDYFGDDDRKYFDALALQY 83 ++ + I D N+ G S+L + R + ++ D++K + L++ Sbjct: 3 KDEIMHIVSCADDNYARHLGGMFVSLLTNMDQEREVKLYVIDGGIKPDNKKRLEETTLKF 62 Query: 84 KTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINK-APKVLYLDADIICQGTIEP 142 I+ ++ + + + T A Y+R I D ++ +++Y+D D + I Sbjct: 63 GVPIEFLEVDTNMYEHAVESSHITKAAYYRISIPDLIKDESIKRMIYIDCDALVLEDISK 122 Query: 143 LINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSAR 202 L + VA V GQ +R + V K YFNSG ++I+ W Q ++ + Sbjct: 123 LWDLDIAPYTVAAVEDAGQ----HERLKEMNVTDTGK-YFNSGIMIIDFESWRKQNITEK 177 Query: 203 AIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLK-------ESFIN 255 I +NE + DQD LN +L D+ ++N Q + +LK N Sbjct: 178 VINFINEHPDEDFLVLHDQDALNAILYDQWYELHPRWNAQTYIMLKLKTPSTLLGRKQYN 237 Query: 256 PVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNT 298 + +H+ G KPW+ +P + + + W Sbjct: 238 ETRENPAIVHFCGGEKPWNS-NTKHPYRDEYFHYMSYTKWNTI 279 >UniRef50_A7VNX5 Putative uncharacterized protein n=1 Tax=Clostridium leptum DSM 753 RepID=A7VNX5_9CLOT Length = 344 Score = 249 bits (635), Expect = 1e-64, Method: Composition-based stats. Identities = 55/306 (17%), Positives = 121/306 (39%), Gaps = 28/306 (9%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEG-SRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 ++ + +D NF G ++ S+ + N + +I + +++ +++ QY+ + Sbjct: 13 MNCVFSSDDNFADILGCALISLFENNREQETIEVYILDGGISEGNKRKLESIFQQYERMV 72 Query: 88 KIYLING--DRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLIN 145 + ++ W + + R +I + +VLYLD DI+ G+++ L Sbjct: 73 HFIEVPDISQLTGEAVTSGRWPISTFARILIDSLLPKEVKRVLYLDCDILVLGSLKNLWE 132 Query: 146 FSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIA 205 D A V+ +R + G+ G Y N+G +LI+ +W Q+ + + Sbjct: 133 IDLKDKTAAGVMDC----LSNQRKQNAGING-EDSYINAGVMLIDMDKWRENQIEKQCMN 187 Query: 206 MLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFS---------LNYQLKESF--- 253 + ++ + DQ V+N +L L+ +YN + Y+ +S+ Sbjct: 188 YIRICN--GQVAYNDQGVINKVLHKDLLVLPPEYNAMTLFFDFTYPDMIKYRKPQSYYSA 245 Query: 254 --INPVTNDTIFIHYIGPT---KPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQ 308 ++ +H+ +PW ++P + + SPW+ L N S+ Sbjct: 246 QQVDHARKHPRIVHFTSSFLSLRPWVK-GSEHPYAPLWRNYYKRSPWRAKDLRSDNRSSY 304 Query: 309 LRYSAK 314 + K Sbjct: 305 RKIYEK 310 >UniRef50_D2RIJ4 Glycosyl transferase family 8 n=2 Tax=Acidaminococcus RepID=D2RIJ4_ACIFE Length = 309 Score = 246 bits (628), Expect = 1e-63, Method: Composition-based stats. Identities = 57/286 (19%), Positives = 102/286 (35%), Gaps = 17/286 (5%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSR-LCFHIFTDYFGDDDRKYFDALALQYKTRI 87 + I +D N+ ++ ASIL + G R + F+ F D ++ + A + I Sbjct: 4 ISIVLASDDNYAQHGAVACASILANHRGERPIHFYYFDDGISEEKQAGIAATVTGLQGSI 63 Query: 88 KIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 G ++ ++ + A Y R +I + +V+YLD D++ I+ L Sbjct: 64 TFIPTAGKEIQ-AHTSGHVNRAAYLRLLIPELVPQAVHRVIYLDTDLVVLDDIQELWEMD 122 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKG--YFNSGFLLINTAQWAAQQVSARAIA 205 V V G R GI +G YFNSG +++ W +Q + I Sbjct: 123 LQGKPVGAVPDLGILASSRMRRQKEETLGIQEGKLYFNSGVMVMELEAWREKQYGDQVIR 182 Query: 206 MLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFS--------LNYQLKESFINPV 257 + E H DQD LN + D +++N L + Sbjct: 183 CVEE----GNFRHHDQDGLNKVFQDNWQPLPLRWNVIPPVFTLPVKVLKKSRWRNLALEA 238 Query: 258 TNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKP 303 H+ G KPW + ++ + + + + +P Sbjct: 239 LERPAVFHWAGRYKPWEFPPKGH-FNEKYYTYLARTAFAGAKMPQP 283 >UniRef50_Q38VK7 Putative bifunctional glycosyl transferase, family 8 n=2 Tax=Lactobacillus sakei subsp. sakei 23K RepID=Q38VK7_LACSS Length = 569 Score = 245 bits (625), Expect = 2e-63, Method: Composition-based stats. Identities = 67/294 (22%), Positives = 121/294 (41%), Gaps = 17/294 (5%) Query: 6 FQETEFLNSVIDYDHKVETENL-CLDIAYGTDKNFLFGCGISIASILKYNEGSR-LCFHI 63 F E +++ Y ++ + ++I + NF+ I ASIL N+ R F + Sbjct: 261 FLEHPYMSEYQVYLSQLPADKRDQINIVSAANSNFVEPLAILYASILNNNDDDRHYAFFV 320 Query: 64 FTDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINK 123 +D D+ + + + ++ L ++ Y+R +I + + Sbjct: 321 LSDQLTARDQATLRQITESFNAELTFIEVDEIPLTAVIQDGQVLKTAYYRLLIPNLLP-E 379 Query: 124 APKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFN 183 +VLYLD D +C + L + + VA V G + R + + + YFN Sbjct: 380 IERVLYLDCDTLCLENLARLWDVELGNIPVAAVEDAGFHN----RLAQMAIDYKSIRYFN 435 Query: 184 SGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQF 243 +G LL+N W Q+++ + + + E +K+ DQD LN +L D+ I K+N Q Sbjct: 436 AGVLLMNLTIWRQQKITEQILTFIKEY--PQKLRFHDQDALNAILHDRWIHLHPKWNVQT 493 Query: 244 SLNYQL---KESFIN----PVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAK 290 S+ IN + IH+ G KPW D + +P + + K Sbjct: 494 SILMDFIVAPTERINRQFLSAQKEPGLIHFCGSEKPW-DKSSTHPYTPQYRFYK 546 Score = 189 bits (481), Expect = 1e-46, Method: Composition-based stats. Identities = 54/278 (19%), Positives = 110/278 (39%), Gaps = 17/278 (6%) Query: 27 LCLDIAYGTDKNFLFGCGISIASILKY-NEGSRLCFHIFTDYFGDDDRKYFDALALQYKT 85 + I D+ + +++ SI ++ + + + + + + L Sbjct: 8 KTIAIMVAADEQYADQMLLTLKSIREHCTLETAIDLFVLSSDLSHATKSAVNRLMT-LPH 66 Query: 86 RIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFIN-KAPKVLYLDADIICQGTIEPLI 144 + IN R+++ P ++ Y+R + + +VLYLD D + + + PL Sbjct: 67 HVSFIAINPRRIKNFPGNNHFDQTAYYRILAPQILLARHIERVLYLDLDTLIRTDLTPLY 126 Query: 145 NFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAI 204 + + + V+ G+A ++ A YFN+G L+I+T W +S + + Sbjct: 127 DSDLEGNIIGAVIDPGKALTLKRLGVPKSQAN--NIYFNAGVLIIDTILWETHHISQKIL 184 Query: 205 AMLNEPEIIKKITHPD-QDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTN---- 259 AML + D QD LN++LA + K+N Q ++ ++ E N + Sbjct: 185 AMLVPYPGRRV---NDIQDALNVVLAGRTKLLAPKWNVQNAILFKTYEPINNEYSQLFKQ 241 Query: 260 ---DTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASP 294 IH+ KPW + ++P + + P Sbjct: 242 AIMAPKIIHFTTEKKPW-EVFLEHPYMSEYQVYLSQLP 278 >UniRef50_Q03HK5 Lipopolysaccharide biosynthesis glycosyltransferase n=1 Tax=Pediococcus pentosaceus ATCC 25745 RepID=Q03HK5_PEDPA Length = 549 Score = 242 bits (619), Expect = 1e-62, Method: Composition-based stats. Identities = 65/296 (21%), Positives = 125/296 (42%), Gaps = 18/296 (6%) Query: 6 FQETEFLNSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFT 65 E +L+ + +++E +++ + F+ S SIL+ + ++ F++ Sbjct: 256 LSEHPYLDEYHEELNELEINRGVVNVISAANSAFVEALATSYISILENDSENQYNFYLLP 315 Query: 66 DYFGDDDRKYFDALALQY-KTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKA 124 D+ D ++ +Y IKI ++ L + + + Y+R + + Sbjct: 316 DHLDQRDMLILGSVISRYDNASIKIVKVDEKLLENAVESDRILKSAYYRILAPELLP-NI 374 Query: 125 PKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNS 184 + +YLD DII + L S + +A V G D R +G+ YFNS Sbjct: 375 NRAIYLDCDIIANTNLHDLWQTSLEGNVLAAVEDAGFHD----RLEHMGITHDNSKYFNS 430 Query: 185 GFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFS 244 G +LI+ W +Q V+ R + +N +K+ DQD LN +L DK + K+N Q + Sbjct: 431 GMMLIDLVSWRSQAVTQRVLDYIN--HNPEKLRFHDQDALNAILYDKWLHLHPKWNAQSN 488 Query: 245 L--------NYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNA 292 + +L + + + IH+ G KPWH + +P + +++ Sbjct: 489 IVLDALVPPRTELLKLYAET-RENPKLIHFCGHVKPWHAES-KHPYTNVYLKYNKK 542 Score = 212 bits (541), Expect = 1e-53, Method: Composition-based stats. Identities = 55/273 (20%), Positives = 110/273 (40%), Gaps = 14/273 (5%) Query: 29 LDIAYGTDKNFLFGCGISIASILKY-NEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 +++ D+N+ I+I + L+ N+ +R+ F + ++ + + LA T + Sbjct: 4 INVLLAADENYADQLQITIKTTLENLNKKTRVNFIVLSNNLSNSTKLALKKLAHGLHT-V 62 Query: 88 KIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFIN-KAPKVLYLDADIICQGTIEPLINF 146 + ++ P+ + Y+R + ++LYLD D++ + + L + Sbjct: 63 EYLDLDPSVFAFCPTNSHINKTAYYRILAPQLLAKRNIDRILYLDVDLLVRHDLTELYDA 122 Query: 147 SFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAM 206 + V V+ GQA + V YFNSG L+I+ +W ++ + + Sbjct: 123 ELNHNIVGAVIDTGQAFALNRLGVD-PVVAANNIYFNSGILVIDIKKWNENHITEKTLNY 181 Query: 207 LNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLK-------ESFINPVTN 259 + I DQD LN +LA + K+N Q S+ ++ + IN Sbjct: 182 IK--HQSHLIIFHDQDALNAVLAGHVQMLHPKWNLQNSIVFRKHRPINEAYDQLINEAIK 239 Query: 260 DTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNA 292 +H+ KPW + ++P + E N Sbjct: 240 SPAIVHFTTHEKPWKTLS-EHPYLDEYHEELNE 271 >UniRef50_Q64ZV2 Lipopolysaccharide 1,2-glucosyltransferase n=8 Tax=Bacteroides RepID=Q64ZV2_BACFR Length = 311 Score = 242 bits (618), Expect = 1e-62, Method: Composition-based stats. Identities = 68/313 (21%), Positives = 119/313 (38%), Gaps = 16/313 (5%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 + IA D NF C +++ S+ N S C HI + D+K ++A Y +I Sbjct: 2 IHIACNIDSNFTIHCAVTLTSLFANNRNSEFCVHIIASTLPEADQKALSSIAESYGNKIC 61 Query: 89 IYLINGDRLRSLPSTK---NWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLIN 145 Y D L + K + A Y+R +++ K+LY+D DI+ I + Sbjct: 62 FYFPEKDLLNNFSIKKSGNRISIATYYRCLLSRILPVNIDKILYIDCDIVVLNDISEFWD 121 Query: 146 FSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIA 205 + + G E+ +S YFN+G LLIN W ++ Sbjct: 122 TDITQYAIGCIEDIGSD---EEEYYSRLQYDKKYSYFNAGVLLINLKYWREHKIDEMCEQ 178 Query: 206 MLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQL------KESFINPVTN 259 +I DQD+LN LL +F ++N Q + + + S + Sbjct: 179 YFLAHS--DRIRFNDQDLLNALLYKDKLFVPFRWNVQDTFYRRTYSHKVKEHSGLKEALL 236 Query: 260 DTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKK 319 +HY KPW+ + +P+ Q + + + +PWK T + + + + + Sbjct: 237 HPAILHYT-NKKPWN-YDSMHPLKQEYFKYLDMTPWKGTRPIIDFQTRVITGFKRLLYIT 294 Query: 320 HRYLKGFSNYLFY 332 + N Y Sbjct: 295 GIKKSKYINLKDY 307 >UniRef50_A6LGX5 Glycosyltransferase family 8 n=1 Tax=Parabacteroides distasonis ATCC 8503 RepID=A6LGX5_PARD8 Length = 325 Score = 241 bits (616), Expect = 2e-62, Method: Composition-based stats. Identities = 72/327 (22%), Positives = 132/327 (40%), Gaps = 25/327 (7%) Query: 30 DIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKI 89 DI +D N+L I S+ + N L FH+ ++ D K + Y+ ++ + Sbjct: 3 DIVVASDCNYLHLVSICAVSLFETNSSESLHFHLLSNGIDSADIKNLQTIVEGYRGKLSV 62 Query: 90 YLINGDRLR-SLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSF 148 Y I R R + + Y R KVLY+D DII G+I L N Sbjct: 63 YPIENLRERLMTDVPETISLTSYARLFAGSILPANLDKVLYIDCDIIFNGSIRDLFNTDL 122 Query: 149 PDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLN 208 + V ++ + ++K +++ Y N+G L+I +W ++ + + + L Sbjct: 123 GNCLVGGILDPLISRTYKKEIK----IPMSEPYINAGVLIIPLNRWRSEGMEQKFVDFLV 178 Query: 209 EPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQ-FSLNYQLKESFI-----------NP 256 K+ H DQ ++N + A + ++N SL Y K+ + Sbjct: 179 ANR--GKVHHHDQGIINAVCAGRKKILPPQFNVMSNSLCYPWKDLYKINTPFYDQEEYKK 236 Query: 257 VTNDTIFIHYIG--PTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAK 314 + IH+ G +PW +P + F++ K + +K+ L KPNN + + + Sbjct: 237 GISSPAIIHFTGAIHGRPW-IVGCTHPYANKFLQFKAKTAYKDIPL-KPNNQSAALHRLE 294 Query: 315 HMLKKHRYLKGFSNYLF--YFIEKIKH 339 +L + F Y+ Y++ KH Sbjct: 295 GILYRLLPFSLFKRYMQSVYYLSYFKH 321 >UniRef50_B7AIV0 Putative uncharacterized protein n=1 Tax=Bacteroides eggerthii DSM 20697 RepID=B7AIV0_9BACE Length = 321 Score = 241 bits (615), Expect = 3e-62, Method: Composition-based stats. Identities = 74/327 (22%), Positives = 131/327 (40%), Gaps = 16/327 (4%) Query: 15 VIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRK 74 + + + + + I + + C +IASI N+ + H+ TDY ++ Sbjct: 1 MYNTTNIPSIKTKAIHIVVCINDAYSQHCAATIASIFINNKNEVIKIHVITDYISKKNQS 60 Query: 75 YFDALALQYKTRIKIYLINGDRLRSLPSTK-----NWTHAIYFRFVIADYFINKAPKVLY 129 + +A + +I+ Y N L P K + T Y+R I K Y Sbjct: 61 RLEKIAFNFNQQIQFYTFNNSTLNRWPCFKDGMPPHVTIQTYYRLFIPQILPLNIKKTFY 120 Query: 130 LDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLI 189 LD D++ + N + VA + Q + + A L + YFN+G LL+ Sbjct: 121 LDCDLLVLHPLREFWNTKMQNKGVAAIAD--QWTDYIEAATRLKYRN-DREYFNAGVLLL 177 Query: 190 NTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYN-TQFSLNYQ 248 N + AI + + I + DQDVLN L+ + I +K+N F +N + Sbjct: 178 NLEYLRNHNFTNNAIDFVTKH--ANDIVYHDQDVLNKLIGENRIIMPVKWNVCSFKINDK 235 Query: 249 LKESF---INPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNN 305 + + +N D IH+ P KPW+ + +P + +PWK+ + Sbjct: 236 IPHIYNATMNDARKDPYIIHFFAPIKPWNQDSS-HPYRSYYYYFLQFTPWKHEVKCHYSL 294 Query: 306 SNQLR-YSAKHMLKKHRYLKGFSNYLF 331 N +R + K L+K +Y +Y+ Sbjct: 295 KNTIRTFLIKIGLRKSQYAIAPQSYMK 321 >UniRef50_UPI0001A457E5 hypothetical protein NsubN_08151 n=1 Tax=Neisseria subflava NJ9703 RepID=UPI0001A457E5 Length = 345 Score = 241 bits (615), Expect = 4e-62, Method: Composition-based stats. Identities = 81/327 (24%), Positives = 142/327 (43%), Gaps = 19/327 (5%) Query: 21 KVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALA 80 K E I Y D+N++ G ++ S+L+ N S + FH+ D FD + Sbjct: 16 KREFIKQPKHIVYAADQNYIKHIGTALLSVLQNNT-SPIHFHLLVSGSEGYDFNIFDQIE 74 Query: 81 LQYKT-RIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGT 139 + I +Y +N + +L +T +T A+Y+R I LYLD D++C G Sbjct: 75 TSNQNYAISVYHLNTEYFSTLQTTHYFTIAMYYRMSIPCLLKGITHTALYLDTDVLCLGN 134 Query: 140 IEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQV 199 I+ L + +A V + K+ + G + YFNSG +L N +W + Sbjct: 135 IDDLFEIDISNSLIAAVPDAILYRAYIKQLNQFGFTDT-EPYFNSGVILFNIDKWNDMAI 193 Query: 200 SARAIAMLNEPEIIK-KITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVT 258 + E K++ PDQD+LN+ + + +N +++ K S + Sbjct: 194 DKILSEKMQAVEKQNFKLSCPDQDILNLACIGHVHWLSENFNW---IHWHQKYSELIDNP 250 Query: 259 NDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKN--------TALLKPNNSNQLR 310 N+ +H++G KPWH + +P + + SPW N T L PN + R Sbjct: 251 NNIRLVHFVGHIKPWHQLGF-HPAYDQYFKN---SPWNNGYLEQPLSTWLPFPNPKRKFR 306 Query: 311 YSAKHMLKKHRYLKGFSNYLFYFIEKI 337 +AK + K+ + + ++ Y Y + +I Sbjct: 307 QAAKRLWKQGQKKQAWAYYREYLLRRI 333 >UniRef50_A8ARL4 Putative uncharacterized protein n=2 Tax=Citrobacter RepID=A8ARL4_CITK8 Length = 314 Score = 238 bits (607), Expect = 3e-61, Method: Composition-based stats. Identities = 80/318 (25%), Positives = 136/318 (42%), Gaps = 11/318 (3%) Query: 24 TENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQY 83 + ++IAY TD N+L +SI S++ N L F +F D+D + + + Sbjct: 3 NKTNVINIAYCTDANYLEYVAVSIMSVIMNNPEQSLAFFVFVYDVSDEDIAKLQSTSNKI 62 Query: 84 KTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPL 143 + I I + ++ + + K+ + Y R + +K + +YLDAD +C ++ + Sbjct: 63 QV-ITIDKADIEKYNNDFAIKHLNRSTYMRLAVPRLLKDKVARFIYLDADTLCFDSLSEI 121 Query: 144 INFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARA 203 + D+ V V + K A LG++ YFN+GFL IN A W + +A Sbjct: 122 NSVDI-DNVVCAVSHDSLNIHDNKHARRLGLS--IDHYFNAGFLYINVANWIKHDIEHKA 178 Query: 204 IAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNY-QLKESFINPVTNDTI 262 +L E K + + DQD LN+ + + F D ++N F+ + KE+F Sbjct: 179 NTVL--FEQGKSLPYFDQDALNIAMNGNITFIDNRWNFLFNWFTDEQKENFFYHSDTLPR 236 Query: 263 FIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPN---NSNQLRYSAKHMLKK 319 IH+ G KPW+ Q ++ + +PW+N L R ++ KK Sbjct: 237 IIHFTGGRKPWYKEHTGL-SQQLYVFYHHFTPWRNAELRSYAPRMRPTDYRVYSRQAAKK 295 Query: 320 HRYLKGFSNYLFYFIEKI 337 Y Y Y KI Sbjct: 296 GNYFTAIKWYAKYLKTKI 313 >UniRef50_C2HBB8 Family 8 glycosyltransferase n=13 Tax=Lactobacillales RepID=C2HBB8_ENTFC Length = 300 Score = 236 bits (603), Expect = 8e-61, Method: Composition-based stats. Identities = 55/284 (19%), Positives = 113/284 (39%), Gaps = 18/284 (6%) Query: 24 TENLCLDIAYGTDKNFLFGCGISIASILKY-NEGSRLCFHIFTDYFGDDDRKYFDALAL- 81 + + + F+ S+L N + F++ D + ++ Sbjct: 1 MNKKEIAVVASCNTKFVPHLAALFVSVLDNCNPSKFVRFYVIDDDIDFESKQLLRFSVKN 60 Query: 82 -QYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFIN-KAPKVLYLDADIICQGT 139 + + ++ IN + ++ + Y+R I + F + ++LY+D D+I Sbjct: 61 ARMNSDVEFLKINKEFFTNVVISDRIPETAYYRIAIPELFRGTEVERILYMDCDMIALQD 120 Query: 140 IEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQV 199 I L F D VA V G + +R + + + YFNSG +LIN +W + + Sbjct: 121 ISKLWRLDFGDSIVAAVEDAG----FHQRLEKMEIPAKSMRYFNSGLMLINVKKWLDENI 176 Query: 200 SARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLK-------ES 252 + + + + +K+ DQD LN +L D+ + ++N Q + + K E Sbjct: 177 TQKVLDFIE--HNPEKLRFHDQDALNAILHDRWLPLHPRWNAQGYIMAKAKKHPTAAGER 234 Query: 253 FINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWK 296 N+ IH+ G KPW ++ P + + + + ++ Sbjct: 235 EYEETRNNPYIIHFSGHVKPWSK-DFEGPTKKYYEKYAGMTAFR 277 >UniRef50_C2HBB9 Family 8 glycosyltransferase n=35 Tax=Lactobacillales RepID=C2HBB9_ENTFC Length = 305 Score = 236 bits (603), Expect = 8e-61, Method: Composition-based stats. Identities = 62/283 (21%), Positives = 118/283 (41%), Gaps = 19/283 (6%) Query: 22 VETENLCLDIAYGTDKNFLFGCGISIASILKY-NEGSRLCFHIFTDYFGDDDRKYFDALA 80 +E + + +D+N+ + IA+ L+ N+ R+ F++ D + ++ + Sbjct: 21 MEKRYGVVPVVTASDENYAPYLSVMIATALENCNKARRIKFYVIDDGLSEYSKQGLEETV 80 Query: 81 LQY--KTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYF-INKAPKVLYLDADIICQ 137 +Y I+ + D + + T Y R + + KVLYLD+D++ Sbjct: 81 NKYSSNASIQFLTVEKDIYEDFLVSDHITTTAYLRISLPNLLAKEDYKKVLYLDSDVLVL 140 Query: 138 GTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQ 197 I L + + ++ GQ K LG+ YFNSG ++I+ QW + Sbjct: 141 DDIVKLYDEPLNGKTIGAIIDPGQV----KALERLGIDS-DDLYFNSGVMVIDIDQWNKK 195 Query: 198 QVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLK------- 250 +++ + I L+E +I + DQD LN +L + K+N Q SL ++ Sbjct: 196 EITEKTIHYLSEN--GDRIIYHDQDALNAVLYEDWEQLHPKWNMQTSLIFERHPAPNEKY 253 Query: 251 ESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNAS 293 E +H+ G KPW+ D+P + +++ S Sbjct: 254 ERLYKEGNEKPSIVHFTGHDKPWNTLK-DHPYTNLYLKKLAHS 295 >UniRef50_C3PWZ8 Lipopolysaccharide 1,2-glucosyltransferase n=1 Tax=Bacteroides sp. 9_1_42FAA RepID=C3PWZ8_9BACE Length = 315 Score = 236 bits (603), Expect = 9e-61, Method: Composition-based stats. Identities = 80/319 (25%), Positives = 143/319 (44%), Gaps = 21/319 (6%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 + IA D F+ C ++I SIL+ N+ + HI + + +D +A +Y T I Sbjct: 1 MHIALTIDSKFVRYCAVTIVSILENNDPKDIMLHIVSGHLPKEDVLTLSQVAEKYGTSIA 60 Query: 89 IYLINGDRLRSL---PSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLIN 145 Y I ++L++ + + +++R V+A + +V+YLD+D + G+++ L + Sbjct: 61 FYYIPHEKLQNYEVKWQKQRLSMVVFYRCVLASILPSTISRVIYLDSDTLVLGSLKELWD 120 Query: 146 FSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIA 205 + +A V + L A + Y N G LL+N A W + + I Sbjct: 121 TNLNQLALAGVQDTVSPNP--SYFERLQYAP-SYNYINGGVLLLNLAYWRKHNIEQQCIK 177 Query: 206 MLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQ--FSLNYQLKESFINPVTND--- 260 + +I DQD+LN LL D+ + DIK+N Q F N + P D Sbjct: 178 YYQQY--PDRIILNDQDILNALLYDQKVLIDIKWNVQDDFYRNNRYTSPAWKPSYTDAIL 235 Query: 261 -TIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKK 319 I +HY G KPW + +P+ F + +P+ ++A K ++ R+ H+L Sbjct: 236 HPIILHYSGR-KPW-AYHAMHPLRHLFFHYQRLTPYDDSAKQKKISTRIYRFI--HLLP- 290 Query: 320 HRYLKGFSNYLFYFIEKIK 338 Y+ G + ++KI+ Sbjct: 291 --YILGLKPKKYVNLKKIR 307 >UniRef50_C0QZN2 Glycosyl transferase, family 8 n=3 Tax=Brachyspira RepID=C0QZN2_BRAHW Length = 339 Score = 234 bits (598), Expect = 3e-60, Method: Composition-based stats. Identities = 72/308 (23%), Positives = 125/308 (40%), Gaps = 21/308 (6%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNE-GSRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 ++I +D N+ G +IASILK + ++ FH+ ++++ +L + I Sbjct: 1 MNICLASDNNYAPYMGTAIASILKNSSEDEKIIFHLIDGGITKENKEKIISLKNIKECEI 60 Query: 88 KIYLINGD----RLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPL 143 Y + +++ A+++R IA + K+LYLD+D+I G+++ L Sbjct: 61 NFYTPDIKMYDGWFEKTSCKAHFSAAMFYRLSIASIIPSNIDKILYLDSDLIATGSLKEL 120 Query: 144 INFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARA 203 + ++ + + GI YFNSG LLIN W + + Sbjct: 121 FLMDIENHYAIVI------KHSTNEKNKWSIDGIND-YFNSGVLLINNKLWIKNNIEDQF 173 Query: 204 IAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIF 263 N K DQDVLN +L K+ +AD++YN Y E+ I + I Sbjct: 174 NKFYNNNY---KTCFGDQDVLNNVLIGKVKYADMRYNVYAEKGYYNTENDI----ENPII 226 Query: 264 IHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKH--MLKKHR 321 IHY+ P KPW + F +PW + + + + + K Sbjct: 227 IHYLSPEKPWKENCRGTLFIDEFWRYYQYTPWFRDEPITAFQTILKQKFYDYDDVRLKGN 286 Query: 322 YLKGFSNY 329 ++K F Y Sbjct: 287 WIKLFGIY 294 >UniRef50_B4RJG6 LgtC n=29 Tax=Neisseria RepID=B4RJG6_NEIG2 Length = 307 Score = 232 bits (591), Expect = 2e-59, Method: Composition-based stats. Identities = 67/326 (20%), Positives = 126/326 (38%), Gaps = 37/326 (11%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 +DI + D N+ ++ S+ + + + FH+ +++R A I+ Sbjct: 1 MDIVFAADDNYAAYLCVAAKSVEAAHPDTEIRFHVLDAGISEENRAAVAANLRGGGGNIR 60 Query: 89 IYLINGDRLRSLPST-KNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 +N + P ++ + Y R + +Y KVLYLD D++ + ++PL + Sbjct: 61 FIDVNPEDFAGFPLNIRHISITTYARLKLGEYI-ADCDKVLYLDTDVLVRDGLKPLWDTD 119 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 + V + + E +G+A + YFN+G LLIN +W + + + Sbjct: 120 LGGNWVGACID-LFVERQEGYKQKIGMAD-GEYYFNAGVLLINLKKWRRHDIFKMSCEWV 177 Query: 208 NEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFI--- 264 + + + + DQD+LN L + +A+ ++N + NY + D +++ Sbjct: 178 EQYK--DVMQYQDQDILNGLFKGGVCYANSRFNFMPT-NYAFMANGFASRHTDPLYLDRT 234 Query: 265 ---------HYIGPTKPWHD----WAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRY 311 HY G KPWH W + A W+ + P Sbjct: 235 NTAMPVAVSHYCGSAKPWHRDCTVWGAERFTELAGSLTTVPEEWRGKLAVPPT------- 287 Query: 312 SAKHMLKKHRYLKGFSNYLFYFIEKI 337 KHML++ R F+ KI Sbjct: 288 --KHMLQRWRKKLS-----ARFLRKI 306 >UniRef50_B3Q568 Putative glycosyltransferase protein n=4 Tax=Rhizobium etli RepID=B3Q568_RHIE6 Length = 331 Score = 231 bits (589), Expect = 3e-59, Method: Composition-based stats. Identities = 52/271 (19%), Positives = 102/271 (37%), Gaps = 13/271 (4%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGS-RLCFHIFTDYFGDDDRKYFDALALQYKTRIKI 89 I + D + ++ S+ + N+ L H+ + G++ ++ I+ Sbjct: 22 IVFAVDAAYAVPLATALRSVAENNQSVWPLDIHVIHEGIGEETKRLILESLPANSAIIQW 81 Query: 90 YLINGDRL-RSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSF 148 + I + + + R ++ + + LYLD DI+ ++E L N Sbjct: 82 HPIATLSFASGFSTRPGVSKMTFARILLPQFLPQTCDRALYLDGDILVLTSLEQLWNTDL 141 Query: 149 PDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLN 208 + + V + + G + K YFN+G LLI+ A+W +++S R++ L Sbjct: 142 GEAVIGAVPDYWLDNPAGSGPGARG-GALVKRYFNAGILLIDLAKWRNERISERSLDYL- 199 Query: 209 EPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIG 268 + + DQD LN+ K D +N QF + + +H++ Sbjct: 200 --DRFPTTEYSDQDALNVACDGKWKILDRAWNFQFEPRQAIAGI---ALEQKAAIVHFVT 254 Query: 269 PTKPWH--DWAWDYPVSQAFMEA--KNASPW 295 KPW + + AF +PW Sbjct: 255 NVKPWKSGSLSPNVAFYDAFRSRTCFALTPW 285 >UniRef50_A7A7B4 Putative uncharacterized protein n=2 Tax=Bifidobacterium adolescentis L2-32 RepID=A7A7B4_BIFAD Length = 1009 Score = 230 bits (587), Expect = 6e-59, Method: Composition-based stats. Identities = 64/335 (19%), Positives = 123/335 (36%), Gaps = 28/335 (8%) Query: 24 TENLCLDIAYGTDKNFLFGCGISIASILKY-NEGSRLCFHIFTDYFGDDDRKYFDALALQ 82 T+ + + + D N++ ++ S +K + + D ++ Q Sbjct: 660 TDKPIVPVVFAADDNYVPQLTTTVYSAMKNADPHYFYDVTVLQRNIAWDKQERLRGFFKQ 719 Query: 83 Y-KTRIKIYLINGDRLRSLPSTK--NWTHAIYFRFVIADYFINKAPKVLYLDADIICQGT 139 + ++ ++ + ST + + Y+RF+I KVLYLD+DII G Sbjct: 720 FPNMNLRFTNVDRELAGYDLSTNNAHISVETYYRFLIQKVLPF-YDKVLYLDSDIIINGD 778 Query: 140 IEPLINFSFPDDKVAMVVT-EGQADWWEKRAHSLGVAGI------AKGYFNSGFLLINTA 192 I L N + + + A+ K +G A YF +G L++NT Sbjct: 779 IAKLYNIDLQGKMLGAIRDIDFLANLNVKHGKRMGYAQTVLKMKNPYDYFQAGVLVLNTK 838 Query: 193 QWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKES 252 + + + P+ I + DQDVLN +++ ++N ++ Sbjct: 839 AMREHYTIKQWLTYASNPDFI----YNDQDVLNAHCEGNVLYLPWEWNVVHDCGGRVGNL 894 Query: 253 FINP----------VTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLK 302 F+ ND +HY G KPW D D + + + +P+ L + Sbjct: 895 FVQAPNDIYDAYMKSRNDPQIVHYAGFQKPWTDPDCD--FASMYWKYARETPFYERLLKR 952 Query: 303 PNNSNQLRYSAKHMLKKHRYLKGFSNYLFYFIEKI 337 +N+ A + KH G N + ++ + Sbjct: 953 VVKANESEIPAGVLRPKHERAVGEDNPIRKIVDPL 987 >UniRef50_B3CA80 Putative uncharacterized protein n=1 Tax=Bacteroides intestinalis DSM 17393 RepID=B3CA80_9BACE Length = 301 Score = 230 bits (586), Expect = 8e-59, Method: Composition-based stats. Identities = 66/299 (22%), Positives = 116/299 (38%), Gaps = 20/299 (6%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIF-TDYFGDDDRKYFDALALQYKTRI 87 +DI D+N++ CG+ +AS+ + + HI + +K +++ + Sbjct: 2 IDIVCSIDENYIEYCGVMLASLFVHTPDEKFRVHIICSSKVEKAGKKRLKVFCEKHQAEV 61 Query: 88 KIYLINGDRLRSLP--STKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLIN 145 Y ++ ++ P + + A Y R +++ + K+LYLD D+I +I+ L Sbjct: 62 YFYDVDYSLIKDFPIRKQDHLSLAAYLRLFMSELIPSNINKILYLDCDLIVVDSIKELWE 121 Query: 146 FSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIA 205 + + VA V D + YFNSG +LIN +W ++ + Sbjct: 122 KNIDNIAVAAVEERSPFDTESPVTLK---YPVEYSYFNSGVMLINLQKWREKKFVEACKS 178 Query: 206 MLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKE------SFINPVTN 259 + + I DQDVLN LL + F I++N Y E + Sbjct: 179 YIASN--YENIKLHDQDVLNALLYKEKQFISIRWNLMDFFLYASPEVQPERKKDWDDALK 236 Query: 260 DTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLK 318 IH+ G KPW + D P ++ W NN N + Y + +L Sbjct: 237 SPAIIHFTGKRKPWM-YNCDSPFRDQYIRFAKQQGWHVI-----NNKNAIHYFFRKILY 289 >UniRef50_C0Z4I4 Putative uncharacterized protein gspA n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0Z4I4_BREBN Length = 264 Score = 228 bits (583), Expect = 1e-58, Method: Composition-based stats. Identities = 58/268 (21%), Positives = 108/268 (40%), Gaps = 14/268 (5%) Query: 27 LCLDIAYGTDKNFLFGCGISIASILKYN-EGSRLCFHIFTDYFGDDDRKYFDALALQYKT 85 + I + F + + S+ + + + H+ +++ ++ Sbjct: 2 GTIHIVTAVNDGFAIHLAVMLYSLFENKVSKNPVIVHVIDSQVSGENKSILTKTVKRFHA 61 Query: 86 RIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLIN 145 +IK I+ + T Y R I D + KV+YLD+DI+ + I PL N Sbjct: 62 QIKYVTIDPTLYDGFLVRDHLTQETYHRISIPDLLDKEVEKVIYLDSDIVIKKDITPLWN 121 Query: 146 FSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIA 205 +A V+ Q K H+ YFN+G L++N +W ++ + + Sbjct: 122 TKVDQYYLAAVMDSWQG--LNKLRHADLAIPDDCDYFNAGVLVMNLKKWREHNITKKIMD 179 Query: 206 MLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIH 265 + + + I I +P QD +N +L D + D K+N YQ K + + + D IH Sbjct: 180 YMKKNQGI--IRYPSQDPMNAILHDNWLQLDTKWN------YQSKHLYKSNLRIDPAIIH 231 Query: 266 YIGPT-KPWHDWAWDYPVSQAFMEAKNA 292 Y G KPW + +P+ + + + Sbjct: 232 YTGEDSKPW--LSKKHPLREEYFKYLKK 257 >UniRef50_D2RIJ7 Glycosyl transferase family 8 n=1 Tax=Acidaminococcus fermentans DSM 20731 RepID=D2RIJ7_ACIFE Length = 330 Score = 228 bits (582), Expect = 2e-58, Method: Composition-based stats. Identities = 76/342 (22%), Positives = 138/342 (40%), Gaps = 29/342 (8%) Query: 5 FFQETEF--LNSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFH 62 +FQ + +Y + L I + F G+ + SI + N+ L FH Sbjct: 9 YFQTHKLYLTKDSFEYMTAENKKKDILHICCNVNDLFFKPAGVLLTSICENNKDLALNFH 68 Query: 63 IFTDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLP-STKNWTHAIYFRFVIADYFI 121 +F D D++++ A +Y +Y ++ ++ K ++ Y R V+ Sbjct: 69 VFVDSCSDENKENLRKTAEKYGCNAYLYKMDMSIYQNFHIKVKRFSRVTYIRIVMPWVLR 128 Query: 122 NKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGY 181 N + LYLDAD++C ++ N+ D V +V + +R L + G Y Sbjct: 129 NVTNRYLYLDADMVCVKSLRVFFNYDLKDKAVGALVYDTP-----ERIAFLKMKG--NVY 181 Query: 182 FNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNT 241 F+ G + IN +W Q+V+ R + + + QD++N++L + Sbjct: 182 FSDGLMWINVDEWIKQRVTERVFSY--QGADPARFKGQTQDLMNLVLDGNVQPIP----- 234 Query: 242 QFSLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALL 301 +L + + + F + D I IHY G KPW + + + + SPW + Sbjct: 235 --ALFHHMDKDF----SVDGILIHYSGRDKPWEIVLDE--DDELWRHYLDISPWPSMPNP 286 Query: 302 KPNNSNQLRYSAKHML----KKHRYLKGFSNYLFYFIEKIKH 339 P +S K + KK +LK +Y I KI++ Sbjct: 287 MPPKRPIYYHSFKKLAQVYSKKGNHLKELECLFWYGILKIRY 328 >UniRef50_B7GNT4 Glycosyl transferase, family 8 n=5 Tax=Bifidobacterium RepID=B7GNT4_BIFLI Length = 1013 Score = 228 bits (582), Expect = 2e-58, Method: Composition-based stats. Identities = 63/335 (18%), Positives = 125/335 (37%), Gaps = 28/335 (8%) Query: 24 TENLCLDIAYGTDKNFLFGCGISIASILKY-NEGSRLCFHIFTDYFGDDDRKYFDALALQ 82 + + + + D N++ ++ S +K + + D ++ Q Sbjct: 664 FDKPIVPVVFAADDNYVPQLTTTVYSAMKNADPSYFYDVVVLQQDIAGDKQERMWRFFEQ 723 Query: 83 Y-KTRIKIYLINGDRLRSLPSTK--NWTHAIYFRFVIADYFINKAPKVLYLDADIICQGT 139 + ++ + + ST + + Y+RF+I KVLYLD+DII G Sbjct: 724 FPNMSLRFLNVKRELSGYDLSTNNAHISIETYYRFLIQQLLP-NYDKVLYLDSDIIIVGD 782 Query: 140 IEPLINFSFPDDKVAMVVT-----EGQADWWEKRAHSLGVAGIAKG--YFNSGFLLINTA 192 I L + D+ + V ++ +++ V + YF +G L++NT Sbjct: 783 IAKLYDIDLQDNLLGAVRDIDFLGNLNVKHGKRMSYAKDVLKMKNPYDYFQAGVLVLNTK 842 Query: 193 QWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKES 252 + + + + P I + DQDVLN K+++ ++N ++ Sbjct: 843 GMRNRYSIEQWLTYASNPNYI----YNDQDVLNAYCEGKVLYLPWEWNVVHDCGGRVGNL 898 Query: 253 FINP----------VTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLK 302 F ++ IHY G KPW D DY S + +P+ + + Sbjct: 899 FTQAPNDVYDAYVKSRSNPQIIHYAGYQKPWVDPDCDY--SSIYWRYARETPFYERLIKR 956 Query: 303 PNNSNQLRYSAKHMLKKHRYLKGFSNYLFYFIEKI 337 +N+ + + L KH G N + F++ + Sbjct: 957 VVLANEPQIPEEVFLPKHERAVGEDNPIRKFVDPL 991 >UniRef50_C4VEI8 General stress protein A n=24 Tax=Enterococcus RepID=C4VEI8_ENTFA Length = 303 Score = 228 bits (581), Expect = 3e-58, Method: Composition-based stats. Identities = 58/284 (20%), Positives = 113/284 (39%), Gaps = 18/284 (6%) Query: 24 TENLCLDIAYGTDKNFLFGCGISIASILKYNEGSR-LCFHIFTDYFGDDDRKYFDALAL- 81 L I + NF+ SIL+ + + + F++ D + ++ Sbjct: 5 ENRKELAIVSCCNTNFVPHLAAMFVSILENSPSAAAVHFYVIDDNINFESKQLLYFTIKH 64 Query: 82 -QYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFI-NKAPKVLYLDADIICQGT 139 Q + + IN +++ +++ Y+R I + F ++ ++LY+D D+I Sbjct: 65 TQLNAELTFFKINPHFFKNVVTSERIPKTAYYRIAIPELFRGSQIERLLYMDCDMIALDD 124 Query: 140 IEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQV 199 + L ++ +A V G + +R + + + YFNSG LLI+ +W V Sbjct: 125 VAKLWTVDLGENIIAAVEDAG----FHQRLEKMAIPAESMCYFNSGLLLIDVKKWLNLDV 180 Query: 200 SARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLK-------ES 252 + + + + E K+ DQD LN +L D+ K+N Q + + K E Sbjct: 181 TTKVLRFIEEN--PDKLRFHDQDALNAVLHDRWTLLHPKWNAQGYILSKAKKHPTIYGEK 238 Query: 253 FINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWK 296 IH+ G KPW Y + + + N + ++ Sbjct: 239 QYEETRRAPSIIHFTGHVKPWTKEFQWY-TKRYYDQYANRTAFR 281 >UniRef50_Q116W1 Glycosyl transferase, family 8 n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q116W1_TRIEI Length = 278 Score = 227 bits (580), Expect = 4e-58, Method: Composition-based stats. Identities = 70/287 (24%), Positives = 128/287 (44%), Gaps = 19/287 (6%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 +++ + D+N+ G++I S+L N S HI T++ + ++ D L+ YK Sbjct: 2 MNLLFCFDQNYQQHFGVAITSVLLNNLSSHFDVHIITNFMEEKLKQKLDTLSKNYKCSFH 61 Query: 89 IYLING-DRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 +Y+IN D++ L + + ++A Y+R ++A+ KVLYLD+D++ +E L N Sbjct: 62 LYIINNLDKISKLKVSDHVSNATYYRLIMAEILPKHIDKVLYLDSDVVVISPLEELYNID 121 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 + +A G + FNSG +++N +W +Q+S + I Sbjct: 122 LENYFIAASGFSGTLVKSKG--------------FNSGVMVVNLEKWRNEQISTKVIDFA 167 Query: 208 NEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLN-YQLKESFINPVTNDTIFIHY 266 + K+ + DQ LN ++ + D K+N Q L+ ++++ N + IHY Sbjct: 168 TKNR--DKLPYHDQSALNRVIKQNYLIIDRKWNFQVDLSPRKIQKPDDNIALKNARIIHY 225 Query: 267 IGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSA 313 IG +KPW+ W D + S W + L A Sbjct: 226 IGSSKPWYFWISD-QRKNIYELYLKKSLWSTSKLQMIFQQTVYFRKA 271 >UniRef50_B2UPJ4 Glycosyl transferase family 8 n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UPJ4_AKKM8 Length = 315 Score = 227 bits (579), Expect = 5e-58, Method: Composition-based stats. Identities = 75/316 (23%), Positives = 125/316 (39%), Gaps = 23/316 (7%) Query: 29 LDIAYGTDKNFLFGCGISIASILKY-NEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 ++I Y TD N G G+SI S+++ G +I T D+ F +L Y + Sbjct: 1 MNIVYATDDNGALGTGVSIVSLMENLPPGVHADIYIMTGGLSGDNTARFHSLQQGYNLHL 60 Query: 88 KIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 + D+ P W+ A Y+R +A + LY+D D I I P+ Sbjct: 61 HFIDM-KDKYTDFPVGSKWSAATYYRLGLAGELPATVERALYVDIDTIFNRDISPMYESE 119 Query: 148 FPDDKVAMVV-TEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAM 206 F D +A V TE ++ R G Y N+G +L + + + ++ ++ Sbjct: 120 FGDCLIAGVFTTEDLSEESFSRWKREMNLGRDSIYINAGVILYHIGRIREECFESQVLSW 179 Query: 207 LNEPEIIKKITHPDQDVLNMLLADKLIFADIKYN----TQFSLNYQLKESFIN------- 255 I +++ DQD+LN+ +++ +N +S+ ++ SF N Sbjct: 180 AKNN--IHRLSWQDQDILNVCYQQRILLLHPMWNICDGAIWSIRWEGVTSFRNNPLKPAD 237 Query: 256 --PVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNT--ALLKPNNSNQLRY 311 IHY G KPWH + F + SPWK+ K N+ ++ Sbjct: 238 LLEAARRPGIIHYWGHPKPWHPNSIRQDYG-LFYKYWKKSPWKDDIRDFRKQNDPGRMFI 296 Query: 312 SAKHML--KKHRYLKG 325 S L K R L+G Sbjct: 297 SKMRCLLGKGKRLLQG 312 >UniRef50_UPI000196958D hypothetical protein BACCELL_01586 n=1 Tax=Bacteroides cellulosilyticus DSM 14838 RepID=UPI000196958D Length = 305 Score = 226 bits (577), Expect = 8e-58, Method: Composition-based stats. Identities = 66/309 (21%), Positives = 110/309 (35%), Gaps = 15/309 (4%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 ++I D ++ C + + S + N G ++ T+ DD + + Y Sbjct: 1 MNIVCAADSGYVQHCSVMLISFFENNPGEEHAVYLLTEGLDLDDLDFIQKIVHSYNGHFF 60 Query: 89 IYLINGDRLRSLP--STKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINF 146 ++ L P ST + + A Y R +AD KVLYLD DII +I+ L Sbjct: 61 YCQVDFKFLEKCPIKSTDHLSIATYNRLFMADLLPADVNKVLYLDCDIIVNQSIKELWET 120 Query: 147 SFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAM 206 D+ V E + + GYFN+G LL+N W ++ I Sbjct: 121 PLRDNFVVAAFEERGCC--AEDVYERLDYDSKYGYFNAGVLLVNLDYWRTHNMTQAFIEY 178 Query: 207 LNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQ------LKESFINPVTND 260 + +K+ DQDVLN DK + + +N +F Y + + + Sbjct: 179 IE--HNFEKLRAHDQDVLNAFFYDKSVHISLAWNVEFIFYYYGIIKKFGFDRDLRFILRH 236 Query: 261 TIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKH 320 +H+ KPW + + +P + K L + L Sbjct: 237 PKILHFTWKPKPW-ETSCQHPFRINYYRYLKKI--KKNPLSFRDTLRALWDKYYFCFLIK 293 Query: 321 RYLKGFSNY 329 +KG Y Sbjct: 294 WKIKGHKYY 302 >UniRef50_A5ZFA9 Putative uncharacterized protein n=1 Tax=Bacteroides caccae ATCC 43185 RepID=A5ZFA9_9BACE Length = 310 Score = 225 bits (574), Expect = 2e-57, Method: Composition-based stats. Identities = 68/274 (24%), Positives = 117/274 (42%), Gaps = 18/274 (6%) Query: 30 DIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKI 89 +I G D + CG + S+ + N G + ++ + + + L Y+ +I Sbjct: 3 NIICGIDDQYCQHCGAMLLSLFESNPG-AITIYVLSLELSEKSKNLLKELVDSYQKQIHF 61 Query: 90 YLINGDRLRSLP--STKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 I + + + P ST + A Y R I + K LY+D+DII + I L + Sbjct: 62 IDIPSELVLNFPMKSTDYPSLATYLRLFIPQLLPFEVDKALYVDSDIIFKKDISALYDSD 121 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 + +A G D + A LG + YFN+GF+L+N + +A+A + Sbjct: 122 ITNYALA-----GMEDAPNQNALRLGFPE-SDLYFNAGFVLLNVKYLRDMDFTNKAMAYI 175 Query: 208 NEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLK---ESFINPVTND---T 261 + +KI DQDVLN LL K++F IK+N + + ++ + + Sbjct: 176 RDCR--EKIVLHDQDVLNALLHGKVLFVPIKWNMLDCFYRKPPFIAKKYMRELHENLDSP 233 Query: 262 IFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPW 295 IH+ GP KPWH +P+ + + W Sbjct: 234 AVIHFSGPLKPWH-HGCPHPLRKEYFNYSRKLSW 266 >UniRef50_B3Z5I6 Glycosyltransferase family 8 n=2 Tax=Bacillus cereus group RepID=B3Z5I6_BACCE Length = 317 Score = 224 bits (571), Expect = 4e-57, Method: Composition-based stats. Identities = 66/298 (22%), Positives = 126/298 (42%), Gaps = 28/298 (9%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEG-SRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 L++ Y +D N+ G+S+ S+L+ N+ + L + + ++K +++ +Y I Sbjct: 3 LNVVYSSDDNYAQHVGVSLLSLLQNNQHFNNLNIFLIENNISSYNKKNLNSVCKKYNKTI 62 Query: 88 KIYLINGDRLRSLPSTKNWTHAI--YFRFVIADYFINKAPKVLYLDADIICQGTIEPLIN 145 + N L L N + AI Y R +A + K++YLD D I ++ L + Sbjct: 63 QYINFN-VLLERLELNINDSIAINSYARLFLAGIIPEELDKIIYLDCDSIINSSLSDLWD 121 Query: 146 FSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIA 205 + VA V + ++GY N+G LLIN +W + + + + Sbjct: 122 TDVTEYFVAGVCDTVSNQTKLRID-----MDKSEGYINAGMLLINLKKWREENIEQKFME 176 Query: 206 MLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQ--------------FSLNYQLKE 251 + + + + H DQ +N +L DK+++ K+N + L E Sbjct: 177 FIKKKD--GNVFHHDQGTINGVLKDKILYLHPKFNAMTPFFTMSRKEIMSYYELENYYNE 234 Query: 252 SFINPVTNDTIFIHYIGP--TKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSN 307 I+ + +FIHY +PW + +P++ + + +PWK+T L K Sbjct: 235 IEIDEAVKNPVFIHYTPAFVNRPWIE-GCKHPLTSLYKSYLDMTPWKSTDLWKDRRGK 291 >UniRef50_A4ECW2 Putative uncharacterized protein n=1 Tax=Collinsella aerofaciens ATCC 25986 RepID=A4ECW2_9ACTN Length = 328 Score = 223 bits (569), Expect = 6e-57, Method: Composition-based stats. Identities = 64/287 (22%), Positives = 117/287 (40%), Gaps = 23/287 (8%) Query: 27 LCLDIAYGTDKNFLFGCGISIASILKYNEG-SRLCFHIFTDYFGDDDRKYFDALALQYKT 85 + +++ Y D NF+ +I S++ + G + FH+F++ +D+++ + +Y Sbjct: 2 VIMNLLYTVDNNFVPQLAANICSVVSNHSGIQDITFHVFSNGITEDNQRLLQEMVTEYNQ 61 Query: 86 RIKIYLING--DRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPL 143 + Y I+ D L T W + R ++A + N+ +V+YLD D I G I L Sbjct: 62 NLVFYDISNFKDALGFDFDTSGWNEIVLARLLMAHFLPNEIERVIYLDGDTIVLGDIALL 121 Query: 144 INFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARA 203 N V MV R + L + G Y N+G LL++ QW + + Sbjct: 122 WNQDLKGCVVGMVPEPTVGPS---RLNDLDLNGCL--YHNAGVLLVDLKQWRSTCCEDQL 176 Query: 204 IAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQ------------LKE 251 + ++ DQD LN +L DK+ +N +Y E Sbjct: 177 LDYCERRS--GRLFANDQDALNAVLKDKICSLSPAFNYSNIFDYYPFIFLNSLMPGFSDE 234 Query: 252 SFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNT 298 + N + I +HY+G +PW + + + + + WK+ Sbjct: 235 NSFNTARSKPIVVHYLGEERPWRR-GNTHRFNNEYHFYLSETFWKDA 280 >UniRef50_C5ZV11 Glycosyl transferase n=2 Tax=Helicobacter canadensis MIT 98-5491 RepID=C5ZV11_9HELI Length = 397 Score = 223 bits (568), Expect = 9e-57, Method: Composition-based stats. Identities = 59/328 (17%), Positives = 118/328 (35%), Gaps = 33/328 (10%) Query: 30 DIAYGTDKNFLFGCGISIASILKYNEGSR---LCFHIFTDYFGDDDRKYFDALA----LQ 82 ++ ++N++ + I SI++ + S FH+ D ++ K + L Sbjct: 3 NVVLNLNENYVPYAAVLITSIIQNTQSSGGGGYNFHLLMDSISQENTKNLENLISELSKI 62 Query: 83 YKTRIKIYLINGDRLRSLPS-TKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIE 141 Y + IY+++ R T N + Y+R I + +YLD D+I G + Sbjct: 63 YPCTLTIYILDDQLFREYSMPTLNGNYLAYYRLKIGSALPLSIKRCVYLDVDMIVLGDLR 122 Query: 142 PLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSA 201 L +V+ ++ + + I YFNSG LL++ W + + Sbjct: 123 ELFEVDLQGKICGVVMEHHSQKIYKPKNQAYKPINITGSYFNSGMLLVDLDLWRQENIED 182 Query: 202 RAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQ------------- 248 RA + + DQD+LN++L+ K I++N + Y+ Sbjct: 183 RAFEIGKNYH----YSFHDQDILNIVLSGKTHKVGIEWNLMVCVYYRAICKDEKGRDKLP 238 Query: 249 LKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQ 308 N + +HY TKPW++ F++ W + + + Sbjct: 239 YYRKDFNSALRNPKILHYFTHTKPWNNAKIYLDYHNKFLDQY----WWD----MVDQTPI 290 Query: 309 LRYSAKHMLKKHRYLKGFSNYLFYFIEK 336 + + + F + Y + + Sbjct: 291 FKEKLLQLKPQADSALAFQCLVGYKLLR 318 >UniRef50_B9KVD4 Glycosyl transferase, family 8 n=2 Tax=Rhodobacter sphaeroides RepID=B9KVD4_RHOSK Length = 334 Score = 222 bits (565), Expect = 2e-56, Method: Composition-based stats. Identities = 62/278 (22%), Positives = 108/278 (38%), Gaps = 12/278 (4%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFT-DYFGDDDRKYFDALALQYKTRI 87 + + + D+ F ++ S G L H+ T D +++ ++ ALA I Sbjct: 1 MHLLFCADRPFFRHAAVAAVSAASATRG-PLQVHLLTCDSCPEEEARFRVALAPFAHVGI 59 Query: 88 KIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 ++ + RL L ++ + A Y RF+ + +VLYLD D+I + L+ Sbjct: 60 SVHRVPAARLEGLFVDRHLSPAAYLRFLAPEVLPEAVQRVLYLDCDLIVLDDVAQLLRLD 119 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 VA G D + + + Y NSG LL++ +W +S + + Sbjct: 120 LQGRAVAAAPDLGWKDAAQAARFRTLGIPLDRPYVNSGVLLMDLGRWRRDGLSQKLFDYV 179 Query: 208 NEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSL-----NYQLKESFINPV--TND 260 + + DQD LN +LAD + D ++N Q L L E V D Sbjct: 180 ARHGSL--LLRHDQDALNAVLADDIHLLDRRWNLQVLLLSPWAKRALPEDRQATVAARRD 237 Query: 261 TIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNT 298 +H+ KPW+ W + + + +PW Sbjct: 238 PAILHFSTADKPWNFRVWTR-RRELYFRFRARTPWSRA 274 >UniRef50_C5ELK9 Glycosyl transferase n=3 Tax=Clostridiales RepID=C5ELK9_9FIRM Length = 333 Score = 221 bits (564), Expect = 2e-56, Method: Composition-based stats. Identities = 61/332 (18%), Positives = 117/332 (35%), Gaps = 30/332 (9%) Query: 22 VETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSR-LCFHIFTDYFGDDDRKYFDALA 80 +E +I Y ++ + S+ S+L N R + +I + + ++ +A Sbjct: 1 MEWNEETANIIYASNDGYAGHLAASMYSLLDNNRNVRNMDIYILSAQMCQEYKERLAGMA 60 Query: 81 LQYKTRIKIYLING--DRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQG 138 + + + + R T+ + + R K LYLD D I Sbjct: 61 EAFHRTLHVVELGDLKQRFDFDIDTRGFDISAMGRLFAPQVLPGTVKKALYLDCDTIVCK 120 Query: 139 TIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQ 198 +I PL D V MV+ + +++ S+G+ G Y+NSG LL+ +W + Sbjct: 121 SIRPLYETELGDAVVGMVM---EPTVYKEMKESIGM-GKDDPYYNSGVLLMALDRWRQED 176 Query: 199 VSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNY----------- 247 V + + ++ DQD +N L ++ +KYN + Y Sbjct: 177 VLQKLLDFYKSCH--GRLFACDQDTINGALKGRIKTLPVKYNYFTNYRYFRYSTLCSMCA 234 Query: 248 ---QLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPN 304 ++ E IHY+G +PW ++ + + +PWK+T Sbjct: 235 AYREIGEEAYLEARRSPAIIHYLGDERPWIAGNHNH-FKKLYEYYLAKTPWKDTP----- 288 Query: 305 NSNQLRYSAKHMLKKHRYLKGFSNYLFYFIEK 336 + HM L ++ + Sbjct: 289 -KQTGKERYMHMWWLFNRLTWLCPPFRLWVSR 319 >UniRef50_C1QBZ8 LPS:glycosyltransferase n=1 Tax=Brachyspira murdochii DSM 12563 RepID=C1QBZ8_9SPIR Length = 336 Score = 221 bits (563), Expect = 3e-56, Method: Composition-based stats. Identities = 74/303 (24%), Positives = 121/303 (39%), Gaps = 20/303 (6%) Query: 30 DIAYGTDKNFLFGCGISIASILKY-NEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 +I +D+N+ +++ASILK N+ + FHI D+ + L IK Sbjct: 5 NICLCSDENYAKYMAVTMASILKNTNDDENIIFHIIESNIKDETKNKLIYLKKIKNCEIK 64 Query: 89 IYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSF 148 Y + ++ + A Y R +I + I A KVLYLD+DII G+++ L + Sbjct: 65 FYRVEYNK---------YPLATYLRLLIPE-LIKDADKVLYLDSDIIVNGSLKELFDIDI 114 Query: 149 PDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLN 208 V + K L G + YFN+G +L N +S + + Sbjct: 115 NGYYALAVKDLYVDIY--KEHKELIEIGNNRIYFNAGVVLFNNKSCIDNNISQKFYSYFT 172 Query: 209 EPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIG 268 E + K+ DQD+LN DK+ D K+N +Y K + P +D + IH++ Sbjct: 173 ENK--NKLKFHDQDILNHCFIDKVKIIDRKWNFMPFRDYNTKSHY--PTKDDAVIIHFV- 227 Query: 269 PTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNS--NQLRYSAKHMLKKHRYLKGF 326 KPW + +PW + + Q Y + + + Y K F Sbjct: 228 EHKPWKTQKDRTYFLDDYWRYYQYTPWFFEEPITAIQTMMQQKMYDYEDIRFRSNYFKFF 287 Query: 327 SNY 329 Y Sbjct: 288 GIY 290 >UniRef50_A1XRC1 Glycosyltransferase n=1 Tax=Haemophilus ducreyi RepID=A1XRC1_HAEDU Length = 267 Score = 221 bits (563), Expect = 4e-56, Method: Composition-based stats. Identities = 65/261 (24%), Positives = 120/261 (45%), Gaps = 13/261 (4%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 ++I + +D+N+ + + SIL +N + F+I ++ + + +L ++ + I+ Sbjct: 1 MNIVFSSDENYAPHLSVCLYSILSHN--YNINFYILDLGIKEESKSFIKSLVEKFNSNIE 58 Query: 89 IYLINGDRLRSLPS-TKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 I+ D + P + A Y R + DY + KVLYLD D I G++ L + Sbjct: 59 FIKISVDSFSNFPIYIDYISLATYARLKLTDYLP-QLEKVLYLDIDTIVNGSLIDLWDLD 117 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 + +A V K L YFN+G LLI+ +W + +++ ++ Sbjct: 118 LNEYYIAAVADPFIESLNYKTILGLD----KNIYFNAGVLLIDCIKWKQYNIFDKSVKII 173 Query: 208 NEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFIN---PVTNDTIFI 264 + + KK+ + DQD+LN++L DK++ D +YN S +K + +T + Sbjct: 174 KD--LSKKLQYQDQDILNLILKDKVLLLDCRYNFMPSQLDFIKRDKVRKGIKITTPIVIY 231 Query: 265 HYIGPTKPWHDWAWDYPVSQA 285 HY GP KPWH ++ Sbjct: 232 HYCGPKKPWHIDCTNFNCELY 252 >UniRef50_B1Y723 Glycosyl transferase family 8 n=1 Tax=Leptothrix cholodnii SP-6 RepID=B1Y723_LEPCP Length = 316 Score = 219 bits (559), Expect = 9e-56, Method: Composition-based stats. Identities = 56/271 (20%), Positives = 102/271 (37%), Gaps = 15/271 (5%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGS-RLCFHIFTDYFGDDDRKYFDALALQYKTRIKI 89 I D+ +L ++ S+++ N + H+ D R + +I+ Sbjct: 13 IVLACDEAYLMPLATTLRSVVESNAAHWPIECHVLVDDVSLPGRARVERSLPARAAQIRW 72 Query: 90 YLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFP 149 + ++ S + + + R ++AD + +VLYLD DI+ G + PL+ Sbjct: 73 HAVDLTDFSSFETQAAISKMTFARLLMADLLPAELERVLYLDTDILVLGDLLPLMRTELD 132 Query: 150 DDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNE 209 + V A+ G+ + YFN+G LLI+ A+W A +VSA A L Sbjct: 133 GAILGAVRDGLDAELKSTSPAPTGMPDVCD-YFNAGVLLIDLARWRAGRVSAAARDHLVA 191 Query: 210 PEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIGP 269 + DQD LN+ +N Q ++ + + +H+I Sbjct: 192 H---PQTPFADQDALNVACDGHWKPLAAHWNFQ---GHRSTDIAALAPSQRPGIVHFITA 245 Query: 270 TKPWHDWAWDYPVSQAFMEAKNASPWKNTAL 300 KPW + A+ W++ L Sbjct: 246 LKPWK-------ADSLSLNARLYDGWRSRTL 269 >UniRef50_C6I3U6 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=C6I3U6_9BACE Length = 310 Score = 219 bits (558), Expect = 1e-55, Method: Composition-based stats. Identities = 66/277 (23%), Positives = 110/277 (39%), Gaps = 21/277 (7%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 ++I D N++ + + S N+ ++ T D + Y + +Y + Sbjct: 1 MNILCCLDDNYVQHTSVMLTSFFINNDFEHHNIYVITMQLNDGNVAYLREVVNKYHSNFY 60 Query: 89 IYLINGDRLRSL--PSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINF 146 +Y +N L T + A Y R KVLY+D DI+ + ++E L Sbjct: 61 LYQVNEAMLSGFVRKETDYVSLAAYLRLFSTQVLPFNCSKVLYIDGDIVVRKSLEELWKM 120 Query: 147 SFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAM 206 + VA V +A+ + GYFNSGF+LIN + W V+ +AI Sbjct: 121 DIENYAVAAVDETIKANCIRHN------YDVTLGYFNSGFMLINLSFWRENSVAEKAIDY 174 Query: 207 LNEPEIIKKITHPDQDVLN-MLLADKLIFADIKYNT-------QFSLNYQLKESF---IN 255 + ++I DQD LN +L D+KYN Q+ + + N Sbjct: 175 MK--RFPERIKSWDQDALNGILYGGLWKRLDLKYNLTTIFLCKQYVEGQDFPKIYTEEYN 232 Query: 256 PVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNA 292 +D +HY GP KPW D+P + +++ Sbjct: 233 SAISDPAVVHYTGPDKPWKYTVVDHPFKKDYLQYARM 269 >UniRef50_Q9L6B2 Putative glycosyl transferase n=3 Tax=Pasteurella RepID=Q9L6B2_PASMU Length = 302 Score = 218 bits (555), Expect = 3e-55, Method: Composition-based stats. Identities = 58/274 (21%), Positives = 119/274 (43%), Gaps = 15/274 (5%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 ++I + +D + ++I SI+ +N + F+IF D++++ + + Y + + Sbjct: 1 MNILFVSDDVYAKHLVVAIKSIINHN-EKGISFYIFDLGIKDENKRNINDIVSSYGSEVN 59 Query: 89 IYLINGDRLRSLPST-KNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 +N S P + A Y R A+Y + K++YLD D++ ++E L N Sbjct: 60 FIAVNEKEFESFPVQISYISLATYARLKAAEYLPDNLNKIIYLDVDVLVFNSLEMLWNVD 119 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 + A + + S+ ++ YFN+G +L N +W V +RA+ +L Sbjct: 120 VNNFLTAACYDSFIENEKSEHKKSISMSDKEY-YFNAGVMLFNLDEWRKMDVFSRALDLL 178 Query: 208 NEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTN-------- 259 ++ + DQD+LN+L +K+ + D ++N + ++K+ ++N Sbjct: 179 AMY--PNQMIYQDQDILNILFRNKVCYLDCRFNFMPNQLERIKQYHKGKLSNLHSLEKTT 236 Query: 260 -DTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNA 292 + HY GP K WH + + + Sbjct: 237 MPVVISHYCGPEKAWHA-DCKHFNVYFYQKILAE 269 >UniRef50_B7KM20 Glycosyl transferase family 8 n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7KM20_CYAP7 Length = 347 Score = 217 bits (554), Expect = 4e-55, Method: Composition-based stats. Identities = 60/282 (21%), Positives = 109/282 (38%), Gaps = 12/282 (4%) Query: 25 ENLCLDIAYGTDKNFLFGCGISIASILKY-NEGSRLCFHIFTDYFGDDDRKYFDALALQY 83 EN + I G D F G +++ S L + ++ +I +R + Sbjct: 9 ENEPITIVSGADDKFALGLAVTLYSALANLDTKRKIDIYIVDGGINSKNRDKLTQILNSD 68 Query: 84 KTRIKIYLINGDR--LRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIE 141 + I + D L + + YFR ++ + + +V+YLD+D++ +G + Sbjct: 69 LMPVSIKWVKPDLTVLEGVKLFGSLNVTTYFRLLLPELLPTQVERVIYLDSDLVVEGNLA 128 Query: 142 PLINFSFPDDKVAMVVTEGQADWWE--KRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQV 199 L + V K LG+A Y N+G +LIN QW + + Sbjct: 129 NLWEQELGNCPAVAVQDYVFPYVCNGLKTYQQLGLAS-NTPYCNAGVMLINIKQWRIEAL 187 Query: 200 SARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFI---NP 256 + + + + + + DQD +N L+A++ D+K+N Q Y K + Sbjct: 188 NRKILEYIRK--FYDLVYLADQDGINALIANRFKLLDLKWNVQIFGVYNGKIDLLCKPKE 245 Query: 257 VTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNT 298 + D +H+ P KPWH + + F S W N Sbjct: 246 LIRDAFILHFTTPIKPWHPY-YRQAGGSRFTHYLRKSKWFND 286 >UniRef50_C7IBC8 Glycosyl transferase family 8 n=1 Tax=Clostridium papyrosolvens DSM 2782 RepID=C7IBC8_9CLOT Length = 464 Score = 217 bits (554), Expect = 4e-55, Method: Composition-based stats. Identities = 61/245 (24%), Positives = 100/245 (40%), Gaps = 10/245 (4%) Query: 64 FTDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINK 123 +++ A +Y +RI+ + + + + + + YFR I + Sbjct: 2 IDGGISSRNKECLRACVEKYGSRIRFLELKPELYQDFKTQSYFGYVTYFRIFIPEIVEAS 61 Query: 124 APKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAK--GY 181 KV+YLD DI+ +G I L + VA V G GI + Y Sbjct: 62 VRKVIYLDCDIVIKGDIRKLWENDISEYFVAAVEDVGIDIGGNFATMVKKHIGIPRKGKY 121 Query: 182 FNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNT 241 FN+G LLIN +W A + + L E +KI DQD LN + D+ + I++N Sbjct: 122 FNAGVLLINLDKWRADKTTETIRKYLIENR--EKIYFADQDGLNAVFKDRWLKLPIEWNQ 179 Query: 242 QFSLNYQLKESFIN-----PVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWK 296 Q + LK + I+ + + IHY KPW +P+ + + +PW Sbjct: 180 QADILELLKRNRIDRPDVMKAALNPMIIHYTKQVKPWQYKDC-HPLKEEYHRYLRLTPWN 238 Query: 297 NTALL 301 +TA Sbjct: 239 DTAPK 243 >UniRef50_B6GCA0 Putative uncharacterized protein n=1 Tax=Collinsella stercoris DSM 13279 RepID=B6GCA0_9ACTN Length = 990 Score = 217 bits (553), Expect = 5e-55, Method: Composition-based stats. Identities = 61/358 (17%), Positives = 133/358 (37%), Gaps = 31/358 (8%) Query: 1 MQQVFFQETEFLNSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGS-RL 59 +Q V FQE ++ + +++ + + + +D N++ +I S+L + R Sbjct: 621 LQCVHFQEPDY-KPGLKMPVRLDDLRQIVPVVFASDNNYVPMLTTTIHSMLSNASNNYRY 679 Query: 60 CFHIFTDYFGDDDRKYFDALALQY-KTRIKIYLING--DRLRSLPSTKNWTHAIYFRFVI 116 + ++ Y + ++ ++ + + + Y+RF+I Sbjct: 680 DITVLHRDISGANQAIMREFFSSYDNVNLGFCDVSQVIEKYNLTTNNPHISVETYYRFLI 739 Query: 117 ADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMV-----VTEGQADWWEKRAHS 171 D KVLYLD+D+I +G + L D +A V ++ A++ Sbjct: 740 QDLLP-YYDKVLYLDSDLIIRGDVSELFATDLGDSLLAAAHDIDFVANVNMKRGDRFAYA 798 Query: 172 LGVAGIAKG--YFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLA 229 V G+ YF +G L++NT ++ + ++ + + DQDVLN Sbjct: 799 KEVLGMKDPYSYFQAGVLVLNTRAMRSRHTMEEWLEFASD----DRFIYNDQDVLNAHCE 854 Query: 230 DKLIFADIKYNTQFSLNYQLKESF----------INPVTNDTIFIHYIGPTKPWHDWAWD 279 ++++ D +N ++ + F ++ +HY G KPW D Sbjct: 855 GEVVYLDYSWNVMIDCFGRINKVFTFAPAYMFDAFIESRSNEKIVHYAGFEKPWKLAGCD 914 Query: 280 YPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKHRYLKGFSNYLFYFIEKI 337 + + +P+ + L N+ +++ H + L ++ I Sbjct: 915 --RGELYWRYARETPFYESLLQHSIAVNRSGRLPDYLI--HEPALSPRSPLRKIVDPI 968 >UniRef50_P43974 Putative glycosyltransferase HI0258 n=32 Tax=Haemophilus influenzae RepID=Y258_HAEIN Length = 330 Score = 217 bits (553), Expect = 5e-55, Method: Composition-based stats. Identities = 58/258 (22%), Positives = 113/258 (43%), Gaps = 7/258 (2%) Query: 23 ETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQ 82 T + ++I + +D + +SI SI+K ++ F+I +++ + LA Sbjct: 33 RTVSQTMNIIFSSDHYYAPYLAVSIFSIIKNTP-KKINFYILDMKINQENKTIINNLASA 91 Query: 83 YKTRIKIYLINGDRLRSLPST-KNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIE 141 Y ++ + ++ P T + A Y R + Y K +Y+D D + +++ Sbjct: 92 YSCKVFFLPVCESDFQNFPKTIDYISLATYARLNLTKYI-KNIEKAIYIDVDTLTNSSLQ 150 Query: 142 PLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSA 201 L N + +A E ++G+ G + YFN+G LLIN +W + + Sbjct: 151 ELWNIDITNYYLAACRDTFIDVKNEAYKKTIGLEGYS--YFNAGILLINLNKWKEENIFQ 208 Query: 202 RAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDT 261 ++I +N+ + K + DQD+LN + K+ F + ++N + +K+ + V Sbjct: 209 KSINWMNKYNNVMK--YQDQDILNGICKGKVKFINNRFNFTPTDRDLIKKKNLLCVKMPI 266 Query: 262 IFIHYIGPTKPWHDWAWD 279 + HY GP K WH Sbjct: 267 VISHYCGPNKFWHKKCSH 284 >UniRef50_B8ISQ5 Glycosyl transferase family 8 n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8ISQ5_METNO Length = 328 Score = 217 bits (552), Expect = 6e-55, Method: Composition-based stats. Identities = 65/264 (24%), Positives = 105/264 (39%), Gaps = 16/264 (6%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDAL-ALQYKTRI 87 + +A D+ F +++AS+L L HIF AL A Q + Sbjct: 13 IAVALCIDRAFFRHALVTVASLLDAGPRQPLDVHIFYAEADPACMARIAALFADQDRHGC 72 Query: 88 KIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 I+ DR P + + Y R ++ Y + + KVLYLDAD+I + PL Sbjct: 73 HFQKISLDRFEGFPVSDAISAGTYARLLLP-YLMPRRAKVLYLDADLIVLDDVAPLWRTE 131 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 VA V + ++G + + YFN+G LL+N A W + ++ R A + Sbjct: 132 LGAAPVAAVRDPFCDNRP-----AIGFSP-DEPYFNAGVLLMNLAVWRREGLAERVAAHI 185 Query: 208 NEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSL------NYQLKESFINPVTNDT 261 + + + DQD LN++L + F D ++N Q + + + Sbjct: 186 DAHGA--SLKYFDQDALNVVLRGRARFVDPRWNFQPRMADATPADIACARAEFRRTRARP 243 Query: 262 IFIHYIGPTKPWHDWAWDYPVSQA 285 IHY P KPW D + Sbjct: 244 AIIHYTTPHKPWKDPFAIHYGRHY 267 >UniRef50_B3XL28 Glycosyl transferase family 8 n=1 Tax=Lactobacillus reuteri 100-23 RepID=B3XL28_LACRE Length = 331 Score = 216 bits (551), Expect = 8e-55, Method: Composition-based stats. Identities = 61/298 (20%), Positives = 125/298 (41%), Gaps = 29/298 (9%) Query: 25 ENLCLDIAYGTDKNFLFGCGISIASILKYNEG-SRLCFHIFTDYFGDDDRKYFDALALQY 83 +I Y TD F G S+ S+L+ N+ ++ F I +++ + + + Sbjct: 1 MKTIYNIVYATDDTFAPVLGTSLLSLLRNNKEAKKINFFILDSGISKENKFRIEKICDNF 60 Query: 84 -KTRIKIYLING--DRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTI 140 +K I ++ + + Y R I D N +VLYLD D + ++ Sbjct: 61 VNASLKWIKIESISKKIGIDVKNDRGSFSQYSRLFIGDVLDNSVERVLYLDCDTLILSSL 120 Query: 141 EPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVS 200 + L N + +A + + + ++ K + + FNSG +LI+ W ++ Sbjct: 121 KDLWNIELKGNIIAA-LKDAFSKYYRKNINLVN----DDLMFNSGVMLIDLKAWRDNKIK 175 Query: 201 ARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQ---FSLNYQLKESFINPV 257 +AI+ + + K+ DQ VLN +L++K D +YN + L+Y+ + + +PV Sbjct: 176 EKAISFIRQRH--GKVQQGDQGVLNSVLSNKTFALDPRYNLVSIFYDLDYREIKLYRSPV 233 Query: 258 -----------TNDTIFIHYIG---PTKPWHDWAWDYPVSQAFMEAKNASPWKNTALL 301 + + +H+ +PW + ++ + +++ +PWKN L Sbjct: 234 NFYSEKIIVKAKENPVILHFTSSFYSIRPWFKNS-NHQCKKIWLKFYQETPWKNQPLQ 290 >UniRef50_C1QC00 LPS:glycosyltransferase n=1 Tax=Brachyspira murdochii DSM 12563 RepID=C1QC00_9SPIR Length = 332 Score = 210 bits (536), Expect = 5e-53, Method: Composition-based stats. Identities = 59/278 (21%), Positives = 98/278 (35%), Gaps = 19/278 (6%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNE-GSRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 +DI D N+ G +IASIL ++ + FH+ ++++ +L I Sbjct: 1 MDICLSADDNYAKYMGTTIASILSNSKEDEEIYFHLLDGGITEENKNKLLSLKNIKNCDI 60 Query: 88 KIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 Y +N + + +FR + K+LYLD D I +++ L Sbjct: 61 IFYSVNNMNYK-------YDAPHFFRLNVPSLIP-NVDKLLYLDCDTIVLNSLKELFEID 112 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 + ++ + YFNSG L+IN W ++ Sbjct: 113 ISNYYALACEDVFLNCIIS--FKNMHGLNVNDIYFNSGMLMINNKLWRDDKLENLFYD-- 168 Query: 208 NEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYI 267 + H DQDVLN ++ ++ D K+N K I+ V IHY Sbjct: 169 -DYSKFGNTGHADQDVLNRIIKGRVKIVDSKWNFLSHKKVYSKAPDISLVN----IIHYA 223 Query: 268 GPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNN 305 G KPW + + F + +PW L Sbjct: 224 G-EKPWKETSSKAFFIDEFWKYYQLTPWCRENTLDAVK 260 >UniRef50_Q65VF6 RfaJ protein n=1 Tax=Mannheimia succiniciproducens MBEL55E RepID=Q65VF6_MANSM Length = 309 Score = 210 bits (535), Expect = 7e-53, Method: Composition-based stats. Identities = 60/260 (23%), Positives = 106/260 (40%), Gaps = 21/260 (8%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDAL------ALQ 82 ++I + D+N+ + I SIL N F+I ++ + L Sbjct: 1 MNIIFNCDENYAPYLSVVIKSILD-NTTLSTQFYILDFNISEESKSCIKNLIQNINKKNS 59 Query: 83 YKTRIKIYLINGDRLRSLP-STKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIE 141 ++ I I+ + + P + + A Y R +ADY N+ K +YLD DII + Sbjct: 60 FQHSINFIKIDDNDFQCFPQTISYISSATYARLKVADYL-NELNKAIYLDIDIIVISDLS 118 Query: 142 PLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSA 201 L + D+ V + + +G+ ++ Y N+G LL+N + Sbjct: 119 RLWHIDLADNLVGACLDPYIEYENQDYKRKIGLQD-SQPYINAGVLLLNLKALREFNLYQ 177 Query: 202 RAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESF-------- 253 +AI + I DQD+LN +L K++F D +YN + ++K + Sbjct: 178 KAIDWNKDY---PNIQFQDQDILNGVLKGKVLFLDSRYNFTVNHRNRIKLAHKGKLLLSS 234 Query: 254 INPVTNDTIFIHYIGPTKPW 273 + T +HY+G KPW Sbjct: 235 LEKATKPICILHYVGSHKPW 254 >UniRef50_D0D9G3 Putative general stress protein A n=1 Tax=Citreicella sp. SE45 RepID=D0D9G3_9RHOB Length = 327 Score = 210 bits (534), Expect = 9e-53, Method: Composition-based stats. Identities = 70/301 (23%), Positives = 119/301 (39%), Gaps = 30/301 (9%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYN-EGSRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 +++ Y D G+SIAS L+ EG+ + H+ + +RK + + Sbjct: 12 INVVYACDNIQALPLGVSIASALENRAEGNPINIHVLSYRISRSNRKSIASQFDGRDDTL 71 Query: 88 KIYLINGD---RLRSLPSTKN--WTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEP 142 + I G+ L L ++ N + A Y R +I++ + +YLD DII + P Sbjct: 72 CWHEITGENRKLLEDLFTSSNRPYPPAAYARLLISEVIP-NIDRAIYLDTDIIVATDLSP 130 Query: 143 LINFSFPDDKVAMVVTEGQADWWEKRAHSLGVA--------GIAKGYFNSGFLLINTAQW 194 L N F + + ++ KR +L YF SG L+ + ++ Sbjct: 131 LWNTPFDGAGLLAIQDLPTSNDHIKRLRALLSPEDISRYGIEDGDSYFQSGVLVFDMKEF 190 Query: 195 AAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSL-------NY 247 + S + N P+ +T PD D LN++ D D ++N S+ + Sbjct: 191 TKTRASELIECLRNYPD----LTFPDNDALNIVFHDSFKLVDPRWNQMASVFKLDAARDT 246 Query: 248 QLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSN 307 + D IHY G KPW D +P ++EA S W + KP+ N Sbjct: 247 PYSAEVFQALLQDPYIIHYSGRPKPWED-GCTHPYLDRWVEALKDSAWNS---WKPSRLN 302 Query: 308 Q 308 + Sbjct: 303 R 303 >UniRef50_A0KQP2 Glycosyl transferase, family 8 n=2 Tax=Aeromonas RepID=A0KQP2_AERHH Length = 366 Score = 209 bits (533), Expect = 1e-52, Method: Composition-based stats. Identities = 73/295 (24%), Positives = 133/295 (45%), Gaps = 24/295 (8%) Query: 25 ENLCLDIAYGTDKNFLFGCGISIASILKY-NEGSRLCFHIFTDYFGDDDRKYFDALALQY 83 + A+ D +F I S+ K+ + +L H+ + ++ L + Sbjct: 1 MRKIIHSAFCIDDSFAVHLAALIHSLGKHLSHDLQLQCHVLA-RLSETNKFKLSKLESE- 58 Query: 84 KTRIKIYLINGDRLRSLPST---KNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTI 140 IK Y D S Y+RF I KVL++D+D+I G I Sbjct: 59 NLVIKFYDNLPDYKDIPISNLYNNRLNEVTYYRFAIPHIL-KSIDKVLFIDSDMIALGDI 117 Query: 141 EPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKG-YFNSGFLLINTAQWAAQQV 199 PL + D VA+V +K+ + GI+ G YFN+GF+L+N +W A+ + Sbjct: 118 SPLWSIDMGDAIVAVVSDHILGCDKKKQL----MRGISSGKYFNAGFMLMNLDKWRAKNI 173 Query: 200 SARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTN 259 S +A+ +L E H DQD LN++L +K ++ D K+N Q N+ + +F+ Sbjct: 174 SEQALRLLIEN---NGFEHNDQDALNIVLENKTVYIDNKWNAQP--NHLAQNNFL----- 223 Query: 260 DTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAK 314 I +H+ G KPWH ++ ++P +++ ++ + + N L + + + ++ Sbjct: 224 -PILVHFCGQEKPWHIYS-NHPFKGSYLVSRRETDYANEPLQSYLDDHDIEILSR 276 >UniRef50_C2KV37 Lipopolysaccharide biosynthesis protein, LPS:glycosyltransferase n=1 Tax=Oribacterium sinus F0268 RepID=C2KV37_9FIRM Length = 324 Score = 209 bits (532), Expect = 1e-52, Method: Composition-based stats. Identities = 67/328 (20%), Positives = 120/328 (36%), Gaps = 36/328 (10%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 + I YG ++ F+ +S++S+L + EG L FHI + ++ ++ +I Sbjct: 1 MHIVYGVNEAFMPILAVSLSSLLLHAEGEALHFHILSLGIEEESKEKLRQYVETEGQKIS 60 Query: 89 IYLINGDRLRSLPS-----TKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPL 143 Y + T ++ A R I K LYLDAD + +I L Sbjct: 61 FYDLEEKLSEWKEKLPALFTGKFSKATLLRLFIPSTLPETITKALYLDADTVVLQSILSL 120 Query: 144 INFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARA 203 + D + M ++K L +A Y+N+G +L+N + + + + Sbjct: 121 YHLRLGDKLLGMAPEPS---IYKKHKEFLSLAE-ESPYYNAGVMLMNLSLLREEGMEEKC 176 Query: 204 IAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYN-------------TQFSLNYQLK 250 + E ++ DQD+LNM+ ++ ++N +FS YQ Sbjct: 177 LRYYQMKE--GQLPFNDQDILNMVCKGRIRSLPQRFNFFSNYAYARYSALCRFSPWYQEL 234 Query: 251 ESFI--NPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQ 308 ES + + +H+ G +PW + +Y +AF SP L K Sbjct: 235 ESKKSYSQAKAHPVIVHFAGDERPWREGNHNY-YRRAFDYYAEESP---LPLEKEKGKQG 290 Query: 309 LRYSAKHM------LKKHRYLKGFSNYL 330 + + K R G Y Sbjct: 291 YLFCYHVLNLLTFVFPKLREKVGEFYYR 318 >UniRef50_D2QX95 Glycosyl transferase family 8 n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2QX95_9PLAN Length = 350 Score = 208 bits (529), Expect = 3e-52, Method: Composition-based stats. Identities = 63/340 (18%), Positives = 127/340 (37%), Gaps = 35/340 (10%) Query: 28 CLDIAYGTDKNFLFGCGISIASILKY-NEGSRLCFHIFTDYFGDDDRKYFDALALQYKTR 86 LD+ D F G +I S+L + S+L + ++R + Sbjct: 4 VLDVLTSADDRFAIGLAGTIKSVLASLSPSSKLNLWVLDGGISSENRDDLIHHWNDPRLS 63 Query: 87 IKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINF 146 + ++ L + + A Y+R + + + K+LY+DAD++ Q + L + Sbjct: 64 VNWLPVDRALLAEFKVAPHMSDAAYYRLLAPNLLPSSVKKLLYIDADLLVQRDLTDLWDE 123 Query: 147 SFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAK-------------------GYFNSGFL 187 F V G + +++ YFNSG Sbjct: 124 PFDGHSCIAVHDIGAPFLDSNQILLEKPDALSRIVCRNPIPMFEELGLAPETRYFNSGVF 183 Query: 188 LINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNY 247 +I+ W ++Q+S + +L ++ + DQ LN++LA++ AD ++N Q + + Sbjct: 184 MIDLETWRSEQLSVQMFDVLCTHR--ERQIYHDQFALNIVLANRWKAADYRWN-QLAYIH 240 Query: 248 QLK---ESFINPV-----TNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPW-KNT 298 +LK +F+ P + +H+ KPW +P+ + F + S W + Sbjct: 241 ELKVPQHTFLEPQVFQQYKHSPWVVHFT-YRKPWQPE-CQHPLRKRFFDYLAGSKWMQAM 298 Query: 299 ALLKPNNSNQLRYSAKHMLKKHRYLKG-FSNYLFYFIEKI 337 P + A ++ + L G + I+ + Sbjct: 299 PEWHPPQQPIVAPPAPPAARQRQGLLGRMQRSIRKRIDSL 338 >UniRef50_B6VUC8 Putative uncharacterized protein n=1 Tax=Bacteroides dorei DSM 17855 RepID=B6VUC8_9BACE Length = 315 Score = 208 bits (529), Expect = 3e-52, Method: Composition-based stats. Identities = 58/316 (18%), Positives = 121/316 (38%), Gaps = 17/316 (5%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 + I + + C + + S+ + N+ + ++F+ D++ K + L +Y T+++ Sbjct: 2 ISILCNSSNEYAIHCKVMLTSLFENNKQNDKEVYVFSTSMSDENIKGLELLGQRYGTKVQ 61 Query: 89 IYLINGDRLRSLPSTKNW-THAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 I +++ +L+ LP + A Y R AD K+LYLD DII ++ L + Sbjct: 62 IIIVDSQKLQFLPIHFAYHNIACYLRLFAADLLPG-INKLLYLDCDIIVNSDLKALWDID 120 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 D A + K+ L Y N+G +LIN W V+ + + Sbjct: 121 ITDYAFAATHDLTYCEPNFKKNLQL---EENDTYINTGVMLINCDYWRNNNVAQKVLDYA 177 Query: 208 NEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESF------INPVTNDT 261 K+ DQD LN + ++N Y+ + ++ + + Sbjct: 178 I--HNGDKMIAADQDALNATMQGSFKLFSEEWNVYPDYFYEKPNLYTNVYPILDEIRRNP 235 Query: 262 IFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKHR 321 IH++ KPW ++ ++P+ + + + K + N ++ + Sbjct: 236 KIIHFL-YVKPWFNY-CNHPLRYLYGKYYAIAEGK--PFILKRNKESIKRDIARLKHCLL 291 Query: 322 YLKGFSNYLFYFIEKI 337 G Y + ++ Sbjct: 292 DFMGIKYYYHVYDKRF 307 >UniRef50_B1MX28 Lipopolysaccharide biosynthesis protein, LPS:glycosyltransferase n=2 Tax=Leuconostoc RepID=B1MX28_LEUCK Length = 283 Score = 207 bits (528), Expect = 4e-52, Method: Composition-based stats. Identities = 56/271 (20%), Positives = 106/271 (39%), Gaps = 10/271 (3%) Query: 25 ENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYK 84 + ++I D+N++ + + S+ + N + + D+ + Q Sbjct: 8 NDDSVNILITIDENYIKPLRVLLYSLRQTNPRENMTIWLAHDHIEVAQLEKLHQFVAQLG 67 Query: 85 TRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLI 144 + ++ S P+ K + +YFR + Y +V+YLD DI+ I PL Sbjct: 68 FVLHTIKVDTSLWASAPTFKQYPPEMYFRLLCGQYLPKTLHRVIYLDPDILVINPIRPLA 127 Query: 145 NFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAI 204 N +A G + H G + YFNSG +L++ + + Sbjct: 128 NMPLKGQMLAASSHMGLTGISQTINHL--RLGTRQVYFNSGVMLMDLDMMRQRVDMKAIL 185 Query: 205 AMLNEPEIIKKITHPDQDVLNMLLADKLIFADIK-YNTQFSLN-YQLKESF----INPVT 258 +++ + K++ PDQD+LN L D+++ + +N N +SF + V Sbjct: 186 SVIQQY--GKELILPDQDILNYLYGDEILSLPEEIWNYDTRDNIMHYAKSFGSVDMRWVM 243 Query: 259 NDTIFIHYIGPTKPWHDWAWDYPVSQAFMEA 289 +T+ +HY G KPW P + Sbjct: 244 ENTVILHYCGRPKPWEKSNSINPFIMLYQHY 274 >UniRef50_D1Y7U2 Glycosyltransferase, family 8 n=1 Tax=Pyramidobacter piscolens W5455 RepID=D1Y7U2_9BACT Length = 617 Score = 207 bits (527), Expect = 5e-52, Method: Composition-based stats. Identities = 64/339 (18%), Positives = 121/339 (35%), Gaps = 42/339 (12%) Query: 32 AYGTDKNFLFGCGISIASILKY-NEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 ++ ++ G + SI + + +IF ++ ++ + Sbjct: 283 VLAANEKYVPILGTCLKSIADHCSSSRSYKLYIFHTDIQEESQRNLKTFLESDNFSLTFV 342 Query: 91 LINGDRLR-SLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFP 149 ++ + L + ++ T ++RF+I D KVLYLD D+I Q I L + Sbjct: 343 NVSLHVGKYRLRAKEHVTTETFYRFLILDLL-KMYDKVLYLDCDMIIQRDIADLYDLDLG 401 Query: 150 DDKVAMVVTE-------GQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSAR 202 + + + G K ++ YF +G LL+N A+ + Sbjct: 402 TNLIGAALDPDFTGQCNGANPATRKYCDAVLKLKDCFTYFQAGVLLMNVAELNKSVTVRQ 461 Query: 203 AIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKE----------- 251 + M + DQD+LN++ + ++ D+ +N ++ Sbjct: 462 LLEMAET----GIYKYSDQDILNVVCEGRALYLDMAWNLLSDCDHYRWHHVVKFAPHYIL 517 Query: 252 SFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTAL----------L 301 IHY G KPW D+ F +A +P+ L Sbjct: 518 DMYENAREKPYIIHYAGFLKPWMKLGEDFGY--EFWKAARETPFYEELLYAALVPHGNTT 575 Query: 302 KPNNS-----NQLRYSAKHMLKKHRYLKGFSNYLFYFIE 335 +P N N+L AK +L K L+ F+ +L+Y I+ Sbjct: 576 RPQNFLHMLINRLVPLAKAVLPKGSRLRYFARHLYYRIK 614 >UniRef50_B2UQ54 Glycosyl transferase family 8 n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UQ54_AKKM8 Length = 328 Score = 206 bits (525), Expect = 9e-52, Method: Composition-based stats. Identities = 64/330 (19%), Positives = 120/330 (36%), Gaps = 22/330 (6%) Query: 21 KVETENLCLDIAYGTDKNFLFGCGISIASILKYN-EGSRLCFHIFTDYFGDDDRKYFDAL 79 + + +D + +++ S+L + ++ +D ++ + L Sbjct: 2 NNPMKKNEFAVVLASDNRGILPLSVTVFSLLNTAGPETFYKIYVLSDGIDGENWASVERL 61 Query: 80 ALQYKTRIKIYLINGDRLRS-LPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQG 138 A + R++ ++G + P T+ W + R I + + +LYLD D++ Sbjct: 62 AAPFDCRLEFIDVSGILEKHDFPHTEQWPVPAWGRVFIPELLKEERGNILYLDIDVLVCR 121 Query: 139 TIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQ 198 + L + + VV E + L + GYFNSG LL+N + + Sbjct: 122 DLTELFRTNMDGKAIG-VVFENFSRPGSHFNERLEMPLTCTGYFNSGVLLMNVDVFREKN 180 Query: 199 VSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQL-----KESF 253 + + ++T PDQD LN L + + ++N L ++ +E F Sbjct: 181 LVRAVLDYAVTHR--DRLTCPDQDALNGALCELTVPLHPRWNWHDGLTRRILKNDPREQF 238 Query: 254 INPVTN--------DTIFIHYIGPTKPWH-DWAWDYPVSQAFMEA--KNASPWKNTALLK 302 VT + +HY G KPW +W ++ + M P L Sbjct: 239 WRGVTPRQAVEAALEPGILHYQGVHKPWRYNWRYEGERYERVMREAGLLRGPLPGRTLPA 298 Query: 303 PNNSNQLRYSAKHMLKKHRYLK-GFSNYLF 331 + R + +K LK GF N L Sbjct: 299 VLKKHLYRPVYRMTARKILRLKEGFDNRLL 328 >UniRef50_C9PNX4 Family 8 glycosyl transferase n=1 Tax=Pasteurella dagmatis ATCC 43325 RepID=C9PNX4_9PAST Length = 285 Score = 206 bits (524), Expect = 1e-51, Method: Composition-based stats. Identities = 57/282 (20%), Positives = 111/282 (39%), Gaps = 21/282 (7%) Query: 25 ENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYK 84 + ++I D F I S+ +N+ + F++ + + + + + Sbjct: 8 RDSNMNIVLSADVQFSEQVKTLIKSVSYHNKN--VHFYLLNKDYPSEWFQILNQYLAYFG 65 Query: 85 TRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLI 144 + I ++ + + + P+ + + A YFR+++ +VLYLD D++ G++ + Sbjct: 66 SNIIDAKVDSEVISTFPTLDHISEASYFRYLLGQL---PLDRVLYLDCDVVVTGSLTEIY 122 Query: 145 NFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAI 204 F D+ + V + HS K YFNSG LLI+ +W Q + + + Sbjct: 123 YTDFGDNMMYAVEDA----FLNIAPHSYKEFPDMKPYFNSGMLLIDLNKWRDQNIENQLM 178 Query: 205 AMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFS--------LNYQLKESFINP 256 + + + + DQD +N++L K D YN Q + E + + Sbjct: 179 DLTKQ---AVNLYYGDQDAMNIILKGKWQALDKIYNYQTGSLIAFIQHKMPEALEKYKDL 235 Query: 257 VTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNT 298 IHYI KPW +D P + W++ Sbjct: 236 QGQQPKVIHYITRYKPWLLPEYDLPFRDQYWAYYQLE-WQDI 276 >UniRef50_C8N7M8 Putative uncharacterized protein n=1 Tax=Cardiobacterium hominis ATCC 15826 RepID=C8N7M8_9GAMM Length = 618 Score = 205 bits (523), Expect = 1e-51, Method: Composition-based stats. Identities = 59/304 (19%), Positives = 112/304 (36%), Gaps = 33/304 (10%) Query: 18 YDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRL-CFHIFTDYFGDDDRKYF 76 Y V+T+ + + +D N+ G I SIL + + I +RK Sbjct: 269 YAQPVQTDKPVVSVVIASDDNYTPHLGALICSILDHFPADKYLDLIILDGGISALNRKLL 328 Query: 77 DALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIIC 136 L I+ + D + L + +++ A ++R ++ + KVLY+D D I Sbjct: 329 MRLLPT-HANIQFLEL-KDEFQQLATHMHFSRATFYRLILDKLIPGR-DKVLYIDCDTIV 385 Query: 137 QGTIEPLINFSFPDDKVAMVVT----------------EGQADWWEKRAHSLGVAGIAKG 180 I L + D + V G +G+ + Sbjct: 386 LDDISTLFDTPLGDHAIGAVFDYIMHHFCLNDVLSIDTTGSLPAKRYLHDYVGLEDGWQR 445 Query: 181 YFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYN 240 YF +G +L N + +S I+ L + K+ DQD+LN +++ D ++N Sbjct: 446 YFQAGVILFNMEKLRRLDLSEVMISDL----LNKRYWFLDQDILNKYFLGDVVYLDPRWN 501 Query: 241 TQFSLNYQLKE------SFINPVTNDTIFIHYIG-PTKPWHDWAWDYPVSQAFMEAKNAS 293 + S+ + + + D IHY G TKPW++ + +++ + + Sbjct: 502 SVNSVQNIYQGLPATYIAELKTTETDPKIIHYAGFETKPWNNRYAE--LAEYYFYYLRQT 559 Query: 294 PWKN 297 W Sbjct: 560 FWYE 563 >UniRef50_B8G232 Glycosyl transferase family 8 n=2 Tax=Desulfitobacterium hafniense RepID=B8G232_DESHD Length = 280 Score = 204 bits (520), Expect = 3e-51, Method: Composition-based stats. Identities = 52/256 (20%), Positives = 108/256 (42%), Gaps = 13/256 (5%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 ++I + +++ + + S+L N G + ++ +D D + +++ Sbjct: 1 MNILVTLNSSYVKQLMVMLTSLLDSNPGEQFTVYVAHSAMSKEDFARIDQAIDSSRCKVE 60 Query: 89 IYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSF 148 ++ + L P T + +Y+R +Y + ++LYLD D++ ++ L F Sbjct: 61 GIKLSDEGLSKAPITSRYPKEMYYRIFAVNYLPDHLERILYLDPDLVVINPLKELYTIDF 120 Query: 149 PDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLN 208 + A + +K H Y NSG +++N + +Q + Sbjct: 121 QGNFFAAASH--VKELLKKLNHVRLNMAEDSTYVNSGVMMMNLSLLRQEQDVHEVYQYIE 178 Query: 209 EPEIIKKITHPDQDVLNMLLAD-------KLIFADIKYNTQFSLNYQLKES--FINPVTN 259 E + ++ PDQDVLN + +D K+ +Y ++LN + ++ ++ V + Sbjct: 179 EYK--HRLFLPDQDVLNGVYSDRTLTVDAKIYNLSERYYALYNLNPKYWDAKIDLDWVRS 236 Query: 260 DTIFIHYIGPTKPWHD 275 +T IHY G KPW D Sbjct: 237 NTAIIHYCGRNKPWKD 252 >UniRef50_C1QC03 LPS:glycosyltransferase n=1 Tax=Brachyspira murdochii DSM 12563 RepID=C1QC03_9SPIR Length = 347 Score = 204 bits (519), Expect = 4e-51, Method: Composition-based stats. Identities = 55/268 (20%), Positives = 113/268 (42%), Gaps = 13/268 (4%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEG-SRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 +++ + ++ + +IAS+L + + +I ++ + +++ +L + I Sbjct: 10 INVCFASNDAYAPYMSTAIASLLSNAKDDENINIYIISENINNSNKEKILSLKKIRECSI 69 Query: 88 KIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 + + + +++ +FR I A K++YLD D+I ++ L + Sbjct: 70 DFIEPKEEIFKYISKYNMKSNSTWFRLSIPSLIP-NADKIVYLDGDMIINSSLRELFSDD 128 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 D V E D ++ +G + K YFN+GFL+IN W + + Sbjct: 129 MSDYY--AYVVEDVMDKIDEVKAPIGFSKTDK-YFNAGFLMINNKLWIEDNLEEK---FY 182 Query: 208 NEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYI 267 N + + + + DQD+LN L +++ F D K++ L+ + I+ N IH + Sbjct: 183 NAVDTMPILGYKDQDILNYCLKNRVKFIDKKWDF---LDNKSCYKEISADINKINIIHCV 239 Query: 268 GPTKPWHDWAWDYPVSQAFMEAKNASPW 295 G KPW + F + +PW Sbjct: 240 G--KPWKKECNVAFFADEFWKYYQLTPW 265 >UniRef50_C8W7U9 Glycosyl transferase family 8 n=2 Tax=Atopobium RepID=C8W7U9_ATOPD Length = 1014 Score = 203 bits (516), Expect = 8e-51, Method: Composition-based stats. Identities = 53/327 (16%), Positives = 121/327 (37%), Gaps = 29/327 (8%) Query: 4 VFFQETEFLNSVIDYDHKVE-TENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRL-CF 61 F + ++ K E + + + D N++ ++ S+L+ + +R Sbjct: 647 HFTNPEPRQKFIPLFEEKPEIASQNVVPVVFAADNNYVPILTCAMGSMLENADPNRYYDV 706 Query: 62 HIFTDYFGDDDRKYFDALALQY-KTRIKIYLI--NGDRLRSLPSTKNWTHAIYFRFVIAD 118 + G ++ +Y RI Y + + + + + YFRF+ D Sbjct: 707 VVLNTNIGGSKQELVKKFFSRYKNARITFYNVWRMVKDYKLDTNNAHISVETYFRFLAQD 766 Query: 119 YFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVT-----EGQADWWEKRAHSLG 173 KV+YLD+D++ G + L + ++ +A + ++ +SL Sbjct: 767 ILSA-YDKVVYLDSDLVVNGNVAELYDVRIGNNLIAATLDIDYLANLNIRGGDRMKYSLD 825 Query: 174 VAGIAKG--YFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADK 231 V + YF +G ++ NTA+ + + + P + DQD+LN + Sbjct: 826 VLNLKNPYAYFQAGVMVFNTAELRRYHTVPEWLRIASNP----IFIYNDQDILNSECQGR 881 Query: 232 LIFADIKYNTQFSLNYQLKESF----------INPVTNDTIFIHYIGPTKPWHDWAWDYP 281 +++ +N ++ + +E + +H+ G KPW + + D Sbjct: 882 VLYLPADWNVTHNIFGRAEELYPMAPNSVFDDYQAARRAPKIVHFAGAIKPWQNASCD-- 939 Query: 282 VSQAFMEAKNASPWKNTALLKPNNSNQ 308 ++ F + +P+ + S + Sbjct: 940 MASYFWKYARNTPFYEVIIQDMVPSAR 966 >UniRef50_A4VVV8 Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases n=7 Tax=Firmicutes RepID=A4VVV8_STRSY Length = 334 Score = 203 bits (516), Expect = 9e-51, Method: Composition-based stats. Identities = 50/292 (17%), Positives = 103/292 (35%), Gaps = 25/292 (8%) Query: 24 TENLCLDIAYGTDKNFLFGCGISIASILKY-NEGSRLCFHIFTDYFGDDDRKYFDALALQ 82 E ++I + + F+ + SI++ +E F++F+D +++ Sbjct: 1 MEEGNVNILFTLNDAFVPQVAACMGSIMRTLDEDDTCHFYLFSDGISQQNKENLHQFVTD 60 Query: 83 YKTRIKIYLING--DRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTI 140 ++ I + T W + R ++ + +++YLD D + I Sbjct: 61 GGNKLTIVELENLESYFDFEVDTNGWASVVLARLLVDKLLPEEVDRIIYLDGDTLVLENI 120 Query: 141 EPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVS 200 L + M + G+ Y N+G LLI+ +W ++ + Sbjct: 121 RELWEVDLEGKVLGMCPEPTAS-----SERREGLNLGTYTYHNAGVLLIDLKRWRSKSIG 175 Query: 201 ARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLN---YQLKESFINP- 256 E ++ DQD LN L +++ I YN + Y+ E P Sbjct: 176 TIIFDYYKEKN--GELFANDQDALNGALKEEIKTLSITYNYFNIFDVYPYRTLEKLSRPS 233 Query: 257 ----------VTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNT 298 + +H++G +PW + + ++ A N +PW+ T Sbjct: 234 TFISKEEFVKIRKQPRIVHFLGEERPWR-IGNKHRFREDYVSALNQTPWRGT 284 >UniRef50_A4UX76 Putative LPS biosynthesis protein n=2 Tax=Lactobacillaceae RepID=A4UX76_9LACO Length = 316 Score = 202 bits (515), Expect = 1e-50, Method: Composition-based stats. Identities = 70/284 (24%), Positives = 112/284 (39%), Gaps = 26/284 (9%) Query: 24 TENLCLDIAYGTDKNFLFGCGISIASILKYN-EGSRLCFHIFTDYFGDDDRKYFDALALQ 82 EN + I Y D N+ +S+AS++ + + D D++ A Sbjct: 1 MENQTVPIFYAVDDNYAPYLAVSLASLVAHTSPDRHYQVIVLCDDLNTDNQGRLKAF-ET 59 Query: 83 YKTRIKIYLINGDRLRSLPSTKN-------WTHAIYFRFVIADYFINKAPKVLYLDADII 135 +I+ IN DRL+ + KN +T IYFR IA+ F K K LYLDAD + Sbjct: 60 DNLKIQFVSIN-DRLKQEITDKNNKLRSDYFTFTIYFRLFIAELFP-KLDKALYLDADTV 117 Query: 136 CQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIA-KGYFNSGFLLINTAQW 194 + L + D+ V V E + GI + Y SG LL+N A+ Sbjct: 118 VLKDVGELFDTQLGDNLVGAVPDPFVGHTPETIDYVEQAVGIDSQKYVCSGVLLMNLAEM 177 Query: 195 AAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFI 254 + + + +LN+ K PDQD +N + +++ + + ++ Q + Sbjct: 178 RRLKFAEHFLQLLNKYHF--KCLAPDQDYMNAIARNRIYYLNPSWHIQIT---------- 225 Query: 255 NPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNT 298 P D IHY KPW D P F + ++ Sbjct: 226 TPQDVDPWLIHYNLFAKPWRY--DDAPRQSYFWTYAKQTDYETM 267 >UniRef50_B7C7N8 Putative uncharacterized protein n=2 Tax=Firmicutes RepID=B7C7N8_9FIRM Length = 416 Score = 201 bits (512), Expect = 3e-50, Method: Composition-based stats. Identities = 63/279 (22%), Positives = 103/279 (36%), Gaps = 30/279 (10%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 I D +++ +I SI +N+ + F+I + + + + + I Sbjct: 22 IVLACDNSYMDKLETTIKSICAHNKN--IKFYILNEDLPIEWFRLMTKRLSYFNSEILNI 79 Query: 91 LINGDRLRSLPS-TKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFP 149 ++GD + +++ + YFR++I DY + KVLYLD DII +++ L N Sbjct: 80 KVSGDSFKKFRCPSEHINYQSYFRYLIPDYVSEE--KVLYLDCDIIVTESLDGLFNLDLK 137 Query: 150 DDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNE 209 + VA V FNSG LLIN W + + I + E Sbjct: 138 NYPVAAVPD----------------LPTTNDGFNSGVLLINNKYWRENDILNKLIKLTVE 181 Query: 210 PEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLK----ESFINPVTNDTIFIH 265 + DQ +LN+L DK + YN Q + Q + IH Sbjct: 182 YHEK---VYGDQGILNILFKDKWYRLPLTYNLQVGSDSQEHMIGNMEWYKLFDGIPKVIH 238 Query: 266 YIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPN 304 Y KPW + + + S W L +P Sbjct: 239 YTYTHKPWLMYNMTR-FKEVWWFYHGIS-WDKMILNEPR 275 >UniRef50_Q2K5X3 Galactosyltransferase protein n=13 Tax=Rhizobium RepID=Q2K5X3_RHIEC Length = 333 Score = 201 bits (511), Expect = 4e-50, Method: Composition-based stats. Identities = 58/309 (18%), Positives = 117/309 (37%), Gaps = 18/309 (5%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 + +D N L ++ S+ + + + F + + + + I++ Sbjct: 38 VIVCSDVNMLPAACCTLLSVKRNLTNADVEFLLLGIDLKPHEVAEVENFGRLHGMAIRVL 97 Query: 91 LINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPD 150 L + W+ A R + + ++LYLDAD++ ++ L F Sbjct: 98 PYETPD-TGLQARGRWSAATLARLYMDRDIPDHIERLLYLDADVLAVAPVDELFTLDFQG 156 Query: 151 DKVAMVVTEGQADWWEKRAHSLGVAGIAKG--YFNSGFLLINTAQWAAQQVSARAIAMLN 208 +A V + + EK G+ +G YFN+G LL + + A+ + R + Sbjct: 157 KALAAV-DDYVMAFPEKSGARQRKIGMGEGGRYFNAGVLLFDWSACRARGLFPRTREIFK 215 Query: 209 EPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIG 268 E + + DQD LN+ + D ++NTQ L P + H+ G Sbjct: 216 ERSHL--FENNDQDALNVTFDGDWLVLDPRWNTQTGL---------LPFVDRPAIFHFTG 264 Query: 269 PTKPWH-DWAWDYPV-SQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKHRYLKGF 326 KPW + W + + + + +PW + +P+ ++++ H+ K+ L Sbjct: 265 RKKPWQANVPWVHRRMANRYADDLRNTPWAS-FCRQPSRTDRVAGFLSHVGKQIGGLTRL 323 Query: 327 SNYLFYFIE 335 + YF Sbjct: 324 ARMRAYFSN 332 >UniRef50_UPI0001AEC697 glycosyl transferase family protein n=1 Tax=Alteromonas macleodii ATCC 27126 RepID=UPI0001AEC697 Length = 361 Score = 200 bits (508), Expect = 8e-50, Method: Composition-based stats. Identities = 64/276 (23%), Positives = 116/276 (42%), Gaps = 24/276 (8%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 + +A+ D F +S+ SIL + S + + F + R+ L L+ Sbjct: 2 ISVAFCIDDKFAPYAAVSVISILSNTK-SFVNIY-FIGNLSEGVREKL--LTLKNDRSAM 57 Query: 89 IYLINGDRLRSLPSTKNW----THAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLI 144 +++ + L ++P + + + R+ IA+ K KV+YLDAD++ G I+ L Sbjct: 58 VFVAHNLPLSTMPLSDRYVERLNKITFVRYAIAEVL-TKLDKVIYLDADVLVCGDIKRLW 116 Query: 145 NFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAI 204 V V+ SL +K YFN+G LL++ W +++ Sbjct: 117 EQPLKKSYVGAVLDHSLMSQKRHITLSLK----SKSYFNAGVLLVDLKIWRDRRIFQY-- 170 Query: 205 AMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFI 264 + ++ + DQDVLN++L +K+ + N Q Y LK + + + Sbjct: 171 -LSRTHNTRERWEYNDQDVLNVVLDEKVQYLGADMNVQ---TYSLKHI----NIKEPLIV 222 Query: 265 HYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTAL 300 H+ G KPWH + +P + + P+KN L Sbjct: 223 HFTGQEKPWHTSSV-HPYKDQYRVLLESVPFKNNKL 257 >UniRef50_Q3D427 Glycosyl transferase, family 8 n=8 Tax=Streptococcus agalactiae RepID=Q3D427_STRAG Length = 413 Score = 200 bits (508), Expect = 8e-50, Method: Composition-based stats. Identities = 71/298 (23%), Positives = 106/298 (35%), Gaps = 24/298 (8%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 IA D + I SI +N+ + F+I D F + + + + I Sbjct: 7 IALAADFGYQEQVKTIIKSICFHNQ--FIDFYILNDDFPVEWFQMMEYHLSKMDCTISNT 64 Query: 91 LINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPD 150 I + ++ K + YFR+ I + KVLYLD D+I + + Sbjct: 65 KIFNEEIKHFKFQKPMPYPTYFRYFIPEVI--HEDKVLYLDCDMIITSDLTSIFTLDISK 122 Query: 151 DKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEP 210 VA V + L + YFNSG LLIN W Q +S R + E Sbjct: 123 YGVAAVRDD-----------LLEEYDGKEDYFNSGLLLINNIFWREQGISQRLLDYTREN 171 Query: 211 EIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLN-----YQLKESFINPVTNDTIFIH 265 + + + DQDVLN +L D + D YN + +Q E +N + IH Sbjct: 172 Q--GALQYHDQDVLNDVLCDNWLELDETYNYHTGADMLYNLFQQSERQLNRRKDLPKVIH 229 Query: 266 YIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKHRYL 323 Y TKPW + E W++ N K + HR Sbjct: 230 YT-ATKPWKYLETSVRWRDIWWEYNRLE-WRDIFTRWQVNDGVDVSLKKVLQPIHRAF 285 >UniRef50_C9RWX3 Glycosyl transferase family 8 n=8 Tax=Bacillaceae RepID=C9RWX3_GEOSY Length = 276 Score = 199 bits (506), Expect = 1e-49, Method: Composition-based stats. Identities = 55/267 (20%), Positives = 100/267 (37%), Gaps = 10/267 (3%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 + TD N+L + + S+ N F++ +++ + + + Sbjct: 4 VLVTTDANYLPPLRVLMHSLFCNNRR-PFTFYLLYSRIAEEEIQALGEFVRRQGHELVPI 62 Query: 91 LINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPD 150 ++ P +++T +Y+R + +VLYLD DI+ ++ L + F Sbjct: 63 YVDPQLFHDAPVFRHYTVEMYYRLAAHLFLPPDVDRVLYLDPDIVAINPMDELYDMDFEG 122 Query: 151 DKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEP 210 + AKGYFN+G +++N A A + + Sbjct: 123 NLFIAAEHTHSTKVANLFNKLRLKTPNAKGYFNTGVMMMNIAMMREHVRLADIYQFIRDN 182 Query: 211 EIIKKITHPDQDVLNMLLADKLIFADI-KYNTQFSLNYQL-----KESFINPVTNDTIFI 264 K+ PDQDVLN L DK+ D +YN L + + + +T+FI Sbjct: 183 RF--KLVLPDQDVLNGLYWDKIKPVDCYRYNYDARYYDFLQLLPNPKHDLAWIEENTVFI 240 Query: 265 HYIGPTKPWHDWAWDYPVSQAFMEAKN 291 HY G KPW D + + + + Sbjct: 241 HYCGKEKPWKD-NYKGELGRFYKRYSQ 266 >UniRef50_D1JY84 Putative uncharacterized protein n=1 Tax=Bacteroides sp. 3_1_33FAA RepID=D1JY84_9BACE Length = 312 Score = 198 bits (504), Expect = 2e-49, Method: Composition-based stats. Identities = 58/269 (21%), Positives = 101/269 (37%), Gaps = 13/269 (4%) Query: 28 CLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 + IA+ + ++ +SI +L+ N L HI +DY D + L Y I Sbjct: 5 PMHIAFCVNDHYAEYILVSIKGLLENNSD-PLVIHILSDYISDKNTNRLKKLVGLYPNAI 63 Query: 88 KIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 +I D L+ WT ++R ++ + +VLYLDAD + IE L + Sbjct: 64 LDIVI-VDDLKLKDLKDTWTIYTWYRVLLPEILDASVHRVLYLDADTLVSENIEELFSLD 122 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 +A V D K + K Y +G +++N W ++ + I Sbjct: 123 MTGKAIAGTVDFQSKD---KSTYQRCGYEAEKEYVCAGVMMMNLDYWREHDIANKIIDWG 179 Query: 208 NEPEIIKKITHPDQDVLNMLLADKLIFADIKYN-----TQFSLNYQLKESFINPVTNDTI 262 + +I +PDQD +N + D + +KY+ Q +Q + Sbjct: 180 RDYN--DRIQYPDQDAINYICRDMKLLLPLKYDIIDGFFQDDYYFQNYPQELRECIESPA 237 Query: 263 FIHYIGPTKPWHDWAWDYPVSQAFMEAKN 291 IHY G PW ++ + + Sbjct: 238 IIHYAGQA-PWVVEISNHLLQDEWERYNK 265 >UniRef50_A3V3C9 Putative uncharacterized protein n=1 Tax=Loktanella vestfoldensis SKA53 RepID=A3V3C9_9RHOB Length = 324 Score = 198 bits (503), Expect = 3e-49, Method: Composition-based stats. Identities = 66/327 (20%), Positives = 123/327 (37%), Gaps = 29/327 (8%) Query: 15 VIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRK 74 I+ + + + + D+++L ++I ++L+ N I Sbjct: 2 TIEIKAENRPQKFRQSVIFCADQSYLPFASLAIHTLLRNNPVRDYDICI-------ASVD 54 Query: 75 YFDALALQYKTRIKIYLIN-GDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDAD 133 I+ I+ G+ +P +K ++ A Y R + + F + ++ YLDAD Sbjct: 55 ALVPPTELKDHDIRFCQIDVGNAFDGMPVSKRFSLAAYLRIALPEAFAGQYDRIFYLDAD 114 Query: 134 IICQGT-IEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTA 192 + G I+ + V V + K G+ YFNSG +L + Sbjct: 115 VFVVGDAIDAVFRLDMLSCPVGAVTDITKLKHPNKPTFDQKALGVDGPYFNSGVMLFDVE 174 Query: 193 QWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKES 252 ++ +V R + + + DQ +LN++L + ++ +N Q+ + L E Sbjct: 175 RFITMRVRERCAEAAKFYQ--GEPIYFDQTLLNIVLQKEWAQLNLGWNWQWPFSRSLFEC 232 Query: 253 FINPVTNDTIFIHYIGPTKPWHDWAWDYP----------VSQAFMEAKNASPWKNTALLK 302 FI D +H+IG KPW D P + + E P + AL Sbjct: 233 FI-----DVQIVHFIGDDKPWSDHKRRLPLKYRETARRFFQKFYPELAQKIPAADAALR- 286 Query: 303 PNNSNQLRYSAKHMLKKHRYLKGFSNY 329 N Y +H+ K H + K F+ + Sbjct: 287 --NGALYHYFFRHITKIHLFTKCFNRH 311 >UniRef50_UPI0001A45357 glycosyl transferase family protein n=1 Tax=Neisseria subflava NJ9703 RepID=UPI0001A45357 Length = 264 Score = 197 bits (502), Expect = 4e-49, Method: Composition-based stats. Identities = 57/267 (21%), Positives = 97/267 (36%), Gaps = 18/267 (6%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 + I D + + S+ +N G + F++ + F + Y + +R+ Sbjct: 4 ITIVLAADTGYAEQVHTLMKSVCTHNTG--VNFYLMHNTFRKEWINYTNQKLAASGSRLN 61 Query: 89 IYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSF 148 I D + + A +FR ++ + + LYLD+D++ ++ L N Sbjct: 62 DVKIEMD-FSQYRRLSHISDAAFFRLMM-QHLP--VDRALYLDSDMVVTQSLHDLFNLDM 117 Query: 149 PDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLN 208 VA V A L YFNSG LL + QW ++ + + Sbjct: 118 RGYPVAAVQDSYLARTDWNHPTGLHTT----PYFNSGMLLADLGQWRKHNIAEQLLQ--T 171 Query: 209 EPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQ-----FSLNYQLKESFINPVTNDTIF 263 I K + + DQ LN + + + + +N Q F Y L E F P T Sbjct: 172 AATIDKTVPYGDQCFLNTVFQENWLQLEESWNYQTGARRFFQTYDLDEMFPLPDTTPP-I 230 Query: 264 IHYIGPTKPWHDWAWDYPVSQAFMEAK 290 IHY KPW P + + + Sbjct: 231 IHYTTLAKPWLCDYGKIPFEEIYWQYY 257 >UniRef50_D2LIH7 Glycosyl transferase family 8 n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LIH7_RHOVA Length = 391 Score = 197 bits (501), Expect = 5e-49, Method: Composition-based stats. Identities = 69/336 (20%), Positives = 129/336 (38%), Gaps = 35/336 (10%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSR-LCFHIFTDYFGDDDRKYFDALALQYKTRI 87 + + ++ ++ G IASI ++ +R +F D +DR + + + Sbjct: 44 VPVVMCFNRRYMPGGAALIASIAEHASPNRLYDLIVFADDLASEDRDMLRNVCDKPNISL 103 Query: 88 KIYLIN-GDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINF 146 + + ++ + + ++ ++R I D + KV+Y+DAD I + L + Sbjct: 104 RFFDVSRCFDGINFITHFHFRKENFYRLKIPD-LMRDFDKVVYIDADTITNRDLADLYDI 162 Query: 147 SFPDDKVAMVVTEGQADWWEKR---------------AHSLGVAGIAKGYFNSGFLLINT 191 +A V K+ LG+ GI+ YFNSG +L N Sbjct: 163 DVDGYYIAAVRDFAMIATQNKKMLDIVGKKIYYETYVKDYLGLIGIS-NYFNSGLVLFNI 221 Query: 192 AQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLN--YQL 249 + Q+S R IA++ K + DQD+LN++ +K+ D +N Y L Sbjct: 222 NKINGSQISERLIALIGT----KLFAYVDQDILNIVFENKVKLIDYSWNMVIDCERLYHL 277 Query: 250 KESFINPVTND----TIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALL---- 301 E + D +HYIG KPW+D +++ + +P L Sbjct: 278 SEPDLYARYLDAGAAPHVVHYIGGNKPWNDPTVH--MAEYYWRYAAKTPLYEKLLREIRE 335 Query: 302 KPNNSNQLRYSAKHMLKKHRYLKGFSNYLFYFIEKI 337 + NS + M R ++ + + Y + I Sbjct: 336 RRENSGASSQPERKMHPGLRSIRSSAQIIGYMLFPI 371 >UniRef50_A5LNA9 Glycosyl transferase, family 8 n=2 Tax=Streptococcus pneumoniae RepID=A5LNA9_STRPN Length = 402 Score = 197 bits (500), Expect = 6e-49, Method: Composition-based stats. Identities = 67/317 (21%), Positives = 116/317 (36%), Gaps = 36/317 (11%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 I G D N+ +I SI +N L F+IF + + + + I Sbjct: 7 IVLGADNNYRDKLETTIKSICYHNRD--LKFYIFNEDIPKEWFYLMEKRLEKLNCEILNI 64 Query: 91 LINGDRLRSLPSTK-NWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFP 149 I+ ++++ + + + YFR+ IA++ K + +YLD D++ G I PL F Sbjct: 65 EIDAEKVKYFSTPDEHIKYMTYFRYFIAEFV--KEDRAVYLDCDMVIHGNINPLFQKDFE 122 Query: 150 DDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNE 209 + + V G K FN+G +++N +W + + + E Sbjct: 123 GNYIIAVPD-----------------GWYKNIFNAGMMMVNVHKWKTDNICQNLLELTAE 165 Query: 210 PEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNY-----QLKESFINPVTND--TI 262 + DQ VLN+L +K YN L+ Q E F+N + Sbjct: 166 KHQE---IYGDQGVLNLLFENKWKKVSPHYNFMVGLDTLGYWAQKPEWFLNSWDENYKPA 222 Query: 263 FIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKHRY 322 IH+ G KPW+D + + + N W+ N A L Sbjct: 223 IIHFEGKDKPWND-SLKTRYRELWW-FYNGLDWQTILSQVDNKPTTFSEIATVSLFHTAI 280 Query: 323 LKG--FSNYLFYFIEKI 337 ++ Y +EK+ Sbjct: 281 FTDTHELEHIEYLVEKL 297 >UniRef50_B2ISC6 Glycosyl transferase, family 2/glycosyl transferase family 8 n=8 Tax=Streptococcus pneumoniae RepID=B2ISC6_STRPS Length = 696 Score = 197 bits (500), Expect = 7e-49, Method: Composition-based stats. Identities = 61/298 (20%), Positives = 112/298 (37%), Gaps = 27/298 (9%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 I + ++ +I SI +N + F++ F ++ K + ++ + I Sbjct: 305 IVLAANYAYVDQVLTTIKSICYHNR--SIRFYLIHSDFPNEWIKQLNKRLEKFDSEIINC 362 Query: 91 LINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPD 150 + +++ S ++ ++ R+ IAD+ + K LYLD D++ ++ L D Sbjct: 363 RVTSEQISCYKSD--ISYTVFLRYFIADFV--QEDKALYLDCDLVVTKNLDDLFATDLQD 418 Query: 151 DKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEP 210 ++A V G G A + FN+G LL+N A W + + + I + NE Sbjct: 419 YRLAAVRDFG------------GRAYFGQEIFNAGVLLVNNAFWKKENMIQKLIDVTNEW 466 Query: 211 EIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIGPT 270 K+ DQ +LNML K + D YN ++ Q + + + IHY+ Sbjct: 467 H--DKVDQADQSILNMLFEHKWLELDFDYNHIV-IHKQFADYQLPEGQDYPAIIHYLSHR 523 Query: 271 KPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKHRYLKGFSN 328 KPW D A + + W N + H+ Sbjct: 524 KPWKDLAAQ--TYREVWWYYHGLEWTELG----QNHHLHPLQRSHIYPIKEPFTCLIY 575 >UniRef50_D2QX94 Glycosyl transferase family 8 n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2QX94_9PLAN Length = 362 Score = 196 bits (498), Expect = 1e-48, Method: Composition-based stats. Identities = 54/309 (17%), Positives = 109/309 (35%), Gaps = 31/309 (10%) Query: 20 HKVETENLCLDIAYGTDKNFLFGCGISIASILKY-NEGSRLCFHIFTDYFGDDDRKYFDA 78 H + + + +D NF G + S L S + + D+++ Sbjct: 6 HPTQNMPTSIQLVTSSDNNFAIGLAGTFKSALTNLAADSSVDLWVLDGGITDENKAEISR 65 Query: 79 LALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQG 138 + + ++ + + + T A Y+R + + K +YLD+D++ +G Sbjct: 66 HLSDPRLTLHFVSVDRKLVSQFVISHHVTDATYYRLLTPEILSRDIGKFIYLDSDLLIRG 125 Query: 139 TIEPLINFSFPDDKVAMVVTEGQA-------------------DWWEKRAHSLGVAGIAK 179 + L N F + G + + Sbjct: 126 DLTKLWNTPFDGAPCVAIQDSGAPFVDSTQLIEQQPSLRGCIANANPIPNYRELGLHPHA 185 Query: 180 GYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKY 239 Y N G ++I+ W +Q++ R + +L++ + +T+ DQ LN++L+ + AD ++ Sbjct: 186 PYLNGGVMMIDLDLWRREQLAERMLKVLSDYR--EHVTYWDQYALNVVLSQRWKQADHRW 243 Query: 240 NTQFSL-------NYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNA 292 N N + + ND H+ KPW +P S+ F + Sbjct: 244 NQIAYPLRFSSHENTIFSKEAFDLYRNDPYISHFT-YRKPWQAE-CIHPRSEEFYQYLEG 301 Query: 293 SPWKNTALL 301 S W NT + Sbjct: 302 SIWANTKPV 310 >UniRef50_Q5WI33 Lipopolysaccharide glycosyltransferase n=6 Tax=Firmicutes RepID=Q5WI33_BACSK Length = 274 Score = 196 bits (498), Expect = 1e-48, Method: Composition-based stats. Identities = 52/270 (19%), Positives = 104/270 (38%), Gaps = 11/270 (4%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 ++I + ++L + + S+ N ++ + + + + Sbjct: 1 MNILVTLNAHYLKPLQVMLTSLFMNNAHEDFTIYLIHSSIPEKQLQLLEQFVCHQGHSLV 60 Query: 89 IYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSF 148 I + + P K+++ +Y+R + + + ++LYLD DI+ I PL + Sbjct: 61 IVETDKTLFANAPVVKHYSSEMYYRLLAYRFLPTELDRILYLDPDILVLNPIRPLYEANI 120 Query: 149 PDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLN 208 A + E L + Y+NSG LL+N A+ A + Sbjct: 121 DSYLYAAAQ-HSFINIQEINKFRLNAYEM-DAYYNSGVLLMNLAKQRETMDINDIFAYVE 178 Query: 209 EPEIIKKITHPDQDVLNMLLADKLIFADIK---YNTQFSLNYQLKES---FINPVTNDTI 262 ++ PDQDVLN L + ++ D + Y+ ++ Y+LK I+ V T+ Sbjct: 179 TYR--NRLVLPDQDVLNALYSPQIKNVDERLYNYDARYYRYYKLKSGGRFDIDAVLQQTV 236 Query: 263 FIHYIGPTKPWHDWAWDYPVSQAFMEAKNA 292 +H+ G KPWH ++ + + Sbjct: 237 ILHFCGKKKPWHK-NYNGKFHSLYKHYEKQ 265 >UniRef50_A3CM53 Glycosyltransferase, putative n=9 Tax=Bacteria RepID=A3CM53_STRSV Length = 1074 Score = 194 bits (493), Expect = 5e-48, Method: Composition-based stats. Identities = 60/274 (21%), Positives = 106/274 (38%), Gaps = 36/274 (13%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 I D+ + +I SIL YN+ ++ ++F D+ + F+ L Q + + Sbjct: 4 IVLVGDQAYQEQVSTTIKSILYYNKNVKI--YVFNQGLSDEWFRDFNELVEQLDSELVNI 61 Query: 91 LINGDRLR-SLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFP 149 ++ + + + + A Y R+ I + +VLYLD+D++ ++PL + Sbjct: 62 SLDQVTISPEWLTQDHISSATYARYFIPQFVAE--GRVLYLDSDLVVNRDLQPLFDIPLE 119 Query: 150 DDKVAMVVTEGQADWWEKRAHSLGVAGIAKGY-FNSGFLLINTAQWAAQQVSARAIAMLN 208 VA V G GY FN+G LLI+ W +++ I + Sbjct: 120 GKLVAAVGDAG-------------------GYGFNAGVLLIDNRSWKERELQESFIKETD 160 Query: 209 E-----PEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLN----YQLKESFINPVTN 259 + + DQ VLN +LA + D YN Q + Y + Sbjct: 161 RIMGLVQSGQMEDFNGDQTVLNHVLAQDWLPLDKIYNLQVGHDLVAFYSGWNGHFE-LDQ 219 Query: 260 DTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNAS 293 + + IHY KPW+ Y Q + + + S Sbjct: 220 EPLIIHYTTFRKPWNSE-VSYRYRQLWWDFQALS 252 Score = 187 bits (475), Expect = 5e-46, Method: Composition-based stats. Identities = 47/268 (17%), Positives = 95/268 (35%), Gaps = 24/268 (8%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 + + + +I SI+ +N + F++ F + + +I Sbjct: 409 VVLAANAAYSEQVLTTIKSIVCHNR--FIKFYVINSDFPTEWFVSMRKKLAKLDCQIVNA 466 Query: 91 LINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPD 150 ++G + + +++ ++ R+ A + + + LYLD DI+ + + Sbjct: 467 RVDGSHISQYKTNIHYS--VFLRYFTATFV--EEDQALYLDCDIVVTRDLSEIFAVDLGS 522 Query: 151 DKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEP 210 + V G G + FNSG LLIN W ++ + I M + Sbjct: 523 YPLGAVRDLG------------GEVYFGEQIFNSGVLLINVNYWRENDIAGQLIEMTDN- 569 Query: 211 EIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIGPT 270 + K+T DQ +LNML ++ + YN + + IHY+ Sbjct: 570 -LHDKVTQDDQSILNMLFENRWMELPFAYNCIT--LHTTFSDYEPEKGLYPPVIHYLTER 626 Query: 271 KPWHDWAWDYPVSQAFMEAKNASPWKNT 298 KPW ++ + + W + Sbjct: 627 KPWKEYTQ--SIYREVWWFYQGLDWSDM 652 >UniRef50_Q3D426 Glycosyl transferase, family 8 n=7 Tax=Streptococcus agalactiae RepID=Q3D426_STRAG Length = 401 Score = 193 bits (491), Expect = 8e-48, Method: Composition-based stats. Identities = 60/284 (21%), Positives = 117/284 (41%), Gaps = 23/284 (8%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 I G D + +I SI+ +N+ L +I F + + Q+ R+K Sbjct: 5 IVLGADFQYRDQVMTTIKSIVSHNQ--HLTIYIINTDFPVEWFNILNHSLEQFDCRVKNI 62 Query: 91 LINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPD 150 I+ D +P+ + + A +FR+ I + + VLYLD+D+I +G+++PL + + + Sbjct: 63 PISSDVFEGIPTLSHISVAGFFRWFIPIHLEEEI--VLYLDSDVIVRGSLDPLFDINLEE 120 Query: 151 DKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEP 210 + + V L A FNSG +LIN + W +++ + + ++ Sbjct: 121 NLLGAVADHFST---------LYYGDTAPVSFNSGVMLINNSLWKKEEIYNSLMRIADKG 171 Query: 211 EIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLN-----YQLKESFINPVTNDTIFIH 265 + DQ+ LN+L ++ I +YN Q + Y + + + + +H Sbjct: 172 SAVGV---GDQEYLNILTQNRWIDIGKQYNVQIGQDVNINAYGRPDLYHFYDDCEPVIVH 228 Query: 266 YIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQL 309 Y KPW+ ++ + W + N N+L Sbjct: 229 YNSQDKPWNKYSQSR-YRSEWWYYFGLE-WSVIYAQQQKNLNRL 270 >UniRef50_Q4JZJ9 Putative glycosyl transferase n=8 Tax=Streptococcus pneumoniae RepID=Q4JZJ9_STRPN Length = 344 Score = 193 bits (490), Expect = 9e-48, Method: Composition-based stats. Identities = 72/340 (21%), Positives = 125/340 (36%), Gaps = 33/340 (9%) Query: 16 IDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKY 75 ++K + N ++I Y TD NF+ SI S+ N L I D D +++ Sbjct: 19 FISENKFRSRNF-MNIVYATDNNFVDVLSASIKSLYTTNSDLDLNLWIIADKVSDRNKEK 77 Query: 76 FDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADII 135 + L+ Q+ R +I I + + + + R + + KVLYLD+DII Sbjct: 78 INRLSKQFAQR-EINWIENVEIPFKLHLDRGSISSFSRLFLGSVLPSSMSKVLYLDSDII 136 Query: 136 CQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWA 195 ++ + + F + V ++ LG+ I K FN+G +LIN W Sbjct: 137 VMDSLRSIFDIDFKGKILYGVNDTFN----KEYKQVLGIP-IDKPMFNAGVMLINLELWR 191 Query: 196 AQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQF------------ 243 V R + ++ + I D VLN +L + +YN Sbjct: 192 NNNVEERFLQVIQK--FNGTILQGDLGVLNAVLYNSFGVLPPEYNYMTIFEDLTYEEMIV 249 Query: 244 ---SLNYQLKESFINPVTNDTIFIHYIG---PTKPWHDWAWDYPVSQAFMEAKNASPWKN 297 +NY KE N + H+ +PW + + + + F + +K Sbjct: 250 FKKPINYYSKEEIKN-ARERIVLRHFTTSFLSKRPWQE-SSEVTHVEIFKKYYRG-AYKQ 306 Query: 298 TALLKPNNSNQLRYSAKHM-LKKHRYLKGFSNYLFYFIEK 336 + K N + K M L +++ Y I K Sbjct: 307 ASPSK--LLNIYKILPKKMSLYLLGFIQSKVRPKLYRITK 344 >UniRef50_C1QEC6 LPS:glycosyltransferase n=1 Tax=Brachyspira murdochii DSM 12563 RepID=C1QEC6_9SPIR Length = 242 Score = 192 bits (489), Expect = 1e-47, Method: Composition-based stats. Identities = 65/256 (25%), Positives = 112/256 (43%), Gaps = 20/256 (7%) Query: 25 ENLCLDIAYGTDKNFLFGCGISIASILKYNEG-SRLCFHIFTDYFGDDDRKYFDALALQY 83 ++I + + + +I SILK ++ FH+ T+ D+++ + L Sbjct: 1 MQETMNICFTANDKYAPFMSATIVSILKNSKDDESFSFHVITNDISDENKMMIERLKEIK 60 Query: 84 KTRIKIYLINGDR----LRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGT 139 +IK Y N D+ + +++ +I+FR I + I KVLYLD DII + Sbjct: 61 TFKIKYYTPNIDKYNKWFEKINYQRHYAPSIFFRLDIPNLII-NIDKVLYLDCDIIVNSS 119 Query: 140 IEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQV 199 + L N + V G ++ +K +G+ K YFNSG LL+N + + + Sbjct: 120 LSELFNIDISEYFALAVEDTGDLNFLKKYKTKIGIEDKHK-YFNSGVLLLNNKLYMEKNL 178 Query: 200 SARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTN 259 + + N+ I DQD+LN L DK+ F D K+N F + + Sbjct: 179 NLESENYFNKY--YNVIECVDQDILNYLFRDKIKFIDNKWN-----------DFSSKNID 225 Query: 260 DTIFIHYIGPTKPWHD 275 + +HY+G K W+ Sbjct: 226 KSAIMHYVGKIKSWNK 241 >UniRef50_C5S494 Putative glycosyl transferase n=2 Tax=Actinobacillus minor RepID=C5S494_9PAST Length = 287 Score = 192 bits (488), Expect = 2e-47, Method: Composition-based stats. Identities = 59/285 (20%), Positives = 101/285 (35%), Gaps = 25/285 (8%) Query: 22 VETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALAL 81 + + ++IA D+N+ I S+ +++ R F++ + D+ + Sbjct: 1 MTNKQQTINIALAADRNYAEQVITLIKSVCYHHKNVR--FYLIHQDYPDEWFMALNQHLT 58 Query: 82 QYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIE 141 I + ++ T A ++R++I + +V+YLD+DI+ G IE Sbjct: 59 NVGAEIIPVTVLDSFRFLSKLQEHITQATFYRYIIPEI---PEDRVIYLDSDIVVDGNIE 115 Query: 142 PLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSA 201 + F V V + H K YFN G LLIN W ++ Sbjct: 116 EMYFSDFNGKYVLAVEDMYISYT----EHGYIEFPDLKPYFNGGVLLINNQLWKENDLAE 171 Query: 202 RAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFIN------ 255 I M + + DQD+LN +L DK YN Q + + N Sbjct: 172 YLIQMTKQY---PNVMFGDQDILNFVLKDKWGILSHVYNYQTGIIHAFPRLEENMSDEEI 228 Query: 256 -------PVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNAS 293 I IHY KPW + + + + + S Sbjct: 229 ITKYQKQADEVKPIIIHYTTKYKPWLNSKYFVLLREKYWFYYQLS 273 >UniRef50_B1I7N1 Glycosyl transferase, family 8 n=8 Tax=Streptococcus pneumoniae RepID=B1I7N1_STRPI Length = 817 Score = 191 bits (486), Expect = 3e-47, Method: Composition-based stats. Identities = 57/271 (21%), Positives = 101/271 (37%), Gaps = 34/271 (12%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRI-KI 89 I D+N++ +I SIL +N ++ +I D + +A + I + Sbjct: 5 IVLAGDRNYIRQLETTIKSILYHNRDVKI--YILNQDIMPDWFRKPRKIARMLGSEIIDV 62 Query: 90 YLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFP 149 L + + + Y R+ IADY + KVLYLD+D+I ++E L + Sbjct: 63 KLPEQTVFQDWEKQDHISSITYARYFIADYI--QEDKVLYLDSDLIVNTSLEKLFSICLE 120 Query: 150 DDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIA---- 205 + +A V FN+G LLIN +W +++ R I Sbjct: 121 EKSLAAVKDTDGIT------------------FNTGVLLINNKKWRQEKLKERLIEQSIV 162 Query: 206 -MLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLN----YQLKESFINPVTND 260 M E + + DQ + N +L D + YN Q + Y + + + Sbjct: 163 TMKEVEEGRFEHFNGDQTIFNQVLQDDWLELGRAYNLQVGHDIVALYNNWQEHL-AFNDK 221 Query: 261 TIFIHYIGPTKPWHDWAWDYPVSQAFMEAKN 291 + IH+ KPW + + + + Sbjct: 222 PVVIHFTTYRKPWTTLTANR-YRDLWWKFHD 251 >UniRef50_Q4HGS8 General stress protein A, putative n=2 Tax=Campylobacter RepID=Q4HGS8_CAMCO Length = 403 Score = 191 bits (486), Expect = 3e-47, Method: Composition-based stats. Identities = 70/317 (22%), Positives = 123/317 (38%), Gaps = 38/317 (11%) Query: 30 DIAYGTDKNFLFGCGISIASILKYN------EGSRLCFHIFTDYFGDDDRKYFD----AL 79 I + D+N++ + I SI+K + FHI +++ ++ R+ + L Sbjct: 3 HIIFSADENYIKYTSVLITSIIKNTNPKNHFQNRPYSFHILSNFVSEETREKLECLKKEL 62 Query: 80 ALQYKTRIKIYLINGDRLRSLPSTKNW--THAIYFRFVIADYFINKAPKVLYLDADIICQ 137 Y I I++++ DR + PS+ + Y+R F + K LYLD+D++C Sbjct: 63 NKIYPCEISIHIMSDDRFENFPSSGAAQNSKLPYYRLKFISLFDDNVDKCLYLDSDMLCM 122 Query: 138 GTIEPLINFSFPDDKVAMVVTEGQADWWEK--RAHSLGVAGIAKGYFNSGFLLINTAQWA 195 I + + +V G K ++ V + YFNSGFLLIN ++ Sbjct: 123 CDIREIFAIDLQGKIIGVVGDPGSKRSKIKFIENNTKKVLKFDENYFNSGFLLINAKEYK 182 Query: 196 AQQVSARAIAMLNEPEIIKKITHPDQDVLNMLL-ADKLIFADIKYNT-QFSLNYQLKESF 253 V + + + IK DQD+LN ++ DK++ YN +L Y + + Sbjct: 183 KANVEKKCEELAKKCIYIK---AADQDLLNAVISKDKILKLSFAYNFNIITLLYVICKDE 239 Query: 254 INPVTN-----------DTIFIHYIGPTKPWHDWAW-----DYPVSQAFMEAKNASP-WK 296 N + +HY KPW + +S + + P +K Sbjct: 240 KKNRLNYTREEFTQSAKNPKILHY--GEKPWKFLKSYVDLQNRNISDYWWDIAKEVPIFK 297 Query: 297 NTALLKPNNSNQLRYSA 313 L + N A Sbjct: 298 EELLRQKENIKDYLLYA 314 >UniRef50_B3WD33 WbbM protein n=8 Tax=Lactobacillus RepID=B3WD33_LACCB Length = 318 Score = 190 bits (483), Expect = 6e-47, Method: Composition-based stats. Identities = 59/282 (20%), Positives = 103/282 (36%), Gaps = 21/282 (7%) Query: 28 CLDIAYGTDKNFLFGCGISIASILKY-NEGSRLCFHIFTDYFGDDDRKYFDALALQYKTR 86 + I + D ++ +++ SI + + +I ++ ALA Sbjct: 6 TVPIFFSVDDGYVPCLAVALTSIRTNKDPQTDFVINILNSGLLQKNQTRLAALAAP-HFT 64 Query: 87 IKIYLINGDRLRSLPST-----KNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIE 141 I ++ + T T IYFR IAD F + K +Y+DAD + G + Sbjct: 65 INFIDMDAVTQQISGDTNKLRGDYVTLTIYFRLFIADMFP-QYDKAIYIDADTVADGDLA 123 Query: 142 PLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGI-AKGYFNSGFLLINTAQWAAQQVS 200 L D+ VA V + E + G+ Y NSG L++N AQ + S Sbjct: 124 ELFTTDLGDNLVAGVADPVMMTYPETIEYIQRDFGVQPGEYINSGVLILNLAQMRQEHFS 183 Query: 201 ARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTND 260 R + +L + DQD +N++ ++ + +N Q + + Sbjct: 184 DRFLHLLKTYHFT--MIAADQDYINVIAQHRIKYLPKTWNMQTGVP--------TAAESG 233 Query: 261 TIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLK 302 IHY KPWH ++ F AS ++ + Sbjct: 234 GKLIHYNLFGKPWHYRDAK--LAANFWHYAPASGFETDLNQQ 273 >UniRef50_C1CFZ1 Glycosyl transferase, family 8 n=17 Tax=Streptococcus pneumoniae RepID=C1CFZ1_STRZJ Length = 404 Score = 189 bits (481), Expect = 1e-46, Method: Composition-based stats. Identities = 60/282 (21%), Positives = 110/282 (39%), Gaps = 33/282 (11%) Query: 30 DIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDA-LALQYKTRIK 88 I + D +++ +I SI YN L F++F D + + L + Sbjct: 5 SIVFNADNDYVDKLETAIKSICCYNNC--LKFYVFNDDIASEWFLMMNKRLKTIQSEIVN 62 Query: 89 IYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSF 148 + +++ + KN ++A +FR+ I ++ + LYLD+DII G+++ L + Sbjct: 63 VKIVDHVLKKFHLPLKNLSYATFFRYFIPNFVKES--RALYLDSDIIVTGSLDYLFDIEL 120 Query: 149 PDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLN 208 +A V + FNSG LL+N W + ++ + + N Sbjct: 121 DGYALAAVEDS--------------FGDVPSTNFNSGMLLVNVDTWRDEDACSKLLELTN 166 Query: 209 EPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSL--------NYQLKESFINPVTND 260 + + + DQ +LNML D+ D +N + N++ E + Sbjct: 167 QYH---ETAYGDQGILNMLFHDRWKRLDRNFNFMVGMDSVAHIEGNHKWYEISELKNGDL 223 Query: 261 TIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLK 302 IHY G KPW + + + + N W + L K Sbjct: 224 PSVIHYTG-VKPWEIISNNR-FREVWW-FYNLLEWSDILLRK 262 >UniRef50_B2ISC2 Glycosyl transferase, family 8 n=2 Tax=Streptococcus pneumoniae RepID=B2ISC2_STRPS Length = 401 Score = 189 bits (481), Expect = 1e-46, Method: Composition-based stats. Identities = 58/289 (20%), Positives = 101/289 (34%), Gaps = 30/289 (10%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 I G D +++ +I SI N+ + F++F + + D + I Sbjct: 5 IVLGADNHYMDKVETTIKSICSKNKEVK--FYVFNSDLPTEWFQLMDKRLSVLGSEIVNV 62 Query: 91 LINGDRLRSLP-STKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFP 149 + + T + + A Y R+ I K + LYLD+DII + L F Sbjct: 63 KVTESLINQFHLPTPHLSSATYLRYFIPTIVFEK--RALYLDSDIIVTADLTSLFEFPLD 120 Query: 150 DDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNE 209 +A V + ++G FNSG LLI+T +W + + + + + Sbjct: 121 GCPLAAVPD---------------IPNTSEG-FNSGVLLIDTDRWREDDIQNQLLNLTIK 164 Query: 210 PEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINP----VTNDTIFIH 265 + + DQ++LNML D+ + YN Q + N IH Sbjct: 165 HH---EHVYGDQEILNMLFKDRWKKLSLSYNLQVGYDTYRHSLGDNEWYHLFEGIPNIIH 221 Query: 266 YIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAK 314 Y KPW + ++ + + W + L K Sbjct: 222 YTTQNKPWSHYRFNR-FRDIWWFYYGLN-WNDILLDNQILQENFEKLIK 268 >UniRef50_C1IBL0 Glycosyl transferase n=1 Tax=Clostridium sp. 7_2_43FAA RepID=C1IBL0_9CLOT Length = 273 Score = 188 bits (479), Expect = 2e-46, Method: Composition-based stats. Identities = 47/271 (17%), Positives = 102/271 (37%), Gaps = 11/271 (4%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 +D+ DKN++ + S++ N + + + + + + Sbjct: 6 IDLLVTFDKNYIPPFQTMLKSLVLNNPRETFHIWLLHSEIPLEMLQEVEEYCAKQGAAMT 65 Query: 89 IYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSF 148 + ++ P +K + +Y+R + K+LYLD DI+ +I PL Sbjct: 66 SINVERSVFKNAPVSKRYPQEMYYRLLAPLILPKSIKKILYLDPDILIINSIRPLWETEL 125 Query: 149 PDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLN 208 + A G + Y+NSG +L++ + + + Sbjct: 126 GNYIFAAASHVGVTGVINDINRV--RLRVDHDYYNSGVMLMDLTKARSIVNVEEIFQCVR 183 Query: 209 EPEIIKKITHPDQDVLNMLLADKLIFAD---IKYNTQFSLNYQLKES---FINPVTNDTI 262 E + +++ PDQD+ N L + + D Y+ + NY L+ ++ +T +T+ Sbjct: 184 EHK--EELLLPDQDIFNYLYGKQTLPLDDAIWNYDARKYSNYLLRSGGNYDMDWITRNTV 241 Query: 263 FIHYIGPTKPWHDWAWDYPVSQAFMEAKNAS 293 +H+ G +KPW + + + + S Sbjct: 242 VLHFCGKSKPWK-HSQNNRFAMLYKHYMQIS 271 >UniRef50_Q3DNA2 Glycosyl transferase, family 8 n=9 Tax=Streptococcus RepID=Q3DNA2_STRAG Length = 272 Score = 188 bits (479), Expect = 2e-46, Method: Composition-based stats. Identities = 58/270 (21%), Positives = 108/270 (40%), Gaps = 17/270 (6%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFD---ALALQYKT 85 +++ + D ++ + + S+++ + +L ++ + L + Y Sbjct: 1 MNLLFSIDDMYVDHFKVMLYSLVRQTKNRKLEIYVLQKTLLKRHTELIQYTQNLEVGYHP 60 Query: 86 RIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLIN 145 I + + P+T + IY+R + + ++LYLDAD++C L + Sbjct: 61 II----VGTEVFAQAPTTDRYPDTIYYRLLAHKFLPETLDRILYLDADMLCLNDFSSLYD 116 Query: 146 FSFPDDKVAMVVTEGQADWWEKRAH-SLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAI 204 D A + + L + YFN+G LL+N + Sbjct: 117 MELGDQLYAAASHNTDGKFLDYVNKLRLKNVELESSYFNTGVLLMNLPAIRKVVHQQTIL 176 Query: 205 AMLNEPEIIKKITHPDQDVLNMLLADKLIFAD---IKYNTQFSLNYQLK---ESFINPVT 258 + + ++ PDQD+LN L A+ + Y+ ++SL YQLK E + V Sbjct: 177 DYMMQNR--GRLILPDQDILNGLYANLVKPIPDEIYNYDARYSLIYQLKSRNEWDLEWVI 234 Query: 259 NDTIFIHYIGPTKPW-HDWAWDYPVSQAFM 287 N T+F+H+ G KPW D+ Y FM Sbjct: 235 NHTVFLHFAGRDKPWKKDYRGRYSGLYKFM 264 >UniRef50_B9BAZ6 Glycosyl transferase family 8 protein n=1 Tax=Burkholderia multivorans CGD1 RepID=B9BAZ6_9BURK Length = 617 Score = 188 bits (478), Expect = 2e-46, Method: Composition-based stats. Identities = 59/355 (16%), Positives = 119/355 (33%), Gaps = 39/355 (10%) Query: 12 LNSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSR-LCFHIFTDYFGD 70 + + + I D NF+ IAS+ + R L + Sbjct: 267 VKGSHTHVPPEPLGGNAVSIVTVADGNFVPHLAAFIASVQDNIDPERVLDLIVLDGGIPA 326 Query: 71 DDRKYF-DALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLY 129 D ++ K R+ +P ++ A ++R + + +V+Y Sbjct: 327 DQQRLLMKQFHRNGKGRLSFIQ-CAHLFSDIPLHGPFSAATFYRLSMGELLAKHR-RVVY 384 Query: 130 LDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEK----------------RAHSLG 173 +D+D I G + L + ++ VA V + +G Sbjct: 385 VDSDTIVLGDLSELFDLDLGNNAVAAVPDVIMKSFVSSGVPALREAGGAPAGIYLKERVG 444 Query: 174 VAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLI 233 + YF +G ++I+ ++ ++ A L + ++ DQDVLN L + Sbjct: 445 MGNRGNEYFQAGLIVIDLDEFRRLRIGEDAYKDL----LARRYWFLDQDVLNKYLLGHVK 500 Query: 234 FADIKYNTQFSLNYQLK------ESFINPVTNDTIFIHYIGPT-KPWHDWAWDYPVSQAF 286 F D+ +N + L + + V +HY G KPW+ P++ + Sbjct: 501 FLDLSWNVVNASMDVLSGLETDIAAKVKEVFAAPSMVHYAGHEAKPWNRPTA--PLAHFY 558 Query: 287 MEAKNASPWK----NTALLKPNNSNQLRYS--AKHMLKKHRYLKGFSNYLFYFIE 335 + W + + P +L+ S K + R + GF +++ Sbjct: 559 WYYLRRTYWYESVIDRRPISPTLDVELQRSRLYKRLRAIWRRMPGFVQRRLFWLR 613 >UniRef50_C0BAU3 Putative uncharacterized protein n=1 Tax=Coprococcus comes ATCC 27758 RepID=C0BAU3_9FIRM Length = 348 Score = 188 bits (477), Expect = 3e-46, Method: Composition-based stats. Identities = 48/294 (16%), Positives = 93/294 (31%), Gaps = 29/294 (9%) Query: 31 IAYGTDKNFLFGCGISIASILKY-NEGSRLCFHIFTDYFGDDDRKYFD-ALALQYKTRIK 88 + ++ ++ + SI N+ I + L ++ Sbjct: 14 VVLSANEYYVPYLAAVLESIRANSNDDQNYDLIIMHRDISMGSQDRLKKQLEDHQNITLR 73 Query: 89 IYLIN--GDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINF 146 I + L ++ YFR ++ K +Y+D+D++ I L Sbjct: 74 FLDIRRYEKPFKKLFLRGHFALETYFRLLMPQIL-ADYDKAVYIDSDLVVNADIAELYAT 132 Query: 147 SFPDDKVAMVVT-------EGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQV 199 +A G +K ++ YF +G ++ N A++ Sbjct: 133 DVDGYLLAAAKDADTAGLYNGFEPNKKKYMDTILKIKKPYEYFQAGVIVFNLAEFRKTYT 192 Query: 200 SARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNY-----------Q 248 +A + E DQDVLN L ++ F D+ +N + Sbjct: 193 TAEMLKFAASYE----WELLDQDVLNYLAQGRVKFVDMAWNVMVDWRGIRLSQIIALAPK 248 Query: 249 LKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLK 302 + IHY GP KPWH D +++ F + + + T + + Sbjct: 249 YLHDEHMEARKNPKIIHYAGPDKPWHQPWSD--MAEEFWKYSRNTVFYETIMQR 300 >UniRef50_C6LDU2 Glycosyl transferase, family 8 n=3 Tax=Firmicutes RepID=C6LDU2_9FIRM Length = 270 Score = 187 bits (476), Expect = 5e-46, Method: Composition-based stats. Identities = 53/244 (21%), Positives = 95/244 (38%), Gaps = 11/244 (4%) Query: 40 LFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRS 99 + I SI+++ +I + D+ A TR+ + S Sbjct: 1 MEHVLDCIRSIVRFPSEDGYDIYILHSDLQEQDQSDAAAQVEDGDTRLHFRFVEPSVFAS 60 Query: 100 LPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTE 159 P ++ + IY+R A + ++LYLD D + ++ L N F + T Sbjct: 61 FPESERYPRLIYYRIFAASLLPPEMDRILYLDGDTLVINPLDELYNMDFEGNYFLAC-TH 119 Query: 160 GQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHP 219 + + + LG+ ++ Y NSG LL+N + +Q + + + +T P Sbjct: 120 VRKFLTKVNQYRLGMEEVS-TYINSGVLLMNLKELREKQDFEEIASFVEKRGRY--LTLP 176 Query: 220 DQDVLNMLLADKLIFAD-IKYNT------QFSLNYQLKESFINPVTNDTIFIHYIGPTKP 272 DQD++ L +K D +KYN ++ K + V + + IHY G KP Sbjct: 177 DQDIITALYGNKTGILDTMKYNLSDRMISVYNTEPGHKRINLEWVRENAVVIHYYGKQKP 236 Query: 273 WHDW 276 W Sbjct: 237 WKKP 240 >UniRef50_C7RG54 Glycosyl transferase family 8 n=1 Tax=Anaerococcus prevotii DSM 20548 RepID=C7RG54_ANAPD Length = 273 Score = 187 bits (475), Expect = 6e-46, Method: Composition-based stats. Identities = 49/268 (18%), Positives = 99/268 (36%), Gaps = 11/268 (4%) Query: 32 AYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIYL 91 D+N++ + + SI N G ++ +D K ++ + Sbjct: 6 LLTLDENYIPQMKVLMTSIYINNPGRIFDVYLIHSRISEDKLKDLGEDLKKFSYTLYPIR 65 Query: 92 INGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDD 151 D T + +Y+R + ++ ++LYLD D++ ++ L+ D Sbjct: 66 ATDDLFSFAKVTDRYPKEMYYRLLAGEFLPENLGEILYLDPDMLVINPLDDLLRTDISDY 125 Query: 152 KVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPE 211 +A G+ D G Y+NSG LLIN + + + + + Sbjct: 126 ILAAASHTGKTDMANNVNRI--RLGTDTDYYNSGLLLINLKRAREEIDPDEIFSFVEDNH 183 Query: 212 IIKKITHPDQDVLNMLLADKLIFAD---IKY---NTQFSLNYQLKESFINPVTNDTIFIH 265 + + PDQD+LN + D++ D Y N L K++ + + + T+ +H Sbjct: 184 M--NLLLPDQDILNAMYGDRIYPLDDLIYNYDARNYSSYLIRSKKQADLAWLMDHTVVLH 241 Query: 266 YIGPTKPWHDWAWDYPVSQAFMEAKNAS 293 + G KPW + + + + Sbjct: 242 FCGRDKPWKK-NHRNKFTSLYKHYMSLT 268 >UniRef50_C6X2V2 Putative glycosyltransferase n=1 Tax=Flavobacteriaceae bacterium 3519-10 RepID=C6X2V2_FLAB3 Length = 315 Score = 186 bits (474), Expect = 6e-46, Method: Composition-based stats. Identities = 59/294 (20%), Positives = 112/294 (38%), Gaps = 28/294 (9%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYN-EGSRLCFHIFTDYFGDDDRKYFDALAL-QYKTR 86 L I + D ++ + I+SI+ + ++ +I ++Y D+++ + + Sbjct: 9 LPIVFTCDDHYFKYAAVVISSIIHNSSRNTKYEINIVSEYISDENQSLAQKMVQSKSNIS 68 Query: 87 IKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINF 146 I+ + I + + + Y+RF I D +VLYLD+D+I I + Sbjct: 69 IQFHAIKIENPEVFHLNSYMSLSTYYRFFIFDLL-KDYDRVLYLDSDLIVDNDISFFADI 127 Query: 147 SFPDDKVAMVVTEGQADWWEKRAH---------SLGVAGIAKGYFNSGFLLINTAQWAAQ 197 F + + + + + YFN+G +L N AQ Sbjct: 128 DFENKPAICCPSIYVQNSLKNNTDHKFTREYFTQILKMSDVDEYFNAGVILFNIKLIRAQ 187 Query: 198 QVSARAIAMLNEPEIIKKITHPDQDVLNMLLADK--LIFADIKYNTQFSLNYQLKESFIN 255 + + + IK + DQD+LN +L + +YN ++ + LK F+N Sbjct: 188 GIDRKFFEAIKN---IKDPVYQDQDILNSVLRNNGGAKLISNEYNHTKTMKFSLKRIFLN 244 Query: 256 PVTND--------TIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALL 301 + N HY+G KPW ++ P S F+ +P+ L Sbjct: 245 ALKNKFGKKRNNWFTIYHYVGKVKPWQNFN---PDSALFLYYAYKTPFVREILK 295 >UniRef50_C0AA16 Lipopolysaccharide biosynthesis protein LPS:glycosyltransferase-like protein n=1 Tax=Opitutaceae bacterium TAV2 RepID=C0AA16_9BACT Length = 726 Score = 186 bits (473), Expect = 1e-45, Method: Composition-based stats. Identities = 62/299 (20%), Positives = 116/299 (38%), Gaps = 32/299 (10%) Query: 28 CLDIAYGTDKNFLFGCGISIASILKY-NEGSRLCFHIFTDYFGDDDRKYFDALALQYKTR 86 C++IA+ D F+ ++I SI+ + + I T+ + K+ D + Sbjct: 404 CINIAFNCDDKFVPYLCVAIKSIVATASTENNYDILILTEGLSPANLKWIDGIKHAKNVS 463 Query: 87 IKIYLI----NGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEP 142 +++ + + S + Y R + + K KVLYLD D+I Q + Sbjct: 464 LRVVNVRDYLQDKDISSFFMRSMVSRIAYVRLYLGELL-EKYAKVLYLDCDLIAQSDVAE 522 Query: 143 LINFSFPDDKVAMVVTEGQADWWEKRAHS-----------LGVAGIAKGYFNSGFLLINT 191 L N + + A V + K + LGV I++ YFNSG ++ + Sbjct: 523 LFNMNLDGNVCAAVPDLAISTETIKNVAAYRDIDVYLRDVLGVTDISQ-YFNSGVMVFDL 581 Query: 192 AQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKE 251 + + IA + DQ+VLN L K++ ++N + SL ++ Sbjct: 582 EKIRTDNLQQTFIAAAAKNTKF----FMDQNVLNSALYGKVLLLGFEWNKRVSLAMANRD 637 Query: 252 SFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTAL---LKPNNSN 307 + T ++ +H+ KP P + E P+ L +KP+++N Sbjct: 638 T-----TTESKILHFAAEPKPLQKIHM--PEHYNWWEYARQLPFYEELLSRVIKPSSTN 689 >UniRef50_B9QZ95 Glycosyl transferase family 8 n=1 Tax=Labrenzia alexandrii DFL-11 RepID=B9QZ95_9RHOB Length = 309 Score = 185 bits (471), Expect = 1e-45, Method: Composition-based stats. Identities = 65/320 (20%), Positives = 110/320 (34%), Gaps = 28/320 (8%) Query: 30 DIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYK--TRI 87 +IA D L G ++I S L+++ H+ D + D+ + + Sbjct: 4 NIAACADTKVLPGLAVTIRSSLEHSS-IPCRIHVLADRLSEQDKHKLSNSWKPHPMCQDV 62 Query: 88 KIYLINGDRLRSLPSTKNW-THAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINF 146 Y I+ + ST + + Y R+ I+D+ + K +YLD D++ + L Sbjct: 63 VFYDIDYQNISKFRSTMYLKSKSAYSRYFISDFLGEE-SKCIYLDCDLLVLRDLAELNTA 121 Query: 147 SFPDDKVAMVVTEG--QADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAI 204 + V AD L + YFNSG L+I+ +W I Sbjct: 122 KMHGKTIGSVRDISVRTADPHLFIGERLQLTN-PYDYFNSGVLIIDLDRWRKLDARNHLI 180 Query: 205 AMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFI 264 + E DQD LN+ F D +NT + P T + I Sbjct: 181 DLTLER--ADTFHSQDQDALNVFFDGDTEFLDPVWNTS---------QYERPDTAENRII 229 Query: 265 HYIGPTKPWH-----DWAWDYPVS---QAFMEAKNASPWKNTALLKPNNSNQLRYSAKHM 316 H IG KPWH + Y + F + + + P ++ + + Sbjct: 230 HLIGTVKPWHARYKEKLSDSYHRTEIWDRFYGVLDRTAYAGNRPWDPAGLGVVKETIESK 289 Query: 317 LKKHRYLKG-FSNYLFYFIE 335 + K + G L F+ Sbjct: 290 IPKMDMVTGKIRRTLQKFLN 309 >UniRef50_C3XGD2 Putative uncharacterized protein n=1 Tax=Helicobacter bilis ATCC 43879 RepID=C3XGD2_9HELI Length = 364 Score = 185 bits (471), Expect = 2e-45, Method: Composition-based stats. Identities = 73/321 (22%), Positives = 115/321 (35%), Gaps = 43/321 (13%) Query: 57 SRLCFHIFTDYFGDDDRKYFD----ALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYF 112 CFHI TD + R+ L Y ++Y ++ + LP N + YF Sbjct: 46 KPFCFHILTDGLKHETRQKLQAFQIELNKIYPCEFRVYTLSDSIFQGLPKLNN-NYLAYF 104 Query: 113 RFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMV-VTEGQADWWEKRAHS 171 R IA LYLD D+IC I + +V V + Q KR + Sbjct: 105 RLKIASCLPQDIKTCLYLDVDMICVADIREIFYTDLQGKICGVVLVPDHQQYCVLKRNSA 164 Query: 172 LGVAGI--AKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLA 229 +G + A YFNSG +LI+ Q+ V + + + + DQD LN +L Sbjct: 165 IGDEFVFNASTYFNSGLMLIDVEQYRKYNVEQKCLEWFEQYVPV----LLDQDALNAVLG 220 Query: 230 DKLIFADIKYNTQFSLNYQLKESFINP---------------VTNDTIFIHYIGPT-KPW 273 D + +++N L ++ F V N+ +HY G T KPW Sbjct: 221 DHICALPLEWNFFVELLKYKRQDFKGKDNNIVMKITYEEYMQVKNNMKILHYTGWTLKPW 280 Query: 274 HDWAWDYP------VSQAFMEAKNASP--WKNTALLKPNNSNQLRYSA-----KHM--LK 318 + + E + +P +K+ + Y + KH+ K Sbjct: 281 QQPYIENDMIKTCIYKNKWWEIAHDTPVFYKDIYASYMKKQEDMLYESILSLQKHIKSFK 340 Query: 319 KHRYLKGFSNYLFYFIEKIKH 339 LK L +K+ H Sbjct: 341 LRNRLKRLQQSLKRRCKKLFH 361 >UniRef50_Q3DM64 Glycosyl transferase, family 8, degenerate n=6 Tax=Streptococcus agalactiae RepID=Q3DM64_STRAG Length = 394 Score = 185 bits (470), Expect = 2e-45, Method: Composition-based stats. Identities = 58/272 (21%), Positives = 99/272 (36%), Gaps = 31/272 (11%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDA-LALQYKTRIKI 89 I D L I SIL +N+ R+ +I + + L + I Sbjct: 6 ICLAGDNKSLNQIQTVIKSILCHND--RVSIYILNQDIASEWFRNIQRRLLNSHSCIFDI 63 Query: 90 YLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFP 149 L + + T+ Y R+ I A KVLYLD D + ++ L Sbjct: 64 KLFDDTFKEFKTPRAHITYMAYARYYIPQLI--DAEKVLYLDIDTLVVDNLDKLFEIELG 121 Query: 150 DDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNE 209 D +A ++ +FNSG +LIN+ W +V+ + + + E Sbjct: 122 DYPIAAILDG------------------DGIHFNSGVMLINSLYWMRYRVTEKLLE-ITE 162 Query: 210 PEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLN----YQLKESFINPVTNDTIFIH 265 E+ I DQ VLN+L + + + KYN Q + Y+ + + + IH Sbjct: 163 RELDNGI-FGDQGVLNLLFDNNWLKLEDKYNAQVGNDLGAFYENWQGYFDRNFESPTIIH 221 Query: 266 YIGPTKPWHDWAWDYPVSQAFMEAKNASPWKN 297 Y KPW+ ++ + + + W Sbjct: 222 YCTHDKPWNTFSSSR-FRETWWQY-EQLDWNE 251 >UniRef50_C6IB51 Glycosyltransferase n=4 Tax=Bacteroides RepID=C6IB51_9BACE Length = 417 Score = 185 bits (470), Expect = 2e-45, Method: Composition-based stats. Identities = 54/234 (23%), Positives = 94/234 (40%), Gaps = 13/234 (5%) Query: 58 RLCFHIFTDYFGDDDRKYFDALALQYKT-RIKIYLINGDRLRSLPSTKNW-THAIYFRFV 115 + +I TDY + +++ + + I+ +I+ + + L + T +R+ Sbjct: 2 NISIYILTDYISLESKEFLQEIKNVFTCVTIQWEIIDSESFKQLKKKGGYITEHTLYRYA 61 Query: 116 IADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVA 175 IAD F K LYLDAD++ G+IEPL A V ++ L Sbjct: 62 IADLFP-NLDKALYLDADLVINGSIEPLWELDLEGYYCAGVDDIFIRRINYRKILELAEK 120 Query: 176 GIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFA 235 + Y N+G LL+N ++ + + + I + + DQD +N + K+ Sbjct: 121 DV---YINAGVLLLNLKDLRKDKIQEKLLQHTSIY--INRDRYQDQDAINCICKGKIKLI 175 Query: 236 DIKYNTQFSLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEA 289 YN S E + +D I IHY G KPWH + + + + Sbjct: 176 PNIYNFTTSETLHTPE-----MLSDIIIIHYTGSIKPWHQEYTWQVLKELYCKY 224 >UniRef50_B7AUG6 Putative uncharacterized protein n=1 Tax=Bacteroides pectinophilus ATCC 43243 RepID=B7AUG6_9BACE Length = 301 Score = 185 bits (470), Expect = 2e-45, Method: Composition-based stats. Identities = 63/274 (22%), Positives = 110/274 (40%), Gaps = 16/274 (5%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHI-FTDYFGDDDRKYFDALALQ---YK 84 ++I + F+ + + S++K N + H+ +TD L Sbjct: 1 MNILVAMNDAFVKCYQVMLTSLIKNNPDENITVHVPYTDGLSRKGLDSIKELVRNQSHGS 60 Query: 85 TRIKIYLINGDRLRSLPST--KNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEP 142 ++ Y DRL SL W+ ++FR ++ ++L+LD DII G+I+ Sbjct: 61 ASVREYYFGKDRLGSLDKLPLGMWSVEMFFRIFAQEFIPESEDRILWLDGDIIVNGSIKD 120 Query: 143 LINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSAR 202 N F A + K + + Y NSG LLIN ++ Sbjct: 121 FYNTDFDSMYYAACEDIAISHGKIKEEYDNLGWSSEEIYVNSGVLLINLKALRNNGITRD 180 Query: 203 AIAMLNEPEIIKKITHPDQDVLNMLLADKLIFAD-IKYNTQFSLNYQLKESFINPVTNDT 261 A A+ E + K+ +PDQ +LN + DK+ FAD +YN Q S Y K + + + +++ Sbjct: 181 A-AVEYALENMDKLHYPDQYMLNAMFHDKIKFADAFRYNCQVS-GYSYKLADM--ILSES 236 Query: 262 IFIHYIGPTKPWHDWAWDYPVS----QAFMEAKN 291 +H+ G +PW + S + Sbjct: 237 AILHFPG-YRPWQTDYQKHYSSAIPGDIWWHYAK 269 >UniRef50_B0NR59 Putative uncharacterized protein n=1 Tax=Bacteroides stercoris ATCC 43183 RepID=B0NR59_BACSE Length = 306 Score = 185 bits (469), Expect = 2e-45, Method: Composition-based stats. Identities = 71/312 (22%), Positives = 130/312 (41%), Gaps = 26/312 (8%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRL-CFHIFTD-YFGDDDRKYFDALALQYKTR 86 + I + D N++ G+ I S+L ++ +I + + D++ + YK Sbjct: 4 IPIVFSIDHNYVMQAGVCILSLLMNSDEKEYYDIYILSAADITEHDKELLNKTIFAYKAD 63 Query: 87 IKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINF 146 I I+ DR + +N + A YFR +I D + K++Y D D+I Q ++ +++ Sbjct: 64 INFIEID-DRFDNAFEIRNISKAAYFRLLIPDLIP-QYDKIIYSDVDVIFQSGLQEVLDT 121 Query: 147 SFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAM 206 D+ + G A+ + LG+ GY NSGFLLIN +Q+ + Sbjct: 122 DLKDNYFGGIKAIG-AESIKDYIIQLGLN--IHGYINSGFLLINAKLQREKQLFNKIQEY 178 Query: 207 LNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNT---QFSLNYQLKESFINPVTNDTI- 262 L KK DQD++N++ ++L F +KY + L Y + + + + Sbjct: 179 LT-----KKFQFQDQDIINIVCKNRLTFLPLKYCFTQKSYELYYTNPKRLFSVFSPKEVE 233 Query: 263 ------FIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHM 316 IHY G KPW+ + + Y + S + + + ++Y + Sbjct: 234 EAFTEGIIHYEGTNKPWNGFCYRY---DNWWRYYKKSVFYSEEMHFQTAYK-IQYPTWTL 289 Query: 317 LKKHRYLKGFSN 328 K R L+ F Sbjct: 290 KKILRLLRNFIR 301 >UniRef50_B6G807 Putative uncharacterized protein n=2 Tax=Collinsella RepID=B6G807_9ACTN Length = 276 Score = 185 bits (469), Expect = 3e-45, Method: Composition-based stats. Identities = 40/267 (14%), Positives = 90/267 (33%), Gaps = 11/267 (4%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 +D+ D+ +L + S+ N+G+++ + + + I+ Sbjct: 6 MDVIVTCDEGYLGPLRTMLYSLRASNQGAQVRIWLLHKGISLPALEELERFCSVLGLAIE 65 Query: 89 IYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSF 148 ++ L ++ + +Y+R + + LYLD DI+ ++ L Sbjct: 66 PVTVDRVLLDGAKCSERYPQEMYYRLLAPSIIKAPIERALYLDPDILVINPLDDLFEIDL 125 Query: 149 PDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLN 208 + A + + YFN+G +L + A+ + + Sbjct: 126 HGNAFAAASHLDAVHPATALNKA--RLSTSSDYFNTGVILFDIARARKSICVDELFSYVK 183 Query: 209 EPEIIKKITHPDQDVLNMLLADKLIFA-DIKYNTQ-----FSLNYQLKESFINPVTNDTI 262 E + + PDQD+ N L + D +N ++ + ++ V T Sbjct: 184 AHEQV--MLFPDQDLFNSLFGAVTLRIPDEIWNYDARKYPDNIIRTWGTATLDWVMEHTA 241 Query: 263 FIHYIGPTKPWHDWAWDYPVSQAFMEA 289 +H+ G KPW + + + Sbjct: 242 ILHFCGKNKPW-APGYRGQFASLYKHY 267 >UniRef50_A5VK24 Glycosyl transferase, family 8 n=18 Tax=Lactobacillales RepID=A5VK24_LACRD Length = 282 Score = 185 bits (469), Expect = 3e-45, Method: Composition-based stats. Identities = 50/271 (18%), Positives = 89/271 (32%), Gaps = 11/271 (4%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 +++ + + F+ + SI + ++ + + Q Sbjct: 1 MNLLFSINDKFVTQLATVLLSIKLNTQAQEFNVYVLQKDKLKRTDD-LERVCKQLGMNYF 59 Query: 89 IYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSF 148 +N P T + IY+R + K+LYLDAD++C + L S Sbjct: 60 PIKVNDQLFNKAPVTDRYPTTIYYRLLAHRLLPQDLHKILYLDADVLCINDLSSLYETSL 119 Query: 149 PDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLN 208 A + + E A GY+NSG LL+N + + Sbjct: 120 DGYLYASAIHTNLTNTTEVINKIRLQNFDADGYYNSGVLLMNLDTIRKKVKDTDIFNYIR 179 Query: 209 EPEIIKKITHPDQDVLNMLLADKLIFAD---IKYNTQFSLNYQ---LKESFINPVTNDTI 262 + PDQDVLN L + ++T+ Y+ E + V +T+ Sbjct: 180 TH----TLLLPDQDVLNALYGRYIKSVPDQLYNFDTRKGGIYETISFGEWTTDWVMRNTV 235 Query: 263 FIHYIGPTKPWHDWAWDYPVSQAFMEAKNAS 293 +HY G KPW + + + Sbjct: 236 ILHYCGRDKPWLPTKNSGRYTALYKNYFQMT 266 >UniRef50_C7HS13 Family 8 glycosyl transferase n=2 Tax=Anaerococcus RepID=C7HS13_9FIRM Length = 276 Score = 184 bits (468), Expect = 4e-45, Method: Composition-based stats. Identities = 56/274 (20%), Positives = 110/274 (40%), Gaps = 12/274 (4%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQ---YKT 85 ++I D+N+L + S+ + N + ++ D+ K + ++ + Sbjct: 1 MNILVSCDENYLNPLKTMLYSLFESN-DTNFEIYLIHKDIRDEKIKEIEKFVIKASSKRA 59 Query: 86 RIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLIN 145 ++ + + + T +T +Y+R + Y ++LYLD D++ + E L N Sbjct: 60 KLNAIKV-KNLFSNAKITFYYTEEMYYRLLAYKYLPENLDRILYLDPDVLVLNSCEKLYN 118 Query: 146 FSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAK--GYFNSGFLLINTAQWAAQQVSARA 203 D+ A A +G YFNSG L+IN Q + Sbjct: 119 MDLGDNYFAAATHTIPTVQSANVARLSISSGHKDIENYFNSGILMINLKLSRDSQTYEKE 178 Query: 204 IAMLNEPEIIKKITHPDQDVLNMLLADKLIFADI---KYNTQFSLNYQLKESFINP--VT 258 + + + PDQD+LN++ +K+I D Y+ + L Y+LK+ N + Sbjct: 179 VLNYVKNTKSLGLIMPDQDLLNVVFRNKIIKIDEIKYNYDARRYLTYKLKDKKYNLSYII 238 Query: 259 NDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNA 292 ++T F+H+ G KPW + + ++ Sbjct: 239 SNTCFLHFCGKRKPWLEENNLGVFTSLYLYFWKK 272 >UniRef50_C4ZG45 Glycosyltransferase Family 2 modular protein n=1 Tax=Eubacterium rectale ATCC 33656 RepID=C4ZG45_EUBR3 Length = 723 Score = 184 bits (467), Expect = 4e-45, Method: Composition-based stats. Identities = 58/276 (21%), Positives = 105/276 (38%), Gaps = 20/276 (7%) Query: 26 NLCLDIAYGT---DKNFLFGCGISIASILKYNEGSRLCFHIFTDY-FGDDDRKYFDALAL 81 + + I G D N+ G ++ SI++ + + FHI D + ++ +A Sbjct: 340 DNAIHICLGIHDKDGNYSVWAGTTMQSIVENTKA-PIVFHILHDDTLNEMNKNKLSLIAD 398 Query: 82 QYKTRIKIYLINGDRLRSLPSTKN-WTHAIYFRFVIADYFINKAPKVLYLDADIICQGTI 140 I+ + N D SL + N +T FR ++ D + K++YLD+D+ I Sbjct: 399 NSGNGIEFHHFNPDIFGSLADSMNRFTIGTMFRIMLPD-IMPDLKKIIYLDSDLFVNTDI 457 Query: 141 EPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQ-V 199 E L N + + +A W YFN+G L +N + Sbjct: 458 EELWNLNIDNYCLAAAQDCSTIRNWGTPYAVAAGQTSRDRYFNAGVLCMNLDNIRKNGSL 517 Query: 200 SARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTN 259 + + L++ + PDQD LN + + K + D K+N Y + E+ N Sbjct: 518 FQQVMDYLSDN---PRTWLPDQDALNAIFSGKTLLIDEKWN------YFIDEARKNNEKA 568 Query: 260 DTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPW 295 + HY + + +A+ +PW Sbjct: 569 EKKIYHYAATL---LMLHTNNEIDRAYYFTILRTPW 601 >UniRef50_A5UC07 Aspartate-semialdehyde dehydrogenase n=7 Tax=Haemophilus influenzae RepID=A5UC07_HAEIE Length = 300 Score = 184 bits (467), Expect = 5e-45, Method: Composition-based stats. Identities = 54/276 (19%), Positives = 101/276 (36%), Gaps = 18/276 (6%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 ++I + D NF + S+ ++ + ++ D + + ++ + Sbjct: 1 MNIVFTLDCNFASHLDTVLKSLCYHHNN--INIYVIHDGIPAESLEKLKMHCAKFDNTLY 58 Query: 89 --IYLINGDRLRSLPSTKNW-THAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLIN 145 + IN ++ S + + A FR + +V+YLD D+I I+ L + Sbjct: 59 DIQFNINQFSFPTVMSPAHIQSSASLFRLYLHQILPQHIERVIYLDIDLIIHQAIDELWD 118 Query: 146 FSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIA 205 + D +A V WE + + Y N+G +LIN +W + I Sbjct: 119 INLEDSLIAGVSDFFSEYLWEHPFYE------KQQYINTGVMLINLNKWRENNIEQYFIE 172 Query: 206 MLNEPEIIKKITHPDQDVLNMLLADK-LIFADIKYNTQFSLNYQLKESFINPVTNDTIFI 264 + + DQDV+N + + +K+N Q L + I Sbjct: 173 YAAKYGEF--FVYGDQDVINFSIPTNLIKLLPVKFNIQVKFIEYLWMEHKEKIKFTPHII 230 Query: 265 HYIGPTKPW---HDWAWDYPVSQAFMEAKNASPWKN 297 HYIG KPW H ++ ++ + S W N Sbjct: 231 HYIGSNKPWLKEHSANSPRFYNEEYLFYHHLS-WDN 265 >UniRef50_C5S3F7 Putative glycosyl transferase n=2 Tax=Actinobacillus minor RepID=C5S3F7_9PAST Length = 275 Score = 183 bits (466), Expect = 6e-45, Method: Composition-based stats. Identities = 59/280 (21%), Positives = 113/280 (40%), Gaps = 26/280 (9%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 I D F +I SI +N + + F + +Y + Q I Sbjct: 13 IILAADIKFAEQLETTIKSICYHNANLYIV--LLNRDFSKEWFEYLNTYLNQINCEIIDV 70 Query: 91 LINGDRLRSLPSTKNWTHA-IYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFP 149 +N ++L + + + A +FR+ I + KVLYLD D++ G++ + Sbjct: 71 KVNCNQLEEYKTLPHISSASTFFRYFIPAFV--NDDKVLYLDCDLVVNGSLSIFFDLELN 128 Query: 150 DDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNE 209 D VA + + ++ +K+ +FN+G LLIN W Q+++ +A+ + + Sbjct: 129 DHYVAASLDDIAFNFHQKK------------HFNAGVLLINNKLWRKQEITLKALELTD- 175 Query: 210 PEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKES----FINPVTND-TIFI 264 + +K+ DQ+VLN+L +K I + N Y + + +I +D + + Sbjct: 176 -RLNEKLEEGDQEVLNILFQNKWIELNPYLNYLVGAEYLYRRNGVTQYIRRQEDDVPLIL 234 Query: 265 HYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPN 304 H+ KPW P + + + W + N Sbjct: 235 HFNTKYKPWLPID-GVPFREYYWFYYRLN-WADIIARHYN 272 >UniRef50_A5EVI8 Glycosyl transferase family 8 protein n=1 Tax=Dichelobacter nodosus VCS1703A RepID=A5EVI8_DICNV Length = 617 Score = 183 bits (465), Expect = 8e-45, Method: Composition-based stats. Identities = 56/326 (17%), Positives = 109/326 (33%), Gaps = 33/326 (10%) Query: 25 ENLCLDIAYGTDKNFLFGCGISIASILKYNE-GSRLCFHIFTDYFGDDDRKYFDALALQY 83 + + + D++++ G I SI+ + + L I +K L + Sbjct: 276 QKNAVSVVIAADEHYVPHLGALICSIIDHLSCDAFLDLIILDGGIDFISQKQLAHLLGKR 335 Query: 84 KTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPL 143 I+ ++ D +++ A ++R ++ D I +VLY+D D I + L Sbjct: 336 GA-IQFLDLS-DEFTDQKVHMHFSRATFYRLIL-DKLIIDRKRVLYIDCDTIVLADLAEL 392 Query: 144 INFSFPDDKVAMVVTEGQADW----------------WEKRAHSLGVAGIAKGYFNSGFL 187 + V + + +G+ + YF +G + Sbjct: 393 FATDLNGKAIGAVFDYIMHHFCQVGVRSIEFTNYLPAKKYLEDYVGLKENWRHYFQAGVI 452 Query: 188 LINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNT------ 241 L + Q + + IA L E K+ DQD+LN + F + +N Sbjct: 453 LFDLEQLRTLNYADKMIASLTE----KRYWFLDQDILNKYFVGNVHFLNPCWNVVNVGAD 508 Query: 242 QFSLNYQLKESFINPVTNDTIFIHYIGPT-KPWHDWAWDYPVSQAFMEAKNASPWKNTAL 300 + + + IHY G KPW D + ++ + + W + L Sbjct: 509 IYEGLSAELIAELKAAERAPAIIHYAGYEAKPWVDLSAK--FAEFYYYYLRQTFWYESVL 566 Query: 301 LKPNNSNQLRYSAKHMLKKHRYLKGF 326 N + S K K R+ + Sbjct: 567 TSKMLLNVRKKSQKSGEKSWRWKIAY 592 >UniRef50_B1LK07 Glycosyl transferase family 8 n=10 Tax=Enterobacteriaceae RepID=B1LK07_ECOSM Length = 630 Score = 182 bits (463), Expect = 1e-44, Method: Composition-based stats. Identities = 62/340 (18%), Positives = 130/340 (38%), Gaps = 46/340 (13%) Query: 26 NLCLDIAYGTDKNFLFGCGISIASILKYNEGSR-LCFHIFTDYFGDDDRKYFDALALQYK 84 + + + D N+ G I SI+ +++ SR + + +++ L + Sbjct: 274 DESVPVVISFDNNYALSGGALINSIVLHSDASRNYDIVVLENKVSHLNKQRLIKLVAGHN 333 Query: 85 T-RIKIYLINGD-RLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEP 142 ++ + +N + + + +++ + Y R I F + KV+++D+D + + + Sbjct: 334 NISLRFFDVNSFTEMSDVHTRAHFSASTYARLFIPQLFR-EYKKVVFIDSDTVVKADLAT 392 Query: 143 LINFSFPDDKVAMV----------------VTEGQADWWEKRAHSLGVAGIAKGYFNSGF 186 L++ + VA V +G + +LG+ YF +G Sbjct: 393 LLDVEIGTNLVAAVKDIVMEGFVKFGTMSESDDGIMPAEQYLKKTLGMTN-PDEYFQAGI 451 Query: 187 LLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFS-- 244 ++ N Q + A+ ++ L KK DQD++N + ++ F +++N Sbjct: 452 IVFNVEQMVTENTFAQLMSALKA----KKYWFLDQDIMNKVFFGRVKFLPLEWNVYHGNG 507 Query: 245 --------LNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWK 296 L + F+ + IHY G KPW+ D F+E ++PW+ Sbjct: 508 NTDDFFPNLKFSTYMRFLQ-ARRNPKMIHYAGENKPWNTEKVD--FYDDFLENVLSTPWE 564 Query: 297 NT--------ALLKPNNSNQLRYSAKHMLKKHRYLKGFSN 328 A + PN +L+ + K R L + N Sbjct: 565 KEIYYRQLPVATVVPNQHTELQQTVLLQTKIKRALMPYVN 604 >UniRef50_B1I7M9 Glycosyl transferase, family 8 n=16 Tax=Streptococcus pneumoniae RepID=B1I7M9_STRPI Length = 406 Score = 182 bits (462), Expect = 2e-44, Method: Composition-based stats. Identities = 57/294 (19%), Positives = 114/294 (38%), Gaps = 36/294 (12%) Query: 30 DIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKI 89 + + D ++ ++ S+ ++N L ++ + + + Sbjct: 7 SVVFAGDYAYIRQIETAMKSLCRHNS--HLKIYLLNQDIPQEWFSQIRIYLQEMGGDLID 64 Query: 90 YLINGDRLRSLPSTK--NWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 + G + + S K + H + R+ I D+ KVLYLD+D+I G + L Sbjct: 65 CKLIGSQFQMNWSNKLPHINHMTFARYFIPDFV--TEDKVLYLDSDLIVTGDLTDLFELD 122 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 ++ +A + A FN+G LLIN +W ++ + + I + Sbjct: 123 LGENYLAAARSCFGAGVG----------------FNAGVLLINNKKWGSETIRQKLIDLT 166 Query: 208 NEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNY---QLKESFI--NPVTNDTI 262 + + + DQ +LNML D+ + +YN Q +Y K FI P+ + Sbjct: 167 EKEH--ENVEEGDQSILNMLFKDQYSSLEDQYNFQIGYDYGAAAFKHQFIFDIPLEPLPL 224 Query: 263 FIHYIGPTKPWHDWAWDYPVSQAFMEAKNAS------PWKNTALLKPNNSNQLR 310 +HYI KPW+ ++ + + + E W + ++ P+ S + Sbjct: 225 ILHYISQDKPWNQFSVGR-LREVWWEYSLMDWSVILNEWFSKSVKYPSKSQIFK 277 >UniRef50_Q3DNS6 Glycosyl transferase, family 8 n=5 Tax=Streptococcus agalactiae RepID=Q3DNS6_STRAG Length = 401 Score = 182 bits (462), Expect = 2e-44, Method: Composition-based stats. Identities = 54/265 (20%), Positives = 102/265 (38%), Gaps = 28/265 (10%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 +A D N+L ++I SI YN + F++F + + + +++ Sbjct: 5 VALAVDSNYLDKALVTIKSICVYNRN--ITFYLFNQDTPVEWVRNINRKLEPLGSKLINV 62 Query: 91 LINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPD 150 I + L + T + +FR +ADY + +VLYLD+DII ++ L F Sbjct: 63 KIYNYDIAHLTTF--LTVSTWFRLFLADYIPSS--RVLYLDSDIIVNTNLDYLFELDFKG 118 Query: 151 DKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEP 210 +A V + +G FN+G LL N W ++ + E Sbjct: 119 YYLAAVKDPHKN---------------EEGGFNAGMLLANLELWREDGLTKTLLKTAEEL 163 Query: 211 EIIKKITHPDQDVLNMLLADKLIFADIKYNTQ----FSLNYQLKESFINPVTNDTIFIHY 266 + K DQ +LN++ ++ + + +N Q S ++N IH+ Sbjct: 164 HRVVKT--GDQSILNIVCHNRWLSLNKTWNFQTYDVVSRYNHRSYLYLNIENRTPNIIHF 221 Query: 267 IGPTKPWHDWAWDYPVSQAFMEAKN 291 + KPW++ + + + Sbjct: 222 LTSDKPWNENSVAR-FRELWWYYFQ 245 >UniRef50_C3XKY2 Glycosyl transferase n=2 Tax=Campylobacterales RepID=C3XKY2_9HELI Length = 433 Score = 182 bits (462), Expect = 2e-44, Method: Composition-based stats. Identities = 66/368 (17%), Positives = 113/368 (30%), Gaps = 70/368 (19%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEG-------------------------------- 56 I D+N++ + I S++ Sbjct: 2 FHIILSADENYIKYASVLITSVIYNTNPKLTFKDFCQKEGFKALKNSYFSAYQNIDFSKL 61 Query: 57 ------SRLCFHIFTDYFGDDDRKYFDALAL----QYKTRIKIYLINGDRLRSLPSTK-- 104 FHI +D + L Y I ++IN + P + Sbjct: 62 SKQEAQEGYIFHILSDSISSTTQNQLTELQNTLNTIYPCEILTHIINDKEFENFPISGAA 121 Query: 105 NWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADW 164 + H Y+R + Y + K LYLD+D++C + L D VA + G Sbjct: 122 HSNHLPYYRLKLDSYLDDSITKCLYLDSDMLCLCDLRELFAIDLKDFVVAAINDPGTKKR 181 Query: 165 WEKRAH--SLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQD 222 K + YFNSGFLLINT + ++ + + + IK DQD Sbjct: 182 KIKYKENGKKMILNFNDNYFNSGFLLINTQNYKQHKIQEKCENLAKKCYYIK---AADQD 238 Query: 223 VLNMLL-ADKLIFADIKYNTQ-----FSLNYQLKESFINPVT-------NDTIFIHYIGP 269 +LN + +KL+ I YN ++ ++ +N + IHY Sbjct: 239 LLNATIPKEKLLKLPIAYNFSSISFCIAICKDEQKHRLNCTRAEFMESYKNPKIIHY--G 296 Query: 270 TKPWHDWAWDYPVS-----QAFMEAKNASP-WKNTALLKPNNSNQLRYSAKHMLKKHRYL 323 KPW + +P + L + + + A + + Sbjct: 297 EKPWKFLQSYVNSKGENINDLWWHYAKITPSFSTQLLESKASIKEYLHFASLGFEVFKLS 356 Query: 324 KGFSNYLF 331 + Y Sbjct: 357 TKLTGYFA 364 >UniRef50_A7H2M2 Glycosyl transferase family 8 n=3 Tax=Campylobacter jejuni RepID=A7H2M2_CAMJD Length = 381 Score = 181 bits (460), Expect = 3e-44, Method: Composition-based stats. Identities = 65/337 (19%), Positives = 125/337 (37%), Gaps = 62/337 (18%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNE-------------GSRLCFHIFTDYFGDDDRKY 75 I ++N++ + + SI++ + FHI +D+ + + Sbjct: 2 FHIVLNANENYIKYAAVLMTSIIQKTDLNKSMSEFCNFDTDEGYVFHILSDHISESMKVR 61 Query: 76 FDALALQ----YKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLD 131 L Q Y +I ++++N D + + + + Y+R +A LYLD Sbjct: 62 ISNLEKQLNDIYPCKIVLHILNDDEFKGMLKWRG-NYLAYYRIKMASVLPQNLKICLYLD 120 Query: 132 ADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLG-----VAGIAKGYFNSGF 186 D++C G + L++ + + A+ + +K SL + YFNSGF Sbjct: 121 CDMLCFGDLRELLSVDINNYQAAVCLDGNNHKKNKKVFFSLKGREKYKFSNIEKYFNSGF 180 Query: 187 LLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLN 246 +L+N +W + ++I L + K +PDQD LN L + + ++N Sbjct: 181 ILVNLDRWRRDNIENKSIDFLKKF----KTLYPDQDALNFAL-NDTLLLPNRWNFSLGYF 235 Query: 247 ---------------------YQLKESFINPVTNDTIFIHYI-GPTKPWHDWAW------ 278 K F N V N H+I P KPW + + Sbjct: 236 VAFLKNSQEILFLNQTKYPHLNYTKTEFENEVKN-IKIAHFILDPFKPWDAFQYSIVNDD 294 Query: 279 ----DYPVSQAFMEAKNASP-WKNTALLKPNNSNQLR 310 +YP + + +P + L++ + N+ + Sbjct: 295 LQLIEYPFYKHYWSVAKNTPEFYLDFLVQKESINEHK 331 >UniRef50_A3VFX3 Glycosyltransferase n=1 Tax=Rhodobacterales bacterium HTCC2654 RepID=A3VFX3_9RHOB Length = 615 Score = 181 bits (460), Expect = 3e-44, Method: Composition-based stats. Identities = 58/323 (17%), Positives = 117/323 (36%), Gaps = 35/323 (10%) Query: 15 VIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYN-EGSRLCFHIFTDYFGDDDR 73 V+ + + +++A+ +D+ +L +AS++++ + GD D Sbjct: 255 VVPFARGARFNDGAVNVAFTSDRPYLPQTAAMVASLIEHAAPDREYNLFYLHENIGDRDL 314 Query: 74 KYFDALALQYKTRIKIYLIN-GDRLRSLPSTKNW--THAIYFRFVIADYFINKAPKVLYL 130 +LA+ + I ++ IN G ++ ++A Y RF++ D +++YL Sbjct: 315 DLLRSLAVAHG-NITLHTINVGTAFSREYRARHHTPSNATYNRFLLFDLLP-DVERLVYL 372 Query: 131 DADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKR------------AHSLGVAGIA 178 D D++ G + L + D +A V R A+ G++ Sbjct: 373 DVDLVLCGDVAELFDTDMNDAPLAAVTDALMTRVLATRVRTRDPEVPDLYAYLSDDLGLS 432 Query: 179 KG----YFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIF 234 YFN+G +++N A +V M+ + DQD+LN+ D+ + Sbjct: 433 DDQISRYFNAGVMVMNFAAMDVAKVGRELREMVA----GNRYFFRDQDILNVYFRDRFVT 488 Query: 235 ADIKYNTQFSLNYQLK------ESFINPVTNDTIFIHY-IGPTKPWHDWAWDYPVSQAFM 287 ++N S + D +H+ KPW + + + F Sbjct: 489 LPSRFNVHNSDRGAYDNVPVPIRNDALAAKADPFIVHFAAAHQKPWREPDVE--FAGLFW 546 Query: 288 EAKNASPWKNTALLKPNNSNQLR 310 +P+ L LR Sbjct: 547 STLARTPFWFEVLEATRRHRSLR 569 >UniRef50_C8WAA9 Glycosyl transferase family 8 n=2 Tax=Atopobium RepID=C8WAA9_ATOPD Length = 358 Score = 181 bits (460), Expect = 3e-44, Method: Composition-based stats. Identities = 48/321 (14%), Positives = 97/321 (30%), Gaps = 29/321 (9%) Query: 32 AYGTDKNFLFGCGISIASILKY-NEGSRLCFHIFTDYFGDDD-RKYFDALALQYKTRIKI 89 + NF+ ++I SI++ N R + T + L + Sbjct: 19 VFACSDNFVPYLSVAIQSIIENVNPERRYDIIVLTRDLSPTNMITLTRQAQLVNNVHVGF 78 Query: 90 YLINGDRLR-SLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSF 148 ++ LP ++ Y+R + K +YLD+D++ I L + Sbjct: 79 LDVDAALGDIELPHHGHFRPETYYRLLAPSLLP-NVNKAIYLDSDLVVNTDIAELYDIDI 137 Query: 149 PDDKVAMVVT-------EGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSA 201 V +G + YF +G +L+N + Q Sbjct: 138 TGYLVGATRDADTIGQIDGYDATVGPYLKNELGMDDPHDYFQAGVILMNLEEIRKQISPE 197 Query: 202 RAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKE---------- 251 + + ++ DQDVLN + + ++K+N + ++ Sbjct: 198 EFLKV----STMRTWRWLDQDVLNRFVNGHYLRINMKWNYLVDWQFLRRDHIVAQAPKDI 253 Query: 252 -SFINPVTNDTIFIHYIGPT-KPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQL 309 + H+ GP +PW D ++ F SP+ + S + Sbjct: 254 REEYEEARKNICIAHFAGPDNRPWLYPNSD--LAGLFWFYARRSPYLEELRSQLEESRRT 311 Query: 310 RYSAKHMLKKHRYLKGFSNYL 330 H ++ +G Sbjct: 312 VRGLSHRVQSGVLFRGLMPLF 332 >UniRef50_C5EZG9 Glycosyl transferase family protein n=1 Tax=Helicobacter pullorum MIT 98-5489 RepID=C5EZG9_9HELI Length = 374 Score = 181 bits (460), Expect = 3e-44, Method: Composition-based stats. Identities = 60/310 (19%), Positives = 109/310 (35%), Gaps = 36/310 (11%) Query: 59 LCFHIFTDYFGDDDRKYFDALA----LQYKTRIKIYLINGDRLRSLPS-TKNWTHAIYFR 113 FH+ D+ + ++ L Y + I+++ + R+ T N + Y+R Sbjct: 9 YNFHLLMDFVSQETKEKLQNLILELSKIYPCTLNIHILEDEIFRTQSLRTLNGNYLAYYR 68 Query: 114 FVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVV---TEGQADWWEKRAH 170 I + +YLD D+I G + L + +V+ + E + Sbjct: 69 LRIGSALPLSIKRCVYLDVDMIVLGDLRELFKINLQGKICGVVMEGKDNDTQNILESKNK 128 Query: 171 SLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLAD 230 I YFNSG LL++ W + + RA ++ + K D+ +LN +L Sbjct: 129 INKSIAIVSNYFNSGMLLVDLDLWRKENIEDRAFEIVKKYYCHK----HDEHILNAVLQG 184 Query: 231 KLIFADIKYNTQFSL-------------NYQLKESFINPVTNDTIFIHYIGPTKPWHD-- 275 + ++N L N N + +HY KPW D Sbjct: 185 QTFKILPQWNMMVFLYCRAVCLNERGKINMPYNRKDFNNALKNPKILHYHTHHKPWEDSK 244 Query: 276 ---WAWDYPVSQAFMEAKNASP-WKNTAL-LKPNNSNQLRYS----AKHMLKKHRYLKGF 326 + + Q + + +P +K L LKP + L + K + + L Sbjct: 245 IYLNYCNKFLGQYWWDMVEQTPIFKEKLLQLKPQADSALAFQCLVGYKLLRYYQKGLFIL 304 Query: 327 SNYLFYFIEK 336 + YF+ K Sbjct: 305 IPFYTYFLIK 314 >UniRef50_Q9L7A2 Putative glycosyl transferase n=1 Tax=Haemophilus ducreyi RepID=Q9L7A2_HAEDU Length = 269 Score = 181 bits (459), Expect = 4e-44, Method: Composition-based stats. Identities = 49/267 (18%), Positives = 106/267 (39%), Gaps = 23/267 (8%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 ++I ++++ +I SI +N+ + F++ + + + + + I Sbjct: 8 MNIVLAANQSYSEYILTTIKSIYLHNKH--IRFYLLNRDYPTEWFDILNNKLRKLNSEII 65 Query: 89 IYLINGDRLRSLPSTKNWTH-AIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 + D +++ + + + +FR+ I+D+ + KV+YLDADI+ G++ L Sbjct: 66 DIKVTNDTIKNFKTYSHISSDTTFFRYFISDFI--EQDKVIYLDADIVVNGSLTELYQTD 123 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 + +A V + + FN+G LLIN +W ++ +++ Sbjct: 124 ISNYFLAAVKDIISEKIY-----------VNNHIFNAGMLLINNKKWREHNITQFCLSLS 172 Query: 208 NEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTND-----TI 262 + I + DQ +LN++ DK + + YN +Y + D + Sbjct: 173 EKY--INSLPDADQSILNLIFKDKWLKLNRGYNYLIGTDYLFFKYGKTRYLEDLGETIPL 230 Query: 263 FIHYIGPTKPWHDWAWDYPVSQAFMEA 289 IHY KPW + + + Sbjct: 231 IIHYNTEAKPWLNIFNTRFRNIYWFYY 257 >UniRef50_A4WWT5 Glycosyl transferase, family 8 n=1 Tax=Rhodobacter sphaeroides ATCC 17025 RepID=A4WWT5_RHOS5 Length = 319 Score = 180 bits (457), Expect = 6e-44, Method: Composition-based stats. Identities = 61/310 (19%), Positives = 107/310 (34%), Gaps = 26/310 (8%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 + + D+N+ + A I + I + ++ + I Sbjct: 17 VIFCCDRNYYPYAMFAAAQIAGRHPHRGFDICIASLEAIEEPPSLSELAVR--HCTID-- 72 Query: 91 LINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQ-GTIEPLINFSFP 149 T Y R V+ + F ++LYLD+DI Q G + LI Sbjct: 73 --AAHLFADFGLDDRRTAVTYLRLVLPEAFSEDYDRILYLDSDIYIQGGDLGALIALPLA 130 Query: 150 DDKVAMVVTEGQADWWEKRAHSLGVAGIAK-GYFNSGFLLINTAQWAAQQVSARAIAMLN 208 +A V Q +R G+ + YFNSG LL + + A + A+ + Sbjct: 131 GRPLAAVRDNKQWRTPSRRMVDFDRLGLPQRPYFNSGVLLFDVPAFRAANLLQEALRIGR 190 Query: 209 EPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIG 268 +++ DQ +LN + +N Q++ + +L + + P IH+IG Sbjct: 191 SQ--GRQLVRHDQSLLNACMLGNWAELSPSWNWQYTWSSRLFAAMLGPN-----IIHFIG 243 Query: 269 PTKPWHDWAWDYPVSQAF---------MEAKNASPWKNTALLKPNNSNQLRYSAKHMLKK 319 KPW D D +S F + P + P++ R KH+L Sbjct: 244 RCKPWCDP--DNLLSPQFARDLQIFLARHFPDHPPLPLGPGMLPDSLAMRRMLMKHLLSS 301 Query: 320 HRYLKGFSNY 329 R + + Sbjct: 302 GRLARYLERF 311 >UniRef50_Q5M3K9 Glycosyl transferase n=4 Tax=Streptococcus RepID=Q5M3K9_STRT2 Length = 697 Score = 180 bits (457), Expect = 7e-44, Method: Composition-based stats. Identities = 52/298 (17%), Positives = 107/298 (35%), Gaps = 27/298 (9%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 I + ++ +I SI+ ++ + F++ D F + + + + + + Sbjct: 303 IVLAANYTYVDQVLTTIKSIVFHHRN--IRFYLINDDFSQEWFRGLNRHLAAFGSEVINC 360 Query: 91 LINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPD 150 ++ ++ + N+ A Y R+ +AD+ + + LYLD+D++ G++E L Sbjct: 361 RVDSSHIKQFKTNSNY--ASYLRYFVADFVSEE--RALYLDSDMVVTGSLEDLFTLDLQG 416 Query: 151 DKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEP 210 +A V + F++GF++I+TA W + I M +E Sbjct: 417 RPLAAVRDYAVQ------------GQDRQAMFDAGFMVIDTAYWKQYNMRRHLIDMTSEW 464 Query: 211 EIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIGPT 270 K+ +Q +LNM+ + + N + L + + +HY Sbjct: 465 H--DKVPFAEQSILNMVFCNNWLTLSFDNNYAVT-KSSLSGYHLPNGQDYPKVLHYTSHR 521 Query: 271 KPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKHRYLKGFSN 328 KPW A + + W N+ L S + K R Sbjct: 522 KPWLPLACQ-AYREVWWFYAQM-DWSGV----AENAALLPLSEDMIYPKGRPFTCLVY 573 >UniRef50_Q0I2Z7 Lipopolysaccharide biosynthesis protein, glycosyltransferase, family 8 n=1 Tax=Haemophilus somnus 129PT RepID=Q0I2Z7_HAES1 Length = 354 Score = 180 bits (456), Expect = 8e-44, Method: Composition-based stats. Identities = 58/303 (19%), Positives = 110/303 (36%), Gaps = 56/303 (18%) Query: 29 LDIAYGTDKNFLFGCGISIASIL----KYNEGSRLCFHIFTDYFGDDDRKYFDALALQYK 84 ++I + D N+ +++ SI+ K NE + F++ + Y LA + Sbjct: 1 MNILFACDDNYAKYLAVTMLSIIHARDKNNECYTIHFYLLDMGISTVAKDYCLELANKNN 60 Query: 85 TRIKIYLINGDRLRSLP-STKNWTHAIYFRFVIADYFIN-KAPKVLYLDADIICQGTIEP 142 + I I+ P + + + + Y R +A+Y K++YLD DI+ ++ P Sbjct: 61 CHLDIVPISISDFEKFPRTIEYISLSTYARLNLANYLKKFNLTKIIYLDIDILVNHSLLP 120 Query: 143 LINFSFPDDKVAMVVTEGQADWWEKR---------------------------------- 168 L N + + + + Sbjct: 121 LWNTDLGNKAIGACYDAFIESQEKSKRMSSQSVSQSVSQSVSQSVSQSVSQSVSQSVSQS 180 Query: 169 ---------AHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIK-KITH 218 + YFN+G LLIN +W V +++ + + + + Sbjct: 181 VSQSVSQSDYKTKLHLPNTHFYFNAGVLLINVVEWEKCHVFEKSLQWIEYCKRNNIEFLY 240 Query: 219 PDQDVLNMLLADKLIFADIKYNTQFSLNYQLKE------SFINPVTNDTIFIHYIGPTKP 272 DQD+LN + A+ + + D++YN + +LK + T IHY+GP K Sbjct: 241 QDQDILNAIFANNVKYLDLRYNFTANALNRLKRVSKKELNQYEEATMPLAIIHYVGPKKS 300 Query: 273 WHD 275 WH+ Sbjct: 301 WHE 303 >UniRef50_C5ZVZ7 Putative glycosyltransferase n=3 Tax=Campylobacterales RepID=C5ZVZ7_9HELI Length = 431 Score = 179 bits (455), Expect = 1e-43, Method: Composition-based stats. Identities = 72/363 (19%), Positives = 122/363 (33%), Gaps = 66/363 (18%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYN---------------------------------- 54 I + DKN++ + I SI+K Sbjct: 2 FHIFFSADKNYIPYTAVLITSIIKNTNPQKSFKDFCTTPSDSLPSLDYPRLQYDNLDKLD 61 Query: 55 EGSRLCFHIFTDYFGDDDRKYFDALALQ----YKTRIKIYLINGDRLRSLPSTK--NWTH 108 + FHI +D D + + Y ++I++IN P + + +H Sbjct: 62 KSEGYVFHILSDSIPKDLQTKLQNFIQELSAFYPCTLQIHIINDIDFAHFPISGAAHSSH 121 Query: 109 AIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKR 168 Y+R DY K LYLD+D++ + L D+ ++ G + K Sbjct: 122 LPYYRLKWQDYIKPAPQKCLYLDSDMLVLCDLRELFALDLKDNIAGIIGDCGSKNRKIKY 181 Query: 169 --AHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNM 226 + + YFNSGFLLIN+ Q+ +Q+ + + + IK DQD+LN Sbjct: 182 QENNYKKTFYFDENYFNSGFLLINSKQYIKEQIWEKCENLAKKCTYIK---AADQDLLNF 238 Query: 227 LLA-DKLIFADIKYNTQ-FSLNYQLKESF-----------INPVTNDTIFIHYIGPTKPW 273 + +K + YN Q +L Y L + N + +HY KPW Sbjct: 239 TIPINKRLKLPFAYNFQCITLLYVLCKDECKNRLNYTREAFNKSFKNPKILHY--GEKPW 296 Query: 274 HDWAWDYPVS-----QAFMEAKNASP-WKNTALLKPNNSNQLRYSAKHMLKKHRYLKGFS 327 + E +P + + L + + + + A Y F Sbjct: 297 RYLQSYQDYKGNNINDIWWEYAQQTPIFGDKLLKQKSQISDYKLFAILGYYALLYTTNFL 356 Query: 328 NYL 330 Y Sbjct: 357 GYF 359 >UniRef50_D2MYR1 Putative uncharacterized protein n=1 Tax=Campylobacter jejuni subsp. jejuni 414 RepID=D2MYR1_CAMJE Length = 383 Score = 179 bits (454), Expect = 1e-43, Method: Composition-based stats. Identities = 69/311 (22%), Positives = 114/311 (36%), Gaps = 70/311 (22%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSR----------------------------LC 60 I + +++ + I+SI+K + S+ Sbjct: 2 FHIILNLNDDYVKYASVLISSIVKNTDTSKTFAKICEENHNLTHILTLKQYNKSEEEGYV 61 Query: 61 FHIFTDYFGDDDRKYFD----ALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVI 116 FHI +D+ D R + LA Y IKIY+IN D R+ K Y+R ++ Sbjct: 62 FHILSDFISDKTRMKLEYLKENLAKIYPCDIKIYIINEDNFRNFLHWKG-NFVAYYRLMV 120 Query: 117 ADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSL---- 172 K LY+DAD++C I L F D + V + + L Sbjct: 121 GSILPPDIEKCLYIDADMLCFSDIRKLFLFDLEDKVLGAVADFATWNTRFLKFRKLKYLF 180 Query: 173 -GVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADK 231 G ++ YFNSG LLI+ +W Q + + + +L K PDQD LN+++ + Sbjct: 181 KGFLKFSREYFNSGLLLIDLKEWRRQNIEKKCLDVLKYY----KCILPDQDALNIVIKEN 236 Query: 232 LIFADIKYNTQ---FSLNYQ-----------------------LKESFINPVTNDTIFIH 265 I + +N ++ NY ++ + N +F+H Sbjct: 237 YIKLPLSFNCPTVCYATNYLNIICKDEISSFSKLDYFKEVGMMYSKNELLEALNKPLFLH 296 Query: 266 YIGPTKPWHDW 276 Y KPW + Sbjct: 297 Y--SEKPWARY 305 >UniRef50_A8AY72 Glycosyl transferase, family 8 SP1766 n=7 Tax=Streptococcus RepID=A8AY72_STRGC Length = 435 Score = 178 bits (451), Expect = 3e-43, Method: Composition-based stats. Identities = 58/304 (19%), Positives = 109/304 (35%), Gaps = 38/304 (12%) Query: 1 MQQVFFQETEFLNSVIDYDHKVE---TENLCLD----IAYGTDKNFLFGCGISIASILKY 53 M+ +F LN I Y+ ++ I D ++ +I S+ Y Sbjct: 1 MKALFTYGLFELNKRIRYNEDTIIRLANRGKMNQMKSIVLAGDYGYIRQIETTIKSLCCY 60 Query: 54 NEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLP---STKNWTHAI 110 +E L ++F + + + D LR + + + Sbjct: 61 HED--LLIYVFNQDIPQEWFINTRKKVKGTGNNLFDIKLLRDDLRMKWEESTYSHINYMA 118 Query: 111 YFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAH 170 Y R+ I +Y KA + LYLD D++ ++ L D +A V Sbjct: 119 YARYFIPEYV--KADRALYLDCDLVVTQNLDHLFELDLEDYYIAAVRATFGLGIG----- 171 Query: 171 SLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLAD 230 FNSG +L+N +W + + + + + + I+++ DQ +LNML + Sbjct: 172 -----------FNSGVMLLNNKRWREENIPQQLVELTDRE--IERVLEGDQSILNMLFKE 218 Query: 231 KLIFADIKYNTQFSLN-----YQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQA 285 + + + YN Q + Y F P++ +HYI KPW+ + + Sbjct: 219 QYLELEDSYNFQIGFDMGAAQYGHDFVFDIPLSPLPAIVHYISALKPWNLLTNMR-LREV 277 Query: 286 FMEA 289 + Sbjct: 278 WWFY 281 >UniRef50_A8FNA2 Putative uncharacterized protein n=2 Tax=Campylobacter jejuni subsp. jejuni 81116 RepID=A8FNA2_CAMJ8 Length = 791 Score = 177 bits (450), Expect = 5e-43, Method: Composition-based stats. Identities = 55/297 (18%), Positives = 103/297 (34%), Gaps = 22/297 (7%) Query: 29 LDIAYGTDKNFLFGCGISIASILK-YNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 + I + D N+ + + SI + +E +I + + I Sbjct: 383 IPIVFSCDANYFSYLTVVLQSIKEKSSENYNYDIYILHNKLDKSLTQKLINYIQAENFSI 442 Query: 88 KIYLING-----DRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEP 142 K I+ + ++ A Y+RF I F + K++YLD DII + + Sbjct: 443 KFVDISRILNLLKSQIQFYTALFFSEATYYRFFIPKIF-KEFKKIIYLDTDIIVKQDLNL 501 Query: 143 LINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSAR 202 L + F A + + YF +G ++ N + + + Sbjct: 502 LYSIDFDKPLAAAKCMIFSQVKQADHRITKLKMKQPENYFQAGVMVYNIQKCLKMDFTQK 561 Query: 203 AIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKE----------S 252 + L E +K DQDVLN + + + +K+N ++++Y++ Sbjct: 562 CLNKLQE---LKDPPLVDQDVLNAVFEGDIHYISLKWNCLWNVSYRIPNFKILYSKDFLK 618 Query: 253 FINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQL 309 D IHY KPW+ + P + + +P+ L K N L Sbjct: 619 DYQEAERDPYIIHYCDYFKPWN--SPHLPKADIWWHYARQTPFYEEILFKNITQNSL 673 >UniRef50_A8LP95 Lipopolysaccharide 1 n=1 Tax=Dinoroseobacter shibae DFL 12 RepID=A8LP95_DINSH Length = 342 Score = 177 bits (449), Expect = 6e-43, Method: Composition-based stats. Identities = 62/312 (19%), Positives = 122/312 (39%), Gaps = 22/312 (7%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 + + +D+ +L + I + + I I+ Sbjct: 38 VCFCSDEGYLPFALFAALQIHRLHPDRCFDLVIAHTG-------PLSVPHGFPGIGIRYV 90 Query: 91 LIN-GDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGT-IEPLINFSF 148 I+ G L T + Y R ++ + ++LY+D+D+ + L+ Sbjct: 91 EIDTGGCFERLALDARRTGSTYLRLALSGALGHDYQRILYMDSDVFALRDGLHVLLFTDM 150 Query: 149 PDDKVAMVVTEGQADWWEKRAHSLGVAGIA-KGYFNSGFLLINTAQWAAQQVSARAIAML 207 +A V Q ++ L + + YFN+G LL++TA+ Q + A+A+ + Sbjct: 151 RGKPLAAVRDNSQWRTSGRKPDDLVTLNLPARPYFNAGVLLMDTARLNEQDILAKALDLG 210 Query: 208 NEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYI 267 ++ DQ +LN + + ++N QF+ S+I ++ D +H+I Sbjct: 211 TSQ--AGRLARHDQTLLNAVTSGNWAEMSPRWNWQFTW-----ASWIFALSEDARILHFI 263 Query: 268 GPTKPWHDWAWDYP--VSQAFMEAKNASPWKNTALLKPNNS--NQLRYSAKHMLKKHRYL 323 GP KPW D + +P +++A+ + + + + NS N R K ++K Sbjct: 264 GPNKPWADTSGRFPKSITRAYGDFLAEQ-FPERTVERAANSPINDPRRLIKSLIKHGLSR 322 Query: 324 KGFSNYLFYFIE 335 K S YL F + Sbjct: 323 KKMSAYLARFAD 334 >UniRef50_C4Z1V1 Putative uncharacterized protein n=1 Tax=Eubacterium eligens ATCC 27750 RepID=C4Z1V1_EUBE2 Length = 607 Score = 176 bits (448), Expect = 6e-43, Method: Composition-based stats. Identities = 59/326 (18%), Positives = 112/326 (34%), Gaps = 32/326 (9%) Query: 6 FQETEFLNSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNE-GSRLCFHIF 64 E + L + +H+ + + + ++ + + + S+ + R + Sbjct: 250 LHEYDELLDSYNREHEEYMAVSRIPVFFSINEQYAPYLAVCLKSLAVHVACDERYRIIVM 309 Query: 65 TDYFGDDDRKYFDALALQYKTRIKIYLIN----------------GDRLRSLPSTKNWTH 108 D + + Y+ I I ++ DR + + +T Sbjct: 310 CDNVKNITMIQLRNVIKDYE-NIDIEFVDIRKKMYEYSESFGQTVTDRQENRLYSGEFTL 368 Query: 109 AIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKR 168 IYFR IA+ F + K +Y+D+D + I L + D V Sbjct: 369 TIYFRLFIAELFP-ELNKAVYIDSDTVINDDIAKLYSVDMGDAMFGAVRDTFAGKNTILA 427 Query: 169 AHSLGVAGIAKG-YFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNML 227 + V GI + Y NSG LL+N + ++ R + ++ E PDQD +N + Sbjct: 428 HYIENVVGIERNEYVNSGVLLMNLDKIRQAHLADRFLKLMAEYHFDS--VAPDQDYINSM 485 Query: 228 LADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFM 287 A ++ F D ++N + + IHY KPWH + P + F Sbjct: 486 CAKEIYFLDKEWNVMPNKGGEY--------IARPKLIHYNLFDKPWHY--SEIPYEEYFW 535 Query: 288 EAKNASPWKNTALLKPNNSNQLRYSA 313 + S + + + A Sbjct: 536 QYAAESGFYPLLIKQRKQYGDNEKKA 561 Score = 105 bits (263), Expect = 2e-21, Method: Composition-based stats. Identities = 56/295 (18%), Positives = 100/295 (33%), Gaps = 63/295 (21%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDR----------KYFDA 78 ++I Y DK G +S S++K N L +I T +G+ KY + Sbjct: 1 MNILYCGDKTMQKGILLSSMSLIK-NVDEPLNIYILTVDYGEKGINYKPVDKAFAKYLEE 59 Query: 79 LALQYKTRIKIYLING-----DRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDAD 133 + ++ ++L++ + L +T R I +VLYLD D Sbjct: 60 KLNKSDIKVNVFLVDVTRYFVEELPEANMQSRFTACCMLRLFADKTDIK--DRVLYLDTD 117 Query: 134 IICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQ 193 ++C+ + + ++A V G GY NSG +L+N Sbjct: 118 VLCRKGFRDFYHQNMDGIEIAGVSD------------YYGRWLFGDGYINSGVMLMNMRM 165 Query: 194 WAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESF 253 + + E I K++ PDQ +N A ++ K+N Q L+ Sbjct: 166 IRQNGLLEKC----REQCIRKEMFMPDQTAVN-TFATRVNLCGRKFNDQRRLH------- 213 Query: 254 INPVTNDTIFIHYIG-----------PTKPW-----HDWAWDYPVSQAFMEAKNA 292 ++T+F H+ KPW H+ + + Sbjct: 214 -----DNTVFQHFTTTFRVFPVIRTVSVKPWEIDKMHNILGLHEYDELLDSYNRE 263 >UniRef50_Q50FU8 Cj81-079 (Fragment) n=6 Tax=Campylobacter jejuni RepID=Q50FU8_CAMJE Length = 333 Score = 176 bits (447), Expect = 9e-43, Method: Composition-based stats. Identities = 69/339 (20%), Positives = 119/339 (35%), Gaps = 54/339 (15%) Query: 30 DIAYGTDKNFLFGCGISIASILKYNE------GSRLCFHIFTDYFGDDDRKYFDALAL-- 81 +I D N++ + IASI+K + F+I ++ ++ L Sbjct: 6 NIVISCDNNYVKYVAVVIASIIKNTKINSQLKEYPYKFYILSNDISKNNILKLKKLIQHL 65 Query: 82 ---QYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQG 138 Y + I+ I+ + P + HA Y+RF IAD + K LYLDAD++ G Sbjct: 66 SNSYYNCELIIHKIDDSKFHRFPKAWHVNHATYYRFEIADIV--EGNKCLYLDADVLVCG 123 Query: 139 TIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLG-----VAGIAKGYFNSGFLLINTAQ 193 I L ++KVA VVT+ + W K + YFN+G +LI+ Q Sbjct: 124 DIRELFYMEL-NNKVAGVVTDSCSRLWTKLYTKDNKTSSYIEFDPLMYFNAGVILIDLNQ 182 Query: 194 WAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQF---------- 243 W + + I N + DQ LN+ L + + +N Sbjct: 183 WKKHDIKNKCIDAFNIYDHGG---LADQSYLNIALKELTYKLPLNWNLIVPEYILLDGYE 239 Query: 244 ------------SLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKN 291 N S + +H+ KPW W+ ++ Sbjct: 240 RHYVVNCLDEISEYNLAYTRSEFEEAMKNKKIVHFC-AAKPW----WNLYYKNNKVDFNE 294 Query: 292 ASPWKNTALLKPNNSNQLRYS-----AKHMLKKHRYLKG 325 + W AL + + +KH+ ++ ++ Sbjct: 295 RNVWWEIALNLEEFKEEFYFLKNSLDSKHLNRQLNTIEW 333 >UniRef50_UPI0001693121 general stress protein n=1 Tax=Paenibacillus larvae subsp. larvae BRL-230010 RepID=UPI0001693121 Length = 352 Score = 175 bits (445), Expect = 1e-42, Method: Composition-based stats. Identities = 55/268 (20%), Positives = 92/268 (34%), Gaps = 27/268 (10%) Query: 36 DKNFLFGCGISIASILKYNEGSRLCFHIFTDY-FGDDDRKYFDALALQYKTRIKIYLING 94 D + G +AS+ S + HI D + +++ L + I Y + Sbjct: 12 DGAYAEHAGAVLASVFCNTS-SSVNVHILHDETLTEANKQKLIELTSSFNQTIHFYPVTI 70 Query: 95 -----DRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFP 149 + + S WT A +R +I K++YLD D++ I L Sbjct: 71 PDNMLQAMAGVKSISFWTQASMYRLLIPALIP--VDKIIYLDCDVLVNMNIAELWEVQLG 128 Query: 150 DDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQ-QVSARAIAMLN 208 D +A V + + H + YFNSG +L + + L Sbjct: 129 DFYLAAVWDQAIMAAVQ---HIIPYGLNPDSYFNSGVILFALNNIRKKIDWYEEMLNFLR 185 Query: 209 EPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIG 268 + PDQD LN + + + D ++N F F N +H+ G Sbjct: 186 RY---PDTSMPDQDTLNAVFGENYLQLDRRFN--FFNMVSPHHDFNN------KIVHFAG 234 Query: 269 PTKPWHDWAWDYPVSQAFMEAKNASPWK 296 K W P + + E + +PWK Sbjct: 235 SEK---CWDVHSPGANLYQEYLSLTPWK 259 >UniRef50_C2LRU0 Glycosyl transferase, family 8 n=1 Tax=Streptococcus salivarius SK126 RepID=C2LRU0_STRSL Length = 402 Score = 175 bits (445), Expect = 2e-42, Method: Composition-based stats. Identities = 44/282 (15%), Positives = 95/282 (33%), Gaps = 30/282 (10%) Query: 30 DIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKI 89 + + + +++ +++ S+ + + ++ + + + + I Sbjct: 5 SVVFVAELSYMEKLEVALKSLCAH--KGQWKIYVLNENLPTEWFTLMNRRLEAIDSEILN 62 Query: 90 YLINGDRLRSLP-STKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSF 148 ++ + + + + +A +FR+ I ++ +VLYLD D+I + PL Sbjct: 63 CRVSAESFKQFSLPSAHIHYATFFRYAIPEFVQEN--RVLYLDCDMIFTQDLSPLFEVDL 120 Query: 149 PDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLN 208 + VV FN+G ++I+T W +V+ + Sbjct: 121 GGLGIGAVVDRPTTTDG----------------FNAGLMVIDTDWWRQHKVTDSLFDLTK 164 Query: 209 EPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLN----YQLKESFINPVTNDTIFI 264 E + DQ +LN+ D YN Q + + + I Sbjct: 165 EHHQN---VYGDQGILNLYFKDAWYQLPWTYNLQVGSDKDQYGYGDLEWYDAFKGVPAVI 221 Query: 265 HYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNS 306 HY KPW ++ + S W+ L KP+ Sbjct: 222 HYTSHNKPWTSKRFNR-FRDIWWFYYALS-WEEILLRKPSLK 261 >UniRef50_A3YS36 Putative sugar transferase n=2 Tax=Campylobacter jejuni RepID=A3YS36_CAMJE Length = 459 Score = 169 bits (429), Expect = 1e-40, Method: Composition-based stats. Identities = 69/342 (20%), Positives = 119/342 (34%), Gaps = 40/342 (11%) Query: 30 DIAYGTDKNFLFGCGISIASILKYNEGS---RLCFHIFTDYFGDDDRKYF----DALALQ 82 I + + ++ + + SI+ S + CFHI + D+ K L+ Sbjct: 3 HIVFNSSNEYIENLSVLMYSIIINTNKSNTKKYCFHILSSNINDNTCKKLTLLEKELSSI 62 Query: 83 YKTRIKIYLINGDRLRSLPSTKN-WTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIE 141 Y + IKIY IN + K+ ++ Y R ++A K LYLD D++ G I Sbjct: 63 YPSEIKIYHINDNLFYDYNIPKHEGSYNAYLRLMLASILSKDIKKCLYLDVDMLVLGDIS 122 Query: 142 PLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSA 201 L + D A V S + I +FNSG +LIN W + + + Sbjct: 123 ELFDLDLKDKVFAAVFILKHPWPNLNSKDSSEIFYIYGSHFNSGLMLINLDAWREKNIES 182 Query: 202 RAIAMLNEPEIIKKITHPDQDVLNMLL-ADKLIFADIKYNTQFSLNYQ------------ 248 R+++ + + + D+ VLN +L D + +++N Sbjct: 183 RSLSFIKNYYVPYAV---DEYVLNAILSKDDIFSLKLEWNFLIGFRRLYLNNDLFFNKEE 239 Query: 249 --------LKESFINPVTNDTIFIHYIGPT--KPWHDWAW--DYPVSQAFMEAKNASPWK 296 + +HY KPW + D + + E +A W Sbjct: 240 GDKYKIICYSKEEFEKAFKKIKILHYTYLYMPKPWENVYSFIDDDYNLVYYEFYDA--WW 297 Query: 297 NTALLKPNNSNQLRYSAKHMLKKHR--YLKGFSNYLFYFIEK 336 + AL P + KK Y + S + +K Sbjct: 298 DMALKTPIYGEHFAKKKREYEKKSLLTYAQAMSKKIKALEKK 339 >UniRef50_Q046Z9 Lipopolysaccharide biosynthesis glycosyltransferase n=32 Tax=Lactobacillus RepID=Q046Z9_LACGA Length = 317 Score = 169 bits (428), Expect = 1e-40, Method: Composition-based stats. Identities = 54/299 (18%), Positives = 117/299 (39%), Gaps = 25/299 (8%) Query: 27 LCLDIAYGTDKNFLFGCGISIASILKY-NEGSRLCFHIFTDYFGDDDRKYFDALALQYKT 85 + + + Y N+ +SI S++ + ++ + D +K + L+++ Sbjct: 2 MTIPVFYTISDNYTPYAAVSIQSLIDHVDQNKDYTITLLVQNISDKHKKDLEDLSIK-NV 60 Query: 86 RIKIYLINGDRL-------RSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQG 138 + I+ I+ + + + + +T +I++R I + F + K +YLDAD I Sbjct: 61 HVNIFHIDDEMVAPIHNSEENYLRAQFFTMSIFYRLFIPNLFP-QYDKAVYLDADTIICT 119 Query: 139 TIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGI--AKGYFNSGFLLINTAQWAA 196 I L N D+ A V + + GI + Y N+G +L N + Sbjct: 120 DIAELYNTEIGDNMFASVPDMSIRFIKPLQVYIKECQGIFPPEKYINNGVILFNMKAFRD 179 Query: 197 QQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINP 256 ++ + +++ + PDQ +N + DK+ ++++ + + Sbjct: 180 KKFVDKFYSLIEKYHFDN--IDPDQAYMNEICEDKIYHLPLEWDAMPNEHMDE------- 230 Query: 257 VTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLK-PNNSNQLRYSAK 314 + +HY KPWH Y + F + SP+ + N +++ R A+ Sbjct: 231 -IKNPKIVHYNLFFKPWHFADVQY--GKYFWDVAKKSPYYGELKEQLANFTDEDRKKAR 286 >UniRef50_Q16CW9 Lipopolysaccharide 1,3-galactosyltransferase, putative n=7 Tax=Rhodobacteraceae RepID=Q16CW9_ROSDO Length = 329 Score = 169 bits (428), Expect = 1e-40, Method: Composition-based stats. Identities = 61/262 (23%), Positives = 105/262 (40%), Gaps = 34/262 (12%) Query: 31 IAYGTDKNFL---FGCGISIASILKYNEGSRLCF---H---IFTDYFGDDDRKYFDALAL 81 I + D+N+L IAS+++ +C H + D Sbjct: 25 IVFCCDQNYLVFAAHAAAQIASLVEK-PEFDICICYGHQAVVLPDSLA------------ 71 Query: 82 QYKTRIKIYLIN-GDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQ-GT 139 I++ ++ GD L K TH +Y R + F + K+LYLD+DI Q G Sbjct: 72 --GLGIRLCHVDVGDVFEGLRLDKGKTHDVYLRIALPTAFAGEYDKILYLDSDIFVQGGD 129 Query: 140 IEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGI-AKGYFNSGFLLINTAQWAAQQ 198 L + +A V Q +++ + GI YFN+G +L++ + Q+ Sbjct: 130 FNALFDIDVAPHCIASVRDNVQWRTPKRQNKRNTIKGIPPSAYFNAGVMLMDVQAYTEQE 189 Query: 199 VSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVT 258 + R + + + DQ++ N +L + +N Q+S + +L F P Sbjct: 190 LMRRCVEFGRARR--RDLKRHDQNLYNAVLQNDWAEISPVWNWQYSWSTRLFAVFAYPN- 246 Query: 259 NDTIFIHYIGPTKPWHDWAWDY 280 IH+IGP KPW D + + Sbjct: 247 ----IIHFIGPAKPWKDESGQF 264 >UniRef50_C6EQF4 Putative uncharacterized protein n=3 Tax=Campylobacter jejuni RepID=C6EQF4_CAMJE Length = 958 Score = 168 bits (426), Expect = 3e-40, Method: Composition-based stats. Identities = 54/293 (18%), Positives = 103/293 (35%), Gaps = 38/293 (12%) Query: 29 LDIAYGTDKNFLFGCGISIASILKY-NEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 + I + D N+L I++ S++ + + + + I Sbjct: 13 IPIVFAVDDNYLPYMSIALNSLVDRVSNCYKYNIFVMHLNIDLERLNRLKENIRNNNVTI 72 Query: 88 KIY-------LINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTI 140 + I + +T A+Y+R I + F + KV+Y D+D+I + I Sbjct: 73 EFINLNQYLKKIFKEYGNIFYERSYFTTAMYYRIFIPEIF-SNFKKVIYCDSDVIFKADI 131 Query: 141 EPLINFSFPDDKVAMVVTEGQADWWEKR----------AHSLGVAGIAKGYFNSGFLLIN 190 L + ++ + KR YFNSG ++ + Sbjct: 132 SHLFFIDLNNKEIGACRDIAALYAYRKRETVWQQNIRNNFDKINFRSISDYFNSGVIVFD 191 Query: 191 TAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLK 250 + + ++ + ++ I + PDQDVLN++ + F +++N ++ + K Sbjct: 192 IVKCIQMKTVSKCLTVIKN---IDNLYFPDQDVLNIVFCGHVHFLPLEWNFLWTTYIEYK 248 Query: 251 ESF----------INPVTNDTIFIHYIGPTKPWHD------WAWDYPVSQAFM 287 ++F I IHYI TKPW D W +P F Sbjct: 249 DNFMYLPKKIINEIYKAKTKPKIIHYISETKPWKDKNSFFVEWWKFPRKNLFY 301 >UniRef50_D1N145 Glycosyl transferase family 8 n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N145_9BACT Length = 311 Score = 166 bits (422), Expect = 8e-40, Method: Composition-based stats. Identities = 64/297 (21%), Positives = 114/297 (38%), Gaps = 20/297 (6%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 + +A TD+N+L ++ AS+L + G + H+ + + D F+AL R+ Sbjct: 5 IQVAMATDRNYLDYALVAAASLLAQHPGGGITLHLLHEELDESDFARFEALRRIDGFRLV 64 Query: 89 IYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSF 148 I + P W+ + Y+R ++ K+LYLD D++ I L N Sbjct: 65 PRKIERGFFQGWPEL-RWSTSAYYRLILPSLLP-DLEKILYLDCDLLVLDDIAELWNTEL 122 Query: 149 PDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLN 208 A + +K YFNSG +L N + A + R I + + Sbjct: 123 GSRSCAAAAVRVAPEHQKKIG-----LPAEAVYFNSGVMLFNLRKMAHENHEKRFIRLFD 177 Query: 209 EPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQ------LKESFINPVTNDTI 262 E + +I +PDQD+LN+ + + ++N S+ E+ + Sbjct: 178 E--LGGRIKYPDQDILNLAYWNDYVKLSQRWNLVTSVYRNPPTPALYSEAEVVEALRRPG 235 Query: 263 FIHYIGPTKPWH-DWAWDYPVSQAFMEAKN----ASPWKNTALLKPNNSNQLRYSAK 314 H+ G KPW +P ++ F P++ LK + L+ K Sbjct: 236 IAHFTGTHKPWRLGKTTHHPYARYFRAYAELAGLPLPFRLKLALKSLLTGSLKPPKK 292 >UniRef50_B8PIH6 Predicted protein n=2 Tax=Agaricomycetes RepID=B8PIH6_POSPM Length = 532 Score = 163 bits (412), Expect = 1e-38, Method: Composition-based stats. Identities = 53/293 (18%), Positives = 109/293 (37%), Gaps = 47/293 (16%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFD-ALALQYKTRI 87 ++IA TD + ++I S++ + + SRL ++ GD+DR ++ + + Sbjct: 227 MNIAIATDPAYAMAAAVAIHSVIAHTK-SRLTIYVLDLGLGDNDRNKLRRSMPRRADATM 285 Query: 88 KIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 ++ +++ A + + + D +VLYLDAD++ + I L + Sbjct: 286 VFIPLD-------YASERKEKATWAKIDMIDVLP--VERVLYLDADVLVRADIWGLWSTD 336 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 + + G + + K YFN+G LL++ A + +A+ Sbjct: 337 LRGKPIGAAIDVGFPEGHN--------GTVRKPYFNAGVLLLDLAAVRR---TLQALQGA 385 Query: 208 NEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNY-----QLKESFINPVTNDTI 262 + DQD+LN +K+N Q Y + +++ + + Sbjct: 386 AREYTTSRFR--DQDLLNAYFEANWAEVSLKWNAQGIATYAELPTEARQNIDMGLLKNPY 443 Query: 263 FIHYIGPT-----------------KPW-HDWAWDYPVSQAFMEAKNASPWKN 297 +H+ GP KPW + A +P + + + WK Sbjct: 444 IVHFTGPVNPTLEVVLNPYIQPYTAKPWGYAGAPGHPHGEEWWNVVEQTAWKG 496 >UniRef50_C3XFW0 Glycosyl transferase n=1 Tax=Helicobacter bilis ATCC 43879 RepID=C3XFW0_9HELI Length = 365 Score = 161 bits (408), Expect = 3e-38, Method: Composition-based stats. Identities = 54/239 (22%), Positives = 84/239 (35%), Gaps = 27/239 (11%) Query: 57 SRLCFHIFTDYFGDDDRKYFDAL----ALQYKTRIKIYLINGDRLRSLPSTKN---WTHA 109 FH+ TD + F L Y +I+ ++I+ + + LP +A Sbjct: 32 EGYHFHVITDSIAKKTLEQFHILQTTLNDIYPCQIEAHIISDEDFKDLPKWGYEEAQQYA 91 Query: 110 IYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRA 169 Y+R + D+ K LYLD D++ + L + A A Sbjct: 92 AYYRVKLVDFLPKNVDKCLYLDTDMLVLTDLRELFALNLDGYIAASSSGSPNATISRYGI 151 Query: 170 HSLGVAGI-------AKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQD 222 + G YF SG +LINT +W Q V A+ L E E DQD Sbjct: 152 YRKKKGGKKAVKSFETSFYFCSGLMLINTKEWIKQNVDIEAMRFLREYET----EFADQD 207 Query: 223 VLNMLLADKLIFADIKY--------NTQFSLNYQLKESFINPVTNDTIFIHYIGPTKPW 273 LN + D++ ++ S N + + + N +H GP K W Sbjct: 208 ALNFAMCDRVYNLGEQWGILAYQSLEAACSTNIDFSKRYEKAMIN-AKILHCNGPAKAW 265 >UniRef50_B2UQN6 Glycosyl transferase family 8 n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UQN6_AKKM8 Length = 371 Score = 158 bits (400), Expect = 2e-37, Method: Composition-based stats. Identities = 57/350 (16%), Positives = 117/350 (33%), Gaps = 37/350 (10%) Query: 21 KVETENLCLDIAYGTDKNFLFGCGISIASILKY-NEGSRLCFHIFTDYFGDDDRKYFDAL 79 E + + + + G++I ++ + + G HI D + + + Sbjct: 13 PASPEKSRIPVMFSATGGWGLPLGVAIHTLCLHASSGRFYDIHIVHDGMDARIIQELNQV 72 Query: 80 ALQYKTRIKIYLINGDRLRSLPST---KNWTHAIYFRFVIADYFINKAPKVLYLDADIIC 136 A + +L + R L ++ Y R + F + +++YLDAD++ Sbjct: 73 AAPFPQVSLSFLQLPEEFRHLFQNGNKDRYSPLAYARLMAGSLFP-QYGRIVYLDADVLL 131 Query: 137 QGTIEPLINFSFPDDKVAMVVT------EGQADWWEKRAHSLGVAGIAKGYFNSGFLLIN 190 G + L VA + + Y NSG L+++ Sbjct: 132 AGDVAELYFSDLRGASVAAAGDGLALWSIEKGTMHPHLEYMGNYLSFPLSYCNSGVLVLD 191 Query: 191 TAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQF-SLNYQL 249 Q + + R + L +PDQD+LN+ L + ++N QF S + Sbjct: 192 LDQMRRRNLEHRLLQQLR--SRPDPFPYPDQDILNIALHGDMTTLPPEWNFQFLSWTWDE 249 Query: 250 KESFI---NPVTNDTIF--------IHYIGPTKPWH---------DWAWDYPVSQAFMEA 289 +++ + N +H +GP KPW + W + EA Sbjct: 250 EKTRLLRGTEFENVPTISCGRSWKLLHMVGPEKPWRLPDTPGTMGQFHWILYSFFWWPEA 309 Query: 290 KNASPWKNTALLKPNNSNQLRYSAKHML-KKHRYLKGFSNYLFYFIEKIK 338 K ++ L + +H+ ++ + + +KI+ Sbjct: 310 KRLPVFREE--LDAISQGLAPLLQRHIRGQQWKLFFSRGHIFRKRRDKIR 357 >UniRef50_C6IJ37 General stress protein A n=2 Tax=Bacteroides RepID=C6IJ37_9BACE Length = 309 Score = 158 bits (400), Expect = 2e-37, Method: Composition-based stats. Identities = 59/275 (21%), Positives = 109/275 (39%), Gaps = 27/275 (9%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFD-ALALQYKTRIKI 89 IA+ D + + SI+ + H+ ++ + + L + R Sbjct: 10 IAFTPD--YFIPAATCLYSIITSMQAEG-ELHVICL-LSEELPERLKLKIQLIGEGRTCY 65 Query: 90 YLINGD-RLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLIN-FS 147 +N +L+ + + +T A +R ++ D + KV+Y+D DII + + L + Sbjct: 66 SFVNLQGKLQHIYIDQKYTEAASYRLLLPDLLP-EYKKVIYIDCDIIVRNDLVQLYHSID 124 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 + +A V E D+ ++G Y NSGFL++N + + I Sbjct: 125 LGMNYLAAVF-EASMDFQLDHLKTIGCN--PNEYINSGFLIMNLELMRKDNMVEKFI--- 178 Query: 208 NEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLN-YQLKESFINPVTNDTIF--- 263 E + + PDQDVLN L D+++ YN+ + Q K+ F+ T Sbjct: 179 -EASKVDYLEFPDQDVLNQLCKDRILALPPYYNSIRTFYLPQYKKFFLQKYTEQDWLEVH 237 Query: 264 ----IHYIGPTKPWHDWAWDYPVSQAFMEAKNASP 294 +HY G KPW+ + + Q + + P Sbjct: 238 RHGTVHYTG-AKPWNQFTVQF---QLWWQYYEQLP 268 >UniRef50_UPI00016B2258 glycosyl transferase, family 8 n=1 Tax=candidate division TM7 single-cell isolate TM7c RepID=UPI00016B2258 Length = 327 Score = 158 bits (400), Expect = 3e-37, Method: Composition-based stats. Identities = 61/339 (17%), Positives = 122/339 (35%), Gaps = 36/339 (10%) Query: 24 TENLCLDIAYGTDKNFLFGCGISIASILKYNEGSR-LCFHIFTDYFGDDDRKYFDALALQ 82 L++ Y +D N+ ISI S+++ N+ + + D F+ + Sbjct: 1 MNKGILNVIYQSDDNYAVVSAISIVSLMENNKHLKQINIFYLGHQLKKDSINKFNKMVGN 60 Query: 83 Y-KTRIKIYLIN--GDRLRSLPSTKNWT--HAIYFRFVIADYFINKAPKVLYLDADIICQ 137 Y I ++ D L+ + K W + +++ + K ++LY++ + Sbjct: 61 YHNATITFVDVSSYPDELKEIGV-KAWKGLYITWYKMLAFAKLDIKTDRILYINPHTVIS 119 Query: 138 GTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQ 197 G ++ L+ F D+ +A+ + +G+ I GYFN G +LIN +W Sbjct: 120 GALDGLLELDFEDNVMALSYDATMVNA---HKDVIGLKPI-DGYFNCGIMLINHKKWMKD 175 Query: 198 QVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPV 257 ++ A+ L DQD+ N+ + ++YN + +I Sbjct: 176 KIDAKMREHLRYNH----YEVADQDLCNVFFKGNIKKVGVEYNFSTVFYGYDIKKYIKAN 231 Query: 258 T----------------NDTIFIH--YIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTA 299 IH + +PW + PV + + N +PWKN Sbjct: 232 GFLPESFYSYDEIMESYYTPKIIHSQFGMNGRPWQQ-GNENPVGILWRKYLNLTPWKNAT 290 Query: 300 LLKPNNSNQLRYSAKHMLKKHRYLKGFSNYLFYFIEKIK 338 P + + +L + +K ++ + K K Sbjct: 291 --MPVAKKDMNWLLYDLLPQSIIVKLYAWAVNRKFAKTK 327 >UniRef50_C5SH34 Glycosyl transferase family 8 n=1 Tax=Asticcacaulis excentricus CB 48 RepID=C5SH34_9CAUL Length = 307 Score = 158 bits (400), Expect = 3e-37, Method: Composition-based stats. Identities = 60/304 (19%), Positives = 108/304 (35%), Gaps = 40/304 (13%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 I Y D N+LF +S AS + N S L I D + + + I++ Sbjct: 6 ICYVVDDNYLFPTLVS-ASQARENAPSSLA-DIVILCLSDASDRVRKVMPVAVALGIELI 63 Query: 91 LINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPD 150 + + +L H +Y R I +VLY+D D ++EPL+N P+ Sbjct: 64 EVPTASIENL-------HPMYGRLFIDKLLPKAYERVLYIDGDTQIAASLEPLLNVDIPE 116 Query: 151 DKVAMVVTEG-----QADWWEKRAHSLGVA-----GIAKGYFNSGFLLINTAQWAAQQVS 200 K V +D W R V + Y N+G L+ N WA ++ Sbjct: 117 GKFLAVRDPAAMFAKLSDKWASRIQGERVEAGLGDNPIEDYLNTGVLVFNMKDWAE--LA 174 Query: 201 ARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTND 260 + ++ DQD +N+ + D+ ++ ++N L +E + Sbjct: 175 GETLKLIRARSTP--FKFGDQDPMNLAIGDRCLYISNRWNFPGFLIGSGQEERVK----- 227 Query: 261 TIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPN-----NSNQLRYSAKH 315 + H++ +PW P + +P+K P + +H Sbjct: 228 PVIYHFMSNPRPWVHAGA--PWGPKWH-----TPYKAFLARFPVLESVAPKTTPVKALRH 280 Query: 316 MLKK 319 L++ Sbjct: 281 HLQQ 284 >UniRef50_C0EQT1 Putative uncharacterized protein n=1 Tax=Neisseria flavescens NRL30031/H210 RepID=C0EQT1_NEIFL Length = 212 Score = 158 bits (399), Expect = 3e-37, Method: Composition-based stats. Identities = 59/223 (26%), Positives = 92/223 (41%), Gaps = 22/223 (9%) Query: 116 IADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVA 175 I + + VLYLD D++C G I L +A V + + + G Sbjct: 3 IPAILGDISDTVLYLDTDVLCLGDISELFTV-----ILAAVPETTLYRAYINKLNVFGFR 57 Query: 176 GIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKIT-HPDQDVLNMLLADKLIF 234 YFNSG LL N W + + E+ K I PDQD+LN+ K+ + Sbjct: 58 ST-DPYFNSGVLLFNNKFWNESSAYTVLNEKIRQVELSKFILACPDQDLLNLSCKGKVGW 116 Query: 235 ADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASP 294 YN +++ + S +N + +H+IG TKPWH + +PV +F SP Sbjct: 117 LPESYN---RIHWHHQGSELNTNPKNIRLVHFIGGTKPWHHLGF-HPVYDSFYR---KSP 169 Query: 295 WK--------NTALLKPNNSNQLRYSAKHMLKKHRYLKGFSNY 329 W N L PN + + +AK + K+ + + Y Sbjct: 170 WYNGYLHQKPNIDLPFPNPHKRYKQAAKRLFKQGNKKQAWLYY 212 >UniRef50_C7XX93 Glycosyl transferase, family 8 n=1 Tax=Lactobacillus coleohominis 101-4-CHN RepID=C7XX93_9LACO Length = 398 Score = 158 bits (399), Expect = 3e-37, Method: Composition-based stats. Identities = 48/270 (17%), Positives = 91/270 (33%), Gaps = 34/270 (12%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 I D ++ +I SI+ + ++ ++ + + Q + + Sbjct: 7 IVLSGDNHYTAQITTTIKSIVYHLRRVKI--YLINSDIPQEYFFNLNLRLKQLDSELVDL 64 Query: 91 LINGDRLRSLPSTK-NWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFP 149 IN + + S K + + Y R +I + LY+D+D I +I L Sbjct: 65 KINPELFSNAESPKAHISKITYGRLMIPQLV--TEDRALYIDSDAIVDQSISELWTMDLG 122 Query: 150 DDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNE 209 D +A V AD FN+G +L N + + + Sbjct: 123 DYPIAAVHDVFLADI-----------------FNAGIILFNNKKLRED---PDLVDNMLA 162 Query: 210 PEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKES------FINPVTN--DT 261 K I DQ VLN + + ++YN + + + + + N Sbjct: 163 AAQQKGILDADQTVLNQFFNHQYLELGLEYNYVIGYDRDVSLAPRNAPGYFEKMLNCPQP 222 Query: 262 IFIHYIGPTKPWHDWAWDYPVSQAFMEAKN 291 IHY P KPW+ + + + + + N Sbjct: 223 KIIHYASPDKPWNLQSAGR-MREKWWQYHN 251 >UniRef50_A7H2X4 Glycosyl transferase family 8 n=2 Tax=Campylobacter RepID=A7H2X4_CAMJD Length = 497 Score = 156 bits (396), Expect = 7e-37, Method: Composition-based stats. Identities = 70/337 (20%), Positives = 120/337 (35%), Gaps = 72/337 (21%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEG------------------SRLCFHIFTDYFGD 70 L I G ++ + I SI+K + CFHIFT+Y + Sbjct: 2 LHICIGVSAEYVKYSAVLINSIVKATQKPFDLKPYENNLSFTKDLKEGFCFHIFTEYKSE 61 Query: 71 DDRK---YFDALALQYKTRIKIYLINGDRLRSLPSTKNW--THAIYFRFVIADYFINKAP 125 D K L+ Y T+ I+++N + S W A++++ + D Sbjct: 62 DTEKIALLAHKLSEIYPTKCLIHVMNNQDFQDF-SYPFWCQNAAMFYKIKVVDIL-KDVD 119 Query: 126 KVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLG-----VAGIAKG 180 K L++ AD+ G + L D+ +A + D + ++A + V AK Sbjct: 120 KCLFIGADLFALGDVRDLFALDLKDNLIAAALDTYNFDGYLRKAKAKNSDEELVFNDAKN 179 Query: 181 YFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYN 240 Y N+ +LIN +W Q + A+ I LN+ ++ D DV ++ A K+ KYN Sbjct: 180 YINNDMMLINLKEWRKQNLQAKYIDYLNKYDLA-----GDLDVFPLVCAPKIHILSSKYN 234 Query: 241 TQ--------FSLNYQLKESFINP-----------VTNDTIFIHYI-GPTKPWHD-WAWD 279 F L LK+ P + D +H+ KPW + Sbjct: 235 FILGYYTRESFGLENTLKDESDKPVWNFTKVELEQIQKDLRLVHFCHYVYKPWMSAYNCH 294 Query: 280 Y----------------PVSQAFMEAKNASPWKNTAL 300 Y P + + + +P+ Sbjct: 295 YVYFNMGLDNDLKPIKVPYYKEWWDMALKTPFFEEDF 331 >UniRef50_A0ZYL4 Putative uncharacterized protein n=1 Tax=Archaeal BJ1 virus RepID=A0ZYL4_9CAUD Length = 286 Score = 155 bits (393), Expect = 2e-36, Method: Composition-based stats. Identities = 66/300 (22%), Positives = 129/300 (43%), Gaps = 27/300 (9%) Query: 27 LCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTR 86 + L++ Y + C IS S+L+ N+ + +I ++ D++ +F+ + Y++ Sbjct: 1 MTLNVCYIAGGDSWVPCYISAYSVLENNQDLDIHMYILSE--EDNNNPFFEHVEYLYESH 58 Query: 87 ----IKIYLINGDRLRSLPS-TKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIE 141 I+ ++ D+ LP+ K+ + +YF+ I VL LDAD IC G++ Sbjct: 59 PSLEIEFIEVDMDQFDDLPAPGKHLSPGVYFKIAINRLLPTDGN-VLLLDADTICDGSLS 117 Query: 142 PLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSA 201 L++ +A + + LG+ + FN+G L +N +WA Q + Sbjct: 118 SLLSLDLSGKVLAAAPSN------KAETVRLGLQN-NRAKFNAGVLYVNLQEWAKQDIEE 170 Query: 202 RAIAMLNEPEIIKKITHPDQDVLNMLLA--DKLIFADIKYNTQFSLNYQLKESFINPVTN 259 R+ + E E DQD LN L+ D + + +YN L + + V + Sbjct: 171 RSRQYIEEHEP----ELNDQDALNALVNNPDDMEYIHPRYNATKLLVREFEM-----VDD 221 Query: 260 DTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKK 319 + IHY GP KPW + + + E + +P+++ + A+ +++ Sbjct: 222 EPTIIHYNGPDKPWR-FVTERESGDLWWEYASKTPFRDYVPKDKGVKEIIFVRARSAMRR 280 >UniRef50_A1VG39 Glycosyl transferase, family 8 n=1 Tax=Desulfovibrio vulgaris DP4 RepID=A1VG39_DESVV Length = 335 Score = 155 bits (392), Expect = 2e-36, Method: Composition-based stats. Identities = 48/288 (16%), Positives = 111/288 (38%), Gaps = 27/288 (9%) Query: 28 CLDIAYGTDKNFLFGCGISIASILKYNEGSRL-CFHIFTDYFGDDDRKYFDALALQYKTR 86 + I + D N+ +++ S+ + + S ++ + D+ +++ + R Sbjct: 3 TVPIVFTFDANYRLPASVALQSLFENAKDSTYYHVYLVCEGLSRGDKDAIESICPEKNGR 62 Query: 87 IKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINF 146 ++ ++ + S PS++NW +Y R ++ KV+Y D D++ + + Sbjct: 63 VEWIDVDNELFSSAPSSENWPKIVYARILLPLLLP--FDKVIYSDVDVVFCSDLAEIFQI 120 Query: 147 SFPDDKVAMVVTEGQA-DWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIA 205 + A V E A R H++ + + SGF+++N + R + Sbjct: 121 EVDGCEWAGVAAELVAFQEGVARCHNVHCEYQNELIYMSGFMVMNLRLMREKDTVGRCLN 180 Query: 206 MLNEPEIIKKITHPDQDVLNMLLADKLIFADIKY----NTQFSLNYQLKESF-------- 253 +++ ++ D ++LNM +D + D Y N F+ N + + Sbjct: 181 NISK--FGSRLKMYDLEILNMS-SDNIARIDFSYCVLENVFFAKNVSEAKEYPWLRGLYR 237 Query: 254 ---INPVTNDTIFIHYIGPT-KPWHDWAWDYPVSQAFMEAKNASPWKN 297 + + IH+ G K W + Q + + SP+++ Sbjct: 238 VSELEAARSAPRIIHFAGSDTKVWERYCVP----QVYRKYLAVSPFRS 281 >UniRef50_Q1RIL1 Lipopolysaccharide 1,2-glucosyltransferase RfaJ n=10 Tax=Rickettsia RepID=Q1RIL1_RICBR Length = 530 Score = 154 bits (390), Expect = 4e-36, Method: Composition-based stats. Identities = 54/295 (18%), Positives = 103/295 (34%), Gaps = 30/295 (10%) Query: 25 ENLCLDIAYGTDKNFLFGCGISIASILKYNE-GSRLCFHIF---TDYFGDDDRKYFDALA 80 ++ LDIA + F IAS L ++ S FHI D ++ + ++ Sbjct: 246 QDNTLDIALIINDKFARHAATVIASSLINSDINSFYKFHIVMNPNDSLTEESMEKLASMK 305 Query: 81 LQYKTRIKIYLINGDRL------RSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADI 134 I + L + + W + +R F +LYLDADI Sbjct: 306 HIRDYSIDFIPFPENVLDLNLANEKIEFSDMWPPLVMYRLYFDQVFP-NLESILYLDADI 364 Query: 135 ICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQW 194 I + + VA + K I Y NSG + +N Sbjct: 365 IVLRDLNSFKKLDMSNYIVAGSMDTALTYCTLKVEEECNRK-INNFYKNSGIVFLNLQNM 423 Query: 195 AAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFI 254 +Q + ++ + +PDQD+LN+ + + +++N F + ++++ Sbjct: 424 REKQAKNMVLDAMHNSKC--SFAYPDQDLLNIAFHNYIYPLSMRWN--FYTYFIDRDNYF 479 Query: 255 NPVTNDTIFIHYIGPTKPWHDWAWDY---------PVSQAFMEAKNASPWKNTAL 300 + +HY G KPW++ + + + + + +PW N Sbjct: 480 SYF-----IMHYAGKKKPWNNEEIKWTKDILEKYQEIEKYYWRYREFTPWGNKDF 529 >UniRef50_UPI00016C0890 glycosyl transferase family 8 protein n=2 Tax=Epulopiscium sp. 'N.t. morphotype B' RepID=UPI00016C0890 Length = 593 Score = 154 bits (389), Expect = 5e-36, Method: Composition-based stats. Identities = 59/310 (19%), Positives = 106/310 (34%), Gaps = 30/310 (9%) Query: 31 IAYGTDKNFLFGCGISIASILKY-NEGSRLCFHIFTDYFGDDDRKYFDALALQY-KTRIK 88 I TD F+ G ++ S++K N + IF + + + +Q ++ Sbjct: 288 IVLTTDDRFIIGAAATLISLVKTSNVNNNYDIIIFHKDLSEKSKTLLRNVVVQRINFSLR 347 Query: 89 IYLINGDR--LRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINF 146 Y + + NW +YF+ +I ++ K L+LD D+I I L++ Sbjct: 348 FYDVGYEMSTYNVYKPGNNWQPCVYFKLLIPS-IMHNYKKSLHLDCDLIILEDIANLLSI 406 Query: 147 SFPDDKVAMVVTEGQA------DWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVS 200 + VA G W K H YFN G ++ N ++ Sbjct: 407 DLKGNAVAGCAEMGCITTSIRRTWANKYYHEKLRITNMVEYFNGGVIVFNINEFHKITSL 466 Query: 201 ARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFIN----- 255 A+ + E KK + +QD+L+ + + +N + + Sbjct: 467 AQLL-----HEAEKKHLNLEQDILSKSFVNHIYLLPQSWNLTRDFLGTVMNLYKQYLPSN 521 Query: 256 ------PVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQL 309 IHYIGP KPW + +Y + + + + A+ N Sbjct: 522 IYQKYLDARQKPKIIHYIGPLKPWDNPNLEY--ASYWWDTIRGTEIYEMAINSQIQKN-F 578 Query: 310 RYSAKHMLKK 319 + K KK Sbjct: 579 SENIKKTTKK 588 >UniRef50_Q38VG7 Putative glycosyl transferase, family 8 n=1 Tax=Lactobacillus sakei subsp. sakei 23K RepID=Q38VG7_LACSS Length = 304 Score = 153 bits (386), Expect = 1e-35, Method: Composition-based stats. Identities = 44/263 (16%), Positives = 91/263 (34%), Gaps = 27/263 (10%) Query: 43 CGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPS 102 ISIA++LK + + I T + K + L + I+ ++ + Sbjct: 1 MSISIATLLKKHMEDEINIFIITSNISEKYIKVIEGLFN--NPKHNIFWVSMPEIDIPLE 58 Query: 103 TKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQA 162 T + A Y R +++YLD D + + + L ++ + + Sbjct: 59 TDRGSLAQYGRLFFDRLIPENIQRLIYLDCDTLIEENLRELWVTDLGENTIGIARDA--- 115 Query: 163 DWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQD 222 + R L FNSG ++I+ W +++ R I +L E +I+ DQ Sbjct: 116 --FSDRYKKLLGLEKDSELFNSGVMIIDRGSWNEKRIEDRIIDLLTEKR--GRISQGDQG 171 Query: 223 VLNMLLADKLIFADIKYNTQFSLNYQLKESFIN--------------PVTNDTIFIHYIG 268 V++++ + D K+N+ S + F+ +H+ Sbjct: 172 VIDIIFQNDAKILDPKWNSMSSYFDFTYDDFLKYRQVKEFYSKQLILEAIQKPAIVHFTS 231 Query: 269 P---TKPWHDWAWDYPVSQAFME 288 +PW + + + Sbjct: 232 SFLNNRPW-IFGSTHRYKNHWRR 253 >UniRef50_C0X9Z7 Putative uncharacterized protein n=2 Tax=Lactobacillus gasseri JV-V03 RepID=C0X9Z7_9LACO Length = 416 Score = 152 bits (384), Expect = 2e-35, Method: Composition-based stats. Identities = 59/272 (21%), Positives = 102/272 (37%), Gaps = 25/272 (9%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 IA + ++ +I SIL + + H+ + + A Q +RI Sbjct: 5 IALSANYGYIDKIETTIKSILYNVKNVEI--HLLNYDIPQEWFANINRYANQIGSRIIDE 62 Query: 91 LINGDRLRSLPS-TKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFP 149 + + L L S K+ Y R +I KA +VLYLD+D++ I+ L + F Sbjct: 63 KFDPEELHDLNSGFKHINQMTYARLLIPKLI--KANRVLYLDSDLVVDDEIDELFSRKFN 120 Query: 150 DDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAA-QQVSARAIAMLN 208 K+ V + L V I N+G LLIN + +S + + Sbjct: 121 GKKILAVTHIFDVRNKNESRVDLPVPSI-----NAGVLLINNQELRKDHNLSEKLLDFAR 175 Query: 209 EPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQL---------KESFINPVTN 259 + + DQD +N D++ KYN Q + L + I Sbjct: 176 KNNFPQD----DQDTINNWFKDEIGSLSFKYNYQIGADRFLFWSNNSNTETATEILDKVK 231 Query: 260 DTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKN 291 + IHYI KP++ ++ + + + +N Sbjct: 232 NPKIIHYISDDKPFNIFSEGR-MRETWWFYRN 262 >UniRef50_A4SAB5 Predicted protein (Fragment) n=1 Tax=Ostreococcus lucimarinus CCE9901 RepID=A4SAB5_OSTLU Length = 259 Score = 149 bits (377), Expect = 1e-34, Method: Composition-based stats. Identities = 63/260 (24%), Positives = 109/260 (41%), Gaps = 20/260 (7%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYN-EGSRLCFHIFT--DYFGDDDRKYFDALALQYKT 85 + IA+ D LF G I+S+L R+ FHIFT D D + + + Sbjct: 3 VHIAFACDPTQLFTLGPVISSVLSATASPHRIRFHIFTARDALTDASVQ-LNCYSRAIPF 61 Query: 86 RIKIYLINGDRLR---SLPSTKNW---THAIYFRFVIADYFINKAPKVLYLDADIICQGT 139 +++ + D +R ++ S K W Y RF A+ + KV+YLD DII +G Sbjct: 62 IWELHEFSKDMIRANITVHSRKEWRLQNAFNYARFYFAEIL-SDVQKVVYLDTDIIVKGD 120 Query: 140 IEPLINFSFPDD---KVAMV-VTEGQADWWEKRAHSLGVAGIAKGY--FNSGFLLINTAQ 193 I L + + +A V + ++ +G+ + FN+G LLI+ Sbjct: 121 ICRLHDANLRSSSTSVIAAVKRSVPLGSLLNFSNAAVKSSGLREKMHSFNAGVLLIDLES 180 Query: 194 WAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESF 253 W +++++ L + K +H Q L ++ D +N + Y K+ Sbjct: 181 WRRKRITSTVETWLKMNSVSKLYSHGSQPPLLLVFGDSFESIPSHWNVD-GVGY--KKGL 237 Query: 254 INPVTNDTIFIHYIGPTKPW 273 V N+ +H+ G +KPW Sbjct: 238 RASVLNEARVLHWSGQSKPW 257 >UniRef50_C5WAK3 Ybl156 protein n=2 Tax=Enterobacteriaceae RepID=C5WAK3_ECOBB Length = 163 Score = 148 bits (375), Expect = 2e-34, Method: Composition-based stats. Identities = 45/164 (27%), Positives = 81/164 (49%), Gaps = 5/164 (3%) Query: 177 IAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFAD 236 + YFN+G + +N +W ++ + +L + + DQD LN+ I+ Sbjct: 1 MNGRYFNAGVIYVNLKKWHEANLTPYLLKLLRGETKYGSLKYLDQDALNIAFNMNNIYLA 60 Query: 237 IKYNTQFSLNYQLKE----SFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNA 292 ++T ++L +L + + +T+ T+ IHY G TKPWH WA YP + F A+ Sbjct: 61 KDFDTIYTLKNELHDRSHRKYQQTITDKTVLIHYTGITKPWHSWA-GYPSASYFNIAREQ 119 Query: 293 SPWKNTALLKPNNSNQLRYSAKHMLKKHRYLKGFSNYLFYFIEK 336 SPWK L + +++ KH+ Y+KG ++ + Y ++K Sbjct: 120 SPWKKYPLKEARTVAEMQKQYKHLFAHGEYIKGITSLIKYKLKK 163 >UniRef50_B9ADW8 Putative uncharacterized protein n=1 Tax=Methanobrevibacter smithii DSM 2375 RepID=B9ADW8_METSM Length = 223 Score = 148 bits (373), Expect = 3e-34, Method: Composition-based stats. Identities = 42/226 (18%), Positives = 79/226 (34%), Gaps = 26/226 (11%) Query: 64 FTDYFGDDDRKYFDALALQYKTRIKIYLINGDR------LRSLPSTKNWTHAIYFRFVIA 117 +++ +A Y I + L + +++ A Y + IA Sbjct: 1 MDSGIKKINKEKIRKIAHDYGADISFIHVADIEEKYNLTLNKMSVKGDFSLATYSKLFIA 60 Query: 118 DYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGI 177 KV+YLD D + + + ++N + A V+ +K Sbjct: 61 SLLPETVDKVIYLDCDALVLDSFKEILNLDLNNYLAAGVLALNCTAEVKKAID----LNE 116 Query: 178 AKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADI 237 Y N+G LLIN +W + V + + L E + K DQ V+N + + L+ + Sbjct: 117 DDLYINAGMLLINLKRWRQENVENQFLEKLVEFNLRGKHFGMDQGVINNVSSKNLLVLNP 176 Query: 238 KYNTQFSLN----------------YQLKESFINPVTNDTIFIHYI 267 KYN + SL+ ++ + +F H+ Sbjct: 177 KYNLEGSLHNTGYDITFKLNGNIQKNYYSREVLDDAIENPVFQHFC 222 >UniRef50_A4IXE1 Glycosyl transferase, family 8 n=16 Tax=Francisella RepID=A4IXE1_FRATW Length = 296 Score = 143 bits (361), Expect = 8e-33, Method: Composition-based stats. Identities = 55/285 (19%), Positives = 107/285 (37%), Gaps = 30/285 (10%) Query: 29 LDIAYGTDKNFLFGCGISIASILKY-NEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 + I + DKN + G ++I S++ + N + +++ F+++ + K I Sbjct: 4 IPIVFTFDKNIILGGAVTIKSLIDHANPDTCYDIYVYHPNINKKSISAFNSMIEKTKHSI 63 Query: 88 KIYLINGDRLRSLPS-TKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINF 146 + ++ + +P T+ ++R +I + KV+Y D D++ Q + + N Sbjct: 64 SFHNVDESIFKDVPIDTRRGWIITFYRLLIPKLLP-QYDKVIYSDVDVLFQSDMSEVYNT 122 Query: 147 SFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFN-SGFLLINTAQWAAQQVSARAIA 205 + A V+ E + Y GF+++NT +R Sbjct: 123 DLTSYEWAGVIAE---KHQQNMVQHKYFKENNNSYIYWPGFMVMNTKLMRENNFISRCFD 179 Query: 206 MLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQ---------------LK 250 ++E ++ D DVLN+ K+ KY T S+ Y Sbjct: 180 TMHEFNT--RLKFRDLDVLNLTCR-KIKSLPFKYVTLQSIYYLNTIQEAPEYIFLKEIYS 236 Query: 251 ESFINPVTNDTIFIHYIG-PTKPWHDWAWDYPVSQAFMEAKNASP 294 ++ + N+ IHY G P KPW P ++E + P Sbjct: 237 DNELLDAKNNPAIIHYAGSPGKPWR---MKRPYKN-YLEYISKIP 277 >UniRef50_C0X9Z8 Glycosyltransferase n=1 Tax=Lactobacillus gasseri JV-V03 RepID=C0X9Z8_9LACO Length = 675 Score = 142 bits (358), Expect = 2e-32, Method: Composition-based stats. Identities = 59/275 (21%), Positives = 108/275 (39%), Gaps = 35/275 (12%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 IA D N L ++ SI +N+ + HI + + Q+ ++I Sbjct: 4 IALDADVNDLNKIETTLKSIFLHNQHVEI--HIINFNIPHEWFINVNQYVNQFGSKIIDE 61 Query: 91 LINGDRLRSLP-STKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFP 149 I+ + L + S+ + RF+I D A KVLYLD+D+I ++ + +F Sbjct: 62 KIDPNFLGDVQPSSDQIKKISFGRFLIPDLI--SADKVLYLDSDLIVTDNLQSIFQMNFD 119 Query: 150 DDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNE 209 D + V D FNSG +LIN +W ++VS++ I M + Sbjct: 120 DKMLFAVHDYQNPDQ-----------------FNSGVMLINNKRWREEKVSSKLIEMSKQ 162 Query: 210 PEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTND------TIF 263 + DQ V+N + +++ ++ YN Q L + V ++ Sbjct: 163 QALAS-----DQAVINEVFKNQIGELNLSYNYQIGLEKNAYWNNKQVVFDNYNRVPIPRI 217 Query: 264 IHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNT 298 I+Y G P++ + + + + N W + Sbjct: 218 INYSGDDNPFNLVSTG-DLRNNWWQYHNLE-WSDI 250 >UniRef50_B6ACJ0 Glycosyl transferase family 8 protein, putative n=1 Tax=Cryptosporidium muris RN66 RepID=B6ACJ0_9CRYT Length = 304 Score = 141 bits (355), Expect = 4e-32, Method: Composition-based stats. Identities = 51/270 (18%), Positives = 101/270 (37%), Gaps = 21/270 (7%) Query: 22 VETENLCLDIAYGTDKNFLFGCGISIASILKY-NEGSRLCFHIFTD-YFGDDDRKYFDAL 79 + + IA+ DK + SI K +E + HI T + D K + Sbjct: 43 LSNPDKVYQIAFSADKEVFQLFPTLLNSIFKNLHEYEKANVHIITMPDISEKDIKILQSF 102 Query: 80 A-LQYKTRIKI-YLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQ 137 + ++ +I + + +L+ + K+ + A R ++ + K+LYLD D+I Sbjct: 103 SMNKFDKKIALLFYPFNYKLKYTRTLKHVSEATMCRLLLPNIIDKSIDKLLYLDTDVIVN 162 Query: 138 GTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAG----IAKGYFNSGFLLINTAQ 193 + L + + + +AD + + FN+G LLI+ + Sbjct: 163 TPLRELFGININSQCGIVARSSTKADLINEWLKKDKIYPHIIYNGTKSFNAGVLLISLNE 222 Query: 194 WAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESF 253 + +A+ + K DQ +LN+ + ++YN + ++ + Sbjct: 223 LRKNHFTDKAMEFVE------KWGLNDQIILNLYCNGEYDELPMQYNF-----WAGRDDY 271 Query: 254 INPVTNDTIFIHYIGPTKPWHDWAWDYPVS 283 N T+ +H+ GP KPW Y Sbjct: 272 RN--TSAHGIVHFAGPNKPWQPNYQPYEEQ 299 >UniRef50_C3XN62 Glycosyl transferase n=1 Tax=Helicobacter winghamensis ATCC BAA-430 RepID=C3XN62_9HELI Length = 284 Score = 139 bits (351), Expect = 1e-31, Method: Composition-based stats. Identities = 47/228 (20%), Positives = 86/228 (37%), Gaps = 26/228 (11%) Query: 129 YLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLL 188 YLD D++ + + V+ E + +L + ++K YFN+G LL Sbjct: 2 YLDVDMLVLKDLREIFAIDLEGKICGAVLDYKANRILEPKNKALPMLNLSKDYFNAGLLL 61 Query: 189 INTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQ 248 I+ +W +Q++ ++ I LN+ + DQ LN++L DK+ + +NT Sbjct: 62 IDLEKWKSQKLESKLIETLNQYHCKE----HDQSALNVVLKDKIKILPLSWNTLVYYYVN 117 Query: 249 LK-------------ESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAF-----MEAK 290 K +N + +HY KPW+D + F Sbjct: 118 AKACDDTKNFNLFYTRKDLNKALKNPHILHYYLGFKPWNDDKIYTDIKGEFLGEHWWNMV 177 Query: 291 NASP-WKNTALLKPNNSNQLRYSAKHMLKKHRYLKGFSNYLFYFIEKI 337 +P +K+ + ++ AK + L F+ Y YF+ Sbjct: 178 EKTPEFKDMIIPLKTKAS---KKAKLQVSLGYTLLTFARYKLYFLIPF 222 >UniRef50_Q17VR5 Lipopolysaccharide biosynthesis protein n=19 Tax=Helicobacter RepID=Q17VR5_HELAH Length = 405 Score = 139 bits (351), Expect = 1e-31, Method: Composition-based stats. Identities = 64/377 (16%), Positives = 131/377 (34%), Gaps = 78/377 (20%) Query: 24 TENLCLDIAYGTDKNFLFGCGISIASILKYNEGSR------LCFHIFTDYFGDDDRKYFD 77 +++ + I D N+ G+S+ S+L + R H D ++ + Sbjct: 2 QDSVIIPIVVAFDNNYCIPAGVSLYSMLANAKTERERVKLFYKIHCLVDGLSAENIEKLK 61 Query: 78 ALALQYK--TRIKIYLIN------------------GDRLRSL----------------- 100 + + ++ I+ D +++ Sbjct: 62 ETLAPFSAFSSVEFLEISTHNTPKENQEIKKNQTIKSDHYQNIDPIIANKIEELFTKLSN 121 Query: 101 PSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVV--- 157 S K ++ I R ++A F + K++ D D + G I V Sbjct: 122 YSQKRFSKMIMCRLLLASLFP-QYDKMIMFDVDTLFVGDISESFFIPLEAHYFGAVREKD 180 Query: 158 --------TEGQADWWEKRAHSLGVAG----------IAKGYFNSGFLLINTAQWAAQQV 199 + + ++RA S+GVA + YFN+GFL +N W + + Sbjct: 181 LIAMNRNSAKDLYELRQRRAKSIGVANAFPNLEEAQILFDNYFNAGFLALNLKLWRKENL 240 Query: 200 SARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTN 259 + I +K+ DQD L + +++ YN S ++ P Sbjct: 241 ENQLIGFFILKN--EKLLFNDQDALCFVCRGRILELPYPYNAHPS----FLDTPSFPSIK 294 Query: 260 DTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKK 319 + +H+ G KPW ++ ++ + E +P+K+ K N+ L + H+ K Sbjct: 295 EVCMLHFWG-DKPWKIFSV--FGAKKWHEVLMQTPFKD----KYFNTPFLDHLFNHIQNK 347 Query: 320 HRYLKGFSNYLFYFIEK 336 + L+ F+ L + ++ Sbjct: 348 NNKLRTFNKALSFVDKR 364 >UniRef50_UPI000197AD97 hypothetical protein BACCOPRO_03221 n=1 Tax=Bacteroides coprophilus DSM 18228 RepID=UPI000197AD97 Length = 313 Score = 138 bits (347), Expect = 3e-31, Method: Composition-based stats. Identities = 55/325 (16%), Positives = 113/325 (34%), Gaps = 28/325 (8%) Query: 27 LCLDIAYGTDKNFLFGCGISIASILKYNEGS-RLCFHIFTDYFGDDDRKYFDALALQYK- 84 + I + D N + + I+S+L + I D ++ D L + Sbjct: 3 KTVPIVFAFDNNLILPACVCISSLLMNAKEETFYDIFILHSSKVDLHKEQLDELPKYFNR 62 Query: 85 TRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPL- 143 RI+ +++ + T Y+R +I + + ++Y D D+I + + + Sbjct: 63 CRIQYRVVDNT-FDQAFEIRGITTPTYYRLLIPELVP-EYDNIIYSDVDVIFRFDLSDIY 120 Query: 144 INFSFPDDKVAMV-VTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSAR 202 + D VA V +K LG I + +G +++N+ + + R Sbjct: 121 FHTDLNDSYVAGVNALVPFIPDMKKYYLKLGNVNIDSIIY-AGNIILNSKKIREDNLVER 179 Query: 203 AIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLN-------YQLKESFIN 255 + K D DVLN+ K+ + + + L++ + + Sbjct: 180 FKELAKN-----KFHFQDLDVLNIACKGKITYLKPVFCLTTYFSELALRHRNLLRDFWSD 234 Query: 256 PVTNDTI---FIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYS 312 ++ + +HY G KPW + S + E SP+ + +L Sbjct: 235 KDIDEALTEGIVHYNGQ-KPWKGICVN---SDIWWEYYRKSPFFDEKFYFEFFYTRLNEL 290 Query: 313 AKHMLKKHRYLKGFSNYLFYFIEKI 337 + L + +K Y Y +I Sbjct: 291 DQLSL--WKRIKILIRYFVYGKREI 313 >UniRef50_Q39T65 Glycosyl transferase, family 8 n=1 Tax=Geobacter metallireducens GS-15 RepID=Q39T65_GEOMG Length = 317 Score = 136 bits (344), Expect = 8e-31, Method: Composition-based stats. Identities = 46/319 (14%), Positives = 110/319 (34%), Gaps = 44/319 (13%) Query: 22 VETENLCLDIAYGTDKNFLFGCGISIASILKY-NEGSRLCFHIFTDYFGDDDRKYFDALA 80 V L + + + D N++ ++ S+L N + F + + +++R + Sbjct: 3 VAINELNIPVFFAFDNNYVIPAAVAFHSLLANVNVSYKYHFIVLHEDISEENRDLLAQVV 62 Query: 81 LQY-KTRIKIYLI---NGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIIC 136 + ++ + + ++ ++T ++ + + K+++ D D++ Sbjct: 63 SLFSNASVEFRDMGESFKNEWENIKGKGHYTKECLYKL-VPMLEFPQYDKIIWSDVDVVF 121 Query: 137 QGTIEPLINFSFPDDKVAMVVTEGQAD-WWEKRAHSLGVAGIAKGYFNSGFLLINTAQWA 195 + I + ++ +A V G+ D ++E + I K +G L+ N + Sbjct: 122 KDDISDVFFMLSEENYIAGVRVCGKLDKYYENMNMPAEIKSILKNGIGAGILVYNLKKMR 181 Query: 196 AQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKY---------------- 239 + + + + + P+QD+LN++L DK+ + ++Y Sbjct: 182 EDNIYDDIM--IALQGMSSIVVQPEQDILNIVLKDKIDYIPLRYCFCTYMYNLFKDRHKM 239 Query: 240 ---------NTQF-------SLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVS 283 N F + E + IHY TKPW+ Sbjct: 240 KLKVKGNLFNYLFKGYRKNLGFDTIYSEKELLEAFESPAIIHYATSTKPWNTLFTK--RK 297 Query: 284 QAFMEAKNASP-WKNTALL 301 ++ +P WK Sbjct: 298 SDWLYCLLKTPFWKRYIFR 316 >UniRef50_B3XPR8 Glycosyl transferase family 8 n=4 Tax=Lactobacillus RepID=B3XPR8_LACRE Length = 465 Score = 132 bits (333), Expect = 1e-29, Method: Composition-based stats. Identities = 50/268 (18%), Positives = 94/268 (35%), Gaps = 34/268 (12%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 IA D ++ ++ SI +N+ + +I + + ++I Sbjct: 4 IALSVDYRWIDQAETTLKSIYAHNKNVKT--YIINHDIPHEWFVNINRYLGVQDSQIIDR 61 Query: 91 LINGDRLRSLPSTK-NWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFP 149 I+ +R + +P + + +Y +F+I + +VLYLD+D+I ++ L Sbjct: 62 KIDEERFKDMPMPEARISPMVYGKFLIPELIPE--DQVLYLDSDVIVDKNLDQLFATKIN 119 Query: 150 DDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNE 209 D + VV FNSG LLIN W + + + + ++ Sbjct: 120 DRPLYTVVDYFNPSQ-----------------FNSGVLLINNLFWRNNNIGNQLLKLGHD 162 Query: 210 PEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSL----NYQLKESFINPVTN--DTIF 263 + Q ++N A D +N Q + K SF D Sbjct: 163 YNLNNT-----QVIMNEGFAQNYGKLDPCFNFQIGYERKSYWNDKSSFYAFFDKVTDPAI 217 Query: 264 IHYIGPTKPWHDWAWDYPVSQAFMEAKN 291 IHY KP++ + + + N Sbjct: 218 IHYTEKDKPFNIEK-TVELREKWWYYHN 244 >UniRef50_A9S2B3 Predicted protein (Fragment) n=1 Tax=Physcomitrella patens subsp. patens RepID=A9S2B3_PHYPA Length = 275 Score = 131 bits (330), Expect = 3e-29, Method: Composition-based stats. Identities = 56/265 (21%), Positives = 99/265 (37%), Gaps = 27/265 (10%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNE-GSRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 + IA D N+L G +I SIL + E S + FH + + + R Sbjct: 11 VHIAMTLDANYLRGSMAAIYSILLHAECASNVRFHFVA---TKEKKNKCKSF-----CRS 62 Query: 88 KIYLINGDRLRSLPSTKN-WTHAI--YFRFVIADYFINKAPKVLYLDADIICQGTIEPLI 144 +Y + + L+ + S+ T Y RF +A + +++YLD D++ G IE L Sbjct: 63 AMYFYSCELLKLIYSSDFVITQEPLNYARFYLAHMIDSCVKRIIYLDLDVLVLGRIEELW 122 Query: 145 NFSFPDDKV-------AMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQ 197 + + V A + ++W + + A YFNSG +LIN +W Sbjct: 123 MTNMGNSTVGTPEYCHANFPSYFTENFWINSSLASTFANKQPCYFNSGMMLINLERWRKT 182 Query: 198 QVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPV 257 + ++ + + L + A + D ++N Q L + + Sbjct: 183 RCTSTLEYWMEVQKQQHIYELGSLPPLLLTFAGSIQAIDNRWN-QHGLGGDIVKGDCRS- 240 Query: 258 TNDTIFIHYIGPTKPWHDWAWDYPV 282 +H+ G KPW P Sbjct: 241 ------LHWSGGGKPWRRLDMHQPC 259 >UniRef50_B9KUH7 Lipopolysaccharide 3-alpha-galactosyltransferase n=1 Tax=Rhodobacter sphaeroides KD131 RepID=B9KUH7_RHOSK Length = 304 Score = 131 bits (330), Expect = 3e-29, Method: Composition-based stats. Identities = 50/222 (22%), Positives = 81/222 (36%), Gaps = 28/222 (12%) Query: 93 NGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDK 152 D + P + R +I +VLYLD D+ + PL + Sbjct: 76 EADLAGAKPVGTYISETTMGRLLIPRKL---TGRVLYLDGDVRVVDDLSPLFSLDMRGFP 132 Query: 153 VAMVVT--------EGQADWWEKRAHSLGVAGIAKG-----YFNSGFLLINTAQWAAQQV 199 +A V G+ RA A G YFN+G LL++ + AA Sbjct: 133 LAGVRDYVVSKRLARGEPVKVRNRARIEEEARCMSGADASTYFNAGVLLLDASAIAADHS 192 Query: 200 SARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKE--SFINPV 257 A+ + + K T DQD LN + A ++ D YN+ +S + + + P Sbjct: 193 LCSAMQ---DLDRASKWTLGDQDHLNNVFAGRVRLIDPAYNSSWSRTPRQRRYVERLGPA 249 Query: 258 T-----NDTIFIHYIGPTKPWHDWAWDY--PVSQAFMEAKNA 292 IH+ GP KPW +++ P ++A + Sbjct: 250 PAELTYAPDAIIHFHGPAKPWKKAQYNFWSPRARAVFSYRRE 291 >UniRef50_C6DEN2 Glycosyl transferase family 8 n=1 Tax=Pectobacterium carotovorum subsp. carotovorum PC1 RepID=C6DEN2_PECCP Length = 615 Score = 131 bits (330), Expect = 3e-29, Method: Composition-based stats. Identities = 53/274 (19%), Positives = 106/274 (38%), Gaps = 26/274 (9%) Query: 31 IAYGTDKNFLFGCGISIAS----ILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTR 86 I D + +++ S I + N + + + + + A ++ Sbjct: 337 IFLCADTAYNVPALVALTSLAMSIAQANPPPDIYMFVLPET-HEIWSQIAHCFAKKFPLT 395 Query: 87 IKIY---LINGDRLRSLPSTKNW----THAIYFRFVIADYFIN-KAPKVLYLDADIICQG 138 +KI ++ D R+ +N+ + Y R + Y + LYLD+D++ + Sbjct: 396 VKIVSTLQMDLDESRAHYGFQNYGKMLSITAYARLYASRYLQGLGITRALYLDSDVVIRR 455 Query: 139 TIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKG-YFNSGFLLINTAQWAAQ 197 + L++ +A + + ++ + GI G YFNSG LL++ A Q Sbjct: 456 SPLGLLHMDMGGYPLAA----RTERAHPRISRAIKLHGIPNGRYFNSGILLLDFQHPATQ 511 Query: 198 QVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPV 257 AIA + + K+ + DQ LN + + D K+N + + +P Sbjct: 512 STLNTAIAYSEQ--LDNKLLYLDQCALNKSIQGLYLDLDEKFNWFI-----VPDDTAHPQ 564 Query: 258 TNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKN 291 D +H+I KPW D + + + + K+ Sbjct: 565 DEDAAIMHFISTPKPW-DLNYSGRGATLWADYKH 597 >UniRef50_O48684 F3I6.10 protein n=46 Tax=Embryophyta RepID=O48684_ARATH Length = 393 Score = 131 bits (330), Expect = 4e-29, Method: Composition-based stats. Identities = 46/273 (16%), Positives = 89/273 (32%), Gaps = 23/273 (8%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNE-GSRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 + IA D +L G ++ S+L++ + FH F + L + Sbjct: 85 VHIAMTLDSEYLRGSIAAVHSVLRHASCPENVFFHFIAAEFDSASPRVLSQLVRSTFPSL 144 Query: 88 KIYLINGDRLRSLPSTKNWTHAI---------YFRFVIADYFINKAPKVLYLDADIICQG 138 + R + +I Y R + D +V+YLD+D+I Sbjct: 145 NFKVY---IFREDTVINLISSSIRLALENPLNYARNYLGDILDRSVERVIYLDSDVITVD 201 Query: 139 TIEPLINFSFPDDKVAMVVTEGQADW--------WEKRAHSLGVAGIAKGYFNSGFLLIN 190 I L N +V A++ W A ++G YFN+G ++++ Sbjct: 202 DITKLWNTVLTGSRVIGAPEYCHANFTQYFTSGFWSDPALPGLISGQKPCYFNTGVMVMD 261 Query: 191 TAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLK 250 +W + + + ++ ++ A + D ++N Q L Sbjct: 262 LVRWREGNYREKLEQWMQLQKKMRIYDLGSLPPFLLVFAGNVEAIDHRWN-QHGLGGDNI 320 Query: 251 ESFINPVTNDTI-FIHYIGPTKPWHDWAWDYPV 282 + + +H+ G KPW P Sbjct: 321 RGSCRSLHPGPVSLLHWSGKGKPWVRLDEKRPC 353 >UniRef50_C7TIE0 Glycosyl transferase, group 8 n=2 Tax=Lactobacillus rhamnosus RepID=C7TIE0_LACRL Length = 286 Score = 129 bits (326), Expect = 1e-28, Method: Composition-based stats. Identities = 55/277 (19%), Positives = 106/277 (38%), Gaps = 25/277 (9%) Query: 31 IAYGTDKNFLFGCGISIASILKYNE-GSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKI 89 + + + + G +IAS++ + L + D + + D + ++ Q Sbjct: 6 VLFTVTGSHIQLTGTAIASLVLHWPVNIPLRILVMADDYLNQDIFWLKSIPKQLLRPNIT 65 Query: 90 YLI-----NGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLI 144 + D++ + + + + +R Y + ++LYLD D++ I P+ Sbjct: 66 VDVWQKPSIMDQVHTANTNTRYPSVVLWRLFAP-YIFSDTDRLLYLDNDVLICDDISPMF 124 Query: 145 NFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIA--KGYFNSGFLLINTAQWAAQQVSAR 202 + PDDK V + Q + I YFNSG LLINT ++ + Sbjct: 125 DM-LPDDKAIGAVNDFQTLLYADTKEGSIWPEIKHFDSYFNSGVLLINTHKYIQAYTQDQ 183 Query: 203 AIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQ--------FSLNYQLKESFI 254 + +N + + DQ +LN L + I ++YN Q ++L+Y LK++ Sbjct: 184 LVNTINTSD----YSFIDQTILNNLFESQSIHLPLQYNYQKDDEWLNGYALHYNLKQAKK 239 Query: 255 NPVTNDTIFI-HYIGPTK--PWHDWAWDYPVSQAFME 288 + I H++ + PW Q F Sbjct: 240 MQAARKKVVIRHFVSEIRSLPWEHGYSRDEFEQNFWR 276 >UniRef50_D1IFB6 Whole genome shotgun sequence of line PN40024, scaffold_26.assembly12x (Fragment) n=1 Tax=Vitis vinifera RepID=D1IFB6_VITVI Length = 473 Score = 129 bits (326), Expect = 1e-28, Method: Composition-based stats. Identities = 50/309 (16%), Positives = 102/309 (33%), Gaps = 43/309 (13%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNE-GSRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 + IA D +L G ++ SIL+++ + FH F + L + Sbjct: 144 VHIAMTLDSEYLRGSIAAVHSILRHSSCPENVFFHFIAAEFDPASPRVLTQLVRSTFPSL 203 Query: 88 KI--YLINGDRLRSLPSTKNWT----HAIYFRFVIADYFINKAPKVLYLDADIICQGTIE 141 Y+ D + +L S+ + Y R + D +V+Y+D+D++ I Sbjct: 204 NFKVYIFREDTVINLISSSIRSALENPLNYARNYLGDILDPCVERVIYIDSDLVVVDDIR 263 Query: 142 PLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSA 201 L N + + YFN+G ++++ +W Sbjct: 264 KLWNITLTEKPC---------------------------YFNTGVMVMDLVRWRKGNYRR 296 Query: 202 RAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDT 261 + + + ++ A + D ++N Q L + P+ Sbjct: 297 KIENWMELQRRRRIYELGSLPPFLLVFAGNVEAIDHRWN-QHGLGGDNVKGSCRPLHPGP 355 Query: 262 I-FIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKH 320 + +H+ G KPW P + W+ L KP+ +++L + + Sbjct: 356 VSLLHWSGKGKPWSRLDARKPCPVDHL-------WEPYDLYKPHRNHRLNHQQMLLSASS 408 Query: 321 RYLKGFSNY 329 L G + Sbjct: 409 STLVGLQKH 417 >UniRef50_Q1CUZ8 Lipopolysaccharide 1,2-glycosyltransferase n=12 Tax=Helicobacter RepID=Q1CUZ8_HELPH Length = 372 Score = 128 bits (322), Expect = 3e-28, Method: Composition-based stats. Identities = 48/328 (14%), Positives = 107/328 (32%), Gaps = 58/328 (17%) Query: 25 ENLCLDIAYGTDKNFLFGCGISIASILKYNEGSR-----------LCFHIFTDYFGDDDR 73 ++ + IA D ++ G+S+ S+L + H D +++ Sbjct: 1 MSIIIPIAIAFDNHYAIPTGVSLYSMLACAKTEHPQSQNDSEKLFYKIHCLVDNLSLENQ 60 Query: 74 KYFDA---------------LALQYKTRIKIYLINGDRLR------SLPSTKNWTHAIYF 112 + ++ + IKI D++ ++ + ++ + Sbjct: 61 QKLKETLAPFSAFASVDFLDISEPDHSTIKIEPFVIDKIHEAFLQLNIYAKTRFSKMVMC 120 Query: 113 RFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQAD--------- 163 R +A F + K++ DAD + + Sbjct: 121 RLFLASLFP-QYDKIIMFDADTLFLNDVSESFFIPLDSYYFGAAKDFASPKSLKHFQTER 179 Query: 164 ---------WWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIK 214 +E + I + ++N GFL++N W A + R + + ++ Sbjct: 180 EREPRQKFSLYEHYLKEKDMKIICENHYNVGFLIVNLKLWRADHLEERLLNLTHQKGQC- 238 Query: 215 KITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIGPTKPWH 274 + P+QD+L + K++ YN + L + P + + +H+ KPW Sbjct: 239 -VFCPEQDLLTLACYQKVLQLPYIYNAHPFM---LNQKRFIPDKKEIVMLHFYFVGKPWI 294 Query: 275 DWAWDYPVSQAFMEAKNASPWKNTALLK 302 Y S+ + E +P+ +K Sbjct: 295 SPTALY--SKEWHETLLKTPFYAEYSVK 320 >UniRef50_B9HMR5 Glycosyltransferase, CAZy family GT8 n=25 Tax=Magnoliophyta RepID=B9HMR5_POPTR Length = 383 Score = 126 bits (317), Expect = 1e-27, Method: Composition-based stats. Identities = 48/273 (17%), Positives = 90/273 (32%), Gaps = 19/273 (6%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNE-GSRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 + IA D +L G ++ S+LK+ + FH F + L + Sbjct: 77 VHIAMTLDSEYLRGSIAAVHSVLKHASCPESIFFHFVAAEFDPASPRVLTQLVRSTFPSL 136 Query: 88 KI--YLINGDRLRSLPSTKNW----THAIYFRFVIADYFINKAPKVLYLDADIICQGTIE 141 Y+ D + +L S+ Y R + D +V+YLD+DI+ I Sbjct: 137 NFKVYIFREDTVINLISSSIRQALENPLNYARNYLGDMLDLCVDRVIYLDSDIVVVDDIH 196 Query: 142 PLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKG----------YFNSGFLLINT 191 L N + +V A++ + + G YFN+G ++++ Sbjct: 197 KLWNTALSGSRVIGAPEYCHANFTQYFTSVFWSDQVMSGTFSSARRKPCYFNTGVMVMDL 256 Query: 192 AQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKE 251 +W R + + + ++ A + D ++N Q L Sbjct: 257 VRWREGDYKRRIEKWMEIQKKTRIYELGSLPPFLLVFAGDVEAIDHRWN-QHGLGGDNVR 315 Query: 252 SFINPVTNDTI-FIHYIGPTKPWHDWAWDYPVS 283 + + +H+ G KPW P Sbjct: 316 GSCRSLHPGPVSLLHWSGKGKPWVRLDAKKPCK 348 >UniRef50_D1IU75 Whole genome shotgun sequence of line PN40024, scaffold_5.assembly12x (Fragment) n=7 Tax=Magnoliophyta RepID=D1IU75_VITVI Length = 364 Score = 125 bits (314), Expect = 2e-27, Method: Composition-based stats. Identities = 50/268 (18%), Positives = 97/268 (36%), Gaps = 20/268 (7%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNE-GSRLCFHIFTD--YFGDDDRKYFDALALQ--- 82 + +A D ++L G ++ SIL++++ + FH + R F L + Sbjct: 63 VHVAITLDVHYLRGSMAAVHSILQHSQCPEDIFFHFLVSETHLEILVRSTFPQLKFKVYY 122 Query: 83 YKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEP 142 + I LI+ +L N Y R +AD +V+YLD+D+I I Sbjct: 123 FNPEIVRNLISTSVREALEHPLN-----YARNYLADLLEPCVRRVIYLDSDLIVVDDIYK 177 Query: 143 LINFSFPDDKVAM-------VVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWA 195 L + S + +W ++ + G YFN+G ++I+ A+W Sbjct: 178 LWSTSLGTRTIGAPEYCHANFTRYFTDKFWSEKRYYGTFDGRKPCYFNTGVIVIDLAKWR 237 Query: 196 AQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFIN 255 + R + + + ++ A + + ++N Q L + Sbjct: 238 RFGFTKRIERWMEVQKNNRIYELGSLPPYLLVFAGHVAPIEHRWN-QHGLGGDNVKGSCR 296 Query: 256 PVTNDTI-FIHYIGPTKPWHDWAWDYPV 282 + + +H+ G KPW P Sbjct: 297 ELHPGPVSLLHWSGSGKPWARLDMKAPC 324 >UniRef50_Q92VQ2 Putative lipopolysaccharide 1,3-galactosyltransferase n=1 Tax=Sinorhizobium meliloti RepID=Q92VQ2_RHIME Length = 337 Score = 124 bits (312), Expect = 5e-27, Method: Composition-based stats. Identities = 50/270 (18%), Positives = 92/270 (34%), Gaps = 21/270 (7%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 I TD+N+ + S ++ +G+ +F D + FD K ++ Sbjct: 20 IVLVTDQNYALPTFSAALSADQHTKGADTAIRMFVVGAEDTWARQFDEAVAGTKIKVIAA 79 Query: 91 LINGDRLRSLPSTKNW-THAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLI-NFSF 148 + S ++ RF I + LY+D D + G ++ L+ + Sbjct: 80 RLPQLAELSPYHRDHYLPPIALARFWIDSLLDAGVDRFLYIDGDTMVDGELDSLLASTPP 139 Query: 149 PDDKVAM------VVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSAR 202 + +A + E AH G+ + YFNSG + + W + Sbjct: 140 AEGLMAAPDFLNIFMDEVSRGKKRDLAHLEGIGCRPETYFNSGVIYASREAW--NDIVPV 197 Query: 203 AIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTN--D 260 A+ + E + DQ LN ++ ++YN Q ++P Sbjct: 198 AMKFMVEH--PEHCPASDQSALNHAARGRVTMLSLRYNYQ-----SEHMMVLDPRRRGIG 250 Query: 261 TIFIHYIGPTKPWHDWAWDYPVSQAFMEAK 290 H+ G KPW+ W P ++F Sbjct: 251 PAIWHFTGGPKPWNTPGW--PWDESFNRYY 278 >UniRef50_B3WD32 Glycosyl transferase n=9 Tax=Lactobacillus RepID=B3WD32_LACCB Length = 279 Score = 124 bits (311), Expect = 5e-27, Method: Composition-based stats. Identities = 60/299 (20%), Positives = 97/299 (32%), Gaps = 57/299 (19%) Query: 27 LCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYF----------GDDDRKYF 76 + ++I + D+ G I+ S++++ L ++ T + Sbjct: 1 MTMNIMFCGDEKMTDGVLIATLSLMRHT-DQPLHIYVLTAKLKVNGHAYQPFSAVTAERM 59 Query: 77 DALALQYK-----TRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLD 131 L Q TRI I + T +T R AD +VLYLD Sbjct: 60 ADLMRQENPQHRLTRIDITDLFMANPPQANMTTMFTPYCMLRLY-ADLIPELPDRVLYLD 118 Query: 132 ADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINT 191 DI+C+ + L D +A V+ W+ + Y NSG LL+N Sbjct: 119 TDIVCRRSFSNLYQEPMKDVDIAGVLDHYGKWWFHHKLTWF-------DYINSGVLLMNL 171 Query: 192 AQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKE 251 A + R ++ + + PDQ LN++ K KYN Q Sbjct: 172 ASIRQDGLLVRCRRLIRH----RWLFMPDQSALNIIAKSK-QILPRKYNEQH-------- 218 Query: 252 SFINPVTNDTIFIHYIGP-----------TKPW-----HDWAWDYPVSQAFMEAKNASP 294 V DT+F H+ KPW H + + + +P Sbjct: 219 ----KVETDTVFQHFTTSFRFWPRFRIVTVKPWQISAVHQQLGLHEYDDLLNDYRQLAP 273 >UniRef50_C6DEN1 Glycosyl transferase family 8 n=1 Tax=Pectobacterium carotovorum subsp. carotovorum PC1 RepID=C6DEN1_PECCP Length = 602 Score = 122 bits (307), Expect = 2e-26, Method: Composition-based stats. Identities = 48/261 (18%), Positives = 101/261 (38%), Gaps = 32/261 (12%) Query: 31 IAYGTDKNFLFGCGISIASI---LKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKT-- 85 I + D + +++ S+ ++ +E +IF + + LA + Sbjct: 327 IFFCADTAYTAPAIVALISLAIAIERSENLP-DIYIFV--LPEAHGLW-GQLASSFNREF 382 Query: 86 -RIKIYLINGDRLRSLPSTKNW---------THAIYFRFVIADYFIN-KAPKVLYLDADI 134 + + +++ +++ S ++ + Y R + Y + LYLD+D+ Sbjct: 383 PSLTLRVVSTLQMQLDQSRAHYGFNSMGDMLSTMAYARLYASRYLSQCGVARALYLDSDV 442 Query: 135 ICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKG-YFNSGFLLINTAQ 193 + Q + PL+ + +A + H++ + GI G YFNSG +L++ Sbjct: 443 VIQSSPLPLLYMDMEEFPLAACHDQVGPLV----DHAVTLHGIPNGRYFNSGVMLLDFHH 498 Query: 194 WAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESF 253 A AI + + + + DQ LN + + D KYN Y Sbjct: 499 PATLPAIEAAITYSEDTDSV--LIFQDQCALNKAIRGLYLTLDGKYNC-----YMPPGRP 551 Query: 254 INPVTNDTIFIHYIGPTKPWH 274 ++ + + +H++ KPWH Sbjct: 552 MSAMYENAAIVHFVSTPKPWH 572 >UniRef50_Q8LF94 Avr9/Cf-9 rapidly elicited protein 231 n=13 Tax=Magnoliophyta RepID=Q8LF94_ARATH Length = 351 Score = 120 bits (302), Expect = 6e-26, Method: Composition-based stats. Identities = 49/272 (18%), Positives = 96/272 (35%), Gaps = 17/272 (6%) Query: 26 NLCLDIAYGTDKNFLFGCGISIASILKYNE-GSRLCFHIFTDYFGDDDRKYFDALALQYK 84 + +A D ++ G ++ S+L+++ + FH F D ++ + Sbjct: 62 RRAVHMAMTLDAAYIRGSVAAVLSVLQHSSCPENIVFH-FVASASADASSLRATISSSFP 120 Query: 85 -TRIKIYLINGDRLRSLPSTKNW----THAIYFRFVIADYFINKAPKVLYLDADIICQGT 139 +Y+ N + L S+ Y R +AD +V+YLD+D+I Sbjct: 121 YLDFTVYVFNISSVSRLISSSIRSALDCPLNYARSYLADLLPPCVRRVVYLDSDLILVDD 180 Query: 140 IEPLINFSFPDDKVAMVVTEGQADW--------WEKRAHSLGVAGIAKGYFNSGFLLINT 191 I L D V A++ W SL A YFN+G ++I+ Sbjct: 181 IAKLAATDLGRDSVLAAPEYCNANFTSYFTSTFWSNPTLSLTFADRKACYFNTGVMVIDL 240 Query: 192 AQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKE 251 ++W ++R + + ++ ++ A + + ++N Q L Sbjct: 241 SRWREGAYTSRIEEWMAMQKRMRIYELGSLPPFLLVFAGLIKPVNHRWN-QHGLGGDNFR 299 Query: 252 SFINPVTNDTI-FIHYIGPTKPWHDWAWDYPV 282 + + +H+ G KPW P Sbjct: 300 GLCRDLHPGPVSLLHWSGKGKPWARLDAGRPC 331 >UniRef50_C7TID9 Glycosyl transferase, group 8 n=2 Tax=Lactobacillus rhamnosus RepID=C7TID9_LACRL Length = 301 Score = 120 bits (302), Expect = 7e-26, Method: Composition-based stats. Identities = 41/277 (14%), Positives = 95/277 (34%), Gaps = 24/277 (8%) Query: 31 IAYGTDKNFLFGCGISIASILK-YNEGSRLCFHIFTDYFGDDDRKYFDALALQYK-TRIK 88 + + + +I S++K Y+ + + + DD + ++ Y +I Sbjct: 6 VVFCVKGLHIMLVATAITSLVKKYHSDREMKILVIIEGGNQDDINFIRSIPSLYGKQQIS 65 Query: 89 I------YLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEP 142 + Y + + + + +R + YF ++ Y+D DI+ I Sbjct: 66 VDFWAPPYPLLDKVSDQFETGTSLPKMVLWRLFLPYYFP-DYDQIAYMDNDILITTDIND 124 Query: 143 LINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSAR 202 L + P+D + V+ + Y N+G + N+ + + + Sbjct: 125 LFDQMLPEDVIGGVLDYEDVTHPDHDRSKEFYLPSTDQYINAGVFVANSNAYRSVVPFEK 184 Query: 203 AIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVT---- 258 I ++N + DQ++LN+ + + ++N Q+ K + P Sbjct: 185 MIEIINRHN----YPYGDQNILNIAFYNHIYLLPWRFNLQYDNRLLDKYESLAPQRIKGI 240 Query: 259 ----NDTIFIHYIG---PTKPWHDWAWDYPVSQAFME 288 N+ IH+ PW+ + + + E Sbjct: 241 REQLNEPGIIHFAANGFVLPPWYVFTPTTRWEKMWWE 277 >UniRef50_A2RLV8 Putative glycosyltransferase n=1 Tax=Lactococcus lactis subsp. cremoris MG1363 RepID=A2RLV8_LACLM Length = 397 Score = 119 bits (300), Expect = 1e-25, Method: Composition-based stats. Identities = 55/313 (17%), Positives = 115/313 (36%), Gaps = 28/313 (8%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGS-RLCFHIFTDYFGDDDRKYFDALALQYKTRIKI 89 I Y + N++ S+ SI+ + + I ++ D++++ + KT++ + Sbjct: 6 IFYTVNGNYIQLVATSLTSIIMNIDEKFPVDIIIVSNDITDENKQTLYEILDMRKTQVNL 65 Query: 90 -YLINGDRLRSL--PSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINF 146 + + D L L + + + + +R + Y + + ++LYLD+D + E + Sbjct: 66 LFRMPPDSLELLLGDVSNIFDNVVCWRIFMP-YSLEEYSQLLYLDSDTLIYEGFEEIFGL 124 Query: 147 SFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAM 206 D + ++ EK + G YFNSG +IN ++ + + Sbjct: 125 LPQDKILGVIPDFYFFAINEKNSSKRG-------YFNSGVYMINVEKYIQKNSKEELLKN 177 Query: 207 LNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNY----------QLKESFINP 256 L E +I + DQ LN +L + +++N Q N+ + + FI Sbjct: 178 LMEN--FSEILYVDQTFLNNTFRGELFYLPLRFNYQKDDNWLNNWAILEAPESSQLFIKE 235 Query: 257 VTNDTIFIHYI---GPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSA 313 + H+I + PW + F N +P S ++ Sbjct: 236 -RANIKIRHFIEFGSHSMPWQHIEVRDQFEEYFWNVWNVLKEYRVKKHRPIKSLKMFLDP 294 Query: 314 KHMLKKHRYLKGF 326 K + L+ Sbjct: 295 KKNEQIINLLERI 307 >UniRef50_D0IR33 LPS 1,2-glycosyltransferase n=3 Tax=Helicobacter pylori RepID=D0IR33_HELP1 Length = 387 Score = 117 bits (293), Expect = 7e-25, Method: Composition-based stats. Identities = 48/343 (13%), Positives = 107/343 (31%), Gaps = 73/343 (21%) Query: 25 ENLCLDIAYGTDKNFLFGCGISIASILKY--------------------------NEGSR 58 ++ + I D ++ G+S+ S+L N+ Sbjct: 1 MSIIIPIVITFDNHYAIPAGVSLYSMLACTKLENPQSQNPQSQNPQSQNPQSQNDNKKLF 60 Query: 59 LCFHIFTDYFGDDDRKYFDALA-----------------LQYKTRIKIYLINGDRLR--- 98 H D +++ Y T I+ +I+ Sbjct: 61 YKIHCLVDNLSLENQCKLKETLAPFSAFMSVDFLDISTPNLYTTPIEPSVIDKINEAFLQ 120 Query: 99 -SLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVV 157 ++ + ++ + R +A F+ + K++ DAD + + D + Sbjct: 121 LNIYAKTRFSKMVMCRLFLASLFL-QYDKIIMFDADTLFLNDVSESFFIPLDDYYFGVAK 179 Query: 158 TEGQAD------------------WWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQV 199 +E + + + ++N GFL++N W A ++ Sbjct: 180 DFSSPKSSKHFQTERERAPRQAFSLYEHYLKEKDIKILYENHYNVGFLVVNLKLWRADRL 239 Query: 200 SARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTN 259 R + + ++ + P+QD+L + K++ YNT + Q P Sbjct: 240 EERLLNLTHQKGQC--VFCPEQDLLTLACYQKVLILPYIYNTHPFMVNQ---KRFIPNRQ 294 Query: 260 DTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLK 302 + + +H+ KPW Y S+ + E + + +K Sbjct: 295 EIVMLHFYFVGKPWVSPTALY--SKEWHETLLKTSFYAEYSVK 335 >UniRef50_A9UZX9 Predicted protein (Fragment) n=1 Tax=Monosiga brevicollis RepID=A9UZX9_MONBE Length = 1116 Score = 115 bits (289), Expect = 2e-24, Method: Composition-based stats. Identities = 48/231 (20%), Positives = 82/231 (35%), Gaps = 17/231 (7%) Query: 32 AYGTDKNFLFGCGISIASILKYN-EGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 N + I SIL ++ L FH+ TD+ ++ +++ Y Sbjct: 882 VVAVGSNHARRLQVLIKSILFHHLPPQPLRFHVITDHETAASLRHLYRSWRLPAVQVRFY 941 Query: 91 LIN----GDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINF 146 I G L L + + + +AD +V+ LD D++ G I L + Sbjct: 942 SITAALQGVDLHGLETHHYAGRYAFVKLFVADLLPVSLERVMVLDTDLLFLGPIAELWDQ 1001 Query: 147 SFPD---DKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQV---- 199 F + VV + R A N+G L++ A+ Q Sbjct: 1002 -FKGWSASAIFAVVDNFSEWYIPGRLQRQPWPAPAPLGINTGVTLLHLARLRHQNFPKVW 1060 Query: 200 SARAIAMLNEPEI-IKKITHPDQDVLNMLLADK---LIFADIKYNTQFSLN 246 ++ +L +P + I DQDV+N + D L ++N Q S N Sbjct: 1061 TSAVARVLADPRLNITYAPLADQDVMNTVFYDNPTLLHRLPCRFNYQLSEN 1111 >UniRef50_A9UXT0 Predicted protein (Fragment) n=1 Tax=Monosiga brevicollis RepID=A9UXT0_MONBE Length = 191 Score = 115 bits (288), Expect = 2e-24, Method: Composition-based stats. Identities = 36/195 (18%), Positives = 69/195 (35%), Gaps = 17/195 (8%) Query: 107 THAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLI-NFSFPDDKVAMVVTEGQAD-- 163 + A + RF++ + + +VLY+D D + QG + L+ + DD V Sbjct: 1 SSANFGRFMLPELLP-ELNRVLYIDIDTVVQGDLVALLAHMDLGDDDYLAAVPRPNVPLS 59 Query: 164 ----------WWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEII 213 E + +A FN+G + N W + + + + + Sbjct: 60 HFFGADIVRLHAELHPDPGQLLQLAAPSFNAGVAVWNLRAWRQRSLRDEVLYYMTKHHEH 119 Query: 214 KKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIGPTKPW 273 + Q +L ++ A D+++N L Y+ S + +H+ G KPW Sbjct: 120 ALWDYGTQPILLLVCAGHWQPLDVRFNLD-GLGYRTDVS--TEALDGAYVLHWSGRRKPW 176 Query: 274 HDWAWDYPVSQAFME 288 A F+ Sbjct: 177 QHDALYRQRWTRFVN 191 >UniRef50_Q1CSY7 Lipopolysaccharide 1,2-glycosyltransferase n=3 Tax=Helicobacter pylori RepID=Q1CSY7_HELPH Length = 341 Score = 114 bits (286), Expect = 4e-24, Method: Composition-based stats. Identities = 46/292 (15%), Positives = 96/292 (32%), Gaps = 51/292 (17%) Query: 46 SIASILKYNEGSR------LCFHIFTDYFGDDDRKYFDALALQYKT--RIKIYLINGDRL 97 S+ S+L R H D ++ + + T I+ I+ + Sbjct: 2 SLYSMLSSCTQERDGVKLFYQIHCLVDSLSAENVEKLKRTMSPFSTFSGIEFCDISKNDA 61 Query: 98 R------------SLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLIN 145 + + K ++ I R ++A F ++ K++ D D + G I Sbjct: 62 YPFKLVSQLFLRLNPFAKKRFSKMILCRLLLASIF-SQYEKIIMFDVDTLFVGDISESFF 120 Query: 146 FSFPDDKVAMVVTEGQADWWEK-------------------RAHSLGVAGIAKGY---FN 183 + SL + Y FN Sbjct: 121 IPMDGVYFGATKEDFSLIGIHNANDLFSSRLNWSRGMGVKLNHKSLIFQEVEILYENPFN 180 Query: 184 SGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQF 243 +GF+L+N A W + + I + + + P+QD+ ++ ++ KYN Sbjct: 181 AGFMLVNLALWREHHLEEKLIDFFKTRD--EGLLLPEQDLFVLVCQGCILEMPCKYNV-- 236 Query: 244 SLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPW 295 + ++ + + P +D +H+ KPW + YP S+ + + + + Sbjct: 237 --HPRMVGTRMIPKKSDACMLHFYADEKPWKHF--RYPYSKEWHQVAFKTSF 284 >UniRef50_C6DEN3 Glycosyl transferase family 8 n=1 Tax=Pectobacterium carotovorum subsp. carotovorum PC1 RepID=C6DEN3_PECCP Length = 610 Score = 114 bits (286), Expect = 4e-24, Method: Composition-based stats. Identities = 49/260 (18%), Positives = 92/260 (35%), Gaps = 30/260 (11%) Query: 31 IAYGTDKNFLFGCGISIASILKY--NEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 I + TD +F +++ S+ + +IF + R ++ +A ++ + Sbjct: 330 IFFCTDADFSLPAVVALTSLAMSIGGANNLPDIYIF---VPPEIRPLWERIAERFTSAFP 386 Query: 89 IYL--------INGDR----LRSLPSTKNWTHAIYFRFVIADYF-INKAPKVLYLDADII 135 I ++ D + + Y RF + Y + LYLD+DI+ Sbjct: 387 IITLRIVSTLQMDLDEVRAQFGFYNVGETLSTTTYTRFYASRYLHYIGVTRALYLDSDIV 446 Query: 136 CQGT-IEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQW 194 + + L +A KRA L + YFN+G +L + Sbjct: 447 ILHSPLSLLYE-DMQGFPLAARTDRNTP--LIKRAIRLHQIA-NERYFNAGVILFDLTHP 502 Query: 195 AAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFI 254 A AI + + DQ LN ++ + D +YN + S Sbjct: 503 AMISTINTAITYSKQGNSP--LLFLDQCALNKAISGLYLALDERYN-----RFIPPSSAT 555 Query: 255 NPVTNDTIFIHYIGPTKPWH 274 + ++T+ +H+I KPW Sbjct: 556 QVIEDNTVIMHFIETPKPWQ 575 >UniRef50_A9SH80 Predicted protein n=2 Tax=Physcomitrella patens subsp. patens RepID=A9SH80_PHYPA Length = 527 Score = 114 bits (285), Expect = 5e-24, Method: Composition-based stats. Identities = 59/338 (17%), Positives = 107/338 (31%), Gaps = 69/338 (20%) Query: 27 LCLDIAYGTDKNFLFGCGISIASILKYNEG-SRLCFHIFTDYFGDDDRKYFDALALQYK- 84 + I TD + + S + RL FH+ LA ++ Sbjct: 193 QVVHIFVSTDGADFRPLAVLVNSTISNAVHPERLHFHLV---LPASHHSRAKHLAAFFQD 249 Query: 85 TRIKIY--LINGDRLRSLPSTKNWTHA------IYFRFVIADYFI---NKAPKVLYLDAD 133 T+I I I+ + + + + A +Y + +YLDAD Sbjct: 250 TKIDIVSENIDFKDMEKHITFRKNSKARPELQSVYN--FAPFLLPLHFKDVGRFIYLDAD 307 Query: 134 IICQGTIEPLINFSFPDDKVAMVVTEGQA-DWWEKRAHSLGVAGIAKG------------ 180 I+ +G IE LI + A V Q + + + + Sbjct: 308 IVVKGNIEELIQIDLGNRAAAAVEDCSQTFETYFDFNELAKIQARPEKPTWVPTEPIKPD 367 Query: 181 --YFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITH---PDQDVLNMLLADKLIFA 235 FN G L+I+T QW QQV+ + ++E + + + + Q + L K + Sbjct: 368 ACVFNRGVLVIDTNQWIKQQVTEAILWWMDEFQSAESVLYKYGLSQPPFLLALYGKYMKL 427 Query: 236 DIKYNTQFSLNYQL-----------------KESFINPVTNDTIFIHYIGPTKPWHDWAW 278 D +N + + ++ FI+ + +H+ G KPW Sbjct: 428 DTPWNVRGLGRNEFSEREREFLESKYGHKPERKPFISLDADTAKILHFNGKFKPWKQTRP 487 Query: 279 DYP--------------VSQAFMEAKNASPWKNTALLK 302 P ++ + E SP + L Sbjct: 488 VGPSSNVVSRCGSKGIECAKLWWEYL--SPVADGILRH 523 >UniRef50_C7PRU3 Glycosyl transferase family 8 n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PRU3_CHIPD Length = 303 Score = 114 bits (285), Expect = 6e-24, Method: Composition-based stats. Identities = 55/327 (16%), Positives = 102/327 (31%), Gaps = 51/327 (15%) Query: 29 LDIAYGTDKNFLFGCGISIASILKY-NEGSRLCFHIFTDYFGDDDRKYFDAL--ALQYKT 85 + IA+ D L G G +I S+++ ++ ++L H + G + L Y Sbjct: 1 MHIAFVIDLPSLEGLGATITSLVRNCSDTAQLDLHFICNNLGTRHKNNLLMLLQTESYHG 60 Query: 86 RIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDAD------------ 133 R + Y + + S + + Y RF+I LDAD Sbjct: 61 RTRFYDFDAQEMFGHLSAVHGSRTSYGRFLIPKL----------LDADYVLCLDPDLLIL 110 Query: 134 --IICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGY-FNSGFLLIN 190 +I I F D +A V + E + ++ F SG LL+N Sbjct: 111 LDVITFDQIR------FEDHFLAAVPGGPFRNTLEAKLLPGQLSVCKDEQSFISGMLLLN 164 Query: 191 TAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLK 250 +W + + + + + D VLN + + ++N ++ Sbjct: 165 LRRWKERDICHEIEKICLRHGMA--LQEADNTVLNTICNGSFYHIEDRFNCIWTPGQAT- 221 Query: 251 ESFINPVTNDTIFIHYIGPTKPWHDWAWD-YPVSQAFMEA--------KNASPWKNTALL 301 P + +H+ G KPW + + Q + + + + Sbjct: 222 -----PSFKENAILHFAGAPKPWDFLGREVHAGYQRWADYDTTFWDRRYKRVAFAGLQRI 276 Query: 302 KPNNSNQLRYSAKHMLKKHRYLKGFSN 328 + RY K G N Sbjct: 277 WKIRKSLFRYYLKSFRAAGSLKLGIKN 303 >UniRef50_UPI0000587C70 PREDICTED: similar to MGC81998 protein n=1 Tax=Strongylocentrotus purpuratus RepID=UPI0000587C70 Length = 344 Score = 112 bits (281), Expect = 2e-23, Method: Composition-based stats. Identities = 49/305 (16%), Positives = 102/305 (33%), Gaps = 43/305 (14%) Query: 9 TEFLNSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYF 68 E +SV K + N +++ +D + L G ++ SI N + + F++ D Sbjct: 43 HETRHSVQRDLSKNSSSNGTINVLICSDGSTLGGMVAAMNSI-YLNSRTHIKFYLVVDTD 101 Query: 69 GDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVL 128 D + + + K I + + L Y R F +V+ Sbjct: 102 SLDHLSKWLSQSSLRKLDYAIKVFDESWLN------------YARLYFPKIFPGLTGRVI 149 Query: 129 YLDADIICQGTIEPLINFSFPDDKVAMVVTEGQA----------------DWWEKRAHSL 172 ++D+D I QG I L V + A ++ ++ SL Sbjct: 150 FVDSDTITQGDIAELNAIDIKPGHVVAFSDDCSAVTSRYGVIMNRYASYLNFGNEKLQSL 209 Query: 173 GVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQ-------DVLN 225 G+ + FN G + N +W Q ++A+ + K+ + Q + Sbjct: 210 GINPMECS-FNPGVFVANVDEWRKQNITAKLDYWVTVNS--KEDVYGSQRGGGHSGPPMM 266 Query: 226 MLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQA 285 ++ K +++ + L + + +H+ G KPW + + Sbjct: 267 IVFYMKYSPLPPEWHIRH-LGVTTGARYSDAFLKAAKLLHWNGRFKPW---GHNSQHTLI 322 Query: 286 FMEAK 290 + + Sbjct: 323 WEKYY 327 >UniRef50_UPI0001621115 predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=UPI0001621115 Length = 1016 Score = 112 bits (281), Expect = 2e-23, Method: Composition-based stats. Identities = 59/371 (15%), Positives = 106/371 (28%), Gaps = 99/371 (26%) Query: 1 MQQVFFQETEFLNSVIDYDHKVETENLCLDIA--------YGTDKNFLFGCGISIASILK 52 +Q V S ++ ++ E+ +D+ TD+ L + I S + Sbjct: 374 LQAVVLDPILNAESEEGTNYSLKREDEPIDVVKREDIHVFVCTDEADLRPLAVLINSSMA 433 Query: 53 YNEGSRLCFHIFTDYFGDDDR-KYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHA-- 109 F+ + + K L + + I+ + + +N T A Sbjct: 434 NCPHPERLFYHLVMPYSQRNAAKRLKHLFPNARVEMAEKYIDIREVEEHITFRNDTGARK 493 Query: 110 ----IYFRFVIADYFINKAP---KVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQ- 161 Y + Y +++YLD+DI+ +G +E L + VA + Q Sbjct: 494 ELVSPYN--FLPFYLPKTYSEIRRIIYLDSDIVVKGNLEVLNDVDLEGHSVAAIEDCSQR 551 Query: 162 --------------ADWWEKRAHSLGVAGIAKG--YFNSGFLLINTAQWAAQQVSARAIA 205 R L K FN G L+I+T QW Q ++ + Sbjct: 552 FQVYFDFAQLDEIHKRQGPDRPKWLPDEPFNKSACVFNRGVLIIDTNQWIEQNITKAIVW 611 Query: 206 MLNEPEIIKK----------------------------------------ITHP-----D 220 ++E K +P Sbjct: 612 WMDEFRKADKKALYKYALYQKRVHKNYFCASLSLICTSSMHFSQVLIVLWYFYPSRAGMS 671 Query: 221 QDVLNMLLADKLIFADIKYNTQ----------------FSLNYQLKE-SFINPVTNDTIF 263 Q + L K D +N + NY F++P ++ Sbjct: 672 QPPFLLALYGKHKVLDETWNVRGLGRPNLSDMERIYYKKGWNYTFDRIPFMSPFADEANI 731 Query: 264 IHYIGPTKPWH 274 +H+ G KPW Sbjct: 732 LHFNGKYKPWK 742 >UniRef50_UPI0001B55E75 hypothetical protein SSPB78_11600 n=1 Tax=Streptomyces sp. SPB78 RepID=UPI0001B55E75 Length = 792 Score = 111 bits (278), Expect = 3e-23, Method: Composition-based stats. Identities = 50/270 (18%), Positives = 91/270 (33%), Gaps = 42/270 (15%) Query: 31 IAYG--TDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 +A+ D+N+L G + S+ N F + D AL RI+ Sbjct: 37 VAFASFVDENYLPGFLALLRSLALSNPEVCEDFLVLHDGLRPASLARIRAL----HPRIR 92 Query: 89 IYLINGDRLRSLPSTKNWTHAIYFR--FVIADYFI-NKAPKVLYLDADIICQGTIEPLIN 145 ++ R + + + R + + D F ++ LD D++ G + L+ Sbjct: 93 PRRVDAARYDAYAKGDQNNYLV--RKAYFLLDVFRVRDYDTIITLDTDMVVLGDLSELLR 150 Query: 146 FSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIA 205 + +A V NSG L+I + +S Sbjct: 151 LR---EGLAAVPQFFYGTHKL----------------NSGLLVI-----QREFLSDAFCE 186 Query: 206 MLNEPEIIKKITH--PDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIF 263 ++E + DQ +LN +L + +YN + + S PV DT Sbjct: 187 RIDETGLAGAYELDKHDQGILNAVLDGDFVRLPARYNFV-----KRRLSGDKPVPEDTAV 241 Query: 264 IHYIGPTKPWHDWAWDYPVSQAFMEAKNAS 293 +H+ G KPW Y ++A + + Sbjct: 242 LHFTGRHKPWQGGENGYAEAEARWRDFHLT 271 >UniRef50_Q04CN2 Lipopolysaccharide biosynthesis glycosyltransferase n=19 Tax=Lactobacillus RepID=Q04CN2_LACDB Length = 274 Score = 111 bits (277), Expect = 5e-23, Method: Composition-based stats. Identities = 58/294 (19%), Positives = 100/294 (34%), Gaps = 56/294 (19%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDR----------KYFDA 78 ++ + D N G I++ S+LK G + +I T R + + Sbjct: 1 MNFLFCGDHNAERGVLIAVLSLLKAERGEEVHVYILTMRTKSKSRSFKPFSQHAADFIRS 60 Query: 79 LALQYKTRIKIYLIN-GDRLRSLPSTKN----WTHAIYFRFVIADYFINKAPKVLYLDAD 133 L + + LI+ + P T N +T R AD ++LYLD D Sbjct: 61 LIVADNPNSSLELIDCTENFIKEPPTANMGTRFTPYAMLRLF-ADELPQIPDRILYLDDD 119 Query: 134 IICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQ 193 +I + ++ ++ V+ ++ + Y NSG LL+N + Sbjct: 120 VIIRRPVDQFYTQDLTGTELVGVLDYFGRFFFHNQKKIF-------DYLNSGVLLLNMPE 172 Query: 194 WAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESF 253 + R ++ +KK+ PDQ +N L +K A KYN Q++L Sbjct: 173 IKRTGLFKRVRHLMQ----VKKMFLPDQTAINKLAKEK-RIAPRKYNEQYAL-------- 219 Query: 254 INPVTNDTIFIHYIGP-----------TKPW-----HDWAWDYPVSQAFMEAKN 291 +DT+ H+ KPW H + E Sbjct: 220 ----QDDTVIQHFTTSFRFFPYFRTQTVKPWDVKRVHSVLNLHEYDDLLNEYLQ 269 >UniRef50_Q02ZT7 Lipopolysaccharide biosynthesis glycosyltransferase n=1 Tax=Lactococcus lactis subsp. cremoris SK11 RepID=Q02ZT7_LACLS Length = 759 Score = 110 bits (275), Expect = 9e-23, Method: Composition-based stats. Identities = 23/145 (15%), Positives = 55/145 (37%), Gaps = 7/145 (4%) Query: 17 DYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKY-NEGSRLCFHIFTDYFGDDDRKY 75 + K N + + + ++ + + SIL+ N + I + + + Sbjct: 596 KIEPKYSINN--IPVVMACNNGYMKYTSVLLQSILENANSKNNYDISILHNDISVETQNR 653 Query: 76 FDALALQYKTRIKIYLINGD--RLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDAD 133 + ++ ++ + L + + + Y+RF+I + F++ KV+Y+D D Sbjct: 654 TLKHFNKDNFSVRFVDVSAKISQYGELKTNAHISVETYYRFLIPELFVH--DKVVYIDCD 711 Query: 134 IICQGTIEPLINFSFPDDKVAMVVT 158 + + I L D+ V V Sbjct: 712 TVVEEDIAKLFEIDIEDNYVGAVRD 736 >UniRef50_C3YRN2 Putative uncharacterized protein (Fragment) n=1 Tax=Branchiostoma floridae RepID=C3YRN2_BRAFL Length = 305 Score = 109 bits (274), Expect = 1e-22, Method: Composition-based stats. Identities = 51/316 (16%), Positives = 106/316 (33%), Gaps = 43/316 (13%) Query: 28 CLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 + + TD+ L G +I SI N S + F++ TD D + + + Sbjct: 2 TIPVVISTDEGRLMGAVAAINSI-ATNSKSPVKFYLITDKDTKDHLEQWILKTRLHSINH 60 Query: 88 KIYLINGDRLRSLPSTKNW-----THAIYFRFVIADYFINKAP-KVLYLDADIICQGTIE 141 +I + N + ++ + + + Y RF + K+LYLD D+I QG I Sbjct: 61 EIIVFNEEWVKGKINVRGGRQELASPLNYARFYLPKLLPPDFNGKILYLDDDVIVQGDIT 120 Query: 142 PLINFSFPDDKVAMVVTEGQA----------------DWWEKRAHSLGVAGIAKGYFNSG 185 L N + V + ++ + LG+ FN+G Sbjct: 121 QLYNTKIDETLVMAFSEDCNTVSNRFGLFMNTYANYINFGNENVKKLGMKPGTCS-FNTG 179 Query: 186 FLLINTAQWAAQQVSARAIAML-----NEPEIIKKITHPDQDVLNMLLADKLIFADIKYN 240 + N +W Q+++ + ++ Q + ++ ++ D ++ Sbjct: 180 VFVANMTEWKNQKITTKLEFWTALNTEENVYGAQQGGGGSQPPMMIVFYNQYSKIDPMWH 239 Query: 241 TQFSLNYQLKES--FINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNT 298 + Y + + +H+ G KPW +++ W+ Sbjct: 240 IRHLGLYSWTAGTRYSKQFIMEAKLLHWNGRFKPW------------GRTSQHMDAWERY 287 Query: 299 ALLKPNNSNQLRYSAK 314 + P +QL + Sbjct: 288 YIPDPTGKSQLTRKFR 303 >UniRef50_Q04CN3 Lipopolysaccharide biosynthesis glycosyltransferase n=1 Tax=Lactobacillus delbrueckii subsp. bulgaricus ATCC BAA-365 RepID=Q04CN3_LACDB Length = 200 Score = 109 bits (272), Expect = 2e-22, Method: Composition-based stats. Identities = 31/163 (19%), Positives = 56/163 (34%), Gaps = 13/163 (7%) Query: 137 QGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIA-KGYFNSGFLLINTAQWA 195 I L ++ VA + + + GI Y NSG L++N Sbjct: 2 NADIAGLYQTELGNNLVAACHDQSVHYIEPLQTYIRDCLGIDPDKYVNSGVLVMNCLAMR 61 Query: 196 AQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFIN 255 + + + +L+ + PDQD LN + + ++ D +++ + + Sbjct: 62 DEDFVDKFLHLLSTYQFNS--IAPDQDYLNEICSGRIKLLDPRWDAMPN--------DFD 111 Query: 256 PVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNT 298 P IHY KPWH Y F + +P+ Sbjct: 112 PEMTGPYLIHYNLFYKPWHFEEVKY--GSYFWQVAKETPFYKD 152 >UniRef50_C5FDY7 Glycogenin n=1 Tax=Microsporum canis CBS 113480 RepID=C5FDY7_NANOT Length = 731 Score = 108 bits (269), Expect = 5e-22, Method: Composition-based stats. Identities = 52/290 (17%), Positives = 88/290 (30%), Gaps = 43/290 (14%) Query: 32 AYGT---DKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 Y T N+L G + S+ RL + D D L Y I Sbjct: 8 VYCTILLSDNYLPGAMVLAHSLRDNGTKGRLAVLVTLDNLQPG---IIDELKTVYDDVIP 64 Query: 89 IYLINGDRLRSLPSTKNWT-HAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 I I +L + + + + + +++Y+DAD+I + L+ Sbjct: 65 IPRIENSYPGNLYLMDRPDLISTFSKIALWK--QTQYDRIVYIDADVIALRAPDELLTLD 122 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLI--NTAQWAAQQVSARAIA 205 F +A V G D FN+G +++ N + A Sbjct: 123 FKS--IAAVPDIGWPDC-----------------FNTGVIVLRPNLKDY---------YA 154 Query: 206 MLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIH 265 +L + DQ +LNM + YN S +YQ ++ + +H Sbjct: 155 LLAFAQRGISFDGADQGLLNMHFKN-WDRLSFTYNCTPSGHYQYVPAYRY-FESTISLVH 212 Query: 266 YIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKH 315 +IG KPW P + + W +H Sbjct: 213 FIGSLKPWRIGRSSSPQQSPYNQLLAK--WWAVYDRHYRTGPIYIPQPRH 260 >UniRef50_A2DXT6 Glycosyl transferase family 8 protein n=1 Tax=Trichomonas vaginalis RepID=A2DXT6_TRIVA Length = 334 Score = 107 bits (267), Expect = 6e-22, Method: Composition-based stats. Identities = 58/263 (22%), Positives = 99/263 (37%), Gaps = 37/263 (14%) Query: 43 CGISIASILKYNEGSRLCFHIFTDYFGD--DDRKYFDALALQYKTRIKIYLINGDRLRSL 100 G ++ S + L ++ T + + +K D + ++Y T I Y N L + Sbjct: 79 LGPAMYSAINSLSNYSLTIYVCTTHPEEVPIFQKRLDQIKIKYVTFIAEYH-NVSSLTTK 137 Query: 101 PSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEG 160 + Y R V +D + + L LD D + G+ + F D +V+ Sbjct: 138 YHLGIIS-ETYIRIVFSDAHP-ELERFLQLDGDTLVTGSFDEFYFAYFNDTYAVVVLDIW 195 Query: 161 QADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPD 220 + K YFN G ++ N ++ +++ + L E E+ + + D Sbjct: 196 KEYEGFK------------NYFNCGSVVFNCQKFRDDKMADKVRTKLKEYEVTRGEWNND 243 Query: 221 QDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIG----PTKPWHDW 276 Q VLN + DK I A KYN F+ +T T H+ G P KP + Sbjct: 244 QTVLNDIFGDKKIIAHKKYN-----------EFMPSLTMQTRIFHFYGLKKKPYKP-NIK 291 Query: 277 AWDYPVS--QAFMEAKNAS--PW 295 + Y + + N S PW Sbjct: 292 SNKYYFRLWRCYFHYFNNSINPW 314 >UniRef50_Q68CQ7 Glycosyltransferase 8 domain-containing protein 1 n=45 Tax=Euteleostomi RepID=GL8D1_HUMAN Length = 371 Score = 105 bits (263), Expect = 2e-21, Method: Composition-based stats. Identities = 56/325 (17%), Positives = 108/325 (33%), Gaps = 53/325 (16%) Query: 7 QETEFLNSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFT- 65 Q +F+ + + + E + + IA D+ L G +I SI ++N S + F+I T Sbjct: 46 QPIDFVPNALRHAVDGRQEEIPVVIAASEDR--LGGAIAAINSI-QHNTRSNVIFYIVTL 102 Query: 66 --------DYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIA 117 + D K + + ++ + G ++ + RF + Sbjct: 103 NNTADHLRSWLNSDSLKSIRYKIVNFDPKL----LEGKVKEDPDQGESMKPLTFARFYLP 158 Query: 118 DYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTE------------------ 159 + A K +Y+D D+I QG I L N + A + Sbjct: 159 -ILVPSAKKAIYMDDDVIVQGDILALYNTALKPGHAAAFSEDCDSASTKVVIRGAGNQYN 217 Query: 160 --GQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLN--------E 209 G D+ ++R L + FN G + N +W Q ++ + + Sbjct: 218 YIGYLDYKKERIRKLSMKASTCS-FNPGVFVANLTEWKRQNITNQLEKWMKLNVEEGLYS 276 Query: 210 PEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIGP 269 + IT P L ++ + D +N + L + + +H+ G Sbjct: 277 RTLAGSITTP---PLLIVFYQQHSTIDPMWNVRH-LGSSAGKRYSPQFVKAAKLLHWNGH 332 Query: 270 TKPWHDWAWDYPVSQAFMEAKNASP 294 KPW + + + P Sbjct: 333 LKPW---GRTASYTDVWEKWYIPDP 354 >UniRef50_A5BZU1 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5BZU1_VITVI Length = 648 Score = 105 bits (263), Expect = 2e-21, Method: Composition-based stats. Identities = 48/288 (16%), Positives = 97/288 (33%), Gaps = 21/288 (7%) Query: 13 NSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIAS-ILKYNEGSRLCFHIFTDYFGDD 71 + + K+ + A +D + + I S +L +E + FHI TD Sbjct: 362 QKRVVLNKKLLEDPSLYHYAIFSDN--VLATSVVINSTMLXASEPEKHVFHIVTDKLSFA 419 Query: 72 DRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLD 131 K + + ++ I + N D ++ H RF + + + K K+L+LD Sbjct: 420 AMKMW--FLVNSPAKVTIQVENIDDFKNPKYLSMLNH---LRFYLPEVYP-KLEKILFLD 473 Query: 132 ADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYF--NS----- 184 DI+ Q + PL + A V T ++ + + I++ F N+ Sbjct: 474 DDIVVQKDLTPLWSLDMQGMVNAAVETCKESFHRFDKYLNFSHPKISEN-FDPNACGWAF 532 Query: 185 GFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFS 244 G + + +W + ++ + E + + D ++ Sbjct: 533 GMNMFDLKEWRKRNMTGIYHYWQDMNEDRTLWKLGSLPPGLITFYNLTYPLDRSWHVL-G 591 Query: 245 LNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNA 292 L Y + ++ +HY G KPW + A + Sbjct: 592 LGYDPQ--LNQTEIDNAAVVHYNGNYKPWLELAIA-KYKSYWSRYVMP 636 >UniRef50_B3XPR6 Putative uncharacterized protein n=3 Tax=Lactobacillus RepID=B3XPR6_LACRE Length = 673 Score = 104 bits (260), Expect = 5e-21, Method: Composition-based stats. Identities = 41/248 (16%), Positives = 75/248 (30%), Gaps = 33/248 (13%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 I + + L + SI N + +I + + + + I Sbjct: 4 IVLCANYDKLDQIETVLKSIYINNNDVKT--YIINSDIAHEWFVNINYFLEKINSEIIDA 61 Query: 91 LINGDRLRSLPSTKNWTHAI--YFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSF 148 I+ +R LP KN A Y +F+I + KVLYL + I ++ L Sbjct: 62 KIDLNRFNELPELKNANMAKIEYGKFLIPELI--NEDKVLYLGNNTIIDQNLDSLFAIDI 119 Query: 149 PDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLN 208 D + V D FN + IN W + + + + Sbjct: 120 EDKPLYATVDFVHPDK-----------------FNMDVMFINNIYWRNNNIGNQFLELGK 162 Query: 209 EPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKE-----SFINPVTNDTIF 263 +++ Q ++N + YN Q + E + +D Sbjct: 163 HYDLVDA-----QAMINDGFRVNIGKLPAIYNYQIGIGDPNFEPVISYRYYEDAIDDPAI 217 Query: 264 IHYIGPTK 271 I Y ++ Sbjct: 218 IQYPTSSR 225 >UniRef50_A2E3L1 Glycosyl transferase family 8 protein n=2 Tax=Trichomonas vaginalis RepID=A2E3L1_TRIVA Length = 319 Score = 103 bits (258), Expect = 7e-21, Method: Composition-based stats. Identities = 45/278 (16%), Positives = 87/278 (31%), Gaps = 34/278 (12%) Query: 28 CLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 +++ + + G +I + + + + F+ T RKY + Sbjct: 54 TMNVLFSWSGE-IQNLGPAIYAFINAHPNQHITFY-LTTTQSKYIRKYRAYFDKWPIHNV 111 Query: 88 KIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 K + + Y R + + + LYLD D I I + + Sbjct: 112 KFFFEVHQTKYLAKRVIRVPYDAYIRLLFPNAHPG-LERFLYLDGDAIVLHNINDMYYYD 170 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 F + +++ L YFNSG ++ N ++ + +A L Sbjct: 171 FQNKSAIVILDH------------LSECEGFSRYFNSGVMMFNNWKYVQENFLKQAEDYL 218 Query: 208 NEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYI 267 EI + + DQ LN + I YN E + + +T H+ Sbjct: 219 KWLEINRGVWFNDQTPLNKIFEHNRIEFPQDYN----------EWNMTRFSKNTKIAHFY 268 Query: 268 GPT---KPWHDWAWD----YPVSQAFMEAKNASP--WK 296 KP+ + + + F +A P W+ Sbjct: 269 DNDWACKPYKASCNRTDLPFELWRCFYKAFKEQPNLWR 306 >UniRef50_B2KBT4 Glycosyl transferase family 8 n=1 Tax=Elusimicrobium minutum Pei191 RepID=B2KBT4_ELUMP Length = 320 Score = 103 bits (257), Expect = 1e-20, Method: Composition-based stats. Identities = 50/325 (15%), Positives = 113/325 (34%), Gaps = 31/325 (9%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEG--SRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 I + ++ FLF +++ S+ K + +F F + D+ + + + Sbjct: 7 IFFACNQRFLFTLAVALLSLKKNSPKALENSDVLVFYQGFNEQDKALLNKI---LPCKFF 63 Query: 89 IYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSF 148 Y + + K+++ + R+ I D + KVLY+D D++ G + + ++ Sbjct: 64 EYKFAVETNFDHINFKHFSQLTFARYEIFDML-DTYKKVLYIDVDVMIGGELNYIFE-NY 121 Query: 149 PDDK-VAMVVTE--GQADWWEKRAHSLGVAGIAKGYFNSGFLLI-NTAQWAAQQVSARAI 204 D VAM G + + + + +N+G L + + + Sbjct: 122 GDKTGVAMCEDTQKGLTLITKNFVNPMPQYDMTLPCYNAGVTLFCDNIKDRQH---LKMW 178 Query: 205 AMLNEPEIIKKITHPDQDVLNMLLAD---KLIFADIKYNTQFSLNYQLKESFINPVTNDT 261 E + + PDQ V+N++ + + N S +++ D Sbjct: 179 CYERTAEWLDNLVCPDQGVVNVMFQEFGITVEVLPDICNCLPSNP-----KYLDKRRKDI 233 Query: 262 IFIHYI-GPTKPWHDWAWDYPVSQAFMEAKNA--SPWKNTA-----LLKPNNSNQLRYSA 313 + H G + W + W+ P + + E +P + +K N + R+ Sbjct: 234 LIYHCAGGGVRFW-TYTWNAPWQKFYKEYLELGGAPHPDKEHAWLKFIKKYNLQRFRFFD 292 Query: 314 KHMLKKHRYLKGFSNYLFYFIEKIK 338 + + + L Y + Sbjct: 293 RSPDPQMHPARFLKYLLIYPFKYAF 317 >UniRef50_B4WN64 Glycosyl transferase family 8 n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WN64_9SYNE Length = 289 Score = 103 bits (256), Expect = 1e-20, Method: Composition-based stats. Identities = 54/289 (18%), Positives = 98/289 (33%), Gaps = 30/289 (10%) Query: 27 LCLDIAYGTDKNFLFGCGISIASILKYNEG----SRLCFHIF-----TDYFGDD------ 71 + +DIA ++ + I SIL L F+I + +F ++ Sbjct: 1 MPVDIALSVNRTLQVPLLVVINSILTNTTHRTEEVPLRFNIVVPIGESAFFEEELKQAFS 60 Query: 72 ---DRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAI-YFRFVIADYFINKAPKV 127 D + + ++ + ++ R + + + Y R D F ++ Sbjct: 61 AKYDCERVEFRVKEFTPPSYLKQYLDNKFREKKQERRLSRYMQYARLFFKDVFP-DIARM 119 Query: 128 LYLDADIICQGTIEPLI---NFSFPDDKVAMVVTEGQA-DWWEKRAHSLGVAGIAKGYFN 183 +Y DADII G + L N + +A V A ++ K FN Sbjct: 120 IYFDADIIVLGNVRSLFTQGNILTSQNYLAAVPQFFPAIFYFSNPLKVFSDLRKFKSTFN 179 Query: 184 SGFLLINTAQWAAQ--QVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNT 241 SG LL + + W Q ++ + L+E + D+ V N++ D I ++N Sbjct: 180 SGVLLTDLSFWTDQTYKLLKHYLE-LDEKNNYRLYHLGDETVFNLMFKDTYIPLTKQWNC 238 Query: 242 QFSLNYQLKESFINPVTNDTIFIHYI-GPTKPWHDWAWDYPVSQAFMEA 289 + + IH+ G KPW Y S + Sbjct: 239 CGYGQAHWVAKLLWKNPENMKAIHWSGGHHKPWQSKQVIY--SDLWRSY 285 >UniRef50_Q9FH36 Similarity to unknown protein n=28 Tax=Embryophyta RepID=Q9FH36_ARATH Length = 535 Score = 102 bits (254), Expect = 2e-20, Method: Composition-based stats. Identities = 45/327 (13%), Positives = 98/327 (29%), Gaps = 66/327 (20%) Query: 25 ENLCLDIAYGTDKNFLFGCGISIASILKYN-EGSRLCFHIFTD----------------- 66 +N +D + + S+++ ++ HI TD Sbjct: 205 DNNYFHFVLASDN--ILAASVVAKSLVQNALRPHKIVLHIITDRKTYFPMQAWFSLHPLS 262 Query: 67 -------------YFGD------DDRKYFDALALQYKTRIKIYLINGDR------LRSLP 101 + + + + Q++ + + N + Sbjct: 263 PAIIEVKALHHFDWLSKGKVPVLEAMEKDQRVRSQFRGGSSVIVANNKENPVVVAAKLQA 322 Query: 102 STKNWTHAI-YFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEG 160 + + + + R + + F KV++LD DI+ Q + PL + V T Sbjct: 323 LSPKYNSLMNHIRIHLPELFP-SLNKVVFLDDDIVIQTDLSPLWDIDMNGKVNGAVETCR 381 Query: 161 QADWWEKRAHSLGVAGIAKG----YFNS-------GFLLINTAQWAAQQVSARAIAMLNE 209 D + + FN G + + A W +S+ L+E Sbjct: 382 GEDKFVMSKKFKSYLNFSNPTIAKNFNPEECAWAYGMNVFDLAAWRRTNISSTYYHWLDE 441 Query: 210 PEIIKKITHPDQDVLN---MLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHY 266 + ++ L + + D + L YQ S+ + +H+ Sbjct: 442 N-LKSDLSLWQLGTLPPGLIAFHGHVQTIDPFW-HMLGLGYQETTSYADA--ESAAVVHF 497 Query: 267 IGPTKPWHDWAWDYPVSQAFMEAKNAS 293 G KPW D A+ + + + + ++S Sbjct: 498 NGRAKPWLDIAFPH-LRPLWAKYLDSS 523 >UniRef50_B9IA47 Glycosyltransferase n=7 Tax=rosids RepID=B9IA47_POPTR Length = 531 Score = 102 bits (254), Expect = 2e-20, Method: Composition-based stats. Identities = 46/321 (14%), Positives = 94/321 (29%), Gaps = 65/321 (20%) Query: 30 DIAYGTDKNFLFGCGISIASILKYNEG-SRLCFHIFTDYF-------------------- 68 + TD + + I+S ++++ +L FHI TD Sbjct: 208 HVVLLTDN--VLAASVVISSTVQHSANPEKLVFHIVTDKKTYIPMNAWFAINPIKSAAVE 265 Query: 69 ---------GDDDRKYFDALALQYKTRIKIYLIN--GDRLRSLPSTKN---------WTH 108 + + + ++ Y N + + + Sbjct: 266 VKGLHQYDWSHEVNVHVKEMLEIHRLIWSHYNDNLRNANFQHEGVNRRSLEALTPSCLSL 325 Query: 109 AIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKR 168 + R I + F K+++LD D++ Q + L V VV D Sbjct: 326 LNHLRIYIPELFP-DLNKIVFLDEDVVVQHDMSSLWELDLNKKVVGAVVDSWCGDNCCPG 384 Query: 169 AHSLGVAGIAKGYFNS-----------GFLLINTAQWAAQQVSARAIAMLNEP-----EI 212 + +S G + + W +++ L E+ Sbjct: 385 KKYKDYLNFSYPIISSNFDHDRCVWLYGVNVFDLEAWRRVKITTNYHKWLKHNLNFGMEL 444 Query: 213 IKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIGPTKP 272 + HP L + ++ D ++ L Y+ ++ + D +H+ GP KP Sbjct: 445 WQPGVHP--PAL-LAFEGQVHPIDPSWHV-GGLGYRPPQAHNIKMLGDAAVLHFSGPAKP 500 Query: 273 WHDWAWDYPVSQAFMEAKNAS 293 W D + + + N S Sbjct: 501 WLDIGFP-ELRSLWNRHVNFS 520 >UniRef50_UPI000180B580 PREDICTED: similar to glycosyltransferase 8 domain containing 4 n=1 Tax=Ciona intestinalis RepID=UPI000180B580 Length = 390 Score = 102 bits (254), Expect = 2e-20, Method: Composition-based stats. Identities = 53/246 (21%), Positives = 98/246 (39%), Gaps = 29/246 (11%) Query: 45 ISIASILKYNEGSRLCFHIFTDYFGDDDRKYFD----ALALQYKTRIK--IYLINGDRLR 98 I S + ++ SR+ HIFT+ D+ RK F + ++ I +Y ++ + L+ Sbjct: 74 TMIKSAIIFSR-SRIHVHIFTENLEDEFRKEFATWPLSAKRKFLLTIHPLVYPLDPEELK 132 Query: 99 SLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLIN--FSFPDDKVAMV 156 + + W FR + +K VLY+D D+I E L + F D +VA + Sbjct: 133 VMKNW--WGPCASFRLFLPTVL-DKTDSVLYVDTDVIFLTPPEELWRHFYLFNDRQVAAL 189 Query: 157 VTEGQADW--WEKRAHSLGVAGIAKGYFNSGFLLINTAQWAA-----QQVSARAIAMLNE 209 + A+ + + K NSG L+N + + S + I+ + Sbjct: 190 APRVGWSFQVPNDNANFIRMQDGKKTQVNSGVFLMNLTRMRQPVFATESESRQKISWNKK 249 Query: 210 -----PEIIKKITHPDQDVLNMLLA---DKLIFADIKYNTQFSLNYQ-LKESFINPVTND 260 K+ + DQ+++N++ D + F KYN + +E + +D Sbjct: 250 LLFPLYRKHKEDMYGDQNLINLVFHYNPDLIYFLPCKYNYHHKFCFDAYRERWCTSAESD 309 Query: 261 -TIFIH 265 IH Sbjct: 310 GAAVIH 315 >UniRef50_D1HMA0 Whole genome shotgun sequence of line PN40024, scaffold_108.assembly12x (Fragment) n=9 Tax=rosids RepID=D1HMA0_VITVI Length = 511 Score = 101 bits (251), Expect = 5e-20, Method: Composition-based stats. Identities = 38/212 (17%), Positives = 66/212 (31%), Gaps = 26/212 (12%) Query: 99 SLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVT 158 S K + + R I + F KV++LD D++ Q + PL V T Sbjct: 297 QARSPKYISLLNHLRIYIPELFP-NLNKVVFLDDDVVIQRDLSPLWEIDLEGKVNGAVET 355 Query: 159 EGQADWWEKRAHSLGVAGIAKG--------------YFNSGFLLINTAQWAAQQVSARAI 204 D W + Y G + + + W + Sbjct: 356 CRGEDEWVMSKRFRNYFNFSHPLIAKNLNPDECAWAY---GMNIFDLSAWRKTNIRETYH 412 Query: 205 AMLNEPEIIKKITHPDQDVLN---MLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDT 261 + L E + +T L + + D + L YQ K + + Sbjct: 413 SWLKEN-LKSNLTMWKLGTLPPALIAFKGHIHPIDPSW-HMLGLGYQNKTNIDS--VKKA 468 Query: 262 IFIHYIGPTKPWHDWAWDYPVSQAFMEAKNAS 293 IHY G +KPW +++ + + + N S Sbjct: 469 AVIHYNGQSKPWLQIGFEH-LRPFWTKYVNYS 499 >UniRef50_B5ZNF8 Glycosyl transferase family 8 n=7 Tax=Rhizobium RepID=B5ZNF8_RHILW Length = 303 Score = 100 bits (250), Expect = 6e-20, Method: Composition-based stats. Identities = 54/311 (17%), Positives = 108/311 (34%), Gaps = 39/311 (12%) Query: 24 TENLCLDIAYGTDKNFLFGCGISIASILKY----NEGSRLCFHIFTDYFGDDDRKYFDAL 79 N C + Y TD + F +I S L+ + + +C + +++ D+ +L Sbjct: 1 MNNQC--VVYVTDVEYSFP---TILSALQARKFASPATDVCV-LMSEHL--DNFDELRSL 52 Query: 80 ALQYKTRI----KIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADII 135 + + + +L + + + V+ + +++YLD D Sbjct: 53 LATSGVDLIDATEALQDSLGKLDGSHFQGRISVSTMAKLVLCEILPANYTQIIYLDGDTQ 112 Query: 136 CQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWA 195 + L N P+ + H G YFN+G L + W Sbjct: 113 IVSDLGGLENALVPEGRFFAARDY-------TAIHDFLDTGKNSHYFNAGVLKFHRNGW- 164 Query: 196 AQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFIN 255 + A+ + + H DQ LN + LI ++N + F++ Sbjct: 165 ---IGQEALELFARNPEACEGKH-DQGALNYVCGSSLILVSNRWNF--------PKQFLH 212 Query: 256 PVTNDTI-FIHYIGPTKPWH--DWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYS 312 V + +HY+ KPWH + W SQ +++ + A P N+ + Y Sbjct: 213 LVNMSALSIVHYMAHPKPWHGTFFPWTDRESQVYVDLRKAHPIYNSLYRGITFDRKALYK 272 Query: 313 AKHMLKKHRYL 323 + M + ++ Sbjct: 273 YRSMRARIKHA 283 >UniRef50_B6HCQ7 Pc18g02120 protein n=2 Tax=mitosporic Trichocomaceae RepID=B6HCQ7_PENCW Length = 711 Score = 100 bits (250), Expect = 7e-20, Method: Composition-based stats. Identities = 53/271 (19%), Positives = 88/271 (32%), Gaps = 54/271 (19%) Query: 32 AYGT---DKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 Y T N+L G + S+ +RL D D L Y I Sbjct: 8 VYCTLLLSDNYLPGAMVLAHSLRDNGTKARLVALFTPDRLQSST---IDELRSVYDELIP 64 Query: 89 IYLINGDRLRSLPSTKN------WTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEP 142 + + D +L +T +R + +V+Y+D D++ + Sbjct: 65 VSSMVNDTPANLWLMDRPDLIATFTKIELWRL-------TQYQRVVYIDCDVVALRAPDE 117 Query: 143 LINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLI--NT-AQWAAQQV 199 L++ A G D FNSG +++ N +A + + Sbjct: 118 LLSLE---ADFAAAPDVGWPDC-----------------FNSGMMVLRPNLQDYYALRAL 157 Query: 200 SARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTN 259 + R I+ DQ +LNM D YN S NYQ ++ + Sbjct: 158 AQRGISF----------DGADQGLLNMHFRD-WHRLSFTYNCTPSANYQYIPAY-KHFQS 205 Query: 260 DTIFIHYIGPTKPWHDWAWDYPVSQAFMEAK 290 IH+IG KPW+ P+ + + Sbjct: 206 TISLIHFIGARKPWNMPRQIVPLESPYNQLL 236 >UniRef50_B4QUA9 GD18236 n=2 Tax=Sophophora RepID=B4QUA9_DROSI Length = 511 Score = 100 bits (249), Expect = 9e-20, Method: Composition-based stats. Identities = 44/254 (17%), Positives = 86/254 (33%), Gaps = 14/254 (5%) Query: 21 KVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRL-CFHIFT-DYFGDDDRKYFDA 78 K +T L I + + I S + +N F IFT D GD+ R+ Sbjct: 189 KRQTGKPPLYIVVVCCGQRVQETLVMIKSAILFNYDEEYLKFVIFTEDGKGDEFREKLTD 248 Query: 79 LALQYKTRIKIYLINGDRLRSLPSTKN--WTHAIYFRFVIADYFINKAPKVLYLDADIIC 136 ++ + R + +LY+D DI+ Sbjct: 249 WRDIKPFTFDFEILPLKFPSGNEVEWRNLFKPCAAQRLFLPSLL-THVDSLLYVDTDILF 307 Query: 137 QGTIEPLINF--SFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQW 194 I + F F + +++ + E + + + NSG +L+N + Sbjct: 308 LSPISDIWRFFKKFNETQMSALTPEHENENIGWYNRFARHPFYGRLGVNSGVMLMNLTRM 367 Query: 195 AAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLA---DKLIFADIKYNTQFSLNYQLKE 251 + + +++ E ++ +I DQD++N+L DKL +YN + + Sbjct: 368 REMKWEQQIVSIHKEYKL--RIIWGDQDIINILFYYHPDKLYIMPCEYNYRPDHCMYMSI 425 Query: 252 SFINPVTNDTIFIH 265 ++ IH Sbjct: 426 CNMSHAG--VKVIH 437 >UniRef50_O95461 Glycosyltransferase-like protein LARGE1 n=84 Tax=Metazoa RepID=LARGE_HUMAN Length = 756 Score = 100 bits (249), Expect = 9e-20, Method: Composition-based stats. Identities = 43/264 (16%), Positives = 93/264 (35%), Gaps = 18/264 (6%) Query: 20 HKVETENLCLDIAY-GTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDA 78 V + + +A N + S+L + + L FH+ D + Sbjct: 129 QPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFH-RRNPLHFHLIADSIAEQILATLFQ 187 Query: 79 LALQYKTRIKIYLINGDRLRSLPS-TKNWTHAIY--FRFVIADYFINKAPKVLYLDADII 135 + R+ Y + + K+++ IY + V+ +V+ LD DI Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYS-GIYGLMKLVLTKTLPANLERVIVLDTDIT 246 Query: 136 CQGTIEPLINF--SFPD-DKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTA 192 I L F + +V + + + +GY N+G +L+ Sbjct: 247 FATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY-NTGVILLLLD 305 Query: 193 QWAAQQVSARAIAMLNEPEIIKKI--THPDQDVLNMLLADK---LIFADIKYNTQFSLNY 247 + + + + E E++ + + DQD+ N ++ + +N Q S + Sbjct: 306 KLRKMK-WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHT 364 Query: 248 QLKESFINPVTNDTIFIHYIGPTK 271 + ++ + + +D IH+ P K Sbjct: 365 RSEQCYRD--VSDLKVIHWNSPKK 386 >UniRef50_Q0U987 Putative uncharacterized protein n=1 Tax=Phaeosphaeria nodorum RepID=Q0U987_PHANO Length = 583 Score = 99 bits (248), Expect = 1e-19, Method: Composition-based stats. Identities = 47/255 (18%), Positives = 78/255 (30%), Gaps = 54/255 (21%) Query: 32 AYGT---DKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 Y T ++L G + S+ +L I + D L Y I Sbjct: 8 VYCTLLMSDSYLPGAAVLAHSLRDAGTKKKLAVLITLETLSADT---ITQLKELYDYLIP 64 Query: 89 IYLI------NGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEP 142 + I N + + +T +R + K++YLDAD++ ++ Sbjct: 65 VERIRTPSPANLYLMGRPDLSFAFTKIALWR-------QTQFRKIVYLDADVVALRALDE 117 Query: 143 LINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLI--NT-AQWAAQQV 199 L + A G D FNSG ++I + W Sbjct: 118 LFDIE---APFAAAPDIGWPDA-----------------FNSGVMVISPDMGEYW----- 152 Query: 200 SARAIAMLNEPEIIKKITHPDQDVLNMLLADK-LIFADIKYNTQFSLNYQLKESFINPVT 258 A+ DQ +LN + YN + YQ + ++ Sbjct: 153 -----ALQTMAATGDSFDGADQGLLNQYFEHRPWQRLKFTYNCTPNAEYQWEPAYRY-YK 206 Query: 259 NDTIFIHYIGPTKPW 273 D +H+IG KPW Sbjct: 207 RDISAVHFIGKEKPW 221 >UniRef50_Q6Z5D6 Glycosyltransferase family-like n=6 Tax=Poaceae RepID=Q6Z5D6_ORYSJ Length = 726 Score = 99 bits (248), Expect = 1e-19, Method: Composition-based stats. Identities = 55/321 (17%), Positives = 103/321 (32%), Gaps = 53/321 (16%) Query: 13 NSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNE-GSRLCFHIFTDYFG-- 69 N Y+ K+E L A +D + G + + S + + + FHI TD Sbjct: 404 NKHFPYEEKLEDPKL-QHYALFSDN--VLGAAVVVNSTIIHAKTPENHVFHIVTDKLNYA 460 Query: 70 --------------------DDDRKYFDA----LALQYKTRIKIYLINGDRLRSLPSTKN 105 +D + ++ + Q +++ I + + Sbjct: 461 AMRMWFLENSQGKAAIEVQNIEDFTWLNSSYSPVLKQLESQFMINYYFKTQQDKRDNNPK 520 Query: 106 WTHAIYF------RFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTE 159 + + Y RF + + F K KVL+LD DI+ Q + L + + T Sbjct: 521 FQNPKYLSILNHLRFYLPEIFP-KLNKVLFLDDDIVVQQDLSALWSIDLKGKVNGAIQTC 579 Query: 160 GQADWWEKRAHSLGVAGIAKG---------YFNSGFLLINTAQWAAQQVSARAIAMLNEP 210 G+ R + IAK Y G + + ++W + ++ + Sbjct: 580 GETFHRFDRYLNFSNPLIAKNFERRACGWAY---GMNMFDLSEWRKRNITDVYHYWQEQN 636 Query: 211 EIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIGPT 270 E + ++ D K++ L Y K + IHY G Sbjct: 637 EHRLLWKLGTLPAGLVTFWNQTFPLDHKWHLL-GLGY--KPNVNQKDIEGAAVIHYNGNR 693 Query: 271 KPWHDWAWDYPVSQAFMEAKN 291 KPW + A + + + N Sbjct: 694 KPWLEIAMA-KYRKYWSKYVN 713 >UniRef50_A2RAV0 Catalytic activity: UDP-glucose + glycogenin <=> UDP + glucosylglycogenin n=2 Tax=Aspergillus RepID=A2RAV0_ASPNC Length = 767 Score = 99 bits (248), Expect = 1e-19, Method: Composition-based stats. Identities = 47/265 (17%), Positives = 87/265 (32%), Gaps = 42/265 (15%) Query: 32 AYGT---DKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 Y T ++L G + S+ ++L D + A+ Y I Sbjct: 7 VYCTLLLSDHYLPGATVLAHSLRDNGSKAKLVALFTPDSLQPATIQELQAV---YDELIP 63 Query: 89 IYLINGDRLRSLPSTKNWT-HAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 ++ + +L A + + + + +++Y+D D++ + L++ Sbjct: 64 VHPLTNITPANLWLMDRPDLIATFTKIELWR--QTQYKRIVYIDCDVVALRAPDELLDLE 121 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLI--NTAQWAAQQVSARAIA 205 A V G D FNSG +++ N + +A Sbjct: 122 VD---FAAVPDVGWPDC-----------------FNSGVMVLRPNLQDY---------LA 152 Query: 206 MLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIH 265 + E DQ +LNM D YN S NYQ ++ + IH Sbjct: 153 LRALAERGISFDGADQGLLNMHFRD-WHRLSFSYNCTPSANYQYIPAY-KHFQSTISMIH 210 Query: 266 YIGPTKPWHDWAWDYPVSQAFMEAK 290 +IG KPW+ P+ + + Sbjct: 211 FIGAQKPWNMARQVEPIHSPYNQLL 235 >UniRef50_Q726Y5 Glycosyl transferase, family 8 n=4 Tax=Desulfovibrio vulgaris RepID=Q726Y5_DESVH Length = 303 Score = 99.2 bits (246), Expect = 2e-19, Method: Composition-based stats. Identities = 44/264 (16%), Positives = 94/264 (35%), Gaps = 36/264 (13%) Query: 23 ETENLCLDIAYGTDKN--FLFGCGISIASI-LKYNEGSRLCFHIFTD-YFGDDDRKYFDA 78 + + + + D N +L +++ S+ L + + + +C + D D R+ Sbjct: 19 RRKAPTMKVVFCIDDNPRYLLMLRVAVRSLRLLHPDITCVCVYAGDDTGVMDAVREEGVL 78 Query: 79 LALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQG 138 LA +Y+ + I R + A ++ + VLY D+D++ + Sbjct: 79 LA-RYRPVLDATTIPAAFHRCIGCFLKLELA-----LVPEL--AAESHVLYCDSDVLFRR 130 Query: 139 TIEPLINFSFPDDKVAMVVTEGQA---------DWWEKRAHSLGVAGIAKGYFNSGFLLI 189 ++ L+ + M + W R + + + Y +SG +L Sbjct: 131 PLDDLLA--LRPPYMGMAREDTAPFFHDHAELDYMWRGRRYVVPLPFPIWTY-SSGVVLF 187 Query: 190 NTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQL 249 N + +A +++I + DQ +LN ++ D ++N Sbjct: 188 NLERLRRHDHVHNFLAFCAGN--VQRIGNLDQSLLNYFFGKRITKLDPRWNCPP------ 239 Query: 250 KESFINPVTNDTIFIHYIGPTKPW 273 + + IH+ GP KPW Sbjct: 240 ---YRQAALAEGHIIHFHGP-KPW 259 >UniRef50_Q9H1C3 Glycosyltransferase 8 domain-containing protein 2 n=29 Tax=Euteleostomi RepID=GL8D2_HUMAN Length = 349 Score = 99.2 bits (246), Expect = 2e-19, Method: Composition-based stats. Identities = 37/275 (13%), Positives = 93/275 (33%), Gaps = 38/275 (13%) Query: 45 ISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTK 104 +I SI N + + F++ + + + I ++ + + + Sbjct: 67 AAINSI-YSNTDANILFYVV--GLRNTLTRIRKWIEHSKLREINFKIVEFNPMVLKGKIR 123 Query: 105 NWT-------HAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVV 157 + + RF + I++ KV+YLD D+I QG I+ L + + A Sbjct: 124 PDSSRPELLQPLNFVRFYLP-LLIHQHEKVIYLDDDVIVQGDIQELYDTTLALGHAAAFS 182 Query: 158 TEGQADWWEKRAHSLGVAGIAKGY-------------------FNSGFLLINTAQWAAQQ 198 + + +G+ GY FN G ++ N +W Q+ Sbjct: 183 DDCDLPSAQDINRLVGLQNTYMGYLDYRKKAIKDLGISPSTCSFNPGVIVANMTEWKHQR 242 Query: 199 VSARAIAMLNEPEIIKKITHPDQ------DVLNMLLADKLIFADIKYNTQFSLNYQLKES 252 ++ + + + + + + + ++ K + ++ + L + Sbjct: 243 ITKQLEKWMQKN-VEENLYSSSLGGGVATSPMLIVFHGKYSTINPLWHIRH-LGWNPDAR 300 Query: 253 FINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFM 287 + + +H+ G KPW + + +++ Sbjct: 301 YSEHFLQEAKLLHWNGRHKPWDFPSVHNDLWESWF 335 >UniRef50_Q9VBY3 CG9996 n=6 Tax=Sophophora RepID=Q9VBY3_DROME Length = 362 Score = 98.4 bits (244), Expect = 3e-19, Method: Composition-based stats. Identities = 41/239 (17%), Positives = 84/239 (35%), Gaps = 12/239 (5%) Query: 20 HKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRL-CFHIFT-DYFGDDDRKYFD 77 ++++T L I + + + I S + +N F IFT D GD+ R+ Sbjct: 39 NELQTGKPPLYIVVVSCGQRVQETLVMIKSAILFNYDEEYLKFVIFTEDGKGDEFREKLT 98 Query: 78 ALALQYKTRIKIYLINGDRLRSLPSTKN--WTHAIYFRFVIADYFINKAPKVLYLDADII 135 ++ + R + +LY+D DI+ Sbjct: 99 DWRDIKPFTFDFEILPLKFPSGNEVEWRNLFKPCAAQRLFLPSLL-THVDSLLYVDTDIL 157 Query: 136 CQGTIEPLINF--SFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQ 193 I + F F + +++ + E + + + NSG +L+N + Sbjct: 158 FLSPISDIWRFFKKFNETQMSALTPEHENENIGWYNRFARHPFYGRLGVNSGVMLMNLTR 217 Query: 194 WAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLA---DKLIFADIKYNTQFSLNYQL 249 + +++ E ++ +I DQD++N+L DKL +YN + + Sbjct: 218 MREMKWEQHIVSIHKEYKL--RIIWGDQDIINILFYYHPDKLYIMPCEYNYRPDHCMYM 274 >UniRef50_C1QEC8 LPS:glycosyltransferase n=1 Tax=Brachyspira murdochii DSM 12563 RepID=C1QEC8_9SPIR Length = 333 Score = 98.0 bits (243), Expect = 4e-19, Method: Composition-based stats. Identities = 48/288 (16%), Positives = 96/288 (33%), Gaps = 28/288 (9%) Query: 25 ENLCLDIAYGTDKNFLFGCGISIASILKYNE-GSRLCFHIFTDYFGDDDRKYFDALALQY 83 N ++I +D F+ I SILK + CFHI D ++++ L Sbjct: 1 MNKVMNICLISDDKFVEYIATLIVSILKNSSENDNFCFHIIEDGIREENKNKLLMLKEIK 60 Query: 84 KTRIKIYLINGDRLRSLPSTKN---------WTHAIYFRFVIADYFINKAPKVLYLDADI 134 IK Y N + + + W ++++ + I + VL++DAD Sbjct: 61 DFEIKFYKPNYNNIEKYKKWQEIFKKNSYPVWHYSVFIKLDIP-IILKDLDNVLFIDADS 119 Query: 135 ICQGTIEPLINFSFPDDKVAMVVTEGQA-----DWWEKRAHSLGVAGIAKGYFNSGFLLI 189 I G I+ + + + ++ K +G Y +S L+ Sbjct: 120 IVLGDIDFIYDVDISNYFFVSQFFYYKSLKNLYPNLYKYILDIGYKNPEYNYVSSQVLIF 179 Query: 190 NTAQWAAQQVSARAIAMLNEPEIIKKI--THPDQDVLNMLLADKLIFADIKYNTQFSLNY 247 N + + I K I ++ + + D + F D+K + + Sbjct: 180 NIKKIKEIFKKEEYYYNKIDECIDKYINSIFTEEHIFLYVFRDSIAFLDLKVDCNENTEK 239 Query: 248 QLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPW 295 + + + G KP +++ F E + +P+ Sbjct: 240 IIISDYFASSGKPLKY----GFDKPINEY------YYKFWEYFSLTPF 277 >UniRef50_Q2R1U9 Glycosyl transferase family 8 protein, expressed n=3 Tax=Poaceae RepID=Q2R1U9_ORYSJ Length = 548 Score = 98.0 bits (243), Expect = 4e-19, Method: Composition-based stats. Identities = 38/268 (14%), Positives = 82/268 (30%), Gaps = 25/268 (9%) Query: 44 GISIASILKYNEG-SRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIYLING-----DRL 97 + + S + ++ R+ FHI TD + I+I ++ Sbjct: 273 AVVVNSTISASKDPKRIMFHIVTDALNFPAMMMWFLTNPPNPATIQIKSLDNLKWLPADF 332 Query: 98 RSLPSTKNWTHAIY------FRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDD 151 K Y RF + + F K++ LD DI+ Q + L Sbjct: 333 SFRFKQKGIRDPRYTSALNHLRFYLPEVFP-SLNKLVLLDHDIVVQRDLSGLWQIDLNGK 391 Query: 152 KVAMVVTEGQADWWEKRAHSLGVAGIA-KGYFNS-------GFLLINTAQWAAQQVSARA 203 V T D + + + + + + F++ G + + +W Q ++ Sbjct: 392 VNGAVETCTSGDGYHRLENLVNFSDPSIINKFDAKACIHAFGMNIFDLKEWRRQGLTTAY 451 Query: 204 IAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIF 263 + + + ++ ++ + D +++ L S Sbjct: 452 NKWFQAGKRRRLWKAGSLPLGQIVFYNQTVPLDHRWHV---LGLGHDRSIGRDAIERAAV 508 Query: 264 IHYIGPTKPWHDWAWDYPVSQAFMEAKN 291 IHY G KPW + + + + Sbjct: 509 IHYSGKLKPWLEISIP-KYRDYWNNFLD 535 >UniRef50_B4WFJ6 Glycosyl transferase family 8 n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WFJ6_9SYNE Length = 298 Score = 98.0 bits (243), Expect = 4e-19, Method: Composition-based stats. Identities = 51/286 (17%), Positives = 99/286 (34%), Gaps = 28/286 (9%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSR-LCFHIFT-----DYFGDDDRKYFDALALQYK 84 I + ++ +++ SI+ + F++ +F R+ +LA Q++ Sbjct: 10 IVFSLNRKIWLSLIVAMNSIVSNASNPDTIRFNVLVPPGEEQFFEKKIREALPSLAAQWR 69 Query: 85 TRIKI------YLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQG 138 + + ++ +N + Y RF D F +V+YLD D+I G Sbjct: 70 VKSYLPPAFMQEYLDKRFKEKTEDRRNSRYIQYSRFFFRDAF-EDLERVIYLDTDLIVLG 128 Query: 139 TIEPLINF--SFPDDKVAMVVTEGQADWWEKRAHSLGVAGIA--KGYFNSGFLLINTAQW 194 I L + + + + + I K FN+G N + W Sbjct: 129 DIAELYAYTKALDEHCYFGSIPHFYPCIFYFSNFMKMREEIPKFKQTFNAGVWFTNLSFW 188 Query: 195 AAQQVSARAIAM--LNEPEIIKKITHPDQDVLNMLLADKLIFADIKYN-TQFSLNYQLKE 251 + R L+ K T D+ V N++ D + AD +N + + + Sbjct: 189 NE-KTYERLNYYLSLDAKSNYKLYTLGDEPVFNLMFKD-YLQADKNWNRCGYGTHPAVTN 246 Query: 252 SFI---NPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASP 294 F+ ++ IH+ GP KPW + + +P Sbjct: 247 LFLASGEKFLSEAKLIHWSGPFKPWSSPKIR--FADLWRTYL-PTP 289 >UniRef50_Q9FX71 T6J4.1 protein n=2 Tax=rosids RepID=Q9FX71_ARATH Length = 363 Score = 97.6 bits (242), Expect = 5e-19, Method: Composition-based stats. Identities = 42/255 (16%), Positives = 90/255 (35%), Gaps = 16/255 (6%) Query: 23 ETENLCLDIAYGTDKNFLFGCGISIASILKYNE-GSRLCFHIFTDYFGDDDRKYFDALAL 81 E + IA D +L G + S+L++ + FH + D + + Sbjct: 54 EHNPSIIHIAMTLDAIYLRGSVAGVFSVLQHASCPENIVFHFIATHRRSADLRRIISSTF 113 Query: 82 QYKTRIKIYLINGDRLRSLPSTKNWTHAI-----YFRFVIADYFINKAPKVLYLDADIIC 136 Y T IY + + +RS S+ A+ Y R +AD +V+Y D+D++ Sbjct: 114 PYLTY-HIYHFDPNLVRSKISSS-IRRALDQPLNYARIYLADLLPIAVRRVIYFDSDLVV 171 Query: 137 QGTIEPLINFSFPDDKVAM-------VVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLI 189 + L V + +W + + + YFN+G ++I Sbjct: 172 VDDVAKLWRIDLRRHVVGAPEYCHANFTNYFTSRFWSSQGYKSALKDRKPCYFNTGVMVI 231 Query: 190 NTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQL 249 + +W ++V+ + + + + ++ A + + ++N Q L Sbjct: 232 DLGKWRERRVTVKLETWMRIQKRHRIYELGSLPPFLLVFAGDVEPVEHRWN-QHGLGGDN 290 Query: 250 KESFINPVTNDTIFI 264 E + + + Sbjct: 291 LEGLCRNLHPGPLIV 305 >UniRef50_B7PNZ0 Glycosyltransferase domain-containing protein, putative (Fragment) n=1 Tax=Ixodes scapularis RepID=B7PNZ0_IXOSC Length = 269 Score = 97.2 bits (241), Expect = 7e-19, Method: Composition-based stats. Identities = 29/134 (21%), Positives = 60/134 (44%), Gaps = 8/134 (5%) Query: 113 RFVIADYFINKAPKVLYLDADIICQGTIEPLINF--SFPDDKVAMVVTEGQADWWEKRAH 170 R + + VLY+D+DI+ +E L + + D ++A + + + H Sbjct: 95 RLFLPSMLSEE-DAVLYVDSDIVFFRPVEELWSIFDNMDDMQLAGLAPDVEDYNGSVYMH 153 Query: 171 SLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLAD 230 + + NSG +L+N + A + + ++N+ + K+ PDQD+LNM+ D Sbjct: 154 NWKTRYYGRYGLNSGVILMNLTRMRAYGLESIVTNLMNKYHSVMKL--PDQDLLNMVFHD 211 Query: 231 ---KLIFADIKYNT 241 +L +++ Sbjct: 212 DPGRLYELPCRWDV 225 >UniRef50_B6JNQ8 Lipopolysaccharide 1,2-glucosyltransferase n=18 Tax=Helicobacter RepID=B6JNQ8_HELP2 Length = 369 Score = 96.9 bits (240), Expect = 8e-19, Method: Composition-based stats. Identities = 53/331 (16%), Positives = 106/331 (32%), Gaps = 64/331 (19%) Query: 25 ENLCLDIAYGTDKNFLFGCGISIASILKYNE--------------------GSRLCFHIF 64 +N + I DKN+ G G+S+ S+L + H Sbjct: 7 QNQIIPIFMSFDKNYALGAGVSLYSLLSHASRHTSAIDFSPLSQNNQLLGTNIVYKIHCL 66 Query: 65 TDYFGDDDRKYFDALALQYKT--RIKIYLING-----DRLRSLPSTKNWTHAIYFRFVIA 117 + + +KT ++ IN + + +K + + ++ Sbjct: 67 IKGVTLEQQNKLLKTLDPFKTFASLEFIDINSLDHSIESYLNESCSKRYGGLLVLCRLLL 126 Query: 118 DYFINKAPKVLYLDADIICQGTI-EPLINFSFP-DDKVAMVVTEGQADWWE--------- 166 K++ +D D + G + + MV +E Sbjct: 127 ASLFPNYSKIISIDVDTVFLGDVASAYFALDNEPTKLLGMVRDTFSHLSFEAFCHFIERA 186 Query: 167 --------KRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITH 218 R + I +G FN GFL+ + +W A+ L K + + Sbjct: 187 CKNFKIDFSRFSPNELKRIHQG-FNMGFLVAHLDRWRQDGFEKIALEFLKTR--GKDLFY 243 Query: 219 PDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWA- 277 P+Q ++NM+ ++++ I YN + F + I +H+I KPW + Sbjct: 244 PEQCLVNMVFWERILELPIYYNC-------YSDFFKEHYPKNIIMLHFI-KYKPWRSVSS 295 Query: 278 ------WDYPVSQAFMEAKNASPWKNTALLK 302 + ++ +P+KN L + Sbjct: 296 LNGRLICYEAEASFWLANLFCTPFKNDFLKE 326 >UniRef50_Q9M9Y5 F4H5.13 protein n=4 Tax=rosids RepID=Q9M9Y5_ARATH Length = 589 Score = 96.5 bits (239), Expect = 1e-18, Method: Composition-based stats. Identities = 52/298 (17%), Positives = 103/298 (34%), Gaps = 40/298 (13%) Query: 36 DKNFLFGC----GISIASILKYN------EGSRLCFHIFTDYFGDDDRKYFDALALQYKT 85 D NF + +S++ + E R+ FH+ TD + L +Q K Sbjct: 296 DANFNHYVVFSDNVLASSVVVNSTISSSKEPERIVFHVVTDSLNYPAISMWFLLNIQSKA 355 Query: 86 RIKIYLINGDRL-----------RSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADI 134 I+I I+ + ++ + + + RF + D F K++ LD D+ Sbjct: 356 TIQILNIDDMDVLPRDYDQLLMKQNSNDPRFISTLNHARFYLPDIFPG-LNKMVLLDHDV 414 Query: 135 ICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIA--KGYFNS-------G 185 + Q + L + V V T + + + + G F+ G Sbjct: 415 VVQRDLSRLWSIDMKGKVVGAVETCLEGESSFRSMSTFINFSDTWVAGKFSPRACTWAFG 474 Query: 186 FLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSL 245 LI+ +W +++++ I N + + + + D +++ Sbjct: 475 MNLIDLEEWRIRKLTSTYIKYFNLGTKRPLWKAGSLPIGWLTFYRQTLALDKRWHVM--- 531 Query: 246 NYQLKESFINPV-TNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLK 302 +ES + V IHY G KPW D + + + P+ +T L + Sbjct: 532 -GLGRESGVKAVDIEQAAVIHYDGVMKPWLDIGKEN--YKRYWNI--HVPYHHTYLQQ 584 >UniRef50_B2VRF2 Glycogenin-2 n=1 Tax=Pyrenophora tritici-repentis Pt-1C-BFP RepID=B2VRF2_PYRTR Length = 622 Score = 96.5 bits (239), Expect = 1e-18, Method: Composition-based stats. Identities = 45/244 (18%), Positives = 72/244 (29%), Gaps = 45/244 (18%) Query: 37 KNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIYLINGDR 96 ++L G + S+ +L + D D L Y I + I Sbjct: 16 DSYLPGAVVLANSLRDAGTKKKLAVLVTMDTLSADT---IGELKTLYDYLIPVQRIRSSN 72 Query: 97 LRSLPSTKN------WTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPD 150 +L +T +R + K++YLDAD++ ++ L + Sbjct: 73 TANLYLMGRPDLAFAFTKIALWR-------QTQFRKLVYLDADVVALRALDELFDIE--- 122 Query: 151 DKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEP 210 A G D FNSG ++I + A+ Sbjct: 123 ASFAAAPDIGWPDA-----------------FNSGVMVI------KPDMGEYW-ALQTMA 158 Query: 211 EIIKKITHPDQDVLNMLLADK-LIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIGP 269 DQ +LN + YN + YQ E D +H+IG Sbjct: 159 AAGDSFDGADQGLLNQYFEHRPWQRLKFTYNCTPNAEYQ-WEPAYRHYKRDIAAVHFIGK 217 Query: 270 TKPW 273 KPW Sbjct: 218 NKPW 221 >UniRef50_A7S9E5 Predicted protein n=1 Tax=Nematostella vectensis RepID=A7S9E5_NEMVE Length = 389 Score = 96.5 bits (239), Expect = 1e-18, Method: Composition-based stats. Identities = 43/247 (17%), Positives = 82/247 (33%), Gaps = 18/247 (7%) Query: 29 LDIAYGTDKNFLFGCGISIAS-ILKYNEGSRLCFHIFTD-YFGDDDRKYFDA--LALQYK 84 + I + + S IL N L FHI + D + D + Sbjct: 85 ISIVICGSRK--DEALTMLKSSILFTNRT--LVFHILAESGLHDGLKSVLDHWPCVEGKQ 140 Query: 85 TRIKIYLINGDRLRSLPSTKN-WTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPL 143 KI+ + + K + R + D + ++Y+D D + ++ L Sbjct: 141 VSYKIHPLKFPEGQKPDEWKKLFKPCAAQRLFLPDIL-TEVDSLIYMDIDTLFLSPVQWL 199 Query: 144 INF--SFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSA 201 + +F ++A + EG+ K NSG +L+N + + Sbjct: 200 WDQFSNFNSSQLASMTPEGEVSATGWYNRFARHPYYGKLGLNSGVMLMNLTRMRSFGWQE 259 Query: 202 RAIAMLNEPEIIKKITHPDQDVLNMLLADK---LIFADIKYNTQFSLNYQLKESFINPVT 258 + + + N+ IT DQD+LN+L + ++N + T Sbjct: 260 KILPIYNKYRF--DITWGDQDILNILFHYHPELVYVLSCEWNYRNDHCIYGNNC-KTADT 316 Query: 259 NDTIFIH 265 N +H Sbjct: 317 NGIYILH 323 >UniRef50_Q2L3C5 Glycosyl transferase-like protein n=3 Tax=Magnoliophyta RepID=Q2L3C5_BRASY Length = 689 Score = 96.1 bits (238), Expect = 2e-18, Method: Composition-based stats. Identities = 45/287 (15%), Positives = 84/287 (29%), Gaps = 49/287 (17%) Query: 43 CGISIASILKYNEGSRLCFHIFTDYFG-------------------DDDRKYFDALALQY 83 + + S L + + FHI TD + + F L Y Sbjct: 397 AAVVVNSTLVH--ATNHVFHIVTDRLNYAAMKMWFLANPLGKAAVQVQNIQEFTWLNSSY 454 Query: 84 ---------KTRIKIYLIN----GDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYL 130 ++ I Y + D + K + + RF + + F K KVL+L Sbjct: 455 SPVLKQLGSRSTIDYYFRSGTARPDENPKFRNPKYLSILNHLRFYLPEIFP-KLNKVLFL 513 Query: 131 DADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGY--------F 182 D D + Q + L + V T G+ + + +A + F Sbjct: 514 DDDTVVQQDLSALWSIDLKGKVNGAVETCGETFHRFDKYLNFSNPIVANNFHPQACGWAF 573 Query: 183 NSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQ 242 G + + ++W Q ++ E + ++ D ++ Sbjct: 574 --GMNMFDLSEWRKQNITDVYHTWQKLNEDRLLWKLGTLPAGLVTFWNRTFPLDRSWHLL 631 Query: 243 FSLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEA 289 L Y + + IHY G KPW + + + Sbjct: 632 -GLGYNPNVNERD--IRRASVIHYNGNLKPWLEIGLS-KYRKYWSRY 674 >UniRef50_Q02ZT6 Lipopolysaccharide biosynthesis glycosyltransferase n=1 Tax=Lactococcus lactis subsp. cremoris SK11 RepID=Q02ZT6_LACLS Length = 281 Score = 95.7 bits (237), Expect = 2e-18, Method: Composition-based stats. Identities = 30/144 (20%), Positives = 62/144 (43%), Gaps = 21/144 (14%) Query: 179 KGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIK 238 + YF +G L++N + + I ++ + I + DQD+LN+ +K+ + Sbjct: 10 EDYFQAGVLVLNLQAIRKDFTTEKFINLVQKRNWI----YMDQDILNLCFKNKVFYLPES 65 Query: 239 YNT--------------QFSLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQ 284 +N Q L YQ+ +S+ N +HY G KPW+ D +++ Sbjct: 66 WNVITLMEKNSVRGQIIQERLPYQISDSY-NKSRKTPNIVHYAGSYKPWYYKESD--MAE 122 Query: 285 AFMEAKNASPWKNTALLKPNNSNQ 308 F + + + LL+ ++ ++ Sbjct: 123 IFWQYAQNTSYYTELLLEASSPSK 146 >UniRef50_A5DLS6 Putative uncharacterized protein n=2 Tax=Pichia guilliermondii RepID=A5DLS6_PICGU Length = 390 Score = 95.7 bits (237), Expect = 2e-18, Method: Composition-based stats. Identities = 50/270 (18%), Positives = 89/270 (32%), Gaps = 46/270 (17%) Query: 35 TDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIYLING 94 T++++L G ++ + + D + +A Y I I Sbjct: 9 TNESYLPGALTLAHTLRSLGTQYPVVVLLDETQVSDRSLQLLEA---AYDRIIPI----S 61 Query: 95 DRLRSLPSTKNWTH----AIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLIN--FSF 148 DRL + P + + ++ + ++LYLD D++ ++ L + + Sbjct: 62 DRLVTSPVDDRLGRPELAVTFSKLLLWN---ESYDQILYLDTDVLPLANVDHLFDEGAAL 118 Query: 149 PDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLN 208 ++A G D FNSG LL QV + + + Sbjct: 119 TPRQIAASPDSGWPDI-----------------FNSGVLLFKPDP----QVYSDLVEFAS 157 Query: 209 EPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIG 268 + DQ +LN A YN + +YQ +F D +HYIG Sbjct: 158 GSD--SSFDGADQGLLNEFFAGNWHRLPFLYNVTPTESYQYVPAFHR-FFKDIKILHYIG 214 Query: 269 PTKPWH------DWAWDYPVSQAFMEAKNA 292 KPWH + + + F E + Sbjct: 215 QIKPWHSSTNIDHFRFHHLWWDRFSEFFDK 244 >UniRef50_B6JMU4 Lipopolysaccharide biosynthesis protein n=20 Tax=Helicobacter RepID=B6JMU4_HELP2 Length = 398 Score = 94.5 bits (234), Expect = 4e-18, Method: Composition-based stats. Identities = 55/312 (17%), Positives = 106/312 (33%), Gaps = 57/312 (18%) Query: 19 DHKVETENLCLDIAYGTDKNFLFGCGISIASILKY----NEGSRLCFHIFTDYFGDDDRK 74 H + ++ + IA+ DKN+L G + S+L+ N+ R H ++D+ Sbjct: 6 SHSFKEQDFHIPIAFAFDKNYLIPAGACLYSLLESIAKANKKIRYTLHALVVGLNEEDKA 65 Query: 75 YFDALALQYKTRIKIYLINGDRLRSLPS-------TKNWTHAIYFRFVIADYFINKAPKV 127 + + +K + + + + TK ++ + ++ +AD F K K+ Sbjct: 66 KLNQITEPFKEFAVLEVKDIEPFLDTIPNPFDEDFTKRFSKMVLVKYFLADLFP-KYSKM 124 Query: 128 LYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFL 187 ++ D D+I + F + + GV + K + GFL Sbjct: 125 VWSDVDVIFCNE----FSADFLN------------IKENDENYFYGVLEVEKHHMMEGFL 168 Query: 188 LINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNY 247 N + + R +L E ++ + I+Y F Y Sbjct: 169 FCNLDYQRKKNFTLRMHDLLKGNEAKGELDFT------KWCWPNMKALGIEY-CVFPYYY 221 Query: 248 QLK-----------ESFINPVTNDTIFIHY---IGPTKPWHDWAWDYPV---SQAFMEAK 290 +K + I + IHY G KPW DYP + ++ A Sbjct: 222 TIKDFSNAYLNENYKKTILEARENPTIIHYDAWWGAVKPW-----DYPFGLKADLWLNAL 276 Query: 291 NASPWKNTALLK 302 +P+ + K Sbjct: 277 AKTPFMSDYTKK 288 >UniRef50_Q871S1 Glycogenin n=3 Tax=Sordariaceae RepID=Q871S1_NEUCR Length = 686 Score = 94.5 bits (234), Expect = 4e-18, Method: Composition-based stats. Identities = 46/255 (18%), Positives = 83/255 (32%), Gaps = 55/255 (21%) Query: 32 AYGT---DKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDAL--------A 80 Y + + +L G + S+ +L I + ++ + + Sbjct: 9 VYASLLLNDAYLPGALVLAHSLRDSGTHKKLAILITPENISNEVVEQLQTVYDYVIPVET 68 Query: 81 LQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTI 140 +Q ++L+N L S + N FR K++Y+DAD++ Sbjct: 69 IQNDRPANLFLMNRPDLHSAFTKINLWKQTQFR------------KIVYIDADVVAYRAP 116 Query: 141 EPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLI--NTAQWAAQQ 198 + L + + G D FN+G +++ N + Sbjct: 117 DELFDLP---HAFSAAPDIGWPDL-----------------FNTGVMVLSPNMGDY---- 152 Query: 199 VSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVT 258 AML E DQ +LNM + YN S +YQ ++ Sbjct: 153 -----YAMLAMAERGISFDGADQGLLNMHFRNTYNRLSFTYNVTPSAHYQYIPAY-KHFQ 206 Query: 259 NDTIFIHYIGPTKPW 273 + +H+IG KPW Sbjct: 207 SSINLLHFIGSEKPW 221 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P27128 Lipopolysaccharide 1,3-galactosyltransferase n=4... 306 9e-82 UniRef50_D2TIX6 UDP-galactose:(Glucosyl) LPS alpha-1,3-galactosy... 299 1e-79 UniRef50_Q9R9D1 UDP-glucose:(Glucosyl) LPS alpha1,3-glucosyltran... 284 3e-75 UniRef50_Q9ZIT4 UDP-galactose:(Glucosyl) LPS alpha1,3-galactosyl... 259 1e-67 UniRef50_D0KD53 Lipopolysaccharide 3-alpha-galactosyltransferase... 256 9e-67 UniRef50_B6XJW2 Putative uncharacterized protein n=1 Tax=Provide... 255 2e-66 UniRef50_D2TY85 UDP-D-galactose:(Glucosyl)lipopolysaccharide-alp... 245 1e-63 UniRef50_D2U322 Lipopolysaccharide 1,2-glucosyltransferase n=1 T... 245 2e-63 UniRef50_UPI00019F16C6 hypothetical protein CATC2_20202 n=1 Tax=... 238 2e-61 UniRef50_UPI000197C525 UDP-D-galactose:(glucosyl) lipopolysaccha... 236 9e-61 UniRef50_B6XJW1 Putative uncharacterized protein n=1 Tax=Provide... 233 6e-60 UniRef50_C1DGU7 Lipopolysaccharide 1,3-galactosyltransferase n=1... 232 1e-59 UniRef50_Q9ZIS1 UDP-galactose:(Galactosyl) LPS alpha1,2-galactos... 232 2e-59 UniRef50_Q46Y64 Glycosyl transferase, family 8 n=1 Tax=Ralstonia... 230 5e-59 UniRef50_Q9ZIS6 UDP-galactose:(Glucosyl) LPS alpha1,2-galactosyl... 229 8e-59 UniRef50_B2PV91 Putative uncharacterized protein n=2 Tax=Enterob... 228 2e-58 UniRef50_A8ARL6 Putative uncharacterized protein n=3 Tax=Citroba... 227 4e-58 UniRef50_Q9ZIT6 UDP-glucose:(Galactosyl) LPS alpha1,2-glucosyltr... 226 8e-58 UniRef50_P27129 Lipopolysaccharide 1,2-glucosyltransferase n=44 ... 224 5e-57 UniRef50_UPI0001B4A0A4 putative glucosyltransferase n=1 Tax=Bact... 221 2e-56 UniRef50_D1PTN4 Putative uncharacterized protein n=1 Tax=Prevote... 221 3e-56 UniRef50_B6GCA0 Putative uncharacterized protein n=1 Tax=Collins... 217 6e-55 UniRef50_D0KD54 Lipopolysaccharide glucosyltransferase I n=2 Tax... 216 1e-54 UniRef50_C3X7M2 Lipopolysaccharide 3-alpha-galactosyltransferase... 216 1e-54 UniRef50_P25148 General stress protein A n=3 Tax=Bacillus subtil... 213 8e-54 UniRef50_C7IBC1 Glycosyl transferase family 8 n=2 Tax=Clostridiu... 212 1e-53 UniRef50_D2RIJ4 Glycosyl transferase family 8 n=2 Tax=Acidaminoc... 212 1e-53 UniRef50_C7QL87 Glycosyl transferase family 8 n=2 Tax=Cyanothece... 211 3e-53 UniRef50_D2ELM0 Lipopolysaccharide biosynthesis glycosyltransfer... 211 3e-53 UniRef50_B7GNT4 Glycosyl transferase, family 8 n=5 Tax=Bifidobac... 210 5e-53 UniRef50_A7A7B4 Putative uncharacterized protein n=2 Tax=Bifidob... 210 7e-53 UniRef50_Q5LF36 Putative glucosyltransferase n=1 Tax=Bacteroides... 209 9e-53 UniRef50_C0WCJ1 Lipopolysaccharide 1,2-glucosyltransferase n=1 T... 207 4e-52 UniRef50_Q38VK7 Putative bifunctional glycosyl transferase, fami... 206 1e-51 UniRef50_A1BHG0 Glycosyl transferase, family 8 n=2 Tax=Chlorobiu... 205 2e-51 UniRef50_D1P7H1 Lipopolysaccharide 1,2-glucosyltransferase n=1 T... 204 4e-51 UniRef50_Q03HK5 Lipopolysaccharide biosynthesis glycosyltransfer... 204 5e-51 UniRef50_A6LGX5 Glycosyltransferase family 8 n=1 Tax=Parabactero... 202 2e-50 UniRef50_C2HBB9 Family 8 glycosyltransferase n=35 Tax=Lactobacil... 202 2e-50 UniRef50_Q64ZV2 Lipopolysaccharide 1,2-glucosyltransferase n=8 T... 200 5e-50 UniRef50_C2HBB8 Family 8 glycosyltransferase n=13 Tax=Lactobacil... 198 2e-49 UniRef50_A8ARL4 Putative uncharacterized protein n=2 Tax=Citroba... 197 5e-49 UniRef50_A7VNX5 Putative uncharacterized protein n=1 Tax=Clostri... 197 6e-49 UniRef50_C5ZV11 Glycosyl transferase n=2 Tax=Helicobacter canade... 196 7e-49 UniRef50_B7AIV0 Putative uncharacterized protein n=1 Tax=Bactero... 196 7e-49 UniRef50_C0QZN2 Glycosyl transferase, family 8 n=3 Tax=Brachyspi... 195 2e-48 UniRef50_C5ELK9 Glycosyl transferase n=3 Tax=Clostridiales RepID... 194 4e-48 UniRef50_B3Q568 Putative glycosyltransferase protein n=4 Tax=Rhi... 193 6e-48 UniRef50_UPI0001A457E5 hypothetical protein NsubN_08151 n=1 Tax=... 193 7e-48 UniRef50_Q2K5X3 Galactosyltransferase protein n=13 Tax=Rhizobium... 193 8e-48 UniRef50_C3PWZ8 Lipopolysaccharide 1,2-glucosyltransferase n=1 T... 192 1e-47 UniRef50_D2RIJ7 Glycosyl transferase family 8 n=1 Tax=Acidaminoc... 191 2e-47 UniRef50_C0Z4I4 Putative uncharacterized protein gspA n=1 Tax=Br... 191 2e-47 UniRef50_D2LIH7 Glycosyl transferase family 8 n=1 Tax=Rhodomicro... 191 2e-47 UniRef50_B7KM20 Glycosyl transferase family 8 n=1 Tax=Cyanothece... 190 6e-47 UniRef50_B8ISQ5 Glycosyl transferase family 8 n=1 Tax=Methylobac... 188 2e-46 UniRef50_B1MX28 Lipopolysaccharide biosynthesis protein, LPS:gly... 187 3e-46 UniRef50_B2UPJ4 Glycosyl transferase family 8 n=1 Tax=Akkermansi... 187 4e-46 UniRef50_C8W7U9 Glycosyl transferase family 8 n=2 Tax=Atopobium ... 187 4e-46 UniRef50_C4VEI8 General stress protein A n=24 Tax=Enterococcus R... 187 6e-46 UniRef50_B4RJG6 LgtC n=29 Tax=Neisseria RepID=B4RJG6_NEIG2 186 9e-46 UniRef50_A4ECW2 Putative uncharacterized protein n=1 Tax=Collins... 185 1e-45 UniRef50_B9KVD4 Glycosyl transferase, family 8 n=2 Tax=Rhodobact... 185 2e-45 UniRef50_C4Z1V1 Putative uncharacterized protein n=1 Tax=Eubacte... 185 3e-45 UniRef50_Q116W1 Glycosyl transferase, family 8 n=1 Tax=Trichodes... 184 4e-45 UniRef50_UPI000196958D hypothetical protein BACCELL_01586 n=1 Ta... 184 6e-45 UniRef50_A5ZFA9 Putative uncharacterized protein n=1 Tax=Bactero... 183 7e-45 UniRef50_B1Y723 Glycosyl transferase family 8 n=1 Tax=Leptothrix... 183 9e-45 UniRef50_C6I3U6 Putative uncharacterized protein n=2 Tax=Bactero... 182 1e-44 UniRef50_B3CA80 Putative uncharacterized protein n=1 Tax=Bactero... 182 2e-44 UniRef50_B2ISC6 Glycosyl transferase, family 2/glycosyl transfer... 182 2e-44 UniRef50_B3Z5I6 Glycosyltransferase family 8 n=2 Tax=Bacillus ce... 182 2e-44 UniRef50_C1QBZ8 LPS:glycosyltransferase n=1 Tax=Brachyspira murd... 182 2e-44 UniRef50_C9PNX4 Family 8 glycosyl transferase n=1 Tax=Pasteurell... 181 3e-44 UniRef50_A1XRC1 Glycosyltransferase n=1 Tax=Haemophilus ducreyi ... 181 4e-44 UniRef50_D1Y7U2 Glycosyltransferase, family 8 n=1 Tax=Pyramidoba... 180 6e-44 UniRef50_D2QX95 Glycosyl transferase family 8 n=1 Tax=Pirellula ... 179 9e-44 UniRef50_C7IBC8 Glycosyl transferase family 8 n=1 Tax=Clostridiu... 179 1e-43 UniRef50_B2UQ54 Glycosyl transferase family 8 n=1 Tax=Akkermansi... 179 1e-43 UniRef50_P43974 Putative glycosyltransferase HI0258 n=32 Tax=Hae... 178 2e-43 UniRef50_Q3D427 Glycosyl transferase, family 8 n=8 Tax=Streptoco... 178 2e-43 UniRef50_C8WAA9 Glycosyl transferase family 8 n=2 Tax=Atopobium ... 178 2e-43 UniRef50_C2KV37 Lipopolysaccharide biosynthesis protein, LPS:gly... 178 2e-43 UniRef50_B3XL28 Glycosyl transferase family 8 n=1 Tax=Lactobacil... 178 3e-43 UniRef50_B6G807 Putative uncharacterized protein n=2 Tax=Collins... 177 4e-43 UniRef50_Q9L6B2 Putative glycosyl transferase n=3 Tax=Pasteurell... 177 5e-43 UniRef50_B3WD33 WbbM protein n=8 Tax=Lactobacillus RepID=B3WD33_... 177 6e-43 UniRef50_C0BAU3 Putative uncharacterized protein n=1 Tax=Coproco... 177 6e-43 UniRef50_B7C7N8 Putative uncharacterized protein n=2 Tax=Firmicu... 177 6e-43 UniRef50_Q65VF6 RfaJ protein n=1 Tax=Mannheimia succiniciproduce... 177 7e-43 UniRef50_C1QC00 LPS:glycosyltransferase n=1 Tax=Brachyspira murd... 176 8e-43 UniRef50_C9RWX3 Glycosyl transferase family 8 n=8 Tax=Bacillacea... 176 8e-43 UniRef50_D0D9G3 Putative general stress protein A n=1 Tax=Citrei... 176 9e-43 UniRef50_C1QC03 LPS:glycosyltransferase n=1 Tax=Brachyspira murd... 175 2e-42 UniRef50_A3VFX3 Glycosyltransferase n=1 Tax=Rhodobacterales bact... 175 2e-42 UniRef50_A3V3C9 Putative uncharacterized protein n=1 Tax=Loktane... 175 2e-42 UniRef50_A4UX76 Putative LPS biosynthesis protein n=2 Tax=Lactob... 175 2e-42 UniRef50_C5FDY7 Glycogenin n=1 Tax=Microsporum canis CBS 113480 ... 175 2e-42 UniRef50_Q3D426 Glycosyl transferase, family 8 n=7 Tax=Streptoco... 175 3e-42 UniRef50_C5S494 Putative glycosyl transferase n=2 Tax=Actinobaci... 175 3e-42 UniRef50_D2QX94 Glycosyl transferase family 8 n=1 Tax=Pirellula ... 174 3e-42 UniRef50_A4VVV8 Lipopolysaccharide biosynthesis proteins, LPS:gl... 174 3e-42 UniRef50_D1JY84 Putative uncharacterized protein n=1 Tax=Bactero... 174 4e-42 UniRef50_A0KQP2 Glycosyl transferase, family 8 n=2 Tax=Aeromonas... 174 4e-42 UniRef50_B9BAZ6 Glycosyl transferase family 8 protein n=1 Tax=Bu... 173 7e-42 UniRef50_B6VUC8 Putative uncharacterized protein n=1 Tax=Bactero... 172 2e-41 UniRef50_B8G232 Glycosyl transferase family 8 n=2 Tax=Desulfitob... 172 2e-41 UniRef50_Q4HGS8 General stress protein A, putative n=2 Tax=Campy... 171 3e-41 UniRef50_C8N7M8 Putative uncharacterized protein n=1 Tax=Cardiob... 171 4e-41 UniRef50_Q5M3K9 Glycosyl transferase n=4 Tax=Streptococcus RepID... 170 4e-41 UniRef50_UPI0001A45357 glycosyl transferase family protein n=1 T... 170 5e-41 UniRef50_C2LRU0 Glycosyl transferase, family 8 n=1 Tax=Streptoco... 170 6e-41 UniRef50_B2ISC2 Glycosyl transferase, family 8 n=2 Tax=Streptoco... 170 7e-41 UniRef50_A8FNA2 Putative uncharacterized protein n=2 Tax=Campylo... 169 2e-40 UniRef50_A5LNA9 Glycosyl transferase, family 8 n=2 Tax=Streptoco... 169 2e-40 UniRef50_B6HCQ7 Pc18g02120 protein n=2 Tax=mitosporic Trichocoma... 168 3e-40 UniRef50_UPI0001AEC697 glycosyl transferase family protein n=1 T... 168 3e-40 UniRef50_C1IBL0 Glycosyl transferase n=1 Tax=Clostridium sp. 7_2... 168 3e-40 UniRef50_C7RG54 Glycosyl transferase family 8 n=1 Tax=Anaerococc... 167 3e-40 UniRef50_A3CM53 Glycosyltransferase, putative n=9 Tax=Bacteria R... 166 9e-40 UniRef50_A5EVI8 Glycosyl transferase family 8 protein n=1 Tax=Di... 166 1e-39 UniRef50_B1I7N1 Glycosyl transferase, family 8 n=8 Tax=Streptoco... 165 1e-39 UniRef50_C5ZVZ7 Putative glycosyltransferase n=3 Tax=Campylobact... 165 2e-39 UniRef50_B2UQN6 Glycosyl transferase family 8 n=1 Tax=Akkermansi... 165 2e-39 UniRef50_A2RAV0 Catalytic activity: UDP-glucose + glycogenin <=>... 165 3e-39 UniRef50_B1I7M9 Glycosyl transferase, family 8 n=16 Tax=Streptoc... 164 3e-39 UniRef50_A8AY72 Glycosyl transferase, family 8 SP1766 n=7 Tax=St... 164 3e-39 UniRef50_C0AA16 Lipopolysaccharide biosynthesis protein LPS:glyc... 164 3e-39 UniRef50_C1CFZ1 Glycosyl transferase, family 8 n=17 Tax=Streptoc... 164 4e-39 UniRef50_Q3DNS6 Glycosyl transferase, family 8 n=5 Tax=Streptoco... 164 5e-39 UniRef50_B9QZ95 Glycosyl transferase family 8 n=1 Tax=Labrenzia ... 164 5e-39 UniRef50_O48684 F3I6.10 protein n=46 Tax=Embryophyta RepID=O4868... 164 5e-39 UniRef50_C4ZG45 Glycosyltransferase Family 2 modular protein n=1... 164 5e-39 UniRef50_B0NR59 Putative uncharacterized protein n=1 Tax=Bactero... 163 7e-39 UniRef50_Q871S1 Glycogenin n=3 Tax=Sordariaceae RepID=Q871S1_NEUCR 163 1e-38 UniRef50_Q01GT2 UDP-glucose:glycoprotein glucosyltransferase, pu... 163 1e-38 UniRef50_A5UC07 Aspartate-semialdehyde dehydrogenase n=7 Tax=Hae... 162 2e-38 UniRef50_Q4JZJ9 Putative glycosyl transferase n=8 Tax=Streptococ... 162 2e-38 UniRef50_C1MLJ1 Glycosyltransferase family 24 protein n=1 Tax=Mi... 162 2e-38 UniRef50_Q09332 UDP-glucose:glycoprotein glucosyltransferase n=1... 161 3e-38 UniRef50_UPI000175831B PREDICTED: similar to UDP-glucose glycopr... 161 3e-38 UniRef50_C3XGD2 Putative uncharacterized protein n=1 Tax=Helicob... 161 4e-38 UniRef50_C4JK72 Putative uncharacterized protein n=1 Tax=Uncinoc... 160 6e-38 UniRef50_Q5WI33 Lipopolysaccharide glycosyltransferase n=6 Tax=F... 160 6e-38 UniRef50_A8XPN2 Putative uncharacterized protein n=2 Tax=Caenorh... 160 7e-38 UniRef50_A4WWT5 Glycosyl transferase, family 8 n=1 Tax=Rhodobact... 160 7e-38 UniRef50_Q3DM64 Glycosyl transferase, family 8, degenerate n=6 T... 159 1e-37 UniRef50_Q046Z9 Lipopolysaccharide biosynthesis glycosyltransfer... 159 1e-37 UniRef50_UPI0001693121 general stress protein n=1 Tax=Paenibacil... 159 1e-37 UniRef50_C1QEC6 LPS:glycosyltransferase n=1 Tax=Brachyspira murd... 159 1e-37 UniRef50_B1LK07 Glycosyl transferase family 8 n=10 Tax=Enterobac... 159 1e-37 UniRef50_C3XKY2 Glycosyl transferase n=2 Tax=Campylobacterales R... 159 1e-37 UniRef50_UPI0000E47484 PREDICTED: similar to UDP-glucose ceramid... 159 1e-37 UniRef50_C3ZE29 Putative uncharacterized protein n=1 Tax=Branchi... 159 1e-37 UniRef50_Q5KMJ4 Putative uncharacterized protein n=1 Tax=Filobas... 158 2e-37 UniRef50_Q0U987 Putative uncharacterized protein n=1 Tax=Phaeosp... 158 3e-37 UniRef50_A8LP95 Lipopolysaccharide 1 n=1 Tax=Dinoroseobacter shi... 158 3e-37 UniRef50_C5EZG9 Glycosyl transferase family protein n=1 Tax=Heli... 158 3e-37 UniRef50_C5JPW4 Glycosyl transferase family 8 protein n=2 Tax=Aj... 158 3e-37 UniRef50_B9HMR5 Glycosyltransferase, CAZy family GT8 n=25 Tax=Ma... 157 5e-37 UniRef50_A5VK24 Glycosyl transferase, family 8 n=18 Tax=Lactobac... 157 5e-37 UniRef50_UPI000180C254 PREDICTED: similar to UDP-glucose ceramid... 157 6e-37 UniRef50_C5S3F7 Putative glycosyl transferase n=2 Tax=Actinobaci... 157 7e-37 UniRef50_A7EPR4 Putative uncharacterized protein n=1 Tax=Sclerot... 157 7e-37 UniRef50_A8PS15 UDP-glucose:Glycoprotein Glucosyltransferase con... 157 8e-37 UniRef50_A2QNN6 Contig An07c0170, complete genome n=10 Tax=Leoti... 156 8e-37 UniRef50_Q3DNA2 Glycosyl transferase, family 8 n=9 Tax=Streptoco... 156 8e-37 UniRef50_C6IB51 Glycosyltransferase n=4 Tax=Bacteroides RepID=C6... 156 1e-36 UniRef50_C7HS13 Family 8 glycosyl transferase n=2 Tax=Anaerococc... 156 1e-36 UniRef50_UPI000023DC59 hypothetical protein FG01882.1 n=1 Tax=Gi... 155 1e-36 UniRef50_D1IFB6 Whole genome shotgun sequence of line PN40024, s... 155 2e-36 UniRef50_Q9L7A2 Putative glycosyl transferase n=1 Tax=Haemophilu... 155 2e-36 UniRef50_C7Z1L1 Putative uncharacterized protein n=1 Tax=Nectria... 155 2e-36 UniRef50_B8PIH6 Predicted protein n=2 Tax=Agaricomycetes RepID=B... 155 3e-36 UniRef50_C0S309 Glycogenin n=5 Tax=Onygenales RepID=C0S309_PARBP 155 3e-36 UniRef50_Q1RIL1 Lipopolysaccharide 1,2-glucosyltransferase RfaJ ... 154 4e-36 UniRef50_B2B5U2 Predicted CDS Pa_2_5770 n=1 Tax=Podospora anseri... 154 4e-36 UniRef50_Q9NYU2 UDP-glucose:glycoprotein glucosyltransferase 1 n... 154 4e-36 UniRef50_P91854 Protein F26H9.8, partially confirmed by transcri... 154 4e-36 UniRef50_Q2HHC6 Putative uncharacterized protein n=1 Tax=Chaetom... 154 5e-36 UniRef50_B6Q6I5 Glycogenin n=3 Tax=Trichocomaceae RepID=B6Q6I5_P... 154 6e-36 UniRef50_Q50FU8 Cj81-079 (Fragment) n=6 Tax=Campylobacter jejuni... 154 7e-36 UniRef50_D0N7I0 UDP-glucose:glycoprotein glucosyltransferase, pu... 153 7e-36 UniRef50_D1HRJ7 Whole genome shotgun sequence of line PN40024, s... 153 7e-36 UniRef50_C5XV64 Putative uncharacterized protein Sb04g036540 n=1... 153 8e-36 UniRef50_C6LDU2 Glycosyl transferase, family 8 n=3 Tax=Firmicute... 153 9e-36 UniRef50_A1VG39 Glycosyl transferase, family 8 n=1 Tax=Desulfovi... 153 1e-35 UniRef50_D1Z8I8 Whole genome shotgun sequence assembly, scaffold... 153 1e-35 UniRef50_Q2GW94 Putative uncharacterized protein n=1 Tax=Chaetom... 152 1e-35 UniRef50_B2VRF2 Glycogenin-2 n=1 Tax=Pyrenophora tritici-repenti... 151 3e-35 UniRef50_UPI00016C0890 glycosyl transferase family 8 protein n=2... 151 3e-35 UniRef50_B6K765 UDP-glucose:glycoprotein glucosyltransferase n=1... 151 3e-35 UniRef50_A7H2M2 Glycosyl transferase family 8 n=3 Tax=Campylobac... 151 3e-35 UniRef50_A4R9Z3 Putative uncharacterized protein n=1 Tax=Magnapo... 151 3e-35 UniRef50_C6EQF4 Putative uncharacterized protein n=3 Tax=Campylo... 151 3e-35 UniRef50_C5P955 Glycosyl transferase family 8 protein n=2 Tax=Co... 151 4e-35 UniRef50_B7AUG6 Putative uncharacterized protein n=1 Tax=Bactero... 150 4e-35 UniRef50_C1FE59 Glycosyltransferase family 24 protein n=1 Tax=Mi... 150 5e-35 UniRef50_D1N145 Glycosyl transferase family 8 n=1 Tax=Victivalli... 150 5e-35 UniRef50_Q6ESI8 Putative UDP-glucose:glycoprotein glucosyltransf... 150 5e-35 UniRef50_C6X2V2 Putative glycosyltransferase n=1 Tax=Flavobacter... 150 5e-35 UniRef50_Q4PEF1 Putative uncharacterized protein n=1 Tax=Ustilag... 150 6e-35 UniRef50_UPI0001792D56 PREDICTED: similar to UDP-glucose glycopr... 150 7e-35 UniRef50_C4Q2X6 Udp-glucose glycoprotein:glucosyltransferase, pu... 150 7e-35 UniRef50_B2VVG3 UDP-glucose:glycoprotein glucosyltransferase n=9... 150 7e-35 UniRef50_Q8LF94 Avr9/Cf-9 rapidly elicited protein 231 n=13 Tax=... 150 8e-35 UniRef50_UPI000180BB9E PREDICTED: similar to Glycogenin 1 n=1 Ta... 150 9e-35 UniRef50_UPI0001757CC2 PREDICTED: similar to glycogenin n=1 Tax=... 149 1e-34 UniRef50_Q4E3K0 UDP-glucose:glycoprotein glucosyltransferase n=2... 149 1e-34 UniRef50_C7XX93 Glycosyl transferase, family 8 n=1 Tax=Lactobaci... 149 1e-34 UniRef50_Q6BJN0 DEHA2G01232p n=3 Tax=Saccharomycetaceae RepID=Q6... 149 1e-34 UniRef50_Q09140 UDP-glucose:glycoprotein glucosyltransferase n=1... 149 1e-34 UniRef50_Q8T191 Probable UDP-glucose:glycoprotein glucosyltransf... 149 1e-34 UniRef50_C6H742 UDP-glucose:glycoprotein glucosyltransferase n=1... 149 2e-34 UniRef50_C4R603 Protein required for beta-1,6 glucan biosynthesi... 149 2e-34 UniRef50_B0CRB8 Glycosyltransferase family 8 protein n=3 Tax=Fun... 148 2e-34 UniRef50_A5BZU1 Putative uncharacterized protein n=1 Tax=Vitis v... 148 3e-34 UniRef50_A5DLS6 Putative uncharacterized protein n=2 Tax=Pichia ... 148 3e-34 UniRef50_Q17VR5 Lipopolysaccharide biosynthesis protein n=19 Tax... 148 3e-34 UniRef50_Q873M5 UDP-Glc:glycoprotein glucosyltransferase n=2 Tax... 148 3e-34 UniRef50_Q0I2Z7 Lipopolysaccharide biosynthesis protein, glycosy... 147 4e-34 UniRef50_Q582S2 UDP-glucose:glycoprotein glucosyltransferase, pu... 147 5e-34 UniRef50_A8NCT1 Putative uncharacterized protein n=1 Tax=Coprino... 147 5e-34 UniRef50_C4Y414 Putative uncharacterized protein n=1 Tax=Clavisp... 147 5e-34 UniRef50_A1D472 Glycosyl transferase family 8 protein n=4 Tax=Tr... 147 7e-34 UniRef50_D1IU75 Whole genome shotgun sequence of line PN40024, s... 146 1e-33 UniRef50_D2MYR1 Putative uncharacterized protein n=1 Tax=Campylo... 145 1e-33 UniRef50_UPI00016E26D6 UPI00016E26D6 related cluster n=3 Tax=Tak... 145 1e-33 UniRef50_UPI0000587C70 PREDICTED: similar to MGC81998 protein n=... 145 2e-33 UniRef50_O15488 Glycogenin-2 n=43 Tax=Fungi/Metazoa group RepID=... 145 2e-33 UniRef50_A9S2B3 Predicted protein (Fragment) n=1 Tax=Physcomitre... 145 3e-33 UniRef50_B9WDQ8 Killer toxin-resistance protein, putative n=5 Ta... 144 3e-33 UniRef50_C0X9Z7 Putative uncharacterized protein n=2 Tax=Lactoba... 144 5e-33 UniRef50_Q6Z5D6 Glycosyltransferase family-like n=6 Tax=Poaceae ... 144 5e-33 UniRef50_Q5M7A1 Hypothetical LOC496877 n=2 Tax=Xenopus (Silurana... 144 6e-33 UniRef50_A3YS36 Putative sugar transferase n=2 Tax=Campylobacter... 144 7e-33 UniRef50_A0ZYL4 Putative uncharacterized protein n=1 Tax=Archaea... 143 7e-33 UniRef50_Q39T65 Glycosyl transferase, family 8 n=1 Tax=Geobacter... 143 1e-32 UniRef50_UPI000197AD97 hypothetical protein BACCOPRO_03221 n=1 T... 143 1e-32 UniRef50_Q2L3C5 Glycosyl transferase-like protein n=3 Tax=Magnol... 142 1e-32 UniRef50_Q22997 Unidentified vitellogenin-linked transcript prot... 142 2e-32 UniRef50_UPI00016B2258 glycosyl transferase, family 8 n=1 Tax=ca... 142 3e-32 UniRef50_A8P591 Glycogenin-1, putative n=2 Tax=Brugia malayi Rep... 141 4e-32 UniRef50_UPI0000F2E03D PREDICTED: similar to glycogenin 2, n=2 T... 140 5e-32 UniRef50_Q9R062 Glycogenin-1 n=22 Tax=Euteleostomi RepID=GLYG_MOUSE 140 6e-32 UniRef50_Q68SS4 Putative glycogenin protein n=1 Tax=Pleurotus dj... 140 8e-32 UniRef50_A4SAB5 Predicted protein (Fragment) n=1 Tax=Ostreococcu... 140 8e-32 UniRef50_B3RM47 Putative uncharacterized protein n=1 Tax=Trichop... 140 9e-32 UniRef50_A4IXE1 Glycosyl transferase, family 8 n=16 Tax=Francise... 140 9e-32 UniRef50_UPI000194B82A PREDICTED: similar to glycogenin 2 n=1 Ta... 139 1e-31 Sequences not found previously or not previously below threshold: >UniRef50_P27128 Lipopolysaccharide 1,3-galactosyltransferase n=43 Tax=Enterobacteriaceae RepID=RFAI_ECOLI Length = 339 Score = 306 bits (783), Expect = 9e-82, Method: Composition-based stats. Identities = 339/339 (100%), Positives = 339/339 (100%) Query: 1 MQQVFFQETEFLNSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLC 60 MQQVFFQETEFLNSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLC Sbjct: 1 MQQVFFQETEFLNSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLC 60 Query: 61 FHIFTDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYF 120 FHIFTDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYF Sbjct: 61 FHIFTDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYF 120 Query: 121 INKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKG 180 INKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKG Sbjct: 121 INKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKG 180 Query: 181 YFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYN 240 YFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYN Sbjct: 181 YFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYN 240 Query: 241 TQFSLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTAL 300 TQFSLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTAL Sbjct: 241 TQFSLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTAL 300 Query: 301 LKPNNSNQLRYSAKHMLKKHRYLKGFSNYLFYFIEKIKH 339 LKPNNSNQLRYSAKHMLKKHRYLKGFSNYLFYFIEKIKH Sbjct: 301 LKPNNSNQLRYSAKHMLKKHRYLKGFSNYLFYFIEKIKH 339 >UniRef50_D2TIX6 UDP-galactose:(Glucosyl) LPS alpha-1,3-galactosyltransferase n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TIX6_CITRO Length = 340 Score = 299 bits (765), Expect = 1e-79, Method: Composition-based stats. Identities = 236/337 (70%), Positives = 272/337 (80%) Query: 1 MQQVFFQETEFLNSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLC 60 MQQV+F+ETEFL S ID++H+ E + LDIAYG D+NFLFGCGISIAS+LK N L Sbjct: 1 MQQVYFKETEFLTSTIDFNHQDTAEKVVLDIAYGVDQNFLFGCGISIASVLKNNTDKTLH 60 Query: 61 FHIFTDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYF 120 FH+F D F + DR+ FD LA QYKT I IYLIN + LRSLPSTKNWT+AIYFRF IADYF Sbjct: 61 FHVFIDAFNETDRRMFDKLAAQYKTHITIYLINCEHLRSLPSTKNWTYAIYFRFAIADYF 120 Query: 121 INKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKG 180 I K K+LYLDADIICQG I+ L+NFSF DK+A VVTEG+ADWWEKRA SLG GI KG Sbjct: 121 IGKTNKLLYLDADIICQGGIDELVNFSFASDKIAAVVTEGKADWWEKRALSLGTEGITKG 180 Query: 181 YFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYN 240 YFNSG +LIN QWA + +SARAI ML++P+I+ +ITHPDQDVLN+LLADKL F DIK+N Sbjct: 181 YFNSGLILINLNQWAIECISARAIKMLSDPDIVGRITHPDQDVLNILLADKLHFLDIKFN 240 Query: 241 TQFSLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTAL 300 TQFSLNYQLK+ FINPV NDTI IHYIGPTKPWH WA DY +S+ F++AK ASPWKNTAL Sbjct: 241 TQFSLNYQLKDKFINPVNNDTILIHYIGPTKPWHSWAGDYLISKPFIDAKQASPWKNTAL 300 Query: 301 LKPNNSNQLRYSAKHMLKKHRYLKGFSNYLFYFIEKI 337 LKP NSNQ RY AKHMLK RY+KG Y YF++KI Sbjct: 301 LKPTNSNQFRYCAKHMLKNKRYIKGMVGYFLYFMKKI 337 >UniRef50_Q9R9D1 UDP-glucose:(Glucosyl) LPS alpha1,3-glucosyltransferase WaaO n=29 Tax=Enterobacteriaceae RepID=Q9R9D1_ECOLX Length = 338 Score = 284 bits (727), Expect = 3e-75, Method: Composition-based stats. Identities = 171/338 (50%), Positives = 233/338 (68%), Gaps = 3/338 (0%) Query: 1 MQQVFFQETEFLNSVIDYDH-KVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRL 59 M +F E +N I +D + +AYG DKNFLFGCG+SI S+L +N Sbjct: 1 MSAHYFNPQEMINKTIIFDERPAASVASSFHVAYGIDKNFLFGCGVSITSVLLHNSDVSF 60 Query: 60 CFHIFTDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADY 119 FH+F D + D + LA Y+T I+I+L+N +RL++LP+TKNW+ A+YFRFVIADY Sbjct: 61 VFHVFIDDIPEADIQRLAQLAKSYRTCIQIHLVNCERLKALPTTKNWSIAMYFRFVIADY 120 Query: 120 FINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAK 179 FI++ K+LYLDADI CQG ++PLI ++ VA VVTE A+WW R SL + K Sbjct: 121 FIDQQDKILYLDADIACQGNLKPLITMDLANN-VAAVVTERDANWWSLRGQSLQCNELEK 179 Query: 180 GYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKY 239 GYFNSG LLINT WA + VSA+A++ML + I+ ++T+ DQD+LN++L K+ F D KY Sbjct: 180 GYFNSGVLLINTLAWAQESVSAKAMSMLADKAIVSRLTYMDQDILNLILLGKVKFIDAKY 239 Query: 240 NTQFSLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTA 299 NTQFSLNY+LK+SF+ P+ ++T+ IHY+GPTKPWH WA YP +Q F++AK ASPWKN Sbjct: 240 NTQFSLNYELKKSFVCPINDETVLIHYVGPTKPWHYWA-GYPSAQPFIKAKEASPWKNEP 298 Query: 300 LLKPNNSNQLRYSAKHMLKKHRYLKGFSNYLFYFIEKI 337 L++P NSN RY AKH K+++ + G NY++YF KI Sbjct: 299 LMRPVNSNYARYCAKHNFKQNKPINGIMNYIYYFYLKI 336 >UniRef50_Q9ZIT4 UDP-galactose:(Glucosyl) LPS alpha1,3-galactosyltransferase WaaI n=26 Tax=Enterobacteriaceae RepID=Q9ZIT4_ECOLX Length = 335 Score = 259 bits (661), Expect = 1e-67, Method: Composition-based stats. Identities = 175/333 (52%), Positives = 235/333 (70%), Gaps = 1/333 (0%) Query: 6 FQETEFLNSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFT 65 +++ + ++ ++ LDIA+G D+NFLFGCG++IASIL N FH+FT Sbjct: 4 LNDSDIILFEYNFHYQNIRSKNTLDIAFGIDRNFLFGCGVAIASILLNNREISCEFHVFT 63 Query: 66 DYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAP 125 DY D D+ YF LA QY +RI IY+IN D+L+SLPSTKNWT+A YFRF+IADYF +K Sbjct: 64 DYISDKDKLYFSDLAKQYNSRINIYVINCDKLKSLPSTKNWTYATYFRFIIADYFYHKHE 123 Query: 126 KVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSG 185 K+LYLDADI C+G+I+ L+++ F +++A VV E +WW+ RA L +A GYFN+G Sbjct: 124 KILYLDADIACKGSIKELLDYQFSTNEIAAVVAERDVEWWQNRASVLTTPQLASGYFNAG 183 Query: 186 FLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSL 245 FLLIN +W +S++AI ML +P+ + KITH DQDVLN+LL K+ F KYNT++S+ Sbjct: 184 FLLINIDEWNLNNISSKAIEMLRDPDWVSKITHLDQDVLNVLLNGKVKFISEKYNTRYSI 243 Query: 246 NYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNN 305 NY+LK+ NPV +DT+FIHY+GPTKPWH+WA +YPVS++F+ AK ASPW LLKP N Sbjct: 244 NYELKDKVDNPVNDDTVFIHYVGPTKPWHEWA-NYPVSRSFLIAKAASPWSKEDLLKPVN 302 Query: 306 SNQLRYSAKHMLKKHRYLKGFSNYLFYFIEKIK 338 SNQ RY AKH K+ Y+ G NYL Y+ EK Sbjct: 303 SNQYRYCAKHKFKQKHYMAGIFNYLKYYKEKCF 335 >UniRef50_D0KD53 Lipopolysaccharide 3-alpha-galactosyltransferase n=3 Tax=Enterobacteriaceae RepID=D0KD53_PECWW Length = 336 Score = 256 bits (654), Expect = 9e-67, Method: Composition-based stats. Identities = 152/339 (44%), Positives = 216/339 (63%), Gaps = 7/339 (2%) Query: 4 VFFQETEFLNSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHI 63 ++F + + + +V + + + LDIA+GTD+ F++GC I+IASIL N L FH+ Sbjct: 1 MYFDKEKVIKTVHSFSYSKKCAE--LDIAFGTDEKFIYGCAIAIASILLKNPDYCLSFHV 58 Query: 64 FTDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINK 123 FTD D D+ F +A QY T I IY+++ L++LP TK W++AIYFRF+IADYF Sbjct: 59 FTDKLSDGDKARFQEMAEQYNTTINIYIVDCSWLKTLPETKLWSYAIYFRFIIADYFYKI 118 Query: 124 APKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFN 183 KVLYLDADIIC G+++ LI + ++ VV +G ++WW+ RA ++ GYFN Sbjct: 119 LDKVLYLDADIICNGSLQELIKLDLSNH-ISAVVLDGDSNWWKNRAQKFQQPELSNGYFN 177 Query: 184 SGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQF 243 SG LLI W V+ ++ +L +PE+ K ITHPDQDVLN+LLA K + KYNTQF Sbjct: 178 SGVLLIEVNNWHQAAVTENSMRLLTDPEMKKIITHPDQDVLNVLLAGKSCHIESKYNTQF 237 Query: 244 SLNYQLKESFIN----PVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTA 299 S+NY+LK S+ P++N TIFIHYIGPTKPWH WA +Y ++ F++AK SPWKN + Sbjct: 238 SINYELKYSYGESAPTPISNKTIFIHYIGPTKPWHKWAANYACTKYFLKAKEHSPWKNES 297 Query: 300 LLKPNNSNQLRYSAKHMLKKHRYLKGFSNYLFYFIEKIK 338 LL ++ +RY AKH ++G ++L Y +K Sbjct: 298 LLDAVTASNMRYCAKHQFHNGEIIRGTLSFLKYLYKKAF 336 >UniRef50_B6XJW2 Putative uncharacterized protein n=1 Tax=Providencia alcalifaciens DSM 30120 RepID=B6XJW2_9ENTR Length = 333 Score = 255 bits (651), Expect = 2e-66, Method: Composition-based stats. Identities = 151/328 (46%), Positives = 213/328 (64%), Gaps = 3/328 (0%) Query: 11 FLNSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDY-FG 69 + I + ++ C +AYG D NFL+G G+SI S+L +N + FHIF D Sbjct: 8 MVKKTIPIGNIEIDDSSCQHVAYGIDHNFLYGSGVSIVSLLMHNPHIQFAFHIFIDNSMS 67 Query: 70 DDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLY 129 D+D F + Y T+I IY I+ + ++ LP+TKNWTHAIYFRF+IA+YF +K +LY Sbjct: 68 DEDIAKFAEICHLYNTKITIYFIDSNNVKKLPTTKNWTHAIYFRFIIAEYFKDKIDYLLY 127 Query: 130 LDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLI 189 LDAD++C I+ L++ + +A VV E WW+KRA SLG ++KGYFNSG + I Sbjct: 128 LDADVVCNRNIDELLSHNLLGY-IAAVVPERDKAWWQKRADSLGFPSVSKGYFNSGVMYI 186 Query: 190 NTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQL 249 N W V+ +++A+L + E+ ++ +PDQDVLN+LL D ++F +NTQFSLNY+L Sbjct: 187 NLRTWKTNNVTEKSMALLMDNEVSHRLVYPDQDVLNILLTDSVLFISSIFNTQFSLNYEL 246 Query: 250 KESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQL 309 K+SF PV T+FIHY+GPTKPWH+WA +Y +Q F+EA+ SPW+N LLK +SN L Sbjct: 247 KKSFDFPVKRTTVFIHYVGPTKPWHEWA-NYETAQPFLEARAVSPWRNVPLLKAKSSNHL 305 Query: 310 RYSAKHMLKKHRYLKGFSNYLFYFIEKI 337 RY AKH + + +Y F NY+ YF KI Sbjct: 306 RYCAKHNINQRKYFFAFKNYIAYFFSKI 333 >UniRef50_D2TY85 UDP-D-galactose:(Glucosyl)lipopolysaccharide-alpha-1, 3-D-galactosyltransferase n=1 Tax=Arsenophonus nasoniae RepID=D2TY85_9ENTR Length = 343 Score = 245 bits (626), Expect = 1e-63, Method: Composition-based stats. Identities = 147/336 (43%), Positives = 204/336 (60%), Gaps = 3/336 (0%) Query: 4 VFFQETEFLNSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHI 63 ++F E + + ++ + IAYG DKNF G ISI S+L +N+ F+I Sbjct: 10 MYFNSKEIILTSYEFSSAD-AKTPQFHIAYGADKNFSLGTAISICSMLYFNKIYTFHFYI 68 Query: 64 FTDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINK 123 FTD + D K FD L Y T+I I LI+ +L+ LP+ K W+HAIYFRF+IA+YF NK Sbjct: 69 FTDTISECDLKKFDELTSCYNTKITILLIDTLQLKKLPTNKLWSHAIYFRFIIANYFHNK 128 Query: 124 APKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFN 183 K+LYLD+DIIC G I L + +A V Q W+KRA L IA GYFN Sbjct: 129 TNKILYLDSDIICSGDISELFDIDLNQHIIAAVADRDQ-YLWKKRAEMLATPEIANGYFN 187 Query: 184 SGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQF 243 SG +LI+T +W +++ + I +L + + K DQD LN+ L ++++F D K+NTQF Sbjct: 188 SGVMLIDTDKWHKNKITEKTINILLDDKTKAKFVFYDQDALNISLVNQVLFLDKKFNTQF 247 Query: 244 SLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKP 303 S+NY+LK + P+ N+ FIHYIGPTKPW+ W+ +YP + FM K SPWK T L+ Sbjct: 248 SINYELKNKTLFPIINNVKFIHYIGPTKPWNIWS-EYPSTHLFMTIKKNSPWKTTPLIAA 306 Query: 304 NNSNQLRYSAKHMLKKHRYLKGFSNYLFYFIEKIKH 339 + SNQ RY+AKHM K +Y+ NYL+YF+ K H Sbjct: 307 STSNQYRYAAKHMFNKKKYIYWLLNYLYYFVNKALH 342 >UniRef50_D2U322 Lipopolysaccharide 1,2-glucosyltransferase n=1 Tax=Arsenophonus nasoniae RepID=D2U322_9ENTR Length = 334 Score = 245 bits (625), Expect = 2e-63, Method: Composition-based stats. Identities = 107/332 (32%), Positives = 171/332 (51%), Gaps = 8/332 (2%) Query: 11 FLNSVIDY--DHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYF 68 F+ + + K+ N +IAYG DKNFL G ISI S+L N + FH+FTDY Sbjct: 4 FIKQKFNIAGEKKLTENNKNFNIAYGVDKNFLLGAAISINSVLINNTDTDFNFHLFTDYI 63 Query: 69 GDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVL 128 D + F + +Y + I IYL++ L+ L ++ W++A YFR + +Y +L Sbjct: 64 DDGYIQRFQTMIAKYNSNIIIYLLDAAELKQLSTSDFWSYATYFRLIAFEYLSTNIHAIL 123 Query: 129 YLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLL 188 YLDAD+IC+G+++ + + D A+V+ + A L +A + YFN+G + Sbjct: 124 YLDADVICKGSLKEIFQLNLADSFAAVVLDVDS--MQQSSATRLNLADLNGKYFNAGVIY 181 Query: 189 INTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLN-- 246 +N +W S +++ ++ K+ + DQD LN+L + I+ YN + L Sbjct: 182 VNLQKWIENDFSKKSLELVRGKTNFGKLKYLDQDALNILFQTQNIYLSRDYNCIYKLKNE 241 Query: 247 --YQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPN 304 Y + N +T+ TI IHY G TKPWH W +YP SQ F + SPWK+ L Sbjct: 242 LAYHDLSKYKNTITDSTILIHYTGVTKPWHTWGINYPASQFFFNSYIHSPWKDQPLKMAE 301 Query: 305 NSNQLRYSAKHMLKKHRYLKGFSNYLFYFIEK 336 +L+ KH+ +H+Y++GF + Y + K Sbjct: 302 KRTELQEKYKHLFLQHKYMQGFLCLIKYKLLK 333 >UniRef50_UPI00019F16C6 hypothetical protein CATC2_20202 n=1 Tax=Citrobacter youngae ATCC 29220 RepID=UPI00019F16C6 Length = 330 Score = 238 bits (608), Expect = 2e-61, Method: Composition-based stats. Identities = 110/337 (32%), Positives = 181/337 (53%), Gaps = 15/337 (4%) Query: 6 FQETEFLNSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFT 65 F + + +++++ L+IA+G DKNF+FG IS+ S+L +N+ + FH+FT Sbjct: 3 FDCHQSIKKILEFNQAPSEHKTQLNIAWGVDKNFMFGAAISMTSVLLHNKDLNIHFHLFT 62 Query: 66 DYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAP 125 DY D ++ LA Q+ T I IY+++ + L+ LPS W+HA+YFRF+ +Y K Sbjct: 63 DYIDADYQQRVAKLAEQFATNISIYIMDANGLKVLPSGNAWSHAMYFRFIAFEYLGEKVD 122 Query: 126 KVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSG 185 +LY+DAD++C+G++ L + A++ + + K YFNSG Sbjct: 123 SLLYIDADVMCKGSLYELTQIDLGEHVAAVITDVDDSPARD--------IEKNKDYFNSG 174 Query: 186 FLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSL 245 + N +W Q A +L + K++ PDQDVLN+L K+IF + ++N + + Sbjct: 175 VIFANLKKWKEQNFINSAFDILLDKN--NKLSFPDQDVLNILFLKKVIFLERRFNAIYGI 232 Query: 246 NYQLK----ESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALL 301 +LK + +T +TI IHYIG TKPW+ WA +YP +Q F+EA +SPW + LL Sbjct: 233 KQELKSKDTSKYKEYITPETILIHYIGVTKPWNSWA-NYPSAQYFVEAWKSSPWADVPLL 291 Query: 302 KPNNSNQLRYSAKHMLKKHRYLKGFSNYLFYFIEKIK 338 Q + ++H + +Y +Y+ Y K+K Sbjct: 292 PARTPKQYKKKSRHERLQGKYFASAISYIGYLWAKLK 328 >UniRef50_UPI000197C525 UDP-D-galactose:(glucosyl) lipopolysaccharide-alpha-1,3-D-galactosyltransferase n=1 Tax=Providencia rettgeri DSM 1131 RepID=UPI000197C525 Length = 339 Score = 236 bits (602), Expect = 9e-61, Method: Composition-based stats. Identities = 122/326 (37%), Positives = 192/326 (58%), Gaps = 2/326 (0%) Query: 13 NSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDD 72 S + D + ++AYG DKNFLFG G+SI S+L N+ FH+FTD+ D D Sbjct: 12 TSFTNKDVNKDLSKKKFNVAYGADKNFLFGTGVSIVSVLLNNKDINFHFHVFTDFLSDKD 71 Query: 73 RKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDA 132 + F ++ QYKT + ++ +N D L+ LP+ + W+HAIYFR +IADYF K KVLYLD+ Sbjct: 72 IQLFSQISKQYKTSVTLHTLNMDILKKLPTNQVWSHAIYFRLIIADYFYKKCDKVLYLDS 131 Query: 133 DIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTA 192 D++C G+I+ L + + +A V+ + E V GI KGYFNSG +LIN Sbjct: 132 DVVCTGSIQILKSLNLSSMPIAAVMDISEPHSVEMANL-FNVEGIKKGYFNSGVMLINPD 190 Query: 193 QWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKES 252 +W +Q++ +++++ + ++ I + DQD +N+ + + D +N + +LN + K Sbjct: 191 EWNYRQLTEKSMSVFTDKKLQPVIKYYDQDAINIAVHGDWLKLDNIFNHRINLNDRYKHK 250 Query: 253 FINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYS 312 N ++ +F+H+IG TKPWH+W+ Y + F+ AK SPWK+ L+ P N +Y+ Sbjct: 251 -KNNDISNAVFVHFIGSTKPWHNWSKYYHEVRCFLNAKEKSPWKDIDLMTPQNITHHKYA 309 Query: 313 AKHMLKKHRYLKGFSNYLFYFIEKIK 338 +KH K +YL F +Y+ Y I KIK Sbjct: 310 SKHFRYKEKYLSSFYHYVLYTILKIK 335 >UniRef50_B6XJW1 Putative uncharacterized protein n=1 Tax=Providencia alcalifaciens DSM 30120 RepID=B6XJW1_9ENTR Length = 325 Score = 233 bits (595), Expect = 6e-60, Method: Composition-based stats. Identities = 107/322 (33%), Positives = 167/322 (51%), Gaps = 3/322 (0%) Query: 16 IDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKY 75 + + L+IAYG DK FLFG G+S+ SI+ N +L FH+FTDY D+ Sbjct: 6 EKIELGAQNGAAELNIAYGVDKGFLFGSGLSMNSIIINNSDIKLKFHLFTDYMNDEFLSK 65 Query: 76 FDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADII 135 + L L I IY+IN D L+ LP + W++A YFRF I D+ +LYLDAD+ Sbjct: 66 LEKLTLNENVNIDIYIINADELKKLPISHVWSYATYFRFFIFDHLCETLSSILYLDADVF 125 Query: 136 CQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWA 195 C+G++ I+ +F + A++ + L + I YFN+G + +N W Sbjct: 126 CKGSLRKYIDIAFNGEYAAVIPD--VPNMQISCVDRLSMPQIKDKYFNAGVIFLNLKVWD 183 Query: 196 AQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQL-KESFI 254 + + +A ++ K + + DQD LN++ + I+ YN ++L +L E++ Sbjct: 184 KNKFTKQAFNLITNNHTGKTLKYLDQDALNIIFNCQNIYLPRDYNCIYTLKNELEHENYK 243 Query: 255 NPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAK 314 + +T++T IHY G TKPWH WA +YP SQ F A SPWKN L+ + + K Sbjct: 244 DYITSETKLIHYTGATKPWHYWAVNYPASQTFKVAFETSPWKNDELVDAKKKPEYQERYK 303 Query: 315 HMLKKHRYLKGFSNYLFYFIEK 336 H + ++L G S+ + Y K Sbjct: 304 HEFNQKKFLTGISSLIKYKKFK 325 >UniRef50_C1DGU7 Lipopolysaccharide 1,3-galactosyltransferase n=1 Tax=Azotobacter vinelandii DJ RepID=C1DGU7_AZOVD Length = 326 Score = 232 bits (593), Expect = 1e-59, Method: Composition-based stats. Identities = 95/326 (29%), Positives = 156/326 (47%), Gaps = 15/326 (4%) Query: 22 VETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALAL 81 + L IA+G D+N+L GI+I SI++ N G L FH+F R D L Sbjct: 5 ATRNSDVLHIAFGVDENYLRPMGITIVSIIENNPGLELVFHVFISSISSASRVRLDRLER 64 Query: 82 QYKTRIKIYLING----DRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQ 137 + + ++L++ S + + A Y R +I + + +VLYLDADI+C Sbjct: 65 MFARPVNLHLVDEMLDVKDPASGKGQAHISKAAYIRLLIPEALRDFTDRVLYLDADILCV 124 Query: 138 GTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQ 197 G I L++ A++ G KRA + YFNSG L I+ +W + Sbjct: 125 GDISGLLHLDIDGRTAAVIRDAG---AESKRAGLVKKGQTLDNYFNSGVLYIDIPRWIER 181 Query: 198 QVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFI-NP 256 V++RA+ + +P + + + DQD LN++L + F D +N Q+ L +LK+ + Sbjct: 182 AVTSRALEKIADPVLD--LRYSDQDALNLVLDGDVRFIDKGWNHQYGLTGKLKKGRVGMD 239 Query: 257 VTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQL----RYS 312 V +DT F+H+IGP KPW W + + F+ + SPW AL + ++ R+ Sbjct: 240 VPSDTKFVHFIGPMKPWRSWN-PHQSKELFLRYQALSPWAGEALDDNFSPREIYVYSRFM 298 Query: 313 AKHMLKKHRYLKGFSNYLFYFIEKIK 338 + M ++ R+L G Y + K K Sbjct: 299 YRSMFQQGRWLSGLIWYGKFLHRKHK 324 >UniRef50_Q9ZIS1 UDP-galactose:(Galactosyl) LPS alpha1,2-galactosyltransferase WaaW n=29 Tax=Enterobacteriaceae RepID=Q9ZIS1_ECOLX Length = 342 Score = 232 bits (591), Expect = 2e-59, Method: Composition-based stats. Identities = 106/319 (33%), Positives = 170/319 (53%), Gaps = 7/319 (2%) Query: 24 TENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQY 83 + L+IAYG D+NFLFG +S+ S++ +N + FH+FTDY +D + +A + Sbjct: 19 NTDRVLNIAYGIDRNFLFGAAVSMQSVVMHNPDLAVKFHLFTDYIDEDYLQRVNAFTSKN 78 Query: 84 -KTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEP 142 ++IY ++ + PS K W++A +FR V Y +LY+DAD+IC+G++ Sbjct: 79 ANVEVRIYKVSSAFIDIFPSLKQWSYATFFRLVAFQYLSETIENLLYIDADVICKGSLAG 138 Query: 143 LINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSAR 202 L++ +F DK A V+ + EK A L + G+ YFN+G + + WA + Sbjct: 139 LLDINFDGDKFAAVIKD-VPFMQEKPAKRLAIEGLPGNYFNAGVVYLQLEAWAKNDFMNK 197 Query: 203 AIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLK----ESFINPVT 258 AIAML K DQD+LN+L IF Y+ + ++Y+LK E + +T Sbjct: 198 AIAMLASDPQHTKYKCLDQDILNILFFGHCIFISGDYDCFYGIDYELKNKSDEDYKKTIT 257 Query: 259 NDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLK 318 +DT IHY+G TKPW+DW +YP + F EA AS W + A + N Q + ++H+ + Sbjct: 258 DDTKLIHYVGVTKPWNDWT-NYPCQKYFNEAYQASCWNDVAFIPATNEKQYQVKSRHLKR 316 Query: 319 KHRYLKGFSNYLFYFIEKI 337 F ++ Y+ +KI Sbjct: 317 NGNIASSFYYFMLYYSKKI 335 >UniRef50_Q46Y64 Glycosyl transferase, family 8 n=1 Tax=Ralstonia eutropha JMP134 RepID=Q46Y64_RALEJ Length = 331 Score = 230 bits (587), Expect = 5e-59, Method: Composition-based stats. Identities = 77/321 (23%), Positives = 148/321 (46%), Gaps = 12/321 (3%) Query: 23 ETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQ 82 IA+ D N+ G +IASI+ N G FH+ T +++++ L Sbjct: 17 SNGKPSFHIAFCVDDNYFRAMGATIASIIDNNPGQHFTFHVLTFSALEENQRRLKQLEEM 76 Query: 83 YKTRIKIYLIN---GDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGT 139 Y +++L++ + +++ +I+ R VI + + +VLYLDADI+C Sbjct: 77 YPVSTQLHLLDLASFTQFSHFLGHSHYSLSIFTRLVIPEVLQGQTDRVLYLDADILCVNR 136 Query: 140 IEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQV 199 ++ L++ ++ +V +R +LG+A YFN G L IN +W A+ + Sbjct: 137 LDELVDMDISNEIAVVVPDA--PVTLRRRVAALGLAHAE--YFNGGVLFINIDKWLAENI 192 Query: 200 SARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLK-ESFINPVT 258 + + + + + DQD LN +L + + ++N + L + L F Sbjct: 193 TPQTLE--ALLDTSTDMRFNDQDALNKVLNGRAKYISPRWNYLYDLIHDLNVNRFAMRPV 250 Query: 259 NDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTAL-LKPNNSNQLRYSAKHML 317 +FIH+ G KPW DW+ + F + SPW++ L +P N+ ++R ++ M Sbjct: 251 GKAVFIHFAGSVKPWADWS-GHEARGLFRKYLALSPWRDMPLDPEPRNTKEMRMHSRFMF 309 Query: 318 KKHRYLKGFSNYLFYFIEKIK 338 ++H+ ++ YL Y ++ + Sbjct: 310 RQHKPVESLKWYLRYLRKRAQ 330 >UniRef50_Q9ZIS6 UDP-galactose:(Glucosyl) LPS alpha1,2-galactosyltransferase WaaT n=26 Tax=Enterobacteriaceae RepID=Q9ZIS6_ECOLX Length = 331 Score = 229 bits (585), Expect = 8e-59, Method: Composition-based stats. Identities = 105/333 (31%), Positives = 169/333 (50%), Gaps = 8/333 (2%) Query: 9 TEFLNSVIDYDHKVETENLC-LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDY 67 EF+ Y + EN L+++YG DKNFL+G G+SI+S+L N FH+FTDY Sbjct: 2 NEFIKERFSYLADNKKENAPELNVSYGIDKNFLYGAGVSISSVLINNSDINFVFHVFTDY 61 Query: 68 FGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKV 127 DD K F+ A Q+ T I +YLI+ LP+++ W++A YFR + +Y + Sbjct: 62 VDDDYLKSFNETAKQFNTSIIVYLIDPKYFADLPTSQFWSYATYFRVLSFEYLSESISTL 121 Query: 128 LYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFL 187 LYLDAD++C+G+++PL F D+ A++ A L + + YFN+G + Sbjct: 122 LYLDADVVCKGSLKPLTEIIFKDEFAAVIPDNDST--QAACAKRLNIPEMNGRYFNAGVI 179 Query: 188 LINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFA----DIKYNTQF 243 +N +W ++ + +L + + DQD LN+ I+ D Y + Sbjct: 180 YVNLKKWHEANLTPYLLKLLRGETKYGSLKYLDQDALNIAFNMNNIYLAKDFDTIYTLKN 239 Query: 244 SLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKP 303 L + + +T+ T+ IHY G TKPWH WA YP + F A+ SPWK L + Sbjct: 240 ELYDRSHRKYQQTITDKTVLIHYTGITKPWHSWA-GYPSASYFNIAREQSPWKKYPLKEA 298 Query: 304 NNSNQLRYSAKHMLKKHRYLKGFSNYLFYFIEK 336 +++ KH+ Y+KG ++ + Y ++K Sbjct: 299 RTVAEMQKQYKHLFAHGEYIKGITSLIKYKLKK 331 >UniRef50_B2PV91 Putative uncharacterized protein n=2 Tax=Enterobacteriaceae RepID=B2PV91_PROST Length = 342 Score = 228 bits (581), Expect = 2e-58, Method: Composition-based stats. Identities = 102/315 (32%), Positives = 165/315 (52%), Gaps = 8/315 (2%) Query: 27 LCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTR 86 CLD+ YG+D+N+ FG G+S S+L N + FH F D D + +A Q++ Sbjct: 23 SCLDVIYGSDENYQFGAGVSAVSLLINNPTTFFRFHYFLDKVSPDFLEKLKVIASQFQVE 82 Query: 87 IKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINF 146 +Y ++ L++LP++ W+ A+YFR V DY + LYLDAD++C G ++ N Sbjct: 83 FHVYELDNKLLKTLPASDVWSSAMYFRLVALDYLSSDYDFALYLDADVMCNGILDLTTNL 142 Query: 147 SFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAM 206 D +V + K L +AK YFNSG + +N +W +Q++ + + Sbjct: 143 -IKDKVCGVVADDIGVRT--KSETRLHAPSLAKTYFNSGVMFVNLKKWHEKQITQQCFEL 199 Query: 207 LNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQL----KESFINPVTNDTI 262 L+ ++ +PDQDVLN++L + L ++NT ++L +L + + +T +T+ Sbjct: 200 LSAENAKQRYKYPDQDVLNLILREDLELLSQRFNTVYTLKNELYDSTHQKYQQVITPETV 259 Query: 263 FIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKHRY 322 IHY G +KPWH WA +YP SQ F +A SPW L + + KH+LK+ Y Sbjct: 260 LIHYTGVSKPWHTWA-NYPASQPFYKALMQSPWTTNDLKPATKFVERKKEYKHLLKQGNY 318 Query: 323 LKGFSNYLFYFIEKI 337 L G + + Y EK+ Sbjct: 319 LAGILSGIRYSFEKL 333 >UniRef50_A8ARL6 Putative uncharacterized protein n=3 Tax=Citrobacter RepID=A8ARL6_CITK8 Length = 339 Score = 227 bits (579), Expect = 4e-58, Method: Composition-based stats. Identities = 105/324 (32%), Positives = 179/324 (55%), Gaps = 9/324 (2%) Query: 18 YDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFD 77 D+ ++ L+IAYG D+NFLFG GIS+ S+L N + F++ TDY D+ + + Sbjct: 15 IDNATHQKSKKLNIAYGVDRNFLFGSGISMTSVLVNNPDIDIHFYVVTDYVDDEYLESVE 74 Query: 78 ALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQ 137 L Y T + + + + + R LPSTK WT+A+Y+R+ +Y + VLYLDADI+C+ Sbjct: 75 RLTQMYGTTVTVLVFDNEAFRKLPSTKAWTYAMYYRYFAFEYLSRELDSVLYLDADIVCK 134 Query: 138 GTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQ 197 ++ L + F + A+V + K LG+ +A+ YFNSG + N W + Sbjct: 135 NSLRELTDIHFAGEYAAVVNDIDRVRL--KSGQRLGIPELARDYFNSGVVFANLHVWREK 192 Query: 198 QVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKES----F 253 ++ ++A +L E K++ + DQD+LN+L +I +N + ++ +LK + Sbjct: 193 KLLSKAFEVL--HERQKELLYFDQDILNILFVGHVILLRRDFNCIYGVDQELKNKNEYRY 250 Query: 254 INPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSA 313 + +T T+ IHY+G TKPWH WA +YPVS+ F+EA S W +LL N + + + Sbjct: 251 QDFITESTVLIHYVGVTKPWHTWA-NYPVSKYFIEAYKKSAWAEKSLLNANTAKLYKRKS 309 Query: 314 KHMLKKHRYLKGFSNYLFYFIEKI 337 +H + +Y++ +++ Y K+ Sbjct: 310 RHERIQRKYIRSIFSHIMYIKNKL 333 >UniRef50_Q9ZIT6 UDP-glucose:(Galactosyl) LPS alpha1,2-glucosyltransferase WaaJ n=26 Tax=Enterobacteriaceae RepID=Q9ZIT6_ECOLX Length = 339 Score = 226 bits (577), Expect = 8e-58, Method: Composition-based stats. Identities = 112/337 (33%), Positives = 183/337 (54%), Gaps = 9/337 (2%) Query: 6 FQETEFLNSVIDYDHKVET--ENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHI 63 F+ +I+ D + E ++++G D+N+ G ISIASIL+ N+ ++ FHI Sbjct: 5 FKHLTQFKDIIELDKRPVKLDERETFNVSWGIDENYQVGAAISIASILENNKQNKFTFHI 64 Query: 64 FTDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINK 123 DY + + LA +Y+T IK+YLI+ + L++LP + W +IY+R + DYF + Sbjct: 65 IADYLDKEYIELLSQLATKYQTVIKLYLIDSEPLKALPQSNIWPVSIYYRLLSFDYFSAR 124 Query: 124 APKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFN 183 +LYLDADI+C+G++ LI F D+ A+V+ K A L YFN Sbjct: 125 LDSLLYLDADIVCKGSLNELIALEFKDEYGAVVIDVDA--MQSKSAERLCNEDFNGSYFN 182 Query: 184 SGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQF 243 SG + IN +W Q+++ + +L++ IIKK+ +PDQD+LN++ KYN + Sbjct: 183 SGVMYINLREWLKQRLTEKFFDLLSDESIIKKLKYPDQDILNLMFLHHAKILPRKYNCIY 242 Query: 244 SLNYQLKES----FINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTA 299 ++ + +E + + +DT+FIHY G TKPWHDWA +Y + F N SPW+N Sbjct: 243 TIKSEFEEKNSEYYTRFINDDTVFIHYTGITKPWHDWA-NYASADYFRNIYNISPWRNIP 301 Query: 300 LLKPNNSNQLRYSAKHMLKKHRYLKGFSNYLFYFIEK 336 K ++ + KH+L + ++L G + Y + K Sbjct: 302 YKKAVKKHEHKEKYKHLLYQKKFLDGVFTAIKYNVMK 338 >UniRef50_P27129 Lipopolysaccharide 1,2-glucosyltransferase n=44 Tax=Enterobacteriaceae RepID=RFAJ_ECOLI Length = 338 Score = 224 bits (570), Expect = 5e-57, Method: Composition-based stats. Identities = 105/327 (32%), Positives = 169/327 (51%), Gaps = 9/327 (2%) Query: 14 SVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDR 73 D+ + CL++AYG D N+L G G+SI SI+ N L F+I D + D Sbjct: 13 KAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFF 72 Query: 74 KYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDAD 133 + LA Q + RI +Y IN D+L+ LP T+ W+ A+YFR ++LYLDAD Sbjct: 73 QKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDAD 132 Query: 134 IICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQ 193 ++C+G I L++ A+V EK L + YFNSG + ++ + Sbjct: 133 VVCKGDISQLLHLGLNGAVAAVVKD--VEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKK 190 Query: 194 WAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKE-- 251 WA +++ +A+++L + + +PDQDV+N+LL +F +YNT +++ +LK+ Sbjct: 191 WADAKLTEKALSILMSKDNV--YKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKT 248 Query: 252 --SFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQL 309 ++ +T T+ IHY G TKPWH WA YP + + A SPWK+ + + + Sbjct: 249 HQNYKKLITESTLLIHYTGATKPWHKWAI-YPSVKYYKIALENSPWKDDSPRDAKSIIEF 307 Query: 310 RYSAKHMLKKHRYLKGFSNYLFYFIEK 336 + KH+L +H Y+ G + Y K Sbjct: 308 KKRYKHLLVQHHYISGIIAGVCYLCRK 334 >UniRef50_UPI0001B4A0A4 putative glucosyltransferase n=1 Tax=Bacteroides sp. 2_1_7 RepID=UPI0001B4A0A4 Length = 301 Score = 221 bits (564), Expect = 2e-56, Method: Composition-based stats. Identities = 67/308 (21%), Positives = 134/308 (43%), Gaps = 13/308 (4%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 +DI TD N++ CG+ + SI N + HI T+ ++++ + +Y +I+ Sbjct: 1 MDIVCCTDNNYVIPCGVLVTSICVNNPKEEITVHILTEGISPENQEVLKKVVAKYGQQIQ 60 Query: 89 IYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSF 148 Y ++ + P +++ T A YFR ++ D KVLYLD D++ + ++ L + Sbjct: 61 FYTVDKKVFANCPISRHITLATYFRLIMTDILPKSVEKVLYLDCDVVVRHSLRSLWDTDI 120 Query: 149 PDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLN 208 ++ D R ++ + GYFN+G LL+N W +S ++N Sbjct: 121 KSYAAGVIPDMSIDDI---RIYNRLQYSPSLGYFNAGVLLVNLRYWRENNLSESFFEIIN 177 Query: 209 EPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNY------QLKESFINPVTNDTI 262 + +++ + DQDVLN++L + + +KYN Q + + D + Sbjct: 178 KY--PERLRYHDQDVLNIVLKEIKLTLPMKYNVQHGYFFKDPLISRTYRDEREQAITDPV 235 Query: 263 FIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKHRY 322 +HY G KPW ++ P + F + S + + +++ + +L+K Sbjct: 236 ILHYSGS-KPW-FIEFEPPFKKDFAFYLDTSGLDKSFIRHIPMKARIKARFRSLLEKLGL 293 Query: 323 LKGFSNYL 330 + + Sbjct: 294 IAPKDSLF 301 >UniRef50_D1PTN4 Putative uncharacterized protein n=1 Tax=Prevotella bergensis DSM 17361 RepID=D1PTN4_9BACT Length = 305 Score = 221 bits (563), Expect = 3e-56, Method: Composition-based stats. Identities = 64/289 (22%), Positives = 125/289 (43%), Gaps = 15/289 (5%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 +DI + D N+L C ++ SIL N+ ++ FH+ ++ ++ R + +A Y ++ Sbjct: 1 MDIVFNIDDNYLMQCCTTMVSILHNNKDGQISFHVISNGLTNESRLKIEQVAEAYHQQVF 60 Query: 89 IYLINGD---RLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLIN 145 Y++N + + + A Y R +AD + K++Y+D D+I G+++ L N Sbjct: 61 FYVVNPEAMSDYEIFDKQGHISMATYLRLFVADILPERLHKIIYMDCDLIVNGSLDGLWN 120 Query: 146 FSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIA 205 +A V + + A YFN+G L++N W VS +A Sbjct: 121 TDVEGYALAAVED---MWSGKADNYVRLGYDAADTYFNAGVLVVNLDYWREHNVSQQAAQ 177 Query: 206 MLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKE------SFINPVTN 259 + ++ DQDVLN L D + ++N Q L + ++ ++ Sbjct: 178 YVALH--AGQLKFNDQDVLNGLFHDSKLLLPFRWNVQDGLLRKRRKIRPEVMPKLDQELE 235 Query: 260 DTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQ 308 + + IH+ G KPW+ ++ P F + + + W+ + P + Sbjct: 236 NPVIIHFTGHRKPWN-FSCLNPYKNLFFKYVDMTEWRGFRPIVPLSWKL 283 >UniRef50_B6GCA0 Putative uncharacterized protein n=1 Tax=Collinsella stercoris DSM 13279 RepID=B6GCA0_9ACTN Length = 990 Score = 217 bits (552), Expect = 6e-55, Method: Composition-based stats. Identities = 60/358 (16%), Positives = 130/358 (36%), Gaps = 31/358 (8%) Query: 1 MQQVFFQETEFLNSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEG-SRL 59 +Q V FQE ++ + +++ + + + +D N++ +I S+L R Sbjct: 621 LQCVHFQEPDY-KPGLKMPVRLDDLRQIVPVVFASDNNYVPMLTTTIHSMLSNASNNYRY 679 Query: 60 CFHIFTDYFGDDDRKYFDALALQY-KTRIKIYLIN--GDRLRSLPSTKNWTHAIYFRFVI 116 + ++ Y + ++ ++ + + + Y+RF+I Sbjct: 680 DITVLHRDISGANQAIMREFFSSYDNVNLGFCDVSQVIEKYNLTTNNPHISVETYYRFLI 739 Query: 117 ADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEG-----QADWWEKRAHS 171 D KVLYLD+D+I +G + L D +A ++ A++ Sbjct: 740 QDLLP-YYDKVLYLDSDLIIRGDVSELFATDLGDSLLAAAHDIDFVANVNMKRGDRFAYA 798 Query: 172 LGVAGIAKGY--FNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLA 229 V G+ Y F +G L++NT ++ + ++ + + DQDVLN Sbjct: 799 KEVLGMKDPYSYFQAGVLVLNTRAMRSRHTMEEWLEFASD----DRFIYNDQDVLNAHCE 854 Query: 230 DKLIFADIKYNTQFSLNYQLKESF----------INPVTNDTIFIHYIGPTKPWHDWAWD 279 ++++ D +N ++ + F ++ +HY G KPW D Sbjct: 855 GEVVYLDYSWNVMIDCFGRINKVFTFAPAYMFDAFIESRSNEKIVHYAGFEKPWKLAGCD 914 Query: 280 YPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKHRYLKGFSNYLFYFIEKI 337 + + +P+ + L N+ ++ H + L ++ I Sbjct: 915 R--GELYWRYARETPFYESLLQHSIAVNRSGRLPDYL--IHEPALSPRSPLRKIVDPI 968 >UniRef50_D0KD54 Lipopolysaccharide glucosyltransferase I n=2 Tax=Pectobacterium RepID=D0KD54_PECWW Length = 336 Score = 216 bits (550), Expect = 1e-54, Method: Composition-based stats. Identities = 119/337 (35%), Positives = 186/337 (55%), Gaps = 10/337 (2%) Query: 4 VFFQETEFLNSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHI 63 + F + SV + H+ ++ L++AYG DKN+ GCG+SI SIL N FH+ Sbjct: 1 MVFSSHIDVLSVFEKRHQSIADHDTLNVAYGIDKNYAVGCGVSITSILINNS-IDFTFHV 59 Query: 64 FTDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINK 123 F+D F DD K LA ++KT+I +Y IN + L++LP T W+HA+YFR + + +K Sbjct: 60 FSDDFDDDFIKKISILAEKFKTKIILYKINSEMLKTLPCTDIWSHAMYFRLLAFSHLSDK 119 Query: 124 APKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFN 183 +LYLDAD++C+G++E L + A++ + +K A L +A + YFN Sbjct: 120 TSSLLYLDADVMCKGSLEQLHKLNTAPHVAAVIRD--VPEMQKKSASRLKMAALEGEYFN 177 Query: 184 SGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQF 243 SG L N W ++ + L + E + I +PDQD++N+LL + F +YNT + Sbjct: 178 SGVLFANLDIWNKLDLTQKIFDKLRDGE--ESIQYPDQDIMNILLNGNVTFLPKEYNTIY 235 Query: 244 SLNYQLKE----SFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTA 299 S+ +LK+ + + +DTI IHY G TKPWH WA +YP + F A+ SPW + Sbjct: 236 SIKNELKDSNHQKYKEVIKDDTILIHYTGVTKPWHKWA-NYPSTSYFQHAQENSPWSTSD 294 Query: 300 LLKPNNSNQLRYSAKHMLKKHRYLKGFSNYLFYFIEK 336 L + +++ KH+LKK +YL G + Y + K Sbjct: 295 LKDADTFVEMKKKYKHLLKKGKYLSGLISAFKYSLNK 331 >UniRef50_C3X7M2 Lipopolysaccharide 3-alpha-galactosyltransferase n=1 Tax=Oxalobacter formigenes OXCC13 RepID=C3X7M2_OXAFO Length = 307 Score = 216 bits (550), Expect = 1e-54, Method: Composition-based stats. Identities = 80/315 (25%), Positives = 138/315 (43%), Gaps = 11/315 (3%) Query: 25 ENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYK 84 IA+G D + ++IASIL+ N+ S + FH+ + D L + Sbjct: 1 MKNEFHIAFGVDTIYAPKMCVTIASILENNKNSNIIFHVIYNDLSDKVIDEIKKSMLTLQ 60 Query: 85 TRIKIYLINGDR--LRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEP 142 I + I+ D + + T + RF I + + LYLDADIIC I Sbjct: 61 AEINFHFIDVDLSIFPKFSNFSHITSGAFLRFFIPELLQGLTDRALYLDADIICINNISD 120 Query: 143 LINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSAR 202 L + ++++ VV + ++ + S K YFNSG L+++ +W V + Sbjct: 121 LFHLEMDENEILAVVEDIDSETYLNENASFQ-----KRYFNSGVLMMDIEKWNKNNVYGQ 175 Query: 203 AIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTI 262 +++L E DQD LN+++ DK+ + D +N + K+ V + Sbjct: 176 LLSVL--NEKGSGFNLIDQDALNLVMIDKVHYLDNIWNYMINAEQLDKKKEKYSVPENAK 233 Query: 263 FIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKHRY 322 FIH++GP KPWH + ++ ++ + + W L P N ++R A++ KK Y Sbjct: 234 FIHFVGPVKPWHCYNIFDDITGLYLNYQKKTVWDG--LEMPKNYKEMRRYARYSFKKGNY 291 Query: 323 LKGFSNYLFYFIEKI 337 L G + + Y K Sbjct: 292 LTGLNWGMRYIKTKF 306 >UniRef50_P25148 General stress protein A n=3 Tax=Bacillus subtilis group RepID=GSPA_BACSU Length = 286 Score = 213 bits (542), Expect = 8e-54, Method: Composition-based stats. Identities = 60/284 (21%), Positives = 113/284 (39%), Gaps = 15/284 (5%) Query: 24 TENLCLDIAYGTDKNFLFGCGISIASILKYNEGSR-LCFHIFTDYFGDDDRKYFDALALQ 82 ++ + I D N+ G S+L + R + ++ D++K + L+ Sbjct: 2 RKDEIMHIVSCADDNYARHLGGMFVSLLTNMDQEREVKLYVIDGGIKPDNKKRLEETTLK 61 Query: 83 YKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINK-APKVLYLDADIICQGTIE 141 + I+ ++ + + + T A Y+R I D ++ +++Y+D D + I Sbjct: 62 FGVPIEFLEVDTNMYEHAVESSHITKAAYYRISIPDLIKDESIKRMIYIDCDALVLEDIS 121 Query: 142 PLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSA 201 L + VA V GQ +R + V K YFNSG ++I+ W Q ++ Sbjct: 122 KLWDLDIAPYTVAAVEDAGQ----HERLKEMNVTDTGK-YFNSGIMIIDFESWRKQNITE 176 Query: 202 RAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLK-------ESFI 254 + I +NE + DQD LN +L D+ ++N Q + +LK Sbjct: 177 KVINFINEHPDEDFLVLHDQDALNAILYDQWYELHPRWNAQTYIMLKLKTPSTLLGRKQY 236 Query: 255 NPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNT 298 N + +H+ G KPW+ +P + + + W Sbjct: 237 NETRENPAIVHFCGGEKPWNS-NTKHPYRDEYFHYMSYTKWNTI 279 >UniRef50_C7IBC1 Glycosyl transferase family 8 n=2 Tax=Clostridium papyrosolvens DSM 2782 RepID=C7IBC1_9CLOT Length = 452 Score = 212 bits (541), Expect = 1e-53, Method: Composition-based stats. Identities = 69/317 (21%), Positives = 120/317 (37%), Gaps = 15/317 (4%) Query: 27 LCLDIAYGTDKNFLFGCGISIASILKYNE-GSRLCFHIFTDYFGDDDRKYFDALALQYKT 85 + I D +++ G+ I S+L+ + L F++ D D++ + Y Sbjct: 2 ETVKIVSACDSHYVQHLGVMITSLLENTSMKTSLEFYVIDGGITDADKELLCSCTCLYGC 61 Query: 86 RIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLIN 145 +I I D + + + A YFR +++ KV+YLD DI+ I L Sbjct: 62 KINFITIQADFYARFGESPSASDATYFRIFVSELLDTSVEKVIYLDCDIVVIKDIAELWK 121 Query: 146 FSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAK--GYFNSGFLLINTAQWAAQQVSARA 203 + +A V G E G+ + YFN+G LLIN +W + +S Sbjct: 122 TDVSEYFLAAVADCGVEYSGEYAVTLKRKLGMKRKDCYFNAGVLLINLVKWREESISKSI 181 Query: 204 IAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINP-----VT 258 L E + KI DQD LN +L ++ + D ++N Q + ++ + Sbjct: 182 CKFLFENK--GKIDFADQDGLNAVLCNRWLPLDSRWNQQVAHCEFYEQEKVVWENVTRAV 239 Query: 259 NDTIFIHYIGP----TKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAK 314 + IHY TKPW+ +P Q + + +PWK+ N L Sbjct: 240 REPWIIHYTTSYFSGTKPWNYLDM-HPYRQEYYRYLHMTPWKSFIPPDRTIWNILLKIIY 298 Query: 315 HMLKKHRYLKGFSNYLF 331 + + + Sbjct: 299 EAYAGRLLINYYRRSIK 315 >UniRef50_D2RIJ4 Glycosyl transferase family 8 n=2 Tax=Acidaminococcus RepID=D2RIJ4_ACIFE Length = 309 Score = 212 bits (540), Expect = 1e-53, Method: Composition-based stats. Identities = 56/315 (17%), Positives = 106/315 (33%), Gaps = 20/315 (6%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNE-GSRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 + I +D N+ ++ ASIL + + F+ F D ++ + A + I Sbjct: 4 ISIVLASDDNYAQHGAVACASILANHRGERPIHFYYFDDGISEEKQAGIAATVTGLQGSI 63 Query: 88 KIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 G ++ ++ + A Y R +I + +V+YLD D++ I+ L Sbjct: 64 TFIPTAGKEIQ-AHTSGHVNRAAYLRLLIPELVPQAVHRVIYLDTDLVVLDDIQELWEMD 122 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGI--AKGYFNSGFLLINTAQWAAQQVSARAIA 205 V V G R GI K YFNSG +++ W +Q + I Sbjct: 123 LQGKPVGAVPDLGILASSRMRRQKEETLGIQEGKLYFNSGVMVMELEAWREKQYGDQVIR 182 Query: 206 MLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFS--------LNYQLKESFINPV 257 + E H DQD LN + D +++N L + Sbjct: 183 CVEE----GNFRHHDQDGLNKVFQDNWQPLPLRWNVIPPVFTLPVKVLKKSRWRNLALEA 238 Query: 258 TNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHML 317 H+ G KPW + ++ + + + + +P + + + Sbjct: 239 LERPAVFHWAGRYKPWEFPPKGH-FNEKYYTYLARTAFAGAKMPQPGKDMKGKSFTR--- 294 Query: 318 KKHRYLKGFSNYLFY 332 ++ R + Sbjct: 295 QEWRLKLAELWKKLF 309 >UniRef50_C7QL87 Glycosyl transferase family 8 n=2 Tax=Cyanothece RepID=C7QL87_CYAP0 Length = 283 Score = 211 bits (537), Expect = 3e-53, Method: Composition-based stats. Identities = 78/296 (26%), Positives = 127/296 (42%), Gaps = 13/296 (4%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 +DI + DKN+ G++I S++ N H+ T D K D L + + + Sbjct: 1 MDILFCFDKNYEQHFGVAITSLILNNTNKIKTIHLVTKDNSKDFLKKIDKLKSKTQAKFF 60 Query: 89 IYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSF 148 IY + L ++ + + + A Y+R + + K+LYLD+D++ ++E L N Sbjct: 61 IYSPDDKDLSNVKVSAHISTAAYYRLLAPELLPQDLKKILYLDSDLVVNSSLENLYNMDI 120 Query: 149 PDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLN 208 DD +A +KR G YFNSG +LIN W + + + L Sbjct: 121 SDDILAAYAGGKMGPGTKKRLQLTG-----DFYFNSGVMLINLEAWRTENIGNKCFKFLQ 175 Query: 209 EPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIG 268 E + ++ DQD LN ++ K + D +N+ L VTN +I IH+ G Sbjct: 176 ENPDMIRLW--DQDALNKIVDGKFLNIDGIWNSLVDLTT-----GETRVTNQSIIIHFTG 228 Query: 269 PTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKHRYLK 324 KPW W P Q + SPW N P N ++ + K + K+ + K Sbjct: 229 TLKPWQSW-CIRPEKQIYWYYLRQSPWSNAYPQFPKNFQEMLLAIKSVYKQIKPKK 283 >UniRef50_D2ELM0 Lipopolysaccharide biosynthesis glycosyltransferase n=1 Tax=Pediococcus acidilactici 7_4 RepID=D2ELM0_PEDAC Length = 552 Score = 211 bits (537), Expect = 3e-53, Method: Composition-based stats. Identities = 59/292 (20%), Positives = 119/292 (40%), Gaps = 16/292 (5%) Query: 9 TEFLNSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYF 68 +L+ + ++E +++ + F S SIL+ + + F + D+ Sbjct: 259 HPYLDEYHEELGELEMHRGVINVISAANSAFTQALATSYVSILENDPDHQYNFFLLPDHL 318 Query: 69 GDDDRKYFDALALQY-KTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKV 127 D D ++ +Y IK+ +N + L + + Y+R + + Sbjct: 319 TDRDMMLLGSIIARYDNATIKVVEVNEELLANAVESDRIVKTAYYRILAPALLP-SINRA 377 Query: 128 LYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFL 187 +YLD DII ++ L + + +A V G + R +G+ + YFNSG + Sbjct: 378 IYLDCDIIANTSLHELWQTNLEGNVIAAVEDAG----FHDRLEKMGITKENEKYFNSGMM 433 Query: 188 LINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNY 247 LI+ +W A+ + + + +N+ +K+ DQD LN L D + ++N Q ++ Sbjct: 434 LIDLVRWRARSTTQKVLDYINQN--PEKLRFHDQDALNANLYDDWLHLHPQWNAQSNIIM 491 Query: 248 QL-------KESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNA 292 + D IH+ G KPWH+ ++P + +++ Sbjct: 492 ETIFPPRTELLEPYAETREDPKLIHFCGHVKPWHE-GCEHPYADVYLKYHEM 542 Score = 180 bits (458), Expect = 4e-44, Method: Composition-based stats. Identities = 65/273 (23%), Positives = 120/273 (43%), Gaps = 14/273 (5%) Query: 27 LCLDIAYGTDKNFLFGCGISIASILKY-NEGSRLCFHIFTDYFGDDDRKYFDALALQYKT 85 ++I D+N+ I+I + L+ N +R F + T+ GD R D L + Sbjct: 2 KKINILLAADRNYADQLCITIKTALETLNSATRAHFIVLTNNLGDQTRALLDKLMHNFH- 60 Query: 86 RIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYF-INKAPKVLYLDADIICQGTIEPLI 144 I+ ++ +R P+ ++ YFR + + +++YLD D++ + + L Sbjct: 61 TIEYLNLDDERFDFCPTNQHINKTAYFRIIAPKLLASRQIDRLIYLDVDVLIRKDLTELA 120 Query: 145 NFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAI 204 + + V V+ GQA + R V + YFNSG ++I+ AQW A +++ + + Sbjct: 121 ESNLNQNTVGAVIDTGQA-FALHRLGVDPVVAASNLYFNSGIMVIDVAQWNAHRITEKTL 179 Query: 205 AMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKE-------SFINPV 257 A + +I DQD LN +LA ++ F K+N Q S+ ++ I+ Sbjct: 180 AFIRNH--ADRIIFHDQDALNAVLAGEVQFLHPKWNLQNSIIFRKHRPINQGYAELIDEA 237 Query: 258 TNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAK 290 + +H+ KPW D +P + E Sbjct: 238 IKEPSIVHFTTHEKPWKDLTV-HPYLDEYHEEL 269 >UniRef50_B7GNT4 Glycosyl transferase, family 8 n=5 Tax=Bifidobacterium RepID=B7GNT4_BIFLI Length = 1013 Score = 210 bits (535), Expect = 5e-53, Method: Composition-based stats. Identities = 65/358 (18%), Positives = 128/358 (35%), Gaps = 31/358 (8%) Query: 1 MQQVFFQETEFLNSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYN-EGSRL 59 +Q V F E + + + + + D N++ ++ S +K Sbjct: 644 LQCVHFTNPE---PAEELKPLDVFDKPIVPVVFAADDNYVPQLTTTVYSAMKNADPSYFY 700 Query: 60 CFHIFTDYFGDDDRKYFDALALQY-KTRIKIYLINGDRLRSLPSTK--NWTHAIYFRFVI 116 + D ++ Q+ ++ + + ST + + Y+RF+I Sbjct: 701 DVVVLQQDIAGDKQERMWRFFEQFPNMSLRFLNVKRELSGYDLSTNNAHISIETYYRFLI 760 Query: 117 ADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEG-----QADWWEKRAHS 171 KVLYLD+DII G I L + D+ + V ++ +++ Sbjct: 761 QQLLP-NYDKVLYLDSDIIIVGDIAKLYDIDLQDNLLGAVRDIDFLGNLNVKHGKRMSYA 819 Query: 172 LGVAGIAKGY--FNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLA 229 V + Y F +G L++NT + + + + P + DQDVLN Sbjct: 820 KDVLKMKNPYDYFQAGVLVLNTKGMRNRYSIEQWLTYASNPN----YIYNDQDVLNAYCE 875 Query: 230 DKLIFADIKYNTQFSLNYQLKESFIN----------PVTNDTIFIHYIGPTKPWHDWAWD 279 K+++ ++N ++ F ++ IHY G KPW D D Sbjct: 876 GKVLYLPWEWNVVHDCGGRVGNLFTQAPNDVYDAYVKSRSNPQIIHYAGYQKPWVDPDCD 935 Query: 280 YPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKHRYLKGFSNYLFYFIEKI 337 S + +P+ + + +N+ + + L KH G N + F++ + Sbjct: 936 --YSSIYWRYARETPFYERLIKRVVLANEPQIPEEVFLPKHERAVGEDNPIRKFVDPL 991 >UniRef50_A7A7B4 Putative uncharacterized protein n=2 Tax=Bifidobacterium adolescentis L2-32 RepID=A7A7B4_BIFAD Length = 1009 Score = 210 bits (534), Expect = 7e-53, Method: Composition-based stats. Identities = 67/358 (18%), Positives = 127/358 (35%), Gaps = 31/358 (8%) Query: 1 MQQVFFQETEFLNSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYN-EGSRL 59 +Q V F E + D T+ + + + D N++ ++ S +K Sbjct: 640 LQCVHFTNPEPAEQLKPLD---ITDKPIVPVVFAADDNYVPQLTTTVYSAMKNADPHYFY 696 Query: 60 CFHIFTDYFGDDDRKYFDALALQY-KTRIKIYLINGDRLRSLPSTK--NWTHAIYFRFVI 116 + D ++ Q+ ++ ++ + ST + + Y+RF+I Sbjct: 697 DVTVLQRNIAWDKQERLRGFFKQFPNMNLRFTNVDRELAGYDLSTNNAHISVETYYRFLI 756 Query: 117 ADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVT-EGQADWWEKRAHSLGV- 174 KVLYLD+DII G I L N + + + A+ K +G Sbjct: 757 QKVLPF-YDKVLYLDSDIIINGDIAKLYNIDLQGKMLGAIRDIDFLANLNVKHGKRMGYA 815 Query: 175 -----AGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLA 229 YF +G L++NT + + + P+ + DQDVLN Sbjct: 816 QTVLKMKNPYDYFQAGVLVLNTKAMREHYTIKQWLTYASNPD----FIYNDQDVLNAHCE 871 Query: 230 DKLIFADIKYNTQFSLNYQLKESFIN----------PVTNDTIFIHYIGPTKPWHDWAWD 279 +++ ++N ++ F+ ND +HY G KPW D D Sbjct: 872 GNVLYLPWEWNVVHDCGGRVGNLFVQAPNDIYDAYMKSRNDPQIVHYAGFQKPWTDPDCD 931 Query: 280 YPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKHRYLKGFSNYLFYFIEKI 337 + + + +P+ L + +N+ A + KH G N + ++ + Sbjct: 932 --FASMYWKYARETPFYERLLKRVVKANESEIPAGVLRPKHERAVGEDNPIRKIVDPL 987 >UniRef50_Q5LF36 Putative glucosyltransferase n=1 Tax=Bacteroides fragilis NCTC 9343 RepID=Q5LF36_BACFN Length = 308 Score = 209 bits (533), Expect = 9e-53, Method: Composition-based stats. Identities = 73/300 (24%), Positives = 127/300 (42%), Gaps = 14/300 (4%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 +DI + D +++ CG++I S+ N + FHI T +R+ + +Y+ +I Sbjct: 1 MDIVHCIDNSYVAQCGVTITSVCVNNVNEVILFHILTTNLSIFNREMLKKIVDKYRQKII 60 Query: 89 IYLINGDRLRSLPST--KNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINF 146 Y ++ L P + + A YFR ++ D KVLYLD D++ I+ L + Sbjct: 61 FYNVDEYLLNKCPLREGDHVSLATYFRILMPDILPKSLNKVLYLDCDLVVCKNIKRLWDT 120 Query: 147 SFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAM 206 + V G D R ++ I +GYFN+G LL+N A W +S + + Sbjct: 121 DISTHSLGAVYDGGTDDI---RTYNRLKYDIRQGYFNAGVLLVNLAYWREFHISNKLLKF 177 Query: 207 LNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQ---LKESFI---NPVTND 260 + + +++ DQD LN +L KYN + + L+E ++ D Sbjct: 178 IEQY--PERLMFWDQDALNSVLIQTTKILPFKYNMLDAFYTKELALREEYLFEIEGALCD 235 Query: 261 TIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKH 320 +H+ P KPW D+P+ F E + W + + P N + + K Sbjct: 236 PTILHFSSPNKPWLK-TCDHPLKSFFFEYLKRTSWNDKFPIYPFNMSLKSRLCLFLWNKG 294 >UniRef50_C0WCJ1 Lipopolysaccharide 1,2-glucosyltransferase n=1 Tax=Acidaminococcus sp. D21 RepID=C0WCJ1_9FIRM Length = 338 Score = 207 bits (527), Expect = 4e-52, Method: Composition-based stats. Identities = 80/333 (24%), Positives = 144/333 (43%), Gaps = 24/333 (7%) Query: 5 FFQETEFLNSVIDYD-HKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHI 63 +F FL V + + +T+ L +AY + + G S+ S+L+ N + FHI Sbjct: 8 YFVPARFLKGVETFSKNAEKTDKAPLHVAYNVNDGYFQIMGASLVSVLENNAHRAVMFHI 67 Query: 64 FTDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPST-KNWTHAIYFRFVIADYFIN 122 FTD + ++ + + LA +Y IK+Y ++ + + ++ Y R V+ Sbjct: 68 FTDGYSKENAQKMEQLADRYGCVIKLYTLHMEPFADFHVKVERFSRITYGRIVMPLILAA 127 Query: 123 KAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYF 182 + LYLDAD + ++ L ++ + V + ++R L + YF Sbjct: 128 ETDHFLYLDADTMVIRPLDELYHWDLTGKAMGAVSE--RMPDAKRRGDYLHL--NNGRYF 183 Query: 183 NSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQ 242 N G +++N +W Q ++ +A ++ EP+ ++ QD+LN++ F YN Sbjct: 184 NDGVMMVNIPEWQKQNITEKAFSLQKEPK--ERFLGQSQDILNIVFDGTNAFLPSIYN-- 239 Query: 243 FSLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKN----T 298 + +P T IH+ G KPW DY + ASPW+ Sbjct: 240 -----EFGGGEDDPQQKGT-IIHWTGRRKPWQMVLSDYDAQ--WRSYNAASPWETLTAIL 291 Query: 299 ALLKPNNSNQLRYSAKHMLKKH--RYLKGFSNY 329 +LKP N + + AK+ K+ Y+KG + Y Sbjct: 292 PILKPENYHDFKEWAKYRRKESFRDYVKGMAYY 324 >UniRef50_Q38VK7 Putative bifunctional glycosyl transferase, family 8 n=2 Tax=Lactobacillus sakei subsp. sakei 23K RepID=Q38VK7_LACSS Length = 569 Score = 206 bits (523), Expect = 1e-51, Method: Composition-based stats. Identities = 64/297 (21%), Positives = 119/297 (40%), Gaps = 17/297 (5%) Query: 5 FFQETEFLNSVIDYDHKVETEN-LCLDIAYGTDKNFLFGCGISIASILKYN-EGSRLCFH 62 F E +++ Y ++ + ++I + NF+ I ASIL N + F Sbjct: 260 VFLEHPYMSEYQVYLSQLPADKRDQINIVSAANSNFVEPLAILYASILNNNDDDRHYAFF 319 Query: 63 IFTDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFIN 122 + +D D+ + + + ++ L ++ Y+R +I + Sbjct: 320 VLSDQLTARDQATLRQITESFNAELTFIEVDEIPLTAVIQDGQVLKTAYYRLLIPNLLP- 378 Query: 123 KAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYF 182 + +VLYLD D +C + L + + VA V G + R + + + YF Sbjct: 379 EIERVLYLDCDTLCLENLARLWDVELGNIPVAAVEDAG----FHNRLAQMAIDYKSIRYF 434 Query: 183 NSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQ 242 N+G LL+N W Q+++ + + + E +K+ DQD LN +L D+ I K+N Q Sbjct: 435 NAGVLLMNLTIWRQQKITEQILTFIKEY--PQKLRFHDQDALNAILHDRWIHLHPKWNVQ 492 Query: 243 FSLNYQL-------KESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNA 292 S+ + IH+ G KPW D + +P + + K+ Sbjct: 493 TSILMDFIVAPTERINRQFLSAQKEPGLIHFCGSEKPW-DKSSTHPYTPQYRFYKSR 548 Score = 154 bits (390), Expect = 4e-36, Method: Composition-based stats. Identities = 53/279 (18%), Positives = 107/279 (38%), Gaps = 19/279 (6%) Query: 27 LCLDIAYGTDKNFLFGCGISIASILKYNE-GSRLCFHIFTDYFGDDDRKYFDALALQYKT 85 + I D+ + +++ SI ++ + + + + + + L Sbjct: 8 KTIAIMVAADEQYADQMLLTLKSIREHCTLETAIDLFVLSSDLSHATKSAVNRLMTLPH- 66 Query: 86 RIKIYLINGDRLRSLPSTKNWTHAIYFRFVIAD-YFINKAPKVLYLDADIICQGTIEPLI 144 + IN R+++ P ++ Y+R + +VLYLD D + + + PL Sbjct: 67 HVSFIAINPRRIKNFPGNNHFDQTAYYRILAPQILLARHIERVLYLDLDTLIRTDLTPLY 126 Query: 145 NFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAI 204 + + + V+ G+A ++ YFN+G L+I+T W +S + + Sbjct: 127 DSDLEGNIIGAVIDPGKALTLKRLGVPKS--QANNIYFNAGVLIIDTILWETHHISQKIL 184 Query: 205 AMLNEPEIIKKITHPD-QDVLNMLLADKLIFADIKYNTQF--------SLNYQLKESFIN 255 AML + D QD LN++LA + K+N Q +N + + F Sbjct: 185 AMLVPYPGRRV---NDIQDALNVVLAGRTKLLAPKWNVQNAILFKTYEPINNEYSQLFKQ 241 Query: 256 PVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASP 294 + IH+ KPW + ++P + + P Sbjct: 242 AIMA-PKIIHFTTEKKPW-EVFLEHPYMSEYQVYLSQLP 278 >UniRef50_A1BHG0 Glycosyl transferase, family 8 n=2 Tax=Chlorobium/Pelodictyon group RepID=A1BHG0_CHLPD Length = 307 Score = 205 bits (521), Expect = 2e-51, Method: Composition-based stats. Identities = 65/285 (22%), Positives = 116/285 (40%), Gaps = 20/285 (7%) Query: 24 TENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQY 83 ++I + TDKN++ ++ S+L+ N+ +I + + + + + Sbjct: 3 HMKNTVNIVFATDKNYIQHLSAALVSLLENNKDLSFTVYIISSGMSEKSYRNIEEIIKTG 62 Query: 84 KTRIKIYLINGDRLRSLPSTK-NWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEP 142 +K ++ + L + + Y+R +I D K+LYLD+DII G+I+ Sbjct: 63 NCTVKHITVSDELFVKLATAHPFYPKGTYYRLLIPDLI--DEEKILYLDSDIIVNGSIKE 120 Query: 143 LINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSAR 202 L N D V + G H YFNSG +LIN A+W + + + Sbjct: 121 LYNQDVEDYFVCAIEDPGFDR------HRQLQMDKESIYFNSGMMLINLAKWKSTGLQKK 174 Query: 203 AIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFIN------- 255 I + I PDQ LN ++ + +KYN Q S+ E + Sbjct: 175 VIDFIEHN--PDAIWFPDQCGLNSVINGRWKKVPLKYNQQSSIFSDDFEKKFDCFSVEEL 232 Query: 256 -PVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTA 299 + + IHY G +KPWH +P + + + +P++N Sbjct: 233 AEAKKNPVIIHYTGGSKPWHFKN-RHPYKKLYWKYLKMTPYRNAI 276 >UniRef50_D1P7H1 Lipopolysaccharide 1,2-glucosyltransferase n=1 Tax=Providencia rustigianii DSM 4541 RepID=D1P7H1_9ENTR Length = 324 Score = 204 bits (519), Expect = 4e-51, Method: Composition-based stats. Identities = 110/334 (32%), Positives = 171/334 (51%), Gaps = 14/334 (4%) Query: 8 ETEFLNSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDY 67 + + S+I + EN I YG D+ FL+G G SIAS++ N+ + FHIF D Sbjct: 3 NNDMIKSLIKINDNERHENSYFHIGYGVDEKFLYGVGTSIASVMLNNKDTDFHFHIFVDN 62 Query: 68 FGDDDRKYFDALALQYKTRIKIYLINGDRLRSLP-STKNWTHAIYFRFVIADYFINKAPK 126 D++ F +I IY I+ ++ + LP +K W+HAIYFR +I Y + Sbjct: 63 LPDEN--LFREAVQGTSHKITIYFIDNEKFKLLPLPSKAWSHAIYFRLLIISYLSSSIDS 120 Query: 127 VLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGF 186 +LYLDADIIC+G + L +F + V + + + + YFNSGF Sbjct: 121 LLYLDADIICKGDLSELKALTFDEKTFVYAVKD------KFCSEKQNLPIDMSKYFNSGF 174 Query: 187 LLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLN 246 L ++ A + + R I ++ + + +HPDQD LN+LL DKLI YN FSL+ Sbjct: 175 LYMSLKHLAQENIPNRVIELVEKND----FSHPDQDALNVLLNDKLINISENYNYMFSLD 230 Query: 247 YQL-KESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNN 305 + + + + + + +FIH++G TKP+H+WA Y + A+ SPWKN LLKP Sbjct: 231 WYITSKGHLAKIPDSVVFIHFVGLTKPFHEWASFYEEYKYLESARKNSPWKNIPLLKPEG 290 Query: 306 SNQLRYSAKHMLKKHRYLKGFSNYLFYFIEKIKH 339 QL H+ K +Y++ + Y ++K H Sbjct: 291 YKQLSRKKSHLRKNGKYVEFIFTTIQYLMKKTFH 324 >UniRef50_Q03HK5 Lipopolysaccharide biosynthesis glycosyltransferase n=1 Tax=Pediococcus pentosaceus ATCC 25745 RepID=Q03HK5_PEDPA Length = 549 Score = 204 bits (518), Expect = 5e-51, Method: Composition-based stats. Identities = 62/295 (21%), Positives = 120/295 (40%), Gaps = 16/295 (5%) Query: 6 FQETEFLNSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFT 65 E +L+ + +++E +++ + F+ S SIL+ + ++ F++ Sbjct: 256 LSEHPYLDEYHEELNELEINRGVVNVISAANSAFVEALATSYISILENDSENQYNFYLLP 315 Query: 66 DYFGDDDRKYFDALALQY-KTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKA 124 D+ D ++ +Y IKI ++ L + + + Y+R + + Sbjct: 316 DHLDQRDMLILGSVISRYDNASIKIVKVDEKLLENAVESDRILKSAYYRILAPELLP-NI 374 Query: 125 PKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNS 184 + +YLD DII + L S + +A V G + R +G+ YFNS Sbjct: 375 NRAIYLDCDIIANTNLHDLWQTSLEGNVLAAVEDAG----FHDRLEHMGITHDNSKYFNS 430 Query: 185 GFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFS 244 G +LI+ W +Q V+ R + + +K+ DQD LN +L DK + K+N Q + Sbjct: 431 GMMLIDLVSWRSQAVTQRVLDYI--NHNPEKLRFHDQDALNAILYDKWLHLHPKWNAQSN 488 Query: 245 L-------NYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNA 292 + + IH+ G KPWH + +P + +++ Sbjct: 489 IVLDALVPPRTELLKLYAETRENPKLIHFCGHVKPWHAES-KHPYTNVYLKYNKK 542 Score = 178 bits (451), Expect = 3e-43, Method: Composition-based stats. Identities = 55/275 (20%), Positives = 110/275 (40%), Gaps = 14/275 (5%) Query: 27 LCLDIAYGTDKNFLFGCGISIASILKY-NEGSRLCFHIFTDYFGDDDRKYFDALALQYKT 85 +++ D+N+ I+I + L+ N+ +R+ F + ++ + + LA Sbjct: 2 KNINVLLAADENYADQLQITIKTTLENLNKKTRVNFIVLSNNLSNSTKLALKKLAHGLH- 60 Query: 86 RIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFIN-KAPKVLYLDADIICQGTIEPLI 144 ++ ++ P+ + Y+R + ++LYLD D++ + + L Sbjct: 61 TVEYLDLDPSVFAFCPTNSHINKTAYYRILAPQLLAKRNIDRILYLDVDLLVRHDLTELY 120 Query: 145 NFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAI 204 + + V V+ GQA + R V YFNSG L+I+ +W ++ + + Sbjct: 121 DAELNHNIVGAVIDTGQA-FALNRLGVDPVVAANNIYFNSGILVIDIKKWNENHITEKTL 179 Query: 205 AMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLK-------ESFINPV 257 + I DQD LN +LA + K+N Q S+ ++ + IN Sbjct: 180 NYIK--HQSHLIIFHDQDALNAVLAGHVQMLHPKWNLQNSIVFRKHRPINEAYDQLINEA 237 Query: 258 TNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNA 292 +H+ KPW + ++P + E N Sbjct: 238 IKSPAIVHFTTHEKPWKTLS-EHPYLDEYHEELNE 271 >UniRef50_A6LGX5 Glycosyltransferase family 8 n=1 Tax=Parabacteroides distasonis ATCC 8503 RepID=A6LGX5_PARD8 Length = 325 Score = 202 bits (513), Expect = 2e-50, Method: Composition-based stats. Identities = 72/327 (22%), Positives = 133/327 (40%), Gaps = 25/327 (7%) Query: 30 DIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKI 89 DI +D N+L I S+ + N L FH+ ++ D K + Y+ ++ + Sbjct: 3 DIVVASDCNYLHLVSICAVSLFETNSSESLHFHLLSNGIDSADIKNLQTIVEGYRGKLSV 62 Query: 90 YLINGDRLRSL-PSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSF 148 Y I R R + + + Y R KVLY+D DII G+I L N Sbjct: 63 YPIENLRERLMTDVPETISLTSYARLFAGSILPANLDKVLYIDCDIIFNGSIRDLFNTDL 122 Query: 149 PDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLN 208 + V ++ + ++K +++ Y N+G L+I +W ++ + + + L Sbjct: 123 GNCLVGGILDPLISRTYKKEIK----IPMSEPYINAGVLIIPLNRWRSEGMEQKFVDFLV 178 Query: 209 EPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQF-SLNYQLKESF-----------INP 256 K+ H DQ ++N + A + ++N SL Y K+ + Sbjct: 179 ANR--GKVHHHDQGIINAVCAGRKKILPPQFNVMSNSLCYPWKDLYKINTPFYDQEEYKK 236 Query: 257 VTNDTIFIHYIG--PTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAK 314 + IH+ G +PW +P + F++ K + +K+ L KPNN + + + Sbjct: 237 GISSPAIIHFTGAIHGRPW-IVGCTHPYANKFLQFKAKTAYKDIPL-KPNNQSAALHRLE 294 Query: 315 HMLKKHRYLKGFSNYLF--YFIEKIKH 339 +L + F Y+ Y++ KH Sbjct: 295 GILYRLLPFSLFKRYMQSVYYLSYFKH 321 >UniRef50_C2HBB9 Family 8 glycosyltransferase n=35 Tax=Lactobacillales RepID=C2HBB9_ENTFC Length = 305 Score = 202 bits (513), Expect = 2e-50, Method: Composition-based stats. Identities = 59/283 (20%), Positives = 116/283 (40%), Gaps = 19/283 (6%) Query: 22 VETENLCLDIAYGTDKNFLFGCGISIASILKY-NEGSRLCFHIFTDYFGDDDRKYFDALA 80 +E + + +D+N+ + IA+ L+ N+ R+ F++ D + ++ + Sbjct: 21 MEKRYGVVPVVTASDENYAPYLSVMIATALENCNKARRIKFYVIDDGLSEYSKQGLEETV 80 Query: 81 LQY--KTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYF-INKAPKVLYLDADIICQ 137 +Y I+ + D + + T Y R + + KVLYLD+D++ Sbjct: 81 NKYSSNASIQFLTVEKDIYEDFLVSDHITTTAYLRISLPNLLAKEDYKKVLYLDSDVLVL 140 Query: 138 GTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQ 197 I L + + ++ GQ E+ YFNSG ++I+ QW + Sbjct: 141 DDIVKLYDEPLNGKTIGAIIDPGQVKALERLG-----IDSDDLYFNSGVMVIDIDQWNKK 195 Query: 198 QVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKES----- 252 +++ + I L+E +I + DQD LN +L + K+N Q SL ++ + Sbjct: 196 EITEKTIHYLSEN--GDRIIYHDQDALNAVLYEDWEQLHPKWNMQTSLIFERHPAPNEKY 253 Query: 253 --FINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNAS 293 +H+ G KPW+ D+P + +++ S Sbjct: 254 ERLYKEGNEKPSIVHFTGHDKPWNTLK-DHPYTNLYLKKLAHS 295 >UniRef50_Q64ZV2 Lipopolysaccharide 1,2-glucosyltransferase n=8 Tax=Bacteroides RepID=Q64ZV2_BACFR Length = 311 Score = 200 bits (509), Expect = 5e-50, Method: Composition-based stats. Identities = 68/313 (21%), Positives = 118/313 (37%), Gaps = 16/313 (5%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 + IA D NF C +++ S+ N S C HI + D+K ++A Y +I Sbjct: 2 IHIACNIDSNFTIHCAVTLTSLFANNRNSEFCVHIIASTLPEADQKALSSIAESYGNKIC 61 Query: 89 IYLINGDRLRSLPSTK---NWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLIN 145 Y D L + K + A Y+R +++ K+LY+D DI+ I + Sbjct: 62 FYFPEKDLLNNFSIKKSGNRISIATYYRCLLSRILPVNIDKILYIDCDIVVLNDISEFWD 121 Query: 146 FSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIA 205 + + G E+ +S YFN+G LLIN W ++ Sbjct: 122 TDITQYAIGCIEDIGSD---EEEYYSRLQYDKKYSYFNAGVLLINLKYWREHKIDEMCEQ 178 Query: 206 MLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLK------ESFINPVTN 259 +I DQD+LN LL +F ++N Q + + S + Sbjct: 179 YFLAHS--DRIRFNDQDLLNALLYKDKLFVPFRWNVQDTFYRRTYSHKVKEHSGLKEALL 236 Query: 260 DTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKK 319 +HY KPW+ + +P+ Q + + + +PWK T + + + + + Sbjct: 237 HPAILHYT-NKKPWN-YDSMHPLKQEYFKYLDMTPWKGTRPIIDFQTRVITGFKRLLYIT 294 Query: 320 HRYLKGFSNYLFY 332 + N Y Sbjct: 295 GIKKSKYINLKDY 307 >UniRef50_C2HBB8 Family 8 glycosyltransferase n=13 Tax=Lactobacillales RepID=C2HBB8_ENTFC Length = 300 Score = 198 bits (504), Expect = 2e-49, Method: Composition-based stats. Identities = 55/284 (19%), Positives = 113/284 (39%), Gaps = 18/284 (6%) Query: 24 TENLCLDIAYGTDKNFLFGCGISIASILKY-NEGSRLCFHIFTDYFGDDDRKYFDALAL- 81 + + + F+ S+L N + F++ D + ++ Sbjct: 1 MNKKEIAVVASCNTKFVPHLAALFVSVLDNCNPSKFVRFYVIDDDIDFESKQLLRFSVKN 60 Query: 82 -QYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFIN-KAPKVLYLDADIICQGT 139 + + ++ IN + ++ + Y+R I + F + ++LY+D D+I Sbjct: 61 ARMNSDVEFLKINKEFFTNVVISDRIPETAYYRIAIPELFRGTEVERILYMDCDMIALQD 120 Query: 140 IEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQV 199 I L F D VA V G + +R + + + YFNSG +LIN +W + + Sbjct: 121 ISKLWRLDFGDSIVAAVEDAG----FHQRLEKMEIPAKSMRYFNSGLMLINVKKWLDENI 176 Query: 200 SARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLK-------ES 252 + + + + +K+ DQD LN +L D+ + ++N Q + + K E Sbjct: 177 TQKVLDFIEHN--PEKLRFHDQDALNAILHDRWLPLHPRWNAQGYIMAKAKKHPTAAGER 234 Query: 253 FINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWK 296 N+ IH+ G KPW ++ P + + + + ++ Sbjct: 235 EYEETRNNPYIIHFSGHVKPW-SKDFEGPTKKYYEKYAGMTAFR 277 >UniRef50_A8ARL4 Putative uncharacterized protein n=2 Tax=Citrobacter RepID=A8ARL4_CITK8 Length = 314 Score = 197 bits (500), Expect = 5e-49, Method: Composition-based stats. Identities = 80/320 (25%), Positives = 138/320 (43%), Gaps = 11/320 (3%) Query: 22 VETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALAL 81 ++ + ++IAY TD N+L +SI S++ N L F +F D+D + + Sbjct: 1 MDNKTNVINIAYCTDANYLEYVAVSIMSVIMNNPEQSLAFFVFVYDVSDEDIAKLQSTSN 60 Query: 82 QYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIE 141 + + I I + ++ + + K+ + Y R + +K + +YLDAD +C ++ Sbjct: 61 KIQV-ITIDKADIEKYNNDFAIKHLNRSTYMRLAVPRLLKDKVARFIYLDADTLCFDSLS 119 Query: 142 PLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSA 201 + + D+ V V + K A LG++ YFN+GFL IN A W + Sbjct: 120 EINSVDI-DNVVCAVSHDSLNIHDNKHARRLGLS--IDHYFNAGFLYINVANWIKHDIEH 176 Query: 202 RAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNY-QLKESFINPVTND 260 +A +L E K + + DQD LN+ + + F D ++N F+ + KE+F Sbjct: 177 KANTVL--FEQGKSLPYFDQDALNIAMNGNITFIDNRWNFLFNWFTDEQKENFFYHSDTL 234 Query: 261 TIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKP---NNSNQLRYSAKHML 317 IH+ G KPW+ Q ++ + +PW+N L R ++ Sbjct: 235 PRIIHFTGGRKPWYKEHTGL-SQQLYVFYHHFTPWRNAELRSYAPRMRPTDYRVYSRQAA 293 Query: 318 KKHRYLKGFSNYLFYFIEKI 337 KK Y Y Y KI Sbjct: 294 KKGNYFTAIKWYAKYLKTKI 313 >UniRef50_A7VNX5 Putative uncharacterized protein n=1 Tax=Clostridium leptum DSM 753 RepID=A7VNX5_9CLOT Length = 344 Score = 197 bits (500), Expect = 6e-49, Method: Composition-based stats. Identities = 55/309 (17%), Positives = 119/309 (38%), Gaps = 28/309 (9%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEG-SRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 ++ + +D NF G ++ S+ + N + +I + +++ +++ QY+ + Sbjct: 13 MNCVFSSDDNFADILGCALISLFENNREQETIEVYILDGGISEGNKRKLESIFQQYERMV 72 Query: 88 KIYLING--DRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLIN 145 + ++ W + + R +I + +VLYLD DI+ G+++ L Sbjct: 73 HFIEVPDISQLTGEAVTSGRWPISTFARILIDSLLPKEVKRVLYLDCDILVLGSLKNLWE 132 Query: 146 FSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIA 205 D A V+ +R + G+ G Y N+G +LI+ +W Q+ + + Sbjct: 133 IDLKDKTAAGVMDCLS----NQRKQNAGING-EDSYINAGVMLIDMDKWRENQIEKQCMN 187 Query: 206 MLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSL---------NYQLKESFI-- 254 + ++ + DQ V+N +L L+ +YN Y+ +S+ Sbjct: 188 YIRICN--GQVAYNDQGVINKVLHKDLLVLPPEYNAMTLFFDFTYPDMIKYRKPQSYYSA 245 Query: 255 ---NPVTNDTIFIHYIGPT---KPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQ 308 + +H+ +PW ++P + + SPW+ L N S+ Sbjct: 246 QQVDHARKHPRIVHFTSSFLSLRPWVK-GSEHPYAPLWRNYYKRSPWRAKDLRSDNRSSY 304 Query: 309 LRYSAKHML 317 + K Sbjct: 305 RKIYEKFYR 313 >UniRef50_C5ZV11 Glycosyl transferase n=2 Tax=Helicobacter canadensis MIT 98-5491 RepID=C5ZV11_9HELI Length = 397 Score = 196 bits (499), Expect = 7e-49, Method: Composition-based stats. Identities = 66/339 (19%), Positives = 126/339 (37%), Gaps = 36/339 (10%) Query: 30 DIAYGTDKNFLFGCGISIASILKYNEGSR---LCFHIFTDYFGDDDRKYFD----ALALQ 82 ++ ++N++ + I SI++ + S FH+ D ++ K + L+ Sbjct: 3 NVVLNLNENYVPYAAVLITSIIQNTQSSGGGGYNFHLLMDSISQENTKNLENLISELSKI 62 Query: 83 YKTRIKIYLINGDRLRSL-PSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIE 141 Y + IY+++ R T N + Y+R I + +YLD D+I G + Sbjct: 63 YPCTLTIYILDDQLFREYSMPTLNGNYLAYYRLKIGSALPLSIKRCVYLDVDMIVLGDLR 122 Query: 142 PLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSA 201 L +V+ ++ + + I YFNSG LL++ W + + Sbjct: 123 ELFEVDLQGKICGVVMEHHSQKIYKPKNQAYKPINITGSYFNSGMLLVDLDLWRQENIED 182 Query: 202 RAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQ------------- 248 RA + + DQD+LN++L+ K I++N + Y+ Sbjct: 183 RAFEIGKNYH----YSFHDQDILNIVLSGKTHKVGIEWNLMVCVYYRAICKDEKGRDKLP 238 Query: 249 LKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVS-----QAFMEAKNASP-WKNTALL- 301 N + +HY TKPW++ Q + + + +P +K L Sbjct: 239 YYRKDFNSALRNPKILHYFTHTKPWNNAKIYLDYHNKFLDQYWWDMVDQTPIFKEKLLQL 298 Query: 302 KPNNSNQLRYS----AKHMLKKHRYLKGFSNYLFYFIEK 336 KP + L + K + + L + Y + K Sbjct: 299 KPQADSALAFQCLVGYKLLRYYQKGLFALIPFYTYSLIK 337 >UniRef50_B7AIV0 Putative uncharacterized protein n=1 Tax=Bacteroides eggerthii DSM 20697 RepID=B7AIV0_9BACE Length = 321 Score = 196 bits (499), Expect = 7e-49, Method: Composition-based stats. Identities = 70/327 (21%), Positives = 125/327 (38%), Gaps = 16/327 (4%) Query: 15 VIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRK 74 + + + + + I + + C +IASI N+ + H+ TDY ++ Sbjct: 1 MYNTTNIPSIKTKAIHIVVCINDAYSQHCAATIASIFINNKNEVIKIHVITDYISKKNQS 60 Query: 75 YFDALALQYKTRIKIYLINGDRLRSLPSTK-----NWTHAIYFRFVIADYFINKAPKVLY 129 + +A + +I+ Y N L P K + T Y+R I K Y Sbjct: 61 RLEKIAFNFNQQIQFYTFNNSTLNRWPCFKDGMPPHVTIQTYYRLFIPQILPLNIKKTFY 120 Query: 130 LDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLI 189 LD D++ + N + VA + + A + + YFN+G LL+ Sbjct: 121 LDCDLLVLHPLREFWNTKMQNKGVAAIADQWTDYIE---AATRLKYRNDREYFNAGVLLL 177 Query: 190 NTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNT----QFSL 245 N + AI + + I + DQDVLN L+ + I +K+N Sbjct: 178 NLEYLRNHNFTNNAIDFVTKH--ANDIVYHDQDVLNKLIGENRIIMPVKWNVCSFKINDK 235 Query: 246 NYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNN 305 + + +N D IH+ P KPW+ + +P + +PWK+ + Sbjct: 236 IPHIYNATMNDARKDPYIIHFFAPIKPWNQDSS-HPYRSYYYYFLQFTPWKHEVKCHYSL 294 Query: 306 SNQLR-YSAKHMLKKHRYLKGFSNYLF 331 N +R + K L+K +Y +Y+ Sbjct: 295 KNTIRTFLIKIGLRKSQYAIAPQSYMK 321 >UniRef50_C0QZN2 Glycosyl transferase, family 8 n=3 Tax=Brachyspira RepID=C0QZN2_BRAHW Length = 339 Score = 195 bits (495), Expect = 2e-48, Method: Composition-based stats. Identities = 69/308 (22%), Positives = 121/308 (39%), Gaps = 21/308 (6%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNE-GSRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 ++I +D N+ G +IASILK + ++ FH+ ++++ +L + I Sbjct: 1 MNICLASDNNYAPYMGTAIASILKNSSEDEKIIFHLIDGGITKENKEKIISLKNIKECEI 60 Query: 88 KIYLINGD----RLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPL 143 Y + +++ A+++R IA + K+LYLD+D+I G+++ L Sbjct: 61 NFYTPDIKMYDGWFEKTSCKAHFSAAMFYRLSIASIIPSNIDKILYLDSDLIATGSLKEL 120 Query: 144 INFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARA 203 + ++ + YFNSG LLIN W + + Sbjct: 121 FLMDIENHYAIVIKH-------STNEKNKWSIDGINDYFNSGVLLINNKLWIKNNIEDQF 173 Query: 204 IAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIF 263 N K DQDVLN +L K+ +AD++YN Y E+ + I Sbjct: 174 NKFYNNNY---KTCFGDQDVLNNVLIGKVKYADMRYNVYAEKGYYNTEND----IENPII 226 Query: 264 IHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKH--MLKKHR 321 IHY+ P KPW + F +PW + + + + + K Sbjct: 227 IHYLSPEKPWKENCRGTLFIDEFWRYYQYTPWFRDEPITAFQTILKQKFYDYDDVRLKGN 286 Query: 322 YLKGFSNY 329 ++K F Y Sbjct: 287 WIKLFGIY 294 >UniRef50_C5ELK9 Glycosyl transferase n=3 Tax=Clostridiales RepID=C5ELK9_9FIRM Length = 333 Score = 194 bits (493), Expect = 4e-48, Method: Composition-based stats. Identities = 60/332 (18%), Positives = 115/332 (34%), Gaps = 30/332 (9%) Query: 22 VETENLCLDIAYGTDKNFLFGCGISIASILKYNEG-SRLCFHIFTDYFGDDDRKYFDALA 80 +E +I Y ++ + S+ S+L N + +I + + ++ +A Sbjct: 1 MEWNEETANIIYASNDGYAGHLAASMYSLLDNNRNVRNMDIYILSAQMCQEYKERLAGMA 60 Query: 81 LQYKTRIKIYLING--DRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQG 138 + + + + R T+ + + R K LYLD D I Sbjct: 61 EAFHRTLHVVELGDLKQRFDFDIDTRGFDISAMGRLFAPQVLPGTVKKALYLDCDTIVCK 120 Query: 139 TIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQ 198 +I PL D V MV + +++ S+G+ G Y+NSG LL+ +W + Sbjct: 121 SIRPLYETELGDAVVGMV---MEPTVYKEMKESIGM-GKDDPYYNSGVLLMALDRWRQED 176 Query: 199 VSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNY----------- 247 V + + ++ DQD +N L ++ +KYN + Y Sbjct: 177 VLQKLLDFYKSCH--GRLFACDQDTINGALKGRIKTLPVKYNYFTNYRYFRYSTLCSMCA 234 Query: 248 ---QLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPN 304 ++ E IHY+G +PW ++ + + +PWK+T Sbjct: 235 AYREIGEEAYLEARRSPAIIHYLGDERPWIAGNHNH-FKKLYEYYLAKTPWKDTP----- 288 Query: 305 NSNQLRYSAKHMLKKHRYLKGFSNYLFYFIEK 336 + HM L ++ + Sbjct: 289 -KQTGKERYMHMWWLFNRLTWLCPPFRLWVSR 319 >UniRef50_B3Q568 Putative glycosyltransferase protein n=4 Tax=Rhizobium etli RepID=B3Q568_RHIE6 Length = 331 Score = 193 bits (491), Expect = 6e-48, Method: Composition-based stats. Identities = 55/307 (17%), Positives = 112/307 (36%), Gaps = 13/307 (4%) Query: 30 DIAYGTDKNFLFGCGISIASILKYNEGS-RLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 I + D + ++ S+ + N+ L H+ + G++ ++ I+ Sbjct: 21 PIVFAVDAAYAVPLATALRSVAENNQSVWPLDIHVIHEGIGEETKRLILESLPANSAIIQ 80 Query: 89 IYLINGDRL-RSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 + I + + + R ++ + + LYLD DI+ ++E L N Sbjct: 81 WHPIATLSFASGFSTRPGVSKMTFARILLPQFLPQTCDRALYLDGDILVLTSLEQLWNTD 140 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 + + V + + G + K YFN+G LLI+ A+W +++S R++ L Sbjct: 141 LGEAVIGAVPDYWLDNPAGSGPGARGG-ALVKRYFNAGILLIDLAKWRNERISERSLDYL 199 Query: 208 NEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYI 267 + + + DQD LN+ K D +N Q + + I + +H++ Sbjct: 200 DRFPTTE---YSDQDALNVACDGKWKILDRAWNFQ--FEPRQAIAGI-ALEQKAAIVHFV 253 Query: 268 GPTKPWH--DWAWDYPVSQAFME--AKNASPWKNTALLKPNNSNQLRYSAKHMLKKHRYL 323 KPW + + AF +PW ++L + + Y Sbjct: 254 TNVKPWKSGSLSPNVAFYDAFRSRTCFALTPWGRVRSGLKRTGSRLLARSALLRTAWSYT 313 Query: 324 KGFSNYL 330 K + Sbjct: 314 KSAVRAI 320 >UniRef50_UPI0001A457E5 hypothetical protein NsubN_08151 n=1 Tax=Neisseria subflava NJ9703 RepID=UPI0001A457E5 Length = 345 Score = 193 bits (491), Expect = 7e-48, Method: Composition-based stats. Identities = 80/328 (24%), Positives = 139/328 (42%), Gaps = 19/328 (5%) Query: 20 HKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDAL 79 K E I Y D+N++ G ++ S+L+ N S + FH+ D FD + Sbjct: 15 RKREFIKQPKHIVYAADQNYIKHIGTALLSVLQNNT-SPIHFHLLVSGSEGYDFNIFDQI 73 Query: 80 -ALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQG 138 I +Y +N + +L +T +T A+Y+R I LYLD D++C G Sbjct: 74 ETSNQNYAISVYHLNTEYFSTLQTTHYFTIAMYYRMSIPCLLKGITHTALYLDTDVLCLG 133 Query: 139 TIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQ 198 I+ L + +A V + K+ + G + YFNSG +L N +W Sbjct: 134 NIDDLFEIDISNSLIAAVPDAILYRAYIKQLNQFGFTD-TEPYFNSGVILFNIDKWNDMA 192 Query: 199 VSARAIAMLNEPEIIK-KITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPV 257 + + E K++ PDQD+LN+ + + +N +++ K S + Sbjct: 193 IDKILSEKMQAVEKQNFKLSCPDQDILNLACIGHVHWLSENFNW---IHWHQKYSELIDN 249 Query: 258 TNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLK--------PNNSNQL 309 N+ +H++G KPWH A+ + SPW N L + PN + Sbjct: 250 PNNIRLVHFVGHIKPWHQLG----FHPAYDQYFKNSPWNNGYLEQPLSTWLPFPNPKRKF 305 Query: 310 RYSAKHMLKKHRYLKGFSNYLFYFIEKI 337 R +AK + K+ + + ++ Y Y + +I Sbjct: 306 RQAAKRLWKQGQKKQAWAYYREYLLRRI 333 >UniRef50_Q2K5X3 Galactosyltransferase protein n=13 Tax=Rhizobium RepID=Q2K5X3_RHIEC Length = 333 Score = 193 bits (490), Expect = 8e-48, Method: Composition-based stats. Identities = 56/308 (18%), Positives = 113/308 (36%), Gaps = 16/308 (5%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 + +D N L ++ S+ + + + F + + + + I++ Sbjct: 38 VIVCSDVNMLPAACCTLLSVKRNLTNADVEFLLLGIDLKPHEVAEVENFGRLHGMAIRVL 97 Query: 91 LINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPD 150 L + W+ A R + + ++LYLDAD++ ++ L F Sbjct: 98 PYETPD-TGLQARGRWSAATLARLYMDRDIPDHIERLLYLDADVLAVAPVDELFTLDFQG 156 Query: 151 DKVAMVVTEGQADWWEKRAHSLGV-AGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNE 209 +A V A + A + G YFN+G LL + + A+ + R + E Sbjct: 157 KALAAVDDYVMAFPEKSGARQRKIGMGEGGRYFNAGVLLFDWSACRARGLFPRTREIFKE 216 Query: 210 PEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIGP 269 + + DQD LN+ + D ++NTQ L P + H+ G Sbjct: 217 RSHL--FENNDQDALNVTFDGDWLVLDPRWNTQTGLL---------PFVDRPAIFHFTGR 265 Query: 270 TKPW--HDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKHRYLKGFS 327 KPW + ++ + + +PW + +P+ ++++ H+ K+ L + Sbjct: 266 KKPWQANVPWVHRRMANRYADDLRNTPWAS-FCRQPSRTDRVAGFLSHVGKQIGGLTRLA 324 Query: 328 NYLFYFIE 335 YF Sbjct: 325 RMRAYFSN 332 >UniRef50_C3PWZ8 Lipopolysaccharide 1,2-glucosyltransferase n=1 Tax=Bacteroides sp. 9_1_42FAA RepID=C3PWZ8_9BACE Length = 315 Score = 192 bits (489), Expect = 1e-47, Method: Composition-based stats. Identities = 74/319 (23%), Positives = 138/319 (43%), Gaps = 21/319 (6%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 + IA D F+ C ++I SIL+ N+ + HI + + +D +A +Y T I Sbjct: 1 MHIALTIDSKFVRYCAVTIVSILENNDPKDIMLHIVSGHLPKEDVLTLSQVAEKYGTSIA 60 Query: 89 IYLINGDRLRSL---PSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLIN 145 Y I ++L++ + + +++R V+A + +V+YLD+D + G+++ L + Sbjct: 61 FYYIPHEKLQNYEVKWQKQRLSMVVFYRCVLASILPSTISRVIYLDSDTLVLGSLKELWD 120 Query: 146 FSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIA 205 + +A V + + Y N G LL+N A W + + I Sbjct: 121 TNLNQLALAGVQDTVSPNP---SYFERLQYAPSYNYINGGVLLLNLAYWRKHNIEQQCIK 177 Query: 206 MLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLK------ESFINPVTN 259 + +I DQD+LN LL D+ + DIK+N Q + + Sbjct: 178 YYQQY--PDRIILNDQDILNALLYDQKVLIDIKWNVQDDFYRNNRYTSPAWKPSYTDAIL 235 Query: 260 DTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKK 319 I +HY G KPW + +P+ F + +P+ ++A K ++ R+ H+L Sbjct: 236 HPIILHYSGR-KPW-AYHAMHPLRHLFFHYQRLTPYDDSAKQKKISTRIYRFI--HLLP- 290 Query: 320 HRYLKGFSNYLFYFIEKIK 338 Y+ G + ++KI+ Sbjct: 291 --YILGLKPKKYVNLKKIR 307 >UniRef50_D2RIJ7 Glycosyl transferase family 8 n=1 Tax=Acidaminococcus fermentans DSM 20731 RepID=D2RIJ7_ACIFE Length = 330 Score = 191 bits (486), Expect = 2e-47, Method: Composition-based stats. Identities = 71/342 (20%), Positives = 130/342 (38%), Gaps = 29/342 (8%) Query: 5 FFQETEF--LNSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFH 62 +FQ + +Y + L I + F G+ + SI + N+ L FH Sbjct: 9 YFQTHKLYLTKDSFEYMTAENKKKDILHICCNVNDLFFKPAGVLLTSICENNKDLALNFH 68 Query: 63 IFTDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPST-KNWTHAIYFRFVIADYFI 121 +F D D++++ A +Y +Y ++ ++ K ++ Y R V+ Sbjct: 69 VFVDSCSDENKENLRKTAEKYGCNAYLYKMDMSIYQNFHIKVKRFSRVTYIRIVMPWVLR 128 Query: 122 NKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGY 181 N + LYLDAD++C ++ N+ D V +V + + Y Sbjct: 129 NVTNRYLYLDADMVCVKSLRVFFNYDLKDKAVGALVYDTPERIAFLKMK-------GNVY 181 Query: 182 FNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNT 241 F+ G + IN +W Q+V+ R + + + QD++N++L + ++ Sbjct: 182 FSDGLMWINVDEWIKQRVTERVFSY--QGADPARFKGQTQDLMNLVLDGNVQPIPALFHH 239 Query: 242 QFSLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALL 301 + D I IHY G KPW + + + + SPW + Sbjct: 240 M-----------DKDFSVDGILIHYSGRDKPWEIVLDEDD--ELWRHYLDISPWPSMPNP 286 Query: 302 KPNNSNQLRYSAKHM----LKKHRYLKGFSNYLFYFIEKIKH 339 P +S K + KK +LK +Y I KI++ Sbjct: 287 MPPKRPIYYHSFKKLAQVYSKKGNHLKELECLFWYGILKIRY 328 >UniRef50_C0Z4I4 Putative uncharacterized protein gspA n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0Z4I4_BREBN Length = 264 Score = 191 bits (486), Expect = 2e-47, Method: Composition-based stats. Identities = 56/268 (20%), Positives = 106/268 (39%), Gaps = 14/268 (5%) Query: 27 LCLDIAYGTDKNFLFGCGISIASILKYN-EGSRLCFHIFTDYFGDDDRKYFDALALQYKT 85 + I + F + + S+ + + + H+ +++ ++ Sbjct: 2 GTIHIVTAVNDGFAIHLAVMLYSLFENKVSKNPVIVHVIDSQVSGENKSILTKTVKRFHA 61 Query: 86 RIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLIN 145 +IK I+ + T Y R I D + KV+YLD+DI+ + I PL N Sbjct: 62 QIKYVTIDPTLYDGFLVRDHLTQETYHRISIPDLLDKEVEKVIYLDSDIVIKKDITPLWN 121 Query: 146 FSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIA 205 +A V+ K H+ YFN+G L++N +W ++ + + Sbjct: 122 TKVDQYYLAAVMD--SWQGLNKLRHADLAIPDDCDYFNAGVLVMNLKKWREHNITKKIMD 179 Query: 206 MLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIH 265 + + + I +P QD +N +L D + D K+N YQ K + + + D IH Sbjct: 180 YMKKNQ--GIIRYPSQDPMNAILHDNWLQLDTKWN------YQSKHLYKSNLRIDPAIIH 231 Query: 266 YIGPT-KPWHDWAWDYPVSQAFMEAKNA 292 Y G KPW + +P+ + + + Sbjct: 232 YTGEDSKPW--LSKKHPLREEYFKYLKK 257 >UniRef50_D2LIH7 Glycosyl transferase family 8 n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LIH7_RHOVA Length = 391 Score = 191 bits (486), Expect = 2e-47, Method: Composition-based stats. Identities = 62/347 (17%), Positives = 123/347 (35%), Gaps = 33/347 (9%) Query: 17 DYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSR-LCFHIFTDYFGDDDRKY 75 + + + ++ ++ G IASI ++ +R +F D +DR Sbjct: 32 TIEPAFPGAATAVPVVMCFNRRYMPGGAALIASIAEHASPNRLYDLIVFADDLASEDRDM 91 Query: 76 FDALALQYKTRIKIYLIN-GDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADI 134 + + ++ + ++ + + ++ ++R I D + KV+Y+DAD Sbjct: 92 LRNVCDKPNISLRFFDVSRCFDGINFITHFHFRKENFYRLKIPD-LMRDFDKVVYIDADT 150 Query: 135 ICQGTIEPLINFSFPDDKVAMVVTE--------------GQADWWEKRAHSLGVAGIAKG 180 I + L + +A V G+ ++E Sbjct: 151 ITNRDLADLYDIDVDGYYIAAVRDFAMIATQNKKMLDIVGKKIYYETYVKDYLGLIGISN 210 Query: 181 YFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYN 240 YFNSG +L N + Q+S R IA++ K + DQD+LN++ +K+ D +N Sbjct: 211 YFNSGLVLFNINKINGSQISERLIALI----GTKLFAYVDQDILNIVFENKVKLIDYSWN 266 Query: 241 TQFSLNYQLK------ESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASP 294 + +HYIG KPW+D +++ + +P Sbjct: 267 MVIDCERLYHLSEPDLYARYLDAGAAPHVVHYIGGNKPWNDPTVH--MAEYYWRYAAKTP 324 Query: 295 WKNTALLKPN----NSNQLRYSAKHMLKKHRYLKGFSNYLFYFIEKI 337 L + NS + M R ++ + + Y + I Sbjct: 325 LYEKLLREIRERRENSGASSQPERKMHPGLRSIRSSAQIIGYMLFPI 371 >UniRef50_B7KM20 Glycosyl transferase family 8 n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7KM20_CYAP7 Length = 347 Score = 190 bits (483), Expect = 6e-47, Method: Composition-based stats. Identities = 56/282 (19%), Positives = 105/282 (37%), Gaps = 10/282 (3%) Query: 25 ENLCLDIAYGTDKNFLFGCGISIASILKY-NEGSRLCFHIFTDYFGDDDRKYFDALALQ- 82 EN + I G D F G +++ S L + ++ +I +R + Sbjct: 9 ENEPITIVSGADDKFALGLAVTLYSALANLDTKRKIDIYIVDGGINSKNRDKLTQILNSD 68 Query: 83 -YKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIE 141 IK + L + + YFR ++ + + +V+YLD+D++ +G + Sbjct: 69 LMPVSIKWVKPDLTVLEGVKLFGSLNVTTYFRLLLPELLPTQVERVIYLDSDLVVEGNLA 128 Query: 142 PLINFSFPDDKVAMVVTEGQAD-WWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVS 200 L + V + + Y N+G +LIN QW + ++ Sbjct: 129 NLWEQELGNCPAVAVQDYVFPYVCNGLKTYQQLGLASNTPYCNAGVMLINIKQWRIEALN 188 Query: 201 ARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFI---NPV 257 + + + + + DQD +N L+A++ D+K+N Q Y K + + Sbjct: 189 RKILEYIRK--FYDLVYLADQDGINALIANRFKLLDLKWNVQIFGVYNGKIDLLCKPKEL 246 Query: 258 TNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTA 299 D +H+ P KPWH + + F S W N Sbjct: 247 IRDAFILHFTTPIKPWHPY-YRQAGGSRFTHYLRKSKWFNDL 287 >UniRef50_B8ISQ5 Glycosyl transferase family 8 n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8ISQ5_METNO Length = 328 Score = 188 bits (478), Expect = 2e-46, Method: Composition-based stats. Identities = 71/324 (21%), Positives = 118/324 (36%), Gaps = 17/324 (5%) Query: 21 KVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDAL- 79 E + +A D+ F +++AS+L L HIF AL Sbjct: 5 HETDEIDRIAVALCIDRAFFRHALVTVASLLDAGPRQPLDVHIFYAEADPACMARIAALF 64 Query: 80 ALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGT 139 A Q + I+ DR P + + Y R ++ Y + + KVLYLDAD+I Sbjct: 65 ADQDRHGCHFQKISLDRFEGFPVSDAISAGTYARLLLP-YLMPRRAKVLYLDADLIVLDD 123 Query: 140 IEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQV 199 + PL VA V + + YFN+G LL+N A W + + Sbjct: 124 VAPLWRTELGAAPVAAVRDPFCDN------RPAIGFSPDEPYFNAGVLLMNLAVWRREGL 177 Query: 200 SARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLK------ESF 253 + R A ++ + + DQD LN++L + F D ++N Q + + Sbjct: 178 AERVAAHIDAHGA--SLKYFDQDALNVVLRGRARFVDPRWNFQPRMADATPADIACARAE 235 Query: 254 INPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSA 313 IHY P KPW D + + +++ A + Q R A Sbjct: 236 FRRTRARPAIIHYTTPHKPWKDPFAIH-YGRHYLDCLMRLEPDLRARYFADVPQQPRLRA 294 Query: 314 KHMLKKHRYLKGFSNYLFYFIEKI 337 H+ + R+ + + + Sbjct: 295 SHLKARMRWRFPEAYRAARTLFRA 318 >UniRef50_B1MX28 Lipopolysaccharide biosynthesis protein, LPS:glycosyltransferase n=2 Tax=Leuconostoc RepID=B1MX28_LEUCK Length = 283 Score = 187 bits (476), Expect = 3e-46, Method: Composition-based stats. Identities = 55/278 (19%), Positives = 106/278 (38%), Gaps = 10/278 (3%) Query: 18 YDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFD 77 + + ++I D+N++ + + S+ + N + + D+ + Sbjct: 1 MEKTKIINDDSVNILITIDENYIKPLRVLLYSLRQTNPRENMTIWLAHDHIEVAQLEKLH 60 Query: 78 ALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQ 137 Q + ++ S P+ K + +YFR + Y +V+YLD DI+ Sbjct: 61 QFVAQLGFVLHTIKVDTSLWASAPTFKQYPPEMYFRLLCGQYLPKTLHRVIYLDPDILVI 120 Query: 138 GTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQ 197 I PL N +A G + H G + YFNSG +L++ + Sbjct: 121 NPIRPLANMPLKGQMLAASSHMGLTGISQTINHLR--LGTRQVYFNSGVMLMDLDMMRQR 178 Query: 198 QVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIK-YNT-QFSLNYQLKESF-- 253 ++++ + K++ PDQD+LN L D+++ + +N +SF Sbjct: 179 VDMKAILSVIQQY--GKELILPDQDILNYLYGDEILSLPEEIWNYDTRDNIMHYAKSFGS 236 Query: 254 --INPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEA 289 + V +T+ +HY G KPW P + Sbjct: 237 VDMRWVMENTVILHYCGRPKPWEKSNSINPFIMLYQHY 274 >UniRef50_B2UPJ4 Glycosyl transferase family 8 n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UPJ4_AKKM8 Length = 315 Score = 187 bits (476), Expect = 4e-46, Method: Composition-based stats. Identities = 70/314 (22%), Positives = 120/314 (38%), Gaps = 22/314 (7%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEG-SRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 ++I Y TD N G G+SI S+++ +I T D+ F +L Y + Sbjct: 1 MNIVYATDDNGALGTGVSIVSLMENLPPGVHADIYIMTGGLSGDNTARFHSLQQGYNLHL 60 Query: 88 KIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 + D+ P W+ A Y+R +A + LY+D D I I P+ Sbjct: 61 HFIDM-KDKYTDFPVGSKWSAATYYRLGLAGELPATVERALYVDIDTIFNRDISPMYESE 119 Query: 148 FPDDKVAMV-VTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAM 206 F D +A V TE ++ R G Y N+G +L + + + ++ ++ Sbjct: 120 FGDCLIAGVFTTEDLSEESFSRWKREMNLGRDSIYINAGVILYHIGRIREECFESQVLSW 179 Query: 207 LNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQ-------------LKESF 253 I +++ DQD+LN+ +++ +N + LK + Sbjct: 180 AKNN--IHRLSWQDQDILNVCYQQRILLLHPMWNICDGAIWSIRWEGVTSFRNNPLKPAD 237 Query: 254 INPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNT--ALLKPNNSNQLRY 311 + IHY G KPWH + F + SPWK+ K N+ ++ Sbjct: 238 LLEAARRPGIIHYWGHPKPWHPNSIRQDYG-LFYKYWKKSPWKDDIRDFRKQNDPGRMFI 296 Query: 312 SAKH-MLKKHRYLK 324 S +L K + L Sbjct: 297 SKMRCLLGKGKRLL 310 >UniRef50_C8W7U9 Glycosyl transferase family 8 n=2 Tax=Atopobium RepID=C8W7U9_ATOPD Length = 1014 Score = 187 bits (475), Expect = 4e-46, Method: Composition-based stats. Identities = 52/331 (15%), Positives = 115/331 (34%), Gaps = 30/331 (9%) Query: 1 MQQVFFQETEFLNSVID--YDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYN-EGS 57 +Q V F E I + + + + D N++ ++ S+L+ Sbjct: 643 LQCVHFTNPEPRQKFIPLFEEKPEIASQNVVPVVFAADNNYVPILTCAMGSMLENADPNR 702 Query: 58 RLCFHIFTDYFGDDDRKYFDALALQY-KTRIKIYLI--NGDRLRSLPSTKNWTHAIYFRF 114 + G ++ +Y RI Y + + + + + YFRF Sbjct: 703 YYDVVVLNTNIGGSKQELVKKFFSRYKNARITFYNVWRMVKDYKLDTNNAHISVETYFRF 762 Query: 115 VIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWE-------K 167 + D KV+YLD+D++ G + L + ++ +A + K Sbjct: 763 LAQDILSA-YDKVVYLDSDLVVNGNVAELYDVRIGNNLIAATLDIDYLANLNIRGGDRMK 821 Query: 168 RAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNML 227 + + YF +G ++ NTA+ + + + P + DQD+LN Sbjct: 822 YSLDVLNLKNPYAYFQAGVMVFNTAELRRYHTVPEWLRIASNP----IFIYNDQDILNSE 877 Query: 228 LADKLIFADIKYNTQFSLNYQLKESF----------INPVTNDTIFIHYIGPTKPWHDWA 277 ++++ +N ++ + +E + +H+ G KPW + + Sbjct: 878 CQGRVLYLPADWNVTHNIFGRAEELYPMAPNSVFDDYQAARRAPKIVHFAGAIKPWQNAS 937 Query: 278 WDYPVSQAFMEAKNASPWKNTALLKPNNSNQ 308 D ++ F + +P+ + S + Sbjct: 938 CD--MASYFWKYARNTPFYEVIIQDMVPSAR 966 >UniRef50_C4VEI8 General stress protein A n=24 Tax=Enterococcus RepID=C4VEI8_ENTFA Length = 303 Score = 187 bits (474), Expect = 6e-46, Method: Composition-based stats. Identities = 57/284 (20%), Positives = 109/284 (38%), Gaps = 18/284 (6%) Query: 24 TENLCLDIAYGTDKNFLFGCGISIASILKYNEGSR-LCFHIFTDYFGDDDRKYFDALAL- 81 L I + NF+ SIL+ + + + F++ D + ++ Sbjct: 5 ENRKELAIVSCCNTNFVPHLAAMFVSILENSPSAAAVHFYVIDDNINFESKQLLYFTIKH 64 Query: 82 -QYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFIN-KAPKVLYLDADIICQGT 139 Q + + IN +++ +++ Y+R I + F + ++LY+D D+I Sbjct: 65 TQLNAELTFFKINPHFFKNVVTSERIPKTAYYRIAIPELFRGSQIERLLYMDCDMIALDD 124 Query: 140 IEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQV 199 + L ++ +A V G + +R + + + YFNSG LLI+ +W V Sbjct: 125 VAKLWTVDLGENIIAAVEDAG----FHQRLEKMAIPAESMCYFNSGLLLIDVKKWLNLDV 180 Query: 200 SARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFS-------LNYQLKES 252 + + + + E K+ DQD LN +L D+ K+N Q E Sbjct: 181 TTKVLRFIEEN--PDKLRFHDQDALNAVLHDRWTLLHPKWNAQGYILSKAKKHPTIYGEK 238 Query: 253 FINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWK 296 IH+ G KPW Y + + + N + ++ Sbjct: 239 QYEETRRAPSIIHFTGHVKPWTKEFQWY-TKRYYDQYANRTAFR 281 >UniRef50_B4RJG6 LgtC n=29 Tax=Neisseria RepID=B4RJG6_NEIG2 Length = 307 Score = 186 bits (473), Expect = 9e-46, Method: Composition-based stats. Identities = 64/327 (19%), Positives = 120/327 (36%), Gaps = 39/327 (11%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 +DI + D N+ ++ S+ + + + FH+ +++R A I+ Sbjct: 1 MDIVFAADDNYAAYLCVAAKSVEAAHPDTEIRFHVLDAGISEENRAAVAANLRGGGGNIR 60 Query: 89 IYLINGDRLRSLPST-KNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 +N + P ++ + Y R + +Y KVLYLD D++ + ++PL + Sbjct: 61 FIDVNPEDFAGFPLNIRHISITTYARLKLGEYI-ADCDKVLYLDTDVLVRDGLKPLWDTD 119 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 + V + + YFN+G LLIN +W + + + Sbjct: 120 LGGNWVGACIDLFVERQEG--YKQKIGMADGEYYFNAGVLLINLKKWRRHDIFKMSCEWV 177 Query: 208 NEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDT------ 261 + + + + + DQD+LN L + +A+ ++N + NY + D Sbjct: 178 EQYKDV--MQYQDQDILNGLFKGGVCYANSRFNFMPT-NYAFMANGFASRHTDPLYLDRT 234 Query: 262 ------IFIHYIGPTKPWHDWAWDYPVSQAFMEAKNAS-----PWKNTALLKPNNSNQLR 310 HY G KPWH ++ F E + W+ + P Sbjct: 235 NTAMPVAVSHYCGSAKPWHR-DCTVWGAERFTELAGSLTTVPEEWRGKLAVPPT------ 287 Query: 311 YSAKHMLKKHRYLKGFSNYLFYFIEKI 337 KHML++ R F+ KI Sbjct: 288 ---KHMLQRWRKKLS-----ARFLRKI 306 >UniRef50_A4ECW2 Putative uncharacterized protein n=1 Tax=Collinsella aerofaciens ATCC 25986 RepID=A4ECW2_9ACTN Length = 328 Score = 185 bits (471), Expect = 1e-45, Method: Composition-based stats. Identities = 62/285 (21%), Positives = 112/285 (39%), Gaps = 23/285 (8%) Query: 28 CLDIAYGTDKNFLFGCGISIASILKYNEG-SRLCFHIFTDYFGDDDRKYFDALALQYKTR 86 +++ Y D NF+ +I S++ + G + FH+F++ +D+++ + +Y Sbjct: 3 IMNLLYTVDNNFVPQLAANICSVVSNHSGIQDITFHVFSNGITEDNQRLLQEMVTEYNQN 62 Query: 87 IKIYLING--DRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLI 144 + Y I+ D L T W + R ++A + N+ +V+YLD D I G I L Sbjct: 63 LVFYDISNFKDALGFDFDTSGWNEIVLARLLMAHFLPNEIERVIYLDGDTIVLGDIALLW 122 Query: 145 NFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAI 204 N V MV G Y N+G LL++ QW + + + Sbjct: 123 NQDLKGCVVGMVPEPTVGPSRLNDLDLNGCL-----YHNAGVLLVDLKQWRSTCCEDQLL 177 Query: 205 AMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQ------------LKES 252 ++ DQD LN +L DK+ +N +Y E+ Sbjct: 178 DYCE--RRSGRLFANDQDALNAVLKDKICSLSPAFNYSNIFDYYPFIFLNSLMPGFSDEN 235 Query: 253 FINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKN 297 N + I +HY+G +PW + + + + + WK+ Sbjct: 236 SFNTARSKPIVVHYLGEERPWRR-GNTHRFNNEYHFYLSETFWKD 279 >UniRef50_B9KVD4 Glycosyl transferase, family 8 n=2 Tax=Rhodobacter sphaeroides RepID=B9KVD4_RHOSK Length = 334 Score = 185 bits (469), Expect = 2e-45, Method: Composition-based stats. Identities = 57/281 (20%), Positives = 105/281 (37%), Gaps = 12/281 (4%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFT-DYFGDDDRKYFDALALQYKTRI 87 + + + D+ F ++ L H+ T D +++ ++ ALA I Sbjct: 1 MHLLFCADRPFFRHAAVAAV-SAASATRGPLQVHLLTCDSCPEEEARFRVALAPFAHVGI 59 Query: 88 KIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 ++ + RL L ++ + A Y RF+ + +VLYLD D+I + L+ Sbjct: 60 SVHRVPAARLEGLFVDRHLSPAAYLRFLAPEVLPEAVQRVLYLDCDLIVLDDVAQLLRLD 119 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 VA G D + + + Y NSG LL++ +W +S + + Sbjct: 120 LQGRAVAAAPDLGWKDAAQAARFRTLGIPLDRPYVNSGVLLMDLGRWRRDGLSQKLFDYV 179 Query: 208 NEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINP-------VTND 260 + + DQD LN +LAD + D ++N Q L + + D Sbjct: 180 ARHGSL--LLRHDQDALNAVLADDIHLLDRRWNLQVLLLSPWAKRALPEDRQATVAARRD 237 Query: 261 TIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALL 301 +H+ KPW+ W + + + +PW Sbjct: 238 PAILHFSTADKPWNFRVWTRR-RELYFRFRARTPWSRAVPE 277 >UniRef50_C4Z1V1 Putative uncharacterized protein n=1 Tax=Eubacterium eligens ATCC 27750 RepID=C4Z1V1_EUBE2 Length = 607 Score = 185 bits (469), Expect = 3e-45, Method: Composition-based stats. Identities = 60/337 (17%), Positives = 112/337 (33%), Gaps = 30/337 (8%) Query: 6 FQETEFLNSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKY-NEGSRLCFHIF 64 E + L + +H+ + + + ++ + + + S+ + R + Sbjct: 250 LHEYDELLDSYNREHEEYMAVSRIPVFFSINEQYAPYLAVCLKSLAVHVACDERYRIIVM 309 Query: 65 TDYFGDDDRKYFDALALQY-KTRIKIYLING--------------DRLRSLPSTKNWTHA 109 D + + Y I+ I DR + + +T Sbjct: 310 CDNVKNITMIQLRNVIKDYENIDIEFVDIRKKMYEYSESFGQTVTDRQENRLYSGEFTLT 369 Query: 110 IYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRA 169 IYFR IA+ F + K +Y+D+D + I L + D V Sbjct: 370 IYFRLFIAELFP-ELNKAVYIDSDTVINDDIAKLYSVDMGDAMFGAVRDTFAGKNTILAH 428 Query: 170 HSLGVAGIAKG-YFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLL 228 + V GI + Y NSG LL+N + ++ R + ++ E PDQD +N + Sbjct: 429 YIENVVGIERNEYVNSGVLLMNLDKIRQAHLADRFLKLMAEYHFDS--VAPDQDYINSMC 486 Query: 229 ADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFME 288 A ++ F D ++N + + IHY KPWH + P + F + Sbjct: 487 AKEIYFLDKEWNVMPNKGGEY--------IARPKLIHYNLFDKPWHY--SEIPYEEYFWQ 536 Query: 289 AKNASPWKNTALLKPNNSNQLRYSAKHMLKKHRYLKG 325 S + + + A K + Sbjct: 537 YAAESGFYPLLIKQRKQYGDNEKKADRENLKKLLARA 573 Score = 83.1 bits (204), Expect = 1e-14, Method: Composition-based stats. Identities = 53/272 (19%), Positives = 92/272 (33%), Gaps = 58/272 (21%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDY----------FGDDDRKYFDA 78 ++I Y DK G +S S++K L +I T KY + Sbjct: 1 MNILYCGDKTMQKGILLSSMSLIKN-VDEPLNIYILTVDYGEKGINYKPVDKAFAKYLEE 59 Query: 79 LALQYKTRIKIYLING-----DRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDAD 133 + ++ ++L++ + L +T R I +VLYLD D Sbjct: 60 KLNKSDIKVNVFLVDVTRYFVEELPEANMQSRFTACCMLRLFADKTDIK--DRVLYLDTD 117 Query: 134 IICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQ 193 ++C+ + + ++A V G GY NSG +L+N Sbjct: 118 VLCRKGFRDFYHQNMDGIEIAGVSD------------YYGRWLFGDGYINSGVMLMNMRM 165 Query: 194 WAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESF 253 + + E I K++ PDQ +N A ++ K+N Q L+ Sbjct: 166 IRQNGLLEKC----REQCIRKEMFMPDQTAVN-TFATRVNLCGRKFNDQRRLH------- 213 Query: 254 INPVTNDTIFIHYIG-----------PTKPWH 274 ++T+F H+ KPW Sbjct: 214 -----DNTVFQHFTTTFRVFPVIRTVSVKPWE 240 >UniRef50_Q116W1 Glycosyl transferase, family 8 n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q116W1_TRIEI Length = 278 Score = 184 bits (467), Expect = 4e-45, Method: Composition-based stats. Identities = 71/294 (24%), Positives = 129/294 (43%), Gaps = 19/294 (6%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 +++ + D+N+ G++I S+L N S HI T++ + ++ D L+ YK Sbjct: 2 MNLLFCFDQNYQQHFGVAITSVLLNNLSSHFDVHIITNFMEEKLKQKLDTLSKNYKCSFH 61 Query: 89 IYLING-DRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 +Y+IN D++ L + + ++A Y+R ++A+ KVLYLD+D++ +E L N Sbjct: 62 LYIINNLDKISKLKVSDHVSNATYYRLIMAEILPKHIDKVLYLDSDVVVISPLEELYNID 121 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 + +A G + FNSG +++N +W +Q+S + I Sbjct: 122 LENYFIAASGFSGTL--------------VKSKGFNSGVMVVNLEKWRNEQISTKVIDFA 167 Query: 208 NEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQ-LKESFINPVTNDTIFIHY 266 + K+ + DQ LN ++ + D K+N Q L+ + +++ N + IHY Sbjct: 168 TKNR--DKLPYHDQSALNRVIKQNYLIIDRKWNFQVDLSPRKIQKPDDNIALKNARIIHY 225 Query: 267 IGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKH 320 IG +KPW+ W D + S W + L A K Sbjct: 226 IGSSKPWYFWISD-QRKNIYELYLKKSLWSTSKLQMIFQQTVYFRKALQRKLKK 278 >UniRef50_UPI000196958D hypothetical protein BACCELL_01586 n=1 Tax=Bacteroides cellulosilyticus DSM 14838 RepID=UPI000196958D Length = 305 Score = 184 bits (466), Expect = 6e-45, Method: Composition-based stats. Identities = 66/309 (21%), Positives = 109/309 (35%), Gaps = 15/309 (4%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 ++I D ++ C + + S + N G ++ T+ DD + + Y Sbjct: 1 MNIVCAADSGYVQHCSVMLISFFENNPGEEHAVYLLTEGLDLDDLDFIQKIVHSYNGHFF 60 Query: 89 IYLINGDRLRSLP--STKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINF 146 ++ L P ST + + A Y R +AD KVLYLD DII +I+ L Sbjct: 61 YCQVDFKFLEKCPIKSTDHLSIATYNRLFMADLLPADVNKVLYLDCDIIVNQSIKELWET 120 Query: 147 SFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAM 206 D+ V E + + GYFN+G LL+N W ++ I Sbjct: 121 PLRDNFVVAAFEERGCCAED--VYERLDYDSKYGYFNAGVLLVNLDYWRTHNMTQAFIEY 178 Query: 207 LNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQ------LKESFINPVTND 260 + +K+ DQDVLN DK + + +N +F Y + + + Sbjct: 179 IEHN--FEKLRAHDQDVLNAFFYDKSVHISLAWNVEFIFYYYGIIKKFGFDRDLRFILRH 236 Query: 261 TIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKH 320 +H+ KPW + +P + K L + L Sbjct: 237 PKILHFTWKPKPWET-SCQHPFRINYYRYLKKI--KKNPLSFRDTLRALWDKYYFCFLIK 293 Query: 321 RYLKGFSNY 329 +KG Y Sbjct: 294 WKIKGHKYY 302 >UniRef50_A5ZFA9 Putative uncharacterized protein n=1 Tax=Bacteroides caccae ATCC 43185 RepID=A5ZFA9_9BACE Length = 310 Score = 183 bits (465), Expect = 7e-45, Method: Composition-based stats. Identities = 66/318 (20%), Positives = 127/318 (39%), Gaps = 21/318 (6%) Query: 30 DIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKI 89 +I G D + CG + S+ + N G+ + ++ + + + L Y+ +I Sbjct: 3 NIICGIDDQYCQHCGAMLLSLFESNPGA-ITIYVLSLELSEKSKNLLKELVDSYQKQIHF 61 Query: 90 YLINGDRLRSLP--STKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 I + + + P ST + A Y R I + K LY+D+DII + I L + Sbjct: 62 IDIPSELVLNFPMKSTDYPSLATYLRLFIPQLLPFEVDKALYVDSDIIFKKDISALYDSD 121 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 + +A + + A LG + YFN+GF+L+N + +A+A + Sbjct: 122 ITNYALAGMED-----APNQNALRLGFPE-SDLYFNAGFVLLNVKYLRDMDFTNKAMAYI 175 Query: 208 NEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLK---ESFINPVTND---T 261 + +KI DQDVLN LL K++F IK+N + + ++ + + Sbjct: 176 RDCR--EKIVLHDQDVLNALLHGKVLFVPIKWNMLDCFYRKPPFIAKKYMRELHENLDSP 233 Query: 262 IFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKH- 320 IH+ GP KPWH +P+ + + W + + ++ + Sbjct: 234 AVIHFSGPLKPWHH-GCPHPLRKEYFNYSRKLSWGCQSPDYYYVFSAFKFPVSLFIWLGC 292 Query: 321 --RYLKGFSNYLFYFIEK 336 + + + ++ Sbjct: 293 TLEKAERLDRKIRFKKKR 310 >UniRef50_B1Y723 Glycosyl transferase family 8 n=1 Tax=Leptothrix cholodnii SP-6 RepID=B1Y723_LEPCP Length = 316 Score = 183 bits (464), Expect = 9e-45, Method: Composition-based stats. Identities = 58/304 (19%), Positives = 112/304 (36%), Gaps = 21/304 (6%) Query: 30 DIAYGTDKNFLFGCGISIASILKYNEGS-RLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 I D+ +L ++ S+++ N + H+ D R + +I+ Sbjct: 12 PIVLACDEAYLMPLATTLRSVVESNAAHWPIECHVLVDDVSLPGRARVERSLPARAAQIR 71 Query: 89 IYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSF 148 + ++ S + + + R ++AD + +VLYLD DI+ G + PL+ Sbjct: 72 WHAVDLTDFSSFETQAAISKMTFARLLMADLLPAELERVLYLDTDILVLGDLLPLMRTEL 131 Query: 149 PDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLN 208 + V A+ G+ + YFN+G LLI+ A+W A +VSA A L Sbjct: 132 DGAILGAVRDGLDAELKSTSPAPTGMPDVCD-YFNAGVLLIDLARWRAGRVSAAARDHLV 190 Query: 209 EPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIG 268 DQD LN+ +N Q ++ + + +H+I Sbjct: 191 AHPQTP---FADQDALNVACDGHWKPLAAHWNFQ---GHRSTDIAALAPSQRPGIVHFIT 244 Query: 269 PTKPWH--DWAWDYPVSQAFME--AKNASP---WKNTA------LLKPNNSNQLRYSAKH 315 KPW + + + + P W + + + ++++ KH Sbjct: 245 ALKPWKADSLSLNARLYDGWRSRTLFARHPVMRWTDAIRALVSRMNRALSAHESTRRLKH 304 Query: 316 MLKK 319 L++ Sbjct: 305 QLRQ 308 >UniRef50_C6I3U6 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=C6I3U6_9BACE Length = 310 Score = 182 bits (463), Expect = 1e-44, Method: Composition-based stats. Identities = 66/298 (22%), Positives = 112/298 (37%), Gaps = 21/298 (7%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 ++I D N++ + + S N+ ++ T D + Y + +Y + Sbjct: 1 MNILCCLDDNYVQHTSVMLTSFFINNDFEHHNIYVITMQLNDGNVAYLREVVNKYHSNFY 60 Query: 89 IYLINGDRLRSL--PSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINF 146 +Y +N L T + A Y R KVLY+D DI+ + ++E L Sbjct: 61 LYQVNEAMLSGFVRKETDYVSLAAYLRLFSTQVLPFNCSKVLYIDGDIVVRKSLEELWKM 120 Query: 147 SFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAM 206 + VA V +A+ + GYFNSGF+LIN + W V+ +AI Sbjct: 121 DIENYAVAAVDETIKANCIRHNY------DVTLGYFNSGFMLINLSFWRENSVAEKAIDY 174 Query: 207 LNEPEIIKKITHPDQDVLN-MLLADKLIFADIKYNTQFSL----------NYQLKESFIN 255 + ++I DQD LN +L D+KYN ++ N Sbjct: 175 MK--RFPERIKSWDQDALNGILYGGLWKRLDLKYNLTTIFLCKQYVEGQDFPKIYTEEYN 232 Query: 256 PVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSA 313 +D +HY GP KPW D+P + +++ + + +R Sbjct: 233 SAISDPAVVHYTGPDKPWKYTVVDHPFKKDYLQYARMLGINHDFNISIFFKRIVRKLL 290 >UniRef50_B3CA80 Putative uncharacterized protein n=1 Tax=Bacteroides intestinalis DSM 17393 RepID=B3CA80_9BACE Length = 301 Score = 182 bits (462), Expect = 2e-44, Method: Composition-based stats. Identities = 67/299 (22%), Positives = 117/299 (39%), Gaps = 20/299 (6%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIF-TDYFGDDDRKYFDALALQYKTRI 87 +DI D+N++ CG+ +AS+ + + HI + +K +++ + Sbjct: 2 IDIVCSIDENYIEYCGVMLASLFVHTPDEKFRVHIICSSKVEKAGKKRLKVFCEKHQAEV 61 Query: 88 KIYLINGDRLRSLPSTK--NWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLIN 145 Y ++ ++ P K + + A Y R +++ + K+LYLD D+I +I+ L Sbjct: 62 YFYDVDYSLIKDFPIRKQDHLSLAAYLRLFMSELIPSNINKILYLDCDLIVVDSIKELWE 121 Query: 146 FSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIA 205 + + VA V D + YFNSG +LIN +W ++ + Sbjct: 122 KNIDNIAVAAVEERSPFDTESPVTLK---YPVEYSYFNSGVMLINLQKWREKKFVEACKS 178 Query: 206 MLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKE------SFINPVTN 259 + + I DQDVLN LL + F I++N Y E + Sbjct: 179 YIASN--YENIKLHDQDVLNALLYKEKQFISIRWNLMDFFLYASPEVQPERKKDWDDALK 236 Query: 260 DTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLK 318 IH+ G KPW + D P ++ W NN N + Y + +L Sbjct: 237 SPAIIHFTGKRKPW-MYNCDSPFRDQYIRFAKQQGWHVI-----NNKNAIHYFFRKILY 289 >UniRef50_B2ISC6 Glycosyl transferase, family 2/glycosyl transferase family 8 n=8 Tax=Streptococcus pneumoniae RepID=B2ISC6_STRPS Length = 696 Score = 182 bits (461), Expect = 2e-44, Method: Composition-based stats. Identities = 62/328 (18%), Positives = 120/328 (36%), Gaps = 27/328 (8%) Query: 1 MQQVFFQETEFLNSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLC 60 + + + + + E+ I + ++ +I SI +N + Sbjct: 275 LSDTATYKEFEMKQRLLNQLSRQEESEKKAIVLAANYAYVDQVLTTIKSICYHN--RSIR 332 Query: 61 FHIFTDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYF 120 F++ F ++ K + ++ + I + +++ S ++ ++ R+ IAD+ Sbjct: 333 FYLIHSDFPNEWIKQLNKRLEKFDSEIINCRVTSEQISCYKSD--ISYTVFLRYFIADFV 390 Query: 121 INKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKG 180 + K LYLD D++ ++ L D ++A V G G A + Sbjct: 391 --QEDKALYLDCDLVVTKNLDDLFATDLQDYRLAAVRDFG------------GRAYFGQE 436 Query: 181 YFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYN 240 FN+G LL+N A W + + + I + NE K+ DQ +LNML K + D YN Sbjct: 437 IFNAGVLLVNNAFWKKENMIQKLIDVTNEWH--DKVDQADQSILNMLFEHKWLELDFDYN 494 Query: 241 TQFSLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTAL 300 ++ Q + + + IHY+ KPW D A + + + W Sbjct: 495 HIV-IHKQFADYQLPEGQDYPAIIHYLSHRKPWKDLAAQT-YREVWW-YYHGLEWTELG- 550 Query: 301 LKPNNSNQLRYSAKHMLKKHRYLKGFSN 328 N + H+ Sbjct: 551 ---QNHHLHPLQRSHIYPIKEPFTCLIY 575 >UniRef50_B3Z5I6 Glycosyltransferase family 8 n=2 Tax=Bacillus cereus group RepID=B3Z5I6_BACCE Length = 317 Score = 182 bits (461), Expect = 2e-44, Method: Composition-based stats. Identities = 63/321 (19%), Positives = 127/321 (39%), Gaps = 30/321 (9%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEG-SRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 L++ Y +D N+ G+S+ S+L+ N+ + L + + ++K +++ +Y I Sbjct: 3 LNVVYSSDDNYAQHVGVSLLSLLQNNQHFNNLNIFLIENNISSYNKKNLNSVCKKYNKTI 62 Query: 88 KIYLINGDRLR-SLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINF 146 + N R L + Y R +A + K++YLD D I ++ L + Sbjct: 63 QYINFNVLLERLELNINDSIAINSYARLFLAGIIPEELDKIIYLDCDSIINSSLSDLWDT 122 Query: 147 SFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAM 206 + VA V + ++GY N+G LLIN +W + + + + Sbjct: 123 DVTEYFVAGVCDTVS-----NQTKLRIDMDKSEGYINAGMLLINLKKWREENIEQKFMEF 177 Query: 207 LNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQ--------------FSLNYQLKES 252 + + + + H DQ +N +L DK+++ K+N + L E Sbjct: 178 IKKKD--GNVFHHDQGTINGVLKDKILYLHPKFNAMTPFFTMSRKEIMSYYELENYYNEI 235 Query: 253 FINPVTNDTIFIHYIGP--TKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNN----S 306 I+ + +FIHY +PW +P++ + + +PWK+T L K Sbjct: 236 EIDEAVKNPVFIHYTPAFVNRPW-IEGCKHPLTSLYKSYLDMTPWKSTDLWKDRRGKVEK 294 Query: 307 NQLRYSAKHMLKKHRYLKGFS 327 + + +++ Sbjct: 295 TIALLYTRLPFRIAHHIRNLI 315 >UniRef50_C1QBZ8 LPS:glycosyltransferase n=1 Tax=Brachyspira murdochii DSM 12563 RepID=C1QBZ8_9SPIR Length = 336 Score = 182 bits (461), Expect = 2e-44, Method: Composition-based stats. Identities = 73/303 (24%), Positives = 120/303 (39%), Gaps = 20/303 (6%) Query: 30 DIAYGTDKNFLFGCGISIASILKY-NEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 +I +D+N+ +++ASILK N+ + FHI D+ + L IK Sbjct: 5 NICLCSDENYAKYMAVTMASILKNTNDDENIIFHIIESNIKDETKNKLIYLKKIKNCEIK 64 Query: 89 IYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSF 148 Y + ++ + A Y R +I + A KVLYLD+DII G+++ L + Sbjct: 65 FYRVEYNK---------YPLATYLRLLIPELI-KDADKVLYLDSDIIVNGSLKELFDIDI 114 Query: 149 PDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLN 208 V + E L G + YFN+G +L N +S + + Sbjct: 115 NGYYALAVKDLYVDIYKEH--KELIEIGNNRIYFNAGVVLFNNKSCIDNNISQKFYSYFT 172 Query: 209 EPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIG 268 E + K+ DQD+LN DK+ D K+N +Y K + P +D + IH++ Sbjct: 173 ENK--NKLKFHDQDILNHCFIDKVKIIDRKWNFMPFRDYNTKSHY--PTKDDAVIIHFV- 227 Query: 269 PTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNS--NQLRYSAKHMLKKHRYLKGF 326 KPW + +PW + + Q Y + + + Y K F Sbjct: 228 EHKPWKTQKDRTYFLDDYWRYYQYTPWFFEEPITAIQTMMQQKMYDYEDIRFRSNYFKFF 287 Query: 327 SNY 329 Y Sbjct: 288 GIY 290 >UniRef50_C9PNX4 Family 8 glycosyl transferase n=1 Tax=Pasteurella dagmatis ATCC 43325 RepID=C9PNX4_9PAST Length = 285 Score = 181 bits (459), Expect = 3e-44, Method: Composition-based stats. Identities = 55/285 (19%), Positives = 108/285 (37%), Gaps = 21/285 (7%) Query: 26 NLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKT 85 + ++I D F I S+ +N+ + F++ + + + + + + Sbjct: 9 DSNMNIVLSADVQFSEQVKTLIKSVSYHNKN--VHFYLLNKDYPSEWFQILNQYLAYFGS 66 Query: 86 RIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLIN 145 I ++ + + + P+ + + A YFR+++ +VLYLD D++ G++ + Sbjct: 67 NIIDAKVDSEVISTFPTLDHISEASYFRYLLGQL---PLDRVLYLDCDVVVTGSLTEIYY 123 Query: 146 FSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIA 205 F D+ + V K YFNSG LLI+ +W Q + + + Sbjct: 124 TDFGDNMMYAVEDAFLNIAPHSYKEF----PDMKPYFNSGMLLIDLNKWRDQNIENQLMD 179 Query: 206 MLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSL--------NYQLKESFINPV 257 + + + + DQD +N++L K D YN Q + E + + Sbjct: 180 LTKQAV---NLYYGDQDAMNIILKGKWQALDKIYNYQTGSLIAFIQHKMPEALEKYKDLQ 236 Query: 258 TNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLK 302 IHYI KPW +D P + W++ Sbjct: 237 GQQPKVIHYITRYKPWLLPEYDLPFRDQYWAYYQL-EWQDIIRKW 280 >UniRef50_A1XRC1 Glycosyltransferase n=1 Tax=Haemophilus ducreyi RepID=A1XRC1_HAEDU Length = 267 Score = 181 bits (459), Expect = 4e-44, Method: Composition-based stats. Identities = 63/261 (24%), Positives = 119/261 (45%), Gaps = 13/261 (4%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 ++I + +D+N+ + + SIL +N + F+I ++ + + +L ++ + I+ Sbjct: 1 MNIVFSSDENYAPHLSVCLYSILSHN--YNINFYILDLGIKEESKSFIKSLVEKFNSNIE 58 Query: 89 IYLINGDRLRSLPST-KNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 I+ D + P + A Y R + DY + KVLYLD D I G++ L + Sbjct: 59 FIKISVDSFSNFPIYIDYISLATYARLKLTDYLP-QLEKVLYLDIDTIVNGSLIDLWDLD 117 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 + +A V E + + YFN+G LLI+ +W + +++ ++ Sbjct: 118 LNEYYIAAVADPFI----ESLNYKTILGLDKNIYFNAGVLLIDCIKWKQYNIFDKSVKII 173 Query: 208 NEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTN---DTIFI 264 + + KK+ + DQD+LN++L DK++ D +YN S +K + + Sbjct: 174 KD--LSKKLQYQDQDILNLILKDKVLLLDCRYNFMPSQLDFIKRDKVRKGIKITTPIVIY 231 Query: 265 HYIGPTKPWHDWAWDYPVSQA 285 HY GP KPWH ++ Sbjct: 232 HYCGPKKPWHIDCTNFNCELY 252 >UniRef50_D1Y7U2 Glycosyltransferase, family 8 n=1 Tax=Pyramidobacter piscolens W5455 RepID=D1Y7U2_9BACT Length = 617 Score = 180 bits (457), Expect = 6e-44, Method: Composition-based stats. Identities = 65/339 (19%), Positives = 120/339 (35%), Gaps = 42/339 (12%) Query: 32 AYGTDKNFLFGCGISIASILKYNEGSR-LCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 ++ ++ G + SI + SR +IF ++ ++ + Sbjct: 283 VLAANEKYVPILGTCLKSIADHCSSSRSYKLYIFHTDIQEESQRNLKTFLESDNFSLTFV 342 Query: 91 LINGDRLR-SLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFP 149 ++ + L + ++ T ++RF+I D KVLYLD D+I Q I L + Sbjct: 343 NVSLHVGKYRLRAKEHVTTETFYRFLILDLLKM-YDKVLYLDCDMIIQRDIADLYDLDLG 401 Query: 150 DDKVAMVVTEGQA-------DWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSAR 202 + + + K ++ YF +G LL+N A+ + Sbjct: 402 TNLIGAALDPDFTGQCNGANPATRKYCDAVLKLKDCFTYFQAGVLLMNVAELNKSVTVRQ 461 Query: 203 AIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKE----------- 251 + M + DQD+LN++ + ++ D+ +N ++ Sbjct: 462 LLEMAET----GIYKYSDQDILNVVCEGRALYLDMAWNLLSDCDHYRWHHVVKFAPHYIL 517 Query: 252 SFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALL---------- 301 IHY G KPW D F +A +P+ L Sbjct: 518 DMYENAREKPYIIHYAGFLKPWMKLGED--FGYEFWKAARETPFYEELLYAALVPHGNTT 575 Query: 302 KPNNS-----NQLRYSAKHMLKKHRYLKGFSNYLFYFIE 335 +P N N+L AK +L K L+ F+ +L+Y I+ Sbjct: 576 RPQNFLHMLINRLVPLAKAVLPKGSRLRYFARHLYYRIK 614 >UniRef50_D2QX95 Glycosyl transferase family 8 n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2QX95_9PLAN Length = 350 Score = 179 bits (455), Expect = 9e-44, Method: Composition-based stats. Identities = 59/342 (17%), Positives = 118/342 (34%), Gaps = 33/342 (9%) Query: 25 ENLCLDIAYGTDKNFLFGCGISIASILKY-NEGSRLCFHIFTDYFGDDDRKYFDALALQY 83 LD+ D F G +I S+L + S+L + ++R Sbjct: 1 MQRVLDVLTSADDRFAIGLAGTIKSVLASLSPSSKLNLWVLDGGISSENRDDLIHHWNDP 60 Query: 84 KTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPL 143 + + ++ L + + A Y+R + + + K+LY+DAD++ Q + L Sbjct: 61 RLSVNWLPVDRALLAEFKVAPHMSDAAYYRLLAPNLLPSSVKKLLYIDADLLVQRDLTDL 120 Query: 144 INFSFPDDKVAMVVTEGQADWWEKRAHSLGV-------------------AGIAKGYFNS 184 + F V G + YFNS Sbjct: 121 WDEPFDGHSCIAVHDIGAPFLDSNQILLEKPDALSRIVCRNPIPMFEELGLAPETRYFNS 180 Query: 185 GFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFS 244 G +I+ W ++Q+S + +L ++ + DQ LN++LA++ AD ++N Sbjct: 181 GVFMIDLETWRSEQLSVQMFDVLCTHR--ERQIYHDQFALNIVLANRWKAADYRWNQLAY 238 Query: 245 L-------NYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPW-K 296 + + L+ + +H+ KPW +P+ + F + S W + Sbjct: 239 IHELKVPQHTFLEPQVFQQYKHSPWVVHFT-YRKPWQ-PECQHPLRKRFFDYLAGSKWMQ 296 Query: 297 NTALLKPNNSNQLRYSAKHMLKKHRYLKG-FSNYLFYFIEKI 337 P + A ++ + L G + I+ + Sbjct: 297 AMPEWHPPQQPIVAPPAPPAARQRQGLLGRMQRSIRKRIDSL 338 >UniRef50_C7IBC8 Glycosyl transferase family 8 n=1 Tax=Clostridium papyrosolvens DSM 2782 RepID=C7IBC8_9CLOT Length = 464 Score = 179 bits (454), Expect = 1e-43, Method: Composition-based stats. Identities = 68/287 (23%), Positives = 112/287 (39%), Gaps = 16/287 (5%) Query: 64 FTDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINK 123 +++ A +Y +RI+ + + + + + + YFR I + Sbjct: 2 IDGGISSRNKECLRACVEKYGSRIRFLELKPELYQDFKTQSYFGYVTYFRIFIPEIVEAS 61 Query: 124 APKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAK--GY 181 KV+YLD DI+ +G I L + VA V G GI + Y Sbjct: 62 VRKVIYLDCDIVIKGDIRKLWENDISEYFVAAVEDVGIDIGGNFATMVKKHIGIPRKGKY 121 Query: 182 FNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNT 241 FN+G LLIN +W A + + L E +KI DQD LN + D+ + I++N Sbjct: 122 FNAGVLLINLDKWRADKTTETIRKYLIENR--EKIYFADQDGLNAVFKDRWLKLPIEWNQ 179 Query: 242 QFSLNYQLKESFIN-----PVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWK 296 Q + LK + I+ + + IHY KPW +P+ + + +PW Sbjct: 180 QADILELLKRNRIDRPDVMKAALNPMIIHYTKQVKPWQYKDC-HPLKEEYHRYLRLTPWN 238 Query: 297 NTALLKPNNSNQLRYSAK------HMLKKHRYLKGFSNYLFYFIEKI 337 +TA ++ K L K + F YF +K+ Sbjct: 239 DTAPKVTIVDVLGKFLGKTPIGRGFYLYKKKIRDYFIIDKSYFSDKL 285 >UniRef50_B2UQ54 Glycosyl transferase family 8 n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UQ54_AKKM8 Length = 328 Score = 179 bits (454), Expect = 1e-43, Method: Composition-based stats. Identities = 57/332 (17%), Positives = 114/332 (34%), Gaps = 26/332 (7%) Query: 21 KVETENLCLDIAYGTDKNFLFGCGISIASILK-YNEGSRLCFHIFTDYFGDDDRKYFDAL 79 + + +D + +++ S+L + ++ +D ++ + L Sbjct: 2 NNPMKKNEFAVVLASDNRGILPLSVTVFSLLNTAGPETFYKIYVLSDGIDGENWASVERL 61 Query: 80 ALQYKTRIKIYLINGDRLRS-LPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQG 138 A + R++ ++G + P T+ W + R I + + +LYLD D++ Sbjct: 62 AAPFDCRLEFIDVSGILEKHDFPHTEQWPVPAWGRVFIPELLKEERGNILYLDIDVLVCR 121 Query: 139 TIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQ 198 + L + + +V L + GYFNSG LL+N + + Sbjct: 122 DLTELFRTNMDGKAIGVVFENFSRP-GSHFNERLEMPLTCTGYFNSGVLLMNVDVFREKN 180 Query: 199 VSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFIN--- 255 + + ++T PDQD LN L + + ++N L ++ ++ Sbjct: 181 LVRAVLDYAVTHR--DRLTCPDQDALNGALCELTVPLHPRWNWHDGLTRRILKNDPREQF 238 Query: 256 ----------PVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNAS-----PWKNTAL 300 + +HY G KPW + W Y + + + P L Sbjct: 239 WRGVTPRQAVEAALEPGILHYQGVHKPWR-YNWRYE-GERYERVMREAGLLRGPLPGRTL 296 Query: 301 LKPNNSNQLRYSAKH-MLKKHRYLKGFSNYLF 331 + R + K R +GF N L Sbjct: 297 PAVLKKHLYRPVYRMTARKILRLKEGFDNRLL 328 >UniRef50_P43974 Putative glycosyltransferase HI0258 n=32 Tax=Haemophilus influenzae RepID=Y258_HAEIN Length = 330 Score = 178 bits (452), Expect = 2e-43, Method: Composition-based stats. Identities = 62/300 (20%), Positives = 120/300 (40%), Gaps = 14/300 (4%) Query: 23 ETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQ 82 T + ++I + +D + +SI SI+K ++ F+I +++ + LA Sbjct: 33 RTVSQTMNIIFSSDHYYAPYLAVSIFSIIKNTPK-KINFYILDMKINQENKTIINNLASA 91 Query: 83 YKTRIKIYLINGDRLRSLPST-KNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIE 141 Y ++ + ++ P T + A Y R + Y K +Y+D D + +++ Sbjct: 92 YSCKVFFLPVCESDFQNFPKTIDYISLATYARLNLTKYI-KNIEKAIYIDVDTLTNSSLQ 150 Query: 142 PLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSA 201 L N + +A E ++G+ G YFN+G LLIN +W + + Sbjct: 151 ELWNIDITNYYLAACRDTFIDVKNEAYKKTIGLEGY--SYFNAGILLINLNKWKEENIFQ 208 Query: 202 RAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDT 261 ++I +N+ + K DQD+LN + K+ F + ++N + +K+ + V Sbjct: 209 KSINWMNKYNNVMKYQ--DQDILNGICKGKVKFINNRFNFTPTDRDLIKKKNLLCVKMPI 266 Query: 262 IFIHYIGPTKPWHDWAWDYPVS--QAFMEAKNA-----SPWKNTALLKPNNSNQLRYSAK 314 + HY GP K WH ++ + S W + P R + Sbjct: 267 VISHYCGPNKFWHKKCSHLNCHIGNLLLKEMDKIIDIPSSWYDHFEKIPFLIKIKRLRKR 326 >UniRef50_Q3D427 Glycosyl transferase, family 8 n=8 Tax=Streptococcus agalactiae RepID=Q3D427_STRAG Length = 413 Score = 178 bits (452), Expect = 2e-43, Method: Composition-based stats. Identities = 71/298 (23%), Positives = 105/298 (35%), Gaps = 24/298 (8%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 IA D + I SI +N+ + F+I D F + + + + I Sbjct: 7 IALAADFGYQEQVKTIIKSICFHNQ--FIDFYILNDDFPVEWFQMMEYHLSKMDCTISNT 64 Query: 91 LINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPD 150 I + ++ K + YFR+ I + KVLYLD D+I + + Sbjct: 65 KIFNEEIKHFKFQKPMPYPTYFRYFIPEVIHE--DKVLYLDCDMIITSDLTSIFTLDISK 122 Query: 151 DKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEP 210 VA V + L + YFNSG LLIN W Q +S R + E Sbjct: 123 YGVAAVRDD-----------LLEEYDGKEDYFNSGLLLINNIFWREQGISQRLLDYTREN 171 Query: 211 EIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQL-----KESFINPVTNDTIFIH 265 + + + DQDVLN +L D + D YN + E +N + IH Sbjct: 172 Q--GALQYHDQDVLNDVLCDNWLELDETYNYHTGADMLYNLFQQSERQLNRRKDLPKVIH 229 Query: 266 YIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKHRYL 323 Y TKPW + E N W++ N K + HR Sbjct: 230 YT-ATKPWKYLETSVRWRDIWWEY-NRLEWRDIFTRWQVNDGVDVSLKKVLQPIHRAF 285 >UniRef50_C8WAA9 Glycosyl transferase family 8 n=2 Tax=Atopobium RepID=C8WAA9_ATOPD Length = 358 Score = 178 bits (452), Expect = 2e-43, Method: Composition-based stats. Identities = 48/321 (14%), Positives = 97/321 (30%), Gaps = 29/321 (9%) Query: 32 AYGTDKNFLFGCGISIASILKY-NEGSRLCFHIFTDYFGDDDRKYFDALAL-QYKTRIKI 89 + NF+ ++I SI++ N R + T + A + Sbjct: 19 VFACSDNFVPYLSVAIQSIIENVNPERRYDIIVLTRDLSPTNMITLTRQAQLVNNVHVGF 78 Query: 90 YLINGDRLR-SLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSF 148 ++ LP ++ Y+R + K +YLD+D++ I L + Sbjct: 79 LDVDAALGDIELPHHGHFRPETYYRLLAPSLLP-NVNKAIYLDSDLVVNTDIAELYDIDI 137 Query: 149 PDDKVAMVVT-------EGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSA 201 V +G + YF +G +L+N + Q Sbjct: 138 TGYLVGATRDADTIGQIDGYDATVGPYLKNELGMDDPHDYFQAGVILMNLEEIRKQISPE 197 Query: 202 RAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKE---------- 251 + + ++ DQDVLN + + ++K+N + ++ Sbjct: 198 EFLKV----STMRTWRWLDQDVLNRFVNGHYLRINMKWNYLVDWQFLRRDHIVAQAPKDI 253 Query: 252 -SFINPVTNDTIFIHYIGP-TKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQL 309 + H+ GP +PW D ++ F SP+ + S + Sbjct: 254 REEYEEARKNICIAHFAGPDNRPWLYPNSD--LAGLFWFYARRSPYLEELRSQLEESRRT 311 Query: 310 RYSAKHMLKKHRYLKGFSNYL 330 H ++ +G Sbjct: 312 VRGLSHRVQSGVLFRGLMPLF 332 >UniRef50_C2KV37 Lipopolysaccharide biosynthesis protein, LPS:glycosyltransferase n=1 Tax=Oribacterium sinus F0268 RepID=C2KV37_9FIRM Length = 324 Score = 178 bits (452), Expect = 2e-43, Method: Composition-based stats. Identities = 60/328 (18%), Positives = 113/328 (34%), Gaps = 36/328 (10%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 + I YG ++ F+ +S++S+L + EG L FHI + ++ ++ +I Sbjct: 1 MHIVYGVNEAFMPILAVSLSSLLLHAEGEALHFHILSLGIEEESKEKLRQYVETEGQKIS 60 Query: 89 IYLINGDRLRSLPS-----TKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPL 143 Y + T ++ A R I K LYLDAD + +I L Sbjct: 61 FYDLEEKLSEWKEKLPALFTGKFSKATLLRLFIPSTLPETITKALYLDADTVVLQSILSL 120 Query: 144 INFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARA 203 + D + M K+ Y+N+G +L+N + + + + Sbjct: 121 YHLRLGDKLLGMAPEPSI----YKKHKEFLSLAEESPYYNAGVMLMNLSLLREEGMEEKC 176 Query: 204 IAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNY-------QLKESF--- 253 + E ++ DQD+LNM+ ++ ++N + Y + + Sbjct: 177 LRYYQMKE--GQLPFNDQDILNMVCKGRIRSLPQRFNFFSNYAYARYSALCRFSPWYQEL 234 Query: 254 -----INPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQ 308 + + +H+ G +PW + +Y +AF SP L K Sbjct: 235 ESKKSYSQAKAHPVIVHFAGDERPWREGNHNY-YRRAFDYYAEESP---LPLEKEKGKQG 290 Query: 309 LRYSAKHM------LKKHRYLKGFSNYL 330 + + K R G Y Sbjct: 291 YLFCYHVLNLLTFVFPKLREKVGEFYYR 318 >UniRef50_B3XL28 Glycosyl transferase family 8 n=1 Tax=Lactobacillus reuteri 100-23 RepID=B3XL28_LACRE Length = 331 Score = 178 bits (451), Expect = 3e-43, Method: Composition-based stats. Identities = 63/317 (19%), Positives = 120/317 (37%), Gaps = 31/317 (9%) Query: 25 ENLCLDIAYGTDKNFLFGCGISIASILKYNEG-SRLCFHIFTDYFGDDDRKYFDALALQY 83 +I Y TD F G S+ S+L+ N+ ++ F I +++ + + + Sbjct: 1 MKTIYNIVYATDDTFAPVLGTSLLSLLRNNKEAKKINFFILDSGISKENKFRIEKICDNF 60 Query: 84 -KTRIKIYLING--DRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTI 140 +K I ++ + + Y R I D N +VLYLD D + ++ Sbjct: 61 VNASLKWIKIESISKKIGIDVKNDRGSFSQYSRLFIGDVLDNSVERVLYLDCDTLILSSL 120 Query: 141 EPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVS 200 + L N + +A + K FNSG +LI+ W ++ Sbjct: 121 KDLWNIELKGNIIAALKDAFS-----KYYRKNINLVNDDLMFNSGVMLIDLKAWRDNKIK 175 Query: 201 ARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQ---FSLNYQ--------- 248 +AI+ + + K+ DQ VLN +L++K D +YN + L+Y+ Sbjct: 176 EKAISFIRQRH--GKVQQGDQGVLNSVLSNKTFALDPRYNLVSIFYDLDYREIKLYRSPV 233 Query: 249 --LKESFINPVTNDTIFIHYIG---PTKPWHDWAWDYPVSQAFMEAKNASPWKNTALL-- 301 E I + + +H+ +PW ++ + +++ +PWKN L Sbjct: 234 NFYSEKIIVKAKENPVILHFTSSFYSIRPW-FKNSNHQCKKIWLKFYQETPWKNQPLQIE 292 Query: 302 KPNNSNQLRYSAKHMLK 318 + ++ LK Sbjct: 293 MSKKKKLINILFEYGLK 309 >UniRef50_B6G807 Putative uncharacterized protein n=2 Tax=Collinsella RepID=B6G807_9ACTN Length = 276 Score = 177 bits (450), Expect = 4e-43, Method: Composition-based stats. Identities = 39/272 (14%), Positives = 91/272 (33%), Gaps = 11/272 (4%) Query: 24 TENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQY 83 + +D+ D+ +L + S+ N+G+++ + + + Sbjct: 1 MKQHAMDVIVTCDEGYLGPLRTMLYSLRASNQGAQVRIWLLHKGISLPALEELERFCSVL 60 Query: 84 KTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPL 143 I+ ++ L ++ + +Y+R + + LYLD DI+ ++ L Sbjct: 61 GLAIEPVTVDRVLLDGAKCSERYPQEMYYRLLAPSIIKAPIERALYLDPDILVINPLDDL 120 Query: 144 INFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARA 203 + A + + YFN+G +L + A+ Sbjct: 121 FEIDLHGNAFAAASHLDAVHPATALNKAR--LSTSSDYFNTGVILFDIARARKSICVDEL 178 Query: 204 IAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIK-YNTQ-----FSLNYQLKESFINPV 257 + + E + + PDQD+ N L + + +N ++ + ++ V Sbjct: 179 FSYVKAHEQV--MLFPDQDLFNSLFGAVTLRIPDEIWNYDARKYPDNIIRTWGTATLDWV 236 Query: 258 TNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEA 289 T +H+ G KPW + + + Sbjct: 237 MEHTAILHFCGKNKPW-APGYRGQFASLYKHY 267 >UniRef50_Q9L6B2 Putative glycosyl transferase n=3 Tax=Pasteurella RepID=Q9L6B2_PASMU Length = 302 Score = 177 bits (449), Expect = 5e-43, Method: Composition-based stats. Identities = 61/304 (20%), Positives = 127/304 (41%), Gaps = 15/304 (4%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 ++I + +D + ++I SI+ +N + F+IF D++++ + + Y + + Sbjct: 1 MNILFVSDDVYAKHLVVAIKSIINHN-EKGISFYIFDLGIKDENKRNINDIVSSYGSEVN 59 Query: 89 IYLINGDRLRSLPST-KNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 +N S P + A Y R A+Y + K++YLD D++ ++E L N Sbjct: 60 FIAVNEKEFESFPVQISYISLATYARLKAAEYLPDNLNKIIYLDVDVLVFNSLEMLWNVD 119 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 + A + + S+ ++ YFN+G +L N +W V +RA+ +L Sbjct: 120 VNNFLTAACYDSFIENEKSEHKKSISMSDKEY-YFNAGVMLFNLDEWRKMDVFSRALDLL 178 Query: 208 NEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTN-------- 259 ++ + DQD+LN+L +K+ + D ++N + ++K+ ++N Sbjct: 179 AMY--PNQMIYQDQDILNILFRNKVCYLDCRFNFMPNQLERIKQYHKGKLSNLHSLEKTT 236 Query: 260 -DTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLK 318 + HY GP K WH + + + + L+ + + Sbjct: 237 MPVVISHYCGPEKAWHA-DCKHFNVYFYQKILAEITRGTDKERVLSIKTYLKALIRRIRY 295 Query: 319 KHRY 322 K +Y Sbjct: 296 KFKY 299 >UniRef50_B3WD33 WbbM protein n=8 Tax=Lactobacillus RepID=B3WD33_LACCB Length = 318 Score = 177 bits (449), Expect = 6e-43, Method: Composition-based stats. Identities = 58/286 (20%), Positives = 103/286 (36%), Gaps = 21/286 (7%) Query: 24 TENLCLDIAYGTDKNFLFGCGISIASILKY-NEGSRLCFHIFTDYFGDDDRKYFDALALQ 82 + + I + D ++ +++ SI + + +I ++ ALA Sbjct: 2 PQQTTVPIFFSVDDGYVPCLAVALTSIRTNKDPQTDFVINILNSGLLQKNQTRLAALAAP 61 Query: 83 YKTRIKIYLINGDRLR-----SLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQ 137 I ++ + + T IYFR IAD F + K +Y+DAD + Sbjct: 62 -HFTINFIDMDAVTQQISGDTNKLRGDYVTLTIYFRLFIADMFP-QYDKAIYIDADTVAD 119 Query: 138 GTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAG-IAKGYFNSGFLLINTAQWAA 196 G + L D+ VA V + E + G Y NSG L++N AQ Sbjct: 120 GDLAELFTTDLGDNLVAGVADPVMMTYPETIEYIQRDFGVQPGEYINSGVLILNLAQMRQ 179 Query: 197 QQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINP 256 + S R + +L + DQD +N++ ++ + +N Q + Sbjct: 180 EHFSDRFLHLLKTYHFT--MIAADQDYINVIAQHRIKYLPKTWNMQTGVPT--------A 229 Query: 257 VTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLK 302 + IHY KPWH ++ F AS ++ + Sbjct: 230 AESGGKLIHYNLFGKPWHYRDAK--LAANFWHYAPASGFETDLNQQ 273 >UniRef50_C0BAU3 Putative uncharacterized protein n=1 Tax=Coprococcus comes ATCC 27758 RepID=C0BAU3_9FIRM Length = 348 Score = 177 bits (448), Expect = 6e-43, Method: Composition-based stats. Identities = 48/294 (16%), Positives = 93/294 (31%), Gaps = 29/294 (9%) Query: 31 IAYGTDKNFLFGCGISIASILKY-NEGSRLCFHIFTDYFGDDDRKYFD-ALALQYKTRIK 88 + ++ ++ + SI N+ I + L ++ Sbjct: 14 VVLSANEYYVPYLAAVLESIRANSNDDQNYDLIIMHRDISMGSQDRLKKQLEDHQNITLR 73 Query: 89 IYLIN--GDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINF 146 I + L ++ YFR ++ K +Y+D+D++ I L Sbjct: 74 FLDIRRYEKPFKKLFLRGHFALETYFRLLMPQIL-ADYDKAVYIDSDLVVNADIAELYAT 132 Query: 147 SFPDDKVAMVVT-------EGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQV 199 +A G +K ++ YF +G ++ N A++ Sbjct: 133 DVDGYLLAAAKDADTAGLYNGFEPNKKKYMDTILKIKKPYEYFQAGVIVFNLAEFRKTYT 192 Query: 200 SARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNY-----------Q 248 +A + E DQDVLN L ++ F D+ +N + Sbjct: 193 TAEMLKFAASYE----WELLDQDVLNYLAQGRVKFVDMAWNVMVDWRGIRLSQIIALAPK 248 Query: 249 LKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLK 302 + IHY GP KPWH D +++ F + + + T + + Sbjct: 249 YLHDEHMEARKNPKIIHYAGPDKPWHQPWSD--MAEEFWKYSRNTVFYETIMQR 300 >UniRef50_B7C7N8 Putative uncharacterized protein n=2 Tax=Firmicutes RepID=B7C7N8_9FIRM Length = 416 Score = 177 bits (448), Expect = 6e-43, Method: Composition-based stats. Identities = 62/284 (21%), Positives = 101/284 (35%), Gaps = 30/284 (10%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 I D +++ +I SI +N+ + F+I + + + + + I Sbjct: 22 IVLACDNSYMDKLETTIKSICAHNKN--IKFYILNEDLPIEWFRLMTKRLSYFNSEILNI 79 Query: 91 LINGDRLRSL-PSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFP 149 ++GD + +++ + YFR++I DY KVLYLD DII +++ L N Sbjct: 80 KVSGDSFKKFRCPSEHINYQSYFRYLIPDYVSE--EKVLYLDCDIIVTESLDGLFNLDLK 137 Query: 150 DDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNE 209 + VA V FNSG LLIN W + + I + E Sbjct: 138 NYPVAAVPD----------------LPTTNDGFNSGVLLINNKYWRENDILNKLIKLTVE 181 Query: 210 PEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSL----NYQLKESFINPVTNDTIFIH 265 + DQ +LN+L DK + YN Q + + IH Sbjct: 182 YHEK---VYGDQGILNILFKDKWYRLPLTYNLQVGSDSQEHMIGNMEWYKLFDGIPKVIH 238 Query: 266 YIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQL 309 Y KPW + + + S W L +P Sbjct: 239 YTYTHKPWLMYNMTR-FKEVWWFYHGIS-WDKMILNEPRVYESF 280 >UniRef50_Q65VF6 RfaJ protein n=1 Tax=Mannheimia succiniciproducens MBEL55E RepID=Q65VF6_MANSM Length = 309 Score = 177 bits (448), Expect = 7e-43, Method: Composition-based stats. Identities = 68/313 (21%), Positives = 118/313 (37%), Gaps = 24/313 (7%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALA------LQ 82 ++I + D+N+ + I SIL F+I ++ + L Sbjct: 1 MNIIFNCDENYAPYLSVVIKSILDNTT-LSTQFYILDFNISEESKSCIKNLIQNINKKNS 59 Query: 83 YKTRIKIYLINGDRLRSLPST-KNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIE 141 ++ I I+ + + P T + A Y R +ADY N+ K +YLD DII + Sbjct: 60 FQHSINFIKIDDNDFQCFPQTISYISSATYARLKVADYL-NELNKAIYLDIDIIVISDLS 118 Query: 142 PLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSA 201 L + D+ V + + +G+ ++ Y N+G LL+N + Sbjct: 119 RLWHIDLADNLVGACLDPYIEYENQDYKRKIGLQD-SQPYINAGVLLLNLKALREFNLYQ 177 Query: 202 RAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESF-------- 253 +AI + I DQD+LN +L K++F D +YN + ++K + Sbjct: 178 KAIDW---NKDYPNIQFQDQDILNGVLKGKVLFLDSRYNFTVNHRNRIKLAHKGKLLLSS 234 Query: 254 INPVTNDTIFIHYIGPTKPWHDWAWDYP---VSQAFMEAKNASPWKNTALLKPNNSNQLR 310 + T +HY+G KPW Q + +N P N QL+ Sbjct: 235 LEKATKPICILHYVGSHKPWLPTTTMVKSCLFDQIYNSIRNKPPHWNKKYQSVPLKFQLK 294 Query: 311 YSAKHMLKKHRYL 323 + + K Y Sbjct: 295 RILREIEDKLVYK 307 >UniRef50_C1QC00 LPS:glycosyltransferase n=1 Tax=Brachyspira murdochii DSM 12563 RepID=C1QC00_9SPIR Length = 332 Score = 176 bits (447), Expect = 8e-43, Method: Composition-based stats. Identities = 58/284 (20%), Positives = 98/284 (34%), Gaps = 19/284 (6%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNE-GSRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 +DI D N+ G +IASIL ++ + FH+ ++++ +L I Sbjct: 1 MDICLSADDNYAKYMGTTIASILSNSKEDEEIYFHLLDGGITEENKNKLLSLKNIKNCDI 60 Query: 88 KIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 Y +N + + +FR + K+LYLD D I +++ L Sbjct: 61 IFYSVNNMNYK-------YDAPHFFRLNVPSLIP-NVDKLLYLDCDTIVLNSLKELFEID 112 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 + ++ + YFNSG L+IN W ++ Sbjct: 113 ISNYYALACEDVFLNCIISF--KNMHGLNVNDIYFNSGMLMINNKLWRDDKLENLFYD-- 168 Query: 208 NEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYI 267 + H DQDVLN ++ ++ D K+N K P + IHY Sbjct: 169 -DYSKFGNTGHADQDVLNRIIKGRVKIVDSKWNFLSHKKVYSKA----PDISLVNIIHYA 223 Query: 268 GPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRY 311 G KPW + + F + +PW L + Sbjct: 224 G-EKPWKETSSKAFFIDEFWKYYQLTPWCRENTLDAVKIMISQK 266 >UniRef50_C9RWX3 Glycosyl transferase family 8 n=8 Tax=Bacillaceae RepID=C9RWX3_GEOSY Length = 276 Score = 176 bits (447), Expect = 8e-43, Method: Composition-based stats. Identities = 55/269 (20%), Positives = 100/269 (37%), Gaps = 10/269 (3%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 + TD N+L + + S+ N F++ +++ + + + Sbjct: 2 FQVLVTTDANYLPPLRVLMHSLFCNNR-RPFTFYLLYSRIAEEEIQALGEFVRRQGHELV 60 Query: 89 IYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSF 148 ++ P +++T +Y+R + +VLYLD DI+ ++ L + F Sbjct: 61 PIYVDPQLFHDAPVFRHYTVEMYYRLAAHLFLPPDVDRVLYLDPDIVAINPMDELYDMDF 120 Query: 149 PDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLN 208 + AKGYFN+G +++N A A + Sbjct: 121 EGNLFIAAEHTHSTKVANLFNKLRLKTPNAKGYFNTGVMMMNIAMMREHVRLADIYQFIR 180 Query: 209 EPEIIKKITHPDQDVLNMLLADKLIFADI-KYNTQFSLNYQL-----KESFINPVTNDTI 262 + K+ PDQDVLN L DK+ D +YN L + + + +T+ Sbjct: 181 DNRF--KLVLPDQDVLNGLYWDKIKPVDCYRYNYDARYYDFLQLLPNPKHDLAWIEENTV 238 Query: 263 FIHYIGPTKPWHDWAWDYPVSQAFMEAKN 291 FIHY G KPW D + + + + Sbjct: 239 FIHYCGKEKPWKD-NYKGELGRFYKRYSQ 266 >UniRef50_D0D9G3 Putative general stress protein A n=1 Tax=Citreicella sp. SE45 RepID=D0D9G3_9RHOB Length = 327 Score = 176 bits (447), Expect = 9e-43, Method: Composition-based stats. Identities = 68/327 (20%), Positives = 119/327 (36%), Gaps = 28/327 (8%) Query: 21 KVETENLCLDIAYGTDKNFLFGCGISIASILKY-NEGSRLCFHIFTDYFGDDDRKYFDAL 79 + +++ Y D G+SIAS L+ EG+ + H+ + +RK + Sbjct: 4 STHQPDKRINVVYACDNIQALPLGVSIASALENRAEGNPINIHVLSYRISRSNRKSIASQ 63 Query: 80 ALQYKTRIKIYLI---NGDRLRSLPSTKNWTH--AIYFRFVIADYFINKAPKVLYLDADI 134 + + I N L L ++ N + A Y R +I++ + +YLD DI Sbjct: 64 FDGRDDTLCWHEITGENRKLLEDLFTSSNRPYPPAAYARLLISEVIP-NIDRAIYLDTDI 122 Query: 135 ICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVA--------GIAKGYFNSGF 186 I + PL N F + + ++ KR +L YF SG Sbjct: 123 IVATDLSPLWNTPFDGAGLLAIQDLPTSNDHIKRLRALLSPEDISRYGIEDGDSYFQSGV 182 Query: 187 LLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSL- 245 L+ + ++ RA ++ +T PD D LN++ D D ++N S+ Sbjct: 183 LVFDMKEFTKT----RASELIECLRNYPDLTFPDNDALNIVFHDSFKLVDPRWNQMASVF 238 Query: 246 ------NYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWK-NT 298 + + D IHY G KPW D +P ++EA S W Sbjct: 239 KLDAARDTPYSAEVFQALLQDPYIIHYSGRPKPWED-GCTHPYLDRWVEALKDSAWNSWK 297 Query: 299 ALLKPNNSNQLRYSAKHMLKKHRYLKG 325 +++ + + K+ R Sbjct: 298 PSRLNRAIDRIPRIQRVLAKRFRRFVS 324 >UniRef50_C1QC03 LPS:glycosyltransferase n=1 Tax=Brachyspira murdochii DSM 12563 RepID=C1QC03_9SPIR Length = 347 Score = 175 bits (445), Expect = 2e-42, Method: Composition-based stats. Identities = 51/285 (17%), Positives = 110/285 (38%), Gaps = 13/285 (4%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEG-SRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 +++ + ++ + +IAS+L + + +I ++ + +++ +L + I Sbjct: 10 INVCFASNDAYAPYMSTAIASLLSNAKDDENINIYIISENINNSNKEKILSLKKIRECSI 69 Query: 88 KIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 + + + +++ +FR I A K++YLD D+I ++ L + Sbjct: 70 DFIEPKEEIFKYISKYNMKSNSTWFRLSIPSLIP-NADKIVYLDGDMIINSSLRELFSDD 128 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 D +V + YFN+GFL+IN W + + Sbjct: 129 MSDYYAYVVEDVMD---KIDEVKAPIGFSKTDKYFNAGFLMINNKLWIEDNLEEK---FY 182 Query: 208 NEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYI 267 N + + + + DQD+LN L +++ F D K++ L+ + I+ N IH + Sbjct: 183 NAVDTMPILGYKDQDILNYCLKNRVKFIDKKWDF---LDNKSCYKEISADINKINIIHCV 239 Query: 268 GPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYS 312 G KPW + F + +PW + + + Sbjct: 240 G--KPWKKECNVAFFADEFWKYYQLTPWFLERPIDAIQTILAQKY 282 >UniRef50_A3VFX3 Glycosyltransferase n=1 Tax=Rhodobacterales bacterium HTCC2654 RepID=A3VFX3_9RHOB Length = 615 Score = 175 bits (444), Expect = 2e-42, Method: Composition-based stats. Identities = 56/323 (17%), Positives = 110/323 (34%), Gaps = 35/323 (10%) Query: 15 VIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYN-EGSRLCFHIFTDYFGDDDR 73 V+ + + +++A+ +D+ +L +AS++++ + GD D Sbjct: 255 VVPFARGARFNDGAVNVAFTSDRPYLPQTAAMVASLIEHAAPDREYNLFYLHENIGDRDL 314 Query: 74 KYFDALALQYKTRIKIYLINGDR-LRSLPSTKNWTH--AIYFRFVIADYFINKAPKVLYL 130 +LA+ + I ++ IN ++ T A Y RF++ D +++YL Sbjct: 315 DLLRSLAVAHG-NITLHTINVGTAFSREYRARHHTPSNATYNRFLLFDLLP-DVERLVYL 372 Query: 131 DADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVA--------------- 175 D D++ G + L + D +A V R + Sbjct: 373 DVDLVLCGDVAELFDTDMNDAPLAAVTDALMTRVLATRVRTRDPEVPDLYAYLSDDLGLS 432 Query: 176 -GIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIF 234 YFN+G +++N A +V M+ + DQD+LN+ D+ + Sbjct: 433 DDQISRYFNAGVMVMNFAAMDVAKVGRELREMVA----GNRYFFRDQDILNVYFRDRFVT 488 Query: 235 ADIKYNTQFSLNYQLK------ESFINPVTNDTIFIHY-IGPTKPWHDWAWDYPVSQAFM 287 ++N S + D +H+ KPW + + F Sbjct: 489 LPSRFNVHNSDRGAYDNVPVPIRNDALAAKADPFIVHFAAAHQKPWREPDV--EFAGLFW 546 Query: 288 EAKNASPWKNTALLKPNNSNQLR 310 +P+ L LR Sbjct: 547 STLARTPFWFEVLEATRRHRSLR 569 >UniRef50_A3V3C9 Putative uncharacterized protein n=1 Tax=Loktanella vestfoldensis SKA53 RepID=A3V3C9_9RHOB Length = 324 Score = 175 bits (443), Expect = 2e-42, Method: Composition-based stats. Identities = 66/327 (20%), Positives = 123/327 (37%), Gaps = 29/327 (8%) Query: 15 VIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRK 74 I+ + + + + D+++L ++I ++L+ N I Sbjct: 2 TIEIKAENRPQKFRQSVIFCADQSYLPFASLAIHTLLRNNPVRDYDICI-------ASVD 54 Query: 75 YFDALALQYKTRIKIYLIN-GDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDAD 133 I+ I+ G+ +P +K ++ A Y R + + F + ++ YLDAD Sbjct: 55 ALVPPTELKDHDIRFCQIDVGNAFDGMPVSKRFSLAAYLRIALPEAFAGQYDRIFYLDAD 114 Query: 134 IICQGT-IEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTA 192 + G I+ + V V + K G+ YFNSG +L + Sbjct: 115 VFVVGDAIDAVFRLDMLSCPVGAVTDITKLKHPNKPTFDQKALGVDGPYFNSGVMLFDVE 174 Query: 193 QWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKES 252 ++ +V R + + + DQ +LN++L + ++ +N Q+ + L E Sbjct: 175 RFITMRVRERCAEAAKFYQ--GEPIYFDQTLLNIVLQKEWAQLNLGWNWQWPFSRSLFEC 232 Query: 253 FINPVTNDTIFIHYIGPTKPWHDWAWDYP----------VSQAFMEAKNASPWKNTALLK 302 FI D +H+IG KPW D P + + E P + AL Sbjct: 233 FI-----DVQIVHFIGDDKPWSDHKRRLPLKYRETARRFFQKFYPELAQKIPAADAALR- 286 Query: 303 PNNSNQLRYSAKHMLKKHRYLKGFSNY 329 N Y +H+ K H + K F+ + Sbjct: 287 --NGALYHYFFRHITKIHLFTKCFNRH 311 >UniRef50_A4UX76 Putative LPS biosynthesis protein n=2 Tax=Lactobacillaceae RepID=A4UX76_9LACO Length = 316 Score = 175 bits (443), Expect = 2e-42, Method: Composition-based stats. Identities = 66/309 (21%), Positives = 114/309 (36%), Gaps = 25/309 (8%) Query: 24 TENLCLDIAYGTDKNFLFGCGISIASILKYN-EGSRLCFHIFTDYFGDDDRKYFDALALQ 82 EN + I Y D N+ +S+AS++ + + D D++ A Sbjct: 1 MENQTVPIFYAVDDNYAPYLAVSLASLVAHTSPDRHYQVIVLCDDLNTDNQGRLKAF-ET 59 Query: 83 YKTRIKIYLING------DRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIIC 136 +I+ IN + + +T IYFR IA+ F K K LYLDAD + Sbjct: 60 DNLKIQFVSINDRLKQEITDKNNKLRSDYFTFTIYFRLFIAELFP-KLDKALYLDADTVV 118 Query: 137 QGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIA-KGYFNSGFLLINTAQWA 195 + L + D+ V V E + GI + Y SG LL+N A+ Sbjct: 119 LKDVGELFDTQLGDNLVGAVPDPFVGHTPETIDYVEQAVGIDSQKYVCSGVLLMNLAEMR 178 Query: 196 AQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFIN 255 + + + +LN+ K PDQD +N + +++ + + ++ Q + Sbjct: 179 RLKFAEHFLQLLNKYHF--KCLAPDQDYMNAIARNRIYYLNPSWHIQIT----------T 226 Query: 256 PVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKH 315 P D IHY KPW P F + ++ L + + A+ Sbjct: 227 PQDVDPWLIHYNLFAKPWRYDDA--PRQSYFWTYAKQTDYETM-LKQQLADMNPKEVARD 283 Query: 316 MLKKHRYLK 324 + ++ Sbjct: 284 QKNQSDLIQ 292 >UniRef50_C5FDY7 Glycogenin n=1 Tax=Microsporum canis CBS 113480 RepID=C5FDY7_NANOT Length = 731 Score = 175 bits (443), Expect = 2e-42, Method: Composition-based stats. Identities = 49/292 (16%), Positives = 86/292 (29%), Gaps = 39/292 (13%) Query: 32 AYGT---DKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 Y T N+L G + S+ RL + D + Y I Sbjct: 8 VYCTILLSDNYLPGAMVLAHSLRDNGTKGRLAVLVTLDNLQPGIIDELKTV---YDDVIP 64 Query: 89 IYLINGDRLRSLPSTKNWT-HAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 I I +L + + + + + +++Y+DAD+I + L+ Sbjct: 65 IPRIENSYPGNLYLMDRPDLISTFSKIALWK--QTQYDRIVYIDADVIALRAPDELLTLD 122 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 F +A V G D FN+G +++ A+L Sbjct: 123 FKS--IAAVPDIGWPDC-----------------FNTGVIVLRPN-------LKDYYALL 156 Query: 208 NEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYI 267 + DQ +LNM + YN S +YQ ++ + +H+I Sbjct: 157 AFAQRGISFDGADQGLLNMHFKN-WDRLSFTYNCTPSGHYQYVPAY-RYFESTISLVHFI 214 Query: 268 GPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKK 319 G KPW P + + W +H + Sbjct: 215 GSLKPWRIGRSSSPQQSPYNQLLAK--WWAVYDRHYRTGPIYIPQPRHYQSQ 264 >UniRef50_Q3D426 Glycosyl transferase, family 8 n=7 Tax=Streptococcus agalactiae RepID=Q3D426_STRAG Length = 401 Score = 175 bits (443), Expect = 3e-42, Method: Composition-based stats. Identities = 59/284 (20%), Positives = 118/284 (41%), Gaps = 23/284 (8%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 I G D + +I SI+ +N+ L +I F + + Q+ R+K Sbjct: 5 IVLGADFQYRDQVMTTIKSIVSHNQH--LTIYIINTDFPVEWFNILNHSLEQFDCRVKNI 62 Query: 91 LINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPD 150 I+ D +P+ + + A +FR+ I + + VLYLD+D+I +G+++PL + + + Sbjct: 63 PISSDVFEGIPTLSHISVAGFFRWFIPIHLEEEI--VLYLDSDVIVRGSLDPLFDINLEE 120 Query: 151 DKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEP 210 + + V ++ A FNSG +LIN + W +++ + + ++ Sbjct: 121 NLLGAVADHFSTLYYG---------DTAPVSFNSGVMLINNSLWKKEEIYNSLMRIADKG 171 Query: 211 EIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLN-----YQLKESFINPVTNDTIFIH 265 + DQ+ LN+L ++ I +YN Q + Y + + + + +H Sbjct: 172 SAVGV---GDQEYLNILTQNRWIDIGKQYNVQIGQDVNINAYGRPDLYHFYDDCEPVIVH 228 Query: 266 YIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQL 309 Y KPW+ ++ + W + N N+L Sbjct: 229 YNSQDKPWNKYSQSR-YRSEWWYYFGL-EWSVIYAQQQKNLNRL 270 >UniRef50_C5S494 Putative glycosyl transferase n=2 Tax=Actinobacillus minor RepID=C5S494_9PAST Length = 287 Score = 175 bits (443), Expect = 3e-42, Method: Composition-based stats. Identities = 58/298 (19%), Positives = 104/298 (34%), Gaps = 26/298 (8%) Query: 22 VETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALAL 81 + + ++IA D+N+ I S+ +++ + F++ + D+ + Sbjct: 1 MTNKQQTINIALAADRNYAEQVITLIKSVCYHHKN--VRFYLIHQDYPDEWFMALNQHLT 58 Query: 82 QYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIE 141 I + ++ T A ++R++I + +V+YLD+DI+ G IE Sbjct: 59 NVGAEIIPVTVLDSFRFLSKLQEHITQATFYRYIIPEI---PEDRVIYLDSDIVVDGNIE 115 Query: 142 PLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSA 201 + F V V + K YFN G LLIN W ++ Sbjct: 116 EMYFSDFNGKYVLAVEDMYISYTEHGYIEF----PDLKPYFNGGVLLINNQLWKENDLAE 171 Query: 202 RAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTND- 260 I M + + DQD+LN +L DK YN Q + + N + Sbjct: 172 YLIQMTKQY---PNVMFGDQDILNFVLKDKWGILSHVYNYQTGIIHAFPRLEENMSDEEI 228 Query: 261 ------------TIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNS 306 I IHY KPW + + + + + S W+ + Sbjct: 229 ITKYQKQADEVKPIIIHYTTKYKPWLNSKYFVLLREKYWFYYQLS-WEEIKKHQQELF 285 >UniRef50_D2QX94 Glycosyl transferase family 8 n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2QX94_9PLAN Length = 362 Score = 174 bits (442), Expect = 3e-42, Method: Composition-based stats. Identities = 54/312 (17%), Positives = 108/312 (34%), Gaps = 31/312 (9%) Query: 20 HKVETENLCLDIAYGTDKNFLFGCGISIASILKY-NEGSRLCFHIFTDYFGDDDRKYFDA 78 H + + + +D NF G + S L S + + D+++ Sbjct: 6 HPTQNMPTSIQLVTSSDNNFAIGLAGTFKSALTNLAADSSVDLWVLDGGITDENKAEISR 65 Query: 79 LALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQG 138 + + ++ + + + T A Y+R + + K +YLD+D++ +G Sbjct: 66 HLSDPRLTLHFVSVDRKLVSQFVISHHVTDATYYRLLTPEILSRDIGKFIYLDSDLLIRG 125 Query: 139 TIEPLINFSFPDDKVAMVVTEGQA-------------------DWWEKRAHSLGVAGIAK 179 + L N F + G + + Sbjct: 126 DLTKLWNTPFDGAPCVAIQDSGAPFVDSTQLIEQQPSLRGCIANANPIPNYRELGLHPHA 185 Query: 180 GYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKY 239 Y N G ++I+ W +Q++ R + +L++ +T+ DQ LN++L+ + AD ++ Sbjct: 186 PYLNGGVMMIDLDLWRREQLAERMLKVLSDYREH--VTYWDQYALNVVLSQRWKQADHRW 243 Query: 240 N-------TQFSLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNA 292 N N + + ND H+ KPW +P S+ F + Sbjct: 244 NQIAYPLRFSSHENTIFSKEAFDLYRNDPYISHFT-YRKPWQAE-CIHPRSEEFYQYLEG 301 Query: 293 SPWKNTALLKPN 304 S W NT + Sbjct: 302 SIWANTKPVWQE 313 >UniRef50_A4VVV8 Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases n=7 Tax=Firmicutes RepID=A4VVV8_STRSY Length = 334 Score = 174 bits (442), Expect = 3e-42, Method: Composition-based stats. Identities = 46/295 (15%), Positives = 100/295 (33%), Gaps = 25/295 (8%) Query: 24 TENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRL-CFHIFTDYFGDDDRKYFDALALQ 82 E ++I + + F+ + SI++ + F++F+D +++ Sbjct: 1 MEEGNVNILFTLNDAFVPQVAACMGSIMRTLDEDDTCHFYLFSDGISQQNKENLHQFVTD 60 Query: 83 YKTRIKIYLING--DRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTI 140 ++ I + T W + R ++ + +++YLD D + I Sbjct: 61 GGNKLTIVELENLESYFDFEVDTNGWASVVLARLLVDKLLPEEVDRIIYLDGDTLVLENI 120 Query: 141 EPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVS 200 L + M + G+ Y N+G LLI+ +W ++ + Sbjct: 121 RELWEVDLEGKVLGMCPEPTAS-----SERREGLNLGTYTYHNAGVLLIDLKRWRSKSIG 175 Query: 201 ARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQ------------ 248 E ++ DQD LN L +++ I YN + Sbjct: 176 TIIFDYYKEKN--GELFANDQDALNGALKEEIKTLSITYNYFNIFDVYPYRTLEKLSRPS 233 Query: 249 --LKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALL 301 + + + +H++G +PW + + ++ A N +PW+ T Sbjct: 234 TFISKEEFVKIRKQPRIVHFLGEERPWR-IGNKHRFREDYVSALNQTPWRGTQFE 287 >UniRef50_D1JY84 Putative uncharacterized protein n=1 Tax=Bacteroides sp. 3_1_33FAA RepID=D1JY84_9BACE Length = 312 Score = 174 bits (442), Expect = 4e-42, Method: Composition-based stats. Identities = 58/274 (21%), Positives = 102/274 (37%), Gaps = 13/274 (4%) Query: 24 TENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQY 83 + + IA+ + ++ +SI +L+ N L HI +DY D + L Y Sbjct: 1 MISSPMHIAFCVNDHYAEYILVSIKGLLENNSD-PLVIHILSDYISDKNTNRLKKLVGLY 59 Query: 84 KTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPL 143 I +I D L+ WT ++R ++ + +VLYLDAD + IE L Sbjct: 60 PNAILDIVI-VDDLKLKDLKDTWTIYTWYRVLLPEILDASVHRVLYLDADTLVSENIEEL 118 Query: 144 INFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARA 203 + +A V D K + K Y +G +++N W ++ + Sbjct: 119 FSLDMTGKAIAGTVDFQSKD---KSTYQRCGYEAEKEYVCAGVMMMNLDYWREHDIANKI 175 Query: 204 IAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYN-----TQFSLNYQLKESFINPVT 258 I + +I +PDQD +N + D + +KY+ Q +Q + Sbjct: 176 IDWGRDYN--DRIQYPDQDAINYICRDMKLLLPLKYDIIDGFFQDDYYFQNYPQELRECI 233 Query: 259 NDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNA 292 IHY G PW ++ + + Sbjct: 234 ESPAIIHYAGQA-PWVVEISNHLLQDEWERYNKL 266 >UniRef50_A0KQP2 Glycosyl transferase, family 8 n=2 Tax=Aeromonas RepID=A0KQP2_AERHH Length = 366 Score = 174 bits (441), Expect = 4e-42, Method: Composition-based stats. Identities = 71/298 (23%), Positives = 132/298 (44%), Gaps = 22/298 (7%) Query: 25 ENLCLDIAYGTDKNFLFGCGISIASILKY-NEGSRLCFHIFTDYFGDDDRKYFDALALQY 83 + A+ D +F I S+ K+ + +L H+ + ++ L + Sbjct: 1 MRKIIHSAFCIDDSFAVHLAALIHSLGKHLSHDLQLQCHVLA-RLSETNKFKLSKLESE- 58 Query: 84 KTRIKIYL--INGDRLRSLPSTKNW-THAIYFRFVIADYFINKAPKVLYLDADIICQGTI 140 IK Y + + N Y+RF I KVL++D+D+I G I Sbjct: 59 NLVIKFYDNLPDYKDIPISNLYNNRLNEVTYYRFAIPHIL-KSIDKVLFIDSDMIALGDI 117 Query: 141 EPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVS 200 PL + D VA+V +K+ +G YFN+GF+L+N +W A+ +S Sbjct: 118 SPLWSIDMGDAIVAVVSDHILGCDKKKQLMRGISSG---KYFNAGFMLMNLDKWRAKNIS 174 Query: 201 ARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTND 260 +A+ +L E H DQD LN++L +K ++ D K+N Q N+ + +F+ Sbjct: 175 EQALRLLIEN---NGFEHNDQDALNIVLENKTVYIDNKWNAQP--NHLAQNNFL------ 223 Query: 261 TIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLK 318 I +H+ G KPWH ++ ++P +++ ++ + + N L + + + ++ L Sbjct: 224 PILVHFCGQEKPWHIYS-NHPFKGSYLVSRRETDYANEPLQSYLDDHDIEILSRLRLS 280 >UniRef50_B9BAZ6 Glycosyl transferase family 8 protein n=1 Tax=Burkholderia multivorans CGD1 RepID=B9BAZ6_9BURK Length = 617 Score = 173 bits (439), Expect = 7e-42, Method: Composition-based stats. Identities = 60/356 (16%), Positives = 117/356 (32%), Gaps = 41/356 (11%) Query: 12 LNSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSR-LCFHIFTDYFGD 70 + + + I D NF+ IAS+ + R L + Sbjct: 267 VKGSHTHVPPEPLGGNAVSIVTVADGNFVPHLAAFIASVQDNIDPERVLDLIVLDGGIPA 326 Query: 71 DDRKYF-DALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLY 129 D ++ K R+ +P ++ A ++R + + K +V+Y Sbjct: 327 DQQRLLMKQFHRNGKGRLSFIQ-CAHLFSDIPLHGPFSAATFYRLSMGELL-AKHRRVVY 384 Query: 130 LDADIICQGTIEPLINFSFPDDKVAMVVTE----------------GQADWWEKRAHSLG 173 +D+D I G + L + ++ VA V G A +G Sbjct: 385 VDSDTIVLGDLSELFDLDLGNNAVAAVPDVIMKSFVSSGVPALREAGGAPAGIYLKERVG 444 Query: 174 VAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLI 233 + YF +G ++I+ ++ ++ A L + ++ DQDVLN L + Sbjct: 445 MGNRGNEYFQAGLIVIDLDEFRRLRIGEDAYKDL----LARRYWFLDQDVLNKYLLGHVK 500 Query: 234 FADIKYN-------TQFSLNYQLKESFINPVTNDTIFIHYIGPT-KPWHDWAWDYPVSQA 285 F D+ +N L + +HY G KPW+ P++ Sbjct: 501 FLDLSWNVVNASMDVLSGLETDIAAKVKEVFAA-PSMVHYAGHEAKPWNRPTA--PLAHF 557 Query: 286 FMEAKNASPWKNTAL-LKPNNSN-----QLRYSAKHMLKKHRYLKGFSNYLFYFIE 335 + + W + + +P + Q K + R + GF +++ Sbjct: 558 YWYYLRRTYWYESVIDRRPISPTLDVELQRSRLYKRLRAIWRRMPGFVQRRLFWLR 613 >UniRef50_B6VUC8 Putative uncharacterized protein n=1 Tax=Bacteroides dorei DSM 17855 RepID=B6VUC8_9BACE Length = 315 Score = 172 bits (436), Expect = 2e-41, Method: Composition-based stats. Identities = 57/316 (18%), Positives = 120/316 (37%), Gaps = 17/316 (5%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 + I + + C + + S+ + N+ + ++F+ D++ K + L +Y T+++ Sbjct: 2 ISILCNSSNEYAIHCKVMLTSLFENNKQNDKEVYVFSTSMSDENIKGLELLGQRYGTKVQ 61 Query: 89 IYLINGDRLRSLPSTK-NWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 I +++ +L+ LP A Y R AD K+LYLD DII ++ L + Sbjct: 62 IIIVDSQKLQFLPIHFAYHNIACYLRLFAADLLPG-INKLLYLDCDIIVNSDLKALWDID 120 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 D A + K+ L Y N+G +LIN W V+ + + Sbjct: 121 ITDYAFAATHDLTYCEPNFKKNLQL---EENDTYINTGVMLINCDYWRNNNVAQKVLDYA 177 Query: 208 NEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESF------INPVTNDT 261 K+ DQD LN + ++N Y+ + ++ + + Sbjct: 178 IHN--GDKMIAADQDALNATMQGSFKLFSEEWNVYPDYFYEKPNLYTNVYPILDEIRRNP 235 Query: 262 IFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKHR 321 IH++ KPW ++ ++P+ + + + + + N ++ + Sbjct: 236 KIIHFL-YVKPWFNY-CNHPLRYLYGKYYAIA--EGKPFILKRNKESIKRDIARLKHCLL 291 Query: 322 YLKGFSNYLFYFIEKI 337 G Y + ++ Sbjct: 292 DFMGIKYYYHVYDKRF 307 >UniRef50_B8G232 Glycosyl transferase family 8 n=2 Tax=Desulfitobacterium hafniense RepID=B8G232_DESHD Length = 280 Score = 172 bits (435), Expect = 2e-41, Method: Composition-based stats. Identities = 52/260 (20%), Positives = 108/260 (41%), Gaps = 13/260 (5%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 ++I + +++ + + S+L N G + ++ +D D + +++ Sbjct: 1 MNILVTLNSSYVKQLMVMLTSLLDSNPGEQFTVYVAHSAMSKEDFARIDQAIDSSRCKVE 60 Query: 89 IYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSF 148 ++ + L P T + +Y+R +Y + ++LYLD D++ ++ L F Sbjct: 61 GIKLSDEGLSKAPITSRYPKEMYYRIFAVNYLPDHLERILYLDPDLVVINPLKELYTIDF 120 Query: 149 PDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLN 208 + A + +K H Y NSG +++N + +Q + Sbjct: 121 QGNFFAAASH--VKELLKKLNHVRLNMAEDSTYVNSGVMMMNLSLLRQEQDVHEVYQYIE 178 Query: 209 EPEIIKKITHPDQDVLNMLLAD-------KLIFADIKYNTQFSLNYQLKES--FINPVTN 259 E + ++ PDQDVLN + +D K+ +Y ++LN + ++ ++ V + Sbjct: 179 EYKH--RLFLPDQDVLNGVYSDRTLTVDAKIYNLSERYYALYNLNPKYWDAKIDLDWVRS 236 Query: 260 DTIFIHYIGPTKPWHDWAWD 279 +T IHY G KPW D Sbjct: 237 NTAIIHYCGRNKPWKDNYIG 256 >UniRef50_Q4HGS8 General stress protein A, putative n=2 Tax=Campylobacter RepID=Q4HGS8_CAMCO Length = 403 Score = 171 bits (434), Expect = 3e-41, Method: Composition-based stats. Identities = 68/317 (21%), Positives = 122/317 (38%), Gaps = 38/317 (11%) Query: 30 DIAYGTDKNFLFGCGISIASILKYN------EGSRLCFHIFTDYFGDDDRKYF----DAL 79 I + D+N++ + I SI+K + FHI +++ ++ R+ L Sbjct: 3 HIIFSADENYIKYTSVLITSIIKNTNPKNHFQNRPYSFHILSNFVSEETREKLECLKKEL 62 Query: 80 ALQYKTRIKIYLINGDRLRSLPSTK--NWTHAIYFRFVIADYFINKAPKVLYLDADIICQ 137 Y I I++++ DR + PS+ + Y+R F + K LYLD+D++C Sbjct: 63 NKIYPCEISIHIMSDDRFENFPSSGAAQNSKLPYYRLKFISLFDDNVDKCLYLDSDMLCM 122 Query: 138 GTIEPLINFSFPDDKVAMVVTEGQADWWEKR--AHSLGVAGIAKGYFNSGFLLINTAQWA 195 I + + +V G K ++ V + YFNSGFLLIN ++ Sbjct: 123 CDIREIFAIDLQGKIIGVVGDPGSKRSKIKFIENNTKKVLKFDENYFNSGFLLINAKEYK 182 Query: 196 AQQVSARAIAMLNEPEIIKKITHPDQDVLNMLL-ADKLIFADIKYNT-----QFSLNYQL 249 V + + + I DQD+LN ++ DK++ YN + + Sbjct: 183 KANVEKKCEELAKKC---IYIKAADQDLLNAVISKDKILKLSFAYNFNIITLLYVICKDE 239 Query: 250 KESFINPVT-------NDTIFIHYIGPTKPWHDWAW-----DYPVSQAFMEAKNASP-WK 296 K++ +N + +HY KPW + +S + + P +K Sbjct: 240 KKNRLNYTREEFTQSAKNPKILHY--GEKPWKFLKSYVDLQNRNISDYWWDIAKEVPIFK 297 Query: 297 NTALLKPNNSNQLRYSA 313 L + N A Sbjct: 298 EELLRQKENIKDYLLYA 314 >UniRef50_C8N7M8 Putative uncharacterized protein n=1 Tax=Cardiobacterium hominis ATCC 15826 RepID=C8N7M8_9GAMM Length = 618 Score = 171 bits (433), Expect = 4e-41, Method: Composition-based stats. Identities = 60/304 (19%), Positives = 109/304 (35%), Gaps = 33/304 (10%) Query: 18 YDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNE-GSRLCFHIFTDYFGDDDRKYF 76 Y V+T+ + + +D N+ G I SIL + L I +RK Sbjct: 269 YAQPVQTDKPVVSVVIASDDNYTPHLGALICSILDHFPADKYLDLIILDGGISALNRKLL 328 Query: 77 DALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIIC 136 L I+ + D + L + +++ A ++R ++ + KVLY+D D I Sbjct: 329 MRLLPT-HANIQFLEL-KDEFQQLATHMHFSRATFYRLILDKLIPGR-DKVLYIDCDTIV 385 Query: 137 QGTIEPLINFSFPDDKVAMVVTE----------------GQADWWEKRAHSLGVAGIAKG 180 I L + D + V G +G+ + Sbjct: 386 LDDISTLFDTPLGDHAIGAVFDYIMHHFCLNDVLSIDTTGSLPAKRYLHDYVGLEDGWQR 445 Query: 181 YFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYN 240 YF +G +L N + +S I+ L + K+ DQD+LN +++ D ++N Sbjct: 446 YFQAGVILFNMEKLRRLDLSEVMISDL----LNKRYWFLDQDILNKYFLGDVVYLDPRWN 501 Query: 241 TQFSLNYQL------KESFINPVTNDTIFIHYIGP-TKPWHDWAWDYPVSQAFMEAKNAS 293 + S+ + + D IHY G TKPW+ +++ + + Sbjct: 502 SVNSVQNIYQGLPATYIAELKTTETDPKIIHYAGFETKPWN--NRYAELAEYYFYYLRQT 559 Query: 294 PWKN 297 W Sbjct: 560 FWYE 563 >UniRef50_Q5M3K9 Glycosyl transferase n=4 Tax=Streptococcus RepID=Q5M3K9_STRT2 Length = 697 Score = 170 bits (432), Expect = 4e-41, Method: Composition-based stats. Identities = 52/298 (17%), Positives = 106/298 (35%), Gaps = 27/298 (9%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 I + ++ +I SI+ ++ + F++ D F + + + + + + Sbjct: 303 IVLAANYTYVDQVLTTIKSIVFHHRN--IRFYLINDDFSQEWFRGLNRHLAAFGSEVINC 360 Query: 91 LINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPD 150 ++ ++ + N +A Y R+ +AD+ + LYLD+D++ G++E L Sbjct: 361 RVDSSHIKQFKTNSN--YASYLRYFVADFVSE--ERALYLDSDMVVTGSLEDLFTLDLQG 416 Query: 151 DKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEP 210 +A V + F++GF++I+TA W + I M +E Sbjct: 417 RPLAAVRDYAVQ------------GQDRQAMFDAGFMVIDTAYWKQYNMRRHLIDMTSEW 464 Query: 211 EIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIGPT 270 K+ +Q +LNM+ + + N + L + + +HY Sbjct: 465 H--DKVPFAEQSILNMVFCNNWLTLSFDNNYAVT-KSSLSGYHLPNGQDYPKVLHYTSHR 521 Query: 271 KPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKHRYLKGFSN 328 KPW A + + W N+ L S + K R Sbjct: 522 KPWLPLACQ-AYREVWWFYAQM-DWSGV----AENAALLPLSEDMIYPKGRPFTCLVY 573 >UniRef50_UPI0001A45357 glycosyl transferase family protein n=1 Tax=Neisseria subflava NJ9703 RepID=UPI0001A45357 Length = 264 Score = 170 bits (432), Expect = 5e-41, Method: Composition-based stats. Identities = 56/267 (20%), Positives = 95/267 (35%), Gaps = 18/267 (6%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 + I D + + S+ +N G + F++ + F + Y + +R+ Sbjct: 4 ITIVLAADTGYAEQVHTLMKSVCTHNTG--VNFYLMHNTFRKEWINYTNQKLAASGSRLN 61 Query: 89 IYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSF 148 I D + + A +FR ++ + LYLD+D++ ++ L N Sbjct: 62 DVKIEMD-FSQYRRLSHISDAAFFRLMMQHL---PVDRALYLDSDMVVTQSLHDLFNLDM 117 Query: 149 PDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLN 208 VA V A L YFNSG LL + QW ++ + + Sbjct: 118 RGYPVAAVQDSYLARTDWNHPTGL----HTTPYFNSGMLLADLGQWRKHNIAEQLLQ--T 171 Query: 209 EPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLN-----YQLKESFINPVTNDTIF 263 I K + + DQ LN + + + + +N Q Y L E F P T Sbjct: 172 AATIDKTVPYGDQCFLNTVFQENWLQLEESWNYQTGARRFFQTYDLDEMFPLPDTTPP-I 230 Query: 264 IHYIGPTKPWHDWAWDYPVSQAFMEAK 290 IHY KPW P + + + Sbjct: 231 IHYTTLAKPWLCDYGKIPFEEIYWQYY 257 >UniRef50_C2LRU0 Glycosyl transferase, family 8 n=1 Tax=Streptococcus salivarius SK126 RepID=C2LRU0_STRSL Length = 402 Score = 170 bits (431), Expect = 6e-41, Method: Composition-based stats. Identities = 44/285 (15%), Positives = 94/285 (32%), Gaps = 30/285 (10%) Query: 30 DIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKI 89 + + + +++ +++ S+ + + ++ + + + + I Sbjct: 5 SVVFVAELSYMEKLEVALKSLCAH--KGQWKIYVLNENLPTEWFTLMNRRLEAIDSEILN 62 Query: 90 YLINGDRLRSLP-STKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSF 148 ++ + + + + +A +FR+ I ++ +VLYLD D+I + PL Sbjct: 63 CRVSAESFKQFSLPSAHIHYATFFRYAIPEFVQEN--RVLYLDCDMIFTQDLSPLFEVDL 120 Query: 149 PDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLN 208 + VV FN+G ++I+T W +V+ + Sbjct: 121 GGLGIGAVVD----------------RPTTTDGFNAGLMVIDTDWWRQHKVTDSLFDLTK 164 Query: 209 EPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSL----NYQLKESFINPVTNDTIFI 264 E + DQ +LN+ D YN Q + + I Sbjct: 165 EHHQN---VYGDQGILNLYFKDAWYQLPWTYNLQVGSDKDQYGYGDLEWYDAFKGVPAVI 221 Query: 265 HYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQL 309 HY KPW ++ + S W+ L KP+ Sbjct: 222 HYTSHNKPWTSKRFNR-FRDIWWFYYALS-WEEILLRKPSLKISF 264 >UniRef50_B2ISC2 Glycosyl transferase, family 8 n=2 Tax=Streptococcus pneumoniae RepID=B2ISC2_STRPS Length = 401 Score = 170 bits (430), Expect = 7e-41, Method: Composition-based stats. Identities = 55/289 (19%), Positives = 98/289 (33%), Gaps = 30/289 (10%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 I G D +++ +I SI ++ + F++F + + D + I Sbjct: 5 IVLGADNHYMDKVETTIKSIC--SKNKEVKFYVFNSDLPTEWFQLMDKRLSVLGSEIVNV 62 Query: 91 LINGDRLRSLP-STKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFP 149 + + T + + A Y R+ I K + LYLD+DII + L F Sbjct: 63 KVTESLINQFHLPTPHLSSATYLRYFIPTIVFEK--RALYLDSDIIVTADLTSLFEFPLD 120 Query: 150 DDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNE 209 +A V FNSG LLI+T +W + + + + + Sbjct: 121 GCPLAAVPD----------------IPNTSEGFNSGVLLIDTDRWREDDIQNQLLNLTIK 164 Query: 210 PEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLK----ESFINPVTNDTIFIH 265 + + DQ++LNML D+ + YN Q + + + IH Sbjct: 165 HH---EHVYGDQEILNMLFKDRWKKLSLSYNLQVGYDTYRHSLGDNEWYHLFEGIPNIIH 221 Query: 266 YIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAK 314 Y KPW + ++ + + W + L K Sbjct: 222 YTTQNKPWSHYRFNR-FRDIWWFYYGLN-WNDILLDNQILQENFEKLIK 268 >UniRef50_A8FNA2 Putative uncharacterized protein n=2 Tax=Campylobacter jejuni subsp. jejuni 81116 RepID=A8FNA2_CAMJ8 Length = 791 Score = 169 bits (428), Expect = 2e-40, Method: Composition-based stats. Identities = 55/325 (16%), Positives = 107/325 (32%), Gaps = 22/325 (6%) Query: 21 KVETENLCLDIAYGTDKNFLFGCGISIASILKYNE-GSRLCFHIFTDYFGDDDRKYFDAL 79 + + + I + D N+ + + SI + + +I + + Sbjct: 375 PPQDKLSHIPIVFSCDANYFSYLTVVLQSIKEKSSENYNYDIYILHNKLDKSLTQKLINY 434 Query: 80 ALQYKTRIKIYLING-----DRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADI 134 IK I+ + ++ A Y+RF I F + K++YLD DI Sbjct: 435 IQAENFSIKFVDISRILNLLKSQIQFYTALFFSEATYYRFFIPKIF-KEFKKIIYLDTDI 493 Query: 135 ICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQW 194 I + + L + F A + + YF +G ++ N + Sbjct: 494 IVKQDLNLLYSIDFDKPLAAAKCMIFSQVKQADHRITKLKMKQPENYFQAGVMVYNIQKC 553 Query: 195 AAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKE--- 251 + + + L E + DQDVLN + + + +K+N ++++Y++ Sbjct: 554 LKMDFTQKCLNKLQELKDPP---LVDQDVLNAVFEGDIHYISLKWNCLWNVSYRIPNFKI 610 Query: 252 -------SFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPN 304 D IHY KPW+ P + + +P+ L K Sbjct: 611 LYSKDFLKDYQEAERDPYIIHYCDYFKPWNSP--HLPKADIWWHYARQTPFYEEILFKNI 668 Query: 305 NSNQLRYSAKHMLKKHRYLKGFSNY 329 N L + +K +Y Sbjct: 669 TQNSLNIIQNSIQGAVERVKAHLSY 693 >UniRef50_A5LNA9 Glycosyl transferase, family 8 n=2 Tax=Streptococcus pneumoniae RepID=A5LNA9_STRPN Length = 402 Score = 169 bits (427), Expect = 2e-40, Method: Composition-based stats. Identities = 66/317 (20%), Positives = 115/317 (36%), Gaps = 36/317 (11%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 I G D N+ +I SI +N L F+IF + + + + I Sbjct: 7 IVLGADNNYRDKLETTIKSICYHNRD--LKFYIFNEDIPKEWFYLMEKRLEKLNCEILNI 64 Query: 91 LINGDRLRSLPSTK-NWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFP 149 I+ ++++ + + + YFR+ IA++ + +YLD D++ G I PL F Sbjct: 65 EIDAEKVKYFSTPDEHIKYMTYFRYFIAEFVKE--DRAVYLDCDMVIHGNINPLFQKDFE 122 Query: 150 DDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNE 209 + + V G K FN+G +++N +W + + + E Sbjct: 123 GNYIIAVPD-----------------GWYKNIFNAGMMMVNVHKWKTDNICQNLLELTAE 165 Query: 210 PEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNY-----QLKESFINPVTND--TI 262 + DQ VLN+L +K YN L+ Q E F+N + Sbjct: 166 KHQE---IYGDQGVLNLLFENKWKKVSPHYNFMVGLDTLGYWAQKPEWFLNSWDENYKPA 222 Query: 263 FIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKHRY 322 IH+ G KPW+D + + + N W+ N A L Sbjct: 223 IIHFEGKDKPWND-SLKTRYRELWWFY-NGLDWQTILSQVDNKPTTFSEIATVSLFHTAI 280 Query: 323 LKG--FSNYLFYFIEKI 337 ++ Y +EK+ Sbjct: 281 FTDTHELEHIEYLVEKL 297 >UniRef50_B6HCQ7 Pc18g02120 protein n=2 Tax=mitosporic Trichocomaceae RepID=B6HCQ7_PENCW Length = 711 Score = 168 bits (426), Expect = 3e-40, Method: Composition-based stats. Identities = 48/280 (17%), Positives = 85/280 (30%), Gaps = 40/280 (14%) Query: 32 AYGT---DKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 Y T N+L G + S+ +RL D ++ Y I Sbjct: 8 VYCTLLLSDNYLPGAMVLAHSLRDNGTKARLVALFTPDRLQSSTIDELRSV---YDELIP 64 Query: 89 IYLINGDRLRSLPSTKNWT-HAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 + + D +L A + + + + +V+Y+D D++ + L++ Sbjct: 65 VSSMVNDTPANLWLMDRPDLIATFTKIELWRL--TQYQRVVYIDCDVVALRAPDELLSL- 121 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 + A G D FNSG +++ A+ Sbjct: 122 --EADFAAAPDVGWPDC-----------------FNSGMMVLRPN-------LQDYYALR 155 Query: 208 NEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYI 267 + DQ +LNM D YN S NYQ ++ + IH+I Sbjct: 156 ALAQRGISFDGADQGLLNMHFRD-WHRLSFTYNCTPSANYQYIPAY-KHFQSTISLIHFI 213 Query: 268 GPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSN 307 G KPW+ P+ + + W + Sbjct: 214 GARKPWNMPRQIVPLESPYNQLLGR--WWAVYDRHYRLPS 251 >UniRef50_UPI0001AEC697 glycosyl transferase family protein n=1 Tax=Alteromonas macleodii ATCC 27126 RepID=UPI0001AEC697 Length = 361 Score = 168 bits (425), Expect = 3e-40, Method: Composition-based stats. Identities = 62/290 (21%), Positives = 117/290 (40%), Gaps = 24/290 (8%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 + +A+ D F +S+ SIL + S + + + R+ L L+ Sbjct: 2 ISVAFCIDDKFAPYAAVSVISILSNTK-SFVNIYFI-GNLSEGVREKL--LTLKNDRSAM 57 Query: 89 IYLINGDRLRSLPSTKNW----THAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLI 144 +++ + L ++P + + + R+ IA+ K KV+YLDAD++ G I+ L Sbjct: 58 VFVAHNLPLSTMPLSDRYVERLNKITFVRYAIAEVL-TKLDKVIYLDADVLVCGDIKRLW 116 Query: 145 NFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAI 204 V V+ SL +K YFN+G LL++ W +++ Sbjct: 117 EQPLKKSYVGAVLDHSLMSQKRHITLSLK----SKSYFNAGVLLVDLKIWRDRRIFQ--- 169 Query: 205 AMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFI 264 + ++ + DQDVLN++L +K+ + N Q S + + + + Sbjct: 170 YLSRTHNTRERWEYNDQDVLNVVLDEKVQYLGADMNVQTY-------SLKHINIKEPLIV 222 Query: 265 HYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAK 314 H+ G KPWH + +P + + P+KN L + A+ Sbjct: 223 HFTGQEKPWHT-SSVHPYKDQYRVLLESVPFKNNKLSLYLDKEDRTILAR 271 >UniRef50_C1IBL0 Glycosyl transferase n=1 Tax=Clostridium sp. 7_2_43FAA RepID=C1IBL0_9CLOT Length = 273 Score = 168 bits (425), Expect = 3e-40, Method: Composition-based stats. Identities = 47/276 (17%), Positives = 102/276 (36%), Gaps = 11/276 (3%) Query: 24 TENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQY 83 + +D+ DKN++ + S++ N + + + + + Sbjct: 1 MSSNRIDLLVTFDKNYIPPFQTMLKSLVLNNPRETFHIWLLHSEIPLEMLQEVEEYCAKQ 60 Query: 84 KTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPL 143 + + ++ P +K + +Y+R + K+LYLD DI+ +I PL Sbjct: 61 GAAMTSINVERSVFKNAPVSKRYPQEMYYRLLAPLILPKSIKKILYLDPDILIINSIRPL 120 Query: 144 INFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARA 203 + A G Y+NSG +L++ + + Sbjct: 121 WETELGNYIFAAASHVGVTGVINDINRVRLRVDHD--YYNSGVMLMDLTKARSIVNVEEI 178 Query: 204 IAMLNEPEIIKKITHPDQDVLNMLLADKLIFADI-KYNT--QFSLNYQLKES---FINPV 257 + E + +++ PDQD+ N L + + D +N + NY L+ ++ + Sbjct: 179 FQCVREHK--EELLLPDQDIFNYLYGKQTLPLDDAIWNYDARKYSNYLLRSGGNYDMDWI 236 Query: 258 TNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNAS 293 T +T+ +H+ G +KPW + + + + S Sbjct: 237 TRNTVVLHFCGKSKPWKH-SQNNRFAMLYKHYMQIS 271 >UniRef50_C7RG54 Glycosyl transferase family 8 n=1 Tax=Anaerococcus prevotii DSM 20548 RepID=C7RG54_ANAPD Length = 273 Score = 167 bits (424), Expect = 3e-40, Method: Composition-based stats. Identities = 49/268 (18%), Positives = 99/268 (36%), Gaps = 11/268 (4%) Query: 32 AYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIYL 91 D+N++ + + SI N G ++ +D K ++ + Sbjct: 6 LLTLDENYIPQMKVLMTSIYINNPGRIFDVYLIHSRISEDKLKDLGEDLKKFSYTLYPIR 65 Query: 92 INGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDD 151 D T + +Y+R + ++ ++LYLD D++ ++ L+ D Sbjct: 66 ATDDLFSFAKVTDRYPKEMYYRLLAGEFLPENLGEILYLDPDMLVINPLDDLLRTDISDY 125 Query: 152 KVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPE 211 +A G+ D G Y+NSG LLIN + + + + + Sbjct: 126 ILAAASHTGKTDMANNVNRIR--LGTDTDYYNSGLLLINLKRAREEIDPDEIFSFVEDNH 183 Query: 212 IIKKITHPDQDVLNMLLADKLIFAD---IKY---NTQFSLNYQLKESFINPVTNDTIFIH 265 + + PDQD+LN + D++ D Y N L K++ + + + T+ +H Sbjct: 184 M--NLLLPDQDILNAMYGDRIYPLDDLIYNYDARNYSSYLIRSKKQADLAWLMDHTVVLH 241 Query: 266 YIGPTKPWHDWAWDYPVSQAFMEAKNAS 293 + G KPW + + + + Sbjct: 242 FCGRDKPWKK-NHRNKFTSLYKHYMSLT 268 >UniRef50_A3CM53 Glycosyltransferase, putative n=9 Tax=Bacteria RepID=A3CM53_STRSV Length = 1074 Score = 166 bits (421), Expect = 9e-40, Method: Composition-based stats. Identities = 58/273 (21%), Positives = 103/273 (37%), Gaps = 34/273 (12%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 I D+ + +I SIL YN+ + ++F D+ + F+ L Q + + Sbjct: 4 IVLVGDQAYQEQVSTTIKSILYYNKN--VKIYVFNQGLSDEWFRDFNELVEQLDSELVNI 61 Query: 91 LINGDRLR-SLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFP 149 ++ + + + + A Y R+ I + +VLYLD+D++ ++PL + Sbjct: 62 SLDQVTISPEWLTQDHISSATYARYFIPQFVAE--GRVLYLDSDLVVNRDLQPLFDIPLE 119 Query: 150 DDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNE 209 VA V G FN+G LLI+ W +++ I + Sbjct: 120 GKLVAAVGDAG------------------GYGFNAGVLLIDNRSWKERELQESFIKETDR 161 Query: 210 -----PEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLN----YQLKESFINPVTND 260 + + DQ VLN +LA + D YN Q + Y + + Sbjct: 162 IMGLVQSGQMEDFNGDQTVLNHVLAQDWLPLDKIYNLQVGHDLVAFYSGWNGHF-ELDQE 220 Query: 261 TIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNAS 293 + IHY KPW+ Y Q + + + S Sbjct: 221 PLIIHYTTFRKPWNSE-VSYRYRQLWWDFQALS 252 Score = 163 bits (412), Expect = 9e-39, Method: Composition-based stats. Identities = 51/294 (17%), Positives = 103/294 (35%), Gaps = 25/294 (8%) Query: 5 FFQETEFLNSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIF 64 E + + K + + + + + +I SI+ +N + F++ Sbjct: 384 HEHPQEMVRKLRSLMKKEKPQAFR-AVVLAANAAYSEQVLTTIKSIVCHN--RFIKFYVI 440 Query: 65 TDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKA 124 F + + +I ++G + + N ++++ R+ A + + Sbjct: 441 NSDFPTEWFVSMRKKLAKLDCQIVNARVDGSHISQYKT--NIHYSVFLRYFTATFV--EE 496 Query: 125 PKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNS 184 + LYLD DI+ + + + V G G + FNS Sbjct: 497 DQALYLDCDIVVTRDLSEIFAVDLGSYPLGAVRDLG------------GEVYFGEQIFNS 544 Query: 185 GFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFS 244 G LLIN W ++ + I M + + K+T DQ +LNML ++ + YN + Sbjct: 545 GVLLINVNYWRENDIAGQLIEMTD--NLHDKVTQDDQSILNMLFENRWMELPFAYNC-IT 601 Query: 245 LNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNT 298 L+ + IHY+ KPW ++ + + W + Sbjct: 602 LHTTFSDYEPEKGLYPP-VIHYLTERKPWKEYTQSI-YREVWWFY-QGLDWSDM 652 >UniRef50_A5EVI8 Glycosyl transferase family 8 protein n=1 Tax=Dichelobacter nodosus VCS1703A RepID=A5EVI8_DICNV Length = 617 Score = 166 bits (420), Expect = 1e-39, Method: Composition-based stats. Identities = 56/331 (16%), Positives = 111/331 (33%), Gaps = 33/331 (9%) Query: 25 ENLCLDIAYGTDKNFLFGCGISIASILKY-NEGSRLCFHIFTDYFGDDDRKYFDALALQY 83 + + + D++++ G I SI+ + + + L I +K L + Sbjct: 276 QKNAVSVVIAADEHYVPHLGALICSIIDHLSCDAFLDLIILDGGIDFISQKQLAHLLGKR 335 Query: 84 KTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPL 143 I+ ++ D +++ A ++R ++ I +VLY+D D I + L Sbjct: 336 GA-IQFLDLS-DEFTDQKVHMHFSRATFYRLILDKLII-DRKRVLYIDCDTIVLADLAEL 392 Query: 144 INFSFPDDKVAMVVTEGQADW----------------WEKRAHSLGVAGIAKGYFNSGFL 187 + V + + +G+ + YF +G + Sbjct: 393 FATDLNGKAIGAVFDYIMHHFCQVGVRSIEFTNYLPAKKYLEDYVGLKENWRHYFQAGVI 452 Query: 188 LINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNY 247 L + Q + + IA L E K+ DQD+LN + F + +N Sbjct: 453 LFDLEQLRTLNYADKMIASLTE----KRYWFLDQDILNKYFVGNVHFLNPCWNVVNVGAD 508 Query: 248 QLK------ESFINPVTNDTIFIHYIGPT-KPWHDWAWDYPVSQAFMEAKNASPWKNTAL 300 + + + IHY G KPW D + ++ + + W + L Sbjct: 509 IYEGLSAELIAELKAAERAPAIIHYAGYEAKPWVDLSAK--FAEFYYYYLRQTFWYESVL 566 Query: 301 LKPNNSNQLRYSAKHMLKKHRYLKGFSNYLF 331 N + S K K R+ + +L Sbjct: 567 TSKMLLNVRKKSQKSGEKSWRWKIAYRIWLR 597 >UniRef50_B1I7N1 Glycosyl transferase, family 8 n=8 Tax=Streptococcus pneumoniae RepID=B1I7N1_STRPI Length = 817 Score = 165 bits (419), Expect = 1e-39, Method: Composition-based stats. Identities = 56/272 (20%), Positives = 99/272 (36%), Gaps = 34/272 (12%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 I D+N++ +I SIL +N + +I D + +A + I Sbjct: 5 IVLAGDRNYIRQLETTIKSILYHNRD--VKIYILNQDIMPDWFRKPRKIARMLGSEIIDV 62 Query: 91 LINGDR-LRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFP 149 + + + + Y R+ IADY + KVLYLD+D+I ++E L + Sbjct: 63 KLPEQTVFQDWEKQDHISSITYARYFIADYI--QEDKVLYLDSDLIVNTSLEKLFSICLE 120 Query: 150 DDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIA---- 205 + +A V FN+G LLIN +W +++ R I Sbjct: 121 EKSLAAVKDT------------------DGITFNTGVLLINNKKWRQEKLKERLIEQSIV 162 Query: 206 -MLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLN----YQLKESFINPVTND 260 M E + + DQ + N +L D + YN Q + Y + + + Sbjct: 163 TMKEVEEGRFEHFNGDQTIFNQVLQDDWLELGRAYNLQVGHDIVALYNNWQEHL-AFNDK 221 Query: 261 TIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNA 292 + IH+ KPW + + + + Sbjct: 222 PVVIHFTTYRKPWTTLTANR-YRDLWWKFHDL 252 >UniRef50_C5ZVZ7 Putative glycosyltransferase n=3 Tax=Campylobacterales RepID=C5ZVZ7_9HELI Length = 431 Score = 165 bits (417), Expect = 2e-39, Method: Composition-based stats. Identities = 71/363 (19%), Positives = 120/363 (33%), Gaps = 66/363 (18%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYN---------------------------------- 54 I + DKN++ + I SI+K Sbjct: 2 FHIFFSADKNYIPYTAVLITSIIKNTNPQKSFKDFCTTPSDSLPSLDYPRLQYDNLDKLD 61 Query: 55 EGSRLCFHIFTDYFGDDDRKYFD----ALALQYKTRIKIYLINGDRLRSLPSTK--NWTH 108 + FHI +D D + L+ Y ++I++IN P + + +H Sbjct: 62 KSEGYVFHILSDSIPKDLQTKLQNFIQELSAFYPCTLQIHIINDIDFAHFPISGAAHSSH 121 Query: 109 AIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKR 168 Y+R DY K LYLD+D++ + L D+ ++ G + K Sbjct: 122 LPYYRLKWQDYIKPAPQKCLYLDSDMLVLCDLRELFALDLKDNIAGIIGDCGSKNRKIKY 181 Query: 169 --AHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNM 226 + + YFNSGFLLIN+ Q+ +Q+ + + + I DQD+LN Sbjct: 182 QENNYKKTFYFDENYFNSGFLLINSKQYIKEQIWEKCENLAKKCT---YIKAADQDLLNF 238 Query: 227 LLA-DKLIFADIKYNTQ-FSLNY-----------QLKESFINPVTNDTIFIHYIGPTKPW 273 + +K + YN Q +L Y N + +HY KPW Sbjct: 239 TIPINKRLKLPFAYNFQCITLLYVLCKDECKNRLNYTREAFNKSFKNPKILHY--GEKPW 296 Query: 274 HDWAWDYPVS-----QAFMEAKNASP-WKNTALLKPNNSNQLRYSAKHMLKKHRYLKGFS 327 + E +P + + L + + + + A Y F Sbjct: 297 RYLQSYQDYKGNNINDIWWEYAQQTPIFGDKLLKQKSQISDYKLFAILGYYALLYTTNFL 356 Query: 328 NYL 330 Y Sbjct: 357 GYF 359 >UniRef50_B2UQN6 Glycosyl transferase family 8 n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UQN6_AKKM8 Length = 371 Score = 165 bits (417), Expect = 2e-39, Method: Composition-based stats. Identities = 51/349 (14%), Positives = 110/349 (31%), Gaps = 35/349 (10%) Query: 21 KVETENLCLDIAYGTDKNFLFGCGISIASILKY-NEGSRLCFHIFTDYFGDDDRKYFDAL 79 E + + + + G++I ++ + + G HI D + + + Sbjct: 13 PASPEKSRIPVMFSATGGWGLPLGVAIHTLCLHASSGRFYDIHIVHDGMDARIIQELNQV 72 Query: 80 ALQYK-TRIKIYLINGDRLRSLPS--TKNWTHAIYFRFVIADYFINKAPKVLYLDADIIC 136 A + + + + + ++ Y R + F + +++YLDAD++ Sbjct: 73 AAPFPQVSLSFLQLPEEFRHLFQNGNKDRYSPLAYARLMAGSLFP-QYGRIVYLDADVLL 131 Query: 137 QGTIEPLINFSFPDDKVAMVVT------EGQADWWEKRAHSLGVAGIAKGYFNSGFLLIN 190 G + L VA + + Y NSG L+++ Sbjct: 132 AGDVAELYFSDLRGASVAAAGDGLALWSIEKGTMHPHLEYMGNYLSFPLSYCNSGVLVLD 191 Query: 191 TAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQF-SLNYQL 249 Q + + R + +PDQD+LN+ L + ++N QF S + Sbjct: 192 LDQMRRRNLEHRLLQ--QLRSRPDPFPYPDQDILNIALHGDMTTLPPEWNFQFLSWTWDE 249 Query: 250 KESFINPVTN-----------DTIFIHYIGPTKPWHDWAWDY--------PVSQAFMEAK 290 +++ + T +H +GP KPW S + Sbjct: 250 EKTRLLRGTEFENVPTISCGRSWKLLHMVGPEKPWRLPDTPGTMGQFHWILYSFFWWPEA 309 Query: 291 NASPWKNTALLKPNNSNQLRYSAKHML-KKHRYLKGFSNYLFYFIEKIK 338 P L + +H+ ++ + + +KI+ Sbjct: 310 KRLPVFREEL-DAISQGLAPLLQRHIRGQQWKLFFSRGHIFRKRRDKIR 357 >UniRef50_A2RAV0 Catalytic activity: UDP-glucose + glycogenin <=> UDP + glucosylglycogenin n=2 Tax=Aspergillus RepID=A2RAV0_ASPNC Length = 767 Score = 165 bits (417), Expect = 3e-39, Method: Composition-based stats. Identities = 47/277 (16%), Positives = 86/277 (31%), Gaps = 40/277 (14%) Query: 32 AYGT---DKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 Y T ++L G + S+ ++L D + A+ Y I Sbjct: 7 VYCTLLLSDHYLPGATVLAHSLRDNGSKAKLVALFTPDSLQPATIQELQAV---YDELIP 63 Query: 89 IYLINGDRLRSLPSTKNWT-HAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 ++ + +L A + + + + +++Y+D D++ + L++ Sbjct: 64 VHPLTNITPANLWLMDRPDLIATFTKIELWR--QTQYKRIVYIDCDVVALRAPDELLDLE 121 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 A V G D FNSG +++ +A+ Sbjct: 122 VD---FAAVPDVGWPDC-----------------FNSGVMVLRPN-------LQDYLALR 154 Query: 208 NEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYI 267 E DQ +LNM D YN S NYQ ++ + IH+I Sbjct: 155 ALAERGISFDGADQGLLNMHFRD-WHRLSFSYNCTPSANYQYIPAY-KHFQSTISMIHFI 212 Query: 268 GPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPN 304 G KPW+ P+ + + W Sbjct: 213 GAQKPWNMARQVEPIHSPYNQLLGR--WWAVYDRHYR 247 >UniRef50_B1I7M9 Glycosyl transferase, family 8 n=16 Tax=Streptococcus pneumoniae RepID=B1I7M9_STRPI Length = 406 Score = 164 bits (416), Expect = 3e-39, Method: Composition-based stats. Identities = 55/296 (18%), Positives = 114/296 (38%), Gaps = 36/296 (12%) Query: 30 DIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKI 89 + + D ++ ++ S+ ++N L ++ + + + Sbjct: 7 SVVFAGDYAYIRQIETAMKSLCRHNSH--LKIYLLNQDIPQEWFSQIRIYLQEMGGDLID 64 Query: 90 YLINGDRLRSLPSTK--NWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 + G + + S K + H + R+ I D+ KVLYLD+D+I G + L Sbjct: 65 CKLIGSQFQMNWSNKLPHINHMTFARYFIPDFV--TEDKVLYLDSDLIVTGDLTDLFELD 122 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 ++ +A + A FN+G LLIN +W ++ + + I + Sbjct: 123 LGENYLAAARSCF----------------GAGVGFNAGVLLINNKKWGSETIRQKLIDLT 166 Query: 208 NEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNY-----QLKESFINPVTNDTI 262 + + + DQ +LNML D+ + +YN Q +Y + + F P+ + Sbjct: 167 EKEH--ENVEEGDQSILNMLFKDQYSSLEDQYNFQIGYDYGAAAFKHQFIFDIPLEPLPL 224 Query: 263 FIHYIGPTKPWHDWAWDYPVSQAFMEAKNASP------WKNTALLKPNNSNQLRYS 312 +HYI KPW+ ++ + + + E W + ++ P+ S + Sbjct: 225 ILHYISQDKPWNQFSVGR-LREVWWEYSLMDWSVILNEWFSKSVKYPSKSQIFKLQ 279 >UniRef50_A8AY72 Glycosyl transferase, family 8 SP1766 n=7 Tax=Streptococcus RepID=A8AY72_STRGC Length = 435 Score = 164 bits (416), Expect = 3e-39, Method: Composition-based stats. Identities = 62/339 (18%), Positives = 116/339 (34%), Gaps = 39/339 (11%) Query: 1 MQQVFFQETEFLNSVIDYDHKVET---ENLCLD----IAYGTDKNFLFGCGISIASILKY 53 M+ +F LN I Y+ ++ I D ++ +I S+ Y Sbjct: 1 MKALFTYGLFELNKRIRYNEDTIIRLANRGKMNQMKSIVLAGDYGYIRQIETTIKSLCCY 60 Query: 54 NEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLP---STKNWTHAI 110 + L ++F + + + D LR + + + Sbjct: 61 H--EDLLIYVFNQDIPQEWFINTRKKVKGTGNNLFDIKLLRDDLRMKWEESTYSHINYMA 118 Query: 111 YFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAH 170 Y R+ I +Y KA + LYLD D++ ++ L D +A V Sbjct: 119 YARYFIPEYV--KADRALYLDCDLVVTQNLDHLFELDLEDYYIAAVRATF---------- 166 Query: 171 SLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLAD 230 FNSG +L+N +W + + + + + + I+++ DQ +LNML + Sbjct: 167 ------GLGIGFNSGVMLLNNKRWREENIPQQLVELTD--REIERVLEGDQSILNMLFKE 218 Query: 231 KLIFADIKYNTQFSLN-----YQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQA 285 + + + YN Q + Y F P++ +HYI KPW+ + + + Sbjct: 219 QYLELEDSYNFQIGFDMGAAQYGHDFVFDIPLSPLPAIVHYISALKPWNLLT-NMRLREV 277 Query: 286 FMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKHRYLK 324 + N W K + + K L Sbjct: 278 WWFY-NDLDWTAIIASKALKGVEKHGQDLDQVYKKELLT 315 >UniRef50_C0AA16 Lipopolysaccharide biosynthesis protein LPS:glycosyltransferase-like protein n=1 Tax=Opitutaceae bacterium TAV2 RepID=C0AA16_9BACT Length = 726 Score = 164 bits (416), Expect = 3e-39, Method: Composition-based stats. Identities = 64/343 (18%), Positives = 118/343 (34%), Gaps = 37/343 (10%) Query: 17 DYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSR-LCFHIFTDYFGDDDRKY 75 D C++IA+ D F+ ++I SI+ I T+ + K+ Sbjct: 393 KIDIHPAFAGNCINIAFNCDDKFVPYLCVAIKSIVATASTENNYDILILTEGLSPANLKW 452 Query: 76 FDALALQYKTRIKIYLI----NGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLD 131 D + +++ + + S + Y R + + K KVLYLD Sbjct: 453 IDGIKHAKNVSLRVVNVRDYLQDKDISSFFMRSMVSRIAYVRLYLGELL-EKYAKVLYLD 511 Query: 132 ADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAG----------IAKGY 181 D+I Q + L N + + A V + K + Y Sbjct: 512 CDLIAQSDVAELFNMNLDGNVCAAVPDLAISTETIKNVAAYRDIDVYLRDVLGVTDISQY 571 Query: 182 FNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNT 241 FNSG ++ + + + IA + DQ+VLN L K++ ++N Sbjct: 572 FNSGVMVFDLEKIRTDNLQQTFIAAAAKNTKF----FMDQNVLNSALYGKVLLLGFEWNK 627 Query: 242 QFSLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTAL- 300 + SL +++ T ++ +H+ KP P + E P+ L Sbjct: 628 RVSLAMANRDT-----TTESKILHFAAEPKP--LQKIHMPEHYNWWEYARQLPFYEELLS 680 Query: 301 --LKPNNSNQLRYSAKH-------MLKKHRYLKGFSNYLFYFI 334 +KP+++N S K K + + +N L + Sbjct: 681 RVIKPSSTNFSSTSQKLPSLNKFIYRKYIKPITALNNLLKFLK 723 >UniRef50_C1CFZ1 Glycosyl transferase, family 8 n=17 Tax=Streptococcus pneumoniae RepID=C1CFZ1_STRZJ Length = 404 Score = 164 bits (415), Expect = 4e-39, Method: Composition-based stats. Identities = 66/317 (20%), Positives = 115/317 (36%), Gaps = 35/317 (11%) Query: 30 DIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKI 89 I + D +++ +I SI YN + L F++F D + + ++ I Sbjct: 5 SIVFNADNDYVDKLETAIKSICCYN--NCLKFYVFNDDIASEWFLMMNKRLKTIQSEIVN 62 Query: 90 YLINGDRLRSLPST-KNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSF 148 I L+ KN ++A +FR+ I ++ + LYLD+DII G+++ L + Sbjct: 63 VKIVDHVLKKFHLPLKNLSYATFFRYFIPNFVKE--SRALYLDSDIIVTGSLDYLFDIEL 120 Query: 149 PDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLN 208 +A V + FNSG LL+N W + ++ + + N Sbjct: 121 DGYALAAVEDSFG--------------DVPSTNFNSGMLLVNVDTWRDEDACSKLLELTN 166 Query: 209 EPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSL--------NYQLKESFINPVTND 260 + + DQ +LNML D+ D +N + N++ E + Sbjct: 167 QYHET---AYGDQGILNMLFHDRWKRLDRNFNFMVGMDSVAHIEGNHKWYEISELKNGDL 223 Query: 261 TIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKH 320 IHY G KPW + + + + N W + L K S Sbjct: 224 PSVIHYTG-VKPWEIIS-NNRFREVWWFY-NLLEWSDILLRKDIISRSFEELVYSPKAHT 280 Query: 321 RYLK--GFSNYLFYFIE 335 ++ Y IE Sbjct: 281 AIFTASCEMEHVEYLIE 297 >UniRef50_Q3DNS6 Glycosyl transferase, family 8 n=5 Tax=Streptococcus agalactiae RepID=Q3DNS6_STRAG Length = 401 Score = 164 bits (415), Expect = 5e-39, Method: Composition-based stats. Identities = 53/266 (19%), Positives = 100/266 (37%), Gaps = 28/266 (10%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 +A D N+L ++I SI YN + F++F + + + +++ Sbjct: 5 VALAVDSNYLDKALVTIKSICVYNRN--ITFYLFNQDTPVEWVRNINRKLEPLGSKLINV 62 Query: 91 LINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPD 150 I + L + T + +FR +ADY + +VLYLD+DII ++ L F Sbjct: 63 KIYNYDIAHLTTF--LTVSTWFRLFLADYIPS--SRVLYLDSDIIVNTNLDYLFELDFKG 118 Query: 151 DKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEP 210 +A V +G FN+G LL N W ++ + E Sbjct: 119 YYLAAVKDPH---------------KNEEGGFNAGMLLANLELWREDGLTKTLLKTAEEL 163 Query: 211 EIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSL----NYQLKESFINPVTNDTIFIHY 266 + K DQ +LN++ ++ + + +N Q ++N IH+ Sbjct: 164 HRVVKT--GDQSILNIVCHNRWLSLNKTWNFQTYDVVSRYNHRSYLYLNIENRTPNIIHF 221 Query: 267 IGPTKPWHDWAWDYPVSQAFMEAKNA 292 + KPW++ + + + Sbjct: 222 LTSDKPWNENSV-ARFRELWWYYFQL 246 >UniRef50_B9QZ95 Glycosyl transferase family 8 n=1 Tax=Labrenzia alexandrii DFL-11 RepID=B9QZ95_9RHOB Length = 309 Score = 164 bits (415), Expect = 5e-39, Method: Composition-based stats. Identities = 64/320 (20%), Positives = 108/320 (33%), Gaps = 28/320 (8%) Query: 30 DIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKT--RI 87 +IA D L G ++I S L+++ H+ D + D+ + + Sbjct: 4 NIAACADTKVLPGLAVTIRSSLEHSS-IPCRIHVLADRLSEQDKHKLSNSWKPHPMCQDV 62 Query: 88 KIYLINGDRLRSLPSTKN-WTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINF 146 Y I+ + ST + + Y R+ I+D+ + K +YLD D++ + L Sbjct: 63 VFYDIDYQNISKFRSTMYLKSKSAYSRYFISDFL-GEESKCIYLDCDLLVLRDLAELNTA 121 Query: 147 SFPDDKVAMVVTEGQA--DWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAI 204 + V D L + YFNSG L+I+ +W I Sbjct: 122 KMHGKTIGSVRDISVRTADPHLFIGERLQLTN-PYDYFNSGVLIIDLDRWRKLDARNHLI 180 Query: 205 AMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFI 264 + E DQD LN+ F D +NT + P T + I Sbjct: 181 DLT--LERADTFHSQDQDALNVFFDGDTEFLDPVWNTS---------QYERPDTAENRII 229 Query: 265 HYIGPTKPWH--------DWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHM 316 H IG KPWH D + F + + + P ++ + + Sbjct: 230 HLIGTVKPWHARYKEKLSDSYHRTEIWDRFYGVLDRTAYAGNRPWDPAGLGVVKETIESK 289 Query: 317 LKKHRYLKG-FSNYLFYFIE 335 + K + G L F+ Sbjct: 290 IPKMDMVTGKIRRTLQKFLN 309 >UniRef50_O48684 F3I6.10 protein n=46 Tax=Embryophyta RepID=O48684_ARATH Length = 393 Score = 164 bits (415), Expect = 5e-39, Method: Composition-based stats. Identities = 48/284 (16%), Positives = 94/284 (33%), Gaps = 19/284 (6%) Query: 24 TENLCLDIAYGTDKNFLFGCGISIASILKYNE-GSRLCFHIFTDYFGDDDRKYFDALALQ 82 + + IA D +L G ++ S+L++ + FH F + L Sbjct: 80 NDPSLVHIAMTLDSEYLRGSIAAVHSVLRHASCPENVFFHFIAAEFDSASPRVLSQLVRS 139 Query: 83 YKTRIKI--YLINGDRLRSLPSTKNW----THAIYFRFVIADYFINKAPKVLYLDADIIC 136 + Y+ D + +L S+ Y R + D +V+YLD+D+I Sbjct: 140 TFPSLNFKVYIFREDTVINLISSSIRLALENPLNYARNYLGDILDRSVERVIYLDSDVIT 199 Query: 137 QGTIEPLINFSFPDDKVAMVVTEGQADW--------WEKRAHSLGVAGIAKGYFNSGFLL 188 I L N +V A++ W A ++G YFN+G ++ Sbjct: 200 VDDITKLWNTVLTGSRVIGAPEYCHANFTQYFTSGFWSDPALPGLISGQKPCYFNTGVMV 259 Query: 189 INTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQ 248 ++ +W + + + ++ ++ A + D ++N Q L Sbjct: 260 MDLVRWREGNYREKLEQWMQLQKKMRIYDLGSLPPFLLVFAGNVEAIDHRWN-QHGLGGD 318 Query: 249 LKESFINPVTNDTI-FIHYIGPTKPWHDWAWDYPV--SQAFMEA 289 + + +H+ G KPW P + Sbjct: 319 NIRGSCRSLHPGPVSLLHWSGKGKPWVRLDEKRPCPLDHLWEPY 362 >UniRef50_C4ZG45 Glycosyltransferase Family 2 modular protein n=1 Tax=Eubacterium rectale ATCC 33656 RepID=C4ZG45_EUBR3 Length = 723 Score = 164 bits (415), Expect = 5e-39, Method: Composition-based stats. Identities = 58/276 (21%), Positives = 104/276 (37%), Gaps = 20/276 (7%) Query: 26 NLCLDIAYGT---DKNFLFGCGISIASILKYNEGSRLCFHIFTDY-FGDDDRKYFDALAL 81 + + I G D N+ G ++ SI++ + + + FHI D + ++ +A Sbjct: 340 DNAIHICLGIHDKDGNYSVWAGTTMQSIVENTK-APIVFHILHDDTLNEMNKNKLSLIAD 398 Query: 82 QYKTRIKIYLINGDRLRSL-PSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTI 140 I+ + N D SL S +T FR ++ D K++YLD+D+ I Sbjct: 399 NSGNGIEFHHFNPDIFGSLADSMNRFTIGTMFRIMLPDIMP-DLKKIIYLDSDLFVNTDI 457 Query: 141 EPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQ-V 199 E L N + + +A W YFN+G L +N + Sbjct: 458 EELWNLNIDNYCLAAAQDCSTIRNWGTPYAVAAGQTSRDRYFNAGVLCMNLDNIRKNGSL 517 Query: 200 SARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTN 259 + + L++ + PDQD LN + + K + D K+N Y + E+ N Sbjct: 518 FQQVMDYLSDN---PRTWLPDQDALNAIFSGKTLLIDEKWN------YFIDEARKNNEKA 568 Query: 260 DTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPW 295 + HY + + +A+ +PW Sbjct: 569 EKKIYHYAATL---LMLHTNNEIDRAYYFTILRTPW 601 >UniRef50_B0NR59 Putative uncharacterized protein n=1 Tax=Bacteroides stercoris ATCC 43183 RepID=B0NR59_BACSE Length = 306 Score = 163 bits (413), Expect = 7e-39, Method: Composition-based stats. Identities = 67/314 (21%), Positives = 127/314 (40%), Gaps = 26/314 (8%) Query: 27 LCLDIAYGTDKNFLFGCGISIASILKYNEGSR-LCFHIFT-DYFGDDDRKYFDALALQYK 84 + I + D N++ G+ I S+L ++ +I + + D++ + YK Sbjct: 2 KKIPIVFSIDHNYVMQAGVCILSLLMNSDEKEYYDIYILSAADITEHDKELLNKTIFAYK 61 Query: 85 TRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLI 144 I I+ DR + +N + A YFR +I D + K++Y D D+I Q ++ ++ Sbjct: 62 ADINFIEID-DRFDNAFEIRNISKAAYFRLLIPDLIP-QYDKIIYSDVDVIFQSGLQEVL 119 Query: 145 NFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAI 204 + D+ + G + + + + GY NSGFLLIN +Q+ + Sbjct: 120 DTDLKDNYFGGIKAIGAESIKD---YIIQLGLNIHGYINSGFLLINAKLQREKQLFNKIQ 176 Query: 205 AMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQ---FSLNYQLKESFINPVTNDT 261 L + K DQD++N++ ++L F +KY + L Y + + + Sbjct: 177 EYLTK-----KFQFQDQDIINIVCKNRLTFLPLKYCFTQKSYELYYTNPKRLFSVFSPKE 231 Query: 262 I-------FIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAK 314 + IHY G KPW+ + + Y + S + + + +Y Sbjct: 232 VEEAFTEGIIHYEGTNKPWNGFCYRY---DNWWRYYKKSVFYSEEMHFQTAYKI-QYPTW 287 Query: 315 HMLKKHRYLKGFSN 328 + K R L+ F Sbjct: 288 TLKKILRLLRNFIR 301 >UniRef50_Q871S1 Glycogenin n=3 Tax=Sordariaceae RepID=Q871S1_NEUCR Length = 686 Score = 163 bits (412), Expect = 1e-38, Method: Composition-based stats. Identities = 44/272 (16%), Positives = 87/272 (31%), Gaps = 44/272 (16%) Query: 32 AYGT---DKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 Y + + +L G + S+ +L I + ++ + + Y I Sbjct: 9 VYASLLLNDAYLPGALVLAHSLRDSGTHKKLAILITPENISNEVVEQLQTV---YDYVIP 65 Query: 89 IYLINGDRLRSLPSTKNWT-HAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 + I DR +L H+ + + + + K++Y+DAD++ + L + Sbjct: 66 VETIQNDRPANLFLMNRPDLHSAFTKINLWK--QTQFRKIVYIDADVVAYRAPDELFDLP 123 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 + G D FN+G ++++ AML Sbjct: 124 ---HAFSAAPDIGWPDL-----------------FNTGVMVLSPN-------MGDYYAML 156 Query: 208 NEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYI 267 E DQ +LNM + YN S +YQ ++ + +H+I Sbjct: 157 AMAERGISFDGADQGLLNMHFRNTYNRLSFTYNVTPSAHYQYIPAY-KHFQSSINLLHFI 215 Query: 268 GPTKPWHD-------WAWDYPVSQAFMEAKNA 292 G KPW + + + + Sbjct: 216 GSEKPWVQGRTQTTGSSTYDEMIGRWWAVYDR 247 >UniRef50_Q01GT2 UDP-glucose:glycoprotein glucosyltransferase, putative (ISS) n=1 Tax=Ostreococcus tauri RepID=Q01GT2_OSTTA Length = 1339 Score = 163 bits (412), Expect = 1e-38, Method: Composition-based stats. Identities = 38/272 (13%), Positives = 85/272 (31%), Gaps = 20/272 (7%) Query: 6 FQETEFLNSVIDYDHKVETENLCLDIA-YGTDKNFLFGCGISIASILKYNEGSRLCFHIF 64 + +L+ V+ + N + I + + I +AS+ + + + F Sbjct: 870 TSPSGWLSKVLK-----KKSNERIHIFSVASGHLYERFLKIMMASVKRSTKN-PVKFWFI 923 Query: 65 TDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKA 124 ++ + + +A +Y ++ + + K Y + F + Sbjct: 924 KNWLSPSFKDFLPHMAEKYDFEYELVSYKWPTWLNKQTEKQRIIWAYKILFLDVLFPLEL 983 Query: 125 PKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQAD-WWEKRAHSLGVAG---IAKG 180 KV+++DAD I + + L N + R G K Sbjct: 984 NKVIFVDADQIVRADMSELWNMNLHGAPYGYTPMCDNNKEMEGFRFWKQGFWQTHLRGKP 1043 Query: 181 YFNSGFLLINTAQWAAQQVSARAIAMLN-EPEIIKKITHPDQDVLNMLLAD-KLIFADIK 238 Y S +++ ++ A R M + + + DQD+ N D + + Sbjct: 1044 YHISALYVVDLDRFRAVAAGDRLRVMYDSLSRDPGSLANLDQDLPNYAQHDVPIFSLPMP 1103 Query: 239 YNTQFSLNYQLKESF-------INPVTNDTIF 263 + S ++ NP+T + Sbjct: 1104 WLWCESWCGNETKAAAKTIDLCNNPLTKEPKL 1135 >UniRef50_A5UC07 Aspartate-semialdehyde dehydrogenase n=7 Tax=Haemophilus influenzae RepID=A5UC07_HAEIE Length = 300 Score = 162 bits (410), Expect = 2e-38, Method: Composition-based stats. Identities = 50/273 (18%), Positives = 98/273 (35%), Gaps = 17/273 (6%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 ++I + D NF + S+ ++ + + ++ D + + ++ + Sbjct: 1 MNIVFTLDCNFASHLDTVLKSLCYHH--NNINIYVIHDGIPAESLEKLKMHCAKFDNTLY 58 Query: 89 IYLINGDRLR---SLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLIN 145 N ++ + + A FR + +V+YLD D+I I+ L + Sbjct: 59 DIQFNINQFSFPTVMSPAHIQSSASLFRLYLHQILPQHIERVIYLDIDLIIHQAIDELWD 118 Query: 146 FSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIA 205 + D +A V WE + + Y N+G +LIN +W + I Sbjct: 119 INLEDSLIAGVSDFFSEYLWEHPFYEK------QQYINTGVMLINLNKWRENNIEQYFIE 172 Query: 206 MLNEPEIIKKITHPDQDVLNMLLA-DKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFI 264 + + DQDV+N + + + +K+N Q L + I Sbjct: 173 YAAKYGEF--FVYGDQDVINFSIPTNLIKLLPVKFNIQVKFIEYLWMEHKEKIKFTPHII 230 Query: 265 HYIGPTKPW---HDWAWDYPVSQAFMEAKNASP 294 HYIG KPW H ++ ++ + S Sbjct: 231 HYIGSNKPWLKEHSANSPRFYNEEYLFYHHLSW 263 >UniRef50_Q4JZJ9 Putative glycosyl transferase n=8 Tax=Streptococcus pneumoniae RepID=Q4JZJ9_STRPN Length = 344 Score = 162 bits (409), Expect = 2e-38, Method: Composition-based stats. Identities = 66/342 (19%), Positives = 116/342 (33%), Gaps = 30/342 (8%) Query: 13 NSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDD 72 +S+ ++I Y TD NF+ SI S+ N L I D D + Sbjct: 15 HSIFFISENKFRSRNFMNIVYATDNNFVDVLSASIKSLYTTNSDLDLNLWIIADKVSDRN 74 Query: 73 RKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDA 132 ++ + L+ Q+ R +I I + + + + R + + KVLYLD+ Sbjct: 75 KEKINRLSKQFAQR-EINWIENVEIPFKLHLDRGSISSFSRLFLGSVLPSSMSKVLYLDS 133 Query: 133 DIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTA 192 DII ++ + + F + V K + I K FN+G +LIN Sbjct: 134 DIIVMDSLRSIFDIDFKGKILYGVNDTF-----NKEYKQVLGIPIDKPMFNAGVMLINLE 188 Query: 193 QWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLN------ 246 W V R + ++ + I D VLN +L + +YN Sbjct: 189 LWRNNNVEERFLQVIQKFN--GTILQGDLGVLNAVLYNSFGVLPPEYNYMTIFEDLTYEE 246 Query: 247 --------YQLKESFINPVTNDTIFIHYIG---PTKPWHDWAWDYPVSQAFMEAKNASPW 295 + I + H+ +PW + + + + F + + Sbjct: 247 MIVFKKPINYYSKEEIKNARERIVLRHFTTSFLSKRPWQE-SSEVTHVEIFKKYYRG-AY 304 Query: 296 KNTALLKPNNSNQLRYSAKHM-LKKHRYLKGFSNYLFYFIEK 336 K + K N + K M L +++ Y I K Sbjct: 305 KQASPSK--LLNIYKILPKKMSLYLLGFIQSKVRPKLYRITK 344 >UniRef50_C1MLJ1 Glycosyltransferase family 24 protein n=1 Tax=Micromonas pusilla CCMP1545 RepID=C1MLJ1_9CHLO Length = 1657 Score = 162 bits (409), Expect = 2e-38, Method: Composition-based stats. Identities = 29/262 (11%), Positives = 76/262 (29%), Gaps = 15/262 (5%) Query: 17 DYDHKVETENLCLDIA-YGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKY 75 + + + +++ + + I + S+ + + + F ++ + Sbjct: 1355 KWRNAKRSRLETINVFSVASGHLYERFLKIMMLSVRRN-TNNPVKFWFIKNWLSPQFKDI 1413 Query: 76 FDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADII 135 +A +Y ++ + K Y + F KV+++DAD + Sbjct: 1414 LPHIAAKYGFEYELVTYKWPTWLHKQTEKQRIIWAYKLLFLDVLFPLTLNKVIFVDADQV 1473 Query: 136 CQGTIEPLINFSFPDDKVAMVVTEG-QADWWEKRAHSLGVAGI---AKGYFNSGFLLINT 191 + ++ L A + R G K Y S +++ Sbjct: 1474 VRSNLKELWEMDLRGAPYAYTPFCDNNPEMEGYRFWKHGFWQTHLAGKPYHISALYVVDL 1533 Query: 192 AQWAAQQVSARAIAMLNE-PEIIKKITHPDQDVLNMLLAD-KLIFADIKYNTQFSLNYQL 249 + + + + + + DQD+ N + ++ S Sbjct: 1534 ETFRHTAAGDKLRLIYETLSKDPSSLANLDQDLPNYAQHQVPIFTLPQQWLWCESWCGND 1593 Query: 250 KESF-------INPVTNDTIFI 264 ++ NP+T + I Sbjct: 1594 TKTAAKTIDLCNNPMTKEPKLI 1615 >UniRef50_Q09332 UDP-glucose:glycoprotein glucosyltransferase n=15 Tax=Neoptera RepID=UGGG_DROME Length = 1548 Score = 161 bits (408), Expect = 3e-38, Method: Composition-based stats. Identities = 32/225 (14%), Positives = 77/225 (34%), Gaps = 12/225 (5%) Query: 24 TENLCLDIA-YGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQ 82 + ++I + + I + S+LK+ + S + F +Y + +A + Sbjct: 1235 EDTETINIFSVASGHLYERLLRIMMVSLLKHTK-SPVKFWFLKNYLSPQFTDFLPHMASE 1293 Query: 83 YKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEP 142 Y + ++ R + K T Y + F K++++DAD I + I+ Sbjct: 1294 YNFQYELVQYKWPRWLHQQTEKQRTIWGYKILFLDVLFPLNVRKIIFVDADAIVRTDIKE 1353 Query: 143 LINFSFPDDKVAMVVTEGQA------DWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAA 196 L + A +W++ + G + Y S +++ ++ Sbjct: 1354 LYDMDLGGAPYAYTPFCDSRKEMEGFRFWKQGYWRSHLMG--RRYHISALYVVDLKRFRK 1411 Query: 197 QQVSARAI-AMLNEPEIIKKITHPDQDVLNMLLADK-LIFADIKY 239 R + +++ DQD+ N ++ + + Sbjct: 1412 IAAGDRLRGQYQALSQDPNSLSNLDQDLPNNMIHQVAIKSLPDDW 1456 >UniRef50_UPI000175831B PREDICTED: similar to UDP-glucose glycoprotein:glucosyltransferase n=3 Tax=Endopterygota RepID=UPI000175831B Length = 1506 Score = 161 bits (407), Expect = 3e-38, Method: Composition-based stats. Identities = 30/247 (12%), Positives = 77/247 (31%), Gaps = 8/247 (3%) Query: 18 YDHKVETENLCLDIA-YGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYF 76 + E + L+I + + I + S+LK+ + + F +Y + + Sbjct: 1201 FSKNEEEPDDKLNIFSVASGHLYERFLRIMMLSVLKHTKT-PVKFWFLKNYLSPQIKDFL 1259 Query: 77 DALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIIC 136 +A +Y ++ R + K Y + F K++++DAD + Sbjct: 1260 PYMAKEYGFEYELVQYKWPRWLHQQTEKQRIIWGYKILFLDVLFPLDVKKIIFVDADQVV 1319 Query: 137 QGTIEPLINFSFPDDKVAMVVTEGQAD-WWEKRAHSLGVAG---IAKGYFNSGFLLINTA 192 + ++ L R LG + Y S +++ Sbjct: 1320 RADLKELQELDLGGAPYGYTPFCDSRKEMDGFRFWKLGYWRNHLQGRKYHISALYVVDLK 1379 Query: 193 QWAAQQVSARAI-AMLNEPEIIKKITHPDQDVLNMLLADK-LIFADIKYNTQFSLNYQLK 250 ++ R + +++ DQD+ N ++ + ++ + Sbjct: 1380 RFRRIAAGDRLRGQYQALSQDPNSLSNLDQDLPNNMIHQVGIKSLPQEWLWCETWCDDES 1439 Query: 251 ESFINPV 257 ++ + Sbjct: 1440 KARAKTI 1446 >UniRef50_C3XGD2 Putative uncharacterized protein n=1 Tax=Helicobacter bilis ATCC 43879 RepID=C3XGD2_9HELI Length = 364 Score = 161 bits (407), Expect = 4e-38, Method: Composition-based stats. Identities = 72/321 (22%), Positives = 112/321 (34%), Gaps = 43/321 (13%) Query: 57 SRLCFHIFTDYFGDDDRKYFD----ALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYF 112 CFHI TD + R+ L Y ++Y ++ + LP N + YF Sbjct: 46 KPFCFHILTDGLKHETRQKLQAFQIELNKIYPCEFRVYTLSDSIFQGLPKLNN-NYLAYF 104 Query: 113 RFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMV-VTEGQADWWEKRAHS 171 R IA LYLD D+IC I + +V V + Q KR + Sbjct: 105 RLKIASCLPQDIKTCLYLDVDMICVADIREIFYTDLQGKICGVVLVPDHQQYCVLKRNSA 164 Query: 172 LG--VAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLA 229 +G A YFNSG +LI+ Q+ V + + + + DQD LN +L Sbjct: 165 IGDEFVFNASTYFNSGLMLIDVEQYRKYNVEQKCLEWFEQYVPV----LLDQDALNAVLG 220 Query: 230 DKLIFADIKYNTQFSLNYQLKESFINP---------------VTNDTIFIHYIG-PTKPW 273 D + +++N L ++ F V N+ +HY G KPW Sbjct: 221 DHICALPLEWNFFVELLKYKRQDFKGKDNNIVMKITYEEYMQVKNNMKILHYTGWTLKPW 280 Query: 274 HDWAWDYP------VSQAFMEAKNASP--WKNTALLKPNNSNQLRY-----SAKHM--LK 318 + + E + +P +K+ + Y KH+ K Sbjct: 281 QQPYIENDMIKTCIYKNKWWEIAHDTPVFYKDIYASYMKKQEDMLYESILSLQKHIKSFK 340 Query: 319 KHRYLKGFSNYLFYFIEKIKH 339 LK L +K+ H Sbjct: 341 LRNRLKRLQQSLKRRCKKLFH 361 >UniRef50_C4JK72 Putative uncharacterized protein n=1 Tax=Uncinocarpus reesii 1704 RepID=C4JK72_UNCRE Length = 696 Score = 160 bits (405), Expect = 6e-38, Method: Composition-based stats. Identities = 40/280 (14%), Positives = 86/280 (30%), Gaps = 37/280 (13%) Query: 26 NLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKT 85 + ++L G + S+ + +++ I + + + Y Sbjct: 4 REAIYCTLLMSDSYLPGAMVLARSLRDHGTQAKIVALITPESLQAQTIEELKCV---YDE 60 Query: 86 RIKIYLINGDRLRSLPSTKNWT-HAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLI 144 I + + +L + + + + + +++Y+DAD++ + L+ Sbjct: 61 VIPVSRVINVSPANLYLMDRPDLISTFTKIELWR--QVQYKQIVYIDADVVALRAPDELL 118 Query: 145 NFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAI 204 A G D FNSG +++ + Sbjct: 119 TLD---THFAAAPDIGWPDC-----------------FNSGVMVLRPS-------LQEYY 151 Query: 205 AMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFI 264 ++L + DQ +LNM YN S +YQ +F + + Sbjct: 152 SLLAFAQRGISFDGADQGLLNMHFT-TWQRLSFAYNCTPSGHYQYIPAF-RHFQSTISLV 209 Query: 265 HYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPN 304 HYIG KPW+ +P+ + + W + Sbjct: 210 HYIGQNKPWNLPRQTFPIEGPYNQLLAR--WWSVYDRHYR 247 >UniRef50_Q5WI33 Lipopolysaccharide glycosyltransferase n=6 Tax=Firmicutes RepID=Q5WI33_BACSK Length = 274 Score = 160 bits (405), Expect = 6e-38, Method: Composition-based stats. Identities = 50/270 (18%), Positives = 98/270 (36%), Gaps = 11/270 (4%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 ++I + ++L + + S+ N ++ + + + + Sbjct: 1 MNILVTLNAHYLKPLQVMLTSLFMNNAHEDFTIYLIHSSIPEKQLQLLEQFVCHQGHSLV 60 Query: 89 IYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSF 148 I + + P K+++ +Y+R + + + ++LYLD DI+ I PL + Sbjct: 61 IVETDKTLFANAPVVKHYSSEMYYRLLAYRFLPTELDRILYLDPDILVLNPIRPLYEANI 120 Query: 149 PDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLN 208 A ++ A Y+NSG LL+N A+ A + Sbjct: 121 DSYLYAAAQHSFI--NIQEINKFRLNAYEMDAYYNSGVLLMNLAKQRETMDINDIFAYVE 178 Query: 209 EPEIIKKITHPDQDVLNMLLADKLIFADIK-YNTQFSLNYQLKES-----FINPVTNDTI 262 ++ PDQDVLN L + ++ D + YN K I+ V T+ Sbjct: 179 TYR--NRLVLPDQDVLNALYSPQIKNVDERLYNYDARYYRYYKLKSGGRFDIDAVLQQTV 236 Query: 263 FIHYIGPTKPWHDWAWDYPVSQAFMEAKNA 292 +H+ G KPWH ++ + + Sbjct: 237 ILHFCGKKKPWHK-NYNGKFHSLYKHYEKQ 265 >UniRef50_A8XPN2 Putative uncharacterized protein n=2 Tax=Caenorhabditis briggsae RepID=A8XPN2_CAEBR Length = 1495 Score = 160 bits (405), Expect = 7e-38, Method: Composition-based stats. Identities = 37/271 (13%), Positives = 82/271 (30%), Gaps = 15/271 (5%) Query: 7 QETEFLNSVIDYDHKVETENLCLDIA-YGTDKNFLFGCGISIASILKYNEGSRLCFHIFT 65 E +S+ + E +++ + + I I S++K + + F + Sbjct: 1184 DEEGVWSSLSNLVSSKEKPQEVINVFSLASGHLYERFMRIMIVSVMKNTKH-PVKFWLLK 1242 Query: 66 DYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAP 125 +Y ++ LA Y ++ R K + + F Sbjct: 1243 NYLSPQFKETLPTLAKHYDFEYELVEYKWPRWLHQQKEKQRIMWGFKILFLDVLFPLDVG 1302 Query: 126 KVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQAD-WWEKRAHSLGVAGI---AKGY 181 KV+++DAD + + + L+ F + V R G + Y Sbjct: 1303 KVIFVDADQVVRADLMELMKFDLGNAPYGYVPFCESRKEMDGFRFWKQGYWANHLAGRRY 1362 Query: 182 FNSGFLLINTAQWAAQQVSARAI-AMLNEPEIIKKITHPDQDVLNMLLAD-KLIFADIKY 239 S +I+ ++ R + + DQD+ N ++ K+ ++ Sbjct: 1363 HISALYVIDLQKFRQIAAGDRLRGQYQGLSGDPNSLANLDQDLPNNMIHQVKIKSLPQEW 1422 Query: 240 NTQFSLNYQLKESF-------INPVTNDTIF 263 + + NP+T + Sbjct: 1423 LWCETWCDDASKKNAKTIDLCNNPLTKEPKL 1453 >UniRef50_A4WWT5 Glycosyl transferase, family 8 n=1 Tax=Rhodobacter sphaeroides ATCC 17025 RepID=A4WWT5_RHOS5 Length = 319 Score = 160 bits (405), Expect = 7e-38, Method: Composition-based stats. Identities = 58/320 (18%), Positives = 105/320 (32%), Gaps = 24/320 (7%) Query: 20 HKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDAL 79 + + D+N+ + A I + I + + Sbjct: 6 EASRPARARQAVIFCCDRNYYPYAMFAAAQIAGRHPHRGFDICI-------ASLEAIEEP 58 Query: 80 ALQYKTRIKIYLING-DRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQ- 137 + ++ I+ T Y R V+ + F ++LYLD+DI Q Sbjct: 59 PSLSELAVRHCTIDAAHLFADFGLDDRRTAVTYLRLVLPEAFSEDYDRILYLDSDIYIQG 118 Query: 138 GTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIA-KGYFNSGFLLINTAQWAA 196 G + LI +A V Q +R G+ + YFNSG LL + + A Sbjct: 119 GDLGALIALPLAGRPLAAVRDNKQWRTPSRRMVDFDRLGLPQRPYFNSGVLLFDVPAFRA 178 Query: 197 QQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINP 256 + A+ + +++ DQ +LN + +N Q++ + +L + + Sbjct: 179 ANLLQEALRIGR--SQGRQLVRHDQSLLNACMLGNWAELSPSWNWQYTWSSRLFAAML-- 234 Query: 257 VTNDTIFIHYIGPTKPWHDWA---WDYPVSQA--F--MEAKNASPWKNTALLKPNNSNQL 309 IH+IG KPW D F + P + P++ Sbjct: 235 ---GPNIIHFIGRCKPWCDPDNLLSPQFARDLQIFLARHFPDHPPLPLGPGMLPDSLAMR 291 Query: 310 RYSAKHMLKKHRYLKGFSNY 329 R KH+L R + + Sbjct: 292 RMLMKHLLSSGRLARYLERF 311 >UniRef50_Q3DM64 Glycosyl transferase, family 8, degenerate n=6 Tax=Streptococcus agalactiae RepID=Q3DM64_STRAG Length = 394 Score = 159 bits (403), Expect = 1e-37, Method: Composition-based stats. Identities = 56/292 (19%), Positives = 104/292 (35%), Gaps = 35/292 (11%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 I D L I SIL +N+ R+ +I + + L + I Sbjct: 6 ICLAGDNKSLNQIQTVIKSILCHND--RVSIYILNQDIASEWFRNIQRRLLNSHSCIFDI 63 Query: 91 LINGDRLRSLPSTK-NWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFP 149 + D + + + + T+ Y R+ I A KVLYLD D + ++ L Sbjct: 64 KLFDDTFKEFKTPRAHITYMAYARYYIPQLI--DAEKVLYLDIDTLVVDNLDKLFEIELG 121 Query: 150 DDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNE 209 D +A ++ +FNSG +LIN+ W +V+ + + + Sbjct: 122 DYPIAAILD------------------GDGIHFNSGVMLINSLYWMRYRVTEKLLEITE- 162 Query: 210 PEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLN----YQLKESFINPVTNDTIFIH 265 + DQ VLN+L + + + KYN Q + Y+ + + + IH Sbjct: 163 -RELDNGIFGDQGVLNLLFDNNWLKLEDKYNAQVGNDLGAFYENWQGYFDRNFESPTIIH 221 Query: 266 YIGPTKPWHDWAWDYPVSQAFMEAKNASP-----WKNTALLKPNNSNQLRYS 312 Y KPW+ ++ + + + + ++ L +P Sbjct: 222 YCTHDKPWNTFSSSR-FRETWWQYEQLDWNEVFNFETYLLPEPTFEKHFFTF 272 >UniRef50_Q046Z9 Lipopolysaccharide biosynthesis glycosyltransferase n=32 Tax=Lactobacillus RepID=Q046Z9_LACGA Length = 317 Score = 159 bits (403), Expect = 1e-37, Method: Composition-based stats. Identities = 53/299 (17%), Positives = 113/299 (37%), Gaps = 25/299 (8%) Query: 27 LCLDIAYGTDKNFLFGCGISIASILKY-NEGSRLCFHIFTDYFGDDDRKYFDALALQYKT 85 + + + Y N+ +SI S++ + ++ + D +K + L Sbjct: 2 MTIPVFYTISDNYTPYAAVSIQSLIDHVDQNKDYTITLLVQNISDKHKKDLEDL-SIKNV 60 Query: 86 RIKIYLINGDRL-------RSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQG 138 + I+ I+ + + + + +T +I++R I + F + K +YLDAD I Sbjct: 61 HVNIFHIDDEMVAPIHNSEENYLRAQFFTMSIFYRLFIPNLFP-QYDKAVYLDADTIICT 119 Query: 139 TIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAG--IAKGYFNSGFLLINTAQWAA 196 I L N D+ A V + + G + Y N+G +L N + Sbjct: 120 DIAELYNTEIGDNMFASVPDMSIRFIKPLQVYIKECQGIFPPEKYINNGVILFNMKAFRD 179 Query: 197 QQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINP 256 ++ + +++ + PDQ +N + DK+ ++++ + + Sbjct: 180 KKFVDKFYSLIEKYHFDNID--PDQAYMNEICEDKIYHLPLEWDAMPNEHMD-------- 229 Query: 257 VTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLK-PNNSNQLRYSAK 314 + +HY KPWH D + F + SP+ + N +++ R A+ Sbjct: 230 EIKNPKIVHYNLFFKPWHF--ADVQYGKYFWDVAKKSPYYGELKEQLANFTDEDRKKAR 286 >UniRef50_UPI0001693121 general stress protein n=1 Tax=Paenibacillus larvae subsp. larvae BRL-230010 RepID=UPI0001693121 Length = 352 Score = 159 bits (403), Expect = 1e-37, Method: Composition-based stats. Identities = 58/308 (18%), Positives = 99/308 (32%), Gaps = 32/308 (10%) Query: 36 DKNFLFGCGISIASILKYNEGSRLCFHIFTDY-FGDDDRKYFDALALQYKTRIKIYLING 94 D + G +AS+ S + HI D + +++ L + I Y + Sbjct: 12 DGAYAEHAGAVLASVFCNTS-SSVNVHILHDETLTEANKQKLIELTSSFNQTIHFYPVTI 70 Query: 95 DR-----LRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFP 149 + + S WT A +R +I K++YLD D++ I L Sbjct: 71 PDNMLQAMAGVKSISFWTQASMYRLLIPALIP--VDKIIYLDCDVLVNMNIAELWEVQLG 128 Query: 150 DDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQ-QVSARAIAMLN 208 D +A V + + YFNSG +L + + L Sbjct: 129 DFYLAAVWDQAIMAAVQHIIPYGLNPD---SYFNSGVILFALNNIRKKIDWYEEMLNFLR 185 Query: 209 EPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIG 268 + PDQD LN + + + D ++N N N +H+ G Sbjct: 186 RYPDT---SMPDQDTLNAVFGENYLQLDRRFNF---FNMVSPHHDFNN-----KIVHFAG 234 Query: 269 PTKPWHDWAWDYPVSQAFMEAKNASPWKNTA-----LLKPNNSNQLRYSAKHMLKKHRYL 323 K W P + + E + +PWK + P + + S K R L Sbjct: 235 SEK---CWDVHSPGANLYQEYLSLTPWKKHTDETSMGVHPLDGQRDSKSKNLKAPKLRPL 291 Query: 324 KGFSNYLF 331 + + Sbjct: 292 RRVRKSIR 299 >UniRef50_C1QEC6 LPS:glycosyltransferase n=1 Tax=Brachyspira murdochii DSM 12563 RepID=C1QEC6_9SPIR Length = 242 Score = 159 bits (403), Expect = 1e-37, Method: Composition-based stats. Identities = 65/256 (25%), Positives = 111/256 (43%), Gaps = 20/256 (7%) Query: 25 ENLCLDIAYGTDKNFLFGCGISIASILKYNEG-SRLCFHIFTDYFGDDDRKYFDALALQY 83 ++I + + + +I SILK ++ FH+ T+ D+++ + L Sbjct: 1 MQETMNICFTANDKYAPFMSATIVSILKNSKDDESFSFHVITNDISDENKMMIERLKEIK 60 Query: 84 KTRIKIYLINGD----RLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGT 139 +IK Y N D + +++ +I+FR I + I KVLYLD DII + Sbjct: 61 TFKIKYYTPNIDKYNKWFEKINYQRHYAPSIFFRLDIPNLII-NIDKVLYLDCDIIVNSS 119 Query: 140 IEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQV 199 + L N + V G ++ +K +G+ K YFNSG LL+N + + + Sbjct: 120 LSELFNIDISEYFALAVEDTGDLNFLKKYKTKIGIEDKHK-YFNSGVLLLNNKLYMEKNL 178 Query: 200 SARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTN 259 + + N+ I DQD+LN L DK+ F D K+N F + + Sbjct: 179 NLESENYFNKY--YNVIECVDQDILNYLFRDKIKFIDNKWN-----------DFSSKNID 225 Query: 260 DTIFIHYIGPTKPWHD 275 + +HY+G K W+ Sbjct: 226 KSAIMHYVGKIKSWNK 241 >UniRef50_B1LK07 Glycosyl transferase family 8 n=10 Tax=Enterobacteriaceae RepID=B1LK07_ECOSM Length = 630 Score = 159 bits (402), Expect = 1e-37, Method: Composition-based stats. Identities = 51/340 (15%), Positives = 118/340 (34%), Gaps = 39/340 (11%) Query: 26 NLCLDIAYGTDKNFLFGCGISIASILKYNE-GSRLCFHIFTDYFGDDDRKYFDALALQYK 84 + + + D N+ G I SI+ +++ + + +++ L + Sbjct: 274 DESVPVVISFDNNYALSGGALINSIVLHSDASRNYDIVVLENKVSHLNKQRLIKLVAGHN 333 Query: 85 -TRIKIYLINGD-RLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEP 142 ++ + +N + + + +++ + Y R I F + KV+++D+D + + + Sbjct: 334 NISLRFFDVNSFTEMSDVHTRAHFSASTYARLFIPQLF-REYKKVVFIDSDTVVKADLAT 392 Query: 143 LINFSFPDDKVAMVVTEGQADWWEK---------------RAHSLGVAGIAKGYFNSGFL 187 L++ + VA V + + YF +G + Sbjct: 393 LLDVEIGTNLVAAVKDIVMEGFVKFGTMSESDDGIMPAEQYLKKTLGMTNPDEYFQAGII 452 Query: 188 LINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNY 247 + N Q + A+ ++ L KK DQD++N + ++ F +++N Sbjct: 453 VFNVEQMVTENTFAQLMSALK----AKKYWFLDQDIMNKVFFGRVKFLPLEWNVYHGNGN 508 Query: 248 QLK---------ESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNT 298 + IHY G KPW+ D F+E ++PW+ Sbjct: 509 TDDFFPNLKFSTYMRFLQARRNPKMIHYAGENKPWNTEKVD--FYDDFLENVLSTPWEKE 566 Query: 299 ALLKPNN-----SNQLRYSAKHMLKKHRYLKGFSNYLFYF 333 + NQ + +L + + + Y+ + Sbjct: 567 IYYRQLPVATVVPNQHTELQQTVLLQTKIKRALMPYVNKY 606 >UniRef50_C3XKY2 Glycosyl transferase n=2 Tax=Campylobacterales RepID=C3XKY2_9HELI Length = 433 Score = 159 bits (402), Expect = 1e-37, Method: Composition-based stats. Identities = 64/368 (17%), Positives = 108/368 (29%), Gaps = 70/368 (19%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEG-------------------------------- 56 I D+N++ + I S++ Sbjct: 2 FHIILSADENYIKYASVLITSVIYNTNPKLTFKDFCQKEGFKALKNSYFSAYQNIDFSKL 61 Query: 57 ------SRLCFHIFTDYFGDDDRKYF----DALALQYKTRIKIYLINGDRLRSLPSTK-- 104 FHI +D + + L Y I ++IN + P + Sbjct: 62 SKQEAQEGYIFHILSDSISSTTQNQLTELQNTLNTIYPCEILTHIINDKEFENFPISGAA 121 Query: 105 NWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADW 164 + H Y+R + Y + K LYLD+D++C + L D VA + G Sbjct: 122 HSNHLPYYRLKLDSYLDDSITKCLYLDSDMLCLCDLRELFAIDLKDFVVAAINDPGTKKR 181 Query: 165 WEKRAH--SLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQD 222 K + YFNSGFLLINT + ++ + + + I DQD Sbjct: 182 KIKYKENGKKMILNFNDNYFNSGFLLINTQNYKQHKIQEKCENLAKKCY---YIKAADQD 238 Query: 223 VLNMLL-ADKLIFADIKYNTQF------------SLNYQLKESFINPVTNDTIFIHYIGP 269 +LN + +KL+ I YN + + IHY Sbjct: 239 LLNATIPKEKLLKLPIAYNFSSISFCIAICKDEQKHRLNCTRAEFMESYKNPKIIHY--G 296 Query: 270 TKPWHDWAWDYPVS-----QAFMEAKNASP-WKNTALLKPNNSNQLRYSAKHMLKKHRYL 323 KPW + +P + L + + + A + + Sbjct: 297 EKPWKFLQSYVNSKGENINDLWWHYAKITPSFSTQLLESKASIKEYLHFASLGFEVFKLS 356 Query: 324 KGFSNYLF 331 + Y Sbjct: 357 TKLTGYFA 364 >UniRef50_UPI0000E47484 PREDICTED: similar to UDP-glucose ceramide glucosyltransferase-like 1 n=3 Tax=Strongylocentrotus purpuratus RepID=UPI0000E47484 Length = 1470 Score = 159 bits (402), Expect = 1e-37, Method: Composition-based stats. Identities = 36/251 (14%), Positives = 84/251 (33%), Gaps = 15/251 (5%) Query: 27 LCLDIA-YGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKT 85 L+I + + I + S+LK+ + S + F +Y ++ +A +Y Sbjct: 1166 EQLNIFSLASGHLYERLLRIMMLSVLKHTK-SPVKFWFLKNYLSPSFKEIIPEMAKEYDF 1224 Query: 86 RIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLIN 145 ++ R + K Y + F K++++DAD I + ++ L + Sbjct: 1225 EYELIQYKWPRWLHQQTEKQRMIWGYKILFLDVLFPLNIKKIIFVDADQIVRADMQELAD 1284 Query: 146 FSFPDDKVAMVVTEGQAD-WWEKRAHSLGVAGI---AKGYFNSGFLLINTAQWAAQQVSA 201 F V R G + Y S +++ ++ Sbjct: 1285 FDLKGAPYGYVPFCDSRKEMDGFRFWKSGYWASHLAGRKYHISALYVVDLVKFRRIAAGD 1344 Query: 202 RAI-AMLNEPEIIKKITHPDQDVLNMLLADK-LIFADIKYNTQFSLNYQLKESF------ 253 R + +++ DQD+ N ++ + ++ + ++ ++S Sbjct: 1345 RLRGQYQALSQDPNSLSNLDQDLPNNMIHQVAIRSLPQEWLYCETWCHESEKSRAKTIDL 1404 Query: 254 -INPVTNDTIF 263 NP+T + Sbjct: 1405 CNNPLTKEPKL 1415 >UniRef50_C3ZE29 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3ZE29_BRAFL Length = 1647 Score = 159 bits (402), Expect = 1e-37, Method: Composition-based stats. Identities = 33/253 (13%), Positives = 79/253 (31%), Gaps = 15/253 (5%) Query: 25 ENLCLDIA-YGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQY 83 E ++I + + I + S+LK+ + + F +Y + +A +Y Sbjct: 1342 EEDVINIFSVASGHLYERLLRIMMLSVLKHTKT-PVKFWFLKNYLSPAVMDFLPHMAKEY 1400 Query: 84 KTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPL 143 + ++ R + K Y + F K++++DAD I + I+ L Sbjct: 1401 GFQYELVQYKWPRWLHQQTEKQRIIWGYKILFLDVLFPLSVKKIIFVDADQIVRTDIKEL 1460 Query: 144 INFSFPDDKVAMVVTEGQAD-WWEKRAHSLGVAGI---AKGYFNSGFLLINTAQWAAQQV 199 + R G + Y S +++ ++ Sbjct: 1461 RDLDLGGAPYGYTPFCDSRKEMNGFRFWKSGYWASHLGGRKYHISALYVVDLKKFRRIAA 1520 Query: 200 SARAI-AMLNEPEIIKKITHPDQDVLNMLLADK-LIFADIKYNTQFSLNYQLKESF---- 253 R + +++ DQD+ N ++ + ++ + ++ Sbjct: 1521 GDRLRGQYQGLSQDPNSLSNLDQDLPNNMIHQVAIKSLPQEWLWCETWCDDASKATAKTI 1580 Query: 254 ---INPVTNDTIF 263 NP+T + Sbjct: 1581 DLCNNPLTKEPKL 1593 >UniRef50_Q5KMJ4 Putative uncharacterized protein n=1 Tax=Filobasidiella neoformans RepID=Q5KMJ4_CRYNE Length = 1543 Score = 158 bits (400), Expect = 2e-37, Method: Composition-based stats. Identities = 38/277 (13%), Positives = 87/277 (31%), Gaps = 16/277 (5%) Query: 2 QQVFFQETEFLNSVIDYDHKVETENLCLDIAYGTDKN-FLFGCGISIASILKYNEGSRLC 60 +QV+ + + + + +TE+ ++I + I I S++K+ S + Sbjct: 1202 KQVY-SKMKSIVGLSTKATPAKTEHADINIFTVASGLLYERFASIMILSVMKH-TNSSVK 1259 Query: 61 FHIFTDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYF 120 F ++ + LA +Y + + + K Y + F Sbjct: 1260 FWFIENFLSPTFIAFIPKLAEEYGFQYEFVTYKWPHWLRAQTEKQRIIWAYKILFLDVLF 1319 Query: 121 INKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVT-EGQADWWEKRAHSLGVAG--- 176 KV+++DAD I + ++ L++ + + R G Sbjct: 1320 PMSLDKVIFVDADQIVRTDMKELMDVDLHGRVYGYAPMGNSRKEMEGFRFWKSGYWKEAL 1379 Query: 177 IAKGYFNSGFLLINTAQWAAQQVSARAI-AMLNEPEIIKKITHPDQDVLNMLLADKLI-F 234 + Y S +++ ++ R + + DQD+ N + I Sbjct: 1380 RGRPYHISALYVVDLKKFRQLATGDRLRGQYHALSADPNSLANLDQDLPNSMQDQIPIWT 1439 Query: 235 ADIKYNTQFSLNYQLKESF-------INPVTNDTIFI 264 D + + + NP+T + + Sbjct: 1440 LDQDWLWCQTWCSDESLATAKTIDLCQNPLTKEPKLV 1476 >UniRef50_Q0U987 Putative uncharacterized protein n=1 Tax=Phaeosphaeria nodorum RepID=Q0U987_PHANO Length = 583 Score = 158 bits (400), Expect = 3e-37, Method: Composition-based stats. Identities = 44/271 (16%), Positives = 81/271 (29%), Gaps = 43/271 (15%) Query: 32 AYGT---DKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 Y T ++L G + S+ +L I + D L Y I Sbjct: 8 VYCTLLMSDSYLPGAAVLAHSLRDAGTKKKLAVLITLETLSADTITQLKEL---YDYLIP 64 Query: 89 IYLINGDRLRSLPSTKNWTHA-IYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 + I +L + + + + + K++YLDAD++ ++ L + Sbjct: 65 VERIRTPSPANLYLMGRPDLSFAFTKIALWR--QTQFRKIVYLDADVVALRALDELFDI- 121 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 + A G D FNSG ++I+ A+ Sbjct: 122 --EAPFAAAPDIGWPDA-----------------FNSGVMVISPD-------MGEYWALQ 155 Query: 208 NEPEIIKKITHPDQDVLNMLLADK-LIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHY 266 DQ +LN + YN + YQ + ++ D +H+ Sbjct: 156 TMAATGDSFDGADQGLLNQYFEHRPWQRLKFTYNCTPNAEYQWEPAY-RYYKRDISAVHF 214 Query: 267 IGPTKPWHD-----WAWDYPVSQAFMEAKNA 292 IG KPW + + + + Sbjct: 215 IGKEKPWSSSRTSGPGVYGELLSRWWQVHDR 245 >UniRef50_A8LP95 Lipopolysaccharide 1 n=1 Tax=Dinoroseobacter shibae DFL 12 RepID=A8LP95_DINSH Length = 342 Score = 158 bits (399), Expect = 3e-37, Method: Composition-based stats. Identities = 62/312 (19%), Positives = 121/312 (38%), Gaps = 22/312 (7%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 + + +D+ +L + I + + I I+ Sbjct: 38 VCFCSDEGYLPFALFAALQIHRLHPDRCFDLVIAHTG-------PLSVPHGFPGIGIRYV 90 Query: 91 LINGD-RLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGT-IEPLINFSF 148 I+ L T + Y R ++ + ++LY+D+D+ + L+ Sbjct: 91 EIDTGGCFERLALDARRTGSTYLRLALSGALGHDYQRILYMDSDVFALRDGLHVLLFTDM 150 Query: 149 PDDKVAMVVTEGQADWWEKRAHSLGVAG-IAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 +A V Q ++ L A+ YFN+G LL++TA+ Q + A+A+ + Sbjct: 151 RGKPLAAVRDNSQWRTSGRKPDDLVTLNLPARPYFNAGVLLMDTARLNEQDILAKALDL- 209 Query: 208 NEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYI 267 ++ DQ +LN + + ++N QF+ S+I ++ D +H+I Sbjct: 210 -GTSQAGRLARHDQTLLNAVTSGNWAEMSPRWNWQFTW-----ASWIFALSEDARILHFI 263 Query: 268 GPTKPWHDWAWDYP--VSQAFMEAKNASPWKNTALLKPNNS--NQLRYSAKHMLKKHRYL 323 GP KPW D + +P +++A+ + + + + NS N R K ++K Sbjct: 264 GPNKPWADTSGRFPKSITRAYGDFLAEQ-FPERTVERAANSPINDPRRLIKSLIKHGLSR 322 Query: 324 KGFSNYLFYFIE 335 K S YL F + Sbjct: 323 KKMSAYLARFAD 334 >UniRef50_C5EZG9 Glycosyl transferase family protein n=1 Tax=Helicobacter pullorum MIT 98-5489 RepID=C5EZG9_9HELI Length = 374 Score = 158 bits (399), Expect = 3e-37, Method: Composition-based stats. Identities = 58/310 (18%), Positives = 108/310 (34%), Gaps = 36/310 (11%) Query: 59 LCFHIFTDYFGDDDRKYFD----ALALQYKTRIKIYLINGDRLRSLPSTK-NWTHAIYFR 113 FH+ D+ + ++ L+ Y + I+++ + R+ N + Y+R Sbjct: 9 YNFHLLMDFVSQETKEKLQNLILELSKIYPCTLNIHILEDEIFRTQSLRTLNGNYLAYYR 68 Query: 114 FVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVV---TEGQADWWEKRAH 170 I + +YLD D+I G + L + +V+ + E + Sbjct: 69 LRIGSALPLSIKRCVYLDVDMIVLGDLRELFKINLQGKICGVVMEGKDNDTQNILESKNK 128 Query: 171 SLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLAD 230 I YFNSG LL++ W + + RA ++ + K D+ +LN +L Sbjct: 129 INKSIAIVSNYFNSGMLLVDLDLWRKENIEDRAFEIVKKYYCHK----HDEHILNAVLQG 184 Query: 231 KLIFADIKYNTQFSL-------------NYQLKESFINPVTNDTIFIHYIGPTKPWHD-- 275 + ++N L N N + +HY KPW D Sbjct: 185 QTFKILPQWNMMVFLYCRAVCLNERGKINMPYNRKDFNNALKNPKILHYHTHHKPWEDSK 244 Query: 276 ---WAWDYPVSQAFMEAKNASP-WKNTALL-KPNNSNQLRYS----AKHMLKKHRYLKGF 326 + + Q + + +P +K L KP + L + K + + L Sbjct: 245 IYLNYCNKFLGQYWWDMVEQTPIFKEKLLQLKPQADSALAFQCLVGYKLLRYYQKGLFIL 304 Query: 327 SNYLFYFIEK 336 + YF+ K Sbjct: 305 IPFYTYFLIK 314 >UniRef50_C5JPW4 Glycosyl transferase family 8 protein n=2 Tax=Ajellomyces RepID=C5JPW4_AJEDS Length = 723 Score = 158 bits (399), Expect = 3e-37, Method: Composition-based stats. Identities = 45/270 (16%), Positives = 83/270 (30%), Gaps = 37/270 (13%) Query: 36 DKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIYLINGD 95 ++L G + S+ ++L + D + Y I I Sbjct: 4 SDSYLPGAMVLAHSLRDTGSKAKLVVLVTLDSLKSSTIDELKTI---YNDIIPITQFVNR 60 Query: 96 RLRSLPSTKNWT-HAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVA 154 +L + + + + + K++Y+DAD++ L+ + A Sbjct: 61 NPANLYLMDRPDLISTFSKIELWR--QTQYSKIVYIDADVVSLRAPNELLKL---VSRFA 115 Query: 155 MVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIK 214 V G D FN+G +++ ++L E Sbjct: 116 AVPDIGWPDC-----------------FNTGLMVLTPN-------MQDYYSLLALAERGI 151 Query: 215 KITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIGPTKPWH 274 DQ +LNM K YN S +YQ +F ++ +HYIG KPW+ Sbjct: 152 SFDGADQGLLNMHFK-KWDRLSFAYNCTPSGHYQYIPAF-RHFGSNISLVHYIGRRKPWN 209 Query: 275 DWAWDYPVSQAFMEAKNASPWKNTALLKPN 304 +P+ + + W Sbjct: 210 LPRQAFPLESPYNQLLGR--WWAMYDRHYR 237 >UniRef50_B9HMR5 Glycosyltransferase, CAZy family GT8 n=25 Tax=Magnoliophyta RepID=B9HMR5_POPTR Length = 383 Score = 157 bits (397), Expect = 5e-37, Method: Composition-based stats. Identities = 47/288 (16%), Positives = 90/288 (31%), Gaps = 21/288 (7%) Query: 22 VETENLCLDIAYGTDKNFLFGCGISIASILKYNE-GSRLCFHIFTDYFGDDDRKYFDALA 80 + + IA D +L G ++ S+LK+ + FH F + L Sbjct: 70 SSCDPSLVHIAMTLDSEYLRGSIAAVHSVLKHASCPESIFFHFVAAEFDPASPRVLTQLV 129 Query: 81 LQYKTRIKI--YLINGDRLRSLPSTKNWT----HAIYFRFVIADYFINKAPKVLYLDADI 134 + Y+ D + +L S+ Y R + D +V+YLD+DI Sbjct: 130 RSTFPSLNFKVYIFREDTVINLISSSIRQALENPLNYARNYLGDMLDLCVDRVIYLDSDI 189 Query: 135 ICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAG----------IAKGYFNS 184 + I L N + +V A++ + YFN+ Sbjct: 190 VVVDDIHKLWNTALSGSRVIGAPEYCHANFTQYFTSVFWSDQVMSGTFSSARRKPCYFNT 249 Query: 185 GFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFS 244 G ++++ +W R + + + ++ A + D ++N Q Sbjct: 250 GVMVMDLVRWREGDYKRRIEKWMEIQKKTRIYELGSLPPFLLVFAGDVEAIDHRWN-QHG 308 Query: 245 LNYQLKESFINPVTNDTI-FIHYIGPTKPWHDWAWDYPVS--QAFMEA 289 L + + +H+ G KPW P + Sbjct: 309 LGGDNVRGSCRSLHPGPVSLLHWSGKGKPWVRLDAKKPCKLDHLWEPY 356 >UniRef50_A5VK24 Glycosyl transferase, family 8 n=18 Tax=Lactobacillales RepID=A5VK24_LACRD Length = 282 Score = 157 bits (397), Expect = 5e-37, Method: Composition-based stats. Identities = 51/271 (18%), Positives = 89/271 (32%), Gaps = 11/271 (4%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 +++ + + F+ + SI + ++ + + Q Sbjct: 1 MNLLFSINDKFVTQLATVLLSIKLNTQAQEFNVYVLQKDKLKRT-DDLERVCKQLGMNYF 59 Query: 89 IYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSF 148 +N P T + IY+R + K+LYLDAD++C + L S Sbjct: 60 PIKVNDQLFNKAPVTDRYPTTIYYRLLAHRLLPQDLHKILYLDADVLCINDLSSLYETSL 119 Query: 149 PDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLN 208 A + + E A GY+NSG LL+N + + Sbjct: 120 DGYLYASAIHTNLTNTTEVINKIRLQNFDADGYYNSGVLLMNLDTIRKKVKDTDIFNYIR 179 Query: 209 EPEIIKKITHPDQDVLNMLLADKLIFADIK-YNT--QFSLNYQ---LKESFINPVTNDTI 262 + PDQDVLN L + + YN + Y+ E + V +T+ Sbjct: 180 THT----LLLPDQDVLNALYGRYIKSVPDQLYNFDTRKGGIYETISFGEWTTDWVMRNTV 235 Query: 263 FIHYIGPTKPWHDWAWDYPVSQAFMEAKNAS 293 +HY G KPW + + + Sbjct: 236 ILHYCGRDKPWLPTKNSGRYTALYKNYFQMT 266 >UniRef50_UPI000180C254 PREDICTED: similar to UDP-glucose ceramide glucosyltransferase-like 1 n=1 Tax=Ciona intestinalis RepID=UPI000180C254 Length = 1548 Score = 157 bits (397), Expect = 6e-37, Method: Composition-based stats. Identities = 31/265 (11%), Positives = 85/265 (32%), Gaps = 15/265 (5%) Query: 13 NSVIDYDHKVETENLCLDIA-YGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDD 71 + +++ + +++ + + I + S++++ S + F + +Y Sbjct: 1236 SESKEWEEGASNSSDVINVFSLASGHLYERLMRIMMLSVMRHTT-SNVKFWVLKNYLSPQ 1294 Query: 72 DRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLD 131 + + +A +Y ++ R + K T Y + F K++++D Sbjct: 1295 FKDFIPHMAEEYGFEYELVQYKWPRWLRQQTEKQRTMWGYKILFLDVLFPLNVEKIIFVD 1354 Query: 132 ADIICQGTIEPLINFSFPDDKVAMVVTEGQA-DWWEKRAHSLGVAGI---AKGYFNSGFL 187 AD I + ++ L + + + R G + Y S Sbjct: 1355 ADQIVRANLKELRDLDLEGNPYGYTPFCSDRTEMDGFRFWKGGYWAQHLAGRKYHISAIY 1414 Query: 188 LINTAQWAAQQVSARAI-AMLNEPEIIKKITHPDQDVLNMLLADK-LIFADIKYNTQFSL 245 +++ ++ R + + + DQD+ N ++ + ++ + Sbjct: 1415 VVDLKKFRQIAAGDRLRGQYQGLSQDPNSLANLDQDLPNNMIHQVGIKSLPQEWLWCSTW 1474 Query: 246 NYQLKESF-------INPVTNDTIF 263 S NP+T + Sbjct: 1475 CSDDSLSRAKTIDLCNNPLTKEPKL 1499 >UniRef50_C5S3F7 Putative glycosyl transferase n=2 Tax=Actinobacillus minor RepID=C5S3F7_9PAST Length = 275 Score = 157 bits (396), Expect = 7e-37, Method: Composition-based stats. Identities = 58/291 (19%), Positives = 113/291 (38%), Gaps = 26/291 (8%) Query: 20 HKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDAL 79 + + I D F +I SI +N + L + F + +Y + Sbjct: 2 KQTNKQTNKQTIILAADIKFAEQLETTIKSICYHN--ANLYIVLLNRDFSKEWFEYLNTY 59 Query: 80 ALQYKTRIKIYLINGDRLRSLPSTKNWTHA-IYFRFVIADYFINKAPKVLYLDADIICQG 138 Q I +N ++L + + + A +FR+ I + KVLYLD D++ G Sbjct: 60 LNQINCEIIDVKVNCNQLEEYKTLPHISSASTFFRYFIPAFV--NDDKVLYLDCDLVVNG 117 Query: 139 TIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQ 198 ++ + D VA + + ++ +K+ +FN+G LLIN W Q+ Sbjct: 118 SLSIFFDLELNDHYVAASLDDIAFNFHQKK------------HFNAGVLLINNKLWRKQE 165 Query: 199 VSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVT 258 ++ +A+ + + + +K+ DQ+VLN+L +K I + N Y + + + Sbjct: 166 ITLKALELTD--RLNEKLEEGDQEVLNILFQNKWIELNPYLNYLVGAEYLYRRNGVTQYI 223 Query: 259 ND-----TIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPN 304 + +H+ KPW P + + + W + N Sbjct: 224 RRQEDDVPLILHFNTKYKPWLPID-GVPFREYYWFYYRLN-WADIIARHYN 272 >UniRef50_A7EPR4 Putative uncharacterized protein n=1 Tax=Sclerotinia sclerotiorum 1980 UF-70 RepID=A7EPR4_SCLS1 Length = 643 Score = 157 bits (396), Expect = 7e-37, Method: Composition-based stats. Identities = 37/264 (14%), Positives = 79/264 (29%), Gaps = 41/264 (15%) Query: 37 KNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIYLINGDR 96 +L G + S+ ++ + TD + + I + + + Sbjct: 16 DTYLPGALVLAHSLRDAGTTKKIAVLVTTDSVTFESMAELQR---NFDFVIPVDRVVNES 72 Query: 97 LRSLPSTKNWT-HAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAM 155 +L H+ + + + + +++Y+DAD++ + L D + Sbjct: 73 PANLDLMGRPDLHSTFTKITLWK--QTQFRRIVYMDADMVALRAPDELFALP---DPFSA 127 Query: 156 VVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKK 215 G D FN+G ++++ A+ Sbjct: 128 APDIGWPDI-----------------FNTGLMVLDPN-------MGDYYALEAMARRGIS 163 Query: 216 ITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIGPTKPWHD 275 DQ +LNM + YN S +YQ +F + H+IG KPW Sbjct: 164 FDGADQGLLNMHFKNTFNRLSFTYNVTPSAHYQYLPAF-QHFQSSISAAHFIGTDKPWKV 222 Query: 276 WAW-------DYPVSQAFMEAKNA 292 + ++ + + Sbjct: 223 GRQASIGATPYHQMTGRWWAVYDK 246 >UniRef50_A8PS15 UDP-glucose:Glycoprotein Glucosyltransferase containing protein n=1 Tax=Brugia malayi RepID=A8PS15_BRUMA Length = 1534 Score = 157 bits (396), Expect = 8e-37, Method: Composition-based stats. Identities = 32/242 (13%), Positives = 81/242 (33%), Gaps = 12/242 (4%) Query: 7 QETEFLNSVIDYDHKVETENLCLDIA-YGTDKNFLFGCGISIASILKYNEGSRLCFHIFT 65 +S+ + ++ ++I + + I I S++K+ + + F + Sbjct: 1220 DHHSIWSSISTTSISGDEKHDAINIFSLASGHLYERFLRIMILSVMKHTKH-PVNFWLLK 1278 Query: 66 DYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAP 125 +Y + ++ +A Y + R + K Y + F Sbjct: 1279 NYLSPNFKETLPQMAKHYGFNYEFIEYRWPRWLHQQTEKQRVMWGYKILFLDVLFPLGVR 1338 Query: 126 KVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQA------DWWEKRAHSLGVAGIAK 179 K++++DAD I + + L+ +W+K + +AG + Sbjct: 1339 KIIFVDADQIVRTDLMELMELDLGGAPYGFTPFCDSRTSMDGFRFWKKGYWANHLAG--R 1396 Query: 180 GYFNSGFLLINTAQWAAQQVSARAI-AMLNEPEIIKKITHPDQDVLNMLLAD-KLIFADI 237 Y S +I+ ++ R +++ DQD+ N ++ ++ Sbjct: 1397 KYHISALYVIDLVKFRQVAAGDRLRGQYQGLSADPNSLSNLDQDLPNNMIHQVRIKSLPQ 1456 Query: 238 KY 239 ++ Sbjct: 1457 EW 1458 >UniRef50_A2QNN6 Contig An07c0170, complete genome n=10 Tax=Leotiomyceta RepID=A2QNN6_ASPNC Length = 1495 Score = 156 bits (395), Expect = 8e-37, Method: Composition-based stats. Identities = 31/249 (12%), Positives = 73/249 (29%), Gaps = 15/249 (6%) Query: 29 LDIA-YGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 ++I + + I + S+++ + F + + + LA +Y Sbjct: 1188 INIFSVASGHLYERMLNIMMVSVMRN-TNHSVKFWFIEQFLSPSFKSFLPHLAKEYNFSY 1246 Query: 88 KIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 ++ K Y + F KV+++DAD I + + L++ Sbjct: 1247 EMVTYKWPHWLRAQKEKQREIWGYKILFLDVLFPLDLDKVIFVDADQIVRTDMYDLVSLD 1306 Query: 148 FPDDKVAMVVTEGQAD-WWEKRAHSLGVAGI---AKGYFNSGFLLINTAQWAAQQVSARA 203 R G + Y S +++ ++ A R Sbjct: 1307 LEGAPYGFTPMCDSRHEMEGFRFWKQGYWKNFLRGQPYHISALYVVDLNRFRAIAAGDRL 1366 Query: 204 I-AMLNEPEIIKKITHPDQDVLNMLLADK-LIFADIKYNTQFSLNYQLKESF-------I 254 + +++ DQD+ N + + ++ + +S Sbjct: 1367 RGQYQMLSADPESLSNLDQDLPNHMQHHIPIKSLPQEWLWCETWCSDESQSQARTIDLCN 1426 Query: 255 NPVTNDTIF 263 NP+T + Sbjct: 1427 NPMTKEPKL 1435 >UniRef50_Q3DNA2 Glycosyl transferase, family 8 n=9 Tax=Streptococcus RepID=Q3DNA2_STRAG Length = 272 Score = 156 bits (395), Expect = 8e-37, Method: Composition-based stats. Identities = 52/271 (19%), Positives = 104/271 (38%), Gaps = 11/271 (4%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 +++ + D ++ + + S+++ + +L ++ + + Sbjct: 1 MNLLFSIDDMYVDHFKVMLYSLVRQTKNRKLEIYVLQKTLLKRHTELI-QYTQNLEVGYH 59 Query: 89 IYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSF 148 ++ + P+T + IY+R + + ++LYLDAD++C L + Sbjct: 60 PIIVGTEVFAQAPTTDRYPDTIYYRLLAHKFLPETLDRILYLDADMLCLNDFSSLYDMEL 119 Query: 149 PDDKVAMVVTEGQADWWEKRAH-SLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 D A + + L + YFN+G LL+N + + Sbjct: 120 GDQLYAAASHNTDGKFLDYVNKLRLKNVELESSYFNTGVLLMNLPAIRKVVHQQTILDYM 179 Query: 208 NEPEIIKKITHPDQDVLNMLLADKLIFAD---IKYNTQFSLNYQLK---ESFINPVTNDT 261 + ++ PDQD+LN L A+ + Y+ ++SL YQLK E + V N T Sbjct: 180 MQNR--GRLILPDQDILNGLYANLVKPIPDEIYNYDARYSLIYQLKSRNEWDLEWVINHT 237 Query: 262 IFIHYIGPTKPWHDWAWDYPVSQAFMEAKNA 292 +F+H+ G KPW + S + Sbjct: 238 VFLHFAGRDKPWKK-DYRGRYSGLYKFMAKE 267 >UniRef50_C6IB51 Glycosyltransferase n=4 Tax=Bacteroides RepID=C6IB51_9BACE Length = 417 Score = 156 bits (395), Expect = 1e-36, Method: Composition-based stats. Identities = 52/234 (22%), Positives = 90/234 (38%), Gaps = 13/234 (5%) Query: 58 RLCFHIFTDYFGDDDRKYFDALALQYKT-RIKIYLINGDRLRSLPSTK-NWTHAIYFRFV 115 + +I TDY + +++ + + I+ +I+ + + L T +R+ Sbjct: 2 NISIYILTDYISLESKEFLQEIKNVFTCVTIQWEIIDSESFKQLKKKGGYITEHTLYRYA 61 Query: 116 IADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVA 175 IAD F K LYLDAD++ G+IEPL A V + Sbjct: 62 IADLFP-NLDKALYLDADLVINGSIEPLWELDLEGYYCAGVDDIFIRRI---NYRKILEL 117 Query: 176 GIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFA 235 Y N+G LL+N ++ + + + I + + DQD +N + K+ Sbjct: 118 AEKDVYINAGVLLLNLKDLRKDKIQEKLLQHTSIY--INRDRYQDQDAINCICKGKIKLI 175 Query: 236 DIKYNTQFSLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEA 289 YN S + + +D I IHY G KPWH + + + + Sbjct: 176 PNIYNFTTS-----ETLHTPEMLSDIIIIHYTGSIKPWHQEYTWQVLKELYCKY 224 >UniRef50_C7HS13 Family 8 glycosyl transferase n=2 Tax=Anaerococcus RepID=C7HS13_9FIRM Length = 276 Score = 156 bits (394), Expect = 1e-36, Method: Composition-based stats. Identities = 57/274 (20%), Positives = 109/274 (39%), Gaps = 12/274 (4%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDAL---ALQYKT 85 ++I D+N+L + S+ + N + ++ D+ K + A + Sbjct: 1 MNILVSCDENYLNPLKTMLYSLFESN-DTNFEIYLIHKDIRDEKIKEIEKFVIKASSKRA 59 Query: 86 RIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLIN 145 ++ + + + T +T +Y+R + Y ++LYLD D++ + E L N Sbjct: 60 KLNAIKV-KNLFSNAKITFYYTEEMYYRLLAYKYLPENLDRILYLDPDVLVLNSCEKLYN 118 Query: 146 FSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAK--GYFNSGFLLINTAQWAAQQVSARA 203 D+ A A +G YFNSG L+IN Q + Sbjct: 119 MDLGDNYFAAATHTIPTVQSANVARLSISSGHKDIENYFNSGILMINLKLSRDSQTYEKE 178 Query: 204 IAMLNEPEIIKKITHPDQDVLNMLLADKLIFAD---IKYNTQFSLNYQLKESFIN--PVT 258 + + + PDQD+LN++ +K+I D Y+ + L Y+LK+ N + Sbjct: 179 VLNYVKNTKSLGLIMPDQDLLNVVFRNKIIKIDEIKYNYDARRYLTYKLKDKKYNLSYII 238 Query: 259 NDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNA 292 ++T F+H+ G KPW + + ++ Sbjct: 239 SNTCFLHFCGKRKPWLEENNLGVFTSLYLYFWKK 272 >UniRef50_UPI000023DC59 hypothetical protein FG01882.1 n=1 Tax=Gibberella zeae PH-1 RepID=UPI000023DC59 Length = 704 Score = 155 bits (393), Expect = 1e-36, Method: Composition-based stats. Identities = 45/265 (16%), Positives = 82/265 (30%), Gaps = 41/265 (15%) Query: 36 DKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIYLINGD 95 ++L G + S+ +L + D D + Y + I D Sbjct: 18 SDSYLPGALVLAHSLRDAGANHKLAVLVTLDSVSGDSITQLKEV---YDYIFPVPRIRND 74 Query: 96 RLRSLPSTKNWT-HAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVA 154 +L H+ + + + K++Y+DAD++ E L N A Sbjct: 75 HPANLQLMNRGDLHSAFTKINLWRL--TDFSKIVYIDADVVAYRAPEELFNL---SQPFA 129 Query: 155 MVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIK 214 G D FN+G ++++ AM+ E Sbjct: 130 AAPDIGWPDL-----------------FNTGVMVLDPN-------MGDFYAMMAMAERGI 165 Query: 215 KITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIGPTKPW- 273 DQ ++NM + YN S +YQ ++ + +H+IG KPW Sbjct: 166 SFDGADQGLINMHFGQQYHRLSFTYNVTPSAHYQYVPAY-RHFQSSINMVHFIGANKPWF 224 Query: 274 ---HDWAWDYPVSQ---AFMEAKNA 292 A P ++ + + Sbjct: 225 TGRDAPAGSGPFTEMIGRWWAVYDR 249 >UniRef50_D1IFB6 Whole genome shotgun sequence of line PN40024, scaffold_26.assembly12x (Fragment) n=1 Tax=Vitis vinifera RepID=D1IFB6_VITVI Length = 473 Score = 155 bits (392), Expect = 2e-36, Method: Composition-based stats. Identities = 50/316 (15%), Positives = 102/316 (32%), Gaps = 43/316 (13%) Query: 22 VETENLCLDIAYGTDKNFLFGCGISIASILKYNE-GSRLCFHIFTDYFGDDDRKYFDALA 80 + + IA D +L G ++ SIL+++ + FH F + L Sbjct: 137 SSCDPSLVHIAMTLDSEYLRGSIAAVHSILRHSSCPENVFFHFIAAEFDPASPRVLTQLV 196 Query: 81 LQYKTRIKI--YLINGDRLRSLPSTKNW----THAIYFRFVIADYFINKAPKVLYLDADI 134 + Y+ D + +L S+ Y R + D +V+Y+D+D+ Sbjct: 197 RSTFPSLNFKVYIFREDTVINLISSSIRSALENPLNYARNYLGDILDPCVERVIYIDSDL 256 Query: 135 ICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQW 194 + I L N + + YFN+G ++++ +W Sbjct: 257 VVVDDIRKLWNITLTEKP---------------------------CYFNTGVMVMDLVRW 289 Query: 195 AAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFI 254 + + + ++ A + D ++N Q L + Sbjct: 290 RKGNYRRKIENWMELQRRRRIYELGSLPPFLLVFAGNVEAIDHRWN-QHGLGGDNVKGSC 348 Query: 255 NPVTNDTI-FIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSA 313 P+ + +H+ G KPW P + W+ L KP+ +++L + Sbjct: 349 RPLHPGPVSLLHWSGKGKPWSRLDARKPCPVDHL-------WEPYDLYKPHRNHRLNHQQ 401 Query: 314 KHMLKKHRYLKGFSNY 329 + L G + Sbjct: 402 MLLSASSSTLVGLQKH 417 >UniRef50_Q9L7A2 Putative glycosyl transferase n=1 Tax=Haemophilus ducreyi RepID=Q9L7A2_HAEDU Length = 269 Score = 155 bits (392), Expect = 2e-36, Method: Composition-based stats. Identities = 50/282 (17%), Positives = 111/282 (39%), Gaps = 25/282 (8%) Query: 27 LCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTR 86 ++I ++++ +I SI +N+ + F++ + + + + + Sbjct: 6 EKMNIVLAANQSYSEYILTTIKSIYLHNKH--IRFYLLNRDYPTEWFDILNNKLRKLNSE 63 Query: 87 IKIYLINGDRLRSLPSTKNWTH-AIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLIN 145 I + D +++ + + + +FR+ I+D+ + KV+YLDADI+ G++ L Sbjct: 64 IIDIKVTNDTIKNFKTYSHISSDTTFFRYFISDFI--EQDKVIYLDADIVVNGSLTELYQ 121 Query: 146 FSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIA 205 + +A V + + FN+G LLIN +W ++ ++ Sbjct: 122 TDISNYFLAAVKDIISEKIY-----------VNNHIFNAGMLLINNKKWREHNITQFCLS 170 Query: 206 MLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTND----- 260 + + I + DQ +LN++ DK + + YN +Y + D Sbjct: 171 LSEKY--INSLPDADQSILNLIFKDKWLKLNRGYNYLIGTDYLFFKYGKTRYLEDLGETI 228 Query: 261 TIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLK 302 + IHY KPW + ++ + + W++ Sbjct: 229 PLIIHYNTEAKPWLNI-FNTRFRNIYWFYYELN-WQDIYAKH 268 >UniRef50_C7Z1L1 Putative uncharacterized protein n=1 Tax=Nectria haematococca mpVI 77-13-4 RepID=C7Z1L1_NECH7 Length = 762 Score = 155 bits (391), Expect = 2e-36, Method: Composition-based stats. Identities = 46/288 (15%), Positives = 86/288 (29%), Gaps = 42/288 (14%) Query: 36 DKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIYLINGD 95 ++L G + S+ +L + D D A+ Y + I D Sbjct: 18 SDSYLPGALVLAHSLRDAGTHRKLAVLVTLDSVSADSITQLKAV---YDYIFPVPRIRND 74 Query: 96 RLRSLPSTKNWT-HAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVA 154 +L H+ + + + + K++Y+DADI+ + L + + Sbjct: 75 NPANLYLMNRGDLHSAFTKINLWKL--TQFSKIVYIDADIVAYRAPDELFDI---THPFS 129 Query: 155 MVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIK 214 G D FN+G +++ AM+ E Sbjct: 130 AAPDIGWPDL-----------------FNTGVMVLTPN-------MGDFYAMIAMAERGI 165 Query: 215 KITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIGPTKPWH 274 DQ ++NM ++ YN S +YQ ++ + +H+IG KPW Sbjct: 166 SFDGADQGLINMHFGNQYNRISFTYNVTPSAHYQYVPAY-RHFQSSINMVHFIGAKKPWF 224 Query: 275 DWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKHRY 322 + F + W R+ K + Sbjct: 225 TGRDAPRGADPFNDMVGR--WWAVY------DRHYRHQYVQYFTKGEW 264 >UniRef50_B8PIH6 Predicted protein n=2 Tax=Agaricomycetes RepID=B8PIH6_POSPM Length = 532 Score = 155 bits (391), Expect = 3e-36, Method: Composition-based stats. Identities = 53/293 (18%), Positives = 105/293 (35%), Gaps = 47/293 (16%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDA-LALQYKTRI 87 ++IA TD + ++I S++ + + SRL ++ GD+DR + + + Sbjct: 227 MNIAIATDPAYAMAAAVAIHSVIAHTK-SRLTIYVLDLGLGDNDRNKLRRSMPRRADATM 285 Query: 88 KIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 ++ +++ A + + + D +VLYLDAD++ + I L + Sbjct: 286 VFIPLDY-------ASERKEKATWAKIDMIDVLP--VERVLYLDADVLVRADIWGLWSTD 336 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 + + G + + K YFN+G LL++ A A+ Sbjct: 337 LRGKPIGAAIDVGFPEG--------HNGTVRKPYFNAGVLLLDLAAVRR-----TLQALQ 383 Query: 208 NEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNY-----QLKESFINPVTNDTI 262 DQD+LN +K+N Q Y + +++ + + Sbjct: 384 GAAREYTTSRFRDQDLLNAYFEANWAEVSLKWNAQGIATYAELPTEARQNIDMGLLKNPY 443 Query: 263 FIHYIGPT-----------------KPW-HDWAWDYPVSQAFMEAKNASPWKN 297 +H+ GP KPW + A +P + + + WK Sbjct: 444 IVHFTGPVNPTLEVVLNPYIQPYTAKPWGYAGAPGHPHGEEWWNVVEQTAWKG 496 >UniRef50_C0S309 Glycogenin n=5 Tax=Onygenales RepID=C0S309_PARBP Length = 785 Score = 155 bits (391), Expect = 3e-36, Method: Composition-based stats. Identities = 51/301 (16%), Positives = 92/301 (30%), Gaps = 47/301 (15%) Query: 32 AYGT---DKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 Y T N+L G + S+ ++L + D + Y + Sbjct: 8 VYCTMLLSDNYLPGAMVLAHSLRDNGCKAKLVVLVTLDSLKASTIDELKTI---YDDVVP 64 Query: 89 IYLINGDRLRSLPSTKNWTHA-IYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 I I +L A + + + + +++Y+DAD++ + L+ + Sbjct: 65 INRIVNHCPANLYLMDRPDLASTFSKIELWR--QTQYRQLVYIDADVVSLRAPDELLTIN 122 Query: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 A V G D FN+G +++ ++L Sbjct: 123 TN---FAAVPDTGWPDC-----------------FNTGLMVLRPN-------MHDYYSLL 155 Query: 208 NEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYI 267 + DQ +LN+ K YN S +YQ +F + +HYI Sbjct: 156 ALAQQGVSFDGADQGLLNIHFK-KWDRLSFVYNCTPSGHYQYIPAF-RHFGSTISLVHYI 213 Query: 268 GPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKP-------NNSNQLRYSAKHMLKKH 320 G KPW+ +P + + W T + AKH L + Sbjct: 214 GSQKPWNLPRQLFPSGSPYNQLLGR--WWATYYRHYRPVVKPDTKLSSRADIAKHGLGQL 271 Query: 321 R 321 Sbjct: 272 H 272 >UniRef50_Q1RIL1 Lipopolysaccharide 1,2-glucosyltransferase RfaJ n=10 Tax=Rickettsia RepID=Q1RIL1_RICBR Length = 530 Score = 154 bits (390), Expect = 4e-36, Method: Composition-based stats. Identities = 53/295 (17%), Positives = 98/295 (33%), Gaps = 30/295 (10%) Query: 25 ENLCLDIAYGTDKNFLFGCGISIASILKYNE-GSRLCFHIF---TDYFGDDDRKYFDALA 80 ++ LDIA + F IAS L ++ S FHI D ++ + ++ Sbjct: 246 QDNTLDIALIINDKFARHAATVIASSLINSDINSFYKFHIVMNPNDSLTEESMEKLASMK 305 Query: 81 LQYKTRIKIYLINGDRL------RSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADI 134 I + L + + W + +R F +LYLDADI Sbjct: 306 HIRDYSIDFIPFPENVLDLNLANEKIEFSDMWPPLVMYRLYFDQVFP-NLESILYLDADI 364 Query: 135 ICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQW 194 I + + VA + K I Y NSG + +N Sbjct: 365 IVLRDLNSFKKLDMSNYIVAGSMDTALTYCTLKVEEECNR-KINNFYKNSGIVFLNLQNM 423 Query: 195 AAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFI 254 +Q + +PDQD+LN+ + + +++N + ++++ Sbjct: 424 REKQAKNMVLD--AMHNSKCSFAYPDQDLLNIAFHNYIYPLSMRWNF--YTYFIDRDNYF 479 Query: 255 NPVTNDTIFIHYIGPTKPWHDWAWD---------YPVSQAFMEAKNASPWKNTAL 300 + +HY G KPW++ + + + + +PW N Sbjct: 480 SYF-----IMHYAGKKKPWNNEEIKWTKDILEKYQEIEKYYWRYREFTPWGNKDF 529 >UniRef50_B2B5U2 Predicted CDS Pa_2_5770 n=1 Tax=Podospora anserina RepID=B2B5U2_PODAN Length = 576 Score = 154 bits (390), Expect = 4e-36, Method: Composition-based stats. Identities = 43/298 (14%), Positives = 92/298 (30%), Gaps = 45/298 (15%) Query: 37 KNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIYLINGDR 96 +L G + S+ +L + D + + Y I + I + Sbjct: 18 DTYLPGALVLAHSLRDAGTTKKLAILVTPDTVSTEVIATLKTV---YDYVIYVDRIRNGK 74 Query: 97 LRSLPSTKNWT-HAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAM 155 +L H+ + + + + K++Y+DAD++ ++ L + + Sbjct: 75 PANLFLMNRPDLHSAFTKINLWK--QTQFRKIVYIDADVVAYRAVDELFDLP---HAFSA 129 Query: 156 VVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKK 215 G D FN+G + + AM+ E Sbjct: 130 APDIGWPDL-----------------FNTGVMALTPN-------MGDYYAMMAMAERGIS 165 Query: 216 ITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIGPTKPWH- 274 DQ +LNM + YN S +YQ ++ +H+IG KPW Sbjct: 166 FDGADQGLLNMHFGNTYNRLSFTYNVTPSAHYQYVPAY-RHFQGSINMVHFIGADKPWRQ 224 Query: 275 ------DWAWDYPVSQAFMEAKNASPWKNT----ALLKPNNSNQLRYSAKHMLKKHRY 322 D ++ + + K ++++ + +A++++ Sbjct: 225 GRESTTDAGPFDEMTGRWWAVYDRHYHKEAGHAPSIVQHFVKGEYNPTARYVVPTGEP 282 >UniRef50_Q9NYU2 UDP-glucose:glycoprotein glucosyltransferase 1 n=77 Tax=Eumetazoa RepID=UGGG1_HUMAN Length = 1555 Score = 154 bits (390), Expect = 4e-36, Method: Composition-based stats. Identities = 33/261 (12%), Positives = 83/261 (31%), Gaps = 15/261 (5%) Query: 17 DYDHKVETENLCLDIA-YGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKY 75 + + ++ ++I + + I + S+LK + + F +Y +++ Sbjct: 1244 KTEEVKQDKDDIINIFSVASGHLYERFLRIMMLSVLKNTKT-PVKFWFLKNYLSPTFKEF 1302 Query: 76 FDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADII 135 +A +Y + ++ R + K Y + F K L++DAD I Sbjct: 1303 IPYMANEYNFQYELVQYKWPRWLHQQTEKQRIIWGYKILFLDVLFPLVVDKFLFVDADQI 1362 Query: 136 CQGTIEPLINFSFPDDKVAMVVTEGQAD-WWEKRAHSLGVAGI---AKGYFNSGFLLINT 191 + ++ L +F+ R G + Y S +++ Sbjct: 1363 VRTDLKELRDFNLDGAPYGYTPFCDSRREMDGYRFWKSGYWASHLAGRKYHISALYVVDL 1422 Query: 192 AQWAAQQVSARAI-AMLNEPEIIKKITHPDQDVLNMLLAD-KLIFADIKYNTQFSLNYQL 249 ++ R + +++ DQD+ N ++ + ++ + Sbjct: 1423 KKFRKIAAGDRLRGQYQGLSQDPNSLSNLDQDLPNNMIHQVPIKSLPQEWLWCETWCDDA 1482 Query: 250 KESF-------INPVTNDTIF 263 + NP+T + Sbjct: 1483 SKKRAKTIDLCNNPMTKEPKL 1503 >UniRef50_P91854 Protein F26H9.8, partially confirmed by transcript evidence n=2 Tax=Caenorhabditis RepID=P91854_CAEEL Length = 1381 Score = 154 bits (389), Expect = 4e-36, Method: Composition-based stats. Identities = 35/272 (12%), Positives = 90/272 (33%), Gaps = 15/272 (5%) Query: 6 FQETEFLNSVIDYDHKVETENLCLDIA-YGTDKNFLFGCGISIASILKYNEGSRLCFHIF 64 LNS +Y E + +++ + + I + S+L + ++ F + Sbjct: 1073 LSIESLLNSAKNYFASPEP-SEVINVFSLASGHLYERFMRIMMTSVLNNTKTQKVKFWLL 1131 Query: 65 TDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKA 124 +Y ++ LA YK ++ + + K Y + F Sbjct: 1132 KNYLSPKFKETIPKLAEFYKFEFELVEYKWPKWLHKQTEKQRVMWGYKILFLDVLFPLNV 1191 Query: 125 PKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQAD----WWEKRAHSLGVAGIAKG 180 K++++DAD + + ++ L++F+ V + ++ + + Sbjct: 1192 DKIIFVDADQVVRADLQELMDFNLNGAPYGYVPFCESRTEMDGFRFWKSGYWKNHLMGRK 1251 Query: 181 YFNSGFLLINTAQWAAQQVSARAI-AMLNEPEIIKKITHPDQDVLNMLLAD-KLIFADIK 238 Y S +++ + R + +++ DQD+ N +L + + + Sbjct: 1252 YHISALYVVDLKAFREFSAGDRLRGRYDSLSADPNSLSNLDQDLPNNMLHEVPIKSLPQE 1311 Query: 239 YNTQFSLNYQLKESF-------INPVTNDTIF 263 + + + NP+T + Sbjct: 1312 WLWCETWCDDGSKEKAKTIDLCNNPLTKEPKL 1343 >UniRef50_Q2HHC6 Putative uncharacterized protein n=1 Tax=Chaetomium globosum RepID=Q2HHC6_CHAGB Length = 1406 Score = 154 bits (389), Expect = 5e-36, Method: Composition-based stats. Identities = 27/230 (11%), Positives = 62/230 (26%), Gaps = 8/230 (3%) Query: 16 IDYDHKVETENLCLDIA-YGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRK 74 + T++ ++I + + I + S++++ + F + + Sbjct: 1112 ETTNSLATTQHAEINIFSVASGHLYERMLNIMMVSVMRH-TNHTVKFWFIEQFLSPSFKD 1170 Query: 75 YFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADI 134 + LA +Y ++ K Y + F KV+++DAD Sbjct: 1171 FIPHLAAEYNFSYEMVTYKWPHWLRQQKEKQREIWGYKILFLDVLFPLSLDKVIFVDADQ 1230 Query: 135 ICQGTIEPLINFSFPDDKVAMVVTEGQA-DWWEKRAHSLGVAGI---AKGYFNSGFLLIN 190 I + + L + R G Y S ++ Sbjct: 1231 IVRTDMHELATLDLEGAPYGFTPMCDSRTEMEGFRFWKTGYWANYLKGHPYHISALYAVD 1290 Query: 191 TAQWAAQQVSARAI-AMLNEPEIIKKITHPDQDVLNML-LADKLIFADIK 238 ++ R + + DQD+ N + + Sbjct: 1291 LRRFRELAAGDRLRQQYHALSADPNSLANLDQDLPNHMQFQIPIHSLPQS 1340 >UniRef50_B6Q6I5 Glycogenin n=3 Tax=Trichocomaceae RepID=B6Q6I5_PENMQ Length = 775 Score = 154 bits (388), Expect = 6e-36, Method: Composition-based stats. Identities = 44/279 (15%), Positives = 88/279 (31%), Gaps = 42/279 (15%) Query: 22 VETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALAL 81 + T + T ++L G + S+ +++ + + + + Sbjct: 1 MATPGEAVYCTLLTSDHYLPGAVVLAHSLRDNGTRAKIVALFTPETLKEATIRELQTV-- 58 Query: 82 QYKTRIKIYLINGDRLRSLPSTKNWT-HAIYFRFVIADYFINKAPKVLYLDADIICQGTI 140 Y I + L + +L + + + + + +++Y+DAD++ Sbjct: 59 -YDEIIPVQLRSNGTPANLLLMGRLDLISTFTKIELWR--QTQYSRIVYMDADVLALRAP 115 Query: 141 EPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVS 200 + L++ + A G D FNSG +++ Sbjct: 116 DELLSL---QEDFAAAPDIGWPDI-----------------FNSGVMVLRPN-------L 148 Query: 201 ARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTND 260 A+ E DQ +LN + YN S NYQ ++ + Sbjct: 149 QDYYALRAFAERGTSFDGGDQGLLNTYFK-RWYRLSFTYNCTPSGNYQYMPAY-RHFEST 206 Query: 261 TIFIHYIGPTKPW----HDWAWDYPVSQA---FMEAKNA 292 IH+IG KPW H +A P Q + + Sbjct: 207 ISLIHFIGSQKPWTQSRHAFASGTPYYQLLGRWWAQYDR 245 >UniRef50_Q50FU8 Cj81-079 (Fragment) n=6 Tax=Campylobacter jejuni RepID=Q50FU8_CAMJE Length = 333 Score = 154 bits (388), Expect = 7e-36, Method: Composition-based stats. Identities = 63/328 (19%), Positives = 106/328 (32%), Gaps = 47/328 (14%) Query: 30 DIAYGTDKNFLFGCGISIASILKYNE------GSRLCFHIFTDYFGDDDRKYFDALALQ- 82 +I D N++ + IASI+K + F+I ++ ++ L Sbjct: 6 NIVISCDNNYVKYVAVVIASIIKNTKINSQLKEYPYKFYILSNDISKNNILKLKKLIQHL 65 Query: 83 ----YKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQG 138 Y + I+ I+ + P + HA Y+RF IAD K LYLDAD++ G Sbjct: 66 SNSYYNCELIIHKIDDSKFHRFPKAWHVNHATYYRFEIADIVEGN--KCLYLDADVLVCG 123 Query: 139 TIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGI----AKGYFNSGFLLINTAQW 194 I L + +V W + + YFN+G +LI+ QW Sbjct: 124 DIRELFYMELNNKVAGVVTDSCSRLWTKLYTKDNKTSSYIEFDPLMYFNAGVILIDLNQW 183 Query: 195 AAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFS---------- 244 + + I N + DQ LN+ L + + +N Sbjct: 184 KKHDIKNKCIDAFNIYDHGG---LADQSYLNIALKELTYKLPLNWNLIVPEYILLDGYER 240 Query: 245 ------------LNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNA 292 N S + +H+ KPW W+ ++ Sbjct: 241 HYVVNCLDEISEYNLAYTRSEFEEAMKNKKIVHFC-AAKPW----WNLYYKNNKVDFNER 295 Query: 293 SPWKNTALLKPNNSNQLRYSAKHMLKKH 320 + W AL + + + KH Sbjct: 296 NVWWEIALNLEEFKEEFYFLKNSLDSKH 323 >UniRef50_D0N7I0 UDP-glucose:glycoprotein glucosyltransferase, putative n=1 Tax=Phytophthora infestans T30-4 RepID=D0N7I0_PHYIN Length = 1632 Score = 153 bits (387), Expect = 7e-36, Method: Composition-based stats. Identities = 29/218 (13%), Positives = 67/218 (30%), Gaps = 6/218 (2%) Query: 27 LCLDIA-YGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKT 85 + + + + I ++S+LK + + F + ++ D +K L Q+ Sbjct: 1351 ETIHVFSVASGYLYERFVKIMMSSVLKR-TNNPVTFWLLENFLSPDFKKSIPVLREQFGM 1409 Query: 86 RIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLIN 145 I++ + K Y + F K++Y+DAD + + ++ L Sbjct: 1410 DIRLVTYKWPNWLRQQTEKQRIIWGYKILFLDVLFPLGVQKIIYVDADQVVRADLKELWE 1469 Query: 146 FSFPDDKVAMVVTEGQAD--WWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARA 203 + + R K Y S +++ A + Sbjct: 1470 LDLDGKPYGYTPFCDSRNVGFQFWRQGYWKDHLRGKPYHISALYVVDLALFRQMAAGDML 1529 Query: 204 IA-MLNEPEIIKKITHPDQDVLNMLLADK-LIFADIKY 239 A + + + DQD+ N + ++ Sbjct: 1530 RAVYSHLSADPNSLANLDQDLPNYAQHQIPIFSLPQEW 1567 >UniRef50_D1HRJ7 Whole genome shotgun sequence of line PN40024, scaffold_34.assembly12x (Fragment) n=2 Tax=Vitis vinifera RepID=D1HRJ7_VITVI Length = 1715 Score = 153 bits (387), Expect = 7e-36, Method: Composition-based stats. Identities = 34/250 (13%), Positives = 71/250 (28%), Gaps = 15/250 (6%) Query: 28 CLDIA-YGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTR 86 ++I + + I I S+LK + F +Y + +A +Y Sbjct: 1407 TINIFSIASGHLYERFLKIMILSVLKN-SNRPVKFWFIKNYLSPQFKDVIPHMAQEYGFE 1465 Query: 87 IKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINF 146 ++ K Y + F KV+++DAD I + + L + Sbjct: 1466 YELITYKWPTWLHKQKEKQRIIWAYKILFLDVIFPLSLEKVIFVDADQIVRADMGELYDM 1525 Query: 147 SFPDDKVAMVVTEGQAD----WWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSAR 202 +A + R K Y S +++ ++ Sbjct: 1526 DIKGRPLAYTPFCDNNKDMDGYRFWRQGFWKDHLRGKPYHISALYVVDLVKFRETAAGDN 1585 Query: 203 AIAMLNE-PEIIKKITHPDQDVLNMLLAD-KLIFADIKYNTQFSLNYQLKESF------- 253 + +++ DQD+ N + ++ S +S Sbjct: 1586 LRVFYETLSKDPNSLSNLDQDLPNFAQHTVPIFSLPQEWLWCESWCGNATKSKAKTIDLC 1645 Query: 254 INPVTNDTIF 263 NP+T + Sbjct: 1646 NNPMTKEPKL 1655 >UniRef50_C5XV64 Putative uncharacterized protein Sb04g036540 n=1 Tax=Sorghum bicolor RepID=C5XV64_SORBI Length = 1568 Score = 153 bits (387), Expect = 8e-36, Method: Composition-based stats. Identities = 37/260 (14%), Positives = 78/260 (30%), Gaps = 15/260 (5%) Query: 18 YDHKVETENLCLDIA-YGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYF 76 D K + ++I + + I I S+LK + F +Y + Sbjct: 1253 TDRKDARQGETINIFSVASGHLYERFLKIMILSVLK-ETQRPVKFWFIKNYLSPQFKDVI 1311 Query: 77 DALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIIC 136 +A +Y ++ K Y + F KV+++DAD I Sbjct: 1312 PHMAREYGFEYELITYKWPTWLHKQKEKQRIIWAYKILFLDVIFPLSLRKVIFVDADQIV 1371 Query: 137 QGTIEPLINFSFPDDKVAMVVTEGQA-DWWEKRAHSLGVAG---IAKGYFNSGFLLINTA 192 + + L + + +A D R G + Y S +++ A Sbjct: 1372 RADMGELYDMNLKGRPLAYTPFCDNNKDMDGYRFWKQGFWKDHLRGRPYHISALYVVDLA 1431 Query: 193 QWAAQQVSARAIAMLNE-PEIIKKITHPDQDVLNMLLAD-KLIFADIKYNTQFSLNYQLK 250 ++ + + +++ DQD+ N + ++ S Sbjct: 1432 KFRQTASGDTLRVFYEQLSKDPNSLSNLDQDLPNYAQHTVPIFSLPQEWLWCESWCGNAT 1491 Query: 251 ESF-------INPVTNDTIF 263 ++ NP+T + Sbjct: 1492 KARAKTIDLCNNPMTKEPKL 1511 >UniRef50_C6LDU2 Glycosyl transferase, family 8 n=3 Tax=Firmicutes RepID=C6LDU2_9FIRM Length = 270 Score = 153 bits (386), Expect = 9e-36, Method: Composition-based stats. Identities = 51/244 (20%), Positives = 89/244 (36%), Gaps = 11/244 (4%) Query: 40 LFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRS 99 + I SI+++ +I + D+ A TR+ + S Sbjct: 1 MEHVLDCIRSIVRFPSEDGYDIYILHSDLQEQDQSDAAAQVEDGDTRLHFRFVEPSVFAS 60 Query: 100 LPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTE 159 P ++ + IY+R A + ++LYLD D + ++ L N F + Sbjct: 61 FPESERYPRLIYYRIFAASLLPPEMDRILYLDGDTLVINPLDELYNMDFEGNYFLACTH- 119 Query: 160 GQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHP 219 + K Y NSG LL+N + +Q + + + + +T P Sbjct: 120 -VRKFLTKVNQYRLGMEEVSTYINSGVLLMNLKELREKQDFEEIASFVEKR--GRYLTLP 176 Query: 220 DQDVLNMLLADKLIFAD-IKYNTQ------FSLNYQLKESFINPVTNDTIFIHYIGPTKP 272 DQD++ L +K D +KYN ++ K + V + + IHY G KP Sbjct: 177 DQDIITALYGNKTGILDTMKYNLSDRMISVYNTEPGHKRINLEWVRENAVVIHYYGKQKP 236 Query: 273 WHDW 276 W Sbjct: 237 WKKP 240 >UniRef50_A1VG39 Glycosyl transferase, family 8 n=1 Tax=Desulfovibrio vulgaris DP4 RepID=A1VG39_DESVV Length = 335 Score = 153 bits (386), Expect = 1e-35, Method: Composition-based stats. Identities = 47/289 (16%), Positives = 108/289 (37%), Gaps = 27/289 (9%) Query: 27 LCLDIAYGTDKNFLFGCGISIASILKYNEGSRL-CFHIFTDYFGDDDRKYFDALALQYKT 85 + I + D N+ +++ S+ + + S ++ + D+ +++ + Sbjct: 2 NTVPIVFTFDANYRLPASVALQSLFENAKDSTYYHVYLVCEGLSRGDKDAIESICPEKNG 61 Query: 86 RIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLIN 145 R++ ++ + S PS++NW +Y R ++ KV+Y D D++ + + Sbjct: 62 RVEWIDVDNELFSSAPSSENWPKIVYARILLPLLLP--FDKVIYSDVDVVFCSDLAEIFQ 119 Query: 146 FSFPDDKVAMVVTEGQADWWE-KRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAI 204 + A V E A R H++ + + SGF+++N + R + Sbjct: 120 IEVDGCEWAGVAAELVAFQEGVARCHNVHCEYQNELIYMSGFMVMNLRLMREKDTVGRCL 179 Query: 205 AMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQ---------------FSLNYQL 249 +++ ++ D ++LNM +D + D Y L Sbjct: 180 NNISK--FGSRLKMYDLEILNMS-SDNIARIDFSYCVLENVFFAKNVSEAKEYPWLRGLY 236 Query: 250 KESFINPVTNDTIFIHYIGPT-KPWHDWAWDYPVSQAFMEAKNASPWKN 297 + S + + IH+ G K W + Q + + SP+++ Sbjct: 237 RVSELEAARSAPRIIHFAGSDTKVWERYCVP----QVYRKYLAVSPFRS 281 >UniRef50_D1Z8I8 Whole genome shotgun sequence assembly, scaffold_9 n=1 Tax=Sordaria macrospora RepID=D1Z8I8_SORMA Length = 1298 Score = 153 bits (386), Expect = 1e-35, Method: Composition-based stats. Identities = 30/234 (12%), Positives = 69/234 (29%), Gaps = 8/234 (3%) Query: 13 NSVIDYDHKVETENLCLDIA-YGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDD 71 D ++ ++I + + I I S++K+ + + F + Sbjct: 976 KPATDKSVSETAQHAEINIFSVASGHLYERMLSIMILSVMKHTTHT-VKFWFIEQFLSPS 1034 Query: 72 DRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLD 131 + + LA +Y + ++ S K Y + F KV+++D Sbjct: 1035 FKSFLPFLAAEYGFQYEMVAYKWPHWLRHQSEKQREIWGYKILFLDVLFPLSLEKVIFVD 1094 Query: 132 ADIICQGTIEPLINFSFPDDKVAMVVTEGQA-DWWEKRAHSLGVAG---IAKGYFNSGFL 187 AD I + + L+ + R G + Y S Sbjct: 1095 ADQIVRTDMYDLVQLDLEGAPYGFTPMCDSRTEMEGFRFWKTGYWATYLRGQPYHISALY 1154 Query: 188 LINTAQWAAQQVSARAI-AMLNEPEIIKKITHPDQDVLNML-LADKLIFADIKY 239 +++ ++ R + + DQD+ N + + ++ Sbjct: 1155 VVDLRRFRELAAGDRLRQQYHTLSADPNSLANLDQDLPNHMQFQIPIKSLPQEW 1208 >UniRef50_Q2GW94 Putative uncharacterized protein n=1 Tax=Chaetomium globosum RepID=Q2GW94_CHAGB Length = 774 Score = 152 bits (385), Expect = 1e-35, Method: Composition-based stats. Identities = 50/306 (16%), Positives = 90/306 (29%), Gaps = 50/306 (16%) Query: 37 KNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIYLINGDR 96 +L G + S+ +L + D D A+ Y I + I + Sbjct: 17 DTYLPGALVLAHSLRDAGTTKKLAVLVTLDTVSADVVTQLKAV---YDYVIPVSRIQNEH 73 Query: 97 LRSLPSTKNWT-HAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAM 155 +L H+ + + + + K++Y+DADI+ + L N + Sbjct: 74 TANLDLMNRRDLHSAFTKINLWR--QTQFRKIVYVDADIVAYRAPDELFNLP---HPFSA 128 Query: 156 VVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKK 215 G D FN+G +++ A+ Sbjct: 129 APDIGWPDL-----------------FNTGLMVLTPN-------MGDYYALTAMARRGIS 164 Query: 216 ITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIGPTKPWHD 275 DQ +LNM + YN S +YQ ++ + +H+IGP KPW Sbjct: 165 FDGADQGLLNMYFKNSFNRLSFSYNVTPSAHYQYVPAY-KHFQSGINMVHFIGPEKPWLQ 223 Query: 276 ----WAWDYPVSQ---AFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKHRYLKGFSN 328 P Q + + +P+ Q + K F Sbjct: 224 GRDITTGSSPFDQMVGRWWAVYDR-----HYRKEPSQPEQEVPAIVQYFVKGE----FQP 274 Query: 329 YLFYFI 334 + Y + Sbjct: 275 TIRYVV 280 >UniRef50_B2VRF2 Glycogenin-2 n=1 Tax=Pyrenophora tritici-repentis Pt-1C-BFP RepID=B2VRF2_PYRTR Length = 622 Score = 151 bits (382), Expect = 3e-35, Method: Composition-based stats. Identities = 45/280 (16%), Positives = 80/280 (28%), Gaps = 40/280 (14%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 I ++L G + S+ +L + D D L Y I + Sbjct: 10 ITLLMSDSYLPGAVVLANSLRDAGTKKKLAVLVTMDTLSADTIGELKTL---YDYLIPVQ 66 Query: 91 LINGDRLRSLPSTKNWTHA-IYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFP 149 I +L A + + + + K++YLDAD++ ++ L + Sbjct: 67 RIRSSNTANLYLMGRPDLAFAFTKIALWR--QTQFRKLVYLDADVVALRALDELFDI--- 121 Query: 150 DDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNE 209 + A G D FNSG ++I A+ Sbjct: 122 EASFAAAPDIGWPDA-----------------FNSGVMVIKPD-------MGEYWALQTM 157 Query: 210 PEIIKKITHPDQDVLNMLLADK-LIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIG 268 DQ +LN + YN + YQ + ++ D +H+IG Sbjct: 158 AAAGDSFDGADQGLLNQYFEHRPWQRLKFTYNCTPNAEYQWEPAY-RHYKRDIAAVHFIG 216 Query: 269 PTKPW--HDWAWDYPVSQA---FMEAKNASPWKNTALLKP 303 KPW + + + A + Sbjct: 217 KNKPWSSQHSGGTGVYGELLARWWAVHQRHLHREKAAKEA 256 >UniRef50_UPI00016C0890 glycosyl transferase family 8 protein n=2 Tax=Epulopiscium sp. 'N.t. morphotype B' RepID=UPI00016C0890 Length = 593 Score = 151 bits (382), Expect = 3e-35, Method: Composition-based stats. Identities = 54/298 (18%), Positives = 102/298 (34%), Gaps = 29/298 (9%) Query: 31 IAYGTDKNFLFGCGISIASILK-YNEGSRLCFHIFTDYFGDDDRKYFDALA-LQYKTRIK 88 I TD F+ G ++ S++K N + IF + + + + ++ Sbjct: 288 IVLTTDDRFIIGAAATLISLVKTSNVNNNYDIIIFHKDLSEKSKTLLRNVVVQRINFSLR 347 Query: 89 IYLI--NGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINF 146 Y + NW +YF+ +I ++ K L+LD D+I I L++ Sbjct: 348 FYDVGYEMSTYNVYKPGNNWQPCVYFKLLIPSI-MHNYKKSLHLDCDLIILEDIANLLSI 406 Query: 147 SFPDDKVAMVVTEGQ------ADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVS 200 + VA G W K H YFN G ++ N ++ Sbjct: 407 DLKGNAVAGCAEMGCITTSIRRTWANKYYHEKLRITNMVEYFNGGVIVFNINEFHKITSL 466 Query: 201 ARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTND 260 A+ + E KK + +QD+L+ + + +N + + + ++ Sbjct: 467 AQLL-----HEAEKKHLNLEQDILSKSFVNHIYLLPQSWNLTRDFLGTVMNLYKQYLPSN 521 Query: 261 -----------TIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSN 307 IHYIGP KPW + + + + + + A+ N Sbjct: 522 IYQKYLDARQKPKIIHYIGPLKPWDNP--NLEYASYWWDTIRGTEIYEMAINSQIQKN 577 >UniRef50_B6K765 UDP-glucose:glycoprotein glucosyltransferase n=1 Tax=Schizosaccharomyces japonicus yFS275 RepID=B6K765_SCHJY Length = 1444 Score = 151 bits (382), Expect = 3e-35, Method: Composition-based stats. Identities = 33/242 (13%), Positives = 85/242 (35%), Gaps = 13/242 (5%) Query: 8 ETEFLNSVI-DYDHKVETENLCLDIA-YGTDKNFLFGCGISIASILKYNEGSRLCFHIFT 65 +FLN ++ + + E+ ++I + + I S++++ + + + F Sbjct: 1113 SHKFLNKLLRPFRGAQKDEHAEINIFSLASGHLYERFIYIMTRSVMEHTKHT-VKFWFIE 1171 Query: 66 DYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAP 125 ++ ++ LA +YK + + N + K Y + F Sbjct: 1172 NFLSPSFKRDIAILAEKYKFKYEFVTYNWPHWLRKQTEKQREIWGYKILFLDVLFPLDLE 1231 Query: 126 KVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQA------DWWEKRAHSLGVAGIAK 179 KV+++DAD I + ++ L++ A +W++ + G+ Sbjct: 1232 KVIFVDADQIVRADLKELMDLDLKGAPYAYTPMCDSRTEMEGFRFWKQGYWKKYLRGMK- 1290 Query: 180 GYFNSGFLLINTAQWAAQQVSARA-IAMLNEPEIIKKITHPDQDVLNMLLADK-LIFADI 237 Y S +++ ++ + +++ DQD+ N L + Sbjct: 1291 -YHISALYVVDLDRFRHMGAGDLLRRQYQLLSADPESLSNLDQDLPNHLQRMIPIYSLPQ 1349 Query: 238 KY 239 ++ Sbjct: 1350 EW 1351 >UniRef50_A7H2M2 Glycosyl transferase family 8 n=3 Tax=Campylobacter jejuni RepID=A7H2M2_CAMJD Length = 381 Score = 151 bits (382), Expect = 3e-35, Method: Composition-based stats. Identities = 59/336 (17%), Positives = 121/336 (36%), Gaps = 60/336 (17%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYN-------------EGSRLCFHIFTDYFGDDDRKY 75 I ++N++ + + SI++ FHI +D+ + + Sbjct: 2 FHIVLNANENYIKYAAVLMTSIIQKTDLNKSMSEFCNFDTDEGYVFHILSDHISESMKVR 61 Query: 76 F----DALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLD 131 L Y +I ++++N D + + + Y+R +A LYLD Sbjct: 62 ISNLEKQLNDIYPCKIVLHILNDDEFKGMLKW-RGNYLAYYRIKMASVLPQNLKICLYLD 120 Query: 132 ADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLG-----VAGIAKGYFNSGF 186 D++C G + L++ + + A+ + +K SL + YFNSGF Sbjct: 121 CDMLCFGDLRELLSVDINNYQAAVCLDGNNHKKNKKVFFSLKGREKYKFSNIEKYFNSGF 180 Query: 187 LLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSL- 245 +L+N +W + ++I L + + +PDQD LN L D + ++N Sbjct: 181 ILVNLDRWRRDNIENKSIDFLKKFKT----LYPDQDALNFALND-TLLLPNRWNFSLGYF 235 Query: 246 -------------------NYQLKESFINPVTNDTIFIHYIG-PTKPWHDWAW------- 278 + ++ + H+I P KPW + + Sbjct: 236 VAFLKNSQEILFLNQTKYPHLNYTKTEFENEVKNIKIAHFILDPFKPWDAFQYSIVNDDL 295 Query: 279 ---DYPVSQAFMEAKNASP-WKNTALLKPNNSNQLR 310 +YP + + +P + L++ + N+ + Sbjct: 296 QLIEYPFYKHYWSVAKNTPEFYLDFLVQKESINEHK 331 >UniRef50_A4R9Z3 Putative uncharacterized protein n=1 Tax=Magnaporthe grisea RepID=A4R9Z3_MAGGR Length = 866 Score = 151 bits (382), Expect = 3e-35, Method: Composition-based stats. Identities = 46/287 (16%), Positives = 86/287 (29%), Gaps = 46/287 (16%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 I N+L G + S+ +L + D K L Y I + Sbjct: 11 ITLLLSDNYLPGALVLAHSLRDAGTTRKLAIMVTLDTVAA---KVITQLKAVYDYVIPVP 67 Query: 91 LINGDRLRSLPSTKNWT-HAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFP 149 I +R +L H+ + + + + K++Y+DAD++ + L Sbjct: 68 RIRNERPANLYLMNRPDLHSAFTKVNLWK--QTQFSKLVYIDADVVAYRAPDELFAI--- 122 Query: 150 DDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNE 209 + G D FN+G +++ AM+ Sbjct: 123 AHPFSAAPDIGWPDL-----------------FNTGVMVLTPN-------MGDYYAMMAM 158 Query: 210 PEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIGP 269 E DQ ++NM YN S +YQ ++ + +H+IG Sbjct: 159 AERGISFDGADQGLINMHFRHTYNRISFTYNVTPSAHYQYVPAY-RHFQSSINMVHFIGS 217 Query: 270 TKPW----HDWAWDYPVSQ---AFM-----EAKNASPWKNTALLKPN 304 KPW + A + + + + ++ P Sbjct: 218 EKPWIQGRNSTAGGGAFDEMVGRWWAVYDRHYRAPTVYEPQVQRPPE 264 >UniRef50_C6EQF4 Putative uncharacterized protein n=3 Tax=Campylobacter jejuni RepID=C6EQF4_CAMJE Length = 958 Score = 151 bits (382), Expect = 3e-35, Method: Composition-based stats. Identities = 52/304 (17%), Positives = 105/304 (34%), Gaps = 35/304 (11%) Query: 28 CLDIAYGTDKNFLFGCGISIASILKYNEG-SRLCFHIFTDYFGDDDRKYFDALALQYKTR 86 + I + D N+L I++ S++ + + + Sbjct: 12 HIPIVFAVDDNYLPYMSIALNSLVDRVSNCYKYNIFVMHLNIDLERLNRLKENIRNNNVT 71 Query: 87 IKIYLIN-------GDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGT 139 I+ +N + +T A+Y+R I + F + KV+Y D+D+I + Sbjct: 72 IEFINLNQYLKKIFKEYGNIFYERSYFTTAMYYRIFIPEIF-SNFKKVIYCDSDVIFKAD 130 Query: 140 IEPLINFSFPDDKVAMVVTEGQADWWEKR----------AHSLGVAGIAKGYFNSGFLLI 189 I L + ++ + KR YFNSG ++ Sbjct: 131 ISHLFFIDLNNKEIGACRDIAALYAYRKRETVWQQNIRNNFDKINFRSISDYFNSGVIVF 190 Query: 190 NTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQL 249 + + + ++ + ++ I + PDQDVLN++ + F +++N ++ + Sbjct: 191 DIVKCIQMKTVSKCLTVIK---NIDNLYFPDQDVLNIVFCGHVHFLPLEWNFLWTTYIEY 247 Query: 250 KESFI----------NPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTA 299 K++F+ IHYI TKPW D + + + + Sbjct: 248 KDNFMYLPKKIINEIYKAKTKPKIIHYISETKPWKDKNS---FFVEWWKFPRKNLFYGEI 304 Query: 300 LLKP 303 L K Sbjct: 305 LCKK 308 >UniRef50_C5P955 Glycosyl transferase family 8 protein n=2 Tax=Coccidioides RepID=C5P955_COCP7 Length = 823 Score = 151 bits (381), Expect = 4e-35, Method: Composition-based stats. Identities = 41/262 (15%), Positives = 83/262 (31%), Gaps = 37/262 (14%) Query: 44 GISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPST 103 + S+ +++ + D + +L Y I + + +L Sbjct: 1 MVLAHSLRDNGTRAKIVVLVTPDSLQASTIEELKSL---YDEVIPVSRVVNVSPANLYLM 57 Query: 104 KNWT-HAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQA 162 + + + + + +++Y+DAD++ + L+ ++A V G Sbjct: 58 DRPDLISTFTKIELWR--QIQYRQIVYIDADVVALRAPDELLTLD---TQLAAVPDIGWP 112 Query: 163 DWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQD 222 D FNSG L++ + +++ + DQ Sbjct: 113 DC-----------------FNSGVLVLRPS-------LQTYYSLVAFAQRGISFDGADQG 148 Query: 223 VLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPV 282 +LNM + YN S +YQ +F + +HYIG KPW +PV Sbjct: 149 LLNMHFRN-WDRLSFAYNCTPSGHYQYIPAF-RHFQSSISLVHYIGQKKPWSLPRQTFPV 206 Query: 283 SQAFMEAKNASPWKNTALLKPN 304 + + W Sbjct: 207 EGPYNQLLAR--WWAVYDRHYR 226 >UniRef50_B7AUG6 Putative uncharacterized protein n=1 Tax=Bacteroides pectinophilus ATCC 43243 RepID=B7AUG6_9BACE Length = 301 Score = 150 bits (380), Expect = 4e-35, Method: Composition-based stats. Identities = 62/276 (22%), Positives = 109/276 (39%), Gaps = 18/276 (6%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHI-FTDYFGDDDRKYFDALALQYKT-- 85 ++I + F+ + + S++K N + H+ +TD L Sbjct: 1 MNILVAMNDAFVKCYQVMLTSLIKNNPDENITVHVPYTDGLSRKGLDSIKELVRNQSHGS 60 Query: 86 -RIKIYLINGDRLRSLPSTK--NWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEP 142 ++ Y DRL SL W+ ++FR ++ ++L+LD DII G+I+ Sbjct: 61 ASVREYYFGKDRLGSLDKLPLGMWSVEMFFRIFAQEFIPESEDRILWLDGDIIVNGSIKD 120 Query: 143 LINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVS-A 201 N F A + K + + Y NSG LLIN ++ Sbjct: 121 FYNTDFDSMYYAACEDIAISHGKIKEEYDNLGWSSEEIYVNSGVLLINLKALRNNGITRD 180 Query: 202 RAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFAD-IKYNTQFSLNYQLKESFINPVTND 260 A+ E + K+ +PDQ +LN + DK+ FAD +YN Q S Y K + + + ++ Sbjct: 181 AAVEYALEN--MDKLHYPDQYMLNAMFHDKIKFADAFRYNCQVS-GYSYKLADM--ILSE 235 Query: 261 TIFIHYIGPTKPWHDWAWDYPVS----QAFMEAKNA 292 + +H+ G +PW + S + Sbjct: 236 SAILHFPG-YRPWQTDYQKHYSSAIPGDIWWHYAKL 270 >UniRef50_C1FE59 Glycosyltransferase family 24 protein n=1 Tax=Micromonas sp. RCC299 RepID=C1FE59_9CHLO Length = 1662 Score = 150 bits (380), Expect = 5e-35, Method: Composition-based stats. Identities = 26/220 (11%), Positives = 64/220 (29%), Gaps = 8/220 (3%) Query: 27 LCLDIA-YGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKT 85 + I + + + + S+ + + + F ++ ++Y A +Y+ Sbjct: 1374 EKIHIFSVASGYLYERLIKVMMLSVRRNTKN-PIKFWFVKNWLSPRFKQYLPHFASRYRF 1432 Query: 86 RIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLIN 145 ++ + K Y + F K++++DAD + + I+ L Sbjct: 1433 EYELVTYKWPTWLQKQTDKQRIIWAYKLLFLDVIFPLSLEKIIFVDADQVVRADIKELWE 1492 Query: 146 FSFPDDKVAMVVTEGQAD-WWEKRAHSLGVAGI---AKGYFNSGFLLINTAQWAAQQVSA 201 A R G K Y S +++ ++ Sbjct: 1493 VDLHGAPYAYTPFCDDNKVMDGFRFWKQGFWERHLDGKPYHISALYVVDLKRFRQLAAGD 1552 Query: 202 RAIA-MLNEPEIIKKITHPDQDVLNMLLAD-KLIFADIKY 239 N + + + DQD+ N + ++ Sbjct: 1553 TLRVIYENLSKDPNSLANLDQDLPNYAQHQVPIFSLPQQW 1592 >UniRef50_D1N145 Glycosyl transferase family 8 n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N145_9BACT Length = 311 Score = 150 bits (380), Expect = 5e-35, Method: Composition-based stats. Identities = 64/297 (21%), Positives = 114/297 (38%), Gaps = 20/297 (6%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIK 88 + +A TD+N+L ++ AS+L + G + H+ + + D F+AL R+ Sbjct: 5 IQVAMATDRNYLDYALVAAASLLAQHPGGGITLHLLHEELDESDFARFEALRRIDGFRLV 64 Query: 89 IYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSF 148 I + P W+ + Y+R ++ K+LYLD D++ I L N Sbjct: 65 PRKIERGFFQGWPEL-RWSTSAYYRLILPSLLP-DLEKILYLDCDLLVLDDIAELWNTEL 122 Query: 149 PDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLN 208 A + +K YFNSG +L N + A + R I + + Sbjct: 123 GSRSCAAAAVRVAPEHQKKIG-----LPAEAVYFNSGVMLFNLRKMAHENHEKRFIRLFD 177 Query: 209 EPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQ------LKESFINPVTNDTI 262 E + +I +PDQD+LN+ + + ++N S+ E+ + Sbjct: 178 E--LGGRIKYPDQDILNLAYWNDYVKLSQRWNLVTSVYRNPPTPALYSEAEVVEALRRPG 235 Query: 263 FIHYIGPTKPWHD-WAWDYPVSQAFMEAKNA----SPWKNTALLKPNNSNQLRYSAK 314 H+ G KPW +P ++ F P++ LK + L+ K Sbjct: 236 IAHFTGTHKPWRLGKTTHHPYARYFRAYAELAGLPLPFRLKLALKSLLTGSLKPPKK 292 >UniRef50_Q6ESI8 Putative UDP-glucose:glycoprotein glucosyltransferase n=3 Tax=Magnoliophyta RepID=Q6ESI8_ORYSJ Length = 1626 Score = 150 bits (380), Expect = 5e-35, Method: Composition-based stats. Identities = 36/266 (13%), Positives = 81/266 (30%), Gaps = 15/266 (5%) Query: 12 LNSVIDYDHKVETENLCLDIA-YGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGD 70 LN+ + D K + ++I + + I I S+LK + + F +Y Sbjct: 1308 LNACLMMDLKATRQGETINIFSVASGHLYERFLKIMILSVLKQTQ-RPVKFWFIKNYLSP 1366 Query: 71 DDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYL 130 + +A +Y ++ K Y + F KV+++ Sbjct: 1367 QFKDVIPHMAQEYGFEYELVTYKWPTWLHKQKEKQRIIWAYKILFLDVIFPLSLRKVIFV 1426 Query: 131 DADIICQGTIEPLINFSFPDDKVAMVVTEGQAD----WWEKRAHSLGVAGIAKGYFNSGF 186 DAD I + + L + + +A + + + Y S Sbjct: 1427 DADQIVRADMGELYDMNLKGRPLAYTPFCDNNKEMDGYRFWKQGFWKDHLRGRPYHISAL 1486 Query: 187 LLINTAQWAAQQVSARAIAMLNE-PEIIKKITHPDQDVLNMLLAD-KLIFADIKYNTQFS 244 +++ A++ + +++ DQD+ N + ++ S Sbjct: 1487 YVVDLAKFRQTASGDTLRVFYETLSKDPNSLSNLDQDLPNYAQHTVPIFSLPQEWLWCES 1546 Query: 245 LNYQLKESF-------INPVTNDTIF 263 ++ NP+T + Sbjct: 1547 WCGNATKARAKTIDLCNNPMTKEPKL 1572 >UniRef50_C6X2V2 Putative glycosyltransferase n=1 Tax=Flavobacteriaceae bacterium 3519-10 RepID=C6X2V2_FLAB3 Length = 315 Score = 150 bits (380), Expect = 5e-35, Method: Composition-based stats. Identities = 57/299 (19%), Positives = 112/299 (37%), Gaps = 28/299 (9%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNE-GSRLCFHIFTDYFGDDDRKYFDALAL-QYKTR 86 L I + D ++ + I+SI+ + ++ +I ++Y D+++ + + Sbjct: 9 LPIVFTCDDHYFKYAAVVISSIIHNSSRNTKYEINIVSEYISDENQSLAQKMVQSKSNIS 68 Query: 87 IKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINF 146 I+ + I + + + Y+RF I D +VLYLD+D+I I + Sbjct: 69 IQFHAIKIENPEVFHLNSYMSLSTYYRFFIFDLL-KDYDRVLYLDSDLIVDNDISFFADI 127 Query: 147 SFPDDKVAMVVTEGQAD---------WWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQ 197 F + + + + + + YFN+G +L N AQ Sbjct: 128 DFENKPAICCPSIYVQNSLKNNTDHKFTREYFTQILKMSDVDEYFNAGVILFNIKLIRAQ 187 Query: 198 QVSARAIAMLNEPEIIKKITHPDQDVLNMLLADK--LIFADIKYNTQFSLNYQLKESFIN 255 + + + + + DQD+LN +L + +YN ++ + LK F+N Sbjct: 188 GIDRKFFEAIKNIKDP---VYQDQDILNSVLRNNGGAKLISNEYNHTKTMKFSLKRIFLN 244 Query: 256 PVTND--------TIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNS 306 + N HY+G KPW ++ P S F+ +P+ L Sbjct: 245 ALKNKFGKKRNNWFTIYHYVGKVKPWQNFN---PDSALFLYYAYKTPFVREILKSNRLK 300 >UniRef50_Q4PEF1 Putative uncharacterized protein n=1 Tax=Ustilago maydis RepID=Q4PEF1_USTMA Length = 1678 Score = 150 bits (379), Expect = 6e-35, Method: Composition-based stats. Identities = 33/248 (13%), Positives = 83/248 (33%), Gaps = 10/248 (4%) Query: 24 TENLCLDIA-YGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQ 82 ++ ++I + + I I S+LK+ S + F ++ +++ LA + Sbjct: 1348 RKHADINIFTVASGHLYERMTYIMILSVLKHTS-SSVKFWFIENFLSPSFKEFIPHLAAE 1406 Query: 83 YKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEP 142 Y ++ K T Y + F KV+++DAD + + ++ Sbjct: 1407 YGFEYELVTYAWPHWLRAQKEKQRTIWGYKILFLDTLFPLDLGKVIFVDADQVVRTDMQE 1466 Query: 143 LINFSFPDDKVAMVVT-EGQADWWEKRAHSLGVAG---IAKGYFNSGFLLINTAQWAAQQ 198 L++ + D R G + Y S +++ ++ Sbjct: 1467 LVDLDLEGKVYGYPPMGDDSEDMDGFRFWKQGYWKDYLRGRPYHISALYVVDLQKFRLFA 1526 Query: 199 VSARAI-AMLNEPEIIKKITHPDQDVLNMLL-ADKLIFADIKYNTQFSLNYQ--LKESFI 254 R +++ DQD+ N + + + + ++ + LK++ Sbjct: 1527 AGDRLRGQYQALSADPNSLSNLDQDLPNNMQTSIPIHTLEKEWLWCETWCSHDWLKDAKT 1586 Query: 255 NPVTNDTI 262 + ++ Sbjct: 1587 IDLCSNPK 1594 >UniRef50_UPI0001792D56 PREDICTED: similar to UDP-glucose glycoprotein:glucosyltransferase n=1 Tax=Acyrthosiphon pisum RepID=UPI0001792D56 Length = 1536 Score = 150 bits (379), Expect = 7e-35, Method: Composition-based stats. Identities = 29/239 (12%), Positives = 80/239 (33%), Gaps = 8/239 (3%) Query: 21 KVETENLCLDIA-YGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDAL 79 ++ + ++I + + I + S+LK + S + F +Y + + + Sbjct: 1234 PDKSTDETINIFSVASGHLYERFLRIMMLSVLKNTK-SPVKFWFLKNYLSPTVKNFLPIM 1292 Query: 80 ALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGT 139 A +YK + ++ R + K T Y + F K++++DAD + + Sbjct: 1293 AQEYKFQYELVEYKWPRWLHQQTEKQRTIWGYKILFLDVLFPLDVKKIIFVDADQVVRAD 1352 Query: 140 IEPLINFSFPDDKVAMVVTEGQAD----WWEKRAHSLGVAGIAKGYFNSGFLLINTAQWA 195 ++ L++ A + + + Y S +++ ++ Sbjct: 1353 MKELVDLDLGGAPYAYTPFCESRKEMDGFRFWKQGYWKTHLQGRRYHISALYVVDLKRFR 1412 Query: 196 AQQVSARAI-AMLNEPEIIKKITHPDQDVLNMLLAD-KLIFADIKYNTQFSLNYQLKES 252 R + +++ DQD+ N ++ + ++ + + Sbjct: 1413 KVAAGDRLRGQYQALSQDPNSLSNLDQDLPNNMIHQVSIKSLPQEWLWCETWCDDASKK 1471 >UniRef50_C4Q2X6 Udp-glucose glycoprotein:glucosyltransferase, putative n=2 Tax=Schistosoma mansoni RepID=C4Q2X6_SCHMA Length = 1673 Score = 150 bits (379), Expect = 7e-35, Method: Composition-based stats. Identities = 28/227 (12%), Positives = 72/227 (31%), Gaps = 8/227 (3%) Query: 20 HKVETENLCLDIA-YGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDA 78 HK + ++I + + I + +++++ S + F +Y + + Sbjct: 1343 HKCASNQETINIFSVASGHLYERLLRIMMLTVIRH-TNSPVKFWFLKNYLSPTFKDFIPY 1401 Query: 79 LALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQG 138 +A +Y + R + K Y + F K++++DAD I + Sbjct: 1402 MATEYGFEYEFVQYKWPRWLHAQTEKQRIIWGYKILFLDVLFPLNVTKIIFVDADQIVRA 1461 Query: 139 TIEPLINFSFPDDKVAMVVTEGQAD-WWEKRAHSLGVAGI---AKGYFNSGFLLINTAQW 194 ++ L + R G + Y S +++ ++ Sbjct: 1462 DLKELADLDLDGAPYGYTPFCDSRKEMDGFRFWKQGYWANHLAGRPYHISALYVVDLTRF 1521 Query: 195 AAQQVSARAI-AMLNEPEIIKKITHPDQDVLNMLLAD-KLIFADIKY 239 R + +++ DQD+ N ++ + ++ Sbjct: 1522 RRLAAGDRLRGQYHGLSQDPNSLSNLDQDLPNNMIHQVPIKSLPQEW 1568 >UniRef50_B2VVG3 UDP-glucose:glycoprotein glucosyltransferase n=9 Tax=Leotiomyceta RepID=B2VVG3_PYRTR Length = 1508 Score = 150 bits (379), Expect = 7e-35, Method: Composition-based stats. Identities = 26/218 (11%), Positives = 68/218 (31%), Gaps = 8/218 (3%) Query: 29 LDIA-YGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 ++I + + I + S++K+ + F + + + +A +Y Sbjct: 1202 INIFSVASGHLYERMLNIMMVSVMKH-TNHTVKFWFIEQFLSPSFKSFLPHIAAEYGFEY 1260 Query: 88 KIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 ++ + K Y + F KV+++DAD I + + L+ Sbjct: 1261 EMVTYKWPHWLRGQTEKQREIWGYKILFLDVLFPLDLKKVIFVDADQIVRTDMYELVQHD 1320 Query: 148 FPDDKVAMVVT-EGQADWWEKRAHSLGVAGI---AKGYFNSGFLLINTAQWAAQQVSARA 203 + + + R G + Y S +++ ++ R Sbjct: 1321 LQGAPYGFTPMGDSRTEMEGFRFWKTGYWANFLRGRPYHISALYVVDLVRFRQLAAGDRL 1380 Query: 204 I-AMLNEPEIIKKITHPDQDVLNML-LADKLIFADIKY 239 + +++ DQD+ N + + ++ Sbjct: 1381 RQQYHSLSADPNSLSNLDQDLPNNMQFNLPIHSLPQEW 1418 >UniRef50_Q8LF94 Avr9/Cf-9 rapidly elicited protein 231 n=13 Tax=Magnoliophyta RepID=Q8LF94_ARATH Length = 351 Score = 150 bits (378), Expect = 8e-35, Method: Composition-based stats. Identities = 48/288 (16%), Positives = 98/288 (34%), Gaps = 18/288 (6%) Query: 26 NLCLDIAYGTDKNFLFGCGISIASILKYNE-GSRLCFHIFTDYFGDDD--RKYFDALALQ 82 + +A D ++ G ++ S+L+++ + FH D R + Sbjct: 62 RRAVHMAMTLDAAYIRGSVAAVLSVLQHSSCPENIVFHFVASASADASSLRATISSSFPY 121 Query: 83 YKTRIKIYLINGDRLRSLPSTKN--WTHAIYFRFVIADYFINKAPKVLYLDADIICQGTI 140 + ++ I+ S ++ Y R +AD +V+YLD+D+I I Sbjct: 122 LDFTVYVFNISSVSRLISSSIRSALDCPLNYARSYLADLLPPCVRRVVYLDSDLILVDDI 181 Query: 141 EPLINFSFPDDKVAMVVTEGQADW--------WEKRAHSLGVAGIAKGYFNSGFLLINTA 192 L D V A++ W SL A YFN+G ++I+ + Sbjct: 182 AKLAATDLGRDSVLAAPEYCNANFTSYFTSTFWSNPTLSLTFADRKACYFNTGVMVIDLS 241 Query: 193 QWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKES 252 +W ++R + + ++ ++ A + + ++N Q L Sbjct: 242 RWREGAYTSRIEEWMAMQKRMRIYELGSLPPFLLVFAGLIKPVNHRWN-QHGLGGDNFRG 300 Query: 253 FINPVTNDTI-FIHYIGPTKPWHDWAWDYPV--SQAFMEA-KNASPWK 296 + + +H+ G KPW P + +P+ Sbjct: 301 LCRDLHPGPVSLLHWSGKGKPWARLDAGRPCPLDALWAPYDLLQTPFA 348 >UniRef50_UPI000180BB9E PREDICTED: similar to Glycogenin 1 n=1 Tax=Ciona intestinalis RepID=UPI000180BB9E Length = 497 Score = 150 bits (378), Expect = 9e-35, Method: Composition-based stats. Identities = 37/251 (14%), Positives = 76/251 (30%), Gaps = 41/251 (16%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 + T+ + G + S+ ++ + + T R L + I + Sbjct: 7 VTLATNDRYCEGALVVAQSLRRHKTRREIVV-LITPQVSTICRSRLSVL---FDHVIVVD 62 Query: 91 LINGDRLRSLPSTKNWTH-AIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFP 149 +++ + L + + + K ++LDAD + ++ L Sbjct: 63 VLDSNDEAHLALLHRPELGVTFTKLHCWRLV--QYTKCVFLDADTLVLTNVDELFE---- 116 Query: 150 DDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNE 209 ++++ G D FNSG + + ++ Sbjct: 117 RNELSASPDAGWPDM-----------------FNSGVFVFTPS-------METYNDLIKL 152 Query: 210 PEIIKKITHPDQDVLNMLL-----ADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFI 264 + DQ +LN +D YN + Y +F DT + Sbjct: 153 ADTDGSFDGGDQGLLNSYFSEWSTSDTSKRLPFLYNMHSTATYTYSPAFAQ-YGKDTKIV 211 Query: 265 HYIGPTKPWHD 275 H+IG KPW+ Sbjct: 212 HFIGFVKPWNH 222 >UniRef50_UPI0001757CC2 PREDICTED: similar to glycogenin n=1 Tax=Tribolium castaneum RepID=UPI0001757CC2 Length = 512 Score = 149 bits (377), Expect = 1e-34, Method: Composition-based stats. Identities = 39/252 (15%), Positives = 79/252 (31%), Gaps = 41/252 (16%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 + T+ ++ G + S+ + +L + T + R A + ++ Sbjct: 7 VTLATNDSYSLGALVLAHSLKQVGSKHQLAV-LVTPGVTNPMRAKL---ATVFDLVQEVN 62 Query: 91 LINGDRLRSLPSTKNWTH-AIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFP 149 +++ +L K + + + K ++LDAD + + L Sbjct: 63 ILDSKDESNLRLLKRPELGVTFTKLHCWRL--TQFDKCVFLDADTLVLQNCDELFERE-- 118 Query: 150 DDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNE 209 +++ G D FNSG + + + + + E Sbjct: 119 --ELSAAPDVGWPDC-----------------FNSGVFVFRPS----NETYDKLVQFAVE 155 Query: 210 PEIIKKITHPDQDVLNMLLADK-----LIFADIKYNTQFSLNYQLKESFINPVTNDTIFI 264 DQ +LN+ +D YN + Y +F D I Sbjct: 156 K---GSFDGGDQGLLNLYFSDWATKDISKHLPFIYNLCSTACYSYLPAFKQ-FGADAKII 211 Query: 265 HYIGPTKPWHDW 276 H+IG +KPW + Sbjct: 212 HFIGSSKPWLQY 223 >UniRef50_Q4E3K0 UDP-glucose:glycoprotein glucosyltransferase n=2 Tax=Trypanosoma cruzi RepID=Q4E3K0_TRYCR Length = 1668 Score = 149 bits (377), Expect = 1e-34, Method: Composition-based stats. Identities = 31/252 (12%), Positives = 74/252 (29%), Gaps = 15/252 (5%) Query: 21 KVETENLCLDIA-YGTDKNFLFGCGISIASILKYN------EGSRLCFHIFTDYFGDDDR 73 K + + L+I + + + I S+++ + +R+ F + ++ + Sbjct: 1371 KSKPDRPTLNIFSVASGHLYERFLRMMIHSVMRTSFDVHGANTTRIKFWLIENFLSPQFK 1430 Query: 74 KYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDAD 133 LA Y + + K T Y + F +V+++DAD Sbjct: 1431 TLVPLLAKHYGFDVGFVTYRWPWWLHKQTEKQRTIWAYKVLFLDVLFPLDVDRVIFVDAD 1490 Query: 134 IICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAG------IAKGYFNSGFL 187 + L N + A + + G K Y S Sbjct: 1491 QTVLADLHELYNMDIGNAPTAYTPFCRKHPNPATKNFRFWDHGYWLEHLHGKPYHISAIY 1550 Query: 188 LINTAQWAAQQVSARA-IAMLNEPEIIKKITHPDQDVLNMLLAD-KLIFADIKYNTQFSL 245 L++ + A + + + + DQD+ N + + ++ + Sbjct: 1551 LVDLRRLRAIAGGDKYRLVYSRLSSDPNSLANLDQDLPNFIQDQVPIYSLPEEWLWCETW 1610 Query: 246 NYQLKESFINPV 257 ++ + Sbjct: 1611 CGAESKARAKTI 1622 >UniRef50_C7XX93 Glycosyl transferase, family 8 n=1 Tax=Lactobacillus coleohominis 101-4-CHN RepID=C7XX93_9LACO Length = 398 Score = 149 bits (377), Expect = 1e-34, Method: Composition-based stats. Identities = 49/271 (18%), Positives = 90/271 (33%), Gaps = 34/271 (12%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 I D ++ +I SI+ + R+ ++ + + Q + + Sbjct: 7 IVLSGDNHYTAQITTTIKSIVYHL--RRVKIYLINSDIPQEYFFNLNLRLKQLDSELVDL 64 Query: 91 LINGDRLRSLPSTK-NWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFP 149 IN + + S K + + Y R +I + LY+D+D I +I L Sbjct: 65 KINPELFSNAESPKAHISKITYGRLMIPQLV--TEDRALYIDSDAIVDQSISELWTMDLG 122 Query: 150 DDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNE 209 D +A V AD FN+G +L N + + + Sbjct: 123 DYPIAAVHDVFLADI-----------------FNAGIILFNNKKLREDP---DLVDNMLA 162 Query: 210 PEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNY------QLKESFINPVTN--DT 261 K I DQ VLN + + ++YN + + + + N Sbjct: 163 AAQQKGILDADQTVLNQFFNHQYLELGLEYNYVIGYDRDVSLAPRNAPGYFEKMLNCPQP 222 Query: 262 IFIHYIGPTKPWHDWAWDYPVSQAFMEAKNA 292 IHY P KPW+ + + + + + N Sbjct: 223 KIIHYASPDKPWNLQSAGR-MREKWWQYHNL 252 >UniRef50_Q6BJN0 DEHA2G01232p n=3 Tax=Saccharomycetaceae RepID=Q6BJN0_DEBHA Length = 1532 Score = 149 bits (377), Expect = 1e-34, Method: Composition-based stats. Identities = 36/238 (15%), Positives = 79/238 (33%), Gaps = 9/238 (3%) Query: 10 EFLNSVIDYDHKVETENLCLDIA-YGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYF 68 F+ S++ ++ ++I + + I AS++ + + F I +Y Sbjct: 1202 SFMKSLLKSKAPTTKKHADINIFTIASGHLYERFLSIMTASVMAH-TDKSVKFWIIENYI 1260 Query: 69 GDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVL 128 +K LA +Y ++ K T Y + F KV+ Sbjct: 1261 SSHFKKLLPLLAQEYNFEYELITYKWPNWLRFQREKQRTIWGYKILFLDVLFPQDLKKVI 1320 Query: 129 YLDADIICQGTIEPLINFSFPDDKVAMVVTEGQAD-----WWEKRAHSLGVAGIAKGYFN 183 ++DAD I + ++ L++ + K+ + V Y Sbjct: 1321 FVDADQIARTDMKELVDLDLEGAPYGFTPMCDSRKDMEGFRFWKQGYWAHVLKDGLKYHI 1380 Query: 184 SGFLLINTAQWAAQQVSARAIA-MLNEPEIIKKITHPDQDVLNMLLAD-KLIFADIKY 239 S +++ ++ A R A +++ DQD+ N + K+ ++ Sbjct: 1381 SALYVVDLDKFRALSAGDRLRAHYQKLSSDPNSLSNLDQDLPNNMQNKIKIHSLPQEW 1438 >UniRef50_Q09140 UDP-glucose:glycoprotein glucosyltransferase n=1 Tax=Schizosaccharomyces pombe RepID=UGGG_SCHPO Length = 1448 Score = 149 bits (377), Expect = 1e-34, Method: Composition-based stats. Identities = 27/243 (11%), Positives = 71/243 (29%), Gaps = 10/243 (4%) Query: 6 FQETEFLNSVIDYDH--KVETENLCLDIA-YGTDKNFLFGCGISIASILKYNEGSRLCFH 62 +F + + + + ++I + + I S++++ ++ F Sbjct: 1132 LSSHKFFDKIKKSLSFFNFKRKEASINIFSVASGHLYERFLYIMTKSVIEH-TDKKVKFW 1190 Query: 63 IFTDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFIN 122 ++ + A+A +Y + N K Y + F Sbjct: 1191 FIENFLSPSFKSSIPAIAKKYNFEYEYITYNWPHWLRKQEEKQREIWGYKILFLDVLFPL 1250 Query: 123 KAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQAD----WWEKRAHSLGVAGIA 178 + KV+Y+DAD I + ++ L++ + + + Sbjct: 1251 ELHKVIYVDADQIVRADLQELMDMDLHGAPYGYTPMCDSREEMEGFRFWKKGYWKKFLRG 1310 Query: 179 KGYFNSGFLLINTAQWAAQQVSARA-IAMLNEPEIIKKITHPDQDVLNMLLA-DKLIFAD 236 Y S +++ ++ +++ DQD+ N L + Sbjct: 1311 LKYHISALYVVDLDRFRKMGAGDLLRRQYQLLSADPNSLSNLDQDLPNHLQHLIPIYSLP 1370 Query: 237 IKY 239 + Sbjct: 1371 QDW 1373 >UniRef50_Q8T191 Probable UDP-glucose:glycoprotein glucosyltransferase A n=2 Tax=Dictyostelium discoideum RepID=UGGG_DICDI Length = 1681 Score = 149 bits (377), Expect = 1e-34, Method: Composition-based stats. Identities = 35/260 (13%), Positives = 88/260 (33%), Gaps = 14/260 (5%) Query: 2 QQVFFQETEFLNSVIDYDHKVETENLCLDIA-YGTDKNFLFGCGISIASILKYNEGSRLC 60 +F + + +SV + K + + I + + I + S++K S + Sbjct: 1344 SNLFSSKNDATDSVATHQKKSNLD--TIHIFSVASGHLYERFLKIMMLSVVKN-TESPIK 1400 Query: 61 FHIFTDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYF 120 F +Y +++ +A +Y + ++ + K Y + F Sbjct: 1401 FWFLKNYLSPAFKEFIPEMAKEYGFQYELVTYKWPWWLRKQTEKQRIIWSYKILFLDVLF 1460 Query: 121 INKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQA------DWWEKRAHSLGV 174 PK++++DAD + + ++ L + + +W+ + Sbjct: 1461 PLDVPKIIFVDADQVVRTDLKELWDMDLHGASLGYTPFCDSNKDTEGFRFWKSGYWRQHL 1520 Query: 175 AGIAKGYFNSGFLLINTAQWAAQQVSARAIA-MLNEPEIIKKITHPDQDVLNMLLAD-KL 232 AG + Y S +++ ++ + A + + DQD+ N L ++ Sbjct: 1521 AG--RSYHISALYVVDLVRFRRLAAGDQLRATYDQLSRDPNSLANLDQDLPNYLQHYVRI 1578 Query: 233 IFADIKYNTQFSLNYQLKES 252 ++ + Q +S Sbjct: 1579 HSLPQEWLWCETWCDQESKS 1598 >UniRef50_C6H742 UDP-glucose:glycoprotein glucosyltransferase n=1 Tax=Ajellomyces capsulatus H143 RepID=C6H742_AJECH Length = 1728 Score = 149 bits (376), Expect = 2e-34, Method: Composition-based stats. Identities = 29/232 (12%), Positives = 69/232 (29%), Gaps = 8/232 (3%) Query: 15 VIDYDHKVETENLCLDIA-YGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDR 73 + + + ++I + + I + S++K+ + S + F + + Sbjct: 1403 LHRLAGPAQGTHADINIFSVASGHLYERMLNIMMVSVMKHTKHS-VKFWFIEQFLSPSFK 1461 Query: 74 KYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDAD 133 + LA +Y ++ + K Y + F KV+++DAD Sbjct: 1462 SFLPHLAAEYGFSYEMVTYKWPHWLRAQTEKQRIIWGYKILFLDVLFPLSLDKVIFVDAD 1521 Query: 134 IICQGTIEPLINFSFPDDKVAMVVTEGQAD-WWEKRAHSLGVAGI---AKGYFNSGFLLI 189 I + + L+ R G Y S ++ Sbjct: 1522 QIVRTDMYELVTLDLEGAPYGFTPMCDSRTSMEGFRFWKQGYWKNFLRGLPYHISALYVV 1581 Query: 190 NTAQWAAQQVSARAI-AMLNEPEIIKKITHPDQDVLNMLLADK-LIFADIKY 239 + ++ A R + +++ DQD+ N + + + Sbjct: 1582 DLKRFRAIAAGDRLRGQYHTLSADPQSLSNLDQDLPNNMQRMLPIKSLPQDW 1633 >UniRef50_C4R603 Protein required for beta-1,6 glucan biosynthesis n=2 Tax=Pichia pastoris GS115 RepID=C4R603_PICPG Length = 1450 Score = 149 bits (375), Expect = 2e-34, Method: Composition-based stats. Identities = 33/235 (14%), Positives = 80/235 (34%), Gaps = 9/235 (3%) Query: 13 NSVIDYDHKVETENLCLDIA-YGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDD 71 + + + + +N ++I + + I S++K+ + + + F + +Y Sbjct: 1148 KFLSSWRKQEQPKNADINIFTVASGHLYERFLSIMTNSVMKHTKHT-VKFWLIENYMSPT 1206 Query: 72 DRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLD 131 +K LA ++ ++ K T Y + F KV+++D Sbjct: 1207 FKKNLPFLAREFGFDYELVNYKWPAWLRGQREKQRTIWGYKILFLDVLFPQSLDKVIFVD 1266 Query: 132 ADIICQGTIEPLINFSFPDDKVAMVVTEGQAD-----WWEKRAHSLGVAGIAKGYFNSGF 186 AD I + ++ L++ + + K+ + + G Y S Sbjct: 1267 ADQIVRTDLKELVDLDLEGAPYGYTPMCNDREEMEGFRFWKQGYWQKLLGDTLKYHISAL 1326 Query: 187 LLINTAQWAAQQVSARAIA-MLNEPEIIKKITHPDQDVLNMLLAD-KLIFADIKY 239 +I+ + R + +++ DQD+ N L K+ ++ Sbjct: 1327 YVIDLKTFRQIAAGDRLRQHYQQLSQDPNSLSNLDQDLPNNLQHQIKIFSLPQEW 1381 >UniRef50_B0CRB8 Glycosyltransferase family 8 protein n=3 Tax=Fungi/Metazoa group RepID=B0CRB8_LACBS Length = 1027 Score = 148 bits (374), Expect = 2e-34, Method: Composition-based stats. Identities = 44/300 (14%), Positives = 98/300 (32%), Gaps = 51/300 (17%) Query: 31 IAYGTDKNFLFGCGISIASILK-YNEGS---RLCF----HIFTDYFGDDDRKYFDALALQ 82 + + ++L G +A++ +N + F + + K + Sbjct: 8 VTLVSSDSYLPGALTLVAALKDLHNSPPVEPEVDFQTICLVTPESVDVSTIKLLRS---A 64 Query: 83 YKTRIKIYLINGDRLRSLPSTKNWTHAI-YFRFVIADYFINKAPKVLYLDADIICQGTIE 141 + I + ++ + + L + I + K+++LDAD++ +I Sbjct: 65 FDVVIGVEILEHENTKGLKLLGRPDLTTVLTKLHIFRL--TQYQKIIFLDADVLPIRSIS 122 Query: 142 PLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSA 201 L N + + V G D FNSG L+++ + Sbjct: 123 HLFNLP---HEFSAVPDVGWPDI-----------------FNSGVLVLSPGE-------D 155 Query: 202 RAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDT 261 + + + DQ +LN YNT + Y ++ + Sbjct: 156 KFNQLNELLKSKGSWDGGDQGILNEWRGGDWNRLSFTYNTTPTAAYTYAPAY-ERYGSQI 214 Query: 262 IFIHYIGPTKPWHDWAWDYPVSQA--------FMEAKNASPWKNTALLKPNNSNQLRYSA 313 IH+IG KPW+ + P + + + +++ ++ ++ RYS+ Sbjct: 215 SAIHFIGKNKPWNSISSHSPQQSYDYESLVDKWFDVYDK-HYRSEPIIPQSSFALQRYSS 273 >UniRef50_A5BZU1 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5BZU1_VITVI Length = 648 Score = 148 bits (374), Expect = 3e-34, Method: Composition-based stats. Identities = 45/284 (15%), Positives = 92/284 (32%), Gaps = 19/284 (6%) Query: 13 NSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIAS-ILKYNEGSRLCFHIFTDYFGDD 71 + + K+ + A +D + + I S +L +E + FHI TD Sbjct: 362 QKRVVLNKKLLEDPSLYHYAIFSDN--VLATSVVINSTMLXASEPEKHVFHIVTDKLSFA 419 Query: 72 DRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLD 131 K + + K I++ I+ + K + + RF + + + K K+L+LD Sbjct: 420 AMKMWFLVNSPAKVTIQVENID-----DFKNPKYLSMLNHLRFYLPEVYP-KLEKILFLD 473 Query: 132 ADIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVA------GIAKGYFNSG 185 DI+ Q + PL + A V T ++ + + + G Sbjct: 474 DDIVVQKDLTPLWSLDMQGMVNAAVETCKESFHRFDKYLNFSHPKISENFDPNACGWAFG 533 Query: 186 FLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSL 245 + + +W + ++ + E + + D ++ L Sbjct: 534 MNMFDLKEWRKRNMTGIYHYWQDMNEDRTLWKLGSLPPGLITFYNLTYPLDRSWHVL-GL 592 Query: 246 NYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEA 289 Y ++ +HY G KPW + A + Sbjct: 593 GYD--PQLNQTEIDNAAVVHYNGNYKPWLELAI-AKYKSYWSRY 633 >UniRef50_A5DLS6 Putative uncharacterized protein n=2 Tax=Pichia guilliermondii RepID=A5DLS6_PICGU Length = 390 Score = 148 bits (374), Expect = 3e-34, Method: Composition-based stats. Identities = 44/274 (16%), Positives = 83/274 (30%), Gaps = 36/274 (13%) Query: 32 AYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIYL 91 T++++L G ++ + + D + + Y I I Sbjct: 6 TLLTNESYLPGALTLAHTLRSLGTQYPVVVLLDETQVSDRSLQLLE---AAYDRIIPISD 62 Query: 92 INGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLIN--FSFP 149 + + + ++ + ++LYLD D++ ++ L + + Sbjct: 63 RLVTSPVDDRLGRPELAVTFSKLLLWN---ESYDQILYLDTDVLPLANVDHLFDEGAALT 119 Query: 150 DDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNE 209 ++A G D FNSG LL QV + + + Sbjct: 120 PRQIAASPDSGWPDI-----------------FNSGVLLFKPDP----QVYSDLVEFAS- 157 Query: 210 PEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIGP 269 DQ +LN A YN + +YQ +F D +HYIG Sbjct: 158 -GSDSSFDGADQGLLNEFFAGNWHRLPFLYNVTPTESYQYVPAFHRFF-KDIKILHYIGQ 215 Query: 270 TKPWHDWAW--DYPVSQAFMEAKNASPWKNTALL 301 KPWH + + + S + + Sbjct: 216 IKPWHSSTNIDHFRFHHLWWD--RFSEFFDKETK 247 >UniRef50_Q17VR5 Lipopolysaccharide biosynthesis protein n=19 Tax=Helicobacter RepID=Q17VR5_HELAH Length = 405 Score = 148 bits (373), Expect = 3e-34, Method: Composition-based stats. Identities = 57/376 (15%), Positives = 114/376 (30%), Gaps = 78/376 (20%) Query: 25 ENLCLDIAYGTDKNFLFGCGISIASILKYNEGSR------LCFHIFTDYFGDDDRKYFDA 78 +++ + I D N+ G+S+ S+L + R H D ++ + Sbjct: 3 DSVIIPIVVAFDNNYCIPAGVSLYSMLANAKTERERVKLFYKIHCLVDGLSAENIEKLKE 62 Query: 79 LALQYK--TRIKIYLINGDRLRS-----------------------------------LP 101 + + ++ I+ Sbjct: 63 TLAPFSAFSSVEFLEISTHNTPKENQEIKKNQTIKSDHYQNIDPIIANKIEELFTKLSNY 122 Query: 102 STKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQ 161 S K ++ I R ++A F + K++ D D + G I V + Sbjct: 123 SQKRFSKMIMCRLLLASLFP-QYDKMIMFDVDTLFVGDISESFFIPLEAHYFGAVREKDL 181 Query: 162 ADWWEKRAHSLGVAGIAK---------------------GYFNSGFLLINTAQWAAQQVS 200 A L + YFN+GFL +N W + + Sbjct: 182 IAMNRNSAKDLYELRQRRAKSIGVANAFPNLEEAQILFDNYFNAGFLALNLKLWRKENLE 241 Query: 201 ARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTND 260 + I +K+ DQD L + +++ YN S P + Sbjct: 242 NQLIGFFILKN--EKLLFNDQDALCFVCRGRILELPYPYNAHPSFLDTPS----FPSIKE 295 Query: 261 TIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKH 320 +H+ G KPW ++ ++ + E +P+K+ P + H+ K+ Sbjct: 296 VCMLHFWG-DKPWKIFSV--FGAKKWHEVLMQTPFKDKYFNTPFLDHLFN----HIQNKN 348 Query: 321 RYLKGFSNYLFYFIEK 336 L+ F+ L + ++ Sbjct: 349 NKLRTFNKALSFVDKR 364 >UniRef50_Q873M5 UDP-Glc:glycoprotein glucosyltransferase n=2 Tax=Yarrowia lipolytica RepID=Q873M5_YARLI Length = 1470 Score = 148 bits (373), Expect = 3e-34, Method: Composition-based stats. Identities = 29/241 (12%), Positives = 75/241 (31%), Gaps = 9/241 (3%) Query: 7 QETEFLNSVIDYDHKVETENLCLDIA-YGTDKNFLFGCGISIASILKYNEGSRLCFHIFT 65 + + + + + ++I + + I AS++ + + + + F + Sbjct: 1142 DKAKLWSKLKKSTGVSTKKQADINIFTVASGHLYERFLSIMTASVMAHTDHT-VKFWLIE 1200 Query: 66 DYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAP 125 ++ + + LA Y ++ + K Y + F Sbjct: 1201 NFLSASFKAFLPHLAAHYGFEYELVTYQWPHWLRGQTEKQRQIWGYKILFLDVLFPQDLE 1260 Query: 126 KVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQAD-----WWEKRAHSLGVAGIAKG 180 +V+++D+D I + + L+ + K+ + G Sbjct: 1261 RVIFIDSDQIVRTDLYELVEMDLEGAPYGFTPMCDSRKEMDGFRFWKQGYWDTFLGDDLV 1320 Query: 181 YFNSGFLLINTAQWAAQQVSARAI-AMLNEPEIIKKITHPDQDVLNMLLAD-KLIFADIK 238 Y S +++ + AQQ+ R +++ DQD+ N L + Sbjct: 1321 YHISALFVVDLKVFRAQQIGDRLRVHYHQLSADPASLSNLDQDLPNNLQRQVPIFSLPQD 1380 Query: 239 Y 239 + Sbjct: 1381 W 1381 >UniRef50_Q0I2Z7 Lipopolysaccharide biosynthesis protein, glycosyltransferase, family 8 n=1 Tax=Haemophilus somnus 129PT RepID=Q0I2Z7_HAES1 Length = 354 Score = 147 bits (372), Expect = 4e-34, Method: Composition-based stats. Identities = 62/348 (17%), Positives = 118/348 (33%), Gaps = 62/348 (17%) Query: 29 LDIAYGTDKNFLFGCGISIASIL----KYNEGSRLCFHIFTDYFGDDDRKYFDALALQYK 84 ++I + D N+ +++ SI+ K NE + F++ + Y LA + Sbjct: 1 MNILFACDDNYAKYLAVTMLSIIHARDKNNECYTIHFYLLDMGISTVAKDYCLELANKNN 60 Query: 85 TRIKIYLINGDRLRSLP-STKNWTHAIYFRFVIADYFIN-KAPKVLYLDADIICQGTIEP 142 + I I+ P + + + + Y R +A+Y K++YLD DI+ ++ P Sbjct: 61 CHLDIVPISISDFEKFPRTIEYISLSTYARLNLANYLKKFNLTKIIYLDIDILVNHSLLP 120 Query: 143 LINFSFPDDKVAMVVTEGQADWWEKR---------------------------------- 168 L N + + + + Sbjct: 121 LWNTDLGNKAIGACYDAFIESQEKSKRMSSQSVSQSVSQSVSQSVSQSVSQSVSQSVSQS 180 Query: 169 ---------AHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIK-KITH 218 + YFN+G LLIN +W V +++ + + + + Sbjct: 181 VSQSVSQSDYKTKLHLPNTHFYFNAGVLLINVVEWEKCHVFEKSLQWIEYCKRNNIEFLY 240 Query: 219 PDQDVLNMLLADKLIFADIKYNTQFSLNYQLKE------SFINPVTNDTIFIHYIGPTKP 272 DQD+LN + A+ + + D++YN + +LK + T IHY+GP K Sbjct: 241 QDQDILNAIFANNVKYLDLRYNFTANALNRLKRVSKKELNQYEEATMPLAIIHYVGPKKS 300 Query: 273 WHDWAWDYPVSQAFMEAKNAS-----PWKNTALLKPNNSNQLRYSAKH 315 WH+ + F WK + + +H Sbjct: 301 WHEKCSMLK-ANLFCHLFQQLENPPKEWKIENVPFIRKLKRFAKDLRH 347 >UniRef50_Q582S2 UDP-glucose:glycoprotein glucosyltransferase, putative n=2 Tax=Trypanosoma brucei RepID=Q582S2_9TRYP Length = 1675 Score = 147 bits (372), Expect = 5e-34, Method: Composition-based stats. Identities = 34/235 (14%), Positives = 74/235 (31%), Gaps = 15/235 (6%) Query: 20 HKVETENLCLDIA-YGTDKNFLFGCGISIASILKYNEG------SRLCFHIFTDYFGDDD 72 + L+I + + I + ++++ + +R+ F + ++ Sbjct: 1355 RSERPKFPTLNIFTVASGHLYERFLRIMMHTVMRTSSDVHGANTTRIKFWLIENFLSPQF 1414 Query: 73 RKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDA 132 ++ LA Y + + + K T Y + F +V+++DA Sbjct: 1415 KELVPLLAEHYGFDVGFVTYRWPWWLNKQTEKQRTIWAYKILFLDVLFPLNVDRVIFVDA 1474 Query: 133 DIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGI------AKGYFNSGF 186 D I Q + L N + +A + G K Y S Sbjct: 1475 DQIVQADLHELYNMNIGAAAMAYTPFCREYPNDATTNFRFWDQGFWLSHLRGKPYHISAL 1534 Query: 187 LLINTAQWAAQQVSARAIA-MLNEPEIIKKITHPDQDVLNMLLADK-LIFADIKY 239 L+N + A + A E + + DQD+ N + + + ++ Sbjct: 1535 YLVNVQRLRAALGGDKYRATYARLSEDPGSLANLDQDLPNFMQDEMPIFSLPEEW 1589 >UniRef50_A8NCT1 Putative uncharacterized protein n=1 Tax=Coprinopsis cinerea okayama7#130 RepID=A8NCT1_COPC7 Length = 1624 Score = 147 bits (372), Expect = 5e-34, Method: Composition-based stats. Identities = 32/222 (14%), Positives = 67/222 (30%), Gaps = 8/222 (3%) Query: 25 ENLCLDIAYGTDKN-FLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQY 83 E ++I + I I S+LK + + + F ++ ++ A +Y Sbjct: 1261 EQAEINIFTVASGLLYERFASIMILSVLKNTKST-VKFWFIENFLSPSFLEFIPHFAKEY 1319 Query: 84 KTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPL 143 ++ + K Y + F KV+++DAD I + ++ L Sbjct: 1320 NFDYELVTYRWPSWLRAQTEKQRIIWAYKILFLDVLFPMDLKKVIFVDADQIVRADLKEL 1379 Query: 144 INFSFPDDKVAMVVT-EGQADWWEKRAHSLGVAG---IAKGYFNSGFLLINTAQWAAQQV 199 ++ + + R G K Y S +I+ ++ Sbjct: 1380 VDLDLQGAPYGYTPMGDDNKEMEGFRFWKTGYWKDFLQGKPYHISALYVIDLVRFRHMAA 1439 Query: 200 SARAI-AMLNEPEIIKKITHPDQDVLNMLLAD-KLIFADIKY 239 + + DQD+ N L + D + Sbjct: 1440 GDILRGQYQALSADPGSLANLDQDLPNNLQRQVPIFSLDEDW 1481 >UniRef50_C4Y414 Putative uncharacterized protein n=1 Tax=Clavispora lusitaniae ATCC 42720 RepID=C4Y414_CLAL4 Length = 1428 Score = 147 bits (371), Expect = 5e-34, Method: Composition-based stats. Identities = 30/260 (11%), Positives = 75/260 (28%), Gaps = 9/260 (3%) Query: 1 MQQVFFQETEFLNSVIDYDHKVETENLCLDIA-YGTDKNFLFGCGISIASILKYNEGSRL 59 +++ + E +++ + + + S++K G + Sbjct: 1122 LKRHHIYPRMKRTETHTSHLRAAKEQADINVFSIASGHLYEQLMSTMMLSVVKN-TGKSV 1180 Query: 60 CFHIFTDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADY 119 F + ++ R+ LA +Y + + Y + Sbjct: 1181 KFWLIENFLSHGFRERVPGLAEKYGFEYEYVGYQWPAWLRQQKQLHRKVWGYKMLFLDTL 1240 Query: 120 FINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQA-----DWWEKRAHSLGV 174 F KV+++DAD I + ++ L+N + K + V Sbjct: 1241 FPADLDKVIFVDADQIARTDLKELVNIDLEGAPYGFAPMCDSRKEMEGYQFWKNGYWPTV 1300 Query: 175 AGIAKGYFNSGFLLINTAQWAAQQVSARAIA-MLNEPEIIKKITHPDQDVLNMLLAD-KL 232 Y S +++ + V + + +++ DQD+ N L + Sbjct: 1301 LKDDLKYHISALYVVDLRRLRETLVGDKLRSHYQKLSADPNSLSNLDQDLPNNLQRQVPI 1360 Query: 233 IFADIKYNTQFSLNYQLKES 252 ++ + +S Sbjct: 1361 HTLPQEWLWCETWCSDESKS 1380 >UniRef50_A1D472 Glycosyl transferase family 8 protein n=4 Tax=Trichocomaceae RepID=A1D472_NEOFI Length = 739 Score = 147 bits (370), Expect = 7e-34, Method: Composition-based stats. Identities = 44/264 (16%), Positives = 77/264 (29%), Gaps = 37/264 (14%) Query: 42 GCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIYLINGDRLRSLP 101 G + S+ ++L D K + Y I + +L Sbjct: 30 GAVVLAHSLRDNGTKAKLVALYTPDTLQYVTIKELQTV---YDEIIPVQTATNHTPANLW 86 Query: 102 STKNWT-HAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEG 160 A + + + + K++Y+D D++ + L+ ++ A G Sbjct: 87 LMDRPDLIATFTKIELWR--QTQFKKIVYIDCDVVAVRAPDELLTL---EEDFAAAPDVG 141 Query: 161 QADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPD 220 D FNSG +++ A+ E D Sbjct: 142 WPDI-----------------FNSGVMVLRPN-------LQDYYALKALAERGISFDGAD 177 Query: 221 QDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDY 280 Q +LNM + YN S NYQ ++ + IH+IG KPW+ Sbjct: 178 QGLLNMHFRN-WHRLSFTYNCTPSANYQYIPAY-KHFQSTISLIHFIGAQKPWNLPRQVL 235 Query: 281 PVSQAFMEAKNASPWKNTALLKPN 304 PV + + W Sbjct: 236 PVESPYNQLLGR--WWAIYDRHYR 257 >UniRef50_D1IU75 Whole genome shotgun sequence of line PN40024, scaffold_5.assembly12x (Fragment) n=7 Tax=Magnoliophyta RepID=D1IU75_VITVI Length = 364 Score = 146 bits (368), Expect = 1e-33, Method: Composition-based stats. Identities = 49/267 (18%), Positives = 100/267 (37%), Gaps = 18/267 (6%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNE-GSRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 + +A D ++L G ++ SIL++++ + FH + + + + + Sbjct: 63 VHVAITLDVHYLRGSMAAVHSILQHSQCPEDIFFHFL---VSETHLEILVR-STFPQLKF 118 Query: 88 KIYLINGDRLRSLPSTKNW----THAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPL 143 K+Y N + +R+L ST Y R +AD +V+YLD+D+I I L Sbjct: 119 KVYYFNPEIVRNLISTSVREALEHPLNYARNYLADLLEPCVRRVIYLDSDLIVVDDIYKL 178 Query: 144 INFSFPDDKVAM-------VVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAA 196 + S + +W ++ + G YFN+G ++I+ A+W Sbjct: 179 WSTSLGTRTIGAPEYCHANFTRYFTDKFWSEKRYYGTFDGRKPCYFNTGVIVIDLAKWRR 238 Query: 197 QQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINP 256 + R + + + ++ A + + ++N Q L + Sbjct: 239 FGFTKRIERWMEVQKNNRIYELGSLPPYLLVFAGHVAPIEHRWN-QHGLGGDNVKGSCRE 297 Query: 257 VTNDTI-FIHYIGPTKPWHDWAWDYPV 282 + + +H+ G KPW P Sbjct: 298 LHPGPVSLLHWSGSGKPWARLDMKAPC 324 >UniRef50_D2MYR1 Putative uncharacterized protein n=1 Tax=Campylobacter jejuni subsp. jejuni 414 RepID=D2MYR1_CAMJE Length = 383 Score = 145 bits (367), Expect = 1e-33, Method: Composition-based stats. Identities = 74/372 (19%), Positives = 125/372 (33%), Gaps = 71/372 (19%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYNEGSR----------------------------LC 60 I + +++ + I+SI+K + S+ Sbjct: 2 FHIILNLNDDYVKYASVLISSIVKNTDTSKTFAKICEENHNLTHILTLKQYNKSEEEGYV 61 Query: 61 FHIFTDYFGDDDRKYFD----ALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVI 116 FHI +D+ D R + LA Y IKIY+IN D R+ K Y+R ++ Sbjct: 62 FHILSDFISDKTRMKLEYLKENLAKIYPCDIKIYIINEDNFRNFLHWKG-NFVAYYRLMV 120 Query: 117 ADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQAD-----WWEKRAHS 171 K LY+DAD++C I L F D + V + + + + Sbjct: 121 GSILPPDIEKCLYIDADMLCFSDIRKLFLFDLEDKVLGAVADFATWNTRFLKFRKLKYLF 180 Query: 172 LGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADK 231 G ++ YFNSG LLI+ +W Q + + + +L K PDQD LN+++ + Sbjct: 181 KGFLKFSREYFNSGLLLIDLKEWRRQNIEKKCLDVLKYY----KCILPDQDALNIVIKEN 236 Query: 232 LIFADIKYNT--------------------------QFSLNYQLKESFINPVTNDTIFIH 265 I + +N + ++ + N +F+H Sbjct: 237 YIKLPLSFNCPTVCYATNYLNIICKDEISSFSKLDYFKEVGMMYSKNELLEALNKPLFLH 296 Query: 266 YIGPTKPW-HDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKHRYLK 324 Y KPW + + S W+ A L + Y+K Sbjct: 297 Y--SEKPWARYYFLNSNYQPIIYSKDVFSMWQKEAFEVLCFKEDLILLKNKDFFQMNYIK 354 Query: 325 GFSNYLFYFIEK 336 + L K Sbjct: 355 FLAEKLSVIERK 366 >UniRef50_UPI00016E26D6 UPI00016E26D6 related cluster n=3 Tax=Takifugu rubripes RepID=UPI00016E26D6 Length = 421 Score = 145 bits (367), Expect = 1e-33, Method: Composition-based stats. Identities = 47/277 (16%), Positives = 83/277 (29%), Gaps = 46/277 (16%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 + T ++ G + S+ ++ + + T + AL + I + Sbjct: 40 VTLVTSDSYCMGAVVVARSLRRHGTTRGVVVMV-TPNVSEQS-STRGALHSVFDEVIMVD 97 Query: 91 LINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPD 150 I L S I F I + + + K ++LDAD + ++ L Sbjct: 98 RIESGDRLHLSSLGRPELGITF-TKIHCWTLTQYSKCVFLDADTLVLDNVDELFQRD--- 153 Query: 151 DKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEP 210 ++++ G D FNSG + + + A A+ Sbjct: 154 -ELSVAPDPGWPDC-----------------FNSGVFVFQPSLQTHASLRAHALQ----- 190 Query: 211 EIIKKITHPDQDVLNMLL-----ADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIH 265 DQ +LN AD YN S Y +F + H Sbjct: 191 --HGSFDGGDQGLLNSFFSSWPVADITKHLPFVYNLSSSCVYSYLPAF-QQFGHSAKIFH 247 Query: 266 YIGPTKPW---------HDWAWDYPVSQAFMEAKNAS 293 + G KPW D+ VS + E + + Sbjct: 248 FTGAVKPWSSSSFKKEGQPPCMDHFVSLWWKEYLSHT 284 >UniRef50_UPI0000587C70 PREDICTED: similar to MGC81998 protein n=1 Tax=Strongylocentrotus purpuratus RepID=UPI0000587C70 Length = 344 Score = 145 bits (367), Expect = 2e-33, Method: Composition-based stats. Identities = 47/306 (15%), Positives = 99/306 (32%), Gaps = 43/306 (14%) Query: 8 ETEFLNSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDY 67 E +SV K + N +++ +D + L G ++ SI + + F++ D Sbjct: 42 NHETRHSVQRDLSKNSSSNGTINVLICSDGSTLGGMVAAMNSIYLNSRTH-IKFYLVVDT 100 Query: 68 FGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKV 127 D + + + K I + + L Y R F +V Sbjct: 101 DSLDHLSKWLSQSSLRKLDYAIKVFDESWLN------------YARLYFPKIFPGLTGRV 148 Query: 128 LYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQA----------------DWWEKRAHS 171 +++D+D I QG I L V + A ++ ++ S Sbjct: 149 IFVDSDTITQGDIAELNAIDIKPGHVVAFSDDCSAVTSRYGVIMNRYASYLNFGNEKLQS 208 Query: 172 LGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIKKITHPDQ-------DVL 224 LG+ + FN G + N +W Q ++A+ + + Q + Sbjct: 209 LGINPME-CSFNPGVFVANVDEWRKQNITAKLDYWVTVNSKED--VYGSQRGGGHSGPPM 265 Query: 225 NMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQ 284 ++ K +++ + L + + +H+ G KPW + + Sbjct: 266 MIVFYMKYSPLPPEWHIRH-LGVTTGARYSDAFLKAAKLLHWNGRFKPW---GHNSQHTL 321 Query: 285 AFMEAK 290 + + Sbjct: 322 IWEKYY 327 >UniRef50_O15488 Glycogenin-2 n=43 Tax=Fungi/Metazoa group RepID=GLYG2_HUMAN Length = 501 Score = 145 bits (365), Expect = 2e-33, Method: Composition-based stats. Identities = 37/255 (14%), Positives = 74/255 (29%), Gaps = 41/255 (16%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 + T+ + G + S+ ++ +L + T R + + I++ Sbjct: 40 VTLATNDIYCQGALVLGQSLRRHRLTRKLVV-LITPQVSSLLRVILSKV---FDEVIEVN 95 Query: 91 LINGDRLRSLPSTKNWTH-AIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFP 149 LI+ L K + K ++LDAD + ++ L + Sbjct: 96 LIDSADYIHLAFLKRPELGLTLTKLHCWTL--THYSKCVFLDADTLVLSNVDELFD---- 149 Query: 150 DDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNE 209 + + G D FNSG + + + + +L Sbjct: 150 RGEFSAAPDPGWPDC-----------------FNSGVFVFQPSL-----HTHKL--LLQH 185 Query: 210 PEIIKKITHPDQDVLNMLLADK-----LIFADIKYNTQFSLNYQLKESFINPVTNDTIFI 264 DQ +LN + YN + Y +F + + Sbjct: 186 AMEHGSFDGADQGLLNSFFRNWSTTDIHKHLPFIYNLSSNTMYTYSPAFKQ-FGSSAKVV 244 Query: 265 HYIGPTKPWHDWAWD 279 H++G KPW+ Sbjct: 245 HFLGSMKPWNYKYNP 259 >UniRef50_A9S2B3 Predicted protein (Fragment) n=1 Tax=Physcomitrella patens subsp. patens RepID=A9S2B3_PHYPA Length = 275 Score = 145 bits (365), Expect = 3e-33, Method: Composition-based stats. Identities = 55/269 (20%), Positives = 98/269 (36%), Gaps = 27/269 (10%) Query: 25 ENLCLDIAYGTDKNFLFGCGISIASILKYNE-GSRLCFHIFTDYFGDDDRKYFDALALQY 83 + IA D N+L G +I SIL + E S + FH + + + Sbjct: 7 NESLVHIAMTLDANYLRGSMAAIYSILLHAECASNVRFHFVA---TKEKKNKCKSF---- 59 Query: 84 KTRIKIYLINGDRLRSLPSTKN---WTHAIYFRFVIADYFINKAPKVLYLDADIICQGTI 140 R +Y + + L+ + S+ Y RF +A + +++YLD D++ G I Sbjct: 60 -CRSAMYFYSCELLKLIYSSDFVITQEPLNYARFYLAHMIDSCVKRIIYLDLDVLVLGRI 118 Query: 141 EPLINFSFPDDKV-------AMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQ 193 E L + + V A + ++W + + A YFNSG +LIN + Sbjct: 119 EELWMTNMGNSTVGTPEYCHANFPSYFTENFWINSSLASTFANKQPCYFNSGMMLINLER 178 Query: 194 WAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESF 253 W + ++ + + L + A + D ++N Q L + + Sbjct: 179 WRKTRCTSTLEYWMEVQKQQHIYELGSLPPLLLTFAGSIQAIDNRWN-QHGLGGDIVKGD 237 Query: 254 INPVTNDTIFIHYIGPTKPWHDWAWDYPV 282 +H+ G KPW P Sbjct: 238 CRS-------LHWSGGGKPWRRLDMHQPC 259 >UniRef50_B9WDQ8 Killer toxin-resistance protein, putative n=5 Tax=Candida RepID=B9WDQ8_CANDC Length = 1453 Score = 144 bits (364), Expect = 3e-33, Method: Composition-based stats. Identities = 35/239 (14%), Positives = 76/239 (31%), Gaps = 8/239 (3%) Query: 9 TEFLNSVIDYDHKVETENLCLDIA-YGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDY 67 + + + ++I + + I IAS+ K+N S + F I D+ Sbjct: 1117 YPRVKKSDNKKAMPMRRHAEINIFTIAGGQLYEKLTSIMIASVRKHNHRSTIKFWILEDF 1176 Query: 68 FGDDDRKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKV 127 + ++++Y + +K Y + F K+ Sbjct: 1177 VSPQFKHLMKLISIKYNVEYEFISYKWPNFLRRQKSKERIIWGYKILFLDVLFPQDLDKI 1236 Query: 128 LYLDADIICQGTIEPLINFSFPDDKVAMVVTEGQA-----DWWEKRAHSLGVAGIAKGYF 182 +++DAD IC+ + LIN + K + V Y Sbjct: 1237 IFIDADQICRADLTELINMDLEGAPYGFTPMCDSREEMEGYRFWKEGYWSDVLKDDLKYH 1296 Query: 183 NSGFLLINTAQWAAQQVSARAIA-MLNEPEIIKKITHPDQDVLNMLLAD-KLIFADIKY 239 S +++ ++ + + R A +++ DQD+ N + K+ + Sbjct: 1297 ISALFVVDLQKFRSIKAGDRLRAHYQKLSSDPNSLSNLDQDLPNNMQRSIKIFSLPQSW 1355 >UniRef50_C0X9Z7 Putative uncharacterized protein n=2 Tax=Lactobacillus gasseri JV-V03 RepID=C0X9Z7_9LACO Length = 416 Score = 144 bits (363), Expect = 5e-33, Method: Composition-based stats. Identities = 57/273 (20%), Positives = 101/273 (36%), Gaps = 25/273 (9%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 IA + ++ +I SIL + + H+ + + A Q +RI Sbjct: 5 IALSANYGYIDKIETTIKSILYNVKN--VEIHLLNYDIPQEWFANINRYANQIGSRIIDE 62 Query: 91 LINGDRLRSLPS-TKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFP 149 + + L L S K+ Y R +I KA +VLYLD+D++ I+ L + F Sbjct: 63 KFDPEELHDLNSGFKHINQMTYARLLIPKLI--KANRVLYLDSDLVVDDEIDELFSRKFN 120 Query: 150 DDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAA-QQVSARAIAMLN 208 K+ V R + + N+G LLIN + +S + + Sbjct: 121 GKKILAVTHIFDV-----RNKNESRVDLPVPSINAGVLLINNQELRKDHNLSEKLLDFAR 175 Query: 209 EPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQL---------KESFINPVTN 259 + + DQD +N D++ KYN Q + L + I Sbjct: 176 KNNFPQD----DQDTINNWFKDEIGSLSFKYNYQIGADRFLFWSNNSNTETATEILDKVK 231 Query: 260 DTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNA 292 + IHYI KP++ ++ + + + +N Sbjct: 232 NPKIIHYISDDKPFNIFSEGR-MRETWWFYRNL 263 >UniRef50_Q6Z5D6 Glycosyltransferase family-like n=6 Tax=Poaceae RepID=Q6Z5D6_ORYSJ Length = 726 Score = 144 bits (363), Expect = 5e-33, Method: Composition-based stats. Identities = 50/318 (15%), Positives = 93/318 (29%), Gaps = 47/318 (14%) Query: 13 NSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNE-GSRLCFHIFTDYFGDD 71 N Y+ K+E + A +D + G + + S + + + FHI TD Sbjct: 404 NKHFPYEEKLE-DPKLQHYALFSDN--VLGAAVVVNSTIIHAKTPENHVFHIVTDKLNYA 460 Query: 72 DRKY------------------------------FDALALQYKTRIKIYL--INGDRLRS 99 + L Q+ D Sbjct: 461 AMRMWFLENSQGKAAIEVQNIEDFTWLNSSYSPVLKQLESQFMINYYFKTQQDKRDNNPK 520 Query: 100 LPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTE 159 + K + + RF + + F K KVL+LD DI+ Q + L + + T Sbjct: 521 FQNPKYLSILNHLRFYLPEIFP-KLNKVLFLDDDIVVQQDLSALWSIDLKGKVNGAIQTC 579 Query: 160 GQADWWEKRAHSLGVA------GIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEII 213 G+ R + + G + + ++W + ++ + E Sbjct: 580 GETFHRFDRYLNFSNPLIAKNFERRACGWAYGMNMFDLSEWRKRNITDVYHYWQEQNEHR 639 Query: 214 KKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIGPTKPW 273 + ++ D K++ L Y K + IHY G KPW Sbjct: 640 LLWKLGTLPAGLVTFWNQTFPLDHKWHLL-GLGY--KPNVNQKDIEGAAVIHYNGNRKPW 696 Query: 274 HDWAWDYPVSQAFMEAKN 291 + A + + + N Sbjct: 697 LEIAM-AKYRKYWSKYVN 713 >UniRef50_Q5M7A1 Hypothetical LOC496877 n=2 Tax=Xenopus (Silurana) tropicalis RepID=Q5M7A1_XENTR Length = 395 Score = 144 bits (362), Expect = 6e-33, Method: Composition-based stats. Identities = 42/250 (16%), Positives = 82/250 (32%), Gaps = 41/250 (16%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 + GT+ + G + S+ + +L I T R + + +++ Sbjct: 9 VTLGTNDIYCQGALVLGKSLRNHKTSRQLVVMI-TSQVTSRMRDVLSNI---FDEVVEVD 64 Query: 91 LINGDRLRSLPSTKNWTH-AIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFP 149 +++ L K + +F + K +Y+DAD I I+ L + Sbjct: 65 ILDSADSVHLSLMKRPELGITFTKFQCWTL--TQYTKCVYMDADTIVLCNIDELFDRD-- 120 Query: 150 DDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNE 209 + + G D FNSG + + +L+ Sbjct: 121 --EFSAAPDSGWPDC-----------------FNSGVFVFRPS-------VETFHKLLHF 154 Query: 210 PEIIKKITHPDQDVLNMLLADK-----LIFADIKYNTQFSLNYQLKESFINPVTNDTIFI 264 E+ DQ +LN ++ YN S Y K +F+ ++ + Sbjct: 155 AEVHGSFDGGDQGLLNSFFSNWATADISKHLPFIYNLSISSVYTYKPAFLQ-FGSEAKVV 213 Query: 265 HYIGPTKPWH 274 H++G KPW+ Sbjct: 214 HFLGTPKPWN 223 >UniRef50_A3YS36 Putative sugar transferase n=2 Tax=Campylobacter jejuni RepID=A3YS36_CAMJE Length = 459 Score = 144 bits (362), Expect = 7e-33, Method: Composition-based stats. Identities = 66/341 (19%), Positives = 115/341 (33%), Gaps = 36/341 (10%) Query: 30 DIAYGTDKNFLFGCGISIASILKYNEGS---RLCFHIFTDYFGDDDRKYF----DALALQ 82 I + + ++ + + SI+ S + CFHI + D+ K L+ Sbjct: 3 HIVFNSSNEYIENLSVLMYSIIINTNKSNTKKYCFHILSSNINDNTCKKLTLLEKELSSI 62 Query: 83 YKTRIKIYLINGDRLRSLPSTKN-WTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIE 141 Y + IKIY IN + K+ ++ Y R ++A K LYLD D++ G I Sbjct: 63 YPSEIKIYHINDNLFYDYNIPKHEGSYNAYLRLMLASILSKDIKKCLYLDVDMLVLGDIS 122 Query: 142 PLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSA 201 L + D A V S + I +FNSG +LIN W + + + Sbjct: 123 ELFDLDLKDKVFAAVFILKHPWPNLNSKDSSEIFYIYGSHFNSGLMLINLDAWREKNIES 182 Query: 202 RAIAMLNEPEIIKKITHPDQDVLNMLL-ADKLIFADIKYNTQFSLNYQ------------ 248 R+++ + + + D+ VLN +L D + +++N Sbjct: 183 RSLSFIKNYYVPYAV---DEYVLNAILSKDDIFSLKLEWNFLIGFRRLYLNNDLFFNKEE 239 Query: 249 --------LKESFINPVTNDTIFIHYIGP--TKPWHDWAWDYPVSQAFMEAKNASPWKNT 298 + +HY KPW + + + W + Sbjct: 240 GDKYKIICYSKEEFEKAFKKIKILHYTYLYMPKPWENVYSFIDDDYNLVYYEFYDAWWDM 299 Query: 299 ALLKPNNSNQLRYSAKHMLKKHR--YLKGFSNYLFYFIEKI 337 AL P + KK Y + S + +K Sbjct: 300 ALKTPIYGEHFAKKKREYEKKSLLTYAQAMSKKIKALEKKT 340 >UniRef50_A0ZYL4 Putative uncharacterized protein n=1 Tax=Archaeal BJ1 virus RepID=A0ZYL4_9CAUD Length = 286 Score = 143 bits (361), Expect = 7e-33, Method: Composition-based stats. Identities = 63/298 (21%), Positives = 120/298 (40%), Gaps = 23/298 (7%) Query: 27 LCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTD-YFGDDDRKYFDALALQYKT 85 + L++ Y + C IS S+L+ N+ + +I ++ + ++ + L + + Sbjct: 1 MTLNVCYIAGGDSWVPCYISAYSVLENNQDLDIHMYILSEEDNNNPFFEHVEYLYESHPS 60 Query: 86 -RIKIYLINGDRLRSLP-STKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPL 143 I+ ++ D+ LP K+ + +YF+ I VL LDAD IC G++ L Sbjct: 61 LEIEFIEVDMDQFDDLPAPGKHLSPGVYFKIAINRLLPTD-GNVLLLDADTICDGSLSSL 119 Query: 144 INFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARA 203 ++ +A + K + FN+G L +N +WA Q + R+ Sbjct: 120 LSLDLSGKVLAAAPS-------NKAETVRLGLQNNRAKFNAGVLYVNLQEWAKQDIEERS 172 Query: 204 IAMLNEPEIIKKITHPDQDVLNMLLA--DKLIFADIKYNTQFSLNYQLKESFINPVTNDT 261 + E E DQD LN L+ D + + +YN L + V ++ Sbjct: 173 RQYIEEHEP----ELNDQDALNALVNNPDDMEYIHPRYNATKLLVREF-----EMVDDEP 223 Query: 262 IFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKK 319 IHY GP KPW + + E + +P+++ + A+ +++ Sbjct: 224 TIIHYNGPDKPWRFVT-ERESGDLWWEYASKTPFRDYVPKDKGVKEIIFVRARSAMRR 280 >UniRef50_Q39T65 Glycosyl transferase, family 8 n=1 Tax=Geobacter metallireducens GS-15 RepID=Q39T65_GEOMG Length = 317 Score = 143 bits (360), Expect = 1e-32, Method: Composition-based stats. Identities = 42/310 (13%), Positives = 107/310 (34%), Gaps = 43/310 (13%) Query: 29 LDIAYGTDKNFLFGCGISIASILKY-NEGSRLCFHIFTDYFGDDDRKYFDALALQY-KTR 86 + + + D N++ ++ S+L N + F + + +++R + + Sbjct: 10 IPVFFAFDNNYVIPAAVAFHSLLANVNVSYKYHFIVLHEDISEENRDLLAQVVSLFSNAS 69 Query: 87 IKIYLI---NGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPL 143 ++ + + ++ ++T ++ + + K+++ D D++ + I + Sbjct: 70 VEFRDMGESFKNEWENIKGKGHYTKECLYKL-VPMLEFPQYDKIIWSDVDVVFKDDISDV 128 Query: 144 INFSFPDDKVAMVVTEGQ-ADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSAR 202 ++ +A V G+ ++E + I K +G L+ N + + Sbjct: 129 FFMLSEENYIAGVRVCGKLDKYYENMNMPAEIKSILKNGIGAGILVYNLKKMREDNIYDD 188 Query: 203 AIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKY----------------------- 239 M+ + + P+QD+LN++L DK+ + ++Y Sbjct: 189 I--MIALQGMSSIVVQPEQDILNIVLKDKIDYIPLRYCFCTYMYNLFKDRHKMKLKVKGN 246 Query: 240 --NTQF-------SLNYQLKESFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAK 290 N F + E + IHY TKPW+ ++ Sbjct: 247 LFNYLFKGYRKNLGFDTIYSEKELLEAFESPAIIHYATSTKPWNTLFTKR--KSDWLYCL 304 Query: 291 NASPWKNTAL 300 +P+ + Sbjct: 305 LKTPFWKRYI 314 >UniRef50_UPI000197AD97 hypothetical protein BACCOPRO_03221 n=1 Tax=Bacteroides coprophilus DSM 18228 RepID=UPI000197AD97 Length = 313 Score = 143 bits (360), Expect = 1e-32, Method: Composition-based stats. Identities = 51/323 (15%), Positives = 106/323 (32%), Gaps = 24/323 (7%) Query: 27 LCLDIAYGTDKNFLFGCGISIASILKYNEGS-RLCFHIFTDYFGDDDRKYFDALALQYKT 85 + I + D N + + I+S+L + I D ++ D L + Sbjct: 3 KTVPIVFAFDNNLILPACVCISSLLMNAKEETFYDIFILHSSKVDLHKEQLDELPKYFNR 62 Query: 86 RIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTI-EPLI 144 Y + + + T Y+R +I + + ++Y D D+I + + + Sbjct: 63 CRIQYRVVDNTFDQAFEIRGITTPTYYRLLIPELVP-EYDNIIYSDVDVIFRFDLSDIYF 121 Query: 145 NFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAI 204 + D VA V K+ + +G +++N+ + + R Sbjct: 122 HTDLNDSYVAGVNALVPFIPDMKKYYLKLGNVNIDSIIYAGNIILNSKKIREDNLVERFK 181 Query: 205 AMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSL-------NYQLKESFINPV 257 + K D DVLN+ K+ + + L++ + + Sbjct: 182 ELAK-----NKFHFQDLDVLNIACKGKITYLKPVFCLTTYFSELALRHRNLLRDFWSDKD 236 Query: 258 TNDTI---FIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAK 314 ++ + +HY G KPW + S + E SP+ + +L + Sbjct: 237 IDEALTEGIVHYNGQ-KPWKGICVN---SDIWWEYYRKSPFFDEKFYFEFFYTRLNELDQ 292 Query: 315 HMLKKHRYLKGFSNYLFYFIEKI 337 L + +K Y Y +I Sbjct: 293 --LSLWKRIKILIRYFVYGKREI 313 >UniRef50_Q2L3C5 Glycosyl transferase-like protein n=3 Tax=Magnoliophyta RepID=Q2L3C5_BRASY Length = 689 Score = 142 bits (359), Expect = 1e-32, Method: Composition-based stats. Identities = 47/317 (14%), Positives = 91/317 (28%), Gaps = 46/317 (14%) Query: 13 NSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFG--- 69 ++ D+ + + E+ L N + + + S L + FHI TD Sbjct: 368 SNNKDFPNTEKLEDPKLHHYAVFSDN-VLAAAVVVNSTLVHATNH--VFHIVTDRLNYAA 424 Query: 70 ----------------DDDRKYFDALALQY---------KTRIKIYLIN----GDRLRSL 100 + + F L Y ++ I Y + D Sbjct: 425 MKMWFLANPLGKAAVQVQNIQEFTWLNSSYSPVLKQLGSRSTIDYYFRSGTARPDENPKF 484 Query: 101 PSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPDDKVAMVVTEG 160 + K + + RF + + F K KVL+LD D + Q + L + V T G Sbjct: 485 RNPKYLSILNHLRFYLPEIFP-KLNKVLFLDDDTVVQQDLSALWSIDLKGKVNGAVETCG 543 Query: 161 QADWWEKRAHSLGVA------GIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEPEIIK 214 + + + + G + + ++W Q ++ E Sbjct: 544 ETFHRFDKYLNFSNPIVANNFHPQACGWAFGMNMFDLSEWRKQNITDVYHTWQKLNEDRL 603 Query: 215 KITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIHYIGPTKPWH 274 + ++ D ++ L Y + IHY G KPW Sbjct: 604 LWKLGTLPAGLVTFWNRTFPLDRSWHLL-GLGYN--PNVNERDIRRASVIHYNGNLKPWL 660 Query: 275 DWAWDYPVSQAFMEAKN 291 + + + + Sbjct: 661 EIGLS-KYRKYWSRYVD 676 >UniRef50_Q22997 Unidentified vitellogenin-linked transcript protein 5, isoform a n=7 Tax=Caenorhabditis RepID=Q22997_CAEEL Length = 429 Score = 142 bits (357), Expect = 2e-32, Method: Composition-based stats. Identities = 38/268 (14%), Positives = 75/268 (27%), Gaps = 41/268 (15%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 I T+ N+ G + + S+ ++ + ++ RK + I Sbjct: 6 ITLATNDNYAQGALVLVHSLRTAGTTRKIH-CLISNEVSAPVRKQLEEHFD--DVSIVDV 62 Query: 91 LINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPD 150 + D + + + + K ++LDAD + + L Sbjct: 63 FNSNDSDNLRLIERPDLGVTFTKLHCWRL--TQYTKCVFLDADTLVLRNADELFTRP--- 117 Query: 151 DKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEP 210 + G D FNSG + +++ Sbjct: 118 -DFSAASDIGWPD-----------------SFNSGVFVYVPNN-------ETYRQLVDFA 152 Query: 211 EIIKKITHPDQDVLNMLLADKL-----IFADIKYNTQFSLNYQLKESFINPVTNDTIFIH 265 DQ +LN ++ YN Y ++ +T +H Sbjct: 153 VTHGSYDGGDQGLLNDFFSNWRDLPSEHRLPFIYNMTAGAFYTYAAAYKR-YGANTKIVH 211 Query: 266 YIGPTKPWHDWAWDY--PVSQAFMEAKN 291 +IG KPWH A + Q + + + Sbjct: 212 FIGSVKPWHGSAAVHTGEHFQQWQKIYH 239 >UniRef50_UPI00016B2258 glycosyl transferase, family 8 n=1 Tax=candidate division TM7 single-cell isolate TM7c RepID=UPI00016B2258 Length = 327 Score = 142 bits (357), Expect = 3e-32, Method: Composition-based stats. Identities = 55/338 (16%), Positives = 115/338 (34%), Gaps = 34/338 (10%) Query: 24 TENLCLDIAYGTDKNFLFGCGISIASILKYNEGSR-LCFHIFTDYFGDDDRKYFDALALQ 82 L++ Y +D N+ ISI S+++ N+ + + D F+ + Sbjct: 1 MNKGILNVIYQSDDNYAVVSAISIVSLMENNKHLKQINIFYLGHQLKKDSINKFNKMVGN 60 Query: 83 Y-KTRIKIYLING---DRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQG 138 Y I ++ + + +++ + K ++LY++ + G Sbjct: 61 YHNATITFVDVSSYPDELKEIGVKAWKGLYITWYKMLAFAKLDIKTDRILYINPHTVISG 120 Query: 139 TIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQ 198 ++ L+ F D+ +A+ + + + GYFN G +LIN +W + Sbjct: 121 ALDGLLELDFEDNVMALSYDATMVNAHKD----VIGLKPIDGYFNCGIMLINHKKWMKDK 176 Query: 199 VSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVT 258 + A+ L DQD+ N+ + ++YN + +I Sbjct: 177 IDAKMREHLRYNH----YEVADQDLCNVFFKGNIKKVGVEYNFSTVFYGYDIKKYIKANG 232 Query: 259 ----------------NDTIFIH--YIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTAL 300 IH + +PW + PV + + N +PWKN Sbjct: 233 FLPESFYSYDEIMESYYTPKIIHSQFGMNGRPWQQ-GNENPVGILWRKYLNLTPWKNAT- 290 Query: 301 LKPNNSNQLRYSAKHMLKKHRYLKGFSNYLFYFIEKIK 338 P + + +L + +K ++ + K K Sbjct: 291 -MPVAKKDMNWLLYDLLPQSIIVKLYAWAVNRKFAKTK 327 >UniRef50_A8P591 Glycogenin-1, putative n=2 Tax=Brugia malayi RepID=A8P591_BRUMA Length = 412 Score = 141 bits (355), Expect = 4e-32, Method: Composition-based stats. Identities = 41/269 (15%), Positives = 77/269 (28%), Gaps = 42/269 (15%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 + T + G + S+ +L I T R A + + + Sbjct: 6 VTLATSDGYAIGALVLAHSLKIQQTTKKLHCMITT-GVSQQLRDEL---AATFDSINLVN 61 Query: 91 LINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPD 150 +++ + +L + F I + + + K ++LDAD + + L + Sbjct: 62 ILDSNDTANLHLIGRPDLGVTF-TKIHCWRLTQYTKCIFLDADCLVIQNADELFDHD--- 117 Query: 151 DKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEP 210 +++ V G D FNSG + ++ + +LN Sbjct: 118 -ELSAVADIGWPDC-----------------FNSGVFVYKPSE-------QTYLDILNFA 152 Query: 211 EIIKKITHPDQDVLNMLLADK-----LIFADIKYNTQFSLNYQLKESFINPVTNDTIFIH 265 DQ +LN YN Y +F +H Sbjct: 153 LEHGSFDGGDQGLLNQFFKGWRDKPPAFRLPFIYNMTSGAIYTYAAAFKK-YGAQVKIVH 211 Query: 266 YIGPTKPWHDWAWDYPVS---QAFMEAKN 291 ++GP KPW S + Sbjct: 212 FLGPVKPWQQSTDSVHYSEHLDYWWSLFK 240 >UniRef50_UPI0000F2E03D PREDICTED: similar to glycogenin 2, n=2 Tax=Amniota RepID=UPI0000F2E03D Length = 585 Score = 140 bits (354), Expect = 5e-32, Method: Composition-based stats. Identities = 46/315 (14%), Positives = 94/315 (29%), Gaps = 60/315 (19%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 + T+ + G + S+ + +L + T R + + I++ Sbjct: 141 VTLATNDVYCQGALVLGHSLKNHKITRKLVI-LITPQVSSLLRTVLYKV---FDEVIEVS 196 Query: 91 LINGDRLRSLPSTKNWTH-AIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFP 149 L + L K + + K +++DAD + I+ L + Sbjct: 197 LEDSTDYVHLALLKRPELGITFTKLHCWTL--THYSKCVFMDADTMVLCNIDELFDRE-- 252 Query: 150 DDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNE 209 +++ G D FNSG + + + ++ Sbjct: 253 --ELSAAPDSGWPDC-----------------FNSGVFVFRPS-LETHNL------LMQH 286 Query: 210 PEIIKKITHPDQDVLNMLLADK-----LIFADIKYNTQFSLNYQLKESFINPVTNDTIFI 264 DQ +LN ++ YN S Y + +F D + Sbjct: 287 AVKHGSFDGADQGLLNSFFSNWATSDIHKHLPFLYNLSSSSMYTYRPAFKR-FGWDAKVV 345 Query: 265 HYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKHRYLK 324 H++GP+KPWH + + + + L S H + K Sbjct: 346 HFLGPSKPWH------------YKYNRETG-------SVISESSLSESQHHASFLGLWWK 386 Query: 325 GFSNYLFYFIEKIKH 339 + + F +K++H Sbjct: 387 IYDENIVPFFDKLQH 401 >UniRef50_Q9R062 Glycogenin-1 n=22 Tax=Euteleostomi RepID=GLYG_MOUSE Length = 333 Score = 140 bits (354), Expect = 6e-32, Method: Composition-based stats. Identities = 40/279 (14%), Positives = 85/279 (30%), Gaps = 55/279 (19%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 + T+ + G + +S+ ++ R+ + + D RK + + + I + Sbjct: 7 VTLTTNDAYAKGALVLGSSLKQHRTTRRMVV-LTSPQVSDSMRKVLETV---FDDVIMVD 62 Query: 91 LINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPD 150 +++ L K I + + + + K +++DAD + I+ L Sbjct: 63 VLDSGDSAHLTLMKRPELGITL-TKLHCWSLTQYSKCVFMDADTLVLSNIDDLFERE--- 118 Query: 151 DKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEP 210 +++ G D FNSG + + +L+ Sbjct: 119 -ELSAAPDPGWPDC-----------------FNSGVFVYQPS-------IETYNQLLHLA 153 Query: 211 EIIKKITHPDQDVLNMLLADK-----LIFADIKYNTQFSLNYQLKESFINPVTNDTIFIH 265 DQ +LN + YN Y +F + +H Sbjct: 154 SEQGSFDGGDQGLLNTYFSGWATTDITKHLPFVYNLSSISIYSYLPAF-KAFGKNAKVVH 212 Query: 266 YIGPTKPWH---------------DWAWDYP-VSQAFME 288 ++G TKPW+ D +P + + Sbjct: 213 FLGRTKPWNYTYNPQTKSVNCDSQDPTVSHPEFLNLWWD 251 >UniRef50_Q68SS4 Putative glycogenin protein n=1 Tax=Pleurotus djamor RepID=Q68SS4_PLEDJ Length = 1190 Score = 140 bits (353), Expect = 8e-32, Method: Composition-based stats. Identities = 45/287 (15%), Positives = 93/287 (32%), Gaps = 41/287 (14%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCF----HIFTDYFGDDDRKYFDALALQYKTR 86 + T +L G +A++ ++ S + F + + K ++ Sbjct: 8 VTLVTSDPYLPGALALVAALNDVHKASDIPFDTVCLVTPETVDVASIKLLRK---AFRLV 64 Query: 87 IKIYLINGDRLRSLPSTKNWTHAI-YFRFVIADYFINKAPKVLYLDADIICQGTIEPLIN 145 + I LI L + + + K+++LDAD++ + L + Sbjct: 65 VGIELIVQPDPSGLNLLGRPDLDTVLTKLHVFRLV--QYSKIIFLDADVLPIRPLSHLFS 122 Query: 146 FSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIA 205 + + V G D FNSG L+++ + + Sbjct: 123 LP---HEFSAVPDVGWPDI-----------------FNSGVLVLSPGE-------DKFTQ 155 Query: 206 MLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESFINPVTNDTIFIH 265 + + DQ +LN D YNT + Y ++ + IH Sbjct: 156 LNQLLKSKGSWDGGDQGILNEWRGDDWNRLSFTYNTTPTAAYTYAPAY-ERYGSQISAIH 214 Query: 266 YIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYS 312 +IGP KPW + ++ + + + ++ P ++RY Sbjct: 215 FIGPNKPWKAYDYN-SLVDRWFSVYDKH--YRAPIVVPKTPFEVRYY 258 >UniRef50_A4SAB5 Predicted protein (Fragment) n=1 Tax=Ostreococcus lucimarinus CCE9901 RepID=A4SAB5_OSTLU Length = 259 Score = 140 bits (352), Expect = 8e-32, Method: Composition-based stats. Identities = 58/260 (22%), Positives = 101/260 (38%), Gaps = 20/260 (7%) Query: 29 LDIAYGTDKNFLFGCGISIASILKYN-EGSRLCFHIFT--DYFGDDDRKYFDALALQYKT 85 + IA+ D LF G I+S+L R+ FHIFT D D + + + Sbjct: 3 VHIAFACDPTQLFTLGPVISSVLSATASPHRIRFHIFTARDALTDASVQ-LNCYSRAIPF 61 Query: 86 RIKIYLINGDRLRSLPSTKNWTHA------IYFRFVIADYFINKAPKVLYLDADIICQGT 139 +++ + D +R+ + + Y RF A+ + KV+YLD DII +G Sbjct: 62 IWELHEFSKDMIRANITVHSRKEWRLQNAFNYARFYFAEIL-SDVQKVVYLDTDIIVKGD 120 Query: 140 IEPLINFSFPDDKVAMVV------TEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQ 193 I L + + +++ G + A FN+G LLI+ Sbjct: 121 ICRLHDANLRSSSTSVIAAVKRSVPLGSLLNFSNAAVKSSGLREKMHSFNAGVLLIDLES 180 Query: 194 WAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKESF 253 W +++++ L + K +H Q L ++ D +N K+ Sbjct: 181 WRRKRITSTVETWLKMNSVSKLYSHGSQPPLLLVFGDSFESIPSHWNV---DGVGYKKGL 237 Query: 254 INPVTNDTIFIHYIGPTKPW 273 V N+ +H+ G +KPW Sbjct: 238 RASVLNEARVLHWSGQSKPW 257 >UniRef50_B3RM47 Putative uncharacterized protein n=1 Tax=Trichoplax adhaerens RepID=B3RM47_TRIAD Length = 1504 Score = 140 bits (352), Expect = 9e-32, Method: Composition-based stats. Identities = 30/248 (12%), Positives = 74/248 (29%), Gaps = 26/248 (10%) Query: 26 NLCLDIA-YGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYK 84 N ++I + + I + S+LK+ + + F ++ + + +A Y Sbjct: 1227 NETINIFTVASGHLYERFLRIMMLSVLKHTKN-PVKFWFLKNFLSPNFKDSIPVMAKNYN 1285 Query: 85 TRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLI 144 + R + K Y + F K++++DAD I + ++ L+ Sbjct: 1286 FGYEYVQYKWPRWLRQQTEKQRVIWGYKILFLDVLFPLGIKKIIFVDADQIVRTDLKELM 1345 Query: 145 NFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAI 204 + A + G S +++ ++ R Sbjct: 1346 DLDLEGAPYAYTPFCDS---------RKEMDGF------SALYVVDLKRFRLLAAGDRLR 1390 Query: 205 -AMLNEPEIIKKITHPDQDVLNMLLAD-KLIFADIKYNTQFSLNYQLKESF-------IN 255 + + DQD+ N ++ + + + ++ N Sbjct: 1391 GQYQGLSADPNSLANLDQDLPNNMIHQVPIKSLPQDWLWCETWCSDGSKATAKTIDMCNN 1450 Query: 256 PVTNDTIF 263 P+T + Sbjct: 1451 PLTKEPKL 1458 >UniRef50_A4IXE1 Glycosyl transferase, family 8 n=16 Tax=Francisella RepID=A4IXE1_FRATW Length = 296 Score = 140 bits (352), Expect = 9e-32, Method: Composition-based stats. Identities = 56/307 (18%), Positives = 111/307 (36%), Gaps = 31/307 (10%) Query: 27 LCLDIAYGTDKNFLFGCGISIASILKY-NEGSRLCFHIFTDYFGDDDRKYFDALALQYKT 85 + I + DKN + G ++I S++ + N + +++ F+++ + K Sbjct: 2 NKIPIVFTFDKNIILGGAVTIKSLIDHANPDTCYDIYVYHPNINKKSISAFNSMIEKTKH 61 Query: 86 RIKIYLINGDRLRSLPS-TKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLI 144 I + ++ + +P T+ ++R +I + KV+Y D D++ Q + + Sbjct: 62 SISFHNVDESIFKDVPIDTRRGWIITFYRLLIPKLLP-QYDKVIYSDVDVLFQSDMSEVY 120 Query: 145 NFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAI 204 N + A V+ E + + + GF+++NT +R Sbjct: 121 NTDLTSYEWAGVIAEKHQQNMVQHKYFK--ENNNSYIYWPGFMVMNTKLMRENNFISRCF 178 Query: 205 AMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNY---------------QL 249 + E ++ D DVLN+ K+ KY T S+ Y Sbjct: 179 DTM--HEFNTRLKFRDLDVLNLTCR-KIKSLPFKYVTLQSIYYLNTIQEAPEYIFLKEIY 235 Query: 250 KESFINPVTNDTIFIHYIG-PTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQ 308 ++ + N+ IHY G P KPW P ++E + P L K + Sbjct: 236 SDNELLDAKNNPAIIHYAGSPGKPWR---MKRPYKN-YLEYISKIP---KELRKYTFRDI 288 Query: 309 LRYSAKH 315 + Sbjct: 289 KKKLLSK 295 >UniRef50_UPI000194B82A PREDICTED: similar to glycogenin 2 n=1 Tax=Taeniopygia guttata RepID=UPI000194B82A Length = 441 Score = 139 bits (351), Expect = 1e-31, Method: Composition-based stats. Identities = 37/249 (14%), Positives = 74/249 (29%), Gaps = 39/249 (15%) Query: 31 IAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRIKIY 90 + TD + G + S+ + +L + T R ++ + + Sbjct: 118 VTLATDDVYCQGALVLGQSLRNHKTSRKLAV-LITPEVSSGMRSVLSSVFDEVVEVDVLD 176 Query: 91 LINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFSFPD 150 + L + + + + K +++DAD + ++ L + Sbjct: 177 SADSVHLALMQRPE--LGVTFTKLHCWTL--THYSKCVFMDADTLVLCNVDELFDRE--- 229 Query: 151 DKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAMLNEP 210 + + G D FNSG + + + + E Sbjct: 230 -EFSAAPDSGWPDC-----------------FNSGVFVFQPSL----KTYNLLLQFAAEH 267 Query: 211 EIIKKITHPDQDVLNMLLADKL-----IFADIKYNTQFSLNYQLKESFINPVTNDTIFIH 265 DQ +LN ++ YN S Y +F N D +H Sbjct: 268 ---GSFDGGDQGLLNSFFSNWATADIGKHLPFLYNLSSSSVYTYVPAF-NHFGRDAKVVH 323 Query: 266 YIGPTKPWH 274 ++G TKPW+ Sbjct: 324 FLGATKPWN 332 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.313 0.138 0.386 Lambda K H 0.267 0.0426 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,937,627,617 Number of Sequences: 3077464 Number of extensions: 81073484 Number of successful extensions: 259686 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 804 Number of HSP's successfully gapped in prelim test: 992 Number of HSP's that attempted gapping in prelim test: 254255 Number of HSP's gapped (non-prelim): 2131 length of query: 339 length of database: 1,040,396,356 effective HSP length: 129 effective length of query: 210 effective length of database: 643,403,500 effective search space: 135114735000 effective search space used: 135114735000 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.4 bits) S2: 93 (40.4 bits)