BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (338 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P27129 Lipopolysaccharide 1,2-glucosyltransferase n=44 ... 698 0.0 UniRef50_D0KD54 Lipopolysaccharide glucosyltransferase I n=2 Tax... 310 6e-83 UniRef50_B6XJW1 Putative uncharacterized protein n=1 Tax=Provide... 286 5e-76 UniRef50_Q9ZIS6 UDP-galactose:(Glucosyl) LPS alpha1,2-galactosyl... 278 2e-73 UniRef50_UPI00019F16C6 hypothetical protein CATC2_20202 n=1 Tax=... 265 2e-69 UniRef50_A8ARL6 Putative uncharacterized protein n=3 Tax=Citroba... 263 7e-69 UniRef50_D2U322 Lipopolysaccharide 1,2-glucosyltransferase n=1 T... 263 8e-69 UniRef50_B2PV91 Putative uncharacterized protein n=2 Tax=Enterob... 256 1e-66 UniRef50_Q9ZIT6 UDP-glucose:(Galactosyl) LPS alpha1,2-glucosyltr... 253 8e-66 UniRef50_Q9ZIS1 UDP-galactose:(Galactosyl) LPS alpha1,2-galactos... 238 3e-61 UniRef50_Q9ZIT4 UDP-galactose:(Glucosyl) LPS alpha1,3-galactosyl... 222 1e-56 UniRef50_B6XJW2 Putative uncharacterized protein n=1 Tax=Provide... 208 2e-52 UniRef50_D0KD53 Lipopolysaccharide 3-alpha-galactosyltransferase... 207 4e-52 UniRef50_Q9R9D1 UDP-glucose:(Glucosyl) LPS alpha1,3-glucosyltran... 196 9e-49 UniRef50_D2TIX6 UDP-galactose:(Glucosyl) LPS alpha-1,3-galactosy... 195 2e-48 UniRef50_D2TY85 UDP-D-galactose:(Glucosyl)lipopolysaccharide-alp... 192 2e-47 UniRef50_UPI000197C525 UDP-D-galactose:(glucosyl) lipopolysaccha... 189 2e-46 UniRef50_P27128 Lipopolysaccharide 1,3-galactosyltransferase n=4... 186 9e-46 UniRef50_D1P7H1 Lipopolysaccharide 1,2-glucosyltransferase n=1 T... 174 5e-42 UniRef50_C5WAK3 Ybl156 protein n=2 Tax=Enterobacteriaceae RepID=... 159 9e-38 UniRef50_C1DGU7 Lipopolysaccharide 1,3-galactosyltransferase n=1... 152 2e-35 UniRef50_C3X7M2 Lipopolysaccharide 3-alpha-galactosyltransferase... 128 4e-28 UniRef50_Q64ZV2 Lipopolysaccharide 1,2-glucosyltransferase n=8 T... 122 2e-26 UniRef50_C7QL87 Glycosyl transferase family 8 n=2 Tax=Cyanothece... 121 3e-26 UniRef50_Q46Y64 Glycosyl transferase, family 8 n=1 Tax=Ralstonia... 120 5e-26 UniRef50_A8ARL4 Putative uncharacterized protein n=2 Tax=Citroba... 120 6e-26 UniRef50_UPI0001B4A0A4 putative glucosyltransferase n=1 Tax=Bact... 113 9e-24 UniRef50_UPI0001A457E5 hypothetical protein NsubN_08151 n=1 Tax=... 113 1e-23 UniRef50_Q116W1 Glycosyl transferase, family 8 n=1 Tax=Trichodes... 112 2e-23 UniRef50_C0WCJ1 Lipopolysaccharide 1,2-glucosyltransferase n=1 T... 110 7e-23 UniRef50_Q03HK5 Lipopolysaccharide biosynthesis glycosyltransfer... 106 1e-21 UniRef50_D2ELM0 Lipopolysaccharide biosynthesis glycosyltransfer... 105 2e-21 UniRef50_C6EQF4 Putative uncharacterized protein n=3 Tax=Campylo... 101 4e-20 UniRef50_UPI0001AEC697 glycosyl transferase family protein n=1 T... 100 8e-20 UniRef50_D1PTN4 Putative uncharacterized protein n=1 Tax=Prevote... 100 1e-19 UniRef50_A5ZFA9 Putative uncharacterized protein n=1 Tax=Bactero... 97 6e-19 UniRef50_A1BHG0 Glycosyl transferase, family 8 n=2 Tax=Chlorobiu... 96 1e-18 UniRef50_B2UPJ4 Glycosyl transferase family 8 n=1 Tax=Akkermansi... 96 2e-18 UniRef50_A1XRC1 Glycosyltransferase n=1 Tax=Haemophilus ducreyi ... 96 2e-18 UniRef50_UPI000190F79C lipopolysaccharide 1,2-glucosyltransferas... 95 3e-18 UniRef50_Q3DNA2 Glycosyl transferase, family 8 n=9 Tax=Streptoco... 95 3e-18 UniRef50_B3CA80 Putative uncharacterized protein n=1 Tax=Bactero... 95 3e-18 UniRef50_C7IBC1 Glycosyl transferase family 8 n=2 Tax=Clostridiu... 95 3e-18 UniRef50_C9PNX4 Family 8 glycosyl transferase n=1 Tax=Pasteurell... 95 4e-18 UniRef50_B7AIV0 Putative uncharacterized protein n=1 Tax=Bactero... 94 6e-18 UniRef50_Q5LF36 Putative glucosyltransferase n=1 Tax=Bacteroides... 94 7e-18 UniRef50_C6I3U6 Putative uncharacterized protein n=2 Tax=Bactero... 94 8e-18 UniRef50_B3Z5I6 Glycosyltransferase family 8 n=2 Tax=Bacillus ce... 92 3e-17 UniRef50_C5ELK9 Glycosyl transferase n=3 Tax=Clostridiales RepID... 91 5e-17 UniRef50_A4ECW2 Putative uncharacterized protein n=1 Tax=Collins... 91 7e-17 UniRef50_A6LGX5 Glycosyltransferase family 8 n=1 Tax=Parabactero... 91 9e-17 UniRef50_C7IBC8 Glycosyl transferase family 8 n=1 Tax=Clostridiu... 90 1e-16 UniRef50_B1MX28 Lipopolysaccharide biosynthesis protein, LPS:gly... 89 2e-16 UniRef50_A7VNX5 Putative uncharacterized protein n=1 Tax=Clostri... 89 2e-16 UniRef50_D2RIJ7 Glycosyl transferase family 8 n=1 Tax=Acidaminoc... 89 2e-16 UniRef50_B3Q568 Putative glycosyltransferase protein n=4 Tax=Rhi... 89 3e-16 UniRef50_Q5WI33 Lipopolysaccharide glycosyltransferase n=6 Tax=F... 88 4e-16 UniRef50_C4VEI8 General stress protein A n=24 Tax=Enterococcus R... 88 4e-16 UniRef50_Q65VF6 RfaJ protein n=1 Tax=Mannheimia succiniciproduce... 88 4e-16 UniRef50_A0KQP2 Glycosyl transferase, family 8 n=2 Tax=Aeromonas... 88 5e-16 UniRef50_C2HBB8 Family 8 glycosyltransferase n=13 Tax=Lactobacil... 87 6e-16 UniRef50_B3XL28 Glycosyl transferase family 8 n=1 Tax=Lactobacil... 87 7e-16 UniRef50_Q38VK7 Putative bifunctional glycosyl transferase, fami... 87 8e-16 UniRef50_B8ISQ5 Glycosyl transferase family 8 n=1 Tax=Methylobac... 86 2e-15 UniRef50_C5ZV11 Glycosyl transferase n=2 Tax=Helicobacter canade... 85 5e-15 UniRef50_C7HS13 Family 8 glycosyl transferase n=2 Tax=Anaerococc... 84 9e-15 UniRef50_D2RIJ4 Glycosyl transferase family 8 n=2 Tax=Acidaminoc... 82 2e-14 UniRef50_C2HBB9 Family 8 glycosyltransferase n=35 Tax=Lactobacil... 82 2e-14 UniRef50_B6G807 Putative uncharacterized protein n=2 Tax=Collins... 82 2e-14 UniRef50_Q9L6B2 Putative glycosyl transferase n=3 Tax=Pasteurell... 82 3e-14 UniRef50_C6LDU2 Glycosyl transferase, family 8 n=3 Tax=Firmicute... 82 3e-14 UniRef50_D1JY84 Putative uncharacterized protein n=1 Tax=Bactero... 81 5e-14 UniRef50_C0AA16 Lipopolysaccharide biosynthesis protein LPS:glyc... 81 5e-14 UniRef50_B9QZ95 Glycosyl transferase family 8 n=1 Tax=Labrenzia ... 81 7e-14 UniRef50_Q9L7A2 Putative glycosyl transferase n=1 Tax=Haemophilu... 80 1e-13 UniRef50_P43974 Putative glycosyltransferase HI0258 n=32 Tax=Hae... 80 1e-13 UniRef50_C0EQT1 Putative uncharacterized protein n=1 Tax=Neisser... 80 1e-13 UniRef50_Q4JZJ9 Putative glycosyl transferase n=8 Tax=Streptococ... 79 2e-13 UniRef50_B1LK07 Glycosyl transferase family 8 n=10 Tax=Enterobac... 79 2e-13 UniRef50_Q2K5X3 Galactosyltransferase protein n=13 Tax=Rhizobium... 79 2e-13 UniRef50_C0QZN2 Glycosyl transferase, family 8 n=3 Tax=Brachyspi... 79 3e-13 UniRef50_A5UC07 Aspartate-semialdehyde dehydrogenase n=7 Tax=Hae... 78 4e-13 UniRef50_B2UQN6 Glycosyl transferase family 8 n=1 Tax=Akkermansi... 78 4e-13 UniRef50_C6IB51 Glycosyltransferase n=4 Tax=Bacteroides RepID=C6... 78 5e-13 UniRef50_C1QBZ8 LPS:glycosyltransferase n=1 Tax=Brachyspira murd... 78 5e-13 UniRef50_Q50FU8 Cj81-079 (Fragment) n=6 Tax=Campylobacter jejuni... 78 5e-13 UniRef50_B9KVD4 Glycosyl transferase, family 8 n=2 Tax=Rhodobact... 78 5e-13 UniRef50_D0D9G3 Putative general stress protein A n=1 Tax=Citrei... 77 6e-13 UniRef50_C1QEC6 LPS:glycosyltransferase n=1 Tax=Brachyspira murd... 77 1e-12 UniRef50_Q3D427 Glycosyl transferase, family 8 n=8 Tax=Streptoco... 76 2e-12 UniRef50_B6VUC8 Putative uncharacterized protein n=1 Tax=Bactero... 76 2e-12 UniRef50_C5S494 Putative glycosyl transferase n=2 Tax=Actinobaci... 75 3e-12 UniRef50_B2ISC2 Glycosyl transferase, family 8 n=2 Tax=Streptoco... 75 4e-12 UniRef50_B8G232 Glycosyl transferase family 8 n=2 Tax=Desulfitob... 75 4e-12 UniRef50_B7KM20 Glycosyl transferase family 8 n=1 Tax=Cyanothece... 75 4e-12 UniRef50_Q3DNS6 Glycosyl transferase, family 8 n=5 Tax=Streptoco... 74 6e-12 UniRef50_A0ZYL4 Putative uncharacterized protein n=1 Tax=Archaea... 74 8e-12 UniRef50_C0Z4I4 Putative uncharacterized protein gspA n=1 Tax=Br... 74 9e-12 UniRef50_B1Y723 Glycosyl transferase family 8 n=1 Tax=Leptothrix... 74 1e-11 UniRef50_C3PWZ8 Lipopolysaccharide 1,2-glucosyltransferase n=1 T... 74 1e-11 UniRef50_UPI0001A45357 glycosyl transferase family protein n=1 T... 73 1e-11 UniRef50_C6IJ37 General stress protein A n=2 Tax=Bacteroides Rep... 73 2e-11 UniRef50_C2KV37 Lipopolysaccharide biosynthesis protein, LPS:gly... 73 2e-11 UniRef50_C6X2V2 Putative glycosyltransferase n=1 Tax=Flavobacter... 72 3e-11 UniRef50_Q4HGS8 General stress protein A, putative n=2 Tax=Campy... 72 3e-11 UniRef50_C9RWX3 Glycosyl transferase family 8 n=8 Tax=Bacillacea... 72 3e-11 UniRef50_Q3D426 Glycosyl transferase, family 8 n=7 Tax=Streptoco... 72 4e-11 UniRef50_P25148 General stress protein A n=3 Tax=Bacillus subtil... 71 7e-11 UniRef50_C1IBL0 Glycosyl transferase n=1 Tax=Clostridium sp. 7_2... 71 7e-11 UniRef50_B2UQ54 Glycosyl transferase family 8 n=1 Tax=Akkermansi... 71 7e-11 UniRef50_UPI00016B2258 glycosyl transferase, family 8 n=1 Tax=ca... 71 7e-11 UniRef50_D2QX94 Glycosyl transferase family 8 n=1 Tax=Pirellula ... 69 2e-10 UniRef50_A5VK24 Glycosyl transferase, family 8 n=18 Tax=Lactobac... 69 2e-10 UniRef50_C6DEN1 Glycosyl transferase family 8 n=1 Tax=Pectobacte... 69 2e-10 UniRef50_B3WD33 WbbM protein n=8 Tax=Lactobacillus RepID=B3WD33_... 69 3e-10 UniRef50_B7GNT4 Glycosyl transferase, family 8 n=5 Tax=Bifidobac... 68 4e-10 UniRef50_A7A7B4 Putative uncharacterized protein n=2 Tax=Bifidob... 68 5e-10 UniRef50_C6DEN2 Glycosyl transferase family 8 n=1 Tax=Pectobacte... 68 5e-10 UniRef50_A7H2M2 Glycosyl transferase family 8 n=3 Tax=Campylobac... 67 6e-10 UniRef50_A5LNA9 Glycosyl transferase, family 8 n=2 Tax=Streptoco... 67 9e-10 UniRef50_A4WWT5 Glycosyl transferase, family 8 n=1 Tax=Rhodobact... 67 9e-10 UniRef50_A3VFX3 Glycosyltransferase n=1 Tax=Rhodobacterales bact... 67 9e-10 UniRef50_C7RG54 Glycosyl transferase family 8 n=1 Tax=Anaerococc... 67 1e-09 UniRef50_D1N145 Glycosyl transferase family 8 n=1 Tax=Victivalli... 67 1e-09 UniRef50_B4RJG6 LgtC n=29 Tax=Neisseria RepID=B4RJG6_NEIG2 67 1e-09 UniRef50_UPI000196958D hypothetical protein BACCELL_01586 n=1 Ta... 66 2e-09 UniRef50_D2MYR1 Putative uncharacterized protein n=1 Tax=Campylo... 66 2e-09 UniRef50_D2QX95 Glycosyl transferase family 8 n=1 Tax=Pirellula ... 66 2e-09 UniRef50_C0BAU3 Putative uncharacterized protein n=1 Tax=Coproco... 65 2e-09 UniRef50_C1QC03 LPS:glycosyltransferase n=1 Tax=Brachyspira murd... 65 2e-09 UniRef50_D1Y7U2 Glycosyltransferase, family 8 n=1 Tax=Pyramidoba... 65 3e-09 UniRef50_A3CM53 Glycosyltransferase, putative n=9 Tax=Bacteria R... 65 3e-09 UniRef50_A8FNA2 Putative uncharacterized protein n=2 Tax=Campylo... 65 3e-09 UniRef50_C5EZG9 Glycosyl transferase family protein n=1 Tax=Heli... 65 3e-09 UniRef50_C3XN62 Glycosyl transferase n=1 Tax=Helicobacter wingha... 65 3e-09 UniRef50_B9BAZ6 Glycosyl transferase family 8 protein n=1 Tax=Bu... 65 5e-09 UniRef50_C7TIE0 Glycosyl transferase, group 8 n=2 Tax=Lactobacil... 64 5e-09 UniRef50_Q046Z9 Lipopolysaccharide biosynthesis glycosyltransfer... 64 5e-09 UniRef50_B7AUG6 Putative uncharacterized protein n=1 Tax=Bactero... 64 7e-09 UniRef50_B7C7N8 Putative uncharacterized protein n=2 Tax=Firmicu... 64 7e-09 UniRef50_C4Z1V1 Putative uncharacterized protein n=1 Tax=Eubacte... 64 7e-09 UniRef50_C1CFZ1 Glycosyl transferase, family 8 n=17 Tax=Streptoc... 64 8e-09 UniRef50_A2RLV8 Putative glycosyltransferase n=1 Tax=Lactococcus... 64 9e-09 UniRef50_C5SH34 Glycosyl transferase family 8 n=1 Tax=Asticcacau... 63 1e-08 UniRef50_C8W7U9 Glycosyl transferase family 8 n=2 Tax=Atopobium ... 63 2e-08 UniRef50_A4UX76 Putative LPS biosynthesis protein n=2 Tax=Lactob... 62 2e-08 UniRef50_C8N7M8 Putative uncharacterized protein n=1 Tax=Cardiob... 62 2e-08 UniRef50_B1I7M9 Glycosyl transferase, family 8 n=16 Tax=Streptoc... 62 2e-08 UniRef50_C3XGD2 Putative uncharacterized protein n=1 Tax=Helicob... 62 3e-08 UniRef50_Q5M3K9 Glycosyl transferase n=4 Tax=Streptococcus RepID... 62 3e-08 UniRef50_A4VVV8 Lipopolysaccharide biosynthesis proteins, LPS:gl... 61 4e-08 UniRef50_Q38VG7 Putative glycosyl transferase, family 8 n=1 Tax=... 61 5e-08 UniRef50_C1QC00 LPS:glycosyltransferase n=1 Tax=Brachyspira murd... 61 6e-08 UniRef50_B2ISC6 Glycosyl transferase, family 2/glycosyl transfer... 61 6e-08 UniRef50_C5S3F7 Putative glycosyl transferase n=2 Tax=Actinobaci... 60 2e-07 UniRef50_Q0I2Z7 Lipopolysaccharide biosynthesis protein, glycosy... 59 2e-07 UniRef50_D2LIH7 Glycosyl transferase family 8 n=1 Tax=Rhodomicro... 59 2e-07 UniRef50_A3V3C9 Putative uncharacterized protein n=1 Tax=Loktane... 59 3e-07 UniRef50_A3YS36 Putative sugar transferase n=2 Tax=Campylobacter... 59 3e-07 UniRef50_B0NR59 Putative uncharacterized protein n=1 Tax=Bactero... 59 3e-07 UniRef50_B6GCA0 Putative uncharacterized protein n=1 Tax=Collins... 58 4e-07 UniRef50_Q1RIL1 Lipopolysaccharide 1,2-glucosyltransferase RfaJ ... 58 4e-07 UniRef50_UPI0001B55E75 hypothetical protein SSPB78_11600 n=1 Tax... 58 6e-07 UniRef50_C7XX93 Glycosyl transferase, family 8 n=1 Tax=Lactobaci... 57 8e-07 UniRef50_A5EVI8 Glycosyl transferase family 8 protein n=1 Tax=Di... 57 1e-06 UniRef50_B8PIH6 Predicted protein n=2 Tax=Agaricomycetes RepID=B... 57 1e-06 UniRef50_C4ZG45 Glycosyltransferase Family 2 modular protein n=1... 57 1e-06 UniRef50_Q062P6 DNA mismatch repair protein n=1 Tax=Synechococcu... 57 1e-06 UniRef50_Q17VR5 Lipopolysaccharide biosynthesis protein n=19 Tax... 56 2e-06 UniRef50_UPI0001693121 general stress protein n=1 Tax=Paenibacil... 56 2e-06 UniRef50_B1I7N1 Glycosyl transferase, family 8 n=8 Tax=Streptoco... 56 2e-06 UniRef50_O48684 F3I6.10 protein n=46 Tax=Embryophyta RepID=O4868... 54 8e-06 UniRef50_C0X9Z8 Glycosyltransferase n=1 Tax=Lactobacillus gasser... 54 8e-06 UniRef50_B9KUH7 Lipopolysaccharide 3-alpha-galactosyltransferase... 53 2e-05 UniRef50_C3XKY2 Glycosyl transferase n=2 Tax=Campylobacterales R... 53 2e-05 UniRef50_B9ADW8 Putative uncharacterized protein n=1 Tax=Methano... 52 2e-05 UniRef50_C0X9Z7 Putative uncharacterized protein n=2 Tax=Lactoba... 52 3e-05 UniRef50_Q2RB54 Glycosyl transferase family 8 protein, expressed... 52 3e-05 UniRef50_A8LP95 Lipopolysaccharide 1 n=1 Tax=Dinoroseobacter shi... 52 4e-05 UniRef50_D0IR33 LPS 1,2-glycosyltransferase n=3 Tax=Helicobacter... 52 4e-05 UniRef50_C6DEN3 Glycosyl transferase family 8 n=1 Tax=Pectobacte... 52 4e-05 UniRef50_A9SH80 Predicted protein n=2 Tax=Physcomitrella patens ... 52 4e-05 UniRef50_Q3DM64 Glycosyl transferase, family 8, degenerate n=6 T... 51 6e-05 UniRef50_C2LRU0 Glycosyl transferase, family 8 n=1 Tax=Streptoco... 50 8e-05 UniRef50_B6ACJ0 Glycosyl transferase family 8 protein, putative ... 50 9e-05 UniRef50_A4SAB5 Predicted protein (Fragment) n=1 Tax=Ostreococcu... 50 1e-04 UniRef50_C8WAA9 Glycosyl transferase family 8 n=2 Tax=Atopobium ... 50 1e-04 UniRef50_B3WD32 Glycosyl transferase n=9 Tax=Lactobacillus RepID... 50 1e-04 UniRef50_A8AY72 Glycosyl transferase, family 8 SP1766 n=7 Tax=St... 50 1e-04 UniRef50_B3JN71 Putative uncharacterized protein n=4 Tax=Bactero... 49 2e-04 UniRef50_Q92VQ2 Putative lipopolysaccharide 1,3-galactosyltransf... 49 2e-04 UniRef50_B4WN64 Glycosyl transferase family 8 n=1 Tax=Synechococ... 48 4e-04 UniRef50_UPI0000587C70 PREDICTED: similar to MGC81998 protein n=... 48 4e-04 UniRef50_Q68CQ7 Glycosyltransferase 8 domain-containing protein ... 48 5e-04 UniRef50_B9HMR5 Glycosyltransferase, CAZy family GT8 n=25 Tax=Ma... 47 7e-04 UniRef50_Q02ZT6 Lipopolysaccharide biosynthesis glycosyltransfer... 47 9e-04 UniRef50_Q1CSY7 Lipopolysaccharide 1,2-glycosyltransferase n=3 T... 47 0.001 UniRef50_A1VG39 Glycosyl transferase, family 8 n=1 Tax=Desulfovi... 47 0.001 UniRef50_A7H2X4 Glycosyl transferase family 8 n=2 Tax=Campylobac... 47 0.001 UniRef50_A2DBB6 Putative uncharacterized protein n=1 Tax=Trichom... 47 0.001 UniRef50_Q16CW9 Lipopolysaccharide 1,3-galactosyltransferase, pu... 46 0.001 UniRef50_A9UXT0 Predicted protein (Fragment) n=1 Tax=Monosiga br... 46 0.002 UniRef50_A5DLS6 Putative uncharacterized protein n=2 Tax=Pichia ... 46 0.002 UniRef50_P91854 Protein F26H9.8, partially confirmed by transcri... 46 0.002 UniRef50_UPI000197AD97 hypothetical protein BACCOPRO_03221 n=1 T... 46 0.002 UniRef50_Q39T65 Glycosyl transferase, family 8 n=1 Tax=Geobacter... 46 0.002 UniRef50_C7PRU3 Glycosyl transferase family 8 n=1 Tax=Chitinopha... 45 0.004 UniRef50_B7PBG6 Glycosyltransferase domain-containing protein, p... 45 0.004 UniRef50_C3XFW0 Glycosyl transferase n=1 Tax=Helicobacter bilis ... 45 0.005 UniRef50_C5ZVZ7 Putative glycosyltransferase n=3 Tax=Campylobact... 45 0.005 UniRef50_A4UX79 LPS biosynthesis protein n=2 Tax=Lactobacillacea... 44 0.006 UniRef50_C2ETF1 Putative uncharacterized protein n=1 Tax=Lactoba... 43 0.013 UniRef50_Q04CN2 Lipopolysaccharide biosynthesis glycosyltransfer... 43 0.014 UniRef50_A2E3L1 Glycosyl transferase family 8 protein n=2 Tax=Tr... 43 0.014 UniRef50_Q04CN3 Lipopolysaccharide biosynthesis glycosyltransfer... 43 0.017 UniRef50_Q1CUZ8 Lipopolysaccharide 1,2-glycosyltransferase n=12 ... 42 0.024 UniRef50_A4IXE1 Glycosyl transferase, family 8 n=16 Tax=Francise... 42 0.031 UniRef50_A8W7G8 Glycosyl transferase family protein (Fragment) n... 42 0.033 UniRef50_UPI0001925360 PREDICTED: similar to glycosyltransferase... 42 0.039 UniRef50_UPI00016C0890 glycosyl transferase family 8 protein n=2... 42 0.039 UniRef50_B3XPR8 Glycosyl transferase family 8 n=4 Tax=Lactobacil... 42 0.041 UniRef50_Q8LF94 Avr9/Cf-9 rapidly elicited protein 231 n=13 Tax=... 42 0.042 UniRef50_Q9H1C3 Glycosyltransferase 8 domain-containing protein ... 42 0.046 UniRef50_A3LQ29 Glycogenin glucosyltransferase n=3 Tax=Saccharom... 41 0.048 UniRef50_A2DXT6 Glycosyl transferase family 8 protein n=1 Tax=Tr... 41 0.058 UniRef50_Q5Z7P2 Os06g0727300 protein n=5 Tax=Magnoliophyta RepID... 41 0.065 UniRef50_C8Q828 Glycosyl transferase family 8 n=4 Tax=Enterobact... 41 0.069 UniRef50_A5BZU1 Putative uncharacterized protein n=1 Tax=Vitis v... 41 0.072 UniRef50_Q9LE59 Like glycosyl transferase 1 n=35 Tax=Embryophyta... 41 0.078 >UniRef50_P27129 Lipopolysaccharide 1,2-glucosyltransferase n=44 Tax=Enterobacteriaceae RepID=RFAJ_ECOLI Length = 338 Score = 698 bits (1801), Expect = 0.0, Method: Compositional matrix adjust. Identities = 338/338 (100%), Positives = 338/338 (100%) Query: 1 MDSFPAIEIDKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDF 60 MDSFPAIEIDKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDF Sbjct: 1 MDSFPAIEIDKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDF 60 Query: 61 YIIADVYNDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLG 120 YIIADVYNDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLG Sbjct: 61 YIIADVYNDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLG 120 Query: 121 LTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFN 180 LTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFN Sbjct: 121 LTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFN 180 Query: 181 SGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTI 240 SGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTI Sbjct: 181 SGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTI 240 Query: 241 KSELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRD 300 KSELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRD Sbjct: 241 KSELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRD 300 Query: 301 AKSIIEFKKRYKHLLVQHHYISGIIAGVCYLCRKYYRK 338 AKSIIEFKKRYKHLLVQHHYISGIIAGVCYLCRKYYRK Sbjct: 301 AKSIIEFKKRYKHLLVQHHYISGIIAGVCYLCRKYYRK 338 >UniRef50_D0KD54 Lipopolysaccharide glucosyltransferase I n=2 Tax=Pectobacterium RepID=D0KD54_PECWW Length = 336 Score = 310 bits (793), Expect = 6e-83, Method: Compositional matrix adjust. Identities = 167/331 (50%), Positives = 224/331 (67%), Gaps = 1/331 (0%) Query: 8 EIDKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVY 67 ID + ++ R +I + LNVAYG+D NY G GVSITSI++NN I+ F++ +D + Sbjct: 6 HIDVLSVFEKRHQSIADHDTLNVAYGIDKNYAVGCGVSITSILINNS-IDFTFHVFSDDF 64 Query: 68 NDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLL 127 +D F +KI+ LAE+ + +I LY+IN++ L+ LPCT +WS AMYFRL AF L LL Sbjct: 65 DDDFIKKISILAEKFKTKIILYKINSEMLKTLPCTDIWSHAMYFRLLAFSHLSDKTSSLL 124 Query: 128 YLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLD 187 YLDADV+CKG + QL L VAAV++DV MQ+K+ SRL L G+YFNSGV++ + Sbjct: 125 YLDADVMCKGSLEQLHKLNTAPHVAAVIRDVPEMQKKSASRLKMAALEGEYFNSGVLFAN 184 Query: 188 LKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDK 247 L W LT+K L + +YPDQD+MN+LL G FLP+EYNTIY+IK+ELKD Sbjct: 185 LDIWNKLDLTQKIFDKLRDGEESIQYPDQDIMNILLNGNVTFLPKEYNTIYSIKNELKDS 244 Query: 248 THQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEF 307 HQ YK++I + T+LIHYTG TKPWHKWA YPS Y++ A ENSPW +DA + +E Sbjct: 245 NHQKYKEVIKDDTILIHYTGVTKPWHKWANYPSTSYFQHAQENSPWSTSDLKDADTFVEM 304 Query: 308 KKRYKHLLVQHHYISGIIAGVCYLCRKYYRK 338 KK+YKHLL + Y+SG+I+ Y KY +K Sbjct: 305 KKKYKHLLKKGKYLSGLISAFKYSLNKYIKK 335 >UniRef50_B6XJW1 Putative uncharacterized protein n=1 Tax=Providencia alcalifaciens DSM 30120 RepID=B6XJW1_9ENTR Length = 325 Score = 286 bits (733), Expect = 5e-76, Method: Compositional matrix adjust. Identities = 143/316 (45%), Positives = 201/316 (63%), Gaps = 6/316 (1%) Query: 18 RLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAK 77 L N + LN+AYGVD +L G G+S+ SI++NN I L F++ D ND F K+ K Sbjct: 9 ELGAQNGAAELNIAYGVDKGFLFGSGLSMNSIIINNSDIKLKFHLFTDYMNDEFLSKLEK 68 Query: 78 LAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKG 137 L + I +Y IN D+L+ LP + VWS A YFR F F L TL +LYLDADV CKG Sbjct: 69 LTLNENVNIDIYIINADELKKLPISHVWSYATYFRFFIFDHLCETLSSILYLDADVFCKG 128 Query: 138 DISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLT 197 + + + + NG AAV+ DV MQ V RLS P++ +YFN+GV++L+LK W K T Sbjct: 129 SLRKYIDIAFNGEYAAVIPDVPNMQISCVDRLSMPQIKDKYFNAGVIFLNLKVWDKNKFT 188 Query: 198 EKALSILMSKDN--VYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKL 255 ++A +++ + KY DQD +N++ ++LPR+YN IYT+K+EL+ H+NYK Sbjct: 189 KQAFNLITNNHTGKTLKYLDQDALNIIFNCQNIYLPRDYNCIYTLKNELE---HENYKDY 245 Query: 256 ITESTLLIHYTGATKPWHKWAI-YPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHL 314 IT T LIHYTGATKPWH WA+ YP+ + +K+A E SPWK+D DAK E+++RYKH Sbjct: 246 ITSETKLIHYTGATKPWHYWAVNYPASQTFKVAFETSPWKNDELVDAKKKPEYQERYKHE 305 Query: 315 LVQHHYISGIIAGVCY 330 Q +++GI + + Y Sbjct: 306 FNQKKFLTGISSLIKY 321 >UniRef50_Q9ZIS6 UDP-galactose:(Glucosyl) LPS alpha1,2-galactosyltransferase WaaT n=26 Tax=Enterobacteriaceae RepID=Q9ZIS6_ECOLX Length = 331 Score = 278 bits (711), Expect = 2e-73, Method: Compositional matrix adjust. Identities = 133/309 (43%), Positives = 197/309 (63%), Gaps = 2/309 (0%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 LNV+YG+D N+L G GVSI+S+++NN IN F++ D +D + + + A+Q I Sbjct: 23 LNVSYGIDKNFLYGAGVSISSVLINNSDINFVFHVFTDYVDDDYLKSFNETAKQFNTSII 82 Query: 88 LYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGL 147 +Y I+ LP +Q WS A YFR+ +F+ L ++ LLYLDADVVCKG + L + Sbjct: 83 VYLIDPKYFADLPTSQFWSYATYFRVLSFEYLSESISTLLYLDADVVCKGSLKPLTEIIF 142 Query: 148 NGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILM-- 205 AAV+ D + Q RL+ PE+ G+YFN+GV+Y++LKKW +A LT L +L Sbjct: 143 KDEFAAVIPDNDSTQAACAKRLNIPEMNGRYFNAGVIYVNLKKWHEANLTPYLLKLLRGE 202 Query: 206 SKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHY 265 +K KY DQD +N+ ++L ++++TIYT+K+EL D++H+ Y++ IT+ T+LIHY Sbjct: 203 TKYGSLKYLDQDALNIAFNMNNIYLAKDFDTIYTLKNELYDRSHRKYQQTITDKTVLIHY 262 Query: 266 TGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQHHYISGII 325 TG TKPWH WA YPS Y+ IA E SPWK ++A+++ E +K+YKHL YI GI Sbjct: 263 TGITKPWHSWAGYPSASYFNIAREQSPWKKYPLKEARTVAEMQKQYKHLFAHGEYIKGIT 322 Query: 326 AGVCYLCRK 334 + + Y +K Sbjct: 323 SLIKYKLKK 331 >UniRef50_UPI00019F16C6 hypothetical protein CATC2_20202 n=1 Tax=Citrobacter youngae ATCC 29220 RepID=UPI00019F16C6 Length = 330 Score = 265 bits (677), Expect = 2e-69, Method: Compositional matrix adjust. Identities = 130/319 (40%), Positives = 195/319 (61%), Gaps = 6/319 (1%) Query: 16 DFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKI 75 +F A LN+A+GVD N++ G +S+TS++L+N+ +N+ F++ D + + Q++ Sbjct: 14 EFNQAPSEHKTQLNIAWGVDKNFMFGAAISMTSVLLHNKDLNIHFHLFTDYIDADYQQRV 73 Query: 76 AKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVC 135 AKLAEQ I++Y ++ + L+ LP WS AMYFR AF+ LG +D LLY+DADV+C Sbjct: 74 AKLAEQFATNISIYIMDANGLKVLPSGNAWSHAMYFRFIAFEYLGEKVDSLLYIDADVMC 133 Query: 136 KGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAK 195 KG + +L + L VAAV+ DV+ + + + D YFNSGV++ +LKKW + Sbjct: 134 KGSLYELTQIDLGEHVAAVITDVDDSPARDIEKNKD------YFNSGVIFANLKKWKEQN 187 Query: 196 LTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKL 255 A IL+ K+N +PDQDV+N+L +FL R +N IY IK ELK K YK+ Sbjct: 188 FINSAFDILLDKNNKLSFPDQDVLNILFLKKVIFLERRFNAIYGIKQELKSKDTSKYKEY 247 Query: 256 ITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLL 315 IT T+LIHY G TKPW+ WA YPS +Y+ A ++SPW D A++ ++KK+ +H Sbjct: 248 ITPETILIHYIGVTKPWNSWANYPSAQYFVEAWKSSPWADVPLLPARTPKQYKKKSRHER 307 Query: 316 VQHHYISGIIAGVCYLCRK 334 +Q Y + I+ + YL K Sbjct: 308 LQGKYFASAISYIGYLWAK 326 >UniRef50_A8ARL6 Putative uncharacterized protein n=3 Tax=Citrobacter RepID=A8ARL6_CITK8 Length = 339 Score = 263 bits (672), Expect = 7e-69, Method: Compositional matrix adjust. Identities = 125/310 (40%), Positives = 190/310 (61%) Query: 25 SECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQL 84 S+ LN+AYGVD N+L G G+S+TS+++NN I++ FY++ D +D + + + +L + Sbjct: 23 SKKLNIAYGVDRNFLFGSGISMTSVLVNNPDIDIHFYVVTDYVDDEYLESVERLTQMYGT 82 Query: 85 RITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 +T+ + + + LP T+ W+ AMY+R FAF+ L LD +LYLDAD+VCK + +L Sbjct: 83 TVTVLVFDNEAFRKLPSTKAWTYAMYYRYFAFEYLSRELDSVLYLDADIVCKNSLRELTD 142 Query: 145 LGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 + G AAVV D++ ++ K+ RL PEL YFNSGVV+ +L W + KL KA +L Sbjct: 143 IHFAGEYAAVVNDIDRVRLKSGQRLGIPELARDYFNSGVVFANLHVWREKKLLSKAFEVL 202 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIH 264 + Y DQD++N+L G + L R++N IY + ELK+K Y+ ITEST+LIH Sbjct: 203 HERQKELLYFDQDILNILFVGHVILLRRDFNCIYGVDQELKNKNEYRYQDFITESTVLIH 262 Query: 265 YTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQHHYISGI 324 Y G TKPWH WA YP KY+ A + S W + S +A + +K++ +H +Q YI I Sbjct: 263 YVGVTKPWHTWANYPVSKYFIEAYKKSAWAEKSLLNANTAKLYKRKSRHERIQRKYIRSI 322 Query: 325 IAGVCYLCRK 334 + + Y+ K Sbjct: 323 FSHIMYIKNK 332 >UniRef50_D2U322 Lipopolysaccharide 1,2-glucosyltransferase n=1 Tax=Arsenophonus nasoniae RepID=D2U322_9ENTR Length = 334 Score = 263 bits (671), Expect = 8e-69, Method: Compositional matrix adjust. Identities = 123/306 (40%), Positives = 195/306 (63%), Gaps = 3/306 (0%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 N+AYGVD N+L G +SI S+++NN + +F++ D +DG+ Q+ + + I Sbjct: 24 FNIAYGVDKNFLLGAAISINSVLINNTDTDFNFHLFTDYIDDGYIQRFQTMIAKYNSNII 83 Query: 88 LYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGL 147 +Y ++ +L+ L + WS A YFRL AF+ L + +LYLDADV+CKG + ++ L L Sbjct: 84 IYLLDAAELKQLSTSDFWSYATYFRLIAFEYLSTNIHAILYLDADVICKGSLKEIFQLNL 143 Query: 148 NGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSK 207 + AAVV DV+ MQ+ + +RL+ +L G+YFN+GV+Y++L+KW + ++K+L ++ K Sbjct: 144 ADSFAAVVLDVDSMQQSSATRLNLADLNGKYFNAGVIYVNLQKWIENDFSKKSLELVRGK 203 Query: 208 DNV--YKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHY 265 N KY DQD +N+L + ++L R+YN IY +K+EL YK IT+ST+LIHY Sbjct: 204 TNFGKLKYLDQDALNILFQTQNIYLSRDYNCIYKLKNELAYHDLSKYKNTITDSTILIHY 263 Query: 266 TGATKPWHKWAI-YPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQHHYISGI 324 TG TKPWH W I YP+ +++ + +SPWKD + A+ E +++YKHL +QH Y+ G Sbjct: 264 TGVTKPWHTWGINYPASQFFFNSYIHSPWKDQPLKMAEKRTELQEKYKHLFLQHKYMQGF 323 Query: 325 IAGVCY 330 + + Y Sbjct: 324 LCLIKY 329 >UniRef50_B2PV91 Putative uncharacterized protein n=2 Tax=Enterobacteriaceae RepID=B2PV91_PROST Length = 342 Score = 256 bits (653), Expect = 1e-66, Method: Compositional matrix adjust. Identities = 130/310 (41%), Positives = 192/310 (61%), Gaps = 3/310 (0%) Query: 27 CLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRI 86 CL+V YG D NY G GVS S+++NN F+ D + F +K+ +A Q Q+ Sbjct: 24 CLDVIYGSDENYQFGAGVSAVSLLINNPTTFFRFHYFLDKVSPDFLEKLKVIASQFQVEF 83 Query: 87 TLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLG 146 +Y ++ L+ LP + VWS AMYFRL A L D LYLDADV+C G + +L Sbjct: 84 HVYELDNKLLKTLPASDVWSSAMYFRLVALDYLSSDYDFALYLDADVMCNGILDLTTNL- 142 Query: 147 LNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMS 206 + V VV D ++ K+ +RL P L YFNSGV++++LKKW + ++T++ +L + Sbjct: 143 IKDKVCGVVADDIGVRTKSETRLHAPSLAKTYFNSGVMFVNLKKWHEKQITQQCFELLSA 202 Query: 207 KD--NVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIH 264 ++ YKYPDQDV+N++L+ L + +NT+YT+K+EL D THQ Y+++IT T+LIH Sbjct: 203 ENAKQRYKYPDQDVLNLILREDLELLSQRFNTVYTLKNELYDSTHQKYQQVITPETVLIH 262 Query: 265 YTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQHHYISGI 324 YTG +KPWH WA YP+ + + AL SPW + + A +E KK YKHLL Q +Y++GI Sbjct: 263 YTGVSKPWHTWANYPASQPFYKALMQSPWTTNDLKPATKFVERKKEYKHLLKQGNYLAGI 322 Query: 325 IAGVCYLCRK 334 ++G+ Y K Sbjct: 323 LSGIRYSFEK 332 >UniRef50_Q9ZIT6 UDP-glucose:(Galactosyl) LPS alpha1,2-glucosyltransferase WaaJ n=26 Tax=Enterobacteriaceae RepID=Q9ZIT6_ECOLX Length = 339 Score = 253 bits (645), Expect = 8e-66, Method: Compositional matrix adjust. Identities = 131/326 (40%), Positives = 198/326 (60%), Gaps = 8/326 (2%) Query: 7 IEIDKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADV 66 IE+DK R ++ E NV++G+D NY G +SI SI+ NN+ F+IIAD Sbjct: 15 IELDK------RPVKLDERETFNVSWGIDENYQVGAAISIASILENNKQNKFTFHIIADY 68 Query: 67 YNDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRL 126 + + + +++LA + Q I LY I+++ L+ LP + +W ++Y+RL +F LD L Sbjct: 69 LDKEYIELLSQLATKYQTVIKLYLIDSEPLKALPQSNIWPVSIYYRLLSFDYFSARLDSL 128 Query: 127 LYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYL 186 LYLDAD+VCKG +++L+ L AVV DV+ MQ K+ RL + + G YFNSGV+Y+ Sbjct: 129 LYLDADIVCKGSLNELIALEFKDEYGAVVIDVDAMQSKSAERLCNEDFNGSYFNSGVMYI 188 Query: 187 DLKKWADAKLTEKALSILMSKDNV--YKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSEL 244 +L++W +LTEK +L + + KYPDQD++N++ LPR+YN IYTIKSE Sbjct: 189 NLREWLKQRLTEKFFDLLSDESIIKKLKYPDQDILNLMFLHHAKILPRKYNCIYTIKSEF 248 Query: 245 KDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSI 304 ++K + Y + I + T+ IHYTG TKPWH WA Y S Y++ SPW++ + A Sbjct: 249 EEKNSEYYTRFINDDTVFIHYTGITKPWHDWANYASADYFRNIYNISPWRNIPYKKAVKK 308 Query: 305 IEFKKRYKHLLVQHHYISGIIAGVCY 330 E K++YKHLL Q ++ G+ + Y Sbjct: 309 HEHKEKYKHLLYQKKFLDGVFTAIKY 334 >UniRef50_Q9ZIS1 UDP-galactose:(Galactosyl) LPS alpha1,2-galactosyltransferase WaaW n=29 Tax=Enterobacteriaceae RepID=Q9ZIS1_ECOLX Length = 342 Score = 238 bits (606), Expect = 3e-61, Method: Compositional matrix adjust. Identities = 123/320 (38%), Positives = 191/320 (59%), Gaps = 4/320 (1%) Query: 23 NTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQN 82 NT LN+AYG+D N+L G VS+ S+V++N + + F++ D ++ + Q++ +N Sbjct: 19 NTDRVLNIAYGIDRNFLFGAAVSMQSVVMHNPDLAVKFHLFTDYIDEDYLQRVNAFTSKN 78 Query: 83 -QLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQ 141 + + +Y++++ + P + WS A +FRL AFQ L T++ LLY+DADV+CKG ++ Sbjct: 79 ANVEVRIYKVSSAFIDIFPSLKQWSYATFFRLVAFQYLSETIENLLYIDADVICKGSLAG 138 Query: 142 LLHLGLNG-AVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKA 200 LL + +G AAV+KDV MQEK RL+ L G YFN+GVVYL L+ WA KA Sbjct: 139 LLDINFDGDKFAAVIKDVPFMQEKPAKRLAIEGLPGNYFNAGVVYLQLEAWAKNDFMNKA 198 Query: 201 LSILMS--KDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITE 258 +++L S + YK DQD++N+L G +F+ +Y+ Y I ELK+K+ ++YKK IT+ Sbjct: 199 IAMLASDPQHTKYKCLDQDILNILFFGHCIFISGDYDCFYGIDYELKNKSDEDYKKTITD 258 Query: 259 STLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQH 318 T LIHY G TKPW+ W YP KY+ A + S W D + A + +++ + +HL Sbjct: 259 DTKLIHYVGVTKPWNDWTNYPCQKYFNEAYQASCWNDVAFIPATNEKQYQVKSRHLKRNG 318 Query: 319 HYISGIIAGVCYLCRKYYRK 338 + S + Y +K RK Sbjct: 319 NIASSFYYFMLYYSKKIARK 338 >UniRef50_Q9ZIT4 UDP-galactose:(Glucosyl) LPS alpha1,3-galactosyltransferase WaaI n=26 Tax=Enterobacteriaceae RepID=Q9ZIT4_ECOLX Length = 335 Score = 222 bits (566), Expect = 1e-56, Method: Compositional matrix adjust. Identities = 129/329 (39%), Positives = 191/329 (58%), Gaps = 15/329 (4%) Query: 15 WDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQK 74 ++F NI + L++A+G+D N+L G GV+I SI+LNNR I+ +F++ D +D Sbjct: 14 YNFHYQNIRSKNTLDIAFGIDRNFLFGCGVAIASILLNNREISCEFHVFTDYISDKDKLY 73 Query: 75 IAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVV 134 + LA+Q RI +Y IN DKL+ LP T+ W+ A YFR +++LYLDAD+ Sbjct: 74 FSDLAKQYNSRINIYVINCDKLKSLPSTKNWTYATYFRFIIADYFYHKHEKILYLDADIA 133 Query: 135 CKGDISQLLHLGLN-GAVAAVV--KDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKW 191 CKG I +LL + +AAVV +DVE Q +A S L+ P+L YFN+G + +++ +W Sbjct: 134 CKGSIKELLDYQFSTNEIAAVVAERDVEWWQNRA-SVLTTPQLASGYFNAGFLLINIDEW 192 Query: 192 ADAKLTEKALSILMSKDNVYK--YPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTH 249 ++ KA+ +L D V K + DQDV+NVLL G F+ +YNT Y+I ELKDK Sbjct: 193 NLNNISSKAIEMLRDPDWVSKITHLDQDVLNVLLNGKVKFISEKYNTRYSINYELKDKV- 251 Query: 250 QNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKK 309 + + T+ IHY G TKPWH+WA YP + + IA SPW + + +++ Sbjct: 252 ---DNPVNDDTVFIHYVGPTKPWHEWANYPVSRSFLIAKAASPWSKEDLLKPVNSNQYRY 308 Query: 310 RYKHLLVQHHYISGIIAGVCYLCRKYYRK 338 KH Q HY++GI YL KYY++ Sbjct: 309 CAKHKFKQKHYMAGIFN---YL--KYYKE 332 >UniRef50_B6XJW2 Putative uncharacterized protein n=1 Tax=Providencia alcalifaciens DSM 30120 RepID=B6XJW2_9ENTR Length = 333 Score = 208 bits (530), Expect = 2e-52, Method: Compositional matrix adjust. Identities = 123/320 (38%), Positives = 186/320 (58%), Gaps = 14/320 (4%) Query: 22 INTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQ 81 I+ S C +VAYG+D N+L G GVSI S++++N HI F+I D N + IAK AE Sbjct: 20 IDDSSCQHVAYGIDHNFLYGSGVSIVSLLMHNPHIQFAFHIFID--NSMSDEDIAKFAEI 77 Query: 82 NQL---RITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGD 138 L +IT+Y I+++ ++ LP T+ W+ A+YFR + +D LLYLDADVVC + Sbjct: 78 CHLYNTKITIYFIDSNNVKKLPTTKNWTHAIYFRFIIAEYFKDKIDYLLYLDADVVCNRN 137 Query: 139 ISQLLHLGLNGAVAAVV--KDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKL 196 I +LL L G +AAVV +D Q++A S L P + YFNSGV+Y++L+ W + Sbjct: 138 IDELLSHNLLGYIAAVVPERDKAWWQKRADS-LGFPSVSKGYFNSGVMYINLRTWKTNNV 196 Query: 197 TEKALSILMSKDNVYK--YPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKK 254 TEK++++LM + ++ YPDQDV+N+LL LF+ +NT +++ ELK +++ Sbjct: 197 TEKSMALLMDNEVSHRLVYPDQDVLNILLTDSVLFISSIFNTQFSLNYELK----KSFDF 252 Query: 255 LITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHL 314 + +T+ IHY G TKPWH+WA Y + + + A SPW++ AKS + KH Sbjct: 253 PVKRTTVFIHYVGPTKPWHEWANYETAQPFLEARAVSPWRNVPLLKAKSSNHLRYCAKHN 312 Query: 315 LVQHHYISGIIAGVCYLCRK 334 + Q Y + Y K Sbjct: 313 INQRKYFFAFKNYIAYFFSK 332 >UniRef50_D0KD53 Lipopolysaccharide 3-alpha-galactosyltransferase n=3 Tax=Enterobacteriaceae RepID=D0KD53_PECWW Length = 336 Score = 207 bits (527), Expect = 4e-52, Method: Compositional matrix adjust. Identities = 112/313 (35%), Positives = 177/313 (56%), Gaps = 4/313 (1%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 L++A+G D ++ G ++I SI+L N L F++ D +DG + ++AEQ I Sbjct: 24 LDIAFGTDEKFIYGCAIAIASILLKNPDYCLSFHVFTDKLSDGDKARFQEMAEQYNTTIN 83 Query: 88 LYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGL 147 +Y ++ L+ LP T++WS A+YFR LD++LYLDAD++C G + +L+ L L Sbjct: 84 IYIVDCSWLKTLPETKLWSYAIYFRFIIADYFYKILDKVLYLDADIICNGSLQELIKLDL 143 Query: 148 NGAVAAVVKDVEPMQEK-AVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMS 206 + ++AVV D + K + PEL YFNSGV+ +++ W A +TE ++ +L Sbjct: 144 SNHISAVVLDGDSNWWKNRAQKFQQPELSNGYFNSGVLLIEVNNWHQAAVTENSMRLLTD 203 Query: 207 KD--NVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIH 264 + + +PDQDV+NVLL G + + +YNT ++I ELK ++ I+ T+ IH Sbjct: 204 PEMKKIITHPDQDVLNVLLAGKSCHIESKYNTQFSINYELKYSYGESAPTPISNKTIFIH 263 Query: 265 YTGATKPWHKWAI-YPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQHHYISG 323 Y G TKPWHKWA Y KY+ A E+SPWK++S DA + + KH I G Sbjct: 264 YIGPTKPWHKWAANYACTKYFLKAKEHSPWKNESLLDAVTASNMRYCAKHQFHNGEIIRG 323 Query: 324 IIAGVCYLCRKYY 336 ++ + YL +K + Sbjct: 324 TLSFLKYLYKKAF 336 >UniRef50_Q9R9D1 UDP-glucose:(Glucosyl) LPS alpha1,3-glucosyltransferase WaaO n=29 Tax=Enterobacteriaceae RepID=Q9R9D1_ECOLX Length = 338 Score = 196 bits (498), Expect = 9e-49, Method: Compositional matrix adjust. Identities = 122/333 (36%), Positives = 191/333 (57%), Gaps = 14/333 (4%) Query: 5 PAIEIDKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIA 64 P I+K +D R A + + +VAYG+D N+L G GVSITS++L+N ++ F++ Sbjct: 8 PQEMINKTIIFDERPA-ASVASSFHVAYGIDKNFLFGCGVSITSVLLHNSDVSFVFHVFI 66 Query: 65 DVYNDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLD 124 D + Q++A+LA+ + I ++ +N ++L+ LP T+ WS AMYFR D Sbjct: 67 DDIPEADIQRLAQLAKSYRTCIQIHLVNCERLKALPTTKNWSIAMYFRFVIADYFIDQQD 126 Query: 125 RLLYLDADVVCKGDISQLLHLGLNGAVAAVV--KDVEPMQEKAVSRLSDPELLGQYFNSG 182 ++LYLDAD+ C+G++ L+ + L VAAVV +D + S L EL YFNSG Sbjct: 127 KILYLDADIACQGNLKPLITMDLANNVAAVVTERDANWWSLRGQS-LQCNELEKGYFNSG 185 Query: 183 VVYLDLKKWADAKLTEKALSILMSKDNVYK--YPDQDVMNVLLKGMTLFLPREYNTIYTI 240 V+ ++ WA ++ KA+S+L K V + Y DQD++N++L G F+ +YNT +++ Sbjct: 186 VLLINTLAWAQESVSAKAMSMLADKAIVSRLTYMDQDILNLILLGKVKFIDAKYNTQFSL 245 Query: 241 KSELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRD 300 ELK +++ I + T+LIHY G TKPWH WA YPS + + A E SPWK++ Sbjct: 246 NYELK----KSFVCPINDETVLIHYVGPTKPWHYWAGYPSAQPFIKAKEASPWKNEPLM- 300 Query: 301 AKSIIEFKKRY--KHLLVQHHYISGIIAGVCYL 331 + + RY KH Q+ I+GI+ + Y Sbjct: 301 -RPVNSNYARYCAKHNFKQNKPINGIMNYIYYF 332 >UniRef50_D2TIX6 UDP-galactose:(Glucosyl) LPS alpha-1,3-galactosyltransferase n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TIX6_CITRO Length = 340 Score = 195 bits (496), Expect = 2e-48, Method: Compositional matrix adjust. Identities = 117/325 (36%), Positives = 184/325 (56%), Gaps = 11/325 (3%) Query: 16 DFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKI 75 DF + L++AYGVD N+L G G+SI S++ NN L F++ D +N+ + Sbjct: 17 DFNHQDTAEKVVLDIAYGVDQNFLFGCGISIASVLKNNTDKTLHFHVFIDAFNETDRRMF 76 Query: 76 AKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRL-FAFQLLGLTLDRLLYLDADVV 134 KLA Q + IT+Y IN + L+ LP T+ W+ A+YFR A +G T ++LLYLDAD++ Sbjct: 77 DKLAAQYKTHITIYLINCEHLRSLPSTKNWTYAIYFRFAIADYFIGKT-NKLLYLDADII 135 Query: 135 CKGDISQLLHLGL-NGAVAAVVKDVEP-MQEKAVSRLSDPELLGQYFNSGVVYLDLKKWA 192 C+G I +L++ + +AAVV + + EK L + YFNSG++ ++L +WA Sbjct: 136 CQGGIDELVNFSFASDKIAAVVTEGKADWWEKRALSLGTEGITKGYFNSGLILINLNQWA 195 Query: 193 DAKLTEKALSILMSKDNVYK--YPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQ 250 ++ +A+ +L D V + +PDQDV+N+LL FL ++NT +++ +LKDK Sbjct: 196 IECISARAIKMLSDPDIVGRITHPDQDVLNILLADKLHFLDIKFNTQFSLNYQLKDK--- 252 Query: 251 NYKKLITESTLLIHYTGATKPWHKWAI-YPSVKYYKIALENSPWKDDSPRDAKSIIEFKK 309 + + T+LIHY G TKPWH WA Y K + A + SPWK+ + + +F+ Sbjct: 253 -FINPVNNDTILIHYIGPTKPWHSWAGDYLISKPFIDAKQASPWKNTALLKPTNSNQFRY 311 Query: 310 RYKHLLVQHHYISGIIAGVCYLCRK 334 KH+L YI G++ Y +K Sbjct: 312 CAKHMLKNKRYIKGMVGYFLYFMKK 336 >UniRef50_D2TY85 UDP-D-galactose:(Glucosyl)lipopolysaccharide-alpha-1, 3-D-galactosyltransferase n=1 Tax=Arsenophonus nasoniae RepID=D2TY85_9ENTR Length = 343 Score = 192 bits (487), Expect = 2e-47, Method: Compositional matrix adjust. Identities = 109/326 (33%), Positives = 177/326 (54%), Gaps = 8/326 (2%) Query: 12 VKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGF 71 + +++F A+ T + ++AYG D N+ G +SI S++ N+ FYI D ++ Sbjct: 19 LTSYEFSSADAKTPQ-FHIAYGADKNFSLGTAISICSMLYFNKIYTFHFYIFTDTISECD 77 Query: 72 FQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDA 131 +K +L +IT+ I+T +L+ LP ++WS A+YFR +++LYLD+ Sbjct: 78 LKKFDELTSCYNTKITILLIDTLQLKKLPTNKLWSHAIYFRFIIANYFHNKTNKILYLDS 137 Query: 132 DVVCKGDISQLLHLGLNGAVAAVVKDVEP-MQEKAVSRLSDPELLGQYFNSGVVYLDLKK 190 D++C GDIS+L + LN + A V D + + +K L+ PE+ YFNSGV+ +D K Sbjct: 138 DIICSGDISELFDIDLNQHIIAAVADRDQYLWKKRAEMLATPEIANGYFNSGVMLIDTDK 197 Query: 191 WADAKLTEKALSILMSKDNVYKYP--DQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKT 248 W K+TEK ++IL+ K+ DQD +N+ L LFL +++NT ++I ELK+KT Sbjct: 198 WHKNKITEKTINILLDDKTKAKFVFYDQDALNISLVNQVLFLDKKFNTQFSINYELKNKT 257 Query: 249 HQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFK 308 I + IHY G TKPW+ W+ YPS + +NSPWK A + +++ Sbjct: 258 LFP----IINNVKFIHYIGPTKPWNIWSEYPSTHLFMTIKKNSPWKTTPLIAASTSNQYR 313 Query: 309 KRYKHLLVQHHYISGIIAGVCYLCRK 334 KH+ + YI ++ + Y K Sbjct: 314 YAAKHMFNKKKYIYWLLNYLYYFVNK 339 >UniRef50_UPI000197C525 UDP-D-galactose:(glucosyl) lipopolysaccharide-alpha-1,3-D-galactosyltransferase n=1 Tax=Providencia rettgeri DSM 1131 RepID=UPI000197C525 Length = 339 Score = 189 bits (479), Expect = 2e-46, Method: Compositional matrix adjust. Identities = 111/311 (35%), Positives = 166/311 (53%), Gaps = 9/311 (2%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 NVAYG D N+L G GVSI S++LNN+ IN F++ D +D Q +++++Q + +T Sbjct: 28 FNVAYGADKNFLFGTGVSIVSVLLNNKDINFHFHVFTDFLSDKDIQLFSQISKQYKTSVT 87 Query: 88 LYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGL 147 L+ +N D L+ LP QVWS A+YFRL D++LYLD+DVVC G I L L L Sbjct: 88 LHTLNMDILKKLPTNQVWSHAIYFRLIIADYFYKKCDKVLYLDSDVVCTGSIQILKSLNL 147 Query: 148 NGAVAAVVKDVEPMQEKAVSRLSDPELLGQ-YFNSGVVYLDLKKWADAKLTEKALSILMS 206 + A V D+ ++ L + E + + YFNSGV+ ++ +W +LTEK++S+ Sbjct: 148 SSMPIAAVMDISEPHSVEMANLFNVEGIKKGYFNSGVMLINPDEWNYRQLTEKSMSVFTD 207 Query: 207 K--DNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIH 264 K V KY DQD +N+ + G L L +N + K K + + + + +H Sbjct: 208 KKLQPVIKYYDQDAINIAVHGDWLKLDNIFNHRINLNDRYKHKKNND-----ISNAVFVH 262 Query: 265 YTGATKPWHKWA-IYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQHHYISG 323 + G+TKPWH W+ Y V+ + A E SPWKD ++I K KH + Y+S Sbjct: 263 FIGSTKPWHNWSKYYHEVRCFLNAKEKSPWKDIDLMTPQNITHHKYASKHFRYKEKYLSS 322 Query: 324 IIAGVCYLCRK 334 V Y K Sbjct: 323 FYHYVLYTILK 333 >UniRef50_P27128 Lipopolysaccharide 1,3-galactosyltransferase n=43 Tax=Enterobacteriaceae RepID=RFAI_ECOLI Length = 339 Score = 186 bits (472), Expect = 9e-46, Method: Compositional matrix adjust. Identities = 107/313 (34%), Positives = 171/313 (54%), Gaps = 9/313 (2%) Query: 27 CLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRI 86 CL++AYG D N+L G G+SI SI+ N L F+I D + D + LA Q + RI Sbjct: 28 CLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDRKYFDALALQYKTRI 87 Query: 87 TLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLG 146 +Y IN D+L+ LP T+ W+ A+YFR ++LYLDAD++C+G I L++ Sbjct: 88 KIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDADIICQGTIEPLINFS 147 Query: 147 L-NGAVAAVVKDVEP-MQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 + VA VV + + EK L + YFNSG + ++ +WA +++ +A+++L Sbjct: 148 FPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQWAAQQVSARAIAML 207 Query: 205 MSKDNVYK--YPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLL 262 + + K +PDQDV+N+LL +F +YNT +++ +LK +++ +T T+ Sbjct: 208 NEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLK----ESFINPVTNDTIF 263 Query: 263 IHYTGATKPWHKWAI-YPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQHHYI 321 IHY G TKPWH WA YP + + A SPWK+ + + + + KH+L +H Y+ Sbjct: 264 IHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQLRYSAKHMLKKHRYL 323 Query: 322 SGIIAGVCYLCRK 334 G + Y K Sbjct: 324 KGFSNYLFYFIEK 336 >UniRef50_D1P7H1 Lipopolysaccharide 1,2-glucosyltransferase n=1 Tax=Providencia rustigianii DSM 4541 RepID=D1P7H1_9ENTR Length = 324 Score = 174 bits (440), Expect = 5e-42, Method: Compositional matrix adjust. Identities = 101/314 (32%), Positives = 169/314 (53%), Gaps = 17/314 (5%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIAD-VYNDGFFQKIAKLAEQNQLRI 86 ++ YGVD +L GVG SI S++LNN+ + F+I D + ++ F++ + +I Sbjct: 24 FHIGYGVDEKFLYGVGTSIASVMLNNKDTDFHFHIFVDNLPDENLFREAVQGTSH---KI 80 Query: 87 TLYRINTDKLQCLPC-TQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHL 145 T+Y I+ +K + LP ++ WS A+YFRL L ++D LLYLDAD++CKGD+S+L L Sbjct: 81 TIYFIDNEKFKLLPLPSKAWSHAIYFRLLIISYLSSSIDSLLYLDADIICKGDLSELKAL 140 Query: 146 GLNGAVAAVVKDVEPMQEKAVSRLSD-PELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 + V +++K S + P + +YFNSG +Y+ LK A + + + ++ Sbjct: 141 TFDEKTF-----VYAVKDKFCSEKQNLPIDMSKYFNSGFLYMSLKHLAQENIPNRVIELV 195 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIH 264 D + +PDQD +NVLL + + YN ++++ + K H I +S + IH Sbjct: 196 EKND--FSHPDQDALNVLLNDKLINISENYNYMFSLDWYITSKGHL---AKIPDSVVFIH 250 Query: 265 YTGATKPWHKWA-IYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQHHYISG 323 + G TKP+H+WA Y KY + A +NSPWK+ + + ++ HL Y+ Sbjct: 251 FVGLTKPFHEWASFYEEYKYLESARKNSPWKNIPLLKPEGYKQLSRKKSHLRKNGKYVEF 310 Query: 324 IIAGVCYLCRKYYR 337 I + YL +K + Sbjct: 311 IFTTIQYLMKKTFH 324 >UniRef50_C5WAK3 Ybl156 protein n=2 Tax=Enterobacteriaceae RepID=C5WAK3_ECOBB Length = 163 Score = 159 bits (403), Expect = 9e-38, Method: Compositional matrix adjust. Identities = 74/161 (45%), Positives = 110/161 (68%), Gaps = 2/161 (1%) Query: 176 GQYFNSGVVYLDLKKWADAKLTEKALSILM--SKDNVYKYPDQDVMNVLLKGMTLFLPRE 233 G+YFN+GV+Y++LKKW +A LT L +L +K KY DQD +N+ ++L ++ Sbjct: 3 GRYFNAGVIYVNLKKWHEANLTPYLLKLLRGETKYGSLKYLDQDALNIAFNMNNIYLAKD 62 Query: 234 YNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPW 293 ++TIYT+K+EL D++H+ Y++ IT+ T+LIHYTG TKPWH WA YPS Y+ IA E SPW Sbjct: 63 FDTIYTLKNELHDRSHRKYQQTITDKTVLIHYTGITKPWHSWAGYPSASYFNIAREQSPW 122 Query: 294 KDDSPRDAKSIIEFKKRYKHLLVQHHYISGIIAGVCYLCRK 334 K ++A+++ E +K+YKHL YI GI + + Y +K Sbjct: 123 KKYPLKEARTVAEMQKQYKHLFAHGEYIKGITSLIKYKLKK 163 >UniRef50_C1DGU7 Lipopolysaccharide 1,3-galactosyltransferase n=1 Tax=Azotobacter vinelandii DJ RepID=C1DGU7_AZOVD Length = 326 Score = 152 bits (384), Expect = 2e-35, Method: Compositional matrix adjust. Identities = 97/323 (30%), Positives = 161/323 (49%), Gaps = 18/323 (5%) Query: 24 TSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQ 83 S+ L++A+GVD NYL +G++I SI+ NN + L F++ + ++ +L Sbjct: 8 NSDVLHIAFGVDENYLRPMGITIVSIIENNPGLELVFHVFISSISSASRVRLDRLERMFA 67 Query: 84 LRITLYRINT-----DKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGD 138 + L+ ++ D + S+A Y RL + L DR+LYLDAD++C GD Sbjct: 68 RPVNLHLVDEMLDVKDPASGKGQAHI-SKAAYIRLLIPEALRDFTDRVLYLDADILCVGD 126 Query: 139 ISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTE 198 IS LLHL ++G AAV++D + K + + L YFNSGV+Y+D+ +W + +T Sbjct: 127 ISGLLHLDIDGRTAAVIRDAG-AESKRAGLVKKGQTLDNYFNSGVLYIDIPRWIERAVTS 185 Query: 199 KALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITE 258 +AL + +Y DQD +N++L G F+ + +N Y + +LK + Sbjct: 186 RALEKIADPVLDLRYSDQDALNLVLDGDVRFIDKGWNHQYGLTGKLK---KGRVGMDVPS 242 Query: 259 STLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDD------SPRDAKSIIEFKKRYK 312 T +H+ G KPW W + S + + SPW + SPR+ F Y+ Sbjct: 243 DTKFVHFIGPMKPWRSWNPHQSKELFLRYQALSPWAGEALDDNFSPREIYVYSRFM--YR 300 Query: 313 HLLVQHHYISGIIAGVCYLCRKY 335 + Q ++SG+I +L RK+ Sbjct: 301 SMFQQGRWLSGLIWYGKFLHRKH 323 >UniRef50_C3X7M2 Lipopolysaccharide 3-alpha-galactosyltransferase n=1 Tax=Oxalobacter formigenes OXCC13 RepID=C3X7M2_OXAFO Length = 307 Score = 128 bits (321), Expect = 4e-28, Method: Compositional matrix adjust. Identities = 91/313 (29%), Positives = 161/313 (51%), Gaps = 16/313 (5%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 ++A+GVD Y + V+I SI+ NN++ N+ F++I + +D +I K Q I Sbjct: 5 FHIAFGVDTIYAPKMCVTIASILENNKNSNIIFHVIYNDLSDKVIDEIKKSMLTLQAEIN 64 Query: 88 LYRINTDKLQCLPCTQVWSR---AMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 + I+ D L P +S + R F +LL DR LYLDAD++C +IS L H Sbjct: 65 FHFIDVD-LSIFPKFSNFSHITSGAFLRFFIPELLQGLTDRALYLDADIICINNISDLFH 123 Query: 145 LGLN-GAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSI 203 L ++ + AVV+D++ E ++ + +YFNSGV+ +D++KW + + LS+ Sbjct: 124 LEMDENEILAVVEDID--SETYLNE--NASFQKRYFNSGVLMMDIEKWNKNNVYGQLLSV 179 Query: 204 LMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLI 263 L K + + DQD +N+++ +L +N Y I +E DK + Y + E+ I Sbjct: 180 LNEKGSGFNLIDQDALNLVMIDKVHYLDNIWN--YMINAEQLDKKKEKYS--VPENAKFI 235 Query: 264 HYTGATKPWHKWAIYPSVK-YYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQHHYIS 322 H+ G KPWH + I+ + Y + + W D K+ E ++ ++ + +Y++ Sbjct: 236 HFVGPVKPWHCYNIFDDITGLYLNYQKKTVW--DGLEMPKNYKEMRRYARYSFKKGNYLT 293 Query: 323 GIIAGVCYLCRKY 335 G+ G+ Y+ K+ Sbjct: 294 GLNWGMRYIKTKF 306 >UniRef50_Q64ZV2 Lipopolysaccharide 1,2-glucosyltransferase n=8 Tax=Bacteroides RepID=Q64ZV2_BACFR Length = 311 Score = 122 bits (305), Expect = 2e-26, Method: Compositional matrix adjust. Identities = 88/307 (28%), Positives = 153/307 (49%), Gaps = 22/307 (7%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +++A +D+N+ V++TS+ NNR+ +IIA + + ++ +AE +I Sbjct: 2 IHIACNIDSNFTIHCAVTLTSLFANNRNSEFCVHIIASTLPEADQKALSSIAESYGNKIC 61 Query: 88 LYRINTDKLQCLPCTQVWSR---AMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 Y D L + +R A Y+R ++L + +D++LY+D D+V DIS+ Sbjct: 62 FYFPEKDLLNNFSIKKSGNRISIATYYRCLLSRILPVNIDKILYIDCDIVVLNDISEFWD 121 Query: 145 LGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 + ++D+ +E+ SRL + YFN+GV+ ++LK W + K+ E Sbjct: 122 TDITQYAIGCIEDIGSDEEEYYSRLQYDKKYS-YFNAGVLLINLKYWREHKIDEMCEQYF 180 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYN---TIYTIKSELKDKTHQNYKKLITESTL 261 ++ + ++ DQD++N LL LF+P +N T Y K K H K+ + + Sbjct: 181 LAHSDRIRFNDQDLLNALLYKDKLFVPFRWNVQDTFYRRTYSHKVKEHSGLKEALLHPAI 240 Query: 262 LIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKR----YKHLLVQ 317 L HYT KPW+ +++P + Y L+ +PWK P II+F+ R +K LL Sbjct: 241 L-HYTNK-KPWNYDSMHPLKQEYFKYLDMTPWKGTRP-----IIDFQTRVITGFKRLL-- 291 Query: 318 HHYISGI 324 YI+GI Sbjct: 292 --YITGI 296 >UniRef50_C7QL87 Glycosyl transferase family 8 n=2 Tax=Cyanothece RepID=C7QL87_CYAP0 Length = 283 Score = 121 bits (304), Expect = 3e-26, Method: Compositional matrix adjust. Identities = 78/290 (26%), Positives = 141/290 (48%), Gaps = 21/290 (7%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +++ + D NY GV+ITS++LNN + +++ + F +KI KL + Q + Sbjct: 1 MDILFCFDKNYEQHFGVAITSLILNNTNKIKTIHLVTKDNSKDFLKKIDKLKSKTQAKFF 60 Query: 88 LYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGL 147 +Y + L + + S A Y+RL A +LL L ++LYLD+D+V + L ++ + Sbjct: 61 IYSPDDKDLSNVKVSAHISTAAYYRLLAPELLPQDLKKILYLDSDLVVNSSLENLYNMDI 120 Query: 148 NGAVAAVVKDVEPMQEKAVSRLSDPELLGQ-YFNSGVVYLDLKKWADAKLTEKALSILMS 206 + + A + M RL +L G YFNSGV+ ++L+ W + K L Sbjct: 121 SDDILAAYAGGK-MGPGTKKRL---QLTGDFYFNSGVMLINLEAWRTENIGNKCFKFLQE 176 Query: 207 KDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYT 266 ++ + DQD +N ++ G L + +N++ + + + +T +++IH+T Sbjct: 177 NPDMIRLWDQDALNKIVDGKFLNIDGIWNSLVDLTTG---------ETRVTNQSIIIHFT 227 Query: 267 GATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLV 316 G KPW W I P + Y L SPW + P +F K ++ +L+ Sbjct: 228 GTLKPWQSWCIRPEKQIYWYYLRQSPWSNAYP-------QFPKNFQEMLL 270 >UniRef50_Q46Y64 Glycosyl transferase, family 8 n=1 Tax=Ralstonia eutropha JMP134 RepID=Q46Y64_RALEJ Length = 331 Score = 120 bits (302), Expect = 5e-26, Method: Compositional matrix adjust. Identities = 83/333 (24%), Positives = 155/333 (46%), Gaps = 37/333 (11%) Query: 23 NTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQN 82 N ++A+ VD NY +G +I SI+ NN + F+++ F + E+N Sbjct: 18 NGKPSFHIAFCVDDNYFRAMGATIASIIDNNPGQHFTFHVLT-------FSAL----EEN 66 Query: 83 QLRI----TLYRINTDK--LQCLPCTQV--------WSRAMYFRLFAFQLLGLTLDRLLY 128 Q R+ +Y ++T L TQ +S +++ RL ++L DR+LY Sbjct: 67 QRRLKQLEEMYPVSTQLHLLDLASFTQFSHFLGHSHYSLSIFTRLVIPEVLQGQTDRVLY 126 Query: 129 LDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDL 188 LDAD++C + +L+ + ++ +A VV D + V+ L +YFN GV+++++ Sbjct: 127 LDADILCVNRLDELVDMDISNEIAVVVPDAPVTLRRRVAALGLAH--AEYFNGGVLFINI 184 Query: 189 KKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKT 248 KW +T + L L+ ++ DQD +N +L G ++ +N +Y + + D Sbjct: 185 DKWLAENITPQTLEALLDTSTDMRFNDQDALNKVLNGRAKYISPRWNYLYDL---IHDLN 241 Query: 249 HQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWK----DDSPRDAKSI 304 + + IH+ G+ KPW W+ + + ++ L SPW+ D PR+ K Sbjct: 242 VNRFAMRPVGKAVFIHFAGSVKPWADWSGHEARGLFRKYLALSPWRDMPLDPEPRNTK-- 299 Query: 305 IEFKKRYKHLLVQHHYISGIIAGVCYLCRKYYR 337 E + + + QH + + + YL ++ R Sbjct: 300 -EMRMHSRFMFRQHKPVESLKWYLRYLRKRAQR 331 >UniRef50_A8ARL4 Putative uncharacterized protein n=2 Tax=Citrobacter RepID=A8ARL4_CITK8 Length = 314 Score = 120 bits (302), Expect = 6e-26, Method: Compositional matrix adjust. Identities = 80/280 (28%), Positives = 143/280 (51%), Gaps = 11/280 (3%) Query: 23 NTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKL-AEQ 81 N + +N+AY DANYL+ V VSI S+++NN +L F++ +D + IAKL + Sbjct: 3 NKTNVINIAYCTDANYLEYVAVSIMSVIMNNPEQSLAFFVFVYDVSD---EDIAKLQSTS 59 Query: 82 NQLR-ITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDIS 140 N+++ IT+ + + +K + +R+ Y RL +LL + R +YLDAD +C +S Sbjct: 60 NKIQVITIDKADIEKYNNDFAIKHLNRSTYMRLAVPRLLKDKVARFIYLDADTLCFDSLS 119 Query: 141 QLLHLGLNGAVAAVVKDVEPMQE-KAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEK 199 ++ + ++ V AV D + + K RL + YFN+G +Y+++ W + K Sbjct: 120 EINSVDIDNVVCAVSHDSLNIHDNKHARRLGLS--IDHYFNAGFLYINVANWIKHDIEHK 177 Query: 200 ALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITES 259 A ++L + Y DQD +N+ + G F+ +N ++ + D+ +N+ Sbjct: 178 ANTVLFEQGKSLPYFDQDALNIAMNGNITFIDNRWNFLF---NWFTDEQKENFFYHSDTL 234 Query: 260 TLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPR 299 +IH+TG KPW+K S + Y +PW++ R Sbjct: 235 PRIIHFTGGRKPWYKEHTGLSQQLYVFYHHFTPWRNAELR 274 >UniRef50_UPI0001B4A0A4 putative glucosyltransferase n=1 Tax=Bacteroides sp. 2_1_7 RepID=UPI0001B4A0A4 Length = 301 Score = 113 bits (283), Expect = 9e-24, Method: Compositional matrix adjust. Identities = 78/290 (26%), Positives = 141/290 (48%), Gaps = 4/290 (1%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +++ D NY+ GV +TSI +NN + +I+ + + + + K+ + +I Sbjct: 1 MDIVCCTDNNYVIPCGVLVTSICVNNPKEEITVHILTEGISPENQEVLKKVVAKYGQQIQ 60 Query: 88 LYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGL 147 Y ++ P ++ + A YFRL +L +++++LYLD DVV + + L + Sbjct: 61 FYTVDKKVFANCPISRHITLATYFRLIMTDILPKSVEKVLYLDCDVVVRHSLRSLWDTDI 120 Query: 148 NGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSK 207 A V+ D+ + +RL LG YFN+GV+ ++L+ W + L+E I+ Sbjct: 121 KSYAAGVIPDMSIDDIRIYNRLQYSPSLG-YFNAGVLLVNLRYWRENNLSESFFEIINKY 179 Query: 208 DNVYKYPDQDVMNVLLKGMTLFLPREYNTI--YTIKSELKDKTHQNYKKLITESTLLIHY 265 +Y DQDV+N++LK + L LP +YN Y K L +T+++ ++ +++HY Sbjct: 180 PERLRYHDQDVLNIVLKEIKLTLPMKYNVQHGYFFKDPLISRTYRDEREQAITDPVILHY 239 Query: 266 TGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLL 315 +G+ KPW P K + L+ S R K R++ LL Sbjct: 240 SGS-KPWFIEFEPPFKKDFAFYLDTSGLDKSFIRHIPMKARIKARFRSLL 288 >UniRef50_UPI0001A457E5 hypothetical protein NsubN_08151 n=1 Tax=Neisseria subflava NJ9703 RepID=UPI0001A457E5 Length = 345 Score = 113 bits (282), Expect = 1e-23, Method: Compositional matrix adjust. Identities = 78/279 (27%), Positives = 139/279 (49%), Gaps = 30/279 (10%) Query: 29 NVAYGVDANYLDGVGVSITSIVLNNRHINLDFYII---ADVYNDGFFQKIAKLAEQNQLR 85 ++ Y D NY+ +G ++ S++ NN + F+++ ++ Y+ F +I + + QN Sbjct: 25 HIVYAADQNYIKHIGTALLSVLQNNTS-PIHFHLLVSGSEGYDFNIFDQI-ETSNQN-YA 81 Query: 86 ITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHL 145 I++Y +NT+ L T ++ AMY+R+ LL LYLD DV+C G+I L + Sbjct: 82 ISVYHLNTEYFSTLQTTHYFTIAMYYRMSIPCLLKGITHTALYLDTDVLCLGNIDDLFEI 141 Query: 146 GLNGAVAAVVKDV----EPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADA---KLTE 198 ++ ++ A V D +++ +D E YFNSGV+ ++ KW D K+ Sbjct: 142 DISNSLIAAVPDAILYRAYIKQLNQFGFTDTE---PYFNSGVILFNIDKWNDMAIDKILS 198 Query: 199 KALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLI-- 256 + + + ++ PDQD++N+ G +L +N I+ HQ Y +LI Sbjct: 199 EKMQAVEKQNFKLSCPDQDILNLACIGHVHWLSENFNWIH---------WHQKYSELIDN 249 Query: 257 TESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKD 295 + L+H+ G KPWH+ +P+ Y +NSPW + Sbjct: 250 PNNIRLVHFVGHIKPWHQLGFHPAYDQY---FKNSPWNN 285 >UniRef50_Q116W1 Glycosyl transferase, family 8 n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q116W1_TRIEI Length = 278 Score = 112 bits (281), Expect = 2e-23, Method: Compositional matrix adjust. Identities = 75/267 (28%), Positives = 131/267 (49%), Gaps = 16/267 (5%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +N+ + D NY GV+ITS++LNN + D +II + + QK+ L++ + Sbjct: 2 MNLLFCFDQNYQQHFGVAITSVLLNNLSSHFDVHIITNFMEEKLKQKLDTLSKNYKCSFH 61 Query: 88 LYRINT-DKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLG 146 LY IN DK+ L + S A Y+RL ++L +D++LYLD+DVV + +L ++ Sbjct: 62 LYIINNLDKISKLKVSDHVSNATYYRLIMAEILPKHIDKVLYLDSDVVVISPLEELYNID 121 Query: 147 LNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMS 206 L A S S + + FNSGV+ ++L+KW + +++ K + Sbjct: 122 LENYFI------------AASGFSGTLVKSKGFNSGVMVVNLEKWRNEQISTKVIDFATK 169 Query: 207 KDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYT 266 + Y DQ +N ++K L + R++N + K N + ++ +IHY Sbjct: 170 NRDKLPYHDQSALNRVIKQNYLIIDRKWNFQVDLSPRKIQKPDDN---IALKNARIIHYI 226 Query: 267 GATKPWHKWAIYPSVKYYKIALENSPW 293 G++KPW+ W Y++ L+ S W Sbjct: 227 GSSKPWYFWISDQRKNIYELYLKKSLW 253 >UniRef50_C0WCJ1 Lipopolysaccharide 1,2-glucosyltransferase n=1 Tax=Acidaminococcus sp. D21 RepID=C0WCJ1_9FIRM Length = 338 Score = 110 bits (275), Expect = 7e-23, Method: Compositional matrix adjust. Identities = 71/247 (28%), Positives = 115/247 (46%), Gaps = 15/247 (6%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 L+VAY V+ Y +G S+ S++ NN H + F+I D Y+ QK+ +LA++ I Sbjct: 33 LHVAYNVNDGYFQIMGASLVSVLENNAHRAVMFHIFTDGYSKENAQKMEQLADRYGCVIK 92 Query: 88 LYRINTDKLQCLPC-TQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLG 146 LY ++ + + +SR Y R+ +L D LYLDAD + + +L H Sbjct: 93 LYTLHMEPFADFHVKVERFSRITYGRIVMPLILAAETDHFLYLDADTMVIRPLDELYHWD 152 Query: 147 LNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMS 206 L G V + P ++ L G+YFN GV+ +++ +W +TEKA S+ Sbjct: 153 LTGKAMGAVSERMPDAKRRGDYLHLNN--GRYFNDGVMMVNIPEWQKQNITEKAFSLQKE 210 Query: 207 KDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYT 266 + QD++N++ G FLP YN + + + K +IH+T Sbjct: 211 PKERFLGQSQDILNIVFDGTNAFLPSIYNEFGGGEDDPQQK------------GTIIHWT 258 Query: 267 GATKPWH 273 G KPW Sbjct: 259 GRRKPWQ 265 >UniRef50_Q03HK5 Lipopolysaccharide biosynthesis glycosyltransferase n=1 Tax=Pediococcus pentosaceus ATCC 25745 RepID=Q03HK5_PEDPA Length = 549 Score = 106 bits (264), Expect = 1e-21, Method: Compositional matrix adjust. Identities = 73/265 (27%), Positives = 134/265 (50%), Gaps = 11/265 (4%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRI 86 +NV D NY D + ++I + + N N+ ++F ++++ ++ + KLA + Sbjct: 4 INVLLAADENYADQLQITIKTTLENLNKKTRVNFIVLSNNLSNSTKLALKKLA-HGLHTV 62 Query: 87 TLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLG-LTLDRLLYLDADVVCKGDISQLLHL 145 ++ P ++ Y+R+ A QLL +DR+LYLD D++ + D+++L Sbjct: 63 EYLDLDPSVFAFCPTNSHINKTAYYRILAPQLLAKRNIDRILYLDVDLLVRHDLTELYDA 122 Query: 146 GLNGAVAAVVKDVEPMQEKAVSRLS-DPELLGQ--YFNSGVVYLDLKKWADAKLTEKALS 202 LN + V D Q A++RL DP + YFNSG++ +D+KKW + +TEK L+ Sbjct: 123 ELNHNIVGAVIDTG--QAFALNRLGVDPVVAANNIYFNSGILVIDIKKWNENHITEKTLN 180 Query: 203 ILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITE---S 259 + + ++ + DQD +N +L G L ++N +I ++ Y +LI E S Sbjct: 181 YIKHQSHLIIFHDQDALNAVLAGHVQMLHPKWNLQNSIVFRKHRPINEAYDQLINEAIKS 240 Query: 260 TLLIHYTGATKPWHKWAIYPSVKYY 284 ++H+T KPW + +P + Y Sbjct: 241 PAIVHFTTHEKPWKTLSEHPYLDEY 265 Score = 81.3 bits (199), Expect = 5e-14, Method: Compositional matrix adjust. Identities = 60/265 (22%), Positives = 121/265 (45%), Gaps = 13/265 (4%) Query: 27 CLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAK-LAEQNQLR 85 +NV ++ +++ + S SI+ N+ +FY++ D + + ++ + Sbjct: 278 VVNVISAANSAFVEALATSYISILENDSENQYNFYLLPDHLDQRDMLILGSVISRYDNAS 337 Query: 86 ITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHL 145 I + +++ L+ + ++ Y+R+ A +LL ++R +YLD D++ ++ L Sbjct: 338 IKIVKVDEKLLENAVESDRILKSAYYRILAPELLP-NINRAIYLDCDIIANTNLHDLWQT 396 Query: 146 GLNGAVAAVVKDV---EPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALS 202 L G V A V+D + ++ ++ + +YFNSG++ +DL W +T++ L Sbjct: 397 SLEGNVLAAVEDAGFHDRLEHMGITHDN-----SKYFNSGMMLIDLVSWRSQAVTQRVLD 451 Query: 203 ILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITES--- 259 + ++ DQD +N +L L L ++N I + KL E+ Sbjct: 452 YINHNPEKLRFHDQDALNAILYDKWLHLHPKWNAQSNIVLDALVPPRTELLKLYAETREN 511 Query: 260 TLLIHYTGATKPWHKWAIYPSVKYY 284 LIH+ G KPWH + +P Y Sbjct: 512 PKLIHFCGHVKPWHAESKHPYTNVY 536 >UniRef50_D2ELM0 Lipopolysaccharide biosynthesis glycosyltransferase n=1 Tax=Pediococcus acidilactici 7_4 RepID=D2ELM0_PEDAC Length = 552 Score = 105 bits (263), Expect = 2e-21, Method: Compositional matrix adjust. Identities = 75/265 (28%), Positives = 129/265 (48%), Gaps = 11/265 (4%) Query: 28 LNVAYGVDANYLDGVGVSI-TSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRI 86 +N+ D NY D + ++I T++ N F ++ + D + KL N I Sbjct: 4 INILLAADRNYADQLCITIKTALETLNSATRAHFIVLTNNLGDQTRALLDKLM-HNFHTI 62 Query: 87 TLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLG-LTLDRLLYLDADVVCKGDISQLLHL 145 ++ ++ P Q ++ YFR+ A +LL +DRL+YLD DV+ + D+++L Sbjct: 63 EYLNLDDERFDFCPTNQHINKTAYFRIIAPKLLASRQIDRLIYLDVDVLIRKDLTELAES 122 Query: 146 GLNGAVAAVVKDVEPMQEKAVSRLS-DPELLGQ--YFNSGVVYLDLKKWADAKLTEKALS 202 LN V D Q A+ RL DP + YFNSG++ +D+ +W ++TEK L+ Sbjct: 123 NLNQNTVGAVIDTG--QAFALHRLGVDPVVAASNLYFNSGIMVIDVAQWNAHRITEKTLA 180 Query: 203 ILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITES--- 259 + + + + DQD +N +L G FL ++N +I +Q Y +LI E+ Sbjct: 181 FIRNHADRIIFHDQDALNAVLAGEVQFLHPKWNLQNSIIFRKHRPINQGYAELIDEAIKE 240 Query: 260 TLLIHYTGATKPWHKWAIYPSVKYY 284 ++H+T KPW ++P + Y Sbjct: 241 PSIVHFTTHEKPWKDLTVHPYLDEY 265 Score = 85.9 bits (211), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 66/277 (23%), Positives = 129/277 (46%), Gaps = 27/277 (9%) Query: 27 CLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKL-AEQNQLR 85 +NV ++ + + S SI+ N+ +F+++ D D + + A + Sbjct: 278 VINVISAANSAFTQALATSYVSILENDPDHQYNFFLLPDHLTDRDMMLLGSIIARYDNAT 337 Query: 86 ITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHL 145 I + +N + L + + Y+R+ A LL +++R +YLD D++ + +L Sbjct: 338 IKVVEVNEELLANAVESDRIVKTAYYRILAPALLP-SINRAIYLDCDIIANTSLHELWQT 396 Query: 146 GLNGAVAAVVKDV---EPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALS 202 L G V A V+D + +++ +++ ++ +YFNSG++ +DL +W T+K L Sbjct: 397 NLEGNVIAAVEDAGFHDRLEKMGITKENE-----KYFNSGMMLIDLVRWRARSTTQKVLD 451 Query: 203 ILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYN--------TIYTIKSELKDKTHQNYKK 254 + ++ DQD +N L L L ++N TI+ ++EL + + Sbjct: 452 YINQNPEKLRFHDQDALNANLYDDWLHLHPQWNAQSNIIMETIFPPRTELLEPYAET--- 508 Query: 255 LITESTLLIHYTGATKPWHKWAIYP----SVKYYKIA 287 E LIH+ G KPWH+ +P +KY+++A Sbjct: 509 --REDPKLIHFCGHVKPWHEGCEHPYADVYLKYHEMA 543 >UniRef50_C6EQF4 Putative uncharacterized protein n=3 Tax=Campylobacter jejuni RepID=C6EQF4_CAMJE Length = 958 Score = 101 bits (251), Expect = 4e-20, Method: Compositional matrix adjust. Identities = 76/271 (28%), Positives = 129/271 (47%), Gaps = 32/271 (11%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQ---NQLRI 86 + + VD NYL + +++ S+V +R N Y I ++ + +++ +L E N + I Sbjct: 15 IVFAVDDNYLPYMSIALNSLV--DRVSNCYKYNIFVMHLNIDLERLNRLKENIRNNNVTI 72 Query: 87 TLYRINT-------DKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDI 139 +N + ++ AMY+R+F ++ +++Y D+DV+ K DI Sbjct: 73 EFINLNQYLKKIFKEYGNIFYERSYFTTAMYYRIFIPEIFS-NFKKVIYCDSDVIFKADI 131 Query: 140 SQLLHLGLNGAVAAVVKDV---------EPMQEKAVSRLSDP---ELLGQYFNSGVVYLD 187 S L + LN +D+ E + ++ + D + YFNSGV+ D Sbjct: 132 SHLFFIDLNNKEIGACRDIAALYAYRKRETVWQQNIRNNFDKINFRSISDYFNSGVIVFD 191 Query: 188 LKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDK 247 + K K K L+++ + DN+Y +PDQDV+N++ G FLP E+N ++T E KD Sbjct: 192 IVKCIQMKTVSKCLTVIKNIDNLY-FPDQDVLNIVFCGHVHFLPLEWNFLWTTYIEYKDN 250 Query: 248 THQNYKKLITE------STLLIHYTGATKPW 272 KK+I E +IHY TKPW Sbjct: 251 FMYLPKKIINEIYKAKTKPKIIHYISETKPW 281 >UniRef50_UPI0001AEC697 glycosyl transferase family protein n=1 Tax=Alteromonas macleodii ATCC 27126 RepID=UPI0001AEC697 Length = 361 Score = 100 bits (249), Expect = 8e-20, Method: Compositional matrix adjust. Identities = 78/273 (28%), Positives = 139/273 (50%), Gaps = 23/273 (8%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 ++VA+ +D + VS+ SI+ N + ++ Y I ++ ++G +K+ L +N Sbjct: 2 ISVAFCIDDKFAPYAAVSVISILSNTKSF-VNIYFIGNL-SEGVREKLLTL--KNDRSAM 57 Query: 88 LYRINTDKLQCLPCTQVWSRAM---YFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 ++ + L +P + + + F +A + LD+++YLDADV+ GDI +L Sbjct: 58 VFVAHNLPLSTMPLSDRYVERLNKITFVRYAIAEVLTKLDKVIYLDADVLVCGDIKRLWE 117 Query: 145 LGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 L + V D M +K LS YFN+GV+ +DLK W D ++ + LS Sbjct: 118 QPLKKSYVGAVLDHSLMSQKRHITLSLKS--KSYFNAGVLLVDLKIWRDRRIFQ-YLSRT 174 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNT-IYTIKSELKDKTHQNYKKLITESTLLI 263 + ++Y DQDV+NV+L +L + N Y++K H N K+ L++ Sbjct: 175 HNTRERWEYNDQDVLNVVLDEKVQYLGADMNVQTYSLK-------HINIKE-----PLIV 222 Query: 264 HYTGATKPWHKWAIYPSVKYYKIALENSPWKDD 296 H+TG KPWH +++P Y++ LE+ P+K++ Sbjct: 223 HFTGQEKPWHTSSVHPYKDQYRVLLESVPFKNN 255 >UniRef50_D1PTN4 Putative uncharacterized protein n=1 Tax=Prevotella bergensis DSM 17361 RepID=D1PTN4_9BACT Length = 305 Score = 99.8 bits (247), Expect = 1e-19, Method: Compositional matrix adjust. Identities = 69/281 (24%), Positives = 132/281 (46%), Gaps = 16/281 (5%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +++ + +D NYL ++ SI+ NN+ + F++I++ + KI ++AE ++ Sbjct: 1 MDIVFNIDDNYLMQCCTTMVSILHNNKDGQISFHVISNGLTNESRLKIEQVAEAYHQQVF 60 Query: 88 LYRINTDKL---QCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 Y +N + + + S A Y RLF +L L +++Y+D D++ G + L + Sbjct: 61 FYVVNPEAMSDYEIFDKQGHISMATYLRLFVADILPERLHKIIYMDCDLIVNGSLDGLWN 120 Query: 145 LGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 + G A V+D+ + RL + YFN+GV+ ++L W + ++++A + Sbjct: 121 TDVEGYALAAVEDMWSGKADNYVRLG-YDAADTYFNAGVLVVNLDYWREHNVSQQAAQYV 179 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTI-------YTIKSELKDKTHQNYKKLIT 257 K+ DQDV+N L L LP +N I+ E+ K Q Sbjct: 180 ALHAGQLKFNDQDVLNGLFHDSKLLLPFRWNVQDGLLRKRRKIRPEVMPKLDQE-----L 234 Query: 258 ESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSP 298 E+ ++IH+TG KPW+ + P + ++ + W+ P Sbjct: 235 ENPVIIHFTGHRKPWNFSCLNPYKNLFFKYVDMTEWRGFRP 275 >UniRef50_A5ZFA9 Putative uncharacterized protein n=1 Tax=Bacteroides caccae ATCC 43185 RepID=A5ZFA9_9BACE Length = 310 Score = 97.4 bits (241), Expect = 6e-19, Method: Compositional matrix adjust. Identities = 80/274 (29%), Positives = 125/274 (45%), Gaps = 9/274 (3%) Query: 29 NVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITL 88 N+ G+D Y G + S+ +N + Y+++ ++ + +L + Q +I Sbjct: 3 NIICGIDDQYCQHCGAMLLSLFESNPGA-ITIYVLSLELSEKSKNLLKELVDSYQKQIHF 61 Query: 89 YRINTDKLQCLP--CTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLG 146 I ++ + P T S A Y RLF QLL +D+ LY+D+D++ K DIS L Sbjct: 62 IDIPSELVLNFPMKSTDYPSLATYLRLFIPQLLPFEVDKALYVDSDIIFKKDISALYDSD 121 Query: 147 LNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMS 206 + A ++D P Q RL PE YFN+G V L++K D T KA++ + Sbjct: 122 ITNYALAGMEDA-PNQNAL--RLGFPES-DLYFNAGFVLLNVKYLRDMDFTNKAMAYIRD 177 Query: 207 KDNVYKYPDQDVMNVLLKGMTLFLPREYNTI--YTIKSELKDKTHQNYKKLITESTLLIH 264 DQDV+N LL G LF+P ++N + + K K + +S +IH Sbjct: 178 CREKIVLHDQDVLNALLHGKVLFVPIKWNMLDCFYRKPPFIAKKYMRELHENLDSPAVIH 237 Query: 265 YTGATKPWHKWAIYPSVKYYKIALENSPWKDDSP 298 ++G KPWH +P K Y W SP Sbjct: 238 FSGPLKPWHHGCPHPLRKEYFNYSRKLSWGCQSP 271 >UniRef50_A1BHG0 Glycosyl transferase, family 8 n=2 Tax=Chlorobium/Pelodictyon group RepID=A1BHG0_CHLPD Length = 307 Score = 96.3 bits (238), Expect = 1e-18, Method: Compositional matrix adjust. Identities = 63/279 (22%), Positives = 136/279 (48%), Gaps = 11/279 (3%) Query: 22 INTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQ 81 ++ +N+ + D NY+ + ++ S++ NN+ ++ YII+ ++ ++ I ++ + Sbjct: 2 LHMKNTVNIVFATDKNYIQHLSAALVSLLENNKDLSFTVYIISSGMSEKSYRNIEEIIKT 61 Query: 82 NQLRITLYRINTDKLQCLPCTQ-VWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDIS 140 + ++ + L + + Y+RL L+ +++LYLD+D++ G I Sbjct: 62 GNCTVKHITVSDELFVKLATAHPFYPKGTYYRLLIPDLIDE--EKILYLDSDIIVNGSIK 119 Query: 141 QLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKA 200 +L + + ++D P ++ D E + YFNSG++ ++L KW L +K Sbjct: 120 ELYNQDVEDYFVCAIED--PGFDRHRQLQMDKESI--YFNSGMMLINLAKWKSTGLQKKV 175 Query: 201 LSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYK----KLI 256 + + + +PDQ +N ++ G +P +YN +I S+ +K + Sbjct: 176 IDFIEHNPDAIWFPDQCGLNSVINGRWKKVPLKYNQQSSIFSDDFEKKFDCFSVEELAEA 235 Query: 257 TESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKD 295 ++ ++IHYTG +KPWH +P K Y L+ +P+++ Sbjct: 236 KKNPVIIHYTGGSKPWHFKNRHPYKKLYWKYLKMTPYRN 274 >UniRef50_B2UPJ4 Glycosyl transferase family 8 n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UPJ4_AKKM8 Length = 315 Score = 95.9 bits (237), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 79/284 (27%), Positives = 125/284 (44%), Gaps = 18/284 (6%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRI 86 +N+ Y D N G GVSI S++ N ++ D YI+ + + L + L + Sbjct: 1 MNIVYATDDNGALGTGVSIVSLMENLPPGVHADIYIMTGGLSGDNTARFHSLQQGYNLHL 60 Query: 87 TLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLG 146 + DK P WS A Y+RL L T++R LY+D D + DIS + Sbjct: 61 HFIDMK-DKYTDFPVGSKWSAATYYRLGLAGELPATVERALYVDIDTIFNRDISPMYESE 119 Query: 147 LNGAVAAVVKDVEPMQEKAVSRLSDPELLGQ---YFNSGVVYLDLKKWADAKLTEKALSI 203 + A V E + E++ SR LG+ Y N+GV+ + + + + LS Sbjct: 120 FGDCLIAGVFTTEDLSEESFSRWKREMNLGRDSIYINAGVILYHIGRIREECFESQVLS- 178 Query: 204 LMSKDNVYK--YPDQDVMNVLLKGMTLFLPREYN----TIYTIKSELKDKTHQNYKKL-- 255 +K+N+++ + DQD++NV + L L +N I++I+ E N K Sbjct: 179 -WAKNNIHRLSWQDQDILNVCYQQRILLLHPMWNICDGAIWSIRWEGVTSFRNNPLKPAD 237 Query: 256 ---ITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDD 296 +IHY G KPWH +I + + SPWKDD Sbjct: 238 LLEAARRPGIIHYWGHPKPWHPNSIRQDYGLFYKYWKKSPWKDD 281 >UniRef50_A1XRC1 Glycosyltransferase n=1 Tax=Haemophilus ducreyi RepID=A1XRC1_HAEDU Length = 267 Score = 95.9 bits (237), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 77/250 (30%), Positives = 117/250 (46%), Gaps = 13/250 (5%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +N+ + D NY + V + SI+ +N +IN FYI+ + I L E+ I Sbjct: 1 MNIVFSSDENYAPHLSVCLYSILSHNYNIN--FYILDLGIKEESKSFIKSLVEKFNSNIE 58 Query: 88 LYRINTDKLQCLPC-TQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLG 146 +I+ D P S A Y RL L L+++LYLD D + G + L L Sbjct: 59 FIKISVDSFSNFPIYIDYISLATYARLKLTDYLP-QLEKVLYLDIDTIVNGSLIDLWDLD 117 Query: 147 LNGAVAAVVKD--VEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 LN A V D +E + K + L YFN+GV+ +D KW + +K++ I+ Sbjct: 118 LNEYYIAAVADPFIESLNYKTILGLDK----NIYFNAGVLLIDCIKWKQYNIFDKSVKII 173 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELK-DKTHQNYKKLITESTLLI 263 +Y DQD++N++LK L L YN + + +K DK + K IT ++ Sbjct: 174 KDLSKKLQYQDQDILNLILKDKVLLLDCRYNFMPSQLDFIKRDKVRKGIK--ITTPIVIY 231 Query: 264 HYTGATKPWH 273 HY G KPWH Sbjct: 232 HYCGPKKPWH 241 >UniRef50_UPI000190F79C lipopolysaccharide 1,2-glucosyltransferase n=1 Tax=Salmonella enterica subsp. enterica serovar Typhi str. E01-6750 RepID=UPI000190F79C Length = 98 Score = 95.1 bits (235), Expect = 3e-18, Method: Composition-based stats. Identities = 52/100 (52%), Positives = 69/100 (69%), Gaps = 3/100 (3%) Query: 1 MDSFPAIEIDKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDF 60 MDSFP IEI + K +D +N N LN++YGVD NYLDGVGVSI S+VLNN +I L F Sbjct: 1 MDSFPEIEIAEYKVFD--ESNNNDDNVLNISYGVDENYLDGVGVSIASVVLNN-NIPLAF 57 Query: 61 YIIADVYNDGFFQKIAKLAEQNQLRITLYRINTDKLQCLP 100 +II D Y+ F + I +LA Q+ ++I+LY I + L+ LP Sbjct: 58 HIICDSYSPCFVKYIERLAVQHHIKISLYLIKVESLEVLP 97 >UniRef50_Q3DNA2 Glycosyl transferase, family 8 n=9 Tax=Streptococcus RepID=Q3DNA2_STRAG Length = 272 Score = 95.1 bits (235), Expect = 3e-18, Method: Compositional matrix adjust. Identities = 71/256 (27%), Positives = 123/256 (48%), Gaps = 14/256 (5%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQ-LRI 86 +N+ + +D Y+D V + S+V ++ L+ Y++ ++ +L + Q L + Sbjct: 1 MNLLFSIDDMYVDHFKVMLYSLVRQTKNRKLEIYVLQKT----LLKRHTELIQYTQNLEV 56 Query: 87 TLYRI--NTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 + I T+ P T + +Y+RL A + L TLDR+LYLDAD++C D S L Sbjct: 57 GYHPIIVGTEVFAQAPTTDRYPDTIYYRLLAHKFLPETLDRILYLDADMLCLNDFSSLYD 116 Query: 145 LGLNG---AVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKAL 201 + L A A+ D + + RL + EL YFN+GV+ ++L + L Sbjct: 117 MELGDQLYAAASHNTDGKFLDYVNKLRLKNVELESSYFNTGVLLMNLPAIRKVVHQQTIL 176 Query: 202 SILMSKDNVYKYPDQDVMNVLLKGMTLFLPRE---YNTIYTIKSELKDKTHQNYKKLITE 258 +M PDQD++N L + +P E Y+ Y++ +LK + + + +I Sbjct: 177 DYMMQNRGRLILPDQDILNGLYANLVKPIPDEIYNYDARYSLIYQLKSRNEWDLEWVINH 236 Query: 259 STLLIHYTGATKPWHK 274 T+ +H+ G KPW K Sbjct: 237 -TVFLHFAGRDKPWKK 251 >UniRef50_B3CA80 Putative uncharacterized protein n=1 Tax=Bacteroides intestinalis DSM 17393 RepID=B3CA80_9BACE Length = 301 Score = 95.1 bits (235), Expect = 3e-18, Method: Compositional matrix adjust. Identities = 65/251 (25%), Positives = 123/251 (49%), Gaps = 8/251 (3%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKL-AEQNQLRI 86 +++ +D NY++ GV + S+ ++ +II + +K K+ E++Q + Sbjct: 2 IDIVCSIDENYIEYCGVMLASLFVHTPDEKFRVHIICSSKVEKAGKKRLKVFCEKHQAEV 61 Query: 87 TLYRINTDKLQCLPCTQV--WSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 Y ++ ++ P + S A Y RLF +L+ ++++LYLD D++ I +L Sbjct: 62 YFYDVDYSLIKDFPIRKQDHLSLAAYLRLFMSELIPSNINKILYLDCDLIVVDSIKELWE 121 Query: 145 LGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 ++ A V++ P ++ L P + YFNSGV+ ++L+KW + K E S + Sbjct: 122 KNIDNIAVAAVEERSPFDTESPVTLKYP-VEYSYFNSGVMLINLQKWREKKFVEACKSYI 180 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTI---YTIKSELKDKTHQNYKKLITESTL 261 S K DQDV+N LL F+ +N + E++ + +++ + +S Sbjct: 181 ASNYENIKLHDQDVLNALLYKEKQFISIRWNLMDFFLYASPEVQPERKKDWDDAL-KSPA 239 Query: 262 LIHYTGATKPW 272 +IH+TG KPW Sbjct: 240 IIHFTGKRKPW 250 >UniRef50_C7IBC1 Glycosyl transferase family 8 n=2 Tax=Clostridium papyrosolvens DSM 2782 RepID=C7IBC1_9CLOT Length = 452 Score = 95.1 bits (235), Expect = 3e-18, Method: Compositional matrix adjust. Identities = 78/286 (27%), Positives = 131/286 (45%), Gaps = 12/286 (4%) Query: 26 ECLNVAYGVDANYLDGVGVSITSIVLNNR-HINLDFYIIADVYNDGFFQKIAKLAEQNQL 84 E + + D++Y+ +GV ITS++ N +L+FY+I D + + Sbjct: 2 ETVKIVSACDSHYVQHLGVMITSLLENTSMKTSLEFYVIDGGITDADKELLCSCTCLYGC 61 Query: 85 RITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 +I I D + S A YFR+F +LL ++++++YLD D+V DI++L Sbjct: 62 KINFITIQADFYARFGESPSASDATYFRIFVSELLDTSVEKVIYLDCDIVVIKDIAELWK 121 Query: 145 LGLNGAVAAVVKD--VEPMQEKAVS--RLSDPELLGQYFNSGVVYLDLKKWADAKLTEKA 200 ++ A V D VE E AV+ R + YFN+GV+ ++L KW + +++ Sbjct: 122 TDVSEYFLAAVADCGVEYSGEYAVTLKRKLGMKRKDCYFNAGVLLINLVKWREESISKSI 181 Query: 201 LSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYN--TIYTIKSELKDKTHQNYKKLITE 258 L + DQD +N +L L L +N + E + +N + + E Sbjct: 182 CKFLFENKGKIDFADQDGLNAVLCNRWLPLDSRWNQQVAHCEFYEQEKVVWENVTRAVRE 241 Query: 259 STLLIHYT----GATKPWHKWAIYPSVKYYKIALENSPWKDDSPRD 300 +IHYT TKPW+ ++P + Y L +PWK P D Sbjct: 242 P-WIIHYTTSYFSGTKPWNYLDMHPYRQEYYRYLHMTPWKSFIPPD 286 >UniRef50_C9PNX4 Family 8 glycosyl transferase n=1 Tax=Pasteurella dagmatis ATCC 43325 RepID=C9PNX4_9PAST Length = 285 Score = 94.7 bits (234), Expect = 4e-18, Method: Compositional matrix adjust. Identities = 73/255 (28%), Positives = 120/255 (47%), Gaps = 24/255 (9%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +N+ D + + V I S+ +N+ N+ FY++ Y +FQ + + I Sbjct: 12 MNIVLSADVQFSEQVKTLIKSVSYHNK--NVHFYLLNKDYPSEWFQILNQYLAYFGSNII 69 Query: 88 LYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLG-LTLDRLLYLDADVVCKGDISQLLHLG 146 +++++ + P S A YFR LLG L LDR+LYLD DVV G ++++ + Sbjct: 70 DAKVDSEVISTFPTLDHISEASYFRY----LLGQLPLDRVLYLDCDVVVTGSLTEIYYTD 125 Query: 147 LNGAVAAVVKD----VEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALS 202 + V+D + P K + YFNSG++ +DL KW D + + + Sbjct: 126 FGDNMMYAVEDAFLNIAPHSYKEFPDMK------PYFNSGMLLIDLNKWRDQNIENQLMD 179 Query: 203 ILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYN-----TIYTIKSELKDKTHQNYKKLIT 257 + N+Y Y DQD MN++LKG L + YN I I+ ++ + + YK L Sbjct: 180 LTKQAVNLY-YGDQDAMNIILKGKWQALDKIYNYQTGSLIAFIQHKMPEAL-EKYKDLQG 237 Query: 258 ESTLLIHYTGATKPW 272 + +IHY KPW Sbjct: 238 QQPKVIHYITRYKPW 252 >UniRef50_B7AIV0 Putative uncharacterized protein n=1 Tax=Bacteroides eggerthii DSM 20697 RepID=B7AIV0_9BACE Length = 321 Score = 94.4 bits (233), Expect = 6e-18, Method: Compositional matrix adjust. Identities = 68/283 (24%), Positives = 130/283 (45%), Gaps = 18/283 (6%) Query: 25 SECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQL 84 ++ +++ ++ Y +I SI +NN++ + ++I D + ++ K+A Sbjct: 12 TKAIHIVVCINDAYSQHCAATIASIFINNKNEVIKIHVITDYISKKNQSRLEKIAFNFNQ 71 Query: 85 RITLYRINTDKLQCLPCTQVW-----SRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDI 139 +I Y N L PC + + Y+RLF Q+L L + + YLD D++ + Sbjct: 72 QIQFYTFNNSTLNRWPCFKDGMPPHVTIQTYYRLFIPQILPLNIKKTFYLDCDLLVLHPL 131 Query: 140 SQLLHLGLNGAVAAVVKDVEPMQEKAVSRL---SDPELLGQYFNSGVVYLDLKKWADAKL 196 + + + A + D +A +RL +D E YFN+GV+ L+L+ + Sbjct: 132 REFWNTKMQNKGVAAIADQWTDYIEAATRLKYRNDRE----YFNAGVLLLNLEYLRNHNF 187 Query: 197 TEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLI 256 T A+ + N Y DQDV+N L+ + +P ++N ++ DK Y + Sbjct: 188 TNNAIDFVTKHANDIVYHDQDVLNKLIGENRIIMPVKWN---VCSFKINDKIPHIYNATM 244 Query: 257 TES---TLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDD 296 ++ +IH+ KPW++ + +P YY L+ +PWK + Sbjct: 245 NDARKDPYIIHFFAPIKPWNQDSSHPYRSYYYYFLQFTPWKHE 287 >UniRef50_Q5LF36 Putative glucosyltransferase n=1 Tax=Bacteroides fragilis NCTC 9343 RepID=Q5LF36_BACFN Length = 308 Score = 94.0 bits (232), Expect = 7e-18, Method: Compositional matrix adjust. Identities = 72/279 (25%), Positives = 136/279 (48%), Gaps = 13/279 (4%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIA---DVYNDGFFQKIAKLAEQNQL 84 +++ + +D +Y+ GV+ITS+ +NN + + F+I+ ++N +KI Q Sbjct: 1 MDIVHCIDNSYVAQCGVTITSVCVNNVNEVILFHILTTNLSIFNREMLKKIVDKYRQ--- 57 Query: 85 RITLYRINTDKLQCLPCTQV--WSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQL 142 +I Y ++ L P + S A YFR+ +L +L+++LYLD D+V +I +L Sbjct: 58 KIIFYNVDEYLLNKCPLREGDHVSLATYFRILMPDILPKSLNKVLYLDCDLVVCKNIKRL 117 Query: 143 LHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALS 202 ++ V D + +RL ++ YFN+GV+ ++L W + ++ K L Sbjct: 118 WDTDISTHSLGAVYDGGTDDIRTYNRLK-YDIRQGYFNAGVLLVNLAYWREFHISNKLLK 176 Query: 203 ILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTI---YTIKSELKDKTHQNYKKLITES 259 + + DQD +N +L T LP +YN + YT + L+++ + + + Sbjct: 177 FIEQYPERLMFWDQDALNSVLIQTTKILPFKYNMLDAFYTKELALREEYLFEIEGALCDP 236 Query: 260 TLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSP 298 T+L H++ KPW K +P ++ L+ + W D P Sbjct: 237 TIL-HFSSPNKPWLKTCDHPLKSFFFEYLKRTSWNDKFP 274 >UniRef50_C6I3U6 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=C6I3U6_9BACE Length = 310 Score = 93.6 bits (231), Expect = 8e-18, Method: Compositional matrix adjust. Identities = 76/258 (29%), Positives = 124/258 (48%), Gaps = 21/258 (8%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +N+ +D NY+ V +TS +NN + + Y+I NDG + ++ + Sbjct: 1 MNILCCLDDNYVQHTSVMLTSFFINNDFEHHNIYVITMQLNDGNVAYLREVVNKYHSNFY 60 Query: 88 LYRINTDKLQCL--PCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHL 145 LY++N L T S A Y RLF+ Q+L ++LY+D D+V + + +L + Sbjct: 61 LYQVNEAMLSGFVRKETDYVSLAAYLRLFSTQVLPFNCSKVLYIDGDIVVRKSLEELWKM 120 Query: 146 GL-NGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 + N AVAAV E ++ + D L YFNSG + ++L W + + EKA+ + Sbjct: 121 DIENYAVAAV---DETIKANCIRHNYDVTL--GYFNSGFMLINLSFWRENSVAEKAIDYM 175 Query: 205 MSKDNVYKYPDQDVMN-VLLKGMTLFLPREYN--TIYTIKSELKDKTHQNYKKLITES-- 259 K DQD +N +L G+ L +YN TI+ K ++ Q++ K+ TE Sbjct: 176 KRFPERIKSWDQDALNGILYGGLWKRLDLKYNLTTIFLCKQYVEG---QDFPKIYTEEYN 232 Query: 260 -----TLLIHYTGATKPW 272 ++HYTG KPW Sbjct: 233 SAISDPAVVHYTGPDKPW 250 >UniRef50_B3Z5I6 Glycosyltransferase family 8 n=2 Tax=Bacillus cereus group RepID=B3Z5I6_BACCE Length = 317 Score = 92.0 bits (227), Expect = 3e-17, Method: Compositional matrix adjust. Identities = 80/296 (27%), Positives = 139/296 (46%), Gaps = 22/296 (7%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHIN-LDFYIIAD---VYNDGFFQKIAKLAEQNQ 83 LNV Y D NY VGVS+ S++ NN+H N L+ ++I + YN + K + Sbjct: 3 LNVVYSSDDNYAQHVGVSLLSLLQNNQHFNNLNIFLIENNISSYNKKNLNSVCKKYNKTI 62 Query: 84 LRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLL 143 I + + ++L+ L + Y RLF ++ LD+++YLD D + +S L Sbjct: 63 QYIN-FNVLLERLE-LNINDSIAINSYARLFLAGIIPEELDKIIYLDCDSIINSSLSDLW 120 Query: 144 HLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSI 203 + A V D Q K + E Y N+G++ ++LKKW + + +K + Sbjct: 121 DTDVTEYFVAGVCDTVSNQTKLRIDMDKSE---GYINAGMLLINLKKWREENIEQKFMEF 177 Query: 204 LMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTI---YTIKS-------ELKDKTHQNYK 253 + KD + DQ +N +LK L+L ++N + +T+ EL++ ++ Sbjct: 178 IKKKDGNVFHHDQGTINGVLKDKILYLHPKFNAMTPFFTMSRKEIMSYYELENYYNEIEI 237 Query: 254 KLITESTLLIHYTGA--TKPWHKWAIYPSVKYYKIALENSPWKD-DSPRDAKSIIE 306 ++ + IHYT A +PW + +P YK L+ +PWK D +D + +E Sbjct: 238 DEAVKNPVFIHYTPAFVNRPWIEGCKHPLTSLYKSYLDMTPWKSTDLWKDRRGKVE 293 >UniRef50_C5ELK9 Glycosyl transferase n=3 Tax=Clostridiales RepID=C5ELK9_9FIRM Length = 333 Score = 91.3 bits (225), Expect = 5e-17, Method: Compositional matrix adjust. Identities = 86/312 (27%), Positives = 138/312 (44%), Gaps = 37/312 (11%) Query: 24 TSECLNVAYGVDANYLDGVGVSITSIVLNNRHI-NLDFYIIADVYNDGFFQKIAKLAEQN 82 E N+ Y + Y + S+ S++ NNR++ N+D YI++ + +++A +AE Sbjct: 4 NEETANIIYASNDGYAGHLAASMYSLLDNNRNVRNMDIYILSAQMCQEYKERLAGMAEA- 62 Query: 83 QLRITLYRINTDKLQCLPCTQVWSRAM----YFRLFAFQLLGLTLDRLLYLDADVVCKGD 138 TL+ + L+ + +R RLFA Q+L T+ + LYLD D + Sbjct: 63 -FHRTLHVVELGDLKQRFDFDIDTRGFDISAMGRLFAPQVLPGTVKKALYLDCDTIVCKS 121 Query: 139 ISQLLHLGLNGAVAAVVKDVEP-----MQEKAVSRLSDPELLGQYFNSGVVYLDLKKWAD 193 I L L AV +V +EP M+E DP Y+NSGV+ + L +W Sbjct: 122 IRPLYETELGDAVVGMV--MEPTVYKEMKESIGMGKDDP-----YYNSGVLLMALDRWRQ 174 Query: 194 AKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYN-----------TIYTIKS 242 + +K L S DQD +N LKG LP +YN T+ ++ + Sbjct: 175 EDVLQKLLDFYKSCHGRLFACDQDTINGALKGRIKTLPVKYNYFTNYRYFRYSTLCSMCA 234 Query: 243 ELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAK 302 ++ + Y + S +IHY G +PW K Y+ L +PWK D+P+ Sbjct: 235 AYREIGEEAYLE-ARRSPAIIHYLGDERPWIAGNHNHFKKLYEYYLAKTPWK-DTPKQTG 292 Query: 303 SIIEFKKRYKHL 314 K+RY H+ Sbjct: 293 -----KERYMHM 299 >UniRef50_A4ECW2 Putative uncharacterized protein n=1 Tax=Collinsella aerofaciens ATCC 25986 RepID=A4ECW2_9ACTN Length = 328 Score = 90.5 bits (223), Expect = 7e-17, Method: Compositional matrix adjust. Identities = 76/318 (23%), Positives = 136/318 (42%), Gaps = 22/318 (6%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHI-NLDFYIIADVYNDGFFQKIAKLAEQNQLRI 86 +N+ Y VD N++ + +I S+V N+ I ++ F++ ++ + + + ++ + + Sbjct: 4 MNLLYTVDNNFVPQLAANICSVVSNHSGIQDITFHVFSNGITEDNQRLLQEMVTEYNQNL 63 Query: 87 TLYRINT--DKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 Y I+ D L T W+ + RL L ++R++YLD D + GDI+ L + Sbjct: 64 VFYDISNFKDALGFDFDTSGWNEIVLARLLMAHFLPNEIERVIYLDGDTIVLGDIALLWN 123 Query: 145 LGLNGAVAAVVKDVEPMQEKAVSRLSDPELLG-QYFNSGVVYLDLKKWADAKLTEKALSI 203 L G V +V P SRL+D +L G Y N+GV+ +DLK+W ++ L Sbjct: 124 QDLKGCVVGMV----PEPTVGPSRLNDLDLNGCLYHNAGVLLVDLKQWRSTCCEDQLLDY 179 Query: 204 LMSKDNVYKYPDQDVMNVLLKGMTLFLPREYN--------TIYTIKSELKDKTHQNYKKL 255 + DQD +N +LK L +N + S + + +N Sbjct: 180 CERRSGRLFANDQDALNAVLKDKICSLSPAFNYSNIFDYYPFIFLNSLMPGFSDENSFNT 239 Query: 256 ITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLL 315 +++HY G +PW + + Y L + WKD D R + L Sbjct: 240 ARSKPIVVHYLGEERPWRRGNTHRFNNEYHFYLSETFWKDAKDEDGWGAYFLAWRTFNFL 299 Query: 316 ------VQHHYISGIIAG 327 +++ ISG+I Sbjct: 300 TRPFPQLRYKVISGLIPA 317 >UniRef50_A6LGX5 Glycosyltransferase family 8 n=1 Tax=Parabacteroides distasonis ATCC 8503 RepID=A6LGX5_PARD8 Length = 325 Score = 90.5 bits (223), Expect = 9e-17, Method: Compositional matrix adjust. Identities = 62/265 (23%), Positives = 122/265 (46%), Gaps = 19/265 (7%) Query: 29 NVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITL 88 ++ D NYL V + S+ N +L F+++++ + + + + E + ++++ Sbjct: 3 DIVVASDCNYLHLVSICAVSLFETNSSESLHFHLLSNGIDSADIKNLQTIVEGYRGKLSV 62 Query: 89 YRINTDKLQCLP-CTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGL 147 Y I + + + + S Y RLFA +L LD++LY+D D++ G I L + L Sbjct: 63 YPIENLRERLMTDVPETISLTSYARLFAGSILPANLDKVLYIDCDIIFNGSIRDLFNTDL 122 Query: 148 NGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSK 207 + + D P+ + + + Y N+GV+ + L +W + +K + L++ Sbjct: 123 GNCLVGGILD--PLISRTYKKEIKIPMSEPYINAGVLIIPLNRWRSEGMEQKFVDFLVAN 180 Query: 208 DNVYKYPDQDVMNVLLKGMTLFLPREYNT-----------IYTIKSELKDKTHQNYKKLI 256 + DQ ++N + G LP ++N +Y I + D+ + YKK I Sbjct: 181 RGKVHHHDQGIINAVCAGRKKILPPQFNVMSNSLCYPWKDLYKINTPFYDQ--EEYKKGI 238 Query: 257 TESTLLIHYTGAT--KPWHKWAIYP 279 + S +IH+TGA +PW +P Sbjct: 239 S-SPAIIHFTGAIHGRPWIVGCTHP 262 >UniRef50_C7IBC8 Glycosyl transferase family 8 n=1 Tax=Clostridium papyrosolvens DSM 2782 RepID=C7IBC8_9CLOT Length = 464 Score = 89.7 bits (221), Expect = 1e-16, Method: Compositional matrix adjust. Identities = 64/229 (27%), Positives = 105/229 (45%), Gaps = 13/229 (5%) Query: 80 EQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDI 139 E+ RI + + Q + YFR+F +++ ++ +++YLD D+V KGDI Sbjct: 19 EKYGSRIRFLELKPELYQDFKTQSYFGYVTYFRIFIPEIVEASVRKVIYLDCDIVIKGDI 78 Query: 140 SQLLHLGLNGAVAAVVKDV--------EPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKW 191 +L ++ A V+DV M +K + G+YFN+GV+ ++L KW Sbjct: 79 RKLWENDISEYFVAAVEDVGIDIGGNFATMVKKHIGIPRK----GKYFNAGVLLINLDKW 134 Query: 192 ADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELK-DKTHQ 250 K TE L+ + DQD +N + K L LP E+N I LK ++ + Sbjct: 135 RADKTTETIRKYLIENREKIYFADQDGLNAVFKDRWLKLPIEWNQQADILELLKRNRIDR 194 Query: 251 NYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPR 299 + ++IHYT KPW +P + Y L +PW D +P+ Sbjct: 195 PDVMKAALNPMIIHYTKQVKPWQYKDCHPLKEEYHRYLRLTPWNDTAPK 243 >UniRef50_B1MX28 Lipopolysaccharide biosynthesis protein, LPS:glycosyltransferase n=2 Tax=Leuconostoc RepID=B1MX28_LEUCK Length = 283 Score = 89.4 bits (220), Expect = 2e-16, Method: Compositional matrix adjust. Identities = 73/271 (26%), Positives = 121/271 (44%), Gaps = 11/271 (4%) Query: 22 INTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQ 81 I + +N+ +D NY+ + V + S+ N N+ ++ D +K+ + Q Sbjct: 6 IINDDSVNILITIDENYIKPLRVLLYSLRQTNPRENMTIWLAHDHIEVAQLEKLHQFVAQ 65 Query: 82 NQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQ 141 + +++T P + + MYFRL Q L TL R++YLD D++ I Sbjct: 66 LGFVLHTIKVDTSLWASAPTFKQYPPEMYFRLLCGQYLPKTLHRVIYLDPDILVINPIRP 125 Query: 142 LLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQ---YFNSGVVYLDLKKWADAKLTE 198 L ++ L G + A M +S+ + LG YFNSGV+ +DL + Sbjct: 126 LANMPLKGQMLAASSH---MGLTGISQTINHLRLGTRQVYFNSGVMLMDLDMMRQRVDMK 182 Query: 199 KALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPRE---YNTIYTIKSELKDKTHQNYKKL 255 LS++ PDQD++N L L LP E Y+T I K + + Sbjct: 183 AILSVIQQYGKELILPDQDILNYLYGDEILSLPEEIWNYDTRDNIMHYAKSFGSVD-MRW 241 Query: 256 ITESTLLIHYTGATKPWHKW-AIYPSVKYYK 285 + E+T+++HY G KPW K +I P + Y+ Sbjct: 242 VMENTVILHYCGRPKPWEKSNSINPFIMLYQ 272 >UniRef50_A7VNX5 Putative uncharacterized protein n=1 Tax=Clostridium leptum DSM 753 RepID=A7VNX5_9CLOT Length = 344 Score = 89.4 bits (220), Expect = 2e-16, Method: Compositional matrix adjust. Identities = 67/283 (23%), Positives = 133/283 (46%), Gaps = 19/283 (6%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNR-HINLDFYIIADVYNDGFFQKIAKLAEQNQLRI 86 +N + D N+ D +G ++ S+ NNR ++ YI+ ++G +K+ + +Q + + Sbjct: 13 MNCVFSSDDNFADILGCALISLFENNREQETIEVYILDGGISEGNKRKLESIFQQYERMV 72 Query: 87 TLYRI-NTDKLQCLPCTQ-VWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 + + +L T W + + R+ LL + R+LYLD D++ G + L Sbjct: 73 HFIEVPDISQLTGEAVTSGRWPISTFARILIDSLLPKEVKRVLYLDCDILVLGSLKNLWE 132 Query: 145 LGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 + L AA V D Q K + ++ + Y N+GV+ +D+ KW + ++ ++ ++ + Sbjct: 133 IDLKDKTAAGVMDCLSNQRKQNAGINGED---SYINAGVMLIDMDKWRENQIEKQCMNYI 189 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTI-----YTIKSELKDKTHQNYKKL---- 255 + Y DQ V+N +L L LP EYN + +T +K + Q+Y Sbjct: 190 RICNGQVAYNDQGVINKVLHKDLLVLPPEYNAMTLFFDFTYPDMIKYRKPQSYYSAQQVD 249 Query: 256 -ITESTLLIHYTG---ATKPWHKWAIYPSVKYYKIALENSPWK 294 + ++H+T + +PW K + +P ++ + SPW+ Sbjct: 250 HARKHPRIVHFTSSFLSLRPWVKGSEHPYAPLWRNYYKRSPWR 292 >UniRef50_D2RIJ7 Glycosyl transferase family 8 n=1 Tax=Acidaminococcus fermentans DSM 20731 RepID=D2RIJ7_ACIFE Length = 330 Score = 89.4 bits (220), Expect = 2e-16, Method: Compositional matrix adjust. Identities = 70/283 (24%), Positives = 135/283 (47%), Gaps = 26/283 (9%) Query: 14 AWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQ 73 ++++ A + L++ V+ + GV +TSI NN+ + L+F++ D +D + Sbjct: 21 SFEYMTAENKKKDILHICCNVNDLFFKPAGVLLTSICENNKDLALNFHVFVDSCSDENKE 80 Query: 74 KIAKLAEQNQLRITLYRINTDKLQCLPC-TQVWSRAMYFRLFAFQLLGLTLDRLLYLDAD 132 + K AE+ LY+++ Q + +SR Y R+ +L +R LYLDAD Sbjct: 81 NLRKTAEKYGCNAYLYKMDMSIYQNFHIKVKRFSRVTYIRIVMPWVLRNVTNRYLYLDAD 140 Query: 133 VVCKGDISQLLHLGL-NGAVAAVVKDVEPMQEKAVSRLSDPELLGQ-YFNSGVVYLDLKK 190 +VC + + L + AV A+V D R++ ++ G YF+ G++++++ + Sbjct: 141 MVCVKSLRVFFNYDLKDKAVGALVYDTP-------ERIAFLKMKGNVYFSDGLMWINVDE 193 Query: 191 WADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQ 250 W ++TE+ S + +K QD+MN++L G +P ++ + Sbjct: 194 WIKQRVTERVFSYQGADPARFKGQTQDLMNLVLDGNVQPIPALFHHM------------- 240 Query: 251 NYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPW 293 K + +LIHY+G KPW + + + ++ L+ SPW Sbjct: 241 --DKDFSVDGILIHYSGRDKPW-EIVLDEDDELWRHYLDISPW 280 >UniRef50_B3Q568 Putative glycosyltransferase protein n=4 Tax=Rhizobium etli RepID=B3Q568_RHIE6 Length = 331 Score = 88.6 bits (218), Expect = 3e-16, Method: Compositional matrix adjust. Identities = 69/274 (25%), Positives = 121/274 (44%), Gaps = 20/274 (7%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHI-NLDFYIIADVYNDGFFQKIAKLAEQNQLRITL 88 + + VDA Y + ++ S+ NN+ + LD ++I + + + I + N I Sbjct: 22 IVFAVDAAYAVPLATALRSVAENNQSVWPLDIHVIHEGIGEETKRLILESLPANSAIIQW 81 Query: 89 YRINTDKLQCLPCTQVWSRAMYF-RLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGL 147 + I T T+ M F R+ Q L T DR LYLD D++ + QL + L Sbjct: 82 HPIATLSFASGFSTRPGVSKMTFARILLPQFLPQTCDRALYLDGDILVLTSLEQLWNTDL 141 Query: 148 NGAVAAVVKDV---EPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 AV V D P +R L+ +YFN+G++ +DL KW + +++E++L L Sbjct: 142 GEAVIGAVPDYWLDNPAGSGPGARGG--ALVKRYFNAGILLIDLAKWRNERISERSLDYL 199 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIH 264 + +Y DQD +NV G L R +N + + + + + ++H Sbjct: 200 -DRFPTTEYSDQDALNVACDGKWKILDRAWNFQFEPRQAIAGIA-------LEQKAAIVH 251 Query: 265 YTGATKPWHKWAIYPSVKYY-----KIALENSPW 293 + KPW ++ P+V +Y + +PW Sbjct: 252 FVTNVKPWKSGSLSPNVAFYDAFRSRTCFALTPW 285 >UniRef50_Q5WI33 Lipopolysaccharide glycosyltransferase n=6 Tax=Firmicutes RepID=Q5WI33_BACSK Length = 274 Score = 88.2 bits (217), Expect = 4e-16, Method: Compositional matrix adjust. Identities = 68/259 (26%), Positives = 116/259 (44%), Gaps = 22/259 (8%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +N+ ++A+YL + V +TS+ +NN H + Y+I + Q + + + Sbjct: 1 MNILVTLNAHYLKPLQVMLTSLFMNNAHEDFTIYLIHSSIPEKQLQLLEQFVCHQGHSLV 60 Query: 88 LYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGL 147 + + P + +S MY+RL A++ L LDR+LYLD D++ I L + Sbjct: 61 IVETDKTLFANAPVVKHYSSEMYYRLLAYRFLPTELDRILYLDPDILVLNPIRPLYEANI 120 Query: 148 NGAV-AAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMS 206 + + AA +QE RL+ E + Y+NSGV+ ++L K + + + + Sbjct: 121 DSYLYAAAQHSFINIQEINKFRLNAYE-MDAYYNSGVLLMNLAKQRETMDINDIFAYVET 179 Query: 207 KDNVYKYPDQDVMNVLLKGMTLFL-PREYNTIYTIKSELKDKTHQNYKKL---------- 255 N PDQDV+N L + R YN D + Y KL Sbjct: 180 YRNRLVLPDQDVLNALYSPQIKNVDERLYNY---------DARYYRYYKLKSGGRFDIDA 230 Query: 256 ITESTLLIHYTGATKPWHK 274 + + T+++H+ G KPWHK Sbjct: 231 VLQQTVILHFCGKKKPWHK 249 >UniRef50_C4VEI8 General stress protein A n=24 Tax=Enterococcus RepID=C4VEI8_ENTFA Length = 303 Score = 88.2 bits (217), Expect = 4e-16, Method: Compositional matrix adjust. Identities = 68/264 (25%), Positives = 127/264 (48%), Gaps = 9/264 (3%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNN-RHINLDFYIIADVYNDGFFQKIAKLAEQNQL-- 84 L + + N++ + SI+ N+ + FY+I D N Q + + QL Sbjct: 10 LAIVSCCNTNFVPHLAAMFVSILENSPSAAAVHFYVIDDNINFESKQLLYFTIKHTQLNA 69 Query: 85 RITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLL-GLTLDRLLYLDADVVCKGDISQLL 143 +T ++IN + + ++ + Y+R+ +L G ++RLLY+D D++ D+++L Sbjct: 70 ELTFFKINPHFFKNVVTSERIPKTAYYRIAIPELFRGSQIERLLYMDCDMIALDDVAKLW 129 Query: 144 HLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSI 203 + L + A V+D Q + +++ P YFNSG++ +D+KKW + +T K L Sbjct: 130 TVDLGENIIAAVEDAGFHQR--LEKMAIPAESMCYFNSGLLLIDVKKWLNLDVTTKVLRF 187 Query: 204 LMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTL-- 261 + + ++ DQD +N +L L ++N I S+ K +K E+ Sbjct: 188 IEENPDKLRFHDQDALNAVLHDRWTLLHPKWNAQGYILSKAKKHPTIYGEKQYEETRRAP 247 Query: 262 -LIHYTGATKPWHKWAIYPSVKYY 284 +IH+TG KPW K + + +YY Sbjct: 248 SIIHFTGHVKPWTKEFQWYTKRYY 271 >UniRef50_Q65VF6 RfaJ protein n=1 Tax=Mannheimia succiniciproducens MBEL55E RepID=Q65VF6_MANSM Length = 309 Score = 87.8 bits (216), Expect = 4e-16, Method: Compositional matrix adjust. Identities = 76/259 (29%), Positives = 125/259 (48%), Gaps = 19/259 (7%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYI----IADVYNDGFFQKIAKLAEQN- 82 +N+ + D NY + V I SI L+N ++ FYI I++ I + ++N Sbjct: 1 MNIIFNCDENYAPYLSVVIKSI-LDNTTLSTQFYILDFNISEESKSCIKNLIQNINKKNS 59 Query: 83 -QLRITLYRINTDKLQCLPCTQVW-SRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDIS 140 Q I +I+ + QC P T + S A Y RL L L++ +YLD D++ D+S Sbjct: 60 FQHSINFIKIDDNDFQCFPQTISYISSATYARLKVADYLN-ELNKAIYLDIDIIVISDLS 118 Query: 141 QLLHLGL-NGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEK 199 +L H+ L + V A + + + R + Y N+GV+ L+LK + L +K Sbjct: 119 RLWHIDLADNLVGACLDPYIEYENQDYKRKIGLQDSQPYINAGVLLLNLKALREFNLYQK 178 Query: 200 ALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLI--- 256 A+ N+ ++ DQD++N +LKG LFL YN +T+ + K K L+ Sbjct: 179 AIDWNKDYPNI-QFQDQDILNGVLKGKVLFLDSRYN--FTVNHRNRIKLAHKGKLLLSSL 235 Query: 257 ---TESTLLIHYTGATKPW 272 T+ ++HY G+ KPW Sbjct: 236 EKATKPICILHYVGSHKPW 254 >UniRef50_A0KQP2 Glycosyl transferase, family 8 n=2 Tax=Aeromonas RepID=A0KQP2_AERHH Length = 366 Score = 87.8 bits (216), Expect = 5e-16, Method: Compositional matrix adjust. Identities = 67/276 (24%), Positives = 146/276 (52%), Gaps = 27/276 (9%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRH-INLDFYIIADVYNDGFFQKIAKLAEQNQLRI 86 ++ A+ +D ++ + I S+ + H + L +++A + F K++KL +N L I Sbjct: 5 IHSAFCIDDSFAVHLAALIHSLGKHLSHDLQLQCHVLARLSETNKF-KLSKLESEN-LVI 62 Query: 87 TLYRINTDKLQCLPCTQVWSRAM----YFRLFAFQLLGLTLDRLLYLDADVVCKGDISQL 142 Y N + +P + +++ + Y+R FA + ++D++L++D+D++ GDIS L Sbjct: 63 KFYD-NLPDYKDIPISNLYNNRLNEVTYYR-FAIPHILKSIDKVLFIDSDMIALGDISPL 120 Query: 143 LHLGLNGAVAAVVKD--VEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKA 200 + + A+ AVV D + ++K + R G+YFN+G + ++L KW ++E+A Sbjct: 121 WSIDMGDAIVAVVSDHILGCDKKKQLMRGISS---GKYFNAGFMLMNLDKWRAKNISEQA 177 Query: 201 LSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITEST 260 L +L+ ++N +++ DQD +N++L+ T+++ ++N N+ Sbjct: 178 LRLLI-ENNGFEHNDQDALNIVLENKTVYIDNKWN------------AQPNHLAQNNFLP 224 Query: 261 LLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDD 296 +L+H+ G KPWH ++ +P Y ++ + + ++ Sbjct: 225 ILVHFCGQEKPWHIYSNHPFKGSYLVSRRETDYANE 260 >UniRef50_C2HBB8 Family 8 glycosyltransferase n=13 Tax=Lactobacillales RepID=C2HBB8_ENTFC Length = 300 Score = 87.4 bits (215), Expect = 6e-16, Method: Compositional matrix adjust. Identities = 55/204 (26%), Positives = 100/204 (49%), Gaps = 6/204 (2%) Query: 86 ITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLL-GLTLDRLLYLDADVVCKGDISQLLH 144 + +IN + + + Y+R+ +L G ++R+LY+D D++ DIS+L Sbjct: 67 VEFLKINKEFFTNVVISDRIPETAYYRIAIPELFRGTEVERILYMDCDMIALQDISKLWR 126 Query: 145 LGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 L ++ A V+D Q + ++ P +YFNSG++ +++KKW D +T+K L + Sbjct: 127 LDFGDSIVAAVEDAGFHQR--LEKMEIPAKSMRYFNSGLMLINVKKWLDENITQKVLDFI 184 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITES---TL 261 ++ DQD +N +L L L +N I ++ K ++ E+ Sbjct: 185 EHNPEKLRFHDQDALNAILHDRWLPLHPRWNAQGYIMAKAKKHPTAAGEREYEETRNNPY 244 Query: 262 LIHYTGATKPWHKWAIYPSVKYYK 285 +IH++G KPW K P+ KYY+ Sbjct: 245 IIHFSGHVKPWSKDFEGPTKKYYE 268 >UniRef50_B3XL28 Glycosyl transferase family 8 n=1 Tax=Lactobacillus reuteri 100-23 RepID=B3XL28_LACRE Length = 331 Score = 87.4 bits (215), Expect = 7e-16, Method: Compositional matrix adjust. Identities = 79/312 (25%), Positives = 149/312 (47%), Gaps = 31/312 (9%) Query: 29 NVAYGVDANYLDGVGVSITSIVLNNRHIN-LDFYII-ADVYNDGFFQKIAKLAEQNQLRI 86 N+ Y D + +G S+ S++ NN+ ++F+I+ + + + F +I K+ + N + Sbjct: 6 NIVYATDDTFAPVLGTSLLSLLRNNKEAKKINFFILDSGISKENKF-RIEKICD-NFVNA 63 Query: 87 TLYRINTDKLQCLPCTQV----WSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQL 142 +L I + + V S + Y RLF +L +++R+LYLD D + + L Sbjct: 64 SLKWIKIESISKKIGIDVKNDRGSFSQYSRLFIGDVLDNSVERVLYLDCDTLILSSLKDL 123 Query: 143 LHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALS 202 ++ L G + A +KD + L + +L+ FNSGV+ +DLK W D K+ EKA+S Sbjct: 124 WNIELKGNIIAALKDAFSKYYRKNINLVNDDLM---FNSGVMLIDLKAWRDNKIKEKAIS 180 Query: 203 ILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLIT----- 257 + + + DQ V+N +L T L YN + +I +L + + Y+ + Sbjct: 181 FIRQRHGKVQQGDQGVLNSVLSNKTFALDPRYNLV-SIFYDLDYREIKLYRSPVNFYSEK 239 Query: 258 ------ESTLLIHYTG---ATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFK 308 E+ +++H+T + +PW K + + K + + +PWK+ + IE Sbjct: 240 IIVKAKENPVILHFTSSFYSIRPWFKNSNHQCKKIWLKFYQETPWKNQPLQ-----IEMS 294 Query: 309 KRYKHLLVQHHY 320 K+ K + + Y Sbjct: 295 KKKKLINILFEY 306 >UniRef50_Q38VK7 Putative bifunctional glycosyl transferase, family 8 n=2 Tax=Lactobacillus sakei subsp. sakei 23K RepID=Q38VK7_LACSS Length = 569 Score = 87.0 bits (214), Expect = 8e-16, Method: Compositional matrix adjust. Identities = 56/213 (26%), Positives = 105/213 (49%), Gaps = 8/213 (3%) Query: 86 ITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQ-LLGLTLDRLLYLDADVVCKGDISQLLH 144 ++ IN +++ P + + Y+R+ A Q LL ++R+LYLD D + + D++ L Sbjct: 68 VSFIAINPRRIKNFPGNNHFDQTAYYRILAPQILLARHIERVLYLDLDTLIRTDLTPLYD 127 Query: 145 LGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQ--YFNSGVVYLDLKKWADAKLTEKALS 202 L G + V ++P + + RL P+ YFN+GV+ +D W +++K L+ Sbjct: 128 SDLEGNIIGAV--IDPGKALTLKRLGVPKSQANNIYFNAGVLIIDTILWETHHISQKILA 185 Query: 203 ILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTL- 261 +L+ QD +NV+L G T L ++N I + + + Y +L ++ + Sbjct: 186 MLVPYPGRRVNDIQDALNVVLAGRTKLLAPKWNVQNAILFKTYEPINNEYSQLFKQAIMA 245 Query: 262 --LIHYTGATKPWHKWAIYPSVKYYKIALENSP 292 +IH+T KPW + +P + Y++ L P Sbjct: 246 PKIIHFTTEKKPWEVFLEHPYMSEYQVYLSQLP 278 Score = 80.9 bits (198), Expect = 7e-14, Method: Compositional matrix adjust. Identities = 61/265 (23%), Positives = 124/265 (46%), Gaps = 13/265 (4%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNN---RHINLDFYIIADVYNDGFFQKIAKLAEQNQL 84 +N+ ++N+++ + + SI+ NN RH F++++D + ++ E Sbjct: 285 INIVSAANSNFVEPLAILYASILNNNDDDRH--YAFFVLSDQLTARDQATLRQITESFNA 342 Query: 85 RITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 +T ++ L + + Y+RL LL ++R+LYLD D +C ++++L Sbjct: 343 ELTFIEVDEIPLTAVIQDGQVLKTAYYRLLIPNLLP-EIERVLYLDCDTLCLENLARLWD 401 Query: 145 LGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 + L A V+D A + + +YFN+GV+ ++L W K+TE+ L+ + Sbjct: 402 VELGNIPVAAVEDAGFHNRLAQMAIDYKSI--RYFNAGVLLMNLTIWRQQKITEQILTFI 459 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSEL----KDKTHQNYKKLITEST 260 ++ DQD +N +L + L ++N +I + ++ ++ + E Sbjct: 460 KEYPQKLRFHDQDALNAILHDRWIHLHPKWNVQTSILMDFIVAPTERINRQFLSAQKEPG 519 Query: 261 LLIHYTGATKPWHKWAIYPSVKYYK 285 LIH+ G+ KPW K + +P Y+ Sbjct: 520 -LIHFCGSEKPWDKSSTHPYTPQYR 543 >UniRef50_B8ISQ5 Glycosyl transferase family 8 n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8ISQ5_METNO Length = 328 Score = 85.9 bits (211), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 66/252 (26%), Positives = 110/252 (43%), Gaps = 16/252 (6%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKL-AEQNQLRI 86 + VA +D + V++ S++ LD +I + +IA L A+Q++ Sbjct: 13 IAVALCIDRAFFRHALVTVASLLDAGPRQPLDVHIFYAEADPACMARIAALFADQDRHGC 72 Query: 87 TLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLG 146 +I+ D+ + P + S Y RL L+ ++LYLDAD++ D++ L Sbjct: 73 HFQKISLDRFEGFPVSDAISAGTYARLLLPYLMPRRA-KVLYLDADLIVLDDVAPLWRTE 131 Query: 147 LNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMS 206 L A A V+D A+ D YFN+GV+ ++L W L E+ + + + Sbjct: 132 LGAAPVAAVRDPFCDNRPAIGFSPD----EPYFNAGVLLMNLAVWRREGLAERVAAHIDA 187 Query: 207 KDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITE------ST 260 KY DQD +NV+L+G F+ +N + + D T + E Sbjct: 188 HGASLKYFDQDALNVVLRGRARFVDPRWN----FQPRMADATPADIACARAEFRRTRARP 243 Query: 261 LLIHYTGATKPW 272 +IHYT KPW Sbjct: 244 AIIHYTTPHKPW 255 >UniRef50_C5ZV11 Glycosyl transferase n=2 Tax=Helicobacter canadensis MIT 98-5491 RepID=C5ZV11_9HELI Length = 397 Score = 84.7 bits (208), Expect = 5e-15, Method: Compositional matrix adjust. Identities = 77/304 (25%), Positives = 148/304 (48%), Gaps = 27/304 (8%) Query: 29 NVAYGVDANYLDGVGVSITSIVLNNRHIN---LDFYIIADVYND----GFFQKIAKLAEQ 81 NV ++ NY+ V ITSI+ N + +F+++ D + I++L++ Sbjct: 3 NVVLNLNENYVPYAAVLITSIIQNTQSSGGGGYNFHLLMDSISQENTKNLENLISELSKI 62 Query: 82 NQLRITLYRINTDKLQ--CLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDI 139 +T+Y ++ + +P T + Y+RL L L++ R +YLD D++ GD+ Sbjct: 63 YPCTLTIYILDDQLFREYSMP-TLNGNYLAYYRLKIGSALPLSIKRCVYLDVDMIVLGDL 121 Query: 140 SQLLHLGLNGAVAAVVKDVEPMQ-EKAVSRLSDP-ELLGQYFNSGVVYLDLKKWADAKLT 197 +L + L G + VV + + K ++ P + G YFNSG++ +DL W + Sbjct: 122 RELFEVDLQGKICGVVMEHHSQKIYKPKNQAYKPINITGSYFNSGMLLVDLDLWRQENIE 181 Query: 198 EKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTI--KSELKDKTHQN---- 251 ++A I K+ Y + DQD++N++L G T + E+N + + ++ KD+ ++ Sbjct: 182 DRAFEI--GKNYHYSFHDQDILNIVLSGKTHKVGIEWNLMVCVYYRAICKDEKGRDKLPY 239 Query: 252 YKKLITES---TLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWK--DDSPRDAKSIIE 306 Y+K + ++HY TKPW+ IY + Y+ L+ W D +P + +++ Sbjct: 240 YRKDFNSALRNPKILHYFTHTKPWNNAKIY--LDYHNKFLDQYWWDMVDQTPIFKEKLLQ 297 Query: 307 FKKR 310 K + Sbjct: 298 LKPQ 301 >UniRef50_C7HS13 Family 8 glycosyl transferase n=2 Tax=Anaerococcus RepID=C7HS13_9FIRM Length = 276 Score = 83.6 bits (205), Expect = 9e-15, Method: Compositional matrix adjust. Identities = 72/264 (27%), Positives = 123/264 (46%), Gaps = 30/264 (11%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLA-EQNQLRI 86 +N+ D NYL+ + + S+ +N N + Y+I D ++I K + + R Sbjct: 1 MNILVSCDENYLNPLKTMLYSLFESN-DTNFEIYLIHKDIRDEKIKEIEKFVIKASSKRA 59 Query: 87 TLYRINTDKL-QCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHL 145 L I L T ++ MY+RL A++ L LDR+LYLD DV+ +L ++ Sbjct: 60 KLNAIKVKNLFSNAKITFYYTEEMYYRLLAYKYLPENLDRILYLDPDVLVLNSCEKLYNM 119 Query: 146 GL-NGAVAAVVKDVEPMQEKAVSRL---SDPELLGQYFNSGVVYLDLKKWADAKLTE--- 198 L + AA + +Q V+RL S + + YFNSG++ ++LK D++ E Sbjct: 120 DLGDNYFAAATHTIPTVQSANVARLSISSGHKDIENYFNSGILMINLKLSRDSQTYEKEV 179 Query: 199 -------KALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPR---EYNTIYTIKSELKDKT 248 K+L ++M PDQD++NV+ + + + Y+ + +LKDK Sbjct: 180 LNYVKNTKSLGLIM--------PDQDLLNVVFRNKIIKIDEIKYNYDARRYLTYKLKDKK 231 Query: 249 HQNYKKLITESTLLIHYTGATKPW 272 + I +T +H+ G KPW Sbjct: 232 YN--LSYIISNTCFLHFCGKRKPW 253 >UniRef50_D2RIJ4 Glycosyl transferase family 8 n=2 Tax=Acidaminococcus RepID=D2RIJ4_ACIFE Length = 309 Score = 82.4 bits (202), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 66/254 (25%), Positives = 115/254 (45%), Gaps = 12/254 (4%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNR-HINLDFYIIADVYNDGFFQKIAKLAEQNQLRI 86 +++ D NY V+ SI+ N+R + FY D ++ IA Q I Sbjct: 4 ISIVLASDDNYAQHGAVACASILANHRGERPIHFYYFDDGISEEKQAGIAATVTGLQGSI 63 Query: 87 TLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLG 146 T ++Q V +RA Y RL +L+ + R++YLD D+V DI +L + Sbjct: 64 TFIPTAGKEIQAHTSGHV-NRAAYLRLLIPELVPQAVHRVIYLDTDLVVLDDIQELWEMD 122 Query: 147 LNGAVAAVVKDVEPMQEKAVSRLSDPELLGQ----YFNSGVVYLDLKKWADAKLTEKALS 202 L G V D+ + + R + L Q YFNSGV+ ++L+ W + + ++ + Sbjct: 123 LQGKPVGAVPDLGILASSRMRRQKEETLGIQEGKLYFNSGVMVMELEAWREKQYGDQVIR 182 Query: 203 ILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTI---YTIKSE-LKDKTHQNYKKLITE 258 + ++ +++ DQD +N + + LP +N I +T+ + LK +N E Sbjct: 183 CV--EEGNFRHHDQDGLNKVFQDNWQPLPLRWNVIPPVFTLPVKVLKKSRWRNLALEALE 240 Query: 259 STLLIHYTGATKPW 272 + H+ G KPW Sbjct: 241 RPAVFHWAGRYKPW 254 >UniRef50_C2HBB9 Family 8 glycosyltransferase n=35 Tax=Lactobacillales RepID=C2HBB9_ENTFC Length = 305 Score = 82.4 bits (202), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 73/270 (27%), Positives = 124/270 (45%), Gaps = 12/270 (4%) Query: 30 VAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAEQ--NQLRI 86 V D NY + V I + + N N+ + FY+I D ++ Q + + + + I Sbjct: 30 VVTASDENYAPYLSVMIATALENCNKARRIKFYVIDDGLSEYSKQGLEETVNKYSSNASI 89 Query: 87 TLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLT-LDRLLYLDADVVCKGDISQLLHL 145 + D + + + Y R+ LL ++LYLD+DV+ DI +L Sbjct: 90 QFLTVEKDIYEDFLVSDHITTTAYLRISLPNLLAKEDYKKVLYLDSDVLVLDDIVKLYDE 149 Query: 146 GLNGAVAAVVKDVEPMQEKAVSRLS-DPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 LNG + ++P Q KA+ RL D + L YFNSGV+ +D+ +W ++TEK + L Sbjct: 150 PLNGKTIGAI--IDPGQVKALERLGIDSDDL--YFNSGVMVIDIDQWNKKEITEKTIHYL 205 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITEST---L 261 + Y DQD +N +L L ++N ++ E ++ Y++L E Sbjct: 206 SENGDRIIYHDQDALNAVLYEDWEQLHPKWNMQTSLIFERHPAPNEKYERLYKEGNEKPS 265 Query: 262 LIHYTGATKPWHKWAIYPSVKYYKIALENS 291 ++H+TG KPW+ +P Y L +S Sbjct: 266 IVHFTGHDKPWNTLKDHPYTNLYLKKLAHS 295 >UniRef50_B6G807 Putative uncharacterized protein n=2 Tax=Collinsella RepID=B6G807_9ACTN Length = 276 Score = 82.4 bits (202), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 60/259 (23%), Positives = 114/259 (44%), Gaps = 24/259 (9%) Query: 27 CLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRI 86 ++V D YL + + S+ +N+ + +++ + +++ + L I Sbjct: 5 AMDVIVTCDEGYLGPLRTMLYSLRASNQGAQVRIWLLHKGISLPALEELERFCSVLGLAI 64 Query: 87 TLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLG 146 ++ L C++ + + MY+RL A ++ ++R LYLD D++ + L + Sbjct: 65 EPVTVDRVLLDGAKCSERYPQEMYYRLLAPSIIKAPIERALYLDPDILVINPLDDLFEID 124 Query: 147 LNG---AVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSI 203 L+G A A+ + V P +RLS YFN+GV+ D+ + + ++ S Sbjct: 125 LHGNAFAAASHLDAVHPATALNKARLSTS---SDYFNTGVILFDIARARKSICVDELFSY 181 Query: 204 LMSKDNVYKYPDQDVMNVLLKGMTLFLPREY----------NTIYTIKSELKDKTHQNYK 253 + + + V +PDQD+ N L +TL +P E N I T + D Sbjct: 182 VKAHEQVMLFPDQDLFNSLFGAVTLRIPDEIWNYDARKYPDNIIRTWGTATLD------- 234 Query: 254 KLITESTLLIHYTGATKPW 272 + E T ++H+ G KPW Sbjct: 235 -WVMEHTAILHFCGKNKPW 252 >UniRef50_Q9L6B2 Putative glycosyl transferase n=3 Tax=Pasteurella RepID=Q9L6B2_PASMU Length = 302 Score = 81.6 bits (200), Expect = 3e-14, Method: Compositional matrix adjust. Identities = 72/282 (25%), Positives = 124/282 (43%), Gaps = 14/282 (4%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +N+ + D Y + V+I SI+ N+ + FYI D + I + + Sbjct: 1 MNILFVSDDVYAKHLVVAIKSII-NHNEKGISFYIFDLGIKDENKRNINDIVSSYGSEVN 59 Query: 88 LYRINTDKLQCLPCTQVW-SRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLG 146 +N + + P + S A Y RL A + L L++++YLD DV+ + L ++ Sbjct: 60 FIAVNEKEFESFPVQISYISLATYARLKAAEYLPDNLNKIIYLDVDVLVFNSLEMLWNVD 119 Query: 147 LNGAVAAVVKDV----EPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALS 202 +N + A D E + K +SD E YFN+GV+ +L +W + +AL Sbjct: 120 VNNFLTAACYDSFIENEKSEHKKSISMSDKEY---YFNAGVMLFNLDEWRKMDVFSRALD 176 Query: 203 ILMSKDNVYKYPDQDVMNVLLKGMTLFLPREY----NTIYTIKSELKDK-THQNYKKLIT 257 +L N Y DQD++N+L + +L + N + IK K K ++ + + T Sbjct: 177 LLAMYPNQMIYQDQDILNILFRNKVCYLDCRFNFMPNQLERIKQYHKGKLSNLHSLEKTT 236 Query: 258 ESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPR 299 ++ HY G K WH + +V +Y+ L D R Sbjct: 237 MPVVISHYCGPEKAWHADCKHFNVYFYQKILAEITRGTDKER 278 >UniRef50_C6LDU2 Glycosyl transferase, family 8 n=3 Tax=Firmicutes RepID=C6LDU2_9FIRM Length = 270 Score = 81.6 bits (200), Expect = 3e-14, Method: Compositional matrix adjust. Identities = 65/259 (25%), Positives = 106/259 (40%), Gaps = 17/259 (6%) Query: 46 ITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVW 105 I SIV D YI+ + A E R+ + P ++ + Sbjct: 8 IRSIVRFPSEDGYDIYILHSDLQEQDQSDAAAQVEDGDTRLHFRFVEPSVFASFPESERY 67 Query: 106 SRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKA 165 R +Y+R+FA LL +DR+LYLD D + + +L ++ G V K Sbjct: 68 PRLIYYRIFAASLLPPEMDRILYLDGDTLVINPLDELYNMDFEGNYFLACTHVRKFLTKV 127 Query: 166 VSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKG 225 E + Y NSGV+ ++LK+ + + E+ S + + PDQD++ L Sbjct: 128 NQYRLGMEEVSTYINSGVLLMNLKELREKQDFEEIASFVEKRGRYLTLPDQDIITALYGN 187 Query: 226 MTLFLPREYNTIYTIKSELKDK------THQNYKKL----ITESTLLIHYTGATKPWHKW 275 T L T+K L D+ T +K++ + E+ ++IHY G KPW K Sbjct: 188 KTGILD-------TMKYNLSDRMISVYNTEPGHKRINLEWVRENAVVIHYYGKQKPWKKP 240 Query: 276 AIYPSVKYYKIALENSPWK 294 + +Y+ E P K Sbjct: 241 YLGMLDVFYRELKEEEPGK 259 >UniRef50_D1JY84 Putative uncharacterized protein n=1 Tax=Bacteroides sp. 3_1_33FAA RepID=D1JY84_9BACE Length = 312 Score = 81.3 bits (199), Expect = 5e-14, Method: Compositional matrix adjust. Identities = 67/251 (26%), Positives = 123/251 (49%), Gaps = 9/251 (3%) Query: 25 SECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQL 84 S +++A+ V+ +Y + + VSI ++ NN L +I++D +D ++ KL Sbjct: 3 SSPMHIAFCVNDHYAEYILVSIKGLLENNSD-PLVIHILSDYISDKNTNRLKKLVGLYPN 61 Query: 85 RITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 I L + D L+ W+ ++R+ ++L ++ R+LYLDAD + +I +L Sbjct: 62 AI-LDIVIVDDLKLKDLKDTWTIYTWYRVLLPEILDASVHRVLYLDADTLVSENIEELFS 120 Query: 145 LGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 L + G A D + ++K+ + E +Y +GV+ ++L W + + K + Sbjct: 121 LDMTGKAIAGTVDFQS-KDKSTYQRCGYEAEKEYVCAGVMMMNLDYWREHDIANKIIDWG 179 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLI---TESTL 261 ++ +YPDQD +N + + M L LP +Y+ I + D QNY + + ES Sbjct: 180 RDYNDRIQYPDQDAINYICRDMKLLLPLKYDIIDGFFQD--DYYFQNYPQELRECIESPA 237 Query: 262 LIHYTGATKPW 272 +IHY G PW Sbjct: 238 IIHYAGQA-PW 247 >UniRef50_C0AA16 Lipopolysaccharide biosynthesis protein LPS:glycosyltransferase-like protein n=1 Tax=Opitutaceae bacterium TAV2 RepID=C0AA16_9BACT Length = 726 Score = 80.9 bits (198), Expect = 5e-14, Method: Compositional matrix adjust. Identities = 78/270 (28%), Positives = 128/270 (47%), Gaps = 33/270 (12%) Query: 24 TSECLNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKI--AKLAE 80 C+N+A+ D ++ + V+I SIV + N D I+ + + + I K A+ Sbjct: 401 AGNCINIAFNCDDKFVPYLCVAIKSIVATASTENNYDILILTEGLSPANLKWIDGIKHAK 460 Query: 81 QNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQL-LGLTLDR---LLYLDADVVCK 136 LR+ R D LQ + + R+M R+ +L LG L++ +LYLD D++ + Sbjct: 461 NVSLRVVNVR---DYLQDKDISSFFMRSMVSRIAYVRLYLGELLEKYAKVLYLDCDLIAQ 517 Query: 137 GDISQLLHLGLNGAVAAVVKDV----EPMQEKAVSRLSDPEL--------LGQYFNSGVV 184 D+++L ++ L+G V A V D+ E ++ A R D L + QYFNSGV+ Sbjct: 518 SDVAELFNMNLDGNVCAAVPDLAISTETIKNVAAYRDIDVYLRDVLGVTDISQYFNSGVM 577 Query: 185 YLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSEL 244 DL+K L + ++ + N + DQ+V+N L G L L E+N ++ Sbjct: 578 VFDLEKIRTDNLQQTFIA--AAAKNTKFFMDQNVLNSALYGKVLLLGFEWNKRVSLAMAN 635 Query: 245 KDKTHQNYKKLITESTLLIHYTGATKPWHK 274 +D T TES +L H+ KP K Sbjct: 636 RDTT--------TESKIL-HFAAEPKPLQK 656 >UniRef50_B9QZ95 Glycosyl transferase family 8 n=1 Tax=Labrenzia alexandrii DFL-11 RepID=B9QZ95_9RHOB Length = 309 Score = 80.9 bits (198), Expect = 7e-14, Method: Compositional matrix adjust. Identities = 65/258 (25%), Positives = 113/258 (43%), Gaps = 35/258 (13%) Query: 29 NVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQL--RI 86 N+A D L G+ V+I S L + I +++AD ++ K++ + + + + Sbjct: 4 NIAACADTKVLPGLAVTIRS-SLEHSSIPCRIHVLADRLSEQDKHKLSNSWKPHPMCQDV 62 Query: 87 TLYRINTDKLQCLPCTQ-VWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHL 145 Y I+ + T + S++ Y R F LG + +YLD D++ D+++L Sbjct: 63 VFYDIDYQNISKFRSTMYLKSKSAYSRYFISDFLGEE-SKCIYLDCDLLVLRDLAELNTA 121 Query: 146 GLNGAVAAVVKDVEPMQEKAVSRLSDPEL-LGQ---------YFNSGVVYLDLKKWADAK 195 ++G V+D+ R +DP L +G+ YFNSGV+ +DL +W Sbjct: 122 KMHGKTIGSVRDIS-------VRTADPHLFIGERLQLTNPYDYFNSGVLIIDLDRWRKLD 174 Query: 196 LTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKL 255 + + + + + + DQD +NV G T FL +NT Y++ Sbjct: 175 ARNHLIDLTLERADTFHSQDQDALNVFFDGDTEFLDPVWNT-------------SQYERP 221 Query: 256 ITESTLLIHYTGATKPWH 273 T +IH G KPWH Sbjct: 222 DTAENRIIHLIGTVKPWH 239 >UniRef50_Q9L7A2 Putative glycosyl transferase n=1 Tax=Haemophilus ducreyi RepID=Q9L7A2_HAEDU Length = 269 Score = 80.1 bits (196), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 64/253 (25%), Positives = 116/253 (45%), Gaps = 15/253 (5%) Query: 22 INTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQ 81 +N E +N+ + +Y + + +I SI L+N+HI FY++ Y +F + + Sbjct: 2 LNPLEKMNIVLAANQSYSEYILTTIKSIYLHNKHIR--FYLLNRDYPTEWFDILNNKLRK 59 Query: 82 NQLRITLYRINTDKLQCLPC-TQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDIS 140 I ++ D ++ + + S +FR F + D+++YLDAD+V G ++ Sbjct: 60 LNSEIIDIKVTNDTIKNFKTYSHISSDTTFFRYFISDFI--EQDKVIYLDADIVVNGSLT 117 Query: 141 QLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKA 200 +L ++ A VKD+ + EK FN+G++ ++ KKW + +T+ Sbjct: 118 ELYQTDISNYFLAAVKDI--ISEKIYVN-------NHIFNAGMLLINNKKWREHNITQFC 168 Query: 201 LSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITEST 260 LS+ N DQ ++N++ K L L R YN + Y + + E+ Sbjct: 169 LSLSEKYINSLPDADQSILNLIFKDKWLKLNRGYNYLIGTDYLFFKYGKTRYLEDLGETI 228 Query: 261 -LLIHYTGATKPW 272 L+IHY KPW Sbjct: 229 PLIIHYNTEAKPW 241 >UniRef50_P43974 Putative glycosyltransferase HI0258 n=32 Tax=Haemophilus influenzae RepID=Y258_HAEIN Length = 330 Score = 80.1 bits (196), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 65/252 (25%), Positives = 118/252 (46%), Gaps = 9/252 (3%) Query: 25 SECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQL 84 S+ +N+ + D Y + VSI SI+ N ++FYI+ N I LA Sbjct: 36 SQTMNIIFSSDHYYAPYLAVSIFSIIKNTPK-KINFYILDMKINQENKTIINNLASAYSC 94 Query: 85 RITLYRINTDKLQCLPCTQVW-SRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLL 143 ++ + Q P T + S A Y RL + + +++ +Y+D D + + +L Sbjct: 95 KVFFLPVCESDFQNFPKTIDYISLATYARLNLTKYIK-NIEKAIYIDVDTLTNSSLQELW 153 Query: 144 HLGLNGAVAAVVKDVE-PMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALS 202 ++ + A +D ++ +A + E YFN+G++ ++L KW + + +K+++ Sbjct: 154 NIDITNYYLAACRDTFIDVKNEAYKKTIGLEGYS-YFNAGILLINLNKWKEENIFQKSIN 212 Query: 203 ILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLL 262 + +NV KY DQD++N + KG F+ +N T + +K K K I ++ Sbjct: 213 WMNKYNNVMKYQDQDILNGICKGKVKFINNRFNFTPTDRDLIKKKNLLCVKMPI----VI 268 Query: 263 IHYTGATKPWHK 274 HY G K WHK Sbjct: 269 SHYCGPNKFWHK 280 >UniRef50_C0EQT1 Putative uncharacterized protein n=1 Tax=Neisseria flavescens NRL30031/H210 RepID=C0EQT1_NEIFL Length = 212 Score = 79.7 bits (195), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 60/183 (32%), Positives = 86/183 (46%), Gaps = 25/183 (13%) Query: 118 LLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVS-RLSDPELLG 176 +LG D +LYLD DV+C GDIS+L + L + + R +DP Sbjct: 6 ILGDISDTVLYLDTDVLCLGDISELFTVILAAVPETTLYRAYINKLNVFGFRSTDP---- 61 Query: 177 QYFNSGVVYLDLKKWADAK----LTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPR 232 YFNSGV+ + K W ++ L EK + +SK + PDQD++N+ KG +LP Sbjct: 62 -YFNSGVLLFNNKFWNESSAYTVLNEKIRQVELSKF-ILACPDQDLLNLSCKGKVGWLPE 119 Query: 233 EYNTIYTIKSELKDKTHQNYKKLIT--ESTLLIHYTGATKPWHKWAIYPSVKYYKIALEN 290 YN I+ H +L T ++ L+H+ G TKPWH +P Y Sbjct: 120 SYNRIH---------WHHQGSELNTNPKNIRLVHFIGGTKPWHHLGFHPV---YDSFYRK 167 Query: 291 SPW 293 SPW Sbjct: 168 SPW 170 >UniRef50_Q4JZJ9 Putative glycosyl transferase n=8 Tax=Streptococcus pneumoniae RepID=Q4JZJ9_STRPN Length = 344 Score = 79.3 bits (194), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 71/275 (25%), Positives = 129/275 (46%), Gaps = 25/275 (9%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +N+ Y D N++D + SI S+ N ++L+ +IIAD +D +KI +L++Q R Sbjct: 31 MNIVYATDNNFVDVLSASIKSLYTTNSDLDLNLWIIADKVSDRNKEKINRLSKQFAQR-- 88 Query: 88 LYRINTDKLQCLPCTQVWSR---AMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 IN + +P R + + RLF +L ++ ++LYLD+D++ + + Sbjct: 89 --EINWIENVEIPFKLHLDRGSISSFSRLFLGSVLPSSMSKVLYLDSDIIVMDSLRSIFD 146 Query: 145 LGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 + G + V D + K V L P + FN+GV+ ++L+ W + + E+ L ++ Sbjct: 147 IDFKGKILYGVNDTFNKEYKQV--LGIP-IDKPMFNAGVMLINLELWRNNNVEERFLQVI 203 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLIT------- 257 + D V+N +L LP EYN + TI +L + +KK I Sbjct: 204 QKFNGTILQGDLGVLNAVLYNSFGVLPPEYNYM-TIFEDLTYEEMIVFKKPINYYSKEEI 262 Query: 258 ----ESTLLIHYTG---ATKPWHKWAIYPSVKYYK 285 E +L H+T + +PW + + V+ +K Sbjct: 263 KNARERIVLRHFTTSFLSKRPWQESSEVTHVEIFK 297 >UniRef50_B1LK07 Glycosyl transferase family 8 n=10 Tax=Enterobacteriaceae RepID=B1LK07_ECOSM Length = 630 Score = 79.3 bits (194), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 78/305 (25%), Positives = 141/305 (46%), Gaps = 47/305 (15%) Query: 26 ECLNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKL-AEQNQ 83 E + V D NY G I SIVL+ + N D ++ + + Q++ KL A N Sbjct: 275 ESVPVVISFDNNYALSGGALINSIVLHSDASRNYDIVVLENKVSHLNKQRLIKLVAGHNN 334 Query: 84 LRITLYRINT-DKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQL 142 + + + +N+ ++ + +S + Y RLF QL +++++D+D V K D++ L Sbjct: 335 ISLRFFDVNSFTEMSDVHTRAHFSASTYARLFIPQLF-REYKKVVFIDSDTVVKADLATL 393 Query: 143 LHLGLNGAVAAVVKD------------------VEPMQE--KAVSRLSDPELLGQYFNSG 182 L + + + A VKD + P ++ K +++P+ +YF +G Sbjct: 394 LDVEIGTNLVAAVKDIVMEGFVKFGTMSESDDGIMPAEQYLKKTLGMTNPD---EYFQAG 450 Query: 183 VVYLDLKKWADAKLTEKALSILMS--KDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTI 240 ++ ++++ +TE + LMS K Y + DQD+MN + G FLP E+N + Sbjct: 451 IIVFNVEQM----VTENTFAQLMSALKAKKYWFLDQDIMNKVFFGRVKFLPLEWNVYHGN 506 Query: 241 KS------ELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALEN---S 291 + LK T+ + + + +IHY G KPW+ V +Y LEN + Sbjct: 507 GNTDDFFPNLKFSTYMRFLQ-ARRNPKMIHYAGENKPWNT----EKVDFYDDFLENVLST 561 Query: 292 PWKDD 296 PW+ + Sbjct: 562 PWEKE 566 >UniRef50_Q2K5X3 Galactosyltransferase protein n=13 Tax=Rhizobium RepID=Q2K5X3_RHIEC Length = 333 Score = 79.3 bits (194), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 71/274 (25%), Positives = 110/274 (40%), Gaps = 38/274 (13%) Query: 35 DANYLDGVGVSITSIVLNNRHINLDFYI---------IADVYNDGFFQKIAKLAEQNQLR 85 D N L ++ S+ N + +++F + +A+V N G +A +R Sbjct: 43 DVNMLPAACCTLLSVKRNLTNADVEFLLLGIDLKPHEVAEVENFGRLHGMA-------IR 95 Query: 86 ITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHL 145 + Y LQ WS A RL+ + + ++RLLYLDADV+ + +L L Sbjct: 96 VLPYETPDTGLQA---RGRWSAATLARLYMDRDIPDHIERLLYLDADVLAVAPVDELFTL 152 Query: 146 GLNGAVAAVVKD-VEPMQEKAVSRLSDPEL--LGQYFNSGVVYLDLKKWADAKLTEKALS 202 G A V D V EK+ +R + G+YFN+GV+ D L + Sbjct: 153 DFQGKALAAVDDYVMAFPEKSGARQRKIGMGEGGRYFNAGVLLFDWSACRARGLFPRTRE 212 Query: 203 ILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLL 262 I + ++++ DQD +NV G L L +NT Q + + Sbjct: 213 IFKERSHLFENNDQDALNVTFDGDWLVLDPRWNT-------------QTGLLPFVDRPAI 259 Query: 263 IHYTGATKPWH---KWAIYPSVKYYKIALENSPW 293 H+TG KPW W Y L N+PW Sbjct: 260 FHFTGRKKPWQANVPWVHRRMANRYADDLRNTPW 293 >UniRef50_C0QZN2 Glycosyl transferase, family 8 n=3 Tax=Brachyspira RepID=C0QZN2_BRAHW Length = 339 Score = 78.6 bits (192), Expect = 3e-13, Method: Compositional matrix adjust. Identities = 78/302 (25%), Positives = 135/302 (44%), Gaps = 40/302 (13%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFF----QKIAKLAEQNQ 83 +N+ D NY +G +I SI+ N+ D II + + G +KI L + Sbjct: 1 MNICLASDNNYAPYMGTAIASILKNSSE---DEKIIFHLIDGGITKENKEKIISLKNIKE 57 Query: 84 LRITLY----RINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDI 139 I Y ++ + C +S AM++RL ++ +D++LYLD+D++ G + Sbjct: 58 CEINFYTPDIKMYDGWFEKTSCKAHFSAAMFYRLSIASIIPSNIDKILYLDSDLIATGSL 117 Query: 140 SQLLHLGLNGAVAAVVKDVEPMQEK-AVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTE 198 +L + + A V+K + K ++ ++D YFNSGV+ ++ K W + + Sbjct: 118 KELFLMDIENHYAIVIKHSTNEKNKWSIDGIND------YFNSGVLLINNKLWIKNNIED 171 Query: 199 KALSILMSKDNVYK--YPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLI 256 + +N YK + DQDV+N +L G + YN +K + N + I Sbjct: 172 QFNKFY---NNNYKTCFGDQDVLNNVLIGKVKYADMRYNV-------YAEKGYYNTENDI 221 Query: 257 TESTLLIHYTGATKPWHKWA-----IYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRY 311 E+ ++IHY KPW + I +YY+ +PW D P A I +K Y Sbjct: 222 -ENPIIIHYLSPEKPWKENCRGTLFIDEFWRYYQY----TPWFRDEPITAFQTILKQKFY 276 Query: 312 KH 313 + Sbjct: 277 DY 278 >UniRef50_A5UC07 Aspartate-semialdehyde dehydrogenase n=7 Tax=Haemophilus influenzae RepID=A5UC07_HAEIE Length = 300 Score = 78.2 bits (191), Expect = 4e-13, Method: Compositional matrix adjust. Identities = 65/268 (24%), Positives = 120/268 (44%), Gaps = 28/268 (10%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQ--NQLR 85 +N+ + +D N+ + + S+ ++ +IN+ Y+I D +K+ + N L Sbjct: 1 MNIVFTLDCNFASHLDTVLKSLCYHHNNINI--YVIHDGIPAESLEKLKMHCAKFDNTLY 58 Query: 86 ITLYRINTDKL-QCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 + IN + + S A FRL+ Q+L ++R++YLD D++ I +L Sbjct: 59 DIQFNINQFSFPTVMSPAHIQSSASLFRLYLHQILPQHIERVIYLDIDLIIHQAIDELWD 118 Query: 145 LGLNGAVAAVVKDV-------EPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLT 197 + L ++ A V D P EK QY N+GV+ ++L KW + + Sbjct: 119 INLEDSLIAGVSDFFSEYLWEHPFYEKQ-----------QYINTGVMLINLNKWRENNIE 167 Query: 198 EKALSILMSKDNVYKYPDQDVMNVLL-KGMTLFLPREYNTIYTIKSELKDKTHQNYKKLI 256 + + + Y DQDV+N + + LP ++N I+ + + +K+ I Sbjct: 168 QYFIEYAAKYGEFFVYGDQDVINFSIPTNLIKLLPVKFN----IQVKFIEYLWMEHKEKI 223 Query: 257 TESTLLIHYTGATKPWHKWAIYPSVKYY 284 + +IHY G+ KPW K S ++Y Sbjct: 224 KFTPHIIHYIGSNKPWLKEHSANSPRFY 251 >UniRef50_B2UQN6 Glycosyl transferase family 8 n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UQN6_AKKM8 Length = 371 Score = 78.2 bits (191), Expect = 4e-13, Method: Compositional matrix adjust. Identities = 76/274 (27%), Positives = 120/274 (43%), Gaps = 39/274 (14%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 + V + + +GV+I ++ L H + + + +DG +I + E NQ+ Sbjct: 21 IPVMFSATGGWGLPLGVAIHTLCL---HASSGRFYDIHIVHDGMDARI--IQELNQVAAP 75 Query: 88 LYRINTDKLQCLP----------CTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKG 137 +++ LQ LP +S Y RL A L R++YLDADV+ G Sbjct: 76 FPQVSLSFLQ-LPEEFRHLFQNGNKDRYSPLAYARLMAGSLFP-QYGRIVYLDADVLLAG 133 Query: 138 DISQLLHLGLNGAVAAVVKD-----------VEPMQEKAVSRLSDPELLGQYFNSGVVYL 186 D+++L L GA A D + P E + LS P Y NSGV+ L Sbjct: 134 DVAELYFSDLRGASVAAAGDGLALWSIEKGTMHPHLEYMGNYLSFPL---SYCNSGVLVL 190 Query: 187 DLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKD 246 DL + L + L L S+ + + YPDQD++N+ L G LP E+N + + ++ Sbjct: 191 DLDQMRRRNLEHRLLQQLRSRPDPFPYPDQDILNIALHGDMTTLPPEWNFQFLSWTWDEE 250 Query: 247 KTH----QNYKKLIT----ESTLLIHYTGATKPW 272 KT ++ + T S L+H G KPW Sbjct: 251 KTRLLRGTEFENVPTISCGRSWKLLHMVGPEKPW 284 >UniRef50_C6IB51 Glycosyltransferase n=4 Tax=Bacteroides RepID=C6IB51_9BACE Length = 417 Score = 77.8 bits (190), Expect = 5e-13, Method: Compositional matrix adjust. Identities = 56/166 (33%), Positives = 85/166 (51%), Gaps = 20/166 (12%) Query: 114 FAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDV--EPMQEKAVSRLSD 171 +A L LD+ LYLDAD+V G I L L L G A V D+ + + + L++ Sbjct: 60 YAIADLFPNLDKALYLDADLVINGSIEPLWELDLEGYYCAGVDDIFIRRINYRKILELAE 119 Query: 172 PELLGQYFNSGVVYLDLKKWADAKLTEKAL---SILMSKDNVYKYPDQDVMNVLLKGMTL 228 ++ Y N+GV+ L+LK K+ EK L SI +++D +Y DQD +N + KG Sbjct: 120 KDV---YINAGVLLLNLKDLRKDKIQEKLLQHTSIYINRD---RYQDQDAINCICKGKIK 173 Query: 229 FLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGATKPWHK 274 +P YN +T L + + ++IHYTG+ KPWH+ Sbjct: 174 LIPNIYN--FTTSETL-------HTPEMLSDIIIIHYTGSIKPWHQ 210 >UniRef50_C1QBZ8 LPS:glycosyltransferase n=1 Tax=Brachyspira murdochii DSM 12563 RepID=C1QBZ8_9SPIR Length = 336 Score = 77.8 bits (190), Expect = 5e-13, Method: Compositional matrix adjust. Identities = 68/249 (27%), Positives = 109/249 (43%), Gaps = 26/249 (10%) Query: 29 NVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 N+ D NY + V++ SI+ N N N+ F+II D K+ L + I Sbjct: 5 NICLCSDENYAKYMAVTMASILKNTNDDENIIFHIIESNIKDETKNKLIYLKKIKNCEIK 64 Query: 88 LYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGL 147 YR+ +K + A Y RL +L+ D++LYLD+D++ G + +L + + Sbjct: 65 FYRVEYNK---------YPLATYLRLLIPELIK-DADKVLYLDSDIIVNGSLKELFDIDI 114 Query: 148 NGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSK 207 NG A VKD+ K L + YFN+GVV + K D +++K S Sbjct: 115 NGYYALAVKDLYVDIYKEHKELIEIGNNRIYFNAGVVLFNNKSCIDNNISQKFYSYFTEN 174 Query: 208 DNVYKYPDQDVMNVLLKGMTLFLPREYNTI----YTIKSELKDKTHQNYKKLITESTLLI 263 N K+ DQD++N + R++N + Y KS K + ++I Sbjct: 175 KNKLKFHDQDILNHCFIDKVKIIDRKWNFMPFRDYNTKSHYPTK----------DDAVII 224 Query: 264 HYTGATKPW 272 H+ KPW Sbjct: 225 HFV-EHKPW 232 >UniRef50_Q50FU8 Cj81-079 (Fragment) n=6 Tax=Campylobacter jejuni RepID=Q50FU8_CAMJE Length = 333 Score = 77.8 bits (190), Expect = 5e-13, Method: Compositional matrix adjust. Identities = 67/233 (28%), Positives = 108/233 (46%), Gaps = 34/233 (14%) Query: 29 NVAYGVDANYLDGVGVSITSIVLNNRHIN-------LDFYIIADVYNDGFFQKIAKLAEQ 81 N+ D NY+ V V I SI+ N + IN FYI+++ + K+ KL + Sbjct: 6 NIVISCDNNYVKYVAVVIASIIKNTK-INSQLKEYPYKFYILSNDISKNNILKLKKLIQH 64 Query: 82 -----NQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTL-DRLLYLDADVVC 135 + +++I+ K P + A Y+R F++ + ++ LYLDADV+ Sbjct: 65 LSNSYYNCELIIHKIDDSKFHRFPKAWHVNHATYYR---FEIADIVEGNKCLYLDADVLV 121 Query: 136 KGDISQLLHLGLNGAVAAVVKD--------VEPMQEKAVSRLSDPELLGQYFNSGVVYLD 187 GDI +L ++ LN VA VV D + K S + L+ YFN+GV+ +D Sbjct: 122 CGDIRELFYMELNNKVAGVVTDSCSRLWTKLYTKDNKTSSYIEFDPLM--YFNAGVILID 179 Query: 188 LKKWADAKLTEKALSILMSKDNVYKY---PDQDVMNVLLKGMTLFLPREYNTI 237 L +W + K + N+Y + DQ +N+ LK +T LP +N I Sbjct: 180 LNQWKKHDIKNKCIDAF----NIYDHGGLADQSYLNIALKELTYKLPLNWNLI 228 >UniRef50_B9KVD4 Glycosyl transferase, family 8 n=2 Tax=Rhodobacter sphaeroides RepID=B9KVD4_RHOSK Length = 334 Score = 77.8 bits (190), Expect = 5e-13, Method: Compositional matrix adjust. Identities = 55/202 (27%), Positives = 94/202 (46%), Gaps = 7/202 (3%) Query: 78 LAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKG 137 LA + I+++R+ +L+ L + S A Y R A ++L + R+LYLD D++ Sbjct: 51 LAPFAHVGISVHRVPAARLEGLFVDRHLSPAAYLRFLAPEVLPEAVQRVLYLDCDLIVLD 110 Query: 138 DISQLLHLGLNGAVAAVVKDV---EPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADA 194 D++QLL L L G A D+ + Q L P L Y NSGV+ +DL +W Sbjct: 111 DVAQLLRLDLQGRAVAAAPDLGWKDAAQAARFRTLGIP-LDRPYVNSGVLLMDLGRWRRD 169 Query: 195 KLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKK 254 L++K + ++ DQD +N +L L R +N + S + ++ Sbjct: 170 GLSQKLFDYVARHGSLLLRHDQDALNAVLADDIHLLDRRWNLQVLLLSPWAKRALPEDRQ 229 Query: 255 LITES---TLLIHYTGATKPWH 273 + ++H++ A KPW+ Sbjct: 230 ATVAARRDPAILHFSTADKPWN 251 >UniRef50_D0D9G3 Putative general stress protein A n=1 Tax=Citreicella sp. SE45 RepID=D0D9G3_9RHOB Length = 327 Score = 77.4 bits (189), Expect = 6e-13, Method: Compositional matrix adjust. Identities = 70/319 (21%), Positives = 135/319 (42%), Gaps = 32/319 (10%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHIN-LDFYIIADVYNDGFFQKIAKLAEQNQLRI 86 +NV Y D +GVSI S + N N ++ ++++ + + IA + + Sbjct: 12 INVVYACDNIQALPLGVSIASALENRAEGNPINIHVLSYRISRSNRKSIASQFDGRDDTL 71 Query: 87 TLYRINTDKLQCL-----PCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQ 141 + I + + L + + A Y RL +++ +DR +YLD D++ D+S Sbjct: 72 CWHEITGENRKLLEDLFTSSNRPYPPAAYARLLISEVIP-NIDRAIYLDTDIIVATDLSP 130 Query: 142 LLHLGLNGAVAAVVKDVEPMQEKAVSRLS---DPELLGQY--------FNSGVVYLDLKK 190 L + +GA ++D+ P + RL PE + +Y F SGV+ D+K+ Sbjct: 131 LWNTPFDGAGLLAIQDL-PTSNDHIKRLRALLSPEDISRYGIEDGDSYFQSGVLVFDMKE 189 Query: 191 WADAKLTEKALSILMSKDNVYKYPDQDVMNVLL-KGMTLFLPR--EYNTIYTIKSELKDK 247 + + +E + D +PD D +N++ L PR + +++ + + Sbjct: 190 FTKTRASELIECLRNYPD--LTFPDNDALNIVFHDSFKLVDPRWNQMASVFKLDAARDTP 247 Query: 248 THQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIE- 306 + + + +IHY+G KPW +P + + AL++S W P I+ Sbjct: 248 YSAEVFQALLQDPYIIHYSGRPKPWEDGCTHPYLDRWVEALKDSAWNSWKPSRLNRAIDR 307 Query: 307 -------FKKRYKHLLVQH 318 KR++ + QH Sbjct: 308 IPRIQRVLAKRFRRFVSQH 326 >UniRef50_C1QEC6 LPS:glycosyltransferase n=1 Tax=Brachyspira murdochii DSM 12563 RepID=C1QEC6_9SPIR Length = 242 Score = 77.0 bits (188), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 62/256 (24%), Positives = 126/256 (49%), Gaps = 24/256 (9%) Query: 26 ECLNVAYGVDANYLDGVGVSITSIVLNNRH-INLDFYIIADVYNDGFFQKIAKLAEQNQL 84 E +N+ + + Y + +I SI+ N++ + F++I + +D I +L E Sbjct: 3 ETMNICFTANDKYAPFMSATIVSILKNSKDDESFSFHVITNDISDENKMMIERLKEIKTF 62 Query: 85 RITLYRINTDK----LQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDIS 140 +I Y N DK + + + ++ +++FRL L+ + +D++LYLD D++ +S Sbjct: 63 KIKYYTPNIDKYNKWFEKINYQRHYAPSIFFRLDIPNLI-INIDKVLYLDCDIIVNSSLS 121 Query: 141 QLLHLGLNGAVAAVVKDVEPMQ--EKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTE 198 +L ++ ++ A V+D + +K +++ E +YFNSGV+ L+ K + + L Sbjct: 122 ELFNIDISEYFALAVEDTGDLNFLKKYKTKIG-IEDKHKYFNSGVLLLNNKLYMEKNLNL 180 Query: 199 KALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITE 258 ++ + NV + DQD++N L + F+ ++N D + +N K Sbjct: 181 ESENYFNKYYNVIECVDQDILNYLFRDKIKFIDNKWN----------DFSSKNIDK---- 226 Query: 259 STLLIHYTGATKPWHK 274 + ++HY G K W+K Sbjct: 227 -SAIMHYVGKIKSWNK 241 >UniRef50_Q3D427 Glycosyl transferase, family 8 n=8 Tax=Streptococcus agalactiae RepID=Q3D427_STRAG Length = 413 Score = 76.3 bits (186), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 72/270 (26%), Positives = 126/270 (46%), Gaps = 21/270 (7%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLY 89 +A D Y + V I SI +N+ I DFYI+ D + +FQ + + I+ Sbjct: 7 IALAADFGYQEQVKTIIKSICFHNQFI--DFYILNDDFPVEWFQMMEYHLSKMDCTISNT 64 Query: 90 RINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNG 149 +I ++++ + YFR F +++ D++LYLD D++ D++ + L ++ Sbjct: 65 KIFNEEIKHFKFQKPMPYPTYFRYFIPEVIHE--DKVLYLDCDMIITSDLTSIFTLDISK 122 Query: 150 AVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDN 209 A V+D L + + YFNSG++ ++ W + ++++ L Sbjct: 123 YGVAAVRD---------DLLEEYDGKEDYFNSGLLLINNIFWREQGISQRLLDYTRENQG 173 Query: 210 VYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTL--LIHYTG 267 +Y DQDV+N +L L L YN +T L + Q+ ++L L +IHYT Sbjct: 174 ALQYHDQDVLNDVLCDNWLELDETYN-YHTGADMLYNLFQQSERQLNRRKDLPKVIHYT- 231 Query: 268 ATKPWHKWAIYPSVKYYKIALENS--PWKD 295 ATKPW + SV++ I E + W+D Sbjct: 232 ATKPWK--YLETSVRWRDIWWEYNRLEWRD 259 >UniRef50_B6VUC8 Putative uncharacterized protein n=1 Tax=Bacteroides dorei DSM 17855 RepID=B6VUC8_9BACE Length = 315 Score = 75.9 bits (185), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 73/284 (25%), Positives = 129/284 (45%), Gaps = 21/284 (7%) Query: 44 VSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQ 103 V +TS+ NN+ + + Y+ + +D + + L ++ ++ + +++ KLQ LP Sbjct: 18 VMLTSLFENNKQNDKEVYVFSTSMSDENIKGLELLGQRYGTKVQIIIVDSQKLQFLPIHF 77 Query: 104 VWSR-AMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDV---E 159 + A Y RLFA LL +++LLYLD D++ D+ L + + A D+ E Sbjct: 78 AYHNIACYLRLFAADLLP-GINKLLYLDCDIIVNSDLKALWDIDITDYAFAATHDLTYCE 136 Query: 160 PMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVM 219 P +K + E Y N+GV+ ++ W + + +K L + + DQD + Sbjct: 137 PNFKKNLQL----EENDTYINTGVMLINCDYWRNNNVAQKVLDYAIHNGDKMIAADQDAL 192 Query: 220 NVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITE---STLLIHYTGATKPWHKWA 276 N ++G E+N +Y K + N ++ E + +IH+ KPW + Sbjct: 193 NATMQGSFKLFSEEWN-VYPDYFYEKPNLYTNVYPILDEIRRNPKIIHFL-YVKPWFNYC 250 Query: 277 IYP----SVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLV 316 +P KYY IA E P+ R+ +SI R KH L+ Sbjct: 251 NHPLRYLYGKYYAIA-EGKPFI--LKRNKESIKRDIARLKHCLL 291 >UniRef50_C5S494 Putative glycosyl transferase n=2 Tax=Actinobacillus minor RepID=C5S494_9PAST Length = 287 Score = 75.1 bits (183), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 69/261 (26%), Positives = 116/261 (44%), Gaps = 21/261 (8%) Query: 23 NTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQN 82 N + +N+A D NY + V I S+ + H N+ FY+I Y D +F + + Sbjct: 3 NKQQTINIALAADRNYAEQVITLIKSVCYH--HKNVRFYLIHQDYPDEWFMALNQHLTNV 60 Query: 83 QLRITLYRINTDKLQCLPCTQVW-SRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQ 141 I + D + L Q ++A ++R + + + DR++YLD+D+V G+I + Sbjct: 61 GAEIIPVTV-LDSFRFLSKLQEHITQATFYR---YIIPEIPEDRVIYLDSDIVVDGNIEE 116 Query: 142 LLHLGLNGAVAAVVKDVE-PMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKA 200 + NG V+D+ E D L YFN GV+ ++ + W + L E Sbjct: 117 MYFSDFNGKYVLAVEDMYISYTEHGYIEFPD---LKPYFNGGVLLINNQLWKENDLAEYL 173 Query: 201 LSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYN-------TIYTIKSELKDKT-HQNY 252 + + NV + DQD++N +LK L YN ++ + D+ Y Sbjct: 174 IQMTKQYPNV-MFGDQDILNFVLKDKWGILSHVYNYQTGIIHAFPRLEENMSDEEIITKY 232 Query: 253 KKLITE-STLLIHYTGATKPW 272 +K E ++IHYT KPW Sbjct: 233 QKQADEVKPIIIHYTTKYKPW 253 >UniRef50_B2ISC2 Glycosyl transferase, family 8 n=2 Tax=Streptococcus pneumoniae RepID=B2ISC2_STRPS Length = 401 Score = 75.1 bits (183), Expect = 4e-12, Method: Compositional matrix adjust. Identities = 77/319 (24%), Positives = 130/319 (40%), Gaps = 45/319 (14%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKL-----AEQNQL 84 + G D +Y+D V +I SI N+ + FY+ +FQ + K +E + Sbjct: 5 IVLGADNHYMDKVETTIKSICSKNKEVK--FYVFNSDLPTEWFQLMDKRLSVLGSEIVNV 62 Query: 85 RITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 ++T IN L P + S A Y R F ++ R LYLD+D++ D++ L Sbjct: 63 KVTESLINQFHL---PTPHL-SSATYLRYFIPTIVFEK--RALYLDSDIIVTADLTSLFE 116 Query: 145 LGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 L+G A V D+ E FNSGV+ +D +W + + + L++ Sbjct: 117 FPLDGCPLAAVPDIPNTSEG--------------FNSGVLLIDTDRWREDDIQNQLLNLT 162 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIH 264 + K + + Y DQ+++N+L K L YN + + L +IH Sbjct: 163 I-KHHEHVYGDQEILNMLFKDRWKKLSLSYNLQVGYDTYRHSLGDNEWYHLFEGIPNIIH 221 Query: 265 YTGATKPWHK---------WAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLL 315 YT KPW W Y + + I L+N +++ + K I + + Sbjct: 222 YTTQNKPWSHYRFNRFRDIWWFYYGLNWNDILLDNQILQENFEKLIKPITCHASIFTN-- 279 Query: 316 VQHHYISGIIAGVCYLCRK 334 +G I G+ YL + Sbjct: 280 ------TGDIEGLPYLLEQ 292 >UniRef50_B8G232 Glycosyl transferase family 8 n=2 Tax=Desulfitobacterium hafniense RepID=B8G232_DESHD Length = 280 Score = 74.7 bits (182), Expect = 4e-12, Method: Compositional matrix adjust. Identities = 61/254 (24%), Positives = 117/254 (46%), Gaps = 13/254 (5%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +N+ ++++Y+ + V +TS++ +N Y+ + F +I + + ++ ++ Sbjct: 1 MNILVTLNSSYVKQLMVMLTSLLDSNPGEQFTVYVAHSAMSKEDFARIDQAIDSSRCKVE 60 Query: 88 LYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGL 147 +++ + L P T + + MY+R+FA L L+R+LYLD D+V + +L + Sbjct: 61 GIKLSDEGLSKAPITSRYPKEMYYRIFAVNYLPDHLERILYLDPDLVVINPLKELYTIDF 120 Query: 148 NGAVAAVVKDVEPMQEKAVS-RLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMS 206 G A V+ + +K RL+ E Y NSGV+ ++L + + + Sbjct: 121 QGNFFAAASHVKELLKKLNHVRLNMAE-DSTYVNSGVMMMNLSLLRQEQDVHEVYQYIEE 179 Query: 207 KDNVYKYPDQDVMNVLLKGMTLF-------LPREYNTIYTIKSELKD-KTHQNYKKLITE 258 + PDQDV+N + TL L Y +Y + + D K ++ + Sbjct: 180 YKHRLFLPDQDVLNGVYSDRTLTVDAKIYNLSERYYALYNLNPKYWDAKIDLDW---VRS 236 Query: 259 STLLIHYTGATKPW 272 +T +IHY G KPW Sbjct: 237 NTAIIHYCGRNKPW 250 >UniRef50_B7KM20 Glycosyl transferase family 8 n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7KM20_CYAP7 Length = 347 Score = 74.7 bits (182), Expect = 4e-12, Method: Compositional matrix adjust. Identities = 71/286 (24%), Positives = 132/286 (46%), Gaps = 23/286 (8%) Query: 25 SECLNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAEQNQ 83 +E + + G D + G+ V++ S + N + +D YI+ N K+ ++ + Sbjct: 10 NEPITIVSGADDKFALGLAVTLYSALANLDTKRKIDIYIVDGGINSKNRDKLTQILNSDL 69 Query: 84 LRITLYRINTDKLQCLPCTQVWSR---AMYFRLFAFQLLGLTLDRLLYLDADVVCKGDIS 140 + +++ + D L L +++ YFRL +LL ++R++YLD+D+V +G+++ Sbjct: 70 MPVSIKWVKPD-LTVLEGVKLFGSLNVTTYFRLLLPELLPTQVERVIYLDSDLVVEGNLA 128 Query: 141 QLLHLGLNGAVAAVVKD-VEPMQEKAVSRLSDPELLG-----QYFNSGVVYLDLKKWADA 194 L L A V+D V P + L + LG Y N+GV+ +++K+W Sbjct: 129 NLWEQELGNCPAVAVQDYVFPY---VCNGLKTYQQLGLASNTPYCNAGVMLINIKQWRIE 185 Query: 195 KLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNT----IYTIKSELKDKTHQ 250 L K L + ++ DQD +N L+ L ++N +Y K +L K Sbjct: 186 ALNRKILEYIRKFYDLVYLADQDGINALIANRFKLLDLKWNVQIFGVYNGKIDLLCKP-- 243 Query: 251 NYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDD 296 K+LI ++ ++H+T KPWH + + L S W +D Sbjct: 244 --KELIRDA-FILHFTTPIKPWHPYYRQAGGSRFTHYLRKSKWFND 286 >UniRef50_Q3DNS6 Glycosyl transferase, family 8 n=5 Tax=Streptococcus agalactiae RepID=Q3DNS6_STRAG Length = 401 Score = 74.3 bits (181), Expect = 6e-12, Method: Compositional matrix adjust. Identities = 64/252 (25%), Positives = 111/252 (44%), Gaps = 27/252 (10%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAE---QNQLRI 86 VA VD+NYLD V+I SI + NR N+ FY+ + + I + E + + Sbjct: 5 VALAVDSNYLDKALVTIKSICVYNR--NITFYLFNQDTPVEWVRNINRKLEPLGSKLINV 62 Query: 87 TLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLG 146 +Y + L W FRLF + + R+LYLD+D++ ++ L L Sbjct: 63 KIYNYDIAHLTTFLTVSTW-----FRLFLADYIPSS--RVLYLDSDIIVNTNLDYLFELD 115 Query: 147 LNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMS 206 G A VKD +E FN+G++ +L+ W + LT+ L Sbjct: 116 FKGYYLAAVKDPHKNEEGG-------------FNAGMLLANLELWREDGLTKTLLKTAEE 162 Query: 207 KDNVYKYPDQDVMNVLLKGMTLFLPREYN-TIYTIKSELKDKTHQNYKKLITESTLLIHY 265 V K DQ ++N++ L L + +N Y + S +++ Y + + +IH+ Sbjct: 163 LHRVVKTGDQSILNIVCHNRWLSLNKTWNFQTYDVVSRYNHRSYL-YLNIENRTPNIIHF 221 Query: 266 TGATKPWHKWAI 277 + KPW++ ++ Sbjct: 222 LTSDKPWNENSV 233 >UniRef50_A0ZYL4 Putative uncharacterized protein n=1 Tax=Archaeal BJ1 virus RepID=A0ZYL4_9CAUD Length = 286 Score = 73.9 bits (180), Expect = 8e-12, Method: Compositional matrix adjust. Identities = 80/292 (27%), Positives = 137/292 (46%), Gaps = 26/292 (8%) Query: 27 CLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDG-FFQKIAKLAEQN-QL 84 LNV Y + +S S++ NN+ +++ YI+++ N+ FF+ + L E + L Sbjct: 2 TLNVCYIAGGDSWVPCYISAYSVLENNQDLDIHMYILSEEDNNNPFFEHVEYLYESHPSL 61 Query: 85 RITLYRINTDKLQCLPCT-QVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLL 143 I ++ D+ LP + S +YF++ +LL T +L LDAD +C G +S LL Sbjct: 62 EIEFIEVDMDQFDDLPAPGKHLSPGVYFKIAINRLLP-TDGNVLLLDADTICDGSLSSLL 120 Query: 144 HLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSI 203 L L+G V A P + RL + FN+GV+Y++L++WA + E++ Sbjct: 121 SLDLSGKVLAAA----PSNKAETVRLGLQNNRAK-FNAGVLYVNLQEWAKQDIEERSRQY 175 Query: 204 LMSKDNVYKYPDQDVMNVLLKG---MTLFLPREYNTIYTIKSELKDKTHQNYKKLITEST 260 + +++ + DQD +N L+ M PR YN + E +++ + Sbjct: 176 I--EEHEPELNDQDALNALVNNPDDMEYIHPR-YNATKLLVREF---------EMVDDEP 223 Query: 261 LLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRD--AKSIIEFKKR 310 +IHY G KPW S + +P++D P+D K II + R Sbjct: 224 TIIHYNGPDKPWRFVTERESGDLWWEYASKTPFRDYVPKDKGVKEIIFVRAR 275 >UniRef50_C0Z4I4 Putative uncharacterized protein gspA n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0Z4I4_BREBN Length = 264 Score = 73.6 bits (179), Expect = 9e-12, Method: Compositional matrix adjust. Identities = 49/169 (28%), Positives = 90/169 (53%), Gaps = 13/169 (7%) Query: 106 SRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGA-VAAVVKDVEPMQEK 164 ++ Y R+ LL +++++YLD+D+V K DI+ L + ++ +AAV+ + + + Sbjct: 83 TQETYHRISIPDLLDKEVEKVIYLDSDIVIKKDITPLWNTKVDQYYLAAVMDSWQGLNKL 142 Query: 165 AVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLK 224 + L+ P+ YFN+GV+ ++LKKW + +T+K + + + +YP QD MN +L Sbjct: 143 RHADLAIPDDC-DYFNAGVLVMNLKKWREHNITKKIMDYMKKNQGIIRYPSQDPMNAILH 201 Query: 225 GMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGA-TKPW 272 L L ++N ++ YK + +IHYTG +KPW Sbjct: 202 DNWLQLDTKWNY----------QSKHLYKSNLRIDPAIIHYTGEDSKPW 240 >UniRef50_B1Y723 Glycosyl transferase family 8 n=1 Tax=Leptothrix cholodnii SP-6 RepID=B1Y723_LEPCP Length = 316 Score = 73.6 bits (179), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 57/250 (22%), Positives = 108/250 (43%), Gaps = 20/250 (8%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNR-HINLDFYIIADVYNDGFFQKIAKLAEQNQLRITL 88 + D YL + ++ S+V +N H ++ +++ D + ++ + +I Sbjct: 13 IVLACDEAYLMPLATTLRSVVESNAAHWPIECHVLVDDVSLPGRARVERSLPARAAQIRW 72 Query: 89 YRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLN 148 + ++ S+ + RL LL L+R+LYLD D++ GD+ L+ L+ Sbjct: 73 HAVDLTDFSSFETQAAISKMTFARLLMADLLPAELERVLYLDTDILVLGDLLPLMRTELD 132 Query: 149 GAVAAVVKDVEPMQEKAVSRLSDPELLGQ-----YFNSGVVYLDLKKWADAKLTEKALSI 203 GA+ V+D + K+ S P G YFN+GV+ +DL +W +++ A Sbjct: 133 GAILGAVRDGLDAELKSTS----PAPTGMPDVCDYFNAGVLLIDLARWRAGRVSAAARDH 188 Query: 204 LMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLI-TESTLL 262 L++ + DQD +NV G L +N + + L ++ + Sbjct: 189 LVAHPQT-PFADQDALNVACDGHWKPLAAHWN--------FQGHRSTDIAALAPSQRPGI 239 Query: 263 IHYTGATKPW 272 +H+ A KPW Sbjct: 240 VHFITALKPW 249 >UniRef50_C3PWZ8 Lipopolysaccharide 1,2-glucosyltransferase n=1 Tax=Bacteroides sp. 9_1_42FAA RepID=C3PWZ8_9BACE Length = 315 Score = 73.6 bits (179), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 78/311 (25%), Positives = 139/311 (44%), Gaps = 30/311 (9%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +++A +D+ ++ V+I SI+ NN ++ +I++ ++++AE+ I Sbjct: 1 MHIALTIDSKFVRYCAVTIVSILENNDPKDIMLHIVSGHLPKEDVLTLSQVAEKYGTSIA 60 Query: 88 LYRINTDKLQCLPCT---QVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 Y I +KLQ Q S +++R +L T+ R++YLD+D + G + +L Sbjct: 61 FYYIPHEKLQNYEVKWQKQRLSMVVFYRCVLASILPSTISRVIYLDSDTLVLGSLKELWD 120 Query: 145 LGLNGAVAAVVKDVEPMQEKAVSRLS-DPELLGQYFNSGVVYLDLKKWADAKLTEKALSI 203 LN A V+D RL P Y N GV+ L+L W + ++ + Sbjct: 121 TNLNQLALAGVQDTVSPNPSYFERLQYAPSY--NYINGGVLLLNLAYWRKHNIEQQCIKY 178 Query: 204 LMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQN-------YKKLI 256 + DQD++N LL + + IK ++D ++N +K Sbjct: 179 YQQYPDRIILNDQDILNALLYDQKVLI--------DIKWNVQDDFYRNNRYTSPAWKPSY 230 Query: 257 TESTL---LIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKH 313 T++ L ++HY+G KPW A++P + +P+ DDS + K I R+ H Sbjct: 231 TDAILHPIILHYSG-RKPWAYHAMHPLRHLFFHYQRLTPY-DDSAKQ-KKISTRIYRFIH 287 Query: 314 LLVQHHYISGI 324 LL YI G+ Sbjct: 288 LLP---YILGL 295 >UniRef50_UPI0001A45357 glycosyl transferase family protein n=1 Tax=Neisseria subflava NJ9703 RepID=UPI0001A45357 Length = 264 Score = 73.2 bits (178), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 66/269 (24%), Positives = 110/269 (40%), Gaps = 19/269 (7%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 + + D Y + V + S+ +N +N FY++ + + + + + R+ Sbjct: 4 ITIVLAADTGYAEQVHTLMKSVCTHNTGVN--FYLMHNTFRKEWINYTNQKLAASGSRLN 61 Query: 88 LYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGL 147 +I D Q + + S A +FRL + L +DR LYLD+D+V + L +L + Sbjct: 62 DVKIEMDFSQYRRLSHI-SDAAFFRLM---MQHLPVDRALYLDSDMVVTQSLHDLFNLDM 117 Query: 148 NGAVAAVVKDVEPMQEKAVSRLSDPELL--GQYFNSGVVYLDLKKWADAKLTEKALSILM 205 G A V+D A + + P L YFNSG++ DL +W + E+ L Sbjct: 118 RGYPVAAVQD----SYLARTDWNHPTGLHTTPYFNSGMLLADLGQWRKHNIAEQLLQTAA 173 Query: 206 SKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHY 265 + D Y DQ +N + + L L +N + + L + +IHY Sbjct: 174 TIDKTVPYGDQCFLNTVFQENWLQLEESWNYQTGARRFFQTYDLDEMFPLPDTTPPIIHY 233 Query: 266 TGATKPWHKWAIYPSVKYYKIALENSPWK 294 T KPW Y KI E W+ Sbjct: 234 TTLAKPW-------LCDYGKIPFEEIYWQ 255 >UniRef50_C6IJ37 General stress protein A n=2 Tax=Bacteroides RepID=C6IJ37_9BACE Length = 309 Score = 72.8 bits (177), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 58/192 (30%), Positives = 98/192 (51%), Gaps = 18/192 (9%) Query: 95 KLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH---LGLNGAV 151 KLQ + Q ++ A +RL LL +++Y+D D++ + D+ QL H LG+N Sbjct: 73 KLQHIYIDQKYTEAASYRLLLPDLLP-EYKKVIYIDCDIIVRNDLVQLYHSIDLGMNYLA 131 Query: 152 AAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVY 211 A ++ + + +P +Y NSG + ++L+ + EK + SK + Sbjct: 132 AVFEASMDFQLDHLKTIGCNP---NEYINSGFLIMNLELMRKDNMVEKFIE--ASKVDYL 186 Query: 212 KYPDQDVMNVLLKGMTLFLPREYNTIYTI------KSELKDKTHQNYKKLITESTLLIHY 265 ++PDQDV+N L K L LP YN+I T K L+ T Q++ ++ T +HY Sbjct: 187 EFPDQDVLNQLCKDRILALPPYYNSIRTFYLPQYKKFFLQKYTEQDWLEVHRHGT--VHY 244 Query: 266 TGATKPWHKWAI 277 TGA KPW+++ + Sbjct: 245 TGA-KPWNQFTV 255 >UniRef50_C2KV37 Lipopolysaccharide biosynthesis protein, LPS:glycosyltransferase n=1 Tax=Oribacterium sinus F0268 RepID=C2KV37_9FIRM Length = 324 Score = 72.8 bits (177), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 71/291 (24%), Positives = 129/291 (44%), Gaps = 32/291 (10%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +++ YGV+ ++ + VS++S++L+ L F+I++ + +K+ + E +I+ Sbjct: 1 MHIVYGVNEAFMPILAVSLSSLLLHAEGEALHFHILSLGIEEESKEKLRQYVETEGQKIS 60 Query: 88 LYRINT------DKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQ 141 Y + +KL L T +S+A RLF L T+ + LYLDAD V I Sbjct: 61 FYDLEEKLSEWKEKLPAL-FTGKFSKATLLRLFIPSTLPETITKALYLDADTVVLQSILS 119 Query: 142 LLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKAL 201 L HL L + + EP K Y+N+GV+ ++L + + EK L Sbjct: 120 LYHLRLGDKLLGMAP--EPSIYKKHKEFLSLAEESPYYNAGVMLMNLSLLREEGMEEKCL 177 Query: 202 SILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIK--------------SELKDK 247 K+ + DQD++N++ KG LP+ +N EL+ K Sbjct: 178 RYYQMKEGQLPFNDQDILNMVCKGRIRSLPQRFNFFSNYAYARYSALCRFSPWYQELESK 237 Query: 248 THQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSP 298 + K +++H+ G +PW + + YY+ A + + ++SP Sbjct: 238 KSYSQAKA---HPVIVHFAGDERPWREG----NHNYYRRAFDY--YAEESP 279 >UniRef50_C6X2V2 Putative glycosyltransferase n=1 Tax=Flavobacteriaceae bacterium 3519-10 RepID=C6X2V2_FLAB3 Length = 315 Score = 72.0 bits (175), Expect = 3e-11, Method: Compositional matrix adjust. Identities = 70/272 (25%), Positives = 122/272 (44%), Gaps = 31/272 (11%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNN-RHINLDFYIIADVYNDGFFQKIAK--LAEQNQL 84 L + + D +Y V I+SI+ N+ R+ + I+++ +D Q +A+ + ++ + Sbjct: 9 LPIVFTCDDHYFKYAAVVISSIIHNSSRNTKYEINIVSEYISDEN-QSLAQKMVQSKSNI 67 Query: 85 RITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 I + I + + S + Y+R F F LL DR+LYLD+D++ DIS Sbjct: 68 SIQFHAIKIENPEVFHLNSYMSLSTYYRFFIFDLLK-DYDRVLYLDSDLIVDNDISFFAD 126 Query: 145 LGLNGAVAA------VVKDVEPMQEKAVSRLSDPELL-----GQYFNSGVVYLDLK---- 189 + A V ++ + +R ++L +YFN+GV+ ++K Sbjct: 127 IDFENKPAICCPSIYVQNSLKNNTDHKFTREYFTQILKMSDVDEYFNAGVILFNIKLIRA 186 Query: 190 KWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLK--GMTLFLPREYNTIYTIKSELK-- 245 + D K E +I KD VY+ DQD++N +L+ G + EYN T+K LK Sbjct: 187 QGIDRKFFEAIKNI---KDPVYQ--DQDILNSVLRNNGGAKLISNEYNHTKTMKFSLKRI 241 Query: 246 --DKTHQNYKKLITESTLLIHYTGATKPWHKW 275 + + K + HY G KPW + Sbjct: 242 FLNALKNKFGKKRNNWFTIYHYVGKVKPWQNF 273 >UniRef50_Q4HGS8 General stress protein A, putative n=2 Tax=Campylobacter RepID=Q4HGS8_CAMCO Length = 403 Score = 72.0 bits (175), Expect = 3e-11, Method: Compositional matrix adjust. Identities = 84/338 (24%), Positives = 158/338 (46%), Gaps = 44/338 (13%) Query: 29 NVAYGVDANYLDGVGVSITSIVLNNRHIN------LDFYIIADVYNDGFFQKIAKLAEQ- 81 ++ + D NY+ V ITSI+ N N F+I+++ ++ +K+ L ++ Sbjct: 3 HIIFSADENYIKYTSVLITSIIKNTNPKNHFQNRPYSFHILSNFVSEETREKLECLKKEL 62 Query: 82 NQL---RITLYRINTDKLQCLPCTQVW--SRAMYFRLFAFQLLGLTLDRLLYLDADVVCK 136 N++ I+++ ++ D+ + P + S+ Y+RL L +D+ LYLD+D++C Sbjct: 63 NKIYPCEISIHIMSDDRFENFPSSGAAQNSKLPYYRLKFISLFDDNVDKCLYLDSDMLCM 122 Query: 137 GDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDP------ELLGQYFNSGVVYLDLKK 190 DI ++ + L G + VV D P +++ + + + YFNSG + ++ K+ Sbjct: 123 CDIREIFAIDLQGKIIGVVGD--PGSKRSKIKFIENNTKKVLKFDENYFNSGFLLINAKE 180 Query: 191 WADAKLTEKALSILMSKDNVY-KYPDQDVMNVLL---KGMTLFLPREYNTIYTIKSELKD 246 + A + +K ++K +Y K DQD++N ++ K + L +N I + KD Sbjct: 181 YKKANVEKKCEE--LAKKCIYIKAADQDLLNAVISKDKILKLSFAYNFNIITLLYVICKD 238 Query: 247 --KTHQNY-KKLITEST---LLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRD 300 K NY ++ T+S ++HY KPW Y + L+N D Sbjct: 239 EKKNRLNYTREEFTQSAKNPKILHY--GEKPWKFLKSY-------VDLQNRNISDYWWDI 289 Query: 301 AKSIIEFKK---RYKHLLVQHHYISGIIAGVCYLCRKY 335 AK + FK+ R K + + +G+ + LC+KY Sbjct: 290 AKEVPIFKEELLRQKENIKDYLLYAGLGFTLYNLCKKY 327 >UniRef50_C9RWX3 Glycosyl transferase family 8 n=8 Tax=Bacillaceae RepID=C9RWX3_GEOSY Length = 276 Score = 72.0 bits (175), Expect = 3e-11, Method: Compositional matrix adjust. Identities = 64/254 (25%), Positives = 106/254 (41%), Gaps = 15/254 (5%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 V DANYL + V + S+ NNR FY++ + Q + + + + Sbjct: 2 FQVLVTTDANYLPPLRVLMHSLFCNNRR-PFTFYLLYSRIAEEEIQALGEFVRRQGHELV 60 Query: 88 LYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGL 147 ++ P + ++ MY+RL A L +DR+LYLD D+V + +L + Sbjct: 61 PIYVDPQLFHDAPVFRHYTVEMYYRLAAHLFLPPDVDRVLYLDPDIVAINPMDELYDMDF 120 Query: 148 NGAVAAVVKDVEPMQEKAVS---RLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 G + + + + RL P G YFN+GV+ +++ + + Sbjct: 121 EGNLFIAAEHTHSTKVANLFNKLRLKTPNAKG-YFNTGVMMMNIAMMREHVRLADIYQFI 179 Query: 205 MSKDNVYK--YPDQDVMNVL----LKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITE 258 +DN +K PDQDV+N L +K + + Y L + H I E Sbjct: 180 --RDNRFKLVLPDQDVLNGLYWDKIKPVDCYRYNYDARYYDFLQLLPNPKHD--LAWIEE 235 Query: 259 STLLIHYTGATKPW 272 +T+ IHY G KPW Sbjct: 236 NTVFIHYCGKEKPW 249 >UniRef50_Q3D426 Glycosyl transferase, family 8 n=7 Tax=Streptococcus agalactiae RepID=Q3D426_STRAG Length = 401 Score = 71.6 bits (174), Expect = 4e-11, Method: Compositional matrix adjust. Identities = 68/266 (25%), Positives = 116/266 (43%), Gaps = 21/266 (7%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLY 89 + G D Y D V +I SIV +N+H L YII + +F + EQ R+ Sbjct: 5 IVLGADFQYRDQVMTTIKSIVSHNQH--LTIYIINTDFPVEWFNILNHSLEQFDCRVKNI 62 Query: 90 RINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNG 149 I++D + +P S A +FR F + L + +LYLD+DV+ +G + L + L Sbjct: 63 PISSDVFEGIPTLSHISVAGFFRWFI--PIHLEEEIVLYLDSDVIVRGSLDPLFDINLEE 120 Query: 150 AVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDN 209 + V D S L + FNSGV+ ++ W ++ + I K + Sbjct: 121 NLLGAVAD-------HFSTLYYGDTAPVSFNSGVMLINNSLWKKEEIYNSLMRI-ADKGS 172 Query: 210 VYKYPDQDVMNVLLKGMTLFLPREYNTI----YTIKSELKDKTHQNYKKLITESTLLIHY 265 DQ+ +N+L + + + ++YN I + + + Y +++HY Sbjct: 173 AVGVGDQEYLNILTQNRWIDIGKQYNVQIGQDVNINAYGRPDLYHFYDDC---EPVIVHY 229 Query: 266 TGATKPWHKWAI--YPSVKYYKIALE 289 KPW+K++ Y S +Y LE Sbjct: 230 NSQDKPWNKYSQSRYRSEWWYYFGLE 255 >UniRef50_P25148 General stress protein A n=3 Tax=Bacillus subtilis group RepID=GSPA_BACSU Length = 286 Score = 70.9 bits (172), Expect = 7e-11, Method: Compositional matrix adjust. Identities = 61/266 (22%), Positives = 120/266 (45%), Gaps = 20/266 (7%) Query: 26 ECLNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAEQNQL 84 E +++ D NY +G S++ N ++ + Y+I + G K E+ L Sbjct: 5 EIMHIVSCADDNYARHLGGMFVSLLTNMDQEREVKLYVI----DGGIKPDNKKRLEETTL 60 Query: 85 R----ITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLL-GLTLDRLLYLDADVVCKGDI 139 + I ++T+ + + ++A Y+R+ L+ ++ R++Y+D D + DI Sbjct: 61 KFGVPIEFLEVDTNMYEHAVESSHITKAAYYRISIPDLIKDESIKRMIYIDCDALVLEDI 120 Query: 140 SQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEK 199 S+L L + A V+D + ++D G+YFNSG++ +D + W +TEK Sbjct: 121 SKLWDLDIAPYTVAAVEDAGQHERLKEMNVTD---TGKYFNSGIMIIDFESWRKQNITEK 177 Query: 200 ALSIL--MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKT----HQNYK 253 ++ + ++ DQD +N +L L +N I +LK + + Y Sbjct: 178 VINFINEHPDEDFLVLHDQDALNAILYDQWYELHPRWNAQTYIMLKLKTPSTLLGRKQYN 237 Query: 254 KLITESTLLIHYTGATKPWHKWAIYP 279 + E+ ++H+ G KPW+ +P Sbjct: 238 E-TRENPAIVHFCGGEKPWNSNTKHP 262 >UniRef50_C1IBL0 Glycosyl transferase n=1 Tax=Clostridium sp. 7_2_43FAA RepID=C1IBL0_9CLOT Length = 273 Score = 70.9 bits (172), Expect = 7e-11, Method: Compositional matrix adjust. Identities = 61/267 (22%), Positives = 111/267 (41%), Gaps = 34/267 (12%) Query: 24 TSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQ 83 +S +++ D NY+ + S+VLNN +++ Q++ + + Sbjct: 2 SSNRIDLLVTFDKNYIPPFQTMLKSLVLNNPRETFHIWLLHSEIPLEMLQEVEEYCAKQG 61 Query: 84 LRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLL 143 +T + + P ++ + + MY+RL A +L ++ ++LYLD D++ I L Sbjct: 62 AAMTSINVERSVFKNAPVSKRYPQEMYYRLLAPLILPKSIKKILYLDPDILIINSIRPLW 121 Query: 144 HLGLNG---------AVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADA 194 L V V+ D+ ++ + Y+NSGV+ +DL K Sbjct: 122 ETELGNYIFAAASHVGVTGVINDINRVRLRVDH---------DYYNSGVMLMDLTKARSI 172 Query: 195 KLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFL--------PREYNTIYTIKSELKD 246 E+ + PDQD+ N L TL L R+Y+ Y ++S Sbjct: 173 VNVEEIFQCVREHKEELLLPDQDIFNYLYGKQTLPLDDAIWNYDARKYSN-YLLRSG--- 228 Query: 247 KTHQNYK-KLITESTLLIHYTGATKPW 272 NY IT +T+++H+ G +KPW Sbjct: 229 ---GNYDMDWITRNTVVLHFCGKSKPW 252 >UniRef50_B2UQ54 Glycosyl transferase family 8 n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UQ54_AKKM8 Length = 328 Score = 70.9 bits (172), Expect = 7e-11, Method: Compositional matrix adjust. Identities = 57/240 (23%), Positives = 107/240 (44%), Gaps = 11/240 (4%) Query: 61 YIIADVYNDGFFQKIAKLAEQNQLRITLYRINTD-KLQCLPCTQVWSRAMYFRLFAFQLL 119 Y+++D + + + +LA R+ ++ + P T+ W + R+F +LL Sbjct: 44 YVLSDGIDGENWASVERLAAPFDCRLEFIDVSGILEKHDFPHTEQWPVPAWGRVFIPELL 103 Query: 120 GLTLDRLLYLDADVVCKGDISQLLHLGLNG-AVAAVVKDVEPMQEKAVSRLSDPELLGQY 178 +LYLD DV+ D+++L ++G A+ V ++ RL P Y Sbjct: 104 KEERGNILYLDIDVLVCRDLTELFRTNMDGKAIGVVFENFSRPGSHFNERLEMPLTCTGY 163 Query: 179 FNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIY 238 FNSGV+ +++ + + L L ++ + PDQD +N L +T+ L +N Sbjct: 164 FNSGVLLMNVDVFREKNLVRAVLDYAVTHRDRLTCPDQDALNGALCELTVPLHPRWNWHD 223 Query: 239 TIKSE-LKDKTHQNYKKLIT-----ESTL---LIHYTGATKPWHKWAIYPSVKYYKIALE 289 + LK+ + + + +T E+ L ++HY G KPW Y +Y ++ E Sbjct: 224 GLTRRILKNDPREQFWRGVTPRQAVEAALEPGILHYQGVHKPWRYNWRYEGERYERVMRE 283 >UniRef50_UPI00016B2258 glycosyl transferase, family 8 n=1 Tax=candidate division TM7 single-cell isolate TM7c RepID=UPI00016B2258 Length = 327 Score = 70.9 bits (172), Expect = 7e-11, Method: Compositional matrix adjust. Identities = 79/297 (26%), Positives = 138/297 (46%), Gaps = 29/297 (9%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRH---INLDFYIIADVYNDGFFQKIAKLAEQNQL 84 LNV Y D NY +SI S++ NN+H IN+ FY+ + D + + + Sbjct: 6 LNVIYQSDDNYAVVSAISIVSLMENNKHLKQINI-FYLGHQLKKDSINKFNKMVGNYHNA 64 Query: 85 RITLYRINT--DKLQCLPCTQVWSRAMY---FRLFAFQLLGLTLDRLLYLDADVVCKGDI 139 IT +++ D+L+ + + W + +Y +++ AF L + DR+LY++ V G + Sbjct: 65 TITFVDVSSYPDELKEI-GVKAW-KGLYITWYKMLAFAKLDIKTDRILYINPHTVISGAL 122 Query: 140 SQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEK 199 LL L V A+ D + A + + + YFN G++ ++ KKW K+ K Sbjct: 123 DGLLELDFEDNVMALSYDATMVN--AHKDVIGLKPIDGYFNCGIMLINHKKWMKDKIDAK 180 Query: 200 ALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYN--TI---YTIKSELK-----DKTH 249 L + N Y+ DQD+ NV KG + EYN T+ Y IK +K ++ Sbjct: 181 MREHL--RYNHYEVADQDLCNVFFKGNIKKVGVEYNFSTVFYGYDIKKYIKANGFLPESF 238 Query: 250 QNYKKLITE--STLLIH--YTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAK 302 +Y +++ + +IH + +PW + P ++ L +PWK+ + AK Sbjct: 239 YSYDEIMESYYTPKIIHSQFGMNGRPWQQGNENPVGILWRKYLNLTPWKNATMPVAK 295 >UniRef50_D2QX94 Glycosyl transferase family 8 n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2QX94_9PLAN Length = 362 Score = 68.9 bits (167), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 68/306 (22%), Positives = 127/306 (41%), Gaps = 36/306 (11%) Query: 23 NTSECLNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAEQ 81 N + + D N+ G+ + S + N ++D +++ D +I++ Sbjct: 10 NMPTSIQLVTSSDNNFAIGLAGTFKSALTNLAADSSVDLWVLDGGITDENKAEISR--HL 67 Query: 82 NQLRITLYRINTDK--LQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDI 139 + R+TL+ ++ D+ + + + A Y+RL ++L + + +YLD+D++ +GD+ Sbjct: 68 SDPRLTLHFVSVDRKLVSQFVISHHVTDATYYRLLTPEILSRDIGKFIYLDSDLLIRGDL 127 Query: 140 SQLLHLGLNGAVAAVVKDV-EPMQEKAVSRLSDPELLG--------------------QY 178 ++L + +GA ++D P + P L G Y Sbjct: 128 TKLWNTPFDGAPCVAIQDSGAPFVDSTQLIEQQPSLRGCIANANPIPNYRELGLHPHAPY 187 Query: 179 FNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIY 238 N GV+ +DL W +L E+ L +L Y DQ +NV+L +N Sbjct: 188 LNGGVMMIDLDLWRREQLAERMLKVLSDYREHVTYWDQYALNVVLSQRWKQADHRWN--- 244 Query: 239 TIKSELKDKTHQN--YKK----LITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSP 292 I L+ +H+N + K L + H+T KPW I+P + + LE S Sbjct: 245 QIAYPLRFSSHENTIFSKEAFDLYRNDPYISHFT-YRKPWQAECIHPRSEEFYQYLEGSI 303 Query: 293 WKDDSP 298 W + P Sbjct: 304 WANTKP 309 >UniRef50_A5VK24 Glycosyl transferase, family 8 n=18 Tax=Lactobacillales RepID=A5VK24_LACRD Length = 282 Score = 68.9 bits (167), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 64/258 (24%), Positives = 115/258 (44%), Gaps = 25/258 (9%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIAD--VYNDGFFQKIAKLAEQNQLR 85 +N+ + ++ ++ + + SI LN + + Y++ + +++ K N Sbjct: 1 MNLLFSINDKFVTQLATVLLSIKLNTQAQEFNVYVLQKDKLKRTDDLERVCKQLGMNYFP 60 Query: 86 ITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHL 145 I ++N P T + +Y+RL A +LL L ++LYLDADV+C D+S L Sbjct: 61 I---KVNDQLFNKAPVTDRYPTTIYYRLLAHRLLPQDLHKILYLDADVLCINDLSSLYET 117 Query: 146 GLNGAVAAVVKDVEPMQEKAV---SRLSDPELLGQYFNSGVVYLDL----KKWADAKLTE 198 L+G + A V RL + + G Y+NSGV+ ++L KK D + Sbjct: 118 SLDGYLYASAIHTNLTNTTEVINKIRLQNFDADG-YYNSGVLLMNLDTIRKKVKDTDIFN 176 Query: 199 --KALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTH--QNYKK 254 + ++L+ PDQDV+N L +P + T K + + + Sbjct: 177 YIRTHTLLL--------PDQDVLNALYGRYIKSVPDQLYNFDTRKGGIYETISFGEWTTD 228 Query: 255 LITESTLLIHYTGATKPW 272 + +T+++HY G KPW Sbjct: 229 WVMRNTVILHYCGRDKPW 246 >UniRef50_C6DEN1 Glycosyl transferase family 8 n=1 Tax=Pectobacterium carotovorum subsp. carotovorum PC1 RepID=C6DEN1_PECCP Length = 602 Score = 68.9 bits (167), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 67/257 (26%), Positives = 112/257 (43%), Gaps = 24/257 (9%) Query: 30 VAYGVDANYLDGVGVSITSIVLN-NRHINL-DFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 + + D Y V++ S+ + R NL D YI G + ++A + +T Sbjct: 327 IFFCADTAYTAPAIVALISLAIAIERSENLPDIYIFVLPEAHGLWGQLASSFNREFPSLT 386 Query: 88 LYRINTDKLQC---------LPCTQVWSRAMYFRLFAFQLLG-LTLDRLLYLDADVVCKG 137 L ++T ++Q + S Y RL+A + L + R LYLD+DVV + Sbjct: 387 LRVVSTLQMQLDQSRAHYGFNSMGDMLSTMAYARLYASRYLSQCGVARALYLDSDVVIQS 446 Query: 138 DISQLLHLGLNG-AVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKL 196 LL++ + +AA V P+ + AV+ P G+YFNSGV+ LD A Sbjct: 447 SPLPLLYMDMEEFPLAACHDQVGPLVDHAVTLHGIPN--GRYFNSGVMLLDFHHPATLPA 504 Query: 197 TEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLI 256 E A++ D+V + DQ +N ++G+ L L +YN + + Sbjct: 505 IEAAITYSEDTDSVLIFQDQCALNKAIRGLYLTLDGKYNCYMPPGRPM---------SAM 555 Query: 257 TESTLLIHYTGATKPWH 273 E+ ++H+ KPWH Sbjct: 556 YENAAIVHFVSTPKPWH 572 >UniRef50_B3WD33 WbbM protein n=8 Tax=Lactobacillus RepID=B3WD33_LACCB Length = 318 Score = 68.6 bits (166), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 69/263 (26%), Positives = 114/263 (43%), Gaps = 43/263 (16%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQK-IAKLAEQNQLRITL 88 + + VD Y+ + V++TSI N+ DF I ++ N G QK +LA T+ Sbjct: 9 IFFSVDDGYVPCLAVALTSI-RTNKDPQTDFVI--NILNSGLLQKNQTRLAALAAPHFTI 65 Query: 89 YRINTDKL-QCLPCTQVWSRA------MYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQ 141 I+ D + Q + R +YFRLF + D+ +Y+DAD V GD+++ Sbjct: 66 NFIDMDAVTQQISGDTNKLRGDYVTLTIYFRLFIADMFP-QYDKAIYIDADTVADGDLAE 124 Query: 142 LLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELL-----------GQYFNSGVVYLDLKK 190 L L + A V D P+ ++ PE + G+Y NSGV+ L+L + Sbjct: 125 LFTTDLGDNLVAGVAD--PVM------MTYPETIEYIQRDFGVQPGEYINSGVLILNLAQ 176 Query: 191 WADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQ 250 +++ L +L + DQD +NV+ + +LP+ +N + + + Sbjct: 177 MRQEHFSDRFLHLLKTYHFTMIAADQDYINVIAQHRIKYLPKTWNMQTGVPTAAE----- 231 Query: 251 NYKKLITESTLLIHYTGATKPWH 273 LIHY KPWH Sbjct: 232 -------SGGKLIHYNLFGKPWH 247 >UniRef50_B7GNT4 Glycosyl transferase, family 8 n=5 Tax=Bifidobacterium RepID=B7GNT4_BIFLI Length = 1013 Score = 68.2 bits (165), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 51/182 (28%), Positives = 90/182 (49%), Gaps = 26/182 (14%) Query: 110 YFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQE------ 163 Y+R QLL D++LYLD+D++ GDI++L + L + V+D++ + Sbjct: 755 YYRFLIQQLLP-NYDKVLYLDSDIIIVGDIAKLYDIDLQDNLLGAVRDIDFLGNLNVKHG 813 Query: 164 ------KAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQD 217 K V ++ +P YF +GV+ L+ K + E+ L+ + + Y Y DQD Sbjct: 814 KRMSYAKDVLKMKNPY---DYFQAGVLVLNTKGMRNRYSIEQWLTYASNPN--YIYNDQD 868 Query: 218 VMNVLLKGMTLFLPREYNTIY-------TIKSELKDKTHQNYKKLITESTLLIHYTGATK 270 V+N +G L+LP E+N ++ + ++ + + Y K + + +IHY G K Sbjct: 869 VLNAYCEGKVLYLPWEWNVVHDCGGRVGNLFTQAPNDVYDAYVKSRS-NPQIIHYAGYQK 927 Query: 271 PW 272 PW Sbjct: 928 PW 929 >UniRef50_A7A7B4 Putative uncharacterized protein n=2 Tax=Bifidobacterium adolescentis L2-32 RepID=A7A7B4_BIFAD Length = 1009 Score = 68.2 bits (165), Expect = 5e-10, Method: Compositional matrix adjust. Identities = 51/203 (25%), Positives = 96/203 (47%), Gaps = 26/203 (12%) Query: 106 SRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQE-- 163 S Y+R ++L D++LYLD+D++ GDI++L ++ L G + ++D++ + Sbjct: 747 SVETYYRFLIQKVLPF-YDKVLYLDSDIIINGDIAKLYNIDLQGKMLGAIRDIDFLANLN 805 Query: 164 ----------KAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKY 213 + V ++ +P YF +GV+ L+ K + ++ L+ + D +Y Sbjct: 806 VKHGKRMGYAQTVLKMKNPY---DYFQAGVLVLNTKAMREHYTIKQWLTYASNPDFIYN- 861 Query: 214 PDQDVMNVLLKGMTLFLPREYNTIYTIKSEL-------KDKTHQNYKKLITESTLLIHYT 266 DQDV+N +G L+LP E+N ++ + + + Y K + ++HY Sbjct: 862 -DQDVLNAHCEGNVLYLPWEWNVVHDCGGRVGNLFVQAPNDIYDAYMKSRNDPQ-IVHYA 919 Query: 267 GATKPWHKWAIYPSVKYYKIALE 289 G KPW + Y+K A E Sbjct: 920 GFQKPWTDPDCDFASMYWKYARE 942 >UniRef50_C6DEN2 Glycosyl transferase family 8 n=1 Tax=Pectobacterium carotovorum subsp. carotovorum PC1 RepID=C6DEN2_PECCP Length = 615 Score = 67.8 bits (164), Expect = 5e-10, Method: Compositional matrix adjust. Identities = 52/172 (30%), Positives = 80/172 (46%), Gaps = 13/172 (7%) Query: 103 QVWSRAMYFRLFAFQLL-GLTLDRLLYLDADVVCKGDISQLLHLGLNG-AVAAVVKDVEP 160 ++ S Y RL+A + L GL + R LYLD+DVV + LLH+ + G +AA + P Sbjct: 420 KMLSITAYARLYASRYLQGLGITRALYLDSDVVIRRSPLGLLHMDMGGYPLAARTERAHP 479 Query: 161 MQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMN 220 +A+ P G+YFNSG++ LD + A A++ DN Y DQ +N Sbjct: 480 RISRAIKLHGIPN--GRYFNSGILLLDFQHPATQSTLNTAIAYSEQLDNKLLYLDQCALN 537 Query: 221 VLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGATKPW 272 ++G+ L L ++N D H E ++H+ KPW Sbjct: 538 KSIQGLYLDLDEKFNWFIVP----DDTAHPQ-----DEDAAIMHFISTPKPW 580 >UniRef50_A7H2M2 Glycosyl transferase family 8 n=3 Tax=Campylobacter jejuni RepID=A7H2M2_CAMJD Length = 381 Score = 67.4 bits (163), Expect = 6e-10, Method: Compositional matrix adjust. Identities = 64/232 (27%), Positives = 110/232 (47%), Gaps = 28/232 (12%) Query: 28 LNVAYGVDANYLDGVGVSITSIV----LN---NRHINLD------FYIIADVYNDGFFQK 74 ++ + NY+ V +TSI+ LN + N D F+I++D ++ + Sbjct: 2 FHIVLNANENYIKYAAVLMTSIIQKTDLNKSMSEFCNFDTDEGYVFHILSDHISESMKVR 61 Query: 75 IAKLAEQ-NQL---RITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLD 130 I+ L +Q N + +I L+ +N D+ + + + Y+R+ +L L LYLD Sbjct: 62 ISNLEKQLNDIYPCKIVLHILNDDEFKGM-LKWRGNYLAYYRIKMASVLPQNLKICLYLD 120 Query: 131 ADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQ--EKAVSRLSDPEL-----LGQYFNSGV 183 D++C GD+ +LL + +N AAV D + +K L E + +YFNSG Sbjct: 121 CDMLCFGDLRELLSVDINNYQAAVCLDGNNHKKNKKVFFSLKGREKYKFSNIEKYFNSGF 180 Query: 184 VYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYN 235 + ++L +W + K++ L + YPDQD +N L TL LP +N Sbjct: 181 ILVNLDRWRRDNIENKSIDFLKKFKTL--YPDQDALNFALND-TLLLPNRWN 229 >UniRef50_A5LNA9 Glycosyl transferase, family 8 n=2 Tax=Streptococcus pneumoniae RepID=A5LNA9_STRPN Length = 402 Score = 67.0 bits (162), Expect = 9e-10, Method: Compositional matrix adjust. Identities = 67/256 (26%), Positives = 105/256 (41%), Gaps = 42/256 (16%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLY 89 + G D NY D + +I SI +NR +L FYI + +F + K E+ I Sbjct: 7 IVLGADNNYRDKLETTIKSICYHNR--DLKFYIFNEDIPKEWFYLMEKRLEKLNCEILNI 64 Query: 90 RINTDKLQCLPCTQVWSRAM-YFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLN 148 I+ +K++ + M YFR F + + DR +YLD D+V G+I+ L Sbjct: 65 EIDAEKVKYFSTPDEHIKYMTYFRYFIAEFV--KEDRAVYLDCDMVIHGNINPLFQKDFE 122 Query: 149 GAVAAVVKDVEPMQEKAVSRLSDPELLGQY---FNSGVVYLDLKKWADAKLTEKALSILM 205 G V D G Y FN+G++ +++ KW + + L + Sbjct: 123 GNYIIAVPD------------------GWYKNIFNAGMMMVNVHKWKTDNICQNLLELTA 164 Query: 206 SKDNVYKYPDQDVMNVLLKGMTLFLPREYNTI--------YTIKSE-LKDKTHQNYKKLI 256 K Y DQ V+N+L + + YN + + K E + +NYK I Sbjct: 165 EKHQEI-YGDQGVLNLLFENKWKKVSPHYNFMVGLDTLGYWAQKPEWFLNSWDENYKPAI 223 Query: 257 TESTLLIHYTGATKPW 272 IH+ G KPW Sbjct: 224 ------IHFEGKDKPW 233 >UniRef50_A4WWT5 Glycosyl transferase, family 8 n=1 Tax=Rhodobacter sphaeroides ATCC 17025 RepID=A4WWT5_RHOS5 Length = 319 Score = 67.0 bits (162), Expect = 9e-10, Method: Compositional matrix adjust. Identities = 52/168 (30%), Positives = 81/168 (48%), Gaps = 15/168 (8%) Query: 110 YFRLFAFQLLGLTLDRLLYLDADV-VCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSR 168 Y RL + DR+LYLD+D+ + GD+ L+ L L G A V+D + + + R Sbjct: 91 YLRLVLPEAFSEDYDRILYLDSDIYIQGGDLGALIALPLAGRPLAAVRDNKQWRTPS-RR 149 Query: 169 LSDPELLG----QYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLK 224 + D + LG YFNSGV+ D+ + A L ++AL I S+ DQ ++N + Sbjct: 150 MVDFDRLGLPQRPYFNSGVLLFDVPAFRAANLLQEALRIGRSQGRQLVRHDQSLLNACML 209 Query: 225 GMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGATKPW 272 G L +N YT S L + ++ + +IH+ G KPW Sbjct: 210 GNWAELSPSWNWQYTWSSRL-------FAAMLGPN--IIHFIGRCKPW 248 >UniRef50_A3VFX3 Glycosyltransferase n=1 Tax=Rhodobacterales bacterium HTCC2654 RepID=A3VFX3_9RHOB Length = 615 Score = 67.0 bits (162), Expect = 9e-10, Method: Compositional matrix adjust. Identities = 83/343 (24%), Positives = 142/343 (41%), Gaps = 49/343 (14%) Query: 27 CLNVAYGVDANYLDGVGVSITSIVLN---NRHINLDFYIIADVYNDGFFQKIAKLAEQNQ 83 +NVA+ D YL + S++ + +R NL FY+ ++ D + LA + Sbjct: 268 AVNVAFTSDRPYLPQTAAMVASLIEHAAPDREYNL-FYLHENI-GDRDLDLLRSLAVAHG 325 Query: 84 LRITLYRINTDKL---QCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDIS 140 ITL+ IN + S A Y R F LL ++RL+YLD D+V GD++ Sbjct: 326 -NITLHTINVGTAFSREYRARHHTPSNATYNRFLLFDLLP-DVERLVYLDVDLVLCGDVA 383 Query: 141 QLLHLGLNGAVAAVVKDVEPMQEKAVS-RLSDPEL-----------------LGQYFNSG 182 +L +N A A V D + A R DPE+ + +YFN+G Sbjct: 384 ELFDTDMNDAPLAAVTDALMTRVLATRVRTRDPEVPDLYAYLSDDLGLSDDQISRYFNAG 443 Query: 183 VVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKS 242 V+ ++ AK+ + M N Y + DQD++NV + + LP +N + + Sbjct: 444 VMVMNFAAMDVAKVGRELRE--MVAGNRYFFRDQDILNVYFRDRFVTLPSRFNVHNSDRG 501 Query: 243 ELKDKTH--QNYKKLITESTLLIHYTGA-TKPWHKWAIYPSVKYYKI---ALENSP-WKD 295 + +N ++H+ A KPW + P V++ + L +P W + Sbjct: 502 AYDNVPVPIRNDALAAKADPFIVHFAAAHQKPWRE----PDVEFAGLFWSTLARTPFWFE 557 Query: 296 DSPRDAKSIIEFKKRYKHLLVQHHYISGIIAGVCYLCRKYYRK 338 ++E +R++ L + GV R+ R+ Sbjct: 558 --------VLEATRRHRSLRARLSRPDTWKHGVVIAGRRLGRR 592 >UniRef50_C7RG54 Glycosyl transferase family 8 n=1 Tax=Anaerococcus prevotii DSM 20548 RepID=C7RG54_ANAPD Length = 273 Score = 67.0 bits (162), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 65/260 (25%), Positives = 112/260 (43%), Gaps = 36/260 (13%) Query: 34 VDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQ-NQLRITLY--R 90 +D NY+ + V +TSI +NN D Y+I ++ K+ L E + TLY R Sbjct: 9 LDENYIPQMKVLMTSIYINNPGRIFDVYLIHSRISE---DKLKDLGEDLKKFSYTLYPIR 65 Query: 91 INTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGA 150 D T + + MY+RL A + L L +LYLD D++ + LL ++ Sbjct: 66 ATDDLFSFAKVTDRYPKEMYYRLLAGEFLPENLGEILYLDPDMLVINPLDDLLRTDISDY 125 Query: 151 V-AAVVKDVEPMQEKAVSRLSDPELLG---QYFNSGVVYLDLKKWADAKLTEKALSILMS 206 + AA + V+R+ LG Y+NSG++ ++LK+ + ++ S + Sbjct: 126 ILAAASHTGKTDMANNVNRIR----LGTDTDYYNSGLLLINLKRAREEIDPDEIFSFVED 181 Query: 207 KDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITES------- 259 PDQD++N + + IY + + + +NY + S Sbjct: 182 NHMNLLLPDQDILNAMYG----------DRIYPLDDLIYNYDARNYSSYLIRSKKQADLA 231 Query: 260 -----TLLIHYTGATKPWHK 274 T+++H+ G KPW K Sbjct: 232 WLMDHTVVLHFCGRDKPWKK 251 >UniRef50_D1N145 Glycosyl transferase family 8 n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N145_9BACT Length = 311 Score = 67.0 bits (162), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 68/271 (25%), Positives = 126/271 (46%), Gaps = 13/271 (4%) Query: 25 SECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQL 84 +E + VA D NYLD V+ S++ + + +++ + ++ F + L + Sbjct: 2 AEDIQVAMATDRNYLDYALVAAASLLAQHPGGGITLHLLHEELDESDFARFEALRRIDGF 61 Query: 85 RITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQL-- 142 R+ +I Q P + WS + Y+RL LL L+++LYLD D++ DI++L Sbjct: 62 RLVPRKIERGFFQGWPELR-WSTSAYYRLILPSLLP-DLEKILYLDCDLLVLDDIAELWN 119 Query: 143 LHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALS 202 LG AA V+ V P +K + ++ YFNSGV+ +L+K A ++ + Sbjct: 120 TELGSRSCAAAAVR-VAPEHQKKIGLPAE----AVYFNSGVMLFNLRKMAHENHEKRFIR 174 Query: 203 ILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLIT--EST 260 + KYPDQD++N+ + L + +N + ++ + +++ Sbjct: 175 LFDELGGRIKYPDQDILNLAYWNDYVKLSQRWNLVTSVYRNPPTPALYSEAEVVEALRRP 234 Query: 261 LLIHYTGATKPWH--KWAIYPSVKYYKIALE 289 + H+TG KPW K +P +Y++ E Sbjct: 235 GIAHFTGTHKPWRLGKTTHHPYARYFRAYAE 265 >UniRef50_B4RJG6 LgtC n=29 Tax=Neisseria RepID=B4RJG6_NEIG2 Length = 307 Score = 66.6 bits (161), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 62/258 (24%), Positives = 110/258 (42%), Gaps = 15/258 (5%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +++ + D NY + V+ S+ + + F+++ ++ +A I Sbjct: 1 MDIVFAADDNYAAYLCVAAKSVEAAHPDTEIRFHVLDAGISEENRAAVAANLRGGGGNIR 60 Query: 88 LYRINTDKLQCLPCT-QVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLG 146 +N + P + S Y RL + + D++LYLD DV+ + + L Sbjct: 61 FIDVNPEDFAGFPLNIRHISITTYARLKLGEYIA-DCDKVLYLDTDVLVRDGLKPLWDTD 119 Query: 147 LNGA-VAAVVKDVEPMQE--KAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSI 203 L G V A + QE K ++D E YFN+GV+ ++LKKW + + + Sbjct: 120 LGGNWVGACIDLFVERQEGYKQKIGMADGEY---YFNAGVLLINLKKWRRHDIFKMSCEW 176 Query: 204 LMSKDNVYKYPDQDVMNVLLKGMTLFLPREYN---TIYTIKSELKDKTHQNYKKLITEST 260 + +V +Y DQD++N L KG + +N T Y + H + L +T Sbjct: 177 VEQYKDVMQYQDQDILNGLFKGGVCYANSRFNFMPTNYAFMANGFASRHTDPLYLDRTNT 236 Query: 261 LL----IHYTGATKPWHK 274 + HY G+ KPWH+ Sbjct: 237 AMPVAVSHYCGSAKPWHR 254 >UniRef50_UPI000196958D hypothetical protein BACCELL_01586 n=1 Tax=Bacteroides cellulosilyticus DSM 14838 RepID=UPI000196958D Length = 305 Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 60/264 (22%), Positives = 107/264 (40%), Gaps = 7/264 (2%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +N+ D+ Y+ V + S NN Y++ + + I K+ Sbjct: 1 MNIVCAADSGYVQHCSVMLISFFENNPGEEHAVYLLTEGLDLDDLDFIQKIVHSYNGHFF 60 Query: 88 LYRINTDKLQCLP--CTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHL 145 +++ L+ P T S A Y RLF LL ++++LYLD D++ I +L Sbjct: 61 YCQVDFKFLEKCPIKSTDHLSIATYNRLFMADLLPADVNKVLYLDCDIIVNQSIKELWET 120 Query: 146 GL-NGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 L + V A ++ E RL G YFN+GV+ ++L W +T+ + + Sbjct: 121 PLRDNFVVAAFEERGCCAEDVYERLDYDSKYG-YFNAGVLLVNLDYWRTHNMTQAFIEYI 179 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNT--IYTIKSELKDKTHQNYKKLITESTLL 262 + DQDV+N ++ + +N I+ +K + I + Sbjct: 180 EHNFEKLRAHDQDVLNAFFYDKSVHISLAWNVEFIFYYYGIIKKFGFDRDLRFILRHPKI 239 Query: 263 IHYTGATKPWHKWAIYP-SVKYYK 285 +H+T KPW +P + YY+ Sbjct: 240 LHFTWKPKPWETSCQHPFRINYYR 263 >UniRef50_D2MYR1 Putative uncharacterized protein n=1 Tax=Campylobacter jejuni subsp. jejuni 414 RepID=D2MYR1_CAMJE Length = 383 Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 51/187 (27%), Positives = 86/187 (45%), Gaps = 14/187 (7%) Query: 60 FYIIADVYNDGFFQKIAKLAEQ----NQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFA 115 F+I++D +D K+ L E I +Y IN D + + + Y+RL Sbjct: 62 FHILSDFISDKTRMKLEYLKENLAKIYPCDIKIYIINEDNFRNFLHWK-GNFVAYYRLMV 120 Query: 116 FQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELL 175 +L +++ LY+DAD++C DI +L L V V D + + L Sbjct: 121 GSILPPDIEKCLYIDADMLCFSDIRKLFLFDLEDKVLGAVADFATWNTRFLKFRKLKYLF 180 Query: 176 G-------QYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTL 228 +YFNSG++ +DLK+W + +K L +L + PDQD +N+++K + Sbjct: 181 KGFLKFSREYFNSGLLLIDLKEWRRQNIEKKCLDVLKYYKCI--LPDQDALNIVIKENYI 238 Query: 229 FLPREYN 235 LP +N Sbjct: 239 KLPLSFN 245 >UniRef50_D2QX95 Glycosyl transferase family 8 n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2QX95_9PLAN Length = 350 Score = 65.9 bits (159), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 65/222 (29%), Positives = 100/222 (45%), Gaps = 35/222 (15%) Query: 106 SRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDV------- 158 S A Y+RL A LL ++ +LLY+DAD++ + D++ L +G V D+ Sbjct: 84 SDAAYYRLLAPNLLPSSVKKLLYIDADLLVQRDLTDLWDEPFDGHSCIAVHDIGAPFLDS 143 Query: 159 -EPMQEK--AVSRL--SDP----ELLG-----QYFNSGVVYLDLKKWADAKLTEKALSIL 204 + + EK A+SR+ +P E LG +YFNSGV +DL+ W +L+ + +L Sbjct: 144 NQILLEKPDALSRIVCRNPIPMFEELGLAPETRYFNSGVFMIDLETWRSEQLSVQMFDVL 203 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTH--------QNYKKLI 256 + Y DQ +N++L +N + I ELK H Q YK Sbjct: 204 CTHRERQIYHDQFALNIVLANRWKAADYRWNQLAYIH-ELKVPQHTFLEPQVFQQYK--- 259 Query: 257 TESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSP 298 S ++H+T KPW +P K + L S W P Sbjct: 260 -HSPWVVHFT-YRKPWQPECQHPLRKRFFDYLAGSKWMQAMP 299 >UniRef50_C0BAU3 Putative uncharacterized protein n=1 Tax=Coprococcus comes ATCC 27758 RepID=C0BAU3_9FIRM Length = 348 Score = 65.5 bits (158), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 65/259 (25%), Positives = 117/259 (45%), Gaps = 27/259 (10%) Query: 38 YLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLYRINTDKL 96 Y+ + + SI N N N D I+ + G ++ K E +Q ITL ++ + Sbjct: 22 YVPYLAAVLESIRANSNDDQNYDLIIMHRDISMGSQDRLKKQLEDHQ-NITLRFLDIRRY 80 Query: 97 QCLPCTQVWSRA-----MYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAV 151 + P +++ R YFRL Q+L D+ +Y+D+D+V DI++L ++G + Sbjct: 81 EK-PFKKLFLRGHFALETYFRLLMPQILA-DYDKAVYIDSDLVVNADIAELYATDVDGYL 138 Query: 152 AAVVKDV---------EPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALS 202 A KD EP ++K + + + +YF +GV+ +L ++ T + L Sbjct: 139 LAAAKDADTAGLYNGFEPNKKKYMDTILKIKKPYEYFQAGVIVFNLAEFRKTYTTAEMLK 198 Query: 203 ILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKS-------ELKDKTHQNYKKL 255 S + ++ DQDV+N L +G F+ +N + + L K + Sbjct: 199 FAASYE--WELLDQDVLNYLAQGRVKFVDMAWNVMVDWRGIRLSQIIALAPKYLHDEHME 256 Query: 256 ITESTLLIHYTGATKPWHK 274 ++ +IHY G KPWH+ Sbjct: 257 ARKNPKIIHYAGPDKPWHQ 275 >UniRef50_C1QC03 LPS:glycosyltransferase n=1 Tax=Brachyspira murdochii DSM 12563 RepID=C1QC03_9SPIR Length = 347 Score = 65.5 bits (158), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 66/282 (23%), Positives = 131/282 (46%), Gaps = 26/282 (9%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRH-INLDFYIIADVYNDGFFQKIAKLAEQNQLRI 86 +NV + + Y + +I S++ N + N++ YII++ N+ +KI L + + I Sbjct: 10 INVCFASNDAYAPYMSTAIASLLSNAKDDENINIYIISENINNSNKEKILSLKKIRECSI 69 Query: 87 TLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLG 146 + + + + S + +FRL L+ D+++YLD D++ + +L Sbjct: 70 DFIEPKEEIFKYISKYNMKSNSTWFRLSIPSLIP-NADKIVYLDGDMIINSSLRELFSDD 128 Query: 147 LNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMS 206 ++ A VV+DV ++ + + + +YFN+G + ++ K W + L EK + + + Sbjct: 129 MSDYYAYVVEDVMDKIDEVKAPIGFSK-TDKYFNAGFLMINNKLWIEDNLEEKFYNAVDT 187 Query: 207 KDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITE--STLLIH 264 + Y DQD++N LK F+ ++++ + YK++ + +IH Sbjct: 188 MP-ILGYKDQDILNYCLKNRVKFIDKKWDFL---------DNKSCYKEISADINKINIIH 237 Query: 265 YTGATKPWHKW---AIYPSV--KYYKIALENSPWKDDSPRDA 301 G KPW K A + KYY++ +PW + P DA Sbjct: 238 CVG--KPWKKECNVAFFADEFWKYYQL----TPWFLERPIDA 273 >UniRef50_D1Y7U2 Glycosyltransferase, family 8 n=1 Tax=Pyramidobacter piscolens W5455 RepID=D1Y7U2_9BACT Length = 617 Score = 65.5 bits (158), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 58/212 (27%), Positives = 96/212 (45%), Gaps = 39/212 (18%) Query: 110 YFRLFAFQLLGLTLDRLLYLDADVVCKGDISQL--LHLGLNGAVAAVVKD-------VEP 160 ++R LL + D++LYLD D++ + DI+ L L LG N AA+ D P Sbjct: 364 FYRFLILDLLKM-YDKVLYLDCDMIIQRDIADLYDLDLGTNLIGAALDPDFTGQCNGANP 422 Query: 161 MQEK---AVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQD 217 K AV +L D YF +GV+ +++ + + + L M++ +YKY DQD Sbjct: 423 ATRKYCDAVLKLKD---CFTYFQAGVLLMNVAELNKSVTVRQLLE--MAETGIYKYSDQD 477 Query: 218 VMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLIT--------------ESTLLI 263 ++NV+ +G L+L +N L D H + ++ E +I Sbjct: 478 ILNVVCEGRALYLDMAWNL-------LSDCDHYRWHHVVKFAPHYILDMYENAREKPYII 530 Query: 264 HYTGATKPWHKWAIYPSVKYYKIALENSPWKD 295 HY G KPW K +++K A E +++ Sbjct: 531 HYAGFLKPWMKLGEDFGYEFWKAARETPFYEE 562 >UniRef50_A3CM53 Glycosyltransferase, putative n=9 Tax=Bacteria RepID=A3CM53_STRSV Length = 1074 Score = 65.5 bits (158), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 70/254 (27%), Positives = 111/254 (43%), Gaps = 33/254 (12%) Query: 35 DANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLYRINTD 94 D Y + V +I SI+ N+ N+ Y+ +D +F+ +L EQ L L I+ D Sbjct: 9 DQAYQEQVSTTIKSILYYNK--NVKIYVFNQGLSDEWFRDFNELVEQ--LDSELVNISLD 64 Query: 95 KLQCLP--CTQVW-SRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAV 151 ++ P TQ S A Y R F Q + R+LYLD+D+V D+ L + L G + Sbjct: 65 QVTISPEWLTQDHISSATYARYFIPQFVAEG--RVLYLDSDLVVNRDLQPLFDIPLEGKL 122 Query: 152 AAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKL-------TEKALSIL 204 A V D G FN+GV+ +D + W + +L T++ + ++ Sbjct: 123 VAAVGDAG----------------GYGFNAGVLLIDNRSWKERELQESFIKETDRIMGLV 166 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIH 264 S DQ V+N +L L L + YN + + + N + + L+IH Sbjct: 167 QSGQMEDFNGDQTVLNHVLAQDWLPLDKIYN-LQVGHDLVAFYSGWNGHFELDQEPLIIH 225 Query: 265 YTGATKPWHKWAIY 278 YT KPW+ Y Sbjct: 226 YTTFRKPWNSEVSY 239 Score = 64.3 bits (155), Expect = 7e-09, Method: Compositional matrix adjust. Identities = 67/269 (24%), Positives = 118/269 (43%), Gaps = 29/269 (10%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLY 89 V +A Y + V +I SIV +NR I FY+I + +F + K + +I Sbjct: 409 VVLAANAAYSEQVLTTIKSIVCHNRFIK--FYVINSDFPTEWFVSMRKKLAKLDCQIVNA 466 Query: 90 RINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNG 149 R++ + +S ++ R F + D+ LYLD D+V D+S++ + L Sbjct: 467 RVDGSHISQYKTNIHYS--VFLRYFTATFV--EEDQALYLDCDIVVTRDLSEIFAVDLGS 522 Query: 150 AVAAVVKDVEPMQEKAVSRLSDPELLG-QYFNSGVVYLDLKKWADAKLTEKALSILMSKD 208 V+D L G Q FNSGV+ +++ W + + + + + + Sbjct: 523 YPLGAVRD-----------LGGEVYFGEQIFNSGVLLINVNYWRENDIAGQLIEMTDNLH 571 Query: 209 NVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGA 268 + DQ ++N+L + + LP YN I T+ + D ++ K L +IHY Sbjct: 572 DKVTQDDQSILNMLFENRWMELPFAYNCI-TLHTTFSD--YEPEKGLYPP---VIHYLTE 625 Query: 269 TKPWHKW--AIYPSVKYYKIALENSPWKD 295 KPW ++ +IY V ++ L+ W D Sbjct: 626 RKPWKEYTQSIYREVWWFYQGLD---WSD 651 >UniRef50_A8FNA2 Putative uncharacterized protein n=2 Tax=Campylobacter jejuni subsp. jejuni 81116 RepID=A8FNA2_CAMJ8 Length = 791 Score = 65.1 bits (157), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 69/293 (23%), Positives = 131/293 (44%), Gaps = 38/293 (12%) Query: 6 AIEIDKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSI-VLNNRHINLDFYIIA 64 A EIDK F L + + + + DANY + V + SI ++ + N D YI+ Sbjct: 361 AGEIDKEIDNFFILPPQDKLSHIPIVFSCDANYFSYLTVVLQSIKEKSSENYNYDIYILH 420 Query: 65 DVYNDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQV-------WSRAMYFRLFAFQ 117 + + QK+ + I ++ ++ L +Q+ +S A Y+R F + Sbjct: 421 NKLDKSLTQKLINYIQAENFSIKF--VDISRILNLLKSQIQFYTALFFSEATYYRFFIPK 478 Query: 118 LLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLS-----DP 172 + +++YLD D++ K D++ L + + +AA + ++A R++ P Sbjct: 479 IFK-EFKKIIYLDTDIIVKQDLNLLYSIDFDKPLAAAKCMIFSQVKQADHRITKLKMKQP 537 Query: 173 ELLGQYFNSGVVYLDLKKWADAKLTEKALSILMS-KDNVYKYPDQDVMNVLLKGMTLFLP 231 E YF +GV+ +++K T+K L+ L KD DQDV+N + +G ++ Sbjct: 538 E---NYFQAGVMVYNIQKCLKMDFTQKCLNKLQELKDP--PLVDQDVLNAVFEGDIHYIS 592 Query: 232 REYNTIYTIKSELKDKTHQNYKKLITESTL-----------LIHYTGATKPWH 273 ++N ++ + + N+K L ++ L +IHY KPW+ Sbjct: 593 LKWNCLWNVSYRIP-----NFKILYSKDFLKDYQEAERDPYIIHYCDYFKPWN 640 >UniRef50_C5EZG9 Glycosyl transferase family protein n=1 Tax=Helicobacter pullorum MIT 98-5489 RepID=C5EZG9_9HELI Length = 374 Score = 65.1 bits (157), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 47/183 (25%), Positives = 86/183 (46%), Gaps = 16/183 (8%) Query: 110 YFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVV-----KDVEPMQEK 164 Y+RL L L++ R +YLD D++ GD+ +L + L G + VV D + + E Sbjct: 66 YYRLRIGSALPLSIKRCVYLDVDMIVLGDLRELFKINLQGKICGVVMEGKDNDTQNILES 125 Query: 165 AVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLK 224 ++ YFNSG++ +DL W + ++A I+ K +K+ D+ ++N +L+ Sbjct: 126 KNKINKSIAIVSNYFNSGMLLVDLDLWRKENIEDRAFEIV-KKYYCHKH-DEHILNAVLQ 183 Query: 225 GMTLFLPREYNTIYTIKSEL-----KDKTHQNYKKLITESTL----LIHYTGATKPWHKW 275 G T + ++N + + + K + Y + + L ++HY KPW Sbjct: 184 GQTFKILPQWNMMVFLYCRAVCLNERGKINMPYNRKDFNNALKNPKILHYHTHHKPWEDS 243 Query: 276 AIY 278 IY Sbjct: 244 KIY 246 >UniRef50_C3XN62 Glycosyl transferase n=1 Tax=Helicobacter winghamensis ATCC BAA-430 RepID=C3XN62_9HELI Length = 284 Score = 65.1 bits (157), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 54/174 (31%), Positives = 85/174 (48%), Gaps = 27/174 (15%) Query: 127 LYLDADVVCKGDISQLLHLGLNGAVAAVVKD------VEPMQEKAVSRLSDPELLGQYFN 180 +YLD D++ D+ ++ + L G + V D +EP + KA+ L+ L YFN Sbjct: 1 MYLDVDMLVLKDLREIFAIDLEGKICGAVLDYKANRILEP-KNKALPMLN---LSKDYFN 56 Query: 181 SGVVYLDLKKWADAKLTEKALSILMSKDNVY--KYPDQDVMNVLLKGMTLFLPREYNTI- 237 +G++ +DL+KW KL K + L N Y K DQ +NV+LK LP +NT+ Sbjct: 57 AGLLLIDLEKWKSQKLESKLIETL----NQYHCKEHDQSALNVVLKDKIKILPLSWNTLV 112 Query: 238 -YTIKSELKDKTHQNYKKLITESTL--------LIHYTGATKPWHKWAIYPSVK 282 Y + ++ D T +N+ T L ++HY KPW+ IY +K Sbjct: 113 YYYVNAKACDDT-KNFNLFYTRKDLNKALKNPHILHYYLGFKPWNDDKIYTDIK 165 >UniRef50_B9BAZ6 Glycosyl transferase family 8 protein n=1 Tax=Burkholderia multivorans CGD1 RepID=B9BAZ6_9BURK Length = 617 Score = 64.7 bits (156), Expect = 5e-09, Method: Compositional matrix adjust. Identities = 74/295 (25%), Positives = 130/295 (44%), Gaps = 46/295 (15%) Query: 39 LDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKL--------AEQNQLRITLYR 90 L G VSI ++ N +L +I A V ++ +++ L A+Q +L + + Sbjct: 279 LGGNAVSIVTVADGNFVPHLAAFI-ASVQDNIDPERVLDLIVLDGGIPADQQRLLMKQFH 337 Query: 91 INTDK----LQC------LPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDIS 140 N +QC +P +S A ++RL +LL R++Y+D+D + GD+S Sbjct: 338 RNGKGRLSFIQCAHLFSDIPLHGPFSAATFYRLSMGELLA-KHRRVVYVDSDTIVLGDLS 396 Query: 141 QLLHLGLNGAVAAVVKDV--EPMQEKAVSRLSDP----------ELLG------QYFNSG 182 +L L L A V DV + V L + E +G +YF +G Sbjct: 397 ELFDLDLGNNAVAAVPDVIMKSFVSSGVPALREAGGAPAGIYLKERVGMGNRGNEYFQAG 456 Query: 183 VVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYT--- 239 ++ +DL ++ ++ E A L+++ Y + DQDV+N L G FL +N + Sbjct: 457 LIVIDLDEFRRLRIGEDAYKDLLARR--YWFLDQDVLNKYLLGHVKFLDLSWNVVNASMD 514 Query: 240 IKSELKDKTHQNYKKLITESTLLIHYTG-ATKPWHKWAIYPSVKYYKIALENSPW 293 + S L+ K++ + ++HY G KPW++ P +Y L + W Sbjct: 515 VLSGLETDIAAKVKEVFAAPS-MVHYAGHEAKPWNR-PTAPLAHFYWYYLRRTYW 567 >UniRef50_C7TIE0 Glycosyl transferase, group 8 n=2 Tax=Lactobacillus rhamnosus RepID=C7TIE0_LACRL Length = 286 Score = 64.3 bits (155), Expect = 5e-09, Method: Compositional matrix adjust. Identities = 57/215 (26%), Positives = 106/215 (49%), Gaps = 12/215 (5%) Query: 30 VAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVY--NDGFFQK-IAKLAEQNQLR 85 V + V +++ G +I S+VL+ +I L ++AD Y D F+ K I K + + Sbjct: 6 VLFTVTGSHIQLTGTAIASLVLHWPVNIPLRILVMADDYLNQDIFWLKSIPKQLLRPNIT 65 Query: 86 ITLYRINT--DKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLL 143 + +++ + D++ + + +RLFA + T DRLLYLD DV+ DIS + Sbjct: 66 VDVWQKPSIMDQVHTANTNTRYPSVVLWRLFAPYIFSDT-DRLLYLDNDVLICDDISPMF 124 Query: 144 HLGLNGAVAAVVKDVEPM---QEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKA 200 + + V D + + K S + + YFNSGV+ ++ K+ A ++ Sbjct: 125 DMLPDDKAIGAVNDFQTLLYADTKEGSIWPEIKHFDSYFNSGVLLINTHKYIQAYTQDQL 184 Query: 201 LSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYN 235 ++ + + D Y + DQ ++N L + ++ LP +YN Sbjct: 185 VNTINTSD--YSFIDQTILNNLFESQSIHLPLQYN 217 >UniRef50_Q046Z9 Lipopolysaccharide biosynthesis glycosyltransferase n=32 Tax=Lactobacillus RepID=Q046Z9_LACGA Length = 317 Score = 64.3 bits (155), Expect = 5e-09, Method: Compositional matrix adjust. Identities = 71/285 (24%), Positives = 134/285 (47%), Gaps = 39/285 (13%) Query: 27 CLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYN--DGFFQKIAKLAEQNQL 84 + V Y + NY VSI S++ ++ N D+ I V N D + + L+ +N + Sbjct: 3 TIPVFYTISDNYTPYAAVSIQSLI-DHVDQNKDYTITLLVQNISDKHKKDLEDLSIKN-V 60 Query: 85 RITLYRINTDKLQCLPCT-------QVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKG 137 + ++ I+ + + + + Q ++ ++++RLF L D+ +YLDAD + Sbjct: 61 HVNIFHIDDEMVAPIHNSEENYLRAQFFTMSIFYRLFIPNLFP-QYDKAVYLDADTIICT 119 Query: 138 DISQLLHLGLNGAVAAVVKD-----VEPMQE--KAVSRLSDPELLGQYFNSGVVYLDLKK 190 DI++L + + + A V D ++P+Q K + PE +Y N+GV+ ++K Sbjct: 120 DIAELYNTEIGDNMFASVPDMSIRFIKPLQVYIKECQGIFPPE---KYINNGVILFNMKA 176 Query: 191 WADAKLTEKALSILMSK--DNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKT 248 + D K +K S++ DN+ PDQ MN + + LP E++ + +E D+ Sbjct: 177 FRDKKFVDKFYSLIEKYHFDNI--DPDQAYMNEICEDKIYHLPLEWD---AMPNEHMDE- 230 Query: 249 HQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPW 293 ++ ++HY KPWH +A KY+ + SP+ Sbjct: 231 --------IKNPKIVHYNLFFKPWH-FADVQYGKYFWDVAKKSPY 266 >UniRef50_B7AUG6 Putative uncharacterized protein n=1 Tax=Bacteroides pectinophilus ATCC 43243 RepID=B7AUG6_9BACE Length = 301 Score = 63.9 bits (154), Expect = 7e-09, Method: Compositional matrix adjust. Identities = 61/279 (21%), Positives = 121/279 (43%), Gaps = 28/279 (10%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQK-------IAKLAE 80 +N+ ++ ++ V +TS++ NN N+ ++ Y DG +K + + Sbjct: 1 MNILVAMNDAFVKCYQVMLTSLIKNNPDENITVHV---PYTDGLSRKGLDSIKELVRNQS 57 Query: 81 QNQLRITLYRINTDKLQCLP--CTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGD 138 + Y D+L L +WS M+FR+FA + + + DR+L+LD D++ G Sbjct: 58 HGSASVREYYFGKDRLGSLDKLPLGMWSVEMFFRIFAQEFIPESEDRILWLDGDIIVNGS 117 Query: 139 ISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQ--YFNSGVVYLDLKKWADAKL 196 I + + A +D+ K + + Y NSGV+ ++LK + + Sbjct: 118 IKDFYNTDFDSMYYAACEDIAISHGKIKEEYDNLGWSSEEIYVNSGVLLINLKALRNNGI 177 Query: 197 TEKALSILMSKDNVYK--YPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKK 254 T A ++ + +N+ K YPDQ ++N + F + ++ +++ Sbjct: 178 TRDA-AVEYALENMDKLHYPDQYMLNAMFHDKIKFA-----DAFRYNCQVSGYSYKLADM 231 Query: 255 LITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPW 293 +++ES +L H+ G +PW K+Y A+ W Sbjct: 232 ILSESAIL-HFPG-YRPWQT----DYQKHYSSAIPGDIW 264 >UniRef50_B7C7N8 Putative uncharacterized protein n=2 Tax=Firmicutes RepID=B7C7N8_9FIRM Length = 416 Score = 63.9 bits (154), Expect = 7e-09, Method: Compositional matrix adjust. Identities = 65/274 (23%), Positives = 115/274 (41%), Gaps = 32/274 (11%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLY 89 + D +Y+D + +I SI +N+ N+ FYI+ + +F+ + K I Sbjct: 22 IVLACDNSYMDKLETTIKSICAHNK--NIKFYILNEDLPIEWFRLMTKRLSYFNSEILNI 79 Query: 90 RINTDKLQCLPC-TQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLN 148 +++ D + C ++ + YFR + + +++LYLD D++ + L +L L Sbjct: 80 KVSGDSFKKFRCPSEHINYQSYFRYLIPDYV--SEEKVLYLDCDIIVTESLDGLFNLDLK 137 Query: 149 GAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMS-K 207 A V D+ + FNSGV+ ++ K W + + K + + + Sbjct: 138 NYPVAAVPDLPTTNDG--------------FNSGVLLINNKYWRENDILNKLIKLTVEYH 183 Query: 208 DNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTG 267 + VY DQ ++N+L K LP YN S+ + + KL +IHYT Sbjct: 184 EKVYG--DQGILNILFKDKWYRLPLTYNLQVGSDSQEHMIGNMEWYKLFDGIPKVIHYTY 241 Query: 268 ATKPW---------HKWAIYPSVKYYKIALENSP 292 KPW W Y + + K+ L N P Sbjct: 242 THKPWLMYNMTRFKEVWWFYHGISWDKMIL-NEP 274 >UniRef50_C4Z1V1 Putative uncharacterized protein n=1 Tax=Eubacterium eligens ATCC 27750 RepID=C4Z1V1_EUBE2 Length = 607 Score = 63.9 bits (154), Expect = 7e-09, Method: Compositional matrix adjust. Identities = 51/196 (26%), Positives = 91/196 (46%), Gaps = 19/196 (9%) Query: 93 TDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVA 152 TD+ + + ++ +YFRLF +L L++ +Y+D+D V DI++L + + A+ Sbjct: 354 TDRQENRLYSGEFTLTIYFRLFIAELFP-ELNKAVYIDSDTVINDDIAKLYSVDMGDAMF 412 Query: 153 AVVKDVEPMQEKAVSRLSDPELLG----QYFNSGVVYLDLKKWADAKLTEKALSILMSKD 208 V+D + ++ + ++G +Y NSGV+ ++L K A L ++ L ++ Sbjct: 413 GAVRDTFAGKNTILAHYIE-NVVGIERNEYVNSGVLLMNLDKIRQAHLADRFLKLMAEYH 471 Query: 209 NVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGA 268 PDQD +N + FL +E+N + E + LIHY Sbjct: 472 FDSVAPDQDYINSMCAKEIYFLDKEWNVMPNKGGEYIARPK------------LIHYNLF 519 Query: 269 TKPWHKWAIYPSVKYY 284 KPWH I P +Y+ Sbjct: 520 DKPWHYSEI-PYEEYF 534 Score = 42.7 bits (99), Expect = 0.017, Method: Compositional matrix adjust. Identities = 61/263 (23%), Positives = 108/263 (41%), Gaps = 40/263 (15%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYND----------GFFQKIAK 77 +N+ Y D G+ +S S++ N L+ YI+ Y + F + + + Sbjct: 1 MNILYCGDKTMQKGILLSSMSLIKNVDE-PLNIYILTVDYGEKGINYKPVDKAFAKYLEE 59 Query: 78 LAEQNQLRITLYRINTDK--LQCLPCTQVWSR---AMYFRLFAFQLLGLTLDRLLYLDAD 132 ++ +++ ++ ++ + ++ LP + SR RLFA + DR+LYLD D Sbjct: 60 KLNKSDIKVNVFLVDVTRYFVEELPEANMQSRFTACCMLRLFADKTD--IKDRVLYLDTD 117 Query: 133 VVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQ-YFNSGVVYLDLKKW 191 V+C+ H ++G A VS L G Y NSGV+ ++++ Sbjct: 118 VLCRKGFRDFYHQNMDGIEIA-----------GVSDYYGRWLFGDGYINSGVMLMNMRMI 166 Query: 192 ADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKT-HQ 250 L EK + K+ PDQ +N + L R++N + L D T Q Sbjct: 167 RQNGLLEKCREQCIRKEMF--MPDQTAVNTFATRVNL-CGRKFND----QRRLHDNTVFQ 219 Query: 251 NYKKLITESTLLIHYTGATKPWH 273 ++ T + T + KPW Sbjct: 220 HFT--TTFRVFPVIRTVSVKPWE 240 >UniRef50_C1CFZ1 Glycosyl transferase, family 8 n=17 Tax=Streptococcus pneumoniae RepID=C1CFZ1_STRZJ Length = 404 Score = 63.9 bits (154), Expect = 8e-09, Method: Compositional matrix adjust. Identities = 71/255 (27%), Positives = 114/255 (44%), Gaps = 35/255 (13%) Query: 29 NVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITL 88 ++ + D +Y+D + +I SI N L FY+ D +F + K + Q I Sbjct: 5 SIVFNADNDYVDKLETAIKSICCYNN--CLKFYVFNDDIASEWFLMMNKRLKTIQSEIVN 62 Query: 89 YRINTDKLQ--CLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLG 146 +I L+ LP + S A +FR F + + R LYLD+D++ G + L + Sbjct: 63 VKIVDHVLKKFHLPLKNL-SYATFFRYFIPNFVKES--RALYLDSDIIVTGSLDYLFDIE 119 Query: 147 LNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMS 206 L+G A V+D S ++ FNSG++ +++ W D E A S L+ Sbjct: 120 LDGYALAAVED------------SFGDVPSTNFNSGMLLVNVDTWRD----EDACSKLLE 163 Query: 207 KDNVYK---YPDQDVMNVLLKGMTLFLPREYNTIYTIKSELK-DKTHQNYKKLITE---- 258 N Y Y DQ ++N+L L R +N + + S + H+ Y+ I+E Sbjct: 164 LTNQYHETAYGDQGILNMLFHDRWKRLDRNFNFMVGMDSVAHIEGNHKWYE--ISELKNG 221 Query: 259 -STLLIHYTGATKPW 272 +IHYTG KPW Sbjct: 222 DLPSVIHYTGV-KPW 235 >UniRef50_A2RLV8 Putative glycosyltransferase n=1 Tax=Lactococcus lactis subsp. cremoris MG1363 RepID=A2RLV8_LACLM Length = 397 Score = 63.5 bits (153), Expect = 9e-09, Method: Compositional matrix adjust. Identities = 53/212 (25%), Positives = 100/212 (47%), Gaps = 14/212 (6%) Query: 30 VAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITL 88 + Y V+ NY+ V S+TSI++N + +D I+++ D Q + ++ + + ++ L Sbjct: 6 IFYTVNGNYIQLVATSLTSIIMNIDEKFPVDIIIVSNDITDENKQTLYEILDMRKTQVNL 65 Query: 89 -YRINTDKLQCL--PCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHL 145 +R+ D L+ L + ++ + +R+F L +LLYLD+D + ++ L Sbjct: 66 LFRMPPDSLELLLGDVSNIFDNVVCWRIFMPYSLE-EYSQLLYLDSDTLIYEGFEEIFGL 124 Query: 146 GLNGAVAAVVKDVE--PMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSI 203 + V+ D + EK S+ YFNSGV ++++K+ E+ L Sbjct: 125 LPQDKILGVIPDFYFFAINEKNSSKRG-------YFNSGVYMINVEKYIQKNSKEELLKN 177 Query: 204 LMSKDNVYKYPDQDVMNVLLKGMTLFLPREYN 235 LM + Y DQ +N +G +LP +N Sbjct: 178 LMENFSEILYVDQTFLNNTFRGELFYLPLRFN 209 >UniRef50_C5SH34 Glycosyl transferase family 8 n=1 Tax=Asticcacaulis excentricus CB 48 RepID=C5SH34_9CAUL Length = 307 Score = 63.2 bits (152), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 67/266 (25%), Positives = 112/266 (42%), Gaps = 30/266 (11%) Query: 24 TSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQ 83 T C + Y VD NYL VS + N D I+ +K+ +A Sbjct: 2 TRHC--ICYVVDDNYLFPTLVSASQARENAPSSLADIVILCLSDASDRVRKVMPVAVA-- 57 Query: 84 LRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLL 143 L I L + T ++ L MY RLF +LL +R+LY+D D + LL Sbjct: 58 LGIELIEVPTASIENL-------HPMYGRLFIDKLLPKAYERVLYIDGDTQIAASLEPLL 110 Query: 144 HLGLNGAVAAVVKDVEPM----QEKAVSRLSDPEL---LG-----QYFNSGVVYLDLKKW 191 ++ + V+D M +K SR+ + LG Y N+GV+ ++K W Sbjct: 111 NVDIPEGKFLAVRDPAAMFAKLSDKWASRIQGERVEAGLGDNPIEDYLNTGVLVFNMKDW 170 Query: 192 ADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYN-TIYTIKSELKDKTHQ 250 A+ L + L ++ ++ +K+ DQD MN+ + L++ +N + I S +++ Sbjct: 171 AE--LAGETLKLIRARSTPFKFGDQDPMNLAIGDRCLYISNRWNFPGFLIGSGQEERVKP 228 Query: 251 NYKKLITESTLLIHYTGATKPW-HKW 275 ++ +H A PW KW Sbjct: 229 VIYHFMSNPRPWVH---AGAPWGPKW 251 >UniRef50_C8W7U9 Glycosyl transferase family 8 n=2 Tax=Atopobium RepID=C8W7U9_ATOPD Length = 1014 Score = 62.8 bits (151), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 68/329 (20%), Positives = 142/329 (43%), Gaps = 58/329 (17%) Query: 22 INTSECLNVAYGVDANYLDGVGVSITSIVLN---NRHINLDFYIIADVYNDGFFQKIAK- 77 I + + V + D NY+ + ++ S++ N NR+ ++ ++ + G Q++ K Sbjct: 667 IASQNVVPVVFAADNNYVPILTCAMGSMLENADPNRYYDV---VVLNTNIGGSKQELVKK 723 Query: 78 -LAEQNQLRITLY---------RINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLL 127 + RIT Y +++T+ S YFR A +L D+++ Sbjct: 724 FFSRYKNARITFYNVWRMVKDYKLDTNNAHI-------SVETYFRFLAQDILS-AYDKVV 775 Query: 128 YLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVS------------RLSDPELL 175 YLD+D+V G++++L + + + A D++ + + L +P Sbjct: 776 YLDSDLVVNGNVAELYDVRIGNNLIAATLDIDYLANLNIRGGDRMKYSLDVLNLKNPY-- 833 Query: 176 GQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYN 235 YF +GV+ + + + L I + + ++ Y DQD++N +G L+LP ++N Sbjct: 834 -AYFQAGVMVFNTAELRRYHTVPEWLRI--ASNPIFIYNDQDILNSECQGRVLYLPADWN 890 Query: 236 TIYTIKSELKD-------KTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIAL 288 + I ++ +Y+ + ++H+ GA KPW + + ++K A Sbjct: 891 VTHNIFGRAEELYPMAPNSVFDDYQA-ARRAPKIVHFAGAIKPWQNASCDMASYFWKYA- 948 Query: 289 ENSPWKD-------DSPRDAKSIIEFKKR 310 N+P+ + S R+ + EF +R Sbjct: 949 RNTPFYEVIIQDMVPSARNDADVTEFHER 977 >UniRef50_A4UX76 Putative LPS biosynthesis protein n=2 Tax=Lactobacillaceae RepID=A4UX76_9LACO Length = 316 Score = 62.4 bits (150), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 75/270 (27%), Positives = 114/270 (42%), Gaps = 50/270 (18%) Query: 25 SECLNVAYGVDANYLDGVGVSITSIVLN---NRHINLDFYIIADVYNDGFFQKIAKLAEQ 81 ++ + + Y VD NY + VS+ S+V + +RH + + D+ D Q K E Sbjct: 3 NQTVPIFYAVDDNYAPYLAVSLASLVAHTSPDRHYQV-IVLCDDLNTDN--QGRLKAFET 59 Query: 82 NQLRITLYRIN-------TDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVV 134 + L+I IN TDK L + ++ +YFRLF +L LD+ LYLDAD V Sbjct: 60 DNLKIQFVSINDRLKQEITDKNNKL-RSDYFTFTIYFRLFIAELFP-KLDKALYLDADTV 117 Query: 135 CKGDISQLLHLGLNGAVAAVVKD------VEPMQ--EKAVSRLSDPELLGQYFNSGVVYL 186 D+ +L L + V D E + E+AV S +Y SGV+ + Sbjct: 118 VLKDVGELFDTQLGDNLVGAVPDPFVGHTPETIDYVEQAVGIDS-----QKYVCSGVLLM 172 Query: 187 DLKKWADAKLTEKALSILMSKDNVYKY----PDQDVMNVLLKGMTLFLPREYNTIYTIKS 242 +L + K E L +L N Y + PDQD MN + + +L ++ T Sbjct: 173 NLAEMRRLKFAEHFLQLL----NKYHFKCLAPDQDYMNAIARNRIYYLNPSWHIQITTPQ 228 Query: 243 ELKDKTHQNYKKLITESTLLIHYTGATKPW 272 ++ LIHY KPW Sbjct: 229 DV--------------DPWLIHYNLFAKPW 244 >UniRef50_C8N7M8 Putative uncharacterized protein n=1 Tax=Cardiobacterium hominis ATCC 15826 RepID=C8N7M8_9GAMM Length = 618 Score = 62.4 bits (150), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 67/274 (24%), Positives = 119/274 (43%), Gaps = 37/274 (13%) Query: 27 CLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRI 86 ++V D NY +G I SI+ H D Y+ + + G I+ L + +R+ Sbjct: 279 VVSVVIASDDNYTPHLGALICSIL---DHFPADKYLDLIILDGG----ISALNRKLLMRL 331 Query: 87 TLYRINT------DKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDIS 140 N D+ Q L +SRA ++RL +L+ D++LY+D D + DIS Sbjct: 332 LPTHANIQFLELKDEFQQLATHMHFSRATFYRLILDKLIP-GRDKVLYIDCDTIVLDDIS 390 Query: 141 QLL------------------HLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSG 182 L H LN ++ P + + + +YF +G Sbjct: 391 TLFDTPLGDHAIGAVFDYIMHHFCLNDVLSIDTTGSLPAKRYLHDYVGLEDGWQRYFQAG 450 Query: 183 VVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKS 242 V+ +++K L+E +S L++K Y + DQD++N G ++L +N++ ++++ Sbjct: 451 VILFNMEKLRRLDLSEVMISDLLNKR--YWFLDQDILNKYFLGDVVYLDPRWNSVNSVQN 508 Query: 243 ELKDKTHQNYKKLITEST--LLIHYTG-ATKPWH 273 + +L T T +IHY G TKPW+ Sbjct: 509 IYQGLPATYIAELKTTETDPKIIHYAGFETKPWN 542 >UniRef50_B1I7M9 Glycosyl transferase, family 8 n=16 Tax=Streptococcus pneumoniae RepID=B1I7M9_STRPI Length = 406 Score = 62.4 bits (150), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 59/261 (22%), Positives = 111/261 (42%), Gaps = 39/261 (14%) Query: 29 NVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITL 88 +V + D Y+ + ++ S+ +N H+ + + + D+ + F +Q+RI L Sbjct: 7 SVVFAGDYAYIRQIETAMKSLCRHNSHLKI-YLLNQDIPQEWF----------SQIRIYL 55 Query: 89 YRINTDKLQC-LPCTQV---WSRAM-------YFRLFAFQLLGLTLDRLLYLDADVVCKG 137 + D + C L +Q WS + + R F + T D++LYLD+D++ G Sbjct: 56 QEMGGDLIDCKLIGSQFQMNWSNKLPHINHMTFARYFIPDFV--TEDKVLYLDSDLIVTG 113 Query: 138 DISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLT 197 D++ L L L A + G FN+GV+ ++ KKW + Sbjct: 114 DLTDLFELDLGENYLAAARSCFGA--------------GVGFNAGVLLINNKKWGSETIR 159 Query: 198 EKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLIT 257 +K + + + + DQ ++N+L K L +YN HQ + Sbjct: 160 QKLIDLTEKEHENVEEGDQSILNMLFKDQYSSLEDQYNFQIGYDYGAAAFKHQFIFDIPL 219 Query: 258 ES-TLLIHYTGATKPWHKWAI 277 E L++HY KPW+++++ Sbjct: 220 EPLPLILHYISQDKPWNQFSV 240 >UniRef50_C3XGD2 Putative uncharacterized protein n=1 Tax=Helicobacter bilis ATCC 43879 RepID=C3XGD2_9HELI Length = 364 Score = 62.4 bits (150), Expect = 3e-08, Method: Compositional matrix adjust. Identities = 50/182 (27%), Positives = 83/182 (45%), Gaps = 19/182 (10%) Query: 110 YFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSR- 168 YFRL L + LYLD D++C DI ++ + L G + VV + Q + R Sbjct: 103 YFRLKIASCLPQDIKTCLYLDVDMICVADIREIFYTDLQGKICGVVLVPDHQQYCVLKRN 162 Query: 169 --LSDPELL--GQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLK 224 + D + YFNSG++ +D++++ + +K L + V DQD +N +L Sbjct: 163 SAIGDEFVFNASTYFNSGLMLIDVEQYRKYNVEQKCLEWF--EQYVPVLLDQDALNAVLG 220 Query: 225 GMTLFLPREYNTIYTI----KSELKDKTHQNYKKLITESTL-------LIHYTGAT-KPW 272 LP E+N + + + K K + K+ E + ++HYTG T KPW Sbjct: 221 DHICALPLEWNFFVELLKYKRQDFKGKDNNIVMKITYEEYMQVKNNMKILHYTGWTLKPW 280 Query: 273 HK 274 + Sbjct: 281 QQ 282 >UniRef50_Q5M3K9 Glycosyl transferase n=4 Tax=Streptococcus RepID=Q5M3K9_STRT2 Length = 697 Score = 62.0 bits (149), Expect = 3e-08, Method: Compositional matrix adjust. Identities = 55/247 (22%), Positives = 105/247 (42%), Gaps = 29/247 (11%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLY 89 + + Y+D V +I SIV ++R N+ FY+I D ++ +F+ + + + Sbjct: 303 IVLAANYTYVDQVLTTIKSIVFHHR--NIRFYLINDDFSQEWFRGLNRHLAAFGSEVINC 360 Query: 90 RINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNG 149 R+++ ++ + A Y R F + + +R LYLD+D+V G + L L L G Sbjct: 361 RVDSSHIKQFKTNSNY--ASYLRYFVADFV--SEERALYLDSDMVVTGSLEDLFTLDLQG 416 Query: 150 AVAAVVKD--VEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSK 207 A V+D V+ +A+ F++G + +D W + + + Sbjct: 417 RPLAAVRDYAVQGQDRQAM------------FDAGFMVIDTAYWKQYNMRRHLIDMTSEW 464 Query: 208 DNVYKYPDQDVMNVLL--KGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHY 265 + + +Q ++N++ +TL Y + S Q+Y K ++HY Sbjct: 465 HDKVPFAEQSILNMVFCNNWLTLSFDNNYAVTKSSLSGYHLPNGQDYPK-------VLHY 517 Query: 266 TGATKPW 272 T KPW Sbjct: 518 TSHRKPW 524 >UniRef50_A4VVV8 Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases n=7 Tax=Firmicutes RepID=A4VVV8_STRSY Length = 334 Score = 61.2 bits (147), Expect = 4e-08, Method: Compositional matrix adjust. Identities = 54/205 (26%), Positives = 90/205 (43%), Gaps = 17/205 (8%) Query: 102 TQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPM 161 T W+ + RL +LL +DR++YLD D + +I +L + L G V + EP Sbjct: 83 TNGWASVVLARLLVDKLLPEEVDRIIYLDGDTLVLENIRELWEVDLEGKVLGMCP--EPT 140 Query: 162 QEKAVSRLSDPELLG--QYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVM 219 A S + LG Y N+GV+ +DLK+W + K+ DQD + Sbjct: 141 ---ASSERREGLNLGTYTYHNAGVLLIDLKRWRSKSIGTIIFDYYKEKNGELFANDQDAL 197 Query: 220 NVLLKG--MTLFLPREYNTIY------TIKSELKDKTHQNYKKL--ITESTLLIHYTGAT 269 N LK TL + Y I+ T++ + T + ++ I + ++H+ G Sbjct: 198 NGALKEEIKTLSITYNYFNIFDVYPYRTLEKLSRPSTFISKEEFVKIRKQPRIVHFLGEE 257 Query: 270 KPWHKWAIYPSVKYYKIALENSPWK 294 +PW + + Y AL +PW+ Sbjct: 258 RPWRIGNKHRFREDYVSALNQTPWR 282 >UniRef50_Q38VG7 Putative glycosyl transferase, family 8 n=1 Tax=Lactobacillus sakei subsp. sakei 23K RepID=Q38VG7_LACSS Length = 304 Score = 61.2 bits (147), Expect = 5e-08, Method: Compositional matrix adjust. Identities = 58/258 (22%), Positives = 121/258 (46%), Gaps = 24/258 (9%) Query: 42 VGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPC 101 + +SI +++ + ++ +II ++ + + I L N + ++ ++ ++ Sbjct: 1 MSISIATLLKKHMEDEINIFIITSNISEKYIKVIEGLF--NNPKHNIFWVSMPEIDIPLE 58 Query: 102 TQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPM 161 T S A Y RLF +L+ + RL+YLD D + + ++ +L L + +D Sbjct: 59 TDRGSLAQYGRLFFDRLIPENIQRLIYLDCDTLIEENLRELWVTDLGENTIGIARDAFSD 118 Query: 162 QEKAVSRLS-DPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMN 220 + K + L D EL FNSGV+ +D W + ++ ++ + +L K DQ V++ Sbjct: 119 RYKKLLGLEKDSEL----FNSGVMIIDRGSWNEKRIEDRIIDLLTEKRGRISQGDQGVID 174 Query: 221 VLLKGMTLFLPREYNTI-----YTIKSELKDKTHQNY--KKLITES---TLLIHYTGA-- 268 ++ + L ++N++ +T LK + + + K+LI E+ ++H+T + Sbjct: 175 IIFQNDAKILDPKWNSMSSYFDFTYDDFLKYRQVKEFYSKQLILEAIQKPAIVHFTSSFL 234 Query: 269 -TKPWHKWAIYPSVKYYK 285 +PW I+ S YK Sbjct: 235 NNRPW----IFGSTHRYK 248 >UniRef50_C1QC00 LPS:glycosyltransferase n=1 Tax=Brachyspira murdochii DSM 12563 RepID=C1QC00_9SPIR Length = 332 Score = 60.8 bits (146), Expect = 6e-08, Method: Compositional matrix adjust. Identities = 68/286 (23%), Positives = 124/286 (43%), Gaps = 32/286 (11%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRH-INLDFYIIADVYNDGFFQKIAKLAEQNQLRI 86 +++ D NY +G +I SI+ N++ + F+++ + K+ L I Sbjct: 1 MDICLSADDNYAKYMGTTIASILSNSKEDEEIYFHLLDGGITEENKNKLLSLKNIKNCDI 60 Query: 87 TLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLG 146 Y +N + + +FRL L+ +D+LLYLD D + + +L + Sbjct: 61 IFYSVNNMNYK-------YDAPHFFRLNVPSLIP-NVDKLLYLDCDTIVLNSLKELFEID 112 Query: 147 LNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMS 206 ++ A +DV + + + YFNSG++ ++ K W D KL E S Sbjct: 113 ISNYYALACEDVFLNCIISFKNMHGLNVNDIYFNSGMLMINNKLWRDDKL-ENLFYDDYS 171 Query: 207 KDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTL--LIH 264 K + DQDV+N ++KG + ++N + +H+ + +L +IH Sbjct: 172 KFGNTGHADQDVLNRIIKGRVKIVDSKWNFL----------SHKKVYSKAPDISLVNIIH 221 Query: 265 YTGATKPWHK-----WAIYPSVKYYKIALENSPWKDDSPRDAKSII 305 Y G KPW + + I KYY++ +PW ++ DA I+ Sbjct: 222 YAGE-KPWKETSSKAFFIDEFWKYYQL----TPWCRENTLDAVKIM 262 >UniRef50_B2ISC6 Glycosyl transferase, family 2/glycosyl transferase family 8 n=8 Tax=Streptococcus pneumoniae RepID=B2ISC6_STRPS Length = 696 Score = 60.8 bits (146), Expect = 6e-08, Method: Compositional matrix adjust. Identities = 68/278 (24%), Positives = 114/278 (41%), Gaps = 31/278 (11%) Query: 18 RLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAK 77 +L+ SE + + Y+D V +I SI +NR I FY+I + + + +++ K Sbjct: 293 QLSRQEESEKKAIVLAANYAYVDQVLTTIKSICYHNRSIR--FYLIHSDFPNEWIKQLNK 350 Query: 78 LAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKG 137 E+ I R+ ++++ C S ++ R F + D+ LYLD D+V Sbjct: 351 RLEKFDSEIINCRVTSEQISCYKSD--ISYTVFLRYFIADFV--QEDKALYLDCDLVVTK 406 Query: 138 DISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQ-YFNSGVVYLDLKKWADAKL 196 ++ L L A V+D GQ FN+GV+ ++ W + Sbjct: 407 NLDDLFATDLQDYRLAAVRD-----------FGGRAYFGQEIFNAGVLLVNNAFWKKENM 455 Query: 197 TEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKD---KTHQNYK 253 +K + + + DQ ++N+L + L L +YN I I + D Q+Y Sbjct: 456 IQKLIDVTNEWHDKVDQADQSILNMLFEHKWLELDFDYNHI-VIHKQFADYQLPEGQDYP 514 Query: 254 KLITESTLLIHYTGATKPWHKWA--IYPSVKYYKIALE 289 +IHY KPW A Y V +Y LE Sbjct: 515 A-------IIHYLSHRKPWKDLAAQTYREVWWYYHGLE 545 >UniRef50_C5S3F7 Putative glycosyl transferase n=2 Tax=Actinobacillus minor RepID=C5S3F7_9PAST Length = 275 Score = 59.7 bits (143), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 65/262 (24%), Positives = 120/262 (45%), Gaps = 32/262 (12%) Query: 33 GVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLYRIN 92 D + + + +I SI +N NL ++ ++ +F+ + Q I ++N Sbjct: 16 AADIKFAEQLETTIKSICYHNA--NLYIVLLNRDFSKEWFEYLNTYLNQINCEIIDVKVN 73 Query: 93 TDKLQ---CLPCTQVWSRAMYFRLF--AFQLLGLTLDRLLYLDADVVCKGDISQLLHLGL 147 ++L+ LP + S + +FR F AF + D++LYLD D+V G +S L L Sbjct: 74 CNQLEEYKTLP--HISSASTFFRYFIPAF----VNDDKVLYLDCDLVVNGSLSIFFDLEL 127 Query: 148 NGA-VAAVVKDVE-PMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILM 205 N VAA + D+ +K ++FN+GV+ ++ K W ++T KAL + Sbjct: 128 NDHYVAASLDDIAFNFHQK------------KHFNAGVLLINNKLWRKQEITLKALELTD 175 Query: 206 SKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSEL---KDKTHQNYKKLITESTLL 262 + + DQ+V+N+L + + L N Y + +E ++ Q ++ + L+ Sbjct: 176 RLNEKLEEGDQEVLNILFQNKWIELNPYLN--YLVGAEYLYRRNGVTQYIRRQEDDVPLI 233 Query: 263 IHYTGATKPWHKWAIYPSVKYY 284 +H+ KPW P +YY Sbjct: 234 LHFNTKYKPWLPIDGVPFREYY 255 >UniRef50_Q0I2Z7 Lipopolysaccharide biosynthesis protein, glycosyltransferase, family 8 n=1 Tax=Haemophilus somnus 129PT RepID=Q0I2Z7_HAES1 Length = 354 Score = 59.3 bits (142), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 81/356 (22%), Positives = 142/356 (39%), Gaps = 73/356 (20%) Query: 28 LNVAYGVDANYLDGVGVSITSIVL----NNRHINLDFYIIADVYNDGFFQKIAK-----L 78 +N+ + D NY + V++ SI+ NN + FY++ D +AK L Sbjct: 1 MNILFACDDNYAKYLAVTMLSIIHARDKNNECYTIHFYLL-----DMGISTVAKDYCLEL 55 Query: 79 AEQNQLRITLYRINTDKLQCLPCT-QVWSRAMYFRL-FAFQLLGLTLDRLLYLDADVVCK 136 A +N + + I+ + P T + S + Y RL A L L +++YLD D++ Sbjct: 56 ANKNNCHLDIVPISISDFEKFPRTIEYISLSTYARLNLANYLKKFNLTKIIYLDIDILVN 115 Query: 137 GDISQLLHLGL-NGAVAAVVKDVEPMQEKAV----------------------------- 166 + L + L N A+ A QEK+ Sbjct: 116 HSLLPLWNTDLGNKAIGACYDAFIESQEKSKRMSSQSVSQSVSQSVSQSVSQSVSQSVSQ 175 Query: 167 ----------------SRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL--MSKD 208 ++L P YFN+GV+ +++ +W + EK+L + ++ Sbjct: 176 SVSQSVSQSVSQSDYKTKLHLPNT-HFYFNAGVLLINVVEWEKCHVFEKSLQWIEYCKRN 234 Query: 209 NV-YKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELK--DKTHQNYKKLITESTLLIHY 265 N+ + Y DQD++N + +L YN + LK K N + T +IHY Sbjct: 235 NIEFLYQDQDILNAIFANNVKYLDLRYNFTANALNRLKRVSKKELNQYEEATMPLAIIHY 294 Query: 266 TGATKPWH-KWAIYPSVKYYKI--ALENSP--WKDDSPRDAKSIIEFKKRYKHLLV 316 G K WH K ++ + + + LEN P WK ++ + + F K +H ++ Sbjct: 295 VGPKKSWHEKCSMLKANLFCHLFQQLENPPKEWKIENVPFIRKLKRFAKDLRHKII 350 >UniRef50_D2LIH7 Glycosyl transferase family 8 n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LIH7_RHOVA Length = 391 Score = 59.3 bits (142), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 37/183 (20%), Positives = 87/183 (47%), Gaps = 20/183 (10%) Query: 123 LDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVE--PMQEKAVSRLSDPEL------ 174 D+++Y+DAD + D++ L + ++G A V+D Q K + + ++ Sbjct: 140 FDKVVYIDADTITNRDLADLYDIDVDGYYIAAVRDFAMIATQNKKMLDIVGKKIYYETYV 199 Query: 175 --------LGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGM 226 + YFNSG+V ++ K ++++E+ ++++ +K ++ Y DQD++N++ + Sbjct: 200 KDYLGLIGISNYFNSGLVLFNINKINGSQISERLIALIGTK--LFAYVDQDILNIVFENK 257 Query: 227 TLFLPREYNTIYTIKSELKDKTHQNYKKLITESTL--LIHYTGATKPWHKWAIYPSVKYY 284 + +N + + Y + + ++HY G KPW+ ++ + Y+ Sbjct: 258 VKLIDYSWNMVIDCERLYHLSEPDLYARYLDAGAAPHVVHYIGGNKPWNDPTVHMAEYYW 317 Query: 285 KIA 287 + A Sbjct: 318 RYA 320 >UniRef50_A3V3C9 Putative uncharacterized protein n=1 Tax=Loktanella vestfoldensis SKA53 RepID=A3V3C9_9RHOB Length = 324 Score = 58.5 bits (140), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 59/266 (22%), Positives = 109/266 (40%), Gaps = 26/266 (9%) Query: 29 NVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLR--- 85 +V + D +YL ++I +++ NN + D I + + L +L+ Sbjct: 17 SVIFCADQSYLPFASLAIHTLLRNNPVRDYDICIAS----------VDALVPPTELKDHD 66 Query: 86 ITLYRINT-DKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGD-ISQLL 143 I +I+ + +P ++ +S A Y R+ + DR+ YLDADV GD I + Sbjct: 67 IRFCQIDVGNAFDGMPVSKRFSLAAYLRIALPEAFAGQYDRIFYLDADVFVVGDAIDAVF 126 Query: 144 HLGLNGAVAAVVKDVEPMQEKAVSRLSDPELL--GQYFNSGVVYLDLKKWADAKLTEKAL 201 L + V D+ ++ L G YFNSGV+ D++++ ++ E+ Sbjct: 127 RLDMLSCPVGAVTDITKLKHPNKPTFDQKALGVDGPYFNSGVMLFDVERFITMRVRERCA 186 Query: 202 SILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTL 261 Y DQ ++N++L+ L +N + L ++ I Sbjct: 187 EAAKFYQGEPIYFDQTLLNIVLQKEWAQLNLGWNWQWPFSRSL-------FECFI--DVQ 237 Query: 262 LIHYTGATKPWHKWAIYPSVKYYKIA 287 ++H+ G KPW +KY + A Sbjct: 238 IVHFIGDDKPWSDHKRRLPLKYRETA 263 >UniRef50_A3YS36 Putative sugar transferase n=2 Tax=Campylobacter jejuni RepID=A3YS36_CAMJE Length = 459 Score = 58.5 bits (140), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 58/222 (26%), Positives = 104/222 (46%), Gaps = 20/222 (9%) Query: 29 NVAYGVDANYLDGVGVSITSIVLNNRHINLD---FYIIADVYNDGFFQKIAKLAEQ---- 81 ++ + Y++ + V + SI++N N F+I++ ND +K+ L ++ Sbjct: 3 HIVFNSSNEYIENLSVLMYSIIINTNKSNTKKYCFHILSSNINDNTCKKLTLLEKELSSI 62 Query: 82 NQLRITLYRINTDKLQ--CLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDI 139 I +Y IN + +P + S Y RL +L + + LYLD D++ GDI Sbjct: 63 YPSEIKIYHINDNLFYDYNIPKHE-GSYNAYLRLMLASILSKDIKKCLYLDVDMLVLGDI 121 Query: 140 SQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDP--ELLGQYFNSGVVYLDLKKWADAKLT 197 S+L L L V A V ++ S+ S + G +FNSG++ ++L W + + Sbjct: 122 SELFDLDLKDKVFAAVFILKHPWPNLNSKDSSEIFYIYGSHFNSGLMLINLDAWREKNIE 181 Query: 198 EKALSILMSKDNVYKYP---DQDVMNVLLKGMTLF-LPREYN 235 ++LS + + Y P D+ V+N +L +F L E+N Sbjct: 182 SRSLSFIKN----YYVPYAVDEYVLNAILSKDDIFSLKLEWN 219 >UniRef50_B0NR59 Putative uncharacterized protein n=1 Tax=Bacteroides stercoris ATCC 43183 RepID=B0NR59_BACSE Length = 306 Score = 58.5 bits (140), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 66/262 (25%), Positives = 115/262 (43%), Gaps = 28/262 (10%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINL-DFYII--ADV--YNDGFFQK--IAKLAEQN 82 + + +D NY+ GV I S+++N+ D YI+ AD+ ++ K A A+ N Sbjct: 6 IVFSIDHNYVMQAGVCILSLLMNSDEKEYYDIYILSAADITEHDKELLNKTIFAYKADIN 65 Query: 83 QLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQL 142 + I D+ + S+A YFRL L+ D+++Y D DV+ + + ++ Sbjct: 66 FIEID------DRFDNAFEIRNISKAAYFRLLIPDLIP-QYDKIIYSDVDVIFQSGLQEV 118 Query: 143 LHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALS 202 L L +K + K + G Y NSG + ++ K + +L K Sbjct: 119 LDTDLKDNYFGGIKAIGAESIKDYIIQLGLNIHG-YINSGFLLINAKLQREKQLFNKIQE 177 Query: 203 ILMSKDNVYKYPDQDVMNVLLKGMTLFLP-------REYNTIYTIKSELKDK-THQNYKK 254 L K +++ DQD++N++ K FLP + Y YT L + + ++ Sbjct: 178 YLTKK---FQFQDQDIINIVCKNRLTFLPLKYCFTQKSYELYYTNPKRLFSVFSPKEVEE 234 Query: 255 LITESTLLIHYTGATKPWHKWA 276 TE +IHY G KPW+ + Sbjct: 235 AFTEG--IIHYEGTNKPWNGFC 254 >UniRef50_B6GCA0 Putative uncharacterized protein n=1 Tax=Collinsella stercoris DSM 13279 RepID=B6GCA0_9ACTN Length = 990 Score = 58.2 bits (139), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 49/205 (23%), Positives = 90/205 (43%), Gaps = 38/205 (18%) Query: 110 YFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRL 169 Y+R F Q L D++LYLD+D++ +GD+S+L L ++ A D++ + + R Sbjct: 734 YYR-FLIQDLLPYYDKVLYLDSDLIIRGDVSELFATDLGDSLLAAAHDIDFVANVNMKRG 792 Query: 170 SD----PELLG-----QYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMN 220 E+LG YF +GV+ L+ + E+ L + D+ + Y DQDV+N Sbjct: 793 DRFAYAKEVLGMKDPYSYFQAGVLVLNTRAMRSRHTMEEWLE--FASDDRFIYNDQDVLN 850 Query: 221 VLLKGMTLFLPREYNTI----------------YTIKSELKDKTHQNYKKLITESTLLIH 264 +G ++L +N + Y + ++ ++++ ++H Sbjct: 851 AHCEGEVVYLDYSWNVMIDCFGRINKVFTFAPAYMFDAFIESRSNEK----------IVH 900 Query: 265 YTGATKPWHKWAIYPSVKYYKIALE 289 Y G KPW Y++ A E Sbjct: 901 YAGFEKPWKLAGCDRGELYWRYARE 925 >UniRef50_Q1RIL1 Lipopolysaccharide 1,2-glucosyltransferase RfaJ n=10 Tax=Rickettsia RepID=Q1RIL1_RICBR Length = 530 Score = 58.2 bits (139), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 52/203 (25%), Positives = 89/203 (43%), Gaps = 23/203 (11%) Query: 102 TQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVE-P 160 + +W + +RL+ Q+ L+ +LYLDAD++ D++ L ++ + A D Sbjct: 334 SDMWPPLVMYRLYFDQVFP-NLESILYLDADIIVLRDLNSFKKLDMSNYIVAGSMDTALT 392 Query: 161 MQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMN 220 V + ++ Y NSG+V+L+L+ + + L + + + YPDQD++N Sbjct: 393 YCTLKVEEECNRKINNFYKNSGIVFLNLQNMREKQAKNMVLDAMHNSKCSFAYPDQDLLN 452 Query: 221 VLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGATKPWH----KWA 276 + L +N YT + NY S ++HY G KPW+ KW Sbjct: 453 IAFHNYIYPLSMRWN-FYTYFID-----RDNYF-----SYFIMHYAGKKKPWNNEEIKWT 501 Query: 277 -----IYPSV-KYYKIALENSPW 293 Y + KYY E +PW Sbjct: 502 KDILEKYQEIEKYYWRYREFTPW 524 >UniRef50_UPI0001B55E75 hypothetical protein SSPB78_11600 n=1 Tax=Streptomyces sp. SPB78 RepID=UPI0001B55E75 Length = 792 Score = 57.8 bits (138), Expect = 6e-07, Method: Compositional matrix adjust. Identities = 71/276 (25%), Positives = 114/276 (41%), Gaps = 48/276 (17%) Query: 5 PAIEIDKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIA 64 PA + + A D R ++ + A VD NYL G + S+ L+N + DF ++ Sbjct: 16 PAAPVREAAADDVR--DLTGKRRVAFASFVDENYLPGFLALLRSLALSNPEVCEDFLVLH 73 Query: 65 DVYNDGFFQKIAKLAEQNQLRITLYRINTDKLQCLP---CTQVWSRAMYFRLFAFQLLGL 121 D +I L RI R++ + R YF L F++ Sbjct: 74 DGLRPASLARIRAL----HPRIRPRRVDAARYDAYAKGDQNNYLVRKAYFLLDVFRVR-- 127 Query: 122 TLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELL-GQY-F 179 D ++ LD D+V GD+S+LL L +AAV P+ G + Sbjct: 128 DYDTIITLDTDMVVLGDLSELLRL--REGLAAV-----------------PQFFYGTHKL 168 Query: 180 NSGVVYLDLKKWADA---KLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNT 236 NSG++ + + +DA ++ E L+ D DQ ++N +L G + LP YN Sbjct: 169 NSGLLVIQREFLSDAFCERIDETGLAGAYELDKH----DQGILNAVLDGDFVRLPARYNF 224 Query: 237 IYTIKSELKDKTHQNYKKLITESTLLIHYTGATKPW 272 + K + K + E T ++H+TG KPW Sbjct: 225 V---------KRRLSGDKPVPEDTAVLHFTGRHKPW 251 >UniRef50_C7XX93 Glycosyl transferase, family 8 n=1 Tax=Lactobacillus coleohominis 101-4-CHN RepID=C7XX93_9LACO Length = 398 Score = 57.4 bits (137), Expect = 8e-07, Method: Compositional matrix adjust. Identities = 66/245 (26%), Positives = 102/245 (41%), Gaps = 27/245 (11%) Query: 35 DANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLYRINTD 94 D +Y + +I SIV + R + + + I +D+ + FF +L + + + L +IN + Sbjct: 12 DNHYTAQITTTIKSIVYHLRRVKI-YLINSDIPQEYFFNLNLRLKQLDSELVDL-KINPE 69 Query: 95 KLQCLPCTQVW-SRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAA 153 + S+ Y RL QL+ T DR LY+D+D + IS+L + L A Sbjct: 70 LFSNAESPKAHISKITYGRLMIPQLV--TEDRALYIDSDAIVDQSISELWTMDLGDYPIA 127 Query: 154 VVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKW-ADAKLTEKALSILMSKDNVYK 212 V DV L FN+G++ + KK D L + L+ K + Sbjct: 128 AVHDV---------------FLADIFNAGIILFNNKKLREDPDLVDNMLAAAQQKGILDA 172 Query: 213 YPDQDVMNVLLKGMTLFLPREYNTI--YTIKSELKDKTHQNY--KKLITESTLLIHYTGA 268 DQ V+N L L EYN + Y L + Y K L +IHY Sbjct: 173 --DQTVLNQFFNHQYLELGLEYNYVIGYDRDVSLAPRNAPGYFEKMLNCPQPKIIHYASP 230 Query: 269 TKPWH 273 KPW+ Sbjct: 231 DKPWN 235 >UniRef50_A5EVI8 Glycosyl transferase family 8 protein n=1 Tax=Dichelobacter nodosus VCS1703A RepID=A5EVI8_DICNV Length = 617 Score = 56.6 bits (135), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 66/273 (24%), Positives = 122/273 (44%), Gaps = 37/273 (13%) Query: 27 CLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGF-FQKIAKLAEQNQLR 85 ++V D +Y+ +G I SI+ H++ D ++ + + G F +LA R Sbjct: 279 AVSVVIAADEHYVPHLGALICSII---DHLSCDAFLDLIILDGGIDFISQKQLAHLLGKR 335 Query: 86 ITLYRIN-TDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 + ++ +D+ +SRA ++RL +L+ + R+LY+D D + D+++L Sbjct: 336 GAIQFLDLSDEFTDQKVHMHFSRATFYRLILDKLI-IDRKRVLYIDCDTIVLADLAELFA 394 Query: 145 LGLNG-AVAAV------------VKDVE-----PMQEKAVSRLSDPELLGQYFNSGVVYL 186 LNG A+ AV V+ +E P ++ + E YF +GV+ Sbjct: 395 TDLNGKAIGAVFDYIMHHFCQVGVRSIEFTNYLPAKKYLEDYVGLKENWRHYFQAGVILF 454 Query: 187 DLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKD 246 DL++ +K ++ L K Y + DQD++N G FL +N + + +++ + Sbjct: 455 DLEQLRTLNYADKMIASLTEKR--YWFLDQDILNKYFVGNVHFLNPCWNVV-NVGADIYE 511 Query: 247 KTHQNYKKLITE------STLLIHYTG-ATKPW 272 +LI E + +IHY G KPW Sbjct: 512 GLS---AELIAELKAAERAPAIIHYAGYEAKPW 541 >UniRef50_B8PIH6 Predicted protein n=2 Tax=Agaricomycetes RepID=B8PIH6_POSPM Length = 532 Score = 56.6 bits (135), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 47/174 (27%), Positives = 78/174 (44%), Gaps = 14/174 (8%) Query: 121 LTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFN 180 L ++R+LYLDADV+ + DI L L G DV E + P YFN Sbjct: 311 LPVERVLYLDADVLVRADIWGLWSTDLRGKPIGAAIDVG-FPEGHNGTVRKP-----YFN 364 Query: 181 SGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTI-YT 239 +GV+ LDL A + T +AL + ++ DQD++N + + ++N Sbjct: 365 AGVLLLDL---AAVRRTLQALQGAAREYTTSRFRDQDLLNAYFEANWAEVSLKWNAQGIA 421 Query: 240 IKSELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPW 293 +EL + QN + ++ ++H+TG P + + P ++ Y PW Sbjct: 422 TYAELPTEARQNIDMGLLKNPYIVHFTGPVNPTLEVVLNPYIQPYTA----KPW 471 >UniRef50_C4ZG45 Glycosyltransferase Family 2 modular protein n=1 Tax=Eubacterium rectale ATCC 33656 RepID=C4ZG45_EUBR3 Length = 723 Score = 56.6 bits (135), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 58/246 (23%), Positives = 111/246 (45%), Gaps = 11/246 (4%) Query: 27 CLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIAD-VYNDGFFQKIAKLAEQNQLR 85 CL + + D NY G ++ SIV N + + F+I+ D N+ K++ +A+ + Sbjct: 346 CLGI-HDKDGNYSVWAGTTMQSIVENTK-APIVFHILHDDTLNEMNKNKLSLIADNSGNG 403 Query: 86 ITLYRINTDKLQCLP-CTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 I + N D L ++ FR+ ++ L +++YLD+D+ DI +L + Sbjct: 404 IEFHHFNPDIFGSLADSMNRFTIGTMFRIMLPDIMP-DLKKIIYLDSDLFVNTDIEELWN 462 Query: 145 LGLNGAVAAVVKDVEPMQEKAV--SRLSDPELLGQYFNSGVVYLDLKK-WADAKLTEKAL 201 L ++ A +D ++ + + +YFN+GV+ ++L + L ++ + Sbjct: 463 LNIDNYCLAAAQDCSTIRNWGTPYAVAAGQTSRDRYFNAGVLCMNLDNIRKNGSLFQQVM 522 Query: 202 SILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTL 261 L + PDQD +N + G TL + ++N Y I K+ K +TL Sbjct: 523 DYLSDNPRTW-LPDQDALNAIFSGKTLLIDEKWN--YFIDEARKNNEKAEKKIYHYAATL 579 Query: 262 LIHYTG 267 L+ +T Sbjct: 580 LMLHTN 585 >UniRef50_Q062P6 DNA mismatch repair protein n=1 Tax=Synechococcus sp. BL107 RepID=Q062P6_9SYNE Length = 281 Score = 56.6 bits (135), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 52/218 (23%), Positives = 95/218 (43%), Gaps = 15/218 (6%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +++ +D + V++TS +L++R ++ AD ++ +A + Sbjct: 1 MHLLLALDQGFEPLAAVALTSYLLHHRFSSVVLVTPADQR----MHQLEGIAASFECPCR 56 Query: 88 LYRINTDK-LQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLG 146 RI T+ L LP + +F + A Q R LY+DAD +C + L L Sbjct: 57 HQRIATESALHRLPAD---LQPYFFCIEALQ--QREPGRYLYVDADTLCVAGLETLEQLP 111 Query: 147 LNGAVA-AVVKDVEPMQEKA-VSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 L G A PM ++ V L P YFN+G++ D + L E+ + Sbjct: 112 LGGTTPLAACSHGRPMPDRTLVLGLEGPY---HYFNAGILLFDSVSLNEVLLPEQVVDYY 168 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKS 242 + + + ++ +Q +N LL G FLP +YN + +++ Sbjct: 169 LQHEALCRFREQCSLNALLSGQVQFLPGQYNVLSWMRA 206 >UniRef50_Q17VR5 Lipopolysaccharide biosynthesis protein n=19 Tax=Helicobacter RepID=Q17VR5_HELAH Length = 405 Score = 56.2 bits (134), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 47/195 (24%), Positives = 92/195 (47%), Gaps = 33/195 (16%) Query: 124 DRLLYLDADVVCKGDISQLLHLGLN----GAVAA---------VVKDVEPMQEK------ 164 D+++ D D + GDIS+ + L GAV KD+ ++++ Sbjct: 145 DKMIMFDVDTLFVGDISESFFIPLEAHYFGAVREKDLIAMNRNSAKDLYELRQRRAKSIG 204 Query: 165 ---AVSRLSDPELL-GQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMN 220 A L + ++L YFN+G + L+LK W L + + + K+ + DQD + Sbjct: 205 VANAFPNLEEAQILFDNYFNAGFLALNLKLWRKENLENQLIGFFILKNEKLLFNDQDALC 264 Query: 221 VLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPS 280 + +G L LP YN S L + + K++ ++H+ G KPW ++++ + Sbjct: 265 FVCRGRILELPYPYN---AHPSFLDTPSFPSIKEVC-----MLHFWG-DKPWKIFSVFGA 315 Query: 281 VKYYKIALENSPWKD 295 K++++ ++ +P+KD Sbjct: 316 KKWHEVLMQ-TPFKD 329 >UniRef50_UPI0001693121 general stress protein n=1 Tax=Paenibacillus larvae subsp. larvae BRL-230010 RepID=UPI0001693121 Length = 352 Score = 56.2 bits (134), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 71/292 (24%), Positives = 122/292 (41%), Gaps = 46/292 (15%) Query: 35 DANYLDGVGVSITSIVLNNRHINLDFYIIAD-VYNDGFFQKIAKLAEQNQLRITLY--RI 91 D Y + G + S+ N +++ +I+ D + QK+ +L I Y I Sbjct: 12 DGAYAEHAGAVLASVFCNTSS-SVNVHILHDETLTEANKQKLIELTSSFNQTIHFYPVTI 70 Query: 92 NTDKLQCLPCTQ---VWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLN 148 + LQ + + W++A +RL L+ +D+++YLD DV+ +I++L + L Sbjct: 71 PDNMLQAMAGVKSISFWTQASMYRLLIPALI--PVDKIIYLDCDVLVNMNIAELWEVQLG 128 Query: 149 GAVAAVVKDVEPMQ--EKAVSRLSDPELLGQYFNSGVVYL---DLKKWADAKLTEKALSI 203 A V D M + + +P+ YFNSGV+ +++K D E+ L+ Sbjct: 129 DFYLAAVWDQAIMAAVQHIIPYGLNPD---SYFNSGVILFALNNIRKKID--WYEEMLNF 183 Query: 204 LMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLI 263 L + PDQD +N + L L R +N + H ++ I + Sbjct: 184 LRRYPDT-SMPDQDTLNAVFGENYLQLDRRFNFFNMVSP------HHDFNNKI------V 230 Query: 264 HYTGATKPWHKWAIYPSVKYYKIALENSPWKD------------DSPRDAKS 303 H+ G+ K W + P Y+ L +PWK D RD+KS Sbjct: 231 HFAGSEKCWDVHS--PGANLYQEYLSLTPWKKHTDETSMGVHPLDGQRDSKS 280 >UniRef50_B1I7N1 Glycosyl transferase, family 8 n=8 Tax=Streptococcus pneumoniae RepID=B1I7N1_STRPI Length = 817 Score = 55.8 bits (133), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 68/256 (26%), Positives = 111/256 (43%), Gaps = 49/256 (19%) Query: 35 DANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLYRINTD 94 D NY+ + +I SI+ +NR + + YI+ +F+K K+A R+ I Sbjct: 10 DRNYIRQLETTIKSILYHNRDVKI--YILNQDIMPDWFRKPRKIA-----RMLGSEIIDV 62 Query: 95 KLQCLPCTQVW------SRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLN 148 KL Q W S Y R F + D++LYLD+D++ + +L + L Sbjct: 63 KLPEQTVFQDWEKQDHISSITYARYFIADYI--QEDKVLYLDSDLIVNTSLEKLFSICLE 120 Query: 149 GAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKAL--SILMS 206 A VKD + G FN+GV+ ++ KKW KL E+ + SI+ Sbjct: 121 EKSLAAVKDTD----------------GITFNTGVLLINNKKWRQEKLKERLIEQSIVTM 164 Query: 207 K---DNVYKY--PDQDVMNVLLKGMTLFLPREYN-----TIYTIKSELKDKTHQNYKKLI 256 K + +++ DQ + N +L+ L L R YN I + + ++ N K ++ Sbjct: 165 KEVEEGRFEHFNGDQTIFNQVLQDDWLELGRAYNLQVGHDIVALYNNWQEHLAFNDKPVV 224 Query: 257 TESTLLIHYTGATKPW 272 IH+T KPW Sbjct: 225 ------IHFTTYRKPW 234 >UniRef50_O48684 F3I6.10 protein n=46 Tax=Embryophyta RepID=O48684_ARATH Length = 393 Score = 53.9 bits (128), Expect = 8e-06, Method: Compositional matrix adjust. Identities = 61/272 (22%), Positives = 112/272 (41%), Gaps = 29/272 (10%) Query: 23 NTSECLNVAYGVDANYLDGVGVSITSIVLNNRHI----NLDFYIIADVYNDGFFQKIAKL 78 N +++A +D+ YL G ++ S++ RH N+ F+ IA ++ + +++L Sbjct: 80 NDPSLVHIAMTLDSEYLRGSIAAVHSVL---RHASCPENVFFHFIAAEFDSASPRVLSQL 136 Query: 79 AEQN--QLRITLYRINTDKLQCLPCTQVW----SRAMYFRLFAFQLLGLTLDRLLYLDAD 132 L +Y D + L + + + Y R + +L +++R++YLD+D Sbjct: 137 VRSTFPSLNFKVYIFREDTVINLISSSIRLALENPLNYARNYLGDILDRSVERVIYLDSD 196 Query: 133 VVCKGDISQLLHLGLNGAVAAVVKD---VEPMQEKAVSRLSDPELLGQ-------YFNSG 182 V+ DI++L + L G+ + Q SDP L G YFN+G Sbjct: 197 VITVDDITKLWNTVLTGSRVIGAPEYCHANFTQYFTSGFWSDPALPGLISGQKPCYFNTG 256 Query: 183 VVYLDLKKWADAKLTEK--ALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTI 240 V+ +DL +W + EK L K +Y ++ G + +N Sbjct: 257 VMVMDLVRWREGNYREKLEQWMQLQKKMRIYDLGSLPPFLLVFAGNVEAIDHRWNQ---- 312 Query: 241 KSELKDKTHQNYKKLITESTLLIHYTGATKPW 272 D + + L L+H++G KPW Sbjct: 313 HGLGGDNIRGSCRSLHPGPVSLLHWSGKGKPW 344 >UniRef50_C0X9Z8 Glycosyltransferase n=1 Tax=Lactobacillus gasseri JV-V03 RepID=C0X9Z8_9LACO Length = 675 Score = 53.9 bits (128), Expect = 8e-06, Method: Compositional matrix adjust. Identities = 55/250 (22%), Positives = 115/250 (46%), Gaps = 31/250 (12%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLY 89 +A D N L+ + ++ SI L+N+H+ + +II +F + + Q +I Sbjct: 4 IALDADVNDLNKIETTLKSIFLHNQHVEI--HIINFNIPHEWFINVNQYVNQFGSKIIDE 61 Query: 90 RINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNG 149 +I+ + L + + + + F F L ++ D++LYLD+D++ ++ + + + Sbjct: 62 KIDPNFLGDVQPSSDQIKKISFGRFLIPDL-ISADKVLYLDSDLIVTDNLQSIFQMNFDD 120 Query: 150 AVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDN 209 + V D + +P+ FNSGV+ ++ K+W + K++ K + MSK Sbjct: 121 KMLFAVHDYQ-----------NPD----QFNSGVMLINNKRWREEKVSSKLIE--MSKQQ 163 Query: 210 VYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITES------TLLI 263 DQ V+N + K E N Y + L+ + N K+++ ++ +I Sbjct: 164 ALA-SDQAVINEVFKNQI----GELNLSYNYQIGLEKNAYWNNKQVVFDNYNRVPIPRII 218 Query: 264 HYTGATKPWH 273 +Y+G P++ Sbjct: 219 NYSGDDNPFN 228 >UniRef50_B9KUH7 Lipopolysaccharide 3-alpha-galactosyltransferase n=1 Tax=Rhodobacter sphaeroides KD131 RepID=B9KUH7_RHOSK Length = 304 Score = 52.8 bits (125), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 46/171 (26%), Positives = 74/171 (43%), Gaps = 25/171 (14%) Query: 125 RLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDV---------EPMQEKAVSRLSDPELL 175 R+LYLD DV D+S L L + G A V+D EP++ + +R+ + Sbjct: 106 RVLYLDGDVRVVDDLSPLFSLDMRGFPLAGVRDYVVSKRLARGEPVKVRNRARIEEEARC 165 Query: 176 ------GQYFNSGVVYLDLKKWADAKLTEKAL-SILMSKDNVYKYP--DQDVMNVLLKGM 226 YFN+GV+ LD A A + +L S + D K+ DQD +N + G Sbjct: 166 MSGADASTYFNAGVLLLD----ASAIAADHSLCSAMQDLDRASKWTLGDQDHLNNVFAGR 221 Query: 227 TLFLPREYNTIYTIKSELK---DKTHQNYKKLITESTLLIHYTGATKPWHK 274 + YN+ ++ + ++ +L +IH+ G KPW K Sbjct: 222 VRLIDPAYNSSWSRTPRQRRYVERLGPAPAELTYAPDAIIHFHGPAKPWKK 272 >UniRef50_C3XKY2 Glycosyl transferase n=2 Tax=Campylobacterales RepID=C3XKY2_9HELI Length = 433 Score = 52.8 bits (125), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 59/234 (25%), Positives = 105/234 (44%), Gaps = 26/234 (11%) Query: 60 FYIIADVYNDGFFQKIAKLAEQNQL------RITLYRINTDKLQCLPCTQVW--SRAMYF 111 F+I++D + ++ +L QN L I + IN + + P + + Y+ Sbjct: 72 FHILSDSISSTTQNQLTEL--QNTLNTIYPCEILTHIINDKEFENFPISGAAHSNHLPYY 129 Query: 112 RLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSD 171 RL L ++ + LYLD+D++C D+ +L + L V A + D + K + + Sbjct: 130 RLKLDSYLDDSITKCLYLDSDMLCLCDLRELFAIDLKDFVVAAINDPGTKKRKIKYKENG 189 Query: 172 PELL----GQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLL-KGM 226 +++ YFNSG + ++ + + K+ EK + L K K DQD++N + K Sbjct: 190 KKMILNFNDNYFNSGFLLINTQNYKQHKIQEKCEN-LAKKCYYIKAADQDLLNATIPKEK 248 Query: 227 TLFLPREYN------TIYTIKSELKDKTHQNYKKLIT--ESTLLIHYTGATKPW 272 L LP YN I K E K + + + + ++ +IHY KPW Sbjct: 249 LLKLPIAYNFSSISFCIAICKDEQKHRLNCTRAEFMESYKNPKIIHY--GEKPW 300 >UniRef50_B9ADW8 Putative uncharacterized protein n=1 Tax=Methanobrevibacter smithii DSM 2375 RepID=B9ADW8_METSM Length = 223 Score = 52.4 bits (124), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 47/158 (29%), Positives = 78/158 (49%), Gaps = 13/158 (8%) Query: 105 WSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQE- 163 +S A Y +LF LL T+D+++YLD D + ++L+L LN +AA V + E Sbjct: 49 FSLATYSKLFIASLLPETVDKVIYLDCDALVLDSFKEILNLDLNNYLAAGVLALNCTAEV 108 Query: 164 KAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKY--PDQDVMNV 221 K L++ +L Y N+G++ ++LK+W + + L L+ + K+ DQ V+N Sbjct: 109 KKAIDLNEDDL---YINAGMLLINLKRWRQENVENQFLEKLVEFNLRGKHFGMDQGVINN 165 Query: 222 LLKGMTLFLPREY-------NTIYTIKSELKDKTHQNY 252 + L L +Y NT Y I +L +NY Sbjct: 166 VSSKNLLVLNPKYNLEGSLHNTGYDITFKLNGNIQKNY 203 >UniRef50_C0X9Z7 Putative uncharacterized protein n=2 Tax=Lactobacillus gasseri JV-V03 RepID=C0X9Z7_9LACO Length = 416 Score = 52.0 bits (123), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 62/260 (23%), Positives = 114/260 (43%), Gaps = 34/260 (13%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLY 89 +A + Y+D + +I SI+ N + N++ +++ +F I + A Q RI Sbjct: 5 IALSANYGYIDKIETTIKSILYNVK--NVEIHLLNYDIPQEWFANINRYANQIGSRIIDE 62 Query: 90 RINTDKLQCLPCT-QVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLN 148 + + ++L L + ++ Y RL +L+ +R+LYLD+D+V +I +L N Sbjct: 63 KFDPEELHDLNSGFKHINQMTYARLLIPKLI--KANRVLYLDSDLVVDDEIDELFSRKFN 120 Query: 149 GAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKW-ADAKLTEKALSILMSK 207 G V + ++ K SR+ P N+GV+ ++ ++ D L+EK L ++ Sbjct: 121 GKKILAVTHIFDVRNKNESRVDLP---VPSINAGVLLINNQELRKDHNLSEKLLD--FAR 175 Query: 208 DNVYKYPDQDVMNVLLK--------------GMTLFLPREYNTIYTIKSELKDKTHQNYK 253 N + DQD +N K G FL N+ +E+ DK Sbjct: 176 KNNFPQDDQDTINNWFKDEIGSLSFKYNYQIGADRFLFWSNNSNTETATEILDK------ 229 Query: 254 KLITESTLLIHYTGATKPWH 273 ++ +IHY KP++ Sbjct: 230 ---VKNPKIIHYISDDKPFN 246 >UniRef50_Q2RB54 Glycosyl transferase family 8 protein, expressed n=11 Tax=Poaceae RepID=Q2RB54_ORYSJ Length = 642 Score = 51.6 bits (122), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 44/174 (25%), Positives = 84/174 (48%), Gaps = 9/174 (5%) Query: 122 TLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVK--DVEPMQEKAVSRLSDPELLGQYF 179 +L+R++ LD D++ + D+S L +L + G V ++ +V+ Q KA + + + Sbjct: 464 SLNRVVVLDDDLIVQKDLSSLWNLNMGGKVVGAIQFCEVKLGQLKAYTEERNFGTNSCVW 523 Query: 180 NSGVVYLDLKKWADAKLTEKALSIL--MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTI 237 SG+ ++LKKW D +T + +L + KD+V +P + + LL L P E + + Sbjct: 524 LSGLNVVELKKWRDLHITSRYDQLLQKLQKDSVTSFPLKVLPISLLVFQDLIYPLEDSWV 583 Query: 238 YTIKSELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENS 291 + + + K+ +T +HY G KPW I+ Y++ + N Sbjct: 584 QSGLGHDYGVSQTDIKRSVT-----LHYNGVMKPWLDLGIHDYKGYWRKYMTNG 632 >UniRef50_A8LP95 Lipopolysaccharide 1 n=1 Tax=Dinoroseobacter shibae DFL 12 RepID=A8LP95_DINSH Length = 342 Score = 51.6 bits (122), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 45/172 (26%), Positives = 75/172 (43%), Gaps = 19/172 (11%) Query: 108 AMYFRLFAFQLLGLTLDRLLYLDADVVCKGD-ISQLLHLGLNGAVAAVVKDVEPMQEKAV 166 + Y RL LG R+LY+D+DV D + LL + G A V+D Q + Sbjct: 110 STYLRLALSGALGHDYQRILYMDSDVFALRDGLHVLLFTDMRGKPLAAVRDNS--QWRTS 167 Query: 167 SRLSDPELLG------QYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMN 220 R D +L+ YFN+GV+ +D + + + KAL + S+ DQ ++N Sbjct: 168 GRKPD-DLVTLNLPARPYFNAGVLLMDTARLNEQDILAKALDLGTSQAGRLARHDQTLLN 226 Query: 221 VLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGATKPW 272 + G + +N +T S + ++E ++H+ G KPW Sbjct: 227 AVTSGNWAEMSPRWNWQFTWAS---------WIFALSEDARILHFIGPNKPW 269 >UniRef50_D0IR33 LPS 1,2-glycosyltransferase n=3 Tax=Helicobacter pylori RepID=D0IR33_HELP1 Length = 387 Score = 51.6 bits (122), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 53/225 (23%), Positives = 92/225 (40%), Gaps = 32/225 (14%) Query: 105 WSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVE-PMQE 163 +S+ + RLF L L D+++ DAD + D+S+ + L+ V KD P Sbjct: 129 FSKMVMCRLFLASLF-LQYDKIIMFDADTLFLNDVSESFFIPLDDYYFGVAKDFSSPKSS 187 Query: 164 K--AVSRLSDPE-----------------LLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 K R P L ++N G + ++LK W +L E+ L++ Sbjct: 188 KHFQTERERAPRQAFSLYEHYLKEKDIKILYENHYNVGFLVVNLKLWRADRLEERLLNLT 247 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLIT--ESTLL 262 K P+QD++ + L LP YNT N K+ I + ++ Sbjct: 248 HQKGQCVFCPEQDLLTLACYQKVLILPYIYNT---------HPFMVNQKRFIPNRQEIVM 298 Query: 263 IHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEF 307 +H+ KPW S ++++ L+ S + + S + K + EF Sbjct: 299 LHFYFVGKPWVSPTALYSKEWHETLLKTSFYAEYSVKFLKQMTEF 343 >UniRef50_C6DEN3 Glycosyl transferase family 8 n=1 Tax=Pectobacterium carotovorum subsp. carotovorum PC1 RepID=C6DEN3_PECCP Length = 610 Score = 51.6 bits (122), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 59/259 (22%), Positives = 114/259 (44%), Gaps = 28/259 (10%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHIN--LDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 + + DA++ V++TS+ ++ N D YI +++IA+ IT Sbjct: 330 IFFCTDADFSLPAVVALTSLAMSIGGANNLPDIYIFVPPEIRPLWERIAERFTSAFPIIT 389 Query: 88 LYRINT-----DKLQC----LPCTQVWSRAMYFRLFAFQLLG-LTLDRLLYLDADVVCKG 137 L ++T D+++ + S Y R +A + L + + R LYLD+D+V Sbjct: 390 LRIVSTLQMDLDEVRAQFGFYNVGETLSTTTYTRFYASRYLHYIGVTRALYLDSDIVILH 449 Query: 138 DISQLLHLGLNG-AVAAVVKDVEPMQEKAVS--RLSDPELLGQYFNSGVVYLDLKKWADA 194 LL+ + G +AA P+ ++A+ ++++ +YFN+GV+ DL A Sbjct: 450 SPLSLLYEDMQGFPLAARTDRNTPLIKRAIRLHQIAN----ERYFNAGVILFDLTHPAMI 505 Query: 195 KLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKK 254 A++ ++ + DQ +N + G+ L L YN S + Sbjct: 506 STINTAITYSKQGNSPLLFLDQCALNKAISGLYLALDERYNRFIPPSSA---------TQ 556 Query: 255 LITESTLLIHYTGATKPWH 273 +I ++T+++H+ KPW Sbjct: 557 VIEDNTVIMHFIETPKPWQ 575 >UniRef50_A9SH80 Predicted protein n=2 Tax=Physcomitrella patens subsp. patens RepID=A9SH80_PHYPA Length = 527 Score = 51.6 bits (122), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 49/197 (24%), Positives = 81/197 (41%), Gaps = 35/197 (17%) Query: 111 FRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQE------- 163 F F L + R +YLDAD+V KG+I +L+ + L AA V+D E Sbjct: 286 FAPFLLPLHFKDVGRFIYLDADIVVKGNIEELIQIDLGNRAAAAVEDCSQTFETYFDFNE 345 Query: 164 --KAVSRLSDPELLGQ--------YFNSGVVYLDLKKWADAKLTEKAL----SILMSKDN 209 K +R P + FN GV+ +D +W ++TE L ++ Sbjct: 346 LAKIQARPEKPTWVPTEPIKPDACVFNRGVLVIDTNQWIKQQVTEAILWWMDEFQSAESV 405 Query: 210 VYKYP-DQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKT---------HQNYKK----L 255 +YKY Q + L G + L +N ++E ++ H+ +K L Sbjct: 406 LYKYGLSQPPFLLALYGKYMKLDTPWNVRGLGRNEFSEREREFLESKYGHKPERKPFISL 465 Query: 256 ITESTLLIHYTGATKPW 272 ++ ++H+ G KPW Sbjct: 466 DADTAKILHFNGKFKPW 482 >UniRef50_Q3DM64 Glycosyl transferase, family 8, degenerate n=6 Tax=Streptococcus agalactiae RepID=Q3DM64_STRAG Length = 394 Score = 51.2 bits (121), Expect = 6e-05, Method: Compositional matrix adjust. Identities = 41/155 (26%), Positives = 71/155 (45%), Gaps = 20/155 (12%) Query: 124 DRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGV 183 +++LYLD D + ++ +L + L A + D + G +FNSGV Sbjct: 97 EKVLYLDIDTLVVDNLDKLFEIELGDYPIAAILDGD----------------GIHFNSGV 140 Query: 184 VYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSE 243 + ++ W ++TEK L I + + + DQ V+N+L L L +YN + ++ Sbjct: 141 MLINSLYWMRYRVTEKLLEITERELDNGIFGDQGVLNLLFDNNWLKLEDKYNA--QVGND 198 Query: 244 LKD--KTHQNYKKLITESTLLIHYTGATKPWHKWA 276 L + Q Y ES +IHY KPW+ ++ Sbjct: 199 LGAFYENWQGYFDRNFESPTIIHYCTHDKPWNTFS 233 >UniRef50_C2LRU0 Glycosyl transferase, family 8 n=1 Tax=Streptococcus salivarius SK126 RepID=C2LRU0_STRSL Length = 402 Score = 50.4 bits (119), Expect = 8e-05, Method: Compositional matrix adjust. Identities = 59/252 (23%), Positives = 106/252 (42%), Gaps = 34/252 (13%) Query: 29 NVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITL 88 +V + + +Y++ + V++ S+ + + Y++ + +F + + E I Sbjct: 5 SVVFVAELSYMEKLEVALKSLCAHKGQWKI--YVLNENLPTEWFTLMNRRLEAIDSEILN 62 Query: 89 YRINTDKLQ--CLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLG 146 R++ + + LP + A +FR + + +R+LYLD D++ D+S L + Sbjct: 63 CRVSAESFKQFSLPSAHI-HYATFFRYAIPEFV--QENRVLYLDCDMIFTQDLSPLFEVD 119 Query: 147 LNG-AVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILM 205 L G + AVV D FN+G++ +D W K+T+ +L L Sbjct: 120 LGGLGIGAVV---------------DRPTTTDGFNAGLMVIDTDWWRQHKVTD-SLFDLT 163 Query: 206 SKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKL-----ITEST 260 + + Y DQ ++N+ K LP YN + S DK Y L Sbjct: 164 KEHHQNVYGDQGILNLYFKDAWYQLPWTYNL--QVGS---DKDQYGYGDLEWYDAFKGVP 218 Query: 261 LLIHYTGATKPW 272 +IHYT KPW Sbjct: 219 AVIHYTSHNKPW 230 >UniRef50_B6ACJ0 Glycosyl transferase family 8 protein, putative n=1 Tax=Cryptosporidium muris RN66 RepID=B6ACJ0_9CRYT Length = 304 Score = 50.4 bits (119), Expect = 9e-05, Method: Compositional matrix adjust. Identities = 41/174 (23%), Positives = 77/174 (44%), Gaps = 21/174 (12%) Query: 106 SRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLN---GAVAAVVKDVEPMQ 162 S A RL ++ ++D+LLYLD DV+ + +L + +N G VA + + Sbjct: 132 SEATMCRLLLPNIIDKSIDKLLYLDTDVIVNTPLRELFGININSQCGIVARSSTKADLIN 191 Query: 163 EKAVSRLSDPELL---GQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVM 219 E P ++ + FN+GV+ + L + T+KA+ + + DQ ++ Sbjct: 192 EWLKKDKIYPHIIYNGTKSFNAGVLLISLNELRKNHFTDKAMEFVEK----WGLNDQIIL 247 Query: 220 NVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGATKPWH 273 N+ G LP +YN + + + ++ T + ++H+ G KPW Sbjct: 248 NLYCNGEYDELPMQYN-FWAGRDDYRN----------TSAHGIVHFAGPNKPWQ 290 >UniRef50_A4SAB5 Predicted protein (Fragment) n=1 Tax=Ostreococcus lucimarinus CCE9901 RepID=A4SAB5_OSTLU Length = 259 Score = 50.1 bits (118), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 59/270 (21%), Positives = 107/270 (39%), Gaps = 40/270 (14%) Query: 28 LNVAYGVDANYLDGVGVSITSIV---LNNRHINLDFYIIADVYNDGFFQ------KIAKL 78 +++A+ D L +G I+S++ + I + D D Q I + Sbjct: 3 VHIAFACDPTQLFTLGPVISSVLSATASPHRIRFHIFTARDALTDASVQLNCYSRAIPFI 62 Query: 79 AEQNQLRITLYRINTDKLQCLPCTQVW--SRAMYFRLFAFQLLGLTLDRLLYLDADVVCK 136 E ++ + R N + + W A + F F + + +++YLD D++ K Sbjct: 63 WELHEFSKDMIRANI----TVHSRKEWRLQNAFNYARFYFAEILSDVQKVVYLDTDIIVK 118 Query: 137 GDISQLLHLGLNG---AVAAVVKDVEPMQ-----EKAVSRLSDPELLGQYFNSGVVYLDL 188 GDI +L L +V A VK P+ A + S FN+GV+ +DL Sbjct: 119 GDICRLHDANLRSSSTSVIAAVKRSVPLGSLLNFSNAAVKSSGLREKMHSFNAGVLLIDL 178 Query: 189 KKWADAKLTEKALSILM--SKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKD 246 + W ++T + L S +Y + Q + ++ +P +N Sbjct: 179 ESWRRKRITSTVETWLKMNSVSKLYSHGSQPPLLLVFGDSFESIPSHWNV---------- 228 Query: 247 KTHQNYKKLITESTL----LIHYTGATKPW 272 YKK + S L ++H++G +KPW Sbjct: 229 -DGVGYKKGLRASVLNEARVLHWSGQSKPW 257 >UniRef50_C8WAA9 Glycosyl transferase family 8 n=2 Tax=Atopobium RepID=C8WAA9_ATOPD Length = 358 Score = 50.1 bits (118), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 47/202 (23%), Positives = 90/202 (44%), Gaps = 30/202 (14%) Query: 99 LPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDV 158 LP + Y+RL A LL +++ +YLD+D+V DI++L + + G + +D Sbjct: 90 LPHHGHFRPETYYRLLAPSLLP-NVNKAIYLDSDLVVNTDIAELYDIDITGYLVGATRDA 148 Query: 159 EPMQE------------KAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMS 206 + + + K + DP YF +GV+ ++L++ E+ L + S Sbjct: 149 DTIGQIDGYDATVGPYLKNELGMDDPH---DYFQAGVILMNLEEIRKQISPEEFLKV--S 203 Query: 207 KDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITE-------S 259 +++ DQDV+N + G L + ++N + + +D K I E + Sbjct: 204 TMRTWRWLDQDVLNRFVNGHYLRINMKWNYLVDWQFLRRDHIVAQAPKDIREEYEEARKN 263 Query: 260 TLLIHYTGA-TKPWHKWAIYPS 280 + H+ G +PW +YP+ Sbjct: 264 ICIAHFAGPDNRPW----LYPN 281 >UniRef50_B3WD32 Glycosyl transferase n=9 Tax=Lactobacillus RepID=B3WD32_LACCB Length = 279 Score = 49.7 bits (117), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 65/289 (22%), Positives = 125/289 (43%), Gaps = 70/289 (24%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHIN--LDFYIIA----------DVYNDGFFQKI 75 +N+ + D DGV ++ S++ RH + L Y++ ++ +++ Sbjct: 3 MNIMFCGDEKMTDGVLIATLSLM---RHTDQPLHIYVLTAKLKVNGHAYQPFSAVTAERM 59 Query: 76 AKLAEQNQLRITLYRIN-TDKLQCLP----CTQVWSRAMYFRLFAFQLLGLTLDRLLYLD 130 A L Q + L RI+ TD P T +++ RL+A L+ DR+LYLD Sbjct: 60 ADLMRQENPQHRLTRIDITDLFMANPPQANMTTMFTPYCMLRLYA-DLIPELPDRVLYLD 118 Query: 131 ADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSD---------PELLGQYFNS 181 D+VC+ S L EPM++ ++ + D Y NS Sbjct: 119 TDIVCRRSFSNLYQ--------------EPMKDVDIAGVLDHYGKWWFHHKLTWFDYINS 164 Query: 182 GVVYLDLKKWADAKLTEKALSILMSKDNVYKY---PDQDVMNVLLKGMTLFLPREYNTIY 238 GV+ ++L A + + L + + +++ PDQ +N++ K + LPR+YN + Sbjct: 165 GVLLMNL-----ASIRQDGLLVRCRRLIRHRWLFMPDQSALNIIAKSKQI-LPRKYNEQH 218 Query: 239 TIKSELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIA 287 ++++ T+ H+T + + W ++ I +VK ++I+ Sbjct: 219 KVETD----------------TVFQHFTTSFRFWPRFRIV-TVKPWQIS 250 >UniRef50_A8AY72 Glycosyl transferase, family 8 SP1766 n=7 Tax=Streptococcus RepID=A8AY72_STRGC Length = 435 Score = 49.7 bits (117), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 39/153 (25%), Positives = 67/153 (43%), Gaps = 19/153 (12%) Query: 124 DRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGV 183 DR LYLD D+V ++ L L L A V+ LG FNSGV Sbjct: 131 DRALYLDCDLVVTQNLDHLFELDLEDYYIAAVRATFG--------------LGIGFNSGV 176 Query: 184 VYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSE 243 + L+ K+W + + ++ + + + DQ ++N+L K L L YN + I + Sbjct: 177 MLLNNKRWREENIPQQLVELTDREIERVLEGDQSILNMLFKEQYLELEDSYN--FQIGFD 234 Query: 244 LKDKTHQN---YKKLITESTLLIHYTGATKPWH 273 + + + + ++ ++HY A KPW+ Sbjct: 235 MGAAQYGHDFVFDIPLSPLPAIVHYISALKPWN 267 >UniRef50_B3JN71 Putative uncharacterized protein n=4 Tax=Bacteroides coprocola DSM 17136 RepID=B3JN71_9BACE Length = 157 Score = 49.3 bits (116), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 31/97 (31%), Positives = 56/97 (57%), Gaps = 6/97 (6%) Query: 204 LMSKDNVYKYPDQDVMNVLLKGMTLFLPREYN---TIYTIKSELKDKTHQNYKKLITEST 260 L+ K+ Y Y DQD++N++ KG FLP +YN +Y++ E + + K++ S Sbjct: 16 LLHKNKKYTYQDQDIINIVCKGKIEFLPLKYNYTSLLYSLSIENEKFKNIKAKEIAECSN 75 Query: 261 LLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDS 297 +IHYTG KPW+ + + +Y+ ++S +KD++ Sbjct: 76 CIIHYTG-NKPWNNFCL--RAEYWWYYYKHSIFKDEN 109 >UniRef50_Q92VQ2 Putative lipopolysaccharide 1,3-galactosyltransferase n=1 Tax=Sinorhizobium meliloti RepID=Q92VQ2_RHIME Length = 337 Score = 49.3 bits (116), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 49/189 (25%), Positives = 72/189 (38%), Gaps = 20/189 (10%) Query: 112 RLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSR--- 168 R + LL +DR LY+D D + G++ LL D + VSR Sbjct: 103 RFWIDSLLDAGVDRFLYIDGDTMVDGELDSLLASTPPAEGLMAAPDFLNIFMDEVSRGKK 162 Query: 169 --LSDPELLG----QYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVL 222 L+ E +G YFNSGV+Y + W D + A+ ++ DQ +N Sbjct: 163 RDLAHLEGIGCRPETYFNSGVIYASREAWND--IVPVAMKFMVEHPEHCPASDQSALNHA 220 Query: 223 LKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGATKPWHK--WAIYPS 280 +G L YN +SE + + + H+TG KPW+ W S Sbjct: 221 ARGRVTMLSLRYN----YQSEHMMVLDPRRRGI---GPAIWHFTGGPKPWNTPGWPWDES 273 Query: 281 VKYYKIALE 289 Y A E Sbjct: 274 FNRYYCAAE 282 >UniRef50_B4WN64 Glycosyl transferase family 8 n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WN64_9SYNE Length = 289 Score = 48.1 bits (113), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 71/280 (25%), Positives = 118/280 (42%), Gaps = 44/280 (15%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRH----INLDFYIIADVYNDGFFQKIAKLA---- 79 +++A V+ + V I SI+ N H + L F I+ + FF++ K A Sbjct: 3 VDIALSVNRTLQVPLLVVINSILTNTTHRTEEVPLRFNIVVPIGESAFFEEELKQAFSAK 62 Query: 80 ---EQNQLRITLYRINT-------DKLQCLPCTQVWSRAM-YFRLFAFQLLGLTLDRLLY 128 E+ + R+ + + +K + + SR M Y RLF F+ + + R++Y Sbjct: 63 YDCERVEFRVKEFTPPSYLKQYLDNKFREKKQERRLSRYMQYARLF-FKDVFPDIARMIY 121 Query: 129 LDADVVCKGDISQLLHLG---LNGAVAAVVKDVEPM---QEKAVSRLSDPELLGQYFNSG 182 DAD++ G++ L G + A V P + SD FNSG Sbjct: 122 FDADIIVLGNVRSLFTQGNILTSQNYLAAVPQFFPAIFYFSNPLKVFSDLRKFKSTFNSG 181 Query: 183 VVYLDLKKWADA--KLTEKALSILMSKDN--VYKYPDQDVMNVLLKGMTLFLPREYNTIY 238 V+ DL W D KL + L L K+N +Y D+ V N++ K + L +++N Sbjct: 182 VLLTDLSFWTDQTYKLLKHYLE-LDEKNNYRLYHLGDETVFNLMFKDTYIPLTKQWNCCG 240 Query: 239 TIK----SELKDKTHQNYKKLITESTLLIHYTGA-TKPWH 273 + ++L K +N K IH++G KPW Sbjct: 241 YGQAHWVAKLLWKNPENMKA--------IHWSGGHHKPWQ 272 >UniRef50_UPI0000587C70 PREDICTED: similar to MGC81998 protein n=1 Tax=Strongylocentrotus purpuratus RepID=UPI0000587C70 Length = 344 Score = 48.1 bits (113), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 62/277 (22%), Positives = 114/277 (41%), Gaps = 44/277 (15%) Query: 21 NINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAE 80 N +++ +NV D + L G+ ++ SI LN+R ++ FY++ D + ++K Sbjct: 56 NSSSNGTINVLICSDGSTLGGMVAAMNSIYLNSR-THIKFYLVVDTDS---LDHLSKWLS 111 Query: 81 QNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDIS 140 Q+ LR Y I L Y RL+ ++ R++++D+D + +GDI+ Sbjct: 112 QSSLRKLDYAIKVFDESWL---------NYARLYFPKIFPGLTGRVIFVDSDTITQGDIA 162 Query: 141 QLLHLGLN-GAVAAVVKDVEPMQEK---AVSRLSDPELLGQ-------------YFNSGV 183 +L + + G V A D + + ++R + G FN GV Sbjct: 163 ELNAIDIKPGHVVAFSDDCSAVTSRYGVIMNRYASYLNFGNEKLQSLGINPMECSFNPGV 222 Query: 184 VYLDLKKWADAKLTEKA--LSILMSKDNVY------KYPDQDVMNVLLKGMTLFLPREYN 235 ++ +W +T K + SK++VY + +M V + LP E++ Sbjct: 223 FVANVDEWRKQNITAKLDYWVTVNSKEDVYGSQRGGGHSGPPMMIVFYMKYSP-LPPEWH 281 Query: 236 TIYTIKSELKDKTHQNYKKLITESTLLIHYTGATKPW 272 L T Y ++ L+H+ G KPW Sbjct: 282 I-----RHLGVTTGARYSDAFLKAAKLLHWNGRFKPW 313 >UniRef50_Q68CQ7 Glycosyltransferase 8 domain-containing protein 1 n=45 Tax=Euteleostomi RepID=GL8D1_HUMAN Length = 371 Score = 47.8 bits (112), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 70/292 (23%), Positives = 118/292 (40%), Gaps = 46/292 (15%) Query: 26 ECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLR 85 E + V + L G +I SI N R N+ FYI+ + + + L+ Sbjct: 64 EEIPVVIAASEDRLGGAIAAINSIQHNTRS-NVIFYIVTL---NNTADHLRSWLNSDSLK 119 Query: 86 ITLYRI-NTD------KLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGD 138 Y+I N D K++ P + + F F +L + + +Y+D DV+ +GD Sbjct: 120 SIRYKIVNFDPKLLEGKVKEDPDQGESMKPLTFARFYLPILVPSAKKAIYMDDDVIVQGD 179 Query: 139 ISQLLHLGLN-GAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLT 197 I L + L G AA +D + K V R + G +N + YLD KK KL+ Sbjct: 180 ILALYNTALKPGHAAAFSEDCDSASTKVVIRGA-----GNQYNY-IGYLDYKKERIRKLS 233 Query: 198 EKALSILMSKD----NVYKYPDQDVMNVLLKGMTLFLP---------------------- 231 KA + + N+ ++ Q++ N L K M L + Sbjct: 234 MKASTCSFNPGVFVANLTEWKRQNITNQLEKWMKLNVEEGLYSRTLAGSITTPPLLIVFY 293 Query: 232 REYNTIYTIKS--ELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSV 281 ++++TI + + L + Y ++ L+H+ G KPW + A Y V Sbjct: 294 QQHSTIDPMWNVRHLGSSAGKRYSPQFVKAAKLLHWNGHLKPWGRTASYTDV 345 >UniRef50_B9HMR5 Glycosyltransferase, CAZy family GT8 n=25 Tax=Magnoliophyta RepID=B9HMR5_POPTR Length = 383 Score = 47.4 bits (111), Expect = 7e-04, Method: Compositional matrix adjust. Identities = 59/273 (21%), Positives = 114/273 (41%), Gaps = 39/273 (14%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLD----FYIIADVYNDGFFQKIAKLAEQN- 82 +++A +D+ YL G ++ S++ +H + F+ +A ++ + + +L Sbjct: 77 VHIAMTLDSEYLRGSIAAVHSVL---KHASCPESIFFHFVAAEFDPASPRVLTQLVRSTF 133 Query: 83 -QLRITLYRINTDKLQCLPCTQVWSRAM-----YFRLFAFQLLGLTLDRLLYLDADVVCK 136 L +Y D + L + + +A+ Y R + +L L +DR++YLD+D+V Sbjct: 134 PSLNFKVYIFREDTVINLISSSI-RQALENPLNYARNYLGDMLDLCVDRVIYLDSDIVVV 192 Query: 137 GDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRL-----SDPELLGQ---------YFNSG 182 DI +L + L+G + V+ E SD + G YFN+G Sbjct: 193 DDIHKLWNTALSG--SRVIGAPEYCHANFTQYFTSVFWSDQVMSGTFSSARRKPCYFNTG 250 Query: 183 VVYLDLKKWADA---KLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYT 239 V+ +DL +W + + EK + I K +Y+ ++ G + +N Sbjct: 251 VMVMDLVRWREGDYKRRIEKWMEI-QKKTRIYELGSLPPFLLVFAGDVEAIDHRWNQ--- 306 Query: 240 IKSELKDKTHQNYKKLITESTLLIHYTGATKPW 272 D + + L L+H++G KPW Sbjct: 307 -HGLGGDNVRGSCRSLHPGPVSLLHWSGKGKPW 338 >UniRef50_Q02ZT6 Lipopolysaccharide biosynthesis glycosyltransferase n=1 Tax=Lactococcus lactis subsp. cremoris SK11 RepID=Q02ZT6_LACLS Length = 281 Score = 47.0 bits (110), Expect = 9e-04, Method: Compositional matrix adjust. Identities = 29/106 (27%), Positives = 54/106 (50%), Gaps = 13/106 (12%) Query: 178 YFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTI 237 YF +GV+ L+L+ TEK ++++ ++ +Y DQD++N+ K +LP +N I Sbjct: 12 YFQAGVLVLNLQAIRKDFTTEKFINLVQKRNWIYM--DQDILNLCFKNKVFYLPESWNVI 69 Query: 238 ----------YTIKSELKDKTHQNYKKLITESTLLIHYTGATKPWH 273 I+ L + +Y K ++ ++HY G+ KPW+ Sbjct: 70 TLMEKNSVRGQIIQERLPYQISDSYNK-SRKTPNIVHYAGSYKPWY 114 >UniRef50_Q1CSY7 Lipopolysaccharide 1,2-glycosyltransferase n=3 Tax=Helicobacter pylori RepID=Q1CSY7_HELPH Length = 341 Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust. Identities = 39/192 (20%), Positives = 79/192 (41%), Gaps = 32/192 (16%) Query: 124 DRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKD------VEPMQEKAVSRLSDPELLG- 176 ++++ D D + GDIS+ + ++G K+ + + SRL+ +G Sbjct: 100 EKIIMFDVDTLFVGDISESFFIPMDGVYFGATKEDFSLIGIHNANDLFSSRLNWSRGMGV 159 Query: 177 -----------------QYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVM 219 FN+G + ++L W + L EK + ++D P+QD+ Sbjct: 160 KLNHKSLIFQEVEILYENPFNAGFMLVNLALWREHHLEEKLIDFFKTRDEGLLLPEQDLF 219 Query: 220 NVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYP 279 ++ +G L +P +YN + T KK ++H+ KPW + Sbjct: 220 VLVCQGCILEMPCKYNVHPRMVG-----TRMIPKK---SDACMLHFYADEKPWKHFRYPY 271 Query: 280 SVKYYKIALENS 291 S +++++A + S Sbjct: 272 SKEWHQVAFKTS 283 >UniRef50_A1VG39 Glycosyl transferase, family 8 n=1 Tax=Desulfovibrio vulgaris DP4 RepID=A1VG39_DESVV Length = 335 Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust. Identities = 57/281 (20%), Positives = 122/281 (43%), Gaps = 22/281 (7%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINL-DFYIIADVYNDGFFQKIAKLAEQNQLRITL 88 + + DANY V++ S+ N + Y++ + + G I + + R+ Sbjct: 6 IVFTFDANYRLPASVALQSLFENAKDSTYYHVYLVCEGLSRGDKDAIESICPEKNGRVEW 65 Query: 89 YRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLN 148 ++ + P ++ W + +Y R+ L L D+++Y D DVV D++++ + ++ Sbjct: 66 IDVDNELFSSAPSSENWPKIVYARILL--PLLLPFDKVIYSDVDVVFCSDLAEIFQIEVD 123 Query: 149 GAV-AAVVKDVEPMQEKAVSRLSDPELLGQ---YFNSGVVYLDLKKWADAKLTEKALSIL 204 G A V ++ QE V+R + Q + SG + ++L+ + + L+ + Sbjct: 124 GCEWAGVAAELVAFQE-GVARCHNVHCEYQNELIYMSGFMVMNLRLMREKDTVGRCLNNI 182 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTL--FLPREYNTIYTIKSELKDKTHQNYKKLITESTL- 261 + K D +++N+ + F ++ K+ + K + + L S L Sbjct: 183 SKFGSRLKMYDLEILNMSSDNIARIDFSYCVLENVFFAKNVSEAKEYPWLRGLYRVSELE 242 Query: 262 -------LIHYTGA-TKPWHKWAIYPSVKYYKIALENSPWK 294 +IH+ G+ TK W ++ + P V Y+ L SP++ Sbjct: 243 AARSAPRIIHFAGSDTKVWERYCV-PQV--YRKYLAVSPFR 280 >UniRef50_A7H2X4 Glycosyl transferase family 8 n=2 Tax=Campylobacter RepID=A7H2X4_CAMJD Length = 497 Score = 46.6 bits (109), Expect = 0.001, Method: Compositional matrix adjust. Identities = 76/309 (24%), Positives = 128/309 (41%), Gaps = 52/309 (16%) Query: 28 LNVAYGVDANYLDGVGVSITSIV--------LNNRHINLDF---------YIIADVYNDG 70 L++ GV A Y+ V I SIV L NL F + I Y Sbjct: 2 LHICIGVSAEYVKYSAVLINSIVKATQKPFDLKPYENNLSFTKDLKEGFCFHIFTEYKSE 61 Query: 71 FFQKIA----KLAEQNQLRITLYRINTDKLQCLPCTQVWSR--AMYFRLFAFQLLGLTLD 124 +KIA KL+E + ++ +N Q W + AM++++ +L +D Sbjct: 62 DTEKIALLAHKLSEIYPTKCLIHVMNNQDFQDFS-YPFWCQNAAMFYKIKVVDILK-DVD 119 Query: 125 RLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDV---EPMQEKAVSRLSDPELL----GQ 177 + L++ AD+ GD+ L L L + A D + KA ++ SD EL+ Sbjct: 120 KCLFIGADLFALGDVRDLFALDLKDNLIAAALDTYNFDGYLRKAKAKNSDEELVFNDAKN 179 Query: 178 YFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTI 237 Y N+ ++ ++LK+W L K + L D D DV ++ L +YN I Sbjct: 180 YINNDMMLINLKEWRKQNLQAKYIDYLNKYDLA---GDLDVFPLVCAPKIHILSSKYNFI 236 Query: 238 --------YTIKSELKDKTHQ---NYKKL----ITESTLLIHYTG-ATKPWHKWAIYPSV 281 + +++ LKD++ + N+ K+ I + L+H+ KPW A Sbjct: 237 LGYYTRESFGLENTLKDESDKPVWNFTKVELEQIQKDLRLVHFCHYVYKPWMS-AYNCHY 295 Query: 282 KYYKIALEN 290 Y+ + L+N Sbjct: 296 VYFNMGLDN 304 >UniRef50_A2DBB6 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2DBB6_TRIVA Length = 1378 Score = 46.6 bits (109), Expect = 0.001, Method: Compositional matrix adjust. Identities = 35/144 (24%), Positives = 65/144 (45%), Gaps = 8/144 (5%) Query: 16 DFRLANINTSECLNVAYGVDANYLDGVGVSITSI-VLNNRHINLDFYIIADVYNDGFFQK 74 + +L+ N +E +NV + V + YL V I I + N + F+ + + + F Sbjct: 1047 NLKLSMSNDTETVNV-FAVVSGYLYEHLVKIMMISAIKNTKNPIHFWFLKNFISSQFMND 1105 Query: 75 IAKLAEQNQLRITLYRINTDKL---QCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDA 131 + K A++ + + N Q +W + LF L + + R++Y+DA Sbjct: 1106 LPKFAKKYNFKYSFVEYNWPSFVVHQSERQRIIWGNKI---LFFDALFPMNISRMIYIDA 1162 Query: 132 DVVCKGDISQLLHLGLNGAVAAVV 155 D V +GD+S+L+ + L G V Sbjct: 1163 DAVVRGDLSELMKIDLKGCPYGFV 1186 >UniRef50_Q16CW9 Lipopolysaccharide 1,3-galactosyltransferase, putative n=7 Tax=Rhodobacteraceae RepID=Q16CW9_ROSDO Length = 329 Score = 46.2 bits (108), Expect = 0.001, Method: Compositional matrix adjust. Identities = 38/170 (22%), Positives = 75/170 (44%), Gaps = 17/170 (10%) Query: 109 MYFRLFAFQLLGLTLDRLLYLDADV-VCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVS 167 +Y R+ D++LYLD+D+ V GD + L + + A V+D +Q + Sbjct: 99 VYLRIALPTAFAGEYDKILYLDSDIFVQGGDFNALFDIDVAPHCIASVRD--NVQWRTPK 156 Query: 168 RLSDPELL-----GQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVL 222 R + + YFN+GV+ +D++ + + +L + + ++ K DQ++ N + Sbjct: 157 RQNKRNTIKGIPPSAYFNAGVMLMDVQAYTEQELMRRCVEFGRARRRDLKRHDQNLYNAV 216 Query: 223 LKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGATKPW 272 L+ + +N Y+ + L + +IH+ G KPW Sbjct: 217 LQNDWAEISPVWNWQYSWSTRL---------FAVFAYPNIIHFIGPAKPW 257 >UniRef50_A9UXT0 Predicted protein (Fragment) n=1 Tax=Monosiga brevicollis RepID=A9UXT0_MONBE Length = 191 Score = 46.2 bits (108), Expect = 0.002, Method: Compositional matrix adjust. Identities = 50/190 (26%), Positives = 86/190 (45%), Gaps = 25/190 (13%) Query: 106 SRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLL---HLGLNGAVAAVVKDVEPMQ 162 S A + R +LL L+R+LY+D D V +GD+ LL LG + +AAV + P+ Sbjct: 1 SSANFGRFMLPELLP-ELNRVLYIDIDTVVQGDLVALLAHMDLGDDDYLAAVPRPNVPLS 59 Query: 163 E---KAVSRL-----SDP----ELLGQYFNSGVVYLDLKKWADAKLTEKALSILMS--KD 208 + RL DP +L FN+GV +L+ W L ++ L + + Sbjct: 60 HFFGADIVRLHAELHPDPGQLLQLAAPSFNAGVAVWNLRAWRQRSLRDEVLYYMTKHHEH 119 Query: 209 NVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGA 268 ++ Y Q ++ ++ G L +N L +T + + L + ++H++G Sbjct: 120 ALWDYGTQPILLLVCAGHWQPLDVRFNL-----DGLGYRTDVSTEAL--DGAYVLHWSGR 172 Query: 269 TKPWHKWAIY 278 KPW A+Y Sbjct: 173 RKPWQHDALY 182 >UniRef50_A5DLS6 Putative uncharacterized protein n=2 Tax=Pichia guilliermondii RepID=A5DLS6_PICGU Length = 390 Score = 46.2 bits (108), Expect = 0.002, Method: Compositional matrix adjust. Identities = 47/183 (25%), Positives = 73/183 (39%), Gaps = 23/183 (12%) Query: 93 TDKLQCLPCTQVWSRAMYFRLFAFQLL-GLTLDRLLYLDADVVCKGDISQLLHLGLNGAV 151 +D+L P R F+ LL + D++LYLD DV+ ++ L G Sbjct: 61 SDRLVTSPVDDRLGRPELAVTFSKLLLWNESYDQILYLDTDVLPLANVDHLFDEG----- 115 Query: 152 AAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVY 211 AA+ P Q A P++ FNSGV+ D ++ + D+ + Sbjct: 116 AALT----PRQIAASPDSGWPDI----FNSGVLLFK----PDPQVYSDLVEFASGSDSSF 163 Query: 212 KYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGATKP 271 DQ ++N G LP YN T + H+ +K + ++HY G KP Sbjct: 164 DGADQGLLNEFFAGNWHRLPFLYNVTPTESYQYVPAFHRFFKDIK-----ILHYIGQIKP 218 Query: 272 WHK 274 WH Sbjct: 219 WHS 221 >UniRef50_P91854 Protein F26H9.8, partially confirmed by transcript evidence n=2 Tax=Caenorhabditis RepID=P91854_CAEEL Length = 1381 Score = 45.8 bits (107), Expect = 0.002, Method: Compositional matrix adjust. Identities = 56/230 (24%), Positives = 101/230 (43%), Gaps = 17/230 (7%) Query: 19 LANINTSECLNVAYGVDAN-YLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAK 77 A+ SE +NV + Y + + +TS++ N + + F+++ + + F + I K Sbjct: 1086 FASPEPSEVINVFSLASGHLYERFMRIMMTSVLNNTKTQKVKFWLLKNYLSPKFKETIPK 1145 Query: 78 LAEQNQLRITLYRINTDKL---QCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVV 134 LAE + L K Q +W Y LF L L +D+++++DAD V Sbjct: 1146 LAEFYKFEFELVEYKWPKWLHKQTEKQRVMWG---YKILFLDVLFPLNVDKIIFVDADQV 1202 Query: 135 CKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDP-----ELLGQYFNSGVVY-LDL 188 + D+ +L+ LNGA V E E R L+G+ ++ +Y +DL Sbjct: 1203 VRADLQELMDFNLNGAPYGYVPFCESRTEMDGFRFWKSGYWKNHLMGRKYHISALYVVDL 1262 Query: 189 KKWADAKLTEK---ALSILMSKDNVYKYPDQDVMNVLLKGMTL-FLPREY 234 K + + ++ L + N DQD+ N +L + + LP+E+ Sbjct: 1263 KAFREFSAGDRLRGRYDSLSADPNSLSNLDQDLPNNMLHEVPIKSLPQEW 1312 >UniRef50_UPI000197AD97 hypothetical protein BACCOPRO_03221 n=1 Tax=Bacteroides coprophilus DSM 18228 RepID=UPI000197AD97 Length = 313 Score = 45.8 bits (107), Expect = 0.002, Method: Compositional matrix adjust. Identities = 65/283 (22%), Positives = 128/283 (45%), Gaps = 29/283 (10%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINL-DFYIIADVYNDGFFQKIAKLAEQ-NQLRIT 87 + + D N + V I+S+++N + D +I+ D +++ +L + N+ RI Sbjct: 7 IVFAFDNNLILPACVCISSLLMNAKEETFYDIFILHSSKVDLHKEQLDELPKYFNRCRIQ 66 Query: 88 LYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQL-LHLG 146 YR+ + + + Y+RL +L+ D ++Y D DV+ + D+S + H Sbjct: 67 -YRVVDNTFDQAFEIRGITTPTYYRLLIPELVP-EYDNIIYSDVDVIFRFDLSDIYFHTD 124 Query: 147 LNGAVAAVVKDVEPM---QEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSI 203 LN + A V + P +K +L + + +G + L+ KK + L E+ + Sbjct: 125 LNDSYVAGVNALVPFIPDMKKYYLKLGNVN-IDSIIYAGNIILNSKKIREDNLVERFKEL 183 Query: 204 LMSKDNVYKYPDQDVMNVLLKGMTLFL-PREYNTIYTIKSELKDK-------THQNYKKL 255 +K + + D DV+N+ KG +L P T Y + L+ + + ++ + Sbjct: 184 AKNK---FHFQDLDVLNIACKGKITYLKPVFCLTTYFSELALRHRNLLRDFWSDKDIDEA 240 Query: 256 ITESTLLIHYTGATKPWHKWAIYPSV--KYYKIALENSPWKDD 296 +TE ++HY G KPW + + +YY+ SP+ D+ Sbjct: 241 LTEG--IVHYNGQ-KPWKGICVNSDIWWEYYR----KSPFFDE 276 >UniRef50_Q39T65 Glycosyl transferase, family 8 n=1 Tax=Geobacter metallireducens GS-15 RepID=Q39T65_GEOMG Length = 317 Score = 45.8 bits (107), Expect = 0.002, Method: Compositional matrix adjust. Identities = 41/182 (22%), Positives = 79/182 (43%), Gaps = 33/182 (18%) Query: 124 DRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDP----ELLGQYF 179 D++++ D DVV K DIS + + A V+ + +K ++ P +L Sbjct: 110 DKIIWSDVDVVFKDDISDVFFMLSEENYIAGVRVCGKL-DKYYENMNMPAEIKSILKNGI 168 Query: 180 NSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREY---NT 236 +G++ +LKK + + + + L ++ P+QD++N++LK ++P Y Sbjct: 169 GAGILVYNLKKMREDNIYDDIMIALQGMSSIVVQPEQDILNIVLKDKIDYIPLRYCFCTY 228 Query: 237 IYT-----------IKSELKDKTHQNYKKLIT--------------ESTLLIHYTGATKP 271 +Y +K L + + Y+K + ES +IHY +TKP Sbjct: 229 MYNLFKDRHKMKLKVKGNLFNYLFKGYRKNLGFDTIYSEKELLEAFESPAIIHYATSTKP 288 Query: 272 WH 273 W+ Sbjct: 289 WN 290 >UniRef50_C7PRU3 Glycosyl transferase family 8 n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PRU3_CHIPD Length = 303 Score = 45.1 bits (105), Expect = 0.004, Method: Compositional matrix adjust. Identities = 60/298 (20%), Positives = 109/298 (36%), Gaps = 65/298 (21%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKL--AEQNQL 84 +++A+ +D L+G+G +ITS+V N + LD + I + + L E Sbjct: 1 MHIAFVIDLPSLEGLGATITSLVRNCSDTAQLDLHFICNNLGTRHKNNLLMLLQTESYHG 60 Query: 85 RITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLG-----------------LTLDRLL 127 R Y + ++ SR Y R +LL +T D++ Sbjct: 61 RTRFYDFDAQEMFGHLSAVHGSRTSYGRFLIPKLLDADYVLCLDPDLLILLDVITFDQIR 120 Query: 128 YLDA--DVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVY 185 + D V G L L +V KD Q F SG++ Sbjct: 121 FEDHFLAAVPGGPFRNTLEAKLLPGQLSVCKDE------------------QSFISGMLL 162 Query: 186 LDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELK 245 L+L++W + + + I + + D V+N + G + +N I+T Sbjct: 163 LNLRRWKERDICHEIEKICLRHGMALQEADNTVLNTICNGSFYHIEDRFNCIWT-----P 217 Query: 246 DKTHQNYKKLITESTLLIHYTGATKPW-----------HKWAIYPSV----KYYKIAL 288 + ++K+ ++H+ GA KPW +WA Y + +Y ++A Sbjct: 218 GQATPSFKE-----NAILHFAGAPKPWDFLGREVHAGYQRWADYDTTFWDRRYKRVAF 270 >UniRef50_B7PBG6 Glycosyltransferase domain-containing protein, putative (Fragment) n=1 Tax=Ixodes scapularis RepID=B7PBG6_IXOSC Length = 304 Score = 45.1 bits (105), Expect = 0.004, Method: Compositional matrix adjust. Identities = 51/193 (26%), Positives = 83/193 (43%), Gaps = 33/193 (17%) Query: 110 YFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGL-NGAVAAVVKDVEPMQEK---A 165 + +L+ +LL L+ LD DV+ +GD+++L L L GAV +D + + A Sbjct: 92 FAKLYLARLLPSVAGTLVVLDDDVIVQGDVAELAALPLPKGAVGLFSRDCDTFSRRYNTA 151 Query: 166 VSRLSD------PEL--LG-----QYFNSGVVYLDLKKWADAKLTEKALSI--LMSKDNV 210 SR P L LG N GV +DL +W+ +TE A + L K+ + Sbjct: 152 GSRYEQYVEARRPSLQALGISATDCVLNLGVFVVDLAEWSRLNVTESAEAWMRLNIKEKL 211 Query: 211 YKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSE-----LKDKTHQNYKKLITESTLLIHY 265 +K + + LL + +N T+ + L Y +L S L+H+ Sbjct: 212 FK--QEGPVPALLLAL-------HNKTATLDPQWHVRNLGVTAGTQYSRLFVSSAKLLHW 262 Query: 266 TGATKPWHKWAIY 278 +G KPW + Y Sbjct: 263 SGRFKPWSSRSPY 275 >UniRef50_C3XFW0 Glycosyl transferase n=1 Tax=Helicobacter bilis ATCC 43879 RepID=C3XFW0_9HELI Length = 365 Score = 44.7 bits (104), Expect = 0.005, Method: Compositional matrix adjust. Identities = 38/177 (21%), Positives = 74/177 (41%), Gaps = 14/177 (7%) Query: 108 AMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVE--PMQEKA 165 A Y+R+ L +D+ LYLD D++ D+ +L L L+G +AA + Sbjct: 91 AAYYRVKLVDFLPKNVDKCLYLDTDMLVLTDLRELFALNLDGYIAASSSGSPNATISRYG 150 Query: 166 VSRLSDPELLGQ-------YFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDV 218 + R YF SG++ ++ K+W + +A+ L + ++ DQD Sbjct: 151 IYRKKKGGKKAVKSFETSFYFCSGLMLINTKEWIKQNVDIEAMRFLREYET--EFADQDA 208 Query: 219 MNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTL---LIHYTGATKPW 272 +N + L ++ + E T+ ++ K ++ + ++H G K W Sbjct: 209 LNFAMCDRVYNLGEQWGILAYQSLEAACSTNIDFSKRYEKAMINAKILHCNGPAKAW 265 >UniRef50_C5ZVZ7 Putative glycosyltransferase n=3 Tax=Campylobacterales RepID=C5ZVZ7_9HELI Length = 431 Score = 44.7 bits (104), Expect = 0.005, Method: Compositional matrix adjust. Identities = 50/191 (26%), Positives = 83/191 (43%), Gaps = 18/191 (9%) Query: 106 SRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKA 165 S Y+RL + + LYLD+D++ D+ +L L L +A ++ D K Sbjct: 120 SHLPYYRLKWQDYIKPAPQKCLYLDSDMLVLCDLRELFALDLKDNIAGIIGDCGSKNRKI 179 Query: 166 VSRLSDPE----LLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNV 221 + ++ + YFNSG + ++ K++ ++ EK + L K K DQD++N Sbjct: 180 KYQENNYKKTFYFDENYFNSGFLLINSKQYIKEQIWEKCEN-LAKKCTYIKAADQDLLNF 238 Query: 222 LLK-GMTLFLPREYN-----TIYTI-KSELKDK---THQNYKKLITESTLLIHYTGATKP 271 + L LP YN +Y + K E K++ T + + K +L HY KP Sbjct: 239 TIPINKRLKLPFAYNFQCITLLYVLCKDECKNRLNYTREAFNKSFKNPKIL-HY--GEKP 295 Query: 272 WHKWAIYPSVK 282 W Y K Sbjct: 296 WRYLQSYQDYK 306 >UniRef50_A4UX79 LPS biosynthesis protein n=2 Tax=Lactobacillaceae RepID=A4UX79_9LACO Length = 186 Score = 44.3 bits (103), Expect = 0.006, Method: Compositional matrix adjust. Identities = 44/154 (28%), Positives = 61/154 (39%), Gaps = 23/154 (14%) Query: 124 DRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKD-----VEPMQEKAVSRLSDPELLGQY 178 DR+LYLDADVVC+ H L G V D Q++A Y Sbjct: 14 DRILYLDADVVCRRPFEDFYHQSLAGTDFVGVLDHYGRWFFHHQQRAFD----------Y 63 Query: 179 FNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIY 238 NSG++ ++L KL + + PDQ MN L K F P +YN Sbjct: 64 INSGMLLMNLDMIRQDKLLARCRECCRKWPMI--MPDQSAMNKLAKHKA-FAPEKYNEQQ 120 Query: 239 TIKSELKDKTHQNYKKLITESTLLIHYTGATKPW 272 ++S+ + KL I +T + KPW Sbjct: 121 DVQSDTVFQHFSTRWKLWP-----IVHTVSVKPW 149 >UniRef50_C2ETF1 Putative uncharacterized protein n=1 Tax=Lactobacillus vaginalis ATCC 49540 RepID=C2ETF1_9LACO Length = 346 Score = 43.1 bits (100), Expect = 0.013, Method: Compositional matrix adjust. Identities = 46/155 (29%), Positives = 70/155 (45%), Gaps = 26/155 (16%) Query: 125 RLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVV 184 ++LYLD+DV+ I+ LL+L + +AA VKD L+ P+ N+GVV Sbjct: 47 KVLYLDSDVIINHSITALLNLDFSEPLAA-VKD-----------LNSPD---SEINAGVV 91 Query: 185 YLDLKKWAD-AKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSE 243 Y + K+ ++ L SK K DQ V++ FLPR YN Y + E Sbjct: 92 YFNNPVINQHPKIVDQLLP--ASKQPGLKNADQSVLSNFFYHQAKFLPRTYN--YEVGVE 147 Query: 244 LKDKTHQNYKKLITE-----STLLIHYTGATKPWH 273 H ++I+E +IH+ KPW+ Sbjct: 148 GYAVYHH-IDRIISELARISDPAIIHFDSDDKPWN 181 >UniRef50_Q04CN2 Lipopolysaccharide biosynthesis glycosyltransferase n=19 Tax=Lactobacillus RepID=Q04CN2_LACDB Length = 274 Score = 43.1 bits (100), Expect = 0.014, Method: Compositional matrix adjust. Identities = 47/165 (28%), Positives = 71/165 (43%), Gaps = 22/165 (13%) Query: 112 RLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSR--L 169 RLFA +L + DR+LYLD DV+ + + Q L G V D R Sbjct: 100 RLFADELPQIP-DRILYLDDDVIIRRPVDQFYTQDLTGTELVGVLDY-------FGRFFF 151 Query: 170 SDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLF 229 + + + Y NSGV+ L++ + L ++ ++ K PDQ +N L K + Sbjct: 152 HNQKKIFDYLNSGVLLLNMPEIKRTGLFKRVRHLMQVKKMF--LPDQTAINKLAKEKRI- 208 Query: 230 LPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHY--TGATKPW 272 PR+YN Y ++ D Q++ T S Y T KPW Sbjct: 209 APRKYNEQYALQD---DTVIQHF----TTSFRFFPYFRTQTVKPW 246 >UniRef50_A2E3L1 Glycosyl transferase family 8 protein n=2 Tax=Trichomonas vaginalis RepID=A2E3L1_TRIVA Length = 319 Score = 43.1 bits (100), Expect = 0.014, Method: Compositional matrix adjust. Identities = 31/116 (26%), Positives = 53/116 (45%), Gaps = 14/116 (12%) Query: 123 LDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSG 182 L+R LYLD D + +I+ + + A V+ D LS+ E +YFNSG Sbjct: 147 LERFLYLDGDAIVLHNINDMYYYDFQNKSAIVILD----------HLSECEGFSRYFNSG 196 Query: 183 VVYLDLKKWADAKLTEKA---LSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYN 235 V+ + K+ ++A L L V+ + DQ +N + + + P++YN Sbjct: 197 VMMFNNWKYVQENFLKQAEDYLKWLEINRGVW-FNDQTPLNKIFEHNRIEFPQDYN 251 >UniRef50_Q04CN3 Lipopolysaccharide biosynthesis glycosyltransferase n=1 Tax=Lactobacillus delbrueckii subsp. bulgaricus ATCC BAA-365 RepID=Q04CN3_LACDB Length = 200 Score = 42.7 bits (99), Expect = 0.017, Method: Compositional matrix adjust. Identities = 44/166 (26%), Positives = 68/166 (40%), Gaps = 23/166 (13%) Query: 137 GDISQLLHLGLNGAVAAVVKD-----VEPMQEKAVSRLS-DPELLGQYFNSGVVYLDLKK 190 DI+ L L + A D +EP+Q L DP+ +Y NSGV+ ++ Sbjct: 3 ADIAGLYQTELGNNLVAACHDQSVHYIEPLQTYIRDCLGIDPD---KYVNSGVLVMNCLA 59 Query: 191 WADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKG-MTLFLPREYNTIYTIKSELKDKTH 249 D +K L +L + PDQD +N + G + L PR D Sbjct: 60 MRDEDFVDKFLHLLSTYQFNSIAPDQDYLNEICSGRIKLLDPR------------WDAMP 107 Query: 250 QNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKD 295 ++ +T LIHY KPWH + ++++A E +KD Sbjct: 108 NDFDPEMT-GPYLIHYNLFYKPWHFEEVKYGSYFWQVAKETPFYKD 152 >UniRef50_Q1CUZ8 Lipopolysaccharide 1,2-glycosyltransferase n=12 Tax=Helicobacter RepID=Q1CUZ8_HELPH Length = 372 Score = 42.4 bits (98), Expect = 0.024, Method: Compositional matrix adjust. Identities = 46/203 (22%), Positives = 84/203 (41%), Gaps = 36/203 (17%) Query: 105 WSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDV------ 158 +S+ + RLF L D+++ DAD + D+S+ + L+ KD Sbjct: 114 FSKMVMCRLFLASLFP-QYDKIIMFDADTLFLNDVSESFFIPLDSYYFGAAKDFASPKSL 172 Query: 159 ---------EPMQEKAVS----RLSDPELLGQ-YFNSGVVYLDLKKWADAKLTEKALSIL 204 EP Q+ ++ + D +++ + ++N G + ++LK W L E+ L++ Sbjct: 173 KHFQTEREREPRQKFSLYEHYLKEKDMKIICENHYNVGFLIVNLKLWRADHLEERLLNLT 232 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITE--STLL 262 K P+QD++ + L LP YN N K+ I + ++ Sbjct: 233 HQKGQCVFCPEQDLLTLACYQKVLQLPYIYNA---------HPFMLNQKRFIPDKKEIVM 283 Query: 263 IHYTGATKPWHKWAIYPSVKYYK 285 +H+ KPW I P+ Y K Sbjct: 284 LHFYFVGKPW----ISPTALYSK 302 >UniRef50_A4IXE1 Glycosyl transferase, family 8 n=16 Tax=Francisella RepID=A4IXE1_FRATW Length = 296 Score = 42.0 bits (97), Expect = 0.031, Method: Compositional matrix adjust. Identities = 56/259 (21%), Positives = 112/259 (43%), Gaps = 20/259 (7%) Query: 30 VAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITL 88 + + D N + G V+I S++ + N D Y+ N + E+ + I+ Sbjct: 6 IVFTFDKNIILGGAVTIKSLIDHANPDTCYDIYVYHPNINKKSISAFNSMIEKTKHSISF 65 Query: 89 YRINTDKLQCLPC-TQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGL 147 + ++ + +P T+ ++RL +LL D+++Y D DV+ + D+S++ + L Sbjct: 66 HNVDESIFKDVPIDTRRGWIITFYRLLIPKLLP-QYDKVIYSDVDVLFQSDMSEVYNTDL 124 Query: 148 NG-AVAAVVKDVEPMQEKAVSRLSDPELLGQY-FNSGVVYLDLKKWADAKLTEKALSILM 205 A V+ E Q+ V E Y + G + ++ K + + + Sbjct: 125 TSYEWAGVI--AEKHQQNMVQHKYFKENNNSYIYWPGFMVMNTKLMRENNFISRCFDTMH 182 Query: 206 SKDNVYKYPDQDVMNVLLKGMTLFLPREYNT---IYTIKSELKDKTHQNYKKLITESTLL 262 + K+ D DV+N+ + + LP +Y T IY + + + + K++ +++ LL Sbjct: 183 EFNTRLKFRDLDVLNLTCRKIKS-LPFKYVTLQSIYYLNTIQEAPEYIFLKEIYSDNELL 241 Query: 263 --------IHYTGAT-KPW 272 IHY G+ KPW Sbjct: 242 DAKNNPAIIHYAGSPGKPW 260 >UniRef50_A8W7G8 Glycosyl transferase family protein (Fragment) n=1 Tax=Mucor racemosus RepID=A8W7G8_RHIRA Length = 241 Score = 42.0 bits (97), Expect = 0.033, Method: Compositional matrix adjust. Identities = 37/142 (26%), Positives = 66/142 (46%), Gaps = 28/142 (19%) Query: 124 DRLLYLDADVVCKGDISQLLHLGLN-----GAVAAVVKDVEPMQEKAVSRL--------- 169 DRL+ LDAD++ ++ +L+H+ L A A V + + ++ S + Sbjct: 92 DRLVLLDADMLPLQNMDELIHMHLPNKDWVAAAYACVCNPQKIKHYPASWIPENCAYTGR 151 Query: 170 -----SDPELLGQ---YFNSGVVYLDLKKWADAKLTEKALSIL--MSKDNVYKYPDQDVM 219 +DP +G YFNSG++ L D + ++ L +S N+Y +PDQD + Sbjct: 152 DTMACTDPTPIGNKADYFNSGLIVLT----PDTSKFDAMVTYLNSISDLNIYPFPDQDFL 207 Query: 220 NVLLKGMTLFLPREYNTIYTIK 241 N + K + YN + T++ Sbjct: 208 NEIFKTKWKPISYVYNALKTLQ 229 >UniRef50_UPI0001925360 PREDICTED: similar to glycosyltransferase-like 1B n=1 Tax=Hydra magnipapillata RepID=UPI0001925360 Length = 730 Score = 41.6 bits (96), Expect = 0.039, Method: Compositional matrix adjust. Identities = 67/245 (27%), Positives = 103/245 (42%), Gaps = 35/245 (14%) Query: 46 ITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLYRINTDKLQC----LPC 101 I SI+ RH L F+ I+D+ Q + K Q+ ++ Y + +KL+ +P Sbjct: 141 IKSILFYRRH-PLHFHFISDISGRHVLQVLFKTWVLKQVGVSFY--DAEKLKADVDWIPN 197 Query: 102 TQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPM 161 T +L + L L +++ LD DV D+++L N + VE Sbjct: 198 THYSGVYGLMKLTLTRALPEFLSKVIVLDTDVFFLTDLAELWAFFNNFTEDQAIGLVENQ 257 Query: 162 QEKAVSRLSDP----ELLGQYFNSGVVYLDLKK-----WADA-KLT-EKALSILMSKDNV 210 + +L +G+ FN+GV+ DL+K WA +LT EK L L+S Sbjct: 258 SQWYTGKLWKKYKIWPAIGRGFNTGVMLFDLQKLRKFQWAHLWRLTAEKQLLNLLST--- 314 Query: 211 YKYPDQDVMNVLLKG---MTLFLPREYNTIYTIKSELKDKTHQN--YKKLITESTLLIHY 265 DQDV+N LK + LP ++N +L D T Y KLI IH+ Sbjct: 315 -VLADQDVINAALKDNPQIVYKLPCQWNI------QLSDNTESEYCYNKLIELKA--IHW 365 Query: 266 TGATK 270 K Sbjct: 366 NSPNK 370 >UniRef50_UPI00016C0890 glycosyl transferase family 8 protein n=2 Tax=Epulopiscium sp. 'N.t. morphotype B' RepID=UPI00016C0890 Length = 593 Score = 41.6 bits (96), Expect = 0.039, Method: Compositional matrix adjust. Identities = 49/228 (21%), Positives = 96/228 (42%), Gaps = 30/228 (13%) Query: 105 WSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDV----EP 160 W +YF+L ++ + L+LD D++ DI+ LL + L G A ++ Sbjct: 367 WQPCVYFKLLIPSIMH-NYKKSLHLDCDLIILEDIANLLSIDLKGNAVAGCAEMGCITTS 425 Query: 161 MQEKAVSRLSDPEL----LGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQ 216 ++ ++ +L + +YFN GV+ ++ ++ K+T A +L + + +Q Sbjct: 426 IRRTWANKYYHEKLRITNMVEYFNGGVIVFNINEF--HKITSLA-QLLHEAEKKHLNLEQ 482 Query: 217 DVMNVLLKGMTLFLPREYN-------TIYTI-KSELKDKTHQNYKKLITESTLLIHYTGA 268 D+++ LP+ +N T+ + K L +Q Y + +IHY G Sbjct: 483 DILSKSFVNHIYLLPQSWNLTRDFLGTVMNLYKQYLPSNIYQKYLD-ARQKPKIIHYIGP 541 Query: 269 TKPWHK---------WAIYPSVKYYKIALENSPWKDDSPRDAKSIIEF 307 KPW W + Y++A+ + K+ S K+ +F Sbjct: 542 LKPWDNPNLEYASYWWDTIRGTEIYEMAINSQIQKNFSENIKKTTKKF 589 >UniRef50_B3XPR8 Glycosyl transferase family 8 n=4 Tax=Lactobacillus RepID=B3XPR8_LACRE Length = 465 Score = 41.6 bits (96), Expect = 0.041, Method: Compositional matrix adjust. Identities = 40/175 (22%), Positives = 75/175 (42%), Gaps = 20/175 (11%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLY 89 +A VD ++D ++ SI +N+ N+ YII +F I + +I Sbjct: 4 IALSVDYRWIDQAETTLKSIYAHNK--NVKTYIINHDIPHEWFVNINRYLGVQDSQIIDR 61 Query: 90 RINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNG 149 +I+ ++ + +P + M + F L + D++LYLD+DV+ ++ QL +N Sbjct: 62 KIDEERFKDMPMPEARISPMVYGKFLIPEL-IPEDQVLYLDSDVIVDKNLDQLFATKIND 120 Query: 150 -AVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSI 203 + VV P Q FNSGV+ ++ W + + + L + Sbjct: 121 RPLYTVVDYFNPSQ----------------FNSGVLLINNLFWRNNNIGNQLLKL 159 >UniRef50_Q8LF94 Avr9/Cf-9 rapidly elicited protein 231 n=13 Tax=Magnoliophyta RepID=Q8LF94_ARATH Length = 351 Score = 41.6 bits (96), Expect = 0.042, Method: Compositional matrix adjust. Identities = 43/175 (24%), Positives = 71/175 (40%), Gaps = 12/175 (6%) Query: 110 YFRLFAFQLLGLTLDRLLYLDADVVCKGDISQL--LHLGLNGAVAAVVKDVEPMQEKAVS 167 Y R + LL + R++YLD+D++ DI++L LG + +AA S Sbjct: 152 YARSYLADLLPPCVRRVVYLDSDLILVDDIAKLAATDLGRDSVLAAPEYCNANFTSYFTS 211 Query: 168 RL-SDPELL-------GQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVM 219 S+P L YFN+GV+ +DL +W + T + + + + Y + Sbjct: 212 TFWSNPTLSLTFADRKACYFNTGVMVIDLSRWREGAYTSRIEEWMAMQKRMRIYELGSLP 271 Query: 220 NVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGATKPWHK 274 LL L P N + D + L L+H++G KPW + Sbjct: 272 PFLLVFAGLIKP--VNHRWNQHGLGGDNFRGLCRDLHPGPVSLLHWSGKGKPWAR 324 >UniRef50_Q9H1C3 Glycosyltransferase 8 domain-containing protein 2 n=29 Tax=Euteleostomi RepID=GL8D2_HUMAN Length = 349 Score = 41.6 bits (96), Expect = 0.046, Method: Compositional matrix adjust. Identities = 64/275 (23%), Positives = 109/275 (39%), Gaps = 45/275 (16%) Query: 41 GVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLYRINT------- 93 G ++ + + +N N+ FY++ + N +I K E ++LR ++I Sbjct: 63 GATMAAINSIYSNTDANILFYVVG-LRNT--LTRIRKWIEHSKLREINFKIVEFNPMVLK 119 Query: 94 DKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLN-GAVA 152 K++ + + F F LL ++++YLD DV+ +GDI +L L G A Sbjct: 120 GKIRPDSSRPELLQPLNFVRFYLPLLIHQHEKVIYLDDDVIVQGDIQELYDTTLALGHAA 179 Query: 153 AVVKDVEPMQEKAVSRLSDPELLGQY----------------------FNSGVVYLDLKK 190 A D + + ++RL L Y FN GV+ ++ + Sbjct: 180 AFSDDCDLPSAQDINRLVG--LQNTYMGYLDYRKKAIKDLGISPSTCSFNPGVIVANMTE 237 Query: 191 WADAKLTEKALSILMSKDNVYK--YPDQDVMNVLLKGMTLFLPREYNTIYTI--KSELKD 246 W ++T K L M K NV + Y V M + +Y+TI + L Sbjct: 238 WKHQRIT-KQLEKWMQK-NVEENLYSSSLGGGVATSPMLIVFHGKYSTINPLWHIRHLGW 295 Query: 247 KTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSV 281 Y + + L+H+ G KPW +PSV Sbjct: 296 NPDARYSEHFLQEAKLLHWNGRHKPWD----FPSV 326 >UniRef50_A3LQ29 Glycogenin glucosyltransferase n=3 Tax=Saccharomycetales RepID=A3LQ29_PICST Length = 411 Score = 41.2 bits (95), Expect = 0.048, Method: Compositional matrix adjust. Identities = 43/157 (27%), Positives = 63/157 (40%), Gaps = 29/157 (18%) Query: 124 DRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGV 183 D L+YLD+D + D+ L KD+ Q A P++ FNSGV Sbjct: 99 DTLIYLDSDTLPLADLDHLFE---------EYKDLTAEQIAASPDAGWPDI----FNSGV 145 Query: 184 VYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMT-----LFLPREYNTI- 237 + L DA + K L +N + DQ ++N + + LP YN Sbjct: 146 LVLK----PDADVFSKLLEFTTVDNNTFDGADQGLLNEFFNVASAGKNWVRLPYVYNVTP 201 Query: 238 -YTIKSELKDKTHQNYKKLITESTLLIHYTGATKPWH 273 Y+ + H+ + S L+HY G TKPWH Sbjct: 202 NYSGAYQYLPALHRFFS-----SIKLLHYIGQTKPWH 233 >UniRef50_A2DXT6 Glycosyl transferase family 8 protein n=1 Tax=Trichomonas vaginalis RepID=A2DXT6_TRIVA Length = 334 Score = 41.2 bits (95), Expect = 0.058, Method: Compositional matrix adjust. Identities = 46/188 (24%), Positives = 72/188 (38%), Gaps = 33/188 (17%) Query: 123 LDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSG 182 L+R L LD D + G + N A VV D+ + E YFN G Sbjct: 159 LERFLQLDGDTLVTGSFDEFYFAYFNDTYAVVVLDI----------WKEYEGFKNYFNCG 208 Query: 183 VVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKS 242 V + +K+ D K+ +K + L + + D VL N I+ K Sbjct: 209 SVVFNCQKFRDDKMADKVRTKLKEYEVTRGEWNND-QTVL------------NDIFGDKK 255 Query: 243 ELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKI------ALENS--PWK 294 + K + + +T T + H+ G K +K I + Y+++ NS PW+ Sbjct: 256 IIAHKKYNEFMPSLTMQTRIFHFYGLKKKPYKPNIKSNKYYFRLWRCYFHYFNNSINPWE 315 Query: 295 DDSPRDAK 302 D RD K Sbjct: 316 FD--RDTK 321 >UniRef50_Q5Z7P2 Os06g0727300 protein n=5 Tax=Magnoliophyta RepID=Q5Z7P2_ORYSJ Length = 601 Score = 40.8 bits (94), Expect = 0.065, Method: Compositional matrix adjust. Identities = 50/189 (26%), Positives = 74/189 (39%), Gaps = 35/189 (18%) Query: 123 LDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEP----------MQEKAVSRLSDP 172 LD++L+LD DVV + D++ L + L G V V+ + +S DP Sbjct: 418 LDKILFLDDDVVVQKDLTPLWDVDLKGIVNGAVETCKESFHRFNTYLNFSHPKISENFDP 477 Query: 173 ELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKY-PDQDVMNVLLKGMTLFLP 231 G F G+ DLK+W +T +Y Y D + L K T LP Sbjct: 478 HACGWAF--GMNMFDLKEWKKQNIT-----------GIYHYWQDLNEDRKLWKLDT--LP 522 Query: 232 REYNTIYTIKSELKDKTH-------QNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYY 284 T Y + L H + + E+ ++HY G KPW AI Y+ Sbjct: 523 PGLITFYNLTYPLNRTWHVLGLGYDPSVDLVEIENAAVVHYNGNYKPWLDLAISKYKPYW 582 Query: 285 K--IALENS 291 + L+NS Sbjct: 583 SKYVDLDNS 591 >UniRef50_C8Q828 Glycosyl transferase family 8 n=4 Tax=Enterobacteriaceae RepID=C8Q828_9ENTR Length = 278 Score = 40.8 bits (94), Expect = 0.069, Method: Compositional matrix adjust. Identities = 58/219 (26%), Positives = 93/219 (42%), Gaps = 43/219 (19%) Query: 103 QVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQL--LHLGLNGAVAAVVKDVEP 160 +VWS+ L ++L G +R+++LDAD++ ++ +L L LG A P Sbjct: 79 EVWSK-----LRVWELTGC--ERVVFLDADMLVLRNMDELFTLDLGDYALAACHACRCNP 131 Query: 161 MQEKAVSRLSDPEL---------------LGQYFNSGVVYLDLKKWADAKLTEKALSILM 205 Q + PE L Y N G + L + +L EK +I Sbjct: 132 NQIASYPASWQPEHCHYTWQERQQPAPANLDLYLNGGFLVLKPDEAVFRQLQEKVTAI-- 189 Query: 206 SKDNVYKYP--DQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLI 263 D++ +YP +QD++N + G L LP YN + T+ + H + K + Sbjct: 190 --DDLRRYPFSEQDLLNEVFAGRWLPLPYIYNALKTLPFQHPQMWHADEVK-------NL 240 Query: 264 HYTGATKPWHKWAIYPSV---KYYKIALENSPWKDDSPR 299 HY A KPW + P + +YY AL+ W+ S R Sbjct: 241 HYILA-KPWKRDLCQPEMERDRYY--ALDKLWWQMASSR 276 >UniRef50_A5BZU1 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5BZU1_VITVI Length = 648 Score = 40.8 bits (94), Expect = 0.072, Method: Compositional matrix adjust. Identities = 44/173 (25%), Positives = 70/173 (40%), Gaps = 33/173 (19%) Query: 123 LDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEP----------MQEKAVSRLSDP 172 L+++L+LD D+V + D++ L L + G V A V+ + +S DP Sbjct: 466 LEKILFLDDDIVVQKDLTPLWSLDMQGMVNAAVETCKESFHRFDKYLNFSHPKISENFDP 525 Query: 173 ELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKY-PDQDVMNVLLKGMTLFLP 231 G F G+ DLK+W +T +Y Y D + L K + LP Sbjct: 526 NACGWAF--GMNMFDLKEWRKRNMT-----------GIYHYWQDMNEDRTLWKLGS--LP 570 Query: 232 REYNTIYTIKSELKDKTHQ---NYKKLITESTL----LIHYTGATKPWHKWAI 277 T Y + L H Y + ++ + ++HY G KPW + AI Sbjct: 571 PGLITFYNLTYPLDRSWHVLGLGYDPQLNQTEIDNAAVVHYNGNYKPWLELAI 623 >UniRef50_Q9LE59 Like glycosyl transferase 1 n=35 Tax=Embryophyta RepID=Q9LE59_ARATH Length = 673 Score = 40.8 bits (94), Expect = 0.078, Method: Compositional matrix adjust. Identities = 41/173 (23%), Positives = 70/173 (40%), Gaps = 33/173 (19%) Query: 123 LDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEP----------MQEKAVSRLSDP 172 L+++L+LD D++ + D++ L + LNG V V+ ++R +P Sbjct: 491 LNKILFLDDDIIVQKDLTPLWEVNLNGKVNGAVETCGESFHRFDKYLNFSNPHIARNFNP 550 Query: 173 ELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVY-KYPDQDVMNVLLKGMTLFLP 231 G + G+ DLK+W +T +Y K+ + + L K T LP Sbjct: 551 NACGWAY--GMNMFDLKEWKKRDIT-----------GIYHKWQNMNENRTLWKLGT--LP 595 Query: 232 REYNTIYTIKSELKDKTH-------QNYKKLITESTLLIHYTGATKPWHKWAI 277 T Y + L H + K E+ ++HY G KPW + A+ Sbjct: 596 PGLITFYGLTHPLNKAWHVLGLGYNPSIDKKDIENAAVVHYNGNMKPWLELAM 648 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P27129 Lipopolysaccharide 1,2-glucosyltransferase n=44 ... 436 e-121 UniRef50_D0KD54 Lipopolysaccharide glucosyltransferase I n=2 Tax... 317 4e-85 UniRef50_UPI00019F16C6 hypothetical protein CATC2_20202 n=1 Tax=... 313 6e-84 UniRef50_Q9R9D1 UDP-glucose:(Glucosyl) LPS alpha1,3-glucosyltran... 313 7e-84 UniRef50_A8ARL6 Putative uncharacterized protein n=3 Tax=Citroba... 309 1e-82 UniRef50_B2PV91 Putative uncharacterized protein n=2 Tax=Enterob... 306 7e-82 UniRef50_Q9ZIS6 UDP-galactose:(Glucosyl) LPS alpha1,2-galactosyl... 306 8e-82 UniRef50_Q9ZIT4 UDP-galactose:(Glucosyl) LPS alpha1,3-galactosyl... 301 2e-80 UniRef50_D2U322 Lipopolysaccharide 1,2-glucosyltransferase n=1 T... 296 1e-78 UniRef50_D0KD53 Lipopolysaccharide 3-alpha-galactosyltransferase... 295 1e-78 UniRef50_Q9ZIT6 UDP-glucose:(Galactosyl) LPS alpha1,2-glucosyltr... 292 1e-77 UniRef50_B6XJW1 Putative uncharacterized protein n=1 Tax=Provide... 292 2e-77 UniRef50_D2TY85 UDP-D-galactose:(Glucosyl)lipopolysaccharide-alp... 289 1e-76 UniRef50_B6XJW2 Putative uncharacterized protein n=1 Tax=Provide... 289 1e-76 UniRef50_D2TIX6 UDP-galactose:(Glucosyl) LPS alpha-1,3-galactosy... 288 2e-76 UniRef50_Q9ZIS1 UDP-galactose:(Galactosyl) LPS alpha1,2-galactos... 287 3e-76 UniRef50_Q46Y64 Glycosyl transferase, family 8 n=1 Tax=Ralstonia... 279 7e-74 UniRef50_C1DGU7 Lipopolysaccharide 1,3-galactosyltransferase n=1... 279 1e-73 UniRef50_D1PTN4 Putative uncharacterized protein n=1 Tax=Prevote... 274 3e-72 UniRef50_UPI0001B4A0A4 putative glucosyltransferase n=1 Tax=Bact... 274 4e-72 UniRef50_P27128 Lipopolysaccharide 1,3-galactosyltransferase n=4... 272 1e-71 UniRef50_UPI000197C525 UDP-D-galactose:(glucosyl) lipopolysaccha... 267 3e-70 UniRef50_C7IBC1 Glycosyl transferase family 8 n=2 Tax=Clostridiu... 262 1e-68 UniRef50_A1BHG0 Glycosyl transferase, family 8 n=2 Tax=Chlorobiu... 257 4e-67 UniRef50_Q64ZV2 Lipopolysaccharide 1,2-glucosyltransferase n=8 T... 257 5e-67 UniRef50_C7QL87 Glycosyl transferase family 8 n=2 Tax=Cyanothece... 253 8e-66 UniRef50_Q5LF36 Putative glucosyltransferase n=1 Tax=Bacteroides... 250 4e-65 UniRef50_A7VNX5 Putative uncharacterized protein n=1 Tax=Clostri... 248 2e-64 UniRef50_C3X7M2 Lipopolysaccharide 3-alpha-galactosyltransferase... 248 3e-64 UniRef50_P25148 General stress protein A n=3 Tax=Bacillus subtil... 247 3e-64 UniRef50_B3Q568 Putative glycosyltransferase protein n=4 Tax=Rhi... 244 3e-63 UniRef50_D2RIJ4 Glycosyl transferase family 8 n=2 Tax=Acidaminoc... 244 4e-63 UniRef50_C2HBB9 Family 8 glycosyltransferase n=35 Tax=Lactobacil... 243 7e-63 UniRef50_D1P7H1 Lipopolysaccharide 1,2-glucosyltransferase n=1 T... 243 9e-63 UniRef50_C2HBB8 Family 8 glycosyltransferase n=13 Tax=Lactobacil... 242 1e-62 UniRef50_Q38VK7 Putative bifunctional glycosyl transferase, fami... 240 6e-62 UniRef50_B7AIV0 Putative uncharacterized protein n=1 Tax=Bactero... 239 1e-61 UniRef50_C5ELK9 Glycosyl transferase n=3 Tax=Clostridiales RepID... 238 2e-61 UniRef50_A8ARL4 Putative uncharacterized protein n=2 Tax=Citroba... 238 3e-61 UniRef50_A4ECW2 Putative uncharacterized protein n=1 Tax=Collins... 238 3e-61 UniRef50_Q116W1 Glycosyl transferase, family 8 n=1 Tax=Trichodes... 237 4e-61 UniRef50_C3PWZ8 Lipopolysaccharide 1,2-glucosyltransferase n=1 T... 236 7e-61 UniRef50_C4VEI8 General stress protein A n=24 Tax=Enterococcus R... 235 1e-60 UniRef50_D2ELM0 Lipopolysaccharide biosynthesis glycosyltransfer... 235 2e-60 UniRef50_A5ZFA9 Putative uncharacterized protein n=1 Tax=Bactero... 234 3e-60 UniRef50_B4RJG6 LgtC n=29 Tax=Neisseria RepID=B4RJG6_NEIG2 232 2e-59 UniRef50_Q9L6B2 Putative glycosyl transferase n=3 Tax=Pasteurell... 229 1e-58 UniRef50_C0WCJ1 Lipopolysaccharide 1,2-glucosyltransferase n=1 T... 229 2e-58 UniRef50_B3CA80 Putative uncharacterized protein n=1 Tax=Bactero... 228 2e-58 UniRef50_B8ISQ5 Glycosyl transferase family 8 n=1 Tax=Methylobac... 227 3e-58 UniRef50_B3Z5I6 Glycosyltransferase family 8 n=2 Tax=Bacillus ce... 227 3e-58 UniRef50_UPI000196958D hypothetical protein BACCELL_01586 n=1 Ta... 227 3e-58 UniRef50_B2UPJ4 Glycosyl transferase family 8 n=1 Tax=Akkermansi... 226 1e-57 UniRef50_UPI0001A457E5 hypothetical protein NsubN_08151 n=1 Tax=... 224 4e-57 UniRef50_C0QZN2 Glycosyl transferase, family 8 n=3 Tax=Brachyspi... 223 6e-57 UniRef50_Q03HK5 Lipopolysaccharide biosynthesis glycosyltransfer... 223 8e-57 UniRef50_B3XL28 Glycosyl transferase family 8 n=1 Tax=Lactobacil... 223 9e-57 UniRef50_C0Z4I4 Putative uncharacterized protein gspA n=1 Tax=Br... 222 1e-56 UniRef50_A6LGX5 Glycosyltransferase family 8 n=1 Tax=Parabactero... 221 2e-56 UniRef50_D2QX94 Glycosyl transferase family 8 n=1 Tax=Pirellula ... 221 3e-56 UniRef50_C7IBC8 Glycosyl transferase family 8 n=1 Tax=Clostridiu... 220 4e-56 UniRef50_D0D9G3 Putative general stress protein A n=1 Tax=Citrei... 220 4e-56 UniRef50_A1XRC1 Glycosyltransferase n=1 Tax=Haemophilus ducreyi ... 220 8e-56 UniRef50_B1Y723 Glycosyl transferase family 8 n=1 Tax=Leptothrix... 219 8e-56 UniRef50_P43974 Putative glycosyltransferase HI0258 n=32 Tax=Hae... 218 2e-55 UniRef50_B7KM20 Glycosyl transferase family 8 n=1 Tax=Cyanothece... 218 2e-55 UniRef50_B9KVD4 Glycosyl transferase, family 8 n=2 Tax=Rhodobact... 217 3e-55 UniRef50_C1IBL0 Glycosyl transferase n=1 Tax=Clostridium sp. 7_2... 217 3e-55 UniRef50_C6I3U6 Putative uncharacterized protein n=2 Tax=Bactero... 217 4e-55 UniRef50_B1MX28 Lipopolysaccharide biosynthesis protein, LPS:gly... 217 6e-55 UniRef50_C1QBZ8 LPS:glycosyltransferase n=1 Tax=Brachyspira murd... 216 1e-54 UniRef50_D2QX95 Glycosyl transferase family 8 n=1 Tax=Pirellula ... 216 1e-54 UniRef50_C2KV37 Lipopolysaccharide biosynthesis protein, LPS:gly... 215 1e-54 UniRef50_A7A7B4 Putative uncharacterized protein n=2 Tax=Bifidob... 215 2e-54 UniRef50_B7GNT4 Glycosyl transferase, family 8 n=5 Tax=Bifidobac... 214 3e-54 UniRef50_B8G232 Glycosyl transferase family 8 n=2 Tax=Desulfitob... 213 5e-54 UniRef50_B2ISC2 Glycosyl transferase, family 8 n=2 Tax=Streptoco... 213 9e-54 UniRef50_C5ZV11 Glycosyl transferase n=2 Tax=Helicobacter canade... 213 1e-53 UniRef50_B6G807 Putative uncharacterized protein n=2 Tax=Collins... 212 1e-53 UniRef50_B7C7N8 Putative uncharacterized protein n=2 Tax=Firmicu... 212 2e-53 UniRef50_B6GCA0 Putative uncharacterized protein n=1 Tax=Collins... 212 2e-53 UniRef50_Q5WI33 Lipopolysaccharide glycosyltransferase n=6 Tax=F... 211 2e-53 UniRef50_A4VVV8 Lipopolysaccharide biosynthesis proteins, LPS:gl... 211 3e-53 UniRef50_B2UQ54 Glycosyl transferase family 8 n=1 Tax=Akkermansi... 210 7e-53 UniRef50_A0KQP2 Glycosyl transferase, family 8 n=2 Tax=Aeromonas... 209 1e-52 UniRef50_C8W7U9 Glycosyl transferase family 8 n=2 Tax=Atopobium ... 208 2e-52 UniRef50_UPI0001A45357 glycosyl transferase family protein n=1 T... 208 3e-52 UniRef50_C9PNX4 Family 8 glycosyl transferase n=1 Tax=Pasteurell... 207 4e-52 UniRef50_B2ISC6 Glycosyl transferase, family 2/glycosyl transfer... 206 1e-51 UniRef50_B6VUC8 Putative uncharacterized protein n=1 Tax=Bactero... 206 1e-51 UniRef50_C9RWX3 Glycosyl transferase family 8 n=8 Tax=Bacillacea... 206 1e-51 UniRef50_C1QC00 LPS:glycosyltransferase n=1 Tax=Brachyspira murd... 205 2e-51 UniRef50_D1JY84 Putative uncharacterized protein n=1 Tax=Bactero... 205 3e-51 UniRef50_UPI0001AEC697 glycosyl transferase family protein n=1 T... 204 3e-51 UniRef50_C1QC03 LPS:glycosyltransferase n=1 Tax=Brachyspira murd... 204 4e-51 UniRef50_Q65VF6 RfaJ protein n=1 Tax=Mannheimia succiniciproduce... 203 8e-51 UniRef50_Q3DNS6 Glycosyl transferase, family 8 n=5 Tax=Streptoco... 203 8e-51 UniRef50_Q2K5X3 Galactosyltransferase protein n=13 Tax=Rhizobium... 203 9e-51 UniRef50_A3CM53 Glycosyltransferase, putative n=9 Tax=Bacteria R... 202 1e-50 UniRef50_D2RIJ7 Glycosyl transferase family 8 n=1 Tax=Acidaminoc... 202 2e-50 UniRef50_C6LDU2 Glycosyl transferase, family 8 n=3 Tax=Firmicute... 202 2e-50 UniRef50_C8N7M8 Putative uncharacterized protein n=1 Tax=Cardiob... 201 3e-50 UniRef50_Q3D426 Glycosyl transferase, family 8 n=7 Tax=Streptoco... 201 3e-50 UniRef50_Q3D427 Glycosyl transferase, family 8 n=8 Tax=Streptoco... 201 4e-50 UniRef50_B1I7N1 Glycosyl transferase, family 8 n=8 Tax=Streptoco... 200 5e-50 UniRef50_A5LNA9 Glycosyl transferase, family 8 n=2 Tax=Streptoco... 200 6e-50 UniRef50_Q9L7A2 Putative glycosyl transferase n=1 Tax=Haemophilu... 200 7e-50 UniRef50_C5S494 Putative glycosyl transferase n=2 Tax=Actinobaci... 200 7e-50 UniRef50_C7RG54 Glycosyl transferase family 8 n=1 Tax=Anaerococc... 200 8e-50 UniRef50_A3VFX3 Glycosyltransferase n=1 Tax=Rhodobacterales bact... 198 2e-49 UniRef50_C0BAU3 Putative uncharacterized protein n=1 Tax=Coproco... 198 2e-49 UniRef50_Q3DNA2 Glycosyl transferase, family 8 n=9 Tax=Streptoco... 198 3e-49 UniRef50_A8FNA2 Putative uncharacterized protein n=2 Tax=Campylo... 197 5e-49 UniRef50_Q4HGS8 General stress protein A, putative n=2 Tax=Campy... 197 5e-49 UniRef50_D1Y7U2 Glycosyltransferase, family 8 n=1 Tax=Pyramidoba... 196 9e-49 UniRef50_Q0I2Z7 Lipopolysaccharide biosynthesis protein, glycosy... 196 1e-48 UniRef50_C5S3F7 Putative glycosyl transferase n=2 Tax=Actinobaci... 196 1e-48 UniRef50_Q4JZJ9 Putative glycosyl transferase n=8 Tax=Streptococ... 195 1e-48 UniRef50_C8WAA9 Glycosyl transferase family 8 n=2 Tax=Atopobium ... 195 2e-48 UniRef50_D2LIH7 Glycosyl transferase family 8 n=1 Tax=Rhodomicro... 195 2e-48 UniRef50_Q3DM64 Glycosyl transferase, family 8, degenerate n=6 T... 194 4e-48 UniRef50_C1CFZ1 Glycosyl transferase, family 8 n=17 Tax=Streptoc... 193 5e-48 UniRef50_C2LRU0 Glycosyl transferase, family 8 n=1 Tax=Streptoco... 193 7e-48 UniRef50_B1I7M9 Glycosyl transferase, family 8 n=16 Tax=Streptoc... 193 8e-48 UniRef50_B9BAZ6 Glycosyl transferase family 8 protein n=1 Tax=Bu... 192 1e-47 UniRef50_A4UX76 Putative LPS biosynthesis protein n=2 Tax=Lactob... 192 2e-47 UniRef50_A5UC07 Aspartate-semialdehyde dehydrogenase n=7 Tax=Hae... 192 2e-47 UniRef50_B3WD33 WbbM protein n=8 Tax=Lactobacillus RepID=B3WD33_... 192 2e-47 UniRef50_A5EVI8 Glycosyl transferase family 8 protein n=1 Tax=Di... 190 8e-47 UniRef50_Q5M3K9 Glycosyl transferase n=4 Tax=Streptococcus RepID... 189 1e-46 UniRef50_UPI0001693121 general stress protein n=1 Tax=Paenibacil... 188 2e-46 UniRef50_B0NR59 Putative uncharacterized protein n=1 Tax=Bactero... 186 8e-46 UniRef50_B9QZ95 Glycosyl transferase family 8 n=1 Tax=Labrenzia ... 186 9e-46 UniRef50_C1QEC6 LPS:glycosyltransferase n=1 Tax=Brachyspira murd... 186 9e-46 UniRef50_B1LK07 Glycosyl transferase family 8 n=10 Tax=Enterobac... 186 1e-45 UniRef50_B2UQN6 Glycosyl transferase family 8 n=1 Tax=Akkermansi... 185 2e-45 UniRef50_A8AY72 Glycosyl transferase, family 8 SP1766 n=7 Tax=St... 184 3e-45 UniRef50_A5VK24 Glycosyl transferase, family 8 n=18 Tax=Lactobac... 183 5e-45 UniRef50_C6EQF4 Putative uncharacterized protein n=3 Tax=Campylo... 183 6e-45 UniRef50_C7HS13 Family 8 glycosyl transferase n=2 Tax=Anaerococc... 183 9e-45 UniRef50_C4Z1V1 Putative uncharacterized protein n=1 Tax=Eubacte... 183 1e-44 UniRef50_C0AA16 Lipopolysaccharide biosynthesis protein LPS:glyc... 182 2e-44 UniRef50_A3V3C9 Putative uncharacterized protein n=1 Tax=Loktane... 181 2e-44 UniRef50_A1VG39 Glycosyl transferase, family 8 n=1 Tax=Desulfovi... 180 9e-44 UniRef50_C0X9Z7 Putative uncharacterized protein n=2 Tax=Lactoba... 179 1e-43 UniRef50_Q50FU8 Cj81-079 (Fragment) n=6 Tax=Campylobacter jejuni... 179 1e-43 UniRef50_C4ZG45 Glycosyltransferase Family 2 modular protein n=1... 177 6e-43 UniRef50_Q046Z9 Lipopolysaccharide biosynthesis glycosyltransfer... 177 6e-43 UniRef50_Q38VG7 Putative glycosyl transferase, family 8 n=1 Tax=... 176 9e-43 UniRef50_C6IB51 Glycosyltransferase n=4 Tax=Bacteroides RepID=C6... 176 1e-42 UniRef50_A4WWT5 Glycosyl transferase, family 8 n=1 Tax=Rhodobact... 176 1e-42 UniRef50_C3XKY2 Glycosyl transferase n=2 Tax=Campylobacterales R... 175 2e-42 UniRef50_C6X2V2 Putative glycosyltransferase n=1 Tax=Flavobacter... 175 2e-42 UniRef50_C5EZG9 Glycosyl transferase family protein n=1 Tax=Heli... 175 2e-42 UniRef50_C7XX93 Glycosyl transferase, family 8 n=1 Tax=Lactobaci... 174 3e-42 UniRef50_D1N145 Glycosyl transferase family 8 n=1 Tax=Victivalli... 173 7e-42 UniRef50_B7AUG6 Putative uncharacterized protein n=1 Tax=Bactero... 172 1e-41 UniRef50_A7H2M2 Glycosyl transferase family 8 n=3 Tax=Campylobac... 171 3e-41 UniRef50_UPI00016B2258 glycosyl transferase, family 8 n=1 Tax=ca... 170 5e-41 UniRef50_A0ZYL4 Putative uncharacterized protein n=1 Tax=Archaea... 170 5e-41 UniRef50_B8PIH6 Predicted protein n=2 Tax=Agaricomycetes RepID=B... 170 7e-41 UniRef50_C5WAK3 Ybl156 protein n=2 Tax=Enterobacteriaceae RepID=... 170 1e-40 UniRef50_Q1RIL1 Lipopolysaccharide 1,2-glucosyltransferase RfaJ ... 166 9e-40 UniRef50_A8LP95 Lipopolysaccharide 1 n=1 Tax=Dinoroseobacter shi... 166 1e-39 UniRef50_D2MYR1 Putative uncharacterized protein n=1 Tax=Campylo... 165 2e-39 UniRef50_C3XGD2 Putative uncharacterized protein n=1 Tax=Helicob... 165 2e-39 UniRef50_C6DEN1 Glycosyl transferase family 8 n=1 Tax=Pectobacte... 162 1e-38 UniRef50_A3YS36 Putative sugar transferase n=2 Tax=Campylobacter... 162 1e-38 UniRef50_Q16CW9 Lipopolysaccharide 1,3-galactosyltransferase, pu... 161 2e-38 UniRef50_O48684 F3I6.10 protein n=46 Tax=Embryophyta RepID=O4868... 160 5e-38 UniRef50_C0X9Z8 Glycosyltransferase n=1 Tax=Lactobacillus gasser... 159 1e-37 UniRef50_C6IJ37 General stress protein A n=2 Tax=Bacteroides Rep... 158 4e-37 UniRef50_Q17VR5 Lipopolysaccharide biosynthesis protein n=19 Tax... 157 5e-37 UniRef50_C5SH34 Glycosyl transferase family 8 n=1 Tax=Asticcacau... 152 1e-35 UniRef50_C6DEN3 Glycosyl transferase family 8 n=1 Tax=Pectobacte... 152 2e-35 UniRef50_C6DEN2 Glycosyl transferase family 8 n=1 Tax=Pectobacte... 151 3e-35 UniRef50_B9HMR5 Glycosyltransferase, CAZy family GT8 n=25 Tax=Ma... 151 3e-35 UniRef50_D0IR33 LPS 1,2-glycosyltransferase n=3 Tax=Helicobacter... 150 8e-35 UniRef50_UPI0000587C70 PREDICTED: similar to MGC81998 protein n=... 148 3e-34 UniRef50_B6ACJ0 Glycosyl transferase family 8 protein, putative ... 143 1e-32 UniRef50_B9ADW8 Putative uncharacterized protein n=1 Tax=Methano... 141 4e-32 UniRef50_B3WD32 Glycosyl transferase n=9 Tax=Lactobacillus RepID... 140 7e-32 UniRef50_C0EQT1 Putative uncharacterized protein n=1 Tax=Neisser... 140 1e-31 UniRef50_Q92VQ2 Putative lipopolysaccharide 1,3-galactosyltransf... 139 1e-31 UniRef50_B9KUH7 Lipopolysaccharide 3-alpha-galactosyltransferase... 139 1e-31 UniRef50_A9SH80 Predicted protein n=2 Tax=Physcomitrella patens ... 139 2e-31 UniRef50_Q1CSY7 Lipopolysaccharide 1,2-glycosyltransferase n=3 T... 138 2e-31 UniRef50_C7TIE0 Glycosyl transferase, group 8 n=2 Tax=Lactobacil... 138 3e-31 UniRef50_A7H2X4 Glycosyl transferase family 8 n=2 Tax=Campylobac... 138 3e-31 UniRef50_A2RLV8 Putative glycosyltransferase n=1 Tax=Lactococcus... 138 4e-31 UniRef50_A4SAB5 Predicted protein (Fragment) n=1 Tax=Ostreococcu... 136 8e-31 UniRef50_UPI0001B55E75 hypothetical protein SSPB78_11600 n=1 Tax... 134 6e-30 UniRef50_C3XN62 Glycosyl transferase n=1 Tax=Helicobacter wingha... 133 1e-29 UniRef50_Q68CQ7 Glycosyltransferase 8 domain-containing protein ... 129 1e-28 UniRef50_Q2RB54 Glycosyl transferase family 8 protein, expressed... 129 2e-28 UniRef50_Q062P6 DNA mismatch repair protein n=1 Tax=Synechococcu... 124 6e-27 UniRef50_B4WN64 Glycosyl transferase family 8 n=1 Tax=Synechococ... 123 1e-26 UniRef50_A9UXT0 Predicted protein (Fragment) n=1 Tax=Monosiga br... 120 1e-25 UniRef50_A5DLS6 Putative uncharacterized protein n=2 Tax=Pichia ... 113 9e-24 UniRef50_A2DBB6 Putative uncharacterized protein n=1 Tax=Trichom... 104 4e-21 Sequences not found previously or not previously below threshold: UniRef50_C5ZVZ7 Putative glycosyltransferase n=3 Tax=Campylobact... 165 2e-39 UniRef50_A4IXE1 Glycosyl transferase, family 8 n=16 Tax=Francise... 156 9e-37 UniRef50_B3XPR8 Glycosyl transferase family 8 n=4 Tax=Lactobacil... 156 1e-36 UniRef50_Q1CUZ8 Lipopolysaccharide 1,2-glycosyltransferase n=12 ... 152 2e-35 UniRef50_D1IFB6 Whole genome shotgun sequence of line PN40024, s... 151 4e-35 UniRef50_C3XFW0 Glycosyl transferase n=1 Tax=Helicobacter bilis ... 146 1e-33 UniRef50_Q39T65 Glycosyl transferase, family 8 n=1 Tax=Geobacter... 145 2e-33 UniRef50_UPI00016C0890 glycosyl transferase family 8 protein n=2... 143 6e-33 UniRef50_UPI000197AD97 hypothetical protein BACCOPRO_03221 n=1 T... 143 1e-32 UniRef50_A9S2B3 Predicted protein (Fragment) n=1 Tax=Physcomitre... 137 6e-31 UniRef50_C3YRN2 Putative uncharacterized protein (Fragment) n=1 ... 133 1e-29 UniRef50_D1IU75 Whole genome shotgun sequence of line PN40024, s... 132 1e-29 UniRef50_Q8LF94 Avr9/Cf-9 rapidly elicited protein 231 n=13 Tax=... 124 5e-27 UniRef50_UPI0001621115 predicted protein n=1 Tax=Physcomitrella ... 122 2e-26 UniRef50_C7TID9 Glycosyl transferase, group 8 n=2 Tax=Lactobacil... 120 8e-26 UniRef50_C7PRU3 Glycosyl transferase family 8 n=1 Tax=Chitinopha... 119 1e-25 UniRef50_Q02ZT7 Lipopolysaccharide biosynthesis glycosyltransfer... 115 2e-24 UniRef50_Q9H1C3 Glycosyltransferase 8 domain-containing protein ... 114 3e-24 UniRef50_A9UZX9 Predicted protein (Fragment) n=1 Tax=Monosiga br... 114 4e-24 UniRef50_B3XPR6 Putative uncharacterized protein n=3 Tax=Lactoba... 114 6e-24 UniRef50_A5BZU1 Putative uncharacterized protein n=1 Tax=Vitis v... 113 1e-23 UniRef50_D1HWZ1 Whole genome shotgun sequence of line PN40024, s... 111 3e-23 UniRef50_Q2L3C5 Glycosyl transferase-like protein n=3 Tax=Magnol... 110 8e-23 UniRef50_Q04CN2 Lipopolysaccharide biosynthesis glycosyltransfer... 109 1e-22 UniRef50_C5FDY7 Glycogenin n=1 Tax=Microsporum canis CBS 113480 ... 109 1e-22 UniRef50_B6JNQ8 Lipopolysaccharide 1,2-glucosyltransferase n=18 ... 109 1e-22 UniRef50_Q6Z5D6 Glycosyltransferase family-like n=6 Tax=Poaceae ... 109 2e-22 UniRef50_Q9FX71 T6J4.1 protein n=2 Tax=rosids RepID=Q9FX71_ARATH 108 2e-22 UniRef50_B2KBT4 Glycosyl transferase family 8 n=1 Tax=Elusimicro... 108 3e-22 UniRef50_B4WFJ6 Glycosyl transferase family 8 n=1 Tax=Synechococ... 108 4e-22 UniRef50_UPI000180D0CC PREDICTED: similar to like-glycosyltransf... 107 6e-22 UniRef50_B5RUI6 DEHA2F17138p n=2 Tax=Debaryomyces hansenii RepID... 106 1e-21 UniRef50_Q04CN3 Lipopolysaccharide biosynthesis glycosyltransfer... 106 1e-21 UniRef50_A7S9E5 Predicted protein n=1 Tax=Nematostella vectensis... 105 2e-21 UniRef50_D1HMA0 Whole genome shotgun sequence of line PN40024, s... 104 4e-21 UniRef50_A2RAV0 Catalytic activity: UDP-glucose + glycogenin <=>... 104 5e-21 UniRef50_Q2R1U9 Glycosyl transferase family 8 protein, expressed... 104 5e-21 UniRef50_O95461 Glycosyltransferase-like protein LARGE1 n=84 Tax... 104 6e-21 UniRef50_B7PBG6 Glycosyltransferase domain-containing protein, p... 103 7e-21 UniRef50_Q9LE59 Like glycosyl transferase 1 n=35 Tax=Embryophyta... 103 7e-21 UniRef50_B9FGA7 Putative uncharacterized protein n=3 Tax=Poaceae... 103 8e-21 UniRef50_Q0U987 Putative uncharacterized protein n=1 Tax=Phaeosp... 103 9e-21 UniRef50_Q9FH36 Similarity to unknown protein n=28 Tax=Embryophy... 103 9e-21 UniRef50_P91854 Protein F26H9.8, partially confirmed by transcri... 103 1e-20 UniRef50_B4QUA9 GD18236 n=2 Tax=Sophophora RepID=B4QUA9_DROSI 102 2e-20 UniRef50_A9RI23 Predicted protein n=1 Tax=Physcomitrella patens ... 102 2e-20 UniRef50_UPI0001925360 PREDICTED: similar to glycosyltransferase... 102 2e-20 UniRef50_Q9M9Y5 F4H5.13 protein n=4 Tax=rosids RepID=Q9M9Y5_ARATH 102 2e-20 UniRef50_B5ZNF8 Glycosyl transferase family 8 n=7 Tax=Rhizobium ... 102 2e-20 UniRef50_Q31QV9 Lipopolysaccharide biosynthesis proteins LPS n=2... 102 2e-20 >UniRef50_P27129 Lipopolysaccharide 1,2-glucosyltransferase n=44 Tax=Enterobacteriaceae RepID=RFAJ_ECOLI Length = 338 Score = 436 bits (1121), Expect = e-121, Method: Composition-based stats. Identities = 338/338 (100%), Positives = 338/338 (100%) Query: 1 MDSFPAIEIDKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDF 60 MDSFPAIEIDKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDF Sbjct: 1 MDSFPAIEIDKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDF 60 Query: 61 YIIADVYNDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLG 120 YIIADVYNDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLG Sbjct: 61 YIIADVYNDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLG 120 Query: 121 LTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFN 180 LTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFN Sbjct: 121 LTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFN 180 Query: 181 SGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTI 240 SGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTI Sbjct: 181 SGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTI 240 Query: 241 KSELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRD 300 KSELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRD Sbjct: 241 KSELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRD 300 Query: 301 AKSIIEFKKRYKHLLVQHHYISGIIAGVCYLCRKYYRK 338 AKSIIEFKKRYKHLLVQHHYISGIIAGVCYLCRKYYRK Sbjct: 301 AKSIIEFKKRYKHLLVQHHYISGIIAGVCYLCRKYYRK 338 >UniRef50_D0KD54 Lipopolysaccharide glucosyltransferase I n=2 Tax=Pectobacterium RepID=D0KD54_PECWW Length = 336 Score = 317 bits (812), Expect = 4e-85, Method: Composition-based stats. Identities = 167/331 (50%), Positives = 224/331 (67%), Gaps = 1/331 (0%) Query: 8 EIDKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVY 67 ID + ++ R +I + LNVAYG+D NY G GVSITSI++NN I+ F++ +D + Sbjct: 6 HIDVLSVFEKRHQSIADHDTLNVAYGIDKNYAVGCGVSITSILINNS-IDFTFHVFSDDF 64 Query: 68 NDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLL 127 +D F +KI+ LAE+ + +I LY+IN++ L+ LPCT +WS AMYFRL AF L LL Sbjct: 65 DDDFIKKISILAEKFKTKIILYKINSEMLKTLPCTDIWSHAMYFRLLAFSHLSDKTSSLL 124 Query: 128 YLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLD 187 YLDADV+CKG + QL L VAAV++DV MQ+K+ SRL L G+YFNSGV++ + Sbjct: 125 YLDADVMCKGSLEQLHKLNTAPHVAAVIRDVPEMQKKSASRLKMAALEGEYFNSGVLFAN 184 Query: 188 LKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDK 247 L W LT+K L + +YPDQD+MN+LL G FLP+EYNTIY+IK+ELKD Sbjct: 185 LDIWNKLDLTQKIFDKLRDGEESIQYPDQDIMNILLNGNVTFLPKEYNTIYSIKNELKDS 244 Query: 248 THQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEF 307 HQ YK++I + T+LIHYTG TKPWHKWA YPS Y++ A ENSPW +DA + +E Sbjct: 245 NHQKYKEVIKDDTILIHYTGVTKPWHKWANYPSTSYFQHAQENSPWSTSDLKDADTFVEM 304 Query: 308 KKRYKHLLVQHHYISGIIAGVCYLCRKYYRK 338 KK+YKHLL + Y+SG+I+ Y KY +K Sbjct: 305 KKKYKHLLKKGKYLSGLISAFKYSLNKYIKK 335 >UniRef50_UPI00019F16C6 hypothetical protein CATC2_20202 n=1 Tax=Citrobacter youngae ATCC 29220 RepID=UPI00019F16C6 Length = 330 Score = 313 bits (802), Expect = 6e-84, Method: Composition-based stats. Identities = 133/326 (40%), Positives = 196/326 (60%), Gaps = 6/326 (1%) Query: 13 KAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFF 72 K +F A LN+A+GVD N++ G +S+TS++L+N+ +N+ F++ D + + Sbjct: 11 KILEFNQAPSEHKTQLNIAWGVDKNFMFGAAISMTSVLLHNKDLNIHFHLFTDYIDADYQ 70 Query: 73 QKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDAD 132 Q++AKLAEQ I++Y ++ + L+ LP WS AMYFR AF+ LG +D LLY+DAD Sbjct: 71 QRVAKLAEQFATNISIYIMDANGLKVLPSGNAWSHAMYFRFIAFEYLGEKVDSLLYIDAD 130 Query: 133 VVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWA 192 V+CKG + +L + L VAAV+ DV+ + D E YFNSGV++ +LKKW Sbjct: 131 VMCKGSLYELTQIDLGEHVAAVITDVDDSPAR------DIEKNKDYFNSGVIFANLKKWK 184 Query: 193 DAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNY 252 + A IL+ K+N +PDQDV+N+L +FL R +N IY IK ELK K Y Sbjct: 185 EQNFINSAFDILLDKNNKLSFPDQDVLNILFLKKVIFLERRFNAIYGIKQELKSKDTSKY 244 Query: 253 KKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYK 312 K+ IT T+LIHY G TKPW+ WA YPS +Y+ A ++SPW D A++ ++KK+ + Sbjct: 245 KEYITPETILIHYIGVTKPWNSWANYPSAQYFVEAWKSSPWADVPLLPARTPKQYKKKSR 304 Query: 313 HLLVQHHYISGIIAGVCYLCRKYYRK 338 H +Q Y + I+ + YL K K Sbjct: 305 HERLQGKYFASAISYIGYLWAKLKSK 330 >UniRef50_Q9R9D1 UDP-glucose:(Glucosyl) LPS alpha1,3-glucosyltransferase WaaO n=29 Tax=Enterobacteriaceae RepID=Q9R9D1_ECOLX Length = 338 Score = 313 bits (802), Expect = 7e-84, Method: Composition-based stats. Identities = 118/336 (35%), Positives = 187/336 (55%), Gaps = 8/336 (2%) Query: 5 PAIEIDKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIA 64 P I+K +D R A + +VAYG+D N+L G GVSITS++L+N ++ F++ Sbjct: 8 PQEMINKTIIFDERPAAS-VASSFHVAYGIDKNFLFGCGVSITSVLLHNSDVSFVFHVFI 66 Query: 65 DVYNDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLD 124 D + Q++A+LA+ + I ++ +N ++L+ LP T+ WS AMYFR D Sbjct: 67 DDIPEADIQRLAQLAKSYRTCIQIHLVNCERLKALPTTKNWSIAMYFRFVIADYFIDQQD 126 Query: 125 RLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVE-PMQEKAVSRLSDPELLGQYFNSGV 183 ++LYLDAD+ C+G++ L+ + L VAAVV + + L EL YFNSGV Sbjct: 127 KILYLDADIACQGNLKPLITMDLANNVAAVVTERDANWWSLRGQSLQCNELEKGYFNSGV 186 Query: 184 VYLDLKKWADAKLTEKALSILMSKD--NVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIK 241 + ++ WA ++ KA+S+L K + Y DQD++N++L G F+ +YNT +++ Sbjct: 187 LLINTLAWAQESVSAKAMSMLADKAIVSRLTYMDQDILNLILLGKVKFIDAKYNTQFSLN 246 Query: 242 SELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDA 301 ELK +++ I + T+LIHY G TKPWH WA YPS + + A E SPWK++ Sbjct: 247 YELK----KSFVCPINDETVLIHYVGPTKPWHYWAGYPSAQPFIKAKEASPWKNEPLMRP 302 Query: 302 KSIIEFKKRYKHLLVQHHYISGIIAGVCYLCRKYYR 337 + + KH Q+ I+GI+ + Y K + Sbjct: 303 VNSNYARYCAKHNFKQNKPINGIMNYIYYFYLKIIK 338 >UniRef50_A8ARL6 Putative uncharacterized protein n=3 Tax=Citrobacter RepID=A8ARL6_CITK8 Length = 339 Score = 309 bits (791), Expect = 1e-82, Method: Composition-based stats. Identities = 127/332 (38%), Positives = 193/332 (58%), Gaps = 3/332 (0%) Query: 4 FPAIEIDKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYII 63 F + I K + LN+AYGVD N+L G G+S+TS+++NN I++ FY++ Sbjct: 5 FENVIIQKKVIDNATHQKSKK---LNIAYGVDRNFLFGSGISMTSVLVNNPDIDIHFYVV 61 Query: 64 ADVYNDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTL 123 D +D + + + +L + +T+ + + + LP T+ W+ AMY+R FAF+ L L Sbjct: 62 TDYVDDEYLESVERLTQMYGTTVTVLVFDNEAFRKLPSTKAWTYAMYYRYFAFEYLSREL 121 Query: 124 DRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGV 183 D +LYLDAD+VCK + +L + G AAVV D++ ++ K+ RL PEL YFNSGV Sbjct: 122 DSVLYLDADIVCKNSLRELTDIHFAGEYAAVVNDIDRVRLKSGQRLGIPELARDYFNSGV 181 Query: 184 VYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSE 243 V+ +L W + KL KA +L + Y DQD++N+L G + L R++N IY + E Sbjct: 182 VFANLHVWREKKLLSKAFEVLHERQKELLYFDQDILNILFVGHVILLRRDFNCIYGVDQE 241 Query: 244 LKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKS 303 LK+K Y+ ITEST+LIHY G TKPWH WA YP KY+ A + S W + S +A + Sbjct: 242 LKNKNEYRYQDFITESTVLIHYVGVTKPWHTWANYPVSKYFIEAYKKSAWAEKSLLNANT 301 Query: 304 IIEFKKRYKHLLVQHHYISGIIAGVCYLCRKY 335 +K++ +H +Q YI I + + Y+ K Sbjct: 302 AKLYKRKSRHERIQRKYIRSIFSHIMYIKNKL 333 >UniRef50_B2PV91 Putative uncharacterized protein n=2 Tax=Enterobacteriaceae RepID=B2PV91_PROST Length = 342 Score = 306 bits (784), Expect = 7e-82, Method: Composition-based stats. Identities = 133/331 (40%), Positives = 198/331 (59%), Gaps = 3/331 (0%) Query: 10 DKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYND 69 +K + A+ CL+V YG D NY G GVS S+++NN F+ D + Sbjct: 7 NKYVLGEVCKADNTLLSCLDVIYGSDENYQFGAGVSAVSLLINNPTTFFRFHYFLDKVSP 66 Query: 70 GFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYL 129 F +K+ +A Q Q+ +Y ++ L+ LP + VWS AMYFRL A L D LYL Sbjct: 67 DFLEKLKVIASQFQVEFHVYELDNKLLKTLPASDVWSSAMYFRLVALDYLSSDYDFALYL 126 Query: 130 DADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLK 189 DADV+C G + +L + V VV D ++ K+ +RL P L YFNSGV++++LK Sbjct: 127 DADVMCNGILDLTTNL-IKDKVCGVVADDIGVRTKSETRLHAPSLAKTYFNSGVMFVNLK 185 Query: 190 KWADAKLTEKALSILMSKD--NVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDK 247 KW + ++T++ +L +++ YKYPDQDV+N++L+ L + +NT+YT+K+EL D Sbjct: 186 KWHEKQITQQCFELLSAENAKQRYKYPDQDVLNLILREDLELLSQRFNTVYTLKNELYDS 245 Query: 248 THQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEF 307 THQ Y+++IT T+LIHYTG +KPWH WA YP+ + + AL SPW + + A +E Sbjct: 246 THQKYQQVITPETVLIHYTGVSKPWHTWANYPASQPFYKALMQSPWTTNDLKPATKFVER 305 Query: 308 KKRYKHLLVQHHYISGIIAGVCYLCRKYYRK 338 KK YKHLL Q +Y++GI++G+ Y K K Sbjct: 306 KKEYKHLLKQGNYLAGILSGIRYSFEKLMGK 336 >UniRef50_Q9ZIS6 UDP-galactose:(Glucosyl) LPS alpha1,2-galactosyltransferase WaaT n=26 Tax=Enterobacteriaceae RepID=Q9ZIS6_ECOLX Length = 331 Score = 306 bits (784), Expect = 8e-82, Method: Composition-based stats. Identities = 136/331 (41%), Positives = 203/331 (61%), Gaps = 5/331 (1%) Query: 9 IDKVKAWDFRLANINTSE---CLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIAD 65 +++ F N E LNV+YG+D N+L G GVSI+S+++NN IN F++ D Sbjct: 1 MNEFIKERFSYLADNKKENAPELNVSYGIDKNFLYGAGVSISSVLINNSDINFVFHVFTD 60 Query: 66 VYNDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDR 125 +D + + + A+Q I +Y I+ LP +Q WS A YFR+ +F+ L ++ Sbjct: 61 YVDDDYLKSFNETAKQFNTSIIVYLIDPKYFADLPTSQFWSYATYFRVLSFEYLSESIST 120 Query: 126 LLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVY 185 LLYLDADVVCKG + L + AAV+ D + Q RL+ PE+ G+YFN+GV+Y Sbjct: 121 LLYLDADVVCKGSLKPLTEIIFKDEFAAVIPDNDSTQAACAKRLNIPEMNGRYFNAGVIY 180 Query: 186 LDLKKWADAKLTEKALSIL--MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSE 243 ++LKKW +A LT L +L +K KY DQD +N+ ++L ++++TIYT+K+E Sbjct: 181 VNLKKWHEANLTPYLLKLLRGETKYGSLKYLDQDALNIAFNMNNIYLAKDFDTIYTLKNE 240 Query: 244 LKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKS 303 L D++H+ Y++ IT+ T+LIHYTG TKPWH WA YPS Y+ IA E SPWK ++A++ Sbjct: 241 LYDRSHRKYQQTITDKTVLIHYTGITKPWHSWAGYPSASYFNIAREQSPWKKYPLKEART 300 Query: 304 IIEFKKRYKHLLVQHHYISGIIAGVCYLCRK 334 + E +K+YKHL YI GI + + Y +K Sbjct: 301 VAEMQKQYKHLFAHGEYIKGITSLIKYKLKK 331 >UniRef50_Q9ZIT4 UDP-galactose:(Glucosyl) LPS alpha1,3-galactosyltransferase WaaI n=26 Tax=Enterobacteriaceae RepID=Q9ZIT4_ECOLX Length = 335 Score = 301 bits (772), Expect = 2e-80, Method: Composition-based stats. Identities = 121/327 (37%), Positives = 183/327 (55%), Gaps = 10/327 (3%) Query: 15 WDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQK 74 ++F NI + L++A+G+D N+L G GV+I SI+LNNR I+ +F++ D +D Sbjct: 14 YNFHYQNIRSKNTLDIAFGIDRNFLFGCGVAIASILLNNREISCEFHVFTDYISDKDKLY 73 Query: 75 IAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVV 134 + LA+Q RI +Y IN DKL+ LP T+ W+ A YFR +++LYLDAD+ Sbjct: 74 FSDLAKQYNSRINIYVINCDKLKSLPSTKNWTYATYFRFIIADYFYHKHEKILYLDADIA 133 Query: 135 CKGDISQLLHLGLNGAVAAVV---KDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKW 191 CKG I +LL + A V +DVE Q +A L+ P+L YFN+G + +++ +W Sbjct: 134 CKGSIKELLDYQFSTNEIAAVVAERDVEWWQNRASV-LTTPQLASGYFNAGFLLINIDEW 192 Query: 192 ADAKLTEKALSILMSKD--NVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTH 249 ++ KA+ +L D + + DQDV+NVLL G F+ +YNT Y+I ELKDK Sbjct: 193 NLNNISSKAIEMLRDPDWVSKITHLDQDVLNVLLNGKVKFISEKYNTRYSINYELKDK-- 250 Query: 250 QNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKK 309 + + T+ IHY G TKPWH+WA YP + + IA SPW + + +++ Sbjct: 251 --VDNPVNDDTVFIHYVGPTKPWHEWANYPVSRSFLIAKAASPWSKEDLLKPVNSNQYRY 308 Query: 310 RYKHLLVQHHYISGIIAGVCYLCRKYY 336 KH Q HY++GI + Y K + Sbjct: 309 CAKHKFKQKHYMAGIFNYLKYYKEKCF 335 >UniRef50_D2U322 Lipopolysaccharide 1,2-glucosyltransferase n=1 Tax=Arsenophonus nasoniae RepID=D2U322_9ENTR Length = 334 Score = 296 bits (757), Expect = 1e-78, Method: Composition-based stats. Identities = 127/325 (39%), Positives = 202/325 (62%), Gaps = 5/325 (1%) Query: 13 KAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFF 72 A + +L N + N+AYGVD N+L G +SI S+++NN + +F++ D +DG+ Sbjct: 11 IAGEKKLTENNKN--FNIAYGVDKNFLLGAAISINSVLINNTDTDFNFHLFTDYIDDGYI 68 Query: 73 QKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDAD 132 Q+ + + I +Y ++ +L+ L + WS A YFRL AF+ L + +LYLDAD Sbjct: 69 QRFQTMIAKYNSNIIIYLLDAAELKQLSTSDFWSYATYFRLIAFEYLSTNIHAILYLDAD 128 Query: 133 VVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWA 192 V+CKG + ++ L L + AAVV DV+ MQ+ + +RL+ +L G+YFN+GV+Y++L+KW Sbjct: 129 VICKGSLKEIFQLNLADSFAAVVLDVDSMQQSSATRLNLADLNGKYFNAGVIYVNLQKWI 188 Query: 193 DAKLTEKALSILMSKDN--VYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQ 250 + ++K+L ++ K N KY DQD +N+L + ++L R+YN IY +K+EL Sbjct: 189 ENDFSKKSLELVRGKTNFGKLKYLDQDALNILFQTQNIYLSRDYNCIYKLKNELAYHDLS 248 Query: 251 NYKKLITESTLLIHYTGATKPWHKWAI-YPSVKYYKIALENSPWKDDSPRDAKSIIEFKK 309 YK IT+ST+LIHYTG TKPWH W I YP+ +++ + +SPWKD + A+ E ++ Sbjct: 249 KYKNTITDSTILIHYTGVTKPWHTWGINYPASQFFFNSYIHSPWKDQPLKMAEKRTELQE 308 Query: 310 RYKHLLVQHHYISGIIAGVCYLCRK 334 +YKHL +QH Y+ G + + Y K Sbjct: 309 KYKHLFLQHKYMQGFLCLIKYKLLK 333 >UniRef50_D0KD53 Lipopolysaccharide 3-alpha-galactosyltransferase n=3 Tax=Enterobacteriaceae RepID=D0KD53_PECWW Length = 336 Score = 295 bits (756), Expect = 1e-78, Method: Composition-based stats. Identities = 116/333 (34%), Positives = 184/333 (55%), Gaps = 6/333 (1%) Query: 10 DKVKAWDFRLANINTSEC--LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVY 67 DK K + + +C L++A+G D ++ G ++I SI+L N L F++ D Sbjct: 4 DKEKVIKTVHSFSYSKKCAELDIAFGTDEKFIYGCAIAIASILLKNPDYCLSFHVFTDKL 63 Query: 68 NDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLL 127 +DG + ++AEQ I +Y ++ L+ LP T++WS A+YFR LD++L Sbjct: 64 SDGDKARFQEMAEQYNTTINIYIVDCSWLKTLPETKLWSYAIYFRFIIADYFYKILDKVL 123 Query: 128 YLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKA-VSRLSDPELLGQYFNSGVVYL 186 YLDAD++C G + +L+ L L+ ++AVV D + K + PEL YFNSGV+ + Sbjct: 124 YLDADIICNGSLQELIKLDLSNHISAVVLDGDSNWWKNRAQKFQQPELSNGYFNSGVLLI 183 Query: 187 DLKKWADAKLTEKALSILMSKD--NVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSEL 244 ++ W A +TE ++ +L + + +PDQDV+NVLL G + + +YNT ++I EL Sbjct: 184 EVNNWHQAAVTENSMRLLTDPEMKKIITHPDQDVLNVLLAGKSCHIESKYNTQFSINYEL 243 Query: 245 KDKTHQNYKKLITESTLLIHYTGATKPWHKW-AIYPSVKYYKIALENSPWKDDSPRDAKS 303 K ++ I+ T+ IHY G TKPWHKW A Y KY+ A E+SPWK++S DA + Sbjct: 244 KYSYGESAPTPISNKTIFIHYIGPTKPWHKWAANYACTKYFLKAKEHSPWKNESLLDAVT 303 Query: 304 IIEFKKRYKHLLVQHHYISGIIAGVCYLCRKYY 336 + KH I G ++ + YL +K + Sbjct: 304 ASNMRYCAKHQFHNGEIIRGTLSFLKYLYKKAF 336 >UniRef50_Q9ZIT6 UDP-glucose:(Galactosyl) LPS alpha1,2-glucosyltransferase WaaJ n=26 Tax=Enterobacteriaceae RepID=Q9ZIT6_ECOLX Length = 339 Score = 292 bits (748), Expect = 1e-77, Method: Composition-based stats. Identities = 129/324 (39%), Positives = 194/324 (59%), Gaps = 2/324 (0%) Query: 13 KAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFF 72 D R ++ E NV++G+D NY G +SI SI+ NN+ F+IIAD + + Sbjct: 15 IELDKRPVKLDERETFNVSWGIDENYQVGAAISIASILENNKQNKFTFHIIADYLDKEYI 74 Query: 73 QKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDAD 132 + +++LA + Q I LY I+++ L+ LP + +W ++Y+RL +F LD LLYLDAD Sbjct: 75 ELLSQLATKYQTVIKLYLIDSEPLKALPQSNIWPVSIYYRLLSFDYFSARLDSLLYLDAD 134 Query: 133 VVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWA 192 +VCKG +++L+ L AVV DV+ MQ K+ RL + + G YFNSGV+Y++L++W Sbjct: 135 IVCKGSLNELIALEFKDEYGAVVIDVDAMQSKSAERLCNEDFNGSYFNSGVMYINLREWL 194 Query: 193 DAKLTEKALSILMSKD--NVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQ 250 +LTEK +L + KYPDQD++N++ LPR+YN IYTIKSE ++K + Sbjct: 195 KQRLTEKFFDLLSDESIIKKLKYPDQDILNLMFLHHAKILPRKYNCIYTIKSEFEEKNSE 254 Query: 251 NYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKR 310 Y + I + T+ IHYTG TKPWH WA Y S Y++ SPW++ + A E K++ Sbjct: 255 YYTRFINDDTVFIHYTGITKPWHDWANYASADYFRNIYNISPWRNIPYKKAVKKHEHKEK 314 Query: 311 YKHLLVQHHYISGIIAGVCYLCRK 334 YKHLL Q ++ G+ + Y K Sbjct: 315 YKHLLYQKKFLDGVFTAIKYNVMK 338 >UniRef50_B6XJW1 Putative uncharacterized protein n=1 Tax=Providencia alcalifaciens DSM 30120 RepID=B6XJW1_9ENTR Length = 325 Score = 292 bits (747), Expect = 2e-77, Method: Composition-based stats. Identities = 144/320 (45%), Positives = 202/320 (63%), Gaps = 6/320 (1%) Query: 18 RLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAK 77 L N + LN+AYGVD +L G G+S+ SI++NN I L F++ D ND F K+ K Sbjct: 9 ELGAQNGAAELNIAYGVDKGFLFGSGLSMNSIIINNSDIKLKFHLFTDYMNDEFLSKLEK 68 Query: 78 LAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKG 137 L + I +Y IN D+L+ LP + VWS A YFR F F L TL +LYLDADV CKG Sbjct: 69 LTLNENVNIDIYIINADELKKLPISHVWSYATYFRFFIFDHLCETLSSILYLDADVFCKG 128 Query: 138 DISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLT 197 + + + + NG AAV+ DV MQ V RLS P++ +YFN+GV++L+LK W K T Sbjct: 129 SLRKYIDIAFNGEYAAVIPDVPNMQISCVDRLSMPQIKDKYFNAGVIFLNLKVWDKNKFT 188 Query: 198 EKALSILMSKD--NVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKL 255 ++A +++ + KY DQD +N++ ++LPR+YN IYT+K+EL+ H+NYK Sbjct: 189 KQAFNLITNNHTGKTLKYLDQDALNIIFNCQNIYLPRDYNCIYTLKNELE---HENYKDY 245 Query: 256 ITESTLLIHYTGATKPWHKWAI-YPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHL 314 IT T LIHYTGATKPWH WA+ YP+ + +K+A E SPWK+D DAK E+++RYKH Sbjct: 246 ITSETKLIHYTGATKPWHYWAVNYPASQTFKVAFETSPWKNDELVDAKKKPEYQERYKHE 305 Query: 315 LVQHHYISGIIAGVCYLCRK 334 Q +++GI + + Y K Sbjct: 306 FNQKKFLTGISSLIKYKKFK 325 >UniRef50_D2TY85 UDP-D-galactose:(Glucosyl)lipopolysaccharide-alpha-1, 3-D-galactosyltransferase n=1 Tax=Arsenophonus nasoniae RepID=D2TY85_9ENTR Length = 343 Score = 289 bits (739), Expect = 1e-76, Method: Composition-based stats. Identities = 109/330 (33%), Positives = 178/330 (53%), Gaps = 8/330 (2%) Query: 12 VKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGF 71 + +++F A+ T + ++AYG D N+ G +SI S++ N+ FYI D ++ Sbjct: 19 LTSYEFSSADAKTPQ-FHIAYGADKNFSLGTAISICSMLYFNKIYTFHFYIFTDTISECD 77 Query: 72 FQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDA 131 +K +L +IT+ I+T +L+ LP ++WS A+YFR +++LYLD+ Sbjct: 78 LKKFDELTSCYNTKITILLIDTLQLKKLPTNKLWSHAIYFRFIIANYFHNKTNKILYLDS 137 Query: 132 DVVCKGDISQLLHLGLNGAVAAVVKDVE-PMQEKAVSRLSDPELLGQYFNSGVVYLDLKK 190 D++C GDIS+L + LN + A V D + + +K L+ PE+ YFNSGV+ +D K Sbjct: 138 DIICSGDISELFDIDLNQHIIAAVADRDQYLWKKRAEMLATPEIANGYFNSGVMLIDTDK 197 Query: 191 WADAKLTEKALSILM--SKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKT 248 W K+TEK ++IL+ + + DQD +N+ L LFL +++NT ++I ELK+KT Sbjct: 198 WHKNKITEKTINILLDDKTKAKFVFYDQDALNISLVNQVLFLDKKFNTQFSINYELKNKT 257 Query: 249 HQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFK 308 I + IHY G TKPW+ W+ YPS + +NSPWK A + +++ Sbjct: 258 ----LFPIINNVKFIHYIGPTKPWNIWSEYPSTHLFMTIKKNSPWKTTPLIAASTSNQYR 313 Query: 309 KRYKHLLVQHHYISGIIAGVCYLCRKYYRK 338 KH+ + YI ++ + Y K K Sbjct: 314 YAAKHMFNKKKYIYWLLNYLYYFVNKALHK 343 >UniRef50_B6XJW2 Putative uncharacterized protein n=1 Tax=Providencia alcalifaciens DSM 30120 RepID=B6XJW2_9ENTR Length = 333 Score = 289 bits (739), Expect = 1e-76, Method: Composition-based stats. Identities = 116/319 (36%), Positives = 180/319 (56%), Gaps = 8/319 (2%) Query: 21 NINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADV-YNDGFFQKIAKLA 79 I+ S C +VAYG+D N+L G GVSI S++++N HI F+I D +D K A++ Sbjct: 19 EIDDSSCQHVAYGIDHNFLYGSGVSIVSLLMHNPHIQFAFHIFIDNSMSDEDIAKFAEIC 78 Query: 80 EQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDI 139 +IT+Y I+++ ++ LP T+ W+ A+YFR + +D LLYLDADVVC +I Sbjct: 79 HLYNTKITIYFIDSNNVKKLPTTKNWTHAIYFRFIIAEYFKDKIDYLLYLDADVVCNRNI 138 Query: 140 SQLLHLGLNGAVAAVVKDVE-PMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTE 198 +LL L G +AAVV + + +K L P + YFNSGV+Y++L+ W +TE Sbjct: 139 DELLSHNLLGYIAAVVPERDKAWWQKRADSLGFPSVSKGYFNSGVMYINLRTWKTNNVTE 198 Query: 199 KALSILMSKD--NVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLI 256 K++++LM + + YPDQDV+N+LL LF+ +NT +++ ELK +++ + Sbjct: 199 KSMALLMDNEVSHRLVYPDQDVLNILLTDSVLFISSIFNTQFSLNYELK----KSFDFPV 254 Query: 257 TESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLV 316 +T+ IHY G TKPWH+WA Y + + + A SPW++ AKS + KH + Sbjct: 255 KRTTVFIHYVGPTKPWHEWANYETAQPFLEARAVSPWRNVPLLKAKSSNHLRYCAKHNIN 314 Query: 317 QHHYISGIIAGVCYLCRKY 335 Q Y + Y K Sbjct: 315 QRKYFFAFKNYIAYFFSKI 333 >UniRef50_D2TIX6 UDP-galactose:(Glucosyl) LPS alpha-1,3-galactosyltransferase n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TIX6_CITRO Length = 340 Score = 288 bits (737), Expect = 2e-76, Method: Composition-based stats. Identities = 112/328 (34%), Positives = 179/328 (54%), Gaps = 9/328 (2%) Query: 16 DFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKI 75 DF + L++AYGVD N+L G G+SI S++ NN L F++ D +N+ + Sbjct: 17 DFNHQDTAEKVVLDIAYGVDQNFLFGCGISIASVLKNNTDKTLHFHVFIDAFNETDRRMF 76 Query: 76 AKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVC 135 KLA Q + IT+Y IN + L+ LP T+ W+ A+YFR ++LLYLDAD++C Sbjct: 77 DKLAAQYKTHITIYLINCEHLRSLPSTKNWTYAIYFRFAIADYFIGKTNKLLYLDADIIC 136 Query: 136 KGDISQLLHLGL-NGAVAAVVKDVEP-MQEKAVSRLSDPELLGQYFNSGVVYLDLKKWAD 193 +G I +L++ + +AAVV + + EK L + YFNSG++ ++L +WA Sbjct: 137 QGGIDELVNFSFASDKIAAVVTEGKADWWEKRALSLGTEGITKGYFNSGLILINLNQWAI 196 Query: 194 AKLTEKALSILMSKD--NVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQN 251 ++ +A+ +L D +PDQDV+N+LL FL ++NT +++ +LKDK Sbjct: 197 ECISARAIKMLSDPDIVGRITHPDQDVLNILLADKLHFLDIKFNTQFSLNYQLKDK---- 252 Query: 252 YKKLITESTLLIHYTGATKPWHKWAIYPSV-KYYKIALENSPWKDDSPRDAKSIIEFKKR 310 + + T+LIHY G TKPWH WA + K + A + SPWK+ + + +F+ Sbjct: 253 FINPVNNDTILIHYIGPTKPWHSWAGDYLISKPFIDAKQASPWKNTALLKPTNSNQFRYC 312 Query: 311 YKHLLVQHHYISGIIAGVCYLCRKYYRK 338 KH+L YI G++ Y +K + Sbjct: 313 AKHMLKNKRYIKGMVGYFLYFMKKITNR 340 >UniRef50_Q9ZIS1 UDP-galactose:(Galactosyl) LPS alpha1,2-galactosyltransferase WaaW n=29 Tax=Enterobacteriaceae RepID=Q9ZIS1_ECOLX Length = 342 Score = 287 bits (735), Expect = 3e-76, Method: Composition-based stats. Identities = 123/321 (38%), Positives = 191/321 (59%), Gaps = 4/321 (1%) Query: 22 INTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQ 81 NT LN+AYG+D N+L G VS+ S+V++N + + F++ D ++ + Q++ + Sbjct: 18 ANTDRVLNIAYGIDRNFLFGAAVSMQSVVMHNPDLAVKFHLFTDYIDEDYLQRVNAFTSK 77 Query: 82 N-QLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDIS 140 N + + +Y++++ + P + WS A +FRL AFQ L T++ LLY+DADV+CKG ++ Sbjct: 78 NANVEVRIYKVSSAFIDIFPSLKQWSYATFFRLVAFQYLSETIENLLYIDADVICKGSLA 137 Query: 141 QLLHLGLNG-AVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEK 199 LL + +G AAV+KDV MQEK RL+ L G YFN+GVVYL L+ WA K Sbjct: 138 GLLDINFDGDKFAAVIKDVPFMQEKPAKRLAIEGLPGNYFNAGVVYLQLEAWAKNDFMNK 197 Query: 200 ALSILMS--KDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLIT 257 A+++L S + YK DQD++N+L G +F+ +Y+ Y I ELK+K+ ++YKK IT Sbjct: 198 AIAMLASDPQHTKYKCLDQDILNILFFGHCIFISGDYDCFYGIDYELKNKSDEDYKKTIT 257 Query: 258 ESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQ 317 + T LIHY G TKPW+ W YP KY+ A + S W D + A + +++ + +HL Sbjct: 258 DDTKLIHYVGVTKPWNDWTNYPCQKYFNEAYQASCWNDVAFIPATNEKQYQVKSRHLKRN 317 Query: 318 HHYISGIIAGVCYLCRKYYRK 338 + S + Y +K RK Sbjct: 318 GNIASSFYYFMLYYSKKIARK 338 >UniRef50_Q46Y64 Glycosyl transferase, family 8 n=1 Tax=Ralstonia eutropha JMP134 RepID=Q46Y64_RALEJ Length = 331 Score = 279 bits (715), Expect = 7e-74, Method: Composition-based stats. Identities = 73/320 (22%), Positives = 151/320 (47%), Gaps = 9/320 (2%) Query: 22 INTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQ 81 N ++A+ VD NY +G +I SI+ NN + F+++ + +++ +L E Sbjct: 17 SNGKPSFHIAFCVDDNYFRAMGATIASIIDNNPGQHFTFHVLTFSALEENQRRLKQLEEM 76 Query: 82 NQLRITLYRINT---DKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGD 138 + L+ ++ + +S +++ RL ++L DR+LYLDAD++C Sbjct: 77 YPVSTQLHLLDLASFTQFSHFLGHSHYSLSIFTRLVIPEVLQGQTDRVLYLDADILCVNR 136 Query: 139 ISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTE 198 + +L+ + ++ +A VV D + V+ L +YFN GV+++++ KW +T Sbjct: 137 LDELVDMDISNEIAVVVPDAPVTLRRRVAALGLAH--AEYFNGGVLFINIDKWLAENITP 194 Query: 199 KALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITE 258 + L L+ ++ DQD +N +L G ++ +N +Y + + D + Sbjct: 195 QTLEALLDTSTDMRFNDQDALNKVLNGRAKYISPRWNYLYDL---IHDLNVNRFAMRPVG 251 Query: 259 STLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSP-RDAKSIIEFKKRYKHLLVQ 317 + IH+ G+ KPW W+ + + ++ L SPW+D + ++ E + + + Q Sbjct: 252 KAVFIHFAGSVKPWADWSGHEARGLFRKYLALSPWRDMPLDPEPRNTKEMRMHSRFMFRQ 311 Query: 318 HHYISGIIAGVCYLCRKYYR 337 H + + + YL ++ R Sbjct: 312 HKPVESLKWYLRYLRKRAQR 331 >UniRef50_C1DGU7 Lipopolysaccharide 1,3-galactosyltransferase n=1 Tax=Azotobacter vinelandii DJ RepID=C1DGU7_AZOVD Length = 326 Score = 279 bits (714), Expect = 1e-73, Method: Composition-based stats. Identities = 97/322 (30%), Positives = 162/322 (50%), Gaps = 16/322 (4%) Query: 24 TSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQ 83 S+ L++A+GVD NYL +G++I SI+ NN + L F++ + ++ +L Sbjct: 8 NSDVLHIAFGVDENYLRPMGITIVSIIENNPGLELVFHVFISSISSASRVRLDRLERMFA 67 Query: 84 LRITLYRINTDKLQCLPCTQ----VWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDI 139 + L+ ++ P + S+A Y RL + L DR+LYLDAD++C GDI Sbjct: 68 RPVNLHLVDEMLDVKDPASGKGQAHISKAAYIRLLIPEALRDFTDRVLYLDADILCVGDI 127 Query: 140 SQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEK 199 S LLHL ++G AAV++D ++A + + L YFNSGV+Y+D+ +W + +T + Sbjct: 128 SGLLHLDIDGRTAAVIRDAGAESKRAGL-VKKGQTLDNYFNSGVLYIDIPRWIERAVTSR 186 Query: 200 ALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITES 259 AL + +Y DQD +N++L G F+ + +N Y + +LK + Sbjct: 187 ALEKIADPVLDLRYSDQDALNLVLDGDVRFIDKGWNHQYGLTGKLK---KGRVGMDVPSD 243 Query: 260 TLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWK------DDSPRDAKSIIEFKKRYKH 313 T +H+ G KPW W + S + + SPW + SPR+ F Y+ Sbjct: 244 TKFVHFIGPMKPWRSWNPHQSKELFLRYQALSPWAGEALDDNFSPREIYVYSRFM--YRS 301 Query: 314 LLVQHHYISGIIAGVCYLCRKY 335 + Q ++SG+I +L RK+ Sbjct: 302 MFQQGRWLSGLIWYGKFLHRKH 323 >UniRef50_D1PTN4 Putative uncharacterized protein n=1 Tax=Prevotella bergensis DSM 17361 RepID=D1PTN4_9BACT Length = 305 Score = 274 bits (701), Expect = 3e-72, Method: Composition-based stats. Identities = 67/279 (24%), Positives = 131/279 (46%), Gaps = 6/279 (2%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +++ + +D NYL ++ SI+ NN+ + F++I++ + KI ++AE ++ Sbjct: 1 MDIVFNIDDNYLMQCCTTMVSILHNNKDGQISFHVISNGLTNESRLKIEQVAEAYHQQVF 60 Query: 88 LYRINTD---KLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 Y +N + + S A Y RLF +L L +++Y+D D++ G + L + Sbjct: 61 FYVVNPEAMSDYEIFDKQGHISMATYLRLFVADILPERLHKIIYMDCDLIVNGSLDGLWN 120 Query: 145 LGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 + G A V+D+ + RL + YFN+GV+ ++L W + ++++A + Sbjct: 121 TDVEGYALAAVEDMWSGKADNYVRLG-YDAADTYFNAGVLVVNLDYWREHNVSQQAAQYV 179 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLI--TESTLL 262 K+ DQDV+N L L LP +N + + + + KL E+ ++ Sbjct: 180 ALHAGQLKFNDQDVLNGLFHDSKLLLPFRWNVQDGLLRKRRKIRPEVMPKLDQELENPVI 239 Query: 263 IHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDA 301 IH+TG KPW+ + P + ++ + W+ P Sbjct: 240 IHFTGHRKPWNFSCLNPYKNLFFKYVDMTEWRGFRPIVP 278 >UniRef50_UPI0001B4A0A4 putative glucosyltransferase n=1 Tax=Bacteroides sp. 2_1_7 RepID=UPI0001B4A0A4 Length = 301 Score = 274 bits (700), Expect = 4e-72, Method: Composition-based stats. Identities = 79/297 (26%), Positives = 144/297 (48%), Gaps = 4/297 (1%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +++ D NY+ GV +TSI +NN + +I+ + + + + K+ + +I Sbjct: 1 MDIVCCTDNNYVIPCGVLVTSICVNNPKEEITVHILTEGISPENQEVLKKVVAKYGQQIQ 60 Query: 88 LYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGL 147 Y ++ P ++ + A YFRL +L +++++LYLD DVV + + L + Sbjct: 61 FYTVDKKVFANCPISRHITLATYFRLIMTDILPKSVEKVLYLDCDVVVRHSLRSLWDTDI 120 Query: 148 NGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSK 207 A V+ D+ + +RL LG YFN+GV+ ++L+ W + L+E I+ Sbjct: 121 KSYAAGVIPDMSIDDIRIYNRLQYSPSLG-YFNAGVLLVNLRYWRENNLSESFFEIINKY 179 Query: 208 DNVYKYPDQDVMNVLLKGMTLFLPREYNTI--YTIKSELKDKTHQNYKKLITESTLLIHY 265 +Y DQDV+N++LK + L LP +YN Y K L +T+++ ++ +++HY Sbjct: 180 PERLRYHDQDVLNIVLKEIKLTLPMKYNVQHGYFFKDPLISRTYRDEREQAITDPVILHY 239 Query: 266 TGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQHHYIS 322 +G+ KPW P K + L+ S R K R++ LL + I+ Sbjct: 240 SGS-KPWFIEFEPPFKKDFAFYLDTSGLDKSFIRHIPMKARIKARFRSLLEKLGLIA 295 >UniRef50_P27128 Lipopolysaccharide 1,3-galactosyltransferase n=43 Tax=Enterobacteriaceae RepID=RFAI_ECOLI Length = 339 Score = 272 bits (696), Expect = 1e-71, Method: Composition-based stats. Identities = 107/330 (32%), Positives = 172/330 (52%), Gaps = 9/330 (2%) Query: 13 KAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFF 72 D+ + CL++AYG D N+L G G+SI SI+ N L F+I D + D Sbjct: 14 SVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDDR 73 Query: 73 QKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDAD 132 + LA Q + RI +Y IN D+L+ LP T+ W+ A+YFR ++LYLDAD Sbjct: 74 KYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDAD 133 Query: 133 VVCKGDISQLLHLGL-NGAVAAVVKDVEP-MQEKAVSRLSDPELLGQYFNSGVVYLDLKK 190 ++C+G I L++ + VA VV + + EK L + YFNSG + ++ + Sbjct: 134 IICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTAQ 193 Query: 191 WADAKLTEKALSILMSKD--NVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKT 248 WA +++ +A+++L + +PDQDV+N+LL +F +YNT +++ +LK+ Sbjct: 194 WAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKE-- 251 Query: 249 HQNYKKLITESTLLIHYTGATKPWHKWAI-YPSVKYYKIALENSPWKDDSPRDAKSIIEF 307 ++ +T T+ IHY G TKPWH WA YP + + A SPWK+ + + + Sbjct: 252 --SFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQL 309 Query: 308 KKRYKHLLVQHHYISGIIAGVCYLCRKYYR 337 + KH+L +H Y+ G + Y K Sbjct: 310 RYSAKHMLKKHRYLKGFSNYLFYFIEKIKH 339 >UniRef50_UPI000197C525 UDP-D-galactose:(glucosyl) lipopolysaccharide-alpha-1,3-D-galactosyltransferase n=1 Tax=Providencia rettgeri DSM 1131 RepID=UPI000197C525 Length = 339 Score = 267 bits (684), Expect = 3e-70, Method: Composition-based stats. Identities = 111/316 (35%), Positives = 168/316 (53%), Gaps = 9/316 (2%) Query: 24 TSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQ 83 + + NVAYG D N+L G GVSI S++LNN+ IN F++ D +D Q +++++Q + Sbjct: 24 SKKKFNVAYGADKNFLFGTGVSIVSVLLNNKDINFHFHVFTDFLSDKDIQLFSQISKQYK 83 Query: 84 LRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLL 143 +TL+ +N D L+ LP QVWS A+YFRL D++LYLD+DVVC G I L Sbjct: 84 TSVTLHTLNMDILKKLPTNQVWSHAIYFRLIIADYFYKKCDKVLYLDSDVVCTGSIQILK 143 Query: 144 HLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQ-YFNSGVVYLDLKKWADAKLTEKALS 202 L L+ A V D+ ++ L + E + + YFNSGV+ ++ +W +LTEK++S Sbjct: 144 SLNLSSMPIAAVMDISEPHSVEMANLFNVEGIKKGYFNSGVMLINPDEWNYRQLTEKSMS 203 Query: 203 ILMSKD--NVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITEST 260 + K V KY DQD +N+ + G L L +N + K K + + + Sbjct: 204 VFTDKKLQPVIKYYDQDAINIAVHGDWLKLDNIFNHRINLNDRYKHKKNNDI-----SNA 258 Query: 261 LLIHYTGATKPWHKWAIYPS-VKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQHH 319 + +H+ G+TKPWH W+ Y V+ + A E SPWKD ++I K KH + Sbjct: 259 VFVHFIGSTKPWHNWSKYYHEVRCFLNAKEKSPWKDIDLMTPQNITHHKYASKHFRYKEK 318 Query: 320 YISGIIAGVCYLCRKY 335 Y+S V Y K Sbjct: 319 YLSSFYHYVLYTILKI 334 >UniRef50_C7IBC1 Glycosyl transferase family 8 n=2 Tax=Clostridium papyrosolvens DSM 2782 RepID=C7IBC1_9CLOT Length = 452 Score = 262 bits (670), Expect = 1e-68, Method: Composition-based stats. Identities = 80/329 (24%), Positives = 139/329 (42%), Gaps = 18/329 (5%) Query: 26 ECLNVAYGVDANYLDGVGVSITSIVLNNR-HINLDFYIIADVYNDGFFQKIAKLAEQNQL 84 E + + D++Y+ +GV ITS++ N +L+FY+I D + + Sbjct: 2 ETVKIVSACDSHYVQHLGVMITSLLENTSMKTSLEFYVIDGGITDADKELLCSCTCLYGC 61 Query: 85 RITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 +I I D + S A YFR+F +LL ++++++YLD D+V DI++L Sbjct: 62 KINFITIQADFYARFGESPSASDATYFRIFVSELLDTSVEKVIYLDCDIVVIKDIAELWK 121 Query: 145 LGLNGAVAAVVKDVE----PMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKA 200 ++ A V D + R + YFN+GV+ ++L KW + +++ Sbjct: 122 TDVSEYFLAAVADCGVEYSGEYAVTLKRKLGMKRKDCYFNAGVLLINLVKWREESISKSI 181 Query: 201 LSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTI--KSELKDKTHQNYKKLITE 258 L + DQD +N +L L L +N E + +N + + E Sbjct: 182 CKFLFENKGKIDFADQDGLNAVLCNRWLPLDSRWNQQVAHCEFYEQEKVVWENVTRAVRE 241 Query: 259 STLLIHYTGA----TKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKS-----IIEFKK 309 +IHYT + TKPW+ ++P + Y L +PWK P D I ++ Sbjct: 242 -PWIIHYTTSYFSGTKPWNYLDMHPYRQEYYRYLHMTPWKSFIPPDRTIWNILLKIIYEA 300 Query: 310 RYKHLLVQHHYISGIIAGVCYLCRKYYRK 338 LL+ + Y I Y + +K Sbjct: 301 YAGRLLINY-YRRSIKPTYRYETARLPKK 328 >UniRef50_A1BHG0 Glycosyl transferase, family 8 n=2 Tax=Chlorobium/Pelodictyon group RepID=A1BHG0_CHLPD Length = 307 Score = 257 bits (657), Expect = 4e-67, Method: Composition-based stats. Identities = 63/293 (21%), Positives = 136/293 (46%), Gaps = 11/293 (3%) Query: 21 NINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAE 80 ++ +N+ + D NY+ + ++ S++ NN+ ++ YII+ ++ ++ I ++ + Sbjct: 1 MLHMKNTVNIVFATDKNYIQHLSAALVSLLENNKDLSFTVYIISSGMSEKSYRNIEEIIK 60 Query: 81 QNQLRITLYRINTDKLQCLPCTQ-VWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDI 139 + ++ + L + + Y+RL L+ +++LYLD+D++ G I Sbjct: 61 TGNCTVKHITVSDELFVKLATAHPFYPKGTYYRLLIPDLIDE--EKILYLDSDIIVNGSI 118 Query: 140 SQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEK 199 +L + + ++D + R + YFNSG++ ++L KW L +K Sbjct: 119 KELYNQDVEDYFVCAIEDPGFDRH----RQLQMDKESIYFNSGMMLINLAKWKSTGLQKK 174 Query: 200 ALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYK----KL 255 + + + +PDQ +N ++ G +P +YN +I S+ +K + Sbjct: 175 VIDFIEHNPDAIWFPDQCGLNSVINGRWKKVPLKYNQQSSIFSDDFEKKFDCFSVEELAE 234 Query: 256 ITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFK 308 ++ ++IHYTG +KPWH +P K Y L+ +P+++ D + K Sbjct: 235 AKKNPVIIHYTGGSKPWHFKNRHPYKKLYWKYLKMTPYRNAIYSDLMPMYLLK 287 >UniRef50_Q64ZV2 Lipopolysaccharide 1,2-glucosyltransferase n=8 Tax=Bacteroides RepID=Q64ZV2_BACFR Length = 311 Score = 257 bits (656), Expect = 5e-67, Method: Composition-based stats. Identities = 81/313 (25%), Positives = 147/313 (46%), Gaps = 12/313 (3%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +++A +D+N+ V++TS+ NNR+ +IIA + + ++ +AE +I Sbjct: 2 IHIACNIDSNFTIHCAVTLTSLFANNRNSEFCVHIIASTLPEADQKALSSIAESYGNKIC 61 Query: 88 LYRINTDKLQCLPCTQ---VWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 Y D L + S A Y+R ++L + +D++LY+D D+V DIS+ Sbjct: 62 FYFPEKDLLNNFSIKKSGNRISIATYYRCLLSRILPVNIDKILYIDCDIVVLNDISEFWD 121 Query: 145 LGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 + ++D+ +E+ SRL + YFN+GV+ ++LK W + K+ E Sbjct: 122 TDITQYAIGCIEDIGSDEEEYYSRLQ-YDKKYSYFNAGVLLINLKYWREHKIDEMCEQYF 180 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYK--KLITESTLL 262 ++ + ++ DQD++N LL LF+P +N T + + K + Sbjct: 181 LAHSDRIRFNDQDLLNALLYKDKLFVPFRWNVQDTFYRRTYSHKVKEHSGLKEALLHPAI 240 Query: 263 IHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQHHYIS 322 +HYT KPW+ +++P + Y L+ +PWK P II+F+ R + YI+ Sbjct: 241 LHYTNK-KPWNYDSMHPLKQEYFKYLDMTPWKGTRP-----IIDFQTRVITGFKRLLYIT 294 Query: 323 GIIAGVCYLCRKY 335 GI + Y Sbjct: 295 GIKKSKYINLKDY 307 >UniRef50_C7QL87 Glycosyl transferase family 8 n=2 Tax=Cyanothece RepID=C7QL87_CYAP0 Length = 283 Score = 253 bits (646), Expect = 8e-66, Method: Composition-based stats. Identities = 74/293 (25%), Positives = 136/293 (46%), Gaps = 12/293 (4%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +++ + D NY GV+ITS++LNN + +++ + F +KI KL + Q + Sbjct: 1 MDILFCFDKNYEQHFGVAITSLILNNTNKIKTIHLVTKDNSKDFLKKIDKLKSKTQAKFF 60 Query: 88 LYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGL 147 +Y + L + + S A Y+RL A +LL L ++LYLD+D+V + L ++ + Sbjct: 61 IYSPDDKDLSNVKVSAHISTAAYYRLLAPELLPQDLKKILYLDSDLVVNSSLENLYNMDI 120 Query: 148 NGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSK 207 + + A + ++ YFNSGV+ ++L+ W + K L Sbjct: 121 SDDILAA---YAGGKMGPGTKKRLQLTGDFYFNSGVMLINLEAWRTENIGNKCFKFLQEN 177 Query: 208 DNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTG 267 ++ + DQD +N ++ G L + +N++ + + + +T +++IH+TG Sbjct: 178 PDMIRLWDQDALNKIVDGKFLNIDGIWNSLVDLTTG---------ETRVTNQSIIIHFTG 228 Query: 268 ATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQHHY 320 KPW W I P + Y L SPW + P+ K+ E K + Q Sbjct: 229 TLKPWQSWCIRPEKQIYWYYLRQSPWSNAYPQFPKNFQEMLLAIKSVYKQIKP 281 >UniRef50_Q5LF36 Putative glucosyltransferase n=1 Tax=Bacteroides fragilis NCTC 9343 RepID=Q5LF36_BACFN Length = 308 Score = 250 bits (640), Expect = 4e-65, Method: Composition-based stats. Identities = 65/282 (23%), Positives = 132/282 (46%), Gaps = 5/282 (1%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +++ + +D +Y+ GV+ITS+ +NN + + F+I+ + + + K+ ++ + +I Sbjct: 1 MDIVHCIDNSYVAQCGVTITSVCVNNVNEVILFHILTTNLSIFNREMLKKIVDKYRQKII 60 Query: 88 LYRINTDKLQCLPCT--QVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHL 145 Y ++ L P S A YFR+ +L +L+++LYLD D+V +I +L Sbjct: 61 FYNVDEYLLNKCPLREGDHVSLATYFRILMPDILPKSLNKVLYLDCDLVVCKNIKRLWDT 120 Query: 146 GLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILM 205 ++ V D + +RL ++ YFN+GV+ ++L W + ++ K L + Sbjct: 121 DISTHSLGAVYDGGTDDIRTYNRLK-YDIRQGYFNAGVLLVNLAYWREFHISNKLLKFIE 179 Query: 206 SKDNVYKYPDQDVMNVLLKGMTLFLPREYNTI--YTIKSELKDKTHQNYKKLITESTLLI 263 + DQD +N +L T LP +YN + + K + + + ++ Sbjct: 180 QYPERLMFWDQDALNSVLIQTTKILPFKYNMLDAFYTKELALREEYLFEIEGALCDPTIL 239 Query: 264 HYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSII 305 H++ KPW K +P ++ L+ + W D P ++ Sbjct: 240 HFSSPNKPWLKTCDHPLKSFFFEYLKRTSWNDKFPIYPFNMS 281 >UniRef50_A7VNX5 Putative uncharacterized protein n=1 Tax=Clostridium leptum DSM 753 RepID=A7VNX5_9CLOT Length = 344 Score = 248 bits (634), Expect = 2e-64, Method: Composition-based stats. Identities = 73/322 (22%), Positives = 146/322 (45%), Gaps = 23/322 (7%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHIN-LDFYIIADVYNDGFFQKIAKLAEQNQLRI 86 +N + D N+ D +G ++ S+ NNR ++ YI+ ++G +K+ + +Q + + Sbjct: 13 MNCVFSSDDNFADILGCALISLFENNREQETIEVYILDGGISEGNKRKLESIFQQYERMV 72 Query: 87 TLYRI-NTDKLQCLPCTQ-VWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 + + +L T W + + R+ LL + R+LYLD D++ G + L Sbjct: 73 HFIEVPDISQLTGEAVTSGRWPISTFARILIDSLLPKEVKRVLYLDCDILVLGSLKNLWE 132 Query: 145 LGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 + L AA V D Q K + ++ + Y N+GV+ +D+ KW + ++ ++ ++ + Sbjct: 133 IDLKDKTAAGVMDCLSNQRKQNAGINGED---SYINAGVMLIDMDKWRENQIEKQCMNYI 189 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTI-----YTIKSELKDKTHQNYKKL---- 255 + Y DQ V+N +L L LP EYN + +T +K + Q+Y Sbjct: 190 RICNGQVAYNDQGVINKVLHKDLLVLPPEYNAMTLFFDFTYPDMIKYRKPQSYYSAQQVD 249 Query: 256 -ITESTLLIHYTGAT---KPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSI---IEFK 308 + ++H+T + +PW K + +P ++ + SPW+ R ++ Sbjct: 250 HARKHPRIVHFTSSFLSLRPWVKGSEHPYAPLWRNYYKRSPWRAKDLRSDNRSSYRKIYE 309 Query: 309 KRYKHLLVQHHY-ISGIIAGVC 329 K Y+ + + +SG + V Sbjct: 310 KFYRLMPLPFSVSLSGFLHSVL 331 >UniRef50_C3X7M2 Lipopolysaccharide 3-alpha-galactosyltransferase n=1 Tax=Oxalobacter formigenes OXCC13 RepID=C3X7M2_OXAFO Length = 307 Score = 248 bits (633), Expect = 3e-64, Method: Composition-based stats. Identities = 87/316 (27%), Positives = 155/316 (49%), Gaps = 14/316 (4%) Query: 24 TSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQ 83 ++A+GVD Y + V+I SI+ NN++ N+ F++I + +D +I K Q Sbjct: 1 MKNEFHIAFGVDTIYAPKMCVTIASILENNKNSNIIFHVIYNDLSDKVIDEIKKSMLTLQ 60 Query: 84 LRITLYRINTDK--LQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQ 141 I + I+ D + + R F +LL DR LYLDAD++C +IS Sbjct: 61 AEINFHFIDVDLSIFPKFSNFSHITSGAFLRFFIPELLQGLTDRALYLDADIICINNISD 120 Query: 142 LLHLGLNGA-VAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKA 200 L HL ++ + AVV+D++ + +YFNSGV+ +D++KW + + Sbjct: 121 LFHLEMDENEILAVVEDIDSETYLN----ENASFQKRYFNSGVLMMDIEKWNKNNVYGQL 176 Query: 201 LSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITEST 260 LS+L K + + DQD +N+++ +L +N Y I +E DK + Y + E+ Sbjct: 177 LSVLNEKGSGFNLIDQDALNLVMIDKVHYLDNIWN--YMINAEQLDKKKEKYS--VPENA 232 Query: 261 LLIHYTGATKPWHKWAIYPSV-KYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQHH 319 IH+ G KPWH + I+ + Y + + W D K+ E ++ ++ + + Sbjct: 233 KFIHFVGPVKPWHCYNIFDDITGLYLNYQKKTVW--DGLEMPKNYKEMRRYARYSFKKGN 290 Query: 320 YISGIIAGVCYLCRKY 335 Y++G+ G+ Y+ K+ Sbjct: 291 YLTGLNWGMRYIKTKF 306 >UniRef50_P25148 General stress protein A n=3 Tax=Bacillus subtilis group RepID=GSPA_BACSU Length = 286 Score = 247 bits (632), Expect = 3e-64, Method: Composition-based stats. Identities = 58/278 (20%), Positives = 120/278 (43%), Gaps = 10/278 (3%) Query: 24 TSECLNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAEQN 82 E +++ D NY +G S++ N ++ + Y+I +++ + + Sbjct: 3 KDEIMHIVSCADDNYARHLGGMFVSLLTNMDQEREVKLYVIDGGIKPDNKKRLEETTLKF 62 Query: 83 QLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLL-GLTLDRLLYLDADVVCKGDISQ 141 + I ++T+ + + ++A Y+R+ L+ ++ R++Y+D D + DIS+ Sbjct: 63 GVPIEFLEVDTNMYEHAVESSHITKAAYYRISIPDLIKDESIKRMIYIDCDALVLEDISK 122 Query: 142 LLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKAL 201 L L + A V+D + ++D G+YFNSG++ +D + W +TEK + Sbjct: 123 LWDLDIAPYTVAAVEDAGQHERLKEMNVTD---TGKYFNSGIMIIDFESWRKQNITEKVI 179 Query: 202 SILMSKDNV--YKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKT---HQNYKKLI 256 + + + DQD +N +L L +N I +LK + + Sbjct: 180 NFINEHPDEDFLVLHDQDALNAILYDQWYELHPRWNAQTYIMLKLKTPSTLLGRKQYNET 239 Query: 257 TESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWK 294 E+ ++H+ G KPW+ +P Y + + W Sbjct: 240 RENPAIVHFCGGEKPWNSNTKHPYRDEYFHYMSYTKWN 277 >UniRef50_B3Q568 Putative glycosyltransferase protein n=4 Tax=Rhizobium etli RepID=B3Q568_RHIE6 Length = 331 Score = 244 bits (624), Expect = 3e-63, Method: Composition-based stats. Identities = 69/308 (22%), Positives = 127/308 (41%), Gaps = 16/308 (5%) Query: 29 NVAYGVDANYLDGVGVSITSIVLNNRHI-NLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 + + VDA Y + ++ S+ NN+ + LD ++I + + + I + N I Sbjct: 21 PIVFAVDAAYAVPLATALRSVAENNQSVWPLDIHVIHEGIGEETKRLILESLPANSAIIQ 80 Query: 88 LYRINTDKLQCLPCTQVW-SRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLG 146 + I T T+ S+ + R+ Q L T DR LYLD D++ + QL + Sbjct: 81 WHPIATLSFASGFSTRPGVSKMTFARILLPQFLPQTCDRALYLDGDILVLTSLEQLWNTD 140 Query: 147 LNGAVAAVVKDVEPMQEK-AVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILM 205 L AV V D + L+ +YFN+G++ +DL KW + +++E++L L Sbjct: 141 LGEAVIGAVPDYWLDNPAGSGPGARGGALVKRYFNAGILLIDLAKWRNERISERSLDYL- 199 Query: 206 SKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHY 265 + +Y DQD +NV G L R +N + + + + + ++H+ Sbjct: 200 DRFPTTEYSDQDALNVACDGKWKILDRAWNFQFEPRQAIAG-------IALEQKAAIVHF 252 Query: 266 TGATKPWHKWAIYPSVKYY-----KIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQHHY 320 KPW ++ P+V +Y + +PW ++ R L Y Sbjct: 253 VTNVKPWKSGSLSPNVAFYDAFRSRTCFALTPWGRVRSGLKRTGSRLLARSALLRTAWSY 312 Query: 321 ISGIIAGV 328 + + Sbjct: 313 TKSAVRAI 320 >UniRef50_D2RIJ4 Glycosyl transferase family 8 n=2 Tax=Acidaminococcus RepID=D2RIJ4_ACIFE Length = 309 Score = 244 bits (623), Expect = 4e-63, Method: Composition-based stats. Identities = 64/287 (22%), Positives = 116/287 (40%), Gaps = 12/287 (4%) Query: 26 ECLNVAYGVDANYLDGVGVSITSIVLNNR-HINLDFYIIADVYNDGFFQKIAKLAEQNQL 84 + +++ D NY V+ SI+ N+R + FY D ++ IA Q Sbjct: 2 DEISIVLASDDNYAQHGAVACASILANHRGERPIHFYYFDDGISEEKQAGIAATVTGLQG 61 Query: 85 RITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 IT ++Q + +RA Y RL +L+ + R++YLD D+V DI +L Sbjct: 62 SITFIPTAGKEIQA-HTSGHVNRAAYLRLLIPELVPQAVHRVIYLDTDLVVLDDIQELWE 120 Query: 145 LGLNGAVAAVVKDVEPMQEKAVSRLSDP----ELLGQYFNSGVVYLDLKKWADAKLTEKA 200 + L G V D+ + + R + + YFNSGV+ ++L+ W + + ++ Sbjct: 121 MDLQGKPVGAVPDLGILASSRMRRQKEETLGIQEGKLYFNSGVMVMELEAWREKQYGDQV 180 Query: 201 LSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSE----LKDKTHQNYKKLI 256 + + +++ DQD +N + + LP +N I + + LK +N Sbjct: 181 IRCVEE--GNFRHHDQDGLNKVFQDNWQPLPLRWNVIPPVFTLPVKVLKKSRWRNLALEA 238 Query: 257 TESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKS 303 E + H+ G KPW + Y L + + Sbjct: 239 LERPAVFHWAGRYKPWEFPPKGHFNEKYYTYLARTAFAGAKMPQPGK 285 >UniRef50_C2HBB9 Family 8 glycosyltransferase n=35 Tax=Lactobacillales RepID=C2HBB9_ENTFC Length = 305 Score = 243 bits (620), Expect = 7e-63, Method: Composition-based stats. Identities = 72/272 (26%), Positives = 121/272 (44%), Gaps = 10/272 (3%) Query: 27 CLNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAEQN--Q 83 + V D NY + V I + + N N+ + FY+I D ++ Q + + + Sbjct: 27 VVPVVTASDENYAPYLSVMIATALENCNKARRIKFYVIDDGLSEYSKQGLEETVNKYSSN 86 Query: 84 LRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLL-GLTLDRLLYLDADVVCKGDISQL 142 I + D + + + Y R+ LL ++LYLD+DV+ DI +L Sbjct: 87 ASIQFLTVEKDIYEDFLVSDHITTTAYLRISLPNLLAKEDYKKVLYLDSDVLVLDDIVKL 146 Query: 143 LHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALS 202 LNG + D P Q KA+ RL + YFNSGV+ +D+ +W ++TEK + Sbjct: 147 YDEPLNGKTIGAIID--PGQVKALERLGI-DSDDLYFNSGVMVIDIDQWNKKEITEKTIH 203 Query: 203 ILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLI---TES 259 L + Y DQD +N +L L ++N ++ E ++ Y++L E Sbjct: 204 YLSENGDRIIYHDQDALNAVLYEDWEQLHPKWNMQTSLIFERHPAPNEKYERLYKEGNEK 263 Query: 260 TLLIHYTGATKPWHKWAIYPSVKYYKIALENS 291 ++H+TG KPW+ +P Y L +S Sbjct: 264 PSIVHFTGHDKPWNTLKDHPYTNLYLKKLAHS 295 >UniRef50_D1P7H1 Lipopolysaccharide 1,2-glucosyltransferase n=1 Tax=Providencia rustigianii DSM 4541 RepID=D1P7H1_9ENTR Length = 324 Score = 243 bits (620), Expect = 9e-63, Method: Composition-based stats. Identities = 100/318 (31%), Positives = 167/318 (52%), Gaps = 15/318 (4%) Query: 23 NTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQN 82 + + ++ YGVD +L GVG SI S++LNN+ + F+I D D + + Sbjct: 19 HENSYFHIGYGVDEKFLYGVGTSIASVMLNNKDTDFHFHIFVDNLPDENL--FREAVQGT 76 Query: 83 QLRITLYRINTDKLQCLPC-TQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQ 141 +IT+Y I+ +K + LP ++ WS A+YFRL L ++D LLYLDAD++CKGD+S+ Sbjct: 77 SHKITIYFIDNEKFKLLPLPSKAWSHAIYFRLLIISYLSSSIDSLLYLDADIICKGDLSE 136 Query: 142 LLHLGLNGA-VAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKA 200 L L + VKD +++ + P + +YFNSG +Y+ LK A + + Sbjct: 137 LKALTFDEKTFVYAVKDKFCSEKQNL-----PIDMSKYFNSGFLYMSLKHLAQENIPNRV 191 Query: 201 LSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITEST 260 + ++ D + +PDQD +NVLL + + YN ++++ + K H I +S Sbjct: 192 IELVEKND--FSHPDQDALNVLLNDKLINISENYNYMFSLDWYITSKGH---LAKIPDSV 246 Query: 261 LLIHYTGATKPWHKW-AIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQHH 319 + IH+ G TKP+H+W + Y KY + A +NSPWK+ + + ++ HL Sbjct: 247 VFIHFVGLTKPFHEWASFYEEYKYLESARKNSPWKNIPLLKPEGYKQLSRKKSHLRKNGK 306 Query: 320 YISGIIAGVCYLCRKYYR 337 Y+ I + YL +K + Sbjct: 307 YVEFIFTTIQYLMKKTFH 324 >UniRef50_C2HBB8 Family 8 glycosyltransferase n=13 Tax=Lactobacillales RepID=C2HBB8_ENTFC Length = 300 Score = 242 bits (619), Expect = 1e-62, Method: Composition-based stats. Identities = 66/295 (22%), Positives = 129/295 (43%), Gaps = 11/295 (3%) Query: 24 TSECLNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAEQ- 81 + + V + ++ + S++ N N + FY+I D + Q + + Sbjct: 2 NKKEIAVVASCNTKFVPHLAALFVSVLDNCNPSKFVRFYVIDDDIDFESKQLLRFSVKNA 61 Query: 82 -NQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLL-GLTLDRLLYLDADVVCKGDI 139 + +IN + + + Y+R+ +L G ++R+LY+D D++ DI Sbjct: 62 RMNSDVEFLKINKEFFTNVVISDRIPETAYYRIAIPELFRGTEVERILYMDCDMIALQDI 121 Query: 140 SQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEK 199 S+L L ++ A V+D Q + ++ P +YFNSG++ +++KKW D +T+K Sbjct: 122 SKLWRLDFGDSIVAAVEDAGFHQR--LEKMEIPAKSMRYFNSGLMLINVKKWLDENITQK 179 Query: 200 ALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKT---HQNYKKLI 256 L + ++ DQD +N +L L L +N I ++ K + + Sbjct: 180 VLDFIEHNPEKLRFHDQDALNAILHDRWLPLHPRWNAQGYIMAKAKKHPTAAGEREYEET 239 Query: 257 TESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKD--DSPRDAKSIIEFKK 309 + +IH++G KPW K P+ KYY+ + ++ P+ K +K Sbjct: 240 RNNPYIIHFSGHVKPWSKDFEGPTKKYYEKYAGMTAFRCVAKFPKYPKYAKIQQK 294 >UniRef50_Q38VK7 Putative bifunctional glycosyl transferase, family 8 n=2 Tax=Lactobacillus sakei subsp. sakei 23K RepID=Q38VK7_LACSS Length = 569 Score = 240 bits (613), Expect = 6e-62, Method: Composition-based stats. Identities = 58/274 (21%), Positives = 122/274 (44%), Gaps = 7/274 (2%) Query: 21 NINTSECLNVAYGVDANYLDGVGVSITSIVLNNRH-INLDFYIIADVYNDGFFQKIAKLA 79 + + +N+ ++N+++ + + SI+ NN + F++++D + ++ Sbjct: 278 PADKRDQINIVSAANSNFVEPLAILYASILNNNDDDRHYAFFVLSDQLTARDQATLRQIT 337 Query: 80 EQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDI 139 E +T ++ L + + Y+RL LL ++R+LYLD D +C ++ Sbjct: 338 ESFNAELTFIEVDEIPLTAVIQDGQVLKTAYYRLLIPNLLP-EIERVLYLDCDTLCLENL 396 Query: 140 SQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEK 199 ++L + L A V+D A + + +YFN+GV+ ++L W K+TE+ Sbjct: 397 ARLWDVELGNIPVAAVEDAGFHNRLAQMAIDYKSI--RYFNAGVLLMNLTIWRQQKITEQ 454 Query: 200 ALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLIT-- 257 L+ + ++ DQD +N +L + L ++N +I + + + Sbjct: 455 ILTFIKEYPQKLRFHDQDALNAILHDRWIHLHPKWNVQTSILMDFIVAPTERINRQFLSA 514 Query: 258 -ESTLLIHYTGATKPWHKWAIYPSVKYYKIALEN 290 + LIH+ G+ KPW K + +P Y+ Sbjct: 515 QKEPGLIHFCGSEKPWDKSSTHPYTPQYRFYKSR 548 Score = 217 bits (552), Expect = 5e-55, Method: Composition-based stats. Identities = 60/272 (22%), Positives = 121/272 (44%), Gaps = 6/272 (2%) Query: 26 ECLNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAEQNQL 84 + + + D Y D + +++ SI + +D ++++ + + +L Sbjct: 8 KTIAIMVAADEQYADQMLLTLKSIREHCTLETAIDLFVLSSDLSHATKSAVNRLMTL-PH 66 Query: 85 RITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQ-LLGLTLDRLLYLDADVVCKGDISQLL 143 ++ IN +++ P + + Y+R+ A Q LL ++R+LYLD D + + D++ L Sbjct: 67 HVSFIAINPRRIKNFPGNNHFDQTAYYRILAPQILLARHIERVLYLDLDTLIRTDLTPLY 126 Query: 144 HLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSI 203 L G + V D + + YFN+GV+ +D W +++K L++ Sbjct: 127 DSDLEGNIIGAVIDPGKALTLKRLGVPKSQANNIYFNAGVLIIDTILWETHHISQKILAM 186 Query: 204 LMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITE---ST 260 L+ QD +NV+L G T L ++N I + + + Y +L + + Sbjct: 187 LVPYPGRRVNDIQDALNVVLAGRTKLLAPKWNVQNAILFKTYEPINNEYSQLFKQAIMAP 246 Query: 261 LLIHYTGATKPWHKWAIYPSVKYYKIALENSP 292 +IH+T KPW + +P + Y++ L P Sbjct: 247 KIIHFTTEKKPWEVFLEHPYMSEYQVYLSQLP 278 >UniRef50_B7AIV0 Putative uncharacterized protein n=1 Tax=Bacteroides eggerthii DSM 20697 RepID=B7AIV0_9BACE Length = 321 Score = 239 bits (611), Expect = 1e-61, Method: Composition-based stats. Identities = 63/287 (21%), Positives = 126/287 (43%), Gaps = 6/287 (2%) Query: 15 WDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQK 74 ++ ++ +++ ++ Y +I SI +NN++ + ++I D + + Sbjct: 2 YNTTNIPSIKTKAIHIVVCINDAYSQHCAATIASIFINNKNEVIKIHVITDYISKKNQSR 61 Query: 75 IAKLAEQNQLRITLYRINTDKLQCLPCTQ-----VWSRAMYFRLFAFQLLGLTLDRLLYL 129 + K+A +I Y N L PC + + Y+RLF Q+L L + + YL Sbjct: 62 LEKIAFNFNQQIQFYTFNNSTLNRWPCFKDGMPPHVTIQTYYRLFIPQILPLNIKKTFYL 121 Query: 130 DADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLK 189 D D++ + + + + A + D +A +RL +YFN+GV+ L+L+ Sbjct: 122 DCDLLVLHPLREFWNTKMQNKGVAAIADQWTDYIEAATRLKYRNDR-EYFNAGVLLLNLE 180 Query: 190 KWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTH 249 + T A+ + N Y DQDV+N L+ + +P ++N ++ + Sbjct: 181 YLRNHNFTNNAIDFVTKHANDIVYHDQDVLNKLIGENRIIMPVKWNVCSFKINDKIPHIY 240 Query: 250 QNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDD 296 + +IH+ KPW++ + +P YY L+ +PWK + Sbjct: 241 NATMNDARKDPYIIHFFAPIKPWNQDSSHPYRSYYYYFLQFTPWKHE 287 >UniRef50_C5ELK9 Glycosyl transferase n=3 Tax=Clostridiales RepID=C5ELK9_9FIRM Length = 333 Score = 238 bits (607), Expect = 2e-61, Method: Composition-based stats. Identities = 77/314 (24%), Positives = 129/314 (41%), Gaps = 21/314 (6%) Query: 24 TSECLNVAYGVDANYLDGVGVSITSIVLNNRHI-NLDFYIIADVYNDGFFQKIAKLAEQN 82 E N+ Y + Y + S+ S++ NNR++ N+D YI++ + +++A +AE Sbjct: 4 NEETANIIYASNDGYAGHLAASMYSLLDNNRNVRNMDIYILSAQMCQEYKERLAGMAEAF 63 Query: 83 QLRITLYRINT--DKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDIS 140 + + + + T+ + + RLFA Q+L T+ + LYLD D + I Sbjct: 64 HRTLHVVELGDLKQRFDFDIDTRGFDISAMGRLFAPQVLPGTVKKALYLDCDTIVCKSIR 123 Query: 141 QLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKA 200 L L AV +V +EP K + Y+NSGV+ + L +W + +K Sbjct: 124 PLYETELGDAVVGMV--MEPTVYKEMKESIGMGKDDPYYNSGVLLMALDRWRQEDVLQKL 181 Query: 201 LSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKD----------KTHQ 250 L S DQD +N LKG LP +YN + + + Sbjct: 182 LDFYKSCHGRLFACDQDTINGALKGRIKTLPVKYNYFTNYRYFRYSTLCSMCAAYREIGE 241 Query: 251 NYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKR 310 S +IHY G +PW K Y+ L +PWKD + K +R Sbjct: 242 EAYLEARRSPAIIHYLGDERPWIAGNHNHFKKLYEYYLAKTPWKDTPKQTGK------ER 295 Query: 311 YKHLLVQHHYISGI 324 Y H+ + ++ + Sbjct: 296 YMHMWWLFNRLTWL 309 >UniRef50_A8ARL4 Putative uncharacterized protein n=2 Tax=Citrobacter RepID=A8ARL4_CITK8 Length = 314 Score = 238 bits (607), Expect = 3e-61, Method: Composition-based stats. Identities = 80/316 (25%), Positives = 150/316 (47%), Gaps = 8/316 (2%) Query: 23 NTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQN 82 N + +N+AY DANYL+ V VSI S+++NN +L F++ +D K+ + + Sbjct: 3 NKTNVINIAYCTDANYLEYVAVSIMSVIMNNPEQSLAFFVFVYDVSDEDIAKLQSTSNKI 62 Query: 83 QLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQL 142 Q+ IT+ + + +K + +R+ Y RL +LL + R +YLDAD +C +S++ Sbjct: 63 QV-ITIDKADIEKYNNDFAIKHLNRSTYMRLAVPRLLKDKVARFIYLDADTLCFDSLSEI 121 Query: 143 LHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALS 202 + ++ V AV D + + +R + YFN+G +Y+++ W + KA + Sbjct: 122 NSVDIDNVVCAVSHDSLNIHDNKHARRLGLSI-DHYFNAGFLYINVANWIKHDIEHKANT 180 Query: 203 ILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLL 262 +L + Y DQD +N+ + G F+ +N ++ + D+ +N+ + Sbjct: 181 VLFEQGKSLPYFDQDALNIAMNGNITFIDNRWNFLFNWFT---DEQKENFFYHSDTLPRI 237 Query: 263 IHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPR---DAKSIIEFKKRYKHLLVQHH 319 IH+TG KPW+K S + Y +PW++ R +++ + + + Sbjct: 238 IHFTGGRKPWYKEHTGLSQQLYVFYHHFTPWRNAELRSYAPRMRPTDYRVYSRQAAKKGN 297 Query: 320 YISGIIAGVCYLCRKY 335 Y + I YL K Sbjct: 298 YFTAIKWYAKYLKTKI 313 >UniRef50_A4ECW2 Putative uncharacterized protein n=1 Tax=Collinsella aerofaciens ATCC 25986 RepID=A4ECW2_9ACTN Length = 328 Score = 238 bits (607), Expect = 3e-61, Method: Composition-based stats. Identities = 77/322 (23%), Positives = 138/322 (42%), Gaps = 23/322 (7%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHI-NLDFYIIADVYNDGFFQKIAKLAEQNQLRI 86 +N+ Y VD N++ + +I S+V N+ I ++ F++ ++ + + + ++ + + Sbjct: 4 MNLLYTVDNNFVPQLAANICSVVSNHSGIQDITFHVFSNGITEDNQRLLQEMVTEYNQNL 63 Query: 87 TLYRINT--DKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 Y I+ D L T W+ + RL L ++R++YLD D + GDI+ L + Sbjct: 64 VFYDISNFKDALGFDFDTSGWNEIVLARLLMAHFLPNEIERVIYLDGDTIVLGDIALLWN 123 Query: 145 LGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQ-YFNSGVVYLDLKKWADAKLTEKALSI 203 L G V +V P SRL+D +L G Y N+GV+ +DLK+W ++ L Sbjct: 124 QDLKGCVVGMV----PEPTVGPSRLNDLDLNGCLYHNAGVLLVDLKQWRSTCCEDQLLDY 179 Query: 204 LMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNT--------IYTIKSELKDKTHQNYKKL 255 + DQD +N +LK L +N + S + + +N Sbjct: 180 CERRSGRLFANDQDALNAVLKDKICSLSPAFNYSNIFDYYPFIFLNSLMPGFSDENSFNT 239 Query: 256 ITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLL 315 +++HY G +PW + + Y L + WKD D R + L Sbjct: 240 ARSKPIVVHYLGEERPWRRGNTHRFNNEYHFYLSETFWKDAKDEDGWGAYFLAWRTFNFL 299 Query: 316 ------VQHHYISGIIA-GVCY 330 +++ ISG+I + Y Sbjct: 300 TRPFPQLRYKVISGLIPAFLKY 321 >UniRef50_Q116W1 Glycosyl transferase, family 8 n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q116W1_TRIEI Length = 278 Score = 237 bits (606), Expect = 4e-61, Method: Composition-based stats. Identities = 77/292 (26%), Positives = 140/292 (47%), Gaps = 17/292 (5%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +N+ + D NY GV+ITS++LNN + D +II + + QK+ L++ + Sbjct: 2 MNLLFCFDQNYQQHFGVAITSVLLNNLSSHFDVHIITNFMEEKLKQKLDTLSKNYKCSFH 61 Query: 88 LYRINT-DKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLG 146 LY IN DK+ L + S A Y+RL ++L +D++LYLD+DVV + +L ++ Sbjct: 62 LYIINNLDKISKLKVSDHVSNATYYRLIMAEILPKHIDKVLYLDSDVVVISPLEELYNID 121 Query: 147 LNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMS 206 L A S S + + FNSGV+ ++L+KW + +++ K + Sbjct: 122 LENYFIAA------------SGFSGTLVKSKGFNSGVMVVNLEKWRNEQISTKVIDFATK 169 Query: 207 KDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYT 266 + Y DQ +N ++K L + R++N + K + + ++ +IHY Sbjct: 170 NRDKLPYHDQSALNRVIKQNYLIIDRKWNFQVDLSPR---KIQKPDDNIALKNARIIHYI 226 Query: 267 GATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDA-KSIIEFKKRYKHLLVQ 317 G++KPW+ W Y++ L+ S W + + + F+K + L + Sbjct: 227 GSSKPWYFWISDQRKNIYELYLKKSLWSTSKLQMIFQQTVYFRKALQRKLKK 278 >UniRef50_C3PWZ8 Lipopolysaccharide 1,2-glucosyltransferase n=1 Tax=Bacteroides sp. 9_1_42FAA RepID=C3PWZ8_9BACE Length = 315 Score = 236 bits (603), Expect = 7e-61, Method: Composition-based stats. Identities = 70/313 (22%), Positives = 131/313 (41%), Gaps = 12/313 (3%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +++A +D+ ++ V+I SI+ NN ++ +I++ ++++AE+ I Sbjct: 1 MHIALTIDSKFVRYCAVTIVSILENNDPKDIMLHIVSGHLPKEDVLTLSQVAEKYGTSIA 60 Query: 88 LYRINTDKLQCLPCT---QVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 Y I +KLQ Q S +++R +L T+ R++YLD+D + G + +L Sbjct: 61 FYYIPHEKLQNYEVKWQKQRLSMVVFYRCVLASILPSTISRVIYLDSDTLVLGSLKELWD 120 Query: 145 LGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 LN A V+D RL Y N GV+ L+L W + ++ + Sbjct: 121 TNLNQLALAGVQDTVSPNPSYFERLQ-YAPSYNYINGGVLLLNLAYWRKHNIEQQCIKYY 179 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTI--YTIKSELKDKTHQNYKKLITESTLL 262 + DQD++N LL + + ++N + + + ++ Sbjct: 180 QQYPDRIILNDQDILNALLYDQKVLIDIKWNVQDDFYRNNRYTSPAWKPSYTDAILHPII 239 Query: 263 IHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQHHYIS 322 +HY+G KPW A++P + +P+ D + K I R+ HLL YI Sbjct: 240 LHYSGR-KPWAYHAMHPLRHLFFHYQRLTPYDDSA--KQKKISTRIYRFIHLLP---YIL 293 Query: 323 GIIAGVCYLCRKY 335 G+ +K Sbjct: 294 GLKPKKYVNLKKI 306 >UniRef50_C4VEI8 General stress protein A n=24 Tax=Enterococcus RepID=C4VEI8_ENTFA Length = 303 Score = 235 bits (601), Expect = 1e-60, Method: Composition-based stats. Identities = 65/283 (22%), Positives = 131/283 (46%), Gaps = 9/283 (3%) Query: 19 LANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHIN-LDFYIIADVYNDGFFQKIAK 77 + + + L + + N++ + SI+ N+ + FY+I D N Q + Sbjct: 1 MQEMENRKELAIVSCCNTNFVPHLAAMFVSILENSPSAAAVHFYVIDDNINFESKQLLYF 60 Query: 78 LAE--QNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLL-GLTLDRLLYLDADVV 134 + Q +T ++IN + + ++ + Y+R+ +L G ++RLLY+D D++ Sbjct: 61 TIKHTQLNAELTFFKINPHFFKNVVTSERIPKTAYYRIAIPELFRGSQIERLLYMDCDMI 120 Query: 135 CKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADA 194 D+++L + L + A V+D Q + +++ P YFNSG++ +D+KKW + Sbjct: 121 ALDDVAKLWTVDLGENIIAAVEDAGFHQR--LEKMAIPAESMCYFNSGLLLIDVKKWLNL 178 Query: 195 KLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDK---THQN 251 +T K L + + ++ DQD +N +L L ++N I S+ K + Sbjct: 179 DVTTKVLRFIEENPDKLRFHDQDALNAVLHDRWTLLHPKWNAQGYILSKAKKHPTIYGEK 238 Query: 252 YKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWK 294 + + +IH+TG KPW K + + +YY + ++ Sbjct: 239 QYEETRRAPSIIHFTGHVKPWTKEFQWYTKRYYDQYANRTAFR 281 >UniRef50_D2ELM0 Lipopolysaccharide biosynthesis glycosyltransferase n=1 Tax=Pediococcus acidilactici 7_4 RepID=D2ELM0_PEDAC Length = 552 Score = 235 bits (600), Expect = 2e-60, Method: Composition-based stats. Identities = 62/268 (23%), Positives = 112/268 (41%), Gaps = 7/268 (2%) Query: 27 CLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQN-QLR 85 +NV ++ + + S SI+ N+ +F+++ D D + + + Sbjct: 278 VINVISAANSAFTQALATSYVSILENDPDHQYNFFLLPDHLTDRDMMLLGSIIARYDNAT 337 Query: 86 ITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHL 145 I + +N + L + + Y+R+ A LL +++R +YLD D++ + +L Sbjct: 338 IKVVEVNEELLANAVESDRIVKTAYYRILAPALLP-SINRAIYLDCDIIANTSLHELWQT 396 Query: 146 GLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILM 205 L G V A V+D + ++ + +YFNSG++ +DL +W T+K L + Sbjct: 397 NLEGNVIAAVEDAGFHDR--LEKMGITKENEKYFNSGMMLIDLVRWRARSTTQKVLDYIN 454 Query: 206 SKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLI---TESTLL 262 ++ DQD +N L L L ++N I E + E L Sbjct: 455 QNPEKLRFHDQDALNANLYDDWLHLHPQWNAQSNIIMETIFPPRTELLEPYAETREDPKL 514 Query: 263 IHYTGATKPWHKWAIYPSVKYYKIALEN 290 IH+ G KPWH+ +P Y E Sbjct: 515 IHFCGHVKPWHEGCEHPYADVYLKYHEM 542 Score = 224 bits (572), Expect = 3e-57, Method: Composition-based stats. Identities = 68/265 (25%), Positives = 121/265 (45%), Gaps = 7/265 (2%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRI 86 +N+ D NY D + ++I + + N F ++ + D + KL N I Sbjct: 4 INILLAADRNYADQLCITIKTALETLNSATRAHFIVLTNNLGDQTRALLDKLM-HNFHTI 62 Query: 87 TLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLL-GLTLDRLLYLDADVVCKGDISQLLHL 145 ++ ++ P Q ++ YFR+ A +LL +DRL+YLD DV+ + D+++L Sbjct: 63 EYLNLDDERFDFCPTNQHINKTAYFRIIAPKLLASRQIDRLIYLDVDVLIRKDLTELAES 122 Query: 146 GLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQ-YFNSGVVYLDLKKWADAKLTEKALSIL 204 LN V D + YFNSG++ +D+ +W ++TEK L+ + Sbjct: 123 NLNQNTVGAVIDTGQAFALHRLGVDPVVAASNLYFNSGIMVIDVAQWNAHRITEKTLAFI 182 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITES---TL 261 + + + DQD +N +L G FL ++N +I +Q Y +LI E+ Sbjct: 183 RNHADRIIFHDQDALNAVLAGEVQFLHPKWNLQNSIIFRKHRPINQGYAELIDEAIKEPS 242 Query: 262 LIHYTGATKPWHKWAIYPSVKYYKI 286 ++H+T KPW ++P + Y Sbjct: 243 IVHFTTHEKPWKDLTVHPYLDEYHE 267 >UniRef50_A5ZFA9 Putative uncharacterized protein n=1 Tax=Bacteroides caccae ATCC 43185 RepID=A5ZFA9_9BACE Length = 310 Score = 234 bits (598), Expect = 3e-60, Method: Composition-based stats. Identities = 78/274 (28%), Positives = 124/274 (45%), Gaps = 9/274 (3%) Query: 29 NVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITL 88 N+ G+D Y G + S+ +N + Y+++ ++ + +L + Q +I Sbjct: 3 NIICGIDDQYCQHCGAMLLSLFESNPGA-ITIYVLSLELSEKSKNLLKELVDSYQKQIHF 61 Query: 89 YRINTDKLQCLP--CTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLG 146 I ++ + P T S A Y RLF QLL +D+ LY+D+D++ K DIS L Sbjct: 62 IDIPSELVLNFPMKSTDYPSLATYLRLFIPQLLPFEVDKALYVDSDIIFKKDISALYDSD 121 Query: 147 LNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMS 206 + A ++D + RL PE YFN+G V L++K D T KA++ + Sbjct: 122 ITNYALAGMEDAP---NQNALRLGFPE-SDLYFNAGFVLLNVKYLRDMDFTNKAMAYIRD 177 Query: 207 KDNVYKYPDQDVMNVLLKGMTLFLPREYNTI--YTIKSELKDKTHQNYKKLITESTLLIH 264 DQDV+N LL G LF+P ++N + + K K + +S +IH Sbjct: 178 CREKIVLHDQDVLNALLHGKVLFVPIKWNMLDCFYRKPPFIAKKYMRELHENLDSPAVIH 237 Query: 265 YTGATKPWHKWAIYPSVKYYKIALENSPWKDDSP 298 ++G KPWH +P K Y W SP Sbjct: 238 FSGPLKPWHHGCPHPLRKEYFNYSRKLSWGCQSP 271 >UniRef50_B4RJG6 LgtC n=29 Tax=Neisseria RepID=B4RJG6_NEIG2 Length = 307 Score = 232 bits (591), Expect = 2e-59, Method: Composition-based stats. Identities = 60/298 (20%), Positives = 118/298 (39%), Gaps = 14/298 (4%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +++ + D NY + V+ S+ + + F+++ ++ +A I Sbjct: 1 MDIVFAADDNYAAYLCVAAKSVEAAHPDTEIRFHVLDAGISEENRAAVAANLRGGGGNIR 60 Query: 88 LYRINTDKLQCLPCT-QVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLG 146 +N + P + S Y RL + + D++LYLD DV+ + + L Sbjct: 61 FIDVNPEDFAGFPLNIRHISITTYARLKLGEYI-ADCDKVLYLDTDVLVRDGLKPLWDTD 119 Query: 147 LNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMS 206 L G D+ +++ + YFN+GV+ ++LKKW + + + + Sbjct: 120 LGGNWVGACIDLFVERQEGYKQKIGMADGEYYFNAGVLLINLKKWRRHDIFKMSCEWVEQ 179 Query: 207 KDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKD---KTHQNYKKLITESTLL- 262 +V +Y DQD++N L KG + +N + T + + + H + L +T + Sbjct: 180 YKDVMQYQDQDILNGLFKGGVCYANSRFNFMPTNYAFMANGFASRHTDPLYLDRTNTAMP 239 Query: 263 ---IHYTGATKPWHKWAIYPSVKYYKIA---LENSP--WKDDSPRDAKSIIEFKKRYK 312 HY G+ KPWH+ + + L P W+ + + R K Sbjct: 240 VAVSHYCGSAKPWHRDCTVWGAERFTELAGSLTTVPEEWRGKLAVPPTKHMLQRWRKK 297 >UniRef50_Q9L6B2 Putative glycosyl transferase n=3 Tax=Pasteurella RepID=Q9L6B2_PASMU Length = 302 Score = 229 bits (585), Expect = 1e-58, Method: Composition-based stats. Identities = 69/300 (23%), Positives = 125/300 (41%), Gaps = 8/300 (2%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +N+ + D Y + V+I SI+ +N + FYI D + I + + Sbjct: 1 MNILFVSDDVYAKHLVVAIKSIINHN-EKGISFYIFDLGIKDENKRNINDIVSSYGSEVN 59 Query: 88 LYRINTDKLQCLPCT-QVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLG 146 +N + + P S A Y RL A + L L++++YLD DV+ + L ++ Sbjct: 60 FIAVNEKEFESFPVQISYISLATYARLKAAEYLPDNLNKIIYLDVDVLVFNSLEMLWNVD 119 Query: 147 LNGAVAAVVKDVEPMQEKAVSRLSDPELLGQY-FNSGVVYLDLKKWADAKLTEKALSILM 205 +N + A D EK+ + S +Y FN+GV+ +L +W + +AL +L Sbjct: 120 VNNFLTAACYDSFIENEKSEHKKSISMSDKEYYFNAGVMLFNLDEWRKMDVFSRALDLLA 179 Query: 206 SKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDK-----THQNYKKLITEST 260 N Y DQD++N+L + +L +N + +K ++ + + T Sbjct: 180 MYPNQMIYQDQDILNILFRNKVCYLDCRFNFMPNQLERIKQYHKGKLSNLHSLEKTTMPV 239 Query: 261 LLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQHHY 320 ++ HY G K WH + +V +Y+ L D R K + + + Y Sbjct: 240 VISHYCGPEKAWHADCKHFNVYFYQKILAEITRGTDKERVLSIKTYLKALIRRIRYKFKY 299 >UniRef50_C0WCJ1 Lipopolysaccharide 1,2-glucosyltransferase n=1 Tax=Acidaminococcus sp. D21 RepID=C0WCJ1_9FIRM Length = 338 Score = 229 bits (583), Expect = 2e-58, Method: Composition-based stats. Identities = 81/324 (25%), Positives = 138/324 (42%), Gaps = 22/324 (6%) Query: 11 KVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDG 70 K + A L+VAY V+ Y +G S+ S++ NN H + F+I D Y+ Sbjct: 16 KGVETFSKNAEKTDKAPLHVAYNVNDGYFQIMGASLVSVLENNAHRAVMFHIFTDGYSKE 75 Query: 71 FFQKIAKLAEQNQLRITLYRINTDKLQCLPCT-QVWSRAMYFRLFAFQLLGLTLDRLLYL 129 QK+ +LA++ I LY ++ + + +SR Y R+ +L D LYL Sbjct: 76 NAQKMEQLADRYGCVIKLYTLHMEPFADFHVKVERFSRITYGRIVMPLILAAETDHFLYL 135 Query: 130 DADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLK 189 DAD + + +L H L G V + P ++ L G+YFN GV+ +++ Sbjct: 136 DADTMVIRPLDELYHWDLTGKAMGAVSERMPDAKRRGDYLHL--NNGRYFNDGVMMVNIP 193 Query: 190 KWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTH 249 +W +TEKA S+ + QD++N++ G FLP YN + + + K Sbjct: 194 EWQKQNITEKAFSLQKEPKERFLGQSQDILNIVFDGTNAFLPSIYNEFGGGEDDPQQKG- 252 Query: 250 QNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKD----DSPRDAKSII 305 +IH+TG KPW + ++ SPW+ ++ Sbjct: 253 -----------TIIHWTGRRKPWQMVLSDYDAQ-WRSYNAASPWETLTAILPILKPENYH 300 Query: 306 EFKK--RYKHLLVQHHYISGIIAG 327 +FK+ +Y+ Y+ G+ Sbjct: 301 DFKEWAKYRRKESFRDYVKGMAYY 324 >UniRef50_B3CA80 Putative uncharacterized protein n=1 Tax=Bacteroides intestinalis DSM 17393 RepID=B3CA80_9BACE Length = 301 Score = 228 bits (582), Expect = 2e-58, Method: Composition-based stats. Identities = 66/271 (24%), Positives = 121/271 (44%), Gaps = 6/271 (2%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYII-ADVYNDGFFQKIAKLAEQNQLRI 86 +++ +D NY++ GV + S+ ++ +II + +++ E++Q + Sbjct: 2 IDIVCSIDENYIEYCGVMLASLFVHTPDEKFRVHIICSSKVEKAGKKRLKVFCEKHQAEV 61 Query: 87 TLYRINTDKLQCLPCTQ--VWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 Y ++ ++ P + S A Y RLF +L+ ++++LYLD D++ I +L Sbjct: 62 YFYDVDYSLIKDFPIRKQDHLSLAAYLRLFMSELIPSNINKILYLDCDLIVVDSIKELWE 121 Query: 145 LGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 ++ A V++ P ++ L P + YFNSGV+ ++L+KW + K E S + Sbjct: 122 KNIDNIAVAAVEERSPFDTESPVTLKYP-VEYSYFNSGVMLINLQKWREKKFVEACKSYI 180 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKL--ITESTLL 262 S K DQDV+N LL F+ +N + + + K +S + Sbjct: 181 ASNYENIKLHDQDVLNALLYKEKQFISIRWNLMDFFLYASPEVQPERKKDWDDALKSPAI 240 Query: 263 IHYTGATKPWHKWAIYPSVKYYKIALENSPW 293 IH+TG KPW P Y + W Sbjct: 241 IHFTGKRKPWMYNCDSPFRDQYIRFAKQQGW 271 >UniRef50_B8ISQ5 Glycosyl transferase family 8 n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8ISQ5_METNO Length = 328 Score = 227 bits (580), Expect = 3e-58, Method: Composition-based stats. Identities = 71/304 (23%), Positives = 123/304 (40%), Gaps = 15/304 (4%) Query: 19 LANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKL 78 L + + + VA +D + V++ S++ LD +I + +IA L Sbjct: 4 LHETDEIDRIAVALCIDRAFFRHALVTVASLLDAGPRQPLDVHIFYAEADPACMARIAAL 63 Query: 79 -AEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKG 137 A+Q++ +I+ D+ + P + S Y RL L+ ++LYLDAD++ Sbjct: 64 FADQDRHGCHFQKISLDRFEGFPVSDAISAGTYARLLLPYLMPRRA-KVLYLDADLIVLD 122 Query: 138 DISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLT 197 D++ L L A A V+D A+ D YFN+GV+ ++L W L Sbjct: 123 DVAPLWRTELGAAPVAAVRDPFCDNRPAIGFSPDE----PYFNAGVLLMNLAVWRREGLA 178 Query: 198 EKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKT--HQNYKKL 255 E+ + + + KY DQD +NV+L+G F+ +N + + + Sbjct: 179 ERVAAHIDAHGASLKYFDQDALNVVLRGRARFVDPRWNFQPRMADATPADIACARAEFRR 238 Query: 256 ITESTLLIHYTGATKPWHK-WAIYPSVKYYKIALENSP------WKDDSPRDAKSIIEFK 308 +IHYT KPW +AI+ Y + P + D + K Sbjct: 239 TRARPAIIHYTTPHKPWKDPFAIHYGRHYLDCLMRLEPDLRARYFADVPQQPRLRASHLK 298 Query: 309 KRYK 312 R + Sbjct: 299 ARMR 302 >UniRef50_B3Z5I6 Glycosyltransferase family 8 n=2 Tax=Bacillus cereus group RepID=B3Z5I6_BACCE Length = 317 Score = 227 bits (580), Expect = 3e-58, Method: Composition-based stats. Identities = 75/294 (25%), Positives = 132/294 (44%), Gaps = 18/294 (6%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRH-INLDFYIIADVYNDGFFQKIAKLAEQNQLRI 86 LNV Y D NY VGVS+ S++ NN+H NL+ ++I + + + + + ++ I Sbjct: 3 LNVVYSSDDNYAQHVGVSLLSLLQNNQHFNNLNIFLIENNISSYNKKNLNSVCKKYNKTI 62 Query: 87 TLYRINTDKLQ-CLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHL 145 N + L + Y RLF ++ LD+++YLD D + +S L Sbjct: 63 QYINFNVLLERLELNINDSIAINSYARLFLAGIIPEELDKIIYLDCDSIINSSLSDLWDT 122 Query: 146 GLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILM 205 + A V D Q K + E Y N+G++ ++LKKW + + +K + + Sbjct: 123 DVTEYFVAGVCDTVSNQTKLRIDMDKSE---GYINAGMLLINLKKWREENIEQKFMEFIK 179 Query: 206 SKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTI----------KSELKDKTHQNYKKL 255 KD + DQ +N +LK L+L ++N + EL++ ++ Sbjct: 180 KKDGNVFHHDQGTINGVLKDKILYLHPKFNAMTPFFTMSRKEIMSYYELENYYNEIEIDE 239 Query: 256 ITESTLLIHYT--GATKPWHKWAIYPSVKYYKIALENSPWKD-DSPRDAKSIIE 306 ++ + IHYT +PW + +P YK L+ +PWK D +D + +E Sbjct: 240 AVKNPVFIHYTPAFVNRPWIEGCKHPLTSLYKSYLDMTPWKSTDLWKDRRGKVE 293 >UniRef50_UPI000196958D hypothetical protein BACCELL_01586 n=1 Tax=Bacteroides cellulosilyticus DSM 14838 RepID=UPI000196958D Length = 305 Score = 227 bits (580), Expect = 3e-58, Method: Composition-based stats. Identities = 60/304 (19%), Positives = 111/304 (36%), Gaps = 6/304 (1%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +N+ D+ Y+ V + S NN Y++ + + I K+ Sbjct: 1 MNIVCAADSGYVQHCSVMLISFFENNPGEEHAVYLLTEGLDLDDLDFIQKIVHSYNGHFF 60 Query: 88 LYRINTDKLQCLP--CTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHL 145 +++ L+ P T S A Y RLF LL ++++LYLD D++ I +L Sbjct: 61 YCQVDFKFLEKCPIKSTDHLSIATYNRLFMADLLPADVNKVLYLDCDIIVNQSIKELWET 120 Query: 146 GLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILM 205 L + + V D + YFN+GV+ ++L W +T+ + + Sbjct: 121 PLRDNFVVAAFEERGCCAEDVYERLDYDSKYGYFNAGVLLVNLDYWRTHNMTQAFIEYIE 180 Query: 206 SKDNVYKYPDQDVMNVLLKGMTLFLPREYNT--IYTIKSELKDKTHQNYKKLITESTLLI 263 + DQDV+N ++ + +N I+ +K + I ++ Sbjct: 181 HNFEKLRAHDQDVLNAFFYDKSVHISLAWNVEFIFYYYGIIKKFGFDRDLRFILRHPKIL 240 Query: 264 HYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQHHYISG 323 H+T KPW +P Y L+ K + ++ +Y + I G Sbjct: 241 HFTWKPKPWETSCQHPFRINYYRYLKKI--KKNPLSFRDTLRALWDKYYFCFLIKWKIKG 298 Query: 324 IIAG 327 Sbjct: 299 HKYY 302 >UniRef50_B2UPJ4 Glycosyl transferase family 8 n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UPJ4_AKKM8 Length = 315 Score = 226 bits (576), Expect = 1e-57, Method: Composition-based stats. Identities = 77/282 (27%), Positives = 120/282 (42%), Gaps = 14/282 (4%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRI 86 +N+ Y D N G GVSI S++ N ++ D YI+ + + L + L + Sbjct: 1 MNIVYATDDNGALGTGVSIVSLMENLPPGVHADIYIMTGGLSGDNTARFHSLQQGYNLHL 60 Query: 87 TLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLG 146 + DK P WS A Y+RL L T++R LY+D D + DIS + Sbjct: 61 HFIDMK-DKYTDFPVGSKWSAATYYRLGLAGELPATVERALYVDIDTIFNRDISPMYESE 119 Query: 147 LNGAVAAVVKDVEPMQEKAVSRLSDPELLGQ---YFNSGVVYLDLKKWADAKLTEKALSI 203 + A V E + E++ SR LG+ Y N+GV+ + + + + LS Sbjct: 120 FGDCLIAGVFTTEDLSEESFSRWKREMNLGRDSIYINAGVILYHIGRIREECFESQVLSW 179 Query: 204 LMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNT----IYTIKSELKDKTHQNYKKLIT-- 257 + + + DQD++NV + L L +N I++I+ E N K Sbjct: 180 AKNNIHRLSWQDQDILNVCYQQRILLLHPMWNICDGAIWSIRWEGVTSFRNNPLKPADLL 239 Query: 258 ---ESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDD 296 +IHY G KPWH +I + + SPWKDD Sbjct: 240 EAARRPGIIHYWGHPKPWHPNSIRQDYGLFYKYWKKSPWKDD 281 >UniRef50_UPI0001A457E5 hypothetical protein NsubN_08151 n=1 Tax=Neisseria subflava NJ9703 RepID=UPI0001A457E5 Length = 345 Score = 224 bits (571), Expect = 4e-57, Method: Composition-based stats. Identities = 80/329 (24%), Positives = 141/329 (42%), Gaps = 28/329 (8%) Query: 25 SECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQN-Q 83 + ++ Y D NY+ +G ++ S++ NN + F+++ F ++ N Sbjct: 21 KQPKHIVYAADQNYIKHIGTALLSVLQNNTS-PIHFHLLVSGSEGYDFNIFDQIETSNQN 79 Query: 84 LRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLL 143 I++Y +NT+ L T ++ AMY+R+ LL LYLD DV+C G+I L Sbjct: 80 YAISVYHLNTEYFSTLQTTHYFTIAMYYRMSIPCLLKGITHTALYLDTDVLCLGNIDDLF 139 Query: 144 HLGLNGAVAAVVKDVEPMQEKAVS-RLSDPELLGQYFNSGVVYLDLKKWAD---AKLTEK 199 + ++ ++ A V D + YFNSGV+ ++ KW D K+ + Sbjct: 140 EIDISNSLIAAVPDAILYRAYIKQLNQFGFTDTEPYFNSGVILFNIDKWNDMAIDKILSE 199 Query: 200 ALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLIT-- 257 + + ++ PDQD++N+ G +L +N I+ HQ Y +LI Sbjct: 200 KMQAVEKQNFKLSCPDQDILNLACIGHVHWLSENFNWIH---------WHQKYSELIDNP 250 Query: 258 ESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKD--------DSPRDAKSIIEFKK 309 + L+H+ G KPWH+ +P+ Y +NSPW + +F++ Sbjct: 251 NNIRLVHFVGHIKPWHQLGFHPA---YDQYFKNSPWNNGYLEQPLSTWLPFPNPKRKFRQ 307 Query: 310 RYKHLLVQHHYISGIIAGVCYLCRKYYRK 338 K L Q YL R+ ++ Sbjct: 308 AAKRLWKQGQKKQAWAYYREYLLRRINKR 336 >UniRef50_C0QZN2 Glycosyl transferase, family 8 n=3 Tax=Brachyspira RepID=C0QZN2_BRAHW Length = 339 Score = 223 bits (569), Expect = 6e-57, Method: Composition-based stats. Identities = 70/303 (23%), Positives = 131/303 (43%), Gaps = 22/303 (7%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNR-HINLDFYIIADVYNDGFFQKIAKLAEQNQLRI 86 +N+ D NY +G +I SI+ N+ + F++I +KI L + I Sbjct: 1 MNICLASDNNYAPYMGTAIASILKNSSEDEKIIFHLIDGGITKENKEKIISLKNIKECEI 60 Query: 87 TLYRINTD----KLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQL 142 Y + + C +S AM++RL ++ +D++LYLD+D++ G + +L Sbjct: 61 NFYTPDIKMYDGWFEKTSCKAHFSAAMFYRLSIASIIPSNIDKILYLDSDLIATGSLKEL 120 Query: 143 LHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALS 202 + + A V+K + K + + YFNSGV+ ++ K W + ++ Sbjct: 121 FLMDIENHYAIVIKHSTNEKNKWSI-----DGINDYFNSGVLLINNKLWIKNNIEDQFNK 175 Query: 203 ILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLL 262 + + DQDV+N +L G + YN +K + N + I E+ ++ Sbjct: 176 FYNNNYKTC-FGDQDVLNNVLIGKVKYADMRYNV-------YAEKGYYNTENDI-ENPII 226 Query: 263 IHYTGATKPWHKWAIYP-SVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKH--LLVQHH 319 IHY KPW + + + + +PW D P A I +K Y + + ++ + Sbjct: 227 IHYLSPEKPWKENCRGTLFIDEFWRYYQYTPWFRDEPITAFQTILKQKFYDYDDVRLKGN 286 Query: 320 YIS 322 +I Sbjct: 287 WIK 289 >UniRef50_Q03HK5 Lipopolysaccharide biosynthesis glycosyltransferase n=1 Tax=Pediococcus pentosaceus ATCC 25745 RepID=Q03HK5_PEDPA Length = 549 Score = 223 bits (568), Expect = 8e-57, Method: Composition-based stats. Identities = 60/268 (22%), Positives = 115/268 (42%), Gaps = 7/268 (2%) Query: 24 TSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQN- 82 +NV ++ +++ + S SI+ N+ +FY++ D + + + + Sbjct: 275 NRGVVNVISAANSAFVEALATSYISILENDSENQYNFYLLPDHLDQRDMLILGSVISRYD 334 Query: 83 QLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQL 142 I + +++ L+ + ++ Y+R+ A +LL ++R +YLD D++ ++ L Sbjct: 335 NASIKIVKVDEKLLENAVESDRILKSAYYRILAPELLP-NINRAIYLDCDIIANTNLHDL 393 Query: 143 LHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALS 202 L G V A V+D + +YFNSG++ +DL W +T++ L Sbjct: 394 WQTSLEGNVLAAVEDAGFHDRLEH--MGITHDNSKYFNSGMMLIDLVSWRSQAVTQRVLD 451 Query: 203 ILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLI---TES 259 + ++ DQD +N +L L L ++N I + KL E+ Sbjct: 452 YINHNPEKLRFHDQDALNAILYDKWLHLHPKWNAQSNIVLDALVPPRTELLKLYAETREN 511 Query: 260 TLLIHYTGATKPWHKWAIYPSVKYYKIA 287 LIH+ G KPWH + +P Y Sbjct: 512 PKLIHFCGHVKPWHAESKHPYTNVYLKY 539 Score = 222 bits (567), Expect = 1e-56, Method: Composition-based stats. Identities = 68/269 (25%), Positives = 127/269 (47%), Gaps = 7/269 (2%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRI 86 +NV D NY D + ++I + + N N+ ++F ++++ ++ + KLA + Sbjct: 4 INVLLAADENYADQLQITIKTTLENLNKKTRVNFIVLSNNLSNSTKLALKKLAHGLH-TV 62 Query: 87 TLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLL-GLTLDRLLYLDADVVCKGDISQLLHL 145 ++ P ++ Y+R+ A QLL +DR+LYLD D++ + D+++L Sbjct: 63 EYLDLDPSVFAFCPTNSHINKTAYYRILAPQLLAKRNIDRILYLDVDLLVRHDLTELYDA 122 Query: 146 GLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQ-YFNSGVVYLDLKKWADAKLTEKALSIL 204 LN + V D + YFNSG++ +D+KKW + +TEK L+ + Sbjct: 123 ELNHNIVGAVIDTGQAFALNRLGVDPVVAANNIYFNSGILVIDIKKWNENHITEKTLNYI 182 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITE---STL 261 + ++ + DQD +N +L G L ++N +I ++ Y +LI E S Sbjct: 183 KHQSHLIIFHDQDALNAVLAGHVQMLHPKWNLQNSIVFRKHRPINEAYDQLINEAIKSPA 242 Query: 262 LIHYTGATKPWHKWAIYPSVKYYKIALEN 290 ++H+T KPW + +P + Y L Sbjct: 243 IVHFTTHEKPWKTLSEHPYLDEYHEELNE 271 >UniRef50_B3XL28 Glycosyl transferase family 8 n=1 Tax=Lactobacillus reuteri 100-23 RepID=B3XL28_LACRE Length = 331 Score = 223 bits (568), Expect = 9e-57, Method: Composition-based stats. Identities = 76/310 (24%), Positives = 144/310 (46%), Gaps = 27/310 (8%) Query: 29 NVAYGVDANYLDGVGVSITSIVLNNRHIN-LDFYIIADVYNDGFFQKIAKLAEQN-QLRI 86 N+ Y D + +G S+ S++ NN+ ++F+I+ + +I K+ + + Sbjct: 6 NIVYATDDTFAPVLGTSLLSLLRNNKEAKKINFFILDSGISKENKFRIEKICDNFVNASL 65 Query: 87 TLYRINT--DKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 +I + K+ S + Y RLF +L +++R+LYLD D + + L + Sbjct: 66 KWIKIESISKKIGIDVKNDRGSFSQYSRLFIGDVLDNSVERVLYLDCDTLILSSLKDLWN 125 Query: 145 LGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 + L G + A +KD + L + +L+ FNSGV+ +DLK W D K+ EKA+S + Sbjct: 126 IELKGNIIAALKDAFSKYYRKNINLVNDDLM---FNSGVMLIDLKAWRDNKIKEKAISFI 182 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLIT------- 257 + + DQ V+N +L T L YN + +I +L + + Y+ + Sbjct: 183 RQRHGKVQQGDQGVLNSVLSNKTFALDPRYNLV-SIFYDLDYREIKLYRSPVNFYSEKII 241 Query: 258 ----ESTLLIHYTGAT---KPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKR 310 E+ +++H+T + +PW K + + K + + +PWK+ + IE K+ Sbjct: 242 VKAKENPVILHFTSSFYSIRPWFKNSNHQCKKIWLKFYQETPWKNQPLQ-----IEMSKK 296 Query: 311 YKHLLVQHHY 320 K + + Y Sbjct: 297 KKLINILFEY 306 >UniRef50_C0Z4I4 Putative uncharacterized protein gspA n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0Z4I4_BREBN Length = 264 Score = 222 bits (567), Expect = 1e-56, Method: Composition-based stats. Identities = 59/266 (22%), Positives = 110/266 (41%), Gaps = 13/266 (4%) Query: 27 CLNVAYGVDANYLDGVGVSITSIVLNN-RHINLDFYIIADVYNDGFFQKIAKLAEQNQLR 85 +++ V+ + + V + S+ N + ++I + + K ++ + Sbjct: 3 TIHIVTAVNDGFAIHLAVMLYSLFENKVSKNPVIVHVIDSQVSGENKSILTKTVKRFHAQ 62 Query: 86 ITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHL 145 I I+ ++ Y R+ LL +++++YLD+D+V K DI+ L + Sbjct: 63 IKYVTIDPTLYDGFLVRDHLTQETYHRISIPDLLDKEVEKVIYLDSDIVIKKDITPLWNT 122 Query: 146 GLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILM 205 ++ A V D K YFN+GV+ ++LKKW + +T+K + + Sbjct: 123 KVDQYYLAAVMDSWQGLNKLRHADLAIPDDCDYFNAGVLVMNLKKWREHNITKKIMDYMK 182 Query: 206 SKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHY 265 + +YP QD MN +L L L ++N + YK + +IHY Sbjct: 183 KNQGIIRYPSQDPMNAILHDNWLQLDTKWNYQ----------SKHLYKSNLRIDPAIIHY 232 Query: 266 TGAT-KPWHKWAIYPSVKYYKIALEN 290 TG KPW +P + Y L+ Sbjct: 233 TGEDSKPWLS-KKHPLREEYFKYLKK 257 >UniRef50_A6LGX5 Glycosyltransferase family 8 n=1 Tax=Parabacteroides distasonis ATCC 8503 RepID=A6LGX5_PARD8 Length = 325 Score = 221 bits (564), Expect = 2e-56, Method: Composition-based stats. Identities = 66/326 (20%), Positives = 134/326 (41%), Gaps = 23/326 (7%) Query: 29 NVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITL 88 ++ D NYL V + S+ N +L F+++++ + + + + E + ++++ Sbjct: 3 DIVVASDCNYLHLVSICAVSLFETNSSESLHFHLLSNGIDSADIKNLQTIVEGYRGKLSV 62 Query: 89 YRINTDKLQCLP-CTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGL 147 Y I + + + + S Y RLFA +L LD++LY+D D++ G I L + L Sbjct: 63 YPIENLRERLMTDVPETISLTSYARLFAGSILPANLDKVLYIDCDIIFNGSIRDLFNTDL 122 Query: 148 NGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSK 207 + + D P+ + + + Y N+GV+ + L +W + +K + L++ Sbjct: 123 GNCLVGGILD--PLISRTYKKEIKIPMSEPYINAGVLIIPLNRWRSEGMEQKFVDFLVAN 180 Query: 208 DNVYKYPDQDVMNVLLKGMTLFLPREYNTI-----YTIKSELK----DKTHQNYKKLITE 258 + DQ ++N + G LP ++N + Y K K + YKK I+ Sbjct: 181 RGKVHHHDQGIINAVCAGRKKILPPQFNVMSNSLCYPWKDLYKINTPFYDQEEYKKGIS- 239 Query: 259 STLLIHYTG--ATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFK------KR 310 S +IH+TG +PW +P + + +KD + R Sbjct: 240 SPAIIHFTGAIHGRPWIVGCTHPYANKFLQFKAKTAYKDIPLKPNNQSAALHRLEGILYR 299 Query: 311 YKHLLVQHHYISGII--AGVCYLCRK 334 + Y+ + + + +K Sbjct: 300 LLPFSLFKRYMQSVYYLSYFKHSIKK 325 >UniRef50_D2QX94 Glycosyl transferase family 8 n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2QX94_9PLAN Length = 362 Score = 221 bits (564), Expect = 3e-56, Method: Composition-based stats. Identities = 62/336 (18%), Positives = 125/336 (37%), Gaps = 40/336 (11%) Query: 19 LANINTSECLNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAK 77 N + + D N+ G+ + S + N ++D +++ D +I++ Sbjct: 6 HPTQNMPTSIQLVTSSDNNFAIGLAGTFKSALTNLAADSSVDLWVLDGGITDENKAEISR 65 Query: 78 LAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKG 137 +L + ++ + + + A Y+RL ++L + + +YLD+D++ +G Sbjct: 66 HLSDPRLTLHFVSVDRKLVSQFVISHHVTDATYYRLLTPEILSRDIGKFIYLDSDLLIRG 125 Query: 138 DISQLLHLGLNGAVAAVVKD---------------------VEPMQEKAVSRLSDPELLG 176 D+++L + +GA ++D + R Sbjct: 126 DLTKLWNTPFDGAPCVAIQDSGAPFVDSTQLIEQQPSLRGCIANANPIPNYRELGLHPHA 185 Query: 177 QYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNT 236 Y N GV+ +DL W +L E+ L +L Y DQ +NV+L +N Sbjct: 186 PYLNGGVMMIDLDLWRREQLAERMLKVLSDYREHVTYWDQYALNVVLSQRWKQADHRWNQ 245 Query: 237 I-YTIKSELKDKT--HQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPW 293 I Y ++ + T + L + H+T KPW I+P + + LE S W Sbjct: 246 IAYPLRFSSHENTIFSKEAFDLYRNDPYISHFT-YRKPWQAECIHPRSEEFYQYLEGSIW 304 Query: 294 KDDSP--------------RDAKSIIEFKKRYKHLL 315 + P K ++++++ L Sbjct: 305 ANTKPVWQEYEPVAGVVHVPPKKPKPYYRRKFRELR 340 >UniRef50_C7IBC8 Glycosyl transferase family 8 n=1 Tax=Clostridium papyrosolvens DSM 2782 RepID=C7IBC8_9CLOT Length = 464 Score = 220 bits (562), Expect = 4e-56, Method: Composition-based stats. Identities = 63/242 (26%), Positives = 108/242 (44%), Gaps = 5/242 (2%) Query: 63 IADVYNDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLT 122 I + + + E+ RI + + Q + YFR+F +++ + Sbjct: 2 IDGGISSRNKECLRACVEKYGSRIRFLELKPELYQDFKTQSYFGYVTYFRIFIPEIVEAS 61 Query: 123 LDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPEL----LGQY 178 + +++YLD D+V KGDI +L ++ A V+DV + + + G+Y Sbjct: 62 VRKVIYLDCDIVIKGDIRKLWENDISEYFVAAVEDVGIDIGGNFATMVKKHIGIPRKGKY 121 Query: 179 FNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIY 238 FN+GV+ ++L KW K TE L+ + DQD +N + K L LP E+N Sbjct: 122 FNAGVLLINLDKWRADKTTETIRKYLIENREKIYFADQDGLNAVFKDRWLKLPIEWNQQA 181 Query: 239 TIKSELK-DKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDS 297 I LK ++ + + ++IHYT KPW +P + Y L +PW D + Sbjct: 182 DILELLKRNRIDRPDVMKAALNPMIIHYTKQVKPWQYKDCHPLKEEYHRYLRLTPWNDTA 241 Query: 298 PR 299 P+ Sbjct: 242 PK 243 >UniRef50_D0D9G3 Putative general stress protein A n=1 Tax=Citreicella sp. SE45 RepID=D0D9G3_9RHOB Length = 327 Score = 220 bits (562), Expect = 4e-56, Method: Composition-based stats. Identities = 69/322 (21%), Positives = 134/322 (41%), Gaps = 32/322 (9%) Query: 25 SECLNVAYGVDANYLDGVGVSITSIVLNNRHIN-LDFYIIADVYNDGFFQKIAKLAEQNQ 83 + +NV Y D +GVSI S + N N ++ ++++ + + IA + Sbjct: 9 DKRINVVYACDNIQALPLGVSIASALENRAEGNPINIHVLSYRISRSNRKSIASQFDGRD 68 Query: 84 LRITLYRI---NTDKLQCLPCTQV--WSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGD 138 + + I N L+ L + + A Y RL +++ +DR +YLD D++ D Sbjct: 69 DTLCWHEITGENRKLLEDLFTSSNRPYPPAAYARLLISEVIP-NIDRAIYLDTDIIVATD 127 Query: 139 ISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLS-----------DPELLGQYFNSGVVYLD 187 +S L + +GA ++D+ P + RL E YF SGV+ D Sbjct: 128 LSPLWNTPFDGAGLLAIQDL-PTSNDHIKRLRALLSPEDISRYGIEDGDSYFQSGVLVFD 186 Query: 188 LKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTI---YTIKSEL 244 +K++ + +E + L + + +PD D +N++ + +N + + + + Sbjct: 187 MKEFTKTRASE-LIECLRNYPD-LTFPDNDALNIVFHDSFKLVDPRWNQMASVFKLDAAR 244 Query: 245 KDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSI 304 + + + +IHY+G KPW +P + + AL++S W P Sbjct: 245 DTPYSAEVFQALLQDPYIIHYSGRPKPWEDGCTHPYLDRWVEALKDSAWNSWKPSRLNRA 304 Query: 305 IE--------FKKRYKHLLVQH 318 I+ KR++ + QH Sbjct: 305 IDRIPRIQRVLAKRFRRFVSQH 326 >UniRef50_A1XRC1 Glycosyltransferase n=1 Tax=Haemophilus ducreyi RepID=A1XRC1_HAEDU Length = 267 Score = 220 bits (560), Expect = 8e-56, Method: Composition-based stats. Identities = 73/260 (28%), Positives = 116/260 (44%), Gaps = 7/260 (2%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +N+ + D NY + V + SI+ +N N++FYI+ + I L E+ I Sbjct: 1 MNIVFSSDENYAPHLSVCLYSILSHN--YNINFYILDLGIKEESKSFIKSLVEKFNSNIE 58 Query: 88 LYRINTDKLQCLPCT-QVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLG 146 +I+ D P S A Y RL L L+++LYLD D + G + L L Sbjct: 59 FIKISVDSFSNFPIYIDYISLATYARLKLTDYLP-QLEKVLYLDIDTIVNGSLIDLWDLD 117 Query: 147 LNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMS 206 LN A V D + L + + YFN+GV+ +D KW + +K++ I+ Sbjct: 118 LNEYYIAAVADPFIESLNYKTILGLDKNI--YFNAGVLLIDCIKWKQYNIFDKSVKIIKD 175 Query: 207 KDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYT 266 +Y DQD++N++LK L L YN + + +K + K IT ++ HY Sbjct: 176 LSKKLQYQDQDILNLILKDKVLLLDCRYNFMPSQLDFIKRDKVRKGIK-ITTPIVIYHYC 234 Query: 267 GATKPWHKWAIYPSVKYYKI 286 G KPWH + + Y Sbjct: 235 GPKKPWHIDCTNFNCELYAY 254 >UniRef50_B1Y723 Glycosyl transferase family 8 n=1 Tax=Leptothrix cholodnii SP-6 RepID=B1Y723_LEPCP Length = 316 Score = 219 bits (559), Expect = 8e-56, Method: Composition-based stats. Identities = 67/306 (21%), Positives = 126/306 (41%), Gaps = 26/306 (8%) Query: 29 NVAYGVDANYLDGVGVSITSIVLNNR-HINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 + D YL + ++ S+V +N H ++ +++ D + ++ + +I Sbjct: 12 PIVLACDEAYLMPLATTLRSVVESNAAHWPIECHVLVDDVSLPGRARVERSLPARAAQIR 71 Query: 88 LYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGL 147 + ++ S+ + RL LL L+R+LYLD D++ GD+ L+ L Sbjct: 72 WHAVDLTDFSSFETQAAISKMTFARLLMADLLPAELERVLYLDTDILVLGDLLPLMRTEL 131 Query: 148 NGAVAAVVKDVEPMQEKAVS--RLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILM 205 +GA+ V+D + K+ S P++ YFN+GV+ +DL +W +++ A L+ Sbjct: 132 DGAILGAVRDGLDAELKSTSPAPTGMPDVC-DYFNAGVLLIDLARWRAGRVSAAARDHLV 190 Query: 206 SKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHY 265 + + DQD +NV G L +N ++ ++ ++H+ Sbjct: 191 AHPQ-TPFADQDALNVACDGHWKPLAAHWNFQ-------GHRSTDIAALAPSQRPGIVHF 242 Query: 266 TGATKPWHKWAIYPSVKYYKIALENS-----P---WKDD------SPRDAKSIIEFKKRY 311 A KPW ++ + + Y + P W D A S E +R Sbjct: 243 ITALKPWKADSLSLNARLYDGWRSRTLFARHPVMRWTDAIRALVSRMNRALSAHESTRRL 302 Query: 312 KHLLVQ 317 KH L Q Sbjct: 303 KHQLRQ 308 >UniRef50_P43974 Putative glycosyltransferase HI0258 n=32 Tax=Haemophilus influenzae RepID=Y258_HAEIN Length = 330 Score = 218 bits (557), Expect = 2e-55, Method: Composition-based stats. Identities = 71/299 (23%), Positives = 128/299 (42%), Gaps = 17/299 (5%) Query: 25 SECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQL 84 S+ +N+ + D Y + VSI SI+ N ++FYI+ N I LA Sbjct: 36 SQTMNIIFSSDHYYAPYLAVSIFSIIKNTPK-KINFYILDMKINQENKTIINNLASAYSC 94 Query: 85 RITLYRINTDKLQCLPCT-QVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLL 143 ++ + Q P T S A Y RL + + +++ +Y+D D + + +L Sbjct: 95 KVFFLPVCESDFQNFPKTIDYISLATYARLNLTKYIK-NIEKAIYIDVDTLTNSSLQELW 153 Query: 144 HLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSI 203 ++ + A +D + + + YFN+G++ ++L KW + + +K+++ Sbjct: 154 NIDITNYYLAACRDTFIDVKNEAYKKTIGLEGYSYFNAGILLINLNKWKEENIFQKSINW 213 Query: 204 LMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLI 263 + +NV KY DQD++N + KG F+ +N T + +K K K I ++ Sbjct: 214 MNKYNNVMKYQDQDILNGICKGKVKFINNRFNFTPTDRDLIKKKNLLCVKMPI----VIS 269 Query: 264 HYTGATKPWHKWAIYPSVKYYKIALEN--------SPWKD--DSPRDAKSIIEFKKRYK 312 HY G K WHK + + + L+ S W D + I +KR K Sbjct: 270 HYCGPNKFWHKKCSHLNCHIGNLLLKEMDKIIDIPSSWYDHFEKIPFLIKIKRLRKRIK 328 >UniRef50_B7KM20 Glycosyl transferase family 8 n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7KM20_CYAP7 Length = 347 Score = 218 bits (556), Expect = 2e-55, Method: Composition-based stats. Identities = 65/279 (23%), Positives = 122/279 (43%), Gaps = 7/279 (2%) Query: 24 TSECLNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAEQN 82 +E + + G D + G+ V++ S + N + +D YI+ N K+ ++ + Sbjct: 9 ENEPITIVSGADDKFALGLAVTLYSALANLDTKRKIDIYIVDGGINSKNRDKLTQILNSD 68 Query: 83 QLRITLYRINTDK--LQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDIS 140 + +++ + D L+ + + YFRL +LL ++R++YLD+D+V +G+++ Sbjct: 69 LMPVSIKWVKPDLTVLEGVKLFGSLNVTTYFRLLLPELLPTQVERVIYLDSDLVVEGNLA 128 Query: 141 QLLHLGLNGAVAAVVKD-VEPMQEKAVSRLSDPELLGQ--YFNSGVVYLDLKKWADAKLT 197 L L A V+D V P + L Y N+GV+ +++K+W L Sbjct: 129 NLWEQELGNCPAVAVQDYVFPYVCNGLKTYQQLGLASNTPYCNAGVMLINIKQWRIEALN 188 Query: 198 EKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLIT 257 K L + ++ DQD +N L+ L ++N K K+LI Sbjct: 189 RKILEYIRKFYDLVYLADQDGINALIANRFKLLDLKWNVQIFGVYNGKIDLLCKPKELIR 248 Query: 258 ESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDD 296 + ++H+T KPWH + + L S W +D Sbjct: 249 D-AFILHFTTPIKPWHPYYRQAGGSRFTHYLRKSKWFND 286 >UniRef50_B9KVD4 Glycosyl transferase, family 8 n=2 Tax=Rhodobacter sphaeroides RepID=B9KVD4_RHOSK Length = 334 Score = 217 bits (554), Expect = 3e-55, Method: Composition-based stats. Identities = 63/278 (22%), Positives = 114/278 (41%), Gaps = 7/278 (2%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIA-DVYNDGFFQKIAKLAEQNQLRI 86 +++ + D + V+ S R L +++ D + + LA + I Sbjct: 1 MHLLFCADRPFFRHAAVAAVSAASATRG-PLQVHLLTCDSCPEEEARFRVALAPFAHVGI 59 Query: 87 TLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLG 146 +++R+ +L+ L + S A Y R A ++L + R+LYLD D++ D++QLL L Sbjct: 60 SVHRVPAARLEGLFVDRHLSPAAYLRFLAPEVLPEAVQRVLYLDCDLIVLDDVAQLLRLD 119 Query: 147 LNGAVAAVVKDVEPMQEKAVSRLS--DPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 L G A D+ +R L Y NSGV+ +DL +W L++K + Sbjct: 120 LQGRAVAAAPDLGWKDAAQAARFRTLGIPLDRPYVNSGVLLMDLGRWRRDGLSQKLFDYV 179 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKK---LITESTL 261 ++ DQD +N +L L R +N + S + ++ Sbjct: 180 ARHGSLLLRHDQDALNAVLADDIHLLDRRWNLQVLLLSPWAKRALPEDRQATVAARRDPA 239 Query: 262 LIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPR 299 ++H++ A KPW+ + Y +PW P Sbjct: 240 ILHFSTADKPWNFRVWTRRRELYFRFRARTPWSRAVPE 277 >UniRef50_C1IBL0 Glycosyl transferase n=1 Tax=Clostridium sp. 7_2_43FAA RepID=C1IBL0_9CLOT Length = 273 Score = 217 bits (554), Expect = 3e-55, Method: Composition-based stats. Identities = 61/270 (22%), Positives = 109/270 (40%), Gaps = 2/270 (0%) Query: 24 TSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQ 83 +S +++ D NY+ + S+VLNN +++ Q++ + + Sbjct: 2 SSNRIDLLVTFDKNYIPPFQTMLKSLVLNNPRETFHIWLLHSEIPLEMLQEVEEYCAKQG 61 Query: 84 LRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLL 143 +T + + P ++ + + MY+RL A +L ++ ++LYLD D++ I L Sbjct: 62 AAMTSINVERSVFKNAPVSKRYPQEMYYRLLAPLILPKSIKKILYLDPDILIINSIRPLW 121 Query: 144 HLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSI 203 L + A V + Y+NSGV+ +DL K E+ Sbjct: 122 ETELGNYIFAAASHVGVTGVINDINRVRLRVDHDYYNSGVMLMDLTKARSIVNVEEIFQC 181 Query: 204 LMSKDNVYKYPDQDVMNVLLKGMTLFLPR-EYNTIYTIKSELKDKTHQNYKK-LITESTL 261 + PDQD+ N L TL L +N S ++ NY IT +T+ Sbjct: 182 VREHKEELLLPDQDIFNYLYGKQTLPLDDAIWNYDARKYSNYLLRSGGNYDMDWITRNTV 241 Query: 262 LIHYTGATKPWHKWAIYPSVKYYKIALENS 291 ++H+ G +KPW YK ++ S Sbjct: 242 VLHFCGKSKPWKHSQNNRFAMLYKHYMQIS 271 >UniRef50_C6I3U6 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=C6I3U6_9BACE Length = 310 Score = 217 bits (554), Expect = 4e-55, Method: Composition-based stats. Identities = 68/273 (24%), Positives = 117/273 (42%), Gaps = 14/273 (5%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +N+ +D NY+ V +TS +NN + + Y+I NDG + ++ + Sbjct: 1 MNILCCLDDNYVQHTSVMLTSFFINNDFEHHNIYVITMQLNDGNVAYLREVVNKYHSNFY 60 Query: 88 LYRINTDKLQCL--PCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHL 145 LY++N L T S A Y RLF+ Q+L ++LY+D D+V + + +L + Sbjct: 61 LYQVNEAMLSGFVRKETDYVSLAAYLRLFSTQVLPFNCSKVLYIDGDIVVRKSLEELWKM 120 Query: 146 GLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILM 205 + A V + KA + ++ YFNSG + ++L W + + EKA+ + Sbjct: 121 DIENYAVAAVDET----IKANCIRHNYDVTLGYFNSGFMLINLSFWRENSVAEKAIDYMK 176 Query: 206 SKDNVYKYPDQDVMN-VLLKGMTLFLPREYNTI------YTIKSELKDKTHQNYKKLITE 258 K DQD +N +L G+ L +YN ++ + K + Sbjct: 177 RFPERIKSWDQDALNGILYGGLWKRLDLKYNLTTIFLCKQYVEGQDFPKIYTEEYNSAIS 236 Query: 259 STLLIHYTGATKPWHKWAI-YPSVKYYKIALEN 290 ++HYTG KPW + +P K Y Sbjct: 237 DPAVVHYTGPDKPWKYTVVDHPFKKDYLQYARM 269 >UniRef50_B1MX28 Lipopolysaccharide biosynthesis protein, LPS:glycosyltransferase n=2 Tax=Leuconostoc RepID=B1MX28_LEUCK Length = 283 Score = 217 bits (552), Expect = 6e-55, Method: Composition-based stats. Identities = 67/270 (24%), Positives = 114/270 (42%), Gaps = 3/270 (1%) Query: 21 NINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAE 80 I + +N+ +D NY+ + V + S+ N N+ ++ D +K+ + Sbjct: 5 KIINDDSVNILITIDENYIKPLRVLLYSLRQTNPRENMTIWLAHDHIEVAQLEKLHQFVA 64 Query: 81 QNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDIS 140 Q + +++T P + + MYFRL Q L TL R++YLD D++ I Sbjct: 65 QLGFVLHTIKVDTSLWASAPTFKQYPPEMYFRLLCGQYLPKTLHRVIYLDPDILVINPIR 124 Query: 141 QLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKA 200 L ++ L G + A + YFNSGV+ +DL + Sbjct: 125 PLANMPLKGQMLAASSHMGLTGISQTINHLRLGTRQVYFNSGVMLMDLDMMRQRVDMKAI 184 Query: 201 LSILMSKDNVYKYPDQDVMNVLLKGMTLFLPRE-YNTIYTIKSELKDKTHQNYK-KLITE 258 LS++ PDQD++N L L LP E +N K+ + + + E Sbjct: 185 LSVIQQYGKELILPDQDILNYLYGDEILSLPEEIWNYDTRDNIMHYAKSFGSVDMRWVME 244 Query: 259 STLLIHYTGATKPWHK-WAIYPSVKYYKIA 287 +T+++HY G KPW K +I P + Y+ Sbjct: 245 NTVILHYCGRPKPWEKSNSINPFIMLYQHY 274 >UniRef50_C1QBZ8 LPS:glycosyltransferase n=1 Tax=Brachyspira murdochii DSM 12563 RepID=C1QBZ8_9SPIR Length = 336 Score = 216 bits (550), Expect = 1e-54, Method: Composition-based stats. Identities = 74/310 (23%), Positives = 125/310 (40%), Gaps = 19/310 (6%) Query: 29 NVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 N+ D NY + V++ SI+ N N N+ F+II D K+ L + I Sbjct: 5 NICLCSDENYAKYMAVTMASILKNTNDDENIIFHIIESNIKDETKNKLIYLKKIKNCEIK 64 Query: 88 LYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGL 147 YR+ +K + A Y RL +L+ D++LYLD+D++ G + +L + + Sbjct: 65 FYRVEYNK---------YPLATYLRLLIPELIK-DADKVLYLDSDIIVNGSLKELFDIDI 114 Query: 148 NGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSK 207 NG A VKD+ K L + YFN+GVV + K D +++K S Sbjct: 115 NGYYALAVKDLYVDIYKEHKELIEIGNNRIYFNAGVVLFNNKSCIDNNISQKFYSYFTEN 174 Query: 208 DNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTG 267 N K+ DQD++N + R++N + K + ++IH+ Sbjct: 175 KNKLKFHDQDILNHCFIDKVKIIDRKWNFMPFRDYNTKSHYPTK------DDAVIIHFV- 227 Query: 268 ATKPWHKWAIYPS-VKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQHHYISGIIA 326 KPW + Y + +PW + P A + +K Y + ++ Sbjct: 228 EHKPWKTQKDRTYFLDDYWRYYQYTPWFFEEPITAIQTMMQQKMYDYEDIRFRSNYFKFF 287 Query: 327 GVCYLCRKYY 336 G+ K Sbjct: 288 GIYANSSKLQ 297 >UniRef50_D2QX95 Glycosyl transferase family 8 n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2QX95_9PLAN Length = 350 Score = 216 bits (550), Expect = 1e-54, Method: Composition-based stats. Identities = 63/301 (20%), Positives = 115/301 (38%), Gaps = 26/301 (8%) Query: 24 TSECLNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAEQN 82 L+V D + G+ +I S++ + + L+ +++ + + Sbjct: 1 MQRVLDVLTSADDRFAIGLAGTIKSVLASLSPSSKLNLWVLDGGISSENRDDLIHHWNDP 60 Query: 83 QLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQL 142 +L + ++ L S A Y+RL A LL ++ +LLY+DAD++ + D++ L Sbjct: 61 RLSVNWLPVDRALLAEFKVAPHMSDAAYYRLLAPNLLPSSVKKLLYIDADLLVQRDLTDL 120 Query: 143 LHLGLNGAVAAVVKDVE-PMQEKAVSRLSDPELLG--------------------QYFNS 181 +G V D+ P + L P+ L +YFNS Sbjct: 121 WDEPFDGHSCIAVHDIGAPFLDSNQILLEKPDALSRIVCRNPIPMFEELGLAPETRYFNS 180 Query: 182 GVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYN---TIY 238 GV +DL+ W +L+ + +L + Y DQ +N++L +N I+ Sbjct: 181 GVFMIDLETWRSEQLSVQMFDVLCTHRERQIYHDQFALNIVLANRWKAADYRWNQLAYIH 240 Query: 239 TIKSELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSP 298 +K + S ++H+T KPW +P K + L S W P Sbjct: 241 ELKVPQHTFLEPQVFQQYKHSPWVVHFT-YRKPWQPECQHPLRKRFFDYLAGSKWMQAMP 299 Query: 299 R 299 Sbjct: 300 E 300 >UniRef50_C2KV37 Lipopolysaccharide biosynthesis protein, LPS:glycosyltransferase n=1 Tax=Oribacterium sinus F0268 RepID=C2KV37_9FIRM Length = 324 Score = 215 bits (549), Expect = 1e-54, Method: Composition-based stats. Identities = 64/281 (22%), Positives = 119/281 (42%), Gaps = 18/281 (6%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +++ YGV+ ++ + VS++S++L+ L F+I++ + +K+ + E +I+ Sbjct: 1 MHIVYGVNEAFMPILAVSLSSLLLHAEGEALHFHILSLGIEEESKEKLRQYVETEGQKIS 60 Query: 88 LYRINTDKLQCLPC-----TQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQL 142 Y + + T +S+A RLF L T+ + LYLDAD V I L Sbjct: 61 FYDLEEKLSEWKEKLPALFTGKFSKATLLRLFIPSTLPETITKALYLDADTVVLQSILSL 120 Query: 143 LHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALS 202 HL L + + EP K Y+N+GV+ ++L + + EK L Sbjct: 121 YHLRLGDKLLGMAP--EPSIYKKHKEFLSLAEESPYYNAGVMLMNLSLLREEGMEEKCLR 178 Query: 203 ILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKD---KTHQNYKKL---- 255 K+ + DQD++N++ KG LP+ +N + Y++L Sbjct: 179 YYQMKEGQLPFNDQDILNMVCKGRIRSLPQRFNFFSNYAYARYSALCRFSPWYQELESKK 238 Query: 256 ----ITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSP 292 +++H+ G +PW + + + E SP Sbjct: 239 SYSQAKAHPVIVHFAGDERPWREGNHNYYRRAFDYYAEESP 279 >UniRef50_A7A7B4 Putative uncharacterized protein n=2 Tax=Bifidobacterium adolescentis L2-32 RepID=A7A7B4_BIFAD Length = 1009 Score = 215 bits (548), Expect = 2e-54, Method: Composition-based stats. Identities = 64/316 (20%), Positives = 133/316 (42%), Gaps = 23/316 (7%) Query: 10 DKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYN 68 + A + +I + V + D NY+ + ++ S + N + H D ++ Sbjct: 647 NPEPAEQLKPLDITDKPIVPVVFAADDNYVPQLTTTVYSAMKNADPHYFYDVTVLQRNIA 706 Query: 69 DGFFQKIAKLAEQN-QLRITLYRINTDKLQCLPCTQ--VWSRAMYFRLFAFQLLGLTLDR 125 +++ +Q + + ++ + T S Y+R ++L D+ Sbjct: 707 WDKQERLRGFFKQFPNMNLRFTNVDRELAGYDLSTNNAHISVETYYRFLIQKVLPF-YDK 765 Query: 126 LLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVE---PMQEKAVSRLSDPELL------G 176 +LYLD+D++ GDI++L ++ L G + ++D++ + K R+ + + Sbjct: 766 VLYLDSDIIINGDIAKLYNIDLQGKMLGAIRDIDFLANLNVKHGKRMGYAQTVLKMKNPY 825 Query: 177 QYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNT 236 YF +GV+ L+ K + ++ L+ + D + Y DQDV+N +G L+LP E+N Sbjct: 826 DYFQAGVLVLNTKAMREHYTIKQWLTYASNPD--FIYNDQDVLNAHCEGNVLYLPWEWNV 883 Query: 237 IYTIKSELKDKTHQNYKKLI------TESTLLIHYTGATKPWHKWAIYPSVKYYKIALEN 290 ++ + + Q + ++HY G KPW + Y+K A E Sbjct: 884 VHDCGGRVGNLFVQAPNDIYDAYMKSRNDPQIVHYAGFQKPWTDPDCDFASMYWKYARE- 942 Query: 291 SPWKDDSPRDAKSIIE 306 +P+ + + E Sbjct: 943 TPFYERLLKRVVKANE 958 >UniRef50_B7GNT4 Glycosyl transferase, family 8 n=5 Tax=Bifidobacterium RepID=B7GNT4_BIFLI Length = 1013 Score = 214 bits (546), Expect = 3e-54, Method: Composition-based stats. Identities = 70/333 (21%), Positives = 138/333 (41%), Gaps = 23/333 (6%) Query: 10 DKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYN 68 + A + + ++ + V + D NY+ + ++ S + N + D ++ Sbjct: 651 NPEPAEELKPLDVFDKPIVPVVFAADDNYVPQLTTTVYSAMKNADPSYFYDVVVLQQDIA 710 Query: 69 DGFFQKIAKLAEQN-QLRITLYRINTDK--LQCLPCTQVWSRAMYFRLFAFQLLGLTLDR 125 +++ + EQ + + + + S Y+R QLL D+ Sbjct: 711 GDKQERMWRFFEQFPNMSLRFLNVKRELSGYDLSTNNAHISIETYYRFLIQQLLP-NYDK 769 Query: 126 LLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVE---PMQEKAVSRLSDPELL------G 176 +LYLD+D++ GDI++L + L + V+D++ + K R+S + + Sbjct: 770 VLYLDSDIIIVGDIAKLYDIDLQDNLLGAVRDIDFLGNLNVKHGKRMSYAKDVLKMKNPY 829 Query: 177 QYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNT 236 YF +GV+ L+ K + E+ L+ + + Y Y DQDV+N +G L+LP E+N Sbjct: 830 DYFQAGVLVLNTKGMRNRYSIEQWLTYASNPN--YIYNDQDVLNAYCEGKVLYLPWEWNV 887 Query: 237 IYTIKSELKDKTHQNYKKLI------TESTLLIHYTGATKPWHKWAIYPSVKYYKIALEN 290 ++ + + Q + + +IHY G KPW S Y++ A E Sbjct: 888 VHDCGGRVGNLFTQAPNDVYDAYVKSRSNPQIIHYAGYQKPWVDPDCDYSSIYWRYARE- 946 Query: 291 SPWKDDSPRDAKSIIEFKKRYKHLLVQHHYISG 323 +P+ + + E + + L +H G Sbjct: 947 TPFYERLIKRVVLANEPQIPEEVFLPKHERAVG 979 >UniRef50_B8G232 Glycosyl transferase family 8 n=2 Tax=Desulfitobacterium hafniense RepID=B8G232_DESHD Length = 280 Score = 213 bits (544), Expect = 5e-54, Method: Composition-based stats. Identities = 59/265 (22%), Positives = 115/265 (43%), Gaps = 5/265 (1%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +N+ ++++Y+ + V +TS++ +N Y+ + F +I + + ++ ++ Sbjct: 1 MNILVTLNSSYVKQLMVMLTSLLDSNPGEQFTVYVAHSAMSKEDFARIDQAIDSSRCKVE 60 Query: 88 LYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGL 147 +++ + L P T + + MY+R+FA L L+R+LYLD D+V + +L + Sbjct: 61 GIKLSDEGLSKAPITSRYPKEMYYRIFAVNYLPDHLERILYLDPDLVVINPLKELYTIDF 120 Query: 148 NGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSK 207 G A V+ + +K + Y NSGV+ ++L + + + Sbjct: 121 QGNFFAAASHVKELLKKLNHVRLNMAEDSTYVNSGVMMMNLSLLRQEQDVHEVYQYIEEY 180 Query: 208 DNVYKYPDQDVMNVLLKGMTLFLPRE-YNT---IYTIKSELKDKTHQNYK-KLITESTLL 262 + PDQDV+N + TL + + YN Y + + + +T + Sbjct: 181 KHRLFLPDQDVLNGVYSDRTLTVDAKIYNLSERYYALYNLNPKYWDAKIDLDWVRSNTAI 240 Query: 263 IHYTGATKPWHKWAIYPSVKYYKIA 287 IHY G KPW I +YK Sbjct: 241 IHYCGRNKPWKDNYIGDLNVFYKNY 265 >UniRef50_B2ISC2 Glycosyl transferase, family 8 n=2 Tax=Streptococcus pneumoniae RepID=B2ISC2_STRPS Length = 401 Score = 213 bits (542), Expect = 9e-54, Method: Composition-based stats. Identities = 69/316 (21%), Positives = 118/316 (37%), Gaps = 37/316 (11%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLY 89 + G D +Y+D V +I SI N+ + FY+ +FQ + K I Sbjct: 5 IVLGADNHYMDKVETTIKSICSKNKE--VKFYVFNSDLPTEWFQLMDKRLSVLGSEIVNV 62 Query: 90 RINTDKLQCLPC-TQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLN 148 ++ + T S A Y R F ++ R LYLD+D++ D++ L L+ Sbjct: 63 KVTESLINQFHLPTPHLSSATYLRYFIPTIVFEK--RALYLDSDIIVTADLTSLFEFPLD 120 Query: 149 GAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKD 208 G A V D+ E FNSGV+ +D +W + + + L++ + Sbjct: 121 GCPLAAVPDIPNTSEG--------------FNSGVLLIDTDRWREDDIQNQLLNLTIKHH 166 Query: 209 NVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGA 268 Y DQ+++N+L K L YN + + L +IHYT Sbjct: 167 EHV-YGDQEILNMLFKDRWKKLSLSYNLQVGYDTYRHSLGDNEWYHLFEGIPNIIHYTTQ 225 Query: 269 TKPWHKWAIYPSVKYYKIA---------LENSPWKDDSPRDAKSIIEFKKRYKHLLVQHH 319 KPW + + L+N +++ + K I + + Sbjct: 226 NKPWSHYRFNRFRDIWWFYYGLNWNDILLDNQILQENFEKLIKPITCHASIFTN------ 279 Query: 320 YISGIIAGVCYLCRKY 335 +G I G+ YL + Sbjct: 280 --TGDIEGLPYLLEQL 293 >UniRef50_C5ZV11 Glycosyl transferase n=2 Tax=Helicobacter canadensis MIT 98-5491 RepID=C5ZV11_9HELI Length = 397 Score = 213 bits (542), Expect = 1e-53, Method: Composition-based stats. Identities = 75/331 (22%), Positives = 143/331 (43%), Gaps = 32/331 (9%) Query: 29 NVAYGVDANYLDGVGVSITSIVLNNRHIN---LDFYIIADVYNDGFFQKIA----KLAEQ 81 NV ++ NY+ V ITSI+ N + +F+++ D + + + +L++ Sbjct: 3 NVVLNLNENYVPYAAVLITSIIQNTQSSGGGGYNFHLLMDSISQENTKNLENLISELSKI 62 Query: 82 NQLRITLYRINTDKLQCLPC-TQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDIS 140 +T+Y ++ + T + Y+RL L L++ R +YLD D++ GD+ Sbjct: 63 YPCTLTIYILDDQLFREYSMPTLNGNYLAYYRLKIGSALPLSIKRCVYLDVDMIVLGDLR 122 Query: 141 QLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDP--ELLGQYFNSGVVYLDLKKWADAKLTE 198 +L + L G + VV + + + + G YFNSG++ +DL W + + Sbjct: 123 ELFEVDLQGKICGVVMEHHSQKIYKPKNQAYKPINITGSYFNSGMLLVDLDLWRQENIED 182 Query: 199 KALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSEL---------KDKTH 249 +A I + Y + DQD++N++L G T + E+N + + K + Sbjct: 183 RAFEIGKNYH--YSFHDQDILNIVLSGKTHKVGIEWNLMVCVYYRAICKDEKGRDKLPYY 240 Query: 250 QNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWK--DDSPRDAKSIIEF 307 + + ++HY TKPW+ IY + Y+ L+ W D +P + +++ Sbjct: 241 RKDFNSALRNPKILHYFTHTKPWNNAKIY--LDYHNKFLDQYWWDMVDQTPIFKEKLLQL 298 Query: 308 KKRYKHLLVQHHYISGIIAGVCYLCRKYYRK 338 K + L V Y +YY+K Sbjct: 299 KPQADSALA-------FQCLVGYKLLRYYQK 322 >UniRef50_B6G807 Putative uncharacterized protein n=2 Tax=Collinsella RepID=B6G807_9ACTN Length = 276 Score = 212 bits (540), Expect = 1e-53, Method: Composition-based stats. Identities = 53/266 (19%), Positives = 109/266 (40%), Gaps = 2/266 (0%) Query: 24 TSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQ 83 ++V D YL + + S+ +N+ + +++ + +++ + Sbjct: 2 KQHAMDVIVTCDEGYLGPLRTMLYSLRASNQGAQVRIWLLHKGISLPALEELERFCSVLG 61 Query: 84 LRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLL 143 L I ++ L C++ + + MY+RL A ++ ++R LYLD D++ + L Sbjct: 62 LAIEPVTVDRVLLDGAKCSERYPQEMYYRLLAPSIIKAPIERALYLDPDILVINPLDDLF 121 Query: 144 HLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSI 203 + L+G A ++ + + YFN+GV+ D+ + + ++ S Sbjct: 122 EIDLHGNAFAAASHLDAVHPATALNKARLSTSSDYFNTGVILFDIARARKSICVDELFSY 181 Query: 204 LMSKDNVYKYPDQDVMNVLLKGMTLFLPRE-YNT-IYTIKSELKDKTHQNYKKLITESTL 261 + + + V +PDQD+ N L +TL +P E +N + + E T Sbjct: 182 VKAHEQVMLFPDQDLFNSLFGAVTLRIPDEIWNYDARKYPDNIIRTWGTATLDWVMEHTA 241 Query: 262 LIHYTGATKPWHKWAIYPSVKYYKIA 287 ++H+ G KPW YK Sbjct: 242 ILHFCGKNKPWAPGYRGQFASLYKHY 267 >UniRef50_B7C7N8 Putative uncharacterized protein n=2 Tax=Firmicutes RepID=B7C7N8_9FIRM Length = 416 Score = 212 bits (539), Expect = 2e-53, Method: Composition-based stats. Identities = 58/259 (22%), Positives = 107/259 (41%), Gaps = 20/259 (7%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLY 89 + D +Y+D + +I SI +N+ N+ FYI+ + +F+ + K I Sbjct: 22 IVLACDNSYMDKLETTIKSICAHNK--NIKFYILNEDLPIEWFRLMTKRLSYFNSEILNI 79 Query: 90 RINTDKLQCLPC-TQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLN 148 +++ D + C ++ + YFR + +++LYLD D++ + L +L L Sbjct: 80 KVSGDSFKKFRCPSEHINYQSYFRYLIPDYVSE--EKVLYLDCDIIVTESLDGLFNLDLK 137 Query: 149 GAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKD 208 A V D+ + FNSGV+ ++ K W + + K + + + Sbjct: 138 NYPVAAVPDLPTTNDG--------------FNSGVLLINNKYWRENDILNKLIKLTVEYH 183 Query: 209 NVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGA 268 Y DQ ++N+L K LP YN S+ + + KL +IHYT Sbjct: 184 EKV-YGDQGILNILFKDKWYRLPLTYNLQVGSDSQEHMIGNMEWYKLFDGIPKVIHYTYT 242 Query: 269 TKPWHKWAIYPSVKYYKIA 287 KPW + + + + Sbjct: 243 HKPWLMYNMTRFKEVWWFY 261 >UniRef50_B6GCA0 Putative uncharacterized protein n=1 Tax=Collinsella stercoris DSM 13279 RepID=B6GCA0_9ACTN Length = 990 Score = 212 bits (539), Expect = 2e-53, Method: Composition-based stats. Identities = 55/301 (18%), Positives = 118/301 (39%), Gaps = 23/301 (7%) Query: 25 SECLNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAEQN- 82 + + V + D NY+ + +I S++ N + + D ++ + + + Sbjct: 645 RQIVPVVFASDNNYVPMLTTTIHSMLSNASNNYRYDITVLHRDISGANQAIMREFFSSYD 704 Query: 83 QLRITLYRIN--TDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDIS 140 + + ++ +K S Y+R LL D++LYLD+D++ +GD+S Sbjct: 705 NVNLGFCDVSQVIEKYNLTTNNPHISVETYYRFLIQDLLPY-YDKVLYLDSDLIIRGDVS 763 Query: 141 QLLHLGLNGAVAAVVKDVEPMQEKAVSR---------LSDPELLGQYFNSGVVYLDLKKW 191 +L L ++ A D++ + + R + + YF +GV+ L+ + Sbjct: 764 ELFATDLGDSLLAAAHDIDFVANVNMKRGDRFAYAKEVLGMKDPYSYFQAGVLVLNTRAM 823 Query: 192 ADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSE------LK 245 E+ L D+ + Y DQDV+N +G ++L +N + Sbjct: 824 RSRHTMEEWLEFAS--DDRFIYNDQDVLNAHCEGEVVYLDYSWNVMIDCFGRINKVFTFA 881 Query: 246 DKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSII 305 + + ++HY G KPW Y++ A E +P+ + + + ++ Sbjct: 882 PAYMFDAFIESRSNEKIVHYAGFEKPWKLAGCDRGELYWRYARE-TPFYESLLQHSIAVN 940 Query: 306 E 306 Sbjct: 941 R 941 >UniRef50_Q5WI33 Lipopolysaccharide glycosyltransferase n=6 Tax=Firmicutes RepID=Q5WI33_BACSK Length = 274 Score = 211 bits (538), Expect = 2e-53, Method: Composition-based stats. Identities = 61/265 (23%), Positives = 113/265 (42%), Gaps = 2/265 (0%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +N+ ++A+YL + V +TS+ +NN H + Y+I + Q + + + Sbjct: 1 MNILVTLNAHYLKPLQVMLTSLFMNNAHEDFTIYLIHSSIPEKQLQLLEQFVCHQGHSLV 60 Query: 88 LYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGL 147 + + P + +S MY+RL A++ L LDR+LYLD D++ I L + Sbjct: 61 IVETDKTLFANAPVVKHYSSEMYYRLLAYRFLPTELDRILYLDPDILVLNPIRPLYEANI 120 Query: 148 NGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSK 207 + + A + ++ + + Y+NSGV+ ++L K + + + + Sbjct: 121 DSYLYAAAQHSFINIQEINKFRLNAYEMDAYYNSGVLLMNLAKQRETMDINDIFAYVETY 180 Query: 208 DNVYKYPDQDVMNVLLKGMTLFLPRE-YNTIYTIKSELKDKTHQNYK-KLITESTLLIHY 265 N PDQDV+N L + YN K K+ + + + T+++H+ Sbjct: 181 RNRLVLPDQDVLNALYSPQIKNVDERLYNYDARYYRYYKLKSGGRFDIDAVLQQTVILHF 240 Query: 266 TGATKPWHKWAIYPSVKYYKIALEN 290 G KPWHK YK + Sbjct: 241 CGKKKPWHKNYNGKFHSLYKHYEKQ 265 >UniRef50_A4VVV8 Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases n=7 Tax=Firmicutes RepID=A4VVV8_STRSY Length = 334 Score = 211 bits (537), Expect = 3e-53, Method: Composition-based stats. Identities = 61/331 (18%), Positives = 125/331 (37%), Gaps = 25/331 (7%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRI 86 +N+ + ++ ++ V + SI+ + FY+ +D + + + + ++ Sbjct: 6 VNILFTLNDAFVPQVAACMGSIMRTLDEDDTCHFYLFSDGISQQNKENLHQFVTDGGNKL 65 Query: 87 TLYRINT--DKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 T+ + T W+ + RL +LL +DR++YLD D + +I +L Sbjct: 66 TIVELENLESYFDFEVDTNGWASVVLARLLVDKLLPEEVDRIIYLDGDTLVLENIRELWE 125 Query: 145 LGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 + L G V + + E+ R Y N+GV+ +DLK+W + Sbjct: 126 VDLEGKVLGMCPEPTASSER---REGLNLGTYTYHNAGVLLIDLKRWRSKSIGTIIFDYY 182 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKL--------- 255 K+ DQD +N LK L YN + I +T + + Sbjct: 183 KEKNGELFANDQDALNGALKEEIKTLSITYNY-FNIFDVYPYRTLEKLSRPSTFISKEEF 241 Query: 256 --ITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKH 313 I + ++H+ G +PW + + Y AL +PW+ F + Sbjct: 242 VKIRKQPRIVHFLGEERPWRIGNKHRFREDYVSALNQTPWRGTQFESGWQFYFFCFNLFN 301 Query: 314 LL------VQHHYISGIIA-GVCYLCRKYYR 337 ++ +++ I+ +I + Y + + Sbjct: 302 MVMKPFPMLRYKIITVLIPVFMKYRKIRLQK 332 >UniRef50_B2UQ54 Glycosyl transferase family 8 n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UQ54_AKKM8 Length = 328 Score = 210 bits (534), Expect = 7e-53, Method: Composition-based stats. Identities = 59/278 (21%), Positives = 112/278 (40%), Gaps = 12/278 (4%) Query: 24 TSECLNVAYGVDANYLDGVGVSITSIVLNN-RHINLDFYIIADVYNDGFFQKIAKLAEQN 82 V D + + V++ S++ Y+++D + + + +LA Sbjct: 6 KKNEFAVVLASDNRGILPLSVTVFSLLNTAGPETFYKIYVLSDGIDGENWASVERLAAPF 65 Query: 83 QLRITLYRIN-TDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQ 141 R+ ++ + P T+ W + R+F +LL +LYLD DV+ D+++ Sbjct: 66 DCRLEFIDVSGILEKHDFPHTEQWPVPAWGRVFIPELLKEERGNILYLDIDVLVCRDLTE 125 Query: 142 LLHLGLNGAVAAVVKDVEPMQ-EKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKA 200 L ++G VV + RL P YFNSGV+ +++ + + L Sbjct: 126 LFRTNMDGKAIGVVFENFSRPGSHFNERLEMPLTCTGYFNSGVLLMNVDVFREKNLVRAV 185 Query: 201 LSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSE-LKDKTHQNYKKLITE- 258 L ++ + PDQD +N L +T+ L +N + LK+ + + + +T Sbjct: 186 LDYAVTHRDRLTCPDQDALNGALCELTVPLHPRWNWHDGLTRRILKNDPREQFWRGVTPR 245 Query: 259 -------STLLIHYTGATKPWHKWAIYPSVKYYKIALE 289 ++HY G KPW Y +Y ++ E Sbjct: 246 QAVEAALEPGILHYQGVHKPWRYNWRYEGERYERVMRE 283 >UniRef50_A0KQP2 Glycosyl transferase, family 8 n=2 Tax=Aeromonas RepID=A0KQP2_AERHH Length = 366 Score = 209 bits (532), Expect = 1e-52, Method: Composition-based stats. Identities = 63/280 (22%), Positives = 139/280 (49%), Gaps = 23/280 (8%) Query: 24 TSECLNVAYGVDANYLDGVGVSITSIVLNNRH-INLDFYIIADVYNDGFFQKIAKLAEQN 82 + ++ A+ +D ++ + I S+ + H + L +++A ++ K++KL E Sbjct: 1 MRKIIHSAFCIDDSFAVHLAALIHSLGKHLSHDLQLQCHVLA-RLSETNKFKLSKL-ESE 58 Query: 83 QLRITLYRINTDKLQCLPCTQVWSR----AMYFRLFAFQLLGLTLDRLLYLDADVVCKGD 138 L I Y N + +P + +++ Y+R +L ++D++L++D+D++ GD Sbjct: 59 NLVIKFYD-NLPDYKDIPISNLYNNRLNEVTYYRFAIPHILK-SIDKVLFIDSDMIALGD 116 Query: 139 ISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTE 198 IS L + + A+ AVV D +K + G+YFN+G + ++L KW ++E Sbjct: 117 ISPLWSIDMGDAIVAVVSDHILGCDKKKQLMRGI-SSGKYFNAGFMLMNLDKWRAKNISE 175 Query: 199 KALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITE 258 +AL +L+ + +++ DQD +N++L+ T+++ ++N N+ Sbjct: 176 QALRLLIENNG-FEHNDQDALNIVLENKTVYIDNKWNAQ------------PNHLAQNNF 222 Query: 259 STLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSP 298 +L+H+ G KPWH ++ +P Y ++ + + ++ Sbjct: 223 LPILVHFCGQEKPWHIYSNHPFKGSYLVSRRETDYANEPL 262 >UniRef50_C8W7U9 Glycosyl transferase family 8 n=2 Tax=Atopobium RepID=C8W7U9_ATOPD Length = 1014 Score = 208 bits (531), Expect = 2e-52, Method: Composition-based stats. Identities = 62/331 (18%), Positives = 130/331 (39%), Gaps = 30/331 (9%) Query: 7 IEIDKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIAD 65 K I + + V + D NY+ + ++ S++ N + + D ++ Sbjct: 652 EPRQKFIPLFEEKPEIASQNVVPVVFAADNNYVPILTCAMGSMLENADPNRYYDVVVLNT 711 Query: 66 VYNDGFFQKIAKLAEQN-QLRITLYRI--NTDKLQCLPCTQVWSRAMYFRLFAFQLLGLT 122 + + K + RIT Y + + S YFR A +L Sbjct: 712 NIGGSKQELVKKFFSRYKNARITFYNVWRMVKDYKLDTNNAHISVETYFRFLAQDILSA- 770 Query: 123 LDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQE---------KAVSRLSDPE 173 D+++YLD+D+V G++++L + + + A D++ + K + + + Sbjct: 771 YDKVVYLDSDLVVNGNVAELYDVRIGNNLIAATLDIDYLANLNIRGGDRMKYSLDVLNLK 830 Query: 174 LLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPRE 233 YF +GV+ + + + L I + ++ Y DQD++N +G L+LP + Sbjct: 831 NPYAYFQAGVMVFNTAELRRYHTVPEWLRIASNP--IFIYNDQDILNSECQGRVLYLPAD 888 Query: 234 YNTIYTIKSELKDKTHQ------NYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIA 287 +N + I ++ + + + ++H+ GA KPW + Y+ Sbjct: 889 WNVTHNIFGRAEELYPMAPNSVFDDYQAARRAPKIVHFAGAIKPWQNASCDM-ASYFWKY 947 Query: 288 LENSPWKD-------DSPRDAKSIIEFKKRY 311 N+P+ + S R+ + EF +R Sbjct: 948 ARNTPFYEVIIQDMVPSARNDADVTEFHERA 978 >UniRef50_UPI0001A45357 glycosyl transferase family protein n=1 Tax=Neisseria subflava NJ9703 RepID=UPI0001A45357 Length = 264 Score = 208 bits (529), Expect = 3e-52, Method: Composition-based stats. Identities = 63/264 (23%), Positives = 105/264 (39%), Gaps = 13/264 (4%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 + + D Y + V + S+ +N +N FY++ + + + + + R+ Sbjct: 4 ITIVLAADTGYAEQVHTLMKSVCTHNTGVN--FYLMHNTFRKEWINYTNQKLAASGSRLN 61 Query: 88 LYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGL 147 +I D S A +FRL Q L +DR LYLD+D+V + L +L + Sbjct: 62 DVKIEMD-FSQYRRLSHISDAAFFRLMM-QHLP--VDRALYLDSDMVVTQSLHDLFNLDM 117 Query: 148 NGAVAAVVKDVEPMQEKAVSRLSDPELL--GQYFNSGVVYLDLKKWADAKLTEKALSILM 205 G A V+D A + + P L YFNSG++ DL +W + E+ L Sbjct: 118 RGYPVAAVQD----SYLARTDWNHPTGLHTTPYFNSGMLLADLGQWRKHNIAEQLLQTAA 173 Query: 206 SKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHY 265 + D Y DQ +N + + L L +N + + L + +IHY Sbjct: 174 TIDKTVPYGDQCFLNTVFQENWLQLEESWNYQTGARRFFQTYDLDEMFPLPDTTPPIIHY 233 Query: 266 TGATKPWH-KWAIYPSVKYYKIAL 288 T KPW + P + Y Sbjct: 234 TTLAKPWLCDYGKIPFEEIYWQYY 257 >UniRef50_C9PNX4 Family 8 glycosyl transferase n=1 Tax=Pasteurella dagmatis ATCC 43325 RepID=C9PNX4_9PAST Length = 285 Score = 207 bits (528), Expect = 4e-52, Method: Composition-based stats. Identities = 73/273 (26%), Positives = 120/273 (43%), Gaps = 14/273 (5%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +N+ D + + V I S+ +N+ N+ FY++ Y +FQ + + I Sbjct: 12 MNIVLSADVQFSEQVKTLIKSVSYHNK--NVHFYLLNKDYPSEWFQILNQYLAYFGSNII 69 Query: 88 LYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGL 147 +++++ + P S A YFR QL LDR+LYLD DVV G ++++ + Sbjct: 70 DAKVDSEVISTFPTLDHISEASYFRYLLGQL---PLDRVLYLDCDVVVTGSLTEIYYTDF 126 Query: 148 NGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSK 207 + V+D + S P++ YFNSG++ +DL KW D + + + + Sbjct: 127 GDNMMYAVEDAF-LNIAPHSYKEFPDM-KPYFNSGMLLIDLNKWRDQNIENQLMDLTKQA 184 Query: 208 DNVYKYPDQDVMNVLLKGMTLFLPREYNTIYT----IKSELKDKTHQNYKKLITESTLLI 263 N+Y Y DQD MN++LKG L + YN + + YK L + +I Sbjct: 185 VNLY-YGDQDAMNIILKGKWQALDKIYNYQTGSLIAFIQHKMPEALEKYKDLQGQQPKVI 243 Query: 264 HYTGATKPW-HKWAIYPSVKYYKIALENSPWKD 295 HY KPW P Y + W+D Sbjct: 244 HYITRYKPWLLPEYDLPFRDQYWAYYQLE-WQD 275 >UniRef50_B2ISC6 Glycosyl transferase, family 2/glycosyl transferase family 8 n=8 Tax=Streptococcus pneumoniae RepID=B2ISC6_STRPS Length = 696 Score = 206 bits (524), Expect = 1e-51, Method: Composition-based stats. Identities = 63/278 (22%), Positives = 116/278 (41%), Gaps = 27/278 (9%) Query: 18 RLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAK 77 +L+ SE + + Y+D V +I SI +NR ++ FY+I + + + +++ K Sbjct: 293 QLSRQEESEKKAIVLAANYAYVDQVLTTIKSICYHNR--SIRFYLIHSDFPNEWIKQLNK 350 Query: 78 LAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKG 137 E+ I R+ ++++ C S ++ R F + D+ LYLD D+V Sbjct: 351 RLEKFDSEIINCRVTSEQISCYKSD--ISYTVFLRYFIADFVQE--DKALYLDCDLVVTK 406 Query: 138 DISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLT 197 ++ L L A V+D + FN+GV+ ++ W + Sbjct: 407 NLDDLFATDLQDYRLAAVRDFG----------GRAYFGQEIFNAGVLLVNNAFWKKENMI 456 Query: 198 EKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYN--TIYTIKSELKDKTHQNYKKL 255 +K + + + DQ ++N+L + L L +YN I+ ++ + Q+Y Sbjct: 457 QKLIDVTNEWHDKVDQADQSILNMLFEHKWLELDFDYNHIVIHKQFADYQLPEGQDY--- 513 Query: 256 ITESTLLIHYTGATKPWHKWA--IYPSVKYYKIALENS 291 +IHY KPW A Y V +Y LE + Sbjct: 514 ----PAIIHYLSHRKPWKDLAAQTYREVWWYYHGLEWT 547 >UniRef50_B6VUC8 Putative uncharacterized protein n=1 Tax=Bacteroides dorei DSM 17855 RepID=B6VUC8_9BACE Length = 315 Score = 206 bits (524), Expect = 1e-51, Method: Composition-based stats. Identities = 68/315 (21%), Positives = 128/315 (40%), Gaps = 14/315 (4%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +++ Y V +TS+ NN+ + + Y+ + +D + + L ++ ++ Sbjct: 2 ISILCNSSNEYAIHCKVMLTSLFENNKQNDKEVYVFSTSMSDENIKGLELLGQRYGTKVQ 61 Query: 88 LYRINTDKLQCLPCTQVW-SRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLG 146 + +++ KLQ LP + + A Y RLFA LL +++LLYLD D++ D+ L + Sbjct: 62 IIIVDSQKLQFLPIHFAYHNIACYLRLFAADLLPG-INKLLYLDCDIIVNSDLKALWDID 120 Query: 147 LNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMS 206 + A D+ E + E Y N+GV+ ++ W + + +K L + Sbjct: 121 ITDYAFAATHDL-TYCEPNFKKNLQLEENDTYINTGVMLINCDYWRNNNVAQKVLDYAIH 179 Query: 207 KDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNY--KKLITESTLLIH 264 + DQD +N ++G E+N E + Y I + +IH Sbjct: 180 NGDKMIAADQDALNATMQGSFKLFSEEWNVYPDYFYEKPNLYTNVYPILDEIRRNPKIIH 239 Query: 265 YTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPR-DAKSIIEFKKRYKHLLVQHHYISG 323 + KPW + +P Y + K + + +SI R KH L+ Sbjct: 240 FL-YVKPWFNYCNHPLRYLYGKYYAIAEGKPFILKRNKESIKRDIARLKHCLLD------ 292 Query: 324 IIAGVCYLCRKYYRK 338 G+ Y Y ++ Sbjct: 293 -FMGIKYYYHVYDKR 306 >UniRef50_C9RWX3 Glycosyl transferase family 8 n=8 Tax=Bacillaceae RepID=C9RWX3_GEOSY Length = 276 Score = 206 bits (524), Expect = 1e-51, Method: Composition-based stats. Identities = 64/267 (23%), Positives = 103/267 (38%), Gaps = 11/267 (4%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 V DANYL + V + S+ NNR FY++ + Q + + + + Sbjct: 2 FQVLVTTDANYLPPLRVLMHSLFCNNR-RPFTFYLLYSRIAEEEIQALGEFVRRQGHELV 60 Query: 88 LYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGL 147 ++ P + ++ MY+RL A L +DR+LYLD D+V + +L + Sbjct: 61 PIYVDPQLFHDAPVFRHYTVEMYYRLAAHLFLPPDVDRVLYLDPDIVAINPMDELYDMDF 120 Query: 148 NGAVAAVVKDVEPMQEK---AVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 G + + + RL P G YFN+GV+ +++ + + Sbjct: 121 EGNLFIAAEHTHSTKVANLFNKLRLKTPNAKG-YFNTGVMMMNIAMMREHVRLADIYQFI 179 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPR-EYNT---IYTIKSELKDKTHQNYKKLITEST 260 PDQDV+N L + YN Y L + H I E+T Sbjct: 180 RDNRFKLVLPDQDVLNGLYWDKIKPVDCYRYNYDARYYDFLQLLPNPKHD--LAWIEENT 237 Query: 261 LLIHYTGATKPWHKWAIYPSVKYYKIA 287 + IHY G KPW ++YK Sbjct: 238 VFIHYCGKEKPWKDNYKGELGRFYKRY 264 >UniRef50_C1QC00 LPS:glycosyltransferase n=1 Tax=Brachyspira murdochii DSM 12563 RepID=C1QC00_9SPIR Length = 332 Score = 205 bits (522), Expect = 2e-51, Method: Composition-based stats. Identities = 64/304 (21%), Positives = 123/304 (40%), Gaps = 22/304 (7%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNR-HINLDFYIIADVYNDGFFQKIAKLAEQNQLRI 86 +++ D NY +G +I SI+ N++ + F+++ + K+ L I Sbjct: 1 MDICLSADDNYAKYMGTTIASILSNSKEDEEIYFHLLDGGITEENKNKLLSLKNIKNCDI 60 Query: 87 TLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLG 146 Y +N + + +FRL L+ +D+LLYLD D + + +L + Sbjct: 61 IFYSVNNMNYK-------YDAPHFFRLNVPSLIP-NVDKLLYLDCDTIVLNSLKELFEID 112 Query: 147 LNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMS 206 ++ A +DV + + + YFNSG++ ++ K W D KL Sbjct: 113 ISNYYALACEDVFLNCIISFKNMHGLNVNDIYFNSGMLMINNKLWRDDKLENLFYDDYSK 172 Query: 207 KDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYT 266 N + DQDV+N ++KG + ++N + K +IHY Sbjct: 173 FGN-TGHADQDVLNRIIKGRVKIVDSKWNFL--------SHKKVYSKAPDISLVNIIHYA 223 Query: 267 GATKPWHK-WAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKK--RYKHLLVQHHYISG 323 G KPW + + + + + +PW ++ DA I+ +K Y+ L + + + Sbjct: 224 G-EKPWKETSSKAFFIDEFWKYYQLTPWCRENTLDAVKIMISQKVNDYEELKLNVNRVKF 282 Query: 324 IIAG 327 + Sbjct: 283 LGFY 286 >UniRef50_D1JY84 Putative uncharacterized protein n=1 Tax=Bacteroides sp. 3_1_33FAA RepID=D1JY84_9BACE Length = 312 Score = 205 bits (521), Expect = 3e-51, Method: Composition-based stats. Identities = 63/265 (23%), Positives = 122/265 (46%), Gaps = 6/265 (2%) Query: 25 SECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQL 84 S +++A+ V+ +Y + + VSI ++ NN L +I++D +D ++ KL Sbjct: 3 SSPMHIAFCVNDHYAEYILVSIKGLLENNSD-PLVIHILSDYISDKNTNRLKKLVGLYPN 61 Query: 85 RITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 I I D L+ W+ ++R+ ++L ++ R+LYLDAD + +I +L Sbjct: 62 AILDIVI-VDDLKLKDLKDTWTIYTWYRVLLPEILDASVHRVLYLDADTLVSENIEELFS 120 Query: 145 LGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 L + G A D + + R E +Y +GV+ ++L W + + K + Sbjct: 121 LDMTGKAIAGTVDFQSKDKSTYQR-CGYEAEKEYVCAGVMMMNLDYWREHDIANKIIDWG 179 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYT-IKSELKDKTHQNYKKLITESTLLI 263 ++ +YPDQD +N + + M L LP +Y+ I + + + + + ES +I Sbjct: 180 RDYNDRIQYPDQDAINYICRDMKLLLPLKYDIIDGFFQDDYYFQNYPQELRECIESPAII 239 Query: 264 HYTGATKPW-HKWAIYPSVKYYKIA 287 HY G PW + + + ++ Sbjct: 240 HYAGQA-PWVVEISNHLLQDEWERY 263 >UniRef50_UPI0001AEC697 glycosyl transferase family protein n=1 Tax=Alteromonas macleodii ATCC 27126 RepID=UPI0001AEC697 Length = 361 Score = 204 bits (520), Expect = 3e-51, Method: Composition-based stats. Identities = 74/275 (26%), Positives = 136/275 (49%), Gaps = 23/275 (8%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 ++VA+ +D + VS+ SI+ N + ++ Y I + ++G +K+ L +N Sbjct: 2 ISVAFCIDDKFAPYAAVSVISILSNTKSF-VNIYFIGN-LSEGVREKLLTL--KNDRSAM 57 Query: 88 LYRINTDKLQCLPCTQVW----SRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLL 143 ++ + L +P + + ++ + R ++L LD+++YLDADV+ GDI +L Sbjct: 58 VFVAHNLPLSTMPLSDRYVERLNKITFVRYAIAEVL-TKLDKVIYLDADVLVCGDIKRLW 116 Query: 144 HLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSI 203 L + V D M +K LS YFN+GV+ +DLK W D ++ + LS Sbjct: 117 EQPLKKSYVGAVLDHSLMSQKRHITLSLK--SKSYFNAGVLLVDLKIWRDRRIFQ-YLSR 173 Query: 204 LMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLI 263 + ++Y DQDV+NV+L +L + N + K + + L++ Sbjct: 174 THNTRERWEYNDQDVLNVVLDEKVQYLGADMNVQTY-----------SLKHINIKEPLIV 222 Query: 264 HYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSP 298 H+TG KPWH +++P Y++ LE+ P+K++ Sbjct: 223 HFTGQEKPWHTSSVHPYKDQYRVLLESVPFKNNKL 257 >UniRef50_C1QC03 LPS:glycosyltransferase n=1 Tax=Brachyspira murdochii DSM 12563 RepID=C1QC03_9SPIR Length = 347 Score = 204 bits (519), Expect = 4e-51, Method: Composition-based stats. Identities = 64/284 (22%), Positives = 127/284 (44%), Gaps = 14/284 (4%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRH-INLDFYIIADVYNDGFFQKIAKLAEQNQLRI 86 +NV + + Y + +I S++ N + N++ YII++ N+ +KI L + + I Sbjct: 10 INVCFASNDAYAPYMSTAIASLLSNAKDDENINIYIISENINNSNKEKILSLKKIRECSI 69 Query: 87 TLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLG 146 + + + + S + +FRL L+ D+++YLD D++ + +L Sbjct: 70 DFIEPKEEIFKYISKYNMKSNSTWFRLSIPSLIP-NADKIVYLDGDMIINSSLRELFSDD 128 Query: 147 LNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMS 206 ++ A VV+DV ++ + + + +YFN+G + ++ K W + L EK + + + Sbjct: 129 MSDYYAYVVEDVMDKIDEVKAPIGFSKT-DKYFNAGFLMINNKLWIEDNLEEKFYNAVDT 187 Query: 207 KDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYT 266 + Y DQD++N LK F+ ++++ L +K+ +IH Sbjct: 188 MP-ILGYKDQDILNYCLKNRVKFIDKKWDF-------LDNKSCYKEISADINKINIIHCV 239 Query: 267 GATKPWHKWAIY-PSVKYYKIALENSPWKDDSPRDAKSIIEFKK 309 G KPW K + + +PW + P DA I +K Sbjct: 240 G--KPWKKECNVAFFADEFWKYYQLTPWFLERPIDAIQTILAQK 281 >UniRef50_Q65VF6 RfaJ protein n=1 Tax=Mannheimia succiniciproducens MBEL55E RepID=Q65VF6_MANSM Length = 309 Score = 203 bits (516), Expect = 8e-51, Method: Composition-based stats. Identities = 71/257 (27%), Positives = 116/257 (45%), Gaps = 15/257 (5%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKL------AEQ 81 +N+ + D NY + V I SI+ N ++ FYI+ ++ I L Sbjct: 1 MNIIFNCDENYAPYLSVVIKSILDNTT-LSTQFYILDFNISEESKSCIKNLIQNINKKNS 59 Query: 82 NQLRITLYRINTDKLQCLPCT-QVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDIS 140 Q I +I+ + QC P T S A Y RL L L++ +YLD D++ D+S Sbjct: 60 FQHSINFIKIDDNDFQCFPQTISYISSATYARLKVADYL-NELNKAIYLDIDIIVISDLS 118 Query: 141 QLLHLGLNGAVAAVVKDVE-PMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEK 199 +L H+ L + D + + R + Y N+GV+ L+LK + L +K Sbjct: 119 RLWHIDLADNLVGACLDPYIEYENQDYKRKIGLQDSQPYINAGVLLLNLKALREFNLYQK 178 Query: 200 ALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSEL----KDKTHQNYKKL 255 A+ N ++ DQD++N +LKG LFL YN ++ + K K + + Sbjct: 179 AIDWNKDYPN-IQFQDQDILNGVLKGKVLFLDSRYNFTVNHRNRIKLAHKGKLLLSSLEK 237 Query: 256 ITESTLLIHYTGATKPW 272 T+ ++HY G+ KPW Sbjct: 238 ATKPICILHYVGSHKPW 254 >UniRef50_Q3DNS6 Glycosyl transferase, family 8 n=5 Tax=Streptococcus agalactiae RepID=Q3DNS6_STRAG Length = 401 Score = 203 bits (516), Expect = 8e-51, Method: Composition-based stats. Identities = 61/260 (23%), Positives = 110/260 (42%), Gaps = 19/260 (7%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLY 89 VA VD+NYLD V+I SI + NR N+ FY+ + + I + E ++ Sbjct: 5 VALAVDSNYLDKALVTIKSICVYNR--NITFYLFNQDTPVEWVRNINRKLEPLGSKLINV 62 Query: 90 RINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNG 149 +I T + + +FRLF + + R+LYLD+D++ ++ L L G Sbjct: 63 KIYN--YDIAHLTTFLTVSTWFRLFLADYIPSS--RVLYLDSDIIVNTNLDYLFELDFKG 118 Query: 150 AVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDN 209 A VKD +E FN+G++ +L+ W + LT+ L Sbjct: 119 YYLAAVKDPHKNEEGG-------------FNAGMLLANLELWREDGLTKTLLKTAEELHR 165 Query: 210 VYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGAT 269 V K DQ ++N++ L L + +N + Y + + +IH+ + Sbjct: 166 VVKTGDQSILNIVCHNRWLSLNKTWNFQTYDVVSRYNHRSYLYLNIENRTPNIIHFLTSD 225 Query: 270 KPWHKWAIYPSVKYYKIALE 289 KPW++ ++ + + + Sbjct: 226 KPWNENSVARFRELWWYYFQ 245 >UniRef50_Q2K5X3 Galactosyltransferase protein n=13 Tax=Rhizobium RepID=Q2K5X3_RHIEC Length = 333 Score = 203 bits (516), Expect = 9e-51, Method: Composition-based stats. Identities = 64/272 (23%), Positives = 103/272 (37%), Gaps = 20/272 (7%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLY 89 V D N L ++ S+ N + +++F ++ ++ + + I + Sbjct: 38 VIVCSDVNMLPAACCTLLSVKRNLTNADVEFLLLGIDLKPHEVAEVENFGRLHGMAIRVL 97 Query: 90 RINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNG 149 T L WS A RL+ + + ++RLLYLDADV+ + +L L G Sbjct: 98 PYETPD-TGLQARGRWSAATLARLYMDRDIPDHIERLLYLDADVLAVAPVDELFTLDFQG 156 Query: 150 AVAAVVKD---VEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMS 206 A V D P + A R G+YFN+GV+ D L + I Sbjct: 157 KALAAVDDYVMAFPEKSGARQRKIGMGEGGRYFNAGVLLFDWSACRARGLFPRTREIFKE 216 Query: 207 KDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYT 266 + ++++ DQD +NV G L L +NT + + + H+T Sbjct: 217 RSHLFENNDQDALNVTFDGDWLVLDPRWNTQTGLL-------------PFVDRPAIFHFT 263 Query: 267 GATKPWH---KWAIYPSVKYYKIALENSPWKD 295 G KPW W Y L N+PW Sbjct: 264 GRKKPWQANVPWVHRRMANRYADDLRNTPWAS 295 >UniRef50_A3CM53 Glycosyltransferase, putative n=9 Tax=Bacteria RepID=A3CM53_STRSV Length = 1074 Score = 202 bits (514), Expect = 1e-50, Method: Composition-based stats. Identities = 64/270 (23%), Positives = 108/270 (40%), Gaps = 29/270 (10%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLY 89 + D Y + V +I SI+ N+ N+ Y+ +D +F+ +L EQ + Sbjct: 4 IVLVGDQAYQEQVSTTIKSILYYNK--NVKIYVFNQGLSDEWFRDFNELVEQLDSELVNI 61 Query: 90 RINTDKLQ-CLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLN 148 ++ + S A Y R F Q + R+LYLD+D+V D+ L + L Sbjct: 62 SLDQVTISPEWLTQDHISSATYARYFIPQFVAE--GRVLYLDSDLVVNRDLQPLFDIPLE 119 Query: 149 GAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKL-------TEKAL 201 G + A V D G FN+GV+ +D + W + +L T++ + Sbjct: 120 GKLVAAVGDAG----------------GYGFNAGVLLIDNRSWKERELQESFIKETDRIM 163 Query: 202 SILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTL 261 ++ S DQ V+N +L L L + YN + + N + + L Sbjct: 164 GLVQSGQMEDFNGDQTVLNHVLAQDWLPLDKIYNLQVG-HDLVAFYSGWNGHFELDQEPL 222 Query: 262 LIHYTGATKPWHKWAIYPSVKYYKIALENS 291 +IHYT KPW+ Y + + S Sbjct: 223 IIHYTTFRKPWNSEVSYRYRQLWWDFQALS 252 Score = 197 bits (500), Expect = 6e-49, Method: Composition-based stats. Identities = 64/293 (21%), Positives = 121/293 (41%), Gaps = 30/293 (10%) Query: 5 PAIEIDKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIA 64 P + K+++ + V +A Y + V +I SIV +NR + FY+I Sbjct: 387 PQEMVRKLRSLMKKEKPQAFRA---VVLAANAAYSEQVLTTIKSIVCHNRF--IKFYVIN 441 Query: 65 DVYNDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLD 124 + +F + K + +I R++ + +S ++ R F + D Sbjct: 442 SDFPTEWFVSMRKKLAKLDCQIVNARVDGSHISQYKTNIHYS--VFLRYFTATFV--EED 497 Query: 125 RLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVV 184 + LYLD D+V D+S++ + L V+D+ + Q FNSGV+ Sbjct: 498 QALYLDCDIVVTRDLSEIFAVDLGSYPLGAVRDLG----------GEVYFGEQIFNSGVL 547 Query: 185 YLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSEL 244 +++ W + + + + + + + DQ ++N+L + + LP YN I Sbjct: 548 LINVNYWRENDIAGQLIEMTDNLHDKVTQDDQSILNMLFENRWMELPFAYNCITL----- 602 Query: 245 KDKTHQNYKKLITESTLLIHYTGATKPWHKW--AIYPSVKYYKIALENSPWKD 295 T +Y+ +IHY KPW ++ +IY V ++ L+ W D Sbjct: 603 -HTTFSDYEPEKGLYPPVIHYLTERKPWKEYTQSIYREVWWFYQGLD---WSD 651 >UniRef50_D2RIJ7 Glycosyl transferase family 8 n=1 Tax=Acidaminococcus fermentans DSM 20731 RepID=D2RIJ7_ACIFE Length = 330 Score = 202 bits (514), Expect = 2e-50, Method: Composition-based stats. Identities = 70/307 (22%), Positives = 136/307 (44%), Gaps = 30/307 (9%) Query: 14 AWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQ 73 ++++ A + L++ V+ + GV +TSI NN+ + L+F++ D +D + Sbjct: 21 SFEYMTAENKKKDILHICCNVNDLFFKPAGVLLTSICENNKDLALNFHVFVDSCSDENKE 80 Query: 74 KIAKLAEQNQLRITLYRINTDKLQCLPCT-QVWSRAMYFRLFAFQLLGLTLDRLLYLDAD 132 + K AE+ LY+++ Q + +SR Y R+ +L +R LYLDAD Sbjct: 81 NLRKTAEKYGCNAYLYKMDMSIYQNFHIKVKRFSRVTYIRIVMPWVLRNVTNRYLYLDAD 140 Query: 133 VVCKGDISQLLHLGLNGAVAAV-VKDVEPMQEKAVSRLSDPELLGQ-YFNSGVVYLDLKK 190 +VC + + L V D R++ ++ G YF+ G++++++ + Sbjct: 141 MVCVKSLRVFFNYDLKDKAVGALVYDTP-------ERIAFLKMKGNVYFSDGLMWINVDE 193 Query: 191 WADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQ 250 W ++TE+ S + +K QD+MN++L G +P ++ + Sbjct: 194 WIKQRVTERVFSYQGADPARFKGQTQDLMNLVLDGNVQPIPALFHHM------------- 240 Query: 251 NYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKR 310 K + +LIHY+G KPW + + + ++ L+ SPW P + + Sbjct: 241 --DKDFSVDGILIHYSGRDKPW-EIVLDEDDELWRHYLDISPW----PSMPNPMPPKRPI 293 Query: 311 YKHLLVQ 317 Y H + Sbjct: 294 YYHSFKK 300 >UniRef50_C6LDU2 Glycosyl transferase, family 8 n=3 Tax=Firmicutes RepID=C6LDU2_9FIRM Length = 270 Score = 202 bits (513), Expect = 2e-50, Method: Composition-based stats. Identities = 63/259 (24%), Positives = 105/259 (40%), Gaps = 3/259 (1%) Query: 39 LDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLYRINTDKLQC 98 ++ V I SIV D YI+ + A E R+ + Sbjct: 1 MEHVLDCIRSIVRFPSEDGYDIYILHSDLQEQDQSDAAAQVEDGDTRLHFRFVEPSVFAS 60 Query: 99 LPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDV 158 P ++ + R +Y+R+FA LL +DR+LYLD D + + +L ++ G V Sbjct: 61 FPESERYPRLIYYRIFAASLLPPEMDRILYLDGDTLVINPLDELYNMDFEGNYFLACTHV 120 Query: 159 EPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDV 218 K E + Y NSGV+ ++LK+ + + E+ S + + PDQD+ Sbjct: 121 RKFLTKVNQYRLGMEEVSTYINSGVLLMNLKELREKQDFEEIASFVEKRGRYLTLPDQDI 180 Query: 219 MNVLLKGMTLFLP-REYNTIYTIKSELKDKTHQNY--KKLITESTLLIHYTGATKPWHKW 275 + L T L +YN + S + + + E+ ++IHY G KPW K Sbjct: 181 ITALYGNKTGILDTMKYNLSDRMISVYNTEPGHKRINLEWVRENAVVIHYYGKQKPWKKP 240 Query: 276 AIYPSVKYYKIALENSPWK 294 + +Y+ E P K Sbjct: 241 YLGMLDVFYRELKEEEPGK 259 >UniRef50_C8N7M8 Putative uncharacterized protein n=1 Tax=Cardiobacterium hominis ATCC 15826 RepID=C8N7M8_9GAMM Length = 618 Score = 201 bits (512), Expect = 3e-50, Method: Composition-based stats. Identities = 68/308 (22%), Positives = 123/308 (39%), Gaps = 28/308 (9%) Query: 10 DKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYN 68 D + ++V D NY +G I SI+ + LD I+ + Sbjct: 262 DTTAKSWYAQPVQTDKPVVSVVIASDDNYTPHLGALICSILDHFPADKYLDLIILDGGIS 321 Query: 69 DGFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLY 128 + + +L + I + D+ Q L +SRA ++RL +L+ D++LY Sbjct: 322 ALNRKLLMRLLPTH-ANIQFLELK-DEFQQLATHMHFSRATFYRLILDKLIPGR-DKVLY 378 Query: 129 LDADVVCKGDISQLLHLGLNGAVAAVVKDVE------------------PMQEKAVSRLS 170 +D D + DIS L L V D P + + Sbjct: 379 IDCDTIVLDDISTLFDTPLGDHAIGAVFDYIMHHFCLNDVLSIDTTGSLPAKRYLHDYVG 438 Query: 171 DPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFL 230 + +YF +GV+ +++K L+E +S L++K Y + DQD++N G ++L Sbjct: 439 LEDGWQRYFQAGVILFNMEKLRRLDLSEVMISDLLNK--RYWFLDQDILNKYFLGDVVYL 496 Query: 231 PREYNTIYTIKSELKDKTHQNYKKLIT--ESTLLIHYTG-ATKPWHKWAIYPSVKYYKIA 287 +N++ ++++ + +L T +IHY G TKPW+ +YY Sbjct: 497 DPRWNSVNSVQNIYQGLPATYIAELKTTETDPKIIHYAGFETKPWNN-RYAELAEYYFYY 555 Query: 288 LENSPWKD 295 L + W + Sbjct: 556 LRQTFWYE 563 >UniRef50_Q3D426 Glycosyl transferase, family 8 n=7 Tax=Streptococcus agalactiae RepID=Q3D426_STRAG Length = 401 Score = 201 bits (511), Expect = 3e-50, Method: Composition-based stats. Identities = 63/287 (21%), Positives = 116/287 (40%), Gaps = 14/287 (4%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLY 89 + G D Y D V +I SIV +N+H L YII + +F + EQ R+ Sbjct: 5 IVLGADFQYRDQVMTTIKSIVSHNQH--LTIYIINTDFPVEWFNILNHSLEQFDCRVKNI 62 Query: 90 RINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNG 149 I++D + +P S A +FR F L + +LYLD+DV+ +G + L + L Sbjct: 63 PISSDVFEGIPTLSHISVAGFFRWFIPIHLEEEI--VLYLDSDVIVRGSLDPLFDINLEE 120 Query: 150 AVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDN 209 + V D S L + FNSGV+ ++ W ++ + + K + Sbjct: 121 NLLGAVAD-------HFSTLYYGDTAPVSFNSGVMLINNSLWKKEEIYNSLMR-IADKGS 172 Query: 210 VYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITE-STLLIHYTGA 268 DQ+ +N+L + + + ++YN + + + +++HY Sbjct: 173 AVGVGDQEYLNILTQNRWIDIGKQYNVQIGQDVNINAYGRPDLYHFYDDCEPVIVHYNSQ 232 Query: 269 TKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLL 315 KPW+K++ + W + K++ + +L Sbjct: 233 DKPWNKYSQSRYRSEWWYYFGLE-WSVIYAQQQKNLNRLTGKTLNLF 278 >UniRef50_Q3D427 Glycosyl transferase, family 8 n=8 Tax=Streptococcus agalactiae RepID=Q3D427_STRAG Length = 413 Score = 201 bits (511), Expect = 4e-50, Method: Composition-based stats. Identities = 67/279 (24%), Positives = 119/279 (42%), Gaps = 19/279 (6%) Query: 24 TSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQ 83 +A D Y + V I SI +N+ +DFYI+ D + +FQ + + Sbjct: 1 MKNRKAIALAADFGYQEQVKTIIKSICFHNQF--IDFYILNDDFPVEWFQMMEYHLSKMD 58 Query: 84 LRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLL 143 I+ +I ++++ + YFR F +++ D++LYLD D++ D++ + Sbjct: 59 CTISNTKIFNEEIKHFKFQKPMPYPTYFRYFIPEVIHE--DKVLYLDCDMIITSDLTSIF 116 Query: 144 HLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSI 203 L ++ A V+D L + + YFNSG++ ++ W + ++++ L Sbjct: 117 TLDISKYGVAAVRD---------DLLEEYDGKEDYFNSGLLLINNIFWREQGISQRLLDY 167 Query: 204 LMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITES--TL 261 +Y DQDV+N +L L L YN +T L + Q+ ++L Sbjct: 168 TRENQGALQYHDQDVLNDVLCDNWLELDETYNY-HTGADMLYNLFQQSERQLNRRKDLPK 226 Query: 262 LIHYTGATKPWHK-WAIYPSVKYYKIALENSPWKDDSPR 299 +IHYT ATKPW + W+D R Sbjct: 227 VIHYT-ATKPWKYLETSVRWRDIWWEYNRLE-WRDIFTR 263 >UniRef50_B1I7N1 Glycosyl transferase, family 8 n=8 Tax=Streptococcus pneumoniae RepID=B1I7N1_STRPI Length = 817 Score = 200 bits (509), Expect = 5e-50, Method: Composition-based stats. Identities = 58/268 (21%), Positives = 100/268 (37%), Gaps = 29/268 (10%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLY 89 + D NY+ + +I SI+ +NR + YI+ +F+K K+A I Sbjct: 5 IVLAGDRNYIRQLETTIKSILYHNRD--VKIYILNQDIMPDWFRKPRKIARMLGSEIIDV 62 Query: 90 RINTDK-LQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLN 148 ++ Q S Y R F + D++LYLD+D++ + +L + L Sbjct: 63 KLPEQTVFQDWEKQDHISSITYARYFIADYIQE--DKVLYLDSDLIVNTSLEKLFSICLE 120 Query: 149 GAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSI----- 203 A VKD + + FN+GV+ ++ KKW KL E+ + Sbjct: 121 EKSLAAVKDTDGIT----------------FNTGVLLINNKKWRQEKLKERLIEQSIVTM 164 Query: 204 --LMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTL 261 + + DQ + N +L+ L L R YN Q + + + Sbjct: 165 KEVEEGRFEHFNGDQTIFNQVLQDDWLELGRAYNLQVGHDIVALYNNWQEHL-AFNDKPV 223 Query: 262 LIHYTGATKPWHKWAIYPSVKYYKIALE 289 +IH+T KPW + + Sbjct: 224 VIHFTTYRKPWTTLTANRYRDLWWKFHD 251 >UniRef50_A5LNA9 Glycosyl transferase, family 8 n=2 Tax=Streptococcus pneumoniae RepID=A5LNA9_STRPN Length = 402 Score = 200 bits (509), Expect = 6e-50, Method: Composition-based stats. Identities = 58/262 (22%), Positives = 99/262 (37%), Gaps = 24/262 (9%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLY 89 + G D NY D + +I SI +NR L FYI + +F + K E+ I Sbjct: 7 IVLGADNNYRDKLETTIKSICYHNRD--LKFYIFNEDIPKEWFYLMEKRLEKLNCEILNI 64 Query: 90 RINTDKLQCLPCTQ-VWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLN 148 I+ +K++ YFR F + + DR +YLD D+V G+I+ L Sbjct: 65 EIDAEKVKYFSTPDEHIKYMTYFRYFIAEFVKE--DRAVYLDCDMVIHGNINPLFQKDFE 122 Query: 149 GAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKD 208 G V D FN+G++ +++ KW + + L + K Sbjct: 123 GNYIIAVPD---------------GWYKNIFNAGMMMVNVHKWKTDNICQNLLELTAEKH 167 Query: 209 NVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITES---TLLIHY 265 Y DQ V+N+L + + YN + + + + + +IH+ Sbjct: 168 QEI-YGDQGVLNLLFENKWKKVSPHYNFMVGLDTLGYWAQKPEWFLNSWDENYKPAIIHF 226 Query: 266 TGATKPWHKWAIYPSVKYYKIA 287 G KPW+ + + Sbjct: 227 EGKDKPWNDSLKTRYRELWWFY 248 >UniRef50_Q9L7A2 Putative glycosyl transferase n=1 Tax=Haemophilus ducreyi RepID=Q9L7A2_HAEDU Length = 269 Score = 200 bits (509), Expect = 7e-50, Method: Composition-based stats. Identities = 65/282 (23%), Positives = 120/282 (42%), Gaps = 16/282 (5%) Query: 21 NINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAE 80 +N E +N+ + +Y + + +I SI L+N+H + FY++ Y +F + Sbjct: 1 MLNPLEKMNIVLAANQSYSEYILTTIKSIYLHNKH--IRFYLLNRDYPTEWFDILNNKLR 58 Query: 81 QNQLRITLYRINTDKLQCLPCTQVWSR-AMYFRLFAFQLLGLTLDRLLYLDADVVCKGDI 139 + I ++ D ++ S +FR F + D+++YLDAD+V G + Sbjct: 59 KLNSEIIDIKVTNDTIKNFKTYSHISSDTTFFRYFISDFI--EQDKVIYLDADIVVNGSL 116 Query: 140 SQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEK 199 ++L ++ A VKD+ + + FN+G++ ++ KKW + +T+ Sbjct: 117 TELYQTDISNYFLAAVKDIISEKI---------YVNNHIFNAGMLLINNKKWREHNITQF 167 Query: 200 ALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITES 259 LS+ N DQ ++N++ K L L R YN + Y + + E+ Sbjct: 168 CLSLSEKYINSLPDADQSILNLIFKDKWLKLNRGYNYLIGTDYLFFKYGKTRYLEDLGET 227 Query: 260 -TLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRD 300 L+IHY KPW Y E + W+D + Sbjct: 228 IPLIIHYNTEAKPWLNIFNTRFRNIYWFYYELN-WQDIYAKH 268 >UniRef50_C5S494 Putative glycosyl transferase n=2 Tax=Actinobacillus minor RepID=C5S494_9PAST Length = 287 Score = 200 bits (509), Expect = 7e-50, Method: Composition-based stats. Identities = 66/281 (23%), Positives = 118/281 (41%), Gaps = 18/281 (6%) Query: 21 NINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAE 80 N + +N+A D NY + V I S+ +++ N+ FY+I Y D +F + + Sbjct: 1 MTNKQQTINIALAADRNYAEQVITLIKSVCYHHK--NVRFYLIHQDYPDEWFMALNQHLT 58 Query: 81 QNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDIS 140 I + + ++A ++R ++ DR++YLD+D+V G+I Sbjct: 59 NVGAEIIPVTVLDSFRFLSKLQEHITQATFYRYIIPEI---PEDRVIYLDSDIVVDGNIE 115 Query: 141 QLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKA 200 ++ NG V+D+ + + P+L YFN GV+ ++ + W + L E Sbjct: 116 EMYFSDFNGKYVLAVEDMY-ISYTEHGYIEFPDL-KPYFNGGVLLINNQLWKENDLAEYL 173 Query: 201 LSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELK--------DKTHQNY 252 + + NV + DQD++N +LK L YN I ++ Y Sbjct: 174 IQMTKQYPNVM-FGDQDILNFVLKDKWGILSHVYNYQTGIIHAFPRLEENMSDEEIITKY 232 Query: 253 KKLITE-STLLIHYTGATKPW-HKWAIYPSVKYYKIALENS 291 +K E ++IHYT KPW + + Y + S Sbjct: 233 QKQADEVKPIIIHYTTKYKPWLNSKYFVLLREKYWFYYQLS 273 >UniRef50_C7RG54 Glycosyl transferase family 8 n=1 Tax=Anaerococcus prevotii DSM 20548 RepID=C7RG54_ANAPD Length = 273 Score = 200 bits (508), Expect = 8e-50, Method: Composition-based stats. Identities = 57/263 (21%), Positives = 103/263 (39%), Gaps = 2/263 (0%) Query: 31 AYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLYR 90 +D NY+ + V +TSI +NN D Y+I ++ + + + ++ + R Sbjct: 6 LLTLDENYIPQMKVLMTSIYINNPGRIFDVYLIHSRISEDKLKDLGEDLKKFSYTLYPIR 65 Query: 91 INTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGA 150 D T + + MY+RL A + L L +LYLD D++ + LL ++ Sbjct: 66 ATDDLFSFAKVTDRYPKEMYYRLLAGEFLPENLGEILYLDPDMLVINPLDDLLRTDISDY 125 Query: 151 VAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNV 210 + A Y+NSG++ ++LK+ + ++ S + Sbjct: 126 ILAAASHTGKTDMANNVNRIRLGTDTDYYNSGLLLINLKRAREEIDPDEIFSFVEDNHMN 185 Query: 211 YKYPDQDVMNVLLKGMTLFL-PREYNT-IYTIKSELKDKTHQNYKKLITESTLLIHYTGA 268 PDQD++N + L YN S L Q + + T+++H+ G Sbjct: 186 LLLPDQDILNAMYGDRIYPLDDLIYNYDARNYSSYLIRSKKQADLAWLMDHTVVLHFCGR 245 Query: 269 TKPWHKWAIYPSVKYYKIALENS 291 KPW K YK + + Sbjct: 246 DKPWKKNHRNKFTSLYKHYMSLT 268 >UniRef50_A3VFX3 Glycosyltransferase n=1 Tax=Rhodobacterales bacterium HTCC2654 RepID=A3VFX3_9RHOB Length = 615 Score = 198 bits (505), Expect = 2e-49, Method: Composition-based stats. Identities = 74/341 (21%), Positives = 133/341 (39%), Gaps = 39/341 (11%) Query: 24 TSECLNVAYGVDANYLDGVGVSITSIVLNN-RHINLDFYIIADVYNDGFFQKIAKLAEQN 82 +NVA+ D YL + S++ + + + + + D + LA + Sbjct: 265 NDGAVNVAFTSDRPYLPQTAAMVASLIEHAAPDREYNLFYLHENIGDRDLDLLRSLAVAH 324 Query: 83 QLRITLYRINT---DKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDI 139 ITL+ IN + S A Y R F LL ++RL+YLD D+V GD+ Sbjct: 325 G-NITLHTINVGTAFSREYRARHHTPSNATYNRFLLFDLLP-DVERLVYLDVDLVLCGDV 382 Query: 140 SQLLHLGLNGAVAAVVKDVEPMQEKAVS-RLSDPEL-----------------LGQYFNS 181 ++L +N A A V D + A R DPE+ + +YFN+ Sbjct: 383 AELFDTDMNDAPLAAVTDALMTRVLATRVRTRDPEVPDLYAYLSDDLGLSDDQISRYFNA 442 Query: 182 GVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIK 241 GV+ ++ AK+ + ++ N Y + DQD++NV + + LP +N + + Sbjct: 443 GVMVMNFAAMDVAKVGRELREMVA--GNRYFFRDQDILNVYFRDRFVTLPSRFNVHNSDR 500 Query: 242 SELKDKTHQ--NYKKLITESTLLIHY-TGATKPWHKWAIYPSVKYYKIALENSP-WKDDS 297 + N ++H+ KPW + + + L +P W + Sbjct: 501 GAYDNVPVPIRNDALAAKADPFIVHFAAAHQKPWREPDV-EFAGLFWSTLARTPFWFE-- 557 Query: 298 PRDAKSIIEFKKRYKHLLVQHHYISGIIAGVCYLCRKYYRK 338 ++E +R++ L + GV R+ R+ Sbjct: 558 ------VLEATRRHRSLRARLSRPDTWKHGVVIAGRRLGRR 592 >UniRef50_C0BAU3 Putative uncharacterized protein n=1 Tax=Coprococcus comes ATCC 27758 RepID=C0BAU3_9FIRM Length = 348 Score = 198 bits (504), Expect = 2e-49, Method: Composition-based stats. Identities = 63/291 (21%), Positives = 121/291 (41%), Gaps = 23/291 (7%) Query: 30 VAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAE-QNQLRIT 87 V + Y+ + + SI N N N D I+ + G ++ K E + + Sbjct: 14 VVLSANEYYVPYLAAVLESIRANSNDDQNYDLIIMHRDISMGSQDRLKKQLEDHQNITLR 73 Query: 88 LYRIN--TDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHL 145 I + L ++ YFRL Q+L D+ +Y+D+D+V DI++L Sbjct: 74 FLDIRRYEKPFKKLFLRGHFALETYFRLLMPQIL-ADYDKAVYIDSDLVVNADIAELYAT 132 Query: 146 GLNGAVAAVVKDVE---------PMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKL 196 ++G + A KD + P ++K + + + +YF +GV+ +L ++ Sbjct: 133 DVDGYLLAAAKDADTAGLYNGFEPNKKKYMDTILKIKKPYEYFQAGVIVFNLAEFRKTYT 192 Query: 197 TEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKS-------ELKDKTH 249 T + L S + ++ DQDV+N L +G F+ +N + + L K Sbjct: 193 TAEMLKFAASYE--WELLDQDVLNYLAQGRVKFVDMAWNVMVDWRGIRLSQIIALAPKYL 250 Query: 250 QNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRD 300 + ++ +IHY G KPWH+ + +++K + ++ R Sbjct: 251 HDEHMEARKNPKIIHYAGPDKPWHQPWSDMAEEFWKYSRNTVFYETIMQRM 301 >UniRef50_Q3DNA2 Glycosyl transferase, family 8 n=9 Tax=Streptococcus RepID=Q3DNA2_STRAG Length = 272 Score = 198 bits (503), Expect = 3e-49, Method: Composition-based stats. Identities = 68/269 (25%), Positives = 121/269 (44%), Gaps = 8/269 (2%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +N+ + +D Y+D V + S+V ++ L+ Y++ + I + + ++ Sbjct: 1 MNLLFSIDDMYVDHFKVMLYSLVRQTKNRKLEIYVLQKTLLKRHTELI-QYTQNLEVGYH 59 Query: 88 LYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGL 147 + T+ P T + +Y+RL A + L TLDR+LYLDAD++C D S L + L Sbjct: 60 PIIVGTEVFAQAPTTDRYPDTIYYRLLAHKFLPETLDRILYLDADMLCLNDFSSLYDMEL 119 Query: 148 NGAVAAVV---KDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 + A D + + RL + EL YFN+GV+ ++L + L + Sbjct: 120 GDQLYAAASHNTDGKFLDYVNKLRLKNVELESSYFNTGVLLMNLPAIRKVVHQQTILDYM 179 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPRE---YNTIYTIKSELKDKTHQNYKKLITESTL 261 M PDQD++N L + +P E Y+ Y++ +LK + + + + T+ Sbjct: 180 MQNRGRLILPDQDILNGLYANLVKPIPDEIYNYDARYSLIYQLKSRNEWD-LEWVINHTV 238 Query: 262 LIHYTGATKPWHKWAIYPSVKYYKIALEN 290 +H+ G KPW K YK + Sbjct: 239 FLHFAGRDKPWKKDYRGRYSGLYKFMAKE 267 >UniRef50_A8FNA2 Putative uncharacterized protein n=2 Tax=Campylobacter jejuni subsp. jejuni 81116 RepID=A8FNA2_CAMJ8 Length = 791 Score = 197 bits (501), Expect = 5e-49, Method: Composition-based stats. Identities = 64/307 (20%), Positives = 126/307 (41%), Gaps = 17/307 (5%) Query: 6 AIEIDKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNR-HINLDFYIIA 64 A EIDK F L + + + + DANY + V + SI + + N D YI+ Sbjct: 361 AGEIDKEIDNFFILPPQDKLSHIPIVFSCDANYFSYLTVVLQSIKEKSSENYNYDIYILH 420 Query: 65 DVYNDGFFQKIAKLAEQNQLRITLYRIN-----TDKLQCLPCTQVWSRAMYFRLFAFQLL 119 + + QK+ + I I+ +S A Y+R F ++ Sbjct: 421 NKLDKSLTQKLINYIQAENFSIKFVDISRILNLLKSQIQFYTALFFSEATYYRFFIPKIF 480 Query: 120 GLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELL--GQ 177 +++YLD D++ K D++ L + + +AA + ++A R++ ++ Sbjct: 481 -KEFKKIIYLDTDIIVKQDLNLLYSIDFDKPLAAAKCMIFSQVKQADHRITKLKMKQPEN 539 Query: 178 YFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTI 237 YF +GV+ +++K T+K L+ L + DQDV+N + +G ++ ++N + Sbjct: 540 YFQAGVMVYNIQKCLKMDFTQKCLNKLQELKDP-PLVDQDVLNAVFEGDIHYISLKWNCL 598 Query: 238 YTIKSE------LKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENS 291 + + L K + +IHY KPW+ + P + + Sbjct: 599 WNVSYRIPNFKILYSKDFLKDYQEAERDPYIIHYCDYFKPWNSPHL-PKADIWWHYARQT 657 Query: 292 PWKDDSP 298 P+ ++ Sbjct: 658 PFYEEIL 664 >UniRef50_Q4HGS8 General stress protein A, putative n=2 Tax=Campylobacter RepID=Q4HGS8_CAMCO Length = 403 Score = 197 bits (501), Expect = 5e-49, Method: Composition-based stats. Identities = 79/336 (23%), Positives = 148/336 (44%), Gaps = 38/336 (11%) Query: 29 NVAYGVDANYLDGVGVSITSIVLNN------RHINLDFYIIADVYNDGFFQKIA----KL 78 ++ + D NY+ V ITSI+ N ++ F+I+++ ++ +K+ +L Sbjct: 3 HIIFSADENYIKYTSVLITSIIKNTNPKNHFQNRPYSFHILSNFVSEETREKLECLKKEL 62 Query: 79 AEQNQLRITLYRINTDKLQCLPCTQVW--SRAMYFRLFAFQLLGLTLDRLLYLDADVVCK 136 + I+++ ++ D+ + P + S+ Y+RL L +D+ LYLD+D++C Sbjct: 63 NKIYPCEISIHIMSDDRFENFPSSGAAQNSKLPYYRLKFISLFDDNVDKCLYLDSDMLCM 122 Query: 137 GDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSD----PELLGQYFNSGVVYLDLKKWA 192 DI ++ + L G + VV D + K ++ + YFNSG + ++ K++ Sbjct: 123 CDIREIFAIDLQGKIIGVVGDPGSKRSKIKFIENNTKKVLKFDENYFNSGFLLINAKEYK 182 Query: 193 DAKLTEKALSILMSKDNVYKYPDQDVMNVLL-KGMTLFLPREYNTIY----TIKSELKDK 247 A + +K L K K DQD++N ++ K L L YN + + + K Sbjct: 183 KANVEKKCEE-LAKKCIYIKAADQDLLNAVISKDKILKLSFAYNFNIITLLYVICKDEKK 241 Query: 248 THQNYKK----LITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKS 303 NY + ++ ++HY KPW Y + L+N D AK Sbjct: 242 NRLNYTREEFTQSAKNPKILHY--GEKPWKFLKSY-------VDLQNRNISDYWWDIAKE 292 Query: 304 IIEFKK---RYKHLLVQHHYISGIIAGVCYLCRKYY 336 + FK+ R K + + +G+ + LC+KY Sbjct: 293 VPIFKEELLRQKENIKDYLLYAGLGFTLYNLCKKYQ 328 >UniRef50_D1Y7U2 Glycosyltransferase, family 8 n=1 Tax=Pyramidobacter piscolens W5455 RepID=D1Y7U2_9BACT Length = 617 Score = 196 bits (499), Expect = 9e-49, Method: Composition-based stats. Identities = 60/334 (17%), Positives = 127/334 (38%), Gaps = 25/334 (7%) Query: 6 AIEIDKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIA 64 I + K + + + + Y+ +G + SI + + + YI Sbjct: 258 QIALVKNPQTEQEIVVNARENDVPAVLAANEKYVPILGTCLKSIADHCSSSRSYKLYIFH 317 Query: 65 DVYNDGFFQKIAKLAEQNQLRITLYRINTDKLQC-LPCTQVWSRAMYFRLFAFQLLGLTL 123 + + + E + +T ++ + L + + ++R LL + Sbjct: 318 TDIQEESQRNLKTFLESDNFSLTFVNVSLHVGKYRLRAKEHVTTETFYRFLILDLLKM-Y 376 Query: 124 DRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVE---------PMQEKAVSRLSDPEL 174 D++LYLD D++ + DI+ L L L + D + P K + + Sbjct: 377 DKVLYLDCDMIIQRDIADLYDLDLGTNLIGAALDPDFTGQCNGANPATRKYCDAVLKLKD 436 Query: 175 LGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREY 234 YF +GV+ +++ + + + L + + +YKY DQD++NV+ +G L+L + Sbjct: 437 CFTYFQAGVLLMNVAELNKSVTVRQLLEMAET--GIYKYSDQDILNVVCEGRALYLDMAW 494 Query: 235 NTIYTIKS-------ELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIA 287 N + + + + E +IHY G KPW K +++K A Sbjct: 495 NLLSDCDHYRWHHVVKFAPHYILDMYENAREKPYIIHYAGFLKPWMKLGEDFGYEFWKAA 554 Query: 288 LENSPWKDDSP---RDAKSIIEFKKRYKHLLVQH 318 E +P+ ++ + + H+L+ Sbjct: 555 RE-TPFYEELLYAALVPHGNTTRPQNFLHMLINR 587 >UniRef50_Q0I2Z7 Lipopolysaccharide biosynthesis protein, glycosyltransferase, family 8 n=1 Tax=Haemophilus somnus 129PT RepID=Q0I2Z7_HAES1 Length = 354 Score = 196 bits (499), Expect = 1e-48, Method: Composition-based stats. Identities = 71/354 (20%), Positives = 128/354 (36%), Gaps = 61/354 (17%) Query: 28 LNVAYGVDANYLDGVGVSITSIV----LNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQ 83 +N+ + D NY + V++ SI+ NN + FY++ + +LA +N Sbjct: 1 MNILFACDDNYAKYLAVTMLSIIHARDKNNECYTIHFYLLDMGISTVAKDYCLELANKNN 60 Query: 84 LRITLYRINTDKLQCLPCT-QVWSRAMYFRLFAFQLLGL-TLDRLLYLDADVVCKGDISQ 141 + + I+ + P T + S + Y RL L L +++YLD D++ + Sbjct: 61 CHLDIVPISISDFEKFPRTIEYISLSTYARLNLANYLKKFNLTKIIYLDIDILVNHSLLP 120 Query: 142 LLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQ------------------------ 177 L + L D ++ R+S + Sbjct: 121 LWNTDLGNKAIGACYDAFIESQEKSKRMSSQSVSQSVSQSVSQSVSQSVSQSVSQSVSQS 180 Query: 178 ---------------------YFNSGVVYLDLKKWADAKLTEKALSIL---MSKDNVYKY 213 YFN+GV+ +++ +W + EK+L + + + Y Sbjct: 181 VSQSVSQSDYKTKLHLPNTHFYFNAGVLLINVVEWEKCHVFEKSLQWIEYCKRNNIEFLY 240 Query: 214 PDQDVMNVLLKGMTLFLPREYNTIYTIKSELK--DKTHQNYKKLITESTLLIHYTGATKP 271 DQD++N + +L YN + LK K N + T +IHY G K Sbjct: 241 QDQDILNAIFANNVKYLDLRYNFTANALNRLKRVSKKELNQYEEATMPLAIIHYVGPKKS 300 Query: 272 WHKWAIYPSVKYY---KIALENSP--WKDDSPRDAKSIIEFKKRYKHLLVQHHY 320 WH+ + LEN P WK ++ + + F K +H ++ Y Sbjct: 301 WHEKCSMLKANLFCHLFQQLENPPKEWKIENVPFIRKLKRFAKDLRHKIIYKIY 354 >UniRef50_C5S3F7 Putative glycosyl transferase n=2 Tax=Actinobacillus minor RepID=C5S3F7_9PAST Length = 275 Score = 196 bits (498), Expect = 1e-48, Method: Composition-based stats. Identities = 59/278 (21%), Positives = 112/278 (40%), Gaps = 17/278 (6%) Query: 20 ANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLA 79 + + D + + + +I SI +N NL ++ ++ +F+ + Sbjct: 3 QTNKQTNKQTIILAADIKFAEQLETTIKSICYHN--ANLYIVLLNRDFSKEWFEYLNTYL 60 Query: 80 EQNQLRITLYRINTDKLQCLPCTQVWSRA-MYFRLFAFQLLGLTLDRLLYLDADVVCKGD 138 Q I ++N ++L+ S A +FR F + D++LYLD D+V G Sbjct: 61 NQINCEIIDVKVNCNQLEEYKTLPHISSASTFFRYFIPAFV--NDDKVLYLDCDLVVNGS 118 Query: 139 ISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTE 198 +S L LN A D ++ ++FN+GV+ ++ K W ++T Sbjct: 119 LSIFFDLELNDHYVAASLD----------DIAFNFHQKKHFNAGVLLINNKLWRKQEITL 168 Query: 199 KALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITE 258 KAL + + + DQ+V+N+L + + L N + + + Y + + Sbjct: 169 KALELTDRLNEKLEEGDQEVLNILFQNKWIELNPYLNYLVGAEYLYRRNGVTQYIRRQED 228 Query: 259 S-TLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKD 295 L++H+ KPW P +YY + W D Sbjct: 229 DVPLILHFNTKYKPWLPIDGVPFREYYWFYYRLN-WAD 265 >UniRef50_Q4JZJ9 Putative glycosyl transferase n=8 Tax=Streptococcus pneumoniae RepID=Q4JZJ9_STRPN Length = 344 Score = 195 bits (497), Expect = 1e-48, Method: Composition-based stats. Identities = 67/319 (21%), Positives = 134/319 (42%), Gaps = 18/319 (5%) Query: 13 KAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFF 72 + + +N+ Y D N++D + SI S+ N ++L+ +IIAD +D Sbjct: 16 SIFFISENKFRSRNFMNIVYATDNNFVDVLSASIKSLYTTNSDLDLNLWIIADKVSDRNK 75 Query: 73 QKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDAD 132 +KI +L++Q R + I ++ S + + RLF +L ++ ++LYLD+D Sbjct: 76 EKINRLSKQFAQR-EINWIENVEIPFKLHLDRGSISSFSRLFLGSVLPSSMSKVLYLDSD 134 Query: 133 VVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWA 192 ++ + + + G + V D + K V + + FN+GV+ ++L+ W Sbjct: 135 IIVMDSLRSIFDIDFKGKILYGVNDTFNKEYKQVLGIP---IDKPMFNAGVMLINLELWR 191 Query: 193 DAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELK------- 245 + + E+ L ++ + D V+N +L LP EYN + + Sbjct: 192 NNNVEERFLQVIQKFNGTILQGDLGVLNAVLYNSFGVLPPEYNYMTIFEDLTYEEMIVFK 251 Query: 246 ---DKTHQNYKKLITESTLLIHYTGAT---KPWHKWAIYPSVKYYKIALENSPWKDDSPR 299 + + K E +L H+T + +PW + + V+ +K +K SP Sbjct: 252 KPINYYSKEEIKNARERIVLRHFTTSFLSKRPWQESSEVTHVEIFKKYYRG-AYKQASPS 310 Query: 300 DAKSIIEFKKRYKHLLVQH 318 +I + + L + Sbjct: 311 KLLNIYKILPKKMSLYLLG 329 >UniRef50_C8WAA9 Glycosyl transferase family 8 n=2 Tax=Atopobium RepID=C8WAA9_ATOPD Length = 358 Score = 195 bits (495), Expect = 2e-48, Method: Composition-based stats. Identities = 56/321 (17%), Positives = 126/321 (39%), Gaps = 30/321 (9%) Query: 31 AYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAEQ-NQLRITL 88 + N++ + V+I SI+ N N D ++ + + + A+ N + + Sbjct: 19 VFACSDNFVPYLSVAIQSIIENVNPERRYDIIVLTRDLSPTNMITLTRQAQLVNNVHVGF 78 Query: 89 YRINTDKLQC-LPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGL 147 ++ LP + Y+RL A LL +++ +YLD+D+V DI++L + + Sbjct: 79 LDVDAALGDIELPHHGHFRPETYYRLLAPSLLP-NVNKAIYLDSDLVVNTDIAELYDIDI 137 Query: 148 NGAVAAVVKDVEPMQE------------KAVSRLSDPELLGQYFNSGVVYLDLKKWADAK 195 G + +D + + + K + DP YF +GV+ ++L++ Sbjct: 138 TGYLVGATRDADTIGQIDGYDATVGPYLKNELGMDDPH---DYFQAGVILMNLEEIRKQI 194 Query: 196 LTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKD-------KT 248 E+ L + + +++ DQDV+N + G L + ++N + + +D K Sbjct: 195 SPEEFLKVSTMR--TWRWLDQDVLNRFVNGHYLRINMKWNYLVDWQFLRRDHIVAQAPKD 252 Query: 249 HQNYKKLITESTLLIHYTGAT-KPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEF 307 + + ++ + H+ G +PW + SP+ ++ + Sbjct: 253 IREEYEEARKNICIAHFAGPDNRPWLYPNSD-LAGLFWFYARRSPYLEELRSQLEESRRT 311 Query: 308 KKRYKHLLVQHHYISGIIAGV 328 + H + G++ Sbjct: 312 VRGLSHRVQSGVLFRGLMPLF 332 >UniRef50_D2LIH7 Glycosyl transferase family 8 n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LIH7_RHOVA Length = 391 Score = 195 bits (495), Expect = 2e-48, Method: Composition-based stats. Identities = 51/300 (17%), Positives = 123/300 (41%), Gaps = 24/300 (8%) Query: 27 CLNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAEQNQLR 85 + V + Y+ G I SI + + + D + AD + + ++ + Sbjct: 43 AVPVVMCFNRRYMPGGAALIASIAEHASPNRLYDLIVFADDLASEDRDMLRNVCDKPNIS 102 Query: 86 ITLYRIN-TDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 + + ++ + + ++RL L+ D+++Y+DAD + D++ L Sbjct: 103 LRFFDVSRCFDGINFITHFHFRKENFYRLKIPDLM-RDFDKVVYIDADTITNRDLADLYD 161 Query: 145 LGLNGAVAAVVKDVEPMQEKAVSRLS----------------DPELLGQYFNSGVVYLDL 188 + ++G A V+D + + L + YFNSG+V ++ Sbjct: 162 IDVDGYYIAAVRDFAMIATQNKKMLDIVGKKIYYETYVKDYLGLIGISNYFNSGLVLFNI 221 Query: 189 KKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKT 248 K ++++E+ ++++ +K ++ Y DQD++N++ + + +N + + Sbjct: 222 NKINGSQISERLIALIGTK--LFAYVDQDILNIVFENKVKLIDYSWNMVIDCERLYHLSE 279 Query: 249 HQNYKKLIT--ESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIE 306 Y + + + ++HY G KPW+ ++ +YY +P + R+ + E Sbjct: 280 PDLYARYLDAGAAPHVVHYIGGNKPWNDPTVHM-AEYYWRYAAKTPLYEKLLREIRERRE 338 >UniRef50_Q3DM64 Glycosyl transferase, family 8, degenerate n=6 Tax=Streptococcus agalactiae RepID=Q3DM64_STRAG Length = 394 Score = 194 bits (493), Expect = 4e-48, Method: Composition-based stats. Identities = 58/267 (21%), Positives = 107/267 (40%), Gaps = 22/267 (8%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLY 89 + D L+ + I SI+ +N ++ YI+ +F+ I + + I Sbjct: 6 ICLAGDNKSLNQIQTVIKSILCHNDRVS--IYILNQDIASEWFRNIQRRLLNSHSCIFDI 63 Query: 90 RINTDKLQCLPCTQ-VWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLN 148 ++ D + + + Y R + QL+ +++LYLD D + ++ +L + L Sbjct: 64 KLFDDTFKEFKTPRAHITYMAYARYYIPQLI--DAEKVLYLDIDTLVVDNLDKLFEIELG 121 Query: 149 GAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKD 208 A + D + G +FNSGV+ ++ W ++TEK L I + Sbjct: 122 DYPIAAILDGD----------------GIHFNSGVMLINSLYWMRYRVTEKLLEITEREL 165 Query: 209 NVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGA 268 + + DQ V+N+L L L +YN + Q Y ES +IHY Sbjct: 166 DNGIFGDQGVLNLLFDNNWLKLEDKYNAQVGNDLGAFYENWQGYFDRNFESPTIIHYCTH 225 Query: 269 TKPWHKWAIYPSVKYYKIALENSPWKD 295 KPW+ ++ + + E W + Sbjct: 226 DKPWNTFSSSRFRETWWQY-EQLDWNE 251 >UniRef50_C1CFZ1 Glycosyl transferase, family 8 n=17 Tax=Streptococcus pneumoniae RepID=C1CFZ1_STRZJ Length = 404 Score = 193 bits (492), Expect = 5e-48, Method: Composition-based stats. Identities = 64/280 (22%), Positives = 112/280 (40%), Gaps = 24/280 (8%) Query: 26 ECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLR 85 ++ + D +Y+D + +I SI N L FY+ D +F + K + Q Sbjct: 2 NTKSIVFNADNDYVDKLETAIKSICCYN--NCLKFYVFNDDIASEWFLMMNKRLKTIQSE 59 Query: 86 ITLYRINTDKLQCLPCT-QVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 I +I L+ + S A +FR F + + R LYLD+D++ G + L Sbjct: 60 IVNVKIVDHVLKKFHLPLKNLSYATFFRYFIPNFVKES--RALYLDSDIIVTGSLDYLFD 117 Query: 145 LGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 + L+G A V+D S ++ FNSG++ +++ W D K L + Sbjct: 118 IELDGYALAAVED------------SFGDVPSTNFNSGMLLVNVDTWRDEDACSKLLELT 165 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLIT----EST 260 Y DQ ++N+L L R +N + + S + + + ++ + Sbjct: 166 NQYHE-TAYGDQGILNMLFHDRWKRLDRNFNFMVGMDSVAHIEGNHKWYEISELKNGDLP 224 Query: 261 LLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRD 300 +IHYTG KPW + + + W D R Sbjct: 225 SVIHYTGV-KPWEIISNNRFREVWWFY-NLLEWSDILLRK 262 >UniRef50_C2LRU0 Glycosyl transferase, family 8 n=1 Tax=Streptococcus salivarius SK126 RepID=C2LRU0_STRSL Length = 402 Score = 193 bits (491), Expect = 7e-48, Method: Composition-based stats. Identities = 53/280 (18%), Positives = 100/280 (35%), Gaps = 21/280 (7%) Query: 29 NVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITL 88 +V + + +Y++ + V++ S+ + Y++ + +F + + E I Sbjct: 5 SVVFVAELSYMEKLEVALKSLCAH--KGQWKIYVLNENLPTEWFTLMNRRLEAIDSEILN 62 Query: 89 YRINTDKLQCLPC-TQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGL 147 R++ + + + A +FR + + R+LYLD D++ D+S L + L Sbjct: 63 CRVSAESFKQFSLPSAHIHYATFFRYAIPEFVQEN--RVLYLDCDMIFTQDLSPLFEVDL 120 Query: 148 NGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSK 207 G V D + FN+G++ +D W K+T+ + Sbjct: 121 GGLGIGAVVDRPTTTDG--------------FNAGLMVIDTDWWRQHKVTDSLFDLTKEH 166 Query: 208 DNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTG 267 Y DQ ++N+ K LP YN + + +IHYT Sbjct: 167 HQNV-YGDQGILNLYFKDAWYQLPWTYNLQVGSDKDQYGYGDLEWYDAFKGVPAVIHYTS 225 Query: 268 ATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEF 307 KPW + S W++ R I F Sbjct: 226 HNKPWTSKRFNRFRDIWWFYYALS-WEEILLRKPSLKISF 264 >UniRef50_B1I7M9 Glycosyl transferase, family 8 n=16 Tax=Streptococcus pneumoniae RepID=B1I7M9_STRPI Length = 406 Score = 193 bits (491), Expect = 8e-48, Method: Composition-based stats. Identities = 52/262 (19%), Positives = 106/262 (40%), Gaps = 21/262 (8%) Query: 29 NVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITL 88 +V + D Y+ + ++ S+ +N H L Y++ +F +I ++ + Sbjct: 7 SVVFAGDYAYIRQIETAMKSLCRHNSH--LKIYLLNQDIPQEWFSQIRIYLQEMGGDLID 64 Query: 89 YRINTDKLQCLPCTQ--VWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLG 146 ++ + Q + + + R F + T D++LYLD+D++ GD++ L L Sbjct: 65 CKLIGSQFQMNWSNKLPHINHMTFARYFIPDFV--TEDKVLYLDSDLIVTGDLTDLFELD 122 Query: 147 LNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMS 206 L A + G FN+GV+ ++ KKW + +K + + Sbjct: 123 LGENYLAAARSCF--------------GAGVGFNAGVLLINNKKWGSETIRQKLIDLTEK 168 Query: 207 KDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITES-TLLIHY 265 + + DQ ++N+L K L +YN HQ + E L++HY Sbjct: 169 EHENVEEGDQSILNMLFKDQYSSLEDQYNFQIGYDYGAAAFKHQFIFDIPLEPLPLILHY 228 Query: 266 TGATKPWHKWAIYPSVKYYKIA 287 KPW+++++ + + Sbjct: 229 ISQDKPWNQFSVGRLREVWWEY 250 >UniRef50_B9BAZ6 Glycosyl transferase family 8 protein n=1 Tax=Burkholderia multivorans CGD1 RepID=B9BAZ6_9BURK Length = 617 Score = 192 bits (489), Expect = 1e-47, Method: Composition-based stats. Identities = 72/330 (21%), Positives = 133/330 (40%), Gaps = 36/330 (10%) Query: 26 ECLNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAEQNQL 84 +++ D N++ + I S+ N + LD ++ + + K +N Sbjct: 282 NAVSIVTVADGNFVPHLAAFIASVQDNIDPERVLDLIVLDGGIPADQQRLLMKQFHRNGK 341 Query: 85 -RITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLL 143 R++ + +P +S A ++RL +LL R++Y+D+D + GD+S+L Sbjct: 342 GRLSFIQ-CAHLFSDIPLHGPFSAATFYRLSMGELLAKHR-RVVYVDSDTIVLGDLSELF 399 Query: 144 HLGLNGAVAAVVKDV------------------EPMQEKAVSRLSDPELLGQYFNSGVVY 185 L L A V DV P R+ +YF +G++ Sbjct: 400 DLDLGNNAVAAVPDVIMKSFVSSGVPALREAGGAPAGIYLKERVGMGNRGNEYFQAGLIV 459 Query: 186 LDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIY---TIKS 242 +DL ++ ++ E A L+++ Y + DQDV+N L G FL +N + + S Sbjct: 460 IDLDEFRRLRIGEDAYKDLLAR--RYWFLDQDVLNKYLLGHVKFLDLSWNVVNASMDVLS 517 Query: 243 ELKDKTHQNYKKLITESTLLIHYTGAT-KPWHKWAIYPSVKYYKIALENSPWK----DDS 297 L+ K++ + ++HY G KPW++ P +Y L + W D Sbjct: 518 GLETDIAAKVKEVF-AAPSMVHYAGHEAKPWNRPTA-PLAHFYWYYLRRTYWYESVIDRR 575 Query: 298 PRDAKSIIEFKKR--YKHLLVQHHYISGII 325 P +E ++ YK L + G + Sbjct: 576 PISPTLDVELQRSRLYKRLRAIWRRMPGFV 605 >UniRef50_A4UX76 Putative LPS biosynthesis protein n=2 Tax=Lactobacillaceae RepID=A4UX76_9LACO Length = 316 Score = 192 bits (488), Expect = 2e-47, Method: Composition-based stats. Identities = 64/278 (23%), Positives = 106/278 (38%), Gaps = 27/278 (9%) Query: 24 TSECLNVAYGVDANYLDGVGVSITSIVLNN-RHINLDFYIIADVYNDGFFQKIAKLAEQN 82 ++ + + Y VD NY + VS+ S+V + + ++ D N ++ K E + Sbjct: 2 ENQTVPIFYAVDDNYAPYLAVSLASLVAHTSPDRHYQVIVLCDDLNTDNQGRL-KAFETD 60 Query: 83 QLRITLYRINTDKLQ------CLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCK 136 L+I IN Q + ++ +YFRLF +L LD+ LYLDAD V Sbjct: 61 NLKIQFVSINDRLKQEITDKNNKLRSDYFTFTIYFRLFIAELFP-KLDKALYLDADTVVL 119 Query: 137 GDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPEL---LGQYFNSGVVYLDLKKWAD 193 D+ +L L + V D + + +Y SGV+ ++L + Sbjct: 120 KDVGELFDTQLGDNLVGAVPDPFVGHTPETIDYVEQAVGIDSQKYVCSGVLLMNLAEMRR 179 Query: 194 AKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYK 253 K E L +L PDQD MN + + +L ++ T ++ Sbjct: 180 LKFAEHFLQLLNKYHFKCLAPDQDYMNAIARNRIYYLNPSWHIQITTPQDV--------- 230 Query: 254 KLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENS 291 LIHY KPW P Y+ + + Sbjct: 231 -----DPWLIHYNLFAKPWRYDDA-PRQSYFWTYAKQT 262 >UniRef50_A5UC07 Aspartate-semialdehyde dehydrogenase n=7 Tax=Haemophilus influenzae RepID=A5UC07_HAEIE Length = 300 Score = 192 bits (488), Expect = 2e-47, Method: Composition-based stats. Identities = 59/276 (21%), Positives = 116/276 (42%), Gaps = 19/276 (6%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +N+ + +D N+ + + S+ + H N++ Y+I D +K+ + + Sbjct: 1 MNIVFTLDCNFASHLDTVLKSLCYH--HNNINIYVIHDGIPAESLEKLKMHCAKFDNTLY 58 Query: 88 LYRINTDKLQC---LPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 + N ++ + + S A FRL+ Q+L ++R++YLD D++ I +L Sbjct: 59 DIQFNINQFSFPTVMSPAHIQSSASLFRLYLHQILPQHIERVIYLDIDLIIHQAIDELWD 118 Query: 145 LGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 + L ++ A V D QY N+GV+ ++L KW + + + + Sbjct: 119 INLEDSLIAGVSDFFSEYLWEHP----FYEKQQYINTGVMLINLNKWRENNIEQYFIEYA 174 Query: 205 MSKDNVYKYPDQDVMNVLL-KGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLI 263 + Y DQDV+N + + LP ++N + + +K+ I + +I Sbjct: 175 AKYGEFFVYGDQDVINFSIPTNLIKLLPVKFNIQV----KFIEYLWMEHKEKIKFTPHII 230 Query: 264 HYTGATKPWHKW----AIYPSVKYYKIALENSPWKD 295 HY G+ KPW K + + Y S W + Sbjct: 231 HYIGSNKPWLKEHSANSPRFYNEEYLFYHHLS-WDN 265 >UniRef50_B3WD33 WbbM protein n=8 Tax=Lactobacillus RepID=B3WD33_LACCB Length = 318 Score = 192 bits (487), Expect = 2e-47, Method: Composition-based stats. Identities = 60/272 (22%), Positives = 106/272 (38%), Gaps = 23/272 (8%) Query: 25 SECLNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAEQNQ 83 + + + VD Y+ + V++TSI N + + I+ ++A LA + Sbjct: 4 QTTVPIFFSVDDGYVPCLAVALTSIRTNKDPQTDFVINILNSGLLQKNQTRLAALAAPH- 62 Query: 84 LRITLYRINTDKLQ-----CLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGD 138 I ++ Q + +YFRLF + D+ +Y+DAD V GD Sbjct: 63 FTINFIDMDAVTQQISGDTNKLRGDYVTLTIYFRLFIADMFP-QYDKAIYIDADTVADGD 121 Query: 139 ISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPEL---LGQYFNSGVVYLDLKKWADAK 195 +++L L + A V D M + G+Y NSGV+ L+L + Sbjct: 122 LAELFTTDLGDNLVAGVADPVMMTYPETIEYIQRDFGVQPGEYINSGVLILNLAQMRQEH 181 Query: 196 LTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKL 255 +++ L +L + DQD +NV+ + +LP+ +N + + Sbjct: 182 FSDRFLHLLKTYHFTMIAADQDYINVIAQHRIKYLPKTWNMQTGVPT------------A 229 Query: 256 ITESTLLIHYTGATKPWHKWAIYPSVKYYKIA 287 LIHY KPWH + ++ A Sbjct: 230 AESGGKLIHYNLFGKPWHYRDAKLAANFWHYA 261 >UniRef50_A5EVI8 Glycosyl transferase family 8 protein n=1 Tax=Dichelobacter nodosus VCS1703A RepID=A5EVI8_DICNV Length = 617 Score = 190 bits (482), Expect = 8e-47, Method: Composition-based stats. Identities = 67/335 (20%), Positives = 128/335 (38%), Gaps = 28/335 (8%) Query: 1 MDSFPAIEIDKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNR-HINLD 59 ++ P + + + + ++V D +Y+ +G I SI+ + LD Sbjct: 253 VNYLPRVFMKNTAEKHWIAQKVCQKNAVSVVIAADEHYVPHLGALICSIIDHLSCDAFLD 312 Query: 60 FYIIADVYNDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLL 119 I+ + +++A L + I ++ D+ +SRA ++RL +L+ Sbjct: 313 LIILDGGIDFISQKQLAHLLGKRGA-IQFLDLS-DEFTDQKVHMHFSRATFYRLILDKLI 370 Query: 120 GLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKD------------------VEPM 161 + R+LY+D D + D+++L LNG V D P Sbjct: 371 -IDRKRVLYIDCDTIVLADLAELFATDLNGKAIGAVFDYIMHHFCQVGVRSIEFTNYLPA 429 Query: 162 QEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNV 221 ++ + E YF +GV+ DL++ +K ++ L K Y + DQD++N Sbjct: 430 KKYLEDYVGLKENWRHYFQAGVILFDLEQLRTLNYADKMIASLTEK--RYWFLDQDILNK 487 Query: 222 LLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKL--ITESTLLIHYTGAT-KPWHKWAIY 278 G FL +N + + + + +L + +IHY G KPW + Sbjct: 488 YFVGNVHFLNPCWNVVNVGADIYEGLSAELIAELKAAERAPAIIHYAGYEAKPWVDLSA- 546 Query: 279 PSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKH 313 ++Y L + W + + KK K Sbjct: 547 KFAEFYYYYLRQTFWYESVLTSKMLLNVRKKSQKS 581 >UniRef50_Q5M3K9 Glycosyl transferase n=4 Tax=Streptococcus RepID=Q5M3K9_STRT2 Length = 697 Score = 189 bits (481), Expect = 1e-46, Method: Composition-based stats. Identities = 53/263 (20%), Positives = 103/263 (39%), Gaps = 25/263 (9%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLY 89 + + Y+D V +I SIV ++R N+ FY+I D ++ +F+ + + + Sbjct: 303 IVLAANYTYVDQVLTTIKSIVFHHR--NIRFYLINDDFSQEWFRGLNRHLAAFGSEVINC 360 Query: 90 RINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNG 149 R+++ ++ + A Y R F + +R LYLD+D+V G + L L L G Sbjct: 361 RVDSSHIKQFKTNSNY--ASYLRYFVADFVSE--ERALYLDSDMVVTGSLEDLFTLDLQG 416 Query: 150 AVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDN 209 A V+D + + F++G + +D W + + + + Sbjct: 417 RPLAAVRDYAVQGQDRQAM----------FDAGFMVIDTAYWKQYNMRRHLIDMTSEWHD 466 Query: 210 VYKYPDQDVMNVLLKGMTLFL--PREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTG 267 + +Q ++N++ L L Y + S Q+Y ++HYT Sbjct: 467 KVPFAEQSILNMVFCNNWLTLSFDNNYAVTKSSLSGYHLPNGQDY-------PKVLHYTS 519 Query: 268 ATKPWHKWAIYPSVKYYKIALEN 290 KPW A + + + Sbjct: 520 HRKPWLPLACQAYREVWWFYAQM 542 >UniRef50_UPI0001693121 general stress protein n=1 Tax=Paenibacillus larvae subsp. larvae BRL-230010 RepID=UPI0001693121 Length = 352 Score = 188 bits (479), Expect = 2e-46, Method: Composition-based stats. Identities = 62/268 (23%), Positives = 105/268 (39%), Gaps = 26/268 (9%) Query: 35 DANYLDGVGVSITSIVLNNRHINLDFYIIADV-YNDGFFQKIAKLAEQNQLRITLYR--I 91 D Y + G + S+ N +++ +I+ D + QK+ +L I Y I Sbjct: 12 DGAYAEHAGAVLASVFCNTSS-SVNVHILHDETLTEANKQKLIELTSSFNQTIHFYPVTI 70 Query: 92 NTDKLQCLPCTQ---VWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLN 148 + LQ + + W++A +RL L+ +D+++YLD DV+ +I++L + L Sbjct: 71 PDNMLQAMAGVKSISFWTQASMYRLLIPALIP--VDKIIYLDCDVLVNMNIAELWEVQLG 128 Query: 149 GAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADA-KLTEKALSILMSK 207 A V D M YFNSGV+ L E+ L+ L Sbjct: 129 DFYLAAVWDQAIMAAVQHIIPYGLN-PDSYFNSGVILFALNNIRKKIDWYEEMLNFLRRY 187 Query: 208 DNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTG 267 + PDQD +N + L L R +N + + ++H+ G Sbjct: 188 PD-TSMPDQDTLNAVFGENYLQLDRRFNFFNMVSPH------------HDFNNKIVHFAG 234 Query: 268 ATKPWHKWAIYPSVKYYKIALENSPWKD 295 + K W + P Y+ L +PWK Sbjct: 235 SEKCWDVHS--PGANLYQEYLSLTPWKK 260 >UniRef50_B0NR59 Putative uncharacterized protein n=1 Tax=Bacteroides stercoris ATCC 43183 RepID=B0NR59_BACSE Length = 306 Score = 186 bits (473), Expect = 8e-46, Method: Composition-based stats. Identities = 60/278 (21%), Positives = 119/278 (42%), Gaps = 17/278 (6%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINL-DFYIIAD-VYNDGFFQKIAKLAEQNQLR 85 + + + +D NY+ GV I S+++N+ D YI++ + + + K + Sbjct: 4 IPIVFSIDHNYVMQAGVCILSLLMNSDEKEYYDIYILSAADITEHDKELLNKTIFAYKAD 63 Query: 86 ITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHL 145 I I+ D+ + S+A YFRL L+ D+++Y D DV+ + + ++L Sbjct: 64 INFIEID-DRFDNAFEIRNISKAAYFRLLIPDLIP-QYDKIIYSDVDVIFQSGLQEVLDT 121 Query: 146 GLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILM 205 L +K + K + G Y NSG + ++ K + +L K L Sbjct: 122 DLKDNYFGGIKAIGAESIKDYIIQLGLNIHG-YINSGFLLINAKLQREKQLFNKIQEYLT 180 Query: 206 SKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTL---- 261 K +++ DQD++N++ K FLP +Y EL + + + + Sbjct: 181 KK---FQFQDQDIINIVCKNRLTFLPLKY-CFTQKSYELYYTNPKRLFSVFSPKEVEEAF 236 Query: 262 ---LIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDD 296 +IHY G KPW+ + Y +++ ++ + ++ Sbjct: 237 TEGIIHYEGTNKPWNGFC-YRYDNWWRYYKKSVFYSEE 273 >UniRef50_B9QZ95 Glycosyl transferase family 8 n=1 Tax=Labrenzia alexandrii DFL-11 RepID=B9QZ95_9RHOB Length = 309 Score = 186 bits (473), Expect = 9e-46, Method: Composition-based stats. Identities = 65/310 (20%), Positives = 127/310 (40%), Gaps = 30/310 (9%) Query: 29 NVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQ--LRI 86 N+A D L G+ V+I S + + I +++AD ++ K++ + + + Sbjct: 4 NIAACADTKVLPGLAVTIRSSLEH-SSIPCRIHVLADRLSEQDKHKLSNSWKPHPMCQDV 62 Query: 87 TLYRINTDKLQCLPCTQVW-SRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHL 145 Y I+ + T S++ Y R F LG + +YLD D++ D+++L Sbjct: 63 VFYDIDYQNISKFRSTMYLKSKSAYSRYFISDFLGEE-SKCIYLDCDLLVLRDLAELNTA 121 Query: 146 GLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQ---YFNSGVVYLDLKKWADAKLTEKALS 202 ++G V+D+ + + L YFNSGV+ +DL +W + Sbjct: 122 KMHGKTIGSVRDISVRTADPHLFIGERLQLTNPYDYFNSGVLIIDLDRWRKLDARNHLID 181 Query: 203 ILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLL 262 + + + + + DQD +NV G T FL +NT Y++ T + Sbjct: 182 LTLERADTFHSQDQDALNVFFDGDTEFLDPVWNT-------------SQYERPDTAENRI 228 Query: 263 IHYTGATKPWH-------KWAIYPSVKYYKIA--LENSPWKDDSPRDAKSIIEFKKRYKH 313 IH G KPWH + + + + + L+ + + + P D + K+ + Sbjct: 229 IHLIGTVKPWHARYKEKLSDSYHRTEIWDRFYGVLDRTAYAGNRPWDPAGLGVVKETIES 288 Query: 314 LLVQHHYISG 323 + + ++G Sbjct: 289 KIPKMDMVTG 298 >UniRef50_C1QEC6 LPS:glycosyltransferase n=1 Tax=Brachyspira murdochii DSM 12563 RepID=C1QEC6_9SPIR Length = 242 Score = 186 bits (473), Expect = 9e-46, Method: Composition-based stats. Identities = 59/257 (22%), Positives = 120/257 (46%), Gaps = 22/257 (8%) Query: 24 TSECLNVAYGVDANYLDGVGVSITSIVLNNRH-INLDFYIIADVYNDGFFQKIAKLAEQN 82 E +N+ + + Y + +I SI+ N++ + F++I + +D I +L E Sbjct: 1 MQETMNICFTANDKYAPFMSATIVSILKNSKDDESFSFHVITNDISDENKMMIERLKEIK 60 Query: 83 QLRITLYRINTDK----LQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGD 138 +I Y N DK + + + ++ +++FRL L+ + +D++LYLD D++ Sbjct: 61 TFKIKYYTPNIDKYNKWFEKINYQRHYAPSIFFRLDIPNLI-INIDKVLYLDCDIIVNSS 119 Query: 139 ISQLLHLGLNGAVAAVVKDVEPMQ-EKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLT 197 +S+L ++ ++ A V+D + K E +YFNSGV+ L+ K + + L Sbjct: 120 LSELFNIDISEYFALAVEDTGDLNFLKKYKTKIGIEDKHKYFNSGVLLLNNKLYMEKNLN 179 Query: 198 EKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLIT 257 ++ + NV + DQD++N L + F+ ++N ++ Sbjct: 180 LESENYFNKYYNVIECVDQDILNYLFRDKIKFIDNKWN---------------DFSSKNI 224 Query: 258 ESTLLIHYTGATKPWHK 274 + + ++HY G K W+K Sbjct: 225 DKSAIMHYVGKIKSWNK 241 >UniRef50_B1LK07 Glycosyl transferase family 8 n=10 Tax=Enterobacteriaceae RepID=B1LK07_ECOSM Length = 630 Score = 186 bits (473), Expect = 1e-45, Method: Composition-based stats. Identities = 72/335 (21%), Positives = 141/335 (42%), Gaps = 36/335 (10%) Query: 25 SECLNVAYGVDANYLDGVGVSITSIVLNNR-HINLDFYIIADVYNDGFFQKIAKLAE-QN 82 E + V D NY G I SIVL++ N D ++ + + Q++ KL N Sbjct: 274 DESVPVVISFDNNYALSGGALINSIVLHSDASRNYDIVVLENKVSHLNKQRLIKLVAGHN 333 Query: 83 QLRITLYRINTD-KLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQ 141 + + + +N+ ++ + +S + Y RLF QL +++++D+D V K D++ Sbjct: 334 NISLRFFDVNSFTEMSDVHTRAHFSASTYARLFIPQLF-REYKKVVFIDSDTVVKADLAT 392 Query: 142 LLHLGLNGAVAAVVKDVE-----------------PMQEKAVSRLSDPELLGQYFNSGVV 184 LL + + + A VKD+ E+ + + +YF +G++ Sbjct: 393 LLDVEIGTNLVAAVKDIVMEGFVKFGTMSESDDGIMPAEQYLKKTLGMTNPDEYFQAGII 452 Query: 185 YLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIY------ 238 ++++ + +S L +K Y + DQD+MN + G FLP E+N + Sbjct: 453 VFNVEQMVTENTFAQLMSALKAK--KYWFLDQDIMNKVFFGRVKFLPLEWNVYHGNGNTD 510 Query: 239 TIKSELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDD-- 296 LK T+ + + + +IHY G KPW+ + + + L +PW+ + Sbjct: 511 DFFPNLKFSTYMRFLQA-RRNPKMIHYAGENKPWNTEKVDFYDDFLENVLS-TPWEKEIY 568 Query: 297 ---SPRDAKSIIEFKKRYKHLLVQHHYISGIIAGV 328 P + + + +L+Q ++ V Sbjct: 569 YRQLPVATVVPNQHTELQQTVLLQTKIKRALMPYV 603 >UniRef50_B2UQN6 Glycosyl transferase family 8 n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UQN6_AKKM8 Length = 371 Score = 185 bits (470), Expect = 2e-45, Method: Composition-based stats. Identities = 67/295 (22%), Positives = 117/295 (39%), Gaps = 22/295 (7%) Query: 20 ANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINL-DFYIIADVYNDGFFQKIAKL 78 + V + + +GV+I ++ L+ D +I+ D + Q++ ++ Sbjct: 13 PASPEKSRIPVMFSATGGWGLPLGVAIHTLCLHASSGRFYDIHIVHDGMDARIIQELNQV 72 Query: 79 AEQN-QLRITLYRINTDKLQCLPC--TQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVC 135 A Q+ ++ ++ + +S Y RL A L R++YLDADV+ Sbjct: 73 AAPFPQVSLSFLQLPEEFRHLFQNGNKDRYSPLAYARLMAGSLFP-QYGRIVYLDADVLL 131 Query: 136 KGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLS-DPELLGQY-------FNSGVVYLD 187 GD+++L L GA A D + + E +G Y NSGV+ LD Sbjct: 132 AGDVAELYFSDLRGASVAAAGDGLALWSIEKGTMHPHLEYMGNYLSFPLSYCNSGVLVLD 191 Query: 188 LKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDK 247 L + L + L L S+ + + YPDQD++N+ L G LP E+N + + ++K Sbjct: 192 LDQMRRRNLEHRLLQQLRSRPDPFPYPDQDILNIALHGDMTTLPPEWNFQFLSWTWDEEK 251 Query: 248 THQNYKKLITESTLL--------IHYTGATKPWH-KWAIYPSVKYYKIALENSPW 293 T + +H G KPW +++ I W Sbjct: 252 TRLLRGTEFENVPTISCGRSWKLLHMVGPEKPWRLPDTPGTMGQFHWILYSFFWW 306 >UniRef50_A8AY72 Glycosyl transferase, family 8 SP1766 n=7 Tax=Streptococcus RepID=A8AY72_STRGC Length = 435 Score = 184 bits (468), Expect = 3e-45, Method: Composition-based stats. Identities = 59/289 (20%), Positives = 104/289 (35%), Gaps = 27/289 (9%) Query: 8 EIDKVKAWD----FRLANINTSECL-NVAYGVDANYLDGVGVSITSIVLNNRHINLDFYI 62 E++K ++ RLAN + ++ D Y+ + +I S+ H +L Y+ Sbjct: 11 ELNKRIRYNEDTIIRLANRGKMNQMKSIVLAGDYGYIRQIETTIKSLCCY--HEDLLIYV 68 Query: 63 IADVYNDGFFQKIAKLAEQNQLRITLYRINTDKLQ---CLPCTQVWSRAMYFRLFAFQLL 119 +F K + + ++ D L+ + Y R F + + Sbjct: 69 FNQDIPQEWFINTRKKVKGTGNNLFDIKLLRDDLRMKWEESTYSHINYMAYARYFIPEYV 128 Query: 120 GLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYF 179 DR LYLD D+V ++ L L L A V+ LG F Sbjct: 129 KA--DRALYLDCDLVVTQNLDHLFELDLEDYYIAAVRATF--------------GLGIGF 172 Query: 180 NSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYT 239 NSGV+ L+ K+W + + ++ + + + DQ ++N+L K L L YN Sbjct: 173 NSGVMLLNNKRWREENIPQQLVELTDREIERVLEGDQSILNMLFKEQYLELEDSYNFQIG 232 Query: 240 IKSELKDKTHQNYKKLITES-TLLIHYTGATKPWHKWAIYPSVKYYKIA 287 H + ++HY A KPW+ + + Sbjct: 233 FDMGAAQYGHDFVFDIPLSPLPAIVHYISALKPWNLLTNMRLREVWWFY 281 >UniRef50_A5VK24 Glycosyl transferase, family 8 n=18 Tax=Lactobacillales RepID=A5VK24_LACRD Length = 282 Score = 183 bits (466), Expect = 5e-45, Method: Composition-based stats. Identities = 59/270 (21%), Positives = 111/270 (41%), Gaps = 10/270 (3%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +N+ + ++ ++ + + SI LN + + Y++ + ++ +Q + Sbjct: 1 MNLLFSINDKFVTQLATVLLSIKLNTQAQEFNVYVLQKD-KLKRTDDLERVCKQLGMNYF 59 Query: 88 LYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGL 147 ++N P T + +Y+RL A +LL L ++LYLDADV+C D+S L L Sbjct: 60 PIKVNDQLFNKAPVTDRYPTTIYYRLLAHRLLPQDLHKILYLDADVLCINDLSSLYETSL 119 Query: 148 NGAVAAVVKDVEPMQEK---AVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 +G + A RL + + G Y+NSGV+ ++L + + Sbjct: 120 DGYLYASAIHTNLTNTTEVINKIRLQNFDADG-YYNSGVLLMNLDTIRKKVKDTDIFNYI 178 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKD--KTHQNYKKLITESTLL 262 + PDQDV+N L +P + T K + + + + +T++ Sbjct: 179 RTH--TLLLPDQDVLNALYGRYIKSVPDQLYNFDTRKGGIYETISFGEWTTDWVMRNTVI 236 Query: 263 IHYTGATKPWHKW-AIYPSVKYYKIALENS 291 +HY G KPW YK + + Sbjct: 237 LHYCGRDKPWLPTKNSGRYTALYKNYFQMT 266 >UniRef50_C6EQF4 Putative uncharacterized protein n=3 Tax=Campylobacter jejuni RepID=C6EQF4_CAMJE Length = 958 Score = 183 bits (466), Expect = 6e-45, Method: Composition-based stats. Identities = 71/294 (24%), Positives = 132/294 (44%), Gaps = 29/294 (9%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRH-INLDFYIIADVYNDGFFQKIAKLAEQNQLRI 86 + + + VD NYL + +++ S+V + + +++ + ++ + N + I Sbjct: 13 IPIVFAVDDNYLPYMSIALNSLVDRVSNCYKYNIFVMHLNIDLERLNRLKENIRNNNVTI 72 Query: 87 TLYRINT-------DKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDI 139 +N + ++ AMY+R+F ++ +++Y D+DV+ K DI Sbjct: 73 EFINLNQYLKKIFKEYGNIFYERSYFTTAMYYRIFIPEIFS-NFKKVIYCDSDVIFKADI 131 Query: 140 SQLLHLGLNGAVAAVVKDVEPM------------QEKAVSRLSDPELLGQYFNSGVVYLD 187 S L + LN +D+ + + + + YFNSGV+ D Sbjct: 132 SHLFFIDLNNKEIGACRDIAALYAYRKRETVWQQNIRNNFDKINFRSISDYFNSGVIVFD 191 Query: 188 LKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDK 247 + K K K L+++ + DN+Y +PDQDV+N++ G FLP E+N ++T E KD Sbjct: 192 IVKCIQMKTVSKCLTVIKNIDNLY-FPDQDVLNIVFCGHVHFLPLEWNFLWTTYIEYKDN 250 Query: 248 THQNYKKLITE------STLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKD 295 KK+I E +IHY TKPW + V+++K +N + + Sbjct: 251 FMYLPKKIINEIYKAKTKPKIIHYISETKPWKDKNSF-FVEWWKFPRKNLFYGE 303 >UniRef50_C7HS13 Family 8 glycosyl transferase n=2 Tax=Anaerococcus RepID=C7HS13_9FIRM Length = 276 Score = 183 bits (464), Expect = 9e-45, Method: Composition-based stats. Identities = 68/274 (24%), Positives = 124/274 (45%), Gaps = 13/274 (4%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKL---AEQNQL 84 +N+ D NYL+ + + S+ +N N + Y+I D ++I K A + Sbjct: 1 MNILVSCDENYLNPLKTMLYSLFESN-DTNFEIYLIHKDIRDEKIKEIEKFVIKASSKRA 59 Query: 85 RITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 ++ ++ + T ++ MY+RL A++ L LDR+LYLD DV+ +L + Sbjct: 60 KLNAIKVK-NLFSNAKITFYYTEEMYYRLLAYKYLPENLDRILYLDPDVLVLNSCEKLYN 118 Query: 145 LGLNGAVAAVVKDVEP-MQEKAVSRL---SDPELLGQYFNSGVVYLDLKKWADAKLTEK- 199 + L A P +Q V+RL S + + YFNSG++ ++LK D++ EK Sbjct: 119 MDLGDNYFAAATHTIPTVQSANVARLSISSGHKDIENYFNSGILMINLKLSRDSQTYEKE 178 Query: 200 ALSILMSKDNV-YKYPDQDVMNVLLKGMTLFLPR-EYNTIYTIKSELKDKTHQNYKKLIT 257 L+ + + ++ PDQD++NV+ + + + +YN K K + I Sbjct: 179 VLNYVKNTKSLGLIMPDQDLLNVVFRNKIIKIDEIKYNYDARRYLTYKLKDKKYNLSYII 238 Query: 258 ESTLLIHYTGATKPW-HKWAIYPSVKYYKIALEN 290 +T +H+ G KPW + + Y + Sbjct: 239 SNTCFLHFCGKRKPWLEENNLGVFTSLYLYFWKK 272 >UniRef50_C4Z1V1 Putative uncharacterized protein n=1 Tax=Eubacterium eligens ATCC 27750 RepID=C4Z1V1_EUBE2 Length = 607 Score = 183 bits (464), Expect = 1e-44, Method: Composition-based stats. Identities = 59/324 (18%), Positives = 127/324 (39%), Gaps = 33/324 (10%) Query: 14 AWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNR-HINLDFYIIADVYNDGFF 72 +++ + V + ++ Y + V + S+ ++ ++ D + Sbjct: 259 SYNREHEEYMAVSRIPVFFSINEQYAPYLAVCLKSLAVHVACDERYRIIVMCDNVKNITM 318 Query: 73 QKIAKLAEQN-QLRITLYRIN--------------TDKLQCLPCTQVWSRAMYFRLFAFQ 117 ++ + + + I I TD+ + + ++ +YFRLF + Sbjct: 319 IQLRNVIKDYENIDIEFVDIRKKMYEYSESFGQTVTDRQENRLYSGEFTLTIYFRLFIAE 378 Query: 118 LLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPEL--- 174 L L++ +Y+D+D V DI++L + + A+ V+D + ++ + + Sbjct: 379 LFP-ELNKAVYIDSDTVINDDIAKLYSVDMGDAMFGAVRDTFAGKNTILAHYIENVVGIE 437 Query: 175 LGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREY 234 +Y NSGV+ ++L K A L ++ L ++ PDQD +N + FL +E+ Sbjct: 438 RNEYVNSGVLLMNLDKIRQAHLADRFLKLMAEYHFDSVAPDQDYINSMCAKEIYFLDKEW 497 Query: 235 NTIYTIKSELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWK 294 N + E + LIHY KPWH ++ P +Y+ S + Sbjct: 498 NVMPNKGGEYIAR------------PKLIHYNLFDKPWH-YSEIPYEEYFWQYAAESGFY 544 Query: 295 DDSPRDAKSIIEFKKRYKHLLVQH 318 + K + +K+ ++ Sbjct: 545 PLLIKQRKQYGDNEKKADRENLKK 568 Score = 106 bits (266), Expect = 9e-22, Method: Composition-based stats. Identities = 52/277 (18%), Positives = 101/277 (36%), Gaps = 58/277 (20%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVY----------NDGFFQKIAK 77 +N+ Y D G+ +S S++ N L+ YI+ Y + F + + + Sbjct: 1 MNILYCGDKTMQKGILLSSMSLIK-NVDEPLNIYILTVDYGEKGINYKPVDKAFAKYLEE 59 Query: 78 LAEQNQLRITLYRINT-----DKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDAD 132 ++ +++ ++ ++ ++L ++ RLFA + DR+LYLD D Sbjct: 60 KLNKSDIKVNVFLVDVTRYFVEELPEANMQSRFTACCMLRLFADK--TDIKDRVLYLDTD 117 Query: 133 VVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWA 192 V+C+ H ++G A V D Y NSGV+ ++++ Sbjct: 118 VLCRKGFRDFYHQNMDGIEIAGVSDYYGRW----------LFGDGYINSGVMLMNMRMIR 167 Query: 193 DAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNY 252 L EK + K+ PDQ +N R++N + Sbjct: 168 QNGLLEKCREQCIRKE--MFMPDQTAVN-TFATRVNLCGRKFNDQRRLH----------- 213 Query: 253 KKLITESTLLIHYTG-----------ATKPWHKWAIY 278 ++T+ H+T + KPW ++ Sbjct: 214 -----DNTVFQHFTTTFRVFPVIRTVSVKPWEIDKMH 245 >UniRef50_C0AA16 Lipopolysaccharide biosynthesis protein LPS:glycosyltransferase-like protein n=1 Tax=Opitutaceae bacterium TAV2 RepID=C0AA16_9BACT Length = 726 Score = 182 bits (462), Expect = 2e-44, Method: Composition-based stats. Identities = 69/297 (23%), Positives = 129/297 (43%), Gaps = 29/297 (9%) Query: 26 ECLNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAEQNQL 84 C+N+A+ D ++ + V+I SIV + N D I+ + + + I + + Sbjct: 403 NCINIAFNCDDKFVPYLCVAIKSIVATASTENNYDILILTEGLSPANLKWIDGIKHAKNV 462 Query: 85 RITLYRINT----DKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDIS 140 + + + + + SR Y RL+ +LL ++LYLD D++ + D++ Sbjct: 463 SLRVVNVRDYLQDKDISSFFMRSMVSRIAYVRLYLGELL-EKYAKVLYLDCDLIAQSDVA 521 Query: 141 QLLHLGLNGAVAAVVKD----VEPMQEKAVSRLSDPEL--------LGQYFNSGVVYLDL 188 +L ++ L+G V A V D E ++ A R D L + QYFNSGV+ DL Sbjct: 522 ELFNMNLDGNVCAAVPDLAISTETIKNVAAYRDIDVYLRDVLGVTDISQYFNSGVMVFDL 581 Query: 189 KKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKT 248 +K L + ++ N + DQ+V+N L G L L E+N ++ +D T Sbjct: 582 EKIRTDNLQQTFIAAAAK--NTKFFMDQNVLNSALYGKVLLLGFEWNKRVSLAMANRDTT 639 Query: 249 HQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSII 305 ++ ++H+ KP K + +++ A + +++ R K Sbjct: 640 TES---------KILHFAAEPKPLQKIHMPEHYNWWEYARQLPFYEELLSRVIKPSS 687 >UniRef50_A3V3C9 Putative uncharacterized protein n=1 Tax=Loktanella vestfoldensis SKA53 RepID=A3V3C9_9RHOB Length = 324 Score = 181 bits (460), Expect = 2e-44, Method: Composition-based stats. Identities = 62/306 (20%), Positives = 117/306 (38%), Gaps = 32/306 (10%) Query: 29 NVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITL 88 +V + D +YL ++I +++ NN + D I + E I Sbjct: 17 SVIFCADQSYLPFASLAIHTLLRNNPVRDYDICI-------ASVDALVPPTELKDHDIRF 69 Query: 89 YRINT-DKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGD-ISQLLHLG 146 +I+ + +P ++ +S A Y R+ + DR+ YLDADV GD I + L Sbjct: 70 CQIDVGNAFDGMPVSKRFSLAAYLRIALPEAFAGQYDRIFYLDADVFVVGDAIDAVFRLD 129 Query: 147 LNGAVAAVVKDVEPMQ--EKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 + V D+ ++ K + G YFNSGV+ D++++ ++ E+ Sbjct: 130 MLSCPVGAVTDITKLKHPNKPTFDQKALGVDGPYFNSGVMLFDVERFITMRVRERCAEAA 189 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIH 264 Y DQ ++N++L+ L +N + L + ++H Sbjct: 190 KFYQGEPIYFDQTLLNIVLQKEWAQLNLGWNWQWPFSRSLFECFI---------DVQIVH 240 Query: 265 YTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSII----------EFKKRYKHL 314 + G KPW +KY + A ++ P A+ I + ++H+ Sbjct: 241 FIGDDKPWSDHKRRLPLKYRETARR--FFQKFYPELAQKIPAADAALRNGALYHYFFRHI 298 Query: 315 LVQHHY 320 H + Sbjct: 299 TKIHLF 304 >UniRef50_A1VG39 Glycosyl transferase, family 8 n=1 Tax=Desulfovibrio vulgaris DP4 RepID=A1VG39_DESVV Length = 335 Score = 180 bits (456), Expect = 9e-44, Method: Composition-based stats. Identities = 49/286 (17%), Positives = 114/286 (39%), Gaps = 22/286 (7%) Query: 26 ECLNVAYGVDANYLDGVGVSITSIVLNNRHINL-DFYIIADVYNDGFFQKIAKLAEQNQL 84 + + + DANY V++ S+ N + Y++ + + G I + + Sbjct: 2 NTVPIVFTFDANYRLPASVALQSLFENAKDSTYYHVYLVCEGLSRGDKDAIESICPEKNG 61 Query: 85 RITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 R+ ++ + P ++ W + +Y R+ LL D+++Y D DVV D++++ Sbjct: 62 RVEWIDVDNELFSSAPSSENWPKIVYARILLPLLLP--FDKVIYSDVDVVFCSDLAEIFQ 119 Query: 145 LGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQ---YFNSGVVYLDLKKWADAKLTEKAL 201 + ++G A V ++ V+R + Q + SG + ++L+ + + L Sbjct: 120 IEVDGCEWAGVAAELVAFQEGVARCHNVHCEYQNELIYMSGFMVMNLRLMREKDTVGRCL 179 Query: 202 SILMSKDNVYKYPDQDVMNVLLKGMTLFLPREY----NTIYTIK-------SELKDKTHQ 250 + + + K D +++N+ + Y N + L+ Sbjct: 180 NNISKFGSRLKMYDLEILNMS-SDNIARIDFSYCVLENVFFAKNVSEAKEYPWLRGLYRV 238 Query: 251 NYKKLITESTLLIHYTGAT-KPWHKWAIYPSVKYYKIALENSPWKD 295 + + + +IH+ G+ K W ++ + Y+ L SP++ Sbjct: 239 SELEAARSAPRIIHFAGSDTKVWERYCVPQV---YRKYLAVSPFRS 281 >UniRef50_C0X9Z7 Putative uncharacterized protein n=2 Tax=Lactobacillus gasseri JV-V03 RepID=C0X9Z7_9LACO Length = 416 Score = 179 bits (454), Expect = 1e-43, Method: Composition-based stats. Identities = 59/267 (22%), Positives = 114/267 (42%), Gaps = 16/267 (5%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLY 89 +A + Y+D + +I SI+ N + N++ +++ +F I + A Q RI Sbjct: 5 IALSANYGYIDKIETTIKSILYNVK--NVEIHLLNYDIPQEWFANINRYANQIGSRIIDE 62 Query: 90 RINTDKLQCLPC-TQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLN 148 + + ++L L + ++ Y RL +L+ R+LYLD+D+V +I +L N Sbjct: 63 KFDPEELHDLNSGFKHINQMTYARLLIPKLIKAN--RVLYLDSDLVVDDEIDELFSRKFN 120 Query: 149 GAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWAD-AKLTEKALSILMSK 207 G V + ++ K SR+ P N+GV+ ++ ++ L+EK L ++ Sbjct: 121 GKKILAVTHIFDVRNKNESRVDLPV---PSINAGVLLINNQELRKDHNLSEKLLDF--AR 175 Query: 208 DNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLIT-----ESTLL 262 N + DQD +N K L +YN L + N + ++ + Sbjct: 176 KNNFPQDDQDTINNWFKDEIGSLSFKYNYQIGADRFLFWSNNSNTETATEILDKVKNPKI 235 Query: 263 IHYTGATKPWHKWAIYPSVKYYKIALE 289 IHY KP++ ++ + + Sbjct: 236 IHYISDDKPFNIFSEGRMRETWWFYRN 262 >UniRef50_Q50FU8 Cj81-079 (Fragment) n=6 Tax=Campylobacter jejuni RepID=Q50FU8_CAMJE Length = 333 Score = 179 bits (454), Expect = 1e-43, Method: Composition-based stats. Identities = 78/337 (23%), Positives = 134/337 (39%), Gaps = 51/337 (15%) Query: 29 NVAYGVDANYLDGVGVSITSIVLNNR------HINLDFYIIADVYNDGFFQKIAKLAEQ- 81 N+ D NY+ V V I SI+ N + FYI+++ + K+ KL + Sbjct: 6 NIVISCDNNYVKYVAVVIASIIKNTKINSQLKEYPYKFYILSNDISKNNILKLKKLIQHL 65 Query: 82 ----NQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKG 137 + +++I+ K P + A Y+R ++ + LYLDADV+ G Sbjct: 66 SNSYYNCELIIHKIDDSKFHRFPKAWHVNHATYYRFEIADIVEGN--KCLYLDADVLVCG 123 Query: 138 DISQLLHLGLNGAVAAVVKD-VEPMQEKAVSRLSDPELLGQ-----YFNSGVVYLDLKKW 191 DI +L ++ LN VA VV D + K ++ + + YFN+GV+ +DL +W Sbjct: 124 DIRELFYMELNNKVAGVVTDSCSRLWTKLYTKDNKTSSYIEFDPLMYFNAGVILIDLNQW 183 Query: 192 ADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQ- 250 + K + D+ DQ +N+ LK +T LP +N I L Sbjct: 184 KKHDIKNKCIDAFNIYDHG-GLADQSYLNIALKELTYKLPLNWNLIVPEYILLDGYERHY 242 Query: 251 -----------------NYKKLITESTLLIHYTGATKPW-------HKWAIYPSVKYYKI 286 + + ++ ++H+ A KPW +K +++I Sbjct: 243 VVNCLDEISEYNLAYTRSEFEEAMKNKKIVHFC-AAKPWWNLYYKNNKVDFNERNVWWEI 301 Query: 287 ALENSPWKDDSPRDAKSIIEFKKRYKHLLVQHHYISG 323 AL +K++ S+ KHL Q + I Sbjct: 302 ALNLEEFKEEFYFLKNSL-----DSKHLNRQLNTIEW 333 >UniRef50_C4ZG45 Glycosyltransferase Family 2 modular protein n=1 Tax=Eubacterium rectale ATCC 33656 RepID=C4ZG45_EUBR3 Length = 723 Score = 177 bits (449), Expect = 6e-43, Method: Composition-based stats. Identities = 57/280 (20%), Positives = 114/280 (40%), Gaps = 23/280 (8%) Query: 25 SECLNVAYGV---DANYLDGVGVSITSIVLNNRHINLDFYIIADV-YNDGFFQKIAKLAE 80 +++ G+ D NY G ++ SIV N + + F+I+ D N+ K++ +A+ Sbjct: 340 DNAIHICLGIHDKDGNYSVWAGTTMQSIVENTK-APIVFHILHDDTLNEMNKNKLSLIAD 398 Query: 81 QNQLRITLYRINTDKLQCLPCT-QVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDI 139 + I + N D L + ++ FR+ ++ L +++YLD+D+ DI Sbjct: 399 NSGNGIEFHHFNPDIFGSLADSMNRFTIGTMFRIMLPDIMP-DLKKIIYLDSDLFVNTDI 457 Query: 140 SQLLHLGLNGAVAAVVKDVEPMQEKAVSR--LSDPELLGQYFNSGVVYLDLKKWADA-KL 196 +L +L ++ A +D ++ + +YFN+GV+ ++L L Sbjct: 458 EELWNLNIDNYCLAAAQDCSTIRNWGTPYAVAAGQTSRDRYFNAGVLCMNLDNIRKNGSL 517 Query: 197 TEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLI 256 ++ + L PDQD +N + G TL + ++N + +K + Sbjct: 518 FQQVMDYLSDNP-RTWLPDQDALNAIFSGKTLLIDEKWNYFIDEARKNNEKAEKKIYHYA 576 Query: 257 TESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDD 296 + L++H YY L +PW + Sbjct: 577 -ATLLMLH----------TNNEIDRAYYFTILR-TPWGEQ 604 >UniRef50_Q046Z9 Lipopolysaccharide biosynthesis glycosyltransferase n=32 Tax=Lactobacillus RepID=Q046Z9_LACGA Length = 317 Score = 177 bits (448), Expect = 6e-43, Method: Composition-based stats. Identities = 60/299 (20%), Positives = 125/299 (41%), Gaps = 28/299 (9%) Query: 27 CLNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAEQNQLR 85 + V Y + NY VSI S++ + +++ + ++ +D + + L + Sbjct: 3 TIPVFYTISDNYTPYAAVSIQSLIDHVDQNKDYTITLLVQNISDKHKKDLEDL-SIKNVH 61 Query: 86 ITLYRINTDKL-------QCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGD 138 + ++ I+ + + + Q ++ ++++RLF L D+ +YLDAD + D Sbjct: 62 VNIFHIDDEMVAPIHNSEENYLRAQFFTMSIFYRLFIPNLFP-QYDKAVYLDADTIICTD 120 Query: 139 ISQLLHLGLNGAVAAVVKDVEPMQEK----AVSRLSDPELLGQYFNSGVVYLDLKKWADA 194 I++L + + + A V D+ K + +Y N+GV+ ++K + D Sbjct: 121 IAELYNTEIGDNMFASVPDMSIRFIKPLQVYIKECQGIFPPEKYINNGVILFNMKAFRDK 180 Query: 195 KLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKK 254 K +K S++ PDQ MN + + LP E++ + N Sbjct: 181 KFVDKFYSLIEKYHFDNIDPDQAYMNEICEDKIYHLPLEWDAM------------PNEHM 228 Query: 255 LITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSII-EFKKRYK 312 ++ ++HY KPWH +A KY+ + SP+ + + E +K+ + Sbjct: 229 DEIKNPKIVHYNLFFKPWH-FADVQYGKYFWDVAKKSPYYGELKEQLANFTDEDRKKAR 286 >UniRef50_Q38VG7 Putative glycosyl transferase, family 8 n=1 Tax=Lactobacillus sakei subsp. sakei 23K RepID=Q38VG7_LACSS Length = 304 Score = 176 bits (447), Expect = 9e-43, Method: Composition-based stats. Identities = 48/258 (18%), Positives = 115/258 (44%), Gaps = 18/258 (6%) Query: 42 VGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPC 101 + +SI +++ + ++ +II ++ + + I L N + ++ ++ ++ Sbjct: 1 MSISIATLLKKHMEDEINIFIITSNISEKYIKVIEGLF--NNPKHNIFWVSMPEIDIPLE 58 Query: 102 TQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPM 161 T S A Y RLF +L+ + RL+YLD D + + ++ +L L + +D Sbjct: 59 TDRGSLAQYGRLFFDRLIPENIQRLIYLDCDTLIEENLRELWVTDLGENTIGIARDAFSD 118 Query: 162 QEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNV 221 + K +L E + FNSGV+ +D W + ++ ++ + +L K DQ V+++ Sbjct: 119 RYK---KLLGLEKDSELFNSGVMIIDRGSWNEKRIEDRIIDLLTEKRGRISQGDQGVIDI 175 Query: 222 LLKGMTLFLPREYNTI-----YTIKSELKDKTHQNYKK-----LITESTLLIHYTGA--- 268 + + L ++N++ +T LK + + + + ++H+T + Sbjct: 176 IFQNDAKILDPKWNSMSSYFDFTYDDFLKYRQVKEFYSKQLILEAIQKPAIVHFTSSFLN 235 Query: 269 TKPWHKWAIYPSVKYYKI 286 +PW + + +++ Sbjct: 236 NRPWIFGSTHRYKNHWRR 253 >UniRef50_C6IB51 Glycosyltransferase n=4 Tax=Bacteroides RepID=C6IB51_9BACE Length = 417 Score = 176 bits (447), Expect = 1e-42, Method: Composition-based stats. Identities = 59/234 (25%), Positives = 97/234 (41%), Gaps = 14/234 (5%) Query: 57 NLDFYIIADVYNDGFFQKIAKLAEQNQL-RITLYRINTDKLQCLPCTQ-VWSRAMYFRLF 114 N+ YI+ D + + + ++ I I+++ + L + +R Sbjct: 2 NISIYILTDYISLESKEFLQEIKNVFTCVTIQWEIIDSESFKQLKKKGGYITEHTLYRYA 61 Query: 115 AFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPEL 174 L LD+ LYLDAD+V G I L L L G A V D+ ++ ++ + Sbjct: 62 IADLFP-NLDKALYLDADLVINGSIEPLWELDLEGYYCAGVDDIF-IRRINYRKILELAE 119 Query: 175 LGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREY 234 Y N+GV+ L+LK K+ EK L N +Y DQD +N + KG +P Y Sbjct: 120 KDVYINAGVLLLNLKDLRKDKIQEKLLQHTSIYINRDRYQDQDAINCICKGKIKLIPNIY 179 Query: 235 NTIYTIKSELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYP-SVKYYKIA 287 N + + + ++IHYTG+ KPWH+ + + Y Sbjct: 180 NFTTS---------ETLHTPEMLSDIIIIHYTGSIKPWHQEYTWQVLKELYCKY 224 >UniRef50_A4WWT5 Glycosyl transferase, family 8 n=1 Tax=Rhodobacter sphaeroides ATCC 17025 RepID=A4WWT5_RHOS5 Length = 319 Score = 176 bits (446), Expect = 1e-42, Method: Composition-based stats. Identities = 63/249 (25%), Positives = 98/249 (39%), Gaps = 23/249 (9%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLY 89 V + D NY + I + H D I + I + ++L + Sbjct: 17 VIFCCDRNYYPYAMFAAAQIAGRHPHRGFDICI-------ASLEAIEEPPSLSELAVRHC 69 Query: 90 RINT-DKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKG-DISQLLHLGL 147 I+ + Y RL + DR+LYLD+D+ +G D+ L+ L L Sbjct: 70 TIDAAHLFADFGLDDRRTAVTYLRLVLPEAFSEDYDRILYLDSDIYIQGGDLGALIALPL 129 Query: 148 NGAVAAVVKDVEPMQEKAVSRLSDPELLG----QYFNSGVVYLDLKKWADAKLTEKALSI 203 G A V+D + + + R+ D + LG YFNSGV+ D+ + A L ++AL I Sbjct: 130 AGRPLAAVRDNKQWRTPS-RRMVDFDRLGLPQRPYFNSGVLLFDVPAFRAANLLQEALRI 188 Query: 204 LMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLI 263 S+ DQ ++N + G L +N YT S L +I Sbjct: 189 GRSQGRQLVRHDQSLLNACMLGNWAELSPSWNWQYTWSSRLF---------AAMLGPNII 239 Query: 264 HYTGATKPW 272 H+ G KPW Sbjct: 240 HFIGRCKPW 248 >UniRef50_C3XKY2 Glycosyl transferase n=2 Tax=Campylobacterales RepID=C3XKY2_9HELI Length = 433 Score = 175 bits (445), Expect = 2e-42, Method: Composition-based stats. Identities = 71/344 (20%), Positives = 128/344 (37%), Gaps = 67/344 (19%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLN-NRHINLD--------------------------- 59 ++ D NY+ V ITS++ N N + Sbjct: 2 FHIILSADENYIKYASVLITSVIYNTNPKLTFKDFCQKEGFKALKNSYFSAYQNIDFSKL 61 Query: 60 ----------FYIIADVYNDGFFQKIAKLAE----QNQLRITLYRINTDKLQCLPCTQ-- 103 F+I++D + ++ +L I + IN + + P + Sbjct: 62 SKQEAQEGYIFHILSDSISSTTQNQLTELQNTLNTIYPCEILTHIINDKEFENFPISGAA 121 Query: 104 VWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQE 163 + Y+RL L ++ + LYLD+D++C D+ +L + L V A + D + Sbjct: 122 HSNHLPYYRLKLDSYLDDSITKCLYLDSDMLCLCDLRELFAIDLKDFVVAAINDPGTKKR 181 Query: 164 KAVSRLSDPELL----GQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVM 219 K + + +++ YFNSG + ++ + + K+ EK L K K DQD++ Sbjct: 182 KIKYKENGKKMILNFNDNYFNSGFLLINTQNYKQHKIQEKC-ENLAKKCYYIKAADQDLL 240 Query: 220 NVLL-KGMTLFLPREYN------TIYTIKSELKDKTHQNYKKLIT--ESTLLIHYTGATK 270 N + K L LP YN I K E K + + + + ++ +IHY K Sbjct: 241 NATIPKEKLLKLPIAYNFSSISFCIAICKDEQKHRLNCTRAEFMESYKNPKIIHY--GEK 298 Query: 271 PWHKWAIYPSVK------YYKIALENSP-WKDDSPRDAKSIIEF 307 PW Y + K + + +P + SI E+ Sbjct: 299 PWKFLQSYVNSKGENINDLWWHYAKITPSFSTQLLESKASIKEY 342 >UniRef50_C6X2V2 Putative glycosyltransferase n=1 Tax=Flavobacteriaceae bacterium 3519-10 RepID=C6X2V2_FLAB3 Length = 315 Score = 175 bits (445), Expect = 2e-42, Method: Composition-based stats. Identities = 61/293 (20%), Positives = 118/293 (40%), Gaps = 23/293 (7%) Query: 26 ECLNVAYGVDANYLDGVGVSITSIVLNN-RHINLDFYIIADVYNDGFFQKIAKLAE-QNQ 83 L + + D +Y V I+SI+ N+ R+ + I+++ +D K+ + ++ Sbjct: 7 NLLPIVFTCDDHYFKYAAVVISSIIHNSSRNTKYEINIVSEYISDENQSLAQKMVQSKSN 66 Query: 84 LRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLL 143 + I + I + + S + Y+R F F LL DR+LYLD+D++ DIS Sbjct: 67 ISIQFHAIKIENPEVFHLNSYMSLSTYYRFFIFDLLK-DYDRVLYLDSDLIVDNDISFFA 125 Query: 144 HLGLNGAVAAVVKDV-----------EPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWA 192 + A + + +++ + +YFN+GV+ ++K Sbjct: 126 DIDFENKPAICCPSIYVQNSLKNNTDHKFTREYFTQILKMSDVDEYFNAGVILFNIKLIR 185 Query: 193 DAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGM--TLFLPREYNTIYTIKSELK----D 246 + K + + + Y DQD++N +L+ + EYN T+K LK + Sbjct: 186 AQGIDRKFFEAIKNIKDPV-YQDQDILNSVLRNNGGAKLISNEYNHTKTMKFSLKRIFLN 244 Query: 247 KTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPR 299 + K + HY G KPW + P + +P+ + + Sbjct: 245 ALKNKFGKKRNNWFTIYHYVGKVKPWQNF--NPDSALFLYYAYKTPFVREILK 295 >UniRef50_C5EZG9 Glycosyl transferase family protein n=1 Tax=Helicobacter pullorum MIT 98-5489 RepID=C5EZG9_9HELI Length = 374 Score = 175 bits (444), Expect = 2e-42, Method: Composition-based stats. Identities = 59/308 (19%), Positives = 122/308 (39%), Gaps = 33/308 (10%) Query: 58 LDFYIIADVYNDGFFQKIA----KLAEQNQLRITLYRINTDKLQCLPC-TQVWSRAMYFR 112 +F+++ D + +K+ +L++ + ++ + + + T + Y+R Sbjct: 9 YNFHLLMDFVSQETKEKLQNLILELSKIYPCTLNIHILEDEIFRTQSLRTLNGNYLAYYR 68 Query: 113 LFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVV-----KDVEPMQEKAVS 167 L L L++ R +YLD D++ GD+ +L + L G + VV D + + E Sbjct: 69 LRIGSALPLSIKRCVYLDVDMIVLGDLRELFKINLQGKICGVVMEGKDNDTQNILESKNK 128 Query: 168 RLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMT 227 ++ YFNSG++ +DL W + ++A I+ D+ ++N +L+G T Sbjct: 129 INKSIAIVSNYFNSGMLLVDLDLWRKENIEDRAFEIVKKYYCHK--HDEHILNAVLQGQT 186 Query: 228 LFLPREYNTIYTIKSEL-----KDKTHQNYKKL----ITESTLLIHYTGATKPWHKWAIY 278 + ++N + + + K + Y + ++ ++HY KPW IY Sbjct: 187 FKILPQWNMMVFLYCRAVCLNERGKINMPYNRKDFNNALKNPKILHYHTHHKPWEDSKIY 246 Query: 279 ------PSVKYYKIALENSPWKDDSPRDAKSIIEFKKR------YKHLLVQHHYISGIIA 326 +Y+ +E +P + K + YK L + +I Sbjct: 247 LNYCNKFLGQYWWDMVEQTPIFKEKLLQLKPQADSALAFQCLVGYKLLRYYQKGLFILIP 306 Query: 327 GVCYLCRK 334 Y K Sbjct: 307 FYTYFLIK 314 >UniRef50_C7XX93 Glycosyl transferase, family 8 n=1 Tax=Lactobacillus coleohominis 101-4-CHN RepID=C7XX93_9LACO Length = 398 Score = 174 bits (442), Expect = 3e-42, Method: Composition-based stats. Identities = 62/264 (23%), Positives = 102/264 (38%), Gaps = 28/264 (10%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLY 89 + D +Y + +I SIV + R + Y+I +F + +Q + Sbjct: 7 IVLSGDNHYTAQITTTIKSIVYHLRR--VKIYLINSDIPQEYFFNLNLRLKQLDSELVDL 64 Query: 90 RINTDKLQCLPCTQ-VWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLN 148 +IN + + S+ Y RL QL+ T DR LY+D+D + IS+L + L Sbjct: 65 KINPELFSNAESPKAHISKITYGRLMIPQLV--TEDRALYIDSDAIVDQSISELWTMDLG 122 Query: 149 GAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADA-KLTEKALSILMSK 207 A V DV L FN+G++ + KK + L + L+ K Sbjct: 123 DYPIAAVHDVF---------------LADIFNAGIILFNNKKLREDPDLVDNMLAAAQQK 167 Query: 208 DNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSE--LKDKTHQNYKKLITE--STLLI 263 DQ V+N L L EYN + + L + Y + + +I Sbjct: 168 G--ILDADQTVLNQFFNHQYLELGLEYNYVIGYDRDVSLAPRNAPGYFEKMLNCPQPKII 225 Query: 264 HYTGATKPWH-KWAIYPSVKYYKI 286 HY KPW+ + A K+++ Sbjct: 226 HYASPDKPWNLQSAGRMREKWWQY 249 >UniRef50_D1N145 Glycosyl transferase family 8 n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N145_9BACT Length = 311 Score = 173 bits (439), Expect = 7e-42, Method: Composition-based stats. Identities = 66/272 (24%), Positives = 121/272 (44%), Gaps = 13/272 (4%) Query: 24 TSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQ 83 +E + VA D NYLD V+ S++ + + +++ + ++ F + L + Sbjct: 1 MAEDIQVAMATDRNYLDYALVAAASLLAQHPGGGITLHLLHEELDESDFARFEALRRIDG 60 Query: 84 LRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLL 143 R+ +I Q P WS + Y+RL LL L+++LYLD D++ DI++L Sbjct: 61 FRLVPRKIERGFFQGWPEL-RWSTSAYYRLILPSLLP-DLEKILYLDCDLLVLDDIAELW 118 Query: 144 HLGLNG--AVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKAL 201 + L AA V+ Q+K YFNSGV+ +L+K A ++ + Sbjct: 119 NTELGSRSCAAAAVRVAPEHQKKI-----GLPAEAVYFNSGVMLFNLRKMAHENHEKRFI 173 Query: 202 SILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLIT--ES 259 + KYPDQD++N+ + L + +N + ++ + +++ Sbjct: 174 RLFDELGGRIKYPDQDILNLAYWNDYVKLSQRWNLVTSVYRNPPTPALYSEAEVVEALRR 233 Query: 260 TLLIHYTGATKPWH--KWAIYPSVKYYKIALE 289 + H+TG KPW K +P +Y++ E Sbjct: 234 PGIAHFTGTHKPWRLGKTTHHPYARYFRAYAE 265 >UniRef50_B7AUG6 Putative uncharacterized protein n=1 Tax=Bacteroides pectinophilus ATCC 43243 RepID=B7AUG6_9BACE Length = 301 Score = 172 bits (437), Expect = 1e-41, Method: Composition-based stats. Identities = 59/276 (21%), Positives = 112/276 (40%), Gaps = 22/276 (7%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYII-ADVYNDGFFQKIAKLAEQNQL-- 84 +N+ ++ ++ V +TS++ NN N+ ++ D + I +L Sbjct: 1 MNILVAMNDAFVKCYQVMLTSLIKNNPDENITVHVPYTDGLSRKGLDSIKELVRNQSHGS 60 Query: 85 -RITLYRINTDKLQCLPCT--QVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQ 141 + Y D+L L +WS M+FR+FA + + + DR+L+LD D++ G I Sbjct: 61 ASVREYYFGKDRLGSLDKLPLGMWSVEMFFRIFAQEFIPESEDRILWLDGDIIVNGSIKD 120 Query: 142 LLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQ--YFNSGVVYLDLKKWADAKLT-E 198 + + A +D+ K + + Y NSGV+ ++LK + +T + Sbjct: 121 FYNTDFDSMYYAACEDIAISHGKIKEEYDNLGWSSEEIYVNSGVLLINLKALRNNGITRD 180 Query: 199 KALSILMSKDNVYKYPDQDVMNVLLKGMTLFLP-REYNTIYTIKSELKDKTHQNYKKLIT 257 A+ + + YPDQ ++N + F YN + S +I Sbjct: 181 AAVEYALENMDKLHYPDQYMLNAMFHDKIKFADAFRYNCQVSGYS-------YKLADMIL 233 Query: 258 ESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPW 293 + ++H+ G +PW K+Y A+ W Sbjct: 234 SESAILHFPG-YRPWQTD----YQKHYSSAIPGDIW 264 >UniRef50_A7H2M2 Glycosyl transferase family 8 n=3 Tax=Campylobacter jejuni RepID=A7H2M2_CAMJD Length = 381 Score = 171 bits (434), Expect = 3e-41, Method: Composition-based stats. Identities = 77/335 (22%), Positives = 130/335 (38%), Gaps = 57/335 (17%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNN-------------RHINLDFYIIADVYNDGFFQK 74 ++ + NY+ V +TSI+ F+I++D ++ + Sbjct: 2 FHIVLNANENYIKYAAVLMTSIIQKTDLNKSMSEFCNFDTDEGYVFHILSDHISESMKVR 61 Query: 75 IAKLAEQ----NQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLD 130 I+ L +Q +I L+ +N D+ + + + Y+R+ +L L LYLD Sbjct: 62 ISNLEKQLNDIYPCKIVLHILNDDEFKGM-LKWRGNYLAYYRIKMASVLPQNLKICLYLD 120 Query: 131 ADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPE-------LLGQYFNSGV 183 D++C GD+ +LL + +N AAV D ++ S + +YFNSG Sbjct: 121 CDMLCFGDLRELLSVDINNYQAAVCLDGNNHKKNKKVFFSLKGREKYKFSNIEKYFNSGF 180 Query: 184 VYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNT------- 236 + ++L +W + K++ L YPDQD +N L TL LP +N Sbjct: 181 ILVNLDRWRRDNIENKSIDFLKKF--KTLYPDQDALNFAL-NDTLLLPNRWNFSLGYFVA 237 Query: 237 -----IYTIKSELKDKTHQNYKKLITE----STLLIHYTG-ATKPW-----------HKW 275 + H NY K E + + H+ KPW + Sbjct: 238 FLKNSQEILFLNQTKYPHLNYTKTEFENEVKNIKIAHFILDPFKPWDAFQYSIVNDDLQL 297 Query: 276 AIYPSVKYYKIALENSP-WKDDSPRDAKSIIEFKK 309 YP K+Y +N+P + D +SI E K Sbjct: 298 IEYPFYKHYWSVAKNTPEFYLDFLVQKESINEHKA 332 >UniRef50_UPI00016B2258 glycosyl transferase, family 8 n=1 Tax=candidate division TM7 single-cell isolate TM7c RepID=UPI00016B2258 Length = 327 Score = 170 bits (432), Expect = 5e-41, Method: Composition-based stats. Identities = 68/299 (22%), Positives = 123/299 (41%), Gaps = 23/299 (7%) Query: 24 TSECLNVAYGVDANYLDGVGVSITSIVLNNRHIN-LDFYIIADVYNDGFFQKIAKLA-EQ 81 LNV Y D NY +SI S++ NN+H+ ++ + + K K+ Sbjct: 2 NKGILNVIYQSDDNYAVVSAISIVSLMENNKHLKQINIFYLGHQLKKDSINKFNKMVGNY 61 Query: 82 NQLRITLYRIN---TDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGD 138 + IT ++ + + ++++ AF L + DR+LY++ V G Sbjct: 62 HNATITFVDVSSYPDELKEIGVKAWKGLYITWYKMLAFAKLDIKTDRILYINPHTVISGA 121 Query: 139 ISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTE 198 + LL L V A+ D + A + + + YFN G++ ++ KKW K+ Sbjct: 122 LDGLLELDFEDNVMALSYDATMVN--AHKDVIGLKPIDGYFNCGIMLINHKKWMKDKIDA 179 Query: 199 KALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTI----------YTIKSELKDKT 248 K L + N Y+ DQD+ NV KG + EYN Y + ++ Sbjct: 180 KMREHL--RYNHYEVADQDLCNVFFKGNIKKVGVEYNFSTVFYGYDIKKYIKANGFLPES 237 Query: 249 HQNYKKLITE--STLLIH--YTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKS 303 +Y +++ + +IH + +PW + P ++ L +PWK+ + AK Sbjct: 238 FYSYDEIMESYYTPKIIHSQFGMNGRPWQQGNENPVGILWRKYLNLTPWKNATMPVAKK 296 >UniRef50_A0ZYL4 Putative uncharacterized protein n=1 Tax=Archaeal BJ1 virus RepID=A0ZYL4_9CAUD Length = 286 Score = 170 bits (432), Expect = 5e-41, Method: Composition-based stats. Identities = 76/292 (26%), Positives = 132/292 (45%), Gaps = 24/292 (8%) Query: 27 CLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIAD-VYNDGFFQKIAKLAEQN-QL 84 LNV Y + +S S++ NN+ +++ YI+++ N+ FF+ + L E + L Sbjct: 2 TLNVCYIAGGDSWVPCYISAYSVLENNQDLDIHMYILSEEDNNNPFFEHVEYLYESHPSL 61 Query: 85 RITLYRINTDKLQCLPCT-QVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLL 143 I ++ D+ LP + S +YF++ +LL +L LDAD +C G +S LL Sbjct: 62 EIEFIEVDMDQFDDLPAPGKHLSPGVYFKIAINRLLPTD-GNVLLLDADTICDGSLSSLL 120 Query: 144 HLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSI 203 L L+G V A P + RL + FN+GV+Y++L++WA + E++ Sbjct: 121 SLDLSGKVLAAA----PSNKAETVRLGLQNNRAK-FNAGVLYVNLQEWAKQDIEERSRQY 175 Query: 204 LMSKDNVYKYPDQDVMNVLLKG--MTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTL 261 + + DQD +N L+ ++ YN + E +++ + Sbjct: 176 IEEHEPEL--NDQDALNALVNNPDDMEYIHPRYNATKLLVREF---------EMVDDEPT 224 Query: 262 LIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDA--KSIIEFKKRY 311 +IHY G KPW S + +P++D P+D K II + R Sbjct: 225 IIHYNGPDKPWRFVTERESGDLWWEYASKTPFRDYVPKDKGVKEIIFVRARS 276 >UniRef50_B8PIH6 Predicted protein n=2 Tax=Agaricomycetes RepID=B8PIH6_POSPM Length = 532 Score = 170 bits (431), Expect = 7e-41, Method: Composition-based stats. Identities = 58/306 (18%), Positives = 118/306 (38%), Gaps = 40/306 (13%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLA-EQNQLRI 86 +N+A D Y V+I S++ + + L Y++ D K+ + + + Sbjct: 227 MNIAIATDPAYAMAAAVAIHSVIAHTKSR-LTIYVLDLGLGDNDRNKLRRSMPRRADATM 285 Query: 87 TLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLG 146 ++ ++ +A + ++ +L ++R+LYLDADV+ + DI L Sbjct: 286 VFIPLD-------YASERKEKATWAKIDMIDVLP--VERVLYLDADVLVRADIWGLWSTD 336 Query: 147 LNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMS 206 L G DV + + YFN+GV+ LDL T +AL Sbjct: 337 LRGKPIGAAIDVG------FPEGHNGTVRKPYFNAGVLLLDLAAVRR---TLQALQGAAR 387 Query: 207 KDNVYKYPDQDVMNVLLKGMTLFLPREYNTI-YTIKSELKDKTHQNYKKLITESTLLIHY 265 + ++ DQD++N + + ++N +EL + QN + ++ ++H+ Sbjct: 388 EYTTSRFRDQDLLNAYFEANWAEVSLKWNAQGIATYAELPTEARQNIDMGLLKNPYIVHF 447 Query: 266 TGAT-----------------KPWHKWAI--YPSVKYYKIALENSPWKDDSPRDAKSIIE 306 TG KPW +P + + +E + WK + ++ Sbjct: 448 TGPVNPTLEVVLNPYIQPYTAKPWGYAGAPGHPHGEEWWNVVEQTAWKGWRASEEYRMLC 507 Query: 307 FKKRYK 312 ++ + Sbjct: 508 ASEKER 513 >UniRef50_C5WAK3 Ybl156 protein n=2 Tax=Enterobacteriaceae RepID=C5WAK3_ECOBB Length = 163 Score = 170 bits (430), Expect = 1e-40, Method: Composition-based stats. Identities = 74/163 (45%), Positives = 111/163 (68%), Gaps = 2/163 (1%) Query: 174 LLGQYFNSGVVYLDLKKWADAKLTEKALSIL--MSKDNVYKYPDQDVMNVLLKGMTLFLP 231 + G+YFN+GV+Y++LKKW +A LT L +L +K KY DQD +N+ ++L Sbjct: 1 MNGRYFNAGVIYVNLKKWHEANLTPYLLKLLRGETKYGSLKYLDQDALNIAFNMNNIYLA 60 Query: 232 REYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENS 291 ++++TIYT+K+EL D++H+ Y++ IT+ T+LIHYTG TKPWH WA YPS Y+ IA E S Sbjct: 61 KDFDTIYTLKNELHDRSHRKYQQTITDKTVLIHYTGITKPWHSWAGYPSASYFNIAREQS 120 Query: 292 PWKDDSPRDAKSIIEFKKRYKHLLVQHHYISGIIAGVCYLCRK 334 PWK ++A+++ E +K+YKHL YI GI + + Y +K Sbjct: 121 PWKKYPLKEARTVAEMQKQYKHLFAHGEYIKGITSLIKYKLKK 163 >UniRef50_Q1RIL1 Lipopolysaccharide 1,2-glucosyltransferase RfaJ n=10 Tax=Rickettsia RepID=Q1RIL1_RICBR Length = 530 Score = 166 bits (421), Expect = 9e-40, Method: Composition-based stats. Identities = 60/306 (19%), Positives = 118/306 (38%), Gaps = 33/306 (10%) Query: 11 KVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNR-HINLDFYII---ADV 66 + +L I L++A ++ + I S ++N+ + F+I+ D Sbjct: 233 EEIQSVIKLTGIKQDNTLDIALIINDKFARHAATVIASSLINSDINSFYKFHIVMNPNDS 292 Query: 67 YNDGFFQKIAKLAEQNQLRITLYRINTDKL------QCLPCTQVWSRAMYFRLFAFQLLG 120 + +K+A + I + L + + + +W + +RL+ Q+ Sbjct: 293 LTEESMEKLASMKHIRDYSIDFIPFPENVLDLNLANEKIEFSDMWPPLVMYRLYFDQVFP 352 Query: 121 LTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVE-PMQEKAVSRLSDPELLGQYF 179 L+ +LYLDAD++ D++ L ++ + A D V + ++ Y Sbjct: 353 -NLESILYLDADIIVLRDLNSFKKLDMSNYIVAGSMDTALTYCTLKVEEECNRKINNFYK 411 Query: 180 NSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYT 239 NSG+V+L+L+ + + L + + + YPDQD++N+ L +N Sbjct: 412 NSGIVFLNLQNMREKQAKNMVLDAMHNSKCSFAYPDQDLLNIAFHNYIYPLSMRWNFYTY 471 Query: 240 IKSELKDKTHQNYKKLITESTLLIHYTGATKPWH----KWAIY------PSVKYYKIALE 289 ++ ++HY G KPW+ KW KYY E Sbjct: 472 FIDRDNYFSY-----------FIMHYAGKKKPWNNEEIKWTKDILEKYQEIEKYYWRYRE 520 Query: 290 NSPWKD 295 +PW + Sbjct: 521 FTPWGN 526 >UniRef50_A8LP95 Lipopolysaccharide 1 n=1 Tax=Dinoroseobacter shibae DFL 12 RepID=A8LP95_DINSH Length = 342 Score = 166 bits (420), Expect = 1e-39, Method: Composition-based stats. Identities = 54/251 (21%), Positives = 92/251 (36%), Gaps = 27/251 (10%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADV---YNDGFFQKIAKLAEQNQLRI 86 V + D YL + I + D I GF + I Sbjct: 38 VCFCSDEGYLPFALFAALQIHRLHPDRCFDLVIAHTGPLSVPHGFP----------GIGI 87 Query: 87 TLYRINTDK-LQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGD-ISQLLH 144 I+T + L + + Y RL LG R+LY+D+DV D + LL Sbjct: 88 RYVEIDTGGCFERLALDARRTGSTYLRLALSGALGHDYQRILYMDSDVFALRDGLHVLLF 147 Query: 145 LGLNGAVAAVVKDVEPMQE---KAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKAL 201 + G A V+D + K ++ YFN+GV+ +D + + + KAL Sbjct: 148 TDMRGKPLAAVRDNSQWRTSGRKPDDLVTLNLPARPYFNAGVLLMDTARLNEQDILAKAL 207 Query: 202 SILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTL 261 + S+ DQ ++N + G + +N +T ++ ++E Sbjct: 208 DLGTSQAGRLARHDQTLLNAVTSGNWAEMSPRWNWQFTW---------ASWIFALSEDAR 258 Query: 262 LIHYTGATKPW 272 ++H+ G KPW Sbjct: 259 ILHFIGPNKPW 269 >UniRef50_D2MYR1 Putative uncharacterized protein n=1 Tax=Campylobacter jejuni subsp. jejuni 414 RepID=D2MYR1_CAMJE Length = 383 Score = 165 bits (419), Expect = 2e-39, Method: Composition-based stats. Identities = 68/306 (22%), Positives = 117/306 (38%), Gaps = 66/306 (21%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNR----------------------------HINLD 59 ++ ++ +Y+ V I+SIV N Sbjct: 2 FHIILNLNDDYVKYASVLISSIVKNTDTSKTFAKICEENHNLTHILTLKQYNKSEEEGYV 61 Query: 60 FYIIADVYNDGFFQKIA----KLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFA 115 F+I++D +D K+ LA+ I +Y IN D + + + Y+RL Sbjct: 62 FHILSDFISDKTRMKLEYLKENLAKIYPCDIKIYIINEDNFRNFLHWK-GNFVAYYRLMV 120 Query: 116 FQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELL 175 +L +++ LY+DAD++C DI +L L V V D + + L Sbjct: 121 GSILPPDIEKCLYIDADMLCFSDIRKLFLFDLEDKVLGAVADFATWNTRFLKFRKLKYLF 180 Query: 176 G-------QYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTL 228 +YFNSG++ +DLK+W + +K L +L + PDQD +N+++K + Sbjct: 181 KGFLKFSREYFNSGLLLIDLKEWRRQNIEKKCLDVLKYYKCIL--PDQDALNIVIKENYI 238 Query: 229 FLPREYNT----------IYTIKSELKDKTHQNYKKLI------------TESTLLIHYT 266 LP +N K E+ + +Y K + L +HY+ Sbjct: 239 KLPLSFNCPTVCYATNYLNIICKDEISSFSKLDYFKEVGMMYSKNELLEALNKPLFLHYS 298 Query: 267 GATKPW 272 KPW Sbjct: 299 --EKPW 302 >UniRef50_C5ZVZ7 Putative glycosyltransferase n=3 Tax=Campylobacterales RepID=C5ZVZ7_9HELI Length = 431 Score = 165 bits (418), Expect = 2e-39, Method: Composition-based stats. Identities = 70/343 (20%), Positives = 122/343 (35%), Gaps = 62/343 (18%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNN---------------------------------- 53 ++ + D NY+ V ITSI+ N Sbjct: 2 FHIFFSADKNYIPYTAVLITSIIKNTNPQKSFKDFCTTPSDSLPSLDYPRLQYDNLDKLD 61 Query: 54 RHINLDFYIIADVYNDGFFQK----IAKLAEQNQLRITLYRINTDKLQCLPCTQ--VWSR 107 + F+I++D K I +L+ + ++ IN P + S Sbjct: 62 KSEGYVFHILSDSIPKDLQTKLQNFIQELSAFYPCTLQIHIINDIDFAHFPISGAAHSSH 121 Query: 108 AMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVS 167 Y+RL + + LYLD+D++ D+ +L L L +A ++ D K Sbjct: 122 LPYYRLKWQDYIKPAPQKCLYLDSDMLVLCDLRELFALDLKDNIAGIIGDCGSKNRKIKY 181 Query: 168 RLSDPE----LLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLL 223 + ++ + YFNSG + ++ K++ ++ EK L K K DQD++N + Sbjct: 182 QENNYKKTFYFDENYFNSGFLLINSKQYIKEQIWEKC-ENLAKKCTYIKAADQDLLNFTI 240 Query: 224 -KGMTLFLPREYNTIY----TIKSELKDKTHQNYKKLI----TESTLLIHYTGATKPWHK 274 L LP YN + + + K NY + ++ ++HY KPW Sbjct: 241 PINKRLKLPFAYNFQCITLLYVLCKDECKNRLNYTREAFNKSFKNPKILHY--GEKPWRY 298 Query: 275 WAIYPSVK------YYKIALENSPWKDDSPRDAKSIIEFKKRY 311 Y K + + +P D KS I K + Sbjct: 299 LQSYQDYKGNNINDIWWEYAQQTPIFGDKLLKQKSQISDYKLF 341 >UniRef50_C3XGD2 Putative uncharacterized protein n=1 Tax=Helicobacter bilis ATCC 43879 RepID=C3XGD2_9HELI Length = 364 Score = 165 bits (418), Expect = 2e-39, Method: Composition-based stats. Identities = 66/318 (20%), Positives = 116/318 (36%), Gaps = 40/318 (12%) Query: 57 NLDFYIIADVYNDGFFQKIA----KLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFR 112 F+I+ D QK+ +L + +Y ++ Q LP + YFR Sbjct: 47 PFCFHILTDGLKHETRQKLQAFQIELNKIYPCEFRVYTLSDSIFQGLPKLNN-NYLAYFR 105 Query: 113 LFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDP 172 L L + LYLD D++C DI ++ + L G + VV + Q + R S Sbjct: 106 LKIASCLPQDIKTCLYLDVDMICVADIREIFYTDLQGKICGVVLVPDHQQYCVLKRNSAI 165 Query: 173 E-----LLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMT 227 YFNSG++ +D++++ + +K L V DQD +N +L Sbjct: 166 GDEFVFNASTYFNSGLMLIDVEQYRKYNVEQKCLEWFEQYVPVLL--DQDALNAVLGDHI 223 Query: 228 LFLPREYNTIYTI-----------KSELKDKTHQNYKKLITESTLLIHYTGAT-KPWHKW 275 LP E+N + + + K + + ++HYTG T KPW + Sbjct: 224 CALPLEWNFFVELLKYKRQDFKGKDNNIVMKITYEEYMQVKNNMKILHYTGWTLKPWQQP 283 Query: 276 AIYP-------SVKYYKIALENSP--WKDDSPRDAKSIIEFKKRY-----KHL--LVQHH 319 I + ++P +KD K + KH+ + Sbjct: 284 YIENDMIKTCIYKNKWWEIAHDTPVFYKDIYASYMKKQEDMLYESILSLQKHIKSFKLRN 343 Query: 320 YISGIIAGVCYLCRKYYR 337 + + + C+K + Sbjct: 344 RLKRLQQSLKRRCKKLFH 361 >UniRef50_C6DEN1 Glycosyl transferase family 8 n=1 Tax=Pectobacterium carotovorum subsp. carotovorum PC1 RepID=C6DEN1_PECCP Length = 602 Score = 162 bits (411), Expect = 1e-38, Method: Composition-based stats. Identities = 67/274 (24%), Positives = 115/274 (41%), Gaps = 24/274 (8%) Query: 30 VAYGVDANYLDGVGVSITSIVLN-NRHINL-DFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 + + D Y V++ S+ + R NL D YI G + ++A + +T Sbjct: 327 IFFCADTAYTAPAIVALISLAIAIERSENLPDIYIFVLPEAHGLWGQLASSFNREFPSLT 386 Query: 88 LYRINTDKLQCLPCTQVW---------SRAMYFRLFAFQLLGL-TLDRLLYLDADVVCKG 137 L ++T ++Q + S Y RL+A + L + R LYLD+DVV + Sbjct: 387 LRVVSTLQMQLDQSRAHYGFNSMGDMLSTMAYARLYASRYLSQCGVARALYLDSDVVIQS 446 Query: 138 DISQLLHLGLNGAVAAVVKD-VEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKL 196 LL++ + A D V P+ + AV+ P G+YFNSGV+ LD A Sbjct: 447 SPLPLLYMDMEEFPLAACHDQVGPLVDHAVTLHGIPN--GRYFNSGVMLLDFHHPATLPA 504 Query: 197 TEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLI 256 E A++ D+V + DQ +N ++G+ L L +YN + + Sbjct: 505 IEAAITYSEDTDSVLIFQDQCALNKAIRGLYLTLDGKYNCYMPPGRPM---------SAM 555 Query: 257 TESTLLIHYTGATKPWHKWAIYPSVKYYKIALEN 290 E+ ++H+ KPWH + + ++ Sbjct: 556 YENAAIVHFVSTPKPWHLAYLGQGTALWSRFYDH 589 >UniRef50_A3YS36 Putative sugar transferase n=2 Tax=Campylobacter jejuni RepID=A3YS36_CAMJE Length = 459 Score = 162 bits (411), Expect = 1e-38, Method: Composition-based stats. Identities = 68/319 (21%), Positives = 127/319 (39%), Gaps = 31/319 (9%) Query: 29 NVAYGVDANYLDGVGVSITSIVLNNRHIN---LDFYIIADVYNDGFFQKI----AKLAEQ 81 ++ + Y++ + V + SI++N N F+I++ ND +K+ +L+ Sbjct: 3 HIVFNSSNEYIENLSVLMYSIIINTNKSNTKKYCFHILSSNINDNTCKKLTLLEKELSSI 62 Query: 82 NQLRITLYRINTDKLQCLPCTQV-WSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDIS 140 I +Y IN + + S Y RL +L + + LYLD D++ GDIS Sbjct: 63 YPSEIKIYHINDNLFYDYNIPKHEGSYNAYLRLMLASILSKDIKKCLYLDVDMLVLGDIS 122 Query: 141 QLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDP--ELLGQYFNSGVVYLDLKKWADAKLTE 198 +L L L V A V ++ S+ S + G +FNSG++ ++L W + + Sbjct: 123 ELFDLDLKDKVFAAVFILKHPWPNLNSKDSSEIFYIYGSHFNSGLMLINLDAWREKNIES 182 Query: 199 KALSILMSKDNVYKYPDQDVMNVLL-KGMTLFLPREYNTIYTIKSELKDK---------- 247 ++LS + + Y D+ V+N +L K L E+N + + + Sbjct: 183 RSLSFIKNYYVPYAV-DEYVLNAILSKDDIFSLKLEWNFLIGFRRLYLNNDLFFNKEEGD 241 Query: 248 ------THQNYKKLITESTLLIHYTGA--TKPWHKWAIYPSVKYYKIALEN-SPWKDDSP 298 + + + ++HYT KPW + Y + E W D + Sbjct: 242 KYKIICYSKEEFEKAFKKIKILHYTYLYMPKPWENVYSFIDDDYNLVYYEFYDAWWDMAL 301 Query: 299 RDAKSIIEFKKRYKHLLVQ 317 + F K+ + + Sbjct: 302 KTPIYGEHFAKKKREYEKK 320 >UniRef50_Q16CW9 Lipopolysaccharide 1,3-galactosyltransferase, putative n=7 Tax=Rhodobacteraceae RepID=Q16CW9_ROSDO Length = 329 Score = 161 bits (409), Expect = 2e-38, Method: Composition-based stats. Identities = 52/257 (20%), Positives = 100/257 (38%), Gaps = 30/257 (11%) Query: 30 VAYGVDANYL---DGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRI 86 + + D NYL I S+V D I +A L I Sbjct: 25 IVFCCDQNYLVFAAHAAAQIASLVE---KPEFDICICYGHQAVVLPDSLA------GLGI 75 Query: 87 TLYRINT-DKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKG-DISQLLH 144 L ++ D + L + + +Y R+ D++LYLD+D+ +G D + L Sbjct: 76 RLCHVDVGDVFEGLRLDKGKTHDVYLRIALPTAFAGEYDKILYLDSDIFVQGGDFNALFD 135 Query: 145 LGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQ-----YFNSGVVYLDLKKWADAKLTEK 199 + + A V+D +Q + R + + YFN+GV+ +D++ + + +L + Sbjct: 136 IDVAPHCIASVRD--NVQWRTPKRQNKRNTIKGIPPSAYFNAGVMLMDVQAYTEQELMRR 193 Query: 200 ALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITES 259 + ++ K DQ++ N +L+ + +N Y+ + L + Sbjct: 194 CVEFGRARRRDLKRHDQNLYNAVLQNDWAEISPVWNWQYSWSTRLF---------AVFAY 244 Query: 260 TLLIHYTGATKPWHKWA 276 +IH+ G KPW + Sbjct: 245 PNIIHFIGPAKPWKDES 261 >UniRef50_O48684 F3I6.10 protein n=46 Tax=Embryophyta RepID=O48684_ARATH Length = 393 Score = 160 bits (406), Expect = 5e-38, Method: Composition-based stats. Identities = 58/269 (21%), Positives = 109/269 (40%), Gaps = 23/269 (8%) Query: 23 NTSECLNVAYGVDANYLDGVGVSITSIVLNNR-HINLDFYIIADVYNDGFFQKIAKLAEQ 81 N +++A +D+ YL G ++ S++ + N+ F+ IA ++ + +++L Sbjct: 80 NDPSLVHIAMTLDSEYLRGSIAAVHSVLRHASCPENVFFHFIAAEFDSASPRVLSQLVRS 139 Query: 82 NQLRITL--YRINTDKLQCLPCTQVW----SRAMYFRLFAFQLLGLTLDRLLYLDADVVC 135 + Y D + L + + + Y R + +L +++R++YLD+DV+ Sbjct: 140 TFPSLNFKVYIFREDTVINLISSSIRLALENPLNYARNYLGDILDRSVERVIYLDSDVIT 199 Query: 136 KGDISQLLHLGLNGAVAAVVK---DVEPMQEKAVSRLSDPELLG-------QYFNSGVVY 185 DI++L + L G+ Q SDP L G YFN+GV+ Sbjct: 200 VDDITKLWNTVLTGSRVIGAPEYCHANFTQYFTSGFWSDPALPGLISGQKPCYFNTGVMV 259 Query: 186 LDLKKWADAKLTEKALSI--LMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSE 243 +DL +W + EK L K +Y ++ G + +N Sbjct: 260 MDLVRWREGNYREKLEQWMQLQKKMRIYDLGSLPPFLLVFAGNVEAIDHRWNQ----HGL 315 Query: 244 LKDKTHQNYKKLITESTLLIHYTGATKPW 272 D + + L L+H++G KPW Sbjct: 316 GGDNIRGSCRSLHPGPVSLLHWSGKGKPW 344 >UniRef50_C0X9Z8 Glycosyltransferase n=1 Tax=Lactobacillus gasseri JV-V03 RepID=C0X9Z8_9LACO Length = 675 Score = 159 bits (402), Expect = 1e-37, Method: Composition-based stats. Identities = 54/276 (19%), Positives = 110/276 (39%), Gaps = 26/276 (9%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLY 89 +A D N L+ + ++ SI L+N+H ++ +II +F + + Q +I Sbjct: 4 IALDADVNDLNKIETTLKSIFLHNQH--VEIHIINFNIPHEWFINVNQYVNQFGSKIIDE 61 Query: 90 RINTDKLQCL-PCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLN 148 +I+ + L + P + + + R L+ D++LYLD+D++ ++ + + + Sbjct: 62 KIDPNFLGDVQPSSDQIKKISFGRFLIPDLISA--DKVLYLDSDLIVTDNLQSIFQMNFD 119 Query: 149 GAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKD 208 + V D + FNSGV+ ++ K+W + K++ K + + + Sbjct: 120 DKMLFAVHDYQNP---------------DQFNSGVMLINNKRWREEKVSSKLIEMSKQQA 164 Query: 209 NVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTL--LIHYT 266 DQ V+N + K L YN ++ Q + +I+Y+ Sbjct: 165 ---LASDQAVINEVFKNQIGELNLSYNYQIGLEKNAYWNNKQVVFDNYNRVPIPRIINYS 221 Query: 267 GATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAK 302 G P++ + + N W D R K Sbjct: 222 GDDNPFNLVSTGDLRNNWWQY-HNLEWSDIVKRYGK 256 >UniRef50_C6IJ37 General stress protein A n=2 Tax=Bacteroides RepID=C6IJ37_9BACE Length = 309 Score = 158 bits (399), Expect = 4e-37, Method: Composition-based stats. Identities = 66/298 (22%), Positives = 124/298 (41%), Gaps = 19/298 (6%) Query: 24 TSECLNVAYGVDANYLDGVGVSITSIVLN--NRHINLDFYIIADVYNDGFFQKIAKLAEQ 81 + + +Y + SI+ + ++++ + KI + E Sbjct: 2 KKTSVPLVIAFTPDYFIPAATCLYSIITSMQAEGELHVICLLSEELPERLKLKIQLIGEG 61 Query: 82 NQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQ 141 + + KLQ + Q ++ A +RL LL +++Y+D D++ + D+ Q Sbjct: 62 RTCY-SFVNL-QGKLQHIYIDQKYTEAASYRLLLPDLLP-EYKKVIYIDCDIIVRNDLVQ 118 Query: 142 LLH-LGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKA 200 L H + L A V + + + +Y NSG + ++L+ + EK Sbjct: 119 LYHSIDLGMNYLAAVFEASMDFQLDHLKTIGCNPN-EYINSGFLIMNLELMRKDNMVEKF 177 Query: 201 LSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTI------KSELKDKTHQNYKK 254 + SK + ++PDQDV+N L K L LP YN+I T K L+ T Q++ + Sbjct: 178 IE--ASKVDYLEFPDQDVLNQLCKDRILALPPYYNSIRTFYLPQYKKFFLQKYTEQDWLE 235 Query: 255 LITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYK 312 + T +HYTGA KPW+++ + +++ + + K I K Y+ Sbjct: 236 VHRHGT--VHYTGA-KPWNQFTV-QFQLWWQYYEQLPEIIKKEWQVDKKIYFLSKLYR 289 >UniRef50_Q17VR5 Lipopolysaccharide biosynthesis protein n=19 Tax=Helicobacter RepID=Q17VR5_HELAH Length = 405 Score = 157 bits (398), Expect = 5e-37, Method: Composition-based stats. Identities = 64/377 (16%), Positives = 135/377 (35%), Gaps = 81/377 (21%) Query: 24 TSECLNVAYGVDANYLDGVGVSITSIVLNNRHIN------LDFYIIADVYNDGFFQKIAK 77 S + + D NY GVS+ S++ N + + + D + +K+ + Sbjct: 3 DSVIIPIVVAFDNNYCIPAGVSLYSMLANAKTERERVKLFYKIHCLVDGLSAENIEKLKE 62 Query: 78 LAE-------------------QNQLRIT-LYRINTDKLQCL-----------------P 100 + I I +D Q + Sbjct: 63 TLAPFSAFSSVEFLEISTHNTPKENQEIKKNQTIKSDHYQNIDPIIANKIEELFTKLSNY 122 Query: 101 CTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVK--DV 158 + +S+ + RL L D+++ D D + GDIS+ + L V+ D+ Sbjct: 123 SQKRFSKMIMCRLLLASLFP-QYDKMIMFDVDTLFVGDISESFFIPLEAHYFGAVREKDL 181 Query: 159 EPMQEKAVSRLSDPE---------------------LLGQYFNSGVVYLDLKKWADAKLT 197 M + L + L YFN+G + L+LK W L Sbjct: 182 IAMNRNSAKDLYELRQRRAKSIGVANAFPNLEEAQILFDNYFNAGFLALNLKLWRKENLE 241 Query: 198 EKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLIT 257 + + + K+ + DQD + + +G L LP YN + L + + K++ Sbjct: 242 NQLIGFFILKNEKLLFNDQDALCFVCRGRILELPYPYNAHPSF---LDTPSFPSIKEV-- 296 Query: 258 ESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQ 317 ++H+ G KPW ++++ + K++++ L +P+KD + + H+ + Sbjct: 297 ---CMLHFWG-DKPWKIFSVFGAKKWHEV-LMQTPFKDKY----FNTPFLDHLFNHIQNK 347 Query: 318 HHYISGIIAGVCYLCRK 334 ++ + + ++ ++ Sbjct: 348 NNKLRTFNKALSFVDKR 364 >UniRef50_A4IXE1 Glycosyl transferase, family 8 n=16 Tax=Francisella RepID=A4IXE1_FRATW Length = 296 Score = 156 bits (395), Expect = 9e-37, Method: Composition-based stats. Identities = 60/307 (19%), Positives = 119/307 (38%), Gaps = 26/307 (8%) Query: 26 ECLNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAEQNQL 84 + + + D N + G V+I S++ + N D Y+ N + E+ + Sbjct: 2 NKIPIVFTFDKNIILGGAVTIKSLIDHANPDTCYDIYVYHPNINKKSISAFNSMIEKTKH 61 Query: 85 RITLYRINTDKLQCLPC-TQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLL 143 I+ + ++ + +P T+ ++RL +LL D+++Y D DV+ + D+S++ Sbjct: 62 SISFHNVDESIFKDVPIDTRRGWIITFYRLLIPKLLP-QYDKVIYSDVDVLFQSDMSEVY 120 Query: 144 HLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSI 203 + L A V + Q + + G + ++ K + + Sbjct: 121 NTDLTSYEWAGVIAEKHQQNMVQHKYFKENNNSYIYWPGFMVMNTKLMRENNFISRCFDT 180 Query: 204 LMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKS-----------ELKDKTHQNY 252 + + K+ D DV+N+ + LP +Y T+ +I LK+ N Sbjct: 181 MHEFNTRLKFRDLDVLNLTCR-KIKSLPFKYVTLQSIYYLNTIQEAPEYIFLKEIYSDNE 239 Query: 253 KKLITESTLLIHYTG-ATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRY 311 + +IHY G KPW + YK LE + P++ + F+ Sbjct: 240 LLDAKNNPAIIHYAGSPGKPW------RMKRPYKNYLE---YISKIPKELRKYT-FRDIK 289 Query: 312 KHLLVQH 318 K LL ++ Sbjct: 290 KKLLSKY 296 >UniRef50_B3XPR8 Glycosyl transferase family 8 n=4 Tax=Lactobacillus RepID=B3XPR8_LACRE Length = 465 Score = 156 bits (395), Expect = 1e-36, Method: Composition-based stats. Identities = 49/261 (18%), Positives = 99/261 (37%), Gaps = 25/261 (9%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLY 89 +A VD ++D ++ SI +N+ N+ YII +F I + +I Sbjct: 4 IALSVDYRWIDQAETTLKSIYAHNK--NVKTYIINHDIPHEWFVNINRYLGVQDSQIIDR 61 Query: 90 RINTDKLQCLPCTQ-VWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLN 148 +I+ ++ + +P + S +Y + +L+ D++LYLD+DV+ ++ QL +N Sbjct: 62 KIDEERFKDMPMPEARISPMVYGKFLIPELIPE--DQVLYLDSDVIVDKNLDQLFATKIN 119 Query: 149 GAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKD 208 V D + FNSGV+ ++ W + + + L + + Sbjct: 120 DRPLYTVVDYFNPSQ---------------FNSGVLLINNLFWRNNNIGNQLLKLGHDYN 164 Query: 209 NVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITE--STLLIHYT 266 Q +MN L +N + + ++ + +IHYT Sbjct: 165 LN---NTQVIMNEGFAQNYGKLDPCFNFQIGYERKSYWNDKSSFYAFFDKVTDPAIIHYT 221 Query: 267 GATKPWHKWAIYPSVKYYKIA 287 KP++ + + Sbjct: 222 EKDKPFNIEKTVELREKWWYY 242 >UniRef50_C5SH34 Glycosyl transferase family 8 n=1 Tax=Asticcacaulis excentricus CB 48 RepID=C5SH34_9CAUL Length = 307 Score = 152 bits (385), Expect = 1e-35, Method: Composition-based stats. Identities = 64/315 (20%), Positives = 117/315 (37%), Gaps = 45/315 (14%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLY 89 + Y VD NYL VS + N D I+ +K+ +A I L Sbjct: 6 ICYVVDDNYLFPTLVSASQARENAPSSLADIVILCLSDASDRVRKVMPVAVALG--IELI 63 Query: 90 RINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNG 149 + T ++ L MY RLF +LL +R+LY+D D + LL++ + Sbjct: 64 EVPTASIENL-------HPMYGRLFIDKLLPKAYERVLYIDGDTQIAASLEPLLNVDIPE 116 Query: 150 AVAAVVKDVEPMQEKAVSRLS------------DPELLGQYFNSGVVYLDLKKWADAKLT 197 V+D M K + + + Y N+GV+ ++K WA+ L Sbjct: 117 GKFLAVRDPAAMFAKLSDKWASRIQGERVEAGLGDNPIEDYLNTGVLVFNMKDWAE--LA 174 Query: 198 EKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLIT 257 + L ++ ++ +K+ DQD MN+ + L++ +N + +++ Sbjct: 175 GETLKLIRARSTPFKFGDQDPMNLAIGDRCLYISNRWNFPGFLIGSGQEE---------R 225 Query: 258 ESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAK-----SIIEFKKRYK 312 ++ H+ +PW K++ +P+K R K + Sbjct: 226 VKPVIYHFMSNPRPWVHAGAPWGPKWH------TPYKAFLARFPVLESVAPKTTPVKALR 279 Query: 313 HLLVQHHYISGIIAG 327 H L Q + G+ Sbjct: 280 HHLQQA--LKGVTEY 292 >UniRef50_C6DEN3 Glycosyl transferase family 8 n=1 Tax=Pectobacterium carotovorum subsp. carotovorum PC1 RepID=C6DEN3_PECCP Length = 610 Score = 152 bits (384), Expect = 2e-35, Method: Composition-based stats. Identities = 61/287 (21%), Positives = 108/287 (37%), Gaps = 22/287 (7%) Query: 16 DFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHIN--LDFYIIADVYNDGFFQ 73 D L + + DA++ V++TS+ ++ N D YI ++ Sbjct: 316 DAALQPTKPLSEYAIFFCTDADFSLPAVVALTSLAMSIGGANNLPDIYIFVPPEIRPLWE 375 Query: 74 KIAKLAEQNQLRITLYRINT---------DKLQCLPCTQVWSRAMYFRLFAFQLLG-LTL 123 +IA+ ITL ++T + + S Y R +A + L + + Sbjct: 376 RIAERFTSAFPIITLRIVSTLQMDLDEVRAQFGFYNVGETLSTTTYTRFYASRYLHYIGV 435 Query: 124 DRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGV 183 R LYLD+D+V LL+ + G A D K RL +YFN+GV Sbjct: 436 TRALYLDSDIVILHSPLSLLYEDMQGFPLAARTDRNTPLIKRAIRLHQIA-NERYFNAGV 494 Query: 184 VYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSE 243 + DL A A++ ++ + DQ +N + G+ L L YN S Sbjct: 495 ILFDLTHPAMISTINTAITYSKQGNSPLLFLDQCALNKAISGLYLALDERYNRFIPPSSA 554 Query: 244 LKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALEN 290 ++I ++T+++H+ KPW + + + Sbjct: 555 ---------TQVIEDNTVIMHFIETPKPWQAGYAGQGLTIWGEYQRH 592 >UniRef50_Q1CUZ8 Lipopolysaccharide 1,2-glycosyltransferase n=12 Tax=Helicobacter RepID=Q1CUZ8_HELPH Length = 372 Score = 152 bits (384), Expect = 2e-35, Method: Composition-based stats. Identities = 60/368 (16%), Positives = 130/368 (35%), Gaps = 65/368 (17%) Query: 28 LNVAYGVDANYLDGVGVSITSIV-----LNNRHIN------LDFYIIADVYNDGFFQKIA 76 + +A D +Y GVS+ S++ + + N + + D + QK+ Sbjct: 5 IPIAIAFDNHYAIPTGVSLYSMLACAKTEHPQSQNDSEKLFYKIHCLVDNLSLENQQKLK 64 Query: 77 K---------------LAEQNQLRITLYRINTDKLQ------CLPCTQVWSRAMYFRLFA 115 + ++E + I + DK+ + +S+ + RLF Sbjct: 65 ETLAPFSAFASVDFLDISEPDHSTIKIEPFVIDKIHEAFLQLNIYAKTRFSKMVMCRLFL 124 Query: 116 FQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQ------------- 162 L D+++ DAD + D+S+ + L+ KD + Sbjct: 125 ASLFP-QYDKIIMFDADTLFLNDVSESFFIPLDSYYFGAAKDFASPKSLKHFQTEREREP 183 Query: 163 -------EKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPD 215 E + + ++N G + ++LK W L E+ L++ K P+ Sbjct: 184 RQKFSLYEHYLKEKDMKIICENHYNVGFLIVNLKLWRADHLEERLLNLTHQKGQCVFCPE 243 Query: 216 QDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGATKPWHKW 275 QD++ + L LP YN + + Q + +++H+ KPW Sbjct: 244 QDLLTLACYQKVLQLPYIYNAHPFMLN-------QKRFIPDKKEIVMLHFYFVGKPWISP 296 Query: 276 AIYPSVKYYKIALENSPWKDDSPRDAKSIIE-----FKKRYKHLLVQHHYISGIIAGVCY 330 S ++++ L+ + + S + K + E K++ L ++ V + Sbjct: 297 TALYSKEWHETLLKTPFYAEYSVKFLKQMTECLSLKDKQKTFEFLAPLLNKKTLLEYVFF 356 Query: 331 LCRKYYRK 338 + +++ Sbjct: 357 RLNRIFKR 364 >UniRef50_C6DEN2 Glycosyl transferase family 8 n=1 Tax=Pectobacterium carotovorum subsp. carotovorum PC1 RepID=C6DEN2_PECCP Length = 615 Score = 151 bits (383), Expect = 3e-35, Method: Composition-based stats. Identities = 63/273 (23%), Positives = 110/273 (40%), Gaps = 23/273 (8%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINL--DFYIIADVYNDGFFQKIAK-LAEQNQLRI 86 + D Y V++TS+ ++ N D Y+ + +IA A++ L + Sbjct: 337 IFLCADTAYNVPALVALTSLAMSIAQANPPPDIYMFVLPETHEIWSQIAHCFAKKFPLTV 396 Query: 87 TLY---RINTDKLQCLPCTQVW----SRAMYFRLFAFQLL-GLTLDRLLYLDADVVCKGD 138 + +++ D+ + Q + S Y RL+A + L GL + R LYLD+DVV + Sbjct: 397 KIVSTLQMDLDESRAHYGFQNYGKMLSITAYARLYASRYLQGLGITRALYLDSDVVIRRS 456 Query: 139 ISQLLHLGLNGAVAAV-VKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLT 197 LLH+ + G A + P +A+ P G+YFNSG++ LD + A Sbjct: 457 PLGLLHMDMGGYPLAARTERAHPRISRAIKLHGIPN--GRYFNSGILLLDFQHPATQSTL 514 Query: 198 EKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLIT 257 A++ DN Y DQ +N ++G+ L L ++N + Sbjct: 515 NTAIAYSEQLDNKLLYLDQCALNKSIQGLYLDLDEKFNWFI---------VPDDTAHPQD 565 Query: 258 ESTLLIHYTGATKPWHKWAIYPSVKYYKIALEN 290 E ++H+ KPW + + Sbjct: 566 EDAAIMHFISTPKPWDLNYSGRGATLWADYKHH 598 >UniRef50_B9HMR5 Glycosyltransferase, CAZy family GT8 n=25 Tax=Magnoliophyta RepID=B9HMR5_POPTR Length = 383 Score = 151 bits (382), Expect = 3e-35, Method: Composition-based stats. Identities = 53/276 (19%), Positives = 108/276 (39%), Gaps = 25/276 (9%) Query: 18 RLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNR-HINLDFYIIADVYNDGFFQKIA 76 R + +++A +D+ YL G ++ S++ + ++ F+ +A ++ + + Sbjct: 67 RSVSSCDPSLVHIAMTLDSEYLRGSIAAVHSVLKHASCPESIFFHFVAAEFDPASPRVLT 126 Query: 77 KLAEQNQLRITL--YRINTDKLQCLPCTQVW----SRAMYFRLFAFQLLGLTLDRLLYLD 130 +L + Y D + L + + + Y R + +L L +DR++YLD Sbjct: 127 QLVRSTFPSLNFKVYIFREDTVINLISSSIRQALENPLNYARNYLGDMLDLCVDRVIYLD 186 Query: 131 ADVVCKGDISQLLHLGLNGAVAAVVK---DVEPMQEKAVSRLSDPELLG---------QY 178 +D+V DI +L + L+G+ Q SD + G Y Sbjct: 187 SDIVVVDDIHKLWNTALSGSRVIGAPEYCHANFTQYFTSVFWSDQVMSGTFSSARRKPCY 246 Query: 179 FNSGVVYLDLKKWADAKLTEKALSI--LMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNT 236 FN+GV+ +DL +W + + + K +Y+ ++ G + +N Sbjct: 247 FNTGVMVMDLVRWREGDYKRRIEKWMEIQKKTRIYELGSLPPFLLVFAGDVEAIDHRWNQ 306 Query: 237 IYTIKSELKDKTHQNYKKLITESTLLIHYTGATKPW 272 D + + L L+H++G KPW Sbjct: 307 ----HGLGGDNVRGSCRSLHPGPVSLLHWSGKGKPW 338 >UniRef50_D1IFB6 Whole genome shotgun sequence of line PN40024, scaffold_26.assembly12x (Fragment) n=1 Tax=Vitis vinifera RepID=D1IFB6_VITVI Length = 473 Score = 151 bits (381), Expect = 4e-35, Method: Composition-based stats. Identities = 50/276 (18%), Positives = 104/276 (37%), Gaps = 41/276 (14%) Query: 24 TSECLNVAYGVDANYLDGVGVSITSIVLNNR-HINLDFYIIADVYNDGFFQKIAKLAEQN 82 +++A +D+ YL G ++ SI+ ++ N+ F+ IA ++ + + +L Sbjct: 140 DPSLVHIAMTLDSEYLRGSIAAVHSILRHSSCPENVFFHFIAAEFDPASPRVLTQLVRST 199 Query: 83 QLRITL--YRINTDKLQCLPCTQVWS----RAMYFRLFAFQLLGLTLDRLLYLDADVVCK 136 + Y D + L + + S Y R + +L ++R++Y+D+D+V Sbjct: 200 FPSLNFKVYIFREDTVINLISSSIRSALENPLNYARNYLGDILDPCVERVIYIDSDLVVV 259 Query: 137 GDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKL 196 DI +L ++ L YFN+GV+ +DL +W Sbjct: 260 DDIRKLWNITLTEKPC-------------------------YFNTGVMVMDLVRWRKGNY 294 Query: 197 TEKALSI--LMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKK 254 K + L + +Y+ ++ G + +N +K + + Sbjct: 295 RRKIENWMELQRRRRIYELGSLPPFLLVFAGNVEAIDHRWNQHGLGGDNVKG----SCRP 350 Query: 255 LITESTLLIHYTGATKPWHKWAIY---PSVKYYKIA 287 L L+H++G KPW + P ++ Sbjct: 351 LHPGPVSLLHWSGKGKPWSRLDARKPCPVDHLWEPY 386 >UniRef50_D0IR33 LPS 1,2-glycosyltransferase n=3 Tax=Helicobacter pylori RepID=D0IR33_HELP1 Length = 387 Score = 150 bits (378), Expect = 8e-35, Method: Composition-based stats. Identities = 64/383 (16%), Positives = 132/383 (34%), Gaps = 80/383 (20%) Query: 28 LNVAYGVDANYLDGVGVSITSIVL--------------------------NNRHINLDFY 61 + + D +Y GVS+ S++ +N+ + + Sbjct: 5 IPIVITFDNHYAIPAGVSLYSMLACTKLENPQSQNPQSQNPQSQNPQSQNDNKKLFYKIH 64 Query: 62 IIADVYNDGFFQKIAKLAEQNQ--LRITLYRINTDKLQCLPC------------------ 101 + D + K+ + + + I+T L P Sbjct: 65 CLVDNLSLENQCKLKETLAPFSAFMSVDFLDISTPNLYTTPIEPSVIDKINEAFLQLNIY 124 Query: 102 -TQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEP 160 +S+ + RLF L L D+++ DAD + D+S+ + L+ V KD Sbjct: 125 AKTRFSKMVMCRLFLASLF-LQYDKIIMFDADTLFLNDVSESFFIPLDDYYFGVAKDFSS 183 Query: 161 MQ--------------------EKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKA 200 + E + L ++N G + ++LK W +L E+ Sbjct: 184 PKSSKHFQTERERAPRQAFSLYEHYLKEKDIKILYENHYNVGFLVVNLKLWRADRLEERL 243 Query: 201 LSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITEST 260 L++ K P+QD++ + L LP YNT + +Q + Sbjct: 244 LNLTHQKGQCVFCPEQDLLTLACYQKVLILPYIYNTHPFM-------VNQKRFIPNRQEI 296 Query: 261 LLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEF-----KKRYKHLL 315 +++H+ KPW S ++++ L+ S + + S + K + EF K++ L Sbjct: 297 VMLHFYFVGKPWVSPTALYSKEWHETLLKTSFYAEYSVKFLKQMTEFLSLKDKQKTFEFL 356 Query: 316 VQHHYISGIIAGVCYLCRKYYRK 338 ++ V + + +++ Sbjct: 357 APLLNPKILLEYVFFRLNRIFKR 379 >UniRef50_UPI0000587C70 PREDICTED: similar to MGC81998 protein n=1 Tax=Strongylocentrotus purpuratus RepID=UPI0000587C70 Length = 344 Score = 148 bits (373), Expect = 3e-34, Method: Composition-based stats. Identities = 57/292 (19%), Positives = 110/292 (37%), Gaps = 44/292 (15%) Query: 21 NINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAE 80 N +++ +NV D + L G+ ++ SI LN+R ++ FY++ D ++K Sbjct: 56 NSSSNGTINVLICSDGSTLGGMVAAMNSIYLNSR-THIKFYLVVDT---DSLDHLSKWLS 111 Query: 81 QNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDIS 140 Q+ LR Y I L Y RL+ ++ R++++D+D + +GDI+ Sbjct: 112 QSSLRKLDYAIKVFDESWLN---------YARLYFPKIFPGLTGRVIFVDSDTITQGDIA 162 Query: 141 QLLHLGLN-GAVAAVVKDVEPMQEKAVSRLS----------------DPELLGQYFNSGV 183 +L + + G V A D + + ++ + FN GV Sbjct: 163 ELNAIDIKPGHVVAFSDDCSAVTSRYGVIMNRYASYLNFGNEKLQSLGINPMECSFNPGV 222 Query: 184 VYLDLKKWADAKLTEKALSILMSKDNVYKYPDQ-------DVMNVLLKGMTLFLPREYNT 236 ++ +W +T K + Y Q M ++ LP E++ Sbjct: 223 FVANVDEWRKQNITAKLDYWVTVNSKEDVYGSQRGGGHSGPPMMIVFYMKYSPLPPEWHI 282 Query: 237 IYTIKSELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIAL 288 L T Y ++ L+H+ G KPW + + + ++ Sbjct: 283 -----RHLGVTTGARYSDAFLKAAKLLHWNGRFKPWGHNSQHTLI--WEKYY 327 >UniRef50_C3XFW0 Glycosyl transferase n=1 Tax=Helicobacter bilis ATCC 43879 RepID=C3XFW0_9HELI Length = 365 Score = 146 bits (369), Expect = 1e-33, Method: Composition-based stats. Identities = 45/234 (19%), Positives = 89/234 (38%), Gaps = 21/234 (8%) Query: 58 LDFYIIADVYNDGFFQKI----AKLAEQNQLRITLYRINTDKLQCLPCTQV---WSRAMY 110 F++I D ++ L + +I + I+ + + LP A Y Sbjct: 34 YHFHVITDSIAKKTLEQFHILQTTLNDIYPCQIEAHIISDEDFKDLPKWGYEEAQQYAAY 93 Query: 111 FRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLS 170 +R+ L +D+ LYLD D++ D+ +L L L+G +AA Sbjct: 94 YRVKLVDFLPKNVDKCLYLDTDMLVLTDLRELFALNLDGYIAASSSGSPNATISRYGIYR 153 Query: 171 DPEL---------LGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNV 221 + YF SG++ ++ K+W + +A+ L + ++ DQD +N Sbjct: 154 KKKGGKKAVKSFETSFYFCSGLMLINTKEWIKQNVDIEAMRFLREYE--TEFADQDALNF 211 Query: 222 LLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITE---STLLIHYTGATKPW 272 + L ++ + E T+ ++ K + + ++H G K W Sbjct: 212 AMCDRVYNLGEQWGILAYQSLEAACSTNIDFSKRYEKAMINAKILHCNGPAKAW 265 >UniRef50_Q39T65 Glycosyl transferase, family 8 n=1 Tax=Geobacter metallireducens GS-15 RepID=Q39T65_GEOMG Length = 317 Score = 145 bits (367), Expect = 2e-33, Method: Composition-based stats. Identities = 55/309 (17%), Positives = 117/309 (37%), Gaps = 39/309 (12%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAEQN-QLR 85 + V + D NY+ V+ S++ N N F ++ + ++ +A++ Sbjct: 10 IPVFFAFDNNYVIPAAVAFHSLLANVNVSYKYHFIVLHEDISEENRDLLAQVVSLFSNAS 69 Query: 86 ITLYRINT---DKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQL 142 + + ++ + + +++ ++L L D++++ D DVV K DIS + Sbjct: 70 VEFRDMGESFKNEWENIKGKGHYTKECLYKL-VPMLEFPQYDKIIWSDVDVVFKDDISDV 128 Query: 143 LHLGLNGAVAAVVKDVEPMQEKAVSRLSDPE----LLGQYFNSGVVYLDLKKWADAKLTE 198 + A V+ + +K ++ P +L +G++ +LKK + + + Sbjct: 129 FFMLSEENYIAGVRVCGKL-DKYYENMNMPAEIKSILKNGIGAGILVYNLKKMREDNIYD 187 Query: 199 KALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYK----- 253 + L ++ P+QD++N++LK ++P Y + + KD+ K Sbjct: 188 DIMIALQGMSSIVVQPEQDILNIVLKDKIDYIPLRYCFCTYMYNLFKDRHKMKLKVKGNL 247 Query: 254 -----------------------KLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALEN 290 ES +IHY +TKPW+ + L+ Sbjct: 248 FNYLFKGYRKNLGFDTIYSEKELLEAFESPAIIHYATSTKPWNTLFTKRKSDWLYCLLKT 307 Query: 291 SPWKDDSPR 299 WK R Sbjct: 308 PFWKRYIFR 316 >UniRef50_UPI00016C0890 glycosyl transferase family 8 protein n=2 Tax=Epulopiscium sp. 'N.t. morphotype B' RepID=UPI00016C0890 Length = 593 Score = 143 bits (362), Expect = 6e-33, Method: Composition-based stats. Identities = 53/296 (17%), Positives = 106/296 (35%), Gaps = 28/296 (9%) Query: 16 DFRLANINTSECLNVAYGVDANYLDGVGVSITSIVL-NNRHINLDFYIIADVYNDGFFQK 74 + A +TS+C + D ++ G ++ S+V +N + N D I ++ Sbjct: 276 EVMPAFADTSDC--IVLTTDDRFIIGAAATLISLVKTSNVNNNYDIIIFHKDLSEKSKTL 333 Query: 75 IAKLAEQN-QLRITLYRI--NTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDA 131 + + Q + Y + W +YF+L ++ + L+LD Sbjct: 334 LRNVVVQRINFSLRFYDVGYEMSTYNVYKPGNNWQPCVYFKLLIPSIMH-NYKKSLHLDC 392 Query: 132 DVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQ--------YFNSGV 183 D++ DI+ LL + L G A ++ + ++ + YFN GV Sbjct: 393 DLIILEDIANLLSIDLKGNAVAGCAEMGCITTSIRRTWANKYYHEKLRITNMVEYFNGGV 452 Query: 184 VYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYN-------- 235 + ++ ++ + L K + QD+++ LP+ +N Sbjct: 453 IVFNINEFHKITSLAQLLHEAEKKHLNLE---QDILSKSFVNHIYLLPQSWNLTRDFLGT 509 Query: 236 TIYTIKSELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENS 291 + K L +Q Y + +IHY G KPW + Y+ + + Sbjct: 510 VMNLYKQYLPSNIYQKYLDA-RQKPKIIHYIGPLKPWDNPNL-EYASYWWDTIRGT 563 >UniRef50_B6ACJ0 Glycosyl transferase family 8 protein, putative n=1 Tax=Cryptosporidium muris RN66 RepID=B6ACJ0_9CRYT Length = 304 Score = 143 bits (360), Expect = 1e-32, Method: Composition-based stats. Identities = 52/276 (18%), Positives = 106/276 (38%), Gaps = 32/276 (11%) Query: 22 INTSECLNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIAD-VYNDGFFQKIAKL- 78 N + +A+ D + SI N + + + +II ++ + + Sbjct: 44 SNPDKVYQIAFSADKEVFQLFPTLLNSIFKNLHEYEKANVHIITMPDISEKDIKILQSFS 103 Query: 79 AEQNQLRI--TLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCK 136 + +I Y N KL+ + S A RL ++ ++D+LLYLD DV+ Sbjct: 104 MNKFDKKIALLFYPFNY-KLKYTRTLKHVSEATMCRLLLPNIIDKSIDKLLYLDTDVIVN 162 Query: 137 GDISQLLHLGLNGAVAAVVKD------VEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKK 190 + +L + +N V + + +K + FN+GV+ + L + Sbjct: 163 TPLRELFGININSQCGIVARSSTKADLINEWLKKDKIYPHIIYNGTKSFNAGVLLISLNE 222 Query: 191 WADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQ 250 T+KA+ + + DQ ++N+ G LP +YN + + + ++ + Sbjct: 223 LRKNHFTDKAMEFVEK----WGLNDQIILNLYCNGEYDELPMQYNF-WAGRDDYRNTSAH 277 Query: 251 NYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKI 286 ++H+ G KPW P+ + Y+ Sbjct: 278 G----------IVHFAGPNKPWQ-----PNYQPYEE 298 >UniRef50_UPI000197AD97 hypothetical protein BACCOPRO_03221 n=1 Tax=Bacteroides coprophilus DSM 18228 RepID=UPI000197AD97 Length = 313 Score = 143 bits (360), Expect = 1e-32, Method: Composition-based stats. Identities = 65/318 (20%), Positives = 129/318 (40%), Gaps = 26/318 (8%) Query: 26 ECLNVAYGVDANYLDGVGVSITSIVLNNRHINL-DFYIIADVYNDGFFQKIAKLAEQNQL 84 + + + + D N + V I+S+++N + D +I+ D +++ +L + Sbjct: 3 KTVPIVFAFDNNLILPACVCISSLLMNAKEETFYDIFILHSSKVDLHKEQLDELPKYFNR 62 Query: 85 RITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQL-L 143 YR+ + + + Y+RL +L+ D ++Y D DV+ + D+S + Sbjct: 63 CRIQYRVVDNTFDQAFEIRGITTPTYYRLLIPELVP-EYDNIIYSDVDVIFRFDLSDIYF 121 Query: 144 HLGLNGAVAAVVKDVEPM---QEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKA 200 H LN + A V + P +K +L + + + +G + L+ KK + L E+ Sbjct: 122 HTDLNDSYVAGVNALVPFIPDMKKYYLKLGNVNIDSIIY-AGNIILNSKKIREDNLVERF 180 Query: 201 LSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITEST 260 + +K + + D DV+N+ KG +L + + T SEL + + ++ Sbjct: 181 KELAKNK---FHFQDLDVLNIACKGKITYLKPVF-CLTTYFSELALRHRNLLRDFWSDKD 236 Query: 261 L-------LIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKH 313 + ++HY G KPW + + SP+ D+ K EF + Sbjct: 237 IDEALTEGIVHYNGQ-KPWKGICVN--SDIWWEYYRKSPFFDE-----KFYFEFFYTRLN 288 Query: 314 LLVQHHYISGIIAGVCYL 331 L Q I + Y Sbjct: 289 ELDQLSLWKRIKILIRYF 306 >UniRef50_B9ADW8 Putative uncharacterized protein n=1 Tax=Methanobrevibacter smithii DSM 2375 RepID=B9ADW8_METSM Length = 223 Score = 141 bits (355), Expect = 4e-32, Method: Composition-based stats. Identities = 54/224 (24%), Positives = 93/224 (41%), Gaps = 22/224 (9%) Query: 63 IADVYNDGFFQKIAKLAEQNQLRITLYRINTDK------LQCLPCTQVWSRAMYFRLFAF 116 + +KI K+A I+ + + L + +S A Y +LF Sbjct: 1 MDSGIKKINKEKIRKIAHDYGADISFIHVADIEEKYNLTLNKMSVKGDFSLATYSKLFIA 60 Query: 117 QLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLG 176 LL T+D+++YLD D + ++L+L LN +AA V + E V + D Sbjct: 61 SLLPETVDKVIYLDCDALVLDSFKEILNLDLNNYLAAGVLALNCTAE--VKKAIDLNEDD 118 Query: 177 QYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKY--PDQDVMNVLLKGMTLFLPREY 234 Y N+G++ ++LK+W + + L L+ + K+ DQ V+N + L L +Y Sbjct: 119 LYINAGMLLINLKRWRQENVENQFLEKLVEFNLRGKHFGMDQGVINNVSSKNLLVLNPKY 178 Query: 235 -------NTIYTIKSELKDKTHQNYK-----KLITESTLLIHYT 266 NT Y I +L +NY E+ + H+ Sbjct: 179 NLEGSLHNTGYDITFKLNGNIQKNYYSREVLDDAIENPVFQHFC 222 >UniRef50_B3WD32 Glycosyl transferase n=9 Tax=Lactobacillus RepID=B3WD32_LACCB Length = 279 Score = 140 bits (353), Expect = 7e-32, Method: Composition-based stats. Identities = 60/286 (20%), Positives = 107/286 (37%), Gaps = 52/286 (18%) Query: 27 CLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIA----------DVYNDGFFQKIA 76 +N+ + D DGV ++ S++ + L Y++ ++ +++A Sbjct: 2 TMNIMFCGDEKMTDGVLIATLSLMRHT-DQPLHIYVLTAKLKVNGHAYQPFSAVTAERMA 60 Query: 77 KLAEQNQLRITLYRIN-TDKLQCLP----CTQVWSRAMYFRLFAFQLLGLTLDRLLYLDA 131 L Q + L RI+ TD P T +++ RL+A L+ DR+LYLD Sbjct: 61 DLMRQENPQHRLTRIDITDLFMANPPQANMTTMFTPYCMLRLYA-DLIPELPDRVLYLDT 119 Query: 132 DVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKW 191 D+VC+ S L + A V D Y NSGV+ ++L Sbjct: 120 DIVCRRSFSNLYQEPMKDVDIAGVLDHYGKWW-----FHHKLTWFDYINSGVLLMNLASI 174 Query: 192 ADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQN 251 L + ++ + PDQ +N++ K LPR+YN + Sbjct: 175 RQDGLLVRCRRLI--RHRWLFMPDQSALNIIAKSK-QILPRKYNEQH------------- 218 Query: 252 YKKLITESTLLIHYTGAT-----------KPWHKWAIYPSVKYYKI 286 + T+ H+T + KPW A++ + ++ Sbjct: 219 ---KVETDTVFQHFTTSFRFWPRFRIVTVKPWQISAVHQQLGLHEY 261 >UniRef50_C0EQT1 Putative uncharacterized protein n=1 Tax=Neisseria flavescens NRL30031/H210 RepID=C0EQT1_NEIFL Length = 212 Score = 140 bits (352), Expect = 1e-31, Method: Composition-based stats. Identities = 60/226 (26%), Positives = 93/226 (41%), Gaps = 29/226 (12%) Query: 115 AFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEK-AVSRLSDPE 173 +LG D +LYLD DV+C GDIS+L + + A V + + + Sbjct: 3 IPAILGDISDTVLYLDTDVLCLGDISELFTV-----ILAAVPETTLYRAYINKLNVFGFR 57 Query: 174 LLGQYFNSGVVYLDLKKWADAK----LTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLF 229 YFNSGV+ + K W ++ L EK + +SK + PDQD++N+ KG + Sbjct: 58 STDPYFNSGVLLFNNKFWNESSAYTVLNEKIRQVELSKF-ILACPDQDLLNLSCKGKVGW 116 Query: 230 LPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALE 289 LP YN I+ + T+ ++ L+H+ G TKPWH +P Y Sbjct: 117 LPESYNRIHWHHQGSELNTNP-------KNIRLVHFIGGTKPWHHLGFHPV---YDSFYR 166 Query: 290 NSPWK--------DDSPRDAKSIIEFKKRYKHLLVQHHYISGIIAG 327 SPW + +K+ K L Q + + Sbjct: 167 KSPWYNGYLHQKPNIDLPFPNPHKRYKQAAKRLFKQGNKKQAWLYY 212 >UniRef50_Q92VQ2 Putative lipopolysaccharide 1,3-galactosyltransferase n=1 Tax=Sinorhizobium meliloti RepID=Q92VQ2_RHIME Length = 337 Score = 139 bits (351), Expect = 1e-31, Method: Composition-based stats. Identities = 61/342 (17%), Positives = 107/342 (31%), Gaps = 28/342 (8%) Query: 9 IDKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYN 68 +DK + + + S + D NY + S + + + + Sbjct: 1 MDKGAVFPSNWQSSSGSAA--IVLVTDQNYALPTFSAALSADQHTKGADTAIRMFVVGAE 58 Query: 69 DGFFQKIAKLAEQNQLRITLYRINT-DKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLL 127 D + ++ + ++++ R+ +L R + LL +DR L Sbjct: 59 DTWARQFDEAVAGTKIKVIAARLPQLAELSPYHRDHYLPPIALARFWIDSLLDAGVDRFL 118 Query: 128 YLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELL---------GQY 178 Y+D D + G++ LL D + VSR +L Y Sbjct: 119 YIDGDTMVDGELDSLLASTPPAEGLMAAPDFLNIFMDEVSRGKKRDLAHLEGIGCRPETY 178 Query: 179 FNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIY 238 FNSGV+Y + W D + A+ ++ DQ +N +G L YN Sbjct: 179 FNSGVIYASREAWND--IVPVAMKFMVEHPEHCPASDQSALNHAARGRVTMLSLRYNYQS 236 Query: 239 TIKSELKDKTHQNYKKLITESTLLIHYTGATKPWHK--WAIYPSVKYYKIALENSPWKDD 296 L + + H+TG KPW+ W S Y A E Sbjct: 237 EHMMVLDPRRRGI-------GPAIWHFTGGPKPWNTPGWPWDESFNRYYCAAEMRLHGST 289 Query: 297 SPRDAKSIIEFKKRYKHLLVQHHYISGIIAGVCYLCRKYYRK 338 + + H ++ + Y RK R+ Sbjct: 290 IVTPVPPEAQTRAGIAHRRRSRSRMTWV-----YPWRKITRR 326 >UniRef50_B9KUH7 Lipopolysaccharide 3-alpha-galactosyltransferase n=1 Tax=Rhodobacter sphaeroides KD131 RepID=B9KUH7_RHOSK Length = 304 Score = 139 bits (351), Expect = 1e-31, Method: Composition-based stats. Identities = 56/267 (20%), Positives = 95/267 (35%), Gaps = 29/267 (10%) Query: 30 VAYGVDANYLDGVG-VSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITL 88 + D L + T + + L F+ +D + + + L Sbjct: 13 IVLITDDRMLKPTLFTAWTMLRRFRGNAELHFW--GSALDDWHWSMVE-HVVSCNANVVL 69 Query: 89 YRI---NTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHL 145 + D P S RL + L R+LYLD DV D+S L L Sbjct: 70 CPLRLEEADLAGAKPVGTYISETTMGRLLIPRKL---TGRVLYLDGDVRVVDDLSPLFSL 126 Query: 146 GLNGAVAAVVKDV---------EPMQEKAVSRLSDPELL------GQYFNSGVVYLDLKK 190 + G A V+D EP++ + +R+ + YFN+GV+ LD Sbjct: 127 DMRGFPLAGVRDYVVSKRLARGEPVKVRNRARIEEEARCMSGADASTYFNAGVLLLDASA 186 Query: 191 WADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYT---IKSELKDK 247 A A+ L + + + DQD +N + G + YN+ ++ + ++ Sbjct: 187 IAADHSLCSAMQDL-DRASKWTLGDQDHLNNVFAGRVRLIDPAYNSSWSRTPRQRRYVER 245 Query: 248 THQNYKKLITESTLLIHYTGATKPWHK 274 +L +IH+ G KPW K Sbjct: 246 LGPAPAELTYAPDAIIHFHGPAKPWKK 272 >UniRef50_A9SH80 Predicted protein n=2 Tax=Physcomitrella patens subsp. patens RepID=A9SH80_PHYPA Length = 527 Score = 139 bits (350), Expect = 2e-31, Method: Composition-based stats. Identities = 64/312 (20%), Positives = 122/312 (39%), Gaps = 49/312 (15%) Query: 10 DKVKAWDFRLANINTS--ECLNVAYGVDANYLDGVGVSITSIVLNNRH-INLDFYIIADV 66 + K ++ LA + + + +++ D + V + S + N H L F+++ Sbjct: 175 NDEKHDEYTLAFLKKASEQVVHIFVSTDGADFRPLAVLVNSTISNAVHPERLHFHLVLPA 234 Query: 67 YNDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRA------MYFRLFAFQLLG 120 + + +A + ++ I I+ ++ + S+A +Y FA LL Sbjct: 235 SHHSRAKHLAAFFQDTKIDIVSENIDFKDMEKHITFRKNSKARPELQSVY--NFAPFLLP 292 Query: 121 ---LTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQE---------KAVSR 168 + R +YLDAD+V KG+I +L+ + L AA V+D E K +R Sbjct: 293 LHFKDVGRFIYLDADIVVKGNIEELIQIDLGNRAAAAVEDCSQTFETYFDFNELAKIQAR 352 Query: 169 LSDP--------ELLGQYFNSGVVYLDLKKWADAKLTEKAL----SILMSKDNVYKYP-D 215 P + FN GV+ +D +W ++TE L ++ +YKY Sbjct: 353 PEKPTWVPTEPIKPDACVFNRGVLVIDTNQWIKQQVTEAILWWMDEFQSAESVLYKYGLS 412 Query: 216 QDVMNVLLKGMTLFLPREYNTIYTIKSEL-------------KDKTHQNYKKLITESTLL 262 Q + L G + L +N ++E + + L ++ + Sbjct: 413 QPPFLLALYGKYMKLDTPWNVRGLGRNEFSEREREFLESKYGHKPERKPFISLDADTAKI 472 Query: 263 IHYTGATKPWHK 274 +H+ G KPW + Sbjct: 473 LHFNGKFKPWKQ 484 >UniRef50_Q1CSY7 Lipopolysaccharide 1,2-glycosyltransferase n=3 Tax=Helicobacter pylori RepID=Q1CSY7_HELPH Length = 341 Score = 138 bits (349), Expect = 2e-31, Method: Composition-based stats. Identities = 43/272 (15%), Positives = 96/272 (35%), Gaps = 47/272 (17%) Query: 58 LDFYIIADVYNDGFFQKIAKLAEQNQL--RITLYRINTDKLQ------------CLPCTQ 103 + + D + +K+ + I I+ + + Sbjct: 21 YQIHCLVDSLSAENVEKLKRTMSPFSTFSGIEFCDISKNDAYPFKLVSQLFLRLNPFAKK 80 Query: 104 VWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKD------ 157 +S+ + RL + ++++ D D + GDIS+ + ++G K+ Sbjct: 81 RFSKMILCRLLLASIFS-QYEKIIMFDVDTLFVGDISESFFIPMDGVYFGATKEDFSLIG 139 Query: 158 VEPMQEKAVSRLSDPELLG------------------QYFNSGVVYLDLKKWADAKLTEK 199 + + SRL+ +G FN+G + ++L W + L EK Sbjct: 140 IHNANDLFSSRLNWSRGMGVKLNHKSLIFQEVEILYENPFNAGFMLVNLALWREHHLEEK 199 Query: 200 ALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITES 259 + ++D P+QD+ ++ +G L +P +YN + Sbjct: 200 LIDFFKTRDEGLLLPEQDLFVLVCQGCILEMPCKYNVHPRMVGTRMIPKK--------SD 251 Query: 260 TLLIHYTGATKPWHKWAIYPSVKYYKIALENS 291 ++H+ KPW + S +++++A + S Sbjct: 252 ACMLHFYADEKPWKHFRYPYSKEWHQVAFKTS 283 >UniRef50_C7TIE0 Glycosyl transferase, group 8 n=2 Tax=Lactobacillus rhamnosus RepID=C7TIE0_LACRL Length = 286 Score = 138 bits (348), Expect = 3e-31, Method: Composition-based stats. Identities = 60/274 (21%), Positives = 121/274 (44%), Gaps = 20/274 (7%) Query: 30 VAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDG---FFQKIAKLAEQNQLR 85 V + V +++ G +I S+VL+ +I L ++AD Y + + + I K + + Sbjct: 6 VLFTVTGSHIQLTGTAIASLVLHWPVNIPLRILVMADDYLNQDIFWLKSIPKQLLRPNIT 65 Query: 86 ITLYRIN--TDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLL 143 + +++ D++ + + +RLFA + DRLLYLD DV+ DIS + Sbjct: 66 VDVWQKPSIMDQVHTANTNTRYPSVVLWRLFAPYIFS-DTDRLLYLDNDVLICDDISPMF 124 Query: 144 HLGLNGAVAAVVKDVEPM---QEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKA 200 + + V D + + K S + + YFNSGV+ ++ K+ A ++ Sbjct: 125 DMLPDDKAIGAVNDFQTLLYADTKEGSIWPEIKHFDSYFNSGVLLINTHKYIQAYTQDQL 184 Query: 201 LSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYK-----KL 255 ++ + + D Y + DQ ++N L + ++ LP +YN + H N K + Sbjct: 185 VNTINTSD--YSFIDQTILNNLFESQSIHLPLQYNYQKDDEWLNGYALHYNLKQAKKMQA 242 Query: 256 ITESTLLIHYTGATK--PW-HKWAIYPSVKYYKI 286 + ++ H+ + PW H ++ + + Sbjct: 243 ARKKVVIRHFVSEIRSLPWEHGYSRDEFEQNFWR 276 >UniRef50_A7H2X4 Glycosyl transferase family 8 n=2 Tax=Campylobacter RepID=A7H2X4_CAMJD Length = 497 Score = 138 bits (348), Expect = 3e-31, Method: Composition-based stats. Identities = 68/339 (20%), Positives = 125/339 (36%), Gaps = 58/339 (17%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRH------------------INLDFYIIADVYND 69 L++ GV A Y+ V I SIV + F+I + + Sbjct: 2 LHICIGVSAEYVKYSAVLINSIVKATQKPFDLKPYENNLSFTKDLKEGFCFHIFTEYKS- 60 Query: 70 GFFQKIA----KLAEQNQLRITLYRINTDKLQCLP-CTQVWSRAMYFRLFAFQLLGLTLD 124 +KIA KL+E + ++ +N Q + AM++++ +L +D Sbjct: 61 EDTEKIALLAHKLSEIYPTKCLIHVMNNQDFQDFSYPFWCQNAAMFYKIKVVDILK-DVD 119 Query: 125 RLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPM-------QEKAVSRLSDPELLGQ 177 + L++ AD+ GD+ L L L + A D + K Sbjct: 120 KCLFIGADLFALGDVRDLFALDLKDNLIAAALDTYNFDGYLRKAKAKNSDEELVFNDAKN 179 Query: 178 YFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTI 237 Y N+ ++ ++LK+W L K + L D D DV ++ L +YN I Sbjct: 180 YINNDMMLINLKEWRKQNLQAKYIDYLNKYD---LAGDLDVFPLVCAPKIHILSSKYNFI 236 Query: 238 --------YTIKSELKDKTHQNY-------KKLITESTLLIHYTGA-TKPWHKWAIYPSV 281 + +++ LKD++ + + I + L+H+ KPW A Sbjct: 237 LGYYTRESFGLENTLKDESDKPVWNFTKVELEQIQKDLRLVHFCHYVYKPWMS-AYNCHY 295 Query: 282 KYYKIALENS------PWKDDSPRDAKSIIEFKKRYKHL 314 Y+ + L+N P+ + A F++ + +L Sbjct: 296 VYFNMGLDNDLKPIKVPYYKEWWDMALKTPFFEEDFANL 334 >UniRef50_A2RLV8 Putative glycosyltransferase n=1 Tax=Lactococcus lactis subsp. cremoris MG1363 RepID=A2RLV8_LACLM Length = 397 Score = 138 bits (347), Expect = 4e-31, Method: Composition-based stats. Identities = 59/292 (20%), Positives = 115/292 (39%), Gaps = 21/292 (7%) Query: 30 VAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITL 88 + Y V+ NY+ V S+TSI++N + +D I+++ D Q + ++ + + ++ L Sbjct: 6 IFYTVNGNYIQLVATSLTSIIMNIDEKFPVDIIIVSNDITDENKQTLYEILDMRKTQVNL 65 Query: 89 -YRINTDKLQCL--PCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHL 145 +R+ D L+ L + ++ + +R+F L +LLYLD+D + ++ L Sbjct: 66 LFRMPPDSLELLLGDVSNIFDNVVCWRIFMPYSL-EEYSQLLYLDSDTLIYEGFEEIFGL 124 Query: 146 GLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILM 205 + V+ D + YFNSGV ++++K+ E+ L LM Sbjct: 125 LPQDKILGVIPDFYFFAINEKN-----SSKRGYFNSGVYMINVEKYIQKNSKEELLKNLM 179 Query: 206 SKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSE-----LKDKTHQNYKKLITEST 260 + Y DQ +N +G +LP +N L+ + Sbjct: 180 ENFSEILYVDQTFLNNTFRGELFYLPLRFNYQKDDNWLNNWAILEAPESSQLFIKERANI 239 Query: 261 LLIHYT---GATKPW-HKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFK 308 + H+ + PW H +Y+ K+ + + I K Sbjct: 240 KIRHFIEFGSHSMPWQHIEVRDQFEEYFWNVWNVL--KEYRVKKHRPIKSLK 289 >UniRef50_A9S2B3 Predicted protein (Fragment) n=1 Tax=Physcomitrella patens subsp. patens RepID=A9S2B3_PHYPA Length = 275 Score = 137 bits (345), Expect = 6e-31, Method: Composition-based stats. Identities = 47/271 (17%), Positives = 101/271 (37%), Gaps = 35/271 (12%) Query: 24 TSECLNVAYGVDANYLDGVGVSITSIVLNNR-HINLDFYIIADVYNDGFFQKIAKLAEQN 82 +++A +DANYL G +I SI+L+ N+ F+ +A K ++ Sbjct: 7 NESLVHIAMTLDANYLRGSMAAIYSILLHAECASNVRFHFVA---TKEKKNK-----CKS 58 Query: 83 QLRITLYRINTDKLQCLPCTQVWS---RAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDI 139 R +Y + + L+ + + Y R + ++ + R++YLD DV+ G I Sbjct: 59 FCRSAMYFYSCELLKLIYSSDFVITQEPLNYARFYLAHMIDSCVKRIIYLDLDVLVLGRI 118 Query: 140 SQLLHLGLNGAVAAVVK--DVEPMQEKAVSRLSDPELLG-------QYFNSGVVYLDLKK 190 +L + + + + + L YFNSG++ ++L++ Sbjct: 119 EELWMTNMGNSTVGTPEYCHANFPSYFTENFWINSSLASTFANKQPCYFNSGMMLINLER 178 Query: 191 WADAKLTEKALSILM--SKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKT 248 W + T + + ++Y+ + + G + +N +K Sbjct: 179 WRKTRCTSTLEYWMEVQKQQHIYELGSLPPLLLTFAGSIQAIDNRWNQHGLGGDIVKGDC 238 Query: 249 HQNYKKLITESTLLIHYTGATKPWHKWAIYP 279 +H++G KPW + ++ Sbjct: 239 RS------------LHWSGGGKPWRRLDMHQ 257 >UniRef50_A4SAB5 Predicted protein (Fragment) n=1 Tax=Ostreococcus lucimarinus CCE9901 RepID=A4SAB5_OSTLU Length = 259 Score = 136 bits (344), Expect = 8e-31, Method: Composition-based stats. Identities = 53/264 (20%), Positives = 107/264 (40%), Gaps = 28/264 (10%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHIN-LDFYIIA--DVYNDGFFQK------IAKL 78 +++A+ D L +G I+S++ + + F+I D D Q I + Sbjct: 3 VHIAFACDPTQLFTLGPVISSVLSATASPHRIRFHIFTARDALTDASVQLNCYSRAIPFI 62 Query: 79 AEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGD 138 E ++ + R N + ++ + Y R + ++L + +++YLD D++ KGD Sbjct: 63 WELHEFSKDMIRANI-TVHSRKEWRLQNAFNYARFYFAEILS-DVQKVVYLDTDIIVKGD 120 Query: 139 ISQLLHLGLNGA---VAAVVKDVEPM-----QEKAVSRLSDPELLGQYFNSGVVYLDLKK 190 I +L L + V A VK P+ A + S FN+GV+ +DL+ Sbjct: 121 ICRLHDANLRSSSTSVIAAVKRSVPLGSLLNFSNAAVKSSGLREKMHSFNAGVLLIDLES 180 Query: 191 WADAKLTEKALSILMSK--DNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKT 248 W ++T + L +Y + Q + ++ +P +N Sbjct: 181 WRRKRITSTVETWLKMNSVSKLYSHGSQPPLLLVFGDSFESIPSHWNV-------DGVGY 233 Query: 249 HQNYKKLITESTLLIHYTGATKPW 272 + + + ++H++G +KPW Sbjct: 234 KKGLRASVLNEARVLHWSGQSKPW 257 >UniRef50_UPI0001B55E75 hypothetical protein SSPB78_11600 n=1 Tax=Streptomyces sp. SPB78 RepID=UPI0001B55E75 Length = 792 Score = 134 bits (337), Expect = 6e-30, Method: Composition-based stats. Identities = 68/272 (25%), Positives = 107/272 (39%), Gaps = 38/272 (13%) Query: 5 PAIEIDKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIA 64 PA + + A D R ++ + A VD NYL G + S+ L+N + DF ++ Sbjct: 16 PAAPVREAAADDVR--DLTGKRRVAFASFVDENYLPGFLALLRSLALSNPEVCEDFLVLH 73 Query: 65 DVYNDGFFQKIAKLAEQNQLRITLYRIN---TDKLQCLPCTQVWSRAMYFRLFAFQLLGL 121 D +I L RI R++ D R YF L F++ Sbjct: 74 DGLRPASLARIRAL----HPRIRPRRVDAARYDAYAKGDQNNYLVRKAYFLLDVFRV--R 127 Query: 122 TLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNS 181 D ++ LD D+V GD+S+LL L A V K NS Sbjct: 128 DYDTIITLDTDMVVLGDLSELLRLR---EGLAAVPQFFYGTHKL--------------NS 170 Query: 182 GVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIK 241 G++ + + +DA E+ ++ DQ ++N +L G + LP YN + Sbjct: 171 GLLVIQREFLSDA-FCERIDETGLAGAYELDKHDQGILNAVLDGDFVRLPARYNFV---- 225 Query: 242 SELKDKTHQNYKKLITESTLLIHYTGATKPWH 273 K + K + E T ++H+TG KPW Sbjct: 226 -----KRRLSGDKPVPEDTAVLHFTGRHKPWQ 252 >UniRef50_C3XN62 Glycosyl transferase n=1 Tax=Helicobacter winghamensis ATCC BAA-430 RepID=C3XN62_9HELI Length = 284 Score = 133 bits (334), Expect = 1e-29, Method: Composition-based stats. Identities = 52/227 (22%), Positives = 89/227 (39%), Gaps = 21/227 (9%) Query: 127 LYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDP--ELLGQYFNSGVV 184 +YLD D++ D+ ++ + L G + V D + + + P L YFN+G++ Sbjct: 1 MYLDVDMLVLKDLREIFAIDLEGKICGAVLDYKANRILEPKNKALPMLNLSKDYFNAGLL 60 Query: 185 YLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSEL 244 +DL+KW KL K + L K DQ +NV+LK LP +NT+ Sbjct: 61 LIDLEKWKSQKLESKLIETLNQYH--CKEHDQSALNVVLKDKIKILPLSWNTLVYYYVNA 118 Query: 245 KDKTHQNYKKLIT---------ESTLLIHYTGATKPWHKWAIYPSVK------YYKIALE 289 K L ++ ++HY KPW+ IY +K ++ +E Sbjct: 119 KACDDTKNFNLFYTRKDLNKALKNPHILHYYLGFKPWNDDKIYTDIKGEFLGEHWWNMVE 178 Query: 290 NSPWKDDSPRDAKSIIEFKKRYKHLLVQHHYISGIIAGVCYLCRKYY 336 +P D K+ K+ K + + + Y +Y Sbjct: 179 KTPEFKDMIIPLKTKA--SKKAKLQVSLGYTLLTFARYKLYFLIPFY 223 >UniRef50_C3YRN2 Putative uncharacterized protein (Fragment) n=1 Tax=Branchiostoma floridae RepID=C3YRN2_BRAFL Length = 305 Score = 133 bits (334), Expect = 1e-29, Method: Composition-based stats. Identities = 56/317 (17%), Positives = 116/317 (36%), Gaps = 44/317 (13%) Query: 26 ECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLR 85 + + V D L G +I SI N++ + FY+I D ++ + + Sbjct: 1 DTIPVVISTDEGRLMGAVAAINSIATNSKS-PVKFYLITDKDTKDHLEQWILKTRLHSIN 59 Query: 86 ITLYRINTDKLQCLP-----CTQVWSRAMYFRLFAFQLLGLTLD-RLLYLDADVVCKGDI 139 + N + ++ ++ S Y R + +LL + ++LYLD DV+ +GDI Sbjct: 60 HEIIVFNEEWVKGKINVRGGRQELASPLNYARFYLPKLLPPDFNGKILYLDDDVIVQGDI 119 Query: 140 SQLLHLGLNGA-VAAVVKDVEPMQEK---------AVSRLSDPELLG-------QYFNSG 182 +QL + ++ V A +D + + + + FN+G Sbjct: 120 TQLYNTKIDETLVMAFSEDCNTVSNRFGLFMNTYANYINFGNENVKKLGMKPGTCSFNTG 179 Query: 183 VVYLDLKKWADAKLTEKA--LSILMSKDNVY-----KYPDQDVMNVLLKGMTLFLPREYN 235 V ++ +W + K+T K + L +++NVY Q M ++ + ++ Sbjct: 180 VFVANMTEWKNQKITTKLEFWTALNTEENVYGAQQGGGGSQPPMMIVFYNQYSKIDPMWH 239 Query: 236 TIYTIKSELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKD 295 + Y K L+H+ G KPW + + + W+ Sbjct: 240 IRH--LGLYSWTAGTRYSKQFIMEAKLLHWNGRFKPWGRTSQHMDA-----------WER 286 Query: 296 DSPRDAKSIIEFKKRYK 312 D + ++++ Sbjct: 287 YYIPDPTGKSQLTRKFR 303 >UniRef50_D1IU75 Whole genome shotgun sequence of line PN40024, scaffold_5.assembly12x (Fragment) n=7 Tax=Magnoliophyta RepID=D1IU75_VITVI Length = 364 Score = 132 bits (333), Expect = 1e-29, Method: Composition-based stats. Identities = 51/261 (19%), Positives = 108/261 (41%), Gaps = 24/261 (9%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNR-HINLDFYIIADVYNDGFFQKIAKLAEQNQLRI 86 ++VA +D +YL G ++ SI+ +++ ++ F+ + ++ + + + + QL+ Sbjct: 63 VHVAITLDVHYLRGSMAAVHSILQHSQCPEDIFFHFL---VSETHLEILVR-STFPQLKF 118 Query: 87 TLYRINTDKLQCLPCTQVW----SRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQL 142 +Y N + ++ L T V Y R + LL + R++YLD+D++ DI +L Sbjct: 119 KVYYFNPEIVRNLISTSVREALEHPLNYARNYLADLLEPCVRRVIYLDSDLIVVDDIYKL 178 Query: 143 LHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLG---------QYFNSGVVYLDLKKWAD 193 L + + + E YFN+GV+ +DL KW Sbjct: 179 WSTSLGTRTIGAPEYCHANFTRYFTDKFWSEKRYYGTFDGRKPCYFNTGVIVIDLAKWRR 238 Query: 194 AKLTEKALSILM--SKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQN 251 T++ + + +Y+ ++ G + +N +K + Sbjct: 239 FGFTKRIERWMEVQKNNRIYELGSLPPYLLVFAGHVAPIEHRWNQHGLGGDNVKG----S 294 Query: 252 YKKLITESTLLIHYTGATKPW 272 ++L L+H++G+ KPW Sbjct: 295 CRELHPGPVSLLHWSGSGKPW 315 >UniRef50_Q68CQ7 Glycosyltransferase 8 domain-containing protein 1 n=45 Tax=Euteleostomi RepID=GL8D1_HUMAN Length = 371 Score = 129 bits (325), Expect = 1e-28, Method: Composition-based stats. Identities = 58/331 (17%), Positives = 103/331 (31%), Gaps = 51/331 (15%) Query: 17 FRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIA 76 R A E + V + L G +I SI +N N+ FYI+ + Sbjct: 55 LRHAVDGRQEEIPVVIAASEDRLGGAIAAINSI-QHNTRSNVIFYIVTLNNTADHLRSWL 113 Query: 77 KLAEQNQLRITLYRINTDKLQ-----CLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDA 131 +R + + L+ + + R + L+ + + +Y+D Sbjct: 114 NSDSLKSIRYKIVNFDPKLLEGKVKEDPDQGESMKPLTFARFYLPILVP-SAKKAIYMDD 172 Query: 132 DVVCKGDISQLLHLGLN-GAVAAVVKDVEPMQEKAVSRLSDPELL--------------- 175 DV+ +GDI L + L G AA +D + K V R + + Sbjct: 173 DVIVQGDILALYNTALKPGHAAAFSEDCDSASTKVVIRGAGNQYNYIGYLDYKKERIRKL 232 Query: 176 -----GQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPD-------QDVMNVLL 223 FN GV +L +W +T + + Y + ++ Sbjct: 233 SMKASTCSFNPGVFVANLTEWKRQNITNQLEKWMKLNVEEGLYSRTLAGSITTPPLLIVF 292 Query: 224 KGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKY 283 + +N L + Y ++ L+H+ G KPW + A Y V Sbjct: 293 YQQHSTIDPMWNV-----RHLGSSAGKRYSPQFVKAAKLLHWNGHLKPWGRTASYTDV-- 345 Query: 284 YKIALENSPWKDDSPRDAKSIIEFKKRYKHL 314 W+ D +RY + Sbjct: 346 ---------WEKWYIPDPTGKFNLIRRYTEI 367 >UniRef50_Q2RB54 Glycosyl transferase family 8 protein, expressed n=11 Tax=Poaceae RepID=Q2RB54_ORYSJ Length = 642 Score = 129 bits (324), Expect = 2e-28, Method: Composition-based stats. Identities = 62/285 (21%), Positives = 115/285 (40%), Gaps = 43/285 (15%) Query: 42 VGVSITSIVLNNRHI-NLDFYIIADVYN-------------------------------D 69 V +I S V+N++ ++ F++ D N D Sbjct: 354 VSTTINSTVMNSKDSGSIVFHLFTDSQNFYAMKHWFDRNMYLEATVHVTDIEDHQKLSKD 413 Query: 70 GFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYL 129 F + L + R+T + Q T+ S + LL +L+R++ L Sbjct: 414 VDFHDMKLLRPAEEFRVTFRN-HYQSFQKQMKTEYISTFGHSHFLLPDLLP-SLNRVVVL 471 Query: 130 DADVVCKGDISQLLHLGLNGAVAAVVK--DVEPMQEKAVSRLSDPELLGQYFNSGVVYLD 187 D D++ + D+S L +L + G V ++ +V+ Q KA + + + SG+ ++ Sbjct: 472 DDDLIVQKDLSSLWNLNMGGKVVGAIQFCEVKLGQLKAYTEERNFGTNSCVWLSGLNVVE 531 Query: 188 LKKWADAKLTEKALSILMS--KDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELK 245 LKKW D +T + +L KD+V +P + + LL L P E + + + Sbjct: 532 LKKWRDLHITSRYDQLLQKLQKDSVTSFPLKVLPISLLVFQDLIYPLEDSWVQSGLGHDY 591 Query: 246 DKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALEN 290 + + K+ +T +HY G KPW I+ Y++ + N Sbjct: 592 GVSQTDIKRSVT-----LHYNGVMKPWLDLGIHDYKGYWRKYMTN 631 >UniRef50_Q8LF94 Avr9/Cf-9 rapidly elicited protein 231 n=13 Tax=Magnoliophyta RepID=Q8LF94_ARATH Length = 351 Score = 124 bits (311), Expect = 5e-27, Method: Composition-based stats. Identities = 49/266 (18%), Positives = 101/266 (37%), Gaps = 21/266 (7%) Query: 24 TSECLNVAYGVDANYLDGVGVSITSIVLNNR-HINLDFYIIADVYNDGFFQKIAKLAEQN 82 + +++A +DA Y+ G ++ S++ ++ N+ F+ +A D + + Sbjct: 61 SRRAVHMAMTLDAAYIRGSVAAVLSVLQHSSCPENIVFHFVASASADASSLRATISSSFP 120 Query: 83 QLRITLYRINTDKLQCLPCTQVW----SRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGD 138 L T+Y N + L + + Y R + LL + R++YLD+D++ D Sbjct: 121 YLDFTVYVFNISSVSRLISSSIRSALDCPLNYARSYLADLLPPCVRRVVYLDSDLILVDD 180 Query: 139 ISQLLHLGLNGAVAAVVKD----------VEPMQEKAVSRLSDPELLGQYFNSGVVYLDL 188 I++L L + L+ + YFN+GV+ +DL Sbjct: 181 IAKLAATDLGRDSVLAAPEYCNANFTSYFTSTFWSNPTLSLTFADRKACYFNTGVMVIDL 240 Query: 189 KKWADAKLTEKALSI--LMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKD 246 +W + T + + + +Y+ ++ G+ + +N D Sbjct: 241 SRWREGAYTSRIEEWMAMQKRMRIYELGSLPPFLLVFAGLIKPVNHRWNQ----HGLGGD 296 Query: 247 KTHQNYKKLITESTLLIHYTGATKPW 272 + L L+H++G KPW Sbjct: 297 NFRGLCRDLHPGPVSLLHWSGKGKPW 322 >UniRef50_Q062P6 DNA mismatch repair protein n=1 Tax=Synechococcus sp. BL107 RepID=Q062P6_9SYNE Length = 281 Score = 124 bits (311), Expect = 6e-27, Method: Composition-based stats. Identities = 47/246 (19%), Positives = 98/246 (39%), Gaps = 19/246 (7%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +++ +D + V++TS +L++R + ++ D ++ +A + Sbjct: 1 MHLLLALDQGFEPLAAVALTSYLLHHRFSS----VVLVTPADQRMHQLEGIAASFECPCR 56 Query: 88 LYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGL 147 RI T+ + +F + A Q R LY+DAD +C + L L L Sbjct: 57 HQRIATESALHRLPADLQPY--FFCIEALQ--QREPGRYLYVDADTLCVAGLETLEQLPL 112 Query: 148 NGA-VAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMS 206 G A PM ++ + + E YFN+G++ D + L E+ + + Sbjct: 113 GGTTPLAACSHGRPMPDRTL--VLGLEGPYHYFNAGILLFDSVSLNEVLLPEQVVDYYLQ 170 Query: 207 KDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYK--------KLITE 258 + + ++ +Q +N LL G FLP +YN + +++ + + + Sbjct: 171 HEALCRFREQCSLNALLSGQVQFLPGQYNVLSWMRARQSSSPWHDVASNPMAYCLPDVRD 230 Query: 259 STLLIH 264 ++H Sbjct: 231 KKAIVH 236 >UniRef50_B4WN64 Glycosyl transferase family 8 n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WN64_9SYNE Length = 289 Score = 123 bits (309), Expect = 1e-26, Method: Composition-based stats. Identities = 62/289 (21%), Positives = 109/289 (37%), Gaps = 35/289 (12%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHI----NLDFYIIADVYNDGFFQKIAKLA--EQ 81 +++A V+ + V I SI+ N H L F I+ + FF++ K A + Sbjct: 3 VDIALSVNRTLQVPLLVVINSILTNTTHRTEEVPLRFNIVVPIGESAFFEEELKQAFSAK 62 Query: 82 NQLRITLYRIN------------TDKLQCLPCTQVWSRAM-YFRLFAFQLLGLTLDRLLY 128 +R+ +K + + SR M Y RLF + + R++Y Sbjct: 63 YDCERVEFRVKEFTPPSYLKQYLDNKFREKKQERRLSRYMQYARLFFKDVFP-DIARMIY 121 Query: 129 LDADVVCKGDISQLL---HLGLNGAVAAVVKDVEP---MQEKAVSRLSDPELLGQYFNSG 182 DAD++ G++ L ++ + A V P + SD FNSG Sbjct: 122 FDADIIVLGNVRSLFTQGNILTSQNYLAAVPQFFPAIFYFSNPLKVFSDLRKFKSTFNSG 181 Query: 183 VVYLDLKKWADA--KLTEKALSILMSKDNV-YKYPDQDVMNVLLKGMTLFLPREYNTIYT 239 V+ DL W D KL + L + + Y D+ V N++ K + L +++N Sbjct: 182 VLLTDLSFWTDQTYKLLKHYLELDEKNNYRLYHLGDETVFNLMFKDTYIPLTKQWNCCGY 241 Query: 240 IKSELKDKTHQNYKKLITESTLLIHYT-GATKPWHKWAIYPSVKYYKIA 287 ++ K + + IH++ G KPW ++ Sbjct: 242 GQAHWVAKLLWKNPENMKA----IHWSGGHHKPWQS-KQVIYSDLWRSY 285 >UniRef50_UPI0001621115 predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=UPI0001621115 Length = 1016 Score = 122 bits (307), Expect = 2e-26, Method: Composition-based stats. Identities = 60/347 (17%), Positives = 110/347 (31%), Gaps = 91/347 (26%) Query: 16 DFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFF-QK 74 + ++ E ++V D L + V I S + N H FY + Y+ ++ Sbjct: 398 EDEPIDVVKREDIHVFVCTDEADLRPLAVLINSSMANCPHPERLFYHLVMPYSQRNAAKR 457 Query: 75 IAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRA------MYFRLFAFQLLGLTLD---R 125 + L ++ + I+ +++ + + A Y F L T R Sbjct: 458 LKHLFPNARVEMAEKYIDIREVEEHITFRNDTGARKELVSPY--NFLPFYLPKTYSEIRR 515 Query: 126 LLYLDADVVCKGDISQLLHLGLNGAVAAVVKDV--------EPMQEKAVSRLSDPELLG- 176 ++YLD+D+V KG++ L + L G A ++D + Q + + P+ Sbjct: 516 IIYLDSDIVVKGNLEVLNDVDLEGHSVAAIEDCSQRFQVYFDFAQLDEIHKRQGPDRPKW 575 Query: 177 ----------QYFNSGVVYLDLKKWADAKLTEKALSILMS-----KDNVYKYP------- 214 FN GV+ +D +W + +T+ + + K +YKY Sbjct: 576 LPDEPFNKSACVFNRGVLIIDTNQWIEQNITKAIVWWMDEFRKADKKALYKYALYQKRVH 635 Query: 215 -----------------------------------DQDVMNVLLKGMTLFLPREYNTIYT 239 Q + L G L +N Sbjct: 636 KNYFCASLSLICTSSMHFSQVLIVLWYFYPSRAGMSQPPFLLALYGKHKVLDETWNVRGL 695 Query: 240 IKSELKDKTHQNYKK-------------LITESTLLIHYTGATKPWH 273 + L D YKK + ++H+ G KPW Sbjct: 696 GRPNLSDMERIYYKKGWNYTFDRIPFMSPFADEANILHFNGKYKPWK 742 >UniRef50_C7TID9 Glycosyl transferase, group 8 n=2 Tax=Lactobacillus rhamnosus RepID=C7TID9_LACRL Length = 301 Score = 120 bits (301), Expect = 8e-26, Method: Composition-based stats. Identities = 50/251 (19%), Positives = 92/251 (36%), Gaps = 17/251 (6%) Query: 30 VAYGVDANYLDGVGVSITSIVL-NNRHINLDFYIIADVYNDGFF---QKIAKLAEQNQLR 85 V + V ++ V +ITS+V + + +I + N + I L + Q+ Sbjct: 6 VVFCVKGLHIMLVATAITSLVKKYHSDREMKILVIIEGGNQDDINFIRSIPSLYGKQQIS 65 Query: 86 ITLYRINTDKLQC----LPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQ 141 + + L + + +RLF D++ Y+D D++ DI+ Sbjct: 66 VDFWAPPYPLLDKVSDQFETGTSLPKMVLWRLFLPYYFP-DYDQIAYMDNDILITTDIND 124 Query: 142 LLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQ--YFNSGVVYLDLKKWADAKLTEK 199 L L V V D E + R + L Y N+GV + + EK Sbjct: 125 LFDQMLPEDVIGGVLDYEDVTHPDHDRSKEFYLPSTDQYINAGVFVANSNAYRSVVPFEK 184 Query: 200 ALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYT--IKSELKDKTHQNYK--KL 255 + I+ + Y Y DQ+++N+ LP +N Y + + + Q K + Sbjct: 185 MIEIINRHN--YPYGDQNILNIAFYNHIYLLPWRFNLQYDNRLLDKYESLAPQRIKGIRE 242 Query: 256 ITESTLLIHYT 266 +IH+ Sbjct: 243 QLNEPGIIHFA 253 >UniRef50_A9UXT0 Predicted protein (Fragment) n=1 Tax=Monosiga brevicollis RepID=A9UXT0_MONBE Length = 191 Score = 120 bits (300), Expect = 1e-25, Method: Composition-based stats. Identities = 43/194 (22%), Positives = 77/194 (39%), Gaps = 25/194 (12%) Query: 106 SRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLL-HLGL--NGAVAAVVKDVEPMQ 162 S A + R +LL L+R+LY+D D V +GD+ LL H+ L + +AAV + P+ Sbjct: 1 SSANFGRFMLPELLP-ELNRVLYIDIDTVVQGDLVALLAHMDLGDDDYLAAVPRPNVPLS 59 Query: 163 EKAVSRL------------SDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNV 210 + + +L FN+GV +L+ W L ++ L + Sbjct: 60 HFFGADIVRLHAELHPDPGQLLQLAAPSFNAGVAVWNLRAWRQRSLRDEVLYYMTKHHEH 119 Query: 211 --YKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGA 268 + Y Q ++ ++ G L +N + + ++H++G Sbjct: 120 ALWDYGTQPILLLVCAGHWQPLDVRFNLDGLGY-------RTDVSTEALDGAYVLHWSGR 172 Query: 269 TKPWHKWAIYPSVK 282 KPW A+Y Sbjct: 173 RKPWQHDALYRQRW 186 >UniRef50_C7PRU3 Glycosyl transferase family 8 n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PRU3_CHIPD Length = 303 Score = 119 bits (299), Expect = 1e-25, Method: Composition-based stats. Identities = 53/316 (16%), Positives = 106/316 (33%), Gaps = 44/316 (13%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHI-NLDFYIIADVYNDGFFQKIAKLA--EQNQL 84 +++A+ +D L+G+G +ITS+V N LD + I + + L E Sbjct: 1 MHIAFVIDLPSLEGLGATITSLVRNCSDTAQLDLHFICNNLGTRHKNNLLMLLQTESYHG 60 Query: 85 RITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTL-----------DRLLYLDADV 133 R Y + ++ SR Y R +LL ++ D Sbjct: 61 RTRFYDFDAQEMFGHLSAVHGSRTSYGRFLIPKLLDADYVLCLDPDLLILLDVITFD--- 117 Query: 134 VCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELL---GQYFNSGVVYLDLKK 190 + A V L + Q F SG++ L+L++ Sbjct: 118 ----------QIRFEDHFLAAVPGGPFRNTLEAKLLPGQLSVCKDEQSFISGMLLLNLRR 167 Query: 191 WADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQ 250 W + + + I + + D V+N + G + +N I+T Sbjct: 168 WKERDICHEIEKICLRHGMALQEADNTVLNTICNGSFYHIEDRFNCIWTPGQATPS---- 223 Query: 251 NYKKLITESTLLIHYTGATKPWHKWA--IYPSVKYYKIALENSPWKDDSPRDAKSIIEFK 308 + ++H+ GA KPW ++ + + + + W R A + ++ Sbjct: 224 ------FKENAILHFAGAPKPWDFLGREVHAGYQRWADY-DTTFWDRRYKRVAFAGLQRI 276 Query: 309 KRYKHLLVQHHYISGI 324 + + L ++ Y+ Sbjct: 277 WKIRKSLFRY-YLKSF 291 >UniRef50_Q02ZT7 Lipopolysaccharide biosynthesis glycosyltransferase n=1 Tax=Lactococcus lactis subsp. cremoris SK11 RepID=Q02ZT7_LACLS Length = 759 Score = 115 bits (289), Expect = 2e-24, Method: Composition-based stats. Identities = 26/139 (18%), Positives = 55/139 (39%), Gaps = 5/139 (3%) Query: 26 ECLNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAEQNQL 84 + V + Y+ V + SI+ N N N D I+ + + + K ++ Sbjct: 604 NNIPVVMACNNGYMKYTSVLLQSILENANSKNNYDISILHNDISVETQNRTLKHFNKDNF 663 Query: 85 RITLYRIN--TDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQL 142 + ++ + L S Y+R +L D+++Y+D D V + DI++L Sbjct: 664 SVRFVDVSAKISQYGELKTNAHISVETYYRFLIPELFVH--DKVVYIDCDTVVEEDIAKL 721 Query: 143 LHLGLNGAVAAVVKDVEPM 161 + + V+D + + Sbjct: 722 FEIDIEDNYVGAVRDFDFI 740 >UniRef50_Q9H1C3 Glycosyltransferase 8 domain-containing protein 2 n=29 Tax=Euteleostomi RepID=GL8D2_HUMAN Length = 349 Score = 114 bits (287), Expect = 3e-24, Method: Composition-based stats. Identities = 52/305 (17%), Positives = 103/305 (33%), Gaps = 40/305 (13%) Query: 7 IEIDKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADV 66 + K A D E + V A + +I SI +N N+ FY++ Sbjct: 30 GTVPKNDADDESETPEELEEEIPVVICAAAGRMGATMAAINSI-YSNTDANILFYVVGLR 88 Query: 67 YNDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQV-----WSRAMYFRLFAFQLLGL 121 +K + ++ ++ + N L+ + R + LL Sbjct: 89 NTLTRIRKWIEHSKLREINFKIVEFNPMVLKGKIRPDSSRPELLQPLNFVRFYLP-LLIH 147 Query: 122 TLDRLLYLDADVVCKGDISQLLHLGLN-GAVAAVVKDVEPMQEKAVSRLSDPELL----- 175 ++++YLD DV+ +GDI +L L G AA D + + ++RL + Sbjct: 148 QHEKVIYLDDDVIVQGDIQELYDTTLALGHAAAFSDDCDLPSAQDINRLVGLQNTYMGYL 207 Query: 176 ---------------GQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQ---- 216 FN GV+ ++ +W ++T++ + Y Sbjct: 208 DYRKKAIKDLGISPSTCSFNPGVIVANMTEWKHQRITKQLEKWMQKNVEENLYSSSLGGG 267 Query: 217 ---DVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGATKPWH 273 M ++ G + ++ L Y + + L+H+ G KPW Sbjct: 268 VATSPMLIVFHGKYSTINPLWHI-----RHLGWNPDARYSEHFLQEAKLLHWNGRHKPWD 322 Query: 274 KWAIY 278 +++ Sbjct: 323 FPSVH 327 >UniRef50_A9UZX9 Predicted protein (Fragment) n=1 Tax=Monosiga brevicollis RepID=A9UZX9_MONBE Length = 1116 Score = 114 bits (287), Expect = 4e-24, Method: Composition-based stats. Identities = 49/227 (21%), Positives = 81/227 (35%), Gaps = 21/227 (9%) Query: 31 AYGVDANYLDGVGVSITSIVLNN-RHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLY 89 V +N+ + V I SI+ ++ L F++I D + + + +++ Y Sbjct: 882 VVAVGSNHARRLQVLIKSILFHHLPPQPLRFHVITDHETAASLRHLYRSWRLPAVQVRFY 941 Query: 90 RINTD----KLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHL 145 I L L R + +LF LL ++L+R++ LD D++ G I++L Sbjct: 942 SITAALQGVDLHGLETHHYAGRYAFVKLFVADLLPVSLERVMVLDTDLLFLGPIAELWD- 1000 Query: 146 GLNG---AVAAVVKDVEPMQE--KAVSRLSDPELLGQYFNSGVVYLDLKKWADAKL---- 196 G + V D + R P N+GV L L + Sbjct: 1001 QFKGWSASAIFAVVDNFSEWYIPGRLQRQPWPAPAPLGINTGVTLLHLARLRHQNFPKVW 1060 Query: 197 TEKALSILMSKDNVYKY---PDQDVMNVLLKGM---TLFLPREYNTI 237 T +L Y DQDVMN + LP +N Sbjct: 1061 TSAVARVLADPRLNITYAPLADQDVMNTVFYDNPTLLHRLPCRFNYQ 1107 >UniRef50_B3XPR6 Putative uncharacterized protein n=3 Tax=Lactobacillus RepID=B3XPR6_LACRE Length = 673 Score = 114 bits (285), Expect = 6e-24, Method: Composition-based stats. Identities = 42/239 (17%), Positives = 84/239 (35%), Gaps = 25/239 (10%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLY 89 + + + LD + + SI +NN + YII +F I E+ I Sbjct: 4 IVLCANYDKLDQIETVLKSIYINNNDVKT--YIINSDIAHEWFVNINYFLEKINSEIIDA 61 Query: 90 RINTDKLQCLPCTQVWSRA--MYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGL 147 +I+ ++ LP + + A Y + +L+ D++LYL + + ++ L + + Sbjct: 62 KIDLNRFNELPELKNANMAKIEYGKFLIPELI--NEDKVLYLGNNTIIDQNLDSLFAIDI 119 Query: 148 NGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSK 207 D + FN V++++ W + + + L + Sbjct: 120 EDKPLYATVDFVHPDK---------------FNMDVMFINNIYWRNNNIGNQFLELGKHY 164 Query: 208 DNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTI-KSELKDKTHQNYKKLITESTLLIHY 265 D V Q ++N + LP YN I + Y + + +I Y Sbjct: 165 DLVDA---QAMINDGFRVNIGKLPAIYNYQIGIGDPNFEPVISYRYYEDAIDDPAIIQY 220 >UniRef50_A5DLS6 Putative uncharacterized protein n=2 Tax=Pichia guilliermondii RepID=A5DLS6_PICGU Length = 390 Score = 113 bits (283), Expect = 9e-24, Method: Composition-based stats. Identities = 52/271 (19%), Positives = 92/271 (33%), Gaps = 35/271 (12%) Query: 34 VDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLYRINT 93 + +YL G ++ + + +D Q + + + I+ Sbjct: 9 TNESYLPGALTLAHTLRSLGTQYPVVVLLDETQVSDRSLQLLEAAYD------RIIPIS- 61 Query: 94 DKLQCLPCTQVWSRAMYFRLFAFQLL-GLTLDRLLYLDADVVCKGDISQLLHLG--LNGA 150 D+L P R F+ LL + D++LYLD DV+ ++ L G L Sbjct: 62 DRLVTSPVDDRLGRPELAVTFSKLLLWNESYDQILYLDTDVLPLANVDHLFDEGAALTPR 121 Query: 151 VAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNV 210 A D FNSGV+ D ++ + D+ Sbjct: 122 QIAASPDSG---------------WPDIFNSGVLLFK----PDPQVYSDLVEFASGSDSS 162 Query: 211 YKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGATK 270 + DQ ++N G LP YN T + H+ +K ++HY G K Sbjct: 163 FDGADQGLLNEFFAGNWHRLPFLYNVTPTESYQYVPAFHRFFK-----DIKILHYIGQIK 217 Query: 271 PWHKWAIYPSVKYYKIALEN-SPWKDDSPRD 300 PWH +++ + + S + D ++ Sbjct: 218 PWHSSTNIDHFRFHHLWWDRFSEFFDKETKN 248 >UniRef50_A5BZU1 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5BZU1_VITVI Length = 648 Score = 113 bits (283), Expect = 1e-23, Method: Composition-based stats. Identities = 49/258 (18%), Positives = 95/258 (36%), Gaps = 28/258 (10%) Query: 43 GVSITSIVLNNRHINLD-FYIIADVYNDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPC 101 V I S +L F+I+ D + F + N ++ + + Sbjct: 391 SVVINSTMLXASEPEKHVFHIVTDKLS---FAAMKMWFLVNSPAKVTIQV--ENIDDFKN 445 Query: 102 TQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPM 161 + S + R + ++ L+++L+LD D+V + D++ L L + G V A V+ + Sbjct: 446 PKYLSMLNHLRFYLPEVYP-KLEKILFLDDDIVVQKDLTPLWSLDMQGMVNAAVETCKES 504 Query: 162 QEKAVSRLS----------DPELLGQYFNSGVVYLDLKKWADAKLTE--KALSILMSKDN 209 + L+ DP G F G+ DLK+W +T + Sbjct: 505 FHRFDKYLNFSHPKISENFDPNACGWAF--GMNMFDLKEWRKRNMTGIYHYWQDMNEDRT 562 Query: 210 VYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGAT 269 ++K + +T L R ++ + ++T ++ ++HY G Sbjct: 563 LWKLGSLPPGLITFYNLTYPLDRSWHVLGLGYDPQLNQTE-------IDNAAVVHYNGNY 615 Query: 270 KPWHKWAIYPSVKYYKIA 287 KPW + AI Y+ Sbjct: 616 KPWLELAIAKYKSYWSRY 633 >UniRef50_D1HWZ1 Whole genome shotgun sequence of line PN40024, scaffold_216.assembly12x (Fragment) n=3 Tax=Magnoliophyta RepID=D1HWZ1_VITVI Length = 503 Score = 111 bits (279), Expect = 3e-23, Method: Composition-based stats. Identities = 51/288 (17%), Positives = 103/288 (35%), Gaps = 30/288 (10%) Query: 19 LANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRH-INLDFYIIADVYNDGFFQKIAK 77 + + + A D + V + S V N + F+++ D N G Q + K Sbjct: 210 PPELEDPKLYHYAIFSDN--VIAASVVVNSAVKNAKEPWKHVFHVVTDKMNLGAMQVMFK 267 Query: 78 LAEQNQLRITLYRINTDKL---------QCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLY 128 + + N I + + K + L + S + R + ++ L R+L+ Sbjct: 268 MRDYNGSHIEVKAVEDYKFLNSSYVPVLRQLENPKYLSMLNHLRFYLPEMYP-KLHRILF 326 Query: 129 LDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPE-LLGQYFN------- 180 LD DVV + D++ L + ++G V V+ + ++ L+ + FN Sbjct: 327 LDDDVVVQRDLTGLWKIDMDGKVNGAVETCFGSFHRYAQYMNFSHPLIKEKFNPKACGWA 386 Query: 181 SGVVYLDLKKWADAKLTEK--ALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIY 238 G+ + DL W K TE+ L ++K + T L + ++ + Sbjct: 387 YGMNFFDLDAWRKEKCTEQYHYWQNLNENRTLWKLGTLPPGLITFYSTTKPLDKSWHVLG 446 Query: 239 TIKSELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKI 286 + + + ++H+ G KPW A+ + Sbjct: 447 LGYN-------PSISMDEIHNAAVVHFNGNMKPWLDIAMNQFRPLWTK 487 >UniRef50_Q2L3C5 Glycosyl transferase-like protein n=3 Tax=Magnoliophyta RepID=Q2L3C5_BRASY Length = 689 Score = 110 bits (275), Expect = 8e-23, Method: Composition-based stats. Identities = 41/207 (19%), Positives = 73/207 (35%), Gaps = 22/207 (10%) Query: 93 TDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVA 152 D+ + S + R + ++ L+++L+LD D V + D+S L + L G V Sbjct: 478 PDENPKFRNPKYLSILNHLRFYLPEIFP-KLNKVLFLDDDTVVQQDLSALWSIDLKGKVN 536 Query: 153 AVVKDVEPMQEKAVSRLSD----------PELLGQYFNSGVVYLDLKKWADAKLTE--KA 200 V+ + L+ P+ G F G+ DL +W +T+ Sbjct: 537 GAVETCGETFHRFDKYLNFSNPIVANNFHPQACGWAF--GMNMFDLSEWRKQNITDVYHT 594 Query: 201 LSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITEST 260 L ++K V T L R ++ + + N + Sbjct: 595 WQKLNEDRLLWKLGTLPAGLVTFWNRTFPLDRSWHLLGLGYN-------PNVNERDIRRA 647 Query: 261 LLIHYTGATKPWHKWAIYPSVKYYKIA 287 +IHY G KPW + + KY+ Sbjct: 648 SVIHYNGNLKPWLEIGLSKYRKYWSRY 674 >UniRef50_Q04CN2 Lipopolysaccharide biosynthesis glycosyltransferase n=19 Tax=Lactobacillus RepID=Q04CN2_LACDB Length = 274 Score = 109 bits (274), Expect = 1e-22, Method: Composition-based stats. Identities = 59/285 (20%), Positives = 104/285 (36%), Gaps = 51/285 (17%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFF----------QKIAK 77 +N + D N GV +++ S++ R + YI+ I Sbjct: 1 MNFLFCGDHNAERGVLIAVLSLLKAERGEEVHVYILTMRTKSKSRSFKPFSQHAADFIRS 60 Query: 78 LAEQNQLRITLYRIN-TDKLQCLPCT----QVWSRAMYFRLFAFQLLGLTLDRLLYLDAD 132 L + +L I+ T+ P T ++ RLFA + L DR+LYLD D Sbjct: 61 LIVADNPNSSLELIDCTENFIKEPPTANMGTRFTPYAMLRLFADE-LPQIPDRILYLDDD 119 Query: 133 VVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWA 192 V+ + + Q L G V D + + + Y NSGV+ L++ + Sbjct: 120 VIIRRPVDQFYTQDLTGTELVGVLDYF-----GRFFFHNQKKIFDYLNSGVLLLNMPEIK 174 Query: 193 DAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNY 252 L ++ ++ K PDQ +N L K PR+YN Y ++ Sbjct: 175 RTGLFKRVRHLMQVK--KMFLPDQTAINKLAKEK-RIAPRKYNEQYALQ----------- 220 Query: 253 KKLITESTLLIHYTGAT-----------KPWHKWAIYPSVKYYKI 286 + T++ H+T + KPW ++ + ++ Sbjct: 221 -----DDTVIQHFTTSFRFFPYFRTQTVKPWDVKRVHSVLNLHEY 260 >UniRef50_C5FDY7 Glycogenin n=1 Tax=Microsporum canis CBS 113480 RepID=C5FDY7_NANOT Length = 731 Score = 109 bits (273), Expect = 1e-22, Method: Composition-based stats. Identities = 60/294 (20%), Positives = 99/294 (33%), Gaps = 44/294 (14%) Query: 31 AYGV---DANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 Y NYL G V S+ N L + D G I +L I Sbjct: 8 VYCTILLSDNYLPGAMVLAHSLRDNGTKGRLAVLVTLDNLQPGI---IDELKTVYDDVIP 64 Query: 88 LYRINTDKLQCLPCTQVWS-RAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLG 146 + RI L + + ++ ++ DR++Y+DADV+ +LL L Sbjct: 65 IPRIENSYPGNLYLMDRPDLISTFSKIALWK--QTQYDRIVYIDADVIALRAPDELLTLD 122 Query: 147 LNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYL--DLKKWADAKLTEKALSIL 204 A V D+ P+ FN+GV+ L +LK + AL Sbjct: 123 F--KSIAAVPDI-----------GWPDC----FNTGVIVLRPNLKDY-------YALLAF 158 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIH 264 + + DQ ++N+ K L YN + + + + L+H Sbjct: 159 AQRGISFDGADQGLLNMHFKN-WDRLSFTYNCTPSGHYQYVPA-----YRYFESTISLVH 212 Query: 265 YTGATKPWHKW-AIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQ 317 + G+ KPW + P Y L W R ++ + + +H Q Sbjct: 213 FIGSLKPWRIGRSSSPQQSPYNQLLAK--WWAVYDRHYRTGPIYIPQPRHYQSQ 264 >UniRef50_B6JNQ8 Lipopolysaccharide 1,2-glucosyltransferase n=18 Tax=Helicobacter RepID=B6JNQ8_HELP2 Length = 369 Score = 109 bits (273), Expect = 1e-22, Method: Composition-based stats. Identities = 54/351 (15%), Positives = 117/351 (33%), Gaps = 70/351 (19%) Query: 25 SECLNVAYGVDANYLDGVGVSITSIVLNNR--------------------HINLDFYIIA 64 ++ + + D NY G GVS+ S++ + +I + + Sbjct: 8 NQIIPIFMSFDKNYALGAGVSLYSLLSHASRHTSAIDFSPLSQNNQLLGTNIVYKIHCLI 67 Query: 65 DVYNDGFFQKIAKLAE--QNQLRITLYRIN-----TDKLQCLPCTQVWSRAMYFRLFAFQ 117 K+ K + + + IN + C++ + + Sbjct: 68 KGVTLEQQNKLLKTLDPFKTFASLEFIDINSLDHSIESYLNESCSKRYGGLLVLCRLLLA 127 Query: 118 LLGLTLDRLLYLDADVVCKGDI-SQLLHLGLNG-AVAAVVKDVEP---------MQEKAV 166 L +++ +D D V GD+ S L + +V+D E+A Sbjct: 128 SLFPNYSKIISIDVDTVFLGDVASAYFALDNEPTKLLGMVRDTFSHLSFEAFCHFIERAC 187 Query: 167 SRL---------SDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQD 217 ++ + + Q FN G + L +W + AL L ++ YP+Q Sbjct: 188 KNFKIDFSRFSPNELKRIHQGFNMGFLVAHLDRWRQDGFEKIALEFLKTRGKDLFYPEQC 247 Query: 218 VMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGATKPWHK--- 274 ++N++ L LP YN +K+ ++ +++H+ KPW Sbjct: 248 LVNMVFWERILELPIYYNCYSDF-----------FKEHYPKNIIMLHFI-KYKPWRSVSS 295 Query: 275 -----WAIYPSVKYYKIALENSPWKDDSPRDAKSII---EFKKRYKHLLVQ 317 ++ L +P+K+D ++ + + + H+ + Sbjct: 296 LNGRLICYEAEASFWLANLFCTPFKNDFLKERLEMAKDQQMQSFKTHIRSK 346 >UniRef50_Q6Z5D6 Glycosyltransferase family-like n=6 Tax=Poaceae RepID=Q6Z5D6_ORYSJ Length = 726 Score = 109 bits (272), Expect = 2e-22, Method: Composition-based stats. Identities = 52/310 (16%), Positives = 98/310 (31%), Gaps = 53/310 (17%) Query: 21 NINTSECLNVAYGVDANYLDGVGVSITSIVLNNRH-INLDFYIIADVYN----------- 68 + + + A D + G V + S +++ + N F+I+ D N Sbjct: 412 KLEDPKLQHYALFSDN--VLGAAVVVNSTIIHAKTPENHVFHIVTDKLNYAAMRMWFLEN 469 Query: 69 --------DGFFQKIAKLAEQN---------QLRITLYRI----NTDKLQCLPCTQVWSR 107 + L Q I Y D + S Sbjct: 470 SQGKAAIEVQNIEDFTWLNSSYSPVLKQLESQFMINYYFKTQQDKRDNNPKFQNPKYLSI 529 Query: 108 AMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVS 167 + R + ++ L+++L+LD D+V + D+S L + L G V ++ + Sbjct: 530 LNHLRFYLPEIFP-KLNKVLFLDDDIVVQQDLSALWSIDLKGKVNGAIQTCGETFHRFDR 588 Query: 168 RLS--------DPELLGQYFNSGVVYLDLKKWADAKLTE--KALSILMSKDNVYKYPDQD 217 L+ + E + G+ DL +W +T+ ++K Sbjct: 589 YLNFSNPLIAKNFERRACGWAYGMNMFDLSEWRKRNITDVYHYWQEQNEHRLLWKLGTLP 648 Query: 218 VMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAI 277 V T L +++ + N + E +IHY G KPW + A+ Sbjct: 649 AGLVTFWNQTFPLDHKWHLLGLGY-------KPNVNQKDIEGAAVIHYNGNRKPWLEIAM 701 Query: 278 YPSVKYYKIA 287 KY+ Sbjct: 702 AKYRKYWSKY 711 >UniRef50_Q9FX71 T6J4.1 protein n=2 Tax=rosids RepID=Q9FX71_ARATH Length = 363 Score = 108 bits (271), Expect = 2e-22, Method: Composition-based stats. Identities = 42/240 (17%), Positives = 95/240 (39%), Gaps = 17/240 (7%) Query: 23 NTSECLNVAYGVDANYLDGVGVSITSIVLNNR-HINLDFYIIADVYNDGFFQKIAKLAEQ 81 + +++A +DA YL G + S++ + N+ F+ IA ++I Sbjct: 55 HNPSIIHIAMTLDAIYLRGSVAGVFSVLQHASCPENIVFHFIATHRRSADLRRIISSTFP 114 Query: 82 NQLRITLYRINTDKLQCLPCTQVW----SRAMYFRLFAFQLLGLTLDRLLYLDADVVCKG 137 L +Y + + ++ + + Y R++ LL + + R++Y D+D+V Sbjct: 115 Y-LTYHIYHFDPNLVRSKISSSIRRALDQPLNYARIYLADLLPIAVRRVIYFDSDLVVVD 173 Query: 138 DISQLLHLGLNGAVAAVVKDVEPMQEKAVS---------RLSDPELLGQYFNSGVVYLDL 188 D+++L + L V + + + + + YFN+GV+ +DL Sbjct: 174 DVAKLWRIDLRRHVVGAPEYCHANFTNYFTSRFWSSQGYKSALKDRKPCYFNTGVMVIDL 233 Query: 189 KKWADAKLTEKALSI--LMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKD 246 KW + ++T K + + + +Y+ ++ G + +N L+ Sbjct: 234 GKWRERRVTVKLETWMRIQKRHRIYELGSLPPFLLVFAGDVEPVEHRWNQHGLGGDNLEG 293 >UniRef50_B2KBT4 Glycosyl transferase family 8 n=1 Tax=Elusimicrobium minutum Pei191 RepID=B2KBT4_ELUMP Length = 320 Score = 108 bits (270), Expect = 3e-22, Method: Composition-based stats. Identities = 57/328 (17%), Positives = 116/328 (35%), Gaps = 34/328 (10%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRH--INLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 + + + +L + V++ S+ N+ N D + +N+ + K+ + Sbjct: 7 IFFACNQRFLFTLAVALLSLKKNSPKALENSDVLVFYQGFNEQDKALLNKILP---CKFF 63 Query: 88 LYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGL 147 Y+ + + +S+ + R F +L T ++LY+D DV+ G+++ + Sbjct: 64 EYKFAVETNFDHINFKHFSQLTFARYEIFDMLD-TYKKVLYIDVDVMIGGELNYIFENYG 122 Query: 148 NGAVAAVVKDVEPMQEKAVSRLSDPELLGQY----FNSGVVYL--DLKKWADAKLTEKAL 201 + A+ +D + +P +N+GV ++K L Sbjct: 123 DKTGVAMCEDTQKGLTLITKNFVNPMPQYDMTLPCYNAGVTLFCDNIKD--RQHLKMWCY 180 Query: 202 SILMSKDNVYKYPDQDVMNVLLKGM---TLFLPREYNTIYTIKSELKDKTHQNYKKLITE 258 + PDQ V+NV+ + LP N + ++ Y + Sbjct: 181 ERTAEWLDNLVCPDQGVVNVMFQEFGITVEVLPDICNCL---------PSNPKYLDKRRK 231 Query: 259 STLLIHYT-GATKPWHKWAIYPSVKYYKIALENS----PWKD-DSPRDAKSIIEFKKRY- 311 L+ H G + W P K+YK LE P K+ + K + R+ Sbjct: 232 DILIYHCAGGGVRFWTYTWNAPWQKFYKEYLELGGAPHPDKEHAWLKFIKKYNLQRFRFF 291 Query: 312 -KHLLVQHHYISGIIAGVCYLCRKYYRK 338 + Q H + + Y + +RK Sbjct: 292 DRSPDPQMHPARFLKYLLIYPFKYAFRK 319 >UniRef50_B4WFJ6 Glycosyl transferase family 8 n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WFJ6_9SYNE Length = 298 Score = 108 bits (269), Expect = 4e-22, Method: Composition-based stats. Identities = 57/283 (20%), Positives = 102/283 (36%), Gaps = 30/283 (10%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHIN-LDFYIIADVYNDGFFQK-----IAKLAEQ-- 81 + + ++ + V++ SIV N + + + F ++ + FF+K + LA Q Sbjct: 10 IVFSLNRKIWLSLIVAMNSIVSNASNPDTIRFNVLVPPGEEQFFEKKIREALPSLAAQWR 69 Query: 82 ---NQLRITLYRINTDKLQCLPCTQVWSRAM-YFRLFAFQLLGLTLDRLLYLDADVVCKG 137 + + + + SR + Y R F L+R++YLD D++ G Sbjct: 70 VKSYLPPAFMQEYLDKRFKEKTEDRRNSRYIQYSRFFFRDAF-EDLERVIYLDTDLIVLG 128 Query: 138 DISQLL----HLGLNGAVAAVVKDVEP----MQEKAVSRLSDPELLGQYFNSGVVYLDLK 189 DI++L L + P R P+ FN+GV + +L Sbjct: 129 DIAELYAYTKALD-EHCYFGSIPHFYPCIFYFSNFMKMREEIPKFKQT-FNAGVWFTNLS 186 Query: 190 KWADAKLTEKALSIL----MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELK 245 W + K E+ L S +Y D+ V N++ K L + +N Sbjct: 187 FWNE-KTYERLNYYLSLDAKSNYKLYTLGDEPVFNLMFKD-YLQADKNWNRCGYGTHPAV 244 Query: 246 DKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIAL 288 + LIH++G KPW I ++ L Sbjct: 245 TNLFLASGEKFLSEAKLIHWSGPFKPWSSPKI-RFADLWRTYL 286 >UniRef50_UPI000180D0CC PREDICTED: similar to like-glycosyltransferase n=1 Tax=Ciona intestinalis RepID=UPI000180D0CC Length = 671 Score = 107 bits (267), Expect = 6e-22, Method: Composition-based stats. Identities = 51/290 (17%), Positives = 108/290 (37%), Gaps = 27/290 (9%) Query: 1 MDSFPAIEIDKVKAWDFRLANINTS------ECLNVAYGVDANY--LDGVGVSITSIVLN 52 M F E ++ ++ L E ++V Y + + V + SI+ + Sbjct: 29 MKRFLQEEKTTLRKYESDLQIQTGRQKPSQCETIHVFLVC-TGYKTVKDMVVLVKSILFH 87 Query: 53 NRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLYR--INTDKLQCLPCTQVWSRAMY 110 + +L F+ + D + +++ + + L++++Y D + LP T Sbjct: 88 RKD-SLHFHYLVDNVSKPILKELFRSWDIPDLKVSMYEDIAVLDDVSWLPFTHHSGINSV 146 Query: 111 FRLFAFQLLGLTLDRLLYLDADVVCKGDISQLL----HLGLNGAVAAVVKDVEPMQEKAV 166 ++L ++L L +D+++ LD+D+V DI++L H+ A V + Sbjct: 147 YKLAILKVLPLYIDKVIVLDSDMVFATDIAELWLQFRHMDQQQAFGMVENQSDWYLGTLK 206 Query: 167 SRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILM---SKDNVYKYPDQDVMNVLL 223 +G+ NSG++ LD +K A + + DQDV+N+ L Sbjct: 207 FEYVVWPAIGRGLNSGMMLLDCEKLRRANWDSEWKETAQQGIKRFKTVPLADQDVINLFL 266 Query: 224 KGMTL---FLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGATK 270 L +N ++ + + ++H+ K Sbjct: 267 VNNKHMLYKLECNWNFQLP-----YERKMELCYNDHKHNVKIVHWNNVRK 311 >UniRef50_B5RUI6 DEHA2F17138p n=2 Tax=Debaryomyces hansenii RepID=B5RUI6_DEBHA Length = 403 Score = 106 bits (265), Expect = 1e-21, Method: Composition-based stats. Identities = 44/275 (16%), Positives = 92/275 (33%), Gaps = 48/275 (17%) Query: 30 VAYGVDANYLDGVGVSITSIVLNN--RHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 + V+ YL G I+ N+ L + ++ + I + + Sbjct: 6 ITLLVNEVYLPGALTVAK-ILKNDYKTSHPLVILLDTSQISEKSTKLIEDVYD------E 58 Query: 88 LYRINTDKLQCLPCTQVWSR-------AMYFRLFAFQLLGLTLDRLLYLDADVVCKGDIS 140 + I+ + P ++ S+ + ++ ++ + +L+YLD D++ I Sbjct: 59 IIPIDGGLITS-PIDKLVSQLNRLELAVTFTKILLWKQI--QYTKLVYLDCDILPMQGID 115 Query: 141 QLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKA 200 L + ++ A D FNSGV+ L + K Sbjct: 116 DLFEIEISSNQVAASPDSG---------------WPDIFNSGVMVLK----PSMIVYNKL 156 Query: 201 LSILMSKDNVYKYPDQDVMNVLLKG-----MTLFLPREYNTIYTIKSELKDKTHQNYKKL 255 + ++DN + DQ + N + LP YN ++ + + +K Sbjct: 157 SEFVETEDNTFDGADQGLFNEFFNIASKGLNWVRLPFLYNVTFSQSYQYLPAFDRFFK-- 214 Query: 256 ITESTLLIHYTGATKPWHKWAIYPSVKYYKIALEN 290 ++H+ G+ KPW +Y+ A Sbjct: 215 ---DIRILHFIGSQKPWMFGGYDKFKEYWWSAFNK 246 >UniRef50_Q04CN3 Lipopolysaccharide biosynthesis glycosyltransferase n=1 Tax=Lactobacillus delbrueckii subsp. bulgaricus ATCC BAA-365 RepID=Q04CN3_LACDB Length = 200 Score = 106 bits (265), Expect = 1e-21, Method: Composition-based stats. Identities = 36/163 (22%), Positives = 57/163 (34%), Gaps = 15/163 (9%) Query: 136 KGDISQLLHLGLNGAVAAVVKD---VEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWA 192 DI+ L L + A D + R +Y NSGV+ ++ Sbjct: 2 NADIAGLYQTELGNNLVAACHDQSVHYIEPLQTYIRDCLGIDPDKYVNSGVLVMNCLAMR 61 Query: 193 DAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNY 252 D +K L +L + PDQD +N + G L ++ + N Sbjct: 62 DEDFVDKFLHLLSTYQFNSIAPDQDYLNEICSGRIKLLDPRWDAM------------PND 109 Query: 253 KKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKD 295 LIHY KPWH + ++++A E +KD Sbjct: 110 FDPEMTGPYLIHYNLFYKPWHFEEVKYGSYFWQVAKETPFYKD 152 >UniRef50_A7S9E5 Predicted protein n=1 Tax=Nematostella vectensis RepID=A7S9E5_NEMVE Length = 389 Score = 105 bits (263), Expect = 2e-21, Method: Composition-based stats. Identities = 49/252 (19%), Positives = 88/252 (34%), Gaps = 28/252 (11%) Query: 28 LNVAYGVDANYLDGVGVSITS-IVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRI 86 +++ D + S I+ NR L F+I+A+ + + + + Sbjct: 85 ISIVICGSRK--DEALTMLKSSILFTNR--TLVFHILAESGLHDGLKSVLDHWPCVEGKQ 140 Query: 87 TLYRINTDKLQCLPCTQVWSRA----MYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQL 142 Y+I+ K W + RLF +L +D L+Y+D D + + L Sbjct: 141 VSYKIHPLKFPEGQKPDEWKKLFKPCAAQRLFLPDIL-TEVDSLIYMDIDTLFLSPVQWL 199 Query: 143 LH--LGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYF-----NSGVVYLDLKKWADAK 195 N + A + P E + + + Y+ NSGV+ ++L + Sbjct: 200 WDQFSNFNSSQLAS---MTPEGEVSATGWYNRFARHPYYGKLGLNSGVMLMNLTRMRSFG 256 Query: 196 LTEKALSILMSKDNVYKYPDQDVMNVLLK---GMTLFLPREYNTIYTIKSELKDKTHQNY 252 EK L I + DQD++N+L + L E+N N Sbjct: 257 WQEKILPIYNKYRFDITWGDQDILNILFHYHPELVYVLSCEWNYRND-----HCIYGNNC 311 Query: 253 KKLITESTLLIH 264 K T ++H Sbjct: 312 KTADTNGIYILH 323 >UniRef50_D1HMA0 Whole genome shotgun sequence of line PN40024, scaffold_108.assembly12x (Fragment) n=9 Tax=rosids RepID=D1HMA0_VITVI Length = 511 Score = 104 bits (260), Expect = 4e-21, Method: Composition-based stats. Identities = 41/213 (19%), Positives = 75/213 (35%), Gaps = 35/213 (16%) Query: 101 CTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEP 160 + S + R++ +L L+++++LD DVV + D+S L + L G V V+ Sbjct: 300 SPKYISLLNHLRIYIPELFP-NLNKVVFLDDDVVIQRDLSPLWEIDLEGKVNGAVETCRG 358 Query: 161 MQEKAVSRLSDPELLGQYFN------------------SGVVYLDLKKWADAKLTEKALS 202 E +S+ YFN G+ DL W + E S Sbjct: 359 EDEWVMSK-----RFRNYFNFSHPLIAKNLNPDECAWAYGMNIFDLSAWRKTNIRETYHS 413 Query: 203 ILMSKDN----VYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITE 258 L ++K + KG + ++ + L + N + + Sbjct: 414 WLKENLKSNLTMWKLGTLPPALIAFKGHIHPIDPSWHML-----GLGYQNKTNIDSV--K 466 Query: 259 STLLIHYTGATKPWHKWAIYPSVKYYKIALENS 291 +IHY G +KPW + ++ + S Sbjct: 467 KAAVIHYNGQSKPWLQIGFEHLRPFWTKYVNYS 499 >UniRef50_A2DBB6 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2DBB6_TRIVA Length = 1378 Score = 104 bits (260), Expect = 4e-21, Method: Composition-based stats. Identities = 43/219 (19%), Positives = 85/219 (38%), Gaps = 19/219 (8%) Query: 16 DFRLANINTSECLNVAYGVDANYLDGVGV--SITSIVLNNRHINLDFYIIADVYNDGFFQ 73 + +L+ N +E +N + V + YL V + S + N ++ + F+ + + + F Sbjct: 1047 NLKLSMSNDTETVN-VFAVVSGYLYEHLVKIMMISAIKNTKN-PIHFWFLKNFISSQFMN 1104 Query: 74 KIAKLAEQNQLRITLYRINTDKL---QCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLD 130 + K A++ + + N Q +W + LF L + + R++Y+D Sbjct: 1105 DLPKFAKKYNFKYSFVEYNWPSFVVHQSERQRIIWGNKI---LFFDALFPMNISRMIYID 1161 Query: 131 ADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQE------KAVSRLSDPELLGQYFNSGVV 184 AD V +GD+S+L+ + L G V +E + +Y S + Sbjct: 1162 ADAVVRGDLSELMKIDLKGCPYGFVPMGMSRKEMKKYHFWTTGYWKNHLRGKKYHISAMF 1221 Query: 185 YLDLKKWADAKLTEKALSILMS---KDNVYKYPDQDVMN 220 +DL ++ +K DQD+ N Sbjct: 1222 VVDLDRFRRMGGGDKLRKHYSQIVGNTKSLANLDQDLPN 1260 >UniRef50_A2RAV0 Catalytic activity: UDP-glucose + glycogenin <=> UDP + glucosylglycogenin n=2 Tax=Aspergillus RepID=A2RAV0_ASPNC Length = 767 Score = 104 bits (260), Expect = 5e-21, Method: Composition-based stats. Identities = 51/247 (20%), Positives = 86/247 (34%), Gaps = 38/247 (15%) Query: 31 AYGV---DANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 Y +YL G V S+ N L D Q++ + ++ Sbjct: 7 VYCTLLLSDHYLPGATVLAHSLRDNGSKAKLVALFTPDSLQPATIQELQAVYDELIPVHP 66 Query: 88 LYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGL 147 L I L + + A + ++ ++ R++Y+D DVV +LL L + Sbjct: 67 LTNITPANLWLMDRPDLI--ATFTKIELWR--QTQYKRIVYIDCDVVALRAPDELLDLEV 122 Query: 148 NGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKA-LSILMS 206 + A V DV P+ FNSGV+ L L + L L Sbjct: 123 D---FAAVPDV-----------GWPDC----FNSGVMVL------RPNLQDYLALRALAE 158 Query: 207 KDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYT 266 + + DQ ++N+ + L YN + + K + +IH+ Sbjct: 159 RGISFDGADQGLLNMHFRD-WHRLSFSYNCTPSANYQYIPA-----YKHFQSTISMIHFI 212 Query: 267 GATKPWH 273 GA KPW+ Sbjct: 213 GAQKPWN 219 >UniRef50_Q2R1U9 Glycosyl transferase family 8 protein, expressed n=3 Tax=Poaceae RepID=Q2R1U9_ORYSJ Length = 548 Score = 104 bits (259), Expect = 5e-21, Method: Composition-based stats. Identities = 53/274 (19%), Positives = 106/274 (38%), Gaps = 38/274 (13%) Query: 43 GVSITSIVLNNRHIN-LDFYIIADVYNDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPC 101 V + S + ++ + F+I+ D N F + T+ + D L+ LP Sbjct: 273 AVVVNSTISASKDPKRIMFHIVTDALN--FPAMMMWFLTNPPNPATIQIKSLDNLKWLPA 330 Query: 102 TQVW-------------SRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLN 148 + S + R + ++ +L++L+ LD D+V + D+S L + LN Sbjct: 331 DFSFRFKQKGIRDPRYTSALNHLRFYLPEVFP-SLNKLVLLDHDIVVQRDLSGLWQIDLN 389 Query: 149 GAVAAVVKDVEP----MQEKAVSRLSDPELLGQYFNS-------GVVYLDLKKWADAKLT 197 G V V+ + + + SDP ++ + F++ G+ DLK+W LT Sbjct: 390 GKVNGAVETCTSGDGYHRLENLVNFSDPSIINK-FDAKACIHAFGMNIFDLKEWRRQGLT 448 Query: 198 EKALSILM--SKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKL 255 + ++K + ++ T+ L ++ + + + Sbjct: 449 TAYNKWFQAGKRRRLWKAGSLPLGQIVFYNQTVPLDHRWHVLGLGHDR-------SIGRD 501 Query: 256 ITESTLLIHYTGATKPWHKWAIYPSVKYYKIALE 289 E +IHY+G KPW + +I Y+ L+ Sbjct: 502 AIERAAVIHYSGKLKPWLEISIPKYRDYWNNFLD 535 >UniRef50_O95461 Glycosyltransferase-like protein LARGE1 n=84 Tax=Metazoa RepID=LARGE_HUMAN Length = 756 Score = 104 bits (259), Expect = 6e-21, Method: Composition-based stats. Identities = 52/267 (19%), Positives = 105/267 (39%), Gaps = 22/267 (8%) Query: 18 RLANINTSECLNVAY-GVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIA 76 + + E ++VA N V + S++ + R L F++IAD + + Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFH-RRNPLHFHLIADSIAEQILATLF 186 Query: 77 KLAEQNQLRITLYRINT--DKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVV 134 + +R+ Y + ++ +P +L + L L+R++ LD D+ Sbjct: 187 QTWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDIT 246 Query: 135 CKGDISQLLHL--GLNG-AVAAVVKDVEPMQEKAVSRLSDP-ELLGQYFNSGVVYLDLKK 190 DI++L + G V +V++ + + P LG+ +N+GV+ L L K Sbjct: 247 FATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDK 306 Query: 191 WADAKLTEKALSILMSKDNV----YKYPDQDVMNVLLKGM---TLFLPREYNTIYTIKSE 243 K E+ + ++ + DQD+ N ++K LP +N + Sbjct: 307 LRKMKW-EQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLS---- 361 Query: 244 LKDKTHQNYKKLITESTLLIHYTGATK 270 ++ Q Y+ + +IH+ K Sbjct: 362 DHTRSEQCYRDV--SDLKVIHWNSPKK 386 >UniRef50_B7PBG6 Glycosyltransferase domain-containing protein, putative (Fragment) n=1 Tax=Ixodes scapularis RepID=B7PBG6_IXOSC Length = 304 Score = 103 bits (258), Expect = 7e-21, Method: Composition-based stats. Identities = 59/317 (18%), Positives = 112/317 (35%), Gaps = 49/317 (15%) Query: 25 SECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYN----DGFFQKIAKLAE 80 E ++VA L G + S+ +N + F+++ D + +L+ Sbjct: 4 REHVHVAVVTSNAKLGGAVALMASV-AHNTARPVSFHLVTDNATQYHVHAWMHD-PRLSG 61 Query: 81 QNQLRITLYR---INTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKG 137 + +T + ++ D + L ++ + +L+ +LL L+ LD DV+ +G Sbjct: 62 LSYEVVTFPQTALVSPDLVGLLQVSR--GPLPFAKLYLARLLPSVAGTLVVLDDDVIVQG 119 Query: 138 DISQLLHLGL-NGAVAAVVKDVEPMQEKAVSRLSDPE----------------LLGQYFN 180 D+++L L L GAV +D + + + S E N Sbjct: 120 DVAELAALPLPKGAVGLFSRDCDTFSRRYNTAGSRYEQYVEARRPSLQALGISATDCVLN 179 Query: 181 SGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQ----DVMNVLLKGMTLFLPREYNT 236 GV +DL +W+ +TE A + + + Q + + L T L +++ Sbjct: 180 LGVFVVDLAEWSRLNVTESAEAWMRLNIKEKLFK-QEGPVPALLLALHNKTATLDPQWHV 238 Query: 237 IYTIKSELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDD 296 L Y +L S L+H++G KPW + P + Sbjct: 239 -----RNLGVTAGTQYSRLFVSSAKLLHWSGRFKPW--SSRSPYADIWHRYF-------- 283 Query: 297 SPRDAKSIIEFKKRYKH 313 D + KH Sbjct: 284 -VPDPTGRFRPASKSKH 299 >UniRef50_Q9LE59 Like glycosyl transferase 1 n=35 Tax=Embryophyta RepID=Q9LE59_ARATH Length = 673 Score = 103 bits (258), Expect = 7e-21, Method: Composition-based stats. Identities = 53/316 (16%), Positives = 109/316 (34%), Gaps = 54/316 (17%) Query: 18 RLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLD-FYIIADVYNDGFFQKIA 76 R N+ + A D + V + S ++N + + F+++ D N G Sbjct: 355 RSENLENPNLYHYALFSDN--VLAASVVVNSTIMNAKDPSKHVFHLVTDKLNFGAMNMWF 412 Query: 77 KLAEQNQLRITLYRINTDK----------------------LQCLPCTQVWSRAMY---- 110 L + I + ++ K + T S Y Sbjct: 413 LLNPPGKATIHVENVDEFKWLNSSYCPVLRQLESAAMREYYFKADHPTSGSSNLKYRNPK 472 Query: 111 -------FRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQE 163 R + ++ L+++L+LD D++ + D++ L + LNG V V+ Sbjct: 473 YLSMLNHLRFYLPEVYP-KLNKILFLDDDIIVQKDLTPLWEVNLNGKVNGAVETCGESFH 531 Query: 164 KAVSRLSDPEL-LGQYFN-------SGVVYLDLKKWADAKLTE--KALSILMSKDNVYKY 213 + L+ + + FN G+ DLK+W +T + ++K Sbjct: 532 RFDKYLNFSNPHIARNFNPNACGWAYGMNMFDLKEWKKRDITGIYHKWQNMNENRTLWKL 591 Query: 214 PDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGATKPWH 273 + G+T L + ++ + + + K E+ ++HY G KPW Sbjct: 592 GTLPPGLITFYGLTHPLNKAWHVLGLGYN-------PSIDKKDIENAAVVHYNGNMKPWL 644 Query: 274 KWAIYPSVKYYKIALE 289 + A+ Y+ ++ Sbjct: 645 ELAMSKYRPYWTKYIK 660 >UniRef50_B9FGA7 Putative uncharacterized protein n=3 Tax=Poaceae RepID=B9FGA7_ORYSJ Length = 316 Score = 103 bits (258), Expect = 8e-21, Method: Composition-based stats. Identities = 46/295 (15%), Positives = 102/295 (34%), Gaps = 44/295 (14%) Query: 23 NTSECLNVAYGVDANYLDGVGVSITSIVLNNR-HINLDFYIIADVYNDGF---------- 71 + +++A +D YL G + S++ + ++ F+ +A + Sbjct: 38 AGAPTIHIAMTLDTTYLRGSLAGVLSVLRHAACPESIAFHFVASSASPARRLAALRRALA 97 Query: 72 --FQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYL 129 F + + R+ +I+T + L Y R++ LL ++ R+LYL Sbjct: 98 AAFPTLPATVHRFDARLVRGKISTSVRRALDQ-----PLNYARIYLADLLPRSVSRVLYL 152 Query: 130 DADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLK 189 D+D++ D++ LL + P + +++S+ + F Sbjct: 153 DSDLLVVDDVAGLLATDFG-------PEGGPWRPQSISKANFNSYFTDAF------WSHP 199 Query: 190 KWADAKLT---EKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKD 246 +W T E + + + +Y+ ++ G + +N D Sbjct: 200 EWRAGGYTVKLEYWMEVQKQEARIYELGSLPPFLLVFAGEVKAVEHRWNQ----HGLGGD 255 Query: 247 KTHQNYKKLITESTLLIHYTGATKPWHKWA------IYPSVKYYKIALENSPWKD 295 ++L L+H++G KPW + + Y + W+D Sbjct: 256 NVAGQCRELHPGPVSLLHWSGKGKPWLRLDAGRPCPLDALWAPYDLLRRRGAWED 310 >UniRef50_Q0U987 Putative uncharacterized protein n=1 Tax=Phaeosphaeria nodorum RepID=Q0U987_PHANO Length = 583 Score = 103 bits (257), Expect = 9e-21, Method: Composition-based stats. Identities = 51/255 (20%), Positives = 88/255 (34%), Gaps = 54/255 (21%) Query: 31 AYGV---DANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 Y +YL G V S+ L I + + I +L E I Sbjct: 8 VYCTLLMSDSYLPGAAVLAHSLRDAGTKKKLAVLITLETLSADT---ITQLKELYDYLIP 64 Query: 88 LYRINTDKLQCLPCTQV------WSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQ 141 + RI T L +++ +R +++YLDADVV + + Sbjct: 65 VERIRTPSPANLYLMGRPDLSFAFTKIALWR-------QTQFRKIVYLDADVVALRALDE 117 Query: 142 LLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYL--DL-KKWADAKLTE 198 L + A A D+ FNSGV+ + D+ + W Sbjct: 118 LFDIE---APFAAAPDIGWPDA---------------FNSGVMVISPDMGEYW------- 152 Query: 199 KALSILMSKDNVYKYPDQDVMNVLLKGM-TLFLPREYNTIYTIKSELKDKTHQNYKKLIT 257 AL + + + + DQ ++N + L YN + + + ++ YK+ I+ Sbjct: 153 -ALQTMAATGDSFDGADQGLLNQYFEHRPWQRLKFTYNCTPNAEYQW-EPAYRYYKRDIS 210 Query: 258 ESTLLIHYTGATKPW 272 +H+ G KPW Sbjct: 211 A----VHFIGKEKPW 221 >UniRef50_Q9FH36 Similarity to unknown protein n=28 Tax=Embryophyta RepID=Q9FH36_ARATH Length = 535 Score = 103 bits (257), Expect = 9e-21, Method: Composition-based stats. Identities = 38/197 (19%), Positives = 72/197 (36%), Gaps = 25/197 (12%) Query: 112 RLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVV-----KDVEPMQEKAV 166 R+ +L +L+++++LD D+V + D+S L + +NG V V +D M +K Sbjct: 335 RIHLPELFP-SLNKVVFLDDDIVIQTDLSPLWDIDMNGKVNGAVETCRGEDKFVMSKKFK 393 Query: 167 SRLSDPE-LLGQYFN-------SGVVYLDLKKWADAKLTEKALSILMSK-DNVYKYPDQD 217 S L+ + + FN G+ DL W ++ L + Sbjct: 394 SYLNFSNPTIAKNFNPEECAWAYGMNVFDLAAWRRTNISSTYYHWLDENLKSDLSLWQLG 453 Query: 218 VM---NVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGATKPWHK 274 + + G + ++ + E ES ++H+ G KPW Sbjct: 454 TLPPGLIAFHGHVQTIDPFWHMLGLGYQETTSYADA-------ESAAVVHFNGRAKPWLD 506 Query: 275 WAIYPSVKYYKIALENS 291 A + L++S Sbjct: 507 IAFPHLRPLWAKYLDSS 523 >UniRef50_P91854 Protein F26H9.8, partially confirmed by transcript evidence n=2 Tax=Caenorhabditis RepID=P91854_CAEEL Length = 1381 Score = 103 bits (256), Expect = 1e-20, Method: Composition-based stats. Identities = 49/223 (21%), Positives = 94/223 (42%), Gaps = 11/223 (4%) Query: 23 NTSECLNVA-YGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQ 81 SE +NV Y + + +TS++ N + + F+++ + + F + I KLAE Sbjct: 1090 EPSEVINVFSLASGHLYERFMRIMMTSVLNNTKTQKVKFWLLKNYLSPKFKETIPKLAEF 1149 Query: 82 NQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQ 141 + L K + Y LF L L +D+++++DAD V + D+ + Sbjct: 1150 YKFEFELVEYKWPKWLHKQTEKQRVMWGYKILFLDVLFPLNVDKIIFVDADQVVRADLQE 1209 Query: 142 LLHLGLNGAVAAVVK------DVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAK 195 L+ LNGA V +++ + + + +Y S + +DLK + + Sbjct: 1210 LMDFNLNGAPYGYVPFCESRTEMDGFRFWKSGYWKNHLMGRKYHISALYVVDLKAFREFS 1269 Query: 196 LTEKA---LSILMSKDNVYKYPDQDVMNVLLKGM-TLFLPREY 234 ++ L + N DQD+ N +L + LP+E+ Sbjct: 1270 AGDRLRGRYDSLSADPNSLSNLDQDLPNNMLHEVPIKSLPQEW 1312 >UniRef50_B4QUA9 GD18236 n=2 Tax=Sophophora RepID=B4QUA9_DROSI Length = 511 Score = 102 bits (255), Expect = 2e-20, Method: Composition-based stats. Identities = 49/231 (21%), Positives = 82/231 (35%), Gaps = 12/231 (5%) Query: 17 FRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINL-DFYIIA-DVYNDGFFQK 74 F L L + + V I S +L N F I D D F +K Sbjct: 186 FCLKRQTGKPPLYIVVVCCGQRVQETLVMIKSAILFNYDEEYLKFVIFTEDGKGDEFREK 245 Query: 75 IAKLAEQNQLRITLYRINTDKLQCLPCTQV--WSRAMYFRLFAFQLLGLTLDRLLYLDAD 132 + + + + RLF LL +D LLY+D D Sbjct: 246 LTDWRDIKPFTFDFEILPLKFPSGNEVEWRNLFKPCAAQRLFLPSLL-THVDSLLYVDTD 304 Query: 133 VVCKGDISQLLHL--GLNG-AVAAVVKDVEPMQEKAVSRLSDPELLGQ-YFNSGVVYLDL 188 ++ IS + N ++A+ + E +R + G+ NSGV+ ++L Sbjct: 305 ILFLSPISDIWRFFKKFNETQMSALTPEHENENIGWYNRFARHPFYGRLGVNSGVMLMNL 364 Query: 189 KKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLK---GMTLFLPREYNT 236 + + K ++ +SI + DQD++N+L +P EYN Sbjct: 365 TRMREMKWEQQIVSIHKEYKLRIIWGDQDIINILFYYHPDKLYIMPCEYNY 415 >UniRef50_A9RI23 Predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=A9RI23_PHYPA Length = 565 Score = 102 bits (255), Expect = 2e-20, Method: Composition-based stats. Identities = 65/367 (17%), Positives = 117/367 (31%), Gaps = 90/367 (24%) Query: 18 RLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGF-FQKIA 76 +T E ++V D L + V I S + N H FY + +N +++ Sbjct: 197 ETVEASTLEDIHVFVCTDEADLRPLAVLINSSMANCPHPERLFYHLVMPHNQRNAAKRLK 256 Query: 77 KLAEQNQLRITLYRINTDKLQCLPCTQVWSRA------MYFRLFAFQLLGLTLDRLLYLD 130 L + ++ + I+ +++ + + A Y F L T+ +L L Sbjct: 257 HLLPKARIEMAEKYIDIREVEEHITFRNDTGARKELVSPY--NFLPFYLPKTIFKL--LR 312 Query: 131 ADVVC----------------------KGDISQLLHLGLNGAVAAVVKDV--------EP 160 A V+C +G++ L + L G A ++D + Sbjct: 313 ATVICSFCLAIGQRFIQLISSTPLIVLQGNLEVLNDVDLEGHSVAAIEDCSQRFQVYFDF 372 Query: 161 MQEKAVSRLSDPELLG-----------QYFNSGVVYLDLKKWADAKLTEKALSILMS--- 206 Q + + P+ FN GV+ +D K+W D +T+ + + Sbjct: 373 AQLDEIQKRQGPDRPSWLPDEPFNKSACVFNRGVLVIDTKEWIDQNITKAIVWWMDEFRK 432 Query: 207 --KDNVYKYP-DQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKK--------- 254 K +YK Q + L G L +N + L D YKK Sbjct: 433 ADKKALYKAGMSQPPFLLALYGKHKVLDETWNVRGLGRPNLSDMERIYYKKGWNYTFERI 492 Query: 255 ----LITESTLLIHYTGATKPWH------------KWAIYPSV-----KYYKIALENSPW 293 + ++H+ G KPW P+ + L SP Sbjct: 493 PFMSPFADEANILHFNGKYKPWKGKRHRGENDEIISICGDPAKGQECAGLWWEYL--SPE 550 Query: 294 KDDSPRD 300 +D + Sbjct: 551 SNDFLKK 557 >UniRef50_UPI0001925360 PREDICTED: similar to glycosyltransferase-like 1B n=1 Tax=Hydra magnipapillata RepID=UPI0001925360 Length = 730 Score = 102 bits (255), Expect = 2e-20, Method: Composition-based stats. Identities = 63/306 (20%), Positives = 114/306 (37%), Gaps = 25/306 (8%) Query: 10 DKVKAWDFRLANINTSECLNVAY-GVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYN 68 K D + + + + +++A V I SI+ RH L F+ I+D+ Sbjct: 104 AKFLKKDCKTSQLPDCQVIHIAIICAGYKETQRVVTLIKSILFYRRH-PLHFHFISDISG 162 Query: 69 DGFFQKIAKLAEQNQLRITLYRINTDK--LQCLPCTQVWSRAMYFRLFAFQLLGLTLDRL 126 Q + K Q+ ++ Y K + +P T +L + L L ++ Sbjct: 163 RHVLQVLFKTWVLKQVGVSFYDAEKLKADVDWIPNTHYSGVYGLMKLTLTRALPEFLSKV 222 Query: 127 LYLDADVVCKGDISQLL---HLGLNGAVAAVVKDVEPMQE-KAVSRLSDPELLGQYFNSG 182 + LD DV D+++L + +V++ K + +G+ FN+G Sbjct: 223 IVLDTDVFFLTDLAELWAFFNNFTEDQAIGLVENQSQWYTGKLWKKYKIWPAIGRGFNTG 282 Query: 183 VVYLDLKKWADAKLTEKALSILMSKDNV----YKYPDQDVMNVLLKGM---TLFLPREYN 235 V+ DL+K + + K + DQDV+N LK LP ++N Sbjct: 283 VMLFDLQKLRKFQW-AHLWRLTAEKQLLNLLSTVLADQDVINAALKDNPQIVYKLPCQWN 341 Query: 236 TIYTIKSELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYK-IALENSPWK 294 + +E + Y KLI IH+ K H V+Y++ + L + Sbjct: 342 IQLSDNTE----SEYCYNKLIELKA--IHWNSPNK--HTGNKLKHVEYFRNMYLTFLEYN 393 Query: 295 DDSPRD 300 + R Sbjct: 394 GNLLRK 399 >UniRef50_Q9M9Y5 F4H5.13 protein n=4 Tax=rosids RepID=Q9M9Y5_ARATH Length = 589 Score = 102 bits (255), Expect = 2e-20, Method: Composition-based stats. Identities = 52/271 (19%), Positives = 100/271 (36%), Gaps = 37/271 (13%) Query: 43 GVSITSIVLNNRH-INLDFYIIADV-----YNDGFFQKIAKLAEQNQLRITLYRI---NT 93 V + S + +++ + F+++ D + F I A L I + + Sbjct: 313 SVVVNSTISSSKEPERIVFHVVTDSLNYPAISMWFLLNIQSKATIQILNIDDMDVLPRDY 372 Query: 94 DKL---QCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGA 150 D+L Q + S + R + + L++++ LD DVV + D+S+L + + G Sbjct: 373 DQLLMKQNSNDPRFISTLNHARFYLPDIFPG-LNKMVLLDHDVVVQRDLSRLWSIDMKGK 431 Query: 151 VAAVVKDV-------------EPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLT 197 V V+ + V+ P F G+ +DL++W KLT Sbjct: 432 VVGAVETCLEGESSFRSMSTFINFSDTWVAGKFSPRACTWAF--GMNLIDLEEWRIRKLT 489 Query: 198 EKALSILM--SKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKL 255 + +K ++K + + TL L + ++ + + K + Sbjct: 490 STYIKYFNLGTKRPLWKAGSLPIGWLTFYRQTLALDKRWHVMGLGR-------ESGVKAV 542 Query: 256 ITESTLLIHYTGATKPWHKWAIYPSVKYYKI 286 E +IHY G KPW +Y+ I Sbjct: 543 DIEQAAVIHYDGVMKPWLDIGKENYKRYWNI 573 >UniRef50_B5ZNF8 Glycosyl transferase family 8 n=7 Tax=Rhizobium RepID=B5ZNF8_RHILW Length = 303 Score = 102 bits (254), Expect = 2e-20, Method: Composition-based stats. Identities = 49/301 (16%), Positives = 100/301 (33%), Gaps = 36/301 (11%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHIN--LDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 V Y D Y +I S + + + D ++ + D F ++ L + + + Sbjct: 6 VVYVTDVEYSFP---TILSALQARKFASPATDVCVLMSEHLD-NFDELRSLLATSGVDLI 61 Query: 88 L----YRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLL 143 + + KL S + +L ++L +++YLD D D+ L Sbjct: 62 DATEALQDSLGKLDGSHFQGRISVSTMAKLVLCEILPANYTQIIYLDGDTQIVSDLGGLE 121 Query: 144 HLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSI 203 + + +D A+ D YFN+GV+ W + ++AL + Sbjct: 122 NALVPEGRFFAARD-----YTAIHDFLDTGKNSHYFNAGVLKFHRNGW----IGQEALEL 172 Query: 204 LMSKDNVYKY-PDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTL- 261 + DQ +N + + + +N + + L+ S L Sbjct: 173 FARNPEACEGKHDQGALNYVCGSSLILVSNRWNF------------PKQFLHLVNMSALS 220 Query: 262 LIHYTGATKPWH---KWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQH 318 ++HY KPWH + Y + P + R + +Y+ + + Sbjct: 221 IVHYMAHPKPWHGTFFPWTDRESQVYVDLRKAHPIYNSLYRGITFDRKALYKYRSMRARI 280 Query: 319 H 319 Sbjct: 281 K 281 >UniRef50_Q31QV9 Lipopolysaccharide biosynthesis proteins LPS n=2 Tax=Synechococcus elongatus RepID=Q31QV9_SYNE7 Length = 329 Score = 102 bits (254), Expect = 2e-20, Method: Composition-based stats. Identities = 62/302 (20%), Positives = 115/302 (38%), Gaps = 33/302 (10%) Query: 10 DKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNR-HINLDFYIIADVYN 68 D+ + + L++ G++ + V I S V N+R L F I+ Sbjct: 25 DESGVTPIAPPDETATMTLDIVLGLNQPIDFTLPVVINSAVQNSRQRETLRFNIVVPTGQ 84 Query: 69 DGFFQKIAKL---AEQNQLRITLYRINTD---KLQCLPCTQVWSRAM-----YFRLFAFQ 117 FQ + + + Q Q R+ ++ + D L R + + R++ Q Sbjct: 85 TEHFQALLETTFPSPQFQWRLGTFQPSADLADYLAHKYSRDRGERLLGRFMQFSRVWLPQ 144 Query: 118 LLGLTLDRLLYLDADVVCKGDISQLLHL--GLNGAVA-AVVKDVEPMQ---EKAVSRLSD 171 + L R+LY D DVV D + L N + A V P +K S Sbjct: 145 VFP-DLTRILYFDTDVVLLEDPAILDQQAGDFNDQIFFAAVPHSRPAWLYFKKPWRAHSY 203 Query: 172 PELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKY-----PDQDVMNVLLKGM 226 + +G FNSGV+ DL+ W +A + ++ + + +D ++Y D+ ++N Sbjct: 204 IKAMGTTFNSGVMVTDLRFWTEA-VYQR-IQAALDRDRQFRYRFLEPGDEALLNACF-PN 260 Query: 227 TLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTG-ATKPWHKWAIYPSVKYYK 285 LP+ +N + + + +IH++G KPW+ I ++ Sbjct: 261 YRALPKRWNRCGYGNARFVARL----LACDPQEAAIIHWSGGHHKPWNTHDII-YGDLWR 315 Query: 286 IA 287 Sbjct: 316 RY 317 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P27129 Lipopolysaccharide 1,2-glucosyltransferase n=44 ... 321 2e-86 UniRef50_Q9R9D1 UDP-glucose:(Glucosyl) LPS alpha1,3-glucosyltran... 264 4e-69 UniRef50_A8ARL6 Putative uncharacterized protein n=3 Tax=Citroba... 259 1e-67 UniRef50_B2PV91 Putative uncharacterized protein n=2 Tax=Enterob... 252 1e-65 UniRef50_D0KD54 Lipopolysaccharide glucosyltransferase I n=2 Tax... 252 2e-65 UniRef50_UPI00019F16C6 hypothetical protein CATC2_20202 n=1 Tax=... 251 3e-65 UniRef50_Q9ZIS6 UDP-galactose:(Glucosyl) LPS alpha1,2-galactosyl... 248 3e-64 UniRef50_D2U322 Lipopolysaccharide 1,2-glucosyltransferase n=1 T... 246 9e-64 UniRef50_D2TIX6 UDP-galactose:(Glucosyl) LPS alpha-1,3-galactosy... 243 6e-63 UniRef50_Q46Y64 Glycosyl transferase, family 8 n=1 Tax=Ralstonia... 243 6e-63 UniRef50_Q9ZIS1 UDP-galactose:(Galactosyl) LPS alpha1,2-galactos... 242 1e-62 UniRef50_D0KD53 Lipopolysaccharide 3-alpha-galactosyltransferase... 241 3e-62 UniRef50_B6XJW1 Putative uncharacterized protein n=1 Tax=Provide... 240 5e-62 UniRef50_Q9ZIT4 UDP-galactose:(Glucosyl) LPS alpha1,3-galactosyl... 239 1e-61 UniRef50_Q9ZIT6 UDP-glucose:(Galactosyl) LPS alpha1,2-glucosyltr... 238 2e-61 UniRef50_C1DGU7 Lipopolysaccharide 1,3-galactosyltransferase n=1... 238 2e-61 UniRef50_UPI0001B4A0A4 putative glucosyltransferase n=1 Tax=Bact... 238 2e-61 UniRef50_D1PTN4 Putative uncharacterized protein n=1 Tax=Prevote... 238 2e-61 UniRef50_C7IBC1 Glycosyl transferase family 8 n=2 Tax=Clostridiu... 234 3e-60 UniRef50_B6XJW2 Putative uncharacterized protein n=1 Tax=Provide... 234 5e-60 UniRef50_D2TY85 UDP-D-galactose:(Glucosyl)lipopolysaccharide-alp... 232 2e-59 UniRef50_P27128 Lipopolysaccharide 1,3-galactosyltransferase n=4... 228 2e-58 UniRef50_A1BHG0 Glycosyl transferase, family 8 n=2 Tax=Chlorobiu... 224 5e-57 UniRef50_UPI000197C525 UDP-D-galactose:(glucosyl) lipopolysaccha... 224 5e-57 UniRef50_D2RIJ4 Glycosyl transferase family 8 n=2 Tax=Acidaminoc... 222 1e-56 UniRef50_P25148 General stress protein A n=3 Tax=Bacillus subtil... 221 4e-56 UniRef50_Q38VK7 Putative bifunctional glycosyl transferase, fami... 219 9e-56 UniRef50_Q64ZV2 Lipopolysaccharide 1,2-glucosyltransferase n=8 T... 219 1e-55 UniRef50_C7QL87 Glycosyl transferase family 8 n=2 Tax=Cyanothece... 218 2e-55 UniRef50_Q5LF36 Putative glucosyltransferase n=1 Tax=Bacteroides... 217 3e-55 UniRef50_C2HBB8 Family 8 glycosyltransferase n=13 Tax=Lactobacil... 214 3e-54 UniRef50_C3X7M2 Lipopolysaccharide 3-alpha-galactosyltransferase... 214 3e-54 UniRef50_D2ELM0 Lipopolysaccharide biosynthesis glycosyltransfer... 214 3e-54 UniRef50_C2HBB9 Family 8 glycosyltransferase n=35 Tax=Lactobacil... 214 3e-54 UniRef50_B7AIV0 Putative uncharacterized protein n=1 Tax=Bactero... 212 2e-53 UniRef50_B3Q568 Putative glycosyltransferase protein n=4 Tax=Rhi... 211 3e-53 UniRef50_A7VNX5 Putative uncharacterized protein n=1 Tax=Clostri... 211 4e-53 UniRef50_C3PWZ8 Lipopolysaccharide 1,2-glucosyltransferase n=1 T... 209 1e-52 UniRef50_A4ECW2 Putative uncharacterized protein n=1 Tax=Collins... 209 1e-52 UniRef50_C0WCJ1 Lipopolysaccharide 1,2-glucosyltransferase n=1 T... 208 2e-52 UniRef50_C5ELK9 Glycosyl transferase n=3 Tax=Clostridiales RepID... 206 7e-52 UniRef50_B6GCA0 Putative uncharacterized protein n=1 Tax=Collins... 206 8e-52 UniRef50_C4VEI8 General stress protein A n=24 Tax=Enterococcus R... 206 1e-51 UniRef50_B8ISQ5 Glycosyl transferase family 8 n=1 Tax=Methylobac... 204 4e-51 UniRef50_B4RJG6 LgtC n=29 Tax=Neisseria RepID=B4RJG6_NEIG2 204 4e-51 UniRef50_B7GNT4 Glycosyl transferase, family 8 n=5 Tax=Bifidobac... 203 6e-51 UniRef50_A8ARL4 Putative uncharacterized protein n=2 Tax=Citroba... 202 1e-50 UniRef50_Q03HK5 Lipopolysaccharide biosynthesis glycosyltransfer... 202 1e-50 UniRef50_A7A7B4 Putative uncharacterized protein n=2 Tax=Bifidob... 202 1e-50 UniRef50_UPI0001A457E5 hypothetical protein NsubN_08151 n=1 Tax=... 202 1e-50 UniRef50_UPI000196958D hypothetical protein BACCELL_01586 n=1 Ta... 202 2e-50 UniRef50_A6LGX5 Glycosyltransferase family 8 n=1 Tax=Parabactero... 202 2e-50 UniRef50_B1MX28 Lipopolysaccharide biosynthesis protein, LPS:gly... 199 1e-49 UniRef50_Q9L6B2 Putative glycosyl transferase n=3 Tax=Pasteurell... 198 2e-49 UniRef50_C0Z4I4 Putative uncharacterized protein gspA n=1 Tax=Br... 198 2e-49 UniRef50_D2LIH7 Glycosyl transferase family 8 n=1 Tax=Rhodomicro... 198 3e-49 UniRef50_A5ZFA9 Putative uncharacterized protein n=1 Tax=Bactero... 197 5e-49 UniRef50_B7KM20 Glycosyl transferase family 8 n=1 Tax=Cyanothece... 197 5e-49 UniRef50_Q116W1 Glycosyl transferase, family 8 n=1 Tax=Trichodes... 196 1e-48 UniRef50_B2UPJ4 Glycosyl transferase family 8 n=1 Tax=Akkermansi... 196 1e-48 UniRef50_C7IBC8 Glycosyl transferase family 8 n=1 Tax=Clostridiu... 196 1e-48 UniRef50_B3CA80 Putative uncharacterized protein n=1 Tax=Bactero... 195 2e-48 UniRef50_B9KVD4 Glycosyl transferase, family 8 n=2 Tax=Rhodobact... 194 3e-48 UniRef50_B3Z5I6 Glycosyltransferase family 8 n=2 Tax=Bacillus ce... 194 3e-48 UniRef50_C5ZV11 Glycosyl transferase n=2 Tax=Helicobacter canade... 194 3e-48 UniRef50_B6G807 Putative uncharacterized protein n=2 Tax=Collins... 194 4e-48 UniRef50_D1P7H1 Lipopolysaccharide 1,2-glucosyltransferase n=1 T... 194 5e-48 UniRef50_C2KV37 Lipopolysaccharide biosynthesis protein, LPS:gly... 194 5e-48 UniRef50_C8W7U9 Glycosyl transferase family 8 n=2 Tax=Atopobium ... 193 7e-48 UniRef50_D2QX94 Glycosyl transferase family 8 n=1 Tax=Pirellula ... 193 7e-48 UniRef50_B1Y723 Glycosyl transferase family 8 n=1 Tax=Leptothrix... 192 1e-47 UniRef50_B3XL28 Glycosyl transferase family 8 n=1 Tax=Lactobacil... 192 1e-47 UniRef50_B7C7N8 Putative uncharacterized protein n=2 Tax=Firmicu... 192 2e-47 UniRef50_C1IBL0 Glycosyl transferase n=1 Tax=Clostridium sp. 7_2... 192 2e-47 UniRef50_A4VVV8 Lipopolysaccharide biosynthesis proteins, LPS:gl... 191 3e-47 UniRef50_Q2K5X3 Galactosyltransferase protein n=13 Tax=Rhizobium... 191 3e-47 UniRef50_A1XRC1 Glycosyltransferase n=1 Tax=Haemophilus ducreyi ... 190 7e-47 UniRef50_C6I3U6 Putative uncharacterized protein n=2 Tax=Bactero... 190 7e-47 UniRef50_B2ISC2 Glycosyl transferase, family 8 n=2 Tax=Streptoco... 189 1e-46 UniRef50_D2QX95 Glycosyl transferase family 8 n=1 Tax=Pirellula ... 189 1e-46 UniRef50_C0QZN2 Glycosyl transferase, family 8 n=3 Tax=Brachyspi... 189 1e-46 UniRef50_P43974 Putative glycosyltransferase HI0258 n=32 Tax=Hae... 189 2e-46 UniRef50_C1MLJ1 Glycosyltransferase family 24 protein n=1 Tax=Mi... 188 3e-46 UniRef50_C2LRU0 Glycosyl transferase, family 8 n=1 Tax=Streptoco... 188 3e-46 UniRef50_C1QBZ8 LPS:glycosyltransferase n=1 Tax=Brachyspira murd... 187 4e-46 UniRef50_C9RWX3 Glycosyl transferase family 8 n=8 Tax=Bacillacea... 187 4e-46 UniRef50_Q3D426 Glycosyl transferase, family 8 n=7 Tax=Streptoco... 186 9e-46 UniRef50_A5EVI8 Glycosyl transferase family 8 protein n=1 Tax=Di... 186 1e-45 UniRef50_D1Y7U2 Glycosyltransferase, family 8 n=1 Tax=Pyramidoba... 186 1e-45 UniRef50_B8G232 Glycosyl transferase family 8 n=2 Tax=Desulfitob... 185 1e-45 UniRef50_B2ISC6 Glycosyl transferase, family 2/glycosyl transfer... 185 2e-45 UniRef50_C8WAA9 Glycosyl transferase family 8 n=2 Tax=Atopobium ... 185 2e-45 UniRef50_Q5WI33 Lipopolysaccharide glycosyltransferase n=6 Tax=F... 184 3e-45 UniRef50_Q3DNS6 Glycosyl transferase, family 8 n=5 Tax=Streptoco... 184 4e-45 UniRef50_D2RIJ7 Glycosyl transferase family 8 n=1 Tax=Acidaminoc... 183 7e-45 UniRef50_C8N7M8 Putative uncharacterized protein n=1 Tax=Cardiob... 183 7e-45 UniRef50_C0BAU3 Putative uncharacterized protein n=1 Tax=Coproco... 183 8e-45 UniRef50_B2UQ54 Glycosyl transferase family 8 n=1 Tax=Akkermansi... 183 1e-44 UniRef50_C1CFZ1 Glycosyl transferase, family 8 n=17 Tax=Streptoc... 182 1e-44 UniRef50_D1JY84 Putative uncharacterized protein n=1 Tax=Bactero... 182 1e-44 UniRef50_D0D9G3 Putative general stress protein A n=1 Tax=Citrei... 182 1e-44 UniRef50_B1I7M9 Glycosyl transferase, family 8 n=16 Tax=Streptoc... 182 1e-44 UniRef50_UPI000175831B PREDICTED: similar to UDP-glucose glycopr... 181 3e-44 UniRef50_C5S494 Putative glycosyl transferase n=2 Tax=Actinobaci... 181 3e-44 UniRef50_B6VUC8 Putative uncharacterized protein n=1 Tax=Bactero... 181 3e-44 UniRef50_A3CM53 Glycosyltransferase, putative n=9 Tax=Bacteria R... 181 3e-44 UniRef50_UPI0001A45357 glycosyl transferase family protein n=1 T... 181 4e-44 UniRef50_C1QC00 LPS:glycosyltransferase n=1 Tax=Brachyspira murd... 181 4e-44 UniRef50_C7RG54 Glycosyl transferase family 8 n=1 Tax=Anaerococc... 180 5e-44 UniRef50_C9PNX4 Family 8 glycosyl transferase n=1 Tax=Pasteurell... 180 5e-44 UniRef50_A0KQP2 Glycosyl transferase, family 8 n=2 Tax=Aeromonas... 180 5e-44 UniRef50_C4Z1V1 Putative uncharacterized protein n=1 Tax=Eubacte... 180 5e-44 UniRef50_C1FE59 Glycosyltransferase family 24 protein n=1 Tax=Mi... 179 9e-44 UniRef50_B1I7N1 Glycosyl transferase, family 8 n=8 Tax=Streptoco... 179 1e-43 UniRef50_Q09332 UDP-glucose:glycoprotein glucosyltransferase n=1... 179 1e-43 UniRef50_A4UX76 Putative LPS biosynthesis protein n=2 Tax=Lactob... 179 1e-43 UniRef50_A8FNA2 Putative uncharacterized protein n=2 Tax=Campylo... 179 1e-43 UniRef50_A5LNA9 Glycosyl transferase, family 8 n=2 Tax=Streptoco... 179 1e-43 UniRef50_B3WD33 WbbM protein n=8 Tax=Lactobacillus RepID=B3WD33_... 179 1e-43 UniRef50_A8XPN2 Putative uncharacterized protein n=2 Tax=Caenorh... 179 1e-43 UniRef50_B9BAZ6 Glycosyl transferase family 8 protein n=1 Tax=Bu... 178 3e-43 UniRef50_B2UQN6 Glycosyl transferase family 8 n=1 Tax=Akkermansi... 177 4e-43 UniRef50_Q3DM64 Glycosyl transferase, family 8, degenerate n=6 T... 177 5e-43 UniRef50_Q01GT2 UDP-glucose:glycoprotein glucosyltransferase, pu... 177 5e-43 UniRef50_Q4JZJ9 Putative glycosyl transferase n=8 Tax=Streptococ... 177 5e-43 UniRef50_Q5M3K9 Glycosyl transferase n=4 Tax=Streptococcus RepID... 176 8e-43 UniRef50_A3VFX3 Glycosyltransferase n=1 Tax=Rhodobacterales bact... 176 9e-43 UniRef50_B1LK07 Glycosyl transferase family 8 n=10 Tax=Enterobac... 176 9e-43 UniRef50_Q65VF6 RfaJ protein n=1 Tax=Mannheimia succiniciproduce... 176 1e-42 UniRef50_C3ZE29 Putative uncharacterized protein n=1 Tax=Branchi... 176 1e-42 UniRef50_UPI0001AEC697 glycosyl transferase family protein n=1 T... 176 1e-42 UniRef50_UPI000180C254 PREDICTED: similar to UDP-glucose ceramid... 175 1e-42 UniRef50_Q9L7A2 Putative glycosyl transferase n=1 Tax=Haemophilu... 175 1e-42 UniRef50_D1Z8I8 Whole genome shotgun sequence assembly, scaffold... 175 2e-42 UniRef50_A2QNN6 Contig An07c0170, complete genome n=10 Tax=Leoti... 175 3e-42 UniRef50_C1QC03 LPS:glycosyltransferase n=1 Tax=Brachyspira murd... 174 4e-42 UniRef50_A8PS15 UDP-glucose:Glycoprotein Glucosyltransferase con... 174 4e-42 UniRef50_Q3D427 Glycosyl transferase, family 8 n=8 Tax=Streptoco... 174 5e-42 UniRef50_C6LDU2 Glycosyl transferase, family 8 n=3 Tax=Firmicute... 173 8e-42 UniRef50_D1HRJ7 Whole genome shotgun sequence of line PN40024, s... 173 1e-41 UniRef50_D0N7I0 UDP-glucose:glycoprotein glucosyltransferase, pu... 172 1e-41 UniRef50_UPI0000E47484 PREDICTED: similar to UDP-glucose ceramid... 172 2e-41 UniRef50_Q4HGS8 General stress protein A, putative n=2 Tax=Campy... 171 2e-41 UniRef50_Q9NYU2 UDP-glucose:glycoprotein glucosyltransferase 1 n... 171 4e-41 UniRef50_A8AY72 Glycosyl transferase, family 8 SP1766 n=7 Tax=St... 171 4e-41 UniRef50_P91854 Protein F26H9.8, partially confirmed by transcri... 170 5e-41 UniRef50_Q5KMJ4 Putative uncharacterized protein n=1 Tax=Filobas... 170 8e-41 UniRef50_Q4PEF1 Putative uncharacterized protein n=1 Tax=Ustilag... 170 9e-41 UniRef50_UPI0001792D56 PREDICTED: similar to UDP-glucose glycopr... 169 9e-41 UniRef50_Q3DNA2 Glycosyl transferase, family 8 n=9 Tax=Streptoco... 169 1e-40 UniRef50_UPI0001693121 general stress protein n=1 Tax=Paenibacil... 169 1e-40 UniRef50_C5S3F7 Putative glycosyl transferase n=2 Tax=Actinobaci... 169 2e-40 UniRef50_C5XV64 Putative uncharacterized protein Sb04g036540 n=1... 168 2e-40 UniRef50_Q046Z9 Lipopolysaccharide biosynthesis glycosyltransfer... 168 2e-40 UniRef50_C6H742 UDP-glucose:glycoprotein glucosyltransferase n=1... 167 4e-40 UniRef50_C5FDY7 Glycogenin n=1 Tax=Microsporum canis CBS 113480 ... 167 4e-40 UniRef50_A5UC07 Aspartate-semialdehyde dehydrogenase n=7 Tax=Hae... 167 5e-40 UniRef50_C4Q2X6 Udp-glucose glycoprotein:glucosyltransferase, pu... 167 5e-40 UniRef50_B9QZ95 Glycosyl transferase family 8 n=1 Tax=Labrenzia ... 167 6e-40 UniRef50_A8NCT1 Putative uncharacterized protein n=1 Tax=Coprino... 167 7e-40 UniRef50_Q6ESI8 Putative UDP-glucose:glycoprotein glucosyltransf... 166 9e-40 UniRef50_B2VVG3 UDP-glucose:glycoprotein glucosyltransferase n=9... 166 9e-40 UniRef50_A1VG39 Glycosyl transferase, family 8 n=1 Tax=Desulfovi... 166 1e-39 UniRef50_B0NR59 Putative uncharacterized protein n=1 Tax=Bactero... 165 2e-39 UniRef50_Q0I2Z7 Lipopolysaccharide biosynthesis protein, glycosy... 165 3e-39 UniRef50_C4R603 Protein required for beta-1,6 glucan biosynthesi... 164 4e-39 UniRef50_B8PIH6 Predicted protein n=2 Tax=Agaricomycetes RepID=B... 164 4e-39 UniRef50_A2RAV0 Catalytic activity: UDP-glucose + glycogenin <=>... 164 4e-39 UniRef50_C4ZG45 Glycosyltransferase Family 2 modular protein n=1... 163 7e-39 UniRef50_Q873M5 UDP-Glc:glycoprotein glucosyltransferase n=2 Tax... 163 7e-39 UniRef50_B6K765 UDP-glucose:glycoprotein glucosyltransferase n=1... 163 9e-39 UniRef50_Q8T191 Probable UDP-glucose:glycoprotein glucosyltransf... 163 1e-38 UniRef50_B6HCQ7 Pc18g02120 protein n=2 Tax=mitosporic Trichocoma... 162 1e-38 UniRef50_Q2HHC6 Putative uncharacterized protein n=1 Tax=Chaetom... 162 1e-38 UniRef50_C0AA16 Lipopolysaccharide biosynthesis protein LPS:glyc... 162 2e-38 UniRef50_A3V3C9 Putative uncharacterized protein n=1 Tax=Loktane... 162 2e-38 UniRef50_O48684 F3I6.10 protein n=46 Tax=Embryophyta RepID=O4868... 162 2e-38 UniRef50_D1HWZ1 Whole genome shotgun sequence of line PN40024, s... 161 3e-38 UniRef50_Q2L3C5 Glycosyl transferase-like protein n=3 Tax=Magnol... 161 3e-38 UniRef50_C6EQF4 Putative uncharacterized protein n=3 Tax=Campylo... 161 4e-38 UniRef50_A5BZU1 Putative uncharacterized protein n=1 Tax=Vitis v... 161 4e-38 UniRef50_C7XX93 Glycosyl transferase, family 8 n=1 Tax=Lactobaci... 161 4e-38 UniRef50_C3XGD2 Putative uncharacterized protein n=1 Tax=Helicob... 161 4e-38 UniRef50_C0X9Z7 Putative uncharacterized protein n=2 Tax=Lactoba... 160 5e-38 UniRef50_Q6BJN0 DEHA2G01232p n=3 Tax=Saccharomycetaceae RepID=Q6... 160 7e-38 UniRef50_Q1RIL1 Lipopolysaccharide 1,2-glucosyltransferase RfaJ ... 160 9e-38 UniRef50_Q4E3K0 UDP-glucose:glycoprotein glucosyltransferase n=2... 159 1e-37 UniRef50_A4WWT5 Glycosyl transferase, family 8 n=1 Tax=Rhodobact... 159 2e-37 UniRef50_C5EZG9 Glycosyl transferase family protein n=1 Tax=Heli... 159 2e-37 UniRef50_C6IB51 Glycosyltransferase n=4 Tax=Bacteroides RepID=C6... 159 2e-37 UniRef50_Q09140 UDP-glucose:glycoprotein glucosyltransferase n=1... 158 2e-37 UniRef50_B9HMR5 Glycosyltransferase, CAZy family GT8 n=25 Tax=Ma... 158 2e-37 UniRef50_Q50FU8 Cj81-079 (Fragment) n=6 Tax=Campylobacter jejuni... 158 3e-37 UniRef50_Q582S2 UDP-glucose:glycoprotein glucosyltransferase, pu... 157 4e-37 UniRef50_A4IXE1 Glycosyl transferase, family 8 n=16 Tax=Francise... 157 5e-37 UniRef50_C5ZVZ7 Putative glycosyltransferase n=3 Tax=Campylobact... 157 5e-37 UniRef50_C1QEC6 LPS:glycosyltransferase n=1 Tax=Brachyspira murd... 157 6e-37 UniRef50_B9WDQ8 Killer toxin-resistance protein, putative n=5 Ta... 157 7e-37 UniRef50_C7HS13 Family 8 glycosyl transferase n=2 Tax=Anaerococc... 157 8e-37 UniRef50_C3XKY2 Glycosyl transferase n=2 Tax=Campylobacterales R... 156 8e-37 UniRef50_C4JK72 Putative uncharacterized protein n=1 Tax=Uncinoc... 156 1e-36 UniRef50_UPI000023DC59 hypothetical protein FG01882.1 n=1 Tax=Gi... 156 1e-36 UniRef50_Q1CUZ8 Lipopolysaccharide 1,2-glycosyltransferase n=12 ... 155 2e-36 UniRef50_Q871S1 Glycogenin n=3 Tax=Sordariaceae RepID=Q871S1_NEUCR 155 2e-36 UniRef50_B3RM47 Putative uncharacterized protein n=1 Tax=Trichop... 155 2e-36 UniRef50_A5VK24 Glycosyl transferase, family 8 n=18 Tax=Lactobac... 155 2e-36 UniRef50_Q9LE59 Like glycosyl transferase 1 n=35 Tax=Embryophyta... 155 3e-36 UniRef50_A7H2M2 Glycosyl transferase family 8 n=3 Tax=Campylobac... 154 4e-36 UniRef50_C5JPW4 Glycosyl transferase family 8 protein n=2 Tax=Aj... 154 4e-36 UniRef50_Q6Z5D6 Glycosyltransferase family-like n=6 Tax=Poaceae ... 154 5e-36 UniRef50_A8LP95 Lipopolysaccharide 1 n=1 Tax=Dinoroseobacter shi... 154 5e-36 UniRef50_Q0U987 Putative uncharacterized protein n=1 Tax=Phaeosp... 154 6e-36 UniRef50_C0S309 Glycogenin n=5 Tax=Onygenales RepID=C0S309_PARBP 154 7e-36 UniRef50_B2VRF2 Glycogenin-2 n=1 Tax=Pyrenophora tritici-repenti... 153 8e-36 UniRef50_UPI00016E26D6 UPI00016E26D6 related cluster n=3 Tax=Tak... 153 8e-36 UniRef50_D1N145 Glycosyl transferase family 8 n=1 Tax=Victivalli... 153 9e-36 UniRef50_B7AUG6 Putative uncharacterized protein n=1 Tax=Bactero... 152 1e-35 UniRef50_Q38VG7 Putative glycosyl transferase, family 8 n=1 Tax=... 152 1e-35 UniRef50_D0IR33 LPS 1,2-glycosyltransferase n=3 Tax=Helicobacter... 152 2e-35 UniRef50_Q39T65 Glycosyl transferase, family 8 n=1 Tax=Geobacter... 152 2e-35 UniRef50_B2B5U2 Predicted CDS Pa_2_5770 n=1 Tax=Podospora anseri... 151 3e-35 UniRef50_A5DLS6 Putative uncharacterized protein n=2 Tax=Pichia ... 151 3e-35 UniRef50_A2Q5F4 Glycosyl transferase, family 8 n=1 Tax=Medicago ... 151 4e-35 UniRef50_A7EPR4 Putative uncharacterized protein n=1 Tax=Sclerot... 151 4e-35 UniRef50_A9SH80 Predicted protein n=2 Tax=Physcomitrella patens ... 150 5e-35 UniRef50_A0ZYL4 Putative uncharacterized protein n=1 Tax=Archaea... 150 5e-35 UniRef50_D1IFB6 Whole genome shotgun sequence of line PN40024, s... 150 6e-35 UniRef50_C5P955 Glycosyl transferase family 8 protein n=2 Tax=Co... 150 6e-35 UniRef50_C6X2V2 Putative glycosyltransferase n=1 Tax=Flavobacter... 150 7e-35 UniRef50_B6Q6I5 Glycogenin n=3 Tax=Trichocomaceae RepID=B6Q6I5_P... 150 7e-35 UniRef50_B9SU65 UDP-glucose glycoprotein:glucosyltransferase, pu... 149 1e-34 UniRef50_Q2GW94 Putative uncharacterized protein n=1 Tax=Chaetom... 149 1e-34 UniRef50_C7Z1L1 Putative uncharacterized protein n=1 Tax=Nectria... 149 2e-34 UniRef50_UPI0001757CC2 PREDICTED: similar to glycogenin n=1 Tax=... 149 2e-34 UniRef50_C4Y414 Putative uncharacterized protein n=1 Tax=Clavisp... 148 3e-34 UniRef50_UPI00016C0890 glycosyl transferase family 8 protein n=2... 148 3e-34 UniRef50_D2VE03 UDP-glucose-glycoprotein glucosyltransferase n=1... 148 3e-34 UniRef50_Q17VR5 Lipopolysaccharide biosynthesis protein n=19 Tax... 147 4e-34 UniRef50_B3XPR8 Glycosyl transferase family 8 n=4 Tax=Lactobacil... 147 5e-34 UniRef50_A4R9Z3 Putative uncharacterized protein n=1 Tax=Magnapo... 147 7e-34 UniRef50_Q68CQ7 Glycosyltransferase 8 domain-containing protein ... 146 1e-33 UniRef50_Q9FIK3 Emb|CAB71043.1 n=39 Tax=Embryophyta RepID=Q9FIK3... 146 1e-33 UniRef50_A3YS36 Putative sugar transferase n=2 Tax=Campylobacter... 146 1e-33 UniRef50_A2EY94 Glycosyl transferase family 8 protein n=1 Tax=Tr... 146 1e-33 UniRef50_UPI000197AD97 hypothetical protein BACCOPRO_03221 n=1 T... 145 2e-33 UniRef50_A5DMZ6 Putative uncharacterized protein n=2 Tax=Pichia ... 145 2e-33 UniRef50_Q92VQ2 Putative lipopolysaccharide 1,3-galactosyltransf... 145 3e-33 UniRef50_A2FK31 Putative uncharacterized protein n=1 Tax=Trichom... 144 4e-33 UniRef50_Q9FH36 Similarity to unknown protein n=28 Tax=Embryophy... 144 5e-33 UniRef50_C5SH34 Glycosyl transferase family 8 n=1 Tax=Asticcacau... 143 5e-33 Sequences not found previously or not previously below threshold: >UniRef50_P27129 Lipopolysaccharide 1,2-glucosyltransferase n=44 Tax=Enterobacteriaceae RepID=RFAJ_ECOLI Length = 338 Score = 321 bits (824), Expect = 2e-86, Method: Composition-based stats. Identities = 338/338 (100%), Positives = 338/338 (100%) Query: 1 MDSFPAIEIDKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDF 60 MDSFPAIEIDKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDF Sbjct: 1 MDSFPAIEIDKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDF 60 Query: 61 YIIADVYNDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLG 120 YIIADVYNDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLG Sbjct: 61 YIIADVYNDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLG 120 Query: 121 LTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFN 180 LTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFN Sbjct: 121 LTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFN 180 Query: 181 SGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTI 240 SGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTI Sbjct: 181 SGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTI 240 Query: 241 KSELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRD 300 KSELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRD Sbjct: 241 KSELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRD 300 Query: 301 AKSIIEFKKRYKHLLVQHHYISGIIAGVCYLCRKYYRK 338 AKSIIEFKKRYKHLLVQHHYISGIIAGVCYLCRKYYRK Sbjct: 301 AKSIIEFKKRYKHLLVQHHYISGIIAGVCYLCRKYYRK 338 >UniRef50_Q9R9D1 UDP-glucose:(Glucosyl) LPS alpha1,3-glucosyltransferase WaaO n=29 Tax=Enterobacteriaceae RepID=Q9R9D1_ECOLX Length = 338 Score = 264 bits (674), Expect = 4e-69, Method: Composition-based stats. Identities = 119/336 (35%), Positives = 187/336 (55%), Gaps = 8/336 (2%) Query: 5 PAIEIDKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIA 64 P I+K +D R A S +VAYG+D N+L G GVSITS++L+N ++ F++ Sbjct: 8 PQEMINKTIIFDERPAASVASS-FHVAYGIDKNFLFGCGVSITSVLLHNSDVSFVFHVFI 66 Query: 65 DVYNDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLD 124 D + Q++A+LA+ + I ++ +N ++L+ LP T+ WS AMYFR D Sbjct: 67 DDIPEADIQRLAQLAKSYRTCIQIHLVNCERLKALPTTKNWSIAMYFRFVIADYFIDQQD 126 Query: 125 RLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVE-PMQEKAVSRLSDPELLGQYFNSGV 183 ++LYLDAD+ C+G++ L+ + L VAAVV + + L EL YFNSGV Sbjct: 127 KILYLDADIACQGNLKPLITMDLANNVAAVVTERDANWWSLRGQSLQCNELEKGYFNSGV 186 Query: 184 VYLDLKKWADAKLTEKALSILMSKD--NVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIK 241 + ++ WA ++ KA+S+L K + Y DQD++N++L G F+ +YNT +++ Sbjct: 187 LLINTLAWAQESVSAKAMSMLADKAIVSRLTYMDQDILNLILLGKVKFIDAKYNTQFSLN 246 Query: 242 SELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDA 301 ELK +++ I + T+LIHY G TKPWH WA YPS + + A E SPWK++ Sbjct: 247 YELK----KSFVCPINDETVLIHYVGPTKPWHYWAGYPSAQPFIKAKEASPWKNEPLMRP 302 Query: 302 KSIIEFKKRYKHLLVQHHYISGIIAGVCYLCRKYYR 337 + + KH Q+ I+GI+ + Y K + Sbjct: 303 VNSNYARYCAKHNFKQNKPINGIMNYIYYFYLKIIK 338 >UniRef50_A8ARL6 Putative uncharacterized protein n=3 Tax=Citrobacter RepID=A8ARL6_CITK8 Length = 339 Score = 259 bits (661), Expect = 1e-67, Method: Composition-based stats. Identities = 127/332 (38%), Positives = 193/332 (58%), Gaps = 3/332 (0%) Query: 4 FPAIEIDKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYII 63 F + I K + LN+AYGVD N+L G G+S+TS+++NN I++ FY++ Sbjct: 5 FENVIIQKKVIDNATHQKSKK---LNIAYGVDRNFLFGSGISMTSVLVNNPDIDIHFYVV 61 Query: 64 ADVYNDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTL 123 D +D + + + +L + +T+ + + + LP T+ W+ AMY+R FAF+ L L Sbjct: 62 TDYVDDEYLESVERLTQMYGTTVTVLVFDNEAFRKLPSTKAWTYAMYYRYFAFEYLSREL 121 Query: 124 DRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGV 183 D +LYLDAD+VCK + +L + G AAVV D++ ++ K+ RL PEL YFNSGV Sbjct: 122 DSVLYLDADIVCKNSLRELTDIHFAGEYAAVVNDIDRVRLKSGQRLGIPELARDYFNSGV 181 Query: 184 VYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSE 243 V+ +L W + KL KA +L + Y DQD++N+L G + L R++N IY + E Sbjct: 182 VFANLHVWREKKLLSKAFEVLHERQKELLYFDQDILNILFVGHVILLRRDFNCIYGVDQE 241 Query: 244 LKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKS 303 LK+K Y+ ITEST+LIHY G TKPWH WA YP KY+ A + S W + S +A + Sbjct: 242 LKNKNEYRYQDFITESTVLIHYVGVTKPWHTWANYPVSKYFIEAYKKSAWAEKSLLNANT 301 Query: 304 IIEFKKRYKHLLVQHHYISGIIAGVCYLCRKY 335 +K++ +H +Q YI I + + Y+ K Sbjct: 302 AKLYKRKSRHERIQRKYIRSIFSHIMYIKNKL 333 >UniRef50_B2PV91 Putative uncharacterized protein n=2 Tax=Enterobacteriaceae RepID=B2PV91_PROST Length = 342 Score = 252 bits (644), Expect = 1e-65, Method: Composition-based stats. Identities = 133/331 (40%), Positives = 198/331 (59%), Gaps = 3/331 (0%) Query: 10 DKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYND 69 +K + A+ CL+V YG D NY G GVS S+++NN F+ D + Sbjct: 7 NKYVLGEVCKADNTLLSCLDVIYGSDENYQFGAGVSAVSLLINNPTTFFRFHYFLDKVSP 66 Query: 70 GFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYL 129 F +K+ +A Q Q+ +Y ++ L+ LP + VWS AMYFRL A L D LYL Sbjct: 67 DFLEKLKVIASQFQVEFHVYELDNKLLKTLPASDVWSSAMYFRLVALDYLSSDYDFALYL 126 Query: 130 DADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLK 189 DADV+C G + +L + V VV D ++ K+ +RL P L YFNSGV++++LK Sbjct: 127 DADVMCNGILDLTTNL-IKDKVCGVVADDIGVRTKSETRLHAPSLAKTYFNSGVMFVNLK 185 Query: 190 KWADAKLTEKALSILMSKD--NVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDK 247 KW + ++T++ +L +++ YKYPDQDV+N++L+ L + +NT+YT+K+EL D Sbjct: 186 KWHEKQITQQCFELLSAENAKQRYKYPDQDVLNLILREDLELLSQRFNTVYTLKNELYDS 245 Query: 248 THQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEF 307 THQ Y+++IT T+LIHYTG +KPWH WA YP+ + + AL SPW + + A +E Sbjct: 246 THQKYQQVITPETVLIHYTGVSKPWHTWANYPASQPFYKALMQSPWTTNDLKPATKFVER 305 Query: 308 KKRYKHLLVQHHYISGIIAGVCYLCRKYYRK 338 KK YKHLL Q +Y++GI++G+ Y K K Sbjct: 306 KKEYKHLLKQGNYLAGILSGIRYSFEKLMGK 336 >UniRef50_D0KD54 Lipopolysaccharide glucosyltransferase I n=2 Tax=Pectobacterium RepID=D0KD54_PECWW Length = 336 Score = 252 bits (643), Expect = 2e-65, Method: Composition-based stats. Identities = 167/333 (50%), Positives = 225/333 (67%), Gaps = 1/333 (0%) Query: 6 AIEIDKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIAD 65 + ID + ++ R +I + LNVAYG+D NY G GVSITSI++NN I+ F++ +D Sbjct: 4 SSHIDVLSVFEKRHQSIADHDTLNVAYGIDKNYAVGCGVSITSILINNS-IDFTFHVFSD 62 Query: 66 VYNDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDR 125 ++D F +KI+ LAE+ + +I LY+IN++ L+ LPCT +WS AMYFRL AF L Sbjct: 63 DFDDDFIKKISILAEKFKTKIILYKINSEMLKTLPCTDIWSHAMYFRLLAFSHLSDKTSS 122 Query: 126 LLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVY 185 LLYLDADV+CKG + QL L VAAV++DV MQ+K+ SRL L G+YFNSGV++ Sbjct: 123 LLYLDADVMCKGSLEQLHKLNTAPHVAAVIRDVPEMQKKSASRLKMAALEGEYFNSGVLF 182 Query: 186 LDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELK 245 +L W LT+K L + +YPDQD+MN+LL G FLP+EYNTIY+IK+ELK Sbjct: 183 ANLDIWNKLDLTQKIFDKLRDGEESIQYPDQDIMNILLNGNVTFLPKEYNTIYSIKNELK 242 Query: 246 DKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSII 305 D HQ YK++I + T+LIHYTG TKPWHKWA YPS Y++ A ENSPW +DA + + Sbjct: 243 DSNHQKYKEVIKDDTILIHYTGVTKPWHKWANYPSTSYFQHAQENSPWSTSDLKDADTFV 302 Query: 306 EFKKRYKHLLVQHHYISGIIAGVCYLCRKYYRK 338 E KK+YKHLL + Y+SG+I+ Y KY +K Sbjct: 303 EMKKKYKHLLKKGKYLSGLISAFKYSLNKYIKK 335 >UniRef50_UPI00019F16C6 hypothetical protein CATC2_20202 n=1 Tax=Citrobacter youngae ATCC 29220 RepID=UPI00019F16C6 Length = 330 Score = 251 bits (641), Expect = 3e-65, Method: Composition-based stats. Identities = 133/327 (40%), Positives = 196/327 (59%), Gaps = 6/327 (1%) Query: 12 VKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGF 71 K +F A LN+A+GVD N++ G +S+TS++L+N+ +N+ F++ D + + Sbjct: 10 KKILEFNQAPSEHKTQLNIAWGVDKNFMFGAAISMTSVLLHNKDLNIHFHLFTDYIDADY 69 Query: 72 FQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDA 131 Q++AKLAEQ I++Y ++ + L+ LP WS AMYFR AF+ LG +D LLY+DA Sbjct: 70 QQRVAKLAEQFATNISIYIMDANGLKVLPSGNAWSHAMYFRFIAFEYLGEKVDSLLYIDA 129 Query: 132 DVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKW 191 DV+CKG + +L + L VAAV+ DV+ + D E YFNSGV++ +LKKW Sbjct: 130 DVMCKGSLYELTQIDLGEHVAAVITDVDDSPAR------DIEKNKDYFNSGVIFANLKKW 183 Query: 192 ADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQN 251 + A IL+ K+N +PDQDV+N+L +FL R +N IY IK ELK K Sbjct: 184 KEQNFINSAFDILLDKNNKLSFPDQDVLNILFLKKVIFLERRFNAIYGIKQELKSKDTSK 243 Query: 252 YKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRY 311 YK+ IT T+LIHY G TKPW+ WA YPS +Y+ A ++SPW D A++ ++KK+ Sbjct: 244 YKEYITPETILIHYIGVTKPWNSWANYPSAQYFVEAWKSSPWADVPLLPARTPKQYKKKS 303 Query: 312 KHLLVQHHYISGIIAGVCYLCRKYYRK 338 +H +Q Y + I+ + YL K K Sbjct: 304 RHERLQGKYFASAISYIGYLWAKLKSK 330 >UniRef50_Q9ZIS6 UDP-galactose:(Glucosyl) LPS alpha1,2-galactosyltransferase WaaT n=26 Tax=Enterobacteriaceae RepID=Q9ZIS6_ECOLX Length = 331 Score = 248 bits (633), Expect = 3e-64, Method: Composition-based stats. Identities = 136/331 (41%), Positives = 203/331 (61%), Gaps = 5/331 (1%) Query: 9 IDKVKAWDFRLANINTSE---CLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIAD 65 +++ F N E LNV+YG+D N+L G GVSI+S+++NN IN F++ D Sbjct: 1 MNEFIKERFSYLADNKKENAPELNVSYGIDKNFLYGAGVSISSVLINNSDINFVFHVFTD 60 Query: 66 VYNDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDR 125 +D + + + A+Q I +Y I+ LP +Q WS A YFR+ +F+ L ++ Sbjct: 61 YVDDDYLKSFNETAKQFNTSIIVYLIDPKYFADLPTSQFWSYATYFRVLSFEYLSESIST 120 Query: 126 LLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVY 185 LLYLDADVVCKG + L + AAV+ D + Q RL+ PE+ G+YFN+GV+Y Sbjct: 121 LLYLDADVVCKGSLKPLTEIIFKDEFAAVIPDNDSTQAACAKRLNIPEMNGRYFNAGVIY 180 Query: 186 LDLKKWADAKLTEKALSIL--MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSE 243 ++LKKW +A LT L +L +K KY DQD +N+ ++L ++++TIYT+K+E Sbjct: 181 VNLKKWHEANLTPYLLKLLRGETKYGSLKYLDQDALNIAFNMNNIYLAKDFDTIYTLKNE 240 Query: 244 LKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKS 303 L D++H+ Y++ IT+ T+LIHYTG TKPWH WA YPS Y+ IA E SPWK ++A++ Sbjct: 241 LYDRSHRKYQQTITDKTVLIHYTGITKPWHSWAGYPSASYFNIAREQSPWKKYPLKEART 300 Query: 304 IIEFKKRYKHLLVQHHYISGIIAGVCYLCRK 334 + E +K+YKHL YI GI + + Y +K Sbjct: 301 VAEMQKQYKHLFAHGEYIKGITSLIKYKLKK 331 >UniRef50_D2U322 Lipopolysaccharide 1,2-glucosyltransferase n=1 Tax=Arsenophonus nasoniae RepID=D2U322_9ENTR Length = 334 Score = 246 bits (628), Expect = 9e-64, Method: Composition-based stats. Identities = 122/316 (38%), Positives = 197/316 (62%), Gaps = 3/316 (0%) Query: 22 INTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQ 81 ++ N+AYGVD N+L G +SI S+++NN + +F++ D +DG+ Q+ + + Sbjct: 18 TENNKNFNIAYGVDKNFLLGAAISINSVLINNTDTDFNFHLFTDYIDDGYIQRFQTMIAK 77 Query: 82 NQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQ 141 I +Y ++ +L+ L + WS A YFRL AF+ L + +LYLDADV+CKG + + Sbjct: 78 YNSNIIIYLLDAAELKQLSTSDFWSYATYFRLIAFEYLSTNIHAILYLDADVICKGSLKE 137 Query: 142 LLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKAL 201 + L L + AAVV DV+ MQ+ + +RL+ +L G+YFN+GV+Y++L+KW + ++K+L Sbjct: 138 IFQLNLADSFAAVVLDVDSMQQSSATRLNLADLNGKYFNAGVIYVNLQKWIENDFSKKSL 197 Query: 202 SIL--MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITES 259 ++ + KY DQD +N+L + ++L R+YN IY +K+EL YK IT+S Sbjct: 198 ELVRGKTNFGKLKYLDQDALNILFQTQNIYLSRDYNCIYKLKNELAYHDLSKYKNTITDS 257 Query: 260 TLLIHYTGATKPWHKWAI-YPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQH 318 T+LIHYTG TKPWH W I YP+ +++ + +SPWKD + A+ E +++YKHL +QH Sbjct: 258 TILIHYTGVTKPWHTWGINYPASQFFFNSYIHSPWKDQPLKMAEKRTELQEKYKHLFLQH 317 Query: 319 HYISGIIAGVCYLCRK 334 Y+ G + + Y K Sbjct: 318 KYMQGFLCLIKYKLLK 333 >UniRef50_D2TIX6 UDP-galactose:(Glucosyl) LPS alpha-1,3-galactosyltransferase n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TIX6_CITRO Length = 340 Score = 243 bits (621), Expect = 6e-63, Method: Composition-based stats. Identities = 110/332 (33%), Positives = 173/332 (52%), Gaps = 9/332 (2%) Query: 12 VKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGF 71 DF + L++AYGVD N+L G G+SI S++ NN L F++ D +N+ Sbjct: 13 TSTIDFNHQDTAEKVVLDIAYGVDQNFLFGCGISIASVLKNNTDKTLHFHVFIDAFNETD 72 Query: 72 FQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDA 131 + KLA Q + IT+Y IN + L+ LP T+ W+ A+YFR ++LLYLDA Sbjct: 73 RRMFDKLAAQYKTHITIYLINCEHLRSLPSTKNWTYAIYFRFAIADYFIGKTNKLLYLDA 132 Query: 132 DVVCKGDISQLLHLGLNGAVAAVV--KDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLK 189 D++C+G I +L++ A V + EK L + YFNSG++ ++L Sbjct: 133 DIICQGGIDELVNFSFASDKIAAVVTEGKADWWEKRALSLGTEGITKGYFNSGLILINLN 192 Query: 190 KWADAKLTEKALSILMSKD--NVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDK 247 +WA ++ +A+ +L D +PDQDV+N+LL FL ++NT +++ +LKDK Sbjct: 193 QWAIECISARAIKMLSDPDIVGRITHPDQDVLNILLADKLHFLDIKFNTQFSLNYQLKDK 252 Query: 248 THQNYKKLITESTLLIHYTGATKPWHKWAIYPS-VKYYKIALENSPWKDDSPRDAKSIIE 306 + + T+LIHY G TKPWH WA K + A + SPWK+ + + + Sbjct: 253 ----FINPVNNDTILIHYIGPTKPWHSWAGDYLISKPFIDAKQASPWKNTALLKPTNSNQ 308 Query: 307 FKKRYKHLLVQHHYISGIIAGVCYLCRKYYRK 338 F+ KH+L YI G++ Y +K + Sbjct: 309 FRYCAKHMLKNKRYIKGMVGYFLYFMKKITNR 340 >UniRef50_Q46Y64 Glycosyl transferase, family 8 n=1 Tax=Ralstonia eutropha JMP134 RepID=Q46Y64_RALEJ Length = 331 Score = 243 bits (621), Expect = 6e-63, Method: Composition-based stats. Identities = 72/320 (22%), Positives = 148/320 (46%), Gaps = 9/320 (2%) Query: 22 INTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQ 81 N ++A+ VD NY +G +I SI+ NN + F+++ + +++ +L E Sbjct: 17 SNGKPSFHIAFCVDDNYFRAMGATIASIIDNNPGQHFTFHVLTFSALEENQRRLKQLEEM 76 Query: 82 NQLRITLYRINTD---KLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGD 138 + L+ ++ + +S +++ RL ++L DR+LYLDAD++C Sbjct: 77 YPVSTQLHLLDLASFTQFSHFLGHSHYSLSIFTRLVIPEVLQGQTDRVLYLDADILCVNR 136 Query: 139 ISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTE 198 + +L+ + ++ +A VV D + V+ L YFN GV+++++ KW +T Sbjct: 137 LDELVDMDISNEIAVVVPDAPVTLRRRVAALGLAHAE--YFNGGVLFINIDKWLAENITP 194 Query: 199 KALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITE 258 + L L+ ++ DQD +N +L G ++ +N + + D + Sbjct: 195 QTLEALLDTSTDMRFNDQDALNKVLNGRAKYISPRWNYL---YDLIHDLNVNRFAMRPVG 251 Query: 259 STLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSP-RDAKSIIEFKKRYKHLLVQ 317 + IH+ G+ KPW W+ + + ++ L SPW+D + ++ E + + + Q Sbjct: 252 KAVFIHFAGSVKPWADWSGHEARGLFRKYLALSPWRDMPLDPEPRNTKEMRMHSRFMFRQ 311 Query: 318 HHYISGIIAGVCYLCRKYYR 337 H + + + YL ++ R Sbjct: 312 HKPVESLKWYLRYLRKRAQR 331 >UniRef50_Q9ZIS1 UDP-galactose:(Galactosyl) LPS alpha1,2-galactosyltransferase WaaW n=29 Tax=Enterobacteriaceae RepID=Q9ZIS1_ECOLX Length = 342 Score = 242 bits (618), Expect = 1e-62, Method: Composition-based stats. Identities = 122/321 (38%), Positives = 188/321 (58%), Gaps = 4/321 (1%) Query: 22 INTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQ 81 NT LN+AYG+D N+L G VS+ S+V++N + + F++ D ++ + Q++ + Sbjct: 18 ANTDRVLNIAYGIDRNFLFGAAVSMQSVVMHNPDLAVKFHLFTDYIDEDYLQRVNAFTSK 77 Query: 82 N-QLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDIS 140 N + + +Y++++ + P + WS A +FRL AFQ L T++ LLY+DADV+CKG ++ Sbjct: 78 NANVEVRIYKVSSAFIDIFPSLKQWSYATFFRLVAFQYLSETIENLLYIDADVICKGSLA 137 Query: 141 QLLHLGLNGAVAAVV-KDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEK 199 LL + +G A V KDV MQEK RL+ L G YFN+GVVYL L+ WA K Sbjct: 138 GLLDINFDGDKFAAVIKDVPFMQEKPAKRLAIEGLPGNYFNAGVVYLQLEAWAKNDFMNK 197 Query: 200 ALSILMSK--DNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLIT 257 A+++L S YK DQD++N+L G +F+ +Y+ Y I ELK+K+ ++YKK IT Sbjct: 198 AIAMLASDPQHTKYKCLDQDILNILFFGHCIFISGDYDCFYGIDYELKNKSDEDYKKTIT 257 Query: 258 ESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQ 317 + T LIHY G TKPW+ W YP KY+ A + S W D + A + +++ + +HL Sbjct: 258 DDTKLIHYVGVTKPWNDWTNYPCQKYFNEAYQASCWNDVAFIPATNEKQYQVKSRHLKRN 317 Query: 318 HHYISGIIAGVCYLCRKYYRK 338 + S + Y +K RK Sbjct: 318 GNIASSFYYFMLYYSKKIARK 338 >UniRef50_D0KD53 Lipopolysaccharide 3-alpha-galactosyltransferase n=3 Tax=Enterobacteriaceae RepID=D0KD53_PECWW Length = 336 Score = 241 bits (615), Expect = 3e-62, Method: Composition-based stats. Identities = 112/319 (35%), Positives = 176/319 (55%), Gaps = 4/319 (1%) Query: 22 INTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQ 81 L++A+G D ++ G ++I SI+L N L F++ D +DG + ++AEQ Sbjct: 18 SKKCAELDIAFGTDEKFIYGCAIAIASILLKNPDYCLSFHVFTDKLSDGDKARFQEMAEQ 77 Query: 82 NQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQ 141 I +Y ++ L+ LP T++WS A+YFR LD++LYLDAD++C G + + Sbjct: 78 YNTTINIYIVDCSWLKTLPETKLWSYAIYFRFIIADYFYKILDKVLYLDADIICNGSLQE 137 Query: 142 LLHLGLNGAVAAVVKDVEPMQEKA-VSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKA 200 L+ L L+ ++AVV D + K + PEL YFNSGV+ +++ W A +TE + Sbjct: 138 LIKLDLSNHISAVVLDGDSNWWKNRAQKFQQPELSNGYFNSGVLLIEVNNWHQAAVTENS 197 Query: 201 LSILMSKDNV--YKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITE 258 + +L + +PDQDV+NVLL G + + +YNT ++I ELK ++ I+ Sbjct: 198 MRLLTDPEMKKIITHPDQDVLNVLLAGKSCHIESKYNTQFSINYELKYSYGESAPTPISN 257 Query: 259 STLLIHYTGATKPWHKW-AIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQ 317 T+ IHY G TKPWHKW A Y KY+ A E+SPWK++S DA + + KH Sbjct: 258 KTIFIHYIGPTKPWHKWAANYACTKYFLKAKEHSPWKNESLLDAVTASNMRYCAKHQFHN 317 Query: 318 HHYISGIIAGVCYLCRKYY 336 I G ++ + YL +K + Sbjct: 318 GEIIRGTLSFLKYLYKKAF 336 >UniRef50_B6XJW1 Putative uncharacterized protein n=1 Tax=Providencia alcalifaciens DSM 30120 RepID=B6XJW1_9ENTR Length = 325 Score = 240 bits (613), Expect = 5e-62, Method: Composition-based stats. Identities = 144/327 (44%), Positives = 202/327 (61%), Gaps = 6/327 (1%) Query: 11 KVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDG 70 + L N + LN+AYGVD +L G G+S+ SI++NN I L F++ D ND Sbjct: 2 NLIKEKIELGAQNGAAELNIAYGVDKGFLFGSGLSMNSIIINNSDIKLKFHLFTDYMNDE 61 Query: 71 FFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLD 130 F K+ KL + I +Y IN D+L+ LP + VWS A YFR F F L TL +LYLD Sbjct: 62 FLSKLEKLTLNENVNIDIYIINADELKKLPISHVWSYATYFRFFIFDHLCETLSSILYLD 121 Query: 131 ADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKK 190 ADV CKG + + + + NG AAV+ DV MQ V RLS P++ +YFN+GV++L+LK Sbjct: 122 ADVFCKGSLRKYIDIAFNGEYAAVIPDVPNMQISCVDRLSMPQIKDKYFNAGVIFLNLKV 181 Query: 191 WADAKLTEKALSILMSKD--NVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKT 248 W K T++A +++ + KY DQD +N++ ++LPR+YN IYT+K+EL+ Sbjct: 182 WDKNKFTKQAFNLITNNHTGKTLKYLDQDALNIIFNCQNIYLPRDYNCIYTLKNELE--- 238 Query: 249 HQNYKKLITESTLLIHYTGATKPWHKWA-IYPSVKYYKIALENSPWKDDSPRDAKSIIEF 307 H+NYK IT T LIHYTGATKPWH WA YP+ + +K+A E SPWK+D DAK E+ Sbjct: 239 HENYKDYITSETKLIHYTGATKPWHYWAVNYPASQTFKVAFETSPWKNDELVDAKKKPEY 298 Query: 308 KKRYKHLLVQHHYISGIIAGVCYLCRK 334 ++RYKH Q +++GI + + Y K Sbjct: 299 QERYKHEFNQKKFLTGISSLIKYKKFK 325 >UniRef50_Q9ZIT4 UDP-galactose:(Glucosyl) LPS alpha1,3-galactosyltransferase WaaI n=26 Tax=Enterobacteriaceae RepID=Q9ZIT4_ECOLX Length = 335 Score = 239 bits (609), Expect = 1e-61, Method: Composition-based stats. Identities = 122/327 (37%), Positives = 184/327 (56%), Gaps = 10/327 (3%) Query: 15 WDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQK 74 ++F NI + L++A+G+D N+L G GV+I SI+LNNR I+ +F++ D +D Sbjct: 14 YNFHYQNIRSKNTLDIAFGIDRNFLFGCGVAIASILLNNREISCEFHVFTDYISDKDKLY 73 Query: 75 IAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVV 134 + LA+Q RI +Y IN DKL+ LP T+ W+ A YFR +++LYLDAD+ Sbjct: 74 FSDLAKQYNSRINIYVINCDKLKSLPSTKNWTYATYFRFIIADYFYHKHEKILYLDADIA 133 Query: 135 CKGDISQLLHLGLNGAVAAVV---KDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKW 191 CKG I +LL + A V +DVE Q +A S L+ P+L YFN+G + +++ +W Sbjct: 134 CKGSIKELLDYQFSTNEIAAVVAERDVEWWQNRA-SVLTTPQLASGYFNAGFLLINIDEW 192 Query: 192 ADAKLTEKALSILMSKD--NVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTH 249 ++ KA+ +L D + + DQDV+NVLL G F+ +YNT Y+I ELKDK Sbjct: 193 NLNNISSKAIEMLRDPDWVSKITHLDQDVLNVLLNGKVKFISEKYNTRYSINYELKDK-- 250 Query: 250 QNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKK 309 + + T+ IHY G TKPWH+WA YP + + IA SPW + + +++ Sbjct: 251 --VDNPVNDDTVFIHYVGPTKPWHEWANYPVSRSFLIAKAASPWSKEDLLKPVNSNQYRY 308 Query: 310 RYKHLLVQHHYISGIIAGVCYLCRKYY 336 KH Q HY++GI + Y K + Sbjct: 309 CAKHKFKQKHYMAGIFNYLKYYKEKCF 335 >UniRef50_Q9ZIT6 UDP-glucose:(Galactosyl) LPS alpha1,2-glucosyltransferase WaaJ n=26 Tax=Enterobacteriaceae RepID=Q9ZIT6_ECOLX Length = 339 Score = 238 bits (608), Expect = 2e-61, Method: Composition-based stats. Identities = 129/324 (39%), Positives = 194/324 (59%), Gaps = 2/324 (0%) Query: 13 KAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFF 72 D R ++ E NV++G+D NY G +SI SI+ NN+ F+IIAD + + Sbjct: 15 IELDKRPVKLDERETFNVSWGIDENYQVGAAISIASILENNKQNKFTFHIIADYLDKEYI 74 Query: 73 QKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDAD 132 + +++LA + Q I LY I+++ L+ LP + +W ++Y+RL +F LD LLYLDAD Sbjct: 75 ELLSQLATKYQTVIKLYLIDSEPLKALPQSNIWPVSIYYRLLSFDYFSARLDSLLYLDAD 134 Query: 133 VVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWA 192 +VCKG +++L+ L AVV DV+ MQ K+ RL + + G YFNSGV+Y++L++W Sbjct: 135 IVCKGSLNELIALEFKDEYGAVVIDVDAMQSKSAERLCNEDFNGSYFNSGVMYINLREWL 194 Query: 193 DAKLTEKALSILMSKD--NVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQ 250 +LTEK +L + KYPDQD++N++ LPR+YN IYTIKSE ++K + Sbjct: 195 KQRLTEKFFDLLSDESIIKKLKYPDQDILNLMFLHHAKILPRKYNCIYTIKSEFEEKNSE 254 Query: 251 NYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKR 310 Y + I + T+ IHYTG TKPWH WA Y S Y++ SPW++ + A E K++ Sbjct: 255 YYTRFINDDTVFIHYTGITKPWHDWANYASADYFRNIYNISPWRNIPYKKAVKKHEHKEK 314 Query: 311 YKHLLVQHHYISGIIAGVCYLCRK 334 YKHLL Q ++ G+ + Y K Sbjct: 315 YKHLLYQKKFLDGVFTAIKYNVMK 338 >UniRef50_C1DGU7 Lipopolysaccharide 1,3-galactosyltransferase n=1 Tax=Azotobacter vinelandii DJ RepID=C1DGU7_AZOVD Length = 326 Score = 238 bits (608), Expect = 2e-61, Method: Composition-based stats. Identities = 95/322 (29%), Positives = 162/322 (50%), Gaps = 12/322 (3%) Query: 22 INTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQ 81 S+ L++A+GVD NYL +G++I SI+ NN + L F++ + ++ +L Sbjct: 6 TRNSDVLHIAFGVDENYLRPMGITIVSIIENNPGLELVFHVFISSISSASRVRLDRLERM 65 Query: 82 NQLRITLYRINTDKLQCLPCTQ----VWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKG 137 + L+ ++ P + S+A Y RL + L DR+LYLDAD++C G Sbjct: 66 FARPVNLHLVDEMLDVKDPASGKGQAHISKAAYIRLLIPEALRDFTDRVLYLDADILCVG 125 Query: 138 DISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLT 197 DIS LLHL ++G AAV++D ++A + + L YFNSGV+Y+D+ +W + +T Sbjct: 126 DISGLLHLDIDGRTAAVIRDAGAESKRAGL-VKKGQTLDNYFNSGVLYIDIPRWIERAVT 184 Query: 198 EKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLIT 257 +AL + +Y DQD +N++L G F+ + +N Y + +LK + Sbjct: 185 SRALEKIADPVLDLRYSDQDALNLVLDGDVRFIDKGWNHQYGLTGKLKK---GRVGMDVP 241 Query: 258 ESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWK----DDSPRDAKSIIEFKKRYKH 313 T +H+ G KPW W + S + + SPW DD+ + + + Y+ Sbjct: 242 SDTKFVHFIGPMKPWRSWNPHQSKELFLRYQALSPWAGEALDDNFSPREIYVYSRFMYRS 301 Query: 314 LLVQHHYISGIIAGVCYLCRKY 335 + Q ++SG+I +L RK+ Sbjct: 302 MFQQGRWLSGLIWYGKFLHRKH 323 >UniRef50_UPI0001B4A0A4 putative glucosyltransferase n=1 Tax=Bacteroides sp. 2_1_7 RepID=UPI0001B4A0A4 Length = 301 Score = 238 bits (608), Expect = 2e-61, Method: Composition-based stats. Identities = 78/297 (26%), Positives = 144/297 (48%), Gaps = 4/297 (1%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +++ D NY+ GV +TSI +NN + +I+ + + + + K+ + +I Sbjct: 1 MDIVCCTDNNYVIPCGVLVTSICVNNPKEEITVHILTEGISPENQEVLKKVVAKYGQQIQ 60 Query: 88 LYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGL 147 Y ++ P ++ + A YFRL +L +++++LYLD DVV + + L + Sbjct: 61 FYTVDKKVFANCPISRHITLATYFRLIMTDILPKSVEKVLYLDCDVVVRHSLRSLWDTDI 120 Query: 148 NGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSK 207 A V+ D+ + +RL LG YFN+GV+ ++L+ W + L+E I+ Sbjct: 121 KSYAAGVIPDMSIDDIRIYNRLQYSPSLG-YFNAGVLLVNLRYWRENNLSESFFEIINKY 179 Query: 208 DNVYKYPDQDVMNVLLKGMTLFLPREYNTIYT--IKSELKDKTHQNYKKLITESTLLIHY 265 +Y DQDV+N++LK + L LP +YN + K L +T+++ ++ +++HY Sbjct: 180 PERLRYHDQDVLNIVLKEIKLTLPMKYNVQHGYFFKDPLISRTYRDEREQAITDPVILHY 239 Query: 266 TGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQHHYIS 322 +G+ KPW P K + L+ S R K R++ LL + I+ Sbjct: 240 SGS-KPWFIEFEPPFKKDFAFYLDTSGLDKSFIRHIPMKARIKARFRSLLEKLGLIA 295 >UniRef50_D1PTN4 Putative uncharacterized protein n=1 Tax=Prevotella bergensis DSM 17361 RepID=D1PTN4_9BACT Length = 305 Score = 238 bits (608), Expect = 2e-61, Method: Composition-based stats. Identities = 66/283 (23%), Positives = 129/283 (45%), Gaps = 6/283 (2%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +++ + +D NYL ++ SI+ NN+ + F++I++ + KI ++AE ++ Sbjct: 1 MDIVFNIDDNYLMQCCTTMVSILHNNKDGQISFHVISNGLTNESRLKIEQVAEAYHQQVF 60 Query: 88 LYRINTD---KLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 Y +N + + S A Y RLF +L L +++Y+D D++ G + L + Sbjct: 61 FYVVNPEAMSDYEIFDKQGHISMATYLRLFVADILPERLHKIIYMDCDLIVNGSLDGLWN 120 Query: 145 LGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 + G A V+D+ + RL + YFN+GV+ ++L W + ++++A + Sbjct: 121 TDVEGYALAAVEDMWSGKADNYVRLGY-DAADTYFNAGVLVVNLDYWREHNVSQQAAQYV 179 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKD--KTHQNYKKLITESTLL 262 K+ DQDV+N L L LP +N + + + E+ ++ Sbjct: 180 ALHAGQLKFNDQDVLNGLFHDSKLLLPFRWNVQDGLLRKRRKIRPEVMPKLDQELENPVI 239 Query: 263 IHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSII 305 IH+TG KPW+ + P + ++ + W+ P S Sbjct: 240 IHFTGHRKPWNFSCLNPYKNLFFKYVDMTEWRGFRPIVPLSWK 282 >UniRef50_C7IBC1 Glycosyl transferase family 8 n=2 Tax=Clostridium papyrosolvens DSM 2782 RepID=C7IBC1_9CLOT Length = 452 Score = 234 bits (598), Expect = 3e-60, Method: Composition-based stats. Identities = 77/327 (23%), Positives = 132/327 (40%), Gaps = 14/327 (4%) Query: 26 ECLNVAYGVDANYLDGVGVSITSIVLNNR-HINLDFYIIADVYNDGFFQKIAKLAEQNQL 84 E + + D++Y+ +GV ITS++ N +L+FY+I D + + Sbjct: 2 ETVKIVSACDSHYVQHLGVMITSLLENTSMKTSLEFYVIDGGITDADKELLCSCTCLYGC 61 Query: 85 RITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 +I I D + S A YFR+F +LL ++++++YLD D+V DI++L Sbjct: 62 KINFITIQADFYARFGESPSASDATYFRIFVSELLDTSVEKVIYLDCDIVVIKDIAELWK 121 Query: 145 LGLNGAVAAVVKDVEPMQEKAV----SRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKA 200 ++ A V D R + YFN+GV+ ++L KW + +++ Sbjct: 122 TDVSEYFLAAVADCGVEYSGEYAVTLKRKLGMKRKDCYFNAGVLLINLVKWREESISKSI 181 Query: 201 LSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELK-DKTHQNYKKLITES 259 L + DQD +N +L L L +N + +K Sbjct: 182 CKFLFENKGKIDFADQDGLNAVLCNRWLPLDSRWNQQVAHCEFYEQEKVVWENVTRAVRE 241 Query: 260 TLLIHYTGA----TKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKH-- 313 +IHYT + TKPW+ ++P + Y L +PWK P D K Sbjct: 242 PWIIHYTTSYFSGTKPWNYLDMHPYRQEYYRYLHMTPWKSFIPPDRTIWNILLKIIYEAY 301 Query: 314 --LLVQHHYISGIIAGVCYLCRKYYRK 338 L+ ++Y I Y + +K Sbjct: 302 AGRLLINYYRRSIKPTYRYETARLPKK 328 >UniRef50_B6XJW2 Putative uncharacterized protein n=1 Tax=Providencia alcalifaciens DSM 30120 RepID=B6XJW2_9ENTR Length = 333 Score = 234 bits (596), Expect = 5e-60, Method: Composition-based stats. Identities = 116/319 (36%), Positives = 180/319 (56%), Gaps = 8/319 (2%) Query: 21 NINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADV-YNDGFFQKIAKLA 79 I+ S C +VAYG+D N+L G GVSI S++++N HI F+I D +D K A++ Sbjct: 19 EIDDSSCQHVAYGIDHNFLYGSGVSIVSLLMHNPHIQFAFHIFIDNSMSDEDIAKFAEIC 78 Query: 80 EQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDI 139 +IT+Y I+++ ++ LP T+ W+ A+YFR + +D LLYLDADVVC +I Sbjct: 79 HLYNTKITIYFIDSNNVKKLPTTKNWTHAIYFRFIIAEYFKDKIDYLLYLDADVVCNRNI 138 Query: 140 SQLLHLGLNGAVAAVVKDVE-PMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTE 198 +LL L G +AAVV + + +K L P + YFNSGV+Y++L+ W +TE Sbjct: 139 DELLSHNLLGYIAAVVPERDKAWWQKRADSLGFPSVSKGYFNSGVMYINLRTWKTNNVTE 198 Query: 199 KALSILMSKD--NVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLI 256 K++++LM + + YPDQDV+N+LL LF+ +NT +++ ELK +++ + Sbjct: 199 KSMALLMDNEVSHRLVYPDQDVLNILLTDSVLFISSIFNTQFSLNYELK----KSFDFPV 254 Query: 257 TESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLV 316 +T+ IHY G TKPWH+WA Y + + + A SPW++ AKS + KH + Sbjct: 255 KRTTVFIHYVGPTKPWHEWANYETAQPFLEARAVSPWRNVPLLKAKSSNHLRYCAKHNIN 314 Query: 317 QHHYISGIIAGVCYLCRKY 335 Q Y + Y K Sbjct: 315 QRKYFFAFKNYIAYFFSKI 333 >UniRef50_D2TY85 UDP-D-galactose:(Glucosyl)lipopolysaccharide-alpha-1, 3-D-galactosyltransferase n=1 Tax=Arsenophonus nasoniae RepID=D2TY85_9ENTR Length = 343 Score = 232 bits (591), Expect = 2e-59, Method: Composition-based stats. Identities = 109/328 (33%), Positives = 176/328 (53%), Gaps = 8/328 (2%) Query: 14 AWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQ 73 +++F A+ T + ++AYG D N+ G +SI S++ N+ FYI D ++ + Sbjct: 21 SYEFSSADAKTPQ-FHIAYGADKNFSLGTAISICSMLYFNKIYTFHFYIFTDTISECDLK 79 Query: 74 KIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADV 133 K +L +IT+ I+T +L+ LP ++WS A+YFR +++LYLD+D+ Sbjct: 80 KFDELTSCYNTKITILLIDTLQLKKLPTNKLWSHAIYFRFIIANYFHNKTNKILYLDSDI 139 Query: 134 VCKGDISQLLHLGLNGAVAAVVKDVEPM-QEKAVSRLSDPELLGQYFNSGVVYLDLKKWA 192 +C GDIS+L + LN + A V D + +K L+ PE+ YFNSGV+ +D KW Sbjct: 140 ICSGDISELFDIDLNQHIIAAVADRDQYLWKKRAEMLATPEIANGYFNSGVMLIDTDKWH 199 Query: 193 DAKLTEKALSILMSKDN--VYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQ 250 K+TEK ++IL+ + + DQD +N+ L LFL +++NT ++I ELK+KT Sbjct: 200 KNKITEKTINILLDDKTKAKFVFYDQDALNISLVNQVLFLDKKFNTQFSINYELKNKT-- 257 Query: 251 NYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKR 310 I + IHY G TKPW+ W+ YPS + +NSPWK A + +++ Sbjct: 258 --LFPIINNVKFIHYIGPTKPWNIWSEYPSTHLFMTIKKNSPWKTTPLIAASTSNQYRYA 315 Query: 311 YKHLLVQHHYISGIIAGVCYLCRKYYRK 338 KH+ + YI ++ + Y K K Sbjct: 316 AKHMFNKKKYIYWLLNYLYYFVNKALHK 343 >UniRef50_P27128 Lipopolysaccharide 1,3-galactosyltransferase n=43 Tax=Enterobacteriaceae RepID=RFAI_ECOLI Length = 339 Score = 228 bits (581), Expect = 2e-58, Method: Composition-based stats. Identities = 104/331 (31%), Positives = 168/331 (50%), Gaps = 9/331 (2%) Query: 12 VKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGF 71 D+ + CL++AYG D N+L G G+SI SI+ N L F+I D + D Sbjct: 13 NSVIDYDHKVETENLCLDIAYGTDKNFLFGCGISIASILKYNEGSRLCFHIFTDYFGDDD 72 Query: 72 FQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDA 131 + LA Q + RI +Y IN D+L+ LP T+ W+ A+YFR ++LYLDA Sbjct: 73 RKYFDALALQYKTRIKIYLINGDRLRSLPSTKNWTHAIYFRFVIADYFINKAPKVLYLDA 132 Query: 132 DVVCKGDISQLLHLGLNGAVAAVV--KDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLK 189 D++C+G I L++ A+V + EK L + YFNSG + ++ Sbjct: 133 DIICQGTIEPLINFSFPDDKVAMVVTEGQADWWEKRAHSLGVAGIAKGYFNSGFLLINTA 192 Query: 190 KWADAKLTEKALSILMSKD--NVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDK 247 +WA +++ +A+++L + +PDQDV+N+LL +F +YNT +++ +LK+ Sbjct: 193 QWAAQQVSARAIAMLNEPEIIKKITHPDQDVLNMLLADKLIFADIKYNTQFSLNYQLKE- 251 Query: 248 THQNYKKLITESTLLIHYTGATKPWHKW-AIYPSVKYYKIALENSPWKDDSPRDAKSIIE 306 ++ +T T+ IHY G TKPWH W YP + + A SPWK+ + + + Sbjct: 252 ---SFINPVTNDTIFIHYIGPTKPWHDWAWDYPVSQAFMEAKNASPWKNTALLKPNNSNQ 308 Query: 307 FKKRYKHLLVQHHYISGIIAGVCYLCRKYYR 337 + KH+L +H Y+ G + Y K Sbjct: 309 LRYSAKHMLKKHRYLKGFSNYLFYFIEKIKH 339 >UniRef50_A1BHG0 Glycosyl transferase, family 8 n=2 Tax=Chlorobium/Pelodictyon group RepID=A1BHG0_CHLPD Length = 307 Score = 224 bits (570), Expect = 5e-57, Method: Composition-based stats. Identities = 63/294 (21%), Positives = 136/294 (46%), Gaps = 11/294 (3%) Query: 21 NINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAE 80 ++ +N+ + D NY+ + ++ S++ NN+ ++ YII+ ++ ++ I ++ + Sbjct: 1 MLHMKNTVNIVFATDKNYIQHLSAALVSLLENNKDLSFTVYIISSGMSEKSYRNIEEIIK 60 Query: 81 QNQLRITLYRINTDKLQCLPCTQ-VWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDI 139 + ++ + L + + Y+RL L+ +++LYLD+D++ G I Sbjct: 61 TGNCTVKHITVSDELFVKLATAHPFYPKGTYYRLLIPDLIDE--EKILYLDSDIIVNGSI 118 Query: 140 SQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEK 199 +L + + ++D + R + YFNSG++ ++L KW L +K Sbjct: 119 KELYNQDVEDYFVCAIEDPGFDRH----RQLQMDKESIYFNSGMMLINLAKWKSTGLQKK 174 Query: 200 ALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNY----KKL 255 + + + +PDQ +N ++ G +P +YN +I S+ +K + Sbjct: 175 VIDFIEHNPDAIWFPDQCGLNSVINGRWKKVPLKYNQQSSIFSDDFEKKFDCFSVEELAE 234 Query: 256 ITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKK 309 ++ ++IHYTG +KPWH +P K Y L+ +P+++ D + K Sbjct: 235 AKKNPVIIHYTGGSKPWHFKNRHPYKKLYWKYLKMTPYRNAIYSDLMPMYLLKS 288 >UniRef50_UPI000197C525 UDP-D-galactose:(glucosyl) lipopolysaccharide-alpha-1,3-D-galactosyltransferase n=1 Tax=Providencia rettgeri DSM 1131 RepID=UPI000197C525 Length = 339 Score = 224 bits (570), Expect = 5e-57, Method: Composition-based stats. Identities = 112/336 (33%), Positives = 169/336 (50%), Gaps = 9/336 (2%) Query: 4 FPAIEIDKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYII 63 F K + + + + NVAYG D N+L G GVSI S++LNN+ IN F++ Sbjct: 4 FENAIQGKTSFTNKDVNKDLSKKKFNVAYGADKNFLFGTGVSIVSVLLNNKDINFHFHVF 63 Query: 64 ADVYNDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTL 123 D +D Q +++++Q + +TL+ +N D L+ LP QVWS A+YFRL Sbjct: 64 TDFLSDKDIQLFSQISKQYKTSVTLHTLNMDILKKLPTNQVWSHAIYFRLIIADYFYKKC 123 Query: 124 DRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQ-YFNSG 182 D++LYLD+DVVC G I L L L+ A V D+ ++ L + E + + YFNSG Sbjct: 124 DKVLYLDSDVVCTGSIQILKSLNLSSMPIAAVMDISEPHSVEMANLFNVEGIKKGYFNSG 183 Query: 183 VVYLDLKKWADAKLTEKALSILMSKDNV--YKYPDQDVMNVLLKGMTLFLPREYNTIYTI 240 V+ ++ +W +LTEK++S+ K KY DQD +N+ + G L L +N + Sbjct: 184 VMLINPDEWNYRQLTEKSMSVFTDKKLQPVIKYYDQDAINIAVHGDWLKLDNIFNHRINL 243 Query: 241 KSELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSV-KYYKIALENSPWKDDSPR 299 K K K + + +H+ G+TKPWH W+ Y + + A E SPWKD Sbjct: 244 NDRYKHK-----KNNDISNAVFVHFIGSTKPWHNWSKYYHEVRCFLNAKEKSPWKDIDLM 298 Query: 300 DAKSIIEFKKRYKHLLVQHHYISGIIAGVCYLCRKY 335 ++I K KH + Y+S V Y K Sbjct: 299 TPQNITHHKYASKHFRYKEKYLSSFYHYVLYTILKI 334 >UniRef50_D2RIJ4 Glycosyl transferase family 8 n=2 Tax=Acidaminococcus RepID=D2RIJ4_ACIFE Length = 309 Score = 222 bits (567), Expect = 1e-56, Method: Composition-based stats. Identities = 68/302 (22%), Positives = 120/302 (39%), Gaps = 18/302 (5%) Query: 26 ECLNVAYGVDANYLDGVGVSITSIVLNNR-HINLDFYIIADVYNDGFFQKIAKLAEQNQL 84 + +++ D NY V+ SI+ N+R + FY D ++ IA Q Sbjct: 2 DEISIVLASDDNYAQHGAVACASILANHRGERPIHFYYFDDGISEEKQAGIAATVTGLQG 61 Query: 85 RITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 IT ++Q + +RA Y RL +L+ + R++YLD D+V DI +L Sbjct: 62 SITFIPTAGKEIQAH-TSGHVNRAAYLRLLIPELVPQAVHRVIYLDTDLVVLDDIQELWE 120 Query: 145 LGLNGAVAAVVKDVEPMQEKAVSRLS----DPELLGQYFNSGVVYLDLKKWADAKLTEKA 200 + L G V D+ + + R + YFNSGV+ ++L+ W + + ++ Sbjct: 121 MDLQGKPVGAVPDLGILASSRMRRQKEETLGIQEGKLYFNSGVMVMELEAWREKQYGDQV 180 Query: 201 LSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSE----LKDKTHQNYKKLI 256 + + +++ DQD +N + + LP +N I + + LK +N Sbjct: 181 IRCVEE--GNFRHHDQDGLNKVFQDNWQPLPLRWNVIPPVFTLPVKVLKKSRWRNLALEA 238 Query: 257 TESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDA------KSIIEFKKR 310 E + H+ G KPW + Y L + + KS + R Sbjct: 239 LERPAVFHWAGRYKPWEFPPKGHFNEKYYTYLARTAFAGAKMPQPGKDMKGKSFTRQEWR 298 Query: 311 YK 312 K Sbjct: 299 LK 300 >UniRef50_P25148 General stress protein A n=3 Tax=Bacillus subtilis group RepID=GSPA_BACSU Length = 286 Score = 221 bits (562), Expect = 4e-56, Method: Composition-based stats. Identities = 58/280 (20%), Positives = 120/280 (42%), Gaps = 10/280 (3%) Query: 24 TSECLNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAEQN 82 E +++ D NY +G S++ N ++ + Y+I +++ + + Sbjct: 3 KDEIMHIVSCADDNYARHLGGMFVSLLTNMDQEREVKLYVIDGGIKPDNKKRLEETTLKF 62 Query: 83 QLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLL-GLTLDRLLYLDADVVCKGDISQ 141 + I ++T+ + + ++A Y+R+ L+ ++ R++Y+D D + DIS+ Sbjct: 63 GVPIEFLEVDTNMYEHAVESSHITKAAYYRISIPDLIKDESIKRMIYIDCDALVLEDISK 122 Query: 142 LLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKAL 201 L L + A V+D + ++D G+YFNSG++ +D + W +TEK + Sbjct: 123 LWDLDIAPYTVAAVEDAGQHERLKEMNVTD---TGKYFNSGIMIIDFESWRKQNITEKVI 179 Query: 202 SILMSKDNV--YKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKT---HQNYKKLI 256 + + + DQD +N +L L +N I +LK + + Sbjct: 180 NFINEHPDEDFLVLHDQDALNAILYDQWYELHPRWNAQTYIMLKLKTPSTLLGRKQYNET 239 Query: 257 TESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDD 296 E+ ++H+ G KPW+ +P Y + + W Sbjct: 240 RENPAIVHFCGGEKPWNSNTKHPYRDEYFHYMSYTKWNTI 279 >UniRef50_Q38VK7 Putative bifunctional glycosyl transferase, family 8 n=2 Tax=Lactobacillus sakei subsp. sakei 23K RepID=Q38VK7_LACSS Length = 569 Score = 219 bits (559), Expect = 9e-56, Method: Composition-based stats. Identities = 58/293 (19%), Positives = 126/293 (43%), Gaps = 7/293 (2%) Query: 21 NINTSECLNVAYGVDANYLDGVGVSITSIVLNNRH-INLDFYIIADVYNDGFFQKIAKLA 79 + + +N+ ++N+++ + + SI+ NN + F++++D + ++ Sbjct: 278 PADKRDQINIVSAANSNFVEPLAILYASILNNNDDDRHYAFFVLSDQLTARDQATLRQIT 337 Query: 80 EQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDI 139 E +T ++ L + + Y+RL LL ++R+LYLD D +C ++ Sbjct: 338 ESFNAELTFIEVDEIPLTAVIQDGQVLKTAYYRLLIPNLLP-EIERVLYLDCDTLCLENL 396 Query: 140 SQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEK 199 ++L + L A V+D A + + +YFN+GV+ ++L W K+TE+ Sbjct: 397 ARLWDVELGNIPVAAVEDAGFHNRLAQMAIDYKSI--RYFNAGVLLMNLTIWRQQKITEQ 454 Query: 200 ALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLI--- 256 L+ + ++ DQD +N +L + L ++N +I + + + Sbjct: 455 ILTFIKEYPQKLRFHDQDALNAILHDRWIHLHPKWNVQTSILMDFIVAPTERINRQFLSA 514 Query: 257 TESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKK 309 + LIH+ G+ KPW K + +P Y+ ++ + + Sbjct: 515 QKEPGLIHFCGSEKPWDKSSTHPYTPQYRFYKSRFLENNNPVPFRANTTFDEY 567 Score = 177 bits (450), Expect = 3e-43, Method: Composition-based stats. Identities = 60/272 (22%), Positives = 119/272 (43%), Gaps = 6/272 (2%) Query: 26 ECLNVAYGVDANYLDGVGVSITSIVLNNR-HINLDFYIIADVYNDGFFQKIAKLAEQNQL 84 + + + D Y D + +++ SI + +D ++++ + + +L Sbjct: 8 KTIAIMVAADEQYADQMLLTLKSIREHCTLETAIDLFVLSSDLSHATKSAVNRLMTLPH- 66 Query: 85 RITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQ-LLGLTLDRLLYLDADVVCKGDISQLL 143 ++ IN +++ P + + Y+R+ A Q LL ++R+LYLD D + + D++ L Sbjct: 67 HVSFIAINPRRIKNFPGNNHFDQTAYYRILAPQILLARHIERVLYLDLDTLIRTDLTPLY 126 Query: 144 HLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSI 203 L G + V D + + YFN+GV+ +D W +++K L++ Sbjct: 127 DSDLEGNIIGAVIDPGKALTLKRLGVPKSQANNIYFNAGVLIIDTILWETHHISQKILAM 186 Query: 204 LMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNY---KKLITEST 260 L+ QD +NV+L G T L ++N I + + + Y K + Sbjct: 187 LVPYPGRRVNDIQDALNVVLAGRTKLLAPKWNVQNAILFKTYEPINNEYSQLFKQAIMAP 246 Query: 261 LLIHYTGATKPWHKWAIYPSVKYYKIALENSP 292 +IH+T KPW + +P + Y++ L P Sbjct: 247 KIIHFTTEKKPWEVFLEHPYMSEYQVYLSQLP 278 >UniRef50_Q64ZV2 Lipopolysaccharide 1,2-glucosyltransferase n=8 Tax=Bacteroides RepID=Q64ZV2_BACFR Length = 311 Score = 219 bits (558), Expect = 1e-55, Method: Composition-based stats. Identities = 80/313 (25%), Positives = 147/313 (46%), Gaps = 12/313 (3%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +++A +D+N+ V++TS+ NNR+ +IIA + + ++ +AE +I Sbjct: 2 IHIACNIDSNFTIHCAVTLTSLFANNRNSEFCVHIIASTLPEADQKALSSIAESYGNKIC 61 Query: 88 LYRINTDKLQCLPCTQ---VWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 Y D L + S A Y+R ++L + +D++LY+D D+V DIS+ Sbjct: 62 FYFPEKDLLNNFSIKKSGNRISIATYYRCLLSRILPVNIDKILYIDCDIVVLNDISEFWD 121 Query: 145 LGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 + ++D+ +E+ SRL + YFN+GV+ ++LK W + K+ E Sbjct: 122 TDITQYAIGCIEDIGSDEEEYYSRLQY-DKKYSYFNAGVLLINLKYWREHKIDEMCEQYF 180 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNY--KKLITESTLL 262 ++ + ++ DQD++N LL LF+P +N T + + K + Sbjct: 181 LAHSDRIRFNDQDLLNALLYKDKLFVPFRWNVQDTFYRRTYSHKVKEHSGLKEALLHPAI 240 Query: 263 IHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQHHYIS 322 +HYT KPW+ +++P + Y L+ +PWK + II+F+ R + YI+ Sbjct: 241 LHYT-NKKPWNYDSMHPLKQEYFKYLDMTPWKGT-----RPIIDFQTRVITGFKRLLYIT 294 Query: 323 GIIAGVCYLCRKY 335 GI + Y Sbjct: 295 GIKKSKYINLKDY 307 >UniRef50_C7QL87 Glycosyl transferase family 8 n=2 Tax=Cyanothece RepID=C7QL87_CYAP0 Length = 283 Score = 218 bits (555), Expect = 2e-55, Method: Composition-based stats. Identities = 75/295 (25%), Positives = 135/295 (45%), Gaps = 12/295 (4%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +++ + D NY GV+ITS++LNN + +++ + F +KI KL + Q + Sbjct: 1 MDILFCFDKNYEQHFGVAITSLILNNTNKIKTIHLVTKDNSKDFLKKIDKLKSKTQAKFF 60 Query: 88 LYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGL 147 +Y + L + + S A Y+RL A +LL L ++LYLD+D+V + L ++ + Sbjct: 61 IYSPDDKDLSNVKVSAHISTAAYYRLLAPELLPQDLKKILYLDSDLVVNSSLENLYNMDI 120 Query: 148 NGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSK 207 + + A + ++ YFNSGV+ ++L+ W + K L Sbjct: 121 SDDILAA---YAGGKMGPGTKKRLQLTGDFYFNSGVMLINLEAWRTENIGNKCFKFLQEN 177 Query: 208 DNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTG 267 ++ + DQD +N ++ G L + +N++ + + T+Q+ +IH+TG Sbjct: 178 PDMIRLWDQDALNKIVDGKFLNIDGIWNSLVDLTTGETRVTNQSI---------IIHFTG 228 Query: 268 ATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQHHYIS 322 KPW W I P + Y L SPW + P+ K+ E K + Q Sbjct: 229 TLKPWQSWCIRPEKQIYWYYLRQSPWSNAYPQFPKNFQEMLLAIKSVYKQIKPKK 283 >UniRef50_Q5LF36 Putative glucosyltransferase n=1 Tax=Bacteroides fragilis NCTC 9343 RepID=Q5LF36_BACFN Length = 308 Score = 217 bits (554), Expect = 3e-55, Method: Composition-based stats. Identities = 64/282 (22%), Positives = 132/282 (46%), Gaps = 5/282 (1%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +++ + +D +Y+ GV+ITS+ +NN + + F+I+ + + + K+ ++ + +I Sbjct: 1 MDIVHCIDNSYVAQCGVTITSVCVNNVNEVILFHILTTNLSIFNREMLKKIVDKYRQKII 60 Query: 88 LYRINTDKLQCLPCT--QVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHL 145 Y ++ L P S A YFR+ +L +L+++LYLD D+V +I +L Sbjct: 61 FYNVDEYLLNKCPLREGDHVSLATYFRILMPDILPKSLNKVLYLDCDLVVCKNIKRLWDT 120 Query: 146 GLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILM 205 ++ V D + +RL ++ YFN+GV+ ++L W + ++ K L + Sbjct: 121 DISTHSLGAVYDGGTDDIRTYNRLKY-DIRQGYFNAGVLLVNLAYWREFHISNKLLKFIE 179 Query: 206 SKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKL--ITESTLLI 263 + DQD +N +L T LP +YN + ++ + ++ ++ Sbjct: 180 QYPERLMFWDQDALNSVLIQTTKILPFKYNMLDAFYTKELALREEYLFEIEGALCDPTIL 239 Query: 264 HYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSII 305 H++ KPW K +P ++ L+ + W D P ++ Sbjct: 240 HFSSPNKPWLKTCDHPLKSFFFEYLKRTSWNDKFPIYPFNMS 281 >UniRef50_C2HBB8 Family 8 glycosyltransferase n=13 Tax=Lactobacillales RepID=C2HBB8_ENTFC Length = 300 Score = 214 bits (546), Expect = 3e-54, Method: Composition-based stats. Identities = 65/295 (22%), Positives = 129/295 (43%), Gaps = 11/295 (3%) Query: 24 TSECLNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAE-- 80 + + V + ++ + S++ N N + FY+I D + Q + + Sbjct: 2 NKKEIAVVASCNTKFVPHLAALFVSVLDNCNPSKFVRFYVIDDDIDFESKQLLRFSVKNA 61 Query: 81 QNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLG-LTLDRLLYLDADVVCKGDI 139 + + +IN + + + Y+R+ +L ++R+LY+D D++ DI Sbjct: 62 RMNSDVEFLKINKEFFTNVVISDRIPETAYYRIAIPELFRGTEVERILYMDCDMIALQDI 121 Query: 140 SQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEK 199 S+L L ++ A V+D Q + ++ P +YFNSG++ +++KKW D +T+K Sbjct: 122 SKLWRLDFGDSIVAAVEDAGFHQR--LEKMEIPAKSMRYFNSGLMLINVKKWLDENITQK 179 Query: 200 ALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDK---THQNYKKLI 256 L + ++ DQD +N +L L L +N I ++ K + + Sbjct: 180 VLDFIEHNPEKLRFHDQDALNAILHDRWLPLHPRWNAQGYIMAKAKKHPTAAGEREYEET 239 Query: 257 TESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWK--DDSPRDAKSIIEFKK 309 + +IH++G KPW K P+ KYY+ + ++ P+ K +K Sbjct: 240 RNNPYIIHFSGHVKPWSKDFEGPTKKYYEKYAGMTAFRCVAKFPKYPKYAKIQQK 294 >UniRef50_C3X7M2 Lipopolysaccharide 3-alpha-galactosyltransferase n=1 Tax=Oxalobacter formigenes OXCC13 RepID=C3X7M2_OXAFO Length = 307 Score = 214 bits (546), Expect = 3e-54, Method: Composition-based stats. Identities = 83/316 (26%), Positives = 150/316 (47%), Gaps = 14/316 (4%) Query: 24 TSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQ 83 ++A+GVD Y + V+I SI+ NN++ N+ F++I + +D +I K Q Sbjct: 1 MKNEFHIAFGVDTIYAPKMCVTIASILENNKNSNIIFHVIYNDLSDKVIDEIKKSMLTLQ 60 Query: 84 LRITLYRINTDK--LQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQ 141 I + I+ D + + R F +LL DR LYLDAD++C +IS Sbjct: 61 AEINFHFIDVDLSIFPKFSNFSHITSGAFLRFFIPELLQGLTDRALYLDADIICINNISD 120 Query: 142 LLHLGLNGA-VAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKA 200 L HL ++ + AVV+D++ + +YFNSGV+ +D++KW + + Sbjct: 121 LFHLEMDENEILAVVEDIDSETYLN----ENASFQKRYFNSGVLMMDIEKWNKNNVYGQL 176 Query: 201 LSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITEST 260 LS+L K + + DQD +N+++ +L +N + + K K + + E+ Sbjct: 177 LSVLNEKGSGFNLIDQDALNLVMIDKVHYLDNIWNYMINAEQLDKKKEKYS----VPENA 232 Query: 261 LLIHYTGATKPWHKWAI-YPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQHH 319 IH+ G KPWH + I Y + + W D K+ E ++ ++ + + Sbjct: 233 KFIHFVGPVKPWHCYNIFDDITGLYLNYQKKTVW--DGLEMPKNYKEMRRYARYSFKKGN 290 Query: 320 YISGIIAGVCYLCRKY 335 Y++G+ G+ Y+ K+ Sbjct: 291 YLTGLNWGMRYIKTKF 306 >UniRef50_D2ELM0 Lipopolysaccharide biosynthesis glycosyltransferase n=1 Tax=Pediococcus acidilactici 7_4 RepID=D2ELM0_PEDAC Length = 552 Score = 214 bits (546), Expect = 3e-54, Method: Composition-based stats. Identities = 62/268 (23%), Positives = 112/268 (41%), Gaps = 7/268 (2%) Query: 27 CLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQN-QLR 85 +NV ++ + + S SI+ N+ +F+++ D D + + + Sbjct: 278 VINVISAANSAFTQALATSYVSILENDPDHQYNFFLLPDHLTDRDMMLLGSIIARYDNAT 337 Query: 86 ITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHL 145 I + +N + L + + Y+R+ A LL +++R +YLD D++ + +L Sbjct: 338 IKVVEVNEELLANAVESDRIVKTAYYRILAPALLP-SINRAIYLDCDIIANTSLHELWQT 396 Query: 146 GLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILM 205 L G V A V+D + ++ + +YFNSG++ +DL +W T+K L + Sbjct: 397 NLEGNVIAAVEDAG--FHDRLEKMGITKENEKYFNSGMMLIDLVRWRARSTTQKVLDYIN 454 Query: 206 SKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLI---TESTLL 262 ++ DQD +N L L L ++N I E + E L Sbjct: 455 QNPEKLRFHDQDALNANLYDDWLHLHPQWNAQSNIIMETIFPPRTELLEPYAETREDPKL 514 Query: 263 IHYTGATKPWHKWAIYPSVKYYKIALEN 290 IH+ G KPWH+ +P Y E Sbjct: 515 IHFCGHVKPWHEGCEHPYADVYLKYHEM 542 Score = 191 bits (486), Expect = 3e-47, Method: Composition-based stats. Identities = 66/267 (24%), Positives = 120/267 (44%), Gaps = 7/267 (2%) Query: 26 ECLNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAEQNQL 84 + +N+ D NY D + ++I + + N F ++ + D + KL Sbjct: 2 KKINILLAADRNYADQLCITIKTALETLNSATRAHFIVLTNNLGDQTRALLDKLMHNFH- 60 Query: 85 RITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLL-GLTLDRLLYLDADVVCKGDISQLL 143 I ++ ++ P Q ++ YFR+ A +LL +DRL+YLD DV+ + D+++L Sbjct: 61 TIEYLNLDDERFDFCPTNQHINKTAYFRIIAPKLLASRQIDRLIYLDVDVLIRKDLTELA 120 Query: 144 HLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQ-YFNSGVVYLDLKKWADAKLTEKALS 202 LN V D + YFNSG++ +D+ +W ++TEK L+ Sbjct: 121 ESNLNQNTVGAVIDTGQAFALHRLGVDPVVAASNLYFNSGIMVIDVAQWNAHRITEKTLA 180 Query: 203 ILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLI---TES 259 + + + + DQD +N +L G FL ++N +I +Q Y +LI + Sbjct: 181 FIRNHADRIIFHDQDALNAVLAGEVQFLHPKWNLQNSIIFRKHRPINQGYAELIDEAIKE 240 Query: 260 TLLIHYTGATKPWHKWAIYPSVKYYKI 286 ++H+T KPW ++P + Y Sbjct: 241 PSIVHFTTHEKPWKDLTVHPYLDEYHE 267 >UniRef50_C2HBB9 Family 8 glycosyltransferase n=35 Tax=Lactobacillales RepID=C2HBB9_ENTFC Length = 305 Score = 214 bits (545), Expect = 3e-54, Method: Composition-based stats. Identities = 66/272 (24%), Positives = 116/272 (42%), Gaps = 10/272 (3%) Query: 27 CLNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAEQN--Q 83 + V D NY + V I + + N N+ + FY+I D ++ Q + + + Sbjct: 27 VVPVVTASDENYAPYLSVMIATALENCNKARRIKFYVIDDGLSEYSKQGLEETVNKYSSN 86 Query: 84 LRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLL-GLTLDRLLYLDADVVCKGDISQL 142 I + D + + + Y R+ LL ++LYLD+DV+ DI +L Sbjct: 87 ASIQFLTVEKDIYEDFLVSDHITTTAYLRISLPNLLAKEDYKKVLYLDSDVLVLDDIVKL 146 Query: 143 LHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALS 202 LNG + D ++ + YFNSGV+ +D+ +W ++TEK + Sbjct: 147 YDEPLNGKTIGAIIDPGQVK---ALERLGIDSDDLYFNSGVMVIDIDQWNKKEITEKTIH 203 Query: 203 ILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLIT---ES 259 L + Y DQD +N +L L ++N ++ E ++ Y++L E Sbjct: 204 YLSENGDRIIYHDQDALNAVLYEDWEQLHPKWNMQTSLIFERHPAPNEKYERLYKEGNEK 263 Query: 260 TLLIHYTGATKPWHKWAIYPSVKYYKIALENS 291 ++H+TG KPW+ +P Y L +S Sbjct: 264 PSIVHFTGHDKPWNTLKDHPYTNLYLKKLAHS 295 >UniRef50_B7AIV0 Putative uncharacterized protein n=1 Tax=Bacteroides eggerthii DSM 20697 RepID=B7AIV0_9BACE Length = 321 Score = 212 bits (539), Expect = 2e-53, Method: Composition-based stats. Identities = 63/287 (21%), Positives = 126/287 (43%), Gaps = 6/287 (2%) Query: 15 WDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQK 74 ++ ++ +++ ++ Y +I SI +NN++ + ++I D + + Sbjct: 2 YNTTNIPSIKTKAIHIVVCINDAYSQHCAATIASIFINNKNEVIKIHVITDYISKKNQSR 61 Query: 75 IAKLAEQNQLRITLYRINTDKLQCLPCTQ-----VWSRAMYFRLFAFQLLGLTLDRLLYL 129 + K+A +I Y N L PC + + Y+RLF Q+L L + + YL Sbjct: 62 LEKIAFNFNQQIQFYTFNNSTLNRWPCFKDGMPPHVTIQTYYRLFIPQILPLNIKKTFYL 121 Query: 130 DADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLK 189 D D++ + + + + A + D +A +RL +YFN+GV+ L+L+ Sbjct: 122 DCDLLVLHPLREFWNTKMQNKGVAAIADQWTDYIEAATRLKY-RNDREYFNAGVLLLNLE 180 Query: 190 KWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTH 249 + T A+ + N Y DQDV+N L+ + +P ++N ++ + Sbjct: 181 YLRNHNFTNNAIDFVTKHANDIVYHDQDVLNKLIGENRIIMPVKWNVCSFKINDKIPHIY 240 Query: 250 QNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDD 296 + +IH+ KPW++ + +P YY L+ +PWK + Sbjct: 241 NATMNDARKDPYIIHFFAPIKPWNQDSSHPYRSYYYYFLQFTPWKHE 287 >UniRef50_B3Q568 Putative glycosyltransferase protein n=4 Tax=Rhizobium etli RepID=B3Q568_RHIE6 Length = 331 Score = 211 bits (537), Expect = 3e-53, Method: Composition-based stats. Identities = 69/308 (22%), Positives = 123/308 (39%), Gaps = 16/308 (5%) Query: 29 NVAYGVDANYLDGVGVSITSIVLNNRH-INLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 + + VDA Y + ++ S+ NN+ LD ++I + + + I + N I Sbjct: 21 PIVFAVDAAYAVPLATALRSVAENNQSVWPLDIHVIHEGIGEETKRLILESLPANSAIIQ 80 Query: 88 LYRINTDKLQCLPCTQVW-SRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLG 146 + I T T+ S+ + R+ Q L T DR LYLD D++ + QL + Sbjct: 81 WHPIATLSFASGFSTRPGVSKMTFARILLPQFLPQTCDRALYLDGDILVLTSLEQLWNTD 140 Query: 147 LNGAVAAVVKDVE-PMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILM 205 L AV V D + L+ +YFN+G++ +DL KW + +++E++L L Sbjct: 141 LGEAVIGAVPDYWLDNPAGSGPGARGGALVKRYFNAGILLIDLAKWRNERISERSLDYLD 200 Query: 206 SKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHY 265 +Y DQD +NV G L R +N + + + + ++H+ Sbjct: 201 RFP-TTEYSDQDALNVACDGKWKILDRAWNFQF-------EPRQAIAGIALEQKAAIVHF 252 Query: 266 TGATKPWHKWAIYPSVKYYKIA-----LENSPWKDDSPRDAKSIIEFKKRYKHLLVQHHY 320 KPW ++ P+V +Y +PW ++ R L Y Sbjct: 253 VTNVKPWKSGSLSPNVAFYDAFRSRTCFALTPWGRVRSGLKRTGSRLLARSALLRTAWSY 312 Query: 321 ISGIIAGV 328 + + Sbjct: 313 TKSAVRAI 320 >UniRef50_A7VNX5 Putative uncharacterized protein n=1 Tax=Clostridium leptum DSM 753 RepID=A7VNX5_9CLOT Length = 344 Score = 211 bits (536), Expect = 4e-53, Method: Composition-based stats. Identities = 65/306 (21%), Positives = 128/306 (41%), Gaps = 19/306 (6%) Query: 27 CLNVAYGVDANYLDGVGVSITSIVLNNRH-INLDFYIIADVYNDGFFQKIAKLAEQNQLR 85 +N + D N+ D +G ++ S+ NNR ++ YI+ ++G +K+ + +Q + Sbjct: 12 RMNCVFSSDDNFADILGCALISLFENNREQETIEVYILDGGISEGNKRKLESIFQQYERM 71 Query: 86 ITLYRINT--DKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLL 143 + + + W + + R+ LL + R+LYLD D++ G + L Sbjct: 72 VHFIEVPDISQLTGEAVTSGRWPISTFARILIDSLLPKEVKRVLYLDCDILVLGSLKNLW 131 Query: 144 HLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSI 203 + L AA V D Q K + ++ + Y N+GV+ +D+ KW + ++ ++ ++ Sbjct: 132 EIDLKDKTAAGVMDCLSNQRKQNAGINGEDS---YINAGVMLIDMDKWRENQIEKQCMNY 188 Query: 204 LMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYK---------- 253 + + Y DQ V+N +L L LP EYN + + K Sbjct: 189 IRICNGQVAYNDQGVINKVLHKDLLVLPPEYNAMTLFFDFTYPDMIKYRKPQSYYSAQQV 248 Query: 254 KLITESTLLIHYTGAT---KPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKR 310 + ++H+T + +PW K + +P ++ + SPW+ R K Sbjct: 249 DHARKHPRIVHFTSSFLSLRPWVKGSEHPYAPLWRNYYKRSPWRAKDLRSDNRSSYRKIY 308 Query: 311 YKHLLV 316 K + Sbjct: 309 EKFYRL 314 >UniRef50_C3PWZ8 Lipopolysaccharide 1,2-glucosyltransferase n=1 Tax=Bacteroides sp. 9_1_42FAA RepID=C3PWZ8_9BACE Length = 315 Score = 209 bits (533), Expect = 1e-52, Method: Composition-based stats. Identities = 70/313 (22%), Positives = 130/313 (41%), Gaps = 12/313 (3%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +++A +D+ ++ V+I SI+ NN ++ +I++ ++++AE+ I Sbjct: 1 MHIALTIDSKFVRYCAVTIVSILENNDPKDIMLHIVSGHLPKEDVLTLSQVAEKYGTSIA 60 Query: 88 LYRINTDKLQCL---PCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 Y I +KLQ Q S +++R +L T+ R++YLD+D + G + +L Sbjct: 61 FYYIPHEKLQNYEVKWQKQRLSMVVFYRCVLASILPSTISRVIYLDSDTLVLGSLKELWD 120 Query: 145 LGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 LN A V+D RL Y N GV+ L+L W + ++ + Sbjct: 121 TNLNQLALAGVQDTVSPNPSYFERLQY-APSYNYINGGVLLLNLAYWRKHNIEQQCIKYY 179 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIK--SELKDKTHQNYKKLITESTLL 262 + DQD++N LL + + ++N + + ++ Sbjct: 180 QQYPDRIILNDQDILNALLYDQKVLIDIKWNVQDDFYRNNRYTSPAWKPSYTDAILHPII 239 Query: 263 IHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQHHYIS 322 +HY+G KPW A++P + +P+ D + K I R+ HLL YI Sbjct: 240 LHYSGR-KPWAYHAMHPLRHLFFHYQRLTPYDDSA--KQKKISTRIYRFIHLLP---YIL 293 Query: 323 GIIAGVCYLCRKY 335 G+ +K Sbjct: 294 GLKPKKYVNLKKI 306 >UniRef50_A4ECW2 Putative uncharacterized protein n=1 Tax=Collinsella aerofaciens ATCC 25986 RepID=A4ECW2_9ACTN Length = 328 Score = 209 bits (531), Expect = 1e-52, Method: Composition-based stats. Identities = 74/324 (22%), Positives = 135/324 (41%), Gaps = 23/324 (7%) Query: 27 CLNVAYGVDANYLDGVGVSITSIVLNNRH-INLDFYIIADVYNDGFFQKIAKLAEQNQLR 85 +N+ Y VD N++ + +I S+V N+ ++ F++ ++ + + + ++ + Sbjct: 3 IMNLLYTVDNNFVPQLAANICSVVSNHSGIQDITFHVFSNGITEDNQRLLQEMVTEYNQN 62 Query: 86 ITLYRINT--DKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLL 143 + Y I+ D L T W+ + RL L ++R++YLD D + GDI+ L Sbjct: 63 LVFYDISNFKDALGFDFDTSGWNEIVLARLLMAHFLPNEIERVIYLDGDTIVLGDIALLW 122 Query: 144 HLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQ-YFNSGVVYLDLKKWADAKLTEKALS 202 + L G V +V + SRL+D +L G Y N+GV+ +DLK+W ++ L Sbjct: 123 NQDLKGCVVGMVPEPTVGP----SRLNDLDLNGCLYHNAGVLLVDLKQWRSTCCEDQLLD 178 Query: 203 ILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSE--------LKDKTHQNYKK 254 + DQD +N +LK L +N + + +N Sbjct: 179 YCERRSGRLFANDQDALNAVLKDKICSLSPAFNYSNIFDYYPFIFLNSLMPGFSDENSFN 238 Query: 255 LITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHL 314 +++HY G +PW + + Y L + WKD D R + Sbjct: 239 TARSKPIVVHYLGEERPWRRGNTHRFNNEYHFYLSETFWKDAKDEDGWGAYFLAWRTFNF 298 Query: 315 L------VQHHYISGIIA-GVCYL 331 L +++ ISG+I + Y Sbjct: 299 LTRPFPQLRYKVISGLIPAFLKYR 322 >UniRef50_C0WCJ1 Lipopolysaccharide 1,2-glucosyltransferase n=1 Tax=Acidaminococcus sp. D21 RepID=C0WCJ1_9FIRM Length = 338 Score = 208 bits (530), Expect = 2e-52, Method: Composition-based stats. Identities = 81/324 (25%), Positives = 138/324 (42%), Gaps = 22/324 (6%) Query: 11 KVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDG 70 K + A L+VAY V+ Y +G S+ S++ NN H + F+I D Y+ Sbjct: 16 KGVETFSKNAEKTDKAPLHVAYNVNDGYFQIMGASLVSVLENNAHRAVMFHIFTDGYSKE 75 Query: 71 FFQKIAKLAEQNQLRITLYRINTDKLQCLPCT-QVWSRAMYFRLFAFQLLGLTLDRLLYL 129 QK+ +LA++ I LY ++ + + +SR Y R+ +L D LYL Sbjct: 76 NAQKMEQLADRYGCVIKLYTLHMEPFADFHVKVERFSRITYGRIVMPLILAAETDHFLYL 135 Query: 130 DADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLK 189 DAD + + +L H L G V + P ++ L G+YFN GV+ +++ Sbjct: 136 DADTMVIRPLDELYHWDLTGKAMGAVSERMPDAKRRGDYL--HLNNGRYFNDGVMMVNIP 193 Query: 190 KWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTH 249 +W +TEKA S+ + QD++N++ G FLP YN + + + K Sbjct: 194 EWQKQNITEKAFSLQKEPKERFLGQSQDILNIVFDGTNAFLPSIYNEFGGGEDDPQQKGT 253 Query: 250 QNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKD----DSPRDAKSII 305 +IH+TG KPW + ++ SPW+ ++ Sbjct: 254 ------------IIHWTGRRKPWQMVLSDYDAQ-WRSYNAASPWETLTAILPILKPENYH 300 Query: 306 EFKK--RYKHLLVQHHYISGIIAG 327 +FK+ +Y+ Y+ G+ Sbjct: 301 DFKEWAKYRRKESFRDYVKGMAYY 324 >UniRef50_C5ELK9 Glycosyl transferase n=3 Tax=Clostridiales RepID=C5ELK9_9FIRM Length = 333 Score = 206 bits (525), Expect = 7e-52, Method: Composition-based stats. Identities = 77/316 (24%), Positives = 128/316 (40%), Gaps = 21/316 (6%) Query: 24 TSECLNVAYGVDANYLDGVGVSITSIVLNNRH-INLDFYIIADVYNDGFFQKIAKLAEQN 82 E N+ Y + Y + S+ S++ NNR+ N+D YI++ + +++A +AE Sbjct: 4 NEETANIIYASNDGYAGHLAASMYSLLDNNRNVRNMDIYILSAQMCQEYKERLAGMAEAF 63 Query: 83 QLRITLYRINT--DKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDIS 140 + + + + T+ + + RLFA Q+L T+ + LYLD D + I Sbjct: 64 HRTLHVVELGDLKQRFDFDIDTRGFDISAMGRLFAPQVLPGTVKKALYLDCDTIVCKSIR 123 Query: 141 QLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKA 200 L L AV +V +EP K + Y+NSGV+ + L +W + +K Sbjct: 124 PLYETELGDAVVGMV--MEPTVYKEMKESIGMGKDDPYYNSGVLLMALDRWRQEDVLQKL 181 Query: 201 LSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKD----------KTHQ 250 L S DQD +N LKG LP +YN + + + Sbjct: 182 LDFYKSCHGRLFACDQDTINGALKGRIKTLPVKYNYFTNYRYFRYSTLCSMCAAYREIGE 241 Query: 251 NYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKR 310 S +IHY G +PW K Y+ L +PWKD + K +R Sbjct: 242 EAYLEARRSPAIIHYLGDERPWIAGNHNHFKKLYEYYLAKTPWKDTPKQTGK------ER 295 Query: 311 YKHLLVQHHYISGIIA 326 Y H+ + ++ + Sbjct: 296 YMHMWWLFNRLTWLCP 311 >UniRef50_B6GCA0 Putative uncharacterized protein n=1 Tax=Collinsella stercoris DSM 13279 RepID=B6GCA0_9ACTN Length = 990 Score = 206 bits (525), Expect = 8e-52, Method: Composition-based stats. Identities = 52/295 (17%), Positives = 110/295 (37%), Gaps = 23/295 (7%) Query: 25 SECLNVAYGVDANYLDGVGVSITSIVLNNRH-INLDFYIIADVYNDGFFQKIAKLAEQN- 82 + + V + D NY+ + +I S++ N + D ++ + + + Sbjct: 645 RQIVPVVFASDNNYVPMLTTTIHSMLSNASNNYRYDITVLHRDISGANQAIMREFFSSYD 704 Query: 83 QLRITLYRIN--TDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDIS 140 + + ++ +K S Y+R LL D++LYLD+D++ +GD+S Sbjct: 705 NVNLGFCDVSQVIEKYNLTTNNPHISVETYYRFLIQDLLPY-YDKVLYLDSDLIIRGDVS 763 Query: 141 QLLHLGLNGAVAAVVKDVEPMQEKAVSR---------LSDPELLGQYFNSGVVYLDLKKW 191 +L L ++ A D++ + + R + + YF +GV+ L+ + Sbjct: 764 ELFATDLGDSLLAAAHDIDFVANVNMKRGDRFAYAKEVLGMKDPYSYFQAGVLVLNTRAM 823 Query: 192 ADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSE------LK 245 E+ L + + Y DQDV+N +G ++L +N + Sbjct: 824 RSRHTMEEWLEFASD--DRFIYNDQDVLNAHCEGEVVYLDYSWNVMIDCFGRINKVFTFA 881 Query: 246 DKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRD 300 + + ++HY G KPW + Y +P+ + + Sbjct: 882 PAYMFDAFIESRSNEKIVHYAGFEKPWKLAGCDRG-ELYWRYARETPFYESLLQH 935 >UniRef50_C4VEI8 General stress protein A n=24 Tax=Enterococcus RepID=C4VEI8_ENTFA Length = 303 Score = 206 bits (523), Expect = 1e-51, Method: Composition-based stats. Identities = 63/282 (22%), Positives = 128/282 (45%), Gaps = 9/282 (3%) Query: 20 ANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHIN-LDFYIIADVYNDGFFQKIAKL 78 + + L + + N++ + SI+ N+ + FY+I D N Q + Sbjct: 2 QEMENRKELAIVSCCNTNFVPHLAAMFVSILENSPSAAAVHFYVIDDNINFESKQLLYFT 61 Query: 79 AEQN--QLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLG-LTLDRLLYLDADVVC 135 + +T ++IN + + ++ + Y+R+ +L ++RLLY+D D++ Sbjct: 62 IKHTQLNAELTFFKINPHFFKNVVTSERIPKTAYYRIAIPELFRGSQIERLLYMDCDMIA 121 Query: 136 KGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAK 195 D+++L + L + A V+D Q + +++ P YFNSG++ +D+KKW + Sbjct: 122 LDDVAKLWTVDLGENIIAAVEDAGFHQR--LEKMAIPAESMCYFNSGLLLIDVKKWLNLD 179 Query: 196 LTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDK---THQNY 252 +T K L + + ++ DQD +N +L L ++N I S+ K + Sbjct: 180 VTTKVLRFIEENPDKLRFHDQDALNAVLHDRWTLLHPKWNAQGYILSKAKKHPTIYGEKQ 239 Query: 253 KKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWK 294 + + +IH+TG KPW K + + +YY + ++ Sbjct: 240 YEETRRAPSIIHFTGHVKPWTKEFQWYTKRYYDQYANRTAFR 281 >UniRef50_B8ISQ5 Glycosyl transferase family 8 n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8ISQ5_METNO Length = 328 Score = 204 bits (519), Expect = 4e-51, Method: Composition-based stats. Identities = 68/304 (22%), Positives = 116/304 (38%), Gaps = 15/304 (4%) Query: 19 LANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKL 78 L + + + VA +D + V++ S++ LD +I + +IA L Sbjct: 4 LHETDEIDRIAVALCIDRAFFRHALVTVASLLDAGPRQPLDVHIFYAEADPACMARIAAL 63 Query: 79 AEQNQLR-ITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKG 137 +I+ D+ + P + S Y RL L+ ++LYLDAD++ Sbjct: 64 FADQDRHGCHFQKISLDRFEGFPVSDAISAGTYARLLLPYLMPRR-AKVLYLDADLIVLD 122 Query: 138 DISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLT 197 D++ L L A A V+D A YFN+GV+ ++L W L Sbjct: 123 DVAPLWRTELGAAPVAAVRDPFCDNRPA----IGFSPDEPYFNAGVLLMNLAVWRREGLA 178 Query: 198 EKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKT--HQNYKKL 255 E+ + + + KY DQD +NV+L+G F+ +N + + + Sbjct: 179 ERVAAHIDAHGASLKYFDQDALNVVLRGRARFVDPRWNFQPRMADATPADIACARAEFRR 238 Query: 256 ITESTLLIHYTGATKPWHK-WAIYPSVKYYKIALENSP------WKDDSPRDAKSIIEFK 308 +IHYT KPW +AI+ Y + P + D + K Sbjct: 239 TRARPAIIHYTTPHKPWKDPFAIHYGRHYLDCLMRLEPDLRARYFADVPQQPRLRASHLK 298 Query: 309 KRYK 312 R + Sbjct: 299 ARMR 302 >UniRef50_B4RJG6 LgtC n=29 Tax=Neisseria RepID=B4RJG6_NEIG2 Length = 307 Score = 204 bits (519), Expect = 4e-51, Method: Composition-based stats. Identities = 57/298 (19%), Positives = 115/298 (38%), Gaps = 14/298 (4%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +++ + D NY + V+ S+ + + F+++ ++ +A I Sbjct: 1 MDIVFAADDNYAAYLCVAAKSVEAAHPDTEIRFHVLDAGISEENRAAVAANLRGGGGNIR 60 Query: 88 LYRINTDKLQCLPCT-QVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLG 146 +N + P + S Y RL + + D++LYLD DV+ + + L Sbjct: 61 FIDVNPEDFAGFPLNIRHISITTYARLKLGEYI-ADCDKVLYLDTDVLVRDGLKPLWDTD 119 Query: 147 LNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMS 206 L G D+ +++ + YFN+GV+ ++LKKW + + + + Sbjct: 120 LGGNWVGACIDLFVERQEGYKQKIGMADGEYYFNAGVLLINLKKWRRHDIFKMSCEWVEQ 179 Query: 207 KDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITES------- 259 +V +Y DQD++N L KG + +N + T + + + + + Sbjct: 180 YKDVMQYQDQDILNGLFKGGVCYANSRFNFMPTNYAFMANGFASRHTDPLYLDRTNTAMP 239 Query: 260 TLLIHYTGATKPWHKWAIYPSVKYYKIA---LENSP--WKDDSPRDAKSIIEFKKRYK 312 + HY G+ KPWH+ + + L P W+ + + R K Sbjct: 240 VAVSHYCGSAKPWHRDCTVWGAERFTELAGSLTTVPEEWRGKLAVPPTKHMLQRWRKK 297 >UniRef50_B7GNT4 Glycosyl transferase, family 8 n=5 Tax=Bifidobacterium RepID=B7GNT4_BIFLI Length = 1013 Score = 203 bits (517), Expect = 6e-51, Method: Composition-based stats. Identities = 65/333 (19%), Positives = 130/333 (39%), Gaps = 23/333 (6%) Query: 10 DKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNN-RHINLDFYIIADVYN 68 + A + + ++ + V + D NY+ + ++ S + N D ++ Sbjct: 651 NPEPAEELKPLDVFDKPIVPVVFAADDNYVPQLTTTVYSAMKNADPSYFYDVVVLQQDIA 710 Query: 69 DGFFQKIAKLAEQN-QLRITLYRINTDKLQCLPCTQ--VWSRAMYFRLFAFQLLGLTLDR 125 +++ + EQ + + + + T S Y+R QLL D+ Sbjct: 711 GDKQERMWRFFEQFPNMSLRFLNVKRELSGYDLSTNNAHISIETYYRFLIQQLLP-NYDK 769 Query: 126 LLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPM---------QEKAVSRLSDPELLG 176 +LYLD+D++ GDI++L + L + V+D++ + + + + Sbjct: 770 VLYLDSDIIIVGDIAKLYDIDLQDNLLGAVRDIDFLGNLNVKHGKRMSYAKDVLKMKNPY 829 Query: 177 QYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNT 236 YF +GV+ L+ K + E+ L+ + + Y Y DQDV+N +G L+LP E+N Sbjct: 830 DYFQAGVLVLNTKGMRNRYSIEQWLTYASNPN--YIYNDQDVLNAYCEGKVLYLPWEWNV 887 Query: 237 IYTIKSELKDKTHQNYKKLI------TESTLLIHYTGATKPWHKWAIYPSVKYYKIALEN 290 ++ + + Q + + +IHY G KPW Y Sbjct: 888 VHDCGGRVGNLFTQAPNDVYDAYVKSRSNPQIIHYAGYQKPWVDPDCD-YSSIYWRYARE 946 Query: 291 SPWKDDSPRDAKSIIEFKKRYKHLLVQHHYISG 323 +P+ + + E + + L +H G Sbjct: 947 TPFYERLIKRVVLANEPQIPEEVFLPKHERAVG 979 >UniRef50_A8ARL4 Putative uncharacterized protein n=2 Tax=Citrobacter RepID=A8ARL4_CITK8 Length = 314 Score = 202 bits (515), Expect = 1e-50, Method: Composition-based stats. Identities = 80/316 (25%), Positives = 149/316 (47%), Gaps = 8/316 (2%) Query: 23 NTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQN 82 N + +N+AY DANYL+ V VSI S+++NN +L F++ +D K+ + + Sbjct: 3 NKTNVINIAYCTDANYLEYVAVSIMSVIMNNPEQSLAFFVFVYDVSDEDIAKLQSTSNKI 62 Query: 83 QLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQL 142 Q+ IT+ + + +K + +R+ Y RL +LL + R +YLDAD +C +S++ Sbjct: 63 QV-ITIDKADIEKYNNDFAIKHLNRSTYMRLAVPRLLKDKVARFIYLDADTLCFDSLSEI 121 Query: 143 LHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALS 202 + ++ V AV D + + +R + YFN+G +Y+++ W + KA + Sbjct: 122 NSVDIDNVVCAVSHDSLNIHDNKHARRLGL-SIDHYFNAGFLYINVANWIKHDIEHKANT 180 Query: 203 ILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLL 262 +L + Y DQD +N+ + G F+ +N ++ D+ +N+ + Sbjct: 181 VLFEQGKSLPYFDQDALNIAMNGNITFIDNRWNFLFNW---FTDEQKENFFYHSDTLPRI 237 Query: 263 IHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPR---DAKSIIEFKKRYKHLLVQHH 319 IH+TG KPW+K S + Y +PW++ R +++ + + + Sbjct: 238 IHFTGGRKPWYKEHTGLSQQLYVFYHHFTPWRNAELRSYAPRMRPTDYRVYSRQAAKKGN 297 Query: 320 YISGIIAGVCYLCRKY 335 Y + I YL K Sbjct: 298 YFTAIKWYAKYLKTKI 313 >UniRef50_Q03HK5 Lipopolysaccharide biosynthesis glycosyltransferase n=1 Tax=Pediococcus pentosaceus ATCC 25745 RepID=Q03HK5_PEDPA Length = 549 Score = 202 bits (515), Expect = 1e-50, Method: Composition-based stats. Identities = 64/294 (21%), Positives = 123/294 (41%), Gaps = 11/294 (3%) Query: 5 PAIEIDKVKAWDFRLANINTSE----CLNVAYGVDANYLDGVGVSITSIVLNNRHINLDF 60 P + + D +N E +NV ++ +++ + S SI+ N+ +F Sbjct: 252 PWKTLSEHPYLDEYHEELNELEINRGVVNVISAANSAFVEALATSYISILENDSENQYNF 311 Query: 61 YIIADVYNDGFFQKIAKLAEQN-QLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLL 119 Y++ D + + + + I + +++ L+ + ++ Y+R+ A +LL Sbjct: 312 YLLPDHLDQRDMLILGSVISRYDNASIKIVKVDEKLLENAVESDRILKSAYYRILAPELL 371 Query: 120 GLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYF 179 ++R +YLD D++ ++ L L G V A V+D + +YF Sbjct: 372 P-NINRAIYLDCDIIANTNLHDLWQTSLEGNVLAAVEDAGFHDRLEH--MGITHDNSKYF 428 Query: 180 NSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYT 239 NSG++ +DL W +T++ L + ++ DQD +N +L L L ++N Sbjct: 429 NSGMMLIDLVSWRSQAVTQRVLDYINHNPEKLRFHDQDALNAILYDKWLHLHPKWNAQSN 488 Query: 240 IKSELKDKTHQNYKKLI---TESTLLIHYTGATKPWHKWAIYPSVKYYKIALEN 290 I + KL E+ LIH+ G KPWH + +P Y + Sbjct: 489 IVLDALVPPRTELLKLYAETRENPKLIHFCGHVKPWHAESKHPYTNVYLKYNKK 542 Score = 186 bits (472), Expect = 1e-45, Method: Composition-based stats. Identities = 65/269 (24%), Positives = 124/269 (46%), Gaps = 7/269 (2%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRI 86 +NV D NY D + ++I + + N N+ ++F ++++ ++ + KLA + Sbjct: 4 INVLLAADENYADQLQITIKTTLENLNKKTRVNFIVLSNNLSNSTKLALKKLAHGLH-TV 62 Query: 87 TLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLL-GLTLDRLLYLDADVVCKGDISQLLHL 145 ++ P ++ Y+R+ A QLL +DR+LYLD D++ + D+++L Sbjct: 63 EYLDLDPSVFAFCPTNSHINKTAYYRILAPQLLAKRNIDRILYLDVDLLVRHDLTELYDA 122 Query: 146 GLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQ-YFNSGVVYLDLKKWADAKLTEKALSIL 204 LN + V D + YFNSG++ +D+KKW + +TEK L+ + Sbjct: 123 ELNHNIVGAVIDTGQAFALNRLGVDPVVAANNIYFNSGILVIDIKKWNENHITEKTLNYI 182 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYK---KLITESTL 261 + ++ + DQD +N +L G L ++N +I ++ Y +S Sbjct: 183 KHQSHLIIFHDQDALNAVLAGHVQMLHPKWNLQNSIVFRKHRPINEAYDQLINEAIKSPA 242 Query: 262 LIHYTGATKPWHKWAIYPSVKYYKIALEN 290 ++H+T KPW + +P + Y L Sbjct: 243 IVHFTTHEKPWKTLSEHPYLDEYHEELNE 271 >UniRef50_A7A7B4 Putative uncharacterized protein n=2 Tax=Bifidobacterium adolescentis L2-32 RepID=A7A7B4_BIFAD Length = 1009 Score = 202 bits (514), Expect = 1e-50, Method: Composition-based stats. Identities = 63/333 (18%), Positives = 128/333 (38%), Gaps = 23/333 (6%) Query: 10 DKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNN-RHINLDFYIIADVYN 68 + A + +I + V + D NY+ + ++ S + N H D ++ Sbjct: 647 NPEPAEQLKPLDITDKPIVPVVFAADDNYVPQLTTTVYSAMKNADPHYFYDVTVLQRNIA 706 Query: 69 DGFFQKIAKLAEQN-QLRITLYRINTDKLQCLPCTQ--VWSRAMYFRLFAFQLLGLTLDR 125 +++ +Q + + ++ + T S Y+R ++L D+ Sbjct: 707 WDKQERLRGFFKQFPNMNLRFTNVDRELAGYDLSTNNAHISVETYYRFLIQKVLPF-YDK 765 Query: 126 LLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVE---PMQEKAVSRLSD------PELLG 176 +LYLD+D++ GDI++L ++ L G + ++D++ + K R+ + Sbjct: 766 VLYLDSDIIINGDIAKLYNIDLQGKMLGAIRDIDFLANLNVKHGKRMGYAQTVLKMKNPY 825 Query: 177 QYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNT 236 YF +GV+ L+ K + ++ L+ + D + Y DQDV+N +G L+LP E+N Sbjct: 826 DYFQAGVLVLNTKAMREHYTIKQWLTYASNPD--FIYNDQDVLNAHCEGNVLYLPWEWNV 883 Query: 237 IYTIKSE------LKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALEN 290 ++ + ++HY G KPW Y Sbjct: 884 VHDCGGRVGNLFVQAPNDIYDAYMKSRNDPQIVHYAGFQKPWTDPDCD-FASMYWKYARE 942 Query: 291 SPWKDDSPRDAKSIIEFKKRYKHLLVQHHYISG 323 +P+ + + E + L +H G Sbjct: 943 TPFYERLLKRVVKANESEIPAGVLRPKHERAVG 975 >UniRef50_UPI0001A457E5 hypothetical protein NsubN_08151 n=1 Tax=Neisseria subflava NJ9703 RepID=UPI0001A457E5 Length = 345 Score = 202 bits (514), Expect = 1e-50, Method: Composition-based stats. Identities = 74/327 (22%), Positives = 135/327 (41%), Gaps = 24/327 (7%) Query: 25 SECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKL-AEQNQ 83 + ++ Y D NY+ +G ++ S++ NN + F+++ F ++ Sbjct: 21 KQPKHIVYAADQNYIKHIGTALLSVLQNNTS-PIHFHLLVSGSEGYDFNIFDQIETSNQN 79 Query: 84 LRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLL 143 I++Y +NT+ L T ++ AMY+R+ LL LYLD DV+C G+I L Sbjct: 80 YAISVYHLNTEYFSTLQTTHYFTIAMYYRMSIPCLLKGITHTALYLDTDVLCLGNIDDLF 139 Query: 144 HLGLNGAVAAVVKDVEPMQ-EKAVSRLSDPELLGQYFNSGVVYLDLKKWADA---KLTEK 199 + ++ ++ A V D + YFNSGV+ ++ KW D K+ + Sbjct: 140 EIDISNSLIAAVPDAILYRAYIKQLNQFGFTDTEPYFNSGVILFNIDKWNDMAIDKILSE 199 Query: 200 ALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITES 259 + + ++ PDQD++N+ G +L +N I+ + + Sbjct: 200 KMQAVEKQNFKLSCPDQDILNLACIGHVHWLSENFNWIH-------WHQKYSELIDNPNN 252 Query: 260 TLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKD--------DSPRDAKSIIEFKKRY 311 L+H+ G KPWH+ +P+ Y +NSPW + +F++ Sbjct: 253 IRLVHFVGHIKPWHQLGFHPA---YDQYFKNSPWNNGYLEQPLSTWLPFPNPKRKFRQAA 309 Query: 312 KHLLVQHHYISGIIAGVCYLCRKYYRK 338 K L Q YL R+ ++ Sbjct: 310 KRLWKQGQKKQAWAYYREYLLRRINKR 336 >UniRef50_UPI000196958D hypothetical protein BACCELL_01586 n=1 Tax=Bacteroides cellulosilyticus DSM 14838 RepID=UPI000196958D Length = 305 Score = 202 bits (514), Expect = 2e-50, Method: Composition-based stats. Identities = 60/304 (19%), Positives = 111/304 (36%), Gaps = 6/304 (1%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +N+ D+ Y+ V + S NN Y++ + + I K+ Sbjct: 1 MNIVCAADSGYVQHCSVMLISFFENNPGEEHAVYLLTEGLDLDDLDFIQKIVHSYNGHFF 60 Query: 88 LYRINTDKLQCLP--CTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHL 145 +++ L+ P T S A Y RLF LL ++++LYLD D++ I +L Sbjct: 61 YCQVDFKFLEKCPIKSTDHLSIATYNRLFMADLLPADVNKVLYLDCDIIVNQSIKELWET 120 Query: 146 GLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILM 205 L + + V D + YFN+GV+ ++L W +T+ + + Sbjct: 121 PLRDNFVVAAFEERGCCAEDVYERLDYDSKYGYFNAGVLLVNLDYWRTHNMTQAFIEYIE 180 Query: 206 SKDNVYKYPDQDVMNVLLKGMTLFLPREYN--TIYTIKSELKDKTHQNYKKLITESTLLI 263 + DQDV+N ++ + +N I+ +K + I ++ Sbjct: 181 HNFEKLRAHDQDVLNAFFYDKSVHISLAWNVEFIFYYYGIIKKFGFDRDLRFILRHPKIL 240 Query: 264 HYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQHHYISG 323 H+T KPW +P Y L+ K + ++ +Y + I G Sbjct: 241 HFTWKPKPWETSCQHPFRINYYRYLKK--IKKNPLSFRDTLRALWDKYYFCFLIKWKIKG 298 Query: 324 IIAG 327 Sbjct: 299 HKYY 302 >UniRef50_A6LGX5 Glycosyltransferase family 8 n=1 Tax=Parabacteroides distasonis ATCC 8503 RepID=A6LGX5_PARD8 Length = 325 Score = 202 bits (513), Expect = 2e-50, Method: Composition-based stats. Identities = 64/326 (19%), Positives = 129/326 (39%), Gaps = 20/326 (6%) Query: 29 NVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITL 88 ++ D NYL V + S+ N +L F+++++ + + + + E + ++++ Sbjct: 3 DIVVASDCNYLHLVSICAVSLFETNSSESLHFHLLSNGIDSADIKNLQTIVEGYRGKLSV 62 Query: 89 YRINTDKLQCL-PCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGL 147 Y I + + + + S Y RLFA +L LD++LY+D D++ G I L + L Sbjct: 63 YPIENLRERLMTDVPETISLTSYARLFAGSILPANLDKVLYIDCDIIFNGSIRDLFNTDL 122 Query: 148 NGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSK 207 + + D P+ + + + Y N+GV+ + L +W + +K + L++ Sbjct: 123 GNCLVGGILD--PLISRTYKKEIKIPMSEPYINAGVLIIPLNRWRSEGMEQKFVDFLVAN 180 Query: 208 DNVYKYPDQDVMNVLLKGMTLFLPREYNTI-----YTIKSELK---DKTHQNYKKLITES 259 + DQ ++N + G LP ++N + Y K K Q K S Sbjct: 181 RGKVHHHDQGIINAVCAGRKKILPPQFNVMSNSLCYPWKDLYKINTPFYDQEEYKKGISS 240 Query: 260 TLLIHYTG--ATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFK------KRY 311 +IH+TG +PW +P + + +KD + R Sbjct: 241 PAIIHFTGAIHGRPWIVGCTHPYANKFLQFKAKTAYKDIPLKPNNQSAALHRLEGILYRL 300 Query: 312 KHLLVQHHYISGIIAGVCYLCRKYYR 337 + Y+ + + Y + Sbjct: 301 LPFSLFKRYMQSV-YYLSYFKHSIKK 325 >UniRef50_B1MX28 Lipopolysaccharide biosynthesis protein, LPS:glycosyltransferase n=2 Tax=Leuconostoc RepID=B1MX28_LEUCK Length = 283 Score = 199 bits (505), Expect = 1e-49, Method: Composition-based stats. Identities = 65/273 (23%), Positives = 110/273 (40%), Gaps = 3/273 (1%) Query: 18 RLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAK 77 I + +N+ +D NY+ + V + S+ N N+ ++ D +K+ + Sbjct: 2 EKTKIINDDSVNILITIDENYIKPLRVLLYSLRQTNPRENMTIWLAHDHIEVAQLEKLHQ 61 Query: 78 LAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKG 137 Q + +++T P + + MYFRL Q L TL R++YLD D++ Sbjct: 62 FVAQLGFVLHTIKVDTSLWASAPTFKQYPPEMYFRLLCGQYLPKTLHRVIYLDPDILVIN 121 Query: 138 DISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLT 197 I L ++ L G + A + YFNSGV+ +DL Sbjct: 122 PIRPLANMPLKGQMLAASSHMGLTGISQTINHLRLGTRQVYFNSGVMLMDLDMMRQRVDM 181 Query: 198 EKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPRE-YNT-IYTIKSELKDKTHQNYKKL 255 + LS++ PDQD++N L L LP E +N + Sbjct: 182 KAILSVIQQYGKELILPDQDILNYLYGDEILSLPEEIWNYDTRDNIMHYAKSFGSVDMRW 241 Query: 256 ITESTLLIHYTGATKPW-HKWAIYPSVKYYKIA 287 + E+T+++HY G KPW +I P + Y+ Sbjct: 242 VMENTVILHYCGRPKPWEKSNSINPFIMLYQHY 274 >UniRef50_Q9L6B2 Putative glycosyl transferase n=3 Tax=Pasteurella RepID=Q9L6B2_PASMU Length = 302 Score = 198 bits (504), Expect = 2e-49, Method: Composition-based stats. Identities = 69/300 (23%), Positives = 125/300 (41%), Gaps = 8/300 (2%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +N+ + D Y + V+I SI+ +N + FYI D + I + + Sbjct: 1 MNILFVSDDVYAKHLVVAIKSIINHN-EKGISFYIFDLGIKDENKRNINDIVSSYGSEVN 59 Query: 88 LYRINTDKLQCLPCT-QVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLG 146 +N + + P S A Y RL A + L L++++YLD DV+ + L ++ Sbjct: 60 FIAVNEKEFESFPVQISYISLATYARLKAAEYLPDNLNKIIYLDVDVLVFNSLEMLWNVD 119 Query: 147 LNGAVAAVVKDVEPMQEKAVSRLSDPELLGQY-FNSGVVYLDLKKWADAKLTEKALSILM 205 +N + A D EK+ + S +Y FN+GV+ +L +W + +AL +L Sbjct: 120 VNNFLTAACYDSFIENEKSEHKKSISMSDKEYYFNAGVMLFNLDEWRKMDVFSRALDLLA 179 Query: 206 SKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDK-----THQNYKKLITEST 260 N Y DQD++N+L + +L +N + +K ++ + + T Sbjct: 180 MYPNQMIYQDQDILNILFRNKVCYLDCRFNFMPNQLERIKQYHKGKLSNLHSLEKTTMPV 239 Query: 261 LLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQHHY 320 ++ HY G K WH + +V +Y+ L D R K + + + Y Sbjct: 240 VISHYCGPEKAWHADCKHFNVYFYQKILAEITRGTDKERVLSIKTYLKALIRRIRYKFKY 299 >UniRef50_C0Z4I4 Putative uncharacterized protein gspA n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0Z4I4_BREBN Length = 264 Score = 198 bits (503), Expect = 2e-49, Method: Composition-based stats. Identities = 59/266 (22%), Positives = 110/266 (41%), Gaps = 13/266 (4%) Query: 27 CLNVAYGVDANYLDGVGVSITSIVLNN-RHINLDFYIIADVYNDGFFQKIAKLAEQNQLR 85 +++ V+ + + V + S+ N + ++I + + K ++ + Sbjct: 3 TIHIVTAVNDGFAIHLAVMLYSLFENKVSKNPVIVHVIDSQVSGENKSILTKTVKRFHAQ 62 Query: 86 ITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHL 145 I I+ ++ Y R+ LL +++++YLD+D+V K DI+ L + Sbjct: 63 IKYVTIDPTLYDGFLVRDHLTQETYHRISIPDLLDKEVEKVIYLDSDIVIKKDITPLWNT 122 Query: 146 GLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILM 205 ++ A V D K YFN+GV+ ++LKKW + +T+K + + Sbjct: 123 KVDQYYLAAVMDSWQGLNKLRHADLAIPDDCDYFNAGVLVMNLKKWREHNITKKIMDYMK 182 Query: 206 SKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHY 265 + +YP QD MN +L L L ++N + YK + +IHY Sbjct: 183 KNQGIIRYPSQDPMNAILHDNWLQLDTKWNYQ----------SKHLYKSNLRIDPAIIHY 232 Query: 266 TGAT-KPWHKWAIYPSVKYYKIALEN 290 TG KPW +P + Y L+ Sbjct: 233 TGEDSKPWLS-KKHPLREEYFKYLKK 257 >UniRef50_D2LIH7 Glycosyl transferase family 8 n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LIH7_RHOVA Length = 391 Score = 198 bits (503), Expect = 3e-49, Method: Composition-based stats. Identities = 52/300 (17%), Positives = 121/300 (40%), Gaps = 24/300 (8%) Query: 27 CLNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAEQNQLR 85 + V + Y+ G I SI + + + D + AD + + ++ + Sbjct: 43 AVPVVMCFNRRYMPGGAALIASIAEHASPNRLYDLIVFADDLASEDRDMLRNVCDKPNIS 102 Query: 86 ITLYRIN-TDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 + + ++ + + ++RL L+ D+++Y+DAD + D++ L Sbjct: 103 LRFFDVSRCFDGINFITHFHFRKENFYRLKIPDLM-RDFDKVVYIDADTITNRDLADLYD 161 Query: 145 LGLNGAVAAVVKDVEP----------------MQEKAVSRLSDPELLGQYFNSGVVYLDL 188 + ++G A V+D E V + YFNSG+V ++ Sbjct: 162 IDVDGYYIAAVRDFAMIATQNKKMLDIVGKKIYYETYVKDYLGLIGISNYFNSGLVLFNI 221 Query: 189 KKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKT 248 K ++++E+ ++++ +K ++ Y DQD++N++ + + +N + + Sbjct: 222 NKINGSQISERLIALIGTK--LFAYVDQDILNIVFENKVKLIDYSWNMVIDCERLYHLSE 279 Query: 249 HQNYKKLITES--TLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIE 306 Y + + ++HY G KPW+ ++ +YY +P + R+ + E Sbjct: 280 PDLYARYLDAGAAPHVVHYIGGNKPWNDPTVHM-AEYYWRYAAKTPLYEKLLREIRERRE 338 >UniRef50_A5ZFA9 Putative uncharacterized protein n=1 Tax=Bacteroides caccae ATCC 43185 RepID=A5ZFA9_9BACE Length = 310 Score = 197 bits (501), Expect = 5e-49, Method: Composition-based stats. Identities = 75/284 (26%), Positives = 121/284 (42%), Gaps = 9/284 (3%) Query: 29 NVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITL 88 N+ G+D Y G + S+ +N + Y+++ ++ + +L + Q +I Sbjct: 3 NIICGIDDQYCQHCGAMLLSLFESNPGA-ITIYVLSLELSEKSKNLLKELVDSYQKQIHF 61 Query: 89 YRINTDKLQCLPC--TQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLG 146 I ++ + P T S A Y RLF QLL +D+ LY+D+D++ K DIS L Sbjct: 62 IDIPSELVLNFPMKSTDYPSLATYLRLFIPQLLPFEVDKALYVDSDIIFKKDISALYDSD 121 Query: 147 LNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMS 206 + A ++D + YFN+G V L++K D T KA++ + Sbjct: 122 ITNYALAGMEDAPNQN----ALRLGFPESDLYFNAGFVLLNVKYLRDMDFTNKAMAYIRD 177 Query: 207 KDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSE--LKDKTHQNYKKLITESTLLIH 264 DQDV+N LL G LF+P ++N + + K + +S +IH Sbjct: 178 CREKIVLHDQDVLNALLHGKVLFVPIKWNMLDCFYRKPPFIAKKYMRELHENLDSPAVIH 237 Query: 265 YTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFK 308 ++G KPWH +P K Y W SP FK Sbjct: 238 FSGPLKPWHHGCPHPLRKEYFNYSRKLSWGCQSPDYYYVFSAFK 281 >UniRef50_B7KM20 Glycosyl transferase family 8 n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7KM20_CYAP7 Length = 347 Score = 197 bits (501), Expect = 5e-49, Method: Composition-based stats. Identities = 62/280 (22%), Positives = 116/280 (41%), Gaps = 7/280 (2%) Query: 24 TSECLNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLA--E 80 +E + + G D + G+ V++ S + N + +D YI+ N K+ ++ + Sbjct: 9 ENEPITIVSGADDKFALGLAVTLYSALANLDTKRKIDIYIVDGGINSKNRDKLTQILNSD 68 Query: 81 QNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDIS 140 + I + + L+ + + YFRL +LL ++R++YLD+D+V +G+++ Sbjct: 69 LMPVSIKWVKPDLTVLEGVKLFGSLNVTTYFRLLLPELLPTQVERVIYLDSDLVVEGNLA 128 Query: 141 QLLHLGLNGAVAAVVKDVEPMQEKAVSR---LSDPELLGQYFNSGVVYLDLKKWADAKLT 197 L L A V+D + Y N+GV+ +++K+W L Sbjct: 129 NLWEQELGNCPAVAVQDYVFPYVCNGLKTYQQLGLASNTPYCNAGVMLINIKQWRIEALN 188 Query: 198 EKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLIT 257 K L + ++ DQD +N L+ L ++N K K+LI Sbjct: 189 RKILEYIRKFYDLVYLADQDGINALIANRFKLLDLKWNVQIFGVYNGKIDLLCKPKELIR 248 Query: 258 ESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDS 297 ++H+T KPWH + + L S W +D Sbjct: 249 -DAFILHFTTPIKPWHPYYRQAGGSRFTHYLRKSKWFNDL 287 >UniRef50_Q116W1 Glycosyl transferase, family 8 n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q116W1_TRIEI Length = 278 Score = 196 bits (498), Expect = 1e-48, Method: Composition-based stats. Identities = 78/292 (26%), Positives = 139/292 (47%), Gaps = 17/292 (5%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +N+ + D NY GV+ITS++LNN + D +II + + QK+ L++ + Sbjct: 2 MNLLFCFDQNYQQHFGVAITSVLLNNLSSHFDVHIITNFMEEKLKQKLDTLSKNYKCSFH 61 Query: 88 LYRINT-DKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLG 146 LY IN DK+ L + S A Y+RL ++L +D++LYLD+DVV + +L ++ Sbjct: 62 LYIINNLDKISKLKVSDHVSNATYYRLIMAEILPKHIDKVLYLDSDVVVISPLEELYNID 121 Query: 147 LNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMS 206 L A S S + + FNSGV+ ++L+KW + +++ K + Sbjct: 122 LENYFIAA------------SGFSGTLVKSKGFNSGVMVVNLEKWRNEQISTKVIDFATK 169 Query: 207 KDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYT 266 + Y DQ +N ++K L + R++N + K N ++ +IHY Sbjct: 170 NRDKLPYHDQSALNRVIKQNYLIIDRKWNFQVDLSPRKIQKPDDNI---ALKNARIIHYI 226 Query: 267 GATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDA-KSIIEFKKRYKHLLVQ 317 G++KPW+ W Y++ L+ S W + + + F+K + L + Sbjct: 227 GSSKPWYFWISDQRKNIYELYLKKSLWSTSKLQMIFQQTVYFRKALQRKLKK 278 >UniRef50_B2UPJ4 Glycosyl transferase family 8 n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UPJ4_AKKM8 Length = 315 Score = 196 bits (498), Expect = 1e-48, Method: Composition-based stats. Identities = 74/310 (23%), Positives = 124/310 (40%), Gaps = 17/310 (5%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRI 86 +N+ Y D N G GVSI S++ N ++ D YI+ + + L + L + Sbjct: 1 MNIVYATDDNGALGTGVSIVSLMENLPPGVHADIYIMTGGLSGDNTARFHSLQQGYNLHL 60 Query: 87 TLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLG 146 + DK P WS A Y+RL L T++R LY+D D + DIS + Sbjct: 61 HFIDMK-DKYTDFPVGSKWSAATYYRLGLAGELPATVERALYVDIDTIFNRDISPMYESE 119 Query: 147 LNGAVAAVVKDVEPMQEKAVSRLS---DPELLGQYFNSGVVYLDLKKWADAKLTEKALSI 203 + A V E + E++ SR + Y N+GV+ + + + + LS Sbjct: 120 FGDCLIAGVFTTEDLSEESFSRWKREMNLGRDSIYINAGVILYHIGRIREECFESQVLSW 179 Query: 204 LMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKK--------- 254 + + + DQD++NV + L L +N ++ + +++ Sbjct: 180 AKNNIHRLSWQDQDILNVCYQQRILLLHPMWNICDGAIWSIRWEGVTSFRNNPLKPADLL 239 Query: 255 LITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAK---SIIEFKKRY 311 +IHY G KPWH +I + + SPWKDD K F + Sbjct: 240 EAARRPGIIHYWGHPKPWHPNSIRQDYGLFYKYWKKSPWKDDIRDFRKQNDPGRMFISKM 299 Query: 312 KHLLVQHHYI 321 + LL + + Sbjct: 300 RCLLGKGKRL 309 >UniRef50_C7IBC8 Glycosyl transferase family 8 n=1 Tax=Clostridium papyrosolvens DSM 2782 RepID=C7IBC8_9CLOT Length = 464 Score = 196 bits (498), Expect = 1e-48, Method: Composition-based stats. Identities = 67/310 (21%), Positives = 118/310 (38%), Gaps = 34/310 (10%) Query: 63 IADVYNDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLT 122 I + + + E+ RI + + Q + YFR+F +++ + Sbjct: 2 IDGGISSRNKECLRACVEKYGSRIRFLELKPELYQDFKTQSYFGYVTYFRIFIPEIVEAS 61 Query: 123 LDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVK----DVEPMQEKAVSRLSDPELLGQY 178 + +++YLD D+V KGDI +L ++ A V+ D+ V + G+Y Sbjct: 62 VRKVIYLDCDIVIKGDIRKLWENDISEYFVAAVEDVGIDIGGNFATMVKKHIGIPRKGKY 121 Query: 179 FNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIY 238 FN+GV+ ++L KW K TE L+ + DQD +N + K L LP E+N Sbjct: 122 FNAGVLLINLDKWRADKTTETIRKYLIENREKIYFADQDGLNAVFKDRWLKLPIEWNQQA 181 Query: 239 TIKSELKDKTHQNYK-KLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDS 297 I LK + ++IHYT KPW +P + Y L +PW D + Sbjct: 182 DILELLKRNRIDRPDVMKAALNPMIIHYTKQVKPWQYKDCHPLKEEYHRYLRLTPWNDTA 241 Query: 298 PRD---------------AKSIIEFKKRYKHLL--------------VQHHYISGIIAGV 328 P+ + +KK+ + I ++ + Sbjct: 242 PKVTIVDVLGKFLGKTPIGRGFYLYKKKIRDYFIIDKSYFSDKLVESKLFKLIYSLLFPL 301 Query: 329 CYLCRKYYRK 338 Y + + + Sbjct: 302 IYTYLRLFSR 311 >UniRef50_B3CA80 Putative uncharacterized protein n=1 Tax=Bacteroides intestinalis DSM 17393 RepID=B3CA80_9BACE Length = 301 Score = 195 bits (496), Expect = 2e-48, Method: Composition-based stats. Identities = 65/271 (23%), Positives = 119/271 (43%), Gaps = 6/271 (2%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADV-YNDGFFQKIAKLAEQNQLRI 86 +++ +D NY++ GV + S+ ++ +II +++ E++Q + Sbjct: 2 IDIVCSIDENYIEYCGVMLASLFVHTPDEKFRVHIICSSKVEKAGKKRLKVFCEKHQAEV 61 Query: 87 TLYRINTDKLQCLPCTQ--VWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 Y ++ ++ P + S A Y RLF +L+ ++++LYLD D++ I +L Sbjct: 62 YFYDVDYSLIKDFPIRKQDHLSLAAYLRLFMSELIPSNINKILYLDCDLIVVDSIKELWE 121 Query: 145 LGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 ++ A V++ P ++ L P + YFNSGV+ ++L+KW + K E S + Sbjct: 122 KNIDNIAVAAVEERSPFDTESPVTLKYP-VEYSYFNSGVMLINLQKWREKKFVEACKSYI 180 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKD--KTHQNYKKLITESTLL 262 S K DQDV+N LL F+ +N + + + +S + Sbjct: 181 ASNYENIKLHDQDVLNALLYKEKQFISIRWNLMDFFLYASPEVQPERKKDWDDALKSPAI 240 Query: 263 IHYTGATKPWHKWAIYPSVKYYKIALENSPW 293 IH+TG KPW P Y + W Sbjct: 241 IHFTGKRKPWMYNCDSPFRDQYIRFAKQQGW 271 >UniRef50_B9KVD4 Glycosyl transferase, family 8 n=2 Tax=Rhodobacter sphaeroides RepID=B9KVD4_RHOSK Length = 334 Score = 194 bits (494), Expect = 3e-48, Method: Composition-based stats. Identities = 61/278 (21%), Positives = 113/278 (40%), Gaps = 7/278 (2%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIA-DVYNDGFFQKIAKLAEQNQLRI 86 +++ + D + V+ + L +++ D + + LA + I Sbjct: 1 MHLLFCADRPFFRHAAVAAV-SAASATRGPLQVHLLTCDSCPEEEARFRVALAPFAHVGI 59 Query: 87 TLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLG 146 +++R+ +L+ L + S A Y R A ++L + R+LYLD D++ D++QLL L Sbjct: 60 SVHRVPAARLEGLFVDRHLSPAAYLRFLAPEVLPEAVQRVLYLDCDLIVLDDVAQLLRLD 119 Query: 147 LNGAVAAVVKDVEPMQEKAVSRLS--DPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 L G A D+ +R L Y NSGV+ +DL +W L++K + Sbjct: 120 LQGRAVAAAPDLGWKDAAQAARFRTLGIPLDRPYVNSGVLLMDLGRWRRDGLSQKLFDYV 179 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKK---LITESTL 261 ++ DQD +N +L L R +N + S + ++ Sbjct: 180 ARHGSLLLRHDQDALNAVLADDIHLLDRRWNLQVLLLSPWAKRALPEDRQATVAARRDPA 239 Query: 262 LIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPR 299 ++H++ A KPW+ + Y +PW P Sbjct: 240 ILHFSTADKPWNFRVWTRRRELYFRFRARTPWSRAVPE 277 >UniRef50_B3Z5I6 Glycosyltransferase family 8 n=2 Tax=Bacillus cereus group RepID=B3Z5I6_BACCE Length = 317 Score = 194 bits (494), Expect = 3e-48, Method: Composition-based stats. Identities = 77/321 (23%), Positives = 134/321 (41%), Gaps = 20/321 (6%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRH-INLDFYIIADVYNDGFFQKIAKLAEQNQLRI 86 LNV Y D NY VGVS+ S++ NN+H NL+ ++I + + + + + ++ I Sbjct: 3 LNVVYSSDDNYAQHVGVSLLSLLQNNQHFNNLNIFLIENNISSYNKKNLNSVCKKYNKTI 62 Query: 87 TLYRINTDKLQ-CLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHL 145 N + L + Y RLF ++ LD+++YLD D + +S L Sbjct: 63 QYINFNVLLERLELNINDSIAINSYARLFLAGIIPEELDKIIYLDCDSIINSSLSDLWDT 122 Query: 146 GLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILM 205 + A V D Q K D + Y N+G++ ++LKKW + + +K + + Sbjct: 123 DVTEYFVAGVCDTVSNQTKL---RIDMDKSEGYINAGMLLINLKKWREENIEQKFMEFIK 179 Query: 206 SKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTI----------KSELKDKTHQNYKKL 255 KD + DQ +N +LK L+L ++N + EL++ ++ Sbjct: 180 KKDGNVFHHDQGTINGVLKDKILYLHPKFNAMTPFFTMSRKEIMSYYELENYYNEIEIDE 239 Query: 256 ITESTLLIHYT--GATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKH 313 ++ + IHYT +PW + +P YK L+ +PWK + K Sbjct: 240 AVKNPVFIHYTPAFVNRPWIEGCKHPLTSLYKSYLDMTPWKSTDLWKDRRGKVEKTIA-- 297 Query: 314 LLVQHHYISGIIAGVCYLCRK 334 L+ I + L K Sbjct: 298 -LLYTRLPFRIAHHIRNLIFK 317 >UniRef50_C5ZV11 Glycosyl transferase n=2 Tax=Helicobacter canadensis MIT 98-5491 RepID=C5ZV11_9HELI Length = 397 Score = 194 bits (493), Expect = 3e-48, Method: Composition-based stats. Identities = 71/338 (21%), Positives = 137/338 (40%), Gaps = 33/338 (9%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHIN---LDFYIIADVYNDGFFQKIA----KLAE 80 NV ++ NY+ V ITSI+ N + +F+++ D + + + +L++ Sbjct: 2 YNVVLNLNENYVPYAAVLITSIIQNTQSSGGGGYNFHLLMDSISQENTKNLENLISELSK 61 Query: 81 QNQLRITLYRINTDKLQCLPCTQ-VWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDI 139 +T+Y ++ + + Y+RL L L++ R +YLD D++ GD+ Sbjct: 62 IYPCTLTIYILDDQLFREYSMPTLNGNYLAYYRLKIGSALPLSIKRCVYLDVDMIVLGDL 121 Query: 140 SQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELL--GQYFNSGVVYLDLKKWADAKLT 197 +L + L G + VV + + + + G YFNSG++ +DL W + Sbjct: 122 RELFEVDLQGKICGVVMEHHSQKIYKPKNQAYKPINITGSYFNSGMLLVDLDLWRQENIE 181 Query: 198 EKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKS---------ELKDKT 248 ++A I + Y + DQD++N++L G T + E+N + + K Sbjct: 182 DRAFEIGKNYH--YSFHDQDILNIVLSGKTHKVGIEWNLMVCVYYRAICKDEKGRDKLPY 239 Query: 249 HQNYKKLITESTLLIHYTGATKPWHKWAIY------PSVKYYKIALENSPWKDDSPRDAK 302 ++ + ++HY TKPW+ IY +Y+ ++ +P + K Sbjct: 240 YRKDFNSALRNPKILHYFTHTKPWNNAKIYLDYHNKFLDQYWWDMVDQTPIFKEKLLQLK 299 Query: 303 SIIEFKKR------YKHLLVQHHYISGIIAGVCYLCRK 334 + YK L + +I Y K Sbjct: 300 PQADSALAFQCLVGYKLLRYYQKGLFALIPFYTYSLIK 337 >UniRef50_B6G807 Putative uncharacterized protein n=2 Tax=Collinsella RepID=B6G807_9ACTN Length = 276 Score = 194 bits (493), Expect = 4e-48, Method: Composition-based stats. Identities = 53/266 (19%), Positives = 109/266 (40%), Gaps = 2/266 (0%) Query: 24 TSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQ 83 ++V D YL + + S+ +N+ + +++ + +++ + Sbjct: 2 KQHAMDVIVTCDEGYLGPLRTMLYSLRASNQGAQVRIWLLHKGISLPALEELERFCSVLG 61 Query: 84 LRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLL 143 L I ++ L C++ + + MY+RL A ++ ++R LYLD D++ + L Sbjct: 62 LAIEPVTVDRVLLDGAKCSERYPQEMYYRLLAPSIIKAPIERALYLDPDILVINPLDDLF 121 Query: 144 HLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSI 203 + L+G A ++ + + YFN+GV+ D+ + + ++ S Sbjct: 122 EIDLHGNAFAAASHLDAVHPATALNKARLSTSSDYFNTGVILFDIARARKSICVDELFSY 181 Query: 204 LMSKDNVYKYPDQDVMNVLLKGMTLFLPRE-YNTIYTIK-SELKDKTHQNYKKLITESTL 261 + + + V +PDQD+ N L +TL +P E +N + + E T Sbjct: 182 VKAHEQVMLFPDQDLFNSLFGAVTLRIPDEIWNYDARKYPDNIIRTWGTATLDWVMEHTA 241 Query: 262 LIHYTGATKPWHKWAIYPSVKYYKIA 287 ++H+ G KPW YK Sbjct: 242 ILHFCGKNKPWAPGYRGQFASLYKHY 267 >UniRef50_D1P7H1 Lipopolysaccharide 1,2-glucosyltransferase n=1 Tax=Providencia rustigianii DSM 4541 RepID=D1P7H1_9ENTR Length = 324 Score = 194 bits (492), Expect = 5e-48, Method: Composition-based stats. Identities = 100/320 (31%), Positives = 165/320 (51%), Gaps = 15/320 (4%) Query: 21 NINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAE 80 + + ++ YGVD +L GVG SI S++LNN+ + F+I D D + + Sbjct: 17 ERHENSYFHIGYGVDEKFLYGVGTSIASVMLNNKDTDFHFHIFVDNLPDENL--FREAVQ 74 Query: 81 QNQLRITLYRINTDKLQCLPC-TQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDI 139 +IT+Y I+ +K + LP ++ WS A+YFRL L ++D LLYLDAD++CKGD+ Sbjct: 75 GTSHKITIYFIDNEKFKLLPLPSKAWSHAIYFRLLIISYLSSSIDSLLYLDADIICKGDL 134 Query: 140 SQLLHLGLNGA-VAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTE 198 S+L L + VKD + + P + +YFNSG +Y+ LK A + Sbjct: 135 SELKALTFDEKTFVYAVKDKFCS-----EKQNLPIDMSKYFNSGFLYMSLKHLAQENIPN 189 Query: 199 KALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITE 258 + + ++ D + +PDQD +NVLL + + YN ++++ + K H I + Sbjct: 190 RVIELVEKND--FSHPDQDALNVLLNDKLINISENYNYMFSLDWYITSKGH---LAKIPD 244 Query: 259 STLLIHYTGATKPWHKW-AIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQ 317 S + IH+ G TKP+H+W + Y KY + A +NSPWK+ + + ++ HL Sbjct: 245 SVVFIHFVGLTKPFHEWASFYEEYKYLESARKNSPWKNIPLLKPEGYKQLSRKKSHLRKN 304 Query: 318 HHYISGIIAGVCYLCRKYYR 337 Y+ I + YL +K + Sbjct: 305 GKYVEFIFTTIQYLMKKTFH 324 >UniRef50_C2KV37 Lipopolysaccharide biosynthesis protein, LPS:glycosyltransferase n=1 Tax=Oribacterium sinus F0268 RepID=C2KV37_9FIRM Length = 324 Score = 194 bits (492), Expect = 5e-48, Method: Composition-based stats. Identities = 61/281 (21%), Positives = 117/281 (41%), Gaps = 18/281 (6%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +++ YGV+ ++ + VS++S++L+ L F+I++ + +K+ + E +I+ Sbjct: 1 MHIVYGVNEAFMPILAVSLSSLLLHAEGEALHFHILSLGIEEESKEKLRQYVETEGQKIS 60 Query: 88 LYRINTDKLQCLPC-----TQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQL 142 Y + + T +S+A RLF L T+ + LYLDAD V I L Sbjct: 61 FYDLEEKLSEWKEKLPALFTGKFSKATLLRLFIPSTLPETITKALYLDADTVVLQSILSL 120 Query: 143 LHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALS 202 HL L + + + P K Y+N+GV+ ++L + + EK L Sbjct: 121 YHLRLGDKLLGMAPE--PSIYKKHKEFLSLAEESPYYNAGVMLMNLSLLREEGMEEKCLR 178 Query: 203 ILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKS-----------ELKDKTHQN 251 K+ + DQD++N++ KG LP+ +N ++ + Sbjct: 179 YYQMKEGQLPFNDQDILNMVCKGRIRSLPQRFNFFSNYAYARYSALCRFSPWYQELESKK 238 Query: 252 YKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSP 292 +++H+ G +PW + + + E SP Sbjct: 239 SYSQAKAHPVIVHFAGDERPWREGNHNYYRRAFDYYAEESP 279 >UniRef50_C8W7U9 Glycosyl transferase family 8 n=2 Tax=Atopobium RepID=C8W7U9_ATOPD Length = 1014 Score = 193 bits (491), Expect = 7e-48, Method: Composition-based stats. Identities = 58/320 (18%), Positives = 123/320 (38%), Gaps = 23/320 (7%) Query: 6 AIEIDKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNN-RHINLDFYIIA 64 K I + + V + D NY+ + ++ S++ N + D ++ Sbjct: 651 PEPRQKFIPLFEEKPEIASQNVVPVVFAADNNYVPILTCAMGSMLENADPNRYYDVVVLN 710 Query: 65 DVYNDGFFQKIAKLAEQN-QLRITLYRI--NTDKLQCLPCTQVWSRAMYFRLFAFQLLGL 121 + + K + RIT Y + + S YFR A +L Sbjct: 711 TNIGGSKQELVKKFFSRYKNARITFYNVWRMVKDYKLDTNNAHISVETYFRFLAQDILSA 770 Query: 122 TLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVE---------PMQEKAVSRLSDP 172 D+++YLD+D+V G++++L + + + A D++ + K + + Sbjct: 771 -YDKVVYLDSDLVVNGNVAELYDVRIGNNLIAATLDIDYLANLNIRGGDRMKYSLDVLNL 829 Query: 173 ELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPR 232 + YF +GV+ + + + L I + ++ Y DQD++N +G L+LP Sbjct: 830 KNPYAYFQAGVMVFNTAELRRYHTVPEWLRIASNP--IFIYNDQDILNSECQGRVLYLPA 887 Query: 233 EYNTIYTIK------SELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKI 286 ++N + I + + + + + ++H+ GA KPW + Y+ Sbjct: 888 DWNVTHNIFGRAEELYPMAPNSVFDDYQAARRAPKIVHFAGAIKPWQNASCDM-ASYFWK 946 Query: 287 ALENSPWKDDSPRDAKSIIE 306 N+P+ + +D Sbjct: 947 YARNTPFYEVIIQDMVPSAR 966 >UniRef50_D2QX94 Glycosyl transferase family 8 n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2QX94_9PLAN Length = 362 Score = 193 bits (491), Expect = 7e-48, Method: Composition-based stats. Identities = 61/340 (17%), Positives = 120/340 (35%), Gaps = 40/340 (11%) Query: 15 WDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQ 73 N + + D N+ G+ + S + N ++D +++ D Sbjct: 2 DRSTHPTQNMPTSIQLVTSSDNNFAIGLAGTFKSALTNLAADSSVDLWVLDGGITDENKA 61 Query: 74 KIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADV 133 +I++ +L + ++ + + + A Y+RL ++L + + +YLD+D+ Sbjct: 62 EISRHLSDPRLTLHFVSVDRKLVSQFVISHHVTDATYYRLLTPEILSRDIGKFIYLDSDL 121 Query: 134 VCKGDISQLLHLGLNGAVAAVVKDVEPMQEK---------------------AVSRLSDP 172 + +GD+++L + +GA ++D R Sbjct: 122 LIRGDLTKLWNTPFDGAPCVAIQDSGAPFVDSTQLIEQQPSLRGCIANANPIPNYRELGL 181 Query: 173 ELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPR 232 Y N GV+ +DL W +L E+ L +L Y DQ +NV+L Sbjct: 182 HPHAPYLNGGVMMIDLDLWRREQLAERMLKVLSDYREHVTYWDQYALNVVLSQRWKQADH 241 Query: 233 EYNTIYT---IKSELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALE 289 +N I S + L + H+T KPW I+P + + LE Sbjct: 242 RWNQIAYPLRFSSHENTIFSKEAFDLYRNDPYISHFT-YRKPWQAECIHPRSEEFYQYLE 300 Query: 290 NSPWKDDSP--------------RDAKSIIEFKKRYKHLL 315 S W + P K ++++++ L Sbjct: 301 GSIWANTKPVWQEYEPVAGVVHVPPKKPKPYYRRKFRELR 340 >UniRef50_B1Y723 Glycosyl transferase family 8 n=1 Tax=Leptothrix cholodnii SP-6 RepID=B1Y723_LEPCP Length = 316 Score = 192 bits (488), Expect = 1e-47, Method: Composition-based stats. Identities = 63/305 (20%), Positives = 123/305 (40%), Gaps = 24/305 (7%) Query: 29 NVAYGVDANYLDGVGVSITSIVL-NNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 + D YL + ++ S+V N H ++ +++ D + ++ + +I Sbjct: 12 PIVLACDEAYLMPLATTLRSVVESNAAHWPIECHVLVDDVSLPGRARVERSLPARAAQIR 71 Query: 88 LYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGL 147 + ++ S+ + RL LL L+R+LYLD D++ GD+ L+ L Sbjct: 72 WHAVDLTDFSSFETQAAISKMTFARLLMADLLPAELERVLYLDTDILVLGDLLPLMRTEL 131 Query: 148 NGAVAAVVKDVEPMQEKAVSR-LSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMS 206 +GA+ V+D + K+ S + + YFN+GV+ +DL +W +++ A L++ Sbjct: 132 DGAILGAVRDGLDAELKSTSPAPTGMPDVCDYFNAGVLLIDLARWRAGRVSAAARDHLVA 191 Query: 207 KDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYT 266 + DQD +NV G L +N +++ ++ ++H+ Sbjct: 192 HPQ-TPFADQDALNVACDGHWKPLAAHWNFQGHRSTDIAALAP-------SQRPGIVHFI 243 Query: 267 GATKPWHKWAIYPSVKYYKIALENSPWKDDSPR--------------DAKSIIEFKKRYK 312 A KPW ++ + + Y + + A S E +R K Sbjct: 244 TALKPWKADSLSLNARLYDGWRSRTLFARHPVMRWTDAIRALVSRMNRALSAHESTRRLK 303 Query: 313 HLLVQ 317 H L Q Sbjct: 304 HQLRQ 308 >UniRef50_B3XL28 Glycosyl transferase family 8 n=1 Tax=Lactobacillus reuteri 100-23 RepID=B3XL28_LACRE Length = 331 Score = 192 bits (488), Expect = 1e-47, Method: Composition-based stats. Identities = 73/314 (23%), Positives = 137/314 (43%), Gaps = 25/314 (7%) Query: 24 TSECLNVAYGVDANYLDGVGVSITSIVLNNRH-INLDFYIIADVYNDGFFQKIAKLAEQN 82 N+ Y D + +G S+ S++ NN+ ++F+I+ + +I K+ + Sbjct: 1 MKTIYNIVYATDDTFAPVLGTSLLSLLRNNKEAKKINFFILDSGISKENKFRIEKICDNF 60 Query: 83 -QLRITLYRINT--DKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDI 139 + +I + K+ S + Y RLF +L +++R+LYLD D + + Sbjct: 61 VNASLKWIKIESISKKIGIDVKNDRGSFSQYSRLFIGDVLDNSVERVLYLDCDTLILSSL 120 Query: 140 SQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEK 199 L ++ L G + A +KD + L + +L+ FNSGV+ +DLK W D K+ EK Sbjct: 121 KDLWNIELKGNIIAALKDAFSKYYRKNINLVNDDLM---FNSGVMLIDLKAWRDNKIKEK 177 Query: 200 ALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELK----------DKTH 249 A+S + + + DQ V+N +L T L YN + + Sbjct: 178 AISFIRQRHGKVQQGDQGVLNSVLSNKTFALDPRYNLVSIFYDLDYREIKLYRSPVNFYS 237 Query: 250 QNYKKLITESTLLIHYTGAT---KPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIE 306 + E+ +++H+T + +PW K + + K + + +PWK+ + IE Sbjct: 238 EKIIVKAKENPVILHFTSSFYSIRPWFKNSNHQCKKIWLKFYQETPWKNQPLQ-----IE 292 Query: 307 FKKRYKHLLVQHHY 320 K+ K + + Y Sbjct: 293 MSKKKKLINILFEY 306 >UniRef50_B7C7N8 Putative uncharacterized protein n=2 Tax=Firmicutes RepID=B7C7N8_9FIRM Length = 416 Score = 192 bits (488), Expect = 2e-47, Method: Composition-based stats. Identities = 61/279 (21%), Positives = 110/279 (39%), Gaps = 21/279 (7%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLY 89 + D +Y+D + +I SI +N+ N+ FYI+ + +F+ + K I Sbjct: 22 IVLACDNSYMDKLETTIKSICAHNK--NIKFYILNEDLPIEWFRLMTKRLSYFNSEILNI 79 Query: 90 RINTDKLQCLPC-TQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLN 148 +++ D + C ++ + YFR + +++LYLD D++ + L +L L Sbjct: 80 KVSGDSFKKFRCPSEHINYQSYFRYLIPDYVSE--EKVLYLDCDIIVTESLDGLFNLDLK 137 Query: 149 GAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKD 208 A V D FNSGV+ ++ K W + + K + + + Sbjct: 138 NYPVAAVPD--------------LPTTNDGFNSGVLLINNKYWRENDILNKLIKLTVEYH 183 Query: 209 NVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGA 268 Y DQ ++N+L K LP YN S+ + + KL +IHYT Sbjct: 184 EKV-YGDQGILNILFKDKWYRLPLTYNLQVGSDSQEHMIGNMEWYKLFDGIPKVIHYTYT 242 Query: 269 TKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEF 307 KPW + + + + S W + + F Sbjct: 243 HKPWLMYNMTRFKEVWWFYHGIS-WDKMILNEPRVYESF 280 >UniRef50_C1IBL0 Glycosyl transferase n=1 Tax=Clostridium sp. 7_2_43FAA RepID=C1IBL0_9CLOT Length = 273 Score = 192 bits (487), Expect = 2e-47, Method: Composition-based stats. Identities = 61/271 (22%), Positives = 109/271 (40%), Gaps = 2/271 (0%) Query: 24 TSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQ 83 +S +++ D NY+ + S+VLNN +++ Q++ + + Sbjct: 2 SSNRIDLLVTFDKNYIPPFQTMLKSLVLNNPRETFHIWLLHSEIPLEMLQEVEEYCAKQG 61 Query: 84 LRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLL 143 +T + + P ++ + + MY+RL A +L ++ ++LYLD D++ I L Sbjct: 62 AAMTSINVERSVFKNAPVSKRYPQEMYYRLLAPLILPKSIKKILYLDPDILIINSIRPLW 121 Query: 144 HLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSI 203 L + A V + Y+NSGV+ +DL K E+ Sbjct: 122 ETELGNYIFAAASHVGVTGVINDINRVRLRVDHDYYNSGVMLMDLTKARSIVNVEEIFQC 181 Query: 204 LMSKDNVYKYPDQDVMNVLLKGMTLFLPR-EYNTIYTIKSELKDKTHQNYK-KLITESTL 261 + PDQD+ N L TL L +N S ++ NY IT +T+ Sbjct: 182 VREHKEELLLPDQDIFNYLYGKQTLPLDDAIWNYDARKYSNYLLRSGGNYDMDWITRNTV 241 Query: 262 LIHYTGATKPWHKWAIYPSVKYYKIALENSP 292 ++H+ G +KPW YK ++ S Sbjct: 242 VLHFCGKSKPWKHSQNNRFAMLYKHYMQISI 272 >UniRef50_A4VVV8 Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases n=7 Tax=Firmicutes RepID=A4VVV8_STRSY Length = 334 Score = 191 bits (486), Expect = 3e-47, Method: Composition-based stats. Identities = 59/330 (17%), Positives = 120/330 (36%), Gaps = 23/330 (6%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRI 86 +N+ + ++ ++ V + SI+ + FY+ +D + + + + ++ Sbjct: 6 VNILFTLNDAFVPQVAACMGSIMRTLDEDDTCHFYLFSDGISQQNKENLHQFVTDGGNKL 65 Query: 87 TLYRIN--TDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 T+ + T W+ + RL +LL +DR++YLD D + +I +L Sbjct: 66 TIVELENLESYFDFEVDTNGWASVVLARLLVDKLLPEEVDRIIYLDGDTLVLENIRELWE 125 Query: 145 LGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 + L G V + + E+ R Y N+GV+ +DLK+W + Sbjct: 126 VDLEGKVLGMCPEPTASSER---REGLNLGTYTYHNAGVLLIDLKRWRSKSIGTIIFDYY 182 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELK----------DKTHQNYKK 254 K+ DQD +N LK L YN + Sbjct: 183 KEKNGELFANDQDALNGALKEEIKTLSITYNYFNIFDVYPYRTLEKLSRPSTFISKEEFV 242 Query: 255 LITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHL 314 I + ++H+ G +PW + + Y AL +PW+ F ++ Sbjct: 243 KIRKQPRIVHFLGEERPWRIGNKHRFREDYVSALNQTPWRGTQFESGWQFYFFCFNLFNM 302 Query: 315 L------VQHHYISGIIA-GVCYLCRKYYR 337 + +++ I+ +I + Y + + Sbjct: 303 VMKPFPMLRYKIITVLIPVFMKYRKIRLQK 332 >UniRef50_Q2K5X3 Galactosyltransferase protein n=13 Tax=Rhizobium RepID=Q2K5X3_RHIEC Length = 333 Score = 191 bits (485), Expect = 3e-47, Method: Composition-based stats. Identities = 68/308 (22%), Positives = 111/308 (36%), Gaps = 21/308 (6%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLY 89 V D N L ++ S+ N + +++F ++ ++ + + I + Sbjct: 38 VIVCSDVNMLPAACCTLLSVKRNLTNADVEFLLLGIDLKPHEVAEVENFGRLHGMAIRVL 97 Query: 90 RINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNG 149 T L WS A RL+ + + ++RLLYLDADV+ + +L L G Sbjct: 98 PYETPD-TGLQARGRWSAATLARLYMDRDIPDHIERLLYLDADVLAVAPVDELFTLDFQG 156 Query: 150 AVAAVVKDV---EPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMS 206 A V D P + A R G+YFN+GV+ D L + I Sbjct: 157 KALAAVDDYVMAFPEKSGARQRKIGMGEGGRYFNAGVLLFDWSACRARGLFPRTREIFKE 216 Query: 207 KDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYT 266 + ++++ DQD +NV G L L +NT + + + H+T Sbjct: 217 RSHLFENNDQDALNVTFDGDWLVLDPRWNTQTGLL-------------PFVDRPAIFHFT 263 Query: 267 GATKPWH---KWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQHHYISG 323 G KPW W Y L N+PW R H+ Q ++ Sbjct: 264 GRKKPWQANVPWVHRRMANRYADDLRNTPWASFC-RQPSRTDRVAGFLSHVGKQIGGLTR 322 Query: 324 IIAGVCYL 331 + Y Sbjct: 323 LARMRAYF 330 >UniRef50_A1XRC1 Glycosyltransferase n=1 Tax=Haemophilus ducreyi RepID=A1XRC1_HAEDU Length = 267 Score = 190 bits (482), Expect = 7e-47, Method: Composition-based stats. Identities = 72/264 (27%), Positives = 115/264 (43%), Gaps = 7/264 (2%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +N+ + D NY + V + SI+ +N N++FYI+ + I L E+ I Sbjct: 1 MNIVFSSDENYAPHLSVCLYSILSHN--YNINFYILDLGIKEESKSFIKSLVEKFNSNIE 58 Query: 88 LYRINTDKLQCLPCT-QVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLG 146 +I+ D P S A Y RL L L+++LYLD D + G + L L Sbjct: 59 FIKISVDSFSNFPIYIDYISLATYARLKLTDYLP-QLEKVLYLDIDTIVNGSLIDLWDLD 117 Query: 147 LNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMS 206 LN A V D + L + + YFN+GV+ +D KW + +K++ I+ Sbjct: 118 LNEYYIAAVADPFIESLNYKTILGLDKNI--YFNAGVLLIDCIKWKQYNIFDKSVKIIKD 175 Query: 207 KDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYT 266 +Y DQD++N++LK L L YN + + +K + IT ++ HY Sbjct: 176 LSKKLQYQDQDILNLILKDKVLLLDCRYNFMPSQLDFIKRDKVRK-GIKITTPIVIYHYC 234 Query: 267 GATKPWHKWAIYPSVKYYKIALEN 290 G KPWH + + Y Sbjct: 235 GPKKPWHIDCTNFNCELYAYLSNK 258 >UniRef50_C6I3U6 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=C6I3U6_9BACE Length = 310 Score = 190 bits (482), Expect = 7e-47, Method: Composition-based stats. Identities = 68/273 (24%), Positives = 116/273 (42%), Gaps = 14/273 (5%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +N+ +D NY+ V +TS +NN + + Y+I NDG + ++ + Sbjct: 1 MNILCCLDDNYVQHTSVMLTSFFINNDFEHHNIYVITMQLNDGNVAYLREVVNKYHSNFY 60 Query: 88 LYRINTDKLQCL--PCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHL 145 LY++N L T S A Y RLF+ Q+L ++LY+D D+V + + +L + Sbjct: 61 LYQVNEAMLSGFVRKETDYVSLAAYLRLFSTQVLPFNCSKVLYIDGDIVVRKSLEELWKM 120 Query: 146 GLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILM 205 + A V + KA + ++ YFNSG + ++L W + + EKA+ + Sbjct: 121 DIENYAVAAVDET----IKANCIRHNYDVTLGYFNSGFMLINLSFWRENSVAEKAIDYMK 176 Query: 206 SKDNVYKYPDQDVMN-VLLKGMTLFLPREYNTIY------TIKSELKDKTHQNYKKLITE 258 K DQD +N +L G+ L +YN ++ + K + Sbjct: 177 RFPERIKSWDQDALNGILYGGLWKRLDLKYNLTTIFLCKQYVEGQDFPKIYTEEYNSAIS 236 Query: 259 STLLIHYTGATKPWHKW-AIYPSVKYYKIALEN 290 ++HYTG KPW +P K Y Sbjct: 237 DPAVVHYTGPDKPWKYTVVDHPFKKDYLQYARM 269 >UniRef50_B2ISC2 Glycosyl transferase, family 8 n=2 Tax=Streptococcus pneumoniae RepID=B2ISC2_STRPS Length = 401 Score = 189 bits (481), Expect = 1e-46, Method: Composition-based stats. Identities = 70/309 (22%), Positives = 115/309 (37%), Gaps = 23/309 (7%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLY 89 + G D +Y+D V +I SI N+ + FY+ +FQ + K I Sbjct: 5 IVLGADNHYMDKVETTIKSICSKNKE--VKFYVFNSDLPTEWFQLMDKRLSVLGSEIVNV 62 Query: 90 RINTDKLQCLPC-TQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLN 148 ++ + T S A Y R F ++ R LYLD+D++ D++ L L+ Sbjct: 63 KVTESLINQFHLPTPHLSSATYLRYFIPTIVFEK--RALYLDSDIIVTADLTSLFEFPLD 120 Query: 149 GAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKD 208 G A V D+ E FNSGV+ +D +W + + + L++ + Sbjct: 121 GCPLAAVPDIPNTSE--------------GFNSGVLLIDTDRWREDDIQNQLLNLTIKHH 166 Query: 209 NVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGA 268 Y DQ+++N+L K L YN + + L +IHYT Sbjct: 167 EHV-YGDQEILNMLFKDRWKKLSLSYNLQVGYDTYRHSLGDNEWYHLFEGIPNIIHYTTQ 225 Query: 269 TKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQHHYI--SGIIA 326 KPW + + W D + F+K K + +G I Sbjct: 226 NKPWSHYRFNRFRDIWWFYYGL-NWNDILLDNQILQENFEKLIKPITCHASIFTNTGDIE 284 Query: 327 GVCYLCRKY 335 G+ YL + Sbjct: 285 GLPYLLEQL 293 >UniRef50_D2QX95 Glycosyl transferase family 8 n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2QX95_9PLAN Length = 350 Score = 189 bits (480), Expect = 1e-46, Method: Composition-based stats. Identities = 60/301 (19%), Positives = 111/301 (36%), Gaps = 26/301 (8%) Query: 24 TSECLNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAEQN 82 L+V D + G+ +I S++ + + L+ +++ + + Sbjct: 1 MQRVLDVLTSADDRFAIGLAGTIKSVLASLSPSSKLNLWVLDGGISSENRDDLIHHWNDP 60 Query: 83 QLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQL 142 +L + ++ L S A Y+RL A LL ++ +LLY+DAD++ + D++ L Sbjct: 61 RLSVNWLPVDRALLAEFKVAPHMSDAAYYRLLAPNLLPSSVKKLLYIDADLLVQRDLTDL 120 Query: 143 LHLGLNGAVAAVVKDVEPMQEKAVSRL---------------------SDPELLGQYFNS 181 +G V D+ + L +YFNS Sbjct: 121 WDEPFDGHSCIAVHDIGAPFLDSNQILLEKPDALSRIVCRNPIPMFEELGLAPETRYFNS 180 Query: 182 GVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYN---TIY 238 GV +DL+ W +L+ + +L + Y DQ +N++L +N I+ Sbjct: 181 GVFMIDLETWRSEQLSVQMFDVLCTHRERQIYHDQFALNIVLANRWKAADYRWNQLAYIH 240 Query: 239 TIKSELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSP 298 +K + S ++H+T KPW +P K + L S W P Sbjct: 241 ELKVPQHTFLEPQVFQQYKHSPWVVHFT-YRKPWQPECQHPLRKRFFDYLAGSKWMQAMP 299 Query: 299 R 299 Sbjct: 300 E 300 >UniRef50_C0QZN2 Glycosyl transferase, family 8 n=3 Tax=Brachyspira RepID=C0QZN2_BRAHW Length = 339 Score = 189 bits (480), Expect = 1e-46, Method: Composition-based stats. Identities = 67/308 (21%), Positives = 127/308 (41%), Gaps = 22/308 (7%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNR-HINLDFYIIADVYNDGFFQKIAKLAEQNQLRI 86 +N+ D NY +G +I SI+ N+ + F++I +KI L + I Sbjct: 1 MNICLASDNNYAPYMGTAIASILKNSSEDEKIIFHLIDGGITKENKEKIISLKNIKECEI 60 Query: 87 TLYRINTD----KLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQL 142 Y + + C +S AM++RL ++ +D++LYLD+D++ G + +L Sbjct: 61 NFYTPDIKMYDGWFEKTSCKAHFSAAMFYRLSIASIIPSNIDKILYLDSDLIATGSLKEL 120 Query: 143 LHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALS 202 + + A V+K + K + + YFNSGV+ ++ K W + ++ Sbjct: 121 FLMDIENHYAIVIKHSTNEKNK-----WSIDGINDYFNSGVLLINNKLWIKNNIEDQFNK 175 Query: 203 ILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLL 262 + + DQDV+N +L G + YN + + E+ ++ Sbjct: 176 FYNNNY-KTCFGDQDVLNNVLIGKVKYADMRYNV--------YAEKGYYNTENDIENPII 226 Query: 263 IHYTGATKPWHKWAIYP-SVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKH--LLVQHH 319 IHY KPW + + + + +PW D P A I +K Y + + ++ + Sbjct: 227 IHYLSPEKPWKENCRGTLFIDEFWRYYQYTPWFRDEPITAFQTILKQKFYDYDDVRLKGN 286 Query: 320 YISGIIAG 327 +I Sbjct: 287 WIKLFGIY 294 >UniRef50_P43974 Putative glycosyltransferase HI0258 n=32 Tax=Haemophilus influenzae RepID=Y258_HAEIN Length = 330 Score = 189 bits (479), Expect = 2e-46, Method: Composition-based stats. Identities = 70/299 (23%), Positives = 127/299 (42%), Gaps = 17/299 (5%) Query: 25 SECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQL 84 S+ +N+ + D Y + VSI SI+ N ++FYI+ N I LA Sbjct: 36 SQTMNIIFSSDHYYAPYLAVSIFSIIKNTPK-KINFYILDMKINQENKTIINNLASAYSC 94 Query: 85 RITLYRINTDKLQCLPCT-QVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLL 143 ++ + Q P T S A Y RL + + +++ +Y+D D + + +L Sbjct: 95 KVFFLPVCESDFQNFPKTIDYISLATYARLNLTKYI-KNIEKAIYIDVDTLTNSSLQELW 153 Query: 144 HLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSI 203 ++ + A +D + + + YFN+G++ ++L KW + + +K+++ Sbjct: 154 NIDITNYYLAACRDTFIDVKNEAYKKTIGLEGYSYFNAGILLINLNKWKEENIFQKSINW 213 Query: 204 LMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLI 263 + +NV KY DQD++N + KG F+ +N T + +K K K ++ Sbjct: 214 MNKYNNVMKYQDQDILNGICKGKVKFINNRFNFTPTDRDLIKKKNLLCVKMP----IVIS 269 Query: 264 HYTGATKPWHKWAIYPSVKYYKIALEN--------SPWKD--DSPRDAKSIIEFKKRYK 312 HY G K WHK + + + L+ S W D + I +KR K Sbjct: 270 HYCGPNKFWHKKCSHLNCHIGNLLLKEMDKIIDIPSSWYDHFEKIPFLIKIKRLRKRIK 328 >UniRef50_C1MLJ1 Glycosyltransferase family 24 protein n=1 Tax=Micromonas pusilla CCMP1545 RepID=C1MLJ1_9CHLO Length = 1657 Score = 188 bits (477), Expect = 3e-46, Method: Composition-based stats. Identities = 48/269 (17%), Positives = 92/269 (34%), Gaps = 15/269 (5%) Query: 9 IDKVKAWDFRLANINTSECLNVAYGVDAN-YLDGVGVSITSIVLNNRHINLDFYIIADVY 67 + +R A + E +NV + Y + + + S+ N + + F+ I + Sbjct: 1348 LWNKIISKWRNAKRSRLETINVFSVASGHLYERFLKIMMLSVRRN-TNNPVKFWFIKNWL 1406 Query: 68 NDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLL 127 + F + +A + L + Y LF L LTL++++ Sbjct: 1407 SPQFKDILPHIAAKYGFEYELVTYKWPTWLHKQTEKQRIIWAYKLLFLDVLFPLTLNKVI 1466 Query: 128 YLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELL------GQYFNS 181 ++DAD V + ++ +L + L GA A + E R Y S Sbjct: 1467 FVDADQVVRSNLKELWEMDLRGAPYAYTPFCDNNPEMEGYRFWKHGFWQTHLAGKPYHIS 1526 Query: 182 GVVYLDLKKWADAKLTEKA---LSILMSKDNVYKYPDQDVMNVLLKGM-TLFLPREYNTI 237 + +DL+ + +K L + DQD+ N + LP+++ Sbjct: 1527 ALYVVDLETFRHTAAGDKLRLIYETLSKDPSSLANLDQDLPNYAQHQVPIFTLPQQWLWC 1586 Query: 238 YTIKSE---LKDKTHQNYKKLITESTLLI 263 + KT +T+ LI Sbjct: 1587 ESWCGNDTKTAAKTIDLCNNPMTKEPKLI 1615 >UniRef50_C2LRU0 Glycosyl transferase, family 8 n=1 Tax=Streptococcus salivarius SK126 RepID=C2LRU0_STRSL Length = 402 Score = 188 bits (477), Expect = 3e-46, Method: Composition-based stats. Identities = 53/280 (18%), Positives = 97/280 (34%), Gaps = 21/280 (7%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLY 89 V + + +Y++ + V++ S+ + Y++ + +F + + E I Sbjct: 6 VVFVAELSYMEKLEVALKSLCAH--KGQWKIYVLNENLPTEWFTLMNRRLEAIDSEILNC 63 Query: 90 RINTDKLQCLPCTQ-VWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLN 148 R++ + + A +FR + + R+LYLD D++ D+S L + L Sbjct: 64 RVSAESFKQFSLPSAHIHYATFFRYAIPEFVQEN--RVLYLDCDMIFTQDLSPLFEVDLG 121 Query: 149 GAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKD 208 G V D FN+G++ +D W K+T+ + Sbjct: 122 GLGIGAVVD--------------RPTTTDGFNAGLMVIDTDWWRQHKVTDSLFDLTKEHH 167 Query: 209 NVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGA 268 Y DQ ++N+ K LP YN + + +IHYT Sbjct: 168 QNV-YGDQGILNLYFKDAWYQLPWTYNLQVGSDKDQYGYGDLEWYDAFKGVPAVIHYTSH 226 Query: 269 TKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFK 308 KPW + S W++ R I F Sbjct: 227 NKPWTSKRFNRFRDIWWFYYALS-WEEILLRKPSLKISFS 265 >UniRef50_C1QBZ8 LPS:glycosyltransferase n=1 Tax=Brachyspira murdochii DSM 12563 RepID=C1QBZ8_9SPIR Length = 336 Score = 187 bits (476), Expect = 4e-46, Method: Composition-based stats. Identities = 74/313 (23%), Positives = 125/313 (39%), Gaps = 19/313 (6%) Query: 26 ECLNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAEQNQL 84 + N+ D NY + V++ SI+ N N N+ F+II D K+ L + Sbjct: 2 QDYNICLCSDENYAKYMAVTMASILKNTNDDENIIFHIIESNIKDETKNKLIYLKKIKNC 61 Query: 85 RITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 I YR+ +K + A Y RL +L+ D++LYLD+D++ G + +L Sbjct: 62 EIKFYRVEYNK---------YPLATYLRLLIPELI-KDADKVLYLDSDIIVNGSLKELFD 111 Query: 145 LGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 + +NG A VKD+ K L + YFN+GVV + K D +++K S Sbjct: 112 IDINGYYALAVKDLYVDIYKEHKELIEIGNNRIYFNAGVVLFNNKSCIDNNISQKFYSYF 171 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIH 264 N K+ DQD++N + R++N + K + ++IH Sbjct: 172 TENKNKLKFHDQDILNHCFIDKVKIIDRKWNFMPFRDYNTKSHYPTK------DDAVIIH 225 Query: 265 YTGATKPWHKWAIYPSV-KYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQHHYISG 323 + KPW Y + +PW + P A + +K Y + ++ Sbjct: 226 FV-EHKPWKTQKDRTYFLDDYWRYYQYTPWFFEEPITAIQTMMQQKMYDYEDIRFRSNYF 284 Query: 324 IIAGVCYLCRKYY 336 G+ K Sbjct: 285 KFFGIYANSSKLQ 297 >UniRef50_C9RWX3 Glycosyl transferase family 8 n=8 Tax=Bacillaceae RepID=C9RWX3_GEOSY Length = 276 Score = 187 bits (476), Expect = 4e-46, Method: Composition-based stats. Identities = 58/262 (22%), Positives = 101/262 (38%), Gaps = 5/262 (1%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLY 89 V DANYL + V + S+ NNR FY++ + Q + + + + Sbjct: 4 VLVTTDANYLPPLRVLMHSLFCNNR-RPFTFYLLYSRIAEEEIQALGEFVRRQGHELVPI 62 Query: 90 RINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNG 149 ++ P + ++ MY+RL A L +DR+LYLD D+V + +L + G Sbjct: 63 YVDPQLFHDAPVFRHYTVEMYYRLAAHLFLPPDVDRVLYLDPDIVAINPMDELYDMDFEG 122 Query: 150 AVAAVVKDVEPMQEKAVSRLSDPELL--GQYFNSGVVYLDLKKWADAKLTEKALSILMSK 207 + + + + + YFN+GV+ +++ + + Sbjct: 123 NLFIAAEHTHSTKVANLFNKLRLKTPNAKGYFNTGVMMMNIAMMREHVRLADIYQFIRDN 182 Query: 208 DNVYKYPDQDVMNVLLKGMTLFLPR-EYNTIYTIKSELKDKTHQNYK-KLITESTLLIHY 265 PDQDV+N L + YN L+ + + I E+T+ IHY Sbjct: 183 RFKLVLPDQDVLNGLYWDKIKPVDCYRYNYDARYYDFLQLLPNPKHDLAWIEENTVFIHY 242 Query: 266 TGATKPWHKWAIYPSVKYYKIA 287 G KPW ++YK Sbjct: 243 CGKEKPWKDNYKGELGRFYKRY 264 >UniRef50_Q3D426 Glycosyl transferase, family 8 n=7 Tax=Streptococcus agalactiae RepID=Q3D426_STRAG Length = 401 Score = 186 bits (473), Expect = 9e-46, Method: Composition-based stats. Identities = 62/287 (21%), Positives = 115/287 (40%), Gaps = 14/287 (4%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLY 89 + G D Y D V +I SIV +N+H L YII + +F + EQ R+ Sbjct: 5 IVLGADFQYRDQVMTTIKSIVSHNQH--LTIYIINTDFPVEWFNILNHSLEQFDCRVKNI 62 Query: 90 RINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNG 149 I++D + +P S A +FR F L + +LYLD+DV+ +G + L + L Sbjct: 63 PISSDVFEGIPTLSHISVAGFFRWFIPIHLEEEI--VLYLDSDVIVRGSLDPLFDINLEE 120 Query: 150 AVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDN 209 + V D L + FNSGV+ ++ W ++ + + K + Sbjct: 121 NLLGAVADHFST-------LYYGDTAPVSFNSGVMLINNSLWKKEEIYNSLMR-IADKGS 172 Query: 210 VYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITE-STLLIHYTGA 268 DQ+ +N+L + + + ++YN + + + +++HY Sbjct: 173 AVGVGDQEYLNILTQNRWIDIGKQYNVQIGQDVNINAYGRPDLYHFYDDCEPVIVHYNSQ 232 Query: 269 TKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLL 315 KPW+K++ + W + K++ + +L Sbjct: 233 DKPWNKYSQSRYRSEWWYYFGL-EWSVIYAQQQKNLNRLTGKTLNLF 278 >UniRef50_A5EVI8 Glycosyl transferase family 8 protein n=1 Tax=Dichelobacter nodosus VCS1703A RepID=A5EVI8_DICNV Length = 617 Score = 186 bits (472), Expect = 1e-45, Method: Composition-based stats. Identities = 66/342 (19%), Positives = 127/342 (37%), Gaps = 28/342 (8%) Query: 1 MDSFPAIEIDKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNR-HINLD 59 ++ P + + + + ++V D +Y+ +G I SI+ + LD Sbjct: 253 VNYLPRVFMKNTAEKHWIAQKVCQKNAVSVVIAADEHYVPHLGALICSIIDHLSCDAFLD 312 Query: 60 FYIIADVYNDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLL 119 I+ + +++A L + I ++ D+ +SRA ++RL +L+ Sbjct: 313 LIILDGGIDFISQKQLAHLLGKRGA-IQFLDLS-DEFTDQKVHMHFSRATFYRLILDKLI 370 Query: 120 GLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPM------------------ 161 + R+LY+D D + D+++L LNG V D Sbjct: 371 -IDRKRVLYIDCDTIVLADLAELFATDLNGKAIGAVFDYIMHHFCQVGVRSIEFTNYLPA 429 Query: 162 QEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNV 221 ++ + E YF +GV+ DL++ +K ++ L K Y + DQD++N Sbjct: 430 KKYLEDYVGLKENWRHYFQAGVILFDLEQLRTLNYADKMIASLTEK--RYWFLDQDILNK 487 Query: 222 LLKGMTLFLPREYNTIYTIKSELKDKTHQNY--KKLITESTLLIHYTGAT-KPWHKWAIY 278 G FL +N + + + + K + +IHY G KPW + Sbjct: 488 YFVGNVHFLNPCWNVVNVGADIYEGLSAELIAELKAAERAPAIIHYAGYEAKPWVDLSA- 546 Query: 279 PSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQHHY 320 ++Y L + W + + KK K + Sbjct: 547 KFAEFYYYYLRQTFWYESVLTSKMLLNVRKKSQKSGEKSWRW 588 >UniRef50_D1Y7U2 Glycosyltransferase, family 8 n=1 Tax=Pyramidobacter piscolens W5455 RepID=D1Y7U2_9BACT Length = 617 Score = 186 bits (472), Expect = 1e-45, Method: Composition-based stats. Identities = 58/361 (16%), Positives = 126/361 (34%), Gaps = 37/361 (10%) Query: 6 AIEIDKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIA 64 I + K + + + + Y+ +G + SI + + + YI Sbjct: 258 QIALVKNPQTEQEIVVNARENDVPAVLAANEKYVPILGTCLKSIADHCSSSRSYKLYIFH 317 Query: 65 DVYNDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQ-VWSRAMYFRLFAFQLLGLTL 123 + + + E + +T ++ + + + ++R LL + Sbjct: 318 TDIQEESQRNLKTFLESDNFSLTFVNVSLHVGKYRLRAKEHVTTETFYRFLILDLLKM-Y 376 Query: 124 DRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVE---------PMQEKAVSRLSDPEL 174 D++LYLD D++ + DI+ L L L + D + P K + + Sbjct: 377 DKVLYLDCDMIIQRDIADLYDLDLGTNLIGAALDPDFTGQCNGANPATRKYCDAVLKLKD 436 Query: 175 LGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREY 234 YF +GV+ +++ + + + L + + +YKY DQD++NV+ +G L+L + Sbjct: 437 CFTYFQAGVLLMNVAELNKSVTVRQLLEMAET--GIYKYSDQDILNVVCEGRALYLDMAW 494 Query: 235 NTIYTIKSE-------LKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIA 287 N + + + E +IHY G KPW K + A Sbjct: 495 NLLSDCDHYRWHHVVKFAPHYILDMYENAREKPYIIHYAGFLKPWMKLGED-FGYEFWKA 553 Query: 288 LENSPWKDDSP---RDAKSIIEFKKRYKHLLV------------QHHYISGIIAGVCYLC 332 +P+ ++ + + H+L+ + + + Y Sbjct: 554 ARETPFYEELLYAALVPHGNTTRPQNFLHMLINRLVPLAKAVLPKGSRLRYFARHLYYRI 613 Query: 333 R 333 + Sbjct: 614 K 614 >UniRef50_B8G232 Glycosyl transferase family 8 n=2 Tax=Desulfitobacterium hafniense RepID=B8G232_DESHD Length = 280 Score = 185 bits (471), Expect = 1e-45, Method: Composition-based stats. Identities = 61/268 (22%), Positives = 116/268 (43%), Gaps = 5/268 (1%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +N+ ++++Y+ + V +TS++ +N Y+ + F +I + + ++ ++ Sbjct: 1 MNILVTLNSSYVKQLMVMLTSLLDSNPGEQFTVYVAHSAMSKEDFARIDQAIDSSRCKVE 60 Query: 88 LYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGL 147 +++ + L P T + + MY+R+FA L L+R+LYLD D+V + +L + Sbjct: 61 GIKLSDEGLSKAPITSRYPKEMYYRIFAVNYLPDHLERILYLDPDLVVINPLKELYTIDF 120 Query: 148 NGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSK 207 G A V+ + +K + Y NSGV+ ++L + + + Sbjct: 121 QGNFFAAASHVKELLKKLNHVRLNMAEDSTYVNSGVMMMNLSLLRQEQDVHEVYQYIEEY 180 Query: 208 DNVYKYPDQDVMNVLLKGMTLFLP-REYNT--IYTIKSELKDKTHQN--YKKLITESTLL 262 + PDQDV+N + TL + + YN Y L K + +T + Sbjct: 181 KHRLFLPDQDVLNGVYSDRTLTVDAKIYNLSERYYALYNLNPKYWDAKIDLDWVRSNTAI 240 Query: 263 IHYTGATKPWHKWAIYPSVKYYKIALEN 290 IHY G KPW I +YK + Sbjct: 241 IHYCGRNKPWKDNYIGDLNVFYKNYEQK 268 >UniRef50_B2ISC6 Glycosyl transferase, family 2/glycosyl transferase family 8 n=8 Tax=Streptococcus pneumoniae RepID=B2ISC6_STRPS Length = 696 Score = 185 bits (470), Expect = 2e-45, Method: Composition-based stats. Identities = 55/270 (20%), Positives = 106/270 (39%), Gaps = 21/270 (7%) Query: 18 RLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAK 77 +L+ SE + + Y+D V +I SI +N ++ FY+I + + + +++ K Sbjct: 293 QLSRQEESEKKAIVLAANYAYVDQVLTTIKSICYHN--RSIRFYLIHSDFPNEWIKQLNK 350 Query: 78 LAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKG 137 E+ I R+ ++++ C S ++ R F + D+ LYLD D+V Sbjct: 351 RLEKFDSEIINCRVTSEQISCYKSD--ISYTVFLRYFIADFVQE--DKALYLDCDLVVTK 406 Query: 138 DISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLT 197 ++ L L A V+D + FN+GV+ ++ W + Sbjct: 407 NLDDLFATDLQDYRLAAVRDFGGRAY----------FGQEIFNAGVLLVNNAFWKKENMI 456 Query: 198 EKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLIT 257 +K + + + DQ ++N+L + L L +YN I + + Sbjct: 457 QKLIDVTNEWHDKVDQADQSILNMLFEHKWLELDFDYNHIV-----IHKQFADYQLPEGQ 511 Query: 258 ESTLLIHYTGATKPWHKWAIYPSVKYYKIA 287 + +IHY KPW A + + Sbjct: 512 DYPAIIHYLSHRKPWKDLAAQTYREVWWYY 541 >UniRef50_C8WAA9 Glycosyl transferase family 8 n=2 Tax=Atopobium RepID=C8WAA9_ATOPD Length = 358 Score = 185 bits (470), Expect = 2e-45, Method: Composition-based stats. Identities = 54/318 (16%), Positives = 124/318 (38%), Gaps = 24/318 (7%) Query: 31 AYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAE-QNQLRITL 88 + N++ + V+I SI+ N N D ++ + + + A+ N + + Sbjct: 19 VFACSDNFVPYLSVAIQSIIENVNPERRYDIIVLTRDLSPTNMITLTRQAQLVNNVHVGF 78 Query: 89 YRINTDKLQ-CLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGL 147 ++ LP + Y+RL A LL +++ +YLD+D+V DI++L + + Sbjct: 79 LDVDAALGDIELPHHGHFRPETYYRLLAPSLLP-NVNKAIYLDSDLVVNTDIAELYDIDI 137 Query: 148 NGAVAAVVKDVEPMQE---------KAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTE 198 G + +D + + + + + YF +GV+ ++L++ E Sbjct: 138 TGYLVGATRDADTIGQIDGYDATVGPYLKNELGMDDPHDYFQAGVILMNLEEIRKQISPE 197 Query: 199 KALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKD-------KTHQN 251 + L + S +++ DQDV+N + G L + ++N + + +D K + Sbjct: 198 EFLKV--STMRTWRWLDQDVLNRFVNGHYLRINMKWNYLVDWQFLRRDHIVAQAPKDIRE 255 Query: 252 YKKLITESTLLIHYTGA-TKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKR 310 + ++ + H+ G +PW + SP+ ++ + + Sbjct: 256 EYEEARKNICIAHFAGPDNRPWLYPNSD-LAGLFWFYARRSPYLEELRSQLEESRRTVRG 314 Query: 311 YKHLLVQHHYISGIIAGV 328 H + G++ Sbjct: 315 LSHRVQSGVLFRGLMPLF 332 >UniRef50_Q5WI33 Lipopolysaccharide glycosyltransferase n=6 Tax=Firmicutes RepID=Q5WI33_BACSK Length = 274 Score = 184 bits (468), Expect = 3e-45, Method: Composition-based stats. Identities = 61/265 (23%), Positives = 113/265 (42%), Gaps = 2/265 (0%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +N+ ++A+YL + V +TS+ +NN H + Y+I + Q + + + Sbjct: 1 MNILVTLNAHYLKPLQVMLTSLFMNNAHEDFTIYLIHSSIPEKQLQLLEQFVCHQGHSLV 60 Query: 88 LYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGL 147 + + P + +S MY+RL A++ L LDR+LYLD D++ I L + Sbjct: 61 IVETDKTLFANAPVVKHYSSEMYYRLLAYRFLPTELDRILYLDPDILVLNPIRPLYEANI 120 Query: 148 NGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSK 207 + + A + ++ + + Y+NSGV+ ++L K + + + + Sbjct: 121 DSYLYAAAQHSFINIQEINKFRLNAYEMDAYYNSGVLLMNLAKQRETMDINDIFAYVETY 180 Query: 208 DNVYKYPDQDVMNVLLKGMTLFLPRE-YNTIYTIKSELKDKTHQNYK-KLITESTLLIHY 265 N PDQDV+N L + YN K K+ + + + T+++H+ Sbjct: 181 RNRLVLPDQDVLNALYSPQIKNVDERLYNYDARYYRYYKLKSGGRFDIDAVLQQTVILHF 240 Query: 266 TGATKPWHKWAIYPSVKYYKIALEN 290 G KPWHK YK + Sbjct: 241 CGKKKPWHKNYNGKFHSLYKHYEKQ 265 >UniRef50_Q3DNS6 Glycosyl transferase, family 8 n=5 Tax=Streptococcus agalactiae RepID=Q3DNS6_STRAG Length = 401 Score = 184 bits (467), Expect = 4e-45, Method: Composition-based stats. Identities = 59/263 (22%), Positives = 109/263 (41%), Gaps = 19/263 (7%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLY 89 VA VD+NYLD V+I SI + N N+ FY+ + + I + E ++ Sbjct: 5 VALAVDSNYLDKALVTIKSICVYN--RNITFYLFNQDTPVEWVRNINRKLEPLGSKLINV 62 Query: 90 RINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNG 149 +I + L + + +FRLF + + R+LYLD+D++ ++ L L G Sbjct: 63 KIYNYDIAHLTT--FLTVSTWFRLFLADYIPSS--RVLYLDSDIIVNTNLDYLFELDFKG 118 Query: 150 AVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDN 209 A VKD + FN+G++ +L+ W + LT+ L Sbjct: 119 YYLAAVKDP-------------HKNEEGGFNAGMLLANLELWREDGLTKTLLKTAEELHR 165 Query: 210 VYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGAT 269 V K DQ ++N++ L L + +N + Y + + +IH+ + Sbjct: 166 VVKTGDQSILNIVCHNRWLSLNKTWNFQTYDVVSRYNHRSYLYLNIENRTPNIIHFLTSD 225 Query: 270 KPWHKWAIYPSVKYYKIALENSP 292 KPW++ ++ + + + Sbjct: 226 KPWNENSVARFRELWWYYFQLDF 248 >UniRef50_D2RIJ7 Glycosyl transferase family 8 n=1 Tax=Acidaminococcus fermentans DSM 20731 RepID=D2RIJ7_ACIFE Length = 330 Score = 183 bits (465), Expect = 7e-45, Method: Composition-based stats. Identities = 71/330 (21%), Positives = 141/330 (42%), Gaps = 26/330 (7%) Query: 14 AWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQ 73 ++++ A + L++ V+ + GV +TSI NN+ + L+F++ D +D + Sbjct: 21 SFEYMTAENKKKDILHICCNVNDLFFKPAGVLLTSICENNKDLALNFHVFVDSCSDENKE 80 Query: 74 KIAKLAEQNQLRITLYRINTDKLQCLPCT-QVWSRAMYFRLFAFQLLGLTLDRLLYLDAD 132 + K AE+ LY+++ Q + +SR Y R+ +L +R LYLDAD Sbjct: 81 NLRKTAEKYGCNAYLYKMDMSIYQNFHIKVKRFSRVTYIRIVMPWVLRNVTNRYLYLDAD 140 Query: 133 VVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWA 192 +VC + + L + P + + + YF+ G++++++ +W Sbjct: 141 MVCVKSLRVFFNYDLKDKAVGALVYDTPERIAFLKMKGNV-----YFSDGLMWINVDEWI 195 Query: 193 DAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNY 252 ++TE+ S + +K QD+MN++L G +P ++ + Sbjct: 196 KQRVTERVFSYQGADPARFKGQTQDLMNLVLDGNVQPIPALFHHM--------------- 240 Query: 253 KKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPW----KDDSPRDAKSIIEFK 308 K + +LIHY+G KPW + + + ++ L+ SPW P+ FK Sbjct: 241 DKDFSVDGILIHYSGRDKPW-EIVLDEDDELWRHYLDISPWPSMPNPMPPKRPIYYHSFK 299 Query: 309 KRYKHLLVQHHYISGIIAGVCYLCRKYYRK 338 K + + +++ + Y K K Sbjct: 300 KLAQVYSKKGNHLKELECLFWYGILKIRYK 329 >UniRef50_C8N7M8 Putative uncharacterized protein n=1 Tax=Cardiobacterium hominis ATCC 15826 RepID=C8N7M8_9GAMM Length = 618 Score = 183 bits (465), Expect = 7e-45, Method: Composition-based stats. Identities = 67/308 (21%), Positives = 120/308 (38%), Gaps = 28/308 (9%) Query: 10 DKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNR-HINLDFYIIADVYN 68 D + ++V D NY +G I SI+ + LD I+ + Sbjct: 262 DTTAKSWYAQPVQTDKPVVSVVIASDDNYTPHLGALICSILDHFPADKYLDLIILDGGIS 321 Query: 69 DGFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLY 128 + + +L I + D+ Q L +SRA ++RL +L+ D++LY Sbjct: 322 ALNRKLLMRLLPT-HANIQFLELK-DEFQQLATHMHFSRATFYRLILDKLIPGR-DKVLY 378 Query: 129 LDADVVCKGDISQLLHLGLNGAVAAVVKDVEPM------------------QEKAVSRLS 170 +D D + DIS L L V D + + Sbjct: 379 IDCDTIVLDDISTLFDTPLGDHAIGAVFDYIMHHFCLNDVLSIDTTGSLPAKRYLHDYVG 438 Query: 171 DPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFL 230 + +YF +GV+ +++K L+E +S L++K Y + DQD++N G ++L Sbjct: 439 LEDGWQRYFQAGVILFNMEKLRRLDLSEVMISDLLNK--RYWFLDQDILNKYFLGDVVYL 496 Query: 231 PREYNTIYTIK--SELKDKTHQNYKKLITESTLLIHYTG-ATKPWHKWAIYPSVKYYKIA 287 +N++ +++ + T+ K +IHY G TKPW+ +YY Sbjct: 497 DPRWNSVNSVQNIYQGLPATYIAELKTTETDPKIIHYAGFETKPWN-NRYAELAEYYFYY 555 Query: 288 LENSPWKD 295 L + W + Sbjct: 556 LRQTFWYE 563 >UniRef50_C0BAU3 Putative uncharacterized protein n=1 Tax=Coprococcus comes ATCC 27758 RepID=C0BAU3_9FIRM Length = 348 Score = 183 bits (464), Expect = 8e-45, Method: Composition-based stats. Identities = 61/292 (20%), Positives = 119/292 (40%), Gaps = 24/292 (8%) Query: 30 VAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIA-KLAEQNQLRIT 87 V + Y+ + + SI N N N D I+ + G ++ +L + + + Sbjct: 14 VVLSANEYYVPYLAAVLESIRANSNDDQNYDLIIMHRDISMGSQDRLKKQLEDHQNITLR 73 Query: 88 LYRIN--TDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHL 145 I + L ++ YFRL Q+L D+ +Y+D+D+V DI++L Sbjct: 74 FLDIRRYEKPFKKLFLRGHFALETYFRLLMPQIL-ADYDKAVYIDSDLVVNADIAELYAT 132 Query: 146 GLNGAVAAVVKDVE---------PMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKL 196 ++G + A KD + P ++K + + + +YF +GV+ +L ++ Sbjct: 133 DVDGYLLAAAKDADTAGLYNGFEPNKKKYMDTILKIKKPYEYFQAGVIVFNLAEFRKTYT 192 Query: 197 TEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKS-------ELKDKTH 249 T + L S ++ DQDV+N L +G F+ +N + + L K Sbjct: 193 TAEMLKFAASY--EWELLDQDVLNYLAQGRVKFVDMAWNVMVDWRGIRLSQIIALAPKYL 250 Query: 250 QNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDA 301 + ++ +IHY G KPWH+ + + N+ + + + Sbjct: 251 HDEHMEARKNPKIIHYAGPDKPWHQPWSDM-AEEFWKYSRNTVFYETIMQRM 301 >UniRef50_B2UQ54 Glycosyl transferase family 8 n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UQ54_AKKM8 Length = 328 Score = 183 bits (464), Expect = 1e-44, Method: Composition-based stats. Identities = 58/272 (21%), Positives = 109/272 (40%), Gaps = 12/272 (4%) Query: 24 TSECLNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAEQN 82 V D + + V++ S++ Y+++D + + + +LA Sbjct: 6 KKNEFAVVLASDNRGILPLSVTVFSLLNTAGPETFYKIYVLSDGIDGENWASVERLAAPF 65 Query: 83 QLRITLYRIN-TDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQ 141 R+ ++ + P T+ W + R+F +LL +LYLD DV+ D+++ Sbjct: 66 DCRLEFIDVSGILEKHDFPHTEQWPVPAWGRVFIPELLKEERGNILYLDIDVLVCRDLTE 125 Query: 142 LLHLGLNGAVAAVVKDVEPMQ-EKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKA 200 L ++G VV + RL P YFNSGV+ +++ + + L Sbjct: 126 LFRTNMDGKAIGVVFENFSRPGSHFNERLEMPLTCTGYFNSGVLLMNVDVFREKNLVRAV 185 Query: 201 LSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSE-LKDKTHQNYKKLITE- 258 L ++ + PDQD +N L +T+ L +N + LK+ + + + +T Sbjct: 186 LDYAVTHRDRLTCPDQDALNGALCELTVPLHPRWNWHDGLTRRILKNDPREQFWRGVTPR 245 Query: 259 -------STLLIHYTGATKPWHKWAIYPSVKY 283 ++HY G KPW Y +Y Sbjct: 246 QAVEAALEPGILHYQGVHKPWRYNWRYEGERY 277 >UniRef50_C1CFZ1 Glycosyl transferase, family 8 n=17 Tax=Streptococcus pneumoniae RepID=C1CFZ1_STRZJ Length = 404 Score = 182 bits (463), Expect = 1e-44, Method: Composition-based stats. Identities = 64/289 (22%), Positives = 113/289 (39%), Gaps = 24/289 (8%) Query: 26 ECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLR 85 ++ + D +Y+D + +I SI N L FY+ D +F + K + Q Sbjct: 2 NTKSIVFNADNDYVDKLETAIKSICCYN--NCLKFYVFNDDIASEWFLMMNKRLKTIQSE 59 Query: 86 ITLYRINTDKLQCLPCT-QVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 I +I L+ + S A +FR F + + R LYLD+D++ G + L Sbjct: 60 IVNVKIVDHVLKKFHLPLKNLSYATFFRYFIPNFVKES--RALYLDSDIIVTGSLDYLFD 117 Query: 145 LGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 + L+G A V+D ++ FNSG++ +++ W D K L + Sbjct: 118 IELDGYALAAVEDSFG------------DVPSTNFNSGMLLVNVDTWRDEDACSKLLELT 165 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKK----LITEST 260 Y DQ ++N+L L R +N + + S + + + + + Sbjct: 166 NQYHE-TAYGDQGILNMLFHDRWKRLDRNFNFMVGMDSVAHIEGNHKWYEISELKNGDLP 224 Query: 261 LLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKK 309 +IHYTG KPW + + + W D R F++ Sbjct: 225 SVIHYTG-VKPWEIISNNRFREVWWFY-NLLEWSDILLRKDIISRSFEE 271 >UniRef50_D1JY84 Putative uncharacterized protein n=1 Tax=Bacteroides sp. 3_1_33FAA RepID=D1JY84_9BACE Length = 312 Score = 182 bits (463), Expect = 1e-44, Method: Composition-based stats. Identities = 63/268 (23%), Positives = 123/268 (45%), Gaps = 6/268 (2%) Query: 25 SECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQL 84 S +++A+ V+ +Y + + VSI ++ NN L +I++D +D ++ KL Sbjct: 3 SSPMHIAFCVNDHYAEYILVSIKGLLENNSD-PLVIHILSDYISDKNTNRLKKLVGLYPN 61 Query: 85 RITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 I I D L+ W+ ++R+ ++L ++ R+LYLDAD + +I +L Sbjct: 62 AILDIVI-VDDLKLKDLKDTWTIYTWYRVLLPEILDASVHRVLYLDADTLVSENIEELFS 120 Query: 145 LGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 L + G A D + + R E +Y +GV+ ++L W + + K + Sbjct: 121 LDMTGKAIAGTVDFQSKDKSTYQRCGY-EAEKEYVCAGVMMMNLDYWREHDIANKIIDWG 179 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTI-YTIKSELKDKTHQNYKKLITESTLLI 263 ++ +YPDQD +N + + M L LP +Y+ I + + + + + ES +I Sbjct: 180 RDYNDRIQYPDQDAINYICRDMKLLLPLKYDIIDGFFQDDYYFQNYPQELRECIESPAII 239 Query: 264 HYTGATKPW-HKWAIYPSVKYYKIALEN 290 HY G PW + + + ++ + Sbjct: 240 HYAGQA-PWVVEISNHLLQDEWERYNKL 266 >UniRef50_D0D9G3 Putative general stress protein A n=1 Tax=Citreicella sp. SE45 RepID=D0D9G3_9RHOB Length = 327 Score = 182 bits (463), Expect = 1e-44, Method: Composition-based stats. Identities = 63/313 (20%), Positives = 128/313 (40%), Gaps = 22/313 (7%) Query: 25 SECLNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAEQNQ 83 + +NV Y D +GVSI S + N ++ ++++ + + IA + Sbjct: 9 DKRINVVYACDNIQALPLGVSIASALENRAEGNPINIHVLSYRISRSNRKSIASQFDGRD 68 Query: 84 LRITLYRI---NTDKLQCLPCTQV--WSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGD 138 + + I N L+ L + + A Y RL +++ +DR +YLD D++ D Sbjct: 69 DTLCWHEITGENRKLLEDLFTSSNRPYPPAAYARLLISEVIP-NIDRAIYLDTDIIVATD 127 Query: 139 ISQLLHLGLNGAVAAVVKDVEPMQEKAVSRL----------SDPELLGQYFNSGVVYLDL 188 +S L + +GA ++D+ + E YF SGV+ D+ Sbjct: 128 LSPLWNTPFDGAGLLAIQDLPTSNDHIKRLRALLSPEDISRYGIEDGDSYFQSGVLVFDM 187 Query: 189 KKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYT---IKSELK 245 K++ + +E + L + + +PD D +N++ + +N + + + + Sbjct: 188 KEFTKTRASE-LIECLRNYPD-LTFPDNDALNIVFHDSFKLVDPRWNQMASVFKLDAARD 245 Query: 246 DKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSII 305 + + + +IHY+G KPW +P + + AL++S W P I Sbjct: 246 TPYSAEVFQALLQDPYIIHYSGRPKPWEDGCTHPYLDRWVEALKDSAWNSWKPSRLNRAI 305 Query: 306 EFKKRYKHLLVQH 318 + R + +L + Sbjct: 306 DRIPRIQRVLAKR 318 >UniRef50_B1I7M9 Glycosyl transferase, family 8 n=16 Tax=Streptococcus pneumoniae RepID=B1I7M9_STRPI Length = 406 Score = 182 bits (462), Expect = 1e-44, Method: Composition-based stats. Identities = 55/293 (18%), Positives = 111/293 (37%), Gaps = 27/293 (9%) Query: 25 SECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQL 84 + +V + D Y+ + ++ S+ +N H L Y++ +F +I ++ Sbjct: 3 QDKKSVVFAGDYAYIRQIETAMKSLCRHNSH--LKIYLLNQDIPQEWFSQIRIYLQEMGG 60 Query: 85 RITLYRINTDKLQCLPCTQ--VWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQL 142 + ++ + Q + + + R F + D++LYLD+D++ GD++ L Sbjct: 61 DLIDCKLIGSQFQMNWSNKLPHINHMTFARYFIPDFVTE--DKVLYLDSDLIVTGDLTDL 118 Query: 143 LHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALS 202 L L A + G FN+GV+ ++ KKW + +K + Sbjct: 119 FELDLGENYLAAARSCFG--------------AGVGFNAGVLLINNKKWGSETIRQKLID 164 Query: 203 ILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITES-TL 261 + + + DQ ++N+L K L +YN HQ + E L Sbjct: 165 LTEKEHENVEEGDQSILNMLFKDQYSSLEDQYNFQIGYDYGAAAFKHQFIFDIPLEPLPL 224 Query: 262 LIHYTGATKPWHKWAIYPSVKYYKIALENSP------WKDDSPRDAKSIIEFK 308 ++HY KPW+++++ + + W S + FK Sbjct: 225 ILHYISQDKPWNQFSVGRLREVWWEYSLMDWSVILNEWFSKSVKYPSKSQIFK 277 >UniRef50_UPI000175831B PREDICTED: similar to UDP-glucose glycoprotein:glucosyltransferase n=3 Tax=Endopterygota RepID=UPI000175831B Length = 1506 Score = 181 bits (460), Expect = 3e-44, Method: Composition-based stats. Identities = 46/272 (16%), Positives = 95/272 (34%), Gaps = 15/272 (5%) Query: 5 PAIEIDKVKAWDFRLANINTSECLNVAYGVDAN-YLDGVGVSITSIVLNNRHINLDFYII 63 P I F + LN+ + Y + + + S++ + + + F+ + Sbjct: 1189 PNSGIWNSITSSFSKNEEEPDDKLNIFSVASGHLYERFLRIMMLSVLKHTKT-PVKFWFL 1247 Query: 64 ADVYNDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTL 123 + + + +A++ L + + + Y LF L L + Sbjct: 1248 KNYLSPQIKDFLPYMAKEYGFEYELVQYKWPRWLHQQTEKQRIIWGYKILFLDVLFPLDV 1307 Query: 124 DRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQ------ 177 +++++DAD V + D+ +L L L GA + +E R Sbjct: 1308 KKIIFVDADQVVRADLKELQELDLGGAPYGYTPFCDSRKEMDGFRFWKLGYWRNHLQGRK 1367 Query: 178 YFNSGVVYLDLKKWADAKLTEKA---LSILMSKDNVYKYPDQDVMNVLLKGM-TLFLPRE 233 Y S + +DLK++ ++ L N DQD+ N ++ + LP+E Sbjct: 1368 YHISALYVVDLKRFRRIAAGDRLRGQYQALSQDPNSLSNLDQDLPNNMIHQVGIKSLPQE 1427 Query: 234 YNTIYTIKSE---LKDKTHQNYKKLITESTLL 262 + T + + KT +T+ L Sbjct: 1428 WLWCETWCDDESKARAKTIDLCNNPMTKEAKL 1459 >UniRef50_C5S494 Putative glycosyl transferase n=2 Tax=Actinobacillus minor RepID=C5S494_9PAST Length = 287 Score = 181 bits (460), Expect = 3e-44, Method: Composition-based stats. Identities = 65/282 (23%), Positives = 116/282 (41%), Gaps = 18/282 (6%) Query: 21 NINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAE 80 N + +N+A D NY + V I S+ +++ N+ FY+I Y D +F + + Sbjct: 1 MTNKQQTINIALAADRNYAEQVITLIKSVCYHHK--NVRFYLIHQDYPDEWFMALNQHLT 58 Query: 81 QNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDIS 140 I + + ++A ++R ++ DR++YLD+D+V G+I Sbjct: 59 NVGAEIIPVTVLDSFRFLSKLQEHITQATFYRYIIPEIPE---DRVIYLDSDIVVDGNIE 115 Query: 141 QLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKA 200 ++ NG V+D+ + + L YFN GV+ ++ + W + L E Sbjct: 116 EMYFSDFNGKYVLAVEDMYISYTE--HGYIEFPDLKPYFNGGVLLINNQLWKENDLAEYL 173 Query: 201 LSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELK--------DKTHQNY 252 + + NV + DQD++N +LK L YN I ++ Y Sbjct: 174 IQMTKQYPNVM-FGDQDILNFVLKDKWGILSHVYNYQTGIIHAFPRLEENMSDEEIITKY 232 Query: 253 KKLITE-STLLIHYTGATKPW-HKWAIYPSVKYYKIALENSP 292 +K E ++IHYT KPW + + Y + S Sbjct: 233 QKQADEVKPIIIHYTTKYKPWLNSKYFVLLREKYWFYYQLSW 274 >UniRef50_B6VUC8 Putative uncharacterized protein n=1 Tax=Bacteroides dorei DSM 17855 RepID=B6VUC8_9BACE Length = 315 Score = 181 bits (460), Expect = 3e-44, Method: Composition-based stats. Identities = 69/315 (21%), Positives = 126/315 (40%), Gaps = 14/315 (4%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +++ Y V +TS+ NN+ + + Y+ + +D + + L ++ ++ Sbjct: 2 ISILCNSSNEYAIHCKVMLTSLFENNKQNDKEVYVFSTSMSDENIKGLELLGQRYGTKVQ 61 Query: 88 LYRINTDKLQCLPCTQ-VWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLG 146 + +++ KLQ LP + A Y RLFA LL +++LLYLD D++ D+ L + Sbjct: 62 IIIVDSQKLQFLPIHFAYHNIACYLRLFAADLLP-GINKLLYLDCDIIVNSDLKALWDID 120 Query: 147 LNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMS 206 + A D+ E + E Y N+GV+ ++ W + + +K L + Sbjct: 121 ITDYAFAATHDL-TYCEPNFKKNLQLEENDTYINTGVMLINCDYWRNNNVAQKVLDYAIH 179 Query: 207 KDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNY--KKLITESTLLIH 264 + DQD +N ++G E+N E + Y I + +IH Sbjct: 180 NGDKMIAADQDALNATMQGSFKLFSEEWNVYPDYFYEKPNLYTNVYPILDEIRRNPKIIH 239 Query: 265 YTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAK-SIIEFKKRYKHLLVQHHYISG 323 + KPW + +P Y + K + K SI R KH L+ Sbjct: 240 FL-YVKPWFNYCNHPLRYLYGKYYAIAEGKPFILKRNKESIKRDIARLKHCLLD------ 292 Query: 324 IIAGVCYLCRKYYRK 338 G+ Y Y ++ Sbjct: 293 -FMGIKYYYHVYDKR 306 >UniRef50_A3CM53 Glycosyltransferase, putative n=9 Tax=Bacteria RepID=A3CM53_STRSV Length = 1074 Score = 181 bits (459), Expect = 3e-44, Method: Composition-based stats. Identities = 64/270 (23%), Positives = 108/270 (40%), Gaps = 29/270 (10%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLY 89 + D Y + V +I SI+ N+ N+ Y+ +D +F+ +L EQ + Sbjct: 4 IVLVGDQAYQEQVSTTIKSILYYNK--NVKIYVFNQGLSDEWFRDFNELVEQLDSELVNI 61 Query: 90 RINTDKLQCLP-CTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLN 148 ++ + S A Y R F Q + R+LYLD+D+V D+ L + L Sbjct: 62 SLDQVTISPEWLTQDHISSATYARYFIPQFVAE--GRVLYLDSDLVVNRDLQPLFDIPLE 119 Query: 149 GAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKL-------TEKAL 201 G + A V D G FN+GV+ +D + W + +L T++ + Sbjct: 120 GKLVAAVGDAG----------------GYGFNAGVLLIDNRSWKERELQESFIKETDRIM 163 Query: 202 SILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTL 261 ++ S DQ V+N +L L L + YN + + N + + L Sbjct: 164 GLVQSGQMEDFNGDQTVLNHVLAQDWLPLDKIYNLQV-GHDLVAFYSGWNGHFELDQEPL 222 Query: 262 LIHYTGATKPWHKWAIYPSVKYYKIALENS 291 +IHYT KPW+ Y + + S Sbjct: 223 IIHYTTFRKPWNSEVSYRYRQLWWDFQALS 252 Score = 180 bits (457), Expect = 6e-44, Method: Composition-based stats. Identities = 55/283 (19%), Positives = 110/283 (38%), Gaps = 25/283 (8%) Query: 5 PAIEIDKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIA 64 P + K+++ + V +A Y + V +I SIV +N + FY+I Sbjct: 387 PQEMVRKLRSLMKKEKPQAFRA---VVLAANAAYSEQVLTTIKSIVCHN--RFIKFYVIN 441 Query: 65 DVYNDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLD 124 + +F + K + +I R++ + +S ++ R F + D Sbjct: 442 SDFPTEWFVSMRKKLAKLDCQIVNARVDGSHISQYKTNIHYS--VFLRYFTATFVEE--D 497 Query: 125 RLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVV 184 + LYLD D+V D+S++ + L V+D+ Q FNSGV+ Sbjct: 498 QALYLDCDIVVTRDLSEIFAVDLGSYPLGAVRDLGGEVY----------FGEQIFNSGVL 547 Query: 185 YLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSEL 244 +++ W + + + + + + + DQ ++N+L + + LP YN I + Sbjct: 548 LINVNYWRENDIAGQLIEMTDNLHDKVTQDDQSILNMLFENRWMELPFAYNCITLHTTFS 607 Query: 245 KDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIA 287 + + +IHY KPW ++ + + Sbjct: 608 DYEPEKGLY------PPVIHYLTERKPWKEYTQSIYREVWWFY 644 >UniRef50_UPI0001A45357 glycosyl transferase family protein n=1 Tax=Neisseria subflava NJ9703 RepID=UPI0001A45357 Length = 264 Score = 181 bits (459), Expect = 4e-44, Method: Composition-based stats. Identities = 58/262 (22%), Positives = 100/262 (38%), Gaps = 9/262 (3%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 + + D Y + V + S+ +N ++FY++ + + + + + R+ Sbjct: 4 ITIVLAADTGYAEQVHTLMKSVCTHNTG--VNFYLMHNTFRKEWINYTNQKLAASGSRLN 61 Query: 88 LYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGL 147 +I D S A +FRL L +DR LYLD+D+V + L +L + Sbjct: 62 DVKIEMD-FSQYRRLSHISDAAFFRLMMQHL---PVDRALYLDSDMVVTQSLHDLFNLDM 117 Query: 148 NGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSK 207 G A V+D + YFNSG++ DL +W + E+ L + Sbjct: 118 RGYPVAAVQDSYLARTDWNHPTGLHTT--PYFNSGMLLADLGQWRKHNIAEQLLQTAATI 175 Query: 208 DNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTG 267 D Y DQ +N + + L L +N + + L + +IHYT Sbjct: 176 DKTVPYGDQCFLNTVFQENWLQLEESWNYQTGARRFFQTYDLDEMFPLPDTTPPIIHYTT 235 Query: 268 ATKPWH-KWAIYPSVKYYKIAL 288 KPW + P + Y Sbjct: 236 LAKPWLCDYGKIPFEEIYWQYY 257 >UniRef50_C1QC00 LPS:glycosyltransferase n=1 Tax=Brachyspira murdochii DSM 12563 RepID=C1QC00_9SPIR Length = 332 Score = 181 bits (459), Expect = 4e-44, Method: Composition-based stats. Identities = 64/304 (21%), Positives = 123/304 (40%), Gaps = 22/304 (7%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNR-HINLDFYIIADVYNDGFFQKIAKLAEQNQLRI 86 +++ D NY +G +I SI+ N++ + F+++ + K+ L I Sbjct: 1 MDICLSADDNYAKYMGTTIASILSNSKEDEEIYFHLLDGGITEENKNKLLSLKNIKNCDI 60 Query: 87 TLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLG 146 Y +N + + +FRL L+ +D+LLYLD D + + +L + Sbjct: 61 IFYSVNNMNYK-------YDAPHFFRLNVPSLIP-NVDKLLYLDCDTIVLNSLKELFEID 112 Query: 147 LNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMS 206 ++ A +DV + + + YFNSG++ ++ K W D KL Sbjct: 113 ISNYYALACEDVFLNCIISFKNMHGLNVNDIYFNSGMLMINNKLWRDDKLENLFYDDYSK 172 Query: 207 KDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYT 266 N + DQDV+N ++KG + ++N + K +IHY Sbjct: 173 FGN-TGHADQDVLNRIIKGRVKIVDSKWNFL--------SHKKVYSKAPDISLVNIIHYA 223 Query: 267 GATKPWHKWAI-YPSVKYYKIALENSPWKDDSPRDAKSIIEFKK--RYKHLLVQHHYISG 323 G KPW + + + + + +PW ++ DA I+ +K Y+ L + + + Sbjct: 224 G-EKPWKETSSKAFFIDEFWKYYQLTPWCRENTLDAVKIMISQKVNDYEELKLNVNRVKF 282 Query: 324 IIAG 327 + Sbjct: 283 LGFY 286 >UniRef50_C7RG54 Glycosyl transferase family 8 n=1 Tax=Anaerococcus prevotii DSM 20548 RepID=C7RG54_ANAPD Length = 273 Score = 180 bits (458), Expect = 5e-44, Method: Composition-based stats. Identities = 55/263 (20%), Positives = 104/263 (39%), Gaps = 2/263 (0%) Query: 31 AYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLYR 90 +D NY+ + V +TSI +NN D Y+I ++ + + + ++ + R Sbjct: 6 LLTLDENYIPQMKVLMTSIYINNPGRIFDVYLIHSRISEDKLKDLGEDLKKFSYTLYPIR 65 Query: 91 INTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGA 150 D T + + MY+RL A + L L +LYLD D++ + LL ++ Sbjct: 66 ATDDLFSFAKVTDRYPKEMYYRLLAGEFLPENLGEILYLDPDMLVINPLDDLLRTDISDY 125 Query: 151 VAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNV 210 + A Y+NSG++ ++LK+ + ++ S + Sbjct: 126 ILAAASHTGKTDMANNVNRIRLGTDTDYYNSGLLLINLKRAREEIDPDEIFSFVEDNHMN 185 Query: 211 YKYPDQDVMNVLLKGMTLFL-PREYNTIYTIKSELKDKTHQNYK-KLITESTLLIHYTGA 268 PDQD++N + L YN S ++ + + + T+++H+ G Sbjct: 186 LLLPDQDILNAMYGDRIYPLDDLIYNYDARNYSSYLIRSKKQADLAWLMDHTVVLHFCGR 245 Query: 269 TKPWHKWAIYPSVKYYKIALENS 291 KPW K YK + + Sbjct: 246 DKPWKKNHRNKFTSLYKHYMSLT 268 >UniRef50_C9PNX4 Family 8 glycosyl transferase n=1 Tax=Pasteurella dagmatis ATCC 43325 RepID=C9PNX4_9PAST Length = 285 Score = 180 bits (458), Expect = 5e-44, Method: Composition-based stats. Identities = 72/284 (25%), Positives = 120/284 (42%), Gaps = 14/284 (4%) Query: 21 NINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAE 80 + +N+ D + + V I S+ +N+ N+ FY++ Y +FQ + + Sbjct: 5 SQGRDSNMNIVLSADVQFSEQVKTLIKSVSYHNK--NVHFYLLNKDYPSEWFQILNQYLA 62 Query: 81 QNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDIS 140 I +++++ + P S A YFR QL LDR+LYLD DVV G ++ Sbjct: 63 YFGSNIIDAKVDSEVISTFPTLDHISEASYFRYLLGQL---PLDRVLYLDCDVVVTGSLT 119 Query: 141 QLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKA 200 ++ + + V+D A + + YFNSG++ +DL KW D + + Sbjct: 120 EIYYTDFGDNMMYAVEDAFLNI--APHSYKEFPDMKPYFNSGMLLIDLNKWRDQNIENQL 177 Query: 201 LSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYT----IKSELKDKTHQNYKKLI 256 + + N+Y Y DQD MN++LKG L + YN + + YK L Sbjct: 178 MDLTKQAVNLY-YGDQDAMNIILKGKWQALDKIYNYQTGSLIAFIQHKMPEALEKYKDLQ 236 Query: 257 TESTLLIHYTGATKPWHKWAIY-PSVKYYKIALENSPWKDDSPR 299 + +IHY KPW P Y + W+D + Sbjct: 237 GQQPKVIHYITRYKPWLLPEYDLPFRDQYWAYYQL-EWQDIIRK 279 >UniRef50_A0KQP2 Glycosyl transferase, family 8 n=2 Tax=Aeromonas RepID=A0KQP2_AERHH Length = 366 Score = 180 bits (458), Expect = 5e-44, Method: Composition-based stats. Identities = 63/281 (22%), Positives = 138/281 (49%), Gaps = 23/281 (8%) Query: 24 TSECLNVAYGVDANYLDGVGVSITSIVLNNRH-INLDFYIIADVYNDGFFQKIAKLAEQN 82 + ++ A+ +D ++ + I S+ + H + L +++A ++ K++KL E Sbjct: 1 MRKIIHSAFCIDDSFAVHLAALIHSLGKHLSHDLQLQCHVLA-RLSETNKFKLSKL-ESE 58 Query: 83 QLRITLYRINTDKLQCLPCTQ----VWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGD 138 L I Y N + +P + + Y+R +L ++D++L++D+D++ GD Sbjct: 59 NLVIKFYD-NLPDYKDIPISNLYNNRLNEVTYYRFAIPHIL-KSIDKVLFIDSDMIALGD 116 Query: 139 ISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTE 198 IS L + + A+ AVV D +K + G+YFN+G + ++L KW ++E Sbjct: 117 ISPLWSIDMGDAIVAVVSDHILGCDKKKQLMRGI-SSGKYFNAGFMLMNLDKWRAKNISE 175 Query: 199 KALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITE 258 +AL +L+ + +++ DQD +N++L+ T+++ ++N N+ Sbjct: 176 QALRLLIENNG-FEHNDQDALNIVLENKTVYIDNKWNAQP------------NHLAQNNF 222 Query: 259 STLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPR 299 +L+H+ G KPWH ++ +P Y ++ + + ++ + Sbjct: 223 LPILVHFCGQEKPWHIYSNHPFKGSYLVSRRETDYANEPLQ 263 >UniRef50_C4Z1V1 Putative uncharacterized protein n=1 Tax=Eubacterium eligens ATCC 27750 RepID=C4Z1V1_EUBE2 Length = 607 Score = 180 bits (457), Expect = 5e-44, Method: Composition-based stats. Identities = 59/329 (17%), Positives = 124/329 (37%), Gaps = 33/329 (10%) Query: 14 AWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFF 72 +++ + V + ++ Y + V + S+ ++ ++ D + Sbjct: 259 SYNREHEEYMAVSRIPVFFSINEQYAPYLAVCLKSLAVHVACDERYRIIVMCDNVKNITM 318 Query: 73 QKIAKLAEQN-QLRITLYRIN--------------TDKLQCLPCTQVWSRAMYFRLFAFQ 117 ++ + + + I I TD+ + + ++ +YFRLF + Sbjct: 319 IQLRNVIKDYENIDIEFVDIRKKMYEYSESFGQTVTDRQENRLYSGEFTLTIYFRLFIAE 378 Query: 118 LLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDP---EL 174 L L++ +Y+D+D V DI++L + + A+ V+D + ++ + Sbjct: 379 LFP-ELNKAVYIDSDTVINDDIAKLYSVDMGDAMFGAVRDTFAGKNTILAHYIENVVGIE 437 Query: 175 LGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREY 234 +Y NSGV+ ++L K A L ++ L ++ PDQD +N + FL +E+ Sbjct: 438 RNEYVNSGVLLMNLDKIRQAHLADRFLKLMAEYHFDSVAPDQDYINSMCAKEIYFLDKEW 497 Query: 235 NTIYTIKSELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWK 294 N + E LIHY KPWH + P +Y+ S + Sbjct: 498 NVMPNKGGEY------------IARPKLIHYNLFDKPWHY-SEIPYEEYFWQYAAESGFY 544 Query: 295 DDSPRDAKSIIEFKKRYKHLLVQHHYISG 323 + K + +K+ ++ Sbjct: 545 PLLIKQRKQYGDNEKKADRENLKKLLARA 573 Score = 91.6 bits (226), Expect = 3e-17, Method: Composition-based stats. Identities = 49/274 (17%), Positives = 101/274 (36%), Gaps = 36/274 (13%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADV----------YNDGFFQKIAK 77 +N+ Y D G+ +S S++ N L+ YI+ + F + + + Sbjct: 1 MNILYCGDKTMQKGILLSSMSLIKNV-DEPLNIYILTVDYGEKGINYKPVDKAFAKYLEE 59 Query: 78 LAEQNQLRITLYRINT-----DKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDAD 132 ++ +++ ++ ++ ++L ++ RLFA + DR+LYLD D Sbjct: 60 KLNKSDIKVNVFLVDVTRYFVEELPEANMQSRFTACCMLRLFADKTDIK--DRVLYLDTD 117 Query: 133 VVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWA 192 V+C+ H ++G A V D Y NSGV+ ++++ Sbjct: 118 VLCRKGFRDFYHQNMDGIEIAGVSD----------YYGRWLFGDGYINSGVMLMNMRMIR 167 Query: 193 DAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNY 252 L EK + K PDQ +N R++N + + Sbjct: 168 QNGLLEKCREQCIRK--EMFMPDQTAVN-TFATRVNLCGRKFNDQRRLHDNTVFQHFTTT 224 Query: 253 KKLITESTLLIHYTGATKPWHKWAIYPSVKYYKI 286 ++ ++ T + KPW ++ + ++ Sbjct: 225 FRVF---PVIR--TVSVKPWEIDKMHNILGLHEY 253 >UniRef50_C1FE59 Glycosyltransferase family 24 protein n=1 Tax=Micromonas sp. RCC299 RepID=C1FE59_9CHLO Length = 1662 Score = 179 bits (455), Expect = 9e-44, Method: Composition-based stats. Identities = 50/272 (18%), Positives = 96/272 (35%), Gaps = 15/272 (5%) Query: 5 PAIEIDKVKAWDFRLANINTSECLNVAYGVDA-NYLDGVGVSITSIVLNNRHINLDFYII 63 P I + + D LA E +++ Y + V + S+ N ++ + F+ + Sbjct: 1353 PRSRIAENASRDLTLAEACHGEKIHIFSVASGYLYERLIKVMMLSVRRNTKN-PIKFWFV 1411 Query: 64 ADVYNDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTL 123 + + F Q + A + + L + Y LF + L+L Sbjct: 1412 KNWLSPRFKQYLPHFASRYRFEYELVTYKWPTWLQKQTDKQRIIWAYKLLFLDVIFPLSL 1471 Query: 124 DRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELL------GQ 177 ++++++DAD V + DI +L + L+GA A + + R Sbjct: 1472 EKIIFVDADQVVRADIKELWEVDLHGAPYAYTPFCDDNKVMDGFRFWKQGFWERHLDGKP 1531 Query: 178 YFNSGVVYLDLKKWADAKLTEKA---LSILMSKDNVYKYPDQDVMNVLLKGM-TLFLPRE 233 Y S + +DLK++ + L N DQD+ N + LP++ Sbjct: 1532 YHISALYVVDLKRFRQLAAGDTLRVIYENLSKDPNSLANLDQDLPNYAQHQVPIFSLPQQ 1591 Query: 234 YNTIYTIKSE---LKDKTHQNYKKLITESTLL 262 + + L KT +T+ L Sbjct: 1592 WLWCESWCGNQTKLSAKTIDLCNNPMTKEPKL 1623 >UniRef50_B1I7N1 Glycosyl transferase, family 8 n=8 Tax=Streptococcus pneumoniae RepID=B1I7N1_STRPI Length = 817 Score = 179 bits (455), Expect = 1e-43, Method: Composition-based stats. Identities = 57/271 (21%), Positives = 100/271 (36%), Gaps = 29/271 (10%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLY 89 + D NY+ + +I SI+ +NR + YI+ +F+K K+A I Sbjct: 5 IVLAGDRNYIRQLETTIKSILYHNRD--VKIYILNQDIMPDWFRKPRKIARMLGSEIIDV 62 Query: 90 RINTDK-LQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLN 148 ++ Q S Y R F + D++LYLD+D++ + +L + L Sbjct: 63 KLPEQTVFQDWEKQDHISSITYARYFIADYIQE--DKVLYLDSDLIVNTSLEKLFSICLE 120 Query: 149 GAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSI----- 203 A VKD + + FN+GV+ ++ KKW KL E+ + Sbjct: 121 EKSLAAVKDTDGIT----------------FNTGVLLINNKKWRQEKLKERLIEQSIVTM 164 Query: 204 --LMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTL 261 + + DQ + N +L+ L L R YN + + + + Sbjct: 165 KEVEEGRFEHFNGDQTIFNQVLQDDWLELGRAYNLQV-GHDIVALYNNWQEHLAFNDKPV 223 Query: 262 LIHYTGATKPWHKWAIYPSVKYYKIALENSP 292 +IH+T KPW + + Sbjct: 224 VIHFTTYRKPWTTLTANRYRDLWWKFHDLEW 254 >UniRef50_Q09332 UDP-glucose:glycoprotein glucosyltransferase n=15 Tax=Neoptera RepID=UGGG_DROME Length = 1548 Score = 179 bits (455), Expect = 1e-43, Method: Composition-based stats. Identities = 44/255 (17%), Positives = 96/255 (37%), Gaps = 12/255 (4%) Query: 18 RLANINTSECLNVAYGVDAN-YLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIA 76 + A +E +N+ + Y + + + S++ + + + F+ + + + F + Sbjct: 1230 QAATDEDTETINIFSVASGHLYERLLRIMMVSLLKHTKS-PVKFWFLKNYLSPQFTDFLP 1288 Query: 77 KLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCK 136 +A + + L + + + + Y LF L L + +++++DAD + + Sbjct: 1289 HMASEYNFQYELVQYKWPRWLHQQTEKQRTIWGYKILFLDVLFPLNVRKIIFVDADAIVR 1348 Query: 137 GDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELL------GQYFNSGVVYLDLKK 190 DI +L + L GA A + +E R +Y S + +DLK+ Sbjct: 1349 TDIKELYDMDLGGAPYAYTPFCDSRKEMEGFRFWKQGYWRSHLMGRRYHISALYVVDLKR 1408 Query: 191 WADAKLTEKA---LSILMSKDNVYKYPDQDVMNVLLKGM-TLFLPREYNTIYTIKSELKD 246 + ++ L N DQD+ N ++ + LP ++ T S+ Sbjct: 1409 FRKIAAGDRLRGQYQALSQDPNSLSNLDQDLPNNMIHQVAIKSLPDDWLWCQTWCSDSNF 1468 Query: 247 KTHQNYKKLITESTL 261 KT + T Sbjct: 1469 KTAKVIDLCNNPQTK 1483 >UniRef50_A4UX76 Putative LPS biosynthesis protein n=2 Tax=Lactobacillaceae RepID=A4UX76_9LACO Length = 316 Score = 179 bits (455), Expect = 1e-43, Method: Composition-based stats. Identities = 63/281 (22%), Positives = 107/281 (38%), Gaps = 27/281 (9%) Query: 24 TSECLNVAYGVDANYLDGVGVSITSIVLNN-RHINLDFYIIADVYNDGFFQKIAKLAEQN 82 ++ + + Y VD NY + VS+ S+V + + ++ D N ++ E + Sbjct: 2 ENQTVPIFYAVDDNYAPYLAVSLASLVAHTSPDRHYQVIVLCDDLNTDNQGRLKA-FETD 60 Query: 83 QLRITLYRINTDKLQ------CLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCK 136 L+I IN Q + ++ +YFRLF +L LD+ LYLDAD V Sbjct: 61 NLKIQFVSINDRLKQEITDKNNKLRSDYFTFTIYFRLFIAELFP-KLDKALYLDADTVVL 119 Query: 137 GDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELL---GQYFNSGVVYLDLKKWAD 193 D+ +L L + V D + + +Y SGV+ ++L + Sbjct: 120 KDVGELFDTQLGDNLVGAVPDPFVGHTPETIDYVEQAVGIDSQKYVCSGVLLMNLAEMRR 179 Query: 194 AKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYK 253 K E L +L PDQD MN + + +L ++ T ++ Sbjct: 180 LKFAEHFLQLLNKYHFKCLAPDQDYMNAIARNRIYYLNPSWHIQITTPQDV--------- 230 Query: 254 KLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWK 294 LIHY KPW P Y+ + + ++ Sbjct: 231 -----DPWLIHYNLFAKPWRYDDA-PRQSYFWTYAKQTDYE 265 >UniRef50_A8FNA2 Putative uncharacterized protein n=2 Tax=Campylobacter jejuni subsp. jejuni 81116 RepID=A8FNA2_CAMJ8 Length = 791 Score = 179 bits (455), Expect = 1e-43, Method: Composition-based stats. Identities = 64/336 (19%), Positives = 131/336 (38%), Gaps = 17/336 (5%) Query: 6 AIEIDKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNR-HINLDFYIIA 64 A EIDK F L + + + + DANY + V + SI + + N D YI+ Sbjct: 361 AGEIDKEIDNFFILPPQDKLSHIPIVFSCDANYFSYLTVVLQSIKEKSSENYNYDIYILH 420 Query: 65 DVYNDGFFQKIAKLAEQNQLRITLYRINTDKLQ-----CLPCTQVWSRAMYFRLFAFQLL 119 + + QK+ + I I+ +S A Y+R F ++ Sbjct: 421 NKLDKSLTQKLINYIQAENFSIKFVDISRILNLLKSQIQFYTALFFSEATYYRFFIPKIF 480 Query: 120 GLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPEL--LGQ 177 +++YLD D++ K D++ L + + +AA + ++A R++ ++ Sbjct: 481 -KEFKKIIYLDTDIIVKQDLNLLYSIDFDKPLAAAKCMIFSQVKQADHRITKLKMKQPEN 539 Query: 178 YFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTI 237 YF +GV+ +++K T+K L+ L + DQDV+N + +G ++ ++N + Sbjct: 540 YFQAGVMVYNIQKCLKMDFTQKCLNKLQELKDP-PLVDQDVLNAVFEGDIHYISLKWNCL 598 Query: 238 YTIKSE------LKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENS 291 + + L K + +IHY KPW+ + P + + Sbjct: 599 WNVSYRIPNFKILYSKDFLKDYQEAERDPYIIHYCDYFKPWNSPHL-PKADIWWHYARQT 657 Query: 292 PWKDDSPRDAKSIIEFKKRYKHLLVQHHYISGIIAG 327 P+ ++ + + + ++ Sbjct: 658 PFYEEILFKNITQNSLNIIQNSIQGAVERVKAHLSY 693 >UniRef50_A5LNA9 Glycosyl transferase, family 8 n=2 Tax=Streptococcus pneumoniae RepID=A5LNA9_STRPN Length = 402 Score = 179 bits (455), Expect = 1e-43, Method: Composition-based stats. Identities = 64/312 (20%), Positives = 111/312 (35%), Gaps = 27/312 (8%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLY 89 + G D NY D + +I SI +NR L FYI + +F + K E+ I Sbjct: 7 IVLGADNNYRDKLETTIKSICYHNRD--LKFYIFNEDIPKEWFYLMEKRLEKLNCEILNI 64 Query: 90 RINTDKLQCLPCTQ-VWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLN 148 I+ +K++ YFR F + + DR +YLD D+V G+I+ L Sbjct: 65 EIDAEKVKYFSTPDEHIKYMTYFRYFIAEFVKE--DRAVYLDCDMVIHGNINPLFQKDFE 122 Query: 149 GAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKD 208 G V D FN+G++ +++ KW + + L + K Sbjct: 123 GNYIIAVPDGW---------------YKNIFNAGMMMVNVHKWKTDNICQNLLELTAEKH 167 Query: 209 NVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITES---TLLIHY 265 Y DQ V+N+L + + YN + + + + + +IH+ Sbjct: 168 QEI-YGDQGVLNLLFENKWKKVSPHYNFMVGLDTLGYWAQKPEWFLNSWDENYKPAIIHF 226 Query: 266 TGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQHHYISGI- 324 G KPW+ + + W+ + F + L + Sbjct: 227 EGKDKPWNDSLKTRYRELWWFY-NGLDWQTILSQVDNKPTTFSEIATVSLFHTAIFTDTH 285 Query: 325 -IAGVCYLCRKY 335 + + YL K Sbjct: 286 ELEHIEYLVEKL 297 >UniRef50_B3WD33 WbbM protein n=8 Tax=Lactobacillus RepID=B3WD33_LACCB Length = 318 Score = 179 bits (454), Expect = 1e-43, Method: Composition-based stats. Identities = 60/274 (21%), Positives = 105/274 (38%), Gaps = 23/274 (8%) Query: 23 NTSECLNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAEQ 81 + + + VD Y+ + V++TSI N + + I+ ++A LA Sbjct: 2 PQQTTVPIFFSVDDGYVPCLAVALTSIRTNKDPQTDFVINILNSGLLQKNQTRLAALAAP 61 Query: 82 NQLRITLYRINTDKLQ-----CLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCK 136 I ++ Q + +YFRLF + D+ +Y+DAD V Sbjct: 62 -HFTINFIDMDAVTQQISGDTNKLRGDYVTLTIYFRLFIADMFP-QYDKAIYIDADTVAD 119 Query: 137 GDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPE---LLGQYFNSGVVYLDLKKWAD 193 GD+++L L + A V D M + G+Y NSGV+ L+L + Sbjct: 120 GDLAELFTTDLGDNLVAGVADPVMMTYPETIEYIQRDFGVQPGEYINSGVLILNLAQMRQ 179 Query: 194 AKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYK 253 +++ L +L + DQD +NV+ + +LP+ +N + + Sbjct: 180 EHFSDRFLHLLKTYHFTMIAADQDYINVIAQHRIKYLPKTWNMQTGVPT----------- 228 Query: 254 KLITESTLLIHYTGATKPWHKWAIYPSVKYYKIA 287 LIHY KPWH + ++ A Sbjct: 229 -AAESGGKLIHYNLFGKPWHYRDAKLAANFWHYA 261 >UniRef50_A8XPN2 Putative uncharacterized protein n=2 Tax=Caenorhabditis briggsae RepID=A8XPN2_CAEBR Length = 1495 Score = 179 bits (454), Expect = 1e-43, Method: Composition-based stats. Identities = 46/261 (17%), Positives = 95/261 (36%), Gaps = 15/261 (5%) Query: 16 DFRLANINTSECLNVA-YGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQK 74 + + E +NV Y + + I S++ N +H + F+++ + + F + Sbjct: 1194 NLVSSKEKPQEVINVFSLASGHLYERFMRIMIVSVMKNTKH-PVKFWLLKNYLSPQFKET 1252 Query: 75 IAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVV 134 + LA+ L + + + LF L L + +++++DAD V Sbjct: 1253 LPTLAKHYDFEYELVEYKWPRWLHQQKEKQRIMWGFKILFLDVLFPLDVGKVIFVDADQV 1312 Query: 135 CKGDISQLLHLGLNGAVAAVVKDVEPMQE------KAVSRLSDPELLGQYFNSGVVYLDL 188 + D+ +L+ L A V E +E ++ +Y S + +DL Sbjct: 1313 VRADLMELMKFDLGNAPYGYVPFCESRKEMDGFRFWKQGYWANHLAGRRYHISALYVIDL 1372 Query: 189 KKWADAKLTEKA---LSILMSKDNVYKYPDQDVMNVLLKG-MTLFLPREYNTIYTIKSEL 244 +K+ ++ L N DQD+ N ++ LP+E+ T + Sbjct: 1373 QKFRQIAAGDRLRGQYQGLSGDPNSLANLDQDLPNNMIHQVKIKSLPQEWLWCETWCDDA 1432 Query: 245 KDKTHQ---NYKKLITESTLL 262 K + +T+ L Sbjct: 1433 SKKNAKTIDLCNNPLTKEPKL 1453 >UniRef50_B9BAZ6 Glycosyl transferase family 8 protein n=1 Tax=Burkholderia multivorans CGD1 RepID=B9BAZ6_9BURK Length = 617 Score = 178 bits (451), Expect = 3e-43, Method: Composition-based stats. Identities = 72/337 (21%), Positives = 132/337 (39%), Gaps = 36/337 (10%) Query: 19 LANINTSECLNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAK 77 +++ D N++ + I S+ N + LD ++ + + K Sbjct: 275 PPEPLGGNAVSIVTVADGNFVPHLAAFIASVQDNIDPERVLDLIVLDGGIPADQQRLLMK 334 Query: 78 LAEQNQL-RITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCK 136 +N R++ + +P +S A ++RL +LL R++Y+D+D + Sbjct: 335 QFHRNGKGRLSFIQ-CAHLFSDIPLHGPFSAATFYRLSMGELL-AKHRRVVYVDSDTIVL 392 Query: 137 GDISQLLHLGLNGAVAAVVKDV------------------EPMQEKAVSRLSDPELLGQY 178 GD+S+L L L A V DV P R+ +Y Sbjct: 393 GDLSELFDLDLGNNAVAAVPDVIMKSFVSSGVPALREAGGAPAGIYLKERVGMGNRGNEY 452 Query: 179 FNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIY 238 F +G++ +DL ++ ++ E A L+++ Y + DQDV+N L G FL +N + Sbjct: 453 FQAGLIVIDLDEFRRLRIGEDAYKDLLAR--RYWFLDQDVLNKYLLGHVKFLDLSWNVVN 510 Query: 239 ---TIKSELKDKTHQNYKKLITESTLLIHYTGAT-KPWHKWAIYPSVKYYKIALENSPWK 294 + S L+ K++ ++HY G KPW++ P +Y L + W Sbjct: 511 ASMDVLSGLETDIAAKVKEVFAA-PSMVHYAGHEAKPWNRPTA-PLAHFYWYYLRRTYWY 568 Query: 295 ----DDSPRDAKSIIEFKKR--YKHLLVQHHYISGII 325 D P +E ++ YK L + G + Sbjct: 569 ESVIDRRPISPTLDVELQRSRLYKRLRAIWRRMPGFV 605 >UniRef50_B2UQN6 Glycosyl transferase family 8 n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UQN6_AKKM8 Length = 371 Score = 177 bits (450), Expect = 4e-43, Method: Composition-based stats. Identities = 75/343 (21%), Positives = 129/343 (37%), Gaps = 28/343 (8%) Query: 20 ANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRH-INLDFYIIADVYNDGFFQKIAKL 78 + V + + +GV+I ++ L+ D +I+ D + Q++ ++ Sbjct: 13 PASPEKSRIPVMFSATGGWGLPLGVAIHTLCLHASSGRFYDIHIVHDGMDARIIQELNQV 72 Query: 79 AEQN-QLRITLYRINTDKLQCLPC--TQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVC 135 A Q+ ++ ++ + +S Y RL A L R++YLDADV+ Sbjct: 73 AAPFPQVSLSFLQLPEEFRHLFQNGNKDRYSPLAYARLMAGSLFP-QYGRIVYLDADVLL 131 Query: 136 KGDISQLLHLGLNGAVAAVVKD-----------VEPMQEKAVSRLSDPELLGQYFNSGVV 184 GD+++L L GA A D + P E + LS P Y NSGV+ Sbjct: 132 AGDVAELYFSDLRGASVAAAGDGLALWSIEKGTMHPHLEYMGNYLSFPLS---YCNSGVL 188 Query: 185 YLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSEL 244 LDL + L + L L S+ + + YPDQD++N+ L G LP E+N + + Sbjct: 189 VLDLDQMRRRNLEHRLLQQLRSRPDPFPYPDQDILNIALHGDMTTLPPEWNFQFLSWTWD 248 Query: 245 KDKTHQNYKKLITEST--------LLIHYTGATKPW-HKWAIYPSVKYYKIALENSPWKD 295 ++KT L+H G KPW +++ I W + Sbjct: 249 EEKTRLLRGTEFENVPTISCGRSWKLLHMVGPEKPWRLPDTPGTMGQFHWILYSFFWWPE 308 Query: 296 DSPRDAKSIIEFKKRYKHLLVQHHYISGIIAGVCYLCRKYYRK 338 + +I G + + +RK Sbjct: 309 AKRLPVFREELDAISQGLAPLLQRHIRGQQWKLFFSRGHIFRK 351 >UniRef50_Q3DM64 Glycosyl transferase, family 8, degenerate n=6 Tax=Streptococcus agalactiae RepID=Q3DM64_STRAG Length = 394 Score = 177 bits (449), Expect = 5e-43, Method: Composition-based stats. Identities = 57/284 (20%), Positives = 108/284 (38%), Gaps = 26/284 (9%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLY 89 + D L+ + I SI+ +N + YI+ +F+ I + + I Sbjct: 6 ICLAGDNKSLNQIQTVIKSILCHND--RVSIYILNQDIASEWFRNIQRRLLNSHSCIFDI 63 Query: 90 RINTDKLQCLPCTQ-VWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLN 148 ++ D + + + Y R + QL+ +++LYLD D + ++ +L + L Sbjct: 64 KLFDDTFKEFKTPRAHITYMAYARYYIPQLIDA--EKVLYLDIDTLVVDNLDKLFEIELG 121 Query: 149 GAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKD 208 A + D + G +FNSGV+ ++ W ++TEK L I + Sbjct: 122 DYPIAAILDGD----------------GIHFNSGVMLINSLYWMRYRVTEKLLEITEREL 165 Query: 209 NVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGA 268 + + DQ V+N+L L L +YN + Q Y ES +IHY Sbjct: 166 DNGIFGDQGVLNLLFDNNWLKLEDKYNAQVGNDLGAFYENWQGYFDRNFESPTIIHYCTH 225 Query: 269 TKPWHKWAIYPSVKYYKIALENSP-----WKDDSPRDAKSIIEF 307 KPW+ ++ + + + ++ + F Sbjct: 226 DKPWNTFSSSRFRETWWQYEQLDWNEVFNFETYLLPEPTFEKHF 269 >UniRef50_Q01GT2 UDP-glucose:glycoprotein glucosyltransferase, putative (ISS) n=1 Tax=Ostreococcus tauri RepID=Q01GT2_OSTTA Length = 1339 Score = 177 bits (449), Expect = 5e-43, Method: Composition-based stats. Identities = 41/256 (16%), Positives = 89/256 (34%), Gaps = 15/256 (5%) Query: 21 NINTSECLNVAYGVDAN-YLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLA 79 ++E +++ + Y + + + S+ + ++ + F+ I + + F + +A Sbjct: 881 KKKSNERIHIFSVASGHLYERFLKIMMASVKRSTKN-PVKFWFIKNWLSPSFKDFLPHMA 939 Query: 80 EQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDI 139 E+ L + Y LF L L L++++++DAD + + D+ Sbjct: 940 EKYDFEYELVSYKWPTWLNKQTEKQRIIWAYKILFLDVLFPLELNKVIFVDADQIVRADM 999 Query: 140 SQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELL------GQYFNSGVVYLDLKKWAD 193 S+L ++ L+GA + +E R Y S + +DL ++ Sbjct: 1000 SELWNMNLHGAPYGYTPMCDNNKEMEGFRFWKQGFWQTHLRGKPYHISALYVVDLDRFRA 1059 Query: 194 AKLTEKA---LSILMSKDNVYKYPDQDVMNVLLKGM-TLFLPREYNTIYTIKSE---LKD 246 ++ L DQD+ N + LP + + Sbjct: 1060 VAAGDRLRVMYDSLSRDPGSLANLDQDLPNYAQHDVPIFSLPMPWLWCESWCGNETKAAA 1119 Query: 247 KTHQNYKKLITESTLL 262 KT +T+ L Sbjct: 1120 KTIDLCNNPLTKEPKL 1135 >UniRef50_Q4JZJ9 Putative glycosyl transferase n=8 Tax=Streptococcus pneumoniae RepID=Q4JZJ9_STRPN Length = 344 Score = 177 bits (449), Expect = 5e-43, Method: Composition-based stats. Identities = 67/334 (20%), Positives = 138/334 (41%), Gaps = 18/334 (5%) Query: 13 KAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFF 72 + + +N+ Y D N++D + SI S+ N ++L+ +IIAD +D Sbjct: 16 SIFFISENKFRSRNFMNIVYATDNNFVDVLSASIKSLYTTNSDLDLNLWIIADKVSDRNK 75 Query: 73 QKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDAD 132 +KI +L++Q R + I ++ S + + RLF +L ++ ++LYLD+D Sbjct: 76 EKINRLSKQFAQR-EINWIENVEIPFKLHLDRGSISSFSRLFLGSVLPSSMSKVLYLDSD 134 Query: 133 VVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWA 192 ++ + + + G + V D K ++ + FN+GV+ ++L+ W Sbjct: 135 IIVMDSLRSIFDIDFKGKILYGVNDTFN---KEYKQVLGIPIDKPMFNAGVMLINLELWR 191 Query: 193 DAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELK------- 245 + + E+ L ++ + D V+N +L LP EYN + + Sbjct: 192 NNNVEERFLQVIQKFNGTILQGDLGVLNAVLYNSFGVLPPEYNYMTIFEDLTYEEMIVFK 251 Query: 246 ---DKTHQNYKKLITESTLLIHYTGAT---KPWHKWAIYPSVKYYKIALENSPWKDDSPR 299 + + K E +L H+T + +PW + + V+ +K +K SP Sbjct: 252 KPINYYSKEEIKNARERIVLRHFTTSFLSKRPWQESSEVTHVEIFKKYYRG-AYKQASPS 310 Query: 300 DAKSIIEFKKRYKHLLVQHHYISGIIAGVCYLCR 333 +I + + L + S + + + + Sbjct: 311 KLLNIYKILPKKMSLYLLGFIQSKVRPKLYRITK 344 >UniRef50_Q5M3K9 Glycosyl transferase n=4 Tax=Streptococcus RepID=Q5M3K9_STRT2 Length = 697 Score = 176 bits (447), Expect = 8e-43, Method: Composition-based stats. Identities = 55/285 (19%), Positives = 108/285 (37%), Gaps = 26/285 (9%) Query: 10 DKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYND 69 +K + T + + + Y+D V +I SIV + H N+ FY+I D ++ Sbjct: 284 EKFQVLSLAPQRYETKKR-AIVLAANYTYVDQVLTTIKSIVFH--HRNIRFYLINDDFSQ 340 Query: 70 GFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYL 129 +F+ + + + R+++ ++ + A Y R F + +R LYL Sbjct: 341 EWFRGLNRHLAAFGSEVINCRVDSSHIKQFKTNSNY--ASYLRYFVADFVSE--ERALYL 396 Query: 130 DADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLK 189 D+D+V G + L L L G A V+D + + F++G + +D Sbjct: 397 DSDMVVTGSLEDLFTLDLQGRPLAAVRDYAVQGQDRQAM----------FDAGFMVIDTA 446 Query: 190 KWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPRE--YNTIYTIKSELKDK 247 W + + + + + +Q ++N++ L L + Y + S Sbjct: 447 YWKQYNMRRHLIDMTSEWHDKVPFAEQSILNMVFCNNWLTLSFDNNYAVTKSSLSGYHLP 506 Query: 248 THQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSP 292 Q+Y ++HYT KPW A + + + Sbjct: 507 NGQDY-------PKVLHYTSHRKPWLPLACQAYREVWWFYAQMDW 544 >UniRef50_A3VFX3 Glycosyltransferase n=1 Tax=Rhodobacterales bacterium HTCC2654 RepID=A3VFX3_9RHOB Length = 615 Score = 176 bits (447), Expect = 9e-43, Method: Composition-based stats. Identities = 72/340 (21%), Positives = 131/340 (38%), Gaps = 37/340 (10%) Query: 24 TSECLNVAYGVDANYLDGVGVSITSIVLNN-RHINLDFYIIADVYNDGFFQKIAKLAEQN 82 +NVA+ D YL + S++ + + + + + D + LA + Sbjct: 265 NDGAVNVAFTSDRPYLPQTAAMVASLIEHAAPDREYNLFYLHENIGDRDLDLLRSLAVAH 324 Query: 83 QLRITLYRIN---TDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDI 139 ITL+ IN + S A Y R F LL ++RL+YLD D+V GD+ Sbjct: 325 G-NITLHTINVGTAFSREYRARHHTPSNATYNRFLLFDLLP-DVERLVYLDVDLVLCGDV 382 Query: 140 SQLLHLGLNGAVAAVVKDV-------------EPMQEKAVSRLSD-----PELLGQYFNS 181 ++L +N A A V D +P + LSD + + +YFN+ Sbjct: 383 AELFDTDMNDAPLAAVTDALMTRVLATRVRTRDPEVPDLYAYLSDDLGLSDDQISRYFNA 442 Query: 182 GVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYT-- 239 GV+ ++ AK+ + ++ N Y + DQD++NV + + LP +N + Sbjct: 443 GVMVMNFAAMDVAKVGRELREMVA--GNRYFFRDQDILNVYFRDRFVTLPSRFNVHNSDR 500 Query: 240 IKSELKDKTHQNYKKLITESTLLIHY-TGATKPWHKWAIYPSVKYYKIALENSPWKDDSP 298 + +N ++H+ KPW + + + L +P+ + Sbjct: 501 GAYDNVPVPIRNDALAAKADPFIVHFAAAHQKPWREPDV-EFAGLFWSTLARTPFWFEVL 559 Query: 299 RDAKSIIEFKKRYKHLLVQHHYISGIIAGVCYLCRKYYRK 338 E +R++ L + GV R+ R+ Sbjct: 560 -------EATRRHRSLRARLSRPDTWKHGVVIAGRRLGRR 592 >UniRef50_B1LK07 Glycosyl transferase family 8 n=10 Tax=Enterobacteriaceae RepID=B1LK07_ECOSM Length = 630 Score = 176 bits (447), Expect = 9e-43, Method: Composition-based stats. Identities = 72/354 (20%), Positives = 142/354 (40%), Gaps = 44/354 (12%) Query: 25 SECLNVAYGVDANYLDGVGVSITSIVLNNR-HINLDFYIIADVYNDGFFQKIAKLAE-QN 82 E + V D NY G I SIVL++ N D ++ + + Q++ KL N Sbjct: 274 DESVPVVISFDNNYALSGGALINSIVLHSDASRNYDIVVLENKVSHLNKQRLIKLVAGHN 333 Query: 83 QLRITLYRINTD-KLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQ 141 + + + +N+ ++ + +S + Y RLF QL +++++D+D V K D++ Sbjct: 334 NISLRFFDVNSFTEMSDVHTRAHFSASTYARLFIPQLF-REYKKVVFIDSDTVVKADLAT 392 Query: 142 LLHLGLNGAVAAVVKDV-----------------EPMQEKAVSRLSDPELLGQYFNSGVV 184 LL + + + A VKD+ E+ + + +YF +G++ Sbjct: 393 LLDVEIGTNLVAAVKDIVMEGFVKFGTMSESDDGIMPAEQYLKKTLGMTNPDEYFQAGII 452 Query: 185 YLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSEL 244 ++++ + +S L +K Y + DQD+MN + G FLP E+N + + Sbjct: 453 VFNVEQMVTENTFAQLMSALKAK--KYWFLDQDIMNKVFFGRVKFLPLEWNVYHGNGNTD 510 Query: 245 KDKTHQNY-----KKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPR 299 + + + +IHY G KPW+ + + + L +PW+ + Sbjct: 511 DFFPNLKFSTYMRFLQARRNPKMIHYAGENKPWNTEKVDFYDDFLENVLS-TPWEKEIYY 569 Query: 300 DAKSI-----IEFKKRYKHLLVQHHYISGIIAGV----------CYLCRKYYRK 338 + + + + +L+Q ++ V KYY K Sbjct: 570 RQLPVATVVPNQHTELQQTVLLQTKIKRALMPYVNKYAPVGSPRRNKLIKYYYK 623 >UniRef50_Q65VF6 RfaJ protein n=1 Tax=Mannheimia succiniciproducens MBEL55E RepID=Q65VF6_MANSM Length = 309 Score = 176 bits (446), Expect = 1e-42, Method: Composition-based stats. Identities = 75/309 (24%), Positives = 127/309 (41%), Gaps = 19/309 (6%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLR-- 85 +N+ + D NY + V I SI+ N ++ FYI+ ++ I L + + Sbjct: 1 MNIIFNCDENYAPYLSVVIKSILDNTT-LSTQFYILDFNISEESKSCIKNLIQNINKKNS 59 Query: 86 ----ITLYRINTDKLQCLPCT-QVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDIS 140 I +I+ + QC P T S A Y RL L L++ +YLD D++ D+S Sbjct: 60 FQHSINFIKIDDNDFQCFPQTISYISSATYARLKVADYL-NELNKAIYLDIDIIVISDLS 118 Query: 141 QLLHLGLNGAVAAVVKDVEPMQEK-AVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEK 199 +L H+ L + D E R + Y N+GV+ L+LK + L +K Sbjct: 119 RLWHIDLADNLVGACLDPYIEYENQDYKRKIGLQDSQPYINAGVLLLNLKALREFNLYQK 178 Query: 200 ALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSE----LKDKTHQNYKKL 255 A+ N ++ DQD++N +LKG LFL YN ++ K K + + Sbjct: 179 AIDWNKDYPN-IQFQDQDILNGVLKGKVLFLDSRYNFTVNHRNRIKLAHKGKLLLSSLEK 237 Query: 256 ITESTLLIHYTGATKPWHKWAIY----PSVKYYKIALENSPWKDDSPRDAKSIIEFKKRY 311 T+ ++HY G+ KPW + Y P + + + K+ Sbjct: 238 ATKPICILHYVGSHKPWLPTTTMVKSCLFDQIYNSIRNKPPHWNKKYQSVPLKFQLKRIL 297 Query: 312 KHLLVQHHY 320 + + + Y Sbjct: 298 REIEDKLVY 306 >UniRef50_C3ZE29 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3ZE29_BRAFL Length = 1647 Score = 176 bits (446), Expect = 1e-42, Method: Composition-based stats. Identities = 43/256 (16%), Positives = 96/256 (37%), Gaps = 15/256 (5%) Query: 21 NINTSECLNVAYGVDAN-YLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLA 79 + + + +N+ + Y + + + S++ + + + F+ + + + + +A Sbjct: 1339 DADEEDVINIFSVASGHLYERLLRIMMLSVLKHTKT-PVKFWFLKNYLSPAVMDFLPHMA 1397 Query: 80 EQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDI 139 ++ + L + + + Y LF L L++ +++++DAD + + DI Sbjct: 1398 KEYGFQYELVQYKWPRWLHQQTEKQRIIWGYKILFLDVLFPLSVKKIIFVDADQIVRTDI 1457 Query: 140 SQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELL------GQYFNSGVVYLDLKKWAD 193 +L L L GA + +E R +Y S + +DLKK+ Sbjct: 1458 KELRDLDLGGAPYGYTPFCDSRKEMNGFRFWKSGYWASHLGGRKYHISALYVVDLKKFRR 1517 Query: 194 AKLTEKA---LSILMSKDNVYKYPDQDVMNVLLKGM-TLFLPREYNTIYTIKSELKD--- 246 ++ L N DQD+ N ++ + LP+E+ T + Sbjct: 1518 IAAGDRLRGQYQGLSQDPNSLSNLDQDLPNNMIHQVAIKSLPQEWLWCETWCDDASKATA 1577 Query: 247 KTHQNYKKLITESTLL 262 KT +T+ L Sbjct: 1578 KTIDLCNNPLTKEPKL 1593 >UniRef50_UPI0001AEC697 glycosyl transferase family protein n=1 Tax=Alteromonas macleodii ATCC 27126 RepID=UPI0001AEC697 Length = 361 Score = 176 bits (446), Expect = 1e-42, Method: Composition-based stats. Identities = 74/275 (26%), Positives = 135/275 (49%), Gaps = 23/275 (8%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 ++VA+ +D + VS+ SI+ N + ++ Y I ++G +K+ L +N Sbjct: 2 ISVAFCIDDKFAPYAAVSVISILSNTKS-FVNIYFI-GNLSEGVREKLLTL--KNDRSAM 57 Query: 88 LYRINTDKLQCLPCTQVW----SRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLL 143 ++ + L +P + + ++ + R ++L LD+++YLDADV+ GDI +L Sbjct: 58 VFVAHNLPLSTMPLSDRYVERLNKITFVRYAIAEVL-TKLDKVIYLDADVLVCGDIKRLW 116 Query: 144 HLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSI 203 L + V D M +K LS YFN+GV+ +DLK W D ++ + LS Sbjct: 117 EQPLKKSYVGAVLDHSLMSQKRHITLS--LKSKSYFNAGVLLVDLKIWRDRRIF-QYLSR 173 Query: 204 LMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLI 263 + ++Y DQDV+NV+L +L + N + K + + L++ Sbjct: 174 THNTRERWEYNDQDVLNVVLDEKVQYLGADMNVQTY-----------SLKHINIKEPLIV 222 Query: 264 HYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSP 298 H+TG KPWH +++P Y++ LE+ P+K++ Sbjct: 223 HFTGQEKPWHTSSVHPYKDQYRVLLESVPFKNNKL 257 >UniRef50_UPI000180C254 PREDICTED: similar to UDP-glucose ceramide glucosyltransferase-like 1 n=1 Tax=Ciona intestinalis RepID=UPI000180C254 Length = 1548 Score = 175 bits (445), Expect = 1e-42, Method: Composition-based stats. Identities = 47/266 (17%), Positives = 103/266 (38%), Gaps = 15/266 (5%) Query: 11 KVKAWDFRLANINTSECLNVA-YGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYND 69 K ++ ++ N+S+ +NV Y + + + S++ + N+ F+++ + + Sbjct: 1235 KSESKEWEEGASNSSDVINVFSLASGHLYERLMRIMMLSVMRHTTS-NVKFWVLKNYLSP 1293 Query: 70 GFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYL 129 F I +AE+ L + + + + Y LF L L +++++++ Sbjct: 1294 QFKDFIPHMAEEYGFEYELVQYKWPRWLRQQTEKQRTMWGYKILFLDVLFPLNVEKIIFV 1353 Query: 130 DADVVCKGDISQLLHLGLNGAVAAVVKDV------EPMQEKAVSRLSDPELLGQYFNSGV 183 DAD + + ++ +L L L G + + + +Y S + Sbjct: 1354 DADQIVRANLKELRDLDLEGNPYGYTPFCSDRTEMDGFRFWKGGYWAQHLAGRKYHISAI 1413 Query: 184 VYLDLKKWADAKLTEKA---LSILMSKDNVYKYPDQDVMNVLLKGM-TLFLPREYNTIYT 239 +DLKK+ ++ L N DQD+ N ++ + LP+E+ T Sbjct: 1414 YVVDLKKFRQIAAGDRLRGQYQGLSQDPNSLANLDQDLPNNMIHQVGIKSLPQEWLWCST 1473 Query: 240 IKSEL---KDKTHQNYKKLITESTLL 262 S+ + KT +T+ L Sbjct: 1474 WCSDDSLSRAKTIDLCNNPLTKEPKL 1499 >UniRef50_Q9L7A2 Putative glycosyl transferase n=1 Tax=Haemophilus ducreyi RepID=Q9L7A2_HAEDU Length = 269 Score = 175 bits (445), Expect = 1e-42, Method: Composition-based stats. Identities = 65/282 (23%), Positives = 119/282 (42%), Gaps = 16/282 (5%) Query: 21 NINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAE 80 +N E +N+ + +Y + + +I SI L+N+H + FY++ Y +F + Sbjct: 1 MLNPLEKMNIVLAANQSYSEYILTTIKSIYLHNKH--IRFYLLNRDYPTEWFDILNNKLR 58 Query: 81 QNQLRITLYRINTDKLQCLPCTQVWSR-AMYFRLFAFQLLGLTLDRLLYLDADVVCKGDI 139 + I ++ D ++ S +FR F + D+++YLDAD+V G + Sbjct: 59 KLNSEIIDIKVTNDTIKNFKTYSHISSDTTFFRYFISDFI--EQDKVIYLDADIVVNGSL 116 Query: 140 SQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEK 199 ++L ++ A VKD+ + + FN+G++ ++ KKW + +T+ Sbjct: 117 TELYQTDISNYFLAAVKDIISEKI---------YVNNHIFNAGMLLINNKKWREHNITQF 167 Query: 200 ALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITES 259 LS+ N DQ ++N++ K L L R YN + Y + + E+ Sbjct: 168 CLSLSEKYINSLPDADQSILNLIFKDKWLKLNRGYNYLIGTDYLFFKYGKTRYLEDLGET 227 Query: 260 -TLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRD 300 L+IHY KPW Y E W+D + Sbjct: 228 IPLIIHYNTEAKPWLNIFNTRFRNIYWFYYEL-NWQDIYAKH 268 >UniRef50_D1Z8I8 Whole genome shotgun sequence assembly, scaffold_9 n=1 Tax=Sordaria macrospora RepID=D1Z8I8_SORMA Length = 1298 Score = 175 bits (443), Expect = 2e-42, Method: Composition-based stats. Identities = 49/276 (17%), Positives = 101/276 (36%), Gaps = 15/276 (5%) Query: 1 MDSFPAIEIDKVKAWDFRLANINTSECLNVAYGVDAN-YLDGVGVSITSIVLNNRHINLD 59 ++S P+ + A D ++ +N+ + Y + + I S++ + H + Sbjct: 965 LESAPSFDESGKPATDKSVSETAQHAEINIFSVASGHLYERMLSIMILSVMKHTTHT-VK 1023 Query: 60 FYIIADVYNDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLL 119 F+ I + F + LA + + + + Y LF L Sbjct: 1024 FWFIEQFLSPSFKSFLPFLAAEYGFQYEMVAYKWPHWLRHQSEKQREIWGYKILFLDVLF 1083 Query: 120 GLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELL---- 175 L+L++++++DAD + + D+ L+ L L GA + E R Sbjct: 1084 PLSLEKVIFVDADQIVRTDMYDLVQLDLEGAPYGFTPMCDSRTEMEGFRFWKTGYWATYL 1143 Query: 176 --GQYFNSGVVYLDLKKWADAKLTEKALS---ILMSKDNVYKYPDQDVMNVL-LKGMTLF 229 Y S + +DL+++ + ++ L + N DQD+ N + + Sbjct: 1144 RGQPYHISALYVVDLRRFRELAAGDRLRQQYHTLSADPNSLANLDQDLPNHMQFQIPIKS 1203 Query: 230 LPREYNTIYTIKSE---LKDKTHQNYKKLITESTLL 262 LP+E+ T ++ K +T T+ L Sbjct: 1204 LPQEWLWCETWCNDETLGKARTIDLCNNPQTKEPKL 1239 >UniRef50_A2QNN6 Contig An07c0170, complete genome n=10 Tax=Leotiomyceta RepID=A2QNN6_ASPNC Length = 1495 Score = 175 bits (443), Expect = 3e-42, Method: Composition-based stats. Identities = 44/257 (17%), Positives = 91/257 (35%), Gaps = 15/257 (5%) Query: 20 ANINTSECLNVAYGVDAN-YLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKL 78 + +N+ + Y + + + S++ N + ++ F+ I + F + L Sbjct: 1180 STSGKQADINIFSVASGHLYERMLNIMMVSVMRN-TNHSVKFWFIEQFLSPSFKSFLPHL 1238 Query: 79 AEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGD 138 A++ + + Y LF L L LD+++++DAD + + D Sbjct: 1239 AKEYNFSYEMVTYKWPHWLRAQKEKQREIWGYKILFLDVLFPLDLDKVIFVDADQIVRTD 1298 Query: 139 ISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQ------YFNSGVVYLDLKKWA 192 + L+ L L GA + E R Y S + +DL ++ Sbjct: 1299 MYDLVSLDLEGAPYGFTPMCDSRHEMEGFRFWKQGYWKNFLRGQPYHISALYVVDLNRFR 1358 Query: 193 DAKLTEKA---LSILMSKDNVYKYPDQDVMNVLLKGM-TLFLPREYNTIYTIK---SELK 245 ++ +L + DQD+ N + + LP+E+ T S+ + Sbjct: 1359 AIAAGDRLRGQYQMLSADPESLSNLDQDLPNHMQHHIPIKSLPQEWLWCETWCSDESQSQ 1418 Query: 246 DKTHQNYKKLITESTLL 262 +T +T+ L Sbjct: 1419 ARTIDLCNNPMTKEPKL 1435 >UniRef50_C1QC03 LPS:glycosyltransferase n=1 Tax=Brachyspira murdochii DSM 12563 RepID=C1QC03_9SPIR Length = 347 Score = 174 bits (441), Expect = 4e-42, Method: Composition-based stats. Identities = 65/295 (22%), Positives = 126/295 (42%), Gaps = 14/295 (4%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRH-INLDFYIIADVYNDGFFQKIAKLAEQNQLRI 86 +NV + + Y + +I S++ N + N++ YII++ N+ +KI L + + I Sbjct: 10 INVCFASNDAYAPYMSTAIASLLSNAKDDENINIYIISENINNSNKEKILSLKKIRECSI 69 Query: 87 TLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLG 146 + + + + S + +FRL L+ D+++YLD D++ + +L Sbjct: 70 DFIEPKEEIFKYISKYNMKSNSTWFRLSIPSLIP-NADKIVYLDGDMIINSSLRELFSDD 128 Query: 147 LNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMS 206 ++ A VV+DV + V +YFN+G + ++ K W + L EK + + + Sbjct: 129 MSDYYAYVVEDV-MDKIDEVKAPIGFSKTDKYFNAGFLMINNKLWIEDNLEEKFYNAVDT 187 Query: 207 KDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYT 266 + Y DQD++N LK F+ ++++ L +K+ +IH Sbjct: 188 MP-ILGYKDQDILNYCLKNRVKFIDKKWDF-------LDNKSCYKEISADINKINIIHCV 239 Query: 267 GATKPWHKWAIY-PSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQHHY 320 G KPW K + + +PW + P DA I +K + + Sbjct: 240 G--KPWKKECNVAFFADEFWKYYQLTPWFLERPIDAIQTILAQKYGDYEETKLKI 292 >UniRef50_A8PS15 UDP-glucose:Glycoprotein Glucosyltransferase containing protein n=1 Tax=Brugia malayi RepID=A8PS15_BRUMA Length = 1534 Score = 174 bits (441), Expect = 4e-42, Method: Composition-based stats. Identities = 42/254 (16%), Positives = 94/254 (37%), Gaps = 15/254 (5%) Query: 23 NTSECLNVA-YGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQ 81 + +N+ Y + + I S++ + +H ++F+++ + + F + + ++A+ Sbjct: 1237 EKHDAINIFSLASGHLYERFLRIMILSVMKHTKH-PVNFWLLKNYLSPNFKETLPQMAKH 1295 Query: 82 NQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQ 141 + + Y LF L L + +++++DAD + + D+ + Sbjct: 1296 YGFNYEFIEYRWPRWLHQQTEKQRVMWGYKILFLDVLFPLGVRKIIFVDADQIVRTDLME 1355 Query: 142 LLHLGLNGAVAAVVKDVEP------MQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAK 195 L+ L L GA + + ++ +Y S + +DL K+ Sbjct: 1356 LMELDLGGAPYGFTPFCDSRTSMDGFRFWKKGYWANHLAGRKYHISALYVIDLVKFRQVA 1415 Query: 196 LTEKA---LSILMSKDNVYKYPDQDVMNVLLKG-MTLFLPREYNTIYTIKSEL---KDKT 248 ++ L + N DQD+ N ++ LP+E+ T + K KT Sbjct: 1416 AGDRLRGQYQGLSADPNSLSNLDQDLPNNMIHQVRIKSLPQEWLWCETWCDDASKEKAKT 1475 Query: 249 HQNYKKLITESTLL 262 T+ L Sbjct: 1476 IDLCNNPQTKEPKL 1489 >UniRef50_Q3D427 Glycosyl transferase, family 8 n=8 Tax=Streptococcus agalactiae RepID=Q3D427_STRAG Length = 413 Score = 174 bits (440), Expect = 5e-42, Method: Composition-based stats. Identities = 63/278 (22%), Positives = 112/278 (40%), Gaps = 17/278 (6%) Query: 24 TSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQ 83 +A D Y + V I SI +N+ +DFYI+ D + +FQ + + Sbjct: 1 MKNRKAIALAADFGYQEQVKTIIKSICFHNQ--FIDFYILNDDFPVEWFQMMEYHLSKMD 58 Query: 84 LRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLL 143 I+ +I ++++ + YFR F +++ D++LYLD D++ D++ + Sbjct: 59 CTISNTKIFNEEIKHFKFQKPMPYPTYFRYFIPEVIHE--DKVLYLDCDMIITSDLTSIF 116 Query: 144 HLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSI 203 L ++ A V+D L + + YFNSG++ ++ W + ++++ L Sbjct: 117 TLDISKYGVAAVRD---------DLLEEYDGKEDYFNSGLLLINNIFWREQGISQRLLDY 167 Query: 204 LMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITES-TLL 262 +Y DQDV+N +L L L YN + + + Sbjct: 168 TRENQGALQYHDQDVLNDVLCDNWLELDETYNYHTGADMLYNLFQQSERQLNRRKDLPKV 227 Query: 263 IHYTGATKPWHKW-AIYPSVKYYKIALENSPWKDDSPR 299 IHYT ATKPW + W+D R Sbjct: 228 IHYT-ATKPWKYLETSVRWRDIWWEY-NRLEWRDIFTR 263 >UniRef50_C6LDU2 Glycosyl transferase, family 8 n=3 Tax=Firmicutes RepID=C6LDU2_9FIRM Length = 270 Score = 173 bits (439), Expect = 8e-42, Method: Composition-based stats. Identities = 62/257 (24%), Positives = 104/257 (40%), Gaps = 3/257 (1%) Query: 39 LDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLYRINTDKLQC 98 ++ V I SIV D YI+ + A E R+ + Sbjct: 1 MEHVLDCIRSIVRFPSEDGYDIYILHSDLQEQDQSDAAAQVEDGDTRLHFRFVEPSVFAS 60 Query: 99 LPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDV 158 P ++ + R +Y+R+FA LL +DR+LYLD D + + +L ++ G V Sbjct: 61 FPESERYPRLIYYRIFAASLLPPEMDRILYLDGDTLVINPLDELYNMDFEGNYFLACTHV 120 Query: 159 EPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDV 218 K E + Y NSGV+ ++LK+ + + E+ S + + PDQD+ Sbjct: 121 RKFLTKVNQYRLGMEEVSTYINSGVLLMNLKELREKQDFEEIASFVEKRGRYLTLPDQDI 180 Query: 219 MNVLLKGMTLFLP-REYNTIYTIKSELKDKTHQN--YKKLITESTLLIHYTGATKPWHKW 275 + L T L +YN + S + + + E+ ++IHY G KPW K Sbjct: 181 ITALYGNKTGILDTMKYNLSDRMISVYNTEPGHKRINLEWVRENAVVIHYYGKQKPWKKP 240 Query: 276 AIYPSVKYYKIALENSP 292 + +Y+ E P Sbjct: 241 YLGMLDVFYRELKEEEP 257 >UniRef50_D1HRJ7 Whole genome shotgun sequence of line PN40024, scaffold_34.assembly12x (Fragment) n=2 Tax=Vitis vinifera RepID=D1HRJ7_VITVI Length = 1715 Score = 173 bits (438), Expect = 1e-41, Method: Composition-based stats. Identities = 44/252 (17%), Positives = 88/252 (34%), Gaps = 19/252 (7%) Query: 27 CLNVAYGVDAN-YLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLR 85 +N+ + Y + + I S++ N + + F+ I + + F I +A++ Sbjct: 1407 TINIFSIASGHLYERFLKIMILSVLKN-SNRPVKFWFIKNYLSPQFKDVIPHMAQEYGFE 1465 Query: 86 ITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHL 145 L + Y LF + L+L++++++DAD + + D+ +L + Sbjct: 1466 YELITYKWPTWLHKQKEKQRIIWAYKILFLDVIFPLSLEKVIFVDADQIVRADMGELYDM 1525 Query: 146 GLNGAVAAVVK------DVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEK 199 + G A D++ + D Y S + +DL K+ + + Sbjct: 1526 DIKGRPLAYTPFCDNNKDMDGYRFWRQGFWKDHLRGKPYHISALYVVDLVKFRETAAGDN 1585 Query: 200 A---LSILMSKDNVYKYPDQDVMNVLLKGM---TLFLPREYNTIYTIKSEL---KDKTHQ 250 L N DQD+ N LP+E+ + K KT Sbjct: 1586 LRVFYETLSKDPNSLSNLDQDLPN--FAQHTVPIFSLPQEWLWCESWCGNATKSKAKTID 1643 Query: 251 NYKKLITESTLL 262 +T+ L Sbjct: 1644 LCNNPMTKEPKL 1655 >UniRef50_D0N7I0 UDP-glucose:glycoprotein glucosyltransferase, putative n=1 Tax=Phytophthora infestans T30-4 RepID=D0N7I0_PHYIN Length = 1632 Score = 172 bits (437), Expect = 1e-41, Method: Composition-based stats. Identities = 53/273 (19%), Positives = 96/273 (35%), Gaps = 15/273 (5%) Query: 2 DSFPAIEIDKVKAWDFRLANINTSECLNVAYGVDA-NYLDGVGVSITSIVLNNRHINLDF 60 D+ A ++D A T E ++V Y V + ++S++ + + F Sbjct: 1329 DTVEADQVDNDSA--VVAQKQRTGETIHVFSVASGYLYERFVKIMMSSVLK-RTNNPVTF 1385 Query: 61 YIIADVYNDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLG 120 +++ + + F + I L EQ + I L + Y LF L Sbjct: 1386 WLLENFLSPDFKKSIPVLREQFGMDIRLVTYKWPNWLRQQTEKQRIIWGYKILFLDVLFP 1445 Query: 121 LTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEP----MQEKAVSRLSDPELLG 176 L + +++Y+DAD V + D+ +L L L+G + Q D Sbjct: 1446 LGVQKIIYVDADQVVRADLKELWELDLDGKPYGYTPFCDSRNVGFQFWRQGYWKDHLRGK 1505 Query: 177 QYFNSGVVYLDLKKWADA---KLTEKALSILMSKDNVYKYPDQDVMNVLLKGM-TLFLPR 232 Y S + +DL + + S L + N DQD+ N + LP+ Sbjct: 1506 PYHISALYVVDLALFRQMAAGDMLRAVYSHLSADPNSLANLDQDLPNYAQHQIPIFSLPQ 1565 Query: 233 EYNTIYTIKSE---LKDKTHQNYKKLITESTLL 262 E+ + S+ + KT + L Sbjct: 1566 EWLWCESWCSDETKVAAKTIDLCNNPKHKEPKL 1598 >UniRef50_UPI0000E47484 PREDICTED: similar to UDP-glucose ceramide glucosyltransferase-like 1 n=3 Tax=Strongylocentrotus purpuratus RepID=UPI0000E47484 Length = 1470 Score = 172 bits (435), Expect = 2e-41, Method: Composition-based stats. Identities = 46/253 (18%), Positives = 94/253 (37%), Gaps = 15/253 (5%) Query: 24 TSECLNVA-YGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQN 82 E LN+ Y + + + S++ + + + F+ + + + F + I ++A++ Sbjct: 1164 DMEQLNIFSLASGHLYERLLRIMMLSVLKHTKS-PVKFWFLKNYLSPSFKEIIPEMAKEY 1222 Query: 83 QLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQL 142 L + + + Y LF L L + +++++DAD + + D+ +L Sbjct: 1223 DFEYELIQYKWPRWLHQQTEKQRMIWGYKILFLDVLFPLNIKKIIFVDADQIVRADMQEL 1282 Query: 143 LHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELL------GQYFNSGVVYLDLKKWADAKL 196 L GA V + +E R +Y S + +DL K+ Sbjct: 1283 ADFDLKGAPYGYVPFCDSRKEMDGFRFWKSGYWASHLAGRKYHISALYVVDLVKFRRIAA 1342 Query: 197 TEKA---LSILMSKDNVYKYPDQDVMNVLLKGM-TLFLPREYNTIYTIKSE---LKDKTH 249 ++ L N DQD+ N ++ + LP+E+ T E + KT Sbjct: 1343 GDRLRGQYQALSQDPNSLSNLDQDLPNNMIHQVAIRSLPQEWLYCETWCHESEKSRAKTI 1402 Query: 250 QNYKKLITESTLL 262 +T+ L Sbjct: 1403 DLCNNPLTKEPKL 1415 >UniRef50_Q4HGS8 General stress protein A, putative n=2 Tax=Campylobacter RepID=Q4HGS8_CAMCO Length = 403 Score = 171 bits (434), Expect = 2e-41, Method: Composition-based stats. Identities = 74/340 (21%), Positives = 143/340 (42%), Gaps = 44/340 (12%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNN------RHINLDFYIIADVYNDGFFQKI----AK 77 ++ + D NY+ V ITSI+ N ++ F+I+++ ++ +K+ + Sbjct: 2 YHIIFSADENYIKYTSVLITSIIKNTNPKNHFQNRPYSFHILSNFVSEETREKLECLKKE 61 Query: 78 LAEQNQLRITLYRINTDKLQCLPCTQ--VWSRAMYFRLFAFQLLGLTLDRLLYLDADVVC 135 L + I+++ ++ D+ + P + S+ Y+RL L +D+ LYLD+D++C Sbjct: 62 LNKIYPCEISIHIMSDDRFENFPSSGAAQNSKLPYYRLKFISLFDDNVDKCLYLDSDMLC 121 Query: 136 KGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSD----PELLGQYFNSGVVYLDLKKW 191 DI ++ + L G + VV D + K ++ + YFNSG + ++ K++ Sbjct: 122 MCDIREIFAIDLQGKIIGVVGDPGSKRSKIKFIENNTKKVLKFDENYFNSGFLLINAKEY 181 Query: 192 ADAKLTEKALSILMSKDNVYKYPDQDVMNVLL-KGMTLFLPREYNT----IYTIKSELKD 246 A + +K L K K DQD++N ++ K L L YN + + + + Sbjct: 182 KKANVEKKCEE-LAKKCIYIKAADQDLLNAVISKDKILKLSFAYNFNIITLLYVICKDEK 240 Query: 247 KTHQNYKKLIT----ESTLLIHYTGATKPWHKWAIYPS------VKYYKIALENSPWKDD 296 K NY + ++ ++HY KPW Y Y+ + P + Sbjct: 241 KNRLNYTREEFTQSAKNPKILHY--GEKPWKFLKSYVDLQNRNISDYWWDIAKEVPIFKE 298 Query: 297 SPRDAKSIIEFKKRYKHLLVQHHYISGIIAGVCYLCRKYY 336 R K + + +G+ + LC+KY Sbjct: 299 ELL----------RQKENIKDYLLYAGLGFTLYNLCKKYQ 328 >UniRef50_Q9NYU2 UDP-glucose:glycoprotein glucosyltransferase 1 n=77 Tax=Eumetazoa RepID=UGGG1_HUMAN Length = 1555 Score = 171 bits (433), Expect = 4e-41, Method: Composition-based stats. Identities = 45/259 (17%), Positives = 95/259 (36%), Gaps = 15/259 (5%) Query: 18 RLANINTSECLNVAYGVDAN-YLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIA 76 + + +N+ + Y + + + S++ N + + F+ + + + F + I Sbjct: 1246 EEVKQDKDDIINIFSVASGHLYERFLRIMMLSVLKNTKT-PVKFWFLKNYLSPTFKEFIP 1304 Query: 77 KLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCK 136 +A + + L + + + Y LF L L +D+ L++DAD + + Sbjct: 1305 YMANEYNFQYELVQYKWPRWLHQQTEKQRIIWGYKILFLDVLFPLVVDKFLFVDADQIVR 1364 Query: 137 GDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELL------GQYFNSGVVYLDLKK 190 D+ +L L+GA + +E R +Y S + +DLKK Sbjct: 1365 TDLKELRDFNLDGAPYGYTPFCDSRREMDGYRFWKSGYWASHLAGRKYHISALYVVDLKK 1424 Query: 191 WADAKLTEKA---LSILMSKDNVYKYPDQDVMNVLLKGM-TLFLPREYNTIYTIKSELKD 246 + ++ L N DQD+ N ++ + LP+E+ T + Sbjct: 1425 FRKIAAGDRLRGQYQGLSQDPNSLSNLDQDLPNNMIHQVPIKSLPQEWLWCETWCDDASK 1484 Query: 247 KTHQ---NYKKLITESTLL 262 K + +T+ L Sbjct: 1485 KRAKTIDLCNNPMTKEPKL 1503 >UniRef50_A8AY72 Glycosyl transferase, family 8 SP1766 n=7 Tax=Streptococcus RepID=A8AY72_STRGC Length = 435 Score = 171 bits (433), Expect = 4e-41, Method: Composition-based stats. Identities = 58/294 (19%), Positives = 105/294 (35%), Gaps = 27/294 (9%) Query: 8 EIDKVKAWD----FRLANINTSECL-NVAYGVDANYLDGVGVSITSIVLNNRHINLDFYI 62 E++K ++ RLAN + ++ D Y+ + +I S+ H +L Y+ Sbjct: 11 ELNKRIRYNEDTIIRLANRGKMNQMKSIVLAGDYGYIRQIETTIKSLCCY--HEDLLIYV 68 Query: 63 IADVYNDGFFQKIAKLAEQNQLRITLYRINTDKLQCLP---CTQVWSRAMYFRLFAFQLL 119 +F K + + ++ D L+ + Y R F + + Sbjct: 69 FNQDIPQEWFINTRKKVKGTGNNLFDIKLLRDDLRMKWEESTYSHINYMAYARYFIPEYV 128 Query: 120 GLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYF 179 DR LYLD D+V ++ L L L A V+ + G F Sbjct: 129 KA--DRALYLDCDLVVTQNLDHLFELDLEDYYIAAVRATFGL--------------GIGF 172 Query: 180 NSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYT 239 NSGV+ L+ K+W + + ++ + + + DQ ++N+L K L L YN Sbjct: 173 NSGVMLLNNKRWREENIPQQLVELTDREIERVLEGDQSILNMLFKEQYLELEDSYNFQIG 232 Query: 240 IKSELKDKTHQNYKKLITES-TLLIHYTGATKPWHKWAIYPSVKYYKIALENSP 292 H + ++HY A KPW+ + + + Sbjct: 233 FDMGAAQYGHDFVFDIPLSPLPAIVHYISALKPWNLLTNMRLREVWWFYNDLDW 286 >UniRef50_P91854 Protein F26H9.8, partially confirmed by transcript evidence n=2 Tax=Caenorhabditis RepID=P91854_CAEEL Length = 1381 Score = 170 bits (432), Expect = 5e-41, Method: Composition-based stats. Identities = 55/268 (20%), Positives = 104/268 (38%), Gaps = 17/268 (6%) Query: 9 IDKVKAWDFRLANINTSECLNVA-YGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVY 67 ++ K + E +NV Y + + +TS++ N + + F+++ + Sbjct: 1079 LNSAKNYFASPEPS---EVINVFSLASGHLYERFMRIMMTSVLNNTKTQKVKFWLLKNYL 1135 Query: 68 NDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLL 127 + F + I KLAE + L K + Y LF L L +D+++ Sbjct: 1136 SPKFKETIPKLAEFYKFEFELVEYKWPKWLHKQTEKQRVMWGYKILFLDVLFPLNVDKII 1195 Query: 128 YLDADVVCKGDISQLLHLGLNGAVAAVVKDV------EPMQEKAVSRLSDPELLGQYFNS 181 ++DAD V + D+ +L+ LNGA V + + + + +Y S Sbjct: 1196 FVDADQVVRADLQELMDFNLNGAPYGYVPFCESRTEMDGFRFWKSGYWKNHLMGRKYHIS 1255 Query: 182 GVVYLDLKKWADAKLTEKA---LSILMSKDNVYKYPDQDVMNVLLKGM-TLFLPREYNTI 237 + +DLK + + ++ L + N DQD+ N +L + LP+E+ Sbjct: 1256 ALYVVDLKAFREFSAGDRLRGRYDSLSADPNSLSNLDQDLPNNMLHEVPIKSLPQEWLWC 1315 Query: 238 YTIKSEL---KDKTHQNYKKLITESTLL 262 T + K KT +T+ L Sbjct: 1316 ETWCDDGSKEKAKTIDLCNNPLTKEPKL 1343 >UniRef50_Q5KMJ4 Putative uncharacterized protein n=1 Tax=Filobasidiella neoformans RepID=Q5KMJ4_CRYNE Length = 1543 Score = 170 bits (430), Expect = 8e-41, Method: Composition-based stats. Identities = 47/258 (18%), Positives = 94/258 (36%), Gaps = 15/258 (5%) Query: 20 ANINTSECLNVAYGVDAN-YLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKL 78 +N+ Y + I S++ + + ++ F+ I + + F I KL Sbjct: 1220 PAKTEHADINIFTVASGLLYERFASIMILSVMKH-TNSSVKFWFIENFLSPTFIAFIPKL 1278 Query: 79 AEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGD 138 AE+ + + Y LF L ++LD+++++DAD + + D Sbjct: 1279 AEEYGFQYEFVTYKWPHWLRAQTEKQRIIWAYKILFLDVLFPMSLDKVIFVDADQIVRTD 1338 Query: 139 ISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLG------QYFNSGVVYLDLKKWA 192 + +L+ + L+G V +E R Y S + +DLKK+ Sbjct: 1339 MKELMDVDLHGRVYGYAPMGNSRKEMEGFRFWKSGYWKEALRGRPYHISALYVVDLKKFR 1398 Query: 193 DAKLTEKA---LSILMSKDNVYKYPDQDVMNVLLKGM-TLFLPREYNTIYTIKSE---LK 245 ++ L + N DQD+ N + + L +++ T S+ Sbjct: 1399 QLATGDRLRGQYHALSADPNSLANLDQDLPNSMQDQIPIWTLDQDWLWCQTWCSDESLAT 1458 Query: 246 DKTHQNYKKLITESTLLI 263 KT + +T+ L+ Sbjct: 1459 AKTIDLCQNPLTKEPKLV 1476 >UniRef50_Q4PEF1 Putative uncharacterized protein n=1 Tax=Ustilago maydis RepID=Q4PEF1_USTMA Length = 1678 Score = 170 bits (430), Expect = 9e-41, Method: Composition-based stats. Identities = 49/257 (19%), Positives = 90/257 (35%), Gaps = 15/257 (5%) Query: 20 ANINTSECLNVAYGVDAN-YLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKL 78 A +N+ + Y + I S++ + ++ F+ I + + F + I L Sbjct: 1345 ATARKHADINIFTVASGHLYERMTYIMILSVLKHTSS-SVKFWFIENFLSPSFKEFIPHL 1403 Query: 79 AEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGD 138 A + L + + Y LF L L L +++++DAD V + D Sbjct: 1404 AAEYGFEYELVTYAWPHWLRAQKEKQRTIWGYKILFLDTLFPLDLGKVIFVDADQVVRTD 1463 Query: 139 ISQLLHLGLNGAVAAVVK------DVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWA 192 + +L+ L L G V D++ + D Y S + +DL+K+ Sbjct: 1464 MQELVDLDLEGKVYGYPPMGDDSEDMDGFRFWKQGYWKDYLRGRPYHISALYVVDLQKFR 1523 Query: 193 DAKLTEKA---LSILMSKDNVYKYPDQDVMNVLLKGM-TLFLPREYNTIYTIKSEL---K 245 ++ L + N DQD+ N + + L +E+ T S Sbjct: 1524 LFAAGDRLRGQYQALSADPNSLSNLDQDLPNNMQTSIPIHTLEKEWLWCETWCSHDWLKD 1583 Query: 246 DKTHQNYKKLITESTLL 262 KT T+ L Sbjct: 1584 AKTIDLCSNPKTKEPKL 1600 >UniRef50_UPI0001792D56 PREDICTED: similar to UDP-glucose glycoprotein:glucosyltransferase n=1 Tax=Acyrthosiphon pisum RepID=UPI0001792D56 Length = 1536 Score = 169 bits (429), Expect = 9e-41, Method: Composition-based stats. Identities = 46/257 (17%), Positives = 99/257 (38%), Gaps = 15/257 (5%) Query: 20 ANINTSECLNVAYGVDAN-YLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKL 78 + +T E +N+ + Y + + + S++ N + + F+ + + + + + Sbjct: 1234 PDKSTDETINIFSVASGHLYERFLRIMMLSVLKNTKS-PVKFWFLKNYLSPTVKNFLPIM 1292 Query: 79 AEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGD 138 A++ + + L + + + Y LF L L + +++++DAD V + D Sbjct: 1293 AQEYKFQYELVEYKWPRWLHQQTEKQRTIWGYKILFLDVLFPLDVKKIIFVDADQVVRAD 1352 Query: 139 ISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLG------QYFNSGVVYLDLKKWA 192 + +L+ L L GA A E +E R +Y S + +DLK++ Sbjct: 1353 MKELVDLDLGGAPYAYTPFCESRKEMDGFRFWKQGYWKTHLQGRRYHISALYVVDLKRFR 1412 Query: 193 DAKLTEKA---LSILMSKDNVYKYPDQDVMNVLLKG-MTLFLPREYNTIYTIKSELKDKT 248 ++ L N DQD+ N ++ LP+E+ T + K+ Sbjct: 1413 KVAAGDRLRGQYQALSQDPNSLSNLDQDLPNNMIHQVSIKSLPQEWLWCETWCDDASKKS 1472 Query: 249 HQ---NYKKLITESTLL 262 + +T+ L Sbjct: 1473 AKTIDLCNNPLTKEAKL 1489 >UniRef50_Q3DNA2 Glycosyl transferase, family 8 n=9 Tax=Streptococcus RepID=Q3DNA2_STRAG Length = 272 Score = 169 bits (429), Expect = 1e-40, Method: Composition-based stats. Identities = 69/269 (25%), Positives = 121/269 (44%), Gaps = 8/269 (2%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +N+ + +D Y+D V + S+V ++ L+ Y++ + I + + ++ Sbjct: 1 MNLLFSIDDMYVDHFKVMLYSLVRQTKNRKLEIYVLQKTLLKRHTELI-QYTQNLEVGYH 59 Query: 88 LYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGL 147 + T+ P T + +Y+RL A + L TLDR+LYLDAD++C D S L + L Sbjct: 60 PIIVGTEVFAQAPTTDRYPDTIYYRLLAHKFLPETLDRILYLDADMLCLNDFSSLYDMEL 119 Query: 148 NGAVAAVV---KDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 + A D + + RL + EL YFN+GV+ ++L + L + Sbjct: 120 GDQLYAAASHNTDGKFLDYVNKLRLKNVELESSYFNTGVLLMNLPAIRKVVHQQTILDYM 179 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPRE-YNT--IYTIKSELKDKTHQNYKKLITESTL 261 M PDQD++N L + +P E YN Y++ +LK + + + + T+ Sbjct: 180 MQNRGRLILPDQDILNGLYANLVKPIPDEIYNYDARYSLIYQLKSRNEWD-LEWVINHTV 238 Query: 262 LIHYTGATKPWHKWAIYPSVKYYKIALEN 290 +H+ G KPW K YK + Sbjct: 239 FLHFAGRDKPWKKDYRGRYSGLYKFMAKE 267 >UniRef50_UPI0001693121 general stress protein n=1 Tax=Paenibacillus larvae subsp. larvae BRL-230010 RepID=UPI0001693121 Length = 352 Score = 169 bits (428), Expect = 1e-40, Method: Composition-based stats. Identities = 59/268 (22%), Positives = 102/268 (38%), Gaps = 26/268 (9%) Query: 35 DANYLDGVGVSITSIVLNNRHINLDFYIIADV-YNDGFFQKIAKLAEQNQLRITLYRINT 93 D Y + G + S+ N +++ +I+ D + QK+ +L I Y + Sbjct: 12 DGAYAEHAGAVLASVFCNTSS-SVNVHILHDETLTEANKQKLIELTSSFNQTIHFYPVTI 70 Query: 94 DK-----LQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLN 148 + + W++A +RL L+ +D+++YLD DV+ +I++L + L Sbjct: 71 PDNMLQAMAGVKSISFWTQASMYRLLIPALIP--VDKIIYLDCDVLVNMNIAELWEVQLG 128 Query: 149 GAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADA-KLTEKALSILMSK 207 A V D M YFNSGV+ L E+ L+ L Sbjct: 129 DFYLAAVWDQAIMAAVQHIIPYGLNPDS-YFNSGVILFALNNIRKKIDWYEEMLNFLRRY 187 Query: 208 DNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTG 267 + PDQD +N + L L R +N + + ++H+ G Sbjct: 188 PD-TSMPDQDTLNAVFGENYLQLDRRFNFFNMVSPHHDF------------NNKIVHFAG 234 Query: 268 ATKPWHKWAIYPSVKYYKIALENSPWKD 295 + K W + P Y+ L +PWK Sbjct: 235 SEKCWDVHS--PGANLYQEYLSLTPWKK 260 >UniRef50_C5S3F7 Putative glycosyl transferase n=2 Tax=Actinobacillus minor RepID=C5S3F7_9PAST Length = 275 Score = 169 bits (427), Expect = 2e-40, Method: Composition-based stats. Identities = 60/285 (21%), Positives = 110/285 (38%), Gaps = 17/285 (5%) Query: 20 ANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLA 79 + + D + + + +I SI +N NL ++ ++ +F+ + Sbjct: 3 QTNKQTNKQTIILAADIKFAEQLETTIKSICYHN--ANLYIVLLNRDFSKEWFEYLNTYL 60 Query: 80 EQNQLRITLYRINTDKLQCLPCTQVWSRA-MYFRLFAFQLLGLTLDRLLYLDADVVCKGD 138 Q I ++N ++L+ S A +FR F + D++LYLD D+V G Sbjct: 61 NQINCEIIDVKVNCNQLEEYKTLPHISSASTFFRYFIPAFV--NDDKVLYLDCDLVVNGS 118 Query: 139 ISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTE 198 +S L LN A D ++FN+GV+ ++ K W ++T Sbjct: 119 LSIFFDLELNDHYVAASLDDIAFNFH----------QKKHFNAGVLLINNKLWRKQEITL 168 Query: 199 KALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITE 258 KAL + + + DQ+V+N+L + + L N + + + Y + + Sbjct: 169 KALELTDRLNEKLEEGDQEVLNILFQNKWIELNPYLNYLVGAEYLYRRNGVTQYIRRQED 228 Query: 259 S-TLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAK 302 L++H+ KPW P +YY W D R Sbjct: 229 DVPLILHFNTKYKPWLPIDGVPFREYYWFYYRL-NWADIIARHYN 272 >UniRef50_C5XV64 Putative uncharacterized protein Sb04g036540 n=1 Tax=Sorghum bicolor RepID=C5XV64_SORBI Length = 1568 Score = 168 bits (426), Expect = 2e-40, Method: Composition-based stats. Identities = 44/256 (17%), Positives = 86/256 (33%), Gaps = 15/256 (5%) Query: 21 NINTSECLNVAYGVDAN-YLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLA 79 + E +N+ + Y + + I S++ + F+ I + + F I +A Sbjct: 1257 DARQGETINIFSVASGHLYERFLKIMILSVLK-ETQRPVKFWFIKNYLSPQFKDVIPHMA 1315 Query: 80 EQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDI 139 + L + Y LF + L+L +++++DAD + + D+ Sbjct: 1316 REYGFEYELITYKWPTWLHKQKEKQRIIWAYKILFLDVIFPLSLRKVIFVDADQIVRADM 1375 Query: 140 SQLLHLGLNGAVAAVVK------DVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWAD 193 +L + L G A D++ + D Y S + +DL K+ Sbjct: 1376 GELYDMNLKGRPLAYTPFCDNNKDMDGYRFWKQGFWKDHLRGRPYHISALYVVDLAKFRQ 1435 Query: 194 AKLTEKA---LSILMSKDNVYKYPDQDVMNVLLKGM-TLFLPREYNTIYTIKSE---LKD 246 + L N DQD+ N + LP+E+ + + Sbjct: 1436 TASGDTLRVFYEQLSKDPNSLSNLDQDLPNYAQHTVPIFSLPQEWLWCESWCGNATKARA 1495 Query: 247 KTHQNYKKLITESTLL 262 KT +T+ L Sbjct: 1496 KTIDLCNNPMTKEPKL 1511 >UniRef50_Q046Z9 Lipopolysaccharide biosynthesis glycosyltransferase n=32 Tax=Lactobacillus RepID=Q046Z9_LACGA Length = 317 Score = 168 bits (426), Expect = 2e-40, Method: Composition-based stats. Identities = 60/299 (20%), Positives = 125/299 (41%), Gaps = 28/299 (9%) Query: 27 CLNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAEQNQLR 85 + V Y + NY VSI S++ + +++ + ++ +D + + L + Sbjct: 3 TIPVFYTISDNYTPYAAVSIQSLIDHVDQNKDYTITLLVQNISDKHKKDLEDL-SIKNVH 61 Query: 86 ITLYRINTDKL-------QCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGD 138 + ++ I+ + + + Q ++ ++++RLF L D+ +YLDAD + D Sbjct: 62 VNIFHIDDEMVAPIHNSEENYLRAQFFTMSIFYRLFIPNLFP-QYDKAVYLDADTIICTD 120 Query: 139 ISQLLHLGLNGAVAAVVKDVEPMQEK----AVSRLSDPELLGQYFNSGVVYLDLKKWADA 194 I++L + + + A V D+ K + +Y N+GV+ ++K + D Sbjct: 121 IAELYNTEIGDNMFASVPDMSIRFIKPLQVYIKECQGIFPPEKYINNGVILFNMKAFRDK 180 Query: 195 KLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKK 254 K +K S++ PDQ MN + + LP E++ + N Sbjct: 181 KFVDKFYSLIEKYHFDNIDPDQAYMNEICEDKIYHLPLEWDAMP------------NEHM 228 Query: 255 LITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSII-EFKKRYK 312 ++ ++HY KPWH +A KY+ + SP+ + + E +K+ + Sbjct: 229 DEIKNPKIVHYNLFFKPWH-FADVQYGKYFWDVAKKSPYYGELKEQLANFTDEDRKKAR 286 >UniRef50_C6H742 UDP-glucose:glycoprotein glucosyltransferase n=1 Tax=Ajellomyces capsulatus H143 RepID=C6H742_AJECH Length = 1728 Score = 167 bits (424), Expect = 4e-40, Method: Composition-based stats. Identities = 45/257 (17%), Positives = 92/257 (35%), Gaps = 15/257 (5%) Query: 20 ANINTSECLNVAYGVDAN-YLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKL 78 T +N+ + Y + + + S++ + +H ++ F+ I + F + L Sbjct: 1409 PAQGTHADINIFSVASGHLYERMLNIMMVSVMKHTKH-SVKFWFIEQFLSPSFKSFLPHL 1467 Query: 79 AEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGD 138 A + + + Y LF L L+LD+++++DAD + + D Sbjct: 1468 AAEYGFSYEMVTYKWPHWLRAQTEKQRIIWGYKILFLDVLFPLSLDKVIFVDADQIVRTD 1527 Query: 139 ISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQ------YFNSGVVYLDLKKWA 192 + +L+ L L GA + R Y S + +DLK++ Sbjct: 1528 MYELVTLDLEGAPYGFTPMCDSRTSMEGFRFWKQGYWKNFLRGLPYHISALYVVDLKRFR 1587 Query: 193 DAKLTEKA---LSILMSKDNVYKYPDQDVMNVLLKGM-TLFLPREYNTIYTIKSE---LK 245 ++ L + DQD+ N + + + LP+++ T S+ Sbjct: 1588 AIAAGDRLRGQYHTLSADPQSLSNLDQDLPNNMQRMLPIKSLPQDWLWCETWCSDESLAT 1647 Query: 246 DKTHQNYKKLITESTLL 262 KT +T+ L Sbjct: 1648 AKTIDLCNNPLTKEPKL 1664 >UniRef50_C5FDY7 Glycogenin n=1 Tax=Microsporum canis CBS 113480 RepID=C5FDY7_NANOT Length = 731 Score = 167 bits (424), Expect = 4e-40, Method: Composition-based stats. Identities = 55/292 (18%), Positives = 92/292 (31%), Gaps = 40/292 (13%) Query: 31 AYGV---DANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 Y NYL G V S+ N L + D G ++ + I Sbjct: 8 VYCTILLSDNYLPGAMVLAHSLRDNGTKGRLAVLVTLDNLQPGIIDELKTV---YDDVIP 64 Query: 88 LYRINTDKLQCLPCTQVWS-RAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLG 146 + RI L + + ++ ++ DR++Y+DADV+ +LL L Sbjct: 65 IPRIENSYPGNLYLMDRPDLISTFSKIALWK--QTQYDRIVYIDADVIALRAPDELLTLD 122 Query: 147 LNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMS 206 A V D+ FN+GV+ L AL Sbjct: 123 FKS--IAAVPDIGWP---------------DCFNTGVIVL-----RPNLKDYYALLAFAQ 160 Query: 207 KDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYT 266 + + DQ ++N+ K L YN + + + L+H+ Sbjct: 161 RGISFDGADQGLLNMHFKN-WDRLSFTYNCTPSGHYQYVPAYRY-----FESTISLVHFI 214 Query: 267 GATKPWHKW-AIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQ 317 G+ KPW + P Y L W R ++ + + +H Q Sbjct: 215 GSLKPWRIGRSSSPQQSPYNQLLAK--WWAVYDRHYRTGPIYIPQPRHYQSQ 264 >UniRef50_A5UC07 Aspartate-semialdehyde dehydrogenase n=7 Tax=Haemophilus influenzae RepID=A5UC07_HAEIE Length = 300 Score = 167 bits (423), Expect = 5e-40, Method: Composition-based stats. Identities = 57/273 (20%), Positives = 114/273 (41%), Gaps = 18/273 (6%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +N+ + +D N+ + + S+ + H N++ Y+I D +K+ + + Sbjct: 1 MNIVFTLDCNFASHLDTVLKSLCYH--HNNINIYVIHDGIPAESLEKLKMHCAKFDNTLY 58 Query: 88 LYRINTDKLQC---LPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 + N ++ + + S A FRL+ Q+L ++R++YLD D++ I +L Sbjct: 59 DIQFNINQFSFPTVMSPAHIQSSASLFRLYLHQILPQHIERVIYLDIDLIIHQAIDELWD 118 Query: 145 LGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 + L ++ A V D + Y N+GV+ ++L KW + + + + Sbjct: 119 INLEDSLIAGVSDFFSEYLWEHPFYEKQQ----YINTGVMLINLNKWRENNIEQYFIEYA 174 Query: 205 MSKDNVYKYPDQDVMNVLLK-GMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLI 263 + Y DQDV+N + + LP ++N + + +K+ I + +I Sbjct: 175 AKYGEFFVYGDQDVINFSIPTNLIKLLPVKFNIQV----KFIEYLWMEHKEKIKFTPHII 230 Query: 264 HYTGATKPWHK----WAIYPSVKYYKIALENSP 292 HY G+ KPW K + + Y S Sbjct: 231 HYIGSNKPWLKEHSANSPRFYNEEYLFYHHLSW 263 >UniRef50_C4Q2X6 Udp-glucose glycoprotein:glucosyltransferase, putative n=2 Tax=Schistosoma mansoni RepID=C4Q2X6_SCHMA Length = 1673 Score = 167 bits (423), Expect = 5e-40, Method: Composition-based stats. Identities = 42/255 (16%), Positives = 93/255 (36%), Gaps = 15/255 (5%) Query: 22 INTSECLNVAYGVDAN-YLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAE 80 + E +N+ + Y + + + +++ + + + F+ + + + F I +A Sbjct: 1346 ASNQETINIFSVASGHLYERLLRIMMLTVIRH-TNSPVKFWFLKNYLSPTFKDFIPYMAT 1404 Query: 81 QNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDIS 140 + + + + Y LF L L + +++++DAD + + D+ Sbjct: 1405 EYGFEYEFVQYKWPRWLHAQTEKQRIIWGYKILFLDVLFPLNVTKIIFVDADQIVRADLK 1464 Query: 141 QLLHLGLNGAVAAVVKDVEPMQE------KAVSRLSDPELLGQYFNSGVVYLDLKKWADA 194 +L L L+GA + +E ++ Y S + +DL ++ Sbjct: 1465 ELADLDLDGAPYGYTPFCDSRKEMDGFRFWKQGYWANHLAGRPYHISALYVVDLTRFRRL 1524 Query: 195 KLTEKA---LSILMSKDNVYKYPDQDVMNVLLKGM-TLFLPREYNTIYTIKSE---LKDK 247 ++ L N DQD+ N ++ + LP+E+ T S+ K K Sbjct: 1525 AAGDRLRGQYHGLSQDPNSLSNLDQDLPNNMIHQVPIKSLPQEWLWCETWCSDESLAKAK 1584 Query: 248 THQNYKKLITESTLL 262 T T+ L Sbjct: 1585 TIDLCNNPRTKEPKL 1599 >UniRef50_B9QZ95 Glycosyl transferase family 8 n=1 Tax=Labrenzia alexandrii DFL-11 RepID=B9QZ95_9RHOB Length = 309 Score = 167 bits (422), Expect = 6e-40, Method: Composition-based stats. Identities = 65/312 (20%), Positives = 129/312 (41%), Gaps = 30/312 (9%) Query: 29 NVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQL--RI 86 N+A D L G+ V+I S + ++ I +++AD ++ K++ + + + + Sbjct: 4 NIAACADTKVLPGLAVTIRSSLEHSS-IPCRIHVLADRLSEQDKHKLSNSWKPHPMCQDV 62 Query: 87 TLYRINTDKLQCLPCTQVW-SRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHL 145 Y I+ + T S++ Y R F LG + +YLD D++ D+++L Sbjct: 63 VFYDIDYQNISKFRSTMYLKSKSAYSRYFISDFLGEE-SKCIYLDCDLLVLRDLAELNTA 121 Query: 146 GLNGAVAAVVKDVEPMQEKAVSRLSD---PELLGQYFNSGVVYLDLKKWADAKLTEKALS 202 ++G V+D+ + + YFNSGV+ +DL +W + Sbjct: 122 KMHGKTIGSVRDISVRTADPHLFIGERLQLTNPYDYFNSGVLIIDLDRWRKLDARNHLID 181 Query: 203 ILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLL 262 + + + + + DQD +NV G T FL +NT Y++ T + Sbjct: 182 LTLERADTFHSQDQDALNVFFDGDTEFLDPVWNT-------------SQYERPDTAENRI 228 Query: 263 IHYTGATKPWH-------KWAIYPSVKYYKIA--LENSPWKDDSPRDAKSIIEFKKRYKH 313 IH G KPWH + + + + + L+ + + + P D + K+ + Sbjct: 229 IHLIGTVKPWHARYKEKLSDSYHRTEIWDRFYGVLDRTAYAGNRPWDPAGLGVVKETIES 288 Query: 314 LLVQHHYISGII 325 + + ++G I Sbjct: 289 KIPKMDMVTGKI 300 >UniRef50_A8NCT1 Putative uncharacterized protein n=1 Tax=Coprinopsis cinerea okayama7#130 RepID=A8NCT1_COPC7 Length = 1624 Score = 167 bits (422), Expect = 7e-40, Method: Composition-based stats. Identities = 44/256 (17%), Positives = 89/256 (34%), Gaps = 15/256 (5%) Query: 21 NINTSECLNVAYGVDAN-YLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLA 79 + +N+ Y + I S++ N + + F+ I + + F + I A Sbjct: 1258 PVKEQAEINIFTVASGLLYERFASIMILSVLKNTKST-VKFWFIENFLSPSFLEFIPHFA 1316 Query: 80 EQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDI 139 ++ L + Y LF L + L +++++DAD + + D+ Sbjct: 1317 KEYNFDYELVTYRWPSWLRAQTEKQRIIWAYKILFLDVLFPMDLKKVIFVDADQIVRADL 1376 Query: 140 SQLLHLGLNGAVAAVVK------DVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWAD 193 +L+ L L GA ++E + D Y S + +DL ++ Sbjct: 1377 KELVDLDLQGAPYGYTPMGDDNKEMEGFRFWKTGYWKDFLQGKPYHISALYVIDLVRFRH 1436 Query: 194 AKLTEKA---LSILMSKDNVYKYPDQDVMNVLLKGM-TLFLPREYNTIYTIKSE---LKD 246 + L + DQD+ N L + + L ++ T S+ + Sbjct: 1437 MAAGDILRGQYQALSADPGSLANLDQDLPNNLQRQVPIFSLDEDWLWCETWCSKDRLHRA 1496 Query: 247 KTHQNYKKLITESTLL 262 KT + +T+ L Sbjct: 1497 KTIDLCQNPLTKEPKL 1512 >UniRef50_Q6ESI8 Putative UDP-glucose:glycoprotein glucosyltransferase n=3 Tax=Magnoliophyta RepID=Q6ESI8_ORYSJ Length = 1626 Score = 166 bits (421), Expect = 9e-40, Method: Composition-based stats. Identities = 44/251 (17%), Positives = 86/251 (34%), Gaps = 15/251 (5%) Query: 26 ECLNVAYGVDAN-YLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQL 84 E +N+ + Y + + I S++ + + F+ I + + F I +A++ Sbjct: 1323 ETINIFSVASGHLYERFLKIMILSVLKQTQ-RPVKFWFIKNYLSPQFKDVIPHMAQEYGF 1381 Query: 85 RITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 L + Y LF + L+L +++++DAD + + D+ +L Sbjct: 1382 EYELVTYKWPTWLHKQKEKQRIIWAYKILFLDVIFPLSLRKVIFVDADQIVRADMGELYD 1441 Query: 145 LGLNGAVAAVVKDVEPMQEKAVSRLSDPELLG------QYFNSGVVYLDLKKWADAKLTE 198 + L G A + +E R Y S + +DL K+ + Sbjct: 1442 MNLKGRPLAYTPFCDNNKEMDGYRFWKQGFWKDHLRGRPYHISALYVVDLAKFRQTASGD 1501 Query: 199 KA---LSILMSKDNVYKYPDQDVMNVLLKGM-TLFLPREYNTIYTIKSE---LKDKTHQN 251 L N DQD+ N + LP+E+ + + KT Sbjct: 1502 TLRVFYETLSKDPNSLSNLDQDLPNYAQHTVPIFSLPQEWLWCESWCGNATKARAKTIDL 1561 Query: 252 YKKLITESTLL 262 +T+ L Sbjct: 1562 CNNPMTKEPKL 1572 >UniRef50_B2VVG3 UDP-glucose:glycoprotein glucosyltransferase n=9 Tax=Leotiomyceta RepID=B2VVG3_PYRTR Length = 1508 Score = 166 bits (421), Expect = 9e-40, Method: Composition-based stats. Identities = 43/259 (16%), Positives = 89/259 (34%), Gaps = 15/259 (5%) Query: 18 RLANINTSECLNVAYGVDAN-YLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIA 76 + T +N+ + Y + + + S++ + H + F+ I + F + Sbjct: 1192 KAVKKGTQADINIFSVASGHLYERMLNIMMVSVMKHTNHT-VKFWFIEQFLSPSFKSFLP 1250 Query: 77 KLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCK 136 +A + + + Y LF L L L +++++DAD + + Sbjct: 1251 HIAAEYGFEYEMVTYKWPHWLRGQTEKQREIWGYKILFLDVLFPLDLKKVIFVDADQIVR 1310 Query: 137 GDISQLLHLGLNGAVAAVVK------DVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKK 190 D+ +L+ L GA ++E + ++ Y S + +DL + Sbjct: 1311 TDMYELVQHDLQGAPYGFTPMGDSRTEMEGFRFWKTGYWANFLRGRPYHISALYVVDLVR 1370 Query: 191 WADAKLTEKALSI---LMSKDNVYKYPDQDVMNVLLKGM-TLFLPREYNTIYTIK---SE 243 + ++ L + N DQD+ N + + LP+E+ T Sbjct: 1371 FRQLAAGDRLRQQYHSLSADPNSLSNLDQDLPNNMQFNLPIHSLPQEWLWCETWCSDEDL 1430 Query: 244 LKDKTHQNYKKLITESTLL 262 K KT T+ L Sbjct: 1431 AKAKTIDLCNNPQTKEPKL 1449 >UniRef50_A1VG39 Glycosyl transferase, family 8 n=1 Tax=Desulfovibrio vulgaris DP4 RepID=A1VG39_DESVV Length = 335 Score = 166 bits (420), Expect = 1e-39, Method: Composition-based stats. Identities = 48/286 (16%), Positives = 114/286 (39%), Gaps = 22/286 (7%) Query: 26 ECLNVAYGVDANYLDGVGVSITSIVLNNRHIN-LDFYIIADVYNDGFFQKIAKLAEQNQL 84 + + + DANY V++ S+ N + Y++ + + G I + + Sbjct: 2 NTVPIVFTFDANYRLPASVALQSLFENAKDSTYYHVYLVCEGLSRGDKDAIESICPEKNG 61 Query: 85 RITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 R+ ++ + P ++ W + +Y R+ LL D+++Y D DVV D++++ Sbjct: 62 RVEWIDVDNELFSSAPSSENWPKIVYARILLPLLLP--FDKVIYSDVDVVFCSDLAEIFQ 119 Query: 145 LGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQ---YFNSGVVYLDLKKWADAKLTEKAL 201 + ++G A V ++ V+R + Q + SG + ++L+ + + L Sbjct: 120 IEVDGCEWAGVAAELVAFQEGVARCHNVHCEYQNELIYMSGFMVMNLRLMREKDTVGRCL 179 Query: 202 SILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTI-----------KSELKDKTHQ 250 + + + K D +++N+ + Y + + L+ Sbjct: 180 NNISKFGSRLKMYDLEILNMS-SDNIARIDFSYCVLENVFFAKNVSEAKEYPWLRGLYRV 238 Query: 251 NYKKLITESTLLIHYTGAT-KPWHKWAIYPSVKYYKIALENSPWKD 295 + + + +IH+ G+ K W ++ + Y+ L SP++ Sbjct: 239 SELEAARSAPRIIHFAGSDTKVWERYCVPQV---YRKYLAVSPFRS 281 >UniRef50_B0NR59 Putative uncharacterized protein n=1 Tax=Bacteroides stercoris ATCC 43183 RepID=B0NR59_BACSE Length = 306 Score = 165 bits (417), Expect = 2e-39, Method: Composition-based stats. Identities = 58/280 (20%), Positives = 117/280 (41%), Gaps = 15/280 (5%) Query: 26 ECLNVAYGVDANYLDGVGVSITSIVLNNRHINL-DFYIIAD-VYNDGFFQKIAKLAEQNQ 83 + + + + +D NY+ GV I S+++N+ D YI++ + + + K + Sbjct: 2 KKIPIVFSIDHNYVMQAGVCILSLLMNSDEKEYYDIYILSAADITEHDKELLNKTIFAYK 61 Query: 84 LRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLL 143 I I+ D+ + S+A YFRL L+ D+++Y D DV+ + + ++L Sbjct: 62 ADINFIEID-DRFDNAFEIRNISKAAYFRLLIPDLIP-QYDKIIYSDVDVIFQSGLQEVL 119 Query: 144 HLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSI 203 L +K + K + G Y NSG + ++ K + +L K Sbjct: 120 DTDLKDNYFGGIKAIGAESIKDYIIQLGLNIHG-YINSGFLLINAKLQREKQLFNKIQEY 178 Query: 204 LMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITEST--- 260 L K +++ DQD++N++ K FLP +Y + + + Sbjct: 179 LTKK---FQFQDQDIINIVCKNRLTFLPLKYCFTQKSYELYYTNPKRLFSVFSPKEVEEA 235 Query: 261 ---LLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDS 297 +IHY G KPW+ + Y +++ ++ + ++ Sbjct: 236 FTEGIIHYEGTNKPWNGFC-YRYDNWWRYYKKSVFYSEEM 274 >UniRef50_Q0I2Z7 Lipopolysaccharide biosynthesis protein, glycosyltransferase, family 8 n=1 Tax=Haemophilus somnus 129PT RepID=Q0I2Z7_HAES1 Length = 354 Score = 165 bits (417), Expect = 3e-39, Method: Composition-based stats. Identities = 70/354 (19%), Positives = 125/354 (35%), Gaps = 61/354 (17%) Query: 28 LNVAYGVDANYLDGVGVSITSIV----LNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQ 83 +N+ + D NY + V++ SI+ NN + FY++ + +LA +N Sbjct: 1 MNILFACDDNYAKYLAVTMLSIIHARDKNNECYTIHFYLLDMGISTVAKDYCLELANKNN 60 Query: 84 LRITLYRINTDKLQCLPCT-QVWSRAMYFRLFAFQLLGL-TLDRLLYLDADVVCKGDISQ 141 + + I+ + P T + S + Y RL L L +++YLD D++ + Sbjct: 61 CHLDIVPISISDFEKFPRTIEYISLSTYARLNLANYLKKFNLTKIIYLDIDILVNHSLLP 120 Query: 142 LLHLGLNGAVAAVVKDVEPMQEKAVSRLS------------------------------- 170 L + L D ++ R+S Sbjct: 121 LWNTDLGNKAIGACYDAFIESQEKSKRMSSQSVSQSVSQSVSQSVSQSVSQSVSQSVSQS 180 Query: 171 --------------DPELLGQYFNSGVVYLDLKKWADAKLTEK---ALSILMSKDNVYKY 213 YFN+GV+ +++ +W + EK + + + Y Sbjct: 181 VSQSVSQSDYKTKLHLPNTHFYFNAGVLLINVVEWEKCHVFEKSLQWIEYCKRNNIEFLY 240 Query: 214 PDQDVMNVLLKGMTLFLPREYNTIYTIKSELK--DKTHQNYKKLITESTLLIHYTGATKP 271 DQD++N + +L YN + LK K N + T +IHY G K Sbjct: 241 QDQDILNAIFANNVKYLDLRYNFTANALNRLKRVSKKELNQYEEATMPLAIIHYVGPKKS 300 Query: 272 WHKWAIYPSVKYY---KIALENSP--WKDDSPRDAKSIIEFKKRYKHLLVQHHY 320 WH+ + LEN P WK ++ + + F K +H ++ Y Sbjct: 301 WHEKCSMLKANLFCHLFQQLENPPKEWKIENVPFIRKLKRFAKDLRHKIIYKIY 354 >UniRef50_C4R603 Protein required for beta-1,6 glucan biosynthesis n=2 Tax=Pichia pastoris GS115 RepID=C4R603_PICPG Length = 1450 Score = 164 bits (415), Expect = 4e-39, Method: Composition-based stats. Identities = 50/261 (19%), Positives = 95/261 (36%), Gaps = 16/261 (6%) Query: 17 FRLANINTSECLNVAYGVDAN-YLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKI 75 +R + +N+ + Y + + S++ + +H + F++I + + F + + Sbjct: 1153 WRKQEQPKNADINIFTVASGHLYERFLSIMTNSVMKHTKHT-VKFWLIENYMSPTFKKNL 1211 Query: 76 AKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVC 135 LA + L + + Y LF L +LD+++++DAD + Sbjct: 1212 PFLAREFGFDYELVNYKWPAWLRGQREKQRTIWGYKILFLDVLFPQSLDKVIFVDADQIV 1271 Query: 136 KGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLG-------QYFNSGVVYLDL 188 + D+ +L+ L L GA +E R +Y S + +DL Sbjct: 1272 RTDLKELVDLDLEGAPYGYTPMCNDREEMEGFRFWKQGYWQKLLGDTLKYHISALYVIDL 1331 Query: 189 KKWADAKLTEKA---LSILMSKDNVYKYPDQDVMNVLLKG-MTLFLPREYNTIYTIKSE- 243 K + ++ L N DQD+ N L LP+E+ T S+ Sbjct: 1332 KTFRQIAAGDRLRQHYQQLSQDPNSLSNLDQDLPNNLQHQIKIFSLPQEWLWCETWCSDE 1391 Query: 244 --LKDKTHQNYKKLITESTLL 262 K KT +T+ L Sbjct: 1392 SLKKAKTIDLCNNPLTKEPKL 1412 >UniRef50_B8PIH6 Predicted protein n=2 Tax=Agaricomycetes RepID=B8PIH6_POSPM Length = 532 Score = 164 bits (415), Expect = 4e-39, Method: Composition-based stats. Identities = 58/306 (18%), Positives = 119/306 (38%), Gaps = 40/306 (13%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAK-LAEQNQLRI 86 +N+A D Y V+I S++ + + L Y++ D K+ + + + + Sbjct: 227 MNIAIATDPAYAMAAAVAIHSVIAHTKSR-LTIYVLDLGLGDNDRNKLRRSMPRRADATM 285 Query: 87 TLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLG 146 ++ ++ +A + ++ +L ++R+LYLDADV+ + DI L Sbjct: 286 VFIPLDY-------ASERKEKATWAKIDMIDVLP--VERVLYLDADVLVRADIWGLWSTD 336 Query: 147 LNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMS 206 L G DV + + + YFN+GV+ LDL T +AL Sbjct: 337 LRGKPIGAAIDVGFPEGHNGT------VRKPYFNAGVLLLDLAAVRR---TLQALQGAAR 387 Query: 207 KDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKS-ELKDKTHQNYKKLITESTLLIHY 265 + ++ DQD++N + + ++N EL + QN + ++ ++H+ Sbjct: 388 EYTTSRFRDQDLLNAYFEANWAEVSLKWNAQGIATYAELPTEARQNIDMGLLKNPYIVHF 447 Query: 266 TGAT-----------------KPWHKWAI--YPSVKYYKIALENSPWKDDSPRDAKSIIE 306 TG KPW +P + + +E + WK + ++ Sbjct: 448 TGPVNPTLEVVLNPYIQPYTAKPWGYAGAPGHPHGEEWWNVVEQTAWKGWRASEEYRMLC 507 Query: 307 FKKRYK 312 ++ + Sbjct: 508 ASEKER 513 >UniRef50_A2RAV0 Catalytic activity: UDP-glucose + glycogenin <=> UDP + glucosylglycogenin n=2 Tax=Aspergillus RepID=A2RAV0_ASPNC Length = 767 Score = 164 bits (415), Expect = 4e-39, Method: Composition-based stats. Identities = 50/272 (18%), Positives = 87/272 (31%), Gaps = 46/272 (16%) Query: 31 AYGV---DANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 Y +YL G V S+ N L D Q++ + ++ Sbjct: 7 VYCTLLLSDHYLPGATVLAHSLRDNGSKAKLVALFTPDSLQPATIQELQAVYDELIPVHP 66 Query: 88 LYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGL 147 L I L + + A + ++ ++ R++Y+D DVV +LL L + Sbjct: 67 LTNITPANLWLMDRPDL--IATFTKIELWR--QTQYKRIVYIDCDVVALRAPDELLDLEV 122 Query: 148 NGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKA-LSILMS 206 + A V DV FNSGV+ L L + L L Sbjct: 123 D---FAAVPDVGWP---------------DCFNSGVMVLRP------NLQDYLALRALAE 158 Query: 207 KDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYT 266 + + DQ ++N+ + L YN + + K + +IH+ Sbjct: 159 RGISFDGADQGLLNMHFRD-WHRLSFSYNCTPSANYQYIPA-----YKHFQSTISMIHFI 212 Query: 267 GATKPWH--------KWAIYPSVKYYKIALEN 290 GA KPW+ + + + Sbjct: 213 GAQKPWNMARQVEPIHSPYNQLLGRWWAVYDR 244 >UniRef50_C4ZG45 Glycosyltransferase Family 2 modular protein n=1 Tax=Eubacterium rectale ATCC 33656 RepID=C4ZG45_EUBR3 Length = 723 Score = 163 bits (413), Expect = 7e-39, Method: Composition-based stats. Identities = 56/281 (19%), Positives = 113/281 (40%), Gaps = 23/281 (8%) Query: 25 SECLNVAYGV---DANYLDGVGVSITSIVLNNRHINLDFYIIADV-YNDGFFQKIAKLAE 80 +++ G+ D NY G ++ SIV N + + F+I+ D N+ K++ +A+ Sbjct: 340 DNAIHICLGIHDKDGNYSVWAGTTMQSIVENTK-APIVFHILHDDTLNEMNKNKLSLIAD 398 Query: 81 QNQLRITLYRINTDKL-QCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDI 139 + I + N D ++ FR+ ++ L +++YLD+D+ DI Sbjct: 399 NSGNGIEFHHFNPDIFGSLADSMNRFTIGTMFRIMLPDIMP-DLKKIIYLDSDLFVNTDI 457 Query: 140 SQLLHLGLNGAVAAVVKDVEPMQEKAVSR--LSDPELLGQYFNSGVVYLDLKKWADAK-L 196 +L +L ++ A +D ++ + +YFN+GV+ ++L L Sbjct: 458 EELWNLNIDNYCLAAAQDCSTIRNWGTPYAVAAGQTSRDRYFNAGVLCMNLDNIRKNGSL 517 Query: 197 TEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLI 256 ++ + L + PDQD +N + G TL + ++N + +K + Sbjct: 518 FQQVMDYLSDNPRTW-LPDQDALNAIFSGKTLLIDEKWNYFIDEARKNNEKAEKKIYHYA 576 Query: 257 TESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDS 297 + L++H YY L +PW + Sbjct: 577 -ATLLMLH----------TNNEIDRAYYFTILR-TPWGEQM 605 >UniRef50_Q873M5 UDP-Glc:glycoprotein glucosyltransferase n=2 Tax=Yarrowia lipolytica RepID=Q873M5_YARLI Length = 1470 Score = 163 bits (413), Expect = 7e-39, Method: Composition-based stats. Identities = 44/257 (17%), Positives = 92/257 (35%), Gaps = 16/257 (6%) Query: 21 NINTSECLNVAYGVDAN-YLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLA 79 + +N+ + Y + + S++ + H + F++I + + F + LA Sbjct: 1157 STKKQADINIFTVASGHLYERFLSIMTASVMAHTDHT-VKFWLIENFLSASFKAFLPHLA 1215 Query: 80 EQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDI 139 L + Y LF L L+R++++D+D + + D+ Sbjct: 1216 AHYGFEYELVTYQWPHWLRGQTEKQRQIWGYKILFLDVLFPQDLERVIFIDSDQIVRTDL 1275 Query: 140 SQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQ-------YFNSGVVYLDLKKWA 192 +L+ + L GA + +E R Y S + +DLK + Sbjct: 1276 YELVEMDLEGAPYGFTPMCDSRKEMDGFRFWKQGYWDTFLGDDLVYHISALFVVDLKVFR 1335 Query: 193 DAKLTEKA---LSILMSKDNVYKYPDQDVMNVLLKGM-TLFLPREYNTIYTIKSELKDKT 248 ++ ++ L + DQD+ N L + + LP+++ T S+ KT Sbjct: 1336 AQQIGDRLRVHYHQLSADPASLSNLDQDLPNNLQRQVPIFSLPQDWLWCETWCSDESLKT 1395 Query: 249 HQN---YKKLITESTLL 262 + +T+ L Sbjct: 1396 AKTIDMCNNPLTKEPKL 1412 >UniRef50_B6K765 UDP-glucose:glycoprotein glucosyltransferase n=1 Tax=Schizosaccharomyces japonicus yFS275 RepID=B6K765_SCHJY Length = 1444 Score = 163 bits (412), Expect = 9e-39, Method: Composition-based stats. Identities = 50/239 (20%), Positives = 94/239 (39%), Gaps = 12/239 (5%) Query: 16 DFRLANINTSECLNVA-YGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQK 74 FR A + +N+ Y + + S++ + +H + F+ I + + F + Sbjct: 1123 PFRGAQKDEHAEINIFSLASGHLYERFIYIMTRSVMEHTKHT-VKFWFIENFLSPSFKRD 1181 Query: 75 IAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVV 134 IA LAE+ + + N + Y LF L L L++++++DAD + Sbjct: 1182 IAILAEKYKFKYEFVTYNWPHWLRKQTEKQREIWGYKILFLDVLFPLDLEKVIFVDADQI 1241 Query: 135 CKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLG------QYFNSGVVYLDL 188 + D+ +L+ L L GA A + E R +Y S + +DL Sbjct: 1242 VRADLKELMDLDLKGAPYAYTPMCDSRTEMEGFRFWKQGYWKKYLRGMKYHISALYVVDL 1301 Query: 189 KKWADA---KLTEKALSILMSKDNVYKYPDQDVMNVLLKGM-TLFLPREYNTIYTIKSE 243 ++ L + +L + DQD+ N L + + LP+E+ T S+ Sbjct: 1302 DRFRHMGAGDLLRRQYQLLSADPESLSNLDQDLPNHLQRMIPIYSLPQEWLWCETWCSD 1360 >UniRef50_Q8T191 Probable UDP-glucose:glycoprotein glucosyltransferase A n=2 Tax=Dictyostelium discoideum RepID=UGGG_DICDI Length = 1681 Score = 163 bits (412), Expect = 1e-38, Method: Composition-based stats. Identities = 43/259 (16%), Positives = 91/259 (35%), Gaps = 15/259 (5%) Query: 18 RLANINTSECLNVAYGVDAN-YLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIA 76 + + +++ + Y + + + S+V N + F+ + + + F + I Sbjct: 1359 THQKKSNLDTIHIFSVASGHLYERFLKIMMLSVVKN-TESPIKFWFLKNYLSPAFKEFIP 1417 Query: 77 KLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCK 136 ++A++ + L + Y LF L L + +++++DAD V + Sbjct: 1418 EMAKEYGFQYELVTYKWPWWLRKQTEKQRIIWSYKILFLDVLFPLDVPKIIFVDADQVVR 1477 Query: 137 GDISQLLHLGLNGAVAAVVKDVEPMQ------EKAVSRLSDPELLGQYFNSGVVYLDLKK 190 D+ +L + L+GA + + Y S + +DL + Sbjct: 1478 TDLKELWDMDLHGASLGYTPFCDSNKDTEGFRFWKSGYWRQHLAGRSYHISALYVVDLVR 1537 Query: 191 WADAKLTEKA---LSILMSKDNVYKYPDQDVMNVLLKG-MTLFLPREYNTIYTIKSE--- 243 + ++ L N DQD+ N L LP+E+ T + Sbjct: 1538 FRRLAAGDQLRATYDQLSRDPNSLANLDQDLPNYLQHYVRIHSLPQEWLWCETWCDQESK 1597 Query: 244 LKDKTHQNYKKLITESTLL 262 K KT +T++ L Sbjct: 1598 SKAKTIDLCNNPLTKTPKL 1616 >UniRef50_B6HCQ7 Pc18g02120 protein n=2 Tax=mitosporic Trichocomaceae RepID=B6HCQ7_PENCW Length = 711 Score = 162 bits (411), Expect = 1e-38, Method: Composition-based stats. Identities = 49/271 (18%), Positives = 85/271 (31%), Gaps = 44/271 (16%) Query: 31 AYGV---DANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 Y NYL G V S+ N L D ++ + ++ + Sbjct: 8 VYCTLLLSDNYLPGAMVLAHSLRDNGTKARLVALFTPDRLQSSTIDELRSVYDELIPVSS 67 Query: 88 LYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGL 147 + L + + A + ++ ++L R++Y+D DVV +LL L Sbjct: 68 MVNDTPANLWLMDRPDL--IATFTKIELWRL--TQYQRVVYIDCDVVALRAPDELLSLEA 123 Query: 148 NGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSK 207 + A DV FNSG++ L AL L + Sbjct: 124 D---FAAAPDVGWP---------------DCFNSGMMVL-----RPNLQDYYALRALAQR 160 Query: 208 DNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTG 267 + DQ ++N+ + L YN + + K + LIH+ G Sbjct: 161 GISFDGADQGLLNMHFRD-WHRLSFTYNCTPSANYQYIPA-----YKHFQSTISLIHFIG 214 Query: 268 ATKPWHKWAI-----YPSVKY---YKIALEN 290 A KPW+ P + + + Sbjct: 215 ARKPWNMPRQIVPLESPYNQLLGRWWAVYDR 245 >UniRef50_Q2HHC6 Putative uncharacterized protein n=1 Tax=Chaetomium globosum RepID=Q2HHC6_CHAGB Length = 1406 Score = 162 bits (411), Expect = 1e-38, Method: Composition-based stats. Identities = 38/227 (16%), Positives = 80/227 (35%), Gaps = 12/227 (5%) Query: 18 RLANINTSECLNVAYGVDAN-YLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIA 76 +N+ + Y + + + S++ + H + F+ I + F I Sbjct: 1115 NSLATTQHAEINIFSVASGHLYERMLNIMMVSVMRHTNHT-VKFWFIEQFLSPSFKDFIP 1173 Query: 77 KLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCK 136 LA + + + Y LF L L+LD+++++DAD + + Sbjct: 1174 HLAAEYNFSYEMVTYKWPHWLRQQKEKQREIWGYKILFLDVLFPLSLDKVIFVDADQIVR 1233 Query: 137 GDISQLLHLGLNGAVAAVVKDVEP------MQEKAVSRLSDPELLGQYFNSGVVYLDLKK 190 D+ +L L L GA + + ++ Y S + +DL++ Sbjct: 1234 TDMHELATLDLEGAPYGFTPMCDSRTEMEGFRFWKTGYWANYLKGHPYHISALYAVDLRR 1293 Query: 191 WADAKLTEKALS---ILMSKDNVYKYPDQDVMNVL-LKGMTLFLPRE 233 + + ++ L + N DQD+ N + + LP+ Sbjct: 1294 FRELAAGDRLRQQYHALSADPNSLANLDQDLPNHMQFQIPIHSLPQS 1340 >UniRef50_C0AA16 Lipopolysaccharide biosynthesis protein LPS:glycosyltransferase-like protein n=1 Tax=Opitutaceae bacterium TAV2 RepID=C0AA16_9BACT Length = 726 Score = 162 bits (410), Expect = 2e-38, Method: Composition-based stats. Identities = 68/335 (20%), Positives = 130/335 (38%), Gaps = 40/335 (11%) Query: 26 ECLNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAEQNQL 84 C+N+A+ D ++ + V+I SIV + N D I+ + + + I + + Sbjct: 403 NCINIAFNCDDKFVPYLCVAIKSIVATASTENNYDILILTEGLSPANLKWIDGIKHAKNV 462 Query: 85 RITLYRINT----DKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDIS 140 + + + + + SR Y RL+ +LL ++LYLD D++ + D++ Sbjct: 463 SLRVVNVRDYLQDKDISSFFMRSMVSRIAYVRLYLGELL-EKYAKVLYLDCDLIAQSDVA 521 Query: 141 QLLHLGLNGAVAAVVKD--VEPMQEKAVSRLSDPE----------LLGQYFNSGVVYLDL 188 +L ++ L+G V A V D + K V+ D + + QYFNSGV+ DL Sbjct: 522 ELFNMNLDGNVCAAVPDLAISTETIKNVAAYRDIDVYLRDVLGVTDISQYFNSGVMVFDL 581 Query: 189 KKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKT 248 +K L + ++ + DQ+V+N L G L L E+N ++ +D T Sbjct: 582 EKIRTDNLQQTFIAAAAKNTK--FFMDQNVLNSALYGKVLLLGFEWNKRVSLAMANRDTT 639 Query: 249 HQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPR---DAKSII 305 ++ ++H+ KP K + + P+ ++ S Sbjct: 640 TES---------KILHFAAEPKPLQKIHMPEHYN-WWEYARQLPFYEELLSRVIKPSSTN 689 Query: 306 EFKKRYKH-------LLVQHHYISGIIAGVCYLCR 333 K I+ + + +L Sbjct: 690 FSSTSQKLPSLNKFIYRKYIKPITALNNLLKFLKI 724 >UniRef50_A3V3C9 Putative uncharacterized protein n=1 Tax=Loktanella vestfoldensis SKA53 RepID=A3V3C9_9RHOB Length = 324 Score = 162 bits (410), Expect = 2e-38, Method: Composition-based stats. Identities = 65/327 (19%), Positives = 125/327 (38%), Gaps = 24/327 (7%) Query: 16 DFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKI 75 + + N +V + D +YL ++I +++ NN + D I + Sbjct: 4 EIKAENRPQKFRQSVIFCADQSYLPFASLAIHTLLRNNPVRDYDICI-------ASVDAL 56 Query: 76 AKLAEQNQLRITLYRINT-DKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVV 134 E I +I+ + +P ++ +S A Y R+ + DR+ YLDADV Sbjct: 57 VPPTELKDHDIRFCQIDVGNAFDGMPVSKRFSLAAYLRIALPEAFAGQYDRIFYLDADVF 116 Query: 135 CKGD-ISQLLHLGLNGAVAAVVKDVEPMQE--KAVSRLSDPELLGQYFNSGVVYLDLKKW 191 GD I + L + V D+ ++ K + G YFNSGV+ D++++ Sbjct: 117 VVGDAIDAVFRLDMLSCPVGAVTDITKLKHPNKPTFDQKALGVDGPYFNSGVMLFDVERF 176 Query: 192 ADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQN 251 ++ E+ Y DQ ++N++L+ L +N + L + Sbjct: 177 ITMRVRERCAEAAKFYQGEPIYFDQTLLNIVLQKEWAQLNLGWNWQWPFSRSLFECFI-- 234 Query: 252 YKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRY 311 ++H+ G KPW +KY + A ++ P A+ I Sbjct: 235 -------DVQIVHFIGDDKPWSDHKRRLPLKYRETARR--FFQKFYPELAQKIPAADAAL 285 Query: 312 KHLLVQHHYISGIIAGVCYLCRKYYRK 338 ++ + H++ I +L K + + Sbjct: 286 RNGALYHYFFRHI--TKIHLFTKCFNR 310 >UniRef50_O48684 F3I6.10 protein n=46 Tax=Embryophyta RepID=O48684_ARATH Length = 393 Score = 162 bits (409), Expect = 2e-38, Method: Composition-based stats. Identities = 58/287 (20%), Positives = 113/287 (39%), Gaps = 26/287 (9%) Query: 23 NTSECLNVAYGVDANYLDGVGVSITSIVLNNR-HINLDFYIIADVYNDGFFQKIAKLAEQ 81 N +++A +D+ YL G ++ S++ + N+ F+ IA ++ + +++L Sbjct: 80 NDPSLVHIAMTLDSEYLRGSIAAVHSVLRHASCPENVFFHFIAAEFDSASPRVLSQLVRS 139 Query: 82 NQLRITL--YRINTDKLQCLPCTQVW----SRAMYFRLFAFQLLGLTLDRLLYLDADVVC 135 + Y D + L + + + Y R + +L +++R++YLD+DV+ Sbjct: 140 TFPSLNFKVYIFREDTVINLISSSIRLALENPLNYARNYLGDILDRSVERVIYLDSDVIT 199 Query: 136 KGDISQLLHLGLNGAVAAVVK---DVEPMQEKAVSRLSDPELLG-------QYFNSGVVY 185 DI++L + L G+ Q SDP L G YFN+GV+ Sbjct: 200 VDDITKLWNTVLTGSRVIGAPEYCHANFTQYFTSGFWSDPALPGLISGQKPCYFNTGVMV 259 Query: 186 LDLKKWADAKLTEKALSIL--MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSE 243 +DL +W + EK + K +Y ++ G + +N Sbjct: 260 MDLVRWREGNYREKLEQWMQLQKKMRIYDLGSLPPFLLVFAGNVEAIDHRWNQ----HGL 315 Query: 244 LKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIY---PSVKYYKIA 287 D + + L L+H++G KPW + P ++ Sbjct: 316 GGDNIRGSCRSLHPGPVSLLHWSGKGKPWVRLDEKRPCPLDHLWEPY 362 >UniRef50_D1HWZ1 Whole genome shotgun sequence of line PN40024, scaffold_216.assembly12x (Fragment) n=3 Tax=Magnoliophyta RepID=D1HWZ1_VITVI Length = 503 Score = 161 bits (408), Expect = 3e-38, Method: Composition-based stats. Identities = 48/292 (16%), Positives = 100/292 (34%), Gaps = 30/292 (10%) Query: 18 RLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHI-NLDFYIIADVYNDGFFQKIA 76 + + + A D + V + S V N + F+++ D N G Q + Sbjct: 209 TPPELEDPKLYHYAIFSDN--VIAASVVVNSAVKNAKEPWKHVFHVVTDKMNLGAMQVMF 266 Query: 77 KLAEQNQLRITLYRINTDKLQC---------LPCTQVWSRAMYFRLFAFQLLGLTLDRLL 127 K+ + N I + + K L + S + R + ++ L R+L Sbjct: 267 KMRDYNGSHIEVKAVEDYKFLNSSYVPVLRQLENPKYLSMLNHLRFYLPEMYP-KLHRIL 325 Query: 128 YLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLS--------DPELLGQYF 179 +LD DVV + D++ L + ++G V V+ + ++ + Sbjct: 326 FLDDDVVVQRDLTGLWKIDMDGKVNGAVETCFGSFHRYAQYMNFSHPLIKEKFNPKACGW 385 Query: 180 NSGVVYLDLKKWADAKLTEK--ALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTI 237 G+ + DL W K TE+ L ++K + T L + ++ + Sbjct: 386 AYGMNFFDLDAWRKEKCTEQYHYWQNLNENRTLWKLGTLPPGLITFYSTTKPLDKSWHVL 445 Query: 238 YTIKSELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALE 289 + + + ++H+ G KPW A+ + ++ Sbjct: 446 GLGYN-------PSISMDEIHNAAVVHFNGNMKPWLDIAMNQFRPLWTKHVD 490 >UniRef50_Q2L3C5 Glycosyl transferase-like protein n=3 Tax=Magnoliophyta RepID=Q2L3C5_BRASY Length = 689 Score = 161 bits (407), Expect = 3e-38, Method: Composition-based stats. Identities = 50/322 (15%), Positives = 98/322 (30%), Gaps = 54/322 (16%) Query: 10 DKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYN- 68 + + + + A D + V + S +++ + F+I+ D N Sbjct: 367 NSNNKDFPNTEKLEDPKLHHYAVFSDN--VLAAAVVVNSTLVHATNH--VFHIVTDRLNY 422 Query: 69 ------------------DGFFQKIAKLAEQNQL---------RITLY----RINTDKLQ 97 Q+ L I Y D+ Sbjct: 423 AAMKMWFLANPLGKAAVQVQNIQEFTWLNSSYSPVLKQLGSRSTIDYYFRSGTARPDENP 482 Query: 98 CLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKD 157 + S + R + ++ L+++L+LD D V + D+S L + L G V V+ Sbjct: 483 KFRNPKYLSILNHLRFYLPEIFP-KLNKVLFLDDDTVVQQDLSALWSIDLKGKVNGAVET 541 Query: 158 VEPMQEKAVSRL--------SDPELLGQYFNSGVVYLDLKKWADAKLTE--KALSILMSK 207 + L ++ + G+ DL +W +T+ L Sbjct: 542 CGETFHRFDKYLNFSNPIVANNFHPQACGWAFGMNMFDLSEWRKQNITDVYHTWQKLNED 601 Query: 208 DNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTG 267 ++K V T L R ++ + + N + +IHY G Sbjct: 602 RLLWKLGTLPAGLVTFWNRTFPLDRSWHLLGLGYN-------PNVNERDIRRASVIHYNG 654 Query: 268 ATKPWHKWAIYPSVKYYKIALE 289 KPW + + KY+ ++ Sbjct: 655 NLKPWLEIGLSKYRKYWSRYVD 676 >UniRef50_C6EQF4 Putative uncharacterized protein n=3 Tax=Campylobacter jejuni RepID=C6EQF4_CAMJE Length = 958 Score = 161 bits (407), Expect = 4e-38, Method: Composition-based stats. Identities = 68/306 (22%), Positives = 128/306 (41%), Gaps = 29/306 (9%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRH-INLDFYIIADVYNDGFFQKIAKLAEQNQLRI 86 + + + VD NYL + +++ S+V + + +++ + ++ + N + I Sbjct: 13 IPIVFAVDDNYLPYMSIALNSLVDRVSNCYKYNIFVMHLNIDLERLNRLKENIRNNNVTI 72 Query: 87 TLYRINT-------DKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDI 139 +N + ++ AMY+R+F ++ +++Y D+DV+ K DI Sbjct: 73 EFINLNQYLKKIFKEYGNIFYERSYFTTAMYYRIFIPEIF-SNFKKVIYCDSDVIFKADI 131 Query: 140 SQLLHLGLNGAVAAVVKDV------------EPMQEKAVSRLSDPELLGQYFNSGVVYLD 187 S L + LN +D+ + + + YFNSGV+ D Sbjct: 132 SHLFFIDLNNKEIGACRDIAALYAYRKRETVWQQNIRNNFDKINFRSISDYFNSGVIVFD 191 Query: 188 LKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDK 247 + K K K L+++ + +PDQDV+N++ G FLP E+N ++T E KD Sbjct: 192 IVKCIQMKTVSKCLTVI-KNIDNLYFPDQDVLNIVFCGHVHFLPLEWNFLWTTYIEYKDN 250 Query: 248 THQNYKKLITE------STLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDA 301 KK+I E +IHY TKPW V+++K +N + + + Sbjct: 251 FMYLPKKIINEIYKAKTKPKIIHYISETKPWKDKNS-FFVEWWKFPRKNLFYGEILCKKL 309 Query: 302 KSIIEF 307 + Sbjct: 310 MIQNSY 315 >UniRef50_A5BZU1 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5BZU1_VITVI Length = 648 Score = 161 bits (407), Expect = 4e-38, Method: Composition-based stats. Identities = 50/297 (16%), Positives = 104/297 (35%), Gaps = 29/297 (9%) Query: 2 DSFPAIEIDKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRH-INLDF 60 D F K + +L + + A D + V I S +L F Sbjct: 355 DYFLQGX-QKRVVLNKKL--LEDPSLYHYAIFSDN--VLATSVVINSTMLXASEPEKHVF 409 Query: 61 YIIADVYNDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLG 120 +I+ D + + + ++ I + I+ + S + R + ++ Sbjct: 410 HIVTDKLSFAAMKMWFLVNSPAKVTIQVENIDD-----FKNPKYLSMLNHLRFYLPEVYP 464 Query: 121 LTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLS--------DP 172 L+++L+LD D+V + D++ L L + G V A V+ + + L+ + Sbjct: 465 -KLEKILFLDDDIVVQKDLTPLWSLDMQGMVNAAVETCKESFHRFDKYLNFSHPKISENF 523 Query: 173 ELLGQYFNSGVVYLDLKKWADAKLT--EKALSILMSKDNVYKYPDQDVMNVLLKGMTLFL 230 + + G+ DLK+W +T + ++K + +T L Sbjct: 524 DPNACGWAFGMNMFDLKEWRKRNMTGIYHYWQDMNEDRTLWKLGSLPPGLITFYNLTYPL 583 Query: 231 PREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIA 287 R ++ + + ++ ++HY G KPW + AI Y+ Sbjct: 584 DRSWHVLGLGYD-------PQLNQTEIDNAAVVHYNGNYKPWLELAIAKYKSYWSRY 633 >UniRef50_C7XX93 Glycosyl transferase, family 8 n=1 Tax=Lactobacillus coleohominis 101-4-CHN RepID=C7XX93_9LACO Length = 398 Score = 161 bits (407), Expect = 4e-38, Method: Composition-based stats. Identities = 58/269 (21%), Positives = 97/269 (36%), Gaps = 27/269 (10%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLY 89 + D +Y + +I SIV + + Y+I +F + +Q + Sbjct: 7 IVLSGDNHYTAQITTTIKSIVYHL--RRVKIYLINSDIPQEYFFNLNLRLKQLDSELVDL 64 Query: 90 RINTDKLQCLPCTQ-VWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLN 148 +IN + + S+ Y RL QL+ DR LY+D+D + IS+L + L Sbjct: 65 KINPELFSNAESPKAHISKITYGRLMIPQLVTE--DRALYIDSDAIVDQSISELWTMDLG 122 Query: 149 GAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADA-KLTEKALSILMSK 207 A V DV L FN+G++ + KK + L + L+ K Sbjct: 123 DYPIAAVHDVF---------------LADIFNAGIILFNNKKLREDPDLVDNMLAAAQQK 167 Query: 208 DNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSE--LKDKTHQNYKKLITE--STLLI 263 DQ V+N L L EYN + + L + Y + + +I Sbjct: 168 G--ILDADQTVLNQFFNHQYLELGLEYNYVIGYDRDVSLAPRNAPGYFEKMLNCPQPKII 225 Query: 264 HYTGATKPWHKWAIYPSVKYYKIALENSP 292 HY KPW+ + + + Sbjct: 226 HYASPDKPWNLQSAGRMREKWWQYHNLDW 254 >UniRef50_C3XGD2 Putative uncharacterized protein n=1 Tax=Helicobacter bilis ATCC 43879 RepID=C3XGD2_9HELI Length = 364 Score = 161 bits (407), Expect = 4e-38, Method: Composition-based stats. Identities = 66/319 (20%), Positives = 116/319 (36%), Gaps = 40/319 (12%) Query: 56 INLDFYIIADVYNDGFFQKIA----KLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYF 111 F+I+ D QK+ +L + +Y ++ Q LP + YF Sbjct: 46 KPFCFHILTDGLKHETRQKLQAFQIELNKIYPCEFRVYTLSDSIFQGLPKLNN-NYLAYF 104 Query: 112 RLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSD 171 RL L + LYLD D++C DI ++ + L G + VV + Q + R S Sbjct: 105 RLKIASCLPQDIKTCLYLDVDMICVADIREIFYTDLQGKICGVVLVPDHQQYCVLKRNSA 164 Query: 172 PELL-----GQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGM 226 YFNSG++ +D++++ + +K L V DQD +N +L Sbjct: 165 IGDEFVFNASTYFNSGLMLIDVEQYRKYNVEQKCLEWFEQYVPVL--LDQDALNAVLGDH 222 Query: 227 TLFLPREYNTIYTI-----------KSELKDKTHQNYKKLITESTLLIHYTGAT-KPWHK 274 LP E+N + + + K + + ++HYTG T KPW + Sbjct: 223 ICALPLEWNFFVELLKYKRQDFKGKDNNIVMKITYEEYMQVKNNMKILHYTGWTLKPWQQ 282 Query: 275 WAIY-------PSVKYYKIALENSP--WKDDSPRDAKSIIEFKKRY-----KHL--LVQH 318 I + ++P +KD K + KH+ Sbjct: 283 PYIENDMIKTCIYKNKWWEIAHDTPVFYKDIYASYMKKQEDMLYESILSLQKHIKSFKLR 342 Query: 319 HYISGIIAGVCYLCRKYYR 337 + + + + C+K + Sbjct: 343 NRLKRLQQSLKRRCKKLFH 361 >UniRef50_C0X9Z7 Putative uncharacterized protein n=2 Tax=Lactobacillus gasseri JV-V03 RepID=C0X9Z7_9LACO Length = 416 Score = 160 bits (406), Expect = 5e-38, Method: Composition-based stats. Identities = 61/292 (20%), Positives = 118/292 (40%), Gaps = 18/292 (6%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLY 89 +A + Y+D + +I SI+ N + N++ +++ +F I + A Q RI Sbjct: 5 IALSANYGYIDKIETTIKSILYNVK--NVEIHLLNYDIPQEWFANINRYANQIGSRIIDE 62 Query: 90 RINTDKLQCLPC-TQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLN 148 + + ++L L + ++ Y RL +L+ R+LYLD+D+V +I +L N Sbjct: 63 KFDPEELHDLNSGFKHINQMTYARLLIPKLIKAN--RVLYLDSDLVVDDEIDELFSRKFN 120 Query: 149 GAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWAD-AKLTEKALSILMSK 207 G V + ++ K SR+ P N+GV+ ++ ++ L+EK L Sbjct: 121 GKKILAVTHIFDVRNKNESRVDLPVPS---INAGVLLINNQELRKDHNLSEKLLDFARKN 177 Query: 208 DNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLIT-----ESTLL 262 + + DQD +N K L +YN L + N + ++ + Sbjct: 178 N--FPQDDQDTINNWFKDEIGSLSFKYNYQIGADRFLFWSNNSNTETATEILDKVKNPKI 235 Query: 263 IHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHL 314 IHY KP++ ++ + + K + K + +H Sbjct: 236 IHYISDDKPFNIFSEGRMRETWWFYRNLEL--SQVVEKYKPLNLDKLKERHF 285 >UniRef50_Q6BJN0 DEHA2G01232p n=3 Tax=Saccharomycetaceae RepID=Q6BJN0_DEBHA Length = 1532 Score = 160 bits (405), Expect = 7e-38, Method: Composition-based stats. Identities = 45/269 (16%), Positives = 97/269 (36%), Gaps = 18/269 (6%) Query: 2 DSFPAIEIDKVKAWDFRL-----ANINTSECLNVAYGVDAN-YLDGVGVSITSIVLNNRH 55 D+ + + +K+ ++ L +N+ + Y + + S++ + Sbjct: 1190 DNIESEDDNKIGSFMKSLLKSKAPTTKKHADINIFTIASGHLYERFLSIMTASVMAH-TD 1248 Query: 56 INLDFYIIADVYNDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFA 115 ++ F+II + + F + + LA++ L + + Y LF Sbjct: 1249 KSVKFWIIENYISSHFKKLLPLLAQEYNFEYELITYKWPNWLRFQREKQRTIWGYKILFL 1308 Query: 116 FQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELL 175 L L +++++DAD + + D+ +L+ L L GA + ++ R Sbjct: 1309 DVLFPQDLKKVIFVDADQIARTDMKELVDLDLEGAPYGFTPMCDSRKDMEGFRFWKQGYW 1368 Query: 176 GQ-------YFNSGVVYLDLKKWADAKLTEKA---LSILMSKDNVYKYPDQDVMNVLLKG 225 Y S + +DL K+ ++ L S N DQD+ N + Sbjct: 1369 AHVLKDGLKYHISALYVVDLDKFRALSAGDRLRAHYQKLSSDPNSLSNLDQDLPNNMQNK 1428 Query: 226 -MTLFLPREYNTIYTIKSELKDKTHQNYK 253 LP+E+ T S+ + + + Sbjct: 1429 IKIHSLPQEWLWCETWCSDSEFRNAKTID 1457 >UniRef50_Q1RIL1 Lipopolysaccharide 1,2-glucosyltransferase RfaJ n=10 Tax=Rickettsia RepID=Q1RIL1_RICBR Length = 530 Score = 160 bits (404), Expect = 9e-38, Method: Composition-based stats. Identities = 58/307 (18%), Positives = 116/307 (37%), Gaps = 33/307 (10%) Query: 11 KVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNR-HINLDFYII---ADV 66 + +L I L++A ++ + I S ++N+ + F+I+ D Sbjct: 233 EEIQSVIKLTGIKQDNTLDIALIINDKFARHAATVIASSLINSDINSFYKFHIVMNPNDS 292 Query: 67 YNDGFFQKIAKLAEQNQLRITLYRINTDKL------QCLPCTQVWSRAMYFRLFAFQLLG 120 + +K+A + I + L + + + +W + +RL+ Q+ Sbjct: 293 LTEESMEKLASMKHIRDYSIDFIPFPENVLDLNLANEKIEFSDMWPPLVMYRLYFDQVFP 352 Query: 121 LTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVE-PMQEKAVSRLSDPELLGQYF 179 L+ +LYLDAD++ D++ L ++ + A D V + ++ Y Sbjct: 353 -NLESILYLDADIIVLRDLNSFKKLDMSNYIVAGSMDTALTYCTLKVEEECNRKINNFYK 411 Query: 180 NSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYT 239 NSG+V+L+L+ + + L + + + YPDQD++N+ L +N Sbjct: 412 NSGIVFLNLQNMREKQAKNMVLDAMHNSKCSFAYPDQDLLNIAFHNYIYPLSMRWNFYTY 471 Query: 240 IKSELKDKTHQNYKKLITESTLLIHYTGATKPWHK----------WAIYPSVKYYKIALE 289 ++ ++HY G KPW+ KYY E Sbjct: 472 FIDRDNYFSYF-----------IMHYAGKKKPWNNEEIKWTKDILEKYQEIEKYYWRYRE 520 Query: 290 NSPWKDD 296 +PW + Sbjct: 521 FTPWGNK 527 >UniRef50_Q4E3K0 UDP-glucose:glycoprotein glucosyltransferase n=2 Tax=Trypanosoma cruzi RepID=Q4E3K0_TRYCR Length = 1668 Score = 159 bits (403), Expect = 1e-37, Method: Composition-based stats. Identities = 46/285 (16%), Positives = 93/285 (32%), Gaps = 30/285 (10%) Query: 7 IEIDKVKAWDFRLANI------NTSECLNVAYGVDAN-YLDGVGVSITSIVLNN------ 53 ++++ A LN+ + Y + + I S++ + Sbjct: 1352 EDVNEASALHVDWPPKGPIKSKPDRPTLNIFSVASGHLYERFLRMMIHSVMRTSFDVHGA 1411 Query: 54 RHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRL 113 + F++I + + F + LA+ + + + Y L Sbjct: 1412 NTTRIKFWLIENFLSPQFKTLVPLLAKHYGFDVGFVTYRWPWWLHKQTEKQRTIWAYKVL 1471 Query: 114 FAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDV--------EPMQEKA 165 F L L +DR++++DAD D+ +L ++ + A A + + Sbjct: 1472 FLDVLFPLDVDRVIFVDADQTVLADLHELYNMDIGNAPTAYTPFCRKHPNPATKNFRFWD 1531 Query: 166 VSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKA---LSILMSKDNVYKYPDQDVMNVL 222 + Y S + +DL++ +K S L S N DQD+ N Sbjct: 1532 HGYWLEHLHGKPYHISAIYLVDLRRLRAIAGGDKYRLVYSRLSSDPNSLANLDQDLPN-F 1590 Query: 223 LKGM--TLFLPREYNTIYTIKS---ELKDKTHQNYKKLITESTLL 262 ++ LP E+ T + + KT +T+ L Sbjct: 1591 IQDQVPIYSLPEEWLWCETWCGAESKARAKTIDLCNNPLTKMPKL 1635 >UniRef50_A4WWT5 Glycosyl transferase, family 8 n=1 Tax=Rhodobacter sphaeroides ATCC 17025 RepID=A4WWT5_RHOS5 Length = 319 Score = 159 bits (401), Expect = 2e-37, Method: Composition-based stats. Identities = 60/248 (24%), Positives = 93/248 (37%), Gaps = 21/248 (8%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLY 89 V + D NY + I + H D I + I + ++L + Sbjct: 17 VIFCCDRNYYPYAMFAAAQIAGRHPHRGFDICI-------ASLEAIEEPPSLSELAVRHC 69 Query: 90 RINT-DKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKG-DISQLLHLGL 147 I+ + Y RL + DR+LYLD+D+ +G D+ L+ L L Sbjct: 70 TIDAAHLFADFGLDDRRTAVTYLRLVLPEAFSEDYDRILYLDSDIYIQGGDLGALIALPL 129 Query: 148 NGAVAAVVKDVEPMQ---EKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 G A V+D + + + V YFNSGV+ D+ + A L ++AL I Sbjct: 130 AGRPLAAVRDNKQWRTPSRRMVDFDRLGLPQRPYFNSGVLLFDVPAFRAANLLQEALRIG 189 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIH 264 S+ DQ ++N + G L +N YT S L +IH Sbjct: 190 RSQGRQLVRHDQSLLNACMLGNWAELSPSWNWQYTWSSRLF---------AAMLGPNIIH 240 Query: 265 YTGATKPW 272 + G KPW Sbjct: 241 FIGRCKPW 248 >UniRef50_C5EZG9 Glycosyl transferase family protein n=1 Tax=Helicobacter pullorum MIT 98-5489 RepID=C5EZG9_9HELI Length = 374 Score = 159 bits (401), Expect = 2e-37, Method: Composition-based stats. Identities = 56/308 (18%), Positives = 120/308 (38%), Gaps = 33/308 (10%) Query: 58 LDFYIIADVYNDGFFQKIA----KLAEQNQLRITLYRINTDKLQCLPCTQ-VWSRAMYFR 112 +F+++ D + +K+ +L++ + ++ + + + + Y+R Sbjct: 9 YNFHLLMDFVSQETKEKLQNLILELSKIYPCTLNIHILEDEIFRTQSLRTLNGNYLAYYR 68 Query: 113 LFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVV-----KDVEPMQEKAVS 167 L L L++ R +YLD D++ GD+ +L + L G + VV D + + E Sbjct: 69 LRIGSALPLSIKRCVYLDVDMIVLGDLRELFKINLQGKICGVVMEGKDNDTQNILESKNK 128 Query: 168 RLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMT 227 ++ YFNSG++ +DL W + ++A I+ D+ ++N +L+G T Sbjct: 129 INKSIAIVSNYFNSGMLLVDLDLWRKENIEDRAFEIVKKYY--CHKHDEHILNAVLQGQT 186 Query: 228 LFLPREYNTIYTIK---------SELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIY 278 + ++N + + ++ ++ ++ ++HY KPW IY Sbjct: 187 FKILPQWNMMVFLYCRAVCLNERGKINMPYNRKDFNNALKNPKILHYHTHHKPWEDSKIY 246 Query: 279 ------PSVKYYKIALENSPWKDDSPRDAKSIIEFKKR------YKHLLVQHHYISGIIA 326 +Y+ +E +P + K + YK L + +I Sbjct: 247 LNYCNKFLGQYWWDMVEQTPIFKEKLLQLKPQADSALAFQCLVGYKLLRYYQKGLFILIP 306 Query: 327 GVCYLCRK 334 Y K Sbjct: 307 FYTYFLIK 314 >UniRef50_C6IB51 Glycosyltransferase n=4 Tax=Bacteroides RepID=C6IB51_9BACE Length = 417 Score = 159 bits (401), Expect = 2e-37, Method: Composition-based stats. Identities = 59/234 (25%), Positives = 96/234 (41%), Gaps = 14/234 (5%) Query: 57 NLDFYIIADVYNDGFFQKIAKLAEQNQL-RITLYRINTDKLQCLPCTQ-VWSRAMYFRLF 114 N+ YI+ D + + + ++ I I+++ + L + +R Sbjct: 2 NISIYILTDYISLESKEFLQEIKNVFTCVTIQWEIIDSESFKQLKKKGGYITEHTLYRYA 61 Query: 115 AFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPEL 174 L LD+ LYLDAD+V G I L L L G A V D+ + ++ + Sbjct: 62 IADLFP-NLDKALYLDADLVINGSIEPLWELDLEGYYCAGVDDIFIRRI-NYRKILELAE 119 Query: 175 LGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREY 234 Y N+GV+ L+LK K+ EK L N +Y DQD +N + KG +P Y Sbjct: 120 KDVYINAGVLLLNLKDLRKDKIQEKLLQHTSIYINRDRYQDQDAINCICKGKIKLIPNIY 179 Query: 235 NTIYTIKSELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYP-SVKYYKIA 287 N + + + ++IHYTG+ KPWH+ + + Y Sbjct: 180 NFTTS---------ETLHTPEMLSDIIIIHYTGSIKPWHQEYTWQVLKELYCKY 224 >UniRef50_Q09140 UDP-glucose:glycoprotein glucosyltransferase n=1 Tax=Schizosaccharomyces pombe RepID=UGGG_SCHPO Length = 1448 Score = 158 bits (400), Expect = 2e-37, Method: Composition-based stats. Identities = 42/231 (18%), Positives = 85/231 (36%), Gaps = 12/231 (5%) Query: 24 TSECLNVAYGVDAN-YLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQN 82 +N+ + Y + + S++ + + F+ I + + F I +A++ Sbjct: 1153 KEASINIFSVASGHLYERFLYIMTKSVIEH-TDKKVKFWFIENFLSPSFKSSIPAIAKKY 1211 Query: 83 QLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQL 142 N + Y LF L L L +++Y+DAD + + D+ +L Sbjct: 1212 NFEYEYITYNWPHWLRKQEEKQREIWGYKILFLDVLFPLELHKVIYVDADQIVRADLQEL 1271 Query: 143 LHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLG------QYFNSGVVYLDLKKWADA-- 194 + + L+GA + +E R +Y S + +DL ++ Sbjct: 1272 MDMDLHGAPYGYTPMCDSREEMEGFRFWKKGYWKKFLRGLKYHISALYVVDLDRFRKMGA 1331 Query: 195 -KLTEKALSILMSKDNVYKYPDQDVMNVLLKGM-TLFLPREYNTIYTIKSE 243 L + +L + N DQD+ N L + LP+++ T S+ Sbjct: 1332 GDLLRRQYQLLSADPNSLSNLDQDLPNHLQHLIPIYSLPQDWLWCETWCSD 1382 >UniRef50_B9HMR5 Glycosyltransferase, CAZy family GT8 n=25 Tax=Magnoliophyta RepID=B9HMR5_POPTR Length = 383 Score = 158 bits (400), Expect = 2e-37, Method: Composition-based stats. Identities = 53/301 (17%), Positives = 115/301 (38%), Gaps = 28/301 (9%) Query: 11 KVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNR-HINLDFYIIADVYND 69 V + R + +++A +D+ YL G ++ S++ + ++ F+ +A ++ Sbjct: 60 PVSTTNGRSVSSCDPSLVHIAMTLDSEYLRGSIAAVHSVLKHASCPESIFFHFVAAEFDP 119 Query: 70 GFFQKIAKLAEQNQLRITL--YRINTDKLQCLPCTQVW----SRAMYFRLFAFQLLGLTL 123 + + +L + Y D + L + + + Y R + +L L + Sbjct: 120 ASPRVLTQLVRSTFPSLNFKVYIFREDTVINLISSSIRQALENPLNYARNYLGDMLDLCV 179 Query: 124 DRLLYLDADVVCKGDISQLLHLGLNG-AVAAVVKDVEPMQEKAVSRLSDPE--------- 173 DR++YLD+D+V DI +L + L+G V + + + + + Sbjct: 180 DRVIYLDSDIVVVDDIHKLWNTALSGSRVIGAPEYCHANFTQYFTSVFWSDQVMSGTFSS 239 Query: 174 --LLGQYFNSGVVYLDLKKWADAKLTEKALSILM--SKDNVYKYPDQDVMNVLLKGMTLF 229 YFN+GV+ +DL +W + + + K +Y+ ++ G Sbjct: 240 ARRKPCYFNTGVMVMDLVRWREGDYKRRIEKWMEIQKKTRIYELGSLPPFLLVFAGDVEA 299 Query: 230 LPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGATKPWHK-WAIYPSV--KYYKI 286 + +N D + + L L+H++G KPW + A P ++ Sbjct: 300 IDHRWNQ----HGLGGDNVRGSCRSLHPGPVSLLHWSGKGKPWVRLDAKKPCKLDHLWEP 355 Query: 287 A 287 Sbjct: 356 Y 356 >UniRef50_Q50FU8 Cj81-079 (Fragment) n=6 Tax=Campylobacter jejuni RepID=Q50FU8_CAMJE Length = 333 Score = 158 bits (399), Expect = 3e-37, Method: Composition-based stats. Identities = 78/338 (23%), Positives = 135/338 (39%), Gaps = 51/338 (15%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNR------HINLDFYIIADVYNDGFFQKIAKLAEQ 81 N+ D NY+ V V I SI+ N + FYI+++ + K+ KL + Sbjct: 5 YNIVISCDNNYVKYVAVVIASIIKNTKINSQLKEYPYKFYILSNDISKNNILKLKKLIQH 64 Query: 82 -----NQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCK 136 + +++I+ K P + A Y+R ++ + LYLDADV+ Sbjct: 65 LSNSYYNCELIIHKIDDSKFHRFPKAWHVNHATYYRFEIADIVEGN--KCLYLDADVLVC 122 Query: 137 GDISQLLHLGLNGAVAAVVKD-VEPMQEKAVSRLSDPELLGQ-----YFNSGVVYLDLKK 190 GDI +L ++ LN VA VV D + K ++ + + YFN+GV+ +DL + Sbjct: 123 GDIRELFYMELNNKVAGVVTDSCSRLWTKLYTKDNKTSSYIEFDPLMYFNAGVILIDLNQ 182 Query: 191 WADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIK--------- 241 W + K + + DQ +N+ LK +T LP +N I Sbjct: 183 WKKHDIKNKCIDAFNIY-DHGGLADQSYLNIALKELTYKLPLNWNLIVPEYILLDGYERH 241 Query: 242 ---------SELKDKTHQNYKKLITESTLLIHYTGATKPW-------HKWAIYPSVKYYK 285 SE ++ + ++ ++H+ A KPW +K +++ Sbjct: 242 YVVNCLDEISEYNLAYTRSEFEEAMKNKKIVHFC-AAKPWWNLYYKNNKVDFNERNVWWE 300 Query: 286 IALENSPWKDDSPRDAKSIIEFKKRYKHLLVQHHYISG 323 IAL +K++ S+ KHL Q + I Sbjct: 301 IALNLEEFKEEFYFLKNSL-----DSKHLNRQLNTIEW 333 >UniRef50_Q582S2 UDP-glucose:glycoprotein glucosyltransferase, putative n=2 Tax=Trypanosoma brucei RepID=Q582S2_9TRYP Length = 1675 Score = 157 bits (398), Expect = 4e-37, Method: Composition-based stats. Identities = 38/262 (14%), Positives = 85/262 (32%), Gaps = 19/262 (7%) Query: 20 ANINTSECLNVAYGVDAN-YLDGVGVSITSIVLNNRH------INLDFYIIADVYNDGFF 72 + LN+ + Y + + + +++ + + F++I + + F Sbjct: 1356 SERPKFPTLNIFTVASGHLYERFLRIMMHTVMRTSSDVHGANTTRIKFWLIENFLSPQFK 1415 Query: 73 QKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDAD 132 + + LAE + + + Y LF L L +DR++++DAD Sbjct: 1416 ELVPLLAEHYGFDVGFVTYRWPWWLNKQTEKQRTIWAYKILFLDVLFPLNVDRVIFVDAD 1475 Query: 133 VVCKGDISQLLHLGLNGAVAAVVKDV--------EPMQEKAVSRLSDPELLGQYFNSGVV 184 + + D+ +L ++ + A A + Y S + Sbjct: 1476 QIVQADLHELYNMNIGAAAMAYTPFCREYPNDATTNFRFWDQGFWLSHLRGKPYHISALY 1535 Query: 185 YLDLKKWADAKLTEKA---LSILMSKDNVYKYPDQDVMNVLLKGM-TLFLPREYNTIYTI 240 +++++ A +K + L DQD+ N + M LP E+ T Sbjct: 1536 LVNVQRLRAALGGDKYRATYARLSEDPGSLANLDQDLPNFMQDEMPIFSLPEEWLWCETW 1595 Query: 241 KSELKDKTHQNYKKLITESTLL 262 + + T + Sbjct: 1596 CAGESKARAKTIDLCNNPLTKI 1617 >UniRef50_A4IXE1 Glycosyl transferase, family 8 n=16 Tax=Francisella RepID=A4IXE1_FRATW Length = 296 Score = 157 bits (398), Expect = 5e-37, Method: Composition-based stats. Identities = 60/307 (19%), Positives = 119/307 (38%), Gaps = 26/307 (8%) Query: 26 ECLNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAEQNQL 84 + + + D N + G V+I S++ + N D Y+ N + E+ + Sbjct: 2 NKIPIVFTFDKNIILGGAVTIKSLIDHANPDTCYDIYVYHPNINKKSISAFNSMIEKTKH 61 Query: 85 RITLYRINTDKLQCLPC-TQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLL 143 I+ + ++ + +P T+ ++RL +LL D+++Y D DV+ + D+S++ Sbjct: 62 SISFHNVDESIFKDVPIDTRRGWIITFYRLLIPKLLP-QYDKVIYSDVDVLFQSDMSEVY 120 Query: 144 HLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSI 203 + L A V + Q + + G + ++ K + + Sbjct: 121 NTDLTSYEWAGVIAEKHQQNMVQHKYFKENNNSYIYWPGFMVMNTKLMRENNFISRCFDT 180 Query: 204 LMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSE-----------LKDKTHQNY 252 + + K+ D DV+N+ + LP +Y T+ +I LK+ N Sbjct: 181 MHEFNTRLKFRDLDVLNLTCR-KIKSLPFKYVTLQSIYYLNTIQEAPEYIFLKEIYSDNE 239 Query: 253 KKLITESTLLIHYTG-ATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRY 311 + +IHY G KPW + YK LE + P++ + F+ Sbjct: 240 LLDAKNNPAIIHYAGSPGKPW------RMKRPYKNYLE---YISKIPKELRKYT-FRDIK 289 Query: 312 KHLLVQH 318 K LL ++ Sbjct: 290 KKLLSKY 296 >UniRef50_C5ZVZ7 Putative glycosyltransferase n=3 Tax=Campylobacterales RepID=C5ZVZ7_9HELI Length = 431 Score = 157 bits (397), Expect = 5e-37, Method: Composition-based stats. Identities = 70/369 (18%), Positives = 125/369 (33%), Gaps = 63/369 (17%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNN---------------------------------- 53 ++ + D NY+ V ITSI+ N Sbjct: 2 FHIFFSADKNYIPYTAVLITSIIKNTNPQKSFKDFCTTPSDSLPSLDYPRLQYDNLDKLD 61 Query: 54 RHINLDFYIIADVYNDGFFQKIA----KLAEQNQLRITLYRINTDKLQCLPCTQ--VWSR 107 + F+I++D K+ +L+ + ++ IN P + S Sbjct: 62 KSEGYVFHILSDSIPKDLQTKLQNFIQELSAFYPCTLQIHIINDIDFAHFPISGAAHSSH 121 Query: 108 AMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEK--- 164 Y+RL + + LYLD+D++ D+ +L L L +A ++ D K Sbjct: 122 LPYYRLKWQDYIKPAPQKCLYLDSDMLVLCDLRELFALDLKDNIAGIIGDCGSKNRKIKY 181 Query: 165 -AVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLL 223 + YFNSG + ++ K++ ++ EK L K K DQD++N + Sbjct: 182 QENNYKKTFYFDENYFNSGFLLINSKQYIKEQIWEKC-ENLAKKCTYIKAADQDLLNFTI 240 Query: 224 K-GMTLFLPREYNTI----YTIKSELKDKTHQNYKKLI----TESTLLIHYTGATKPWHK 274 L LP YN + + + K NY + ++ ++HY KPW Sbjct: 241 PINKRLKLPFAYNFQCITLLYVLCKDECKNRLNYTREAFNKSFKNPKILHY--GEKPWRY 298 Query: 275 WAIYPS------VKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQHHYISGIIAGV 328 Y + + +P D KS I K + +L + + Sbjct: 299 LQSYQDYKGNNINDIWWEYAQQTPIFGDKLLKQKSQISDYKLFA-ILGYYALLYTTNFLG 357 Query: 329 CYLCRKYYR 337 + K + Sbjct: 358 YFNLSKLLK 366 >UniRef50_C1QEC6 LPS:glycosyltransferase n=1 Tax=Brachyspira murdochii DSM 12563 RepID=C1QEC6_9SPIR Length = 242 Score = 157 bits (397), Expect = 6e-37, Method: Composition-based stats. Identities = 59/258 (22%), Positives = 117/258 (45%), Gaps = 22/258 (8%) Query: 24 TSECLNVAYGVDANYLDGVGVSITSIVLNNRH-INLDFYIIADVYNDGFFQKIAKLAEQN 82 E +N+ + + Y + +I SI+ N++ + F++I + +D I +L E Sbjct: 1 MQETMNICFTANDKYAPFMSATIVSILKNSKDDESFSFHVITNDISDENKMMIERLKEIK 60 Query: 83 QLRITLYRINTDKLQCLPCT----QVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGD 138 +I Y N DK + ++ +++FRL L+ + +D++LYLD D++ Sbjct: 61 TFKIKYYTPNIDKYNKWFEKINYQRHYAPSIFFRLDIPNLI-INIDKVLYLDCDIIVNSS 119 Query: 139 ISQLLHLGLNGAVAAVVKDVEP-MQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLT 197 +S+L ++ ++ A V+D K E +YFNSGV+ L+ K + + L Sbjct: 120 LSELFNIDISEYFALAVEDTGDLNFLKKYKTKIGIEDKHKYFNSGVLLLNNKLYMEKNLN 179 Query: 198 EKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLIT 257 ++ + NV + DQD++N L + F+ ++N ++ Sbjct: 180 LESENYFNKYYNVIECVDQDILNYLFRDKIKFIDNKWN---------------DFSSKNI 224 Query: 258 ESTLLIHYTGATKPWHKW 275 + + ++HY G K W+K Sbjct: 225 DKSAIMHYVGKIKSWNKN 242 >UniRef50_B9WDQ8 Killer toxin-resistance protein, putative n=5 Tax=Candida RepID=B9WDQ8_CANDC Length = 1453 Score = 157 bits (396), Expect = 7e-37, Method: Composition-based stats. Identities = 44/246 (17%), Positives = 92/246 (37%), Gaps = 12/246 (4%) Query: 10 DKVKAWDFRLANINTSECLNVA-YGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYN 68 K+ + + + +N+ Y + I S+ +N + F+I+ D + Sbjct: 1119 RVKKSDNKKAMPMRRHAEINIFTIAGGQLYEKLTSIMIASVRKHNHRSTIKFWILEDFVS 1178 Query: 69 DGFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLY 128 F + ++ + + ++ Y LF L LD++++ Sbjct: 1179 PQFKHLMKLISIKYNVEYEFISYKWPNFLRRQKSKERIIWGYKILFLDVLFPQDLDKIIF 1238 Query: 129 LDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLG-------QYFNS 181 +DAD +C+ D+++L+++ L GA + +E R +Y S Sbjct: 1239 IDADQICRADLTELINMDLEGAPYGFTPMCDSREEMEGYRFWKEGYWSDVLKDDLKYHIS 1298 Query: 182 GVVYLDLKKWADAKLTEKA---LSILMSKDNVYKYPDQDVMNVLLKG-MTLFLPREYNTI 237 + +DL+K+ K ++ L S N DQD+ N + + LP+ + Sbjct: 1299 ALFVVDLQKFRSIKAGDRLRAHYQKLSSDPNSLSNLDQDLPNNMQRSIKIFSLPQSWLWC 1358 Query: 238 YTIKSE 243 T S+ Sbjct: 1359 ETWCSD 1364 >UniRef50_C7HS13 Family 8 glycosyl transferase n=2 Tax=Anaerococcus RepID=C7HS13_9FIRM Length = 276 Score = 157 bits (396), Expect = 8e-37, Method: Composition-based stats. Identities = 65/274 (23%), Positives = 119/274 (43%), Gaps = 13/274 (4%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKL---AEQNQL 84 +N+ D NYL+ + + S+ +N N + Y+I D ++I K A + Sbjct: 1 MNILVSCDENYLNPLKTMLYSLFESN-DTNFEIYLIHKDIRDEKIKEIEKFVIKASSKRA 59 Query: 85 RITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 ++ ++ + T ++ MY+RL A++ L LDR+LYLD DV+ +L + Sbjct: 60 KLNAIKVK-NLFSNAKITFYYTEEMYYRLLAYKYLPENLDRILYLDPDVLVLNSCEKLYN 118 Query: 145 LGLNGAVAAVVKDVEPM----QEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEK- 199 + L A P +S S + + YFNSG++ ++LK D++ EK Sbjct: 119 MDLGDNYFAAATHTIPTVQSANVARLSISSGHKDIENYFNSGILMINLKLSRDSQTYEKE 178 Query: 200 ALSILMSKDNV-YKYPDQDVMNVLLKGMTLFLPR-EYNTIYTIKSELKDKTHQNYKKLIT 257 L+ + + ++ PDQD++NV+ + + + +YN K K + I Sbjct: 179 VLNYVKNTKSLGLIMPDQDLLNVVFRNKIIKIDEIKYNYDARRYLTYKLKDKKYNLSYII 238 Query: 258 ESTLLIHYTGATKPWHKWAI-YPSVKYYKIALEN 290 +T +H+ G KPW + Y + Sbjct: 239 SNTCFLHFCGKRKPWLEENNLGVFTSLYLYFWKK 272 >UniRef50_C3XKY2 Glycosyl transferase n=2 Tax=Campylobacterales RepID=C3XKY2_9HELI Length = 433 Score = 156 bits (395), Expect = 8e-37, Method: Composition-based stats. Identities = 67/365 (18%), Positives = 119/365 (32%), Gaps = 67/365 (18%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNN---------------------------------- 53 ++ D NY+ V ITS++ N Sbjct: 2 FHIILSADENYIKYASVLITSVIYNTNPKLTFKDFCQKEGFKALKNSYFSAYQNIDFSKL 61 Query: 54 ----RHINLDFYIIADVYNDGFFQKIAKLAE----QNQLRITLYRINTDKLQCLPCTQ-- 103 F+I++D + ++ +L I + IN + + P + Sbjct: 62 SKQEAQEGYIFHILSDSISSTTQNQLTELQNTLNTIYPCEILTHIINDKEFENFPISGAA 121 Query: 104 VWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQE 163 + Y+RL L ++ + LYLD+D++C D+ +L + L V A + D + Sbjct: 122 HSNHLPYYRLKLDSYLDDSITKCLYLDSDMLCLCDLRELFAIDLKDFVVAAINDPGTKKR 181 Query: 164 KAVSRLSD----PELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVM 219 K + + YFNSG + ++ + + K+ EK L K K DQD++ Sbjct: 182 KIKYKENGKKMILNFNDNYFNSGFLLINTQNYKQHKIQEKC-ENLAKKCYYIKAADQDLL 240 Query: 220 NVLL-KGMTLFLPREYNTIYTIKSELKDKTHQNY--------KKLITESTLLIHYTGATK 270 N + K L LP YN K Q + ++ +IHY K Sbjct: 241 NATIPKEKLLKLPIAYNFSSISFCIAICKDEQKHRLNCTRAEFMESYKNPKIIHY--GEK 298 Query: 271 PWHKWAIYPS------VKYYKIALENSP-WKDDSPRDAKSIIEFKKRYKHLLVQHHYISG 323 PW Y + + + +P + SI E+ + Sbjct: 299 PWKFLQSYVNSKGENINDLWWHYAKITPSFSTQLLESKASIKEYLHFASLGFEVFKLSTK 358 Query: 324 IIAGV 328 + Sbjct: 359 LTGYF 363 >UniRef50_C4JK72 Putative uncharacterized protein n=1 Tax=Uncinocarpus reesii 1704 RepID=C4JK72_UNCRE Length = 696 Score = 156 bits (395), Expect = 1e-36, Method: Composition-based stats. Identities = 42/276 (15%), Positives = 91/276 (32%), Gaps = 45/276 (16%) Query: 25 SECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQL 84 E + + +YL G V S+ + + I + +++ + ++ Sbjct: 4 REAIYCTLLMSDSYLPGAMVLARSLRDHGTQAKIVALITPESLQAQTIEELKCVYDEVIP 63 Query: 85 RITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH 144 + ++ L + + S + ++ ++ + +++Y+DADVV +LL Sbjct: 64 VSRVINVSPANLYLMDRPDLIS--TFTKIELWR--QVQYKQIVYIDADVVALRAPDELLT 119 Query: 145 LGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 L + A D+ FNSGV+ L ++ S+L Sbjct: 120 LDTH---FAAAPDIGWP---------------DCFNSGVMVLRPS-------LQEYYSLL 154 Query: 205 --MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLL 262 + + DQ ++N+ L YN + + + + L Sbjct: 155 AFAQRGISFDGADQGLLNMHFT-TWQRLSFAYNCTPSGHYQYIPA-----FRHFQSTISL 208 Query: 263 IHYTGATKPWHKWAI-----YPSVKY---YKIALEN 290 +HY G KPW+ P + + + Sbjct: 209 VHYIGQNKPWNLPRQTFPIEGPYNQLLARWWSVYDR 244 >UniRef50_UPI000023DC59 hypothetical protein FG01882.1 n=1 Tax=Gibberella zeae PH-1 RepID=UPI000023DC59 Length = 704 Score = 156 bits (394), Expect = 1e-36, Method: Composition-based stats. Identities = 44/280 (15%), Positives = 91/280 (32%), Gaps = 42/280 (15%) Query: 20 ANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLA 79 A E + + +YL G V S+ + L + D + ++ ++ Sbjct: 3 AQNAKGEQIYATLLLSDSYLPGALVLAHSLRDAGANHKLAVLVTLDSVSGDSITQLKEVY 62 Query: 80 EQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDI 139 + + + LQ + + S + ++ ++L +++Y+DADVV Sbjct: 63 DYIFPVPRIRNDHPANLQLMNRGDLHS--AFTKINLWRL--TDFSKIVYIDADVVAYRAP 118 Query: 140 SQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTE- 198 +L +L A D+ FN+GV+ LD + + Sbjct: 119 EELFNL---SQPFAAAPDIGWPDL---------------FNTGVMVLDP------NMGDF 154 Query: 199 KALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITE 258 A+ + + + DQ ++N+ L YN + + Sbjct: 155 YAMMAMAERGISFDGADQGLINMHFGQQYHRLSFTYNVTPSAHYQYVPAYRH-----FQS 209 Query: 259 STLLIHYTGATKPWHKWAIYPSVK--------YYKIALEN 290 S ++H+ GA KPW P+ + + Sbjct: 210 SINMVHFIGANKPWFTGRDAPAGSGPFTEMIGRWWAVYDR 249 >UniRef50_Q1CUZ8 Lipopolysaccharide 1,2-glycosyltransferase n=12 Tax=Helicobacter RepID=Q1CUZ8_HELPH Length = 372 Score = 155 bits (393), Expect = 2e-36, Method: Composition-based stats. Identities = 56/369 (15%), Positives = 123/369 (33%), Gaps = 65/369 (17%) Query: 27 CLNVAYGVDANYLDGVGVSITSIVLNNRHIN-----------LDFYIIADVYNDGFFQKI 75 + +A D +Y GVS+ S++ + + + + D + QK+ Sbjct: 4 IIPIAIAFDNHYAIPTGVSLYSMLACAKTEHPQSQNDSEKLFYKIHCLVDNLSLENQQKL 63 Query: 76 AKLAEQN--QLRITLYRINTDKLQ-------------------CLPCTQVWSRAMYFRLF 114 + + I+ + +S+ + RLF Sbjct: 64 KETLAPFSAFASVDFLDISEPDHSTIKIEPFVIDKIHEAFLQLNIYAKTRFSKMVMCRLF 123 Query: 115 AFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDV---------------- 158 L D+++ DAD + D+S+ + L+ KD Sbjct: 124 LASLFP-QYDKIIMFDADTLFLNDVSESFFIPLDSYYFGAAKDFASPKSLKHFQTERERE 182 Query: 159 ----EPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYP 214 + E + + ++N G + ++LK W L E+ L++ K P Sbjct: 183 PRQKFSLYEHYLKEKDMKIICENHYNVGFLIVNLKLWRADHLEERLLNLTHQKGQCVFCP 242 Query: 215 DQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGATKPWHK 274 +QD++ + L LP YN + + Q + +++H+ KPW Sbjct: 243 EQDLLTLACYQKVLQLPYIYNAHPFMLN-------QKRFIPDKKEIVMLHFYFVGKPWIS 295 Query: 275 WAIYPSVKYYKIALENSPWKDDSPRDAKSIIE-----FKKRYKHLLVQHHYISGIIAGVC 329 S ++++ L+ + + S + K + E K++ L ++ V Sbjct: 296 PTALYSKEWHETLLKTPFYAEYSVKFLKQMTECLSLKDKQKTFEFLAPLLNKKTLLEYVF 355 Query: 330 YLCRKYYRK 338 + + +++ Sbjct: 356 FRLNRIFKR 364 >UniRef50_Q871S1 Glycogenin n=3 Tax=Sordariaceae RepID=Q871S1_NEUCR Length = 686 Score = 155 bits (392), Expect = 2e-36, Method: Composition-based stats. Identities = 41/273 (15%), Positives = 87/273 (31%), Gaps = 47/273 (17%) Query: 31 AYGV---DANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 Y + YL G V S+ + H L I + ++ +++ + I Sbjct: 9 VYASLLLNDAYLPGALVLAHSLRDSGTHKKLAILITPENISNEVVEQLQTV---YDYVIP 65 Query: 88 LYRINTDKLQCLPCTQVWS-RAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLG 146 + I D+ L + + ++ ++ +++Y+DADVV +L L Sbjct: 66 VETIQNDRPANLFLMNRPDLHSAFTKINLWK--QTQFRKIVYIDADVVAYRAPDELFDLP 123 Query: 147 LNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKA-LSILM 205 + D+ FN+GV+ L + + + + Sbjct: 124 ---HAFSAAPDIGWPDL---------------FNTGVMVLSP------NMGDYYAMLAMA 159 Query: 206 SKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHY 265 + + DQ ++N+ + L YN + + K S L+H+ Sbjct: 160 ERGISFDGADQGLLNMHFRNTYNRLSFTYNVTPSAHYQYIPA-----YKHFQSSINLLHF 214 Query: 266 TGATKPWHKWAI--------YPSVKYYKIALEN 290 G+ KPW + + + + Sbjct: 215 IGSEKPWVQGRTQTTGSSTYDEMIGRWWAVYDR 247 >UniRef50_B3RM47 Putative uncharacterized protein n=1 Tax=Trichoplax adhaerens RepID=B3RM47_TRIAD Length = 1504 Score = 155 bits (392), Expect = 2e-36, Method: Composition-based stats. Identities = 41/250 (16%), Positives = 94/250 (37%), Gaps = 22/250 (8%) Query: 21 NINTSECLNVAYGVDAN-YLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLA 79 +E +N+ + Y + + + S++ + ++ + F+ + + + F I +A Sbjct: 1223 ETEFNETINIFTVASGHLYERFLRIMMLSVLKHTKN-PVKFWFLKNFLSPNFKDSIPVMA 1281 Query: 80 EQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDI 139 + + + + Y LF L L + +++++DAD + + D+ Sbjct: 1282 KNYNFGYEYVQYKWPRWLRQQTEKQRVIWGYKILFLDVLFPLGIKKIIFVDADQIVRTDL 1341 Query: 140 SQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEK 199 +L+ L L GA A + +E S + +DLK++ ++ Sbjct: 1342 KELMDLDLEGAPYAYTPFCDSRKEMDGF-------------SALYVVDLKRFRLLAAGDR 1388 Query: 200 A---LSILMSKDNVYKYPDQDVMNVLLKGM-TLFLPREYNTIYTIKSELKDKTHQN---Y 252 L + N DQD+ N ++ + LP+++ T S+ T + Sbjct: 1389 LRGQYQGLSADPNSLANLDQDLPNNMIHQVPIKSLPQDWLWCETWCSDGSKATAKTIDMC 1448 Query: 253 KKLITESTLL 262 +T+ L Sbjct: 1449 NNPLTKEPKL 1458 >UniRef50_A5VK24 Glycosyl transferase, family 8 n=18 Tax=Lactobacillales RepID=A5VK24_LACRD Length = 282 Score = 155 bits (391), Expect = 2e-36, Method: Composition-based stats. Identities = 57/269 (21%), Positives = 108/269 (40%), Gaps = 8/269 (2%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 +N+ + ++ ++ + + SI LN + + Y++ + ++ +Q + Sbjct: 1 MNLLFSINDKFVTQLATVLLSIKLNTQAQEFNVYVLQKDKLKRT-DDLERVCKQLGMNYF 59 Query: 88 LYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGL 147 ++N P T + +Y+RL A +LL L ++LYLDADV+C D+S L L Sbjct: 60 PIKVNDQLFNKAPVTDRYPTTIYYRLLAHRLLPQDLHKILYLDADVLCINDLSSLYETSL 119 Query: 148 NGAVAAVVKDVEPMQEKAVSRLSDPELL--GQYFNSGVVYLDLKKWADAKLTEKALSILM 205 +G + A V + Y+NSGV+ ++L + + Sbjct: 120 DGYLYASAIHTNLTNTTEVINKIRLQNFDADGYYNSGVLLMNLDTIRKKVKDTDIFNYIR 179 Query: 206 SKDNVYKYPDQDVMNVLLKGMTLFLPRE-YNTIYTIKSELKDKTHQNY-KKLITESTLLI 263 + PDQDV+N L +P + YN + + + + +T+++ Sbjct: 180 TH--TLLLPDQDVLNALYGRYIKSVPDQLYNFDTRKGGIYETISFGEWTTDWVMRNTVIL 237 Query: 264 HYTGATKPWH-KWAIYPSVKYYKIALENS 291 HY G KPW YK + + Sbjct: 238 HYCGRDKPWLPTKNSGRYTALYKNYFQMT 266 >UniRef50_Q9LE59 Like glycosyl transferase 1 n=35 Tax=Embryophyta RepID=Q9LE59_ARATH Length = 673 Score = 155 bits (391), Expect = 3e-36, Method: Composition-based stats. Identities = 50/325 (15%), Positives = 107/325 (32%), Gaps = 54/325 (16%) Query: 9 IDKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRH-INLDFYIIADVY 67 + K R N+ + A D + V + S ++N + F+++ D Sbjct: 346 LSPEKRKFPRSENLENPNLYHYALFSDN--VLAASVVVNSTIMNAKDPSKHVFHLVTDKL 403 Query: 68 NDGFFQKIAKLAEQNQLRITLYRINTDKLQC----------------------------- 98 N G L + I + ++ K Sbjct: 404 NFGAMNMWFLLNPPGKATIHVENVDEFKWLNSSYCPVLRQLESAAMREYYFKADHPTSGS 463 Query: 99 ----LPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAV 154 + S + R + ++ L+++L+LD D++ + D++ L + LNG V Sbjct: 464 SNLKYRNPKYLSMLNHLRFYLPEVYP-KLNKILFLDDDIIVQKDLTPLWEVNLNGKVNGA 522 Query: 155 VKDVEPMQEKAVSRLS--------DPELLGQYFNSGVVYLDLKKWADAKLTEKALSI--L 204 V+ + L+ + + G+ DLK+W +T + Sbjct: 523 VETCGESFHRFDKYLNFSNPHIARNFNPNACGWAYGMNMFDLKEWKKRDITGIYHKWQNM 582 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIH 264 ++K + G+T L + ++ + + + K E+ ++H Sbjct: 583 NENRTLWKLGTLPPGLITFYGLTHPLNKAWHVLGLGYN-------PSIDKKDIENAAVVH 635 Query: 265 YTGATKPWHKWAIYPSVKYYKIALE 289 Y G KPW + A+ Y+ ++ Sbjct: 636 YNGNMKPWLELAMSKYRPYWTKYIK 660 >UniRef50_A7H2M2 Glycosyl transferase family 8 n=3 Tax=Campylobacter jejuni RepID=A7H2M2_CAMJD Length = 381 Score = 154 bits (390), Expect = 4e-36, Method: Composition-based stats. Identities = 74/363 (20%), Positives = 132/363 (36%), Gaps = 58/363 (15%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNN-------------RHINLDFYIIADVYNDGFFQK 74 ++ + NY+ V +TSI+ F+I++D ++ + Sbjct: 2 FHIVLNANENYIKYAAVLMTSIIQKTDLNKSMSEFCNFDTDEGYVFHILSDHISESMKVR 61 Query: 75 I----AKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLD 130 I +L + +I L+ +N D+ + + + Y+R+ +L L LYLD Sbjct: 62 ISNLEKQLNDIYPCKIVLHILNDDEFKGM-LKWRGNYLAYYRIKMASVLPQNLKICLYLD 120 Query: 131 ADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLG-------QYFNSGV 183 D++C GD+ +LL + +N AAV D ++ S +YFNSG Sbjct: 121 CDMLCFGDLRELLSVDINNYQAAVCLDGNNHKKNKKVFFSLKGREKYKFSNIEKYFNSGF 180 Query: 184 VYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNT------- 236 + ++L +W + K++ L YPDQD +N L TL LP +N Sbjct: 181 ILVNLDRWRRDNIENKSIDFLKK--FKTLYPDQDALNFALND-TLLLPNRWNFSLGYFVA 237 Query: 237 -----IYTIKSELKDKTHQNY----KKLITESTLLIHYTG-ATKPW-----------HKW 275 + H NY + ++ + H+ KPW + Sbjct: 238 FLKNSQEILFLNQTKYPHLNYTKTEFENEVKNIKIAHFILDPFKPWDAFQYSIVNDDLQL 297 Query: 276 AIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQHHYISGIIAGVCYLCRKY 335 YP K+Y +N+P + + L+ + + Y R Sbjct: 298 IEYPFYKHYWSVAKNTP--EFYLDFLVQKESINEHKAENLINELGKAVVKEMRRYTSRAS 355 Query: 336 YRK 338 YR+ Sbjct: 356 YRR 358 >UniRef50_C5JPW4 Glycosyl transferase family 8 protein n=2 Tax=Ajellomyces RepID=C5JPW4_AJEDS Length = 723 Score = 154 bits (389), Expect = 4e-36, Method: Composition-based stats. Identities = 43/264 (16%), Positives = 80/264 (30%), Gaps = 41/264 (15%) Query: 35 DANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLYRINTD 94 +YL G V S+ L + D ++ + N Sbjct: 4 SDSYLPGAMVLAHSLRDTGSKAKLVVLVTLDSLKSSTIDELKTIYNDIIPITQFVNRNPA 63 Query: 95 KLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAV 154 L + + S + ++ ++ +++Y+DADVV ++LL L A Sbjct: 64 NLYLMDRPDLIS--TFSKIELWR--QTQYSKIVYIDADVVSLRAPNELLKLVSR---FAA 116 Query: 155 VKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYP 214 V D+ FN+G++ L L L + + Sbjct: 117 VPDIGWP---------------DCFNTGLMVLTPNMQDYYSLL-----ALAERGISFDGA 156 Query: 215 DQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGATKPWHK 274 DQ ++N+ K L YN + + + + L+HY G KPW+ Sbjct: 157 DQGLLNMHFK-KWDRLSFAYNCTPSGHYQYIPA-----FRHFGSNISLVHYIGRRKPWNL 210 Query: 275 WAI-----YPSVKY---YKIALEN 290 P + + + Sbjct: 211 PRQAFPLESPYNQLLGRWWAMYDR 234 >UniRef50_Q6Z5D6 Glycosyltransferase family-like n=6 Tax=Poaceae RepID=Q6Z5D6_ORYSJ Length = 726 Score = 154 bits (389), Expect = 5e-36, Method: Composition-based stats. Identities = 50/314 (15%), Positives = 101/314 (32%), Gaps = 53/314 (16%) Query: 19 LANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRH-INLDFYIIADVYNDGFFQK--- 74 + + + A D + G V + S +++ + N F+I+ D N + Sbjct: 410 EEKLEDPKLQHYALFSDN--VLGAAVVVNSTIIHAKTPENHVFHIVTDKLNYAAMRMWFL 467 Query: 75 ---------------------------IAKLAEQNQLRITL--YRINTDKLQCLPCTQVW 105 + +L Q + + D + Sbjct: 468 ENSQGKAAIEVQNIEDFTWLNSSYSPVLKQLESQFMINYYFKTQQDKRDNNPKFQNPKYL 527 Query: 106 SRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKA 165 S + R + ++ L+++L+LD D+V + D+S L + L G V ++ + Sbjct: 528 SILNHLRFYLPEIFP-KLNKVLFLDDDIVVQQDLSALWSIDLKGKVNGAIQTCGETFHRF 586 Query: 166 VSRLS--------DPELLGQYFNSGVVYLDLKKWADAKLTE--KALSILMSKDNVYKYPD 215 L+ + E + G+ DL +W +T+ ++K Sbjct: 587 DRYLNFSNPLIAKNFERRACGWAYGMNMFDLSEWRKRNITDVYHYWQEQNEHRLLWKLGT 646 Query: 216 QDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGATKPWHKW 275 V T L +++ + N + E +IHY G KPW + Sbjct: 647 LPAGLVTFWNQTFPLDHKWHLLGLGY-------KPNVNQKDIEGAAVIHYNGNRKPWLEI 699 Query: 276 AIYPSVKYYKIALE 289 A+ KY+ + Sbjct: 700 AMAKYRKYWSKYVN 713 >UniRef50_A8LP95 Lipopolysaccharide 1 n=1 Tax=Dinoroseobacter shibae DFL 12 RepID=A8LP95_DINSH Length = 342 Score = 154 bits (389), Expect = 5e-36, Method: Composition-based stats. Identities = 61/339 (17%), Positives = 113/339 (33%), Gaps = 31/339 (9%) Query: 5 PAIEIDKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIA 64 P ++ + ++ V + D YL + I + D I Sbjct: 13 PLRGMNAIPGPRVTASSQCARHEHAVCFCSDEGYLPFALFAALQIHRLHPDRCFDLVIAH 72 Query: 65 DV---YNDGFFQKIAKLAEQNQLRITLYRINTD-KLQCLPCTQVWSRAMYFRLFAFQLLG 120 GF + I I+T + L + + Y RL LG Sbjct: 73 TGPLSVPHGF----------PGIGIRYVEIDTGGCFERLALDARRTGSTYLRLALSGALG 122 Query: 121 LTLDRLLYLDADVVCKGD-ISQLLHLGLNGAVAAVVKDVEPMQ---EKAVSRLSDPELLG 176 R+LY+D+DV D + LL + G A V+D + K ++ Sbjct: 123 HDYQRILYMDSDVFALRDGLHVLLFTDMRGKPLAAVRDNSQWRTSGRKPDDLVTLNLPAR 182 Query: 177 QYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNT 236 YFN+GV+ +D + + + KAL + S+ DQ ++N + G + +N Sbjct: 183 PYFNAGVLLMDTARLNEQDILAKALDLGTSQAGRLARHDQTLLNAVTSGNWAEMSPRWNW 242 Query: 237 IYTIKSELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIA----LENSP 292 +T ++ ++E ++H+ G KPW + + E P Sbjct: 243 QFTW---------ASWIFALSEDARILHFIGPNKPWADTSGRFPKSITRAYGDFLAEQFP 293 Query: 293 WKDDSPRDAKSIIEFKKRYKHLLVQHHYISGIIAGVCYL 331 + I + ++ K L+ + A + Sbjct: 294 ERTVERAANSPINDPRRLIKSLIKHGLSRKKMSAYLARF 332 >UniRef50_Q0U987 Putative uncharacterized protein n=1 Tax=Phaeosphaeria nodorum RepID=Q0U987_PHANO Length = 583 Score = 154 bits (388), Expect = 6e-36, Method: Composition-based stats. Identities = 39/271 (14%), Positives = 76/271 (28%), Gaps = 44/271 (16%) Query: 31 AYGV---DANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 Y +YL G V S+ L I + + ++ +L + Sbjct: 8 VYCTLLMSDSYLPGAAVLAHSLRDAGTKKKLAVLITLETLSADTITQLKELYDYLIPVER 67 Query: 88 LYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGL 147 + + L + + +++YLDADVV + +L + Sbjct: 68 IRTPSPANLYLMGRPD----LSFAFTKIALWRQTQFRKIVYLDADVVALRALDELFDI-- 121 Query: 148 NGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKA-LSILMS 206 A A D+ FNSGV+ + + E L + + Sbjct: 122 -EAPFAAAPDIGWP---------------DAFNSGVMVISPD------MGEYWALQTMAA 159 Query: 207 KDNVYKYPDQDVMNVLLKGM-TLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHY 265 + + DQ ++N + L YN + + + +H+ Sbjct: 160 TGDSFDGADQGLLNQYFEHRPWQRLKFTYNCTPNAEYQWEPAYRY-----YKRDISAVHF 214 Query: 266 TGATKPWHKWAI------YPSVKYYKIALEN 290 G KPW + + + Sbjct: 215 IGKEKPWSSSRTSGPGVYGELLSRWWQVHDR 245 >UniRef50_C0S309 Glycogenin n=5 Tax=Onygenales RepID=C0S309_PARBP Length = 785 Score = 154 bits (388), Expect = 7e-36, Method: Composition-based stats. Identities = 51/301 (16%), Positives = 93/301 (30%), Gaps = 48/301 (15%) Query: 31 AYGV---DANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRIT 87 Y NYL G V S+ N L + D ++ + + Sbjct: 8 VYCTMLLSDNYLPGAMVLAHSLRDNGCKAKLVVLVTLDSLKASTIDELKTIYDDVVPINR 67 Query: 88 LYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGL 147 + L + + S + ++ ++ +L+Y+DADVV +LL + Sbjct: 68 IVNHCPANLYLMDRPDLAS--TFSKIELWR--QTQYRQLVYIDADVVSLRAPDELLTINT 123 Query: 148 NGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSK 207 N A V D FN+G++ L +L L + Sbjct: 124 N---FAAVPDTGWP---------------DCFNTGLMVL-----RPNMHDYYSLLALAQQ 160 Query: 208 DNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTG 267 + DQ ++N+ K L YN + + + + L+HY G Sbjct: 161 GVSFDGADQGLLNIHFK-KWDRLSFVYNCTPSGHYQYIPA-----FRHFGSTISLVHYIG 214 Query: 268 ATKPWH-----KWAIYPSVK----YYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQH 318 + KPW+ + P + ++ + + + KH L Q Sbjct: 215 SQKPWNLPRQLFPSGSPYNQLLGRWWATYYRH---YRPVVKPDTKLSSRADIAKHGLGQL 271 Query: 319 H 319 H Sbjct: 272 H 272 >UniRef50_B2VRF2 Glycogenin-2 n=1 Tax=Pyrenophora tritici-repentis Pt-1C-BFP RepID=B2VRF2_PYRTR Length = 622 Score = 153 bits (387), Expect = 8e-36, Method: Composition-based stats. Identities = 43/290 (14%), Positives = 86/290 (29%), Gaps = 41/290 (14%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLY 89 + + +YL G V S+ L + D + ++ L + + Sbjct: 10 ITLLMSDSYLPGAVVLANSLRDAGTKKKLAVLVTMDTLSADTIGELKTLYDYLIPVQRIR 69 Query: 90 RINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNG 149 NT L + + +L+YLDADVV + +L + Sbjct: 70 SSNTANLYLMGRPD----LAFAFTKIALWRQTQFRKLVYLDADVVALRALDELFDI---E 122 Query: 150 AVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKA-LSILMSKD 208 A A D+ FNSGV+ + + E L + + Sbjct: 123 ASFAAAPDIGWP---------------DAFNSGVMVI------KPDMGEYWALQTMAAAG 161 Query: 209 NVYKYPDQDVMNVLLKGM-TLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTG 267 + + DQ ++N + L YN + + + +H+ G Sbjct: 162 DSFDGADQGLLNQYFEHRPWQRLKFTYNCTPNAEYQWEPAYRH-----YKRDIAAVHFIG 216 Query: 268 ATKPW---HKWAIYPSVKY---YKIALENSPWKDDSPRDAKSIIEFKKRY 311 KPW H + + + ++ + ++A + + Sbjct: 217 KNKPWSSQHSGGTGVYGELLARWWAVHQRHLHREKAAKEAGEAHSTQSDF 266 >UniRef50_UPI00016E26D6 UPI00016E26D6 related cluster n=3 Tax=Takifugu rubripes RepID=UPI00016E26D6 Length = 421 Score = 153 bits (387), Expect = 8e-36, Method: Composition-based stats. Identities = 53/338 (15%), Positives = 99/338 (29%), Gaps = 58/338 (17%) Query: 5 PAIEIDKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIA 64 P E+ A L+ E V +Y G V S+ + + ++ Sbjct: 16 PREELTSTVANSQFLSVGEAGEAF-VTLVTSDSYCMGAVVVARSLRRHGTTRGVVV-MVT 73 Query: 65 DVYNDGF--FQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLT 122 ++ + + ++ + + + L L ++ + ++ + L Sbjct: 74 PNVSEQSSTRGALHSVFDEVIMVDRIESGDRLHLSSLGRPELGI--TFTKIHCWTL--TQ 129 Query: 123 LDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSG 182 + ++LDAD + ++ +L +V D FNSG Sbjct: 130 YSKCVFLDADTLVLDNVDELFQRD----ELSVAPDPGWP---------------DCFNSG 170 Query: 183 VVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLL-----KGMTLFLPREYNTI 237 V A L AL + DQ ++N +T LP YN Sbjct: 171 VFVFQPSLQTHASLRAHALQ-----HGSFDGGDQGLLNSFFSSWPVADITKHLPFVYNLS 225 Query: 238 YTIKSELKDKTHQNYKKLITESTLLIHYTGATKPW---------HKWAIYPSVKYYKI-A 287 + + S + H+TGA KPW + V + Sbjct: 226 SSCVYSYLPA-----FQQFGHSAKIFHFTGAVKPWSSSSFKKEGQPPCMDHFVSLWWKEY 280 Query: 288 LENS----PWKDD--SPRDAKSIIEFKKRYKHLLVQHH 319 L ++ P KD + K + +K L Sbjct: 281 LSHTTSPPPEKDFHQNVEPPKQVATSQKATSFCLSNKK 318 >UniRef50_D1N145 Glycosyl transferase family 8 n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N145_9BACT Length = 311 Score = 153 bits (387), Expect = 9e-36, Method: Composition-based stats. Identities = 62/271 (22%), Positives = 114/271 (42%), Gaps = 9/271 (3%) Query: 24 TSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQ 83 +E + VA D NYLD V+ S++ + + +++ + ++ F + L + Sbjct: 1 MAEDIQVAMATDRNYLDYALVAAASLLAQHPGGGITLHLLHEELDESDFARFEALRRIDG 60 Query: 84 LRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLL 143 R+ +I Q P WS + Y+RL LL L+++LYLD D++ DI++L Sbjct: 61 FRLVPRKIERGFFQGWPEL-RWSTSAYYRLILPSLLP-DLEKILYLDCDLLVLDDIAELW 118 Query: 144 HLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSI 203 + L A + YFNSGV+ +L+K A ++ + + Sbjct: 119 NTELGSRSCAAA---AVRVAPEHQKKIGLPAEAVYFNSGVMLFNLRKMAHENHEKRFIRL 175 Query: 204 LMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKK--LITESTL 261 KYPDQD++N+ + L + +N + ++ + + Sbjct: 176 FDELGGRIKYPDQDILNLAYWNDYVKLSQRWNLVTSVYRNPPTPALYSEAEVVEALRRPG 235 Query: 262 LIHYTGATKPWH--KWAIYPSVKYYKIALEN 290 + H+TG KPW K +P +Y++ E Sbjct: 236 IAHFTGTHKPWRLGKTTHHPYARYFRAYAEL 266 >UniRef50_B7AUG6 Putative uncharacterized protein n=1 Tax=Bacteroides pectinophilus ATCC 43243 RepID=B7AUG6_9BACE Length = 301 Score = 152 bits (385), Expect = 1e-35, Method: Composition-based stats. Identities = 55/278 (19%), Positives = 108/278 (38%), Gaps = 23/278 (8%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYI-IADVYNDGFFQKIAKLAEQNQL-- 84 +N+ ++ ++ V +TS++ NN N+ ++ D + I +L Sbjct: 1 MNILVAMNDAFVKCYQVMLTSLIKNNPDENITVHVPYTDGLSRKGLDSIKELVRNQSHGS 60 Query: 85 -RITLYRINTDKLQCLPCT--QVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQ 141 + Y D+L L +WS M+FR+FA + + + DR+L+LD D++ G I Sbjct: 61 ASVREYYFGKDRLGSLDKLPLGMWSVEMFFRIFAQEFIPESEDRILWLDGDIIVNGSIKD 120 Query: 142 LLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQ--YFNSGVVYLDLKKWADAKLT-E 198 + + A +D+ K + + Y NSGV+ ++LK + +T + Sbjct: 121 FYNTDFDSMYYAACEDIAISHGKIKEEYDNLGWSSEEIYVNSGVLLINLKALRNNGITRD 180 Query: 199 KALSILMSKDNVYKYPDQDVMNVLLKGMTLFLP-REYNTIYTIKSELKDKTHQNYKKLIT 257 A+ + + YPDQ ++N + F YN + S +I Sbjct: 181 AAVEYALENMDKLHYPDQYMLNAMFHDKIKFADAFRYNCQVSGYSY-------KLADMIL 233 Query: 258 ESTLLIHYTGATKPWHKWAIYPSV-----KYYKIALEN 290 + ++H+ G +PW + + Sbjct: 234 SESAILHFPGY-RPWQTDYQKHYSSAIPGDIWWHYAKL 270 >UniRef50_Q38VG7 Putative glycosyl transferase, family 8 n=1 Tax=Lactobacillus sakei subsp. sakei 23K RepID=Q38VG7_LACSS Length = 304 Score = 152 bits (385), Expect = 1e-35, Method: Composition-based stats. Identities = 46/258 (17%), Positives = 114/258 (44%), Gaps = 18/258 (6%) Query: 42 VGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPC 101 + +SI +++ + ++ +II ++ + + I L N + ++ ++ ++ Sbjct: 1 MSISIATLLKKHMEDEINIFIITSNISEKYIKVIEGLF--NNPKHNIFWVSMPEIDIPLE 58 Query: 102 TQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPM 161 T S A Y RLF +L+ + RL+YLD D + + ++ +L L + +D Sbjct: 59 TDRGSLAQYGRLFFDRLIPENIQRLIYLDCDTLIEENLRELWVTDLGENTIGIARDAFSD 118 Query: 162 QEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNV 221 + K +L E + FNSGV+ +D W + ++ ++ + +L K DQ V+++ Sbjct: 119 RYK---KLLGLEKDSELFNSGVMIIDRGSWNEKRIEDRIIDLLTEKRGRISQGDQGVIDI 175 Query: 222 LLKGMTLFLPREYNTIYTI----------KSELKDKTHQNYKKLITESTLLIHYTGA--- 268 + + L ++N++ + ++K+ + + ++H+T + Sbjct: 176 IFQNDAKILDPKWNSMSSYFDFTYDDFLKYRQVKEFYSKQLILEAIQKPAIVHFTSSFLN 235 Query: 269 TKPWHKWAIYPSVKYYKI 286 +PW + + +++ Sbjct: 236 NRPWIFGSTHRYKNHWRR 253 >UniRef50_D0IR33 LPS 1,2-glycosyltransferase n=3 Tax=Helicobacter pylori RepID=D0IR33_HELP1 Length = 387 Score = 152 bits (384), Expect = 2e-35, Method: Composition-based stats. Identities = 64/384 (16%), Positives = 132/384 (34%), Gaps = 80/384 (20%) Query: 27 CLNVAYGVDANYLDGVGVSITSIVL--------------------------NNRHINLDF 60 + + D +Y GVS+ S++ +N+ + Sbjct: 4 IIPIVITFDNHYAIPAGVSLYSMLACTKLENPQSQNPQSQNPQSQNPQSQNDNKKLFYKI 63 Query: 61 YIIADVYNDGFFQKIAKLAEQN--QLRITLYRINTDKLQCLPCTQ--------------- 103 + + D + K+ + + + I+T L P Sbjct: 64 HCLVDNLSLENQCKLKETLAPFSAFMSVDFLDISTPNLYTTPIEPSVIDKINEAFLQLNI 123 Query: 104 ----VWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVE 159 +S+ + RLF L L D+++ DAD + D+S+ + L+ V KD Sbjct: 124 YAKTRFSKMVMCRLFLASLF-LQYDKIIMFDADTLFLNDVSESFFIPLDDYYFGVAKDFS 182 Query: 160 PMQ--------------------EKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEK 199 + E + L ++N G + ++LK W +L E+ Sbjct: 183 SPKSSKHFQTERERAPRQAFSLYEHYLKEKDIKILYENHYNVGFLVVNLKLWRADRLEER 242 Query: 200 ALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITES 259 L++ K P+QD++ + L LP YNT + +Q + Sbjct: 243 LLNLTHQKGQCVFCPEQDLLTLACYQKVLILPYIYNTHPFM-------VNQKRFIPNRQE 295 Query: 260 TLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEF-----KKRYKHL 314 +++H+ KPW S ++++ L+ S + + S + K + EF K++ Sbjct: 296 IVMLHFYFVGKPWVSPTALYSKEWHETLLKTSFYAEYSVKFLKQMTEFLSLKDKQKTFEF 355 Query: 315 LVQHHYISGIIAGVCYLCRKYYRK 338 L ++ V + + +++ Sbjct: 356 LAPLLNPKILLEYVFFRLNRIFKR 379 >UniRef50_Q39T65 Glycosyl transferase, family 8 n=1 Tax=Geobacter metallireducens GS-15 RepID=Q39T65_GEOMG Length = 317 Score = 152 bits (383), Expect = 2e-35, Method: Composition-based stats. Identities = 53/310 (17%), Positives = 116/310 (37%), Gaps = 39/310 (12%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLN-NRHINLDFYIIADVYNDGFFQKIAKLAEQN-QLR 85 + V + D NY+ V+ S++ N N F ++ + ++ +A++ Sbjct: 10 IPVFFAFDNNYVIPAAVAFHSLLANVNVSYKYHFIVLHEDISEENRDLLAQVVSLFSNAS 69 Query: 86 ITLYRINT---DKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQL 142 + + ++ + + +++ ++L L D++++ D DVV K DIS + Sbjct: 70 VEFRDMGESFKNEWENIKGKGHYTKECLYKL-VPMLEFPQYDKIIWSDVDVVFKDDISDV 128 Query: 143 LHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLG----QYFNSGVVYLDLKKWADAKLTE 198 + A V+ + +K ++ P + +G++ +LKK + + + Sbjct: 129 FFMLSEENYIAGVRVCGKL-DKYYENMNMPAEIKSILKNGIGAGILVYNLKKMREDNIYD 187 Query: 199 KALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDK----------- 247 + L ++ P+QD++N++LK ++P Y + + KD+ Sbjct: 188 DIMIALQGMSSIVVQPEQDILNIVLKDKIDYIPLRYCFCTYMYNLFKDRHKMKLKVKGNL 247 Query: 248 -----------------THQNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALEN 290 + ES +IHY +TKPW+ + L+ Sbjct: 248 FNYLFKGYRKNLGFDTIYSEKELLEAFESPAIIHYATSTKPWNTLFTKRKSDWLYCLLKT 307 Query: 291 SPWKDDSPRD 300 WK R Sbjct: 308 PFWKRYIFRY 317 >UniRef50_B2B5U2 Predicted CDS Pa_2_5770 n=1 Tax=Podospora anserina RepID=B2B5U2_PODAN Length = 576 Score = 151 bits (382), Expect = 3e-35, Method: Composition-based stats. Identities = 35/269 (13%), Positives = 77/269 (28%), Gaps = 40/269 (14%) Query: 36 ANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLYRINTDK 95 YL G V S+ L + D + + + + + Sbjct: 18 DTYLPGALVLAHSLRDAGTTKKLAILVTPDTVSTEVIATLKTVYDYVIYVDRIRNGKPAN 77 Query: 96 LQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVV 155 L + + S + ++ ++ +++Y+DADVV + +L L + Sbjct: 78 LFLMNRPDLHS--AFTKINLWK--QTQFRKIVYIDADVVAYRAVDELFDLP---HAFSAA 130 Query: 156 KDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPD 215 D+ ++ +G Y+ A+ + + + D Sbjct: 131 PDIGWPDLFNTGVMALTPNMGDYY--------------------AMMAMAERGISFDGAD 170 Query: 216 QDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGATKPWHKW 275 Q ++N+ L YN + + S ++H+ GA KPW + Sbjct: 171 QGLLNMHFGNTYNRLSFTYNVTPSAHYQYVPAYRH-----FQGSINMVHFIGADKPWRQG 225 Query: 276 A--------IYPSVKYYKIALENSPWKDD 296 + + K+ Sbjct: 226 RESTTDAGPFDEMTGRWWAVYDRHYHKEA 254 >UniRef50_A5DLS6 Putative uncharacterized protein n=2 Tax=Pichia guilliermondii RepID=A5DLS6_PICGU Length = 390 Score = 151 bits (382), Expect = 3e-35, Method: Composition-based stats. Identities = 46/278 (16%), Positives = 84/278 (30%), Gaps = 45/278 (16%) Query: 31 AYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLYR 90 + +YL G ++ + + +D Q + ++ Sbjct: 6 TLLTNESYLPGALTLAHTLRSLGTQYPVVVLLDETQVSDRSLQLLEAAYDR-------II 58 Query: 91 INTDKLQCLPCTQVWSRA----MYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLH-- 144 +D+L P R + +L + + D++LYLD DV+ ++ L Sbjct: 59 PISDRLVTSPVDDRLGRPELAVTFSKLLLWN---ESYDQILYLDTDVLPLANVDHLFDEG 115 Query: 145 LGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSIL 204 L A D FNSGV+ ++ + Sbjct: 116 AALTPRQIAASPDSGWP---------------DIFNSGVLLFKPD----PQVYSDLVEFA 156 Query: 205 MSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIH 264 D+ + DQ ++N G LP YN T + H+ + ++H Sbjct: 157 SGSDSSFDGADQGLLNEFFAGNWHRLPFLYNVTPTESYQYVPAFHR-----FFKDIKILH 211 Query: 265 YTGATKPWHKWAI---YPSVKYYKIALENSPWKDDSPR 299 Y G KPWH + + S + D + Sbjct: 212 YIGQIKPWHSSTNIDHFRFHHLWWD--RFSEFFDKETK 247 >UniRef50_A2Q5F4 Glycosyl transferase, family 8 n=1 Tax=Medicago truncatula RepID=A2Q5F4_MEDTR Length = 680 Score = 151 bits (381), Expect = 4e-35, Method: Composition-based stats. Identities = 48/324 (14%), Positives = 106/324 (32%), Gaps = 53/324 (16%) Query: 9 IDKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRH-INLDFYIIADVY 67 ++ + + + + A D + V + S VLN + F+I+ D Sbjct: 354 LNSSQQQFPNQEKLEDPQLYHYAIFSDN--ILATAVVVNSTVLNAKDASKHVFHIVTDRL 411 Query: 68 NDGFFQK------------------------------IAKLAEQNQLRITL--YRINTDK 95 N + + +LA + ++ +D Sbjct: 412 NYAAMRMWFLVNSPGKATIQVQNIEDFTWLNASYSPVLKQLASPAMIDYYFKAHKATSDS 471 Query: 96 LQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVV 155 + S + R + ++ L+++L+LD D+V + D++ L + L G V V Sbjct: 472 NLKFRNPKYLSILNHLRFYLPEVFP-KLNKVLFLDDDIVVQKDLTGLWSIDLKGNVNGAV 530 Query: 156 KDVEPMQEKAVSRLS--------DPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSK 207 + + L+ + + + G+ DL +W K+TE + Sbjct: 531 ETCGESFHRFDRYLNFSNPLIAKNFDPHACGWAYGMNVFDLVQWKRQKITEVYHNWQNLN 590 Query: 208 DNV--YKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHY 265 + +K + T L + ++ + + N + + ++HY Sbjct: 591 HDRQLWKLGTLPPGLITFWKRTFPLNKAWHVLGLGYN-------PNVNQKDIDRAAVMHY 643 Query: 266 TGATKPWHKWAIYPSVKYYKIALE 289 G KPW + +I Y+ + Sbjct: 644 NGNMKPWLEISIPKFRGYWTKYVN 667 >UniRef50_A7EPR4 Putative uncharacterized protein n=1 Tax=Sclerotinia sclerotiorum 1980 UF-70 RepID=A7EPR4_SCLS1 Length = 643 Score = 151 bits (381), Expect = 4e-35, Method: Composition-based stats. Identities = 40/281 (14%), Positives = 83/281 (29%), Gaps = 45/281 (16%) Query: 36 ANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLYRINTDK 95 YL G V S+ + + D ++ + I + R+ + Sbjct: 16 DTYLPGALVLAHSLRDAGTTKKIAVLVTTDSVTFESMAELQR---NFDFVIPVDRVVNES 72 Query: 96 LQCLPCTQVWS-RAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAV 154 L + + ++ ++ R++Y+DAD+V +L L + Sbjct: 73 PANLDLMGRPDLHSTFTKITLWK--QTQFRRIVYMDADMVALRAPDELFALP---DPFSA 127 Query: 155 VKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKA-LSILMSKDNVYKY 213 D+ FN+G++ LD + + L + + + Sbjct: 128 APDIGWP---------------DIFNTGLMVLDP------NMGDYYALEAMARRGISFDG 166 Query: 214 PDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGATKPWH 273 DQ ++N+ K L YN + + + S H+ G KPW Sbjct: 167 ADQGLLNMHFKNTFNRLSFTYNVTPSAHYQYLPA-----FQHFQSSISAAHFIGTDKPWK 221 Query: 274 KWAI--------YPSVKYYKIALENSPWKDDSPRDAKSIIE 306 + + + +K R ++ E Sbjct: 222 VGRQASIGATPYHQMTGRWWAVYDKH-YKQTVSRVPSTMEE 261 >UniRef50_A9SH80 Predicted protein n=2 Tax=Physcomitrella patens subsp. patens RepID=A9SH80_PHYPA Length = 527 Score = 150 bits (380), Expect = 5e-35, Method: Composition-based stats. Identities = 65/339 (19%), Positives = 123/339 (36%), Gaps = 60/339 (17%) Query: 10 DKVKAWDFRLANINT--SECLNVAYGVDANYLDGVGVSITSIVLNNRH-INLDFYIIADV 66 + K ++ LA + + +++ D + V + S + N H L F+++ Sbjct: 175 NDEKHDEYTLAFLKKASEQVVHIFVSTDGADFRPLAVLVNSTISNAVHPERLHFHLVLPA 234 Query: 67 YNDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRA------MY-FRLFAFQLL 119 + + +A + ++ I I+ ++ + S+A +Y F F L Sbjct: 235 SHHSRAKHLAAFFQDTKIDIVSENIDFKDMEKHITFRKNSKARPELQSVYNFAPFLLPLH 294 Query: 120 GLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQE---------KAVSRLS 170 + R +YLDAD+V KG+I +L+ + L AA V+D E K +R Sbjct: 295 FKDVGRFIYLDADIVVKGNIEELIQIDLGNRAAAAVEDCSQTFETYFDFNELAKIQARPE 354 Query: 171 DP--------ELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDN----VYKYP-DQD 217 P + FN GV+ +D +W ++TE L + + +YKY Q Sbjct: 355 KPTWVPTEPIKPDACVFNRGVLVIDTNQWIKQQVTEAILWWMDEFQSAESVLYKYGLSQP 414 Query: 218 VMNVLLKGMTLFLPREYNTIYTIKSEL-------------KDKTHQNYKKLITESTLLIH 264 + L G + L +N ++E + + L ++ ++H Sbjct: 415 PFLLALYGKYMKLDTPWNVRGLGRNEFSEREREFLESKYGHKPERKPFISLDADTAKILH 474 Query: 265 YTGATKPWHKWAIY---------------PSVKYYKIAL 288 + G KPW + K + L Sbjct: 475 FNGKFKPWKQTRPVGPSSNVVSRCGSKGIECAKLWWEYL 513 >UniRef50_A0ZYL4 Putative uncharacterized protein n=1 Tax=Archaeal BJ1 virus RepID=A0ZYL4_9CAUD Length = 286 Score = 150 bits (380), Expect = 5e-35, Method: Composition-based stats. Identities = 72/297 (24%), Positives = 130/297 (43%), Gaps = 22/297 (7%) Query: 27 CLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIAD-VYNDGFFQKIAKLAEQN-QL 84 LNV Y + +S S++ NN+ +++ YI+++ N+ FF+ + L E + L Sbjct: 2 TLNVCYIAGGDSWVPCYISAYSVLENNQDLDIHMYILSEEDNNNPFFEHVEYLYESHPSL 61 Query: 85 RITLYRINTDKLQCLPCT-QVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLL 143 I ++ D+ LP + S +YF++ +LL +L LDAD +C G +S LL Sbjct: 62 EIEFIEVDMDQFDDLPAPGKHLSPGVYFKIAINRLLPTD-GNVLLLDADTICDGSLSSLL 120 Query: 144 HLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSI 203 L L+G V A KA + + FN+GV+Y++L++WA + E++ Sbjct: 121 SLDLSGKVLAAAP-----SNKAETVRLGLQNNRAKFNAGVLYVNLQEWAKQDIEERSRQY 175 Query: 204 LMSKDNVYKYPDQDVMNVLLK--GMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTL 261 + + DQD +N L+ ++ YN + E +++ + Sbjct: 176 IEEHEPEL--NDQDALNALVNNPDDMEYIHPRYNATKLLVRE---------FEMVDDEPT 224 Query: 262 LIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQH 318 +IHY G KPW S + +P++D P+D R + + + Sbjct: 225 IIHYNGPDKPWRFVTERESGDLWWEYASKTPFRDYVPKDKGVKEIIFVRARSAMRRF 281 >UniRef50_D1IFB6 Whole genome shotgun sequence of line PN40024, scaffold_26.assembly12x (Fragment) n=1 Tax=Vitis vinifera RepID=D1IFB6_VITVI Length = 473 Score = 150 bits (379), Expect = 6e-35, Method: Composition-based stats. Identities = 50/290 (17%), Positives = 108/290 (37%), Gaps = 41/290 (14%) Query: 21 NINTSECLNVAYGVDANYLDGVGVSITSIVLNNR-HINLDFYIIADVYNDGFFQKIAKLA 79 + +++A +D+ YL G ++ SI+ ++ N+ F+ IA ++ + + +L Sbjct: 137 SSCDPSLVHIAMTLDSEYLRGSIAAVHSILRHSSCPENVFFHFIAAEFDPASPRVLTQLV 196 Query: 80 EQNQLRITL--YRINTDKLQCLPCTQVWS----RAMYFRLFAFQLLGLTLDRLLYLDADV 133 + Y D + L + + S Y R + +L ++R++Y+D+D+ Sbjct: 197 RSTFPSLNFKVYIFREDTVINLISSSIRSALENPLNYARNYLGDILDPCVERVIYIDSDL 256 Query: 134 VCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWAD 193 V DI +L ++ L YFN+GV+ +DL +W Sbjct: 257 VVVDDIRKLWNITLTEKP-------------------------CYFNTGVMVMDLVRWRK 291 Query: 194 AKLTEKALSILM--SKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQN 251 K + + + +Y+ ++ G + +N +K + Sbjct: 292 GNYRRKIENWMELQRRRRIYELGSLPPFLLVFAGNVEAIDHRWNQHGLGGDNVKG----S 347 Query: 252 YKKLITESTLLIHYTGATKPWHKWAIY---PSVKYYKIALENSPWKDDSP 298 + L L+H++G KPW + P ++ P ++ Sbjct: 348 CRPLHPGPVSLLHWSGKGKPWSRLDARKPCPVDHLWEPYDLYKPHRNHRL 397 >UniRef50_C5P955 Glycosyl transferase family 8 protein n=2 Tax=Coccidioides RepID=C5P955_COCP7 Length = 823 Score = 150 bits (379), Expect = 6e-35, Method: Composition-based stats. Identities = 42/256 (16%), Positives = 83/256 (32%), Gaps = 41/256 (16%) Query: 43 GVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLYRINTDKLQCLPCT 102 V S+ N + + D +++ L ++ + ++ L + Sbjct: 1 MVLAHSLRDNGTRAKIVVLVTPDSLQASTIEELKSLYDEVIPVSRVVNVSPANLYLMDRP 60 Query: 103 QVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQ 162 + S + ++ ++ + +++Y+DADVV +LL L A V D+ Sbjct: 61 DLIS--TFTKIELWR--QIQYRQIVYIDADVVALRAPDELLTLDTQ---LAAVPDIGWP- 112 Query: 163 EKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVL 222 FNSGV+ L + T +L + + DQ ++N+ Sbjct: 113 --------------DCFNSGVLVL-----RPSLQTYYSLVAFAQRGISFDGADQGLLNMH 153 Query: 223 LKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAI----- 277 + L YN + + + S L+HY G KPW Sbjct: 154 FRN-WDRLSFAYNCTPSGHYQYIPA-----FRHFQSSISLVHYIGQKKPWSLPRQTFPVE 207 Query: 278 YPSVKY---YKIALEN 290 P + + + Sbjct: 208 GPYNQLLARWWAVYDR 223 >UniRef50_C6X2V2 Putative glycosyltransferase n=1 Tax=Flavobacteriaceae bacterium 3519-10 RepID=C6X2V2_FLAB3 Length = 315 Score = 150 bits (379), Expect = 7e-35, Method: Composition-based stats. Identities = 60/297 (20%), Positives = 113/297 (38%), Gaps = 23/297 (7%) Query: 22 INTSECLNVAYGVDANYLDGVGVSITSIVLNNR-HINLDFYIIADVYNDGFFQKIAKLAE 80 L + + D +Y V I+SI+ N+ + + I+++ +D K+ + Sbjct: 3 QPRMNLLPIVFTCDDHYFKYAAVVISSIIHNSSRNTKYEINIVSEYISDENQSLAQKMVQ 62 Query: 81 -QNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDI 139 ++ + I + I + + S + Y+R F F LL DR+LYLD+D++ DI Sbjct: 63 SKSNISIQFHAIKIENPEVFHLNSYMSLSTYYRFFIFDLL-KDYDRVLYLDSDLIVDNDI 121 Query: 140 SQLLHLGLNGAVAAVVKDV-----------EPMQEKAVSRLSDPELLGQYFNSGVVYLDL 188 S + A + + +++ + +YFN+GV+ ++ Sbjct: 122 SFFADIDFENKPAICCPSIYVQNSLKNNTDHKFTREYFTQILKMSDVDEYFNAGVILFNI 181 Query: 189 KKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGM--TLFLPREYNTIYTIKSELKD 246 K + K + Y DQD++N +L+ + EYN T+K LK Sbjct: 182 KLIRAQGIDRKFFEAI-KNIKDPVYQDQDILNSVLRNNGGAKLISNEYNHTKTMKFSLKR 240 Query: 247 KTHQNYKKLITES----TLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPR 299 K + + HY G KPW P + +P+ + + Sbjct: 241 IFLNALKNKFGKKRNNWFTIYHYVGKVKPWQ--NFNPDSALFLYYAYKTPFVREILK 295 >UniRef50_B6Q6I5 Glycogenin n=3 Tax=Trichocomaceae RepID=B6Q6I5_PENMQ Length = 775 Score = 150 bits (379), Expect = 7e-35, Method: Composition-based stats. Identities = 43/278 (15%), Positives = 83/278 (29%), Gaps = 41/278 (14%) Query: 21 NINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAE 80 E + +YL G V S+ N + + + +++ + + Sbjct: 1 MATPGEAVYCTLLTSDHYLPGAVVLAHSLRDNGTRAKIVALFTPETLKEATIRELQTVYD 60 Query: 81 QNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDIS 140 + L + + S + ++ ++ R++Y+DADV+ Sbjct: 61 EIIPVQLRSNGTPANLLLMGRLDLIS--TFTKIELWR--QTQYSRIVYMDADVLALRAPD 116 Query: 141 QLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKA 200 +LL L + A D+ FNSGV+ L A Sbjct: 117 ELLSLQED---FAAAPDIGWP---------------DIFNSGVMVL-----RPNLQDYYA 153 Query: 201 LSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITEST 260 L + + DQ ++N K L YN + + + Sbjct: 154 LRAFAERGTSFDGGDQGLLNTYFK-RWYRLSFTYNCTPSGNYQYMPAYRH-----FESTI 207 Query: 261 LLIHYTGATKPWHKWA--------IYPSVKYYKIALEN 290 LIH+ G+ KPW + Y + + + Sbjct: 208 SLIHFIGSQKPWTQSRHAFASGTPYYQLLGRWWAQYDR 245 >UniRef50_B9SU65 UDP-glucose glycoprotein:glucosyltransferase, putative n=3 Tax=Magnoliophyta RepID=B9SU65_RICCO Length = 1512 Score = 149 bits (377), Expect = 1e-34, Method: Composition-based stats. Identities = 34/205 (16%), Positives = 72/205 (35%), Gaps = 11/205 (5%) Query: 22 INTSECLNVAYGVDAN-YLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAE 80 + +N+ + Y + + I S++ N + + F+ I + + F I +A+ Sbjct: 1296 SRRGKPINIFSIASGHLYERFLKIMILSVLKNTQ-RPVKFWFIKNYLSPQFKDVIPCMAQ 1354 Query: 81 QNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDIS 140 + L + Y LF + L+L++++++DAD V + D+ Sbjct: 1355 EYGFEYELITYKWPSWLHKQKEKQRIIWAYKILFLDVIFPLSLEKVIFVDADQVVRADMG 1414 Query: 141 QLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLG------QYFNSGVVYLDLKKWADA 194 +L + + G A + ++ R Y S + +DL K+ + Sbjct: 1415 ELYDMDIKGRPLAYTPFCDNNKDMDGYRFWRQGFWKEHLRGRPYHISALYVVDLVKFRET 1474 Query: 195 KLTEKA---LSILMSKDNVYKYPDQ 216 + L N DQ Sbjct: 1475 AAGDNLRVFYETLSKDPNSLANLDQ 1499 >UniRef50_Q2GW94 Putative uncharacterized protein n=1 Tax=Chaetomium globosum RepID=Q2GW94_CHAGB Length = 774 Score = 149 bits (376), Expect = 1e-34, Method: Composition-based stats. Identities = 39/288 (13%), Positives = 84/288 (29%), Gaps = 46/288 (15%) Query: 36 ANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITL---YRIN 92 YL G V S+ L + D + ++ + + + + N Sbjct: 17 DTYLPGALVLAHSLRDAGTTKKLAVLVTLDTVSADVVTQLKAVYDYVIPVSRIQNEHTAN 76 Query: 93 TDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVA 152 D + +++ +R +++Y+DAD+V +L +L Sbjct: 77 LDLMNRRDLHSAFTKINLWR-------QTQFRKIVYVDADIVAYRAPDELFNLP---HPF 126 Query: 153 AVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYK 212 + D+ + +G Y+ AL+ + + + Sbjct: 127 SAAPDIGWPDLFNTGLMVLTPNMGDYY--------------------ALTAMARRGISFD 166 Query: 213 YPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGATKPW 272 DQ ++N+ K L YN + + K ++H+ G KPW Sbjct: 167 GADQGLLNMYFKNSFNRLSFSYNVTPSAHYQYVPA-----YKHFQSGINMVHFIGPEKPW 221 Query: 273 HKWAI--------YPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYK 312 + V + + K+ S + + + K Sbjct: 222 LQGRDITTGSSPFDQMVGRWWAVYDRHYRKEPSQPEQEVPAIVQYFVK 269 >UniRef50_C7Z1L1 Putative uncharacterized protein n=1 Tax=Nectria haematococca mpVI 77-13-4 RepID=C7Z1L1_NECH7 Length = 762 Score = 149 bits (376), Expect = 2e-34, Method: Composition-based stats. Identities = 38/265 (14%), Positives = 82/265 (30%), Gaps = 42/265 (15%) Query: 35 DANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLYRINTD 94 +YL G V S+ H L + D + ++ + + + N Sbjct: 18 SDSYLPGALVLAHSLRDAGTHRKLAVLVTLDSVSADSITQLKAVYDYIFPVPRIRNDNPA 77 Query: 95 KLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAV 154 L + + S + ++ ++L +++Y+DAD+V +L + + Sbjct: 78 NLYLMNRGDLHS--AFTKINLWKL--TQFSKIVYIDADIVAYRAPDELFDI---THPFSA 130 Query: 155 VKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTE-KALSILMSKDNVYKY 213 D+ FN+GV+ L + + A+ + + + Sbjct: 131 APDIGWPDL---------------FNTGVMVLTP------NMGDFYAMIAMAERGISFDG 169 Query: 214 PDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGATKPWH 273 DQ ++N+ + YN + + S ++H+ GA KPW Sbjct: 170 ADQGLINMHFGNQYNRISFTYNVTPSAHYQYVPAYRH-----FQSSINMVHFIGAKKPWF 224 Query: 274 KWAI-----YPSVK---YYKIALEN 290 P + + Sbjct: 225 TGRDAPRGADPFNDMVGRWWAVYDR 249 >UniRef50_UPI0001757CC2 PREDICTED: similar to glycogenin n=1 Tax=Tribolium castaneum RepID=UPI0001757CC2 Length = 512 Score = 149 bits (375), Expect = 2e-34, Method: Composition-based stats. Identities = 46/258 (17%), Positives = 87/258 (33%), Gaps = 39/258 (15%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLY 89 V + +Y G V S+ L ++ + K+A + + Q L Sbjct: 7 VTLATNDSYSLGALVLAHSLKQVGSKHQLAV-LVTPGVTNPMRAKLATVFDLVQEVNILD 65 Query: 90 RINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNG 149 + L+ L ++ + +L ++L D+ ++LDAD + + +L Sbjct: 66 SKDESNLRLLKRPELG--VTFTKLHCWRL--TQFDKCVFLDADTLVLQNCDELFERE--- 118 Query: 150 AVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDN 209 + DV FNSGV + +K + + K Sbjct: 119 -ELSAAPDVGWP---------------DCFNSGVFVFRPS----NETYDKLVQFAVEK-G 157 Query: 210 VYKYPDQDVMNVLL-----KGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIH 264 + DQ ++N+ K ++ LP YN T K +IH Sbjct: 158 SFDGGDQGLLNLYFSDWATKDISKHLPFIYNLCSTACYSYLPA-----FKQFGADAKIIH 212 Query: 265 YTGATKPWHKWAIYPSVK 282 + G++KPW ++ + K Sbjct: 213 FIGSSKPWLQYFNTETRK 230 >UniRef50_C4Y414 Putative uncharacterized protein n=1 Tax=Clavispora lusitaniae ATCC 42720 RepID=C4Y414_CLAL4 Length = 1428 Score = 148 bits (374), Expect = 3e-34, Method: Composition-based stats. Identities = 46/256 (17%), Positives = 95/256 (37%), Gaps = 15/256 (5%) Query: 18 RLANINTSECLNVAYGVDAN-YLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIA 76 L +NV + Y + + S+V N ++ F++I + + GF +++ Sbjct: 1140 HLRAAKEQADINVFSIASGHLYEQLMSTMMLSVVKN-TGKSVKFWLIENFLSHGFRERVP 1198 Query: 77 KLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCK 136 LAE+ Y LF L LD+++++DAD + + Sbjct: 1199 GLAEKYGFEYEYVGYQWPAWLRQQKQLHRKVWGYKMLFLDTLFPADLDKVIFVDADQIAR 1258 Query: 137 GDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLG-------QYFNSGVVYLDLK 189 D+ +L+++ L GA + +E + +Y S + +DL+ Sbjct: 1259 TDLKELVNIDLEGAPYGFAPMCDSRKEMEGYQFWKNGYWPTVLKDDLKYHISALYVVDLR 1318 Query: 190 KWADAKLTEKA---LSILMSKDNVYKYPDQDVMNVLLKGM-TLFLPREYNTIYTIKSELK 245 + + + +K L + N DQD+ N L + + LP+E+ T S+ Sbjct: 1319 RLRETLVGDKLRSHYQKLSADPNSLSNLDQDLPNNLQRQVPIHTLPQEWLWCETWCSDES 1378 Query: 246 DKTHQNYKKLITESTL 261 + + + + Sbjct: 1379 KSSAKMID--MCNNPK 1392 >UniRef50_UPI00016C0890 glycosyl transferase family 8 protein n=2 Tax=Epulopiscium sp. 'N.t. morphotype B' RepID=UPI00016C0890 Length = 593 Score = 148 bits (374), Expect = 3e-34, Method: Composition-based stats. Identities = 49/286 (17%), Positives = 100/286 (34%), Gaps = 26/286 (9%) Query: 30 VAYGVDANYLDGVGVSITSIVL-NNRHINLDFYIIADVYNDGFFQKIAKLA-EQNQLRIT 87 + D ++ G ++ S+V +N + N D I ++ + + ++ + Sbjct: 288 IVLTTDDRFIIGAAATLISLVKTSNVNNNYDIIIFHKDLSEKSKTLLRNVVVQRINFSLR 347 Query: 88 LYRI--NTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHL 145 Y + W +YF+L ++ + L+LD D++ DI+ LL + Sbjct: 348 FYDVGYEMSTYNVYKPGNNWQPCVYFKLLIPSIMH-NYKKSLHLDCDLIILEDIANLLSI 406 Query: 146 GLNGAVAAVVKDVEP--------MQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLT 197 L G A ++ K + +YFN GV+ ++ ++ Sbjct: 407 DLKGNAVAGCAEMGCITTSIRRTWANKYYHEKLRITNMVEYFNGGVIVFNINEFHKITSL 466 Query: 198 EKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYN--------TIYTIKSELKDKTH 249 + L K +QD+++ LP+ +N + K L + Sbjct: 467 AQLLHEAEKKHLNL---EQDILSKSFVNHIYLLPQSWNLTRDFLGTVMNLYKQYLPSNIY 523 Query: 250 QNYKKLITESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKD 295 Q Y + +IHY G KPW + Y+ + + + Sbjct: 524 QKYLD-ARQKPKIIHYIGPLKPWDNPNL-EYASYWWDTIRGTEIYE 567 >UniRef50_D2VE03 UDP-glucose-glycoprotein glucosyltransferase n=1 Tax=Naegleria gruberi RepID=D2VE03_NAEGR Length = 1404 Score = 148 bits (373), Expect = 3e-34, Method: Composition-based stats. Identities = 39/253 (15%), Positives = 83/253 (32%), Gaps = 15/253 (5%) Query: 24 TSECLNVA-YGVDANYLDGVGVSITSIVLN--NRHINLDFYIIADVYNDGFFQKIAKLAE 80 + +++ Y + + I S+ + + + F+ + + Q + + A+ Sbjct: 1120 EKKTIHIFSLASGLMYERLLKIMILSVRKHLKRSDVKVKFWFLKQFLSPSLKQFLPEYAK 1179 Query: 81 QNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLG-LTLDRLLYLDADVVCKGDI 139 L T+ Y LF L +++++++DAD VC+ D+ Sbjct: 1180 AYNFEYGLISYQWPHWLHKQQTKQRLIWAYKVLFLDVLFPLQEVNKIIFVDADQVCRTDM 1239 Query: 140 SQL-LHLGLNGAVAAVVKDVEPMQE------KAVSRLSDPELLGQYFNSGVVYLDLKKWA 192 S+L L + G A E +E ++ Y S + +D+ + Sbjct: 1240 SELFFDLDMQGKALAYTPFCESRKEMDGYRFWKTGYWANHLGGRPYHISALYVVDIDMFR 1299 Query: 193 DAKLTEKA---LSILMSKDNVYKYPDQDVMNVLLKGM-TLFLPREYNTIYTIKSELKDKT 248 ++ L N DQD+ N + LP+E+ + S+ Sbjct: 1300 RNYHGDQFRMVYDNLARDPNSLSNLDQDLPNYAQHNVPIRSLPQEWLWCESWCSDESKAK 1359 Query: 249 HQNYKKLITESTL 261 + T Sbjct: 1360 AKTIDLCNNPQTK 1372 >UniRef50_Q17VR5 Lipopolysaccharide biosynthesis protein n=19 Tax=Helicobacter RepID=Q17VR5_HELAH Length = 405 Score = 147 bits (372), Expect = 4e-34, Method: Composition-based stats. Identities = 59/374 (15%), Positives = 122/374 (32%), Gaps = 81/374 (21%) Query: 27 CLNVAYGVDANYLDGVGVSITSIVLNNRHIN------LDFYIIADVYNDGFFQKIAKLAE 80 + + D NY GVS+ S++ N + + + D + +K+ + Sbjct: 6 IIPIVVAFDNNYCIPAGVSLYSMLANAKTERERVKLFYKIHCLVDGLSAENIEKLKETLA 65 Query: 81 QN--QLRITLYRINTDKLQCL-----------------------------------PCTQ 103 + I+T + Sbjct: 66 PFSAFSSVEFLEISTHNTPKENQEIKKNQTIKSDHYQNIDPIIANKIEELFTKLSNYSQK 125 Query: 104 VWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVVK--DVEPM 161 +S+ + RL L D+++ D D + GDIS+ + L V+ D+ M Sbjct: 126 RFSKMIMCRLLLASLFP-QYDKMIMFDVDTLFVGDISESFFIPLEAHYFGAVREKDLIAM 184 Query: 162 QEKAVSRLSDPE---------------------LLGQYFNSGVVYLDLKKWADAKLTEKA 200 + L + L YFN+G + L+LK W L + Sbjct: 185 NRNSAKDLYELRQRRAKSIGVANAFPNLEEAQILFDNYFNAGFLALNLKLWRKENLENQL 244 Query: 201 LSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITEST 260 + + K+ + DQD + + +G L LP YN + + + Sbjct: 245 IGFFILKNEKLLFNDQDALCFVCRGRILELPYPYNAHPSFLDTPSFPS--------IKEV 296 Query: 261 LLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHLLVQHHY 320 ++H+ G KPW +++ K + L +P+KD + + H+ +++ Sbjct: 297 CMLHFWG-DKPWKIFSV-FGAKKWHEVLMQTPFKDKY----FNTPFLDHLFNHIQNKNNK 350 Query: 321 ISGIIAGVCYLCRK 334 + + ++ ++ Sbjct: 351 LRTFNKALSFVDKR 364 >UniRef50_B3XPR8 Glycosyl transferase family 8 n=4 Tax=Lactobacillus RepID=B3XPR8_LACRE Length = 465 Score = 147 bits (371), Expect = 5e-34, Method: Composition-based stats. Identities = 49/266 (18%), Positives = 99/266 (37%), Gaps = 25/266 (9%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLY 89 +A VD ++D ++ SI +N+ N+ YII +F I + +I Sbjct: 4 IALSVDYRWIDQAETTLKSIYAHNK--NVKTYIINHDIPHEWFVNINRYLGVQDSQIIDR 61 Query: 90 RINTDKLQCLPCTQ-VWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLN 148 +I+ ++ + +P + S +Y + +L+ D++LYLD+DV+ ++ QL +N Sbjct: 62 KIDEERFKDMPMPEARISPMVYGKFLIPELIPE--DQVLYLDSDVIVDKNLDQLFATKIN 119 Query: 149 GAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKD 208 V D + FNSGV+ ++ W + + + L + + Sbjct: 120 DRPLYTVVDYFNPSQ---------------FNSGVLLINNLFWRNNNIGNQLLKLGHDYN 164 Query: 209 NVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITE--STLLIHYT 266 Q +MN L +N + + ++ + +IHYT Sbjct: 165 ---LNNTQVIMNEGFAQNYGKLDPCFNFQIGYERKSYWNDKSSFYAFFDKVTDPAIIHYT 221 Query: 267 GATKPWHKWAIYPSVKYYKIALENSP 292 KP++ + + Sbjct: 222 EKDKPFNIEKTVELREKWWYYHNLEW 247 >UniRef50_A4R9Z3 Putative uncharacterized protein n=1 Tax=Magnaporthe grisea RepID=A4R9Z3_MAGGR Length = 866 Score = 147 bits (370), Expect = 7e-34, Method: Composition-based stats. Identities = 42/287 (14%), Positives = 88/287 (30%), Gaps = 49/287 (17%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLY 89 + + NYL G V S+ L + D + I +L I + Sbjct: 11 ITLLLSDNYLPGALVLAHSLRDAGTTRKLAIMVTLDTVAA---KVITQLKAVYDYVIPVP 67 Query: 90 RINTDKLQCLPCTQVWS-RAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLN 148 RI ++ L + + ++ ++ +L+Y+DADVV +L + Sbjct: 68 RIRNERPANLYLMNRPDLHSAFTKVNLWK--QTQFSKLVYIDADVVAYRAPDELFAI--- 122 Query: 149 GAVAAVVKDVEPMQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKA-LSILMSK 207 + D+ FN+GV+ L + + + + + Sbjct: 123 AHPFSAAPDIGWPDL---------------FNTGVMVLTP------NMGDYYAMMAMAER 161 Query: 208 DNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTG 267 + DQ ++N+ + + YN + + S ++H+ G Sbjct: 162 GISFDGADQGLINMHFRHTYNRISFTYNVTPSAHYQYVPAYRH-----FQSSINMVHFIG 216 Query: 268 ATKPWHK--------WAIYPSVKYYK-----IALENSPWKDDSPRDA 301 + KPW + A V + + ++ R Sbjct: 217 SEKPWIQGRNSTAGGGAFDEMVGRWWAVYDRHYRAPTVYEPQVQRPP 263 >UniRef50_Q68CQ7 Glycosyltransferase 8 domain-containing protein 1 n=45 Tax=Euteleostomi RepID=GL8D1_HUMAN Length = 371 Score = 146 bits (368), Expect = 1e-33, Method: Composition-based stats. Identities = 57/337 (16%), Positives = 100/337 (29%), Gaps = 51/337 (15%) Query: 11 KVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDG 70 R A E + V + L G +I SI +N N+ FYI+ Sbjct: 49 DFVPNALRHAVDGRQEEIPVVIAASEDRLGGAIAAINSI-QHNTRSNVIFYIVTLNNTAD 107 Query: 71 FFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQV-----WSRAMYFRLFAFQLLGLTLDR 125 + +R + + L+ + R + L+ + Sbjct: 108 HLRSWLNSDSLKSIRYKIVNFDPKLLEGKVKEDPDQGESMKPLTFARFYLPILVPSA-KK 166 Query: 126 LLYLDADVVCKGDISQLLHLGLN-GAVAAVVKDVEP--------------------MQEK 164 +Y+D DV+ +GDI L + L G AA +D + +K Sbjct: 167 AIYMDDDVIVQGDILALYNTALKPGHAAAFSEDCDSASTKVVIRGAGNQYNYIGYLDYKK 226 Query: 165 AVSRLSDPELLGQYFNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYP-------DQD 217 R + FN GV +L +W +T + + Y Sbjct: 227 ERIRKLSMKASTCSFNPGVFVANLTEWKRQNITNQLEKWMKLNVEEGLYSRTLAGSITTP 286 Query: 218 VMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHYTGATKPWHKWAI 277 + ++ + +N L + Y ++ L+H+ G KPW + A Sbjct: 287 PLLIVFYQQHSTIDPMWNV-----RHLGSSAGKRYSPQFVKAAKLLHWNGHLKPWGRTAS 341 Query: 278 YPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKHL 314 Y V W+ D +RY + Sbjct: 342 YTDV-----------WEKWYIPDPTGKFNLIRRYTEI 367 >UniRef50_Q9FIK3 Emb|CAB71043.1 n=39 Tax=Embryophyta RepID=Q9FIK3_ARATH Length = 615 Score = 146 bits (368), Expect = 1e-33, Method: Composition-based stats. Identities = 49/324 (15%), Positives = 108/324 (33%), Gaps = 53/324 (16%) Query: 9 IDKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHI-NLDFYIIADVY 67 ++ + + ++ + A D + V + S + N +H F+I+ D Sbjct: 289 LNSSEQQFPNQEKLEDTQLYHYALFSDN--VLATSVVVNSTITNAKHPLKHVFHIVTDRL 346 Query: 68 NDGFFQK------------------------------IAKLAEQNQLRITL--YRINTDK 95 N + + +L+ ++ + + N+D Sbjct: 347 NYAAMRMWFLDNPPGKATIQVQNVEEFTWLNSSYSPVLKQLSSRSMIDYYFRAHHTNSDT 406 Query: 96 LQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVV 155 + S + R + ++ L ++L+LD D+V + D+S L + L G V V Sbjct: 407 NLKFRNPKYLSILNHLRFYLPEIFP-KLSKVLFLDDDIVVQKDLSGLWSVDLKGNVNGAV 465 Query: 156 KDVEPMQEKAVSRLS--------DPELLGQYFNSGVVYLDLKKWADAKLTEKALSI--LM 205 + + L+ + + + G+ DL +W +TE L Sbjct: 466 ETCGESFHRFDRYLNFSNPLISKNFDPRACGWAYGMNVFDLDEWKRQNITEVYHRWQDLN 525 Query: 206 SKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITESTLLIHY 265 ++K + T L R+++ + + + + E +IHY Sbjct: 526 QDRELWKLGTLPPGLITFWRRTYPLDRKWHILGLGYN-------PSVNQRDIERAAVIHY 578 Query: 266 TGATKPWHKWAIYPSVKYYKIALE 289 G KPW + I ++ ++ Sbjct: 579 NGNLKPWLEIGIPRYRGFWSKHVD 602 >UniRef50_A3YS36 Putative sugar transferase n=2 Tax=Campylobacter jejuni RepID=A3YS36_CAMJE Length = 459 Score = 146 bits (368), Expect = 1e-33, Method: Composition-based stats. Identities = 71/339 (20%), Positives = 133/339 (39%), Gaps = 33/339 (9%) Query: 28 LNVAYGVDANYLDGVGVSITSIVLNNRHIN---LDFYIIADVYNDGFFQKI----AKLAE 80 ++ + Y++ + V + SI++N N F+I++ ND +K+ +L+ Sbjct: 2 YHIVFNSSNEYIENLSVLMYSIIINTNKSNTKKYCFHILSSNINDNTCKKLTLLEKELSS 61 Query: 81 QNQLRITLYRINTDKLQCLPCTQV-WSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDI 139 I +Y IN + + S Y RL +L + + LYLD D++ GDI Sbjct: 62 IYPSEIKIYHINDNLFYDYNIPKHEGSYNAYLRLMLASILSKDIKKCLYLDVDMLVLGDI 121 Query: 140 SQLLHLGLNGAVAAVVKDVEPMQEKAVSRLS--DPELLGQYFNSGVVYLDLKKWADAKLT 197 S+L L L V A V ++ S+ S + G +FNSG++ ++L W + + Sbjct: 122 SELFDLDLKDKVFAAVFILKHPWPNLNSKDSSEIFYIYGSHFNSGLMLINLDAWREKNIE 181 Query: 198 EKALSILMSKDNVYKYPDQDVMNVLL-KGMTLFLPREYNTIYTIKSELKDK--------- 247 ++LS + + Y D+ V+N +L K L E+N + + + Sbjct: 182 SRSLSFIKNYYVPYA-VDEYVLNAILSKDDIFSLKLEWNFLIGFRRLYLNNDLFFNKEEG 240 Query: 248 -------THQNYKKLITESTLLIHYTGA--TKPWHKWAIYPSVKYYKIALEN-SPWKDDS 297 + + + ++HYT KPW + Y + E W D + Sbjct: 241 DKYKIICYSKEEFEKAFKKIKILHYTYLYMPKPWENVYSFIDDDYNLVYYEFYDAWWDMA 300 Query: 298 PRDAKSIIEFKKRYKHLLVQ--HHYISGIIAGVCYLCRK 334 + F K+ + + Y + + L +K Sbjct: 301 LKTPIYGEHFAKKKREYEKKSLLTYAQAMSKKIKALEKK 339 >UniRef50_A2EY94 Glycosyl transferase family 8 protein n=1 Tax=Trichomonas vaginalis RepID=A2EY94_TRIVA Length = 1241 Score = 146 bits (368), Expect = 1e-33, Method: Composition-based stats. Identities = 48/265 (18%), Positives = 97/265 (36%), Gaps = 15/265 (5%) Query: 12 VKAWDFRLANINTSECLNVAYGVDAN-YLDGVGVSITSIVLNNRHINLDFYIIADVYNDG 70 +++ + T E ++V Y + + S+ + + ++ F+I+ + + Sbjct: 946 FSSFEQVRTDPKTIETVDVFIVASGQLYERLAKIMMISVRRH-TNSSVRFWILKNYLSPS 1004 Query: 71 FFQKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLD 130 F + K++++ L N + LF + L R++Y+D Sbjct: 1005 FKASLPKMSQEYNFSYNLISYNWPANLFKQKEKNRIIWANKILFLDNIFPPDLKRVIYID 1064 Query: 131 ADVVCKGDISQLLHLGLNGAVAAVVKDVE------PMQEKAVSRLSDPELLGQYFNSGVV 184 AD + + D+S+L+ L L+GA A + P + +Y S + Sbjct: 1065 ADQIVRSDLSELMKLDLSGAPYAFTPMCDSRTEIEPYRFWKRGYWQKQLRGKKYHISALF 1124 Query: 185 YLDLKKWADAK---LTEKALSILMSKDNVYKYPDQDVMNVLLKGM-TLFLPREYNTIYTI 240 +DL+++ + L N DQD+ N + + LP+E+ T Sbjct: 1125 VVDLERFRQMDAGEILRDVYQDLAPDPNSLANLDQDLPNYVQDALPIYSLPQEWLWCETW 1184 Query: 241 KSE---LKDKTHQNYKKLITESTLL 262 S+ K KT +T L Sbjct: 1185 CSDETMNKAKTIDLCNNPLTHKPKL 1209 >UniRef50_UPI000197AD97 hypothetical protein BACCOPRO_03221 n=1 Tax=Bacteroides coprophilus DSM 18228 RepID=UPI000197AD97 Length = 313 Score = 145 bits (367), Expect = 2e-33, Method: Composition-based stats. Identities = 59/319 (18%), Positives = 114/319 (35%), Gaps = 22/319 (6%) Query: 24 TSECLNVAYGVDANYLDGVGVSITSIVLNNRHIN-LDFYIIADVYNDGFFQKIAKLAEQN 82 + + + + D N + V I+S+++N + D +I+ D +++ +L + Sbjct: 1 MMKTVPIVFAFDNNLILPACVCISSLLMNAKEETFYDIFILHSSKVDLHKEQLDELPKYF 60 Query: 83 QLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQL 142 YR+ + + + Y+RL +L+ D ++Y D DV+ + D+S + Sbjct: 61 NRCRIQYRVVDNTFDQAFEIRGITTPTYYRLLIPELVP-EYDNIIYSDVDVIFRFDLSDI 119 Query: 143 -LHLGLNGAVAAVVKDVEPMQEKAVSRLSDPE--LLGQYFNSGVVYLDLKKWADAKLTEK 199 H LN + A V + P + +G + L+ KK + L E+ Sbjct: 120 YFHTDLNDSYVAGVNALVPFIPDMKKYYLKLGNVNIDSIIYAGNIILNSKKIREDNLVER 179 Query: 200 ALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITES 259 + N + + D DV+N+ KG +L + + + Sbjct: 180 FKELAK---NKFHFQDLDVLNIACKGKITYLKPVFCLTTYFSELALRHRNLLRDFWSDKD 236 Query: 260 T------LLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAKSIIEFKKRYKH 313 ++HY G KPW + + SP+ D+ K EF + Sbjct: 237 IDEALTEGIVHYNGQ-KPWKGICVN--SDIWWEYYRKSPFFDE-----KFYFEFFYTRLN 288 Query: 314 LLVQHHYISGIIAGVCYLC 332 L Q I + Y Sbjct: 289 ELDQLSLWKRIKILIRYFV 307 >UniRef50_A5DMZ6 Putative uncharacterized protein n=2 Tax=Pichia guilliermondii RepID=A5DMZ6_PICGU Length = 1415 Score = 145 bits (366), Expect = 2e-33, Method: Composition-based stats. Identities = 43/265 (16%), Positives = 89/265 (33%), Gaps = 16/265 (6%) Query: 14 AWDFRLANINTSECLNVA-YGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFF 72 + NT E +N+ Y + + + S + + ++ +++ + F Sbjct: 1093 NFWSSKGVENTGEDINIFTIASGELYEHLLSIMLASATSHTK-RSVKLWLLEGFLSPKFR 1151 Query: 73 QKIAKLAEQNQLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDAD 132 + LA + + Y LF L L R++++DAD Sbjct: 1152 SNLPALASKYGFSYEFISYKWPIWLRSQQPVSRTVWGYKILFLDALFPQDLKRVIFIDAD 1211 Query: 133 VVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPELLG-------QYFNSGVVY 185 V + D+ +L+ L GA V E +E + +Y S + Sbjct: 1212 QVLRADLMELMETDLQGAPYGFVPMCESKEEMKGYQFWKQGYWAQMLQDDLKYHISALFV 1271 Query: 186 LDLKKWADAKLTEKA---LSILMSKDNVYKYPDQDVMNVLLKGM-TLFLPREYNTIYTIK 241 +DL ++ ++ ++ L S DQD+ N L + + LP E+ T Sbjct: 1272 VDLVEFRKRRVGDRLRAHYQKLSSDPKSLSNLDQDLPNNLQRIVPIHSLPPEWLWCDTWC 1331 Query: 242 SE---LKDKTHQNYKKLITESTLLI 263 ++ + K + ++ Sbjct: 1332 AKEELGRAKAIDLCNDPTSTEDKIV 1356 >UniRef50_Q92VQ2 Putative lipopolysaccharide 1,3-galactosyltransferase n=1 Tax=Sinorhizobium meliloti RepID=Q92VQ2_RHIME Length = 337 Score = 145 bits (365), Expect = 3e-33, Method: Composition-based stats. Identities = 59/342 (17%), Positives = 105/342 (30%), Gaps = 28/342 (8%) Query: 9 IDKVKAWDFRLANINTSECLNVAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYN 68 +DK + + + S + D NY + S + + + + Sbjct: 1 MDKGAVFPSNWQSSSGSAA--IVLVTDQNYALPTFSAALSADQHTKGADTAIRMFVVGAE 58 Query: 69 DGFFQKIAKLAEQNQLRITLYRINTD-KLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLL 127 D + ++ + ++++ R+ +L R + LL +DR L Sbjct: 59 DTWARQFDEAVAGTKIKVIAARLPQLAELSPYHRDHYLPPIALARFWIDSLLDAGVDRFL 118 Query: 128 YLDADVVCKGDISQLLHLGLNGAVAAVVKDVEPMQEKAVSRLSDPE---------LLGQY 178 Y+D D + G++ LL D + VSR + Y Sbjct: 119 YIDGDTMVDGELDSLLASTPPAEGLMAAPDFLNIFMDEVSRGKKRDLAHLEGIGCRPETY 178 Query: 179 FNSGVVYLDLKKWADAKLTEKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIY 238 FNSGV+Y + W + A+ ++ DQ +N +G L YN Sbjct: 179 FNSGVIYASREAW--NDIVPVAMKFMVEHPEHCPASDQSALNHAARGRVTMLSLRYNYQS 236 Query: 239 TIKSELKDKTHQNYKKLITESTLLIHYTGATKPWHK--WAIYPSVKYYKIALENSPWKDD 296 L + + H+TG KPW+ W S Y A E Sbjct: 237 EHMMVLDPRRRGI-------GPAIWHFTGGPKPWNTPGWPWDESFNRYYCAAEMRLHGST 289 Query: 297 SPRDAKSIIEFKKRYKHLLVQHHYISGIIAGVCYLCRKYYRK 338 + + H ++ + Y RK R+ Sbjct: 290 IVTPVPPEAQTRAGIAHRRRSRSRMTWV-----YPWRKITRR 326 >UniRef50_A2FK31 Putative uncharacterized protein n=1 Tax=Trichomonas vaginalis RepID=A2FK31_TRIVA Length = 1298 Score = 144 bits (364), Expect = 4e-33, Method: Composition-based stats. Identities = 36/252 (14%), Positives = 89/252 (35%), Gaps = 15/252 (5%) Query: 24 TSECLNVAYGVDAN-YLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQN 82 +++ Y + Y + +SI S+V + + + +++ + + F + + +E+ Sbjct: 1021 DDGKIHIFYVASGHLYERLMRISILSVVKHTKS-PVKLWLLENFASPNFRNSLKEFSEKY 1079 Query: 83 QLRITLYRINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQL 142 + + + + Y LF + L R++Y+D+D + + D+ +L Sbjct: 1080 KFEYEFCSYKWPRWLPREEARQRTFWGYKILFLDVMFPNDLRRVIYIDSDQIIRTDMREL 1139 Query: 143 LHLGLNGAVAAVVKDVEP------MQEKAVSRLSDPELLGQYFNSGVVYLDLKKWADAK- 195 + + G A + + + Y S + +DL ++ Sbjct: 1140 MTMDFEGKPYAFTPFCNDRPEMQEYRFWEIGYWQNLLNGKPYHISALFAVDLPEYRSLDV 1199 Query: 196 --LTEKALSILMSKDNVYKYPDQDVMNVLLKGM---TLFLPREYNTIYTIKSELKDKTHQ 250 + K L + DQD+ N++ + LP+E+ + S+ K + Sbjct: 1200 GGMMRKGYMDLHNDKESLSNLDQDLPNMM-QNRGAPIFSLPQEWLWCGSWCSDETMKKAK 1258 Query: 251 NYKKLITESTLL 262 T + Sbjct: 1259 TIDLCNNPRTKV 1270 >UniRef50_Q9FH36 Similarity to unknown protein n=28 Tax=Embryophyta RepID=Q9FH36_ARATH Length = 535 Score = 144 bits (363), Expect = 5e-33, Method: Composition-based stats. Identities = 48/333 (14%), Positives = 100/333 (30%), Gaps = 71/333 (21%) Query: 20 ANINTSECLNVAYGVDANYLDGVGVSITSIVLNN-RHINLDFYIIAD------------- 65 + + + D + V S+V N R + +II D Sbjct: 201 PMLVDNNYFHFVLASDN--ILAASVVAKSLVQNALRPHKIVLHIITDRKTYFPMQAWFSL 258 Query: 66 -VYNDGFFQK----------------IAKLAEQNQLRITLY---RINTDKLQCLP----- 100 + + + + + ++R + + P Sbjct: 259 HPLSPAIIEVKALHHFDWLSKGKVPVLEAMEKDQRVRSQFRGGSSVIVANNKENPVVVAA 318 Query: 101 -----CTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNGAVAAVV 155 + S + R+ +L +L+++++LD D+V + D+S L + +NG V V Sbjct: 319 KLQALSPKYNSLMNHIRIHLPELFP-SLNKVVFLDDDIVIQTDLSPLWDIDMNGKVNGAV 377 Query: 156 K-----DVEPMQEKAVSRLS--------DPELLGQYFNSGVVYLDLKKWADAKLTEKALS 202 + D M +K S L+ + + G+ DL W ++ Sbjct: 378 ETCRGEDKFVMSKKFKSYLNFSNPTIAKNFNPEECAWAYGMNVFDLAAWRRTNISSTYYH 437 Query: 203 ILMSKDNV----YKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLITE 258 L ++ + G + ++ + E E Sbjct: 438 WLDENLKSDLSLWQLGTLPPGLIAFHGHVQTIDPFWHMLGLGYQETTSYADA-------E 490 Query: 259 STLLIHYTGATKPWHKWAIYPSVKYYKIALENS 291 S ++H+ G KPW A + L++S Sbjct: 491 SAAVVHFNGRAKPWLDIAFPHLRPLWAKYLDSS 523 >UniRef50_C5SH34 Glycosyl transferase family 8 n=1 Tax=Asticcacaulis excentricus CB 48 RepID=C5SH34_9CAUL Length = 307 Score = 143 bits (362), Expect = 5e-33, Method: Composition-based stats. Identities = 64/315 (20%), Positives = 118/315 (37%), Gaps = 45/315 (14%) Query: 30 VAYGVDANYLDGVGVSITSIVLNNRHINLDFYIIADVYNDGFFQKIAKLAEQNQLRITLY 89 + Y VD NYL VS + N D I+ +D + + L I L Sbjct: 6 ICYVVDDNYLFPTLVSASQARENAPSSLADIVILC--LSDASDRVRKVMPVAVALGIELI 63 Query: 90 RINTDKLQCLPCTQVWSRAMYFRLFAFQLLGLTLDRLLYLDADVVCKGDISQLLHLGLNG 149 + T ++ L MY RLF +LL +R+LY+D D + LL++ + Sbjct: 64 EVPTASIENL-------HPMYGRLFIDKLLPKAYERVLYIDGDTQIAASLEPLLNVDIPE 116 Query: 150 AVAAVVKDVEPMQEKAVSRLS------------DPELLGQYFNSGVVYLDLKKWADAKLT 197 V+D M K + + + Y N+GV+ ++K WA+ L Sbjct: 117 GKFLAVRDPAAMFAKLSDKWASRIQGERVEAGLGDNPIEDYLNTGVLVFNMKDWAE--LA 174 Query: 198 EKALSILMSKDNVYKYPDQDVMNVLLKGMTLFLPREYNTIYTIKSELKDKTHQNYKKLIT 257 + L ++ ++ +K+ DQD MN+ + L++ +N + +++ + Sbjct: 175 GETLKLIRARSTPFKFGDQDPMNLAIGDRCLYISNRWNFPGFLIGSGQEERVK------- 227 Query: 258 ESTLLIHYTGATKPWHKWAIYPSVKYYKIALENSPWKDDSPRDAK-----SIIEFKKRYK 312 ++ H+ +PW K+ ++P+K R K + Sbjct: 228 --PVIYHFMSNPRPWVHAGAPWGPKW------HTPYKAFLARFPVLESVAPKTTPVKALR 279 Query: 313 HLLVQHHYISGIIAG 327 H L Q + G+ Sbjct: 280 HHLQQA--LKGVTEY 292 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.312 0.139 0.388 Lambda K H 0.267 0.0426 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,899,903,011 Number of Sequences: 3077464 Number of extensions: 74426496 Number of successful extensions: 259744 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 893 Number of HSP's successfully gapped in prelim test: 980 Number of HSP's that attempted gapping in prelim test: 253577 Number of HSP's gapped (non-prelim): 2435 length of query: 338 length of database: 1,040,396,356 effective HSP length: 129 effective length of query: 209 effective length of database: 643,403,500 effective search space: 134471331500 effective search space used: 134471331500 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.3 bits) S2: 93 (40.4 bits)