BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (441 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P75905 Biofilm PGA synthesis N-glycosyltransferase pgaC... 910 0.0 UniRef50_B2UK24 Glycosyl transferase family 2 n=10 Tax=Burkholde... 420 e-116 UniRef50_A4SIQ6 Glycosyl transferase, family 2 n=6 Tax=Gammaprot... 407 e-112 UniRef50_Q30SK0 Glycosyl transferase, family 2 n=6 Tax=Proteobac... 402 e-110 UniRef50_A3N3L7 Biofilm PGA synthesis N-glycosyltransferase PgaC... 355 2e-96 UniRef50_Q9RQP9 Biofilm PIA synthesis N-glycosyltransferase icaA... 343 1e-92 UniRef50_C8WT25 Glycosyl transferase family 2 n=3 Tax=Bacteria R... 342 1e-92 UniRef50_B9EAX5 Intercellular adhesion protein IcaA homolog n=19... 335 3e-90 UniRef50_A4J4W8 Glycosyl transferase, family 2 n=3 Tax=Clostridi... 277 6e-73 UniRef50_C9RCL0 Glycosyl transferase family 2 n=1 Tax=Ammonifex ... 263 1e-68 UniRef50_Q1IL87 Glycosyl transferase, family 2 n=1 Tax=Candidatu... 251 3e-65 UniRef50_C0WP52 Possible VGP family vectorial glycosyl polymeriz... 229 2e-58 UniRef50_Q9CHP7 Glycosyl transferase n=3 Tax=Lactococcus lactis ... 224 6e-57 UniRef50_Q38W83 Putative glycosyl transferase, family 2 n=1 Tax=... 205 2e-51 UniRef50_UPI0001AEFEC9 N-glycosyltransferase PgaC n=1 Tax=Strept... 192 1e-47 UniRef50_A5VMQ3 Glycosyl transferase, family 2 n=38 Tax=Lactobac... 192 3e-47 UniRef50_B5HP10 Bifunctional transferase/deacetylase n=9 Tax=Str... 162 2e-38 UniRef50_D2AU81 Polysaccharide deacetylase n=1 Tax=Streptosporan... 161 6e-38 UniRef50_Q82DY8 Putative polysaccharide deacetylase/glycosyltran... 160 8e-38 UniRef50_B1L4E1 Glycosyl transferase family 2 n=1 Tax=Candidatus... 157 7e-37 UniRef50_UPI0001B4D705 bi-functional transferase/deacetylase n=3... 155 2e-36 UniRef50_C7MUK1 Glycosyl transferase n=11 Tax=Actinomycetales Re... 153 1e-35 UniRef50_D1SD50 Polysaccharide deacetylase n=1 Tax=Micromonospor... 153 1e-35 UniRef50_B5GIB2 Bi-functional transferase/deacetylase n=7 Tax=St... 152 2e-35 UniRef50_C0YGU0 Polysaccharide deacetylase/glycosyl transferase,... 149 3e-34 UniRef50_A1R2V1 Glycosyl transferase, group 2 family domain prot... 147 6e-34 UniRef50_C8WBS4 Polysaccharide deacetylase n=4 Tax=Sphingomonada... 145 3e-33 UniRef50_Q2G7Y8 Polysaccharide deacetylase n=1 Tax=Novosphingobi... 144 8e-33 UniRef50_B0T3D0 Polysaccharide deacetylase n=2 Tax=Caulobacter R... 143 1e-32 UniRef50_B4CVJ2 Glycosyl transferase family 2 n=1 Tax=Chthonioba... 143 1e-32 UniRef50_B8II97 Polysaccharide deacetylase n=1 Tax=Methylobacter... 143 2e-32 UniRef50_B5HQE3 Bifunctional transferase/deacetylase n=3 Tax=Str... 142 2e-32 UniRef50_A6Q1D6 Glucosaminyltransferase n=2 Tax=Epsilonproteobac... 142 3e-32 UniRef50_B9ZC52 Glycosyl transferase family 2 n=1 Tax=Natrialba ... 142 3e-32 UniRef50_C1V8R5 Glycosyl transferase n=1 Tax=Halogeometricum bor... 140 1e-31 UniRef50_D2L983 Polysaccharide deacetylase n=1 Tax=Desulfovibrio... 139 2e-31 UniRef50_C6WRK1 Glycosyl transferase family 2 n=1 Tax=Actinosynn... 138 3e-31 UniRef50_Q1GJ85 Glycosyl transferase family 2 n=4 Tax=Rhodobacte... 137 9e-31 UniRef50_C1ACV5 Putative glycosyl transferase/polysaccharide dea... 136 1e-30 UniRef50_C1D0J4 Putative glycosyltransferase n=1 Tax=Deinococcus... 136 2e-30 UniRef50_A9EEN3 Glycosyl transferase, family 2 n=1 Tax=Oceanibul... 136 2e-30 UniRef50_B6BG61 Glycosyl transferase, group 2 family protein n=2... 136 2e-30 UniRef50_C5CVQ8 Glycosyl transferase family 2 n=4 Tax=Proteobact... 135 2e-30 UniRef50_A0RGJ4 Glycosyl transferase and polysaccharide deacetyl... 134 7e-30 UniRef50_Q1IUP8 Polysaccharide deacetylase n=1 Tax=Candidatus Ko... 133 1e-29 UniRef50_D2LDP7 Polysaccharide deacetylase n=1 Tax=Rhodomicrobiu... 133 2e-29 UniRef50_Q3B487 Glucosaminyltransferase n=2 Tax=Chlorobium/Pelod... 132 2e-29 UniRef50_C7RT53 N-glycosyltransferase PgaC n=1 Tax=Candidatus Ac... 132 3e-29 UniRef50_D2LZZ6 Glycosyl transferase family 2 n=1 Tax=Bacillus c... 132 3e-29 UniRef50_C5SLQ2 Polysaccharide deacetylase n=1 Tax=Asticcacaulis... 131 4e-29 UniRef50_A1UIF2 Polysaccharide deacetylase n=3 Tax=Mycobacterium... 131 6e-29 UniRef50_D1PAP3 Putative biofilm PGA synthesis N-glycosyltransfe... 131 6e-29 UniRef50_C5A5B1 Glycosyltransferase, family 2 n=9 Tax=Thermococc... 130 1e-28 UniRef50_Q2IFD0 Glycosyl transferase, family 2 n=5 Tax=Bacteria ... 129 3e-28 UniRef50_Q05S47 Group-specific protein n=1 Tax=Synechococcus sp.... 128 5e-28 UniRef50_C1F4G9 Polysaccharide deacetylase domain protein/glycos... 127 9e-28 UniRef50_A7GCY8 Glycosyl transferase, group 2 family n=66 Tax=Fi... 127 1e-27 UniRef50_D2L608 Glycosyl transferase family 2 n=1 Tax=Desulfovib... 126 2e-27 UniRef50_C5A6A1 Glycosyltransferase n=2 Tax=Thermococcus RepID=C... 126 2e-27 UniRef50_A9EU26 Group-specific protein n=3 Tax=Phaeobacter galla... 126 2e-27 UniRef50_Q2RGJ6 Glycosyl transferase, family 2 n=7 Tax=Firmicute... 125 2e-27 UniRef50_B7QVL5 Glycosyl transferase, family 2 n=1 Tax=Ruegeria ... 125 2e-27 UniRef50_B9MNQ5 Glycosyl transferase family 2 n=1 Tax=Anaerocell... 125 3e-27 UniRef50_B1V7Q3 Chitin synthase n=1 Tax=Clostridium perfringens ... 125 4e-27 UniRef50_B4D4P9 Polysaccharide deacetylase n=1 Tax=Chthoniobacte... 125 4e-27 UniRef50_C4WPV5 Putative uncharacterized protein n=1 Tax=Ochroba... 124 6e-27 UniRef50_A4YD58 Glycosyl transferase, family 2 n=12 Tax=Sulfolob... 124 6e-27 UniRef50_C7RAF6 Glycosyl transferase family 2 n=1 Tax=Kangiella ... 124 7e-27 UniRef50_Q46B82 Glucosaminyltransferase n=2 Tax=Methanosarcina R... 123 1e-26 UniRef50_B9LWG0 Glycosyl transferase family 2 n=1 Tax=Halorubrum... 123 2e-26 UniRef50_B9YCC6 Putative uncharacterized protein n=1 Tax=Holdema... 122 2e-26 UniRef50_UPI0001C30EDA glycosyl transferase family 2 n=1 Tax=Con... 121 5e-26 UniRef50_C4I9P5 Inner membrane glycosyltransferase n=39 Tax=Bact... 120 1e-25 UniRef50_D1BHB0 Glycosyl transferase n=1 Tax=Sanguibacter keddie... 119 2e-25 UniRef50_B1HSU1 Biofilm PIA synthesis N-glycosyltransferase icaA... 119 3e-25 UniRef50_A6FMF5 N-acetylglucosaminyltransferase n=1 Tax=Roseobac... 119 3e-25 UniRef50_B4U2G9 Glycosyl transferase, group 2 family protein n=2... 119 3e-25 UniRef50_P96587 Uncharacterized glycosyltransferase ydaM n=6 Tax... 118 4e-25 UniRef50_A6DGB3 Glycosyl transferase, family 2 n=3 Tax=Lentispha... 118 4e-25 UniRef50_Q988K9 Mlr6694 protein n=1 Tax=Mesorhizobium loti RepID... 118 5e-25 UniRef50_Q46CB1 Glycosyltransferase n=2 Tax=Methanosarcina RepID... 116 2e-24 UniRef50_C7XH18 Glycosyltransferase n=1 Tax=Lactobacillus crispa... 115 2e-24 UniRef50_A6CMH6 Glycosyl transferase, group 2 family protein n=2... 115 3e-24 UniRef50_A1VIY0 Glycosyl transferase, family 2 n=1 Tax=Polaromon... 115 5e-24 UniRef50_A3Q6L3 Glycosyl transferase, family 2 n=6 Tax=Actinomyc... 114 5e-24 UniRef50_Q03HA4 Glycosyltransferase n=2 Tax=Bacilli RepID=Q03HA4... 113 2e-23 UniRef50_UPI00017891C0 glycosyl transferase family 2 n=1 Tax=Geo... 113 2e-23 UniRef50_C0GFE2 Glycosyl transferase family 2 n=1 Tax=Dethiobact... 111 4e-23 UniRef50_Q0AV05 Glycosyltransferase n=1 Tax=Syntrophomonas wolfe... 110 9e-23 UniRef50_Q97FY5 Glycosyltransferase n=4 Tax=Clostridiales RepID=... 109 2e-22 UniRef50_A4CLA3 Glycosyl transferase, group 2 family protein n=2... 108 5e-22 UniRef50_D1B804 Glycosyl transferase family 2 n=2 Tax=Bacteria R... 107 6e-22 UniRef50_C1TQW8 Glycosyl transferase n=1 Tax=Dethiosulfovibrio p... 107 7e-22 UniRef50_Q2S5Y1 Glycosyl transferase, group 2 family protein n=1... 107 7e-22 UniRef50_C9RLY4 Glycosyl transferase family 2 n=1 Tax=Fibrobacte... 107 1e-21 UniRef50_D1N3C9 Glycosyl transferase family 2 n=1 Tax=Victivalli... 106 2e-21 UniRef50_D1CH68 Glycosyl transferase family 2 n=2 Tax=Bacteria R... 105 2e-21 UniRef50_B7I546 Glycosyltransferase n=12 Tax=Acinetobacter RepID... 105 2e-21 UniRef50_Q0AL04 Glycosyl transferase, family 2 n=3 Tax=Bacteria ... 105 3e-21 UniRef50_Q97G47 Glycosyltransferase n=1 Tax=Clostridium acetobut... 105 4e-21 UniRef50_A3I0Z1 Glycosyltransferase n=1 Tax=Algoriphagus sp. PR1... 104 6e-21 UniRef50_B0U7V3 Glycosyl transferase family 2 n=1 Tax=Methylobac... 104 8e-21 UniRef50_D1CD85 Glycosyl transferase family 2 n=1 Tax=Thermobacu... 104 9e-21 UniRef50_A5FCY6 Candidate polysaccharide-forming beta-glycosyltr... 103 1e-20 UniRef50_C0Z7U1 Conserved hypothetical membrane protein n=1 Tax=... 102 2e-20 UniRef50_A0LZM3 Transmembrane family-2 glycosyl transferase-poss... 102 2e-20 UniRef50_Q974Y9 Putative uncharacterized protein ST0523 n=2 Tax=... 102 2e-20 UniRef50_C7RTK3 Polysaccharide deacetylase n=2 Tax=Bacteria RepI... 102 2e-20 UniRef50_Q9KAQ6 BH2231 protein n=4 Tax=Bacillus RepID=Q9KAQ6_BACHD 102 3e-20 UniRef50_A1S0Z5 Glycosyl transferase, family 2 n=1 Tax=Thermofil... 102 3e-20 UniRef50_C7TL81 Glycosyl transferase, group 2 n=3 Tax=Bacilli Re... 102 4e-20 UniRef50_A0LSJ8 Glycosyl transferase, family 2 n=1 Tax=Acidother... 101 5e-20 UniRef50_C0G271 Glycosyl transferase family 2 n=1 Tax=Natrialba ... 101 6e-20 UniRef50_Q2RZV9 Putative glucosyltransferase n=1 Tax=Salinibacte... 101 7e-20 UniRef50_C7IGQ0 Glycosyl transferase family 2 n=2 Tax=Firmicutes... 100 1e-19 UniRef50_A9KK06 Glycosyl transferase family 2 n=12 Tax=Bacteria ... 100 1e-19 UniRef50_C9KN46 Chitin synthase n=1 Tax=Mitsuokella multacida DS... 100 2e-19 UniRef50_C6X1A6 Glycosyl transferase, family 2 n=3 Tax=Flavobact... 99 2e-19 UniRef50_Q9UY40 Glycosyl transferase, family 2 n=2 Tax=Thermococ... 99 3e-19 UniRef50_UPI0001C42DFC capsule anchoring protein CapD n=1 Tax=Ba... 99 3e-19 UniRef50_Q2LWC4 Glycosyltransferase n=1 Tax=Syntrophus aciditrop... 99 5e-19 UniRef50_A6LJJ1 Glycosyl transferase, family 2 n=2 Tax=Thermosip... 98 6e-19 UniRef50_A8MM07 Glycosyl transferase family 2 n=1 Tax=Alkaliphil... 98 7e-19 UniRef50_C7NQ00 Glycosyl transferase family 2 n=1 Tax=Halorhabdu... 97 9e-19 UniRef50_C6A3S6 Putative glycosyl transferase n=1 Tax=Thermococc... 97 1e-18 UniRef50_C2EFI6 Glycosyltransferase n=1 Tax=Lactobacillus saliva... 97 1e-18 UniRef50_Q0IE23 Glycosyl transferase, group 2 family protein n=1... 97 1e-18 UniRef50_C6J2E9 Glycosyl transferase n=1 Tax=Paenibacillus sp. o... 97 1e-18 UniRef50_C4Z6K0 Putative uncharacterized protein n=1 Tax=Eubacte... 97 2e-18 UniRef50_B2UMM8 Glycosyl transferase family 2 n=3 Tax=Verrucomic... 97 2e-18 UniRef50_B9MDW0 Glycosyl transferase family 2 n=1 Tax=Diaphoroba... 97 2e-18 UniRef50_B8E0Z1 Glycosyl transferase family 2 n=1 Tax=Dictyoglom... 97 2e-18 UniRef50_A5GEA8 Glycosyl transferase, family 2 n=3 Tax=Deltaprot... 97 2e-18 UniRef50_C0A1N7 Glycosyl transferase, family 2 n=1 Tax=Opitutace... 96 2e-18 UniRef50_A7INQ0 Glycosyl transferase family 2 n=11 Tax=Rhizobial... 96 2e-18 UniRef50_C7ZM19 Putative uncharacterized protein n=1 Tax=Nectria... 96 3e-18 UniRef50_A0QPQ1 Chitin synthase n=1 Tax=Mycobacterium smegmatis ... 96 4e-18 UniRef50_C7TCU0 Phage-related glycosyl transferase, group 2 n=1 ... 96 4e-18 UniRef50_A9KYN4 Glycosyl transferase family 2 n=24 Tax=Gammaprot... 95 5e-18 UniRef50_D1JE47 Conserved hypothetical membrane protein, glycosy... 95 5e-18 UniRef50_B1YJV4 Glycosyl transferase family 2 n=4 Tax=Bacillales... 95 6e-18 UniRef50_UPI000038DF5C N-acetylglucosaminyltransferase n=1 Tax=F... 94 8e-18 UniRef50_Q110Z2 Glycosyl transferase, family 2 n=15 Tax=Cyanobac... 94 8e-18 UniRef50_C0ZQD3 Putative glycosyltransferase n=2 Tax=Rhodococcus... 94 1e-17 UniRef50_C0QGC2 NodC n=2 Tax=Desulfobacterium autotrophicum HRM2... 94 1e-17 UniRef50_C6MI20 Glycosyl transferase family 2 n=1 Tax=Nitrosomon... 94 1e-17 UniRef50_D1U9V0 Hyaluronan synthase n=1 Tax=Desulfovibrio aespoe... 93 2e-17 UniRef50_UPI0001C31B0A glycosyl transferase family 2 n=1 Tax=Con... 93 2e-17 UniRef50_A4CJ64 Glycosyltransferase n=15 Tax=Bacteroidetes RepID... 93 2e-17 UniRef50_C7QK51 Glycosyl transferase family 2 n=1 Tax=Catenulisp... 93 2e-17 UniRef50_C7M4N0 Glycosyl transferase family 2 n=4 Tax=Flavobacte... 93 2e-17 UniRef50_B2GJB7 Putative glycosyltransferase n=2 Tax=Micrococcin... 93 2e-17 UniRef50_D2S449 Glycosyl transferase family 2 n=1 Tax=Geodermato... 93 2e-17 UniRef50_B7HT47 Glycosyl transferase family 2 n=8 Tax=Bacteria R... 93 2e-17 UniRef50_Q7NHH7 Glr2559 protein n=1 Tax=Gloeobacter violaceus Re... 93 3e-17 UniRef50_Q03RZ5 Glycosyltransferase n=3 Tax=Lactobacillus RepID=... 93 3e-17 UniRef50_B7KYJ4 Glycosyl transferase family 2 n=2 Tax=Methylobac... 92 3e-17 UniRef50_A5FAD6 Candidate beta-glycosyltransferase; Glycosyltran... 92 3e-17 UniRef50_Q74D65 Glycosyl transferase, group 2 family protein n=1... 92 3e-17 UniRef50_A9QSM4 N-acetylglucosaminyltransferase n=39 Tax=Lactoba... 92 4e-17 UniRef50_B9AE19 Putative uncharacterized protein n=1 Tax=Methano... 92 4e-17 UniRef50_C1F7J6 Glycosyl transferase, group 2 family n=1 Tax=Aci... 92 4e-17 UniRef50_C7PF18 Glycosyl transferase family 2 n=9 Tax=Bacteroide... 92 5e-17 UniRef50_UPI000178A1DC glycosyl transferase family 2 n=1 Tax=Geo... 91 6e-17 UniRef50_Q4KHI6 Glycosyl transferase, group 2 family protein n=1... 91 7e-17 UniRef50_C9NZ15 Putative uncharacterized protein n=1 Tax=Vibrio ... 91 7e-17 UniRef50_A0LP64 Glycosyl transferase, family 2 n=1 Tax=Syntropho... 91 8e-17 UniRef50_D2QUU5 Glycosyl transferase family 2 n=1 Tax=Spirosoma ... 91 9e-17 UniRef50_A3HSN4 Glycosyl transferase, group 2 family protein n=1... 91 1e-16 UniRef50_A0LZL8 Transmembrane family-2 glycosyl transferase-poss... 91 1e-16 UniRef50_B5W069 Glycosyl transferase family 2 n=3 Tax=Arthrospir... 91 1e-16 UniRef50_B0C9M4 Glycosyl transferase, family 2 n=1 Tax=Acaryochl... 91 1e-16 UniRef50_B3QIT3 Glycosyl transferase family 2 n=6 Tax=Bradyrhizo... 90 2e-16 UniRef50_Q1IMJ5 Glycosyl transferase, family 2 n=10 Tax=cellular... 90 2e-16 UniRef50_B5IUE6 Glycosyl transferase, group 2 family protein n=1... 90 2e-16 UniRef50_Q1D6P4 Glycosyl transferase, group 2 n=3 Tax=Cystobacte... 89 3e-16 UniRef50_D1N5H0 Glycosyl transferase family 2 n=1 Tax=Victivalli... 89 3e-16 UniRef50_A8VSD1 Binding-protein-dependent transport systems inne... 89 3e-16 UniRef50_B3DW74 Glycosyltransferase n=1 Tax=Methylacidiphilum in... 89 4e-16 UniRef50_Q03MS9 Glycosyltransferase, probably involved in cell w... 88 6e-16 UniRef50_A1ULY3 Glycosyl transferase, family 2 n=4 Tax=Mycobacte... 88 6e-16 UniRef50_D2EES4 Glycosyl transferase family 2 n=1 Tax=Candidatus... 87 1e-15 UniRef50_Q04QL4 Glycosyltransferase plus another conserved domai... 87 1e-15 UniRef50_C3LL95 Chitin synthase n=10 Tax=Bacilli RepID=C3LL95_BACAC 87 1e-15 UniRef50_UPI0001C1680B Glycosyl transferase, family 2 n=2 Tax=No... 87 1e-15 UniRef50_C6VR67 Glycosyltransferase (Putative) n=7 Tax=Lactobaci... 87 2e-15 UniRef50_A2SR79 Glycosyl transferase, family 2 n=1 Tax=Methanoco... 86 2e-15 UniRef50_D2S4V1 Glycosyl transferase family 2 n=1 Tax=Geodermato... 86 2e-15 UniRef50_A7HQK0 Glycosyl transferase family 2 n=1 Tax=Parvibacul... 86 2e-15 UniRef50_B7HFD6 N-acetyllactosaminide beta-1,6-N-acetylglucosami... 86 2e-15 UniRef50_Q0AEM9 Cellulose synthase (UDP-forming) n=2 Tax=Nitroso... 86 3e-15 UniRef50_C6HYN9 Glycosyl transferase n=1 Tax=Leptospirillum ferr... 86 3e-15 UniRef50_Q2JKN6 Glycosyl transferase, group 2 family protein n=2... 86 3e-15 UniRef50_Q6KZU9 N-acetylglucosaminyltransferase n=1 Tax=Picrophi... 86 3e-15 UniRef50_B4WM17 Glycosyl transferase, group 2 family protein n=1... 86 4e-15 UniRef50_D2YGE5 Putative uncharacterized protein n=1 Tax=Vibrio ... 86 4e-15 UniRef50_Q1PUL2 Putative uncharacterized protein n=1 Tax=Candida... 85 5e-15 UniRef50_C1ZJA9 Glycosyl transferase n=1 Tax=Planctomyces limnop... 85 6e-15 UniRef50_A8YGM3 Similarity to the N-terminal part of Q3MEW3_ANAV... 85 6e-15 UniRef50_A9B149 Glycosyl transferase family 2 n=1 Tax=Herpetosip... 84 9e-15 UniRef50_A0Q541 Glycosyl transferase n=3 Tax=Francisella novicid... 84 1e-14 UniRef50_D1BIJ9 Glycosyl transferase n=2 Tax=Micrococcineae RepI... 84 1e-14 UniRef50_A5GQ06 Glycosyltransferase of family GT2; possible proc... 84 1e-14 UniRef50_C9NBS6 Glycosyl transferase family 2 n=1 Tax=Streptomyc... 83 2e-14 UniRef50_P74165 Sll1377 protein n=9 Tax=Cyanobacteria RepID=P741... 83 2e-14 UniRef50_A9FCX3 Glycosyltransferase n=1 Tax=Sorangium cellulosum... 83 2e-14 UniRef50_Q6ACB6 Glucosaminyltransferase n=1 Tax=Leifsonia xyli s... 83 2e-14 UniRef50_D2QHQ3 Glycosyl transferase family 2 n=1 Tax=Spirosoma ... 83 2e-14 UniRef50_Q2S964 Glycosyltransferase, probably involved in cell w... 83 2e-14 UniRef50_A9FZQ2 Glycosyltransferase n=1 Tax=Sorangium cellulosum... 83 2e-14 UniRef50_A4SFN6 Glycosyl transferase, family 2 n=1 Tax=Chlorobiu... 83 2e-14 UniRef50_D1X4H0 Glycosyl transferase family 2 n=2 Tax=Streptomyc... 83 3e-14 UniRef50_D1RA94 Putative uncharacterized protein n=1 Tax=Parachl... 82 3e-14 UniRef50_B7HZB4 Beta-1,3-N-acetylglucosaminyltransferase n=37 Ta... 82 3e-14 UniRef50_A6VAQ9 MigA n=16 Tax=Pseudomonas RepID=A6VAQ9_PSEA7 82 3e-14 UniRef50_A5G0G3 Glycosyl transferase, family 2 n=1 Tax=Acidiphil... 82 3e-14 UniRef50_B8HDP3 Glycosyl transferase family 2 n=1 Tax=Arthrobact... 82 3e-14 UniRef50_A6A3G7 Chitin synthase n=2 Tax=Vibrio cholerae RepID=A6... 82 3e-14 UniRef50_D0MHS9 Glycosyl transferase family 2 n=1 Tax=Rhodotherm... 82 4e-14 UniRef50_A5D278 Glycosyltransferases n=1 Tax=Pelotomaculum therm... 82 4e-14 UniRef50_C5SBP2 Glycosyl transferase family 2 n=1 Tax=Allochroma... 82 4e-14 UniRef50_C6D3F1 Glycosyl transferase family 2 n=10 Tax=Bacteria ... 82 4e-14 UniRef50_C4RQS0 Glycosyl transferase n=1 Tax=Micromonospora sp. ... 82 5e-14 UniRef50_D1SM82 Glycosyl transferase family 2 n=1 Tax=Methanocal... 82 6e-14 UniRef50_Q93DZ7 Eps1I n=1 Tax=Streptococcus thermophilus RepID=Q... 81 6e-14 UniRef50_A8AKI2 Putative uncharacterized protein n=1 Tax=Citroba... 81 8e-14 UniRef50_D0DDK6 Glycosyl transferase, family 2 n=1 Tax=Citreicel... 81 8e-14 UniRef50_C0GHH9 Glycosyl transferase family 2 n=1 Tax=Dethiobact... 81 9e-14 UniRef50_C6N2D6 Glucosaminyltransferase n=1 Tax=Legionella dranc... 80 1e-13 UniRef50_Q8YUP7 Glucosyltransferase n=2 Tax=Cyanobacteria RepID=... 80 2e-13 UniRef50_B8J3S6 Glycosyl transferase family 2 n=1 Tax=Desulfovib... 80 2e-13 UniRef50_A3MTD6 Glycosyl transferase, family 2 n=4 Tax=Thermopro... 80 2e-13 UniRef50_UPI0001BC6092 glycosyl transferase, group 2 family prot... 80 2e-13 UniRef50_Q0IA52 Glycosyl transferase, group 2 family protein dom... 80 2e-13 UniRef50_Q2W2R8 Cellulose synthase catalytic subunit n=5 Tax=Rho... 80 2e-13 UniRef50_C4KBM8 Glycosyl transferase family 2 n=6 Tax=Betaproteo... 80 2e-13 UniRef50_C5CI31 Glycosyl transferase family 2 n=1 Tax=Kosmotoga ... 79 2e-13 UniRef50_Q0EZB0 Glycosyl transferase, family 2 n=1 Tax=Mariprofu... 79 3e-13 UniRef50_B2A4H6 Glycosyl transferase family 2 n=1 Tax=Natranaero... 79 3e-13 UniRef50_B7K6R4 Glycosyl transferase family 2 n=2 Tax=Cyanothece... 79 3e-13 UniRef50_C6LFR4 Glycosyl transferase n=5 Tax=Clostridiales RepID... 79 3e-13 UniRef50_D2QCG6 Glycosyl transferase family 2 n=1 Tax=Spirosoma ... 79 3e-13 UniRef50_B7JW84 Glycosyl transferase family 2 n=2 Tax=Cyanothece... 79 3e-13 UniRef50_D1JHN4 Conserved hypothetical membrane protein, glycosy... 79 4e-13 UniRef50_A7HJL2 Glycosyl transferase family 2 n=1 Tax=Fervidobac... 79 4e-13 UniRef50_B2IY03 Glycosyl transferase, family 2 n=1 Tax=Nostoc pu... 79 4e-13 UniRef50_B4BNC7 CDP-glycerol:poly(Glycerophosphate) glycerophosp... 79 4e-13 UniRef50_A7HN17 Glycosyl transferase family 2 n=1 Tax=Fervidobac... 79 4e-13 UniRef50_C9RRD2 Glycosyl transferase family 2 n=1 Tax=Fibrobacte... 79 5e-13 UniRef50_A4WR49 Cellulose synthase (UDP-forming) n=6 Tax=Rhodoba... 79 5e-13 >UniRef50_P75905 Biofilm PGA synthesis N-glycosyltransferase pgaC n=146 Tax=Bacteria RepID=PGAC_ECOLI Length = 441 Score = 910 bits (2352), Expect = 0.0, Method: Compositional matrix adjust. Identities = 441/441 (100%), Positives = 441/441 (100%) Query: 1 MINRIVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHW 60 MINRIVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHW Sbjct: 1 MINRIVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHW 60 Query: 61 PWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRA 120 PWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRA Sbjct: 61 PWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRA 120 Query: 121 ILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPM 180 ILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPM Sbjct: 121 ILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPM 180 Query: 181 LYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRS 240 LYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRS Sbjct: 181 LYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRS 240 Query: 241 ALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGG 300 ALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGG Sbjct: 241 ALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGG 300 Query: 301 AEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIA 360 AEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIA Sbjct: 301 AEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIA 360 Query: 361 ATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRV 420 ATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRV Sbjct: 361 ATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRV 420 Query: 421 MLMPKKQRARWVSPDRGILRG 441 MLMPKKQRARWVSPDRGILRG Sbjct: 421 MLMPKKQRARWVSPDRGILRG 441 >UniRef50_B2UK24 Glycosyl transferase family 2 n=10 Tax=Burkholderiales RepID=B2UK24_RALPJ Length = 425 Score = 420 bits (1079), Expect = e-116, Method: Compositional matrix adjust. Identities = 207/407 (50%), Positives = 277/407 (68%), Gaps = 2/407 (0%) Query: 33 FVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAP-QLKDNPSISIIIPCFNEEKNVE 91 FVF++PFFMS W++GG+ + ER ++ A +K P +SII+PC+NEE NV Sbjct: 13 FVFYYPFFMSYFWMIGGLLHYFLLERGTRRIQHPLALLGVKTYPKVSIIVPCYNEEANVR 72 Query: 92 ETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGA 151 E I RY N ++IAVNDGS+D+T L+ +AAQ P L VIH + NQGKAI L T A Sbjct: 73 EVISHLARMRYPNYDIIAVNDGSSDRTGERLNELAAQYPQLVVIHQSSNQGKAIGLTTAA 132 Query: 152 AAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYS 211 +EYL+CIDGD++LD DA A+++ ++ NP VGAVTGNPRIRTRSTL+G++QVGE+S Sbjct: 133 QVTDAEYLMCIDGDSILDVDAIAWMIRHLMENPAVGAVTGNPRIRTRSTLLGRMQVGEFS 192 Query: 212 SIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIF 271 SI+GLIKRTQ++YG + TVSGV+ FR+ A+ EVGYWS+DM+TEDIDISWKLQ+ WTI Sbjct: 193 SIVGLIKRTQQVYGRLLTVSGVVVMFRKRAIEEVGYWSNDMLTEDIDISWKLQVGGWTIR 252 Query: 272 YEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTI 331 YEPRAL WILMPET +GL+KQRLRWA+GG + +K + MWP+FFEY L+ + Sbjct: 253 YEPRALSWILMPETFRGLYKQRLRWAKGGIQALIKYAPAMLSLRQSMMWPIFFEYALSVV 312 Query: 332 WAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNL 391 WA+ L+ I ++V V + E A G LL C+LQ ++ I+ RY+ + Sbjct: 313 WAYNMLL-VITWSVLGLFVDMPPEWRMEAFPRWHGTLLFITCVLQLLIGCFIDRRYDDGI 371 Query: 392 TSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRGI 438 +W+PV FW+L+L TT++ F V ++ RARW SPDRGI Sbjct: 372 LRYFIDTVWYPVAFWILNLVTTVIGFPAVAFQRQRARARWTSPDRGI 418 >UniRef50_A4SIQ6 Glycosyl transferase, family 2 n=6 Tax=Gammaproteobacteria RepID=A4SIQ6_AERS4 Length = 416 Score = 407 bits (1046), Expect = e-112, Method: Compositional matrix adjust. Identities = 205/408 (50%), Positives = 276/408 (67%), Gaps = 12/408 (2%) Query: 33 FVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQ---LKDNPSISIIIPCFNEEKN 89 F +P M+++WI GG+Y++ W + A A L P +++++PC+NE N Sbjct: 11 FTLAYPSMMAMVWICGGLYYYF----QWEQRDIALARTGLVLPRYPKVTLMVPCYNEGAN 66 Query: 90 VEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKT 149 VEETI L QRY ++EV+A+NDGS D T LDR+AAQ L V+H NQGKA+AL Sbjct: 67 VEETISHLLRQRYPDLEVLAINDGSKDDTGQRLDRLAAQDARLTVLH-QHNQGKAMALNN 125 Query: 150 GAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGE 209 G AK E LV IDGDA+LD DA ++V+ + +P+VGAVTGNPR+RTRST++GKIQ GE Sbjct: 126 GLERAKGEILVGIDGDAVLDHDAVRWMVKHFIESPKVGAVTGNPRVRTRSTIIGKIQTGE 185 Query: 210 YSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWT 269 +SSIIGLIKR QRIYG VFTVSGV+ AFR+SA+ VG WS DM+TEDIDISWKLQL W Sbjct: 186 FSSIIGLIKRAQRIYGTVFTVSGVVVAFRKSAVEAVGGWSTDMVTEDIDISWKLQLAGWL 245 Query: 270 IFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLT 329 I Y+P+ALCW+LMPET++GL+KQRLRWAQGGAEV L+ + R N W L EY + Sbjct: 246 IHYQPQALCWVLMPETVRGLYKQRLRWAQGGAEVILRYGLQAMRWRNRHFWLLLLEYLSS 305 Query: 330 TIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEH 389 +W ++ L+ +I+ ++ P EL G+++ +CLLQF VS+ I+N Y+ Sbjct: 306 VLWCYSMLLLALIWLIR----PDLGELAEGQLFQWTGVIMTAICLLQFGVSIFIDNHYDK 361 Query: 390 NLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRG 437 + S W IW+P+I+W+L+L T +V+ + +L + Q A W SPDRG Sbjct: 362 TMARSFVWCIWYPLIYWLLNLVTVIVAVPKAILRRRGQLAVWESPDRG 409 >UniRef50_Q30SK0 Glycosyl transferase, family 2 n=6 Tax=Proteobacteria RepID=Q30SK0_SULDN Length = 433 Score = 402 bits (1033), Expect = e-110, Method: Compositional matrix adjust. Identities = 200/418 (47%), Positives = 281/418 (67%), Gaps = 18/418 (4%) Query: 30 MMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPS---ISIIIPCFNE 86 M+ +VF++P FMS +W++G ++F+ E+ + P+L++N S +SI+IPC+NE Sbjct: 15 MLGYVFYYPLFMSTLWMIGAIFFYYKSEKRYV---EQSIPELRENESWAGVSILIPCYNE 71 Query: 87 EKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIA 146 +N ETI AL Y EVIA+NDGS D T IL +A P L+V++LAQNQGKA+A Sbjct: 72 GENAIETITYALDVIYPEFEVIAINDGSKDDTLDILLSLAKDNPRLKVVNLAQNQGKALA 131 Query: 147 LKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQ 206 L+ G+ AK E+LVCIDGDAL+D ++ + + P V AVTGNPRIR R++L+GKIQ Sbjct: 132 LQAGSLVAKHEFLVCIDGDALIDPYCLYWMAKHFIRYPEVAAVTGNPRIRNRTSLLGKIQ 191 Query: 207 VGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLN 266 VGE+SSI+G+IKR QR +G +FTVSGVI FR+SA+ EVGYWS DM+TEDIDI+WKLQ N Sbjct: 192 VGEFSSIVGMIKRAQRSFGRLFTVSGVITGFRKSAVHEVGYWSPDMLTEDIDITWKLQRN 251 Query: 267 QWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEY 326 W + +EP++L WILMPET+KGLWKQRLRWA GGA+V KN L+ + +W L E Sbjct: 252 GWDVRFEPKSLVWILMPETIKGLWKQRLRWAMGGAQVMFKNFKVLFLYKQTHLWGLMIEL 311 Query: 327 CLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHT------AGILLCTLCLLQFIVS 380 L+ +WA+T ++ +++ L + ++A + + ++L CL+QF VS Sbjct: 312 LLSMVWAYTMVLVILVWIFGLL-----FPIGYLAPSESPILPDQGSVILIGACLVQFGVS 366 Query: 381 LMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRGI 438 ++ Y+ L + FW+IW+P FW L+L T + + +V L KK RARWVSPDRG+ Sbjct: 367 KWLDGHYDEGLGKNYFWMIWYPFAFWFLNLFTAVAALPKV-LFGKKGRARWVSPDRGV 423 >UniRef50_A3N3L7 Biofilm PGA synthesis N-glycosyltransferase PgaC n=10 Tax=Pasteurellaceae RepID=A3N3L7_ACTP2 Length = 411 Score = 355 bits (911), Expect = 2e-96, Method: Compositional matrix adjust. Identities = 182/411 (44%), Positives = 259/411 (63%), Gaps = 3/411 (0%) Query: 28 ELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEE 87 E+ FVF +P M+ W G+ +++++E+ N + ++ P +S+++PC+NE Sbjct: 4 EIFSLFVFAYPAVMAFYWAFAGLTYFLFKEKL-KVPPNFDQMKHEEVPLVSLMVPCYNES 62 Query: 88 KNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIAL 147 N++E I L +Y N E+I +NDGS D T I+D+ A + + +H A N GKA AL Sbjct: 63 DNLDEAIPHLLNLKYPNYELIFINDGSKDHTGEIIDKWAKRDKRIVALHQA-NSGKASAL 121 Query: 148 KTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQV 207 G A+ +Y+ CIDGDA+LD A Y+V+ + NPR GAVTGNPR+R RST++G++QV Sbjct: 122 NNGLRIARGKYVGCIDGDAVLDYKALDYMVQALESNPRYGAVTGNPRVRNRSTILGRLQV 181 Query: 208 GEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQ 267 E+SSIIGLIKR Q + G +FTVSGV FR+ + E+G WS +MITEDID+SWK+Q + Sbjct: 182 SEFSSIIGLIKRAQCLMGTIFTVSGVCCLFRKDIMFEIGGWSTNMITEDIDVSWKIQTSG 241 Query: 268 WTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYC 327 + IFYEPRALCW+LMPET+ GL+KQRLRWAQGGAE +K ++WR +N R+WP+F EY Sbjct: 242 YDIFYEPRALCWVLMPETINGLFKQRLRWAQGGAETMMKYFPQIWRLKNRRLWPMFIEYI 301 Query: 328 LTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRY 387 +T IWA LV ++ L I L A + +L Q +SL I+NRY Sbjct: 302 VTAIWASLLLVSILLSIYNLI-FDNQIGLLDWAELKPSIAILFIAFFTQLSISLYIDNRY 360 Query: 388 EHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRGI 438 E + +F IW+P ++W L+ T L + + K + A W SPDRG+ Sbjct: 361 EKGVVKYVFSCIWYPWLYWSLNTITLLCGIPKAIFRNKTKLAVWTSPDRGV 411 >UniRef50_Q9RQP9 Biofilm PIA synthesis N-glycosyltransferase icaA n=63 Tax=Staphylococcaceae RepID=ICAA_STAA8 Length = 412 Score = 343 bits (879), Expect = 1e-92, Method: Compositional matrix adjust. Identities = 164/414 (39%), Positives = 259/414 (62%), Gaps = 18/414 (4%) Query: 32 RFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVE 91 F+ F+P FMSI WIVG +YF+ RE + + P + + I+ ++ C+NE + +E Sbjct: 5 NFLLFYPVFMSIYWIVGSIYFYFTREIRYSLNKK-PDINVDELEGITFLLACYNESETIE 63 Query: 92 ETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGA 151 +T+ LA +YE E+I +NDGS+D T ++ ++ + + L +N+GKA AL G Sbjct: 64 DTLSNVLALKYEKKEIIIINDGSSDNTAELIYKIKENNDFI-FVDLQENRGKANALNQGI 122 Query: 152 AAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYS 211 A +Y++C+D D ++D+DA Y++E ++P++GAVTGNPRIR +S+++GKIQ EY+ Sbjct: 123 KQASYDYVMCLDADTIVDQDAPYYMIENFKHDPKLGAVTGNPRIRNKSSILGKIQTIEYA 182 Query: 212 SIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIF 271 S+IG IKR+Q + G V T+SGV F++SA+ +VGYW DMITEDI +SWKL L + I Sbjct: 183 SLIGCIKRSQTLAGAVNTISGVFTLFKKSAVVDVGYWDTDMITEDIAVSWKLHLRGYRIK 242 Query: 272 YEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTI 331 YEP A+CW+L+PETL GLWKQR+RWAQGG EV L++ + + F ++ L FE ++ + Sbjct: 243 YEPLAMCWMLVPETLGGLWKQRVRWAQGGHEVLLRDFFSTMKTKRFPLYILMFEQIISIL 302 Query: 332 WAFTCL--VGFIIYAVQLAGVPLNIELTHIAATHTAGILLCT------LCLLQFIVSLMI 383 W + L +G++ L + T++ I L + + ++QF V+L I Sbjct: 303 WVYIVLLYLGYLFITANF--------LDYTFMTYSFSIFLLSSFTMTFINVIQFTVALFI 354 Query: 384 ENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRG 437 ++RYE + L ++ W+P ++W+++ A LV+F + + K A W SPDRG Sbjct: 355 DSRYEKKNMAGLIFVSWYPTVYWIINAAVVLVAFPKALKRKKGGYATWSSPDRG 408 >UniRef50_C8WT25 Glycosyl transferase family 2 n=3 Tax=Bacteria RepID=C8WT25_ALIAD Length = 410 Score = 342 bits (878), Expect = 1e-92, Method: Compositional matrix adjust. Identities = 194/419 (46%), Positives = 266/419 (63%), Gaps = 20/419 (4%) Query: 29 LMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEK 88 ++ F+F +PF MSI+W+VGG + RERH P+ E+ P L + P +SI+IPC NE Sbjct: 1 MVHSFLFLYPFVMSIVWMVGGCVYAWRRERH-PFAES---PDLPETPFVSILIPCHNEGD 56 Query: 89 NVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALK 148 +E+TI L Y E++A+NDGSTD TRA+L+RMAA +RV++L +GKA AL Sbjct: 57 VLEDTIGRMLQLDYPAYEIVALNDGSTDDTRAVLERMAACDARVRVVNLPVQRGKARALN 116 Query: 149 TGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLY--NPRVGAVTGNPRIRTRSTLVGKIQ 206 G A++ E LV +D DA+L +DA ++V + + RVGAVTGNPRIR R TL+GKIQ Sbjct: 117 AGLVASRGEILVTVDADAVLAKDALRFLVWHFVAPGSERVGAVTGNPRIRNRGTLLGKIQ 176 Query: 207 VGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLN 266 V EY+SIIGLIKR QR+ G + TVSGVIAAFR+ AL + G W +DM+T+DI +SWKL+ Sbjct: 177 VLEYASIIGLIKRAQRVLGKIMTVSGVIAAFRKRALVDCGMWDEDMVTDDIAVSWKLERR 236 Query: 267 QWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEY 326 W I YEPRALC++ PE L+ L +QR RWAQGG EV ++N + LW +N RM PL+ E Sbjct: 237 AWDIRYEPRALCFMWAPERLRSLIRQRARWAQGGVEVLIRNASVLWTWKNRRMIPLYVEE 296 Query: 327 CLTTIWAFTCLVGFI-IYAVQLA-GVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIE 384 L WA+ +V + A +A G+P T++A T G L L+Q V+L IE Sbjct: 297 LLGIAWAYLWVVSLVWTLAFDVAHGIP----WTYVAET---GTWLGLTSLVQTSVALWIE 349 Query: 385 NRYEHN-LTSSLFWIIWFPVIFWMLSLATTLVSFTR---VMLMPKKQR-ARWVSPDRGI 438 RYE + L F+ IW+P +WM+ + + + M ++ R A W SPDRG+ Sbjct: 350 QRYERDSLWRYYFYAIWYPAAYWMIGAFVVVWAVPKACWAMWAARRGRYATWKSPDRGV 408 >UniRef50_B9EAX5 Intercellular adhesion protein IcaA homolog n=19 Tax=Staphylococcaceae RepID=B9EAX5_MACCJ Length = 414 Score = 335 bits (858), Expect = 3e-90, Method: Compositional matrix adjust. Identities = 164/405 (40%), Positives = 247/405 (60%), Gaps = 5/405 (1%) Query: 33 FVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEE 92 F+ +P MS++WI+G V +++ ER + P P+ + IS +IPC+NE VEE Sbjct: 11 FLIVYPVLMSLVWIIGAVLYFLLIERRL---KQTP-PERDEYEGISFLIPCYNEADTVEE 66 Query: 93 TIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAA 152 T+ L Y E+I +NDGS+D T ++ + R I L QN+GKA AL T Sbjct: 67 TLVNVLKLSYPKKEIILINDGSSDNTAEKINALKDHY-DFRFIDLQQNRGKANALNTAVK 125 Query: 153 AAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSS 212 A+ ++++ ID D ++D A ++++ + P +GAVTGNPRIR +S+++GKIQ EY+S Sbjct: 126 HAQYDFVMGIDADTIIDDAAPYFMIDNFRHAPNLGAVTGNPRIRNKSSILGKIQTVEYAS 185 Query: 213 IIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFY 272 IIG IKR Q + G V T+SGV FR+SAL VGYW DDMITEDI +SWK ++ + I Y Sbjct: 186 IIGSIKRAQTLNGYVNTISGVFTLFRKSALEAVGYWDDDMITEDIAVSWKFHMHHYEIRY 245 Query: 273 EPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIW 332 EPRALCW+L+PET G+WKQRLRWAQGG EV L+ + ++ N +W L+ E ++ IW Sbjct: 246 EPRALCWMLVPETFTGIWKQRLRWAQGGHEVVLREFKSMLKQPNLPLWLLYLEQLISIIW 305 Query: 333 AFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLT 392 ++ L + + + ++ A I+L T+ ++QF VSL+I++RYE Sbjct: 306 VYSVLFLLAFTIINIDVLDYYFYAYNVNLLILASIILTTVNVIQFTVSLVIDSRYERKNI 365 Query: 393 SSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRG 437 LF++ W+P +W+++ + +F + + K + A W SPDRG Sbjct: 366 VYLFFLCWYPTFYWLINACVAVFAFPKALRRKKGEFATWSSPDRG 410 >UniRef50_A4J4W8 Glycosyl transferase, family 2 n=3 Tax=Clostridia RepID=A4J4W8_DESRM Length = 425 Score = 277 bits (708), Expect = 6e-73, Method: Compositional matrix adjust. Identities = 148/414 (35%), Positives = 240/414 (57%), Gaps = 14/414 (3%) Query: 33 FVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEE 92 F F +P MSI+W++GG+YFW YRER P++ P ++I+IPC NEE ++ Sbjct: 17 FTFLYPVTMSIIWVIGGLYFWWYRERK---ARTNNWPEVW--PPVTILIPCHNEEISIAT 71 Query: 93 TIHAALAQRYENIEVIAVNDGSTDKTRAIL-DRMAAQIPHLRVIHLAQNQGKAIALKTG- 150 T HA Y ++ V+ ++D STD T I+ + + ++P ++ L NQGKA AL G Sbjct: 72 TCHALSKVNYPDLRVVFIDDASTDNTAQIIREWLRQEVPSFHLLRLTTNQGKAKALNCGL 131 Query: 151 AAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEY 210 A + V ID D L+ D +++ P + PR+GAV+GNP + R L+ +Q E+ Sbjct: 132 QVAVHTPITVVIDADTLITPDTLKWLIAPFIRQPRLGAVSGNPLVGNRENLLENLQTAEF 191 Query: 211 SSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTI 270 +SI+GLIKR+QR G + TVSG I AF L ++G +S TEDIDI+W +Q N + + Sbjct: 192 ASILGLIKRSQRSLGRMLTVSGCITAFCTDTLRQLGGFSSRSATEDIDITWAIQRNFYEV 251 Query: 271 FYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLK--NMTRLWRKENFRMWPLFFEYCL 328 ++EPRA+ +I +P+T+K W QR RWA GG + ++ WR R+WP++ ++ + Sbjct: 252 WFEPRAIAYIQVPKTIKEFWHQRCRWALGGWHLLRSHWDIFTHWRWR--RLWPVYLDFVI 309 Query: 329 TTIWAFTCLVGFIIYAVQ-LAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRY 387 + +W+F ++G +++ V L I LT I A + G ++ +CL+QF +L+ +R+ Sbjct: 310 SYLWSFCLVIGTLLWLVTYLIPSKPAIGLTPIPAWY--GSVISFVCLVQFGAALLANHRH 367 Query: 388 EHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRGILRG 441 +H + S FWI W+P+ F+ + T + + R + +W SP R ++G Sbjct: 368 DHKMYQSFFWIPWYPIFFFCIGALTVVWTSCRGLFGDLVTVGKWKSPARSSMQG 421 >UniRef50_C9RCL0 Glycosyl transferase family 2 n=1 Tax=Ammonifex degensii KC4 RepID=C9RCL0_AMMDK Length = 429 Score = 263 bits (672), Expect = 1e-68, Method: Compositional matrix adjust. Identities = 149/411 (36%), Positives = 232/411 (56%), Gaps = 10/411 (2%) Query: 29 LMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEK 88 L+ ++F +PF MS++W+ G+YFW RER P E P L P ++I+IPC NE Sbjct: 9 LLGWYIFLYPFAMSVLWVTAGLYFWWRRERK-PEREKKFWPDLW--PPVTILIPCHNEAA 65 Query: 89 NVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALK 148 ++ T A Y + V+ ++D STD+T ++ R P+ ++ LA+NQGKA AL Sbjct: 66 SIAATATALQFLDYPDYRVVFIDDASTDETAEVIRRFLPSNPNFHLLRLAENQGKAQALN 125 Query: 149 TG-AAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQV 207 A A + V ID D LL A Y+V P PR+GAVTGNP R L+ K+Q Sbjct: 126 CALATAVTTPITVVIDADTLLAPQALKYLVAPFCRQPRLGAVTGNPIALNRKNLLEKLQA 185 Query: 208 GEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQ 267 E++SIIGLIKR QR+ G V TVSG IAA+R L EVG +S TEDIDI+W++Q Sbjct: 186 AEFASIIGLIKRAQRVLGRVLTVSGCIAAYRTEVLREVGGFSSRTATEDIDITWQIQRRF 245 Query: 268 WTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFL--KNMTRLWRKENFRMWPLFFE 325 + +++ P+A+ +I P LK WKQR RWA GG + K++ + WR ++P++ E Sbjct: 246 YEVWFAPQAVAFIQCPSRLKEYWKQRRRWALGGWHLLRTHKDIFKRWRWRY--LYPVYLE 303 Query: 326 YCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIEN 385 + L+ +W+F + G +++ V + + I + G +L C++Q +L++ Sbjct: 304 FVLSYLWSFAFVFGTLLWLVTRLFWGQPVGFSPIPVWY--GAMLSLACIVQIATALVLNR 361 Query: 386 RYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 RY+ L ++FW+ W+P+ F++ T + + + + Q +W SP+R Sbjct: 362 RYDPKLWRTVFWVPWYPLFFFVFGALTVVWTAPKGLFGSLAQAGKWKSPER 412 >UniRef50_Q1IL87 Glycosyl transferase, family 2 n=1 Tax=Candidatus Koribacter versatilis Ellin345 RepID=Q1IL87_ACIBL Length = 422 Score = 251 bits (642), Expect = 3e-65, Method: Compositional matrix adjust. Identities = 139/399 (34%), Positives = 224/399 (56%), Gaps = 16/399 (4%) Query: 31 MRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNV 90 +RF +P MS +WI ++F R R E+ P +SI++P F E + + Sbjct: 26 LRFYGLYPILMSWVWISLSLFF---RRRQ----EDTEMEMSGPAPMVSILVPAFAEAETI 78 Query: 91 EETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTG 150 ++TI A L Y N EVI VND S D T ++ R P +R+++ N+GKA+AL Sbjct: 79 DDTIEALLKLDYPNYEVILVNDCSPDNTAEVV-RQYLDDPRIRLLNKQVNEGKAMALNDA 137 Query: 151 AAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEY 210 + E LV ID D ++ RD Y+V P RV AVTGNPR+R R +++ +Q E+ Sbjct: 138 LPMCRGEILVVIDADIIVSRDLLNYMV-PHFAGTRVAAVTGNPRVRNRVSILQHLQAVEF 196 Query: 211 SSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTI 270 SSI+ + +R QR+ G V TVSG + A RRSAL E+G ++ M TEDID++W+LQ+ W + Sbjct: 197 SSIVSMQRRAQRVLGRVLTVSGAVFAVRRSALLELGGFTPHMATEDIDLTWRLQMKFWDV 256 Query: 271 FYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTT 330 YEPRA+ W+ +P +L+ LWKQR RWA+G +V ++ + RMWP+F+E + Sbjct: 257 RYEPRAVVWMQVPLSLRELWKQRKRWARGLVQVLKRHREVPTNWKMRRMWPIFYESIFSI 316 Query: 331 IWA--FTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYE 388 +W+ F + + + ++ + P + + G+++ T CLLQ + ++ +Y+ Sbjct: 317 LWSYVFVLMTSYWLISLAVGYAPRGVS----PFPNFWGMMIATTCLLQLFIGAWVDRQYD 372 Query: 389 HNLTSSLFWIIWFPVIFWML-SLATTLVSFTRVMLMPKK 426 + S +++PVI+WML +L T+ + + P + Sbjct: 373 PGIMWSFPEAVFYPVIYWMLMALITSFYTIPALFKKPPR 411 >UniRef50_C0WP52 Possible VGP family vectorial glycosyl polymerization protein n=5 Tax=Lactobacillus RepID=C0WP52_LACBU Length = 504 Score = 229 bits (583), Expect = 2e-58, Method: Compositional matrix adjust. Identities = 122/373 (32%), Positives = 209/373 (56%), Gaps = 16/373 (4%) Query: 72 KDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPH 131 K+ +S+++P NE + E + + Y IE+I +NDGS D+T A++ R+ + Sbjct: 142 KEAELVSVLVPAHNEADTLGEVVDSIANLVYPRIELILINDGSQDETLAVMHRLVQKYSK 201 Query: 132 ---LRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGA 188 ++V+ + N+GKA AL GA A+ ++L+C+D D +D++ ++ NP+VGA Sbjct: 202 HFVVKVVDIQPNKGKANALNYGAQVAEGDFLLCLDADCYVDKNVLNPMMAQFYNNPKVGA 261 Query: 189 VTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQR-IYGNVFTVSGVIAAFRRSALAEVGY 247 V G P +R R++++G++++ EY +I +IKR Q + G + TVSGV+ A+R++ALA VG+ Sbjct: 262 VAGKPIVRNRTSILGRLELLEYVGVIDIIKRGQAFVTGRITTVSGVVVAYRKAALANVGW 321 Query: 248 WSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKN 307 W+ + ITEDID++W+L +QW + Y P+A+ WIL+P+ +K L QR RWA+GG EV +N Sbjct: 322 WNSEAITEDIDVTWRLYHHQWQVAYCPQAVAWILVPDHIKDLIHQRRRWARGGFEVLFRN 381 Query: 308 MTRLWRKENFRMWPLFFEYCLTTIWAFTC--LVGFIIYAVQLAGVPLNIELTHIAATHTA 365 L W L + ++ +WA +C + F ++ V G ++ Sbjct: 382 RGMLVTGRLSEQW-LLLDMIISDLWAISCSFVTLFYLFTVAFTG--------YLIMDGVV 432 Query: 366 GILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPK 425 LL + L+QF+V + + L + + V +W ++L + L + L P Sbjct: 433 LFLLLIISLVQFLVGYTDSKKSDFINWRDLLLLPLYVVFYWFINLISCLTALGSFFLDP- 491 Query: 426 KQRARWVSPDRGI 438 + W SPDRG+ Sbjct: 492 RHVGSWHSPDRGL 504 >UniRef50_Q9CHP7 Glycosyl transferase n=3 Tax=Lactococcus lactis RepID=Q9CHP7_LACLA Length = 354 Score = 224 bits (570), Expect = 6e-57, Method: Compositional matrix adjust. Identities = 121/351 (34%), Positives = 200/351 (56%), Gaps = 19/351 (5%) Query: 33 FVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEE 92 +V ++P +S +W G + ++ER + + K+ ISI+I +NEE+ +EE Sbjct: 10 YVVYYPIILSFIW-ASGAFLSRWKER-----DKKAIDETKEFEKISIVISAYNEEETIEE 63 Query: 93 TIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPH---LRVIHLAQNQGKAIALKT 149 + + Y ++E+ V+D S+D+T L + L ++ +N+GKA AL Sbjct: 64 VLLSLRNLNYPSLEIFIVDDKSSDRTLEKLYEVKKSFDDWETLTILEQKENKGKATALNV 123 Query: 150 GAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGE 209 S+Y++ ID D+ L +DA +Y++ + + +GAVTG P +R R+TL+GK+Q E Sbjct: 124 ALQQVNSKYMLVIDADSYLSKDALSYLLAELTSSSDIGAVTGRPIVRNRTTLLGKLQTLE 183 Query: 210 YSSIIGLIKRTQRIY-GNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQW 268 Y S+I IKR Q ++ G + TVSGVI +R AL E+G ++ +++TEDID++W+L NQW Sbjct: 184 YLSVIDAIKRAQSLFLGAIMTVSGVIVMYRVEALKEIGGFNTEVMTEDIDVTWRLHRNQW 243 Query: 269 TIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCL 328 + Y P+AL +IL+PE LK L KQR RWA GG EV + N++ ++++ F+ L E Sbjct: 244 KVKYVPKALSYILVPENLKSLLKQRRRWAVGGVEVLISNLSWVFKRGEFKHRFLLIEMIC 303 Query: 329 TTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIV 379 + IW++ L+ Y +Q+ LN E +G ++ L+ F V Sbjct: 304 SHIWSWFFLIESYQYFLQML---LNQEF------KVSGQIIVIFVLISFFV 345 >UniRef50_Q38W83 Putative glycosyl transferase, family 2 n=1 Tax=Lactobacillus sakei subsp. sakei 23K RepID=Q38W83_LACSS Length = 439 Score = 205 bits (522), Expect = 2e-51, Method: Compositional matrix adjust. Identities = 139/434 (32%), Positives = 225/434 (51%), Gaps = 27/434 (6%) Query: 19 LCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSIS 78 L + YF ++++ + +P + +IV +Y+W+ E+ E+ P K P I+ Sbjct: 2 LLIDYFT--QILIYTLVSYPIIGGVSFIVSSLYYWLLMEK-----EDQPRYLKKGTPFIT 54 Query: 79 IIIPCFNEEKNVEETI-HAALAQRY--ENIEVIAVNDGSTDKTRAILDRMAAQIPH-LRV 134 I++P NE+ ++E TI H A Y + E+I VNDGSTDKT IL+ + A+ LR Sbjct: 55 ILVPAHNEKASIEATIDHLATQMNYPTDQYEIIVVNDGSTDKTGIILETLQAKYGQRLRT 114 Query: 135 IHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPR-----VGAV 189 + + N+GKA + AK E+++ D D+ + DA + + M Y R +GAV Sbjct: 115 VTIINNRGKAAGFNSALGFAKGEFILSNDADSKPEVDA---LWKYMYYFEREGGAQLGAV 171 Query: 190 TGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWS 249 TGN ++TLV + Q E +SIIGLIKR+Q YG +F SG +R+SA+ +VG W Sbjct: 172 TGNMLSINKTTLVAEAQQNELNSIIGLIKRSQLSYGGLFAFSGANTMYRKSAVIDVGGWQ 231 Query: 250 DDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMT 309 + TEDI ISW +Q W ++ P ++ +PETL L KQR RW GG V L Sbjct: 232 AEQPTEDIAISWDMQTAGWQAYFAPHIRFFMDVPETLPELVKQRRRWTSGGIYVLLTKSF 291 Query: 310 RLWRK--ENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGI 367 + R ++F M P+ +Y L+ IW+ + + V G +N + + ++ + Sbjct: 292 GILRHPIKHFSMVPIIIDYGLSIIWSLFYWISMATFIVLQIGYVINQDWHRLLSSLSIAA 351 Query: 368 LLCTLCLLQFIVSLMIENRYEHNLTSSLFWII---WFPVIFWMLSLATTLVSF--TRVML 422 + T+ +L ++ L ++ Y ++ SL +I W+ +I+WM++ T V T + Sbjct: 352 IFTTIEILVGLIQL-VQASYFNDGGRSLKYIAFAPWYILIYWMVNTYTVAVEIIPTIRQI 410 Query: 423 MPKKQRARWVSPDR 436 + K W SP R Sbjct: 411 IRGKDAGVWKSPKR 424 >UniRef50_UPI0001AEFEC9 N-glycosyltransferase PgaC n=1 Tax=Streptomyces roseosporus NRRL 11379 RepID=UPI0001AEFEC9 Length = 343 Score = 192 bits (489), Expect = 1e-47, Method: Compositional matrix adjust. Identities = 105/303 (34%), Positives = 165/303 (54%), Gaps = 6/303 (1%) Query: 28 ELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEE 87 ++++ F+ F+P W+VGGV F + ER P + P P +++++P +NEE Sbjct: 5 DVLLVFLAFYPVVTGAAWMVGGVMFRLTEER--PRAVDPPG----GLPGVTVLVPAYNEE 58 Query: 88 KNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIAL 147 K + + A Y +E++ +NDGSTD T A A + V+ N+GKA L Sbjct: 59 KVIAGCVAALRLVDYPELEILILNDGSTDATVAAARAAAEGDARITVVDDGVNRGKADRL 118 Query: 148 KTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQV 207 G A+ + ++ D D L DA ++V ML +PR A+ G PR+ R +++ +Q+ Sbjct: 119 NEGMRQARHDLVLVTDADTHLHSDAVTHLVRRMLRSPRYAAIAGCPRVTNRGSVIATMQM 178 Query: 208 GEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQ 267 E +S+IGLI+RT + G V TV+GV+ FRR A+ VG + M TEDI+++W+L Sbjct: 179 LETASLIGLIRRTHALAGRVGTVAGVLGLFRRDAVLAVGGYDPKMATEDIELTWRLLQAG 238 Query: 268 WTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYC 327 + P AL + +PETL G+W QR RWA+G EV + + R ++ MWP+ Sbjct: 239 HLTGFAPNALVGMQVPETLSGIWAQRTRWARGQGEVLRTHARTVARPKHAVMWPIAPGGV 298 Query: 328 LTT 330 L T Sbjct: 299 LPT 301 >UniRef50_A5VMQ3 Glycosyl transferase, family 2 n=38 Tax=Lactobacillales RepID=A5VMQ3_LACRD Length = 443 Score = 192 bits (487), Expect = 3e-47, Method: Compositional matrix adjust. Identities = 124/416 (29%), Positives = 212/416 (50%), Gaps = 21/416 (5%) Query: 37 WPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHA 96 +P ++ W G + + ++R++ E P +L P I+I+IP NEE ++ T+ Sbjct: 30 YPILGAMFWFFGAISYILFRQKE----ELPPKAELTPEPFITIMIPAHNEEIVIKSTLEY 85 Query: 97 ALAQ-RYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAK 155 L Q Y N EV+ ++DGS D+T IL M + LRVI + QNQGKA A G AK Sbjct: 86 LLTQLNYHNYEVLVMDDGSDDRTPEILHSMQTKYSRLRVIRIEQNQGKAHAFNIGLFFAK 145 Query: 156 SEYLVCIDGDALLDRDAAAYIVEPMLY-----NPRVGAVTGNPRIRTRSTLVGKIQVGEY 210 EY++ D D + + DA +++ M Y A+T N + RS+L GK Q E+ Sbjct: 146 GEYILSNDADTIPETDA---LIKYMQYFTSANGINYAAITANMDVYNRSSLWGKSQTVEF 202 Query: 211 SSIIGLIKRTQRIYGN-VFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWT 269 SSI+G+IKR+Q N ++ SG +RR L VG + D TEDI I+W T Sbjct: 203 SSIVGIIKRSQTAINNTMYAYSGANTMYRRDFLINVGGFRQDRATEDISIAWDHTFINAT 262 Query: 270 IFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKE-NFR-MWPLFFEYC 327 + P + + +PET L++QR RWAQGG+EV+L N +++R +R + P+ + Sbjct: 263 PKFAPDIVFHMNVPETFHDLYRQRKRWAQGGSEVWLSNFLKVFRHPWQYRFVIPMLTDTT 322 Query: 328 LTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRY 387 L+ IW+F + II+ + + + + ++ L+ + ++ Sbjct: 323 LSIIWSFFFWITSIIFIITMLSFAITGNYERVWHGIVMSMIFVNFQLIAGLFQVLAALIL 382 Query: 388 EHNLTSSLFWIIWFPVIF---WMLSLATTLVSFTR-VMLMPKKQRARWVSPDRGIL 439 + N S L ++++ P+ W+++ T + +F + + + +W+SP+R ++ Sbjct: 383 DFN-GSKLRYLMFSPLYLLFTWIVNPLTIVTTFHKAIKTVTGHGSGKWISPERKVV 437 >UniRef50_B5HP10 Bifunctional transferase/deacetylase n=9 Tax=Streptomyces RepID=B5HP10_9ACTO Length = 741 Score = 162 bits (410), Expect = 2e-38, Method: Compositional matrix adjust. Identities = 91/266 (34%), Positives = 149/266 (56%), Gaps = 12/266 (4%) Query: 42 SIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQR 101 ++M ++ G++ R + + WG AP Q +S+++P +NE K +E T+ + +A Sbjct: 343 ALMLLLSGIHARRVRRKGFRWG--APVTQ-----PVSVLVPAYNEAKCIENTVRSLMASE 395 Query: 102 YENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVC 161 + IEV+ ++DGS+D T I++ M +P++RV+ N GK AL G A A+ + +V Sbjct: 396 HP-IEVLVIDDGSSDGTARIVEAMG--LPNVRVVR-QLNAGKPAALNRGLANARHDIIVM 451 Query: 162 IDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQ 221 +DGD + + +V+P +PRVGAV GN ++ R +L+G Q EY L +R Sbjct: 452 MDGDTVFEPSTVRELVQP-FGDPRVGAVAGNAKVGNRDSLIGAWQHIEYVMGFNLDRRMY 510 Query: 222 RIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWIL 281 + + T+ G + AFRRSAL VG SDD + ED DI+ + + W + Y +A W Sbjct: 511 DVLRCMPTIPGAVGAFRRSALERVGGMSDDTLAEDTDITMAMHRDGWRVVYAEKARAWTE 570 Query: 282 MPETLKGLWKQRLRWAQGGAEVFLKN 307 PE+++ LW QR RW+ G + K+ Sbjct: 571 APESVQQLWSQRYRWSYGTMQAIWKH 596 >UniRef50_D2AU81 Polysaccharide deacetylase n=1 Tax=Streptosporangium roseum DSM 43021 RepID=D2AU81_STRRD Length = 752 Score = 161 bits (407), Expect = 6e-38, Method: Compositional matrix adjust. Identities = 97/285 (34%), Positives = 148/285 (51%), Gaps = 13/285 (4%) Query: 68 APQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENI-EVIAVNDGSTDKTRAILDRMA 126 AP + P++++I+P +NE +E T+ + + Y + EV+ V+DGS+D T AI + Sbjct: 395 APAWPEPPAVTVIVPAYNEAAGIEATVRSLVNTDYPGVLEVVVVDDGSSDDTAAIAASLG 454 Query: 127 AQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRV 186 +P +RVI +N GK AL TG A A + LV +DGD + + ++V P L +P V Sbjct: 455 --LPGVRVIR-QENGGKPSALNTGIAHASHDILVMVDGDTVFEPATIGHLVRP-LSDPAV 510 Query: 187 GAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVG 246 GAV+GN ++ R ++G+ Q EY L +R + G + TV G I AFRRSAL E+G Sbjct: 511 GAVSGNTKVGNRRGMIGRWQHIEYVIGFNLDRRAFDLLGCMPTVPGAIGAFRRSALQEIG 570 Query: 247 YWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLK 306 S D + ED D++ + W + YE AL W P +L LW+QR RW G Sbjct: 571 GVSVDTLAEDTDLTMAMCRGGWRVVYEENALAWTEAPTSLSQLWRQRYRWCYG------- 623 Query: 307 NMTRLWRKENFRMWPL-FFEYCLTTIWAFTCLVGFIIYAVQLAGV 350 + +W+ P F CL + F ++ + V + V Sbjct: 624 TLQAMWKHRRAITEPSPFGRRCLGYLTLFQVVLPLLAPVVDVMAV 668 >UniRef50_Q82DY8 Putative polysaccharide deacetylase/glycosyltransferase n=1 Tax=Streptomyces avermitilis RepID=Q82DY8_STRAW Length = 790 Score = 160 bits (405), Expect = 8e-38, Method: Compositional matrix adjust. Identities = 94/270 (34%), Positives = 146/270 (54%), Gaps = 12/270 (4%) Query: 42 SIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQR 101 ++M ++ ++ +R + WG P +S+I+P +NE++ +E TI++ LAQ Sbjct: 408 AMMLVLARTHYRQRNKRRFSWGPEITRP-------VSVIVPAYNEKECIENTINS-LAQS 459 Query: 102 YENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVC 161 IE+I V+DGSTD T I++ M +IP++RV+ +N GK AL G A + +V Sbjct: 460 THPIEIIVVDDGSTDGTADIVEAM--RIPNVRVLR-QENAGKPAALNNGVRNASYDIVVM 516 Query: 162 IDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQ 221 +DGD + + D +V+P + VGAV GN ++ R T++G Q EY L +R Sbjct: 517 MDGDTVFEADTVRRLVQP-FADDEVGAVAGNAKVGNRDTVIGAWQHIEYVMGFNLDRRMY 575 Query: 222 RIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWIL 281 + + T+ G I AFRR A+ EVG S+D + ED DI+ + W + YE A W Sbjct: 576 DLLRCMPTIPGAIGAFRREAVLEVGGMSEDTLAEDTDITIAMHRGGWRVVYEEHARAWTE 635 Query: 282 MPETLKGLWKQRLRWAQGGAEVFLKNMTRL 311 P +LK LW QR RW+ G + K+ L Sbjct: 636 APGSLKQLWSQRYRWSYGTMQALWKHRKSL 665 >UniRef50_B1L4E1 Glycosyl transferase family 2 n=1 Tax=Candidatus Korarchaeum cryptofilum OPF8 RepID=B1L4E1_KORCO Length = 489 Score = 157 bits (397), Expect = 7e-37, Method: Compositional matrix adjust. Identities = 89/235 (37%), Positives = 130/235 (55%), Gaps = 4/235 (1%) Query: 74 NPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLR 133 P +S+IIP NEEK + TI + L YE EV+ V+DGSTD+T I Q P ++ Sbjct: 126 RPLVSVIIPARNEEKVIGSTIRSVLESDYEPKEVVVVDDGSTDRTFEIAS--IYQGPKVK 183 Query: 134 VIHLA-QNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGN 192 V+ +GKA AL G A+ E +V +D D ++ RDA +V L +PRV AV GN Sbjct: 184 VLRRELGGRGKARALNFGLRFARGEVIVLMDADTIISRDAIKELVRK-LQDPRVSAVAGN 242 Query: 193 PRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDM 252 +R + L+ K+Q EY + L ++ + G V +SG + AFRR+ L G + D Sbjct: 243 VMVRNKVNLLTKLQAIEYIATFHLFRKGLSVLGAVPIISGALGAFRRNVLESSGLYDADT 302 Query: 253 ITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKN 307 +TED D++ K + + AL + PE LK L++QRLRW +G EV +K+ Sbjct: 303 LTEDFDVTLKALKSGKIVQASSYALAFTEAPEKLKSLYRQRLRWYRGAYEVLIKH 357 >UniRef50_UPI0001B4D705 bi-functional transferase/deacetylase n=3 Tax=Streptomyces RepID=UPI0001B4D705 Length = 700 Score = 155 bits (393), Expect = 2e-36, Method: Compositional matrix adjust. Identities = 87/234 (37%), Positives = 137/234 (58%), Gaps = 7/234 (2%) Query: 68 APQLKD-NPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMA 126 +P L++ N +++I+P +NEE +E T+ + LA + ++++I V+DGSTD+T +A Sbjct: 327 SPWLREVNDPVTVIVPAYNEEAGIEATVRSLLASTHPHLQIIVVDDGSTDRT----ADLA 382 Query: 127 AQIPHLRVIHLAQ-NQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPR 185 I R+ + Q N GKA AL TG A A + +V +D D + + DA +++P L +P Sbjct: 383 TWIDDPRISVIRQINSGKATALNTGLAHAAHDIVVMVDADTVFEADAVHQLIQP-LAHPA 441 Query: 186 VGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEV 245 +GAV+GN ++ R +L+G+ Q EY L +R + + TV G I AFRR AL V Sbjct: 442 IGAVSGNTKVGNRRSLLGRWQHLEYVFGFNLDRRMFEVLECMPTVPGAIGAFRRDALMGV 501 Query: 246 GYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQG 299 G S+D + ED D++ L W + YE A+ W +P +L+ LW+QR RW G Sbjct: 502 GGVSEDTLAEDTDLTMSLWRAGWRVVYEETAVAWTEVPTSLRQLWRQRYRWCFG 555 >UniRef50_C7MUK1 Glycosyl transferase n=11 Tax=Actinomycetales RepID=C7MUK1_SACVD Length = 1099 Score = 153 bits (387), Expect = 1e-35, Method: Compositional matrix adjust. Identities = 82/231 (35%), Positives = 135/231 (58%), Gaps = 5/231 (2%) Query: 77 ISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIH 136 +S+I+P +NE N+E T+ +A+A + +E+I V+DGSTD T +++ + + +RV+ Sbjct: 343 VSVIVPAYNEAANIEATVRSAVASTHP-VEIIVVDDGSTDGTADLVEGLG--LSGVRVLR 399 Query: 137 LAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIR 196 N+GKA AL TG AAA + +V +DGD + + + +V+P +P VGAV+GN +I Sbjct: 400 -RPNRGKAAALNTGIAAASYDLIVMVDGDTVFEPNTVHELVQP-FADPEVGAVSGNVKIA 457 Query: 197 TRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITED 256 R TL+ ++Q EY + +R + ++ TV G AFRRSAL +VG S + ED Sbjct: 458 NRETLLARLQHIEYVVGFNVDRRVHEVMRSMPTVPGAGGAFRRSALLQVGGLSAQTLAED 517 Query: 257 IDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKN 307 D++ + W ++ +A+ W P T++ LW+QR RW G + K+ Sbjct: 518 TDLTISIGRAGWRTVFQEKAVTWTEAPTTVRQLWRQRFRWTFGTLQALWKH 568 >UniRef50_D1SD50 Polysaccharide deacetylase n=1 Tax=Micromonospora aurantiaca ATCC 27029 RepID=D1SD50_9ACTO Length = 776 Score = 153 bits (386), Expect = 1e-35, Method: Compositional matrix adjust. Identities = 91/250 (36%), Positives = 131/250 (52%), Gaps = 12/250 (4%) Query: 59 HWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHA-ALAQRYENIEVIAVNDGSTDK 117 W WG P +S+I+P +NE + + + + AL IEV+ V+DGSTD Sbjct: 407 DWSWGPPVTEP-------VSVIVPAYNEREGIAAAVRSLALGDHPGGIEVVVVDDGSTDG 459 Query: 118 TRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIV 177 T I+ A ++P++RV+ N GK AL TG A A+ + +V +DGD + + D+ +V Sbjct: 460 TADIV--AALRLPNVRVVR-KPNGGKPSALNTGVALARHDLIVMVDGDTIFEPDSVRRLV 516 Query: 178 EPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAF 237 +P +P VG V GN ++ R L+ K Q EY L +R + TV G I AF Sbjct: 517 QP-FADPGVGVVAGNVKVGNRRGLIAKWQHIEYVIGFNLDRRLYETLRCMPTVPGAIGAF 575 Query: 238 RRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWA 297 RR AL +VG +DD + ED D++ L W I YE A W P T+ LWKQR RW+ Sbjct: 576 RRQALEQVGGMTDDTLAEDTDVTIALGRAGWHIVYEESARAWTEAPTTVGQLWKQRYRWS 635 Query: 298 QGGAEVFLKN 307 G + K+ Sbjct: 636 YGTLQAMWKH 645 >UniRef50_B5GIB2 Bi-functional transferase/deacetylase n=7 Tax=Streptomyces RepID=B5GIB2_9ACTO Length = 767 Score = 152 bits (384), Expect = 2e-35, Method: Compositional matrix adjust. Identities = 85/252 (33%), Positives = 136/252 (53%), Gaps = 12/252 (4%) Query: 56 RERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGST 115 R R + WG P +++++P +NE K V T+ +L++ +E+I ++DGST Sbjct: 370 RRRGFAWGVPVTEP-------VTVLVPAYNEAKCVTATV-TSLSRSEHPVEIIVIDDGST 421 Query: 116 DKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAY 175 D T I++R+ +P +RV+ +N GK AL G A A + +V +DGD + + Sbjct: 422 DDTAGIVERLG--LPGVRVVR-QENAGKPAALNRGIAHASHDIIVMMDGDTVFEPATVRE 478 Query: 176 IVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIA 235 +V+P +PRVGAV GN ++ R +L+G Q EY L +R + + T+ G + Sbjct: 479 LVQP-FGDPRVGAVAGNAKVGNRDSLIGAWQHIEYVMGFNLDRRMYDVLRCMPTIPGAVG 537 Query: 236 AFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLR 295 AFRR AL VG S+D + ED DI+ + + W + Y +A W PE++ LW QR R Sbjct: 538 AFRRDALTRVGGMSEDTLAEDTDITMAIHRDGWRVVYAEKARAWTEAPESVAQLWSQRYR 597 Query: 296 WAQGGAEVFLKN 307 W+ G + K+ Sbjct: 598 WSYGTMQAIWKH 609 >UniRef50_C0YGU0 Polysaccharide deacetylase/glycosyl transferase, group 2 family protein n=2 Tax=Bacteroidetes RepID=C0YGU0_9FLAO Length = 1132 Score = 149 bits (375), Expect = 3e-34, Method: Compositional matrix adjust. Identities = 102/311 (32%), Positives = 155/311 (49%), Gaps = 22/311 (7%) Query: 52 FWVYRER--HWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIA 109 +W ++ER GE P L+ P +SII+P +NEE N+ ++ L Q Y N ++I Sbjct: 749 YWAFKERKKEKKLGE---FPVLESYPKVSIIVPAYNEEVNIVSSLQNLLKQTYPNFDIIM 805 Query: 110 VNDGSTDKTRAILDRMAAQIP-HLRV-IHLAQNQGKAIALKTGAAAAKSEYLVCIDGDAL 167 V+DGS D T D+ A P H ++ I +N GKA AL G + +EY+VCID D Sbjct: 806 VDDGSKDST---YDKAKAAFPDHPKLKIFTKRNGGKATALNFGISQTDAEYVVCIDADTK 862 Query: 168 LDRDAAAYIVEPMLYNP---RVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIY 224 L +DA Y++ L + ++ AV GN ++ R + K Q EY++ + Sbjct: 863 LQQDAVKYLIARFLNSDPEEKIAAVAGNVKVGNRVNWLTKWQAIEYTTSQNFDRLAYANI 922 Query: 225 GNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPE 284 + + G I AF+RS + E G +S D + ED DI+ K+ +T+ E RA+ PE Sbjct: 923 NAITVIPGAIGAFKRSVVIETGGYSSDTLAEDCDITVKILKAGYTVANENRAVAVTEAPE 982 Query: 285 TLKGLWKQRLRWAQGGAEVFLKNMTRLW--RKENFRMWP----LFFEYCLTTIWAFTCLV 338 T+K KQR RW G ++F K R + +W L F+Y I F+ L Sbjct: 983 TVKQFLKQRFRWTYGIMQMFWKQRQTFLNPRYKGLGLWAMPNILLFQYI---IPFFSPLA 1039 Query: 339 GFIIYAVQLAG 349 I++ L+G Sbjct: 1040 DVIMFFGILSG 1050 >UniRef50_A1R2V1 Glycosyl transferase, group 2 family domain protein n=4 Tax=Actinomycetales RepID=A1R2V1_ARTAT Length = 431 Score = 147 bits (372), Expect = 6e-34, Method: Compositional matrix adjust. Identities = 86/261 (32%), Positives = 144/261 (55%), Gaps = 13/261 (4%) Query: 43 IMWIVGGVYF-WVYRE---RHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAAL 98 I+++ +YF W+YR RH L D P +S+I+P +NE + + + L Sbjct: 26 IVYVPMALYFDWLYRSICARH-------RYSVLSDRPLVSVIVPGYNEAVVITGCVESIL 78 Query: 99 AQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEY 158 A RY +EVI V+DGSTD+T +I++ +A Q +R + A N GK AL G AAA + Sbjct: 79 ASRYLRLEVILVDDGSTDETASIMEGLAQQYDRVRFLSQA-NAGKGAALNCGIAAALGDI 137 Query: 159 LVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIK 218 L+ +D D + D +++E +P+VGAV G+ R L + GL++ Sbjct: 138 LMFVDADGVFAPDTLIHMLEG-FDDPKVGAVCGDDRPVNLDRLQTMMLAILSHVGTGLVR 196 Query: 219 RTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALC 278 R + + VSG I AFR + E+G + +D + ED++++W++ + + ++P+AL Sbjct: 197 RALSLMNCLPIVSGNIGAFRSDLVRELGGFHEDTLGEDLELTWRVYKAGYRVRFQPKALV 256 Query: 279 WILMPETLKGLWKQRLRWAQG 299 + P T+ GLW+QR+RW++G Sbjct: 257 YAESPSTMGGLWRQRVRWSRG 277 >UniRef50_C8WBS4 Polysaccharide deacetylase n=4 Tax=Sphingomonadaceae RepID=C8WBS4_ZYMMN Length = 1126 Score = 145 bits (366), Expect = 3e-33, Method: Compositional matrix adjust. Identities = 81/232 (34%), Positives = 127/232 (54%), Gaps = 3/232 (1%) Query: 69 PQLKDNPSISIIIPCFNEEKNVEETIHAALAQ-RYENIEVIAVNDGSTDKTRAILDRMAA 127 P + +IS++IP FNEE +E +I LA NIEVI ++DGSTD + I++ A Sbjct: 747 PPIDPERTISVLIPAFNEEAVIEASIRRVLASAEVNNIEVIVIDDGSTDNSSQIVESQFA 806 Query: 128 QIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVG 187 P +++I L+ N GKA AL G AK E ++ +D D + A + +P++G Sbjct: 807 DDPRVQLIRLS-NGGKARALNHGVQKAKGEIIIALDADTHFEPRTIARLTR-WFSDPKLG 864 Query: 188 AVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGY 247 AV GN ++ R L+ + Q EY + L +R + + V G + A+R L +VG Sbjct: 865 AVAGNAKVGNRINLITRWQALEYITAQNLERRATVLLNAMTVVPGAVGAWRAETLRQVGG 924 Query: 248 WSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQG 299 + D + ED D++ +Q + W + Y+P A+ W PET++ L +QR RWA G Sbjct: 925 FPDQTLAEDQDLTIIIQEHDWAVRYDPYAVAWTEAPETIRALARQRFRWAFG 976 >UniRef50_Q2G7Y8 Polysaccharide deacetylase n=1 Tax=Novosphingobium aromaticivorans DSM 12444 RepID=Q2G7Y8_NOVAD Length = 1101 Score = 144 bits (362), Expect = 8e-33, Method: Compositional matrix adjust. Identities = 89/280 (31%), Positives = 141/280 (50%), Gaps = 32/280 (11%) Query: 34 VFFWPFFMSIMW------IVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEE 87 +F W FF++I I+ G+ +W R A P + P++S+IIP +NEE Sbjct: 689 LFSWLFFVAIALGIARAVIMAGLAWWQSRS------PRAEPPAFE--PTVSVIIPAWNEE 740 Query: 88 KNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIAL 147 + + ++ LA Y ++VI +DGS D T A++ R P + ++ LA N GKA AL Sbjct: 741 RVIAASVERVLASDYPALQVIVADDGSKDATSAVVARHFGHDPRVTLLTLA-NGGKAAAL 799 Query: 148 KTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPM--------LYNPRVGAVTGNPRIRTRS 199 A E ++ +D D EP+ +PR+GAV G+ R+ R Sbjct: 800 NRALRHATGEVVIALDADTQF---------EPLTIRRLARWFADPRIGAVAGDARVGNRV 850 Query: 200 TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDI 259 LV + Q EY + L +R + + V G + A+RR+AL VG + ++ + ED D+ Sbjct: 851 NLVTRWQAVEYITAQNLERRALAGFDAMTVVPGAVGAWRRAALDSVGGYPENTLAEDQDL 910 Query: 260 SWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQG 299 + +Q W + Y+PRA+ W P+T + L +QR RWA G Sbjct: 911 TIAIQRKGWRVTYDPRAVAWTEAPQTFRALARQRYRWAFG 950 >UniRef50_B0T3D0 Polysaccharide deacetylase n=2 Tax=Caulobacter RepID=B0T3D0_CAUSK Length = 1124 Score = 143 bits (361), Expect = 1e-32, Method: Compositional matrix adjust. Identities = 86/275 (31%), Positives = 136/275 (49%), Gaps = 6/275 (2%) Query: 74 NPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLR 133 P +S++IPCFNEEK + ++ L ++N+EV+ ++DGS D T + R P + Sbjct: 751 GPLVSVLIPCFNEEKVIAASVARILESEWKNLEVLVLDDGSKDNTAQEVRRAHGDDPRVT 810 Query: 134 VIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNP 193 ++ +N GKA A+ G A AK +Y+V +D D L ++ +P +GAV GN Sbjct: 811 LLSF-ENGGKARAVNRGLAIAKGDYVVALDADTLFPPKTIGRLIR-WFQDPTIGAVAGNA 868 Query: 194 RIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMI 253 + R +V + Q EY + L +R G V V G + A+R+S L +G + D + Sbjct: 869 IVGNRVNMVTRWQALEYVTAQNLERRALAALGAVTVVPGAVGAWRKSVLDALGGYPSDTL 928 Query: 254 TEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWR 313 ED D++ Q W + ++P A + P+T+ GL KQR RW+ G + K+ L+ Sbjct: 929 AEDQDLTIACQRAGWKVAFDPAAQAFTEAPDTVGGLLKQRFRWSFGTLQCVWKHRAALFS 988 Query: 314 KENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLA 348 + P L IW F L+ V LA Sbjct: 989 PKT----PALGFVALPQIWLFQILLAVAAPLVDLA 1019 >UniRef50_B4CVJ2 Glycosyl transferase family 2 n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4CVJ2_9BACT Length = 1181 Score = 143 bits (360), Expect = 1e-32, Method: Compositional matrix adjust. Identities = 105/331 (31%), Positives = 160/331 (48%), Gaps = 26/331 (7%) Query: 29 LMMRFVFFWPFFMSIMWIVG------------GVYFWVYRERHWPWGENAPAPQLKDNPS 76 LM +F F+SI ++V + WVY R P G +++ P Sbjct: 746 LMFALIFGILRFLSIAFVVAIALGIARVAFVTSLAIWVYF-RSKPRGR-----PIENPPL 799 Query: 77 ISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIH 136 +SIIIP +NE+ V TI + LA Y ++E+I V+DGSTD T +++ A +RV+ Sbjct: 800 VSIIIPAYNEQSVVGRTIRSVLANDYPHMEIIFVDDGSTDGTADAVEQEFAGHEKVRVVR 859 Query: 137 LAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIR 196 N GKA AL G +K E +V +D D ++ ++ +PRVGAV GN ++ Sbjct: 860 QV-NGGKASALNHGILVSKGEIIVGLDADTQFRKETITRLIR-HFRDPRVGAVAGNVKVG 917 Query: 197 TRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITED 256 R L+ + Q EY + + + V V G I A+RR+AL EVG + D + ED Sbjct: 918 NRINLITRWQALEYITSQNVDRLAYAQLNAVTVVPGAIGAWRRTALDEVGGYLTDTLAED 977 Query: 257 IDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKEN 316 +D++W+++ W I E A+ P T + +KQR RW+ G + K+ L+R Sbjct: 978 MDLTWRIRRKGWKIETEAGAVALTEAPATTQAFFKQRFRWSFGTLQCLWKHRRALFR--- 1034 Query: 317 FRMWPLFFEYCLTTIWAFTCLVGFIIYAVQL 347 + F L T+W F L I V L Sbjct: 1035 ---YGWFGWVGLPTLWLFQILFQVIAPLVDL 1062 >UniRef50_B8II97 Polysaccharide deacetylase n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8II97_METNO Length = 1120 Score = 143 bits (360), Expect = 2e-32, Method: Compositional matrix adjust. Identities = 86/240 (35%), Positives = 126/240 (52%), Gaps = 3/240 (1%) Query: 76 SISIIIPCFNEEKNVEETIHAALAQR-YENIEVIAVNDGSTDKTRAILDRMAAQIPHLRV 134 I++++P +NEE + +TI LA E IE+I ++DGSTD T A++ R A Sbjct: 746 GIAVLVPAYNEEIVILKTIQTLLASTIAEQIEIIVIDDGSTDNTAAVV-RTAFPNTAAVQ 804 Query: 135 IHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPR 194 I+ N GKA AL G +E +V IDGD +L DA ++ +P++GAV G Sbjct: 805 IYTKANGGKAAALNYGLQKTSTEIIVAIDGDTVLLPDAIEHLAR-HFADPKIGAVAGTVS 863 Query: 195 IRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMIT 254 + R TL+ + Q EY+ L +R ++ + V G I A+RR AL VG +S D + Sbjct: 864 VGNRKTLIARFQALEYTMSQNLDRRAFQLINAIGVVPGAIGAWRREALMAVGGYSSDTLA 923 Query: 255 EDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRK 314 ED D++ L+L W + EPRA PE L+ KQR RW G +V K+ R+ Sbjct: 924 EDADLTISLELAGWKVVCEPRARALTEAPERLRAFLKQRFRWMFGTLQVAYKHAPASLRR 983 >UniRef50_B5HQE3 Bifunctional transferase/deacetylase n=3 Tax=Streptomyces RepID=B5HQE3_9ACTO Length = 900 Score = 142 bits (358), Expect = 2e-32, Method: Compositional matrix adjust. Identities = 89/276 (32%), Positives = 144/276 (52%), Gaps = 14/276 (5%) Query: 42 SIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQR 101 +M I+ ++ + +R + WG P +++I+P +NE++ + T+ + L++ Sbjct: 514 GMMLILARRHYRLRNKRRFSWGPTVTRP-------VTVIVPAYNEKECIANTLES-LSKS 565 Query: 102 YENIEVIAVNDGSTDKTRAILDRMAAQ---IPHLRVIHLAQNQGKAIALKTGAAAAKSEY 158 IE+I V+DGS+D T I R AA+ + ++RVI N GK AL G A + Sbjct: 566 THPIEIIVVDDGSSDGTSEI-SREAARALGMTNVRVIR-QDNAGKPAALNNGVRNASYDI 623 Query: 159 LVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIK 218 ++ +DGD + + DA +V+P +P VGAV GN ++ R T++G Q EY L + Sbjct: 624 VIMMDGDTVFEPDAVHQLVQP-FADPEVGAVAGNAKVGNRDTVIGAWQHIEYVMGFNLDR 682 Query: 219 RTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALC 278 R + + T+ G I AFRR A+ +VG S+D + ED DI+ + + Y+ A Sbjct: 683 RMYDLLRCMPTIPGAIGAFRREAVLQVGGMSEDTLAEDTDITIAIHRAGRRVVYQEHARA 742 Query: 279 WILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRK 314 W P +LK LW QR RW+ G + K+ L K Sbjct: 743 WTEAPGSLKQLWSQRYRWSYGTMQALWKHRKSLTDK 778 >UniRef50_A6Q1D6 Glucosaminyltransferase n=2 Tax=Epsilonproteobacteria RepID=A6Q1D6_NITSB Length = 438 Score = 142 bits (358), Expect = 3e-32, Method: Compositional matrix adjust. Identities = 85/287 (29%), Positives = 142/287 (49%), Gaps = 17/287 (5%) Query: 50 VYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIA 109 ++F ++R E + P +S+I+P +NE K + +I + L Q Y N+E+I Sbjct: 59 LFFSIFRTIQRSAEETYQIDPKQRYPKVSVIVPAYNEAKTIATSISSLLTQNYPNLEIIV 118 Query: 110 VNDGSTDKTRAILDRMAAQIPH------LRVIHLAQNQGKAIALKTGAAAAKSEYLVCID 163 V+DGS+D+T A Q H +RV +N+GK+ A+ G + E + +D Sbjct: 119 VDDGSSDETYF----KAKQFEHNEFCKEIRVFR-KKNEGKSKAINYGIERSTGELIFVMD 173 Query: 164 GDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRI 223 D+ + ++A ++ +P + AV G+ + + L+ K+Q EY + +++ Q Sbjct: 174 ADSKISQNAIL-LLARHFEDPDIAAVAGSVYVSNQVNLITKLQALEYIEGLNMVRNGQAF 232 Query: 224 YGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMP 283 V + G + FR++AL VG + D ED D++ KL + I +EP A+ + P Sbjct: 233 LKAVNIIPGPVGMFRKNALYNVGLYDHDTFAEDCDVTLKLIAKGYKIDFEPEAVAYTEAP 292 Query: 284 ETLKGLWKQRLRWAQGGAEVFLKNMTRLWR-KEN----FRMWPLFFE 325 E L L KQR RW +G + K+ LW KEN MW + FE Sbjct: 293 ENLLDLIKQRYRWTRGILQAIRKHRNLLWNFKENTTASMVMWYMLFE 339 >UniRef50_B9ZC52 Glycosyl transferase family 2 n=1 Tax=Natrialba magadii ATCC 43099 RepID=B9ZC52_NATMA Length = 549 Score = 142 bits (357), Expect = 3e-32, Method: Compositional matrix adjust. Identities = 95/292 (32%), Positives = 153/292 (52%), Gaps = 18/292 (6%) Query: 18 PLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSI 77 P +A H G +++ FV++W F++ + + + + R ++ P N P PQ I Sbjct: 108 PSPLALVHLGAVVLIFVYYW--FIAFIALF---HDQIGRSKYVP---NPPYPQ------I 153 Query: 78 SIIIPCFNEEKNVEETIHAALAQRY--ENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVI 135 +++IP +NEE V TI + L Y + +E+IAV+DGSTD T A AA + V+ Sbjct: 154 TVLIPAYNEEGYVGRTIQSLLDANYPADALEIIAVDDGSTDDTLAEASAFAAASEQVSVV 213 Query: 136 HLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRI 195 A N GK AL G A + +V +D D+++DRDA +IV P + +GAV N I Sbjct: 214 SKA-NGGKYSALNYGLLFAAGDIIVTVDADSIVDRDALKHIVAPFAADDDIGAVASNVTI 272 Query: 196 RTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITE 255 R +L+ + Q EY+ + + +R +G V V G + A+RR L+EV + D +TE Sbjct: 273 WNRDSLITRCQQLEYTIGVNIYRRALDYFGIVMVVPGCLGAYRREVLSEVFAYDPDTLTE 332 Query: 256 DIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKN 307 D D++ K+ + + A + P T L++QRLRW +G +K+ Sbjct: 333 DFDVTMKVLRAGYRVSVS-DARVYTEAPATWGDLYRQRLRWYRGNYMTIIKH 383 >UniRef50_C1V8R5 Glycosyl transferase n=1 Tax=Halogeometricum borinquense DSM 11551 RepID=C1V8R5_9EURY Length = 567 Score = 140 bits (353), Expect = 1e-31, Method: Compositional matrix adjust. Identities = 80/238 (33%), Positives = 126/238 (52%), Gaps = 6/238 (2%) Query: 72 KDNPSISIIIPCFNEEKNVEETIHAALAQRY--ENIEVIAVNDGSTDKTRAILDRMAAQI 129 D P +S+++P +NE V + + + LA Y + +EVI ++DGSTD T A + A + Sbjct: 199 DDAPLVSVLVPAYNESNYVGDCLDSILASDYPTDRLEVIVIDDGSTDGTYA--EASAYRN 256 Query: 130 PHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAV 189 + V H + N GK AL G + ++ + +V +D D++L A VE + +PR+GAV Sbjct: 257 DRVSVFHRS-NGGKHAALNLGLSCSRGDVVVAVDADSILAPSALRTAVEQLQSDPRLGAV 315 Query: 190 TGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWS 249 G + +VG +Q EY I ++R G V + G + FRR AL+EVG + Sbjct: 316 AGTVVVNNADGIVGSVQALEYVLGINTLRRAFSYLGTVMVIPGCLGVFRREALSEVGGYD 375 Query: 250 DDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKN 307 D +TED D++ +L W + AL + P +L L QRLRW +G + LK+ Sbjct: 376 PDTVTEDFDLTVRLLKAGWRVELS-EALVYTEAPFSLTDLLNQRLRWTRGNIQTLLKH 432 >UniRef50_D2L983 Polysaccharide deacetylase n=1 Tax=Desulfovibrio sp. FW1012B RepID=D2L983_9DELT Length = 1140 Score = 139 bits (350), Expect = 2e-31, Method: Compositional matrix adjust. Identities = 90/294 (30%), Positives = 154/294 (52%), Gaps = 22/294 (7%) Query: 65 NAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALA-QRYENIEVIAVNDGSTDKT-RAIL 122 + PAP L S+++++P +NEEK V +T+ + LA Q E+I V+DGSTD T R + Sbjct: 760 SGPAPDL----SVAVVVPAYNEEKVVLQTVQSLLACQHPATFEIIVVDDGSTDATYRVLC 815 Query: 123 DRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLY 182 + +A + L I N GK AL G A +++ +V +D D + RD + + Sbjct: 816 EALAGE--KLVTIVTKPNGGKPAALNHGIALTRADIVVTLDADTVFARDTILRLAD-WFR 872 Query: 183 NPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSAL 242 +P+VGAV GN ++ R + + Q EY + L +R + +V V G + A+RR + Sbjct: 873 DPKVGAVAGNAKVGNRINFLTRCQALEYVTSQNLDRRALTVLDSVTVVPGAVGAWRREVV 932 Query: 243 AEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAE 302 G +S + + ED D++ ++Q T+ YE RA+ P+T++G +QR RW G + Sbjct: 933 EAAGGFSGETLAEDADLTIRIQRMGHTVAYEDRAVALTEAPDTMRGFLRQRFRWMFGTLQ 992 Query: 303 VFLKNMTRLWRKE----------NFRMWPLFFEYC--LTTIW-AFTCLVGFIIY 343 V K+ L+R N ++ +FF+ + +W A+TCL ++++ Sbjct: 993 VAWKHKDALFRPRYGLLGFFGLPNIWLYQIFFQIISPVMDLWLAYTCLKSWVLW 1046 >UniRef50_C6WRK1 Glycosyl transferase family 2 n=1 Tax=Actinosynnema mirum DSM 43827 RepID=C6WRK1_ACTMD Length = 637 Score = 138 bits (348), Expect = 3e-31, Method: Compositional matrix adjust. Identities = 85/238 (35%), Positives = 130/238 (54%), Gaps = 7/238 (2%) Query: 83 CFNEEKNVEETIHAALAQRYEN-IEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQ 141 +NE + + + +A Y +EV+ V+DGS D T ++ A +P +RV+ +N Sbjct: 283 AYNEAVTITAAVRSLVASDYPAPVEVVVVDDGSADGTADVV--RALDLPGVRVV-TRENG 339 Query: 142 GKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTL 201 GKA AL TG A A + LV +DGD + + A +V P L P VGAV+GN ++ R L Sbjct: 340 GKAEALNTGVALAGHDALVLVDGDTIFEPGTLAALVAP-LGAPGVGAVSGNVKVANRRGL 398 Query: 202 VGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISW 261 +G+ Q EY+S L ++ + + T+ G + AFRR AL EVG S D + ED D++ Sbjct: 399 LGRWQHLEYTSGSNLDRQILNAWRCLPTIPGAVGAFRREALVEVGGVSSDTLAEDTDVTM 458 Query: 262 KLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRM 319 + W + YEP A W +P L+ L++QR RW+ G + K+ R+ +E RM Sbjct: 459 AITRAGWRVVYEPGARAWTEVPAGLRSLYRQRYRWSYGTFQAMWKH--RVAVREQGRM 514 >UniRef50_Q1GJ85 Glycosyl transferase family 2 n=4 Tax=Rhodobacteraceae RepID=Q1GJ85_SILST Length = 1002 Score = 137 bits (344), Expect = 9e-31, Method: Compositional matrix adjust. Identities = 78/228 (34%), Positives = 128/228 (56%), Gaps = 7/228 (3%) Query: 74 NPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLR 133 P +++IIP NEEK + I + A Y+N+E+I V+DGS+D T L+ + A H+R Sbjct: 632 TPKVAVIIPAHNEEKVIRSCIQSVRASDYKNLEIIVVDDGSSDNT---LNEIFA-FSHMR 687 Query: 134 VIHL--AQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTG 191 + L NQGK AL +E +VCID D +++ A ++V NPR+GAV G Sbjct: 688 EVRLISQPNQGKWSALNRALMNTSAEIVVCIDADTQIEKSAIGHMVR-HFDNPRIGAVAG 746 Query: 192 NPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDD 251 + L+ ++Q EY++ + ++ + + V G + A+R +AL + G++SD+ Sbjct: 747 KIIAGNKVNLLTRLQALEYTTAQNVERKAFDLINGMLVVPGALGAWRVAALRKAGHFSDE 806 Query: 252 MITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQG 299 +TED D++ ++ + I YEP A + +PE + L KQRLRW+ G Sbjct: 807 TMTEDTDLTIEVNRAGYRIAYEPLARGYTEVPERIGQLLKQRLRWSFG 854 >UniRef50_C1ACV5 Putative glycosyl transferase/polysaccharide deacetylase n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1ACV5_GEMAT Length = 1123 Score = 136 bits (343), Expect = 1e-30, Method: Compositional matrix adjust. Identities = 88/263 (33%), Positives = 134/263 (50%), Gaps = 7/263 (2%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRV 134 P +S+++P +NE + + T+ + L+Q Y ++EV+ V+DGS+D T A L V Sbjct: 751 PRVSVLVPAYNEGRVIGRTVQSVLSQAYPDLEVVVVDDGSSDDTHDAASH-ATDDARLHV 809 Query: 135 IHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPR 194 + N GKA AL TG A A E +V ID D +L DA ++V P L + RVGAV GN + Sbjct: 810 VR-QTNAGKAAALNTGIAMATGEVIVVIDADTILAPDAIRHLVRP-LADARVGAVAGNAK 867 Query: 195 IRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMIT 254 + R L+ + Q EY + L +R + + V G I A+RRSA+ + G + D + Sbjct: 868 VGNRINLLTRWQAVEYVTSQNLDRRAFVMLNCITVVPGAIGAWRRSAVLDAGGFRTDTLA 927 Query: 255 EDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRK 314 ED D++ L + +A+ PET L KQR RW+ G + K+ L R+ Sbjct: 928 EDQDLTLTLLRGGHKVALAEQAVALTEAPETFGALLKQRFRWSFGTLQCAWKHRGALMRR 987 Query: 315 ENFRMWPLFFEYCLTTIWAFTCL 337 + + + L IW F L Sbjct: 988 DAGALGMV----GLPNIWLFQLL 1006 >UniRef50_C1D0J4 Putative glycosyltransferase n=1 Tax=Deinococcus deserti VCD115 RepID=C1D0J4_DEIDV Length = 430 Score = 136 bits (342), Expect = 2e-30, Method: Compositional matrix adjust. Identities = 78/238 (32%), Positives = 129/238 (54%), Gaps = 3/238 (1%) Query: 70 QLKDNP-SISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQ 128 +L P ++++I +NEE + T+ + L++ E ++V+ V+DGS+D T A ++R A Sbjct: 36 ELPATPLGVTVMIAAYNEEIGIGGTLTSVLSEPLEELQVVVVDDGSSDGTAAAVERFAQL 95 Query: 129 IPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGA 188 P +++I +N GKA AL + V +D D+ + A + ++PRVGA Sbjct: 96 DPRVQLIR-QENTGKAGALNHALKYVRYPVAVSVDADSAISPGTLAALAR-HFHDPRVGA 153 Query: 189 VTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYW 248 V G+ R+ + L+ +Q EY+ + KR Q G V V G A+R L E+G + Sbjct: 154 VAGDVRVAGEAKLLTHMQDMEYTIGQHMEKRAQCFLGTVSVVPGAAGAYRTELLRELGGY 213 Query: 249 SDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLK 306 S D +TED+D++ + + + +EP AL + P+ LK LW+QRLRW G +V K Sbjct: 214 SHDTLTEDMDLTIAVAAAGYRVRFEPLALSFTEPPQMLKHLWRQRLRWMYGTFQVMGK 271 >UniRef50_A9EEN3 Glycosyl transferase, family 2 n=1 Tax=Oceanibulbus indolifex HEL-45 RepID=A9EEN3_9RHOB Length = 1088 Score = 136 bits (342), Expect = 2e-30, Method: Compositional matrix adjust. Identities = 103/335 (30%), Positives = 166/335 (49%), Gaps = 19/335 (5%) Query: 30 MMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKN 89 +++ F+ F + I + +++ R RH P P K PS++I+IP +NE + Sbjct: 680 LLQIAFWTVFAIGICRSISLLFWAARRRRHAP-------PLSKHEPSVTIVIPAYNEARV 732 Query: 90 VEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQ-NQGKAIALK 148 +E+ I AL Y + ++I V+DGSTD T ++ + H V L Q N+GKA AL Sbjct: 733 IEKCIRKALYSEYGDFDIIVVDDGSTDDT---YEKAISFAYHPLVTVLRQPNRGKAAALN 789 Query: 149 TGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVG 208 A+SE L+CID D+ + DA + + +P+VGAV G + R L+ ++Q Sbjct: 790 AALDEAQSEILICIDADSQIAPDAVSLLAA-HFKDPKVGAVAGRVVVGNRDNLLTRLQAL 848 Query: 209 EYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQW 268 EY + + +R + + V G I A+R +AL E G +S + +TED D++ + + + Sbjct: 849 EYITAQAVERRAKEYLNAITVVPGAIGAWRTTALMEAGIFSTETLTEDADMTMAMIRSDY 908 Query: 269 TIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCL 328 + YE RA+ P +L L QRLRW+ G + K++ + N + L Sbjct: 909 QVIYEDRAVATTETPRSLSALMTQRLRWSLGMMQAGWKHLGATVERRNLGL------VAL 962 Query: 329 TTIWAFTCLVGFIIYAVQLAGVPLNIE-LTHIAAT 362 + F L+ I L V L IE T+I AT Sbjct: 963 PDLVVFGYLMPLIAPLADLFLVLLVIEFFTNIGAT 997 >UniRef50_B6BG61 Glycosyl transferase, group 2 family protein n=2 Tax=Rhodobacterales RepID=B6BG61_9RHOB Length = 1140 Score = 136 bits (342), Expect = 2e-30, Method: Compositional matrix adjust. Identities = 78/245 (31%), Positives = 134/245 (54%), Gaps = 14/245 (5%) Query: 56 RERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGST 115 RER P L+ P ++++IP NE K + ++I + A Y+N+E+I V+DGST Sbjct: 758 RERFTP---------LQRQPKVAVVIPAHNEAKVIAQSIESVRASGYKNLEIIVVDDGST 808 Query: 116 DKTRAILDRMA-AQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAA 174 D T +L+ + +R+I NQGK AL + +E+ VCID D + +DA Sbjct: 809 DDT--LLEVLKFGHKSEVRLIS-QPNQGKWSALNRAIQSTDAEFAVCIDADTQVCKDAIT 865 Query: 175 YIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVI 234 ++V +P+ GAV G R L+ ++Q EY++ + ++ + + V G I Sbjct: 866 HLVR-HFADPKTGAVAGKIIAGNRVNLLTRLQAFEYATSQNIERKAFDLINGILVVPGAI 924 Query: 235 AAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRL 294 A+R AL + G++S++ +TED D++ ++ + + +EP+A + +PE + L KQRL Sbjct: 925 GAWRVEALRKAGFFSEETLTEDTDLTIQVNRAGYNVVFEPKAKAYTEVPENVGQLLKQRL 984 Query: 295 RWAQG 299 RW+ G Sbjct: 985 RWSLG 989 >UniRef50_C5CVQ8 Glycosyl transferase family 2 n=4 Tax=Proteobacteria RepID=C5CVQ8_VARPS Length = 445 Score = 135 bits (341), Expect = 2e-30, Method: Compositional matrix adjust. Identities = 94/283 (33%), Positives = 143/283 (50%), Gaps = 9/283 (3%) Query: 64 ENAPA-PQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYEN-IEVIAVNDGSTDKTRAI 121 + PA P L P ISI+I +NEE +++ET+ + Q Y ++VI +NDGS D T + Sbjct: 80 DRRPAHPALAVYPPISILIAAYNEEASIDETLVSIDRQNYPGELQVIVINDGSADGTSLV 139 Query: 122 LDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPML 181 + R + P L + L +N GKA AL G + ++ +D D+ L R A IVE Sbjct: 140 VQRARERYPWLTFLDLEKNGGKARALNIGFKEVAHDLVITVDADSFLYRGALTSIVERYR 199 Query: 182 YN-PRVGAVTGNPRIR-TRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRR 239 + P AV G +R +R + + Q +Y I IKR Q ++ G + + R Sbjct: 200 ADPPHTRAVAGKILVRNSRLNWITRCQEWDYFHGIAAIKRVQSLFQGTLVAQGAFSIYDR 259 Query: 240 SALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQG 299 +AL EVG W + + EDI ++W L + + + A + +P T+K L KQR RWA+G Sbjct: 260 AALVEVGGWP-ECVGEDIVLTWALLKAGYRVGHCEDACLFTNVPTTVKQLVKQRQRWARG 318 Query: 300 GAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIW-AFTCLVGFI 341 AE F+K+ L + R+ F + L W C +GFI Sbjct: 319 MAEAFIKHPGILVKP---RLSTFFIYWNLLFPWLDLACTIGFI 358 >UniRef50_A0RGJ4 Glycosyl transferase and polysaccharide deacetylase fusion protein n=66 Tax=Bacillus RepID=A0RGJ4_BACAH Length = 1119 Score = 134 bits (337), Expect = 7e-30, Method: Compositional matrix adjust. Identities = 85/267 (31%), Positives = 135/267 (50%), Gaps = 6/267 (2%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRV 134 P +S++I +NEEK + +TI + L +Y EVI V+DGSTD T ++ + P +R Sbjct: 758 PFVSVVIAAYNEEKVIAKTIRSILDSKYGEFEVIVVDDGSTDGTSKVMQETFYKHPKVRF 817 Query: 135 IHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPR 194 I +N GK+ A+ G ++ E +V +D D ++ +DA + +V + V AV+GN + Sbjct: 818 IQ-KENGGKSSAMNLGFQQSRGEIIVTLDADTIIAQDAISLMVR-HFEDQNVAAVSGNVK 875 Query: 195 IRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMIT 254 + R L+ Q EY + L +R + V G I A+R+ + E GY S+D + Sbjct: 876 VGNRRNLLTTWQHVEYITGFNLERRAFDELNCITVVPGAIGAWRKKNVVESGYLSEDTLA 935 Query: 255 EDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRK 314 ED D++ I YE +A + PE +K L KQR RW+ G + K+ L Sbjct: 936 EDTDLTITFLRQGHRIVYEEKAYAFTESPEDVKSLIKQRYRWSYGTLQCLWKHRKALC-- 993 Query: 315 ENFRMWPLFFEYCLTTIWAFTCLVGFI 341 N + L F L +W F ++ FI Sbjct: 994 -NSKHKTLGF-IALPNMWLFQYVLQFI 1018 >UniRef50_Q1IUP8 Polysaccharide deacetylase n=1 Tax=Candidatus Koribacter versatilis Ellin345 RepID=Q1IUP8_ACIBL Length = 1154 Score = 133 bits (335), Expect = 1e-29, Method: Compositional matrix adjust. Identities = 75/227 (33%), Positives = 120/227 (52%), Gaps = 3/227 (1%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRM-AAQIPHLR 133 P+++++IP +NEEK +E T+ + L Y + I ++DGS D T +++++ AA+I + Sbjct: 774 PAVAVLIPAYNEEKVIERTVRSVLDSDYPKLRAIVIDDGSKDATVEVVEQLFAAEIASGK 833 Query: 134 VIHLAQ-NQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGN 192 V L + N GKA AL G E V ID D ++ DA +V P NP++ A+ GN Sbjct: 834 VTLLTKPNSGKAAALNYGLEFVTEEIFVGIDADTIIAPDAIGLLV-PHFQNPKIAAIAGN 892 Query: 193 PRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDM 252 ++ R + Q EY + +R ++G V V G I A+R A+ G + D Sbjct: 893 AKVGNRVNWWTRWQALEYITSQNFERRALDVFGAVSVVPGAIGAWRTEAVLAAGKYHHDT 952 Query: 253 ITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQG 299 + ED D++ L + + + YE AL + P T GL +QR RW+ G Sbjct: 953 VAEDADLTMALLQDGYRVEYEDLALAYTEAPSTANGLMRQRFRWSFG 999 >UniRef50_D2LDP7 Polysaccharide deacetylase n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LDP7_RHOVA Length = 1170 Score = 133 bits (334), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 97/349 (27%), Positives = 165/349 (47%), Gaps = 32/349 (9%) Query: 69 PQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQ 128 P D+ S+S++IP FNE K + +I LA ++ +EVI ++DGSTD T ++ A Sbjct: 758 PPYADDLSVSVLIPAFNEAKVITASIRQILASSHQKLEVIVIDDGSTDGTADVVRGEFAD 817 Query: 129 IPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGA 188 P + ++H N GKA A+ A A + +V +D D + + +V +P+VGA Sbjct: 818 DPRVSLMH-TPNGGKARAINLALAQATGDIVVVLDADTQFEPLTISRLVR-WFADPKVGA 875 Query: 189 VTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYW 248 V GN ++ R ++ + Q EY + L +R + V G + A+RR A+ +G + Sbjct: 876 VAGNAKVGNRINVLTRWQALEYITAQNLERRALATLDCITVVPGAVGAWRREAIMGLGGF 935 Query: 249 SDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNM 308 + + ED D++ +Q + + ++ AL W P+TL GL KQR RWA G + Sbjct: 936 PSNTLAEDQDLTISVQRAGYKVLFDADALAWTEAPDTLGGLAKQRFRWAFG-------TL 988 Query: 309 TRLWRKENFRMWPLFFEY---CLTTIWAFTCLVGFIIYAVQLA---------------GV 350 LW+ + + P + L +W F + I V L G Sbjct: 989 QCLWKHRSANLNPRYGALGMIALPQVWLFQIALALISPLVDLLLLVQVVRTGIDYLQHGS 1048 Query: 351 PLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWII 399 N E I T+ A + T+ L +++ ++E R + +L L+W++ Sbjct: 1049 QFNSENFTITLTYYA--VFMTVDLSAALIAFLLEKREDRSL---LWWLV 1092 >UniRef50_Q3B487 Glucosaminyltransferase n=2 Tax=Chlorobium/Pelodictyon group RepID=Q3B487_PELLD Length = 461 Score = 132 bits (332), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 82/234 (35%), Positives = 124/234 (52%), Gaps = 7/234 (2%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRV 134 P +S+IIP FNEE ++ + I +ALA Y E+I V+DGS D T I++R + V Sbjct: 117 PFVSVIIPSFNEEDSIAQCIESALALDYPAYEIIIVDDGSRDLTLPIIERY-----DVSV 171 Query: 135 IHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPR 194 I L N+GK AL G AK E + D D+ LD Y+V + VGAV G Sbjct: 172 IRLRTNRGKVEALNRGIEKAKGEIIFFTDSDSSLDPKVLRYLVAG-FSDDTVGAVAGMVL 230 Query: 195 IRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTV-SGVIAAFRRSALAEVGYWSDDMI 253 + ++ + ++Q EY +IK+ Q G ++ G AFRRS L E G + + + Sbjct: 231 PKRHNSYLLRMQTIEYIYGQSIIKKAQLKSGGAVSICPGPATAFRRSVLLETGGYRNRTL 290 Query: 254 TEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKN 307 ED DI+ ++ + YEPRA+ + KGL KQR+RW++G +V+ ++ Sbjct: 291 AEDFDITLEMMELGYRAVYEPRAVSYTSAMTGWKGLRKQRIRWSRGHLQVYREH 344 >UniRef50_C7RT53 N-glycosyltransferase PgaC n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RT53_9PROT Length = 139 Score = 132 bits (332), Expect = 3e-29, Method: Compositional matrix adjust. Identities = 56/100 (56%), Positives = 80/100 (80%) Query: 186 VGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEV 245 +GA+TGNPRIR R+ L+G++QVGE+SSI+GLIKR Q +YG VFTVSGV+ AFR+ ALA Sbjct: 13 LGALTGNPRIRKRAALLGRLQVGEFSSIVGLIKRAQTVYGRVFTVSGVVCAFRKRALAAA 72 Query: 246 GYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPET 285 G+WS +T+D++++W++QL W + +E +ALCW+L T Sbjct: 73 GWWSPAALTDDVEVTWRIQLAGWQVAFEAKALCWLLQTLT 112 >UniRef50_D2LZZ6 Glycosyl transferase family 2 n=1 Tax=Bacillus cellulosilyticus DSM 2522 RepID=D2LZZ6_BACS4 Length = 457 Score = 132 bits (332), Expect = 3e-29, Method: Compositional matrix adjust. Identities = 95/322 (29%), Positives = 152/322 (47%), Gaps = 50/322 (15%) Query: 63 GENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAIL 122 G N K+N +SI+IP +NEEK + + + Y N+E I +NDGSTD+T IL Sbjct: 30 GLNKYKKVDKEN-GVSILIPAYNEEKVILNCLQGIVNLNYNNVEAIFINDGSTDRTLDIL 88 Query: 123 DRM-----------AAQIPHLRVIHL-------------AQNQGKAIALKTGAAAAKSEY 158 + A +IPH +I + +N GKA AL G A+ EY Sbjct: 89 TQHLQLVPVYNRLPAVKIPHEAIIDIYASTLYPKVLVIDKKNGGKADALNAGIEYARKEY 148 Query: 159 LVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRI---------RTRSTLVG----KI 205 +V +D D++LD++ + +++ + + A+ G +I + R T + + Sbjct: 149 VVTLDADSILDKE-SLHVINSTFSDNHILAIGGMVQIGQGFQGNYLKPRPTFINPGIIRF 207 Query: 206 QVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQ- 264 Q+ +Y + + K TQ ++ ++G AFRR L EV + I ED+DI+ K+Q Sbjct: 208 QILQYFTAFYMHKYTQSKLKSITVIAGAFGAFRRKVLFEVDGFR-KTIGEDMDITLKIQR 266 Query: 265 -----LNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRM 319 + + P+ALC+ PET + L+ QR+RW + + F + K R+ Sbjct: 267 LIKTKYKNHRLIFVPQALCYTECPETFRALFNQRVRWQKAFVDCFFMYRKNFFHKLGVRL 326 Query: 320 WPLFF---EYCLTTIWAFTCLV 338 LFF L TI AFT L+ Sbjct: 327 -SLFFLGDGVLLGTINAFTALL 347 >UniRef50_C5SLQ2 Polysaccharide deacetylase n=1 Tax=Asticcacaulis excentricus CB 48 RepID=C5SLQ2_9CAUL Length = 1126 Score = 131 bits (330), Expect = 4e-29, Method: Compositional matrix adjust. Identities = 81/223 (36%), Positives = 121/223 (54%), Gaps = 2/223 (0%) Query: 77 ISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIH 136 +S+IIP FNE + +E ++ L +IEV+ V+DGS D+T AI+ R A P + +I Sbjct: 754 VSVIIPAFNEARVIEASVRRVLTSVDVDIEVVVVDDGSKDETSAIVARAFANEPRVTLIT 813 Query: 137 LAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIR 196 A N GKA A+ G A A+ E ++ +D D + + A +V P +GAV GN ++ Sbjct: 814 QA-NAGKATAVNNGIARARGEIIIALDADTQFEPETIARLVR-WFIRPEIGAVAGNAKVG 871 Query: 197 TRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITED 256 R V + Q EY + L + +G + V G + A+RRSAL EVG + D + ED Sbjct: 872 NRFNFVTRWQGVEYVTAQNLERTALATFGAMMVVPGAVGAWRRSALEEVGGYPHDTLAED 931 Query: 257 IDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQG 299 D++ +Q W + Y+ A+ W PE+ L KQR RWA G Sbjct: 932 QDLTIAIQRRGWAVAYDQDAVAWTEAPESFASLIKQRYRWAFG 974 >UniRef50_A1UIF2 Polysaccharide deacetylase n=3 Tax=Mycobacterium RepID=A1UIF2_MYCSK Length = 789 Score = 131 bits (329), Expect = 6e-29, Method: Compositional matrix adjust. Identities = 113/411 (27%), Positives = 182/411 (44%), Gaps = 32/411 (7%) Query: 18 PLCVAYFHSGELMMRF------VFFWPFFMSIMWIVGGVYFWV-----YRERHWPWGENA 66 P A +H+ LM + FW M + ++ +Y + YR+ W + Sbjct: 349 PADAATYHTLRLMEKAPGAVLGFLFW-LGMGSLTVMSLLYLILALVCQYRQNRLRWND-- 405 Query: 67 PAPQLKDN--PSISIIIPCFNEEKNVEETIHAALAQRY--ENIEVIAVNDGSTDKTRAIL 122 + D+ P +S+++ FNEEK + TI Y EV+AVNDGSTD T IL Sbjct: 406 ----IGDDQLPMVSVVLAAFNEEKVIARTIAELRRSDYPRSRFEVVAVNDGSTDGTLRIL 461 Query: 123 DRMAAQIPHLRVIHLAQNQGKAIALKTG--AAAAKSEYLVCIDGDALLDRDAAAYIVEPM 180 +A P LRV+ A N GK+ A+ G A+A S +V +D D L D + Sbjct: 462 TELARDWPKLRVVDQA-NSGKSSAINNGINHASAVSTVMVTMDADTLFRPDTIRNLARHF 520 Query: 181 ---LYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAF 237 + +VGAV G+ ++ R L+ Q EY S I + + +R+ + V G +A+ Sbjct: 521 ARHTHGRQVGAVAGHIKVGNRRNLLTAWQSLEYISGICVTRMAERLLNAISIVPGACSAW 580 Query: 238 RRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWA 297 R+AL E+G + DD + ED D + LQ + I E A+ PET++ L KQR RW Sbjct: 581 SRTALEEIGGFCDDTMAEDCDATLALQRRGYRILQENNAIADTEAPETIRALAKQRKRWT 640 Query: 298 QGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELT 357 G + K+ L+R + + Y ++ + I A ++ N + Sbjct: 641 YGNIQALWKHRAMLFRPRYGALGLVALPYAALSLIVPLLFMPLTIVAAGMSLAAGNWQSI 700 Query: 358 HIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWML 408 + AG + ++ M R H L ++ II+ P+ ++L Sbjct: 701 AL----FAGFVAALHMIISITAVAMARERAWHLLVVPVYRIIYEPLRAYLL 747 >UniRef50_D1PAP3 Putative biofilm PGA synthesis N-glycosyltransferase PgaC n=1 Tax=Prevotella copri DSM 18205 RepID=D1PAP3_9BACT Length = 438 Score = 131 bits (329), Expect = 6e-29, Method: Compositional matrix adjust. Identities = 97/322 (30%), Positives = 154/322 (47%), Gaps = 17/322 (5%) Query: 72 KDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRM--AAQI 129 ++NP ISI++P NE K++ + +++ Q Y N E+I V+DGS D T+ I + + A I Sbjct: 69 RENPLISILVPGKNEGKHIFKMVNSLAEQTYRNYEIIVVDDGSDDDTKLICNDLYRAGYI 128 Query: 130 PHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAV 189 H + L GKA A GA A+ +Y+VC+D D+ LDRDA I+ P + V V Sbjct: 129 TH--YLRLDTRGGKAAASNYGAQMARGKYIVCLDADSSLDRDALEKILLPFYIDGMVKGV 186 Query: 190 TGNPRIRT-RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYW 248 G ++R + T+ +Q EY I + + G +SG AF R L EVGYW Sbjct: 187 GGCVKVRNYKETICSSLQAFEYLKRIQVGRIVTSELGIYHIISGAFGAFERKTLKEVGYW 246 Query: 249 SDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNM 308 D D D++ K++ + + + A+C +P L+ QR+RW++ L+ Sbjct: 247 -DIGPGLDGDLTQKIRKAGYKVKFAEDAICMTNVPTKWYKLYHQRIRWSRSLVRFRLRKH 305 Query: 309 TRLWRKENFRMWPLFFEYCLTTIWAFTCLVGF--IIYAVQLAGVPLNIELTHIAATHTAG 366 + + W + F C + F + Y ++LA + N THI G Sbjct: 306 ADILLPT--KNWSILNWLSNMESVVFDCFLNFLWLWYIIKLA-ITFN---THIIEVLALG 359 Query: 367 ILLCTLCLLQ--FIVSLMIENR 386 + +C Q FI+ LM+ R Sbjct: 360 YFI-RVCFSQLAFILVLMVSER 380 >UniRef50_C5A5B1 Glycosyltransferase, family 2 n=9 Tax=Thermococcaceae RepID=C5A5B1_THEGJ Length = 354 Score = 130 bits (326), Expect = 1e-28, Method: Compositional matrix adjust. Identities = 79/227 (34%), Positives = 125/227 (55%), Gaps = 11/227 (4%) Query: 74 NPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLR 133 +P +S++IP +NEE+N+E+ + AAL+Q Y N EVI V+DGSTD T ++ + LR Sbjct: 35 SPMVSVVIPAYNEEENIEDAVKAALSQDYPNFEVIVVDDGSTDGTYE--KALSIKDERLR 92 Query: 134 VIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNP 193 VI + + GK AL G + E +V ID D LL+++A + +VE Y+ + AV G Sbjct: 93 VIRI-NHGGKTRALNVGLEFSAGEIIVTIDADGLLEKNAVSRLVE-RFYSDDIAAVGGQV 150 Query: 194 RIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMI 253 R+R + L + I +R + + N+ G I+AFR+ +L ++G + +D++ Sbjct: 151 RVRGETFLELAQDIEHLR--IATFRRAKELE-NLSLAPGPISAFRKRSLLKIGGFVNDLV 207 Query: 254 TEDIDISWKLQLNQ-WTIFYEPRALCWILMPETLKGLWKQRLRWAQG 299 D + L L + + Y P A W+ MP L LW+QR RW G Sbjct: 208 E---DYATTLALKKIGRVVYAPGARVWVRMPSGLGALWRQRRRWFLG 251 >UniRef50_Q2IFD0 Glycosyl transferase, family 2 n=5 Tax=Bacteria RepID=Q2IFD0_ANADE Length = 479 Score = 129 bits (323), Expect = 3e-28, Method: Compositional matrix adjust. Identities = 87/288 (30%), Positives = 130/288 (45%), Gaps = 47/288 (16%) Query: 74 NPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIP--- 130 P +SII+P FNEE+ + TI L Y +E++ VNDGSTD+T +L R +P Sbjct: 56 EPPVSIIVPAFNEEETIAGTIRCLLQLEYPELEIVVVNDGSTDRTLEVLSREFELVPFPE 115 Query: 131 ---------------------HLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLD 169 ++RV+ N GKA A+ G AA+ +DGD++L Sbjct: 116 AYRISVPSTPVRRVYRSLRHRNVRVLD-KSNGGKADAMNAGINAARYPLFCAVDGDSVLQ 174 Query: 170 RDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRI-YGNVF 228 RD+ A +V P L +P A G R+ +V V E G I R Q + Y F Sbjct: 175 RDSVARVVRPFLEDPTTIACGGTIRVLNGCRVVDGF-VEEIGMPRGWIARVQVLEYLRAF 233 Query: 229 --------------TVSGVIAAFRRSALAEVGYWSDDMITEDIDIS------WKLQLNQW 268 VSG FRR+A+ E G + D + ED+++ ++L + Sbjct: 234 LFGRLGWAPVNALPNVSGAFGMFRRTAVIEAGGYRTDTVGEDMELVLRLHRLYRLSGRPY 293 Query: 269 TIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKEN 316 I + P +CW +P+TL L +QR+RW +G E N L+ + Sbjct: 294 RISFVPDPICWTEVPDTLAALRRQRVRWHRGLLEAIAGNRQLLFHRRG 341 >UniRef50_Q05S47 Group-specific protein n=1 Tax=Synechococcus sp. RS9916 RepID=Q05S47_9SYNE Length = 422 Score = 128 bits (321), Expect = 5e-28, Method: Compositional matrix adjust. Identities = 86/288 (29%), Positives = 129/288 (44%), Gaps = 17/288 (5%) Query: 34 VFFWPFFMSIM---WIV--------GGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIP 82 + FWP +S + WI+ + R + W E P P +SI+IP Sbjct: 2 LHFWPLILSALLGTWIIRTAIVVALAHQHNQAERTKEANWSEQQSTPC---KPLVSILIP 58 Query: 83 CFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLR--VIHLAQN 140 NEE + + I + + Y N E+I +NDGS D T A P R VI A N Sbjct: 59 ACNEEATITDCIQSCIQSNYGNKEIIIINDGSNDATSDQAHIAQANHPKARITVIDFACN 118 Query: 141 QGKAIALKTGAAAAKSEYLVCIDGDA-LLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRS 199 QGK AL G A E++V +D D D + ++ P++ + A TGN +I Sbjct: 119 QGKTAALNEGLLYANGEFIVTLDADTRFSDPSSLTRLLNPLIQYQDIAACTGNLKIANPD 178 Query: 200 TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDI 259 + IQ EY+ II KR Q + + G ++AFR + L +G +S + ED D Sbjct: 179 RTIPNIQSIEYTKIIQTFKRAQSHVNAIMILPGAVSAFRATELRSIGGFSASTLAEDADA 238 Query: 260 SWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKN 307 + L L + + + A P T++ L KQR+RW G + K+ Sbjct: 239 TMSLLLRRNRLLFTSSASAITQGPNTVQELLKQRIRWRVGQLQCLWKH 286 >UniRef50_C1F4G9 Polysaccharide deacetylase domain protein/glycosyl transferase, group 2 family protein n=1 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F4G9_ACIC5 Length = 1170 Score = 127 bits (319), Expect = 9e-28, Method: Compositional matrix adjust. Identities = 113/409 (27%), Positives = 187/409 (45%), Gaps = 51/409 (12%) Query: 33 FVFF-WPFFMSIMWIVGGVYFWV--YRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKN 89 FVFF MS ++ G++ + +R R P G PA ++I+IP +NEEK Sbjct: 750 FVFFVGDVLMSGRLLIIGLFALIERFRTRRIPPGVYEPA--------VAILIPAYNEEKV 801 Query: 90 VEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQ-IPHLRVIHLAQ-NQGKAIAL 147 + TI +AL Y ++ V+ ++DGSTD+T + AQ I ++ L + N GKA AL Sbjct: 802 IVRTIRSALNSDYPHLHVVVIDDGSTDRTLEVAREAYAQEIADGKLTVLTKPNAGKAEAL 861 Query: 148 KTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQV 207 G + E V ID D ++ DA + +V +P+VGAV GN ++ + L + Q Sbjct: 862 NFGLRQIREEVYVGIDADTVIAVDAVSKLVR-HFADPKVGAVAGNAKVGNKVNLWTRWQA 920 Query: 208 GEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQ 267 EY + +R ++ V V G I A+R +A+ G + + + ED D++ L Sbjct: 921 LEYITGQNFERRALDLFNVVTVVPGAIGAWRTAAVLAGGCYPLNTVAEDADLTMNLVEQG 980 Query: 268 WTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYC 327 + + YE AL + P GL +QR RW+ G + K+ ++ FR + Sbjct: 981 YKVIYEDHALAFTEAPINANGLMRQRFRWSFGTLQAVFKH------RQAFRTNRAMGFFA 1034 Query: 328 LTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRY 387 L I F L +PL LL + L+QF++ N++ Sbjct: 1035 LPNIVVFQIL------------LPLASPFID---------LLFAVSLIQFLI-----NKH 1068 Query: 388 EHNLTSSLF----WIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWV 432 H T+S +I+F + F ++ T+L++F+ P + W+ Sbjct: 1069 YHPETASAASFDKLLIYF-LAFIVIDFFTSLLAFSLEPRHPANKGDGWL 1116 >UniRef50_A7GCY8 Glycosyl transferase, group 2 family n=66 Tax=Firmicutes RepID=A7GCY8_CLOBL Length = 420 Score = 127 bits (318), Expect = 1e-27, Method: Compositional matrix adjust. Identities = 99/392 (25%), Positives = 193/392 (49%), Gaps = 19/392 (4%) Query: 58 RHWPWGENAPAPQLKDN--PSISIIIPCFNEEKNVEETIHAALAQRY--ENIEVIAVNDG 113 R++ N+ LK+ P +SI++P NEEK + T+ + L Y + +E+I +ND Sbjct: 34 RYYLKTLNSELKGLKNEKYPKVSILVPAHNEEKVIGRTVKSILLLNYPKDKMELIVINDN 93 Query: 114 STDKTRAILDRMAAQIP--HLRVIH---LAQNQGKAIALKTGAAAAKSEYLVCIDGDALL 168 S+D T+ IL ++ + + ++I+ + +GK+ AL G + +++ D D Sbjct: 94 SSDNTKKILKQIQKEYRSYNFKIINTDNITGGRGKSNALNIGYKHSSGDFIAVYDADNTP 153 Query: 169 DRDAAAYIVEPMLYNPRVGAVTGNPRIRT--RSTLVGKIQVGEYSSIIGLIKRTQRIYGN 226 D++A Y++E ++ + +GAV G R R R+ L I + E S + + + N Sbjct: 154 DKNALKYLMETIIEDEHLGAVIGKFRTRNKDRNMLTRFINI-ETLSFQWMCQAGRWNLLN 212 Query: 227 VFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETL 286 + T+ G R++ + ++ W I ED +IS+++ + I + P ++ W PE L Sbjct: 213 LCTIPGTNFVVRKNIIQKLNGWDPKAIAEDTEISFRIYELGYKIKFVPYSVTWEQEPENL 272 Query: 287 KGLWKQRLRWAQGGAEVFLKNMTRLWRKENFR-MWPLFFEYCLTTIWAFTCLVGFIIYAV 345 K +KQR RWA+G V LK +++ + ++ +F+ + + ++ + ++ I++ V Sbjct: 273 KVWFKQRTRWAKGNIYVLLKYFKNMFKGTSKDIIFDIFYFFSVYFLFLSSVIISDILFIV 332 Query: 346 QLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVI- 404 G+ LNI L I + IL L +L+ ++L +E + L I++F Sbjct: 333 ---GIFLNINLHVIGNFNVLWILAYVLFVLEVSLTLTLEKGESNKENLILVPIMYFTYCQ 389 Query: 405 FWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 WM+ ++ + R L K+ +W +R Sbjct: 390 MWMIVALRGIIQYIRDKLF--KKEIKWYKTER 419 >UniRef50_D2L608 Glycosyl transferase family 2 n=1 Tax=Desulfovibrio sp. FW1012B RepID=D2L608_9DELT Length = 555 Score = 126 bits (316), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 89/318 (27%), Positives = 154/318 (48%), Gaps = 22/318 (6%) Query: 74 NPSISIIIPCFNEEKNVEETIHAALAQRY--ENIEVIAVNDGSTDKTRAILDRMAAQI-- 129 P++S++IPCFNEE +E+TI L Q Y E +EVI V+DGS+D + A+L+R+ ++ Sbjct: 193 KPAVSVVIPCFNEEVWIEKTIRGCLNQNYPEELLEVIVVDDGSSDGSVAVLERVQRKVFE 252 Query: 130 ---PHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRV 186 L V +N+GK A+ GA AK E +V +D D+ L D+ IV+P L NP++ Sbjct: 253 EVGERLFVHVFPENRGKRHAMAEGARRAKGEVVVFVDSDSFLQPDSILQIVQP-LKNPKI 311 Query: 187 GAVTGNPRIRTR-STLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAE- 244 GA TG + + + + ++Q Y + K + I+ V +SG +AA+R+ L Sbjct: 312 GAATGRCEVENKWTNTLTRMQAVRYFIGFRIFKAAESIFDAVTCLSGPLAAYRKDVLLRY 371 Query: 245 VGYWSDD-------MITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWA 297 +G W + +D ++ + + + + Y+ A+ ++P T + + Q++RW Sbjct: 372 LGAWVNQRYLGQPATFGDDRSLTNYVLGSHYAV-YQHTAITHTIVPSTYRQFYIQQMRWK 430 Query: 298 QGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELT 357 + L+ +W KE F + L + L F+ P Sbjct: 431 RSWLRESLRAALFMWYKEPFMAISFYLGVILPLLAPVVVLRAFVFVPAAYGVWP----TM 486 Query: 358 HIAATHTAGILLCTLCLL 375 ++A +L+CT LL Sbjct: 487 YVAGVFFMSMLMCTSYLL 504 >UniRef50_C5A6A1 Glycosyltransferase n=2 Tax=Thermococcus RepID=C5A6A1_THEGJ Length = 451 Score = 126 bits (316), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 113/385 (29%), Positives = 178/385 (46%), Gaps = 26/385 (6%) Query: 59 HWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRY--ENIEVIAVNDGSTD 116 H+ E + P + P +S++IP NE N+E I + Q Y E EVI V+DGS D Sbjct: 74 HFRENEGS-KPNPRFFPKVSLVIPAHNEAMNIERLIESIQYQDYPFECYEVILVDDGSVD 132 Query: 117 KTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALL-DRDAAAY 175 T I +R +RVI +N GKA AL+TG AAK + ++ +D D+ D + Sbjct: 133 GTPEIAERYG-----IRVIRHERNMGKAKALETGIKAAKGDVIITLDADSYFADGSSLRN 187 Query: 176 IVEPMLYNPRVGAVTGNPRIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVI 234 IVE + P VG TG RI RS L+ K QV EY + +R Q + V G Sbjct: 188 IVENLFSRPFVGVSTGAIRIDVRSGKLIEKFQVIEYLHSFEVGRRVQGYLDWLLVVPGAF 247 Query: 235 AAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRL 294 +AF+ + + D + ED +++ T +EP+A + + + L++QR+ Sbjct: 248 SAFKGYLIKSLPAIPRDTLAEDFELAMITYRAGLTSNFEPKAAVYTEPATSWRELYRQRI 307 Query: 295 RWAQGGAEVFLKNMTRLW-RKENFRMWPLFF-----EYCLTTIWAFTCLVGFIIYAVQLA 348 RW GG +V K + RK + LFF EY L + F +V F + + L Sbjct: 308 RWYYGGLQVMAKYHDMIMNRKYGEKGLFLFFHMILLEYILPVLQVF-GIVAFPL--IMLV 364 Query: 349 GVPLNIELTHIAATHTAGILLCTLCL-LQFIVSLMIEN---RYEHNLTSSLFWIIWFPVI 404 L +E+ I + + L L LQ++ +++ E N ++L ++ PVI Sbjct: 365 HNFLGLEILDITLPFPLMMAVFLLVLFLQYLPGVLMSGIAMAIERNPRTALGYL---PVI 421 Query: 405 FWMLSLATTLVSFTRVMLMPKKQRA 429 F + L+S ++ M + R Sbjct: 422 FLYYPIYNPLLSLAKIDAMLRFLRG 446 >UniRef50_A9EU26 Group-specific protein n=3 Tax=Phaeobacter gallaeciensis RepID=A9EU26_9RHOB Length = 1136 Score = 126 bits (316), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 67/223 (30%), Positives = 121/223 (54%), Gaps = 2/223 (0%) Query: 77 ISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIH 136 ++++IP +NEEK++ +TI++ L Y N+ V+ V+DGSTD T ++ + P+++++ Sbjct: 759 VTVVIPAYNEEKSILKTIYSVLESDYPNLSVLVVDDGSTDDTHGLVTKTYKDNPNVQILR 818 Query: 137 LAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIR 196 N GK A + ++Y+V ID D ++ DA +++P L NP+VGAV G + Sbjct: 819 -QPNGGKWKAANLAFSHVTTDYVVAIDADTIVAPDAIRRLMQP-LRNPQVGAVAGKIMVG 876 Query: 197 TRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITED 256 + L+ K++ EY+ + +R + V G A+R A+ + GY+S + ED Sbjct: 877 NSNNLLTKLEKLEYTVAQNIDRRAYETINAIMVVPGAFGAWRTEAVRKCGYYSSQTLAED 936 Query: 257 IDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQG 299 D++ L + + RA + P ++ L KQR+RW+ G Sbjct: 937 TDLTISLLEQGYEVRAAERAYAYTEAPASVGTLMKQRMRWSIG 979 >UniRef50_Q2RGJ6 Glycosyl transferase, family 2 n=7 Tax=Firmicutes RepID=Q2RGJ6_MOOTA Length = 466 Score = 125 bits (315), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 101/355 (28%), Positives = 170/355 (47%), Gaps = 28/355 (7%) Query: 24 FHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPC 83 F S L++ F+ + P ++ V + +R P+ + P D P I+I+I Sbjct: 75 FVSAILVITFIAYVPGYLVAFLAVS-----LLVDRQPPFKVSDP-----DLP-ITILIAA 123 Query: 84 FNEEKNVEETIHAALAQRYEN-IEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQG 142 NE N+ T+ Q Y I V+ V++GSTD T + A ++ + ++ G Sbjct: 124 RNEASNIGMTLQYIANQNYRGPISVVLVDNGSTDGTADVALAKARELKLNLLCLREESPG 183 Query: 143 KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNP-RVGAVTGNPRIR-TRST 200 K AL G A + Y + +D D LL R A +IV +L +P VGAV G+ +R +R Sbjct: 184 KNFALNAGLAKIVTPYFITLDADTLLHRFAVKHIVARLLSSPPDVGAVAGHVLVRNSRDN 243 Query: 201 LVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDIS 260 + +IQ +Y I IKRTQ +Y G + ++ A+ +V W D I EDI ++ Sbjct: 244 FLTRIQEWDYFLGIASIKRTQGLYQGTLVAQGAFSLYKTEAVRQVSGWPD-TIGEDIVLT 302 Query: 261 WKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMW 320 WKLQ + +++EP A+ + +P+ ++ +QR RWA+G E +KN+ ++ + + Sbjct: 303 WKLQQAGYRVYFEPTAVAFTAVPKVVRHFVRQRSRWARGMVE-GMKNVPPWYQPSYLKKF 361 Query: 321 PLFFEYCLTTIWAFTCLV------------GFIIYAVQLAGVPLNIELTHIAATH 363 + + I F LV +I+ L +PLNI ++ I T+ Sbjct: 362 LTGIDLVIPVIDLFYTLVWIPGLILACFGKYYIVGPYTLFVLPLNIGVSLIMYTY 416 >UniRef50_B7QVL5 Glycosyl transferase, family 2 n=1 Tax=Ruegeria sp. R11 RepID=B7QVL5_9RHOB Length = 1140 Score = 125 bits (315), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 68/223 (30%), Positives = 121/223 (54%), Gaps = 2/223 (0%) Query: 77 ISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIH 136 ++++IP +NEEK++ +TI++ L Y N+ V+ V+DGSTD T ++ + P ++++ Sbjct: 763 VTVVIPAYNEEKSILKTIYSVLDSDYPNLSVLVVDDGSTDATYDLVSKTYLNNPKVQILR 822 Query: 137 LAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIR 196 N GK A A ++Y+V ID D ++ DA +++P L NP+VGAV G + Sbjct: 823 -QPNGGKWKAANLAFAHVTTDYVVAIDADTIVAPDAIRRLMQP-LRNPKVGAVAGKIMVG 880 Query: 197 TRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITED 256 + L+ K++ EY+ + +R + V G A+R +A+ E GY+S + ED Sbjct: 881 NSNNLLTKLEKLEYTVAQNIDRRAYETINAIMVVPGAFGAWRTAAVRECGYYSSQTLAED 940 Query: 257 IDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQG 299 D++ L + + +A + P ++ L KQR+RW+ G Sbjct: 941 TDLTISLLEAGYVVRAAEKAYAYTEAPASVGTLMKQRMRWSIG 983 >UniRef50_B9MNQ5 Glycosyl transferase family 2 n=1 Tax=Anaerocellum thermophilum DSM 6725 RepID=B9MNQ5_ANATD Length = 464 Score = 125 bits (314), Expect = 3e-27, Method: Compositional matrix adjust. Identities = 92/348 (26%), Positives = 164/348 (47%), Gaps = 49/348 (14%) Query: 77 ISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAIL---------DRMAA 127 +S+++P +NEE+ + +++ + L Y EV+ +NDGS D T +L DR Sbjct: 58 VSLLVPAYNEEETIAKSVKSFLQIEYPEYEVVVINDGSKDGTLDVLKNEFDLYIVDRKFR 117 Query: 128 QI---PHLRVIHLA-----------QNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAA 173 +I ++VI+ + +N GKA AL G Y+ +D D++L+RD+ Sbjct: 118 KILSTKEIKVIYYSKKYSNLIVVDKENGGKADALNAGINVCTYPYVCSLDADSILERDSI 177 Query: 174 AYIVEPMLYNP-RVGAVTGNPRI---------------RTRSTLVGKIQVGEYSSIIGLI 217 A +++P NP V A TG RI + S+ + + Q+ EY Sbjct: 178 AKVMQPFFDNPYEVVATTGIVRIVNGTELDSFGNIKKLKLPSSSLARFQIIEYLRAFLGA 237 Query: 218 KRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWT------IF 271 ++ + G++ SG AAF ++A+ +VG +SD + ED++I KL+ N + + Sbjct: 238 RKGLSMIGSLVIASGAFAAFNKNAIIKVGGFSDRTVGEDMEIVVKLRKNSYKEGALGRVE 297 Query: 272 YEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKEN--FRMWPLFFEYCLT 329 + P + W PETLK L KQR RW +G +V + L+ + ++ + ++ Sbjct: 298 FVPDPIVWTQCPETLKDLSKQRRRWQRGLCQVIFMHKDILFNPKYGILGLFAMPYQLMFE 357 Query: 330 TIWAFTCLVGFIIYAVQLAGVPLNIE--LTHIAATHTAGILLCTLCLL 375 + F ++G+I + +N+E L A GI++ L +L Sbjct: 358 LLGPFVEMLGYIFIPISYFAHIINLEVALFFFAVEIMYGIVISILAVL 405 >UniRef50_B1V7Q3 Chitin synthase n=1 Tax=Clostridium perfringens D str. JGS1721 RepID=B1V7Q3_CLOPE Length = 466 Score = 125 bits (314), Expect = 4e-27, Method: Compositional matrix adjust. Identities = 92/320 (28%), Positives = 149/320 (46%), Gaps = 56/320 (17%) Query: 29 LMMRFVFFWP--FFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNE 86 M ++F + FF+S ++ + +Y + ++R + K+ ISI++P +NE Sbjct: 12 FFMYYIFIYAVIFFISTVYSIIDLYEFNLKKR---FNSTINIYDKKNYTPISILVPAYNE 68 Query: 87 EKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAIL-DRMAAQ---------IPHLRVIH 136 EK + + I + LA Y E+I VNDGS DKT +L +R + + I ++I Sbjct: 69 EKTILKCIDSLLALEYPEYEIIIVNDGSKDKTLEVLINRFSLKKIQRPIRKSINCKKIID 128 Query: 137 L-------------AQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYN 183 + +N GKA AL G A+ + +D D++L RD+ + IV P + N Sbjct: 129 IYEGLEKVNITLINKENGGKADALNMGINASTYPLFLTLDADSVLQRDSLSNIVVPFIEN 188 Query: 184 PRVGAVTGNPRIRTRSTLVGKI--------------QVGEYSSIIGLIKRTQRIYGNVFT 229 AV GN +I L + Q+ EY + T R++ N F Sbjct: 189 ENTIAVGGNVKISNSVLLENGVVKKNRLPKKWLIIFQIIEYYRVFW----TTRVWFNKFN 244 Query: 230 ----VSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQL------NQWTIFYEPRALCW 279 +SG F++S++ VG ++ + I ED+++ KL N+++I Y P A+CW Sbjct: 245 GNLIISGAFGVFKKSSVINVGGYNSNSIGEDMELVMKLHSFNKKNENKYSIQYTPNAICW 304 Query: 280 ILMPETLKGLWKQRLRWAQG 299 PE LK L QR RW G Sbjct: 305 SQAPEKLKDLKSQRKRWHMG 324 >UniRef50_B4D4P9 Polysaccharide deacetylase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D4P9_9BACT Length = 1192 Score = 125 bits (313), Expect = 4e-27, Method: Compositional matrix adjust. Identities = 94/299 (31%), Positives = 143/299 (47%), Gaps = 49/299 (16%) Query: 74 NPSISIIIPCFNEEKNVEETIHAALAQRYE-NIEVIAVNDGSTDKTRAILDRMAAQIPHL 132 PS+S++I FNE K +++T+ + L Y ++E+I V+DGS D T A ++ +AA P + Sbjct: 799 QPSVSVVIAAFNEGKVIQKTLRSVLDTDYTGDLELIIVDDGSRDDTAAKVEALAAAEPRI 858 Query: 133 RVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGN 192 R+I N+GKA AL+ G A++E + +D D L + + +V+P L +P VGAV+GN Sbjct: 859 RLIQ-QPNRGKASALRHGVTVARNEIVAFLDADTLFEHHTLSALVQP-LEHPLVGAVSGN 916 Query: 193 PRI-----------------------------RTRSTL-------VGKIQVGEYSSIIGL 216 R+ T +TL + Q EY L Sbjct: 917 ARVGNLLTGHDSSGASSASANSPGFLSRWWAGSTWATLSLLARRFIASCQSLEYICGFNL 976 Query: 217 IKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRA 276 +R + + V G ++A RRSA+ G +SDD + ED D++ + + I Y P A Sbjct: 977 DRRAYTEWNCITVVPGAVSALRRSAIVAGGGFSDDTLAEDTDLTLVMHRLGYRIEYAPEA 1036 Query: 277 LCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFE---YCLTTIW 332 + W PET GL KQR RWA G M LW+ + P + + L IW Sbjct: 1037 VAWTEAPETTIGLVKQRSRWAFG-------TMQCLWKHRDMVFNPRYGALGWFSLPNIW 1088 >UniRef50_C4WPV5 Putative uncharacterized protein n=1 Tax=Ochrobactrum intermedium LMG 3301 RepID=C4WPV5_9RHIZ Length = 426 Score = 124 bits (311), Expect = 6e-27, Method: Compositional matrix adjust. Identities = 73/230 (31%), Positives = 122/230 (53%), Gaps = 7/230 (3%) Query: 72 KDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPH 131 ++PS+S+I+ NEE +E + + Q +N E++ V+DGSTD+ + + Q Sbjct: 61 SEHPSVSVILVGHNEEDALEACVRSLHEQSIQNFEIVIVSDGSTDRMVEVAQSLVKQGLA 120 Query: 132 LRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTG 191 +V+ + GK+ + A A + ++ +D D DR A Y++ P +P VGAV G Sbjct: 121 TKVLSTSLRGGKSSGINLACAYASGDIIINVDCDCSFDRYAFEYLLRP-FSDPAVGAVCG 179 Query: 192 N--PRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEV-GYW 248 + PR + ++++ + Q EY I + KR V SG +AFRR AL ++ G+ Sbjct: 180 DIAPR-NSDASVISQFQEIEYLQSIAVGKRIASAVNQVVCASGAFSAFRRKALDDIRGF- 237 Query: 249 SDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQ 298 D ED+D + +L+L W I Y P A+C+ +P T+ +QRLRW + Sbjct: 238 -DVGGGEDLDTTIRLRLKGWRIEYAPEAVCYTDVPTTVYQYIRQRLRWER 286 >UniRef50_A4YD58 Glycosyl transferase, family 2 n=12 Tax=Sulfolobaceae RepID=A4YD58_METS5 Length = 395 Score = 124 bits (311), Expect = 6e-27, Method: Compositional matrix adjust. Identities = 93/337 (27%), Positives = 160/337 (47%), Gaps = 24/337 (7%) Query: 65 NAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYE--NIEVIAVNDGSTDKTRAIL 122 + P+ PS S+++P NEEK + + + Q Y+ E+I + DGSTD T + Sbjct: 31 KSDEPKTSSGPSFSLLVPVRNEEKVLGRLLERLVNQEYDRSKYEIIVLEDGSTDNTLGVC 90 Query: 123 DRMAAQIPHLRVIHLAQNQ---GKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEP 179 ++ + ++ +HL ++ GK+ AL G ++ + + D D + D Y+ + Sbjct: 91 NKFSEMYSIIKCVHLEKSNVVNGKSRALNYGLKISRGDIIGVFDADTVPRLDVLGYVAQK 150 Query: 180 MLYNPRVGAVTGN-PRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFT-VSGVIAAF 237 + N RVG V G I R ++V ++ E + + R +F + G + Sbjct: 151 FISNSRVGGVQGRLVPINVRESIVARLASLE--ELFSEYSISGRARAGLFVPLEGTCSFV 208 Query: 238 RRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWA 297 RR AL +VG W+++++TED+D+S KL + I Y P W +P T L +QRLRW Sbjct: 209 RRDALEKVGGWNENVLTEDLDLSLKLTSLNYLIVYSPSVQSWREVPVTFSSLVRQRLRWY 268 Query: 298 QGGAEVFLKNMTRL---WRKENFRMW---PLFF-----EYCLTTIWAFT--CLVGFIIYA 344 +G E+ ++ ++R WR + M P+F Y L I+++ L+ II Sbjct: 269 RGNFELTMR-ISRFKFTWRLVDAAMLVGTPVFMVLSLANYSLVFIYSYQLHVLIAAIISF 327 Query: 345 VQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSL 381 + + L I ++ T I+L L L F +SL Sbjct: 328 SSMMTLLLIIMISRRHMIETIYIILSAL-YLNFTISL 363 >UniRef50_C7RAF6 Glycosyl transferase family 2 n=1 Tax=Kangiella koreensis DSM 16069 RepID=C7RAF6_KANKD Length = 453 Score = 124 bits (311), Expect = 7e-27, Method: Compositional matrix adjust. Identities = 78/238 (32%), Positives = 125/238 (52%), Gaps = 10/238 (4%) Query: 77 ISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKT-----RAILDRMAAQIPH 131 +S+++P +NEE + +TI + L Y + E+I +NDGSTDKT + I D A P Sbjct: 85 VSVLLPAWNEEVGIVKTIESVLQSNYHDFELIVINDGSTDKTDQLVRQFIADYEADSNPK 144 Query: 132 LRVIHLA-QNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVT 190 + + +L+ N GKA AL G A ++ ID D ++DR+A ++ +P +GAV Sbjct: 145 VSIKYLSLPNGGKAHALNKGLELATGNIVITIDADCIVDRNAIYRFIQ-RFDDPSIGAVA 203 Query: 191 GNPRIRTRSTLVGKIQVGEYSSIIGLI-KRTQRIYGNVFTVSGVIAAFRRSALAEVGYWS 249 GN + + + +Q EY + G KR + V + G AA+RR L + + Sbjct: 204 GNIIVGNKRKPLEWLQQLEY--VCGFFYKRADSYFNAVHIIGGAAAAYRRDVLINLKGFD 261 Query: 250 DDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKN 307 +++ITEDID+S + + + Y A+ + P KG+ KQRLRW G +LK+ Sbjct: 262 EEIITEDIDLSMSILSHGYKTRYAHDAVVYTEGPTEWKGIRKQRLRWKYGRILCYLKH 319 >UniRef50_Q46B82 Glucosaminyltransferase n=2 Tax=Methanosarcina RepID=Q46B82_METBF Length = 403 Score = 123 bits (309), Expect = 1e-26, Method: Compositional matrix adjust. Identities = 73/227 (32%), Positives = 117/227 (51%), Gaps = 5/227 (2%) Query: 74 NPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLR 133 NP +SI++P +NE K + I + L Y EVI V+DGS+D T L+ M + Sbjct: 46 NPFVSIVVPAYNEGKVIGHCIKSILESNYSEYEVILVDDGSSDNT---LEEMQHYETNSH 102 Query: 134 VIHLAQ-NQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGN 192 VI + + N GKA AL G AK E + +D D + D + ++ + + VGAV GN Sbjct: 103 VIVVTKKNGGKASALNVGLKLAKGEVIFFVDADGIFAPDTISKMLSGFI-SEDVGAVCGN 161 Query: 193 PRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDM 252 L ++ G ++R + VSG I A+R S L + G + + Sbjct: 162 DAPINLDKLQTQLANLLTHVGTGFVRRALSTIDCLPVVSGNIGAYRSSTLEKTGPFLEGF 221 Query: 253 ITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQG 299 I ED++++W++ + + ++P A+ + P T+ GLWKQR+RWA+G Sbjct: 222 IGEDMELTWRVHKAGYKVVFQPWAIVYAEAPSTITGLWKQRVRWARG 268 >UniRef50_B9LWG0 Glycosyl transferase family 2 n=1 Tax=Halorubrum lacusprofundi ATCC 49239 RepID=B9LWG0_HALLT Length = 509 Score = 123 bits (308), Expect = 2e-26, Method: Compositional matrix adjust. Identities = 90/301 (29%), Positives = 149/301 (49%), Gaps = 21/301 (6%) Query: 19 LCVAYFHSGELMMRFVFFWPFFMSIMWIVG----GVYFW------VYRERHWPWGENAPA 68 LCVA+ G ++ + P S +++ +Y+W VY + + + P Sbjct: 77 LCVAFVCYGFGVVTLGWVQPSLTSGVYLFAIALIFLYYWFIALAAVYHNQRYHSQDAPPE 136 Query: 69 PQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAILDRMA 126 P + SISII+P +NEE ++ TI A L Y + E+I ++DGSTD T A + A Sbjct: 137 P----SASISIIVPAYNEEGYIQRTITALLDADYPDGKREIIVIDDGSTDNTCA--EARA 190 Query: 127 AQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRV 186 + + V+ N GK AL G A +E ++ +D D++ ++DA +V P L + V Sbjct: 191 FESETVSVVT-KDNGGKYSALNYGLLFASNEIILTVDADSVPEKDALKQMVAP-LSDQSV 248 Query: 187 GAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVG 246 GAV I R +L+ Q EY+ + + +R ++G V V G + A+RR L E+ Sbjct: 249 GAVASTVTIWNRGSLLTGCQQLEYTIGVNVYRRMLDLFGIVMVVPGCLGAYRRDVLDEIQ 308 Query: 247 YWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLK 306 + +TED DI+ K+ + + A + P++L+ L+ QRLRW +G K Sbjct: 309 GFDPQTLTEDFDITVKVLRAGYEV-RSSEARVYTEAPDSLRDLYNQRLRWYRGNYMTIFK 367 Query: 307 N 307 + Sbjct: 368 H 368 >UniRef50_B9YCC6 Putative uncharacterized protein n=1 Tax=Holdemania filiformis DSM 12042 RepID=B9YCC6_9FIRM Length = 477 Score = 122 bits (307), Expect = 2e-26, Method: Compositional matrix adjust. Identities = 99/316 (31%), Positives = 145/316 (45%), Gaps = 55/316 (17%) Query: 33 FVFF---WPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDN--PSISIIIPCFNEE 87 FVF+ + F+ I +VG +YR R+ +N +LK + +S+I+P NEE Sbjct: 16 FVFYLIGYSTFLFISVVVGAS--VLYRNRYLEKMKN----ELKHDYYVPVSVIVPAHNEE 69 Query: 88 KNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDR----------MAAQIP------- 130 V +T+ + L Y+ EV+ V+DGSTDKT L + + Q+P Sbjct: 70 VTVVDTVRSLLQLDYKLYEVLVVDDGSTDKTSQRLIKAFNLIETQRPIRLQVPCQKQKAV 129 Query: 131 HLRVIH-------LAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYN 183 + R IH + +N GKA AL G A++ Y VC+D D+LL RD+ Y V P+L + Sbjct: 130 YEREIHGIRVTLIIKENGGKADALNMGINASRFPYFVCMDADSLLQRDSLEYAVRPLLES 189 Query: 184 PRVGAVTGNPRIRTRSTLV-GKI-------------QVGEYSSIIGLIKRTQRIYGNVFT 229 V AV G RI TL GK+ Q+ EY + Y Sbjct: 190 DEVIAVGGLVRISNGVTLENGKVKHYHLPFNPLVCMQILEYDRSFLASRILMDQYNGNLI 249 Query: 230 VSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLN------QWTIFYEPRALCWILMP 283 +SG F++ + VG + + ED+++ KL + + I Y P A+CW P Sbjct: 250 ISGAFGLFKKDLVITVGGYDRSTMGEDMELVTKLHVFCRAHDIPYAIRYAPDAICWSQAP 309 Query: 284 ETLKGLWKQRLRWAQG 299 TL L KQR RW G Sbjct: 310 STLLDLTKQRRRWHIG 325 >UniRef50_UPI0001C30EDA glycosyl transferase family 2 n=1 Tax=Conexibacter woesei DSM 14684 RepID=UPI0001C30EDA Length = 433 Score = 121 bits (304), Expect = 5e-26, Method: Compositional matrix adjust. Identities = 81/259 (31%), Positives = 135/259 (52%), Gaps = 13/259 (5%) Query: 67 PAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRY--ENIEVIAVNDGSTDKTRAILDR 124 P + P ++I++P FNEE V ++ + LA Y + +E++A++DGSTD+T A + Sbjct: 64 PPRDVGHEPHVAIVMPAFNEEAVVARSLRSLLALDYPVDKLELVAIDDGSTDETLARMRE 123 Query: 125 MAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNP 184 +AA+ P +RVI L +NQGK A+ G A +E L +D D+ LD DA +V+ P Sbjct: 124 VAAESPRVRVIELGRNQGKRAAMAAGMRATDAEILAFVDSDSSLDPDALHKLVQGFA-EP 182 Query: 185 RVGAVTGNPRI-RTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSA-L 242 VGAV G+ + + + ++QV Y ++K ++ I+ V SG AA+RR A L Sbjct: 183 GVGAVCGHADVANVDDSWLTRMQVVRYFVAFQVVKASESIFACVTCCSGCFAAYRRDAIL 242 Query: 243 AEVGYW-------SDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLR 295 + +W S +D ++ + L W + Y A+ ++P T++ +Q+LR Sbjct: 243 PHLDWWEHQTFLRSPATFGDDRALT-NVVLRDWKVRYAANAVSHTVVPHTMRQFLRQQLR 301 Query: 296 WAQGGAEVFLKNMTRLWRK 314 W + L +WRK Sbjct: 302 WKRSWTRESLILARFVWRK 320 >UniRef50_C4I9P5 Inner membrane glycosyltransferase n=39 Tax=Bacteria RepID=C4I9P5_BURPS Length = 520 Score = 120 bits (300), Expect = 1e-25, Method: Compositional matrix adjust. Identities = 100/342 (29%), Positives = 156/342 (45%), Gaps = 29/342 (8%) Query: 54 VYRERHWPWG-------ENAP--APQLKDNPSISIIIPCFNEEKNVEETIHAALAQRY-- 102 VY RH+ +G + AP A D P I++ + NEE V + + A LA Y Sbjct: 115 VYASRHYVFGLDRLFKPQRAPYRAITHADWPEITVFVAAHNEEAVVADCLTALLATTYPR 174 Query: 103 ENIEVIAVNDGSTDKTRAILDRMAAQIPHL-RVIHLAQNQ-GKAIALKTGAAAAKSEYLV 160 E + ++ VND STD TRA++D + A+ P L + H + GKA ALK + + +V Sbjct: 175 ERLTIVPVNDRSTDNTRALIDEVQARAPELIKPFHRESGKPGKAAALKDALREIRGDIMV 234 Query: 161 CIDGDALLDRDAAAYIVEPMLYNPRVGAVTGN--PRIRTRSTLVGKIQVGEYSSIIGLIK 218 D D L +V P ++P VGAV G P+ R+ L + + E + + + Sbjct: 235 VFDADYLPRPGLLKELVAP-FFDPEVGAVMGRVVPQNADRNLLARLLDL-ERAGGYQVNQ 292 Query: 219 RTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALC 278 + + G V G + R+SAL VG W DD + ED D++++L L+ W Y A C Sbjct: 293 QARNNLGLVPQYGGTVGGVRKSALDAVGGWRDDTLAEDTDMTYRLLLSNWRTVYLNHAEC 352 Query: 279 WILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLV 338 + +PE +Q RWA+G + + + L R P+ C L+ Sbjct: 353 YEEVPERWPVRARQLTRWAKGHNQTLFRYLIPLLRS------PVTPRRCRLD---GALLL 403 Query: 339 GFIIYAVQLA---GVPLNIELTHIAATHTAGILLCTLCLLQF 377 G + LA G+ L + LT+ + G+L+ L F Sbjct: 404 GVFVMPALLALAWGIALALYLTNGIDSLVLGLLVSVFALFAF 445 >UniRef50_D1BHB0 Glycosyl transferase n=1 Tax=Sanguibacter keddieii DSM 10542 RepID=D1BHB0_SANKS Length = 411 Score = 119 bits (299), Expect = 2e-25, Method: Compositional matrix adjust. Identities = 74/248 (29%), Positives = 129/248 (52%), Gaps = 32/248 (12%) Query: 67 PAPQLK--DNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDR 124 PA L+ NP +S+++P F+E + + + + LA RY N EV+ V+DGS D T ++ Sbjct: 37 PADPLRVPSNPFVSVVVPAFDEGRVLRRCVDSILADRYLNKEVVLVDDGSRDDTWEVMKS 96 Query: 125 MAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLY-- 182 + P + V H N GKA AL G + ++ L+ +D D + + + + ML Sbjct: 97 YRGR-PGVVVAH-QTNAGKAAALNHGISLSRGSILMFVDADGVF----SGWTIRGMLRGF 150 Query: 183 -NPRVGAVTGN----------PRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVS 231 +PRVGAV GN PR+ T T + +++R + VS Sbjct: 151 DDPRVGAVCGNDAPVNLDRFQPRLLTLLT-----------HVTAMVRRALAQVNCLTIVS 199 Query: 232 GVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWK 291 G A RR L EVG +++ ++ ED++++W+++ + + ++P A+ + +P + LW+ Sbjct: 200 GNCGAVRREVLDEVGGFTEGLLGEDLELTWRVRRAGYRVSFQPEAIVYAEVPAGVGPLWR 259 Query: 292 QRLRWAQG 299 QR+RWA+G Sbjct: 260 QRVRWARG 267 >UniRef50_B1HSU1 Biofilm PIA synthesis N-glycosyltransferase icaA n=1 Tax=Lysinibacillus sphaericus C3-41 RepID=B1HSU1_LYSSC Length = 403 Score = 119 bits (297), Expect = 3e-25, Method: Compositional matrix adjust. Identities = 76/250 (30%), Positives = 129/250 (51%), Gaps = 6/250 (2%) Query: 68 APQLKDNPSISIIIPCFNEEKNVEETIHAALAQRY--ENIEVIAVNDGSTDKTRAILDRM 125 + ++ D PS+S+ IP NE +E+T+ A Y + +EVI +ND S+D+T I + Sbjct: 22 SKKMTDVPSVSVFIPAHNEALVIEQTLRAMSRLYYPKDKLEVIVINDNSSDETGNIALQY 81 Query: 126 AAQIPHLRVIHLAQ---NQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLY 182 A + P +RVI + +GK+ AL + A + + +V D D +R A Y+V ++ Sbjct: 82 AEKFPFMRVIETVEPNKGKGKSSALNSALADSTGDIVVVYDADNTPERMAVWYLVMGLMN 141 Query: 183 NPRVGAVTGNPR-IRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSA 241 +P+ A G R I T + + E + + + + V T+ G A RR+ Sbjct: 142 DPKAAATVGKFRVINAAETWLTRFINIETICFQWMAQGGRWKWFKVATIPGTNFAIRRTV 201 Query: 242 LAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGA 301 L ++G W + ED +++ ++ + I + P+A+ W PETLK WKQR RWA+G Sbjct: 202 LEQLGGWDVKALAEDTELTIRVYNLGYHIRFFPKAITWEQEPETLKVWWKQRTRWARGNQ 261 Query: 302 EVFLKNMTRL 311 V LK +++ Sbjct: 262 YVVLKFLSQF 271 >UniRef50_A6FMF5 N-acetylglucosaminyltransferase n=1 Tax=Roseobacter sp. AzwK-3b RepID=A6FMF5_9RHOB Length = 419 Score = 119 bits (297), Expect = 3e-25, Method: Compositional matrix adjust. Identities = 83/278 (29%), Positives = 134/278 (48%), Gaps = 17/278 (6%) Query: 66 APAPQLKDNPS------ISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTR 119 A A L++ P +S++I N++ + T+ + +Q ++N+E+I V DG Sbjct: 40 AVASVLRERPDSPKDLLVSVLIVGLNDKAGLARTLQSLKSQTHKNLEIIVVQDGPGTTMY 99 Query: 120 AILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEP 179 + A + I GKA AL G SEY+V D D DR+A + I+ Sbjct: 100 QLGKEFEASGDIQKYIKTEIRGGKAAALNIGFTRCSSEYVVVADIDTSFDRNAISTIIGR 159 Query: 180 MLYNPRVGAVTGN--PRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAF 237 +L + ++G V+GN PR S +Q EY + I + ++ Q ++G + VSG AF Sbjct: 160 LLSDKKIGGVSGNLAPR-NLASGFWPSVQAIEYLNNITIGRQFQSMFGILTIVSGAFGAF 218 Query: 238 RRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWA 297 R SA+ VG W D +D +++ K++L+ W I +EP A +P++ LW QRLRW Sbjct: 219 RASAIRSVGGW-DVGPGDDSNLTTKIRLSGWKIQFEPDAWALTDVPDSFLALWTQRLRWN 277 Query: 298 QGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFT 335 + ++N WRK P + L + A T Sbjct: 278 RS----LIRNR---WRKFRSVFNPFLHNFSLVEVVAAT 308 >UniRef50_B4U2G9 Glycosyl transferase, group 2 family protein n=22 Tax=Streptococcus RepID=B4U2G9_STREM Length = 464 Score = 119 bits (297), Expect = 3e-25, Method: Compositional matrix adjust. Identities = 116/456 (25%), Positives = 199/456 (43%), Gaps = 64/456 (14%) Query: 25 HSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCF 84 +S L + F+ + F+ + IVG ++ + R + P G + P +SI++P + Sbjct: 10 YSQYLFLAFLIAYASFLLLSLIVGAIHLY-ERRQQVPLGGDVSLP------PVSILVPAY 62 Query: 85 NEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRM------------------- 125 NEE + ET+ + L Y+ E+I ++DGSTD T +L Sbjct: 63 NEELTIIETVESLLMLDYDCYEIIVIDDGSTDHTAKLLKEQFHLEAVTCSINQRIPTRPL 122 Query: 126 ----AAQIPHLRVIHLAQ-NQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPM 180 Q+ H+++ L + N GK +L G AA+ +Y +C+D D++L D+ I + + Sbjct: 123 KAVSQKQLKHVKLTLLEKANGGKGDSLNFGINAAQYDYFLCMDADSMLQEDSLKRIAKTV 182 Query: 181 LYNPRVGAVTGNPRIRTRSTL-VGKI-------------QVGEYS-SIIGLIKRTQRIYG 225 +P V AV G ++ L G+I Q EY + G + G Sbjct: 183 AKDPSVIAVGGMVKVSQGVALAAGRIVDYHLPRQLISCAQAIEYDCAFFGARVLLDQAAG 242 Query: 226 NVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKL----QLNQ--WTIFYEPRALCW 279 N+ +SG F++ V + + ED+++ KL Q N+ + I YE A+CW Sbjct: 243 NLI-ISGAFGLFKKELAIAVDGYDTGTLGEDMELVMKLHFFCQNNKIPYRIGYETDAICW 301 Query: 280 ILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVG 339 P LK L +QR RW G + LK + + F++ L Y L ++ L+ Sbjct: 302 SQAPSRLKDLCQQRRRWFLGLYQC-LKKYLQFFVSGRFKVTVL-ISYVLYLLFE---LLS 356 Query: 340 FIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENR-YEHNLTS-SLFW 397 I A + + L++ + T ++L +C I NR Y N +S + F Sbjct: 357 PFIEAFGVTIIVLSLLCHQLNTTFFVSLMLLYICYCVLIALTSFLNRVYSQNNSSITFFD 416 Query: 398 IIW--FPVIFWMLSLAT--TLVSFTRVMLMPKKQRA 429 ++W + I+ L L T TLV + + +K+ A Sbjct: 417 MVWVIYAAIYQCLVLHTVLTLVKVSSFIGYQRKKNA 452 >UniRef50_P96587 Uncharacterized glycosyltransferase ydaM n=6 Tax=Bacillus RepID=YDAM_BACSU Length = 420 Score = 118 bits (296), Expect = 4e-25, Method: Compositional matrix adjust. Identities = 78/279 (27%), Positives = 144/279 (51%), Gaps = 12/279 (4%) Query: 48 GGVYFWVYRERHWP-WGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRY--EN 104 GG ++ ER+ P W EN +K+ P +S++IP NEE + +T+ A + Y + Sbjct: 26 GGFRHYMTFERNIPKWREN-----MKELPKVSVLIPAHNEEVVIRQTLKAMVNLYYPKDR 80 Query: 105 IEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLA---QNQGKAIALKTGAAAAKSEYLVC 161 +E+I VND S+D+T I++ + + ++++ +GK+ AL +G A + + + Sbjct: 81 LEIIVVNDNSSDRTGDIVNEFSEKYDFIKMVITKPPNAGKGKSSALNSGFAESNGDVICV 140 Query: 162 IDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPR-IRTRSTLVGKIQVGEYSSIIGLIKRT 220 D D ++ A Y+V ++ + + GAV G R I TL+ + E + + Sbjct: 141 YDADNTPEKMAVYYLVLGLMNDEKAGAVVGKFRVINAAKTLLTRFINIETICFQWMAQGG 200 Query: 221 QRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWI 280 + + + T+ G A RRS + ++G W D + ED +++ ++ + I + P A+ W Sbjct: 201 RWKWFKIATIPGTNFAIRRSIIEKLGGWDDKALAEDTELTIRVYNLGYHIRFFPAAITWE 260 Query: 281 LMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRM 319 PET K W+QR RWA+G V LK + + ++ + R+ Sbjct: 261 QEPETWKVWWRQRTRWARGNQYVVLKFLAQFFKLKRKRI 299 >UniRef50_A6DGB3 Glycosyl transferase, family 2 n=3 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DGB3_9BACT Length = 396 Score = 118 bits (296), Expect = 4e-25, Method: Compositional matrix adjust. Identities = 78/248 (31%), Positives = 125/248 (50%), Gaps = 12/248 (4%) Query: 70 QLKDNPSISIIIPCFNEEKNVEETIHAALAQRYE--NIEVIAVNDGSTDKTRAILDRMAA 127 L+ NP +S+++P NEE +E + Y+ +EVI +ND S+D T+ ++D Sbjct: 20 DLEYNPKVSVLVPAHNEEAVIEGCLDCMNKLEYKTGQLEVIILNDRSSDGTKELIDNFLC 79 Query: 128 QIP--HLRVIH--LAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYN 183 + P H+R H ++ GKA A+K A KSE +V D D L D ++ P + Sbjct: 80 KNPQSHIRAHHRPMSSEPGKAAAMKEIIATLKSEIIVIFDADYLPQADLIKRLISP-FKD 138 Query: 184 PRVGAVTGNPRIRT---RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRS 240 P+VGA G R+ T + ++ K+ E S + + + + + G R S Sbjct: 139 PQVGATMG--RVVTYNANANIMTKLIDLERRSGYAIDQNVRNYFDLLPQFGGTTGGIRLS 196 Query: 241 ALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGG 300 AL +VG W +TED D+++KL LN + I Y A C+ PET + +KQ RWA G Sbjct: 197 ALEDVGGWDTRTLTEDTDLTYKLYLNGYKIKYLNAAACYEETPETWQARYKQVRRWAYGH 256 Query: 301 AEVFLKNM 308 + +K++ Sbjct: 257 NDCMIKHL 264 >UniRef50_Q988K9 Mlr6694 protein n=1 Tax=Mesorhizobium loti RepID=Q988K9_RHILO Length = 475 Score = 118 bits (295), Expect = 5e-25, Method: Compositional matrix adjust. Identities = 101/352 (28%), Positives = 160/352 (45%), Gaps = 54/352 (15%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKT-------------RAI 121 P I++++P +NEE NV E++H+ LA Y N EVI +NDGS D+T R Sbjct: 63 PPIALLVPAYNEELNVVESVHSMLALEYPNFEVIVINDGSKDQTLQRLIGAFKLVKFRRP 122 Query: 122 LDRMAAQIPHLRVIHLA-----------QNQGKAIALKTGAAAAKSEYLVCIDGDALLDR 170 + A P +R ++ + +N GKA A G ++ IDGD++L+ Sbjct: 123 YEEALAHKP-IRGLYSSPMTERLFVVDKENGGKADAQNAGINVCRAPLFCVIDGDSILEP 181 Query: 171 DAAAYIVEPMLYNP-RVGAVTGNPRIRTRSTL-VGKI-------------QVGEYSSIIG 215 DA +P + +P R AV G RI S + G++ QV EY Sbjct: 182 DALMRAAQPFIDDPERTIAVGGTIRIANGSRIEAGRVREIHLPNRLLPLLQVVEYLRAFL 241 Query: 216 LIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLN------QWT 269 + + + VSG FRR+ + VG ++ + ED+D+ KL + ++ Sbjct: 242 MARLAWSHINTLMLVSGAFGMFRRTEVVAVGGFTMGSMGEDLDLVIKLHRHMRDAGRKYR 301 Query: 270 IFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCL- 328 I + P +CW PETL L +QR RW +G E F + L+ R+ L F + L Sbjct: 302 IQFIPEPVCWTEAPETLSVLARQRTRWQRGALECFFRYRAMLFNPRYGRVGFLGFGHILI 361 Query: 329 -TTIWAFTCLVGFI-IYAVQLAGVPLNIE----LTHIAATHTAGILLCTLCL 374 + ++G+I I A L G+ L+ E T + T+ + +C+L L Sbjct: 362 VDVLGPVAEVLGYILIPAFWLLGI-LSTEYLLAFTSLIFTYGVCVSVCSLVL 412 >UniRef50_Q46CB1 Glycosyltransferase n=2 Tax=Methanosarcina RepID=Q46CB1_METBF Length = 694 Score = 116 bits (290), Expect = 2e-24, Method: Compositional matrix adjust. Identities = 89/318 (27%), Positives = 146/318 (45%), Gaps = 31/318 (9%) Query: 34 VFFWPFFMSIMWIVGGVYFWVYRER-HWPWGENAPAPQ-LKD---NPSISIIIPCFNEEK 88 V+FW F ++ + + W + + +N+ PQ ++D + +SII+P ++EEK Sbjct: 302 VYFWFFIITSLLPLSFFVIWKFLSGWNIRRSKNSCQPQNIQDTGYHGLVSIIVPAYDEEK 361 Query: 89 NVEETIHAALAQRYE-NIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIAL 147 + + + L Q Y +EVI VNDGSTD+T I+ + ++ I L N GKA AL Sbjct: 362 TIGSCLESILNQDYNGEMEVIVVNDGSTDRTAEIVLKYP-----VKFIDLKANVGKANAL 416 Query: 148 KTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRT---RSTLVGK 204 AK + L+ D D+ + +A + +VE ++ N V GN I + ++ Sbjct: 417 NRAIEDAKGDILIFTDSDSYMSANAVSSLVECLITNNDAQIVAGNVFIHNSQGKKRIMKY 476 Query: 205 IQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQ 264 Q+ EY + + Q + GN+ G + A +R V + SD+ I ED D + K Sbjct: 477 FQMIEYLIEQEITRYLQSLSGNILVCPGPLTAVKREVCESVRF-SDETIVEDADFTIKAL 535 Query: 265 LNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFF 324 I EP+A + PET + +KQR RW G N ++W L Sbjct: 536 KKSMKIIREPKAKVYTNAPETFRAWYKQRKRWWYG----------------NLQVWRLHR 579 Query: 325 EYCLTTIWAFTCLVGFII 342 + + W +G+II Sbjct: 580 HWAMRNAWMVLNYLGYII 597 >UniRef50_C7XH18 Glycosyltransferase n=1 Tax=Lactobacillus crispatus 125-2-CHN RepID=C7XH18_9LACO Length = 442 Score = 115 bits (289), Expect = 2e-24, Method: Compositional matrix adjust. Identities = 86/302 (28%), Positives = 143/302 (47%), Gaps = 23/302 (7%) Query: 58 RHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYEN--IEVIAVNDGST 115 H + P P P +SIIIP +N K + + + Y + + V++GS Sbjct: 36 HHHKIRKKTPLPDFL--PVLSIIIPTYNSSKTLYRCLKSLADSDYPKNLLGIFVVDNGSK 93 Query: 116 DKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAY 175 D T + + P L + L GK+ AL A+S+Y++ ID D LD+ A Sbjct: 94 DDTFQVFAKAQTDYPQLLMRWLTSPHGKSSALNAAIYKARSQYIINIDSDGRLDKHALNN 153 Query: 176 IVEPMLYNPRVGAVTGNPRIRTRSTLVGK----IQVGEYSSIIGLIKRTQRIYGN---VF 228 IV+ NP + A+TG I T L+ K +++ EY + I N +F Sbjct: 154 IVKKFSANPNIDALTGT--ILTDRHLIKKPHKLLKLNEYFEYCQTFLAGRNIENNSNHIF 211 Query: 229 TVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLN-QWTIFYEPRALCWILMPETLK 287 T+SG +AF+R L +S D I+ED D++++++ N + TI + A+ + Sbjct: 212 TLSGAFSAFKRETLIRTYLYSTDTISEDTDMTFQIRYNLKGTISFCDDAIFYSEPISGFS 271 Query: 288 GLWKQRLRWAQGGAEVFLKNMTRL--WRK--ENFRMWPLFFEYCLT---TIWAFT--CLV 338 L+ QR RW +G EV K ++++ W++ NF++ L ++ +T IW F L+ Sbjct: 272 ELYTQRQRWQRGELEVINKYLSQVLSWKQFFSNFQIRRLIIDHTVTFLKIIWMFAIFILL 331 Query: 339 GF 340 GF Sbjct: 332 GF 333 >UniRef50_A6CMH6 Glycosyl transferase, group 2 family protein n=2 Tax=Bacillus RepID=A6CMH6_9BACI Length = 449 Score = 115 bits (288), Expect = 3e-24, Method: Compositional matrix adjust. Identities = 91/327 (27%), Positives = 147/327 (44%), Gaps = 60/327 (18%) Query: 63 GENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAIL 122 E ++ N S S+I+P +NEEK +E I Y+N E++ VNDGS D T +L Sbjct: 29 NEKKHKVRITKNHSFSVIVPAYNEEKVIENCIEGFRQVEYKNAELLIVNDGSKDNTLRVL 88 Query: 123 ---------DRM-AAQIPHLRVIHLAQ-------------NQGKAIALKTGAAAAKSEYL 159 RM ++ H RV Q N GKA +L G +++E + Sbjct: 89 HEVLQLKPASRMIDTRLKHERVTGFYQSKKFPSIFVINKKNGGKADSLNAGINFSENEIV 148 Query: 160 VCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGN---------------PRIRTRSTLVGK 204 V +D D++L++++ + V A GN P ++ + + K Sbjct: 149 VTLDADSILEKNSLNEM-NAAFQKKSVLACGGNVLISQAFNGSLENLKPTLKVNNVI--K 205 Query: 205 IQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQ 264 Q +Y S L KR Q ++ ++G AFRR L + + + ED+DI+ KL Sbjct: 206 FQFLQYLSSFYLHKRAQAAMNSITVIAGAFGAFRRETLFAIKGFR-KTVGEDMDITLKLH 264 Query: 265 L------NQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFR 318 N+ +I + P ALC+ P TLK ++KQR+RW + + + +RK FR Sbjct: 265 KWIKERGNKESITFVPTALCYTECPSTLKNMFKQRVRWQKAFIDCLI-----YYRKCYFR 319 Query: 319 MWP----LFF---EYCLTTIWAFTCLV 338 P +FF ++ + T+ AF L+ Sbjct: 320 KLPKKFSIFFLVDQFTIGTLSAFPVLL 346 >UniRef50_A1VIY0 Glycosyl transferase, family 2 n=1 Tax=Polaromonas naphthalenivorans CJ2 RepID=A1VIY0_POLNA Length = 476 Score = 115 bits (287), Expect = 5e-24, Method: Compositional matrix adjust. Identities = 77/243 (31%), Positives = 120/243 (49%), Gaps = 10/243 (4%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRY--ENIEVIAVNDGSTDKTRAILDRMAAQIP-H 131 P I++ I NEEK + I A L Y + +++I VND STD+T AI+DR A+ P Sbjct: 105 PMITVFIAAHNEEKVIAGCIEALLNTDYPADQLKIIPVNDRSTDRTGAIIDRYVARFPSR 164 Query: 132 LRVIHLAQNQ-GKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVT 190 + H + GK+ ALK A A+ + + D D + R + P ++P VGAV Sbjct: 165 ISPFHRTLGKAGKSAALKDALAFAEGDIAIIFDADYVPGRGLLKQLAAP-FFDPEVGAVM 223 Query: 191 GNP-RIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWS 249 G + + + L+ ++ E S + ++ + + G + R SA+ VG W Sbjct: 224 GRVVPVNSGANLLTRMLDLERSGGYQVDQQARMNMNLLPQYGGTVGGVRLSAVEAVGGWH 283 Query: 250 DDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMT 309 DD + ED DI+++L N W Y R+ C+ +PE + KQ RWA+G +V M Sbjct: 284 DDTLAEDTDITYRLMFNGWKTVYSNRSECYEEVPEEWRVRIKQVKRWAKGHNQV----MA 339 Query: 310 RLW 312 R W Sbjct: 340 RYW 342 >UniRef50_A3Q6L3 Glycosyl transferase, family 2 n=6 Tax=Actinomycetales RepID=A3Q6L3_MYCSJ Length = 485 Score = 114 bits (286), Expect = 5e-24, Method: Compositional matrix adjust. Identities = 84/302 (27%), Positives = 136/302 (45%), Gaps = 52/302 (17%) Query: 69 PQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQ 128 P L P +S+++ +E +EE + + Y ++EVI ++D STD T +L R+AA+ Sbjct: 106 PALPPLPRVSLLVAVRDEVDGIEECVRTMVGSAYPDLEVIVIDDASTDGTPDVLRRLAAE 165 Query: 129 IPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGA 188 + + VI NQGK AL G A E L D D +L DA + V ++ NPR+GA Sbjct: 166 L-DVTVIFKEINQGKKHALTDGVRVASGEVLAFTDSDCVLAPDALSRCVRALVENPRLGA 224 Query: 189 VTGNPR-IRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAE--- 244 V+G+ R + T++ + Q Y + K + +GNV VSG +A FRR A+ Sbjct: 225 VSGHARALNAADTVLTRAQDTWYDGQFRVAKAAEATFGNVTCVSGPLAVFRRDAIVNYLP 284 Query: 245 --------------------VGY--------------WSDDMITEDIDISWKLQLNQWTI 270 GY ++DD + D D + +W + Sbjct: 285 AWANDTFLGREFRFATDRQLTGYVLGQVWKGQALKRRYADDPLVADHDHPER----RWLV 340 Query: 271 FYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNM----TRLWRKENFRMWPLFFEY 326 Y A W +P + KQ++RW + F++N+ + +WR+ F LF+ + Sbjct: 341 GYVRSAHVWTTVPARFRPFLKQQVRWKKS----FIRNLCFTGSFMWRR-GFGAAALFYGH 395 Query: 327 CL 328 L Sbjct: 396 VL 397 >UniRef50_Q03HA4 Glycosyltransferase n=2 Tax=Bacilli RepID=Q03HA4_PEDPA Length = 414 Score = 113 bits (282), Expect = 2e-23, Method: Compositional matrix adjust. Identities = 100/393 (25%), Positives = 187/393 (47%), Gaps = 32/393 (8%) Query: 64 ENAPAP--QLKDN-PSISIIIPCFNEEKNVEETIHAALAQRY--ENIEVIAVNDGSTDKT 118 + AP P QL + P +++++P NE + +T+ + L+ Y ++ E+I +ND S+D + Sbjct: 32 QEAPKPDKQLPEKIPFVTVMVPAHNEGIVIVKTVLSLLSFDYPHDHYEIIVINDNSSDNS 91 Query: 119 RAILDRMAAQI--PHLRVIH---LAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAA 173 +L + L+VI+ + +GK+ AL G AK + D D +R A Sbjct: 92 AELLKEVQNDFGEERLKVINTDNVTGGKGKSNALNIGLKQAKGSVIAIYDADNTPERGAL 151 Query: 174 AYIVEPMLYNPRVGAVTGNPRIRTR-STLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSG 232 +V +L N R+ AV G R R + +T++ + E S + + ++ + T+ G Sbjct: 152 RLLVAELLGNDRLAAVIGKFRTRNKEATILTRFINIETLSFQWMAQAGRQRLFKLCTIPG 211 Query: 233 VIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQ 292 RR L ++G W + ED +IS+++ + I ++PRA+ W P+TL + Q Sbjct: 212 TNYVIRREILEKIGGWDVKALAEDTEISFRVYQMGYQIKFQPRAVTWEQEPQTLDVWFHQ 271 Query: 293 RLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFE--YCLTTIWAFTCLVGFIIYAVQLAGV 350 R RW +G V +KN L++K P+ F+ Y L+T + ++ A+ L+ V Sbjct: 272 RTRWVKGNIYVVVKNAKLLFKKAG---RPIRFDLLYFLSTYF-------LLMTALILSDV 321 Query: 351 PLNIELTHIAATHTAGI--LLCTLCLLQFIVSLMIENRYEHNLT--SSLFWIIWFPVIF- 405 + + +A + G L ++ FI+S + E S++ +II+ +++ Sbjct: 322 VFVLSVAGLAHSDLQGFSNALWLFGIIMFIISTFVTVTTEKGEMRFSNILYIIFMYLVYS 381 Query: 406 --WMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 W+ A +V + R + K+ A+W R Sbjct: 382 QMWLAVAAYGMVGYIREQVFHKQ--AKWYKTKR 412 >UniRef50_UPI00017891C0 glycosyl transferase family 2 n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI00017891C0 Length = 450 Score = 113 bits (282), Expect = 2e-23, Method: Compositional matrix adjust. Identities = 87/311 (27%), Positives = 142/311 (45%), Gaps = 53/311 (17%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIP---- 130 SI++++P +NEE + I A RY N E++ +NDGS D T + LD + P Sbjct: 39 KSITVLVPAYNEELTIRNCIDAMAGLRYSNYEIMIINDGSKDGTLSALDELLDLEPSART 98 Query: 131 --------------------HLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDR 170 H+ VI N GKA +L G A S+ ++ +D D++L+ Sbjct: 99 ADSKLNYKSIKGFYRSNRYRHIYVID-KMNGGKADSLNAGIDYAVSDIVITLDADSMLEA 157 Query: 171 DAAAYIVEPMLYNPRVGAVTGNPRI--------------RTRSTLVGKIQVGEYSSIIGL 216 D+ Y V ++ + A+ G +I + R + K Q+ Y+ + Sbjct: 158 DSLKY-VNQYFHDDDIIALGGTVKIVQGAVRDKNGAVVEKFRGKGLIKSQIINYTHGFYV 216 Query: 217 IKRTQRIYGNVFTVSGVIAAFRRSALAEV-GYWSDDMITEDIDISWKLQLN------QWT 269 K TQ + ++ +SG AF + L V G+ S + EDIDI+ K+ + Sbjct: 217 RKLTQSYFNSIVVISGAFGAFYKEILVHVDGFRS--TVGEDIDITLKIHEYIKAHRLKKK 274 Query: 270 IFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFE---Y 326 + Y P A+C+ PE L+ +KQR+RW + + L +RL RK + + LFF + Sbjct: 275 LVYAPEAVCYTECPENLQNFYKQRIRWQKAFIDCILIYWSRLSRKFSVSV-SLFFAIDGF 333 Query: 327 CLTTIWAFTCL 337 L T+ AFT + Sbjct: 334 MLGTLSAFTTI 344 >UniRef50_C0GFE2 Glycosyl transferase family 2 n=1 Tax=Dethiobacter alkaliphilus AHT 1 RepID=C0GFE2_9FIRM Length = 455 Score = 111 bits (278), Expect = 4e-23, Method: Compositional matrix adjust. Identities = 99/376 (26%), Positives = 170/376 (45%), Gaps = 67/376 (17%) Query: 69 PQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILD---RM 125 P+ ++I++P FNE+ +++ I AL Y+N E+I +NDGSTD T IL+ R+ Sbjct: 34 PEKLPELGLTILVPAFNEQHVIKQCILGALRANYQNHELIIINDGSTDDTLGILESFLRL 93 Query: 126 AAQ---------------------IPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDG 164 A P++ VI N GKA AL G A ++ +V +D Sbjct: 94 EAVEESINRTLTYKTVRGFYYSTIYPNVYVID-KSNGGKADALNAGIDYANNDIVVTLDA 152 Query: 165 DALLD-----------RDAAAYIVEPMLYNPRVGAVTG---NPRIRTRSTLVGKIQVGEY 210 D++LD D + M++ + V G +P R + + Q+ +Y Sbjct: 153 DSILDSNSLQAMNSCFHDDSVIAAGGMVHIVQGAGVKGGKASPNFAVRGII--RYQIMQY 210 Query: 211 SSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTI 270 + L K Q + + ++G AFRRS L E+ + + + ED+DI+ K+Q +T Sbjct: 211 LTSFYLHKFVQAKFKAMTVIAGAFGAFRRSTLLELNGFR-NTVGEDMDITLKIQRLIYTK 269 Query: 271 FYEPR------ALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFF 324 + + A+C+ PE + L+KQR+RW + + + L+ + +F + F Sbjct: 270 ERDKKMMVVSEAVCYTECPENFRNLFKQRIRWQKAFVDCIITYGHCLFSQFSFGLSVFFL 329 Query: 325 --EYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCT----LCLLQFI 378 + L T+ AF L+ VPL + L+ HT + L T L L Q + Sbjct: 330 IEGFMLGTLIAFNTLL-----------VPLVLILSKTG--HTLVLTLLTIYFILALCQSL 376 Query: 379 VSLMIENRYEHNLTSS 394 V+ ++ R+ H+ + S Sbjct: 377 VAFLVSKRFGHSYSKS 392 >UniRef50_Q0AV05 Glycosyltransferase n=1 Tax=Syntrophomonas wolfei subsp. wolfei str. Goettingen RepID=Q0AV05_SYNWW Length = 452 Score = 110 bits (276), Expect = 9e-23, Method: Compositional matrix adjust. Identities = 69/243 (28%), Positives = 122/243 (50%), Gaps = 14/243 (5%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRY--ENIEVIAVNDGSTDKTRAILDRMAAQIPHL 132 P +++IIP N + + + Q Y E+I+VI VN+GS D++ I R + P + Sbjct: 56 PYVTVIIPVHNSADTLGHCLDSIARQTYPRESIQVICVNNGSEDESFDIFQRFQYRHPEM 115 Query: 133 RVI-HLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTG 191 V+ + GK+IAL G + Y++ +D D LD DA +V +P + A TG Sbjct: 116 SVMWSSLERAGKSIALNAGVYSGHGSYMMNVDSDTWLDPDAILNVVRVFEKDPSLAAATG 175 Query: 192 NPRIRTR----STLVGKI---QVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAE 244 + R+ + S+ + I +V EY + +R Q + ++FT+SG + FRR + + Sbjct: 176 SIRVDKKLGEGSSFIDMINYCEVIEYIVAFDIGRRYQDLKNSIFTLSGAFSVFRRDIILQ 235 Query: 245 VGYWSDDMITEDIDISWK----LQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGG 300 + ++ED D+++ ++ + I + A+ ++ E L L+ QRLRW +G Sbjct: 236 TFLYQTRTVSEDTDLTFNIRQAIEAGEGRIGFISDAIAYVEPIENLSRLYSQRLRWQRGE 295 Query: 301 AEV 303 EV Sbjct: 296 MEV 298 >UniRef50_Q97FY5 Glycosyltransferase n=4 Tax=Clostridiales RepID=Q97FY5_CLOAB Length = 456 Score = 109 bits (272), Expect = 2e-22, Method: Compositional matrix adjust. Identities = 75/279 (26%), Positives = 140/279 (50%), Gaps = 19/279 (6%) Query: 38 PFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAA 97 PF M I+ +GG F + + + A + +L P I+II+P +N E ++ I + Sbjct: 22 PFLMEIVPALGG--FIILLRKRLSIKDKAFSGKL---PQITIIVPVYNSEDTLKGCIESI 76 Query: 98 LAQRY--ENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAK 155 Y E I+V+ V++ S+D + I R + L + ++ QGK+ AL + Sbjct: 77 YNSDYPSEFIDVMLVDNQSSDNSFKIFTRCQKEFDGLSLRYMNSKQGKSKALNMALFNSS 136 Query: 156 SEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTG----NPRI--RTRST---LVGKIQ 206 +Y++ ID D L +DA +V NP V +TG + + +T+ + L+ K + Sbjct: 137 GKYIIHIDSDGKLHKDAIKNMVTRFEGNPHVHCMTGVILTDKELIEKTKGSFFKLLRKCE 196 Query: 207 VGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQ-- 264 EY+ + + G+++T+SG +AFR+S + + ++ + ++ED I+++++ Sbjct: 197 FFEYAQAFLAGRNFESEVGSIYTLSGAFSAFRKSTILKTQLYNSETVSEDTQITFQVKKL 256 Query: 265 LNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEV 303 LNQ ++ AL ++ E L+ QR RW +G EV Sbjct: 257 LNQ-KVYLCENALFFVDPIEGFNKLYTQRQRWQRGELEV 294 >UniRef50_A4CLA3 Glycosyl transferase, group 2 family protein n=2 Tax=Flavobacteriales RepID=A4CLA3_9FLAO Length = 467 Score = 108 bits (269), Expect = 5e-22, Method: Compositional matrix adjust. Identities = 85/306 (27%), Positives = 130/306 (42%), Gaps = 49/306 (16%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKT---------------- 118 PSI+II P +NE + E I + L+ RY N EV+ VNDGS D T Sbjct: 53 PSITIIAPAYNEGLTIVENIRSLLSLRYVNYEVMVVNDGSKDDTLEKMVAAYDLERVPMD 112 Query: 119 ----------RAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALL 168 R + L VI +N GK+ AL TG ++S+Y+ CID D LL Sbjct: 113 IDPEWPCKPIRGVYKSRQRSFEKLTVID-KENGGKSDALNTGMRLSESKYIGCIDVDCLL 171 Query: 169 DRDAAAYIVEPMLYN--PRVGAVTGNPRIRTRSTLVG--------------KIQVGEYSS 212 DA ++V+ + RV AV G R+ + G + Q+ EY+ Sbjct: 172 LPDALLHVVKAFYEHTSKRVIAVGGVIRVANSCKIDGGTLEEIRLPKNWLARFQLLEYTR 231 Query: 213 IIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQL------N 266 L + ++ +SG F R E G + + ED++I ++++ Sbjct: 232 SFLLGRMAWGRIDSLLIISGAFGFFDRQIALEAGGYDTGTVGEDMEIVFRMRRLMHDNDT 291 Query: 267 QWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEY 326 +T+ Y P LCW +PE L QR RW++G E ++ + + R+ L + Y Sbjct: 292 PYTVAYIPDPLCWTEVPEDLGIFINQRDRWSRGNLETLYRHRDMFFNPKYGRLGMLSYPY 351 Query: 327 CLTTIW 332 L W Sbjct: 352 WLFYEW 357 >UniRef50_D1B804 Glycosyl transferase family 2 n=2 Tax=Bacteria RepID=D1B804_THEAS Length = 450 Score = 107 bits (268), Expect = 6e-22, Method: Compositional matrix adjust. Identities = 80/266 (30%), Positives = 128/266 (48%), Gaps = 22/266 (8%) Query: 45 WIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYEN 104 +++ GV + PW E ++ +PS+S II C++E ++V+ TI Q Y Sbjct: 50 FVLAGVLKYAMERALLPWEE------VQYHPSVSCIITCYSEGEDVKLTIRTLAHQIYPG 103 Query: 105 -IEVIAVNDGS---TDKTRAILDRM--AAQIPHLRVIHLA--QNQGKAIALKTGAAAAKS 156 I++I V DG+ + RA L+ M + P+ ++ L Q G+ +L G + A+ Sbjct: 104 RIQIIPVIDGALVNSHTHRAALEMMDQVMRAPNRELLVLPKWQRGGRVSSLNAGLSVARG 163 Query: 157 EYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRT-RSTLVGKIQVGEYSSIIG 215 E ++ +DGD D D + V +P V AV G R+R RS+L +Q EY IG Sbjct: 164 EVVMALDGDTSFDNDMVSKAVR-HFRDPNVAAVAGCLRVRNWRSSLAAALQGLEYMLAIG 222 Query: 216 LIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQ-----LNQWTI 270 + K + V +SG FRR L + W D ED+DI+ +++ I Sbjct: 223 VSKTGLSQFNAVNNISGAFGIFRREVLNLIMGW-DAGSAEDLDITMRIKNYFGRYRSVRI 281 Query: 271 FYEPRALCWILMPETLKGLWKQRLRW 296 ++P + P T +G +KQRLRW Sbjct: 282 LFDPEVIGHTDAPPTFRGFFKQRLRW 307 >UniRef50_C1TQW8 Glycosyl transferase n=1 Tax=Dethiosulfovibrio peptidovorans DSM 11002 RepID=C1TQW8_9BACT Length = 444 Score = 107 bits (268), Expect = 7e-22, Method: Compositional matrix adjust. Identities = 82/278 (29%), Positives = 133/278 (47%), Gaps = 23/278 (8%) Query: 33 FVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEE 92 FV F + IM I+G + + R PW + PS+S ++ C++E V+ Sbjct: 33 FVLFLEMPVYIMIILGVLKHCLERMYQLPWRSDY-------YPSVSCLVTCYSEGDAVKH 85 Query: 93 TIHAALAQRYEN-IEVIAVNDGS-----TDKTRAILDRMAAQIPH--LRVIHLAQNQGKA 144 TI + Q Y I++I + DG+ T + ++ A++ + L+V+ Q G+ Sbjct: 86 TIRSLARQIYPGRIQIIPIVDGAALNGETFRAAKSMESAVARLSNRSLQVVPKWQRGGRV 145 Query: 145 IALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRT-RSTLVG 203 AL TG A E ++ +DGD D D + VG V+G R+R RS LV Sbjct: 146 SALNTGMNFADGEIVMALDGDTSFDNDMVERATR-HFEDRSVGCVSGCLRVRNARSGLVT 204 Query: 204 KIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKL 263 ++Q EY I K G V +SG FRRS L +G W D ED+D++ ++ Sbjct: 205 RLQAIEYFLSIQAAKTGLSALGVVNNISGAFGVFRRSVLDLIGGW-DAGTAEDLDLTLRV 263 Query: 264 Q-----LNQWTIFYEPRALCWILMPETLKGLWKQRLRW 296 + + + I ++P A+ + +P+T + +KQRLRW Sbjct: 264 KNYFGRYDGFRIIFDPEAMGFTDVPDTFRDFFKQRLRW 301 >UniRef50_Q2S5Y1 Glycosyl transferase, group 2 family protein n=1 Tax=Salinibacter ruber DSM 13855 RepID=Q2S5Y1_SALRD Length = 434 Score = 107 bits (268), Expect = 7e-22, Method: Compositional matrix adjust. Identities = 97/378 (25%), Positives = 155/378 (41%), Gaps = 54/378 (14%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRM--------- 125 P I++I P NEE ++ A L+ Y + E++ VNDGSTD T LDR+ Sbjct: 23 PPITLIAPAHNEEATCVSSVRAFLSTEYPSFEILVVNDGSTDNT---LDRLQEAFDLQPT 79 Query: 126 ----AAQIPHLRVIHLAQNQ-------------GKAIALKTGAAAAKSEYLVCIDGDALL 168 + +P V + Q+Q GKA AL G S +D D L+ Sbjct: 80 ARPPTSSLPTESVRQVYQSQTHPNLWVLDKENGGKADALNAGINFCDSPLFCAVDADTLV 139 Query: 169 DRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTL--------------VGKIQVGEYSSII 214 + + A +V P L AV G R+ T+ + K+QV EY Sbjct: 140 EGNGLARMVRPFLEETATVAVGGIVRVANGCTIRAGRVQEVSMPTKWLPKLQVLEYLRAF 199 Query: 215 GLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQL------NQW 268 + G + +SG FRR VG ++ D + ED+++ ++ + Sbjct: 200 YFGRVGWDALGAMLIISGAFGLFRRETAVRVGGYATDTVGEDMELVVRMHRRLREAGQDY 259 Query: 269 TIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLW--RKENFRMWPLFFEY 326 + + P + W +P TL+ L +QR RW +G A+ +++ L R M F Y Sbjct: 260 RVRFVPDPVAWTKVPSTLQDLGRQRDRWQRGLADSMIRHQQMLLNPRYGTVGMLAYPFFY 319 Query: 327 CLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENR 386 L T L+G+I + V L L+ +A A +L L L + + R Sbjct: 320 MLETFGPILELMGYITFVVALISGTLSGPFM-LAFFLVAFMLGMALSLSAVTLEELTFRR 378 Query: 387 YEHNLTSSLFWIIWFPVI 404 YE T+ +++ PV+ Sbjct: 379 YEK--TADFAQLLFLPVV 394 >UniRef50_C9RLY4 Glycosyl transferase family 2 n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RLY4_FIBSS Length = 517 Score = 107 bits (266), Expect = 1e-21, Method: Compositional matrix adjust. Identities = 74/255 (29%), Positives = 126/255 (49%), Gaps = 11/255 (4%) Query: 70 QLKDNPSISIIIPCFNEEKNVEETIHAALAQRY-ENIEVIAVNDGSTDK----TRAILDR 124 L D P ++ +P FNE VE + A A Y ++ I V D STD+ T+ + Sbjct: 57 SLADLPQVTTQLPVFNEANCVERLLEAVCAIDYPKDKHEIQVLDDSTDECYEVTKKKVAE 116 Query: 125 MAAQIPHLRVIHLAQNQG-KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYN 183 +AA+ +++IH + KA ALK G A AK E+L D D + ++D V ++ + Sbjct: 117 LAARGYDIKLIHRTNRKDFKAGALKEGMAVAKGEFLAIFDADFVPEKDFLLKTVPYLVMD 176 Query: 184 PRVGAVTG--NPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVF-TVSGVIAAFRRS 240 P+VG V G RT S L +G + I+++ R +G +F +G +R+ Sbjct: 177 PQVGLVQGRWGHLNRTESGLTLAQSIGIDGHFV--IEQSARSWGKLFMNFNGTAGVWRKD 234 Query: 241 ALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGG 300 A+ G W D +TED+D+S++ QL W + + + +P + Q+ RWA+G Sbjct: 235 AIYGGGGWEGDTLTEDMDLSYRSQLAGWKMKFVFDVIVPAELPNDINAFKAQQFRWAKGS 294 Query: 301 AEVFLKNMTRLWRKE 315 + +K + ++ R + Sbjct: 295 IQTAIKILPKVLRSK 309 >UniRef50_D1N3C9 Glycosyl transferase family 2 n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N3C9_9BACT Length = 444 Score = 106 bits (265), Expect = 2e-21, Method: Compositional matrix adjust. Identities = 93/370 (25%), Positives = 161/370 (43%), Gaps = 27/370 (7%) Query: 67 PAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRY--ENIEVIAVNDGSTDKTRAILDR 124 P ++ P ++IIP +NE + ++ + + Y + +E+IAVNDGS D T + + Sbjct: 72 PTVPPEELPVCTVIIPAYNEGETIKLALESVCRSEYPADKLEIIAVNDGSIDDTLRWMQQ 131 Query: 125 MAAQIP-HLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYN 183 A + +RVI +N+GK AL G + E V +D D+LL DA A +V P + Sbjct: 132 AAEESGGRIRVITYPENRGKRYALYQGFRSGSGEVFVTVDSDSLLKPDALARMVSPFRRD 191 Query: 184 PRVGAVTGNPRIRT-RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSAL 242 VGAV+GN R+ + + ++ I+ Q ++ +VF G ++A+RR+AL Sbjct: 192 HSVGAVSGNVRVANLEGGIFPPMIDAGFTFAFDFIRAGQTVFKSVFCTPGALSAYRRTAL 251 Query: 243 A--------EVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRL 294 + + I ED ++ + + + + A + +PE +G+ K L Sbjct: 252 MPLLNEWLNQTFFGQPAGIGEDRALTNLILRQGYGVIMQRSAYIYTNVPECYRGISKMLL 311 Query: 295 RWAQGGAEVFLKNMTRLWRKENF---RMWPLFFEYCLTTIWAF--TCLVGFIIYAVQLAG 349 RW + L+ + ++R F R W + L T+ ++ F +Y + Sbjct: 312 RWERSNIRENLEMYSFIFRNFRFTDPRRWFMLATLLLYTLMTVLPVVMIWFTLYNIFTTN 371 Query: 350 VPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLS 409 L + IAAT L L LQ N H L + I +FW++ Sbjct: 372 GTL---VLSIAATAMLWSTLPALVNLQ-------RNSLRHILYCYFYGIFHALTLFWVVP 421 Query: 410 LATTLVSFTR 419 A + +R Sbjct: 422 YAVFTIGNSR 431 >UniRef50_D1CH68 Glycosyl transferase family 2 n=2 Tax=Bacteria RepID=D1CH68_THET1 Length = 470 Score = 105 bits (263), Expect = 2e-21, Method: Compositional matrix adjust. Identities = 73/277 (26%), Positives = 123/277 (44%), Gaps = 45/277 (16%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDR---------- 124 P +S+I+P +NEE + +++ + LA Y +V+ VNDGS D T +L R Sbjct: 58 PPVSLIVPAYNEEATIADSVRSFLALHYPQHQVVVVNDGSRDATLEVLKREFDLWESDQP 117 Query: 125 --------------MAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDR 170 +++ P+L VI +N GKA +L G AA + CID D +L+ Sbjct: 118 IRLRLRTAPIRAVYASSRFPNLLVID-KENGGKADSLNAGICAASYPLVCCIDADIILEE 176 Query: 171 DAAAYIVEPMLYNPRVGAVTGNPRIRTRSTL-VGKI-------------QVGEYSSIIGL 216 DA + PM+ + +V AV G R+ + G++ QV EY Sbjct: 177 DALLRVARPMIESSKVVAVGGIVRVANGCKIEAGRVTEVRAPREWLPSFQVVEYLRAFLA 236 Query: 217 IKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQL------NQWTI 270 + + +SG FRR + VG + + ED+++ ++ + I Sbjct: 237 ARTAWSRLNCLLIISGAFGMFRREDVIAVGGYETSTVGEDMEVVTRMHRLLRERHRDYHI 296 Query: 271 FYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKN 307 + P + W +P TL+ L +QR RW +G + L++ Sbjct: 297 AFVPDPVAWTEVPATLRVLRRQRDRWHRGLIDTLLRH 333 >UniRef50_B7I546 Glycosyltransferase n=12 Tax=Acinetobacter RepID=B7I546_ACIB5 Length = 416 Score = 105 bits (263), Expect = 2e-21, Method: Compositional matrix adjust. Identities = 72/256 (28%), Positives = 129/256 (50%), Gaps = 12/256 (4%) Query: 53 WVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRY--ENIEVIAV 110 W Y + +N P P+ P +S++IP +NE +E+T+HA Q Y E+ EV+ + Sbjct: 30 WKYSKTAEKELQNLPIPE--RWPVLSVLIPAYNEGVVIEDTLHAIAQQDYPAESYEVLLI 87 Query: 111 NDGSTDKTRAILDRMAAQIPHLRVIHLAQN---QGKAIALKTGAAAAKSEYLVCIDGDAL 167 NDGS D T I + +A P ++++++ + +GK+ L G AK E +V D D+ Sbjct: 88 NDGSKDNTLEIAENLAKIYPCIKIVNVPKGMGGKGKSRTLNNGLPHAKGELIVVYDADST 147 Query: 168 LDRDAAAYIVEPMLYNPRVGAVTGNPRIRT-RSTLVGKIQVGEYSSIIGLIK--RTQRIY 224 + D + + +L + ++ AV G R R + +++ + E+ + + R QR Sbjct: 148 PEPDCVRLLAQTLLADKKLVAVNGKVRTRNWQDSILTRFIAIEFIFFQWIFQGGRWQRF- 206 Query: 225 GNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPE 284 + T+ G R AL +G + + + +D ++S+++ + Q I + P A+ W P Sbjct: 207 -ELSTLMGTNYVIWRDALETLGGFDEKSLVDDTEMSFRIFIGQKRIKWVPYAIGWQQDPP 265 Query: 285 TLKGLWKQRLRWAQGG 300 +L KQR RW QG Sbjct: 266 SLSVFVKQRSRWTQGN 281 >UniRef50_Q0AL04 Glycosyl transferase, family 2 n=3 Tax=Bacteria RepID=Q0AL04_MARMM Length = 475 Score = 105 bits (262), Expect = 3e-21, Method: Compositional matrix adjust. Identities = 92/337 (27%), Positives = 133/337 (39%), Gaps = 54/337 (16%) Query: 29 LMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEK 88 L++ F +++ + GV + R R P ++ P +SI+ P +NE + Sbjct: 18 LILASALFQDTLYAVLIAIAGVT--LMRRRFEPNAHRLMRRIGQEAPRVSILAPAYNEAE 75 Query: 89 NVEETIHAALAQRYENIEVIAVNDGSTDKT--------------RAILDRMAAQIPHLRV 134 V + A L Q + N EVI +NDGS D T R DR+ P V Sbjct: 76 TVVDCTRALLRQNHSNYEVIVINDGSGDDTLNRLVRAFDLKPAERECSDRLDF-TPVRAV 134 Query: 135 IHLAQ----------NQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNP 184 H N GKA AL G A A ID D+LL+ DA +P L NP Sbjct: 135 YHRPGPVPLWVIDKVNGGKADALNAGIAHASGSLFCAIDADSLLEADALIRTSQPFLDNP 194 Query: 185 RVGAVTGNPRI----RTRSTLVGK----------IQVGEYSSIIGLIKRTQRIYGNVFTV 230 AV G R+ R R V + +Q EY + + G + V Sbjct: 195 DTVAVAGTVRVANGCRLREGAVEEAGLPRRFLPLVQSLEYLRAFLVARFALSELGALMIV 254 Query: 231 SGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLN------QWTIFYEPRALCWILMPE 284 SG F R+ + EVG + + ED ++ +L + + + P +CW +PE Sbjct: 255 SGAFGMFDRARVVEVGGYDPRTVGEDAELIVRLHRDGRDSDRPTVVRFIPEPVCWTQVPE 314 Query: 285 TLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWP 321 L L QR RW +G E LWR + + P Sbjct: 315 RLSDLAGQRRRWQRGALET-------LWRHRDAALRP 344 >UniRef50_Q97G47 Glycosyltransferase n=1 Tax=Clostridium acetobutylicum RepID=Q97G47_CLOAB Length = 447 Score = 105 bits (262), Expect = 4e-21, Method: Compositional matrix adjust. Identities = 66/241 (27%), Positives = 121/241 (50%), Gaps = 12/241 (4%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRY--ENIEVIAVNDGSTDKTRAILDRMAAQIPHL 132 P I+I+IP +N E +E +++ + Q Y E I V+ +N+G DK + + P L Sbjct: 50 PDITILIPIYNSENTLENCLNSVINQTYPIEKISVMLINNGKRDKAYNVFCSFQERNPRL 109 Query: 133 RVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTG- 191 +V L + GKA AL G A +Y++ ID D +LD++A +VE NP + A+TG Sbjct: 110 KVWWLDSSSGKAKALNKGIYMANGKYIINIDSDGVLDKEAILKVVEKFETNPDICAMTGV 169 Query: 192 ---NPRIRTRST-----LVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALA 243 +P + + L+ + ++ EY + + Q +FT++G + +RR + Sbjct: 170 VLTDPELIEETKLKNLRLIQRCELFEYIEAFLIGRGFQSQVNMMFTLAGAFSCYRREIIM 229 Query: 244 EVGYWSDDMITEDIDISWKL-QLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAE 302 + ++++ + ED ++ ++ +L I A ++ + L L+ QR RW +G E Sbjct: 230 KTQLYNNETLGEDTHMTSQIRELLDGKIALCEEAFFYVDPIDDLDKLYIQRQRWQRGQME 289 Query: 303 V 303 V Sbjct: 290 V 290 >UniRef50_A3I0Z1 Glycosyltransferase n=1 Tax=Algoriphagus sp. PR1 RepID=A3I0Z1_9SPHI Length = 489 Score = 104 bits (260), Expect = 6e-21, Method: Compositional matrix adjust. Identities = 75/264 (28%), Positives = 125/264 (47%), Gaps = 12/264 (4%) Query: 52 FWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRY-ENIEVIAV 110 FW R+ W + P ++ P +++ +P FNE V+ I AA Y + + I + Sbjct: 32 FWKARK----WLASTPMKEMDTWPKVTVQLPIFNELYVVDRLIEAAANLNYPKELLEIQL 87 Query: 111 NDGSTDKTRAILDRMAAQIP--HLRVIHLAQNQG-KAIALKTGAAAAKSEYLVCIDGDAL 167 D STD+T ++ P + + IH G KA ALK G A+ E++ D D + Sbjct: 88 LDDSTDETVDLIQEKIKNYPEVNFQYIHRQDRVGFKAGALKEGLVNAEGEFIAIFDADFV 147 Query: 168 LDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGN 226 D D + P + +VG V RS +L+ ++Q + LI++ R Y + Sbjct: 148 PDPDFLLKTL-PYFSSEKVGMVQSRWTHLNRSYSLLTRLQAFALDAHF-LIEQMGRNYQH 205 Query: 227 VF-TVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPET 285 F +G +R+S + + G W DD +TED+D+S++ Q W Y P +P Sbjct: 206 AFINFNGTGGVWRKSCILDSGNWHDDTLTEDLDLSYRAQRKGWEFIYRPEIESPAELPPI 265 Query: 286 LKGLWKQRLRWAQGGAEVFLKNMT 309 + + Q+ RW +GGAE K+++ Sbjct: 266 MSAVKSQQFRWTKGGAECARKHIS 289 >UniRef50_B0U7V3 Glycosyl transferase family 2 n=1 Tax=Methylobacterium sp. 4-46 RepID=B0U7V3_METS4 Length = 483 Score = 104 bits (259), Expect = 8e-21, Method: Compositional matrix adjust. Identities = 77/276 (27%), Positives = 122/276 (44%), Gaps = 27/276 (9%) Query: 67 PAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMA 126 PA P +SI++P FNE + V + + Y N E+I V+DGSTD T +A Sbjct: 101 PAELPNPAPFVSILVPAFNESETVIGAPTSLMTLDYPNYEIIFVDDGSTDDTFIKAFPLA 160 Query: 127 AQIPHLRV-IHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPR 185 Q + + ++ N GK +L AK + L+C+D D+ L +DA +V P + Sbjct: 161 GQYGNCTLRVYTKPNGGKWSSLNFAYKKAKGDLLLCVDADSGLAKDALRVMV-PRMSEKG 219 Query: 186 VGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEV 245 V G IR R+ + K+Q EY G ++ + G V V G I ++R+ L E+ Sbjct: 220 VVGCAGQVTIRNRNNFLTKLQAAEYLLGNGGMRMALSLMGVVTVVPGPIGLYKRAILEEI 279 Query: 246 ----------------------GYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMP 283 G S + ED +S I YEPRA + P Sbjct: 280 TKIPRKLSAAAAVTHSGPGAVNGPLSGETFAEDFQLSLSALALGGRIVYEPRAYAYTKCP 339 Query: 284 ETLKGLWKQRLRWAQGGAEVFL---KNMTRLWRKEN 316 + + L QR RW +G +V++ + +++L + N Sbjct: 340 DDVATLMSQRYRWIRGTWQVYIVYVRTLSKLTNRGN 375 >UniRef50_D1CD85 Glycosyl transferase family 2 n=1 Tax=Thermobaculum terrenum ATCC BAA-798 RepID=D1CD85_THET1 Length = 387 Score = 104 bits (259), Expect = 9e-21, Method: Compositional matrix adjust. Identities = 75/229 (32%), Positives = 122/229 (53%), Gaps = 10/229 (4%) Query: 70 QLKDNPSISIIIPCFNEEKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAILDRMAA 127 Q+K PSISI++ NE +E+++ LA Y + +EV+ V+DGSTD T I+ +++ Sbjct: 44 QVKSWPSISIVMAVHNEASVIEDSLTDLLAMDYPHGELEVVVVSDGSTDATNEIVRSISS 103 Query: 128 QIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVG 187 + +R+I L Q G+ A+ G AA+ E +V D D D+D +I + L +PR+G Sbjct: 104 KDSRVRLIEL-QRSGQTAAIIQGVLAARYEVIVRSDADTRHDKDFLIHIAKHYL-DPRIG 161 Query: 188 AVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGY 247 V G +IR S G Y L++R + G + T SG + +FRR Sbjct: 162 CVGGELKIRNTSASGISQSEGIYWRYEMLLRRLESDAGVLSTTSGAVMSFRREIFEP--- 218 Query: 248 WSDDMITEDIDISWKLQLNQ-WTIFYEPRALCWILMPETLKGLWKQRLR 295 +S D +ED+ I KL + + + + +EPRA+ + +MP + +G R R Sbjct: 219 FSPD-CSEDVVIP-KLAIKKGYVMVHEPRAVAYEVMPSSARGEVNARRR 265 >UniRef50_A5FCY6 Candidate polysaccharide-forming beta-glycosyltransferase; Glycosyltransferase family 2 n=2 Tax=Flavobacteriaceae RepID=A5FCY6_FLAJ1 Length = 477 Score = 103 bits (257), Expect = 1e-20, Method: Compositional matrix adjust. Identities = 81/283 (28%), Positives = 127/283 (44%), Gaps = 49/283 (17%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAIL----DRMAAQI- 129 P +S+I P +NE +EE + + L+ Y EVI VNDGS D + +L D + ++ Sbjct: 63 PKLSLIAPAYNEGFTIEENVKSLLSLNYNQYEVIVVNDGSKDNSMEVLINTYDLVLTELA 122 Query: 130 -------PHLRVIHLA-------------QNQGKAIALKTGAAAAKSEYLVCIDGDALLD 169 +R ++++ +N GKA AL G A++ Y+VCID D +LD Sbjct: 123 FHQQIETKKIRGLYVSRNAAFKKLIVVDKENGGKADALNAGLNIAQNPYVVCIDVDCILD 182 Query: 170 RDAAAYIVEPMLYNP--RVGAVTGNPRIRTR--------------STLVGKIQVGEYSSI 213 +D+ + +P L + R+ A G RI + L+ +IQV EY Sbjct: 183 KDSLLKLAKPFLESSGQRIIATGGVVRIANQCIIKNGRLIEVNVPDLLLPRIQVLEYLRA 242 Query: 214 IGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDI-----SWKLQLN-Q 267 L + + +SG AF + G +S + ED+++ + L+ N Sbjct: 243 FLLGRMAWGRLDGLLLISGAFGAFDKEIALLAGGYSTKTVGEDMELVVRMRRYMLENNLP 302 Query: 268 WTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEV--FLKNM 308 +++ Y P LCW PE K KQR RW +G E F K M Sbjct: 303 YSVRYIPDPLCWTEAPEDFKIFKKQRSRWMRGTIETLDFHKKM 345 >UniRef50_C0Z7U1 Conserved hypothetical membrane protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0Z7U1_BREBN Length = 480 Score = 102 bits (255), Expect = 2e-20, Method: Compositional matrix adjust. Identities = 88/301 (29%), Positives = 129/301 (42%), Gaps = 73/301 (24%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAIL---------DRM 125 P ISI++P +NEE + E + + L+ Y EV+ VNDGS D T ++ + M Sbjct: 56 PPISILVPAYNEELTIIENVRSLLSLNYPRYEVLVVNDGSKDATLQVMIDEFSLVYSEHM 115 Query: 126 AAQ----------------IPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLD 169 Q P+L +I +N GKA +L G + + ID D+LL+ Sbjct: 116 PVQPLLHTSKIRGIYHNPAFPNLYLID-KENGGKADSLNAGINLSHYPLIASIDADSLLE 174 Query: 170 RDAAAYIVEPMLYNP-RVGAVTGNPR--------------IRTRSTLVGKIQVGEY---- 210 +DA I + + NP A+ G+ R IR + +Q EY Sbjct: 175 KDALIRIAKVYMENPDETVAIGGDIRIANGCKIEDGVVKDIRLPDKFLPMMQSVEYLKAF 234 Query: 211 -------SSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKL 263 S+I GLI VSG FR+ + +VG + + ED++I KL Sbjct: 235 LGGRIGWSAINGLI-----------IVSGAFGVFRKDYIIKVGGYREGYPGEDMNIIIKL 283 Query: 264 QL----NQ--WTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENF 317 N+ + + + P A+CW P+TLK L QR RW +G LKNM R F Sbjct: 284 HKYMLENKLPYRVAFCPDAVCWTQAPDTLKILGSQRRRWGRGN----LKNMIEYGRHMAF 339 Query: 318 R 318 R Sbjct: 340 R 340 >UniRef50_A0LZM3 Transmembrane family-2 glycosyl transferase-possibly involved in biofilm formation n=2 Tax=Flavobacteriaceae RepID=A0LZM3_GRAFK Length = 473 Score = 102 bits (255), Expect = 2e-20, Method: Compositional matrix adjust. Identities = 76/270 (28%), Positives = 119/270 (44%), Gaps = 48/270 (17%) Query: 81 IPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKT-RAILDRMAAQ---------IP 130 P FNEE NV E + + L Y + E++ +NDGS D T A+L Q IP Sbjct: 64 APAFNEEANVVENVRSLLTLNYPSYEIVIINDGSKDGTLEALLTTFDMQKQEFLYHPFIP 123 Query: 131 ----------------HLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAA 174 +L VI +N GKA AL TG +E + CID D +L+ DA Sbjct: 124 SKEVKGVYKSKNEAYKNLTVID-KENGGKADALNTGINVCDNEIICCIDVDCILENDALL 182 Query: 175 YIVEPMLYNPRVGAVTGN---------------PRIRTRSTLVGKIQVGEYSSIIGLIKR 219 +V+P L N + +G I+ T + + Q+ EY + + Sbjct: 183 KLVKPFLNNTKKVIASGGIIRVANSCVIEDGRILDIKLPDTFLARAQIIEYFRAFLMGRM 242 Query: 220 TQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDI----SWKLQLNQ--WTIFYE 273 + +SG F + + +VG ++ D + ED+++ K++ N +T+ + Sbjct: 243 AWSRLDGLLLISGAFGMFDKDTVIKVGGYNPDTVGEDMELLVRMRKKMRENNIPYTVGFV 302 Query: 274 PRALCWILMPETLKGLWKQRLRWAQGGAEV 303 P LCW +P+ K L +QR RW +G AE Sbjct: 303 PNPLCWTEVPQNWKVLHRQRNRWTRGTAET 332 >UniRef50_Q974Y9 Putative uncharacterized protein ST0523 n=2 Tax=cellular organisms RepID=Q974Y9_SULTO Length = 419 Score = 102 bits (255), Expect = 2e-20, Method: Compositional matrix adjust. Identities = 77/254 (30%), Positives = 126/254 (49%), Gaps = 13/254 (5%) Query: 65 NAPAPQLKDNPSISIIIPCFNE-EKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILD 123 N PQ K SII+ NE +K +E IH L YE E+I ++D +K Sbjct: 32 NVYEPQNKSQKGFSIIVAEKNESKKTIEGLIHNLLDINYEKYEIIIISDDPKEKFEEKY- 90 Query: 124 RMAAQIPHLRVIHLAQNQ---GKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPM 180 R I L+++H +N GKA AL G +K EYLV +D DA +D+D ++ + Sbjct: 91 REYMNIEKLKILH--RNNPVGGKAGALNYGVKHSKYEYLVFLDSDARVDKD---FLSKLA 145 Query: 181 LYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRS 240 LY+ V + RI T V + ++ + + + G +G R+ Sbjct: 146 LYDFNVASY--RLRIYNAETDVQRYYKEFTEKVMDSLFKVRYYLGLPIFPNGSAFCIRKD 203 Query: 241 ALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGG 300 L +VG W +++ITED+++ +L L+ + I Y + + L P TL L+KQ RW+ G Sbjct: 204 ILIKVGGWKENIITEDLELGIRLFLHNYKIKYLDDIIVYSLAPFTLDDLYKQIERWSYGS 263 Query: 301 AEVFLKNMTRLWRK 314 A++ L++ +L++K Sbjct: 264 AQLLLQSF-KLFKK 276 >UniRef50_C7RTK3 Polysaccharide deacetylase n=2 Tax=Bacteria RepID=C7RTK3_9PROT Length = 336 Score = 102 bits (255), Expect = 2e-20, Method: Compositional matrix adjust. Identities = 59/213 (27%), Positives = 104/213 (48%), Gaps = 2/213 (0%) Query: 104 NIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCID 163 + +++ V+DGS+D T + P +R I +N GK A G A ++E +V +D Sbjct: 17 DFQIVVVDDGSSDATYRVACDAFGSDPRVR-IFTKENGGKPAAPNFGLAQTQAEVIVALD 75 Query: 164 GDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRI 223 D + RD + +PRVGAV GN ++ R L+ + Q EY + L +R + Sbjct: 76 ADTIFARDTVRKLAR-HFEDPRVGAVAGNAKVGNRINLLTRWQALEYITSQNLDRRAFDV 134 Query: 224 YGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMP 283 + V G + A+RR + G + + ED D++ ++ + + YE A+ P Sbjct: 135 LNCITVVPGAVGAWRRELVERAGGFDHQTLAEDADLTLAIRKLGYNVVYEDEAVALTEAP 194 Query: 284 ETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKEN 316 ++++G QR RW G +V K+ L+R ++ Sbjct: 195 DSVRGFIGQRYRWMYGTMQVARKHRDVLFRPQS 227 >UniRef50_Q9KAQ6 BH2231 protein n=4 Tax=Bacillus RepID=Q9KAQ6_BACHD Length = 482 Score = 102 bits (254), Expect = 3e-20, Method: Compositional matrix adjust. Identities = 78/283 (27%), Positives = 133/283 (46%), Gaps = 53/283 (18%) Query: 70 QLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKT----------- 118 +L +SI++P +NEE + ET+ + L+ +Y E++ VNDGSTD+T Sbjct: 59 ELTYTKPVSILVPAYNEETGIIETVRSLLSLKYPQTEIVVVNDGSTDQTLEVIIEHFQMV 118 Query: 119 ---RAILDRMAAQ----------IPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGD 165 + I ++ + PHL ++ N GKA AL G +K Y IDGD Sbjct: 119 KVGKVIRKQIETEPIKGVYQSTIFPHLLLVD-KSNGGKADALNAGLNVSKYPYFCSIDGD 177 Query: 166 ALLDRDAAAYIVEPMLYN----PRVGAVTGNPRIRTRSTL-VGKI-------------QV 207 ++L+ DA +++P++ + V A GN RI S + +G + QV Sbjct: 178 SILETDALLKVMKPIVTSRDDEDEVIASGGNVRIANGSDIQMGSVLSVQLAKNPLVVMQV 237 Query: 208 GEY--SSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKL-- 263 EY + ++G I ++ + V +SG + F + + E G +S + ED+++ +L Sbjct: 238 IEYLRAFLMGRIGLSR--HNMVLIISGAFSVFAKKWVMEAGGYSKKTVGEDMELVVRLHR 295 Query: 264 ----QLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAE 302 + + I + P +CW P T + L +QR RW +G E Sbjct: 296 LVKEKRLKKRITFVPDPVCWTEAPATFRVLQRQRSRWHRGLME 338 >UniRef50_A1S0Z5 Glycosyl transferase, family 2 n=1 Tax=Thermofilum pendens Hrk 5 RepID=A1S0Z5_THEPD Length = 420 Score = 102 bits (254), Expect = 3e-20, Method: Compositional matrix adjust. Identities = 68/238 (28%), Positives = 116/238 (48%), Gaps = 5/238 (2%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRY--ENIEVIAVNDGSTDKTRAILDRMAAQIPH- 131 P +++I+P +E + VE ++A L+ Y E +EVI V+ S I+ R + P Sbjct: 67 PRVTVIVPSKDEGRRVERCLNAILSSDYPLEKLEVIVVDASSDGYVEEIVRRAGERYPGA 126 Query: 132 LRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTG 191 +R+I + +GK AL A E + D D++ +RDA V+ L P V AV G Sbjct: 127 VRLIREEEPRGKPAALNRALREATGEVVAVFDADSVPERDAIRRAVK-HLEEPGVAAVQG 185 Query: 192 NPRI-RTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSD 250 + R +++ ++ E + + R + G ++G +RSAL EVG W + Sbjct: 186 KTLVLNERESVLARVASKEEKAWFHALIRGRERLGLFVPLTGSCQFVKRSALEEVGGWRE 245 Query: 251 DMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNM 308 D + ED+++S L + + Y + W P +L+ L QR RW +G E F +++ Sbjct: 246 DALAEDLELSMDLLARGYRVKYANDVVSWQEAPTSLRSLAVQRNRWYRGYMEAFARHL 303 >UniRef50_C7TL81 Glycosyl transferase, group 2 n=3 Tax=Bacilli RepID=C7TL81_LACRL Length = 417 Score = 102 bits (253), Expect = 4e-20, Method: Compositional matrix adjust. Identities = 74/258 (28%), Positives = 126/258 (48%), Gaps = 13/258 (5%) Query: 77 ISIIIPCFNEEKNVEETIHAALAQRY--ENIEVIAVNDGSTDKTRAILDRMAAQIP--HL 132 +S+++P NE + +T+ + L Y + E+I +ND S+D + A+L + + P L Sbjct: 51 VSVMVPAHNEGVVIVKTVESLLRFDYPQDRYEIIVINDNSSDNSAALLRDLQHRYPDRQL 110 Query: 133 RVIH---LAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAV 189 VI+ + +GK+ AL G A+ L D D + A +V ++ + +GAV Sbjct: 111 HVINTDAVTGGKGKSNALNIGLTKAQGSVLAIYDADNTPEFGALRILVSELMADDGLGAV 170 Query: 190 TGNPRIRTRST--LVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGY 247 G R R + L I + S Q ++G + T+ G RRS + ++G Sbjct: 171 IGKFRTRNKQATWLTRFINIETLSFQWMAQAGRQHLFG-LCTIPGTNYVIRRSLIDKIGG 229 Query: 248 WSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKN 307 W + ED +IS+ + +N I ++P+A+ W P+TL + QR RW +G V LKN Sbjct: 230 WDVKALAEDTEISFHVYMNGARIKFQPKAVTWEQEPQTLDVWFHQRTRWVKGNIYVILKN 289 Query: 308 MTRLWRKENFRMWPLFFE 325 L++K R P+ F+ Sbjct: 290 SALLFQK---RGRPIRFD 304 >UniRef50_A0LSJ8 Glycosyl transferase, family 2 n=1 Tax=Acidothermus cellulolyticus 11B RepID=A0LSJ8_ACIC1 Length = 471 Score = 101 bits (252), Expect = 5e-20, Method: Compositional matrix adjust. Identities = 80/255 (31%), Positives = 127/255 (49%), Gaps = 8/255 (3%) Query: 65 NAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRY--ENIEVIAVNDGSTDKTRAIL 122 +P +D P++S+++ C NEE V I +A Y +E++ VNDGS+D+T +L Sbjct: 82 ESPEVPSEDLPTVSVLVACHNEESVVPGLIRGLVALHYPRSRLEILIVNDGSSDRTGELL 141 Query: 123 DRMAAQIPHLRVIH--LAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPM 180 D++ A I H+RVIH GK+ AL A A E +V D D + R+ +V Sbjct: 142 DQLTAGIRHMRVIHRPAGAGGGKSGALNAALAEANGEIVVVFDADHIPRRNVIHRLVR-H 200 Query: 181 LYNPRVGAVTGNPRIRT--RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFR 238 +P V AV G +R +STL I + +S + Q ++G + G A R Sbjct: 201 FRDPTVAAVQGRCIVRNSVQSTLARSIAIDYFSGYLVNEYGRQALFG-LPAYGGANCAVR 259 Query: 239 RSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQ 298 + L + G W+ + +TED DI+ +L L + + Y+ A+ TL +QR RWA+ Sbjct: 260 TALLRQFGGWNVNSVTEDTDITLRLVLAGYRVRYDITAVDTEEGATTLARFVRQRYRWAR 319 Query: 299 GGAEVFLKNMTRLWR 313 G +V+ +WR Sbjct: 320 GHQQVWRDYRRGVWR 334 >UniRef50_C0G271 Glycosyl transferase family 2 n=1 Tax=Natrialba magadii ATCC 43099 RepID=C0G271_NATMA Length = 466 Score = 101 bits (251), Expect = 6e-20, Method: Compositional matrix adjust. Identities = 90/322 (27%), Positives = 142/322 (44%), Gaps = 66/322 (20%) Query: 39 FFMSIMWIVGGVYFWVYRE---------RHW---PWGENAPAPQLKDNPSISIIIPCFNE 86 FF++ +V G Y +V+ R W P +P L P I++I+P +NE Sbjct: 15 FFLAYFALVNGSYLFVHLVSIVTLQRGIRDWLLDPVYHPHNSPFL---PGIAVIVPAYNE 71 Query: 87 EKNVEETIHAALAQRYENIEVIAVNDGSTDKT----------RAILDRMAAQIPHLRVIH 136 E + +T+ + L Y N E++ VNDGS D+T + + + +P V Sbjct: 72 EVTIVDTVQSLLNLEYSNHEIVVVNDGSRDETLEQLISTFELEPVGEDVPFDLPCETVHE 131 Query: 137 LAQ------------NQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNP 184 + Q N GKA AL G ID D++++R+A +V+P L +P Sbjct: 132 VYQSSRVDLVVIDKDNGGKADALNAGVFFTDQPLFCAIDADSIIERNALLEVVQPFLRDP 191 Query: 185 -RVGAVTGNPRIR-----TRSTL---------VGKIQVGEY-----SSIIGLIKRTQRIY 224 R+ A G R+ TR + + +Q EY S IGL Sbjct: 192 ERMIASGGAVRVANGCSITRGVITDIGLSRSHLANLQTVEYLRAFLSGRIGLSG-----L 246 Query: 225 GNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLN----QWTIFYEPRALCWI 280 G++ +SG FR S + EVG +S + ITED+++ +LQ + + I + P + W Sbjct: 247 GSLLIISGAFGLFRTSTVREVGGYSTETITEDMELIVRLQKHVSDEKHRITFVPYPVVWT 306 Query: 281 LMPETLKGLWKQRLRWAQGGAE 302 +PE L +QR RW +G E Sbjct: 307 EVPERRAVLARQRGRWFKGMLE 328 >UniRef50_Q2RZV9 Putative glucosyltransferase n=1 Tax=Salinibacter ruber DSM 13855 RepID=Q2RZV9_SALRD Length = 510 Score = 101 bits (251), Expect = 7e-20, Method: Compositional matrix adjust. Identities = 73/266 (27%), Positives = 123/266 (46%), Gaps = 9/266 (3%) Query: 77 ISIIIPCFNEEKNVEETIHAALAQRYENIEV-IAVNDGSTDKTRAILDRMAA--QIPHLR 133 +++ +P +NE + I A + Y + I V D STD T + R A Q + Sbjct: 61 VTVQLPLYNEAEVAHRLIDACVQLDYPRSRLDIQVLDDSTDATTERVARRVAHWQAEGVN 120 Query: 134 VIHLAQNQG---KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYN-PRVGAV 189 + H+ ++ KA AL G A+ + + D D + R + + P ++ P +G V Sbjct: 121 ITHVRRDDRTGYKAGALANGLQRARGDLIAIFDAD-FVPRPSFLRRLVPRFFDAPDLGMV 179 Query: 190 TGN-PRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYW 248 + +L+ K+Q + + +R + + G +G +RR+ + + G W Sbjct: 180 QARWGHLNRDDSLLTKVQAFGLDAHFAIEQRVRELAGCFLNFNGTAGVWRRACIEDAGGW 239 Query: 249 SDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNM 308 + D +TED+D+S++ QL W + Y P A +P + L Q+ RWA+GGAE LK Sbjct: 240 AHDTLTEDLDLSYRAQLQGWRLTYVPAAEAPAELPPDMNALRAQQFRWAKGGAETALKLT 299 Query: 309 TRLWRKENFRMWPLFFEYCLTTIWAF 334 RLWR L + LT +AF Sbjct: 300 GRLWRSAQPWRVKLEGTFHLTAHFAF 325 >UniRef50_C7IGQ0 Glycosyl transferase family 2 n=2 Tax=Firmicutes RepID=C7IGQ0_9CLOT Length = 433 Score = 100 bits (249), Expect = 1e-19, Method: Compositional matrix adjust. Identities = 79/273 (28%), Positives = 122/273 (44%), Gaps = 22/273 (8%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAILDRMAAQIPHL 132 P+++I+IP NEEK + T+ L Y I++I +ND S+D T + ++ P Sbjct: 52 PTVTIMIPAHNEEKVIASTLERILELNYPQHLIQIIVINDNSSDATGVQIKKVQDNSPER 111 Query: 133 RVIHLAQNQ-----GKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVG 187 R+ + N+ GKA AL AK E++ D DA +R+ +++ + R Sbjct: 112 RINVITTNKENGGRGKAHALNLALKQAKGEWICIFDADAAPERNTLKFLIIKSFEDKRFA 171 Query: 188 AVTGNPRIRTRSTLVGKIQVGEYSSIIGL-IKRTQRIYG-------NVFTVSGVIAAFRR 239 AV G + R R+ + I L + +QRI + + G R Sbjct: 172 AVFGRNKARNRNRNF-------LTKCINLELIASQRIVNPGKWYLFKLGQIPGTNFIINR 224 Query: 240 SALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQG 299 + L +VG W ITED D+ + + + I YE RA + PE L KQR RWA+G Sbjct: 225 TILQKVGGWDSRAITEDTDLGFTIMKRNYLIAYESRAEAYQQEPEKLSVYVKQRTRWAKG 284 Query: 300 GAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIW 332 V KN L+ +R+ Y +T IW Sbjct: 285 NLYVVFKNFPDLFSGLAWRIKFDILFYWMTYIW 317 >UniRef50_A9KK06 Glycosyl transferase family 2 n=12 Tax=Bacteria RepID=A9KK06_CLOPH Length = 648 Score = 100 bits (248), Expect = 1e-19, Method: Compositional matrix adjust. Identities = 87/344 (25%), Positives = 155/344 (45%), Gaps = 47/344 (13%) Query: 1 MINRIVSFFILCLVLCI-PLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERH 59 MI+ ++ FFI ++ + V YF + + + F+ ++ G +Y R Sbjct: 182 MISSLLLFFITAFIILMRAESVIYFKFNKWLYLYSIIAACFLLSRYLFGFMY------RK 235 Query: 60 WPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRY--ENIEVIAVNDGSTDK 117 P N P +SI+IPCFNEE+ ++ TI + + Q Y + +EVI ++D S DK Sbjct: 236 VPIDINF-------TPGVSILIPCFNEEEWIQRTILSCINQDYPVDKLEVIVIDDCSHDK 288 Query: 118 T----RAILDRMAAQIPH------LRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDAL 167 + I++++ + L+ I +N GK AL GA AK E +V +D D+ Sbjct: 289 SVEKIEEIINKLYQEADQFDIKNRLKYIVQKENAGKREALALGALEAKHELVVFVDSDSF 348 Query: 168 LDRDAAAYIVEPMLYNPRVGAVTGNPRI-RTRSTLVGKIQVGEYSSIIGLIKRTQRIYGN 226 LD A +V+P +P++G V+G + T + + K+Q Y ++K + + Sbjct: 349 LDPFAIRNLVQP-FKDPKMGGVSGRTDVANTFTNTLTKMQSVRYYIAFRIMKAAESYFDA 407 Query: 227 VFTVSGVIAAFRR-------------SALAEVGYWSDDMITEDIDISWKLQLNQWTIFYE 273 V +SG +A +++ S L + + DD + L + Y+ Sbjct: 408 VTCLSGPLACYKKEIILENREAWLNQSFLGQRATFGDDRAMTN------FVLKKHRTAYQ 461 Query: 274 PRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENF 317 A+C ++P + KQ++RW + L +W+KE F Sbjct: 462 DSAICSTIVPNQYRVFLKQQMRWKRSWLRESLIAGKFMWKKEPF 505 >UniRef50_C9KN46 Chitin synthase n=1 Tax=Mitsuokella multacida DSM 20544 RepID=C9KN46_9FIRM Length = 628 Score = 100 bits (248), Expect = 2e-19, Method: Compositional matrix adjust. Identities = 86/344 (25%), Positives = 148/344 (43%), Gaps = 49/344 (14%) Query: 5 IVSFFILCLVLCIPL----CVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHW 60 + SFF+ +V I L V YF + + F+ ++ G Y Sbjct: 179 VASFFLAVIVFVIVLMRAESVIYFQFNRWLYLYSIIAATFLLTRYLFGSFY--------- 229 Query: 61 PWGENAPAPQLKD-NPSISIIIPCFNEEKNVEETIHAALAQRY--ENIEVIAV----NDG 113 P D P + II+PCFNEEK ++ TI + Q Y + +EVI V ND Sbjct: 230 -----RPTKIDPDYTPGVMIIVPCFNEEKWIQHTILGCINQDYPIDKLEVIVVDDCSNDH 284 Query: 114 STDKTRAILDRMAAQIPHLRVIHLAQ---------NQGKAIALKTGAAAAKSEYLVCIDG 164 S DK + I++R+ ++ + N+GK A+ GA AK E LV +D Sbjct: 285 SVDKIKEIIERLKQSDGDQKMYRVEDRLHYYVQPVNKGKREAMAVGAHMAKHELLVFVDS 344 Query: 165 DALLDRDAAAYIVEPMLYNPRVGAVTGNPRI-RTRSTLVGKIQVGEYSSIIGLIKRTQRI 223 D+ LD A IV+P + ++G V+G + T + + K+Q Y ++K + Sbjct: 345 DSFLDPYAVRNIVQP-FKDKKMGGVSGRTDVANTYTNSLTKMQAVRYYIAFRIMKAAEGY 403 Query: 224 YGNVFTVSGVIAAFRRSALAEVGYWSDDMITE----------DIDISWKLQLNQWTIFYE 273 + V +SG ++ +R+ + + + DD + + D L Y+ Sbjct: 404 FDAVTCLSGPLSCYRKDLVLK---YCDDWLNQKFLGQRATFGDDRSMTNFILRHHRTTYQ 460 Query: 274 PRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENF 317 A+C ++P++ K +Q++RW + + +W+KE F Sbjct: 461 DTAVCMTIVPKSHKMFLRQQMRWKRSWLRESIIAARYMWKKEPF 504 >UniRef50_C6X1A6 Glycosyl transferase, family 2 n=3 Tax=Flavobacteriaceae RepID=C6X1A6_FLAB3 Length = 824 Score = 99.4 bits (246), Expect = 2e-19, Method: Compositional matrix adjust. Identities = 79/283 (27%), Positives = 121/283 (42%), Gaps = 49/283 (17%) Query: 73 DNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKT-------------- 118 + P S+I P +NE + E + + L+ Y N+E+I VNDGS D + Sbjct: 58 NAPVFSLIAPAYNEGMTIVENVRSLLSLYYHNLEIIIVNDGSKDDSIQKLIEAYELESVN 117 Query: 119 ------------RAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDA 166 R I L V+ A N GKA AL G + +YLVCID D Sbjct: 118 FFIQGNIPTAEVRGIYKSKNPAFRKLTVVDKA-NGGKADALNVGVNISGGDYLVCIDVDC 176 Query: 167 LLDRDAAAYIVEPML--YNPRVGAVTG------NPRIRTRS--------TLVGKIQVGEY 210 +L++DA + +P L + ++ A G N RI TL+G+ QV EY Sbjct: 177 ILEQDAILKLAKPFLEQTDKKLIACGGVIRLANNCRIENGKVVEVNLPRTLLGRTQVLEY 236 Query: 211 SSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQ--LNQW 268 L + + +SG F R + G + + + ED+++ +++ + + Sbjct: 237 IRAFVLGRMAWSRASGLILISGAFGVFDRKIVLACGGYDKNTVGEDMELVVRMRQYMEER 296 Query: 269 TIFYE----PRALCWILMPETLKGLWKQRLRWAQGGAEVFLKN 307 YE P LCW PE+ + L KQR RW +G E K+ Sbjct: 297 NEPYEVINIPDPLCWTEAPESKEILVKQRNRWMRGTMETLWKH 339 >UniRef50_Q9UY40 Glycosyl transferase, family 2 n=2 Tax=Thermococcaceae RepID=Q9UY40_PYRAB Length = 447 Score = 99.0 bits (245), Expect = 3e-19, Method: Compositional matrix adjust. Identities = 74/266 (27%), Positives = 126/266 (47%), Gaps = 23/266 (8%) Query: 58 RHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDK 117 + +P+ E + + P + ++IP NEE+ + +T+ + L Q Y N++VI +ND STD+ Sbjct: 59 KRYPYDETGFNLEFLE-PLVYVLIPAHNEERVIYKTVRSVLGQDYRNMKVILINDNSTDR 117 Query: 118 TRAILDRMAAQIPH-LRVIHLAQNQGKAIALKTGAA-------AAKSEYLVCIDGDALLD 169 TR I++ + + P + +I + +G++ A Y+ +D D L+ Sbjct: 118 TRDIMEEINRKYPRKVVIIDVPPERGRSKPRALNYALEIIEKYMTHPNYVFILDADYLIP 177 Query: 170 RDAAAYIVEPMLYNPR-VGAVTGNPRIRT-RSTLVGKI-----QVGEYSSIIGLIKRTQR 222 +A +V M P+ V + GN R R R V K VG +I G +K + Sbjct: 178 PNALKTLVSIMESAPQYVIGIQGNVRPRNFRKNFVTKFITLERLVGFNVAIEGDMKLNEN 237 Query: 223 -IYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWIL 281 YG G +A R L +G + +D +TED D+ + + + +Y + W Sbjct: 238 GKYG------GTVALLRFPLLIRLGKFREDSVTEDTDLWARAMIAGYRFWYYHGVIGWEE 291 Query: 282 MPETLKGLWKQRLRWAQGGAEVFLKN 307 ETL+ KQR RWAQG +V + + Sbjct: 292 AVETLRDYIKQRSRWAQGHLQVMIDH 317 >UniRef50_UPI0001C42DFC capsule anchoring protein CapD n=1 Tax=Bacillus pseudofirmus OF4 RepID=UPI0001C42DFC Length = 1000 Score = 99.0 bits (245), Expect = 3e-19, Method: Compositional matrix adjust. Identities = 87/317 (27%), Positives = 145/317 (45%), Gaps = 47/317 (14%) Query: 70 QLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILD------ 123 QL++ IS+++PC+NE+ +E ++ + Y+ EVI +NDGS D+T ++L+ Sbjct: 581 QLEEEKGISVLVPCYNEQGIIETSLKNMSSLPYQKKEVIYINDGSKDRTFSLLNTYLKLK 640 Query: 124 ----RMAAQIPHLRVIHLAQ-------------NQGKAIALKTGAAAAKSEYLVCIDGDA 166 + ++ H RV + Q N GKA AL G A + +V +D D Sbjct: 641 PSTKKPLQKLSHKRVRGVYQSELYPFIYVIDKENGGKADALNAGIEYAGEDIVVSLDADT 700 Query: 167 LLDRDAAAYIVEPMLYNPRVGA---------VTGNP--RIRTR-STLVGKIQVGEYSSII 214 +L +A + E V A T NP R+ R + + ++Q+ ++ Sbjct: 701 VLTENALPKVNEAFEDEDVVAAGGMVHVLQTKTKNPLKRLSLRHANFLVRLQMFDFLKAF 760 Query: 215 GLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKL------QLNQW 268 + K + + + +SG FR+ AL EVG + I EDIDI+ K+ +N+ Sbjct: 761 YITKISLARFHALAVISGAFGIFRKDALLEVGGYR-STIGEDIDITLKMHEYIAKHVNKK 819 Query: 269 TIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLF---FE 325 I + A+ + MPET +KQR+RW + + + + +RK FR F FE Sbjct: 820 VIHLK-EAISYTEMPETWNDFFKQRVRWQKAYIDCVI-HFHSFFRKTLFRKAVSFFYIFE 877 Query: 326 YCLTTIWAFTCLVGFII 342 L I + + GF I Sbjct: 878 SFLVGIVSAFIMTGFFI 894 >UniRef50_Q2LWC4 Glycosyltransferase n=1 Tax=Syntrophus aciditrophicus SB RepID=Q2LWC4_SYNAS Length = 482 Score = 98.6 bits (244), Expect = 5e-19, Method: Compositional matrix adjust. Identities = 84/321 (26%), Positives = 145/321 (45%), Gaps = 60/321 (18%) Query: 33 FVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLK---DNPSISIIIPCFNEEKN 89 F+ F+ F+ +++ + V RH AP L+ + P +SIIIP NE + Sbjct: 15 FIGFYYGFVQVIYTLMLTIALVVILRHLNRIRYAPFRDLEKRSEMPPVSIIIPARNESEV 74 Query: 90 VEETIHAALAQRYENIEVIAVNDGSTD-----------------------KTRAILD-RM 125 + +I ++L Y +EVI VNDGSTD KTR + Sbjct: 75 IIRSIESSLTMNYPGLEVIVVNDGSTDGMLDMLISHYSLQRIDPLYRDLIKTRPVRGFYC 134 Query: 126 AAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNP- 184 ++IP+L ++ +N GKA A+ +S Y+ D D++L++DA + P++ + Sbjct: 135 TSKIPNLLIVD-KENGGKADAINCAINVCRSPYVCVQDADSVLEKDALIRLATPLVQSEV 193 Query: 185 RVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRT------------QRIYGNV----- 227 V A G R+ L G Q G + I L +R+ ++G V Sbjct: 194 PVIACGGVVRV-----LNGTKQEGNVVTSIELPRRSTLACFQIVEYLRSFLFGRVGLDAM 248 Query: 228 ---FTVSGVIAAFRRSALAEVGYWSDDMITEDIDI------SWKLQLNQWTIFYEPRALC 278 +SG + F+++AL +G ++ D + ED+++ ++L+ + I + +C Sbjct: 249 NCNLVISGAFSFFQKAALLNIGGFATDNVAEDMELIVRMHRHYRLKKIPYRIRFISDPIC 308 Query: 279 WILMPETLKGLWKQRLRWAQG 299 W +PET + L +QR RW G Sbjct: 309 WTEVPETFRMLGRQRRRWHLG 329 >UniRef50_A6LJJ1 Glycosyl transferase, family 2 n=2 Tax=Thermosipho RepID=A6LJJ1_THEM4 Length = 379 Score = 98.2 bits (243), Expect = 6e-19, Method: Compositional matrix adjust. Identities = 72/236 (30%), Positives = 110/236 (46%), Gaps = 10/236 (4%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRV 134 P +S+IIPC+NEE + TI A Y+N E+I ++D STD T I + + +L+V Sbjct: 43 PLVSVIIPCYNEEAVIGNTIRAVEKNSYKNFEIIVIDDNSTDSTFEIAKGLEKEYNNLKV 102 Query: 135 IHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPM--LYNPRVGAVTGN 192 I +GK ++ AK E ++ ID DA R YI + NP+V + N Sbjct: 103 IKKKGEKGKPQSINEAMEIAKGEIVLIIDADA---RIPENYISSHVYCFSNPKVEMIFTN 159 Query: 193 -PRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDD 251 R T V IQ Y + + I + I +F +G+ FR+S L + Sbjct: 160 FEAYNFRFTPVFIIQ-EIYFNFVKTIFYSNIIVKMIFMGNGIF--FRKSLLERILPIDPT 216 Query: 252 MITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGG-AEVFLK 306 + +D ++ KL + FY I ++ K LWKQ RW GG E+F++ Sbjct: 217 TLVDDFSMATKLSKLKVKEFYSTYPYIKIQFAKSFKDLWKQHKRWYTGGFREMFMR 272 >UniRef50_A8MM07 Glycosyl transferase family 2 n=1 Tax=Alkaliphilus oremlandii OhILAs RepID=A8MM07_ALKOO Length = 482 Score = 97.8 bits (242), Expect = 7e-19, Method: Compositional matrix adjust. Identities = 89/352 (25%), Positives = 155/352 (44%), Gaps = 57/352 (16%) Query: 77 ISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKT-RAILDRMAAQ------- 128 IS+++P ++EE+ + + I + L+ Y EVI +NDGS D T R I++ + Sbjct: 63 ISVLVPAYDEEETIVDNIKSLLSLNYPAFEVIVINDGSNDDTLRKIVEAYDLKEINQPVR 122 Query: 129 ----------------IPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDA 172 IP+L ++ +N GKA AL G +K ID D++L+ D+ Sbjct: 123 HLLKTKKIKGIYANLDIPNLVLVD-KENGGKADALNVGINVSKYPVFTSIDADSILESDS 181 Query: 173 AAYIVEPMLYNPRVGAVTGNPRIRTRSTLV-GKIQ-VGEYSSIIGLIKRTQRIYGN---- 226 ++ P + + AV G RI S + GK++ +G S + + + + + Sbjct: 182 LVRVIMPFIEDNLTVAVGGIVRIINGSEIKNGKVEAIGLPKSKLAMFQIVEYLRAFLTGR 241 Query: 227 --------VFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLN------QWTIFY 272 + VSG AF++ + + G ++ D I ED+++ K+ ++ + + Sbjct: 242 LAWSKLRCLLIVSGAFGAFKKDVVVQAGGYTPDTIGEDMELVVKIHKMMRKNKVKYKVNF 301 Query: 273 EPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLF---FEYCLT 329 P +CW PE +K L QR RW G M L+R ++ + P + Y + Sbjct: 302 IPDPVCWTQAPENIKDLRGQRRRWQIG-------LMDSLFRHKDLLLNPRYGVVGMYGVP 354 Query: 330 TIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSL 381 W F L G II + +PL+ + + T + L T L I+SL Sbjct: 355 YFWIFEML-GPIIETLGYIIIPLSY-IFGLLNTRYFVLFLITSILYGIILSL 404 >UniRef50_C7NQ00 Glycosyl transferase family 2 n=1 Tax=Halorhabdus utahensis DSM 12940 RepID=C7NQ00_HALUD Length = 475 Score = 97.4 bits (241), Expect = 9e-19, Method: Compositional matrix adjust. Identities = 83/295 (28%), Positives = 127/295 (43%), Gaps = 56/295 (18%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKT------RAILDRMAAQ 128 P I I++P +NEE + E++ + L+ Y EV+ VNDGSTD T + L+ + A+ Sbjct: 63 PGIGIVVPAYNEEATIVESVRSMLSLNYPETEVVVVNDGSTDATLDRLREQFDLEAIDAE 122 Query: 129 IP------------------HLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDR 170 IP L V+ N GK+ AL G +D D ++DR Sbjct: 123 IPFDVPAEEIHGVYRSTTDEKLLVVD-KDNGGKSDALNAGIWLTDMPLFCAVDADTIIDR 181 Query: 171 DAAAYIVEPMLYNPRVGAVTGN---------------PRIRTRSTLVGKIQVGE-----Y 210 DA IV P L P +G + T + +QV E Y Sbjct: 182 DALLQIVTPFLKEPTTAVASGGVIRVANDCTVEDGIVTDVSLPKTGLPGLQVMEYLRAFY 241 Query: 211 SSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKL------Q 264 S +GL RI G + +SG F+ + E+G + D ITED DI +L Q Sbjct: 242 SGRLGL----NRINGLIL-ISGAFGLFQTETVREIGGYRHDTITEDFDIVVRLHKHLTEQ 296 Query: 265 LNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRM 319 ++T+ + P + W +P + + L +QR RW +G E + N L+ + R+ Sbjct: 297 DREYTVDFVPEPVAWTEVPSSRRVLSRQRRRWYRGMMETVVTNRKMLFNRAYGRV 351 >UniRef50_C6A3S6 Putative glycosyl transferase n=1 Tax=Thermococcus sibiricus MM 739 RepID=C6A3S6_THESM Length = 388 Score = 97.4 bits (241), Expect = 1e-18, Method: Compositional matrix adjust. Identities = 77/296 (26%), Positives = 136/296 (45%), Gaps = 22/296 (7%) Query: 69 PQLKDN-PSISIIIPCFNEEKNVEETIHAALAQRY--ENIEVIAVNDGSTDKTRAILDRM 125 P++ + PS++I+IP NE + +T+ A Y + +EV+ ++DGSTD T I + + Sbjct: 26 PEIPEELPSVTILIPARNEGLVIRDTLRAMANLDYPKDKLEVLLLDDGSTDDTAKIAEEV 85 Query: 126 AAQIPHLRVIHL-AQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNP 184 + P ++VI + GK+ L G AK E + D D + A +V + + Sbjct: 86 SKDYPFIKVIRVEGGGSGKSYVLNYGLKLAKGEVIAVYDADNRPEPGALKDLV--AMLSD 143 Query: 185 RVGAVTG-------NPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAF 237 AVTG N I TR + + Y + G +++ + + G Sbjct: 144 ETPAVTGKVKTMNWNRNILTRFICMEYL----YFQLAGQAGKSK--FYKTAILPGTNFVI 197 Query: 238 RRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWA 297 R+ L E+G W ++ + ED+++S+++ L I Y P A+ W PE+L+ ++QR RWA Sbjct: 198 RKELLEELGGWDEEALAEDLELSFRIILTGKKIAYTPLAVTWEQEPESLRVWFRQRTRWA 257 Query: 298 QGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLN 353 G + + R + W L F+ LT + + + I+ V + L Sbjct: 258 AGNVHTVKEYVKRF---KEIPSWGLRFDLFLTLMVYYLLAMAVIVADVAFVALLLT 310 >UniRef50_C2EFI6 Glycosyltransferase n=1 Tax=Lactobacillus salivarius ATCC 11741 RepID=C2EFI6_9LACO Length = 444 Score = 97.1 bits (240), Expect = 1e-18, Method: Compositional matrix adjust. Identities = 78/283 (27%), Positives = 133/283 (46%), Gaps = 21/283 (7%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAILDRMAAQIPHL 132 P +++I+P +N + + + + Y I +I N+ STD + + + ++ Sbjct: 51 PQLTVIVPIYNSGDTLYDCLKSIAESNYPKHLISIICANNKSTDNSFEVYKQAQMDFRNV 110 Query: 133 RVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTG- 191 R+ L NQGKA AL T + +Y+ ID D L +DA + +P V A TG Sbjct: 111 RMQWLEANQGKAKALNTAIYGSTGDYVFTIDSDGTLSKDALLNMAVNFNKHPEVMAQTGT 170 Query: 192 ---NPRIRTRSTLVGKIQVGEY--SSIIGLIKRT-QRIYGNVFTVSGVIAAFRRSALAEV 245 N + S +++ EY +I L RT + +FT+SG +AFRRSAL + Sbjct: 171 ILSNNELIKNSKKHRLLRLEEYYEYAIAFLAGRTVESKDDQIFTMSGAFSAFRRSALMQS 230 Query: 246 GYWSDDMITEDIDISWKLQLN-QWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEV- 303 ++ + + ED D++++++ + + P A+ ++ L+ QR RW +G EV Sbjct: 231 RLYNVETVGEDTDMTFQIRYYLKGRVMLCPNAIFYVEPISDWDELYVQRQRWQRGEIEVI 290 Query: 304 --FLK---NMTRLWRKENFRMWPLFFEYC---LTTIWAFTCLV 338 FL N+ R+W NF + L ++ L IW F V Sbjct: 291 RTFLSEKLNLKRIW--SNFIVRRLLVDHTVAFLKVIWLFAIFV 331 >UniRef50_Q0IE23 Glycosyl transferase, group 2 family protein n=19 Tax=cellular organisms RepID=Q0IE23_SYNS3 Length = 467 Score = 97.1 bits (240), Expect = 1e-18, Method: Compositional matrix adjust. Identities = 70/261 (26%), Positives = 121/261 (46%), Gaps = 21/261 (8%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRYENIEVIA--VNDGSTDKTRAILDRMAAQIPHL 132 P++ +++ +EE V + A RY ++ ++DGS DKT +LD + Q P+L Sbjct: 105 PNVDVVVAARDEEAVVARLMERLGALRYPADQLTTWIIDDGSEDKTPDLLDELKPQYPNL 164 Query: 133 RVIHLAQNQ--GKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVT 190 VI +N GK+ AL A + E+++ +D D + D ++ + G Sbjct: 165 NVIRRPRNAGGGKSGALNAALAQSSGEWILVLDADGQISEDQLERLIPIAVMGDWSGVQM 224 Query: 191 GNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSD 250 + L+ ++Q E + LI++ + + G V + G RR L G +++ Sbjct: 225 RKAVTNADTNLLTRLQAMEM-AFDALIQQGRLLGGGVSELRGNGQLLRRDVLEACGGFNE 283 Query: 251 DMITEDIDISWK--LQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNM 308 D +T+D+D+S++ LQ + T+ + P + ET+ LWKQR RWA+GG + F Sbjct: 284 DTVTDDLDLSFRFLLQGARTTLLWNPPVREEAV--ETISALWKQRQRWAEGGLQRF---- 337 Query: 309 TRLWRKENFRMWPLFFEYCLT 329 F WP LT Sbjct: 338 --------FDYWPGLLSNRLT 350 >UniRef50_C6J2E9 Glycosyl transferase n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J2E9_9BACL Length = 488 Score = 96.7 bits (239), Expect = 1e-18, Method: Compositional matrix adjust. Identities = 90/302 (29%), Positives = 121/302 (40%), Gaps = 78/302 (25%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDR---------- 124 P +S+++P FNEE V E + L Y EVI +NDGS D T +L Sbjct: 63 PPVSLLVPAFNEELTVIENVKCLLTLNYPTYEVIVINDGSKDNTLGVLREEFELMPLPNP 122 Query: 125 ---------------MAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLD 169 + PHL VI N GKA +L G A + ID D+LL+ Sbjct: 123 DVRHTIDCQPITAVYHNPKYPHLYVID-KPNGGKADSLNAGINLAHYPLISSIDADSLLE 181 Query: 170 RDAAAYIVEPMLYNP-RVGAVTGNPRIRT----------RSTLVGKI----QVGEY---- 210 +DA + + NP A+ G+ RI +L KI Q EY Sbjct: 182 KDALIRMARMYMENPEETVAIGGDVRIANGCMIENGAVKEVSLPRKIWPMFQATEYLKAF 241 Query: 211 -------SSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKL 263 SSI GLI VSG FR+ + VG + ED++I KL Sbjct: 242 LGGRIGWSSINGLI-----------IVSGAFGLFRKDYVIAVGGYRGGYPGEDMNIIIKL 290 Query: 264 QLN------QWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENF 317 ++ I + P A+CW P+T + L QR RW +G LKNM EN+ Sbjct: 291 HRYMLENKLKYRIAFCPEAVCWTQAPDTYRILSNQRKRWGRGN----LKNMI-----ENY 341 Query: 318 RM 319 M Sbjct: 342 DM 343 >UniRef50_C4Z6K0 Putative uncharacterized protein n=1 Tax=Eubacterium eligens ATCC 27750 RepID=C4Z6K0_EUBE2 Length = 465 Score = 96.7 bits (239), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 73/307 (23%), Positives = 146/307 (47%), Gaps = 23/307 (7%) Query: 38 PFFMSIMWIVGGVYFWVYRE-RHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHA 96 PF M I+ +G V+ + R RH +L+ P I++IIP +N + I + Sbjct: 19 PFIMEIVPAIGSVFLLIRRNLRH-----KIHKKELEVYPEITLIIPVYNSADTLYGCIRS 73 Query: 97 ALAQRYEN--IEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAA 154 Y N I + VN+ D + I + + P L + + QGK+ AL + Sbjct: 74 INDSTYPNDRIRIFLVNNKGKDNSFDIYAQCQQEFPELLMQWMNSEQGKSRALNLALYNS 133 Query: 155 KSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGN-----PRIRTR----STLVGKI 205 +Y++ +D D +L+++A ++ + + +TG+ +I+ S L+ ++ Sbjct: 134 DGKYIINLDSDGMLEKNALVNMITRFENDTAINCMTGSILTVPEQIKKYKAGPSRLLREL 193 Query: 206 QVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQ- 264 + EY+ + +V+T+SG +AFR+SA+ + ++ D I ED I+++++ Sbjct: 194 EFMEYAQAFLAGRSYASELNSVYTLSGAFSAFRKSAVLKSWMYNTDTICEDTHITFQMRY 253 Query: 265 LNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFF 324 L + + AL ++ E + L+ QR RW +G EV ++++ ++F++ LF Sbjct: 254 LQKERVEVCEDALFFVDPIENVNKLYTQRQRWQRGSLEV-----SKMFMDKSFKVKNLFT 308 Query: 325 EYCLTTI 331 + T+ Sbjct: 309 NISVKTL 315 >UniRef50_B2UMM8 Glycosyl transferase family 2 n=3 Tax=Verrucomicrobia RepID=B2UMM8_AKKM8 Length = 505 Score = 96.7 bits (239), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 65/264 (24%), Positives = 120/264 (45%), Gaps = 10/264 (3%) Query: 59 HWPWGENAPAPQ--LKDNPSISIIIPCFNEEKNVEETIHAALAQRY--ENIEVIAVNDGS 114 +W N P P+ ++ P +++ +P FNE+ V+ + + A Y + +E+ ++D + Sbjct: 34 YWKNRNNKPQPKARFQELPVVTVQLPMFNEKFVVDRLLESVAALDYPQDKLEIQILDDST 93 Query: 115 TDKTRAI---LDRMAAQIPHLRVIHLAQNQG-KAIALKTGAAAAKSEYLVCIDGDALLDR 170 D T ++ + ++ IH G KA AL+ AK E+L+ +D D + + Sbjct: 94 DDTTEQCYRKVEELKSRGFDAVCIHRTDRTGFKAGALEAATKVAKGEFLLILDADFVPEP 153 Query: 171 DAAAYIVEPMLYNPRVGAVTGN-PRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFT 229 D + + VG V I L+ +IQ + + + G FT Sbjct: 154 DLLQKTIH-FFTDENVGLVQTRWGHINREYNLLTRIQGMYLDGHFAMEQTARNRSGRFFT 212 Query: 230 VSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGL 289 +G +R+ + + G WS D +TED+D+S+++QL W Y + +P + G Sbjct: 213 FNGTAGIWRKCVIGDAGGWSHDTLTEDMDLSYRVQLRGWRFIYLNDVVTPAELPVDMDGF 272 Query: 290 WKQRLRWAQGGAEVFLKNMTRLWR 313 Q+ RW +G +V K + +WR Sbjct: 273 KSQQHRWTKGSIQVCQKILLDIWR 296 >UniRef50_B9MDW0 Glycosyl transferase family 2 n=1 Tax=Diaphorobacter sp. TPSY RepID=B9MDW0_DIAST Length = 441 Score = 96.7 bits (239), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 74/258 (28%), Positives = 118/258 (45%), Gaps = 16/258 (6%) Query: 64 ENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYEN-IEVIAVNDGSTDKT---- 118 E+ P Q + P +S +I C++E ++V +TI + Q Y IE+IAV DG+ Sbjct: 54 EHRPLRQREAYPRVSCVITCYSEGEDVGKTIGSLAQQLYPGFIEIIAVIDGALQNGPTLR 113 Query: 119 ---RAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAY 175 RA Q L V+ Q G+ +L G + A E ++ +DGD D D Sbjct: 114 AARRAQSLLAGTQRRSLVVLPKWQRGGRVSSLNAGLSIATGEVVMALDGDTSFDNDMVRN 173 Query: 176 IVEPMLYNPRVGAVTGNPRIRT-RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVI 234 +P V V GN R+R + +LV ++Q EY IG K + V +SG Sbjct: 174 ATR-HFDDPGVVGVAGNLRVRNAKRSLVTRLQALEYMLSIGAGKTGLSEFNIVNNISGAF 232 Query: 235 AAFRRSALAEVGYWSDDMITEDIDISWKLQL-----NQWTIFYEPRALCWILMPETLKGL 289 FR L +G W D ED+D++ +++ I ++P A+ P+T + Sbjct: 233 GVFRTRFLRNLGGW-DAGTAEDLDMTMRIKQYFGRHPDLRIVFDPHAVGHTDAPDTWRVF 291 Query: 290 WKQRLRWAQGGAEVFLKN 307 ++QRLRW +F++ Sbjct: 292 FRQRLRWDGDMFYIFIRK 309 >UniRef50_B8E0Z1 Glycosyl transferase family 2 n=1 Tax=Dictyoglomus turgidum DSM 6724 RepID=B8E0Z1_DICTD Length = 399 Score = 96.7 bits (239), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 72/257 (28%), Positives = 126/257 (49%), Gaps = 27/257 (10%) Query: 73 DNPSISIIIPCFNEEKNVEETIHAALAQRY--ENIEVIAVNDGSTDKTRAILDRMAAQIP 130 D P +S+++P NEEK E ++A L Y + IE+I ++D STD+TR IL+ +++ P Sbjct: 22 DLPYVSVLVPMHNEEKVAENVLNALLNTDYPKDRIEIIPIDDNSTDRTREILEDYSSKYP 81 Query: 131 HL-----RVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDAL----LDRDAAAYIVEPML 181 HL R +L + GK AL A+ E ++ D D + + RD A ++P Sbjct: 82 HLIKPLYRGSYLPR--GKPSALNDALKVAEGEIIIVFDADYIPPKGIIRDLAVSFLDP-- 137 Query: 182 YNPRVGAVTGN--PRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTV---SGVIAA 236 VG V G P +++ L + IG + Q+ N+ + G + Sbjct: 138 ---EVGVVMGRVVPLNISKNLLTRLFDLER----IGGYQVDQQARYNLKLIPQFGGTVGG 190 Query: 237 FRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRW 296 FR+ + ++G ++ ++ ED +++ K +N + Y RA C+ PET + KQ RW Sbjct: 191 FRKELILKLGGFNPKILAEDTELTIKAYINGVKVCYTNRAECYEEAPETWEVRAKQIRRW 250 Query: 297 AQGGAEVFLKNMTRLWR 313 ++G +V + + L + Sbjct: 251 SRGHNQVMFRYLLPLIK 267 >UniRef50_A5GEA8 Glycosyl transferase, family 2 n=3 Tax=Deltaproteobacteria RepID=A5GEA8_GEOUR Length = 492 Score = 96.7 bits (239), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 76/282 (26%), Positives = 123/282 (43%), Gaps = 18/282 (6%) Query: 44 MWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRY- 102 +W++ +Y ER P AP ++ PS+++ +P +NE E + AA + Sbjct: 28 LWLIYCLYMPKGSERSTP----APFAAPEEFPSVTVQLPLYNERFVAERLLDAAAGLDWP 83 Query: 103 -ENIEVIAVNDGSTDKTRAILDRMA---AQIPHLRVIHLAQNQG-KAIALKTGAAAAKSE 157 E +E+ ++D D R + R A Q + V+ G KA AL G A A E Sbjct: 84 RERLEIQVLDDSDDDTCRLVDQRAAWWRKQGVAITVVRRTSRDGYKAGALANGLATAHGE 143 Query: 158 YLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRS----TLVGKIQVGEYSSI 213 Y+ D D + D + P N VG V T + + +G + SI Sbjct: 144 YIAVFDADFIPPPDFL-HATMPWFRNQDVGMVQTRWSFCNADHSWFTGIQSLLLGPHFSI 202 Query: 214 IGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYE 273 ++ Q ++ F +G +RRSA+ G W D +TED+D+S++ QL W Y Sbjct: 203 EHRVRYRQGLF---FNFNGTAGVWRRSAIESAGGWQSDTVTEDLDLSYRAQLAGWRFVYR 259 Query: 274 PRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKE 315 +P T+ L Q+ RWA+G + K + RL ++ Sbjct: 260 EECQVPSELPVTMAALRSQQQRWAKGSIQTARKILPRLLQER 301 >UniRef50_C0A1N7 Glycosyl transferase, family 2 n=1 Tax=Opitutaceae bacterium TAV2 RepID=C0A1N7_9BACT Length = 498 Score = 96.3 bits (238), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 70/245 (28%), Positives = 112/245 (45%), Gaps = 15/245 (6%) Query: 77 ISIIIPCFNEEKNVEETIHAALAQRY-----ENIEVIAVNDGSTDKTRAILDRMAAQIP- 130 + I +P +NE VE + A R+ E +E I + D STD+T AI++R A P Sbjct: 50 VCIQLPLYNESLVVEALLDKVSAIRWGNGGDETLE-IQILDDSTDETTAIIERWMAANPV 108 Query: 131 -----HLRVIHLAQNQG-KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNP 184 H+ I G KA AL G ++E+ D D + D ++ P + Sbjct: 109 RVATAHISHIRRPNRHGYKAGALSYGMTLTEAEFFAIFDADFRPEPDFLEQLM-PHFADT 167 Query: 185 RVGAVTGNPRIRTR-STLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALA 243 ++G V R S+L+ + Q + + + + G F +G +RR AL Sbjct: 168 KIGVVQARWEFANRKSSLLTRFQGVFLDAHFVVEQEARYAAGLFFNFNGTAGIWRRRALD 227 Query: 244 EVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEV 303 E G W+DD +TED+D+S++ QL W Y +PE++ Q+ RW +GG +V Sbjct: 228 EAGGWTDDTVTEDLDVSYRAQLRGWKFIYRADYAVPSELPESMTAFKSQQRRWTKGGMQV 287 Query: 304 FLKNM 308 K + Sbjct: 288 MRKQI 292 >UniRef50_A7INQ0 Glycosyl transferase family 2 n=11 Tax=Rhizobiales RepID=A7INQ0_XANP2 Length = 905 Score = 96.3 bits (238), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 93/328 (28%), Positives = 146/328 (44%), Gaps = 34/328 (10%) Query: 75 PSISIIIPCFNEEKN-VEETIHAALAQRYENIEVIAVNDGSTDKT--RAILDRMAAQIPH 131 P +SI +P + E +++TI A A Y N E I + + + D + + AA Sbjct: 434 PKVSIHVPAYREPPEMLKQTIDALAALEYPNFEAIIIINNTPDPAMVEPVREYCAALGER 493 Query: 132 LRVIHLAQNQG-KAIALKTG--AAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGA 188 + I+ + G KA AL+ A A +E + ID D ++ D +V P+ +P VG Sbjct: 494 FKFINAEKVAGFKAGALRIALDATAPDAEIIGVIDADYVVTPDWLKELV-PVFDDPTVGL 552 Query: 189 VTGNPRIRTRS-TLVGKIQVGEYSSI--IGLIKRTQRIYGNVFTVSGVIAAFRRSALAEV 245 V R +L+ + EY+ IG+++R + + V G + RR+A+ E Sbjct: 553 VQAPQDHRDADRSLLHEAMNAEYAGFFDIGMVQRNE---DDAIVVHGTMCLIRRAAMLEA 609 Query: 246 GYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEV-- 303 G WS D I ED D+ + N W Y + + L+P++ + KQR RWA GG ++ Sbjct: 610 GNWSSDTICEDTDLGLTIAENGWKTHYTRKRYGYGLLPDSFEAFKKQRHRWAYGGFQIIK 669 Query: 304 -----FLKNMTRLW--RKENFRM-WPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIE 355 FL N +RL +K +F + W W + VG ++ LA VP + Sbjct: 670 KHWRKFLPNRSRLTTAQKRHFVLGW---------ISWLGSESVGAVMAIASLAFVPFVLL 720 Query: 356 LTHIAATH--TAGILLCTLCLLQFIVSL 381 H T IL+ L L VSL Sbjct: 721 FGVSVPAHVLTLPILITFLVYLMHFVSL 748 >UniRef50_C7ZM19 Putative uncharacterized protein n=1 Tax=Nectria haematococca mpVI 77-13-4 RepID=C7ZM19_NECH7 Length = 332 Score = 95.9 bits (237), Expect = 3e-18, Method: Compositional matrix adjust. Identities = 74/288 (25%), Positives = 132/288 (45%), Gaps = 31/288 (10%) Query: 43 IMWIVGGVYFWVYRERHWPWGENAPAPQ-LKDN--PSISIIIPCFNEEKNVEETIHAALA 99 I+ I+ Y Y W W + P KDN P IS++IP FNE + + +I + + Sbjct: 40 ILEIISTFYKIEYLLMVWRWIKYKPTKAATKDNELPFISVVIPVFNESEFIHNSIESIVL 99 Query: 100 QRY--ENIEVIAVNDGSTDKTRAILDRMAAQIPH----LRVIHLAQNQGKAIALKTGAAA 153 Y + IE++ ++DGSTD T +++ IP R++ N GK A+++G + Sbjct: 100 SDYPKDKIELVIIDDGSTDDTWQHVNQAVKAIPQCGISYRLLQHPANLGKRQAIQSGFST 159 Query: 154 AKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGN--------PRIRTRSTLVGKI 205 A ++ +D D++L++ A IV P++ +P +GAV G+ P + + ++ Sbjct: 160 ALGSIIISVDSDSVLEKGALHRIVSPLMQDPAIGAVAGHLAVLNVSSPNTLSTKGFLPRL 219 Query: 206 QVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMIT----------- 254 Y I L + G V + G +AFR +A+ + D++ T Sbjct: 220 LDILYEHIGNLPRAALTAEGFVTILPGAFSAFRVNAVRD---HVDNLCTSTFLGSPLKHG 276 Query: 255 EDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAE 302 ED++++ + + W Y+ A+ PETL+ + RW E Sbjct: 277 EDMELAHHILKDGWKTVYQSNAVVHTTAPETLQKAFLMFSRWEHLSQE 324 >UniRef50_A0QPQ1 Chitin synthase n=1 Tax=Mycobacterium smegmatis str. MC2 155 RepID=A0QPQ1_MYCS2 Length = 431 Score = 95.5 bits (236), Expect = 4e-18, Method: Compositional matrix adjust. Identities = 69/236 (29%), Positives = 115/236 (48%), Gaps = 13/236 (5%) Query: 75 PSISIIIPCFNEEKNVEETIH-AALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLR 133 P ++II+P +NE +H + E + V+ V+D S+D T + AA+ P+L Sbjct: 86 PVVAIIVPAYNEPDIARTLLHCVGVDYPRELLRVVVVDDKSSDDTLRRITDFAAEHPNLT 145 Query: 134 VIHLAQNQGKAIALKTG-AAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGN 192 VI N GK A+ TG AAA ++ V ID D+ + RDA ++ + VGAV G+ Sbjct: 146 VIPHEVNGGKRRAMATGMAAADDADLFVFIDSDSQVTRDAIR-VIASYFADSSVGAVCGH 204 Query: 193 PRIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVG-YWSD 250 + S ++ ++Q +Y + K + ++G+V SG +A+R A+ V W + Sbjct: 205 TDVTNISHNILTRMQAMQYYIAFRIYKSAEALFGSVTCCSGCFSAYRADAVRPVADKWLN 264 Query: 251 -------DMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQG 299 +D ++ L L W + Y P A + +PE LK +Q+LRW + Sbjct: 265 QTFLGRPSTFGDDRSLTNYL-LRDWRVLYAPDAQAYTNVPEHLKQFLRQQLRWKKS 319 >UniRef50_C7TCU0 Phage-related glycosyl transferase, group 2 n=1 Tax=Lactobacillus rhamnosus GG RepID=C7TCU0_LACRG Length = 249 Score = 95.5 bits (236), Expect = 4e-18, Method: Compositional matrix adjust. Identities = 63/194 (32%), Positives = 107/194 (55%), Gaps = 8/194 (4%) Query: 74 NPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLR 133 +PSIS+I+P FNEEK + +T+ + Y+ I ++ V+DGS+D T AIL + ++ Sbjct: 57 SPSISVIVPAFNEEKAIIQTLASICRSTYKKISIVVVDDGSSDNT-AILVKEFIEVHPDN 115 Query: 134 VIHLAQ--NQGKAIALKTGAAAAKSEYL-VCIDGDALLDRDAAAYIVEPMLYNPRVGAVT 190 I L Q N GK++A+ + L + +D D+LL DA +V+ L NP++ A+ Sbjct: 116 DIKLIQQPNSGKSVAINNALFNYDNSVLTMVLDADSLLKEDAIEKMVKWFL-NPKIVALA 174 Query: 191 GNPRIRTRSTLVGKIQVGEY-SSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWS 249 N ++ T T +G Q E+ S+ G K + + ++ + G+ + FR L +G + Sbjct: 175 MNVKMLTLPTFIGACQRFEFISAYRG--KCAEHVLKTLYIIGGIGSTFRTDKLKNIGGYD 232 Query: 250 DDMITEDIDISWKL 263 + TEDID++ KL Sbjct: 233 TNTPTEDIDLTLKL 246 >UniRef50_A9KYN4 Glycosyl transferase family 2 n=24 Tax=Gammaproteobacteria RepID=A9KYN4_SHEB9 Length = 443 Score = 95.1 bits (235), Expect = 5e-18, Method: Compositional matrix adjust. Identities = 86/281 (30%), Positives = 129/281 (45%), Gaps = 27/281 (9%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRYEN-IEVIAVNDGSTDKT---RAILDRMAAQIP 130 P +S +I C+ E + V+ TI + + Q Y+ IE+IAV DG+ R + A Sbjct: 67 PKVSCVITCYAEGEAVKSTIDSFIEQVYDGEIEIIAVVDGAVQNGLTYRVAMAAAKACQA 126 Query: 131 HLRVIHLA---QNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVG 187 H R + + Q G+ L G A A E ++ D D D D + IV P +P V Sbjct: 127 HNRTVIVLPKWQRGGRVSTLNAGLATATGEIVINADADTSFDNDMVSQIV-PHFEDPNVP 185 Query: 188 AVTGNPRIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVG 246 AV G R+R S ++ ++Q EY + K + + +SG AFRR+ L ++G Sbjct: 186 AVGGALRVRNVSESIFTRMQAIEYLISMQGGKTGLGQWNLLNNISGAFGAFRRTFLIQIG 245 Query: 247 YWSDDMITEDIDISWKLQL-----NQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGA 301 W D ED+D++ +++ W I + A+ P LK L QRLRW G Sbjct: 246 GW-DTHTAEDLDLTIRIKQYFKRHPDWRIPFATLAIGHTDAPADLKTLVWQRLRW--DGD 302 Query: 302 EVFLKNMTRLWRKENFRMWPLFFEYCL-TTIWAFTCLVGFI 341 +FL +RK WP F L + + FT L GF+ Sbjct: 303 LLFL-----YFRKH----WPAFTPKLLGSGTFVFTLLYGFL 334 >UniRef50_D1JE47 Conserved hypothetical membrane protein, glycosyl transferase family 2 n=1 Tax=uncultured archaeon RepID=D1JE47_9ARCH Length = 392 Score = 94.7 bits (234), Expect = 5e-18, Method: Compositional matrix adjust. Identities = 75/278 (26%), Positives = 128/278 (46%), Gaps = 20/278 (7%) Query: 74 NPSISIIIPCFNEEKNVEETIHAALAQRYEN-IEVIAVNDGSTDKTRAILDRMAAQIPHL 132 P+++++IP +NEEK + +TI++ Q Y N ++++ V+D STDKT I A+ Sbjct: 36 KPTVTVLIPAYNEEKTIADTINSIRQQTYSNVVQILVVDDSSTDKTDEIAREGDAE---- 91 Query: 133 RVIHLAQNQG-KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTG 191 VI N G KA A K E ID D +LD +A +V L P + + Sbjct: 92 -VIRTPSNSGTKAQAQNFALPYLKGELTATIDADTILDVEAIDTMVLRFLKEPDIASTCS 150 Query: 192 --NPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWS 249 P++R +G+ EY + + K+TQ G SG + F S L + G+ Sbjct: 151 FVVPQVRKNFWELGR--TVEYIYGLFMRKKTQEYLGIPIVCSGCFSLFNTSLLQKYGF-K 207 Query: 250 DDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMT 309 D + ED+D++W+L + + A+C+ P++ K Q RW + + + Sbjct: 208 DRTMAEDMDLTWQLLAEGKKVRFCADAICFPKDPDSWKTYKGQVSRWLRSFFQNLSVHKK 267 Query: 310 RLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQL 347 +W+ N ++ + Y + +G I YA+ L Sbjct: 268 DIWK--NKKLAVFLYSYLIEG------FMGVIFYALLL 297 >UniRef50_B1YJV4 Glycosyl transferase family 2 n=4 Tax=Bacillales RepID=B1YJV4_EXIS2 Length = 482 Score = 94.7 bits (234), Expect = 6e-18, Method: Compositional matrix adjust. Identities = 89/368 (24%), Positives = 158/368 (42%), Gaps = 57/368 (15%) Query: 56 RERHWPWGENAPAPQLKD--NPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDG 113 R++ + E++ P ++ +S+++P +NEE + ++ A LA Y EVI +NDG Sbjct: 41 RKQRFRRKESSYLPLMRSLYTRPVSVLVPAYNEEYTIVTSVQALLALDYPEHEVIVINDG 100 Query: 114 STDKTRAILDRMAAQIPHLR----------VIHL-------------AQNQGKAIALKTG 150 S+D T L + LR V H+ +N GKA AL G Sbjct: 101 SSDGTMDELFKTYELEKELRFPRILIGSQPVKHVYRSRIDARLIVIDKENGGKADALNAG 160 Query: 151 AAAAKSEYLVCIDGDALLDRDAAAYIVEPML-YNPRVGAVTGN--------------PRI 195 + Y +D D++L+RDA +++P++ + V A G+ I Sbjct: 161 IDYSSYPYFCSLDADSILERDALLKVMKPIVESDEDVIATCGSIFLVNGCTIEGGEIVEI 220 Query: 196 RTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITE 255 RT + +Q+ EY + + + VSG F+++ + + G + + ++ E Sbjct: 221 RTPKQRIVLMQMIEYFRAFLFGRLGLSRFNMLLIVSGAFGLFQKAIVVQAGGYKEGLVGE 280 Query: 256 DIDISWKL----QLNQWT--IFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVF--LKN 307 D+++ +L + N+ I Y P +CW P+T++ L QR RW +G AE K Sbjct: 281 DMELVVRLHRYMKENKIDKRIEYVPSPVCWTQAPDTMEVLRSQRSRWHRGLAETLWTHKR 340 Query: 308 MTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGI 367 M R + + + + + L+G+ I + L LN E G+ Sbjct: 341 MIGNPRYGFVGSVSMLYFFLIELLGPIVELIGYFIIPLNLLVGTLNFEF---------GV 391 Query: 368 LLCTLCLL 375 +L T+ LL Sbjct: 392 VLLTVALL 399 >UniRef50_UPI000038DF5C N-acetylglucosaminyltransferase n=1 Tax=Ferroplasma acidarmanus fer1 RepID=UPI000038DF5C Length = 405 Score = 94.4 bits (233), Expect = 8e-18, Method: Compositional matrix adjust. Identities = 69/250 (27%), Positives = 113/250 (45%), Gaps = 15/250 (6%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRV 134 P +SII+P NEE ++ TI + L Q Y N E+ V D S+D T I ++ + V Sbjct: 56 PMVSIIVPAKNEETVIKRTIESILNQTYTNFELFVVVDNSSDNTYKIAKEYESRDKRVNV 115 Query: 135 IHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLY-NPRV-----GA 188 + + KA AL K E + D D +L + V M Y N V Sbjct: 116 FNRPDGKSKASALNFCFEKTKGEVIATYDADTMLLPNTLENAVYGMNYFNVDVLQGYNSY 175 Query: 189 VTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFT-VSGVIAAFRRSALAEVGY 247 + I TR ++ +I V +++IG R + N+F V+G F+R + +G Sbjct: 176 INREENIFTRLAVIDEILVK--ATLIG------RTHFNLFVPVAGSNQYFKRKVIESIGG 227 Query: 248 WSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKN 307 W D+ +TED++ S ++ ++ Y A P + ++QR RW +G +VF + Sbjct: 228 WDDNFLTEDLESSIRISNARYKSAYLGSAKALQETPASYSEYFRQRTRWLRGYHQVFFHS 287 Query: 308 MTRLWRKENF 317 R + +F Sbjct: 288 KKRFSKFTDF 297 >UniRef50_Q110Z2 Glycosyl transferase, family 2 n=15 Tax=Cyanobacteria RepID=Q110Z2_TRIEI Length = 502 Score = 94.4 bits (233), Expect = 8e-18, Method: Compositional matrix adjust. Identities = 105/390 (26%), Positives = 167/390 (42%), Gaps = 51/390 (13%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRY--ENIEVIAVNDGSTDKTRAILDRMAAQIPHL 132 P IS+++ NEE + + + LA Y + E+ ++D STDKT +L+++A + L Sbjct: 126 PYISLLVAAKNEEAVIRKLVKNMLALDYPTNSYELWVIDDNSTDKTPLLLEQLAREYEQL 185 Query: 133 RVIHLAQNQG--KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVT 190 +VI + + G K+ AL K + L D DA + D +V P+ VGAV Sbjct: 186 KVIRRSPDAGGGKSGALNAAIPFVKGKILGVFDADAQVTPDLLQKVV-PLFAREEVGAV- 243 Query: 191 GNPRIRTRSTLVG-----KIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEV 245 +IR G K Q E + G + + G + + G R +AL E Sbjct: 244 ---QIRKAIANAGINFWTKGQSAEMV-VDGFFQEQRIAIGGIGELRGNGQFVRMNALEEC 299 Query: 246 GYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFL 305 G W++ IT+D+D++ +L LNQW I Y + LW QR RWA+GG + +L Sbjct: 300 GGWNEQTITDDLDLTIRLHLNQWDIDYLAFPAVTEEGVTSPIALWHQRSRWAEGGYQRYL 359 Query: 306 KNMTRLWRKENFRMWPLFFEYCLT-TIWAFTCLVGFIIYAVQLAGVP---LNIELTHIAA 361 + R RM + T +W F Y + +A VP ++I L + Sbjct: 360 DYWKLILRN---RM-----RFSKTWDLWQFLV----TQYLLSVAAVPDFLMSIILRRLPI 407 Query: 362 THTAGILLCTLCLLQFIVSLMIENRYEHNLTS---------------SLFWIIWFPV--I 404 T + + LL + L + + NL SLF + V Sbjct: 408 TSPLTVFTVMVSLLGMFIGLRRTRKQQMNLAKEEKVMEFNSSKDNPLSLFLTLLESVRGT 467 Query: 405 FWMLSLATTL-VSFTRVMLMPKKQRARWVS 433 F+ML + V+ R+ ++PK R +WV Sbjct: 468 FYMLHWFVVMGVTIARMSILPK--RLKWVK 495 >UniRef50_C0ZQD3 Putative glycosyltransferase n=2 Tax=Rhodococcus RepID=C0ZQD3_RHOE4 Length = 634 Score = 94.0 bits (232), Expect = 1e-17, Method: Compositional matrix adjust. Identities = 68/247 (27%), Positives = 111/247 (44%), Gaps = 5/247 (2%) Query: 74 NPSISIIIPCFNEEKNVEETIHA--ALAQRYEN-IEVIAVNDGSTDKTRAILDRMAAQIP 130 +P +I++P NE ++ TI A A+ Y I + V++ STD T + +++ A+ Sbjct: 252 SPPTAILVPAHNEAHDIAATIAAVDTAARTYPGRIHLYVVDNASTDNTHEVAEQIIAECS 311 Query: 131 HLRVIHL-AQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAV 189 + L GKAIAL G E++ ID D ++ + NPRVG V Sbjct: 312 FITGTVLDCPTPGKAIALNMGIDHITEEFVARIDADTVIGPGCLETAMR-HFANPRVGCV 370 Query: 190 TGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWS 249 G P R T + K ++ E G + + Y V + G+ A +RRS L +VG Sbjct: 371 GGIPMPREEKTWIDKCRLIEVYMRHGFFQVSLDGYQGVMGIPGMFAVYRRSQLIKVGGMV 430 Query: 250 DDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMT 309 M ED DI +L + +P+A+ + P + L +QR RW + + +N Sbjct: 431 QGMNGEDTDICMRLTAAGYHSVADPKAVYYSETPASYAHLLEQRTRWFRSIYHLAARNRA 490 Query: 310 RLWRKEN 316 L ++ Sbjct: 491 LLLDRQT 497 >UniRef50_C0QGC2 NodC n=2 Tax=Desulfobacterium autotrophicum HRM2 RepID=C0QGC2_DESAH Length = 466 Score = 94.0 bits (232), Expect = 1e-17, Method: Compositional matrix adjust. Identities = 62/236 (26%), Positives = 118/236 (50%), Gaps = 12/236 (5%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRY--ENIEVIAVNDGSTDKTRAILDRMAAQIPH- 131 P ++I+P FNE V +T+ + +A Y + +++IAV+DGS D T + + ++ Sbjct: 83 PHCTVIVPAFNEGVQVYKTLKSLVASDYPKKKLQLIAVDDGSQDDTWDWIKKAKQELGKG 142 Query: 132 LRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTG 191 L++I L QNQGK AL +G + L+ +D D++++ ++V P + + RVGAV G Sbjct: 143 LKIIRLQQNQGKRHALYSGFVKSTGSILITVDSDSMVEPLTIRHMVAPFVKDARVGAVAG 202 Query: 192 NPRIRTRST-LVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEV-GYWS 249 N R+ R ++ ++ + ++ +Q + V G ++A+R+S + +V W Sbjct: 203 NVRVLNREKGMIPRMLDVAFVYGFDFMRASQSMVNTVMCTPGALSAYRKSVVIQVLPEWL 262 Query: 250 DDM-------ITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQ 298 I ED ++ + + ++ A+ + +P T L K LRW + Sbjct: 263 HQTYCGRPANIGEDRAMTNLILREGNHVLFQQNAMVYTEIPVTYTKLCKMYLRWGR 318 >UniRef50_C6MI20 Glycosyl transferase family 2 n=1 Tax=Nitrosomonas sp. AL212 RepID=C6MI20_9PROT Length = 441 Score = 93.6 bits (231), Expect = 1e-17, Method: Compositional matrix adjust. Identities = 70/236 (29%), Positives = 112/236 (47%), Gaps = 16/236 (6%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRYE-NIEVIAVNDGS-----TDKTRAILDRMAAQ 128 P ++ I C+ E K V+ T+ + L Q Y +IE+IA+ DG+ T + L + ++ Sbjct: 65 PHVTCAITCYAEGKAVQSTVISLLEQVYSGHIEIIAIVDGAKRNQATYQALVELISLVSR 124 Query: 129 IPH--LRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRV 186 P L +I Q G+ +L G A+ E +DGD D + NP + Sbjct: 125 YPKRSLVIIPKFQRGGRVSSLNAGLNRARGEIFFALDGDTSFDNQMVHH-AATHFRNPGM 183 Query: 187 GAVTGNPRIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEV 245 AVTG R+R T++ ++Q EY + + K G + V G FR+S L +V Sbjct: 184 VAVTGPMRVRNAGKTIMTRLQSLEYMLTMQVGKLGFANLGVINNVPGAFGIFRKSFLQQV 243 Query: 246 GYWSDDMITEDIDISWKLQ-----LNQWTIFYEPRALCWILMPETLKGLWKQRLRW 296 G W+ ED+D++ +++ I +EPRA+ +PE+ KQRLRW Sbjct: 244 GGWNTGT-AEDLDLTLRIKQYYKRYPHLQIEFEPRAVSHTDVPESFMDFLKQRLRW 298 >UniRef50_D1U9V0 Hyaluronan synthase n=1 Tax=Desulfovibrio aespoeensis Aspo-2 RepID=D1U9V0_9DELT Length = 484 Score = 93.2 bits (230), Expect = 2e-17, Method: Compositional matrix adjust. Identities = 88/316 (27%), Positives = 143/316 (45%), Gaps = 31/316 (9%) Query: 7 SFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRE--------- 57 F + L++ I L V F+S + R +S +W GVYF V Sbjct: 39 GFLAISLIVLISLLV--FNSLVYVGRLHLSLAMPLSTLW-RAGVYFLVVMNLAMLAWRVL 95 Query: 58 ---RHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRY--ENIEVIAVND 112 R+ P A A +LK S ++IIP +NE V ET+ + ++ + + ++VI V+D Sbjct: 96 LVLRYRPAAPCADA-ELK---SCTVIIPAYNEGPQVLETLRSVMSSDFPADRLQVICVDD 151 Query: 113 GSTDKTRAILDRMAAQI-PHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRD 171 GS D T + + + + +I NQGK AL G A E LV ID D++++ Sbjct: 152 GSKDDTWLWMVKGGEEFGDRVELIRSPMNQGKRHALNHGFLRATGEVLVTIDSDSVIEPT 211 Query: 172 AAAYIVEPMLYNPRVGAVTGNPRI-RTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTV 230 +V NPRVGAV GN R+ T ++ K+ ++ I+ Q + V Sbjct: 212 TLRSMVSVFCRNPRVGAVAGNVRVLNTGEGVIPKMLDVAFTYSFDFIRAAQSTFNTVMCT 271 Query: 231 SGVIAAFRRSALAEV-GYWSDDM-------ITEDIDISWKLQLNQWTIFYEPRALCWILM 282 G ++A+ + +A+V W I ED ++ + Q + ++ A+ + + Sbjct: 272 PGALSAYDAALVAKVREKWLHQTFMGQPANIGEDRAMTNLILKEQRLVHFQSDAVVFTNV 331 Query: 283 PETLKGLWKQRLRWAQ 298 P KGL K LRWA+ Sbjct: 332 PVEYKGLCKMMLRWAR 347 >UniRef50_UPI0001C31B0A glycosyl transferase family 2 n=1 Tax=Conexibacter woesei DSM 14684 RepID=UPI0001C31B0A Length = 495 Score = 93.2 bits (230), Expect = 2e-17, Method: Compositional matrix adjust. Identities = 71/246 (28%), Positives = 119/246 (48%), Gaps = 13/246 (5%) Query: 69 PQLKDNPSISIIIPCFNEEKNVEETIHAALAQRY--ENIEVIAVNDGSTDKTRAILDRMA 126 P+ P ++++IP +NE + T+ LA Y + + + V+D STD T ++ A Sbjct: 40 PRAACYPRVALVIPAWNEAAVIGRTLDRLLALEYPGDRLRIYVVDDASTDATPELVRAKA 99 Query: 127 AQIPHLRVIHLAQ---NQGKAIALKTGAAAAKSE----YLVCIDGDALLDRDAAAYIVEP 179 + P R+ HL + +GKA + G ++E ++ ID D + + ++ Sbjct: 100 EEHPG-RIFHLRRERGGEGKAHTINHGLRTIRAEGWYEAVLVIDADVIFTPPSLRWMTR- 157 Query: 180 MLYNPRVGAVTGNPRIRTR-STLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFR 238 L +P VGAVTG + +R + + + EY + +R Q + G ++G R Sbjct: 158 HLADPGVGAVTGYIKEGSRPGSWMTRFVAFEYVTAQAAARRAQNVLGAQACLAGGAQLLR 217 Query: 239 RSALAEVGYWSDD-MITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWA 297 R +L VG D + ED + +QL + +EP A+ W P T+ GLWKQRLRW Sbjct: 218 RESLEAVGGEIDTATLAEDTVTTLGVQLAGGRVVFEPYAIVWAEEPRTVTGLWKQRLRWG 277 Query: 298 QGGAEV 303 +G +V Sbjct: 278 RGNVQV 283 >UniRef50_A4CJ64 Glycosyltransferase n=15 Tax=Bacteroidetes RepID=A4CJ64_9FLAO Length = 494 Score = 93.2 bits (230), Expect = 2e-17, Method: Compositional matrix adjust. Identities = 71/266 (26%), Positives = 122/266 (45%), Gaps = 17/266 (6%) Query: 65 NAPAPQL-----KDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEV-IAVNDGSTD-- 116 N AP+ K+ P ++I +P +NEE VE + Y ++ I V D STD Sbjct: 41 NKEAPKFNLLDPKEIPYVTIQLPIYNEEYVVERLLENIARIEYPKSKLEIQVLDDSTDDS 100 Query: 117 --KTRAILDRMAAQIPHLRVIHLAQNQG-KAIALKTGAAAAKSEYLVCIDGDALLDRDAA 173 +T A+++ + Q ++ I +G KA ALK G AK +++ D D L D D Sbjct: 101 VEQTAAMIEELQKQGLDIQHIRRENREGFKAGALKEGLKIAKGDFIAIFDADFLPDAD-- 158 Query: 174 AYIVEPMLY--NPRVGAVTGN-PRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTV 230 ++ + ++Y + +G V I + + KIQ + L + + G+ Sbjct: 159 -WLKKTVIYFKDEEIGVVQTRWGHINRDYSTLTKIQAFALDAHFTLEQVGRNSKGHFINF 217 Query: 231 SGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLW 290 +G +R+ + + G W D +TED+D+S++ QL W Y +P + Sbjct: 218 NGTAGIWRKECILDAGNWEGDTLTEDLDLSYRAQLKNWKFKYLEDVETPAELPVVISAAR 277 Query: 291 KQRLRWAQGGAEVFLKNMTRLWRKEN 316 Q+ RW +GGAE F K + + + +N Sbjct: 278 SQQFRWNKGGAENFRKTVWNVVKAKN 303 >UniRef50_C7QK51 Glycosyl transferase family 2 n=1 Tax=Catenulispora acidiphila DSM 44928 RepID=C7QK51_CATAD Length = 658 Score = 93.2 bits (230), Expect = 2e-17, Method: Compositional matrix adjust. Identities = 71/249 (28%), Positives = 116/249 (46%), Gaps = 10/249 (4%) Query: 63 GENAPAPQLKDNPSISIIIPCFNEEKNVEETIHA--ALAQRYEN-IEVIAVNDGSTDKTR 119 G +P P + P + +IIP FNE +E T+ + A+ Y + VI +DGS D TR Sbjct: 267 GPGSPRPD-ERRPPLEVIIPAFNEAVVIERTLRSIDRAARAYGGPVRVILCDDGSMDDTR 325 Query: 120 AILDRMAAQIPHL--RVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIV 177 A+ + A+ + VI ++ GK++AL A +++Y+ ID D LD +A Y V Sbjct: 326 ALAEATIAEFGYATGEVIQ-GRHAGKSVALNLALAECEADYVYRIDADCALDANAFVYSV 384 Query: 178 EPMLYNPRVGAVTGN--PRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIA 235 L +P VG V P+ +T + +++ E G + V V G Sbjct: 385 PHFLKHPDVGLVGAFVLPK-EPYTTWIDRMRALEMIFSFGFARVMLAEVDAVPCVPGTFC 443 Query: 236 AFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLR 295 AFRR +G + + ED+ + + + +PR + + +P T++ L QR R Sbjct: 444 AFRRLPALAIGGFVHGTLGEDVFFTCSMARLGYRAAIDPRVISYEDVPTTVRQLRVQRFR 503 Query: 296 WAQGGAEVF 304 W +GG F Sbjct: 504 WCKGGIMSF 512 >UniRef50_C7M4N0 Glycosyl transferase family 2 n=4 Tax=Flavobacteria RepID=C7M4N0_CAPOD Length = 494 Score = 92.8 bits (229), Expect = 2e-17, Method: Compositional matrix adjust. Identities = 71/271 (26%), Positives = 123/271 (45%), Gaps = 15/271 (5%) Query: 58 RHWPWGENAPAPQL---KDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEV-IAVNDG 113 +H + AP L K+ P ++I +P +NE+ V + Y ++ I V D Sbjct: 36 KHRKQNDEAPKFNLLDPKEIPYVTIQLPLYNEKYVVRRLLENIAKLEYPRTKLEIQVLDD 95 Query: 114 STDKTRA----ILDRMAAQIPHLRVIHLAQNQG-KAIALKTGAAAAKSEYLVCIDGDALL 168 STD++ A I+ + A ++ I +G KA ALK G A AK +++ D D + Sbjct: 96 STDESVAETAEIIKELQATGLDIKHIRRTNREGFKAGALKEGLAIAKGDFVAIFDADFMP 155 Query: 169 DRDAAAYIVEPMLY--NPRVGAVTGN-PRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYG 225 D ++ ++Y +P +G V I +++ KIQ L + + G Sbjct: 156 QPD---WLKRTVVYFKDPEIGVVQTRWGHINRNYSILTKIQALALDVHFTLEQVGRNSKG 212 Query: 226 NVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPET 285 +G +R++ + + G W D +TED+D+S++ QL W Y +P Sbjct: 213 YFINFNGTAGIWRKACIYDAGNWEGDTLTEDLDLSYRAQLKNWKFKYLEDVETPAELPVV 272 Query: 286 LKGLWKQRLRWAQGGAEVFLKNMTRLWRKEN 316 + Q+ RW +GGAE F K ++R+ +N Sbjct: 273 ISAARSQQFRWNKGGAENFRKTVSRVLAAKN 303 >UniRef50_B2GJB7 Putative glycosyltransferase n=2 Tax=Micrococcineae RepID=B2GJB7_KOCRD Length = 481 Score = 92.8 bits (229), Expect = 2e-17, Method: Compositional matrix adjust. Identities = 73/303 (24%), Positives = 122/303 (40%), Gaps = 47/303 (15%) Query: 43 IMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRY 102 +M ++ V F +R R P+ + + P IS++ P NEE + + + L+ ++ Sbjct: 35 VMIVLAAVDFVAHR-RRIPFAGREELIRSRTMPGISVVAPMHNEEVGIRVAVESFLSLQH 93 Query: 103 ENIEVIAVNDGSTDKTRAILDRMAAQIPHLR-------------VIHL------------ 137 EVI V+DGSTD +L R +P R IH+ Sbjct: 94 PRHEVIVVDDGSTDNGFEVLRRAFDLVPVSRQMPEDVPVREAPTSIHVPRDGRTRLTVVR 153 Query: 138 AQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNP---- 193 +N G++ ++ G A+ + ++ +D D++LD A + P + +P G Sbjct: 154 KRNSGRSDSINVGVNLAREDLVLFVDSDSILDPGALLAVSRPFIEDPISTVAAGGVIRAV 213 Query: 194 -----------RIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSAL 242 +R + IQV EY L + + +SG FRR AL Sbjct: 214 NGCRVKGGRVLEVRMPGNPLADIQVMEYLRAFHLGRAGWSRINALLLISGAFGVFRRDAL 273 Query: 243 AEVGYWSDDMITEDIDISWKL------QLNQWTIFYEPRALCWILMPETLKGLWKQRLRW 296 VG I ED ++ +L Q + + + +CW +P TL L +QR RW Sbjct: 274 VTVGGLDASSIGEDFELVMRLHRTFRKQRRPYRVTFLTEPICWTEVPSTLPVLRRQRARW 333 Query: 297 AQG 299 +G Sbjct: 334 HRG 336 >UniRef50_D2S449 Glycosyl transferase family 2 n=1 Tax=Geodermatophilus obscurus DSM 43160 RepID=D2S449_9ACTO Length = 484 Score = 92.8 bits (229), Expect = 2e-17, Method: Compositional matrix adjust. Identities = 76/283 (26%), Positives = 122/283 (43%), Gaps = 52/283 (18%) Query: 74 NPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKT----------RAILD 123 P +S++IP +EE + +T+ A LAQRY +EV+ V+DGSTD+T + Sbjct: 59 TPGVSVLIPAHDEEAGILDTLAAVLAQRYPVLEVVLVDDGSTDRTFDMVAERFGLVQVTA 118 Query: 124 RMAAQIP---------------HLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALL 168 R A +P L V+ ++ AL A+ + +D D+LL Sbjct: 119 RPTAALPVDGTVLSVHRATTGDPLTVVRKTSVGRRSDALNAALNVARHPLVCMVDADSLL 178 Query: 169 DRDAAAYIVEPMLYNPRVGAVTGN-----------------PRIRTRSTLVGKIQVGEYS 211 + +A + P + +P +G PR+ +R V + QV EY Sbjct: 179 EPEALVQVARPFVEDPANVVASGGVIRAANGARVARGSVLEPRLSSR--WVVRFQVVEYL 236 Query: 212 SIIGLIKRTQRIYGN-VFTVSGVIAAFRRSALAEVGYWSDDMITEDIDI------SWKLQ 264 L+ RT N + +SG FRR + EVG + ED ++ + Q Sbjct: 237 RSF-LLGRTGWADANALLIISGAFGLFRRDLVVEVGGLDALSLAEDAELVVTVQEHLRRQ 295 Query: 265 LNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKN 307 + + P +CW +PETL L +QR RW+ G A++ K+ Sbjct: 296 RRPHRVVFVPEPVCWTEVPETLAVLGRQRRRWSHGLAQLLWKH 338 >UniRef50_B7HT47 Glycosyl transferase family 2 n=8 Tax=Bacteria RepID=B7HT47_BACC7 Length = 1184 Score = 92.8 bits (229), Expect = 2e-17, Method: Compositional matrix adjust. Identities = 73/291 (25%), Positives = 120/291 (41%), Gaps = 51/291 (17%) Query: 77 ISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILD------------R 124 +SII+P +NE + ++ + L Y EVI VNDGS+D T L R Sbjct: 61 VSIIVPAYNEGVTIVSSVQSLLTLEYPEFEVIVVNDGSSDDTLEQLKDHFQLYEIQNVVR 120 Query: 125 MAAQIPHLRVIHLA-----------QNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAA 173 + + +R + + +N GKA AL G + Y+ +D D+LL+RDA Sbjct: 121 LQLETETIRKTYRSSVNKQIIVVDKENGGKADALNAGINISNYPYVCSLDADSLLERDAL 180 Query: 174 AYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEY---SSIIGLIKRTQRIYGNVFT- 229 ++P+ +P VTG + + Q+ E + L++ + + G +F Sbjct: 181 MKAMKPIYESPEKVMVTGGSVRIVNGSYIQNGQMIENRLPKQPLALMQIIEYLRGFLFGR 240 Query: 230 -----------VSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLN------QWTIFY 272 +SG F + + VG + + ED+++ L + I Y Sbjct: 241 LAWSKYNILPIISGAFGIFDKGEVIRVGGYQRKTVGEDMELVVHLHKKALQDGEEKKIIY 300 Query: 273 EPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLF 323 P A+CW P+ L KQR RW +G E LWR ++ P + Sbjct: 301 NPNAICWTQAPDDLTTFRKQRSRWHRGLGET-------LWRHKDILFRPKY 344 >UniRef50_Q7NHH7 Glr2559 protein n=1 Tax=Gloeobacter violaceus RepID=Q7NHH7_GLOVI Length = 426 Score = 92.8 bits (229), Expect = 3e-17, Method: Compositional matrix adjust. Identities = 96/376 (25%), Positives = 150/376 (39%), Gaps = 36/376 (9%) Query: 66 APAPQLKDNPS----ISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKT--R 119 A +P+ K +P+ +S+++ NE+ + + Y + EV +DGSTD+T R Sbjct: 53 ALSPRPKSDPAYRPRVSVLVAAKNEQAVAAQLVAMLRRLDYPDFEVWIADDGSTDRTYQR 112 Query: 120 AILDRMAAQIPHL-RVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVE 178 + Q HL R I GK+ L A + LV D DA ++ D + V Sbjct: 113 LLEAGRGWQALHLVRRIPERSRPGKSAVLNELRERATGDILVVFDADARVEPDFLSRTV- 171 Query: 179 PMLYNPRVGAVTGNPRIR-------TRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVS 231 P+ VGA+ R+ TR + Y I T + GN V Sbjct: 172 PLFAVSSVGALQVRKRVHNADFNFWTRGQSAEMLLDAFYQQQRAAIGGTAELRGNGQLV- 230 Query: 232 GVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWK 291 R +AL VG W++ +T+D+D++ +L L W I + T LW+ Sbjct: 231 ------RAAALEAVGGWNEATVTDDLDLTLRLHLGGWQIAFASDPCVDEEGVTTWSALWR 284 Query: 292 QRLRWAQGGAEVFLKNMTRLWRKENFRMWPLF-FEYCLTTIWAFTCLVGFIIYAVQLAGV 350 QR RWA+GG + +L RL+ + +C V +++A Q Sbjct: 285 QRSRWAEGGFQRYLDYAPRLFGGAMGTTKTVDQLIFCTIQYLMPVAAVLDLLFAFQRGAA 344 Query: 351 PLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSL 410 PL L +A T +C Q + + L +++++ WFPVI Sbjct: 345 PLLTPLVLVATVFT----VCGFYFGQRERGVQVGRAMLETLAGTIYFLHWFPVI------ 394 Query: 411 ATTLVSFTRVMLMPKK 426 LV R L PKK Sbjct: 395 ---LVKLARTALEPKK 407 >UniRef50_Q03RZ5 Glycosyltransferase n=3 Tax=Lactobacillus RepID=Q03RZ5_LACBA Length = 374 Score = 92.8 bits (229), Expect = 3e-17, Method: Compositional matrix adjust. Identities = 79/289 (27%), Positives = 135/289 (46%), Gaps = 31/289 (10%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRY--ENIEVIAVNDGSTDKTRAILDRMAAQIPHL 132 P ++II+P +N E + E I + Q Y E +++I ++ STD + R P L Sbjct: 53 PWVTIIVPVYNSEDTLFECIQSINEQTYPKEAMQIILADNQSTDGSFGAYARAQNTFPEL 112 Query: 133 RVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGN 192 + ++ ++GKA AL T + Y++ ID D LL+++A +V + + A+TG Sbjct: 113 ILRYVNTDKGKARALNTAIYESIGIYVINIDSDGLLEKNAVKNMVLRFENDLSIAAMTG- 171 Query: 193 PRIRTRSTLVGKI-----------QVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSA 241 I + LV K+ + EY+ + + +FT+SG +AFRR Sbjct: 172 -AILPKEELVQKVKGFGHRLLAKNEYYEYAQAFLSGRTIEASRDQLFTMSGAFSAFRREV 230 Query: 242 LAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALC--WILMPETLKG---LWKQRLRW 296 L E + D I ED D++++++ T + +C I E + G L+ QR RW Sbjct: 231 LMETFLYDIDTIGEDTDMTFQIR----TRLGKKVVICADAIFYVEPISGFSELYTQRQRW 286 Query: 297 AQGGAEV---FLKNMTRLWR-KENFRMWPLFFEYCLT---TIWAFTCLV 338 +G EV ++K L R +NF + + ++ T IW F +V Sbjct: 287 QRGELEVTENYMKQTASLKRFFKNFLVRRMMIDHTFTFPRMIWTFATVV 335 >UniRef50_B7KYJ4 Glycosyl transferase family 2 n=2 Tax=Methylobacterium extorquens group RepID=B7KYJ4_METC4 Length = 423 Score = 92.4 bits (228), Expect = 3e-17, Method: Compositional matrix adjust. Identities = 70/222 (31%), Positives = 104/222 (46%), Gaps = 3/222 (1%) Query: 76 SISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVI 135 ++S+IIP FN + TI + Q +E+I V+DGSTD TRA+ +R VI Sbjct: 54 AVSVIIPTFNGGSGLAPTIASLRQQTLSPLEIIIVDDGSTDDTRAVANRARQDGLVDMVI 113 Query: 136 HLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRI 195 G++ A+ A A + + ID D +L+RDA A + +PRV GN I Sbjct: 114 CHGTRCGRSAAINAAARFASGDLIFTIDADTVLERDAIARLAN-AFNDPRVAGACGNIAI 172 Query: 196 RTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMIT 254 R S +L +Q EY I K + G + +SG + +RR A G D Sbjct: 173 RNESASLWTGLQSVEYMMSISAGKSILDVVGAIACLSGACSMYRRDVFARQGGL-DVGPG 231 Query: 255 EDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRW 296 ED++ S +L+ + + + P A P GL +QR RW Sbjct: 232 EDLEFSLRLRRLGYRVRFVPDAWAETDGPVAAVGLLRQRARW 273 >UniRef50_A5FAD6 Candidate beta-glycosyltransferase; Glycosyltransferase family 2 n=7 Tax=Flavobacteriales RepID=A5FAD6_FLAJ1 Length = 478 Score = 92.4 bits (228), Expect = 3e-17, Method: Compositional matrix adjust. Identities = 63/256 (24%), Positives = 121/256 (47%), Gaps = 14/256 (5%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRY--ENIEVIAVNDGSTDKTRAILDRMAAQI-PH 131 P+ ++I+P +NE K V ET+ + + +E++A++DGS D T + + ++ Sbjct: 124 PTCTVIVPAYNEGKLVYETLMSLAESDFPAHKLELLAIDDGSKDDTWYWIQQAKIKLGDR 183 Query: 132 LRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTG 191 + + QN+GK AL G K E V +D D+++ +D +V P + + + GAV G Sbjct: 184 VSIFQQPQNKGKRHALYRGFNLGKGEVFVTVDSDSIVKKDTLRNLVSPFVVDEKCGAVAG 243 Query: 192 NPRI-RTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSAL-AEVGYWS 249 N ++ ++ K+ + ++ + G+V G +AA+RR+A+ A + W Sbjct: 244 NVQVLNNEKAILPKMLNVSFVMSFEFMRSAESSLGSVLCTPGALAAYRRNAVFACLPEWI 303 Query: 250 DDM-------ITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAE 302 D I ED ++ + + ++ A +PE KGL+K +RW + Sbjct: 304 DQTFMGQPSDIGEDRAMTNMILKQGHHVLFQRNAYVLTNVPEDYKGLYKMFIRWGRSNVR 363 Query: 303 VFLKNMTRLWRKENFR 318 + +W+ +FR Sbjct: 364 ENIMMAKYVWK--DFR 377 >UniRef50_Q74D65 Glycosyl transferase, group 2 family protein n=1 Tax=Geobacter sulfurreducens RepID=Q74D65_GEOSL Length = 472 Score = 92.4 bits (228), Expect = 3e-17, Method: Compositional matrix adjust. Identities = 73/285 (25%), Positives = 122/285 (42%), Gaps = 51/285 (17%) Query: 65 NAPAPQLKDN-------PSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDK 117 NA Q DN +SI++P +NEEK + ++ + LA RY EVI +NDGSTD Sbjct: 43 NAVTSQSIDNVLNGMFYRPLSILVPAYNEEKTIVSSVKSLLALRYPEYEVIVINDGSTDG 102 Query: 118 TRAIL---------DR-MAAQIPHLRVIHL-------------AQNQGKAIALKTGAAAA 154 T L D+ ++ +PH +I +N GKA AL G A+ Sbjct: 103 TLESLINEFRLVRIDKPISLHVPHRPIIAKYVSVDHPYLFVLDKENGGKADALNAGINAS 162 Query: 155 KSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPR--------------IRTRST 200 + ID D++L+ DA + + V A G R +R Sbjct: 163 QFPLFCSIDADSVLEDDALIRATRLFVEDREVVATGGIVRVLNGCEVEDGIVKGVRAPRG 222 Query: 201 LVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDIS 260 ++ Q EY+ + + + ++ +SG FR+ + V + + + ED+D+ Sbjct: 223 MLECFQTVEYTKGFLSGRTSWNYFRSLLIISGAFGIFRKDMVMAVKGYRES-VGEDMDLV 281 Query: 261 WKLQLN------QWTIFYEPRALCWILMPETLKGLWKQRLRWAQG 299 +L + ++ + + P +CW +P + L KQR RW +G Sbjct: 282 VRLHRHCRQNRIRYKVVFVPDPVCWTQVPSDMASLLKQRNRWHRG 326 >UniRef50_A9QSM4 N-acetylglucosaminyltransferase n=39 Tax=Lactobacillales RepID=A9QSM4_LACLK Length = 438 Score = 92.0 bits (227), Expect = 4e-17, Method: Compositional matrix adjust. Identities = 85/341 (24%), Positives = 147/341 (43%), Gaps = 31/341 (9%) Query: 41 MSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQ 100 + +M + +FW+ + E +L P I+I++P NEE + +TI + + Sbjct: 21 LGLMILFSATHFWLTTIKK----ERQKVFRLATYPKITIVVPAHNEEVVIAQTIESIMNL 76 Query: 101 RY--ENIEVIAVNDGSTDKTR----AILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAA 154 Y +E++ D D T +++ + +VI GKA L + A Sbjct: 77 NYPKTQLELLLYADNCQDDTYNQIISVVSLEKYKYISAQVIQRTGTGGKAGVLNDALSIA 136 Query: 155 KSEYLVCIDGDALLDRDAAAYIVEPMLYNP-RVGAVTGNPRIRTRSTLVGKIQVGEYSSI 213 E++ D DA+ +++A ++VE L NP + GAV G + R + Sbjct: 137 TGEWICVYDADAMPEKNALHFLVEKALENPEKYGAVFGRNKTRNYKQ-------NFLTRC 189 Query: 214 IGL-IKRTQRIYG-------NVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQL 265 I L I TQRI+ + + G + + +G W + +TED IS+K+ Sbjct: 190 INLEIVNTQRIHHVGLWHLFKIGRIPGTNFIIKTDFVKHIGGWDNGALTEDTAISFKIMQ 249 Query: 266 NQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRM-WPLFF 324 + I + + PET+ + QR RWA+G +V L N L+ + +R+ W +F+ Sbjct: 250 SGKLIALAHNSEAFQQEPETVSAYYHQRKRWAKGNYQVILDNFKHLFDRSVWRIKWEVFY 309 Query: 325 EYCLTTIWAFTCLV-GFIIYAVQLA--GVPLNIELTHIAAT 362 Y T W +V II+ + G+ + THI T Sbjct: 310 -YISTFFWFNAAIVISDIIFVANIVTWGISMFNPSTHILFT 349 >UniRef50_B9AE19 Putative uncharacterized protein n=1 Tax=Methanobrevibacter smithii DSM 2375 RepID=B9AE19_METSM Length = 461 Score = 92.0 bits (227), Expect = 4e-17, Method: Compositional matrix adjust. Identities = 77/293 (26%), Positives = 127/293 (43%), Gaps = 21/293 (7%) Query: 64 ENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYE-----NIEVIAVNDGSTDKT 118 + P P K P +S+IIP NEE + +T+ + Y N E+I VNDGSTD T Sbjct: 82 QEKPIPYEKP-PFVSLIIPAHNEEYTIAQTVTSISKIDYTLNGKPNFELIVVNDGSTDST 140 Query: 119 RAILDRMAAQIPHLRVIHL---AQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAY 175 L + IP LR++ +GK L + +K E + D D +++D Sbjct: 141 GEKLSELKKDIPILRIVTRKPPKSGKGKGFVLNDALSLSKGEIIGVFDADTQVEKDFLN- 199 Query: 176 IVEPMLYNPRVGAVTGNPRIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVI 234 IV P L NP+V V ++ + + +Q E+ S + + + F + G Sbjct: 200 IVMPYLNNPKVQGVQTRVKMYNKDENFLANMQHVEFESFGNTLIAKDNLGKSGF-LGGNG 258 Query: 235 AAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRL 294 ++ A+ + W +TED+++S K+ L I Y + K +KQR Sbjct: 259 QFVKKQAILDGEKWDGFAVTEDLNLSVKILLKGGQIRYCGETAVYQEAVTDWKSFFKQRT 318 Query: 295 RWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCL---TTIWAFTCLV--GFII 342 RWA G E + ++ + PL +Y + + +AF + GFI+ Sbjct: 319 RWAIGNFETIFIYLPKILKSP----LPLIKKYGIIEHISFYAFNLFIFFGFIV 367 >UniRef50_C1F7J6 Glycosyl transferase, group 2 family n=1 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F7J6_ACIC5 Length = 627 Score = 92.0 bits (227), Expect = 4e-17, Method: Compositional matrix adjust. Identities = 67/271 (24%), Positives = 119/271 (43%), Gaps = 12/271 (4%) Query: 52 FWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEV-IAV 110 ++ +R+R P + + P +++ +P FNE+ ++ I A A Y + I V Sbjct: 152 YYRFRKR----ATKEPPKEFPELPRVTVQLPIFNEQFVIDRLIEAICAMDYPRDRLEIQV 207 Query: 111 NDGSTDKTRAI----LDRMAAQIPHLRVIHLAQNQG-KAIALKTGAAAAKSEYLVCIDGD 165 D STD+T+A+ + + Q + +H QG KA AL G AK E++ D D Sbjct: 208 LDDSTDETQAVAAALVKKYQEQGQPIVYLHRTNRQGYKAGALDEGLKVAKGEFVAIFDAD 267 Query: 166 ALLDRDAAAYIVEPMLYNPRVGAVTGN-PRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIY 224 + D ++ +P +G V + + + +++ L + Sbjct: 268 FVPSPDWLMKVIH-HFSDPAIGMVQTRWTHLNRDYSFLTQVEAILLDGHFVLEHGARSRA 326 Query: 225 GNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPE 284 G F +G +RR+A+ + G W D +TED D+S++ QL W Y C +P Sbjct: 327 GVFFNFNGTAGMWRRTAITDAGGWQHDTLTEDTDLSYRAQLVGWKFKYLQDVECPAELPI 386 Query: 285 TLKGLWKQRLRWAQGGAEVFLKNMTRLWRKE 315 + Q+ RWA+G + K M ++ R + Sbjct: 387 EMTAFKTQQARWAKGLIQTSKKIMPQVLRAD 417 >UniRef50_C7PF18 Glycosyl transferase family 2 n=9 Tax=Bacteroidetes RepID=C7PF18_CHIPD Length = 486 Score = 91.7 bits (226), Expect = 5e-17, Method: Compositional matrix adjust. Identities = 82/354 (23%), Positives = 142/354 (40%), Gaps = 54/354 (15%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKT---------------- 118 P I+++ P FNE + + + L Y E+I VNDGSTD + Sbjct: 61 PGITVLAPAFNEGLTIIFNVRSLLTLNYPLYEIIIVNDGSTDNSLEQMIDEFELVPVEFA 120 Query: 119 ----------RAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALL 168 R I L VI + KA A+ G AA ++ VC D D +L Sbjct: 121 YNAKIQTKPVRKIYKSTNPAYAKLMVIDKVNGKSKADAVNAGINAALYDHFVCTDVDCIL 180 Query: 169 DRDAAAYIVEPMLYNPRVGAVTGNP----------------RIRTRSTLVGKIQVGEYSS 212 D++ +++P++ + + R+R S L+ + Q EY Sbjct: 181 DKNTLLELIKPVMQEEKKRVIATGATLRIANSCEFDQGVMTRMRPPSQLLPRFQEVEYIR 240 Query: 213 IIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKL----QLNQ- 267 L K + V VSG + F + G + ED+++ ++ + N+ Sbjct: 241 AFVLGKMGWSLLNCVPNVSGGLGLFDKEIAIRSGGYDHSSFGEDMELMTRMCRYAKDNKI 300 Query: 268 -WTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEY 326 + I Y P+ LCW P ++K +QR RWA+G A++ + + +M + F Y Sbjct: 301 DYAIRYIPKTLCWTEAPASVKIFNRQRTRWARGLAQLMYAHFGMFFNPRYGKMGLIIFPY 360 Query: 327 -----CLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLL 375 L I FT ++ +I+ A L + L + +T +++ T+ +L Sbjct: 361 NFFFELLAPIIEFTGIIYYIVMAC-LGLINWPTALLLLVFVYTYSVMITTIAIL 413 >UniRef50_UPI000178A1DC glycosyl transferase family 2 n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI000178A1DC Length = 462 Score = 91.3 bits (225), Expect = 6e-17, Method: Compositional matrix adjust. Identities = 73/284 (25%), Positives = 119/284 (41%), Gaps = 57/284 (20%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAA------- 127 PS+S+++P +NEE + E + + Y EVI +NDGS+D+T +L Sbjct: 47 PSVSLLVPAYNEELTIAENVRCLMTLNYPTYEVIVINDGSSDETLKVLIEEYGLTLLPNP 106 Query: 128 ------------------QIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLD 169 Q P L +I +N GKA +L G ++ + ID D+LL+ Sbjct: 107 EIRGNISTNKVRGIYYNPQYPQLYLID-KENGGKADSLNAGINLSRYPLISSIDADSLLE 165 Query: 170 RDAAAYIVEPMLYNPR-VGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRT----QRI- 223 +DA + + NP A+ G+ RI I+ GE + L +R Q I Sbjct: 166 KDALIRMARMYMENPEETVAIGGDVRIANGCV----IENGEVKDV-SLPRRMWPMFQSIE 220 Query: 224 --------------YGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLN--- 266 + + VSG FR+ + VG + ED++I KL Sbjct: 221 YLKAFLGGRIGWSHFNGLIIVSGAFGLFRKDYVIAVGGYRGGYPGEDMNIIIKLHRYMLE 280 Query: 267 ---QWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKN 307 ++ + + P A+CW P++ L QR RW +G + ++N Sbjct: 281 NKIKYRVSFCPEAVCWTQAPDSYNILSNQRKRWGRGNLKNMIEN 324 >UniRef50_Q4KHI6 Glycosyl transferase, group 2 family protein n=11 Tax=Pseudomonadaceae RepID=Q4KHI6_PSEF5 Length = 863 Score = 91.3 bits (225), Expect = 7e-17, Method: Compositional matrix adjust. Identities = 65/251 (25%), Positives = 116/251 (46%), Gaps = 14/251 (5%) Query: 75 PSISIIIPCFNEEKN-VEETIHAALAQRYENIEVIAVNDGSTDKT-----RAILDRMAAQ 128 P +SI +PC+NE V++T++A Y + EV+ +++ + D +A + + + Sbjct: 421 PKVSIHVPCYNEPPEMVKQTLNALANLDYPDFEVLIIDNNTKDPAVWEPVQAYCETLGPR 480 Query: 129 IPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGA 188 V LA +G A+ A +E + ID D +DR+ ++V P +P++ Sbjct: 481 FKFFHVAPLAGFKGGALNYLLPHTAKDAEVIAVIDSDYCVDRNWLKHMV-PHFADPKIAI 539 Query: 189 VTGNPRIRTRS-TLVGKIQVGEYSSI--IGLIKRTQRIYGNVFTVSGVIAAFRRSALAEV 245 V R ++ + K+ EY IG++ R R + G + RR+ L E+ Sbjct: 540 VQSPQDYRDQNESTFKKLCYAEYKGFFHIGMVTRNDR---DAIIQHGTMTMTRRTVLEEL 596 Query: 246 GYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFL 305 G W+D I ED ++ ++ + Y + LMP+T KQR RWA G ++ Sbjct: 597 G-WADWCICEDAELGLRVFEKGLSAAYHHESYGKGLMPDTFIDFKKQRFRWAYGAIQIIK 655 Query: 306 KNMTRLWRKEN 316 ++ L R +N Sbjct: 656 RHTASLLRGKN 666 >UniRef50_C9NZ15 Putative uncharacterized protein n=1 Tax=Vibrio coralliilyticus ATCC BAA-450 RepID=C9NZ15_9VIBR Length = 391 Score = 91.3 bits (225), Expect = 7e-17, Method: Compositional matrix adjust. Identities = 74/254 (29%), Positives = 120/254 (47%), Gaps = 18/254 (7%) Query: 72 KDNPSISIIIPCFNEEKNVEETIHAAL----AQRYEN-IEVIAVNDGSTDK--TRAILDR 124 ++P +S+ +P +NE EE + AL Q Y + I+VI ++DGST+ +R Sbjct: 37 DNDPEVSVFLPFYNEN---EEYLICALNKLNEQNYSSRIQVIVIDDGSTNDAISRVKAWI 93 Query: 125 MAAQIPH-LRVIHLAQNQG-KAIALKT--GAAAAKSEYLVCIDGDALLDRDAAAYIVEPM 180 + +I H +++ A N G K AL A + V +D D ++ + +V + Sbjct: 94 TSTEINHDFQIVERAVNGGRKGFALDHVLHLGVATGDVYVVVDSDTYIEPNGIYELVTKL 153 Query: 181 LYNPRVGAVTG--NPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFR 238 + + AV G P + +GK+Q E+ S G I+ Q G V ++G A R Sbjct: 154 WSDEKYAAVCGYITPE-NHEVSFIGKLQHYEHVSFYGAIRAAQDKLGCVPVLAGAFVAHR 212 Query: 239 RSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQ 298 S + ++G WS+ ++ EDI WK N + Y P+A P KGL+ QR RWA+ Sbjct: 213 ASVVKKLGGWSEWLV-EDIAWCWKAISNGYRTGYAPKAKATTQCPTDAKGLFNQRRRWAR 271 Query: 299 GGAEVFLKNMTRLW 312 G E + + W Sbjct: 272 GRVEAYTEAWKTHW 285 >UniRef50_A0LP64 Glycosyl transferase, family 2 n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LP64_SYNFM Length = 924 Score = 90.9 bits (224), Expect = 8e-17, Method: Compositional matrix adjust. Identities = 69/235 (29%), Positives = 105/235 (44%), Gaps = 15/235 (6%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRV 134 PSIS+++ +N + + + Y N EV+ VNDGSTD T I AA+ +V Sbjct: 399 PSISVVVCSYNGSRTIRNCLDGLRRLDYPNFEVVVVNDGSTDGTGEI----AAEY-GFKV 453 Query: 135 IHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPR 194 I+ N G + A TG AAK E + ID DA D Y+ L VG N Sbjct: 454 IN-TTNHGLSSARNTGMRAAKGEIVAYIDDDARPDPHWLTYLAYTFLTTDHVGVGGPNLP 512 Query: 195 IRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDD--M 252 + + I L+ + + + G AFR+SAL + + + Sbjct: 513 PPEDGPIAACVANSPGGPIHVLLTDQEAEH-----IPGCNMAFRKSALEAIDGFDHRFRI 567 Query: 253 ITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKN 307 +D+D+ W+LQ WT+ + P A+ W ++K WKQ+ + G AE FL+ Sbjct: 568 AGDDVDLCWRLQKKGWTLGFNPAAMVWHHRRNSVKAYWKQQKNY--GKAEAFLEK 620 >UniRef50_D2QUU5 Glycosyl transferase family 2 n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QUU5_9SPHI Length = 508 Score = 90.9 bits (224), Expect = 9e-17, Method: Compositional matrix adjust. Identities = 66/238 (27%), Positives = 113/238 (47%), Gaps = 14/238 (5%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRYENIEV-IAVNDGSTDKTRAILDRMAAQ----- 128 P +++ +P +NE VE I A + +Y ++ I V D STD+T +I+ R A+ Sbjct: 53 PRLTVQLPVYNELYVVERLIDAVVLLKYPKDKLDIQVLDDSTDETVSIIARKVAEYKKQG 112 Query: 129 --IPHLRVIHLAQNQG-KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPR 185 I H+R + +G KA AL G AK E++ D D + D + V P +P+ Sbjct: 113 FDIEHIR---RPERKGFKAGALAYGLTLAKGEFVAIFDADFVPDPEFLLKTV-PHFADPK 168 Query: 186 VGAVTGN-PRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAE 244 V V + +L+ ++Q ++ + + + G + +G +R+ A+A+ Sbjct: 169 VAIVQTRWEHLNEDFSLITQLQAFGLNAHFTVEQSGRYAAGLLANFNGTGGVWRKVAIAD 228 Query: 245 VGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAE 302 G W D +TED+D+S++ QL W Y +P + L Q+ RW +G AE Sbjct: 229 AGGWQSDTLTEDLDLSYRAQLRGWKFVYREDVGSPAELPVAMNALKSQQYRWMKGAAE 286 >UniRef50_A3HSN4 Glycosyl transferase, group 2 family protein n=1 Tax=Algoriphagus sp. PR1 RepID=A3HSN4_9SPHI Length = 486 Score = 90.9 bits (224), Expect = 1e-16, Method: Compositional matrix adjust. Identities = 74/278 (26%), Positives = 122/278 (43%), Gaps = 53/278 (19%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKT---------------- 118 P +SII P +NE +++ + + ++ Y EVI +NDGS D T Sbjct: 61 PGVSIIAPAYNEGQSIVQNAKSLMSLHYGKFEVIIINDGSKDDTLQKLLEAFELEKTEFA 120 Query: 119 ----------RAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALL 168 R + L V+ +N GKA AL +G A E + CID D +L Sbjct: 121 YDYQIDCKQVRGVYKSKNRSYSKLVVVD-KENGGKADALNSGINLASLEIIACIDVDCIL 179 Query: 169 DRDAAAYIVEPML--YNPRVGAVTGN--------------PRIRTRSTLVGKIQVGEY-- 210 D+ +V P + N +V AV G + R ++L+G+ QV EY Sbjct: 180 SHDSITRMVRPFMEETNRKVIAVGGAIGIANNCDVQDGTVTKYRLPTSLLGRFQVIEYFR 239 Query: 211 SSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKL------Q 264 + ++G + T R+ G + +SG F + + +VG + + ED+++ ++ Q Sbjct: 240 AFLMGRMAWT-RVNG-LMLISGAFGFFNKDLVLKVGGYFPKTVGEDMELVVRMRRYMEEQ 297 Query: 265 LNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAE 302 + + + P LCW +PE L +QR RW +G E Sbjct: 298 KIPYRVGFVPDPLCWTEVPEDESVLARQRNRWIRGTIE 335 >UniRef50_A0LZL8 Transmembrane family-2 glycosyl transferase-possibly involved in biofilm formation n=1 Tax=Gramella forsetii KT0803 RepID=A0LZL8_GRAFK Length = 483 Score = 90.5 bits (223), Expect = 1e-16, Method: Compositional matrix adjust. Identities = 80/326 (24%), Positives = 133/326 (40%), Gaps = 56/326 (17%) Query: 73 DNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKT-RAILDRMAAQ--- 128 D PS+++I P +NE + E + + L+ +Y E++ VNDGS D + + ++D + Sbjct: 64 DIPSVTVIAPAYNEGLTIIENVKSLLSIQYPYYELLLVNDGSKDDSLQKLIDEFDLEQQD 123 Query: 129 -------IPHLRVIHL---------------AQNQGKAIALKTGAAAAKSEYLVCIDGDA 166 IP V H+ N GKA A+ G A +E ++ D D Sbjct: 124 ATFITQPIPTASVKHIYRSRKTKYTHLTVIDKNNGGKADAINAGVNFANTELVIFTDVDC 183 Query: 167 LLDRDAAAYIVEPMLYNPRV-------------GAVTGN---PRIRTRSTLVGKIQVGEY 210 ++++DA +V P L +V N +R S L+ QV EY Sbjct: 184 IIEQDAILKMVRPYLEEDDKEIIGCGGGIGIANDSVIKNGILTELRLPSRLIPMTQVVEY 243 Query: 211 SSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQ--- 267 L + + +SG + R + EVG ++ + ED+++ +L+ + Sbjct: 244 IRAFLLGRMAWSEINGLMLISGAFGMYPRKRVIEVGGFNPKTVGEDLELCIRLRKHMEDK 303 Query: 268 ---WTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFF 324 + + Y P LCW P K L QR RWA+G E + L+ ++ M LF+ Sbjct: 304 KIPYKVVYLPETLCWTEGPSDYKILVMQRDRWARGLWETMSLHKDLLFNRKYGNMGLLFY 363 Query: 325 EYCL--------TTIWAFTCLVGFII 342 Y L C++ F I Sbjct: 364 PYWLLFEFGAPIVEFMGVVCIIAFTI 389 >UniRef50_B5W069 Glycosyl transferase family 2 n=3 Tax=Arthrospira RepID=B5W069_SPIMA Length = 561 Score = 90.5 bits (223), Expect = 1e-16, Method: Compositional matrix adjust. Identities = 62/245 (25%), Positives = 115/245 (46%), Gaps = 7/245 (2%) Query: 67 PAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAILDR 124 P + ++ P +S+++ NEE + + A Y + E+ ++D S+D+T +L Sbjct: 161 PEHRPEELPFVSLLVAAKNEEAVIGNLVKNLCALNYPSHCYELWVIDDNSSDRTPIVLQE 220 Query: 125 MAAQIPHLRVIHLAQNQ--GKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLY 182 +A + L ++H +N GK+ AL + + L D DA +D D ++ P Sbjct: 221 LAKEYQQLHILHRDENATGGKSGALNQALPLTRGKILGVFDADATVDSDLLQQVI-PKFQ 279 Query: 183 NPRVGAVTGNPRIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSA 241 +VGAV I + + + Q E + + ++ + G + + G RR+A Sbjct: 280 AEQVGAVQLQKAIANSNFNFLTRCQASEMA-LDAFFQKQRVAVGGIGELRGNGEFIRRAA 338 Query: 242 LAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGA 301 L G W + IT+D+D++ +L L+ W I + ++ + LW QR RWA+GG Sbjct: 339 LESCGGWCEQTITDDLDLTIRLHLDHWDIEFLDTSVVFEEGVTNWVALWHQRNRWAEGGY 398 Query: 302 EVFLK 306 + +L Sbjct: 399 QRYLD 403 >UniRef50_B0C9M4 Glycosyl transferase, family 2 n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0C9M4_ACAM1 Length = 492 Score = 90.5 bits (223), Expect = 1e-16, Method: Compositional matrix adjust. Identities = 74/275 (26%), Positives = 124/275 (45%), Gaps = 20/275 (7%) Query: 41 MSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDN--PSISIIIPCFNEEKNVEETIHAAL 98 +SI+W ++++R R E +P + D P ++I +P FNE V+ + A Sbjct: 32 ISIIW-----RYYLHRRR-----EISPLHKFSDADLPQVTIQLPLFNEMYVVDRLLEAVA 81 Query: 99 AQRYENIEV-IAVNDGSTDKTRAILDRMAAQIP--HLRV--IHLAQNQG-KAIALKTGAA 152 A Y ++ I V D STD+TR I ++ HL + IH +G KA AL G Sbjct: 82 ALEYPVDKLQIQVLDDSTDETREICRAKVRELKQRHLNIDYIHRCDRKGYKAGALAYGLQ 141 Query: 153 AAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGN-PRIRTRSTLVGKIQVGEYS 211 +A + ++ D D + D +V NP+VG V I +++ +IQ Sbjct: 142 SATGDLVMIFDADFVPSPDTLINMVH-YFANPKVGMVQARWGHINRHYSILTEIQALMLD 200 Query: 212 SIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIF 271 + ++ G F +G +R + + G W +TED+D+S++ QL W Sbjct: 201 GHFVTEQTSRNRSGCFFNFNGTAGIWRIQTIEDAGGWQHTTVTEDLDLSYRAQLKGWECI 260 Query: 272 YEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLK 306 Y P + +P + Q+ RWA+G ++V K Sbjct: 261 YLPNIVVPAELPMEMNSFKSQQFRWAKGASQVAKK 295 >UniRef50_B3QIT3 Glycosyl transferase family 2 n=6 Tax=Bradyrhizobiaceae RepID=B3QIT3_RHOPT Length = 944 Score = 89.7 bits (221), Expect = 2e-16, Method: Compositional matrix adjust. Identities = 75/257 (29%), Positives = 120/257 (46%), Gaps = 20/257 (7%) Query: 75 PSISIIIPCFNEEKN-VEETIHAALAQRYENIEVIAVNDGSTDK--TRAILDRMAAQIPH 131 P +SI +P + E +++T+ A Y N EV+ + + + D T+ I D Sbjct: 473 PKVSIHVPAYFEPPEMLKQTLDALARLDYPNFEVVVIINNTPDPAFTQPIQDHCRELGER 532 Query: 132 LRVIHLAQNQG-KAIALKTGA--AAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGA 188 + I+ + +G KA AL+ A +E + ID D ++ D +V P +PRVG Sbjct: 533 FKFINAEKVKGFKAGALRIAMERTAVDAEIIGIIDADYVVTPDWLKDLV-PAFDDPRVGL 591 Query: 189 VTGNPRIRTRS-TLVGKIQVGEYSSI--IGLIKRTQRIYGNVFTVSGVIAAFRRSALAEV 245 V R +L+ I GEY+ IG+++R + N V G + RR+A+ Sbjct: 592 VQAPQEHRDGDRSLMHYIMNGEYAGFFDIGMVQRNEY---NGIIVHGTMCLIRRAAMDMA 648 Query: 246 GYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEV-- 303 G WS D I ED D+ ++ + W Y + L+P+T + KQR RWA GG ++ Sbjct: 649 GGWSSDTICEDSDLGLEIMEHGWLTHYTNTRYGYGLLPDTYEAFKKQRHRWAYGGFQIIK 708 Query: 304 -----FLKNMTRLWRKE 315 FL +RL R + Sbjct: 709 KHWRRFLPGNSRLSRDQ 725 >UniRef50_Q1IMJ5 Glycosyl transferase, family 2 n=10 Tax=cellular organisms RepID=Q1IMJ5_ACIBL Length = 546 Score = 89.7 bits (221), Expect = 2e-16, Method: Compositional matrix adjust. Identities = 64/256 (25%), Positives = 110/256 (42%), Gaps = 8/256 (3%) Query: 67 PAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEV-IAVNDGSTDKT----RAI 121 P + P +++ +P FNE+ ++ + A Y ++ I V D STD+T R + Sbjct: 82 PPQHFAELPRVTVQLPIFNEQYVIDRLVEAVCKLDYPKDKLDIQVLDDSTDETVEVAREV 141 Query: 122 LDRMAAQIPHLRVIHLAQNQG-KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPM 180 ++R AA + IH G KA AL+ G A K E++ D D + D + Sbjct: 142 VERYAALGNPISYIHRTNRHGFKAGALQEGMAVCKGEFIAIFDADFVPPADFLQKCIH-H 200 Query: 181 LYNPRVGAVTGN-PRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRR 239 P +G V + + + +++ L + G F +G +R+ Sbjct: 201 FAEPEIGMVQTRWTHLNRNYSFLTEVEAILLDGHFVLEHGGRSRKGVFFNFNGTAGMWRK 260 Query: 240 SALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQG 299 A+ E G W D +TED D+S++ Q+ W Y C +P + Q+ RWA+G Sbjct: 261 QAIEEAGGWQHDTLTEDTDLSYRAQVKGWRFKYLQDVECPAELPIEMTAFKTQQARWAKG 320 Query: 300 GAEVFLKNMTRLWRKE 315 + K + L+R + Sbjct: 321 LIQCSKKVLPFLYRSD 336 >UniRef50_B5IUE6 Glycosyl transferase, group 2 family protein n=1 Tax=Thermococcus barophilus MP RepID=B5IUE6_9EURY Length = 309 Score = 89.7 bits (221), Expect = 2e-16, Method: Compositional matrix adjust. Identities = 71/230 (30%), Positives = 117/230 (50%), Gaps = 11/230 (4%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRYENI-EVIAVNDGSTDKTRAILDRMAAQIPH-- 131 P ISI IP +NE++N+++ + + L Q+ NI E+I +NDGSTD T + +++ I Sbjct: 4 PRISIGIPVYNEQENIQKLLLSILCQKGVNIREIIIINDGSTDNTLKKIMEISSIIERDI 63 Query: 132 -LRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDA-LLDRDAAAYIVEPMLYNPRVGAV 189 L+VI+L N GKA AL A SEYLV +D D L+D +V P++ N +G V Sbjct: 64 VLKVINLRHNIGKANALNIIFRIASSEYLVLLDSDVILMDEYTLKRLVYPLIKNEDIGLV 123 Query: 190 TGNPRIRTRSTL----VGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEV 245 +G I + S++ KI Y L+K +G V+ +G I A R+ E+ Sbjct: 124 SGWYEISSYSSINMSVFNKIITWAYKFSAKLLKEIGSNFG-VYGATGAIMAVPRNIYKEL 182 Query: 246 GYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLR 295 +M+ +D+ + + ++P A + + TLK + ++R Sbjct: 183 Q-LPTNMLRDDLYVYLDVLSKNKKFVFQPNAKVIVHINPTLKNYLRYQVR 231 >UniRef50_Q1D6P4 Glycosyl transferase, group 2 n=3 Tax=Cystobacterineae RepID=Q1D6P4_MYXXD Length = 417 Score = 89.4 bits (220), Expect = 3e-16, Method: Compositional matrix adjust. Identities = 81/333 (24%), Positives = 139/333 (41%), Gaps = 41/333 (12%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAILDRMAAQIPHL 132 P ++I+IP FNE + + + + L Q Y + ++++ V+D S D + A + A P++ Sbjct: 41 PRVAIVIPLFNEGEGIYHGVRSLLEQDYPSHLLQIVVVDDCSKDDSYAWALKAAEGNPNV 100 Query: 133 RVIHLAQNQGKAIALKTGA-AAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTG 191 V+ +N GK + AA +E +V +D D ++D+ A +V + +PR+ AV G Sbjct: 101 IVMRNPENMGKRKGINRAVKAATDAEIIVSVDSDVIVDKAAVRQLVRRFV-SPRIAAVGG 159 Query: 192 NPRIRTRST--LVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAE----- 244 + R + I++ ++ +K +R + V +SG + A+RR L E Sbjct: 160 RTYVTNRHQNWMTRMIEI-KFHFAQEWLKDLERSFRQVMCLSGCLTAYRRHVLLELEPIL 218 Query: 245 ---------VGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLR 295 + Y D +T I + + Y A C+ P TL G + Q+LR Sbjct: 219 EARAIAGIPIKYGEDRFLTRQI------VKHDYETVYTLDAFCFTAAPSTLAGYFSQQLR 272 Query: 296 WAQGGAEVFLKNMTRLWRKEN------FRMWPLFFEYCLTTIW-----AFTCLVGFIIYA 344 W + L ++ WR L Y L + F ++ F Sbjct: 273 WRRSNLVDLLGGLSHAWRLHPVVAVHYVSQLALLLSYPLVIVHNVLTGEFWDILAFHFLV 332 Query: 345 VQLAGVPLNIELTHIAA---THTAGILLCTLCL 374 + L GV IE H+ A H A L L + Sbjct: 333 IGLLGVIYRIETRHLPADRRVHGASFLPMALLM 365 >UniRef50_D1N5H0 Glycosyl transferase family 2 n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N5H0_9BACT Length = 463 Score = 89.0 bits (219), Expect = 3e-16, Method: Compositional matrix adjust. Identities = 67/280 (23%), Positives = 133/280 (47%), Gaps = 15/280 (5%) Query: 67 PAPQLKDN--PSISIIIPCFNEEKNVEETIHAALAQRY--ENIEVIAVNDGSTDKTRAIL 122 P L D P+ ++I+P +NE K+V + + LA Y +E++A++DGS D T + Sbjct: 82 PRRTLSDGELPTCTVIVPAYNEGKSVLNALDSVLASDYPAAKLEILAIDDGSADDTWYWI 141 Query: 123 DRMAAQIP-HLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPML 181 AA+ + + L +N GK AL G +++E +V +D D++++ D + P Sbjct: 142 KLAAARSGGRITPVKLEKNGGKRHALYRGIRQSRAEVIVTVDSDSVVEPDTLRLLNSPFA 201 Query: 182 YNPRVGAVTGNPRI-RTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRS 240 + ++ V G+ R+ + ++ ++ + +++ Q + G+VF G ++A+RR+ Sbjct: 202 -DSKIAGVAGSIRVLNLQDGILPRMMDVNFVFGFEIMRSAQSVLGSVFCTPGALSAYRRA 260 Query: 241 ALAE-VGYWSDDM-------ITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQ 292 AL + W D I ED ++ L + I ++ A+ ++P + + K Sbjct: 261 ALLPFLDQWVDQKFFGEPAHIAEDRALATFLLADDHRIVFQRNAVARTIIPSSYRTTCKM 320 Query: 293 RLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIW 332 LRW +G +RK ++ + F + T+W Sbjct: 321 LLRWGRGDVRETCSMYRFAFRKFSWFHLGIQFNLLMQTMW 360 >UniRef50_A8VSD1 Binding-protein-dependent transport systems inner membrane component n=2 Tax=Bacillus selenitireducens MLS10 RepID=A8VSD1_9BACI Length = 476 Score = 89.0 bits (219), Expect = 3e-16, Method: Compositional matrix adjust. Identities = 81/327 (24%), Positives = 136/327 (41%), Gaps = 53/327 (16%) Query: 72 KDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIP- 130 +D IS+++P +NEE + ++++ L Y EVI +NDGS D+T + P Sbjct: 58 QDTYPISVLVPAYNEEVGIVASVNSLLTLNYPEYEVIVINDGSDDETSLRIKDHFNMKPV 117 Query: 131 ------HLRVIHLA----------------QNQGKAIALKTGAAAAKSEYLVCIDGDALL 168 H + + +N GK AL G + Y ID D++L Sbjct: 118 KLAMRKHFQTAEVTAAYQSSEYGNLYLIEKENGGKGDALNAGINFSSYPYFASIDADSIL 177 Query: 169 DRDAAAYIVEPML-YNPRVGAVTGNPRI----RTRSTLVGKI----------QVGEY--S 211 ++DA ++P++ N V A G RI + ++V I Q+ EY Sbjct: 178 EKDALLKTMKPIIDSNGEVTATGGTVRIANGCQIEKSVVQTITLPKKPLEVMQIIEYFRG 237 Query: 212 SIIGLI--KRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDI------SWKL 263 +IG + RT + VSG F +S + VG + + ED+++ S K Sbjct: 238 FLIGRLGFSRTNML----MIVSGAFGVFEKSRVIRVGGYDSSTVGEDMELVVRIHRSIKD 293 Query: 264 QLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLF 323 + + I Y +CW P T L QR+RW +G E K+ ++ + R+ Sbjct: 294 EKSPQRIEYIQDPVCWTEAPSTFSSLKSQRIRWQRGLTETIWKHKGMMFNPKYKRVGLFS 353 Query: 324 FEYCLTTIWAFTCLVGFIIYAVQLAGV 350 Y T + + +V F Y + + G+ Sbjct: 354 LPYY-TFVELLSAVVEFFGYLIIILGL 379 >UniRef50_B3DW74 Glycosyltransferase n=1 Tax=Methylacidiphilum infernorum V4 RepID=B3DW74_METI4 Length = 480 Score = 88.6 bits (218), Expect = 4e-16, Method: Compositional matrix adjust. Identities = 70/295 (23%), Positives = 130/295 (44%), Gaps = 24/295 (8%) Query: 36 FWPFFMSIMWIVGGVY-------FWVYRERHWPWGENAPAPQL-KDNPSISIIIPCFNEE 87 FW ++++ + G+Y W+ R + AP L P ++I +P +NE+ Sbjct: 4 FWFLILALVLFIYGIYRMSLILRLWMGSHRD----KKAPTDALFYTYPEVTIQLPIYNEK 59 Query: 88 KNVEETIHAALAQRY-ENIEVIAVNDGSTDKTRAILDRMAAQIPH----LRVIHLAQNQG 142 VE +HA Y +N I + D STD+T AI+ + + + + +G Sbjct: 60 SVVERLLHAVCKIDYPKNKMEIQIIDDSTDETTAIISKWVCEYQKKGFDIYQLRRGTREG 119 Query: 143 -KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVE--PMLYNPRVGAVTGN-PRIRTR 198 KA L+ G +K E++ D D L +++ E P + VG V + + Sbjct: 120 FKAGGLQYGLERSKGEFIAIFDADFL---PPPSFLKETLPYFRSRDVGMVQARWGYLNRQ 176 Query: 199 STLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDID 258 ++L+ + Q L + + Y F +G +R+ + + G W D +TED+D Sbjct: 177 ASLLTRCQALFLDGHFLLEQPVRYKYNLFFNFNGTAGIWRKKCIIDAGGWEGDTLTEDLD 236 Query: 259 ISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWR 313 +S++ Q W Y P+ + +P + Q+ RWA+G + K++ L++ Sbjct: 237 LSYRAQFKGWKFVYTPQMVVPSELPSPIVAFRTQQHRWAKGAIQTAKKHLFSLFK 291 >UniRef50_Q03MS9 Glycosyltransferase, probably involved in cell wall biogenesis n=5 Tax=Lactobacillales RepID=Q03MS9_STRTD Length = 397 Score = 88.2 bits (217), Expect = 6e-16, Method: Compositional matrix adjust. Identities = 69/284 (24%), Positives = 136/284 (47%), Gaps = 18/284 (6%) Query: 76 SISIIIPCFNEEKNV-EETIHAALAQRYENIEVIAVNDGSTDKTRAI--LDRMAAQIPHL 132 S+S+I+P +NEE + + I + +AQ+ ++E+I V+DGS ++ I + ++ Sbjct: 49 SVSVIVPAYNEEPQILKNCIDSIVAQKAPDLEIIVVDDGSKNREELIEKVYNTYQSNQNV 108 Query: 133 RVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALL-DRDAAAYIVEPMLYNPRVGAVTG 191 +++ +N+GK K G AK + +V +D D LL D +A +++ Y VGAVTG Sbjct: 109 KILLPEENKGKRHCQKLGFDIAKGDIIVTVDSDTLLHDENAVEKLIQRFAY-KNVGAVTG 167 Query: 192 NPRIRTRST-LVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEV--GYW 248 + R+ ++T ++ ++ Y + + Q + V SG +A+R+ + +V Y Sbjct: 168 DVRVENKNTNILTRLITYRYWTAFHQERAAQSRFHVVMCCSGPFSAYRKEIIDKVKEKYI 227 Query: 249 SDDMITEDIDISWKLQLNQWT------IFYEPRALCWILMPETLKGLWKQRLRWAQGGAE 302 + + E+ L + + + + +PET++G KQ++RW + Sbjct: 228 TQYFLGENCTYGDDRHLTNLVLEEGHDVAFHRDSRVYTFVPETIRGYIKQQVRWNKS--- 284 Query: 303 VFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQ 346 F + M + R + + ++ + I F +V I AVQ Sbjct: 285 -FYREMLWTIKFAPKRHFYMLYDLVMQFILPFMLVVSLIAMAVQ 327 >UniRef50_A1ULY3 Glycosyl transferase, family 2 n=4 Tax=Mycobacterium RepID=A1ULY3_MYCSK Length = 651 Score = 88.2 bits (217), Expect = 6e-16, Method: Compositional matrix adjust. Identities = 59/226 (26%), Positives = 101/226 (44%), Gaps = 5/226 (2%) Query: 75 PSISIIIPCFNEEKNVEETIHA---ALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPH 131 P +++IIP NE ++ TI A A A+ E + + +++ STD T + A H Sbjct: 267 PPLAVIIPAHNEAHDITATIEAVDRAAARYAETVHIYVIDNASTDDTADVAQTAIAACAH 326 Query: 132 -LRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVT 190 +H GKA+AL G + + E++V ID D ++ + + + +V AV Sbjct: 327 STGEVHECAVPGKAVALNYGLSVIREEFVVRIDADTVIGENCLDVTLR-HFTDAKVAAVG 385 Query: 191 GNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSD 250 G PR T ++++ E G + Y + G+ +RR + EVG Sbjct: 386 GMPRPERIRTFFDRVRLVEVLVKHGFFQVAMMGYDGIIGEPGMFVVYRRRVVEEVGGIVQ 445 Query: 251 DMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRW 296 M ED DI ++ + +P A+ + P++ L +QR RW Sbjct: 446 GMNGEDTDICMRMSSQGYLSLVDPTAVYFSETPQSWAHLREQRTRW 491 >UniRef50_D2EES4 Glycosyl transferase family 2 n=1 Tax=Candidatus Parvarchaeum acidiphilum ARMAN-4 RepID=D2EES4_9EURY Length = 420 Score = 87.0 bits (214), Expect = 1e-15, Method: Compositional matrix adjust. Identities = 87/330 (26%), Positives = 154/330 (46%), Gaps = 19/330 (5%) Query: 27 GELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNE 86 +L + F +S+ WI F+ + H E +LK+ PS++++IP E Sbjct: 5 SDLFTSLIVFVLLVLSLFWI-----FYFFDINHEIKKEGENELKLKEFPSVAVLIPTIYE 59 Query: 87 EKNVEETIHAALAQRYENIEV-IAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAI 145 K + +TI Y N ++ + +N ST +T AA ++VI A GKA Sbjct: 60 GKRLRDTIENVKQTIYPNYKIYVVLNKSSTKETIK-----AAYTKGVKVIK-APFDGKAR 113 Query: 146 ALKTGAAAAKSEYLVCI-DGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGK 204 + +E L+ + D D +++D +V N +V AV + ++ +V Sbjct: 114 VMNYAIKNYINEKLLLVLDADTFIEKDLVTRLVYH-FKNKKVAAVVSSVKVYKPKKVVEY 172 Query: 205 IQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQ 264 +Q EY I K +I G + + G + F L ++GY+ D TED++I ++ Sbjct: 173 LQYYEYLLSILARKALSKI-GGLVIIHGAGSMFNIEILKKIGYFDDSNYTEDMEIGMRIL 231 Query: 265 LNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLK-NMTRLWRKENFRMWPLF 323 + + + +AL + + P T + L+KQR+RW G LK T L K+N +W + Sbjct: 232 TSGYKVESSVKALSYTVTPNTFRKLFKQRVRWFSGFFFNILKYRKTALDSKKN-EIWKIA 290 Query: 324 FEYCLTTIW-AFTCLVGFIIYAVQLAGVPL 352 + LT+I+ +F ++G I Y V + +P+ Sbjct: 291 LPFMLTSIFLSFFVIIG-IAYTVIVFLLPV 319 >UniRef50_Q04QL4 Glycosyltransferase plus another conserved domain n=6 Tax=Leptospira RepID=Q04QL4_LEPBJ Length = 517 Score = 87.0 bits (214), Expect = 1e-15, Method: Compositional matrix adjust. Identities = 68/285 (23%), Positives = 129/285 (45%), Gaps = 27/285 (9%) Query: 51 YFWVY-RERHWPWGENAPAPQLKDN----PSISIIIPCFNEEKNVEETIHAALAQRYENI 105 Y VY ++++ + E+ P L N P +++ +P FNE V+ I +A +Y Sbjct: 28 YIMVYLYKKNYAYCESEPDKILDVNDPNLPVVTVQLPIFNEFYVVDRLIETTVALKYPKD 87 Query: 106 EV-IAVNDGSTD----KTRAILDRMAA---QIPHLRVIHLAQNQGKAIALKTGAAAAKSE 157 ++ I + D STD K+R +++ A I HL + KA AL+ G A+ E Sbjct: 88 KLEIQLLDDSTDETIEKSRNLINHYKALGFDIHHLHRAGAERTGHKAGALEAGMKVARGE 147 Query: 158 YLVCIDGDALLDRDAAAYIVEPMLYNPRVGAV-------TGNPRIRTRSTLVGKIQVGEY 210 Y+ D D + D D V P +P++G V + + T++ G + + Sbjct: 148 YIAIFDADFMPDPDFLIKTV-PYFEDPQIGMVQVRWGHINADYNVLTKAQSFG---IDGH 203 Query: 211 SSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTI 270 I + + ++ N +G+ +++ + + G W D +TED D+S++ ++ W Sbjct: 204 FMIEQVARNGSHLWMNFNGTAGI---WKKECIIDSGGWEHDTLTEDFDLSYRAEMRGWKF 260 Query: 271 FYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKE 315 Y C +P + Q+ RW +G + +K + R++R + Sbjct: 261 RYFKDIECKAEIPAMISAYKSQQFRWCKGSIQTAVKLLPRIFRAD 305 >UniRef50_C3LL95 Chitin synthase n=10 Tax=Bacilli RepID=C3LL95_BACAC Length = 366 Score = 86.7 bits (213), Expect = 1e-15, Method: Compositional matrix adjust. Identities = 76/279 (27%), Positives = 124/279 (44%), Gaps = 32/279 (11%) Query: 76 SISIIIPCFNEEKN-VEETIHAALAQRYENIEVIAVNDGSTDKT--------RAILDRMA 126 +++++P +NE + + TI++ LAQ Y E+ V+DGS DK+ R L R Sbjct: 2 KVAVVVPSYNESASAIVNTINSVLAQDYPIHEIFFVDDGSKDKSAYEVALKMREELLRTQ 61 Query: 127 AQ--------------IPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDA 172 + IP L V L +N GK A ++ +V ID D L +A Sbjct: 62 REIAATTKNICSEILGIPDLIVHRLPKNCGKRHAQLWAFKRTTADAIVTIDSDGDLFPNA 121 Query: 173 AAYIVEPMLYNPRVGAVTGNPRIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVS 231 +++P + +V A TG+ IR R+ L+ K+ Y + + + Q + GNV S Sbjct: 122 VRELLKP-FNDEKVMATTGHVNIRNRNDNLLTKLIDMRYDNAFRVERAAQSVTGNVLVCS 180 Query: 232 GVIAAFRRSALAE--VGYWSDDMITEDIDISWKLQLNQWTIF-----YEPRALCWILMPE 284 G ++ +RR + E Y S + E++ L + I Y+ A C P Sbjct: 181 GPLSCYRREVITENLENYGSQMFLGEEVQFGDDRCLTNYAILKGKTVYQSTARCITDAPT 240 Query: 285 TLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLF 323 TLK KQ+LRW + L ++ +K N +W +F Sbjct: 241 TLKQFLKQQLRWNKSFFRESLISLGIGMKKPNVLVWTIF 279 >UniRef50_UPI0001C1680B Glycosyl transferase, family 2 n=2 Tax=Nostocaceae RepID=UPI0001C1680B Length = 467 Score = 86.7 bits (213), Expect = 1e-15, Method: Compositional matrix adjust. Identities = 69/245 (28%), Positives = 114/245 (46%), Gaps = 19/245 (7%) Query: 73 DNPSISIIIPCFNEEKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAILDRMAAQIP 130 D PS+S+++ NE+ ++ +H + Y + EV ++D STDKT IL ++ Sbjct: 105 DLPSVSVLVSAKNEQAVIDRLVHNLCSLEYPHGEYEVWLIDDHSTDKTPEILAQLQQDYK 164 Query: 131 HLRVIHLAQNQ--GKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGA 188 L V N GK+ AL K E + D DA + D ++ P +VGA Sbjct: 165 QLNVFRRDANATGGKSGALNQVLPMTKGEIIAVFDADAQVSPDLLLQVI-PTFQREKVGA 223 Query: 189 VTGNPRI-RTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGY 247 V I + + Q+ E + + ++ + G + + G RR AL + G Sbjct: 224 VQVRKAIANAKENFWTRGQMAEMA-LDTWFQQQRTAIGGLGELRGNGQFVRREALNDCGG 282 Query: 248 WSDDMITEDIDISWKLQLNQWTI---FYEPRALCWILMPETLK---GLWKQRLRWAQGGA 301 W+++ IT+D+D++ +L L W I FY P ++ E + LW QR RWA+GG Sbjct: 283 WNEETITDDLDLTIRLNLTGWDIECMFYPP------VLEEGVTNVVALWHQRNRWAEGGY 336 Query: 302 EVFLK 306 + +L Sbjct: 337 QRYLD 341 >UniRef50_C6VR67 Glycosyltransferase (Putative) n=7 Tax=Lactobacillales RepID=C6VR67_LACPJ Length = 451 Score = 86.7 bits (213), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 68/266 (25%), Positives = 122/266 (45%), Gaps = 17/266 (6%) Query: 57 ERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYE--NIEVIAVNDGS 114 RH P APA P ++II+P +N E + + I + Q Y+ ++++I N+ S Sbjct: 41 HRHMP-RLKAPAKW----PWVTIIVPVYNSEDTLFDCIRSIDHQTYDKKSMQIILANNQS 95 Query: 115 TDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAA 174 TD + R P+L + ++ ++GK AL + Y++ ID D LL+ +A Sbjct: 96 TDDSFGAYARAQNAFPNLFLRYVNTDKGKDRALNAAIYESIGSYVINIDSDGLLEPNAIK 155 Query: 175 YIVEPMLYNPRVGAVTGN---PRIRTRST------LVGKIQVGEYSSIIGLIKRTQRIYG 225 +V + ++ A+TG R + L+ K + EY+ + + Sbjct: 156 NMVLRFENDEQIAAMTGAVLPQRNLVKQVHGWWHRLLAKNEYYEYAQAFLSGRTIESFRD 215 Query: 226 NVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLN-QWTIFYEPRALCWILMPE 284 +FT+SG +AFRR L E + D I ED D++++++ + A+ ++ Sbjct: 216 QLFTMSGAFSAFRREVLMETFLYDTDTIGEDTDMTFQIRTRLGKKVVICADAIFYVAPIS 275 Query: 285 TLKGLWKQRLRWAQGGAEVFLKNMTR 310 L+ QR RW +G EV M++ Sbjct: 276 GFSELYTQRQRWQRGELEVAQHYMSQ 301 >UniRef50_A2SR79 Glycosyl transferase, family 2 n=1 Tax=Methanocorpusculum labreanum Z RepID=A2SR79_METLZ Length = 376 Score = 86.3 bits (212), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 65/240 (27%), Positives = 114/240 (47%), Gaps = 7/240 (2%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAILDRMAAQIPHL 132 P+ISI+I +NEE+ + I + + Y N +EV+ V D S DKT + A + Sbjct: 45 PAISIVICAYNEERTIARKIQSISSCTYPNELMEVVLVIDCSDDKTEEVARSELANVDFS 104 Query: 133 RVIHL-AQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTG 191 IH+ Q GK +L TG +SE ++ D D + D + +V +L + R A TG Sbjct: 105 WKIHINEQRSGKNTSLNTGIGLTESELVIATDADLVWDNRSVEQLVLRILADDRFAAGTG 164 Query: 192 NPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDD 251 + + + V ++ Y S G + + + +G + R+S VG ++ Sbjct: 165 DLQPNPGADRVTSME-KTYRSYFGRMAEWESAHDATPAFNGCLLILRKSI---VGGVNET 220 Query: 252 MITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRL 311 +D +++++ + FYE RA + +PE LK + Q++R A+G + L N+ L Sbjct: 221 NGADDANLAYEAIRKGYRTFYETRAAIYEELPENLKKQYTQKVRRAKGLIQTTLANLDLL 280 >UniRef50_D2S4V1 Glycosyl transferase family 2 n=1 Tax=Geodermatophilus obscurus DSM 43160 RepID=D2S4V1_9ACTO Length = 483 Score = 86.3 bits (212), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 84/294 (28%), Positives = 130/294 (44%), Gaps = 21/294 (7%) Query: 42 SIMWIVGGVYFWVYRERHWP-------WGEN----APAPQLKDNPSISIIIPCFNEEKNV 90 ++ IV GVYF RH W EN A PS+++ + C NEE V Sbjct: 67 GVLNIVFGVYFC----RHLGFAVAAARWAENDMLAADVGLHSYTPSVAVFVGCKNEELVV 122 Query: 91 EETIHAALAQRY--ENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIH--LAQNQGKAIA 146 + + A LA Y + + ++ V+DGS D T A LD AA+ P LRV+H GK+ A Sbjct: 123 DGMVTALLALDYPADRLTLVVVDDGSDDTTGARLDAWAAREPRLRVLHRPPGSGGGKSGA 182 Query: 147 LKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRT-RSTLVGKI 205 L +E V D D D A +V +P VGAV G IR R + + Sbjct: 183 LNDALHLVDAEIAVVFDADHEPDPTALRRLVR-HFRDPVVGAVMGRCVIRNGRDSQMAST 241 Query: 206 QVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQL 265 ++ S + + ++ + G A R S L +G W+ +TED D++ ++ + Sbjct: 242 VFVDFLSGYLVNEYGRQALFELPAYGGANCAVRMSTLRGLGGWNPHTVTEDTDLTLRVLM 301 Query: 266 NQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRM 319 + Y+ A+ + + + WKQR RWA+G + F L R + + Sbjct: 302 AGQRVRYDLSAVDFEEAVVSAQRFWKQRYRWARGHQKCFRDYWRPLMRSPHLSL 355 >UniRef50_A7HQK0 Glycosyl transferase family 2 n=1 Tax=Parvibaculum lavamentivorans DS-1 RepID=A7HQK0_PARL1 Length = 917 Score = 86.3 bits (212), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 80/274 (29%), Positives = 120/274 (43%), Gaps = 25/274 (9%) Query: 40 FMSIMWIVGGVYFWVYRERHW-PWGENAPAPQLKDNPSISIIIPCFNEEKN-VEETIHAA 97 F+ ++W G RH+ P+ ENA +P +SI +P NE V ET+ A Sbjct: 439 FVEVIWTRHG-------SRHFKPFDENAVSPTAM----VSIHVPIHNEPPEMVRETLQAL 487 Query: 98 LAQRYENIEVIAVNDGSTDKT--RAILDRMAAQIPHLRVIHLAQNQG-KAIALKTG--AA 152 Y+N EV+ +++ + D + + D A P R HL G KA AL G Sbjct: 488 ANLDYDNYEVLVLDNNTVDPEVWQPVRDYCAQLGPRFRFFHLENWPGFKAGALNFGLEKT 547 Query: 153 AAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTR-STLVGKIQVGEYS 211 A ++E + ID D ++ +V P VG V G R R + + EY+ Sbjct: 548 AEEAEIIAVIDSDYQVEPSWLKVLV-PYFDKQDVGFVQGPQDYRDRHESAFKNMAYWEYA 606 Query: 212 SI--IGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWT 269 IG+++R N G + R+SAL VG W++ I ED ++ KL + Sbjct: 607 GFFHIGMVQRNNF---NAIIQHGTMTQVRKSALKRVGGWAEWCICEDAELGIKLYRAGYD 663 Query: 270 IFYEPRALCWILMPETLKGLWKQRLRWAQGGAEV 303 Y + L P+TL G QR RWA G ++ Sbjct: 664 SVYVNHSFGRGLTPDTLSGYITQRFRWAYGAVQI 697 >UniRef50_B7HFD6 N-acetyllactosaminide beta-1,6-N-acetylglucosaminyl-transferase n=76 Tax=Firmicutes RepID=B7HFD6_BACC4 Length = 433 Score = 86.3 bits (212), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 67/246 (27%), Positives = 111/246 (45%), Gaps = 6/246 (2%) Query: 71 LKDNPSISIIIPCFNEEKNVEETIHAALAQRYE-NIEVIAVNDGSTDKTRAI---LDRMA 126 L+ PS+ I IP NE +++T+ A Y + + +ND S D+T I D+ Sbjct: 64 LEHYPSVDIFIPAHNEGIVIKDTLEAMAKIEYPGKLTIYLLNDNSQDETPEIGDDFDKAY 123 Query: 127 AQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVE-PMLYNPR 185 A I H+RV + +GK+ L G + + EY D D + A +VE Sbjct: 124 AHICHIRVPP-GEPKGKSRVLNYGLSISDGEYFCVYDADNQPEPHALRMLVEHAETTEDA 182 Query: 186 VGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEV 245 VGAV + + ++ E+ L++ + + +++G RRSAL E+ Sbjct: 183 VGAVGHVRTVNENRNWLTRMISLEFQIFQLLMQSGRWLLFQTGSLTGTNMLLRRSALEEL 242 Query: 246 GYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFL 305 G + I ED +++ ++ + + P ++ W PE LK L KQR RW QG + Sbjct: 243 GGYDPYAIAEDAELTLRITQKGYLLPIVPESVTWEQEPEHLKILIKQRTRWLQGNLYILE 302 Query: 306 KNMTRL 311 K + L Sbjct: 303 KMFSSL 308 >UniRef50_Q0AEM9 Cellulose synthase (UDP-forming) n=2 Tax=Nitrosomonas RepID=Q0AEM9_NITEC Length = 492 Score = 85.9 bits (211), Expect = 3e-15, Method: Compositional matrix adjust. Identities = 64/253 (25%), Positives = 118/253 (46%), Gaps = 19/253 (7%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRY--ENIEVIAVNDGSTDKT----RAILDRMAAQ 128 P +S+++P +NE + I A +Y ++E++ ++D STD T +A +D A Sbjct: 61 PRVSVLLPIYNEAAVIVRLIDAVCRLQYPTSSLEILVLDD-STDNTSTLAQARIDYHADL 119 Query: 129 IPHLRVIHLAQNQG-KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVG 187 +R + A N+G KA L G + E+ V D D + D V P +P++G Sbjct: 120 GVSIRYVRRASNEGYKAGNLLNGIRQSSGEFYVIFDADFIPQEDFLLRTV-PYFQDPQLG 178 Query: 188 AV-TGNPRIRTRSTLVGKIQVGEYSS----IIGLIKRTQRIYGNVFTVSGVIAAFRRSAL 242 + TG ++ + + Q E +GL + G++ ++SG +RR+ + Sbjct: 179 FLQTGIGYENRDASFLTRFQAMEMGHQQYVTVGLSED-----GDMASLSGSSCVWRRACV 233 Query: 243 AEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAE 302 +G W+ +TED+D+ ++ Q W Y + ++PET QR RW +G Sbjct: 234 ESLGGWNASTVTEDVDLGYRAQFGNWKYAYMRDVVSMSVLPETTSAFRVQRKRWGRGLIH 293 Query: 303 VFLKNMTRLWRKE 315 K+ +++R+ Sbjct: 294 SAFKHARQMFRQR 306 >UniRef50_C6HYN9 Glycosyl transferase n=1 Tax=Leptospirillum ferrodiazotrophum RepID=C6HYN9_9BACT Length = 384 Score = 85.9 bits (211), Expect = 3e-15, Method: Compositional matrix adjust. Identities = 86/334 (25%), Positives = 143/334 (42%), Gaps = 35/334 (10%) Query: 33 FVFFWPFFM--SIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNV 90 F F +P + I+W++ ++ P ++ PAP P +S++IP NEEK V Sbjct: 14 FAFLYPLVLYPGIIWLLSRLF-------PEPVRKSRPAPL----PFVSLVIPAHNEEKVV 62 Query: 91 EETIHAALAQRY--ENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALK 148 I +L+ Y + EV+ +DGSTD+T AI + P R++ +GK L Sbjct: 63 AAKIENSLSLDYPADRYEVLLASDGSTDRTAAIARE--TRHPAFRLLDFPVRRGKLAVLT 120 Query: 149 TGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRST----LVGK 204 A+ E +V D A+L DA VE +P VG V G RI T G+ Sbjct: 121 DAVGEARGELVVLTDTSAILAPDALLRAVE-NFSDPGVGCVAGRYRIAREMTPELDARGE 179 Query: 205 IQVGEYSSIIGLIKRTQRI----YGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDIS 260 + G + I QRI + + G RRS V + I +D I Sbjct: 180 SERGYFEFEI-----FQRICESRFHSTLGAHGAFYMIRRSLFPRV---PEGTINDDFVIP 231 Query: 261 WKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMW 320 + + YE +A+ + + T+K +++R+R +QG + + L K+ ++ Sbjct: 232 MLILAQGFRTVYEEKAVAYEIHLATVKSEFRRRVRISQGNFQQIFLLLPALGLKDPRALF 291 Query: 321 PLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNI 354 +FF + + + L+G + + L P I Sbjct: 292 -VFFSHKVIRAFQPITLLGLLATSALLPAWPFRI 324 >UniRef50_Q2JKN6 Glycosyl transferase, group 2 family protein n=2 Tax=Synechococcus RepID=Q2JKN6_SYNJB Length = 493 Score = 85.9 bits (211), Expect = 3e-15, Method: Compositional matrix adjust. Identities = 68/252 (26%), Positives = 114/252 (45%), Gaps = 27/252 (10%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRY--ENIEVIAVNDGSTDKTRAILDRMAAQIPHL 132 P ++++IP NE + +H+ RY ++E+ A++D S+D T +L IPHL Sbjct: 129 PRVAVLIPAKNESAVLPRLLHSLTQLRYPTSHLELWAIDDNSSDATPEVLREAQKWIPHL 188 Query: 133 RVI--HLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVT 190 RV + GK+ AL + E ++ D DA++ D A + P+ Sbjct: 189 RVYRRQPGRGGGKSGALNEVLPLTQGEIILVCDADAVVPSDFLARTL-PLFVQ------V 241 Query: 191 GNPRIRTRSTLVGKIQVGEYSS-------IIGLIK--------RTQRI-YGNVFTVSGVI 234 G+ R R VG +QV + S +G + + QR+ + + G Sbjct: 242 GSLRSRFSRRTVGAVQVRKALSNPSVNFWTLGQVAEMASDAYFQQQRVAVRGIGELRGNG 301 Query: 235 AAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRL 294 RR L + G W++ +T+D+D+++KL L I + P + K LW QR Sbjct: 302 QLVRRDVLEKCGGWNEATLTDDLDLTFKLHLAGVDIAFLPEPAIVEEGVTSWKSLWHQRC 361 Query: 295 RWAQGGAEVFLK 306 RWA+GG + +L Sbjct: 362 RWAEGGYQRYLD 373 >UniRef50_Q6KZU9 N-acetylglucosaminyltransferase n=1 Tax=Picrophilus torridus RepID=Q6KZU9_PICTO Length = 395 Score = 85.9 bits (211), Expect = 3e-15, Method: Compositional matrix adjust. Identities = 66/245 (26%), Positives = 108/245 (44%), Gaps = 12/245 (4%) Query: 73 DN--PSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIP 130 DN P +SII+P NEE + I + L Q Y+N E+ V D S D T R+A Sbjct: 43 DNYKPLVSIIVPAKNEETVIGRCIESILGQAYDNFELFVVVDNSDDDTY----RIAKSYE 98 Query: 131 HLRVIHLAQNQG---KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVG 187 +H+ + G KA AL + + E + D D +L+++ V M Y Sbjct: 99 RDGRVHVFERHGNLTKASALNYAYSMSHGEIIATYDADTVLEKNTLKNAVYGMRYMDADV 158 Query: 188 AVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFT-VSGVIAAFRRSALAEVG 246 N I + ++ + II + R+Y ++F V+G F+R + +G Sbjct: 159 LQGYNTYINREENIFTRLAAID--EIIVKVSMIGRMYLHLFVPVAGSNQYFKRETIRIIG 216 Query: 247 YWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLK 306 W+ + +TED++ ++ + Y P A + P T KQR+RW +G +V L Sbjct: 217 GWNGNFLTEDLESGVRMAAKRMRSAYLPSAKVYQETPATYSEYIKQRIRWLRGYHQVLLH 276 Query: 307 NMTRL 311 + L Sbjct: 277 SKKEL 281 >UniRef50_B4WM17 Glycosyl transferase, group 2 family protein n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WM17_9SYNE Length = 475 Score = 85.5 bits (210), Expect = 4e-15, Method: Compositional matrix adjust. Identities = 71/251 (28%), Positives = 119/251 (47%), Gaps = 13/251 (5%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRY--ENIEVIAVNDGSTDKTRAILDRMAAQIPHL 132 P +S+++ NEE+ + + + L Y + ++ ++D STD T ILD + + L Sbjct: 118 PYVSLLVAAKNEEQVIGSLVESLLHIDYPTDRYDLWIIDDYSTDATPEILDNLVKRHRQL 177 Query: 133 RVIHLAQNQ--GKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVT 190 VIH GK+ AL K + L D DA + D Y+V PM + G T Sbjct: 178 NVIHRGPGAIGGKSGALNLVWPQTKGDLLAVFDADAQVSSDLLRYVV-PMFDPEKGGKKT 236 Query: 191 GNPRIRTRSTLVGK---IQVGEYSSIIGLIKRTQRI-YGNVFTVSGVIAAFRRSALAEVG 246 G ++R K + + + + +RI G + + G RR A+A+ G Sbjct: 237 GAVQVRKAIANATKNFWTRGQKAEMALDCYMQQRRIAVGGIGELRGNGQFVRREAIAQCG 296 Query: 247 YWSDDMITEDIDISWKLQLNQWTI--FYEPRALCWILMPETLKGLWKQRLRWAQGGAEVF 304 W+++ IT+D+D++ +L L QW I + P A+ + L LW QR RWA+GG + + Sbjct: 297 GWNEETITDDLDLTIQLHLQQWDIGLLFAP-AVGEEGVTNPL-ALWHQRNRWAEGGFQRY 354 Query: 305 LKNMTRLWRKE 315 L +L + + Sbjct: 355 LDYWRQLGKNQ 365 >UniRef50_D2YGE5 Putative uncharacterized protein n=1 Tax=Vibrio mimicus VM603 RepID=D2YGE5_VIBMI Length = 333 Score = 85.5 bits (210), Expect = 4e-15, Method: Compositional matrix adjust. Identities = 37/93 (39%), Positives = 59/93 (63%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRV 134 P +S+IIPCFN E+ + +++ + L Q Y+NIE+I V+DGSTD+T ++ + R+ Sbjct: 7 PLVSVIIPCFNGEEYICDSVDSVLGQSYDNIEIIVVDDGSTDRTIELIKEKYHALDGFRL 66 Query: 135 IHLAQNQGKAIALKTGAAAAKSEYLVCIDGDAL 167 + LA N G A+A G + A+ EY+ +D D L Sbjct: 67 VQLASNCGAAVARNKGVSEARGEYVCFLDADDL 99 >UniRef50_Q1PUL2 Putative uncharacterized protein n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1PUL2_9BACT Length = 342 Score = 85.1 bits (209), Expect = 5e-15, Method: Compositional matrix adjust. Identities = 62/239 (25%), Positives = 117/239 (48%), Gaps = 12/239 (5%) Query: 78 SIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHL 137 SI+IP +NEEK++ + + L+ Y + E+I ++D S+D T ++ A++ + ++ Sbjct: 4 SILIPAYNEEKSISTCLDSLLSVTYPDKEIIVIDDASSDHTVKEVETFASR--GVILVKR 61 Query: 138 AQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRT 197 +N G+A AL +G + A E +V D D + + E + V AV G + + Sbjct: 62 EKNGGRAAALNSGLSRATGEIVVTTDADTGVSPHWLT-LFERHFADEEVIAVGGAYKAQN 120 Query: 198 RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDM-ITED 256 + G S + ++ T R VSGV +A RR L ++G +++ +ED Sbjct: 121 K----GNPLANATSVLDQILNGTFRKTFIPNKVSGVNSAIRRDTLLDLGGFNESAWWSED 176 Query: 257 IDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMT---RLW 312 ++ WKL + Y+P + + P+T +G+W+++ W F ++M +LW Sbjct: 177 SELGWKLD-KTGKVIYDPENVVYTQYPDTWEGIWRRKFYWGYAMGLKFRQHMPFRLKLW 234 >UniRef50_C1ZJA9 Glycosyl transferase n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZJA9_PLALI Length = 533 Score = 84.7 bits (208), Expect = 6e-15, Method: Compositional matrix adjust. Identities = 78/336 (23%), Positives = 144/336 (42%), Gaps = 26/336 (7%) Query: 58 RHWP--WGENAPAPQLKDN---PSISIIIPCFNEEKNVEETIHAALAQRYENIEV-IAVN 111 R+W W + P + D P+++I +P FNE + A Y + + + Sbjct: 29 RYWRSVWKKQKPTSRPIDREHLPAVTIQLPMFNESIIAPRILEAVSRIDYPRDRLQVQIL 88 Query: 112 DGSTDKT----RAILDRMAAQIPHLRV--IHLAQNQG-KAIALKTGAAAAKSEYLVCIDG 164 D STD + IL+ + P L + +H QG KA AL+ E++ D Sbjct: 89 DDSTDHSPEIIAGILEELRQSQPELNIEYLHRTDRQGFKAGALQAAMPLVTGEFIAIFDA 148 Query: 165 DALLDRDAAAYIVEPMLYNPRVGAVTGN-PRIRTRSTLVGKIQVGEYSSIIGLIKRTQRI 223 D + D +++ P +P V V + +++ + Q + + + Sbjct: 149 DFIPQPDFLTHLL-PYFDSPEVAVVQSRWGHLNAHDSVLTQAQQFFLDGHHSVEQNGRNR 207 Query: 224 YGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMP 283 G T +G ++RSA+ G WS D + ED+D+S++ Q + I Y + +P Sbjct: 208 AGYFITFNGTAGIWQRSAMEAAGGWSADTLVEDLDLSYRTQSLGYRIVYVEDYVTPGELP 267 Query: 284 ETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFE-YCLTTIWAFTCLVGFII 342 ++ GL Q RW +G A+V LK + ++W++ PL + + ++A ++ ++ Sbjct: 268 NSVSGLRVQLFRWFKGNAQVGLKILGKVWKQP----LPLSVKIHATAQLFAPFTMLSSLV 323 Query: 343 YAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFI 378 + +PL + H A H L LC + F+ Sbjct: 324 MLLITGALPL---ILHAAPEHAG---LVKLCYMGFV 353 >UniRef50_A8YGM3 Similarity to the N-terminal part of Q3MEW3_ANAVT Glycosyl transferase n=1 Tax=Microcystis aeruginosa PCC 7806 RepID=A8YGM3_MICAE Length = 398 Score = 84.7 bits (208), Expect = 6e-15, Method: Compositional matrix adjust. Identities = 82/317 (25%), Positives = 140/317 (44%), Gaps = 37/317 (11%) Query: 77 ISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIH 136 +S++IPC+ + + + + L Q Y+N E+I VNDGSTD T ++ A + P RV + Sbjct: 13 VSVLIPCYQHGHLLAQAVESVLRQSYQNYEIIIVNDGSTDNTHQVVQEYAQRYPQ-RVKY 71 Query: 137 LAQ-NQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNP-- 193 L Q QG+ A K G+A + E+L+ +D D LL VE + NP AV N Sbjct: 72 LYQEQQGQVAARKNGSALIRGEFLITLDADDLLAPSMMGKCVESLSKNPEAAAVASNVWM 131 Query: 194 RIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMI 253 T++G++ E S +++ +G ++G++ R +L +VG+++ D Sbjct: 132 MAEDGQTILGQLDQSEVPSWPNILEHN--TWGG---IAGIM--IRSQSLQKVGWFNFDGT 184 Query: 254 T--EDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEV---FLKNM 308 ED DI + ++ + + P L + +Q LR +G +V K Sbjct: 185 AGGEDWDIWIRFARSRQKLLHIPDFLAQ--YRQNCSSHSRQGLRVLKGILQVLDCLTKED 242 Query: 309 TRLWRKEN------------FRMWPLFFEY--CLTTIWAFTCLVGFIIYAVQLAGVPLNI 354 RL + EN R +FF Y L L + Y VP ++ Sbjct: 243 PRLIQLENVHPPIDISYYNQLRNRAVFFNYGVALAAAEGEETLQDILQYF-----VPSDL 297 Query: 355 ELTHIAATHTAGILLCT 371 +L +A + G++ + Sbjct: 298 DLETLATAYIKGLIFVS 314 >UniRef50_A9B149 Glycosyl transferase family 2 n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9B149_HERA2 Length = 331 Score = 84.3 bits (207), Expect = 9e-15, Method: Compositional matrix adjust. Identities = 56/210 (26%), Positives = 109/210 (51%), Gaps = 12/210 (5%) Query: 78 SIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHL 137 SIII +N + + ++A L Q +IEV+ V++GS D + A L + P +R++ L Sbjct: 4 SIIILNWNGRALLADCLNALLPQCDASIEVLVVDNGSHDGSAAWLHQ---HYPQVRLLAL 60 Query: 138 AQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRT 197 N+G + + G A+ + L+ ++ DA+++ + + I+ P + P + A G Sbjct: 61 TNNRGFSGGVNVGLHVARGDVLLLLNNDAIVEPNFISAILAPFQHQPTLAASAGLMTFAH 120 Query: 198 RSTLVGKIQVGEY----SSIIGLIKRTQRIYGN---VFTVSGVIAAFRRSALAEVGYWSD 250 R ++ + Y ++ GL++ ++ ++ SG A+RR+ALA+VG + + Sbjct: 121 RPEIIASAGIQLYRDGVATDAGLLQPVAQLASQPSPIWGGSGGAVAYRRAALADVGMFDE 180 Query: 251 DMIT--EDIDISWKLQLNQWTIFYEPRALC 278 ED+D++W+LQL +W P+A+ Sbjct: 181 GYFAYLEDVDLAWRLQLREWQTVLAPQAVA 210 >UniRef50_A0Q541 Glycosyl transferase n=3 Tax=Francisella novicida RepID=A0Q541_FRATN Length = 438 Score = 84.0 bits (206), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 94/389 (24%), Positives = 171/389 (43%), Gaps = 50/389 (12%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRYE-NIEVIAVNDGSTDKTRAILDR--------M 125 P +S II ++E K++ T+ + Q Y+ +IEV+ + DG+ D + LD + Sbjct: 63 PKVSCIITAYSEGKDILLTLKSLEEQLYKGHIEVLILIDGA-DVNKKTLDVAMRYKKSFV 121 Query: 126 AAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPR 185 A L+VI G A + G + E ++ +DGD +D D + +V+ + + Sbjct: 122 ANNKRSLKVIPKWHRGGHASSSNLGFNLSTGEIIIMLDGDTSVDNDMVSNMVQQFVDSNV 181 Query: 186 VGAVTGNPRIRT-RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAE 244 VG +G R+R + L+ ++Q EY I + + ++ VSG AFR+S L++ Sbjct: 182 VGC-SGTLRVRNIKKNLLTRMQGIEYMLGIHMSRIGLASINSINNVSGAFGAFRKSFLSK 240 Query: 245 VGYWSDDMITEDIDISWKLQL-----NQWTIFYEPRALCWILMPETLKGLWKQRLRWAQG 299 W + ED+D++ +LQ I + ++ +PETL L KQRLRW Sbjct: 241 TTRWRNGS-AEDLDLTLRLQSYFKRHKHLRIVHSQDSIAHTDVPETLWQLIKQRLRW--- 296 Query: 300 GAEVFLKNMTRLWRK--ENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQ----------- 346 +++ + R +K + WP F T G +AVQ Sbjct: 297 DGDIYFIYIRRHLKKLLPRYLGWPTFL---------ITVFSGLFFHAVQPIVIVISLIYS 347 Query: 347 LAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWF-PVIF 405 + P+N+ + A + + + + FI L++ R + +L +++++I F P Sbjct: 348 IFVFPINVIIPIYIAVYLYYLAISIFLFITFI--LLVSERKKQDL--AMWYVIPFMPFYS 403 Query: 406 WMLSLATTLVSFTRVMLMPKKQRARWVSP 434 L T+ F+ + K R ++P Sbjct: 404 VFLKFVATVAIFSEFFI--KTHRDSSMAP 430 >UniRef50_D1BIJ9 Glycosyl transferase n=2 Tax=Micrococcineae RepID=D1BIJ9_SANKS Length = 497 Score = 84.0 bits (206), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 71/255 (27%), Positives = 119/255 (46%), Gaps = 18/255 (7%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRY--ENIEVIAVNDGSTDKTRAILDRMAAQIPHL 132 P ++++IP +NE V ++ + Y + + V V+D STD T ++ + Q P Sbjct: 54 PRVAVVIPAWNEGAVVGNSVDMLMRLDYPADALRVYVVDDASTDDTPDVVLAKSRQYPG- 112 Query: 133 RVIHLAQ---NQGKAIALKTGAAAAKS----EYLVCIDGDALLDRDAAAYIVEPMLYNPR 185 +V HL + +GKA L G A E L+ +D D + D+ + L + + Sbjct: 113 QVFHLRREKGGEGKAHTLNHGLAVILEDDWMEALLIMDADVVYLPDSLRKMTR-HLADEQ 171 Query: 186 VGAVTGNPRIRT-RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAE 244 VGAVT R + + + EY+ +R Q + G V ++G R+ L Sbjct: 172 VGAVTAYIREGSAEKNYLTRFIALEYALSQAASRRAQNVMGAVACLAGGAQLHSRANLEA 231 Query: 245 VGYWSD-DMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEV 303 +G D + ED ++K QL + +EP A+ P+++ LWKQRLRW +G + Sbjct: 232 LGGRIDTSSLAEDTVTTFKTQLAGRKVVFEPHAVVLAEEPQSVAALWKQRLRWGRGNVHI 291 Query: 304 FLKNMTRLWRKENFR 318 TR++R+ FR Sbjct: 292 -----TRMYRRLWFR 301 >UniRef50_A5GQ06 Glycosyltransferase of family GT2; possible processive Glycosyltransferase n=1 Tax=Synechococcus sp. RCC307 RepID=A5GQ06_SYNR3 Length = 453 Score = 83.6 bits (205), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 74/289 (25%), Positives = 139/289 (48%), Gaps = 34/289 (11%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRY--ENIEVIAVNDGSTDKTRAILDRMAAQIPHL 132 P+I +++ +EE+ + + A Y + +++ ++DGS D+T IL++++A L Sbjct: 93 PAIDVLVAARDEEQVIARLVQAIAGLEYPADRLKLWVMDDGSNDRTPEILNQLSATHSFL 152 Query: 133 RVIHLAQNQG--KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVT 190 +V +N G K+ AL + ++L+ +D DA L D + +P L++ +GAV Sbjct: 153 QVRRRPRNAGGGKSGALNALLPQLQGDWLLVLDADAALTPDVLLRL-QPQLHDSALGAVQ 211 Query: 191 -GNPRIRTRSTLVGKIQVGE--YSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGY 247 + S + + Q E + + + L + Q G + + G RR L + G Sbjct: 212 LRKAVVGAESNWLTRSQALEMAFDAEMQLGRIAQ---GGIGELRGNGQLIRRQVLLDCGG 268 Query: 248 WSDDMITEDIDISWKLQL--NQWTIFYEPRALCWILMPETL---KGLWKQRLRWAQGGAE 302 +++D +T+D+D+S++L L Q ++ + P ++ E + LW+QR RWA+GG + Sbjct: 269 FNEDTVTDDLDLSFRLLLAGQQVSVLWNP-----VVDEEPVMRWGALWRQRQRWAEGGLQ 323 Query: 303 VFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFII-YAVQLAGV 350 FL WP F L+ +V F++ Y + LA V Sbjct: 324 RFLD------------YWPALFSDRLSGRQQLDLIVFFLLQYGLPLATV 360 >UniRef50_C9NBS6 Glycosyl transferase family 2 n=1 Tax=Streptomyces flavogriseus ATCC 33331 RepID=C9NBS6_9ACTO Length = 492 Score = 83.2 bits (204), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 67/280 (23%), Positives = 122/280 (43%), Gaps = 35/280 (12%) Query: 69 PQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQ 128 P L +P ++ ++ +E +E + + A + ++ V+D S D TR +L R+ Sbjct: 113 PPLPAHPMVAFLLAVKDEVDGIEACVRSMAASDCPGVRIVVVDDHSQDGTRDVLRRLEDD 172 Query: 129 IPHLRVIHLAQNQGKAIAL-KTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVG 187 + + V++L N GK AL + AA +E + D D +L DA V ++ +P +G Sbjct: 173 L-GITVVYLDANVGKKHALVRACEFAADTEIIAFTDSDCVLAPDALRRCVRALVTHPELG 231 Query: 188 AVTGNPR-IRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAE-V 245 AV+G+ R + T S + + Q Y + K + +G+V VSG +A FRR A+ + Sbjct: 232 AVSGHCRALNTDSGFLARAQDVWYEGQFRVSKAAEASFGSVSCVSGPLAVFRRDAIFNYL 291 Query: 246 GYWSDD--------------MITEDIDISWKLQL-----------------NQWTIFYEP 274 W+DD + + +W+ + +W + Y Sbjct: 292 PAWADDRFMGAPFRFATDRQLTGYVLGQAWRGKALKRRYAGSPFTAEDFAERRWRVGYVR 351 Query: 275 RALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRK 314 A W +P L L +Q++RW + + +WR+ Sbjct: 352 SAKVWTDVPARLGPLLRQQVRWKKSFVRNLFFTGSFMWRR 391 >UniRef50_P74165 Sll1377 protein n=9 Tax=Cyanobacteria RepID=P74165_SYNY3 Length = 479 Score = 83.2 bits (204), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 64/236 (27%), Positives = 110/236 (46%), Gaps = 5/236 (2%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRY--ENIEVIAVNDGSTDKTRAILDRMAAQIPHL 132 P + +++ NEE + + + + Y + EV V+D STD+T AILD++ Q P L Sbjct: 108 PQVCLMVAAKNEEAVIGKIVQQLCSLDYPGDRHEVWIVDDNSTDRTPAILDQLRQQYPQL 167 Query: 133 RVIH--LAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVT 190 +V+ + GK+ AL A + + + D DA + +D +V P +P GA+ Sbjct: 168 KVVRRGAGASGGKSGALNEVLAQTQGDIVGVFDADANVPKDLLRRVV-PYFASPTFGALQ 226 Query: 191 GNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSD 250 I + G ++ ++ + + G + + G R AL VG W++ Sbjct: 227 VRKAIANEAVNFWTRGQGAEMALDAYFQQQRIVTGGIGELRGNGQFVARQALDAVGGWNE 286 Query: 251 DMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLK 306 IT+D+D++ +L L+QW + T LW QR RWA+GG + +L Sbjct: 287 QTITDDLDLTIRLHLHQWKVGILVNPPVEEEGVTTAIALWHQRNRWAEGGYQRYLD 342 >UniRef50_A9FCX3 Glycosyltransferase n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9FCX3_SORC5 Length = 358 Score = 83.2 bits (204), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 69/241 (28%), Positives = 115/241 (47%), Gaps = 11/241 (4%) Query: 71 LKDNPSISIIIPCFNEEKNVEETIHAALAQRY--ENIEVIAVNDGSTDKTRAILDRMAAQ 128 L + P SI+IPC+NEE+ +E+ + AL QRY E +E+ V+ GSTD+TR ++ R+AA+ Sbjct: 18 LAERPRCSIVIPCYNEERYIEDVLRCALEQRYPPELLEIFVVDGGSTDRTRELVGRVAAE 77 Query: 129 IPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGA 188 P + ++H ++ +A A+ G A+ E +V +D A D A V + R GA Sbjct: 78 DPRVTLLH-NPDRIQAAAMNIGIRRARGEVIVRMDAHADYAADYVAASVAALR---RTGA 133 Query: 189 VT--GNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVI-AAFRRSALAEV 245 + G R R R+ + SS +G+ R V V AFRR A Sbjct: 134 LNAGGAARPRHRNGFQ-RALCAALSSPLGVGGSAYRDPSREGFVESVFNGAFRREAFEIA 192 Query: 246 GYWSDDMIT-EDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVF 304 G + IT ED +++ ++ ++ + + + L++Q + +G A Sbjct: 193 GLYDPAAITNEDAELNQRIIEAGGAVYLSREIVAYYYPRSSFAALFRQYFAYGKGRARTL 252 Query: 305 L 305 L Sbjct: 253 L 253 >UniRef50_Q6ACB6 Glucosaminyltransferase n=1 Tax=Leifsonia xyli subsp. xyli RepID=Q6ACB6_LEIXX Length = 421 Score = 83.2 bits (204), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 100/392 (25%), Positives = 165/392 (42%), Gaps = 30/392 (7%) Query: 46 IVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRY--E 103 + F+V W + AP L P ++I++P +NE + +I + Y E Sbjct: 27 VAAAATFFVIPLHAWINHYHKAAPYL---PRVAIVVPAWNEGAVIGASIDRLVTLDYPKE 83 Query: 104 NIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQ--NQGKAIALKTGA----AAAKSE 157 + + V+D STD T ++ A + P + + QGKA L G A E Sbjct: 84 ALRIYVVDDASTDDTSVVVRERAMRYPGNVFLFRREKGGQGKAHTLNHGIERIFADDWME 143 Query: 158 YLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTG--NPRIRTRSTLVGKIQVGEYSSIIG 215 L+ +D D + D+ + L +P+VGAV+ + R+ L K EY Sbjct: 144 ALLIMDADVIYQPDSLRKMTR-HLADPKVGAVSAYIHEGSADRNYLT-KFVSTEYVLSQP 201 Query: 216 LIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSD-DMITEDIDISWKLQLNQWTIFYEP 274 +R Q + G ++G R L +G D + ED +++ QL + +EP Sbjct: 202 TARRAQNVLGAQACLAGGAQLHSRENLIAIGGQVDTSTLAEDTITTFETQLRGKRVVFEP 261 Query: 275 RALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLW----RKENFRMWPLFFEYCLTT 330 A P T+ LWKQRLRWA+G V + +++W RK + W F + Sbjct: 262 HAHVLAEEPRTIDALWKQRLRWARGNVSVTAR-YSKVWFRPSRKHHLGAWT--FGIVWFS 318 Query: 331 IWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHN 390 +W ++ +I ++ LAG+ + L + AT IL + L V ++ Sbjct: 319 LWLLPLIM--VISSIGLAGL---LFLHNGFATTVFRILWISAALAYLFVLILGSQTDVGT 373 Query: 391 LTSSLFWIIWFPVIFWMLSLATTLVSFTRVML 422 + ++ FP I ML + T L F +ML Sbjct: 374 TRKEVGHVMLFPGIVNMLVMLTAL--FPTMML 403 >UniRef50_D2QHQ3 Glycosyl transferase family 2 n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QHQ3_9SPHI Length = 403 Score = 83.2 bits (204), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 70/271 (25%), Positives = 126/271 (46%), Gaps = 15/271 (5%) Query: 63 GENAPAPQLKD-NPSISIIIPCFNEEKNVEETIHAALAQRY--ENIEVIAVNDGSTDKTR 119 G+ P P P + +++P +NE ++E ++ +LA Y + + ++ V +GSTD T Sbjct: 38 GKKRPVPNDSSFVPDVMLVVPAYNERDCIDEKVNNSLALDYPSDKLHILFVTEGSTDGTT 97 Query: 120 AILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEP 179 ++++ AA P + V+ Q GK A+ K+ +++ D + L+ + +I+ Sbjct: 98 ELVEQYAATNPRVHVMGGRQRLGKIEAVNQAMHGIKAPFVIFTDANTTLNSASIKHIIR- 156 Query: 180 MLYNPRVGAVTGNPRIRTRST--LVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAF 237 +P VGAV G RI+ S+ VG + G Y +KR +V +G + A Sbjct: 157 HFADPNVGAVAGEKRIQLLSSEAAVGSGE-GLYWKYESFLKRLDTELYSVVGAAGELFAI 215 Query: 238 RRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWA 297 R +V D + +D IS ++ + + YEP A ++ K+++R A Sbjct: 216 RTELYEQV---EKDTLLDDFVISLRIAERGYRVVYEPDAYALERPSFSVGDEKKRKVRIA 272 Query: 298 QGGAEVFLKNMTRLWRKEN-FRMWPLFFEYC 327 GG + ++RL N FR L F+Y Sbjct: 273 AGG----FQAISRLQPLLNIFRYGWLSFQYT 299 >UniRef50_Q2S964 Glycosyltransferase, probably involved in cell wall biogenesis n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2S964_HAHCH Length = 269 Score = 83.2 bits (204), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 39/109 (35%), Positives = 65/109 (59%), Gaps = 3/109 (2%) Query: 64 ENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALA---QRYENIEVIAVNDGSTDKTRA 120 E + + Q K S+S++ P +NEE N+E T+ A+ + +E +E+I VNDGS+D TR Sbjct: 21 EASASAQNKKLSSVSVVFPAYNEETNIEMTVLKAIGAFKKHFETVEIIVVNDGSSDGTRD 80 Query: 121 ILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLD 169 IL+R+ + +R IH QN+G A++TG + + +++ D D D Sbjct: 81 ILERLRQEHEEVRPIHHVQNKGYGGAVRTGLKSGRGDFIFFSDSDGQFD 129 >UniRef50_A9FZQ2 Glycosyltransferase n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9FZQ2_SORC5 Length = 521 Score = 83.2 bits (204), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 69/244 (28%), Positives = 108/244 (44%), Gaps = 8/244 (3%) Query: 72 KDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEV-IAVNDGSTDKT----RAILDRMA 126 +D P ++I +P FNE + A Y ++ I V D STD+T RA ++R+ Sbjct: 50 RDLPPVTIQLPLFNESTVAARLLDAVAKMDYPRDKLEIQVLDDSTDETQGLVRAHVERLR 109 Query: 127 AQIPHLRVIHLAQNQG-KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPR 185 A +H G KA AL G AK E + D D + D IV +P Sbjct: 110 ALGLDAVYLHRVDRVGYKAGALDAGLKIAKGELVAIFDADFIPQPDFVRSIV-GHFEDPT 168 Query: 186 VGAVTGN-PRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAE 244 VG V + +++ ++Q + R + G +F SG +R+ A+ E Sbjct: 169 VGMVQTRWGHLNRDVSILTQVQALMLDGHHLVENRARFGAGLLFNFSGTGGMWRKDAIRE 228 Query: 245 VGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVF 304 G W D +TED+D+S++ QL + Y + +PE + L Q+ RWA+G + Sbjct: 229 AGGWQHDTLTEDLDLSYRAQLAGYRFVYREDVVSPAELPEDISALRAQQYRWAKGTVQTA 288 Query: 305 LKNM 308 K M Sbjct: 289 RKLM 292 >UniRef50_A4SFN6 Glycosyl transferase, family 2 n=1 Tax=Chlorobium phaeovibrioides DSM 265 RepID=A4SFN6_PROVI Length = 474 Score = 82.8 bits (203), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 78/293 (26%), Positives = 121/293 (41%), Gaps = 59/293 (20%) Query: 58 RHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDK 117 R +GE + + +S+II FNEEK + T+ L RY E++ VNDGS D Sbjct: 44 REICFGEFSIISRSAQTFPVSVIISAFNEEKMIVATVENVLQLRYPQFEIVVVNDGSVDG 103 Query: 118 TRAILD-----RMAAQI-------------------PHLRVIHLAQNQGKAIALKTGAAA 153 T A+L R+ ++ P L V+ +N +A A G Sbjct: 104 TMAVLTERFGLRLVHEVFREHILTKPVRGLYRSPEYPQLLVVD-KENGRRADANNAGVNH 162 Query: 154 AKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTG-----NPRIRTRSTLVGK---- 204 A+ + ID D +L+ D+ Y+V P L N V A TG N + ++ +V + Sbjct: 163 ARYPLICQIDADCILEEDSLLYMVRPFLGNSDVIAATGIIRPSNGLVVSKGRIVSRGLPD 222 Query: 205 -----IQVGEYSSII----GLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITE 255 Q EY G + R Q ++ +SG +R VG + + + Sbjct: 223 RWLPLFQWLEYLRAFQWARGGLSRMQ----SMLCMSGAYTFIKREVFLAVGGANPHAVVD 278 Query: 256 DIDISWKLQLNQW----------TIFYEPRALCWILMPETLKGLWKQRLRWAQ 298 D +++ L LN++ I Y P C+ +PETLK L QR W + Sbjct: 279 DFELT--LTLNRYIHTHKKERRMKIAYVPDPGCYSEVPETLKALASQRNFWQR 329 >UniRef50_D1X4H0 Glycosyl transferase family 2 n=2 Tax=Streptomyces RepID=D1X4H0_9ACTO Length = 343 Score = 82.8 bits (203), Expect = 3e-14, Method: Compositional matrix adjust. Identities = 62/195 (31%), Positives = 99/195 (50%), Gaps = 14/195 (7%) Query: 72 KDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPH 131 + P +S++IP +N + +E + AALAQ YE +EV+ ++D STD + A+ A+ Sbjct: 3 RKQPLVSVVIPNYNYGRALELCLRAALAQTYEPLEVLLIDDCSTDDSVAV-----AEACG 57 Query: 132 LRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTG 191 +RVI N G A TGAA A+ E +V +D D ++ DA A V + +PR+GAV G Sbjct: 58 VRVISTGVNSGVATTRNTGAAHARGEIIVFVDSDVAMEPDAVANAVALLDSDPRIGAVCG 117 Query: 192 NPRIRTRSTLVGKIQVGEYSSIIGL--IKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWS 249 L+ V EY + + T+ G + T+ I A AE+G ++ Sbjct: 118 T---YDAEPLIRDSLVEEYRCLHQFYWLAETE---GRIGTLHTAICAMPARVFAEIGPFN 171 Query: 250 DDM-ITEDIDISWKL 263 + TED D + ++ Sbjct: 172 PRLRHTEDGDYAARI 186 >UniRef50_D1RA94 Putative uncharacterized protein n=1 Tax=Parachlamydia acanthamoebae str. Hall's coccus RepID=D1RA94_9CHLA Length = 620 Score = 82.4 bits (202), Expect = 3e-14, Method: Compositional matrix adjust. Identities = 62/243 (25%), Positives = 111/243 (45%), Gaps = 9/243 (3%) Query: 72 KDNPSISIIIPCFNEEKN-VEETIHAALAQRYENIEVIAVNDGSTDKT-----RAILDRM 125 K++P +SI IPCFNE V ET++A Y + EVI +++ + D T A ++ Sbjct: 178 KNSPFVSIHIPCFNEPPELVIETLNAISRFNYPHFEVIVLDNNTKDPTVWAPVEAHCLQL 237 Query: 126 AAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPR 185 + + LA + A+ A+++E + D D + D + +V +P+ Sbjct: 238 GERFRFYHIDKLAGAKAGALNACLKCTASQAELIAVFDADYVAKEDFLSRLVG-FFDDPK 296 Query: 186 VGAVTGNPRIRT-RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAE 244 +G V R + EY + L Q + +T+ G + RR+AL E Sbjct: 297 IGFVQSCQDYRDWDHSHYQAACYYEYETHFKLELPGQNEWDVTYTI-GTMCLIRRTALDE 355 Query: 245 VGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVF 304 VG W++ +TED +++ ++ + +Y + L+PET + QR RW+ G + Sbjct: 356 VGGWAEWCLTEDSEVAVRIHALGFAGYYLKETFGYGLIPETFESYKLQRFRWSAGPVQQI 415 Query: 305 LKN 307 K+ Sbjct: 416 QKH 418 >UniRef50_B7HZB4 Beta-1,3-N-acetylglucosaminyltransferase n=37 Tax=Bacillus cereus group RepID=B7HZB4_BACC7 Length = 326 Score = 82.4 bits (202), Expect = 3e-14, Method: Compositional matrix adjust. Identities = 38/98 (38%), Positives = 64/98 (65%), Gaps = 2/98 (2%) Query: 71 LKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIP 130 +K N +S++IP +N EK +EET+ + L Q Y+NIE++ V+DGS D++ +I+ +A + P Sbjct: 2 MKQNQLVSVVIPLYNAEKYIEETMQSILDQTYKNIEIVIVDDGSKDQSPSIVKNLAEKYP 61 Query: 131 -HLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDAL 167 ++ +H +NQG ++A TG A EY+ +D D L Sbjct: 62 GQVKYVH-QKNQGVSVARNTGIENASGEYIAFLDSDDL 98 >UniRef50_A6VAQ9 MigA n=16 Tax=Pseudomonas RepID=A6VAQ9_PSEA7 Length = 300 Score = 82.4 bits (202), Expect = 3e-14, Method: Compositional matrix adjust. Identities = 56/226 (24%), Positives = 106/226 (46%), Gaps = 19/226 (8%) Query: 73 DNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHL 132 D P +S++ PCFN EK +EE + + AQ Y N EVI V+DGSTD + A+L+++ A Sbjct: 15 DTPLVSVVAPCFNAEKYLEEALRSIYAQDYPNFEVIIVDDGSTDASYAMLEQLQASYGF- 73 Query: 133 RVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTG- 191 ++ +NQG + AL G A+ +Y+ D D ++ + + E + +P VG V Sbjct: 74 -QLYRQRNQGVSAALNFGLQHARGDYVATPDLDDIMLPHSLSVRAEYLDQHPEVGCVGAL 132 Query: 192 -----NPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVG 246 + T+ +I+ ++ ++ GN + ++ +R AL G Sbjct: 133 VIYIDSEGQETKRQSSNRIRQLDFDYLL----------GNAYVCGAPVSLYRMEALRAAG 182 Query: 247 YWSDDMITEDIDISWKLQLNQWTIFYEPRALC-WILMPETLKGLWK 291 ++ + +D ++ ++ + I P + + P+ L +K Sbjct: 183 FYDPQIKVQDFQMTLRIASQGYQIHKLPVLVTRYRRHPDNLSRRYK 228 >UniRef50_A5G0G3 Glycosyl transferase, family 2 n=1 Tax=Acidiphilium cryptum JF-5 RepID=A5G0G3_ACICJ Length = 903 Score = 82.4 bits (202), Expect = 3e-14, Method: Compositional matrix adjust. Identities = 74/263 (28%), Positives = 119/263 (45%), Gaps = 20/263 (7%) Query: 52 FWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKN-VEETIHAALAQRYENIEVIAV 110 F R RH+ E PA P +S+ + NE V++T++A A YEN EV+ + Sbjct: 422 FGRVRMRHF---EPVPAAPGTKLPKVSLHLAICNEPPEMVKQTLNALAALDYENFEVLVI 478 Query: 111 NDGSTDKTRAILDRMAAQIPHL----RVIHLAQNQG-KAIALKTG--AAAAKSEYLVCID 163 ++ + D A+ + +AA L R L ++ G KA AL A +E + +D Sbjct: 479 DNNTKDP--AVWEPVAAHCARLGKQFRFFTLGKHPGYKAGALNFALRETAPDAEIVGVLD 536 Query: 164 GDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRS-TLVGKIQVGEYSSI--IGLIKRT 220 D ++D D +V P +P VG R +L ++ EY+ IG++ R Sbjct: 537 SDYIVDPDWLRCMV-PAFADPNVGFTQSPQDYRDNDGSLFKRMMFWEYAGFFHIGMVNRN 595 Query: 221 QRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWI 280 +R N G + R++AL G W++ ITED ++ +L + Y R+ Sbjct: 596 ER---NAVIQHGTMTLIRKAALDAEGGWAEWCITEDSELGLRLFREGYEAVYSKRSFGRG 652 Query: 281 LMPETLKGLWKQRLRWAQGGAEV 303 +MP+ KQR RWA G + Sbjct: 653 VMPDDFNAFRKQRYRWAYGAMRI 675 >UniRef50_B8HDP3 Glycosyl transferase family 2 n=1 Tax=Arthrobacter chlorophenolicus A6 RepID=B8HDP3_ARTCA Length = 445 Score = 82.4 bits (202), Expect = 3e-14, Method: Compositional matrix adjust. Identities = 63/241 (26%), Positives = 116/241 (48%), Gaps = 18/241 (7%) Query: 77 ISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIP--HLRV 134 +SI+I NEE + T+ + Y + EV +D S+D T ++ + P L + Sbjct: 73 VSIVIAAHNEEAVIVRTLESIGTSTYRSFEVFVADDASSDLTGRLVRDFQLRHPTMDLHL 132 Query: 135 IHLAQNQGKAIALKTG-AAAAKSEYLVCIDGDALLDRDAAAYIVEPMLY--NPRVGAVTG 191 + + +N GK AL + A+ ++ + +D D+++ + I + Y +P V V Sbjct: 133 VRMRRNIGKGAALNSVLRRHARGQFAMTLDADSIIGPET---ITNALSYFDDPCVAGVAA 189 Query: 192 NPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNV---FTVSGVIAAFRRSALAEVGYW 248 N +I +T++G +Q + +IG R+++ Y + F V GV + +R L +V ++ Sbjct: 190 NVQILEETTVLGILQ--RFEHMIGY--RSKKFYSLLNAEFVVGGVASTYRMRVLRKVDFY 245 Query: 249 SDDMITEDIDISWK---LQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFL 305 D +TEDI +S K L ++ + Y + +L+ L KQR RW G + + Sbjct: 246 DTDTVTEDIGLSAKITNLGNRRFRMLYGADVVAKTEGVLSLRALAKQRYRWKYGSIQNLI 305 Query: 306 K 306 K Sbjct: 306 K 306 >UniRef50_A6A3G7 Chitin synthase n=2 Tax=Vibrio cholerae RepID=A6A3G7_VIBCH Length = 390 Score = 82.4 bits (202), Expect = 3e-14, Method: Compositional matrix adjust. Identities = 69/252 (27%), Positives = 118/252 (46%), Gaps = 11/252 (4%) Query: 62 WGENAPAPQLKDNPSISIIIPCFNEEKN-VEETIHAALAQRYEN-IEVIAVNDGSTDKTR 119 W PA L + PS+++ +P +NEE + E + Q Y + ++VI +NDGST+KT Sbjct: 27 WFNWKPAKVLNNLPSVTVFVPFYNEEPELIIEALTKIDQQSYSSKLQVIIINDGSTNKTP 86 Query: 120 AILDRMAAQIPHLRVI-----HLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAA 174 I+ + I H + +G A+ A + + +D D ++ + Sbjct: 87 LIVSQWINSFRKQNYILLNRPHNSGRKGAALDYALDKNIATGDIYIVVDSDTFIEPNGIL 146 Query: 175 YIVEPMLYNPRVGAVTGN--PRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSG 232 + + +P AV G P+ R+ + L + Q E+ + G I+ +Q G V ++G Sbjct: 147 ALANKIWSHPEYAAVCGYIVPKNRSNNILC-QSQFYEHLGVHGAIRTSQDHLGIVPVLAG 205 Query: 233 VIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQ 292 A R S + E+G WS+ ++ EDI W+ N++ Y AL P L++Q Sbjct: 206 AFVAHRASVVHEIGGWSEWLV-EDISWCWRALANRYRTGYTAEALATTHCPNNRDALFRQ 264 Query: 293 RLRWAQGGAEVF 304 R RW++G E F Sbjct: 265 RRRWSRGRVEAF 276 >UniRef50_D0MHS9 Glycosyl transferase family 2 n=1 Tax=Rhodothermus marinus DSM 4252 RepID=D0MHS9_RHOM4 Length = 330 Score = 82.0 bits (201), Expect = 4e-14, Method: Compositional matrix adjust. Identities = 44/117 (37%), Positives = 70/117 (59%), Gaps = 3/117 (2%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRV 134 P ++I+IP +N E+ + + + + LAQ + + E + VNDGSTD++ I+ R + P +R+ Sbjct: 2 PRVTIVIPLYNTERFIADALESVLAQTFTDWECVVVNDGSTDRSAEIVARYVERDPRIRL 61 Query: 135 IHLAQNQGKAIALKTGAAAA--KSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAV 189 +H NQG A TGAA A SEYL +D D +L+ +A +V + +P VG V Sbjct: 62 VH-QPNQGVCAARNTGAAHASPDSEYLFFLDADDMLEPEALERLVAYLDAHPEVGLV 117 >UniRef50_A5D278 Glycosyltransferases n=1 Tax=Pelotomaculum thermopropionicum SI RepID=A5D278_PELTS Length = 230 Score = 82.0 bits (201), Expect = 4e-14, Method: Compositional matrix adjust. Identities = 42/94 (44%), Positives = 56/94 (59%), Gaps = 3/94 (3%) Query: 76 SISIIIPCFNEEKNVEETIHAALA---QRYENIEVIAVNDGSTDKTRAILDRMAAQIPHL 132 SI+I PC+NEE+NVE ALA Q ++ E+I VNDGS D+T I D +A + P + Sbjct: 5 SITIFFPCYNEEQNVERVAREALAVARQISDDFEIIIVNDGSRDRTGEIADGLAKEFPEV 64 Query: 133 RVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDA 166 RVIH N+G AL++G A E + DGD Sbjct: 65 RVIHHEVNRGYGAALQSGFKNATKELVFYTDGDG 98 >UniRef50_C5SBP2 Glycosyl transferase family 2 n=1 Tax=Allochromatium vinosum DSM 180 RepID=C5SBP2_CHRVI Length = 481 Score = 82.0 bits (201), Expect = 4e-14, Method: Compositional matrix adjust. Identities = 67/264 (25%), Positives = 115/264 (43%), Gaps = 25/264 (9%) Query: 73 DNPSISIIIPCFNEEKNVEETIHAALAQRYENIEV-IAVNDGSTDKTRAILDRMAA---- 127 ++P + + +P FNE VE + A + + + I V D S D + AI R A Sbjct: 50 EHPRVLVQLPLFNEGDLVERILEAVMDLDWPRDRLEIQVLDDSIDGSLAISQRAVAVLKQ 109 Query: 128 QIPHLRVIHLAQNQG-KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRV 186 Q ++ ++H Q KA AL G +++ ++ D D +I P V Sbjct: 110 QGVNIELLHRVQRTAFKAGALAAGLERSEAPFVAIFDAD---------FIPPPDFLQRTV 160 Query: 187 GAVTGNPR---IRTR-------STLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAA 236 GA+ NP ++TR +L+ +IQ S G+ + + G +G Sbjct: 161 GALVANPDLAYVQTRWGHLNRDESLLTRIQARLLDSHFGVEQEARWRLGLPLPFNGTCGL 220 Query: 237 FRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRW 296 +RR+A+ E G W D +TED+D+S + L W + + +P + + Q+ RW Sbjct: 221 WRRAAIDEAGGWDGDTLTEDLDLSLRANLAGWRSGFMGDLVVPGSLPVSARAWRVQQFRW 280 Query: 297 AQGGAEVFLKNMTRLWRKENFRMW 320 +G + F+K +WR +W Sbjct: 281 TKGFVQCFIKLTPLVWRSRRLPIW 304 >UniRef50_C6D3F1 Glycosyl transferase family 2 n=10 Tax=Bacteria RepID=C6D3F1_PAESJ Length = 235 Score = 82.0 bits (201), Expect = 4e-14, Method: Compositional matrix adjust. Identities = 48/134 (35%), Positives = 75/134 (55%), Gaps = 9/134 (6%) Query: 76 SISIIIPCFNEEKNVE---ETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHL 132 +S++ P +NEE+N++ E I A+ + E+ E++ VNDGS D++ +LD +A Q + Sbjct: 3 KLSVVAPIYNEEENIKLLHEAITNAVKDKVESYEIVLVNDGSKDRSAMLLDELAEQDSRV 62 Query: 133 RVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLD-RDAAAYIVEPMLYNPRVGAVTG 191 RVIH +N G+ A+ G + E + ID D D RD I + M Y P + V G Sbjct: 63 RVIHFEKNCGQTAAIYAGLKGSAGELVALIDADLQTDPRD----IFKLMPYIPNMDFVNG 118 Query: 192 NPRIRTRSTLVGKI 205 R+R R T+V K+ Sbjct: 119 -KRVRRRDTVVKKV 131 >UniRef50_C4RQS0 Glycosyl transferase n=1 Tax=Micromonospora sp. ATCC 39149 RepID=C4RQS0_9ACTO Length = 351 Score = 81.6 bits (200), Expect = 5e-14, Method: Compositional matrix adjust. Identities = 51/127 (40%), Positives = 69/127 (54%), Gaps = 9/127 (7%) Query: 74 NPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLR 133 NP +S+IIP +N ++ I AAL Q Y+ IEVI V+D STD + A + A + Sbjct: 7 NPLVSVIIPNYNYGNSIGSCIDAALGQTYQPIEVIVVDDHSTDDSLAKARQRA-----VT 61 Query: 134 VIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTG-- 191 V+ +N G A+A TGAAAA+ E L +D D L DA V + +P GAV G Sbjct: 62 VLQTPRNSGVAVARNTGAAAARGEILFFVDSDVALRPDAVQRAVAELAAHPEYGAVCGIY 121 Query: 192 --NPRIR 196 +P IR Sbjct: 122 EEDPLIR 128 >UniRef50_D1SM82 Glycosyl transferase family 2 n=1 Tax=Methanocaldococcus sp. FS406-22 RepID=D1SM82_9EURY Length = 307 Score = 81.6 bits (200), Expect = 6e-14, Method: Compositional matrix adjust. Identities = 74/314 (23%), Positives = 130/314 (41%), Gaps = 29/314 (9%) Query: 71 LKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIP 130 + NP +SI+IP N +K+VE I++ L Y+NIE+I V+D STD T L +P Sbjct: 1 MDKNPLVSIVIPTHNRKKHVERLINSILENTYKNIEIIVVDDASTDGTYEYLKEKFGNLP 60 Query: 131 HLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAV- 189 + +++ +N + + G +K E + +D D +LD+ +V+ ++ + ++G V Sbjct: 61 NFKIVRNNKNLLVSGSRNRGIGLSKGELIFLVDDDNILDKKCIENLVKLLILDDKIGVVG 120 Query: 190 ------TGNPRIRTRSTLVGKIQVGEY--SSIIGLIKRT---QRIYGNVFTVSGVIAAFR 238 RI + T I Y + L R + N F V + Sbjct: 121 PIMYYWKDKKRIWSAGTKRNMITSRTYFIGRELPLPNRKTWETDDFPNAFMV-------K 173 Query: 239 RSALAEVGYWSDDMIT---EDIDISWKLQLNQWTIFYEPRALCW--ILMPETLKGLWKQR 293 R A+ +VG + D E+ D +++ + I +A+ W I +PE +K K R Sbjct: 174 REAIEKVGVFDDKTFPIHYEESDFGERIRKAGYKIVCSSKAIIWHDIPLPEDVKD--KSR 231 Query: 294 LRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLN 353 L + R+ + + W F + L W + G+ + + Sbjct: 232 LLHCHNEFRAYYCGRNRIMYHKKYSKWWQFLVFILIFNW---LVAGYYLKVIIFESRKPF 288 Query: 354 IELTHIAATHTAGI 367 E IA + GI Sbjct: 289 KESLKIAKAYLRGI 302 >UniRef50_Q93DZ7 Eps1I n=1 Tax=Streptococcus thermophilus RepID=Q93DZ7_STRTR Length = 316 Score = 81.3 bits (199), Expect = 6e-14, Method: Compositional matrix adjust. Identities = 42/98 (42%), Positives = 62/98 (63%), Gaps = 1/98 (1%) Query: 74 NPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLR 133 NP ISII+P +N EK + I + LAQ Y NIEVI VNDGSTD++ A++ + +++ Sbjct: 2 NPLISIIVPTYNVEKYIRTCIESILAQTYRNIEVIIVNDGSTDQSLAVISDLICSHHNIK 61 Query: 134 VIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRD 171 VI+ +NQG ++A TG AA +Y+ +D D + D Sbjct: 62 VIN-QKNQGLSVARNTGIDAATGKYIAFVDADDKIKPD 98 >UniRef50_A8AKI2 Putative uncharacterized protein n=1 Tax=Citrobacter koseri ATCC BAA-895 RepID=A8AKI2_CITK8 Length = 357 Score = 80.9 bits (198), Expect = 8e-14, Method: Compositional matrix adjust. Identities = 40/108 (37%), Positives = 66/108 (61%), Gaps = 1/108 (0%) Query: 73 DNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHL 132 ++P +SII+P +N +E + + +AQ + ++E+IAVNDGSTD +R IL R +++P + Sbjct: 2 NHPVVSIILPVYNVSAYLERCLDSLVAQTWRHLEIIAVNDGSTDNSRDILARYQSRLPGM 61 Query: 133 RVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPM 180 R++ +N+G A A A+A EY+ C+D D + D IV M Sbjct: 62 RIVD-QENRGLAEARNAALASASGEYIYCLDSDDHIASDGIETIVREM 108 >UniRef50_D0DDK6 Glycosyl transferase, family 2 n=1 Tax=Citreicella sp. SE45 RepID=D0DDK6_9RHOB Length = 618 Score = 80.9 bits (198), Expect = 8e-14, Method: Compositional matrix adjust. Identities = 66/280 (23%), Positives = 123/280 (43%), Gaps = 6/280 (2%) Query: 69 PQLKDNPSISIIIPCFNEEKNVEETIHA---ALAQRYENIEVIAVNDGSTDKTRAILDRM 125 P + P I++IIP NE +++TIH+ A Q + ++ ++ +++ S+D T I+ Sbjct: 252 PPGRFAPPITVIIPAHNEAHIIDDTIHSLDVAAEQYHNDVTLLVIDNNSSDATPKIVREA 311 Query: 126 AAQIPHLRVIHLAQN-QGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNP 184 + +R L++ GKA AL G A +++Y+V ID D + +A ++ NP Sbjct: 312 LLRARRIRGRLLSEPVPGKAHALNRGIAETRTDYVVRIDADTQVSPEALWRVMR-HFSNP 370 Query: 185 RVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAE 244 V AV G + ++ E +GL + + V G+ A+ AL + Sbjct: 371 LV-AVAGGQALAPGGGFFDGVRDLEVILKVGLDQVAYGAGDAIIGVPGMFVAYNTQALRQ 429 Query: 245 VGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVF 304 G ++ M ED D + ++ + + +P A +P T + +QR+RW + V Sbjct: 430 AGSFASGMNGEDTDAAIRIAESGRRLVVDPSATFVSEVPRTFSHMREQRMRWFRSLFHVM 489 Query: 305 LKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYA 344 +N + L Y L + F+++A Sbjct: 490 ARNRQHFHLRNLSIRSKLLMPYLLINTARRAIALPFLLFA 529 >UniRef50_C0GHH9 Glycosyl transferase family 2 n=1 Tax=Dethiobacter alkaliphilus AHT 1 RepID=C0GHH9_9FIRM Length = 357 Score = 80.9 bits (198), Expect = 9e-14, Method: Compositional matrix adjust. Identities = 68/221 (30%), Positives = 108/221 (48%), Gaps = 39/221 (17%) Query: 77 ISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIH 136 +S+++ +N E+ ++E + AA++Q Y N+E+I VNDGSTD T+AILD A ++VIH Sbjct: 3 VSLVMAVYNGEEFLKEALGAAISQTYSNLEIIVVNDGSTDSTKAILD--AVTDSKVKVIH 60 Query: 137 LAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLY---NPR-------V 186 L +NQG A A+ TG + A +++ D D D I E + + NPR + Sbjct: 61 LKENQGAANAMNTGISQATGDWIAVQDAD---DNSFPTRIEEQVKHIQKNPRLVGLGTFI 117 Query: 187 GAVTGNPRI---------RTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAF 237 + G+P + R +++ V Q+ R +G T S V+ F Sbjct: 118 ECIPGSPTVSQARLRRFAREKNSCVSSTQI-----------RKAVYHGCPLTHSSVM--F 164 Query: 238 RRSALAEVGYWSDDM-ITEDIDISWKLQLNQWTIFYEPRAL 277 + +VG +S D I D D+ KL L + I P+ L Sbjct: 165 CKEVFFQVGGYSTDFRIAYDYDLWLKL-LEKGEIENLPKVL 204 >UniRef50_C6N2D6 Glucosaminyltransferase n=1 Tax=Legionella drancourtii LLAP12 RepID=C6N2D6_9GAMM Length = 526 Score = 80.5 bits (197), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 64/254 (25%), Positives = 120/254 (47%), Gaps = 15/254 (5%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRY--ENIEVIAVNDGSTDKTRAILDRMAAQIPHL 132 P + IIIP +NEE + +I + + Y +++ + AV+D STD+T I+ A Q P+ Sbjct: 47 PRVVIIIPVWNEEDVISLSIDSLMRLHYPADSLRIYAVDDVSTDRTPEIIAAKALQYPN- 105 Query: 133 RVIHLAQNQ----GKAIALKTGA----AAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNP 184 RV ++ ++ GKA + G +E ++ ID D +L+ + + L + Sbjct: 106 RVFNIRRHAKIDFGKAAVINHGLNIILQDTWAEAIMIIDADVMLETNTLIRMTR-HLADQ 164 Query: 185 RVGAVTGNPRI-RTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALA 243 V AVT + + L+ EY +R Q + G + ++G R+++ Sbjct: 165 EVSAVTAYIKEGQVPGNLLTHFIGFEYIVAQAATRRAQNVLGVMACLAGGAQLHTRASIE 224 Query: 244 EVGYWSD-DMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAE 302 ++G D + ED ++ QL+ + ++ A+ P + LWKQR RWA+G + Sbjct: 225 QIGGRIDTSTLAEDTYTTFSTQLHNNKVIFDGNAIAIAEEPTYISDLWKQRFRWAEGNVQ 284 Query: 303 VFLKNMTRLWRKEN 316 + ++ ++W N Sbjct: 285 I-TRHFKKIWFHRN 297 >UniRef50_Q8YUP7 Glucosyltransferase n=2 Tax=Cyanobacteria RepID=Q8YUP7_ANASP Length = 343 Score = 80.1 bits (196), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 46/112 (41%), Positives = 67/112 (59%), Gaps = 3/112 (2%) Query: 76 SISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVI 135 ++S+IIP +N E +V ET+ + LAQ Y ++E+I V+DGSTDK+ I + Q +R+I Sbjct: 2 NLSVIIPVYNSESSVAETLRSVLAQTYRDLEIIIVDDGSTDKSIDICKQF--QDERIRII 59 Query: 136 HLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVG 187 H QN+G A A TG A+ EYL +D D L + A +E +P VG Sbjct: 60 H-QQNRGLAGARNTGIRQARGEYLAFVDSDDLWLPEKLAKHLEHFERSPEVG 110 >UniRef50_B8J3S6 Glycosyl transferase family 2 n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774 RepID=B8J3S6_DESDA Length = 352 Score = 80.1 bits (196), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 68/222 (30%), Positives = 109/222 (49%), Gaps = 18/222 (8%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRV 134 PSI++++P FN E+ ++E + + LAQ + N +++ +NDGSTD T IL R+AA+ P +R Sbjct: 10 PSITVLLPVFNGEQFIKEAVTSILAQSFTNFDLLIINDGSTDGTGEILQRLAAKDPRIR- 68 Query: 135 IHLAQNQGKAIALKTGAAAAKSEYLVCIDGD--ALLDRDAAAYIVEPMLYNPRVGAV-TG 191 IH +N+G L G A SE++ +D D AL R A Y + M + TG Sbjct: 69 IHQRENRGLIATLNEGIALCSSEFVARMDADDCALPHRLAVQY--DYMTKHAETAVCSTG 126 Query: 192 NPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDD 251 T TL + G R + ++G ++A RRSAL VG + Sbjct: 127 MEEYETGRTLTWE--------GYGEAVRARLLFGCCLYHPTILA--RRSALLAVGGYDAA 176 Query: 252 M-ITEDIDISWKLQLNQWTIFYEPRALC-WILMPETLKGLWK 291 M ED D+ +L +++ P+ L + + P + +K Sbjct: 177 MPCAEDYDLWMRLTEAGYSLAVLPQVLLRYRVHPHLSRAAYK 218 >UniRef50_A3MTD6 Glycosyl transferase, family 2 n=4 Tax=Thermoproteaceae RepID=A3MTD6_PYRCJ Length = 365 Score = 80.1 bits (196), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 76/266 (28%), Positives = 123/266 (46%), Gaps = 23/266 (8%) Query: 50 VYFWVYRERHW---PWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRY--EN 104 +Y+W R + W PW + AP P + P +++I+P +NE +EE + Q Y + Sbjct: 20 LYYW--RAKRWLKKPW-DVAPDPTYR--PRVTVIVPTYNEAPLIEEKLDNIYEQDYPRDK 74 Query: 105 IEVIAVNDGSTDKTRAILDRMAAQIPHL--RVIHLAQNQGKAIALKTGAAAAKSEYLVCI 162 +EV+ V+ STD T + + R A P L ++ + +GKA AL T A E +V Sbjct: 75 LEVVVVDSASTDGTPSAVRRWAETHPDLALTLVEETERRGKAHALNTALRHATGEIVVIT 134 Query: 163 DGDAL-LDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGE-YSSIIGLIKRT 220 D DAL RD A V+ L +P VGAV+ R G V + Y +++ Sbjct: 135 DADALWPARDTLANAVK-WLADPTVGAVSCVKRP------AGPAGVEDSYRDFYNVLRVA 187 Query: 221 QRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWI 280 + G +AAF+R L +G + D+ +D + K+ + + P +C Sbjct: 188 ESKAWATPIFHGELAAFKRELLERLGGFPTDVGADDSHTATKIAMMGYRAITPPDVVCVE 247 Query: 281 LMPETLKGLWKQRLRWAQGGAEVFLK 306 +P+ +G R+R AQ + F K Sbjct: 248 AVPK--RGYHAWRIRRAQHLVQHFAK 271 >UniRef50_UPI0001BC6092 glycosyl transferase, group 2 family protein n=1 Tax=Fusobacterium ulcerans ATCC 49185 RepID=UPI0001BC6092 Length = 332 Score = 80.1 bits (196), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 39/94 (41%), Positives = 58/94 (61%), Gaps = 1/94 (1%) Query: 78 SIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHL 137 SII+PC+N EK +E+TI + L Q +++ E+I VNDGSTDKT+ ILD + + VI+ Sbjct: 7 SIIVPCYNVEKYIEKTIKSILTQTFDDFELILVNDGSTDKTKNILDEYFQKDKRINVIN- 65 Query: 138 AQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRD 171 +N G + A G A +Y+ +DGD + D Sbjct: 66 KKNSGVSEARNEGIKIAVGKYIYFLDGDDYITND 99 >UniRef50_Q0IA52 Glycosyl transferase, group 2 family protein domain protein n=1 Tax=Synechococcus sp. CC9311 RepID=Q0IA52_SYNS3 Length = 667 Score = 80.1 bits (196), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 56/238 (23%), Positives = 107/238 (44%), Gaps = 11/238 (4%) Query: 71 LKDNPSISIIIPCFNEEKNV-EETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQI 129 L+++PS+ I I +NE++ + TI AA+ +Y+N V ++DG+ R +D++ ++ Sbjct: 121 LRNSPSVDIFITTYNEDEEIITPTIMAAMNVKYQNSNVWVLDDGN----RKWVDKLCTEL 176 Query: 130 PHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAV 189 + + KA L +Y V +D D ++ D+ I+ N + + Sbjct: 177 QVGYITRTSNVHAKAGNLNNALQKTFGKYFVILDADFVIFADSLGPIISKFESNSNIATI 236 Query: 190 TG-----NPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAE 244 NP + + V E + +++ + G + T G + RR+A+ E Sbjct: 237 QTPHCFFNPDPILSNLRLNNFWVDEQAFYHNVVQPGKNSIG-LATCCGSSSIHRRTAILE 295 Query: 245 VGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAE 302 +G + +TED +S L W ++ + L PE+LK + QR RW +G + Sbjct: 296 IGGFPTKSVTEDTLLSIILYSKGWKTYFHSLPMARGLSPESLKSFFIQRRRWVRGAIQ 353 >UniRef50_Q2W2R8 Cellulose synthase catalytic subunit n=5 Tax=Rhodospirillaceae RepID=Q2W2R8_MAGSA Length = 885 Score = 79.7 bits (195), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 64/247 (25%), Positives = 112/247 (45%), Gaps = 13/247 (5%) Query: 67 PAPQLKDNPSISIIIPCFNEEKN-VEETIHAALAQRYENIEVIAVNDGSTDKT--RAILD 123 P Q K P +SI +PC+NE V+ T+ + +A Y N E+I +++ + + R + + Sbjct: 425 PPDQCKRFPKVSIHLPCYNEPPAMVKLTLDSLMALDYPNFEIIVLDNNTKKEEVWRPVEE 484 Query: 124 RMAAQIPHLRVIHLAQNQG-KAIALKTG--AAAAKSEYLVCIDGDALLDRDAAAYIVEPM 180 ++ HLA G KA AL G ++E + +D D ++D++ +V P Sbjct: 485 YCKTLGDKVKFFHLAPWPGAKAGALNFGLTVTDPEAEIIGVVDSDYMVDKNWLKGLV-PY 543 Query: 181 LYNPRVGAVTGNPRIRT-RSTLVGKIQVGEYSSI--IGLIKRTQRIYGNVFTVSGVIAAF 237 NP+VG V R L ++ EY+ IG++ R + + G + Sbjct: 544 FENPKVGHVQAPQDHREWEHDLFKEMINWEYAGFFDIGMVFRNE---ADAIIQHGTMTLV 600 Query: 238 RRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWA 297 R+ L + G W + I ED ++ ++ + Y L L+P++ KQR RWA Sbjct: 601 RKKTLEDAGRWGEWCIVEDAELGLRMMKAGYQSVYVQDRLGHGLVPDSFMAYKKQRFRWA 660 Query: 298 QGGAEVF 304 G ++ Sbjct: 661 YGAVQIL 667 >UniRef50_C4KBM8 Glycosyl transferase family 2 n=6 Tax=Betaproteobacteria RepID=C4KBM8_THASP Length = 868 Score = 79.7 bits (195), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 70/254 (27%), Positives = 108/254 (42%), Gaps = 13/254 (5%) Query: 67 PAPQLKDNPSISIIIPCFNEEKN-VEETIHAALAQRYENIEVIAVNDGSTDKT--RAILD 123 P + P +SI + C+NE V TI + Y+N EV+ +++ + D+ + + Sbjct: 418 PPHPPEQQPFVSIHLACYNEPPEMVIATIDSLAQMNYQNFEVLILDNNTRDEALWKPLER 477 Query: 124 RMAAQIPHLRVIHLAQNQG-KAIALKTG--AAAAKSEYLVCIDGDALLDRDAAAYIVEPM 180 R A P R HLA G KA AL G ++E + +D D ++D D A +V P Sbjct: 478 RCAELGPRFRFFHLANWPGFKAGALNYGLKVTDPRAEVVGVVDADYVVDPDWLACLV-PH 536 Query: 181 LYNPRVGAVTGNPRIRT-RSTLVGKIQVGEYSSI--IGLIKRTQRIYGNVFTVSGVIAAF 237 P V V R ++ E+ IG+ R +R N G + Sbjct: 537 FDQPEVAVVQAPQAHRDWEGQPFKRMCNWEFDGFFRIGMHHRNER---NALIQHGTMTMV 593 Query: 238 RRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWA 297 RR AL EVG WS+ I ED ++ +L + Y L L P + QR RWA Sbjct: 594 RRRALEEVGGWSEWCICEDTELGLRLIEKGYDTRYIDHILGRGLTPSGFAAIKSQRFRWA 653 Query: 298 QGGAEVFLKNMTRL 311 G ++ ++ + Sbjct: 654 FGAMQILKAHLPHM 667 >UniRef50_C5CI31 Glycosyl transferase family 2 n=1 Tax=Kosmotoga olearia TBF 19.5.1 RepID=C5CI31_KOSOT Length = 377 Score = 79.3 bits (194), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 61/227 (26%), Positives = 98/227 (43%), Gaps = 7/227 (3%) Query: 76 SISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVI 135 S+ +I NEE +E T+ L + VI V+D STD T IL ++ + P+L V+ Sbjct: 40 SVVAVIAAKNEESVIENTVRNLLTNSPRSFRVIVVDDKSTDSTYKILSELSREFPNLTVV 99 Query: 136 HLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVE--PMLYNPRVGAVTGNP 193 GK+ AL K + ++ +D DA +D D +I E + +P + V + Sbjct: 100 RNTGAPGKSEALNVALDLVKEDIVLFLDADARVDWD---FIKEYSKLFRSPEINVVFTDF 156 Query: 194 RIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMI 253 +S + I Y S I + +F SGV RR VG + + Sbjct: 157 ESYNQSRTLAVILQDLYFSFIKAFVYSGLFSPTIFMNSGVF--IRRKVFDIVGKFDPQTL 214 Query: 254 TEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGG 300 +D D++ +L + + + C I L+ L+KQ RW GG Sbjct: 215 VDDFDLALRLWKRKLKVKFVRGKRCKIQYAFRLQDLFKQHCRWYIGG 261 >UniRef50_Q0EZB0 Glycosyl transferase, family 2 n=1 Tax=Mariprofundus ferrooxydans PV-1 RepID=Q0EZB0_9PROT Length = 348 Score = 79.3 bits (194), Expect = 3e-13, Method: Compositional matrix adjust. Identities = 60/188 (31%), Positives = 91/188 (48%), Gaps = 19/188 (10%) Query: 77 ISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIP-HLRVI 135 +S IIP +N + + + + + LAQ Y IE+I V+DGSTD T A D++A Q P +RV+ Sbjct: 20 VSTIIPVYNRVEMLRQAVGSVLAQSYRPIEIIIVDDGSTDATGAEADKLAGQHPDEIRVL 79 Query: 136 HLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRI 195 H+ N G +A + G AA+ E++ +D D LL + + V + +P G G R+ Sbjct: 80 HI-DNGGAGLAREAGRKAARGEFIQYLDSDDLLLPEKFSVQVAALNEHPECGIAYGYTRL 138 Query: 196 RTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTV-------SGVIAAFRRSALAEVGYW 248 + G I Y K T R + +F + FRRS VG W Sbjct: 139 ---IDIDGNILKAPY-------KWTDRPFDTLFPALLVDRWWNTHTPLFRRSVCDAVGPW 188 Query: 249 SDDMITED 256 S ++ED Sbjct: 189 SGMRMSED 196 >UniRef50_B2A4H6 Glycosyl transferase family 2 n=1 Tax=Natranaerobius thermophilus JW/NM-WN-LF RepID=B2A4H6_NATTJ Length = 336 Score = 79.3 bits (194), Expect = 3e-13, Method: Compositional matrix adjust. Identities = 53/158 (33%), Positives = 85/158 (53%), Gaps = 12/158 (7%) Query: 76 SISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVI 135 ++SI++ +NEEK +++ I +AL+Q Y + ++I VNDGSTD T+ ILD + Q + VI Sbjct: 4 AVSIVMAVYNEEKFLQQAIDSALSQTYPSTDLIVVNDGSTDSTKTILDNLNDQ--RVTVI 61 Query: 136 HLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRI 195 HL +NQG A AL G A ++ ++ D D +Y P +V + NP++ Sbjct: 62 HLEENQGAANALNIGIEQASGNWI------SIHDADDISY---PTKIEEQVKYIKQNPQL 112 Query: 196 RTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGV 233 TLV I G + L+KR +I +V + + Sbjct: 113 VGVGTLVEYI-AGSPNVTQKLLKRLVKINKSVISAEQI 149 >UniRef50_B7K6R4 Glycosyl transferase family 2 n=2 Tax=Cyanothece RepID=B7K6R4_CYAP7 Length = 374 Score = 79.3 bits (194), Expect = 3e-13, Method: Compositional matrix adjust. Identities = 66/200 (33%), Positives = 95/200 (47%), Gaps = 20/200 (10%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRV 134 P +S+IIP +N EK +E T+ + L+Q Y+NIEV+ ++DGS DKT I+ +A Q RV Sbjct: 11 PLVSVIIPAYNAEKFIERTLRSVLSQTYQNIEVLVIDDGSQDKTAEIVQTIANQ--DQRV 68 Query: 135 IHLAQ-NQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLY-NPRVGAVTG- 191 I L Q N G A A G AK E++ ID D + + V+ ML VG V Sbjct: 69 ILLRQANSGVAAARNLGIQNAKGEFIAPIDADDIWYPENIEKQVQCMLEGGTSVGLVYSW 128 Query: 192 NPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNV----FTVSGVIAAFRRSALAEVGY 247 + I L G +V E +YG + F + A RR+ L EV Sbjct: 129 SVDIDENDQLTGAFRVAEIEG---------DVYGTLVCHYFLGNASCALIRRTCLEEVSS 179 Query: 248 WSDDMITEDID--ISWKLQL 265 ++ T ++ W+L L Sbjct: 180 YNCQFKTHNLQGCEDWELAL 199 >UniRef50_C6LFR4 Glycosyl transferase n=5 Tax=Clostridiales RepID=C6LFR4_9FIRM Length = 317 Score = 79.3 bits (194), Expect = 3e-13, Method: Compositional matrix adjust. Identities = 51/217 (23%), Positives = 104/217 (47%), Gaps = 21/217 (9%) Query: 76 SISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVI 135 +S+IIP +N EK + + + L Q ++ VI V++ S D + I++ A+ P +R+I Sbjct: 2 DVSVIIPNYNGEKYIRACLDSLLTQNIGDVPVIVVDNASADGSCGIIE---AEYPSVRLI 58 Query: 136 HLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRI 195 L +N G + A+ G AA++E+++ ++ D + D + ++ + + R+ + R Sbjct: 59 RLDKNYGFSRAVNEGIRAARTEFVILLNNDTRAEADFVSELLRAIRRDARIFSCQAQMRQ 118 Query: 196 RTRSTLVGK-----------IQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAE 244 L+ + G+ S G +K VF+ A +R+S + Sbjct: 119 MDEPQLLDNAGDFYCALGWALTRGKGKSCAGYLKEDA-----VFSACAGAAIYRKSIFEK 173 Query: 245 VGYWSDDMIT--EDIDISWKLQLNQWTIFYEPRALCW 279 +GY+ + ED+DI ++ ++ + Y PRA+ + Sbjct: 174 IGYFDEKHFAYLEDVDICYRAKIAGYENRYAPRAVVY 210 >UniRef50_D2QCG6 Glycosyl transferase family 2 n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QCG6_9SPHI Length = 331 Score = 79.0 bits (193), Expect = 3e-13, Method: Compositional matrix adjust. Identities = 34/91 (37%), Positives = 60/91 (65%), Gaps = 2/91 (2%) Query: 77 ISIIIPCFNEEKNVEETIHAALA--QRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRV 134 IS+++P FNEE+N+ +H +A +YE E++ V+DGS+D+TR +L +++ P +R Sbjct: 2 ISLVVPAFNEEENLSVLVHRLMAVMSQYECYEILIVDDGSSDQTRHVLRQLSQAYPAVRF 61 Query: 135 IHLAQNQGKAIALKTGAAAAKSEYLVCIDGD 165 I ++N G +AL+ G A+ E ++C+D D Sbjct: 62 ISFSRNFGHQMALRAGYDNARGEAVICLDAD 92 >UniRef50_B7JW84 Glycosyl transferase family 2 n=2 Tax=Cyanothece RepID=B7JW84_CYAP8 Length = 1177 Score = 79.0 bits (193), Expect = 3e-13, Method: Compositional matrix adjust. Identities = 61/191 (31%), Positives = 97/191 (50%), Gaps = 27/191 (14%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTD-KTRAILDRMAAQIPHLR 133 P +S+IIPC+N + +EE I + L Q ++N E+I V+DGSTD KT+ +LD + P Sbjct: 558 PLVSVIIPCYNYGQYLEEAIDSVLQQTFQNFEIIVVDDGSTDSKTQEVLDNLNK--PKTT 615 Query: 134 VIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNP 193 +I +NQG AIA G AK +Y+ C+D D D+ AY+ + ++ Sbjct: 616 LIR-QKNQGVAIARNEGIFQAKGKYICCLDAD---DKLKPAYLEKCLI------------ 659 Query: 194 RIRTRSTLVGK--IQVGEYSSII---GLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYW 248 ++ T + + IQ E S ++ + ++ + N VS V FRR +VG + Sbjct: 660 KLETENLDICYTWIQEFEESDLVWKTASFELSKLLEENCLEVSAV---FRRDIWEKVGGY 716 Query: 249 SDDMITEDIDI 259 M ED D+ Sbjct: 717 DPQMAYEDWDL 727 >UniRef50_D1JHN4 Conserved hypothetical membrane protein, glycosyl transferase family 2 n=1 Tax=uncultured archaeon RepID=D1JHN4_9ARCH Length = 538 Score = 79.0 bits (193), Expect = 4e-13, Method: Compositional matrix adjust. Identities = 77/372 (20%), Positives = 151/372 (40%), Gaps = 57/372 (15%) Query: 74 NPSISIIIPCFNEEKN-VEETIHAALAQRYENIEVIAVNDGSTDK----TRAILDRMAAQ 128 P +++ I +NE+ + +E TI A Y N ++ ++D + + ++A+ ++ Sbjct: 104 EPKVAVFITAYNEDADTLENTISACTLMDYRNKQIYLLDDSTKPELMRTSKALTEKYG-- 161 Query: 129 IPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGD-------------ALLDRDAAAY 175 + +H +G ++YL+ +D D L ++ A+ Sbjct: 162 ---IEYVHRENRRGFKAGAINDMLNVDAKYLLILDADQRPGYNFLREVVPTLEEKPELAF 218 Query: 176 IVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIA 235 + P Y R + N +ST + G+ S N G Sbjct: 219 VQTPQYYVNRDSSKVSNAASAQQSTFYANVSEGKSVS-------------NAMFACGTNI 265 Query: 236 AFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLR 295 R SAL ++G + ++ +TED S+ L ++ +Y P ++ G + Q++R Sbjct: 266 VLRVSALKDIGGFDEESVTEDFATSFMLHERGYSSYYYNNVFVEGDGPASIPGYYMQQMR 325 Query: 296 WAQGGAEVFLKNMTRLWRKENFRMWPL-FFEYCLTTIWAFT---------CLVGFIIYAV 345 WA G +F K + L+R R+ P+ ++EY L+ W F C V ++++ + Sbjct: 326 WAYGTIGIFKKLLKELFRHPR-RLTPVQWWEYFLSGTWYFVGWAFFFMMICPVAYLLFEI 384 Query: 346 QLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIEN-RYEHNLTSSLFWIIWFPVI 404 + PL E + +L T LQF VS+ + R + + + FP+ Sbjct: 385 R----PLLAEPYIYVVAYLPYLLFST---LQFFVSMSMRGFRAKDQWFGQILTYLTFPI- 436 Query: 405 FWMLSLATTLVS 416 +ML+ L++ Sbjct: 437 -YMLAATYALIN 447 >UniRef50_A7HJL2 Glycosyl transferase family 2 n=1 Tax=Fervidobacterium nodosum Rt17-B1 RepID=A7HJL2_FERNB Length = 333 Score = 79.0 bits (193), Expect = 4e-13, Method: Compositional matrix adjust. Identities = 69/275 (25%), Positives = 129/275 (46%), Gaps = 32/275 (11%) Query: 76 SISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVI 135 +SI++P NE K +++ + L Q YE+IE+I ++ STD+TR I+ R + P + Sbjct: 5 KVSIVLPVRNEAKTIQKVLDGLLRQEYEDIEIIVIDGMSTDETRNIVQRYVGRFPE--KV 62 Query: 136 HLAQNQG--KAIALKTGAAAAKSEYLVCIDGDALLDRD---AAAYIVEPMLYNPRVGAVT 190 L +N+ L G + EY++ G A ++ +E + G V Sbjct: 63 KLIENEKVYTPSGLNIGIRNSSGEYVMIASGHASYSKNYIKECVKAIESSECDIAGGVVE 122 Query: 191 GNPRIRTRSTL-VGKI-----QVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAE 244 PR ++ L + +I +G +G +++T V TV+ ++R + Sbjct: 123 VLPRSESKKALAIAEILKHPFGIGGARYKLGAVEKTY-----VDTVA--YGVYKREVFEK 175 Query: 245 VGYWSDDMI-TEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQG---- 299 VG +++++I +DI+ + +L+ + + I P+A + + K LWK ++ G Sbjct: 176 VGLFNENLIRNQDIEFNLRLKGSGYRIMLIPQARAYYFARDKYKELWKN--NFSNGFWVT 233 Query: 300 GAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAF 334 ++ F +N RL + PLFF L T++AF Sbjct: 234 YSKKFAENPYRLR-----HLIPLFFVVYLITLFAF 263 >UniRef50_B2IY03 Glycosyl transferase, family 2 n=1 Tax=Nostoc punctiforme PCC 73102 RepID=B2IY03_NOSP7 Length = 427 Score = 78.6 bits (192), Expect = 4e-13, Method: Compositional matrix adjust. Identities = 70/256 (27%), Positives = 113/256 (44%), Gaps = 25/256 (9%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRY--ENIEVIAVNDGSTDKTRAILDRMAAQIPHL 132 P + I++P NE + + + Y ++++ V+DGSTD+T IL + Q P L Sbjct: 75 PPVLILVPAKNESSVLANLVDSLYQLNYPSNSLDIWIVDDGSTDETPEILRELQTQFPSL 134 Query: 133 RVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGN 192 +V GK+ AL + E ++ D DA L + V P+ +GAV Sbjct: 135 QVHRRESKGGKSGALNAVFPFTQGEIILVCDADAQLPVNFLQQTV-PLFQKQAIGAVQVR 193 Query: 193 PRIRTRST-LVGKIQVGEYSSIIGLIKRTQRI-YGNVFTVSGVIAAFRRSALAEVGYWSD 250 I +T + + Q E L +T RI G + + G RR L + W++ Sbjct: 194 KTISNANTNFLTRCQQMEMCCDSFL--QTHRIAIGGMSELRGNGMLVRRELLEKCQGWNE 251 Query: 251 DMITEDIDISWKLQLNQWTIFY-------EPRALCWILMPETLKGLWKQRLRWAQGGAEV 303 + +T+D+D+ +KL L I + E W + LW QR RWA+GG + Sbjct: 252 NTVTDDLDLCFKLYLAGAEIEFVTVPSIQEESVTSW-------EKLWHQRCRWAEGGYQR 304 Query: 304 FL----KNMTRLWRKE 315 +L + +T W KE Sbjct: 305 YLDYFPQILTLGWAKE 320 >UniRef50_B4BNC7 CDP-glycerol:poly(Glycerophosphate) glycerophosphotransferase n=3 Tax=Geobacillus RepID=B4BNC7_9BACI Length = 888 Score = 78.6 bits (192), Expect = 4e-13, Method: Compositional matrix adjust. Identities = 36/107 (33%), Positives = 63/107 (58%), Gaps = 1/107 (0%) Query: 72 KDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPH 131 + +P +++++ +N E +EE + + Q +E++EV+ VND STD+T I+DR + P Sbjct: 7 RKSPELTVVVIAYNNELYIEEALESLDEQTFEDMEVVVVNDASTDRTGEIIDRFVSTRPK 66 Query: 132 LRVIHLAQNQGK-AIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIV 177 R IHL N G ++ TG A + +Y++ +DGD +DA +V Sbjct: 67 FRAIHLPNNSGGCSVPRNTGIANSTGKYIMFLDGDDWYAKDACELMV 113 >UniRef50_A7HN17 Glycosyl transferase family 2 n=1 Tax=Fervidobacterium nodosum Rt17-B1 RepID=A7HN17_FERNB Length = 341 Score = 78.6 bits (192), Expect = 4e-13, Method: Compositional matrix adjust. Identities = 36/92 (39%), Positives = 56/92 (60%) Query: 77 ISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIH 136 +S+IIP +N E +E ++ + L Q YENIE+I VNDGS D T + +++ + I Sbjct: 8 VSVIIPAYNLESYIERSLRSVLEQTYENIEIIVVNDGSKDSTAEVAEKILKNAGKIYKII 67 Query: 137 LAQNQGKAIALKTGAAAAKSEYLVCIDGDALL 168 +NQG ++A TG AA+ +Y+ +DGD L Sbjct: 68 NQKNQGASVARNTGLTAAQGKYIKFLDGDDTL 99 >UniRef50_C9RRD2 Glycosyl transferase family 2 n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RRD2_FIBSS Length = 367 Score = 78.6 bits (192), Expect = 5e-13, Method: Compositional matrix adjust. Identities = 70/245 (28%), Positives = 113/245 (46%), Gaps = 23/245 (9%) Query: 72 KDNPSISIIIPCFNEEKNVEETIHAALAQRY-ENIEVIAVNDGSTDKTRAILDRMAAQIP 130 K PS+S++IP NEE+ E T+ A AQ Y EVI V+D STD+TR IL++ AA P Sbjct: 34 KATPSVSVVIPMRNEEEFAERTLEAVAAQDYIGEWEVICVDDRSTDRTREILEKFAATHP 93 Query: 131 HLRVIHLAQN-----QGKAIALKTGAAAAKSEYLVCIDGDALLDRD----AAAYIVEPML 181 RV+ L Q+ K AL++ AK+E L+ +D D + + A V+ + Sbjct: 94 RFRVLSLPQDLPQIASPKKRALESAFKIAKNEVLLTMDADCIPRKSWITAMAGRFVDGIC 153 Query: 182 YNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSA 241 V N R+ L K++ Y++ ++ G S A+++ Sbjct: 154 I---VQGPKQNNGSRSMPHLYQKLETLGYTA----MEAAGFSLGRPIVASAACLAYKKDL 206 Query: 242 LAEVGYWSD--DMITEDID--ISWKLQLNQWTIFY--EPRALCWILMPETLKGLWKQRLR 295 +VG + D ++ + D D I +++ + Y + A+ T K L+ QR R Sbjct: 207 FFKVGGFGDLINLSSGDDDMLIHKMMKIPGTKVCYNLDKDAVIETAPVHTWKQLFNQRAR 266 Query: 296 WAQGG 300 W+ G Sbjct: 267 WSSNG 271 >UniRef50_A4WR49 Cellulose synthase (UDP-forming) n=6 Tax=Rhodobacteraceae RepID=A4WR49_RHOS5 Length = 788 Score = 78.6 bits (192), Expect = 5e-13, Method: Compositional matrix adjust. Identities = 72/265 (27%), Positives = 109/265 (41%), Gaps = 25/265 (9%) Query: 67 PAPQLKDNPSISIIIPCFNEEKN-VEETIHAALAQRYENI--EVIAVNDGSTDKTRAILD 123 P Q +D P++ I++P +NE + + T+ AA Y + V+ +DG TD+ D Sbjct: 132 PQVQPEDLPTVDILVPSYNEPADMLSVTLSAAKNMIYPSRLRTVVLCDDGGTDQRCLSPD 191 Query: 124 RMAAQIPHLRVIHLAQ---------------NQGKAIALKTGAAAAKSEYLVCIDGDALL 168 A+ R L Q KA + K + +V D D + Sbjct: 192 PEIARQAQERRRELQQLCRELGVVYSTRERNEHAKAGNMSAALERLKGDLVVVFDADHVP 251 Query: 169 DRDAAAYIVEPMLYNPRVGAVTG-----NPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRI 223 RD A V + +P++ V NP R+ +G E I R Sbjct: 252 SRDFLARTVGYFVEDPKLFLVQTPHFFINPDPIQRNLALGDDCPPENEMFYAKIHRGLDR 311 Query: 224 YGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMP 283 +G F G A RR AL +VG ++ + ITED + + ++ W Y RA+ L P Sbjct: 312 WGGAF-FCGSAAVLRRRALDDVGGFAGETITEDAETALEIHARGWKSLYIDRAMIAGLQP 370 Query: 284 ETLKGLWKQRLRWAQGGAEVF-LKN 307 ET +QR RWA G ++ LKN Sbjct: 371 ETFASFIQQRGRWATGMMQMLRLKN 395 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P75905 Biofilm PGA synthesis N-glycosyltransferase pgaC... 611 e-173 UniRef50_A4SIQ6 Glycosyl transferase, family 2 n=6 Tax=Gammaprot... 418 e-115 UniRef50_B2UK24 Glycosyl transferase family 2 n=10 Tax=Burkholde... 418 e-115 UniRef50_A3N3L7 Biofilm PGA synthesis N-glycosyltransferase PgaC... 404 e-111 UniRef50_Q30SK0 Glycosyl transferase, family 2 n=6 Tax=Proteobac... 403 e-111 UniRef50_Q9RQP9 Biofilm PIA synthesis N-glycosyltransferase icaA... 393 e-108 UniRef50_B9EAX5 Intercellular adhesion protein IcaA homolog n=19... 390 e-107 UniRef50_C8WT25 Glycosyl transferase family 2 n=3 Tax=Bacteria R... 382 e-104 UniRef50_A4J4W8 Glycosyl transferase, family 2 n=3 Tax=Clostridi... 369 e-100 UniRef50_C9RCL0 Glycosyl transferase family 2 n=1 Tax=Ammonifex ... 359 9e-98 UniRef50_Q1IL87 Glycosyl transferase, family 2 n=1 Tax=Candidatu... 344 4e-93 UniRef50_B4CVJ2 Glycosyl transferase family 2 n=1 Tax=Chthonioba... 327 4e-88 UniRef50_Q82DY8 Putative polysaccharide deacetylase/glycosyltran... 326 1e-87 UniRef50_A0RGJ4 Glycosyl transferase and polysaccharide deacetyl... 324 5e-87 UniRef50_C0WP52 Possible VGP family vectorial glycosyl polymeriz... 323 9e-87 UniRef50_B5HQE3 Bifunctional transferase/deacetylase n=3 Tax=Str... 320 6e-86 UniRef50_UPI0001B4D705 bi-functional transferase/deacetylase n=3... 315 2e-84 UniRef50_D2LDP7 Polysaccharide deacetylase n=1 Tax=Rhodomicrobiu... 315 2e-84 UniRef50_D2AU81 Polysaccharide deacetylase n=1 Tax=Streptosporan... 309 1e-82 UniRef50_B5HP10 Bifunctional transferase/deacetylase n=9 Tax=Str... 309 2e-82 UniRef50_Q38W83 Putative glycosyl transferase, family 2 n=1 Tax=... 309 2e-82 UniRef50_Q2G7Y8 Polysaccharide deacetylase n=1 Tax=Novosphingobi... 308 3e-82 UniRef50_B0T3D0 Polysaccharide deacetylase n=2 Tax=Caulobacter R... 307 4e-82 UniRef50_C8WBS4 Polysaccharide deacetylase n=4 Tax=Sphingomonada... 307 5e-82 UniRef50_C1ACV5 Putative glycosyl transferase/polysaccharide dea... 307 6e-82 UniRef50_A5VMQ3 Glycosyl transferase, family 2 n=38 Tax=Lactobac... 304 6e-81 UniRef50_C0YGU0 Polysaccharide deacetylase/glycosyl transferase,... 302 1e-80 UniRef50_D2L983 Polysaccharide deacetylase n=1 Tax=Desulfovibrio... 302 1e-80 UniRef50_B5GIB2 Bi-functional transferase/deacetylase n=7 Tax=St... 298 2e-79 UniRef50_A1UIF2 Polysaccharide deacetylase n=3 Tax=Mycobacterium... 297 7e-79 UniRef50_Q1IUP8 Polysaccharide deacetylase n=1 Tax=Candidatus Ko... 296 8e-79 UniRef50_D1SD50 Polysaccharide deacetylase n=1 Tax=Micromonospor... 295 2e-78 UniRef50_A9EEN3 Glycosyl transferase, family 2 n=1 Tax=Oceanibul... 294 3e-78 UniRef50_A7GCY8 Glycosyl transferase, group 2 family n=66 Tax=Fi... 294 4e-78 UniRef50_C1F4G9 Polysaccharide deacetylase domain protein/glycos... 294 5e-78 UniRef50_B6BG61 Glycosyl transferase, group 2 family protein n=2... 293 1e-77 UniRef50_UPI0001AEFEC9 N-glycosyltransferase PgaC n=1 Tax=Strept... 292 1e-77 UniRef50_A1R2V1 Glycosyl transferase, group 2 family domain prot... 291 3e-77 UniRef50_Q1GJ85 Glycosyl transferase family 2 n=4 Tax=Rhodobacte... 291 4e-77 UniRef50_Q9CHP7 Glycosyl transferase n=3 Tax=Lactococcus lactis ... 287 4e-76 UniRef50_C7MUK1 Glycosyl transferase n=11 Tax=Actinomycetales Re... 285 3e-75 UniRef50_C1V8R5 Glycosyl transferase n=1 Tax=Halogeometricum bor... 282 1e-74 UniRef50_A6Q1D6 Glucosaminyltransferase n=2 Tax=Epsilonproteobac... 280 9e-74 UniRef50_P96587 Uncharacterized glycosyltransferase ydaM n=6 Tax... 279 1e-73 UniRef50_B9ZC52 Glycosyl transferase family 2 n=1 Tax=Natrialba ... 279 1e-73 UniRef50_B8II97 Polysaccharide deacetylase n=1 Tax=Methylobacter... 278 3e-73 UniRef50_B4D4P9 Polysaccharide deacetylase n=1 Tax=Chthoniobacte... 276 1e-72 UniRef50_B9LWG0 Glycosyl transferase family 2 n=1 Tax=Halorubrum... 275 2e-72 UniRef50_B7QVL5 Glycosyl transferase, family 2 n=1 Tax=Ruegeria ... 274 4e-72 UniRef50_Q03HA4 Glycosyltransferase n=2 Tax=Bacilli RepID=Q03HA4... 273 8e-72 UniRef50_A9EU26 Group-specific protein n=3 Tax=Phaeobacter galla... 272 2e-71 UniRef50_C5SLQ2 Polysaccharide deacetylase n=1 Tax=Asticcacaulis... 272 2e-71 UniRef50_B1L4E1 Glycosyl transferase family 2 n=1 Tax=Candidatus... 271 4e-71 UniRef50_Q3B487 Glucosaminyltransferase n=2 Tax=Chlorobium/Pelod... 271 4e-71 UniRef50_Q46B82 Glucosaminyltransferase n=2 Tax=Methanosarcina R... 269 2e-70 UniRef50_B4U2G9 Glycosyl transferase, group 2 family protein n=2... 269 2e-70 UniRef50_B9MNQ5 Glycosyl transferase family 2 n=1 Tax=Anaerocell... 267 4e-70 UniRef50_B1HSU1 Biofilm PIA synthesis N-glycosyltransferase icaA... 267 4e-70 UniRef50_C6A3S6 Putative glycosyl transferase n=1 Tax=Thermococc... 267 6e-70 UniRef50_C1D0J4 Putative glycosyltransferase n=1 Tax=Deinococcus... 265 2e-69 UniRef50_B7HT47 Glycosyl transferase family 2 n=8 Tax=Bacteria R... 265 3e-69 UniRef50_Q2S5Y1 Glycosyl transferase, group 2 family protein n=1... 261 3e-68 UniRef50_D1BHB0 Glycosyl transferase n=1 Tax=Sanguibacter keddie... 258 3e-67 UniRef50_Q2IFD0 Glycosyl transferase, family 2 n=5 Tax=Bacteria ... 258 4e-67 UniRef50_B7I546 Glycosyltransferase n=12 Tax=Acinetobacter RepID... 257 4e-67 UniRef50_Q0AL04 Glycosyl transferase, family 2 n=3 Tax=Bacteria ... 257 6e-67 UniRef50_C7RAF6 Glycosyl transferase family 2 n=1 Tax=Kangiella ... 256 1e-66 UniRef50_Q7NHH7 Glr2559 protein n=1 Tax=Gloeobacter violaceus Re... 255 3e-66 UniRef50_Q2LWC4 Glycosyltransferase n=1 Tax=Syntrophus aciditrop... 254 6e-66 UniRef50_A8MM07 Glycosyl transferase family 2 n=1 Tax=Alkaliphil... 252 2e-65 UniRef50_C7TL81 Glycosyl transferase, group 2 n=3 Tax=Bacilli Re... 252 2e-65 UniRef50_Q110Z2 Glycosyl transferase, family 2 n=15 Tax=Cyanobac... 252 3e-65 UniRef50_Q988K9 Mlr6694 protein n=1 Tax=Mesorhizobium loti RepID... 251 3e-65 UniRef50_Q46CB1 Glycosyltransferase n=2 Tax=Methanosarcina RepID... 251 3e-65 UniRef50_C1F7J6 Glycosyl transferase, group 2 family n=1 Tax=Aci... 251 5e-65 UniRef50_C7RTK3 Polysaccharide deacetylase n=2 Tax=Bacteria RepI... 250 7e-65 UniRef50_D1CH68 Glycosyl transferase family 2 n=2 Tax=Bacteria R... 250 7e-65 UniRef50_C5A6A1 Glycosyltransferase n=2 Tax=Thermococcus RepID=C... 250 8e-65 UniRef50_B1YJV4 Glycosyl transferase family 2 n=4 Tax=Bacillales... 250 1e-64 UniRef50_C5CVQ8 Glycosyl transferase family 2 n=4 Tax=Proteobact... 249 2e-64 UniRef50_Q1IMJ5 Glycosyl transferase, family 2 n=10 Tax=cellular... 248 3e-64 UniRef50_Q05S47 Group-specific protein n=1 Tax=Synechococcus sp.... 248 3e-64 UniRef50_C4I9P5 Inner membrane glycosyltransferase n=39 Tax=Bact... 248 4e-64 UniRef50_A0LZM3 Transmembrane family-2 glycosyl transferase-poss... 248 4e-64 UniRef50_C6X1A6 Glycosyl transferase, family 2 n=3 Tax=Flavobact... 247 6e-64 UniRef50_A4CLA3 Glycosyl transferase, group 2 family protein n=2... 247 8e-64 UniRef50_C7IGQ0 Glycosyl transferase family 2 n=2 Tax=Firmicutes... 245 2e-63 UniRef50_B1V7Q3 Chitin synthase n=1 Tax=Clostridium perfringens ... 244 4e-63 UniRef50_A9KK06 Glycosyl transferase family 2 n=12 Tax=Bacteria ... 243 8e-63 UniRef50_A6DGB3 Glycosyl transferase, family 2 n=3 Tax=Lentispha... 243 1e-62 UniRef50_A8VSD1 Binding-protein-dependent transport systems inne... 243 1e-62 UniRef50_D1PAP3 Putative biofilm PGA synthesis N-glycosyltransfe... 242 2e-62 UniRef50_A1S0Z5 Glycosyl transferase, family 2 n=1 Tax=Thermofil... 242 2e-62 UniRef50_Q9KAQ6 BH2231 protein n=4 Tax=Bacillus RepID=Q9KAQ6_BACHD 242 3e-62 UniRef50_D2L608 Glycosyl transferase family 2 n=1 Tax=Desulfovib... 242 3e-62 UniRef50_B5W069 Glycosyl transferase family 2 n=3 Tax=Arthrospir... 241 3e-62 UniRef50_D2LZZ6 Glycosyl transferase family 2 n=1 Tax=Bacillus c... 240 6e-62 UniRef50_A0LZL8 Transmembrane family-2 glycosyl transferase-poss... 239 2e-61 UniRef50_B9YCC6 Putative uncharacterized protein n=1 Tax=Holdema... 239 2e-61 UniRef50_B2GJB7 Putative glycosyltransferase n=2 Tax=Micrococcin... 239 2e-61 UniRef50_Q74D65 Glycosyl transferase, group 2 family protein n=1... 239 2e-61 UniRef50_UPI0001C1680B Glycosyl transferase, family 2 n=2 Tax=No... 238 3e-61 UniRef50_C0Z7U1 Conserved hypothetical membrane protein n=1 Tax=... 238 3e-61 UniRef50_B3DW74 Glycosyltransferase n=1 Tax=Methylacidiphilum in... 237 5e-61 UniRef50_Q2RZV9 Putative glucosyltransferase n=1 Tax=Salinibacte... 237 6e-61 UniRef50_C0G271 Glycosyl transferase family 2 n=1 Tax=Natrialba ... 237 8e-61 UniRef50_A5FCY6 Candidate polysaccharide-forming beta-glycosyltr... 236 1e-60 UniRef50_UPI0001C42DFC capsule anchoring protein CapD n=1 Tax=Ba... 236 1e-60 UniRef50_C1TQW8 Glycosyl transferase n=1 Tax=Dethiosulfovibrio p... 236 1e-60 UniRef50_A1VIY0 Glycosyl transferase, family 2 n=1 Tax=Polaromon... 235 2e-60 UniRef50_B0C9M4 Glycosyl transferase, family 2 n=1 Tax=Acaryochl... 235 2e-60 UniRef50_B8E0Z1 Glycosyl transferase family 2 n=1 Tax=Dictyoglom... 235 3e-60 UniRef50_C6J2E9 Glycosyl transferase n=1 Tax=Paenibacillus sp. o... 235 3e-60 UniRef50_B7HFD6 N-acetyllactosaminide beta-1,6-N-acetylglucosami... 234 4e-60 UniRef50_C0GFE2 Glycosyl transferase family 2 n=1 Tax=Dethiobact... 233 1e-59 UniRef50_A9QSM4 N-acetylglucosaminyltransferase n=39 Tax=Lactoba... 233 1e-59 UniRef50_A4YD58 Glycosyl transferase, family 2 n=12 Tax=Sulfolob... 233 1e-59 UniRef50_B0U7V3 Glycosyl transferase family 2 n=1 Tax=Methylobac... 233 1e-59 UniRef50_D1N3C9 Glycosyl transferase family 2 n=1 Tax=Victivalli... 233 1e-59 UniRef50_A3I0Z1 Glycosyltransferase n=1 Tax=Algoriphagus sp. PR1... 232 2e-59 UniRef50_A6CMH6 Glycosyl transferase, group 2 family protein n=2... 232 2e-59 UniRef50_Q6ACB6 Glucosaminyltransferase n=1 Tax=Leifsonia xyli s... 232 2e-59 UniRef50_A3HSN4 Glycosyl transferase, group 2 family protein n=1... 232 2e-59 UniRef50_B2UMM8 Glycosyl transferase family 2 n=3 Tax=Verrucomic... 232 2e-59 UniRef50_Q2RGJ6 Glycosyl transferase, family 2 n=7 Tax=Firmicute... 232 2e-59 UniRef50_UPI000178A1DC glycosyl transferase family 2 n=1 Tax=Geo... 231 4e-59 UniRef50_D1JE47 Conserved hypothetical membrane protein, glycosy... 231 4e-59 UniRef50_B9AE19 Putative uncharacterized protein n=1 Tax=Methano... 231 4e-59 UniRef50_A9FZQ2 Glycosyltransferase n=1 Tax=Sorangium cellulosum... 231 5e-59 UniRef50_Q9UY40 Glycosyl transferase, family 2 n=2 Tax=Thermococ... 230 5e-59 UniRef50_C7PF18 Glycosyl transferase family 2 n=9 Tax=Bacteroide... 230 8e-59 UniRef50_C4WPV5 Putative uncharacterized protein n=1 Tax=Ochroba... 229 1e-58 UniRef50_C7XH18 Glycosyltransferase n=1 Tax=Lactobacillus crispa... 229 1e-58 UniRef50_C6WRK1 Glycosyl transferase family 2 n=1 Tax=Actinosynn... 229 2e-58 UniRef50_UPI0001C30EDA glycosyl transferase family 2 n=1 Tax=Con... 228 2e-58 UniRef50_A5GEA8 Glycosyl transferase, family 2 n=3 Tax=Deltaprot... 228 3e-58 UniRef50_Q0AV05 Glycosyltransferase n=1 Tax=Syntrophomonas wolfe... 227 6e-58 UniRef50_A7INQ0 Glycosyl transferase family 2 n=11 Tax=Rhizobial... 226 1e-57 UniRef50_C5SBP2 Glycosyl transferase family 2 n=1 Tax=Allochroma... 225 2e-57 UniRef50_D2S4V1 Glycosyl transferase family 2 n=1 Tax=Geodermato... 225 3e-57 UniRef50_A4CJ64 Glycosyltransferase n=15 Tax=Bacteroidetes RepID... 225 3e-57 UniRef50_Q7NLY9 Gll0980 protein n=1 Tax=Gloeobacter violaceus Re... 225 3e-57 UniRef50_C4KBM8 Glycosyl transferase family 2 n=6 Tax=Betaproteo... 224 5e-57 UniRef50_D2S449 Glycosyl transferase family 2 n=1 Tax=Geodermato... 224 5e-57 UniRef50_Q0IE23 Glycosyl transferase, group 2 family protein n=1... 224 6e-57 UniRef50_UPI00017891C0 glycosyl transferase family 2 n=1 Tax=Geo... 224 6e-57 UniRef50_C7M4N0 Glycosyl transferase family 2 n=4 Tax=Flavobacte... 223 7e-57 UniRef50_UPI000038DF5C N-acetylglucosaminyltransferase n=1 Tax=F... 223 8e-57 UniRef50_C2EFI6 Glycosyltransferase n=1 Tax=Lactobacillus saliva... 221 4e-56 UniRef50_B2IY03 Glycosyl transferase, family 2 n=1 Tax=Nostoc pu... 221 5e-56 UniRef50_Q6KZU9 N-acetylglucosaminyltransferase n=1 Tax=Picrophi... 220 8e-56 UniRef50_C7NQ00 Glycosyl transferase family 2 n=1 Tax=Halorhabdu... 220 9e-56 UniRef50_C9RLY4 Glycosyl transferase family 2 n=1 Tax=Fibrobacte... 220 1e-55 UniRef50_Q2JKN6 Glycosyl transferase, group 2 family protein n=2... 219 1e-55 UniRef50_B9MDW0 Glycosyl transferase family 2 n=1 Tax=Diaphoroba... 219 2e-55 UniRef50_C9KN46 Chitin synthase n=1 Tax=Mitsuokella multacida DS... 219 2e-55 UniRef50_A0Q541 Glycosyl transferase n=3 Tax=Francisella novicid... 218 2e-55 UniRef50_C1ZJA9 Glycosyl transferase n=1 Tax=Planctomyces limnop... 218 3e-55 UniRef50_C4Z6K0 Putative uncharacterized protein n=1 Tax=Eubacte... 218 4e-55 UniRef50_C5A5B1 Glycosyltransferase, family 2 n=9 Tax=Thermococc... 217 6e-55 UniRef50_Q97FY5 Glycosyltransferase n=4 Tax=Clostridiales RepID=... 217 8e-55 UniRef50_C0A1N7 Glycosyl transferase, family 2 n=1 Tax=Opitutace... 217 9e-55 UniRef50_D2QUU5 Glycosyl transferase family 2 n=1 Tax=Spirosoma ... 216 1e-54 UniRef50_Q0AEM9 Cellulose synthase (UDP-forming) n=2 Tax=Nitroso... 215 2e-54 UniRef50_B3QIT3 Glycosyl transferase family 2 n=6 Tax=Bradyrhizo... 215 2e-54 UniRef50_C5S5F1 Glycosyl transferase family 2 n=1 Tax=Allochroma... 215 2e-54 UniRef50_A6FMF5 N-acetylglucosaminyltransferase n=1 Tax=Roseobac... 215 2e-54 UniRef50_Q03RZ5 Glycosyltransferase n=3 Tax=Lactobacillus RepID=... 214 4e-54 UniRef50_UPI0001C31B0A glycosyl transferase family 2 n=1 Tax=Con... 214 6e-54 UniRef50_C0ZQD3 Putative glycosyltransferase n=2 Tax=Rhodococcus... 214 7e-54 UniRef50_A5FAD6 Candidate beta-glycosyltransferase; Glycosyltran... 214 7e-54 UniRef50_C6VR67 Glycosyltransferase (Putative) n=7 Tax=Lactobaci... 213 8e-54 UniRef50_B4WM17 Glycosyl transferase, group 2 family protein n=1... 213 1e-53 UniRef50_Q7NDK6 Glr4229 protein n=1 Tax=Gloeobacter violaceus Re... 213 1e-53 UniRef50_A7HQK0 Glycosyl transferase family 2 n=1 Tax=Parvibacul... 212 2e-53 UniRef50_Q2W2R8 Cellulose synthase catalytic subunit n=5 Tax=Rho... 212 2e-53 UniRef50_B4RSC1 Cellulose synthase catalytic subunit n=2 Tax=Alt... 212 2e-53 UniRef50_Q46DE3 Dolichyl-phosphate mannose synthase related prot... 212 2e-53 UniRef50_Q97G47 Glycosyltransferase n=1 Tax=Clostridium acetobut... 210 6e-53 UniRef50_A1SGB7 Glycosyl transferase, family 2 n=1 Tax=Nocardioi... 210 6e-53 UniRef50_Q1PUL2 Putative uncharacterized protein n=1 Tax=Candida... 210 6e-53 UniRef50_Q03MS9 Glycosyltransferase, probably involved in cell w... 210 7e-53 UniRef50_Q6LKT1 Hypothetical glycosyltransferase, probably invol... 210 8e-53 UniRef50_D1U9V0 Hyaluronan synthase n=1 Tax=Desulfovibrio aespoe... 210 9e-53 UniRef50_A5G0G3 Glycosyl transferase, family 2 n=1 Tax=Acidiphil... 209 1e-52 UniRef50_Q04QL4 Glycosyltransferase plus another conserved domai... 209 2e-52 UniRef50_Q9FNI7 Glucomannan 4-beta-mannosyltransferase 2 n=8 Tax... 209 2e-52 UniRef50_P74165 Sll1377 protein n=9 Tax=Cyanobacteria RepID=P741... 208 4e-52 UniRef50_Q4KHI6 Glycosyl transferase, group 2 family protein n=1... 208 4e-52 UniRef50_A0LSJ8 Glycosyl transferase, family 2 n=1 Tax=Acidother... 208 4e-52 UniRef50_A5GQ06 Glycosyltransferase of family GT2; possible proc... 207 5e-52 UniRef50_B3DVV3 Glycosyltransferase n=1 Tax=Methylacidiphilum in... 206 1e-51 UniRef50_D1N5H0 Glycosyl transferase family 2 n=1 Tax=Victivalli... 206 1e-51 UniRef50_C7QK51 Glycosyl transferase family 2 n=1 Tax=Catenulisp... 206 1e-51 UniRef50_D1B804 Glycosyl transferase family 2 n=2 Tax=Bacteria R... 206 2e-51 UniRef50_Q7XIF5 Probable mannan synthase 7 n=9 Tax=Magnoliophyta... 205 2e-51 UniRef50_D0DDK6 Glycosyl transferase, family 2 n=1 Tax=Citreicel... 205 2e-51 UniRef50_Q0AQN9 Glycosyl transferase, family 2 n=2 Tax=Hyphomona... 205 3e-51 UniRef50_A4SFN6 Glycosyl transferase, family 2 n=1 Tax=Chlorobiu... 205 3e-51 UniRef50_B8HDP3 Glycosyl transferase family 2 n=1 Tax=Arthrobact... 205 3e-51 UniRef50_D1BIJ9 Glycosyl transferase n=2 Tax=Micrococcineae RepI... 205 4e-51 UniRef50_D1J3P1 Whole genome shotgun sequence of line PN40024, s... 204 5e-51 UniRef50_A4WR49 Cellulose synthase (UDP-forming) n=6 Tax=Rhodoba... 204 6e-51 UniRef50_C6HYN9 Glycosyl transferase n=1 Tax=Leptospirillum ferr... 203 9e-51 UniRef50_C9NZ15 Putative uncharacterized protein n=1 Tax=Vibrio ... 203 1e-50 UniRef50_O67406 Cellulose synthase catalytic subunit n=1 Tax=Aqu... 203 1e-50 UniRef50_C6MI20 Glycosyl transferase family 2 n=1 Tax=Nitrosomon... 203 1e-50 UniRef50_A3Q6L3 Glycosyl transferase, family 2 n=6 Tax=Actinomyc... 202 2e-50 UniRef50_Q1D6P4 Glycosyl transferase, group 2 n=3 Tax=Cystobacte... 202 3e-50 UniRef50_B7KYJ4 Glycosyl transferase family 2 n=2 Tax=Methylobac... 201 4e-50 UniRef50_D0LRC3 Glycosyl transferase family 2 n=1 Tax=Haliangium... 201 4e-50 UniRef50_A0QPQ1 Chitin synthase n=1 Tax=Mycobacterium smegmatis ... 201 4e-50 UniRef50_Q30R23 Glycosyl transferase, family 2 n=3 Tax=Campyloba... 200 7e-50 UniRef50_A0LP64 Glycosyl transferase, family 2 n=1 Tax=Syntropho... 200 9e-50 UniRef50_A3MTD6 Glycosyl transferase, family 2 n=4 Tax=Thermopro... 199 2e-49 UniRef50_A5G7E0 Glycosyl transferase, family 2 n=3 Tax=Bacteria ... 198 3e-49 UniRef50_B2ID82 Glycosyl transferase family 2 n=2 Tax=Beijerinck... 198 3e-49 UniRef50_A9KYN4 Glycosyl transferase family 2 n=24 Tax=Gammaprot... 198 3e-49 UniRef50_C0QGC2 NodC n=2 Tax=Desulfobacterium autotrophicum HRM2... 198 5e-49 UniRef50_D1JHN4 Conserved hypothetical membrane protein, glycosy... 197 5e-49 UniRef50_C8VZJ3 Glycosyl transferase family 2 n=1 Tax=Desulfotom... 197 7e-49 UniRef50_D2EES4 Glycosyl transferase family 2 n=1 Tax=Candidatus... 197 9e-49 UniRef50_C6N2D6 Glucosaminyltransferase n=1 Tax=Legionella dranc... 197 9e-49 UniRef50_Q974Y9 Putative uncharacterized protein ST0523 n=2 Tax=... 196 2e-48 UniRef50_Q02IY3 Probable glucosyl transferase n=6 Tax=Proteobact... 196 2e-48 UniRef50_A6LJJ1 Glycosyl transferase, family 2 n=2 Tax=Thermosip... 195 3e-48 UniRef50_C3MU39 Glycosyl transferase family 2 n=9 Tax=Sulfolobus... 195 4e-48 UniRef50_D1RA94 Putative uncharacterized protein n=1 Tax=Parachl... 194 4e-48 UniRef50_B1M838 Cellulose synthase catalytic subunit (UDP-formin... 193 9e-48 UniRef50_C7ZM19 Putative uncharacterized protein n=1 Tax=Nectria... 192 3e-47 UniRef50_C4DG81 Glycosyl transferase n=1 Tax=Stackebrandtia nass... 192 3e-47 UniRef50_D1RIK1 Glycosyl transferase domain-containing protein n... 191 4e-47 Sequences not found previously or not previously below threshold: UniRef50_B7A6V1 Glycosyl transferase family 2 n=1 Tax=Thermus aq... 259 2e-67 UniRef50_C7PF17 Glycosyl transferase family 2 n=1 Tax=Chitinopha... 227 5e-58 UniRef50_A8USY3 Cellulose synthase (UDP-forming) n=1 Tax=Hydroge... 202 2e-50 UniRef50_Q8S7W0 Probable mannan synthase 4 n=4 Tax=Magnoliophyta... 202 3e-50 UniRef50_B6AKK5 Glycosyltransferase n=1 Tax=Leptospirillum sp. G... 201 4e-50 UniRef50_C1ZM23 Glycosyl transferase n=1 Tax=Planctomyces limnop... 200 1e-49 UniRef50_B9XJM2 Glycosyl transferase family 2 n=1 Tax=bacterium ... 198 3e-49 UniRef50_B2JFS7 Cellulose synthase (UDP-forming) n=8 Tax=Burkhol... 198 3e-49 UniRef50_Q9LJP4 Xyloglucan glycosyltransferase 4 n=75 Tax=Embryo... 198 3e-49 UniRef50_C1DW58 Cellulose synthase catalytic subunit (UDP-formin... 197 7e-49 UniRef50_P58931 Cellulose synthase catalytic subunit [UDP-formin... 195 2e-48 UniRef50_Q0IZF3 Os09g0572500 protein n=2 Tax=Oryza sativa Japoni... 195 2e-48 UniRef50_Q7PC76 Glucomannan 4-beta-mannosyltransferase 1 n=53 Ta... 195 3e-48 UniRef50_A9CZ20 Putative cellulose synthase catalytic subunit (U... 193 1e-47 UniRef50_B7IR36 Glycosyltransferase n=11 Tax=Bacillus cereus gro... 192 2e-47 UniRef50_A3ERE6 Cellulose synthase catalytic subunit (UDP-formin... 192 2e-47 UniRef50_Q2K9Z1 Cellulose synthase protein n=17 Tax=Rhizobiaceae... 192 3e-47 UniRef50_Q6YWK8 Probable mannan synthase 11 n=3 Tax=Poaceae RepI... 192 3e-47 >UniRef50_P75905 Biofilm PGA synthesis N-glycosyltransferase pgaC n=146 Tax=Bacteria RepID=PGAC_ECOLI Length = 441 Score = 611 bits (1576), Expect = e-173, Method: Composition-based stats. Identities = 441/441 (100%), Positives = 441/441 (100%) Query: 1 MINRIVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHW 60 MINRIVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHW Sbjct: 1 MINRIVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHW 60 Query: 61 PWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRA 120 PWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRA Sbjct: 61 PWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRA 120 Query: 121 ILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPM 180 ILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPM Sbjct: 121 ILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPM 180 Query: 181 LYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRS 240 LYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRS Sbjct: 181 LYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRS 240 Query: 241 ALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGG 300 ALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGG Sbjct: 241 ALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGG 300 Query: 301 AEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIA 360 AEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIA Sbjct: 301 AEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIA 360 Query: 361 ATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRV 420 ATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRV Sbjct: 361 ATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRV 420 Query: 421 MLMPKKQRARWVSPDRGILRG 441 MLMPKKQRARWVSPDRGILRG Sbjct: 421 MLMPKKQRARWVSPDRGILRG 441 >UniRef50_A4SIQ6 Glycosyl transferase, family 2 n=6 Tax=Gammaproteobacteria RepID=A4SIQ6_AERS4 Length = 416 Score = 418 bits (1076), Expect = e-115, Method: Composition-based stats. Identities = 205/408 (50%), Positives = 276/408 (67%), Gaps = 12/408 (2%) Query: 33 FVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQ---LKDNPSISIIIPCFNEEKN 89 F +P M+++WI GG+Y++ W + A A L P +++++PC+NE N Sbjct: 11 FTLAYPSMMAMVWICGGLYYYF----QWEQRDIALARTGLVLPRYPKVTLMVPCYNEGAN 66 Query: 90 VEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKT 149 VEETI L QRY ++EV+A+NDGS D T LDR+AAQ L V+H NQGKA+AL Sbjct: 67 VEETISHLLRQRYPDLEVLAINDGSKDDTGQRLDRLAAQDARLTVLH-QHNQGKAMALNN 125 Query: 150 GAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGE 209 G AK E LV IDGDA+LD DA ++V+ + +P+VGAVTGNPR+RTRST++GKIQ GE Sbjct: 126 GLERAKGEILVGIDGDAVLDHDAVRWMVKHFIESPKVGAVTGNPRVRTRSTIIGKIQTGE 185 Query: 210 YSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWT 269 +SSIIGLIKR QRIYG VFTVSGV+ AFR+SA+ VG WS DM+TEDIDISWKLQL W Sbjct: 186 FSSIIGLIKRAQRIYGTVFTVSGVVVAFRKSAVEAVGGWSTDMVTEDIDISWKLQLAGWL 245 Query: 270 IFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLT 329 I Y+P+ALCW+LMPET++GL+KQRLRWAQGGAEV L+ + R N W L EY + Sbjct: 246 IHYQPQALCWVLMPETVRGLYKQRLRWAQGGAEVILRYGLQAMRWRNRHFWLLLLEYLSS 305 Query: 330 TIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEH 389 +W ++ L+ +I+ ++ P EL G+++ +CLLQF VS+ I+N Y+ Sbjct: 306 VLWCYSMLLLALIWLIR----PDLGELAEGQLFQWTGVIMTAICLLQFGVSIFIDNHYDK 361 Query: 390 NLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRG 437 + S W IW+P+I+W+L+L T +V+ + +L + Q A W SPDRG Sbjct: 362 TMARSFVWCIWYPLIYWLLNLVTVIVAVPKAILRRRGQLAVWESPDRG 409 >UniRef50_B2UK24 Glycosyl transferase family 2 n=10 Tax=Burkholderiales RepID=B2UK24_RALPJ Length = 425 Score = 418 bits (1075), Expect = e-115, Method: Composition-based stats. Identities = 207/416 (49%), Positives = 280/416 (67%), Gaps = 2/416 (0%) Query: 26 SGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQ-LKDNPSISIIIPCF 84 + + FVF++PFFMS W++GG+ + ER ++ A +K P +SII+PC+ Sbjct: 6 AKRWISGFVFYYPFFMSYFWMIGGLLHYFLLERGTRRIQHPLALLGVKTYPKVSIIVPCY 65 Query: 85 NEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKA 144 NEE NV E I RY N ++IAVNDGS+D+T L+ +AAQ P L VIH + NQGKA Sbjct: 66 NEEANVREVISHLARMRYPNYDIIAVNDGSSDRTGERLNELAAQYPQLVVIHQSSNQGKA 125 Query: 145 IALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGK 204 I L T A +EYL+CIDGD++LD DA A+++ ++ NP VGAVTGNPRIRTRSTL+G+ Sbjct: 126 IGLTTAAQVTDAEYLMCIDGDSILDVDAIAWMIRHLMENPAVGAVTGNPRIRTRSTLLGR 185 Query: 205 IQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQ 264 +QVGE+SSI+GLIKRTQ++YG + TVSGV+ FR+ A+ EVGYWS+DM+TEDIDISWKLQ Sbjct: 186 MQVGEFSSIVGLIKRTQQVYGRLLTVSGVVVMFRKRAIEEVGYWSNDMLTEDIDISWKLQ 245 Query: 265 LNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFF 324 + WTI YEPRAL WILMPET +GL+KQRLRWA+GG + +K + MWP+FF Sbjct: 246 VGGWTIRYEPRALSWILMPETFRGLYKQRLRWAKGGIQALIKYAPAMLSLRQSMMWPIFF 305 Query: 325 EYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIE 384 EY L+ +WA+ L+ I ++V V + E A G LL C+LQ ++ I+ Sbjct: 306 EYALSVVWAYNMLL-VITWSVLGLFVDMPPEWRMEAFPRWHGTLLFITCVLQLLIGCFID 364 Query: 385 NRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRGILR 440 RY+ + +W+PV FW+L+L TT++ F V ++ RARW SPDRGI + Sbjct: 365 RRYDDGILRYFIDTVWYPVAFWILNLVTTVIGFPAVAFQRQRARARWTSPDRGIQQ 420 >UniRef50_A3N3L7 Biofilm PGA synthesis N-glycosyltransferase PgaC n=10 Tax=Pasteurellaceae RepID=A3N3L7_ACTP2 Length = 411 Score = 404 bits (1039), Expect = e-111, Method: Composition-based stats. Identities = 181/412 (43%), Positives = 258/412 (62%), Gaps = 3/412 (0%) Query: 27 GELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNE 86 E+ FVF +P M+ W G+ +++++E+ N + ++ P +S+++PC+NE Sbjct: 3 LEIFSLFVFAYPAVMAFYWAFAGLTYFLFKEK-LKVPPNFDQMKHEEVPLVSLMVPCYNE 61 Query: 87 EKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIA 146 N++E I L +Y N E+I +NDGS D T I+D+ A + + +H N GKA A Sbjct: 62 SDNLDEAIPHLLNLKYPNYELIFINDGSKDHTGEIIDKWAKRDKRIVALH-QANSGKASA 120 Query: 147 LKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQ 206 L G A+ +Y+ CIDGDA+LD A Y+V+ + NPR GAVTGNPR+R RST++G++Q Sbjct: 121 LNNGLRIARGKYVGCIDGDAVLDYKALDYMVQALESNPRYGAVTGNPRVRNRSTILGRLQ 180 Query: 207 VGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLN 266 V E+SSIIGLIKR Q + G +FTVSGV FR+ + E+G WS +MITEDID+SWK+Q + Sbjct: 181 VSEFSSIIGLIKRAQCLMGTIFTVSGVCCLFRKDIMFEIGGWSTNMITEDIDVSWKIQTS 240 Query: 267 QWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEY 326 + IFYEPRALCW+LMPET+ GL+KQRLRWAQGGAE +K ++WR +N R+WP+F EY Sbjct: 241 GYDIFYEPRALCWVLMPETINGLFKQRLRWAQGGAETMMKYFPQIWRLKNRRLWPMFIEY 300 Query: 327 CLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENR 386 +T IWA LV ++ L I L A + +L Q +SL I+NR Sbjct: 301 IVTAIWASLLLVSILLSIYNLI-FDNQIGLLDWAELKPSIAILFIAFFTQLSISLYIDNR 359 Query: 387 YEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRGI 438 YE + +F IW+P ++W L+ T L + + K + A W SPDRG+ Sbjct: 360 YEKGVVKYVFSCIWYPWLYWSLNTITLLCGIPKAIFRNKTKLAVWTSPDRGV 411 >UniRef50_Q30SK0 Glycosyl transferase, family 2 n=6 Tax=Proteobacteria RepID=Q30SK0_SULDN Length = 433 Score = 403 bits (1036), Expect = e-111, Method: Composition-based stats. Identities = 195/412 (47%), Positives = 272/412 (66%), Gaps = 2/412 (0%) Query: 30 MMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKN 89 M+ +VF++P FMS +W++G ++F+ E+ + + + +SI+IPC+NE +N Sbjct: 15 MLGYVFYYPLFMSTLWMIGAIFFYYKSEKRYVEQSIPELRENESWAGVSILIPCYNEGEN 74 Query: 90 VEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKT 149 ETI AL Y EVIA+NDGS D T IL +A P L+V++LAQNQGKA+AL+ Sbjct: 75 AIETITYALDVIYPEFEVIAINDGSKDDTLDILLSLAKDNPRLKVVNLAQNQGKALALQA 134 Query: 150 GAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGE 209 G+ AK E+LVCIDGDAL+D ++ + + P V AVTGNPRIR R++L+GKIQVGE Sbjct: 135 GSLVAKHEFLVCIDGDALIDPYCLYWMAKHFIRYPEVAAVTGNPRIRNRTSLLGKIQVGE 194 Query: 210 YSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWT 269 +SSI+G+IKR QR +G +FTVSGVI FR+SA+ EVGYWS DM+TEDIDI+WKLQ N W Sbjct: 195 FSSIVGMIKRAQRSFGRLFTVSGVITGFRKSAVHEVGYWSPDMLTEDIDITWKLQRNGWD 254 Query: 270 IFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLT 329 + +EP++L WILMPET+KGLWKQRLRWA GGA+V KN L+ + +W L E L+ Sbjct: 255 VRFEPKSLVWILMPETIKGLWKQRLRWAMGGAQVMFKNFKVLFLYKQTHLWGLMIELLLS 314 Query: 330 TIWAFTCLVGFIIYAVQLAG-VPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYE 388 +WA+T ++ +++ L + ++L CL+QF VS ++ Y+ Sbjct: 315 MVWAYTMVLVILVWIFGLLFPIGYLAPSESPILPDQGSVILIGACLVQFGVSKWLDGHYD 374 Query: 389 HNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRGILR 440 L + FW+IW+P FW L+L T + + +V L KK RARWVSPDRG+ + Sbjct: 375 EGLGKNYFWMIWYPFAFWFLNLFTAVAALPKV-LFGKKGRARWVSPDRGVHQ 425 >UniRef50_Q9RQP9 Biofilm PIA synthesis N-glycosyltransferase icaA n=63 Tax=Staphylococcaceae RepID=ICAA_STAA8 Length = 412 Score = 393 bits (1011), Expect = e-108, Method: Composition-based stats. Identities = 159/411 (38%), Positives = 254/411 (61%), Gaps = 2/411 (0%) Query: 30 MMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKN 89 F+ F+P FMSI WIVG +YF+ RE + + P + + I+ ++ C+NE + Sbjct: 3 FFNFLLFYPVFMSIYWIVGSIYFYFTREIRYSLNKK-PDINVDELEGITFLLACYNESET 61 Query: 90 VEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKT 149 +E+T+ LA +YE E+I +NDGS+D T ++ ++ + + L +N+GKA AL Sbjct: 62 IEDTLSNVLALKYEKKEIIIINDGSSDNTAELIYKIKENNDFI-FVDLQENRGKANALNQ 120 Query: 150 GAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGE 209 G A +Y++C+D D ++D+DA Y++E ++P++GAVTGNPRIR +S+++GKIQ E Sbjct: 121 GIKQASYDYVMCLDADTIVDQDAPYYMIENFKHDPKLGAVTGNPRIRNKSSILGKIQTIE 180 Query: 210 YSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWT 269 Y+S+IG IKR+Q + G V T+SGV F++SA+ +VGYW DMITEDI +SWKL L + Sbjct: 181 YASLIGCIKRSQTLAGAVNTISGVFTLFKKSAVVDVGYWDTDMITEDIAVSWKLHLRGYR 240 Query: 270 IFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLT 329 I YEP A+CW+L+PETL GLWKQR+RWAQGG EV L++ + + F ++ L FE ++ Sbjct: 241 IKYEPLAMCWMLVPETLGGLWKQRVRWAQGGHEVLLRDFFSTMKTKRFPLYILMFEQIIS 300 Query: 330 TIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEH 389 +W + L+ + + + + + + ++QF V+L I++RYE Sbjct: 301 ILWVYIVLLYLGYLFITANFLDYTFMTYSFSIFLLSSFTMTFINVIQFTVALFIDSRYEK 360 Query: 390 NLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRGILR 440 + L ++ W+P ++W+++ A LV+F + + K A W SPDRG + Sbjct: 361 KNMAGLIFVSWYPTVYWIINAAVVLVAFPKALKRKKGGYATWSSPDRGNTQ 411 >UniRef50_B9EAX5 Intercellular adhesion protein IcaA homolog n=19 Tax=Staphylococcaceae RepID=B9EAX5_MACCJ Length = 414 Score = 390 bits (1002), Expect = e-107, Method: Composition-based stats. Identities = 163/412 (39%), Positives = 247/412 (59%), Gaps = 5/412 (1%) Query: 29 LMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEK 88 F+ +P MS++WI+G V +++ ER P+ + IS +IPC+NE Sbjct: 7 FFGGFLIVYPVLMSLVWIIGAVLYFLLIERRLKQTP----PERDEYEGISFLIPCYNEAD 62 Query: 89 NVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALK 148 VEET+ L Y E+I +NDGS+D T ++ + R I L QN+GKA AL Sbjct: 63 TVEETLVNVLKLSYPKKEIILINDGSSDNTAEKINALKDHY-DFRFIDLQQNRGKANALN 121 Query: 149 TGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVG 208 T A+ ++++ ID D ++D A ++++ + P +GAVTGNPRIR +S+++GKIQ Sbjct: 122 TAVKHAQYDFVMGIDADTIIDDAAPYFMIDNFRHAPNLGAVTGNPRIRNKSSILGKIQTV 181 Query: 209 EYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQW 268 EY+SIIG IKR Q + G V T+SGV FR+SAL VGYW DDMITEDI +SWK ++ + Sbjct: 182 EYASIIGSIKRAQTLNGYVNTISGVFTLFRKSALEAVGYWDDDMITEDIAVSWKFHMHHY 241 Query: 269 TIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCL 328 I YEPRALCW+L+PET G+WKQRLRWAQGG EV L+ + ++ N +W L+ E + Sbjct: 242 EIRYEPRALCWMLVPETFTGIWKQRLRWAQGGHEVVLREFKSMLKQPNLPLWLLYLEQLI 301 Query: 329 TTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYE 388 + IW ++ L + + + ++ A I+L T+ ++QF VSL+I++RYE Sbjct: 302 SIIWVYSVLFLLAFTIINIDVLDYYFYAYNVNLLILASIILTTVNVIQFTVSLVIDSRYE 361 Query: 389 HNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRGILR 440 LF++ W+P +W+++ + +F + + K + A W SPDRG ++ Sbjct: 362 RKNIVYLFFLCWYPTFYWLINACVAVFAFPKALRRKKGEFATWSSPDRGNIQ 413 >UniRef50_C8WT25 Glycosyl transferase family 2 n=3 Tax=Bacteria RepID=C8WT25_ALIAD Length = 410 Score = 382 bits (982), Expect = e-104, Method: Composition-based stats. Identities = 189/417 (45%), Positives = 258/417 (61%), Gaps = 16/417 (3%) Query: 29 LMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEK 88 ++ F+F +PF MSI+W+VGG + RERH A +P L + P +SI+IPC NE Sbjct: 1 MVHSFLFLYPFVMSIVWMVGGCVYAWRRERH----PFAESPDLPETPFVSILIPCHNEGD 56 Query: 89 NVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALK 148 +E+TI L Y E++A+NDGSTD TRA+L+RMAA +RV++L +GKA AL Sbjct: 57 VLEDTIGRMLQLDYPAYEIVALNDGSTDDTRAVLERMAACDARVRVVNLPVQRGKARALN 116 Query: 149 TGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLY--NPRVGAVTGNPRIRTRSTLVGKIQ 206 G A++ E LV +D DA+L +DA ++V + + RVGAVTGNPRIR R TL+GKIQ Sbjct: 117 AGLVASRGEILVTVDADAVLAKDALRFLVWHFVAPGSERVGAVTGNPRIRNRGTLLGKIQ 176 Query: 207 VGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLN 266 V EY+SIIGLIKR QR+ G + TVSGVIAAFR+ AL + G W +DM+T+DI +SWKL+ Sbjct: 177 VLEYASIIGLIKRAQRVLGKIMTVSGVIAAFRKRALVDCGMWDEDMVTDDIAVSWKLERR 236 Query: 267 QWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEY 326 W I YEPRALC++ PE L+ L +QR RWAQGG EV ++N + LW +N RM PL+ E Sbjct: 237 AWDIRYEPRALCFMWAPERLRSLIRQRARWAQGGVEVLIRNASVLWTWKNRRMIPLYVEE 296 Query: 327 CLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENR 386 L WA+ +V + V I T++A T G L L+Q V+L IE R Sbjct: 297 LLGIAWAYLWVVSLVWTLA--FDVAHGIPWTYVAET---GTWLGLTSLVQTSVALWIEQR 351 Query: 387 YEHN-LTSSLFWIIWFPVIFWMLSLATTLVSFTRVML----MPKKQRARWVSPDRGI 438 YE + L F+ IW+P +WM+ + + + + + A W SPDRG+ Sbjct: 352 YERDSLWRYYFYAIWYPAAYWMIGAFVVVWAVPKACWAMWAARRGRYATWKSPDRGV 408 >UniRef50_A4J4W8 Glycosyl transferase, family 2 n=3 Tax=Clostridia RepID=A4J4W8_DESRM Length = 425 Score = 369 bits (948), Expect = e-100, Method: Composition-based stats. Identities = 140/414 (33%), Positives = 231/414 (55%), Gaps = 8/414 (1%) Query: 30 MMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKN 89 + F F +P MSI+W++GG+YFW YRER + P ++I+IPC NEE + Sbjct: 14 IGYFTFLYPVTMSIIWVIGGLYFWWYRERKARTNNW-----PEVWPPVTILIPCHNEEIS 68 Query: 90 VEETIHAALAQRYENIEVIAVNDGSTDKTRAILDR-MAAQIPHLRVIHLAQNQGKAIALK 148 + T HA Y ++ V+ ++D STD T I+ + ++P ++ L NQGKA AL Sbjct: 69 IATTCHALSKVNYPDLRVVFIDDASTDNTAQIIREWLRQEVPSFHLLRLTTNQGKAKALN 128 Query: 149 TGAAAA-KSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQV 207 G A + V ID D L+ D +++ P + PR+GAV+GNP + R L+ +Q Sbjct: 129 CGLQVAVHTPITVVIDADTLITPDTLKWLIAPFIRQPRLGAVSGNPLVGNRENLLENLQT 188 Query: 208 GEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQ 267 E++SI+GLIKR+QR G + TVSG I AF L ++G +S TEDIDI+W +Q N Sbjct: 189 AEFASILGLIKRSQRSLGRMLTVSGCITAFCTDTLRQLGGFSSRSATEDIDITWAIQRNF 248 Query: 268 WTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYC 327 + +++EPRA+ +I +P+T+K W QR RWA GG + + R+WP++ ++ Sbjct: 249 YEVWFEPRAIAYIQVPKTIKEFWHQRCRWALGGWHLLRSHWDIFTHWRWRRLWPVYLDFV 308 Query: 328 LTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRY 387 ++ +W+F ++G +++ V +P + G ++ +CL+QF +L+ +R+ Sbjct: 309 ISYLWSFCLVIGTLLWLVTYL-IPSKPAIGLTPIPAWYGSVISFVCLVQFGAALLANHRH 367 Query: 388 EHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRGILRG 441 +H + S FWI W+P+ F+ + T + + R + +W SP R ++G Sbjct: 368 DHKMYQSFFWIPWYPIFFFCIGALTVVWTSCRGLFGDLVTVGKWKSPARSSMQG 421 >UniRef50_C9RCL0 Glycosyl transferase family 2 n=1 Tax=Ammonifex degensii KC4 RepID=C9RCL0_AMMDK Length = 429 Score = 359 bits (923), Expect = 9e-98, Method: Composition-based stats. Identities = 147/409 (35%), Positives = 228/409 (55%), Gaps = 6/409 (1%) Query: 29 LMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEK 88 L+ ++F +PF MS++W+ G+YFW RER P E P L P ++I+IPC NE Sbjct: 9 LLGWYIFLYPFAMSVLWVTAGLYFWWRRERK-PEREKKFWPDL--WPPVTILIPCHNEAA 65 Query: 89 NVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALK 148 ++ T A Y + V+ ++D STD+T ++ R P+ ++ LA+NQGKA AL Sbjct: 66 SIAATATALQFLDYPDYRVVFIDDASTDETAEVIRRFLPSNPNFHLLRLAENQGKAQALN 125 Query: 149 TGAAAA-KSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQV 207 A A + V ID D LL A Y+V P PR+GAVTGNP R L+ K+Q Sbjct: 126 CALATAVTTPITVVIDADTLLAPQALKYLVAPFCRQPRLGAVTGNPIALNRKNLLEKLQA 185 Query: 208 GEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQ 267 E++SIIGLIKR QR+ G V TVSG IAA+R L EVG +S TEDIDI+W++Q Sbjct: 186 AEFASIIGLIKRAQRVLGRVLTVSGCIAAYRTEVLREVGGFSSRTATEDIDITWQIQRRF 245 Query: 268 WTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYC 327 + +++ P+A+ +I P LK WKQR RWA GG + + R ++P++ E+ Sbjct: 246 YEVWFAPQAVAFIQCPSRLKEYWKQRRRWALGGWHLLRTHKDIFKRWRWRYLYPVYLEFV 305 Query: 328 LTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRY 387 L+ +W+F + G +++ V + + I + G +L C++Q +L++ RY Sbjct: 306 LSYLWSFAFVFGTLLWLVTRLFWGQPVGFSPIPVWY--GAMLSLACIVQIATALVLNRRY 363 Query: 388 EHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 + L ++FW+ W+P+ F++ T + + + + Q +W SP+R Sbjct: 364 DPKLWRTVFWVPWYPLFFFVFGALTVVWTAPKGLFGSLAQAGKWKSPER 412 >UniRef50_Q1IL87 Glycosyl transferase, family 2 n=1 Tax=Candidatus Koribacter versatilis Ellin345 RepID=Q1IL87_ACIBL Length = 422 Score = 344 bits (883), Expect = 4e-93, Method: Composition-based stats. Identities = 138/399 (34%), Positives = 224/399 (56%), Gaps = 16/399 (4%) Query: 31 MRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNV 90 +RF +P MS +WI ++F R R E+ P +SI++P F E + + Sbjct: 26 LRFYGLYPILMSWVWISLSLFF---RRRQ----EDTEMEMSGPAPMVSILVPAFAEAETI 78 Query: 91 EETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTG 150 ++TI A L Y N EVI VND S D T ++ + P +R+++ N+GKA+AL Sbjct: 79 DDTIEALLKLDYPNYEVILVNDCSPDNTAEVVRQY-LDDPRIRLLNKQVNEGKAMALNDA 137 Query: 151 AAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEY 210 + E LV ID D ++ RD Y+V P RV AVTGNPR+R R +++ +Q E+ Sbjct: 138 LPMCRGEILVVIDADIIVSRDLLNYMV-PHFAGTRVAAVTGNPRVRNRVSILQHLQAVEF 196 Query: 211 SSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTI 270 SSI+ + +R QR+ G V TVSG + A RRSAL E+G ++ M TEDID++W+LQ+ W + Sbjct: 197 SSIVSMQRRAQRVLGRVLTVSGAVFAVRRSALLELGGFTPHMATEDIDLTWRLQMKFWDV 256 Query: 271 FYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTT 330 YEPRA+ W+ +P +L+ LWKQR RWA+G +V ++ + RMWP+F+E + Sbjct: 257 RYEPRAVVWMQVPLSLRELWKQRKRWARGLVQVLKRHREVPTNWKMRRMWPIFYESIFSI 316 Query: 331 IWA--FTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYE 388 +W+ F + + + ++ + P + + G+++ T CLLQ + ++ +Y+ Sbjct: 317 LWSYVFVLMTSYWLISLAVGYAPRGVS----PFPNFWGMMIATTCLLQLFIGAWVDRQYD 372 Query: 389 HNLTSSLFWIIWFPVIFWML-SLATTLVSFTRVMLMPKK 426 + S +++PVI+WML +L T+ + + P + Sbjct: 373 PGIMWSFPEAVFYPVIYWMLMALITSFYTIPALFKKPPR 411 >UniRef50_B4CVJ2 Glycosyl transferase family 2 n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4CVJ2_9BACT Length = 1181 Score = 327 bits (840), Expect = 4e-88, Method: Composition-based stats. Identities = 110/427 (25%), Positives = 179/427 (41%), Gaps = 31/427 (7%) Query: 27 GELMMRFVFFWPFFMSIMWIVG-----GVYFWVYRERHWPWGENAPA-PQLKDNPSISII 80 LM +F F+SI ++V +V W + + P +++ P +SII Sbjct: 744 DRLMFALIFGILRFLSIAFVVAIALGIARVAFVTSLAIWVYFRSKPRGRPIENPPLVSII 803 Query: 81 IPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQN 140 IP +NE+ V TI + LA Y ++E+I V+DGSTD T +++ A +RV+ N Sbjct: 804 IPAYNEQSVVGRTIRSVLANDYPHMEIIFVDDGSTDGTADAVEQEFAGHEKVRVVR-QVN 862 Query: 141 QGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRST 200 GKA AL G +K E +V +D D ++ ++ +PRVGAV GN ++ R Sbjct: 863 GGKASALNHGILVSKGEIIVGLDADTQFRKETITRLIR-HFRDPRVGAVAGNVKVGNRIN 921 Query: 201 LVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDIS 260 L+ + Q EY + + + V V G I A+RR+AL EVG + D + ED+D++ Sbjct: 922 LITRWQALEYITSQNVDRLAYAQLNAVTVVPGAIGAWRRTALDEVGGYLTDTLAEDMDLT 981 Query: 261 WKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMW 320 W+++ W I E A+ P T + +KQR RW+ G + K+ L+R + Sbjct: 982 WRIRRKGWKIETEAGAVALTEAPATTQAFFKQRFRWSFGTLQCLWKHRRALFR------Y 1035 Query: 321 PLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTA--------------- 365 F L T+W F L I V L + + Sbjct: 1036 GWFGWVGLPTLWLFQILFQVIAPLVDLQVLYSLWSFGSSWFSEHYLGIVNQAATPPGALL 1095 Query: 366 --GILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLM 423 +L L + + + + L W+ + L A + R L Sbjct: 1096 QQTLLFYALFYAVELAGAAVAVAIDRDGWRLLPWLFLQRFFYRQLMYAVLWKAVLRAFLG 1155 Query: 424 PKKQRAR 430 + + Sbjct: 1156 ERTGWGK 1162 >UniRef50_Q82DY8 Putative polysaccharide deacetylase/glycosyltransferase n=1 Tax=Streptomyces avermitilis RepID=Q82DY8_STRAW Length = 790 Score = 326 bits (835), Expect = 1e-87, Method: Composition-based stats. Identities = 104/395 (26%), Positives = 178/395 (45%), Gaps = 23/395 (5%) Query: 42 SIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQR 101 ++M ++ ++ +R + WG P +S+I+P +NE++ +E TI++ Sbjct: 408 AMMLVLARTHYRQRNKRRFSWGPEITRP-------VSVIVPAYNEKECIENTINSLAQST 460 Query: 102 YENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVC 161 + IE+I V+DGSTD T I++ A +IP++RV+ +N GK AL G A + +V Sbjct: 461 HP-IEIIVVDDGSTDGTADIVE--AMRIPNVRVLR-QENAGKPAALNNGVRNASYDIVVM 516 Query: 162 IDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQ 221 +DGD + + D +V+P + VGAV GN ++ R T++G Q EY L +R Sbjct: 517 MDGDTVFEADTVRRLVQP-FADDEVGAVAGNAKVGNRDTVIGAWQHIEYVMGFNLDRRMY 575 Query: 222 RIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWIL 281 + + T+ G I AFRR A+ EVG S+D + ED DI+ + W + YE A W Sbjct: 576 DLLRCMPTIPGAIGAFRREAVLEVGGMSEDTLAEDTDITIAMHRGGWRVVYEEHARAWTE 635 Query: 282 MPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFI 341 P +LK LW QR RW+ G + K+ L + F + + F + Sbjct: 636 APGSLKQLWSQRYRWSYGTMQALWKHRKSLTDRGPSGR---FGRVGMPLVVIFQIVTPVF 692 Query: 342 IYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWF 401 + + V ++ A+ A + + + L+ + ++ +++ Sbjct: 693 APLIDVFTV-YSMIFVDFWASLLAWLAVLVVQLVCAAYAFRLDREKYR-------YLLMM 744 Query: 402 PVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 P+ LV + R RW R Sbjct: 745 PLQQLAYRQMMYLVLIHSCITALTGGRLRWQKLKR 779 >UniRef50_A0RGJ4 Glycosyl transferase and polysaccharide deacetylase fusion protein n=66 Tax=Bacillus RepID=A0RGJ4_BACAH Length = 1119 Score = 324 bits (830), Expect = 5e-87, Method: Composition-based stats. Identities = 102/407 (25%), Positives = 174/407 (42%), Gaps = 18/407 (4%) Query: 30 MMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKN 89 ++ +F+ + I V +YF ++R P +S++I +NEEK Sbjct: 717 ILTTIFYVAIGLGIFRFVFLIYFAWKQKRKTLSRYIH----SSYQPFVSVVIAAYNEEKV 772 Query: 90 VEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKT 149 + +TI + L +Y EVI V+DGSTD T ++ + P +R I +N GK+ A+ Sbjct: 773 IAKTIRSILDSKYGEFEVIVVDDGSTDGTSKVMQETFYKHPKVRFI-QKENGGKSSAMNL 831 Query: 150 GAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGE 209 G ++ E +V +D D ++ +DA + +V + V AV+GN ++ R L+ Q E Sbjct: 832 GFQQSRGEIIVTLDADTIIAQDAISLMVR-HFEDQNVAAVSGNVKVGNRRNLLTTWQHVE 890 Query: 210 YSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWT 269 Y + L +R + V G I A+R+ + E GY S+D + ED D++ Sbjct: 891 YITGFNLERRAFDELNCITVVPGAIGAWRKKNVVESGYLSEDTLAEDTDLTITFLRQGHR 950 Query: 270 IFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLT 329 I YE +A + PE +K L KQR RW+ G + K+ L ++ + + L Sbjct: 951 IVYEEKAYAFTESPEDVKSLIKQRYRWSYGTLQCLWKHRKALCNSKHKTLGFI----ALP 1006 Query: 330 TIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEH 389 +W F ++ FI A + + I L G L+ + SL E Sbjct: 1007 NMWLFQYVLQFI---APFADILMIIGLFSSDPLKVLGFYFVFF-LMDLLASLFAFKLEEE 1062 Query: 390 NLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 N ++W + ++ T V + + W R Sbjct: 1063 NPK----PLVWLILQRFIYRQFMTYVVVKSIFSSIRGIAVGWNKLKR 1105 >UniRef50_C0WP52 Possible VGP family vectorial glycosyl polymerization protein n=5 Tax=Lactobacillus RepID=C0WP52_LACBU Length = 504 Score = 323 bits (828), Expect = 9e-87, Method: Composition-based stats. Identities = 126/403 (31%), Positives = 219/403 (54%), Gaps = 15/403 (3%) Query: 40 FMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALA 99 F+S+ + G F+ + + P + K+ +S+++P NE + E + + Sbjct: 113 FLSVTPSLIGSIFFAIQAKQVPL---VNLYKGKEAELVSVLVPAHNEADTLGEVVDSIAN 169 Query: 100 QRYENIEVIAVNDGSTDKTRAILDRMAAQIPH---LRVIHLAQNQGKAIALKTGAAAAKS 156 Y IE+I +NDGS D+T A++ R+ + ++V+ + N+GKA AL GA A+ Sbjct: 170 LVYPRIELILINDGSQDETLAVMHRLVQKYSKHFVVKVVDIQPNKGKANALNYGAQVAEG 229 Query: 157 EYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGL 216 ++L+C+D D +D++ ++ NP+VGAV G P +R R++++G++++ EY +I + Sbjct: 230 DFLLCLDADCYVDKNVLNPMMAQFYNNPKVGAVAGKPIVRNRTSILGRLELLEYVGVIDI 289 Query: 217 IKRTQR-IYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPR 275 IKR Q + G + TVSGV+ A+R++ALA VG+W+ + ITEDID++W+L +QW + Y P+ Sbjct: 290 IKRGQAFVTGRITTVSGVVVAYRKAALANVGWWNSEAITEDIDVTWRLYHHQWQVAYCPQ 349 Query: 276 ALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFT 335 A+ WIL+P+ +K L QR RWA+GG EV +N L W L + ++ +WA + Sbjct: 350 AVAWILVPDHIKDLIHQRRRWARGGFEVLFRNRGMLVTGRLSEQW-LLLDMIISDLWAIS 408 Query: 336 CLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSL 395 C + Y +A ++ LL + L+QF+V + + L Sbjct: 409 CSFVTLFYLFTVAF------TGYLIMDGVVLFLLLIISLVQFLVGYTDSKKSDFINWRDL 462 Query: 396 FWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRGI 438 + + V +W ++L + L + L P+ W SPDRG+ Sbjct: 463 LLLPLYVVFYWFINLISCLTALGSFFLDPR-HVGSWHSPDRGL 504 >UniRef50_B5HQE3 Bifunctional transferase/deacetylase n=3 Tax=Streptomyces RepID=B5HQE3_9ACTO Length = 900 Score = 320 bits (821), Expect = 6e-86, Method: Composition-based stats. Identities = 100/397 (25%), Positives = 170/397 (42%), Gaps = 23/397 (5%) Query: 42 SIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQR 101 +M I+ ++ + +R + WG P +++I+P +NE++ + T+ + Sbjct: 514 GMMLILARRHYRLRNKRRFSWGPTVTRP-------VTVIVPAYNEKECIANTLESLSKST 566 Query: 102 YENIEVIAVNDGSTDKTRAILDRMAAQIP--HLRVIHLAQNQGKAIALKTGAAAAKSEYL 159 + IE+I V+DGS+D T I A + ++RVI N GK AL G A + + Sbjct: 567 HP-IEIIVVDDGSSDGTSEISREAARALGMTNVRVIR-QDNAGKPAALNNGVRNASYDIV 624 Query: 160 VCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKR 219 + +DGD + + DA +V+P +P VGAV GN ++ R T++G Q EY L +R Sbjct: 625 IMMDGDTVFEPDAVHQLVQP-FADPEVGAVAGNAKVGNRDTVIGAWQHIEYVMGFNLDRR 683 Query: 220 TQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCW 279 + + T+ G I AFRR A+ +VG S+D + ED DI+ + + Y+ A W Sbjct: 684 MYDLLRCMPTIPGAIGAFRREAVLQVGGMSEDTLAEDTDITIAIHRAGRRVVYQEHARAW 743 Query: 280 ILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVG 339 P +LK LW QR RW+ G + K+ L K F + + F + Sbjct: 744 TEAPGSLKQLWSQRYRWSYGTMQALWKHRKSLTDKGPSGR---FGRVGMPLVVIFQIVTP 800 Query: 340 FIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWII 399 + + I + AA + + +QF+ + + +++ Sbjct: 801 VFAPLIDVFTAYSMIFIDFRAALY----AWLAVLGIQFVCAAYAFRLDKEKYR----YLL 852 Query: 400 WFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 P+ LV + R RW R Sbjct: 853 MMPLQQLAYRQMMYLVLIHSCITALTGGRLRWQKLKR 889 >UniRef50_UPI0001B4D705 bi-functional transferase/deacetylase n=3 Tax=Streptomyces RepID=UPI0001B4D705 Length = 700 Score = 315 bits (808), Expect = 2e-84, Method: Composition-based stats. Identities = 109/408 (26%), Positives = 187/408 (45%), Gaps = 17/408 (4%) Query: 31 MRFVFFWPFFMSIMWIVGGVYFWVYRERHW-PWGENAPAPQLKDNPSISIIIPCFNEEKN 89 M +V + ++ +V V+F R + +P + ++P +++I+P +NEE Sbjct: 291 MAWVLAIAGALGLLRLVTLVFFARAHVRRLTRFRPGSPWLREVNDP-VTVIVPAYNEEAG 349 Query: 90 VEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKT 149 +E T+ + LA + ++++I V+DGSTD+T + + P + VI N GKA AL T Sbjct: 350 IEATVRSLLASTHPHLQIIVVDDGSTDRTADLATWI--DDPRISVIR-QINSGKATALNT 406 Query: 150 GAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGE 209 G A A + +V +D D + + DA +++P+ +P +GAV+GN ++ R +L+G+ Q E Sbjct: 407 GLAHAAHDIVVMVDADTVFEADAVHQLIQPL-AHPAIGAVSGNTKVGNRRSLLGRWQHLE 465 Query: 210 YSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWT 269 Y L +R + + TV G I AFRR AL VG S+D + ED D++ L W Sbjct: 466 YVFGFNLDRRMFEVLECMPTVPGAIGAFRRDALMGVGGVSEDTLAEDTDLTMSLWRAGWR 525 Query: 270 IFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLT 329 + YE A+ W +P +L+ LW+QR RW G + K+ F L+ Sbjct: 526 VVYEETAVAWTEVPTSLRQLWRQRYRWCFGTLQSMWKHRRAAVELGPAGR---FGRRGLS 582 Query: 330 TIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEH 389 + F L+ I V L + + L+ + + L + R + Sbjct: 583 YLVLFQVLLPLIAPIVDLFALYGALFLSPVQSAGIWCAFLAVQLICAGYAL-----RLDG 637 Query: 390 NLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRG 437 SL+ + + ++ L + S +L R W R Sbjct: 638 ERMRSLWAMPFQLFVYRQLMYLVVIQSVVAALL---GARLTWHRMHRS 682 >UniRef50_D2LDP7 Polysaccharide deacetylase n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LDP7_RHOVA Length = 1170 Score = 315 bits (807), Expect = 2e-84, Method: Composition-based stats. Identities = 97/424 (22%), Positives = 180/424 (42%), Gaps = 30/424 (7%) Query: 27 GELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNE 86 L F+ + + ++ G+ + R R P D+ S+S++IP FNE Sbjct: 721 NTLTTLFIAAIALGLGRLVVLCGLAAYGNRRRKRRAD-----PPYADDLSVSVLIPAFNE 775 Query: 87 EKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIA 146 K + +I LA ++ +EVI ++DGSTD T ++ A P + ++H N GKA A Sbjct: 776 AKVITASIRQILASSHQKLEVIVIDDGSTDGTADVVRGEFADDPRVSLMHTP-NGGKARA 834 Query: 147 LKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQ 206 + A A + +V +D D + + +V +P+VGAV GN ++ R ++ + Q Sbjct: 835 INLALAQATGDIVVVLDADTQFEPLTISRLVR-WFADPKVGAVAGNAKVGNRINVLTRWQ 893 Query: 207 VGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLN 266 EY + L +R + V G + A+RR A+ +G + + + ED D++ +Q Sbjct: 894 ALEYITAQNLERRALATLDCITVVPGAVGAWRREAIMGLGGFPSNTLAEDQDLTISVQRA 953 Query: 267 QWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEY 326 + + ++ AL W P+TL GL KQR RWA G + K+ + + + Sbjct: 954 GYKVLFDADALAWTEAPDTLGGLAKQRFRWAFGTLQCLWKHRSANLNPRYGALGMI---- 1009 Query: 327 CLTTIWAFTCLVGFIIYA-------------VQLAGVPLNIELTHIAATHTAGILLCTLC 373 L +W F + I + + T T + T+ Sbjct: 1010 ALPQVWLFQIALALISPLVDLLLLVQVVRTGIDYLQHGSQFNSENFTITLTYYAVFMTVD 1069 Query: 374 LLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVS 433 L +++ ++E R + +L L+W++ + + S + + + + W Sbjct: 1070 LSAALIAFLLEKREDRSL---LWWLVLQRFGYRQVMYYVVAKSVVKAL---QGRVVGWGK 1123 Query: 434 PDRG 437 +R Sbjct: 1124 LERK 1127 >UniRef50_D2AU81 Polysaccharide deacetylase n=1 Tax=Streptosporangium roseum DSM 43021 RepID=D2AU81_STRRD Length = 752 Score = 309 bits (792), Expect = 1e-82, Method: Composition-based stats. Identities = 115/409 (28%), Positives = 181/409 (44%), Gaps = 31/409 (7%) Query: 41 MSIMWIVGGV-----------YFWVYRER-HWPWGENAPAPQLKDNPSISIIIPCFNEEK 88 MS + +V GV WV+ R AP + P++++I+P +NE Sbjct: 356 MSWILVVAGVITLLRLLFFLVLAWVHARRVRGGKRRAGRAPAWPEPPAVTVIVPAYNEAA 415 Query: 89 NVEETIHAALAQRYEN-IEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIAL 147 +E T+ + + Y +EV+ V+DGS+D T AI + +P +RVI +N GK AL Sbjct: 416 GIEATVRSLVNTDYPGVLEVVVVDDGSSDDTAAIAASLG--LPGVRVIR-QENGGKPSAL 472 Query: 148 KTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQV 207 TG A A + LV +DGD + + ++V P+ +P VGAV+GN ++ R ++G+ Q Sbjct: 473 NTGIAHASHDILVMVDGDTVFEPATIGHLVRPL-SDPAVGAVSGNTKVGNRRGMIGRWQH 531 Query: 208 GEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQ 267 EY L +R + G + TV G I AFRRSAL E+G S D + ED D++ + Sbjct: 532 IEYVIGFNLDRRAFDLLGCMPTVPGAIGAFRRSALQEIGGVSVDTLAEDTDLTMAMCRGG 591 Query: 268 WTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYC 327 W + YE AL W P +L LW+QR RW G + K+ + F C Sbjct: 592 WRVVYEENALAWTEAPTSLSQLWRQRYRWCYGTLQAMWKHRRAITEPSPFGRR------C 645 Query: 328 LTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRY 387 L + F ++ + V + V + + L+Q R Sbjct: 646 LGYLTLFQVVLPLLAPVVDVMAV----YSVVMGDPLPVVAVWAGFVLVQAFSG-WYALRL 700 Query: 388 EHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 + S L+ + ++ L + S +L R RW + R Sbjct: 701 DRERASVLWVLPLQQFVYRQLMYLVVIQSVATAVL---GVRLRWQTIRR 746 >UniRef50_B5HP10 Bifunctional transferase/deacetylase n=9 Tax=Streptomyces RepID=B5HP10_9ACTO Length = 741 Score = 309 bits (791), Expect = 2e-82, Method: Composition-based stats. Identities = 109/423 (25%), Positives = 197/423 (46%), Gaps = 33/423 (7%) Query: 26 SGELMMRFVFFWPFFM---SIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIP 82 + L++ + ++M ++ G++ R + + WG P +S+++P Sbjct: 324 TDVLVVGLSIIGSLVIGRFALMLLLSGIHARRVRRKGFRWGAPVTQP-------VSVLVP 376 Query: 83 CFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQG 142 +NE K +E T+ + +A + IEV+ ++DGS+D T I++ M +P++RV+ N G Sbjct: 377 AYNEAKCIENTVRSLMASEHP-IEVLVIDDGSSDGTARIVEAMG--LPNVRVVR-QLNAG 432 Query: 143 KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLV 202 K AL G A A+ + +V +DGD + + +V+P +PRVGAV GN ++ R +L+ Sbjct: 433 KPAALNRGLANARHDIIVMMDGDTVFEPSTVRELVQP-FGDPRVGAVAGNAKVGNRDSLI 491 Query: 203 GKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWK 262 G Q EY L +R + + T+ G + AFRRSAL VG SDD + ED DI+ Sbjct: 492 GAWQHIEYVMGFNLDRRMYDVLRCMPTIPGAVGAFRRSALERVGGMSDDTLAEDTDITMA 551 Query: 263 LQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPL 322 + + W + Y +A W PE+++ LW QR RW+ G + K+ L + Sbjct: 552 MHRDGWRVVYAEKARAWTEAPESVQQLWSQRYRWSYGTMQAIWKHRRALVERGPSGR--- 608 Query: 323 FFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLM 382 F L + F + + + + + + T A + + + ++ + Sbjct: 609 FGRVGLPLVSLFMVVAPLLAPLIDVFLL-YGVVFGPTQKTIVAWLGVLAIQVVCAAYAFR 667 Query: 383 IEN-RYEHNLTSSLFWIIWFPVIFWML--SLATTLVSFTRVMLMPKKQRARWVSPDR-GI 438 ++ R H ++ L I++ +++ +L S T L R RW R G+ Sbjct: 668 LDRERMTHLISLPLQQILYRQLMYVVLLQSWITALTG----------GRLRWQKLRRTGV 717 Query: 439 LRG 441 + Sbjct: 718 VEA 720 >UniRef50_Q38W83 Putative glycosyl transferase, family 2 n=1 Tax=Lactobacillus sakei subsp. sakei 23K RepID=Q38W83_LACSS Length = 439 Score = 309 bits (791), Expect = 2e-82, Method: Composition-based stats. Identities = 135/432 (31%), Positives = 220/432 (50%), Gaps = 21/432 (4%) Query: 19 LCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSIS 78 L + YF ++++ + +P + +IV +Y+W+ E+ E+ P K P I+ Sbjct: 2 LLIDYFT--QILIYTLVSYPIIGGVSFIVSSLYYWLLMEK-----EDQPRYLKKGTPFIT 54 Query: 79 IIIPCFNEEKNVEETIHAALAQ-RYE--NIEVIAVNDGSTDKTRAILDRMAAQI-PHLRV 134 I++P NE+ ++E TI Q Y E+I VNDGSTDKT IL+ + A+ LR Sbjct: 55 ILVPAHNEKASIEATIDHLATQMNYPTDQYEIIVVNDGSTDKTGIILETLQAKYGQRLRT 114 Query: 135 IHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYN--PRVGAVTGN 192 + + N+GKA + AK E+++ D D+ + DA + ++GAVTGN Sbjct: 115 VTIINNRGKAAGFNSALGFAKGEFILSNDADSKPEVDALWKYMYYFEREGGAQLGAVTGN 174 Query: 193 PRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDM 252 ++TLV + Q E +SIIGLIKR+Q YG +F SG +R+SA+ +VG W + Sbjct: 175 MLSINKTTLVAEAQQNELNSIIGLIKRSQLSYGGLFAFSGANTMYRKSAVIDVGGWQAEQ 234 Query: 253 ITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLW 312 TEDI ISW +Q W ++ P ++ +PETL L KQR RW GG V L + Sbjct: 235 PTEDIAISWDMQTAGWQAYFAPHIRFFMDVPETLPELVKQRRRWTSGGIYVLLTKSFGIL 294 Query: 313 RK--ENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLC 370 R ++F M P+ +Y L+ IW+ + + V G +N + + ++ + + Sbjct: 295 RHPIKHFSMVPIIIDYGLSIIWSLFYWISMATFIVLQIGYVINQDWHRLLSSLSIAAIFT 354 Query: 371 TLCLLQFIVSLMIENRYEHNLTSSLFWI---IWFPVIFWMLSLATTLVSF--TRVMLMPK 425 T+ +L ++ L ++ Y ++ SL +I W+ +I+WM++ T V T ++ Sbjct: 355 TIEILVGLIQL-VQASYFNDGGRSLKYIAFAPWYILIYWMVNTYTVAVEIIPTIRQIIRG 413 Query: 426 KQRARWVSPDRG 437 K W SP R Sbjct: 414 KDAGVWKSPKRS 425 >UniRef50_Q2G7Y8 Polysaccharide deacetylase n=1 Tax=Novosphingobium aromaticivorans DSM 12444 RepID=Q2G7Y8_NOVAD Length = 1101 Score = 308 bits (790), Expect = 3e-82, Method: Composition-based stats. Identities = 94/350 (26%), Positives = 160/350 (45%), Gaps = 9/350 (2%) Query: 17 IPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPS 76 + + VA F + + ++ +F W FF++I + W + A P+ Sbjct: 673 VRVDVAAFLALDGLVT-LFSWLFFVAIALGIARAVIMAGL--AWWQSRSPRAEPPAFEPT 729 Query: 77 ISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIH 136 +S+IIP +NEE+ + ++ LA Y ++VI +DGS D T A++ R P + ++ Sbjct: 730 VSVIIPAWNEERVIAASVERVLASDYPALQVIVADDGSKDATSAVVARHFGHDPRVTLLT 789 Query: 137 LAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIR 196 LA N GKA AL A E ++ +D D + + +PR+GAV G+ R+ Sbjct: 790 LA-NGGKAAALNRALRHATGEVVIALDADTQFEPLTIRRLAR-WFADPRIGAVAGDARVG 847 Query: 197 TRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITED 256 R LV + Q EY + L +R + + V G + A+RR+AL VG + ++ + ED Sbjct: 848 NRVNLVTRWQAVEYITAQNLERRALAGFDAMTVVPGAVGAWRRAALDSVGGYPENTLAED 907 Query: 257 IDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKEN 316 D++ +Q W + Y+PRA+ W P+T + L +QR RWA G + K+ + ++ Sbjct: 908 QDLTIAIQRKGWRVTYDPRAVAWTEAPQTFRALARQRYRWAFGTLQCLWKHRKVITSRKP 967 Query: 317 FRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAG 366 + + L W F I + A + I H Sbjct: 968 AGLGLV----GLPQAWLFQIAFAAISPLIDGALIASIISTVVRVVQHGWA 1013 >UniRef50_B0T3D0 Polysaccharide deacetylase n=2 Tax=Caulobacter RepID=B0T3D0_CAUSK Length = 1124 Score = 307 bits (788), Expect = 4e-82, Method: Composition-based stats. Identities = 103/413 (24%), Positives = 170/413 (41%), Gaps = 26/413 (6%) Query: 38 PFFMSIMWIVGGVYFWVYRERHWPWGENAPAP-QLKDNPSISIIIPCFNEEKNVEETIHA 96 F +I + + F W +P + P +S++IPCFNEEK + ++ Sbjct: 714 LFLTAIALGLARLVFLACLALVHRWTHQSPENLDPETGPLVSVLIPCFNEEKVIAASVAR 773 Query: 97 ALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKS 156 L ++N+EV+ ++DGS D T + R P + ++ +N GKA A+ G A AK Sbjct: 774 ILESEWKNLEVLVLDDGSKDNTAQEVRRAHGDDPRVTLLSF-ENGGKARAVNRGLAIAKG 832 Query: 157 EYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGL 216 +Y+V +D D L ++ +P +GAV GN + R +V + Q EY + L Sbjct: 833 DYVVALDADTLFPPKTIGRLIR-WFQDPTIGAVAGNAIVGNRVNMVTRWQALEYVTAQNL 891 Query: 217 IKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRA 276 +R G V V G + A+R+S L +G + D + ED D++ Q W + ++P A Sbjct: 892 ERRALAALGAVTVVPGAVGAWRKSVLDALGGYPSDTLAEDQDLTIACQRAGWKVAFDPAA 951 Query: 277 LCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTC 336 + P+T+ GL KQR RW+ G + K+ L+ + P L IW F Sbjct: 952 QAFTEAPDTVGGLLKQRFRWSFGTLQCVWKHRAALFSPKT----PALGFVALPQIWLFQI 1007 Query: 337 LVGFIIYAVQLAGV-------------PLNIELTHIAATHTAGILLCTLCLLQFIVSLMI 383 L+ V LA V P+ + + L + + + Sbjct: 1008 LLAVAAPLVDLAVVWSLISGVYGAIAHPVEWSPDDTIQGLLYWAVFILVDLSAGALGMAL 1067 Query: 384 ENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 E R + + PV + V V+ + R W +R Sbjct: 1068 EKRAPW------ADLPYLPVQRFGYRQLMYYVVVKSVLTAARGGRVGWGKLER 1114 >UniRef50_C8WBS4 Polysaccharide deacetylase n=4 Tax=Sphingomonadaceae RepID=C8WBS4_ZYMMN Length = 1126 Score = 307 bits (787), Expect = 5e-82, Method: Composition-based stats. Identities = 102/440 (23%), Positives = 185/440 (42%), Gaps = 28/440 (6%) Query: 18 PLCVAYFHSGEL-MMRFVFFWPFFMSIMWIVG---------GVYFWVYRERHWPWGENAP 67 PL S F +++ WI ++ + Sbjct: 686 PLSPVERVSARWNFALFSCLGASVIALRWIFAIAITLGILRALFLSAFSIIQARKENRLI 745 Query: 68 APQLKDNPSISIIIPCFNEEKNVEETIHAALAQ-RYENIEVIAVNDGSTDKTRAILDRMA 126 P + +IS++IP FNEE +E +I LA NIEVI ++DGSTD + I++ Sbjct: 746 FPPIDPERTISVLIPAFNEEAVIEASIRRVLASAEVNNIEVIVIDDGSTDNSSQIVESQF 805 Query: 127 AQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRV 186 A P +++I L+ N GKA AL G AK E ++ +D D + A + +P++ Sbjct: 806 ADDPRVQLIRLS-NGGKARALNHGVQKAKGEIIIALDADTHFEPRTIARLTR-WFSDPKL 863 Query: 187 GAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVG 246 GAV GN ++ R L+ + Q EY + L +R + + V G + A+R L +VG Sbjct: 864 GAVAGNAKVGNRINLITRWQALEYITAQNLERRATVLLNAMTVVPGAVGAWRAETLRQVG 923 Query: 247 YWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLK 306 + D + ED D++ +Q + W + Y+P A+ W PET++ L +QR RWA G + K Sbjct: 924 GFPDQTLAEDQDLTIIIQEHDWAVRYDPYAVAWTEAPETIRALARQRFRWAFGTLQCLWK 983 Query: 307 NMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFI--IYAVQLAGVPLNIELT------- 357 + + + + + + L F + I + + + Sbjct: 984 HWSIIKNRRPKGLAYIGLPQSLIFQIGFATISPIIDLALVISIMATVFAVYQHGWVQQGD 1043 Query: 358 HIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSF 417 + + + L+ +V+ +E + + +L L+ +I + + + + + Sbjct: 1044 DLQKMAAYWSVFTLIDLMSGVVAFALERKEKWSL---LWLLIPQRIGYRQIMYYVVIKAL 1100 Query: 418 TRVMLMPKKQRARWVSPDRG 437 T+ + PK W +R Sbjct: 1101 TQAIRGPKVG---WDKLERS 1117 >UniRef50_C1ACV5 Putative glycosyl transferase/polysaccharide deacetylase n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1ACV5_GEMAT Length = 1123 Score = 307 bits (786), Expect = 6e-82, Method: Composition-based stats. Identities = 103/388 (26%), Positives = 168/388 (43%), Gaps = 18/388 (4%) Query: 58 RHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDK 117 + + A + P +S+++P +NE + + T+ + L+Q Y ++EV+ V+DGS+D Sbjct: 734 QRFSKRYARRAADAEWLPRVSVLVPAYNEGRVIGRTVQSVLSQAYPDLEVVVVDDGSSDD 793 Query: 118 TRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIV 177 T A L V+ N GKA AL TG A A E +V ID D +L DA ++V Sbjct: 794 THD-AASHATDDARLHVVR-QTNAGKAAALNTGIAMATGEVIVVIDADTILAPDAIRHLV 851 Query: 178 EPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAF 237 P+ + RVGAV GN ++ R L+ + Q EY + L +R + + V G I A+ Sbjct: 852 RPL-ADARVGAVAGNAKVGNRINLLTRWQAVEYVTSQNLDRRAFVMLNCITVVPGAIGAW 910 Query: 238 RRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWA 297 RRSA+ + G + D + ED D++ L + +A+ PET L KQR RW+ Sbjct: 911 RRSAVLDAGGFRTDTLAEDQDLTLTLLRGGHKVALAEQAVALTEAPETFGALLKQRFRWS 970 Query: 298 QGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQL---------A 348 G + K+ L R++ + + L IW F L + A + Sbjct: 971 FGTLQCAWKHRGALMRRDAGALGMV----GLPNIWLFQLLFPLLAPAADVALLASLARLL 1026 Query: 349 GVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWML 408 + + A LL L LL + ++ +E S ++ + + Sbjct: 1027 LEAPALGVHAAWAHAEPVFLLYALFLLIDTFTAVLGVAFEKGEPLSQALLVPLQRVAYRQ 1086 Query: 409 SLATTLVSFTRVMLMPKKQRARWVSPDR 436 L L+ R + K W +R Sbjct: 1087 VLYVALLKAMRAAV--KGWAPGWGKLER 1112 >UniRef50_A5VMQ3 Glycosyl transferase, family 2 n=38 Tax=Lactobacillales RepID=A5VMQ3_LACRD Length = 443 Score = 304 bits (778), Expect = 6e-81, Method: Composition-based stats. Identities = 119/412 (28%), Positives = 202/412 (49%), Gaps = 13/412 (3%) Query: 37 WPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHA 96 +P ++ W G + + ++R++ E P +L P I+I+IP NEE ++ T+ Sbjct: 30 YPILGAMFWFFGAISYILFRQK----EELPPKAELTPEPFITIMIPAHNEEIVIKSTLEY 85 Query: 97 ALAQ-RYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAK 155 L Q Y N EV+ ++DGS D+T IL M + LRVI + QNQGKA A G AK Sbjct: 86 LLTQLNYHNYEVLVMDDGSDDRTPEILHSMQTKYSRLRVIRIEQNQGKAHAFNIGLFFAK 145 Query: 156 SEYLVCIDGDALLDRDAAAYIVEPMLYNP--RVGAVTGNPRIRTRSTLVGKIQVGEYSSI 213 EY++ D D + + DA ++ A+T N + RS+L GK Q E+SSI Sbjct: 146 GEYILSNDADTIPETDALIKYMQYFTSANGINYAAITANMDVYNRSSLWGKSQTVEFSSI 205 Query: 214 IGLIKRTQRIYGNVFTV-SGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFY 272 +G+IKR+Q N SG +RR L VG + D TEDI I+W T + Sbjct: 206 VGIIKRSQTAINNTMYAYSGANTMYRRDFLINVGGFRQDRATEDISIAWDHTFINATPKF 265 Query: 273 EPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRK--ENFRMWPLFFEYCLTT 330 P + + +PET L++QR RWAQGG+EV+L N +++R + + P+ + L+ Sbjct: 266 APDIVFHMNVPETFHDLYRQRKRWAQGGSEVWLSNFLKVFRHPWQYRFVIPMLTDTTLSI 325 Query: 331 IWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHN 390 IW+F + II+ + + + + ++ L+ + ++ + N Sbjct: 326 IWSFFFWITSIIFIITMLSFAITGNYERVWHGIVMSMIFVNFQLIAGLFQVLAALILDFN 385 Query: 391 LT--SSLFWIIWFPVIFWMLSLATTLVSFTRVMLM-PKKQRARWVSPDRGIL 439 + L + + + W+++ T + +F + + +W+SP+R ++ Sbjct: 386 GSKLRYLMFSPLYLLFTWIVNPLTIVTTFHKAIKTVTGHGSGKWISPERKVV 437 >UniRef50_C0YGU0 Polysaccharide deacetylase/glycosyl transferase, group 2 family protein n=2 Tax=Bacteroidetes RepID=C0YGU0_9FLAO Length = 1132 Score = 302 bits (775), Expect = 1e-80, Method: Composition-based stats. Identities = 100/366 (27%), Positives = 161/366 (43%), Gaps = 10/366 (2%) Query: 16 CIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNP 75 + + Y S L+ F F + + ++ +W ++ER P L+ P Sbjct: 716 LVLATIIYGVSHFLVALFTIFIVLGLIRLLLMA---YWAFKERKKEKKLGEF-PVLESYP 771 Query: 76 SISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVI 135 +SII+P +NEE N+ ++ L Q Y N ++I V+DGS D T P L+ I Sbjct: 772 KVSIIVPAYNEEVNIVSSLQNLLKQTYPNFDIIMVDDGSKDSTYDKAKAAFPDHPKLK-I 830 Query: 136 HLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPML-YNP--RVGAVTGN 192 +N GKA AL G + +EY+VCID D L +DA Y++ L +P ++ AV GN Sbjct: 831 FTKRNGGKATALNFGISQTDAEYVVCIDADTKLQQDAVKYLIARFLNSDPEEKIAAVAGN 890 Query: 193 PRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDM 252 ++ R + K Q EY++ + + + G I AF+RS + E G +S D Sbjct: 891 VKVGNRVNWLTKWQAIEYTTSQNFDRLAYANINAITVIPGAIGAFKRSVVIETGGYSSDT 950 Query: 253 ITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLW 312 + ED DI+ K+ +T+ E RA+ PET+K KQR RW G ++F K Sbjct: 951 LAEDCDITVKILKAGYTVANENRAVAVTEAPETVKQFLKQRFRWTYGIMQMFWKQRQTFL 1010 Query: 313 --RKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLC 370 R + +W + I F + +I + + T+ L Sbjct: 1011 NPRYKGLGLWAMPNILLFQYIIPFFSPLADVIMFFGILSGNGDKIFTYYLIFLLVDASLA 1070 Query: 371 TLCLLQ 376 + + Sbjct: 1071 LIAFIM 1076 >UniRef50_D2L983 Polysaccharide deacetylase n=1 Tax=Desulfovibrio sp. FW1012B RepID=D2L983_9DELT Length = 1140 Score = 302 bits (775), Expect = 1e-80, Method: Composition-based stats. Identities = 99/403 (24%), Positives = 172/403 (42%), Gaps = 30/403 (7%) Query: 49 GVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYE-NIEV 107 V + R + PAP L S+++++P +NEEK V +T+ + LA ++ E+ Sbjct: 744 AVLAVFEKVRGRRRPVSGPAPDL----SVAVVVPAYNEEKVVLQTVQSLLACQHPATFEI 799 Query: 108 IAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDAL 167 I V+DGSTD T +L A L I N GK AL G A +++ +V +D D + Sbjct: 800 IVVDDGSTDATYRVLCEALA-GEKLVTIVTKPNGGKPAALNHGIALTRADIVVTLDADTV 858 Query: 168 LDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNV 227 RD + + +P+VGAV GN ++ R + + Q EY + L +R + +V Sbjct: 859 FARDTILRLAD-WFRDPKVGAVAGNAKVGNRINFLTRCQALEYVTSQNLDRRALTVLDSV 917 Query: 228 FTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLK 287 V G + A+RR + G +S + + ED D++ ++Q T+ YE RA+ P+T++ Sbjct: 918 TVVPGAVGAWRREVVEAAGGFSGETLAEDADLTIRIQRMGHTVAYEDRAVALTEAPDTMR 977 Query: 288 GLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQL 347 G +QR RW G +V K+ L+R L + L IW + I + L Sbjct: 978 GFLRQRFRWMFGTLQVAWKHKDALFRPRYG----LLGFFGLPNIWLYQIFFQIISPVMDL 1033 Query: 348 -------------AGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSS 394 P + + L +L +V+ ++E + L Sbjct: 1034 WLAYTCLKSWVLWLWHPATWDPDALFRVLFYYALFMAADILAGLVAFLLERGEDKRL--- 1090 Query: 395 LFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRG 437 + W + +V+ ++ + + W +R Sbjct: 1091 ---LAWLVPQRFFYRQLMYVVALRTLLASLRGREMGWSKLERK 1130 >UniRef50_B5GIB2 Bi-functional transferase/deacetylase n=7 Tax=Streptomyces RepID=B5GIB2_9ACTO Length = 767 Score = 298 bits (764), Expect = 2e-79, Method: Composition-based stats. Identities = 101/395 (25%), Positives = 174/395 (44%), Gaps = 23/395 (5%) Query: 42 SIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQR 101 +M ++ + R R + WG P +++++P +NE K V T+ + Sbjct: 356 GLMLLLSFAHARRTRRRGFAWGVPVTEP-------VTVLVPAYNEAKCVTATVTSLSRSE 408 Query: 102 YENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVC 161 + +E+I ++DGSTD T I++R+ +P +RV+ +N GK AL G A A + +V Sbjct: 409 HP-VEIIVIDDGSTDDTAGIVERLG--LPGVRVVR-QENAGKPAALNRGIAHASHDIIVM 464 Query: 162 IDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQ 221 +DGD + + +V+P +PRVGAV GN ++ R +L+G Q EY L +R Sbjct: 465 MDGDTVFEPATVRELVQP-FGDPRVGAVAGNAKVGNRDSLIGAWQHIEYVMGFNLDRRMY 523 Query: 222 RIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWIL 281 + + T+ G + AFRR AL VG S+D + ED DI+ + + W + Y +A W Sbjct: 524 DVLRCMPTIPGAVGAFRRDALTRVGGMSEDTLAEDTDITMAIHRDGWRVVYAEKARAWTE 583 Query: 282 MPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFI 341 PE++ LW QR RW+ G + K+ + + F L + F L + Sbjct: 584 APESVAQLWSQRYRWSYGTMQAIWKHRHAVLERGASGR---FGRVGLPLVSLFMVLAPLL 640 Query: 342 IYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWF 401 + + + I T A + + + + + R + L + Sbjct: 641 APLIDVFLL-YGIVFGPTGRTLLAWLGVLAVQAVCAAYAF----RLDKEPMRHLVSLPLQ 695 Query: 402 PVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 +++ L L S+ + R RW R Sbjct: 696 QLLYRQLMYVVLLQSWITALT---GGRLRWQKLRR 727 >UniRef50_A1UIF2 Polysaccharide deacetylase n=3 Tax=Mycobacterium RepID=A1UIF2_MYCSK Length = 789 Score = 297 bits (760), Expect = 7e-79, Method: Composition-based stats. Identities = 112/437 (25%), Positives = 183/437 (41%), Gaps = 36/437 (8%) Query: 18 PLCVAYFHSGELM-------MRFVFFW----PFFMSIMWIVGGVYFWVYRERHWPWGENA 66 P A +H+ LM + F+F+ MS+++++ + YR+ W + Sbjct: 349 PADAATYHTLRLMEKAPGAVLGFLFWLGMGSLTVMSLLYLILALV-CQYRQNRLRWNDIG 407 Query: 67 PAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAILDR 124 P +S+++ FNEEK + TI Y EV+AVNDGSTD T IL Sbjct: 408 D----DQLPMVSVVLAAFNEEKVIARTIAELRRSDYPRSRFEVVAVNDGSTDGTLRILTE 463 Query: 125 MAAQIPHLRVIHLAQNQGKAIALKTGAAAAKS--EYLVCIDGDALLDRDAAAYIVEPMLY 182 +A P LRV+ N GK+ A+ G A + +V +D D L D + Sbjct: 464 LARDWPKLRVVD-QANSGKSSAINNGINHASAVSTVMVTMDADTLFRPDTIRNLARHFAR 522 Query: 183 N---PRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRR 239 + +VGAV G+ ++ R L+ Q EY S I + + +R+ + V G +A+ R Sbjct: 523 HTHGRQVGAVAGHIKVGNRRNLLTAWQSLEYISGICVTRMAERLLNAISIVPGACSAWSR 582 Query: 240 SALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQG 299 +AL E+G + DD + ED D + LQ + I E A+ PET++ L KQR RW G Sbjct: 583 TALEEIGGFCDDTMAEDCDATLALQRRGYRILQENNAIADTEAPETIRALAKQRKRWTYG 642 Query: 300 GAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHI 359 + K+ L+R + + Y ++ + I A ++ N + Sbjct: 643 NIQALWKHRAMLFRPRYGALGLVALPYAALSLIVPLLFMPLTIVAAGMSLAAGNWQS--- 699 Query: 360 AATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTR 419 AG + ++ M R H L ++ II+ P+ ++L + Sbjct: 700 -IALFAGFVAALHMIISITAVAMARERAWHLLVVPVYRIIYEPLRAYLL--------YAS 750 Query: 420 VMLMPKKQRARWVSPDR 436 K W +R Sbjct: 751 AYRAIKGTIVAWDKLER 767 >UniRef50_Q1IUP8 Polysaccharide deacetylase n=1 Tax=Candidatus Koribacter versatilis Ellin345 RepID=Q1IUP8_ACIBL Length = 1154 Score = 296 bits (759), Expect = 8e-79, Method: Composition-based stats. Identities = 93/438 (21%), Positives = 178/438 (40%), Gaps = 25/438 (5%) Query: 14 VLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKD 73 + +++ G + +F+ FF+ + + G + F + Sbjct: 715 WAAMVASLSFILFGA-VSQFIIAV-FFVGDVLMTGRLVFIGTLAIYDRIRGPRLTADPDY 772 Query: 74 NPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRM-AAQIPHL 132 P+++++IP +NEEK +E T+ + L Y + I ++DGS D T +++++ AA+I Sbjct: 773 RPAVAVLIPAYNEEKVIERTVRSVLDSDYPKLRAIVIDDGSKDATVEVVEQLFAAEIASG 832 Query: 133 RV-IHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTG 191 +V + N GKA AL G E V ID D ++ DA +V P NP++ A+ G Sbjct: 833 KVTLLTKPNSGKAAALNYGLEFVTEEIFVGIDADTIIAPDAIGLLV-PHFQNPKIAAIAG 891 Query: 192 NPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDD 251 N ++ R + Q EY + +R ++G V V G I A+R A+ G + D Sbjct: 892 NAKVGNRVNWWTRWQALEYITSQNFERRALDVFGAVSVVPGAIGAWRTEAVLAAGKYHHD 951 Query: 252 MITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRL 311 + ED D++ L + + + YE AL + P T GL +QR RW+ G + K+ + Sbjct: 952 TVAEDADLTMALLQDGYRVEYEDLALAYTEAPSTANGLMRQRFRWSFGIMQSVYKHRSA- 1010 Query: 312 WRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGV-------------PLNIELTH 358 F+ + L + F L+ + + L + P + + + Sbjct: 1011 -----FKQGGALGWFALPNVVIFQILLPLVSPFIDLMFLFGAGSYAWNRYMHPESTDPSS 1065 Query: 359 IAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFT 418 L + + ++ +E R + + + + + ++V Sbjct: 1066 FHKLVLYFALFLVIDFVASTIAFTLERR-QPGGQKDFWLLAHVWLQRFAYRQLFSIVLIK 1124 Query: 419 RVMLMPKKQRARWVSPDR 436 + + W +R Sbjct: 1125 TLKRAIEGGEFAWDKLER 1142 >UniRef50_D1SD50 Polysaccharide deacetylase n=1 Tax=Micromonospora aurantiaca ATCC 27029 RepID=D1SD50_9ACTO Length = 776 Score = 295 bits (756), Expect = 2e-78, Method: Composition-based stats. Identities = 106/378 (28%), Positives = 169/378 (44%), Gaps = 23/378 (6%) Query: 60 WPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYEN-IEVIAVNDGSTDKT 118 W WG P +S+I+P +NE + + + + + IEV+ V+DGSTD T Sbjct: 408 WSWGPPVTEP-------VSVIVPAYNEREGIAAAVRSLALGDHPGGIEVVVVDDGSTDGT 460 Query: 119 RAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVE 178 I+ + ++P++RV+ N GK AL TG A A+ + +V +DGD + + D+ +V+ Sbjct: 461 ADIVAAL--RLPNVRVVR-KPNGGKPSALNTGVALARHDLIVMVDGDTIFEPDSVRRLVQ 517 Query: 179 PMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFR 238 P +P VG V GN ++ R L+ K Q EY L +R + TV G I AFR Sbjct: 518 P-FADPGVGVVAGNVKVGNRRGLIAKWQHIEYVIGFNLDRRLYETLRCMPTVPGAIGAFR 576 Query: 239 RSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQ 298 R AL +VG +DD + ED D++ L W I YE A W P T+ LWKQR RW+ Sbjct: 577 RQALEQVGGMTDDTLAEDTDVTIALGRAGWHIVYEESARAWTEAPTTVGQLWKQRYRWSY 636 Query: 299 GGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTH 358 G + K+ + F CL+ + F L+ + L + + Sbjct: 637 GTLQAMWKHRRSVIDSGRSGR---FGRRCLSFLTLFGVLLPLAAPVIDLLAI-YGLIFLD 692 Query: 359 IAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFT 418 + T A + + L L +++ ++ + W++ P+ ++ LV Sbjct: 693 RSDTVVAWLAMLALQFLTAVLAFRLDR-----EKLGVLWVL--PLQQFVYRQVMYLVLLQ 745 Query: 419 RVMLMPKKQRARWVSPDR 436 V R W R Sbjct: 746 AVGTALTGGRLGWQKLRR 763 >UniRef50_A9EEN3 Glycosyl transferase, family 2 n=1 Tax=Oceanibulbus indolifex HEL-45 RepID=A9EEN3_9RHOB Length = 1088 Score = 294 bits (754), Expect = 3e-78, Method: Composition-based stats. Identities = 110/395 (27%), Positives = 184/395 (46%), Gaps = 25/395 (6%) Query: 30 MMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKN 89 +++ F+ F + I + +++ R RH P P K PS++I+IP +NE + Sbjct: 680 LLQIAFWTVFAIGICRSISLLFWAARRRRHAP-------PLSKHEPSVTIVIPAYNEARV 732 Query: 90 VEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKT 149 +E+ I AL Y + ++I V+DGSTD T A P + V+ N+GKA AL Sbjct: 733 IEKCIRKALYSEYGDFDIIVVDDGSTDDTYEKAISFAY-HPLVTVLR-QPNRGKAAALNA 790 Query: 150 GAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGE 209 A+SE L+CID D+ + DA + ++ +P+VGAV G + R L+ ++Q E Sbjct: 791 ALDEAQSEILICIDADSQIAPDAVS-LLAAHFKDPKVGAVAGRVVVGNRDNLLTRLQALE 849 Query: 210 YSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWT 269 Y + + +R + + V G I A+R +AL E G +S + +TED D++ + + + Sbjct: 850 YITAQAVERRAKEYLNAITVVPGAIGAWRTTALMEAGIFSTETLTEDADMTMAMIRSDYQ 909 Query: 270 IFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLT 329 + YE RA+ P +L L QRLRW+ G + K++ + N + L Sbjct: 910 VIYEDRAVATTETPRSLSALMTQRLRWSLGMMQAGWKHLGATVERRNLGL------VALP 963 Query: 330 TIWAFTCLVGFIIYAVQLAGVPLNIELT--------HIAATHTAGILLCTLCLLQF-IVS 380 + F L+ I L V L IE A+ T +++ L L I S Sbjct: 964 DLVVFGYLMPLIAPLADLFLVLLVIEFFTNIGATDQDFASVITNPLIIAYLALPALEIAS 1023 Query: 381 LMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLV 415 +I R + L ++ IF+ L +++ Sbjct: 1024 AVIAFRLDPTEDRRLLLLLPVQRIFYRQVLYVSVI 1058 >UniRef50_A7GCY8 Glycosyl transferase, group 2 family n=66 Tax=Firmicutes RepID=A7GCY8_CLOBL Length = 420 Score = 294 bits (754), Expect = 4e-78, Method: Composition-based stats. Identities = 96/428 (22%), Positives = 198/428 (46%), Gaps = 20/428 (4%) Query: 19 LCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSIS 78 + + + + F++I+ + G +++ G + P +S Sbjct: 2 ESLNWVDYLFIFSLVSIWMLLFVNIILSLAGYRYYLKTLNSELKGLKN-----EKYPKVS 56 Query: 79 IIIPCFNEEKNVEETIHAALAQRYE--NIEVIAVNDGSTDKTRAILDRMAAQIP--HLRV 134 I++P NEEK + T+ + L Y +E+I +ND S+D T+ IL ++ + + ++ Sbjct: 57 ILVPAHNEEKVIGRTVKSILLLNYPKDKMELIVINDNSSDNTKKILKQIQKEYRSYNFKI 116 Query: 135 IHLAQ---NQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTG 191 I+ +GK+ AL G + +++ D D D++A Y++E ++ + +GAV G Sbjct: 117 INTDNITGGRGKSNALNIGYKHSSGDFIAVYDADNTPDKNALKYLMETIIEDEHLGAVIG 176 Query: 192 NPRIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSD 250 R R + ++ + E S + + + N+ T+ G R++ + ++ W Sbjct: 177 KFRTRNKDRNMLTRFINIETLSFQWMCQAGRWNLLNLCTIPGTNFVVRKNIIQKLNGWDP 236 Query: 251 DMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTR 310 I ED +IS+++ + I + P ++ W PE LK +KQR RWA+G V LK Sbjct: 237 KAIAEDTEISFRIYELGYKIKFVPYSVTWEQEPENLKVWFKQRTRWAKGNIYVLLKYFKN 296 Query: 311 LWRKENF-RMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILL 369 +++ + ++ +F+ + + ++ + ++ I++ V G+ LNI L I + IL Sbjct: 297 MFKGTSKDIIFDIFYFFSVYFLFLSSVIISDILFIV---GIFLNINLHVIGNFNVLWILA 353 Query: 370 CTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVI-FWMLSLATTLVSFTRVMLMPKKQR 428 L +L+ ++L +E + L I++F WM+ ++ + R L K+ Sbjct: 354 YVLFVLEVSLTLTLEKGESNKENLILVPIMYFTYCQMWMIVALRGIIQYIRDKLF--KKE 411 Query: 429 ARWVSPDR 436 +W +R Sbjct: 412 IKWYKTER 419 >UniRef50_C1F4G9 Polysaccharide deacetylase domain protein/glycosyl transferase, group 2 family protein n=1 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F4G9_ACIC5 Length = 1170 Score = 294 bits (753), Expect = 5e-78, Method: Composition-based stats. Identities = 107/409 (26%), Positives = 180/409 (44%), Gaps = 47/409 (11%) Query: 31 MRFVFFW-PFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKN 89 + FVFF MS ++ G++ + R R P P+++I+IP +NEEK Sbjct: 748 LIFVFFVGDVLMSGRLLIIGLFALIERFR------TRRIPPGVYEPAVAILIPAYNEEKV 801 Query: 90 VEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQ---IPHLRVIHLAQNQGKAIA 146 + TI +AL Y ++ V+ ++DGSTD+T + AQ L V+ N GKA A Sbjct: 802 IVRTIRSALNSDYPHLHVVVIDDGSTDRTLEVAREAYAQEIADGKLTVL-TKPNAGKAEA 860 Query: 147 LKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQ 206 L G + E V ID D ++ DA + +V +P+VGAV GN ++ + L + Q Sbjct: 861 LNFGLRQIREEVYVGIDADTVIAVDAVSKLVR-HFADPKVGAVAGNAKVGNKVNLWTRWQ 919 Query: 207 VGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLN 266 EY + +R ++ V V G I A+R +A+ G + + + ED D++ L Sbjct: 920 ALEYITGQNFERRALDLFNVVTVVPGAIGAWRTAAVLAGGCYPLNTVAEDADLTMNLVEQ 979 Query: 267 QWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEY 326 + + YE AL + P GL +QR RW+ G + K+ FR + Sbjct: 980 GYKVIYEDHALAFTEAPINANGLMRQRFRWSFGTLQAVFKHRQA------FRTNRAMGFF 1033 Query: 327 CLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENR 386 L I F L+ + L L + L+QF++ N+ Sbjct: 1034 ALPNIVVFQILLPLASPFIDL---------------------LFAVSLIQFLI-----NK 1067 Query: 387 YEHNLT---SSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWV 432 + H T +S ++ + + F ++ T+L++F+ P + W+ Sbjct: 1068 HYHPETASAASFDKLLIYFLAFIVIDFFTSLLAFSLEPRHPANKGDGWL 1116 >UniRef50_B6BG61 Glycosyl transferase, group 2 family protein n=2 Tax=Rhodobacterales RepID=B6BG61_9RHOB Length = 1140 Score = 293 bits (750), Expect = 1e-77, Method: Composition-based stats. Identities = 95/394 (24%), Positives = 176/394 (44%), Gaps = 17/394 (4%) Query: 48 GGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEV 107 G+ R + L+ P ++++IP NE K + ++I + A Y+N+E+ Sbjct: 747 AGILILALLNRRERFT------PLQRQPKVAVVIPAHNEAKVIAQSIESVRASGYKNLEI 800 Query: 108 IAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDAL 167 I V+DGSTD T + + + +R+I NQGK AL + +E+ VCID D Sbjct: 801 IVVDDGSTDDTLLEVLKFGHK-SEVRLIS-QPNQGKWSALNRAIQSTDAEFAVCIDADTQ 858 Query: 168 LDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNV 227 + +DA ++V +P+ GAV G R L+ ++Q EY++ + ++ + + Sbjct: 859 VCKDAITHLVR-HFADPKTGAVAGKIIAGNRVNLLTRLQAFEYATSQNIERKAFDLINGI 917 Query: 228 FTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLK 287 V G I A+R AL + G++S++ +TED D++ ++ + + +EP+A + +PE + Sbjct: 918 LVVPGAIGAWRVEALRKAGFFSEETLTEDTDLTIQVNRAGYNVVFEPKAKAYTEVPENVG 977 Query: 288 GLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLF----FEYCLTTIWAFT-CLVGFII 342 L KQRLRW+ G + K+ + + + + F Y + V F++ Sbjct: 978 QLLKQRLRWSLGMFQSAWKHKRAIIEGRSIGLVSISDMFVFGYVFPLLAPIADLFVIFML 1037 Query: 343 YAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFP 402 Y + G ++ T T L L+ +++ + + SL Sbjct: 1038 YNLMAGGWTGDVGSTQQVQTTQYLWAFLALPALELLIAAIAITTDKDESNWSLLLFPLQR 1097 Query: 403 VIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 + + L + + S R + A W S R Sbjct: 1098 LAYRPLLYFSVIRSILRAVTGR---LANWGSVKR 1128 >UniRef50_UPI0001AEFEC9 N-glycosyltransferase PgaC n=1 Tax=Streptomyces roseosporus NRRL 11379 RepID=UPI0001AEFEC9 Length = 343 Score = 292 bits (749), Expect = 1e-77, Method: Composition-based stats. Identities = 103/303 (33%), Positives = 162/303 (53%), Gaps = 6/303 (1%) Query: 28 ELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEE 87 ++++ F+ F+P W+VGGV F + ER P +++++P +NEE Sbjct: 5 DVLLVFLAFYPVVTGAAWMVGGVMFRLTEER------PRAVDPPGGLPGVTVLVPAYNEE 58 Query: 88 KNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIAL 147 K + + A Y +E++ +NDGSTD T A A + V+ N+GKA L Sbjct: 59 KVIAGCVAALRLVDYPELEILILNDGSTDATVAAARAAAEGDARITVVDDGVNRGKADRL 118 Query: 148 KTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQV 207 G A+ + ++ D D L DA ++V ML +PR A+ G PR+ R +++ +Q+ Sbjct: 119 NEGMRQARHDLVLVTDADTHLHSDAVTHLVRRMLRSPRYAAIAGCPRVTNRGSVIATMQM 178 Query: 208 GEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQ 267 E +S+IGLI+RT + G V TV+GV+ FRR A+ VG + M TEDI+++W+L Sbjct: 179 LETASLIGLIRRTHALAGRVGTVAGVLGLFRRDAVLAVGGYDPKMATEDIELTWRLLQAG 238 Query: 268 WTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYC 327 + P AL + +PETL G+W QR RWA+G EV + + R ++ MWP+ Sbjct: 239 HLTGFAPNALVGMQVPETLSGIWAQRTRWARGQGEVLRTHARTVARPKHAVMWPIAPGGV 298 Query: 328 LTT 330 L T Sbjct: 299 LPT 301 >UniRef50_A1R2V1 Glycosyl transferase, group 2 family domain protein n=4 Tax=Actinomycetales RepID=A1R2V1_ARTAT Length = 431 Score = 291 bits (746), Expect = 3e-77, Method: Composition-based stats. Identities = 105/396 (26%), Positives = 187/396 (47%), Gaps = 17/396 (4%) Query: 43 IMWIVGGVYF-WVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQR 101 I+++ +YF W+YR + L D P +S+I+P +NE + + + LA R Sbjct: 26 IVYVPMALYFDWLYRSICARHRYSV----LSDRPLVSVIVPGYNEAVVITGCVESILASR 81 Query: 102 YENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVC 161 Y +EVI V+DGSTD+T +I++ +A Q +R + N GK AL G AAA + L+ Sbjct: 82 YLRLEVILVDDGSTDETASIMEGLAQQYDRVRFLS-QANAGKGAALNCGIAAALGDILMF 140 Query: 162 IDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQ 221 +D D + D +++E +P+VGAV G+ R L + GL++R Sbjct: 141 VDADGVFAPDTLIHMLEG-FDDPKVGAVCGDDRPVNLDRLQTMMLAILSHVGTGLVRRAL 199 Query: 222 RIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWIL 281 + + VSG I AFR + E+G + +D + ED++++W++ + + ++P+AL + Sbjct: 200 SLMNCLPIVSGNIGAFRSDLVRELGGFHEDTLGEDLELTWRVYKAGYRVRFQPKALVYAE 259 Query: 282 MPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFI 341 P T+ GLW+QR+RW++G + + L + M+ F + T+ L + Sbjct: 260 SPSTMGGLWRQRVRWSRGLLQTLRLHSGMLGSRRYG-MFGAFLVFNAITMVLIPILQLVV 318 Query: 342 IYAVQLAGVPLNIELTHIAATHTAGI-LLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIW 400 + + V + + + +L L+ F V L NR +L L+ + Sbjct: 319 LALLPFLYVAGMGPVPAEVLAILGWLGVFVSLALIVFSVGL---NRSWRDL-RFLWTLPL 374 Query: 401 FPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 +P + LA + ++ + ARW R Sbjct: 375 WPFYSVFVGLALA----SAIVKEIRGSPARWNKLQR 406 >UniRef50_Q1GJ85 Glycosyl transferase family 2 n=4 Tax=Rhodobacteraceae RepID=Q1GJ85_SILST Length = 1002 Score = 291 bits (744), Expect = 4e-77, Method: Composition-based stats. Identities = 95/366 (25%), Positives = 173/366 (47%), Gaps = 10/366 (2%) Query: 74 NPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLR 133 P +++IIP NEEK + I + A Y+N+E+I V+DGS+D T + + + +R Sbjct: 632 TPKVAVIIPAHNEEKVIRSCIQSVRASDYKNLEIIVVDDGSSDNTLNEIFAF-SHMREVR 690 Query: 134 VIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNP 193 +I NQGK AL +E +VCID D +++ A ++V NPR+GAV G Sbjct: 691 LIS-QPNQGKWSALNRALMNTSAEIVVCIDADTQIEKSAIGHMVR-HFDNPRIGAVAGKI 748 Query: 194 RIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMI 253 + L+ ++Q EY++ + ++ + + V G + A+R +AL + G++SD+ + Sbjct: 749 IAGNKVNLLTRLQALEYTTAQNVERKAFDLINGMLVVPGALGAWRVAALRKAGHFSDETM 808 Query: 254 TEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWR 313 TED D++ ++ + I YEP A + +PE + L KQRLRW+ G + K+ ++ Sbjct: 809 TEDTDLTIEVNRAGYRIAYEPLARGYTEVPERIGQLLKQRLRWSFGMFQSAWKHKKAMFE 868 Query: 314 KENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLC 373 + + + + ++ + + A+ L + + LL L Sbjct: 869 GRSVGLISIPDMFIFGYLFPLLAPIADLFVAILLYQMVSGGWDSGAVGAQNMQYLLAYLT 928 Query: 374 L--LQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRA-R 430 L L+F+++ R + S++ ++ FPV + S R +L R Sbjct: 929 LPALEFVIAAFALARDKDE---SMWSLLLFPVQRVLYRPI-LYYSVIRAILRAITGRLFS 984 Query: 431 WVSPDR 436 W + R Sbjct: 985 WGAQKR 990 >UniRef50_Q9CHP7 Glycosyl transferase n=3 Tax=Lactococcus lactis RepID=Q9CHP7_LACLA Length = 354 Score = 287 bits (736), Expect = 4e-76, Method: Composition-based stats. Identities = 120/358 (33%), Positives = 199/358 (55%), Gaps = 19/358 (5%) Query: 29 LMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEK 88 ++ +V ++P +S +W G W + + K+ ISI+I +NEE+ Sbjct: 6 MVDVYVVYYPIILSFIWASGAFL------SRWKERDKKAIDETKEFEKISIVISAYNEEE 59 Query: 89 NVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPH---LRVIHLAQNQGKAI 145 +EE + + Y ++E+ V+D S+D+T L + L ++ +N+GKA Sbjct: 60 TIEEVLLSLRNLNYPSLEIFIVDDKSSDRTLEKLYEVKKSFDDWETLTILEQKENKGKAT 119 Query: 146 ALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKI 205 AL S+Y++ ID D+ L +DA +Y++ + + +GAVTG P +R R+TL+GK+ Sbjct: 120 ALNVALQQVNSKYMLVIDADSYLSKDALSYLLAELTSSSDIGAVTGRPIVRNRTTLLGKL 179 Query: 206 QVGEYSSIIGLIKRTQRIY-GNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQ 264 Q EY S+I IKR Q ++ G + TVSGVI +R AL E+G ++ +++TEDID++W+L Sbjct: 180 QTLEYLSVIDAIKRAQSLFLGAIMTVSGVIVMYRVEALKEIGGFNTEVMTEDIDVTWRLH 239 Query: 265 LNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFF 324 NQW + Y P+AL +IL+PE LK L KQR RWA GG EV + N++ ++++ F+ L Sbjct: 240 RNQWKVKYVPKALSYILVPENLKSLLKQRRRWAVGGVEVLISNLSWVFKRGEFKHRFLLI 299 Query: 325 EYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLM 382 E + IW++ L+ Y +Q+ LN E +G ++ L+ F V + Sbjct: 300 EMICSHIWSWFFLIESYQYFLQML---LNQEF------KVSGQIIVIFVLISFFVFFL 348 >UniRef50_C7MUK1 Glycosyl transferase n=11 Tax=Actinomycetales RepID=C7MUK1_SACVD Length = 1099 Score = 285 bits (729), Expect = 3e-75, Method: Composition-based stats. Identities = 97/362 (26%), Positives = 170/362 (46%), Gaps = 16/362 (4%) Query: 76 SISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVI 135 +S+I+P +NE N+E T+ +A+A + +E+I V+DGSTD T +++ + + +RV+ Sbjct: 342 PVSVIVPAYNEAANIEATVRSAVASTHP-VEIIVVDDGSTDGTADLVEGLG--LSGVRVL 398 Query: 136 HLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRI 195 N+GKA AL TG AAA + +V +DGD + + + +V+P +P VGAV+GN +I Sbjct: 399 RRP-NRGKAAALNTGIAAASYDLIVMVDGDTVFEPNTVHELVQP-FADPEVGAVSGNVKI 456 Query: 196 RTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITE 255 R TL+ ++Q EY + +R + ++ TV G AFRRSAL +VG S + E Sbjct: 457 ANRETLLARLQHIEYVVGFNVDRRVHEVMRSMPTVPGAGGAFRRSALLQVGGLSAQTLAE 516 Query: 256 DIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKE 315 D D++ + W ++ +A+ W P T++ LW+QR RW G + K+ + ++ Sbjct: 517 DTDLTISIGRAGWRTVFQEKAVTWTEAPTTVRQLWRQRFRWTFGTLQALWKHRKAIVQRG 576 Query: 316 NFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLL 375 F + + F L+ I + + V + +L L +L Sbjct: 577 AAGRVGRFG---MLHVVCFQVLLPMIAPVIDVFLV------YGVLFLDPWTTVLLWLTML 627 Query: 376 QFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPD 435 + + ++ W++ P + +V + R RW Sbjct: 628 GIQAAAAAYAFHLDGERKTVLWLL--PAQQLIYRQLMYVVLMQSLAAAASGVRVRWQHMR 685 Query: 436 RG 437 R Sbjct: 686 RS 687 >UniRef50_C1V8R5 Glycosyl transferase n=1 Tax=Halogeometricum borinquense DSM 11551 RepID=C1V8R5_9EURY Length = 567 Score = 282 bits (723), Expect = 1e-74, Method: Composition-based stats. Identities = 107/436 (24%), Positives = 173/436 (39%), Gaps = 29/436 (6%) Query: 11 LCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWP--------W 62 L + L I L G + V P + G V +P Sbjct: 133 LSIWLAIVLSALVLTGGFVFGWSV---PTPYEMAVFAGFVVIAFIVVVVFPLTIVQMRGR 189 Query: 63 GENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYE--NIEVIAVNDGSTDKTRA 120 E+ D P +S+++P +NE V + + + LA Y +EVI ++DGSTD T A Sbjct: 190 EESDHTLDDDDAPLVSVLVPAYNESNYVGDCLDSILASDYPTDRLEVIVIDDGSTDGTYA 249 Query: 121 ILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPM 180 + A + + V H + N GK AL G + ++ + +V +D D++L A VE + Sbjct: 250 --EASAYRNDRVSVFHRS-NGGKHAALNLGLSCSRGDVVVAVDADSILAPSALRTAVEQL 306 Query: 181 LYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRS 240 +PR+GAV G + +VG +Q EY I ++R G V + G + FRR Sbjct: 307 QSDPRLGAVAGTVVVNNADGIVGSVQALEYVLGINTLRRAFSYLGTVMVIPGCLGVFRRE 366 Query: 241 ALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGG 300 AL+EVG + D +TED D++ +L W + AL + P +L L QRLRW +G Sbjct: 367 ALSEVGGYDPDTVTEDFDLTVRLLKAGWRVE-LSEALVYTEAPFSLTDLLNQRLRWTRGN 425 Query: 301 AEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIA 360 + LK+ Y +AF I++ + V + I Sbjct: 426 IQTLLKHRDVFSEPA----------YGFLHRFAFPLSALSILFVPFASIVVTTMIFVAIL 475 Query: 361 ATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRV 420 G+ L L V L + + + P+ ++ Sbjct: 476 NGAFLGVALVAAYFL--FVLLFVAAMALDLSDGDWRLLAYAPLHLVGYRQFLDVIVIRTA 533 Query: 421 MLMPKKQRARWVSPDR 436 +++ + RW S R Sbjct: 534 LMLLRGTNKRWESVTR 549 >UniRef50_A6Q1D6 Glucosaminyltransferase n=2 Tax=Epsilonproteobacteria RepID=A6Q1D6_NITSB Length = 438 Score = 280 bits (716), Expect = 9e-74, Method: Composition-based stats. Identities = 91/428 (21%), Positives = 181/428 (42%), Gaps = 22/428 (5%) Query: 4 RIVSFFILCLVLCIPLCVAYFHSGELMMRF----------VFFWPFFMSIMWIVGGVYFW 53 + V +L L L + L + + + + F + + I + ++F Sbjct: 3 KTVFIILLSLFLIVTLGIILYSFYQYYILFNSMFIKIGYGIILGFTSLVIFRYMLLLFFS 62 Query: 54 VYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDG 113 ++R E + P +S+I+P +NE K + +I + L Q Y N+E+I V+DG Sbjct: 63 IFRTIQRSAEETYQIDPKQRYPKVSVIVPAYNEAKTIATSISSLLTQNYPNLEIIVVDDG 122 Query: 114 STDKTRAILDRMAAQ--IPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRD 171 S+D+T + +RV +N+GK+ A+ G + E + +D D+ + ++ Sbjct: 123 SSDETYFKAKQFEHNEFCKEIRVFR-KKNEGKSKAINYGIERSTGELIFVMDADSKISQN 181 Query: 172 AAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVS 231 A ++ +P + AV G+ + + L+ K+Q EY + +++ Q V + Sbjct: 182 AIL-LLARHFEDPDIAAVAGSVYVSNQVNLITKLQALEYIEGLNMVRNGQAFLKAVNIIP 240 Query: 232 GVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWK 291 G + FR++AL VG + D ED D++ KL + I +EP A+ + PE L L K Sbjct: 241 GPVGMFRKNALYNVGLYDHDTFAEDCDVTLKLIAKGYKIDFEPEAVAYTEAPENLLDLIK 300 Query: 292 QRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCL--TTIWAFTCLVGFIIYAVQLAG 349 QR RW +G + K+ LW + + Y L + W F + V + Sbjct: 301 QRYRWTRGILQAIRKHRNLLWNFKENTTASMVMWYMLFESLFWPFADI------WVNIFV 354 Query: 350 VPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLS 409 + + + L + + +++ + ++++ I+F + Sbjct: 355 LYWAVTSGMSIFIFYWWSIFTILDVAGALYCILLTGEKLGLVFYAVYYRIFFIETINLAK 414 Query: 410 LATTLVSF 417 + ++ F Sbjct: 415 IFASIEEF 422 >UniRef50_P96587 Uncharacterized glycosyltransferase ydaM n=6 Tax=Bacillus RepID=YDAM_BACSU Length = 420 Score = 279 bits (715), Expect = 1e-73, Method: Composition-based stats. Identities = 100/417 (23%), Positives = 196/417 (47%), Gaps = 18/417 (4%) Query: 29 LMMRFVFFWPFFMS-IMWIVGGVYFWVYRERHWP-WGENAPAPQLKDNPSISIIIPCFNE 86 + W + + + GG ++ ER+ P W EN +K+ P +S++IP NE Sbjct: 6 FFISLSLIWVMLLYHMFLMQGGFRHYMTFERNIPKWREN-----MKELPKVSVLIPAHNE 60 Query: 87 EKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAILDRMAAQIPHLR-VIHLAQN--Q 141 E + +T+ A + Y +E+I VND S+D+T I++ + + ++ VI N + Sbjct: 61 EVVIRQTLKAMVNLYYPKDRLEIIVVNDNSSDRTGDIVNEFSEKYDFIKMVITKPPNAGK 120 Query: 142 GKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTR-ST 200 GK+ AL +G A + + + D D ++ A Y+V ++ + + GAV G R+ T Sbjct: 121 GKSSALNSGFAESNGDVICVYDADNTPEKMAVYYLVLGLMNDEKAGAVVGKFRVINAAKT 180 Query: 201 LVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDIS 260 L+ + E + + + + + T+ G A RRS + ++G W D + ED +++ Sbjct: 181 LLTRFINIETICFQWMAQGGRWKWFKIATIPGTNFAIRRSIIEKLGGWDDKALAEDTELT 240 Query: 261 WKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMW 320 ++ + I + P A+ W PET K W+QR RWA+G V LK + + ++ + R+ Sbjct: 241 IRVYNLGYHIRFFPAAITWEQEPETWKVWWRQRTRWARGNQYVVLKFLAQFFKLKRKRII 300 Query: 321 PLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVS 380 F + T F ++ + A+ + + ++ L+ IL L + + +++ Sbjct: 301 FDLFYFFFTYFLFFFGVI--MSNAIFVVNLFYDLHLSVGFLAMILWILAFFLFMTEVMIT 358 Query: 381 LMIENRYEHNLTSSLFWIIWFPV-IFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 L IE + + ++++F W++ + +L F + KQ +W +R Sbjct: 359 LSIEKTEMNKQNFFIVFLMYFTYSQAWIVLVIYSL--FVEIKHRLFKQEVKWYKTER 413 >UniRef50_B9ZC52 Glycosyl transferase family 2 n=1 Tax=Natrialba magadii ATCC 43099 RepID=B9ZC52_NATMA Length = 549 Score = 279 bits (715), Expect = 1e-73, Method: Composition-based stats. Identities = 104/409 (25%), Positives = 180/409 (44%), Gaps = 25/409 (6%) Query: 5 IVSFFILCLVL--CIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPW 62 + S+ L + L P +A H G +++ FV++W + + + R ++ P Sbjct: 93 VGSYLGLIVWLGWLTPSPLALVHLGAVVLIFVYYWFIA-----FIALFHDQIGRSKYVPN 147 Query: 63 GENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRA 120 P I+++IP +NEE V TI + L Y +E+IAV+DGSTD T A Sbjct: 148 PP---------YPQITVLIPAYNEEGYVGRTIQSLLDANYPADALEIIAVDDGSTDDTLA 198 Query: 121 ILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPM 180 AA + V+ N GK AL G A + +V +D D+++DRDA +IV P Sbjct: 199 EASAFAAASEQVSVVS-KANGGKYSALNYGLLFAAGDIIVTVDADSIVDRDALKHIVAPF 257 Query: 181 LYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRS 240 + +GAV N I R +L+ + Q EY+ + + +R +G V V G + A+RR Sbjct: 258 AADDDIGAVASNVTIWNRDSLITRCQQLEYTIGVNIYRRALDYFGIVMVVPGCLGAYRRE 317 Query: 241 ALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGG 300 L+EV + D +TED D++ K+ + + A + P T L++QRLRW +G Sbjct: 318 VLSEVFAYDPDTLTEDFDVTMKVLRAGYRVS-VSDARVYTEAPATWGDLYRQRLRWYRGN 376 Query: 301 AEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIA 360 +K+ + + + + + L ++ V LA + I H+ Sbjct: 377 YMTIIKHWSVVTDSSYGYLNRIALPFRLVEMFFLP-----FASFVVLAYILWLIAAGHVL 431 Query: 361 ATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLS 409 + ++ L + + IE L + ++ + L+ Sbjct: 432 TVFAVFVFFTSIVFLIAALGIQIEGEDWRLLVYAPLLVVGYKQFHDALN 480 >UniRef50_B8II97 Polysaccharide deacetylase n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8II97_METNO Length = 1120 Score = 278 bits (711), Expect = 3e-73, Method: Composition-based stats. Identities = 98/405 (24%), Positives = 163/405 (40%), Gaps = 25/405 (6%) Query: 38 PFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAA 97 F + I+G + R P P I++++P +NEE + +TI Sbjct: 715 AIFRLTLIIIGATAHGLRGSRRDPPEGWRPR-------GIAVLVPAYNEEIVILKTIQTL 767 Query: 98 LAQR-YENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKS 156 LA E IE+I ++DGSTD T A++ ++ I+ N GKA AL G + Sbjct: 768 LASTIAEQIEIIVIDDGSTDNTAAVVRTAFPNTAAVQ-IYTKANGGKAAALNYGLQKTST 826 Query: 157 EYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGL 216 E +V IDGD +L DA ++ +P++GAV G + R TL+ + Q EY+ L Sbjct: 827 EIIVAIDGDTVLLPDAIEHLAR-HFADPKIGAVAGTVSVGNRKTLIARFQALEYTMSQNL 885 Query: 217 IKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRA 276 +R ++ + V G I A+RR AL VG +S D + ED D++ L+L W + EPRA Sbjct: 886 DRRAFQLINAIGVVPGAIGAWRREALMAVGGYSSDTLAEDADLTISLELAGWKVVCEPRA 945 Query: 277 LCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYC--------- 327 PE L+ KQR RW G +V K+ R+ L Sbjct: 946 RALTEAPERLRAFLKQRFRWMFGTLQVAYKHAPASLRRPRGISLILIPNVLLFQFLFTLL 1005 Query: 328 --LTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIEN 385 L + +V ++ + + + ++ L +L++ Sbjct: 1006 APLMDLILIFSVVSSVVDITLIGSRSEGYGTLELLMAY--WLVFQVFDFLAGCAALLLHG 1063 Query: 386 RYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRAR 430 + L ++ + L T + + + + Sbjct: 1064 KS--PEWRLLPLLVLQRFCYRQLLYITAIRTLLTALRGTFVGWGK 1106 >UniRef50_B4D4P9 Polysaccharide deacetylase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D4P9_9BACT Length = 1192 Score = 276 bits (707), Expect = 1e-72, Method: Composition-based stats. Identities = 103/414 (24%), Positives = 173/414 (41%), Gaps = 50/414 (12%) Query: 35 FFWPFFM---SIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVE 91 FFW F + ++ + + W+ H + PS+S++I FNE K ++ Sbjct: 761 FFWAFMIFSTGLVVLRTLLVAWLAHRHH----RGEIVSKSTFQPSVSVVIAAFNEGKVIQ 816 Query: 92 ETIHAALAQRYE-NIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTG 150 +T+ + L Y ++E+I V+DGS D T A ++ +AA P +R+I N+GKA AL+ G Sbjct: 817 KTLRSVLDTDYTGDLELIIVDDGSRDDTAAKVEALAAAEPRIRLI-QQPNRGKASALRHG 875 Query: 151 AAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRT------------- 197 A++E + +D D L + + +V+P+ +P VGAV+GN R+ Sbjct: 876 VTVARNEIVAFLDADTLFEHHTLSALVQPL-EHPLVGAVSGNARVGNLLTGHDSSGASSA 934 Query: 198 -----------------------RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVI 234 + Q EY L +R + + V G + Sbjct: 935 SANSPGFLSRWWAGSTWATLSLLARRFIASCQSLEYICGFNLDRRAYTEWNCITVVPGAV 994 Query: 235 AAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRL 294 +A RRSA+ G +SDD + ED D++ + + I Y P A+ W PET GL KQR Sbjct: 995 SALRRSAIVAGGGFSDDTLAEDTDLTLVMHRLGYRIEYAPEAVAWTEAPETTIGLVKQRS 1054 Query: 295 RWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNI 354 RWA G + K+ ++ + + L IW + + V + + Sbjct: 1055 RWAFGTMQCLWKHRDMVFNPRYGALGW----FSLPNIWFMQIGLVAVTPLVDAILLYSLL 1110 Query: 355 ELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWML 408 + I+L L + M R +L + PV+ W++ Sbjct: 1111 FGNARGIWYWFVIVLGLDLFLAALACWMDGERLRRSLLILPMRFFYRPVLAWVI 1164 >UniRef50_B9LWG0 Glycosyl transferase family 2 n=1 Tax=Halorubrum lacusprofundi ATCC 49239 RepID=B9LWG0_HALLT Length = 509 Score = 275 bits (704), Expect = 2e-72, Method: Composition-based stats. Identities = 102/426 (23%), Positives = 172/426 (40%), Gaps = 25/426 (5%) Query: 19 LCVAYFHSGELMMRFVFFWPFFMSIMWIVG----GVYFWVYRERHWPWGENAPAPQLKDN 74 LCVA+ G ++ + P S +++ +Y+W + + Sbjct: 77 LCVAFVCYGFGVVTLGWVQPSLTSGVYLFAIALIFLYYWFIALAAVYHNQRYHSQDAPPE 136 Query: 75 PS--ISIIIPCFNEEKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAILDRMAAQIP 130 PS ISII+P +NEE ++ TI A L Y + E+I ++DGSTD T A ++ Sbjct: 137 PSASISIIVPAYNEEGYIQRTITALLDADYPDGKREIIVIDDGSTDNTCAEARAFESETV 196 Query: 131 HLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVT 190 + N GK AL G A +E ++ +D D++ ++DA +V P+ + VGAV Sbjct: 197 SVV---TKDNGGKYSALNYGLLFASNEIILTVDADSVPEKDALKQMVAPL-SDQSVGAVA 252 Query: 191 GNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSD 250 I R +L+ Q EY+ + + +R ++G V V G + A+RR L E+ + Sbjct: 253 STVTIWNRGSLLTGCQQLEYTIGVNVYRRMLDLFGIVMVVPGCLGAYRRDVLDEIQGFDP 312 Query: 251 DMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTR 310 +TED DI+ K+ + + A + P++L+ L+ QRLRW +G K+ Sbjct: 313 QTLTEDFDITVKVLRAGYEVR-SSEARVYTEAPDSLRDLYNQRLRWYRGNYMTIFKHRGV 371 Query: 311 LWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLC 370 L ++ F L + I+ + + + I+ Sbjct: 372 LSEPTTGFLYRFAFPLRLVELLFLPFASWVILGLIVKILLSGFVIQVVSLFIFFLSIIFL 431 Query: 371 TLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRAR 430 L +I E L++ F V + A L S V+L Sbjct: 432 IAALGVYI---------EGEDWRLLWYTPLFLVGYKHFHDALLLKSLADVLLGR---NLS 479 Query: 431 WVSPDR 436 W R Sbjct: 480 WTRATR 485 >UniRef50_B7QVL5 Glycosyl transferase, family 2 n=1 Tax=Ruegeria sp. R11 RepID=B7QVL5_9RHOB Length = 1140 Score = 274 bits (702), Expect = 4e-72, Method: Composition-based stats. Identities = 87/408 (21%), Positives = 176/408 (43%), Gaps = 11/408 (2%) Query: 35 FFWPFFMSIMW-IVGGVYFWVYRERHWPWGENAPAPQLKD-NPSISIIIPCFNEEKNVEE 92 + ++++ ++ + +V R + + D ++++IP +NEEK++ + Sbjct: 719 VSYIVPLAVLLGVLRALTLFVLAIRSKRLSRSDLRHRTGDFTAPVTVVIPAYNEEKSILK 778 Query: 93 TIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAA 152 TI++ L Y N+ V+ V+DGSTD T ++ + P ++++ N GK A A Sbjct: 779 TIYSVLDSDYPNLSVLVVDDGSTDATYDLVSKTYLNNPKVQILR-QPNGGKWKAANLAFA 837 Query: 153 AAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSS 212 ++Y+V ID D ++ DA +++P+ NP+VGAV G + + L+ K++ EY+ Sbjct: 838 HVTTDYVVAIDADTIVAPDAIRRLMQPL-RNPKVGAVAGKIMVGNSNNLLTKLEKLEYTV 896 Query: 213 IIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFY 272 + +R + V G A+R +A+ E GY+S + ED D++ L + + Sbjct: 897 AQNIDRRAYETINAIMVVPGAFGAWRTAAVRECGYYSSQTLAEDTDLTISLLEAGYVVRA 956 Query: 273 EPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPL----FFEYCL 328 +A + P ++ L KQR+RW+ G + K+ + + + L F + Sbjct: 957 AEKAYAYTEAPASVGTLMKQRMRWSIGILQSAWKHRATIRKGHAIGLVGLTDLVLFGVIM 1016 Query: 329 TTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYE 388 + L+ ++ A + A + + L LL+ I++ Sbjct: 1017 PLLGPIIDLLLVLMLARFITSFDGTSFDAFTARDYIVLSIFLALPLLEMIMADYAVRSEP 1076 Query: 389 HNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 +F ++ +I+ L + S R++ W R Sbjct: 1077 STPRRMVFLLLLNRLIYRQLLIINVYRSLWRILTGRLTG---WHKLRR 1121 >UniRef50_Q03HA4 Glycosyltransferase n=2 Tax=Bacilli RepID=Q03HA4_PEDPA Length = 414 Score = 273 bits (699), Expect = 8e-72, Method: Composition-based stats. Identities = 100/425 (23%), Positives = 186/425 (43%), Gaps = 32/425 (7%) Query: 28 ELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDN-PSISIIIPCFNE 86 LM+ + W + + + Y W + P QL + P +++++P NE Sbjct: 4 FLMVAIIAIWGTLIINLILTVAGYTW------YLQEAPKPDKQLPEKIPFVTVMVPAHNE 57 Query: 87 EKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAILDRMAAQI--PHLRVIHLAQ--- 139 + +T+ + L+ Y + E+I +ND S+D + +L + L+VI+ Sbjct: 58 GIVIVKTVLSLLSFDYPHDHYEIIVINDNSSDNSAELLKEVQNDFGEERLKVINTDNVTG 117 Query: 140 NQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTR- 198 +GK+ AL G AK + D D +R A +V +L N R+ AV G R R + Sbjct: 118 GKGKSNALNIGLKQAKGSVIAIYDADNTPERGALRLLVAELLGNDRLAAVIGKFRTRNKE 177 Query: 199 STLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDID 258 +T++ + E S + + ++ + T+ G RR L ++G W + ED + Sbjct: 178 ATILTRFINIETLSFQWMAQAGRQRLFKLCTIPGTNYVIRREILEKIGGWDVKALAEDTE 237 Query: 259 ISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFR 318 IS+++ + I ++PRA+ W P+TL + QR RW +G V +KN L++K Sbjct: 238 ISFRVYQMGYQIKFQPRAVTWEQEPQTLDVWFHQRTRWVKGNIYVVVKNAKLLFKKAG-- 295 Query: 319 MWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGI--LLCTLCLLQ 376 P+ F+ F + I L+ V + + +A + G L ++ Sbjct: 296 -RPIRFDLLYFLSTYFLLMTALI-----LSDVVFVLSVAGLAHSDLQGFSNALWLFGIIM 349 Query: 377 FIVSLMIENRYEHNLTS--SLFWIIWFPVIF---WMLSLATTLVSFTRVMLMPKKQRARW 431 FI+S + E ++ +II+ +++ W+ A +V + R + K+ A+W Sbjct: 350 FIISTFVTVTTEKGEMRFSNILYIIFMYLVYSQMWLAVAAYGMVGYIREQVFHKQ--AKW 407 Query: 432 VSPDR 436 R Sbjct: 408 YKTKR 412 >UniRef50_A9EU26 Group-specific protein n=3 Tax=Phaeobacter gallaeciensis RepID=A9EU26_9RHOB Length = 1136 Score = 272 bits (696), Expect = 2e-71, Method: Composition-based stats. Identities = 83/365 (22%), Positives = 166/365 (45%), Gaps = 8/365 (2%) Query: 21 VAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNP-SISI 79 +A+ G F PF + ++ ++ + +V R + D +++ Sbjct: 703 LAFTGFGNSATWVAFLVPFAV-LLGVIRALVLFVMAIRSRRADRADLRHRTGDFSAPVTV 761 Query: 80 IIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQ 139 +IP +NEEK++ +TI++ L Y N+ V+ V+DGSTD T ++ + P+++++ Sbjct: 762 VIPAYNEEKSILKTIYSVLESDYPNLSVLVVDDGSTDDTHGLVTKTYKDNPNVQILR-QP 820 Query: 140 NQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRS 199 N GK A + ++Y+V ID D ++ DA +++P+ NP+VGAV G + + Sbjct: 821 NGGKWKAANLAFSHVTTDYVVAIDADTIVAPDAIRRLMQPL-RNPQVGAVAGKIMVGNSN 879 Query: 200 TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDI 259 L+ K++ EY+ + +R + V G A+R A+ + GY+S + ED D+ Sbjct: 880 NLLTKLEKLEYTVAQNIDRRAYETINAIMVVPGAFGAWRTEAVRKCGYYSSQTLAEDTDL 939 Query: 260 SWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRM 319 + L + + RA + P ++ L KQR+RW+ G + K+ + + + + Sbjct: 940 TISLLEQGYEVRAAERAYAYTEAPASVGTLMKQRMRWSIGILQSAWKHRSTIRKGHAVGL 999 Query: 320 WPL----FFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLL 375 L F + + L+ ++ ++G + + L LL Sbjct: 1000 VGLTDLVIFGVIMPLLGPIIDLLLVLMLVRFVSGFDGTTFDAFTVRDYAVLSIFLALPLL 1059 Query: 376 QFIVS 380 + I++ Sbjct: 1060 EMIMA 1064 >UniRef50_C5SLQ2 Polysaccharide deacetylase n=1 Tax=Asticcacaulis excentricus CB 48 RepID=C5SLQ2_9CAUL Length = 1126 Score = 272 bits (695), Expect = 2e-71, Method: Composition-based stats. Identities = 96/370 (25%), Positives = 162/370 (43%), Gaps = 17/370 (4%) Query: 77 ISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIH 136 +S+IIP FNE + +E ++ L +IEV+ V+DGS D+T AI+ R A P + +I Sbjct: 754 VSVIIPAFNEARVIEASVRRVLTSVDVDIEVVVVDDGSKDETSAIVARAFANEPRVTLI- 812 Query: 137 LAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIR 196 N GKA A+ G A A+ E ++ +D D + + A +V + P +GAV GN ++ Sbjct: 813 TQANAGKATAVNNGIARARGEIIIALDADTQFEPETIARLVRWFIR-PEIGAVAGNAKVG 871 Query: 197 TRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITED 256 R V + Q EY + L + +G + V G + A+RRSAL EVG + D + ED Sbjct: 872 NRFNFVTRWQGVEYVTAQNLERTALATFGAMMVVPGAVGAWRRSALEEVGGYPHDTLAED 931 Query: 257 IDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKEN 316 D++ +Q W + Y+ A+ W PE+ L KQR RWA G + K+ L + Sbjct: 932 QDLTIAIQRRGWAVAYDQDAVAWTEAPESFASLIKQRYRWAFGTLQCLWKHRRILKEGKP 991 Query: 317 FRMWPLFFEYCLTTIWAFTCLVGFI--IYAVQLAGVPLNIELTHIAAT-------HTAGI 367 + + F+ + I + +A I + T Sbjct: 992 KGLAHIGLPQAWIFQIGFSVISPLIDLALLINIAATLWTISQHGLNQTDGTLERMLIYWT 1051 Query: 368 LLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQ 427 + + L ++ M+E R + +FW++ ++ + L + + Sbjct: 1052 VFVAIDALCGWIAYMLEPREKR---YPVFWLLSQRFVYRQIMYYVVLKALASA---GRGL 1105 Query: 428 RARWVSPDRG 437 W +R Sbjct: 1106 AVGWGKLERS 1115 >UniRef50_B1L4E1 Glycosyl transferase family 2 n=1 Tax=Candidatus Korarchaeum cryptofilum OPF8 RepID=B1L4E1_KORCO Length = 489 Score = 271 bits (693), Expect = 4e-71, Method: Composition-based stats. Identities = 111/431 (25%), Positives = 185/431 (42%), Gaps = 18/431 (4%) Query: 10 ILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYR---ERHWPWGENA 66 + L++ +P+ +A S E + F +S + + ++ + + E Sbjct: 59 LFSLLIIVPISLALVASLESLSTFETILYGIISFGFTLISWHYLLLVPLAAYYKRMEEIE 118 Query: 67 PAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMA 126 L P +S+IIP NEEK + TI + L YE EV+ V+DGSTD+T I Sbjct: 119 ARKPLLYRPLVSVIIPARNEEKVIGSTIRSVLESDYEPKEVVVVDDGSTDRTFEIASIY- 177 Query: 127 AQIPHLRVIHLA-QNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPR 185 Q P ++V+ +GKA AL G A+ E +V +D D ++ RDA +V + +PR Sbjct: 178 -QGPKVKVLRRELGGRGKARALNFGLRFARGEVIVLMDADTIISRDAIKELVRKL-QDPR 235 Query: 186 VGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEV 245 V AV GN +R + L+ K+Q EY + L ++ + G V +SG + AFRR+ L Sbjct: 236 VSAVAGNVMVRNKVNLLTKLQAIEYIATFHLFRKGLSVLGAVPIISGALGAFRRNVLESS 295 Query: 246 GYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFL 305 G + D +TED D++ K + + AL + PE LK L++QRLRW +G EV + Sbjct: 296 GLYDADTLTEDFDVTLKALKSGKIVQASSYALAFTEAPEKLKSLYRQRLRWYRGAYEVLI 355 Query: 306 KNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTA 365 K+ ++ + II + + I Sbjct: 356 KHRDAF-----------SLSGLTMLDFSLILMNNLIIPLIDFLSIISIIIAILRGLIWPL 404 Query: 366 GILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPK 425 I + LQ +++L I E S +F ++ + F ++ + Sbjct: 405 IIQIILFSTLQILINLFILQLAEERDISLVFLPLFAIGYKQFHEILMLKCFFDVIIARMR 464 Query: 426 KQRARWVSPDR 436 + W +R Sbjct: 465 GKSFSWTFIER 475 >UniRef50_Q3B487 Glucosaminyltransferase n=2 Tax=Chlorobium/Pelodictyon group RepID=Q3B487_PELLD Length = 461 Score = 271 bits (693), Expect = 4e-71, Method: Composition-based stats. Identities = 98/362 (27%), Positives = 160/362 (44%), Gaps = 7/362 (1%) Query: 41 MSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQ 100 M + + G F P +S+IIP FNEE ++ + I +ALA Sbjct: 83 MFFLIYLAGSLFPFLFFIGLTMLPERRIAGNTSLPFVSVIIPSFNEEDSIAQCIESALAL 142 Query: 101 RYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLV 160 Y E+I V+DGS D T I++R + VI L N+GK AL G AK E + Sbjct: 143 DYPAYEIIIVDDGSRDLTLPIIERY-----DVSVIRLRTNRGKVEALNRGIEKAKGEIIF 197 Query: 161 CIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRT 220 D D+ LD Y+V + VGAV G + ++ + ++Q EY +IK+ Sbjct: 198 FTDSDSSLDPKVLRYLVAG-FSDDTVGAVAGMVLPKRHNSYLLRMQTIEYIYGQSIIKKA 256 Query: 221 QRIYGN-VFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCW 279 Q G V G AFRRS L E G + + + ED DI+ ++ + YEPRA+ + Sbjct: 257 QLKSGGAVSICPGPATAFRRSVLLETGGYRNRTLAEDFDITLEMMELGYRAVYEPRAVSY 316 Query: 280 ILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVG 339 KGL KQR+RW++G +V+ ++ ++ + + + L + L Sbjct: 317 TSAMTGWKGLRKQRIRWSRGHLQVYREHRGMMFSDIGGPISLFWMPWSLFIGYGSAFLEM 376 Query: 340 FIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWII 399 + L + ++ L + + ++L + +Q I+ LM R +L + F Sbjct: 377 AFLGVYPLLVIISSMPLLFLKLGLSYMVILECISAMQGIIPLMQARRLTPSLALASFTTQ 436 Query: 400 WF 401 + Sbjct: 437 PY 438 >UniRef50_Q46B82 Glucosaminyltransferase n=2 Tax=Methanosarcina RepID=Q46B82_METBF Length = 403 Score = 269 bits (688), Expect = 2e-70, Method: Composition-based stats. Identities = 96/408 (23%), Positives = 174/408 (42%), Gaps = 16/408 (3%) Query: 29 LMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEK 88 L + F F + I+ I + R NP +SI++P +NE K Sbjct: 6 LFFYIIGFTCFSVGILTIPYYPLSLAFEMRKSKQN-----IFNSKNPFVSIVVPAYNEGK 60 Query: 89 NVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALK 148 + I + L Y EVI V+DGS+D T + H+ V+ +N GKA AL Sbjct: 61 VIGHCIKSILESNYSEYEVILVDDGSSDNTLEEMQHY-ETNSHVIVV-TKKNGGKASALN 118 Query: 149 TGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVG 208 G AK E + +D D + D + ++ + VGAV GN L ++ Sbjct: 119 VGLKLAKGEVIFFVDADGIFAPDTISKMLSGFIS-EDVGAVCGNDAPINLDKLQTQLANL 177 Query: 209 EYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQW 268 G ++R + VSG I A+R S L + G + + I ED++++W++ + Sbjct: 178 LTHVGTGFVRRALSTIDCLPVVSGNIGAYRSSTLEKTGPFLEGFIGEDMELTWRVHKAGY 237 Query: 269 TIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCL 328 + ++P A+ + P T+ GLWKQR+RWA+G + + L+ + ++ + L Sbjct: 238 KVVFQPWAIVYAEAPSTITGLWKQRVRWARGLLKTAYIHRDMLFNPKYG-LFAFYLPINL 296 Query: 329 TTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYE 388 T++ L I + + NI ++ G L + + S+ ++ + Sbjct: 297 TSMIIIPLLQLISIILLPILLF-RNINPIPLSWISIMGWLGMFSSIFALLFSIALDRAW- 354 Query: 389 HNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 L F++I V++ + L + ++ +++ A+W DR Sbjct: 355 --LDLKYFYVIPLWVLYSFMMDVVMLWAI---IVELRRKEAKWNKLDR 397 >UniRef50_B4U2G9 Glycosyl transferase, group 2 family protein n=22 Tax=Streptococcus RepID=B4U2G9_STREM Length = 464 Score = 269 bits (687), Expect = 2e-70, Method: Composition-based stats. Identities = 105/461 (22%), Positives = 189/461 (40%), Gaps = 62/461 (13%) Query: 19 LCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSIS 78 L + +S L + F+ + F+ + IVG ++ + R + P P +S Sbjct: 4 LFLVLKYSQYLFLAFLIAYASFLLLSLIVGAIHLYERR-------QQVPLGGDVSLPPVS 56 Query: 79 IIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRM------------- 125 I++P +NEE + ET+ + L Y+ E+I ++DGSTD T +L Sbjct: 57 ILVPAYNEELTIIETVESLLMLDYDCYEIIVIDDGSTDHTAKLLKEQFHLEAVTCSINQR 116 Query: 126 ----------AAQIPHLRVIHLAQ-NQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAA 174 Q+ H+++ L + N GK +L G AA+ +Y +C+D D++L D+ Sbjct: 117 IPTRPLKAVSQKQLKHVKLTLLEKANGGKGDSLNFGINAAQYDYFLCMDADSMLQEDSLK 176 Query: 175 YIVEPMLYNPRVGAVTGNPRIRT--------------RSTLVGKIQVGEYSSIIGLIKRT 220 I + + +P V AV G ++ L+ Q EY + Sbjct: 177 RIAKTVAKDPSVIAVGGMVKVSQGVALAAGRIVDYHLPRQLISCAQAIEYDCAFFGARVL 236 Query: 221 QRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQ------LNQWTIFYEP 274 +SG F++ V + + ED+++ KL + I YE Sbjct: 237 LDQAAGNLIISGAFGLFKKELAIAVDGYDTGTLGEDMELVMKLHFFCQNNKIPYRIGYET 296 Query: 275 RALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAF 334 A+CW P LK L +QR RW G + K + + + F++ + Y L ++ Sbjct: 297 DAICWSQAPSRLKDLCQQRRRWFLGLYQCLKKYL-QFFVSGRFKV-TVLISYVLYLLFE- 353 Query: 335 TCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENR--YEHNLT 392 L+ I A + + L++ + T ++L +C I NR ++N + Sbjct: 354 --LLSPFIEAFGVTIIVLSLLCHQLNTTFFVSLMLLYICYCVLIALTSFLNRVYSQNNSS 411 Query: 393 SSLFWIIW--FPVIFWMLSLAT--TLVSFTRVMLMPKKQRA 429 + F ++W + I+ L L T TLV + + +K+ A Sbjct: 412 ITFFDMVWVIYAAIYQCLVLHTVLTLVKVSSFIGYQRKKNA 452 >UniRef50_B9MNQ5 Glycosyl transferase family 2 n=1 Tax=Anaerocellum thermophilum DSM 6725 RepID=B9MNQ5_ANATD Length = 464 Score = 267 bits (684), Expect = 4e-70, Method: Composition-based stats. Identities = 102/467 (21%), Positives = 192/467 (41%), Gaps = 57/467 (12%) Query: 25 HSGELMMRFVFFWPFFMSIMW---IVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIII 81 H FV ++ ++ ++ I+ ++ V R+ G +S+++ Sbjct: 3 HIINWFSWFVSYYVLVLNTVYAILILISLFGIVSYWRNKIKGRIVEVVSSDFALPVSLLV 62 Query: 82 PCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDR----------------- 124 P +NEE+ + +++ + L Y EV+ +NDGS D T +L Sbjct: 63 PAYNEEETIAKSVKSFLQIEYPEYEVVVINDGSKDGTLDVLKNEFDLYIVDRKFRKILST 122 Query: 125 -------MAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIV 177 + + +L V+ +N GKA AL G Y+ +D D++L+RD+ A ++ Sbjct: 123 KEIKVIYYSKKYSNLIVVD-KENGGKADALNAGINVCTYPYVCSLDADSILERDSIAKVM 181 Query: 178 EPMLYNP-RVGAVTGNPRIRTRSTL---------------VGKIQVGEYSSIIGLIKRTQ 221 +P NP V A TG RI + L + + Q+ EY ++ Sbjct: 182 QPFFDNPYEVVATTGIVRIVNGTELDSFGNIKKLKLPSSSLARFQIIEYLRAFLGARKGL 241 Query: 222 RIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQW------TIFYEPR 275 + G++ SG AAF ++A+ +VG +SD + ED++I KL+ N + + + P Sbjct: 242 SMIGSLVIASGAFAAFNKNAIIKVGGFSDRTVGEDMEIVVKLRKNSYKEGALGRVEFVPD 301 Query: 276 ALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTT--IWA 333 + W PETLK L KQR RW +G +V + L+ + + Y L + Sbjct: 302 PIVWTQCPETLKDLSKQRRRWQRGLCQVIFMHKDILFNPKYGILGLFAMPYQLMFELLGP 361 Query: 334 FTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTS 393 F ++G+I + +N+E+ + + + +L ++ + +YE Sbjct: 362 FVEMLGYIFIPISYFAHIINLEVA-LFFFAVEIMYGIVISILAVLLGEFSDRKYEG--WR 418 Query: 394 SLFWIIWFPVIFWM-LSLATTLVSFTRVMLMPKKQRARWVSPDRGIL 439 ++ F ++ T L +++ W P+R L Sbjct: 419 EFGILVLFAILENFGYRQMTMLFRIVGTFEAILRKKG-WAKPERKKL 464 >UniRef50_B1HSU1 Biofilm PIA synthesis N-glycosyltransferase icaA n=1 Tax=Lysinibacillus sphaericus C3-41 RepID=B1HSU1_LYSSC Length = 403 Score = 267 bits (684), Expect = 4e-70, Method: Composition-based stats. Identities = 102/396 (25%), Positives = 186/396 (46%), Gaps = 16/396 (4%) Query: 48 GGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYEN--I 105 G ++F ++ + W + + D PS+S+ IP NE +E+T+ A Y + Sbjct: 7 GYLHFRLFEKPIDEWSKK-----MTDVPSVSVFIPAHNEALVIEQTLRAMSRLYYPKDKL 61 Query: 106 EVIAVNDGSTDKTRAILDRMAAQIPHLRVIHL---AQNQGKAIALKTGAAAAKSEYLVCI 162 EVI +ND S+D+T I + A + P +RVI + +GK+ AL + A + + +V Sbjct: 62 EVIVINDNSSDETGNIALQYAEKFPFMRVIETVEPNKGKGKSSALNSALADSTGDIVVVY 121 Query: 163 DGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRS-TLVGKIQVGEYSSIIGLIKRTQ 221 D D +R A Y+V ++ +P+ A G R+ + T + + E + + + Sbjct: 122 DADNTPERMAVWYLVMGLMNDPKAAATVGKFRVINAAETWLTRFINIETICFQWMAQGGR 181 Query: 222 RIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWIL 281 + V T+ G A RR+ L ++G W + ED +++ ++ + I + P+A+ W Sbjct: 182 WKWFKVATIPGTNFAIRRTVLEQLGGWDVKALAEDTELTIRVYNLGYHIRFFPKAITWEQ 241 Query: 282 MPETLKGLWKQRLRWAQGGAEVFLKNMTRLWR-KENFRMWPLFFEYCLTTIWAFTCLVGF 340 PETLK WKQR RWA+G V LK +++ + K ++ LF+ + ++ F ++ Sbjct: 242 EPETLKVWWKQRTRWARGNQYVVLKFLSQFFTLKRKSIIFDLFYFFFTYFLFFFGVILSN 301 Query: 341 IIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIW 400 ++ + L +I LT +L L L + +++L IE E N + L+ I+ Sbjct: 302 ALFVINLF---YDIGLTVGDIALVLWVLAFLLFLGEVMITLSIEKT-EMNKKNFLYVILM 357 Query: 401 FPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 + M + F + M Q +W +R Sbjct: 358 YFTYSQMWIVLVVWSLFLEIKRMFTGQEVQWYKTER 393 >UniRef50_C6A3S6 Putative glycosyl transferase n=1 Tax=Thermococcus sibiricus MM 739 RepID=C6A3S6_THESM Length = 388 Score = 267 bits (683), Expect = 6e-70, Method: Composition-based stats. Identities = 86/401 (21%), Positives = 154/401 (38%), Gaps = 18/401 (4%) Query: 44 MWIVGGVYFWVYRERHWPWGENAPAPQ-LKDNPSISIIIPCFNEEKNVEETIHAALAQRY 102 M VY+ V + P+ ++ PS++I+IP NE + +T+ A Y Sbjct: 1 MIYPIFVYYIVLTIAGLRYNSRFKRPEIPEELPSVTILIPARNEGLVIRDTLRAMANLDY 60 Query: 103 E--NIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQ-GKAIALKTGAAAAKSEYL 159 +EV+ ++DGSTD T I + ++ P ++VI + GK+ L G AK E + Sbjct: 61 PKDKLEVLLLDDGSTDDTAKIAEEVSKDYPFIKVIRVEGGGSGKSYVLNYGLKLAKGEVI 120 Query: 160 VCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRT-RSTLVGKIQVGEYSSIIGLIK 218 D D + A +V + + AVTG + ++ + EY + Sbjct: 121 AVYDADNRPEPGALKDLVAML--SDETPAVTGKVKTMNWNRNILTRFICMEYLYFQLAGQ 178 Query: 219 RTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALC 278 + + + G R+ L E+G W ++ + ED+++S+++ L I Y P A+ Sbjct: 179 AGKSKFYKTAILPGTNFVIRKELLEELGGWDEEALAEDLELSFRIILTGKKIAYTPLAVT 238 Query: 279 WILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLV 338 W PE+L+ ++QR RWA G + + R + W L F+ LT + + + Sbjct: 239 WEQEPESLRVWFRQRTRWAAGNVHTVKEYVK---RFKEIPSWGLRFDLFLTLMVYYLLAM 295 Query: 339 GFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWI 398 I+ V + L G + LL+ L + Sbjct: 296 AVIVADVAFVALLLTSGSVTWFTWSVLGFVYLAF-LLEIFAGLYDGKIKSPGCWLLALLM 354 Query: 399 IWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRGIL 439 W+L L ++ + W R + Sbjct: 355 YHTYSQIWILISLAGLW-------EARRAKKVWYKTPRTAV 388 >UniRef50_C1D0J4 Putative glycosyltransferase n=1 Tax=Deinococcus deserti VCD115 RepID=C1D0J4_DEIDV Length = 430 Score = 265 bits (679), Expect = 2e-69, Method: Composition-based stats. Identities = 94/383 (24%), Positives = 176/383 (45%), Gaps = 18/383 (4%) Query: 54 VYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDG 113 +R R PA L ++++I +NEE + T+ + L++ E ++V+ V+DG Sbjct: 25 AHRSRIRRQHRELPATPLG----VTVMIAAYNEEIGIGGTLTSVLSEPLEELQVVVVDDG 80 Query: 114 STDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAA 173 S+D T A ++R A P +++I +N GKA AL + V +D D+ + Sbjct: 81 SSDGTAAAVERFAQLDPRVQLIR-QENTGKAGALNHALKYVRYPVAVSVDADSAISPGTL 139 Query: 174 AYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGV 233 A + ++PRVGAV G+ R+ + L+ +Q EY+ + KR Q G V V G Sbjct: 140 AALAR-HFHDPRVGAVAGDVRVAGEAKLLTHMQDMEYTIGQHMEKRAQCFLGTVSVVPGA 198 Query: 234 IAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQR 293 A+R L E+G +S D +TED+D++ + + + +EP AL + P+ LK LW+QR Sbjct: 199 AGAYRTELLRELGGYSHDTLTEDMDLTIAVAAAGYRVRFEPLALSFTEPPQMLKHLWRQR 258 Query: 294 LRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLN 353 LRW G +V K ++ R+ L L + + ++G + +++ + + Sbjct: 259 LRWMYGTFQVMGKYRHLIFNPRAGRLGLL----TLPYVLLYGLVLGGLSPLIEILAITIL 314 Query: 354 IELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATT 413 ++ T + L+Q + +++ +S +++ P + L + Sbjct: 315 VQYTVVENLWPI--------LMQILAEVVVIGAALALGGASWRMLLYTPTQRFFLRPFVS 366 Query: 414 LVSFTRVMLMPKKQRARWVSPDR 436 +V +++ W R Sbjct: 367 IVVLMTCWTFLRRRNVHWNKLPR 389 >UniRef50_B7HT47 Glycosyl transferase family 2 n=8 Tax=Bacteria RepID=B7HT47_BACC7 Length = 1184 Score = 265 bits (677), Expect = 3e-69, Method: Composition-based stats. Identities = 98/471 (20%), Positives = 170/471 (36%), Gaps = 56/471 (11%) Query: 19 LCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSIS 78 L + S L + ++ F + ++++ RER P +S Sbjct: 4 LDIFLKISQWLFLLYMGVAILFYTAIFLISAFTLRKKRERDENRELLHYLQSSITRP-VS 62 Query: 79 IIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLA 138 II+P +NE + ++ + L Y EVI VNDGS+D T L V+ L Sbjct: 63 IIVPAYNEGVTIVSSVQSLLTLEYPEFEVIVVNDGSSDDTLEQLKDHFQLYEIQNVVRLQ 122 Query: 139 -----------------------QNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAY 175 +N GKA AL G + Y+ +D D+LL+RDA Sbjct: 123 LETETIRKTYRSSVNKQIIVVDKENGGKADALNAGINISNYPYVCSLDADSLLERDALMK 182 Query: 176 IVEPMLYNPRVGAVTGN-PRIRTRSTL--------------VGKIQVGEYSSIIGLIKRT 220 ++P+ +P VTG RI S + + +Q+ EY + Sbjct: 183 AMKPIYESPEKVMVTGGSVRIVNGSYIQNGQMIENRLPKQPLALMQIIEYLRGFLFGRLA 242 Query: 221 QRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLN------QWTIFYEP 274 Y + +SG F + + VG + + ED+++ L + I Y P Sbjct: 243 WSKYNILPIISGAFGIFDKGEVIRVGGYQRKTVGEDMELVVHLHKKALQDGEEKKIIYNP 302 Query: 275 RALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAF 334 A+CW P+ L KQR RW +G E ++ L+R + + + L W Sbjct: 303 NAICWTQAPDDLTTFRKQRSRWHRGLGETLWRHKDILFRPKYKAFGMIAMPFYLLLEW-- 360 Query: 335 TCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFI-VSLMIENRYEHNLTS 393 +G II + + ++ I + +L T+ F+ V +++ + + Sbjct: 361 ---LGPIIEILGYLLLLYHLLFDEIFTEYVFLLLAATVLYGSFLSVGVVLLEEWSMKKQN 417 Query: 394 SL----FWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRGILR 440 S+ ++W + T F + + W R L Sbjct: 418 SIKDFTLLLLWSLTESFWYRPLTVWYRF-LGLFQSLFRIKGWGKMKRKSLE 467 >UniRef50_Q2S5Y1 Glycosyl transferase, group 2 family protein n=1 Tax=Salinibacter ruber DSM 13855 RepID=Q2S5Y1_SALRD Length = 434 Score = 261 bits (668), Expect = 3e-68, Method: Composition-based stats. Identities = 102/436 (23%), Positives = 166/436 (38%), Gaps = 58/436 (13%) Query: 54 VYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDG 113 + R + A P I++I P NEE ++ A L+ Y + E++ VNDG Sbjct: 2 LRYHRRMGAYVSETALGADALPPITLIAPAHNEEATCVSSVRAFLSTEYPSFEILVVNDG 61 Query: 114 STDKTRAILDRM------------------------AAQIPHLRVIHLAQNQGKAIALKT 149 STD T L + P+L V+ +N GKA AL Sbjct: 62 STDNTLDRLQEAFDLQPTARPPTSSLPTESVRQVYQSQTHPNLWVLD-KENGGKADALNA 120 Query: 150 GAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRST--------- 200 G S +D D L++ + A +V P L AV G R+ T Sbjct: 121 GINFCDSPLFCAVDADTLVEGNGLARMVRPFLEETATVAVGGIVRVANGCTIRAGRVQEV 180 Query: 201 -----LVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITE 255 + K+QV EY + G + +SG FRR VG ++ D + E Sbjct: 181 SMPTKWLPKLQVLEYLRAFYFGRVGWDALGAMLIISGAFGLFRRETAVRVGGYATDTVGE 240 Query: 256 DIDISWKLQLN------QWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMT 309 D+++ ++ + + + P + W +P TL+ L +QR RW +G A+ +++ Sbjct: 241 DMELVVRMHRRLREAGQDYRVRFVPDPVAWTKVPSTLQDLGRQRDRWQRGLADSMIRHQQ 300 Query: 310 RLWRKEN--FRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGI 367 L M F Y L T L+G+I + V L L+ +A A + Sbjct: 301 MLLNPRYGTVGMLAYPFFYMLETFGPILELMGYITFVVALISGTLSGPFM-LAFFLVAFM 359 Query: 368 LLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVI----FWMLSLATTLVSFTRVMLM 423 L L L + + RYE + +++ PV+ + L+ + Sbjct: 360 LGMALSLSAVTLEELTFRRYEKT--ADFAQLLFLPVVETFGYRQLNAWWRVRGLWSYFRG 417 Query: 424 PKK----QRARWVSPD 435 K+ +RA + SPD Sbjct: 418 NKEWGAMERAGFDSPD 433 >UniRef50_B7A6V1 Glycosyl transferase family 2 n=1 Tax=Thermus aquaticus Y51MC23 RepID=B7A6V1_THEAQ Length = 467 Score = 259 bits (662), Expect = 2e-67, Method: Composition-based stats. Identities = 99/473 (20%), Positives = 176/473 (37%), Gaps = 54/473 (11%) Query: 17 IPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPS 76 + L + ++++ + F F+ + + G+ R + + Sbjct: 1 MELLAWLLYGYQVLVLYYFAVLNFLYATFALFGLRMVARYARELSELSLKDLLEREAYLP 60 Query: 77 ISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAA--------- 127 +SI++P +NEEK + ++ + LA RY EVI V DG D+T +L Sbjct: 61 VSILVPTYNEEKTIAASVRSFLALRYPEFEVIVVCDGPRDRTLEVLKEAFRLVQVDWVFK 120 Query: 128 ---------------QIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDA 172 P+L V+ +N GKA AL G A+ +D D+LLD +A Sbjct: 121 RALPTKPIRGVYRSLTYPNLLVVD-KENGGKADALNAGLNFARYPLFCGVDADSLLDPEA 179 Query: 173 AAYIVEPMLYNPRVGAVTGNPRIRTRST--------------LVGKIQVGEYSSIIGLIK 218 L + RV AV G R + + K+Q+ EY+ + + Sbjct: 180 LLRASRLFLEDERVLAVGGTIRPLNGARVREGIVEELRLPRGFLEKMQIIEYARAFFMGR 239 Query: 219 RTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQL------NQWTIFY 272 + +SG FRRS +VG + D + ED+++ +L + + Y Sbjct: 240 AGWSAMNALLIISGAFGLFRRSEAMKVGGYRTDTVGEDMELVVRLHRRAREEGRDYRVLY 299 Query: 273 EPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTT-- 330 P +C+ +P L +QR RW +G EV + L+ R+ L Y Sbjct: 300 TPDPICYTEVPADWATLRRQRNRWHRGLWEVLWLHRAMLFNPRYGRLGLLAMPYFFLFEA 359 Query: 331 IWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHT--AGILLCTLCLLQFIVSLMIENRYE 388 + ++G+++ V N E + G+LL L + + L R Sbjct: 360 LGPVVEVLGYLLLPVFYLLGVFNAEFALLFLLLALGYGVLLSQLAIGMETLLLKRYPRLR 419 Query: 389 HNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRGILRG 441 + ++ +L+L + +F ++R +W R L G Sbjct: 420 DRVALLFLSLLEALGYRQVLALERFIATF-----QVLRKRGQWGEMRRKGLGG 467 >UniRef50_D1BHB0 Glycosyl transferase n=1 Tax=Sanguibacter keddieii DSM 10542 RepID=D1BHB0_SANKS Length = 411 Score = 258 bits (660), Expect = 3e-67, Method: Composition-based stats. Identities = 86/408 (21%), Positives = 176/408 (43%), Gaps = 17/408 (4%) Query: 29 LMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEK 88 L ++ + +S++ V R A ++ NP +S+++P F+E + Sbjct: 6 LALQLLVLPAVLLSLVEAAYLPLALVAHARR-----PADPLRVPSNPFVSVVVPAFDEGR 60 Query: 89 NVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALK 148 + + + LA RY N EV+ V+DGS D T ++ + P + V H N GKA AL Sbjct: 61 VLRRCVDSILADRYLNKEVVLVDDGSRDDTWEVMKSYRGR-PGVVVAH-QTNAGKAAALN 118 Query: 149 TGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVG 208 G + ++ L+ +D D + ++ +PRVGAV GN ++ Sbjct: 119 HGISLSRGSILMFVDADGVFSGWTIRGMLRG-FDDPRVGAVCGNDAPVNLDRFQPRLLTL 177 Query: 209 EYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQW 268 + + +++R + VSG A RR L EVG +++ ++ ED++++W+++ + Sbjct: 178 -LTHVTAMVRRALAQVNCLTIVSGNCGAVRREVLDEVGGFTEGLLGEDLELTWRVRRAGY 236 Query: 269 TIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCL 328 + ++P A+ + +P + LW+QR+RWA+G + + + + + P L Sbjct: 237 RVSFQPEAIVYAEVPAGVGPLWRQRVRWARGYLQTLWMHRDLVGSVRHGVVGPYLAVNAL 296 Query: 329 TTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYE 388 T + A ++ + + L + + + T L +L +VS + + Sbjct: 297 TMVVAP--VLQLVALGIILGLAVDDSQTLYGTPASTLAWALLLFPVLSVVVSAALNGSWR 354 Query: 389 HNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 + ++ PV +++ ++ P +RW +R Sbjct: 355 D--LRFFYVLLVMPVYSTVMNAVMVTAIVQQLRRRP----SRWNKLER 396 >UniRef50_Q2IFD0 Glycosyl transferase, family 2 n=5 Tax=Bacteria RepID=Q2IFD0_ANADE Length = 479 Score = 258 bits (659), Expect = 4e-67, Method: Composition-based stats. Identities = 105/454 (23%), Positives = 178/454 (39%), Gaps = 49/454 (10%) Query: 28 ELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQL--KDNPSISIIIPCFN 85 E++ + +++ ++ + R + P+ + P +SII+P FN Sbjct: 8 EVVQWAFLAYFIALTVSYLALNLVSLRVIARDADARASLLLPRFFSRLEPPVSIIVPAFN 67 Query: 86 EEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAA------------------ 127 EE+ + TI L Y +E++ VNDGSTD+T +L R Sbjct: 68 EEETIAGTIRCLLQLEYPELEIVVVNDGSTDRTLEVLSREFELVPFPEAYRISVPSTPVR 127 Query: 128 ------QIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPML 181 + ++RV+ N GKA A+ G AA+ +DGD++L RD+ A +V P L Sbjct: 128 RVYRSLRHRNVRVLD-KSNGGKADAMNAGINAARYPLFCAVDGDSVLQRDSVARVVRPFL 186 Query: 182 YNPRVGAVTGNPRIRT--------------RSTLVGKIQVGEYSSIIGLIKRTQRIYGNV 227 +P A G R+ + ++QV EY + + Sbjct: 187 EDPTTIACGGTIRVLNGCRVVDGFVEEIGMPRGWIARVQVLEYLRAFLFGRLGWAPVNAL 246 Query: 228 FTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQL------NQWTIFYEPRALCWIL 281 VSG FRR+A+ E G + D + ED+++ +L + I + P +CW Sbjct: 247 PNVSGAFGMFRRTAVIEAGGYRTDTVGEDMELVLRLHRLYRLSGRPYRISFVPDPICWTE 306 Query: 282 MPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFI 341 +P+TL L +QR+RW +G E N L+ + L L AF +V Sbjct: 307 VPDTLAALRRQRVRWHRGLLEAIAGNRQLLFHRRGGAAAWLTVPNLL-FFEAFGPVVEVA 365 Query: 342 IYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWF 401 Y V A L + A A L L + L ++ + + L +I Sbjct: 366 GYVVIGAAWALGVISWSSFAAFFAAALGVATLLSANALLLEEKSFHVYQRPKQLAVLIGA 425 Query: 402 PVIFWM-LSLATTLVSFTRVMLMPKKQRARWVSP 434 ++ + L T + ++ ++ W SP Sbjct: 426 VLVENLGLRQLTAFWRISGMLSWLFRRPIAWGSP 459 >UniRef50_B7I546 Glycosyltransferase n=12 Tax=Acinetobacter RepID=B7I546_ACIB5 Length = 416 Score = 257 bits (658), Expect = 4e-67, Method: Composition-based stats. Identities = 91/419 (21%), Positives = 175/419 (41%), Gaps = 20/419 (4%) Query: 28 ELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEE 87 +++ F F + W W Y + +N P P + P +S++IP +NE Sbjct: 5 DILFLFGFLGIWIPQAFWAWLSYQAWKYSKTAEKELQNLPIP--ERWPVLSVLIPAYNEG 62 Query: 88 KNVEETIHAALAQRYE--NIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQN---QG 142 +E+T+HA Q Y + EV+ +NDGS D T I + +A P ++++++ + +G Sbjct: 63 VVIEDTLHAIAQQDYPAESYEVLLINDGSKDNTLEIAENLAKIYPCIKIVNVPKGMGGKG 122 Query: 143 KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRT-RSTL 201 K+ L G AK E +V D D+ + D + + +L + ++ AV G R R + ++ Sbjct: 123 KSRTLNNGLPHAKGELIVVYDADSTPEPDCVRLLAQTLLADKKLVAVNGKVRTRNWQDSI 182 Query: 202 VGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISW 261 + + E+ + + + + T+ G R AL +G + + + +D ++S+ Sbjct: 183 LTRFIAIEFIFFQWIFQGGRWQRFELSTLMGTNYVIWRDALETLGGFDEKSLVDDTEMSF 242 Query: 262 KLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWP 321 ++ + Q I + P A+ W P +L KQR RW QG V K + R +P Sbjct: 243 RIFIGQKRIKWVPYAIGWQQDPPSLSVFVKQRSRWTQGNFYVTRKYLPVALRTP----FP 298 Query: 322 LFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAG--ILLCTLCLLQFIV 379 + E L I + V + ++ + L + L IA G LL L ++ Sbjct: 299 IGIE-ILNNIMCYILFVPALFWS----HITLTLGLLDIAGISVLGPFTLLWGLSFCLYVA 353 Query: 380 SLMIENRYEH-NLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRG 437 + E + ++ + + +F + + +W +R Sbjct: 354 QMWFTLSLEKVKPELYFYSVLSYVSYSQIFLFIVFKAAFDMLKNKIQGNSLQWYKTERS 412 >UniRef50_Q0AL04 Glycosyl transferase, family 2 n=3 Tax=Bacteria RepID=Q0AL04_MARMM Length = 475 Score = 257 bits (657), Expect = 6e-67, Method: Composition-based stats. Identities = 97/402 (24%), Positives = 146/402 (36%), Gaps = 51/402 (12%) Query: 22 AYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIII 81 F L++ F +++ + GV + R R P ++ P +SI+ Sbjct: 11 VTFPVTFLILASALFQDTLYAVLIAIAGVT--LMRRRFEPNAHRLMRRIGQEAPRVSILA 68 Query: 82 PCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLR-------- 133 P +NE + V + A L Q + N EVI +NDGS D T L R P R Sbjct: 69 PAYNEAETVVDCTRALLRQNHSNYEVIVINDGSGDDTLNRLVRAFDLKPAERECSDRLDF 128 Query: 134 -----VIHLAQ----------NQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVE 178 V H N GKA AL G A A ID D+LL+ DA + Sbjct: 129 TPVRAVYHRPGPVPLWVIDKVNGGKADALNAGIAHASGSLFCAIDADSLLEADALIRTSQ 188 Query: 179 PMLYNPRVGAVTGNPRIRT--------------RSTLVGKIQVGEYSSIIGLIKRTQRIY 224 P L NP AV G R+ + +Q EY + + Sbjct: 189 PFLDNPDTVAVAGTVRVANGCRLREGAVEEAGLPRRFLPLVQSLEYLRAFLVARFALSEL 248 Query: 225 GNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWT------IFYEPRALC 278 G + VSG F R+ + EVG + + ED ++ +L + + + P +C Sbjct: 249 GALMIVSGAFGMFDRARVVEVGGYDPRTVGEDAELIVRLHRDGRDSDRPTVVRFIPEPVC 308 Query: 279 WILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLV 338 W +PE L L QR RW +G E ++ R L + ++ Sbjct: 309 WTQVPERLSDLAGQRRRWQRGALETLWRHRDAALRPGYGLPGTLGLGSMIVI-----DVI 363 Query: 339 GFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVS 380 G + + A L HI+ L +C F++S Sbjct: 364 GPCLEVLGYALAFLLAATGHIS-WSWFAALFALVCSFGFLLS 404 >UniRef50_C7RAF6 Glycosyl transferase family 2 n=1 Tax=Kangiella koreensis DSM 16069 RepID=C7RAF6_KANKD Length = 453 Score = 256 bits (654), Expect = 1e-66, Method: Composition-based stats. Identities = 99/437 (22%), Positives = 186/437 (42%), Gaps = 22/437 (5%) Query: 8 FFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAP 67 + + VL + + S L+ VF + + ++ + + R P + Sbjct: 19 YLLSATVLILAFFIDRPVSDSLISLRVFLFIMVLPLITKLMVQLYAAIRYSLQPVNQGYS 78 Query: 68 APQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAA 127 +S+++P +NEE + +TI + L Y + E+I +NDGSTDKT ++ + A Sbjct: 79 TMVPY---KVSVLLPAWNEEVGIVKTIESVLQSNYHDFELIVINDGSTDKTDQLVRQFIA 135 Query: 128 QI-----PHLRVIHLA-QNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPML 181 P + + +L+ N GKA AL G A ++ ID D ++DR+A ++ Sbjct: 136 DYEADSNPKVSIKYLSLPNGGKAHALNKGLELATGNIVITIDADCIVDRNAIYRFIQR-F 194 Query: 182 YNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSA 241 +P +GAV GN + + + +Q EY KR + V + G AA+RR Sbjct: 195 DDPSIGAVAGNIIVGNKRKPLEWLQQLEYVCGF-FYKRADSYFNAVHIIGGAAAAYRRDV 253 Query: 242 LAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGA 301 L + + +++ITEDID+S + + + Y A+ + P KG+ KQRLRW G Sbjct: 254 LINLKGFDEEIITEDIDLSMSILSHGYKTRYAHDAVVYTEGPTEWKGIRKQRLRWKYGRI 313 Query: 302 EVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAA 361 +LK+ + L + ++A L+ ++ + + Sbjct: 314 LCYLKHRELFLSSNSQHNRYLSWFILPLAVYAEIMLLNVGLFLSLFYCYVIASQ----DF 369 Query: 362 THTAGILLCTLCLLQFIVSLMIENRYEHNL--TSSLFWIIWFPVIFWMLSLATTLVSFTR 419 A ++L + L+ + L ++ Y NL + W++ + + F L T Sbjct: 370 ALLASVILVSSSLVALQIKLDSKSSYHKNLLPFIPVAWLLLYVIEFIELVALTM-----S 424 Query: 420 VMLMPKKQRARWVSPDR 436 + + ++ +W +R Sbjct: 425 LQKLYNRESVKWQKWNR 441 >UniRef50_Q7NHH7 Glr2559 protein n=1 Tax=Gloeobacter violaceus RepID=Q7NHH7_GLOVI Length = 426 Score = 255 bits (651), Expect = 3e-66, Method: Composition-based stats. Identities = 95/414 (22%), Positives = 158/414 (38%), Gaps = 23/414 (5%) Query: 26 SGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFN 85 + L + + P S++W+ G+ +Y R P P +S+++ N Sbjct: 18 ALVLFVLALHLIPTGASVVWLAAGLT-GLYAVRVLFALSPRPKSDPAYRPRVSVLVAAKN 76 Query: 86 EEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHL---AQNQG 142 E+ + + Y + EV +DGSTD+T L L ++ G Sbjct: 77 EQAVAAQLVAMLRRLDYPDFEVWIADDGSTDRTYQRLLEAGRGWQALHLVRRIPERSRPG 136 Query: 143 KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRS-TL 201 K+ L A + LV D DA ++ D + V P+ VGA+ R+ Sbjct: 137 KSAVLNELRERATGDILVVFDADARVEPDFLSRTV-PLFAVSSVGALQVRKRVHNADFNF 195 Query: 202 VGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISW 261 + Q E + ++ + G + G R +AL VG W++ +T+D+D++ Sbjct: 196 WTRGQSAEML-LDAFYQQQRAAIGGTAELRGNGQLVRAAALEAVGGWNEATVTDDLDLTL 254 Query: 262 KLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWP 321 +L L W I + T LW+QR RWA+GG + +L RL+ Sbjct: 255 RLHLGGWQIAFASDPCVDEEGVTTWSALWRQRSRWAEGGFQRYLDYAPRLFGGAMGTTKT 314 Query: 322 L-FFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVS 380 + +C V +++A Q PL L +A T +C Q Sbjct: 315 VDQLIFCTIQYLMPVAAVLDLLFAFQRGAAPLLTPLVLVATVFT----VCGFYFGQRERG 370 Query: 381 LMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSP 434 + + L +++++ WFPVI LV R L PKK WV Sbjct: 371 VQVGRAMLETLAGTIYFLHWFPVI---------LVKLARTALEPKK--LVWVKT 413 >UniRef50_Q2LWC4 Glycosyltransferase n=1 Tax=Syntrophus aciditrophicus SB RepID=Q2LWC4_SYNAS Length = 482 Score = 254 bits (648), Expect = 6e-66, Method: Composition-based stats. Identities = 95/458 (20%), Positives = 181/458 (39%), Gaps = 65/458 (14%) Query: 30 MMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLK---DNPSISIIIPCFNE 86 F+ F+ F+ +++ + V RH AP L+ + P +SIIIP NE Sbjct: 12 FNYFIGFYYGFVQVIYTLMLTIALVVILRHLNRIRYAPFRDLEKRSEMPPVSIIIPARNE 71 Query: 87 EKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMA-------------------- 126 + + +I ++L Y +EVI VNDGSTD +L Sbjct: 72 SEVIIRSIESSLTMNYPGLEVIVVNDGSTDGMLDMLISHYSLQRIDPLYRDLIKTRPVRG 131 Query: 127 ----AQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLY 182 ++IP+L +I +N GKA A+ +S Y+ D D++L++DA + P++ Sbjct: 132 FYCTSKIPNL-LIVDKENGGKADAINCAINVCRSPYVCVQDADSVLEKDALIRLATPLVQ 190 Query: 183 NPR-VGAVTGNPRIRT---------------RSTLVGKIQVGEYSSIIGLIKRTQRIYGN 226 + V A G R+ R + + Q+ EY + Sbjct: 191 SEVPVIACGGVVRVLNGTKQEGNVVTSIELPRRSTLACFQIVEYLRSFLFGRVGLDAMNC 250 Query: 227 VFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLN------QWTIFYEPRALCWI 280 +SG + F+++AL +G ++ D + ED+++ ++ + + I + +CW Sbjct: 251 NLVISGAFSFFQKAALLNIGGFATDNVAEDMELIVRMHRHYRLKKIPYRIRFISDPICWT 310 Query: 281 LMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGF 340 +PET + L +QR RW G + K+ + ++ ++ Y + F ++ Sbjct: 311 EVPETFRMLGRQRRRWHLGMIQTVWKHKSLIFNPRYGQIGLFAMPYTV-----FVEMISP 365 Query: 341 IIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQF------IVSLMIENRYEHNLTSS 394 ++ + +PL+ I A + L + F + + RY S Sbjct: 366 LLEFLGYIVIPLSYFFGLINADYFLLFLALAIAYGVFLSTMGIFLEEITFRRY--PKWSD 423 Query: 395 LFWIIWFPVIFWM-LSLATTLVSFTRVMLMPKKQRARW 431 LF ++ + V+ + + F + R +W Sbjct: 424 LFKLLLYGVLENIGYRQLNSFWRF-QAFFQFLFGRIKW 460 >UniRef50_A8MM07 Glycosyl transferase family 2 n=1 Tax=Alkaliphilus oremlandii OhILAs RepID=A8MM07_ALKOO Length = 482 Score = 252 bits (644), Expect = 2e-65, Method: Composition-based stats. Identities = 91/462 (19%), Positives = 174/462 (37%), Gaps = 53/462 (11%) Query: 22 AYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIII 81 +++ +V + IV + Y + + P IS+++ Sbjct: 9 IVISFNHMVLYYVLAINSIYFMQLIVSAFSLYDYIRKMYYSDFKKYTASSNMIP-ISVLV 67 Query: 82 PCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKT-RAILDRMAAQIPHLRVIHL--- 137 P ++EE+ + + I + L+ Y EVI +NDGS D T R I++ + + V HL Sbjct: 68 PAYDEEETIVDNIKSLLSLNYPAFEVIVINDGSNDDTLRKIVEAYDLKEINQPVRHLLKT 127 Query: 138 -------------------AQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVE 178 +N GKA AL G +K ID D++L+ D+ ++ Sbjct: 128 KKIKGIYANLDIPNLVLVDKENGGKADALNVGINVSKYPVFTSIDADSILESDSLVRVIM 187 Query: 179 PMLYNPRVGAVTGNPRIRTRSTL--------------VGKIQVGEYSSIIGLIKRTQRIY 224 P + + AV G RI S + + Q+ EY + Sbjct: 188 PFIEDNLTVAVGGIVRIINGSEIKNGKVEAIGLPKSKLAMFQIVEYLRAFLTGRLAWSKL 247 Query: 225 GNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLN------QWTIFYEPRALC 278 + VSG AF++ + + G ++ D I ED+++ K+ ++ + + P +C Sbjct: 248 RCLLIVSGAFGAFKKDVVVQAGGYTPDTIGEDMELVVKIHKMMRKNKVKYKVNFIPDPVC 307 Query: 279 WILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLV 338 W PE +K L QR RW G + ++ L + Y + W F ++ Sbjct: 308 WTQAPENIKDLRGQRRRWQIGLMDSLFRHKDLLLNPRYGVVG----MYGVPYFWIF-EML 362 Query: 339 GFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCL-LQFIVSLMIENRYEHNLTSSLFW 397 G II + +PL+ + + L+ ++ + + ++ Y + S+ Sbjct: 363 GPIIETLGYIIIPLSYIFGLLNTRYFVLFLITSILYGIILSLGAILLEEYTFSKYPSMKQ 422 Query: 398 IIWFPVIFWMLSLAT---TLVSFTRVMLMPKKQRARWVSPDR 436 ++ + + + T + ++ KK R W R Sbjct: 423 LLRLSLYGILENFGYRQMTTLFRIDAIIRFKKFRGDWGKIKR 464 >UniRef50_C7TL81 Glycosyl transferase, group 2 n=3 Tax=Bacilli RepID=C7TL81_LACRL Length = 417 Score = 252 bits (643), Expect = 2e-65, Method: Composition-based stats. Identities = 93/423 (21%), Positives = 180/423 (42%), Gaps = 22/423 (5%) Query: 26 SGELMMRFVF--FWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPC 83 + + +M F W M + +V YF + P P P + +S+++P Sbjct: 3 ALDWIMLFAIGAIWLILMVNVILVVAGYFEYMKMTQQPEPSLPPTPPM-----VSVMVPA 57 Query: 84 FNEEKNVEETIHAALAQRYE--NIEVIAVNDGSTDKTRAILDRMAAQIP--HLRVIH--- 136 NE + +T+ + L Y E+I +ND S+D + A+L + + P L VI+ Sbjct: 58 HNEGVVIVKTVESLLRFDYPQDRYEIIVINDNSSDNSAALLRDLQHRYPDRQLHVINTDA 117 Query: 137 LAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIR 196 + +GK+ AL G A+ L D D + A +V ++ + +GAV G R R Sbjct: 118 VTGGKGKSNALNIGLTKAQGSVLAIYDADNTPEFGALRILVSELMADDGLGAVIGKFRTR 177 Query: 197 TR-STLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITE 255 + +T + + E S + + ++ + T+ G RRS + ++G W + E Sbjct: 178 NKQATWLTRFINIETLSFQWMAQAGRQHLFGLCTIPGTNYVIRRSLIDKIGGWDVKALAE 237 Query: 256 DIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKE 315 D +IS+ + +N I ++P+A+ W P+TL + QR RW +G V LKN L++K Sbjct: 238 DTEISFHVYMNGARIKFQPKAVTWEQEPQTLDVWFHQRTRWVKGNIYVILKNSALLFQKR 297 Query: 316 NFRMWP-LFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCL 374 + L + + + + ++ ++ + AG + + +L L + Sbjct: 298 GRPIRFDLIYFLSIYFLLMTSLVLSDAVFILSTAGWA---HVGLKGFSTGLWLLAILLFI 354 Query: 375 LQFIVSLMIENRYEHNLTSSLFWIIWFPV-IFWMLSLATTLVSFTRVMLMPKKQRARWVS 433 + +++ E + +++ W+ +V++ R L Q+A W Sbjct: 355 VSTFITISTEKGEMTVGNVGIIGLMYITYSQLWLAVALYGMVAYIREQLF--HQQAHWYK 412 Query: 434 PDR 436 R Sbjct: 413 TKR 415 >UniRef50_Q110Z2 Glycosyl transferase, family 2 n=15 Tax=Cyanobacteria RepID=Q110Z2_TRIEI Length = 502 Score = 252 bits (643), Expect = 3e-65, Method: Composition-based stats. Identities = 97/434 (22%), Positives = 175/434 (40%), Gaps = 35/434 (8%) Query: 24 FHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPC 83 + + ++ + + + + ++ + + + + + P IS+++ Sbjct: 75 WTTTIILHLLSWGYWIILGLTGLLSVQFLRILFAKPKLAPKTLSEENFTEWPYISLLVAA 134 Query: 84 FNEEKNVEETIHAALAQRYE--NIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQ-- 139 NEE + + + LA Y + E+ ++D STDKT +L+++A + L+VI + Sbjct: 135 KNEEAVIRKLVKNMLALDYPTNSYELWVIDDNSTDKTPLLLEQLAREYEQLKVIRRSPDA 194 Query: 140 NQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRS 199 GK+ AL K + L D DA + D +V P+ VGAV I Sbjct: 195 GGGKSGALNAAIPFVKGKILGVFDADAQVTPDLLQKVV-PLFAREEVGAVQIRKAIANAG 253 Query: 200 -TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDID 258 K Q E + G + + G + + G R +AL E G W++ IT+D+D Sbjct: 254 INFWTKGQSAEMV-VDGFFQEQRIAIGGIGELRGNGQFVRMNALEECGGWNEQTITDDLD 312 Query: 259 ISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFR 318 ++ +L LNQW I Y + LW QR RWA+GG + +L + R Sbjct: 313 LTIRLHLNQWDIDYLAFPAVTEEGVTSPIALWHQRSRWAEGGYQRYLDYWKLILRNR--- 369 Query: 319 MWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFI 378 + F +W F + +++ + ++I L + T + + LL Sbjct: 370 ---MRFSKTW-DLWQF-LVTQYLLSVAAVPDFLMSIILRRLPITSPLTVFTVMVSLLGMF 424 Query: 379 VSLMIENRYEHNLTS---------------SLFWIIWFPV--IFWMLSLATTL-VSFTRV 420 + L + + NL SLF + V F+ML + V+ R+ Sbjct: 425 IGLRRTRKQQMNLAKEEKVMEFNSSKDNPLSLFLTLLESVRGTFYMLHWFVVMGVTIARM 484 Query: 421 MLMPKKQRARWVSP 434 ++PK R +WV Sbjct: 485 SILPK--RLKWVKT 496 >UniRef50_Q988K9 Mlr6694 protein n=1 Tax=Mesorhizobium loti RepID=Q988K9_RHILO Length = 475 Score = 251 bits (642), Expect = 3e-65, Method: Composition-based stats. Identities = 98/402 (24%), Positives = 167/402 (41%), Gaps = 54/402 (13%) Query: 26 SGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPA--PQLKDNPSISIIIPC 83 + ++ F+ + ++++ + + P +A P I++++P Sbjct: 12 AASILSWFIIGAGLAQTTIYLLQLIIAAYALSKRPPVARSALLWHRYGDVAPPIALLVPA 71 Query: 84 FNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAIL-------------DRMAAQIP 130 +NEE NV E++H+ LA Y N EVI +NDGS D+T L + A P Sbjct: 72 YNEELNVVESVHSMLALEYPNFEVIVINDGSKDQTLQRLIGAFKLVKFRRPYEEALAHKP 131 Query: 131 -----------HLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEP 179 L V+ +N GKA A G ++ IDGD++L+ DA +P Sbjct: 132 IRGLYSSPMTERLFVVD-KENGGKADAQNAGINVCRAPLFCVIDGDSILEPDALMRAAQP 190 Query: 180 MLYNP-RVGAVTGNPRIRT--------------RSTLVGKIQVGEYSSIIGLIKRTQRIY 224 + +P R AV G RI + L+ +QV EY + + Sbjct: 191 FIDDPERTIAVGGTIRIANGSRIEAGRVREIHLPNRLLPLLQVVEYLRAFLMARLAWSHI 250 Query: 225 GNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQL------NQWTIFYEPRALC 278 + VSG FRR+ + VG ++ + ED+D+ KL ++ I + P +C Sbjct: 251 NTLMLVSGAFGMFRRTEVVAVGGFTMGSMGEDLDLVIKLHRHMRDAGRKYRIQFIPEPVC 310 Query: 279 WILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLT--TIWAFTC 336 W PETL L +QR RW +G E F + L+ R+ L F + L + Sbjct: 311 WTEAPETLSVLARQRTRWQRGALECFFRYRAMLFNPRYGRVGFLGFGHILIVDVLGPVAE 370 Query: 337 LVGFI----IYAVQLAGVPLNIELTHIAATHTAGILLCTLCL 374 ++G+I + + + + T + T+ + +C+L L Sbjct: 371 VLGYILIPAFWLLGILSTEYLLAFTSLIFTYGVCVSVCSLVL 412 >UniRef50_Q46CB1 Glycosyltransferase n=2 Tax=Methanosarcina RepID=Q46CB1_METBF Length = 694 Score = 251 bits (642), Expect = 3e-65, Method: Composition-based stats. Identities = 98/389 (25%), Positives = 165/389 (42%), Gaps = 30/389 (7%) Query: 34 VFFWPFFMSIMWIVGGVYFWVYR-----ERHWPWGENAPAPQLKDNPSISIIIPCFNEEK 88 V+FW F ++ + + W + R + + +SII+P ++EEK Sbjct: 302 VYFWFFIITSLLPLSFFVIWKFLSGWNIRRSKNSCQPQNIQDTGYHGLVSIIVPAYDEEK 361 Query: 89 NVEETIHAALAQRYEN-IEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIAL 147 + + + L Q Y +EVI VNDGSTD+T I+ + ++ I L N GKA AL Sbjct: 362 TIGSCLESILNQDYNGEMEVIVVNDGSTDRTAEIVLKY-----PVKFIDLKANVGKANAL 416 Query: 148 KTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRT---RSTLVGK 204 AK + L+ D D+ + +A + +VE ++ N V GN I + ++ Sbjct: 417 NRAIEDAKGDILIFTDSDSYMSANAVSSLVECLITNNDAQIVAGNVFIHNSQGKKRIMKY 476 Query: 205 IQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQ 264 Q+ EY + + Q + GN+ G + A +R V +SD+ I ED D + K Sbjct: 477 FQMIEYLIEQEITRYLQSLSGNILVCPGPLTAVKREVCESV-RFSDETIVEDADFTIKAL 535 Query: 265 LNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFF 324 I EP+A + PET + +KQR RW G +V+ + R + L + Sbjct: 536 KKSMKIIREPKAKVYTNAPETFRAWYKQRKRWWYGNLQVWRLHRHWAMRNAWMVLNYLGY 595 Query: 325 EYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIE 384 + +I LV F+ Y + L I L ++ +L L + F SL Sbjct: 596 IIGVCSI----ILVMFLPYLLLQYDNLLLISLRGLS-----YSILPVLIFIVFTASLF-- 644 Query: 385 NRYEHNLTSSLFWIIWFPVIFWMLSLATT 413 L +I + +++ + + T Sbjct: 645 ----SKDKKLLPMLIPYVLVYSTIKVVTI 669 >UniRef50_C1F7J6 Glycosyl transferase, group 2 family n=1 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F7J6_ACIC5 Length = 627 Score = 251 bits (641), Expect = 5e-65, Method: Composition-based stats. Identities = 81/372 (21%), Positives = 149/372 (40%), Gaps = 12/372 (3%) Query: 39 FFMSIMWIVGGVYFWVY---RERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIH 95 F + I+ GV+ ++ R P + + P +++ +P FNE+ ++ I Sbjct: 132 FLVMIILSFYGVHRYIMVWEYYRFRKRATKEPPKEFPELPRVTVQLPIFNEQFVIDRLIE 191 Query: 96 AALAQRYENIEV-IAVNDGSTDKTRAI----LDRMAAQIPHLRVIHLAQNQG-KAIALKT 149 A A Y + I V D STD+T+A+ + + Q + +H QG KA AL Sbjct: 192 AICAMDYPRDRLEIQVLDDSTDETQAVAAALVKKYQEQGQPIVYLHRTNRQGYKAGALDE 251 Query: 150 GAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRS-TLVGKIQVG 208 G AK E++ D D + D ++ +P +G V R + + +++ Sbjct: 252 GLKVAKGEFVAIFDADFVPSPDWLMKVI-HHFSDPAIGMVQTRWTHLNRDYSFLTQVEAI 310 Query: 209 EYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQW 268 L + G F +G +RR+A+ + G W D +TED D+S++ QL W Sbjct: 311 LLDGHFVLEHGARSRAGVFFNFNGTAGMWRRTAITDAGGWQHDTLTEDTDLSYRAQLVGW 370 Query: 269 TIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCL 328 Y C +P + Q+ RWA+G + K M ++ R + L Y L Sbjct: 371 KFKYLQDVECPAELPIEMTAFKTQQARWAKGLIQTSKKIMPQVLRADLSWHEKLEAWYHL 430 Query: 329 TTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYE 388 T ++ ++ I + + + + T + F + + + Y Sbjct: 431 TANISYPLMIVLSILLLPTEIIQFHQGWFQMLFIDFPLFAASTFSIASFYM-VSQQILYP 489 Query: 389 HNLTSSLFWIIW 400 H +L ++ + Sbjct: 490 HRWFRTLCYLPF 501 >UniRef50_C7RTK3 Polysaccharide deacetylase n=2 Tax=Bacteria RepID=C7RTK3_9PROT Length = 336 Score = 250 bits (639), Expect = 7e-65, Method: Composition-based stats. Identities = 79/338 (23%), Positives = 144/338 (42%), Gaps = 14/338 (4%) Query: 90 VEETIHAALAQRYE-NIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALK 148 + +TI + LA + +++ V+DGS+D T + P +R I +N GK A Sbjct: 2 IVQTIASLLACDPPRDFQIVVVDDGSSDATYRVACDAFGSDPRVR-IFTKENGGKPAAPN 60 Query: 149 TGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVG 208 G A ++E +V +D D + RD + +PRVGAV GN ++ R L+ + Q Sbjct: 61 FGLAQTQAEVIVALDADTIFARDTVRKLAR-HFEDPRVGAVAGNAKVGNRINLLTRWQAL 119 Query: 209 EYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQW 268 EY + L +R + + V G + A+RR + G + + ED D++ ++ + Sbjct: 120 EYITSQNLDRRAFDVLNCITVVPGAVGAWRRELVERAGGFDHQTLAEDADLTLAIRKLGY 179 Query: 269 TIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCL 328 + YE A+ P++++G QR RW G +V K+ L+R ++ + + + Sbjct: 180 NVVYEDEAVALTEAPDSVRGFIGQRYRWMYGTMQVARKHRDVLFRPQSGALGFVALPNVI 239 Query: 329 TTIWAFTCLVGFIIYAVQLAGV----------PLNIELTHIAATHTAGILLCTLCLLQFI 378 F LV I+ + + + P ++ L T+ L I Sbjct: 240 VFQILFP-LVSPIMDLLLIGSLASAAVNHWQHPEEFSAGNLWRVLFYYALFVTVDYLAAI 298 Query: 379 VSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVS 416 ++ +E + L LFW +F + V+ Sbjct: 299 LAFALERKESWWLLIWLFWQRFFYRQLMYYVAIKSTVT 336 >UniRef50_D1CH68 Glycosyl transferase family 2 n=2 Tax=Bacteria RepID=D1CH68_THET1 Length = 470 Score = 250 bits (639), Expect = 7e-65, Method: Composition-based stats. Identities = 86/410 (20%), Positives = 156/410 (38%), Gaps = 49/410 (11%) Query: 24 FHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPC 83 F ++ + +++++ V Y R + + P +S+I+P Sbjct: 8 FAVNAFVLCYFLALNGTYLVLYLISFVEIVDYSRREI-FSGLSELFTSAYAPPVSLIVPA 66 Query: 84 FNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDR------------------- 124 +NEE + +++ + LA Y +V+ VNDGS D T +L R Sbjct: 67 YNEEATIADSVRSFLALHYPQHQVVVVNDGSRDATLEVLKREFDLWESDQPIRLRLRTAP 126 Query: 125 -----MAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEP 179 +++ P+L VI +N GKA +L G AA + CID D +L+ DA + P Sbjct: 127 IRAVYASSRFPNLLVID-KENGGKADSLNAGICAASYPLVCCIDADIILEEDALLRVARP 185 Query: 180 MLYNPRVGAVTGNPRIRT--------------RSTLVGKIQVGEYSSIIGLIKRTQRIYG 225 M+ + +V AV G R+ + QV EY + Sbjct: 186 MIESSKVVAVGGIVRVANGCKIEAGRVTEVRAPREWLPSFQVVEYLRAFLAARTAWSRLN 245 Query: 226 NVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQL------NQWTIFYEPRALCW 279 + +SG FRR + VG + + ED+++ ++ + I + P + W Sbjct: 246 CLLIISGAFGMFRREDVIAVGGYETSTVGEDMEVVTRMHRLLRERHRDYHIAFVPDPVAW 305 Query: 280 ILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKEN--FRMWPLFFEYCLTTIWAFTCL 337 +P TL+ L +QR RW +G + L++ L M + + + + L Sbjct: 306 TEVPATLRVLRRQRDRWHRGLIDTLLRHRQMLLNPTYGLVGMLAMPYYFLFEMLGPVVEL 365 Query: 338 VGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRY 387 +G++ + L +N G L + ++ RY Sbjct: 366 LGYMALVLGLVLGVVNAPFALAFFGAAVG-LGVLMSTAAVLLEEFRFRRY 414 >UniRef50_C5A6A1 Glycosyltransferase n=2 Tax=Thermococcus RepID=C5A6A1_THEGJ Length = 451 Score = 250 bits (639), Expect = 8e-65, Method: Composition-based stats. Identities = 111/433 (25%), Positives = 186/433 (42%), Gaps = 20/433 (4%) Query: 8 FFILCLVLCIPLCVAYF--HSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGEN 65 F + I L + +R++F+ + + ++ YF Y H+ E Sbjct: 25 FVQEAVWKVISLLFQGVPRYPVYAFVRYLFY--LTGAFVILIYAAYFIFYL--HFRENEG 80 Query: 66 APAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYE--NIEVIAVNDGSTDKTRAILD 123 + P + P +S++IP NE N+E I + Q Y EVI V+DGS D T I + Sbjct: 81 S-KPNPRFFPKVSLVIPAHNEAMNIERLIESIQYQDYPFECYEVILVDDGSVDGTPEIAE 139 Query: 124 RMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALL-DRDAAAYIVEPMLY 182 R +RVI +N GKA AL+TG AAK + ++ +D D+ D + IVE + Sbjct: 140 RYG-----IRVIRHERNMGKAKALETGIKAAKGDVIITLDADSYFADGSSLRNIVENLFS 194 Query: 183 NPRVGAVTGNPRIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSA 241 P VG TG RI RS L+ K QV EY + +R Q + V G +AF+ Sbjct: 195 RPFVGVSTGAIRIDVRSGKLIEKFQVIEYLHSFEVGRRVQGYLDWLLVVPGAFSAFKGYL 254 Query: 242 LAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGA 301 + + D + ED +++ T +EP+A + + + L++QR+RW GG Sbjct: 255 IKSLPAIPRDTLAEDFELAMITYRAGLTSNFEPKAAVYTEPATSWRELYRQRIRWYYGGL 314 Query: 302 EVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVP---LNIELTH 358 +V K + ++ F + + + L F I A L + L +E+ Sbjct: 315 QVMAKYHDMIMNRKYGEKGLFLFFHMILLEYILPVLQVFGIVAFPLIMLVHNFLGLEILD 374 Query: 359 IAATHTAGILLCTLCL-LQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSF 417 I + + L L LQ++ +++ + + + PVIF + L+S Sbjct: 375 ITLPFPLMMAVFLLVLFLQYLPGVLMSGIAMAIERNPRTALGYLPVIFLYYPIYNPLLSL 434 Query: 418 TRVMLMPKKQRAR 430 ++ M + R Sbjct: 435 AKIDAMLRFLRGV 447 >UniRef50_B1YJV4 Glycosyl transferase family 2 n=4 Tax=Bacillales RepID=B1YJV4_EXIS2 Length = 482 Score = 250 bits (638), Expect = 1e-64, Method: Composition-based stats. Identities = 90/457 (19%), Positives = 182/457 (39%), Gaps = 58/457 (12%) Query: 30 MMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKD--NPSISIIIPCFNEE 87 ++ ++ F S ++++ + R++ + E++ P ++ +S+++P +NEE Sbjct: 18 VLIYMILVIIFYSTLYLISA---QMVRKQRFRRKESSYLPLMRSLYTRPVSVLVPAYNEE 74 Query: 88 KNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAA-----QIPHL---------- 132 + ++ A LA Y EVI +NDGS+D T L + + P + Sbjct: 75 YTIVTSVQALLALDYPEHEVIVINDGSSDGTMDELFKTYELEKELRFPRILIGSQPVKHV 134 Query: 133 -------RVIHL-AQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNP 184 R+I + +N GKA AL G + Y +D D++L+RDA +++P++ + Sbjct: 135 YRSRIDARLIVIDKENGGKADALNAGIDYSSYPYFCSLDADSILERDALLKVMKPIVESD 194 Query: 185 RVGAVTGN---------------PRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFT 229 T IRT + +Q+ EY + + + Sbjct: 195 EDVIATCGSIFLVNGCTIEGGEIVEIRTPKQRIVLMQMIEYFRAFLFGRLGLSRFNMLLI 254 Query: 230 VSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLN------QWTIFYEPRALCWILMP 283 VSG F+++ + + G + + ++ ED+++ +L I Y P +CW P Sbjct: 255 VSGAFGLFQKAIVVQAGGYKEGLVGEDMELVVRLHRYMKENKIDKRIEYVPSPVCWTQAP 314 Query: 284 ETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMW--PLFFEYCLTTIWAFTCLVGFI 341 +T++ L QR RW +G AE + + + + + + + + L+G+ Sbjct: 315 DTMEVLRSQRSRWHRGLAETLWTHKRMIGNPRYGFVGSVSMLYFFLIELLGPIVELIGYF 374 Query: 342 IYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYE--HNLTSSLFWII 399 I + L LN E + T A + L ++ +R++ L W + Sbjct: 375 IIPLNLLVGTLNFEFGVVLLT-VALLYGSLLSAGAVLLEEWTMHRFKNVKELIRLYLWSL 433 Query: 400 WFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 + + + L + M R +W +R Sbjct: 434 TESFWYRPILVFFKLGGLIQFM----TGRGQWGKMER 466 >UniRef50_C5CVQ8 Glycosyl transferase family 2 n=4 Tax=Proteobacteria RepID=C5CVQ8_VARPS Length = 445 Score = 249 bits (636), Expect = 2e-64, Method: Composition-based stats. Identities = 103/409 (25%), Positives = 175/409 (42%), Gaps = 24/409 (5%) Query: 27 GELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPA-PQLKDNPSISIIIPCFN 85 ++ + P FM+ + + + PA P L P ISI+I +N Sbjct: 55 AIFLVYGIAVVPGFMNAFLAISLLL------------DRRPAHPALAVYPPISILIAAYN 102 Query: 86 EEKNVEETIHAALAQRYEN-IEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKA 144 EE +++ET+ + Q Y ++VI +NDGS D T ++ R + P L + L +N GKA Sbjct: 103 EEASIDETLVSIDRQNYPGELQVIVINDGSADGTSLVVQRARERYPWLTFLDLEKNGGKA 162 Query: 145 IALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYN-PRVGAVTGNPRIRT-RSTLV 202 AL G + ++ +D D+ L R A IVE + P AV G +R R + Sbjct: 163 RALNIGFKEVAHDLVITVDADSFLYRGALTSIVERYRADPPHTRAVAGKILVRNSRLNWI 222 Query: 203 GKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWK 262 + Q +Y I IKR Q ++ G + + R+AL EVG W + + EDI ++W Sbjct: 223 TRCQEWDYFHGIAAIKRVQSLFQGTLVAQGAFSIYDRAALVEVGGWPE-CVGEDIVLTWA 281 Query: 263 LQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPL 322 L + + + A + +P T+K L KQR RWA+G AE F+K+ L + R+ Sbjct: 282 LLKAGYRVGHCEDACLFTNVPTTVKQLVKQRQRWARGMAEAFIKHPGILVKP---RLSTF 338 Query: 323 FFEYCLTTIWA-FTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSL 381 F + L W C +GFI + T L LL F + Sbjct: 339 FIYWNLLFPWLDLACTIGFI---PGIILAFFGHYWIVGIMTLALIPATFALSLLMFRIER 395 Query: 382 MIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRAR 430 + + + ++ + +++ ++ +++ + + K Sbjct: 396 RMFRKTGLVVRKNVQGFFVYVLLYSLILQPASVLGYLDELFRKPKAWGT 444 >UniRef50_Q1IMJ5 Glycosyl transferase, family 2 n=10 Tax=cellular organisms RepID=Q1IMJ5_ACIBL Length = 546 Score = 248 bits (634), Expect = 3e-64, Method: Composition-based stats. Identities = 78/371 (21%), Positives = 144/371 (38%), Gaps = 12/371 (3%) Query: 39 FFMSIMWIVGGVYFW---VYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIH 95 F + I+ GV+ + R+ P + P +++ +P FNE+ ++ + Sbjct: 51 FIVLIILAAYGVHRYQLVWMYYRNRKNKTTDPPQHFAELPRVTVQLPIFNEQYVIDRLVE 110 Query: 96 AALAQRYENIEV-IAVNDGSTDKT----RAILDRMAAQIPHLRVIHLAQNQG-KAIALKT 149 A Y ++ I V D STD+T R +++R AA + IH G KA AL+ Sbjct: 111 AVCKLDYPKDKLDIQVLDDSTDETVEVAREVVERYAALGNPISYIHRTNRHGFKAGALQE 170 Query: 150 GAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRS-TLVGKIQVG 208 G A K E++ D D + D + P +G V R+ + + +++ Sbjct: 171 GMAVCKGEFIAIFDADFVPPADFLQKCI-HHFAEPEIGMVQTRWTHLNRNYSFLTEVEAI 229 Query: 209 EYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQW 268 L + G F +G +R+ A+ E G W D +TED D+S++ Q+ W Sbjct: 230 LLDGHFVLEHGGRSRKGVFFNFNGTAGMWRKQAIEEAGGWQHDTLTEDTDLSYRAQVKGW 289 Query: 269 TIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCL 328 Y C +P + Q+ RWA+G + K + L+R + R + Y L Sbjct: 290 RFKYLQDVECPAELPIEMTAFKTQQARWAKGLIQCSKKVLPFLYRSDVPRRVKVEAWYHL 349 Query: 329 TTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYE 388 T ++ ++ + + + L T + F + + + Y Sbjct: 350 TANISYPLMIVLSALMLPAMVLRFYQGWFQMLYIDMPLFLASTFSISSFYL-VSQKELYP 408 Query: 389 HNLTSSLFWII 399 + ++ Sbjct: 409 KTWLRTFMYLP 419 >UniRef50_Q05S47 Group-specific protein n=1 Tax=Synechococcus sp. RS9916 RepID=Q05S47_9SYNE Length = 422 Score = 248 bits (634), Expect = 3e-64, Method: Composition-based stats. Identities = 96/387 (24%), Positives = 156/387 (40%), Gaps = 21/387 (5%) Query: 34 VFFWPFFMSIM---WIV--------GGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIP 82 + FWP +S + WI+ + R + W E P P +SI+IP Sbjct: 2 LHFWPLILSALLGTWIIRTAIVVALAHQHNQAERTKEANWSEQQSTPC---KPLVSILIP 58 Query: 83 CFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIP--HLRVIHLAQN 140 NEE + + I + + Y N E+I +NDGS D T A P + VI A N Sbjct: 59 ACNEEATITDCIQSCIQSNYGNKEIIIINDGSNDATSDQAHIAQANHPKARITVIDFACN 118 Query: 141 QGKAIALKTGAAAAKSEYLVCIDGDALL-DRDAAAYIVEPMLYNPRVGAVTGNPRIRTRS 199 QGK AL G A E++V +D D D + ++ P++ + A TGN +I Sbjct: 119 QGKTAALNEGLLYANGEFIVTLDADTRFSDPSSLTRLLNPLIQYQDIAACTGNLKIANPD 178 Query: 200 TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDI 259 + IQ EY+ II KR Q + + G ++AFR + L +G +S + ED D Sbjct: 179 RTIPNIQSIEYTKIIQTFKRAQSHVNAIMILPGAVSAFRATELRSIGGFSASTLAEDADA 238 Query: 260 SWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRM 319 + L L + + + A P T++ L KQR+RW G + K+ + + Sbjct: 239 TMSLLLRRNRLLFTSSASAITQGPNTVQELLKQRIRWRVGQLQCLWKHR----KLCGQSL 294 Query: 320 WPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIV 379 FF + + F++ + + + IA + + + Sbjct: 295 STAFFFVDTASTNLISAAAPFVLAVTGVYVIQKGLWQQPIATVAGFVAIDIAATVFAYTA 354 Query: 380 SLMIENRYEHNLTSSLFWIIWFPVIFW 406 + L LF+ ++ P I W Sbjct: 355 AQQKTPSIPDYLRHLLFFSLFNPFITW 381 >UniRef50_C4I9P5 Inner membrane glycosyltransferase n=39 Tax=Bacteria RepID=C4I9P5_BURPS Length = 520 Score = 248 bits (633), Expect = 4e-64, Method: Composition-based stats. Identities = 91/416 (21%), Positives = 171/416 (41%), Gaps = 9/416 (2%) Query: 29 LMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLK--DNPSISIIIPCFNE 86 +++ + + +++ +V +V+ + AP + D P I++ + NE Sbjct: 97 IVISLCAAYLWVFALLTLVYASRHYVFGLDRLFKPQRAPYRAITHADWPEITVFVAAHNE 156 Query: 87 EKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAILDRMAAQIPH-LRVIHLAQNQ-G 142 E V + + A LA Y + ++ VND STD TRA++D + A+ P ++ H + G Sbjct: 157 EAVVADCLTALLATTYPRERLTIVPVNDRSTDNTRALIDEVQARAPELIKPFHRESGKPG 216 Query: 143 KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRS-TL 201 KA ALK + + +V D D L +V P +P VGAV G + L Sbjct: 217 KAAALKDALREIRGDIMVVFDADYLPRPGLLKELVAPFF-DPEVGAVMGRVVPQNADRNL 275 Query: 202 VGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISW 261 + ++ E + + ++ + G V G + R+SAL VG W DD + ED D+++ Sbjct: 276 LARLLDLERAGGYQVNQQARNNLGLVPQYGGTVGGVRKSALDAVGGWRDDTLAEDTDMTY 335 Query: 262 KLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWP 321 +L L+ W Y A C+ +PE +Q RWA+G + + + L R Sbjct: 336 RLLLSNWRTVYLNHAECYEEVPERWPVRARQLTRWAKGHNQTLFRYLIPLLRSPVTPRRC 395 Query: 322 LFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSL 381 L ++ L+ + I+ + + L F V Sbjct: 396 RLDGALLLGVFVMPALLALAWGIALALYLTNGIDSLVLGLLVSVFALFAFSTFGNFGVFF 455 Query: 382 MIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMP-KKQRARWVSPDR 436 I + ++ ++ V+ + +++A + + + L ++ RW +R Sbjct: 456 EIVVAARLDGRATRLRLVPVNVVGFCVTIAAVVAALWGLALDALLRRELRWDKTER 511 >UniRef50_A0LZM3 Transmembrane family-2 glycosyl transferase-possibly involved in biofilm formation n=2 Tax=Flavobacteriaceae RepID=A0LZM3_GRAFK Length = 473 Score = 248 bits (633), Expect = 4e-64, Method: Composition-based stats. Identities = 85/408 (20%), Positives = 154/408 (37%), Gaps = 61/408 (14%) Query: 82 PCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRA--------------------- 120 P FNEE NV E + + L Y + E++ +NDGS D T Sbjct: 65 PAFNEEANVVENVRSLLTLNYPSYEIVIINDGSKDGTLEALLTTFDMQKQEFLYHPFIPS 124 Query: 121 -----ILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAY 175 + +L VI +N GKA AL TG +E + CID D +L+ DA Sbjct: 125 KEVKGVYKSKNEAYKNLTVID-KENGGKADALNTGINVCDNEIICCIDVDCILENDALLK 183 Query: 176 IVEPMLYNPRVGAVTGNP---------------RIRTRSTLVGKIQVGEYSSIIGLIKRT 220 +V+P L N + +G I+ T + + Q+ EY + + Sbjct: 184 LVKPFLNNTKKVIASGGIIRVANSCVIEDGRILDIKLPDTFLARAQIIEYFRAFLMGRMA 243 Query: 221 QRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLN------QWTIFYEP 274 + +SG F + + +VG ++ D + ED+++ +++ +T+ + P Sbjct: 244 WSRLDGLLLISGAFGMFDKDTVIKVGGYNPDTVGEDMELLVRMRKKMRENNIPYTVGFVP 303 Query: 275 RALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAF 334 LCW +P+ K L +QR RW +G AE + L+ + + L Y W Sbjct: 304 NPLCWTEVPQNWKVLHRQRNRWTRGTAETLWLHKKMLFNPKYGVLGILSTPYWFFFEWLA 363 Query: 335 TCL----VGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHN 390 + ++ + + +G+ + L ++ IL L +Y+ Sbjct: 364 PLIEFLGFLYLTFLIIFSGLNWWVYLIFFMVVYSFAILFSITSL---FFEEFSFQQYKKP 420 Query: 391 --LTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 + L I+ P+I+ + + + + W + R Sbjct: 421 KYIIRLLGTILLEPLIYH----PFIMWASVKGNWDLLNGKKSWGAMSR 464 >UniRef50_C6X1A6 Glycosyl transferase, family 2 n=3 Tax=Flavobacteriaceae RepID=C6X1A6_FLAB3 Length = 824 Score = 247 bits (631), Expect = 6e-64, Method: Composition-based stats. Identities = 95/451 (21%), Positives = 173/451 (38%), Gaps = 55/451 (12%) Query: 28 ELMMRFVFFWPFFMSIMWIVGGVYFW---VYRERHWPWGENAPAPQLKDNPSISIIIPCF 84 E+++ F + + +++ G+Y + +R + + + + P S+I P + Sbjct: 10 EIVIWFFLLYACGVVLVYSWMGIYALGAVLRYKRENTFTDYSIIASNPNAPVFSLIAPAY 69 Query: 85 NEEKNVEETIHAALAQRYENIEVIAVNDGSTDK-----------------------TRAI 121 NE + E + + L+ Y N+E+I VNDGS D T + Sbjct: 70 NEGMTIVENVRSLLSLYYHNLEIIIVNDGSKDDSIQKLIEAYELESVNFFIQGNIPTAEV 129 Query: 122 LDRMAAQIPHLR--VIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEP 179 ++ P R + N GKA AL G + +YLVCID D +L++DA + +P Sbjct: 130 RGIYKSKNPAFRKLTVVDKANGGKADALNVGVNISGGDYLVCIDVDCILEQDAILKLAKP 189 Query: 180 MLY--NPRVGAVTGNPRIRTR--------------STLVGKIQVGEYSSIIGLIKRTQRI 223 L + ++ A G R+ TL+G+ QV EY L + Sbjct: 190 FLEQTDKKLIACGGVIRLANNCRIENGKVVEVNLPRTLLGRTQVLEYIRAFVLGRMAWSR 249 Query: 224 YGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQL------NQWTIFYEPRAL 277 + +SG F R + G + + + ED+++ +++ + + P L Sbjct: 250 ASGLILISGAFGVFDRKIVLACGGYDKNTVGEDMELVVRMRQYMEERNEPYEVINIPDPL 309 Query: 278 CWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLT--TIWAFT 335 CW PE+ + L KQR RW +G E K+ ++ ++ + Y T + Sbjct: 310 CWTEAPESKEILVKQRNRWMRGTMETLWKHRKMMFNPRYGKLGTVSLPYWFTFEFLGPLI 369 Query: 336 CLVGFIIYAVQLAGVPLNIELTHIAATHT--AGILLCTLCLLQFIVSLMIENRYEHNLTS 393 G+ I+ + L +N I AG L +L +VS + + S Sbjct: 370 EFAGYFIFVIFLLLGIINWSFFWILFALVISAGFLYSIYAILVDLVSRQVY-TKRKDFMS 428 Query: 394 SLFWIIWFPVIFWMLSLATTLVSFTRVMLMP 424 + + P F + + + F Sbjct: 429 LITTALLEPFYFHPIVVRAGVKGFVDYFKKS 459 >UniRef50_A4CLA3 Glycosyl transferase, group 2 family protein n=2 Tax=Flavobacteriales RepID=A4CLA3_9FLAO Length = 467 Score = 247 bits (630), Expect = 8e-64, Method: Composition-based stats. Identities = 106/469 (22%), Positives = 175/469 (37%), Gaps = 60/469 (12%) Query: 21 VAYFHSGELMMRFVFFWPF-FMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISI 79 + Y + L+ V F F M + +++ G+ + PSI+I Sbjct: 2 LDYINIVFLLFTVVLFTLFTIMGYLSTRNSIHY----RHKNSIGDISRTMASPLAPSITI 57 Query: 80 IIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRA------------------- 120 I P +NE + E I + L+ RY N EV+ VNDGS D T Sbjct: 58 IAPAYNEGLTIVENIRSLLSLRYVNYEVMVVNDGSKDDTLEKMVAAYDLERVPMDIDPEW 117 Query: 121 -------ILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAA 173 + L VI +N GK+ AL TG ++S+Y+ CID D LL DA Sbjct: 118 PCKPIRGVYKSRQRSFEKLTVID-KENGGKSDALNTGMRLSESKYIGCIDVDCLLLPDAL 176 Query: 174 AYIVEPMLYN--PRVGAVTGNPRIRT--------------RSTLVGKIQVGEYSSIIGLI 217 ++V+ + RV AV G R+ + + Q+ EY+ L Sbjct: 177 LHVVKAFYEHTSKRVIAVGGVIRVANSCKIDGGTLEEIRLPKNWLARFQLLEYTRSFLLG 236 Query: 218 KRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLN------QWTIF 271 + ++ +SG F R E G + + ED++I ++++ +T+ Sbjct: 237 RMAWGRIDSLLIISGAFGFFDRQIALEAGGYDTGTVGEDMEIVFRMRRLMHDNDTPYTVA 296 Query: 272 YEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTI 331 Y P LCW +PE L QR RW++G E ++ + + R+ L + Y L Sbjct: 297 YIPDPLCWTEVPEDLGIFINQRDRWSRGNLETLYRHRDMFFNPKYGRLGMLSYPYWLFYE 356 Query: 332 WAFTCL--VGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEH 389 W L GF LN E +A T T + + N+Y Sbjct: 357 WLAPILEFTGFFSILAFAYLGILNWEFF-VAITATVYAFAAMFSFYAVLWDVWSYNQYTR 415 Query: 390 NLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQ-RARWVSPDRG 437 + +++ ++ ++ + S R + + W S R Sbjct: 416 T--RDILTLLFCALVEPVVFHPVVVWSSIRGNFKKLFKINSGWGSQVRK 462 >UniRef50_C7IGQ0 Glycosyl transferase family 2 n=2 Tax=Firmicutes RepID=C7IGQ0_9CLOT Length = 433 Score = 245 bits (627), Expect = 2e-63, Method: Composition-based stats. Identities = 92/418 (22%), Positives = 168/418 (40%), Gaps = 25/418 (5%) Query: 39 FFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAAL 98 ++I+ + G + + + ++ + P+++I+IP NEEK + T+ L Sbjct: 20 VLINIILVFGAIGYVAHERKNKVNLDKYN----DRLPTVTIMIPAHNEEKVIASTLERIL 75 Query: 99 AQRYEN--IEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQN-----QGKAIALKTGA 151 Y I++I +ND S+D T + ++ P R+ + N +GKA AL Sbjct: 76 ELNYPQHLIQIIVINDNSSDATGVQIKKVQDNSPERRINVITTNKENGGRGKAHALNLAL 135 Query: 152 AAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRST-LVGKIQVGEY 210 AK E++ D DA +R+ +++ + R AV G + R R+ + K E Sbjct: 136 KQAKGEWICIFDADAAPERNTLKFLIIKSFEDKRFAAVFGRNKARNRNRNFLTKCINLEL 195 Query: 211 SSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTI 270 + ++ + + + G R+ L +VG W ITED D+ + + + I Sbjct: 196 IASQRIVNPGKWYLFKLGQIPGTNFIINRTILQKVGGWDSRAITEDTDLGFTIMKRNYLI 255 Query: 271 FYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTT 330 YE RA + PE L KQR RWA+G V KN L+ +R+ Y +T Sbjct: 256 AYESRAEAYQQEPEKLSVYVKQRTRWAKGNLYVVFKNFPDLFSGLAWRIKFDILFYWMTY 315 Query: 331 IW------------AFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFI 378 IW F + G I + + L + + +L+ + +LQ Sbjct: 316 IWFLLFVLISDGILLFNIVSGVIDFIYPGLNLLGQTSLAVNFNFNLSFLLMYVIFVLQIN 375 Query: 379 VSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 V L + L + + + + +++ + K+ A+W +R Sbjct: 376 VGLASDKGQSTAENV-LISCVSYFTYAQVFIFISLKATYSYISDRIFKREAKWYKTER 432 >UniRef50_B1V7Q3 Chitin synthase n=1 Tax=Clostridium perfringens D str. JGS1721 RepID=B1V7Q3_CLOPE Length = 466 Score = 244 bits (624), Expect = 4e-63, Method: Composition-based stats. Identities = 105/465 (22%), Positives = 183/465 (39%), Gaps = 59/465 (12%) Query: 21 VAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISII 80 V + + + F++ FF+S ++ + +Y + ++R + K+ ISI+ Sbjct: 6 VDFINLFFMYYIFIYAVIFFISTVYSIIDLYEFNLKKR---FNSTINIYDKKNYTPISIL 62 Query: 81 IPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIH---- 136 +P +NEEK + + I + LA Y E+I VNDGS DKT +L + R I Sbjct: 63 VPAYNEEKTILKCIDSLLALEYPEYEIIIVNDGSKDKTLEVLINRFSLKKIQRPIRKSIN 122 Query: 137 -------------------LAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIV 177 +N GKA AL G A+ + +D D++L RD+ + IV Sbjct: 123 CKKIIDIYEGLEKVNITLINKENGGKADALNMGINASTYPLFLTLDADSVLQRDSLSNIV 182 Query: 178 EPMLYNPRVGAVTGNPRIRT--------------RSTLVGKIQVGEYSSIIGLIKRTQRI 223 P + N AV GN +I + Q+ EY + + Sbjct: 183 VPFIENENTIAVGGNVKISNSVLLENGVVKKNRLPKKWLIIFQIIEYYRVFWTTRVWFNK 242 Query: 224 YGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQ------LNQWTIFYEPRAL 277 + +SG F++S++ VG ++ + I ED+++ KL N+++I Y P A+ Sbjct: 243 FNGNLIISGAFGVFKKSSVINVGGYNSNSIGEDMELVMKLHSFNKKNENKYSIQYTPNAI 302 Query: 278 CWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTT------I 331 CW PE LK L QR RW G + ++ + + F Y I Sbjct: 303 CWSQAPEKLKDLKSQRKRWHMGLMQSLAEHKYIFLNIKYGVVGVFSFLYYFIYEMLSCLI 362 Query: 332 WAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNL 391 F + FI Y + I I +++ I + ++ ++ I R + + Sbjct: 363 EVFGVVFLFISYFTGFINLKFFITFMCIYIFYSSLISISSIFSEEYF--FNINLRIKDKI 420 Query: 392 TSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 LF + M S+ + F + ++ W +R Sbjct: 421 KLILFSFLEAFGYRQMCSIFRIVAIF-----KYRTKKNHWEKIER 460 >UniRef50_A9KK06 Glycosyl transferase family 2 n=12 Tax=Bacteria RepID=A9KK06_CLOPH Length = 648 Score = 243 bits (621), Expect = 8e-63, Method: Composition-based stats. Identities = 101/439 (23%), Positives = 185/439 (42%), Gaps = 53/439 (12%) Query: 1 MINRIVSFFILCLVLCI-PLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERH 59 MI+ ++ FFI ++ + V YF + + + F+ ++ G +Y R Sbjct: 182 MISSLLLFFITAFIILMRAESVIYFKFNKWLYLYSIIAACFLLSRYLFGFMY------RK 235 Query: 60 WPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYE--NIEVIAVNDGSTDK 117 P N P +SI+IPCFNEE+ ++ TI + + Q Y +EVI ++D S DK Sbjct: 236 VPIDINF-------TPGVSILIPCFNEEEWIQRTILSCINQDYPVDKLEVIVIDDCSHDK 288 Query: 118 TRAILDRMAAQI----------PHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDAL 167 + ++ + ++ L+ I +N GK AL GA AK E +V +D D+ Sbjct: 289 SVEKIEEIINKLYQEADQFDIKNRLKYIVQKENAGKREALALGALEAKHELVVFVDSDSF 348 Query: 168 LDRDAAAYIVEPMLYNPRVGAVTGNPRIRTR-STLVGKIQVGEYSSIIGLIKRTQRIYGN 226 LD A +V+P +P++G V+G + + + K+Q Y ++K + + Sbjct: 349 LDPFAIRNLVQP-FKDPKMGGVSGRTDVANTFTNTLTKMQSVRYYIAFRIMKAAESYFDA 407 Query: 227 VFTVSGVIAAFRRSALAE----------VGYWSDDMITEDIDISWKLQLNQWTIFYEPRA 276 V +SG +A +++ + E +G +D ++ L + Y+ A Sbjct: 408 VTCLSGPLACYKKEIILENREAWLNQSFLG--QRATFGDDRAMT-NFVLKKHRTAYQDSA 464 Query: 277 LCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTC 336 +C ++P + KQ++RW + L +W+KE F + + Sbjct: 465 ICSTIVPNQYRVFLKQQMRWKRSWLRESLIAGKFMWKKEPFAAITFYMGLIVP------- 517 Query: 337 LVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLF 396 + I+ L VPL + GILL +L + V+ ++ + L LF Sbjct: 518 IAAPIVVIYNLIYVPL--QHRVFPTAFLVGILLMSLLM---SVAQLLFRKSSTWLFGMLF 572 Query: 397 WIIWFPVIFWMLSLATTLV 415 + V+ W + +A Sbjct: 573 CFYYEAVLLWQMPIAWVTF 591 >UniRef50_A6DGB3 Glycosyl transferase, family 2 n=3 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DGB3_9BACT Length = 396 Score = 243 bits (620), Expect = 1e-62, Method: Composition-based stats. Identities = 100/379 (26%), Positives = 158/379 (41%), Gaps = 21/379 (5%) Query: 70 QLKDNPSISIIIPCFNEEKNVEETIHAALAQRYE--NIEVIAVNDGSTDKTRAILDRMAA 127 L+ NP +S+++P NEE +E + Y+ +EVI +ND S+D T+ ++D Sbjct: 20 DLEYNPKVSVLVPAHNEEAVIEGCLDCMNKLEYKTGQLEVIILNDRSSDGTKELIDNFLC 79 Query: 128 QIP--HLRVIHLA--QNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYN 183 + P H+R H GKA A+K A KSE +V D D L D ++ P + Sbjct: 80 KNPQSHIRAHHRPMSSEPGKAAAMKEIIATLKSEIIVIFDADYLPQADLIKRLISP-FKD 138 Query: 184 PRVGAVTGNPRIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSAL 242 P+VGA G + ++ K+ E S + + + + + G R SAL Sbjct: 139 PQVGATMGRVVTYNANANIMTKLIDLERRSGYAIDQNVRNYFDLLPQFGGTTGGIRLSAL 198 Query: 243 AEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAE 302 +VG W +TED D+++KL LN + I Y A C+ PET + +KQ RWA G + Sbjct: 199 EDVGGWDTRTLTEDTDLTYKLYLNGYKIKYLNAAACYEETPETWQARYKQVRRWAYGHND 258 Query: 303 VFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAAT 362 +K++ ++ + L TI+A + + L G NI + A+ Sbjct: 259 CMIKHLIPTLMHKDKNLLRKLDALLLLTIYAAPAALLVLSIVAFLFG---NISVNMSASL 315 Query: 363 HTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFW-----MLSLATTLVSF 417 T +L C + + I + P IF ML+ L+ Sbjct: 316 ITLFLLFCGFGNFSPFFQMFAACIKDRQPHC----IRYIPYIFVSSTISMLASTHALLLL 371 Query: 418 TRVMLMPKKQRARWVSPDR 436 L KK W R Sbjct: 372 PIEKLGLKK-SLSWDKTLR 389 >UniRef50_A8VSD1 Binding-protein-dependent transport systems inner membrane component n=2 Tax=Bacillus selenitireducens MLS10 RepID=A8VSD1_9BACI Length = 476 Score = 243 bits (620), Expect = 1e-62, Method: Composition-based stats. Identities = 94/465 (20%), Positives = 175/465 (37%), Gaps = 54/465 (11%) Query: 23 YFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQL---KDNPSISI 79 + E++ + + F+S+++IV R + +D IS+ Sbjct: 6 LVSAVEVLAWMIVIYMLFVSLLYIVLFSVAMPKIRREQKLNRIEQIEGMTVNQDTYPISV 65 Query: 80 IIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDK---------------------- 117 ++P +NEE + ++++ L Y EVI +NDGS D+ Sbjct: 66 LVPAYNEEVGIVASVNSLLTLNYPEYEVIVINDGSDDETSLRIKDHFNMKPVKLAMRKHF 125 Query: 118 -TRAILDRMA-AQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAY 175 T + ++ +L +I +N GK AL G + Y ID D++L++DA Sbjct: 126 QTAEVTAAYQSSEYGNLYLIE-KENGGKGDALNAGINFSSYPYFASIDADSILEKDALLK 184 Query: 176 IVEPML-YNPRVGAVTGNPRIRTRSTL--------------VGKIQVGEYSSIIGLIKRT 220 ++P++ N V A G RI + + +Q+ EY + + Sbjct: 185 TMKPIIDSNGEVTATGGTVRIANGCQIEKSVVQTITLPKKPLEVMQIIEYFRGFLIGRLG 244 Query: 221 QRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQL------NQWTIFYEP 274 + VSG F +S + VG + + ED+++ ++ + I Y Sbjct: 245 FSRTNMLMIVSGAFGVFEKSRVIRVGGYDSSTVGEDMELVVRIHRSIKDEKSPQRIEYIQ 304 Query: 275 RALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAF 334 +CW P T L QR+RW +G E K+ ++ + R+ Y T + Sbjct: 305 DPVCWTEAPSTFSSLKSQRIRWQRGLTETIWKHKGMMFNPKYKRVGLFSLPYY-TFVELL 363 Query: 335 TCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSS 394 + +V F Y + + G+ IA A +L L V L +++ S Sbjct: 364 SAVVEFFGYLIIILGLVFAFIDPAIAGMMFAVTILYGSLLSSLSVLLEEWTYHKYQDVKS 423 Query: 395 LFWIIWFPV--IFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRG 437 L + + + +FW L + +K+ W + +R Sbjct: 424 LPILFLYALTEVFWYRPLTVSFRLVGLFSAFGRKRE--WGNMERK 466 >UniRef50_D1PAP3 Putative biofilm PGA synthesis N-glycosyltransferase PgaC n=1 Tax=Prevotella copri DSM 18205 RepID=D1PAP3_9BACT Length = 438 Score = 242 bits (618), Expect = 2e-62, Method: Composition-based stats. Identities = 99/418 (23%), Positives = 177/418 (42%), Gaps = 15/418 (3%) Query: 21 VAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQL---KDNPSI 77 ++ ++ + F + M +I+ + + + + ++NP I Sbjct: 15 ISTLSFSSIVNTYWFLFFIEMPRYYILEYLVIGNRLMKRNQLDKEKEYAKYFLYRENPLI 74 Query: 78 SIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHL 137 SI++P NE K++ + +++ Q Y N E+I V+DGS D T+ I + + + L Sbjct: 75 SILVPGKNEGKHIFKMVNSLAEQTYRNYEIIVVDDGSDDDTKLICNDLYRAGYITHYLRL 134 Query: 138 AQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRT 197 GKA A GA A+ +Y+VC+D D+ LDRDA I+ P + V V G ++R Sbjct: 135 DTRGGKAAASNYGAQMARGKYIVCLDADSSLDRDALEKILLPFYIDGMVKGVGGCVKVRN 194 Query: 198 -RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITED 256 + T+ +Q EY I + + G +SG AF R L EVGYW D Sbjct: 195 YKETICSSLQAFEYLKRIQVGRIVTSELGIYHIISGAFGAFERKTLKEVGYWDIG-PGLD 253 Query: 257 IDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAE-VFLKNMTRLWRKE 315 D++ K++ + + + A+C +P L+ QR+RW++ K+ L + Sbjct: 254 GDLTQKIRKAGYKVKFAEDAICMTNVPTKWYKLYHQRIRWSRSLVRFRLRKHADILLPTK 313 Query: 316 NFRMWPLFFEYCLTTIWAFTCLVGFI-IYAVQLAGVPLNIELTHIAATHTAGILL-CTLC 373 N W + F C + F+ ++ + + N THI G + Sbjct: 314 N---WSILNWLSNMESVVFDCFLNFLWLWYIIKLAITFN---THIIEVLALGYFIRVCFS 367 Query: 374 LLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARW 431 L FI+ LM+ R + + + P + L +A L + + + + + W Sbjct: 368 QLAFILVLMVSERKKDVWFLYRYLPLMSPYTGYFLRIAR-LSAHLQELFFRRSYKDAW 424 >UniRef50_A1S0Z5 Glycosyl transferase, family 2 n=1 Tax=Thermofilum pendens Hrk 5 RepID=A1S0Z5_THEPD Length = 420 Score = 242 bits (617), Expect = 2e-62, Method: Composition-based stats. Identities = 86/358 (24%), Positives = 151/358 (42%), Gaps = 19/358 (5%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRYE--NIEVIAVNDGSTDKTRAILDRMAAQIPH- 131 P +++I+P +E + VE ++A L+ Y +EVI V+ S I+ R + P Sbjct: 67 PRVTVIVPSKDEGRRVERCLNAILSSDYPLEKLEVIVVDASSDGYVEEIVRRAGERYPGA 126 Query: 132 LRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTG 191 +R+I + +GK AL A E + D D++ +RDA V+ L P V AV G Sbjct: 127 VRLIREEEPRGKPAALNRALREATGEVVAVFDADSVPERDAIRRAVK-HLEEPGVAAVQG 185 Query: 192 NPRIRT-RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSD 250 + R +++ ++ E + + R + G ++G +RSAL EVG W + Sbjct: 186 KTLVLNERESVLARVASKEEKAWFHALIRGRERLGLFVPLTGSCQFVKRSALEEVGGWRE 245 Query: 251 DMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTR 310 D + ED+++S L + + Y + W P +L+ L QR RW +G E F +++ Sbjct: 246 DALAEDLELSMDLLARGYRVKYANDVVSWQEAPTSLRSLAVQRNRWYRGYMEAFARHLRL 305 Query: 311 LWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLC 370 + + + + + + A LA L + H + L Sbjct: 306 ALAGRRG------LDAAILSAGPYLMALSLLAVAAWLASTALP-HVNHFSTPAALVAALN 358 Query: 371 TLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVI--FWMLSLATTLVSFTRVMLMPKK 426 + L V+L + R S + W PVI +W A L + ++L + Sbjct: 359 AVSLFSVSVALALSER-----PVSAKNLAWVPVIYAYWFTLSAVALHALAEIILRRPR 411 >UniRef50_Q9KAQ6 BH2231 protein n=4 Tax=Bacillus RepID=Q9KAQ6_BACHD Length = 482 Score = 242 bits (617), Expect = 3e-62, Method: Composition-based stats. Identities = 93/436 (21%), Positives = 161/436 (36%), Gaps = 62/436 (14%) Query: 27 GELMMRFVFFWPFFMSIMWIV--GGVYFWVYRERHWPWGENAPAP----QLKDNPSISII 80 L F +F +M I+ +V G + V + + +L +SI+ Sbjct: 10 SYLPYLFAWFLLGYMGIVVLVYTGMLLLSVVQLKREQKLNKQEVYDDYLELTYTKPVSIL 69 Query: 81 IPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLA-- 138 +P +NEE + ET+ + L+ +Y E++ VNDGSTD+T ++ + +VI Sbjct: 70 VPAYNEETGIIETVRSLLSLKYPQTEIVVVNDGSTDQTLEVIIEHFQMVKVGKVIRKQIE 129 Query: 139 ---------------------QNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIV 177 N GKA AL G +K Y IDGD++L+ DA ++ Sbjct: 130 TEPIKGVYQSTIFPHLLLVDKSNGGKADALNAGLNVSKYPYFCSIDGDSILETDALLKVM 189 Query: 178 EPML----YNPRVGAVTGNPRIRTRSTL--------------VGKIQVGEYSSIIGLIKR 219 +P++ V A GN RI S + + +QV EY + + Sbjct: 190 KPIVTSRDDEDEVIASGGNVRIANGSDIQMGSVLSVQLAKNPLVVMQVIEYLRAFLMGRI 249 Query: 220 TQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQ------LNQWTIFYE 273 + V +SG + F + + E G +S + ED+++ +L + I + Sbjct: 250 GLSRHNMVLIISGAFSVFAKKWVMEAGGYSKKTVGEDMELVVRLHRLVKEKRLKKRITFV 309 Query: 274 PRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLT--TI 331 P +CW P T + L +QR RW +G E + + + + Y Sbjct: 310 PDPVCWTEAPATFRVLQRQRSRWHRGLMESLWLHRGMTFNPKYGLVGTASIPYFWIVEFF 369 Query: 332 WAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRY---- 387 L+G++ L +E + I+ RY Sbjct: 370 GPVVELMGYLYIVFAFFFGGLYVEFALALFLLFVLY-GTVFSMTAVILEGWSLKRYPKVS 428 Query: 388 --EHNLTSSLFWIIWF 401 + SLF +W+ Sbjct: 429 DMSRLMIFSLFEALWY 444 >UniRef50_D2L608 Glycosyl transferase family 2 n=1 Tax=Desulfovibrio sp. FW1012B RepID=D2L608_9DELT Length = 555 Score = 242 bits (617), Expect = 3e-62, Method: Composition-based stats. Identities = 103/416 (24%), Positives = 182/416 (43%), Gaps = 28/416 (6%) Query: 15 LCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDN 74 + + L +F E + F F P F+ + F++ + Sbjct: 138 ILLALVGVWFIKKESIFYFWFDVPVFLYGL----CALFYLLSRFFFAALYRPYPVDPGYK 193 Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAILDRMAAQI--- 129 P++S++IPCFNEE +E+TI L Q Y +EVI V+DGS+D + A+L+R+ ++ Sbjct: 194 PAVSVVIPCFNEEVWIEKTIRGCLNQNYPEELLEVIVVDDGSSDGSVAVLERVQRKVFEE 253 Query: 130 --PHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVG 187 L V +N+GK A+ GA AK E +V +D D+ L D+ IV+P+ NP++G Sbjct: 254 VGERLFVHVFPENRGKRHAMAEGARRAKGEVVVFVDSDSFLQPDSILQIVQPL-KNPKIG 312 Query: 188 AVTGNPRIRTR-STLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAE-V 245 A TG + + + + ++Q Y + K + I+ V +SG +AA+R+ L + Sbjct: 313 AATGRCEVENKWTNTLTRMQAVRYFIGFRIFKAAESIFDAVTCLSGPLAAYRKDVLLRYL 372 Query: 246 GYW-------SDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQ 298 G W +D ++ + + + + Y+ A+ ++P T + + Q++RW + Sbjct: 373 GAWVNQRYLGQPATFGDDRSLTNYVLGSHYAV-YQHTAITHTIVPSTYRQFYIQQMRWKR 431 Query: 299 GGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTH 358 L+ +W KE F + L + L F+ P + Sbjct: 432 SWLRESLRAALFMWYKEPFMAISFYLGVILPLLAPVVVLRAFVFVPAAYGVWP----TMY 487 Query: 359 IAATHTAGILLCTLCLLQFIVSLMIENR--YEHNLTSSLFWIIWFPVIFWMLSLAT 412 +A +L+CT LL +L + L L+ I+W + FW T Sbjct: 488 VAGVFFMSMLMCTSYLLIKRSNLWLYGVPFCFFYLFVLLWQIVWAVLTFWKSEWGT 543 >UniRef50_B5W069 Glycosyl transferase family 2 n=3 Tax=Arthrospira RepID=B5W069_SPIMA Length = 561 Score = 241 bits (616), Expect = 3e-62, Method: Composition-based stats. Identities = 79/440 (17%), Positives = 168/440 (38%), Gaps = 49/440 (11%) Query: 30 MMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKN 89 ++ + ++ ++ + ++ + R + P + ++ P +S+++ NEE Sbjct: 127 LISWGYWLILGLTGLLVIQAIRIVWAGHRGL---DINPEHRPEELPFVSLLVAAKNEEAV 183 Query: 90 VEETIHAALAQRYEN--IEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQN--QGKAI 145 + + A Y + E+ ++D S+D+T +L +A + L ++H +N GK+ Sbjct: 184 IGNLVKNLCALNYPSHCYELWVIDDNSSDRTPIVLQELAKEYQQLHILHRDENATGGKSG 243 Query: 146 ALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRS-TLVGK 204 AL + + L D DA +D D V P +VGAV I + + + Sbjct: 244 ALNQALPLTRGKILGVFDADATVDSD-LLQQVIPKFQAEQVGAVQLQKAIANSNFNFLTR 302 Query: 205 IQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQ 264 Q E + + ++ + G + + G RR+AL G W + IT+D+D++ +L Sbjct: 303 CQASEMA-LDAFFQKQRVAVGGIGELRGNGEFIRRAALESCGGWCEQTITDDLDLTIRLH 361 Query: 265 LNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFF 324 L+ W I + ++ + LW QR RWA+GG + +L + Sbjct: 362 LDHWDIEFLDTSVVFEEGVTNWVALWHQRNRWAEGGYQRYLDYGKFIIANRMGVGKTF-- 419 Query: 325 EYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMI- 383 ++ F + +I+ + + +++ L T +L +L L+ + L Sbjct: 420 -----DLFGF-LITQYILPMAAIPDLFMSLILRRPPITSPLTLLAVSLSLIGMFIGLRRT 473 Query: 384 ----------------------ENRYEHNLTSSLFWIIWFPVIF-------WMLSLATTL 414 +L + + +F +M + Sbjct: 474 VSRGNMPSQKTNSPDNLTTDGDRQAATESLGKYYPPLSYMTTLFQTLRGTVYMFHWF-IV 532 Query: 415 VSFTRVMLMPKKQRARWVSP 434 V+ + + +R +WV Sbjct: 533 VASATARISVRPKRLKWVKT 552 >UniRef50_D2LZZ6 Glycosyl transferase family 2 n=1 Tax=Bacillus cellulosilyticus DSM 2522 RepID=D2LZZ6_BACS4 Length = 457 Score = 240 bits (614), Expect = 6e-62, Method: Composition-based stats. Identities = 106/452 (23%), Positives = 185/452 (40%), Gaps = 66/452 (14%) Query: 35 FFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETI 94 F + ++ V F+++ G N K+N +SI+IP +NEEK + + Sbjct: 2 FLQISLIFVLIFVIFQTFYMFIPLLTIKGLNKYKKVDKEN-GVSILIPAYNEEKVILNCL 60 Query: 95 HAALAQRYENIEVIAVNDGSTDKTRAILDRM-----------AAQIPHLRVIHL------ 137 + Y N+E I +NDGSTD+T IL + A +IPH +I + Sbjct: 61 QGIVNLNYNNVEAIFINDGSTDRTLDILTQHLQLVPVYNRLPAVKIPHEAIIDIYASTLY 120 Query: 138 -------AQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVT 190 +N GKA AL G A+ EY+V +D D++LD+++ +++ + + A+ Sbjct: 121 PKVLVIDKKNGGKADALNAGIEYARKEYVVTLDADSILDKESL-HVINSTFSDNHILAIG 179 Query: 191 GNPRI---------RTRSTLVG----KIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAF 237 G +I + R T + + Q+ +Y + + K TQ ++ ++G AF Sbjct: 180 GMVQIGQGFQGNYLKPRPTFINPGIIRFQILQYFTAFYMHKYTQSKLKSITVIAGAFGAF 239 Query: 238 RRSALAEVGYWSDDMITEDIDISWKLQL------NQWTIFYEPRALCWILMPETLKGLWK 291 RR L EV + I ED+DI+ K+Q + + P+ALC+ PET + L+ Sbjct: 240 RRKVLFEVDGFR-KTIGEDMDITLKIQRLIKTKYKNHRLIFVPQALCYTECPETFRALFN 298 Query: 292 QRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFF--EYCLTTIWAFTCLVGFIIYAVQLAG 349 QR+RW + + F + K R+ F L TI AFT L+ + Sbjct: 299 QRVRWQKAFVDCFFMYRKNFFHKLGVRLSLFFLGDGVLLGTINAFTALLIPLFLIFNR-- 356 Query: 350 VPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENR-----YEHNLTSSLFWIIWFPVI 404 H + L + Q I ++ + R + +I V Sbjct: 357 -------DHYMVALGLFTITFILGIFQSIATIEVSKRLGIKYTRMDYLRISLFIPLEIVT 409 Query: 405 FWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 + +L + + + + +W +R Sbjct: 410 YRLLGILFVTIGTFMFFV----NKDKWDKANR 437 >UniRef50_A0LZL8 Transmembrane family-2 glycosyl transferase-possibly involved in biofilm formation n=1 Tax=Gramella forsetii KT0803 RepID=A0LZL8_GRAFK Length = 483 Score = 239 bits (610), Expect = 2e-61, Method: Composition-based stats. Identities = 87/415 (20%), Positives = 152/415 (36%), Gaps = 60/415 (14%) Query: 25 HSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWG----ENAPAPQLKDNPSISII 80 + + F+F + + G+ + D PS+++I Sbjct: 12 YYIYWISVFLFIGYGASLLTLYLTGILLARRSIKRTKQRSIFLRAEDIVSATDIPSVTVI 71 Query: 81 IPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRA-------------------- 120 P +NE + E + + L+ +Y E++ VNDGS D + Sbjct: 72 APAYNEGLTIIENVKSLLSIQYPYYELLLVNDGSKDDSLQKLIDEFDLEQQDATFITQPI 131 Query: 121 -------ILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAA 173 I + HL VI N GKA A+ G A +E ++ D D ++++DA Sbjct: 132 PTASVKHIYRSRKTKYTHLTVID-KNNGGKADAINAGVNFANTELVIFTDVDCIIEQDAI 190 Query: 174 AYIVEPMLYNPRVGAVTGN----------------PRIRTRSTLVGKIQVGEYSSIIGLI 217 +V P L + +R S L+ QV EY L Sbjct: 191 LKMVRPYLEEDDKEIIGCGGGIGIANDSVIKNGILTELRLPSRLIPMTQVVEYIRAFLLG 250 Query: 218 KRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLN------QWTIF 271 + + +SG + R + EVG ++ + ED+++ +L+ + + + Sbjct: 251 RMAWSEINGLMLISGAFGMYPRKRVIEVGGFNPKTVGEDLELCIRLRKHMEDKKIPYKVV 310 Query: 272 YEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTT- 330 Y P LCW P K L QR RWA+G E + L+ ++ M LF+ Y L Sbjct: 311 YLPETLCWTEGPSDYKILVMQRDRWARGLWETMSLHKDLLFNRKYGNMGLLFYPYWLLFE 370 Query: 331 ----IWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSL 381 I F +V I + + L + + A + G + T+ + ++ S Sbjct: 371 FGAPIVEFMGVVCIIAFTI-LGWINWPFAILLFFAAYLVGCVFSTIAIFMYVKSF 424 >UniRef50_B9YCC6 Putative uncharacterized protein n=1 Tax=Holdemania filiformis DSM 12042 RepID=B9YCC6_9FIRM Length = 477 Score = 239 bits (610), Expect = 2e-61, Method: Composition-based stats. Identities = 105/454 (23%), Positives = 170/454 (37%), Gaps = 57/454 (12%) Query: 33 FVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEE 92 F F+ S + V R+ + + +S+I+P NEE V + Sbjct: 15 FFVFYLIGYSTFLFISVVVGASVLYRNRYLEKMKNELKHDYYVPVSVIVPAHNEEVTVVD 74 Query: 93 TIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLA-------------- 138 T+ + L Y+ EV+ V+DGSTDKT L + I R I L Sbjct: 75 TVRSLLQLDYKLYEVLVVDDGSTDKTSQRLIKAFNLIETQRPIRLQVPCQKQKAVYEREI 134 Query: 139 ----------QNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGA 188 +N GKA AL G A++ Y VC+D D+LL RD+ Y V P+L + V A Sbjct: 135 HGIRVTLIIKENGGKADALNMGINASRFPYFVCMDADSLLQRDSLEYAVRPLLESDEVIA 194 Query: 189 VTGNPRIRTRSTL--------------VGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVI 234 V G RI TL + +Q+ EY + Y +SG Sbjct: 195 VGGLVRISNGVTLENGKVKHYHLPFNPLVCMQILEYDRSFLASRILMDQYNGNLIISGAF 254 Query: 235 AAFRRSALAEVGYWSDDMITEDIDISWKLQL------NQWTIFYEPRALCWILMPETLKG 288 F++ + VG + + ED+++ KL + + I Y P A+CW P TL Sbjct: 255 GLFKKDLVITVGGYDRSTMGEDMELVTKLHVFCRAHDIPYAIRYAPDAICWSQAPSTLLD 314 Query: 289 LWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPL------FFEYCLTTIWAFTCLVGFII 342 L KQR RW G + K + + + +E I F L + Sbjct: 315 LTKQRRRWHIGLFQSLTKYRRLFLNFKYGFVSLISYFYYLLYELLSPYIELFGVLTIVLA 374 Query: 343 YAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFP 402 + L P + + A + + L +L I ++ ++ ++ Sbjct: 375 WVTNLINYPFVVYFYLVYACFSGVLTLVAFFSRIHTQNLTIS---FSDVVKAVGLCLFEI 431 Query: 403 VIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 + + A ++S L +K++ W + R Sbjct: 432 IFLRLYLAAVRMLSL----LSYRKRKNHWEAITR 461 >UniRef50_B2GJB7 Putative glycosyltransferase n=2 Tax=Micrococcineae RepID=B2GJB7_KOCRD Length = 481 Score = 239 bits (609), Expect = 2e-61, Method: Composition-based stats. Identities = 88/463 (19%), Positives = 171/463 (36%), Gaps = 53/463 (11%) Query: 18 PLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSI 77 + F + +++ F+ ++ +M ++ V F +R R P+ + + P I Sbjct: 11 VAAILAFIAVPVLIYFLLINTSYL-VMIVLAAVDFVAHR-RRIPFAGREELIRSRTMPGI 68 Query: 78 SIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIP------- 130 S++ P NEE + + + L+ ++ EVI V+DGSTD +L R +P Sbjct: 69 SVVAPMHNEEVGIRVAVESFLSLQHPRHEVIVVDDGSTDNGFEVLRRAFDLVPVSRQMPE 128 Query: 131 -------------------HLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRD 171 L V+ +N G++ ++ G A+ + ++ +D D++LD Sbjct: 129 DVPVREAPTSIHVPRDGRTRLTVVR-KRNSGRSDSINVGVNLAREDLVLFVDSDSILDPG 187 Query: 172 AAAYIVEPMLYNPRVGAVTGNP---------------RIRTRSTLVGKIQVGEYSSIIGL 216 A + P + +P G +R + IQV EY L Sbjct: 188 ALLAVSRPFIEDPISTVAAGGVIRAVNGCRVKGGRVLEVRMPGNPLADIQVMEYLRAFHL 247 Query: 217 IKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQ------LNQWTI 270 + + +SG FRR AL VG I ED ++ +L + + Sbjct: 248 GRAGWSRINALLLISGAFGVFRRDALVTVGGLDASSIGEDFELVMRLHRTFRKQRRPYRV 307 Query: 271 FYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFE-YCLT 329 + +CW +P TL L +QR RW +G E L R+ + Y L Sbjct: 308 TFLTEPICWTEVPSTLPVLRRQRARWHRGLWETLWAYRDMLGNPRYGRLGLVAVPYYWLF 367 Query: 330 TIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRY-- 387 + A + ++ + L + A + L + + ++Y Sbjct: 368 ELIAPLLELAGLVLIALGFALGLVTVQYFVLFMLVAYAYAMIVTLAAVTIEELSYHKYPR 427 Query: 388 EHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRAR 430 +L +L+ ++ + + L+ + + +L ++Q Sbjct: 428 WRDLGITLWAVVLENLGYRQLTAWWRMEGWVASLLGRRQQWGT 470 >UniRef50_Q74D65 Glycosyl transferase, group 2 family protein n=1 Tax=Geobacter sulfurreducens RepID=Q74D65_GEOSL Length = 472 Score = 239 bits (609), Expect = 2e-61, Method: Composition-based stats. Identities = 99/468 (21%), Positives = 174/468 (37%), Gaps = 62/468 (13%) Query: 29 LMMRFVFFWPFFMSIMWIVGGVY--FWVYRERHWPWGENAPAPQLKDN-------PSISI 79 ++ + P ++ G Y F + R NA Q DN +SI Sbjct: 5 IVAFLLAIQPVIFIYFIVLNGFYTLFTIISLRDIRNYLNAVTSQSIDNVLNGMFYRPLSI 64 Query: 80 IIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRA------------------- 120 ++P +NEEK + ++ + LA RY EVI +NDGSTD T Sbjct: 65 LVPAYNEEKTIVSSVKSLLALRYPEYEVIVINDGSTDGTLESLINEFRLVRIDKPISLHV 124 Query: 121 -----ILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAY 175 I ++ P+L V+ +N GKA AL G A++ ID D++L+ DA Sbjct: 125 PHRPIIAKYVSVDHPYLFVLD-KENGGKADALNAGINASQFPLFCSIDADSVLEDDALIR 183 Query: 176 IVEPMLYNPRVGAVTGNPRI--------------RTRSTLVGKIQVGEYSSIIGLIKRTQ 221 + + V A G R+ R ++ Q EY+ + + Sbjct: 184 ATRLFVEDREVVATGGIVRVLNGCEVEDGIVKGVRAPRGMLECFQTVEYTKGFLSGRTSW 243 Query: 222 RIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLN------QWTIFYEPR 275 + ++ +SG FR+ + V + + + ED+D+ +L + ++ + + P Sbjct: 244 NYFRSLLIISGAFGIFRKDMVMAVKGYRES-VGEDMDLVVRLHRHCRQNRIRYKVVFVPD 302 Query: 276 ALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTT--IWA 333 +CW +P + L KQR RW +G + N + F Y +T + Sbjct: 303 PVCWTQVPSDMASLLKQRNRWHRGLIDSLWHNRGMFLNPRYGTVGLFGFPYFVTVELLGP 362 Query: 334 FTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTS 393 +G+ + + +N E I A + + L + +I RY+ L Sbjct: 363 AVEFIGYFGFVLLFFLGQVNREFA-ILFFLLAVLWGTWINLGSIFLDNLIYKRYK-GLGD 420 Query: 394 SLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRGILRG 441 L ++ + F+ + + KK W P R + G Sbjct: 421 VLKLCLFGLLEFFGYRQIIVVERLIATFMFWKKG---WGKPKRKEIDG 465 >UniRef50_UPI0001C1680B Glycosyl transferase, family 2 n=2 Tax=Nostocaceae RepID=UPI0001C1680B Length = 467 Score = 238 bits (608), Expect = 3e-61, Method: Composition-based stats. Identities = 87/411 (21%), Positives = 159/411 (38%), Gaps = 24/411 (5%) Query: 30 MMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKN 89 ++ + F + ++ + V + + R RH D PS+S+++ NE+ Sbjct: 68 LVSWGFAFILGLTTILGVHALRIILVRPRHHHKQ------IQGDLPSVSVLVSAKNEQAV 121 Query: 90 VEETIHAALAQRYEN--IEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQN--QGKAI 145 ++ +H + Y + EV ++D STDKT IL ++ L V N GK+ Sbjct: 122 IDRLVHNLCSLEYPHGEYEVWLIDDHSTDKTPEILAQLQQDYKQLNVFRRDANATGGKSG 181 Query: 146 ALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRT-RSTLVGK 204 AL K E + D DA + D ++ P +VGAV I + + Sbjct: 182 ALNQVLPMTKGEIIAVFDADAQVSPDLLLQVI-PTFQREKVGAVQVRKAIANAKENFWTR 240 Query: 205 IQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQ 264 Q+ E + + ++ + G + + G RR AL + G W+++ IT+D+D++ +L Sbjct: 241 GQMAEMA-LDTWFQQQRTAIGGLGELRGNGQFVRREALNDCGGWNEETITDDLDLTIRLN 299 Query: 265 LNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFF 324 L W I + LW QR RWA+GG + +L + + + Sbjct: 300 LTGWDIECMFYPPVLEEGVTNVVALWHQRNRWAEGGYQRYLDYWDLILKGRMRAGKTVDL 359 Query: 325 EYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIE 384 L+ +II + + +++ L T+ + L Sbjct: 360 --------LIFMLIMYIIPTAAVPDLLMSLIRHRPPILAPITGLSVTMSFIGMFSGLRRT 411 Query: 385 NRYEHNLTSSLFWIIWFPVIFWMLSLATTL-VSFTRVMLMPKKQRARWVSP 434 + + N + + +ML + + RV L PK R +WV Sbjct: 412 RQDQKNSNYLMLLLQTIRGSVYMLHWLVVMSSTTARVSLRPK--RLKWVKT 460 >UniRef50_C0Z7U1 Conserved hypothetical membrane protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0Z7U1_BREBN Length = 480 Score = 238 bits (608), Expect = 3e-61, Method: Composition-based stats. Identities = 91/437 (20%), Positives = 167/437 (38%), Gaps = 63/437 (14%) Query: 30 MMRFVFFWPFF-MSIMWIVGGVYFWVYR---------ERHWPWGENAPAPQLKDNPSISI 79 M F+ ++ F + + + +YF + + + + P ISI Sbjct: 1 MRDFLLYYGVFAVYYVIAINSLYFIIMLFSFNSIMGILKSSIYSRFQTLSGSEHVPPISI 60 Query: 80 IIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAI-LDRMAA----------- 127 ++P +NEE + E + + L+ Y EV+ VNDGS D T + +D + Sbjct: 61 LVPAYNEELTIIENVRSLLSLNYPRYEVLVVNDGSKDATLQVMIDEFSLVYSEHMPVQPL 120 Query: 128 -------------QIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAA 174 P+L +I +N GKA +L G + + ID D+LL++DA Sbjct: 121 LHTSKIRGIYHNPAFPNLYLID-KENGGKADSLNAGINLSHYPLIASIDADSLLEKDALI 179 Query: 175 YIVEPMLYNP-RVGAVTGNPRIRT--------------RSTLVGKIQVGEYSSIIGLIKR 219 I + + NP A+ G+ RI + +Q EY + Sbjct: 180 RIAKVYMENPDETVAIGGDIRIANGCKIEDGVVKDIRLPDKFLPMMQSVEYLKAFLGGRI 239 Query: 220 TQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQ------LNQWTIFYE 273 + VSG FR+ + +VG + + ED++I KL + + + Sbjct: 240 GWSAINGLIIVSGAFGVFRKDYIIKVGGYREGYPGEDMNIIIKLHKYMLENKLPYRVAFC 299 Query: 274 PRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRL-WRKEN--FRMWPLFFEYCLTT 330 P A+CW P+TLK L QR RW +G + ++ + +R + F M L F T Sbjct: 300 PDAVCWTQAPDTLKILGSQRRRWGRGNLKNMIEYGRHMAFRPKYKLFGMLTLPFNILFET 359 Query: 331 IWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRY--E 388 + + + G ++ + + + + + L + ++ + RY Sbjct: 360 LNPYFRITG-LLALIGYTLLDMTNGYVLLLYGLLNVLYGMLLGIGSLLLEEIAFRRYPRF 418 Query: 389 HNLTSSLFWIIWFPVIF 405 ++ LF+ I + Sbjct: 419 QDIVKMLFYTILMFFGY 435 >UniRef50_B3DW74 Glycosyltransferase n=1 Tax=Methylacidiphilum infernorum V4 RepID=B3DW74_METI4 Length = 480 Score = 237 bits (606), Expect = 5e-61, Method: Composition-based stats. Identities = 79/368 (21%), Positives = 149/368 (40%), Gaps = 16/368 (4%) Query: 34 VFFWPFFMSIMWIVGGVYFWVYRERHW---PWGENAPAPQL-KDNPSISIIIPCFNEEKN 89 + FW ++++ + G+Y R W + AP L P ++I +P +NE+ Sbjct: 2 ISFWFLILALVLFIYGIYRMSLILRLWMGSHRDKKAPTDALFYTYPEVTIQLPIYNEKSV 61 Query: 90 VEETIHAALAQRYEN--IEVIAVNDGSTDKTRAILDRMAAQIPH----LRVIHLAQNQG- 142 VE +HA Y +E+ ++D STD+T AI+ + + + + +G Sbjct: 62 VERLLHAVCKIDYPKNKMEIQIIDD-STDETTAIISKWVCEYQKKGFDIYQLRRGTREGF 120 Query: 143 KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNP-RIRTRSTL 201 KA L+ G +K E++ D D L + P + VG V + +++L Sbjct: 121 KAGGLQYGLERSKGEFIAIFDADFLPPPSFLKETL-PYFRSRDVGMVQARWGYLNRQASL 179 Query: 202 VGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISW 261 + + Q L + + Y F +G +R+ + + G W D +TED+D+S+ Sbjct: 180 LTRCQALFLDGHFLLEQPVRYKYNLFFNFNGTAGIWRKKCIIDAGGWEGDTLTEDLDLSY 239 Query: 262 KLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWP 321 + Q W Y P+ + +P + Q+ RWA+G + K++ L++ Sbjct: 240 RAQFKGWKFVYTPQMVVPSELPSPIVAFRTQQHRWAKGAIQTAKKHLFSLFKGSYSLGSK 299 Query: 322 L-FFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVS 380 + + L V I+ A+ PL T + + ++ L F V Sbjct: 300 IEGLFHLLAHSIHPIVAVLVILNAISFFCSPLPQSFT-LEVSGMLFSVISLFYLSYFAVV 358 Query: 381 LMIENRYE 388 + + E Sbjct: 359 IFLSKNLE 366 >UniRef50_Q2RZV9 Putative glucosyltransferase n=1 Tax=Salinibacter ruber DSM 13855 RepID=Q2RZV9_SALRD Length = 510 Score = 237 bits (605), Expect = 6e-61, Method: Composition-based stats. Identities = 72/281 (25%), Positives = 122/281 (43%), Gaps = 7/281 (2%) Query: 77 ISIIIPCFNEEKNVEETIHAALAQRYENIEV-IAVNDGSTDKTRAILDR----MAAQIPH 131 +++ +P +NE + I A + Y + I V D STD T + R A+ + Sbjct: 61 VTVQLPLYNEAEVAHRLIDACVQLDYPRSRLDIQVLDDSTDATTERVARRVAHWQAEGVN 120 Query: 132 LRVIHLAQNQG-KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVT 190 + + G KA AL G A+ + + D D + +V P +G V Sbjct: 121 ITHVRRDDRTGYKAGALANGLQRARGDLIAIFDADFVPRPSFLRRLVPRFFDAPDLGMVQ 180 Query: 191 GNP-RIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWS 249 + +L+ K+Q + + +R + + G +G +RR+ + + G W+ Sbjct: 181 ARWGHLNRDDSLLTKVQAFGLDAHFAIEQRVRELAGCFLNFNGTAGVWRRACIEDAGGWA 240 Query: 250 DDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMT 309 D +TED+D+S++ QL W + Y P A +P + L Q+ RWA+GGAE LK Sbjct: 241 HDTLTEDLDLSYRAQLQGWRLTYVPAAEAPAELPPDMNALRAQQFRWAKGGAETALKLTG 300 Query: 310 RLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGV 350 RLWR L + LT +AF ++ + L + Sbjct: 301 RLWRSAQPWRVKLEGTFHLTAHFAFPFILLAALTHAPLLLL 341 >UniRef50_C0G271 Glycosyl transferase family 2 n=1 Tax=Natrialba magadii ATCC 43099 RepID=C0G271_NATMA Length = 466 Score = 237 bits (604), Expect = 8e-61, Method: Composition-based stats. Identities = 97/407 (23%), Positives = 160/407 (39%), Gaps = 59/407 (14%) Query: 39 FFMSIMWIVGGVYFWVYR---------ERHWPWGENAPAPQLKDNPSISIIIPCFNEEKN 89 FF++ +V G Y +V+ R W P I++I+P +NEE Sbjct: 15 FFLAYFALVNGSYLFVHLVSIVTLQRGIRDWLLDPVYHPHNSPFLPGIAVIVPAYNEEVT 74 Query: 90 VEETIHAALAQRYENIEVIAVNDGSTDKTRAIL-----------------------DRMA 126 + +T+ + L Y N E++ VNDGS D+T L + Sbjct: 75 IVDTVQSLLNLEYSNHEIVVVNDGSRDETLEQLISTFELEPVGEDVPFDLPCETVHEVYQ 134 Query: 127 AQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNP-R 185 + L VI N GKA AL G ID D++++R+A +V+P L +P R Sbjct: 135 SSRVDLVVID-KDNGGKADALNAGVFFTDQPLFCAIDADSIIERNALLEVVQPFLRDPER 193 Query: 186 VGAVTGNPRIRTRSTL--------------VGKIQVGEYSSIIGLIKRTQRIYGNVFTVS 231 + A G R+ ++ + +Q EY + G++ +S Sbjct: 194 MIASGGAVRVANGCSITRGVITDIGLSRSHLANLQTVEYLRAFLSGRIGLSGLGSLLIIS 253 Query: 232 GVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLN----QWTIFYEPRALCWILMPETLK 287 G FR S + EVG +S + ITED+++ +LQ + + I + P + W +PE Sbjct: 254 GAFGLFRTSTVREVGGYSTETITEDMELIVRLQKHVSDEKHRITFVPYPVVWTEVPERRA 313 Query: 288 GLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCL--VGFIIYAV 345 L +QR RW +G E + + + + + L + +G+II Sbjct: 314 VLARQRGRWFKGMLETLWVHRRMIGNPKYGMVGLFALPFFLFVEALGPLIEGLGYIIVPA 373 Query: 346 QLAGVPLNIELTH--IAATHTAGILLCTLCLLQFIVSLMIENRYEHN 390 LN+ + G LL L +L +VS RY + Sbjct: 374 AFLLGYLNVPFFVVFLGVAVGVGTLLSWLSVLNEVVSF---RRYRNP 417 >UniRef50_A5FCY6 Candidate polysaccharide-forming beta-glycosyltransferase; Glycosyltransferase family 2 n=2 Tax=Flavobacteriaceae RepID=A5FCY6_FLAJ1 Length = 477 Score = 236 bits (602), Expect = 1e-60, Method: Composition-based stats. Identities = 92/458 (20%), Positives = 167/458 (36%), Gaps = 54/458 (11%) Query: 22 AYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYR--ERHWPWGENAPAPQLKDNPSISI 79 + ++ + + MS I+ + R + + + + P +S+ Sbjct: 8 ITIFTHIYLILILAYAILIMSSYLILAYLSGKELRGYLKKNSFIDYDVLLTSEFAPKLSL 67 Query: 80 IIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDR--------------- 124 I P +NE +EE + + L+ Y EVI VNDGS D + +L Sbjct: 68 IAPAYNEGFTIEENVKSLLSLNYNQYEVIVVNDGSKDNSMEVLINTYDLVLTELAFHQQI 127 Query: 125 --------MAAQIPHLR--VIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAA 174 ++ + ++ +N GKA AL G A++ Y+VCID D +LD+D+ Sbjct: 128 ETKKIRGLYVSRNAAFKKLIVVDKENGGKADALNAGLNIAQNPYVVCIDVDCILDKDSLL 187 Query: 175 YIVEPMLYNP--RVGAVTGNPRIRTR--------------STLVGKIQVGEYSSIIGLIK 218 + +P L + R+ A G RI + L+ +IQV EY L + Sbjct: 188 KLAKPFLESSGQRIIATGGVVRIANQCIIKNGRLIEVNVPDLLLPRIQVLEYLRAFLLGR 247 Query: 219 RTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQL------NQWTIFY 272 + +SG AF + G +S + ED+++ +++ +++ Y Sbjct: 248 MAWGRLDGLLLISGAFGAFDKEIALLAGGYSTKTVGEDMELVVRMRRYMLENNLPYSVRY 307 Query: 273 EPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYC--LTT 330 P LCW PE K KQR RW +G E + + + L Y Sbjct: 308 IPDPLCWTEAPEDFKIFKKQRSRWMRGTIETLDFHKKMFLNPKYKLLGMLSIPYWTLFEF 367 Query: 331 IWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRY--E 388 + +G +I V LN + + ++ ++Y + Sbjct: 368 LAPGIEFIGLVITIVFFIFGLLNWHFFLLLLLFVYTFAVF-FSVIALYSEEQTYHKYPKQ 426 Query: 389 HNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKK 426 + L P F L++ L+ + ++ K Sbjct: 427 RDFFKLLLAAFIEPFYFHPLTVYAALIGYKEKVMGTKG 464 >UniRef50_UPI0001C42DFC capsule anchoring protein CapD n=1 Tax=Bacillus pseudofirmus OF4 RepID=UPI0001C42DFC Length = 1000 Score = 236 bits (602), Expect = 1e-60, Method: Composition-based stats. Identities = 92/418 (22%), Positives = 173/418 (41%), Gaps = 55/418 (13%) Query: 46 IVGGVYFWVYRERHWPWGEN---APAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRY 102 I+G ++ ++ PW ++ + QL++ IS+++PC+NE+ +E ++ + Y Sbjct: 554 ILGIIFPALHLLHSLPWFKSKGESIRRQLEEEKGISVLVPCYNEQGIIETSLKNMSSLPY 613 Query: 103 ENIEVIAVNDGSTDKTRAILDRM------------------------AAQIPHLRVIHLA 138 + EVI +NDGS D+T ++L+ + P + VI Sbjct: 614 QKKEVIYINDGSKDRTFSLLNTYLKLKPSTKKPLQKLSHKRVRGVYQSELYPFIYVID-K 672 Query: 139 QNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRI--- 195 +N GKA AL G A + +V +D D +L +A V + V A G + Sbjct: 673 ENGGKADALNAGIEYAGEDIVVSLDADTVLTENALPK-VNEAFEDEDVVAAGGMVHVLQT 731 Query: 196 ----------RTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEV 245 + + ++Q+ ++ + K + + + +SG FR+ AL EV Sbjct: 732 KTKNPLKRLSLRHANFLVRLQMFDFLKAFYITKISLARFHALAVISGAFGIFRKDALLEV 791 Query: 246 GYWSDDMITEDIDISWKLQLN-----QWTIFYEPRALCWILMPETLKGLWKQRLRWAQGG 300 G + I EDIDI+ K+ + + A+ + MPET +KQR+RW + Sbjct: 792 GGYR-STIGEDIDITLKMHEYIAKHVNKKVIHLKEAISYTEMPETWNDFFKQRVRWQKAY 850 Query: 301 AEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIA 360 + + + +RK FR FF + + + FI+ +A ++ +++A Sbjct: 851 IDCVI-HFHSFFRKTLFRKAVSFF--YIFESFLVGIVSAFIMTGFFIANAIISPPESYLA 907 Query: 361 ATHTAGILLCTLCLLQFIVSLMIEN----RYEHNLTSSLFWIIWFPVIFWMLSLATTL 414 L ++ I +L++ R+ SLF I V + +L + Sbjct: 908 YVLYYLTFLLIFGVVYDIFALIMNRYHGFRFRWKDIPSLFVTILLDVFVYRFALMYIV 965 >UniRef50_C1TQW8 Glycosyl transferase n=1 Tax=Dethiosulfovibrio peptidovorans DSM 11002 RepID=C1TQW8_9BACT Length = 444 Score = 236 bits (602), Expect = 1e-60, Method: Composition-based stats. Identities = 105/446 (23%), Positives = 187/446 (41%), Gaps = 37/446 (8%) Query: 8 FFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAP 67 F + L L PL + + FV F + IM I+G + + R PW + Sbjct: 8 FRAVSLFLDAPLSPETWQVLLKFIPFVLFLEMPVYIMIILGVLKHCLERMYQLPWRSD-- 65 Query: 68 APQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYE-NIEVIAVNDGS-----TDKTRAI 121 PS+S ++ C++E V+ TI + Q Y I++I + DG+ T + Sbjct: 66 -----YYPSVSCLVTCYSEGDAVKHTIRSLARQIYPGRIQIIPIVDGAALNGETFRAAKS 120 Query: 122 LDRMAAQIPH--LRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEP 179 ++ A++ + L+V+ Q G+ AL TG A E ++ +DGD D D Sbjct: 121 MESAVARLSNRSLQVVPKWQRGGRVSALNTGMNFADGEIVMALDGDTSFDNDMVERATR- 179 Query: 180 MLYNPRVGAVTGNPRIRT-RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFR 238 + VG V+G R+R RS LV ++Q EY I K G V +SG FR Sbjct: 180 HFEDRSVGCVSGCLRVRNARSGLVTRLQAIEYFLSIQAAKTGLSALGVVNNISGAFGVFR 239 Query: 239 RSALAEVGYWSDDMITEDIDISWKL-----QLNQWTIFYEPRALCWILMPETLKGLWKQR 293 RS L +G W ED+D++ ++ + + + I ++P A+ + +P+T + +KQR Sbjct: 240 RSVLDLIGGWDAGT-AEDLDLTLRVKNYFGRYDGFRIIFDPEAMGFTDVPDTFRDFFKQR 298 Query: 294 LRWAQGGAEVFLKNMTRLWRKENFRM--WPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVP 351 LRW ++ + WR + ++ W F T + + I+ V Sbjct: 299 LRW---DGDLSFLYFRKHWRSFSPKLVGWKNFLALLWTGLLFQVVMPLLIVCYTAYLFVA 355 Query: 352 LNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFW---ML 408 + L ++ + L++ R + +L +++ WI+ P+ + + Sbjct: 356 YPLSFAVWITVLVYLFYLAVTAVMFLLFVLLLSERPKEDL-ATMPWIVLSPLFAFACRVY 414 Query: 409 SLATTLVSFTRVMLMPKKQRARWVSP 434 TL L+ + R ++P Sbjct: 415 GAFATLWE-----LLGRGHRDTNMAP 435 >UniRef50_A1VIY0 Glycosyl transferase, family 2 n=1 Tax=Polaromonas naphthalenivorans CJ2 RepID=A1VIY0_POLNA Length = 476 Score = 235 bits (601), Expect = 2e-60, Method: Composition-based stats. Identities = 89/396 (22%), Positives = 165/396 (41%), Gaps = 12/396 (3%) Query: 30 MMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLK--DNPSISIIIPCFNEE 87 + + F + ++ ++ V +++ + P + P I++ I NEE Sbjct: 58 LAATLASVLFLIVVIMMLYAVRHFIFTINRLLGEQRHPYLDIAIARWPMITVFIAAHNEE 117 Query: 88 KNVEETIHAALAQRYE--NIEVIAVNDGSTDKTRAILDRMAAQIP-HLRVIHLA-QNQGK 143 K + I A L Y +++I VND STD+T AI+DR A+ P + H GK Sbjct: 118 KVIAGCIEALLNTDYPADQLKIIPVNDRSTDRTGAIIDRYVARFPSRISPFHRTLGKAGK 177 Query: 144 AIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPR-IRTRSTLV 202 + ALK A A+ + + D D + R + P +P VGAV G + + + L+ Sbjct: 178 SAALKDALAFAEGDIAIIFDADYVPGRGLLKQLAAPFF-DPEVGAVMGRVVPVNSGANLL 236 Query: 203 GKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWK 262 ++ E S + ++ + + G + R SA+ VG W DD + ED DI+++ Sbjct: 237 TRMLDLERSGGYQVDQQARMNMNLLPQYGGTVGGVRLSAVEAVGGWHDDTLAEDTDITYR 296 Query: 263 LQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPL 322 L N W Y R+ C+ +PE + KQ RWA+G +V + + + Sbjct: 297 LMFNGWKTVYSNRSECYEEVPEEWRVRIKQVKRWAKGHNQVMARYWWQFACSPYLTLAQR 356 Query: 323 FFEYCLTTIWAFTCLV----GFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFI 378 L ++ L+ G ++ L + +L I A G L + + Sbjct: 357 IDGLLLLFVFLIPLLMLIGWGLVLGLYFLNAGSMLSQLIPIFALMVYGTLGNFAAFFEIV 416 Query: 379 VSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTL 414 ++++++ + L + + +F + +L Sbjct: 417 IAVLLDGHRKRLRLLPLNILGFLVSLFAISGAVVSL 452 >UniRef50_B0C9M4 Glycosyl transferase, family 2 n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0C9M4_ACAM1 Length = 492 Score = 235 bits (600), Expect = 2e-60, Method: Composition-based stats. Identities = 88/382 (23%), Positives = 160/382 (41%), Gaps = 22/382 (5%) Query: 30 MMRFVFFWPFF-MSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEK 88 ++ + + F +SI+W ++++R R D P ++I +P FNE Sbjct: 20 ILTLIAIYSFHKISIIW-----RYYLHRRREISPLHKF---SDADLPQVTIQLPLFNEMY 71 Query: 89 NVEETIHAALAQRYE--NIEVIAVNDGSTDKTRAILD----RMAAQIPHLRVIHLAQNQG 142 V+ + A A Y +++ ++D STD+TR I + + ++ IH +G Sbjct: 72 VVDRLLEAVAALEYPVDKLQIQVLDD-STDETREICRAKVRELKQRHLNIDYIHRCDRKG 130 Query: 143 -KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNP-RIRTRST 200 KA AL G +A + ++ D D + D +V NP+VG V I + Sbjct: 131 YKAGALAYGLQSATGDLVMIFDADFVPSPDTLINMV-HYFANPKVGMVQARWGHINRHYS 189 Query: 201 LVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDIS 260 ++ +IQ + ++ G F +G +R + + G W +TED+D+S Sbjct: 190 ILTEIQALMLDGHFVTEQTSRNRSGCFFNFNGTAGIWRIQTIEDAGGWQHTTVTEDLDLS 249 Query: 261 WKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMW 320 ++ QL W Y P + +P + Q+ RWA+G ++V K + + Sbjct: 250 YRAQLKGWECIYLPNIVVPAELPMEMNSFKSQQFRWAKGASQVAKKLLLPILTSNAPGHV 309 Query: 321 PLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVS 380 L + LT + + L+ ++ ++ + A H L+ TL LL F Sbjct: 310 KLEAFFHLTNNFNYLLLLVLLLLSLPYQLFLAETGWRYGLAIHLPLFLITTLSLLAFYSV 369 Query: 381 LMIENRYE---HNLTSSLFWII 399 E R + LTS+LF ++ Sbjct: 370 AQEEQRGQNSPWKLTSNLFLLM 391 >UniRef50_B8E0Z1 Glycosyl transferase family 2 n=1 Tax=Dictyoglomus turgidum DSM 6724 RepID=B8E0Z1_DICTD Length = 399 Score = 235 bits (600), Expect = 3e-60, Method: Composition-based stats. Identities = 74/378 (19%), Positives = 162/378 (42%), Gaps = 23/378 (6%) Query: 72 KDNPSISIIIPCFNEEKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAILDRMAAQI 129 D P +S+++P NEEK E ++A L Y IE+I ++D STD+TR IL+ +++ Sbjct: 21 SDLPYVSVLVPMHNEEKVAENVLNALLNTDYPKDRIEIIPIDDNSTDRTREILEDYSSKY 80 Query: 130 PH-LRVIHLAQ--NQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRV 186 PH ++ ++ +GK AL A+ E ++ D D + + + L +P V Sbjct: 81 PHLIKPLYRGSYLPRGKPSALNDALKVAEGEIIIVFDADYIPPKGIIRDLAVSFL-DPEV 139 Query: 187 GAVTGNPRIRT-RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEV 245 G V G L+ ++ E + ++ + + G + FR+ + ++ Sbjct: 140 GVVMGRVVPLNISKNLLTRLFDLERIGGYQVDQQARYNLKLIPQFGGTVGGFRKELILKL 199 Query: 246 GYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFL 305 G ++ ++ ED +++ K +N + Y RA C+ PET + KQ RW++G +V Sbjct: 200 GGFNPKILAEDTELTIKAYINGVKVCYTNRAECYEEAPETWEVRAKQIRRWSRGHNQVMF 259 Query: 306 KNMTRLWRKENFRMW------PLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHI 359 + + L + + L Y ++ ++ + +++ + + + + Sbjct: 260 RYLLPLIKSPYLSLREKVDGVFLLCVYLISPLFLIGLVDSIVLFFLGEMQILGSSFFIFL 319 Query: 360 AATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTR 419 +A +T G + + +++ +++ P+ + ++ + + F Sbjct: 320 SAYNTFGNFAP---FYEIGLGALLDGATYR------IFLLPLPLFNFFFNMWYSSLGFFD 370 Query: 420 VMLMPKKQRA-RWVSPDR 436 +L +R W +R Sbjct: 371 SVLDLITRRDPVWHKTER 388 >UniRef50_C6J2E9 Glycosyl transferase n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J2E9_9BACL Length = 488 Score = 235 bits (599), Expect = 3e-60, Method: Composition-based stats. Identities = 88/428 (20%), Positives = 148/428 (34%), Gaps = 58/428 (13%) Query: 9 FILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPA 68 + ++L L Y+ ++ F F I+ I R + + Sbjct: 7 LLRDILLIYGLFATYYVMSVNLIYFSIMTFSFRQILTIF----------RRAAYSKYNTL 56 Query: 69 PQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAA- 127 + P +S+++P FNEE V E + L Y EVI +NDGS D T +L Sbjct: 57 SGSELVPPVSLLVPAFNEELTVIENVKCLLTLNYPTYEVIVINDGSKDNTLGVLREEFEL 116 Query: 128 ------------------------QIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCID 163 + PHL VI N GKA +L G A + ID Sbjct: 117 MPLPNPDVRHTIDCQPITAVYHNPKYPHLYVID-KPNGGKADSLNAGINLAHYPLISSID 175 Query: 164 GDALLDRDAAAYIVEPMLYNP-RVGAVTGNPRIRT--------------RSTLVGKIQVG 208 D+LL++DA + + NP A+ G+ RI + Q Sbjct: 176 ADSLLEKDALIRMARMYMENPEETVAIGGDVRIANGCMIENGAVKEVSLPRKIWPMFQAT 235 Query: 209 EYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQL--- 265 EY + + VSG FR+ + VG + ED++I KL Sbjct: 236 EYLKAFLGGRIGWSSINGLIIVSGAFGLFRKDYVIAVGGYRGGYPGEDMNIIIKLHRYML 295 Query: 266 ---NQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPL 322 ++ I + P A+CW P+T + L QR RW +G + ++N + M L Sbjct: 296 ENKLKYRIAFCPEAVCWTQAPDTYRILSNQRKRWGRGNLKNMIENYDMAFNPRYKVMGLL 355 Query: 323 FFEYCLTTIWAFTCL-VGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSL 381 Y + + ++ + + + + L + ++ Sbjct: 356 TMPYNIIFEALNPYFKITGLLALIGYTLMDMTHWRILAIFWLINFLSGYVLSIGALLLEE 415 Query: 382 MIENRYEH 389 + R++ Sbjct: 416 LAFKRFDK 423 >UniRef50_B7HFD6 N-acetyllactosaminide beta-1,6-N-acetylglucosaminyl-transferase n=76 Tax=Firmicutes RepID=B7HFD6_BACC4 Length = 433 Score = 234 bits (598), Expect = 4e-60, Method: Composition-based stats. Identities = 78/427 (18%), Positives = 155/427 (36%), Gaps = 21/427 (4%) Query: 14 VLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKD 73 + I + Y + F+F + G+ L+ Sbjct: 19 WISITFSIEYV---LIFTAFLFSGLLVYYSFLTIAGLIH---------RNSKRKDRTLEH 66 Query: 74 NPSISIIIPCFNEEKNVEETIHAALAQRYE-NIEVIAVNDGSTDKTRAILDRMAAQIPHL 132 PS+ I IP NE +++T+ A Y + + +ND S D+T I D H+ Sbjct: 67 YPSVDIFIPAHNEGIVIKDTLEAMAKIEYPGKLTIYLLNDNSQDETPEIGDDFDKAYAHI 126 Query: 133 RVIHLA--QNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVT 190 I + + +GK+ L G + + EY D D + A +VE Sbjct: 127 CHIRVPPGEPKGKSRVLNYGLSISDGEYFCVYDADNQPEPHALRMLVEHAETTEDAVGAV 186 Query: 191 GNPR-IRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWS 249 G+ R + + ++ E+ L++ + + +++G RRSAL E+G + Sbjct: 187 GHVRTVNENRNWLTRMISLEFQIFQLLMQSGRWLLFQTGSLTGTNMLLRRSALEELGGYD 246 Query: 250 DDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMT 309 I ED +++ ++ + + P ++ W PE LK L KQR RW QG + K + Sbjct: 247 PYAIAEDAELTLRITQKGYLLPIVPESVTWEQEPEHLKILIKQRTRWLQGNLYILEKMFS 306 Query: 310 RLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILL 369 L + + + + ++ ++ + + + L G+ ++ + + Sbjct: 307 SLSFFKGKLLVHSLQQVLVYVVFWLFLIISNVWFVIGLLGI---FQIQYSIPLLFMWYVA 363 Query: 370 CTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRA 429 + Q + +E + + + + +L+ + R KKQ Sbjct: 364 YITYVSQLFSAQSVERTFTPTNIFISVIMYFTYAQLFTYLFIRSLILYLRA--KSKKQVI 421 Query: 430 RWVSPDR 436 W R Sbjct: 422 GWDKTVR 428 >UniRef50_C0GFE2 Glycosyl transferase family 2 n=1 Tax=Dethiobacter alkaliphilus AHT 1 RepID=C0GFE2_9FIRM Length = 455 Score = 233 bits (595), Expect = 1e-59, Method: Composition-based stats. Identities = 98/450 (21%), Positives = 182/450 (40%), Gaps = 68/450 (15%) Query: 39 FFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSI--SIIIPCFNEEKNVEETIHA 96 ++++ V F + + P + +I++P FNE+ +++ I Sbjct: 2 LVLTLVIFSIFVMFQILYITIPLLNHKRGTELPEKLPELGLTILVPAFNEQHVIKQCILG 61 Query: 97 ALAQRYENIEVIAVNDGSTDKTRAILDR------------------------MAAQIPHL 132 AL Y+N E+I +NDGSTD T IL+ + P++ Sbjct: 62 ALRANYQNHELIIINDGSTDDTLGILESFLRLEAVEESINRTLTYKTVRGFYYSTIYPNV 121 Query: 133 RVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGN 192 VI N GKA AL G A ++ +V +D D++LD ++ + ++ V A G Sbjct: 122 YVID-KSNGGKADALNAGIDYANNDIVVTLDADSILDSNSLQAM-NSCFHDDSVIAAGGM 179 Query: 193 PRIRTRSTL-------------VGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRR 239 I + + + + Q+ +Y + L K Q + + ++G AFRR Sbjct: 180 VHIVQGAGVKGGKASPNFAVRGIIRYQIMQYLTSFYLHKFVQAKFKAMTVIAGAFGAFRR 239 Query: 240 SALAEVGYWSDDMITEDIDISWKLQL------NQWTIFYEPRALCWILMPETLKGLWKQR 293 S L E+ + + + ED+DI+ K+Q + A+C+ PE + L+KQR Sbjct: 240 STLLELNGFR-NTVGEDMDITLKIQRLIYTKERDKKMMVVSEAVCYTECPENFRNLFKQR 298 Query: 294 LRWAQGGAEVFLKNMTRLWRKENFRMWPLFF--EYCLTTIWAFTCLVGFIIYAVQLAGVP 351 +RW + + + L+ + +F + F + L T+ AF L+ ++ + G Sbjct: 299 IRWQKAFVDCIITYGHCLFSQFSFGLSVFFLIEGFMLGTLIAFNTLLVPLVLILSKTGHT 358 Query: 352 LNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSS-----LFWIIWFPVIFW 406 L + L I L L Q +V+ ++ R+ H+ + S L +I + + Sbjct: 359 LVLTLLTIYF---------ILALCQSLVAFLVSKRFGHSYSKSNYARILSFIPLEIITYR 409 Query: 407 MLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 +L L + + W +R Sbjct: 410 LLGLLWVTAGTIQYFYKKQH----WDKVER 435 >UniRef50_A9QSM4 N-acetylglucosaminyltransferase n=39 Tax=Lactobacillales RepID=A9QSM4_LACLK Length = 438 Score = 233 bits (595), Expect = 1e-59, Method: Composition-based stats. Identities = 90/439 (20%), Positives = 174/439 (39%), Gaps = 31/439 (7%) Query: 23 YFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIP 82 + + ++ + + +M + +FW+ + E +L P I+I++P Sbjct: 3 FIQTIMILAIISIWIATGLGLMILFSATHFWLTTIK----KERQKVFRLATYPKITIVVP 58 Query: 83 CFNEEKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRA----ILDRMAAQIPHLRVIH 136 NEE + +TI + + Y +E++ D D T ++ + +VI Sbjct: 59 AHNEEVVIAQTIESIMNLNYPKTQLELLLYADNCQDDTYNQIISVVSLEKYKYISAQVIQ 118 Query: 137 LAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNP-RVGAVTGNPRI 195 GKA L + A E++ D DA+ +++A ++VE L NP + GAV G + Sbjct: 119 RTGTGGKAGVLNDALSIATGEWICVYDADAMPEKNALHFLVEKALENPEKYGAVFGRNKT 178 Query: 196 RTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMIT 254 R + + E + + + + G + + +G W + +T Sbjct: 179 RNYKQNFLTRCINLEIVNTQRIHHVGLWHLFKIGRIPGTNFIIKTDFVKHIGGWDNGALT 238 Query: 255 EDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRK 314 ED IS+K+ + I + + PET+ + QR RWA+G +V L N L+ + Sbjct: 239 EDTAISFKIMQSGKLIALAHNSEAFQQEPETVSAYYHQRKRWAKGNYQVILDNFKHLFDR 298 Query: 315 ENFRMWPLFFEYCLTTIWA-FTCLVGFIIYAVQLA--GVPLNIELTHIAATH-------- 363 +R+ F Y T W ++ II+ + G+ + THI T Sbjct: 299 SVWRIKWEVFYYISTFFWFNAAIVISDIIFVANIVTWGISMFNPSTHILFTIGSNNIELA 358 Query: 364 ----TAGILLCTLCLLQFIVSL--MIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSF 417 L+ + LLQ ++L NL+ +L + + +F ++S+ + + Sbjct: 359 NLLMINWFLMVGIYLLQINIALASQFGQLTVKNLSVALISYVTYAQLFIVVSVVSVVSVL 418 Query: 418 TRVMLMPKKQRARWVSPDR 436 +L + +W R Sbjct: 419 LDHILH--RDGTKWYKTKR 435 >UniRef50_A4YD58 Glycosyl transferase, family 2 n=12 Tax=Sulfolobaceae RepID=A4YD58_METS5 Length = 395 Score = 233 bits (595), Expect = 1e-59, Method: Composition-based stats. Identities = 97/359 (27%), Positives = 164/359 (45%), Gaps = 22/359 (6%) Query: 41 MSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQ 100 +SI+ + VY + W + P+ PS S+++P NEEK + + + Q Sbjct: 9 LSIIVSIWSVYNSAFAIYGLSW--KSDEPKTSSGPSFSLLVPVRNEEKVLGRLLERLVNQ 66 Query: 101 RYE--NIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQ---GKAIALKTGAAAAK 155 Y+ E+I + DGSTD T + ++ + ++ +HL ++ GK+ AL G ++ Sbjct: 67 EYDRSKYEIIVLEDGSTDNTLGVCNKFSEMYSIIKCVHLEKSNVVNGKSRALNYGLKISR 126 Query: 156 SEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGN-PRIRTRSTLVGKIQVGEYSSII 214 + + D D + D Y+ + + N RVG V G I R ++V ++ E Sbjct: 127 GDIIGVFDADTVPRLDVLGYVAQKFISNSRVGGVQGRLVPINVRESIVARLASLEELFSE 186 Query: 215 GLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEP 274 I + G + G + RR AL +VG W+++++TED+D+S KL + I Y P Sbjct: 187 YSI-SGRARAGLFVPLEGTCSFVRRDALEKVGGWNENVLTEDLDLSLKLTSLNYLIVYSP 245 Query: 275 RALCWILMPETLKGLWKQRLRWAQGGAEVFLKN--MTRLWRKENFRMW---PLFF----- 324 W +P T L +QRLRW +G E+ ++ WR + M P+F Sbjct: 246 SVQSWREVPVTFSSLVRQRLRWYRGNFELTMRISRFKFTWRLVDAAMLVGTPVFMVLSLA 305 Query: 325 EYCLTTIWAFTC--LVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSL 381 Y L I+++ L+ II + + L I ++ T I+L L L F +SL Sbjct: 306 NYSLVFIYSYQLHVLIAAIISFSSMMTLLLIIMISRRHMIETIYIILSAL-YLNFTISL 363 >UniRef50_B0U7V3 Glycosyl transferase family 2 n=1 Tax=Methylobacterium sp. 4-46 RepID=B0U7V3_METS4 Length = 483 Score = 233 bits (594), Expect = 1e-59, Method: Composition-based stats. Identities = 84/380 (22%), Positives = 148/380 (38%), Gaps = 31/380 (8%) Query: 26 SGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFN 85 + + + ++ ++ + F+ H PA P +SI++P FN Sbjct: 64 TTAWGIFLIVIGASIIARWLVIQAMAFY----EHDRVRRKPPAELPNPAPFVSILVPAFN 119 Query: 86 EEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRV-IHLAQNQGKA 144 E + V + + Y N E+I V+DGSTD T +A Q + + ++ N GK Sbjct: 120 ESETVIGAPTSLMTLDYPNYEIIFVDDGSTDDTFIKAFPLAGQYGNCTLRVYTKPNGGKW 179 Query: 145 IALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGK 204 +L AK + L+C+D D+ L +DA +V P + V G IR R+ + K Sbjct: 180 SSLNFAYKKAKGDLLLCVDADSGLAKDALRVMV-PRMSEKGVVGCAGQVTIRNRNNFLTK 238 Query: 205 IQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEV------------------- 245 +Q EY G ++ + G V V G I ++R+ L E+ Sbjct: 239 LQAAEYLLGNGGMRMALSLMGVVTVVPGPIGLYKRAILEEITKIPRKLSAAAAVTHSGPG 298 Query: 246 ---GYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAE 302 G S + ED +S I YEPRA + P+ + L QR RW +G + Sbjct: 299 AVNGPLSGETFAEDFQLSLSALALGGRIVYEPRAYAYTKCPDDVATLMSQRYRWIRGTWQ 358 Query: 303 VFLKNMTRLWRKENFRMWPLFFEYCLTTIWA---FTCLVGFIIYAVQLAGVPLNIELTHI 359 V++ + L + N +T ++ F + + + +A + + + Sbjct: 359 VYIVYVRTLSKLTNRGNKSSLLNTVMTVLYPADIFLVPILNFFFWISIAVSMASGQSMNF 418 Query: 360 AATHTAGILLCTLCLLQFIV 379 +LL + + Sbjct: 419 ILAWIGSVLLMNVMTAMVYI 438 >UniRef50_D1N3C9 Glycosyl transferase family 2 n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N3C9_9BACT Length = 444 Score = 233 bits (594), Expect = 1e-59, Method: Composition-based stats. Identities = 100/423 (23%), Positives = 175/423 (41%), Gaps = 36/423 (8%) Query: 26 SGELMMRFVFFWPF-FMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCF 84 + F FF+ + ++ ++ YRE P ++ P ++IIP + Sbjct: 38 FDRFSLAFTFFFLCCPLGYFLLIAVCRYFFYRE--------IPTVPPEELPVCTVIIPAY 89 Query: 85 NEEKNVEETIHAALAQRYE--NIEVIAVNDGSTDKTRAILDRMAAQ-IPHLRVIHLAQNQ 141 NE + ++ + + Y +E+IAVNDGS D T + + A + +RVI +N+ Sbjct: 90 NEGETIKLALESVCRSEYPADKLEIIAVNDGSIDDTLRWMQQAAEESGGRIRVITYPENR 149 Query: 142 GKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRT-RST 200 GK AL G + E V +D D+LL DA A +V P + VGAV+GN R+ Sbjct: 150 GKRYALYQGFRSGSGEVFVTVDSDSLLKPDALARMVSPFRRDHSVGAVSGNVRVANLEGG 209 Query: 201 LVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALA--------EVGYWSDDM 252 + + ++ I+ Q ++ +VF G ++A+RR+AL + + Sbjct: 210 IFPPMIDAGFTFAFDFIRAGQTVFKSVFCTPGALSAYRRTALMPLLNEWLNQTFFGQPAG 269 Query: 253 ITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLW 312 I ED ++ + + + + A + +PE +G+ K LRW + L+ + ++ Sbjct: 270 IGEDRALTNLILRQGYGVIMQRSAYIYTNVPECYRGISKMLLRWERSNIRENLEMYSFIF 329 Query: 313 RKENF---RMWPLFFEYCLTTIW--AFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGI 367 R F R W + L T+ ++ F +Y + L + IAAT Sbjct: 330 RNFRFTDPRRWFMLATLLLYTLMTVLPVVMIWFTLYNIFTTNGTL---VLSIAATAMLWS 386 Query: 368 LLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQ 427 L L LQ N H L + I +FW++ A + +R + Sbjct: 387 TLPALVNLQ-------RNSLRHILYCYFYGIFHALTLFWVVPYAVFTIGNSRWLTRSGAN 439 Query: 428 RAR 430 A Sbjct: 440 SAG 442 >UniRef50_A3I0Z1 Glycosyltransferase n=1 Tax=Algoriphagus sp. PR1 RepID=A3I0Z1_9SPHI Length = 489 Score = 232 bits (593), Expect = 2e-59, Method: Composition-based stats. Identities = 89/396 (22%), Positives = 162/396 (40%), Gaps = 11/396 (2%) Query: 26 SGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFN 85 L++ F+ I + G W + + W + P ++ P +++ +P FN Sbjct: 3 FIYLLIGIYTLGLLFIFIYSLAQGNLLWNF-WKARKWLASTPMKEMDTWPKVTVQLPIFN 61 Query: 86 EEKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAILDRMAAQIP--HLRVIHLAQNQ 141 E V+ I AA Y +E+ ++D STD+T ++ P + + IH Sbjct: 62 ELYVVDRLIEAAANLNYPKELLEIQLLDD-STDETVDLIQEKIKNYPEVNFQYIHRQDRV 120 Query: 142 G-KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRS- 199 G KA ALK G A+ E++ D D + D D + P + +VG V RS Sbjct: 121 GFKAGALKEGLVNAEGEFIAIFDADFVPDPDFLLKTL-PYFSSEKVGMVQSRWTHLNRSY 179 Query: 200 TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDI 259 +L+ ++Q + + + + +G +R+S + + G W DD +TED+D+ Sbjct: 180 SLLTRLQAFALDAHFLIEQMGRNYQHAFINFNGTGGVWRKSCILDSGNWHDDTLTEDLDL 239 Query: 260 SWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLW-RKENFR 318 S++ Q W Y P +P + + Q+ RW +GGAE K+++ + +K FR Sbjct: 240 SYRAQRKGWEFIYRPEIESPAELPPIMSAVKSQQFRWTKGGAECARKHISGVMSQKLPFR 299 Query: 319 MWPLFFEYCL-TTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQF 377 + F + ++I+ LV V AG+ I I + + Sbjct: 300 VKFHAFAHLFNSSIFIAILLVSLSSIGVWWAGIKGMIPERLFQLAGIFMIGFVIIAGVYL 359 Query: 378 IVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATT 413 + + +L + + F + LSL + Sbjct: 360 VSYFYARRSFLKSLGQVFWQLPIFLSVSMALSLHNS 395 >UniRef50_A6CMH6 Glycosyl transferase, group 2 family protein n=2 Tax=Bacillus RepID=A6CMH6_9BACI Length = 449 Score = 232 bits (593), Expect = 2e-59, Method: Composition-based stats. Identities = 94/393 (23%), Positives = 163/393 (41%), Gaps = 53/393 (13%) Query: 30 MMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKN 89 M + F+S + ++ +Y E ++ N S S+I+P +NEEK Sbjct: 1 MTMLYITFAVFLSFV-----IFQTIYVLIPLFKNEKKHKVRITKNHSFSVIVPAYNEEKV 55 Query: 90 VEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRM------------------------ 125 +E I Y+N E++ VNDGS D T +L + Sbjct: 56 IENCIEGFRQVEYKNAELLIVNDGSKDNTLRVLHEVLQLKPASRMIDTRLKHERVTGFYQ 115 Query: 126 AAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPR 185 + + P + VI+ +N GKA +L G +++E +V +D D++L++++ + Sbjct: 116 SKKFPSIFVIN-KKNGGKADSLNAGINFSENEIVVTLDADSILEKNSLNEM-NAAFQKKS 173 Query: 186 VGAVTGNPRIRT-------------RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSG 232 V A GN I + V K Q +Y S L KR Q ++ ++G Sbjct: 174 VLACGGNVLISQAFNGSLENLKPTLKVNNVIKFQFLQYLSSFYLHKRAQAAMNSITVIAG 233 Query: 233 VIAAFRRSALAEVGYWSDDMITEDIDISWKLQL----NQWT--IFYEPRALCWILMPETL 286 AFRR L + + + ED+DI+ KL I + P ALC+ P TL Sbjct: 234 AFGAFRRETLFAIKGFR-KTVGEDMDITLKLHKWIKERGNKESITFVPTALCYTECPSTL 292 Query: 287 KGLWKQRLRWAQGGAEVFLKNMTRLWRK--ENFRMWPLFFEYCLTTIWAFTCLVGFIIYA 344 K ++KQR+RW + + + +RK + F ++ L ++ + T+ AF L+ ++ Sbjct: 293 KNMFKQRVRWQKAFIDCLIYYRKCYFRKLPKKFSIFFLVDQFTIGTLSAFPVLLTPVLLY 352 Query: 345 VQLAGVPLNIELTHIAATHTAGILLCTLCLLQF 377 + L + L A L T+ + Sbjct: 353 FNGGQLTLLLMLMSGAFLLFVYQSLTTMFVSYL 385 >UniRef50_Q6ACB6 Glucosaminyltransferase n=1 Tax=Leifsonia xyli subsp. xyli RepID=Q6ACB6_LEIXX Length = 421 Score = 232 bits (592), Expect = 2e-59, Method: Composition-based stats. Identities = 93/422 (22%), Positives = 162/422 (38%), Gaps = 21/422 (4%) Query: 26 SGELMMRFVFFWPFFMSIMWIVGGV-YFWVYRERHWPWGENAPAPQLKDNPSISIIIPCF 84 + + + V ++ +V F+V W + AP L P ++I++P + Sbjct: 6 ALQWIGLVVCLILALTGLIPVVAAAATFFVIPLHAWINHYHKAAPYL---PRVAIVVPAW 62 Query: 85 NEEKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQG 142 NE + +I + Y + + V+D STD T ++ A + P + + G Sbjct: 63 NEGAVIGASIDRLVTLDYPKEALRIYVVDDASTDDTSVVVRERAMRYPGNVFLFRREKGG 122 Query: 143 --KAIALKTGAAAAKS----EYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIR 196 KA L G + E L+ +D D + D+ + L +P+VGAV+ Sbjct: 123 QGKAHTLNHGIERIFADDWMEALLIMDADVIYQPDSLRKMTR-HLADPKVGAVSAYIHEG 181 Query: 197 TRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGY-WSDDMIT 254 + + K EY +R Q + G ++G R L +G + Sbjct: 182 SADRNYLTKFVSTEYVLSQPTARRAQNVLGAQACLAGGAQLHSRENLIAIGGQVDTSTLA 241 Query: 255 EDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRK 314 ED +++ QL + +EP A P T+ LWKQRLRWA+G V + +R Sbjct: 242 EDTITTFETQLRGKRVVFEPHAHVLAEEPRTIDALWKQRLRWARGNVSVTARYSKVWFRP 301 Query: 315 ENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCL 374 + +W F+ + +I + G+ + L + AT IL + L Sbjct: 302 SRKHH---LGAWTFGIVW-FSLWLLPLIMVISSIGLAGLLFLHNGFATTVFRILWISAAL 357 Query: 375 LQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSP 434 V ++ + ++ FP I ML + T L F +ML + + S Sbjct: 358 AYLFVLILGSQTDVGTTRKEVGHVMLFPGIVNMLVMLTAL--FPTMMLTGFRGCSASGSG 415 Query: 435 DR 436 R Sbjct: 416 AR 417 >UniRef50_A3HSN4 Glycosyl transferase, group 2 family protein n=1 Tax=Algoriphagus sp. PR1 RepID=A3HSN4_9SPHI Length = 486 Score = 232 bits (592), Expect = 2e-59, Method: Composition-based stats. Identities = 82/397 (20%), Positives = 146/397 (36%), Gaps = 58/397 (14%) Query: 24 FHSGELMMRFVFFWPFFMSIMWIVGGVYFWVY------RERHWPWGENAPAPQLKDNPSI 77 F LM F+ + I+ V + R + + P + Sbjct: 4 FLFYFLMEALGIFFLLISFTVIIIYFVTMILSALELRDHLRKNQFADYQDIITSPIAPGV 63 Query: 78 SIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDR------------M 125 SII P +NE +++ + + ++ Y EVI +NDGS D T L Sbjct: 64 SIIAPAYNEGQSIVQNAKSLMSLHYGKFEVIIINDGSKDDTLQKLLEAFELEKTEFAYDY 123 Query: 126 AAQIPHLR-------------VIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDA 172 +R V+ +N GKA AL +G A E + CID D +L D+ Sbjct: 124 QIDCKQVRGVYKSKNRSYSKLVVVDKENGGKADALNSGINLASLEIIACIDVDCILSHDS 183 Query: 173 AAYIVEPMLY--NPRVGAVTGNPRIRTR--------------STLVGKIQVGEYSSIIGL 216 +V P + N +V AV G I ++L+G+ QV EY + Sbjct: 184 ITRMVRPFMEETNRKVIAVGGAIGIANNCDVQDGTVTKYRLPTSLLGRFQVIEYFRAFLM 243 Query: 217 IKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLN------QWTI 270 + + +SG F + + +VG + + ED+++ +++ + + Sbjct: 244 GRMAWTRVNGLMLISGAFGFFNKDLVLKVGGYFPKTVGEDMELVVRMRRYMEEQKIPYRV 303 Query: 271 FYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTT 330 + P LCW +PE L +QR RW +G E + + + + L + + Sbjct: 304 GFVPDPLCWTEVPEDESVLARQRNRWIRGTIETLQLHRPIRFNPKYGILGMLAYPF---- 359 Query: 331 IWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGI 367 W + +I + + I L +A + + Sbjct: 360 -WNVFEKMAPLIELLGILYTLTLIVLGDFSAFYFFAL 395 >UniRef50_B2UMM8 Glycosyl transferase family 2 n=3 Tax=Verrucomicrobia RepID=B2UMM8_AKKM8 Length = 505 Score = 232 bits (592), Expect = 2e-59, Method: Composition-based stats. Identities = 82/371 (22%), Positives = 157/371 (42%), Gaps = 18/371 (4%) Query: 31 MRFVFFWPFFMSIMWI-VGGVYFWVYRERHWPW----GENAPAPQLKDNPSISIIIPCFN 85 + + F W ++ + + G F + W + P + ++ P +++ +P FN Sbjct: 3 LNYNFIWLLCYLLVLVGLAGYGFHRLSIVYLYWKNRNNKPQPKARFQELPVVTVQLPMFN 62 Query: 86 EEKNVEETIHAALAQRYE--NIEVIAVNDGSTDKTRAIL----DRMAAQIPHLRVIHLAQ 139 E+ V+ + + A Y +E+ ++D STD T + + ++ IH Sbjct: 63 EKFVVDRLLESVAALDYPQDKLEIQILDD-STDDTTEQCYRKVEELKSRGFDAVCIHRTD 121 Query: 140 NQG-KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNP-RIRT 197 G KA AL+ AK E+L+ +D D + + D + + VG V I Sbjct: 122 RTGFKAGALEAATKVAKGEFLLILDADFVPEPDLLQKTI-HFFTDENVGLVQTRWGHINR 180 Query: 198 RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDI 257 L+ +IQ + + + G FT +G +R+ + + G WS D +TED+ Sbjct: 181 EYNLLTRIQGMYLDGHFAMEQTARNRSGRFFTFNGTAGIWRKCVIGDAGGWSHDTLTEDM 240 Query: 258 DISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENF 317 D+S+++QL W Y + +P + G Q+ RW +G +V K + +WR Sbjct: 241 DLSYRVQLRGWRFIYLNDVVTPAELPVDMDGFKSQQHRWTKGSIQVCQKILLDIWRSNAP 300 Query: 318 RMWPLFFEYCLTTIWAF--TCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLL 375 + LT +++ L+ F++Y + +P N E + + A + ++ + Sbjct: 301 LKAKVEATTHLTCNYSYLLLALLCFLVYPICTQRIPEN-ETVFMWFVNVALFFMTSVAVC 359 Query: 376 QFIVSLMIENR 386 F +S I R Sbjct: 360 IFYMSAQIVVR 370 >UniRef50_Q2RGJ6 Glycosyl transferase, family 2 n=7 Tax=Firmicutes RepID=Q2RGJ6_MOOTA Length = 466 Score = 232 bits (592), Expect = 2e-59, Method: Composition-based stats. Identities = 104/424 (24%), Positives = 188/424 (44%), Gaps = 20/424 (4%) Query: 6 VSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGEN 65 +S + L V F S L++ F+ + P ++ V + +R P+ + Sbjct: 57 LSVILSLPWLRDLSEVIGFVSAILVITFIAYVPGYLVAFLAVS-----LLVDRQPPFKVS 111 Query: 66 APAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYEN-IEVIAVNDGSTDKTRAILDR 124 P D P I+I+I NE N+ T+ Q Y I V+ V++GSTD T + Sbjct: 112 DP-----DLP-ITILIAARNEASNIGMTLQYIANQNYRGPISVVLVDNGSTDGTADVALA 165 Query: 125 MAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYN- 183 A ++ + ++ GK AL G A + Y + +D D LL R A +IV +L + Sbjct: 166 KARELKLNLLCLREESPGKNFALNAGLAKIVTPYFITLDADTLLHRFAVKHIVARLLSSP 225 Query: 184 PRVGAVTGNPRIRT-RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSAL 242 P VGAV G+ +R R + +IQ +Y I IKRTQ +Y G + ++ A+ Sbjct: 226 PDVGAVAGHVLVRNSRDNFLTRIQEWDYFLGIASIKRTQGLYQGTLVAQGAFSLYKTEAV 285 Query: 243 AEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAE 302 +V W D I EDI ++WKLQ + +++EP A+ + +P+ ++ +QR RWA+G E Sbjct: 286 RQVSGW-PDTIGEDIVLTWKLQQAGYRVYFEPTAVAFTAVPKVVRHFVRQRSRWARGMVE 344 Query: 303 VFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAAT 362 +KN+ ++ + + + + I F LV + L T Sbjct: 345 -GMKNVPPWYQPSYLKKFLTGIDLVIPVIDLFYTLV----WIPGLILACFGKYYIVGPYT 399 Query: 363 HTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVML 422 L + L+ + L + + + + + + +I+ ++S +L + + + Sbjct: 400 LFVLPLNIGVSLIMYTYQLRVFGQLDLRVRKNRLGFFAYVLIYQLISSPVSLWGYVQELF 459 Query: 423 MPKK 426 ++ Sbjct: 460 QTRR 463 >UniRef50_UPI000178A1DC glycosyl transferase family 2 n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI000178A1DC Length = 462 Score = 231 bits (590), Expect = 4e-59, Method: Composition-based stats. Identities = 89/433 (20%), Positives = 161/433 (37%), Gaps = 54/433 (12%) Query: 34 VFFWPFFMSIMWIVGGVYFWVYRE---RHWPWGENAPAPQLKDNPSISIIIPCFNEEKNV 90 ++ +S ++ V ++ + + R + + + PS+S+++P +NEE + Sbjct: 3 AIYYVMIVSALYFVILMFSFRNIKDIFRRATYSKYNTLSGSELVPSVSLLVPAYNEELTI 62 Query: 91 EETIHAALAQRYENIEVIAVNDGSTDKTRAILDR-------------------------M 125 E + + Y EVI +NDGS+D+T +L Sbjct: 63 AENVRCLMTLNYPTYEVIVINDGSSDETLKVLIEEYGLTLLPNPEIRGNISTNKVRGIYY 122 Query: 126 AAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNP- 184 Q P L +I +N GKA +L G ++ + ID D+LL++DA + + NP Sbjct: 123 NPQYPQLYLID-KENGGKADSLNAGINLSRYPLISSIDADSLLEKDALIRMARMYMENPE 181 Query: 185 RVGAVTGNPRIRT--------------RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTV 230 A+ G+ RI + Q EY + + + V Sbjct: 182 ETVAIGGDVRIANGCVIENGEVKDVSLPRRMWPMFQSIEYLKAFLGGRIGWSHFNGLIIV 241 Query: 231 SGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLN------QWTIFYEPRALCWILMPE 284 SG FR+ + VG + ED++I KL ++ + + P A+CW P+ Sbjct: 242 SGAFGLFRKDYVIAVGGYRGGYPGEDMNIIIKLHRYMLENKIKYRVSFCPEAVCWTQAPD 301 Query: 285 TLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYA 344 + L QR RW +G + ++N ++ + M L Y + A Sbjct: 302 SYNILSNQRKRWGRGNLKNMIENRDMVFNPKYKVMGLLTMPYNVI-FEALNPYFRITGLL 360 Query: 345 VQLAGVPLNIELTHIAATHTAGILLC--TLCLLQFIVSLMIENRYEHNLTSSLFWIIWFP 402 L V L++ H+ L L + I+ M RY L+ + +++ Sbjct: 361 ALLGYVLLDMTNWHVLVVFGLVNFLSGYLLSVGALILEEMAFRRYT-KLSDLVKMLLFSA 419 Query: 403 VIFWMLSLATTLV 415 + F L Sbjct: 420 LKFVGYHQLGVLW 432 >UniRef50_D1JE47 Conserved hypothetical membrane protein, glycosyl transferase family 2 n=1 Tax=uncultured archaeon RepID=D1JE47_9ARCH Length = 392 Score = 231 bits (590), Expect = 4e-59, Method: Composition-based stats. Identities = 74/366 (20%), Positives = 146/366 (39%), Gaps = 15/366 (4%) Query: 73 DNPSISIIIPCFNEEKNVEETIHAALAQRYENI-EVIAVNDGSTDKTRAILDRMAAQIPH 131 P+++++IP +NEEK + +TI++ Q Y N+ +++ V+D STDKT I A+ Sbjct: 35 KKPTVTVLIPAYNEEKTIADTINSIRQQTYSNVVQILVVDDSSTDKTDEIAREGDAE--- 91 Query: 132 LRVIHLAQNQG-KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVT 190 VI N G KA A K E ID D +LD +A +V L P + + Sbjct: 92 --VIRTPSNSGTKAQAQNFALPYLKGELTATIDADTILDVEAIDTMVLRFLKEPDIASTC 149 Query: 191 GNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSD 250 + R + EY + + K+TQ G SG + F S L + G + D Sbjct: 150 SFVVPQVRKNFWELGRTVEYIYGLFMRKKTQEYLGIPIVCSGCFSLFNTSLLQKYG-FKD 208 Query: 251 DMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTR 310 + ED+D++W+L + + A+C+ P++ K Q RW + + + Sbjct: 209 RTMAEDMDLTWQLLAEGKKVRFCADAICFPKDPDSWKTYKGQVSRWLRSFFQNLSVHKKD 268 Query: 311 LWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLA---GVPLNIELTHIAATHTAGI 367 +W+ + +F L + +++++ + + ++ I Sbjct: 269 IWKNKKL---AVFLYSYLIEGFMGVIFYALLLWSIAFPPTQYIVVTSPVSLIIPVKAIMP 325 Query: 368 LLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQ 427 LL + S++ + + + +++ + +F + ++ Sbjct: 326 LLLLSSVAIVTASVLYQGHKLKISKIEILKGVPCYFALSIVNCCLFMEAFLNEWVFKRRL 385 Query: 428 RARWVS 433 +W Sbjct: 386 E-KWEK 390 >UniRef50_B9AE19 Putative uncharacterized protein n=1 Tax=Methanobrevibacter smithii DSM 2375 RepID=B9AE19_METSM Length = 461 Score = 231 bits (589), Expect = 4e-59, Method: Composition-based stats. Identities = 90/433 (20%), Positives = 162/433 (37%), Gaps = 26/433 (6%) Query: 14 VLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKD 73 + + C A SG L + + + + +++ P P Sbjct: 38 IFLLVTCSALIISGILARNVTWLQWLLLIPTLTMLFLAIISTKKQEKPIPYEKP------ 91 Query: 74 NPSISIIIPCFNEEKNVEETIHAALAQRY-----ENIEVIAVNDGSTDKTRAILDRMAAQ 128 P +S+IIP NEE + +T+ + Y N E+I VNDGSTD T L + Sbjct: 92 -PFVSLIIPAHNEEYTIAQTVTSISKIDYTLNGKPNFELIVVNDGSTDSTGEKLSELKKD 150 Query: 129 IPHLRVIHLAQ---NQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPR 185 IP LR++ +GK L + +K E + D D +++D IV P L NP+ Sbjct: 151 IPILRIVTRKPPKSGKGKGFVLNDALSLSKGEIIGVFDADTQVEKDFL-NIVMPYLNNPK 209 Query: 186 VGAVTGNPRIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAE 244 V V ++ + + +Q E+ S + + G + G ++ A+ + Sbjct: 210 VQGVQTRVKMYNKDENFLANMQHVEFESFGNTL-IAKDNLGKSGFLGGNGQFVKKQAILD 268 Query: 245 VGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVF 304 W +TED+++S K+ L I Y + K +KQR RWA G E Sbjct: 269 GEKWDGFAVTEDLNLSVKILLKGGQIRYCGETAVYQEAVTDWKSFFKQRTRWAIGNFETI 328 Query: 305 LKNMTRLWRKE-NFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATH 363 + ++ + E+ + GFI+ + + +T I Sbjct: 329 FIYLPKILKSPLPLIKKYGIIEHISFYAFNLFIFFGFIVSMLNAISWFIFHGVTIIRMD- 387 Query: 364 TAGILLCTLCLLQFIVSLMIE-NRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVML 422 A +++ + + FI + I R + + II +W+ + F + Sbjct: 388 -APLIVGIISTIAFIPGISIALLREKAGPLKFIKDII----GYWIYCFHLIPLFFETMFK 442 Query: 423 MPKKQRARWVSPD 435 M ++ +W Sbjct: 443 MIIRKERKWDKTK 455 >UniRef50_A9FZQ2 Glycosyltransferase n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9FZQ2_SORC5 Length = 521 Score = 231 bits (589), Expect = 5e-59, Method: Composition-based stats. Identities = 83/362 (22%), Positives = 141/362 (38%), Gaps = 11/362 (3%) Query: 47 VGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYE--N 104 + V + A +D P ++I +P FNE + A Y Sbjct: 25 LHLVVLCRLNRAKITRAQEVAALTDRDLPPVTIQLPLFNESTVAARLLDAVAKMDYPRDK 84 Query: 105 IEVIAVNDGSTDKT----RAILDRMAAQIPHLRVIHLAQNQG-KAIALKTGAAAAKSEYL 159 +E+ ++D STD+T RA ++R+ A +H G KA AL G AK E + Sbjct: 85 LEIQVLDD-STDETQGLVRAHVERLRALGLDAVYLHRVDRVGYKAGALDAGLKIAKGELV 143 Query: 160 VCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRS-TLVGKIQVGEYSSIIGLIK 218 D D + D IV +P VG V R +++ ++Q + Sbjct: 144 AIFDADFIPQPDFVRSIVG-HFEDPTVGMVQTRWGHLNRDVSILTQVQALMLDGHHLVEN 202 Query: 219 RTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALC 278 R + G +F SG +R+ A+ E G W D +TED+D+S++ QL + Y + Sbjct: 203 RARFGAGLLFNFSGTGGMWRKDAIREAGGWQHDTLTEDLDLSYRAQLAGYRFVYREDVVS 262 Query: 279 WILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLV 338 +PE + L Q+ RWA+G + K M + + + + LT +A+ LV Sbjct: 263 PAELPEDISALRAQQYRWAKGTVQTARKLMATVLSAKLSLGQRIEAFFHLTPHFAYPLLV 322 Query: 339 GFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQF-IVSLMIENRYEHNLTSSLFW 397 + + + + + A T L F +++ + R L Sbjct: 323 LLSVLLLPALVLFPAADTLTMIAIDLPLCTATTGSLAAFYMLAETAQGRSRWGAVRRLPM 382 Query: 398 II 399 +I Sbjct: 383 LI 384 >UniRef50_Q9UY40 Glycosyl transferase, family 2 n=2 Tax=Thermococcaceae RepID=Q9UY40_PYRAB Length = 447 Score = 230 bits (588), Expect = 5e-59, Method: Composition-based stats. Identities = 93/438 (21%), Positives = 170/438 (38%), Gaps = 14/438 (3%) Query: 10 ILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAP 69 L L + I + +A + + V F M + + +P+ E Sbjct: 11 ALYLYILIIIGLALVIPPKYALEIVLIILFLMVSSGSIFYTLLMASLGKRYPYDETGFNL 70 Query: 70 QLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQI 129 + + P + ++IP NEE+ + +T+ + L Q Y N++VI +ND STD+TR I++ + + Sbjct: 71 EFLE-PLVYVLIPAHNEERVIYKTVRSVLGQDYRNMKVILINDNSTDRTRDIMEEINRKY 129 Query: 130 PH-LRVIHLAQNQG--KAIALKTGAA-----AAKSEYLVCIDGDALLDRDAAAYIVEPML 181 P + +I + +G K AL Y+ +D D L+ +A +V M Sbjct: 130 PRKVVIIDVPPERGRSKPRALNYALEIIEKYMTHPNYVFILDADYLIPPNALKTLVSIME 189 Query: 182 YNPRVGA-VTGNPRIRT-RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRR 239 P+ + GN R R R V K E + G +A R Sbjct: 190 SAPQYVIGIQGNVRPRNFRKNFVTKFITLERLVGFNVAIEGDMKLNENGKYGGTVALLRF 249 Query: 240 SALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQG 299 L +G + +D +TED D+ + + + +Y + W ETL+ KQR RWAQG Sbjct: 250 PLLIRLGKFREDSVTEDTDLWARAMIAGYRFWYYHGVIGWEEAVETLRDYIKQRSRWAQG 309 Query: 300 GAEVFLKNMTRLWRK-ENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTH 358 +V + + + R N + Y ++ + + I+ + + + Sbjct: 310 HLQVMIDHYWPVMRSCSNIIESFIEHFYMMSYLVPVFWFLSVILNSYLIITGAPPLSFAR 369 Query: 359 IAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFT 418 + I L S +E + HN ++ +P+ F + +A + + Sbjct: 370 PKLFLSVSIFTFLLFWFSVAYSNWVEKK-RHNYYVPWSFVALYPLYFMVFVIAGVIYTM- 427 Query: 419 RVMLMPKKQRARWVSPDR 436 R ++ R W R Sbjct: 428 RGLIRLLVGRLHWEKTKR 445 >UniRef50_C7PF18 Glycosyl transferase family 2 n=9 Tax=Bacteroidetes RepID=C7PF18_CHIPD Length = 486 Score = 230 bits (587), Expect = 8e-59, Method: Composition-based stats. Identities = 92/466 (19%), Positives = 170/466 (36%), Gaps = 61/466 (13%) Query: 27 GELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPS---ISIIIPC 83 G VF + + I + + V+ + + + L +P I+++ P Sbjct: 10 GRFYESTVFIYGTVLLITYALLAVFSMISVRAYARKDKFHQENILLSSPLAPGITVLAPA 69 Query: 84 FNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKT------------------------- 118 FNE + + + L Y E+I VNDGSTD + Sbjct: 70 FNEGLTIIFNVRSLLTLNYPLYEIIIVNDGSTDNSLEQMIDEFELVPVEFAYNAKIQTKP 129 Query: 119 -RAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIV 177 R I L VI + KA A+ G AA ++ VC D D +LD++ ++ Sbjct: 130 VRKIYKSTNPAYAKLMVIDKVNGKSKADAVNAGINAALYDHFVCTDVDCILDKNTLLELI 189 Query: 178 EPMLYNPRVGAVTGNP----------------RIRTRSTLVGKIQVGEYSSIIGLIKRTQ 221 +P++ + + R+R S L+ + Q EY L K Sbjct: 190 KPVMQEEKKRVIATGATLRIANSCEFDQGVMTRMRPPSQLLPRFQEVEYIRAFVLGKMGW 249 Query: 222 RIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLN------QWTIFYEPR 275 + V VSG + F + G + ED+++ ++ + I Y P+ Sbjct: 250 SLLNCVPNVSGGLGLFDKEIAIRSGGYDHSSFGEDMELMTRMCRYAKDNKIDYAIRYIPK 309 Query: 276 ALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEY-----CLTT 330 LCW P ++K +QR RWA+G A++ + + +M + F Y L Sbjct: 310 TLCWTEAPASVKIFNRQRTRWARGLAQLMYAHFGMFFNPRYGKMGLIIFPYNFFFELLAP 369 Query: 331 IWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHN 390 I FT ++ +I+ A L + L + +T +++ T+ +L ++ RY Sbjct: 370 IIEFTGIIYYIVMAC-LGLINWPTALLLLVFVYTYSVMITTIAILWDQLTF----RYYKT 424 Query: 391 LTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 + F++ + + +++ W + R Sbjct: 425 WKEVALLCTTPFLEFFLYHPLIVIFALRGYYYFLTGKKSTWGNMQR 470 >UniRef50_C4WPV5 Putative uncharacterized protein n=1 Tax=Ochrobactrum intermedium LMG 3301 RepID=C4WPV5_9RHIZ Length = 426 Score = 229 bits (585), Expect = 1e-58, Method: Composition-based stats. Identities = 86/362 (23%), Positives = 156/362 (43%), Gaps = 7/362 (1%) Query: 72 KDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPH 131 ++PS+S+I+ NEE +E + + Q +N E++ V+DGSTD+ + + Q Sbjct: 61 SEHPSVSVILVGHNEEDALEACVRSLHEQSIQNFEIVIVSDGSTDRMVEVAQSLVKQGLA 120 Query: 132 LRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTG 191 +V+ + GK+ + A A + ++ +D D DR A Y++ P +P VGAV G Sbjct: 121 TKVLSTSLRGGKSSGINLACAYASGDIIINVDCDCSFDRYAFEYLLRP-FSDPAVGAVCG 179 Query: 192 NPRIRT-RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSD 250 + R ++++ + Q EY I + KR V SG +AFRR AL ++ + Sbjct: 180 DIAPRNSDASVISQFQEIEYLQSIAVGKRIASAVNQVVCASGAFSAFRRKALDDIRGFDV 239 Query: 251 DMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTR 310 ED+D + +L+L W I Y P A+C+ +P T+ +QRLRW + +++ Sbjct: 240 GG-GEDLDTTIRLRLKGWRIEYAPEAVCYTDVPTTVYQYIRQRLRWERDAI--WIRYRKH 296 Query: 311 LWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLC 370 + E + +VG I+ V L + + A I L Sbjct: 297 VQLMNPLSPAFRITEAVHQWDFLLFNVVGAFIFPVYLVWLFIQYGAFTPAILIAMQIGLL 356 Query: 371 TLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRAR 430 + +L S R H +L ++ + + + L+++ + R Sbjct: 357 AIDVLILAASAWCTGR--HVFLRNLLFLPGYSLFMTYVMRPVRLIAYIDEWVFSGSHRDN 414 Query: 431 WV 432 + Sbjct: 415 YT 416 >UniRef50_C7XH18 Glycosyltransferase n=1 Tax=Lactobacillus crispatus 125-2-CHN RepID=C7XH18_9LACO Length = 442 Score = 229 bits (585), Expect = 1e-58, Method: Composition-based stats. Identities = 89/405 (21%), Positives = 172/405 (42%), Gaps = 14/405 (3%) Query: 24 FHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPC 83 F L M F W I + + ++ H + P P P +SIIIP Sbjct: 2 FDQTVLQMGFWATWLLIPVIFEEIPMIINFIRLLHHHKIRKKTPLPDF--LPVLSIIIPT 59 Query: 84 FNEEKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQ 141 +N K + + + Y + + V++GS D T + + P L + L Sbjct: 60 YNSSKTLYRCLKSLADSDYPKNLLGIFVVDNGSKDDTFQVFAKAQTDYPQLLMRWLTSPH 119 Query: 142 GKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNP-----RIR 196 GK+ AL A+S+Y++ ID D LD+ A IV+ NP + A+TG I+ Sbjct: 120 GKSSALNAAIYKARSQYIINIDSDGRLDKHALNNIVKKFSANPNIDALTGTILTDRHLIK 179 Query: 197 TRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITED 256 L+ + EY + + ++FT+SG +AF+R L +S D I+ED Sbjct: 180 KPHKLLKLNEYFEYCQTFLAGRNIENNSNHIFTLSGAFSAFKRETLIRTYLYSTDTISED 239 Query: 257 IDISWKLQLN-QWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKE 315 D++++++ N + TI + A+ + L+ QR RW +G EV K ++++ + Sbjct: 240 TDMTFQIRYNLKGTISFCDDAIFYSEPISGFSELYTQRQRWQRGELEVINKYLSQVLSWK 299 Query: 316 ----NFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCT 371 NF++ L ++ +T + FI+ + + L I + + + Sbjct: 300 QFFSNFQIRRLIIDHTVTFLKIIWMFAIFILLGFGYSWISLLFSYAVIYLLYLFLEISSS 359 Query: 372 LCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVS 416 +C+ ++ S E ++ +F ++++ ++ + + + Sbjct: 360 ICVYIYLGSFPEERKFYLKNIWMIFLLLFYNLLCSFIRCVGIINA 404 >UniRef50_C6WRK1 Glycosyl transferase family 2 n=1 Tax=Actinosynnema mirum DSM 43827 RepID=C6WRK1_ACTMD Length = 637 Score = 229 bits (584), Expect = 2e-58, Method: Composition-based stats. Identities = 95/362 (26%), Positives = 150/362 (41%), Gaps = 33/362 (9%) Query: 83 CFNEEKNVEETIHAALAQRYE-NIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQ-- 139 +NE + + + +A Y +EV+ V+DGS D T ++ + +P +RV+ Sbjct: 283 AYNEAVTITAAVRSLVASDYPAPVEVVVVDDGSADGTADVVRAL--DLPGVRVVTRENGG 340 Query: 140 -----NQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPR 194 N G A A + LV +DGD + + A +V P+ P VGAV+GN + Sbjct: 341 KAEALNTGVA--------LAGHDALVLVDGDTIFEPGTLAALVAPLGA-PGVGAVSGNVK 391 Query: 195 IRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMIT 254 + R L+G+ Q EY+S L ++ + + T+ G + AFRR AL EVG S D + Sbjct: 392 VANRRGLLGRWQHLEYTSGSNLDRQILNAWRCLPTIPGAVGAFRREALVEVGGVSSDTLA 451 Query: 255 EDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRK 314 ED D++ + W + YEP A W +P L+ L++QR RW+ G + K+ + + Sbjct: 452 EDTDVTMAITRAGWRVVYEPGARAWTEVPAGLRSLYRQRYRWSYGTFQAMWKHRVAV--R 509 Query: 315 ENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCL 374 E RM L Y L L V V + A + L Sbjct: 510 EQGRMGRLGLLYLLLFHLLLPLLA-----PVMDVYVLYGALVADAPAALPIWLAFLALQT 564 Query: 375 LQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSP 434 +L ++ L + FP+ T LV ++ RW S Sbjct: 565 ASAGYALRLDGESLRPLWA-------FPLQQLAYRQLTYLVVVQSLVTAAHGVPLRWQSI 617 Query: 435 DR 436 R Sbjct: 618 QR 619 >UniRef50_UPI0001C30EDA glycosyl transferase family 2 n=1 Tax=Conexibacter woesei DSM 14684 RepID=UPI0001C30EDA Length = 433 Score = 228 bits (583), Expect = 2e-58, Method: Composition-based stats. Identities = 93/371 (25%), Positives = 163/371 (43%), Gaps = 23/371 (6%) Query: 42 SIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQR 101 ++ I G V R P + P ++I++P FNEE V ++ + LA Sbjct: 39 ALFAIYGLVVVSYIVSRFVLSIPYRPPRDVGHEPHVAIVMPAFNEEAVVARSLRSLLALD 98 Query: 102 YE--NIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYL 159 Y +E++A++DGSTD+T A + +AA+ P +RVI L +NQGK A+ G A +E L Sbjct: 99 YPVDKLELVAIDDGSTDETLARMREVAAESPRVRVIELGRNQGKRAAMAAGMRATDAEIL 158 Query: 160 VCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRT-RSTLVGKIQVGEYSSIIGLIK 218 +D D+ LD DA +V+ P VGAV G+ + + + ++QV Y ++K Sbjct: 159 AFVDSDSSLDPDALHKLVQG-FAEPGVGAVCGHADVANVDDSWLTRMQVVRYFVAFQVVK 217 Query: 219 RTQRIYGNVFTVSGVIAAFRRSALA--------EVGYWSDDMITEDIDISWKLQLNQWTI 270 ++ I+ V SG AA+RR A+ + S +D ++ + L W + Sbjct: 218 ASESIFACVTCCSGCFAAYRRDAILPHLDWWEHQTFLRSPATFGDDRALT-NVVLRDWKV 276 Query: 271 FYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTT 330 Y A+ ++P T++ +Q+LRW + L +WRK + + Sbjct: 277 RYAANAVSHTVVPHTMRQFLRQQLRWKRSWTRESLILARFVWRKYPVAALSAYVGIAIAL 336 Query: 331 IWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHN 390 + I AV+ +E + AGI + + + V+ RY+ Sbjct: 337 VAP--------IVAVRAIVWLPLVEGRGAPLVYLAGIYMLAVAYGLYYVAC--RRRYDPR 386 Query: 391 LTSSLFWIIWF 401 + + ++ Sbjct: 387 WVYGIAFCFFY 397 >UniRef50_A5GEA8 Glycosyl transferase, family 2 n=3 Tax=Deltaproteobacteria RepID=A5GEA8_GEOUR Length = 492 Score = 228 bits (582), Expect = 3e-58, Method: Composition-based stats. Identities = 94/405 (23%), Positives = 157/405 (38%), Gaps = 25/405 (6%) Query: 44 MWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYE 103 +W++ +Y ER P AP ++ PS+++ +P +NE E + AA + Sbjct: 28 LWLIYCLYMPKGSERSTP----APFAAPEEFPSVTVQLPLYNERFVAERLLDAAAGLDWP 83 Query: 104 N--IEVIAVNDGSTDKTRAILDRMAA----QIPHLRVIHLAQNQG-KAIALKTGAAAAKS 156 +E+ ++D S D T ++D+ AA Q + V+ G KA AL G A A Sbjct: 84 RERLEIQVLDD-SDDDTCRLVDQRAAWWRKQGVAITVVRRTSRDGYKAGALANGLATAHG 142 Query: 157 EYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRS-TLVGKIQVGEYSSIIG 215 EY+ D D + D + P N VG V + IQ Sbjct: 143 EYIAVFDADFIPPPDFL-HATMPWFRNQDVGMVQTRWSFCNADHSWFTGIQSLLLGPHFS 201 Query: 216 LIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPR 275 + R + G F +G +RRSA+ G W D +TED+D+S++ QL W Y Sbjct: 202 IEHRVRYRQGLFFNFNGTAGVWRRSAIESAGGWQSDTVTEDLDLSYRAQLAGWRFVYREE 261 Query: 276 ALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLW--RKENFRMWPLFFEYCLTTIWA 333 +P T+ L Q+ RWA+G + K + RL R W Sbjct: 262 CQVPSELPVTMAALRSQQQRWAKGSIQTARKILPRLLQERLPPAVKIEAMAHLMANIYWL 321 Query: 334 FTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTS 393 +V +Y + I L + L + ++ + + I + S Sbjct: 322 LGMIVMLTLYPA--VTWRVGIGLHQVLRIDLPLFLATSGAIMSYFLLYSIRS----GSKS 375 Query: 394 SLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRGI 438 L ++ P + L+ + +L S + + P + R +P G+ Sbjct: 376 LLRHVVLLPALTIGLAPSISL-SVLKGLFRPGGEFER--TPKFGV 417 >UniRef50_C7PF17 Glycosyl transferase family 2 n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PF17_CHIPD Length = 498 Score = 227 bits (580), Expect = 5e-58, Method: Composition-based stats. Identities = 92/479 (19%), Positives = 163/479 (34%), Gaps = 88/479 (18%) Query: 33 FVFFW---PFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKN 89 FVF + FM + + + + +R + + Q P ISII P FNE Sbjct: 11 FVFVYGCTMLFMYALLAILSLRGIIRFQRKNSYVDYNKMLQSPLAPGISIIAPAFNEGVT 70 Query: 90 VEETIHAALAQRYENIEVIAVNDGSTDKTR-AILDRMAAQ-------------------- 128 + + + L Y EVI VNDGSTD T +++ Q Sbjct: 71 IISNVRSLLTLNYPRFEVIIVNDGSTDDTLTKLINEFQLQEVDFAYNERIKSQPVKRLFK 130 Query: 129 -----IPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYN 183 L VI + KA A G AA +Y +C D D ++++D +++P + Sbjct: 131 STNTAYDKLVVIDKVNGKSKADASNAGINAAAYDYFLCTDVDCIIEKDTLLRMIKPFMDE 190 Query: 184 ------------PRVGAVTGN--------------------------PRIRTRSTLVGKI 205 P G + R+R + + Sbjct: 191 EHNKIKEIGEPCPECGYIHVKEDSVRVIATGATLRIANSCEIDEGVITRVRPPEKWLPRF 250 Query: 206 QVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKL-- 263 Q EY L K + V VSG + F + + G + ED+DI ++ Sbjct: 251 QEMEYLRAYVLGKMGWSVINCVPNVSGGLGLFDKEIAIKAGGYDSKSFAEDMDIVTRMCT 310 Query: 264 ----QLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRM 319 ++ I Y P CW P +K +QR RW +G AE+ + ++ ++ Sbjct: 311 YMIDNKLKYAIRYIPTTQCWTEGPPNMKVFSRQRTRWGRGLAEIITIHRKVIFNPRYRKL 370 Query: 320 WPLFFEYCLTTIWAFTCL----VGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLL 375 + Y L + + + + IY + + + L + + +++ TL +L Sbjct: 371 GLVVLPYNLFFEFLAPIIEFTGILYYIYQIITGNINWHNALILLLFVYLYSVMITTLAVL 430 Query: 376 QFIVSLMIENRYEH-NLTSSLFWI----IWFPVIFWMLSLATTLVSFTRVMLMPKKQRA 429 + Y+H + + P ++ L + L + + K Sbjct: 431 ------WDQITYKHYKTWREVVGLALMAFIEPFVYHPLIVFFALRGYFNFITGKKHTWG 483 >UniRef50_Q0AV05 Glycosyltransferase n=1 Tax=Syntrophomonas wolfei subsp. wolfei str. Goettingen RepID=Q0AV05_SYNWW Length = 452 Score = 227 bits (579), Expect = 6e-58, Method: Composition-based stats. Identities = 93/416 (22%), Positives = 167/416 (40%), Gaps = 26/416 (6%) Query: 30 MMRFVFFWPFFMSIMWIV----GGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFN 85 ++ F FW ++ + +V VYF + E +L P +++IIP N Sbjct: 9 ILAFGLFWGIWLMVPLLVDVSTAMVYFISFLNNRKQKVEKK--EELFFFPYVTVIIPVHN 66 Query: 86 EEKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLA-QNQG 142 + + + Q Y I+VI VN+GS D++ I R + P + V+ + + G Sbjct: 67 SADTLGHCLDSIARQTYPRESIQVICVNNGSEDESFDIFQRFQYRHPEMSVMWSSLERAG 126 Query: 143 KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTR---- 198 K+IAL G + Y++ +D D LD DA +V +P + A TG+ R+ + Sbjct: 127 KSIALNAGVYSGHGSYMMNVDSDTWLDPDAILNVVRVFEKDPSLAAATGSIRVDKKLGEG 186 Query: 199 ---STLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITE 255 ++ +V EY + +R Q + ++FT+SG + FRR + + + ++E Sbjct: 187 SSFIDMINYCEVIEYIVAFDIGRRYQDLKNSIFTLSGAFSVFRRDIILQTFLYQTRTVSE 246 Query: 256 DIDISWKLQLN----QWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRL 311 D D+++ ++ + I + A+ ++ E L L+ QRLRW +G EV Sbjct: 247 DTDLTFNIRQAIEAGEGRIGFISDAIAYVEPIENLSRLYSQRLRWQRGEMEV----TGLY 302 Query: 312 WRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCT 371 + K LF I T + + L + G+ +C Sbjct: 303 YEKIPGIFGALFDFVGRILISDHTLAFSRLAWTFLLPFLFFLGYPLPTIVVAMIGMYICY 362 Query: 372 LCL--LQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPK 425 + L F ++ ++WII F I+ L L + K Sbjct: 363 IILESCTFYIAYKGSAPEFQKELQKIWWIILFMPIYRYLVYWFRLAGIIAALTEEK 418 >UniRef50_A7INQ0 Glycosyl transferase family 2 n=11 Tax=Rhizobiales RepID=A7INQ0_XANP2 Length = 905 Score = 226 bits (577), Expect = 1e-57, Method: Composition-based stats. Identities = 92/372 (24%), Positives = 149/372 (40%), Gaps = 13/372 (3%) Query: 22 AYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQ-LKDNPSISII 80 Y ++ V + + + + F R + A P P +SI Sbjct: 380 DYVTLALSVVMLVPLVFVLLYRIEEMAAIAFGSGPRRLIDARKAAVVPTVPSRFPKVSIH 439 Query: 81 IPCFNEEKN-VEETIHAALAQRYENIEVIAVNDGSTDKT--RAILDRMAAQIPHLRVIHL 137 +P + E +++TI A A Y N E I + + + D + + AA + I+ Sbjct: 440 VPAYREPPEMLKQTIDALAALEYPNFEAIIIINNTPDPAMVEPVREYCAALGERFKFINA 499 Query: 138 AQNQG-KAIALKTGAAAA--KSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPR 194 + G KA AL+ A +E + ID D ++ D +V P+ +P VG V Sbjct: 500 EKVAGFKAGALRIALDATAPDAEIIGVIDADYVVTPDWLKELV-PVFDDPTVGLVQAPQD 558 Query: 195 IRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMI 253 R +L+ + EY+ + QR + V G + RR+A+ E G WS D I Sbjct: 559 HRDADRSLLHEAMNAEYAGFFDIG-MVQRNEDDAIVVHGTMCLIRRAAMLEAGNWSSDTI 617 Query: 254 TEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWR 313 ED D+ + N W Y + + L+P++ + KQR RWA GG ++ K+ + Sbjct: 618 CEDTDLGLTIAENGWKTHYTRKRYGYGLLPDSFEAFKKQRHRWAYGGFQIIKKHWRKFLP 677 Query: 314 KENFRMWPLFFEYCLTTI-WAFTCLVGFIIYAVQLAGVPLNIELTHIAATH--TAGILLC 370 + + L I W + VG ++ LA VP + H T IL+ Sbjct: 678 NRSRLTTAQKRHFVLGWISWLGSESVGAVMAIASLAFVPFVLLFGVSVPAHVLTLPILIT 737 Query: 371 TLCLLQFIVSLM 382 L L VSL Sbjct: 738 FLVYLMHFVSLY 749 >UniRef50_C5SBP2 Glycosyl transferase family 2 n=1 Tax=Allochromatium vinosum DSM 180 RepID=C5SBP2_CHRVI Length = 481 Score = 225 bits (575), Expect = 2e-57, Method: Composition-based stats. Identities = 70/356 (19%), Positives = 137/356 (38%), Gaps = 12/356 (3%) Query: 37 WPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLK----DNPSISIIIPCFNEEKNVEE 92 + + ++ + V + P K ++P + + +P FNE VE Sbjct: 10 YYLILGLVILALCSLNLVALTLARRFMPGRPLSVAKLTDAEHPRVLVQLPLFNEGDLVER 69 Query: 93 TIHAALAQRYENIEV-IAVNDGSTDKTRAILDR----MAAQIPHLRVIHLAQNQG-KAIA 146 + A + + + I V D S D + AI R + Q ++ ++H Q KA A Sbjct: 70 ILEAVMDLDWPRDRLEIQVLDDSIDGSLAISQRAVAVLKQQGVNIELLHRVQRTAFKAGA 129 Query: 147 LKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNP-RIRTRSTLVGKI 205 L G +++ ++ D D + D V ++ NP + V + +L+ +I Sbjct: 130 LAAGLERSEAPFVAIFDADFIPPPDFLQRTVGALVANPDLAYVQTRWGHLNRDESLLTRI 189 Query: 206 QVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQL 265 Q S G+ + + G +G +RR+A+ E G W D +TED+D+S + L Sbjct: 190 QARLLDSHFGVEQEARWRLGLPLPFNGTCGLWRRAAIDEAGGWDGDTLTEDLDLSLRANL 249 Query: 266 NQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMW-PLFF 324 W + + +P + + Q+ RW +G + F+K +WR +W + Sbjct: 250 AGWRSGFMGDLVVPGSLPVSARAWRVQQFRWTKGFVQCFIKLTPLVWRSRRLPIWQKIMI 309 Query: 325 EYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVS 380 + + AF V +I + + +L ++ ++ Sbjct: 310 SFQIGQPLAFVVGVACLIMGLPFMAGAVVGGAGLSVVASVTSVLGFAAPIMFLTMA 365 >UniRef50_D2S4V1 Glycosyl transferase family 2 n=1 Tax=Geodermatophilus obscurus DSM 43160 RepID=D2S4V1_9ACTO Length = 483 Score = 225 bits (574), Expect = 3e-57, Method: Composition-based stats. Identities = 105/435 (24%), Positives = 169/435 (38%), Gaps = 19/435 (4%) Query: 18 PLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHW-------PWGEN----A 66 PL + + G M+ + + ++ V + F VY RH W EN A Sbjct: 39 PLVLLAYVCGFTMVALLHGLDETLGVVAGVLNIVFGVYFCRHLGFAVAAARWAENDMLAA 98 Query: 67 PAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYE--NIEVIAVNDGSTDKTRAILDR 124 PS+++ + C NEE V+ + A LA Y + ++ V+DGS D T A LD Sbjct: 99 DVGLHSYTPSVAVFVGCKNEELVVDGMVTALLALDYPADRLTLVVVDDGSDDTTGARLDA 158 Query: 125 MAAQIPHLRVIHLAQNQGKAI--ALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLY 182 AA+ P LRV+H G AL +E V D D D A +V Sbjct: 159 WAAREPRLRVLHRPPGSGGGKSGALNDALHLVDAEIAVVFDADHEPDPTALRRLVR-HFR 217 Query: 183 NPRVGAVTGNPRIRT-RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSA 241 +P VGAV G IR R + + ++ S + + ++ + G A R S Sbjct: 218 DPVVGAVMGRCVIRNGRDSQMASTVFVDFLSGYLVNEYGRQALFELPAYGGANCAVRMST 277 Query: 242 LAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGA 301 L +G W+ +TED D++ ++ + + Y+ A+ + + + WKQR RWA+G Sbjct: 278 LRGLGGWNPHTVTEDTDLTLRVLMAGQRVRYDLSAVDFEEAVVSAQRFWKQRYRWARGHQ 337 Query: 302 EVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAA 361 + F L R + + ++ L G I + I + A Sbjct: 338 KCFRDYWRPLMRSPHLSLVEKVETTLFLLVYHVPLLSG--IGVLLTVLRAFGIGDLPLVA 395 Query: 362 THTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVM 421 +LL L + V L+I + L ++ F + W+ + A R Sbjct: 396 LLPLSMLLFVGPLAELSVGLLIGRVERRSAWRLLGFLPAFALSIWITTRAYVDGMLGRPY 455 Query: 422 LMPKKQRARWVSPDR 436 K R+ S R Sbjct: 456 TWVKTSRSGATSTVR 470 >UniRef50_A4CJ64 Glycosyltransferase n=15 Tax=Bacteroidetes RepID=A4CJ64_9FLAO Length = 494 Score = 225 bits (573), Expect = 3e-57, Method: Composition-based stats. Identities = 88/427 (20%), Positives = 170/427 (39%), Gaps = 16/427 (3%) Query: 17 IPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPS 76 + + +AYF + + + + +S + ++ + R + P K+ P Sbjct: 1 MGITIAYFIIAIYSLALLLIFFYSLSQLNLLLNYLGFKRRNKEAP---KFNLLDPKEIPY 57 Query: 77 ISIIIPCFNEEKNVEETIHAALAQRYEN--IEVIAVNDGSTDK----TRAILDRMAAQIP 130 ++I +P +NEE VE + Y +E+ ++D STD T A+++ + Q Sbjct: 58 VTIQLPIYNEEYVVERLLENIARIEYPKSKLEIQVLDD-STDDSVEQTAAMIEELQKQGL 116 Query: 131 HLRVIHLAQNQG-KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAV 189 ++ I +G KA ALK G AK +++ D D L D D V + +G V Sbjct: 117 DIQHIRRENREGFKAGALKEGLKIAKGDFIAIFDADFLPDADWLKKTV-IYFKDEEIGVV 175 Query: 190 TGNP-RIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYW 248 I + + KIQ + L + + G+ +G +R+ + + G W Sbjct: 176 QTRWGHINRDYSTLTKIQAFALDAHFTLEQVGRNSKGHFINFNGTAGIWRKECILDAGNW 235 Query: 249 SDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNM 308 D +TED+D+S++ QL W Y +P + Q+ RW +GGAE F K + Sbjct: 236 EGDTLTEDLDLSYRAQLKNWKFKYLEDVETPAELPVVISAARSQQFRWNKGGAENFRKTV 295 Query: 309 TRLWRKENFRMWPLFFEYC-LTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGI 367 + + +N F L F C+ + ++ + + T+ Sbjct: 296 WNVVKAKNIPFKTKFHGVMHLLNSSMFLCVFIVALLSIPMLYIKNTFGHLDWIFEVTSFF 355 Query: 368 LLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQ 427 ++ T+ L F+ + + + I F+ ++L +L + V+ + Sbjct: 356 IVSTIIL--FVCYWFTYKSIQGSSFDNFVDYIKLFFTFFSVALGFSLHNTVAVLEGHMGK 413 Query: 428 RARWVSP 434 R+ +V Sbjct: 414 RSEFVRT 420 >UniRef50_Q7NLY9 Gll0980 protein n=1 Tax=Gloeobacter violaceus RepID=Q7NLY9_GLOVI Length = 804 Score = 225 bits (573), Expect = 3e-57, Method: Composition-based stats. Identities = 80/367 (21%), Positives = 151/367 (41%), Gaps = 22/367 (5%) Query: 37 WPFFMSIMWI--VGGVYFW--VYRERHWPWGENAPAPQLKDNPSISIIIPCFNEE-KNVE 91 +P + ++ I + + W + R+R P + + P + + +PC+ E + V Sbjct: 130 YPLLVLMLPIGVIASLEQWEVLTRKRWLRPRTVKPIGEREHYPKVCLQVPCYAEPPEVVT 189 Query: 92 ETIHAALAQRYENIEVIAVNDGSTDK-----TRAILDRMAAQIPHLRVIHLAQNQGKAIA 146 T+ A RY N EV+ +++ + D A +++ + V LA KA A Sbjct: 190 ATLDRLAALRYPNFEVMVIDNNTKDPNLWKPVEAYCEQLGERFRFFHVDPLA--GAKAGA 247 Query: 147 LKTGAAAAKS--EYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRT-RSTLVG 203 L E + ID D + D + ++ PR+G V R ++L Sbjct: 248 LNWAMERVAGDVEIIGVIDADYHAEPDFLSSLLA-HFDEPRMGFVQTPHDYRGWENSLYQ 306 Query: 204 KIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKL 263 ++ EY + + TV G + RR AL E G WS+ TED +++ ++ Sbjct: 307 RMCYWEYKTFFATTMPSLNEKDAGLTV-GTMCLIRRRALDEAGGWSEWCQTEDSELAIRI 365 Query: 264 QLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKE---NFRMW 320 +T + P+ L+PET G KQR RW G + F +++ K + Sbjct: 366 HALGYTSVFVPQTFGRGLIPETFAGYKKQRFRWTFGPVQEFKQHLRLFLPKPWAAPSELR 425 Query: 321 PL-FFEYCLTTIWAFTCLVGFIIYAVQLAGVP-LNIELTHIAATHTAGILLCTLCLLQFI 378 P+ + + FT + F++ V LA + + ++ +AA + I + ++ Sbjct: 426 PIQKLHHMNHGLGPFTTGLSFLMMPVNLAVLVSMLVQGESVAAPRSLWIAAVCATVAGWL 485 Query: 379 VSLMIEN 385 ++ +I Sbjct: 486 LTWLIHR 492 >UniRef50_C4KBM8 Glycosyl transferase family 2 n=6 Tax=Betaproteobacteria RepID=C4KBM8_THASP Length = 868 Score = 224 bits (571), Expect = 5e-57, Method: Composition-based stats. Identities = 89/420 (21%), Positives = 165/420 (39%), Gaps = 31/420 (7%) Query: 2 INRIVSFFILCLVLCIPLCVAYFHSGELMMRFVFFW--PFFMSIMWIVGGVYF--WVYRE 57 + ++ I L++ I + V Y+ + ++ V + + + G F ++++ Sbjct: 349 LAALIQACISTLIIGINVPVEYYLTQRDLIGLVLLIGATCMTAAVLLSHGFEFGEVLFKK 408 Query: 58 RHWPWGENAPAPQLKDNPSISIIIPCFNEEKN-VEETIHAALAQRYENIEVIAVNDGSTD 116 + P + P +SI + C+NE V TI + Y+N EV+ +++ + D Sbjct: 409 KWARRFTPLPPHPPEQQPFVSIHLACYNEPPEMVIATIDSLAQMNYQNFEVLILDNNTRD 468 Query: 117 KT--RAILDRMAAQIPHLRVIHLAQNQG-KAIALKTGAAAA--KSEYLVCIDGDALLDRD 171 + + + R A P R HLA G KA AL G ++E + +D D ++D D Sbjct: 469 EALWKPLERRCAELGPRFRFFHLANWPGFKAGALNYGLKVTDPRAEVVGVVDADYVVDPD 528 Query: 172 AAAYIVEPMLYNPRVGAVTGNPRIRT-RSTLVGKIQVGEYSSII--GLIKRTQRIYGNVF 228 A +V P P V V R ++ E+ G+ R +R N Sbjct: 529 WLACLV-PHFDQPEVAVVQAPQAHRDWEGQPFKRMCNWEFDGFFRIGMHHRNER---NAL 584 Query: 229 TVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKG 288 G + RR AL EVG WS+ I ED ++ +L + Y L L P Sbjct: 585 IQHGTMTMVRRRALEEVGGWSEWCICEDTELGLRLIEKGYDTRYIDHILGRGLTPSGFAA 644 Query: 289 LWKQRLRWAQGGAEVFLKNMTRLW---------RKENFRMWPLFFEYCLTTIWAFTCLVG 339 + QR RWA G ++ ++ + R W + L ++AF G Sbjct: 645 IKSQRFRWAFGAMQILKAHLPHMIGRSTLNLAQRYHFLTGWFAWLGDALQLVFAF----G 700 Query: 340 FIIYAVQLAGVPLNIELTHIAATH-TAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWI 398 +++ + + P L ++ G ++ L + ++ ++ L +S+ + Sbjct: 701 SLLWTLGILLFPKAFGLPVVSLALPIFGFMIFKAALGPILYRRTMDCPWKDILGASILSV 760 >UniRef50_D2S449 Glycosyl transferase family 2 n=1 Tax=Geodermatophilus obscurus DSM 43160 RepID=D2S449_9ACTO Length = 484 Score = 224 bits (571), Expect = 5e-57, Method: Composition-based stats. Identities = 70/288 (24%), Positives = 115/288 (39%), Gaps = 46/288 (15%) Query: 74 NPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRM-------- 125 P +S++IP +EE + +T+ A LAQRY +EV+ V+DGSTD+T ++ Sbjct: 59 TPGVSVLIPAHDEEAGILDTLAAVLAQRYPVLEVVLVDDGSTDRTFDMVAERFGLVQVTA 118 Query: 126 -----------------AAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALL 168 A L V+ ++ AL A+ + +D D+LL Sbjct: 119 RPTAALPVDGTVLSVHRATTGDPLTVVRKTSVGRRSDALNAALNVARHPLVCMVDADSLL 178 Query: 169 DRDAAAYIVEPMLYNPRVGAVTGNP---------------RIRTRSTLVGKIQVGEYSSI 213 + +A + P + +P +G R S V + QV EY Sbjct: 179 EPEALVQVARPFVEDPANVVASGGVIRAANGARVARGSVLEPRLSSRWVVRFQVVEYLRS 238 Query: 214 IGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDIS------WKLQLNQ 267 L + + +SG FRR + EVG + ED ++ + Q Sbjct: 239 FLLGRTGWADANALLIISGAFGLFRRDLVVEVGGLDALSLAEDAELVVTVQEHLRRQRRP 298 Query: 268 WTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKE 315 + + P +CW +PETL L +QR RW+ G A++ K+ + Sbjct: 299 HRVVFVPEPVCWTEVPETLAVLGRQRRRWSHGLAQLLWKHRRMIGNPR 346 >UniRef50_Q0IE23 Glycosyl transferase, group 2 family protein n=19 Tax=cellular organisms RepID=Q0IE23_SYNS3 Length = 467 Score = 224 bits (571), Expect = 6e-57, Method: Composition-based stats. Identities = 74/302 (24%), Positives = 128/302 (42%), Gaps = 12/302 (3%) Query: 63 GENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEV--IAVNDGSTDKTRA 120 E A L P++ +++ +EE V + A RY ++ ++DGS DKT Sbjct: 93 AEPAVVESLASWPNVDVVVAARDEEAVVARLMERLGALRYPADQLTTWIIDDGSEDKTPD 152 Query: 121 ILDRMAAQIPHLRVIHLAQN--QGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVE 178 +LD + Q P+L VI +N GK+ AL A + E+++ +D D + D ++ Sbjct: 153 LLDELKPQYPNLNVIRRPRNAGGGKSGALNAALAQSSGEWILVLDADGQISEDQLERLIP 212 Query: 179 PMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFR 238 + G + L+ ++Q E + LI++ + + G V + G R Sbjct: 213 IAVMGDWSGVQMRKAVTNADTNLLTRLQAMEMA-FDALIQQGRLLGGGVSELRGNGQLLR 271 Query: 239 RSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQ 298 R L G +++D +T+D+D+S++ L ET+ LWKQR RWA+ Sbjct: 272 RDVLEACGGFNEDTVTDDLDLSFRFLLQGARTTLLWNPPVREEAVETISALWKQRQRWAE 331 Query: 299 GGAEVFLKNMTRLWR------KENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPL 352 GG + F L + +Y L + +F L+ + A PL Sbjct: 332 GGLQRFFDYWPGLLSNRLTVGQRRDLACFFLLQYALPVL-SFADLLTSVFTRTSPAYWPL 390 Query: 353 NI 354 +I Sbjct: 391 SI 392 >UniRef50_UPI00017891C0 glycosyl transferase family 2 n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI00017891C0 Length = 450 Score = 224 bits (571), Expect = 6e-57, Method: Composition-based stats. Identities = 94/423 (22%), Positives = 175/423 (41%), Gaps = 53/423 (12%) Query: 35 FFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETI 94 +F+ + + +Y ++ N+ + SI++++P +NEE + I Sbjct: 3 YFFYGLAGVFLVFQALYTFIPLLCSKVKKLNSDLAEK----SITVLVPAYNEELTIRNCI 58 Query: 95 HAALAQRYENIEVIAVNDGSTDKTRAILDRM------------------------AAQIP 130 A RY N E++ +NDGS D T + LD + + + Sbjct: 59 DAMAGLRYSNYEIMIINDGSKDGTLSALDELLDLEPSARTADSKLNYKSIKGFYRSNRYR 118 Query: 131 HLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVT 190 H+ VI N GKA +L G A S+ ++ +D D++L+ D+ Y V ++ + A+ Sbjct: 119 HIYVID-KMNGGKADSLNAGIDYAVSDIVITLDADSMLEADSLKY-VNQYFHDDDIIALG 176 Query: 191 GNPRI--------------RTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAA 236 G +I + R + K Q+ Y+ + K TQ + ++ +SG A Sbjct: 177 GTVKIVQGAVRDKNGAVVEKFRGKGLIKSQIINYTHGFYVRKLTQSYFNSIVVISGAFGA 236 Query: 237 FRRSALAEVGYWSDDMITEDIDISWKL------QLNQWTIFYEPRALCWILMPETLKGLW 290 F + L V + + EDIDI+ K+ + + Y P A+C+ PE L+ + Sbjct: 237 FYKEILVHVDGFR-STVGEDIDITLKIHEYIKAHRLKKKLVYAPEAVCYTECPENLQNFY 295 Query: 291 KQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFF--EYCLTTIWAFTCLVGFIIYAVQLA 348 KQR+RW + + L +RL RK + + F + L T+ AFT + V Sbjct: 296 KQRIRWQKAFIDCILIYWSRLSRKFSVSVSLFFAIDGFMLGTLSAFTTIFYLGQTLVVGG 355 Query: 349 GVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWML 408 + I ++ A ++ +L L + S + Y S ++ + + + Sbjct: 356 NLWHAIIFLLMSVVLNAAQIVISLYLCRKYGSTYAFSDYVRMFLFSQAELLTYRNLLLYI 415 Query: 409 SLA 411 ++A Sbjct: 416 NIA 418 >UniRef50_C7M4N0 Glycosyl transferase family 2 n=4 Tax=Flavobacteria RepID=C7M4N0_CAPOD Length = 494 Score = 223 bits (570), Expect = 7e-57, Method: Composition-based stats. Identities = 85/388 (21%), Positives = 157/388 (40%), Gaps = 14/388 (3%) Query: 58 RHWPWGENAPAPQL---KDNPSISIIIPCFNEEKNVEETIHAALAQRYE--NIEVIAVND 112 +H + AP L K+ P ++I +P +NE+ V + Y +E+ ++D Sbjct: 36 KHRKQNDEAPKFNLLDPKEIPYVTIQLPLYNEKYVVRRLLENIAKLEYPRTKLEIQVLDD 95 Query: 113 GSTDK----TRAILDRMAAQIPHLRVIHLAQNQG-KAIALKTGAAAAKSEYLVCIDGDAL 167 STD+ T I+ + A ++ I +G KA ALK G A AK +++ D D + Sbjct: 96 -STDESVAETAEIIKELQATGLDIKHIRRTNREGFKAGALKEGLAIAKGDFVAIFDADFM 154 Query: 168 LDRDAAAYIVEPMLYNPRVGAVTGNP-RIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGN 226 D V +P +G V I +++ KIQ L + + G Sbjct: 155 PQPDWLKRTV-VYFKDPEIGVVQTRWGHINRNYSILTKIQALALDVHFTLEQVGRNSKGY 213 Query: 227 VFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETL 286 +G +R++ + + G W D +TED+D+S++ QL W Y +P + Sbjct: 214 FINFNGTAGIWRKACIYDAGNWEGDTLTEDLDLSYRAQLKNWKFKYLEDVETPAELPVVI 273 Query: 287 KGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQ 346 Q+ RW +GGAE F K ++R+ +N W F + + + L F++ + Sbjct: 274 SAARSQQFRWNKGGAENFRKTVSRVLAAKNIG-WKTKFHGVMHLLNSSMFLWVFVVSVLS 332 Query: 347 LAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFW 406 + + + + H+A ++ FI + I V F+ Sbjct: 333 IPMLYIKNQYGHLAWIFHITSFFIISTVILFICYWFTYKNLQGKTFDDFLNYIKLFVTFF 392 Query: 407 MLSLATTLVSFTRVMLMPKKQRARWVSP 434 ++L + + V+ +R+ +V Sbjct: 393 SVALGFSFHNTVAVLEGHTGKRSEFVRT 420 >UniRef50_UPI000038DF5C N-acetylglucosaminyltransferase n=1 Tax=Ferroplasma acidarmanus fer1 RepID=UPI000038DF5C Length = 405 Score = 223 bits (570), Expect = 8e-57, Method: Composition-based stats. Identities = 77/407 (18%), Positives = 147/407 (36%), Gaps = 18/407 (4%) Query: 30 MMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKN 89 ++ + + + G F Y E + P +SII+P NEE Sbjct: 11 LIAIGLVYVIYQFPIIYFGYKDFTKYDIDFSKLNEAQFSGLKMYKPMVSIIVPAKNEETV 70 Query: 90 VEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKT 149 ++ TI + L Q Y N E+ V D S+D T I ++ + V + + KA AL Sbjct: 71 IKRTIESILNQTYTNFELFVVVDNSSDNTYKIAKEYESRDKRVNVFNRPDGKSKASALNF 130 Query: 150 GAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGE 209 K E + D D +L + V M Y N I + ++ V + Sbjct: 131 CFEKTKGEVIATYDADTMLLPNTLENAVYGMNYFNVDVLQGYNSYINREENIFTRLAVID 190 Query: 210 YSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWT 269 ++ + + V+G F+R + +G W D+ +TED++ S ++ ++ Sbjct: 191 -EILVKATLIGRTHFNLFVPVAGSNQYFKRKVIESIGGWDDNFLTEDLESSIRISNARYK 249 Query: 270 IFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLT 329 Y A P + ++QR RW +G +VF + R + +F + + Sbjct: 250 SAYLGSAKALQETPASYSEYFRQRTRWLRGYHQVFFHSKKRFSKFTDFDALMIVLAPTFS 309 Query: 330 TIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEH 389 I F L ++ + + ++ +V ++I+ R Sbjct: 310 GILFFGWLYISLLNFYNPFVHSMRTYFISLILISLIIYVVA-------LVLVLIKKRQNF 362 Query: 390 NLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 I+ P+I+ L +L++ + L + W + Sbjct: 363 ---------IYIPLIYIYL-TLNSLIAIYTLFLEITGAKRVWHKVKK 399 >UniRef50_C2EFI6 Glycosyltransferase n=1 Tax=Lactobacillus salivarius ATCC 11741 RepID=C2EFI6_9LACO Length = 444 Score = 221 bits (564), Expect = 4e-56, Method: Composition-based stats. Identities = 84/424 (19%), Positives = 172/424 (40%), Gaps = 28/424 (6%) Query: 24 FHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPC 83 F+ + M F W +I + + + H+ + P +++I+P Sbjct: 2 FNQVVIQMGFWTAWLVIPAIFELSPAI--VAFFRNHFSKISSKQEYHPGRMPQLTVIVPI 59 Query: 84 FNEEKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQ 141 +N + + + + Y I +I N+ STD + + + ++R+ L NQ Sbjct: 60 YNSGDTLYDCLKSIAESNYPKHLISIICANNKSTDNSFEVYKQAQMDFRNVRMQWLEANQ 119 Query: 142 GKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRT---- 197 GKA AL T + +Y+ ID D L +DA + +P V A TG Sbjct: 120 GKAKALNTAIYGSTGDYVFTIDSDGTLSKDALLNMAVNFNKHPEVMAQTGTILSNNELIK 179 Query: 198 ---RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMIT 254 + L+ + EY+ + + +FT+SG +AFRRSAL + ++ + + Sbjct: 180 NSKKHRLLRLEEYYEYAIAFLAGRTVESKDDQIFTMSGAFSAFRRSALMQSRLYNVETVG 239 Query: 255 EDIDISWKLQ-LNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLK------N 307 ED D++++++ + + P A+ ++ L+ QR RW +G EV N Sbjct: 240 EDTDMTFQIRYYLKGRVMLCPNAIFYVEPISDWDELYVQRQRWQRGEIEVIRTFLSEKLN 299 Query: 308 MTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGI 367 + R+W NF + L ++ + + F++ + + + + L + + Sbjct: 300 LKRIW--SNFIVRRLLVDHTVAFLKVIWLFAIFVLIPFGYSPILIVMSLLLMYLLYLFIG 357 Query: 368 LLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQ 427 L ++ ++ IE +Y N +W+ + I+ ++ ++ ML + Sbjct: 358 FLNFTNVMHYLKFDPIERKYFRNH----WWVTFMMPIYNLIVSFFRVMGVINTML----K 409 Query: 428 RARW 431 +W Sbjct: 410 SGKW 413 >UniRef50_B2IY03 Glycosyl transferase, family 2 n=1 Tax=Nostoc punctiforme PCC 73102 RepID=B2IY03_NOSP7 Length = 427 Score = 221 bits (563), Expect = 5e-56, Method: Composition-based stats. Identities = 83/399 (20%), Positives = 158/399 (39%), Gaps = 27/399 (6%) Query: 45 WIVGGVYFWVYRERHWPWGENAPAPQLKDN---PSISIIIPCFNEEKNVEETIHAALAQR 101 W++ G+ + + A ++ P + I++P NE + + + Sbjct: 42 WLMLGLTAILTVQTLRMLVAKPDAAVMESKIYLPPVLILVPAKNESSVLANLVDSLYQLN 101 Query: 102 YEN--IEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYL 159 Y + +++ V+DGSTD+T IL + Q P L+V GK+ AL + E + Sbjct: 102 YPSNSLDIWIVDDGSTDETPEILRELQTQFPSLQVHRRESKGGKSGALNAVFPFTQGEII 161 Query: 160 VCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRT-RSTLVGKIQVGEYSSIIGLIK 218 + D DA L + V P+ +GAV I + + + Q E ++ Sbjct: 162 LVCDADAQLPVNFLQQTV-PLFQKQAIGAVQVRKTISNANTNFLTRCQQMEMCC-DSFLQ 219 Query: 219 RTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALC 278 + G + + G RR L + W+++ +T+D+D+ +KL L I + Sbjct: 220 THRIAIGGMSELRGNGMLVRRELLEKCQGWNENTVTDDLDLCFKLYLAGAEIEFVTVPSI 279 Query: 279 WILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLV 338 + + LW QR RWA+GG + +L ++ W + L + F + Sbjct: 280 QEESVTSWEKLWHQRCRWAEGGYQRYLDYFPQILTLG----WAKEIDLLLFFLLQFLLPI 335 Query: 339 GFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWI 398 G I + I +H LL + + FI L Y+ + + Sbjct: 336 GLI------PDLLWTIFYSHHPVLFPLQTLLSIILTIAFIAGL-----YQFQNLRG-WSL 383 Query: 399 IWFPVI--FWMLSLATTLVSFTRVMLMPKKQRARWVSPD 435 +W + F+M+ ++ T + + +K+R+ WV + Sbjct: 384 LWATIQGSFYMVHWIPVMI-VTTLKMCVQKERSHWVKTE 421 >UniRef50_Q6KZU9 N-acetylglucosaminyltransferase n=1 Tax=Picrophilus torridus RepID=Q6KZU9_PICTO Length = 395 Score = 220 bits (561), Expect = 8e-56, Method: Composition-based stats. Identities = 76/403 (18%), Positives = 148/403 (36%), Gaps = 20/403 (4%) Query: 34 VFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEET 93 + + I++ + + H+ + P +SII+P NEE + Sbjct: 7 IGILLIVVGIVYSIYQFPILLIGYLHF-HDYDIDFDWDNYKPLVSIIVPAKNEETVIGRC 65 Query: 94 IHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAA 153 I + L Q Y+N E+ V D S D T I + + V N KA AL + Sbjct: 66 IESILGQAYDNFELFVVVDNSDDDTYRIAKSY-ERDGRVHVFERHGNLTKASALNYAYSM 124 Query: 154 AKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSI 213 + E + D D +L+++ V M Y N I + ++ + I Sbjct: 125 SHGEIIATYDADTVLEKNTLKNAVYGMRYMDADVLQGYNTYINREENIFTRLAAID-EII 183 Query: 214 IGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYE 273 + + + V+G F+R + +G W+ + +TED++ ++ + Y Sbjct: 184 VKVSMIGRMYLHLFVPVAGSNQYFKRETIRIIGGWNGNFLTEDLESGVRMAAKRMRSAYL 243 Query: 274 PRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWA 333 P A + P T KQR+RW +G +V L + L + + + I Sbjct: 244 PSAKVYQETPATYSEYIKQRIRWLRGYHQVLLHSKKELSGLSGLDILMIVLAPTFSGILL 303 Query: 334 FTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTS 393 F+ ++ ++ + H T+ +L + L + + E+ L Sbjct: 304 FS--------SIYISILNFYNPYVHSMRTYFISLLFIFFMIYIIAFVLALIKKKENALYI 355 Query: 394 SLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 L +I + +L+ ++ +F +L K+ W + Sbjct: 356 PLVYI------YVVLNAIISIYTFFLEILGVKR---VWYKVKK 389 >UniRef50_C7NQ00 Glycosyl transferase family 2 n=1 Tax=Halorhabdus utahensis DSM 12940 RepID=C7NQ00_HALUD Length = 475 Score = 220 bits (560), Expect = 9e-56, Method: Composition-based stats. Identities = 95/465 (20%), Positives = 165/465 (35%), Gaps = 60/465 (12%) Query: 30 MMRFVFFWPFFMSIMWIVGGVYFWVYRE---RHWPWGENAPAPQLKDNPSISIIIPCFNE 86 FV + +S+ ++ V + W P I I++P +NE Sbjct: 15 FGVFVVGYYALISVGYLFLHVLALFELREDVKESRWDPPFRKFSSPFYPGIGIVVPAYNE 74 Query: 87 EKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDR------MAAQIPHLR------- 133 E + E++ + L+ Y EV+ VNDGSTD T L + A+IP Sbjct: 75 EATIVESVRSMLSLNYPETEVVVVNDGSTDATLDRLREQFDLEAIDAEIPFDVPAEEIHG 134 Query: 134 ----------VIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYN 183 ++ N GK+ AL G +D D ++DRDA IV P L Sbjct: 135 VYRSTTDEKLLVVDKDNGGKSDALNAGIWLTDMPLFCAVDADTIIDRDALLQIVTPFLKE 194 Query: 184 PRVGAVTGNP---------------RIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVF 228 P +G + T + +QV EY + + Sbjct: 195 PTTAVASGGVIRVANDCTVEDGIVTDVSLPKTGLPGLQVMEYLRAFYSGRLGLNRINGLI 254 Query: 229 TVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQL------NQWTIFYEPRALCWILM 282 +SG F+ + E+G + D ITED DI +L ++T+ + P + W + Sbjct: 255 LISGAFGLFQTETVREIGGYRHDTITEDFDIVVRLHKHLTEQDREYTVDFVPEPVAWTEV 314 Query: 283 PETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFII 342 P + + L +QR RW +G E + N L+ + R+ + + + +G +I Sbjct: 315 PSSRRVLSRQRRRWYRGMMETVVTNRKMLFNRAYGRVGTVVMPF-----FVAAETLGPLI 369 Query: 343 YAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQF------IVSLMIENRYEHNLTSSLF 396 + +P+ + T L T F + NRY+ + Sbjct: 370 EGLGYVVLPVAWYFGALDVTFVLMFFLLTTGYGVFLSWFGVFSEVWSFNRYDSPW--QVL 427 Query: 397 WIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRGILRG 441 ++W+ V+ + R ++ + W +R G Sbjct: 428 RLLWYGVLENFGYRQWKTIVAWRGLIEYLRGVETWGVMERSGFAG 472 >UniRef50_C9RLY4 Glycosyl transferase family 2 n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RLY4_FIBSS Length = 517 Score = 220 bits (560), Expect = 1e-55, Method: Composition-based stats. Identities = 80/385 (20%), Positives = 151/385 (39%), Gaps = 15/385 (3%) Query: 11 LCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQ 70 + +L + V Y +G ++ + F + SI + +R Sbjct: 1 MSTILLYAMFVVYVIAGVGLVIYGFS--CYYSIYLFLKNSRKTRLSDRKAILKYYR-EHS 57 Query: 71 LKDNPSISIIIPCFNEEKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAI----LDR 124 L D P ++ +P FNE VE + A A Y E+ ++D STD+ + + Sbjct: 58 LADLPQVTTQLPVFNEANCVERLLEAVCAIDYPKDKHEIQVLDD-STDECYEVTKKKVAE 116 Query: 125 MAAQIPHLRVIHLAQNQG-KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYN 183 +AA+ +++IH + KA ALK G A AK E+L D D + ++D V ++ + Sbjct: 117 LAARGYDIKLIHRTNRKDFKAGALKEGMAVAKGEFLAIFDADFVPEKDFLLKTVPYLVMD 176 Query: 184 PRVGAVTGNP-RIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSAL 242 P+VG V G + + + Q + + + +G +R+ A+ Sbjct: 177 PQVGLVQGRWGHLNRTESGLTLAQSIGIDGHFVIEQSARSWGKLFMNFNGTAGVWRKDAI 236 Query: 243 AEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAE 302 G W D +TED+D+S++ QL W + + + +P + Q+ RWA+G + Sbjct: 237 YGGGGWEGDTLTEDMDLSYRSQLAGWKMKFVFDVIVPAELPNDINAFKAQQFRWAKGSIQ 296 Query: 303 VFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAAT 362 +K + ++ R + + T C++ + + H+ Sbjct: 297 TAIKILPKVLRSKVPLRVKIGAILHTTHYSIHPCML--FTALCAWPLLAFFEPVGHLPTW 354 Query: 363 -HTAGILLCTLCLLQFIVSLMIENR 386 +T G L + V + R Sbjct: 355 AYTVGFAFIFLAAIAPSVLYFVAQR 379 >UniRef50_Q2JKN6 Glycosyl transferase, group 2 family protein n=2 Tax=Synechococcus RepID=Q2JKN6_SYNJB Length = 493 Score = 219 bits (559), Expect = 1e-55, Method: Composition-based stats. Identities = 87/374 (23%), Positives = 148/374 (39%), Gaps = 28/374 (7%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRYE--NIEVIAVNDGSTDKTRAILDRMAAQIPHL 132 P ++++IP NE + +H+ RY ++E+ A++D S+D T +L IPHL Sbjct: 129 PRVAVLIPAKNESAVLPRLLHSLTQLRYPTSHLELWAIDDNSSDATPEVLREAQKWIPHL 188 Query: 133 RVIHLA--QNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPM---------L 181 RV + GK+ AL + E ++ D DA++ D A + Sbjct: 189 RVYRRQPGRGGGKSGALNEVLPLTQGEIILVCDADAVVPSDFLARTLPLFVQVGSLRSRF 248 Query: 182 YNPRVGAVTGNPRIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRS 240 VGAV + S QV E +S ++ + + + G RR Sbjct: 249 SRRTVGAVQVRKALSNPSVNFWTLGQVAEMAS-DAYFQQQRVAVRGIGELRGNGQLVRRD 307 Query: 241 ALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGG 300 L + G W++ +T+D+D+++KL L I + P + K LW QR RWA+GG Sbjct: 308 VLEKCGGWNEATLTDDLDLTFKLHLAGVDIAFLPEPAIVEEGVTSWKSLWHQRCRWAEGG 367 Query: 301 AEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIA 360 + +L + + + +WAF Y + +A VP + + Sbjct: 368 YQRYLDYWPGILGRRMGMAKTV-------DLWAFFIS----QYLLPMALVPDTLWVLLTG 416 Query: 361 ATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRV 420 + + L L + V+ R L W + +ML +++ T Sbjct: 417 HSSVL-LPLNALVMGCLTVAFYRGLRQVEGLRGRRLWWHTALGLVYMLHWLPVMIATTAR 475 Query: 421 MLMPKKQRARWVSP 434 M + K R RWV Sbjct: 476 MCVQPK-RLRWVKT 488 >UniRef50_B9MDW0 Glycosyl transferase family 2 n=1 Tax=Diaphorobacter sp. TPSY RepID=B9MDW0_DIAST Length = 441 Score = 219 bits (558), Expect = 2e-55, Method: Composition-based stats. Identities = 90/435 (20%), Positives = 165/435 (37%), Gaps = 22/435 (5%) Query: 14 VLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKD 73 V A + + F+PF + + + + + E+ P Q + Sbjct: 6 VFAFATLFASSSQQNWVELALKFFPFVV-FLEAPFFLLVTAGMVK-YGLREHRPLRQREA 63 Query: 74 NPSISIIIPCFNEEKNVEETIHAALAQRYENI-EVIAVNDGSTDK--TRAILDRMAA--- 127 P +S +I C++E ++V +TI + Q Y E+IAV DG+ T R + Sbjct: 64 YPRVSCVITCYSEGEDVGKTIGSLAQQLYPGFIEIIAVIDGALQNGPTLRAARRAQSLLA 123 Query: 128 --QIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPR 185 Q L V+ Q G+ +L G + A E ++ +DGD D D +P Sbjct: 124 GTQRRSLVVLPKWQRGGRVSSLNAGLSIATGEVVMALDGDTSFDNDMVRNATR-HFDDPG 182 Query: 186 VGAVTGNPRIRT-RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAE 244 V V GN R+R + +LV ++Q EY IG K + V +SG FR L Sbjct: 183 VVGVAGNLRVRNAKRSLVTRLQALEYMLSIGAGKTGLSEFNIVNNISGAFGVFRTRFLRN 242 Query: 245 VGYWSDDMITEDIDISWKLQLN-----QWTIFYEPRALCWILMPETLKGLWKQRLRWAQG 299 +G W ED+D++ +++ I ++P A+ P+T + ++QRLRW Sbjct: 243 LGGWDAGT-AEDLDMTMRIKQYFGRHPDLRIVFDPHAVGHTDAPDTWRVFFRQRLRWDGD 301 Query: 300 GAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHI 359 +F++ R W F + + L I+ + + + Sbjct: 302 MFYIFIRKFRFNLRPR-LLGWRNFLFVIVNGLLMQLVLPFLIVAYTGTMLFTMPLGVVMG 360 Query: 360 AATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTR 419 L L + + + R ++ L ++ FP+ ++ + + Sbjct: 361 VLAFIYLAYLAALLFYFLLYVVAVSERPRDDVW-YLGFLPLFPLFAFVNRIHCSFSIMAE 419 Query: 420 VMLMPKKQRARWVSP 434 + + K ++P Sbjct: 420 MFM--KSHLDSSMAP 432 >UniRef50_C9KN46 Chitin synthase n=1 Tax=Mitsuokella multacida DSM 20544 RepID=C9KN46_9FIRM Length = 628 Score = 219 bits (558), Expect = 2e-55, Method: Composition-based stats. Identities = 95/440 (21%), Positives = 162/440 (36%), Gaps = 59/440 (13%) Query: 5 IVSFFILCLVLCIPL----CVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHW 60 + SFF+ +V I L V YF + + F+ ++ G Y Sbjct: 179 VASFFLAVIVFVIVLMRAESVIYFQFNRWLYLYSIIAATFLLTRYLFGSFY--------- 229 Query: 61 PWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYE--NIEVIAVNDGSTDKT 118 P + II+PCFNEEK ++ TI + Q Y +EVI V+D S D + Sbjct: 230 ----RPTKIDPDYTPGVMIIVPCFNEEKWIQHTILGCINQDYPIDKLEVIVVDDCSNDHS 285 Query: 119 ----RAILDRMAAQI---------PHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGD 165 + I++R+ L N+GK A+ GA AK E LV +D D Sbjct: 286 VDKIKEIIERLKQSDGDQKMYRVEDRLHYYVQPVNKGKREAMAVGAHMAKHELLVFVDSD 345 Query: 166 ALLDRDAAAYIVEPMLYNPRVGAVTGNPRI-RTRSTLVGKIQVGEYSSIIGLIKRTQRIY 224 + LD A IV+P + ++G V+G + T + + K+Q Y ++K + + Sbjct: 346 SFLDPYAVRNIVQP-FKDKKMGGVSGRTDVANTYTNSLTKMQAVRYYIAFRIMKAAEGYF 404 Query: 225 GNVFTVSGVIAAFRRSALAEVGYWSDD----------MITEDIDISWKLQLNQWTIFYEP 274 V +SG ++ +R+ + + Y D +D ++ + L Y+ Sbjct: 405 DAVTCLSGPLSCYRKDLVLK--YCDDWLNQKFLGQRATFGDDRSMTNFI-LRHHRTTYQD 461 Query: 275 RALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAF 334 A+C ++P++ K +Q++RW + + +W+KE F + + Sbjct: 462 TAVCMTIVPKSHKMFLRQQMRWKRSWLRESIIAARYMWKKEPFMALSFYMGLLVPIAAPI 521 Query: 335 TCLVGFIIYAVQLAGVPLNIELTHIAATH------------TAGILLCTLCLLQFIVSLM 382 L I + PL + + T I CL V L Sbjct: 522 IVLYNLIYIPIMHRVFPLTFLVGMLMMALLMSMAQLFLRRSTTWIFGVWFCLYYEAVLLW 581 Query: 383 IENRYEHNLTSSLFWIIWFP 402 S + P Sbjct: 582 QMPVAWFTFWKSTWGTRLTP 601 >UniRef50_A0Q541 Glycosyl transferase n=3 Tax=Francisella novicida RepID=A0Q541_FRATN Length = 438 Score = 218 bits (557), Expect = 2e-55, Method: Composition-based stats. Identities = 100/450 (22%), Positives = 184/450 (40%), Gaps = 42/450 (9%) Query: 5 IVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYF--WVYRERHWPW 62 IV F L + F S + + P+ + I V G YF V + P+ Sbjct: 3 IVYVFYQQLWMIFSHPRELFVS-VFAVALLLETPYTLFIFLGVLGAYFKQIVNITKRAPY 61 Query: 63 GENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYE-NIEVIAVNDGST--DKTR 119 P +S II ++E K++ T+ + Q Y+ +IEV+ + DG+ KT Sbjct: 62 -----------YPKVSCIITAYSEGKDILLTLKSLEEQLYKGHIEVLILIDGADVNKKTL 110 Query: 120 AILDRMAAQI-----PHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAA 174 + R L+VI G A + G + E ++ +DGD +D D + Sbjct: 111 DVAMRYKKSFVANNKRSLKVIPKWHRGGHASSSNLGFNLSTGEIIIMLDGDTSVDNDMVS 170 Query: 175 YIVEPMLYNPRVGAVTGNPRIRT-RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGV 233 +V+ + + V +G R+R + L+ ++Q EY I + + ++ VSG Sbjct: 171 NMVQQFV-DSNVVGCSGTLRVRNIKKNLLTRMQGIEYMLGIHMSRIGLASINSINNVSGA 229 Query: 234 IAAFRRSALAEVGYWSDDMITEDIDISWKLQL-----NQWTIFYEPRALCWILMPETLKG 288 AFR+S L++ W + ED+D++ +LQ I + ++ +PETL Sbjct: 230 FGAFRKSFLSKTTRWRNGS-AEDLDLTLRLQSYFKRHKHLRIVHSQDSIAHTDVPETLWQ 288 Query: 289 LWKQRLRWAQGGAEVFLKNMTRLWRK--ENFRMWPLFFEYCLTTIW--AFTCLVGFIIYA 344 L KQRLRW +++ + R +K + WP F + ++ A +V I Sbjct: 289 LIKQRLRW---DGDIYFIYIRRHLKKLLPRYLGWPTFLITVFSGLFFHAVQPIVIVISLI 345 Query: 345 VQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVI 404 + P+N+ + A + + + + FI+ + + + + + I + P Sbjct: 346 YSIFVFPINVIIPIYIAVYLYYLAISIFLFITFILLVSERKKQDLAMW---YVIPFMPFY 402 Query: 405 FWMLSLATTLVSFTRVMLMPKKQRARWVSP 434 L T+ F+ + K R ++P Sbjct: 403 SVFLKFVATVAIFSEFFI--KTHRDSSMAP 430 >UniRef50_C1ZJA9 Glycosyl transferase n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZJA9_PLALI Length = 533 Score = 218 bits (556), Expect = 3e-55, Method: Composition-based stats. Identities = 87/431 (20%), Positives = 178/431 (41%), Gaps = 28/431 (6%) Query: 30 MMRFVFFWPFFMSIMWIVGGVYFWVYRERHWP--WGENAPAPQL---KDNPSISIIIPCF 84 M+ + F++++ V + + R+W W + P + + P+++I +P F Sbjct: 1 MLLPLIVTLLFVTLVNTVFQLTQFDLAYRYWRSVWKKQKPTSRPIDREHLPAVTIQLPMF 60 Query: 85 NEEKNVEETIHAALAQRYENIEV-IAVNDGSTDKTRAI----LDRMAAQIP--HLRVIHL 137 NE + A Y + + + D STD + I L+ + P ++ +H Sbjct: 61 NESIIAPRILEAVSRIDYPRDRLQVQILDDSTDHSPEIIAGILEELRQSQPELNIEYLHR 120 Query: 138 AQNQG-KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNP-RI 195 QG KA AL+ E++ D D + D +++ P +P V V + Sbjct: 121 TDRQGFKAGALQAAMPLVTGEFIAIFDADFIPQPDFLTHLL-PYFDSPEVAVVQSRWGHL 179 Query: 196 RTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITE 255 +++ + Q + + + G T +G ++RSA+ G WS D + E Sbjct: 180 NAHDSVLTQAQQFFLDGHHSVEQNGRNRAGYFITFNGTAGIWQRSAMEAAGGWSADTLVE 239 Query: 256 DIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKE 315 D+D+S++ Q + I Y + +P ++ GL Q RW +G A+V LK + ++W++ Sbjct: 240 DLDLSYRTQSLGYRIVYVEDYVTPGELPNSVSGLRVQLFRWFKGNAQVGLKILGKVWKQP 299 Query: 316 NFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLL 375 + + ++A ++ ++ + +PL + H A H + LC + + Sbjct: 300 LPLSVKI---HATAQLFAPFTMLSSLVMLLITGALPL---ILHAAPEHAGLVKLCYMGFV 353 Query: 376 QFIVSLMIEN----RYEHNLTSSLFWIIWFPVIFWMLSLATTL-VSFTRVMLMPKKQRA- 429 L++ R++ + P F +++ T L + +L +RA Sbjct: 354 WVPAVLLVYGTPRIRFDEGPW-YIRLAKLVPRTFVFMAMMTGLSCQSSIAVLEAVFKRAN 412 Query: 430 RWVSPDRGILR 440 +WV +G + Sbjct: 413 QWVVTPKGFSQ 423 >UniRef50_C4Z6K0 Putative uncharacterized protein n=1 Tax=Eubacterium eligens ATCC 27750 RepID=C4Z6K0_EUBE2 Length = 465 Score = 218 bits (555), Expect = 4e-55, Method: Composition-based stats. Identities = 82/399 (20%), Positives = 166/399 (41%), Gaps = 20/399 (5%) Query: 35 FFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETI 94 PF M I+ +G V+ + R +L+ P I++IIP +N + I Sbjct: 16 IIIPFIMEIVPAIGSVFLLIRRN----LRHKIHKKELEVYPEITLIIPVYNSADTLYGCI 71 Query: 95 HAALAQRYEN--IEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAA 152 + Y N I + VN+ D + I + + P L + + QGK+ AL Sbjct: 72 RSINDSTYPNDRIRIFLVNNKGKDNSFDIYAQCQQEFPELLMQWMNSEQGKSRALNLALY 131 Query: 153 AAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTR---------STLVG 203 + +Y++ +D D +L+++A ++ + + +TG+ S L+ Sbjct: 132 NSDGKYIINLDSDGMLEKNALVNMITRFENDTAINCMTGSILTVPEQIKKYKAGPSRLLR 191 Query: 204 KIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKL 263 +++ EY+ + +V+T+SG +AFR+SA+ + ++ D I ED I++++ Sbjct: 192 ELEFMEYAQAFLAGRSYASELNSVYTLSGAFSAFRKSAVLKSWMYNTDTICEDTHITFQM 251 Query: 264 Q-LNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENF----R 318 + L + + AL ++ E + L+ QR RW +G EV M + ++ +N Sbjct: 252 RYLQKERVEVCEDALFFVDPIENVNKLYTQRQRWQRGSLEVSKMFMDKSFKVKNLFTNIS 311 Query: 319 MWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFI 378 + L +++ L + V +G + + I +T L + + F+ Sbjct: 312 VKTLLYDHTFAFPRLIWYLALICLIVVGYSGKTVILSTAIIFGLYTLIGYLYFIVVAYFL 371 Query: 379 VSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSF 417 RY + ++ +F + + LA + S Sbjct: 372 RINKDIRRYYTRHWWCILFLPFFNFAVFFIRLAGIINSI 410 >UniRef50_C5A5B1 Glycosyltransferase, family 2 n=9 Tax=Thermococcaceae RepID=C5A5B1_THEGJ Length = 354 Score = 217 bits (554), Expect = 6e-55, Method: Composition-based stats. Identities = 98/377 (25%), Positives = 168/377 (44%), Gaps = 52/377 (13%) Query: 39 FFMSIMWIVGGVYFWVYR---ERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIH 95 +S++ + G +F Y R +P E + P +S++IP +NEE+N+E+ + Sbjct: 5 ILLSLILLWDGYFFLRYLLSLRRAYPTREWS--------PMVSVVIPAYNEEENIEDAVK 56 Query: 96 AALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAK 155 AAL+Q Y N EVI V+DGSTD T ++ + LRVI + + GK AL G + Sbjct: 57 AALSQDYPNFEVIVVDDGSTDGTYE--KALSIKDERLRVIRI-NHGGKTRALNVGLEFSA 113 Query: 156 SEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIG 215 E +V ID D LL+++A + +VE + + AV G R+R T + Q E+ I Sbjct: 114 GEIIVTIDADGLLEKNAVSRLVERFYSD-DIAAVGGQVRVRGE-TFLELAQDIEHLRIAT 171 Query: 216 LIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPR 275 + + N+ G I+AFR+ +L ++G + +D++ ED + L+ + Y P Sbjct: 172 FRRAKE--LENLSLAPGPISAFRKRSLLKIGGFVNDLV-EDYATTLALKKIG-RVVYAPG 227 Query: 276 ALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFT 335 A W+ MP L LW+QR RW G K ++ ++++ + ++ + A Sbjct: 228 ARVWVRMPSGLGALWRQRRRWFLGD---LPKLSSKPFKEK--------LVFIMSDLVA-- 274 Query: 336 CLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSL 395 L+ + L ++ L + L+ + L Sbjct: 275 -LLDVLFPLAALMAGKWSLLLIFLVLEWLMMYLIVRV------------------EGGML 315 Query: 396 FWIIWFPVIFWMLSLAT 412 + FPVI W + Sbjct: 316 VEVALFPVILWFWAAFY 332 >UniRef50_Q97FY5 Glycosyltransferase n=4 Tax=Clostridiales RepID=Q97FY5_CLOAB Length = 456 Score = 217 bits (553), Expect = 8e-55, Method: Composition-based stats. Identities = 85/414 (20%), Positives = 175/414 (42%), Gaps = 26/414 (6%) Query: 35 FFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETI 94 PF M I+ +GG F + + + A + +L P I+II+P +N E ++ I Sbjct: 19 IIIPFLMEIVPALGG--FIILLRKRLSIKDKAFSGKL---PQITIIVPVYNSEDTLKGCI 73 Query: 95 HAALAQRYEN--IEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAA 152 + Y + I+V+ V++ S+D + I R + L + ++ QGK+ AL Sbjct: 74 ESIYNSDYPSEFIDVMLVDNQSSDNSFKIFTRCQKEFDGLSLRYMNSKQGKSKALNMALF 133 Query: 153 AAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTR---------STLVG 203 + +Y++ ID D L +DA +V NP V +TG L+ Sbjct: 134 NSSGKYIIHIDSDGKLHKDAIKNMVTRFEGNPHVHCMTGVILTDKELIEKTKGSFFKLLR 193 Query: 204 KIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKL 263 K + EY+ + + G+++T+SG +AFR+S + + ++ + ++ED I++++ Sbjct: 194 KCEFFEYAQAFLAGRNFESEVGSIYTLSGAFSAFRKSTILKTQLYNSETVSEDTQITFQV 253 Query: 264 -QLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPL 322 +L ++ AL ++ E L+ QR RW +G EV + ++ ++ + Sbjct: 254 KKLLNQKVYLCENALFFVDPIEGFNKLYTQRQRWQRGELEV-----SHMFLRDGLKFGKG 308 Query: 323 FFEYCLTTIWAFTCLVGF---IIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIV 379 F + + F I Y + V +N + + + LL +L F + Sbjct: 309 FVSDFMVRTLMYDHTFAFPRMIWYFALIFLVFMNYPIYLVVGSVAVIYLLYSLSAFLFYL 368 Query: 380 SLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVS 433 +++ + + S + + M + + F ++ + + W + Sbjct: 369 NVISYLKKYKEVRSYYAKKFYLIFLMPMYNFGVFWIRFC-GIINSIQLQGNWKT 421 >UniRef50_C0A1N7 Glycosyl transferase, family 2 n=1 Tax=Opitutaceae bacterium TAV2 RepID=C0A1N7_9BACT Length = 498 Score = 217 bits (552), Expect = 9e-55, Method: Composition-based stats. Identities = 87/392 (22%), Positives = 151/392 (38%), Gaps = 28/392 (7%) Query: 33 FVFFWPFFMSIMWIVGGVYFWVYRER------HWPWGENAPAPQLKDNPSISIIIPCFNE 86 F+ + + G ++F ++R + + + P P P + I +P +NE Sbjct: 5 FLVVYACTLG-----GLLFFALHRIKMLWLYARYSRRQTPPPPLNGPLPRVCIQLPLYNE 59 Query: 87 EKNVEETIHAALAQRYEN-----IEVIAVNDGSTDKTRAILDRMAAQIP------HLRVI 135 VE + A R+ N +E+ ++D STD+T AI++R A P H+ I Sbjct: 60 SLVVEALLDKVSAIRWGNGGDETLEIQILDD-STDETTAIIERWMAANPVRVATAHISHI 118 Query: 136 HLAQNQG-KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPR 194 G KA AL G ++E+ D D + D ++ P + ++G V Sbjct: 119 RRPNRHGYKAGALSYGMTLTEAEFFAIFDADFRPEPDFLEQLM-PHFADTKIGVVQARWE 177 Query: 195 IRTR-STLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMI 253 R S+L+ + Q + + + + G F +G +RR AL E G W+DD + Sbjct: 178 FANRKSSLLTRFQGVFLDAHFVVEQEARYAAGLFFNFNGTAGIWRRRALDEAGGWTDDTV 237 Query: 254 TEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWR 313 TED+D+S++ QL W Y +PE++ Q+ RW +GG +V K + + Sbjct: 238 TEDLDVSYRAQLRGWKFIYRADYAVPSELPESMTAFKSQQRRWTKGGMQVMRKQIATIAC 297 Query: 314 KENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLC 373 L + LV F I V + LL Sbjct: 298 SGAPSRSKQEAILHLLVGFVHPLLVLFAISFVPYLILAGQRPTGLWVFFSPVMALLIGAG 357 Query: 374 LLQFIVSLMIENRYEHNLTSSLFWIIWFPVIF 405 + F ++ + W++ P+ Sbjct: 358 SVAFYITAQYFRH--REWREGVLWLLTSPIFM 387 >UniRef50_D2QUU5 Glycosyl transferase family 2 n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QUU5_9SPHI Length = 508 Score = 216 bits (550), Expect = 1e-54, Method: Composition-based stats. Identities = 77/343 (22%), Positives = 146/343 (42%), Gaps = 17/343 (4%) Query: 50 VYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEV-I 108 + + ++ ++A + P +++ +P +NE VE I A + +Y ++ I Sbjct: 28 ISYLRSEKKRRALAQSAADYSPEALPRLTVQLPVYNELYVVERLIDAVVLLKYPKDKLDI 87 Query: 109 AVNDGSTDKTRAILDRMAAQIPH----LRVIHLAQNQG-KAIALKTGAAAAKSEYLVCID 163 V D STD+T +I+ R A+ + I + +G KA AL G AK E++ D Sbjct: 88 QVLDDSTDETVSIIARKVAEYKKQGFDIEHIRRPERKGFKAGALAYGLTLAKGEFVAIFD 147 Query: 164 GDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRS-TLVGKIQVGEYSSIIGLIKRTQR 222 D + D + V P +P+V V +L+ ++Q ++ + + + Sbjct: 148 ADFVPDPEFLLKTV-PHFADPKVAIVQTRWEHLNEDFSLITQLQAFGLNAHFTVEQSGRY 206 Query: 223 IYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILM 282 G + +G +R+ A+A+ G W D +TED+D+S++ QL W Y + Sbjct: 207 AAGLLANFNGTGGVWRKVAIADAGGWQSDTLTEDLDLSYRAQLRGWKFVYREDVGSPAEL 266 Query: 283 PETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFII 342 P + L Q+ RW +G AE K + + + + AF L Sbjct: 267 PVAMNALKSQQYRWMKGAAECARKLFVNVLKTPGVSL--------SMKLHAFFHLFSSAT 318 Query: 343 YAVQLAGVPLNIELTHIAATHTAG-ILLCTLCLLQFIVSLMIE 384 + + L +++ L +I + H + + L QF + ++I Sbjct: 319 FILVLILGVMSVPLIYIRSQHPEWEWVFVVINLFQFNLLILIT 361 >UniRef50_Q0AEM9 Cellulose synthase (UDP-forming) n=2 Tax=Nitrosomonas RepID=Q0AEM9_NITEC Length = 492 Score = 215 bits (549), Expect = 2e-54, Method: Composition-based stats. Identities = 72/401 (17%), Positives = 153/401 (38%), Gaps = 27/401 (6%) Query: 29 LMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKD-------------NP 75 LM+ + + + + +V + + + R +L + P Sbjct: 2 LMLTVLSYAAQAVFFLVVVCFILYVLLELRVLGISRRVERHKLTELVSQLPGMLALPVWP 61 Query: 76 SISIIIPCFNEEKNVEETIHAALAQRYE--NIEVIAVNDGSTDKT----RAILDRMAAQI 129 +S+++P +NE + I A +Y ++E++ ++D STD T +A +D A Sbjct: 62 RVSVLLPIYNEAAVIVRLIDAVCRLQYPTSSLEILVLDD-STDNTSTLAQARIDYHADLG 120 Query: 130 PHLRVIHLAQNQG-KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGA 188 +R + A N+G KA L G + E+ V D D + D V P +P++G Sbjct: 121 VSIRYVRRASNEGYKAGNLLNGIRQSSGEFYVIFDADFIPQEDFLLRTV-PYFQDPQLGF 179 Query: 189 VTGNPRIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGY 247 + R + + + Q E + G++ ++SG +RR+ + +G Sbjct: 180 LQTGIGYENRDASFLTRFQAMEM-GHQQYVTVGLSEDGDMASLSGSSCVWRRACVESLGG 238 Query: 248 WSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKN 307 W+ +TED+D+ ++ Q W Y + ++PET QR RW +G K+ Sbjct: 239 WNASTVTEDVDLGYRAQFGNWKYAYMRDVVSMSVLPETTSAFRVQRKRWGRGLIHSAFKH 298 Query: 308 MTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGI 367 +++R+ M L + + +++ L I + + ++ + A + Sbjct: 299 ARQMFRQRMPLMQRL---HAIAMMFSSLLLASIYILLLLCLPLSYLVDFEGLGIHWVAWL 355 Query: 368 LLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWML 408 + + + +W ++ + Sbjct: 356 FFGFVAIWGLNNVFAARKGAHIGEQRGILRTLWETYLYVAM 396 >UniRef50_B3QIT3 Glycosyl transferase family 2 n=6 Tax=Bradyrhizobiaceae RepID=B3QIT3_RHOPT Length = 944 Score = 215 bits (549), Expect = 2e-54, Method: Composition-based stats. Identities = 79/320 (24%), Positives = 134/320 (41%), Gaps = 12/320 (3%) Query: 73 DNPSISIIIPCFNEEKN-VEETIHAALAQRYENIEVIAVNDGSTDK--TRAILDRMAAQI 129 + P +SI +P + E +++T+ A Y N EV+ + + + D T+ I D Sbjct: 471 NVPKVSIHVPAYFEPPEMLKQTLDALARLDYPNFEVVVIINNTPDPAFTQPIQDHCRELG 530 Query: 130 PHLRVIHLAQNQG-KAIALKTGAAAA--KSEYLVCIDGDALLDRDAAAYIVEPMLYNPRV 186 + I+ + +G KA AL+ +E + ID D ++ D +V P +PRV Sbjct: 531 ERFKFINAEKVKGFKAGALRIAMERTAVDAEIIGIIDADYVVTPDWLKDLV-PAFDDPRV 589 Query: 187 GAVTGNPRIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEV 245 G V R +L+ I GEY+ + + Y + V G + RR+A+ Sbjct: 590 GLVQAPQEHRDGDRSLMHYIMNGEYAGFFDIGMVQRNEYNGI-IVHGTMCLIRRAAMDMA 648 Query: 246 GYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFL 305 G WS D I ED D+ ++ + W Y + L+P+T + KQR RWA GG ++ Sbjct: 649 GGWSSDTICEDSDLGLEIMEHGWLTHYTNTRYGYGLLPDTYEAFKKQRHRWAYGGFQIIK 708 Query: 306 KNMTRLWRKENFRMWPLFFEYCLTTI-WAFTCLVGFIIYAVQLAGVPLNIELTHIAA--T 362 K+ R + E+ L + W +G ++ + L VP+ Sbjct: 709 KHWRRFLPGNSRLSRDQRREFGLGWLNWLGAESLGVVVAILNLIWVPIVAFADIAIPDKI 768 Query: 363 HTAGILLCTLCLLQFIVSLM 382 T I+ + L + L Sbjct: 769 LTLPIIASFIVTLAHFLVLY 788 >UniRef50_C5S5F1 Glycosyl transferase family 2 n=1 Tax=Allochromatium vinosum DSM 180 RepID=C5S5F1_CHRVI Length = 879 Score = 215 bits (549), Expect = 2e-54, Method: Composition-based stats. Identities = 77/388 (19%), Positives = 136/388 (35%), Gaps = 14/388 (3%) Query: 11 LCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQ 70 + + Y ++ + I+ ++ + W W P Sbjct: 353 AAVWVVYDYTRQYMTPATAIVGVLLLIGGVGVIVLLMAEAHEWAESVWLRRWRRPFPLRS 412 Query: 71 LKDN--PSISIIIPCFNEEKNV-EETIHAALAQRYENIEVIAVNDGSTDKT--RAILDRM 125 + D+ P +S+ +P +NE + +ET+ A Y + EV+ +++ + D + D Sbjct: 413 VPDDQLPFVSVHVPAYNEPPELLKETLDGLAALDYPHFEVLVIDNNTKDPAVWEPVRDYC 472 Query: 126 AAQIPHLRVIHLAQNQG-KAIALKTGAAAA--KSEYLVCIDGDALLDRDAAAYIVEPMLY 182 A R H+ G KA AL +++ + ID D ++ ++V P Sbjct: 473 ARLGERFRFFHVDPLAGYKAGALNFALRHTDPRADVVAVIDADYIVRPPWLRHLV-PAFG 531 Query: 183 NPRVGAVTGNPRIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSA 241 +P V V R + + EY L T+ + G + RR Sbjct: 532 DPEVAIVQAPQDYRDAHQNAFKAMCMAEYRGFFHLGMVTRNERNAIIQ-HGTMTMIRRQT 590 Query: 242 LAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGA 301 L V W++ ITED ++ +L Y P LMP+T KQR RWA G Sbjct: 591 LDAVDGWAEWCITEDAELGLRLFEGGHKALYIPCTYGQGLMPDTFADFRKQRYRWAYGAV 650 Query: 302 EVFLKNMTRL--WRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHI 359 + L + L R + + + W +A V + + I Sbjct: 651 RILLHHRRELLGLRGTSLSLGQRYHFVAGWLPWFADGFNLLFNFAALAWSVAMVMAPDTI 710 Query: 360 AATH-TAGILLCTLCLLQFIVSLMIENR 386 + T ++ L L + SL + R Sbjct: 711 TPPYMTIALVPLVLFLFKMSKSLFLYRR 738 >UniRef50_A6FMF5 N-acetylglucosaminyltransferase n=1 Tax=Roseobacter sp. AzwK-3b RepID=A6FMF5_9RHOB Length = 419 Score = 215 bits (549), Expect = 2e-54, Method: Composition-based stats. Identities = 101/432 (23%), Positives = 173/432 (40%), Gaps = 25/432 (5%) Query: 8 FFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAP 67 F L L + F+F +P F I+ + V RER Sbjct: 1 MFELSFAFLANLSASSAVHLYW-YYFLFEFPRF--ILSALAVAVASVLRER--------- 48 Query: 68 APQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAA 127 P + +S++I N++ + T+ + +Q ++N+E+I V DG + A Sbjct: 49 -PDSPKDLLVSVLIVGLNDKAGLARTLQSLKSQTHKNLEIIVVQDGPGTTMYQLGKEFEA 107 Query: 128 QIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVG 187 + I GKA AL G SEY+V D D DR+A + I+ +L + ++G Sbjct: 108 SGDIQKYIKTEIRGGKAAALNIGFTRCSSEYVVVADIDTSFDRNAISTIIGRLLSDKKIG 167 Query: 188 AVTGNPRIRT-RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVG 246 V+GN R S +Q EY + I + ++ Q ++G + VSG AFR SA+ VG Sbjct: 168 GVSGNLAPRNLASGFWPSVQAIEYLNNITIGRQFQSMFGILTIVSGAFGAFRASAIRSVG 227 Query: 247 YWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLK 306 W +D +++ K++L+ W I +EP A +P++ LW QRLRW + ++ Sbjct: 228 GWDVG-PGDDSNLTTKIRLSGWKIQFEPDAWALTDVPDSFLALWTQRLRWNRS----LIR 282 Query: 307 NMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAG 366 N WRK P + L + A T + F A + + Sbjct: 283 NR---WRKFRSVFNPFLHNFSLVEVVAATNQLWFNFIATFAYLFYIFYSIITYGEWAIMI 339 Query: 367 ILLCTLCLLQFIVSLMIENRYEHN---LTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLM 423 IL + + F V L ++ H + ++ + + L A + ++ +L Sbjct: 340 ILSLHIIMFVFDVLLFLQAATLHPRGYFWKLVLYLPAYALYGGFLMRANRVYAYLSELLF 399 Query: 424 PKKQRARWVSPD 435 + Sbjct: 400 RSSYNDSFYPTK 411 >UniRef50_Q03RZ5 Glycosyltransferase n=3 Tax=Lactobacillus RepID=Q03RZ5_LACBA Length = 374 Score = 214 bits (546), Expect = 4e-54, Method: Composition-based stats. Identities = 77/326 (23%), Positives = 139/326 (42%), Gaps = 24/326 (7%) Query: 36 FWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIH 95 P + I+ + ++ RH P P ++II+P +N E + E I Sbjct: 19 LIPIVVEIIPALISTAKLMHFHRHEPK-----LTPPGKWPWVTIIVPVYNSEDTLFECIQ 73 Query: 96 AALAQRYEN--IEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAA 153 + Q Y +++I ++ STD + R P L + ++ ++GKA AL T Sbjct: 74 SINEQTYPKEAMQIILADNQSTDGSFGAYARAQNTFPELILRYVNTDKGKARALNTAIYE 133 Query: 154 AKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTR---------STLVGK 204 + Y++ ID D LL+++A +V + + A+TG + L+ K Sbjct: 134 SIGIYVINIDSDGLLEKNAVKNMVLRFENDLSIAAMTGAILPKEELVQKVKGFGHRLLAK 193 Query: 205 IQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQ 264 + EY+ + + +FT+SG +AFRR L E + D I ED D++++++ Sbjct: 194 NEYYEYAQAFLSGRTIEASRDQLFTMSGAFSAFRREVLMETFLYDIDTIGEDTDMTFQIR 253 Query: 265 -LNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEV---FLKNMTRLWR-KENFRM 319 + A+ ++ L+ QR RW +G EV ++K L R +NF + Sbjct: 254 TRLGKKVVICADAIFYVEPISGFSELYTQRQRWQRGELEVTENYMKQTASLKRFFKNFLV 313 Query: 320 WPLFFEYCLTT---IWAFTCLVGFII 342 + ++ T IW F +V Sbjct: 314 RRMMIDHTFTFPRMIWTFATVVLIAF 339 >UniRef50_UPI0001C31B0A glycosyl transferase family 2 n=1 Tax=Conexibacter woesei DSM 14684 RepID=UPI0001C31B0A Length = 495 Score = 214 bits (545), Expect = 6e-54, Method: Composition-based stats. Identities = 83/392 (21%), Positives = 152/392 (38%), Gaps = 19/392 (4%) Query: 34 VFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEET 93 + + +V G Y ++ H P+ P ++++IP +NE + T Sbjct: 8 ILIALVLAGAIPLVAGCYQFLLAGLHTFRVR---TPRAACYPRVALVIPAWNEAAVIGRT 64 Query: 94 IHAALAQRYE--NIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQG--KAIALKT 149 + LA Y + + V+D STD T ++ A + P + G KA + Sbjct: 65 LDRLLALEYPGDRLRIYVVDDASTDATPELVRAKAEEHPGRIFHLRRERGGEGKAHTINH 124 Query: 150 GAAAAKSE----YLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTR-STLVGK 204 G ++E ++ ID D + + ++ L +P VGAVTG + +R + + + Sbjct: 125 GLRTIRAEGWYEAVLVIDADVIFTPPSLRWMTR-HLADPGVGAVTGYIKEGSRPGSWMTR 183 Query: 205 IQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGY-WSDDMITEDIDISWKL 263 EY + +R Q + G ++G RR +L VG + ED + + Sbjct: 184 FVAFEYVTAQAAARRAQNVLGAQACLAGGAQLLRRESLEAVGGEIDTATLAEDTVTTLGV 243 Query: 264 QLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLF 323 QL + +EP A+ W P T+ GLWKQRLRW +G +V + + + + + Sbjct: 244 QLAGGRVVFEPYAIVWAEEPRTVTGLWKQRLRWGRGNVQVTRRFRGVWLHRGSGSLGGVG 303 Query: 324 FEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMI 383 F IW F+ + + A + + + L + V+L Sbjct: 304 F----ALIW-FSVFLMPVFMVTSSAALVTLWAIDRGQSIEVFRTLWFLNVVTYLFVTLSS 358 Query: 384 ENRYEHNLTSSLFWIIWFPVIFWMLSLATTLV 415 + + + FP + +L + L Sbjct: 359 FSIDAATARRTWREGLLFPGVVSLLIILYALY 390 >UniRef50_C0ZQD3 Putative glycosyltransferase n=2 Tax=Rhodococcus RepID=C0ZQD3_RHOE4 Length = 634 Score = 214 bits (545), Expect = 7e-54, Method: Composition-based stats. Identities = 76/312 (24%), Positives = 131/312 (41%), Gaps = 10/312 (3%) Query: 72 KDNPSISIIIPCFNEEKNVEETIHAA--LAQRYE-NIEVIAVNDGSTDKTRAILDRMAAQ 128 + +P +I++P NE ++ TI A A+ Y I + V++ STD T + +++ A+ Sbjct: 250 EFSPPTAILVPAHNEAHDIAATIAAVDTAARTYPGRIHLYVVDNASTDNTHEVAEQIIAE 309 Query: 129 IPHLRVIHLA-QNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVG 187 + L GKAIAL G E++ ID D ++ + NPRVG Sbjct: 310 CSFITGTVLDCPTPGKAIALNMGIDHITEEFVARIDADTVIGPGCLETAMR-HFANPRVG 368 Query: 188 AVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGY 247 V G P R T + K ++ E G + + Y V + G+ A +RRS L +VG Sbjct: 369 CVGGIPMPREEKTWIDKCRLIEVYMRHGFFQVSLDGYQGVMGIPGMFAVYRRSQLIKVGG 428 Query: 248 WSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKN 307 M ED DI +L + +P+A+ + P + L +QR RW + + +N Sbjct: 429 MVQGMNGEDTDICMRLTAAGYHSVADPKAVYYSETPASYAHLLEQRTRWFRSIYHLAARN 488 Query: 308 MTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYA----VQLAGVPLNIELTHIAATH 363 L ++ + + L L +I+A + ++ I AT Sbjct: 489 RALLLDRQTM-IGTFVLPFNLVNAARRAMLAPLLIFACIAGIAFQSTFTTLQWQPIVATL 547 Query: 364 TAGILLCTLCLL 375 ++ T+ + Sbjct: 548 LGMSMIMTVVVC 559 >UniRef50_A5FAD6 Candidate beta-glycosyltransferase; Glycosyltransferase family 2 n=7 Tax=Flavobacteriales RepID=A5FAD6_FLAJ1 Length = 478 Score = 214 bits (545), Expect = 7e-54, Method: Composition-based stats. Identities = 80/389 (20%), Positives = 158/389 (40%), Gaps = 24/389 (6%) Query: 34 VFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDN--PSISIIIPCFNEEKNVE 91 + F +++ GG++ + + + + P + D P+ ++I+P +NE K V Sbjct: 85 IIFLAVTVALFLFKGGMFLYNL----YLYFKYKPIESVSDELLPTCTVIVPAYNEGKLVY 140 Query: 92 ETIHAALAQRYE--NIEVIAVNDGSTDKTRAILDRMAAQ-IPHLRVIHLAQNQGKAIALK 148 ET+ + + +E++A++DGS D T + + + + + QN+GK AL Sbjct: 141 ETLMSLAESDFPAHKLELLAIDDGSKDDTWYWIQQAKIKLGDRVSIFQQPQNKGKRHALY 200 Query: 149 TGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRI-RTRSTLVGKIQV 207 G K E V +D D+++ +D +V P + + + GAV GN ++ ++ K+ Sbjct: 201 RGFNLGKGEVFVTVDSDSIVKKDTLRNLVSPFVVDEKCGAVAGNVQVLNNEKAILPKMLN 260 Query: 208 GEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGY-WSDDM-------ITEDIDI 259 + ++ + G+V G +AA+RR+A+ W D I ED + Sbjct: 261 VSFVMSFEFMRSAESSLGSVLCTPGALAAYRRNAVFACLPEWIDQTFMGQPSDIGEDRAM 320 Query: 260 SWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKEN--- 316 + + + ++ A +PE KGL+K +RW + + +W+ Sbjct: 321 TNMILKQGHHVLFQRNAYVLTNVPEDYKGLYKMFIRWGRSNVRENIMMAKYVWKDFRKES 380 Query: 317 -FRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLL 375 LF E + I + + + + ++ L I + +L Sbjct: 381 KVGARLLFTEQSIRIIMTYPFFIFMFYFIATHPLLFISSALLGILIVSSFSVLFYAKRY- 439 Query: 376 QFIVSLMIENRYEHNLTSSLFWIIWFPVI 404 F + Y T LFWI + +I Sbjct: 440 SFTEAFWAY-SYSVFYTFGLFWIAPYSII 467 >UniRef50_C6VR67 Glycosyltransferase (Putative) n=7 Tax=Lactobacillales RepID=C6VR67_LACPJ Length = 451 Score = 213 bits (544), Expect = 8e-54, Method: Composition-based stats. Identities = 75/362 (20%), Positives = 150/362 (41%), Gaps = 24/362 (6%) Query: 36 FWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIH 95 P + I+ + + RH P APA P ++II+P +N E + + I Sbjct: 20 LVPIVVEIIPAIMSTIKLMRLHRHMP-RLKAPAK----WPWVTIIVPVYNSEDTLFDCIR 74 Query: 96 AALAQRYEN--IEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAA 153 + Q Y+ +++I N+ STD + R P+L + ++ ++GK AL Sbjct: 75 SIDHQTYDKKSMQIILANNQSTDDSFGAYARAQNAFPNLFLRYVNTDKGKDRALNAAIYE 134 Query: 154 AKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTR---------STLVGK 204 + Y++ ID D LL+ +A +V + ++ A+TG + L+ K Sbjct: 135 SIGSYVINIDSDGLLEPNAIKNMVLRFENDEQIAAMTGAVLPQRNLVKQVHGWWHRLLAK 194 Query: 205 IQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQ 264 + EY+ + + +FT+SG +AFRR L E + D I ED D++++++ Sbjct: 195 NEYYEYAQAFLSGRTIESFRDQLFTMSGAFSAFRREVLMETFLYDTDTIGEDTDMTFQIR 254 Query: 265 -LNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWR----KENFRM 319 + A+ ++ L+ QR RW +G EV M++ + F + Sbjct: 255 TRLGKKVVICADAIFYVAPISGFSELYTQRQRWQRGELEVAQHYMSQTASIKGFFKEFLV 314 Query: 320 WPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIV 379 + ++ T ++ A + V + + I ++ +++ + + + Sbjct: 315 RRIMIDHTFTFPRMIWTFATIVLIAFGYSTVVVGLSYLLI---YSLYVMVGVINFISVGI 371 Query: 380 SL 381 L Sbjct: 372 LL 373 >UniRef50_B4WM17 Glycosyl transferase, group 2 family protein n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WM17_9SYNE Length = 475 Score = 213 bits (543), Expect = 1e-53, Method: Composition-based stats. Identities = 89/377 (23%), Positives = 155/377 (41%), Gaps = 26/377 (6%) Query: 72 KDNPSISIIIPCFNEEKNVEETIHAALAQRYE--NIEVIAVNDGSTDKTRAILDRMAAQI 129 K P +S+++ NEE+ + + + L Y ++ ++D STD T ILD + + Sbjct: 115 KSWPYVSLLVAAKNEEQVIGSLVESLLHIDYPTDRYDLWIIDDYSTDATPEILDNLVKRH 174 Query: 130 PHLRVIHLAQN--QGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVG 187 L VIH GK+ AL K + L D DA + D Y+V PM + G Sbjct: 175 RQLNVIHRGPGAIGGKSGALNLVWPQTKGDLLAVFDADAQVSSDLLRYVV-PMFDPEKGG 233 Query: 188 AVTGNPRIRTR-----STLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSAL 242 TG ++R + Q E + + +R + G + + G RR A+ Sbjct: 234 KKTGAVQVRKAIANATKNFWTRGQKAEMALDCYMQQR-RIAVGGIGELRGNGQFVRREAI 292 Query: 243 AEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAE 302 A+ G W+++ IT+D+D++ +L L QW I LW QR RWA+GG + Sbjct: 293 AQCGGWNEETITDDLDLTIQLHLQQWDIGLLFAPAVGEEGVTNPLALWHQRNRWAEGGFQ 352 Query: 303 VFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVP-LNIELTHIAA 361 +L +L + + L + W I Y + L +P L I L Sbjct: 353 RYLDYWRQLGKNQLGWGKSLD----MLGFW-------IIQYMLPLVALPDLVIALIRRQT 401 Query: 362 THTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVM 421 A I + +C+ + + ++ S++ I + +ML ++ + Sbjct: 402 PVYAPITILAVCMSGIGMFTTLRQSERTSVWSAIVQTIRGSI--YMLHWL-VVIGSMAIR 458 Query: 422 LMPKKQRARWVSPDRGI 438 + +++R +WV G Sbjct: 459 ISVRQKRLKWVKTAHGS 475 >UniRef50_Q7NDK6 Glr4229 protein n=1 Tax=Gloeobacter violaceus RepID=Q7NDK6_GLOVI Length = 492 Score = 213 bits (542), Expect = 1e-53, Method: Composition-based stats. Identities = 75/341 (21%), Positives = 132/341 (38%), Gaps = 19/341 (5%) Query: 41 MSIMWIV-GGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALA 99 ++ +W+ G+ + R P P +++ +P FNE + A A Sbjct: 5 LAALWLFTYGINAYWLTARRG--APPPVPPSPGRWPLVTVQLPVFNELYVCRRLLAATCA 62 Query: 100 QRYEN--IEVIAVNDGSTDKTRAILDRMAAQIPHLRV----IHLAQNQG-KAIALKTGAA 152 Y + + ++D STD+T +L + L +H + G KA AL Sbjct: 63 LDYPREALHIQVLDD-STDETVQLLAAAIEEQRRLGFSIEHLHRKERHGFKAGALAAATP 121 Query: 153 AAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNP-RIRTRSTLVGKIQVGEYS 211 A EY+ D D L D + + RVG V +L+ ++Q Sbjct: 122 LANGEYIAIFDADFLPPPDWLKRAL-VHFADGRVGLVQTRWGHTNPGYSLLTRLQALGID 180 Query: 212 SIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIF 271 + ++ + G F +G +R+ A+ G W D + ED+D+S++ QL W Sbjct: 181 GHFAVEQQARCANGYYFNFNGTAGVWRKRAIEAGGGWQADTLAEDLDLSYRSQLAGWKAV 240 Query: 272 YEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTI 331 Y+ R + +P ++ Q+ RWA+G + K + R+ M L LT Sbjct: 241 YDGRIVAPAELPVSMAAYKMQQYRWAKGSIQCARKLLGRVMDCGGSPMQKLQAALHLTGY 300 Query: 332 WAFTCLVGFIIYAVQLAGV------PLNIELTHIAATHTAG 366 +V ++ +V L V PL++ + T G Sbjct: 301 AVHPLMVWIVLCSVPLLTVSWVGQHPLSLVWGTLMVPATFG 341 >UniRef50_A7HQK0 Glycosyl transferase family 2 n=1 Tax=Parvibaculum lavamentivorans DS-1 RepID=A7HQK0_PARL1 Length = 917 Score = 212 bits (541), Expect = 2e-53, Method: Composition-based stats. Identities = 89/382 (23%), Positives = 147/382 (38%), Gaps = 24/382 (6%) Query: 14 VLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKD 73 V +F G L++ + F+ ++W G P+ ENA +P Sbjct: 413 VGITVWSFLFFAQGLLLIVLLAEAIEFVEVIWTRHG------SRHFKPFDENAVSPTA-- 464 Query: 74 NPSISIIIPCFNEEKN-VEETIHAALAQRYENIEVIAVNDGSTDKT--RAILDRMAAQIP 130 +SI +P NE V ET+ A Y+N EV+ +++ + D + + D A P Sbjct: 465 --MVSIHVPIHNEPPEMVRETLQALANLDYDNYEVLVLDNNTVDPEVWQPVRDYCAQLGP 522 Query: 131 HLRVIHLAQNQG-KAIALKTGAAAA--KSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVG 187 R HL G KA AL G ++E + ID D ++ +V P VG Sbjct: 523 RFRFFHLENWPGFKAGALNFGLEKTAEEAEIIAVIDSDYQVEPSWLKVLV-PYFDKQDVG 581 Query: 188 AVTGNPRIRTR-STLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVG 246 V G R R + + EY+ + + + + G + R+SAL VG Sbjct: 582 FVQGPQDYRDRHESAFKNMAYWEYAGFFHIGMVQRNNFNAIIQ-HGTMTQVRKSALKRVG 640 Query: 247 YWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLK 306 W++ I ED ++ KL + Y + L P+TL G QR RWA G ++ Sbjct: 641 GWAEWCICEDAELGIKLYRAGYDSVYVNHSFGRGLTPDTLSGYITQRFRWAYGAVQIVKH 700 Query: 307 NMTRL--WRKENFRMWPLFFEYCLTTIWA-FTCLVGFIIYAVQLAGVPLNIELTHIAATH 363 + L W + Y L F + + + + H+ + Sbjct: 701 HWDALAPWASSKKGLTGAQRYYFLAGWLPWFADGLALLFTTASIVLSAWALYQPHMVSLP 760 Query: 364 TAGILLCTL--CLLQFIVSLMI 383 A L+ T+ + +F+ SL + Sbjct: 761 VAAFLIPTIGSFVFKFVRSLWL 782 >UniRef50_Q2W2R8 Cellulose synthase catalytic subunit n=5 Tax=Rhodospirillaceae RepID=Q2W2R8_MAGSA Length = 885 Score = 212 bits (541), Expect = 2e-53, Method: Composition-based stats. Identities = 73/358 (20%), Positives = 131/358 (36%), Gaps = 26/358 (7%) Query: 67 PAPQLKDNPSISIIIPCFNEEK-NVEETIHAALAQRYENIEVIAVNDGSTDKT--RAILD 123 P Q K P +SI +PC+NE V+ T+ + +A Y N E+I +++ + + R + + Sbjct: 425 PPDQCKRFPKVSIHLPCYNEPPAMVKLTLDSLMALDYPNFEIIVLDNNTKKEEVWRPVEE 484 Query: 124 RMAAQIPHLRVIHLAQNQG-KAIALKTGAAAAK--SEYLVCIDGDALLDRDAAAYIVEPM 180 ++ HLA G KA AL G +E + +D D ++D++ +V P Sbjct: 485 YCKTLGDKVKFFHLAPWPGAKAGALNFGLTVTDPEAEIIGVVDSDYMVDKNWLKGLV-PY 543 Query: 181 LYNPRVGAVTGNPRIRT-RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRR 239 NP+VG V R L ++ EY+ + + + G + R+ Sbjct: 544 FENPKVGHVQAPQDHREWEHDLFKEMINWEYAGFFDIGMVFRNEADAIIQ-HGTMTLVRK 602 Query: 240 SALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQG 299 L + G W + I ED ++ ++ + Y L L+P++ KQR RWA G Sbjct: 603 KTLEDAGRWGEWCIVEDAELGLRMMKAGYQSVYVQDRLGHGLVPDSFMAYKKQRFRWAYG 662 Query: 300 GAEVFLKNMTRLWRKENFRM-----------WPLFFEYCLTTIWAFTCLVGFIIYAVQLA 348 ++ + L + + W +F ++ L + V Sbjct: 663 AVQILKAHWRSLIPFKQTGLTTGQKYHFVAGWLPWFADAFYLLFCVAALAWSLGMIVAPR 722 Query: 349 GVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENR------YEHNLTSSLFWIIW 400 + + + L R LT S+ W +W Sbjct: 723 YFSTPLPFFTLPTVGVFVAKIFHHFFLYTTRVNCGLKRRSLAAIAGMGLTYSIAWAMW 780 >UniRef50_B4RSC1 Cellulose synthase catalytic subunit n=2 Tax=Alteromonas macleodii RepID=B4RSC1_ALTMD Length = 707 Score = 212 bits (541), Expect = 2e-53, Method: Composition-based stats. Identities = 102/454 (22%), Positives = 170/454 (37%), Gaps = 53/454 (11%) Query: 8 FFILCLVLCIPLCVAYF------------HSGELMMRFVFFWPFFMSIMWIVGGVYFWVY 55 F ++ LVL I L V Y + + +F + I ++G + Sbjct: 34 FRVVALVLAIALSVRYLLWRGLYTLTYSDVLSMIAVWLLFAAEIYAGITSVLGAIVNAFP 93 Query: 56 RERHWPWGENAPAPQLKDNPSISIIIPCFNE-EKNVEETIHAALAQRYEN--IEVIAVND 112 R E L PS+ ++IP +NE E+ +E TI AA Y + + ++D Sbjct: 94 LSRPLLSLEGMDKSTL---PSVDVMIPSYNEDEEILEVTIRAAKMLDYPKEKLHIHLLDD 150 Query: 113 GSTDKT---------------RAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSE 157 G TD+ R L + ++ KA + + + Sbjct: 151 GGTDQKINADNPISAAQAKQRRQDLQALCERLGITYHTRAKNEFAKAGNVNSAIQNTHGD 210 Query: 158 YLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVT-----GNPRIRTRSTLVG-KIQVGEYS 211 +V +D D + D V ML N +V V NP R+ E Sbjct: 211 LIVILDADHVPTSDFLDRTVPWMLKNEKVFLVQTPHFMANPDPVERNYFSAFSRMPSEND 270 Query: 212 SIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIF 271 G I++ +G+ F G A RR L VG S + ITED + + L + Sbjct: 271 MFYGTIQKGLDYWGSSF-FCGSAALMRRKHLDLVGGISGESITEDAETALDLHKMGYESV 329 Query: 272 YEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTI 331 Y R + L PET +QR+RWAQG ++ L + + ++ E + W Y + + Sbjct: 330 YVDRPMVSGLAPETFDAFIQQRMRWAQGMTQILL--LKKPFKAEGLK-WYQRVGYMSSIM 386 Query: 332 WAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNL 391 + I + LA + +++ H + L + + +++S M+ R L Sbjct: 387 FWLFPFARIIFLFMPLAYLVFGLQVYHASILEIFAYTLPHV-IATYMISTMLFGRTRWPL 445 Query: 392 TSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPK 425 S L+ I L A TL + +V L P+ Sbjct: 446 VSELYEI---------LQCAFTLNALVKVFLKPR 470 >UniRef50_Q46DE3 Dolichyl-phosphate mannose synthase related protein n=2 Tax=Methanosarcina RepID=Q46DE3_METBF Length = 337 Score = 212 bits (540), Expect = 2e-53, Method: Composition-based stats. Identities = 87/363 (23%), Positives = 145/363 (39%), Gaps = 44/363 (12%) Query: 73 DNPSISIIIPCFNEEKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAILDRMAAQIP 130 P +S+++ NEEK +EE I + + Y E+I ++ STDKTR I+ + P Sbjct: 2 KYPFVSVVVGIRNEEKFIEECIESLINLDYPRDSYEIIIIDGMSTDKTRDIVQ----KYP 57 Query: 131 HLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVT 190 +++ +N A A G A+ E + DGD +D +V M +P Sbjct: 58 VKLLLNEKKN--VAAARNLGVENARGELVAFTDGDCKVDPQWLKALVHEMQTSPDDVVCF 115 Query: 191 GNPR-IRTRSTLVGKIQVGEYSSIIGLIKRTQRIY----GNVFTVSGVIAAFRRSALAEV 245 G P I + G++ S +G Q V ++ A ++++A+ EV Sbjct: 116 GGPNLIFDTDPVFGRVVGYAQESFLGSGGSAQSKNSTKKHYVSSLPNCNAMYKKAAIKEV 175 Query: 246 GYWSDDM-ITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVF 304 G + + + +D D+++++ Y P A TLK + ++ AE+F Sbjct: 176 GGFDERFVVGQDGDLNYRIGKKGNKFLYIPNAQVLHHRRGTLKSFSVRMFKYGMWMAELF 235 Query: 305 LKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHT 364 K+ F W F + T++A L+ I Y+ T Sbjct: 236 KKH-------GEFVRWYAFLP-SIATLFAVILLIASIKYS-------------------T 268 Query: 365 AGILLCTLCLLQFIVSLM--IENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVML 422 AG+LL L + FI+ + I+ Y+ LF + PV L L SFT L Sbjct: 269 AGLLLLVLVTIYFILVFITSIQVTYKMKSKYGLFALFIIPVQHVAYGLG-FLYSFTNSPL 327 Query: 423 MPK 425 K Sbjct: 328 TSK 330 >UniRef50_Q97G47 Glycosyltransferase n=1 Tax=Clostridium acetobutylicum RepID=Q97G47_CLOAB Length = 447 Score = 210 bits (536), Expect = 6e-53, Method: Composition-based stats. Identities = 93/426 (21%), Positives = 180/426 (42%), Gaps = 25/426 (5%) Query: 31 MRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNV 90 + F W I+ I+GGV ++ + + + P I+I+IP +N E + Sbjct: 9 LLFWGIWLVIPLIVDIIGGVIGAIFITVSYFRKKKL---DIGYLPDITILIPIYNSENTL 65 Query: 91 EETIHAALAQRYE--NIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALK 148 E +++ + Q Y I V+ +N+G DK + + P L+V L + GKA AL Sbjct: 66 ENCLNSVINQTYPIEKISVMLINNGKRDKAYNVFCSFQERNPRLKVWWLDSSSGKAKALN 125 Query: 149 TGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTG----NPRIRTRS----- 199 G A +Y++ ID D +LD++A +VE NP + A+TG +P + + Sbjct: 126 KGIYMANGKYIINIDSDGVLDKEAILKVVEKFETNPDICAMTGVVLTDPELIEETKLKNL 185 Query: 200 TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDI 259 L+ + ++ EY + + Q +FT++G + +RR + + ++++ + ED + Sbjct: 186 RLIQRCELFEYIEAFLIGRGFQSQVNMMFTLAGAFSCYRREIIMKTQLYNNETLGEDTHM 245 Query: 260 SWKLQ-LNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFR 318 + +++ L I A ++ + L L+ QR RW +G EV + L+ + + Sbjct: 246 TSQIRELLDGKIALCEEAFFYVDPIDDLDKLYIQRQRWQRGQMEV-----SALFNEIANK 300 Query: 319 MWPLFFEYCLTTIWAFTCLV--GFIIYAVQLAGVPLNIELTHIAAT-HTAGILLCTLCLL 375 F TI LV FI + V L+ LT + I+ + Sbjct: 301 NTKRFINVLKVTIIKDHSLVFPRFIWIFAMMYLVFLDYPLTLVVGANLILYIVYVITSTI 360 Query: 376 QFIVS--LMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVS 433 F ++ + E + + + ++II I+ + + + + S Sbjct: 361 YFFIAKLFLKEKKQIKDYVNKHWYIILLLPIYRFVVFFIRVAGIINAVEKNASWNTKTFS 420 Query: 434 PDRGIL 439 +R I+ Sbjct: 421 QEREIV 426 >UniRef50_A1SGB7 Glycosyl transferase, family 2 n=1 Tax=Nocardioides sp. JS614 RepID=A1SGB7_NOCSJ Length = 772 Score = 210 bits (536), Expect = 6e-53, Method: Composition-based stats. Identities = 87/416 (20%), Positives = 155/416 (37%), Gaps = 30/416 (7%) Query: 46 IVGGVYFW---------VYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKN-VEETIH 95 ++G Y W +R R E A A P +S+ +P E V ET+ Sbjct: 138 LLGCAYLWELCDAMGSEQWRRRVSDGVEPARAADAGIRPFVSLHVPAHEEPPEMVIETLE 197 Query: 96 AALAQRYENIEVIAVNDGSTDKT-RAILDRMAAQIPHLRVIHLAQNQG-KAIALKTGAAA 153 + YE+ E+IA++D +TD++ ++ A ++ HL G K+ AL Sbjct: 198 SLRGLDYEHYEIIAIDDNTTDESLWRPVEAWCAAH-GVKFAHLEDWPGYKSGALNYALRE 256 Query: 154 A---KSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRT-RSTLVGKIQVGE 209 ++E + +D D L+ D A P+ +PRVG + R + Sbjct: 257 MTDDRAELIGVVDSDYQLEPDFLARC-APLFADPRVGFIQSPQDYRDWEGAPFYRRLYYS 315 Query: 210 YSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWT 269 Y + + ++ +G + RR AL +VG W + ITED ++S ++ W+ Sbjct: 316 YKYFFSVSQPSRNERDGA-IFAGTMGLIRRQALEDVGGWDEWCITEDAELSLRVLRAGWS 374 Query: 270 IFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFE---- 325 + + +MP T + L QR RW GG ++ + L L Sbjct: 375 GMHVDASFGHGVMPLTFEALKGQRFRWCFGGIQILRMHWRSLLPGLRGGGNRLTLGQRWA 434 Query: 326 YCLTTIWAFTCLVGFIIYAVQLAG---VPLNIELTHIAATHTAGILLCTLCLLQFIVSLM 382 Y + + LVG + + L G + L L + ++ L LL F+ ++ Sbjct: 435 YLSGGLQWYGDLVGVLFFLFLLGGATNLALGGGLLFRKLSPFLLAVIPLLVLLGFLRAVS 494 Query: 383 IENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRGI 438 + R S W + + + + K + R + G+ Sbjct: 495 LIRRGTGAGWSDAIGAFLI----WQSTTLVVARASVQGLFARKAEFLRTPKTEEGL 546 >UniRef50_Q1PUL2 Putative uncharacterized protein n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1PUL2_9BACT Length = 342 Score = 210 bits (536), Expect = 6e-53, Method: Composition-based stats. Identities = 73/359 (20%), Positives = 145/359 (40%), Gaps = 28/359 (7%) Query: 76 SISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVI 135 SI+IP +NEEK++ + + L+ Y + E+I ++D S+D T ++ A++ + ++ Sbjct: 2 KFSILIPAYNEEKSISTCLDSLLSVTYPDKEIIVIDDASSDHTVKEVETFASRG--VILV 59 Query: 136 HLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRI 195 +N G+A AL +G + A E +V D D + + E + V AV G + Sbjct: 60 KREKNGGRAAALNSGLSRATGEIVVTTDADTGVSPHWLT-LFERHFADEEVIAVGGAYKA 118 Query: 196 RTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDM-IT 254 + + + S + ++ T R VSGV +A RR L ++G +++ + Sbjct: 119 QNKGNPL----ANATSVLDQILNGTFRKTFIPNKVSGVNSAIRRDTLLDLGGFNESAWWS 174 Query: 255 EDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRK 314 ED ++ WKL + Y+P + + P+T +G+W+++ W F ++M Sbjct: 175 EDSELGWKLDKTG-KVIYDPENVVYTQYPDTWEGIWRRKFYWGYAMGLKFRQHMP----- 228 Query: 315 ENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCL 374 + L T + + V + G LL L L Sbjct: 229 -----------FRLKLWIRPTVFMALFVSLVVFLATIPCGMRVYFVPGLFFGTLLSALTL 277 Query: 375 LQFIVSLMIENRY-EHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWV 432 + + ++ R + L +L + P+ + + + + P + W Sbjct: 278 VYVPIGAIVMIRTGDTGLAKTLPLLAILPIFREFAYVYGMFLGYCKG--KPSALKPSWK 334 >UniRef50_Q03MS9 Glycosyltransferase, probably involved in cell wall biogenesis n=5 Tax=Lactobacillales RepID=Q03MS9_STRTD Length = 397 Score = 210 bits (536), Expect = 7e-53, Method: Composition-based stats. Identities = 79/397 (19%), Positives = 169/397 (42%), Gaps = 29/397 (7%) Query: 30 MMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNP--SISIIIPCFNEE 87 M+ F+ + ++ I + + N S+S+I+P +NEE Sbjct: 1 MLNFLSIFLLTYGVLAISHIAFQIILCHSDHRRQSKKSFKDFHSNYQASVSVIVPAYNEE 60 Query: 88 KNV-EETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIP---HLRVIHLAQNQGK 143 + + I + +AQ+ ++E+I V+DGS ++ +++++ +++++ +N+GK Sbjct: 61 PQILKNCIDSIVAQKAPDLEIIVVDDGSKNR-EELIEKVYNTYQSNQNVKILLPEENKGK 119 Query: 144 AIALKTGAAAAKSEYLVCIDGDALL-DRDAAAYIVEPMLYNPRVGAVTGNPRIRTRST-L 201 K G AK + +V +D D LL D +A +++ Y VGAVTG+ R+ ++T + Sbjct: 120 RHCQKLGFDIAKGDIIVTVDSDTLLHDENAVEKLIQRFAY-KNVGAVTGDVRVENKNTNI 178 Query: 202 VGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEV------GYW--SDDMI 253 + ++ Y + + Q + V SG +A+R+ + +V Y+ + Sbjct: 179 LTRLITYRYWTAFHQERAAQSRFHVVMCCSGPFSAYRKEIIDKVKEKYITQYFLGENCTY 238 Query: 254 TEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWR 313 +D ++ + + + + + +PET++G KQ++RW + F + M + Sbjct: 239 GDDRHLTNLVLEEGHDVAFHRDSRVYTFVPETIRGYIKQQVRWNKS----FYREMLWTIK 294 Query: 314 KENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLC 373 R + + ++ + I F +V I AVQ H + +L Sbjct: 295 FAPKRHFYMLYDLVMQFILPFMLVVSLIAMAVQTISYHDLGHFYHYLLVLILIAIFRSLY 354 Query: 374 LLQFI--VSLMIENRYEHNLTSSLFWIIWFPVIFWML 408 + + ++ L + +I PV F+ L Sbjct: 355 GIYRTKDIGFLLFV-----LYGFMHVLILLPVRFYAL 386 >UniRef50_Q6LKT1 Hypothetical glycosyltransferase, probably involved in cell wall biogenesis n=1 Tax=Photobacterium profundum RepID=Q6LKT1_PHOPR Length = 743 Score = 210 bits (535), Expect = 8e-53, Method: Composition-based stats. Identities = 83/419 (19%), Positives = 152/419 (36%), Gaps = 27/419 (6%) Query: 31 MRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEE-KN 89 + F ++ + + ++ + P + P++ + IP +NE + Sbjct: 88 LDLPFAIALLVTECYGITIYLLGMFVNVRQRSRDITPINVNQAIPTVDVFIPTYNESIRV 147 Query: 90 VEETIHAALAQRYE-NIEVIAVNDGSTDKTRAILDRMAAQIPHLR--------------V 134 V T+ AA+ +Y + V ++DG T + D A+I R Sbjct: 148 VAPTVSAAMQLQYPGKVNVWVLDDGGTPQKLNDEDPQQARIAQQRTHKLKDLCQKLGANY 207 Query: 135 IHLAQN-QGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNP 193 I N KA + + E ++ +D D + +D V P++G + Sbjct: 208 ITRPANVHAKAGNINHALKHSNGELILILDADHVPTKDFLINTVGMFQQQPKLGFIQTPH 267 Query: 194 RIRTRSTLVGKIQVG-----EYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYW 248 T + + + E I + F G A RR AL +VG Sbjct: 268 FFATPGPIEKNLGIEDKVPSENEMFYNRILSGMDFWNASF-FCGSAAIIRREALLDVGGI 326 Query: 249 SDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNM 308 S ITED D + ++ W Y RA+ L P+T QR RWAQG ++FL N Sbjct: 327 STRTITEDADTALEIHAKGWDSIYLNRAMIAGLSPDTFGAYVTQRSRWAQGMLQIFLLNN 386 Query: 309 TRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGIL 368 L R +F P Y +T++ F L + L + ++++ A Sbjct: 387 PLLKRGLSF---PQKICYLNSTLFWFFPLFRVMYLIAPLCYLVFDLQIFVGNANDFLVFA 443 Query: 369 LCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQ 427 + L + ++S + + + L S + + +F+ + + + PK + Sbjct: 444 IPHLAISM-VISQHLFGKTRNALFSEFYETVLSLYLFFPVLSVLITPKKPKFTVTPKGE 501 >UniRef50_D1U9V0 Hyaluronan synthase n=1 Tax=Desulfovibrio aespoeensis Aspo-2 RepID=D1U9V0_9DELT Length = 484 Score = 210 bits (535), Expect = 9e-53, Method: Composition-based stats. Identities = 98/421 (23%), Positives = 166/421 (39%), Gaps = 29/421 (6%) Query: 7 SFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRE------RHW 60 F + L++ I L V F+S + R +S +W GVYF V R Sbjct: 39 GFLAISLIVLISLLV--FNSLVYVGRLHLSLAMPLSTLW-RAGVYFLVVMNLAMLAWRVL 95 Query: 61 PWGENAPAPQLKDN--PSISIIIPCFNEEKNVEETIHAALAQRYE--NIEVIAVNDGSTD 116 PA D S ++IIP +NE V ET+ + ++ + ++VI V+DGS D Sbjct: 96 LVLRYRPAAPCADAELKSCTVIIPAYNEGPQVLETLRSVMSSDFPADRLQVICVDDGSKD 155 Query: 117 KTRAILDRMAAQI-PHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAY 175 T + + + + +I NQGK AL G A E LV ID D++++ Sbjct: 156 DTWLWMVKGGEEFGDRVELIRSPMNQGKRHALNHGFLRATGEVLVTIDSDSVIEPTTLRS 215 Query: 176 IVEPMLYNPRVGAVTGNPRIRTRST-LVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVI 234 +V NPRVGAV GN R+ ++ K+ ++ I+ Q + V G + Sbjct: 216 MVSVFCRNPRVGAVAGNVRVLNTGEGVIPKMLDVAFTYSFDFIRAAQSTFNTVMCTPGAL 275 Query: 235 AAFRRSALAEVG-YWSDDM-------ITEDIDISWKLQLNQWTIFYEPRALCWILMPETL 286 +A+ + +A+V W I ED ++ + Q + ++ A+ + +P Sbjct: 276 SAYDAALVAKVREKWLHQTFMGQPANIGEDRAMTNLILKEQRLVHFQSDAVVFTNVPVEY 335 Query: 287 KGLWKQRLRWAQGGAE---VFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIY 343 KGL K LRWA+ + +R E + + A L +I Sbjct: 336 KGLCKMMLRWARSNIRETIALSRFGFTRFRPEPVLGAQVDLAQQWVRMGANEMLKAGLI- 394 Query: 344 AVQLAGVPLNIELTHIAATHTAGILLCTLCLLQF-IVSLMIENRYEHNLTSSLFWIIWFP 402 L P + A ++ L++ ++ + Y + L WI + Sbjct: 395 -ASLITWPGVTLYGLLGGIFVASLMPAFFYLIRHRSLACLWAFPYSMFMLFGLSWISLYA 453 Query: 403 V 403 + Sbjct: 454 L 454 >UniRef50_A5G0G3 Glycosyl transferase, family 2 n=1 Tax=Acidiphilium cryptum JF-5 RepID=A5G0G3_ACICJ Length = 903 Score = 209 bits (533), Expect = 1e-52, Method: Composition-based stats. Identities = 76/344 (22%), Positives = 131/344 (38%), Gaps = 22/344 (6%) Query: 51 YFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKN-VEETIHAALAQRYENIEVIA 109 F R RH+ E PA P +S+ + NE V++T++A A YEN EV+ Sbjct: 421 LFGRVRMRHF---EPVPAAPGTKLPKVSLHLAICNEPPEMVKQTLNALAALDYENFEVLV 477 Query: 110 VNDGSTDKT--RAILDRMAAQIPHLRVIHLAQNQG-KAIALKTGAAAA--KSEYLVCIDG 164 +++ + D + A R L ++ G KA AL +E + +D Sbjct: 478 IDNNTKDPAVWEPVAAHCARLGKQFRFFTLGKHPGYKAGALNFALRETAPDAEIVGVLDS 537 Query: 165 DALLDRDAAAYIVEPMLYNPRVGAVTGNPRIR-TRSTLVGKIQVGEYSSIIGLIKRTQRI 223 D ++D D +V P +P VG R +L ++ EY+ + + Sbjct: 538 DYIVDPDWLRCMV-PAFADPNVGFTQSPQDYRDNDGSLFKRMMFWEYAGFFHIGMVNRNE 596 Query: 224 YGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMP 283 V G + R++AL G W++ ITED ++ +L + Y R+ +MP Sbjct: 597 RNAVIQ-HGTMTLIRKAALDAEGGWAEWCITEDSELGLRLFREGYEAVYSKRSFGRGVMP 655 Query: 284 ETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTC------- 336 + KQR RWA G + ++ + R + + T W Sbjct: 656 DDFNAFRKQRYRWAYGAMRISRRHWKAFLSPFD-RTLTIGQRWHFVTGWLPWIGDALGLA 714 Query: 337 -LVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIV 379 L+ + ++ L P+ E + I L ++Q Sbjct: 715 FLLLGLAWSAGLILDPVRFEFPILLFM-LPSIGLFAFKIVQIFA 757 >UniRef50_Q04QL4 Glycosyltransferase plus another conserved domain n=6 Tax=Leptospira RepID=Q04QL4_LEPBJ Length = 517 Score = 209 bits (532), Expect = 2e-52, Method: Composition-based stats. Identities = 79/405 (19%), Positives = 152/405 (37%), Gaps = 23/405 (5%) Query: 30 MMRFVFFWPFFMSIMWIVGGVYFWVY-RERHWPWGENAPAPQLK----DNPSISIIIPCF 84 ++ + +++ + Y VY ++++ + E+ P L + P +++ +P F Sbjct: 7 VLFLAIYGIDIVALFFFGIHTYIMVYLYKKNYAYCESEPDKILDVNDPNLPVVTVQLPIF 66 Query: 85 NEEKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAILDRMAAQIPHLRV-IHLAQNQ 141 NE V+ I +A +Y +E+ ++D STD+T + L IH Sbjct: 67 NEFYVVDRLIETTVALKYPKDKLEIQLLDD-STDETIEKSRNLINHYKALGFDIHHLHRA 125 Query: 142 G------KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNP-R 194 G KA AL+ G A+ EY+ D D + D D V P +P++G V Sbjct: 126 GAERTGHKAGALEAGMKVARGEYIAIFDADFMPDPDFLIKTV-PYFEDPQIGMVQVRWGH 184 Query: 195 IRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMIT 254 I ++ K Q + + + +G +++ + + G W D +T Sbjct: 185 INADYNVLTKAQSFGIDGHFMIEQVARNGSHLWMNFNGTAGIWKKECIIDSGGWEHDTLT 244 Query: 255 EDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRK 314 ED D+S++ ++ W Y C +P + Q+ RW +G + +K + R++R Sbjct: 245 EDFDLSYRAEMRGWKFRYFKDIECKAEIPAMISAYKSQQFRWCKGSIQTAVKLLPRIFRA 304 Query: 315 ENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPL--NIELTHIAATHTAGILLCTL 372 + L +V I+++ L + + + G Sbjct: 305 DLPWKIKSEAIVHLINYSVHPLMVINILFSAPLLLMNYWSGFDFYDLPIEILMG---TAA 361 Query: 373 CLLQFIVSLMIENRYEHN-LTSSLFWIIWFPVIFWMLSLATTLVS 416 L V MI Y L + + I M+ +V+ Sbjct: 362 VLSVGSVGPMIFYAYSQKILHKDWKRRMVYLPILIMIGTGIAIVN 406 >UniRef50_Q9FNI7 Glucomannan 4-beta-mannosyltransferase 2 n=8 Tax=Spermatophyta RepID=CSLA2_ARATH Length = 534 Score = 209 bits (532), Expect = 2e-52, Method: Composition-based stats. Identities = 83/400 (20%), Positives = 161/400 (40%), Gaps = 21/400 (5%) Query: 15 LCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQL--K 72 + +PL + LM + +M I+ ++ + FW ++ + + +L Sbjct: 36 VIVPLLQLAVYICLLMSVMLLCERVYMGIVIVLVKL-FWKKPDKRYKFEPIHDDEELGSS 94 Query: 73 DNPSISIIIPCFNEEKNVEETIHAALAQRYENIE-VIAVNDGSTDKTRAI-----LDRMA 126 + P + + IP FNE + + +I AA + + VI V D STD T R A Sbjct: 95 NFPVVLVQIPMFNEREVYKLSIGAACGLSWPSDRLVIQVLDDSTDPTVKQMVEVECQRWA 154 Query: 127 AQIPHLRV-IHLAQNQGKAIALKTGAAAA---KSEYLVCIDGDALLDRDAAAYIVEPMLY 182 ++ ++R I + KA ALK G + EY+V D D + D + +++ Sbjct: 155 SKGINIRYQIRENRVGYKAGALKEGLKRSYVKHCEYVVIFDADFQPEPDFLRRSIPFLMH 214 Query: 183 NPRVGAVTGNPR-IRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSA 241 NP + V R + + L+ ++Q + + F +G +R +A Sbjct: 215 NPNIALVQARWRFVNSDECLLTRMQEMSLDYHFTVEQEVGSSTHAFFGFNGTAGIWRIAA 274 Query: 242 LAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGA 301 + E G W D ED+D++ + L W Y +P T + Q+ RW+ G A Sbjct: 275 INEAGGWKDRTTVEDMDLAVRASLRGWKFLYLGDLQVKSELPSTFRAFRFQQHRWSCGPA 334 Query: 302 EVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAA 361 +F K + + R + R W + I++F + I + V + + LT + Sbjct: 335 NLFRKMVMEIVRNKKVRFWKKVY-----VIYSFFFVRKIIAHWVTFCFYCVVLPLTILVP 389 Query: 362 THTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWF 401 + + + I++++ ++ +WI++ Sbjct: 390 E--VKVPIWGSVYIPSIITILNSVGTPRSIHLLFYWILFE 427 >UniRef50_P74165 Sll1377 protein n=9 Tax=Cyanobacteria RepID=P74165_SYNY3 Length = 479 Score = 208 bits (530), Expect = 4e-52, Method: Composition-based stats. Identities = 90/406 (22%), Positives = 158/406 (38%), Gaps = 20/406 (4%) Query: 43 IMWIVGGVYFWVYRERHWPWGENAPAPQLKD-----NPSISIIIPCFNEEKNVEETIHAA 97 ++W + G E AP D P + +++ NEE + + + Sbjct: 71 LVWALTGALSLQALRLMKATPEEAPPLLTGDASTVPYPQVCLMVAAKNEEAVIGKIVQQL 130 Query: 98 LAQRYE--NIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIA--LKTGAAA 153 + Y EV V+D STD+T AILD++ Q P L+V+ + L A Sbjct: 131 CSLDYPGDRHEVWIVDDNSTDRTPAILDQLRQQYPQLKVVRRGAGASGGKSGALNEVLAQ 190 Query: 154 AKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRS-TLVGKIQVGEYSS 212 + + + D DA + +D +V P +P GA+ I + + Q E + Sbjct: 191 TQGDIVGVFDADANVPKDLLRRVV-PYFASPTFGALQVRKAIANEAVNFWTRGQGAEMA- 248 Query: 213 IIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFY 272 + ++ + + G + + G R AL VG W++ IT+D+D++ +L L+QW + Sbjct: 249 LDAYFQQQRIVTGGIGELRGNGQFVARQALDAVGGWNEQTITDDLDLTIRLHLHQWKVGI 308 Query: 273 EPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPL-FFEYCLTTI 331 T LW QR RWA+GG + +L + + L F + L Sbjct: 309 LVNPPVEEEGVTTAIALWHQRNRWAEGGYQRYLDYWRWICTQPMGWKKKLDLFSFLLMQY 368 Query: 332 WAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNL 391 T V ++ A+ PL L+++A + + L L + L Sbjct: 369 LLPTAAVPDLLMALWQRRFPLLTPLSYLAIGFSCWGMYYGLKRLTPSEGESPWQQMPALL 428 Query: 392 TSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRG 437 ++ I+ W++ + R+ PK R +WV G Sbjct: 429 ARTIGGTIYM--FHWLIIMPAVT---ARMAFRPK--RLKWVKTVHG 467 >UniRef50_Q4KHI6 Glycosyl transferase, group 2 family protein n=11 Tax=Pseudomonadaceae RepID=Q4KHI6_PSEF5 Length = 863 Score = 208 bits (529), Expect = 4e-52, Method: Composition-based stats. Identities = 82/392 (20%), Positives = 152/392 (38%), Gaps = 29/392 (7%) Query: 4 RIVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHW--- 60 R FI L + + + Y +S + F F +++ + GV+ + E H Sbjct: 342 RTFLTFIAFLCGSVLVWIGYDYSQQYSTWFSLTVGFLLALGAL--GVFIVLLTEAHELAE 399 Query: 61 ------PWGENAPAP-QLKDNPSISIIIPCFNEEKN-VEETIHAALAQRYENIEVIAVND 112 E P P +SI +PC+NE V++T++A Y + EV+ +++ Sbjct: 400 AVWIHKRRREFLPVEGDSSYRPKVSIHVPCYNEPPEMVKQTLNALANLDYPDFEVLIIDN 459 Query: 113 GSTDKT-RAILDRMAAQ-IPHLRVIHLAQNQG-KAIALKTGAAAA--KSEYLVCIDGDAL 167 + D + P + H+A G K AL +E + ID D Sbjct: 460 NTKDPAVWEPVQAYCETLGPRFKFFHVAPLAGFKGGALNYLLPHTAKDAEVIAVIDSDYC 519 Query: 168 LDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGN 226 +DR+ ++V P +P++ V R ++ + K+ EY + T+ Sbjct: 520 VDRNWLKHMV-PHFADPKIAIVQSPQDYRDQNESTFKKLCYAEYKGFFHIGMVTRNDRDA 578 Query: 227 VFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETL 286 + G + RR+ L E+G W+D I ED ++ ++ + Y + LMP+T Sbjct: 579 IIQ-HGTMTMTRRTVLEELG-WADWCICEDAELGLRVFEKGLSAAYHHESYGKGLMPDTF 636 Query: 287 KGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAF-------TCLVG 339 KQR RWA G ++ ++ L R +N + + L + VG Sbjct: 637 IDFKKQRFRWAYGAIQIIKRHTASLLRGKNTELTRGQRYHFLAGWLPWVADGMNIFFTVG 696 Query: 340 FIIYAVQLAGVPLNIELTHIAATHTAGILLCT 371 ++++ + VP ++ + L Sbjct: 697 ALLWSAAMIIVPQRVDPPLLIFAIPPLALFVF 728 >UniRef50_A0LSJ8 Glycosyl transferase, family 2 n=1 Tax=Acidothermus cellulolyticus 11B RepID=A0LSJ8_ACIC1 Length = 471 Score = 208 bits (529), Expect = 4e-52, Method: Composition-based stats. Identities = 92/430 (21%), Positives = 171/430 (39%), Gaps = 11/430 (2%) Query: 11 LCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQ 70 + L I V + V + ++ + +P Sbjct: 28 IVLAGLISAAVGILGPDRAIFLGVLVINLVFLVFFLRHVAFAAAAARWAPTDLYESPEVP 87 Query: 71 LKDNPSISIIIPCFNEEKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAILDRMAAQ 128 +D P++S+++ C NEE V I +A Y +E++ VNDGS+D+T +LD++ A Sbjct: 88 SEDLPTVSVLVACHNEESVVPGLIRGLVALHYPRSRLEILIVNDGSSDRTGELLDQLTAG 147 Query: 129 IPHLRVIHLAQNQGKAIALKTGAAAAK--SEYLVCIDGDALLDRDAAAYIVEPMLYNPRV 186 I H+RVIH G + AA A+ E +V D D + R+ +V +P V Sbjct: 148 IRHMRVIHRPAGAGGGKSGALNAALAEANGEIVVVFDADHIPRRNVIHRLVR-HFRDPTV 206 Query: 187 GAVTGNPRIRTR-STLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEV 245 AV G +R + + + +Y S + + ++ + G A R + L + Sbjct: 207 AAVQGRCIVRNSVQSTLARSIAIDYFSGYLVNEYGRQALFGLPAYGGANCAVRTALLRQF 266 Query: 246 GYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFL 305 G W+ + +TED DI+ +L L + + Y+ A+ TL +QR RWA+G +V+ Sbjct: 267 GGWNVNSVTEDTDITLRLVLAGYRVRYDITAVDTEEGATTLARFVRQRYRWARGHQQVWR 326 Query: 306 KNMTRLWRKENFRMWPLF--FEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATH 363 +WR + + + L C + ++ ++LAG+ ++ + + Sbjct: 327 DYRRGVWRCPHLTLGQKIETTLFLLVYHVPVFCTLTLVLTLLRLAGIGPHVSVVELLPLA 386 Query: 364 TAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLM 423 + V L++ S + F V + + A R Sbjct: 387 ALLFAGP---FCELAVGLLVGRAPRRAAWSVAWMTPLFVVFMLVCTRAWIDGLLGRPYTW 443 Query: 424 PKKQRARWVS 433 K +R++W + Sbjct: 444 VKTKRSQWSA 453 >UniRef50_A5GQ06 Glycosyltransferase of family GT2; possible processive Glycosyltransferase n=1 Tax=Synechococcus sp. RCC307 RepID=A5GQ06_SYNR3 Length = 453 Score = 207 bits (528), Expect = 5e-52, Method: Composition-based stats. Identities = 89/427 (20%), Positives = 174/427 (40%), Gaps = 20/427 (4%) Query: 14 VLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKD 73 ++ + C+ + + P F + + G + + R R ++ P Sbjct: 32 LIVVLACLWAGAAPHWLGPVRGLLPAFGLTLMLGGYGFSTLRRGRSSAAAVSSAPPAGAV 91 Query: 74 NPSISIIIPCFNEEKNVEETIHAALAQRYE--NIEVIAVNDGSTDKTRAILDRMAAQIPH 131 P+I +++ +EE+ + + A Y +++ ++DGS D+T IL++++A Sbjct: 92 VPAIDVLVAARDEEQVIARLVQAIAGLEYPADRLKLWVMDDGSNDRTPEILNQLSATHSF 151 Query: 132 LRVIHLAQN--QGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAV 189 L+V +N GK+ AL + ++L+ +D DA L D + +P L++ +GAV Sbjct: 152 LQVRRRPRNAGGGKSGALNALLPQLQGDWLLVLDADAALTPDVLLRL-QPQLHDSALGAV 210 Query: 190 TGNPRIRT-RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYW 248 + S + + Q E + ++ + G + + G RR L + G + Sbjct: 211 QLRKAVVGAESNWLTRSQALEMA-FDAEMQLGRIAQGGIGELRGNGQLIRRQVLLDCGGF 269 Query: 249 SDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNM 308 ++D +T+D+D+S++L L + + LW+QR RWA+GG + FL Sbjct: 270 NEDTVTDDLDLSFRLLLAGQQVSVLWNPVVDEEPVMRWGALWRQRQRWAEGGLQRFLDYW 329 Query: 309 TRLW-RKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGI 367 L+ + + R + L VG + + PL L+ ++ + + Sbjct: 330 PALFSDRLSGRQQLDLIVFFLLQYGLPLATVGDVFGMIWWRQWPLLWPLS-VSTLSLSAL 388 Query: 368 LLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQ 427 L + E + L ++ + I WF VI W+ R+ L PK Sbjct: 389 ALWRSGRRHSEGPELPEATGWNLLVANTYLIHWFLVIPWVAV---------RMALRPK-- 437 Query: 428 RARWVSP 434 R W Sbjct: 438 RLVWAKT 444 >UniRef50_B3DVV3 Glycosyltransferase n=1 Tax=Methylacidiphilum infernorum V4 RepID=B3DVV3_METI4 Length = 858 Score = 206 bits (525), Expect = 1e-51, Method: Composition-based stats. Identities = 67/288 (23%), Positives = 120/288 (41%), Gaps = 23/288 (7%) Query: 70 QLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQI 129 L+ P +S+I+ +N K + + + + +Y + E+I V+DGSTD TR IL Q Sbjct: 314 PLERTPKVSVIVCTYNGAKTLTDCLSSLKRLQYPDYEIIVVDDGSTDHTRKILH----QF 369 Query: 130 PHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAV 189 P + H N+G + A G A E + D D + D D ++V+ +L + V Sbjct: 370 PEVIAFH-QPNKGLSAARNLGIEKATGEIVAFTDSDCMADPDWLYFMVKTLLNDRYVACG 428 Query: 190 TGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWS 249 N R ++ I + S L+ T + + G A+ + ++GY+ Sbjct: 429 GPNFSPVPRQSIQEIISLAPGSPSHVLLSDTHAEH-----IPGCNMAYWKWIFEKIGYFD 483 Query: 250 D--DMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKN 307 +D+DI W++ + + I + P A+ W T+K KQ++ G AE L+ Sbjct: 484 PIFRKAGDDVDICWRILSSGYQIGFNPSAVIWHYRRFTVKDYLKQQI--GYGEAEALLRF 541 Query: 308 MT---------RLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQ 346 LW+ + FF ++++ GF + Sbjct: 542 KHPHYFGSLGNALWKGRIYEQNLSFFSLKKSSLYQGIFGTGFFQFLYP 589 >UniRef50_D1N5H0 Glycosyl transferase family 2 n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N5H0_9BACT Length = 463 Score = 206 bits (525), Expect = 1e-51, Method: Composition-based stats. Identities = 85/415 (20%), Positives = 167/415 (40%), Gaps = 30/415 (7%) Query: 24 FHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPC 83 FHS L F F + I G ++ R+ P + P+ ++I+P Sbjct: 48 FHSIRL---FALFGLGVLGINIFYYGFLMFLACLRYRPRR----TLSDGELPTCTVIVPA 100 Query: 84 FNEEKNVEETIHAALAQRYE--NIEVIAVNDGSTDKTRAILDRMAAQ-IPHLRVIHLAQN 140 +NE K+V + + LA Y +E++A++DGS D T + AA+ + + L +N Sbjct: 101 YNEGKSVLNALDSVLASDYPAAKLEILAIDDGSADDTWYWIKLAAARSGGRITPVKLEKN 160 Query: 141 QGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRT-RS 199 GK AL G +++E +V +D D++++ D + P + ++ V G+ R+ + Sbjct: 161 GGKRHALYRGIRQSRAEVIVTVDSDSVVEPDTLRLLNSP-FADSKIAGVAGSIRVLNLQD 219 Query: 200 TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAE-VGYWSDD------- 251 ++ ++ + +++ Q + G+VF G ++A+RR+AL + W D Sbjct: 220 GILPRMMDVNFVFGFEIMRSAQSVLGSVFCTPGALSAYRRAALLPFLDQWVDQKFFGEPA 279 Query: 252 MITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRL 311 I ED ++ L + I ++ A+ ++P + + K LRW +G Sbjct: 280 HIAEDRALATFLLADDHRIVFQRNAVARTIIPSSYRTTCKMLLRWGRGDVRETCSMYRFA 339 Query: 312 WRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCT 371 +RK ++ + F + T+W F L G++L Sbjct: 340 FRKFSWFHLGIQFNLLMQTMWLFLP---------VLLLPLTLAAACAAPEAFLQGLILGM 390 Query: 372 LCLLQFIVSLMIENR-YEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPK 425 + + R + + F I +FW+ V ++ M + Sbjct: 391 IAWSTIPAFIYSTRRGGSEAIFAYTFAIFKLVFLFWIGPYCLVSVRNSKWMTRGR 445 >UniRef50_C7QK51 Glycosyl transferase family 2 n=1 Tax=Catenulispora acidiphila DSM 44928 RepID=C7QK51_CATAD Length = 658 Score = 206 bits (525), Expect = 1e-51, Method: Composition-based stats. Identities = 98/437 (22%), Positives = 170/437 (38%), Gaps = 22/437 (5%) Query: 10 ILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAP 69 LC++ + + + EL + F+ +M + + V G +P P Sbjct: 219 ALCMMGSLAVVLIARAGAELFVAFLVIRALRAGVMDCIYVLLGLVRM-----AGPGSPRP 273 Query: 70 QLKDNPSISIIIPCFNEEKNVEETIHAA--LAQRYEN-IEVIAVNDGSTDKTRAILDRMA 126 + P + +IIP FNE +E T+ + A+ Y + VI +DGS D TRA+ + Sbjct: 274 D-ERRPPLEVIIPAFNEAVVIERTLRSIDRAARAYGGPVRVILCDDGSMDDTRALAEATI 332 Query: 127 AQIPHLRVIHLAQNQ-GKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPR 185 A+ + + GK++AL A +++Y+ ID D LD +A Y V L +P Sbjct: 333 AEFGYATGEVIQGRHAGKSVALNLALAECEADYVYRIDADCALDANAFVYSVPHFLKHPD 392 Query: 186 VGAVTGNPRIR-TRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAE 244 VG V + +T + +++ E G + V V G AFRR Sbjct: 393 VGLVGAFVLPKEPYTTWIDRMRALEMIFSFGFARVMLAEVDAVPCVPGTFCAFRRLPALA 452 Query: 245 VGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVF 304 +G + + ED+ + + + +PR + + +P T++ L QR RW +GG F Sbjct: 453 IGGFVHGTLGEDVFFTCSMARLGYRAAIDPRVISYEDVPTTVRQLRVQRFRWCKGGIMSF 512 Query: 305 LKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHT 364 R W + A L+ A+ L + L+I + Sbjct: 513 AAYTPFGCGAPGPRNWFKM------PLGAGKGLLKPFSSAMFLLTIELSILDPTVRHNVL 566 Query: 365 AGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPV----IFWMLSLATTLVSFT-R 419 +LL + IV +I Y L + ++W P F+M+ + + Sbjct: 567 RFVLLLLASQITSIVPRLIVMGYYRRLRLLPWMVLWIPFSFLKRFFMIEAFLSFGMRPVK 626 Query: 420 VMLMPKKQRARWVSPDR 436 L + + W S R Sbjct: 627 PPLRVRSRFPTWSSVFR 643 >UniRef50_D1B804 Glycosyl transferase family 2 n=2 Tax=Bacteria RepID=D1B804_THEAS Length = 450 Score = 206 bits (524), Expect = 2e-51, Method: Composition-based stats. Identities = 102/454 (22%), Positives = 179/454 (39%), Gaps = 37/454 (8%) Query: 1 MINRIVSFFILCLVLCI--PLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRER 58 ++ ++ + + PL + + V F ++ GV + Sbjct: 5 LVTDVLRMAAAAIRGLLEDPLSSQVWVVIFKFVPMVVFLELPYYAF-VLAGVLKYAMERA 63 Query: 59 HWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYE-NIEVIAVNDGS--- 114 PW E ++ +PS+S II C++E ++V+ TI Q Y I++I V DG+ Sbjct: 64 LLPWEE------VQYHPSVSCIITCYSEGEDVKLTIRTLAHQIYPGRIQIIPVIDGALVN 117 Query: 115 TDKTRAILDRM----AAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDR 170 + RA L+ M A L V+ Q G+ +L G + A+ E ++ +DGD D Sbjct: 118 SHTHRAALEMMDQVMRAPNRELLVLPKWQRGGRVSSLNAGLSVARGEVVMALDGDTSFDN 177 Query: 171 DAAAYIVEPMLYNPRVGAVTGNPRIRT-RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFT 229 D + V +P V AV G R+R RS+L +Q EY IG+ K + V Sbjct: 178 DMVSKAVR-HFRDPNVAAVAGCLRVRNWRSSLAAALQGLEYMLAIGVSKTGLSQFNAVNN 236 Query: 230 VSGVIAAFRRSALAEVGYWSDDMITEDIDISWKL-----QLNQWTIFYEPRALCWILMPE 284 +SG FRR L + W ED+DI+ ++ + I ++P + P Sbjct: 237 ISGAFGIFRREVLNLIMGWDAGS-AEDLDITMRIKNYFGRYRSVRILFDPEVIGHTDAPP 295 Query: 285 TLKGLWKQRLRWAQGGAEVF-LKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIY 343 T +G +KQRLRW ++ K++ + W F L ++ F +V I+ Sbjct: 296 TFRGFFKQRLRWDGDLFYIYVRKHLKSF--SPSMMGWRNFITVTLGGLY-FQIVVPLILL 352 Query: 344 ---AVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIW 400 A +PL + + + + + L + + R + L L Sbjct: 353 VYTAFSFITLPLGTVFRILLLVYLFYLGVTLVFYLLGLAFVSERPRDDLILGVLLPLFPV 412 Query: 401 FPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSP 434 F + S A + ++ + ++P Sbjct: 413 FTFASRLWSAAAVIWE-----MVARGHLDSSMAP 441 >UniRef50_Q7XIF5 Probable mannan synthase 7 n=9 Tax=Magnoliophyta RepID=CSLA7_ORYSJ Length = 585 Score = 205 bits (523), Expect = 2e-51, Method: Composition-based stats. Identities = 82/414 (19%), Positives = 165/414 (39%), Gaps = 33/414 (7%) Query: 31 MRFVFFWPFFMSIMWIVGGVY----------FWVYRERHWPWGENAPAPQLK---DNPSI 77 ++ + MS+M +V + E + W A A + K + P + Sbjct: 92 LQVAVWVCMVMSVMLVVEATFNSAVSLGVKAIGWRPEWRFKWEPLAGADEEKGRGEYPMV 151 Query: 78 SIIIPCFNEEKNVEETIHAALAQRYENIEVIA-VNDGSTDKTRA-----ILDRMAAQIPH 131 + IP +NE + + +I AA ++ ++I V D STD + A++ + Sbjct: 152 MVQIPMYNELEVYKLSIGAACELKWPKDKLIVQVLDDSTDPFIKNLVELECESWASKGVN 211 Query: 132 LRVIHLAQNQG-KAIALKTGAA---AAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVG 187 ++ + + +G KA ALK G + EY+ D D + + V +++NP V Sbjct: 212 IKYVTRSSRKGFKAGALKKGMECDYTKQCEYIAIFDADFQPEPNFLLRTVPFLMHNPNVA 271 Query: 188 AVTGNPR-IRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVG 246 V + ++L+ ++Q + + + F+ +G +R +A+ E G Sbjct: 272 LVQARWAFVNDTTSLLTRVQKMFFDYHFKVEQEAGSATFAFFSFNGTAGVWRTTAINEAG 331 Query: 247 YWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLK 306 W D ED+D++ + LN W Y +P T +Q+ RWA GGA +F K Sbjct: 332 GWKDRTTVEDMDLAVRASLNGWKFIYVGDIRVKSELPSTYGAYCRQQFRWACGGANLFRK 391 Query: 307 NMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAG 366 + ++ + F+ +++F + + V + + L+ + Sbjct: 392 IAMDVLVAKDISLLKKFY-----MLYSFFLVRRVVAPMVACVLYNIIVPLSVMIPELFIP 446 Query: 367 ILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFW--MLSLATTLVSFT 418 + + + + ++ R NL FWI++ V+ M + T L+ + Sbjct: 447 --IWGVAYIPMALLIITTIRNPRNLHIMPFWILFESVMTVLRMRAALTGLMELS 498 >UniRef50_D0DDK6 Glycosyl transferase, family 2 n=1 Tax=Citreicella sp. SE45 RepID=D0DDK6_9RHOB Length = 618 Score = 205 bits (523), Expect = 2e-51, Method: Composition-based stats. Identities = 72/370 (19%), Positives = 144/370 (38%), Gaps = 13/370 (3%) Query: 69 PQLKDNPSISIIIPCFNEEKNVEETIHAA---LAQRYENIEVIAVNDGSTDKTRAILDRM 125 P + P I++IIP NE +++TIH+ Q + ++ ++ +++ S+D T I+ Sbjct: 252 PPGRFAPPITVIIPAHNEAHIIDDTIHSLDVAAEQYHNDVTLLVIDNNSSDATPKIVREA 311 Query: 126 AAQIPHLR-VIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNP 184 + +R + GKA AL G A +++Y+V ID D + +A ++ NP Sbjct: 312 LLRARRIRGRLLSEPVPGKAHALNRGIAETRTDYVVRIDADTQVSPEALWRVMR-HFSNP 370 Query: 185 RVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAE 244 + AV G + ++ E +GL + + V G+ A+ AL + Sbjct: 371 -LVAVAGGQALAPGGGFFDGVRDLEVILKVGLDQVAYGAGDAIIGVPGMFVAYNTQALRQ 429 Query: 245 VGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVF 304 G ++ M ED D + ++ + + +P A +P T + +QR+RW + V Sbjct: 430 AGSFASGMNGEDTDAAIRIAESGRRLVVDPSATFVSEVPRTFSHMREQRMRWFRSLFHVM 489 Query: 305 LKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHT 364 +N + L Y L + F+++A + + + I+ Sbjct: 490 ARNRQHFHLRNLSIRSKLLMPYLLINTARRAIALPFLLFATIFLLMAPD-PRSSISVQAV 548 Query: 365 AGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMP 424 L+ ++ +L + +L + + V S T + Sbjct: 549 LAFLMGAPMIMVIFAALTMCR------FRALLHLPQYLVFRMFRSYLTLESMLSISFQGY 602 Query: 425 KKQRARWVSP 434 + + SP Sbjct: 603 QTGVSARRSP 612 >UniRef50_Q0AQN9 Glycosyl transferase, family 2 n=2 Tax=Hyphomonadaceae RepID=Q0AQN9_MARMM Length = 632 Score = 205 bits (522), Expect = 3e-51, Method: Composition-based stats. Identities = 81/375 (21%), Positives = 145/375 (38%), Gaps = 10/375 (2%) Query: 46 IVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAA--LAQRYE 103 + F+ + W N + + +P ++++IP NE + +TI A A Y+ Sbjct: 228 VFLAQQFFTVQLASLGWMRNTLSRRRSFSPPVAVLIPAHNESHLIGQTIDAIDEAAAHYD 287 Query: 104 N-IEVIAVNDGSTDKTRAILDRMAAQIPHLR--VIHLAQNQGKAIALKTGAAAAKSEYLV 160 + ++ + + S+D T + + A+ R VI GKA AL G +Y+V Sbjct: 288 GPVRILLMENCSSDDTADVARQAIAKCRCARGEVIESII-PGKAKALNHGLELISEDYVV 346 Query: 161 CIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRT 220 ID D +D + + N VG V G P R+ L+ K + E + G + Sbjct: 347 RIDADTQIDPQSLRLAMR-HFANETVGTVGGLPLPLKRTGLLDKFRTIEVLNRHGYFQVA 405 Query: 221 QRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWI 280 + + + G+ +RR L E G ++M ED DI ++ + +PR Sbjct: 406 LGAFNGILGIPGMFCIYRRDVLMEAGGIVEEMNGEDTDIVLRMTNLGYRAVSDPRVKFRT 465 Query: 281 LMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGF 340 +P++++ L +QR RW + V N L++ N + + L + Sbjct: 466 EVPDSMEFLREQRTRWFRSLYHVTAHNREMLFQG-NLITGAVVLPFTLMNGARRAMMAPL 524 Query: 341 IIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIW 400 IY V L + I T A + + +V+ R + L + Sbjct: 525 AIYGVVLFFMFGGIYDHPHLTTVLAVMFGMPFIMACAVVAFW--RRPDLILYMPAYMAFR 582 Query: 401 FPVIFWMLSLATTLV 415 ++ L TLV Sbjct: 583 LLRSYYTLGATLTLV 597 >UniRef50_A4SFN6 Glycosyl transferase, family 2 n=1 Tax=Chlorobium phaeovibrioides DSM 265 RepID=A4SFN6_PROVI Length = 474 Score = 205 bits (521), Expect = 3e-51, Method: Composition-based stats. Identities = 89/405 (21%), Positives = 145/405 (35%), Gaps = 55/405 (13%) Query: 26 SGELMMRFVFFWPFFM---SIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIP 82 + L++ +F + M + IV G R +GE + + +S+II Sbjct: 9 ALYLLIAVIFIYFVAMHAIYLFLIVLGALEQKKFRREICFGEFSIISRSAQTFPVSVIIS 68 Query: 83 CFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAIL-DRMAAQ------------- 128 FNEEK + T+ L RY E++ VNDGS D T A+L +R + Sbjct: 69 AFNEEKMIVATVENVLQLRYPQFEIVVVNDGSVDGTMAVLTERFGLRLVHEVFREHILTK 128 Query: 129 ----------IPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVE 178 P L V+ +N +A A G A+ + ID D +L+ D+ Y+V Sbjct: 129 PVRGLYRSPEYPQLLVVD-KENGRRADANNAGVNHARYPLICQIDADCILEEDSLLYMVR 187 Query: 179 PMLYNPRVGAVTGNPRIRT--------------RSTLVGKIQVGEYSSIIGLIKRTQRIY 224 P L N V A TG R + Q EY + Sbjct: 188 PFLGNSDVIAATGIIRPSNGLVVSKGRIVSRGLPDRWLPLFQWLEYLRAFQWARGGLSRM 247 Query: 225 GNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKL--------QLNQWTIFYEPRA 276 ++ +SG +R VG + + +D +++ L + + I Y P Sbjct: 248 QSMLCMSGAYTFIKREVFLAVGGANPHAVVDDFELTLTLNRYIHTHKKERRMKIAYVPDP 307 Query: 277 LCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTT--IWAF 334 C+ +PETLK L QR W + + + N ++ + I A Sbjct: 308 GCYSEVPETLKALASQRNFWQRTLLQSIIWNRDMMFNPRYGLAGMFGMPFFFFFEAISAV 367 Query: 335 TCLVGFIIYAVQL---AGVPLNIELTHIAATHTAGILLCTLCLLQ 376 +II + L P+ + L A + + L+Q Sbjct: 368 VEATSYIIAPIALCIGMVTPIELLLLFSAGVLSGSFVSVVAVLMQ 412 >UniRef50_B8HDP3 Glycosyl transferase family 2 n=1 Tax=Arthrobacter chlorophenolicus A6 RepID=B8HDP3_ARTCA Length = 445 Score = 205 bits (521), Expect = 3e-51, Method: Composition-based stats. Identities = 78/368 (21%), Positives = 153/368 (41%), Gaps = 11/368 (2%) Query: 76 SISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIP--HLR 133 +SI+I NEE + T+ + Y + EV +D S+D T ++ + P L Sbjct: 72 LVSIVIAAHNEEAVIVRTLESIGTSTYRSFEVFVADDASSDLTGRLVRDFQLRHPTMDLH 131 Query: 134 VIHLAQNQGKAIALKTGA-AAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGN 192 ++ + +N GK AL + A+ ++ + +D D+++ + + +P V V N Sbjct: 132 LVRMRRNIGKGAALNSVLRRHARGQFAMTLDADSIIGPETITNALS-YFDDPCVAGVAAN 190 Query: 193 PRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDM 252 +I +T++G +Q E+ K+ + F V GV + +R L +V ++ D Sbjct: 191 VQILEETTVLGILQRFEHMIGYRS-KKFYSLLNAEFVVGGVASTYRMRVLRKVDFYDTDT 249 Query: 253 ITEDIDISWKLQLNQ---WTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMT 309 +TEDI +S K+ + + Y + +L+ L KQR RW G + +K Sbjct: 250 VTEDIGLSAKITNLGNRRFRMLYGADVVAKTEGVLSLRALAKQRYRWKYGSIQNLIKYRG 309 Query: 310 RLWRK-ENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGIL 368 + + ++ + + FT LV + + + L + A I Sbjct: 310 MMLNPSRRYTPTLTYYRMPMALLSEFTLLVSPLAWTYAV-YWSLTTHTPALVLGAYATIT 368 Query: 369 LCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQR 428 TL + +L + R +L + + +++ + L+ A + +R L + + Sbjct: 369 AYTLLTVWMDENLTVRERSRLSLYAPTAYFLFYIMDLVQLTAALRCILRSRG-LFRRPEI 427 Query: 429 ARWVSPDR 436 W SP R Sbjct: 428 NTWKSPQR 435 >UniRef50_D1BIJ9 Glycosyl transferase n=2 Tax=Micrococcineae RepID=D1BIJ9_SANKS Length = 497 Score = 205 bits (521), Expect = 4e-51, Method: Composition-based stats. Identities = 81/362 (22%), Positives = 141/362 (38%), Gaps = 15/362 (4%) Query: 64 ENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAI 121 N A P ++++IP +NE V ++ + Y + V V+D STD T + Sbjct: 43 RNHYAKAAPYLPRVAVVIPAWNEGAVVGNSVDMLMRLDYPADALRVYVVDDASTDDTPDV 102 Query: 122 LDRMAAQIPH--LRVIHLAQNQGKAIALKTGAAAA----KSEYLVCIDGDALLDRDAAAY 175 + + Q P + +GKA L G A E L+ +D D + D+ Sbjct: 103 VLAKSRQYPGQVFHLRREKGGEGKAHTLNHGLAVILEDDWMEALLIMDADVVYLPDSLRK 162 Query: 176 IVEPMLYNPRVGAVTGNPRIRT-RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVI 234 + L + +VGAVT R + + + EY+ +R Q + G V ++G Sbjct: 163 MTR-HLADEQVGAVTAYIREGSAEKNYLTRFIALEYALSQAASRRAQNVMGAVACLAGGA 221 Query: 235 AAFRRSALAEVGY-WSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQR 293 R+ L +G + ED ++K QL + +EP A+ P+++ LWKQR Sbjct: 222 QLHSRANLEALGGRIDTSSLAEDTVTTFKTQLAGRKVVFEPHAVVLAEEPQSVAALWKQR 281 Query: 294 LRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLN 353 LRW +G + +R + + L V ++ +V L G+ L Sbjct: 282 LRWGRGNVHITRMYRRLWFRPQEGNRLG-SIAFGLVWFGVAFQPVAMVLSSVGLVGLYLL 340 Query: 354 IELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATT 413 T T L + ++SL ++ +L ++FP + +L T Sbjct: 341 ESTTSAQVFRTIWALAACTYVFTMVMSLALDPTTARK---TLREAVFFPGLVSVLITVET 397 Query: 414 LV 415 Sbjct: 398 FF 399 >UniRef50_D1J3P1 Whole genome shotgun sequence of line PN40024, scaffold_74.assembly12x (Fragment) n=16 Tax=Spermatophyta RepID=D1J3P1_VITVI Length = 449 Score = 204 bits (520), Expect = 5e-51, Method: Composition-based stats. Identities = 72/339 (21%), Positives = 132/339 (38%), Gaps = 18/339 (5%) Query: 74 NPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIA-VNDGSTDKTRAI-----LDRMAA 127 P + + IP +NE++ + +I AA + + +I V D STD T R A+ Sbjct: 12 YPMVLVQIPMYNEKEVYQLSIGAACGLSWPSERIIIQVLDDSTDPTIKDLVEMECQRWAS 71 Query: 128 QIPHLRVIHLAQNQG-KAIALKTGAAAA---KSEYLVCIDGDALLDRDAAAYIVEPMLYN 183 + +++ G KA ALK G + + +Y+ D D + D + +++N Sbjct: 72 KGINIKYEIRGNRHGYKAGALKEGMKHSYVKQCDYVAIFDADFQPEPDFLWRTIPFLVHN 131 Query: 184 PRVGAVTGNPR-IRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSAL 242 P +G V + + + L+ ++Q + + F +G +R SAL Sbjct: 132 PEIGLVQARWKFVNSDECLMTRMQEMSLDYHFTVEQEVGSSSHAFFGFNGTAGVWRISAL 191 Query: 243 AEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAE 302 E G W D ED+D++ + L W Y +P T K Q+ RW+ G A Sbjct: 192 NEAGGWKDRTTVEDMDLAVRASLKGWKFVYVGALKVKNELPSTFKAYRYQQHRWSCGPAN 251 Query: 303 VFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAAT 362 +F K + R + +W Y + + + +V I+ + V Sbjct: 252 LFKKMAIEIARNKKVNLWKK--VYVIYSFFFVRKIVAHIVTFLFYCVVFPATVFFPEVEV 309 Query: 363 HTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWF 401 G + ++L+ +L +FWI++ Sbjct: 310 PMWG-----AVYIPSTITLLNAVGTPRSLHLLVFWILFE 343 >UniRef50_A4WR49 Cellulose synthase (UDP-forming) n=6 Tax=Rhodobacteraceae RepID=A4WR49_RHOS5 Length = 788 Score = 204 bits (519), Expect = 6e-51, Method: Composition-based stats. Identities = 87/396 (21%), Positives = 146/396 (36%), Gaps = 28/396 (7%) Query: 32 RFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNE-EKNV 90 FVF F + +G + + P Q +D P++ I++P +NE + Sbjct: 97 SFVFAMLLFAVETFSIGIFFLNGFLSADPTDRPFPPQVQPEDLPTVDILVPSYNEPADML 156 Query: 91 EETIHAALAQRYENI--EVIAVNDGSTDKT-----RAILDRMAAQIPHLR--------VI 135 T+ AA Y + V+ +DG TD+ I + + L+ V Sbjct: 157 SVTLSAAKNMIYPSRLRTVVLCDDGGTDQRCLSPDPEIARQAQERRRELQQLCRELGVVY 216 Query: 136 HLAQN--QGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTG-- 191 + KA + K + +V D D + RD A V + +P++ V Sbjct: 217 STRERNEHAKAGNMSAALERLKGDLVVVFDADHVPSRDFLARTVGYFVEDPKLFLVQTPH 276 Query: 192 ---NPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYW 248 NP R+ +G E I R +G F G A RR AL +VG + Sbjct: 277 FFINPDPIQRNLALGDDCPPENEMFYAKIHRGLDRWGGAF-FCGSAAVLRRRALDDVGGF 335 Query: 249 SDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNM 308 + + ITED + + ++ W Y RA+ L PET +QR RWA G + L+ Sbjct: 336 AGETITEDAETALEIHARGWKSLYIDRAMIAGLQPETFASFIQQRGRWATGMMQ-MLRLK 394 Query: 309 TRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGIL 368 L+R Y + + F LV L + +++ + Sbjct: 395 NPLFR--PGLGIAQRLCYLNSMSFWFFPLVRMTFLIAPLIYLFFGVQVFVATFEEVLAYM 452 Query: 369 LCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVI 404 L + F+V + +R L S ++ + P + Sbjct: 453 PGYLA-VSFLVQNALFSRQRWPLVSEVYEVAQAPYL 487 >UniRef50_C6HYN9 Glycosyl transferase n=1 Tax=Leptospirillum ferrodiazotrophum RepID=C6HYN9_9BACT Length = 384 Score = 203 bits (517), Expect = 9e-51, Method: Composition-based stats. Identities = 83/338 (24%), Positives = 140/338 (41%), Gaps = 25/338 (7%) Query: 27 GELMMRFVFFWPFFM--SIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCF 84 + F F +P + I+W++ ++ P ++ PAP P +S++IP Sbjct: 8 LFWLFVFAFLYPLVLYPGIIWLLSRLF-------PEPVRKSRPAP----LPFVSLVIPAH 56 Query: 85 NEEKNVEETIHAALAQRYE--NIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQG 142 NEEK V I +L+ Y EV+ +DGSTD+T AI + P R++ +G Sbjct: 57 NEEKVVAAKIENSLSLDYPADRYEVLLASDGSTDRTAAIARE--TRHPAFRLLDFPVRRG 114 Query: 143 KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRST-- 200 K L A+ E +V D A+L DA VE +P VG V G RI T Sbjct: 115 KLAVLTDAVGEARGELVVLTDTSAILAPDALLRAVEN-FSDPGVGCVAGRYRIAREMTPE 173 Query: 201 LVGKIQVGEYSSIIGLIKRT-QRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDI 259 L + + + +R + + + G RRS V + I +D I Sbjct: 174 LDARGESERGYFEFEIFQRICESRFHSTLGAHGAFYMIRRSLFPRV---PEGTINDDFVI 230 Query: 260 SWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRM 319 + + YE +A+ + + T+K +++R+R +QG + + L K+ R Sbjct: 231 PMLILAQGFRTVYEEKAVAYEIHLATVKSEFRRRVRISQGNFQQIFLLLPALGLKDP-RA 289 Query: 320 WPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELT 357 +FF + + + L+G + + L P I Sbjct: 290 LFVFFSHKVIRAFQPITLLGLLATSALLPAWPFRIFFL 327 >UniRef50_C9NZ15 Putative uncharacterized protein n=1 Tax=Vibrio coralliilyticus ATCC BAA-450 RepID=C9NZ15_9VIBR Length = 391 Score = 203 bits (517), Expect = 1e-50, Method: Composition-based stats. Identities = 84/365 (23%), Positives = 151/365 (41%), Gaps = 23/365 (6%) Query: 72 KDNPSISIIIPCFNE-EKNVEETIHAALAQRYE-NIEVIAVNDGSTDKTRAILDRMAAQI 129 ++P +S+ +P +NE E+ + ++ Q Y I+VI ++DGST+ + + Sbjct: 37 DNDPEVSVFLPFYNENEEYLICALNKLNEQNYSSRIQVIVIDDGSTNDAISRVKAWITST 96 Query: 130 P---HLRVIHLAQNQG-KAIALKT--GAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYN 183 +++ A N G K AL A + V +D D ++ + +V + + Sbjct: 97 EINHDFQIVERAVNGGRKGFALDHVLHLGVATGDVYVVVDSDTYIEPNGIYELVTKLWSD 156 Query: 184 PRVGAVTGNPRIRT-RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSAL 242 + AV G + +GK+Q E+ S G I+ Q G V ++G A R S + Sbjct: 157 EKYAAVCGYITPENHEVSFIGKLQHYEHVSFYGAIRAAQDKLGCVPVLAGAFVAHRASVV 216 Query: 243 AEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAE 302 ++G WS+ ++ EDI WK N + Y P+A P KGL+ QR RWA+G E Sbjct: 217 KKLGGWSEWLV-EDIAWCWKAISNGYRTGYAPKAKATTQCPTDAKGLFNQRRRWARGRVE 275 Query: 303 VFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAAT 362 + + W P F I+ + ++ F+ VP+ I +I Sbjct: 276 AYTEAWKTHW-SAGLCASPWFLLTAAQYIFPSSIIL-FVFMVATGIWVPIAIGALNIIF- 332 Query: 363 HTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVML 422 LL + F +S + N+ + + + ++ L+ ++ +L Sbjct: 333 ---YFLLVNSYIRDFDLSGDLSNQ-------KIIKAPIYSALLESITWIPNLLGYSDEIL 382 Query: 423 MPKKQ 427 KK Sbjct: 383 GKKKG 387 >UniRef50_O67406 Cellulose synthase catalytic subunit n=1 Tax=Aquifex aeolicus RepID=O67406_AQUAE Length = 759 Score = 203 bits (516), Expect = 1e-50, Method: Composition-based stats. Identities = 88/452 (19%), Positives = 159/452 (35%), Gaps = 39/452 (8%) Query: 5 IVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGE 64 +V + + + + ++ F S+ + G++F + P Sbjct: 80 LVFIMLRYFWWRTFNTINTDNVWNFIFSTALYFAEFYSVTIALLGIFFSIR-----PIDR 134 Query: 65 NAPAPQLKDNPSISIIIPCFNEEKNVEETIH-AALAQRYE--NIEVIAVNDGST-----D 116 A ++ P++ + IP +NE + ET AA+ Y V ++DG T D Sbjct: 135 KAIKVDMESLPTVDVFIPTYNEPPEIPETTALAAINMDYPSDKFNVYILDDGGTKQRLND 194 Query: 117 KTRAILDRMAAQ---------------IPHLRVIHLAQN-QGKAIALKTGAAAAKSEYLV 160 + + ++ + +N KA + K + ++ Sbjct: 195 PDPERREYFRKRAEELKSFVERLRKLGYKNIHYLTREKNVHAKAGNINEALKKTKGDLIL 254 Query: 161 CIDGDALLDRDAAAYIVEPMLYNPRVGAVTG-----NPRIRTRSTLVGKIQVGEYSSIIG 215 +D D + +D V + NP+V V NP ++ V GE Sbjct: 255 ILDADHVPSKDFLKETVGFFVKNPKVFLVQTPHTFYNPDPIEKNLGVFGRMPGENEMFYF 314 Query: 216 LIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPR 275 LI++ ++ + F G A RR L EVG +TED + + +L + Y R Sbjct: 315 LIQKGFDLWNSSF-FCGSAALLRRKYLEEVGGIQTTTVTEDAETALELHSRGYESVYYDR 373 Query: 276 ALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFT 335 L + L PETL G+ QR+RWAQG ++F+ L + W Y + + F Sbjct: 374 PLIFGLNPETLSGMIVQRIRWAQGMIQIFILKNPIL---KKGLKWYQKLAYFNASFFWFF 430 Query: 336 CLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSL 395 L I LA + + + + + L ++ + +R S + Sbjct: 431 GLARTIFLIAPLAYLLFGLHIYDASLAEVLAYPIPHF-LASLLLFYYLFSRVRWPFISEI 489 Query: 396 FWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQ 427 + I IF L + PK + Sbjct: 490 YESILGIFIFIPSILTLLNPKNPTFRVTPKGE 521 >UniRef50_C6MI20 Glycosyl transferase family 2 n=1 Tax=Nitrosomonas sp. AL212 RepID=C6MI20_9PROT Length = 441 Score = 203 bits (516), Expect = 1e-50, Method: Composition-based stats. Identities = 88/426 (20%), Positives = 171/426 (40%), Gaps = 29/426 (6%) Query: 24 FHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPC 83 F +++ FV F+ + + + + + P L P ++ I C Sbjct: 21 FQYAFMLIPFVLFFELPLYFINWMSAFRYLIKNAIEAPLA-------LHYYPHVTCAITC 73 Query: 84 FNEEKNVEETIHAALAQRYE-NIEVIAVNDGS-----TDKTRAILDRMAAQIPH--LRVI 135 + E K V+ T+ + L Q Y +IE+IA+ DG+ T + L + ++ P L +I Sbjct: 74 YAEGKAVQSTVISLLEQVYSGHIEIIAIVDGAKRNQATYQALVELISLVSRYPKRSLVII 133 Query: 136 HLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRI 195 Q G+ +L G A+ E +DGD D + NP + AVTG R+ Sbjct: 134 PKFQRGGRVSSLNAGLNRARGEIFFALDGDTSFDNQMVHHA-ATHFRNPGMVAVTGPMRV 192 Query: 196 RTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMIT 254 R T++ ++Q EY + + K G + V G FR+S L +VG W+ Sbjct: 193 RNAGKTIMTRLQSLEYMLTMQVGKLGFANLGVINNVPGAFGIFRKSFLQQVGGWNTGT-A 251 Query: 255 EDIDISWKLQLNQWT-----IFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMT 309 ED+D++ +++ I +EPRA+ +PE+ KQRLRW ++ Sbjct: 252 EDLDLTLRIKQYYKRYPHLQIEFEPRAVSHTDVPESFMDFLKQRLRWDGDLWYIYSNKHK 311 Query: 310 RLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAG-VPLNIELTHIAATHTAGIL 368 F + ++ +IY + +A +P + L + + Sbjct: 312 HGISASMMGWRNFIFLFWYGILFQIVMPFSIVIYTIYIAVKLPFHAFLLSMILVYAFYFC 371 Query: 369 LCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQR 428 L + + +++ + + + N ++ + +P ++ + + L ++ K Sbjct: 372 LIAIQYILYLLFITDRRKEDCN---AILILPIYPAFQFISRVWSALAIL--NQIINKGHL 426 Query: 429 ARWVSP 434 ++P Sbjct: 427 DSAMAP 432 >UniRef50_A3Q6L3 Glycosyl transferase, family 2 n=6 Tax=Actinomycetales RepID=A3Q6L3_MYCSJ Length = 485 Score = 202 bits (515), Expect = 2e-50, Method: Composition-based stats. Identities = 96/403 (23%), Positives = 160/403 (39%), Gaps = 45/403 (11%) Query: 10 ILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENA--- 66 +L L+ +PL V L+ + M ++ + + + P + Sbjct: 38 VLVLIGLMPLIVLLAVQAPLLSHGALLLGYGMLVLTATVSMMYLGFARYEDPSVHPSTDA 97 Query: 67 ------PAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRA 120 P L P +S+++ +E +EE + + Y ++EVI ++D STD T Sbjct: 98 GARDRCDFPALPPLPRVSLLVAVRDEVDGIEECVRTMVGSAYPDLEVIVIDDASTDGTPD 157 Query: 121 ILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPM 180 +L R+AA++ + VI NQGK AL G A E L D D +L DA + V + Sbjct: 158 VLRRLAAEL-DVTVIFKEINQGKKHALTDGVRVASGEVLAFTDSDCVLAPDALSRCVRAL 216 Query: 181 LYNPRVGAVTGNPR-IRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRR 239 + NPR+GAV+G+ R + T++ + Q Y + K + +GNV VSG +A FRR Sbjct: 217 VENPRLGAVSGHARALNAADTVLTRAQDTWYDGQFRVAKAAEATFGNVTCVSGPLAVFRR 276 Query: 240 SALAE-VGYWSDDM-------ITEDIDIS-------WKLQ------------------LN 266 A+ + W++D D ++ WK Q Sbjct: 277 DAIVNYLPAWANDTFLGREFRFATDRQLTGYVLGQVWKGQALKRRYADDPLVADHDHPER 336 Query: 267 QWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEY 326 +W + Y A W +P + KQ++RW + + +WR F LF+ + Sbjct: 337 RWLVGYVRSAHVWTTVPARFRPFLKQQVRWKKSFIRNLCFTGSFMWR-RGFGAAALFYGH 395 Query: 327 CLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILL 369 L A V +++A L + T L Sbjct: 396 VLFVAVAPLMAVRHLVWAPANGLYFLTLLYLCGVVTKGFAWAL 438 >UniRef50_A8USY3 Cellulose synthase (UDP-forming) n=1 Tax=Hydrogenivirga sp. 128-5-R1-1 RepID=A8USY3_9AQUI Length = 717 Score = 202 bits (514), Expect = 2e-50, Method: Composition-based stats. Identities = 82/418 (19%), Positives = 147/418 (35%), Gaps = 26/418 (6%) Query: 13 LVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMW------IVGGVYFWVYRERHWPWGENA 66 ++ + + V+ ++ + P F ++++ ++ + F+ R Sbjct: 9 FLILLSVSVSTYYIVWRLGTLNPDAPLFSAVLYGAEVYGLIASIMFFYMVWRLPNRKTKD 68 Query: 67 PAPQLKDNPSISIIIPCFNEEKNV-EETIHAALAQRYENIEVIAVNDGSTDKTRAILDRM 125 P LK + I IP NE + +TI A++ RY + ++DGS + R + + + Sbjct: 69 PPKDLK----VDIFIPTLNESPELLRKTIQGAISVRYPH-RTYVLDDGSRREVRELCEEL 123 Query: 126 AAQIPHLRVIHLAQN-QGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNP 184 + I N GKA L +++ +D D + D + + Sbjct: 124 GCRY-----IPRYTNEHGKAGNLNNALGKTDGDFIAVLDADHVPQPDFLDRTLG-YFADE 177 Query: 185 RVGAVTGNPRIRTRST----LVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRS 240 V V + LV E S +I R + + F G A RR Sbjct: 178 EVAFVQTPQDFYNVDSYQHRLVRGKLWNEQSLFFRVIMRGKDRTNSAF-FCGSCAVVRRK 236 Query: 241 ALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGG 300 AL ++G ++ +TED+ S +L W Y P L + + P L QR RW +G Sbjct: 237 ALEDIGGFATGTVTEDLHTSIRLHAKGWKSVYYPHVLAYGVAPADLAPFKNQRGRWGEGA 296 Query: 301 AEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAG--VPLNIELTH 358 +VF K L R +F T + F + + V L +P+++ L Sbjct: 297 MQVFAKENPLLTRGLTLPQRINYFASMSTYLDGFQKAIFYTAPVVVLLTGKLPISVSLKE 356 Query: 359 IAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVS 416 I L + E + F + + F L +V+ Sbjct: 357 FLPVFLPHIFLSVWAFEEASRGYGRMALLEQYNMARFFTFMKSLLGFIRLKRPRFMVT 414 >UniRef50_Q8S7W0 Probable mannan synthase 4 n=4 Tax=Magnoliophyta RepID=CSLA4_ORYSJ Length = 549 Score = 202 bits (513), Expect = 3e-50, Method: Composition-based stats. Identities = 72/398 (18%), Positives = 141/398 (35%), Gaps = 41/398 (10%) Query: 8 FFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAP 67 + C+ + V M+ + MS++ +V ER + W Sbjct: 38 VLQFAVWACMAMSV--------MLVLEVAY---MSLVSLVAVKLLRRVPERRYKWEPITT 86 Query: 68 A------------------PQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIA 109 + P + + IP +NE++ + +I AA A + +I Sbjct: 87 GSGGVGGGDGEDEEAATGGREAAAFPMVLVQIPMYNEKEVYKLSIGAACALTWPPDRIII 146 Query: 110 -VNDGSTDKTRAI-----LDRMAAQIPHLRVIHLAQNQG-KAIALKTGAAAA---KSEYL 159 V D STD A + +++ +G KA ALK G + +++ Sbjct: 147 QVLDDSTDPAIKDLVELECKDWARKEINIKYEIRDNRKGYKAGALKKGMEHIYTQQCDFV 206 Query: 160 VCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRST-LVGKIQVGEYSSIIGLIK 218 D D + D + +++NP++G V L+ +IQ + + Sbjct: 207 AIFDADFQPESDFLLKTIPFLVHNPKIGLVQTRWEFVNYDVCLMTRIQKMSLDYHFKVEQ 266 Query: 219 RTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALC 278 + + F +G +R SA+ E G W D ED+D++ + L W Y Sbjct: 267 ESGSSMHSFFGFNGTAGVWRVSAINEAGGWKDRTTVEDMDLAVRASLKGWQFLYVGDIRV 326 Query: 279 WILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLV 338 +P T K Q+ RW G A +F K T + + + +W + + + Sbjct: 327 KSELPSTFKAYRHQQHRWTCGAANLFRKMATEIAKNKGVSVWK-KLHLLYSFFFVRRVVA 385 Query: 339 GFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQ 376 + + +PL++ + ++ + + T + Sbjct: 386 PILTFLFYCVVIPLSVMVPEVSIPVWGMVYIPTAITIM 423 >UniRef50_Q1D6P4 Glycosyl transferase, group 2 n=3 Tax=Cystobacterineae RepID=Q1D6P4_MYXXD Length = 417 Score = 202 bits (513), Expect = 3e-50, Method: Composition-based stats. Identities = 83/365 (22%), Positives = 148/365 (40%), Gaps = 28/365 (7%) Query: 36 FWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIH 95 +P + M ++ Y R G P ++I+IP FNE + + + Sbjct: 3 LFPIQLLFMVVLMNRYVLGPLLRRLK-GRKFDRVDDTYLPRVAIVIPLFNEGEGIYHGVR 61 Query: 96 AALAQRYEN--IEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAA 153 + L Q Y + ++++ V+D S D + A + A P++ V+ +N GK + A Sbjct: 62 SLLEQDYPSHLLQIVVVDDCSKDDSYAWALKAAEGNPNVIVMRNPENMGKRKGINRAVKA 121 Query: 154 AK-SEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRS-TLVGKIQVGEYS 211 A +E +V +D D ++D+ A +V + PR+ AV G + R + ++ ++ Sbjct: 122 ATDAEIIVSVDSDVIVDKAAVRQLVRRFVS-PRIAAVGGRTYVTNRHQNWMTRMIEIKFH 180 Query: 212 SIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDM--------ITEDIDISWKL 263 +K +R + V +SG + A+RR L E+ + ED ++ ++ Sbjct: 181 FAQEWLKDLERSFRQVMCLSGCLTAYRRHVLLELEPILEARAIAGIPIKYGEDRFLTRQI 240 Query: 264 QLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRM---- 319 + + Y A C+ P TL G + Q+LRW + L ++ WR Sbjct: 241 VKHDYETVYTLDAFCFTAAPSTLAGYFSQQLRWRRSNLVDLLGGLSHAWRLHPVVAVHYV 300 Query: 320 --WPLFFEYCLTTIW-----AFTCLVGFIIYAVQLAGVPLNIELTHIAA---THTAGILL 369 L Y L + F ++ F + L GV IE H+ A H A L Sbjct: 301 SQLALLLSYPLVIVHNVLTGEFWDILAFHFLVIGLLGVIYRIETRHLPADRRVHGASFLP 360 Query: 370 CTLCL 374 L + Sbjct: 361 MALLM 365 >UniRef50_B6AKK5 Glycosyltransferase n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AKK5_9BACT Length = 714 Score = 201 bits (512), Expect = 4e-50, Method: Composition-based stats. Identities = 86/417 (20%), Positives = 154/417 (36%), Gaps = 29/417 (6%) Query: 29 LMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAP-QLKDNPSISIIIPCFNEE 87 +M+ V + M+ + G + V R + P P + K+ P++ ++IP +NE Sbjct: 109 VMLVLVEAYAAVMTFL----GFFVMVSPVRRT----SPPLPKEEKECPTVDVMIPVYNEP 160 Query: 88 -KNVEETIHAALAQRYE--NIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKA 144 + TI AA Y + V ++DG R ++ ++ ++ + KA Sbjct: 161 VDVIRPTIFAASQLEYPLSRLRVWILDDGR----RKEIEALSGELGVGYLTRPDNKGAKA 216 Query: 145 IALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTL--- 201 L + + D D + L P + V +R Sbjct: 217 GNLNHALGKTDGDLIAIFDCDHVPLPRFLQKTAGFFLNRPDLALVQTPHHFYSRDPFERN 276 Query: 202 --VGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDI 259 G GE +I+ ++ + G A RRSAL E+G + + +TED Sbjct: 277 IGFGNQVPGEPDLFYHVIQPGMDLWNAAY-FCGSAAVLRRSALKEIGGFRTETVTEDAHT 335 Query: 260 SWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRM 319 S L + +Y AL L P++++ L KQR+RW +G ++F + + +F Sbjct: 336 SLCLHDRGYRSYYLEEALVTGLSPDSMRDLIKQRVRWCRGMIQIFRIDNPLFKKGLSF-- 393 Query: 320 WPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIV 379 P Y + F + L + I H LL L + Q Sbjct: 394 -PQKLCYMNAIFYWFFSIPRLFFLVAPLLYIYFRIYSVHAGLEDMLLYLLPYLLVAQG-- 450 Query: 380 SLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 S ++ R +L S+ + I P+ F+++ A T + F + + DR Sbjct: 451 SSLLFYRSRRSLLWSIVYEI--PLSFYLIFPAFTALLFPKKGTFHVTPKGTRTDSDR 505 >UniRef50_B7KYJ4 Glycosyl transferase family 2 n=2 Tax=Methylobacterium extorquens group RepID=B7KYJ4_METC4 Length = 423 Score = 201 bits (512), Expect = 4e-50, Method: Composition-based stats. Identities = 89/359 (24%), Positives = 143/359 (39%), Gaps = 7/359 (1%) Query: 77 ISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIH 136 +S+IIP FN + TI + Q +E+I V+DGSTD TRA+ +R VI Sbjct: 55 VSVIIPTFNGGSGLAPTIASLRQQTLSPLEIIIVDDGSTDDTRAVANRARQDGLVDMVIC 114 Query: 137 LAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIR 196 G++ A+ A A + + ID D +L+RDA A + +PRV GN IR Sbjct: 115 HGTRCGRSAAINAAARFASGDLIFTIDADTVLERDAIARLANA-FNDPRVAGACGNIAIR 173 Query: 197 TRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITE 255 S +L +Q EY I K + G + +SG + +RR A G E Sbjct: 174 NESASLWTGLQSVEYMMSISAGKSILDVVGAIACLSGACSMYRRDVFARQGGLDVG-PGE 232 Query: 256 DIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKE 315 D++ S +L+ + + + P A P GL +QR RW + + L Sbjct: 233 DLEFSLRLRRLGYRVRFVPDAWAETDGPVAAVGLLRQRARWDRDALRIRSMMYGELSFFH 292 Query: 316 NFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLL 375 P + ++ + Y++ + V L I LL + L Sbjct: 293 PLERLPDTLQRLDFIVFDLIPTMSLPFYSIYI--VILFGPAHAILFIAAIYFLLLWISLF 350 Query: 376 QFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSP 434 ++ ++ +R SL FP + VS++ +L +R +V P Sbjct: 351 NMALAFVLFSRS--PSFFSLGAAFIFPFYQGIYMKCARFVSYSSEILFATSRRDDYVPP 407 >UniRef50_D0LRC3 Glycosyl transferase family 2 n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LRC3_HALO1 Length = 488 Score = 201 bits (512), Expect = 4e-50, Method: Composition-based stats. Identities = 86/373 (23%), Positives = 146/373 (39%), Gaps = 27/373 (7%) Query: 49 GVYFWVYRERHWPWGENAPAPQLK--------------DNPSISIIIPCFNEEKNVEETI 94 G + R+W E PA + P +++ +P +NE E I Sbjct: 19 GAHRLWLTVRYWRLRERGPAVAALPQAPADAPADAAIAELPVVTVQLPIYNERCVAERLI 78 Query: 95 HAALAQRYENIEV-IAVNDGSTDKTRAILDRMAAQIPH----LRVIHLAQNQG-KAIALK 148 A A + + I V D S+D T A+ A + + H QG KA AL+ Sbjct: 79 DAVAALDWPRDRLHIQVLDDSSDDTAALCRDKVAALRRAGYDIEHRHRQDRQGFKAGALE 138 Query: 149 TGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRS-TLVGKIQV 207 G AA+K +++ +D D ++ D + +PRVG V R +++ +IQ Sbjct: 139 AGLAASKGAFVLILDADFVVPSDFLRAAMGC-FADPRVGMVQTRWAHLNRDASVLTRIQA 197 Query: 208 GEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQ 267 + + + G+ F +G +RR A+ G W D +TED+D+S++ L Sbjct: 198 LLLDGHFVVDQTARARSGHFFNFNGTAGIWRREAIVAAGGWQHDTLTEDLDLSYRALLAG 257 Query: 268 WTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYC 327 W Y +PE + Q+ RWA+G EV K + + + + W + + C Sbjct: 258 WRFEYLLERETPAELPEDMNAFKSQQFRWAKGSLEVAKKLLPAVL--GSAQPWRVKLDAC 315 Query: 328 LTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRY 387 LV ++ + L ++ A AG+L T L F + M R Sbjct: 316 FHLTQNLPYLVTLVLLLCAAPVLVLAGDMPSWLAAVHAGLLAATA--LTFAIYCMTAQRA 373 Query: 388 -EHNLTSSLFWII 399 + +L + Sbjct: 374 LRRSPWRTLCLLP 386 >UniRef50_A0QPQ1 Chitin synthase n=1 Tax=Mycobacterium smegmatis str. MC2 155 RepID=A0QPQ1_MYCS2 Length = 431 Score = 201 bits (512), Expect = 4e-50, Method: Composition-based stats. Identities = 94/430 (21%), Positives = 174/430 (40%), Gaps = 34/430 (7%) Query: 8 FFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAP 67 ++ L+L + L AY M+ FW + I+ F + P + Sbjct: 24 VIVMALLLIVMLIFAYKVVSLHNMKDEPFWAAYGLIITAFIFTRFGLAALYRPPRRDTG- 82 Query: 68 APQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAILDRM 125 P ++II+P +NE ++ T+ + Y + V+ V+D S+D T + Sbjct: 83 ----NYWPVVAIIVPAYNE-PDIARTLLHCVGVDYPRELLRVVVVDDKSSDDTLRRITDF 137 Query: 126 AAQIPHLRVIHLAQNQGKAIALKTGAAAAK-SEYLVCIDGDALLDRDAAAYIVEPMLYNP 184 AA+ P+L VI N GK A+ TG AAA ++ V ID D+ + RDA ++ + Sbjct: 138 AAEHPNLTVIPHEVNGGKRRAMATGMAAADDADLFVFIDSDSQVTRDAI-RVIASYFADS 196 Query: 185 RVGAVTGNPRIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALA 243 VGAV G+ + S ++ ++Q +Y + K + ++G+V SG +A+R A+ Sbjct: 197 SVGAVCGHTDVTNISHNILTRMQAMQYYIAFRIYKSAEALFGSVTCCSGCFSAYRADAVR 256 Query: 244 EV-GYW-------SDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLR 295 V W +D ++ L L W + Y P A + +PE LK +Q+LR Sbjct: 257 PVADKWLNQTFLGRPSTFGDDRSLTNYL-LRDWRVLYAPDAQAYTNVPEHLKQFLRQQLR 315 Query: 296 WAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIE 355 W + + M + RK + L I + F++ ++ +P Sbjct: 316 WKKSWLREAPRAMCAVRRKNPVMVVMFALSIVLPLIAPQVVMRAFVVQPHFISQLPF--- 372 Query: 356 LTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLV 415 + G+ + F +R +F+ +++ +I +L + +V Sbjct: 373 ------WYFGGVAAIAVIYGLF----YRLHRPVKRWYQGIFFTMFYTIIL-VLQMPYAMV 421 Query: 416 SFTRVMLMPK 425 + + Sbjct: 422 TIRDSKWGTR 431 >UniRef50_Q30R23 Glycosyl transferase, family 2 n=3 Tax=Campylobacterales RepID=Q30R23_SULDN Length = 889 Score = 200 bits (510), Expect = 7e-50, Method: Composition-based stats. Identities = 76/402 (18%), Positives = 145/402 (36%), Gaps = 16/402 (3%) Query: 21 VAYFHSGELMMR---FVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSI 77 V Y + G +M + P M + ++ P + P + Sbjct: 402 VHYMNFGTWVMWGMGTILMVPLTMITLAKANELFKC--TIGRPPERLIPLDLKSDHVPFV 459 Query: 78 SIIIPCFNE-EKNVEETIHAALAQRYENIEVIAVNDGSTDKTRA--ILDRMAAQIPHLRV 134 SI +P + E ++ET+ A RY N EV+ + + + + I + Sbjct: 460 SIHVPAYKEQPAVLKETLEALANLRYTNFEVLVIINNTPEDFYKTPIKELCEELGERFVY 519 Query: 135 IHLAQNQGKAIALKTGAAAA--KSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGN 192 + + KA AL E L ID D ++ + +V P+ + +V V Sbjct: 520 LDIECTGFKAGALNRALEFTNPTCEILAVIDADYVIKPNWLIDLV-PIFDDAKVALVQAP 578 Query: 193 PRIRT-RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDD 251 R +L+ + EY+ + +R N G + R+SA EVG W+ D Sbjct: 579 QDHRDGNESLLKRAMNAEYAGFFDIG-MVERNEENAIVAHGTMIMVRKSAFDEVGQWNTD 637 Query: 252 MITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRL 311 I ED ++ +L + Y R L+P+T++ QR RWA G ++ K+ Sbjct: 638 TIVEDSELGLRLFEAGYIGHYTNRRYGHGLLPDTIEAFKNQRHRWAYGAVQILKKHWEHF 697 Query: 312 WRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCT 371 + + + +G + + + VP+ I + T + + T Sbjct: 698 KPSSKTLTRSQKHHFITGWFFWLSDALGTVTSILNIIWVPVIIFVGVTIPTIALTVPIIT 757 Query: 372 LCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATT 413 ++ I S ++ Y + ++ + + + L L Sbjct: 758 AFIVNVIHSFVL---YRTRVGANFYHSLLSAIAAMSLQLTIF 796 >UniRef50_A0LP64 Glycosyl transferase, family 2 n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LP64_SYNFM Length = 924 Score = 200 bits (509), Expect = 9e-50, Method: Composition-based stats. Identities = 78/320 (24%), Positives = 116/320 (36%), Gaps = 30/320 (9%) Query: 67 PAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMA 126 P + PSIS+++ +N + + + Y N EV+ VNDGSTD T I Sbjct: 391 PFAPDLEWPSISVVVCSYNGSRTIRNCLDGLRRLDYPNFEVVVVNDGSTDGTGEIAAEYG 450 Query: 127 AQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRV 186 +VI+ N G + A TG AAK E + ID DA D Y+ L V Sbjct: 451 -----FKVINTT-NHGLSSARNTGMRAAKGEIVAYIDDDARPDPHWLTYLAYTFLTTDHV 504 Query: 187 GAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVG 246 G N + + I L+ + + + G AFR+SAL + Sbjct: 505 GVGGPNLPPPEDGPIAACVANSPGGPIHVLLTDQEAEH-----IPGCNMAFRKSALEAID 559 Query: 247 YWSDDM--ITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVF 304 + +D+D+ W+LQ WT+ + P A+ W ++K WKQ+ G AE F Sbjct: 560 GFDHRFRIAGDDVDLCWRLQKKGWTLGFNPAAMVWHHRRNSVKAYWKQQK--NYGKAEAF 617 Query: 305 L-KNMTRLWRKENFRMWPLFFEY-------------CLTTIWAFTCLVGFIIYAVQLA-G 349 L K + W + W A L Sbjct: 618 LEKKWPEKYNAAGHVPWAGRLYFKGFEQLLSWSRGRIYQGTWGSALFQSIYQPAPTLLWS 677 Query: 350 VPLNIELTHIAATHTAGILL 369 +P+ E I A +L Sbjct: 678 LPMMPEWYAIIAALAVFSVL 697 >UniRef50_C1ZM23 Glycosyl transferase n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZM23_PLALI Length = 523 Score = 200 bits (508), Expect = 1e-49, Method: Composition-based stats. Identities = 71/364 (19%), Positives = 138/364 (37%), Gaps = 16/364 (4%) Query: 39 FFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAAL 98 F M+I + +W+ R E P P + P ++I +P +NE V + AA Sbjct: 17 FVMTIAQLDLYRRYWLSVLRKDRQREVPPLP--ESLPRVTIQLPIYNESPVVHRLLEAAS 74 Query: 99 AQRYEN--IEVIAVNDGSTDKTRAIL----DRMAAQIP--HLRVIHLAQNQG-KAIALKT 149 Y + +++ ++D STD IL + + P +++ H G KA L Sbjct: 75 RIDYPHNLLQIQVLDD-STDDCSKILVDKVAEIQQRDPSLNIQYRHRIDRTGYKAGNLDE 133 Query: 150 GAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGE 209 G A E++ D D + D + + + S++V ++Q Sbjct: 134 GTTWATGEFMAIFDADFVPKPDYLQQTIRYFQNEEIAIVQSRWGHLNPDSSIVTRVQQFF 193 Query: 210 YSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWT 269 + + +R + +G +R+ + + G W ED+D+S++ QL Sbjct: 194 LDGHLSVEQRGRGDSDLFLIYNGSAGIWRKQVIVDCGGWMTTAAIEDVDMSYRAQLRGKK 253 Query: 270 IFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLT 329 I Y +P+++ L Q RW +G ++ +K + ++W+ + +PL + T Sbjct: 254 IVYLEDYTTPGELPDSMIALRLQLFRWWKGNLQIAIKYIRQVWQSD----YPLIKKLHAT 309 Query: 330 TIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEH 389 T + + LAG I + + L + L+ + + R Sbjct: 310 THLFGPLMSAVTFANIILAGAVPLIVTWYPETRYWLASTLLGVALIPVLFLVYGTGRIRF 369 Query: 390 NLTS 393 S Sbjct: 370 GEGS 373 >UniRef50_A3MTD6 Glycosyl transferase, family 2 n=4 Tax=Thermoproteaceae RepID=A3MTD6_PYRCJ Length = 365 Score = 199 bits (507), Expect = 2e-49, Method: Composition-based stats. Identities = 70/277 (25%), Positives = 117/277 (42%), Gaps = 13/277 (4%) Query: 47 VGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYE--N 104 G + +R + W AP P +++I+P +NE +EE + Q Y Sbjct: 15 FGAPLLYYWRAKRWLKKPWDVAPDPTYRPRVTVIVPTYNEAPLIEEKLDNIYEQDYPRDK 74 Query: 105 IEVIAVNDGSTDKTRAILDRMAAQIPH--LRVIHLAQNQGKAIALKTGAAAAKSEYLVCI 162 +EV+ V+ STD T + + R A P L ++ + +GKA AL T A E +V Sbjct: 75 LEVVVVDSASTDGTPSAVRRWAETHPDLALTLVEETERRGKAHALNTALRHATGEIVVIT 134 Query: 163 DGDA-LLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQ 221 D DA RD A V+ L +P VGAV+ R + + Y +++ + Sbjct: 135 DADALWPARDTLANAVK-WLADPTVGAVSCVKRPAGPAGVEDS-----YRDFYNVLRVAE 188 Query: 222 RIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWIL 281 G +AAF+R L +G + D+ +D + K+ + + P +C Sbjct: 189 SKAWATPIFHGELAAFKRELLERLGGFPTDVGADDSHTATKIAMMGYRAITPPDVVCVEA 248 Query: 282 MPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFR 318 +P+ +G R+R AQ + F K + F+ Sbjct: 249 VPK--RGYHAWRIRRAQHLVQHFAKAIRDGKAPPPFK 283 >UniRef50_A5G7E0 Glycosyl transferase, family 2 n=3 Tax=Bacteria RepID=A5G7E0_GEOUR Length = 477 Score = 198 bits (505), Expect = 3e-49, Method: Composition-based stats. Identities = 80/358 (22%), Positives = 140/358 (39%), Gaps = 23/358 (6%) Query: 70 QLKDNPSISIIIPCFNEEKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAILDRMAA 127 Q P++S++IP + + ++ + + Y I+VI V+DGS D + + A Sbjct: 76 QPAALPTVSVVIPVKDRAEELKRCLASLAQLDYPQEMIQVIVVDDGSRDDSPLVAREFGA 135 Query: 128 QIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVG 187 L V +G A A GAA A+ E L ID D A ++ P+ +P+ Sbjct: 136 ----LVVPSGGTGRGPAAARNVGAANARGEILAFIDSDCTASEKWLAELI-PLFNDPKTA 190 Query: 188 AVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGY 247 AV G ++ V + + S +G +R+ + F + RR+ V Sbjct: 191 AVGGMVDGMCTTSAVDRYEAVMSSLSLGSRERSGSGGDDTFYLPSCNMLVRRTIFLSVDG 250 Query: 248 WSDDM-ITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLK 306 + D M + ED+D++W+L+ WTI Y P + TL+ +R G +E L+ Sbjct: 251 FDDAMHVGEDVDLTWRLRDEGWTIAYLPLGRVYHEHRSTLRSFMSRRF--DYGTSEGLLQ 308 Query: 307 NMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAG 366 + RK L F L + FT + A LA + + L G Sbjct: 309 LLHPHRRKRMIIPPLLAFVLALCLMAPFTGCWSLLPAAGVLAADAMVVRLRFARRRLPIG 368 Query: 367 ILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMP 424 + S ++ R L S ++++ + V ++ L + +P Sbjct: 369 L------------SALLAGRL-RALGSLVYYLCYHLVRYYAPVLIAIALIVPLFCAVP 413 >UniRef50_B9XJM2 Glycosyl transferase family 2 n=1 Tax=bacterium Ellin514 RepID=B9XJM2_9BACT Length = 852 Score = 198 bits (505), Expect = 3e-49, Method: Composition-based stats. Identities = 76/370 (20%), Positives = 132/370 (35%), Gaps = 25/370 (6%) Query: 66 APAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRM 125 AP L P +SI++ +N + + + + Y + EV+ V+DGSTD T I+ Sbjct: 292 APYFPLNRYPKVSIVVASYNGGRTTKACLDSLERLNYPDYEVVFVDDGSTDDTANIVRSY 351 Query: 126 AAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPR 185 + + +N G ++A +G +AA E + D D D D Y+V +L Sbjct: 352 SK----VHYFKHEKNLGLSVARNSGISAATGEIVAFTDSDCRADEDWLYYLVGDLLDGDF 407 Query: 186 VGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEV 245 G N S + + V ++ Q + + G AF + AL E+ Sbjct: 408 AGMGGHNFLPHDDSLVAAAVMVSPGGPAHVMLTDRQAEH-----IPGCNMAFYKWALEEI 462 Query: 246 GYWSD--DMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEV 303 G + +D+D+ W++Q I + P W T+ KQ+ G AE Sbjct: 463 GGFDPIFWKAGDDVDVCWRVQQAGGRIGFSPAGFVWHYRRSTVNAYLKQQR--GYGEAEA 520 Query: 304 FLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATH 363 L RK + I++ + + G + + + Sbjct: 521 L------LVRKHPEYFNAMGGGIWQGRIYSPAKFGIVMRPPIIYHGAFASGFFQTLYTSQ 574 Query: 364 TAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLM 423 G+L L I+ + L S ++ PV L L+ + Sbjct: 575 AVGVLSLFSSLEYHILVTL------PLLVLSSTFLYLLPVAVTSLVLSLGVCGVAAAQAS 628 Query: 424 PKKQRARWVS 433 K++ RW S Sbjct: 629 LPKKKIRWWS 638 >UniRef50_B2JFS7 Cellulose synthase (UDP-forming) n=8 Tax=Burkholderiaceae RepID=B2JFS7_BURP8 Length = 716 Score = 198 bits (504), Expect = 3e-49, Method: Composition-based stats. Identities = 77/403 (19%), Positives = 146/403 (36%), Gaps = 27/403 (6%) Query: 9 FILCLVLCIPLCVAYFHSGELMMRFVFFWPFF---MSIMWIVGGVYFWVYRERHWPWGEN 65 F + +++ + + + G + P F + + + G V +Y + Sbjct: 11 FYVSVLVVAYVLAGFLYFGWRFTTINWQLPLFSIALYVAELFGFVTGLLYALMTYRLAVR 70 Query: 66 APAPQLKDNPSISIIIPCFNEE-KNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDR 124 P L + S+ + +P +NE + V T+ AA Y + ++DG+ + R + Sbjct: 71 VPEQPLSNA-SVDVFVPTYNESVELVRRTLLAARNIEYPHKT-WLLDDGNRIEMRELAAE 128 Query: 125 MAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNP 184 + + KA L + E++ D D +D ++ +P Sbjct: 129 LGCDY----LARGENAHAKAGNLNHALKHSSGEFIAIFDADHAARKDFLTKVIG-FFADP 183 Query: 185 RVGAVTGNPRIRTRSTL----VGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRS 240 RVG V + +G+ E S +I+R + + F G A RRS Sbjct: 184 RVGFVQTPQDFYNVDSFSHRQIGRRVWHEQSLFFKVIQRGKDSWNAAF-FCGSCAVLRRS 242 Query: 241 ALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGG 300 AL +G ++ + +TEDI S L + Y P +L + L P +L +QR+RW GG Sbjct: 243 ALERIGGFATETVTEDIHTSILLHKAGYKSVYYPESLAFGLAPHSLDTYLQQRIRWGVGG 302 Query: 301 AEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQ-----LAGVPLNIE 355 +V + LW + Y + + F F+ + +P+ Sbjct: 303 MQVLRRER-VLW--SSGLTLAQRLNYMASMLIYFEGWQKFVFFVTPPIVFFFGLLPIVAP 359 Query: 356 LTHIAATHTAGILLCTLCLLQFIVS---LMIENRYEHNLTSSL 395 + LL L+ L++ +Y + + Sbjct: 360 MGQFLGWFFLYYLLSMAMYLELARGYGALLLSEQYGMSRFFAF 402 >UniRef50_B2ID82 Glycosyl transferase family 2 n=2 Tax=Beijerinckiaceae RepID=B2ID82_BEII9 Length = 443 Score = 198 bits (504), Expect = 3e-49, Method: Composition-based stats. Identities = 82/402 (20%), Positives = 150/402 (37%), Gaps = 13/402 (3%) Query: 43 IMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRY 102 + ++G F + R P + P + I IP FNE + V E + AA A ++ Sbjct: 32 FLALIGINIFELLIGRRLRVPLPGPLVAEANLPHVLIQIPVFNEAEMVAEALQAAAALQW 91 Query: 103 ENIEV-IAVNDGSTDKTRAILDR----MAAQIPHLRVIHLAQNQG-KAIALKTGAAAAKS 156 + I + D STD+T I R + AQ + + A G KA AL G A + Sbjct: 92 PRDRLHIQLLDDSTDETSTIAQRIVLNLRAQGTDVLHLRRADRSGYKAGALAAGMARCDA 151 Query: 157 EYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTR-STLVGKIQVGEYSSIIG 215 Y+ D D + +V ++ + + G V + + +IQ + Sbjct: 152 PYVAIFDVDFRPPSNWLRAVVPMLIADKKAGFVQSRCEFSNYATNWLTRIQGLMMDAHYT 211 Query: 216 LIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPR 275 + + T+ G +F +G +RR A+ G WS D + ED+D++ + ++ W + Sbjct: 212 MEQATRYRAGWLFQFNGTAGLWRREAIEAAGGWSADSLCEDLDLTVRARVAGWHGVFTME 271 Query: 276 ALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFT 335 + L+PE ++ Q+ RW+ G +V K M ++W + L + L I AF Sbjct: 272 PVVPGLVPEKVRHWRVQQRRWSTGFVQVTRKLMKQIWTSDCSFGLKLSSSF-LILIQAFY 330 Query: 336 CLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSL 395 ++ + + T + + + R L L Sbjct: 331 PCAAVATASILAGVLLRGGDATDYVPLLDGFAAFVAFIAIGLTLVPYLVLR-RGPLHQYL 389 Query: 396 FWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRG 437 +I P + +SL+ ++ W + Sbjct: 390 LTLILLPPMMIYISLSNA----PAILKSSFGGTETWKRTPKS 427 >UniRef50_A9KYN4 Glycosyl transferase family 2 n=24 Tax=Gammaproteobacteria RepID=A9KYN4_SHEB9 Length = 443 Score = 198 bits (504), Expect = 3e-49, Method: Composition-based stats. Identities = 94/382 (24%), Positives = 153/382 (40%), Gaps = 25/382 (6%) Query: 74 NPSISIIIPCFNEEKNVEETIHAALAQRYEN-IEVIAVNDGSTDK--TRAILDRMAA--- 127 P +S +I C+ E + V+ TI + + Q Y+ IE+IAV DG+ T + A Sbjct: 66 RPKVSCVITCYAEGEAVKSTIDSFIEQVYDGEIEIIAVVDGAVQNGLTYRVAMAAAKACQ 125 Query: 128 -QIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRV 186 + V+ Q G+ L G A A E ++ D D D D + IV P +P V Sbjct: 126 AHNRTVIVLPKWQRGGRVSTLNAGLATATGEIVINADADTSFDNDMVSQIV-PHFEDPNV 184 Query: 187 GAVTGNPRIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEV 245 AV G R+R S ++ ++Q EY + K + + +SG AFRR+ L ++ Sbjct: 185 PAVGGALRVRNVSESIFTRMQAIEYLISMQGGKTGLGQWNLLNNISGAFGAFRRTFLIQI 244 Query: 246 GYWSDDMITEDIDISWKLQLN-----QWTIFYEPRALCWILMPETLKGLWKQRLRWAQGG 300 G W ED+D++ +++ W I + A+ P LK L QRLRW Sbjct: 245 GGWDTHT-AEDLDLTIRIKQYFKRHPDWRIPFATLAIGHTDAPADLKTLVWQRLRW---D 300 Query: 301 AEVFLKNMTRLWRKE-NFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVP-LNIELTH 358 ++ + W + F + L + L+ F+I + V + Sbjct: 301 GDLLFLYFRKHWPAFTPKLLGSGTFVFTLLYGFLQNVLMPFVIVFYGIGIVVSFPWQFIL 360 Query: 359 IAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFT 418 A G L L +V L I R +L ++ W+ +P + + L + Sbjct: 361 SIALTIYGFYLVILAFFYSMVLLAISERIPQDLRLAV-WLPLYPFYALFMRM-VCLFALL 418 Query: 419 RVMLMPKKQR---ARWVSPDRG 437 M+ + A W RG Sbjct: 419 NEMVRRSHEESSMAPWWVLKRG 440 >UniRef50_Q9LJP4 Xyloglucan glycosyltransferase 4 n=75 Tax=Embryophyta RepID=CSLC4_ARATH Length = 673 Score = 198 bits (504), Expect = 3e-49, Method: Composition-based stats. Identities = 78/358 (21%), Positives = 133/358 (37%), Gaps = 22/358 (6%) Query: 52 FWVYRERHWPW--GENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEV-I 108 FW+ ++ P E+ P + I IP NE + E++I AA + + I Sbjct: 174 FWIKFKKIEPKLTEESIDLEDPSSFPMVLIQIPMCNEREVYEQSIGAASQLDWPKDRILI 233 Query: 109 AVNDGSTDKTRAILDR-----MAAQIPHLRVIHLAQNQG-KAIALKTGAA---AAKSEYL 159 V D S D +L + A + ++ H G KA LK+ E++ Sbjct: 234 QVLDDSDDPNLQLLIKEEVSVWAEKGVNIIYRHRLIRTGYKAGNLKSAMTCDYVKDYEFV 293 Query: 160 VCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRS-TLVGKIQVGEYSSIIGLIK 218 D D + D V NP +G V + L+ ++Q + + Sbjct: 294 TIFDADFTPNPDFLKKTVPHFKGNPELGLVQARWSFVNKDENLLTRLQNINLCFHFEVEQ 353 Query: 219 RTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALC 278 + ++ N F +G +R AL E G W + ED+DI+ + LN W Y Sbjct: 354 QVNGVFLNFFGFNGTAGVWRIKALEESGGWLERTTVEDMDIAVRAHLNGWKFIYLNDVEV 413 Query: 279 WILMPETLKGLWKQRLRWAQGGAEVF-------LKNMTRLWRKENFRMWPLFFEYCLTTI 331 +PE+ + KQ+ RW G ++F +K+ +W+K N + Sbjct: 414 TCELPESYEAYKKQQHRWHSGPMQLFRLCLPSIIKSKISVWKKANLIFLFFLLRKLILPF 473 Query: 332 WAF--TCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRY 387 ++F C++ + + A +PL I IL + L+ EN Sbjct: 474 YSFTLFCIILPLTMFIPEAELPLWIICYVPIFISLLNILPSPKSFPFLVPYLLFENTM 531 >UniRef50_C0QGC2 NodC n=2 Tax=Desulfobacterium autotrophicum HRM2 RepID=C0QGC2_DESAH Length = 466 Score = 198 bits (503), Expect = 5e-49, Method: Composition-based stats. Identities = 76/354 (21%), Positives = 146/354 (41%), Gaps = 19/354 (5%) Query: 70 QLKDNPSISIIIPCFNEEKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAILDRMAA 127 + P ++I+P FNE V +T+ + +A Y +++IAV+DGS D T + + Sbjct: 78 ETSQLPHCTVIVPAFNEGVQVYKTLKSLVASDYPKKKLQLIAVDDGSQDDTWDWIKKAKQ 137 Query: 128 Q-IPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRV 186 + L++I L QNQGK AL +G + L+ +D D++++ ++V P + + RV Sbjct: 138 ELGKGLKIIRLQQNQGKRHALYSGFVKSTGSILITVDSDSMVEPLTIRHMVAPFVKDARV 197 Query: 187 GAVTGNPRIRTR-STLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEV 245 GAV GN R+ R ++ ++ + ++ +Q + V G ++A+R+S + +V Sbjct: 198 GAVAGNVRVLNREKGMIPRMLDVAFVYGFDFMRASQSMVNTVMCTPGALSAYRKSVVIQV 257 Query: 246 GY-WSDDM-------ITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWA 297 W I ED ++ + + ++ A+ + +P T L K LRW Sbjct: 258 LPEWLHQTYCGRPANIGEDRAMTNLILREGNHVLFQQNAMVYTEIPVTYTKLCKMYLRWG 317 Query: 298 QGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELT 357 + + +RK FR + + + + + +P N Sbjct: 318 RSNVRETIAMSRFAFRK--FRKGSMVGARINLLSGWLSLIKTPLFVITLVKIMPQNFSSF 375 Query: 358 HIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLA 411 I+ + L + NR L + + F + W+ A Sbjct: 376 GISIVAG----ILVFQSLAAGIYAWKYNRTS-ALWAYTYGFYGFLALSWIKPYA 424 >UniRef50_D1JHN4 Conserved hypothetical membrane protein, glycosyl transferase family 2 n=1 Tax=uncultured archaeon RepID=D1JHN4_9ARCH Length = 538 Score = 197 bits (502), Expect = 5e-49, Method: Composition-based stats. Identities = 84/444 (18%), Positives = 161/444 (36%), Gaps = 20/444 (4%) Query: 3 NRIVSFFILCLV----LCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRER 58 I++ +C+ + V Y H + + F I I + + Sbjct: 29 TAIIALLCICIYLTFRFFVFFAVGYSHLDRIFAICLLLGELFFFIHGIGYSIDVIKASRK 88 Query: 59 HWPWGENAPAPQLKDNPSISIIIPCFNE-EKNVEETIHAALAQRYENIEVIAVNDGSTDK 117 + ++ P +++ I +NE +E TI A Y N ++ ++D ST Sbjct: 89 YREEFDSQHYFLNVKEPKVAVFITAYNEDADTLENTISACTLMDYRNKQIYLLDD-STKP 147 Query: 118 TRAILDRMAAQIPHLRVIHLAQNQG-KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYI 176 + + + +H +G KA A+ ++YL+ +D D + + Sbjct: 148 ELMRTSKALTEKYGIEYVHRENRRGFKAGAINDMLNV-DAKYLLILDADQRPGYNFLREV 206 Query: 177 VEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAA 236 V + P + V R + S + N G Sbjct: 207 VPTLEEKPELAFVQTPQYYVNRDSSKVSNAASAQQSTFYANVSEGKSVSNAMFACGTNIV 266 Query: 237 FRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRW 296 R SAL ++G + ++ +TED S+ L ++ +Y P ++ G + Q++RW Sbjct: 267 LRVSALKDIGGFDEESVTEDFATSFMLHERGYSSYYYNNVFVEGDGPASIPGYYMQQMRW 326 Query: 297 AQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGV-----P 351 A G +F K + L+R ++EY L+ W F F + +A + P Sbjct: 327 AYGTIGIFKKLLKELFRHPRRLTPVQWWEYFLSGTWYFVGWAFFFMMICPVAYLLFEIRP 386 Query: 352 LNIELTHIAATHTAGILLCTLCLLQFIVSLMIEN-RYEHNLTSSLFWIIWFPVIFWMLSL 410 L E + +L T LQF VS+ + R + + + FP+ +ML+ Sbjct: 387 LLAEPYIYVVAYLPYLLFST---LQFFVSMSMRGFRAKDQWFGQILTYLTFPI--YMLAA 441 Query: 411 ATTLVSFTRVMLMPKKQRARWVSP 434 L++ ++ + + SP Sbjct: 442 TYALIN-KKIPFVVTPKGGSGKSP 464 >UniRef50_C1DW58 Cellulose synthase catalytic subunit (UDP-forming) n=1 Tax=Sulfurihydrogenibium azorense Az-Fu1 RepID=C1DW58_SULAA Length = 764 Score = 197 bits (501), Expect = 7e-49, Method: Composition-based stats. Identities = 81/432 (18%), Positives = 150/432 (34%), Gaps = 33/432 (7%) Query: 23 YFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIP 82 + E ++ F +++ ++ + + + E P + +D P++ + IP Sbjct: 84 TLNLDEGLLNATFSILLYLAELYSITILLLGSFISLRLLEREPIPVEKREDYPTVDVFIP 143 Query: 83 CFNEEKNV-EETIHAALAQRYE--NIEVIAVNDGST---------DKTRA------ILDR 124 +NE + T AA + Y +V ++DG T D+ R L + Sbjct: 144 TYNEPPEIVITTALAAASMDYPQEKFKVYILDDGGTAQKLNDPDPDRRRENYERAMQLKK 203 Query: 125 MAAQI-PHLRVIHLAQN-QGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLY 182 ++ + +N KA + + ++ +D D + D V Sbjct: 204 FVEDYNGNIYYLTRERNLHAKAGNINEALKKTNGDLVLILDCDHVPAEDFLKRTVGFFNR 263 Query: 183 NPRVGAVTG-----NPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAF 237 P++ V NP ++ + KI E I++ + F G A Sbjct: 264 YPKLFLVQTPHSFYNPDPIEKNLGIFKIVPSEADMFYKHIQKGLDFWSASF-FCGSAAIL 322 Query: 238 RRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWA 297 RR L EVG ITED + + +L + Y R + + L PET QR RWA Sbjct: 323 RRKYLNEVGGIQGTTITEDAETALELHSRGYDSAYYARPMIYGLQPETFSAFIVQRTRWA 382 Query: 298 QGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELT 357 QG ++FL L + W Y I+ F FI + L ++ Sbjct: 383 QGMIQIFLLKNPLL---KKGLRWYQKLSYLNANIFWFFGFARFIFLIAPVLYPILGAKIY 439 Query: 358 HIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSF 417 + + + + +VS + ++ N S ++ I I L +++ Sbjct: 440 NASLEDAMVYAIPHF-IFSVMVSYYLYSKVRWNFFSEIYEAIQSIFI---LPAIVSVILS 495 Query: 418 TRVMLMPKKQRA 429 R + Sbjct: 496 PRKPTFKVTPKG 507 >UniRef50_C8VZJ3 Glycosyl transferase family 2 n=1 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8VZJ3_DESAS Length = 528 Score = 197 bits (501), Expect = 7e-49, Method: Composition-based stats. Identities = 62/311 (19%), Positives = 126/311 (40%), Gaps = 13/311 (4%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAILDRMAAQIPHL 132 P+++++IP N + + + + Y +E+I V+DGSTD T I+ ++ Sbjct: 136 PTVTVVIPVKNRPGEIRDCLDSLKVLNYPKEKMEIIVVDDGSTDSTGDIIASY-----NV 190 Query: 133 RVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGN 192 ++I L +++G + G AK E + +D D + ++ P VGAV G Sbjct: 191 KLISLPKSKGASACRNIGVKEAKGEIIAFLDSDCTVSPGWIKELL-PYFAFEGVGAVGGF 249 Query: 193 PRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDM 252 S+ + K + S +G +R F V ++ A + G + + M Sbjct: 250 VNSYYNSSCLDKYEAACSSLNMGKRVLFERDAKTNFYVPSCNLFVKKDAFNQTGGFKESM 309 Query: 253 -ITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRL 311 + ED+D W+++ + + Y P+ + L + K+R+ + A+++ K+ Sbjct: 310 HVGEDVDFCWRMRKLGYFLLYVPQGVIAHKHRNILAKMLKRRMEYGTSEADLYKKHSD-- 367 Query: 312 WRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCT 371 KE P++ + C+ ++ AV +PL + + Sbjct: 368 --KEKVFPVPVYEGLSFLSFLLAICMTKPLLLAVNAPLLPLGVFAKAKKISEYRSEFGYM 425 Query: 372 LCLLQFIVSLM 382 L F+ S + Sbjct: 426 SLLKSFLRSTL 436 >UniRef50_D2EES4 Glycosyl transferase family 2 n=1 Tax=Candidatus Parvarchaeum acidiphilum ARMAN-4 RepID=D2EES4_9EURY Length = 420 Score = 197 bits (500), Expect = 9e-49, Method: Composition-based stats. Identities = 92/425 (21%), Positives = 174/425 (40%), Gaps = 33/425 (7%) Query: 27 GELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNE 86 +L + F +S+ WI F+ + H E +LK+ PS++++IP E Sbjct: 5 SDLFTSLIVFVLLVLSLFWI-----FYFFDINHEIKKEGENELKLKEFPSVAVLIPTIYE 59 Query: 87 EKNVEETIHAALAQRYENIEVIAV-NDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAI 145 K + +TI Y N ++ V N ST +T AA ++VI A GKA Sbjct: 60 GKRLRDTIENVKQTIYPNYKIYVVLNKSSTKETIK-----AAYTKGVKVI-KAPFDGKAR 113 Query: 146 ALKTGAAAAKSE-YLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGK 204 + +E L+ +D D +++D +V N +V AV + ++ +V Sbjct: 114 VMNYAIKNYINEKLLLVLDADTFIEKDLVTRLV-YHFKNKKVAAVVSSVKVYKPKKVVEY 172 Query: 205 IQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQ 264 +Q EY I L ++ G + + G + F L ++GY+ D TED++I ++ Sbjct: 173 LQYYEYLLSI-LARKALSKIGGLVIIHGAGSMFNIEILKKIGYFDDSNYTEDMEIGMRIL 231 Query: 265 LNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFF 324 + + + +AL + + P T + L+KQR+RW G LK + +W + Sbjct: 232 TSGYKVESSVKALSYTVTPNTFRKLFKQRVRWFSGFFFNILKYRKTALDSKKNEIWKIAL 291 Query: 325 EYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIE 384 + LT+I+ ++ I Y V + +P+ +++ + T L +L + +E Sbjct: 292 PFMLTSIFLSFFVIIGIAYTVIVFLLPVYSIISNTSITFFISYFTPHLSILSLNARIFLE 351 Query: 385 NR-----------------YEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQ 427 ++ + F I+ + + L S + V+ K Sbjct: 352 ITGAAIGIFSIIYSLTTIDHKFKVVKDFFGILGYMTFYSFFLSFIWLYSLS-VLGKFKAG 410 Query: 428 RARWV 432 + W+ Sbjct: 411 KLIWL 415 >UniRef50_C6N2D6 Glucosaminyltransferase n=1 Tax=Legionella drancourtii LLAP12 RepID=C6N2D6_9GAMM Length = 526 Score = 197 bits (500), Expect = 9e-49, Method: Composition-based stats. Identities = 78/413 (18%), Positives = 164/413 (39%), Gaps = 20/413 (4%) Query: 32 RFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVE 91 + F + + I V F +N A P + IIIP +NEE + Sbjct: 4 LLINFLSMLVLLGLIPLFVLFAQLSLVGLHGLKNHYAKCKNYTPRVVIIIPVWNEEDVIS 63 Query: 92 ETIHAALAQRYE--NIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQ----GKAI 145 +I + + Y ++ + AV+D STD+T I+ A Q P+ RV ++ ++ GKA Sbjct: 64 LSIDSLMRLHYPADSLRIYAVDDVSTDRTPEIIAAKALQYPN-RVFNIRRHAKIDFGKAA 122 Query: 146 ALKTGAAA----AKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRI-RTRST 200 + G +E ++ ID D +L+ + + L + V AVT + + Sbjct: 123 VINHGLNIILQDTWAEAIMIIDADVMLETNTLIRMTR-HLADQEVSAVTAYIKEGQVPGN 181 Query: 201 LVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGY-WSDDMITEDIDI 259 L+ EY +R Q + G + ++G R+++ ++G + ED Sbjct: 182 LLTHFIGFEYIVAQAATRRAQNVLGVMACLAGGAQLHTRASIEQIGGRIDTSTLAEDTYT 241 Query: 260 SWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRM 319 ++ QL+ + ++ A+ P + LWKQR RWA+G ++ ++ ++W N Sbjct: 242 TFSTQLHNNKVIFDGNAIAIAEEPTYISDLWKQRFRWAEGNVQI-TRHFKKIWFHRNT-- 298 Query: 320 WPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIV 379 P + F L+ + + + + + + + + L L ++ Sbjct: 299 -PGGLGGIFFGLIWFCTLLTPLAMIIASIALN-VLYFLNASVSWQLFQIFSLLSFLSYLF 356 Query: 380 SLMIENRYEHNLTSSLFWI-IWFPVIFWMLSLATTLVSFTRVMLMPKKQRARW 431 + + +++ + P + +L++ + + +L+ + W Sbjct: 357 TTFYSFSIDPGTAKRVWFAGMMLPGLLSLLNMMAAVFPYHAGLLIHQYFPTTW 409 >UniRef50_Q974Y9 Putative uncharacterized protein ST0523 n=2 Tax=cellular organisms RepID=Q974Y9_SULTO Length = 419 Score = 196 bits (498), Expect = 2e-48, Method: Composition-based stats. Identities = 74/314 (23%), Positives = 129/314 (41%), Gaps = 10/314 (3%) Query: 65 NAPAPQLKDNPSISIIIPCFNEEK-NVEETIHAALAQRYENIEVIAVNDGSTDKTRAILD 123 N PQ K SII+ NE K +E IH L YE E+I ++D +K Sbjct: 32 NVYEPQNKSQKGFSIIVAEKNESKKTIEGLIHNLLDINYEKYEIIIISDDPKEKFEEKYR 91 Query: 124 RMAAQIPHLRVIHLAQN-QGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLY 182 I L+++H GKA AL G +K EYLV +D DA +D+D + + Sbjct: 92 EY-MNIEKLKILHRNNPVGGKAGALNYGVKHSKYEYLVFLDSDARVDKDFLSKLALY--- 147 Query: 183 NPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSAL 242 + RI T V + ++ + + + G +G R+ L Sbjct: 148 --DFNVASYRLRIYNAETDVQRYYKEFTEKVMDSLFKVRYYLGLPIFPNGSAFCIRKDIL 205 Query: 243 AEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAE 302 +VG W +++ITED+++ +L L+ + I Y + + L P TL L+KQ RW+ G A+ Sbjct: 206 IKVGGWKENIITEDLELGIRLFLHNYKIKYLDDIIVYSLAPFTLDDLYKQIERWSYGSAQ 265 Query: 303 VFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAAT 362 + L++ + L+ + I+ + ++ ++ + Sbjct: 266 LLLQSFKLFKKGIRGIEGVLYAQQW--GIYGLFIFMLILLSSLNFILKIPLLVFILSIVV 323 Query: 363 HTAGILLCTLCLLQ 376 + + + L Q Sbjct: 324 YGMSVSIYALVYKQ 337 >UniRef50_Q02IY3 Probable glucosyl transferase n=6 Tax=Proteobacteria RepID=Q02IY3_PSEAB Length = 869 Score = 196 bits (498), Expect = 2e-48, Method: Composition-based stats. Identities = 82/420 (19%), Positives = 155/420 (36%), Gaps = 35/420 (8%) Query: 8 FFILCLVLCIPLCV--AYFHSGELMMRFVFFWPFFMSI--MWIVGGVYFWVYRERHWPWG 63 F + C + V AY +S + F + + + +V ++ + W Sbjct: 344 FLAVVSFACASVLVWIAYDYSQQYSTWFSLTVGALLGVGALGVVIVLFTEAHELAEAVWT 403 Query: 64 ENAPAPQLK------DNPSISIIIPCFNEEKNV-EETIHAALAQRYENIEVIAVNDGSTD 116 P L P +SI +PC+NE + ++T+ A Y + EV+ +++ + D Sbjct: 404 RKRRRPFLPITAAQAYRPKVSIHVPCYNEPPELLKQTLDALARLDYPDYEVLVIDNNTRD 463 Query: 117 KT--RAILDRMAAQIPHLRVIHLAQNQG-KAIALKTGAAAAKSEYLVC--IDGDALLDRD 171 + + A R H+A +G KA AL ++ V ID D +D D Sbjct: 464 PAVWQPVEAHCARLGERFRFFHVAPLEGFKAGALNFALGHVAADVEVVAVIDADYCVDPD 523 Query: 172 AAAYIVEPMLYNPRVGAVTGNPRIRTR-STLVGKIQVGEYSSIIGLIKRTQRIYGNVFTV 230 ++V P +PR+ V R + + ++ EY + T+ + Sbjct: 524 WLRHMV-PHFGDPRIAVVQSPQDYRDQHESAFKRLCYAEYKGFFHIGMVTRNDRDAIIE- 581 Query: 231 SGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLW 290 G + RRS L E+ W + ITED ++ ++ + Y R+ +MP+T Sbjct: 582 HGTMTMIRRSVLDEL-RWPEWCITEDAELGLRVFEKGLSAAYFERSYGKGVMPDTFIDFK 640 Query: 291 KQRLRWAQGGAEVFLKNMTRLWRKE-----------NFRMWPLFFEYCLTTIWAFTCLVG 339 KQR RWA G ++ ++ L R + + + + F L G Sbjct: 641 KQRFRWAYGAIQIMKRHTDALLRGRGPDGSRLTRGQRYHFVAGWLPWIADGLNIFFTL-G 699 Query: 340 FIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWII 399 ++++ + VP ++ + L + + +L S F + Sbjct: 700 ALLWSAAMIIVPKRVDPPLLIFAILP---LALFAFKVGKILFLYRRTVGVDLRDSFFAAL 756 >UniRef50_P58931 Cellulose synthase catalytic subunit [UDP-forming] n=29 Tax=Proteobacteria RepID=BCSA_PSEFS Length = 739 Score = 195 bits (497), Expect = 2e-48, Method: Composition-based stats. Identities = 82/439 (18%), Positives = 154/439 (35%), Gaps = 26/439 (5%) Query: 1 MINRIVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHW 60 ++ +V+ + F+ I+ I G V R Sbjct: 89 VVLSLVASLRYMFWRLTSTLGFETWVDMFFGYGLVAAEFYALIVLIFGYVQTAWPLRRTP 148 Query: 61 PWGENAPAPQLKDNPSISIIIPCFNEEKNVEE-TIHAALAQRYEN--IEVIAVNDGSTDK 117 W + P ++ P++ + IP +NE ++ + TI AA A + + V ++DG D Sbjct: 149 VWLKTEP----EEWPTVDVFIPTYNEALSIVKLTIFAAQAMDWPKDKLRVHVLDDGRRDD 204 Query: 118 TRAILDRMAAQIPHLRVIHLAQN-QGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYI 176 R ++ + I N KA L EY+ D D + R Sbjct: 205 FREFCRKVG-----VNYIRRDNNFHAKAGNLNEALKVTDGEYIALFDADHVPTRSFLQVS 259 Query: 177 VEPMLYNPRVGAVTG-----NPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVS 231 + L +P++ + +P ++ + E GL++ ++ F Sbjct: 260 LGWFLKDPKLAMLQTPHFFFSPDPFEKNLDTFRAVPNEGELFYGLVQDGNDLWNATF-FC 318 Query: 232 GVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWK 291 G A RR L E+G + + +TED + KL + Y L E+L Sbjct: 319 GSCAVIRREPLLEIGGVAVETVTEDAHTALKLNRLGYNTAYLAIPQAAGLATESLSRHIN 378 Query: 292 QRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVP 351 QR+RWA+G A++F + L + W Y + F L + LA + Sbjct: 379 QRIRWARGMAQIFRTDNPLLGKGLK---WGQRICYANAMLHFFYGLPRLVFLTAPLAYLI 435 Query: 352 LNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLA 411 E+ H +A +L L S I+ R+ H+ + ++ + + +++L Sbjct: 436 FGAEIFHASALMIVAYVLPHLVHSSLTNS-RIQGRFRHSFWNEVYETV---LAWYILPPV 491 Query: 412 TTLVSFTRVMLMPKKQRAR 430 + + + Sbjct: 492 LVALVNPKAGGFNVTDKGG 510 >UniRef50_Q0IZF3 Os09g0572500 protein n=2 Tax=Oryza sativa Japonica Group RepID=Q0IZF3_ORYSJ Length = 541 Score = 195 bits (496), Expect = 2e-48, Method: Composition-based stats. Identities = 73/315 (23%), Positives = 125/315 (39%), Gaps = 18/315 (5%) Query: 10 ILCLVLCIPLCVA-YFHSGELMMRFV--FFWPFFMSIMWIVGGVYFWVYRERH----WPW 62 + C V+ L +A Y +M F+ + ++ +W++ R Sbjct: 13 VRCGVVLPTLQLAVYVCVAMSIMLFLERLYMALVVAALWLIRRRRRRSNRREQDDDGAEN 72 Query: 63 GENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIE-VIAVNDGSTDKT-RA 120 + P+ ++P + + IP FNE++ +I AA + + + VI V D STD R Sbjct: 73 DQLLQDPEAANSPMVLVQIPMFNEKQVYRLSIGAACGMTWPSDKLVIQVLDDSTDPAIRE 132 Query: 121 ILD----RMAAQIPHLRVIHLAQNQG-KAIALKTGAAAA---KSEYLVCIDGDALLDRDA 172 +++ R A + +R + G KA A++ G A + E + D D D D Sbjct: 133 MVEGECGRWAGKGVSIRYENRRNRSGYKAGAMREGLRKAYARECELVAIFDADFQPDADF 192 Query: 173 AAYIVEPMLYNPRVGAVTGNPRIRTRST-LVGKIQVGEYSSIIGLIKRTQRIYGNVFTVS 231 V ++ +P V V R L+ +IQ + + F + Sbjct: 193 LLRTVPVLVADPGVALVQARWRFVNADECLLTRIQEMSLDYHFRVEQEVGSACHGFFGFN 252 Query: 232 GVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWK 291 G +R AL E G W + ED+D++ + L W Y +P TL+ Sbjct: 253 GTAGVWRVRALEEAGGWKERTTVEDMDLAVRASLRGWRFVYVGHVGVRNELPSTLRAYRY 312 Query: 292 QRLRWAQGGAEVFLK 306 Q+ RW+ G A +F K Sbjct: 313 QQHRWSCGPANLFRK 327 >UniRef50_A6LJJ1 Glycosyl transferase, family 2 n=2 Tax=Thermosipho RepID=A6LJJ1_THEM4 Length = 379 Score = 195 bits (496), Expect = 3e-48, Method: Composition-based stats. Identities = 95/399 (23%), Positives = 159/399 (39%), Gaps = 35/399 (8%) Query: 39 FFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAAL 98 +S++ ++ G Y R + N K P +S+IIPC+NEE + TI A Sbjct: 12 ILISLIIVLYGKYKNEKRTNYLHENRN-----PKYYPLVSVIIPCYNEEAVIGNTIRAVE 66 Query: 99 AQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEY 158 Y+N E+I ++D STD T I + + +L+VI +GK ++ AK E Sbjct: 67 KNSYKNFEIIVIDDNSTDSTFEIAKGLEKEYNNLKVIKKKGEKGKPQSINEAMEIAKGEI 126 Query: 159 LVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRT-RSTLVGKIQVGEYSSIIGLI 217 ++ ID DA + + + V NP+V + N R T V IQ Y + + I Sbjct: 127 VLIIDADARIPENYISSHVYC-FSNPKVEMIFTNFEAYNFRFTPVFIIQEI-YFNFVKTI 184 Query: 218 KRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRAL 277 + I +F +G+ FR+S L + + +D ++ KL + FY Sbjct: 185 FYSNIIVKMIFMGNGIF--FRKSLLERILPIDPTTLVDDFSMATKLSKLKVKEFYSTYPY 242 Query: 278 CWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCL 337 I ++ K LWKQ RW GG R + LF+ Y LT + F + Sbjct: 243 IKIQFAKSFKDLWKQHKRWYTGGFREMF------MRIKEGHHSYLFY-YMLTGLVIFLPV 295 Query: 338 VGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFW 397 + + + HI L+ + +L +I SL +L S + Sbjct: 296 ISLFVDLI-----------FHIGMFKIVFGLIMAVYILLWISSLDSRQFGFFSLFYSFLY 344 Query: 398 IIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 + +L + L+ + P ++ W +R Sbjct: 345 VP-------LLMIFELLILVYSYITGPFNKKLPWEKVER 376 >UniRef50_Q7PC76 Glucomannan 4-beta-mannosyltransferase 1 n=53 Tax=Embryophyta RepID=CSLA1_ORYSJ Length = 521 Score = 195 bits (496), Expect = 3e-48, Method: Composition-based stats. Identities = 85/402 (21%), Positives = 146/402 (36%), Gaps = 24/402 (5%) Query: 15 LCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHW----PWGENAPAPQ 70 + +PL M +F +M++ ++ GV R P ++ P Sbjct: 22 VIVPLLRLAVAVCLTMSVLLFLERMYMAV--VISGVKILRRRPDRRYRCDPIPDDDPELG 79 Query: 71 LKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIA-VNDGSTDKTRAI-----LDR 124 P + I IP FNE + + +I A + + ++ V D STD +R Sbjct: 80 TSAFPVVLIQIPMFNEREVYQLSIGAVCGLSWPSDRLVVQVLDDSTDPVIKEMVRIECER 139 Query: 125 MAAQIPHLRVIHLAQNQG-KAIALKTGAAAA---KSEYLVCIDGDALLDRDAAAYIVEPM 180 A + ++ +G KA ALK G + EY+ D D D D + + Sbjct: 140 WAHKGVNITYQIRENRKGYKAGALKEGMKHGYVRECEYVAIFDADFQPDPDFLRRTIPFL 199 Query: 181 LYNPRVGAVTGNPRIRTRST-LVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRR 239 ++N + V R L+ ++Q + + F +G +R Sbjct: 200 VHNSDIALVQARWRFVNADECLMTRMQEMSLDYHFTVEQEVSSSVCAFFGFNGTAGVWRV 259 Query: 240 SALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQG 299 SA+ E G W D ED+D++ + L W Y +P T K Q+ RW+ G Sbjct: 260 SAVNEAGGWKDRTTVEDMDLAIRASLKGWKFVYLGDVQVKSELPSTFKAFRFQQHRWSCG 319 Query: 300 GAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHI 359 A +F K + + R + +W + I+ F + I + V A L I T Sbjct: 320 PANLFRKMLMEIVRNKKVTIWK-----KIHVIYNFFLIRKIIAHIVTFAFYCLIIPATIF 374 Query: 360 AATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWF 401 C + I++L+ + FWI++ Sbjct: 375 VPEVRIPKWGC--VYIPTIITLLNSVGTPRSFHLLFFWILFE 414 >UniRef50_C3MU39 Glycosyl transferase family 2 n=9 Tax=Sulfolobus RepID=C3MU39_SULIM Length = 427 Score = 195 bits (495), Expect = 4e-48, Method: Composition-based stats. Identities = 81/374 (21%), Positives = 152/374 (40%), Gaps = 28/374 (7%) Query: 30 MMRFVFFWPFFMSI----MWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFN 85 M+R +F F+S +WI+ +++ + G + N I II+ + Sbjct: 1 MIREIFEILLFISSSITSLWILLQAFYYKVSNENLVQGLG--NDRKNSNKKIDIIVAIKD 58 Query: 86 E-EKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQG-K 143 E E+ ++E I Y VI ++D S I+ + + +I +N+G K Sbjct: 59 ENERTIKELIDNLSELNYRPYRVIIISDDSLKNFEKIMKVVDRIPENFVLIRRPENRGRK 118 Query: 144 AIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVG 203 A AL + E LV +D +A ++++ I + A ++R T V Sbjct: 119 AGALNFAFNLSDGELLVFLDAEARVEKNFLRKISKL-----NYDAAAFRLKVRDPVTPVQ 173 Query: 204 KIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKL 263 KI ++ + + + G + +G A RR L +VG W ++ + ED+++ +L Sbjct: 174 KIYSHTNEFVMNALFKAREKLGLIIFANGSAFAIRRDILRKVGGWKENSVAEDLELGIRL 233 Query: 264 QLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRL----------WR 313 LN + Y + + L P TL L+ Q RWA G AE+ +M L Sbjct: 234 ALNGIRVKYVDDIIVYTLAPYTLTDLYNQIKRWAYGSAELLAYSMRLLKLGVKGLEGFIY 293 Query: 314 KENFRMWPLFFEYCLTTI-WAFTCLVGFIIYAVQLAGVPLNIELTHIAA----THTAGIL 368 + + ++PL+ L I F V ++ L + ++ + + +GI+ Sbjct: 294 SQQWGIYPLYLLVFLIVISLQFILNVNYLYVFSSLVPILVSNGIYITLIKPKDDYRSGIV 353 Query: 369 LCTLCLLQFIVSLM 382 L+ +I + Sbjct: 354 TIIASLIGYIQGAL 367 >UniRef50_D1RA94 Putative uncharacterized protein n=1 Tax=Parachlamydia acanthamoebae str. Hall's coccus RepID=D1RA94_9CHLA Length = 620 Score = 194 bits (494), Expect = 4e-48, Method: Composition-based stats. Identities = 83/417 (19%), Positives = 155/417 (37%), Gaps = 23/417 (5%) Query: 1 MINRIVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPF-----------FMSIMWIVGG 49 +++ I+ F + L + ++ + + +F M + Sbjct: 96 LVSMILPAFYGIVWLSELVMASFHYFEHWLSAAIFITLMGSLGCLMIFNTVMLSWITLVR 155 Query: 50 VYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKN-VEETIHAALAQRYENIEVI 108 +R G K++P +SI IPCFNE V ET++A Y + EVI Sbjct: 156 YSDLYFRFPRLKRGLTIADQSKKNSPFVSIHIPCFNEPPELVIETLNAISRFNYPHFEVI 215 Query: 109 AVNDGSTDKT--RAILDRMAAQIPHLRVIHLAQNQG-KAIALKTGAA--AAKSEYLVCID 163 +++ + D T + R H+ + G KA AL A+++E + D Sbjct: 216 VLDNNTKDPTVWAPVEAHCLQLGERFRFYHIDKLAGAKAGALNACLKCTASQAELIAVFD 275 Query: 164 GDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRT-RSTLVGKIQVGEYSSIIGLIKRTQR 222 D + D + +V +P++G V R + EY + L Q Sbjct: 276 ADYVAKEDFLSRLVG-FFDDPKIGFVQSCQDYRDWDHSHYQAACYYEYETHFKLELPGQN 334 Query: 223 IYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILM 282 + +T+ G + RR+AL EVG W++ +TED +++ ++ + +Y + L+ Sbjct: 335 EWDVTYTI-GTMCLIRRTALDEVGGWAEWCLTEDSEVAVRIHALGFAGYYLKETFGYGLI 393 Query: 283 PETLKGLWKQRLRWAQGGAEVFLKNMTRL--WRKENFRMWPLFFEYCLTTIWAFTCLVGF 340 PET + QR RW+ G + K+ W K + F E + F+ + F Sbjct: 394 PETFESYKLQRFRWSAGPVQQIQKHWRLYLPWAKNGLSLAQKFGEIFHSLSIFFSESLSF 453 Query: 341 IIYAVQLAGVPLNIELTHIAATHTAGILLCT-LCLLQFIVSLMIENRYEHNLTSSLF 396 +I L + + +L + ++ I+ + R Sbjct: 454 LINIPILCICLWFAIVKQQSFILPKAVLWAIPIVFIKNILCNWLSIRLLGGSWKEYL 510 >UniRef50_B1M838 Cellulose synthase catalytic subunit (UDP-forming) n=15 Tax=Rhizobiales RepID=B1M838_METRJ Length = 930 Score = 193 bits (492), Expect = 9e-48, Method: Composition-based stats. Identities = 78/357 (21%), Positives = 123/357 (34%), Gaps = 28/357 (7%) Query: 66 APAPQLKDNPSISIIIPCFNEEKNV-EETIHAALAQRYE--NIEVIAVNDGSTDKTRA-- 120 AP D P++ I IP +NE + T+ AA Y V ++DG TD+ A Sbjct: 153 APVLPEADLPTVDIFIPSYNESAEILGLTLAAARNLDYPAGRATVWLLDDGGTDQKCADP 212 Query: 121 -------------ILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDAL 167 L + A + + KA L G A+++ ++ +D D Sbjct: 213 DPARAGAARARRAALQALCAGLGVRYLTRARNAHAKAGNLNNGLTQARADLVLVLDADHA 272 Query: 168 LDRDAAAYIVEPMLYNPRVGAVTG-----NPRIRTRSTLVGKIQVGEYSSIIGLIKRTQR 222 R V +P++ V NP R+ E G+ + Sbjct: 273 PFRPFLRETVGLFARDPKLFLVQTPHVFINPDPIERNLRTFTRMPSENEMFYGVTQAGLD 332 Query: 223 IYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILM 282 + F G A RRSAL VG +S ITED + +++L WT Y R L L Sbjct: 333 KWNGSF-FCGSAALLRRSALDAVGGFSGVTITEDCETAFELHARGWTSAYVDRPLIAGLQ 391 Query: 283 PETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFII 342 PET QR RW QG ++ L R Y + + F L I Sbjct: 392 PETFADFIGQRARWCQGMFQIMLLKNPLFKRGLKPIQR---LCYLSSMTFWFFPLPRLIF 448 Query: 343 YAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWII 399 L + ++++ + T + ++ + S L+ + Sbjct: 449 MLAPLLHIFFDVKIFVSSIDEALVYT-ATYVVANMMMQNYLYGHVRWPWVSELYEYV 504 >UniRef50_A9CZ20 Putative cellulose synthase catalytic subunit (UDP-forming) n=1 Tax=Hoeflea phototrophica DFL-43 RepID=A9CZ20_9RHIZ Length = 730 Score = 193 bits (491), Expect = 1e-47, Method: Composition-based stats. Identities = 87/456 (19%), Positives = 158/456 (34%), Gaps = 37/456 (8%) Query: 3 NRIVSFFILCLVLCIPLCVAYFHSGELM------MRFVFFWPFFMSIMWIVGGVYFWVYR 56 N + F + L AY+ + E + + F+ +M+ M+ + + + Sbjct: 28 NTVFRFITFIFCAVLALRYAYWRTTETLPVFSEPLNFIPGLLLYMAEMYCLLMLAISFFM 87 Query: 57 ERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNV-EETIHAALAQRYE--NIEVIAVNDG 113 L D P++ ++IP +NE+ + T+ AA + Y + + ++DG Sbjct: 88 LADPIKRVAPKVRSLDDLPTVDVLIPTYNEDPELLAGTLAAAKSMIYPRDKLTIWLLDDG 147 Query: 114 ST-------DKTRAIL-----DRMAAQIP--HLRVIHLAQN-QGKAIALKTGAAAAKSEY 158 ST D T A+ +++ A + +N KA L G + S+ Sbjct: 148 STQAKRSHKDPTVALAATRRHEQLKALCKAMGVHYHARKKNDHAKAGNLNDGLKVSGSDL 207 Query: 159 LVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTG-----NPRIRTRSTLVGKIQVGEYSSI 213 +V D D R+ V +P++ V NP ++ + E Sbjct: 208 VVVFDADHAPVREFLKETVSFFKEDPKLFLVQTPHYFLNPDPLEKNLRTFRSMPSENEMF 267 Query: 214 IGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYE 273 +++R + F G A R AL VG +S ITED + + L W Y Sbjct: 268 YSVLQRGLDKWNASF-FCGSAAVLSRKALQAVGGFSGQSITEDCETALSLHAKGWRSLYV 326 Query: 274 PRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWA 333 + L L PET QR RW QG ++ + N L + M + W Sbjct: 327 DKPLIAGLQPETFVAFIGQRARWCQGMLQILILNRPFL--AKGLTMGQRICYAGINMFWL 384 Query: 334 FTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTS 393 F L L + ++E+ L L + F + + + S Sbjct: 385 FP-LSRLAFMLSPLLYIFFSLEIYQANIQEFGAYALTYL-IASFAMQSYLYGKVRWPWVS 442 Query: 394 SLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRA 429 L+ I + ++ +++ R + Sbjct: 443 ELYEYIQSIM---LVGSIISVIRNPRKPTFNVTAKG 475 >UniRef50_B7IR36 Glycosyltransferase n=11 Tax=Bacillus cereus group RepID=B7IR36_BACC2 Length = 741 Score = 192 bits (489), Expect = 2e-47, Method: Composition-based stats. Identities = 76/411 (18%), Positives = 139/411 (33%), Gaps = 17/411 (4%) Query: 5 IVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGE 64 I+ F I + I + L+ + + F + + + E Sbjct: 34 IILFLICNAIYLIWRTFYSLPTINLISIIAGIILLITEWAGYLQSIVFSIVSWKPYKRKE 93 Query: 65 NAPAPQLKDNPSISIIIPCFNEE-KNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAI 121 P + P++ I I +NE ++ T+ Y + + +DG R Sbjct: 94 -VPLSTFEKLPTVDIFIATYNEPIDLLKRTVAGCTLITYPKELLNIYICDDGR----RES 148 Query: 122 LDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPML 181 + ++AA + KA L +K + +V +D D + + + Sbjct: 149 VKQLAADFSVHHITRTENKHAKAGNLNHAMLHSKGDIIVTMDADMIPRANFLERTIGYFS 208 Query: 182 YNP----RVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAF 237 N + V N + E + ++ + + V G A F Sbjct: 209 KNNVVFVQAPQVFYNADPFQYNLFFEDNIANEQDFFMRQLEEGKDRFNATMYV-GSNALF 267 Query: 238 RRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWA 297 RR+AL E+G ++ +ITED+ LQ N+W + L L PET L KQR RW Sbjct: 268 RRTALEEIGGFATGVITEDMATGMLLQANKWETIFVNETLAVGLSPETFSDLLKQRDRWC 327 Query: 298 QGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELT 357 +G +V K + +F L+ + + F + I L + +I Sbjct: 328 RGNIQVVKKWNPFTIKGLSFMQRLLYAD---GIHYWFFGVYKMIFLLAPLLFLVFDIYSL 384 Query: 358 HIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWML 408 I H Q I + N+ S ++ + P + + + Sbjct: 385 EINFMHLFMFWAPAFLTSQLIFKA-VSNKKRTTTWSHVYEVAMAPYMGFAI 434 >UniRef50_A3ERE6 Cellulose synthase catalytic subunit (UDP-forming) n=2 Tax=Leptospirillum sp. Group II RepID=A3ERE6_9BACT Length = 714 Score = 192 bits (488), Expect = 2e-47, Method: Composition-based stats. Identities = 99/427 (23%), Positives = 153/427 (35%), Gaps = 24/427 (5%) Query: 24 FHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPA--PQLKDNPSISIII 81 H E F+ F S + +V G Y + +P G A P + P++ I+I Sbjct: 91 VHWYEYPFIFLLFGAELYSWIILVLG-----YLQTAYPRGRKADPLPPDTESWPTVDILI 145 Query: 82 PCFNEE-KNVEETIHAALAQRYE--NIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLA 138 P +NE V T+ AAL + V ++DG R AAQ V Sbjct: 146 PTYNEPLAVVRTTLLAALNMDWPPEKKRVFLLDDG----NREEFATFAAQTGVTYVARPE 201 Query: 139 QNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTG-----NP 193 KA + A + +Y+ D D + R V + +PRV V +P Sbjct: 202 HRHAKAGNVNYALARSDGKYVALFDCDHVPTRSFLQTTVGILEKSPRVAFVQTPHHFYSP 261 Query: 194 RIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMI 253 R+ V K E G+I+ ++ T G RR AL E+ ++ D + Sbjct: 262 DPYERNLNVFKKLPNEGELFYGVIQDGNDLWNA-STFCGSCTVLRRKALNEIQGFAVDTV 320 Query: 254 TEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWR 313 TED S +L W Y L +L +QR+RWA+G ++F L R Sbjct: 321 TEDAHTSIRLHRLGWESAYLNLPQAAGLATPSLSAHIRQRIRWARGMIQIFRIENPLLAR 380 Query: 314 KENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLC 373 Y ++ + L I LA + + + A A T+ Sbjct: 381 GLTL---AQKLCYLNGMLFFLSSLPRIIFLTAPLAYLYFGVRIFDADAYSIASYAFPTIA 437 Query: 374 LLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVS 433 L + + I R+ H+ + ++ P IF LA + K+ R Sbjct: 438 L-SLLANAAINGRHRHSFWNQVYETTLAPYIFLPTLLAMVRPDLGTFNVTAKEGRIEKEF 496 Query: 434 PDRGILR 440 DR I R Sbjct: 497 YDRRIAR 503 >UniRef50_Q2K9Z1 Cellulose synthase protein n=17 Tax=Rhizobiaceae RepID=Q2K9Z1_RHIEC Length = 748 Score = 192 bits (488), Expect = 3e-47, Method: Composition-based stats. Identities = 80/427 (18%), Positives = 145/427 (33%), Gaps = 31/427 (7%) Query: 33 FVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNV-E 91 F+ +++ M+ V + ++ + A P + + +P +NE+ + Sbjct: 104 FIPGLLLYLAEMYSVAMLALSLFIVATPLPPRPSRAANPGRLPHVDVFVPSYNEDAGLLG 163 Query: 92 ETIHAALAQRYE--NIEVIAVNDGSTD---KTRAILDRMAA-----------QIPHLRVI 135 T+ AA A Y + V ++DG T + +L+ AA Q + + Sbjct: 164 NTLAAAKAMDYPADKLHVWLLDDGGTLQKRNSGKLLEAQAAAARHIELKQLCQDLDVTYL 223 Query: 136 HLAQN-QGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTG--- 191 +N KA L G + E + D D RD V +P++ V Sbjct: 224 TRDRNEHAKAGNLNNGMKHSTGELIAVFDADHAPARDFLLETVGYFEDDPKLFLVQTPHF 283 Query: 192 --NPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWS 249 NP R+ + E G+I+R + F G A R AL +S Sbjct: 284 FINPDPLERNLRTFEKMPSENEMFYGIIQRGLDKWNAAF-FCGSAAVLSRKALESQNGFS 342 Query: 250 DDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMT 309 ITED + + L + W Y + L L P T QR RWAQG ++ Sbjct: 343 GISITEDCETALALHGSGWNSIYVDKPLIAGLQPATFASFIGQRSRWAQGMMQILRFRFP 402 Query: 310 RLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILL 369 L R + P Y +T++ I L + ++E+ + Sbjct: 403 LLKRGLSI---PQRLCYMSSTLFWLFPFPRTIFLFAPLFYLFFDLEIFTASGGEFLAYTF 459 Query: 370 CTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRA 429 L+ ++ + + S L+ + +L +++ R + Sbjct: 460 -AYMLVNLMMQNYLYGSFRWPWISELYE---YVQTVHLLPAVVSVMLNPRKPTFKVTAKD 515 Query: 430 RWVSPDR 436 ++ R Sbjct: 516 ESIAVSR 522 >UniRef50_C7ZM19 Putative uncharacterized protein n=1 Tax=Nectria haematococca mpVI 77-13-4 RepID=C7ZM19_NECH7 Length = 332 Score = 192 bits (488), Expect = 3e-47, Method: Composition-based stats. Identities = 70/279 (25%), Positives = 128/279 (45%), Gaps = 25/279 (8%) Query: 43 IMWIVGGVYFWVYRERHWPWGENAP---APQLKDNPSISIIIPCFNEEKNVEETIHAALA 99 I+ I+ Y Y W W + P A + + P IS++IP FNE + + +I + + Sbjct: 40 ILEIISTFYKIEYLLMVWRWIKYKPTKAATKDNELPFISVVIPVFNESEFIHNSIESIVL 99 Query: 100 QRYE--NIEVIAVNDGSTDKTRAILDRMAAQIPH----LRVIHLAQNQGKAIALKTGAAA 153 Y IE++ ++DGSTD T +++ IP R++ N GK A+++G + Sbjct: 100 SDYPKDKIELVIIDDGSTDDTWQHVNQAVKAIPQCGISYRLLQHPANLGKRQAIQSGFST 159 Query: 154 AKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRST--------LVGKI 205 A ++ +D D++L++ A IV P++ +P +GAV G+ + S+ + ++ Sbjct: 160 ALGSIIISVDSDSVLEKGALHRIVSPLMQDPAIGAVAGHLAVLNVSSPNTLSTKGFLPRL 219 Query: 206 QVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAE-VGYWSDDMI-------TEDI 257 Y I L + G V + G +AFR +A+ + V ED+ Sbjct: 220 LDILYEHIGNLPRAALTAEGFVTILPGAFSAFRVNAVRDHVDNLCTSTFLGSPLKHGEDM 279 Query: 258 DISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRW 296 +++ + + W Y+ A+ PETL+ + RW Sbjct: 280 ELAHHILKDGWKTVYQSNAVVHTTAPETLQKAFLMFSRW 318 >UniRef50_Q6YWK8 Probable mannan synthase 11 n=3 Tax=Poaceae RepID=CSLAB_ORYSJ Length = 570 Score = 192 bits (488), Expect = 3e-47, Method: Composition-based stats. Identities = 64/309 (20%), Positives = 119/309 (38%), Gaps = 15/309 (4%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRYENIEV-IAVNDGSTDKT-----RAILDRMAAQ 128 P + + IP FNE + + +I AA + + + V I V D STD + + Sbjct: 118 PMVLVQIPMFNEREVYKLSIGAACSLDWPSDRVVIQVLDDSTDLVVKDLVEKECQKWQGK 177 Query: 129 IPHLRVIHLAQNQG-KAIALKTGAAA---AKSEYLVCIDGDALLDRDAAAYIVEPMLYNP 184 +++ +G KA ALK G + EY+ D D + D V +++N Sbjct: 178 GVNIKYEVRGNRKGYKAGALKEGLKHDYVKECEYIAMFDADFQPESDFLLRTVPFLVHNS 237 Query: 185 RVGAVTGNPRIRTRST-LVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALA 243 + V + + L+ + Q + + F +G +R +A+ Sbjct: 238 EIALVQTRWKFVNANECLLTRFQEMSLDYHFKYEQEAGSSVYSFFGFNGTAGVWRIAAID 297 Query: 244 EVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEV 303 + G W D ED+D++ + L W Y +P T K Q+ RW+ G A + Sbjct: 298 DAGGWKDRTTVEDMDLAVRATLQGWKFVYVGDVKVKSELPSTFKAYRFQQHRWSCGPANL 357 Query: 304 FLKNMTRLWRKENFRMWP---LFFEYCLTT-IWAFTCLVGFIIYAVQLAGVPLNIELTHI 359 F K M + + W L++++ I A T + + + ++ IE+ Sbjct: 358 FKKMMVEILENKKVSFWNKIHLWYDFFFVGKIAAHTVTFIYYCFVIPVSVWLPEIEIPLW 417 Query: 360 AATHTAGIL 368 + ++ Sbjct: 418 GVVYVPTVI 426 >UniRef50_C4DG81 Glycosyl transferase n=1 Tax=Stackebrandtia nassauensis DSM 44728 RepID=C4DG81_9ACTO Length = 630 Score = 192 bits (487), Expect = 3e-47, Method: Composition-based stats. Identities = 77/379 (20%), Positives = 136/379 (35%), Gaps = 16/379 (4%) Query: 37 WPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCF--NEEKNVEETI 94 + ++++ + + + E P + P S II + NE + + +T+ Sbjct: 224 YLVIVAVLVVNSALMWA---ENLHALDPKRPGEPKEPFPPASAIIAAYLPNEAETIIDTV 280 Query: 95 HAALAQRYE-NIEVIA-VNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAA 152 A LAQ Y ++VI N A L +A + P L V+ + + KA + Sbjct: 281 RALLAQNYPGRLQVILAYNTPRPLPVEAQLRLLARRDPRLVVLEVPFSTSKAQNVNAALG 340 Query: 153 AAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTR-STLVGKIQVGEYS 211 A E++ D D A + V G+ +R V + E+ Sbjct: 341 VATGEFVGVFDADHQPAPLAFHRAWRWIAD--GADVVQGHCVVRNGHDAWVPRTVAVEFE 398 Query: 212 SIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIF 271 +I + + G +R S L + DM+TEDID S + L I Sbjct: 399 AIYAVSHPGRAALHGFGIFGGSNGFWRTSLLRRI-RMRPDMLTEDIDSSMRALLQGSRIV 457 Query: 272 YEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTI 331 +P L + L PE+L LW QR+RWAQG +V +++ R +F L Sbjct: 458 SDPALLSFELAPESLGPLWHQRMRWAQGWFQVSRRHLLATLRSASFNTRNKLGSLFLLGW 517 Query: 332 -----WAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENR 386 WA +V + AG ++ A T+ L + + Sbjct: 518 RELTPWASLQIVPVVAALAWQAGGVTKLDWMIPAFLLTSLYALSVGPVQTLFAWRLAVPE 577 Query: 387 YEHNLTSSLFWIIWFPVIF 405 + ++++ + + Sbjct: 578 IKKKPWWFGVYLLFNAIFY 596 >UniRef50_D1RIK1 Glycosyl transferase domain-containing protein n=1 Tax=Legionella longbeachae D-4968 RepID=D1RIK1_LEGLO Length = 422 Score = 191 bits (486), Expect = 4e-47, Method: Composition-based stats. Identities = 95/423 (22%), Positives = 176/423 (41%), Gaps = 27/423 (6%) Query: 30 MMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNE--E 87 M F+ + F+ I+ + V +Y + W N+PA P ++++I C+NE E Sbjct: 1 MTSFLLIFFPFVVILEMPYYVLICLYSIKAWLKEYNSPATIKTYAPFVTVVITCYNETFE 60 Query: 88 KNVEETIHAALAQRYE-NIEVIAVNDGST--DKTRAILDRMAAQI-----PHLRVIHLAQ 139 + + T+ A Q Y NI+ + + D + KT ++A + VI + Sbjct: 61 EVLI-TVRAIAEQIYSGNIQTLIIIDNAQINKKTVHYARQLAMTYQTPPSRTITVIEKSS 119 Query: 140 NQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRT-R 198 G A ++ G AK + L+ +D D +D A + +P V AV+G R+R + Sbjct: 120 RGGHASSMNLGLKLAKGDVLIMLDADTSIDNQTIAKSAARFI-DPHVIAVSGAVRVRNVK 178 Query: 199 STLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDID 258 + + ++Q EY I L + +SG FR L ++G W + ED D Sbjct: 179 DSWLTRLQAIEYMIGIQLGRYGLTELNVTNNISGAFGVFRTKFLKQIGGWLNGS-AEDFD 237 Query: 259 ISWKLQL--NQW---TIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWR 313 ++++L N++ I +EP A+ W P TL+ L KQRLRW +++ + R WR Sbjct: 238 LTFRLHAYINKYPYLKIVHEPLAVAWTAAPITLRRLLKQRLRW---DGDLYYILVRRHWR 294 Query: 314 KENFRMW--PLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCT 371 K + RM + ++ L I+ + N+ + + Sbjct: 295 KFSTRMLKPTRMLFFIWYGLYYQIALPFIILLYTLILLAKFNVSILLSVSLIIYIYYFLA 354 Query: 372 LCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARW 431 + + +++ R + ++ S W+ P +L + L ++ K R Sbjct: 355 QIFMYILYLILVSERPKQDVYMS-GWLFLMPAYQQLLRFSAALFILNEILF--KGHRDSS 411 Query: 432 VSP 434 ++P Sbjct: 412 MAP 414 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P75905 Biofilm PGA synthesis N-glycosyltransferase pgaC... 469 e-131 UniRef50_A4SIQ6 Glycosyl transferase, family 2 n=6 Tax=Gammaprot... 368 e-100 UniRef50_B2UK24 Glycosyl transferase family 2 n=10 Tax=Burkholde... 362 1e-98 UniRef50_Q30SK0 Glycosyl transferase, family 2 n=6 Tax=Proteobac... 362 2e-98 UniRef50_A3N3L7 Biofilm PGA synthesis N-glycosyltransferase PgaC... 361 2e-98 UniRef50_B4CVJ2 Glycosyl transferase family 2 n=1 Tax=Chthonioba... 361 4e-98 UniRef50_Q9RQP9 Biofilm PIA synthesis N-glycosyltransferase icaA... 359 1e-97 UniRef50_B9EAX5 Intercellular adhesion protein IcaA homolog n=19... 358 2e-97 UniRef50_A0RGJ4 Glycosyl transferase and polysaccharide deacetyl... 353 6e-96 UniRef50_Q82DY8 Putative polysaccharide deacetylase/glycosyltran... 346 8e-94 UniRef50_C8WT25 Glycosyl transferase family 2 n=3 Tax=Bacteria R... 342 1e-92 UniRef50_B5HQE3 Bifunctional transferase/deacetylase n=3 Tax=Str... 342 2e-92 UniRef50_A4J4W8 Glycosyl transferase, family 2 n=3 Tax=Clostridi... 341 3e-92 UniRef50_D2LDP7 Polysaccharide deacetylase n=1 Tax=Rhodomicrobiu... 339 1e-91 UniRef50_Q2G7Y8 Polysaccharide deacetylase n=1 Tax=Novosphingobi... 334 4e-90 UniRef50_B5HP10 Bifunctional transferase/deacetylase n=9 Tax=Str... 334 5e-90 UniRef50_B0T3D0 Polysaccharide deacetylase n=2 Tax=Caulobacter R... 333 8e-90 UniRef50_C9RCL0 Glycosyl transferase family 2 n=1 Tax=Ammonifex ... 331 4e-89 UniRef50_D2L983 Polysaccharide deacetylase n=1 Tax=Desulfovibrio... 329 9e-89 UniRef50_UPI0001B4D705 bi-functional transferase/deacetylase n=3... 329 1e-88 UniRef50_C0YGU0 Polysaccharide deacetylase/glycosyl transferase,... 329 1e-88 UniRef50_Q1IUP8 Polysaccharide deacetylase n=1 Tax=Candidatus Ko... 328 3e-88 UniRef50_C8WBS4 Polysaccharide deacetylase n=4 Tax=Sphingomonada... 327 6e-88 UniRef50_Q1IL87 Glycosyl transferase, family 2 n=1 Tax=Candidatu... 326 8e-88 UniRef50_D2AU81 Polysaccharide deacetylase n=1 Tax=Streptosporan... 326 9e-88 UniRef50_B5GIB2 Bi-functional transferase/deacetylase n=7 Tax=St... 325 2e-87 UniRef50_C1ACV5 Putative glycosyl transferase/polysaccharide dea... 320 4e-86 UniRef50_D1SD50 Polysaccharide deacetylase n=1 Tax=Micromonospor... 317 3e-85 UniRef50_A1UIF2 Polysaccharide deacetylase n=3 Tax=Mycobacterium... 317 3e-85 UniRef50_Q1GJ85 Glycosyl transferase family 2 n=4 Tax=Rhodobacte... 315 2e-84 UniRef50_B6BG61 Glycosyl transferase, group 2 family protein n=2... 315 3e-84 UniRef50_A7GCY8 Glycosyl transferase, group 2 family n=66 Tax=Fi... 311 3e-83 UniRef50_B7QVL5 Glycosyl transferase, family 2 n=1 Tax=Ruegeria ... 311 4e-83 UniRef50_C1F4G9 Polysaccharide deacetylase domain protein/glycos... 310 6e-83 UniRef50_C0WP52 Possible VGP family vectorial glycosyl polymeriz... 307 4e-82 UniRef50_A9EU26 Group-specific protein n=3 Tax=Phaeobacter galla... 307 6e-82 UniRef50_A9EEN3 Glycosyl transferase, family 2 n=1 Tax=Oceanibul... 306 9e-82 UniRef50_A6Q1D6 Glucosaminyltransferase n=2 Tax=Epsilonproteobac... 306 1e-81 UniRef50_B4D4P9 Polysaccharide deacetylase n=1 Tax=Chthoniobacte... 304 5e-81 UniRef50_C1V8R5 Glycosyl transferase n=1 Tax=Halogeometricum bor... 303 8e-81 UniRef50_A1R2V1 Glycosyl transferase, group 2 family domain prot... 301 3e-80 UniRef50_C5SLQ2 Polysaccharide deacetylase n=1 Tax=Asticcacaulis... 300 6e-80 UniRef50_C7MUK1 Glycosyl transferase n=11 Tax=Actinomycetales Re... 297 7e-79 UniRef50_B8II97 Polysaccharide deacetylase n=1 Tax=Methylobacter... 297 7e-79 UniRef50_P96587 Uncharacterized glycosyltransferase ydaM n=6 Tax... 295 3e-78 UniRef50_B9LWG0 Glycosyl transferase family 2 n=1 Tax=Halorubrum... 294 3e-78 UniRef50_C1D0J4 Putative glycosyltransferase n=1 Tax=Deinococcus... 293 1e-77 UniRef50_C6A3S6 Putative glycosyl transferase n=1 Tax=Thermococc... 293 1e-77 UniRef50_B9MNQ5 Glycosyl transferase family 2 n=1 Tax=Anaerocell... 292 2e-77 UniRef50_Q3B487 Glucosaminyltransferase n=2 Tax=Chlorobium/Pelod... 292 2e-77 UniRef50_B1L4E1 Glycosyl transferase family 2 n=1 Tax=Candidatus... 291 4e-77 UniRef50_Q46B82 Glucosaminyltransferase n=2 Tax=Methanosarcina R... 290 7e-77 UniRef50_Q03HA4 Glycosyltransferase n=2 Tax=Bacilli RepID=Q03HA4... 288 2e-76 UniRef50_B1HSU1 Biofilm PIA synthesis N-glycosyltransferase icaA... 288 3e-76 UniRef50_B7HT47 Glycosyl transferase family 2 n=8 Tax=Bacteria R... 287 4e-76 UniRef50_B9ZC52 Glycosyl transferase family 2 n=1 Tax=Natrialba ... 287 4e-76 UniRef50_A5VMQ3 Glycosyl transferase, family 2 n=38 Tax=Lactobac... 287 8e-76 UniRef50_C4I9P5 Inner membrane glycosyltransferase n=39 Tax=Bact... 286 1e-75 UniRef50_B7I546 Glycosyltransferase n=12 Tax=Acinetobacter RepID... 286 1e-75 UniRef50_C1F7J6 Glycosyl transferase, group 2 family n=1 Tax=Aci... 286 1e-75 UniRef50_Q6LKT1 Hypothetical glycosyltransferase, probably invol... 285 3e-75 UniRef50_B5W069 Glycosyl transferase family 2 n=3 Tax=Arthrospir... 283 1e-74 UniRef50_UPI0001C1680B Glycosyl transferase, family 2 n=2 Tax=No... 283 1e-74 UniRef50_Q2LWC4 Glycosyltransferase n=1 Tax=Syntrophus aciditrop... 282 2e-74 UniRef50_Q2S5Y1 Glycosyl transferase, group 2 family protein n=1... 282 2e-74 UniRef50_Q110Z2 Glycosyl transferase, family 2 n=15 Tax=Cyanobac... 282 2e-74 UniRef50_Q38W83 Putative glycosyl transferase, family 2 n=1 Tax=... 282 2e-74 UniRef50_B7A6V1 Glycosyl transferase family 2 n=1 Tax=Thermus aq... 280 5e-74 UniRef50_C7RAF6 Glycosyl transferase family 2 n=1 Tax=Kangiella ... 280 7e-74 UniRef50_O67406 Cellulose synthase catalytic subunit n=1 Tax=Aqu... 279 1e-73 UniRef50_Q1IMJ5 Glycosyl transferase, family 2 n=10 Tax=cellular... 279 1e-73 UniRef50_B7HFD6 N-acetyllactosaminide beta-1,6-N-acetylglucosami... 279 1e-73 UniRef50_D1BHB0 Glycosyl transferase n=1 Tax=Sanguibacter keddie... 279 2e-73 UniRef50_P58931 Cellulose synthase catalytic subunit [UDP-formin... 278 2e-73 UniRef50_B6AKK5 Glycosyltransferase n=1 Tax=Leptospirillum sp. G... 277 4e-73 UniRef50_A8VSD1 Binding-protein-dependent transport systems inne... 277 8e-73 UniRef50_A3ERE6 Cellulose synthase catalytic subunit (UDP-formin... 277 8e-73 UniRef50_B9AE19 Putative uncharacterized protein n=1 Tax=Methano... 277 8e-73 UniRef50_C1DJH7 Cellulose synthase subunit AB n=35 Tax=Proteobac... 277 9e-73 UniRef50_P21877 Cyclic di-GMP-binding domain n=11 Tax=cellular o... 276 1e-72 UniRef50_Q9CHP7 Glycosyl transferase n=3 Tax=Lactococcus lactis ... 275 2e-72 UniRef50_UPI0001AEFEC9 N-glycosyltransferase PgaC n=1 Tax=Strept... 275 3e-72 UniRef50_D1CH68 Glycosyl transferase family 2 n=2 Tax=Bacteria R... 274 4e-72 UniRef50_P58932 Cellulose synthase catalytic subunit [UDP-formin... 273 6e-72 UniRef50_C7TL81 Glycosyl transferase, group 2 n=3 Tax=Bacilli Re... 271 4e-71 UniRef50_Q5N0N9 UDP-glucose-beta-D-glucan glucosyltransferase n=... 271 4e-71 UniRef50_B1YJV4 Glycosyl transferase family 2 n=4 Tax=Bacillales... 271 4e-71 UniRef50_A8MM07 Glycosyl transferase family 2 n=1 Tax=Alkaliphil... 271 5e-71 UniRef50_B7IR36 Glycosyltransferase n=11 Tax=Bacillus cereus gro... 270 5e-71 UniRef50_A7M791 Cellulose synthase subunit AB n=1 Tax=Asaia bogo... 270 5e-71 UniRef50_A4CLA3 Glycosyl transferase, group 2 family protein n=2... 270 5e-71 UniRef50_C1DW58 Cellulose synthase catalytic subunit (UDP-formin... 270 9e-71 UniRef50_A6DGB3 Glycosyl transferase, family 2 n=3 Tax=Lentispha... 270 9e-71 UniRef50_A0LZM3 Transmembrane family-2 glycosyl transferase-poss... 269 1e-70 UniRef50_A1VIY0 Glycosyl transferase, family 2 n=1 Tax=Polaromon... 269 2e-70 UniRef50_Q1LL41 Glycosyl transferase, family 2 n=18 Tax=Burkhold... 268 2e-70 UniRef50_Q9UY40 Glycosyl transferase, family 2 n=2 Tax=Thermococ... 268 2e-70 UniRef50_Q9KAQ6 BH2231 protein n=4 Tax=Bacillus RepID=Q9KAQ6_BACHD 268 2e-70 UniRef50_A8USY3 Cellulose synthase (UDP-forming) n=1 Tax=Hydroge... 268 2e-70 UniRef50_Q7NHH7 Glr2559 protein n=1 Tax=Gloeobacter violaceus Re... 268 3e-70 UniRef50_B8E0Z1 Glycosyl transferase family 2 n=1 Tax=Dictyoglom... 268 4e-70 UniRef50_B2JFS7 Cellulose synthase (UDP-forming) n=8 Tax=Burkhol... 267 5e-70 UniRef50_A1S0Z5 Glycosyl transferase, family 2 n=1 Tax=Thermofil... 267 5e-70 UniRef50_C5CVQ8 Glycosyl transferase family 2 n=4 Tax=Proteobact... 267 5e-70 UniRef50_Q59167 Cyclic di-GMP-binding domain n=5 Tax=cellular or... 267 7e-70 UniRef50_B0C9M4 Glycosyl transferase, family 2 n=1 Tax=Acaryochl... 267 8e-70 UniRef50_A4CJ64 Glycosyltransferase n=15 Tax=Bacteroidetes RepID... 267 9e-70 UniRef50_C7RTK3 Polysaccharide deacetylase n=2 Tax=Bacteria RepI... 266 1e-69 UniRef50_C6WRK1 Glycosyl transferase family 2 n=1 Tax=Actinosynn... 266 1e-69 UniRef50_B3DW74 Glycosyltransferase n=1 Tax=Methylacidiphilum in... 265 2e-69 UniRef50_P37653 Cellulose synthase catalytic subunit [UDP-formin... 265 2e-69 UniRef50_Q3IER5 Cellulose synthase catalytic subunit [UDP-formin... 265 3e-69 UniRef50_C1F830 Cellulose synthase I n=4 Tax=Bacteria RepID=C1F8... 265 3e-69 UniRef50_B1V7Q3 Chitin synthase n=1 Tax=Clostridium perfringens ... 265 3e-69 UniRef50_Q2IFD0 Glycosyl transferase, family 2 n=5 Tax=Bacteria ... 265 3e-69 UniRef50_A6GMN6 Cellulose synthase (UDP-forming) n=3 Tax=Burkhol... 264 5e-69 UniRef50_A9CZ20 Putative cellulose synthase catalytic subunit (U... 264 6e-69 UniRef50_B4U2G9 Glycosyl transferase, group 2 family protein n=2... 263 6e-69 UniRef50_Q0AL04 Glycosyl transferase, family 2 n=3 Tax=Bacteria ... 263 6e-69 UniRef50_D0KCI8 Cellulose synthase catalytic subunit (UDP-formin... 263 8e-69 UniRef50_Q46CB1 Glycosyltransferase n=2 Tax=Methanosarcina RepID... 263 9e-69 UniRef50_A3I0Z1 Glycosyltransferase n=1 Tax=Algoriphagus sp. PR1... 263 1e-68 UniRef50_B4RSC1 Cellulose synthase catalytic subunit n=2 Tax=Alt... 263 1e-68 UniRef50_C6X1A6 Glycosyl transferase, family 2 n=3 Tax=Flavobact... 262 1e-68 UniRef50_C0Z7U1 Conserved hypothetical membrane protein n=1 Tax=... 262 1e-68 UniRef50_B2UMM8 Glycosyl transferase family 2 n=3 Tax=Verrucomic... 262 2e-68 UniRef50_C4KBM8 Glycosyl transferase family 2 n=6 Tax=Betaproteo... 262 2e-68 UniRef50_Q2K9Z1 Cellulose synthase protein n=17 Tax=Rhizobiaceae... 262 2e-68 UniRef50_B2IY03 Glycosyl transferase, family 2 n=1 Tax=Nostoc pu... 262 3e-68 UniRef50_Q8DMV4 Cellulose synthase n=1 Tax=Thermosynechococcus e... 261 3e-68 UniRef50_Q74D65 Glycosyl transferase, group 2 family protein n=1... 261 4e-68 UniRef50_D2S4V1 Glycosyl transferase family 2 n=1 Tax=Geodermato... 260 6e-68 UniRef50_Q988K9 Mlr6694 protein n=1 Tax=Mesorhizobium loti RepID... 260 7e-68 UniRef50_C7IGQ0 Glycosyl transferase family 2 n=2 Tax=Firmicutes... 260 1e-67 UniRef50_C7M4N0 Glycosyl transferase family 2 n=4 Tax=Flavobacte... 260 1e-67 UniRef50_C8WFV0 Cellulose synthase catalytic subunit (UDP-formin... 259 1e-67 UniRef50_C6X7P6 Cellulose synthase catalytic subunit (UDP-formin... 259 1e-67 UniRef50_A5FCY6 Candidate polysaccharide-forming beta-glycosyltr... 259 1e-67 UniRef50_A4WR49 Cellulose synthase (UDP-forming) n=6 Tax=Rhodoba... 259 2e-67 UniRef50_C9RLY4 Glycosyl transferase family 2 n=1 Tax=Fibrobacte... 259 2e-67 UniRef50_B9YCC6 Putative uncharacterized protein n=1 Tax=Holdema... 259 2e-67 UniRef50_Q31NE1 Cellulose synthase (UDP-forming) n=2 Tax=Synecho... 258 2e-67 UniRef50_A0LZL8 Transmembrane family-2 glycosyl transferase-poss... 258 2e-67 UniRef50_A3HSN4 Glycosyl transferase, group 2 family protein n=1... 258 2e-67 UniRef50_C6J2E9 Glycosyl transferase n=1 Tax=Paenibacillus sp. o... 258 2e-67 UniRef50_C4WPV5 Putative uncharacterized protein n=1 Tax=Ochroba... 258 3e-67 UniRef50_C0G271 Glycosyl transferase family 2 n=1 Tax=Natrialba ... 258 3e-67 UniRef50_UPI000178A1DC glycosyl transferase family 2 n=1 Tax=Geo... 258 4e-67 UniRef50_C5A6A1 Glycosyltransferase n=2 Tax=Thermococcus RepID=C... 257 6e-67 UniRef50_Q31R21 Cellulose synthase (UDP-forming) n=3 Tax=Synecho... 257 6e-67 UniRef50_B5WDV1 Cellulose synthase catalytic subunit (UDP-formin... 257 7e-67 UniRef50_Q7NUL9 Cellulose synthase, subunit A n=1 Tax=Chromobact... 257 8e-67 UniRef50_Q1Q2A6 Strongly similar to cellulose synthase, catalyti... 257 8e-67 UniRef50_C5S5F1 Glycosyl transferase family 2 n=1 Tax=Allochroma... 257 9e-67 UniRef50_Q9RBJ2 Cyclic di-GMP-binding domain n=2 Tax=Gluconaceto... 256 1e-66 UniRef50_B4AY07 Cellulose synthase (UDP-forming) n=1 Tax=Cyanoth... 256 1e-66 UniRef50_Q2RZV9 Putative glucosyltransferase n=1 Tax=Salinibacte... 256 1e-66 UniRef50_A9KK06 Glycosyl transferase family 2 n=12 Tax=Bacteria ... 256 1e-66 UniRef50_B1Y242 Cellulose synthase catalytic subunit (UDP-formin... 256 1e-66 UniRef50_D1PAP3 Putative biofilm PGA synthesis N-glycosyltransfe... 256 2e-66 UniRef50_A5GEA8 Glycosyl transferase, family 2 n=3 Tax=Deltaprot... 255 2e-66 UniRef50_B2GJB7 Putative glycosyltransferase n=2 Tax=Micrococcin... 255 3e-66 UniRef50_B4W4T5 Cellulose synthase catalytic subunit (UDP-formin... 255 3e-66 UniRef50_Q97J01 Glycosyltransferase, involved in cell wall bioge... 254 4e-66 UniRef50_C2KK33 Glycosyltransferase (Fragment) n=2 Tax=Lactobaci... 254 4e-66 UniRef50_Q05S47 Group-specific protein n=1 Tax=Synechococcus sp.... 254 4e-66 UniRef50_C0A1N7 Glycosyl transferase, family 2 n=1 Tax=Opitutace... 254 5e-66 UniRef50_C5AKK6 Glycosyl transferase, family 2 n=5 Tax=Burkholde... 254 5e-66 UniRef50_A9FZQ2 Glycosyltransferase n=1 Tax=Sorangium cellulosum... 253 7e-66 UniRef50_Q30R23 Glycosyl transferase, family 2 n=3 Tax=Campyloba... 253 8e-66 UniRef50_A7INQ0 Glycosyl transferase family 2 n=11 Tax=Rhizobial... 253 8e-66 UniRef50_B0CG43 Glycosyl transferase, group 2 family protein, pu... 253 8e-66 UniRef50_A7HQK0 Glycosyl transferase family 2 n=1 Tax=Parvibacul... 253 9e-66 UniRef50_Q7NLY9 Gll0980 protein n=1 Tax=Gloeobacter violaceus Re... 252 2e-65 UniRef50_B1M838 Cellulose synthase catalytic subunit (UDP-formin... 252 2e-65 UniRef50_C7PF17 Glycosyl transferase family 2 n=1 Tax=Chitinopha... 252 2e-65 UniRef50_Q3B477 Cellulose synthase (UDP-forming) n=1 Tax=Chlorob... 251 3e-65 UniRef50_A3V6V4 Putative cellulose synthase catalytic subunit pr... 251 4e-65 UniRef50_Q6KZU9 N-acetylglucosaminyltransferase n=1 Tax=Picrophi... 251 4e-65 UniRef50_C5SBP2 Glycosyl transferase family 2 n=1 Tax=Allochroma... 251 4e-65 UniRef50_Q0IE23 Glycosyl transferase, group 2 family protein n=1... 251 5e-65 UniRef50_C0GFE2 Glycosyl transferase family 2 n=1 Tax=Dethiobact... 250 6e-65 UniRef50_UPI0001C42DFC capsule anchoring protein CapD n=1 Tax=Ba... 250 1e-64 UniRef50_B1ZCT8 Cellulose synthase (UDP-forming) n=14 Tax=Rhizob... 250 1e-64 UniRef50_D1JE47 Conserved hypothetical membrane protein, glycosy... 249 1e-64 UniRef50_B0U7V3 Glycosyl transferase family 2 n=1 Tax=Methylobac... 249 1e-64 UniRef50_UPI000038DF5C N-acetylglucosaminyltransferase n=1 Tax=F... 249 1e-64 UniRef50_A0LSJ8 Glycosyl transferase, family 2 n=1 Tax=Acidother... 249 1e-64 UniRef50_Q0AEM9 Cellulose synthase (UDP-forming) n=2 Tax=Nitroso... 249 1e-64 UniRef50_D2L608 Glycosyl transferase family 2 n=1 Tax=Desulfovib... 249 2e-64 UniRef50_A5FW11 Cellulose synthase (UDP-forming) n=1 Tax=Acidiph... 248 2e-64 UniRef50_A8FII3 Glycosyltransferase n=3 Tax=Bacillus RepID=A8FII... 248 2e-64 UniRef50_B6R4T3 Glycosyl transferase, family 2 n=1 Tax=Pseudovib... 248 2e-64 UniRef50_A9QSM4 N-acetylglucosaminyltransferase n=39 Tax=Lactoba... 248 3e-64 UniRef50_B8HQQ1 Cellulose synthase (UDP-forming) n=5 Tax=Cyanoba... 248 3e-64 UniRef50_A4YD58 Glycosyl transferase, family 2 n=12 Tax=Sulfolob... 248 4e-64 UniRef50_Q6ACB6 Glucosaminyltransferase n=1 Tax=Leifsonia xyli s... 248 4e-64 UniRef50_C7PF18 Glycosyl transferase family 2 n=9 Tax=Bacteroide... 247 5e-64 UniRef50_Q4KHI6 Glycosyl transferase, group 2 family protein n=1... 247 5e-64 UniRef50_D2QUU5 Glycosyl transferase family 2 n=1 Tax=Spirosoma ... 247 8e-64 UniRef50_A3ZTV9 Cellulose synthase (UDP-forming) n=2 Tax=Blastop... 245 2e-63 UniRef50_Q2JNJ5 Glycosyl transferase, group 2 family protein n=6... 245 2e-63 UniRef50_C1ZJA9 Glycosyl transferase n=1 Tax=Planctomyces limnop... 245 3e-63 UniRef50_D2LZZ6 Glycosyl transferase family 2 n=1 Tax=Bacillus c... 245 3e-63 UniRef50_B3QIT3 Glycosyl transferase family 2 n=6 Tax=Bradyrhizo... 245 4e-63 UniRef50_A1SGB7 Glycosyl transferase, family 2 n=1 Tax=Nocardioi... 244 5e-63 UniRef50_C7XH18 Glycosyltransferase n=1 Tax=Lactobacillus crispa... 244 5e-63 UniRef50_C6QIA2 Cellulose synthase (UDP-forming) n=1 Tax=Hyphomi... 244 6e-63 UniRef50_A6CMH6 Glycosyl transferase, group 2 family protein n=2... 244 6e-63 UniRef50_Q9FNI7 Glucomannan 4-beta-mannosyltransferase 2 n=8 Tax... 244 6e-63 UniRef50_A0YM95 UDP-glucose-beta-D-glucan glucosyltransferase n=... 243 9e-63 UniRef50_Q04QL4 Glycosyltransferase plus another conserved domai... 243 9e-63 UniRef50_B7K7L6 Cellulose synthase (UDP-forming) n=1 Tax=Cyanoth... 243 1e-62 UniRef50_B4VZ62 Type IV pilus assembly protein PilZ n=1 Tax=Micr... 243 1e-62 UniRef50_Q9LJP4 Xyloglucan glycosyltransferase 4 n=75 Tax=Embryo... 243 1e-62 UniRef50_A5FY21 Glycosyl transferase, family 2 n=2 Tax=Alphaprot... 243 1e-62 UniRef50_Q182N2 Cellulose synthase catalytic subunit [UDP-formin... 243 1e-62 UniRef50_D1J3P1 Whole genome shotgun sequence of line PN40024, s... 242 1e-62 UniRef50_A4SY23 Cellulose synthase (UDP-forming) n=1 Tax=Polynuc... 242 2e-62 UniRef50_B4WM17 Glycosyl transferase, group 2 family protein n=1... 242 2e-62 UniRef50_B6JD78 Glycosyl transferase, family 2 n=1 Tax=Oligotrop... 241 3e-62 UniRef50_Q89FJ5 Bll6705 protein n=1 Tax=Bradyrhizobium japonicum... 241 3e-62 UniRef50_Q2RGJ6 Glycosyl transferase, family 2 n=7 Tax=Firmicute... 241 5e-62 UniRef50_B0C4P4 Glycosyl transferase, group 2 family protein n=1... 240 7e-62 UniRef50_Q2JKN6 Glycosyl transferase, group 2 family protein n=2... 240 8e-62 UniRef50_C7NQ00 Glycosyl transferase family 2 n=1 Tax=Halorhabdu... 240 9e-62 UniRef50_A5AGR2 Putative uncharacterized protein n=1 Tax=Vitis v... 240 1e-61 UniRef50_C9B2I5 Cellulose synthase catalytic subunit n=2 Tax=Ent... 240 1e-61 UniRef50_A5GQ06 Glycosyltransferase of family GT2; possible proc... 239 1e-61 UniRef50_B4UAH9 Glycosyl transferase family 2 n=4 Tax=Anaeromyxo... 239 1e-61 UniRef50_Q7NDK6 Glr4229 protein n=1 Tax=Gloeobacter violaceus Re... 239 2e-61 UniRef50_Q3ATR2 Glycosyltransferases probably involved in cell w... 239 2e-61 UniRef50_P74165 Sll1377 protein n=9 Tax=Cyanobacteria RepID=P741... 239 2e-61 UniRef50_B2IZC1 Glycosyl transferase, family 2 n=1 Tax=Nostoc pu... 239 2e-61 UniRef50_Q8S7W0 Probable mannan synthase 4 n=4 Tax=Magnoliophyta... 238 2e-61 UniRef50_A9CLP5 Curdlan Synthase n=5 Tax=Alphaproteobacteria Rep... 238 3e-61 UniRef50_C1TQW8 Glycosyl transferase n=1 Tax=Dethiosulfovibrio p... 238 4e-61 UniRef50_Q2W2R8 Cellulose synthase catalytic subunit n=5 Tax=Rho... 237 5e-61 UniRef50_UPI0001C31B0A glycosyl transferase family 2 n=1 Tax=Con... 237 8e-61 UniRef50_UPI0001C30EDA glycosyl transferase family 2 n=1 Tax=Con... 237 8e-61 UniRef50_A6CBT2 Glycosyl transferase, group 2 family protein n=1... 237 9e-61 UniRef50_B0C9B5 Glycosyl transferase, group 2 family protein, pu... 236 1e-60 UniRef50_Q7XIF5 Probable mannan synthase 7 n=9 Tax=Magnoliophyta... 236 1e-60 UniRef50_C9KN46 Chitin synthase n=1 Tax=Mitsuokella multacida DS... 236 1e-60 UniRef50_UPI00017891C0 glycosyl transferase family 2 n=1 Tax=Geo... 236 1e-60 UniRef50_C6QAR9 Glycosyl transferase family 2 n=1 Tax=Hyphomicro... 236 1e-60 UniRef50_Q886Q3 Glycosyl transferase, group 2 family protein n=2... 236 1e-60 UniRef50_Q7PC76 Glucomannan 4-beta-mannosyltransferase 1 n=53 Ta... 235 2e-60 UniRef50_C1ZM23 Glycosyl transferase n=1 Tax=Planctomyces limnop... 235 2e-60 Sequences not found previously or not previously below threshold: UniRef50_A1SEL1 General secretory system II, protein E domain pr... 240 9e-62 >UniRef50_P75905 Biofilm PGA synthesis N-glycosyltransferase pgaC n=146 Tax=Bacteria RepID=PGAC_ECOLI Length = 441 Score = 469 bits (1208), Expect = e-131, Method: Composition-based stats. Identities = 441/441 (100%), Positives = 441/441 (100%) Query: 1 MINRIVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHW 60 MINRIVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHW Sbjct: 1 MINRIVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHW 60 Query: 61 PWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRA 120 PWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRA Sbjct: 61 PWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRA 120 Query: 121 ILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPM 180 ILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPM Sbjct: 121 ILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPM 180 Query: 181 LYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRS 240 LYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRS Sbjct: 181 LYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRS 240 Query: 241 ALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGG 300 ALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGG Sbjct: 241 ALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGG 300 Query: 301 AEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIA 360 AEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIA Sbjct: 301 AEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIA 360 Query: 361 ATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRV 420 ATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRV Sbjct: 361 ATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRV 420 Query: 421 MLMPKKQRARWVSPDRGILRG 441 MLMPKKQRARWVSPDRGILRG Sbjct: 421 MLMPKKQRARWVSPDRGILRG 441 >UniRef50_A4SIQ6 Glycosyl transferase, family 2 n=6 Tax=Gammaproteobacteria RepID=A4SIQ6_AERS4 Length = 416 Score = 368 bits (945), Expect = e-100, Method: Composition-based stats. Identities = 202/412 (49%), Positives = 274/412 (66%), Gaps = 6/412 (1%) Query: 26 SGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFN 85 + F +P M+++WI GG+Y++ ++ L P +++++PC+N Sbjct: 4 FLSGIAIFTLAYPSMMAMVWICGGLYYY-FQWEQRDIALARTGLVLPRYPKVTLMVPCYN 62 Query: 86 EEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAI 145 E NVEETI L QRY ++EV+A+NDGS D T LDR+AAQ L V+H NQGKA+ Sbjct: 63 EGANVEETISHLLRQRYPDLEVLAINDGSKDDTGQRLDRLAAQDARLTVLH-QHNQGKAM 121 Query: 146 ALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKI 205 AL G AK E LV IDGDA+LD DA ++V+ + +P+VGAVTGNPR+RTRST++GKI Sbjct: 122 ALNNGLERAKGEILVGIDGDAVLDHDAVRWMVKHFIESPKVGAVTGNPRVRTRSTIIGKI 181 Query: 206 QVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQL 265 Q GE+SSIIGLIKR QRIYG VFTVSGV+ AFR+SA+ VG WS DM+TEDIDISWKLQL Sbjct: 182 QTGEFSSIIGLIKRAQRIYGTVFTVSGVVVAFRKSAVEAVGGWSTDMVTEDIDISWKLQL 241 Query: 266 NQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFE 325 W I Y+P+ALCW+LMPET++GL+KQRLRWAQGGAEV L+ + R N W L E Sbjct: 242 AGWLIHYQPQALCWVLMPETVRGLYKQRLRWAQGGAEVILRYGLQAMRWRNRHFWLLLLE 301 Query: 326 YCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIEN 385 Y + +W ++ L+ +I+ + P EL G+++ +CLLQF VS+ I+N Sbjct: 302 YLSSVLWCYSMLLLALIWLI----RPDLGELAEGQLFQWTGVIMTAICLLQFGVSIFIDN 357 Query: 386 RYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRG 437 Y+ + S W IW+P+I+W+L+L T +V+ + +L + Q A W SPDRG Sbjct: 358 HYDKTMARSFVWCIWYPLIYWLLNLVTVIVAVPKAILRRRGQLAVWESPDRG 409 >UniRef50_B2UK24 Glycosyl transferase family 2 n=10 Tax=Burkholderiales RepID=B2UK24_RALPJ Length = 425 Score = 362 bits (930), Expect = 1e-98, Method: Composition-based stats. Identities = 206/418 (49%), Positives = 277/418 (66%), Gaps = 2/418 (0%) Query: 24 FHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPW-GENAPAPQLKDNPSISIIIP 82 + + FVF++PFFMS W++GG+ + ER +K P +SII+P Sbjct: 4 LLAKRWISGFVFYYPFFMSYFWMIGGLLHYFLLERGTRRIQHPLALLGVKTYPKVSIIVP 63 Query: 83 CFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQG 142 C+NEE NV E I RY N ++IAVNDGS+D+T L+ +AAQ P L VIH + NQG Sbjct: 64 CYNEEANVREVISHLARMRYPNYDIIAVNDGSSDRTGERLNELAAQYPQLVVIHQSSNQG 123 Query: 143 KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLV 202 KAI L T A +EYL+CIDGD++LD DA A+++ ++ NP VGAVTGNPRIRTRSTL+ Sbjct: 124 KAIGLTTAAQVTDAEYLMCIDGDSILDVDAIAWMIRHLMENPAVGAVTGNPRIRTRSTLL 183 Query: 203 GKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWK 262 G++QVGE+SSI+GLIKRTQ++YG + TVSGV+ FR+ A+ EVGYWS+DM+TEDIDISWK Sbjct: 184 GRMQVGEFSSIVGLIKRTQQVYGRLLTVSGVVVMFRKRAIEEVGYWSNDMLTEDIDISWK 243 Query: 263 LQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPL 322 LQ+ WTI YEPRAL WILMPET +GL+KQRLRWA+GG + +K + MWP+ Sbjct: 244 LQVGGWTIRYEPRALSWILMPETFRGLYKQRLRWAKGGIQALIKYAPAMLSLRQSMMWPI 303 Query: 323 FFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLM 382 FFEY L+ +WA+ L+ I ++V V + E A G LL C+LQ ++ Sbjct: 304 FFEYALSVVWAYNMLL-VITWSVLGLFVDMPPEWRMEAFPRWHGTLLFITCVLQLLIGCF 362 Query: 383 IENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRGILR 440 I+ RY+ + +W+PV FW+L+L TT++ F V ++ RARW SPDRGI + Sbjct: 363 IDRRYDDGILRYFIDTVWYPVAFWILNLVTTVIGFPAVAFQRQRARARWTSPDRGIQQ 420 >UniRef50_Q30SK0 Glycosyl transferase, family 2 n=6 Tax=Proteobacteria RepID=Q30SK0_SULDN Length = 433 Score = 362 bits (929), Expect = 2e-98, Method: Composition-based stats. Identities = 194/424 (45%), Positives = 275/424 (64%), Gaps = 2/424 (0%) Query: 18 PLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSI 77 + + + M+ +VF++P FMS +W++G ++F+ E+ + + + + Sbjct: 3 TFFIWWHYIWHAMLGYVFYYPLFMSTLWMIGAIFFYYKSEKRYVEQSIPELRENESWAGV 62 Query: 78 SIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHL 137 SI+IPC+NE +N ETI AL Y EVIA+NDGS D T IL +A P L+V++L Sbjct: 63 SILIPCYNEGENAIETITYALDVIYPEFEVIAINDGSKDDTLDILLSLAKDNPRLKVVNL 122 Query: 138 AQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRT 197 AQNQGKA+AL+ G+ AK E+LVCIDGDAL+D ++ + + P V AVTGNPRIR Sbjct: 123 AQNQGKALALQAGSLVAKHEFLVCIDGDALIDPYCLYWMAKHFIRYPEVAAVTGNPRIRN 182 Query: 198 RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDI 257 R++L+GKIQVGE+SSI+G+IKR QR +G +FTVSGVI FR+SA+ EVGYWS DM+TEDI Sbjct: 183 RTSLLGKIQVGEFSSIVGMIKRAQRSFGRLFTVSGVITGFRKSAVHEVGYWSPDMLTEDI 242 Query: 258 DISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENF 317 DI+WKLQ N W + +EP++L WILMPET+KGLWKQRLRWA GGA+V KN L+ + Sbjct: 243 DITWKLQRNGWDVRFEPKSLVWILMPETIKGLWKQRLRWAMGGAQVMFKNFKVLFLYKQT 302 Query: 318 RMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAG-VPLNIELTHIAATHTAGILLCTLCLLQ 376 +W L E L+ +WA+T ++ +++ L + ++L CL+Q Sbjct: 303 HLWGLMIELLLSMVWAYTMVLVILVWIFGLLFPIGYLAPSESPILPDQGSVILIGACLVQ 362 Query: 377 FIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 F VS ++ Y+ L + FW+IW+P FW L+L T + + +V+ KK RARWVSPDR Sbjct: 363 FGVSKWLDGHYDEGLGKNYFWMIWYPFAFWFLNLFTAVAALPKVLFG-KKGRARWVSPDR 421 Query: 437 GILR 440 G+ + Sbjct: 422 GVHQ 425 >UniRef50_A3N3L7 Biofilm PGA synthesis N-glycosyltransferase PgaC n=10 Tax=Pasteurellaceae RepID=A3N3L7_ACTP2 Length = 411 Score = 361 bits (928), Expect = 2e-98, Method: Composition-based stats. Identities = 181/413 (43%), Positives = 258/413 (62%), Gaps = 3/413 (0%) Query: 26 SGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFN 85 E+ FVF +P M+ W G+ +++++E+ N + ++ P +S+++PC+N Sbjct: 2 ILEIFSLFVFAYPAVMAFYWAFAGLTYFLFKEK-LKVPPNFDQMKHEEVPLVSLMVPCYN 60 Query: 86 EEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAI 145 E N++E I L +Y N E+I +NDGS D T I+D+ A + + +H N GKA Sbjct: 61 ESDNLDEAIPHLLNLKYPNYELIFINDGSKDHTGEIIDKWAKRDKRIVALH-QANSGKAS 119 Query: 146 ALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKI 205 AL G A+ +Y+ CIDGDA+LD A Y+V+ + NPR GAVTGNPR+R RST++G++ Sbjct: 120 ALNNGLRIARGKYVGCIDGDAVLDYKALDYMVQALESNPRYGAVTGNPRVRNRSTILGRL 179 Query: 206 QVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQL 265 QV E+SSIIGLIKR Q + G +FTVSGV FR+ + E+G WS +MITEDID+SWK+Q Sbjct: 180 QVSEFSSIIGLIKRAQCLMGTIFTVSGVCCLFRKDIMFEIGGWSTNMITEDIDVSWKIQT 239 Query: 266 NQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFE 325 + + IFYEPRALCW+LMPET+ GL+KQRLRWAQGGAE +K ++WR +N R+WP+F E Sbjct: 240 SGYDIFYEPRALCWVLMPETINGLFKQRLRWAQGGAETMMKYFPQIWRLKNRRLWPMFIE 299 Query: 326 YCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIEN 385 Y +T IWA LV ++ L I L A + +L Q +SL I+N Sbjct: 300 YIVTAIWASLLLVSILLSIYNLIF-DNQIGLLDWAELKPSIAILFIAFFTQLSISLYIDN 358 Query: 386 RYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRGI 438 RYE + +F IW+P ++W L+ T L + + K + A W SPDRG+ Sbjct: 359 RYEKGVVKYVFSCIWYPWLYWSLNTITLLCGIPKAIFRNKTKLAVWTSPDRGV 411 >UniRef50_B4CVJ2 Glycosyl transferase family 2 n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4CVJ2_9BACT Length = 1181 Score = 361 bits (926), Expect = 4e-98, Method: Composition-based stats. Identities = 111/440 (25%), Positives = 180/440 (40%), Gaps = 29/440 (6%) Query: 15 LCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDN 74 L + F ++RF+ F ++I + V F + +++ Sbjct: 739 LITSIDRLMFALIFGILRFLSI-AFVVAIALGIARVAFVTSLAIWVYFRSKPRGRPIENP 797 Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRV 134 P +SIIIP +NE+ V TI + LA Y ++E+I V+DGSTD T +++ A +RV Sbjct: 798 PLVSIIIPAYNEQSVVGRTIRSVLANDYPHMEIIFVDDGSTDGTADAVEQEFAGHEKVRV 857 Query: 135 IHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPR 194 + N GKA AL G +K E +V +D D ++ ++ +PRVGAV GN + Sbjct: 858 VR-QVNGGKASALNHGILVSKGEIIVGLDADTQFRKETITRLIR-HFRDPRVGAVAGNVK 915 Query: 195 IRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMIT 254 + R L+ + Q EY + + + V V G I A+RR+AL EVG + D + Sbjct: 916 VGNRINLITRWQALEYITSQNVDRLAYAQLNAVTVVPGAIGAWRRTALDEVGGYLTDTLA 975 Query: 255 EDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRK 314 ED+D++W+++ W I E A+ P T + +KQR RW+ G + K+ L+R Sbjct: 976 EDMDLTWRIRRKGWKIETEAGAVALTEAPATTQAFFKQRFRWSFGTLQCLWKHRRALFRY 1035 Query: 315 ENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAG-------- 366 F L T+W F L I V L + + Sbjct: 1036 G------WFGWVGLPTLWLFQILFQVIAPLVDLQVLYSLWSFGSSWFSEHYLGIVNQAAT 1089 Query: 367 ---------ILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSF 417 +L L + + + + L W+ + L A + Sbjct: 1090 PPGALLQQTLLFYALFYAVELAGAAVAVAIDRDGWRLLPWLFLQRFFYRQLMYAVLWKAV 1149 Query: 418 TRVMLMPKKQRARWVSPDRG 437 R L +R W +R Sbjct: 1150 LRAFL---GERTGWGKLERR 1166 >UniRef50_Q9RQP9 Biofilm PIA synthesis N-glycosyltransferase icaA n=63 Tax=Staphylococcaceae RepID=ICAA_STAA8 Length = 412 Score = 359 bits (923), Expect = 1e-97, Method: Composition-based stats. Identities = 159/411 (38%), Positives = 253/411 (61%), Gaps = 2/411 (0%) Query: 30 MMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKN 89 F+ F+P FMSI WIVG +YF+ RE + P + + I+ ++ C+NE + Sbjct: 3 FFNFLLFYPVFMSIYWIVGSIYFYFTREIRYSLN-KKPDINVDELEGITFLLACYNESET 61 Query: 90 VEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKT 149 +E+T+ LA +YE E+I +NDGS+D T ++ ++ + + L +N+GKA AL Sbjct: 62 IEDTLSNVLALKYEKKEIIIINDGSSDNTAELIYKIKENNDFI-FVDLQENRGKANALNQ 120 Query: 150 GAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGE 209 G A +Y++C+D D ++D+DA Y++E ++P++GAVTGNPRIR +S+++GKIQ E Sbjct: 121 GIKQASYDYVMCLDADTIVDQDAPYYMIENFKHDPKLGAVTGNPRIRNKSSILGKIQTIE 180 Query: 210 YSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWT 269 Y+S+IG IKR+Q + G V T+SGV F++SA+ +VGYW DMITEDI +SWKL L + Sbjct: 181 YASLIGCIKRSQTLAGAVNTISGVFTLFKKSAVVDVGYWDTDMITEDIAVSWKLHLRGYR 240 Query: 270 IFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLT 329 I YEP A+CW+L+PETL GLWKQR+RWAQGG EV L++ + + F ++ L FE ++ Sbjct: 241 IKYEPLAMCWMLVPETLGGLWKQRVRWAQGGHEVLLRDFFSTMKTKRFPLYILMFEQIIS 300 Query: 330 TIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEH 389 +W + L+ + + + + + + ++QF V+L I++RYE Sbjct: 301 ILWVYIVLLYLGYLFITANFLDYTFMTYSFSIFLLSSFTMTFINVIQFTVALFIDSRYEK 360 Query: 390 NLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRGILR 440 + L ++ W+P ++W+++ A LV+F + + K A W SPDRG + Sbjct: 361 KNMAGLIFVSWYPTVYWIINAAVVLVAFPKALKRKKGGYATWSSPDRGNTQ 411 >UniRef50_B9EAX5 Intercellular adhesion protein IcaA homolog n=19 Tax=Staphylococcaceae RepID=B9EAX5_MACCJ Length = 414 Score = 358 bits (919), Expect = 2e-97, Method: Composition-based stats. Identities = 163/414 (39%), Positives = 247/414 (59%), Gaps = 5/414 (1%) Query: 27 GELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNE 86 F+ +P MS++WI+G V +++ ER P+ + IS +IPC+NE Sbjct: 5 TNFFGGFLIVYPVLMSLVWIIGAVLYFLLIERRLKQTP----PERDEYEGISFLIPCYNE 60 Query: 87 EKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIA 146 VEET+ L Y E+I +NDGS+D T ++ + R I L QN+GKA A Sbjct: 61 ADTVEETLVNVLKLSYPKKEIILINDGSSDNTAEKINALKDHY-DFRFIDLQQNRGKANA 119 Query: 147 LKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQ 206 L T A+ ++++ ID D ++D A ++++ + P +GAVTGNPRIR +S+++GKIQ Sbjct: 120 LNTAVKHAQYDFVMGIDADTIIDDAAPYFMIDNFRHAPNLGAVTGNPRIRNKSSILGKIQ 179 Query: 207 VGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLN 266 EY+SIIG IKR Q + G V T+SGV FR+SAL VGYW DDMITEDI +SWK ++ Sbjct: 180 TVEYASIIGSIKRAQTLNGYVNTISGVFTLFRKSALEAVGYWDDDMITEDIAVSWKFHMH 239 Query: 267 QWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEY 326 + I YEPRALCW+L+PET G+WKQRLRWAQGG EV L+ + ++ N +W L+ E Sbjct: 240 HYEIRYEPRALCWMLVPETFTGIWKQRLRWAQGGHEVVLREFKSMLKQPNLPLWLLYLEQ 299 Query: 327 CLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENR 386 ++ IW ++ L + + + ++ A I+L T+ ++QF VSL+I++R Sbjct: 300 LISIIWVYSVLFLLAFTIINIDVLDYYFYAYNVNLLILASIILTTVNVIQFTVSLVIDSR 359 Query: 387 YEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRGILR 440 YE LF++ W+P +W+++ + +F + + K + A W SPDRG ++ Sbjct: 360 YERKNIVYLFFLCWYPTFYWLINACVAVFAFPKALRRKKGEFATWSSPDRGNIQ 413 >UniRef50_A0RGJ4 Glycosyl transferase and polysaccharide deacetylase fusion protein n=66 Tax=Bacillus RepID=A0RGJ4_BACAH Length = 1119 Score = 353 bits (907), Expect = 6e-96, Method: Composition-based stats. Identities = 101/407 (24%), Positives = 174/407 (42%), Gaps = 18/407 (4%) Query: 30 MMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKN 89 ++ +F+ + I V +YF ++R P +S++I +NEEK Sbjct: 717 ILTTIFYVAIGLGIFRFVFLIYFAWKQKRKTLSRYIH----SSYQPFVSVVIAAYNEEKV 772 Query: 90 VEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKT 149 + +TI + L +Y EVI V+DGSTD T ++ + P +R I +N GK+ A+ Sbjct: 773 IAKTIRSILDSKYGEFEVIVVDDGSTDGTSKVMQETFYKHPKVRFI-QKENGGKSSAMNL 831 Query: 150 GAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGE 209 G ++ E +V +D D ++ +DA + +V + V AV+GN ++ R L+ Q E Sbjct: 832 GFQQSRGEIIVTLDADTIIAQDAISLMVR-HFEDQNVAAVSGNVKVGNRRNLLTTWQHVE 890 Query: 210 YSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWT 269 Y + L +R + V G I A+R+ + E GY S+D + ED D++ Sbjct: 891 YITGFNLERRAFDELNCITVVPGAIGAWRKKNVVESGYLSEDTLAEDTDLTITFLRQGHR 950 Query: 270 IFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLT 329 I YE +A + PE +K L KQR RW+ G + K+ L ++ + + L Sbjct: 951 IVYEEKAYAFTESPEDVKSLIKQRYRWSYGTLQCLWKHRKALCNSKHKTLGFI----ALP 1006 Query: 330 TIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEH 389 +W F ++ FI A + + I L G L +++ + + E Sbjct: 1007 NMWLFQYVLQFI---APFADILMIIGLFSSDPLKVLGFYFVFF--LMDLLASLFAFKLEE 1061 Query: 390 NLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 L W+I I+ + S + + W R Sbjct: 1062 ENPKPLVWLILQRFIYRQFMTYVVVKSIFSSI---RGIAVGWNKLKR 1105 >UniRef50_Q82DY8 Putative polysaccharide deacetylase/glycosyltransferase n=1 Tax=Streptomyces avermitilis RepID=Q82DY8_STRAW Length = 790 Score = 346 bits (889), Expect = 8e-94, Method: Composition-based stats. Identities = 107/419 (25%), Positives = 179/419 (42%), Gaps = 22/419 (5%) Query: 24 FHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPC 83 + L V M ++ + YR+R+ P +S+I+P Sbjct: 387 WSMPALASGLVVVGVAVMGRFAMMLVLARTHYRQRNKRRFSWGPEITRP----VSVIVPA 442 Query: 84 FNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGK 143 +NE++ +E TI++ + IE+I V+DGSTD T I++ M +IP++RV+ +N GK Sbjct: 443 YNEKECIENTINSLAQSTHP-IEIIVVDDGSTDGTADIVEAM--RIPNVRVLR-QENAGK 498 Query: 144 AIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVG 203 AL G A + +V +DGD + + D +V+P + VGAV GN ++ R T++G Sbjct: 499 PAALNNGVRNASYDIVVMMDGDTVFEADTVRRLVQP-FADDEVGAVAGNAKVGNRDTVIG 557 Query: 204 KIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKL 263 Q EY L +R + + T+ G I AFRR A+ EVG S+D + ED DI+ + Sbjct: 558 AWQHIEYVMGFNLDRRMYDLLRCMPTIPGAIGAFRREAVLEVGGMSEDTLAEDTDITIAM 617 Query: 264 QLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLW-RKENFRMWPL 322 W + YE A W P +LK LW QR RW+ G + K+ L R + R + Sbjct: 618 HRGGWRVVYEEHARAWTEAPGSLKQLWSQRYRWSYGTMQALWKHRKSLTDRGPSGRFGRV 677 Query: 323 FFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLM 382 + +V + + ++ A+ A + + + L+ + Sbjct: 678 GMP-----LVVIFQIVTPVFAPLIDVFTVYSMIFVDFWASLLAWLAVLVVQLVCAAYAFR 732 Query: 383 IENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRGILRG 441 ++ +++ P+ LV + R RW R G Sbjct: 733 LDREKYR-------YLLMMPLQQLAYRQMMYLVLIHSCITALTGGRLRWQKLKRTGEVG 784 >UniRef50_C8WT25 Glycosyl transferase family 2 n=3 Tax=Bacteria RepID=C8WT25_ALIAD Length = 410 Score = 342 bits (878), Expect = 1e-92, Method: Composition-based stats. Identities = 189/418 (45%), Positives = 258/418 (61%), Gaps = 16/418 (3%) Query: 29 LMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEK 88 ++ F+F +PF MSI+W+VGG + RERH A +P L + P +SI+IPC NE Sbjct: 1 MVHSFLFLYPFVMSIVWMVGGCVYAWRRERH----PFAESPDLPETPFVSILIPCHNEGD 56 Query: 89 NVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALK 148 +E+TI L Y E++A+NDGSTD TRA+L+RMAA +RV++L +GKA AL Sbjct: 57 VLEDTIGRMLQLDYPAYEIVALNDGSTDDTRAVLERMAACDARVRVVNLPVQRGKARALN 116 Query: 149 TGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLY--NPRVGAVTGNPRIRTRSTLVGKIQ 206 G A++ E LV +D DA+L +DA ++V + + RVGAVTGNPRIR R TL+GKIQ Sbjct: 117 AGLVASRGEILVTVDADAVLAKDALRFLVWHFVAPGSERVGAVTGNPRIRNRGTLLGKIQ 176 Query: 207 VGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLN 266 V EY+SIIGLIKR QR+ G + TVSGVIAAFR+ AL + G W +DM+T+DI +SWKL+ Sbjct: 177 VLEYASIIGLIKRAQRVLGKIMTVSGVIAAFRKRALVDCGMWDEDMVTDDIAVSWKLERR 236 Query: 267 QWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEY 326 W I YEPRALC++ PE L+ L +QR RWAQGG EV ++N + LW +N RM PL+ E Sbjct: 237 AWDIRYEPRALCFMWAPERLRSLIRQRARWAQGGVEVLIRNASVLWTWKNRRMIPLYVEE 296 Query: 327 CLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENR 386 L WA+ +V + V I T++A T G L L+Q V+L IE R Sbjct: 297 LLGIAWAYLWVVSLVWTLA--FDVAHGIPWTYVAET---GTWLGLTSLVQTSVALWIEQR 351 Query: 387 YEH-NLTSSLFWIIWFPVIFWMLSLATTLVSFTRVML----MPKKQRARWVSPDRGIL 439 YE +L F+ IW+P +WM+ + + + + + A W SPDRG+ Sbjct: 352 YERDSLWRYYFYAIWYPAAYWMIGAFVVVWAVPKACWAMWAARRGRYATWKSPDRGVS 409 >UniRef50_B5HQE3 Bifunctional transferase/deacetylase n=3 Tax=Streptomyces RepID=B5HQE3_9ACTO Length = 900 Score = 342 bits (878), Expect = 2e-92, Method: Composition-based stats. Identities = 102/420 (24%), Positives = 171/420 (40%), Gaps = 20/420 (4%) Query: 24 FHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPC 83 + L + V M ++ + YR R+ P +++I+P Sbjct: 493 YTVPTLSVGLVIVGVAVMGRFGMMLILARRHYRLRNKRRFSWGPTVTRP----VTVIVPA 548 Query: 84 FNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIP--HLRVIHLAQNQ 141 +NE++ + T+ + + IE+I V+DGS+D T I A + ++RVI N Sbjct: 549 YNEKECIANTLESLSKSTHP-IEIIVVDDGSSDGTSEISREAARALGMTNVRVIR-QDNA 606 Query: 142 GKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTL 201 GK AL G A + ++ +DGD + + DA +V+P +P VGAV GN ++ R T+ Sbjct: 607 GKPAALNNGVRNASYDIVIMMDGDTVFEPDAVHQLVQP-FADPEVGAVAGNAKVGNRDTV 665 Query: 202 VGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISW 261 +G Q EY L +R + + T+ G I AFRR A+ +VG S+D + ED DI+ Sbjct: 666 IGAWQHIEYVMGFNLDRRMYDLLRCMPTIPGAIGAFRREAVLQVGGMSEDTLAEDTDITI 725 Query: 262 KLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWP 321 + + Y+ A W P +LK LW QR RW+ G + K+ L K Sbjct: 726 AIHRAGRRVVYQEHARAWTEAPGSLKQLWSQRYRWSYGTMQALWKHRKSLTDKGPSGR-- 783 Query: 322 LFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSL 381 F + + F + + + I + AA + +L + Sbjct: 784 -FGRVGMPLVVIFQIVTPVFAPLIDVFTAYSMIFIDFRAALYAWLAVLGIQFVCAAYAFR 842 Query: 382 MIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRGILRG 441 + + +Y +++ P+ LV + R RW R G Sbjct: 843 LDKEKYR--------YLLMMPLQQLAYRQMMYLVLIHSCITALTGGRLRWQKLKRTGEVG 894 >UniRef50_A4J4W8 Glycosyl transferase, family 2 n=3 Tax=Clostridia RepID=A4J4W8_DESRM Length = 425 Score = 341 bits (875), Expect = 3e-92, Method: Composition-based stats. Identities = 140/427 (32%), Positives = 232/427 (54%), Gaps = 8/427 (1%) Query: 17 IPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPS 76 + + + F F +P MSI+W++GG+YFW YRER + P Sbjct: 1 MVNTITADTLLHGIGYFTFLYPVTMSIIWVIGGLYFWWYRERKARTNNW-----PEVWPP 55 Query: 77 ISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRM-AAQIPHLRVI 135 ++I+IPC NEE ++ T HA Y ++ V+ ++D STD T I+ ++P ++ Sbjct: 56 VTILIPCHNEEISIATTCHALSKVNYPDLRVVFIDDASTDNTAQIIREWLRQEVPSFHLL 115 Query: 136 HLAQNQGKAIALKTGAAAA-KSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPR 194 L NQGKA AL G A + V ID D L+ D +++ P + PR+GAV+GNP Sbjct: 116 RLTTNQGKAKALNCGLQVAVHTPITVVIDADTLITPDTLKWLIAPFIRQPRLGAVSGNPL 175 Query: 195 IRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMIT 254 + R L+ +Q E++SI+GLIKR+QR G + TVSG I AF L ++G +S T Sbjct: 176 VGNRENLLENLQTAEFASILGLIKRSQRSLGRMLTVSGCITAFCTDTLRQLGGFSSRSAT 235 Query: 255 EDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRK 314 EDIDI+W +Q N + +++EPRA+ +I +P+T+K W QR RWA GG + + Sbjct: 236 EDIDITWAIQRNFYEVWFEPRAIAYIQVPKTIKEFWHQRCRWALGGWHLLRSHWDIFTHW 295 Query: 315 ENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCL 374 R+WP++ ++ ++ +W+F ++G +++ V +P + G ++ +CL Sbjct: 296 RWRRLWPVYLDFVISYLWSFCLVIGTLLWLVTYL-IPSKPAIGLTPIPAWYGSVISFVCL 354 Query: 375 LQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSP 434 +QF +L+ +R++H + S FWI W+P+ F+ + T + + R + +W SP Sbjct: 355 VQFGAALLANHRHDHKMYQSFFWIPWYPIFFFCIGALTVVWTSCRGLFGDLVTVGKWKSP 414 Query: 435 DRGILRG 441 R ++G Sbjct: 415 ARSSMQG 421 >UniRef50_D2LDP7 Polysaccharide deacetylase n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LDP7_RHOVA Length = 1170 Score = 339 bits (871), Expect = 1e-91, Method: Composition-based stats. Identities = 97/432 (22%), Positives = 180/432 (41%), Gaps = 31/432 (7%) Query: 24 FHSGELM---MRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISII 80 F++ L + +F + + +V Y R + P S+S++ Sbjct: 713 FYTLNLFQNTLTTLFIAAIALGLGRLVVLCGLAAYGNRRRKRRADPPYADDL---SVSVL 769 Query: 81 IPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQN 140 IP FNE K + +I LA ++ +EVI ++DGSTD T ++ A P + ++H N Sbjct: 770 IPAFNEAKVITASIRQILASSHQKLEVIVIDDGSTDGTADVVRGEFADDPRVSLMHTP-N 828 Query: 141 QGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRST 200 GKA A+ A A + +V +D D + + +V +P+VGAV GN ++ R Sbjct: 829 GGKARAINLALAQATGDIVVVLDADTQFEPLTISRLVRW-FADPKVGAVAGNAKVGNRIN 887 Query: 201 LVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDIS 260 ++ + Q EY + L +R + V G + A+RR A+ +G + + + ED D++ Sbjct: 888 VLTRWQALEYITAQNLERRALATLDCITVVPGAVGAWRREAIMGLGGFPSNTLAEDQDLT 947 Query: 261 WKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMW 320 +Q + + ++ AL W P+TL GL KQR RWA G + K+ + + Sbjct: 948 ISVQRAGYKVLFDADALAWTEAPDTLGGLAKQRFRWAFGTLQCLWKHRSANLNPRYGALG 1007 Query: 321 PLFFEYCLTTIWAFTCLVGFIIYA-------------VQLAGVPLNIELTHIAATHTAGI 367 + L +W F + I + + T T Sbjct: 1008 MI----ALPQVWLFQIALALISPLVDLLLLVQVVRTGIDYLQHGSQFNSENFTITLTYYA 1063 Query: 368 LLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQ 427 + T+ L +++ ++E R + +L L+W++ + + S + + + + Sbjct: 1064 VFMTVDLSAALIAFLLEKREDRSL---LWWLVLQRFGYRQVMYYVVAKSVVKAL---QGR 1117 Query: 428 RARWVSPDRGIL 439 W +R Sbjct: 1118 VVGWGKLERKAT 1129 >UniRef50_Q2G7Y8 Polysaccharide deacetylase n=1 Tax=Novosphingobium aromaticivorans DSM 12444 RepID=Q2G7Y8_NOVAD Length = 1101 Score = 334 bits (857), Expect = 4e-90, Method: Composition-based stats. Identities = 96/429 (22%), Positives = 177/429 (41%), Gaps = 14/429 (3%) Query: 14 VLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKD 73 + + + VA F + + ++ +F W FF++I + W + A Sbjct: 670 LAAVRVDVAAFLALDGLVT-LFSWLFFVAIALGIARAVIMAGLA--WWQSRSPRAEPPAF 726 Query: 74 NPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLR 133 P++S+IIP +NEE+ + ++ LA Y ++VI +DGS D T A++ R P + Sbjct: 727 EPTVSVIIPAWNEERVIAASVERVLASDYPALQVIVADDGSKDATSAVVARHFGHDPRVT 786 Query: 134 VIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNP 193 ++ L N GKA AL A E ++ +D D + + +PR+GAV G+ Sbjct: 787 LLTL-ANGGKAAALNRALRHATGEVVIALDADTQFEPLTIRRLARW-FADPRIGAVAGDA 844 Query: 194 RIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMI 253 R+ R LV + Q EY + L +R + + V G + A+RR+AL VG + ++ + Sbjct: 845 RVGNRVNLVTRWQAVEYITAQNLERRALAGFDAMTVVPGAVGAWRRAALDSVGGYPENTL 904 Query: 254 TEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWR 313 ED D++ +Q W + Y+PRA+ W P+T + L +QR RWA G + K+ + Sbjct: 905 AEDQDLTIAIQRKGWRVTYDPRAVAWTEAPQTFRALARQRYRWAFGTLQCLWKHRKVITS 964 Query: 314 KENFRMWPLFFEYCLTTIWAFTCLVGFI--IYAVQLAGVPLNIELTHIAATHT----AGI 367 ++ + + AF + I + + + A T I Sbjct: 965 RKPAGLGLVGLPQAWLFQIAFAAISPLIDGALIASIISTVVRVVQHGWAQTQGDLGRMAI 1024 Query: 368 LLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQ 427 + I + + ++ +++ + L + + Sbjct: 1025 YWSLFTAIDVICGWIAYRLDDKRPPYPAHLLVAQRIVYRQIMYWVVLRALASAI---GGW 1081 Query: 428 RARWVSPDR 436 W +R Sbjct: 1082 IVGWGKLER 1090 >UniRef50_B5HP10 Bifunctional transferase/deacetylase n=9 Tax=Streptomyces RepID=B5HP10_9ACTO Length = 741 Score = 334 bits (856), Expect = 5e-90, Method: Composition-based stats. Identities = 106/432 (24%), Positives = 195/432 (45%), Gaps = 29/432 (6%) Query: 15 LCIPLCVAYFHSGELMMRFVFFWPFFMS---IMWIVGGVYFWVYRERHWPWGENAPAPQL 71 + + + L++ + +M ++ G++ R + + WG P Sbjct: 313 WIFLVQASERITDVLVVGLSIIGSLVIGRFALMLLLSGIHARRVRRKGFRWGAPVTQP-- 370 Query: 72 KDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPH 131 +S+++P +NE K +E T+ + +A + IEV+ ++DGS+D T I++ M +P+ Sbjct: 371 -----VSVLVPAYNEAKCIENTVRSLMASEHP-IEVLVIDDGSSDGTARIVEAMG--LPN 422 Query: 132 LRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTG 191 +RV+ N GK AL G A A+ + +V +DGD + + +V+P +PRVGAV G Sbjct: 423 VRVVR-QLNAGKPAALNRGLANARHDIIVMMDGDTVFEPSTVRELVQP-FGDPRVGAVAG 480 Query: 192 NPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDD 251 N ++ R +L+G Q EY L +R + + T+ G + AFRRSAL VG SDD Sbjct: 481 NAKVGNRDSLIGAWQHIEYVMGFNLDRRMYDVLRCMPTIPGAVGAFRRSALERVGGMSDD 540 Query: 252 MITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRL 311 + ED DI+ + + W + Y +A W PE+++ LW QR RW+ G + K+ L Sbjct: 541 TLAEDTDITMAMHRDGWRVVYAEKARAWTEAPESVQQLWSQRYRWSYGTMQAIWKHRRAL 600 Query: 312 W-RKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLC 370 R + R + + + +V ++ + + + T A + + Sbjct: 601 VERGPSGRFGRVGLP-----LVSLFMVVAPLLAPLIDVFLLYGVVFGPTQKTIVAWLGVL 655 Query: 371 TLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRAR 430 + ++ + R + + L + +++ L L S+ + R R Sbjct: 656 AIQVVCAAYAF----RLDRERMTHLISLPLQQILYRQLMYVVLLQSWITALT---GGRLR 708 Query: 431 WVSPDR-GILRG 441 W R G++ Sbjct: 709 WQKLRRTGVVEA 720 >UniRef50_B0T3D0 Polysaccharide deacetylase n=2 Tax=Caulobacter RepID=B0T3D0_CAUSK Length = 1124 Score = 333 bits (854), Expect = 8e-90, Method: Composition-based stats. Identities = 101/416 (24%), Positives = 171/416 (41%), Gaps = 20/416 (4%) Query: 35 FFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPA-PQLKDNPSISIIIPCFNEEKNVEET 93 F +I + + F W +P + P +S++IPCFNEEK + + Sbjct: 711 LTALFLTAIALGLARLVFLACLALVHRWTHQSPENLDPETGPLVSVLIPCFNEEKVIAAS 770 Query: 94 IHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAA 153 + L ++N+EV+ ++DGS D T + R P + ++ +N GKA A+ G A Sbjct: 771 VARILESEWKNLEVLVLDDGSKDNTAQEVRRAHGDDPRVTLLSF-ENGGKARAVNRGLAI 829 Query: 154 AKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSI 213 AK +Y+V +D D L ++ +P +GAV GN + R +V + Q EY + Sbjct: 830 AKGDYVVALDADTLFPPKTIGRLIRW-FQDPTIGAVAGNAIVGNRVNMVTRWQALEYVTA 888 Query: 214 IGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYE 273 L +R G V V G + A+R+S L +G + D + ED D++ Q W + ++ Sbjct: 889 QNLERRALAALGAVTVVPGAVGAWRKSVLDALGGYPSDTLAEDQDLTIACQRAGWKVAFD 948 Query: 274 PRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWA 333 P A + P+T+ GL KQR RW+ G + K+ L+ + + + L IW Sbjct: 949 PAAQAFTEAPDTVGGLLKQRFRWSFGTLQCVWKHRAALFSPKTPALGFV----ALPQIWL 1004 Query: 334 FTCLVGFIIYAVQLAGVPLNIELTHIAATHTA----------GILLCTLCLLQFIVSLMI 383 F L+ V LA V I + A H + L+ + Sbjct: 1005 FQILLAVAAPLVDLAVVWSLISGVYGAIAHPVEWSPDDTIQGLLYWAVFILVDLSAGALG 1064 Query: 384 ENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRGIL 439 + + L ++ + L + S + + R W +R Sbjct: 1065 MALEKRAPWADLPYLPVQRFGYRQLMYYVVVKSV---LTAARGGRVGWGKLERRAT 1117 >UniRef50_C9RCL0 Glycosyl transferase family 2 n=1 Tax=Ammonifex degensii KC4 RepID=C9RCL0_AMMDK Length = 429 Score = 331 bits (849), Expect = 4e-89, Method: Composition-based stats. Identities = 146/412 (35%), Positives = 227/412 (55%), Gaps = 6/412 (1%) Query: 26 SGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFN 85 + L+ ++F +PF MS++W+ G+YFW RER P E P L P ++I+IPC N Sbjct: 6 AWWLLGWYIFLYPFAMSVLWVTAGLYFWWRRER-KPEREKKFWPDL--WPPVTILIPCHN 62 Query: 86 EEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAI 145 E ++ T A Y + V+ ++D STD+T ++ R P+ ++ LA+NQGKA Sbjct: 63 EAASIAATATALQFLDYPDYRVVFIDDASTDETAEVIRRFLPSNPNFHLLRLAENQGKAQ 122 Query: 146 ALKTGAAAA-KSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGK 204 AL A A + V ID D LL A Y+V P PR+GAVTGNP R L+ K Sbjct: 123 ALNCALATAVTTPITVVIDADTLLAPQALKYLVAPFCRQPRLGAVTGNPIALNRKNLLEK 182 Query: 205 IQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQ 264 +Q E++SIIGLIKR QR+ G V TVSG IAA+R L EVG +S TEDIDI+W++Q Sbjct: 183 LQAAEFASIIGLIKRAQRVLGRVLTVSGCIAAYRTEVLREVGGFSSRTATEDIDITWQIQ 242 Query: 265 LNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFF 324 + +++ P+A+ +I P LK WKQR RWA GG + + R ++P++ Sbjct: 243 RRFYEVWFAPQAVAFIQCPSRLKEYWKQRRRWALGGWHLLRTHKDIFKRWRWRYLYPVYL 302 Query: 325 EYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIE 384 E+ L+ +W+F + G +++ V + + G +L C++Q +L++ Sbjct: 303 EFVLSYLWSFAFVFGTLLWLVT--RLFWGQPVGFSPIPVWYGAMLSLACIVQIATALVLN 360 Query: 385 NRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 RY+ L ++FW+ W+P+ F++ T + + + + Q +W SP+R Sbjct: 361 RRYDPKLWRTVFWVPWYPLFFFVFGALTVVWTAPKGLFGSLAQAGKWKSPER 412 >UniRef50_D2L983 Polysaccharide deacetylase n=1 Tax=Desulfovibrio sp. FW1012B RepID=D2L983_9DELT Length = 1140 Score = 329 bits (845), Expect = 9e-89, Method: Composition-based stats. Identities = 96/435 (22%), Positives = 174/435 (40%), Gaps = 19/435 (4%) Query: 16 CIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWP-WGENAPAPQLKDN 74 + L + F + +G + G P + Sbjct: 706 LVALGNRVGFFLLFAWSAGLQYLFVTGTILGLGRLLILAVLAVFEKVRGRRRPVSGPAPD 765 Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRYEN-IEVIAVNDGSTDKTRAILDRMAAQIPHLR 133 S+++++P +NEEK V +T+ + LA ++ E+I V+DGSTD T +L A L Sbjct: 766 LSVAVVVPAYNEEKVVLQTVQSLLACQHPATFEIIVVDDGSTDATYRVLCEALA-GEKLV 824 Query: 134 VIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNP 193 I N GK AL G A +++ +V +D D + RD + + +P+VGAV GN Sbjct: 825 TIVTKPNGGKPAALNHGIALTRADIVVTLDADTVFARDTILRLADW-FRDPKVGAVAGNA 883 Query: 194 RIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMI 253 ++ R + + Q EY + L +R + +V V G + A+RR + G +S + + Sbjct: 884 KVGNRINFLTRCQALEYVTSQNLDRRALTVLDSVTVVPGAVGAWRREVVEAAGGFSGETL 943 Query: 254 TEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWR 313 ED D++ ++Q T+ YE RA+ P+T++G +QR RW G +V K+ L+R Sbjct: 944 AEDADLTIRIQRMGHTVAYEDRAVALTEAPDTMRGFLRQRFRWMFGTLQVAWKHKDALFR 1003 Query: 314 KENFRMWPLFFEYCLTTIWAFTCLVGFI-IYA--------VQLAGVPLNIELTHIAATHT 364 + F + + ++ V P + + Sbjct: 1004 PRYGLLGFFGLPNIWLYQIFFQIISPVMDLWLAYTCLKSWVLWLWHPATWDPDALFRVLF 1063 Query: 365 AGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMP 424 L +L +V+ ++E + L L W++ + L L + + Sbjct: 1064 YYALFMAADILAGLVAFLLERGEDKRL---LAWLVPQRFFYRQLMYVVALRTLLASL--- 1117 Query: 425 KKQRARWVSPDRGIL 439 + + W +R Sbjct: 1118 RGREMGWSKLERKAT 1132 >UniRef50_UPI0001B4D705 bi-functional transferase/deacetylase n=3 Tax=Streptomyces RepID=UPI0001B4D705 Length = 700 Score = 329 bits (845), Expect = 1e-88, Method: Composition-based stats. Identities = 105/426 (24%), Positives = 183/426 (42%), Gaps = 16/426 (3%) Query: 16 CIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNP 75 + M + + + + + R + +P + ++P Sbjct: 277 LLWAGAIGRTFTNAMAWVLAIAGALGLLRLVTLVFFARAHVRRLTRFRPGSPWLREVNDP 336 Query: 76 SISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVI 135 +++I+P +NEE +E T+ + LA + ++++I V+DGSTD+T + + P + VI Sbjct: 337 -VTVIVPAYNEEAGIEATVRSLLASTHPHLQIIVVDDGSTDRTADLATWI--DDPRISVI 393 Query: 136 HLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRI 195 N GKA AL TG A A + +V +D D + + DA +++P+ +P +GAV+GN ++ Sbjct: 394 R-QINSGKATALNTGLAHAAHDIVVMVDADTVFEADAVHQLIQPL-AHPAIGAVSGNTKV 451 Query: 196 RTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITE 255 R +L+G+ Q EY L +R + + TV G I AFRR AL VG S+D + E Sbjct: 452 GNRRSLLGRWQHLEYVFGFNLDRRMFEVLECMPTVPGAIGAFRRDALMGVGGVSEDTLAE 511 Query: 256 DIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKE 315 D D++ L W + YE A+ W +P +L+ LW+QR RW G + K+ Sbjct: 512 DTDLTMSLWRAGWRVVYEETAVAWTEVPTSLRQLWRQRYRWCFGTLQSMWKHRRAAVELG 571 Query: 316 NFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLL 375 F L+ + F L+ I V L + + L+ + + L + Sbjct: 572 PAGR---FGRRGLSYLVLFQVLLPLIAPIVDLFALYGALFLSPVQSAGIWCAFLAVQLIC 628 Query: 376 QFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPD 435 + R SL+ + + ++ L + S +L R W Sbjct: 629 AGYALRLDGERMR-----SLWAMPFQLFVYRQLMYLVVIQSVVAALL---GARLTWHRMH 680 Query: 436 RGILRG 441 R Sbjct: 681 RSGTAA 686 >UniRef50_C0YGU0 Polysaccharide deacetylase/glycosyl transferase, group 2 family protein n=2 Tax=Bacteroidetes RepID=C0YGU0_9FLAO Length = 1132 Score = 329 bits (844), Expect = 1e-88, Method: Composition-based stats. Identities = 103/426 (24%), Positives = 170/426 (39%), Gaps = 24/426 (5%) Query: 16 CIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNP 75 + + Y S L+ F F + + ++ + + + P L+ P Sbjct: 716 LVLATIIYGVSHFLVALFTIFIVLGLIRLLLMA----YWAFKERKKEKKLGEFPVLESYP 771 Query: 76 SISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVI 135 +SII+P +NEE N+ ++ L Q Y N ++I V+DGS D T P L++ Sbjct: 772 KVSIIVPAYNEEVNIVSSLQNLLKQTYPNFDIIMVDDGSKDSTYDKAKAAFPDHPKLKIF 831 Query: 136 HLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPML-YNP--RVGAVTGN 192 +N GKA AL G + +EY+VCID D L +DA Y++ L +P ++ AV GN Sbjct: 832 -TKRNGGKATALNFGISQTDAEYVVCIDADTKLQQDAVKYLIARFLNSDPEEKIAAVAGN 890 Query: 193 PRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDM 252 ++ R + K Q EY++ + + + G I AF+RS + E G +S D Sbjct: 891 VKVGNRVNWLTKWQAIEYTTSQNFDRLAYANINAITVIPGAIGAFKRSVVIETGGYSSDT 950 Query: 253 ITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLW 312 + ED DI+ K+ +T+ E RA+ PET+K KQR RW G ++F K Sbjct: 951 LAEDCDITVKILKAGYTVANENRAVAVTEAPETVKQFLKQRFRWTYGIMQMFWKQRQTFL 1010 Query: 313 RKENFRMWPLFFEYCLTT--IWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLC 370 + L I F + +I + + T+ L Sbjct: 1011 NPRYKGLGLWAMPNILLFQYIIPFFSPLADVIMFFGILSGNGDKIFTYYLIFLLVDASLA 1070 Query: 371 TLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRAR 430 + + + +L +II + L S + + K + Sbjct: 1071 LIAFIM-----------QREKLINLLYIIPQRFGYRWLMYIVLFKSLRKAL---KGEMQS 1116 Query: 431 WVSPDR 436 W R Sbjct: 1117 WGFLKR 1122 >UniRef50_Q1IUP8 Polysaccharide deacetylase n=1 Tax=Candidatus Koribacter versatilis Ellin345 RepID=Q1IUP8_ACIBL Length = 1154 Score = 328 bits (841), Expect = 3e-88, Method: Composition-based stats. Identities = 91/438 (20%), Positives = 169/438 (38%), Gaps = 25/438 (5%) Query: 14 VLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKD 73 + +++ G + FF+ + + G + F + Sbjct: 715 WAAMVASLSFILFGAVSQ--FIIAVFFVGDVLMTGRLVFIGTLAIYDRIRGPRLTADPDY 772 Query: 74 NPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDR-MAAQIPHL 132 P+++++IP +NEEK +E T+ + L Y + I ++DGS D T ++++ AA+I Sbjct: 773 RPAVAVLIPAYNEEKVIERTVRSVLDSDYPKLRAIVIDDGSKDATVEVVEQLFAAEIASG 832 Query: 133 RV-IHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTG 191 +V + N GKA AL G E V ID D ++ DA +V P NP++ A+ G Sbjct: 833 KVTLLTKPNSGKAAALNYGLEFVTEEIFVGIDADTIIAPDAIGLLV-PHFQNPKIAAIAG 891 Query: 192 NPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDD 251 N ++ R + Q EY + +R ++G V V G I A+R A+ G + D Sbjct: 892 NAKVGNRVNWWTRWQALEYITSQNFERRALDVFGAVSVVPGAIGAWRTEAVLAAGKYHHD 951 Query: 252 MITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRL 311 + ED D++ L + + + YE AL + P T GL +QR RW+ G + K+ + Sbjct: 952 TVAEDADLTMALLQDGYRVEYEDLALAYTEAPSTANGLMRQRFRWSFGIMQSVYKHRSA- 1010 Query: 312 WRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTH------------- 358 F+ + L + F L+ + + L + Sbjct: 1011 -----FKQGGALGWFALPNVVIFQILLPLVSPFIDLMFLFGAGSYAWNRYMHPESTDPSS 1065 Query: 359 IAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFT 418 L + + ++ +E R + + + + ++V Sbjct: 1066 FHKLVLYFALFLVIDFVASTIAFTLERRQPGGQ-KDFWLLAHVWLQRFAYRQLFSIVLIK 1124 Query: 419 RVMLMPKKQRARWVSPDR 436 + + W +R Sbjct: 1125 TLKRAIEGGEFAWDKLER 1142 >UniRef50_C8WBS4 Polysaccharide deacetylase n=4 Tax=Sphingomonadaceae RepID=C8WBS4_ZYMMN Length = 1126 Score = 327 bits (838), Expect = 6e-88, Method: Composition-based stats. Identities = 99/434 (22%), Positives = 186/434 (42%), Gaps = 19/434 (4%) Query: 14 VLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKD 73 V + G ++ + + ++ + I+ ++ + P + Sbjct: 693 VSARWNFALFSCLGASVIALRWIFAIAIT-LGILRALFLSAFSIIQARKENRLIFPPIDP 751 Query: 74 NPSISIIIPCFNEEKNVEETIHAALAQ-RYENIEVIAVNDGSTDKTRAILDRMAAQIPHL 132 +IS++IP FNEE +E +I LA NIEVI ++DGSTD + I++ A P + Sbjct: 752 ERTISVLIPAFNEEAVIEASIRRVLASAEVNNIEVIVIDDGSTDNSSQIVESQFADDPRV 811 Query: 133 RVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGN 192 ++I L+ N GKA AL G AK E ++ +D D + A + +P++GAV GN Sbjct: 812 QLIRLS-NGGKARALNHGVQKAKGEIIIALDADTHFEPRTIARLTRW-FSDPKLGAVAGN 869 Query: 193 PRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDM 252 ++ R L+ + Q EY + L +R + + V G + A+R L +VG + D Sbjct: 870 AKVGNRINLITRWQALEYITAQNLERRATVLLNAMTVVPGAVGAWRAETLRQVGGFPDQT 929 Query: 253 ITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLW 312 + ED D++ +Q + W + Y+P A+ W PET++ L +QR RWA G + K+ + + Sbjct: 930 LAEDQDLTIIIQEHDWAVRYDPYAVAWTEAPETIRALARQRFRWAFGTLQCLWKHWSIIK 989 Query: 313 RKENFRMWPLFFEYCLTTIWAFTCLVGFI--IYAVQLAGVPLNIELTHIAA-------TH 363 + + + L F + I + + + Sbjct: 990 NRRPKGLAYIGLPQSLIFQIGFATISPIIDLALVISIMATVFAVYQHGWVQQGDDLQKMA 1049 Query: 364 TAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLM 423 + + L+ +V+ +E + + S L+ +I + + + + + T+ + Sbjct: 1050 AYWSVFTLIDLMSGVVAFALERKEK---WSLLWLLIPQRIGYRQIMYYVVIKALTQAIRG 1106 Query: 424 PKKQRARWVSPDRG 437 PK W +R Sbjct: 1107 PK---VGWDKLERS 1117 >UniRef50_Q1IL87 Glycosyl transferase, family 2 n=1 Tax=Candidatus Koribacter versatilis Ellin345 RepID=Q1IL87_ACIBL Length = 422 Score = 326 bits (837), Expect = 8e-88, Method: Composition-based stats. Identities = 137/410 (33%), Positives = 218/410 (53%), Gaps = 12/410 (2%) Query: 28 ELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEE 87 +RF +P MS +WI ++F E+ P +SI++P F E Sbjct: 23 VYALRFYGLYPILMSWVWISLSLFF-------RRRQEDTEMEMSGPAPMVSILVPAFAEA 75 Query: 88 KNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIAL 147 + +++TI A L Y N EVI VND S D T ++ + P +R+++ N+GKA+AL Sbjct: 76 ETIDDTIEALLKLDYPNYEVILVNDCSPDNTAEVVRQY-LDDPRIRLLNKQVNEGKAMAL 134 Query: 148 KTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQV 207 + E LV ID D ++ RD Y+V P RV AVTGNPR+R R +++ +Q Sbjct: 135 NDALPMCRGEILVVIDADIIVSRDLLNYMV-PHFAGTRVAAVTGNPRVRNRVSILQHLQA 193 Query: 208 GEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQ 267 E+SSI+ + +R QR+ G V TVSG + A RRSAL E+G ++ M TEDID++W+LQ+ Sbjct: 194 VEFSSIVSMQRRAQRVLGRVLTVSGAVFAVRRSALLELGGFTPHMATEDIDLTWRLQMKF 253 Query: 268 WTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYC 327 W + YEPRA+ W+ +P +L+ LWKQR RWA+G +V ++ + RMWP+F+E Sbjct: 254 WDVRYEPRAVVWMQVPLSLRELWKQRKRWARGLVQVLKRHREVPTNWKMRRMWPIFYESI 313 Query: 328 LTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRY 387 + +W++ ++ + + LA + G+++ T CLLQ + ++ +Y Sbjct: 314 FSILWSYVFVLMTSYWLISLA--VGYAPRGVSPFPNFWGMMIATTCLLQLFIGAWVDRQY 371 Query: 388 EHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRG 437 + + S +++PVI+WML T + + W RG Sbjct: 372 DPGIMWSFPEAVFYPVIYWMLMALITSFYTIPALFKKPPRVQTW-RIRRG 420 >UniRef50_D2AU81 Polysaccharide deacetylase n=1 Tax=Streptosporangium roseum DSM 43021 RepID=D2AU81_STRRD Length = 752 Score = 326 bits (837), Expect = 9e-88, Method: Composition-based stats. Identities = 113/413 (27%), Positives = 180/413 (43%), Gaps = 20/413 (4%) Query: 26 SGELMMRFVFFWPFFMSIMWIVGGVYFWVYRER-HWPWGENAPAPQLKDNPSISIIIPCF 84 M + + + V WV+ R AP + P++++I+P + Sbjct: 352 FVTAMSWILVVAGVITLLRLLFFLVLAWVHARRVRGGKRRAGRAPAWPEPPAVTVIVPAY 411 Query: 85 NEEKNVEETIHAALAQRYEN-IEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGK 143 NE +E T+ + + Y +EV+ V+DGS+D T AI + +P +RVI +N GK Sbjct: 412 NEAAGIEATVRSLVNTDYPGVLEVVVVDDGSSDDTAAIAASLG--LPGVRVIR-QENGGK 468 Query: 144 AIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVG 203 AL TG A A + LV +DGD + + ++V P+ +P VGAV+GN ++ R ++G Sbjct: 469 PSALNTGIAHASHDILVMVDGDTVFEPATIGHLVRPL-SDPAVGAVSGNTKVGNRRGMIG 527 Query: 204 KIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKL 263 + Q EY L +R + G + TV G I AFRRSAL E+G S D + ED D++ + Sbjct: 528 RWQHIEYVIGFNLDRRAFDLLGCMPTVPGAIGAFRRSALQEIGGVSVDTLAEDTDLTMAM 587 Query: 264 QLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLF 323 W + YE AL W P +L LW+QR RW G + K+ + F Sbjct: 588 CRGGWRVVYEENALAWTEAPTSLSQLWRQRYRWCYGTLQAMWKHRRAITEPSPFGRR--- 644 Query: 324 FEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMI 383 CL + F ++ + V + V + + L+Q S Sbjct: 645 ---CLGYLTLFQVVLPLLAPVVDVMAVYSVVMGDPLPVVAV----WAGFVLVQAF-SGWY 696 Query: 384 ENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 R + S L+ + ++ L + S +L R RW + R Sbjct: 697 ALRLDRERASVLWVLPLQQFVYRQLMYLVVIQSVATAVLGV---RLRWQTIRR 746 >UniRef50_B5GIB2 Bi-functional transferase/deacetylase n=7 Tax=Streptomyces RepID=B5GIB2_9ACTO Length = 767 Score = 325 bits (833), Expect = 2e-87, Method: Composition-based stats. Identities = 99/438 (22%), Positives = 180/438 (41%), Gaps = 18/438 (4%) Query: 3 NRIVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPW 62 + + L S + V S++ G+ + Sbjct: 310 SALAPVSGTELWKGRAWVWLVAASDSVTGVLVTGLAVTGSLVLARFGLMLLLSFAHARRT 369 Query: 63 GENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAIL 122 A + +++++P +NE K V T+ + + +E+I ++DGSTD T I+ Sbjct: 370 RRRGFAWGVPVTEPVTVLVPAYNEAKCVTATVTSLSRSEHP-VEIIVIDDGSTDDTAGIV 428 Query: 123 DRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLY 182 +R+ +P +RV+ +N GK AL G A A + +V +DGD + + +V+P Sbjct: 429 ERLG--LPGVRVVR-QENAGKPAALNRGIAHASHDIIVMMDGDTVFEPATVRELVQP-FG 484 Query: 183 NPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSAL 242 +PRVGAV GN ++ R +L+G Q EY L +R + + T+ G + AFRR AL Sbjct: 485 DPRVGAVAGNAKVGNRDSLIGAWQHIEYVMGFNLDRRMYDVLRCMPTIPGAVGAFRRDAL 544 Query: 243 AEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAE 302 VG S+D + ED DI+ + + W + Y +A W PE++ LW QR RW+ G + Sbjct: 545 TRVGGMSEDTLAEDTDITMAIHRDGWRVVYAEKARAWTEAPESVAQLWSQRYRWSYGTMQ 604 Query: 303 VFLKNMTRLW-RKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAA 361 K+ + R + R + + + ++ ++ + + I Sbjct: 605 AIWKHRHAVLERGASGRFGRVGLP-----LVSLFMVLAPLLAPLIDVFLLYGIVFGPTGR 659 Query: 362 THTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVM 421 T A + + + + + R + L + +++ L L S+ + Sbjct: 660 TLLAWLGVLAVQAVCAAYAF----RLDKEPMRHLVSLPLQQLLYRQLMYVVLLQSWITAL 715 Query: 422 LMPKKQRARWVSPDRGIL 439 R RW R + Sbjct: 716 T---GGRLRWQKLRRTGV 730 >UniRef50_C1ACV5 Putative glycosyl transferase/polysaccharide deacetylase n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1ACV5_GEMAT Length = 1123 Score = 320 bits (822), Expect = 4e-86, Method: Composition-based stats. Identities = 102/401 (25%), Positives = 167/401 (41%), Gaps = 18/401 (4%) Query: 45 WIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYEN 104 I + + + + A + P +S+++P +NE + + T+ + L+Q Y + Sbjct: 721 VIRLLLIGTLATVQRFSKRYARRAADAEWLPRVSVLVPAYNEGRVIGRTVQSVLSQAYPD 780 Query: 105 IEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDG 164 +EV+ V+DGS+D T A L V+ N GKA AL TG A A E +V ID Sbjct: 781 LEVVVVDDGSSDDTHD-AASHATDDARLHVVR-QTNAGKAAALNTGIAMATGEVIVVIDA 838 Query: 165 DALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIY 224 D +L DA ++V P+ + RVGAV GN ++ R L+ + Q EY + L +R + Sbjct: 839 DTILAPDAIRHLVRPL-ADARVGAVAGNAKVGNRINLLTRWQAVEYVTSQNLDRRAFVML 897 Query: 225 GNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPE 284 + V G I A+RRSA+ + G + D + ED D++ L + +A+ PE Sbjct: 898 NCITVVPGAIGAWRRSAVLDAGGFRTDTLAEDQDLTLTLLRGGHKVALAEQAVALTEAPE 957 Query: 285 TLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYA 344 T L KQR RW+ G + K+ L R++ + + L IW F L + A Sbjct: 958 TFGALLKQRFRWSFGTLQCAWKHRGALMRRDAGALGMV----GLPNIWLFQLLFPLLAPA 1013 Query: 345 VQL---------AGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSL 395 + + + A LL L LL + ++ +E L Sbjct: 1014 ADVALLASLARLLLEAPALGVHAAWAHAEPVFLLYALFLLIDTFTAVLGVAFEKGE--PL 1071 Query: 396 FWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 + P+ + + K W +R Sbjct: 1072 SQALLVPLQRVAYRQVLYVALLKAMRAAVKGWAPGWGKLER 1112 >UniRef50_D1SD50 Polysaccharide deacetylase n=1 Tax=Micromonospora aurantiaca ATCC 27029 RepID=D1SD50_9ACTO Length = 776 Score = 317 bits (814), Expect = 3e-85, Method: Composition-based stats. Identities = 105/420 (25%), Positives = 178/420 (42%), Gaps = 17/420 (4%) Query: 19 LCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAP-QLKDNPSI 77 + V + M + + + ++ + + +V RH + Sbjct: 359 VLVGAVRVADGMFGMLGVFFVLVGVLTVGRTLLLFVMAPRHAARRRRRDWSWGPPVTEPV 418 Query: 78 SIIIPCFNEEKNVEETIHAALAQRYEN-IEVIAVNDGSTDKTRAILDRMAAQIPHLRVIH 136 S+I+P +NE + + + + + IEV+ V+DGSTD T I+ + ++P++RV+ Sbjct: 419 SVIVPAYNEREGIAAAVRSLALGDHPGGIEVVVVDDGSTDGTADIVAAL--RLPNVRVVR 476 Query: 137 LAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIR 196 N GK AL TG A A+ + +V +DGD + + D+ +V+P +P VG V GN ++ Sbjct: 477 -KPNGGKPSALNTGVALARHDLIVMVDGDTIFEPDSVRRLVQP-FADPGVGVVAGNVKVG 534 Query: 197 TRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITED 256 R L+ K Q EY L +R + TV G I AFRR AL +VG +DD + ED Sbjct: 535 NRRGLIAKWQHIEYVIGFNLDRRLYETLRCMPTVPGAIGAFRRQALEQVGGMTDDTLAED 594 Query: 257 IDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKEN 316 D++ L W I YE A W P T+ LWKQR RW+ G + K+ + Sbjct: 595 TDVTIALGRAGWHIVYEESARAWTEAPTTVGQLWKQRYRWSYGTLQAMWKHRRSVIDSGR 654 Query: 317 FRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQ 376 F CL+ + F L+ + L + + + T A + + L L Sbjct: 655 SGR---FGRRCLSFLTLFGVLLPLAAPVIDLLAI-YGLIFLDRSDTVVAWLAMLALQFLT 710 Query: 377 FIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 +++ R + L+ + ++ + L + + R W R Sbjct: 711 AVLAF----RLDREKLGVLWVLPLQQFVYRQVMYLVLLQAVGTALT---GGRLGWQKLRR 763 >UniRef50_A1UIF2 Polysaccharide deacetylase n=3 Tax=Mycobacterium RepID=A1UIF2_MYCSK Length = 789 Score = 317 bits (814), Expect = 3e-85, Method: Composition-based stats. Identities = 108/434 (24%), Positives = 176/434 (40%), Gaps = 26/434 (5%) Query: 17 IPLCVAYFHSGELM-----MRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENA-PAPQ 70 P A +H+ LM F + M + ++ +Y + + Sbjct: 348 YPADAATYHTLRLMEKAPGAVLGFLFWLGMGSLTVMSLLYLILALVCQYRQNRLRWNDIG 407 Query: 71 LKDNPSISIIIPCFNEEKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAILDRMAAQ 128 P +S+++ FNEEK + TI Y EV+AVNDGSTD T IL +A Sbjct: 408 DDQLPMVSVVLAAFNEEKVIARTIAELRRSDYPRSRFEVVAVNDGSTDGTLRILTELARD 467 Query: 129 IPHLRVIHLAQNQGKAIALKTGAAAAKS--EYLVCIDGDALLDRDAAAYIVEPMLYN--- 183 P LRV+ N GK+ A+ G A + +V +D D L D + + Sbjct: 468 WPKLRVVD-QANSGKSSAINNGINHASAVSTVMVTMDADTLFRPDTIRNLARHFARHTHG 526 Query: 184 PRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALA 243 +VGAV G+ ++ R L+ Q EY S I + + +R+ + V G +A+ R+AL Sbjct: 527 RQVGAVAGHIKVGNRRNLLTAWQSLEYISGICVTRMAERLLNAISIVPGACSAWSRTALE 586 Query: 244 EVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEV 303 E+G + DD + ED D + LQ + I E A+ PET++ L KQR RW G + Sbjct: 587 EIGGFCDDTMAEDCDATLALQRRGYRILQENNAIADTEAPETIRALAKQRKRWTYGNIQA 646 Query: 304 FLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATH 363 K+ L+R + + Y ++ + I A ++ N + + Sbjct: 647 LWKHRAMLFRPRYGALGLVALPYAALSLIVPLLFMPLTIVAAGMSLAAGNWQ----SIAL 702 Query: 364 TAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLM 423 AG + ++ M R H L ++ II+ P+ ++L + Sbjct: 703 FAGFVAALHMIISITAVAMARERAWHLLVVPVYRIIYEPLRAYLL--------YASAYRA 754 Query: 424 PKKQRARWVSPDRG 437 K W +R Sbjct: 755 IKGTIVAWDKLERR 768 >UniRef50_Q1GJ85 Glycosyl transferase family 2 n=4 Tax=Rhodobacteraceae RepID=Q1GJ85_SILST Length = 1002 Score = 315 bits (809), Expect = 2e-84, Method: Composition-based stats. Identities = 96/420 (22%), Positives = 185/420 (44%), Gaps = 15/420 (3%) Query: 19 LCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSIS 78 + + G+ + +F+ + ++ V + V R P ++ Sbjct: 584 VSFSLVAWGQDAIVILFWLALGIGVVRSVAILLLAVLNWRGHRTISLT-------TPKVA 636 Query: 79 IIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLA 138 +IIP NEEK + I + A Y+N+E+I V+DGS+D T + + + +R+I Sbjct: 637 VIIPAHNEEKVIRSCIQSVRASDYKNLEIIVVDDGSSDNTLNEIFAFS-HMREVRLIS-Q 694 Query: 139 QNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTR 198 NQGK AL +E +VCID D +++ A ++V NPR+GAV G + Sbjct: 695 PNQGKWSALNRALMNTSAEIVVCIDADTQIEKSAIGHMVR-HFDNPRIGAVAGKIIAGNK 753 Query: 199 STLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDID 258 L+ ++Q EY++ + ++ + + V G + A+R +AL + G++SD+ +TED D Sbjct: 754 VNLLTRLQALEYTTAQNVERKAFDLINGMLVVPGALGAWRVAALRKAGHFSDETMTEDTD 813 Query: 259 ISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFR 318 ++ ++ + I YEP A + +PE + L KQRLRW+ G + K+ ++ + Sbjct: 814 LTIEVNRAGYRIAYEPLARGYTEVPERIGQLLKQRLRWSFGMFQSAWKHKKAMFEGRSVG 873 Query: 319 MWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCL--LQ 376 + + + ++ + + A+ L + + LL L L L+ Sbjct: 874 LISIPDMFIFGYLFPLLAPIADLFVAILLYQMVSGGWDSGAVGAQNMQYLLAYLTLPALE 933 Query: 377 FIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 F+++ R + SL V++ + + + + R + W + R Sbjct: 934 FVIAAFALARDKDESMWSLLLFPVQRVLYRPILYYSVIRAILRAITGRLF---SWGAQKR 990 >UniRef50_B6BG61 Glycosyl transferase, group 2 family protein n=2 Tax=Rhodobacterales RepID=B6BG61_9RHOB Length = 1140 Score = 315 bits (807), Expect = 3e-84, Method: Composition-based stats. Identities = 92/396 (23%), Positives = 175/396 (44%), Gaps = 11/396 (2%) Query: 46 IVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENI 105 + G+ L+ P ++++IP NE K + ++I + A Y+N+ Sbjct: 739 LAIGLLRSAGILILALLNRRERFTPLQRQPKVAVVIPAHNEAKVIAQSIESVRASGYKNL 798 Query: 106 EVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGD 165 E+I V+DGSTD T + + + +R+I NQGK AL + +E+ VCID D Sbjct: 799 EIIVVDDGSTDDTLLEVLKFGHK-SEVRLIS-QPNQGKWSALNRAIQSTDAEFAVCIDAD 856 Query: 166 ALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYG 225 + +DA ++V +P+ GAV G R L+ ++Q EY++ + ++ + Sbjct: 857 TQVCKDAITHLVR-HFADPKTGAVAGKIIAGNRVNLLTRLQAFEYATSQNIERKAFDLIN 915 Query: 226 NVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPET 285 + V G I A+R AL + G++S++ +TED D++ ++ + + +EP+A + +PE Sbjct: 916 GILVVPGAIGAWRVEALRKAGFFSEETLTEDTDLTIQVNRAGYNVVFEPKAKAYTEVPEN 975 Query: 286 LKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFT-----CLVGF 340 + L KQRLRW+ G + K+ + + + + + ++ V F Sbjct: 976 VGQLLKQRLRWSLGMFQSAWKHKRAIIEGRSIGLVSISDMFVFGYVFPLLAPIADLFVIF 1035 Query: 341 IIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIW 400 ++Y + G ++ T T L L+ +++ + + SL Sbjct: 1036 MLYNLMAGGWTGDVGSTQQVQTTQYLWAFLALPALELLIAAIAITTDKDESNWSLLLFPL 1095 Query: 401 FPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 + + L + + S R + A W S R Sbjct: 1096 QRLAYRPLLYFSVIRSILRAVTGRL---ANWGSVKR 1128 >UniRef50_A7GCY8 Glycosyl transferase, group 2 family n=66 Tax=Firmicutes RepID=A7GCY8_CLOBL Length = 420 Score = 311 bits (798), Expect = 3e-83, Method: Composition-based stats. Identities = 96/427 (22%), Positives = 198/427 (46%), Gaps = 20/427 (4%) Query: 20 CVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISI 79 + + + + F++I+ + G +++ G + P +SI Sbjct: 3 SLNWVDYLFIFSLVSIWMLLFVNIILSLAGYRYYLKTLNSELKGLK-----NEKYPKVSI 57 Query: 80 IIPCFNEEKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAILDRMAAQIP--HLRVI 135 ++P NEEK + T+ + L Y +E+I +ND S+D T+ IL ++ + + ++I Sbjct: 58 LVPAHNEEKVIGRTVKSILLLNYPKDKMELIVINDNSSDNTKKILKQIQKEYRSYNFKII 117 Query: 136 HLAQ---NQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGN 192 + +GK+ AL G + +++ D D D++A Y++E ++ + +GAV G Sbjct: 118 NTDNITGGRGKSNALNIGYKHSSGDFIAVYDADNTPDKNALKYLMETIIEDEHLGAVIGK 177 Query: 193 PRIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDD 251 R R + ++ + E S + + + N+ T+ G R++ + ++ W Sbjct: 178 FRTRNKDRNMLTRFINIETLSFQWMCQAGRWNLLNLCTIPGTNFVVRKNIIQKLNGWDPK 237 Query: 252 MITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRL 311 I ED +IS+++ + I + P ++ W PE LK +KQR RWA+G V LK + Sbjct: 238 AIAEDTEISFRIYELGYKIKFVPYSVTWEQEPENLKVWFKQRTRWAKGNIYVLLKYFKNM 297 Query: 312 WRKENF-RMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLC 370 ++ + ++ +F+ + + ++ + ++ I++ V G+ LNI L I + IL Sbjct: 298 FKGTSKDIIFDIFYFFSVYFLFLSSVIISDILFIV---GIFLNINLHVIGNFNVLWILAY 354 Query: 371 TLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPV-IFWMLSLATTLVSFTRVMLMPKKQRA 429 L +L+ ++L +E + L I++F WM+ ++ + R L K+ Sbjct: 355 VLFVLEVSLTLTLEKGESNKENLILVPIMYFTYCQMWMIVALRGIIQYIRDKLF--KKEI 412 Query: 430 RWVSPDR 436 +W +R Sbjct: 413 KWYKTER 419 >UniRef50_B7QVL5 Glycosyl transferase, family 2 n=1 Tax=Ruegeria sp. R11 RepID=B7QVL5_9RHOB Length = 1140 Score = 311 bits (797), Expect = 4e-83, Method: Composition-based stats. Identities = 87/423 (20%), Positives = 176/423 (41%), Gaps = 11/423 (2%) Query: 18 PLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSI 77 L + SG + ++ + ++ + + +R + Sbjct: 706 SLAFSGVGSGATWVSYIVPLAVLLGVLRALTLFVLAIRSKRLSRSDLRHRTGDFTA--PV 763 Query: 78 SIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHL 137 +++IP +NEEK++ +TI++ L Y N+ V+ V+DGSTD T ++ + P ++++ Sbjct: 764 TVVIPAYNEEKSILKTIYSVLDSDYPNLSVLVVDDGSTDATYDLVSKTYLNNPKVQILR- 822 Query: 138 AQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRT 197 N GK A A ++Y+V ID D ++ DA +++P+ NP+VGAV G + Sbjct: 823 QPNGGKWKAANLAFAHVTTDYVVAIDADTIVAPDAIRRLMQPL-RNPKVGAVAGKIMVGN 881 Query: 198 RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDI 257 + L+ K++ EY+ + +R + V G A+R +A+ E GY+S + ED Sbjct: 882 SNNLLTKLEKLEYTVAQNIDRRAYETINAIMVVPGAFGAWRTAAVRECGYYSSQTLAEDT 941 Query: 258 DISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENF 317 D++ L + + +A + P ++ L KQR+RW+ G + K+ + + Sbjct: 942 DLTISLLEAGYVVRAAEKAYAYTEAPASVGTLMKQRMRWSIGILQSAWKHRATIRKGHAI 1001 Query: 318 RMW----PLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLC 373 + + F + + L+ ++ A + A + + L Sbjct: 1002 GLVGLTDLVLFGVIMPLLGPIIDLLLVLMLARFITSFDGTSFDAFTARDYIVLSIFLALP 1061 Query: 374 LLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVS 433 LL+ I++ +F ++ +I+ L + S R++ W Sbjct: 1062 LLEMIMADYAVRSEPSTPRRMVFLLLLNRLIYRQLLIINVYRSLWRILTGRLTG---WHK 1118 Query: 434 PDR 436 R Sbjct: 1119 LRR 1121 >UniRef50_C1F4G9 Polysaccharide deacetylase domain protein/glycosyl transferase, group 2 family protein n=1 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F4G9_ACIC5 Length = 1170 Score = 310 bits (795), Expect = 6e-83, Method: Composition-based stats. Identities = 97/416 (23%), Positives = 169/416 (40%), Gaps = 17/416 (4%) Query: 30 MMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKN 89 ++ F MS ++ G++ + R R P P+++I+IP +NEEK Sbjct: 748 LIFVFFVGDVLMSGRLLIIGLFALIERFRTRR------IPPGVYEPAVAILIPAYNEEKV 801 Query: 90 VEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQ---IPHLRVIHLAQNQGKAIA 146 + TI +AL Y ++ V+ ++DGSTD+T + AQ L V+ N GKA A Sbjct: 802 IVRTIRSALNSDYPHLHVVVIDDGSTDRTLEVAREAYAQEIADGKLTVL-TKPNAGKAEA 860 Query: 147 LKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQ 206 L G + E V ID D ++ DA + +V +P+VGAV GN ++ + L + Q Sbjct: 861 LNFGLRQIREEVYVGIDADTVIAVDAVSKLVR-HFADPKVGAVAGNAKVGNKVNLWTRWQ 919 Query: 207 VGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLN 266 EY + +R ++ V V G I A+R +A+ G + + + ED D++ L Sbjct: 920 ALEYITGQNFERRALDLFNVVTVVPGAIGAWRTAAVLAGGCYPLNTVAEDADLTMNLVEQ 979 Query: 267 QWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEY 326 + + YE AL + P GL +QR RW+ G + K+ + L Sbjct: 980 GYKVIYEDHALAFTEAPINANGLMRQRFRWSFGTLQAVFKHRQAFRTNRAMGFFALPNIV 1039 Query: 327 CLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHT----AGILLCTLCLLQFIVSLM 382 + +++AV L +N A+ I ++ F SL+ Sbjct: 1040 VFQILLPLASPFIDLLFAVSLIQFLINKHYHPETASAASFDKLLIYFLAFIVIDFFTSLL 1099 Query: 383 IENRYEH--NLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 + + + + + ++V F + + W +R Sbjct: 1100 AFSLEPRHPANKGDGWLLFHIWLQRFSYRQIFSIVLFRTLKRAIDGRPFNWDKLER 1155 >UniRef50_C0WP52 Possible VGP family vectorial glycosyl polymerization protein n=5 Tax=Lactobacillus RepID=C0WP52_LACBU Length = 504 Score = 307 bits (788), Expect = 4e-82, Method: Composition-based stats. Identities = 126/403 (31%), Positives = 217/403 (53%), Gaps = 15/403 (3%) Query: 40 FMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALA 99 F+S+ + G F+ + + P + K+ +S+++P NE + E + + Sbjct: 113 FLSVTPSLIGSIFFAIQAKQVPL---VNLYKGKEAELVSVLVPAHNEADTLGEVVDSIAN 169 Query: 100 QRYENIEVIAVNDGSTDKTRAILDRMAAQIPH---LRVIHLAQNQGKAIALKTGAAAAKS 156 Y IE+I +NDGS D+T A++ R+ + ++V+ + N+GKA AL GA A+ Sbjct: 170 LVYPRIELILINDGSQDETLAVMHRLVQKYSKHFVVKVVDIQPNKGKANALNYGAQVAEG 229 Query: 157 EYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGL 216 ++L+C+D D +D++ ++ NP+VGAV G P +R R++++G++++ EY +I + Sbjct: 230 DFLLCLDADCYVDKNVLNPMMAQFYNNPKVGAVAGKPIVRNRTSILGRLELLEYVGVIDI 289 Query: 217 IKRTQRIY-GNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPR 275 IKR Q G + TVSGV+ A+R++ALA VG+W+ + ITEDID++W+L +QW + Y P+ Sbjct: 290 IKRGQAFVTGRITTVSGVVVAYRKAALANVGWWNSEAITEDIDVTWRLYHHQWQVAYCPQ 349 Query: 276 ALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFT 335 A+ WIL+P+ +K L QR RWA+GG EV +N L W L + ++ +WA + Sbjct: 350 AVAWILVPDHIKDLIHQRRRWARGGFEVLFRNRGMLVTGRLSEQW-LLLDMIISDLWAIS 408 Query: 336 CLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSL 395 C V L + ++ LL + L+QF+V + + L Sbjct: 409 C------SFVTLFYLFTVAFTGYLIMDGVVLFLLLIISLVQFLVGYTDSKKSDFINWRDL 462 Query: 396 FWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRGI 438 + + V +W ++L + L + L P+ W SPDRG+ Sbjct: 463 LLLPLYVVFYWFINLISCLTALGSFFLDPR-HVGSWHSPDRGL 504 >UniRef50_A9EU26 Group-specific protein n=3 Tax=Phaeobacter gallaeciensis RepID=A9EU26_9RHOB Length = 1136 Score = 307 bits (787), Expect = 6e-82, Method: Composition-based stats. Identities = 90/427 (21%), Positives = 181/427 (42%), Gaps = 10/427 (2%) Query: 20 CVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISI 79 +A+ G F PF + + I V F + + + +++ Sbjct: 702 ALAFTGFGNSATWVAFLVPFAVLLGVIRALVLFVMAIRSRRADRADLRHRTGDFSAPVTV 761 Query: 80 IIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQ 139 +IP +NEEK++ +TI++ L Y N+ V+ V+DGSTD T ++ + P+++++ Sbjct: 762 VIPAYNEEKSILKTIYSVLESDYPNLSVLVVDDGSTDDTHGLVTKTYKDNPNVQILR-QP 820 Query: 140 NQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRS 199 N GK A + ++Y+V ID D ++ DA +++P+ NP+VGAV G + + Sbjct: 821 NGGKWKAANLAFSHVTTDYVVAIDADTIVAPDAIRRLMQPL-RNPQVGAVAGKIMVGNSN 879 Query: 200 TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDI 259 L+ K++ EY+ + +R + V G A+R A+ + GY+S + ED D+ Sbjct: 880 NLLTKLEKLEYTVAQNIDRRAYETINAIMVVPGAFGAWRTEAVRKCGYYSSQTLAEDTDL 939 Query: 260 SWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRM 319 + L + + RA + P ++ L KQR+RW+ G + K+ + + + + Sbjct: 940 TISLLEQGYEVRAAERAYAYTEAPASVGTLMKQRMRWSIGILQSAWKHRSTIRKGHAVGL 999 Query: 320 W----PLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLL 375 + F + + L+ ++ ++G + + L LL Sbjct: 1000 VGLTDLVIFGVIMPLLGPIIDLLLVLMLVRFVSGFDGTTFDAFTVRDYAVLSIFLALPLL 1059 Query: 376 QFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPD 435 + I++ S + ++ +I+ L + S R++ W Sbjct: 1060 EMIMADYAVRSEPTMPRSMVLLLLLNRLIYRQLLIINVYRSLWRILTGRLTG---WHKLR 1116 Query: 436 R-GILRG 441 R G ++ Sbjct: 1117 RFGSVQA 1123 >UniRef50_A9EEN3 Glycosyl transferase, family 2 n=1 Tax=Oceanibulbus indolifex HEL-45 RepID=A9EEN3_9RHOB Length = 1088 Score = 306 bits (785), Expect = 9e-82, Method: Composition-based stats. Identities = 102/423 (24%), Positives = 182/423 (43%), Gaps = 17/423 (4%) Query: 18 PLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSI 77 + + +++ F+ F + I + +++ R RH P P K PS+ Sbjct: 668 SMSFLAVGNSWSLLQIAFWTVFAIGICRSISLLFWAARRRRHAP-------PLSKHEPSV 720 Query: 78 SIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHL 137 +I+IP +NE + +E+ I AL Y + ++I V+DGSTD T A P + V+ Sbjct: 721 TIVIPAYNEARVIEKCIRKALYSEYGDFDIIVVDDGSTDDTYEKAISFAY-HPLVTVLR- 778 Query: 138 AQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRT 197 N+GKA AL A+SE L+CID D+ + DA + ++ +P+VGAV G + Sbjct: 779 QPNRGKAAALNAALDEAQSEILICIDADSQIAPDAVS-LLAAHFKDPKVGAVAGRVVVGN 837 Query: 198 RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDI 257 R L+ ++Q EY + + +R + + V G I A+R +AL E G +S + +TED Sbjct: 838 RDNLLTRLQALEYITAQAVERRAKEYLNAITVVPGAIGAWRTTALMEAGIFSTETLTEDA 897 Query: 258 DISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENF 317 D++ + + + + YE RA+ P +L L QRLRW+ G + K++ + N Sbjct: 898 DMTMAMIRSDYQVIYEDRAVATTETPRSLSALMTQRLRWSLGMMQAGWKHLGATVERRNL 957 Query: 318 RMWPL----FFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLC 373 + L F Y + I L ++ + + T+ I L Sbjct: 958 GLVALPDLVVFGYLMPLIAPLADLFLVLLVIEFFTNIGATDQDFASVITNPLIIAYLALP 1017 Query: 374 LLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVS 433 L+ +++ L + + + + + + + R + W Sbjct: 1018 ALEIASAVIAFRLDPTEDRRLLLLLPVQRIFYRQVLYVSVIRALWRAVTGSLT---NWGR 1074 Query: 434 PDR 436 R Sbjct: 1075 MTR 1077 >UniRef50_A6Q1D6 Glucosaminyltransferase n=2 Tax=Epsilonproteobacteria RepID=A6Q1D6_NITSB Length = 438 Score = 306 bits (784), Expect = 1e-81, Method: Composition-based stats. Identities = 93/450 (20%), Positives = 187/450 (41%), Gaps = 29/450 (6%) Query: 2 INRIVSFFILCLVLCIPLCVAYFHSGELMMRF----------VFFWPFFMSIMWIVGGVY 51 + + V +L L L + L + + + + F + + I + ++ Sbjct: 1 MVKTVFIILLSLFLIVTLGIILYSFYQYYILFNSMFIKIGYGIILGFTSLVIFRYMLLLF 60 Query: 52 FWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVN 111 F ++R E + P +S+I+P +NE K + +I + L Q Y N+E+I V+ Sbjct: 61 FSIFRTIQRSAEETYQIDPKQRYPKVSVIVPAYNEAKTIATSISSLLTQNYPNLEIIVVD 120 Query: 112 DGSTDKTRAILDRMAAQ--IPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLD 169 DGS+D+T + +RV +N+GK+ A+ G + E + +D D+ + Sbjct: 121 DGSSDETYFKAKQFEHNEFCKEIRVFR-KKNEGKSKAINYGIERSTGELIFVMDADSKIS 179 Query: 170 RDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFT 229 ++A ++ +P + AV G+ + + L+ K+Q EY + +++ Q V Sbjct: 180 QNAIL-LLARHFEDPDIAAVAGSVYVSNQVNLITKLQALEYIEGLNMVRNGQAFLKAVNI 238 Query: 230 VSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGL 289 + G + FR++AL VG + D ED D++ KL + I +EP A+ + PE L L Sbjct: 239 IPGPVGMFRKNALYNVGLYDHDTFAEDCDVTLKLIAKGYKIDFEPEAVAYTEAPENLLDL 298 Query: 290 WKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCL--TTIWAFTCLVGFIIYAVQL 347 KQR RW +G + K+ LW + + Y L + W F + V + Sbjct: 299 IKQRYRWTRGILQAIRKHRNLLWNFKENTTASMVMWYMLFESLFWPFADIW------VNI 352 Query: 348 AGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWM 407 + + + L + + +++ + ++++ I+F + Sbjct: 353 FVLYWAVTSGMSIFIFYWWSIFTILDVAGALYCILLTGEKLGLVFYAVYYRIFFIETINL 412 Query: 408 LSLATTLVSFTRVMLMPKKQRARWVSPDRG 437 + ++ F + + W +R Sbjct: 413 AKIFASIEEFMGIQM-------SWGKLERK 435 >UniRef50_B4D4P9 Polysaccharide deacetylase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D4P9_9BACT Length = 1192 Score = 304 bits (779), Expect = 5e-81, Method: Composition-based stats. Identities = 105/450 (23%), Positives = 181/450 (40%), Gaps = 54/450 (12%) Query: 24 FHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPC 83 F + ++ F + + F + + ++ + RH + PS+S++I Sbjct: 752 FRTWHGIVEFFWAFMIFSTGLVVLRTLLVAWLAHRHHR---GEIVSKSTFQPSVSVVIAA 808 Query: 84 FNEEKNVEETIHAALAQRYE-NIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQG 142 FNE K +++T+ + L Y ++E+I V+DGS D T A ++ +AA P +R+I N+G Sbjct: 809 FNEGKVIQKTLRSVLDTDYTGDLELIIVDDGSRDDTAAKVEALAAAEPRIRLI-QQPNRG 867 Query: 143 KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRT----- 197 KA AL+ G A++E + +D D L + + +V+P+ +P VGAV+GN R+ Sbjct: 868 KASALRHGVTVARNEIVAFLDADTLFEHHTLSALVQPLE-HPLVGAVSGNARVGNLLTGH 926 Query: 198 -------------------------------RSTLVGKIQVGEYSSIIGLIKRTQRIYGN 226 + Q EY L +R + Sbjct: 927 DSSGASSASANSPGFLSRWWAGSTWATLSLLARRFIASCQSLEYICGFNLDRRAYTEWNC 986 Query: 227 VFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETL 286 + V G ++A RRSA+ G +SDD + ED D++ + + I Y P A+ W PET Sbjct: 987 ITVVPGAVSALRRSAIVAGGGFSDDTLAEDTDLTLVMHRLGYRIEYAPEAVAWTEAPETT 1046 Query: 287 KGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQ 346 GL KQR RWA G + K+ ++ + + L IW + + V Sbjct: 1047 IGLVKQRSRWAFGTMQCLWKHRDMVFNPRYGALGW----FSLPNIWFMQIGLVAVTPLVD 1102 Query: 347 LAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFW 406 + + + I+L L + M R +L I+ + Sbjct: 1103 AILLYSLLFGNARGIWYWFVIVLGLDLFLAALACWMDGERLRRSL-----LILPMRFFYR 1157 Query: 407 MLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 + S +R + K W +R Sbjct: 1158 PVLAWVIWRSISRAL---KGAFVAWGKLER 1184 >UniRef50_C1V8R5 Glycosyl transferase n=1 Tax=Halogeometricum borinquense DSM 11551 RepID=C1V8R5_9EURY Length = 567 Score = 303 bits (777), Expect = 8e-81, Method: Composition-based stats. Identities = 98/443 (22%), Positives = 177/443 (39%), Gaps = 24/443 (5%) Query: 2 INRIVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIM------WIVGGVYFWVY 55 + R+ + + + + A +G + + P+ M++ + V F + Sbjct: 123 LRRLANTRVRLSIWLAIVLSALVLTGGFVFGWSVPTPYEMAVFAGFVVIAFIVVVVFPLT 182 Query: 56 RERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYE--NIEVIAVNDG 113 + E+ D P +S+++P +NE V + + + LA Y +EVI ++DG Sbjct: 183 IVQMRGREESDHTLDDDDAPLVSVLVPAYNESNYVGDCLDSILASDYPTDRLEVIVIDDG 242 Query: 114 STDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAA 173 STD T A + + V H + N GK AL G + ++ + +V +D D++L A Sbjct: 243 STDGTYAEASAY--RNDRVSVFHRS-NGGKHAALNLGLSCSRGDVVVAVDADSILAPSAL 299 Query: 174 AYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGV 233 VE + +PR+GAV G + +VG +Q EY I ++R G V + G Sbjct: 300 RTAVEQLQSDPRLGAVAGTVVVNNADGIVGSVQALEYVLGINTLRRAFSYLGTVMVIPGC 359 Query: 234 IAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQR 293 + FRR AL+EVG + D +TED D++ +L W + AL + P +L L QR Sbjct: 360 LGVFRREALSEVGGYDPDTVTEDFDLTVRLLKAGWRVE-LSEALVYTEAPFSLTDLLNQR 418 Query: 294 LRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLN 353 LRW +G + LK+ + F +I F ++ + + Sbjct: 419 LRWTRGNIQTLLKHRDVFSEPAYGFLHRFAFPLSALSI-LFVPFASIVVTTMIFVAILNG 477 Query: 354 IELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATT 413 L +L + + +S + + P+ Sbjct: 478 AFLGVALVAAYFLFVLLFVAAMALDLSD-----------GDWRLLAYAPLHLVGYRQFLD 526 Query: 414 LVSFTRVMLMPKKQRARWVSPDR 436 ++ +++ + RW S R Sbjct: 527 VIVIRTALMLLRGTNKRWESVTR 549 >UniRef50_A1R2V1 Glycosyl transferase, group 2 family domain protein n=4 Tax=Actinomycetales RepID=A1R2V1_ARTAT Length = 431 Score = 301 bits (772), Expect = 3e-80, Method: Composition-based stats. Identities = 102/406 (25%), Positives = 184/406 (45%), Gaps = 14/406 (3%) Query: 36 FWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIH 95 F M+ + V ++ + R L D P +S+I+P +NE + + Sbjct: 18 ILGFGMAKIVYVPMALYFDWLYRSICARHRYSV--LSDRPLVSVIVPGYNEAVVITGCVE 75 Query: 96 AALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAK 155 + LA RY +EVI V+DGSTD+T +I++ +A Q +R + N GK AL G AAA Sbjct: 76 SILASRYLRLEVILVDDGSTDETASIMEGLAQQYDRVRFLS-QANAGKGAALNCGIAAAL 134 Query: 156 SEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIG 215 + L+ +D D + D +++E +P+VGAV G+ R L + G Sbjct: 135 GDILMFVDADGVFAPDTLIHMLEG-FDDPKVGAVCGDDRPVNLDRLQTMMLAILSHVGTG 193 Query: 216 LIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPR 275 L++R + + VSG I AFR + E+G + +D + ED++++W++ + + ++P+ Sbjct: 194 LVRRALSLMNCLPIVSGNIGAFRSDLVRELGGFHEDTLGEDLELTWRVYKAGYRVRFQPK 253 Query: 276 ALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFT 335 AL + P T+ GLW+QR+RW++G + + L + F + T+ Sbjct: 254 ALVYAESPSTMGGLWRQRVRWSRGLLQTLRLHSGMLGSRRYGMFGA-FLVFNAITMVLIP 312 Query: 336 CLVGFIIYAVQLAGVPLNIELTHIAATHTAGI-LLCTLCLLQFIVSLMIENRYEHNLTSS 394 L ++ + V + + + +L L+ F V L NR +L Sbjct: 313 ILQLVVLALLPFLYVAGMGPVPAEVLAILGWLGVFVSLALIVFSVGL---NRSWRDL-RF 368 Query: 395 LFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRGILR 440 L+ + +P + LA + ++ + ARW R ++ Sbjct: 369 LWTLPLWPFYSVFVGLALA----SAIVKEIRGSPARWNKLQRTGIK 410 >UniRef50_C5SLQ2 Polysaccharide deacetylase n=1 Tax=Asticcacaulis excentricus CB 48 RepID=C5SLQ2_9CAUL Length = 1126 Score = 300 bits (769), Expect = 6e-80, Method: Composition-based stats. Identities = 97/438 (22%), Positives = 173/438 (39%), Gaps = 16/438 (3%) Query: 11 LCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQ 70 L V G + ++++F + ++ + ++ + A Sbjct: 683 LAAVRFDVGIFMLLAWGAIGLQWLFIAAIVLGVLRALILSVMAIWADYSGHMDPPAETDD 742 Query: 71 LK-----DNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRM 125 + +S+IIP FNE + +E ++ L +IEV+ V+DGS D+T AI+ R Sbjct: 743 GRIADFVHARQVSVIIPAFNEARVIEASVRRVLTSVDVDIEVVVVDDGSKDETSAIVARA 802 Query: 126 AAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPR 185 A P + +I N GKA A+ G A A+ E ++ +D D + + A +V + P Sbjct: 803 FANEPRVTLI-TQANAGKATAVNNGIARARGEIIIALDADTQFEPETIARLVRWFIR-PE 860 Query: 186 VGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEV 245 +GAV GN ++ R V + Q EY + L + +G + V G + A+RRSAL EV Sbjct: 861 IGAVAGNAKVGNRFNFVTRWQGVEYVTAQNLERTALATFGAMMVVPGAVGAWRRSALEEV 920 Query: 246 GYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFL 305 G + D + ED D++ +Q W + Y+ A+ W PE+ L KQR RWA G + Sbjct: 921 GGYPHDTLAEDQDLTIAIQRRGWAVAYDQDAVAWTEAPESFASLIKQRYRWAFGTLQCLW 980 Query: 306 KNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFI--IYAVQLAGVPLNIELTHI---- 359 K+ L + + + F+ + I + +A I + Sbjct: 981 KHRRILKEGKPKGLAHIGLPQAWIFQIGFSVISPLIDLALLINIAATLWTISQHGLNQTD 1040 Query: 360 AATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTR 419 I + + + +FW++ ++ + L + Sbjct: 1041 GTLERMLIYWTVFVAIDALCGWIAYMLEPREKRYPVFWLLSQRFVYRQIMYYVVLKALAS 1100 Query: 420 VMLMPKKQRARWVSPDRG 437 + W +R Sbjct: 1101 A---GRGLAVGWGKLERS 1115 >UniRef50_C7MUK1 Glycosyl transferase n=11 Tax=Actinomycetales RepID=C7MUK1_SACVD Length = 1099 Score = 297 bits (760), Expect = 7e-79, Method: Composition-based stats. Identities = 96/364 (26%), Positives = 171/364 (46%), Gaps = 16/364 (4%) Query: 76 SISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVI 135 +S+I+P +NE N+E T+ +A+A + +E+I V+DGSTD T +++ + + +RV+ Sbjct: 342 PVSVIVPAYNEAANIEATVRSAVASTHP-VEIIVVDDGSTDGTADLVEGLG--LSGVRVL 398 Query: 136 HLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRI 195 N+GKA AL TG AAA + +V +DGD + + + +V+P +P VGAV+GN +I Sbjct: 399 RRP-NRGKAAALNTGIAAASYDLIVMVDGDTVFEPNTVHELVQPF-ADPEVGAVSGNVKI 456 Query: 196 RTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITE 255 R TL+ ++Q EY + +R + ++ TV G AFRRSAL +VG S + E Sbjct: 457 ANRETLLARLQHIEYVVGFNVDRRVHEVMRSMPTVPGAGGAFRRSALLQVGGLSAQTLAE 516 Query: 256 DIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKE 315 D D++ + W ++ +A+ W P T++ LW+QR RW G + K+ + ++ Sbjct: 517 DTDLTISIGRAGWRTVFQEKAVTWTEAPTTVRQLWRQRFRWTFGTLQALWKHRKAIVQRG 576 Query: 316 NFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLL 375 F + + F L+ I + + + + T + + + Sbjct: 577 AAGRVGRFG---MLHVVCFQVLLPMIAPVIDV-FLVYGVLFLDPWTTVLLWLTMLGIQAA 632 Query: 376 QFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPD 435 + ++ + L+ + +I+ L + S R RW Sbjct: 633 AAAYAFHLDGERKT----VLWLLPAQQLIYRQLMYVVLMQSLAAA---ASGVRVRWQHMR 685 Query: 436 RGIL 439 R L Sbjct: 686 RSGL 689 >UniRef50_B8II97 Polysaccharide deacetylase n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8II97_METNO Length = 1120 Score = 297 bits (760), Expect = 7e-79, Method: Composition-based stats. Identities = 99/422 (23%), Positives = 168/422 (39%), Gaps = 17/422 (4%) Query: 25 HSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCF 84 + + + +++ I + H G P+ I++++P + Sbjct: 695 QTVSWLCDAIPAVAITTTVLAIFRLTLIIIGATAHGLRGSRRDPPEGWRPRGIAVLVPAY 754 Query: 85 NEEKNVEETIHAALAQR-YENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGK 143 NEE + +TI LA E IE+I ++DGSTD T A++ ++ I+ N GK Sbjct: 755 NEEIVILKTIQTLLASTIAEQIEIIVIDDGSTDNTAAVVRTAFPNTAAVQ-IYTKANGGK 813 Query: 144 AIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVG 203 A AL G +E +V IDGD +L DA ++ +P++GAV G + R TL+ Sbjct: 814 AAALNYGLQKTSTEIIVAIDGDTVLLPDAIEHLAR-HFADPKIGAVAGTVSVGNRKTLIA 872 Query: 204 KIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKL 263 + Q EY+ L +R ++ + V G I A+RR AL VG +S D + ED D++ L Sbjct: 873 RFQALEYTMSQNLDRRAFQLINAIGVVPGAIGAWRREALMAVGGYSSDTLAEDADLTISL 932 Query: 264 QLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLF 323 +L W + EPRA PE L+ KQR RW G +V K+ R+ L Sbjct: 933 ELAGWKVVCEPRARALTEAPERLRAFLKQRFRWMFGTLQVAYKHAPASLRRPRGISLILI 992 Query: 324 --FEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIEL-------THIAATHTAGILLCTLCL 374 L+ I+ ++ V + + ++ Sbjct: 993 PNVLLFQFLFTLLAPLMDLILIFSVVSSVVDITLIGSRSEGYGTLELLMAYWLVFQVFDF 1052 Query: 375 LQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSP 434 L +L++ + L ++ + L T + + + + W Sbjct: 1053 LAGCAALLLHGKS--PEWRLLPLLVLQRFCYRQLLYITAIRTLLTAL---RGTFVGWGKL 1107 Query: 435 DR 436 R Sbjct: 1108 VR 1109 >UniRef50_P96587 Uncharacterized glycosyltransferase ydaM n=6 Tax=Bacillus RepID=YDAM_BACSU Length = 420 Score = 295 bits (755), Expect = 3e-78, Method: Composition-based stats. Identities = 94/418 (22%), Positives = 190/418 (45%), Gaps = 13/418 (3%) Query: 25 HSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCF 84 ++ + + + + + GG ++ ER+ P +K+ P +S++IP Sbjct: 3 NTLFFISLSLIWVMLLYHMFLMQGGFRHYMTFERNIPKWRE----NMKELPKVSVLIPAH 58 Query: 85 NEEKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAILDRMAAQIPHLR-VIHLAQN- 140 NEE + +T+ A + Y +E+I VND S+D+T I++ + + ++ VI N Sbjct: 59 NEEVVIRQTLKAMVNLYYPKDRLEIIVVNDNSSDRTGDIVNEFSEKYDFIKMVITKPPNA 118 Query: 141 -QGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTR- 198 +GK+ AL +G A + + + D D ++ A Y+V ++ + + GAV G R+ Sbjct: 119 GKGKSSALNSGFAESNGDVICVYDADNTPEKMAVYYLVLGLMNDEKAGAVVGKFRVINAA 178 Query: 199 STLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDID 258 TL+ + E + + + + + T+ G A RRS + ++G W D + ED + Sbjct: 179 KTLLTRFINIETICFQWMAQGGRWKWFKIATIPGTNFAIRRSIIEKLGGWDDKALAEDTE 238 Query: 259 ISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFR 318 ++ ++ + I + P A+ W PET K W+QR RWA+G V LK + + ++ + R Sbjct: 239 LTIRVYNLGYHIRFFPAAITWEQEPETWKVWWRQRTRWARGNQYVVLKFLAQFFKLKRKR 298 Query: 319 MWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFI 378 + F + T F ++ + A+ + + ++ L+ IL L + + + Sbjct: 299 IIFDLFYFFFTYFLFFFGVI--MSNAIFVVNLFYDLHLSVGFLAMILWILAFFLFMTEVM 356 Query: 379 VSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 ++L IE + + ++++F + F + KQ +W +R Sbjct: 357 ITLSIEKTEMNKQNFFIVFLMYFT-YSQAWIVLVIYSLFVEIKHRLFKQEVKWYKTER 413 >UniRef50_B9LWG0 Glycosyl transferase family 2 n=1 Tax=Halorubrum lacusprofundi ATCC 49239 RepID=B9LWG0_HALLT Length = 509 Score = 294 bits (754), Expect = 3e-78, Method: Composition-based stats. Identities = 100/434 (23%), Positives = 167/434 (38%), Gaps = 27/434 (6%) Query: 5 IVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGE 64 + F+ + L + F F + VY + + + Sbjct: 77 LCVAFVCYGFGVVTLGWVQPSLTSGVYLFAIALIFLYYWFIALAAVYH----NQRYHSQD 132 Query: 65 NAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYE--NIEVIAVNDGSTDKTRAIL 122 P P SISII+P +NEE ++ TI A L Y E+I ++DGSTD T A Sbjct: 133 APPEPSA----SISIIVPAYNEEGYIQRTITALLDADYPDGKREIIVIDDGSTDNTCAEA 188 Query: 123 DRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLY 182 ++ + N GK AL G A +E ++ +D D++ ++DA +V P+ Sbjct: 189 RAFESETVSVV---TKDNGGKYSALNYGLLFASNEIILTVDADSVPEKDALKQMVAPL-S 244 Query: 183 NPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSAL 242 + VGAV I R +L+ Q EY+ + + +R ++G V V G + A+RR L Sbjct: 245 DQSVGAVASTVTIWNRGSLLTGCQQLEYTIGVNVYRRMLDLFGIVMVVPGCLGAYRRDVL 304 Query: 243 AEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAE 302 E+ + +TED DI+ K+ + + A + P++L+ L+ QRLRW +G Sbjct: 305 DEIQGFDPQTLTEDFDITVKVLRAGYEVR-SSEARVYTEAPDSLRDLYNQRLRWYRGNYM 363 Query: 303 VFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAAT 362 K+ L ++ F L + I+ + + + Sbjct: 364 TIFKHRGVLSEPTTGFLYRFAFPLRLVELLFLPFASWVILGLIVKILLSGFVIQVVSLFI 423 Query: 363 HTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVML 422 I+ L +I E L++ F V + A L S V+L Sbjct: 424 FFLSIIFLIAALGVYI---------EGEDWRLLWYTPLFLVGYKHFHDALLLKSLADVLL 474 Query: 423 MPKKQRARWVSPDR 436 + W R Sbjct: 475 ---GRNLSWTRATR 485 >UniRef50_C1D0J4 Putative glycosyltransferase n=1 Tax=Deinococcus deserti VCD115 RepID=C1D0J4_DEIDV Length = 430 Score = 293 bits (750), Expect = 1e-77, Method: Composition-based stats. Identities = 95/401 (23%), Positives = 169/401 (42%), Gaps = 15/401 (3%) Query: 37 WPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPS-ISIIIPCFNEEKNVEETIH 95 + I+ ++ V +L P ++++I +NEE + T+ Sbjct: 3 VALTVVIVLLLTKAVLTVTIAVAHRSRIRRQHRELPATPLGVTVMIAAYNEEIGIGGTLT 62 Query: 96 AALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAK 155 + L++ E ++V+ V+DGS+D T A ++R A P +++I +N GKA AL + Sbjct: 63 SVLSEPLEELQVVVVDDGSSDGTAAAVERFAQLDPRVQLIR-QENTGKAGALNHALKYVR 121 Query: 156 SEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIG 215 V +D D+ + A + ++PRVGAV G+ R+ + L+ +Q EY+ Sbjct: 122 YPVAVSVDADSAISPGTLAALAR-HFHDPRVGAVAGDVRVAGEAKLLTHMQDMEYTIGQH 180 Query: 216 LIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPR 275 + KR Q G V V G A+R L E+G +S D +TED+D++ + + + +EP Sbjct: 181 MEKRAQCFLGTVSVVPGAAGAYRTELLRELGGYSHDTLTEDMDLTIAVAAAGYRVRFEPL 240 Query: 276 ALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFT 335 AL + P+ LK LW+QRLRW G +V K ++ R+ L Y L Sbjct: 241 ALSFTEPPQMLKHLWRQRLRWMYGTFQVMGKYRHLIFNPRAGRLGLLTLPYVLLYGLVLG 300 Query: 336 CLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSL 395 L I V + L+Q + +++ +S Sbjct: 301 GLSPLIEILAITILVQYTVVENLWPI------------LMQILAEVVVIGAALALGGASW 348 Query: 396 FWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 +++ P + L ++V +++ W R Sbjct: 349 RMLLYTPTQRFFLRPFVSIVVLMTCWTFLRRRNVHWNKLPR 389 >UniRef50_C6A3S6 Putative glycosyl transferase n=1 Tax=Thermococcus sibiricus MM 739 RepID=C6A3S6_THESM Length = 388 Score = 293 bits (750), Expect = 1e-77, Method: Composition-based stats. Identities = 86/401 (21%), Positives = 158/401 (39%), Gaps = 18/401 (4%) Query: 44 MWIVGGVYFWVYRERHWPWGENAPAPQ-LKDNPSISIIIPCFNEEKNVEETIHAALAQRY 102 M VY+ V + P+ ++ PS++I+IP NE + +T+ A Y Sbjct: 1 MIYPIFVYYIVLTIAGLRYNSRFKRPEIPEELPSVTILIPARNEGLVIRDTLRAMANLDY 60 Query: 103 E--NIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQ-GKAIALKTGAAAAKSEYL 159 +EV+ ++DGSTD T I + ++ P ++VI + GK+ L G AK E + Sbjct: 61 PKDKLEVLLLDDGSTDDTAKIAEEVSKDYPFIKVIRVEGGGSGKSYVLNYGLKLAKGEVI 120 Query: 160 VCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRT-RSTLVGKIQVGEYSSIIGLIK 218 D D + A +V + + AVTG + ++ + EY + Sbjct: 121 AVYDADNRPEPGALKDLVAML--SDETPAVTGKVKTMNWNRNILTRFICMEYLYFQLAGQ 178 Query: 219 RTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALC 278 + + + G R+ L E+G W ++ + ED+++S+++ L I Y P A+ Sbjct: 179 AGKSKFYKTAILPGTNFVIRKELLEELGGWDEEALAEDLELSFRIILTGKKIAYTPLAVT 238 Query: 279 WILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLV 338 W PE+L+ ++QR RWA G + + R + W L F+ LT + + + Sbjct: 239 WEQEPESLRVWFRQRTRWAAGNVHTVKEYVK---RFKEIPSWGLRFDLFLTLMVYYLLAM 295 Query: 339 GFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWI 398 I+ V + L G + LL+ L Sbjct: 296 AVIVADVAFVALLLTSGSVTWFTWSVLGFVYLAF-LLEIFAGLYDGKIKSPGCW------ 348 Query: 399 IWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRGIL 439 + +++ S L+S + ++ + W R + Sbjct: 349 LLALLMYHTYSQIWILISLA-GLWEARRAKKVWYKTPRTAV 388 >UniRef50_B9MNQ5 Glycosyl transferase family 2 n=1 Tax=Anaerocellum thermophilum DSM 6725 RepID=B9MNQ5_ANATD Length = 464 Score = 292 bits (749), Expect = 2e-77, Method: Composition-based stats. Identities = 101/466 (21%), Positives = 189/466 (40%), Gaps = 59/466 (12%) Query: 25 HSGELMMRFVFFWPFFM---SIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIII 81 H FV ++ + + I+ ++ V R+ G +S+++ Sbjct: 3 HIINWFSWFVSYYVLVLNTVYAILILISLFGIVSYWRNKIKGRIVEVVSSDFALPVSLLV 62 Query: 82 PCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDR----------------- 124 P +NEE+ + +++ + L Y EV+ +NDGS D T +L Sbjct: 63 PAYNEEETIAKSVKSFLQIEYPEYEVVVINDGSKDGTLDVLKNEFDLYIVDRKFRKILST 122 Query: 125 -------MAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIV 177 + + +L V+ +N GKA AL G Y+ +D D++L+RD+ A ++ Sbjct: 123 KEIKVIYYSKKYSNLIVVD-KENGGKADALNAGINVCTYPYVCSLDADSILERDSIAKVM 181 Query: 178 EPMLYNP-RVGAVTGNPRIRT---------------RSTLVGKIQVGEYSSIIGLIKRTQ 221 +P NP V A TG RI S+ + + Q+ EY ++ Sbjct: 182 QPFFDNPYEVVATTGIVRIVNGTELDSFGNIKKLKLPSSSLARFQIIEYLRAFLGARKGL 241 Query: 222 RIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQW------TIFYEPR 275 + G++ SG AAF ++A+ +VG +SD + ED++I KL+ N + + + P Sbjct: 242 SMIGSLVIASGAFAAFNKNAIIKVGGFSDRTVGEDMEIVVKLRKNSYKEGALGRVEFVPD 301 Query: 276 ALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTT--IWA 333 + W PETLK L KQR RW +G +V + L+ + + Y L + Sbjct: 302 PIVWTQCPETLKDLSKQRRRWQRGLCQVIFMHKDILFNPKYGILGLFAMPYQLMFELLGP 361 Query: 334 FTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRY--EHNL 391 F ++G+I + +N+E+ + + +L ++ + +Y Sbjct: 362 FVEMLGYIFIPISYFAHIINLEVALFFFAVEIMYGIV-ISILAVLLGEFSDRKYEGWREF 420 Query: 392 TSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRG 437 + + I + +++ +V +L K W P+R Sbjct: 421 GILVLFAILENFGYRQMTMLFRIVGTFEAILRKKG----WAKPERK 462 >UniRef50_Q3B487 Glucosaminyltransferase n=2 Tax=Chlorobium/Pelodictyon group RepID=Q3B487_PELLD Length = 461 Score = 292 bits (748), Expect = 2e-77, Method: Composition-based stats. Identities = 103/419 (24%), Positives = 178/419 (42%), Gaps = 13/419 (3%) Query: 1 MINRIVSFFILC-LVLCIPLCVAYFHSGELMMRFVFFWPFF-----MSIMWIVGGVYFWV 54 +++ I+S + + +P V + F + M + + G F Sbjct: 37 ILSAIISLLVSMGIWSFMPGWVDSLNVLIWGNPTRSFPVWMHQVLDMFFLIYLAGSLFPF 96 Query: 55 YRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGS 114 P +S+IIP FNEE ++ + I +ALA Y E+I V+DGS Sbjct: 97 LFFIGLTMLPERRIAGNTSLPFVSVIIPSFNEEDSIAQCIESALALDYPAYEIIIVDDGS 156 Query: 115 TDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAA 174 D T I++R + VI L N+GK AL G AK E + D D+ LD Sbjct: 157 RDLTLPIIERY-----DVSVIRLRTNRGKVEALNRGIEKAKGEIIFFTDSDSSLDPKVLR 211 Query: 175 YIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYG-NVFTVSGV 233 Y+V + VGAV G + ++ + ++Q EY +IK+ Q G V G Sbjct: 212 YLVAGF-SDDTVGAVAGMVLPKRHNSYLLRMQTIEYIYGQSIIKKAQLKSGGAVSICPGP 270 Query: 234 IAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQR 293 AFRRS L E G + + + ED DI+ ++ + YEPRA+ + KGL KQR Sbjct: 271 ATAFRRSVLLETGGYRNRTLAEDFDITLEMMELGYRAVYEPRAVSYTSAMTGWKGLRKQR 330 Query: 294 LRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLN 353 +RW++G +V+ ++ ++ + + + L + L + L + + Sbjct: 331 IRWSRGHLQVYREHRGMMFSDIGGPISLFWMPWSLFIGYGSAFLEMAFLGVYPLLVIISS 390 Query: 354 IELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLAT 412 + L + + ++L + +Q I+ LM R +L + F + ++ + + Sbjct: 391 MPLLFLKLGLSYMVILECISAMQGIIPLMQARRLTPSLALASFTTQPYRIVLSYMRIVA 449 >UniRef50_B1L4E1 Glycosyl transferase family 2 n=1 Tax=Candidatus Korarchaeum cryptofilum OPF8 RepID=B1L4E1_KORCO Length = 489 Score = 291 bits (745), Expect = 4e-77, Method: Composition-based stats. Identities = 112/441 (25%), Positives = 181/441 (41%), Gaps = 15/441 (3%) Query: 1 MINRIVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHW 60 ++ F +L +V VA S ++ F + + + Sbjct: 53 LLGGAALFSLLIIVPISLALVASLESLSTFETILYGIISFGFTLISWHYLLLVPLAAYYK 112 Query: 61 PWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRA 120 E L P +S+IIP NEEK + TI + L YE EV+ V+DGSTD+T Sbjct: 113 RMEEIEARKPLLYRPLVSVIIPARNEEKVIGSTIRSVLESDYEPKEVVVVDDGSTDRTFE 172 Query: 121 ILDRMAAQIPHLRVIHLA-QNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEP 179 I Q P ++V+ +GKA AL G A+ E +V +D D ++ RDA +V Sbjct: 173 IASIY--QGPKVKVLRRELGGRGKARALNFGLRFARGEVIVLMDADTIISRDAIKELVRK 230 Query: 180 MLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRR 239 + +PRV AV GN +R + L+ K+Q EY + L ++ + G V +SG + AFRR Sbjct: 231 L-QDPRVSAVAGNVMVRNKVNLLTKLQAIEYIATFHLFRKGLSVLGAVPIISGALGAFRR 289 Query: 240 SALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQG 299 + L G + D +TED D++ K + + AL + PE LK L++QRLRW +G Sbjct: 290 NVLESSGLYDADTLTEDFDVTLKALKSGKIVQASSYALAFTEAPEKLKSLYRQRLRWYRG 349 Query: 300 GAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHI 359 EV +K+ ++ + II + + I Sbjct: 350 AYEVLIKHRDA-----------FSLSGLTMLDFSLILMNNLIIPLIDFLSIISIIIAILR 398 Query: 360 AATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTR 419 I + LQ +++L I E S +F ++ + F Sbjct: 399 GLIWPLIIQIILFSTLQILINLFILQLAEERDISLVFLPLFAIGYKQFHEILMLKCFFDV 458 Query: 420 VMLMPKKQRARWVSPDRGILR 440 ++ + + W +R L Sbjct: 459 IIARMRGKSFSWTFIERRGLE 479 >UniRef50_Q46B82 Glucosaminyltransferase n=2 Tax=Methanosarcina RepID=Q46B82_METBF Length = 403 Score = 290 bits (743), Expect = 7e-77, Method: Composition-based stats. Identities = 95/414 (22%), Positives = 174/414 (42%), Gaps = 16/414 (3%) Query: 26 SGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFN 85 L + F F + I+ I + R NP +SI++P +N Sbjct: 3 PLLLFFYIIGFTCFSVGILTIPYYPLSLAFEMRKSKQN-----IFNSKNPFVSIVVPAYN 57 Query: 86 EEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAI 145 E K + I + L Y EVI V+DGS+D T + H+ V+ +N GKA Sbjct: 58 EGKVIGHCIKSILESNYSEYEVILVDDGSSDNTLEEMQHY-ETNSHVIVV-TKKNGGKAS 115 Query: 146 ALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKI 205 AL G AK E + +D D + D + ++ + VGAV GN L ++ Sbjct: 116 ALNVGLKLAKGEVIFFVDADGIFAPDTISKMLSGFIS-EDVGAVCGNDAPINLDKLQTQL 174 Query: 206 QVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQL 265 G ++R + VSG I A+R S L + G + + I ED++++W++ Sbjct: 175 ANLLTHVGTGFVRRALSTIDCLPVVSGNIGAYRSSTLEKTGPFLEGFIGEDMELTWRVHK 234 Query: 266 NQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFE 325 + + ++P A+ + P T+ GLWKQR+RWA+G + + L+ + ++ + Sbjct: 235 AGYKVVFQPWAIVYAEAPSTITGLWKQRVRWARGLLKTAYIHRDMLFNPKYG-LFAFYLP 293 Query: 326 YCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIEN 385 LT++ L + + NI ++ G L + + S+ ++ Sbjct: 294 INLTSMIIIPLLQLI-SIILLPILLFRNINPIPLSWISIMGWLGMFSSIFALLFSIALDR 352 Query: 386 RYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRGIL 439 + L F++I V++ + L + ++ +++ A+W DR + Sbjct: 353 AW---LDLKYFYVIPLWVLYSFMMDVVMLWAI---IVELRRKEAKWNKLDRTGI 400 >UniRef50_Q03HA4 Glycosyltransferase n=2 Tax=Bacilli RepID=Q03HA4_PEDPA Length = 414 Score = 288 bits (738), Expect = 2e-76, Method: Composition-based stats. Identities = 96/424 (22%), Positives = 183/424 (43%), Gaps = 30/424 (7%) Query: 28 ELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEE 87 LM+ + W + + + Y W + + P +++++P NE Sbjct: 4 FLMVAIIAIWGTLIINLILTVAGYTW-----YLQEAPKPDKQLPEKIPFVTVMVPAHNEG 58 Query: 88 KNVEETIHAALAQRYE--NIEVIAVNDGSTDKTRAILDRMAAQIP--HLRVIHLAQ---N 140 + +T+ + L+ Y + E+I +ND S+D + +L + L+VI+ Sbjct: 59 IVIVKTVLSLLSFDYPHDHYEIIVINDNSSDNSAELLKEVQNDFGEERLKVINTDNVTGG 118 Query: 141 QGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRS- 199 +GK+ AL G AK + D D +R A +V +L N R+ AV G R R + Sbjct: 119 KGKSNALNIGLKQAKGSVIAIYDADNTPERGALRLLVAELLGNDRLAAVIGKFRTRNKEA 178 Query: 200 TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDI 259 T++ + E S + + ++ + T+ G RR L ++G W + ED +I Sbjct: 179 TILTRFINIETLSFQWMAQAGRQRLFKLCTIPGTNYVIRREILEKIGGWDVKALAEDTEI 238 Query: 260 SWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRM 319 S+++ + I ++PRA+ W P+TL + QR RW +G V +KN L++K + Sbjct: 239 SFRVYQMGYQIKFQPRAVTWEQEPQTLDVWFHQRTRWVKGNIYVVVKNAKLLFKKAGRPI 298 Query: 320 WPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGI--LLCTLCLLQF 377 + T ++ A+ L+ V + + +A + G L ++ F Sbjct: 299 RFDLLYFLSTYF--------LLMTALILSDVVFVLSVAGLAHSDLQGFSNALWLFGIIMF 350 Query: 378 IVSLMIENRYEHN--LTSSLFWIIWFPVIF---WMLSLATTLVSFTRVMLMPKKQRARWV 432 I+S + E S++ +II+ +++ W+ A +V + R + K+ A+W Sbjct: 351 IISTFVTVTTEKGEMRFSNILYIIFMYLVYSQMWLAVAAYGMVGYIREQVFHKQ--AKWY 408 Query: 433 SPDR 436 R Sbjct: 409 KTKR 412 >UniRef50_B1HSU1 Biofilm PIA synthesis N-glycosyltransferase icaA n=1 Tax=Lysinibacillus sphaericus C3-41 RepID=B1HSU1_LYSSC Length = 403 Score = 288 bits (737), Expect = 3e-76, Method: Composition-based stats. Identities = 99/389 (25%), Positives = 179/389 (46%), Gaps = 9/389 (2%) Query: 54 VYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYEN--IEVIAVN 111 R + + + ++ D PS+S+ IP NE +E+T+ A Y +EVI +N Sbjct: 8 YLHFRLFEKPIDEWSKKMTDVPSVSVFIPAHNEALVIEQTLRAMSRLYYPKDKLEVIVIN 67 Query: 112 DGSTDKTRAILDRMAAQIPHLRVIHL---AQNQGKAIALKTGAAAAKSEYLVCIDGDALL 168 D S+D+T I + A + P +RVI + +GK+ AL + A + + +V D D Sbjct: 68 DNSSDETGNIALQYAEKFPFMRVIETVEPNKGKGKSSALNSALADSTGDIVVVYDADNTP 127 Query: 169 DRDAAAYIVEPMLYNPRVGAVTGNPRIRTR-STLVGKIQVGEYSSIIGLIKRTQRIYGNV 227 +R A Y+V ++ +P+ A G R+ T + + E + + + + V Sbjct: 128 ERMAVWYLVMGLMNDPKAAATVGKFRVINAAETWLTRFINIETICFQWMAQGGRWKWFKV 187 Query: 228 FTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLK 287 T+ G A RR+ L ++G W + ED +++ ++ + I + P+A+ W PETLK Sbjct: 188 ATIPGTNFAIRRTVLEQLGGWDVKALAEDTELTIRVYNLGYHIRFFPKAITWEQEPETLK 247 Query: 288 GLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQL 347 WKQR RWA+G V LK +++ + + + F + T F ++ + A+ + Sbjct: 248 VWWKQRTRWARGNQYVVLKFLSQFFTLKRKSIIFDLFYFFFTYFLFFFGVI--LSNALFV 305 Query: 348 AGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWM 407 + +I LT +L L L + +++L IE + E N + L+ I+ + M Sbjct: 306 INLFYDIGLTVGDIALVLWVLAFLLFLGEVMITLSIE-KTEMNKKNFLYVILMYFTYSQM 364 Query: 408 LSLATTLVSFTRVMLMPKKQRARWVSPDR 436 + F + M Q +W +R Sbjct: 365 WIVLVVWSLFLEIKRMFTGQEVQWYKTER 393 >UniRef50_B7HT47 Glycosyl transferase family 2 n=8 Tax=Bacteria RepID=B7HT47_BACC7 Length = 1184 Score = 287 bits (736), Expect = 4e-76, Method: Composition-based stats. Identities = 98/475 (20%), Positives = 168/475 (35%), Gaps = 62/475 (13%) Query: 18 PLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSI 77 L + S L + ++ F + ++++ RER P + Sbjct: 3 YLDIFLKISQWLFLLYMGVAILFYTAIFLISAFTLRKKRERDENRELLHYLQSSITRP-V 61 Query: 78 SIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHL 137 SII+P +NE + ++ + L Y EVI VNDGS+D T L V+ L Sbjct: 62 SIIVPAYNEGVTIVSSVQSLLTLEYPEFEVIVVNDGSSDDTLEQLKDHFQLYEIQNVVRL 121 Query: 138 A-----------------------QNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAA 174 +N GKA AL G + Y+ +D D+LL+RDA Sbjct: 122 QLETETIRKTYRSSVNKQIIVVDKENGGKADALNAGINISNYPYVCSLDADSLLERDALM 181 Query: 175 YIVEPMLYNPRVGAVTGN-PRIRTRSTL--------------VGKIQVGEYSSIIGLIKR 219 ++P+ +P VTG RI S + + +Q+ EY + Sbjct: 182 KAMKPIYESPEKVMVTGGSVRIVNGSYIQNGQMIENRLPKQPLALMQIIEYLRGFLFGRL 241 Query: 220 TQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQ------WTIFYE 273 Y + +SG F + + VG + + ED+++ L I Y Sbjct: 242 AWSKYNILPIISGAFGIFDKGEVIRVGGYQRKTVGEDMELVVHLHKKALQDGEEKKIIYN 301 Query: 274 PRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWA 333 P A+CW P+ L KQR RW +G E ++ L+R + + + L W Sbjct: 302 PNAICWTQAPDDLTTFRKQRSRWHRGLGETLWRHKDILFRPKYKAFGMIAMPFYLLLEW- 360 Query: 334 FTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYE----- 388 +G II + + ++ I + +L T+ F+ ++ Sbjct: 361 ----LGPIIEILGYLLLLYHLLFDEIFTEYVFLLLAATVLYGSFLSVGVVLLEEWSMKKQ 416 Query: 389 ---HNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRGILR 440 + T L W + + L++ + + + K W R L Sbjct: 417 NSIKDFTLLLLWSLTESFWYRPLTVWYRFLGLFQSLFRIKG----WGKMKRKSLE 467 >UniRef50_B9ZC52 Glycosyl transferase family 2 n=1 Tax=Natrialba magadii ATCC 43099 RepID=B9ZC52_NATMA Length = 549 Score = 287 bits (736), Expect = 4e-76, Method: Composition-based stats. Identities = 100/426 (23%), Positives = 171/426 (40%), Gaps = 16/426 (3%) Query: 13 LVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLK 72 L + + + + +++ I +F + + Sbjct: 89 LAVAVGSYLGLIVWLGWLTPSPLALVHLGAVVLIFVYYWFIAFIALFHDQIGRSKYVPNP 148 Query: 73 DNPSISIIIPCFNEEKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAILDRMAAQIP 130 P I+++IP +NEE V TI + L Y +E+IAV+DGSTD T A AA Sbjct: 149 PYPQITVLIPAYNEEGYVGRTIQSLLDANYPADALEIIAVDDGSTDDTLAEASAFAAASE 208 Query: 131 HLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVT 190 + V+ N GK AL G A + +V +D D+++DRDA +IV P + +GAV Sbjct: 209 QVSVVS-KANGGKYSALNYGLLFAAGDIIVTVDADSIVDRDALKHIVAPFAADDDIGAVA 267 Query: 191 GNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSD 250 N I R +L+ + Q EY+ + + +R +G V V G + A+RR L+EV + Sbjct: 268 SNVTIWNRDSLITRCQQLEYTIGVNIYRRALDYFGIVMVVPGCLGAYRREVLSEVFAYDP 327 Query: 251 DMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTR 310 D +TED D++ K+ + + A + P T L++QRLRW +G +K+ + Sbjct: 328 DTLTEDFDVTMKVLRAGYRVS-VSDARVYTEAPATWGDLYRQRLRWYRGNYMTIIKHWSV 386 Query: 311 LWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLC 370 + + + + L ++ V LA + I H+ + Sbjct: 387 VTDSSYGYLNRIALPFRLVEMFFLP-----FASFVVLAYILWLIAAGHVLTVFAVFVFFT 441 Query: 371 TLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRAR 430 ++ L + + IE L + V + A + V+ P Sbjct: 442 SIVFLIAALGIQIEG----EDWRLLVYAPLLVVGYKQFHDALNVKCLFDVLTSP---ELG 494 Query: 431 WVSPDR 436 W R Sbjct: 495 WTRAAR 500 >UniRef50_A5VMQ3 Glycosyl transferase, family 2 n=38 Tax=Lactobacillales RepID=A5VMQ3_LACRD Length = 443 Score = 287 bits (734), Expect = 8e-76, Method: Composition-based stats. Identities = 118/427 (27%), Positives = 205/427 (48%), Gaps = 13/427 (3%) Query: 23 YFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIP 82 + ++ + +P ++ W G + + ++R++ E P +L P I+I+IP Sbjct: 16 WMVFLIIINLILCIYPILGAMFWFFGAISYILFRQK----EELPPKAELTPEPFITIMIP 71 Query: 83 CFNEEKNVEETIHAALAQ-RYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQ 141 NEE ++ T+ L Q Y N EV+ ++DGS D+T IL M + LRVI + QNQ Sbjct: 72 AHNEEIVIKSTLEYLLTQLNYHNYEVLVMDDGSDDRTPEILHSMQTKYSRLRVIRIEQNQ 131 Query: 142 GKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRV--GAVTGNPRIRTRS 199 GKA A G AK EY++ D D + + DA ++ + A+T N + RS Sbjct: 132 GKAHAFNIGLFFAKGEYILSNDADTIPETDALIKYMQYFTSANGINYAAITANMDVYNRS 191 Query: 200 TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTV-SGVIAAFRRSALAEVGYWSDDMITEDID 258 +L GK Q E+SSI+G+IKR+Q N SG +RR L VG + D TEDI Sbjct: 192 SLWGKSQTVEFSSIVGIIKRSQTAINNTMYAYSGANTMYRRDFLINVGGFRQDRATEDIS 251 Query: 259 ISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKE--N 316 I+W T + P + + +PET L++QR RWAQGG+EV+L N +++R Sbjct: 252 IAWDHTFINATPKFAPDIVFHMNVPETFHDLYRQRKRWAQGGSEVWLSNFLKVFRHPWQY 311 Query: 317 FRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQ 376 + P+ + L+ IW+F + II+ + + + + ++ L+ Sbjct: 312 RFVIPMLTDTTLSIIWSFFFWITSIIFIITMLSFAITGNYERVWHGIVMSMIFVNFQLIA 371 Query: 377 FIVSLMIENRYEH--NLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKK-QRARWVS 433 + ++ + + L + + + W+++ T + +F + + +W+S Sbjct: 372 GLFQVLAALILDFNGSKLRYLMFSPLYLLFTWIVNPLTIVTTFHKAIKTVTGHGSGKWIS 431 Query: 434 PDRGILR 440 P+R ++ Sbjct: 432 PERKVVD 438 >UniRef50_C4I9P5 Inner membrane glycosyltransferase n=39 Tax=Bacteria RepID=C4I9P5_BURPS Length = 520 Score = 286 bits (733), Expect = 1e-75, Method: Composition-based stats. Identities = 90/416 (21%), Positives = 168/416 (40%), Gaps = 9/416 (2%) Query: 29 LMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLK--DNPSISIIIPCFNE 86 +++ + + +++ +V +V+ + AP + D P I++ + NE Sbjct: 97 IVISLCAAYLWVFALLTLVYASRHYVFGLDRLFKPQRAPYRAITHADWPEITVFVAAHNE 156 Query: 87 EKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAILDRMAAQIPHLR--VIHLAQNQG 142 E V + + A LA Y + ++ VND STD TRA++D + A+ P L + G Sbjct: 157 EAVVADCLTALLATTYPRERLTIVPVNDRSTDNTRALIDEVQARAPELIKPFHRESGKPG 216 Query: 143 KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRS-TL 201 KA ALK + + +V D D L +V P +P VGAV G + L Sbjct: 217 KAAALKDALREIRGDIMVVFDADYLPRPGLLKELVAPFF-DPEVGAVMGRVVPQNADRNL 275 Query: 202 VGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISW 261 + ++ E + + ++ + G V G + R+SAL VG W DD + ED D+++ Sbjct: 276 LARLLDLERAGGYQVNQQARNNLGLVPQYGGTVGGVRKSALDAVGGWRDDTLAEDTDMTY 335 Query: 262 KLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWP 321 +L L+ W Y A C+ +PE +Q RWA+G + + + L R Sbjct: 336 RLLLSNWRTVYLNHAECYEEVPERWPVRARQLTRWAKGHNQTLFRYLIPLLRSPVTPRRC 395 Query: 322 LFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSL 381 L ++ L+ + I+ + + L F V Sbjct: 396 RLDGALLLGVFVMPALLALAWGIALALYLTNGIDSLVLGLLVSVFALFAFSTFGNFGVFF 455 Query: 382 MIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTR-VMLMPKKQRARWVSPDR 436 I + ++ ++ V+ + +++A + + + ++ RW +R Sbjct: 456 EIVVAARLDGRATRLRLVPVNVVGFCVTIAAVVAALWGLALDALLRRELRWDKTER 511 >UniRef50_B7I546 Glycosyltransferase n=12 Tax=Acinetobacter RepID=B7I546_ACIB5 Length = 416 Score = 286 bits (732), Expect = 1e-75, Method: Composition-based stats. Identities = 85/421 (20%), Positives = 167/421 (39%), Gaps = 16/421 (3%) Query: 24 FHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPC 83 +++ F F + W W Y + +N P P + P +S++IP Sbjct: 1 MAPIDILFLFGFLGIWIPQAFWAWLSYQAWKYSKTAEKELQNLPIP--ERWPVLSVLIPA 58 Query: 84 FNEEKNVEETIHAALAQRYE--NIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQN- 140 +NE +E+T+HA Q Y + EV+ +NDGS D T I + +A P ++++++ + Sbjct: 59 YNEGVVIEDTLHAIAQQDYPAESYEVLLINDGSKDNTLEIAENLAKIYPCIKIVNVPKGM 118 Query: 141 --QGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRT- 197 +GK+ L G AK E +V D D+ + D + + +L + ++ AV G R R Sbjct: 119 GGKGKSRTLNNGLPHAKGELIVVYDADSTPEPDCVRLLAQTLLADKKLVAVNGKVRTRNW 178 Query: 198 RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDI 257 + +++ + E+ + + + + T+ G R AL +G + + + +D Sbjct: 179 QDSILTRFIAIEFIFFQWIFQGGRWQRFELSTLMGTNYVIWRDALETLGGFDEKSLVDDT 238 Query: 258 DISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENF 317 ++S+++ + Q I + P A+ W P +L KQR RW QG V K + R Sbjct: 239 EMSFRIFIGQKRIKWVPYAIGWQQDPPSLSVFVKQRSRWTQGNFYVTRKYLPVALRTPFP 298 Query: 318 RMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQF 377 + ++ I + L + + LL L + Sbjct: 299 IGIEILNNIMCYILFVPALFWSHITLTLGLLDIA-------GISVLGPFTLLWGLSFCLY 351 Query: 378 IVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVML-MPKKQRARWVSPDR 436 + + E F+ + V + + L + ++ + +W +R Sbjct: 352 VAQMWFTLSLEKVKPELYFYSVLSYVSYSQIFLFIVFKAAFDMLKNKIQGNSLQWYKTER 411 Query: 437 G 437 Sbjct: 412 S 412 >UniRef50_C1F7J6 Glycosyl transferase, group 2 family n=1 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F7J6_ACIC5 Length = 627 Score = 286 bits (732), Expect = 1e-75, Method: Composition-based stats. Identities = 86/428 (20%), Positives = 160/428 (37%), Gaps = 18/428 (4%) Query: 24 FHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPC 83 ++ + M +F + + V R P + + P +++ +P Sbjct: 120 WNLFDAAMLTPYFLVMIILSFYGVHRYIMVWEYYRFRKRATKEPPKEFPELPRVTVQLPI 179 Query: 84 FNEEKNVEETIHAALAQRYEN--IEVIAVNDGSTDKT----RAILDRMAAQIPHLRVIHL 137 FNE+ ++ I A A Y +E+ ++D STD+T A++ + Q + +H Sbjct: 180 FNEQFVIDRLIEAICAMDYPRDRLEIQVLDD-STDETQAVAAALVKKYQEQGQPIVYLHR 238 Query: 138 AQNQG-KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIR 196 QG KA AL G AK E++ D D + D V +P +G V Sbjct: 239 TNRQGYKAGALDEGLKVAKGEFVAIFDADFVPSPDWLMK-VIHHFSDPAIGMVQTRWTHL 297 Query: 197 TRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITE 255 R + + +++ L + G F +G +RR+A+ + G W D +TE Sbjct: 298 NRDYSFLTQVEAILLDGHFVLEHGARSRAGVFFNFNGTAGMWRRTAITDAGGWQHDTLTE 357 Query: 256 DIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKE 315 D D+S++ QL W Y C +P + Q+ RWA+G + K M ++ R + Sbjct: 358 DTDLSYRAQLVGWKFKYLQDVECPAELPIEMTAFKTQQARWAKGLIQTSKKIMPQVLRAD 417 Query: 316 NFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLL 375 L Y LT ++ ++ I + + + + T + Sbjct: 418 LSWHEKLEAWYHLTANISYPLMIVLSILLLPTEIIQFHQGWFQMLFIDFPLFAASTFSIA 477 Query: 376 QFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLAT------TLVSFTRVMLMPKKQRA 429 F + + + Y H +L ++ + + L+L L+ K R Sbjct: 478 SFYM-VSQQILYPHRWFRTLCYLPFLMALGIGLTLTNSKAVIEALLGIKSSFKRTPKYRV 536 Query: 430 RWVSPDRG 437 + +R Sbjct: 537 Q-AKGERS 543 >UniRef50_Q6LKT1 Hypothetical glycosyltransferase, probably involved in cell wall biogenesis n=1 Tax=Photobacterium profundum RepID=Q6LKT1_PHOPR Length = 743 Score = 285 bits (729), Expect = 3e-75, Method: Composition-based stats. Identities = 83/444 (18%), Positives = 155/444 (34%), Gaps = 31/444 (6%) Query: 6 VSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGEN 65 I + + + + +L I + G++ V + Sbjct: 67 AFISIRYIWWRYSETLPWESTLDLPFAIALLVTECYGITIYLLGMFVNV----RQRSRDI 122 Query: 66 APAPQLKDNPSISIIIPCFNEE-KNVEETIHAALAQRYE-NIEVIAVNDGST-----DKT 118 P + P++ + IP +NE + V T+ AA+ +Y + V ++DG T D+ Sbjct: 123 TPINVNQAIPTVDVFIPTYNESIRVVAPTVSAAMQLQYPGKVNVWVLDDGGTPQKLNDED 182 Query: 119 RAILD----------RMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALL 168 + ++ + A KA + + E ++ +D D + Sbjct: 183 PQQARIAQQRTHKLKDLCQKLGANYITRPANVHAKAGNINHALKHSNGELILILDADHVP 242 Query: 169 DRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTL-----VGKIQVGEYSSIIGLIKRTQRI 223 +D V P++G + T + + E I Sbjct: 243 TKDFLINTVGMFQQQPKLGFIQTPHFFATPGPIEKNLGIEDKVPSENEMFYNRILSGMDF 302 Query: 224 YGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMP 283 + F G A RR AL +VG S ITED D + ++ W Y RA+ L P Sbjct: 303 WNASF-FCGSAAIIRREALLDVGGISTRTITEDADTALEIHAKGWDSIYLNRAMIAGLSP 361 Query: 284 ETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIY 343 +T QR RWAQG ++FL N L R +F P Y +T++ F L + Sbjct: 362 DTFGAYVTQRSRWAQGMLQIFLLNNPLLKRGLSF---PQKICYLNSTLFWFFPLFRVMYL 418 Query: 344 AVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPV 403 L + ++++ A + L + ++S + + + L S + + Sbjct: 419 IAPLCYLVFDLQIFVGNANDFLVFAIPHLA-ISMVISQHLFGKTRNALFSEFYETVLSLY 477 Query: 404 IFWMLSLATTLVSFTRVMLMPKKQ 427 +F+ + + + PK + Sbjct: 478 LFFPVLSVLITPKKPKFTVTPKGE 501 >UniRef50_B5W069 Glycosyl transferase family 2 n=3 Tax=Arthrospira RepID=B5W069_SPIMA Length = 561 Score = 283 bits (724), Expect = 1e-74, Method: Composition-based stats. Identities = 80/456 (17%), Positives = 168/456 (36%), Gaps = 48/456 (10%) Query: 15 LCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDN 74 + + L + G L+ ++ ++ + ++ + R + P + ++ Sbjct: 113 IALSLIWSLMIIGHLISW-GYWLILGLTGLLVIQAIRIVWAGHRGL---DINPEHRPEEL 168 Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAILDRMAAQIPHL 132 P +S+++ NEE + + A Y + E+ ++D S+D+T +L +A + L Sbjct: 169 PFVSLLVAAKNEEAVIGNLVKNLCALNYPSHCYELWVIDDNSSDRTPIVLQELAKEYQQL 228 Query: 133 RVIHLAQN--QGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVT 190 ++H +N GK+ AL + + L D DA +D D ++ +VGAV Sbjct: 229 HILHRDENATGGKSGALNQALPLTRGKILGVFDADATVDSDLLQQVIPKFQA-EQVGAVQ 287 Query: 191 GNPRIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWS 249 I + + + Q E + ++ + G + + G RR+AL G W Sbjct: 288 LQKAIANSNFNFLTRCQASEMAL-DAFFQKQRVAVGGIGELRGNGEFIRRAALESCGGWC 346 Query: 250 DDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMT 309 + IT+D+D++ +L L+ W I + ++ + LW QR RWA+GG + +L Sbjct: 347 EQTITDDLDLTIRLHLDHWDIEFLDTSVVFEEGVTNWVALWHQRNRWAEGGYQRYLDYGK 406 Query: 310 RLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILL 369 + L + +I+ + + +++ L T +L Sbjct: 407 FIIANRMGVGKTFDLFGFL--------ITQYILPMAAIPDLFMSLILRRPPITSPLTLLA 458 Query: 370 CTLCLLQFIVSLMI-----------------------ENRYEHNLTSSLFWIIWFPVIFW 406 +L L+ + L +L + + +F Sbjct: 459 VSLSLIGMFIGLRRTVSRGNMPSQKTNSPDNLTTDGDRQAATESLGKYYPPLSYMTTLFQ 518 Query: 407 MLS------LATTLVSFTRVMLMPKKQRARWVSPDR 436 L +V+ + + +R +WV Sbjct: 519 TLRGTVYMFHWFIVVASATARISVRPKRLKWVKTVH 554 >UniRef50_UPI0001C1680B Glycosyl transferase, family 2 n=2 Tax=Nostocaceae RepID=UPI0001C1680B Length = 467 Score = 283 bits (724), Expect = 1e-74, Method: Composition-based stats. Identities = 84/431 (19%), Positives = 160/431 (37%), Gaps = 22/431 (5%) Query: 15 LCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDN 74 + + V ++ + F + ++ + V + + R RH D Sbjct: 53 ILVLTMVWGGTIALHLVSWGFAFILGLTTILGVHALRIILVRPRHHHKQ------IQGDL 106 Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAILDRMAAQIPHL 132 PS+S+++ NE+ ++ +H + Y + EV ++D STDKT IL ++ L Sbjct: 107 PSVSVLVSAKNEQAVIDRLVHNLCSLEYPHGEYEVWLIDDHSTDKTPEILAQLQQDYKQL 166 Query: 133 RVIHLAQN--QGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVT 190 V N GK+ AL K E + D DA + D ++ +VGAV Sbjct: 167 NVFRRDANATGGKSGALNQVLPMTKGEIIAVFDADAQVSPDLLLQVIPTFQR-EKVGAVQ 225 Query: 191 GNPRIRT-RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWS 249 I + + Q+ E + ++ + G + + G RR AL + G W+ Sbjct: 226 VRKAIANAKENFWTRGQMAEMAL-DTWFQQQRTAIGGLGELRGNGQFVRREALNDCGGWN 284 Query: 250 DDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMT 309 ++ IT+D+D++ +L L W I + LW QR RWA+GG + +L Sbjct: 285 EETITDDLDLTIRLNLTGWDIECMFYPPVLEEGVTNVVALWHQRNRWAEGGYQRYLDYWD 344 Query: 310 RLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILL 369 + + + L+ +II + + +++ L Sbjct: 345 LILKGRMRAGKTVDL--------LIFMLIMYIIPTAAVPDLLMSLIRHRPPILAPITGLS 396 Query: 370 CTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRA 429 T+ + L + + N + + +ML + S T + + +R Sbjct: 397 VTMSFIGMFSGLRRTRQDQKNSNYLMLLLQTIRGSVYMLHWLVVM-SSTTARVSLRPKRL 455 Query: 430 RWVSPDRGILR 440 +WV + Sbjct: 456 KWVKTVHTGSQ 466 >UniRef50_Q2LWC4 Glycosyltransferase n=1 Tax=Syntrophus aciditrophicus SB RepID=Q2LWC4_SYNAS Length = 482 Score = 282 bits (722), Expect = 2e-74, Method: Composition-based stats. Identities = 93/473 (19%), Positives = 180/473 (38%), Gaps = 65/473 (13%) Query: 21 VAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLK---DNPSI 77 F+ F+ F+ +++ + V RH AP L+ + P + Sbjct: 3 AILVSIIIGFNYFIGFYYGFVQVIYTLMLTIALVVILRHLNRIRYAPFRDLEKRSEMPPV 62 Query: 78 SIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAIL------DRMAAQIPH 131 SIIIP NE + + +I ++L Y +EVI VNDGSTD +L R+ Sbjct: 63 SIIIPARNESEVIIRSIESSLTMNYPGLEVIVVNDGSTDGMLDMLISHYSLQRIDPLYRD 122 Query: 132 LR-----------------VIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAA 174 L +I +N GKA A+ +S Y+ D D++L++DA Sbjct: 123 LIKTRPVRGFYCTSKIPNLLIVDKENGGKADAINCAINVCRSPYVCVQDADSVLEKDALI 182 Query: 175 YIVEPMLYNPR-VGAVTGNPRIRT---------------RSTLVGKIQVGEYSSIIGLIK 218 + P++ + V A G R+ R + + Q+ EY + Sbjct: 183 RLATPLVQSEVPVIACGGVVRVLNGTKQEGNVVTSIELPRRSTLACFQIVEYLRSFLFGR 242 Query: 219 RTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLN------QWTIFY 272 +SG + F+++AL +G ++ D + ED+++ ++ + + I + Sbjct: 243 VGLDAMNCNLVISGAFSFFQKAALLNIGGFATDNVAEDMELIVRMHRHYRLKKIPYRIRF 302 Query: 273 EPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIW 332 +CW +PET + L +QR RW G + K+ + ++ ++ Y + Sbjct: 303 ISDPICWTEVPETFRMLGRQRRRWHLGMIQTVWKHKSLIFNPRYGQIGLFAMPYTV---- 358 Query: 333 AFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQF------IVSLMIENR 386 F ++ ++ + +PL+ I A + L + F + + R Sbjct: 359 -FVEMISPLLEFLGYIVIPLSYFFGLINADYFLLFLALAIAYGVFLSTMGIFLEEITFRR 417 Query: 387 Y--EHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRG 437 Y +L L + + + + L+ +F + + R +W + Sbjct: 418 YPKWSDLFKLLLYGVLENIGYRQLNSFWRFQAFFQFLF----GRIKWEHVKKK 466 >UniRef50_Q2S5Y1 Glycosyl transferase, group 2 family protein n=1 Tax=Salinibacter ruber DSM 13855 RepID=Q2S5Y1_SALRD Length = 434 Score = 282 bits (722), Expect = 2e-74, Method: Composition-based stats. Identities = 97/430 (22%), Positives = 161/430 (37%), Gaps = 50/430 (11%) Query: 53 WVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVND 112 + R + A P I++I P NEE ++ A L+ Y + E++ VND Sbjct: 1 MLRYHRRMGAYVSETALGADALPPITLIAPAHNEEATCVSSVRAFLSTEYPSFEILVVND 60 Query: 113 GSTDKTRAILDRM------------------------AAQIPHLRVIHLAQNQGKAIALK 148 GSTD T L + P+L V+ +N GKA AL Sbjct: 61 GSTDNTLDRLQEAFDLQPTARPPTSSLPTESVRQVYQSQTHPNLWVLD-KENGGKADALN 119 Query: 149 TGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRT----------- 197 G S +D D L++ + A +V P L AV G R+ Sbjct: 120 AGINFCDSPLFCAVDADTLVEGNGLARMVRPFLEETATVAVGGIVRVANGCTIRAGRVQE 179 Query: 198 ---RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMIT 254 + + K+QV EY + G + +SG FRR VG ++ D + Sbjct: 180 VSMPTKWLPKLQVLEYLRAFYFGRVGWDALGAMLIISGAFGLFRRETAVRVGGYATDTVG 239 Query: 255 EDIDISWKLQLN------QWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNM 308 ED+++ ++ + + + P + W +P TL+ L +QR RW +G A+ +++ Sbjct: 240 EDMELVVRMHRRLREAGQDYRVRFVPDPVAWTKVPSTLQDLGRQRDRWQRGLADSMIRHQ 299 Query: 309 TRLWRKENFRMWPLF--FEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAG 366 L + L F Y L T L+G+I + V L L+ +A A Sbjct: 300 QMLLNPRYGTVGMLAYPFFYMLETFGPILELMGYITFVVALISGTLSGPFM-LAFFLVAF 358 Query: 367 ILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKK 426 +L L L + + RYE + +++ PV+ R + + Sbjct: 359 MLGMALSLSAVTLEELTFRRYEKT--ADFAQLLFLPVVETFGYRQLNAWWRVRGLWSYFR 416 Query: 427 QRARWVSPDR 436 W + +R Sbjct: 417 GNKEWGAMER 426 >UniRef50_Q110Z2 Glycosyl transferase, family 2 n=15 Tax=Cyanobacteria RepID=Q110Z2_TRIEI Length = 502 Score = 282 bits (721), Expect = 2e-74, Method: Composition-based stats. Identities = 92/444 (20%), Positives = 167/444 (37%), Gaps = 33/444 (7%) Query: 16 CIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNP 75 + + + ++ + + + + ++ + + + + + P Sbjct: 67 AALMLWLIWTTTIILHLLSWGYWIILGLTGLLSVQFLRILFAKPKLAPKTLSEENFTEWP 126 Query: 76 SISIIIPCFNEEKNVEETIHAALAQRYE--NIEVIAVNDGSTDKTRAILDRMAAQIPHLR 133 IS+++ NEE + + + LA Y + E+ ++D STDKT +L+++A + L+ Sbjct: 127 YISLLVAAKNEEAVIRKLVKNMLALDYPTNSYELWVIDDNSTDKTPLLLEQLAREYEQLK 186 Query: 134 VIHLAQ--NQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTG 191 VI + GK+ AL K + L D DA + D +V P+ VGAV Sbjct: 187 VIRRSPDAGGGKSGALNAAIPFVKGKILGVFDADAQVTPDLLQKVV-PLFAREEVGAVQI 245 Query: 192 NPRIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSD 250 I K Q E G + + G + + G R +AL E G W++ Sbjct: 246 RKAIANAGINFWTKGQSAEMVV-DGFFQEQRIAIGGIGELRGNGQFVRMNALEECGGWNE 304 Query: 251 DMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTR 310 IT+D+D++ +L LNQW I Y + LW QR RWA+GG + +L Sbjct: 305 QTITDDLDLTIRLHLNQWDIDYLAFPAVTEEGVTSPIALWHQRSRWAEGGYQRYLDYWKL 364 Query: 311 LWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLC 370 + R + T + +++ + ++I L + T + Sbjct: 365 ILRNR--------MRFSKTWDLWQFLVTQYLLSVAAVPDFLMSIILRRLPITSPLTVFTV 416 Query: 371 TLCLLQFIVSLMIENRYEHNLTS---------------SLFWIIWFPV--IFWMLSLATT 413 + LL + L + + NL SLF + V F+ML Sbjct: 417 MVSLLGMFIGLRRTRKQQMNLAKEEKVMEFNSSKDNPLSLFLTLLESVRGTFYMLHWFVV 476 Query: 414 LVSFTRVMLMPKKQRARWVSPDRG 437 + T + +R +WV Sbjct: 477 M-GVTIARMSILPKRLKWVKTVHR 499 >UniRef50_Q38W83 Putative glycosyl transferase, family 2 n=1 Tax=Lactobacillus sakei subsp. sakei 23K RepID=Q38W83_LACSS Length = 439 Score = 282 bits (721), Expect = 2e-74, Method: Composition-based stats. Identities = 126/429 (29%), Positives = 213/429 (49%), Gaps = 17/429 (3%) Query: 21 VAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISII 80 + + ++++ + +P + +IV +Y+W+ E+ E+ P K P I+I+ Sbjct: 2 LLIDYFTQILIYTLVSYPIIGGVSFIVSSLYYWLLMEK-----EDQPRYLKKGTPFITIL 56 Query: 81 IPCFNEEKNVEETIHAALAQ-RYE--NIEVIAVNDGSTDKTRAILDRMAAQIP-HLRVIH 136 +P NE+ ++E TI Q Y E+I VNDGSTDKT IL+ + A+ LR + Sbjct: 57 VPAHNEKASIEATIDHLATQMNYPTDQYEIIVVNDGSTDKTGIILETLQAKYGQRLRTVT 116 Query: 137 LAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNP--RVGAVTGNPR 194 + N+GKA + AK E+++ D D+ + DA + ++GAVTGN Sbjct: 117 IINNRGKAAGFNSALGFAKGEFILSNDADSKPEVDALWKYMYYFEREGGAQLGAVTGNML 176 Query: 195 IRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMIT 254 ++TLV + Q E +SIIGLIKR+Q YG +F SG +R+SA+ +VG W + T Sbjct: 177 SINKTTLVAEAQQNELNSIIGLIKRSQLSYGGLFAFSGANTMYRKSAVIDVGGWQAEQPT 236 Query: 255 EDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRK 314 EDI ISW +Q W ++ P ++ +PETL L KQR RW GG V L + R Sbjct: 237 EDIAISWDMQTAGWQAYFAPHIRFFMDVPETLPELVKQRRRWTSGGIYVLLTKSFGILRH 296 Query: 315 --ENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTL 372 ++F M P+ +Y L+ IW+ + + V G +N + + ++ + + T+ Sbjct: 297 PIKHFSMVPIIIDYGLSIIWSLFYWISMATFIVLQIGYVINQDWHRLLSSLSIAAIFTTI 356 Query: 373 CLLQFIVSLMIENRYEHNLT--SSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQR-- 428 +L ++ L+ + + + + W+ +I+WM++ T V + + + Sbjct: 357 EILVGLIQLVQASYFNDGGRSLKYIAFAPWYILIYWMVNTYTVAVEIIPTIRQIIRGKDA 416 Query: 429 ARWVSPDRG 437 W SP R Sbjct: 417 GVWKSPKRS 425 >UniRef50_B7A6V1 Glycosyl transferase family 2 n=1 Tax=Thermus aquaticus Y51MC23 RepID=B7A6V1_THEAQ Length = 467 Score = 280 bits (718), Expect = 5e-74, Method: Composition-based stats. Identities = 99/473 (20%), Positives = 174/473 (36%), Gaps = 54/473 (11%) Query: 17 IPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPS 76 + L + ++++ + F F+ + + G+ R + + Sbjct: 1 MELLAWLLYGYQVLVLYYFAVLNFLYATFALFGLRMVARYARELSELSLKDLLEREAYLP 60 Query: 77 ISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAA--------- 127 +SI++P +NEEK + ++ + LA RY EVI V DG D+T +L Sbjct: 61 VSILVPTYNEEKTIAASVRSFLALRYPEFEVIVVCDGPRDRTLEVLKEAFRLVQVDWVFK 120 Query: 128 ---------------QIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDA 172 P+L V+ +N GKA AL G A+ +D D+LLD +A Sbjct: 121 RALPTKPIRGVYRSLTYPNLLVVD-KENGGKADALNAGLNFARYPLFCGVDADSLLDPEA 179 Query: 173 AAYIVEPMLYNPRVGAVTGNPRIRT--------------RSTLVGKIQVGEYSSIIGLIK 218 L + RV AV G R + K+Q+ EY+ + + Sbjct: 180 LLRASRLFLEDERVLAVGGTIRPLNGARVREGIVEELRLPRGFLEKMQIIEYARAFFMGR 239 Query: 219 RTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQL------NQWTIFY 272 + +SG FRRS +VG + D + ED+++ +L + + Y Sbjct: 240 AGWSAMNALLIISGAFGLFRRSEAMKVGGYRTDTVGEDMELVVRLHRRAREEGRDYRVLY 299 Query: 273 EPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTT-- 330 P +C+ +P L +QR RW +G EV + L+ R+ L Y Sbjct: 300 TPDPICYTEVPADWATLRRQRNRWHRGLWEVLWLHRAMLFNPRYGRLGLLAMPYFFLFEA 359 Query: 331 IWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGI--LLCTLCLLQFIVSLMIENRYE 388 + ++G+++ V N E + G LL L + + L R Sbjct: 360 LGPVVEVLGYLLLPVFYLLGVFNAEFALLFLLLALGYGVLLSQLAIGMETLLLKRYPRLR 419 Query: 389 HNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRGILRG 441 + ++ +L+L + +F ++R +W R L G Sbjct: 420 DRVALLFLSLLEALGYRQVLALERFIATF-----QVLRKRGQWGEMRRKGLGG 467 >UniRef50_C7RAF6 Glycosyl transferase family 2 n=1 Tax=Kangiella koreensis DSM 16069 RepID=C7RAF6_KANKD Length = 453 Score = 280 bits (717), Expect = 7e-74, Method: Composition-based stats. Identities = 96/439 (21%), Positives = 184/439 (41%), Gaps = 20/439 (4%) Query: 5 IVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGE 64 + + VL + + S L+ VF + + ++ + + R P + Sbjct: 16 TSLYLLSATVLILAFFIDRPVSDSLISLRVFLFIMVLPLITKLMVQLYAAIRYSLQPVNQ 75 Query: 65 NAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDR 124 +S+++P +NEE + +TI + L Y + E+I +NDGSTDKT ++ + Sbjct: 76 GYSTMVPY---KVSVLLPAWNEEVGIVKTIESVLQSNYHDFELIVINDGSTDKTDQLVRQ 132 Query: 125 MAAQIPH-------LRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIV 177 A ++ + L N GKA AL G A ++ ID D ++DR+A + Sbjct: 133 FIADYEADSNPKVSIKYLSLP-NGGKAHALNKGLELATGNIVITIDADCIVDRNAIYRFI 191 Query: 178 EPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAF 237 + +P +GAV GN + + + +Q EY KR + V + G AA+ Sbjct: 192 QR-FDDPSIGAVAGNIIVGNKRKPLEWLQQLEYVCGF-FYKRADSYFNAVHIIGGAAAAY 249 Query: 238 RRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWA 297 RR L + + +++ITEDID+S + + + Y A+ + P KG+ KQRLRW Sbjct: 250 RRDVLINLKGFDEEIITEDIDLSMSILSHGYKTRYAHDAVVYTEGPTEWKGIRKQRLRWK 309 Query: 298 QGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELT 357 G +LK+ + L + ++A L+ ++ + + Sbjct: 310 YGRILCYLKHRELFLSSNSQHNRYLSWFILPLAVYAEIMLLNVGLF----LSLFYCYVIA 365 Query: 358 HIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSF 417 A ++L + L+ + L ++ Y NL L +I ++ +++ + Sbjct: 366 SQDFALLASVILVSSSLVALQIKLDSKSSYHKNL---LPFIPVAWLLLYVIEFIELVALT 422 Query: 418 TRVMLMPKKQRARWVSPDR 436 + + ++ +W +R Sbjct: 423 MSLQKLYNRESVKWQKWNR 441 >UniRef50_O67406 Cellulose synthase catalytic subunit n=1 Tax=Aquifex aeolicus RepID=O67406_AQUAE Length = 759 Score = 279 bits (715), Expect = 1e-73, Method: Composition-based stats. Identities = 85/452 (18%), Positives = 156/452 (34%), Gaps = 39/452 (8%) Query: 5 IVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGE 64 +V + + + + ++ F S+ + G++F + P Sbjct: 80 LVFIMLRYFWWRTFNTINTDNVWNFIFSTALYFAEFYSVTIALLGIFFSIR-----PIDR 134 Query: 65 NAPAPQLKDNPSISIIIPCFNEEKNVEETIH-AALAQRYE--NIEVIAVNDGST-----D 116 A ++ P++ + IP +NE + ET AA+ Y V ++DG T D Sbjct: 135 KAIKVDMESLPTVDVFIPTYNEPPEIPETTALAAINMDYPSDKFNVYILDDGGTKQRLND 194 Query: 117 KTRAILDRMAAQ---------------IPHLRVIHLAQN-QGKAIALKTGAAAAKSEYLV 160 + + ++ + +N KA + K + ++ Sbjct: 195 PDPERREYFRKRAEELKSFVERLRKLGYKNIHYLTREKNVHAKAGNINEALKKTKGDLIL 254 Query: 161 CIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKI-----QVGEYSSIIG 215 +D D + +D V + NP+V V + + GE Sbjct: 255 ILDADHVPSKDFLKETVGFFVKNPKVFLVQTPHTFYNPDPIEKNLGVFGRMPGENEMFYF 314 Query: 216 LIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPR 275 LI++ ++ + F G A RR L EVG +TED + + +L + Y R Sbjct: 315 LIQKGFDLWNSSF-FCGSAALLRRKYLEEVGGIQTTTVTEDAETALELHSRGYESVYYDR 373 Query: 276 ALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFT 335 L + L PETL G+ QR+RWAQG ++F+ L + W Y + + F Sbjct: 374 PLIFGLNPETLSGMIVQRIRWAQGMIQIFILKNPIL---KKGLKWYQKLAYFNASFFWFF 430 Query: 336 CLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSL 395 L I LA + + + + + L ++ + +R S + Sbjct: 431 GLARTIFLIAPLAYLLFGLHIYDASLAEVLAYPIPHF-LASLLLFYYLFSRVRWPFISEI 489 Query: 396 FWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQ 427 + I IF L + PK + Sbjct: 490 YESILGIFIFIPSILTLLNPKNPTFRVTPKGE 521 >UniRef50_Q1IMJ5 Glycosyl transferase, family 2 n=10 Tax=cellular organisms RepID=Q1IMJ5_ACIBL Length = 546 Score = 279 bits (715), Expect = 1e-73, Method: Composition-based stats. Identities = 77/405 (19%), Positives = 147/405 (36%), Gaps = 17/405 (4%) Query: 14 VLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVY------FWVYRERHWPWGENAP 67 + + F F I+ I+ Y R+ P Sbjct: 23 WIRTHVLDTTFKGLYQANAFDLCLLIPYFIVLIILAAYGVHRYQLVWMYYRNRKNKTTDP 82 Query: 68 APQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYEN--IEVIAVNDGSTDKT----RAI 121 + P +++ +P FNE+ ++ + A Y +++ ++D STD+T R + Sbjct: 83 PQHFAELPRVTVQLPIFNEQYVIDRLVEAVCKLDYPKDKLDIQVLDD-STDETVEVAREV 141 Query: 122 LDRMAAQIPHLRVIHLAQNQG-KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPM 180 ++R AA + IH G KA AL+ G A K E++ D D + D + Sbjct: 142 VERYAALGNPISYIHRTNRHGFKAGALQEGMAVCKGEFIAIFDADFVPPADFLQKCI-HH 200 Query: 181 LYNPRVGAVTGNPRIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRR 239 P +G V R+ + + +++ L + G F +G +R+ Sbjct: 201 FAEPEIGMVQTRWTHLNRNYSFLTEVEAILLDGHFVLEHGGRSRKGVFFNFNGTAGMWRK 260 Query: 240 SALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQG 299 A+ E G W D +TED D+S++ Q+ W Y C +P + Q+ RWA+G Sbjct: 261 QAIEEAGGWQHDTLTEDTDLSYRAQVKGWRFKYLQDVECPAELPIEMTAFKTQQARWAKG 320 Query: 300 GAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHI 359 + K + L+R + R + Y LT ++ ++ + + + Sbjct: 321 LIQCSKKVLPFLYRSDVPRRVKVEAWYHLTANISYPLMIVLSALMLPAMVLRFYQGWFQM 380 Query: 360 AATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVI 404 L T + F + + + Y + ++ + Sbjct: 381 LYIDMPLFLASTFSISSFYL-VSQKELYPKTWLRTFMYLPALMAL 424 >UniRef50_B7HFD6 N-acetyllactosaminide beta-1,6-N-acetylglucosaminyl-transferase n=76 Tax=Firmicutes RepID=B7HFD6_BACC4 Length = 433 Score = 279 bits (715), Expect = 1e-73, Method: Composition-based stats. Identities = 78/427 (18%), Positives = 155/427 (36%), Gaps = 21/427 (4%) Query: 14 VLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKD 73 + I + Y + F+F + G+ L+ Sbjct: 19 WISITFSIEYVL---IFTAFLFSGLLVYYSFLTIAGLIH---------RNSKRKDRTLEH 66 Query: 74 NPSISIIIPCFNEEKNVEETIHAALAQRYE-NIEVIAVNDGSTDKTRAILDRMAAQIPHL 132 PS+ I IP NE +++T+ A Y + + +ND S D+T I D H+ Sbjct: 67 YPSVDIFIPAHNEGIVIKDTLEAMAKIEYPGKLTIYLLNDNSQDETPEIGDDFDKAYAHI 126 Query: 133 RVIHLA--QNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVT 190 I + + +GK+ L G + + EY D D + A +VE Sbjct: 127 CHIRVPPGEPKGKSRVLNYGLSISDGEYFCVYDADNQPEPHALRMLVEHAETTEDAVGAV 186 Query: 191 GNPRIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWS 249 G+ R + + ++ E+ L++ + + +++G RRSAL E+G + Sbjct: 187 GHVRTVNENRNWLTRMISLEFQIFQLLMQSGRWLLFQTGSLTGTNMLLRRSALEELGGYD 246 Query: 250 DDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMT 309 I ED +++ ++ + + P ++ W PE LK L KQR RW QG + K + Sbjct: 247 PYAIAEDAELTLRITQKGYLLPIVPESVTWEQEPEHLKILIKQRTRWLQGNLYILEKMFS 306 Query: 310 RLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILL 369 L + + + + ++ ++ + + + L G+ ++ + + Sbjct: 307 SLSFFKGKLLVHSLQQVLVYVVFWLFLIISNVWFVIGLLGIF---QIQYSIPLLFMWYVA 363 Query: 370 CTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRA 429 + Q + +E + + + + +L+ + R KKQ Sbjct: 364 YITYVSQLFSAQSVERTFTPTNIFISVIMYFTYAQLFTYLFIRSLILYLRA--KSKKQVI 421 Query: 430 RWVSPDR 436 W R Sbjct: 422 GWDKTVR 428 >UniRef50_D1BHB0 Glycosyl transferase n=1 Tax=Sanguibacter keddieii DSM 10542 RepID=D1BHB0_SANKS Length = 411 Score = 279 bits (714), Expect = 2e-73, Method: Composition-based stats. Identities = 81/408 (19%), Positives = 175/408 (42%), Gaps = 12/408 (2%) Query: 32 RFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVE 91 + + + + ++ A ++ NP +S+++P F+E + + Sbjct: 4 WALALQLLVLPAVLLSLVEAAYLPLALVAHARRPADPLRVPSNPFVSVVVPAFDEGRVLR 63 Query: 92 ETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGA 151 + + LA RY N EV+ V+DGS D T ++ + P + V H N GKA AL G Sbjct: 64 RCVDSILADRYLNKEVVLVDDGSRDDTWEVMKSYRGR-PGVVVAH-QTNAGKAAALNHGI 121 Query: 152 AAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYS 211 + ++ L+ +D D + ++ +PRVGAV GN ++ + Sbjct: 122 SLSRGSILMFVDADGVFSGWTIRGMLRG-FDDPRVGAVCGNDAPVNLDRFQPRLLTL-LT 179 Query: 212 SIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIF 271 + +++R + VSG A RR L EVG +++ ++ ED++++W+++ + + Sbjct: 180 HVTAMVRRALAQVNCLTIVSGNCGAVRREVLDEVGGFTEGLLGEDLELTWRVRRAGYRVS 239 Query: 272 YEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTI 331 ++P A+ + +P + LW+QR+RWA+G + + + + + P LT + Sbjct: 240 FQPEAIVYAEVPAGVGPLWRQRVRWARGYLQTLWMHRDLVGSVRHGVVGPYLAVNALTMV 299 Query: 332 WAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNL 391 A ++ + + L + + + T L +L +VS + + Sbjct: 300 VAP--VLQLVALGIILGLAVDDSQTLYGTPASTLAWALLLFPVLSVVVSAALNGSWRD-- 355 Query: 392 TSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRGIL 439 + ++ PV +++ ++ +++ +RW +R + Sbjct: 356 LRFFYVLLVMPVYSTVMNAVMVTAIVQQL----RRRPSRWNKLERTGV 399 >UniRef50_P58931 Cellulose synthase catalytic subunit [UDP-forming] n=29 Tax=Proteobacteria RepID=BCSA_PSEFS Length = 739 Score = 278 bits (713), Expect = 2e-73, Method: Composition-based stats. Identities = 76/439 (17%), Positives = 156/439 (35%), Gaps = 25/439 (5%) Query: 1 MINRIVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHW 60 ++ + + + + + + ++ + F +++ ++ G + R Sbjct: 88 LVVLSLVASLRYMFWRLTSTLGFETWVDMFFGYGLVAAEFYALIVLIFGYVQTAWPLRRT 147 Query: 61 PWGENAPAPQLKDNPSISIIIPCFNEEKNVEE-TIHAALAQRYEN--IEVIAVNDGSTDK 117 P + ++ P++ + IP +NE ++ + TI AA A + + V ++DG D Sbjct: 148 PVWLKT---EPEEWPTVDVFIPTYNEALSIVKLTIFAAQAMDWPKDKLRVHVLDDGRRDD 204 Query: 118 TRAILDRMAAQIPHLRVIHLAQN-QGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYI 176 R ++ + I N KA L EY+ D D + R Sbjct: 205 FREFCRKVG-----VNYIRRDNNFHAKAGNLNEALKVTDGEYIALFDADHVPTRSFLQVS 259 Query: 177 VEPMLYNPRVGAVTGNPRIRTRSTL-----VGKIQVGEYSSIIGLIKRTQRIYGNVFTVS 231 + L +P++ + + + E GL++ ++ F Sbjct: 260 LGWFLKDPKLAMLQTPHFFFSPDPFEKNLDTFRAVPNEGELFYGLVQDGNDLWNATF-FC 318 Query: 232 GVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWK 291 G A RR L E+G + + +TED + KL + Y L E+L Sbjct: 319 GSCAVIRREPLLEIGGVAVETVTEDAHTALKLNRLGYNTAYLAIPQAAGLATESLSRHIN 378 Query: 292 QRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVP 351 QR+RWA+G A++F + L + W Y + F L + LA + Sbjct: 379 QRIRWARGMAQIFRTDNPLLGK---GLKWGQRICYANAMLHFFYGLPRLVFLTAPLAYLI 435 Query: 352 LNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLA 411 E+ H +A +L L S I+ R+ H+ + ++ + + +++L Sbjct: 436 FGAEIFHASALMIVAYVLPHLVHSSLTNS-RIQGRFRHSFWNEVYETV---LAWYILPPV 491 Query: 412 TTLVSFTRVMLMPKKQRAR 430 + + + Sbjct: 492 LVALVNPKAGGFNVTDKGG 510 >UniRef50_B6AKK5 Glycosyltransferase n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AKK5_9BACT Length = 714 Score = 277 bits (710), Expect = 4e-73, Method: Composition-based stats. Identities = 83/444 (18%), Positives = 153/444 (34%), Gaps = 24/444 (5%) Query: 1 MINRIVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHW 60 ++ ++ I + Y + + + + + G + V R Sbjct: 78 LVMISMALAGRYFSWRIAQTI-YGSPLDWVASVMLVLVEAYAAVMTFLGFFVMVSPVRR- 135 Query: 61 PWGENAPAPQLKDNPSISIIIPCFNEE-KNVEETIHAALAQRYE--NIEVIAVNDGSTDK 117 + K+ P++ ++IP +NE + TI AA Y + V ++DG Sbjct: 136 --TSPPLPKEEKECPTVDVMIPVYNEPVDVIRPTIFAASQLEYPLSRLRVWILDDGR--- 190 Query: 118 TRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIV 177 R ++ ++ ++ + KA L + + D D + Sbjct: 191 -RKEIEALSGELGVGYLTRPDNKGAKAGNLNHALGKTDGDLIAIFDCDHVPLPRFLQKTA 249 Query: 178 EPMLYNPRVGAVTGNPRIRTRSTL-----VGKIQVGEYSSIIGLIKRTQRIYGNVFTVSG 232 L P + V +R G GE +I+ ++ + G Sbjct: 250 GFFLNRPDLALVQTPHHFYSRDPFERNIGFGNQVPGEPDLFYHVIQPGMDLWNAAY-FCG 308 Query: 233 VIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQ 292 A RRSAL E+G + + +TED S L + +Y AL L P++++ L KQ Sbjct: 309 SAAVLRRSALKEIGGFRTETVTEDAHTSLCLHDRGYRSYYLEEALVTGLSPDSMRDLIKQ 368 Query: 293 RLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPL 352 R+RW +G ++F + + +F P Y + F + L + Sbjct: 369 RVRWCRGMIQIFRIDNPLFKKGLSF---PQKLCYMNAIFYWFFSIPRLFFLVAPLLYIYF 425 Query: 353 NIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLAT 412 I H LL L + Q SL+ L S ++ I P+ F+++ A Sbjct: 426 RIYSVHAGLEDMLLYLLPYLLVAQGS-SLLFYRSRRSLLWSIVYEI---PLSFYLIFPAF 481 Query: 413 TLVSFTRVMLMPKKQRARWVSPDR 436 T + F + + DR Sbjct: 482 TALLFPKKGTFHVTPKGTRTDSDR 505 >UniRef50_A8VSD1 Binding-protein-dependent transport systems inner membrane component n=2 Tax=Bacillus selenitireducens MLS10 RepID=A8VSD1_9BACI Length = 476 Score = 277 bits (708), Expect = 8e-73, Method: Composition-based stats. Identities = 87/473 (18%), Positives = 171/473 (36%), Gaps = 62/473 (13%) Query: 22 AYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLK---DNPSIS 78 + E++ + + F+S+++IV R + D IS Sbjct: 5 WLVSAVEVLAWMIVIYMLFVSLLYIVLFSVAMPKIRREQKLNRIEQIEGMTVNQDTYPIS 64 Query: 79 IIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDK--------------------- 117 +++P +NEE + ++++ L Y EVI +NDGS D+ Sbjct: 65 VLVPAYNEEVGIVASVNSLLTLNYPEYEVIVINDGSDDETSLRIKDHFNMKPVKLAMRKH 124 Query: 118 --TRAILDRMA-AQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAA 174 T + ++ +L +I +N GK AL G + Y ID D++L++DA Sbjct: 125 FQTAEVTAAYQSSEYGNLYLIE-KENGGKGDALNAGINFSSYPYFASIDADSILEKDALL 183 Query: 175 YIVEPML-YNPRVGAVTGNPRIRT--------------RSTLVGKIQVGEYSSIIGLIKR 219 ++P++ N V A G RI + +Q+ EY + + Sbjct: 184 KTMKPIIDSNGEVTATGGTVRIANGCQIEKSVVQTITLPKKPLEVMQIIEYFRGFLIGRL 243 Query: 220 TQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQL------NQWTIFYE 273 + VSG F +S + VG + + ED+++ ++ + I Y Sbjct: 244 GFSRTNMLMIVSGAFGVFEKSRVIRVGGYDSSTVGEDMELVVRIHRSIKDEKSPQRIEYI 303 Query: 274 PRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWA 333 +CW P T L QR+RW +G E K+ ++ + R+ Y + Sbjct: 304 QDPVCWTEAPSTFSSLKSQRIRWQRGLTETIWKHKGMMFNPKYKRVGLFSLPY-----YT 358 Query: 334 FTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSL------MIENRY 387 F L+ ++ + L + I + T+ + SL ++Y Sbjct: 359 FVELLSAVVEFFGYLIIILGLVFAFIDPAIAGMMFAVTILYGSLLSSLSVLLEEWTYHKY 418 Query: 388 EHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRGILR 440 + + + ++ +FW L + + ++ W + +R + Sbjct: 419 QDVKSLPILFLYALTEVFWYRPLTVSFRLV--GLFSAFGRKREWGNMERKGID 469 >UniRef50_A3ERE6 Cellulose synthase catalytic subunit (UDP-forming) n=2 Tax=Leptospirillum sp. Group II RepID=A3ERE6_9BACT Length = 714 Score = 277 bits (708), Expect = 8e-73, Method: Composition-based stats. Identities = 95/452 (21%), Positives = 151/452 (33%), Gaps = 24/452 (5%) Query: 1 MINRIVSFFILCLVLCIPLCVAY----FHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYR 56 +I + + H E F+ F S + +V G Y Sbjct: 64 LITLSFFSTLRYAWWRTTETLGTGDPKVHWYEYPFIFLLFGAELYSWIILVLGYLQTAYP 123 Query: 57 ERHWPWGENAPAPQLKDNPSISIIIPCFNEE-KNVEETIHAALAQRYE--NIEVIAVNDG 113 + P + P++ I+IP +NE V T+ AAL + V ++DG Sbjct: 124 R---GRKADPLPPDTESWPTVDILIPTYNEPLAVVRTTLLAALNMDWPPEKKRVFLLDDG 180 Query: 114 STDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAA 173 + R AAQ V KA + A + +Y+ D D + R Sbjct: 181 N----REEFATFAAQTGVTYVARPEHRHAKAGNVNYALARSDGKYVALFDCDHVPTRSFL 236 Query: 174 AYIVEPMLYNPRVGAVTGNPRIRTRSTL-----VGKIQVGEYSSIIGLIKRTQRIYGNVF 228 V + +PRV V + V K E G+I+ ++ Sbjct: 237 QTTVGILEKSPRVAFVQTPHHFYSPDPYERNLNVFKKLPNEGELFYGVIQDGNDLWNAS- 295 Query: 229 TVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKG 288 T G RR AL E+ ++ D +TED S +L W Y L +L Sbjct: 296 TFCGSCTVLRRKALNEIQGFAVDTVTEDAHTSIRLHRLGWESAYLNLPQAAGLATPSLSA 355 Query: 289 LWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLA 348 +QR+RWA+G ++F L R Y ++ + L I LA Sbjct: 356 HIRQRIRWARGMIQIFRIENPLLAR---GLTLAQKLCYLNGMLFFLSSLPRIIFLTAPLA 412 Query: 349 GVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWML 408 + + + A A T+ L + + I R+ H+ + ++ P IF Sbjct: 413 YLYFGVRIFDADAYSIASYAFPTIAL-SLLANAAINGRHRHSFWNQVYETTLAPYIFLPT 471 Query: 409 SLATTLVSFTRVMLMPKKQRARWVSPDRGILR 440 LA + K+ R DR I R Sbjct: 472 LLAMVRPDLGTFNVTAKEGRIEKEFYDRRIAR 503 >UniRef50_B9AE19 Putative uncharacterized protein n=1 Tax=Methanobrevibacter smithii DSM 2375 RepID=B9AE19_METSM Length = 461 Score = 277 bits (708), Expect = 8e-73, Method: Composition-based stats. Identities = 86/452 (19%), Positives = 162/452 (35%), Gaps = 34/452 (7%) Query: 1 MINRIVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHW 60 +I + S F+L + + + + + M + I+ Sbjct: 31 LIEKSNSIFLLVTCSALIISGILARNVTWLQWLLLIPTLTMLFLAIIST----------- 79 Query: 61 PWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRY-----ENIEVIAVNDGST 115 + + P +S+IIP NEE + +T+ + Y N E+I VNDGST Sbjct: 80 --KKQEKPIPYEKPPFVSLIIPAHNEEYTIAQTVTSISKIDYTLNGKPNFELIVVNDGST 137 Query: 116 DKTRAILDRMAAQIPHLRVIHLAQNQ---GKAIALKTGAAAAKSEYLVCIDGDALLDRDA 172 D T L + IP LR++ + GK L + +K E + D D +++D Sbjct: 138 DSTGEKLSELKKDIPILRIVTRKPPKSGKGKGFVLNDALSLSKGEIIGVFDADTQVEKDF 197 Query: 173 AAYIVEPMLYNPRVGAVTGNPRIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVS 231 IV P L NP+V V ++ + + +Q E+ S G + G + Sbjct: 198 L-NIVMPYLNNPKVQGVQTRVKMYNKDENFLANMQHVEFES-FGNTLIAKDNLGKSGFLG 255 Query: 232 GVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWK 291 G ++ A+ + W +TED+++S K+ L I Y + K +K Sbjct: 256 GNGQFVKKQAILDGEKWDGFAVTEDLNLSVKILLKGGQIRYCGETAVYQEAVTDWKSFFK 315 Query: 292 QRLRWAQGGAEVFLKNMTRLWR-KENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGV 350 QR RWA G E + ++ + E+ + GFI+ + Sbjct: 316 QRTRWAIGNFETIFIYLPKILKSPLPLIKKYGIIEHISFYAFNLFIFFGFIVSMLNAISW 375 Query: 351 PL--NIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWML 408 + + + + A GI+ + ++L+ E + + +W+ Sbjct: 376 FIFHGVTIIRMDAPLIVGIISTIAFIPGISIALLREKAGPLKFIKDI-------IGYWIY 428 Query: 409 SLATTLVSFTRVMLMPKKQRARWVSPDRGILR 440 + F + M ++ +W + Sbjct: 429 CFHLIPLFFETMFKMIIRKERKWDKTKHKGIE 460 >UniRef50_C1DJH7 Cellulose synthase subunit AB n=35 Tax=Proteobacteria RepID=C1DJH7_AZOVD Length = 1455 Score = 277 bits (708), Expect = 9e-73, Method: Composition-based stats. Identities = 86/448 (19%), Positives = 149/448 (33%), Gaps = 25/448 (5%) Query: 1 MINRIVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHW 60 +I V+ L + + +S E + + M + G + V + Sbjct: 62 LIVVSVAVSTRYLYWRTTQTLVFDNSLEAFLGTGLYLAEVYVWMILTLGYFQTVMPLKRA 121 Query: 61 PWGENAPAPQLKDNPSISIIIPCFNEE-KNVEETIHAALAQRYE--NIEVIAVNDGSTDK 117 L P++ + IP +NE V++T+ AA Y + V ++DG + Sbjct: 122 IRPLPE---DLSQWPTVDVYIPTYNESLDVVQDTVLAAQNIDYPADKLRVYLLDDGRRPE 178 Query: 118 TRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIV 177 A + KA L + E + D D + R V Sbjct: 179 FGAFAAAAGVGY----ITRSDNRHAKAGNLNNALSLTDGELICIFDCDHVSTRAFLQATV 234 Query: 178 EPMLYNPRVGAVTGNPRIRTRSTL-----VGKIQVGEYSSIIGLIKRTQRIYGNVFTVSG 232 L + R+ V + G+ E G +++ + F G Sbjct: 235 GAFLQDARLALVQTPHHFHSPDPFERNLATGREVPNEGELFYGPVQQGNDFWNAAF-FCG 293 Query: 233 VIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQ 292 A RRSAL E+G ++ + +TED + KLQ W + P L L E L Q Sbjct: 294 SCAVIRRSALEEIGGFAVETVTEDSHTALKLQRRGWNTAFLPVPLAADLATERLALHIGQ 353 Query: 293 RLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPL 352 R+RWA+G ++F + L R P Y + L + A + L Sbjct: 354 RMRWARGMTQIFRLDNPLLGR---GLRLPQRLCYLNALLHYQFALPRIVFLTAPTAYLLL 410 Query: 353 NIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLAT 412 + AT +L + + + I+ R+ H+ L+ + + F +L Sbjct: 411 GQNIIAAPATLIFAYMLPHMAH-SIMANARIQGRHRHSFWGELYETV---LAFHILKPTL 466 Query: 413 TLVSFTRVMLMPKKQRARWVSPDRGILR 440 + F + + DRG Sbjct: 467 VTLFFPKRGKFNVTDKG--ELLDRGFFD 492 >UniRef50_P21877 Cyclic di-GMP-binding domain n=11 Tax=cellular organisms RepID=ACSA1_ACEXY Length = 1550 Score = 276 bits (707), Expect = 1e-72, Method: Composition-based stats. Identities = 62/420 (14%), Positives = 144/420 (34%), Gaps = 24/420 (5%) Query: 7 SFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENA 66 + L + +++ + ++ + ++M + + + P Sbjct: 84 LVSLRYLTWRLTETLSFDTWLQGLLGTMLLVAELYALMMLFLSYFQTIAPLHRAPLPLP- 142 Query: 67 PAPQLKDNPSISIIIPCFNEEKNVEE-TIHAALAQRYE--NIEVIAVNDGSTDKTRAILD 123 P + P++ I +P +NEE ++ T+ +L + + V ++DG + A Sbjct: 143 --PNPDEWPTVDIFVPTYNEELSIVRLTVLGSLGIDWPPEKVRVHILDDGRRPEFAAFAA 200 Query: 124 RMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYN 183 A + KA L +Y++ D D + R + M+ + Sbjct: 201 ECGANY----IARPTNEHAKAGNLNYAIGHTDGDYILIFDCDHVPTRAFLQLTMGWMVED 256 Query: 184 PRVGAVTGNPRIRTRSTLVGKIQVG-----EYSSIIGLIKRTQRIYGNVFTVSGVIAAFR 238 P++ + + + G E + G+++ + F G A R Sbjct: 257 PKIALMQTPHHFYSPDPFQRNLSAGYRTPPEGNLFYGVVQDGNDFWDATF-FCGSCAILR 315 Query: 239 RSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQ 298 R+A+ ++G ++ +TED + K+Q W+ Y L L E L QR+RWA+ Sbjct: 316 RTAIEQIGGFATQTVTEDAHTALKMQRLGWSTAYLRIPLAGGLATERLILHIGQRVRWAR 375 Query: 299 GGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTH 358 G ++F + R ++ + + ++A I + LA + + Sbjct: 376 GMLQIFRIDNPLFGRGLSWGQRLCYLSAMTSFLFAVP---RVIFLSSPLAFLFFGQNIIA 432 Query: 359 IAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFT 418 + + ++ ++ S + ++ + L T + + Sbjct: 433 ASPLALLAYAIP-----HMFHAVGTASKINKGWRYSFWSEVYETTMALFLVRVTIVTLLS 487 >UniRef50_Q9CHP7 Glycosyl transferase n=3 Tax=Lactococcus lactis RepID=Q9CHP7_LACLA Length = 354 Score = 275 bits (705), Expect = 2e-72, Method: Composition-based stats. Identities = 114/354 (32%), Positives = 188/354 (53%), Gaps = 10/354 (2%) Query: 24 FHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPC 83 ++ +V ++P +S +W G W + + K+ ISI+I Sbjct: 1 MLISTMVDVYVVYYPIILSFIWASGAFL------SRWKERDKKAIDETKEFEKISIVISA 54 Query: 84 FNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPH---LRVIHLAQN 140 +NEE+ +EE + + Y ++E+ V+D S+D+T L + L ++ +N Sbjct: 55 YNEEETIEEVLLSLRNLNYPSLEIFIVDDKSSDRTLEKLYEVKKSFDDWETLTILEQKEN 114 Query: 141 QGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRST 200 +GKA AL S+Y++ ID D+ L +DA +Y++ + + +GAVTG P +R R+T Sbjct: 115 KGKATALNVALQQVNSKYMLVIDADSYLSKDALSYLLAELTSSSDIGAVTGRPIVRNRTT 174 Query: 201 LVGKIQVGEYSSIIGLIKRTQRIY-GNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDI 259 L+GK+Q EY S+I IKR Q ++ G + TVSGVI +R AL E+G ++ +++TEDID+ Sbjct: 175 LLGKLQTLEYLSVIDAIKRAQSLFLGAIMTVSGVIVMYRVEALKEIGGFNTEVMTEDIDV 234 Query: 260 SWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRM 319 +W+L NQW + Y P+AL +IL+PE LK L KQR RWA GG EV + N++ ++++ F+ Sbjct: 235 TWRLHRNQWKVKYVPKALSYILVPENLKSLLKQRRRWAVGGVEVLISNLSWVFKRGEFKH 294 Query: 320 WPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLC 373 L E + IW++ L+ Y +Q+ I Sbjct: 295 RFLLIEMICSHIWSWFFLIESYQYFLQMLLNQEFKVSGQIIVIFVLISFFVFFL 348 >UniRef50_UPI0001AEFEC9 N-glycosyltransferase PgaC n=1 Tax=Streptomyces roseosporus NRRL 11379 RepID=UPI0001AEFEC9 Length = 343 Score = 275 bits (704), Expect = 3e-72, Method: Composition-based stats. Identities = 101/299 (33%), Positives = 160/299 (53%), Gaps = 6/299 (2%) Query: 24 FHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPC 83 ++++ F+ F+P W+VGGV F + ER P +++++P Sbjct: 1 MSPIDVLLVFLAFYPVVTGAAWMVGGVMFRLTEER------PRAVDPPGGLPGVTVLVPA 54 Query: 84 FNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGK 143 +NEEK + + A Y +E++ +NDGSTD T A A + V+ N+GK Sbjct: 55 YNEEKVIAGCVAALRLVDYPELEILILNDGSTDATVAAARAAAEGDARITVVDDGVNRGK 114 Query: 144 AIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVG 203 A L G A+ + ++ D D L DA ++V ML +PR A+ G PR+ R +++ Sbjct: 115 ADRLNEGMRQARHDLVLVTDADTHLHSDAVTHLVRRMLRSPRYAAIAGCPRVTNRGSVIA 174 Query: 204 KIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKL 263 +Q+ E +S+IGLI+RT + G V TV+GV+ FRR A+ VG + M TEDI+++W+L Sbjct: 175 TMQMLETASLIGLIRRTHALAGRVGTVAGVLGLFRRDAVLAVGGYDPKMATEDIELTWRL 234 Query: 264 QLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPL 322 + P AL + +PETL G+W QR RWA+G EV + + R ++ MWP+ Sbjct: 235 LQAGHLTGFAPNALVGMQVPETLSGIWAQRTRWARGQGEVLRTHARTVARPKHAVMWPI 293 >UniRef50_D1CH68 Glycosyl transferase family 2 n=2 Tax=Bacteria RepID=D1CH68_THET1 Length = 470 Score = 274 bits (702), Expect = 4e-72, Method: Composition-based stats. Identities = 92/471 (19%), Positives = 172/471 (36%), Gaps = 55/471 (11%) Query: 17 IPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPS 76 + + F ++ + +++++ V Y R G + P Sbjct: 1 MTIAEFIFAVNAFVLCYFLALNGTYLVLYLISFVEIVDYSRREIFSGLSE-LFTSAYAPP 59 Query: 77 ISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDR------------ 124 +S+I+P +NEE + +++ + LA Y +V+ VNDGS D T +L R Sbjct: 60 VSLIVPAYNEEATIADSVRSFLALHYPQHQVVVVNDGSRDATLEVLKREFDLWESDQPIR 119 Query: 125 ------------MAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDA 172 +++ P+L VI +N GKA +L G AA + CID D +L+ DA Sbjct: 120 LRLRTAPIRAVYASSRFPNLLVID-KENGGKADSLNAGICAASYPLVCCIDADIILEEDA 178 Query: 173 AAYIVEPMLYNPRVGAVTGNPRIRT--------------RSTLVGKIQVGEYSSIIGLIK 218 + PM+ + +V AV G R+ + QV EY + Sbjct: 179 LLRVARPMIESSKVVAVGGIVRVANGCKIEAGRVTEVRAPREWLPSFQVVEYLRAFLAAR 238 Query: 219 RTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQL------NQWTIFY 272 + +SG FRR + VG + + ED+++ ++ + I + Sbjct: 239 TAWSRLNCLLIISGAFGMFRREDVIAVGGYETSTVGEDMEVVTRMHRLLRERHRDYHIAF 298 Query: 273 EPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTT-- 330 P + W +P TL+ L +QR RW +G + L++ L + L Y Sbjct: 299 VPDPVAWTEVPATLRVLRRQRDRWHRGLIDTLLRHRQMLLNPTYGLVGMLAMPYYFLFEM 358 Query: 331 IWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRY--E 388 + L+G++ + L +N G L + ++ RY Sbjct: 359 LGPVVELLGYMALVLGLVLGVVNAPFALAFFGAAVG-LGVLMSTAAVLLEEFRFRRYAGW 417 Query: 389 HNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRGIL 439 +L + + + L+ + + ++ W + +R Sbjct: 418 RDLGKLILCGVLENFGYRQLTTWWRAWAIVSYL----RKDTSWGAMERRGF 464 >UniRef50_P58932 Cellulose synthase catalytic subunit [UDP-forming] n=3 Tax=Proteobacteria RepID=BCSA_XANAC Length = 729 Score = 273 bits (700), Expect = 6e-72, Method: Composition-based stats. Identities = 78/435 (17%), Positives = 148/435 (34%), Gaps = 20/435 (4%) Query: 1 MINRIVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHW 60 M+ ++ + + + + + ++ + + +V G + ++ Sbjct: 82 MMGMSLAVSCRYIWWRMTQTMGVGSAVDFILGLGLLGAELYAFVILVLGYFQVLWPLNRK 141 Query: 61 PWGENAPAPQLKDNPSISIIIPCFNEE-KNVEETIHAALAQRYE--NIEVIAVNDGSTDK 117 P A PS+ + IP +NE V T+ AA + I + ++DG D+ Sbjct: 142 PVPLPADQRL---WPSVDVFIPTYNEPLSVVRTTVLAASVIDWPAGKITIHLLDDGRRDE 198 Query: 118 TRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIV 177 RA + V KA + +Y+ D D + R + Sbjct: 199 FRAFCAEVGINY----VTRTNNAHAKAGNINAALKKCSGDYVAIFDCDHIPTRSFLQVAM 254 Query: 178 EPMLYNPRVGAVTGNPRIRTRSTLVGKI-----QVGEYSSIIGLIKRTQRIYGNVFTVSG 232 L++ ++ V + + E GL++ + F G Sbjct: 255 GWFLHDTKLALVQMPHYFFSPDPFERNLDTHGKVPNEGELFYGLLQDGNDQWNATF-FCG 313 Query: 233 VIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQ 292 A +R+AL EVG + + +TED + KLQ + Y L E+L G Q Sbjct: 314 SCAVIKRTALEEVGGVAVETVTEDAHTALKLQRRGYRTAYLAVPQAAGLATESLSGHVAQ 373 Query: 293 RLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPL 352 R+RWA+G A++ + L R Y + F + I LA + Sbjct: 374 RIRWARGMAQIARIDNPLLGRGLKLSQR---LCYLNAMLHFFYGVPRIIYLTAPLAYLFF 430 Query: 353 NIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLAT 412 + +A L + +L +++R+ H L + ++ IF +A Sbjct: 431 GAHVIQASALMILAYALPHILQANLT-NLRVQSRFRHLLWNEVYETTLAWYIFRPTLVAL 489 Query: 413 TLVSFTRVMLMPKKQ 427 + + PK Sbjct: 490 LNPKLGKFNVTPKGG 504 >UniRef50_C7TL81 Glycosyl transferase, group 2 n=3 Tax=Bacilli RepID=C7TL81_LACRL Length = 417 Score = 271 bits (694), Expect = 4e-71, Method: Composition-based stats. Identities = 95/424 (22%), Positives = 175/424 (41%), Gaps = 20/424 (4%) Query: 24 FHSGELMMRFVF--FWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIII 81 + + +M F W M + +V YF + P P P + +S+++ Sbjct: 1 MSALDWIMLFAIGAIWLILMVNVILVVAGYFEYMKMTQQPEPSLPPTPPM-----VSVMV 55 Query: 82 PCFNEEKNVEETIHAALAQRYE--NIEVIAVNDGSTDKTRAILDRMAAQIP--HLRVIHL 137 P NE + +T+ + L Y E+I +ND S+D + A+L + + P L VI+ Sbjct: 56 PAHNEGVVIVKTVESLLRFDYPQDRYEIIVINDNSSDNSAALLRDLQHRYPDRQLHVINT 115 Query: 138 A---QNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPR 194 +GK+ AL G A+ L D D + A +V ++ + +GAV G R Sbjct: 116 DAVTGGKGKSNALNIGLTKAQGSVLAIYDADNTPEFGALRILVSELMADDGLGAVIGKFR 175 Query: 195 IRTR-STLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMI 253 R + +T + + E S + + ++ + T+ G RRS + ++G W + Sbjct: 176 TRNKQATWLTRFINIETLSFQWMAQAGRQHLFGLCTIPGTNYVIRRSLIDKIGGWDVKAL 235 Query: 254 TEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWR 313 ED +IS+ + +N I ++P+A+ W P+TL + QR RW +G V LKN L++ Sbjct: 236 AEDTEISFHVYMNGARIKFQPKAVTWEQEPQTLDVWFHQRTRWVKGNIYVILKNSALLFQ 295 Query: 314 KENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLC 373 K + + T LV + AV + + + +L L Sbjct: 296 KRGRPIRFDLIYFLSIYFLLMTSLV--LSDAVFILSTAGWAHVGLKGFSTGLWLLAILLF 353 Query: 374 LLQFIVSLMIENRYEHNLTSSLFWIIWFPV-IFWMLSLATTLVSFTRVMLMPKKQRARWV 432 ++ +++ E + +++ W+ +V++ R L Q+A W Sbjct: 354 IVSTFITISTEKGEMTVGNVGIIGLMYITYSQLWLAVALYGMVAYIREQLF--HQQAHWY 411 Query: 433 SPDR 436 R Sbjct: 412 KTKR 415 >UniRef50_Q5N0N9 UDP-glucose-beta-D-glucan glucosyltransferase n=3 Tax=Chroococcales RepID=Q5N0N9_SYNP6 Length = 737 Score = 271 bits (693), Expect = 4e-71, Method: Composition-based stats. Identities = 80/441 (18%), Positives = 153/441 (34%), Gaps = 32/441 (7%) Query: 4 RIVSFFILCLVLCIPLCVAYFHSGEL------MMRFVFFWPFFMSIMWIVGGVYFWVYRE 57 R+++ + ++ +F S L + + F+ ++++ + ++ Sbjct: 87 RLITSIGIAVLGIRYELWRFFGSLNLDDPLNGTLSVLLFFAELLTVVNTIAFFLHTIFSY 146 Query: 58 RHWPWGE--NAPAPQLKDNPSISIIIPCFNEEKNV-EETIHAALAQRYENIEVIAVNDGS 114 P + + PS+ II+P +NE ++ ++ A A Y N +V ++D Sbjct: 147 DRSPEADILSQAVISQAYQPSVDIILPTYNEGVDILRRSVIACQAMDYSNKKVFLLDDTR 206 Query: 115 TDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAA 174 + RA+ + + KA + E LV D D + ++ Sbjct: 207 RPEVRALAEELGCGYFD----RPDNKHAKAGNINHALPYLSGELLVVFDADFVPSKNFLI 262 Query: 175 YIVEPMLYNPRVGAVTGNPRIRTRSTL-----VGKIQVGEYSSIIGLIKRTQRIYGNVFT 229 + +P+ G + + + I E S I+ ++ + +V Sbjct: 263 RTIG-FFQDPKTGLLQTPQHFFNEDPISVNLGLEGILNNEQSLFFRFIQPSRDYFNSVVC 321 Query: 230 VSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGL 289 G RRSAL E+G + ITED S KLQ + + + Y AL L PET+ Sbjct: 322 -CGTCFVIRRSALDEIGGIPTESITEDYFTSIKLQTHGYRVKYLNEALAAGLAPETISAY 380 Query: 290 WKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAG 349 QRLRW QG + + L W L+ ++ F + + V LA Sbjct: 381 VNQRLRWGQGTLQSLFTDSNFL--TVPGLNWQERLSQSLSIVYWFLSIPRVVFLVVPLAF 438 Query: 350 VPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLS 409 + + L S + E R S ++ + Sbjct: 439 LIFGLAPLRATVNEILYFFLPYYIANIMAFSWLTEGR-RSAFWSDVYET---------II 488 Query: 410 LATTLVSFTRVMLMPKKQRAR 430 ++ R ++ PK + + Sbjct: 489 CVPMAITILRTLVSPKGKPFK 509 >UniRef50_B1YJV4 Glycosyl transferase family 2 n=4 Tax=Bacillales RepID=B1YJV4_EXIS2 Length = 482 Score = 271 bits (693), Expect = 4e-71, Method: Composition-based stats. Identities = 85/463 (18%), Positives = 172/463 (37%), Gaps = 56/463 (12%) Query: 25 HSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENA---PAPQLKDNPSISIII 81 + ++ V + + I + + + + + P + +S+++ Sbjct: 9 NIINVLSWTVLIYMILVIIFYSTLYLISAQMVRKQRFRRKESSYLPLMRSLYTRPVSVLV 68 Query: 82 PCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAIL-----DRMAAQIPHL---- 132 P +NEE + ++ A LA Y EVI +NDGS+D T L + P + Sbjct: 69 PAYNEEYTIVTSVQALLALDYPEHEVIVINDGSSDGTMDELFKTYELEKELRFPRILIGS 128 Query: 133 --------------RVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVE 178 ++ +N GKA AL G + Y +D D++L+RDA +++ Sbjct: 129 QPVKHVYRSRIDARLIVIDKENGGKADALNAGIDYSSYPYFCSLDADSILERDALLKVMK 188 Query: 179 PM-LYNPRVGAVTGNPRIRTRSTL--------------VGKIQVGEYSSIIGLIKRTQRI 223 P+ + V A G+ + T+ + +Q+ EY + Sbjct: 189 PIVESDEDVIATCGSIFLVNGCTIEGGEIVEIRTPKQRIVLMQMIEYFRAFLFGRLGLSR 248 Query: 224 YGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLN------QWTIFYEPRAL 277 + + VSG F+++ + + G + + ++ ED+++ +L I Y P + Sbjct: 249 FNMLLIVSGAFGLFQKAIVVQAGGYKEGLVGEDMELVVRLHRYMKENKIDKRIEYVPSPV 308 Query: 278 CWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMW--PLFFEYCLTTIWAFT 335 CW P+T++ L QR RW +G AE + + + + + + + + Sbjct: 309 CWTQAPDTMEVLRSQRSRWHRGLAETLWTHKRMIGNPRYGFVGSVSMLYFFLIELLGPIV 368 Query: 336 CLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYE--HNLTS 393 L+G+ I + L LN E + T L ++ +R++ L Sbjct: 369 ELIGYFIIPLNLLVGTLNFEFGVVLLTVALLY-GSLLSAGAVLLEEWTMHRFKNVKELIR 427 Query: 394 SLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 W + + + + L + M R +W +R Sbjct: 428 LYLWSLTESFWYRPILVFFKLGGLIQFM----TGRGQWGKMER 466 >UniRef50_A8MM07 Glycosyl transferase family 2 n=1 Tax=Alkaliphilus oremlandii OhILAs RepID=A8MM07_ALKOO Length = 482 Score = 271 bits (693), Expect = 5e-71, Method: Composition-based stats. Identities = 90/465 (19%), Positives = 170/465 (36%), Gaps = 59/465 (12%) Query: 22 AYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIII 81 +++ +V + IV + Y + + P IS+++ Sbjct: 9 IVISFNHMVLYYVLAINSIYFMQLIVSAFSLYDYIRKMYYSDFKKYTASSNMIP-ISVLV 67 Query: 82 PCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKT-RAILDRMAAQ------------ 128 P ++EE+ + + I + L+ Y EVI +NDGS D T R I++ + Sbjct: 68 PAYDEEETIVDNIKSLLSLNYPAFEVIVINDGSNDDTLRKIVEAYDLKEINQPVRHLLKT 127 Query: 129 -----------IPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIV 177 IP+L ++ +N GKA AL G +K ID D++L+ D+ ++ Sbjct: 128 KKIKGIYANLDIPNLVLVD-KENGGKADALNVGINVSKYPVFTSIDADSILESDSLVRVI 186 Query: 178 EPMLYNPRVGAVTGNPRIRT--------------RSTLVGKIQVGEYSSIIGLIKRTQRI 223 P + + AV G RI + + Q+ EY + Sbjct: 187 MPFIEDNLTVAVGGIVRIINGSEIKNGKVEAIGLPKSKLAMFQIVEYLRAFLTGRLAWSK 246 Query: 224 YGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLN------QWTIFYEPRAL 277 + VSG AF++ + + G ++ D I ED+++ K+ ++ + + P + Sbjct: 247 LRCLLIVSGAFGAFKKDVVVQAGGYTPDTIGEDMELVVKIHKMMRKNKVKYKVNFIPDPV 306 Query: 278 CWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCL 337 CW PE +K L QR RW G + ++ L + Y + + Sbjct: 307 CWTQAPENIKDLRGQRRRWQIGLMDSLFRHKDLLLNPRYGVVGMYGVPY-----FWIFEM 361 Query: 338 VGFIIYAVQLAGVPLNIELTHIAATHTAGILL------CTLCLLQFIVSLMIENRYEHNL 391 +G II + +PL+ + + L+ L L ++ ++Y Sbjct: 362 LGPIIETLGYIIIPLSYIFGLLNTRYFVLFLITSILYGIILSLGAILLEEYTFSKY--PS 419 Query: 392 TSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 L + + ++ T + ++ KK R W R Sbjct: 420 MKQLLRLSLYGILENFGYRQMTTLFRIDAIIRFKKFRGDWGKIKR 464 >UniRef50_B7IR36 Glycosyltransferase n=11 Tax=Bacillus cereus group RepID=B7IR36_BACC2 Length = 741 Score = 270 bits (692), Expect = 5e-71, Method: Composition-based stats. Identities = 78/430 (18%), Positives = 141/430 (32%), Gaps = 19/430 (4%) Query: 5 IVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGE 64 I+ F I + I + L+ + + F + + + E Sbjct: 34 IILFLICNAIYLIWRTFYSLPTINLISIIAGIILLITEWAGYLQSIVFSIVSWKPYKRKE 93 Query: 65 NAPAPQLKDNPSISIIIPCFNEE-KNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAI 121 P + P++ I I +NE ++ T+ Y + + +DG R Sbjct: 94 -VPLSTFEKLPTVDIFIATYNEPIDLLKRTVAGCTLITYPKELLNIYICDDGR----RES 148 Query: 122 LDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPML 181 + ++AA + KA L +K + +V +D D + + + Sbjct: 149 VKQLAADFSVHHITRTENKHAKAGNLNHAMLHSKGDIIVTMDADMIPRANFLERTIGYFS 208 Query: 182 YNPRVGAVTGNPRIRTRSTL-----VGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAA 236 N V V E + ++ + + V G A Sbjct: 209 KN-NVVFVQAPQVFYNADPFQYNLFFEDNIANEQDFFMRQLEEGKDRFNATMYV-GSNAL 266 Query: 237 FRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRW 296 FRR+AL E+G ++ +ITED+ LQ N+W + L L PET L KQR RW Sbjct: 267 FRRTALEEIGGFATGVITEDMATGMLLQANKWETIFVNETLAVGLSPETFSDLLKQRDRW 326 Query: 297 AQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIEL 356 +G +V K + +F L+ + + F + I L + +I Sbjct: 327 CRGNIQVVKKWNPFTIKGLSFMQRLLYAD---GIHYWFFGVYKMIFLLAPLLFLVFDIYS 383 Query: 357 THIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVS 416 I H Q I + N+ S ++ + P + + + L Sbjct: 384 LEINFMHLFMFWAPAFLTSQLIFKA-VSNKKRTTTWSHVYEVAMAPYMGFAILSEIFLRK 442 Query: 417 FTRVMLMPKK 426 + + K Sbjct: 443 KFQFHVTRKG 452 >UniRef50_A7M791 Cellulose synthase subunit AB n=1 Tax=Asaia bogorensis RepID=A7M791_9PROT Length = 1563 Score = 270 bits (692), Expect = 5e-71, Method: Composition-based stats. Identities = 80/437 (18%), Positives = 152/437 (34%), Gaps = 25/437 (5%) Query: 3 NRIVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPW 62 + + L + + + + ++ V + + ++ + ++ Sbjct: 80 GLSIIVSLRYLSWRLTETLNFSSWTQSILGSVLVLAEIYAFVMLILSYFQTLHP---LGR 136 Query: 63 GENAPAPQLKDNPSISIIIPCFNEE-KNVEETIHAALAQRYENIEV--IAVNDGSTDKTR 119 + PS+ I IP +NEE + V +T+ AAL + + ++DG ++ R Sbjct: 137 EVTPMPKDVTTWPSVDIYIPTYNEELRIVRQTVLAALGIDWPRDRINVYLLDDGHREEFR 196 Query: 120 AILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEP 179 + A + N KA L K E + D D + R V P Sbjct: 197 EFAEACGAGY----IARSEHNHAKAGNLNHAMQLTKGELIAIFDCDHIASRTFLQETVGP 252 Query: 180 MLYNPRVGAVTGNPRIRTRSTLVGKI-----QVGEYSSIIGLIKRTQRIYGNVFTVSGVI 234 L P + + + E + GLI+ + F G Sbjct: 253 FLLKPHLALIQTPHHFYSPDPFQRNFSHGAVVPPEGNLFYGLIQDGNDFWNATF-FCGSC 311 Query: 235 AAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRL 294 A RR+AL E+G + + +TED+ + KLQ W+ Y + L L E L QR+ Sbjct: 312 AVIRRTALEEIGGVAVETVTEDMHTALKLQRRGWSTAYLAKPLAAGLSTERLILHVGQRM 371 Query: 295 RWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNI 354 RWA+G ++ + NF +F + ++A I +A + N Sbjct: 372 RWARGMLQIMRLDNPLTGPGLNFAQRLCYFAAIASFLFAIP---RLIFLTSPMAYLLFNQ 428 Query: 355 ELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTL 414 L + + + I S++ R N SL+ ++ + L+ + + Sbjct: 429 TLIDASP-----FAIAIYAMPHLIHSVLTSARTNRNWRYSLWSEVYETTLAPFLARVSVM 483 Query: 415 -VSFTRVMLMPKKQRAR 430 + F R ++ Sbjct: 484 TLIFPRRGKFNVTEKGG 500 >UniRef50_A4CLA3 Glycosyl transferase, group 2 family protein n=2 Tax=Flavobacteriales RepID=A4CLA3_9FLAO Length = 467 Score = 270 bits (692), Expect = 5e-71, Method: Composition-based stats. Identities = 99/468 (21%), Positives = 176/468 (37%), Gaps = 61/468 (13%) Query: 29 LMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDN------PSISIIIP 82 ++ + F +++ + + ++ + + PSI+II P Sbjct: 1 MLDYINIVFLLFTVVLFTLFTIMGYLSTRNSIHYRHKNSIGDISRTMASPLAPSITIIAP 60 Query: 83 CFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRA---------------------- 120 +NE + E I + L+ RY N EV+ VNDGS D T Sbjct: 61 AYNEGLTIVENIRSLLSLRYVNYEVMVVNDGSKDDTLEKMVAAYDLERVPMDIDPEWPCK 120 Query: 121 ----ILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYI 176 + L VI +N GK+ AL TG ++S+Y+ CID D LL DA ++ Sbjct: 121 PIRGVYKSRQRSFEKLTVID-KENGGKSDALNTGMRLSESKYIGCIDVDCLLLPDALLHV 179 Query: 177 VEPMLYN--PRVGAVTGNPRIRT--------------RSTLVGKIQVGEYSSIIGLIKRT 220 V+ + RV AV G R+ + + Q+ EY+ L + Sbjct: 180 VKAFYEHTSKRVIAVGGVIRVANSCKIDGGTLEEIRLPKNWLARFQLLEYTRSFLLGRMA 239 Query: 221 QRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLN------QWTIFYEP 274 ++ +SG F R E G + + ED++I ++++ +T+ Y P Sbjct: 240 WGRIDSLLIISGAFGFFDRQIALEAGGYDTGTVGEDMEIVFRMRRLMHDNDTPYTVAYIP 299 Query: 275 RALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAF 334 LCW +PE L QR RW++G E ++ + + R+ L + Y L W Sbjct: 300 DPLCWTEVPEDLGIFINQRDRWSRGNLETLYRHRDMFFNPKYGRLGMLSYPYWLFYEWLA 359 Query: 335 TCL-VGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRY--EHNL 391 L + A + + +A T T + + N+Y ++ Sbjct: 360 PILEFTGFFSILAFAYLGILNWEFFVAITATVYAFAAMFSFYAVLWDVWSYNQYTRTRDI 419 Query: 392 TSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRGIL 439 + LF + PV+F + + +++ + + + W S R Sbjct: 420 LTLLFCALVEPVVFHPVVVWSSIRGNFKKLFKI---NSGWGSQVRKGF 464 >UniRef50_C1DW58 Cellulose synthase catalytic subunit (UDP-forming) n=1 Tax=Sulfurihydrogenibium azorense Az-Fu1 RepID=C1DW58_SULAA Length = 764 Score = 270 bits (690), Expect = 9e-71, Method: Composition-based stats. Identities = 82/456 (17%), Positives = 154/456 (33%), Gaps = 33/456 (7%) Query: 2 INRIVSFFILCLVLCIPLCVAYFHSGEL---MMRFVFFWPFFMSIMWIVGGVYFWVYRER 58 + ++V F ++ F++ L ++ F +++ ++ + + + Sbjct: 60 LVKLVIIFFGIIISLRYFWWRTFNTLNLDEGLLNATFSILLYLAELYSITILLLGSFISL 119 Query: 59 HWPWGENAPAPQLKDNPSISIIIPCFNEEKNVE-ETIHAALAQRYE--NIEVIAVNDGST 115 E P + +D P++ + IP +NE + T AA + Y +V ++DG T Sbjct: 120 RLLEREPIPVEKREDYPTVDVFIPTYNEPPEIVITTALAAASMDYPQEKFKVYILDDGGT 179 Query: 116 ---------DKTRA------ILDRMAAQI-PHLRVIHLAQN-QGKAIALKTGAAAAKSEY 158 D+ R L + ++ + +N KA + + Sbjct: 180 AQKLNDPDPDRRRENYERAMQLKKFVEDYNGNIYYLTRERNLHAKAGNINEALKKTNGDL 239 Query: 159 LVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTL-----VGKIQVGEYSSI 213 ++ +D D + D V P++ V + + KI E Sbjct: 240 VLILDCDHVPAEDFLKRTVGFFNRYPKLFLVQTPHSFYNPDPIEKNLGIFKIVPSEADMF 299 Query: 214 IGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYE 273 I++ + F G A RR L EVG ITED + + +L + Y Sbjct: 300 YKHIQKGLDFWSASF-FCGSAAILRRKYLNEVGGIQGTTITEDAETALELHSRGYDSAYY 358 Query: 274 PRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWA 333 R + + L PET QR RWAQG ++FL L + W Y I+ Sbjct: 359 ARPMIYGLQPETFSAFIVQRTRWAQGMIQIFLLKNPLL---KKGLRWYQKLSYLNANIFW 415 Query: 334 FTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTS 393 F FI + L ++ + + + +VS + ++ N S Sbjct: 416 FFGFARFIFLIAPVLYPILGAKIYNASLEDAMVYAIPHFIF-SVMVSYYLYSKVRWNFFS 474 Query: 394 SLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRA 429 ++ I I + + PK + Sbjct: 475 EIYEAIQSIFILPAIVSVILSPRKPTFKVTPKGENL 510 >UniRef50_A6DGB3 Glycosyl transferase, family 2 n=3 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DGB3_9BACT Length = 396 Score = 270 bits (690), Expect = 9e-71, Method: Composition-based stats. Identities = 97/376 (25%), Positives = 158/376 (42%), Gaps = 15/376 (3%) Query: 70 QLKDNPSISIIIPCFNEEKNVEETIHAALAQRYE--NIEVIAVNDGSTDKTRAILDRMAA 127 L+ NP +S+++P NEE +E + Y+ +EVI +ND S+D T+ ++D Sbjct: 20 DLEYNPKVSVLVPAHNEEAVIEGCLDCMNKLEYKTGQLEVIILNDRSSDGTKELIDNFLC 79 Query: 128 QIP--HLRVIHLA--QNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYN 183 + P H+R H GKA A+K A KSE +V D D L D ++ P + Sbjct: 80 KNPQSHIRAHHRPMSSEPGKAAAMKEIIATLKSEIIVIFDADYLPQADLIKRLISP-FKD 138 Query: 184 PRVGAVTGNPRIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSAL 242 P+VGA G + ++ K+ E S + + + + + G R SAL Sbjct: 139 PQVGATMGRVVTYNANANIMTKLIDLERRSGYAIDQNVRNYFDLLPQFGGTTGGIRLSAL 198 Query: 243 AEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAE 302 +VG W +TED D+++KL LN + I Y A C+ PET + +KQ RWA G + Sbjct: 199 EDVGGWDTRTLTEDTDLTYKLYLNGYKIKYLNAAACYEETPETWQARYKQVRRWAYGHND 258 Query: 303 VFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAAT 362 +K++ ++ + L TI+A + + L G NI + A+ Sbjct: 259 CMIKHLIPTLMHKDKNLLRKLDALLLLTIYAAPAALLVLSIVAFLFG---NISVNMSASL 315 Query: 363 HTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPV--IFWMLSLATTLVSFTRV 420 T +L C + + + +I + V ML+ L+ Sbjct: 316 ITLFLLFCGFGNFSPFFQMFAACIKDRQPHC-IRYIPYIFVSSTISMLASTHALLLLPIE 374 Query: 421 MLMPKKQRARWVSPDR 436 L K + W R Sbjct: 375 KLGLK-KSLSWDKTLR 389 >UniRef50_A0LZM3 Transmembrane family-2 glycosyl transferase-possibly involved in biofilm formation n=2 Tax=Flavobacteriaceae RepID=A0LZM3_GRAFK Length = 473 Score = 269 bits (689), Expect = 1e-70, Method: Composition-based stats. Identities = 97/470 (20%), Positives = 176/470 (37%), Gaps = 60/470 (12%) Query: 24 FHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPA---PQLKDNPSISII 80 + E+ F+ + + + ++ ++ + Q PSISI+ Sbjct: 4 YQISEIASWLFLFYGIIICSGYTIAAIFSFLEIRDYKRKNNFQAEFALLQSSSLPSISIL 63 Query: 81 IPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAIL------------------ 122 P FNEE NV E + + L Y + E++ +NDGS D T L Sbjct: 64 APAFNEEANVVENVRSLLTLNYPSYEIVIINDGSKDGTLEALLTTFDMQKQEFLYHPFIP 123 Query: 123 --------DRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAA 174 +L VI +N GKA AL TG +E + CID D +L+ DA Sbjct: 124 SKEVKGVYKSKNEAYKNLTVID-KENGGKADALNTGINVCDNEIICCIDVDCILENDALL 182 Query: 175 YIVEPMLYN-PRVGAVTGNPRIRT--------------RSTLVGKIQVGEYSSIIGLIKR 219 +V+P L N +V A G R+ T + + Q+ EY + + Sbjct: 183 KLVKPFLNNTKKVIASGGIIRVANSCVIEDGRILDIKLPDTFLARAQIIEYFRAFLMGRM 242 Query: 220 TQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLN------QWTIFYE 273 + +SG F + + +VG ++ D + ED+++ +++ +T+ + Sbjct: 243 AWSRLDGLLLISGAFGMFDKDTVIKVGGYNPDTVGEDMELLVRMRKKMRENNIPYTVGFV 302 Query: 274 PRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWA 333 P LCW +P+ K L +QR RW +G AE + L+ + + L Y W Sbjct: 303 PNPLCWTEVPQNWKVLHRQRNRWTRGTAETLWLHKKMLFNPKYGVLGILSTPYWFFFEWL 362 Query: 334 FTCL----VGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEH 389 + ++ + + +G+ + L ++ IL L F + + Sbjct: 363 APLIEFLGFLYLTFLIIFSGLNWWVYLIFFMVVYSFAILFSITSLF-FEEFSFQQYKKPK 421 Query: 390 NLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRGIL 439 + L I+ P+I+ + + + + W + R L Sbjct: 422 YIIRLLGTILLEPLIYH----PFIMWASVKGNWDLLNGKKSWGAMSRTGL 467 >UniRef50_A1VIY0 Glycosyl transferase, family 2 n=1 Tax=Polaromonas naphthalenivorans CJ2 RepID=A1VIY0_POLNA Length = 476 Score = 269 bits (688), Expect = 2e-70, Method: Composition-based stats. Identities = 92/418 (22%), Positives = 167/418 (39%), Gaps = 17/418 (4%) Query: 30 MMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLK--DNPSISIIIPCFNEE 87 + + F + ++ ++ V +++ + P + P I++ I NEE Sbjct: 58 LAATLASVLFLIVVIMMLYAVRHFIFTINRLLGEQRHPYLDIAIARWPMITVFIAAHNEE 117 Query: 88 KNVEETIHAALAQRYE--NIEVIAVNDGSTDKTRAILDRMAAQIP-HLRVIHLA-QNQGK 143 K + I A L Y +++I VND STD+T AI+DR A+ P + H GK Sbjct: 118 KVIAGCIEALLNTDYPADQLKIIPVNDRSTDRTGAIIDRYVARFPSRISPFHRTLGKAGK 177 Query: 144 AIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRS-TLV 202 + ALK A A+ + + D D + R + P +P VGAV G L+ Sbjct: 178 SAALKDALAFAEGDIAIIFDADYVPGRGLLKQLAAPFF-DPEVGAVMGRVVPVNSGANLL 236 Query: 203 GKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWK 262 ++ E S + ++ + + G + R SA+ VG W DD + ED DI+++ Sbjct: 237 TRMLDLERSGGYQVDQQARMNMNLLPQYGGTVGGVRLSAVEAVGGWHDDTLAEDTDITYR 296 Query: 263 LQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPL 322 L N W Y R+ C+ +PE + KQ RWA+G +V + + + Sbjct: 297 LMFNGWKTVYSNRSECYEEVPEEWRVRIKQVKRWAKGHNQVMARYWWQFACSPYLTLAQR 356 Query: 323 FFEYCLTTIWAFTCLV----GFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFI 378 L ++ L+ G ++ L + +L I A G L + + Sbjct: 357 IDGLLLLFVFLIPLLMLIGWGLVLGLYFLNAGSMLSQLIPIFALMVYGTLGNFAAFFEIV 416 Query: 379 VSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 ++++++ + L + + F + A + + + K+ W R Sbjct: 417 IAVLLDGHRKR-----LRLLPLNILGFLVSLFAISGAVVSLALDGLFKREMVWEKTIR 469 >UniRef50_Q1LL41 Glycosyl transferase, family 2 n=18 Tax=Burkholderiaceae RepID=Q1LL41_RALME Length = 857 Score = 268 bits (687), Expect = 2e-70, Method: Composition-based stats. Identities = 70/439 (15%), Positives = 151/439 (34%), Gaps = 25/439 (5%) Query: 1 MINRIVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHW 60 MI + + + + + + E ++ ++ + + + + + Sbjct: 213 MIAVSMLMAVRYIWWRSTQTLQFATTTEAVIGYLLYAAECYTWLVLALSYLQTAWPLHRK 272 Query: 61 PWGENAPAPQLKDNPSISIIIPCFNEE-KNVEETIHAALAQRYE--NIEVIAVNDGSTDK 117 P A P++ + IP +NE V+ T++AA + + + V ++D Sbjct: 273 PKPLPA---DSSLWPTVDVFIPTYNESLAVVQPTVYAARSMDWPPDKLRVYILDDSHRPA 329 Query: 118 TRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIV 177 R + + KA + +Y+ D D + R + Sbjct: 330 MREFAEAAGVGY----ITRDNNRHAKAGNINQALPRTSGDYIAIFDCDHIPTRSFLQMTM 385 Query: 178 EPMLYNPRVGAVTGNPRIRTRSTL-----VGKIQVGEYSSIIGLIKRTQRIYGNVFTVSG 232 L +P+ V + + E S GLI+ ++ F G Sbjct: 386 GEFLDDPKCALVQTPHHFFSPDPFERNFDTFRRVPNEGSLFYGLIQDGNDLWNATF-FCG 444 Query: 233 VIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQ 292 A +R+ L E+G + + +TED + KL + Y L E+L Q Sbjct: 445 SCAVIKRAPLLEIGGIAVETVTEDSHTALKLHRRGYNSAYLRTVQAAGLATESLSSHIGQ 504 Query: 293 RLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPL 352 R+RWA+G A++F + L + Y + F + I + +A + Sbjct: 505 RIRWARGMAQIFRLDNPLLGK---GLTLFQRLCYSNAMLHFFYGIPRLIFLTMPIAYLYF 561 Query: 353 NIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLAT 412 + + + T+ +++ L +++ + +R + S + ++ V+ W + L T Sbjct: 562 GLHVIN-----TSALMIMAYVLPYLLIANVTNSRLQGRYRHSFWAEVYESVLAWYIVLPT 616 Query: 413 TLVSF-TRVMLMPKKQRAR 430 T+ R + Sbjct: 617 TVAFINPRAGKFNVTAKGG 635 >UniRef50_Q9UY40 Glycosyl transferase, family 2 n=2 Tax=Thermococcaceae RepID=Q9UY40_PYRAB Length = 447 Score = 268 bits (687), Expect = 2e-70, Method: Composition-based stats. Identities = 96/447 (21%), Positives = 173/447 (38%), Gaps = 14/447 (3%) Query: 1 MINRIVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHW 60 M I L L + I + +A + + V F M + + + Sbjct: 2 MRASIKFQSALYLYILIIIGLALVIPPKYALEIVLIILFLMVSSGSIFYTLLMASLGKRY 61 Query: 61 PWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRA 120 P+ E + + P + ++IP NEE+ + +T+ + L Q Y N++VI +ND STD+TR Sbjct: 62 PYDETGFNLEFLE-PLVYVLIPAHNEERVIYKTVRSVLGQDYRNMKVILINDNSTDRTRD 120 Query: 121 ILDRMAAQIPH-LRVIHLAQNQG--KAIALKTGAA-----AAKSEYLVCIDGDALLDRDA 172 I++ + + P + +I + +G K AL Y+ +D D L+ +A Sbjct: 121 IMEEINRKYPRKVVIIDVPPERGRSKPRALNYALEIIEKYMTHPNYVFILDADYLIPPNA 180 Query: 173 AAYIVEPMLYNPR-VGAVTGNPRIRT-RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTV 230 +V M P+ V + GN R R R V K E + Sbjct: 181 LKTLVSIMESAPQYVIGIQGNVRPRNFRKNFVTKFITLERLVGFNVAIEGDMKLNENGKY 240 Query: 231 SGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLW 290 G +A R L +G + +D +TED D+ + + + +Y + W ETL+ Sbjct: 241 GGTVALLRFPLLIRLGKFREDSVTEDTDLWARAMIAGYRFWYYHGVIGWEEAVETLRDYI 300 Query: 291 KQRLRWAQGGAEVFLKNMTRLWRK-ENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAG 349 KQR RWAQG +V + + + R N + Y ++ + + I+ + + Sbjct: 301 KQRSRWAQGHLQVMIDHYWPVMRSCSNIIESFIEHFYMMSYLVPVFWFLSVILNSYLIIT 360 Query: 350 VPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLS 409 + + I L S +E + HN ++ +P+ F + Sbjct: 361 GAPPLSFARPKLFLSVSIFTFLLFWFSVAYSNWVEKK-RHNYYVPWSFVALYPLYFMVFV 419 Query: 410 LATTLVSFTRVMLMPKKQRARWVSPDR 436 +A + + R ++ R W R Sbjct: 420 IAGVIYTM-RGLIRLLVGRLHWEKTKR 445 >UniRef50_Q9KAQ6 BH2231 protein n=4 Tax=Bacillus RepID=Q9KAQ6_BACHD Length = 482 Score = 268 bits (687), Expect = 2e-70, Method: Composition-based stats. Identities = 89/476 (18%), Positives = 171/476 (35%), Gaps = 59/476 (12%) Query: 18 PLCVAYFHSGELMMRFVFFWPFFMSIMW---IVGGVYFWVYRERHWPWGENAPAPQLKDN 74 + + L F+ + + +++ ++ V ++ +L Sbjct: 4 TITNIVSYLPYLFAWFLLGYMGIVVLVYTGMLLLSVVQLKREQKLNKQEVYDDYLELTYT 63 Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRV 134 +SI++P +NEE + ET+ + L+ +Y E++ VNDGSTD+T ++ + +V Sbjct: 64 KPVSILVPAYNEETGIIETVRSLLSLKYPQTEIVVVNDGSTDQTLEVIIEHFQMVKVGKV 123 Query: 135 IHLA-----------------------QNQGKAIALKTGAAAAKSEYLVCIDGDALLDRD 171 I N GKA AL G +K Y IDGD++L+ D Sbjct: 124 IRKQIETEPIKGVYQSTIFPHLLLVDKSNGGKADALNAGLNVSKYPYFCSIDGDSILETD 183 Query: 172 AAAYIVEPML----YNPRVGAVTGNPRIRTRS--------------TLVGKIQVGEYSSI 213 A +++P++ V A GN RI S + +QV EY Sbjct: 184 ALLKVMKPIVTSRDDEDEVIASGGNVRIANGSDIQMGSVLSVQLAKNPLVVMQVIEYLRA 243 Query: 214 IGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLN------Q 267 + + + V +SG + F + + E G +S + ED+++ +L + Sbjct: 244 FLMGRIGLSRHNMVLIISGAFSVFAKKWVMEAGGYSKKTVGEDMELVVRLHRLVKEKRLK 303 Query: 268 WTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYC 327 I + P +CW P T + L +QR RW +G E + + + + Y Sbjct: 304 KRITFVPDPVCWTEAPATFRVLQRQRSRWHRGLMESLWLHRGMTFNPKYGLVGTASIPYF 363 Query: 328 LT--TIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIEN 385 L+G++ L +E + I+ Sbjct: 364 WIVEFFGPVVELMGYLYIVFAFFFGGLYVEFALALFLLFVLY-GTVFSMTAVILEGWSLK 422 Query: 386 RYEH--NLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRGIL 439 RY +++ + + ++ + + L++ + + K W R L Sbjct: 423 RYPKVSDMSRLMIFSLFEALWYRPLTVLWRFGAIIEALFRSK----AWGEMTRKGL 474 >UniRef50_A8USY3 Cellulose synthase (UDP-forming) n=1 Tax=Hydrogenivirga sp. 128-5-R1-1 RepID=A8USY3_9AQUI Length = 717 Score = 268 bits (687), Expect = 2e-70, Method: Composition-based stats. Identities = 76/424 (17%), Positives = 145/424 (34%), Gaps = 27/424 (6%) Query: 13 LVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMW------IVGGVYFWVYRERHWPWGENA 66 ++ + + V+ ++ + P F ++++ ++ + F+ R Sbjct: 9 FLILLSVSVSTYYIVWRLGTLNPDAPLFSAVLYGAEVYGLIASIMFFYMVWRLPNRKTKD 68 Query: 67 PAPQLKDNPSISIIIPCFNE-EKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRM 125 P L + I IP NE + + +TI A++ RY + ++DGS + R + + + Sbjct: 69 PPKDL----KVDIFIPTLNESPELLRKTIQGAISVRYPH-RTYVLDDGSRREVRELCEEL 123 Query: 126 AAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPR 185 + + GKA L +++ +D D + D + + Sbjct: 124 GCRY----IPRYTNEHGKAGNLNNALGKTDGDFIAVLDADHVPQPDFLDRTLG-YFADEE 178 Query: 186 VGAVTGNPRIRTRSTLVGKIQ----VGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSA 241 V V + ++ E S +I R + + F G A RR A Sbjct: 179 VAFVQTPQDFYNVDSYQHRLVRGKLWNEQSLFFRVIMRGKDRTNSAF-FCGSCAVVRRKA 237 Query: 242 LAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGA 301 L ++G ++ +TED+ S +L W Y P L + + P L QR RW +G Sbjct: 238 LEDIGGFATGTVTEDLHTSIRLHAKGWKSVYYPHVLAYGVAPADLAPFKNQRGRWGEGAM 297 Query: 302 EVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAA 361 +VF K L R +F T + F I Y + + ++ Sbjct: 298 QVFAKENPLLTRGLTLPQRINYFASMSTYLDGFQ---KAIFYTAPVVVLLTGKLPISVSL 354 Query: 362 THTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVM 421 + L + L + L +F + +L + R M Sbjct: 355 KEFLPVFLPHIFLSVWAFEEASRGYGRMALLEQYNMARFFTFMKSLLGFIR--LKRPRFM 412 Query: 422 LMPK 425 + K Sbjct: 413 VTEK 416 >UniRef50_Q7NHH7 Glr2559 protein n=1 Tax=Gloeobacter violaceus RepID=Q7NHH7_GLOVI Length = 426 Score = 268 bits (685), Expect = 3e-70, Method: Composition-based stats. Identities = 94/422 (22%), Positives = 158/422 (37%), Gaps = 23/422 (5%) Query: 20 CVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISI 79 + + L + + P S++W+ G+ +Y R P P +S+ Sbjct: 12 TALVWSALVLFVLALHLIPTGASVVWLAAGLT-GLYAVRVLFALSPRPKSDPAYRPRVSV 70 Query: 80 IIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHL-- 137 ++ NE+ + + Y + EV +DGSTD+T L L ++ Sbjct: 71 LVAAKNEQAVAAQLVAMLRRLDYPDFEVWIADDGSTDRTYQRLLEAGRGWQALHLVRRIP 130 Query: 138 -AQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIR 196 GK+ L A + LV D DA ++ D + V P+ VGA+ R+ Sbjct: 131 ERSRPGKSAVLNELRERATGDILVVFDADARVEPDFLSRTV-PLFAVSSVGALQVRKRVH 189 Query: 197 TRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITE 255 + Q E ++ + G + G R +AL VG W++ +T+ Sbjct: 190 NADFNFWTRGQSAEMLL-DAFYQQQRAAIGGTAELRGNGQLVRAAALEAVGGWNEATVTD 248 Query: 256 DIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKE 315 D+D++ +L L W I + T LW+QR RWA+GG + +L RL+ Sbjct: 249 DLDLTLRLHLGGWQIAFASDPCVDEEGVTTWSALWRQRSRWAEGGFQRYLDYAPRLFGGA 308 Query: 316 NFRMWPL-FFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCL 374 + +C V +++A Q PL L +A T +C Sbjct: 309 MGTTKTVDQLIFCTIQYLMPVAAVLDLLFAFQRGAAPLLTPLVLVATVFT----VCGFYF 364 Query: 375 LQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSP 434 Q + + L +++++ WFPVI LV R L PK + WV Sbjct: 365 GQRERGVQVGRAMLETLAGTIYFLHWFPVI---------LVKLARTALEPK--KLVWVKT 413 Query: 435 DR 436 Sbjct: 414 AH 415 >UniRef50_B8E0Z1 Glycosyl transferase family 2 n=1 Tax=Dictyoglomus turgidum DSM 6724 RepID=B8E0Z1_DICTD Length = 399 Score = 268 bits (685), Expect = 4e-70, Method: Composition-based stats. Identities = 74/379 (19%), Positives = 162/379 (42%), Gaps = 23/379 (6%) Query: 71 LKDNPSISIIIPCFNEEKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAILDRMAAQ 128 D P +S+++P NEEK E ++A L Y IE+I ++D STD+TR IL+ +++ Sbjct: 20 DSDLPYVSVLVPMHNEEKVAENVLNALLNTDYPKDRIEIIPIDDNSTDRTREILEDYSSK 79 Query: 129 IPH-LRVIHLAQ--NQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPR 185 PH ++ ++ +GK AL A+ E ++ D D + + + L +P Sbjct: 80 YPHLIKPLYRGSYLPRGKPSALNDALKVAEGEIIIVFDADYIPPKGIIRDLAVSFL-DPE 138 Query: 186 VGAVTGNPRIRT-RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAE 244 VG V G L+ ++ E + ++ + + G + FR+ + + Sbjct: 139 VGVVMGRVVPLNISKNLLTRLFDLERIGGYQVDQQARYNLKLIPQFGGTVGGFRKELILK 198 Query: 245 VGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVF 304 +G ++ ++ ED +++ K +N + Y RA C+ PET + KQ RW++G +V Sbjct: 199 LGGFNPKILAEDTELTIKAYINGVKVCYTNRAECYEEAPETWEVRAKQIRRWSRGHNQVM 258 Query: 305 LKNMTRLWRKENFRMW------PLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTH 358 + + L + + L Y ++ ++ + +++ + + + Sbjct: 259 FRYLLPLIKSPYLSLREKVDGVFLLCVYLISPLFLIGLVDSIVLFFLGEMQILGSSFFIF 318 Query: 359 IAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFT 418 ++A +T G + + +++ +++ P+ + ++ + + F Sbjct: 319 LSAYNTFGNFAPFY---EIGLGALLDGATYR------IFLLPLPLFNFFFNMWYSSLGFF 369 Query: 419 RVMLMPKKQRA-RWVSPDR 436 +L +R W +R Sbjct: 370 DSVLDLITRRDPVWHKTER 388 >UniRef50_B2JFS7 Cellulose synthase (UDP-forming) n=8 Tax=Burkholderiaceae RepID=B2JFS7_BURP8 Length = 716 Score = 267 bits (684), Expect = 5e-70, Method: Composition-based stats. Identities = 78/406 (19%), Positives = 139/406 (34%), Gaps = 21/406 (5%) Query: 6 VSFFILCLVLCIPLCVAY-FHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGE 64 VS ++ VL L + F + + + + V G+ + + R Sbjct: 13 VSVLVVAYVLAGFLYFGWRFTTINWQLPLFSIALYVAELFGFVTGLLYALMTYRLAVRVP 72 Query: 65 NAPAPQLKDNPSISIIIPCFNEE-KNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILD 123 P S+ + +P +NE + V T+ AA Y + ++DG+ + R + Sbjct: 73 EQPLSNA----SVDVFVPTYNESVELVRRTLLAARNIEYPHKT-WLLDDGNRIEMRELAA 127 Query: 124 RMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYN 183 + + KA L + E++ D D +D ++ + Sbjct: 128 ELGCDY----LARGENAHAKAGNLNHALKHSSGEFIAIFDADHAARKDFLTKVIG-FFAD 182 Query: 184 PRVGAVTGNPRIRTRSTL----VGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRR 239 PRVG V + +G+ E S +I+R + + F G A RR Sbjct: 183 PRVGFVQTPQDFYNVDSFSHRQIGRRVWHEQSLFFKVIQRGKDSWNAAF-FCGSCAVLRR 241 Query: 240 SALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQG 299 SAL +G ++ + +TEDI S L + Y P +L + L P +L +QR+RW G Sbjct: 242 SALERIGGFATETVTEDIHTSILLHKAGYKSVYYPESLAFGLAPHSLDTYLQQRIRWGVG 301 Query: 300 GAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHI 359 G +V + + Y + + F F+ + + Sbjct: 302 GMQVLRRERVL---WSSGLTLAQRLNYMASMLIYFEGWQKFVFFVTPPIVFFFGLLPIVA 358 Query: 360 AATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIF 405 G LL + L + Y L S + + F Sbjct: 359 PMGQFLG-WFFLYYLLSMAMYLELARGYGALLLSEQYGMSRFFAFM 403 >UniRef50_A1S0Z5 Glycosyl transferase, family 2 n=1 Tax=Thermofilum pendens Hrk 5 RepID=A1S0Z5_THEPD Length = 420 Score = 267 bits (684), Expect = 5e-70, Method: Composition-based stats. Identities = 86/397 (21%), Positives = 156/397 (39%), Gaps = 20/397 (5%) Query: 45 WIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYE- 103 + Y + P +++I+P +E + VE ++A L+ Y Sbjct: 37 ILAWSAYHAPIIVAGLLAPRGSGDDPGNGLPRVTVIVPSKDEGRRVERCLNAILSSDYPL 96 Query: 104 -NIEVIAVNDGSTDK-TRAILDRMAAQIPH-LRVIHLAQNQGKAIALKTGAAAAKSEYLV 160 +EVI V D S+D I+ R + P +R+I + +GK AL A E + Sbjct: 97 EKLEVIVV-DASSDGYVEEIVRRAGERYPGAVRLIREEEPRGKPAALNRALREATGEVVA 155 Query: 161 CIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRT-RSTLVGKIQVGEYSSIIGLIKR 219 D D++ +RDA V+ + P V AV G + R +++ ++ E + + R Sbjct: 156 VFDADSVPERDAIRRAVKHLEE-PGVAAVQGKTLVLNERESVLARVASKEEKAWFHALIR 214 Query: 220 TQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCW 279 + G ++G +RSAL EVG W +D + ED+++S L + + Y + W Sbjct: 215 GRERLGLFVPLTGSCQFVKRSALEEVGGWREDALAEDLELSMDLLARGYRVKYANDVVSW 274 Query: 280 ILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVG 339 P +L+ L QR RW +G E F +++ + + + + + Sbjct: 275 QEAPTSLRSLAVQRNRWYRGYMEAFARHLRLALAGRRG------LDAAILSAGPYLMALS 328 Query: 340 FIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWII 399 + A LA L + H + L + L V+L + R +L W+ Sbjct: 329 LLAVAAWLASTALP-HVNHFSTPAALVAALNAVSLFSVSVALALSER--PVSAKNLAWVP 385 Query: 400 WFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 +W A L + ++L + W + Sbjct: 386 VI-YAYWFTLSAVALHALAEIILRRPR---VWRRTPK 418 >UniRef50_C5CVQ8 Glycosyl transferase family 2 n=4 Tax=Proteobacteria RepID=C5CVQ8_VARPS Length = 445 Score = 267 bits (684), Expect = 5e-70, Method: Composition-based stats. Identities = 97/429 (22%), Positives = 172/429 (40%), Gaps = 9/429 (2%) Query: 5 IVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGE 64 +++ + + A +L R F+ V + + Sbjct: 22 VIALIGASAWMWFSIWAAGPWLADLTDRVGMALAIFLVYGIAVVPGFMNAFLAISLLLDR 81 Query: 65 NAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYE-NIEVIAVNDGSTDKTRAILD 123 P L P ISI+I +NEE +++ET+ + Q Y ++VI +NDGS D T ++ Sbjct: 82 RPAHPALAVYPPISILIAAYNEEASIDETLVSIDRQNYPGELQVIVINDGSADGTSLVVQ 141 Query: 124 RMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYN 183 R + P L + L +N GKA AL G + ++ +D D+ L R A IVE + Sbjct: 142 RARERYPWLTFLDLEKNGGKARALNIGFKEVAHDLVITVDADSFLYRGALTSIVERYRAD 201 Query: 184 -PRVGAVTGNPRIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSA 241 P AV G +R + + Q +Y I IKR Q ++ G + + R+A Sbjct: 202 PPHTRAVAGKILVRNSRLNWITRCQEWDYFHGIAAIKRVQSLFQGTLVAQGAFSIYDRAA 261 Query: 242 LAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGA 301 L EVG W + + EDI ++W L + + + A + +P T+K L KQR RWA+G A Sbjct: 262 LVEVGGWPE-CVGEDIVLTWALLKAGYRVGHCEDACLFTNVPTTVKQLVKQRQRWARGMA 320 Query: 302 EVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAA 361 E F+K+ L + F ++ + L I + + Sbjct: 321 EAFIKHPGILVKPR-----LSTFFIYWNLLFPWLDLACTIGFIPGIILAFFGHYWIVGIM 375 Query: 362 THTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVM 421 T L LL F + + + + ++ + +++ ++ +++ + + Sbjct: 376 TLALIPATFALSLLMFRIERRMFRKTGLVVRKNVQGFFVYVLLYSLILQPASVLGYLDEL 435 Query: 422 LMPKKQRAR 430 K Sbjct: 436 FRKPKAWGT 444 >UniRef50_Q59167 Cyclic di-GMP-binding domain n=5 Tax=cellular organisms RepID=ACSA2_ACEXY Length = 1596 Score = 267 bits (682), Expect = 7e-70, Method: Composition-based stats. Identities = 75/415 (18%), Positives = 140/415 (33%), Gaps = 24/415 (5%) Query: 11 LCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQ 70 +V + + + + + +++ + + + P A Sbjct: 86 RYMVWRLTTTLELHSPLQAALSLLLVAAELYALLTLCLSYFQMSWPLDRKPLPLPA---D 142 Query: 71 LKDNPSISIIIPCFNEEKNVEE-TIHAALAQRYE--NIEVIAVNDGSTDKTRAILDRMAA 127 D P + + +P +NEE ++ T+ ALA + + V ++DG A A Sbjct: 143 TTDWPVVDVYVPSYNEELSLVRSTVLGALAIDWPADKLNVYILDDGRRKSFHAFAMEAGA 202 Query: 128 QIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVG 187 +I N KA L + EY+V D D + R + M+ +P++ Sbjct: 203 GY----IIRDQNNHAKAGNLNHALRVTEGEYVVIFDCDHIPTRGFLKKTIGWMMADPKLA 258 Query: 188 AVTGNPRIRTRSTLVGKIQVG-----EYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSAL 242 + + + G E + GL++ + F G AA RRSA+ Sbjct: 259 LLQTPHHFYSPDPFQRNLATGQNVPPEGNMFYGLVQDGNDFWDATF-FCGSCAAIRRSAV 317 Query: 243 AEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAE 302 +G ++ + +TED + K+Q W Y + L L E L QR+RWA+G + Sbjct: 318 LGIGGFATETVTEDAHTALKMQREGWHTAYLRQPLAAGLSTERLMLHIGQRVRWARGMLQ 377 Query: 303 VFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAAT 362 + + L + W Y + + A LA + L + + Sbjct: 378 IMRLDNPLL---GSGLRWQQRLCYLSAMSHFLFAIPRLVFLASPLAFLFLGQNIIAASPF 434 Query: 363 HTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSF 417 + +S R E S + I+ + L T + Sbjct: 435 AILVYAFPHVFHSIGTLS-----RVEGRWRYSFWSEIYETTLALFLVRVTIMTLL 484 >UniRef50_B0C9M4 Glycosyl transferase, family 2 n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0C9M4_ACAM1 Length = 492 Score = 267 bits (682), Expect = 8e-70, Method: Composition-based stats. Identities = 86/428 (20%), Positives = 163/428 (38%), Gaps = 17/428 (3%) Query: 24 FHSGELMMRFVFFWPFFMSIMWIVGGVY----FWVYRERHWPWGENAPAPQLKDNPSISI 79 + + F+ + + + + W Y D P ++I Sbjct: 3 ISFIDGISGFIPELYLGILTLIAIYSFHKISIIWRYYLHRRREISPLHKFSDADLPQVTI 62 Query: 80 IIPCFNEEKNVEETIHAALAQRYE--NIEVIAVNDGSTDKTRAILD----RMAAQIPHLR 133 +P FNE V+ + A A Y +++ ++D STD+TR I + + ++ Sbjct: 63 QLPLFNEMYVVDRLLEAVAALEYPVDKLQIQVLDD-STDETREICRAKVRELKQRHLNID 121 Query: 134 VIHLAQNQG-KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGN 192 IH +G KA AL G +A + ++ D D + D +V NP+VG V Sbjct: 122 YIHRCDRKGYKAGALAYGLQSATGDLVMIFDADFVPSPDTLINMV-HYFANPKVGMVQAR 180 Query: 193 PRIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDD 251 R +++ +IQ + ++ G F +G +R + + G W Sbjct: 181 WGHINRHYSILTEIQALMLDGHFVTEQTSRNRSGCFFNFNGTAGIWRIQTIEDAGGWQHT 240 Query: 252 MITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRL 311 +TED+D+S++ QL W Y P + +P + Q+ RWA+G ++V K + + Sbjct: 241 TVTEDLDLSYRAQLKGWECIYLPNIVVPAELPMEMNSFKSQQFRWAKGASQVAKKLLLPI 300 Query: 312 WRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCT 371 L + LT + + L+ ++ ++ + A H L+ T Sbjct: 301 LTSNAPGHVKLEAFFHLTNNFNYLLLLVLLLLSLPYQLFLAETGWRYGLAIHLPLFLITT 360 Query: 372 LCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARW 431 L LL F E R +++ + + LS+ +L + + + R Sbjct: 361 LSLLAFYSVAQEEQRGQNSPWKLTSNLFLLMSVGIGLSINQSL-AVYDGLFRVGRDFVR- 418 Query: 432 VSPDRGIL 439 +P G+ Sbjct: 419 -TPKHGVT 425 >UniRef50_A4CJ64 Glycosyltransferase n=15 Tax=Bacteroidetes RepID=A4CJ64_9FLAO Length = 494 Score = 267 bits (682), Expect = 9e-70, Method: Composition-based stats. Identities = 86/429 (20%), Positives = 169/429 (39%), Gaps = 16/429 (3%) Query: 17 IPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPS 76 + + +AYF + + + + + + + ++ +R K+ P Sbjct: 1 MGITIAYFIIAIYSLALLLIFFYS---LSQLNLLLNYLGFKRRNKEAPKFNLLDPKEIPY 57 Query: 77 ISIIIPCFNEEKNVEETIHAALAQRYEN--IEVIAVNDGSTDK----TRAILDRMAAQIP 130 ++I +P +NEE VE + Y +E+ ++D STD T A+++ + Q Sbjct: 58 VTIQLPIYNEEYVVERLLENIARIEYPKSKLEIQVLDD-STDDSVEQTAAMIEELQKQGL 116 Query: 131 HLRVIHLAQNQG-KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAV 189 ++ I +G KA ALK G AK +++ D D L D D V + +G V Sbjct: 117 DIQHIRRENREGFKAGALKEGLKIAKGDFIAIFDADFLPDADWLKKTV-IYFKDEEIGVV 175 Query: 190 TGNPRIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYW 248 R + + KIQ + L + + G+ +G +R+ + + G W Sbjct: 176 QTRWGHINRDYSTLTKIQAFALDAHFTLEQVGRNSKGHFINFNGTAGIWRKECILDAGNW 235 Query: 249 SDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNM 308 D +TED+D+S++ QL W Y +P + Q+ RW +GGAE F K + Sbjct: 236 EGDTLTEDLDLSYRAQLKNWKFKYLEDVETPAELPVVISAARSQQFRWNKGGAENFRKTV 295 Query: 309 TRLWRKENFRMWPLFFEYC-LTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGI 367 + + +N F L F C+ + ++ + + T+ Sbjct: 296 WNVVKAKNIPFKTKFHGVMHLLNSSMFLCVFIVALLSIPMLYIKNTFGHLDWIFEVTSFF 355 Query: 368 LLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQ 427 ++ T+ L F+ + + + I F+ ++L +L + V+ + Sbjct: 356 IVSTIIL--FVCYWFTYKSIQGSSFDNFVDYIKLFFTFFSVALGFSLHNTVAVLEGHMGK 413 Query: 428 RARWVSPDR 436 R+ +V + Sbjct: 414 RSEFVRTPK 422 >UniRef50_C7RTK3 Polysaccharide deacetylase n=2 Tax=Bacteria RepID=C7RTK3_9PROT Length = 336 Score = 266 bits (681), Expect = 1e-69, Method: Composition-based stats. Identities = 76/338 (22%), Positives = 141/338 (41%), Gaps = 17/338 (5%) Query: 90 VEETIHAALAQRYEN-IEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALK 148 + +TI + LA +++ V+DGS+D T + P +R+ +N GK A Sbjct: 2 IVQTIASLLACDPPRDFQIVVVDDGSSDATYRVACDAFGSDPRVRIF-TKENGGKPAAPN 60 Query: 149 TGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVG 208 G A ++E +V +D D + RD + +PRVGAV GN ++ R L+ + Q Sbjct: 61 FGLAQTQAEVIVALDADTIFARDTVRKLAR-HFEDPRVGAVAGNAKVGNRINLLTRWQAL 119 Query: 209 EYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQW 268 EY + L +R + + V G + A+RR + G + + ED D++ ++ + Sbjct: 120 EYITSQNLDRRAFDVLNCITVVPGAVGAWRRELVERAGGFDHQTLAEDADLTLAIRKLGY 179 Query: 269 TIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCL 328 + YE A+ P++++G QR RW G +V K+ L+R ++ + + + Sbjct: 180 NVVYEDEAVALTEAPDSVRGFIGQRYRWMYGTMQVARKHRDVLFRPQSGALGFVALPNVI 239 Query: 329 TTIWAFTCLVGFIIYAVQLAGVPLN----------IELTHIAATHTAGILLCTLCLLQFI 378 F LV I+ + + + ++ L T+ L I Sbjct: 240 VFQILFP-LVSPIMDLLLIGSLASAAVNHWQHPEEFSAGNLWRVLFYYALFVTVDYLAAI 298 Query: 379 VSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVS 416 ++ +E + L W+ W + L + S Sbjct: 299 LAFALERKES---WWLLIWLFWQRFFYRQLMYYVAIKS 333 >UniRef50_C6WRK1 Glycosyl transferase family 2 n=1 Tax=Actinosynnema mirum DSM 43827 RepID=C6WRK1_ACTMD Length = 637 Score = 266 bits (680), Expect = 1e-69, Method: Composition-based stats. Identities = 95/355 (26%), Positives = 154/355 (43%), Gaps = 19/355 (5%) Query: 83 CFNEEKNVEETIHAALAQRYE-NIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQ 141 +NE + + + +A Y +EV+ V+DGS D T ++ + +P +RV+ +N Sbjct: 283 AYNEAVTITAAVRSLVASDYPAPVEVVVVDDGSADGTADVVRAL--DLPGVRVVTR-ENG 339 Query: 142 GKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTL 201 GKA AL TG A A + LV +DGD + + A +V P+ P VGAV+GN ++ R L Sbjct: 340 GKAEALNTGVALAGHDALVLVDGDTIFEPGTLAALVAPLGA-PGVGAVSGNVKVANRRGL 398 Query: 202 VGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISW 261 +G+ Q EY+S L ++ + + T+ G + AFRR AL EVG S D + ED D++ Sbjct: 399 LGRWQHLEYTSGSNLDRQILNAWRCLPTIPGAVGAFRREALVEVGGVSSDTLAEDTDVTM 458 Query: 262 KLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWP 321 + W + YEP A W +P L+ L++QR RW+ G + K+ + + Sbjct: 459 AITRAGWRVVYEPGARAWTEVPAGLRSLYRQRYRWSYGTFQAMWKHRVAVREQGRMGRLG 518 Query: 322 LFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSL 381 L + + V + V + A + L +L Sbjct: 519 LLYLLLFHLLLPLLAPVMDV-------YVLYGALVADAPAALPIWLAFLALQTASAGYAL 571 Query: 382 MIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 ++ L + FP+ T LV ++ RW S R Sbjct: 572 RLDGESLRPLWA-------FPLQQLAYRQLTYLVVVQSLVTAAHGVPLRWQSIQR 619 >UniRef50_B3DW74 Glycosyltransferase n=1 Tax=Methylacidiphilum infernorum V4 RepID=B3DW74_METI4 Length = 480 Score = 265 bits (678), Expect = 2e-69, Method: Composition-based stats. Identities = 79/417 (18%), Positives = 155/417 (37%), Gaps = 18/417 (4%) Query: 34 VFFWPFFMSIMWIVGGVYFWVYRERHW---PWGENAPAPQLKD-NPSISIIIPCFNEEKN 89 + FW ++++ + G+Y R W + AP L P ++I +P +NE+ Sbjct: 2 ISFWFLILALVLFIYGIYRMSLILRLWMGSHRDKKAPTDALFYTYPEVTIQLPIYNEKSV 61 Query: 90 VEETIHAALAQRYE--NIEVIAVNDGSTDKTRAILDRMAAQIP----HLRVIHLAQNQG- 142 VE +HA Y +E+ ++D STD+T AI+ + + + + +G Sbjct: 62 VERLLHAVCKIDYPKNKMEIQIIDD-STDETTAIISKWVCEYQKKGFDIYQLRRGTREGF 120 Query: 143 KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTR-STL 201 KA L+ G +K E++ D D L + P + VG V R ++L Sbjct: 121 KAGGLQYGLERSKGEFIAIFDADFLPPPSFLKETL-PYFRSRDVGMVQARWGYLNRQASL 179 Query: 202 VGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISW 261 + + Q L + + Y F +G +R+ + + G W D +TED+D+S+ Sbjct: 180 LTRCQALFLDGHFLLEQPVRYKYNLFFNFNGTAGIWRKKCIIDAGGWEGDTLTEDLDLSY 239 Query: 262 KLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWP 321 + Q W Y P+ + +P + Q+ RWA+G + K++ L++ Sbjct: 240 RAQFKGWKFVYTPQMVVPSELPSPIVAFRTQQHRWAKGAIQTAKKHLFSLFKGSYSLGSK 299 Query: 322 LFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSL 381 + + L + +I + + + ++ L F V + Sbjct: 300 IEGLFHLLAHSIHPIVAVLVILNAISFFCSPLPQSFTLEVSGMLFSVISLFYLSYFAVVI 359 Query: 382 MIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTR----VMLMPKKQRARWVSP 434 + E L + + + + + + + PK P Sbjct: 360 FLSKNLEAGALLILPFSMAVALGMTFANTKSVIDGLFGKNNVFVRTPKNGFFNSDKP 416 >UniRef50_P37653 Cellulose synthase catalytic subunit [UDP-forming] n=143 Tax=Proteobacteria RepID=BCSA_ECOLI Length = 872 Score = 265 bits (678), Expect = 2e-69, Method: Composition-based stats. Identities = 72/425 (16%), Positives = 152/425 (35%), Gaps = 24/425 (5%) Query: 1 MINRIVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHW 60 +I ++ + + + L+ + + + + +V G + V+ Sbjct: 202 LIVLSLTVSCRYIWWRYTSTLNWDDPVSLVCGLILLFAETYAWIVLVLGYFQVVWPLNRQ 261 Query: 61 PWGENAPAPQLKDNPSISIIIPCFNEE-KNVEETIHAALAQRYEN--IEVIAVNDGSTDK 117 P + PS+ I +P +NE+ V+ TI+A+L + + + ++DG ++ Sbjct: 262 PVPLPK---DMSLWPSVDIFVPTYNEDLNVVKNTIYASLGIDWPKDKLNIWILDDGGREE 318 Query: 118 TRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIV 177 R + + + KA + AK E++ D D + R + Sbjct: 319 FRQFAQNVGVKY----IARTTHEHAKAGNINNALKYAKGEFVSIFDCDHVPTRSFLQMTM 374 Query: 178 EPMLYNPRVGAVTGNPRIRTRSTL---VGKIQ--VGEYSSIIGLIKRTQRIYGNVFTVSG 232 L ++ + + +G+ + E + GL++ ++ F G Sbjct: 375 GWFLKEKQLAMMQTPHHFFSPDPFERNLGRFRKTPNEGTLFYGLVQDGNDMWDATF-FCG 433 Query: 233 VIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQ 292 A RR L E+G + + +TED S +L +T Y L E+L Q Sbjct: 434 SCAVIRRKPLDEIGGIAVETVTEDAHTSLRLHRRGYTSAYMRIPQAAGLATESLSAHIGQ 493 Query: 293 RLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPL 352 R+RWA+G ++F + + F Y + + I LA + L Sbjct: 494 RIRWARGMVQIFRLDNPLTGKGLKFAQR---LCYVNAMFHFLSGIPRLIFLTAPLAFLLL 550 Query: 353 NIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLAT 412 H + +++ L I + + ++ + S + I+ V+ W ++ T Sbjct: 551 -----HAYIIYAPALMIALFVLPHMIHASLTNSKIQGKYRHSFWSEIYETVLAWYIAPPT 605 Query: 413 TLVSF 417 + Sbjct: 606 LVALI 610 >UniRef50_Q3IER5 Cellulose synthase catalytic subunit [UDP-forming] n=6 Tax=Gammaproteobacteria RepID=Q3IER5_PSEHT Length = 884 Score = 265 bits (677), Expect = 3e-69, Method: Composition-based stats. Identities = 72/425 (16%), Positives = 148/425 (34%), Gaps = 24/425 (5%) Query: 1 MINRIVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHW 60 MI V+ L + + L + + + ++ G + + Sbjct: 223 MITLSVTTSSRYLWWRYTETLNWDDPLALFLGLGLLLAETYAWLVLILGFFQTINPLERK 282 Query: 61 PWGENAPAPQLKDNPSISIIIPCFNEE-KNVEETIHAALAQRYE--NIEVIAVNDGSTDK 117 P P++ + IP +NE V+ T AA++ + + V ++DG + Sbjct: 283 PIAMPKNT---DLWPTVDVYIPTYNEPLSVVKPTTLAAMSIDWPADKLNVYILDDGKRPE 339 Query: 118 TRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIV 177 +++ + N KA + + EY+ D D + R + Sbjct: 340 FAQFAEQVGVGY----LTRPDNNHAKAGNMNSAMRYTDGEYIAIFDCDHVPARSFLQMTM 395 Query: 178 EPMLYNPRVGAVTGNPRIRTRSTLVGKI-----QVGEYSSIIGLIKRTQRIYGNVFTVSG 232 L + +V V + + E GLI+ ++ F G Sbjct: 396 GQFLKDSKVCLVQTPHHFFSADPFERNLNNHSQVPNENMLFYGLIQDGNDMWDATF-FCG 454 Query: 233 VIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQ 292 A +R AL ++G ++ + +TED + ++Q + Y L ++L Q Sbjct: 455 SCAVLKRQALDDIGGFAFETVTEDAHTALRMQRAGYKTAYINIPQAAGLATDSLSAHIGQ 514 Query: 293 RLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPL 352 R+RWA+G A++F + L + N P Y + + + + LA + Sbjct: 515 RIRWARGMAQIFRLDNPLLGKGLNL---PQRLCYINAMLHFLSGVPRIVFLTAPLALIYF 571 Query: 353 NIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLAT 412 N + + ++ TL ++ +R + S + ++ V+ W + T Sbjct: 572 NAYIIYAPFLAIFIYVVPTLIQVKAT-----NSRIQGKYRYSFWGEVYETVLAWYILKPT 626 Query: 413 TLVSF 417 T+ F Sbjct: 627 TVALF 631 >UniRef50_C1F830 Cellulose synthase I n=4 Tax=Bacteria RepID=C1F830_ACIC5 Length = 1495 Score = 265 bits (677), Expect = 3e-69, Method: Composition-based stats. Identities = 79/441 (17%), Positives = 145/441 (32%), Gaps = 26/441 (5%) Query: 1 MINRIVSFFILCLVLCIPLCVAY------FHSGELMMRFVFFWPFFMSIMWIVGGVYFWV 54 M+ + + F + + + + + + + G + + Sbjct: 74 MMMSLFATFRYAFWRIGTVVAFFQDPGSKYTPADAFFIVILVGAEAYAFSILFLGFFQTI 133 Query: 55 YRERHWPWGENAPAPQLKDNPSISIIIPCFNEE-KNVEETIHAALAQRYE--NIEVIAVN 111 + R P +D P + ++IP +NE V T AAL + + V ++ Sbjct: 134 WPLRRAPVPLP---DDPEDWPHVDLLIPTYNEPMSVVRYTALAALNIDWPADKLHVYILD 190 Query: 112 DGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRD 171 DG+ + R + + KA + A S Y+ D D + R Sbjct: 191 DGNRPEFRDFAIQAGLGY----MTRDNNAHAKAGNINQALARLDSPYVAIFDSDHVPTRS 246 Query: 172 AAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVG-----KIQVGEYSSIIGLIKRTQRIYGN 226 + L + ++G + + K E G+++ + Sbjct: 247 FLQVTMGWFLRDEQLGMLQTPHHFYSPDPFERNLGQYKTIPNEGELFYGIVQDGNDFWNA 306 Query: 227 VFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETL 286 F G A RR+AL E+G + + +TED S ++Q+N W Y A L E L Sbjct: 307 SF-FCGSCAVLRRTALDEIGGIAVETVTEDAHTSLRMQINGWNTAYINIAQAAGLATERL 365 Query: 287 KGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQ 346 G KQR+RWA+G ++ + F +F ++A L I Sbjct: 366 SGHVKQRIRWARGMVQILRIDNPLFAPNLTFAQRLCYFNAMTHFMYA---LPRLIFLTAP 422 Query: 347 LAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFW 406 L + L L L S I+ ++ H+ + ++ + P I Sbjct: 423 LIYLIFGFTNVPGYWVAILAYALPHLTLSNVTNS-RIQGQHRHSYWNEVYETVLAPYILL 481 Query: 407 MLSLATTLVSFTRVMLMPKKQ 427 LA + + K Sbjct: 482 PTLLALLNPRLGKFNVTAKGG 502 >UniRef50_B1V7Q3 Chitin synthase n=1 Tax=Clostridium perfringens D str. JGS1721 RepID=B1V7Q3_CLOPE Length = 466 Score = 265 bits (677), Expect = 3e-69, Method: Composition-based stats. Identities = 99/466 (21%), Positives = 184/466 (39%), Gaps = 59/466 (12%) Query: 20 CVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISI 79 V + + + F++ FF+S ++ + +Y + ++R + K+ ISI Sbjct: 5 IVDFINLFFMYYIFIYAVIFFISTVYSIIDLYEFNLKKR---FNSTINIYDKKNYTPISI 61 Query: 80 IIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIH--- 136 ++P +NEEK + + I + LA Y E+I VNDGS DKT +L + R I Sbjct: 62 LVPAYNEEKTILKCIDSLLALEYPEYEIIIVNDGSKDKTLEVLINRFSLKKIQRPIRKSI 121 Query: 137 --------------------LAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYI 176 +N GKA AL G A+ + +D D++L RD+ + I Sbjct: 122 NCKKIIDIYEGLEKVNITLINKENGGKADALNMGINASTYPLFLTLDADSVLQRDSLSNI 181 Query: 177 VEPMLYNPRVGAVTGNPRIRT--------------RSTLVGKIQVGEYSSIIGLIKRTQR 222 V P + N AV GN +I + Q+ EY + + Sbjct: 182 VVPFIENENTIAVGGNVKISNSVLLENGVVKKNRLPKKWLIIFQIIEYYRVFWTTRVWFN 241 Query: 223 IYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQ------LNQWTIFYEPRA 276 + +SG F++S++ VG ++ + I ED+++ KL N+++I Y P A Sbjct: 242 KFNGNLIISGAFGVFKKSSVINVGGYNSNSIGEDMELVMKLHSFNKKNENKYSIQYTPNA 301 Query: 277 LCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTT------ 330 +CW PE LK L QR RW G + ++ + + F Y Sbjct: 302 ICWSQAPEKLKDLKSQRKRWHMGLMQSLAEHKYIFLNIKYGVVGVFSFLYYFIYEMLSCL 361 Query: 331 IWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHN 390 I F + FI Y + I I +++ I + ++ ++ ++ + + + Sbjct: 362 IEVFGVVFLFISYFTGFINLKFFITFMCIYIFYSSLISISSIFSEEYFFNINLRIKDK-- 419 Query: 391 LTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 + + + + +V+ + + ++ W +R Sbjct: 420 -IKLILFSFLEAFGYRQMCSIFRIVAIFKY----RTKKNHWEKIER 460 >UniRef50_Q2IFD0 Glycosyl transferase, family 2 n=5 Tax=Bacteria RepID=Q2IFD0_ANADE Length = 479 Score = 265 bits (677), Expect = 3e-69, Method: Composition-based stats. Identities = 105/459 (22%), Positives = 178/459 (38%), Gaps = 49/459 (10%) Query: 23 YFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQ--LKDNPSISII 80 E++ + +++ ++ + R + P+ + P +SII Sbjct: 3 LLRLIEVVQWAFLAYFIALTVSYLALNLVSLRVIARDADARASLLLPRFFSRLEPPVSII 62 Query: 81 IPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAA------------- 127 +P FNEE+ + TI L Y +E++ VNDGSTD+T +L R Sbjct: 63 VPAFNEEETIAGTIRCLLQLEYPELEIVVVNDGSTDRTLEVLSREFELVPFPEAYRISVP 122 Query: 128 -----------QIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYI 176 + ++RV+ N GKA A+ G AA+ +DGD++L RD+ A + Sbjct: 123 STPVRRVYRSLRHRNVRVLD-KSNGGKADAMNAGINAARYPLFCAVDGDSVLQRDSVARV 181 Query: 177 VEPMLYNPRVGAVTGNPRIRT--------------RSTLVGKIQVGEYSSIIGLIKRTQR 222 V P L +P A G R+ + ++QV EY + Sbjct: 182 VRPFLEDPTTIACGGTIRVLNGCRVVDGFVEEIGMPRGWIARVQVLEYLRAFLFGRLGWA 241 Query: 223 IYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQL------NQWTIFYEPRA 276 + VSG FRR+A+ E G + D + ED+++ +L + I + P Sbjct: 242 PVNALPNVSGAFGMFRRTAVIEAGGYRTDTVGEDMELVLRLHRLYRLSGRPYRISFVPDP 301 Query: 277 LCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTC 336 +CW +P+TL L +QR+RW +G E N L+ + L L AF Sbjct: 302 ICWTEVPDTLAALRRQRVRWHRGLLEAIAGNRQLLFHRRGGAAAWLTVP-NLLFFEAFGP 360 Query: 337 LVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLF 396 +V Y V A L + A A L L + L ++ + + L Sbjct: 361 VVEVAGYVVIGAAWALGVISWSSFAAFFAAALGVATLLSANALLLEEKSFHVYQRPKQLA 420 Query: 397 WIIWFPVIFWM-LSLATTLVSFTRVMLMPKKQRARWVSP 434 +I ++ + L T + ++ ++ W SP Sbjct: 421 VLIGAVLVENLGLRQLTAFWRISGMLSWLFRRPIAWGSP 459 >UniRef50_A6GMN6 Cellulose synthase (UDP-forming) n=3 Tax=Burkholderiaceae RepID=A6GMN6_9BURK Length = 828 Score = 264 bits (675), Expect = 5e-69, Method: Composition-based stats. Identities = 77/425 (18%), Positives = 141/425 (33%), Gaps = 20/425 (4%) Query: 11 LCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQ 70 + + E+ + + + + ++ G + + P Sbjct: 187 RYVFWRFTSSLDLETGTEIFLGYTLVAAEAYTWIILIFGFIQTAWPLKRKPM---PLELN 243 Query: 71 LKDNPSISIIIPCFNEE-KNVEETIHAALAQRYE--NIEVIAVNDGSTDKTRAILDRMAA 127 L D P++ + IP +NE V T++AA + I V ++DG + Sbjct: 244 LNDWPTVDVFIPSYNEPLSVVRPTVYAAKGLDWPLEKITVYILDDGHRPAFEEFAKQAGV 303 Query: 128 QIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVG 187 + + + KA + +Y+ D D + R + +P+ Sbjct: 304 EY----ISRPDNSHAKAGNINYALKHTHGQYIAIFDCDHIPTRSFLQTSMGWFSRDPKCV 359 Query: 188 AVTGNPRIRTRSTLVG-----KIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSAL 242 V + + E S GLI+ + F G A R AL Sbjct: 360 LVQTPHHFFSADPFERNFNSFRQMPNEGSLFYGLIQDGNDYWNASF-FCGSCAVIERKAL 418 Query: 243 AEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAE 302 EVG + + +TED + KL + Y L ETL G QR+RWA+G A+ Sbjct: 419 LEVGGIAVETVTEDAHTALKLHSLGYNSVYLNNIQAAGLATETLAGHIGQRIRWARGMAQ 478 Query: 303 VFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAAT 362 +F + + N Y + F + I + +A + L + AA Sbjct: 479 IFRTDNPMFKKGLNL---FQRVCYSNAMLHFFYGIPRMIFLVMPMAYLFFEQHLINAAAV 535 Query: 363 HTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVML 422 L L + + S+ I+ +Y + S ++ + I +LA + + Sbjct: 536 TILSYALPQLMISSYTNSV-IQGKYRRSFWSEVYETVLSWYIILPTTLAFINPKLGKFNV 594 Query: 423 MPKKQ 427 K Sbjct: 595 TSKGG 599 >UniRef50_A9CZ20 Putative cellulose synthase catalytic subunit (UDP-forming) n=1 Tax=Hoeflea phototrophica DFL-43 RepID=A9CZ20_9RHIZ Length = 730 Score = 264 bits (675), Expect = 6e-69, Method: Composition-based stats. Identities = 82/456 (17%), Positives = 151/456 (33%), Gaps = 37/456 (8%) Query: 3 NRIVSFFILCLVLCIPLCVAYFHSGELM------MRFVFFWPFFMSIMWIVGGVYFWVYR 56 N + F + L AY+ + E + + F+ +M+ M+ + + + Sbjct: 28 NTVFRFITFIFCAVLALRYAYWRTTETLPVFSEPLNFIPGLLLYMAEMYCLLMLAISFFM 87 Query: 57 ERHWPWGENAPAPQLKDNPSISIIIPCFNE-EKNVEETIHAALAQRYE--NIEVIAVNDG 113 L D P++ ++IP +NE + + T+ AA + Y + + ++DG Sbjct: 88 LADPIKRVAPKVRSLDDLPTVDVLIPTYNEDPELLAGTLAAAKSMIYPRDKLTIWLLDDG 147 Query: 114 ST-------DKTRAILD-------RMAAQIPHLRVIHLAQN-QGKAIALKTGAAAAKSEY 158 ST D T A+ + + + +N KA L G + S+ Sbjct: 148 STQAKRSHKDPTVALAATRRHEQLKALCKAMGVHYHARKKNDHAKAGNLNDGLKVSGSDL 207 Query: 159 LVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTL-----VGKIQVGEYSSI 213 +V D D R+ V +P++ V L + E Sbjct: 208 VVVFDADHAPVREFLKETVSFFKEDPKLFLVQTPHYFLNPDPLEKNLRTFRSMPSENEMF 267 Query: 214 IGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYE 273 +++R + F G A R AL VG +S ITED + + L W Y Sbjct: 268 YSVLQRGLDKWNASF-FCGSAAVLSRKALQAVGGFSGQSITEDCETALSLHAKGWRSLYV 326 Query: 274 PRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWA 333 + L L PET QR RW QG ++ + N L + Y ++ Sbjct: 327 DKPLIAGLQPETFVAFIGQRARWCQGMLQILILNRPFLAKGLTMGQR---ICYAGINMFW 383 Query: 334 FTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTS 393 L L + ++E+ L T + F + + + S Sbjct: 384 LFPLSRLAFMLSPLLYIFFSLEIYQANIQEFGAYAL-TYLIASFAMQSYLYGKVRWPWVS 442 Query: 394 SLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRA 429 L+ + ++ +++ R + Sbjct: 443 ELYE---YIQSIMLVGSIISVIRNPRKPTFNVTAKG 475 >UniRef50_B4U2G9 Glycosyl transferase, group 2 family protein n=22 Tax=Streptococcus RepID=B4U2G9_STREM Length = 464 Score = 263 bits (674), Expect = 6e-69, Method: Composition-based stats. Identities = 98/468 (20%), Positives = 175/468 (37%), Gaps = 61/468 (13%) Query: 18 PLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSI 77 L + +S L + F+ + F+ + IVG ++ + + P P + Sbjct: 3 LLFLVLKYSQYLFLAFLIAYASFLLLSLIVGAIHLY-------ERRQQVPLGGDVSLPPV 55 Query: 78 SIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRM------------ 125 SI++P +NEE + ET+ + L Y+ E+I ++DGSTD T +L Sbjct: 56 SILVPAYNEELTIIETVESLLMLDYDCYEIIVIDDGSTDHTAKLLKEQFHLEAVTCSINQ 115 Query: 126 -----------AAQIPHLRVIHLAQ-NQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAA 173 Q+ H+++ L + N GK +L G AA+ +Y +C+D D++L D+ Sbjct: 116 RIPTRPLKAVSQKQLKHVKLTLLEKANGGKGDSLNFGINAAQYDYFLCMDADSMLQEDSL 175 Query: 174 AYIVEPMLYNPRVGAVTGNPRIRT--------------RSTLVGKIQVGEYSSIIGLIKR 219 I + + +P V AV G ++ L+ Q EY + Sbjct: 176 KRIAKTVAKDPSVIAVGGMVKVSQGVALAAGRIVDYHLPRQLISCAQAIEYDCAFFGARV 235 Query: 220 TQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQ------LNQWTIFYE 273 +SG F++ V + + ED+++ KL + I YE Sbjct: 236 LLDQAAGNLIISGAFGLFKKELAIAVDGYDTGTLGEDMELVMKLHFFCQNNKIPYRIGYE 295 Query: 274 PRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWA 333 A+CW P LK L +QR RW G + K + ++ Sbjct: 296 TDAICWSQAPSRLKDLCQQRRRWFLGLYQCLKKYLQFFVSGR-----FKVTVLISYVLYL 350 Query: 334 FTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTS 393 L+ I A + + L++ + T ++L +C I NR S Sbjct: 351 LFELLSPFIEAFGVTIIVLSLLCHQLNTTFFVSLMLLYICYCVLIALTSFLNRVYSQNNS 410 Query: 394 SLFWI----IWFPVIFWMLSLATTLVSF-TRVMLMPKKQRARWVSPDR 436 S+ + + + I+ L L T L + ++++ W R Sbjct: 411 SITFFDMVWVIYAAIYQCLVLHTVLTLVKVSSFIGYQRKKNAWGQITR 458 >UniRef50_Q0AL04 Glycosyl transferase, family 2 n=3 Tax=Bacteria RepID=Q0AL04_MARMM Length = 475 Score = 263 bits (674), Expect = 6e-69, Method: Composition-based stats. Identities = 99/465 (21%), Positives = 153/465 (32%), Gaps = 52/465 (11%) Query: 22 AYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIII 81 F L++ F +++ + GV + R R P ++ P +SI+ Sbjct: 11 VTFPVTFLILASALFQDTLYAVLIAIAGVT--LMRRRFEPNAHRLMRRIGQEAPRVSILA 68 Query: 82 PCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLR-------- 133 P +NE + V + A L Q + N EVI +NDGS D T L R P R Sbjct: 69 PAYNEAETVVDCTRALLRQNHSNYEVIVINDGSGDDTLNRLVRAFDLKPAERECSDRLDF 128 Query: 134 -----VIHLAQ----------NQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVE 178 V H N GKA AL G A A ID D+LL+ DA + Sbjct: 129 TPVRAVYHRPGPVPLWVIDKVNGGKADALNAGIAHASGSLFCAIDADSLLEADALIRTSQ 188 Query: 179 PMLYNPRVGAVTGNPRIRT--------------RSTLVGKIQVGEYSSIIGLIKRTQRIY 224 P L NP AV G R+ + +Q EY + + Sbjct: 189 PFLDNPDTVAVAGTVRVANGCRLREGAVEEAGLPRRFLPLVQSLEYLRAFLVARFALSEL 248 Query: 225 GNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWT------IFYEPRALC 278 G + VSG F R+ + EVG + + ED ++ +L + + + P +C Sbjct: 249 GALMIVSGAFGMFDRARVVEVGGYDPRTVGEDAELIVRLHRDGRDSDRPTVVRFIPEPVC 308 Query: 279 WILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTT--IWAFTC 336 W +PE L L QR RW +G E ++ R L + I Sbjct: 309 WTQVPERLSDLAGQRRRWQRGALETLWRHRDAALRPGYGLPGTLGLGSMIVIDVIGPCLE 368 Query: 337 LVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYE-HNLTSSL 395 ++G+ + + A ++ + + + + L Sbjct: 369 VLGYALAFLLAATGHISWSWFAALFALVCSFGFLLSASAILLQEIELRRTASLKAIGQLL 428 Query: 396 FWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRGILR 440 F + LS L + K W R Sbjct: 429 FAAAIENFGYRQLSNIWRLAGTIDFLKGSKH----WGHIKRSAFD 469 >UniRef50_D0KCI8 Cellulose synthase catalytic subunit (UDP-forming) n=4 Tax=Pectobacterium RepID=D0KCI8_PECWW Length = 899 Score = 263 bits (673), Expect = 8e-69, Method: Composition-based stats. Identities = 73/425 (17%), Positives = 150/425 (35%), Gaps = 24/425 (5%) Query: 1 MINRIVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHW 60 MI + L + + L+ + + + +V G + ++ Sbjct: 202 MIALSFTVSCRYLWWRYTETLNWDDPVSLVCGLLLLAAETYAWVVLVLGYFQTIWPLNRH 261 Query: 61 PWGENAPAPQLKDNPSISIIIPCFNEE-KNVEETIHAALAQRYEN--IEVIAVNDGSTDK 117 P P++ ++IP +NE V+ T++AAL + + + ++DG Sbjct: 262 PVSLPE---DNTTWPTVDLMIPTYNEPLSVVKPTVYAALGIDWPKDKLNIYILDDGG--- 315 Query: 118 TRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIV 177 RA A ++ + + KA + AK E++ D D + R + Sbjct: 316 -RAEFKAFADEVGVHYIARVTHEHAKAGNINNALKQAKGEFVAIFDCDHVPTRSFLQLTM 374 Query: 178 EPMLYNPRVGAVTGNPRIRTRSTL---VGKIQ--VGEYSSIIGLIKRTQRIYGNVFTVSG 232 + ++ + + +G+ + E + GL++ ++ F G Sbjct: 375 GWFFKDKKLAMLQTPHHFFSPDPFERNLGRFRRTPNEGTLFYGLVQDGNDMWDATF-FCG 433 Query: 233 VIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQ 292 A RR L E+ + + +TED S +L ++ Y L E+L Q Sbjct: 434 SCAILRRKPLDEISGIAVETVTEDAHTSLRLHRRGYSSAYIRIPQAAGLATESLSAHIGQ 493 Query: 293 RLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPL 352 R+RWA+G ++F + L + Y + + + I LA + Sbjct: 494 RIRWARGMVQIFRLDNPLLGKGLKLGQR---LCYANAMMHFLSGIPRLIFLTAPLAFL-- 548 Query: 353 NIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLAT 412 L H + + L + + + +R + S + I+ V+ W ++ T Sbjct: 549 ---LIHAYIIFAPALAIALYVLPHMVHASLTNSRIQGRYRHSFWSEIYETVLAWYIARPT 605 Query: 413 TLVSF 417 T+ F Sbjct: 606 TVALF 610 >UniRef50_Q46CB1 Glycosyltransferase n=2 Tax=Methanosarcina RepID=Q46CB1_METBF Length = 694 Score = 263 bits (673), Expect = 9e-69, Method: Composition-based stats. Identities = 93/403 (23%), Positives = 162/403 (40%), Gaps = 30/403 (7%) Query: 31 MRFVFFWPFFMSIMWIVGGVYFWVYR-----ERHWPWGENAPAPQLKDNPSISIIIPCFN 85 + V+FW F ++ + + W + R + + +SII+P ++ Sbjct: 299 IMVVYFWFFIITSLLPLSFFVIWKFLSGWNIRRSKNSCQPQNIQDTGYHGLVSIIVPAYD 358 Query: 86 EEKNVEETIHAALAQRY-ENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKA 144 EEK + + + L Q Y +EVI VNDGSTD+T I+ + ++ I L N GKA Sbjct: 359 EEKTIGSCLESILNQDYNGEMEVIVVNDGSTDRTAEIVLKYP-----VKFIDLKANVGKA 413 Query: 145 IALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRT---RSTL 201 AL AK + L+ D D+ + +A + +VE ++ N V GN I + + Sbjct: 414 NALNRAIEDAKGDILIFTDSDSYMSANAVSSLVECLITNNDAQIVAGNVFIHNSQGKKRI 473 Query: 202 VGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISW 261 + Q+ EY + + Q + GN+ G + A +R V +SD+ I ED D + Sbjct: 474 MKYFQMIEYLIEQEITRYLQSLSGNILVCPGPLTAVKREVCESV-RFSDETIVEDADFTI 532 Query: 262 KLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWP 321 K I EP+A + PET + +KQR RW G +V+ + R Sbjct: 533 KALKKSMKIIREPKAKVYTNAPETFRAWYKQRKRWWYGNLQVWRLHRHWAMRN-----AW 587 Query: 322 LFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSL 381 + Y + + I+ + L I+ + +L L + F SL Sbjct: 588 MVLNY----LGYIIGVCSIILVMFLPYLLLQYDNLLLISLRGLSYSILPVLIFIVFTASL 643 Query: 382 MIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMP 424 L +I + +++ + + T + + Sbjct: 644 F------SKDKKLLPMLIPYVLVYSTIKVVTISYLYLCYLTGR 680 >UniRef50_A3I0Z1 Glycosyltransferase n=1 Tax=Algoriphagus sp. PR1 RepID=A3I0Z1_9SPHI Length = 489 Score = 263 bits (672), Expect = 1e-68, Method: Composition-based stats. Identities = 86/410 (20%), Positives = 164/410 (40%), Gaps = 15/410 (3%) Query: 29 LMMRFVFFWPFFMSIMWIVGGVY----FWVYRERHWPWGENAPAPQLKDNPSISIIIPCF 84 ++ ++ + + +++I + W + P ++ P +++ +P F Sbjct: 1 MIFIYLLIGIYTLGLLFIFIYSLAQGNLLWNFWKARKWLASTPMKEMDTWPKVTVQLPIF 60 Query: 85 NEEKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAILDRMAAQIP--HLRVIHLAQN 140 NE V+ I AA Y +E+ ++D STD+T ++ P + + IH Sbjct: 61 NELYVVDRLIEAAANLNYPKELLEIQLLDD-STDETVDLIQEKIKNYPEVNFQYIHRQDR 119 Query: 141 QG-KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRS 199 G KA ALK G A+ E++ D D + D D + P + +VG V RS Sbjct: 120 VGFKAGALKEGLVNAEGEFIAIFDADFVPDPDFLLKTL-PYFSSEKVGMVQSRWTHLNRS 178 Query: 200 -TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDID 258 +L+ ++Q + + + + +G +R+S + + G W DD +TED+D Sbjct: 179 YSLLTRLQAFALDAHFLIEQMGRNYQHAFINFNGTGGVWRKSCILDSGNWHDDTLTEDLD 238 Query: 259 ISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWR-KENF 317 +S++ Q W Y P +P + + Q+ RW +GGAE K+++ + K F Sbjct: 239 LSYRAQRKGWEFIYRPEIESPAELPPIMSAVKSQQFRWTKGGAECARKHISGVMSQKLPF 298 Query: 318 RMWPLFFEYCL-TTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQ 376 R+ F + ++I+ LV V AG+ I I + + Sbjct: 299 RVKFHAFAHLFNSSIFIAILLVSLSSIGVWWAGIKGMIPERLFQLAGIFMIGFVIIAGVY 358 Query: 377 FIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKK 426 + + +L + + F + LSL + + + K Sbjct: 359 LVSYFYARRSFLKSLGQVFWQLPIFLSVSMALSLHNS-QAVWEGLTGKKS 407 >UniRef50_B4RSC1 Cellulose synthase catalytic subunit n=2 Tax=Alteromonas macleodii RepID=B4RSC1_ALTMD Length = 707 Score = 263 bits (672), Expect = 1e-68, Method: Composition-based stats. Identities = 94/459 (20%), Positives = 159/459 (34%), Gaps = 44/459 (9%) Query: 7 SFFILCLVLCIPLCVAYFHS------------GELMMRFVFFWPFFMSIMWIVGGVYFWV 54 F ++ LVL I L V Y + + +F + I ++G + Sbjct: 33 LFRVVALVLAIALSVRYLLWRGLYTLTYSDVLSMIAVWLLFAAEIYAGITSVLGAIVNAF 92 Query: 55 YRERHWPWGENAPAPQLKDNPSISIIIPCFNE-EKNVEETIHAALAQRYEN--IEVIAVN 111 R E PS+ ++IP +NE E+ +E TI AA Y + + ++ Sbjct: 93 PLSRPLLSLEGM---DKSTLPSVDVMIPSYNEDEEILEVTIRAAKMLDYPKEKLHIHLLD 149 Query: 112 DGSTDKT---------------RAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKS 156 DG TD+ R L + ++ KA + + Sbjct: 150 DGGTDQKINADNPISAAQAKQRRQDLQALCERLGITYHTRAKNEFAKAGNVNSAIQNTHG 209 Query: 157 EYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKI------QVGEY 210 + +V +D D + D V ML N +V V + + E Sbjct: 210 DLIVILDADHVPTSDFLDRTVPWMLKNEKVFLVQTPHFMANPDPVERNYFSAFSRMPSEN 269 Query: 211 SSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTI 270 G I++ +G+ F G A RR L VG S + ITED + + L + Sbjct: 270 DMFYGTIQKGLDYWGSSF-FCGSAALMRRKHLDLVGGISGESITEDAETALDLHKMGYES 328 Query: 271 FYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTT 330 Y R + L PET +QR+RWAQG ++ L K W Y + Sbjct: 329 VYVDRPMVSGLAPETFDAFIQQRMRWAQGMTQILLLKKPF---KAEGLKWYQRVGYMSSI 385 Query: 331 IWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHN 390 ++ I + LA + +++ H + L + + +++S M+ R Sbjct: 386 MFWLFPFARIIFLFMPLAYLVFGLQVYHASILEIFAYTLPHV-IATYMISTMLFGRTRWP 444 Query: 391 LTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRA 429 L S L+ I+ L ++ PK + Sbjct: 445 LVSELYEILQCAFTLNALVKVFLKPRAPSFVVTPKGESL 483 >UniRef50_C6X1A6 Glycosyl transferase, family 2 n=3 Tax=Flavobacteriaceae RepID=C6X1A6_FLAB3 Length = 824 Score = 262 bits (671), Expect = 1e-68, Method: Composition-based stats. Identities = 93/462 (20%), Positives = 172/462 (37%), Gaps = 57/462 (12%) Query: 28 ELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWP---WGENAPAPQLKDNPSISIIIPCF 84 E+++ F + + +++ G+Y R+ + + + + P S+I P + Sbjct: 10 EIVIWFFLLYACGVVLVYSWMGIYALGAVLRYKRENTFTDYSIIASNPNAPVFSLIAPAY 69 Query: 85 NEEKNVEETIHAALAQRYENIEVIAVNDGSTDK-----------------------TRAI 121 NE + E + + L+ Y N+E+I VNDGS D T + Sbjct: 70 NEGMTIVENVRSLLSLYYHNLEIIIVNDGSKDDSIQKLIEAYELESVNFFIQGNIPTAEV 129 Query: 122 LDRMAAQIPHLR--VIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEP 179 ++ P R + N GKA AL G + +YLVCID D +L++DA + +P Sbjct: 130 RGIYKSKNPAFRKLTVVDKANGGKADALNVGVNISGGDYLVCIDVDCILEQDAILKLAKP 189 Query: 180 MLY--NPRVGAVTGNPRIRT--------------RSTLVGKIQVGEYSSIIGLIKRTQRI 223 L + ++ A G R+ TL+G+ QV EY L + Sbjct: 190 FLEQTDKKLIACGGVIRLANNCRIENGKVVEVNLPRTLLGRTQVLEYIRAFVLGRMAWSR 249 Query: 224 YGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQL------NQWTIFYEPRAL 277 + +SG F R + G + + + ED+++ +++ + + P L Sbjct: 250 ASGLILISGAFGVFDRKIVLACGGYDKNTVGEDMELVVRMRQYMEERNEPYEVINIPDPL 309 Query: 278 CWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLT--TIWAFT 335 CW PE+ + L KQR RW +G E K+ ++ ++ + Y T + Sbjct: 310 CWTEAPESKEILVKQRNRWMRGTMETLWKHRKMMFNPRYGKLGTVSLPYWFTFEFLGPLI 369 Query: 336 CLVGFIIYAVQLAGVPLNIELTHIAATHTAGI-LLCTLCLLQFIVSLMIENRYEHNLTSS 394 G+ I+ + L +N I L ++ + + + S Sbjct: 370 EFAGYFIFVIFLLLGIINWSFFWILFALVISAGFLYSIYAILVDLVSRQVYTKRKDFMSL 429 Query: 395 LFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 + + P F + + + F K+ W R Sbjct: 430 ITTALLEPFYFHPIVVRAGVKGFVDYF----KKSHSWGEMTR 467 >UniRef50_C0Z7U1 Conserved hypothetical membrane protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0Z7U1_BREBN Length = 480 Score = 262 bits (671), Expect = 1e-68, Method: Composition-based stats. Identities = 85/468 (18%), Positives = 170/468 (36%), Gaps = 58/468 (12%) Query: 26 SGELMMRFVFFWPFFMSIMWIVGGVYFW---VYRERHWPWGENAPAPQLKDNPSISIIIP 82 F ++ ++ ++ + ++ + + + + + P ISI++P Sbjct: 4 FLLYYGVFAVYYVIAINSLYFIIMLFSFNSIMGILKSSIYSRFQTLSGSEHVPPISILVP 63 Query: 83 CFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAI-LDRMAAQI------------ 129 +NEE + E + + L+ Y EV+ VNDGS D T + +D + Sbjct: 64 AYNEELTIIENVRSLLSLNYPRYEVLVVNDGSKDATLQVMIDEFSLVYSEHMPVQPLLHT 123 Query: 130 ------------PHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIV 177 P+L +I +N GKA +L G + + ID D+LL++DA I Sbjct: 124 SKIRGIYHNPAFPNLYLID-KENGGKADSLNAGINLSHYPLIASIDADSLLEKDALIRIA 182 Query: 178 EPMLYNP-RVGAVTGNPRIRT--------------RSTLVGKIQVGEYSSIIGLIKRTQR 222 + + NP A+ G+ RI + +Q EY + Sbjct: 183 KVYMENPDETVAIGGDIRIANGCKIEDGVVKDIRLPDKFLPMMQSVEYLKAFLGGRIGWS 242 Query: 223 IYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQ------LNQWTIFYEPRA 276 + VSG FR+ + +VG + + ED++I KL + + + P A Sbjct: 243 AINGLIIVSGAFGVFRKDYIIKVGGYREGYPGEDMNIIIKLHKYMLENKLPYRVAFCPDA 302 Query: 277 LCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRL-WRKENFRMWPLFFEY-CLTTIWAF 334 +CW P+TLK L QR RW +G + ++ + +R + L + L Sbjct: 303 VCWTQAPDTLKILGSQRRRWGRGNLKNMIEYGRHMAFRPKYKLFGMLTLPFNILFETLNP 362 Query: 335 TCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEH--NLT 392 + ++ + + + + + L + ++ + RY ++ Sbjct: 363 YFRITGLLALIGYTLLDMTNGYVLLLYGLLNVLYGMLLGIGSLLLEEIAFRRYPRFQDIV 422 Query: 393 SSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRGILR 440 LF+ I + + + + + + W + R + Sbjct: 423 KMLFYTILMFFGYRQIGVIWRFLGHIEYL----RNNNSWGTMTRTSWQ 466 >UniRef50_B2UMM8 Glycosyl transferase family 2 n=3 Tax=Verrucomicrobia RepID=B2UMM8_AKKM8 Length = 505 Score = 262 bits (670), Expect = 2e-68, Method: Composition-based stats. Identities = 75/371 (20%), Positives = 145/371 (39%), Gaps = 11/371 (2%) Query: 30 MMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKN 89 + + + + + VY + P + ++ P +++ +P FNE+ Sbjct: 7 FIWLLCYLLVLVGLAGYGFHRLSIVYLYWKNRNNKPQPKARFQELPVVTVQLPMFNEKFV 66 Query: 90 VEETIHAALAQRYE--NIEVIAVNDGSTDKTRAIL----DRMAAQIPHLRVIHLAQNQG- 142 V+ + + A Y +E+ ++D STD T + + ++ IH G Sbjct: 67 VDRLLESVAALDYPQDKLEIQILDD-STDDTTEQCYRKVEELKSRGFDAVCIHRTDRTGF 125 Query: 143 KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRS-TL 201 KA AL+ AK E+L+ +D D + + D + + VG V R L Sbjct: 126 KAGALEAATKVAKGEFLLILDADFVPEPDLLQKTI-HFFTDENVGLVQTRWGHINREYNL 184 Query: 202 VGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISW 261 + +IQ + + + G FT +G +R+ + + G WS D +TED+D+S+ Sbjct: 185 LTRIQGMYLDGHFAMEQTARNRSGRFFTFNGTAGIWRKCVIGDAGGWSHDTLTEDMDLSY 244 Query: 262 KLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWP 321 ++QL W Y + +P + G Q+ RW +G +V K + +WR Sbjct: 245 RVQLRGWRFIYLNDVVTPAELPVDMDGFKSQQHRWTKGSIQVCQKILLDIWRSNAPLKAK 304 Query: 322 LFFEYCLTTIWAFTCLVGFIIYAVQLAG-VPLNIELTHIAATHTAGILLCTLCLLQFIVS 380 + LT +++ L + E + + A + ++ + F +S Sbjct: 305 VEATTHLTCNYSYLLLALLCFLVYPICTQRIPENETVFMWFVNVALFFMTSVAVCIFYMS 364 Query: 381 LMIENRYEHNL 391 I R + Sbjct: 365 AQIVVRPKSWW 375 >UniRef50_C4KBM8 Glycosyl transferase family 2 n=6 Tax=Betaproteobacteria RepID=C4KBM8_THASP Length = 868 Score = 262 bits (670), Expect = 2e-68, Method: Composition-based stats. Identities = 81/446 (18%), Positives = 152/446 (34%), Gaps = 20/446 (4%) Query: 2 INRIVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWP--FFMSIMWIVGGVYF--WVYRE 57 + ++ I L++ I + V Y+ + ++ V + + + G F ++++ Sbjct: 349 LAALIQACISTLIIGINVPVEYYLTQRDLIGLVLLIGATCMTAAVLLSHGFEFGEVLFKK 408 Query: 58 RHWPWGENAPAPQLKDNPSISIIIPCFNEEKN-VEETIHAALAQRYENIEVIAVNDGSTD 116 + P + P +SI + C+NE V TI + Y+N EV+ +++ + D Sbjct: 409 KWARRFTPLPPHPPEQQPFVSIHLACYNEPPEMVIATIDSLAQMNYQNFEVLILDNNTRD 468 Query: 117 KTR--AILDRMAAQIPHLRVIHLAQNQG-KAIALKTGAAAA--KSEYLVCIDGDALLDRD 171 + + R A P R HLA G KA AL G ++E + +D D ++D D Sbjct: 469 EALWKPLERRCAELGPRFRFFHLANWPGFKAGALNYGLKVTDPRAEVVGVVDADYVVDPD 528 Query: 172 AAAYIVEPMLYNPRVGAVTGNPRIRT-RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTV 230 A +V P P V V R ++ E+ + + + Sbjct: 529 WLACLV-PHFDQPEVAVVQAPQAHRDWEGQPFKRMCNWEFDGFFRIGMHHRNERNALIQ- 586 Query: 231 SGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLW 290 G + RR AL EVG WS+ I ED ++ +L + Y L L P + Sbjct: 587 HGTMTMVRRRALEEVGGWSEWCICEDTELGLRLIEKGYDTRYIDHILGRGLTPSGFAAIK 646 Query: 291 KQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGV 350 QR RWA G ++ ++ + + + + + + L Sbjct: 647 SQRFRWAFGAMQILKAHLPHMI-GRSTLNLAQRYHFLTGWFAWLGDALQLVFAFGSLLWT 705 Query: 351 PLNIELTHIAATHTAGILLCTLCLLQF--IVSLMIENRYEHNLTSSLFWIIWFPVIFWML 408 + + L + F + ++ R + V Sbjct: 706 LGILLFPKAFGLPVVSLALPIFGFMIFKAALGPILYRRTMDCPWKDILGASILSVGLAHA 765 Query: 409 SLATTLVSFTR----VMLMPKKQRAR 430 + ++ PK R Sbjct: 766 IARGVFAGLVKKKGVFVVTPKGWRGG 791 >UniRef50_Q2K9Z1 Cellulose synthase protein n=17 Tax=Rhizobiaceae RepID=Q2K9Z1_RHIEC Length = 748 Score = 262 bits (669), Expect = 2e-68, Method: Composition-based stats. Identities = 81/482 (16%), Positives = 149/482 (30%), Gaps = 54/482 (11%) Query: 1 MINRIVSFFILCL---------VLCIPLCVAYFHSGELMMR--------------FVFFW 37 +I I ++ L + L + F+ Sbjct: 49 LITSITVVTVMALIKILKGEGTWRLVALAFGTSIVLRYVYWRTTNTLPPLNQPENFIPGL 108 Query: 38 PFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNV-EETIHA 96 +++ M+ V + ++ + A P + + +P +NE+ + T+ A Sbjct: 109 LLYLAEMYSVAMLALSLFIVATPLPPRPSRAANPGRLPHVDVFVPSYNEDAGLLGNTLAA 168 Query: 97 ALAQRYE--NIEVIAVNDGSTD---KTRAILDRMAAQIPHL------------RVIHLAQ 139 A A Y + V ++DG T + +L+ AA H+ + Sbjct: 169 AKAMDYPADKLHVWLLDDGGTLQKRNSGKLLEAQAAAARHIELKQLCQDLDVTYLTRDRN 228 Query: 140 NQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRS 199 KA L G + E + D D RD V +P++ V Sbjct: 229 EHAKAGNLNNGMKHSTGELIAVFDADHAPARDFLLETVGYFEDDPKLFLVQTPHFFINPD 288 Query: 200 TLVGKI-----QVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMIT 254 L + E G+I+R + F G A R AL +S IT Sbjct: 289 PLERNLRTFEKMPSENEMFYGIIQRGLDKWNAAF-FCGSAAVLSRKALESQNGFSGISIT 347 Query: 255 EDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRK 314 ED + + L + W Y + L L P T QR RWAQG ++ L R Sbjct: 348 EDCETALALHGSGWNSIYVDKPLIAGLQPATFASFIGQRSRWAQGMMQILRFRFPLLKRG 407 Query: 315 ENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCL 374 + P Y +T++ I L + ++E+ + L Sbjct: 408 LS---IPQRLCYMSSTLFWLFPFPRTIFLFAPLFYLFFDLEIFTASGGEFLAYTF-AYML 463 Query: 375 LQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSP 434 + ++ + + S L+ + +L +++ R + ++ Sbjct: 464 VNLMMQNYLYGSFRWPWISELYE---YVQTVHLLPAVVSVMLNPRKPTFKVTAKDESIAV 520 Query: 435 DR 436 R Sbjct: 521 SR 522 >UniRef50_B2IY03 Glycosyl transferase, family 2 n=1 Tax=Nostoc punctiforme PCC 73102 RepID=B2IY03_NOSP7 Length = 427 Score = 262 bits (669), Expect = 3e-68, Method: Composition-based stats. Identities = 81/410 (19%), Positives = 151/410 (36%), Gaps = 22/410 (5%) Query: 31 MRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNV 90 + + + P +M + + + A P + I++P NE + Sbjct: 31 VSLLHWLPETQWLMLGLTAILTVQTLRMLVAKPDAAVMESKIYLPPVLILVPAKNESSVL 90 Query: 91 EETIHAALAQRYEN--IEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALK 148 + + Y + +++ V+DGSTD+T IL + Q P L+V GK+ AL Sbjct: 91 ANLVDSLYQLNYPSNSLDIWIVDDGSTDETPEILRELQTQFPSLQVHRRESKGGKSGALN 150 Query: 149 TGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRS-TLVGKIQV 207 + E ++ D DA L + V P+ +GAV I + + + Q Sbjct: 151 AVFPFTQGEIILVCDADAQLPVNFLQQTV-PLFQKQAIGAVQVRKTISNANTNFLTRCQQ 209 Query: 208 GEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQ 267 E ++ + G + + G RR L + W+++ +T+D+D+ +KL L Sbjct: 210 MEMCC-DSFLQTHRIAIGGMSELRGNGMLVRRELLEKCQGWNENTVTDDLDLCFKLYLAG 268 Query: 268 WTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYC 327 I + + + LW QR RWA+GG + +L ++ W + Sbjct: 269 AEIEFVTVPSIQEESVTSWEKLWHQRCRWAEGGYQRYLDYFPQILT----LGWAKEIDLL 324 Query: 328 LTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRY 387 L + F +G I + I +H LL + + FI L Sbjct: 325 LFFLLQFLLPIGLI------PDLLWTIFYSHHPVLFPLQTLLSIILTIAFIAGLYQFQNL 378 Query: 388 EHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLM-PKKQRARWVSPDR 436 + ++W + + V + M +K+R+ WV + Sbjct: 379 RG------WSLLWATIQGSFYMVHWIPVMIVTTLKMCVQKERSHWVKTEH 422 >UniRef50_Q8DMV4 Cellulose synthase n=1 Tax=Thermosynechococcus elongatus BP-1 RepID=Q8DMV4_THEEB Length = 736 Score = 261 bits (668), Expect = 3e-68, Method: Composition-based stats. Identities = 70/432 (16%), Positives = 128/432 (29%), Gaps = 18/432 (4%) Query: 5 IVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGE 64 ++ F LV + + H F+ +S + Y + W + Sbjct: 101 VILFGCRYLVWRGWVTLNDAHWLSFAASLGFYGLEVLSFFTYLLYFYQTAWLTTAWRSQQ 160 Query: 65 ----NAPAPQLKDNPSISIIIPCFNEEKNV-EETIHAALAQRYENIEVIAVNDGSTDKTR 119 + + PS+ I IP +NE + TI A A Y N + ++DG + Sbjct: 161 ANAYQQAVLRGEYCPSVDIFIPTYNEPPYILRRTIVACQAIHYRNKSIYVLDDGRRSEIA 220 Query: 120 AILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEP 179 + + + KA L E + D D + ++ V Sbjct: 221 DLCAHLGVNY----LTRPTNEHRKAGNLNHALKHTTGELIAVFDADFIPFQNFLTRTVG- 275 Query: 180 MLYNPRVGAVTGNPRIRTRS-----TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVI 234 + +V V + + G+ G I+ + +GN G Sbjct: 276 FFQDDQVSMVQTPQHFFNPDYHSQNLGIEFMMPGDMEYFFGFIQPGRD-FGNAIICCGTS 334 Query: 235 AAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRL 294 RR AL +G + + ED K+Q+ + + Y L P + +QRL Sbjct: 335 YVVRRRALEAIGGYYTRCVVEDFQTGTKMQIAGYRLVYLNEILSMGESPRNFQDYLEQRL 394 Query: 295 RWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNI 354 RW QG +++ + W + + + + I Sbjct: 395 RWLQGNMQIYFCGDDLPIWSK--LSWFQRSCHLSLLLHNINPFIRTCFLIGPFLSLMTGI 452 Query: 355 ELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTL 414 LT + L L+ S E RY FP + ++ + Sbjct: 453 SLTVATLSEYLFYALPYTLLMIATFSWATEGRYFSLWGEVYEVTFAFPGMVQLIKILRNP 512 Query: 415 VSFTRVMLMPKK 426 ++ K Sbjct: 513 FGKIGSIVTNKG 524 >UniRef50_Q74D65 Glycosyl transferase, group 2 family protein n=1 Tax=Geobacter sulfurreducens RepID=Q74D65_GEOSL Length = 472 Score = 261 bits (667), Expect = 4e-68, Method: Composition-based stats. Identities = 91/468 (19%), Positives = 168/468 (35%), Gaps = 53/468 (11%) Query: 20 CVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISI 79 VA+ + + ++ F ++ + + +SI Sbjct: 5 IVAFLLAIQPVIFIYFIVLNGFYTLFTIISLRDIRNYLNAVTSQSIDNVLNGMFYRPLSI 64 Query: 80 IIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRA------------------- 120 ++P +NEEK + ++ + LA RY EVI +NDGSTD T Sbjct: 65 LVPAYNEEKTIVSSVKSLLALRYPEYEVIVINDGSTDGTLESLINEFRLVRIDKPISLHV 124 Query: 121 -----ILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAY 175 I ++ P+L V+ +N GKA AL G A++ ID D++L+ DA Sbjct: 125 PHRPIIAKYVSVDHPYLFVLD-KENGGKADALNAGINASQFPLFCSIDADSVLEDDALIR 183 Query: 176 IVEPMLYNPRVGAVTGNPRIRT--------------RSTLVGKIQVGEYSSIIGLIKRTQ 221 + + V A G R+ ++ Q EY+ + + Sbjct: 184 ATRLFVEDREVVATGGIVRVLNGCEVEDGIVKGVRAPRGMLECFQTVEYTKGFLSGRTSW 243 Query: 222 RIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLN------QWTIFYEPR 275 + ++ +SG FR+ + V + + + ED+D+ +L + ++ + + P Sbjct: 244 NYFRSLLIISGAFGIFRKDMVMAVKGYRE-SVGEDMDLVVRLHRHCRQNRIRYKVVFVPD 302 Query: 276 ALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLT--TIWA 333 +CW +P + L KQR RW +G + N + F Y +T + Sbjct: 303 PVCWTQVPSDMASLLKQRNRWHRGLIDSLWHNRGMFLNPRYGTVGLFGFPYFVTVELLGP 362 Query: 334 FTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTS 393 +G+ + + +N E I A + + L + +I RY L Sbjct: 363 AVEFIGYFGFVLLFFLGQVNREF-AILFFLLAVLWGTWINLGSIFLDNLIYKRY-KGLGD 420 Query: 394 SLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRGILRG 441 L ++ + F+ + + KK W P R + G Sbjct: 421 VLKLCLFGLLEFFGYRQIIVVERLIATFMFWKKG---WGKPKRKEIDG 465 >UniRef50_D2S4V1 Glycosyl transferase family 2 n=1 Tax=Geodermatophilus obscurus DSM 43160 RepID=D2S4V1_9ACTO Length = 483 Score = 260 bits (666), Expect = 6e-68, Method: Composition-based stats. Identities = 105/435 (24%), Positives = 170/435 (39%), Gaps = 19/435 (4%) Query: 18 PLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHW-------PWGENAPAPQ 70 PL + + G M+ + + ++ V + F VY RH W EN Sbjct: 39 PLVLLAYVCGFTMVALLHGLDETLGVVAGVLNIVFGVYFCRHLGFAVAAARWAENDMLAA 98 Query: 71 L----KDNPSISIIIPCFNEEKNVEETIHAALAQRYE--NIEVIAVNDGSTDKTRAILDR 124 PS+++ + C NEE V+ + A LA Y + ++ V+DGS D T A LD Sbjct: 99 DVGLHSYTPSVAVFVGCKNEELVVDGMVTALLALDYPADRLTLVVVDDGSDDTTGARLDA 158 Query: 125 MAAQIPHLRVIHL--AQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLY 182 AA+ P LRV+H GK+ AL +E V D D D A +V Sbjct: 159 WAAREPRLRVLHRPPGSGGGKSGALNDALHLVDAEIAVVFDADHEPDPTALRRLVR-HFR 217 Query: 183 NPRVGAVTGNPRIRT-RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSA 241 +P VGAV G IR R + + ++ S + + ++ + G A R S Sbjct: 218 DPVVGAVMGRCVIRNGRDSQMASTVFVDFLSGYLVNEYGRQALFELPAYGGANCAVRMST 277 Query: 242 LAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGA 301 L +G W+ +TED D++ ++ + + Y+ A+ + + + WKQR RWA+G Sbjct: 278 LRGLGGWNPHTVTEDTDLTLRVLMAGQRVRYDLSAVDFEEAVVSAQRFWKQRYRWARGHQ 337 Query: 302 EVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAA 361 + F L R + + ++ L G + L I + A Sbjct: 338 KCFRDYWRPLMRSPHLSLVEKVETTLFLLVYHVPLLSGIGVLLTVL--RAFGIGDLPLVA 395 Query: 362 THTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVM 421 +LL L + V L+I + L ++ F + W+ + A R Sbjct: 396 LLPLSMLLFVGPLAELSVGLLIGRVERRSAWRLLGFLPAFALSIWITTRAYVDGMLGRPY 455 Query: 422 LMPKKQRARWVSPDR 436 K R+ S R Sbjct: 456 TWVKTSRSGATSTVR 470 >UniRef50_Q988K9 Mlr6694 protein n=1 Tax=Mesorhizobium loti RepID=Q988K9_RHILO Length = 475 Score = 260 bits (665), Expect = 7e-68, Method: Composition-based stats. Identities = 100/468 (21%), Positives = 179/468 (38%), Gaps = 55/468 (11%) Query: 24 FHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPA--PQLKDNPSISIII 81 + ++ F+ + ++++ + + P +A P I++++ Sbjct: 10 VLAASILSWFIIGAGLAQTTIYLLQLIIAAYALSKRPPVARSALLWHRYGDVAPPIALLV 69 Query: 82 PCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAIL--------------DRMAA 127 P +NEE NV E++H+ LA Y N EVI +NDGS D+T L + +A Sbjct: 70 PAYNEELNVVESVHSMLALEYPNFEVIVINDGSKDQTLQRLIGAFKLVKFRRPYEEALAH 129 Query: 128 QIPHLRV---------IHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVE 178 + + +N GKA A G ++ IDGD++L+ DA + Sbjct: 130 KPIRGLYSSPMTERLFVVDKENGGKADAQNAGINVCRAPLFCVIDGDSILEPDALMRAAQ 189 Query: 179 PMLYNPR-VGAVTGNPRIRT--------------RSTLVGKIQVGEYSSIIGLIKRTQRI 223 P + +P AV G RI + L+ +QV EY + + Sbjct: 190 PFIDDPERTIAVGGTIRIANGSRIEAGRVREIHLPNRLLPLLQVVEYLRAFLMARLAWSH 249 Query: 224 YGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQL------NQWTIFYEPRAL 277 + VSG FRR+ + VG ++ + ED+D+ KL ++ I + P + Sbjct: 250 INTLMLVSGAFGMFRRTEVVAVGGFTMGSMGEDLDLVIKLHRHMRDAGRKYRIQFIPEPV 309 Query: 278 CWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLT--TIWAFT 335 CW PETL L +QR RW +G E F + L+ R+ L F + L + Sbjct: 310 CWTEAPETLSVLARQRTRWQRGALECFFRYRAMLFNPRYGRVGFLGFGHILIVDVLGPVA 369 Query: 336 CLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYE--HNLTS 393 ++G+I+ L+ E +A T + + ++ R+ +L Sbjct: 370 EVLGYILIPAFWLLGILSTE-YLLAFTSLIFTYGVCVSVCSLVLEEAELQRFPRARDLAV 428 Query: 394 SLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRGILRG 441 + + L+ + + + + + W R L G Sbjct: 429 LTAVAVIENFGYRQLNNFWRVKGWWQYL----RADQSWGEMTRTGLSG 472 >UniRef50_C7IGQ0 Glycosyl transferase family 2 n=2 Tax=Firmicutes RepID=C7IGQ0_9CLOT Length = 433 Score = 260 bits (664), Expect = 1e-67, Method: Composition-based stats. Identities = 92/415 (22%), Positives = 166/415 (40%), Gaps = 21/415 (5%) Query: 42 SIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQR 101 S++ + V+ + H + P+++I+IP NEEK + T+ L Sbjct: 19 SVLINIILVFGAIGYVAHERKNKVNLDKYNDRLPTVTIMIPAHNEEKVIASTLERILELN 78 Query: 102 YEN--IEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQN-----QGKAIALKTGAAAA 154 Y I++I +ND S+D T + ++ P R+ + N +GKA AL A Sbjct: 79 YPQHLIQIIVINDNSSDATGVQIKKVQDNSPERRINVITTNKENGGRGKAHALNLALKQA 138 Query: 155 KSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRST-LVGKIQVGEYSSI 213 K E++ D DA +R+ +++ + R AV G + R R+ + K E + Sbjct: 139 KGEWICIFDADAAPERNTLKFLIIKSFEDKRFAAVFGRNKARNRNRNFLTKCINLELIAS 198 Query: 214 IGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYE 273 ++ + + + G R+ L +VG W ITED D+ + + + I YE Sbjct: 199 QRIVNPGKWYLFKLGQIPGTNFIINRTILQKVGGWDSRAITEDTDLGFTIMKRNYLIAYE 258 Query: 274 PRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWA 333 RA + PE L KQR RWA+G V KN L+ +R+ Y +T IW Sbjct: 259 SRAEAYQQEPEKLSVYVKQRTRWAKGNLYVVFKNFPDLFSGLAWRIKFDILFYWMTYIWF 318 Query: 334 FTCLV------------GFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSL 381 ++ G I + + L + + +L+ + +LQ V L Sbjct: 319 LLFVLISDGILLFNIVSGVIDFIYPGLNLLGQTSLAVNFNFNLSFLLMYVIFVLQINVGL 378 Query: 382 MIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 + + L + + + + +++ + K+ A+W +R Sbjct: 379 ASDKGQSTAE-NVLISCVSYFTYAQVFIFISLKATYSYISDRIFKREAKWYKTER 432 >UniRef50_C7M4N0 Glycosyl transferase family 2 n=4 Tax=Flavobacteria RepID=C7M4N0_CAPOD Length = 494 Score = 260 bits (664), Expect = 1e-67, Method: Composition-based stats. Identities = 81/388 (20%), Positives = 154/388 (39%), Gaps = 11/388 (2%) Query: 57 ERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYE--NIEVIAVNDGS 114 + K+ P ++I +P +NE+ V + Y +E+ ++D S Sbjct: 38 RKQNDEAPKFNLLDPKEIPYVTIQLPLYNEKYVVRRLLENIAKLEYPRTKLEIQVLDD-S 96 Query: 115 TDK----TRAILDRMAAQIPHLRVIHLAQNQG-KAIALKTGAAAAKSEYLVCIDGDALLD 169 TD+ T I+ + A ++ I +G KA ALK G A AK +++ D D + Sbjct: 97 TDESVAETAEIIKELQATGLDIKHIRRTNREGFKAGALKEGLAIAKGDFVAIFDADFMPQ 156 Query: 170 RDAAAYIVEPMLYNPRVGAVTGNPRIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVF 228 D V +P +G V R+ +++ KIQ L + + G Sbjct: 157 PDWLKRTV-VYFKDPEIGVVQTRWGHINRNYSILTKIQALALDVHFTLEQVGRNSKGYFI 215 Query: 229 TVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKG 288 +G +R++ + + G W D +TED+D+S++ QL W Y +P + Sbjct: 216 NFNGTAGIWRKACIYDAGNWEGDTLTEDLDLSYRAQLKNWKFKYLEDVETPAELPVVISA 275 Query: 289 LWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLA 348 Q+ RW +GGAE F K ++R+ +N W F + + + L F++ + + Sbjct: 276 ARSQQFRWNKGGAENFRKTVSRVLAAKNIG-WKTKFHGVMHLLNSSMFLWVFVVSVLSIP 334 Query: 349 GVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWML 408 + + + H+A ++ FI + I V F+ + Sbjct: 335 MLYIKNQYGHLAWIFHITSFFIISTVILFICYWFTYKNLQGKTFDDFLNYIKLFVTFFSV 394 Query: 409 SLATTLVSFTRVMLMPKKQRARWVSPDR 436 +L + + V+ +R+ +V + Sbjct: 395 ALGFSFHNTVAVLEGHTGKRSEFVRTPK 422 >UniRef50_C8WFV0 Cellulose synthase catalytic subunit (UDP-forming) n=4 Tax=Zymomonas mobilis RepID=C8WFV0_ZYMMN Length = 725 Score = 259 bits (663), Expect = 1e-67, Method: Composition-based stats. Identities = 77/435 (17%), Positives = 144/435 (33%), Gaps = 25/435 (5%) Query: 4 RIVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWG 63 + + I + + H + + V F + ++ G++ ++ + Sbjct: 82 SSILTSTRYIFWRITQTLRFDHIMDAVFGGVLFMAELYAWAILILGLFQILWPMQR---P 138 Query: 64 ENAPAPQLKDNPSISIIIPCFNEEKNVEE-TIHAALAQRYEN--IEVIAVNDGSTDKTRA 120 + + ++ P++ + IP +NE + + T+ AAL Y V ++DG ++ R Sbjct: 139 VVPLSGEDEELPTVDVFIPTYNESMEIVQNTVFAALGMDYPKDRFNVYLLDDGHREEFRL 198 Query: 121 ILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPM 180 + + KA L K E + D D + R V + Sbjct: 199 FAEEAGCHY----LTRNDNLNAKAGNLNAALRKTKGELVCIFDCDHVPTRAFLQLTVGWL 254 Query: 181 LYNPRVGAVTGNPRIRTRSTLVGKI-----QVGEYSSIIGLIKRTQRIYGNVFTVSGVIA 235 P + V + + + G+ G ++R + F G A Sbjct: 255 QKEPNLALVQTPHFFYSPDPIQRNVPGGDELPGDNELFYGTVQRGNDLRDATF-FCGSCA 313 Query: 236 AFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLR 295 RR AL E +S + +TED + KLQ W Y L L +TL KQR R Sbjct: 314 ILRREALEENNGFSGETVTEDAHTALKLQRRGWDTAYINIRLSAGLATDTLLAHIKQRAR 373 Query: 296 WAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIE 355 WA+G ++ + R Y + L I LA + N Sbjct: 374 WARGMTQILRIDNPLWGR---GLTIAQRLCYMNALLHYQFALPRVIFLISPLAFILFNSS 430 Query: 356 LTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVI-FWMLSLATTL 414 + H +A + L +SL R + + I ++ F ++ Sbjct: 431 IIHASAVMVFAYAIPHL-----FMSLWAHERLSNGRRQPFWGEINETLLAFHLIKPTLIT 485 Query: 415 VSFTRVMLMPKKQRA 429 +S+ + + Sbjct: 486 LSYPKKGSFNVTDKG 500 >UniRef50_C6X7P6 Cellulose synthase catalytic subunit (UDP-forming) n=1 Tax=Methylovorus sp. SIP3-4 RepID=C6X7P6_METSD Length = 744 Score = 259 bits (663), Expect = 1e-67, Method: Composition-based stats. Identities = 78/441 (17%), Positives = 152/441 (34%), Gaps = 29/441 (6%) Query: 7 SFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENA 66 + L+ + Y + + + + S+ + G+ ++ R+ P Sbjct: 83 FLALRYLIWRSTETLIYTGPLDFIAMTLLYLAEVYSLTIYMLGILLNLWPLRNRPLKLP- 141 Query: 67 PAPQLKDNPSISIIIPCFNEEKNVEE-TIHAALAQRYEN--IEVIAVNDGST------DK 117 P L D P++ + IP ++E ++ T AA Y + + +DGST Sbjct: 142 --PNLADYPTVDVFIPTYDESDDIIRITTIAASQIDYPKDKLRIHICDDGSTINKRNNPA 199 Query: 118 TRAI-------LDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDR 170 T + L R+A + + KA L E + +D D + Sbjct: 200 TAEVAWARYYRLRRLAQDLGANYITRETNVAAKAGNLNHALHQTHGELFLVLDCDHVPTH 259 Query: 171 DAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVG-----EYSSIIGLIKRTQRIYG 225 D V +P++ V + + + E I R+ + Sbjct: 260 DILRRTVGFFAADPKLFLVQTPHFFINPTPVEKNLIGVGNPNRENDMFYKEIHRSLDFWN 319 Query: 226 NVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPET 285 + + G A RR + EVG + ITED + S+ L + Y R + L PE+ Sbjct: 320 SSY-FCGSAALLRRRYVMEVGGIAGKTITEDAETSFHLHSLGYNSIYLDRPMVCGLSPES 378 Query: 286 LKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAV 345 K QR RWAQG ++FL + + + +F +F +C + F L + + Sbjct: 379 YKDYILQRTRWAQGMVQLFLLSNPLMAKGLSFHQRLCYFNFC---FYWFFGLARVMFFIA 435 Query: 346 QLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIF 405 + L++++ + T + + F++ R S ++ + + Sbjct: 436 PALFLLLSLKIYFASFTQVISYAVPYI-FSTFMLMDFFYGRARQPFFSEIYESVQSIFLA 494 Query: 406 WMLSLATTLVSFTRVMLMPKK 426 + + PK Sbjct: 495 PAVIAVLFNPHKPSFKVTPKG 515 >UniRef50_A5FCY6 Candidate polysaccharide-forming beta-glycosyltransferase; Glycosyltransferase family 2 n=2 Tax=Flavobacteriaceae RepID=A5FCY6_FLAJ1 Length = 477 Score = 259 bits (663), Expect = 1e-67, Method: Composition-based stats. Identities = 93/471 (19%), Positives = 167/471 (35%), Gaps = 58/471 (12%) Query: 22 AYFHSGELMMRFVFFWPFFMSIMWIVGGVYF--WVYRERHWPWGENAPAPQLKDNPSISI 79 + ++ + + MS I+ + + + + + P +S+ Sbjct: 8 ITIFTHIYLILILAYAILIMSSYLILAYLSGKELRGYLKKNSFIDYDVLLTSEFAPKLSL 67 Query: 80 IIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDR--------------- 124 I P +NE +EE + + L+ Y EVI VNDGS D + +L Sbjct: 68 IAPAYNEGFTIEENVKSLLSLNYNQYEVIVVNDGSKDNSMEVLINTYDLVLTELAFHQQI 127 Query: 125 --------MAAQIPHLR--VIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAA 174 ++ + ++ +N GKA AL G A++ Y+VCID D +LD+D+ Sbjct: 128 ETKKIRGLYVSRNAAFKKLIVVDKENGGKADALNAGLNIAQNPYVVCIDVDCILDKDSLL 187 Query: 175 YIVEPMLYNP--RVGAVTGNPRIRT--------------RSTLVGKIQVGEYSSIIGLIK 218 + +P L + R+ A G RI L+ +IQV EY L + Sbjct: 188 KLAKPFLESSGQRIIATGGVVRIANQCIIKNGRLIEVNVPDLLLPRIQVLEYLRAFLLGR 247 Query: 219 RTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQL------NQWTIFY 272 + +SG AF + G +S + ED+++ +++ +++ Y Sbjct: 248 MAWGRLDGLLLISGAFGAFDKEIALLAGGYSTKTVGEDMELVVRMRRYMLENNLPYSVRY 307 Query: 273 EPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIW 332 P LCW PE K KQR RW +G E + + + L Y + Sbjct: 308 IPDPLCWTEAPEDFKIFKKQRSRWMRGTIETLDFHKKMFLNPKYKLLGMLSIPYWTLFEF 367 Query: 333 AFTC--LVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRY--E 388 +G +I V LN + + ++ ++Y + Sbjct: 368 LAPGIEFIGLVITIVFFIFGLLNWHFFLLLLLFVYTFAVF-FSVIALYSEEQTYHKYPKQ 426 Query: 389 HNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRGIL 439 + L P F L++ L+ + ++ K W R Sbjct: 427 RDFFKLLLAAFIEPFYFHPLTVYAALIGYKEKVMGTKG----WGEMTRKGF 473 >UniRef50_A4WR49 Cellulose synthase (UDP-forming) n=6 Tax=Rhodobacteraceae RepID=A4WR49_RHOS5 Length = 788 Score = 259 bits (662), Expect = 2e-67, Method: Composition-based stats. Identities = 83/434 (19%), Positives = 150/434 (34%), Gaps = 31/434 (7%) Query: 1 MINRIVSFFILCLVLCIPLCVAYFHSG---ELMMRFVFFWPFFMSIMWIVGGVYFWVYRE 57 M+ R + +++ F + L FVF F + +G + + Sbjct: 63 MVPRFLLLSAASMLVMRYWFWRLFETLPPPALDPSFVFAMLLFAVETFSIGIFFLNGFLS 122 Query: 58 RHWPWGENAPAPQLKDNPSISIIIPCFNEEKN-VEETIHAALAQRYE-NIE-VIAVNDGS 114 P Q +D P++ I++P +NE + + T+ AA Y + V+ +DG Sbjct: 123 ADPTDRPFPPQVQPEDLPTVDILVPSYNEPADMLSVTLSAAKNMIYPSRLRTVVLCDDGG 182 Query: 115 TD---------------KTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYL 159 TD + R L ++ ++ + KA + K + + Sbjct: 183 TDQRCLSPDPEIARQAQERRRELQQLCRELGVVYSTRERNEHAKAGNMSAALERLKGDLV 242 Query: 160 VCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTL-----VGKIQVGEYSSII 214 V D D + RD A V + +P++ V + +G E Sbjct: 243 VVFDADHVPSRDFLARTVGYFVEDPKLFLVQTPHFFINPDPIQRNLALGDDCPPENEMFY 302 Query: 215 GLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEP 274 I R +G F G A RR AL +VG ++ + ITED + + ++ W Y Sbjct: 303 AKIHRGLDRWGGAF-FCGSAAVLRRRALDDVGGFAGETITEDAETALEIHARGWKSLYID 361 Query: 275 RALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAF 334 RA+ L PET +QR RWA G ++ Y + + F Sbjct: 362 RAMIAGLQPETFASFIQQRGRWATGMMQMLRLKNPLF---RPGLGIAQRLCYLNSMSFWF 418 Query: 335 TCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSS 394 LV L + +++ + + F+V + +R L S Sbjct: 419 FPLVRMTFLIAPLIYLFFGVQVFVATFEEVLAYM-PGYLAVSFLVQNALFSRQRWPLVSE 477 Query: 395 LFWIIWFPVIFWML 408 ++ + P + + Sbjct: 478 VYEVAQAPYLSRAV 491 >UniRef50_C9RLY4 Glycosyl transferase family 2 n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RLY4_FIBSS Length = 517 Score = 259 bits (662), Expect = 2e-67, Method: Composition-based stats. Identities = 87/433 (20%), Positives = 164/433 (37%), Gaps = 15/433 (3%) Query: 11 LCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQ 70 + +L + V Y +G ++ + F + SI + +R Sbjct: 1 MSTILLYAMFVVYVIAGVGLVIYGF--SCYYSIYLFLKNSRKTRLSDRKAILKYYR-EHS 57 Query: 71 LKDNPSISIIIPCFNEEKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAI----LDR 124 L D P ++ +P FNE VE + A A Y E+ ++D STD+ + + Sbjct: 58 LADLPQVTTQLPVFNEANCVERLLEAVCAIDYPKDKHEIQVLDD-STDECYEVTKKKVAE 116 Query: 125 MAAQIPHLRVIHLAQNQG-KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYN 183 +AA+ +++IH + KA ALK G A AK E+L D D + ++D V ++ + Sbjct: 117 LAARGYDIKLIHRTNRKDFKAGALKEGMAVAKGEFLAIFDADFVPEKDFLLKTVPYLVMD 176 Query: 184 PRVGAVTGNPRIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSAL 242 P+VG V G R+ + + Q + + + +G +R+ A+ Sbjct: 177 PQVGLVQGRWGHLNRTESGLTLAQSIGIDGHFVIEQSARSWGKLFMNFNGTAGVWRKDAI 236 Query: 243 AEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAE 302 G W D +TED+D+S++ QL W + + + +P + Q+ RWA+G + Sbjct: 237 YGGGGWEGDTLTEDMDLSYRSQLAGWKMKFVFDVIVPAELPNDINAFKAQQFRWAKGSIQ 296 Query: 303 VFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAAT 362 +K + ++ R + + T C++ + A L + A Sbjct: 297 TAIKILPKVLRSKVPLRVKIGAILHTTHYSIHPCMLFTALCAWPLLAFFEPVGHLPTWA- 355 Query: 363 HTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVML 422 +T G L + V + R + + I L + + + V Sbjct: 356 YTVGFAFIFLAAIAPSVLYFVAQRCSGYTGWKIRLLSL--PILMALGVGIAVSNSRAVFA 413 Query: 423 MPKKQRARWVSPD 435 + +V Sbjct: 414 AVLGTKGSFVRTP 426 >UniRef50_B9YCC6 Putative uncharacterized protein n=1 Tax=Holdemania filiformis DSM 12042 RepID=B9YCC6_9FIRM Length = 477 Score = 259 bits (662), Expect = 2e-67, Method: Composition-based stats. Identities = 103/461 (22%), Positives = 170/461 (36%), Gaps = 57/461 (12%) Query: 26 SGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFN 85 + F F+ S + V R+ + + +S+I+P N Sbjct: 8 FLANVEFFFVFYLIGYSTFLFISVVVGASVLYRNRYLEKMKNELKHDYYVPVSVIVPAHN 67 Query: 86 EEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLA------- 138 EE V +T+ + L Y+ EV+ V+DGSTDKT L + I R I L Sbjct: 68 EEVTVVDTVRSLLQLDYKLYEVLVVDDGSTDKTSQRLIKAFNLIETQRPIRLQVPCQKQK 127 Query: 139 -----------------QNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPML 181 +N GKA AL G A++ Y VC+D D+LL RD+ Y V P+L Sbjct: 128 AVYEREIHGIRVTLIIKENGGKADALNMGINASRFPYFVCMDADSLLQRDSLEYAVRPLL 187 Query: 182 YNPRVGAVTGNPRIRTRSTL--------------VGKIQVGEYSSIIGLIKRTQRIYGNV 227 + V AV G RI TL + +Q+ EY + Y Sbjct: 188 ESDEVIAVGGLVRISNGVTLENGKVKHYHLPFNPLVCMQILEYDRSFLASRILMDQYNGN 247 Query: 228 FTVSGVIAAFRRSALAEVGYWSDDMITEDIDIS------WKLQLNQWTIFYEPRALCWIL 281 +SG F++ + VG + + ED+++ + + I Y P A+CW Sbjct: 248 LIISGAFGLFKKDLVITVGGYDRSTMGEDMELVTKLHVFCRAHDIPYAIRYAPDAICWSQ 307 Query: 282 MPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPL------FFEYCLTTIWAFT 335 P TL L KQR RW G + K + + + +E I F Sbjct: 308 APSTLLDLTKQRRRWHIGLFQSLTKYRRLFLNFKYGFVSLISYFYYLLYELLSPYIELFG 367 Query: 336 CLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSL 395 L + + L P + + A + + L +L I ++ ++ Sbjct: 368 VLTIVLAWVTNLINYPFVVYFYLVYACFSGVLTLVAFFSRIHTQNLTIS---FSDVVKAV 424 Query: 396 FWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 ++ + + A ++S +L +K++ W + R Sbjct: 425 GLCLFEIIFLRLYLAAVRMLS----LLSYRKRKNHWEAITR 461 >UniRef50_Q31NE1 Cellulose synthase (UDP-forming) n=2 Tax=Synechococcus elongatus RepID=Q31NE1_SYNE7 Length = 740 Score = 258 bits (661), Expect = 2e-67, Method: Composition-based stats. Identities = 84/421 (19%), Positives = 141/421 (33%), Gaps = 15/421 (3%) Query: 14 VLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKD 73 L + S L + F+ ++ + +R + A K Sbjct: 116 WRLFALNNGHIFSLTLSIIFLLTESLYVLSFLMQFYPTLVFDPKRRSRQADQQEALLSKF 175 Query: 74 NPSISIIIPCFNE-EKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHL 132 +PS++I IP +NE + + TI A YEN E+ ++DG + RAI + + Sbjct: 176 SPSVAIWIPIYNEHPRIIRRTILACQLIDYENKEIYVLDDGHRSEIRAIATELG-----V 230 Query: 133 RVIHLAQN-QGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTG 191 + N KA L S+ + D D L + V N + V Sbjct: 231 HYLSRPDNTHRKAGNLNYALNHTNSDLIAVFDCDFLPFNNFLKRTVG-FFANEEIALVQT 289 Query: 192 NPRIRT-----RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVG 246 R+ + + + I+ + + +V G RRSAL +VG Sbjct: 290 PQHYYNSDFHTRNLGLDYVLPNDMDYFFHYIQPIRDQFNSVIC-CGTSYVARRSALEDVG 348 Query: 247 YWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLK 306 + D I ED KL LN W + Y L +P L +QRLRW QG +++ Sbjct: 349 GYYTDCIVEDFQTGTKLLLNHWRVVYLNEVLSIGEVPRHLSEYLQQRLRWMQGNIQLYCS 408 Query: 307 NMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAG 366 + W Y I+ T + I + L L Sbjct: 409 HKQLPIWSGRLTTW-QRLFYLSILIYCLTPFMRAIYILLPLLSFLFGFTLIAAPPIEYFY 467 Query: 367 ILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKK 426 L + L+ S + N Y T I+ +P + ++ + ++ K Sbjct: 468 YGLPFILLIYGATSWLTYNHYFLYWTEIYESIMCWPSLQRIIQVLFNPFGNFGSLVTAKG 527 Query: 427 Q 427 + Sbjct: 528 E 528 >UniRef50_A0LZL8 Transmembrane family-2 glycosyl transferase-possibly involved in biofilm formation n=1 Tax=Gramella forsetii KT0803 RepID=A0LZL8_GRAFK Length = 483 Score = 258 bits (661), Expect = 2e-67, Method: Composition-based stats. Identities = 89/472 (18%), Positives = 164/472 (34%), Gaps = 60/472 (12%) Query: 25 HSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWG----ENAPAPQLKDNPSISII 80 + + F+F + + G+ + D PS+++I Sbjct: 12 YYIYWISVFLFIGYGASLLTLYLTGILLARRSIKRTKQRSIFLRAEDIVSATDIPSVTVI 71 Query: 81 IPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRA-------------------- 120 P +NE + E + + L+ +Y E++ VNDGS D + Sbjct: 72 APAYNEGLTIIENVKSLLSIQYPYYELLLVNDGSKDDSLQKLIDEFDLEQQDATFITQPI 131 Query: 121 -------ILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAA 173 I + HL VI N GKA A+ G A +E ++ D D ++++DA Sbjct: 132 PTASVKHIYRSRKTKYTHLTVID-KNNGGKADAINAGVNFANTELVIFTDVDCIIEQDAI 190 Query: 174 AYIVEPML----------------YNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLI 217 +V P L N V +R S L+ QV EY L Sbjct: 191 LKMVRPYLEEDDKEIIGCGGGIGIANDSVIKNGILTELRLPSRLIPMTQVVEYIRAFLLG 250 Query: 218 KRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLN------QWTIF 271 + + +SG + R + EVG ++ + ED+++ +L+ + + + Sbjct: 251 RMAWSEINGLMLISGAFGMYPRKRVIEVGGFNPKTVGEDLELCIRLRKHMEDKKIPYKVV 310 Query: 272 YEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTI 331 Y P LCW P K L QR RWA+G E + L+ ++ M LF+ Y L Sbjct: 311 YLPETLCWTEGPSDYKILVMQRDRWARGLWETMSLHKDLLFNRKYGNMGLLFYPYWLLFE 370 Query: 332 WAFTCL-VGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHN 390 + + ++ + + I A ++ C + + + + Y Sbjct: 371 FGAPIVEFMGVVCIIAFTILGWINWPFAILLFFAAYLVGCVFSTIAIFMYVKSFDHYTKP 430 Query: 391 --LTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRGILR 440 + L P ++ + L + + + + W + R + Sbjct: 431 KQIVELLLAAYLEPFLYHPILLFGQMKGYYKKLFDITSG---WGNMTRKGFK 479 >UniRef50_A3HSN4 Glycosyl transferase, group 2 family protein n=1 Tax=Algoriphagus sp. PR1 RepID=A3HSN4_9SPHI Length = 486 Score = 258 bits (661), Expect = 2e-67, Method: Composition-based stats. Identities = 85/469 (18%), Positives = 162/469 (34%), Gaps = 60/469 (12%) Query: 24 FHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRE------RHWPWGENAPAPQLKDNPSI 77 F LM F+ + I+ V + R + + P + Sbjct: 4 FLFYFLMEALGIFFLLISFTVIIIYFVTMILSALELRDHLRKNQFADYQDIITSPIAPGV 63 Query: 78 SIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTR------------AILDRM 125 SII P +NE +++ + + ++ Y EVI +NDGS D T Sbjct: 64 SIIAPAYNEGQSIVQNAKSLMSLHYGKFEVIIINDGSKDDTLQKLLEAFELEKTEFAYDY 123 Query: 126 AAQIPHLR-------------VIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDA 172 +R V+ +N GKA AL +G A E + CID D +L D+ Sbjct: 124 QIDCKQVRGVYKSKNRSYSKLVVVDKENGGKADALNSGINLASLEIIACIDVDCILSHDS 183 Query: 173 AAYIVEPMLY--NPRVGAVTGNPRIRT--------------RSTLVGKIQVGEYSSIIGL 216 +V P + N +V AV G I ++L+G+ QV EY + Sbjct: 184 ITRMVRPFMEETNRKVIAVGGAIGIANNCDVQDGTVTKYRLPTSLLGRFQVIEYFRAFLM 243 Query: 217 IKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLN------QWTI 270 + + +SG F + + +VG + + ED+++ +++ + + Sbjct: 244 GRMAWTRVNGLMLISGAFGFFNKDLVLKVGGYFPKTVGEDMELVVRMRRYMEEQKIPYRV 303 Query: 271 FYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTT 330 + P LCW +PE L +QR RW +G E + + + + L + + Sbjct: 304 GFVPDPLCWTEVPEDESVLARQRNRWIRGTIETLQLHRPIRFNPKYGILGMLAYPFWNVF 363 Query: 331 --IWAFTCLVGFIIYAVQLAGVPLN-IELTHIAATHTAGILLCTLCLLQFIVSLMIENRY 387 + L+G + + + + LL + + + + Sbjct: 364 EKMAPLIELLGILYTLTLIVLGDFSAFYFFALYTMLYLLSLLVSSFSILYEQVAYHNYKD 423 Query: 388 EHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 + +L ++ ++ P + + + + K W R Sbjct: 424 KKDLHKLIYIVLIEPFVIHPKVVFWGIKGHIDFI----KGIGGWGQMIR 468 >UniRef50_C6J2E9 Glycosyl transferase n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J2E9_9BACL Length = 488 Score = 258 bits (661), Expect = 2e-67, Method: Composition-based stats. Identities = 90/470 (19%), Positives = 161/470 (34%), Gaps = 55/470 (11%) Query: 14 VLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKD 73 +L L + + +M + M+ + R + + + Sbjct: 7 LLRDILLIYGLFATYYVMSVNLIYFSIMTF-----SFRQILTIFRRAAYSKYNTLSGSEL 61 Query: 74 NPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAA------ 127 P +S+++P FNEE V E + L Y EVI +NDGS D T +L Sbjct: 62 VPPVSLLVPAFNEELTVIENVKCLLTLNYPTYEVIVINDGSKDNTLGVLREEFELMPLPN 121 Query: 128 -------------------QIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALL 168 + PHL VI N GKA +L G A + ID D+LL Sbjct: 122 PDVRHTIDCQPITAVYHNPKYPHLYVID-KPNGGKADSLNAGINLAHYPLISSIDADSLL 180 Query: 169 DRDAAAYIVEPMLYNP-RVGAVTGNPRIRT--------------RSTLVGKIQVGEYSSI 213 ++DA + + NP A+ G+ RI + Q EY Sbjct: 181 EKDALIRMARMYMENPEETVAIGGDVRIANGCMIENGAVKEVSLPRKIWPMFQATEYLKA 240 Query: 214 IGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQL------NQ 267 + + VSG FR+ + VG + ED++I KL + Sbjct: 241 FLGGRIGWSSINGLIIVSGAFGLFRKDYVIAVGGYRGGYPGEDMNIIIKLHRYMLENKLK 300 Query: 268 WTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYC 327 + I + P A+CW P+T + L QR RW +G + ++N + M L Y Sbjct: 301 YRIAFCPEAVCWTQAPDTYRILSNQRKRWGRGNLKNMIENYDMAFNPRYKVMGLLTMPYN 360 Query: 328 LTT-IWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENR 386 + + ++ + + + + L + ++ + R Sbjct: 361 IIFEALNPYFKITGLLALIGYTLMDMTHWRILAIFWLINFLSGYVLSIGALLLEELAFKR 420 Query: 387 YEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 ++ L+ +++ + F+ + L + + + W + R Sbjct: 421 FD-KLSDLCKLLLFSALKFFGYAQLGGLWRI-QGHIQFMRNNNSWGTMTR 468 >UniRef50_C4WPV5 Putative uncharacterized protein n=1 Tax=Ochrobactrum intermedium LMG 3301 RepID=C4WPV5_9RHIZ Length = 426 Score = 258 bits (659), Expect = 3e-67, Method: Composition-based stats. Identities = 92/420 (21%), Positives = 168/420 (40%), Gaps = 9/420 (2%) Query: 19 LCVAYFHSGELMMRFVFFWPFFMSIM--WIVGGVYFWVYRERHWPWGENAPAPQLKDNPS 76 L A+ S + FW + +I+ + + ++PS Sbjct: 6 LGFAFLLSQTTLSIMSVFWYTLIFEFPRYILPFLVAGMVMRSPDAEALIEEDGTASEHPS 65 Query: 77 ISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIH 136 +S+I+ NEE +E + + Q +N E++ V+DGSTD+ + + Q +V+ Sbjct: 66 VSVILVGHNEEDALEACVRSLHEQSIQNFEIVIVSDGSTDRMVEVAQSLVKQGLATKVLS 125 Query: 137 LAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIR 196 + GK+ + A A + ++ +D D DR A Y++ P +P VGAV G+ R Sbjct: 126 TSLRGGKSSGINLACAYASGDIIINVDCDCSFDRYAFEYLLRP-FSDPAVGAVCGDIAPR 184 Query: 197 TRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITE 255 +++ + Q EY I + KR V SG +AFRR AL ++ + E Sbjct: 185 NSDASVISQFQEIEYLQSIAVGKRIASAVNQVVCASGAFSAFRRKALDDIRGFDVG-GGE 243 Query: 256 DIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKE 315 D+D + +L+L W I Y P A+C+ +P T+ +QRLRW + +++ + Sbjct: 244 DLDTTIRLRLKGWRIEYAPEAVCYTDVPTTVYQYIRQRLRWERDAI--WIRYRKHVQLMN 301 Query: 316 NFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLL 375 E + +VG I+ V L + + A I L + +L Sbjct: 302 PLSPAFRITEAVHQWDFLLFNVVGAFIFPVYLVWLFIQYGAFTPAILIAMQIGLLAIDVL 361 Query: 376 QFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPD 435 S R H +L ++ + + + L+++ + R + Sbjct: 362 ILAASAWCTGR--HVFLRNLLFLPGYSLFMTYVMRPVRLIAYIDEWVFSGSHRDNYTPVK 419 >UniRef50_C0G271 Glycosyl transferase family 2 n=1 Tax=Natrialba magadii ATCC 43099 RepID=C0G271_NATMA Length = 466 Score = 258 bits (659), Expect = 3e-67, Method: Composition-based stats. Identities = 98/458 (21%), Positives = 169/458 (36%), Gaps = 53/458 (11%) Query: 27 GELMMRFVFFWPFFM---SIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPC 83 L+ F + + + + + R W P I++I+P Sbjct: 9 VWLISWFFLAYFALVNGSYLFVHLVSIVTLQRGIRDWLLDPVYHPHNSPFLPGIAVIVPA 68 Query: 84 FNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLR---------- 133 +NEE + +T+ + L Y N E++ VNDGS D+T L P Sbjct: 69 YNEEVTIVDTVQSLLNLEYSNHEIVVVNDGSRDETLEQLISTFELEPVGEDVPFDLPCET 128 Query: 134 ------------VIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPML 181 V+ N GKA AL G ID D++++R+A +V+P L Sbjct: 129 VHEVYQSSRVDLVVIDKDNGGKADALNAGVFFTDQPLFCAIDADSIIERNALLEVVQPFL 188 Query: 182 YNPR-VGAVTGNPRIRTRSTL--------------VGKIQVGEYSSIIGLIKRTQRIYGN 226 +P + A G R+ ++ + +Q EY + G+ Sbjct: 189 RDPERMIASGGAVRVANGCSITRGVITDIGLSRSHLANLQTVEYLRAFLSGRIGLSGLGS 248 Query: 227 VFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLN----QWTIFYEPRALCWILM 282 + +SG FR S + EVG +S + ITED+++ +LQ + + I + P + W + Sbjct: 249 LLIISGAFGLFRTSTVREVGGYSTETITEDMELIVRLQKHVSDEKHRITFVPYPVVWTEV 308 Query: 283 PETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTT--IWAFTCLVGF 340 PE L +QR RW +G E + + + + + L + +G+ Sbjct: 309 PERRAVLARQRGRWFKGMLETLWVHRRMIGNPKYGMVGLFALPFFLFVEALGPLIEGLGY 368 Query: 341 IIYAVQLAGVPLNIELTHIAATHTAGI--LLCTLCLLQFIVSLMIENRYEHNLTSSLFWI 398 II LN+ + G+ LL L +L +VS R ++ L + Sbjct: 369 IIVPAAFLLGYLNVPFFVVFLGVAVGVGTLLSWLSVLNEVVSFR-RYRNPRDIALLLLYG 427 Query: 399 IWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 + V F T V+F + + + W R Sbjct: 428 LLEHVPFRQWKSFITWVAFVQYL----RGETSWGEMSR 461 >UniRef50_UPI000178A1DC glycosyl transferase family 2 n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI000178A1DC Length = 462 Score = 258 bits (659), Expect = 4e-67, Method: Composition-based stats. Identities = 82/454 (18%), Positives = 159/454 (35%), Gaps = 48/454 (10%) Query: 29 LMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEK 88 ++ + + + ++ R + + + PS+S+++P +NEE Sbjct: 1 MVAIYYVMIVSALYFVILMFSFRNIKDIFRRATYSKYNTLSGSELVPSVSLLVPAYNEEL 60 Query: 89 NVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDR------------------------ 124 + E + + Y EVI +NDGS+D+T +L Sbjct: 61 TIAENVRCLMTLNYPTYEVIVINDGSSDETLKVLIEEYGLTLLPNPEIRGNISTNKVRGI 120 Query: 125 -MAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYN 183 Q P L +I +N GKA +L G ++ + ID D+LL++DA + + N Sbjct: 121 YYNPQYPQLYLID-KENGGKADSLNAGINLSRYPLISSIDADSLLEKDALIRMARMYMEN 179 Query: 184 P-RVGAVTGNPRIRT--------------RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVF 228 P A+ G+ RI + Q EY + + + Sbjct: 180 PEETVAIGGDVRIANGCVIENGEVKDVSLPRRMWPMFQSIEYLKAFLGGRIGWSHFNGLI 239 Query: 229 TVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLN------QWTIFYEPRALCWILM 282 VSG FR+ + VG + ED++I KL ++ + + P A+CW Sbjct: 240 IVSGAFGLFRKDYVIAVGGYRGGYPGEDMNIIIKLHRYMLENKIKYRVSFCPEAVCWTQA 299 Query: 283 PETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFII 342 P++ L QR RW +G + ++N ++ + M L Y + + Sbjct: 300 PDSYNILSNQRKRWGRGNLKNMIENRDMVFNPKYKVMGLLTMPYNVIFEALNPYFRITGL 359 Query: 343 YAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFP 402 A+ + + L L + ++ + R L+ + +++ Sbjct: 360 LALLGYVLLDMTNWHVLVVFGLVNFLSGYLLSVGALILEEMAFRRYTKLSDLVKMLLFSA 419 Query: 403 VIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 + F L + + + W + R Sbjct: 420 LKFVGYHQLGVLWRV-QGHIQYLQNNNSWGTMTR 452 >UniRef50_C5A6A1 Glycosyltransferase n=2 Tax=Thermococcus RepID=C5A6A1_THEGJ Length = 451 Score = 257 bits (657), Expect = 6e-67, Method: Composition-based stats. Identities = 105/436 (24%), Positives = 182/436 (41%), Gaps = 19/436 (4%) Query: 7 SFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENA 66 F + I L + FV + + I+ ++++ + Sbjct: 24 LFVQEAVWKVISLLFQGV-PRYPVYAFVRYLFYLTGAFVILIYAAYFIFYLHFRENEGSK 82 Query: 67 PAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYE--NIEVIAVNDGSTDKTRAILDR 124 P P+ P +S++IP NE N+E I + Q Y EVI V+DGS D T I +R Sbjct: 83 PNPR--FFPKVSLVIPAHNEAMNIERLIESIQYQDYPFECYEVILVDDGSVDGTPEIAER 140 Query: 125 MAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALL-DRDAAAYIVEPMLYN 183 +RVI +N GKA AL+TG AAK + ++ +D D+ D + IVE + Sbjct: 141 YG-----IRVIRHERNMGKAKALETGIKAAKGDVIITLDADSYFADGSSLRNIVENLFSR 195 Query: 184 PRVGAVTGNPRIRTRST-LVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSAL 242 P VG TG RI RS L+ K QV EY + +R Q + V G +AF+ + Sbjct: 196 PFVGVSTGAIRIDVRSGKLIEKFQVIEYLHSFEVGRRVQGYLDWLLVVPGAFSAFKGYLI 255 Query: 243 AEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAE 302 + D + ED +++ T +EP+A + + + L++QR+RW GG + Sbjct: 256 KSLPAIPRDTLAEDFELAMITYRAGLTSNFEPKAAVYTEPATSWRELYRQRIRWYYGGLQ 315 Query: 303 VFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGV---PLNIELTHI 359 V K + ++ F + + + L F I A L + L +E+ I Sbjct: 316 VMAKYHDMIMNRKYGEKGLFLFFHMILLEYILPVLQVFGIVAFPLIMLVHNFLGLEILDI 375 Query: 360 AATHTAGILLCTLCL-LQFIVSLMIEN---RYEHNLTSSLFWIIWFPVIFWMLSLATTLV 415 + + L L LQ++ +++ E N ++L ++ + + + + +L Sbjct: 376 TLPFPLMMAVFLLVLFLQYLPGVLMSGIAMAIERNPRTALGYLPVIFLYYPIYNPLLSLA 435 Query: 416 SFTRVMLMPKKQRARW 431 ++ + W Sbjct: 436 KIDAMLRFLRGVVQSW 451 >UniRef50_Q31R21 Cellulose synthase (UDP-forming) n=3 Tax=Synechococcus elongatus RepID=Q31R21_SYNE7 Length = 749 Score = 257 bits (657), Expect = 6e-67, Method: Composition-based stats. Identities = 69/436 (15%), Positives = 138/436 (31%), Gaps = 18/436 (4%) Query: 1 MINRIVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHW 60 ++ + + L+ + + + + + ++ + + V Sbjct: 109 VVGILALLTVRYLLWRSLSTLNVSTPLNGGLSLILYGMELLLLVTGLIQLLLSVAARDRQ 168 Query: 61 PWGENAPAP--QLKDNPSISIIIPCFNEEKN-VEETIHAALAQRYENIEVIAVNDGSTDK 117 + Q + P + I++P +NE + T+ L Y V ++DG + Sbjct: 169 AQADQYAVAVQQGQYQPHVDILVPTYNEPVGLLRRTLVGCLTLDYAAKTVHVLDDGDRPE 228 Query: 118 TRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIV 177 A+ ++ + + KA L + E + D D + + A V Sbjct: 229 VAALARQLGCRYQA----RRDRQGAKAGNLNYALPNCRGELVAVFDADFIPRQSFLARTV 284 Query: 178 EPMLYNPRVGAVTGNPRIRTRSTL-----VGKIQVGEYSSIIGLIKRTQRIYGNVFTVSG 232 + R+G V + + + E ++ + G+V V G Sbjct: 285 G-FFQDGRIGLVQTPQSFYNPDPIAYNLDLAEQIPPEDEIFYRHVQPMRDGVGSVVCV-G 342 Query: 233 VIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQ 292 RR AL +G + + ++ED ++ + + Y L L PE+L KQ Sbjct: 343 TSFVVRRQALDAIGGFVTESLSEDYFTGIRIAAAGYQLVYLNEKLSQGLAPESLAAYAKQ 402 Query: 293 RLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPL 352 RLRWA+G + F L + + F+ L + + L Sbjct: 403 RLRWARGTLQAFFIQANPL--TVPGLNPLQRLGHLEGLLHWFSSLPRILFLVMPLGYGV- 459 Query: 353 NIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLAT 412 I LL + L V + R L S ++ ++ + A Sbjct: 460 GIMPLRATGPELLYFLLP-IYLGHLTVFNWLNRRSRSALLSEIYSLVLAVPLAITSLQAL 518 Query: 413 TLVSFTRVMLMPKKQR 428 + + PK R Sbjct: 519 VQPFRCQFAVTPKGLR 534 >UniRef50_B5WDV1 Cellulose synthase catalytic subunit (UDP-forming) n=1 Tax=Burkholderia sp. H160 RepID=B5WDV1_9BURK Length = 779 Score = 257 bits (656), Expect = 7e-67, Method: Composition-based stats. Identities = 74/446 (16%), Positives = 135/446 (30%), Gaps = 34/446 (7%) Query: 7 SFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENA 66 LV + Y + + + I+ G++ R Sbjct: 98 FLTARYLVWRTSSTLDYSGFFDFTFSLLLYSAEIYGIVVASIGLF---VNSRPLHRNVTP 154 Query: 67 PAPQLKDNPSISIIIPCFNEEKNVEE-TIHAALAQRYEN--IEVIAVNDGSTDKTRA--- 120 PS+ + IP +NE + E T+ AAL RY + V ++DG T++ + Sbjct: 155 LLTDAGPLPSVDVFIPTYNESPELLEVTLRAALDIRYPRDRLNVYLLDDGGTEQKLSQAD 214 Query: 121 ------------ILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALL 168 L MA + + KA + E +V D D + Sbjct: 215 PAKAADAQARAQALRNMAQRHGGRYLTRTRNEHAKAGNINAALEQTSGELVVIFDADHVP 274 Query: 169 DRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKI-----QVGEYSSIIGLIKRTQRI 223 D V + ++ V L + E +I+ Sbjct: 275 TADFLERTVGFFQRDAQLYLVQTPHFFINPDPLEKNLGMFGKMPPESEMFYSVIQPGLDF 334 Query: 224 YGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMP 283 + F G A RR + E+G + ITED + + L + Y + L P Sbjct: 335 WNASF-FCGSAAVLRRRCVMEIGGITGQSITEDAETALMLHARGYHSLYLNEPMISGLQP 393 Query: 284 ETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIY 343 ET QR+RWAQG ++ L + Y + F C + Sbjct: 394 ETFASFVVQRVRWAQGMIQLLLLRNPLRMKGLKLAQR---ICYFSNAFFWFFCYARVVFL 450 Query: 344 AVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPV 403 + ++ H L + L + + + + L S ++ ++ V Sbjct: 451 VAPAMYLLFGLQFYHATIPQFFAYGLPQI-LGAILTTDYLFGGHRWTLISEIYELL---V 506 Query: 404 IFWMLSLATTLVSFTRVMLMPKKQRA 429 + L ++++ R + Sbjct: 507 SMFSLRGVISVLAKPRAPTFKVTPKG 532 >UniRef50_Q7NUL9 Cellulose synthase, subunit A n=1 Tax=Chromobacterium violaceum RepID=Q7NUL9_CHRVO Length = 852 Score = 257 bits (656), Expect = 8e-67, Method: Composition-based stats. Identities = 63/435 (14%), Positives = 134/435 (30%), Gaps = 20/435 (4%) Query: 1 MINRIVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHW 60 ++ + + + L + + + + G + + + Sbjct: 193 LMVFSMVMSTRYIWWRATSTINDDTWLNLAFGLMLLAAEIFAWIVLSLGFFQSSWALKRR 252 Query: 61 PWGENAPAPQLKDNPSISIIIPCFNEE-KNVEETIHAALAQRYE--NIEVIAVNDGSTDK 117 P++ + IP +NE + + T+ +AL + + V ++DG Sbjct: 253 VAPLP---DDRALWPAVDVFIPIYNEPLRVLRPTVMSALEMDWPPDKLRVHILDDGC--- 306 Query: 118 TRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIV 177 R + AA I + KA + + ++ D D + R + Sbjct: 307 -REEVREFAAGIGAGYITRPVHKHAKAGNINHALTVTSAGFIAVFDCDHIPTRSFLRSTM 365 Query: 178 EPMLYNPRVGAVTGNPRIRTRSTLVGKI-----QVGEYSSIIGLIKRTQRIYGNVFTVSG 232 L + ++ V + + E G ++ ++ F G Sbjct: 366 GGFLSDGKLALVQTPHHFFSADPFERNLETHGKMPNEGELFYGRVQDGNDLWNATF-FCG 424 Query: 233 VIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQ 292 A RRS L E G + D +TED S +L + Y L E+L Q Sbjct: 425 SCAVLRRSHLVEAGGIAVDTVTEDAHTSLRLHRLGYRSAYINVVQAAGLATESLSAHIGQ 484 Query: 293 RLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPL 352 R+RWA+G A++F + L + P F Y + I + + Sbjct: 485 RIRWARGMAQIFRSDNPVLGK---GLTLPQRFCYLNAMLHFLNGAPRLIFLTAPMVFLIF 541 Query: 353 NIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLAT 412 ++ + A ++ + + + + R+ + ++ + I + A Sbjct: 542 HVYTIYAQALSIVLYVVPHMAF-SVLANSRLNGRFRRSFWGEVYETVLAWYIVIPTTAAL 600 Query: 413 TLVSFTRVMLMPKKQ 427 + K Sbjct: 601 LFPGLGTFNVTAKGG 615 >UniRef50_Q1Q2A6 Strongly similar to cellulose synthase, catalytic subunit (UDP-forming) n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q2A6_9BACT Length = 763 Score = 257 bits (656), Expect = 8e-67, Method: Composition-based stats. Identities = 76/441 (17%), Positives = 143/441 (32%), Gaps = 29/441 (6%) Query: 10 ILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAP 69 + L+ + Y + + + ++ G++ ++ H A Sbjct: 87 LRYLLWRTFQSLVYTGPLDYAATILLYLAEAYAVAIHFFGLFVNIWPLNHKIAPLPA--- 143 Query: 70 QLKDNPSISIIIPCFNE-EKNVEETIHAALAQRYE--NIEVIAVNDGSTDKTRAI----- 121 + P++ I IP ++E E + T + Y V ++DG T R Sbjct: 144 DSRLYPTVDIFIPTYHEPEDVIRTTALSCTQIDYPKDKFNVYILDDGGTVAKRNHPEKGM 203 Query: 122 --------LDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAA 173 L RMA ++ + KA L +E ++ +D D + RD Sbjct: 204 DAWKRHYSLKRMAKELGISYITRQQNVHAKAGNLNHALQHTSAELVLILDCDHVPTRDIL 263 Query: 174 AYIVEPMLYNPRVGAVTGNPRIRTRSTL-----VGKIQVGEYSSIIGLIKRTQRIYGNVF 228 L + + V + GE I + + Sbjct: 264 RNTAGWFLKDEMLFLVQTPHFFINPDPFEKNLAIFGEAPGENEMFYHGIHHGLDSWNASY 323 Query: 229 TVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKG 288 G A RR L EVG + + ITED + + L + Y R + L PE Sbjct: 324 -FCGSAALLRRKYLMEVGGIAGETITEDAETTVALHSKGYNSVYIDRPMVCGLSPENFDD 382 Query: 289 LWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLA 348 + QR RWAQG ++F+ + W Y +++ F FI Y V A Sbjct: 383 MLVQRSRWAQGMIQIFILKNPLFAK---GMKWYQRLCYMSASLFWFFGFSRFIFYIVPAA 439 Query: 349 GVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWML 408 + +++ H + L + + +V+ + ++ S ++ + + ++ Sbjct: 440 FLLFGLKVYHASFAEIVAYALPHV-IGCIVVTDFLHGKFRWPFFSEIYESVLSIFLMPVI 498 Query: 409 SLATTLVSFTRVMLMPKKQRA 429 + PK Q Sbjct: 499 FSVIRNPRHPTFKVTPKGQSL 519 >UniRef50_C5S5F1 Glycosyl transferase family 2 n=1 Tax=Allochromatium vinosum DSM 180 RepID=C5S5F1_CHRVI Length = 879 Score = 257 bits (656), Expect = 9e-67, Method: Composition-based stats. Identities = 75/392 (19%), Positives = 130/392 (33%), Gaps = 14/392 (3%) Query: 7 SFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENA 66 + + + Y ++ + I+ ++ + W W Sbjct: 349 AISTAAVWVVYDYTRQYMTPATAIVGVLLLIGGVGVIVLLMAEAHEWAESVWLRRWRRPF 408 Query: 67 PAP--QLKDNPSISIIIPCFNEEKNV-EETIHAALAQRYENIEVIAVNDGSTDK-TRAIL 122 P P +S+ +P +NE + +ET+ A Y + EV+ +++ + D + Sbjct: 409 PLRSVPDDQLPFVSVHVPAYNEPPELLKETLDGLAALDYPHFEVLVIDNNTKDPAVWEPV 468 Query: 123 DRMAAQ-IPHLRVIHLAQNQG-KAIALKTGAAAA--KSEYLVCIDGDALLDRDAAAYIVE 178 A+ R H+ G KA AL +++ + ID D ++ ++V Sbjct: 469 RDYCARLGERFRFFHVDPLAGYKAGALNFALRHTDPRADVVAVIDADYIVRPPWLRHLV- 527 Query: 179 PMLYNPRVGAVTGNPRIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAF 237 P +P V V R + + EY L T+ + G + Sbjct: 528 PAFGDPEVAIVQAPQDYRDAHQNAFKAMCMAEYRGFFHLGMVTRNERNAIIQ-HGTMTMI 586 Query: 238 RRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWA 297 RR L V W++ ITED ++ +L Y P LMP+T KQR RWA Sbjct: 587 RRQTLDAVDGWAEWCITEDAELGLRLFEGGHKALYIPCTYGQGLMPDTFADFRKQRYRWA 646 Query: 298 QGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIW-AFTCLVGFIIYAVQLAGVPLNIEL 356 G + L + L + + + F + LA + Sbjct: 647 YGAVRILLHHRRELLGLRGTSLSLGQRYHFVAGWLPWFADGFNLLFNFAALAWSVAMVMA 706 Query: 357 THIAATHTAGILLCTLCLLQFIV--SLMIENR 386 I L L L F + SL + R Sbjct: 707 PDTITPPYMTIALVPLVLFLFKMSKSLFLYRR 738 >UniRef50_Q9RBJ2 Cyclic di-GMP-binding domain n=2 Tax=Gluconacetobacter xylinus RepID=BCSA4_ACEXY Length = 1518 Score = 256 bits (655), Expect = 1e-66, Method: Composition-based stats. Identities = 75/437 (17%), Positives = 142/437 (32%), Gaps = 31/437 (7%) Query: 11 LCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQ 70 +V + V + + + + + ++M + + + R E+ Sbjct: 85 RYIVWRLTATVQFSNWLQTALAVLLLLAEAYALMTLCLSYFQMAWPLRR---REHPLPED 141 Query: 71 LKDNPSISIIIPCFNEEKNVEE-TIHAALAQRYE--NIEVIAVNDGSTDKTRAILDRMAA 127 + PS+ + +P +NEE ++ T+ AL + + V ++DG A Sbjct: 142 MAQWPSVDVFVPSYNEELSLVRSTVLGALDLDWPADRLNVYILDDGRRKAFHDFAVEAGA 201 Query: 128 QIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVG 187 +I N KA L A S + V D D + R + M+ +P + Sbjct: 202 GY----IIRAENNHAKAGNLNHALAVTDSPFAVIFDCDHVPTRGFLRRTIGWMMADPNLA 257 Query: 188 AVTGNPRIRTRSTLVGKI-----QVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSAL 242 + + E + GL++ + F G A RR A+ Sbjct: 258 LLQTPHHFYAPDPFQRNLAGGMHVPPEGNMFYGLVQDGNDFWDATF-FCGSCAIIRREAV 316 Query: 243 AEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAE 302 +G ++ + +TED + K+Q W Y L L E L QR+RWA+G + Sbjct: 317 MGIGGFATETVTEDAHTALKMQRRGWGTAYLREPLAAGLATERLILHIGQRVRWARGMIQ 376 Query: 303 VFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAAT 362 + + L W Y + LA + L + + Sbjct: 377 IMRLDNPML---GAGLRWEQRLCYLSAMSHFLFAIPRLTFLVSPLAFLFLGQNIIAASPL 433 Query: 363 HTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVML 422 + L + +S R E S + I+ + L T + Sbjct: 434 AISVYALPHIFHSVITLS-----RIEGRWRYSFWSEIYETSLALFLVRITIVTLL----- 483 Query: 423 MPKKQRARWVSPDRGIL 439 + + ++ D+G L Sbjct: 484 --QPHKGKFNVTDKGGL 498 >UniRef50_B4AY07 Cellulose synthase (UDP-forming) n=1 Tax=Cyanothece sp. PCC 7822 RepID=B4AY07_9CHRO Length = 734 Score = 256 bits (655), Expect = 1e-66, Method: Composition-based stats. Identities = 65/398 (16%), Positives = 129/398 (32%), Gaps = 19/398 (4%) Query: 11 LCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQ 70 LV + + + V + I + + + + H P+P Sbjct: 107 RYLVWRATSTLNLATPLDGVFSIVIYGIELFFIFSLTVQLCLSLKIKDHSAELVPHPSPV 166 Query: 71 L----KDNPSISIIIPCFNEEKN-VEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRM 125 L + P + I I +NE + T+ A Y + V ++DG+ + +A+ + Sbjct: 167 LGKNTEFTPLVDIFIVTYNEPSFFIRRTVVGCQALDYPHKNVYVLDDGNRPEIKALAREL 226 Query: 126 AAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPR 185 + L KA L + + D D + ++ V + + Sbjct: 227 GCDY----ITRLDNRYAKAGNLNNALVQTTGKLIAVFDADFVPTKNFLRRTVG-FFQDEK 281 Query: 186 VGAVTGNPRIRTRSTL-----VGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRS 240 +G V N + V I E + +R + G +G RRS Sbjct: 282 MGLVQTNQSFYNADPVAKNLGVENILPEELENFYRYYQRLRNTVGATIC-NGSSFVVRRS 340 Query: 241 ALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGG 300 L E+G + + ++ED +L + + Y L L E + QRLRW +G Sbjct: 341 HLEEIGGFVTESVSEDYFTGIRLTAKGYRLIYLNEKLSAGLAAENMSCYMIQRLRWCRGT 400 Query: 301 AEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIA 360 + F N L + ++ T + I+ N+E ++ Sbjct: 401 LQAFFVNSNPL--TIPGLLPIQRLANLEGILFWLTPIPRLILLITPFLFFVFNLEPYQVS 458 Query: 361 ATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWI 398 ++ + +L + + L S ++ + Sbjct: 459 SSDFIYFFMP-YYILNILTFAWLNYHARSALISDIYSV 495 >UniRef50_Q2RZV9 Putative glucosyltransferase n=1 Tax=Salinibacter ruber DSM 13855 RepID=Q2RZV9_SALRD Length = 510 Score = 256 bits (655), Expect = 1e-66, Method: Composition-based stats. Identities = 80/395 (20%), Positives = 146/395 (36%), Gaps = 10/395 (2%) Query: 17 IPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPS 76 + + + V + +++W+ + D P Sbjct: 1 MLAALEVAVPALYAVAIVVLTAYGGNLLWLALVHAASERLRDGPVPDPDDLPVPDDDWPV 60 Query: 77 ISIIIPCFNEEKNVEETIHAALAQRYE--NIEVIAVNDGSTDKTRAILDR----MAAQIP 130 +++ +P +NE + I A + Y +++ ++D STD T + R A+ Sbjct: 61 VTVQLPLYNEAEVAHRLIDACVQLDYPRSRLDIQVLDD-STDATTERVARRVAHWQAEGV 119 Query: 131 HLRVIHLAQNQG-KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAV 189 ++ + G KA AL G A+ + + D D + +V P +G V Sbjct: 120 NITHVRRDDRTGYKAGALANGLQRARGDLIAIFDADFVPRPSFLRRLVPRFFDAPDLGMV 179 Query: 190 TGNPRIRTR-STLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYW 248 R +L+ K+Q + + +R + + G +G +RR+ + + G W Sbjct: 180 QARWGHLNRDDSLLTKVQAFGLDAHFAIEQRVRELAGCFLNFNGTAGVWRRACIEDAGGW 239 Query: 249 SDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNM 308 + D +TED+D+S++ QL W + Y P A +P + L Q+ RWA+GGAE LK Sbjct: 240 AHDTLTEDLDLSYRAQLQGWRLTYVPAAEAPAELPPDMNALRAQQFRWAKGGAETALKLT 299 Query: 309 TRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGIL 368 RLWR L + LT +AF ++ + L + I Sbjct: 300 GRLWRSAQPWRVKLEGTFHLTAHFAFPFILLAALTHAPLLLLK-GIGHGPGEVYFAVMGF 358 Query: 369 LCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPV 403 F+ L + + L + F Sbjct: 359 GLFGFAGFFLAQLFAQRALHPDWRRRLRLFVPFMA 393 >UniRef50_A9KK06 Glycosyl transferase family 2 n=12 Tax=Bacteria RepID=A9KK06_CLOPH Length = 648 Score = 256 bits (654), Expect = 1e-66, Method: Composition-based stats. Identities = 99/449 (22%), Positives = 186/449 (41%), Gaps = 55/449 (12%) Query: 1 MINRIVSFFILCLVLCI-PLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERH 59 MI+ ++ FFI ++ + V YF + + + F+ ++ G +Y Sbjct: 182 MISSLLLFFITAFIILMRAESVIYFKFNKWLYLYSIIAACFLLSRYLFGFMY-------- 233 Query: 60 WPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYE--NIEVIAVNDGSTDK 117 + P +SI+IPCFNEE+ ++ TI + + Q Y +EVI ++D S DK Sbjct: 234 -----RKVPIDINFTPGVSILIPCFNEEEWIQRTILSCINQDYPVDKLEVIVIDDCSHDK 288 Query: 118 TRAILDRMAAQI----------PHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDAL 167 + ++ + ++ L+ I +N GK AL GA AK E +V +D D+ Sbjct: 289 SVEKIEEIINKLYQEADQFDIKNRLKYIVQKENAGKREALALGALEAKHELVVFVDSDSF 348 Query: 168 LDRDAAAYIVEPMLYNPRVGAVTGNPRIRTR-STLVGKIQVGEYSSIIGLIKRTQRIYGN 226 LD A +V+P +P++G V+G + + + K+Q Y ++K + + Sbjct: 349 LDPFAIRNLVQP-FKDPKMGGVSGRTDVANTFTNTLTKMQSVRYYIAFRIMKAAESYFDA 407 Query: 227 VFTVSGVIAAFRRSALAE----------VGYWSDDMITEDIDISWKLQLNQWTIFYEPRA 276 V +SG +A +++ + E +G +D ++ L + Y+ A Sbjct: 408 VTCLSGPLACYKKEIILENREAWLNQSFLG--QRATFGDDRAMT-NFVLKKHRTAYQDSA 464 Query: 277 LCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTC 336 +C ++P + KQ++RW + L +W+KE F + + Sbjct: 465 ICSTIVPNQYRVFLKQQMRWKRSWLRESLIAGKFMWKKEPFAAITFYMGLIVP------- 517 Query: 337 LVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLF 396 + I+ L VPL + GILL +L + V+ ++ + L LF Sbjct: 518 IAAPIVVIYNLIYVPLQHRVF--PTAFLVGILLMSLLMS---VAQLLFRKSSTWLFGMLF 572 Query: 397 WIIWFPVIFWMLSLATTLVSFTRVMLMPK 425 + V+ W + V+F + + Sbjct: 573 CFYYEAVLLWQM--PIAWVTFWKSTWGTR 599 >UniRef50_B1Y242 Cellulose synthase catalytic subunit (UDP-forming) n=2 Tax=cellular organisms RepID=B1Y242_LEPCP Length = 851 Score = 256 bits (654), Expect = 1e-66, Method: Composition-based stats. Identities = 78/425 (18%), Positives = 136/425 (32%), Gaps = 24/425 (5%) Query: 1 MINRIVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHW 60 +I + + + E ++ + + +V G R Sbjct: 197 LIGLALLASFRYGWWRVTQSMDLAPGWESLLGGGLLCAELYAWLIMVLGFVQ---TSRPL 253 Query: 61 PWGENAPAPQLKDNPSISIIIPCFNEE-KNVEETIHAALAQRYE--NIEVIAVNDGSTDK 117 P++ + IP +NE + T+ AA + + V ++DG + Sbjct: 254 KRPVAPIELPRDQWPTVDVYIPTYNEPLSVIGPTVLAARDLDWPADRLVVHVLDDGHRPE 313 Query: 118 TRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIV 177 RA + + KA L A EY+ D D + R + Sbjct: 314 VRAYAEAAGVNY----ISRDNNRHAKAGNLNNALAETGGEYIAIFDCDHMPARGFLVNTM 369 Query: 178 EPMLYNPRVGAVTGNPRIRTRSTL-----VGKIQVGEYSSIIGLIKRTQRIYGNVFTVSG 232 ML + V + + E GL++ ++ F G Sbjct: 370 GWMLRDSNCAMVQTPHHFFSADPFERNLDTFRRVPNEGVLFYGLVQDGNDVWNASF-FCG 428 Query: 233 VIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQ 292 A RRSAL ++G + + +TED + KL W Y L E+L G +Q Sbjct: 429 SCAVLRRSALEQIGGIATETVTEDAHTALKLHRKGWRTAYLNVTHAAGLATESLSGHVRQ 488 Query: 293 RLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPL 352 R+RWA+G A++F + L R Y + F L I L + Sbjct: 489 RIRWARGMAQIFRLDNPLLGRGLTVMQR---LCYANAMLHFFYGLPRLIFLTAPLGYLYF 545 Query: 353 NIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLAT 412 + H + LL L L + + ++ V+ W +++ T Sbjct: 546 EFHIIHASVGVLLLYLLPHLVLPHIA-----NAHIAGPHRHTFWSELYETVLAWYVAVPT 600 Query: 413 TLVSF 417 T+ F Sbjct: 601 TVALF 605 >UniRef50_D1PAP3 Putative biofilm PGA synthesis N-glycosyltransferase PgaC n=1 Tax=Prevotella copri DSM 18205 RepID=D1PAP3_9BACT Length = 438 Score = 256 bits (654), Expect = 2e-66, Method: Composition-based stats. Identities = 96/423 (22%), Positives = 174/423 (41%), Gaps = 13/423 (3%) Query: 21 VAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQL---KDNPSI 77 ++ ++ + F + M +I+ + + + + ++NP I Sbjct: 15 ISTLSFSSIVNTYWFLFFIEMPRYYILEYLVIGNRLMKRNQLDKEKEYAKYFLYRENPLI 74 Query: 78 SIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHL 137 SI++P NE K++ + +++ Q Y N E+I V+DGS D T+ I + + + L Sbjct: 75 SILVPGKNEGKHIFKMVNSLAEQTYRNYEIIVVDDGSDDDTKLICNDLYRAGYITHYLRL 134 Query: 138 AQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRT 197 GKA A GA A+ +Y+VC+D D+ LDRDA I+ P + V V G ++R Sbjct: 135 DTRGGKAAASNYGAQMARGKYIVCLDADSSLDRDALEKILLPFYIDGMVKGVGGCVKVRN 194 Query: 198 -RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITED 256 + T+ +Q EY I + + G +SG AF R L EVGYW D Sbjct: 195 YKETICSSLQAFEYLKRIQVGRIVTSELGIYHIISGAFGAFERKTLKEVGYWDIG-PGLD 253 Query: 257 IDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAE-VFLKNMTRLWRKE 315 D++ K++ + + + A+C +P L+ QR+RW++ K+ L + Sbjct: 254 GDLTQKIRKAGYKVKFAEDAICMTNVPTKWYKLYHQRIRWSRSLVRFRLRKHADILLPTK 313 Query: 316 NFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLL 375 N W + F C + F+ + + + + I + L Sbjct: 314 N---WSILNWLSNMESVVFDCFLNFL-WLWYIIKLAITFNTHIIEVLALGYFIRVCFSQL 369 Query: 376 QFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARW--VS 433 FI+ LM+ R + + + P + L +A L + + + + + W Sbjct: 370 AFILVLMVSERKKDVWFLYRYLPLMSPYTGYFLRIAR-LSAHLQELFFRRSYKDAWNPEK 428 Query: 434 PDR 436 R Sbjct: 429 TSR 431 >UniRef50_A5GEA8 Glycosyl transferase, family 2 n=3 Tax=Deltaproteobacteria RepID=A5GEA8_GEOUR Length = 492 Score = 255 bits (652), Expect = 2e-66, Method: Composition-based stats. Identities = 89/422 (21%), Positives = 160/422 (37%), Gaps = 19/422 (4%) Query: 17 IPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPS 76 + + + + + + +W++ +Y ER P AP ++ PS Sbjct: 1 MLSAIIPVLTAIHFAALLGLCLYGVHRLWLIYCLYMPKGSERSTP----APFAAPEEFPS 56 Query: 77 ISIIIPCFNEEKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAILDRMAA----QIP 130 +++ +P +NE E + AA + +E+ ++D S D T ++D+ AA Q Sbjct: 57 VTVQLPLYNERFVAERLLDAAAGLDWPRERLEIQVLDD-SDDDTCRLVDQRAAWWRKQGV 115 Query: 131 HLRVIHLAQNQG-KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAV 189 + V+ G KA AL G A A EY+ D D + D + P N VG V Sbjct: 116 AITVVRRTSRDGYKAGALANGLATAHGEYIAVFDADFIPPPDFLHATM-PWFRNQDVGMV 174 Query: 190 TGNPRIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYW 248 + IQ + R + G F +G +RRSA+ G W Sbjct: 175 QTRWSFCNADHSWFTGIQSLLLGPHFSIEHRVRYRQGLFFNFNGTAGVWRRSAIESAGGW 234 Query: 249 SDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNM 308 D +TED+D+S++ QL W Y +P T+ L Q+ RWA+G + K + Sbjct: 235 QSDTVTEDLDLSYRAQLAGWRFVYREECQVPSELPVTMAALRSQQQRWAKGSIQTARKIL 294 Query: 309 TRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGIL 368 RL ++ + L + + ++ + I L + L Sbjct: 295 PRLLQERLPPAVKIEAMAHLMANIYWLLGMIVMLTLYPAVTWRVGIGLHQVLRIDLPLFL 354 Query: 369 LCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQR 428 + ++ + + I S L ++ P + L+ + +L S + + P + Sbjct: 355 ATSGAIMSYFLLYSI----RSGSKSLLRHVVLLPALTIGLAPSISL-SVLKGLFRPGGEF 409 Query: 429 AR 430 R Sbjct: 410 ER 411 >UniRef50_B2GJB7 Putative glycosyltransferase n=2 Tax=Micrococcineae RepID=B2GJB7_KOCRD Length = 481 Score = 255 bits (652), Expect = 3e-66, Method: Composition-based stats. Identities = 90/479 (18%), Positives = 168/479 (35%), Gaps = 62/479 (12%) Query: 15 LCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDN 74 L A + + F ++ IV +V R P+ + + Sbjct: 6 LLRGAVAAILAFIAVPVLIYFLLINTSYLVMIVLAAVDFVAHRRRIPFAGREELIRSRTM 65 Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAA------- 127 P IS++ P NEE + + + L+ ++ EVI V+DGSTD +L R Sbjct: 66 PGISVVAPMHNEEVGIRVAVESFLSLQHPRHEVIVVDDGSTDNGFEVLRRAFDLVPVSRQ 125 Query: 128 -----------------QIPHLRV-IHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLD 169 + R+ + +N G++ ++ G A+ + ++ +D D++LD Sbjct: 126 MPEDVPVREAPTSIHVPRDGRTRLTVVRKRNSGRSDSINVGVNLAREDLVLFVDSDSILD 185 Query: 170 RDAAAYIVEPMLYNP-RVGAVTGNPRIRT--------------RSTLVGKIQVGEYSSII 214 A + P + +P A G R + IQV EY Sbjct: 186 PGALLAVSRPFIEDPISTVAAGGVIRAVNGCRVKGGRVLEVRMPGNPLADIQVMEYLRAF 245 Query: 215 GLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQ------LNQW 268 L + + +SG FRR AL VG I ED ++ +L + Sbjct: 246 HLGRAGWSRINALLLISGAFGVFRRDALVTVGGLDASSIGEDFELVMRLHRTFRKQRRPY 305 Query: 269 TIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCL 328 + + +CW +P TL L +QR RW +G E L R+ + Y Sbjct: 306 RVTFLTEPICWTEVPSTLPVLRRQRARWHRGLWETLWAYRDMLGNPRYGRLGLVAVPY-- 363 Query: 329 TTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLC------LLQFIVSLM 382 + L+ ++ L + L L + + +L L + + Sbjct: 364 ---YWLFELIAPLLELAGLVLIALGFALGLVTVQYFVLFMLVAYAYAMIVTLAAVTIEEL 420 Query: 383 IENRY--EHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRGIL 439 ++Y +L +L+ ++ + + L+ + + +L ++Q W + R Sbjct: 421 SYHKYPRWRDLGITLWAVVLENLGYRQLTAWWRMEGWVASLLGRRQQ---WGTMTRSGF 476 >UniRef50_B4W4T5 Cellulose synthase catalytic subunit (UDP-forming) n=2 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4W4T5_9CYAN Length = 883 Score = 255 bits (651), Expect = 3e-66, Method: Composition-based stats. Identities = 69/428 (16%), Positives = 146/428 (34%), Gaps = 21/428 (4%) Query: 1 MINRIVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHW 60 ++ + + L + + + + F +I+ +V + + + Sbjct: 109 LVLLSLVTTVRYLYYRVNYTLNLDDWLNGIFSLLLFVAELYAILTLVLAYFQTLKIKDRQ 168 Query: 61 PWGENAPAPQLKDNPSISIIIPCFNEEKNVEE-TIHAALAQRYE--NIEVIAVNDGSTDK 117 P + + P + I IP +NE+ + T ALA Y +V ++DG +K Sbjct: 169 PVDLS--LYPEEQWPKVDIYIPTYNEDVEIVRKTTLCALAIDYPADKKKVYVLDDGRAEK 226 Query: 118 T---RAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAA 174 RA L + ++ + + + KA + T + ++ +D D + + Sbjct: 227 YKARRAELRKTCEELGAIMLTRDNNDHAKAGNINTAFRRTTGDLVLILDCDHMPVKQFLK 286 Query: 175 YIVEPMLYNPRVGAVTGNPRIRTRS----TLVGKI-QVGEYSSIIGLIKRTQRIYGNVFT 229 + V +P+V V L+ K ++++ + F Sbjct: 287 HTVGFFF-DPQVAFVQTPHWFFNPDPFERNLLTKGKIPVGNELFYKVLQKGNDFWNAAF- 344 Query: 230 VSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGL 289 G A R+ ++G + + +TED +++L + FY + + L PE Sbjct: 345 FCGSAAVIRKEYALQIGGIATETVTEDCHTAFRLHSLGYKSFYYDKIMVAGLAPEKFSAY 404 Query: 290 WKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAG 349 Q++RWA+G A++ + K N + P Y T F + Sbjct: 405 VGQQVRWARGMAQILRLENPLINPKLNLTI-PQRICYFSATSHFFYGYPRMMYALAPPLF 463 Query: 350 VPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLS 409 + I T L + L S+ + ++ S + I+ + + Sbjct: 464 LLFGINSVKGLGLETLCYALPHIIL-----SMQTNHIPYKHVRFSFWNEIFEFAMSFQAG 518 Query: 410 LATTLVSF 417 + T L Sbjct: 519 IVTLLAVV 526 >UniRef50_Q97J01 Glycosyltransferase, involved in cell wall biogenesis n=1 Tax=Clostridium acetobutylicum RepID=Q97J01_CLOAB Length = 768 Score = 254 bits (650), Expect = 4e-66, Method: Composition-based stats. Identities = 69/436 (15%), Positives = 138/436 (31%), Gaps = 28/436 (6%) Query: 9 FILCLVLCIPLCVAYFHSGELMMRFV---------FFWPFFMSIMWIVGGVYFWVYRERH 59 +++ + ++Y + + + I +G + ++ Sbjct: 26 LFYKVIILMVFYISYGIYLYWRFAYTIPRGYGDIALIAGYVLFIAETIGFLESSMFYLTL 85 Query: 60 WPWG-ENAPAPQLKDNPSISIIIPCFNEEKNV-EETIHAALAQRYE---NIEVIAVNDGS 114 W + P Q K+ P + I I +NE +N+ +T+ Y + + +DG+ Sbjct: 86 WDTNTPSTPEVQDKEFPHVDIFIASYNEPRNLLYKTLIGCKNMDYPDKSKVHIYICDDGN 145 Query: 115 TDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAA 174 + + + + + KA L S Y+V D D + D Sbjct: 146 RKDIKELCESLKVGY----ITREDNTHAKAGNLNNALKQTSSPYVVTFDADMIPMHDFLL 201 Query: 175 YIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQ-----VGEYSSIIGLIKRTQRIYGNVFT 229 + + ++G V + E + LI+ + + + Sbjct: 202 KTIPFFMVEDKIGFVQVPQNFYNADPFQYNLFNESRVPNEQNLFSRLIQAGKNRFNAI-I 260 Query: 230 VSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGL 289 SG R AL E+ ITED K+Q + Y L PE+L+ L Sbjct: 261 YSGSNTVLSREALNEINGLVVGTITEDFATGMKIQSKGYKCIYLNETHASGLSPESLEDL 320 Query: 290 WKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAG 349 + QR+RW +G + F R N ++F + + L I + + Sbjct: 321 YSQRIRWGRGVIQTFKAFNPLFMRGLNIYQKLMYFS---AFSYWYFGLWRLIFFMAPIVF 377 Query: 350 VPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLS 409 + + +A + L + + ++ I FP I + Sbjct: 378 SVFGVVVLSTSAIQMLEVWLPMFIFTNLTF-YHFSKKVRSVSWTHIYDTILFPQITVGIL 436 Query: 410 LATTLVSFTRVMLMPK 425 T ++ + PK Sbjct: 437 KETFGFKMSKFKVTPK 452 >UniRef50_C2KK33 Glycosyltransferase (Fragment) n=2 Tax=Lactobacillales RepID=C2KK33_LEUMC Length = 426 Score = 254 bits (650), Expect = 4e-66, Method: Composition-based stats. Identities = 71/385 (18%), Positives = 135/385 (35%), Gaps = 19/385 (4%) Query: 51 YFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEE-KNVEETIHAALAQRYENIE--- 106 + ++ +N P++ + I +NEE ++ +I AAL Y + E Sbjct: 6 FLQNVLKKFQRPVQNLDVYADGQYPTVDVFIATYNEETPILKRSIVAALLMHYPSPELVN 65 Query: 107 VIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDA 166 + +DGS D + + + + V+ KA L + E +V +D D Sbjct: 66 IYVCDDGSRDNVKKLCEELGVHH----VVRETHEHAKAGNLNHALKVSSGELVVTMDADM 121 Query: 167 LLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRST-----LVGKIQVGEYSSIIGLIKRTQ 221 + D V +N +G + + + I++ + Sbjct: 122 VPRTDFLEKTVG-YFHNKNMGFIQTPQTFFNDDPYQFNLFATNTIGNDQDFFMRSIEQQK 180 Query: 222 RIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWIL 281 + V G A FRR AL VG ++ +ITED+ LQ +W + L L Sbjct: 181 DAFNATMYV-GSNAIFRRKALDSVGGFATGVITEDMATGMLLQAKKWDTGFVNENLASGL 239 Query: 282 MPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFI 341 PET L KQR RWA+G +V K + + + Y + F+ + + Sbjct: 240 APETFGDLIKQRDRWARGNIQVAKKWHPWRVKGLDLMQCVI---YTDGIHYWFSGIYKLV 296 Query: 342 IYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWF 401 + + + + L Q +L + + + + S+++ Sbjct: 297 FLLAPILFLVFGQYSLQTSFSQILIFWLPAFIASQLTFNL-VSEKKQTVMLSNIYETATA 355 Query: 402 PVIFWMLSLATTLVSFTRVMLMPKK 426 P + + + L S + + K Sbjct: 356 PFMAYGVFNELFLKSKQKFAVTRKG 380 >UniRef50_Q05S47 Group-specific protein n=1 Tax=Synechococcus sp. RS9916 RepID=Q05S47_9SYNE Length = 422 Score = 254 bits (650), Expect = 4e-66, Method: Composition-based stats. Identities = 92/384 (23%), Positives = 153/384 (39%), Gaps = 10/384 (2%) Query: 26 SGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFN 85 ++ + W +I+ + + R + W E P P +SI+IP N Sbjct: 5 WPLILSALLGTWIIRTAIVVALAHQHNQAERTKEANWSEQQSTPC---KPLVSILIPACN 61 Query: 86 EEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIP--HLRVIHLAQNQGK 143 EE + + I + + Y N E+I +NDGS D T A P + VI A NQGK Sbjct: 62 EEATITDCIQSCIQSNYGNKEIIIINDGSNDATSDQAHIAQANHPKARITVIDFACNQGK 121 Query: 144 AIALKTGAAAAKSEYLVCIDGDALL-DRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLV 202 AL G A E++V +D D D + ++ P++ + A TGN +I + Sbjct: 122 TAALNEGLLYANGEFIVTLDADTRFSDPSSLTRLLNPLIQYQDIAACTGNLKIANPDRTI 181 Query: 203 GKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWK 262 IQ EY+ II KR Q + + G ++AFR + L +G +S + ED D + Sbjct: 182 PNIQSIEYTKIIQTFKRAQSHVNAIMILPGAVSAFRATELRSIGGFSASTLAEDADATMS 241 Query: 263 LQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPL 322 L L + + + A P T++ L KQR+RW G + K+ + Sbjct: 242 LLLRRNRLLFTSSASAITQGPNTVQELLKQRIRWRVGQLQCLWKHRKLC----GQSLSTA 297 Query: 323 FFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLM 382 FF + + F++ + + + IA + + + + Sbjct: 298 FFFVDTASTNLISAAAPFVLAVTGVYVIQKGLWQQPIATVAGFVAIDIAATVFAYTAAQQ 357 Query: 383 IENRYEHNLTSSLFWIIWFPVIFW 406 L LF+ ++ P I W Sbjct: 358 KTPSIPDYLRHLLFFSLFNPFITW 381 >UniRef50_C0A1N7 Glycosyl transferase, family 2 n=1 Tax=Opitutaceae bacterium TAV2 RepID=C0A1N7_9BACT Length = 498 Score = 254 bits (649), Expect = 5e-66, Method: Composition-based stats. Identities = 88/421 (20%), Positives = 152/421 (36%), Gaps = 16/421 (3%) Query: 32 RFVFFWPFFMSIMWIVGGVYF-WVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNV 90 F+ + + + ++ + + P P P + I +P +NE V Sbjct: 4 IFLVVYACTLGGLLFFALHRIKMLWLYARYSRRQTPPPPLNGPLPRVCIQLPLYNESLVV 63 Query: 91 EETIHAALAQRYEN-----IEVIAVNDGSTDKTRAILDRMAAQIP------HLRVIHLAQ 139 E + A R+ N +E+ ++D STD+T AI++R A P H+ I Sbjct: 64 EALLDKVSAIRWGNGGDETLEIQILDD-STDETTAIIERWMAANPVRVATAHISHIRRPN 122 Query: 140 NQG-KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTR 198 G KA AL G ++E+ D D + D ++ P + ++G V R Sbjct: 123 RHGYKAGALSYGMTLTEAEFFAIFDADFRPEPDFLEQLM-PHFADTKIGVVQARWEFANR 181 Query: 199 -STLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDI 257 S+L+ + Q + + + + G F +G +RR AL E G W+DD +TED+ Sbjct: 182 KSSLLTRFQGVFLDAHFVVEQEARYAAGLFFNFNGTAGIWRRRALDEAGGWTDDTVTEDL 241 Query: 258 DISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENF 317 D+S++ QL W Y +PE++ Q+ RW +GG +V K + + Sbjct: 242 DVSYRAQLRGWKFIYRADYAVPSELPESMTAFKSQQRRWTKGGMQVMRKQIATIACSGAP 301 Query: 318 RMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQF 377 L + LV F I V + LL + F Sbjct: 302 SRSKQEAILHLLVGFVHPLLVLFAISFVPYLILAGQRPTGLWVFFSPVMALLIGAGSVAF 361 Query: 378 IVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRG 437 ++ E + F + T V+ + + R R Sbjct: 362 YITAQYFRHREWREGVLWLLTSPIFMAFGLAMSVTGTVAVIEGLCQRGGEFVRTPKGGRA 421 Query: 438 I 438 + Sbjct: 422 V 422 >UniRef50_C5AKK6 Glycosyl transferase, family 2 n=5 Tax=Burkholderiales RepID=C5AKK6_BURGB Length = 734 Score = 254 bits (649), Expect = 5e-66, Method: Composition-based stats. Identities = 72/425 (16%), Positives = 147/425 (34%), Gaps = 24/425 (5%) Query: 1 MINRIVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHW 60 MI V +V + + +++ + + + ++ G + + + Sbjct: 91 MIMLSVITTGRYMVWRLSDTTYWSRPLDMIWGVLLVAAEIYAALILLLGYFQTAWPLKRK 150 Query: 61 PWGENAPAPQLKDNPSISIIIPCFNEE-KNVEETIHAALAQRYE--NIEVIAVNDGSTDK 117 P A D P++ + IP +NE V+ T++AALA Y + + ++DG Sbjct: 151 PVPMPA---SRADWPTVDVFIPTYNEPLSVVKPTVYAALALDYPPDKLTIHVLDDGRRAD 207 Query: 118 TRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIV 177 +A + + I KA + E+ D D + R + Sbjct: 208 FKAFCEEVGVNW----TIRAHNRYAKAGNINEALKITHGEFFAVFDCDHIPTRSFLQLTL 263 Query: 178 EPMLYNPRVGAVTGNPRIRTRSTL-----VGKIQVGEYSSIIGLIKRTQRIYGNVFTVSG 232 L + ++ + + + E GL++ ++ F G Sbjct: 264 GWFLRDRQLSLLQTPHHFFSADPFERNLGTFRKVPNENELFYGLVQDGNDLWNAAF-FCG 322 Query: 233 VIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQ 292 A RRS + E+G + + +TED S KL +T Y L E+L G Q Sbjct: 323 SCAVLRRSMVEEIGGIAVETVTEDAHTSLKLHRLGYTTAYLAIPQAAGLATESLSGHIGQ 382 Query: 293 RLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPL 352 R+RWA+G ++F + + R Y + F + + L+ + Sbjct: 383 RIRWARGMTQIFRIDNPLVGR---GLTIAQRLCYLNGMLHFFYGIPRLVFLTAPLSFLFF 439 Query: 353 NIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLAT 412 ++ +A + + L + + +R + S + ++ V+ ++ T Sbjct: 440 GAQVIQASA-----VTIALFALPHMLHANATNSRMQRQFRHSFWAEVYESVLASYITPPT 494 Query: 413 TLVSF 417 L Sbjct: 495 LLALI 499 >UniRef50_A9FZQ2 Glycosyltransferase n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9FZQ2_SORC5 Length = 521 Score = 253 bits (648), Expect = 7e-66, Method: Composition-based stats. Identities = 80/382 (20%), Positives = 145/382 (37%), Gaps = 13/382 (3%) Query: 29 LMMRFVFFWPFFMSIMWI--VGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNE 86 L+ F +S + + V + A +D P ++I +P FNE Sbjct: 5 LLCVLYFGVLIGLSGYGLHRLHLVVLCRLNRAKITRAQEVAALTDRDLPPVTIQLPLFNE 64 Query: 87 EKNVEETIHAALAQRYE--NIEVIAVNDGSTDKTRAILDRMAAQIPH-----LRVIHLAQ 139 + A Y +E+ ++D STD+T+ ++ ++ + + + + Sbjct: 65 STVAARLLDAVAKMDYPRDKLEIQVLDD-STDETQGLVRAHVERLRALGLDAVYLHRVDR 123 Query: 140 NQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRS 199 KA AL G AK E + D D + D IV +P VG V R Sbjct: 124 VGYKAGALDAGLKIAKGELVAIFDADFIPQPDFVRSIVG-HFEDPTVGMVQTRWGHLNRD 182 Query: 200 -TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDID 258 +++ ++Q + R + G +F SG +R+ A+ E G W D +TED+D Sbjct: 183 VSILTQVQALMLDGHHLVENRARFGAGLLFNFSGTGGMWRKDAIREAGGWQHDTLTEDLD 242 Query: 259 ISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFR 318 +S++ QL + Y + +PE + L Q+ RWA+G + K M + + Sbjct: 243 LSYRAQLAGYRFVYREDVVSPAELPEDISALRAQQYRWAKGTVQTARKLMATVLSAKLSL 302 Query: 319 MWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFI 378 + + LT +A+ LV + + + + + A T L F Sbjct: 303 GQRIEAFFHLTPHFAYPLLVLLSVLLLPALVLFPAADTLTMIAIDLPLCTATTGSLAAFY 362 Query: 379 -VSLMIENRYEHNLTSSLFWII 399 ++ + R L +I Sbjct: 363 MLAETAQGRSRWGAVRRLPMLI 384 >UniRef50_Q30R23 Glycosyl transferase, family 2 n=3 Tax=Campylobacterales RepID=Q30R23_SULDN Length = 889 Score = 253 bits (648), Expect = 8e-66, Method: Composition-based stats. Identities = 76/436 (17%), Positives = 155/436 (35%), Gaps = 22/436 (5%) Query: 12 CLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAP-Q 70 ++ + V Y + G +M + M + P P + Sbjct: 393 IVMSAVYPFVHYMNFGTWVMWGMGTILMVPLTMITLAKANELFKCTIGRPPERLIPLDLK 452 Query: 71 LKDNPSISIIIPCFNE-EKNVEETIHAALAQRYENIEVIAVNDGSTDKTR-----AILDR 124 P +SI +P + E ++ET+ A RY N EV+ + + + + + + Sbjct: 453 SDHVPFVSIHVPAYKEQPAVLKETLEALANLRYTNFEVLVIINNTPEDFYKTPIKELCEE 512 Query: 125 MAAQIPHLRVIHLAQNQGKAIALKTGAAAA--KSEYLVCIDGDALLDRDAAAYIVEPMLY 182 + + + KA AL E L ID D ++ + +V P+ Sbjct: 513 LGE---RFVYLDIECTGFKAGALNRALEFTNPTCEILAVIDADYVIKPNWLIDLV-PIFD 568 Query: 183 NPRVGAVTGNPRIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSA 241 + +V V R + +L+ + EY+ + + + G + R+SA Sbjct: 569 DAKVALVQAPQDHRDGNESLLKRAMNAEYAGFFDIGMVERNEENAI-VAHGTMIMVRKSA 627 Query: 242 LAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGA 301 EVG W+ D I ED ++ +L + Y R L+P+T++ QR RWA G Sbjct: 628 FDEVGQWNTDTIVEDSELGLRLFEAGYIGHYTNRRYGHGLLPDTIEAFKNQRHRWAYGAV 687 Query: 302 EVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAA 361 ++ K+ + + + +G + + + VP+ I + Sbjct: 688 QILKKHWEHFKPSSKTLTRSQKHHFITGWFFWLSDALGTVTSILNIIWVPVIIFVGVTIP 747 Query: 362 THTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVM 421 T + + T ++ I S ++ Y + ++ + + + L L + + Sbjct: 748 TIALTVPIITAFIVNVIHSFVL---YRTRVGANFYHSLLSAIAAMSLQLTIFKAVYDGFV 804 Query: 422 LMPKKQRARWVSPDRG 437 K R + ++G Sbjct: 805 ----KDRLPFKRTEKG 816 >UniRef50_A7INQ0 Glycosyl transferase family 2 n=11 Tax=Rhizobiales RepID=A7INQ0_XANP2 Length = 905 Score = 253 bits (647), Expect = 8e-66, Method: Composition-based stats. Identities = 90/421 (21%), Positives = 156/421 (37%), Gaps = 13/421 (3%) Query: 22 AYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQ-LKDNPSISII 80 Y ++ V + + + + F R + A P P +SI Sbjct: 380 DYVTLALSVVMLVPLVFVLLYRIEEMAAIAFGSGPRRLIDARKAAVVPTVPSRFPKVSIH 439 Query: 81 IPCFNEEKN-VEETIHAALAQRYENIEVIAVNDGSTDK--TRAILDRMAAQIPHLRVIHL 137 +P + E +++TI A A Y N E I + + + D + + AA + I+ Sbjct: 440 VPAYREPPEMLKQTIDALAALEYPNFEAIIIINNTPDPAMVEPVREYCAALGERFKFINA 499 Query: 138 AQNQG-KAIALKTGAAAA--KSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPR 194 + G KA AL+ A +E + ID D ++ D +V P+ +P VG V Sbjct: 500 EKVAGFKAGALRIALDATAPDAEIIGVIDADYVVTPDWLKELV-PVFDDPTVGLVQAPQD 558 Query: 195 IRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMI 253 R +L+ + EY+ + + + V G + RR+A+ E G WS D I Sbjct: 559 HRDADRSLLHEAMNAEYAGFFDIGMVQRNEDDAI-VVHGTMCLIRRAAMLEAGNWSSDTI 617 Query: 254 TEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWR 313 ED D+ + N W Y + + L+P++ + KQR RWA GG ++ K+ + Sbjct: 618 CEDTDLGLTIAENGWKTHYTRKRYGYGLLPDSFEAFKKQRHRWAYGGFQIIKKHWRKFLP 677 Query: 314 KENFRMWPLFFEYCLTTI-WAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGI--LLC 370 + + L I W + VG ++ LA VP + H + L+ Sbjct: 678 NRSRLTTAQKRHFVLGWISWLGSESVGAVMAIASLAFVPFVLLFGVSVPAHVLTLPILIT 737 Query: 371 TLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRAR 430 L L VSL + + V + + + + + Sbjct: 738 FLVYLMHFVSLYRLRVETTPMRMLGAAVAASAVQYTVAKAVLDGFRYKDLAFARTAKGNN 797 Query: 431 W 431 W Sbjct: 798 W 798 >UniRef50_B0CG43 Glycosyl transferase, group 2 family protein, putative n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0CG43_ACAM1 Length = 716 Score = 253 bits (647), Expect = 8e-66, Method: Composition-based stats. Identities = 69/415 (16%), Positives = 133/415 (32%), Gaps = 16/415 (3%) Query: 5 IVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGE 64 +V + + + + + + F M+ + + P + Sbjct: 93 LVGLGVRYELWRLFNTLNLSDPLNGTLSILLFVMELMAFTNTAAFMLQTIPSIDRSPEAD 152 Query: 65 --NAPAPQLKDNPSISIIIPCFNEEK-NVEETIHAALAQRYENIEVIAVNDGSTDKTRAI 121 + P + ++IP +NE + TI A +Y++ V ++D R + Sbjct: 153 HLSKAVISGAYQPWVDVLIPTYNEPPDVLRRTIIGCQAMKYDHKRVFLLDDLRRPHIREL 212 Query: 122 LDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPML 181 + +I + KA L A + +V D D + + V Sbjct: 213 AQELDCHY----IIRPNNDHAKAGNLNHAIAKTDGDLIVVFDADFIPTTNFLTRTVG-FF 267 Query: 182 YNPRVGAVTGNPRIRTRSTL-----VGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAA 236 + +V + + + ++ E + I+ ++ + +V G Sbjct: 268 QDEKVALLQTPQNFYNEDPIGVNLGLQEVLTNEQALFFRYIQPSRDVTNSVIC-CGSSFV 326 Query: 237 FRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRW 296 RR+ALAE+G + ITED S KLQ + I Y AL + PE + QRLRW Sbjct: 327 IRRTALAEIGGIPTESITEDFFTSLKLQSLGYKIKYLNEALSAGMSPENMGAYIDQRLRW 386 Query: 297 AQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIEL 356 QG + + L ++ + FI + L+ + + Sbjct: 387 GQGTLQSMFC--PANVFTIPGLNLLQRVSHSLGLLYWCLSVPRFIFLVMPLSYLLFELAP 444 Query: 357 THIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLA 411 L S M E R + I+ P++ ++ Sbjct: 445 LRATIDGVLFFYLPYFVCNAISFSWMTEGRRSAFWSDVYETILCIPMMLTVIQTL 499 >UniRef50_A7HQK0 Glycosyl transferase family 2 n=1 Tax=Parvibaculum lavamentivorans DS-1 RepID=A7HQK0_PARL1 Length = 917 Score = 253 bits (647), Expect = 9e-66, Method: Composition-based stats. Identities = 84/395 (21%), Positives = 148/395 (37%), Gaps = 17/395 (4%) Query: 11 LCLVLCIPLCVAYFHS-GELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAP 69 + + YF G + F+FF + I+ + + F P Sbjct: 397 AIAWAGLSVTGLYFTWVGITVWSFLFFAQGLLLIVLLAEAIEFVEVIWTRHGSRHFKPFD 456 Query: 70 QLKDNPS--ISIIIPCFNEEKN-VEETIHAALAQRYENIEVIAVNDGSTD-KTRAILDRM 125 + +P+ +SI +P NE V ET+ A Y+N EV+ +++ + D + + Sbjct: 457 ENAVSPTAMVSIHVPIHNEPPEMVRETLQALANLDYDNYEVLVLDNNTVDPEVWQPVRDY 516 Query: 126 AAQ-IPHLRVIHLAQNQG-KAIALKTGAAAA--KSEYLVCIDGDALLDRDAAAYIVEPML 181 AQ P R HL G KA AL G ++E + ID D ++ +V P Sbjct: 517 CAQLGPRFRFFHLENWPGFKAGALNFGLEKTAEEAEIIAVIDSDYQVEPSWLKVLV-PYF 575 Query: 182 YNPRVGAVTGNPRIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRS 240 VG V G R R + + EY+ + + + + G + R+S Sbjct: 576 DKQDVGFVQGPQDYRDRHESAFKNMAYWEYAGFFHIGMVQRNNFNAIIQ-HGTMTQVRKS 634 Query: 241 ALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGG 300 AL VG W++ I ED ++ KL + Y + L P+TL G QR RWA G Sbjct: 635 ALKRVGGWAEWCICEDAELGIKLYRAGYDSVYVNHSFGRGLTPDTLSGYITQRFRWAYGA 694 Query: 301 AEVFLKNMTRLWRK---ENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELT 357 ++ + L + + + + F + + + + Sbjct: 695 VQIVKHHWDALAPWASSKKGLTGAQRYYFLAGWLPWFADGLALLFTTASIVLSAWALYQP 754 Query: 358 HIAATHTAGILLCTL--CLLQFIVSLMIENRYEHN 390 H+ + A L+ T+ + +F+ SL + + Sbjct: 755 HMVSLPVAAFLIPTIGSFVFKFVRSLWLYAVRVRD 789 >UniRef50_Q7NLY9 Gll0980 protein n=1 Tax=Gloeobacter violaceus RepID=Q7NLY9_GLOVI Length = 804 Score = 252 bits (645), Expect = 2e-65, Method: Composition-based stats. Identities = 82/448 (18%), Positives = 172/448 (38%), Gaps = 22/448 (4%) Query: 1 MINRIVSFFILCLVLC-IPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFW--VYRE 57 ++N + + + + ++M + M + ++ + W + R+ Sbjct: 95 LVNNFLLALTVLAWGVNFIVNLEISLLTRMLMLAGYPLLVLMLPIGVIASLEQWEVLTRK 154 Query: 58 RHWPWGENAPAPQLKDNPSISIIIPCFNEE-KNVEETIHAALAQRYENIEVIAVNDGSTD 116 R P + + P + + +PC+ E + V T+ A RY N EV+ +++ + D Sbjct: 155 RWLRPRTVKPIGEREHYPKVCLQVPCYAEPPEVVTATLDRLAALRYPNFEVMVIDNNTKD 214 Query: 117 K-----TRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKS--EYLVCIDGDALLD 169 A +++ + V KA AL E + ID D + Sbjct: 215 PNLWKPVEAYCEQLGERFRFFHV--DPLAGAKAGALNWAMERVAGDVEIIGVIDADYHAE 272 Query: 170 RDAAAYIVEPMLYNPRVGAVTGNPRIRT-RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVF 228 D + ++ PR+G V R ++L ++ EY + + Sbjct: 273 PDFLSSLLA-HFDEPRMGFVQTPHDYRGWENSLYQRMCYWEYKTFFATTMPSLNEKDAGL 331 Query: 229 TVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKG 288 TV G + RR AL E G WS+ TED +++ ++ +T + P+ L+PET G Sbjct: 332 TV-GTMCLIRRRALDEAGGWSEWCQTEDSELAIRIHALGYTSVFVPQTFGRGLIPETFAG 390 Query: 289 LWKQRLRWAQGGAEVFLKNMTRLWRK----ENFRMWPLFFEYCLTTIWAFTCLVGFIIYA 344 KQR RW G + F +++ K + + + FT + F++ Sbjct: 391 YKKQRFRWTFGPVQEFKQHLRLFLPKPWAAPSELRPIQKLHHMNHGLGPFTTGLSFLMMP 450 Query: 345 VQLAGVP-LNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEH-NLTSSLFWIIWFP 402 V LA + + ++ +AA + I + ++++ +I +L ++ I+ Sbjct: 451 VNLAVLVSMLVQGESVAAPRSLWIAAVCATVAGWLLTWLIHRVEMGCSLRDTIGAILAKQ 510 Query: 403 VIFWMLSLATTLVSFTRVMLMPKKQRAR 430 ++ + A+ + F + + + + Sbjct: 511 ALYHTMITASVMGLFVGRIAWRRTDKFK 538 >UniRef50_B1M838 Cellulose synthase catalytic subunit (UDP-forming) n=15 Tax=Rhizobiales RepID=B1M838_METRJ Length = 930 Score = 252 bits (644), Expect = 2e-65, Method: Composition-based stats. Identities = 79/449 (17%), Positives = 136/449 (30%), Gaps = 37/449 (8%) Query: 5 IVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGE 64 + V ++ + + V Sbjct: 96 TAVVIRYAYWRVTGTLPTLDDPVSFGLGVVLALAELYCVLILTVSLIINVAPLARGA--- 152 Query: 65 NAPAPQLKDNPSISIIIPCFNEEKNV-EETIHAALAQRYE--NIEVIAVNDGSTDKTRA- 120 AP D P++ I IP +NE + T+ AA Y V ++DG TD+ A Sbjct: 153 -APVLPEADLPTVDIFIPSYNESAEILGLTLAAARNLDYPAGRATVWLLDDGGTDQKCAD 211 Query: 121 --------------ILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDA 166 L + A + + KA L G A+++ ++ +D D Sbjct: 212 PDPARAGAARARRAALQALCAGLGVRYLTRARNAHAKAGNLNNGLTQARADLVLVLDADH 271 Query: 167 LLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTL------VGKIQVGEYSSIIGLIKRT 220 R V +P++ V + ++ E G+ + Sbjct: 272 APFRPFLRETVGLFARDPKLFLVQTPHVFINPDPIERNLRTFTRM-PSENEMFYGVTQAG 330 Query: 221 QRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWI 280 + F G A RRSAL VG +S ITED + +++L WT Y R L Sbjct: 331 LDKWNGSF-FCGSAALLRRSALDAVGGFSGVTITEDCETAFELHARGWTSAYVDRPLIAG 389 Query: 281 LMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGF 340 L PET QR RW QG ++ L R Y + + F L Sbjct: 390 LQPETFADFIGQRARWCQGMFQIMLLKNPLFKRGLKPIQR---LCYLSSMTFWFFPLPRL 446 Query: 341 IIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIW 400 I L + ++++ + T + ++ + S L+ Sbjct: 447 IFMLAPLLHIFFDVKIFVSSIDEALVYT-ATYVVANMMMQNYLYGHVRWPWVSELYE--- 502 Query: 401 FPVIFWMLSLATTLVSFTRVMLMPKKQRA 429 + ++ ++V R + Sbjct: 503 YVQGVYLARSIVSVVLSPRKPSFTVTNKG 531 >UniRef50_C7PF17 Glycosyl transferase family 2 n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PF17_CHIPD Length = 498 Score = 252 bits (644), Expect = 2e-65, Method: Composition-based stats. Identities = 91/489 (18%), Positives = 163/489 (33%), Gaps = 78/489 (15%) Query: 22 AYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIII 81 + + E + FM + + + + +R + + Q P ISII Sbjct: 3 LFRNFYEGFVFVYGCTMLFMYALLAILSLRGIIRFQRKNSYVDYNKMLQSPLAPGISIIA 62 Query: 82 PCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTR-AILDRMAAQ------------ 128 P FNE + + + L Y EVI VNDGSTD T +++ Q Sbjct: 63 PAFNEGVTIISNVRSLLTLNYPRFEVIIVNDGSTDDTLTKLINEFQLQEVDFAYNERIKS 122 Query: 129 -------------IPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAY 175 L VI + KA A G AA +Y +C D D ++++D Sbjct: 123 QPVKRLFKSTNTAYDKLVVIDKVNGKSKADASNAGINAAAYDYFLCTDVDCIIEKDTLLR 182 Query: 176 IVEPMLYNP------------------------RVGAVTGNPRIRT-------------- 197 +++P + RV A RI Sbjct: 183 MIKPFMDEEHNKIKEIGEPCPECGYIHVKEDSVRVIATGATLRIANSCEIDEGVITRVRP 242 Query: 198 RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDI 257 + + Q EY L K + V VSG + F + + G + ED+ Sbjct: 243 PEKWLPRFQEMEYLRAYVLGKMGWSVINCVPNVSGGLGLFDKEIAIKAGGYDSKSFAEDM 302 Query: 258 DISWKLQ------LNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRL 311 DI ++ ++ I Y P CW P +K +QR RW +G AE+ + + Sbjct: 303 DIVTRMCTYMIDNKLKYAIRYIPTTQCWTEGPPNMKVFSRQRTRWGRGLAEIITIHRKVI 362 Query: 312 WRKENFRMWPLFFEYCLTTIWAFTCL----VGFIIYAVQLAGVPLNIELTHIAATHTAGI 367 + ++ + Y L + + + + IY + + + L + + + Sbjct: 363 FNPRYRKLGLVVLPYNLFFEFLAPIIEFTGILYYIYQIITGNINWHNALILLLFVYLYSV 422 Query: 368 LLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQ 427 ++ TL +L ++ + + P ++ L + L + + K Sbjct: 423 MITTLAVLWDQIT-YKHYKTWREVVGLALMAFIEPFVYHPLIVFFALRGYFNFITGKKH- 480 Query: 428 RARWVSPDR 436 W + R Sbjct: 481 --TWGNMQR 487 >UniRef50_Q3B477 Cellulose synthase (UDP-forming) n=1 Tax=Chlorobium luteolum DSM 273 RepID=Q3B477_PELLD Length = 501 Score = 251 bits (642), Expect = 3e-65, Method: Composition-based stats. Identities = 75/416 (18%), Positives = 149/416 (35%), Gaps = 8/416 (1%) Query: 9 FILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPA 68 L + + + L+ R F F + ++ G ++ Sbjct: 19 IALAVYVAVRGYFLVVSDYPLVERIYGFVFFAAELFVLLHGFGYFTNVLALSIKPYRNQQ 78 Query: 69 PQLKDNPSISIIIPCFNEEK-NVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAA 127 ++ P+++I+IP +E K +E+T+ A Y N + ++D S + +A+ Sbjct: 79 AASQEQPAVAILIPARHEPKEVLEQTLLACRNLGYPNKTIYILDDSSILSYKDEARELAS 138 Query: 128 QIPHLRVIHLAQNQG-KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRV 186 + ++ + N+G KA L A ++Y+ D D IV + + R+ Sbjct: 139 RF-NVELFSRDGNRGAKAGMLNDALAHINAKYIAVFDADQNPMPGFLQKIVPVLEADSRL 197 Query: 187 GAVTGNPRIRT-RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEV 245 V + V + + I + + +F G R+ AL V Sbjct: 198 ALVQTPQFYTNTEESRVAWSSNIQQAVFYEYISEGKSVKNAMFC-CGTNFVMRKDALDSV 256 Query: 246 GYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFL 305 G + + +TED+ + KL + W Y A + + PE L + Q+ RWA G A++F Sbjct: 257 GGFEEGSVTEDVATTLKLHMAGWKSLYYEHAYVFGMAPENLGSYFMQQNRWAMGSAQLFR 316 Query: 306 KNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTA 365 K R +FEY L+ + F I+ + ++ + + + + Sbjct: 317 KAAGLFLRLPGSLSPLQWFEYFLSGSYYFIGWAYLILLSAPISYILFDTPSYFMDPSVFM 376 Query: 366 GILLCTLCLLQFIVSLMIENRYEHNL---TSSLFWIIWFPVIFWMLSLATTLVSFT 418 + L + +E R+ + +I PV + +S Sbjct: 377 LTFIPYFILSVLLFYSGLEMRHFSPFNLIKGQMLAMISIPVYMRAVVQGVLNISVP 432 >UniRef50_A3V6V4 Putative cellulose synthase catalytic subunit protein n=1 Tax=Loktanella vestfoldensis SKA53 RepID=A3V6V4_9RHOB Length = 652 Score = 251 bits (642), Expect = 4e-65, Method: Composition-based stats. Identities = 80/422 (18%), Positives = 146/422 (34%), Gaps = 23/422 (5%) Query: 10 ILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAP 69 + + + V + + ++ F + + + V F++ R+ P E A A Sbjct: 45 LRYVYWRLNSTVLPYDEQFGIEFIWIWFVFIVEVAIYIEAVVFFLIMSRYKPRHEEADAG 104 Query: 70 Q-----LKDNPSISIIIPCFNEE-KNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILD 123 + P + + IP +NE + +E+TI A Y N+ V ++DG D L Sbjct: 105 EAALIARGVYPPVDVFIPTYNEPPEVLEKTIVGAAGIDYPNLTVWVLDDGKRD----WLC 160 Query: 124 RMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYN 183 M + + KA L A+ +++ D D + R+ + + + Sbjct: 161 GMCETEGARYLTRPDNSHAKAGNLNNALLYAEGDFIAIFDADFVPHRNFLKRTIG-LFDD 219 Query: 184 PRVGAVTGNPRIRTRSTL-----VGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFR 238 P VG V + + + K E + + +G F G + R Sbjct: 220 PEVGIVQTPQHFFNQDPIQFNLDLTKDWPDEQRLFFDKMAECRDAWGVAFC-CGSCSVTR 278 Query: 239 RSALAEVG-YWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWA 297 R+AL G + + ITED+ + KL + Y L L E+L+G + QR RW Sbjct: 279 RAALRAAGDQYPTESITEDLLTTLKLSRAGYKTRYLNERLSMGLAAESLEGFFVQRARWC 338 Query: 298 QGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQL-AGVPLNIEL 356 +GG + L LFF + W F+I + + + L Sbjct: 339 RGGIQSIFLPDGPLGPGIPLLNRLLFFPWS----WLIQYPSRFVIMIIPIVVLWTGLMPL 394 Query: 357 THIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVS 416 + + M+ Y L+++ W F ++ +LS Sbjct: 395 HFTSIEELIFYQFPVFLTYYLFMRWMLPYHYMPVLSTAQSWFATFRLLPTVLSSIIKPFG 454 Query: 417 FT 418 Sbjct: 455 VP 456 >UniRef50_Q6KZU9 N-acetylglucosaminyltransferase n=1 Tax=Picrophilus torridus RepID=Q6KZU9_PICTO Length = 395 Score = 251 bits (642), Expect = 4e-65, Method: Composition-based stats. Identities = 75/403 (18%), Positives = 147/403 (36%), Gaps = 20/403 (4%) Query: 34 VFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEET 93 + + I++ + + H+ + P +SII+P NEE + Sbjct: 7 IGILLIVVGIVYSIYQFPILLIGYLHFH-DYDIDFDWDNYKPLVSIIVPAKNEETVIGRC 65 Query: 94 IHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAA 153 I + L Q Y+N E+ V D S D T I + + V N KA AL + Sbjct: 66 IESILGQAYDNFELFVVVDNSDDDTYRIAKSY-ERDGRVHVFERHGNLTKASALNYAYSM 124 Query: 154 AKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSI 213 + E + D D +L+++ V M Y N I + ++ + I Sbjct: 125 SHGEIIATYDADTVLEKNTLKNAVYGMRYMDADVLQGYNTYINREENIFTRLAAID-EII 183 Query: 214 IGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYE 273 + + + V+G F+R + +G W+ + +TED++ ++ + Y Sbjct: 184 VKVSMIGRMYLHLFVPVAGSNQYFKRETIRIIGGWNGNFLTEDLESGVRMAAKRMRSAYL 243 Query: 274 PRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWA 333 P A + P T KQR+RW +G +V L + L + + + I Sbjct: 244 PSAKVYQETPATYSEYIKQRIRWLRGYHQVLLHSKKELSGLSGLDILMIVLAPTFSGI-- 301 Query: 334 FTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTS 393 + ++ ++ + H T+ +L + L + + E+ L Sbjct: 302 ------LLFSSIYISILNFYNPYVHSMRTYFISLLFIFFMIYIIAFVLALIKKKENALYI 355 Query: 394 SLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 L +I + +L+ ++ +F +L K+ W + Sbjct: 356 PLVYI------YVVLNAIISIYTFFLEILGVKR---VWYKVKK 389 >UniRef50_C5SBP2 Glycosyl transferase family 2 n=1 Tax=Allochromatium vinosum DSM 180 RepID=C5SBP2_CHRVI Length = 481 Score = 251 bits (641), Expect = 4e-65, Method: Composition-based stats. Identities = 76/414 (18%), Positives = 150/414 (36%), Gaps = 17/414 (4%) Query: 37 WPFFMSIMWIVGGVYFWVYRERHWPWGENAPAP----QLKDNPSISIIIPCFNEEKNVEE 92 + + ++ + V + P ++P + + +P FNE VE Sbjct: 10 YYLILGLVILALCSLNLVALTLARRFMPGRPLSVAKLTDAEHPRVLVQLPLFNEGDLVER 69 Query: 93 TIHAALAQRYE--NIEVIAVNDGSTDKTRAILDR----MAAQIPHLRVIHLAQNQG-KAI 145 + A + + +E+ ++D S D + AI R + Q ++ ++H Q KA Sbjct: 70 ILEAVMDLDWPRDRLEIQVLDD-SIDGSLAISQRAVAVLKQQGVNIELLHRVQRTAFKAG 128 Query: 146 ALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRS-TLVGK 204 AL G +++ ++ D D + D V ++ NP + V R +L+ + Sbjct: 129 ALAAGLERSEAPFVAIFDADFIPPPDFLQRTVGALVANPDLAYVQTRWGHLNRDESLLTR 188 Query: 205 IQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQ 264 IQ S G+ + + G +G +RR+A+ E G W D +TED+D+S + Sbjct: 189 IQARLLDSHFGVEQEARWRLGLPLPFNGTCGLWRRAAIDEAGGWDGDTLTEDLDLSLRAN 248 Query: 265 LNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMW-PLF 323 L W + + +P + + Q+ RW +G + F+K +WR +W + Sbjct: 249 LAGWRSGFMGDLVVPGSLPVSARAWRVQQFRWTKGFVQCFIKLTPLVWRSRRLPIWQKIM 308 Query: 324 FEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMI 383 + + AF V +I + + +L ++ ++ Sbjct: 309 ISFQIGQPLAFVVGVACLIMGLPFMAGAVVGGAGLSVVASVTSVLGFAAPIMFLTMAGRD 368 Query: 384 EN-RYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 E R + ++ L+ + K R R R Sbjct: 369 EGIRRTAIEILGALALTSGLLLSNARGGLEALLGHRTEFVRTPKARDR--KTAR 420 >UniRef50_Q0IE23 Glycosyl transferase, group 2 family protein n=19 Tax=cellular organisms RepID=Q0IE23_SYNS3 Length = 467 Score = 251 bits (641), Expect = 5e-65, Method: Composition-based stats. Identities = 77/380 (20%), Positives = 144/380 (37%), Gaps = 18/380 (4%) Query: 63 GENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEV--IAVNDGSTDKTRA 120 E A L P++ +++ +EE V + A RY ++ ++DGS DKT Sbjct: 93 AEPAVVESLASWPNVDVVVAARDEEAVVARLMERLGALRYPADQLTTWIIDDGSEDKTPD 152 Query: 121 ILDRMAAQIPHLRVIHLAQN--QGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVE 178 +LD + Q P+L VI +N GK+ AL A + E+++ +D D + D ++ Sbjct: 153 LLDELKPQYPNLNVIRRPRNAGGGKSGALNAALAQSSGEWILVLDADGQISEDQLERLIP 212 Query: 179 PMLYNPRVGAVTGNPRIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAF 237 + G V + L+ ++Q E + LI++ + + G V + G Sbjct: 213 IAVMGDWSG-VQMRKAVTNADTNLLTRLQAMEMA-FDALIQQGRLLGGGVSELRGNGQLL 270 Query: 238 RRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWA 297 RR L G +++D +T+D+D+S++ L ET+ LWKQR RWA Sbjct: 271 RRDVLEACGGFNEDTVTDDLDLSFRFLLQGARTTLLWNPPVREEAVETISALWKQRQRWA 330 Query: 298 QGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELT 357 +GG + F L A L+ + + + A + ++ Sbjct: 331 EGGLQRFFDYWPGLLSNRLTVGQRRDL--------ACFFLLQYALPVLSFADLLTSVFTR 382 Query: 358 HIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSF 417 A I+ + + + R+ + + LS ++ + Sbjct: 383 TSPAYWPLSIVAFGVSGVAY---WRGCRRFSEGPSLPQPDPFSLLLGIAYLSHWFFVIPW 439 Query: 418 TRVMLMPKKQRARWVSPDRG 437 + + +R W Sbjct: 440 VTLRMAVLPKRLVWAKTSHR 459 >UniRef50_C0GFE2 Glycosyl transferase family 2 n=1 Tax=Dethiobacter alkaliphilus AHT 1 RepID=C0GFE2_9FIRM Length = 455 Score = 250 bits (640), Expect = 6e-65, Method: Composition-based stats. Identities = 97/446 (21%), Positives = 180/446 (40%), Gaps = 60/446 (13%) Query: 39 FFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSI--SIIIPCFNEEKNVEETIHA 96 ++++ V F + + P + +I++P FNE+ +++ I Sbjct: 2 LVLTLVIFSIFVMFQILYITIPLLNHKRGTELPEKLPELGLTILVPAFNEQHVIKQCILG 61 Query: 97 ALAQRYENIEVIAVNDGSTDKTRAILDRMAAQ------------------------IPHL 132 AL Y+N E+I +NDGSTD T IL+ P++ Sbjct: 62 ALRANYQNHELIIINDGSTDDTLGILESFLRLEAVEESINRTLTYKTVRGFYYSTIYPNV 121 Query: 133 RVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGN 192 VI N GKA AL G A ++ +V +D D++LD ++ + ++ V A G Sbjct: 122 YVID-KSNGGKADALNAGIDYANNDIVVTLDADSILDSNSLQAMNSC-FHDDSVIAAGGM 179 Query: 193 PRIRTRSTL-------------VGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRR 239 I + + + + Q+ +Y + L K Q + + ++G AFRR Sbjct: 180 VHIVQGAGVKGGKASPNFAVRGIIRYQIMQYLTSFYLHKFVQAKFKAMTVIAGAFGAFRR 239 Query: 240 SALAEVGYWSDDMITEDIDISWKLQL------NQWTIFYEPRALCWILMPETLKGLWKQR 293 S L E+ + + + ED+DI+ K+Q + A+C+ PE + L+KQR Sbjct: 240 STLLELNGFR-NTVGEDMDITLKIQRLIYTKERDKKMMVVSEAVCYTECPENFRNLFKQR 298 Query: 294 LRWAQGGAEVFLKNMTRLWRKENFRMWPLFF--EYCLTTIWAFTCLVGFIIYAVQLAGVP 351 +RW + + + L+ + +F + F + L T+ AF L+ ++ + G Sbjct: 299 IRWQKAFVDCIITYGHCLFSQFSFGLSVFFLIEGFMLGTLIAFNTLLVPLVLILSKTGHT 358 Query: 352 LNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLF-WIIWFPVIFWMLSL 410 L + L I IL L+ F+VS + Y + + + +I + + +L L Sbjct: 359 LVLTLLTI-----YFILALCQSLVAFLVSKRFGHSYSKSNYARILSFIPLEIITYRLLGL 413 Query: 411 ATTLVSFTRVMLMPKKQRARWVSPDR 436 + ++ W +R Sbjct: 414 LWVTAGTIQYFY----KKQHWDKVER 435 >UniRef50_UPI0001C42DFC capsule anchoring protein CapD n=1 Tax=Bacillus pseudofirmus OF4 RepID=UPI0001C42DFC Length = 1000 Score = 250 bits (638), Expect = 1e-64, Method: Composition-based stats. Identities = 91/440 (20%), Positives = 164/440 (37%), Gaps = 56/440 (12%) Query: 43 IMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRY 102 I+ I+ ++ + + QL++ IS+++PC+NE+ +E ++ + Y Sbjct: 554 ILGIIFPALHLLHSLPWFKSKGESIRRQLEEEKGISVLVPCYNEQGIIETSLKNMSSLPY 613 Query: 103 ENIEVIAVNDGSTDKTRAILDRM------------------------AAQIPHLRVIHLA 138 + EVI +NDGS D+T ++L+ + P + VI Sbjct: 614 QKKEVIYINDGSKDRTFSLLNTYLKLKPSTKKPLQKLSHKRVRGVYQSELYPFIYVID-K 672 Query: 139 QNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTR 198 +N GKA AL G A + +V +D D +L +A V + V A G + Sbjct: 673 ENGGKADALNAGIEYAGEDIVVSLDADTVLTENALPK-VNEAFEDEDVVAAGGMVHVLQT 731 Query: 199 S-------------TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEV 245 + ++Q+ ++ + K + + + +SG FR+ AL EV Sbjct: 732 KTKNPLKRLSLRHANFLVRLQMFDFLKAFYITKISLARFHALAVISGAFGIFRKDALLEV 791 Query: 246 GYWSDDMITEDIDISWKLQLN-----QWTIFYEPRALCWILMPETLKGLWKQRLRWAQGG 300 G + I EDIDI+ K+ + + A+ + MPET +KQR+RW + Sbjct: 792 GGYR-STIGEDIDITLKMHEYIAKHVNKKVIHLKEAISYTEMPETWNDFFKQRVRWQKAY 850 Query: 301 AEVFLKNMTRLWRKENFRM---WPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELT 357 + + + +RK FR + FE L I + + GF I + P + Sbjct: 851 IDCVI-HFHSFFRKTLFRKAVSFFYIFESFLVGIVSAFIMTGFFIANA-IISPPESYLAY 908 Query: 358 HIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWF-PVIFWMLSLATTLVS 416 + I + I++ R+ SLF I ++ + + Sbjct: 909 VLYYLTFLLIFGVVYDIFALIMNRYHGFRFRWKDIPSLFVTILLDVFVYRFALMYIVMHG 968 Query: 417 FTRVMLMPKKQRARWVSPDR 436 W R Sbjct: 969 TIEYFF-----NTDWNKVSR 983 >UniRef50_B1ZCT8 Cellulose synthase (UDP-forming) n=14 Tax=Rhizobiales RepID=B1ZCT8_METPB Length = 672 Score = 250 bits (638), Expect = 1e-64, Method: Composition-based stats. Identities = 78/431 (18%), Positives = 149/431 (34%), Gaps = 27/431 (6%) Query: 5 IVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHW---- 60 +++ + LV V H + F++ W F + + ++ Sbjct: 45 VLAVAVRYLVWRFFYTVLP-HPADGSFAFIWTWTVFFVELGAFADILLFLVVMSRSVDRS 103 Query: 61 ----PWGENAPAPQLKDNPSISIIIPCFNEE-KNVEETIHAALAQRYE--NIEVIAVNDG 113 A + P++ + IP +NE +E TI ALA Y +V ++D Sbjct: 104 SEASRLERTFFARPKAELPTVDVFIPTYNEPLDVLERTIVGALALDYPRDKFKVYVLDDK 163 Query: 114 STDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAA 173 D L + + V + KA + G + E++ D D + R+ Sbjct: 164 RRD----WLKAYCEEKGAIHVTRPDNSHAKAGNMNNGLKVSSGEFIAIFDADFVPYRNFL 219 Query: 174 AYIVEPMLYNPRVGAVTGNPRIRTRSTL-----VGKIQVGEYSSIIGLIKRTQRIYGNVF 228 + P +P +G V + + + K+ E + ++ + F Sbjct: 220 RRTL-PFFTDPTIGIVQTPQHFFNKDPVQSNLSLEKVWPDEQRLFFDEMAASRDAWDVSF 278 Query: 229 TVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKG 288 G + RR+AL +G + D ITED+ + + + Y L L E LKG Sbjct: 279 C-CGSCSIARRAALDVIGGFPHDSITEDLLTTLAMLNKGYKTRYLNERLSMGLAAENLKG 337 Query: 289 LWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLA 348 + QR RW +GG + + L R ++ L+ + +T F+I V Sbjct: 338 YFVQRGRWCRGGIQTIYLHNGPL-RGPGLSLFQRVMFLPLSWLMQYTT--RFVILVVPAV 394 Query: 349 GVPLNIELTHIAATH-TAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWM 407 + + T L L ++ + RY ++S++ F ++ + Sbjct: 395 YLWTGAAPLYFTGTQDIVTYQLPVLAAYFLLMGWLTPTRYLPLVSSAVGTFATFRMLPVV 454 Query: 408 LSLATTLVSFT 418 +S Sbjct: 455 VSSVIKPFGVP 465 >UniRef50_D1JE47 Conserved hypothetical membrane protein, glycosyl transferase family 2 n=1 Tax=uncultured archaeon RepID=D1JE47_9ARCH Length = 392 Score = 249 bits (637), Expect = 1e-64, Method: Composition-based stats. Identities = 76/361 (21%), Positives = 146/361 (40%), Gaps = 9/361 (2%) Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRYENI-EVIAVNDGSTDKTRAILDRMAAQIPHLR 133 P+++++IP +NEEK + +TI++ Q Y N+ +++ V+D STDKT I A+ Sbjct: 37 PTVTVLIPAYNEEKTIADTINSIRQQTYSNVVQILVVDDSSTDKTDEIAREGDAE----- 91 Query: 134 VIHLAQNQG-KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGN 192 VI N G KA A K E ID D +LD +A +V L P + + Sbjct: 92 VIRTPSNSGTKAQAQNFALPYLKGELTATIDADTILDVEAIDTMVLRFLKEPDIASTCSF 151 Query: 193 PRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDM 252 + R + EY + + K+TQ G SG + F S L + G + D Sbjct: 152 VVPQVRKNFWELGRTVEYIYGLFMRKKTQEYLGIPIVCSGCFSLFNTSLLQKYG-FKDRT 210 Query: 253 ITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLW 312 + ED+D++W+L + + A+C+ P++ K Q RW + + + +W Sbjct: 211 MAEDMDLTWQLLAEGKKVRFCADAICFPKDPDSWKTYKGQVSRWLRSFFQNLSVHKKDIW 270 Query: 313 RKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTL 372 + + ++ + F L+ + I + + ++ I LL Sbjct: 271 KNKKLAVFLYSYLIEGFMGVIFYALLLWSIAFPPTQYIVVTSPVSLIIPVKAIMPLLLLS 330 Query: 373 CLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWV 432 + S++ + + + +++ + +F + K++ +W Sbjct: 331 SVAIVTASVLYQGHKLKISKIEILKGVPCYFALSIVNCCLFMEAFLNEWVF-KRRLEKWE 389 Query: 433 S 433 Sbjct: 390 K 390 >UniRef50_B0U7V3 Glycosyl transferase family 2 n=1 Tax=Methylobacterium sp. 4-46 RepID=B0U7V3_METS4 Length = 483 Score = 249 bits (637), Expect = 1e-64, Method: Composition-based stats. Identities = 86/415 (20%), Positives = 158/415 (38%), Gaps = 30/415 (7%) Query: 26 SGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFN 85 + + + ++ ++ + F+ H PA P +SI++P FN Sbjct: 64 TTAWGIFLIVIGASIIARWLVIQAMAFY----EHDRVRRKPPAELPNPAPFVSILVPAFN 119 Query: 86 EEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRV-IHLAQNQGKA 144 E + V + + Y N E+I V+DGSTD T +A Q + + ++ N GK Sbjct: 120 ESETVIGAPTSLMTLDYPNYEIIFVDDGSTDDTFIKAFPLAGQYGNCTLRVYTKPNGGKW 179 Query: 145 IALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGK 204 +L AK + L+C+D D+ L +DA +V M V G IR R+ + K Sbjct: 180 SSLNFAYKKAKGDLLLCVDADSGLAKDALRVMVPRM-SEKGVVGCAGQVTIRNRNNFLTK 238 Query: 205 IQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEV------------------- 245 +Q EY G ++ + G V V G I ++R+ L E+ Sbjct: 239 LQAAEYLLGNGGMRMALSLMGVVTVVPGPIGLYKRAILEEITKIPRKLSAAAAVTHSGPG 298 Query: 246 ---GYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAE 302 G S + ED +S I YEPRA + P+ + L QR RW +G + Sbjct: 299 AVNGPLSGETFAEDFQLSLSALALGGRIVYEPRAYAYTKCPDDVATLMSQRYRWIRGTWQ 358 Query: 303 VFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAAT 362 V++ + L + N +T ++ + I+ + +++ Sbjct: 359 VYIVYVRTLSKLTNRGNKSSLLNTVMTVLYPADIFLVPILNFFFWISIAVSMASGQSMNF 418 Query: 363 HTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSF 417 A I L + + ++E + ++ + +++ A + + Sbjct: 419 ILAWIGSVLLMNVMTAMVYILEQ--DDDIALLPLVPVLDLYQSILINSAWVIAAV 471 >UniRef50_UPI000038DF5C N-acetylglucosaminyltransferase n=1 Tax=Ferroplasma acidarmanus fer1 RepID=UPI000038DF5C Length = 405 Score = 249 bits (637), Expect = 1e-64, Method: Composition-based stats. Identities = 78/408 (19%), Positives = 150/408 (36%), Gaps = 20/408 (4%) Query: 30 MMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKN 89 ++ + + + G F Y E + P +SII+P NEE Sbjct: 11 LIAIGLVYVIYQFPIIYFGYKDFTKYDIDFSKLNEAQFSGLKMYKPMVSIIVPAKNEETV 70 Query: 90 VEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKT 149 ++ TI + L Q Y N E+ V D S+D T I ++ + V + + KA AL Sbjct: 71 IKRTIESILNQTYTNFELFVVVDNSSDNTYKIAKEYESRDKRVNVFNRPDGKSKASALNF 130 Query: 150 GAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTR-STLVGKIQVG 208 K E + D D +L + V M V + G R + ++ V Sbjct: 131 CFEKTKGEVIATYDADTMLLPNTLENAVYGM-NYFNVDVLQGYNSYINREENIFTRLAVI 189 Query: 209 EYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQW 268 + ++ + + V+G F+R + +G W D+ +TED++ S ++ ++ Sbjct: 190 D-EILVKATLIGRTHFNLFVPVAGSNQYFKRKVIESIGGWDDNFLTEDLESSIRISNARY 248 Query: 269 TIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCL 328 Y A P + ++QR RW +G +VF + R + +F + Sbjct: 249 KSAYLGSAKALQETPASYSEYFRQRTRWLRGYHQVFFHSKKRFSKFTDFDALMIVLAPTF 308 Query: 329 TTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYE 388 + I F L ++ + + ++ +V ++I+ R Sbjct: 309 SGILFFGWLYISLLNFYNPFVHSMRTYFISLILISLIIYVVA-------LVLVLIKKRQN 361 Query: 389 HNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 I+ P+I+ L+L +L++ + L + W + Sbjct: 362 F---------IYIPLIYIYLTL-NSLIAIYTLFLEITGAKRVWHKVKK 399 >UniRef50_A0LSJ8 Glycosyl transferase, family 2 n=1 Tax=Acidothermus cellulolyticus 11B RepID=A0LSJ8_ACIC1 Length = 471 Score = 249 bits (637), Expect = 1e-64, Method: Composition-based stats. Identities = 92/427 (21%), Positives = 165/427 (38%), Gaps = 7/427 (1%) Query: 10 ILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAP 69 + L I V + V + ++ + +P Sbjct: 27 FIVLAGLISAAVGILGPDRAIFLGVLVINLVFLVFFLRHVAFAAAAARWAPTDLYESPEV 86 Query: 70 QLKDNPSISIIIPCFNEEKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAILDRMAA 127 +D P++S+++ C NEE V I +A Y +E++ VNDGS+D+T +LD++ A Sbjct: 87 PSEDLPTVSVLVACHNEESVVPGLIRGLVALHYPRSRLEILIVNDGSSDRTGELLDQLTA 146 Query: 128 QIPHLRVIHLAQNQGKAIALKTGAAAAK--SEYLVCIDGDALLDRDAAAYIVEPMLYNPR 185 I H+RVIH G + AA A+ E +V D D + R+ +V +P Sbjct: 147 GIRHMRVIHRPAGAGGGKSGALNAALAEANGEIVVVFDADHIPRRNVIHRLVR-HFRDPT 205 Query: 186 VGAVTGNPRIRTR-STLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAE 244 V AV G +R + + + +Y S + + ++ + G A R + L + Sbjct: 206 VAAVQGRCIVRNSVQSTLARSIAIDYFSGYLVNEYGRQALFGLPAYGGANCAVRTALLRQ 265 Query: 245 VGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVF 304 G W+ + +TED DI+ +L L + + Y+ A+ TL +QR RWA+G +V+ Sbjct: 266 FGGWNVNSVTEDTDITLRLVLAGYRVRYDITAVDTEEGATTLARFVRQRYRWARGHQQVW 325 Query: 305 LKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHT 364 +WR + + E L + + + + L + + Sbjct: 326 RDYRRGVWRCPHLTLG-QKIETTLFLLVYHVPVFCTLTLVLTLLRLAGIGPHVSVVELLP 384 Query: 365 AGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMP 424 LL + V L++ S + F V + + A R Sbjct: 385 LAALLFAGPFCELAVGLLVGRAPRRAAWSVAWMTPLFVVFMLVCTRAWIDGLLGRPYTWV 444 Query: 425 KKQRARW 431 K +R++W Sbjct: 445 KTKRSQW 451 >UniRef50_Q0AEM9 Cellulose synthase (UDP-forming) n=2 Tax=Nitrosomonas RepID=Q0AEM9_NITEC Length = 492 Score = 249 bits (637), Expect = 1e-64, Method: Composition-based stats. Identities = 74/434 (17%), Positives = 158/434 (36%), Gaps = 14/434 (3%) Query: 15 LCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDN 74 + + ++Y + V F + + + ++G + + L Sbjct: 1 MLMLTVLSYAAQAVFFLVVVCFILYVLLELRVLGISRRVERHKLTELVSQLPGMLALPVW 60 Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRYE--NIEVIAVNDGSTDKTRAILDR----MAAQ 128 P +S+++P +NE + I A +Y ++E++ ++D STD T + A Sbjct: 61 PRVSVLLPIYNEAAVIVRLIDAVCRLQYPTSSLEILVLDD-STDNTSTLAQARIDYHADL 119 Query: 129 IPHLRVIHLAQNQG-KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVG 187 +R + A N+G KA L G + E+ V D D + D V P +P++G Sbjct: 120 GVSIRYVRRASNEGYKAGNLLNGIRQSSGEFYVIFDADFIPQEDFLLRTV-PYFQDPQLG 178 Query: 188 AVTGNPRIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVG 246 + R + + + Q E + G++ ++SG +RR+ + +G Sbjct: 179 FLQTGIGYENRDASFLTRFQAMEM-GHQQYVTVGLSEDGDMASLSGSSCVWRRACVESLG 237 Query: 247 YWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLK 306 W+ +TED+D+ ++ Q W Y + ++PET QR RW +G K Sbjct: 238 GWNASTVTEDVDLGYRAQFGNWKYAYMRDVVSMSVLPETTSAFRVQRKRWGRGLIHSAFK 297 Query: 307 NMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAG 366 + +++R+ M L + + +++ L I + + ++ + A Sbjct: 298 HARQMFRQRMPLMQRL---HAIAMMFSSLLLASIYILLLLCLPLSYLVDFEGLGIHWVAW 354 Query: 367 ILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKK 426 + + + + +W ++ + L F M Sbjct: 355 LFFGFVAIWGLNNVFAARKGAHIGEQRGILRTLWETYLYVAMFLPMAWYYFAGGMRALFG 414 Query: 427 QRARWVSPDRGILR 440 + +G + Sbjct: 415 VHGDFHRTLKGGSK 428 >UniRef50_D2L608 Glycosyl transferase family 2 n=1 Tax=Desulfovibrio sp. FW1012B RepID=D2L608_9DELT Length = 555 Score = 249 bits (636), Expect = 2e-64, Method: Composition-based stats. Identities = 100/432 (23%), Positives = 182/432 (42%), Gaps = 36/432 (8%) Query: 13 LVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLK 72 L + + L +F E + F F P F+ + F++ + Sbjct: 136 LGILLALVGVWFIKKESIFYFWFDVPVFLYGL----CALFYLLSRFFFAALYRPYPVDPG 191 Query: 73 DNPSISIIIPCFNEEKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAILDRMAAQ-- 128 P++S++IPCFNEE +E+TI L Q Y +EVI V+DGS+D + A+L+R+ + Sbjct: 192 YKPAVSVVIPCFNEEVWIEKTIRGCLNQNYPEELLEVIVVDDGSSDGSVAVLERVQRKVF 251 Query: 129 ---IPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPR 185 L V +N+GK A+ GA AK E +V +D D+ L D+ IV+P+ NP+ Sbjct: 252 EEVGERLFVHVFPENRGKRHAMAEGARRAKGEVVVFVDSDSFLQPDSILQIVQPL-KNPK 310 Query: 186 VGAVTGNPRIRTR-STLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAE 244 +GA TG + + + + ++Q Y + K + I+ V +SG +AA+R+ L Sbjct: 311 IGAATGRCEVENKWTNTLTRMQAVRYFIGFRIFKAAESIFDAVTCLSGPLAAYRKDVLLR 370 Query: 245 -VGYW-------SDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRW 296 +G W +D ++ + + + Y+ A+ ++P T + + Q++RW Sbjct: 371 YLGAWVNQRYLGQPATFGDDRSLTNYVLGSHY-AVYQHTAITHTIVPSTYRQFYIQQMRW 429 Query: 297 AQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIEL 356 + L+ +W KE F + L + L F+ P Sbjct: 430 KRSWLRESLRAALFMWYKEPFMAISFYLGVILPLLAPVVVLRAFVFVPAAYGVWP----T 485 Query: 357 THIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVS 416 ++A +L+CT LL NL +F + + + +++ Sbjct: 486 MYVAGVFFMSMLMCTSYLLI----------KRSNLWLYGVPFCFFYLFVLLWQIVWAVLT 535 Query: 417 FTRVMLMPKKQR 428 F + + + Sbjct: 536 FWKSEWGTRASK 547 >UniRef50_A5FW11 Cellulose synthase (UDP-forming) n=1 Tax=Acidiphilium cryptum JF-5 RepID=A5FW11_ACICJ Length = 758 Score = 248 bits (635), Expect = 2e-64, Method: Composition-based stats. Identities = 79/436 (18%), Positives = 140/436 (32%), Gaps = 30/436 (6%) Query: 14 VLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKD 73 F + + + + I + G V P Sbjct: 85 WRLTDTLPFGFGTINIALGILLVLAELHGIGSAIMGQIINVLPLDRTP---QHFTLDPAS 141 Query: 74 NPSISIIIPCFNEEKNVEET-IHAALAQRYE--NIEVIAVNDGSTDKTRAI--------- 121 P++ I+IP +NE+ + ET + AA Y V ++DG TD RA Sbjct: 142 CPTVDILIPTYNEDPEIVETTVIAATQIDYPASRFRVYILDDGGTDAKRAKPGAAGREAA 201 Query: 122 -----LDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYI 176 L +A + + + KA + E ++ +D D + RD + Sbjct: 202 RRAETLREIARKFGAIYLTRPENVHAKAGNINHALGQIDGELILVLDCDHVPTRDFLSRT 261 Query: 177 VEPMLYNPRVGAVTGN-----PRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVS 231 V +P++ + P R+ ++ E +++ +G F Sbjct: 262 VGFFQADPKLFLLQTPHNFVTPDPIERNLSTFEVMPAENELFYHVMQPGLDFWGAAF-FC 320 Query: 232 GVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWK 291 G A RR L E+G S ITED + + K + + R + L PET G Sbjct: 321 GSAAVLRRHVLDEIGGISGKSITEDAETTVKAMCRGYRTAFYNRPMVSGLQPETFSGFIL 380 Query: 292 QRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVP 351 QR+RWAQG ++F+ + L F LF + ++ + I+ + + Sbjct: 381 QRVRWAQGMMQIFMLDNVWLKSGLTFMQRLLFTNFAFYWLFP---ISRSILLLMPPLFLF 437 Query: 352 LNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLA 411 + L L + S R S ++ + L Sbjct: 438 FGFNVAATTPRQLLVYALP-YYLAALVNSQYFYGRVRWPFISQVYETAQTVFLSAALVSV 496 Query: 412 TTLVSFTRVMLMPKKQ 427 + + PK + Sbjct: 497 LRRPRAPKFKVTPKGE 512 >UniRef50_A8FII3 Glycosyltransferase n=3 Tax=Bacillus RepID=A8FII3_BACP2 Length = 758 Score = 248 bits (635), Expect = 2e-64, Method: Composition-based stats. Identities = 68/431 (15%), Positives = 135/431 (31%), Gaps = 21/431 (4%) Query: 6 VSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGEN 65 + + +C F + F + +G + V+ W Sbjct: 35 IIIVLCVFTNAAYICWRLFFTLPHEGTFNIVMGILLVACECIGFLQLLVFYTLVWKPSNR 94 Query: 66 APA--PQLKDNPSISIIIPCFNEE-KNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRA 120 L P++ I I +NE + ++ T+ + Y + + +DG RA Sbjct: 95 KQVMISDLDRLPTVDIFIATYNEPIEVLKRTVAGCVNLSYPKDKVHIYLCDDG----KRA 150 Query: 121 ILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPM 180 ++++A++ + KA L + E ++ +D D + V P Sbjct: 151 SVEQLASEFGVHYLTRTDNKFAKAGNLNHAMSQTNGELILTLDADMVPLPAFLEKTV-PY 209 Query: 181 LYNPRVGAVTGNPRIRTRSTLVGKI-----QVGEYSSIIGLIKRTQRIYGNVFTVSGVIA 235 ++ V + E + ++ + + V V G Sbjct: 210 FHDGATAFVQVPQAFYNEDPFQYNMFSKDRIPNEQDFFMQTLQAGKDRFNAVMYV-GSNT 268 Query: 236 AFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLR 295 FRR AL E+G ++ +ITED+ LQ + L L PE+ L KQR R Sbjct: 269 VFRRMALDEIGGFATGVITEDMATGMLLQGK-FKSVSVGEVLAVGLAPESWLDLLKQRDR 327 Query: 296 WAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIE 355 W++G + K R L+ + ++ F L I + + I+ Sbjct: 328 WSRGNIQCARKFNPLKVRGLTLMQRVLYLD---GIVYWFNGLFKMIYILTPILFLLFGIQ 384 Query: 356 LTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLV 415 L ++ ++ N+ S ++ + + Sbjct: 385 SFWADFQSIFMFWLPAF-FSSYLAFKLVSNQKRSMFWSHIYESSMAFHLAGVALSELFFK 443 Query: 416 SFTRVMLMPKK 426 ++ PK Sbjct: 444 KRVEFLVTPKG 454 >UniRef50_B6R4T3 Glycosyl transferase, family 2 n=1 Tax=Pseudovibrio sp. JE062 RepID=B6R4T3_9RHOB Length = 672 Score = 248 bits (635), Expect = 2e-64, Method: Composition-based stats. Identities = 80/432 (18%), Positives = 160/432 (37%), Gaps = 13/432 (3%) Query: 16 CIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGV-YFWVYRERHWPWGENAPAPQLKDN 74 + + +F VF+W F+ + +G + + + ++ P+L++ Sbjct: 236 LLTVSPIFFLGTIGFAFDVFYWALFILLTLSLGAIGLLRIASFFTYSDKQDFAVPELENW 295 Query: 75 PSISIIIPCFNEEKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAILDRMAAQIPHL 132 P +I++P + E + + A A Y ++VI + + + T+ L ++ + + Sbjct: 296 PHYTILVPLYKESAICRQLVDALDALDYPKEALDVIFLVEQDDELTQKNLRKLLRKSMRM 355 Query: 133 RVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPM-LYNPRVGAVTG 191 ++ + Q K AL G AA K E++ D + + + L V + Sbjct: 356 IILPPGKPQTKPRALSVGLAATKGEFVTVFDAEDRPEPQQLKKAICQFALEGHDVACLQA 415 Query: 192 NPRI-RTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSD 250 I + + + EY+++ + + + G FR SAL +VG W Sbjct: 416 ALSIDHAKDGWLVRQFAFEYAALFDVFLPFLSRKNLLLPLGGTSNHFRVSALRKVGGWDP 475 Query: 251 DMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTR 310 +TED D++ + + + + P TLK QR RW +G + ++ Sbjct: 476 FNVTEDADLAVRFARQGFRTRTLNSS-TYEEAPLTLKAWLHQRTRWHKGWIQTLAVHLRN 534 Query: 311 LWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAA--THTAGIL 368 F LT CL + A+ A V + T + A + L Sbjct: 535 PRLTYKQLGATNFALLLLTFFGGMLCLWAAPLTALMFAEVLWGVHQTGLQAFDAFSIYAL 594 Query: 369 LCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQR 428 C L L V +++ + + I P+ +W+L + + ++ P Sbjct: 595 FCFLFGLGGTVVTVLQGSAKRGFRPRGWEIASIPL-YWVLGCIASYKALFEFIVKPHY-- 651 Query: 429 ARWVSPDRGILR 440 W + GI+R Sbjct: 652 --WRKTEHGIVR 661 >UniRef50_A9QSM4 N-acetylglucosaminyltransferase n=39 Tax=Lactobacillales RepID=A9QSM4_LACLK Length = 438 Score = 248 bits (635), Expect = 3e-64, Method: Composition-based stats. Identities = 83/439 (18%), Positives = 167/439 (38%), Gaps = 27/439 (6%) Query: 21 VAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISII 80 + + + ++ + + +M + +FW+ + E +L P I+I+ Sbjct: 1 MNFIQTIMILAIISIWIATGLGLMILFSATHFWLTTIK----KERQKVFRLATYPKITIV 56 Query: 81 IPCFNEEKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAILDRMAA----QIPHLRV 134 +P NEE + +TI + + Y +E++ D D T + + + + +V Sbjct: 57 VPAHNEEVVIAQTIESIMNLNYPKTQLELLLYADNCQDDTYNQIISVVSLEKYKYISAQV 116 Query: 135 IHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNP-RVGAVTGNP 193 I GKA L + A E++ D DA+ +++A ++VE L NP + GAV G Sbjct: 117 IQRTGTGGKAGVLNDALSIATGEWICVYDADAMPEKNALHFLVEKALENPEKYGAVFGRN 176 Query: 194 RIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDM 252 + R + + E + + + + G + + +G W + Sbjct: 177 KTRNYKQNFLTRCINLEIVNTQRIHHVGLWHLFKIGRIPGTNFIIKTDFVKHIGGWDNGA 236 Query: 253 ITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLW 312 +TED IS+K+ + I + + PET+ + QR RWA+G +V L N L+ Sbjct: 237 LTEDTAISFKIMQSGKLIALAHNSEAFQQEPETVSAYYHQRKRWAKGNYQVILDNFKHLF 296 Query: 313 RKENFRMWPLFFEYCLTTIWA-FTCLVGFIIYAVQLAGVPLNIELTHIAATHTA------ 365 + +R+ F Y T W ++ II+ + +++ T Sbjct: 297 DRSVWRIKWEVFYYISTFFWFNAAIVISDIIFVANIVTWGISMFNPSTHILFTIGSNNIE 356 Query: 366 --------GILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSF 417 L+ + LLQ ++L + S+ I + + ++ V Sbjct: 357 LANLLMINWFLMVGIYLLQINIALASQFGQLTVKNLSVALISYVTYAQLFIVVSVVSVVS 416 Query: 418 TRVMLMPKKQRARWVSPDR 436 + + + +W R Sbjct: 417 VLLDHILHRDGTKWYKTKR 435 >UniRef50_B8HQQ1 Cellulose synthase (UDP-forming) n=5 Tax=Cyanobacteria RepID=B8HQQ1_CYAP4 Length = 768 Score = 248 bits (634), Expect = 3e-64, Method: Composition-based stats. Identities = 71/438 (16%), Positives = 138/438 (31%), Gaps = 21/438 (4%) Query: 1 MINRIVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHW 60 ++ ++ + ++ + + F + + V ++ + Sbjct: 106 VVGILLILTVRYVLWRSLSTLNLSDPLNGVFSLGLFLLEMLVVTLAVIQLFLILKIRPDQ 165 Query: 61 PWGENAP-----APQLKDNPSISIIIPCFNEEKNV-EETIHAALAQRYENIEVIAVNDGS 114 A P++ I++P +NE + TI A Y N ++ ++D Sbjct: 166 RRQREADHYALAVEAGDYLPNVDILVPTYNEPVEILRRTIIGCQALDYANKKIYLLDDTR 225 Query: 115 TDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAA 174 RA ++ +A ++ + KA L + + E +V D D + R+ Sbjct: 226 ----RAQVEDLAGELGCEYMARPDNRHAKAGNLNHALSQTQGELIVVFDADFIPTRNFLK 281 Query: 175 YIVEPMLYNPRVGAVTGNPRIRTRSTL-----VGKIQVGEYSSIIGLIKRTQRIYGNVFT 229 + +P+V V + + I + ++ + G+V Sbjct: 282 RTIG-FFQDPQVALVQTPQSFYNPDPIAHNLGLENIITHDEEVFYRHMQEFRDAVGSVVC 340 Query: 230 VSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGL 289 G RRSAL V ++ D ITED +L Q + Y L L E + Sbjct: 341 A-GTSFVVRRSALEGVRGFATDSITEDYFTGIRLSAQQHRLVYLDEKLSAGLAAEDIAAF 399 Query: 290 WKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAG 349 QR RWA+G + F L + + F + LA Sbjct: 400 ATQRFRWARGTLQAFFIQANPL--TIPGLTPVQRLAHLEGLLSWFGTWSRVGFLLMPLAY 457 Query: 350 VPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLS 409 L++ LL L+ V + R L +F ++ + + Sbjct: 458 SFLHVIPVRATTRELMYFLLP-YVLVNLTVFSWLNKRSRSALLGEVFSLVLVFPLAITV- 515 Query: 410 LATTLVSFTRVMLMPKKQ 427 L F++ + K Sbjct: 516 FQVMLNPFSKGFNVTPKG 533 >UniRef50_A4YD58 Glycosyl transferase, family 2 n=12 Tax=Sulfolobaceae RepID=A4YD58_METS5 Length = 395 Score = 248 bits (633), Expect = 4e-64, Method: Composition-based stats. Identities = 87/407 (21%), Positives = 163/407 (40%), Gaps = 32/407 (7%) Query: 37 WPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHA 96 +SI+ + VY + W + P+ PS S+++P NEEK + + Sbjct: 5 IVIGLSIIVSIWSVYNSAFAIYGLSW--KSDEPKTSSGPSFSLLVPVRNEEKVLGRLLER 62 Query: 97 ALAQRY--ENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQ---GKAIALKTGA 151 + Q Y E+I + DGSTD T + ++ + ++ +HL ++ GK+ AL G Sbjct: 63 LVNQEYDRSKYEIIVLEDGSTDNTLGVCNKFSEMYSIIKCVHLEKSNVVNGKSRALNYGL 122 Query: 152 AAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRT-RSTLVGKIQVGEY 210 ++ + + D D + D Y+ + + N RVG V G R ++V ++ E Sbjct: 123 KISRGDIIGVFDADTVPRLDVLGYVAQKFISNSRVGGVQGRLVPINVRESIVARLASLE- 181 Query: 211 SSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTI 270 + G + G + RR AL +VG W+++++TED+D+S KL + I Sbjct: 182 ELFSEYSISGRARAGLFVPLEGTCSFVRRDALEKVGGWNENVLTEDLDLSLKLTSLNYLI 241 Query: 271 FYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTT 330 Y P W +P T L +QRLRW +G E+ ++ + Sbjct: 242 VYSPSVQSWREVPVTFSSLVRQRLRWYRGNFELTMRISRFKFTWR--------------- 286 Query: 331 IWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHN 390 + LVG ++ V + + ++ ++ ++ +MI R+ Sbjct: 287 LVDAAMLVGTPVFMVLSLANYSLVFIYSYQLHVLIAAIISFSSMMTLLLIIMISRRH--- 343 Query: 391 LTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRG 437 + I+ + L+ +L ++L W +R Sbjct: 344 ----MIETIYIILSALYLNFTISLH-LISIVLELAGAPKGWSKTERS 385 >UniRef50_Q6ACB6 Glucosaminyltransferase n=1 Tax=Leifsonia xyli subsp. xyli RepID=Q6ACB6_LEIXX Length = 421 Score = 248 bits (633), Expect = 4e-64, Method: Composition-based stats. Identities = 89/421 (21%), Positives = 157/421 (37%), Gaps = 19/421 (4%) Query: 26 SGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFN 85 + + + V ++ +V + H A P ++I++P +N Sbjct: 6 ALQWIGLVVCLILALTGLIPVVAAAATFFVIPLHAWINHYHKAAP--YLPRVAIVVPAWN 63 Query: 86 EEKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAILDRMAAQIP-HLRVIHLAQNQ- 141 E + +I + Y + + V+D STD T ++ A + P ++ + + Sbjct: 64 EGAVIGASIDRLVTLDYPKEALRIYVVDDASTDDTSVVVRERAMRYPGNVFLFRREKGGQ 123 Query: 142 GKAIALKTGAAAAKS----EYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRT 197 GKA L G + E L+ +D D + D+ + L +P+VGAV+ + Sbjct: 124 GKAHTLNHGIERIFADDWMEALLIMDADVIYQPDSLRKMTR-HLADPKVGAVSAYIHEGS 182 Query: 198 RS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGY-WSDDMITE 255 + K EY +R Q + G ++G R L +G + E Sbjct: 183 ADRNYLTKFVSTEYVLSQPTARRAQNVLGAQACLAGGAQLHSRENLIAIGGQVDTSTLAE 242 Query: 256 DIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKE 315 D +++ QL + +EP A P T+ LWKQRLRWA+G V + +R Sbjct: 243 DTITTFETQLRGKRVVFEPHAHVLAEEPRTIDALWKQRLRWARGNVSVTARYSKVWFRPS 302 Query: 316 NFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLL 375 I F+ + +I + G+ + L + AT IL + L Sbjct: 303 RKHH----LGAWTFGIVWFSLWLLPLIMVISSIGLAGLLFLHNGFATTVFRILWISAALA 358 Query: 376 QFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPD 435 V ++ + ++ FP I ML + T L F +ML + + S Sbjct: 359 YLFVLILGSQTDVGTTRKEVGHVMLFPGIVNMLVMLTAL--FPTMMLTGFRGCSASGSGA 416 Query: 436 R 436 R Sbjct: 417 R 417 >UniRef50_C7PF18 Glycosyl transferase family 2 n=9 Tax=Bacteroidetes RepID=C7PF18_CHIPD Length = 486 Score = 247 bits (632), Expect = 5e-64, Method: Composition-based stats. Identities = 91/471 (19%), Positives = 165/471 (35%), Gaps = 59/471 (12%) Query: 25 HSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPS---ISIII 81 G VF + + I + + V+ + + + L +P I+++ Sbjct: 8 TIGRFYESTVFIYGTVLLITYALLAVFSMISVRAYARKDKFHQENILLSSPLAPGITVLA 67 Query: 82 PCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRA--------------------- 120 P FNE + + + L Y E+I VNDGSTD + Sbjct: 68 PAFNEGLTIIFNVRSLLTLNYPLYEIIIVNDGSTDNSLEQMIDEFELVPVEFAYNAKIQT 127 Query: 121 -----ILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAY 175 I L VI + KA A+ G AA ++ VC D D +LD++ Sbjct: 128 KPVRKIYKSTNPAYAKLMVIDKVNGKSKADAVNAGINAALYDHFVCTDVDCILDKNTLLE 187 Query: 176 IVEPMLYNPR--VGAVTGNPRIRT--------------RSTLVGKIQVGEYSSIIGLIKR 219 +++P++ + V A RI S L+ + Q EY L K Sbjct: 188 LIKPVMQEEKKRVIATGATLRIANSCEFDQGVMTRMRPPSQLLPRFQEVEYIRAFVLGKM 247 Query: 220 TQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLN------QWTIFYE 273 + V VSG + F + G + ED+++ ++ + I Y Sbjct: 248 GWSLLNCVPNVSGGLGLFDKEIAIRSGGYDHSSFGEDMELMTRMCRYAKDNKIDYAIRYI 307 Query: 274 PRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTT--I 331 P+ LCW P ++K +QR RWA+G A++ + + +M + F Y + Sbjct: 308 PKTLCWTEAPASVKIFNRQRTRWARGLAQLMYAHFGMFFNPRYGKMGLIIFPYNFFFELL 367 Query: 332 WAFTCLVGFIIYAVQ--LAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEH 389 G I Y V L + L + +T +++ T+ +L ++ RY Sbjct: 368 APIIEFTGIIYYIVMACLGLINWPTALLLLVFVYTYSVMITTIAILWDQLTF----RYYK 423 Query: 390 NLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRGILR 440 + F++ + + +++ W + R + Sbjct: 424 TWKEVALLCTTPFLEFFLYHPLIVIFALRGYYYFLTGKKSTWGNMQRRGFQ 474 >UniRef50_Q4KHI6 Glycosyl transferase, group 2 family protein n=11 Tax=Pseudomonadaceae RepID=Q4KHI6_PSEF5 Length = 863 Score = 247 bits (632), Expect = 5e-64, Method: Composition-based stats. Identities = 81/433 (18%), Positives = 149/433 (34%), Gaps = 21/433 (4%) Query: 3 NRIVSFFILCLVLCIPLCVAYFHSGELMMRFVF--FWPFFMSIMWIVGGVYFWVYRERHW 60 R FI L + + + Y +S + F + + + + + + Sbjct: 341 GRTFLTFIAFLCGSVLVWIGYDYSQQYSTWFSLTVGFLLALGALGVFIVLLTEAHELAEA 400 Query: 61 PWGENAPAP------QLKDNPSISIIIPCFNEEKN-VEETIHAALAQRYENIEVIAVNDG 113 W P +SI +PC+NE V++T++A Y + EV+ +++ Sbjct: 401 VWIHKRRREFLPVEGDSSYRPKVSIHVPCYNEPPEMVKQTLNALANLDYPDFEVLIIDNN 460 Query: 114 STDK-TRAILDRMAAQ-IPHLRVIHLAQNQG-KAIALKTGAAAA--KSEYLVCIDGDALL 168 + D + P + H+A G K AL +E + ID D + Sbjct: 461 TKDPAVWEPVQAYCETLGPRFKFFHVAPLAGFKGGALNYLLPHTAKDAEVIAVIDSDYCV 520 Query: 169 DRDAAAYIVEPMLYNPRVGAVTGNPRIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNV 227 DR+ ++V P +P++ V R ++ + K+ EY + T+ + Sbjct: 521 DRNWLKHMV-PHFADPKIAIVQSPQDYRDQNESTFKKLCYAEYKGFFHIGMVTRNDRDAI 579 Query: 228 FTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLK 287 G + RR+ L E+G W+D I ED ++ ++ + Y + LMP+T Sbjct: 580 IQ-HGTMTMTRRTVLEELG-WADWCICEDAELGLRVFEKGLSAAYHHESYGKGLMPDTFI 637 Query: 288 GLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTC-LVGFIIYAVQ 346 KQR RWA G ++ ++ L R +N + + L + + Sbjct: 638 DFKKQRFRWAYGAIQIIKRHTASLLRGKNTELTRGQRYHFLAGWLPWVADGMNIFFTVGA 697 Query: 347 LAGVPLNIELTHIAATHTAGILLCTLCLLQFIVS--LMIENRYEHNLTSSLFWIIWFPVI 404 L I + + L L F V + + R F + Sbjct: 698 LLWSAAMIIVPQRVDPPLLIFAIPPLALFVFKVGKIIFLYRRAVGVNLKDAFCAALAGLA 757 Query: 405 FWMLSLATTLVSF 417 L F Sbjct: 758 LSHTIAKAVLYGF 770 >UniRef50_D2QUU5 Glycosyl transferase family 2 n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QUU5_9SPHI Length = 508 Score = 247 bits (630), Expect = 8e-64, Method: Composition-based stats. Identities = 73/399 (18%), Positives = 157/399 (39%), Gaps = 14/399 (3%) Query: 46 IVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYEN- 104 + + + ++ ++A + P +++ +P +NE VE I A + +Y Sbjct: 24 LSLIISYLRSEKKRRALAQSAADYSPEALPRLTVQLPVYNELYVVERLIDAVVLLKYPKD 83 Query: 105 -IEVIAVNDGSTDKTRAI----LDRMAAQIPHLRVIHLAQNQG-KAIALKTGAAAAKSEY 158 +++ ++D STD+T +I + Q + I + +G KA AL G AK E+ Sbjct: 84 KLDIQVLDD-STDETVSIIARKVAEYKKQGFDIEHIRRPERKGFKAGALAYGLTLAKGEF 142 Query: 159 LVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRS-TLVGKIQVGEYSSIIGLI 217 + D D + D + V P +P+V V +L+ ++Q ++ + Sbjct: 143 VAIFDADFVPDPEFLLKTV-PHFADPKVAIVQTRWEHLNEDFSLITQLQAFGLNAHFTVE 201 Query: 218 KRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRAL 277 + + G + +G +R+ A+A+ G W D +TED+D+S++ QL W Y Sbjct: 202 QSGRYAAGLLANFNGTGGVWRKVAIADAGGWQSDTLTEDLDLSYRAQLRGWKFVYREDVG 261 Query: 278 CWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCL 337 +P + L Q+ RW +G AE K + + + + F+ Sbjct: 262 SPAELPVAMNALKSQQYRWMKGAAECARKLFVNVLKTPGVSLSMKLHAF----FHLFSSA 317 Query: 338 VGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFW 397 ++ + + VPL + +++ I+ ++ Sbjct: 318 TFILVLILGVMSVPLIYIRSQHPEWEWVFVVINLFQFNLLILITFYGIPVWFLKGANKAR 377 Query: 398 IIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 + W+ ++ L + +L + V+ ++ +V + Sbjct: 378 LAWYFPMYSSLMMGLSLHNTIAVIEGYMGRKTPFVRTPK 416 >UniRef50_A3ZTV9 Cellulose synthase (UDP-forming) n=2 Tax=Blastopirellula marina DSM 3645 RepID=A3ZTV9_9PLAN Length = 1168 Score = 245 bits (627), Expect = 2e-63, Method: Composition-based stats. Identities = 61/431 (14%), Positives = 138/431 (32%), Gaps = 21/431 (4%) Query: 6 VSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGEN 65 + ++ + + F++ + + G + ++Y + W Sbjct: 17 FVIVLAAIMSTVYISYRLFYTLNFESYYAIAASVTLLGAETWGVILLFLYFMQVWDSIVL 76 Query: 66 APAPQLKDNPSISIIIPCFNEE-KNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDR 124 P P L + ++ + +P +NE+ + + +I+A Y + ++DG + + Sbjct: 77 DPVPPLAER-TVDVFLPTYNEDVQLLRGSINALNQLDYPH-RTYVLDDGRRAEVAELCQE 134 Query: 125 MAAQIPHLRVIHLAQN-QGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYN 183 M + I N KA L E++V D D + + + + + Sbjct: 135 MGVEY-----ISRENNLHAKAGNLNHALDRTDGEFVVIFDADHIARSNFISRTLG-YFAD 188 Query: 184 PRVGAVTGNPRIRTRSTLVGK------IQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAF 237 ++ + I E +I+ + + G A F Sbjct: 189 EKLAFIQTPHAFYNFDNFQSHVDYKRGIYWEEGQLFYNVIQPGKTSWNA-NVFCGSAAIF 247 Query: 238 RRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWA 297 RR+AL EVG + + ITED+ ++ W + + + QRLRW Sbjct: 248 RRTALEEVGLIATETITEDMHTGLRIHAKGWKSLFVNERMIAAQAAPDITSFASQRLRWG 307 Query: 298 QGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELT 357 +G +F + + + L + +I + L + Sbjct: 308 EGNLSIFAYDNPLTMKGLTIGQRLNYLGSMLGWTTGPAKALLYIAPILML----FSGVSP 363 Query: 358 HIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSF 417 T ++ ++ + +I N Y + + F + ++ A Sbjct: 364 VGHFNATLLLITLAYLIVTWTTVKLISNGYGRLWDMEVQAMTNFWIQCCCVARAMLNRGR 423 Query: 418 TRVMLMPKKQR 428 + ++ K+ R Sbjct: 424 GKFVVTSKRGR 434 >UniRef50_Q2JNJ5 Glycosyl transferase, group 2 family protein n=6 Tax=Bacteria RepID=Q2JNJ5_SYNJB Length = 764 Score = 245 bits (626), Expect = 2e-63, Method: Composition-based stats. Identities = 67/423 (15%), Positives = 137/423 (32%), Gaps = 12/423 (2%) Query: 25 HSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCF 84 + ++ + + SI++ + H E A +D P +I++P + Sbjct: 333 WTTLTILVLLINLFYVASILFKLLLSLVGSADRFHQITDEEVAALDDRDLPIYTILVPVY 392 Query: 85 NEEKNVEETIHAALAQRYE--NIEVIAVNDGSTDKTRAILDRMA-AQIPHLRVIHLAQNQ 141 E + + I + Y ++V+ + + + T + L ++ ++ + Sbjct: 393 KEPEVMPILIKSLSKLDYPHERLDVLILLEENDRDTIEAARAAKPPRYVRLLLVPDSKPK 452 Query: 142 GKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLY-NPRVGAVTGNPRIRTRS- 199 K A G A A+ EYL D + + D D V +P + V RS Sbjct: 453 TKPKACNYGLAFARGEYLTIYDAEDIPDPDQLKKAVIAFRKGDPSLVCVQAALNYFNRSE 512 Query: 200 TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDI 259 + ++ EYS + + G FR L E+ W +TED D+ Sbjct: 513 NFLTRMFTLEYSYWFDYLLPGLETLRMPIPLGGTSNHFRTDRLRELQGWDPFNVTEDADL 572 Query: 260 SWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRM 319 + + +T+ + +K +QR RW +G + +L + R Sbjct: 573 GIRASQHGYTVGVINST-TYEEANCAVKNWIRQRSRWIKGYMQTWLVHNRNPLRSLRKLG 631 Query: 320 WPLFFEY-CLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFI 378 + Y FT L I++ + + + L L+ Sbjct: 632 LKNWLSYQFFIGGSFFTFLTSPIMWLLFIYWLLTRAHWLQNLFPSWLVYLGLFNLLVGNA 691 Query: 379 VSLMIE-NRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRG 437 + + + L + I+W L ++ ++ P W G Sbjct: 692 IGIYLNLVAVFRRGYYDLAFYALLNPIYWQLHSMAAYMALWQLFTKPFY----WEKTIHG 747 Query: 438 ILR 440 + + Sbjct: 748 LSK 750 >UniRef50_C1ZJA9 Glycosyl transferase n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZJA9_PLALI Length = 533 Score = 245 bits (626), Expect = 3e-63, Method: Composition-based stats. Identities = 80/427 (18%), Positives = 166/427 (38%), Gaps = 20/427 (4%) Query: 30 MMRFVFFWPFFMSIMWIVGGVYFWVYRERHWP--WGENAPAPQL---KDNPSISIIIPCF 84 M+ + F++++ V + + R+W W + P + + P+++I +P F Sbjct: 1 MLLPLIVTLLFVTLVNTVFQLTQFDLAYRYWRSVWKKQKPTSRPIDREHLPAVTIQLPMF 60 Query: 85 NEEKNVEETIHAALAQRYENIEVIA-VNDGSTDKTRA----ILDRMAAQIP--HLRVIHL 137 NE + A Y + + D STD + IL+ + P ++ +H Sbjct: 61 NESIIAPRILEAVSRIDYPRDRLQVQILDDSTDHSPEIIAGILEELRQSQPELNIEYLHR 120 Query: 138 AQNQG-KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIR 196 QG KA AL+ E++ D D + D +++ P +P V V Sbjct: 121 TDRQGFKAGALQAAMPLVTGEFIAIFDADFIPQPDFLTHLL-PYFDSPEVAVVQSRWGHL 179 Query: 197 TR-STLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITE 255 +++ + Q + + + G T +G ++RSA+ G WS D + E Sbjct: 180 NAHDSVLTQAQQFFLDGHHSVEQNGRNRAGYFITFNGTAGIWQRSAMEAAGGWSADTLVE 239 Query: 256 DIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKE 315 D+D+S++ Q + I Y + +P ++ GL Q RW +G A+V LK + ++W++ Sbjct: 240 DLDLSYRTQSLGYRIVYVEDYVTPGELPNSVSGLRVQLFRWFKGNAQVGLKILGKVWKQP 299 Query: 316 NFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIE--LTHIAATHTAGILLCTLC 373 + + ++A ++ ++ + +PL + H + + Sbjct: 300 LPLSVKI---HATAQLFAPFTMLSSLVMLLITGALPLILHAAPEHAGLVKLCYMGFVWVP 356 Query: 374 LLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVS 433 + + E L ++ +F + + S V+ K+ +WV Sbjct: 357 AVLLVYGTPRIRFDEGPWYIRLAKLVPRTFVFMAMMTGLSCQSSIAVLEAVFKRANQWVV 416 Query: 434 PDRGILR 440 +G + Sbjct: 417 TPKGFSQ 423 >UniRef50_D2LZZ6 Glycosyl transferase family 2 n=1 Tax=Bacillus cellulosilyticus DSM 2522 RepID=D2LZZ6_BACS4 Length = 457 Score = 245 bits (625), Expect = 3e-63, Method: Composition-based stats. Identities = 102/451 (22%), Positives = 178/451 (39%), Gaps = 62/451 (13%) Query: 33 FVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEE 92 F+ F+ I I Y ++ + + +SI+IP +NEEK + Sbjct: 2 FLQISLIFVLIFVIFQTFYMFIPLLTIKGLNKYKKVDKEN---GVSILIPAYNEEKVILN 58 Query: 93 TIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAA------------------------- 127 + + Y N+E I +NDGSTD+T IL + Sbjct: 59 CLQGIVNLNYNNVEAIFINDGSTDRTLDILTQHLQLVPVYNRLPAVKIPHEAIIDIYAST 118 Query: 128 QIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVG 187 P + VI +N GKA AL G A+ EY+V +D D++LD+++ +++ + + Sbjct: 119 LYPKVLVID-KKNGGKADALNAGIEYARKEYVVTLDADSILDKESL-HVINSTFSDNHIL 176 Query: 188 AVTGNPRIRTR-------------STLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVI 234 A+ G +I + + + Q+ +Y + + K TQ ++ ++G Sbjct: 177 AIGGMVQIGQGFQGNYLKPRPTFINPGIIRFQILQYFTAFYMHKYTQSKLKSITVIAGAF 236 Query: 235 AAFRRSALAEVGYWSDDMITEDIDISWKLQL------NQWTIFYEPRALCWILMPETLKG 288 AFRR L EV + I ED+DI+ K+Q + + P+ALC+ PET + Sbjct: 237 GAFRRKVLFEVDGFR-KTIGEDMDITLKIQRLIKTKYKNHRLIFVPQALCYTECPETFRA 295 Query: 289 LWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFF--EYCLTTIWAFTCLVGFIIYAVQ 346 L+ QR+RW + + F + K R+ F L TI AFT L+ + Sbjct: 296 LFNQRVRWQKAFVDCFFMYRKNFFHKLGVRLSLFFLGDGVLLGTINAFTALLIPLFLIFN 355 Query: 347 LAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRY-EHNLTSSLFWIIWFPVIF 405 + + L I IL + VS + +Y + +I V + Sbjct: 356 RDHYMVALGLFTITF-----ILGIFQSIATIEVSKRLGIKYTRMDYLRISLFIPLEIVTY 410 Query: 406 WMLSLATTLVSFTRVMLMPKKQRARWVSPDR 436 +L + + + + +W +R Sbjct: 411 RLLGILFVTIGTFMFFV----NKDKWDKANR 437 >UniRef50_B3QIT3 Glycosyl transferase family 2 n=6 Tax=Bradyrhizobiaceae RepID=B3QIT3_RHOPT Length = 944 Score = 245 bits (625), Expect = 4e-63, Method: Composition-based stats. Identities = 89/433 (20%), Positives = 164/433 (37%), Gaps = 21/433 (4%) Query: 17 IPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHW----------PWGENA 66 L + F M+ + ++ + + V F R Sbjct: 405 YFLFGSAFALTLGMILLIPLVAIALARIEEISAVAFGRKPRRLITRALTDAQETKRAAAI 464 Query: 67 PAPQLKDNPSISIIIPCFNEEKN-VEETIHAALAQRYENIEVIAVNDGSTDK--TRAILD 123 + + + P +SI +P + E +++T+ A Y N EV+ + + + D T+ I D Sbjct: 465 ASGEPVNVPKVSIHVPAYFEPPEMLKQTLDALARLDYPNFEVVVIINNTPDPAFTQPIQD 524 Query: 124 RMAAQIPHLRVIHLAQNQG-KAIALKTGAAAA--KSEYLVCIDGDALLDRDAAAYIVEPM 180 + I+ + +G KA AL+ +E + ID D ++ D +V P Sbjct: 525 HCRELGERFKFINAEKVKGFKAGALRIAMERTAVDAEIIGIIDADYVVTPDWLKDLV-PA 583 Query: 181 LYNPRVGAVTGNPRIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRR 239 +PRVG V R +L+ I GEY+ + + Y + V G + RR Sbjct: 584 FDDPRVGLVQAPQEHRDGDRSLMHYIMNGEYAGFFDIGMVQRNEYNGI-IVHGTMCLIRR 642 Query: 240 SALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQG 299 +A+ G WS D I ED D+ ++ + W Y + L+P+T + KQR RWA G Sbjct: 643 AAMDMAGGWSSDTICEDSDLGLEIMEHGWLTHYTNTRYGYGLLPDTYEAFKKQRHRWAYG 702 Query: 300 GAEVFLKNMTRLWRKENFRMWPLFFEYCLTTI-WAFTCLVGFIIYAVQLAGVPLNIELTH 358 G ++ K+ R + E+ L + W +G ++ + L VP+ Sbjct: 703 GFQIIKKHWRRFLPGNSRLSRDQRREFGLGWLNWLGAESLGVVVAILNLIWVPIVAFADI 762 Query: 359 IAATHTAGILLCTLCLLQFIVSLMIEN-RYEHNLTSSLFWIIWFPVIFWMLSLATTLVSF 417 + + ++ L++ R + + L +I + W +S A Sbjct: 763 AIPDKILTLPIIASFIVTLAHFLVLYRLRVKIGVPQMLGAMIAAMSVQWTVSRAVAQGLI 822 Query: 418 TRVMLMPKKQRAR 430 T + + + Sbjct: 823 TEHLAFARTSKGG 835 >UniRef50_A1SGB7 Glycosyl transferase, family 2 n=1 Tax=Nocardioides sp. JS614 RepID=A1SGB7_NOCSJ Length = 772 Score = 244 bits (624), Expect = 5e-63, Method: Composition-based stats. Identities = 88/455 (19%), Positives = 164/455 (36%), Gaps = 31/455 (6%) Query: 8 FFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWV----------YRE 57 ++ + L + M R + + ++ + G + +R Sbjct: 99 VYLYVVYLVFMFEWTFASPLGAMGRAGALSLWGLEVLAALLGCAYLWELCDAMGSEQWRR 158 Query: 58 RHWPWGENAPAPQLKDNPSISIIIPCFNEEKN-VEETIHAALAQRYENIEVIAVNDGSTD 116 R E A A P +S+ +P E V ET+ + YE+ E+IA++D +TD Sbjct: 159 RVSDGVEPARAADAGIRPFVSLHVPAHEEPPEMVIETLESLRGLDYEHYEIIAIDDNTTD 218 Query: 117 KT-RAILDRMAAQIPHLRVIHLAQNQG-KAIALKTGAAAA---KSEYLVCIDGDALLDRD 171 ++ ++ A ++ HL G K+ AL ++E + +D D L+ D Sbjct: 219 ESLWRPVEAWCAAHG-VKFAHLEDWPGYKSGALNYALREMTDDRAELIGVVDSDYQLEPD 277 Query: 172 AAAYIVEPMLYNPRVGAVTGNPRIRT-RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTV 230 A P+ +PRVG + R + Y + + ++ Sbjct: 278 FLARC-APLFADPRVGFIQSPQDYRDWEGAPFYRRLYYSYKYFFSVSQPSRNERDGA-IF 335 Query: 231 SGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLW 290 +G + RR AL +VG W + ITED ++S ++ W+ + + +MP T + L Sbjct: 336 AGTMGLIRRQALEDVGGWDEWCITEDAELSLRVLRAGWSGMHVDASFGHGVMPLTFEALK 395 Query: 291 KQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFE----YCLTTIWAFTCLVGFIIYAVQ 346 QR RW GG ++ + L L Y + + LVG + + Sbjct: 396 GQRFRWCFGGIQILRMHWRSLLPGLRGGGNRLTLGQRWAYLSGGLQWYGDLVGVLFFLFL 455 Query: 347 LAG---VPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPV 403 L G + L L + ++ L LL F+ ++ + R S Sbjct: 456 LGGATNLALGGGLLFRKLSPFLLAVIPLLVLLGFLRAVSLIRRGTGAGWSDAIGAFLI-- 513 Query: 404 IFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRGI 438 W + + + + K + R + G+ Sbjct: 514 --WQSTTLVVARASVQGLFARKAEFLRTPKTEEGL 546 >UniRef50_C7XH18 Glycosyltransferase n=1 Tax=Lactobacillus crispatus 125-2-CHN RepID=C7XH18_9LACO Length = 442 Score = 244 bits (623), Expect = 5e-63, Method: Composition-based stats. Identities = 88/405 (21%), Positives = 171/405 (42%), Gaps = 14/405 (3%) Query: 24 FHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPC 83 F L M F W I + + ++ H + P P P +SIIIP Sbjct: 2 FDQTVLQMGFWATWLLIPVIFEEIPMIINFIRLLHHHKIRKKTPLPD--FLPVLSIIIPT 59 Query: 84 FNEEKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQ 141 +N K + + + Y + + V++GS D T + + P L + L Sbjct: 60 YNSSKTLYRCLKSLADSDYPKNLLGIFVVDNGSKDDTFQVFAKAQTDYPQLLMRWLTSPH 119 Query: 142 GKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTG-----NPRIR 196 GK+ AL A+S+Y++ ID D LD+ A IV+ NP + A+TG I+ Sbjct: 120 GKSSALNAAIYKARSQYIINIDSDGRLDKHALNNIVKKFSANPNIDALTGTILTDRHLIK 179 Query: 197 TRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITED 256 L+ + EY + + ++FT+SG +AF+R L +S D I+ED Sbjct: 180 KPHKLLKLNEYFEYCQTFLAGRNIENNSNHIFTLSGAFSAFKRETLIRTYLYSTDTISED 239 Query: 257 IDISWKLQ-LNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKE 315 D++++++ + TI + A+ + L+ QR RW +G EV K ++++ + Sbjct: 240 TDMTFQIRYNLKGTISFCDDAIFYSEPISGFSELYTQRQRWQRGELEVINKYLSQVLSWK 299 Query: 316 ----NFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCT 371 NF++ L ++ +T + FI+ + + L I + + + Sbjct: 300 QFFSNFQIRRLIIDHTVTFLKIIWMFAIFILLGFGYSWISLLFSYAVIYLLYLFLEISSS 359 Query: 372 LCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVS 416 +C+ ++ S E ++ +F ++++ ++ + + + Sbjct: 360 ICVYIYLGSFPEERKFYLKNIWMIFLLLFYNLLCSFIRCVGIINA 404 >UniRef50_C6QIA2 Cellulose synthase (UDP-forming) n=1 Tax=Hyphomicrobium denitrificans ATCC 51888 RepID=C6QIA2_9RHIZ Length = 656 Score = 244 bits (623), Expect = 6e-63, Method: Composition-based stats. Identities = 75/431 (17%), Positives = 137/431 (31%), Gaps = 29/431 (6%) Query: 13 LVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGEN--APAPQ 70 L + + + + F ++++ F + Sbjct: 54 LWRIFETIPSPENPIDFLTGLTFTAVEAIAMLGATASQIFLTRTRNRTAEADANVKRLLA 113 Query: 71 LKDNPSISIIIPCFNE-EKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQI 129 P I ++I +NE E ++ TI AL Y N + +DG R+ L + + Sbjct: 114 KPQVPKIDVLICTYNEGEDILDRTIIGALNINYPNFRIWVCDDGR----RSWLKELCDRR 169 Query: 130 PHLRVIHLAQNQGKAIALKTGAAAA-----KSEYLVCIDGDALLDRDAAAYIVEPMLYNP 184 + KA + K +++ +D D + D V + +P Sbjct: 170 GIGYLTRPDNAHAKAGNINAALKNLAALDEKPDFISILDADFIPLPDFLTRAVS-LALDP 228 Query: 185 RVGAVTGNPRIRTRSTL-----VGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRR 239 VG V + V K+ E ++ ++ + F G + R Sbjct: 229 DVGVVQTPQHFFNPDPIQSNLAVTKVWPDEQRFFFDIVMASKDAWNAAFC-CGTSSVIRF 287 Query: 240 SALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQG 299 AL +G + D +TED +S +L+ + Y L L PE L QR RW+ G Sbjct: 288 DALQSIGGFPTDSVTEDYLVSLRLREKGFRTVYLNEILSLGLAPEGLAEYTTQRARWSLG 347 Query: 300 GAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHI 359 ++ + L W+ T L + V + L+I+ H Sbjct: 348 FVQICRGHSGPLKWGNGLPFVDRIMLCETFLHWSATHLFRVLGIIVPALYLILDIQAVHA 407 Query: 360 AATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTR 419 L + + I + E R + ++ ML + L S Sbjct: 408 NVIDAIWYLAPFIVVQSAIYVWLTEGR----------VLPLMTDLYQMLCASEVLKSVWS 457 Query: 420 VMLMPKKQRAR 430 +L P Q+ + Sbjct: 458 GLLRPNGQKFK 468 >UniRef50_A6CMH6 Glycosyl transferase, group 2 family protein n=2 Tax=Bacillus RepID=A6CMH6_9BACI Length = 449 Score = 244 bits (623), Expect = 6e-63, Method: Composition-based stats. Identities = 101/455 (22%), Positives = 175/455 (38%), Gaps = 64/455 (14%) Query: 30 MMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKN 89 M + F+S + ++ +Y E ++ N S S+I+P +NEEK Sbjct: 1 MTMLYITFAVFLSFV-----IFQTIYVLIPLFKNEKKHKVRITKNHSFSVIVPAYNEEKV 55 Query: 90 VEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRM------------------------ 125 +E I Y+N E++ VNDGS D T +L + Sbjct: 56 IENCIEGFRQVEYKNAELLIVNDGSKDNTLRVLHEVLQLKPASRMIDTRLKHERVTGFYQ 115 Query: 126 AAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPR 185 + + P + VI +N GKA +L G +++E +V +D D++L++++ + Sbjct: 116 SKKFPSIFVI-NKKNGGKADSLNAGINFSENEIVVTLDADSILEKNSLNEMNAAFQK-KS 173 Query: 186 VGAVTGNPRI---RTRS----------TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSG 232 V A GN I S V K Q +Y S L KR Q ++ ++G Sbjct: 174 VLACGGNVLISQAFNGSLENLKPTLKVNNVIKFQFLQYLSSFYLHKRAQAAMNSITVIAG 233 Query: 233 VIAAFRRSALAEVGYWSDDMITEDIDISWKLQL----NQWT--IFYEPRALCWILMPETL 286 AFRR L + + + ED+DI+ KL I + P ALC+ P TL Sbjct: 234 AFGAFRRETLFAIKGFR-KTVGEDMDITLKLHKWIKERGNKESITFVPTALCYTECPSTL 292 Query: 287 KGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFF--EYCLTTIWAFTCLVGFIIYA 344 K ++KQR+RW + + + +RK + F ++ + T+ AF L+ ++ Sbjct: 293 KNMFKQRVRWQKAFIDCLIYYRKCYFRKLPKKFSIFFLVDQFTIGTLSAFPVLLTPVLLY 352 Query: 345 VQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIW--FP 402 + L + L A +L L VS + ++ + Sbjct: 353 FNGGQLTLLLMLMSGAF-----LLFVYQSLTTMFVSYLHGLKFSKRDLLRFLLFLPIDII 407 Query: 403 VIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRG 437 V +++L+ + P W +R Sbjct: 408 VFRMLINLSFVVYGTLSYFHSPH----AWDKLERS 438 >UniRef50_Q9FNI7 Glucomannan 4-beta-mannosyltransferase 2 n=8 Tax=Spermatophyta RepID=CSLA2_ARATH Length = 534 Score = 244 bits (623), Expect = 6e-63, Method: Composition-based stats. Identities = 80/403 (19%), Positives = 159/403 (39%), Gaps = 21/403 (5%) Query: 15 LCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQL--K 72 + +PL + LM + +M I+ ++ + FW ++ + + +L Sbjct: 36 VIVPLLQLAVYICLLMSVMLLCERVYMGIVIVLVKL-FWKKPDKRYKFEPIHDDEELGSS 94 Query: 73 DNPSISIIIPCFNEEKNVEETIHAALAQRYENIE-VIAVNDGSTDKTRA-----ILDRMA 126 + P + + IP FNE + + +I AA + + VI V D STD T R A Sbjct: 95 NFPVVLVQIPMFNEREVYKLSIGAACGLSWPSDRLVIQVLDDSTDPTVKQMVEVECQRWA 154 Query: 127 AQIPHLRVIHLAQNQG-KAIALKTGAAAA---KSEYLVCIDGDALLDRDAAAYIVEPMLY 182 ++ ++R G KA ALK G + EY+V D D + D + +++ Sbjct: 155 SKGINIRYQIRENRVGYKAGALKEGLKRSYVKHCEYVVIFDADFQPEPDFLRRSIPFLMH 214 Query: 183 NPRVGAVTGNPRIRTRST-LVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSA 241 NP + V R L+ ++Q + + F +G +R +A Sbjct: 215 NPNIALVQARWRFVNSDECLLTRMQEMSLDYHFTVEQEVGSSTHAFFGFNGTAGIWRIAA 274 Query: 242 LAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGA 301 + E G W D ED+D++ + L W Y +P T + Q+ RW+ G A Sbjct: 275 INEAGGWKDRTTVEDMDLAVRASLRGWKFLYLGDLQVKSELPSTFRAFRFQQHRWSCGPA 334 Query: 302 EVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAA 361 +F K + + R + R W + + + + ++ + +PL I + + Sbjct: 335 NLFRKMVMEIVRNKKVRFWKKVY-VIYSFFFVRKIIAHWVTFCFYCVVLPLTILVPEVKV 393 Query: 362 THTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVI 404 + + ++ ++++ ++ +WI++ V+ Sbjct: 394 PIWGSVYIPSI------ITILNSVGTPRSIHLLFYWILFENVM 430 >UniRef50_A0YM95 UDP-glucose-beta-D-glucan glucosyltransferase n=1 Tax=Lyngbya sp. PCC 8106 RepID=A0YM95_9CYAN Length = 758 Score = 243 bits (621), Expect = 9e-63, Method: Composition-based stats. Identities = 69/433 (15%), Positives = 140/433 (32%), Gaps = 25/433 (5%) Query: 15 LCIPLCVAYFHSGELMMRFVFFWPFFMSIM-WIVGGVYFWVYRERHWPWGE----NAPAP 69 + ++ + + M F+ ++ WI GG+ W+ + + Sbjct: 115 YILWRTLSTLNLSDPMNGTFSLGLLFLEVLIWISGGIRLWLVLKMKDRSQQANQMEINVR 174 Query: 70 QLKDNPSISIIIPCFNEEKNV-EETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQ 128 + K P++ I+IP + E ++ + TI A Y N +V ++D + + + + Sbjct: 175 EGKFLPTVDILIPTYQESTSIVKRTIIGCQAIDYPNKKVYLLDDSQRLEVKQLAQELGCN 234 Query: 129 IPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGA 188 + + KA L E +V D D + R+ + N +V Sbjct: 235 Y----IARSENSHAKAGNLNHALPLTSGELIVVFDADFIPTRNFLDRTIG-FFQNKKVAL 289 Query: 189 VTGNPRIRTRSTL-----VGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALA 243 V + + K+ + + I+ + + G RR AL Sbjct: 290 VQTPQSFYNPDPIAKNLGLEKVLISDDEEFYRHIELLKDGVNSTVCA-GTSFVMRRKALE 348 Query: 244 EVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEV 303 E+G + + I+ED +L + + Y L L E + + QR RW +G + Sbjct: 349 EIGGFVTESISEDYFTGIRLSAQGYQLVYLREKLSAGLAAEDITNYFVQRKRWGRGTLQA 408 Query: 304 FLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATH 363 F N L + + + + + LA L + Sbjct: 409 FFINSNPL--TIPGLKLSQRLAHLEGLVNWLSVWAYPYFFLMPLAYAFLGVLPIQATVEE 466 Query: 364 TAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIW-----FPVIFWMLSLATTLVSFT 418 + +Q IV + R + S ++ + V ML+ T Sbjct: 467 WLYFFIPC-YWVQIIVFRWLNERSRSVIISEIYKTVLCFPLSLSVFLVMLNPFKHKFQVT 525 Query: 419 RVMLMPKKQRARW 431 + + ++ W Sbjct: 526 QKGISSQRFSFNW 538 >UniRef50_Q04QL4 Glycosyltransferase plus another conserved domain n=6 Tax=Leptospira RepID=Q04QL4_LEPBJ Length = 517 Score = 243 bits (621), Expect = 9e-63, Method: Composition-based stats. Identities = 71/407 (17%), Positives = 139/407 (34%), Gaps = 17/407 (4%) Query: 19 LCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSIS 78 L V + FF I + + + + P ++ Sbjct: 2 LTVVTVLFLA-IYGIDIVALFFFGIHTYIMVYLYKKNYAYCESEPDKILDVNDPNLPVVT 60 Query: 79 IIIPCFNEEKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAILDRMAAQIPHLRV-I 135 + +P FNE V+ I +A +Y +E+ ++D STD+T + L I Sbjct: 61 VQLPIFNEFYVVDRLIETTVALKYPKDKLEIQLLDD-STDETIEKSRNLINHYKALGFDI 119 Query: 136 HLAQNQG------KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAV 189 H G KA AL+ G A+ EY+ D D + D D V P +P++G V Sbjct: 120 HHLHRAGAERTGHKAGALEAGMKVARGEYIAIFDADFMPDPDFLIKTV-PYFEDPQIGMV 178 Query: 190 TGNPRIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYW 248 ++ K Q + + + +G +++ + + G W Sbjct: 179 QVRWGHINADYNVLTKAQSFGIDGHFMIEQVARNGSHLWMNFNGTAGIWKKECIIDSGGW 238 Query: 249 SDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNM 308 D +TED D+S++ ++ W Y C +P + Q+ RW +G + +K + Sbjct: 239 EHDTLTEDFDLSYRAEMRGWKFRYFKDIECKAEIPAMISAYKSQQFRWCKGSIQTAVKLL 298 Query: 309 TRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIEL----THIAATHT 364 R++R + L +V I+++ L + I Sbjct: 299 PRIFRADLPWKIKSEAIVHLINYSVHPLMVINILFSAPLLLMNYWSGFDFYDLPIEILMG 358 Query: 365 AGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLA 411 +L + I + + + ++ +I +++ Sbjct: 359 TAAVLSVGSVGPMIFYAYSQKILHKDWKRRMVYLPILIMIGTGIAIV 405 >UniRef50_B7K7L6 Cellulose synthase (UDP-forming) n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7K7L6_CYAP7 Length = 718 Score = 243 bits (621), Expect = 1e-62, Method: Composition-based stats. Identities = 77/439 (17%), Positives = 154/439 (35%), Gaps = 25/439 (5%) Query: 3 NRIVSFFILCLVLCIPLCVAYFHSGEL---MMRFVFFWPFFMSIMWIVGGVYFWVYRERH 59 +R+V +L ++C L F + L + F M ++ I+ + ++ + Sbjct: 93 SRLVIVSLLVFLICRYLVWRGFFTLNLNNPLNSVFSLGLFLMELILILSYTFKICFQLKI 152 Query: 60 WPWGENAPA-----PQLKDNPSISIIIPCFNEEKNV-EETIHAALAQRYENIEVIAVNDG 113 P A A + K PS+ I IP +NE ++ + T+ A Y +V ++D Sbjct: 153 KPRHREASAMSIAVIEGKYTPSVDIFIPTYNESLHILKRTVIGCQAIDYFRKKVYLLDDS 212 Query: 114 STDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAA 173 + + + + + + + KA L E + D D + + Sbjct: 213 QRTEVKKLAEELGCEY----MTRPDNKHAKAGNLNHALTKTSGELIAVFDADFVPTTNFL 268 Query: 174 AYIVEPMLYNPRVGAVTGNPRIRTRSTL-----VGKIQVGEYSSIIGLIKRTQRIYGNVF 228 V ++ V + + + E ++ I+ + + Sbjct: 269 IRTVG-FFQKSKIALVQTYQSFYNPDPVARNLGLEDLIPRENEALFRHIQLIKDSLESAI 327 Query: 229 TVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKG 288 G RRS L EVG + + ++ED +L + + Y +L L E Sbjct: 328 CA-GTSFIVRRSHLEEVGGFVTESLSEDFFTGIRLSSQGYKVIYLGESLSAGLSAEMTNA 386 Query: 289 LWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLA 348 +Q+LRW++G + F L W Y I + L+ I + Sbjct: 387 YIQQQLRWSRGTLQAFFIKENPL--TIPGLNWRQRLIYLEGIIHWCSFLIQAIFLIFPVF 444 Query: 349 GVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWI-IWFPVIFWM 407 + L I ++ + L L+Q S + R L ++++ I + FP+ + Sbjct: 445 YLLLGIIPFNVTINEFIYYFIP-LYLVQLFTSSWLSYRSVSVLINNIYLINLCFPLAVNV 503 Query: 408 LSLATTLVSFTRVMLMPKK 426 + + S + + PK Sbjct: 504 IHTLLSPFSV-KFRVTPKG 521 >UniRef50_B4VZ62 Type IV pilus assembly protein PilZ n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VZ62_9CYAN Length = 727 Score = 243 bits (621), Expect = 1e-62, Method: Composition-based stats. Identities = 72/418 (17%), Positives = 132/418 (31%), Gaps = 20/418 (4%) Query: 5 IVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPW-- 62 ++ + + + H + ++ + I+ + + V+ Sbjct: 90 LLILMTRYFIWRTIATLNFDHLATATFSLLIYFIESLGILSFILHIPQSVWSNAKQRSAQ 149 Query: 63 --GENAPAPQLKDNPSISIIIPCFNE-EKNVEETIHAALAQRYENIEVIAVNDGSTDKTR 119 + K PS+ + +P +NE E V T+ A Y N +V ++D R Sbjct: 150 ADRYSQDILSGKYQPSVDVFVPTYNEPESVVSRTVIGCQAMDYANKKVYILDDTRRPNIR 209 Query: 120 AILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEP 179 A+ + + + KA L + + E + +D D + ++ V Sbjct: 210 ALAKELGCEY----ITRPDNQHAKAGNLNSALPKTQGELIAIMDADFVPFKNFLTRTVGF 265 Query: 180 MLYNPRVGAVTGNPRIRT-----RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVI 234 L P++ V R+ V I + ++ G + + +V G Sbjct: 266 FLQ-PKIALVQTPQAFYNPDHHARNLGVDHILYDDLANFFGFSLSCRDVTNSVLC-CGTS 323 Query: 235 AAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRL 294 RR AL EVG ++ + ED S + W + Y L T KQR Sbjct: 324 YVVRRMALEEVGGYNTICLAEDSPTSTTMLTRGWRLIYLNEVLSMGESTRTYVDFLKQRT 383 Query: 295 RWAQGGAEVFLKNMT-RLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLN 353 RW ++F +W N+ FF + L T F L + L + L Sbjct: 384 RWHHSNYQIFCCGDKIPIWSTMNWLQKSYFFTFFLGT---FDPLFRTVFMFTPLMSLILG 440 Query: 354 IELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLA 411 I L + L+ E I+ FP + ++ Sbjct: 441 ISPIVSTTPEMIYYFLPWILLMAGSTGWATEYCGSFFWNEVYQTILCFPTLKCLIFAI 498 >UniRef50_Q9LJP4 Xyloglucan glycosyltransferase 4 n=75 Tax=Embryophyta RepID=CSLC4_ARATH Length = 673 Score = 243 bits (620), Expect = 1e-62, Method: Composition-based stats. Identities = 72/427 (16%), Positives = 144/427 (33%), Gaps = 21/427 (4%) Query: 20 CVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISI 79 S +F ++ +G + + E+ P + I Sbjct: 144 IAPLVISLSRFCTVLFLIQSLDRLVLCLGCFWIKFKKIEPKLTEESIDLEDPSSFPMVLI 203 Query: 80 IIPCFNEEKNVEETIHAALAQRYENIEVIA-VNDGSTDKTRAILDR-----MAAQIPHLR 133 IP NE + E++I AA + ++ V D S D +L + A + ++ Sbjct: 204 QIPMCNEREVYEQSIGAASQLDWPKDRILIQVLDDSDDPNLQLLIKEEVSVWAEKGVNII 263 Query: 134 VIHLAQNQG-KAIALKTGAA---AAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAV 189 H G KA LK+ E++ D D + D V NP +G V Sbjct: 264 YRHRLIRTGYKAGNLKSAMTCDYVKDYEFVTIFDADFTPNPDFLKKTVPHFKGNPELGLV 323 Query: 190 TGNPRIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYW 248 + L+ ++Q + ++ ++ N F +G +R AL E G W Sbjct: 324 QARWSFVNKDENLLTRLQNINLCFHFEVEQQVNGVFLNFFGFNGTAGVWRIKALEESGGW 383 Query: 249 SDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNM 308 + ED+DI+ + LN W Y +PE+ + KQ+ RW G ++F + Sbjct: 384 LERTTVEDMDIAVRAHLNGWKFIYLNDVEVTCELPESYEAYKKQQHRWHSGPMQLFRLCL 443 Query: 309 TRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGIL 368 + + + ++ F + + +PL + + Sbjct: 444 PSIIKSK--ISVWKKANLIFLFFLLRKLILPFYSFTLFCIILPLTMFIPEAELP------ 495 Query: 369 LCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFW--MLSLATTLVSFTRVMLMPKK 426 L +C + +SL+ + + ++++ + ++ + L F Sbjct: 496 LWIICYVPIFISLLNILPSPKSFPFLVPYLLFENTMSITKFNAMISGLFQFGSAYEWVVT 555 Query: 427 QRARWVS 433 ++ S Sbjct: 556 KKTGRSS 562 >UniRef50_A5FY21 Glycosyl transferase, family 2 n=2 Tax=Alphaproteobacteria RepID=A5FY21_ACICJ Length = 642 Score = 243 bits (620), Expect = 1e-62, Method: Composition-based stats. Identities = 68/419 (16%), Positives = 136/419 (32%), Gaps = 17/419 (4%) Query: 30 MMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAP---APQLKDNPSISIIIPCFNE 86 ++ + + ++ GV V R NA A + D P ++++P + E Sbjct: 207 LLGIMATVSALYASLFAFRGVLTIVGSGRRTDISVNASELAALKDGDLPVFTVLVPMYRE 266 Query: 87 EKNVEETIHAALAQRYE--NIEVIAVNDGSTDKTRAILDRMAAQIP-HLRVIHLAQNQGK 143 + + + + Y ++V V + +T + A+ + + +Q + K Sbjct: 267 AEVLPILVDSIRRLDYPRAKLDVKLVLEAGDTETIEAAKALGAEDLFEIIRVPDSQPKTK 326 Query: 144 AIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYN-PRVGAVTGNPRIRTRS-TL 201 A A+ EY V D + + D +V + P V V R Sbjct: 327 PKACNYALRFARGEYTVIFDAEDSPEPDQLRKVVALFNASGPEVACVQARLNYFNRDDNF 386 Query: 202 VGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISW 261 + ++ EYS + + G FR L E+G W +TED D+ Sbjct: 387 LTRMFTLEYSQWFDYLLPGLYRLNIPIPLGGTSNHFRTEVLHELGAWDPYNVTEDADLGI 446 Query: 262 KLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWP 321 +L + + + L QR RW +G + +L +M R Sbjct: 447 RLTQAGYRVAVVNST-TFEEANGVLHSWINQRSRWIKGYMQTWLVHMRRPVELYRRLGPV 505 Query: 322 LFFEYCLTTIWAFTC-LVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLL--QFI 378 F + + + L+ +++ + L V + +L ++ Sbjct: 506 GFLGFHMFIGFPPMTALINPLLWIMFLVSVIVGRSAVAGFFPGPVLVLALFDLMVGNAMY 565 Query: 379 VSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRG 437 V I + + W + P +W+L + +++ P W G Sbjct: 566 VYFNIVAVAKRRWYGLVPWGLLAPA-YWVLHSVAAYKALLQLITNPHY----WEKTTHG 619 >UniRef50_Q182N2 Cellulose synthase catalytic subunit [UDP-forming] n=8 Tax=Clostridium RepID=Q182N2_CLOD6 Length = 768 Score = 243 bits (620), Expect = 1e-62, Method: Composition-based stats. Identities = 72/446 (16%), Positives = 140/446 (31%), Gaps = 31/446 (6%) Query: 4 RIVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGV-YFWVYRERHWPW 62 +I+ + + L ++ + +F + + I+G + F Y Sbjct: 7 KILITLTIITNVIYILWRIFYTVPKEEGKFALICAIILLFVEIMGMMEMFVHYYGMSNIE 66 Query: 63 GENAPAPQLKDNPSISIIIPCFNEE-KNVEETIHAALAQRYE---NIEVIAVNDGSTDKT 118 P + P + + I +NE V +T++ + +Y + + +DG+ ++ Sbjct: 67 YPEKPIISEELYPHVDVFIATYNESVDLVRKTVNGCIHMQYPDKKKVHIYICDDGNREEM 126 Query: 119 RAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVE 178 + ++M + + KA L S + D D + D V Sbjct: 127 CILAEKMGVNY----ITRTEREGAKAGNLNNAMQHTNSPLIATFDADMIPMHDFLIATVP 182 Query: 179 PMLYNP------------RVGAVTGNPRIRTRSTLV-----GKIQVGEYSSIIGLIKRTQ 221 L N +VG V + E I+ + Sbjct: 183 YFLKNEQAKKDGEKEEYEKVGFVQTPQSFYNPDLFQFNLHSERRIPNEQDYFYRDIQLAR 242 Query: 222 RIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWIL 281 +V G R AL EV + ITED +Q + + P L Sbjct: 243 NRTNSV-IYGGSNTVISREALEEVDGFYTYSITEDFATGILIQSKGYRCYAIPEVHASGL 301 Query: 282 MPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFI 341 P LK L KQR RWA+G + + R F Y + + + + F+ Sbjct: 302 SPTDLKSLIKQRERWARGCIQTGRRLNILFRRGLGF---WQKISYISSITYWYASIKRFV 358 Query: 342 IYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWF 401 + N+ + L L + + +N + ++++ I F Sbjct: 359 YIMAPILFSVFNVIVVKCTLLQVLVFWLPMYILSSLSLKIFSQN-IRNTRWTNIYETIMF 417 Query: 402 PVIFWMLSLATTLVSFTRVMLMPKKQ 427 + + L T +S + + K + Sbjct: 418 QSLMPAVILETFAISKNKFSVTNKSK 443 >UniRef50_D1J3P1 Whole genome shotgun sequence of line PN40024, scaffold_74.assembly12x (Fragment) n=16 Tax=Spermatophyta RepID=D1J3P1_VITVI Length = 449 Score = 242 bits (619), Expect = 1e-62, Method: Composition-based stats. Identities = 77/374 (20%), Positives = 141/374 (37%), Gaps = 24/374 (6%) Query: 74 NPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIA-VNDGSTDKTRA-----ILDRMAA 127 P + + IP +NE++ + +I AA + + +I V D STD T R A+ Sbjct: 12 YPMVLVQIPMYNEKEVYQLSIGAACGLSWPSERIIIQVLDDSTDPTIKDLVEMECQRWAS 71 Query: 128 QIPHLRVIHLAQNQG-KAIALKTGAAAA---KSEYLVCIDGDALLDRDAAAYIVEPMLYN 183 + +++ G KA ALK G + + +Y+ D D + D + +++N Sbjct: 72 KGINIKYEIRGNRHGYKAGALKEGMKHSYVKQCDYVAIFDADFQPEPDFLWRTIPFLVHN 131 Query: 184 PRVGAVTGNPRIRTRST-LVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSAL 242 P +G V + L+ ++Q + + F +G +R SAL Sbjct: 132 PEIGLVQARWKFVNSDECLMTRMQEMSLDYHFTVEQEVGSSSHAFFGFNGTAGVWRISAL 191 Query: 243 AEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAE 302 E G W D ED+D++ + L W Y +P T K Q+ RW+ G A Sbjct: 192 NEAGGWKDRTTVEDMDLAVRASLKGWKFVYVGALKVKNELPSTFKAYRYQQHRWSCGPAN 251 Query: 303 VFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAAT 362 +F K + R + +W Y + + + +V I+ + V Sbjct: 252 LFKKMAIEIARNKKVNLWKKV--YVIYSFFFVRKIVAHIVTFLFYCVVFPATVFFPEVEV 309 Query: 363 HTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVML 422 G + ++L+ +L +FWI++ V+ + AT + Sbjct: 310 PMWG-----AVYIPSTITLLNAVGTPRSLHLLVFWILFENVMSLHRTKATFMGLL----- 359 Query: 423 MPKKQRARWVSPDR 436 + WV ++ Sbjct: 360 -EVGRVNEWVVTEK 372 >UniRef50_A4SY23 Cellulose synthase (UDP-forming) n=1 Tax=Polynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1 RepID=A4SY23_POLSQ Length = 909 Score = 242 bits (619), Expect = 2e-62, Method: Composition-based stats. Identities = 76/448 (16%), Positives = 151/448 (33%), Gaps = 29/448 (6%) Query: 4 RIVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWG 63 R + + V C + F + F F + + + G F + ++ Sbjct: 61 RSIFIVLALFVSCRYIAWRIFDTLIYTGFFDFIGMALLFLAELYGFTLFLLDMFVNFSPI 120 Query: 64 ENAPAPQLKD---NPSISIIIPCFNEEKNVEE-TIHAALAQRYEN--IEVIAVNDGSTDK 117 + P K+ P++ I IP ++E + + T+ AA Y + + ++DG T Sbjct: 121 SDDIIPLSKEESLLPTVDIFIPTYDESEAIVRMTVTAATQINYPKNKLNIYILDDGGTHA 180 Query: 118 TRAI-------------LDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDG 164 R L R+A + + + KA + + + ++ +D Sbjct: 181 KRRSKESGAKAWRRHYSLRRLAKTLGVHYLTRETNQKAKAGNINHALQHTRGDLILILDC 240 Query: 165 DALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTR---STLVGKI--QVGEYSSIIGLIKR 219 D + +D V L +P++ V + ++ I + E I+ Sbjct: 241 DQVPTKDILQNTVGQFLSDPKMFLVQTPHFFINETPVNNVITGISNRPDESEMFYRKIQP 300 Query: 220 TQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCW 279 + F G A RRS L EVG + ITED + S L + + Y + + Sbjct: 301 AMNFWNAAF-FCGSAAVLRRSCLMEVGGIAIKTITEDCETSLILHAHGYNSSYINKPMIC 359 Query: 280 ILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVG 339 L PET Q RWA+G +V + R + +F + ++ F Sbjct: 360 GLSPETPSDYLTQHSRWAKGMLQVLMHYNPLFMRGLSLPQRLAYFASSYSWLFGFA---R 416 Query: 340 FIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWII 399 +I + A + L + + + + ++S S ++ + Sbjct: 417 YIFFLAPSAFLILGLNVYAANWHEMVDFTVPYALSILVVISYFFAGS-RQIFFSEIYETV 475 Query: 400 WFPVIFWMLSLATTLVSFTRVMLMPKKQ 427 + L + ++ PK + Sbjct: 476 KGFRMIRELMPVLLNPWKQKFLVTPKGK 503 >UniRef50_B4WM17 Glycosyl transferase, group 2 family protein n=1 Tax=Synechococcus sp. PCC 7335 RepID=B4WM17_9SYNE Length = 475 Score = 242 bits (618), Expect = 2e-62, Method: Composition-based stats. Identities = 89/464 (19%), Positives = 163/464 (35%), Gaps = 60/464 (12%) Query: 10 ILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAP 69 + + L A+ H FV +++ + + P P P Sbjct: 35 AAIALSALWLLTAFLHKVSWGHEFVLIATVVLAV--------YALRVIFARPQSPPIPLP 86 Query: 70 --------------------------QLKDNPSISIIIPCFNEEKNVEETIHAALAQRYE 103 K P +S+++ NEE+ + + + L Y Sbjct: 87 SVESLTQTSIDRASLEAVDEAGEIPLDSKSWPYVSLLVAAKNEEQVIGSLVESLLHIDYP 146 Query: 104 --NIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQN--QGKAIALKTGAAAAKSEYL 159 ++ ++D STD T ILD + + L VIH GK+ AL K + L Sbjct: 147 TDRYDLWIIDDYSTDATPEILDNLVKRHRQLNVIHRGPGAIGGKSGALNLVWPQTKGDLL 206 Query: 160 VCIDGDALLDRDAAAYIVEPMLYNP-----RVGAVTGNPRIRTRS-TLVGKIQVGEYSSI 213 D DA + D Y+V +P + GAV I + + Q E + Sbjct: 207 AVFDADAQVSSDLLRYVVPMF--DPEKGGKKTGAVQVRKAIANATKNFWTRGQKAEMALD 264 Query: 214 IGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYE 273 + +R + G + + G RR A+A+ G W+++ IT+D+D++ +L L QW I Sbjct: 265 CYMQQR-RIAVGGIGELRGNGQFVRREAIAQCGGWNEETITDDLDLTIQLHLQQWDIGLL 323 Query: 274 PRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWA 333 LW QR RWA+GG + +L +L + + + W Sbjct: 324 FAPAVGEEGVTNPLALWHQRNRWAEGGFQRYLDYWRQLGKNQLGWGK----SLDMLGFWI 379 Query: 334 FTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTS 393 ++ + L I L A I + +C+ + + ++ S Sbjct: 380 IQYMLPLVALP------DLVIALIRRQTPVYAPITILAVCMSGIGMFTTLRQSERTSVWS 433 Query: 394 SLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRG 437 ++ I + ++ + + +++R +WV G Sbjct: 434 AIVQTIRGSIYM---LHWLVVIGSMAIRISVRQKRLKWVKTAHG 474 >UniRef50_B6JD78 Glycosyl transferase, family 2 n=1 Tax=Oligotropha carboxidovorans OM5 RepID=B6JD78_OLICO Length = 669 Score = 241 bits (616), Expect = 3e-62, Method: Composition-based stats. Identities = 69/440 (15%), Positives = 145/440 (32%), Gaps = 22/440 (5%) Query: 15 LCIPLCVAYFHSGELMMRFVFFWPF--FMSIMWIVGGVYFWVYRERHWPWGENAPAPQLK 72 L + F S ++ + ++I +++ +P Sbjct: 218 WRYGLKLLAFTSAFMLAPTLLTQLSGAVLAIWFLLFNSLRLAGAFAGGERTPRSPRIPDA 277 Query: 73 DNPSISIIIPCFNEEKNVEETIHAALAQRYE--NIEVIAVNDGSTDKTRAILDRMAAQIP 130 P ++++ ++E +V + + A Y ++++ + + +T+A L R+ Sbjct: 278 QLPLYTVMVALYHEGPSVAHLVQSLAALDYPREKLDILLLLEADDIETQAALSRLHLPGN 337 Query: 131 HLRVIHLA-QNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYN-PRVGA 188 +I + K AL G +A+ E+ D + D ++ ++ V Sbjct: 338 VQTLIVPPFGPRTKPKALNAGLMSARGEFTAVFDAEDRPDPSQLRDAIDAFRHHHTDVAC 397 Query: 189 VTGNPRIRT-RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGY 247 V + I + + + EY+ + R +G + G FR L EVG Sbjct: 398 VQASLCIDNSADSWLACMFTAEYAGQFDVFLRGFSQFGLPLPLGGSSNHFRTDVLREVGG 457 Query: 248 WSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKN 307 W +TED D+ ++L + + + P +QR RW +G + ++ + Sbjct: 458 WDAYNVTEDADLGFRLARRGYRAVMF-DSTTYEEAPAHTGAWLRQRSRWMKGWMQTWIVH 516 Query: 308 MTRLWRKENFRMWPLFFEY-------CLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIA 360 M R FF LT + + ++ A A + + Sbjct: 517 MRSPRRLIKQSGLAGFFTLNLLVGGNVLTALAYPILIAACLLEAGLAATGSTAVAMFSGP 576 Query: 361 ATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRV 420 + L +VSLM L + ++ P+ A ++ R Sbjct: 577 FIELHFTTIAAGYLSTVVVSLM--GLARRGLLRHAWVLVLTPLY-----WAWLSIAAWRA 629 Query: 421 MLMPKKQRARWVSPDRGILR 440 + W + G+ R Sbjct: 630 LRQLLHDPYHWEKTEHGLAR 649 >UniRef50_Q89FJ5 Bll6705 protein n=1 Tax=Bradyrhizobium japonicum RepID=Q89FJ5_BRAJA Length = 730 Score = 241 bits (616), Expect = 3e-62, Method: Composition-based stats. Identities = 71/380 (18%), Positives = 119/380 (31%), Gaps = 18/380 (4%) Query: 11 LCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQ 70 +C + Y FV F + + + R + Sbjct: 56 ICTTVLPASGSWYQVGWIWFCLFVELLTFTDQFILYLTFLRRTDRRSEADQHERRLRSLP 115 Query: 71 LKDNPSISIIIPCFNEE-KNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQI 129 + PS+ + IP ++E + +E+TI AL Y N +V ++DG R L + Q Sbjct: 116 DDELPSVDVFIPTYDEPFEVLEKTITGALCLGYPNFQVWVLDDGR----RPWLRELCKQK 171 Query: 130 PHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAV 189 + KA + E+ D D + R+ + +P++G V Sbjct: 172 GAGYLTRADNLNAKAGNINHALTKTSGEFFAIFDADFVPQRNFLMRTMG-FFADPKIGIV 230 Query: 190 TGNPRIRTRSTL-----VGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAE 244 + + + E I + + F G + RR+AL Sbjct: 231 QIPHTFYNEDPMQANLGLHRGLPDEQRFFFDSIMPARDGWDGAFC-CGSNSVTRRAALRA 289 Query: 245 VG-YWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEV 303 VG ITED+ ++ L + Y L L PE+L + QR RWA+G + Sbjct: 290 VGDALPTQSITEDMLLTLVLLRKGYVTRYLREPLAIGLAPESLDAFFVQRQRWARGAIQT 349 Query: 304 FLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAG-VPLNIELTHIAAT 362 L LFF T W L + V + + + Sbjct: 350 LYLADGPLGPGLGLMHRLLFFP----THWLSMGLRALAVVLVPIVFMWTGLAPVVDVTPE 405 Query: 363 HTAGILLCTLCLLQFIVSLM 382 L + L +S Sbjct: 406 SVFYYLFPAILALNGGISWF 425 >UniRef50_Q2RGJ6 Glycosyl transferase, family 2 n=7 Tax=Firmicutes RepID=Q2RGJ6_MOOTA Length = 466 Score = 241 bits (615), Expect = 5e-62, Method: Composition-based stats. Identities = 100/429 (23%), Positives = 183/429 (42%), Gaps = 20/429 (4%) Query: 1 MINRIVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHW 60 ++ +S + L V F S L++ F+ + P ++ V + Sbjct: 52 LVWVSLSVILSLPWLRDLSEVIGFVSAILVITFIAYVPGYLVAFLAVSLLV--------- 102 Query: 61 PWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYE-NIEVIAVNDGSTDKTR 119 P + I+I+I NE N+ T+ Q Y I V+ V++GSTD T Sbjct: 103 --DRQPPFKVSDPDLPITILIAARNEASNIGMTLQYIANQNYRGPISVVLVDNGSTDGTA 160 Query: 120 AILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEP 179 + A ++ + ++ GK AL G A + Y + +D D LL R A +IV Sbjct: 161 DVALAKARELKLNLLCLREESPGKNFALNAGLAKIVTPYFITLDADTLLHRFAVKHIVAR 220 Query: 180 MLYN-PRVGAVTGNPRIRT-RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAF 237 +L + P VGAV G+ +R R + +IQ +Y I IKRTQ +Y G + + Sbjct: 221 LLSSPPDVGAVAGHVLVRNSRDNFLTRIQEWDYFLGIASIKRTQGLYQGTLVAQGAFSLY 280 Query: 238 RRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWA 297 + A+ +V W D I EDI ++WKLQ + +++EP A+ + +P+ ++ +QR RWA Sbjct: 281 KTEAVRQVSGW-PDTIGEDIVLTWKLQQAGYRVYFEPTAVAFTAVPKVVRHFVRQRSRWA 339 Query: 298 QGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELT 357 +G E KN+ ++ + + + + I F LV + L Sbjct: 340 RGMVEGM-KNVPPWYQPSYLKKFLTGIDLVIPVIDLFYTLV----WIPGLILACFGKYYI 394 Query: 358 HIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSF 417 T L + L+ + L + + + + + + +I+ ++S +L + Sbjct: 395 VGPYTLFVLPLNIGVSLIMYTYQLRVFGQLDLRVRKNRLGFFAYVLIYQLISSPVSLWGY 454 Query: 418 TRVMLMPKK 426 + + ++ Sbjct: 455 VQELFQTRR 463 >UniRef50_B0C4P4 Glycosyl transferase, group 2 family protein n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0C4P4_ACAM1 Length = 730 Score = 240 bits (614), Expect = 7e-62, Method: Composition-based stats. Identities = 71/433 (16%), Positives = 134/433 (30%), Gaps = 23/433 (5%) Query: 7 SFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENA 66 S +L + + + + + I+ IV V + R A Sbjct: 82 SLLLLIVQYVVWRLFWTLNLSDTFNSIFSIGLLTLEIITIVTNVIQLILVSRSTNRLGEA 141 Query: 67 PAPQL-----KDNPSISIIIPCFNEEKNV-EETIHAALAQRYENIEVIAVNDGSTDKTRA 120 ++ P++ ++IP ++E + + TI A Y V ++D + +A Sbjct: 142 DINEMAVLDGDYLPTVDVMIPTYDEPSFILKRTIIGCQAMDYPRKTVYLLDDTRRPEIKA 201 Query: 121 ILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPM 180 + + + +I KA L E L D D + ++ V Sbjct: 202 LATELGCEY----IIRPDNLHAKAGNLNHALNFTSGELLTVFDADFVPTKNFLRRTVG-F 256 Query: 181 LYNPRVGAVTGNPRIRTRSTL-----VGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIA 235 N + + + + + I E + + G Sbjct: 257 FQNDSIALLQTHQSFFNPDPVARNIGLENILPQEVEIFSRYYQPLRDGVETALCY-GSSF 315 Query: 236 AFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLR 295 RRSALAE G ++ ++ED +L + + +L L E + QRLR Sbjct: 316 VARRSALAEAGGFATGTLSEDYFTGVRLSAKGKRVIFLGESLSAGLSAENMSAHITQRLR 375 Query: 296 WAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIE 355 WA+G + F L + + + FT + + LA + Sbjct: 376 WARGTLQAFFIEANPL--RIRGLKLIQRLAHLDGLLQWFTSVSRVGFLLMPLAYAFFGVI 433 Query: 356 LTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWII-WFPVIFWMLSLATTL 414 + L+Q + R L S ++ + FPV +L T Sbjct: 434 PWRATTQEMLYFFMP-YYLVQLFTFSWLNQRSRSALISDIYAVSQCFPVALTVLQ--TLF 490 Query: 415 VSFTRVMLMPKKQ 427 F++ + K Sbjct: 491 NPFSKGFKVTPKG 503 >UniRef50_Q2JKN6 Glycosyl transferase, group 2 family protein n=2 Tax=Synechococcus RepID=Q2JKN6_SYNJB Length = 493 Score = 240 bits (613), Expect = 8e-62, Method: Composition-based stats. Identities = 85/377 (22%), Positives = 148/377 (39%), Gaps = 28/377 (7%) Query: 74 NPSISIIIPCFNEEKNVEETIHAALAQRYE--NIEVIAVNDGSTDKTRAILDRMAAQIPH 131 P ++++IP NE + +H+ RY ++E+ A++D S+D T +L IPH Sbjct: 128 LPRVAVLIPAKNESAVLPRLLHSLTQLRYPTSHLELWAIDDNSSDATPEVLREAQKWIPH 187 Query: 132 LRVIHLAQNQG--KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPM--------- 180 LRV +G K+ AL + E ++ D DA++ D A + Sbjct: 188 LRVYRRQPGRGGGKSGALNEVLPLTQGEIILVCDADAVVPSDFLARTLPLFVQVGSLRSR 247 Query: 181 LYNPRVGAVTGNPRIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRR 239 VGAV + S QV E +S ++ + + + G RR Sbjct: 248 FSRRTVGAVQVRKALSNPSVNFWTLGQVAEMAS-DAYFQQQRVAVRGIGELRGNGQLVRR 306 Query: 240 SALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQG 299 L + G W++ +T+D+D+++KL L I + P + K LW QR RWA+G Sbjct: 307 DVLEKCGGWNEATLTDDLDLTFKLHLAGVDIAFLPEPAIVEEGVTSWKSLWHQRCRWAEG 366 Query: 300 GAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHI 359 G + +L + + + +WAF Y + +A VP + + Sbjct: 367 GYQRYLDYWPGILGRRMGMAKTV-------DLWAFFIS----QYLLPMALVPDTLWVLLT 415 Query: 360 AATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTR 419 + + L L + V+ R L W + +ML +++ T Sbjct: 416 GHSSVL-LPLNALVMGCLTVAFYRGLRQVEGLRGRRLWWHTALGLVYMLHWLPVMIA-TT 473 Query: 420 VMLMPKKQRARWVSPDR 436 + + +R RWV Sbjct: 474 ARMCVQPKRLRWVKTVH 490 >UniRef50_C7NQ00 Glycosyl transferase family 2 n=1 Tax=Halorhabdus utahensis DSM 12940 RepID=C7NQ00_HALUD Length = 475 Score = 240 bits (613), Expect = 9e-62, Method: Composition-based stats. Identities = 92/465 (19%), Positives = 161/465 (34%), Gaps = 60/465 (12%) Query: 30 MMRFVFFWPFFMSIMWIVGGVYFWVYRE---RHWPWGENAPAPQLKDNPSISIIIPCFNE 86 FV + +S+ ++ V + W P I I++P +NE Sbjct: 15 FGVFVVGYYALISVGYLFLHVLALFELREDVKESRWDPPFRKFSSPFYPGIGIVVPAYNE 74 Query: 87 EKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDR---------------------- 124 E + E++ + L+ Y EV+ VNDGSTD T L Sbjct: 75 EATIVESVRSMLSLNYPETEVVVVNDGSTDATLDRLREQFDLEAIDAEIPFDVPAEEIHG 134 Query: 125 -MAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYN 183 + ++ N GK+ AL G +D D ++DRDA IV P L Sbjct: 135 VYRSTTDEKLLVVDKDNGGKSDALNAGIWLTDMPLFCAVDADTIIDRDALLQIVTPFLKE 194 Query: 184 PRVGAVTGNP-RIRT--------------RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVF 228 P +G R+ T + +QV EY + + Sbjct: 195 PTTAVASGGVIRVANDCTVEDGIVTDVSLPKTGLPGLQVMEYLRAFYSGRLGLNRINGLI 254 Query: 229 TVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQL------NQWTIFYEPRALCWILM 282 +SG F+ + E+G + D ITED DI +L ++T+ + P + W + Sbjct: 255 LISGAFGLFQTETVREIGGYRHDTITEDFDIVVRLHKHLTEQDREYTVDFVPEPVAWTEV 314 Query: 283 PETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFII 342 P + + L +QR RW +G E + N L+ + R+ + + + +G +I Sbjct: 315 PSSRRVLSRQRRRWYRGMMETVVTNRKMLFNRAYGRVGTVVMPF-----FVAAETLGPLI 369 Query: 343 YAVQLAGVPLNIELTHIAATHTAGILL------CTLCLLQFIVSLMIENRYEHNLTSSLF 396 + +P+ + T L L + NRY+ + Sbjct: 370 EGLGYVVLPVAWYFGALDVTFVLMFFLLTTGYGVFLSWFGVFSEVWSFNRYDSPW--QVL 427 Query: 397 WIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRGILRG 441 ++W+ V+ + R ++ + W +R G Sbjct: 428 RLLWYGVLENFGYRQWKTIVAWRGLIEYLRGVETWGVMERSGFAG 472 >UniRef50_A1SEL1 General secretory system II, protein E domain protein n=1 Tax=Nocardioides sp. JS614 RepID=A1SEL1_NOCSJ Length = 606 Score = 240 bits (613), Expect = 9e-62, Method: Composition-based stats. Identities = 76/441 (17%), Positives = 136/441 (30%), Gaps = 20/441 (4%) Query: 15 LCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPW-------GENAP 67 L V + M S++++V Y + R E Sbjct: 170 LVTTAIVVVMCAVIWPMETAIAVVAACSLLYLVVSFYKFRLTLRALGTHLETDVTDEEIA 229 Query: 68 APQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYE--NIEVIAVNDGSTDKTRAILDRM 125 A + P+ +I++P + E V + A Y ++V + + ++T + + Sbjct: 230 AIDERHLPTYTILVPLYKEAGIVPRLVRDINALDYPRTRLDVKLLCEEDDEETVQRIRDL 289 Query: 126 AA-QIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNP 184 HL V+ +Q + K A G A +Y V D + D D + P Sbjct: 290 QLPPHFHLVVVPDSQPKTKPKACNYGLQLATGDYCVIFDAEDRPDPDQLKKAIIAFSRVP 349 Query: 185 R-VGAVTGNPRIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSAL 242 V V + ++ EYS L+ + + G F + L Sbjct: 350 ENVVCVQAKLNHFNQDQNMLTAWFANEYSMHFELVLPAMGAAESPIPLGGTSNHFVTAKL 409 Query: 243 AEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAE 302 E+G W +TED D+ +L + + + +QR RW +G + Sbjct: 410 RELGAWDPFNVTEDADLGIRLHREGYRTAMI-DSTTLEEANSQVPNWIRQRSRWNKGYIQ 468 Query: 303 VFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAAT 362 +L +M + + F + LT AF L+ I +A+ V Sbjct: 469 TWLVHMRAPFALLSQTGLKGFLSFNLTMGSAFVLLLNPIFWALTTLYVFTQAGFIEQLFP 528 Query: 363 HTAGILLCTLCLLQ--FIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRV 420 L + V L + + P+ +W L F ++ Sbjct: 529 GIIFYAASALLFVGNFVFVYLNVAGSLHRGEFGLTRTALLSPL-YWGLMSWAAWKGFIQL 587 Query: 421 MLMPKKQRARWVSPDRGILRG 441 P W G+ G Sbjct: 588 FTNPFY----WEKTVHGLDEG 604 >UniRef50_A5AGR2 Putative uncharacterized protein n=1 Tax=Vitis vinifera RepID=A5AGR2_VITVI Length = 695 Score = 240 bits (612), Expect = 1e-61, Method: Composition-based stats. Identities = 68/417 (16%), Positives = 141/417 (33%), Gaps = 18/417 (4%) Query: 17 IPLCVAYFHSGELMMRFVFFWPFFMSIM--WIVGGVYFWVYRERHWPW--GENAPAPQLK 72 + V Y L++ F + + + FW+ ++ P + Sbjct: 138 LSFRVDYIAPLVLILSKFCIVLFLIQSLDRLFLCFGCFWIKHKKLKPEMDADAYDIEDGS 197 Query: 73 DNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIA-VNDGSTDKTRAI-----LDRMA 126 P + + IP NE++ ++I A + ++ V D S D+ + + Sbjct: 198 SFPMVLVQIPMCNEKEVYAQSISAVCQLDWPRERLLIQVLDDSDDENVQLLIKNEVSSWN 257 Query: 127 AQIPHLRVIHLAQNQG-KAIALKTGAA---AAKSEYLVCIDGDALLDRDAAAYIVEPMLY 182 + ++ H G KA LK+ A E++ D D + D + Sbjct: 258 QKGVNIIYRHRFVRTGYKAGNLKSAMACDYVKDYEFVAIFDADFQPNPDFLKQTIPHFKG 317 Query: 183 NPRVGAVTGNPRIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSA 241 NP VG V + L+ ++Q + ++ ++ F +G +R A Sbjct: 318 NPEVGLVQARWSFVNKDENLLTRLQNINLCFHFEVEQQVNGLFLEFFGFNGTAGVWRIKA 377 Query: 242 LAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGA 301 L E G W + ED+DI+ + L+ W + +PE+ + KQ+ RW G Sbjct: 378 LEESGGWLERTTVEDMDIAVRAHLHGWKFIFLNDVKVPCELPESYQAYKKQQHRWHSGPM 437 Query: 302 EVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAA 361 ++F + + + + P + + +Q+ + Sbjct: 438 QLFRLCLPAIVTSKVSSLPPPLITAKFSIQIYMQPFTYSMFLFLQMGIWKKANLIFLFFL 497 Query: 362 THTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFT 418 + + L I+ L + ++ I + P+ +L++ SF Sbjct: 498 LRKLILPFYSFTLFCIILPLTMFIPEAE---LPVWVICYIPIFMSLLNILPAPKSFP 551 >UniRef50_C9B2I5 Cellulose synthase catalytic subunit n=2 Tax=Enterococcus casseliflavus RepID=C9B2I5_ENTCA Length = 627 Score = 240 bits (612), Expect = 1e-61, Method: Composition-based stats. Identities = 76/402 (18%), Positives = 148/402 (36%), Gaps = 15/402 (3%) Query: 5 IVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGE 64 ++F + LV + + S L+ + + + + V G + +++ + Sbjct: 34 TIAFLFIYLVWRLFFTIPIDSSASLVFGIILYIAEVIGLF--VYGFFVYLFSNKLSKEYR 91 Query: 65 NAPAPQLKDNPSISIIIPCFNEE-KNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILD 123 + PS++ I +NE+ K V T A A +Y N ++ +DG R L Sbjct: 92 YEGDEAEEFLPSVAAFICTYNEDIKLVIATALAVKALKYPNKKIYICDDGH----REELK 147 Query: 124 RMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYN 183 MA ++ + KA + + S+ ++ +D D ++ ++ V+ N Sbjct: 148 EMAERMNLGYLTREGNANAKAGNINYALSQTSSDLILLLDADFIVKKNIIFEAVD-YFRN 206 Query: 184 PRVGAVTGNPRIRTRST--LVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSA 241 P+V V + L+ K E + ++ + + G A RRSA Sbjct: 207 PKVALVQYPQTFYNKDPFQLLRKSMYNEQELFMRFLEPALSRENALIHI-GTNAIIRRSA 265 Query: 242 LAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGA 301 L E+G ITED+ LQ + + +A + P T K L QR RWAQG Sbjct: 266 LEEIGGVPTSSITEDMATGMLLQDAGYETIFINKAYALGITPYTAKELTSQRTRWAQGTK 325 Query: 302 EVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAA 361 ++F + R+ + Y + ++ FT I + +I + Sbjct: 326 QIF-DHFKP--RRLKGLSFMQKLCYYNSYLYWFTSFQKIIFLLAPTLFMVFDIFIVRSNN 382 Query: 362 THTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPV 403 L ++ L + + + +S ++ P+ Sbjct: 383 HQLLLFFLPPFIMISLSFRLYV-PKIRNLTSSHIYDCFVAPI 423 >UniRef50_A5GQ06 Glycosyltransferase of family GT2; possible processive Glycosyltransferase n=1 Tax=Synechococcus sp. RCC307 RepID=A5GQ06_SYNR3 Length = 453 Score = 239 bits (611), Expect = 1e-61, Method: Composition-based stats. Identities = 81/430 (18%), Positives = 166/430 (38%), Gaps = 20/430 (4%) Query: 13 LVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLK 72 ++ + C+ + + P F + + G + + R R ++ P Sbjct: 31 ALIVVLACLWAGAAPHWLGPVRGLLPAFGLTLMLGGYGFSTLRRGRSSAAAVSSAPPAGA 90 Query: 73 DNPSISIIIPCFNEEKNVEETIHAALAQRYE--NIEVIAVNDGSTDKTRAILDRMAAQIP 130 P+I +++ +EE+ + + A Y +++ ++DGS D+T IL++++A Sbjct: 91 VVPAIDVLVAARDEEQVIARLVQAIAGLEYPADRLKLWVMDDGSNDRTPEILNQLSATHS 150 Query: 131 HLRVIHLAQN--QGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGA 188 L+V +N GK+ AL + ++L+ +D DA L D + +P L++ +GA Sbjct: 151 FLQVRRRPRNAGGGKSGALNALLPQLQGDWLLVLDADAALTPDVLLRL-QPQLHDSALGA 209 Query: 189 VTGN-PRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGY 247 V + S + + Q E + ++ + G + + G RR L + G Sbjct: 210 VQLRKAVVGAESNWLTRSQALEMA-FDAEMQLGRIAQGGIGELRGNGQLIRRQVLLDCGG 268 Query: 248 WSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKN 307 +++D +T+D+D+S++L L + + LW+QR RWA+GG + FL Sbjct: 269 FNEDTVTDDLDLSFRLLLAGQQVSVLWNPVVDEEPVMRWGALWRQRQRWAEGGLQRFLDY 328 Query: 308 MTRLWR-KENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAG 366 L+ + + R + L VG + + PL L+ Sbjct: 329 WPALFSDRLSGRQQLDLIVFFLLQYGLPLATVGDVFGMIWWRQWPLLWPLS--------- 379 Query: 367 ILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKK 426 + TL L + E +++ ++ + V + + Sbjct: 380 --VSTLSLSALALWRSGRRHSEGPELPEATGWNLLVANTYLIHWF-LVIPWVAVRMALRP 436 Query: 427 QRARWVSPDR 436 +R W Sbjct: 437 KRLVWAKTVH 446 >UniRef50_B4UAH9 Glycosyl transferase family 2 n=4 Tax=Anaeromyxobacter RepID=B4UAH9_ANASK Length = 1140 Score = 239 bits (611), Expect = 1e-61, Method: Composition-based stats. Identities = 82/427 (19%), Positives = 145/427 (33%), Gaps = 23/427 (5%) Query: 10 ILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAP 69 + L + + ++ ++ + + RH P Sbjct: 430 VRYLAWRAGATLNFVEPLGPLLSLSLLAAEAYALSTV--ALLAVQVGLRHRARPPEPPLG 487 Query: 70 QLKDNPSISIIIPCFNEEKNV-EETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQ 128 + +PS+ +++P ++E ++ ++T+ A A RY V+ +D D+ + A+ Sbjct: 488 PEEPHPSVDVMVPIYSEPLDILDKTLAACAAMRYPRKTVVVCDDSHRDEVARLAAEHGAR 547 Query: 129 IPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGA 188 + + KA L A E +V D D + V P +PR+G Sbjct: 548 Y-----LAGPRRHAKAGNLNAALARTDGELVVVFDTDHVPTAAFLERTV-PCFRDPRLGV 601 Query: 189 VTGNPRIRTRSTLVGKI-----QVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALA 243 V + E I+ + +G F V G A FRR ALA Sbjct: 602 VQTPHHFYNPDVFQRALGAPAAVPNEADLFNHAIQGGRDRWGGAFFV-GSGAVFRREALA 660 Query: 244 EVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEV 303 VG + ITEDI S KL W + L L E L QR RW G ++ Sbjct: 661 SVGGFHLLSITEDIHTSQKLHARGWRTRFVDEDLAAGLSAEDLASYVVQRRRWMLGCLQI 720 Query: 304 FLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATH 363 F ++ L R R Y + + F L + +A LA + ++ Sbjct: 721 FFRDNPLLQRGLPLRHR---VGYLASLWYFFFPLARLVFFATPLAYLLFHLHPLFAELPV 777 Query: 364 TAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLM 423 L+ L S ++ +L+ ++ + + L+ AT + R + Sbjct: 778 LLAYLVPHLVAAPLAASALV-----PGWPRALWGSLYEGAVAFPLARATLDLLLPRRLGF 832 Query: 424 PKKQRAR 430 + Sbjct: 833 KVTPKGV 839 >UniRef50_Q7NDK6 Glr4229 protein n=1 Tax=Gloeobacter violaceus RepID=Q7NDK6_GLOVI Length = 492 Score = 239 bits (610), Expect = 2e-61, Method: Composition-based stats. Identities = 73/362 (20%), Positives = 133/362 (36%), Gaps = 16/362 (4%) Query: 41 MSIMWIV-GGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALA 99 ++ +W+ G+ + R P P +++ +P FNE + A A Sbjct: 5 LAALWLFTYGINAYWLTARRGAPPPV--PPSPGRWPLVTVQLPVFNELYVCRRLLAATCA 62 Query: 100 QRYEN--IEVIAVNDGSTDKTRAILDRMAAQIPH----LRVIHLAQNQG-KAIALKTGAA 152 Y + + ++D STD+T +L + + +H + G KA AL Sbjct: 63 LDYPREALHIQVLDD-STDETVQLLAAAIEEQRRLGFSIEHLHRKERHGFKAGALAAATP 121 Query: 153 AAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRS-TLVGKIQVGEYS 211 A EY+ D D L D + + RVG V +L+ ++Q Sbjct: 122 LANGEYIAIFDADFLPPPDWLKRAL-VHFADGRVGLVQTRWGHTNPGYSLLTRLQALGID 180 Query: 212 SIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIF 271 + ++ + G F +G +R+ A+ G W D + ED+D+S++ QL W Sbjct: 181 GHFAVEQQARCANGYYFNFNGTAGVWRKRAIEAGGGWQADTLAEDLDLSYRSQLAGWKAV 240 Query: 272 YEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTI 331 Y+ R + +P ++ Q+ RWA+G + K + R+ M L LT Sbjct: 241 YDGRIVAPAELPVSMAAYKMQQYRWAKGSIQCARKLLGRVMDCGGSPMQKLQAALHLTGY 300 Query: 332 WAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNL 391 +V ++ +V L V + + G L+ + L + Sbjct: 301 AVHPLMVWIVLCSVPLLTVSW---VGQHPLSLVWGTLMVPATFGPPTLYLAARRDLDPRG 357 Query: 392 TS 393 Sbjct: 358 WR 359 >UniRef50_Q3ATR2 Glycosyltransferases probably involved in cell wall biogenesis-like n=3 Tax=Chlorobium RepID=Q3ATR2_CHLCH Length = 503 Score = 239 bits (610), Expect = 2e-61, Method: Composition-based stats. Identities = 77/427 (18%), Positives = 155/427 (36%), Gaps = 9/427 (2%) Query: 9 FILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMS---IMWIVGGVYFWVYRERHWPWGEN 65 ++ L+ Y+ F FF + + +F++ R + Sbjct: 16 LLIVLLFTYLAIRIYYTINAPFSTIEFILGFFFLCSEVFILYHSFFFFIDILREALYDAE 75 Query: 66 APAPQLKDNPSISIIIPCFNEE-KNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDR 124 + + S++I++P +E + VE T+ + YEN V ++D S +K + Sbjct: 76 PKRKEPGKDASVAILVPARHEPREVVENTLLTCINIAYENKTVYLLDDSSIEKFKQEARE 135 Query: 125 MAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNP 184 ++ ++ + KA + K +Y+V D D + +V M + Sbjct: 136 VSKKLGAKIFTRVGNRGAKAGIVNDILKTLKEKYVVIFDADQNPMPNFLRTLVPLMEGDD 195 Query: 185 RVGAVTGNPRIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALA 243 ++ + + V + + I + + +F G FRR AL Sbjct: 196 KLAFIQTPQFYSNGDASPVAMTSHNQQAIFYEYICLGKSLNHAMFC-CGTNVIFRREALQ 254 Query: 244 EVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEV 303 +VGY+ +D +TEDI S +L W + +A + + PE L +KQ+ RWA G +++ Sbjct: 255 DVGYFDEDTVTEDIATSLRLHAKGWKSMFLYKAYTFGMAPEDLASYFKQQNRWALGTSQL 314 Query: 304 FLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATH 363 K + + + EY ++T + F F + A + + NI+ +I Sbjct: 315 LRKFIHLFFTNIKALKPVQWIEYSISTTYYFIGWAYFFLMAGPVMYIFFNIKSYNIDPLS 374 Query: 364 TAGILLCTLCLLQFIVSLMIENRYEHN---LTSSLFWIIWFPVIFWMLSLATTLVSFTRV 420 + L L + + + L + + PV + + Sbjct: 375 YVFTFIPFLVLSNIVFFQGMARKSYTRLNMLKVQMLTALSMPVYLSATVIGVLNLDKKGF 434 Query: 421 MLMPKKQ 427 + PK Sbjct: 435 QVTPKGG 441 >UniRef50_P74165 Sll1377 protein n=9 Tax=Cyanobacteria RepID=P74165_SYNY3 Length = 479 Score = 239 bits (610), Expect = 2e-61, Method: Composition-based stats. Identities = 88/439 (20%), Positives = 162/439 (36%), Gaps = 26/439 (5%) Query: 10 ILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAP 69 +++ I V H V+ +S+ + + P Sbjct: 48 AAVMLMAIWTVVITLHYWVWGSWLVWALTGALSL--------QALRLMKATPEEAPPLLT 99 Query: 70 QLK---DNPSISIIIPCFNEEKNVEETIHAALAQRYE--NIEVIAVNDGSTDKTRAILDR 124 P + +++ NEE + + + + Y EV V+D STD+T AILD+ Sbjct: 100 GDASTVPYPQVCLMVAAKNEEAVIGKIVQQLCSLDYPGDRHEVWIVDDNSTDRTPAILDQ 159 Query: 125 MAAQIPHLRVIHLAQNQGKAIA--LKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLY 182 + Q P L+V+ + L A + + + D DA + +D +V P Sbjct: 160 LRQQYPQLKVVRRGAGASGGKSGALNEVLAQTQGDIVGVFDADANVPKDLLRRVV-PYFA 218 Query: 183 NPRVGAVTGNPRIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSA 241 +P GA+ I + + Q E + ++ + + G + + G R A Sbjct: 219 SPTFGALQVRKAIANEAVNFWTRGQGAEMAL-DAYFQQQRIVTGGIGELRGNGQFVARQA 277 Query: 242 LAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGA 301 L VG W++ IT+D+D++ +L L+QW + T LW QR RWA+GG Sbjct: 278 LDAVGGWNEQTITDDLDLTIRLHLHQWKVGILVNPPVEEEGVTTAIALWHQRNRWAEGGY 337 Query: 302 EVFLKNMTRLWRKENFRMWPL-FFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIA 360 + +L + + L F + L T V ++ A+ PL L+++A Sbjct: 338 QRYLDYWRWICTQPMGWKKKLDLFSFLLMQYLLPTAAVPDLLMALWQRRFPLLTPLSYLA 397 Query: 361 ATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRV 420 + + L L + L ++ I+ ++ T ++F Sbjct: 398 IGFSCWGMYYGLKRLTPSEGESPWQQMPALLARTIGGTIYMFHWLIIMPAVTARMAF--- 454 Query: 421 MLMPKKQRARWVSPDRGIL 439 + +R +WV G Sbjct: 455 ----RPKRLKWVKTVHGAA 469 >UniRef50_B2IZC1 Glycosyl transferase, family 2 n=1 Tax=Nostoc punctiforme PCC 73102 RepID=B2IZC1_NOSP7 Length = 726 Score = 239 bits (610), Expect = 2e-61, Method: Composition-based stats. Identities = 60/407 (14%), Positives = 121/407 (29%), Gaps = 21/407 (5%) Query: 5 IVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGE 64 ++ + + + H ++ + +S ++ E Sbjct: 99 LLFLATRYFTWRTVVTLNFSHPASIVFSLTIYILEVISFFSCCLYTLQTIWSTSKLRSAE 158 Query: 65 ----NAPAPQLKDNPSISIIIPCFNEEKNV-EETIHAALAQRYENIEVIAVNDGSTDKTR 119 + + PS+ +++P +NE + T+ A Y N + ++D + Sbjct: 159 ANQYSQAVLSTQYLPSVDVLVPTYNEPDYIVRRTVIGCQAMEYLNKTIYILDDTRRPHIK 218 Query: 120 AILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEP 179 + + + + + KA L E +V ID D + ++ V Sbjct: 219 ELAEELGCEY----ITRPDNTHAKAGNLNNALKYIGGELIVPIDADFVPFKNFLTRTVG- 273 Query: 180 MLYNPRVGAVTGNPRIRT-----RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVI 234 NP V R+ + + GL + + + +V G Sbjct: 274 FFQNPETSLVQTPQYFYNPDYHARNLGLENFLPNDLEHFYGLQQSNRDVGNSVIC-CGSS 332 Query: 235 AAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRL 294 RRS L +G + ED S ++ + + + + L T QRL Sbjct: 333 YVVRRSCLDAIGGYYTRCCVEDFQTSLRMLTHGFRLIFLNEVLSTGESTRTYTDYIDQRL 392 Query: 295 RWAQGGAEVFLKNMT-RLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLN 353 RW QG +V+ +W K N+ I+ F + + L L Sbjct: 393 RWLQGNFQVYFCGDDIPIWSKLNWIQKSYSISQL---IYCFQSVFRAVFILAPLCSAYLG 449 Query: 354 IELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIW 400 I + LL + N + ++ I+ Sbjct: 450 ISPFIATLPEILYYFMP-FWLLHIAIYGWASNYRVSYFWNEVYETIF 495 >UniRef50_Q8S7W0 Probable mannan synthase 4 n=4 Tax=Magnoliophyta RepID=CSLA4_ORYSJ Length = 549 Score = 238 bits (609), Expect = 2e-61, Method: Composition-based stats. Identities = 76/416 (18%), Positives = 150/416 (36%), Gaps = 36/416 (8%) Query: 18 PLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPA--------- 68 P+ + M + +MS++ +V ER + W Sbjct: 37 PVLQFAVWACMAMSVMLVLEVAYMSLVSLVAVKLLRRVPERRYKWEPITTGSGGVGGGDG 96 Query: 69 ---------PQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIA-VNDGSTDKT 118 + P + + IP +NE++ + +I AA A + +I V D STD Sbjct: 97 EDEEAATGGREAAAFPMVLVQIPMYNEKEVYKLSIGAACALTWPPDRIIIQVLDDSTDPA 156 Query: 119 RA-----ILDRMAAQIPHLRVIHLAQNQG-KAIALKTGAAAA---KSEYLVCIDGDALLD 169 A + +++ +G KA ALK G + +++ D D + Sbjct: 157 IKDLVELECKDWARKEINIKYEIRDNRKGYKAGALKKGMEHIYTQQCDFVAIFDADFQPE 216 Query: 170 RDAAAYIVEPMLYNPRVGAVTGNPRIRTRST-LVGKIQVGEYSSIIGLIKRTQRIYGNVF 228 D + +++NP++G V L+ +IQ + + + + F Sbjct: 217 SDFLLKTIPFLVHNPKIGLVQTRWEFVNYDVCLMTRIQKMSLDYHFKVEQESGSSMHSFF 276 Query: 229 TVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKG 288 +G +R SA+ E G W D ED+D++ + L W Y +P T K Sbjct: 277 GFNGTAGVWRVSAINEAGGWKDRTTVEDMDLAVRASLKGWQFLYVGDIRVKSELPSTFKA 336 Query: 289 LWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLA 348 Q+ RW G A +F K T + + + +W + + + + + Sbjct: 337 YRHQQHRWTCGAANLFRKMATEIAKNKGVSVWKK-LHLLYSFFFVRRVVAPILTFLFYCV 395 Query: 349 GVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVI 404 +PL++ + ++ + + T + + R ++ FWI++ V+ Sbjct: 396 VIPLSVMVPEVSIPVWGMVYIPTAITIMNAI------RNPGSIHLMPFWILFENVM 445 >UniRef50_A9CLP5 Curdlan Synthase n=5 Tax=Alphaproteobacteria RepID=A9CLP5_AGRT5 Length = 657 Score = 238 bits (608), Expect = 3e-61, Method: Composition-based stats. Identities = 69/394 (17%), Positives = 141/394 (35%), Gaps = 22/394 (5%) Query: 6 VSFFILCLVLCIPLCVAY-FHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPW-G 63 ++ + + +A + + + + FF +S++ + P Sbjct: 47 MALAVRYIWWRGTQTLAPPGFTLDCLFSWSFFALEALSVLSSLSAFTILTRTTDRSPEVE 106 Query: 64 ENAPAPQLKDNPSISIIIPCFNEE-KNVEETIHAALAQRYENIEVIAVNDGSTDKTRAIL 122 EN + P ++I+I +NEE +E TI A A + +++V+ ++DG D L Sbjct: 107 ENKSWWKPDPPPKVAILIATYNEEQAVLERTIVGAKASDHPDVQVLVLDDGRRD----WL 162 Query: 123 DRMAAQIPHLRVIHLAQNQG-KAIALKTGAAAAK-----SEYLVCIDGDALLDRDAAAYI 176 + +R + N+G KA + +++ +D D + D + Sbjct: 163 RDFCEEH-DVRYLRRPDNKGSKAGNINHALGVLAQDDQPPDFIAVLDADFVPHTDFVSRA 221 Query: 177 VEPMLYNPRVGAVTGNPRIRTRSTL-----VGKIQVGEYSSIIGLIKRTQRIYGNVFTVS 231 + + ++ VG V + + + E ++ ++ +G F Sbjct: 222 LA-LFHDQEVGLVQTPQHFFNPDPIQHNLGLSRSYPDEQRFFFDHLQPSRDGWGIAFC-C 279 Query: 232 GVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWK 291 G + R A+ G + D ITED ++ +LQ + Y AL L PE LK Sbjct: 280 GTSSVIRWRAVQASGGFPTDSITEDFMLTLQLQEYGYRTVYLNEALTEGLAPEGLKEYVT 339 Query: 292 QRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVP 351 QR RW G ++ + + + N R+ + W T L Sbjct: 340 QRARWCLGLMQI-ARGRLGPFSRNNLRLRDRWSVVDSVFFWLTTFTFRIASMTFPLFYWF 398 Query: 352 LNIELTHIAATHTAGILLCTLCLLQFIVSLMIEN 385 NI + + + + + F+++ + Sbjct: 399 FNITVVNASVPDVISYFGAYMVWILFVLNFISSG 432 >UniRef50_C1TQW8 Glycosyl transferase n=1 Tax=Dethiosulfovibrio peptidovorans DSM 11002 RepID=C1TQW8_9BACT Length = 444 Score = 238 bits (607), Expect = 4e-61, Method: Composition-based stats. Identities = 102/449 (22%), Positives = 183/449 (40%), Gaps = 31/449 (6%) Query: 7 SFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENA 66 F + L L PL + + FV F + IM I+G + + R PW Sbjct: 7 LFRAVSLFLDAPLSPETWQVLLKFIPFVLFLEMPVYIMIILGVLKHCLERMYQLPWR--- 63 Query: 67 PAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYE-NIEVIAVNDGS-----TDKTRA 120 PS+S ++ C++E V+ TI + Q Y I++I + DG+ T + Sbjct: 64 ----SDYYPSVSCLVTCYSEGDAVKHTIRSLARQIYPGRIQIIPIVDGAALNGETFRAAK 119 Query: 121 ILDRMAAQIPH--LRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVE 178 ++ A++ + L+V+ Q G+ AL TG A E ++ +DGD D D Sbjct: 120 SMESAVARLSNRSLQVVPKWQRGGRVSALNTGMNFADGEIVMALDGDTSFDNDMVERATR 179 Query: 179 PMLYNPRVGAVTGNPRIRT-RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAF 237 + VG V+G R+R RS LV ++Q EY I K G V +SG F Sbjct: 180 -HFEDRSVGCVSGCLRVRNARSGLVTRLQAIEYFLSIQAAKTGLSALGVVNNISGAFGVF 238 Query: 238 RRSALAEVGYWSDDMITEDIDISWKLQLN-----QWTIFYEPRALCWILMPETLKGLWKQ 292 RRS L +G W ED+D++ +++ + I ++P A+ + +P+T + +KQ Sbjct: 239 RRSVLDLIGGWDAGT-AEDLDLTLRVKNYFGRYDGFRIIFDPEAMGFTDVPDTFRDFFKQ 297 Query: 293 RLRWAQGGAEVFLKNMTRLWRKENFRM--WPLFFEYCLTTIWAFTCLVGFIIYAVQLAGV 350 RLRW ++ + WR + ++ W F T + + I+ V Sbjct: 298 RLRW---DGDLSFLYFRKHWRSFSPKLVGWKNFLALLWTGLLFQVVMPLLIVCYTAYLFV 354 Query: 351 PLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSL 410 + L ++ + L++ R + +L +++ WI+ P+ + + Sbjct: 355 AYPLSFAVWITVLVYLFYLAVTAVMFLLFVLLLSERPKEDL-ATMPWIVLSPLFAFACRV 413 Query: 411 ATTLVSFTRVMLMPKKQR--ARWVSPDRG 437 + ++ + A W + Sbjct: 414 YGAFATLWELLGRGHRDTNMAPWWVTKKS 442 >UniRef50_Q2W2R8 Cellulose synthase catalytic subunit n=5 Tax=Rhodospirillaceae RepID=Q2W2R8_MAGSA Length = 885 Score = 237 bits (606), Expect = 5e-61, Method: Composition-based stats. Identities = 69/375 (18%), Positives = 130/375 (34%), Gaps = 20/375 (5%) Query: 50 VYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEK-NVEETIHAALAQRYENIEVI 108 V + + P Q K P +SI +PC+NE V+ T+ + +A Y N E+I Sbjct: 408 VTELTWASKFKRRFTALPPDQCKRFPKVSIHLPCYNEPPAMVKLTLDSLMALDYPNFEII 467 Query: 109 AVNDGST-DKTRAILDRMAAQ-IPHLRVIHLAQNQG-KAIALKTGAAAAK--SEYLVCID 163 +++ + ++ ++ ++ HLA G KA AL G +E + +D Sbjct: 468 VLDNNTKKEEVWRPVEEYCKTLGDKVKFFHLAPWPGAKAGALNFGLTVTDPEAEIIGVVD 527 Query: 164 GDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRT-RSTLVGKIQVGEYSSIIGLIKRTQR 222 D ++D++ +V P NP+VG V R L ++ EY+ + + Sbjct: 528 SDYMVDKNWLKGLV-PYFENPKVGHVQAPQDHREWEHDLFKEMINWEYAGFFDIGMVFRN 586 Query: 223 IYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILM 282 + G + R+ L + G W + I ED ++ ++ + Y L L+ Sbjct: 587 EADAIIQ-HGTMTLVRKKTLEDAGRWGEWCIVEDAELGLRMMKAGYQSVYVQDRLGHGLV 645 Query: 283 PETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRM-----------WPLFFEYCLTTI 331 P++ KQR RWA G ++ + L + + W +F + Sbjct: 646 PDSFMAYKKQRFRWAYGAVQILKAHWRSLIPFKQTGLTTGQKYHFVAGWLPWFADAFYLL 705 Query: 332 WAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNL 391 + L + V + + + L R Sbjct: 706 FCVAALAWSLGMIVAPRYFSTPLPFFTLPTVGVFVAKIFHHFFLYTTRVNCGLKRRSLAA 765 Query: 392 TSSLFWIIWFPVIFW 406 + + W Sbjct: 766 IAGMGLTYSIAWAMW 780 >UniRef50_UPI0001C31B0A glycosyl transferase family 2 n=1 Tax=Conexibacter woesei DSM 14684 RepID=UPI0001C31B0A Length = 495 Score = 237 bits (604), Expect = 8e-61, Method: Composition-based stats. Identities = 81/392 (20%), Positives = 150/392 (38%), Gaps = 19/392 (4%) Query: 34 VFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEET 93 + + +V G Y ++ H P+ P ++++IP +NE + T Sbjct: 8 ILIALVLAGAIPLVAGCYQFLLAGLHTFRVR---TPRAACYPRVALVIPAWNEAAVIGRT 64 Query: 94 IHAALAQRYE--NIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQG--KAIALKT 149 + LA Y + + V+D STD T ++ A + P + G KA + Sbjct: 65 LDRLLALEYPGDRLRIYVVDDASTDATPELVRAKAEEHPGRIFHLRRERGGEGKAHTINH 124 Query: 150 GAAAAK----SEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTR-STLVGK 204 G + E ++ ID D + + ++ L +P VGAVTG + +R + + + Sbjct: 125 GLRTIRAEGWYEAVLVIDADVIFTPPSLRWMTR-HLADPGVGAVTGYIKEGSRPGSWMTR 183 Query: 205 IQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGY-WSDDMITEDIDISWKL 263 EY + +R Q + G ++G RR +L VG + ED + + Sbjct: 184 FVAFEYVTAQAAARRAQNVLGAQACLAGGAQLLRRESLEAVGGEIDTATLAEDTVTTLGV 243 Query: 264 QLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLF 323 QL + +EP A+ W P T+ GLWKQRLRW +G +V + + + + + Sbjct: 244 QLAGGRVVFEPYAIVWAEEPRTVTGLWKQRLRWGRGNVQVTRRFRGVWLHRGSGSLGGVG 303 Query: 324 FEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMI 383 F + F+ + + A + + + L + V+L Sbjct: 304 FA-----LIWFSVFLMPVFMVTSSAALVTLWAIDRGQSIEVFRTLWFLNVVTYLFVTLSS 358 Query: 384 ENRYEHNLTSSLFWIIWFPVIFWMLSLATTLV 415 + + + FP + +L + L Sbjct: 359 FSIDAATARRTWREGLLFPGVVSLLIILYALY 390 >UniRef50_UPI0001C30EDA glycosyl transferase family 2 n=1 Tax=Conexibacter woesei DSM 14684 RepID=UPI0001C30EDA Length = 433 Score = 237 bits (604), Expect = 8e-61, Method: Composition-based stats. Identities = 90/395 (22%), Positives = 164/395 (41%), Gaps = 23/395 (5%) Query: 42 SIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQR 101 ++ I G V R P + P ++I++P FNEE V ++ + LA Sbjct: 39 ALFAIYGLVVVSYIVSRFVLSIPYRPPRDVGHEPHVAIVMPAFNEEAVVARSLRSLLALD 98 Query: 102 YE--NIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYL 159 Y +E++A++DGSTD+T A + +AA+ P +RVI L +NQGK A+ G A +E L Sbjct: 99 YPVDKLELVAIDDGSTDETLARMREVAAESPRVRVIELGRNQGKRAAMAAGMRATDAEIL 158 Query: 160 VCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRT-RSTLVGKIQVGEYSSIIGLIK 218 +D D+ LD DA +V+ P VGAV G+ + + + ++QV Y ++K Sbjct: 159 AFVDSDSSLDPDALHKLVQG-FAEPGVGAVCGHADVANVDDSWLTRMQVVRYFVAFQVVK 217 Query: 219 RTQRIYGNVFTVSGVIAAFRRSALAE-VGYW-------SDDMITEDIDISWKLQLNQWTI 270 ++ I+ V SG AA+RR A+ + +W S +D ++ + L W + Sbjct: 218 ASESIFACVTCCSGCFAAYRRDAILPHLDWWEHQTFLRSPATFGDDRALT-NVVLRDWKV 276 Query: 271 FYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTT 330 Y A+ ++P T++ +Q+LRW + L +WRK + Sbjct: 277 RYAANAVSHTVVPHTMRQFLRQQLRWKRSWTRESLILARFVWRKYPVAALSAYVG----- 331 Query: 331 IWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHN 390 + + V + + + A + L + + + RY+ Sbjct: 332 -----IAIALVAPIVAVRAIVWLPLVEGRGAPLVYLAGIYMLAVAYGLYYVACRRRYDPR 386 Query: 391 LTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPK 425 + + ++ + L + + Sbjct: 387 WVYGIAFCFFYLCFMLWQTYWAILTARSTSWGTRP 421 >UniRef50_A6CBT2 Glycosyl transferase, group 2 family protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CBT2_9PLAN Length = 424 Score = 237 bits (604), Expect = 9e-61, Method: Composition-based stats. Identities = 81/414 (19%), Positives = 147/414 (35%), Gaps = 33/414 (7%) Query: 29 LMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWG---ENAPAPQLKDNPSISIIIPCFN 85 +++ +F+ F+ G R+ E P +SIII + Sbjct: 4 VLIASLFWTSLFLIGFSYFGYPLVIWLLARNKSRQQDTEEREPLDTNSLPKVSIIIAAYR 63 Query: 86 EEKNVEETIHAALAQRYE--NIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGK 143 EE + E ++ Y +E++ DG+ D T ++D +R+I Q +GK Sbjct: 64 EEAVILERLNNLARLDYPTSKLEILIGCDGNEDLTGELVDSYG--DDQIRLIQFEQRRGK 121 Query: 144 AIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVG 203 A L A E LV D + +D +V + G V G + T G Sbjct: 122 ASVLNDCVPLASGEILVFSDANTNMDPQCIRQLVR-HFQDESTGCVCGQLILEDVET--G 178 Query: 204 KIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMITEDIDISWKL 263 K G Y +K + G V V+G I A ++S V D I +D I ++ Sbjct: 179 KNVDGLYWKYENFLKHCETRLGAVLGVNGAIYALKKSLYQPV---PADTIIDDFLIGMRV 235 Query: 264 QLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLF 323 L + Y+ A +++ +K+R+R G + K++ L + F Sbjct: 236 HLAGQRLIYDESAFAVEESATSVQAEFKRRIRIGTGAFQSL-KHLKGLLHPRYGSIAFAF 294 Query: 324 FEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMI 383 + + L + + + + L PL + A + L L + + + + Sbjct: 295 WSHKLLRWFCPVFMALAFLSNLFLLNQPL----------YQATLSLQILFYISAFIGMKL 344 Query: 384 ENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRG 437 L + P +F ++LA + R + +Q W +R Sbjct: 345 NGSSR------LLKLCRVPGMFVQMNLALG-IGLYRWLFT--RQSGIWERTERS 389 >UniRef50_B0C9B5 Glycosyl transferase, group 2 family protein, putative n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0C9B5_ACAM1 Length = 709 Score = 236 bits (603), Expect = 1e-60, Method: Composition-based stats. Identities = 63/433 (14%), Positives = 132/433 (30%), Gaps = 29/433 (6%) Query: 1 MINRIVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHW 60 ++ + L+ + ++ +F + + R Sbjct: 82 LVFVLFFVQARYLLWRATHTLVLKGPEAVLCIGLFAYELLSILNSWTNNTLILRSPNRSS 141 Query: 61 PWGENAPAPQLKDNPSISIIIPCFNEE-KNVEETIHAALAQRYEN--IEVIAVNDGSTDK 117 + + Q PS+ I+IP +NE K +E T+ + +Y + ++DG Sbjct: 142 EADQYSQWVQKTFKPSVDILIPTYNESSKMLERTLIGCKSLKYPKQQKHIWLLDDGQRPD 201 Query: 118 TRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIV 177 +A+ + + + KA L S+ +V D D + + Sbjct: 202 IQALAQKWDCNY----LTRPINDHAKAGNLNHALQHVNSDIVVVFDADFVPLNHFLERTI 257 Query: 178 EPMLYNPRVGAVTGNPRIRTRS----TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGV 233 + + V + L G+ E + +I+ + V SG Sbjct: 258 GFFKADDNLSMVVTPQKFYNPDPPQKNLGGQFITDEQTDFYQVIQPARDSVNAVVC-SGS 316 Query: 234 IAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQR 293 +RR L +G D I ED +Q + Y L + KQR Sbjct: 317 SVLYRRQHLDAIGGIPLDTIVEDYATGILMQAQGYKTVYLNELLSVGCAANNINEYLKQR 376 Query: 294 LRWAQGGAE-VFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPL 352 +RWA+G V ++ + + F Y T++ ++ + + L + Sbjct: 377 MRWAEGTLRMVISQHNPL---RLPGLTFMQRFTYASGTLYWLEEMLKTLSFLAPLFYLFF 433 Query: 353 NIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLAT 412 + I + L ++S M + + +I+ +L Sbjct: 434 GFKSIDITLGEASTYTFYNYSLSIIVMSWMRGS-------------VLLLIIYNLLQGFH 480 Query: 413 TLVSFTRVMLMPK 425 ++ L + Sbjct: 481 IFKVILKIFLYSQ 493 >UniRef50_Q7XIF5 Probable mannan synthase 7 n=9 Tax=Magnoliophyta RepID=CSLA7_ORYSJ Length = 585 Score = 236 bits (603), Expect = 1e-60, Method: Composition-based stats. Identities = 84/431 (19%), Positives = 164/431 (38%), Gaps = 22/431 (5%) Query: 15 LCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWV--YRERHWPWGENAPAPQLK 72 + P +M + F S + + W +R + P Sbjct: 87 VIAPTLQVAVWVCMVMSVMLVVEATFNSAVSLGVKAIGWRPEWRFKWEPLAGADEEKGRG 146 Query: 73 DNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIA-VNDGSTDKTRA-----ILDRMA 126 + P + + IP +NE + + +I AA ++ ++I V D STD + A Sbjct: 147 EYPMVMVQIPMYNELEVYKLSIGAACELKWPKDKLIVQVLDDSTDPFIKNLVELECESWA 206 Query: 127 AQIPHLRVIHLAQNQG-KAIALKTGAA---AAKSEYLVCIDGDALLDRDAAAYIVEPMLY 182 ++ +++ + + +G KA ALK G + EY+ D D + + V +++ Sbjct: 207 SKGVNIKYVTRSSRKGFKAGALKKGMECDYTKQCEYIAIFDADFQPEPNFLLRTVPFLMH 266 Query: 183 NPRVGAVTGNPRIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSA 241 NP V V + +L+ ++Q + + + F+ +G +R +A Sbjct: 267 NPNVALVQARWAFVNDTTSLLTRVQKMFFDYHFKVEQEAGSATFAFFSFNGTAGVWRTTA 326 Query: 242 LAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGA 301 + E G W D ED+D++ + LN W Y +P T +Q+ RWA GGA Sbjct: 327 INEAGGWKDRTTVEDMDLAVRASLNGWKFIYVGDIRVKSELPSTYGAYCRQQFRWACGGA 386 Query: 302 EVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAA 361 +F K + ++ + F+ +++F + + V A V NI + Sbjct: 387 NLFRKIAMDVLVAKDISLLKKFY-----MLYSFFLVRRVVAPMV--ACVLYNIIVPLSVM 439 Query: 362 THTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVI--FWMLSLATTLVSFTR 419 I + + + + ++ R NL FWI++ V+ M + T L+ + Sbjct: 440 IPELFIPIWGVAYIPMALLIITTIRNPRNLHIMPFWILFESVMTVLRMRAALTGLMELSG 499 Query: 420 VMLMPKKQRAR 430 ++ Sbjct: 500 FNKWTVTKKIG 510 >UniRef50_C9KN46 Chitin synthase n=1 Tax=Mitsuokella multacida DSM 20544 RepID=C9KN46_9FIRM Length = 628 Score = 236 bits (603), Expect = 1e-60, Method: Composition-based stats. Identities = 92/444 (20%), Positives = 160/444 (36%), Gaps = 59/444 (13%) Query: 2 INRIVSFFILCLVLCIPL----CVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRE 57 + + SFF+ +V I L V YF + + F+ ++ G Y Sbjct: 176 MLAVASFFLAVIVFVIVLMRAESVIYFQFNRWLYLYSIIAATFLLTRYLFGSFY------ 229 Query: 58 RHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYE--NIEVIAVNDGST 115 P + II+PCFNEEK ++ TI + Q Y +EVI V+D S Sbjct: 230 -------RPTKIDPDYTPGVMIIVPCFNEEKWIQHTILGCINQDYPIDKLEVIVVDDCSN 282 Query: 116 DKTR----AILDRMAAQIP---------HLRVIHLAQNQGKAIALKTGAAAAKSEYLVCI 162 D + I++R+ L N+GK A+ GA AK E LV + Sbjct: 283 DHSVDKIKEIIERLKQSDGDQKMYRVEDRLHYYVQPVNKGKREAMAVGAHMAKHELLVFV 342 Query: 163 DGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRT-RSTLVGKIQVGEYSSIIGLIKRTQ 221 D D+ LD A IV+P + ++G V+G + + + K+Q Y ++K + Sbjct: 343 DSDSFLDPYAVRNIVQP-FKDKKMGGVSGRTDVANTYTNSLTKMQAVRYYIAFRIMKAAE 401 Query: 222 RIYGNVFTVSGVIAAFRRSALAEVGYWSDD----------MITEDIDISWKLQLNQWTIF 271 + V +SG ++ +R+ + + Y D +D ++ + + Sbjct: 402 GYFDAVTCLSGPLSCYRKDLVLK--YCDDWLNQKFLGQRATFGDDRSMTNFILRH-HRTT 458 Query: 272 YEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTI 331 Y+ A+C ++P++ K +Q++RW + + +W+KE F + + Sbjct: 459 YQDTAVCMTIVPKSHKMFLRQQMRWKRSWLRESIIAARYMWKKEPFMALSFYMGLLVPIA 518 Query: 332 WAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTA------------GILLCTLCLLQFIV 379 L I + PL + + I CL V Sbjct: 519 APIIVLYNLIYIPIMHRVFPLTFLVGMLMMALLMSMAQLFLRRSTTWIFGVWFCLYYEAV 578 Query: 380 SLMIENRYEHNLTSSLFWIIWFPV 403 L S + P Sbjct: 579 LLWQMPVAWFTFWKSTWGTRLTPA 602 >UniRef50_UPI00017891C0 glycosyl transferase family 2 n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI00017891C0 Length = 450 Score = 236 bits (603), Expect = 1e-60, Method: Composition-based stats. Identities = 98/453 (21%), Positives = 180/453 (39%), Gaps = 61/453 (13%) Query: 35 FFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISIIIPCFNEEKNVEETI 94 +F+ + + +Y ++ N+ + SI++++P +NEE + I Sbjct: 3 YFFYGLAGVFLVFQALYTFIPLLCSKVKKLNSDLAEK----SITVLVPAYNEELTIRNCI 58 Query: 95 HAALAQRYENIEVIAVNDGSTDKTRAILDRM------------------------AAQIP 130 A RY N E++ +NDGS D T + LD + + + Sbjct: 59 DAMAGLRYSNYEIMIINDGSKDGTLSALDELLDLEPSARTADSKLNYKSIKGFYRSNRYR 118 Query: 131 HLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVT 190 H+ VI N GKA +L G A S+ ++ +D D++L+ D+ Y V ++ + A+ Sbjct: 119 HIYVID-KMNGGKADSLNAGIDYAVSDIVITLDADSMLEADSLKY-VNQYFHDDDIIALG 176 Query: 191 GNPRIRT--------------RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAA 236 G +I R + K Q+ Y+ + K TQ + ++ +SG A Sbjct: 177 GTVKIVQGAVRDKNGAVVEKFRGKGLIKSQIINYTHGFYVRKLTQSYFNSIVVISGAFGA 236 Query: 237 FRRSALAEVGYWSDDMITEDIDISWKL------QLNQWTIFYEPRALCWILMPETLKGLW 290 F + L V + + EDIDI+ K+ + + Y P A+C+ PE L+ + Sbjct: 237 FYKEILVHVDGFR-STVGEDIDITLKIHEYIKAHRLKKKLVYAPEAVCYTECPENLQNFY 295 Query: 291 KQRLRWAQGGAEVFLKNMTRLWRKENFRMWPLFF--EYCLTTIWAFTCLVGFIIYAVQLA 348 KQR+RW + + L +RL RK + + F + L T+ AFT + V Sbjct: 296 KQRIRWQKAFIDCILIYWSRLSRKFSVSVSLFFAIDGFMLGTLSAFTTIFYLGQTLVVGG 355 Query: 349 GVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWML 408 + I ++ A ++ +L L + S + Y S ++ + + + Sbjct: 356 NLWHAIIFLLMSVVLNAAQIVISLYLCRKYGSTYAFSDYVRMFLFSQAELLTYRNLLLYI 415 Query: 409 SLATTLVSFTRVMLMPKKQRARWVSPDRGILRG 441 ++A T F W +R + Sbjct: 416 NIAGTFKYFDN--------DEGWGFVERKGVAA 440 >UniRef50_C6QAR9 Glycosyl transferase family 2 n=1 Tax=Hyphomicrobium denitrificans ATCC 51888 RepID=C6QAR9_9RHIZ Length = 506 Score = 236 bits (603), Expect = 1e-60, Method: Composition-based stats. Identities = 73/435 (16%), Positives = 140/435 (32%), Gaps = 12/435 (2%) Query: 8 FFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAP 67 + + + ++ V PF M + V++ V R+ G Sbjct: 67 VMLAIPAIFTAALILVDEQALILWWVVLALPFLMIATVRLVAVWYVVRRQPKHWRGPLDD 126 Query: 68 APQLKDNPSISIIIPCFNEEKNVEETIHAALAQRYE--NIEVIAVNDGSTDKTR-AILDR 124 P+ S+++P + EE V + A Y +E++ + + TR A+L Sbjct: 127 RRFDARLPTFSVLVPVYKEEAVVPGLVAAMRRIDYPPDRVEILFITEEHDQPTRQALLQS 186 Query: 125 MAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLY-N 183 AQ + + Q K AL A + D + + DRD + Sbjct: 187 NLAQNMRVLTVPAGHPQTKPRALNFALQEAGGILVAVYDAEDIPDRDQLRRAAAAFVAGG 246 Query: 184 PRVGAVTGNPRIRTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSAL 242 PR+ V I + + EY ++ + + G FRR L Sbjct: 247 PRLACVQAQLTIYNAKQSFFSRQFALEYKALFSGLLPALAFLKLPIPLGGTSNHFRRDLL 306 Query: 243 AEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAE 302 + G W +TED D+ ++ + + R+ P + QR RW +G + Sbjct: 307 RKCGGWDPFNVTEDADLGIRIARLGYDVAVI-RSETSEEAPTEWRTWCGQRTRWIKGWIQ 365 Query: 303 VFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAAT 362 +L +M R F + + ++ + V + ++ A Sbjct: 366 TYLVHMRHPLRLWRDLGTWQFIGFQIMIGGMILSILVHPWFYVLIVNKAMSGAALMPAGA 425 Query: 363 HTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRVML 422 I ++ + + ++ S L+ IW P I+W+ + ++ Sbjct: 426 ALQWI-FSAHLMIGYGAAWLLTIVTARGSISGLWAAIWLP-IYWLAISWAAYRAVIDLIF 483 Query: 423 MPKKQRARWVSPDRG 437 P W G Sbjct: 484 RPFH----WEKTAHG 494 >UniRef50_Q886Q3 Glycosyl transferase, group 2 family protein n=2 Tax=Pseudomonas syringae pv. tomato RepID=Q886Q3_PSESM Length = 842 Score = 236 bits (602), Expect = 1e-60, Method: Composition-based stats. Identities = 76/444 (17%), Positives = 161/444 (36%), Gaps = 19/444 (4%) Query: 5 IVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGE 64 V F L + + ++Y M + +++ + I G+ + + E W + Sbjct: 331 TVLIFAALYTLLVGVGISYAQPLS-MWVALPIALVWVTSLLIGTGIQGYEFLESCWGPEK 389 Query: 65 NAPAPQ----LKDNPSISIIIPCFNEEKN-VEETIHAALAQRYENIEVIAVNDGSTD-KT 118 P P +SI +PC+NE + V+ T+ A Y N EV+ +++ + D + Sbjct: 390 PRSFPPLRAYPGPLPKVSIHVPCYNEPPDMVKLTLDALQRLDYPNFEVLIIDNNTQDPEV 449 Query: 119 RAILDRMAAQ-IPHLRVIHLAQNQG-KAIALKTGAAAA--KSEYLVCIDGDALLDRDAAA 174 +++ Q P R+ H+ G K+ AL +E + ID D + R Sbjct: 450 WEPIEQYCRQLGPRFRLFHVNPLSGFKSGALNYLLDYTAKDAEIVAAIDADYCVHRHWLK 509 Query: 175 YIVEPMLYNPRVGAVTGNPRIRT-RSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGV 233 ++ P P + + R +L + EY + + + + G Sbjct: 510 HM-APYFACPDIAVIQVPQDYRDGDDSLFKRCCQAEYRVFFNIGMVIRNDHDAIIQ-HGT 567 Query: 234 IAAFRRSALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQR 293 + R S L + W++ I ED ++ ++ N ++ Y + L+P+T KQR Sbjct: 568 MTLIRNSVLQRL-RWAEWSICEDAELGLRILENGFSTGYVAISYGKGLIPDTFMDFKKQR 626 Query: 294 LRWAQGGAEVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLN 353 RWA G ++ ++ L + P+ + + + A+ + + Sbjct: 627 YRWAYGVIQILKRHTGSLIAGTCEALTPIQRYHFIAGWMPWIAGGINYFLAIAVLLWSMA 686 Query: 354 IEL----THIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLS 409 + + + LL L + + ++ + II ++ ++ Sbjct: 687 MIIQPDTLEPVPWIFSSSLLLMFVLGVCKAISLYQRLASTDIKDAFAAIIASMALYSVVG 746 Query: 410 LATTLVSFTRVMLMPKKQRARWVS 433 A +FT + + + S Sbjct: 747 KAVLSSAFTSGLPFFRTPKQTSGS 770 >UniRef50_Q7PC76 Glucomannan 4-beta-mannosyltransferase 1 n=53 Tax=Embryophyta RepID=CSLA1_ORYSJ Length = 521 Score = 235 bits (601), Expect = 2e-60, Method: Composition-based stats. Identities = 83/436 (19%), Positives = 157/436 (36%), Gaps = 28/436 (6%) Query: 15 LCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHW--PWGENAPAPQLK 72 + +PL M +F +M+++ + R+ P ++ P Sbjct: 22 VIVPLLRLAVAVCLTMSVLLFLERMYMAVVISGVKILRRRPDRRYRCDPIPDDDPELGTS 81 Query: 73 DNPSISIIIPCFNEEKNVEETIHAALAQRYENIEVIA-VNDGSTDK-----TRAILDRMA 126 P + I IP FNE + + +I A + + ++ V D STD R +R A Sbjct: 82 AFPVVLIQIPMFNEREVYQLSIGAVCGLSWPSDRLVVQVLDDSTDPVIKEMVRIECERWA 141 Query: 127 AQIPHLRVIHLAQNQG-KAIALKTGAAAA---KSEYLVCIDGDALLDRDAAAYIVEPMLY 182 + ++ +G KA ALK G + EY+ D D D D + +++ Sbjct: 142 HKGVNITYQIRENRKGYKAGALKEGMKHGYVRECEYVAIFDADFQPDPDFLRRTIPFLVH 201 Query: 183 NPRVGAVTGNPRIRTRST-LVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSA 241 N + V R L+ ++Q + + F +G +R SA Sbjct: 202 NSDIALVQARWRFVNADECLMTRMQEMSLDYHFTVEQEVSSSVCAFFGFNGTAGVWRVSA 261 Query: 242 LAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGA 301 + E G W D ED+D++ + L W Y +P T K Q+ RW+ G A Sbjct: 262 VNEAGGWKDRTTVEDMDLAIRASLKGWKFVYLGDVQVKSELPSTFKAFRFQQHRWSCGPA 321 Query: 302 EVFLKNMTRLWRKENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQL-AGVPLNIELTHIA 360 +F K + + R + +W + + + ++ I+ +P I + + Sbjct: 322 NLFRKMLMEIVRNKKVTIWKKI--HVIYNFFLIRKIIAHIVTFAFYCLIIPATIFVPEVR 379 Query: 361 ATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIWFPVIFWMLSLATTLVSFTRV 420 + + T+ ++L+ + FWI++ V + TL+ Sbjct: 380 IPKWGCVYIPTI------ITLLNSVGTPRSFHLLFFWILFENV-MSLHRTKATLIGLLEA 432 Query: 421 MLMPKKQRARWVSPDR 436 + WV ++ Sbjct: 433 -----GRANEWVVTEK 443 >UniRef50_C1ZM23 Glycosyl transferase n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZM23_PLALI Length = 523 Score = 235 bits (601), Expect = 2e-60, Method: Composition-based stats. Identities = 71/426 (16%), Positives = 152/426 (35%), Gaps = 24/426 (5%) Query: 30 MMRFVFFWPFFMSIMWIVGGVYFWVYRERHW-----PWGENAPAPQLKDNPSISIIIPCF 84 M+ + ++++ V + R+W + P + P ++I +P + Sbjct: 1 MVWLIVAGLIAVTLLTFVMTIAQLDLYRRYWLSVLRKDRQREVPPLPESLPRVTIQLPIY 60 Query: 85 NEEKNVEETIHAALAQRYEN--IEVIAVNDGSTDKTRAI----LDRMAAQIP--HLRVIH 136 NE V + AA Y + +++ ++D STD I + + + P +++ H Sbjct: 61 NESPVVHRLLEAASRIDYPHNLLQIQVLDD-STDDCSKILVDKVAEIQQRDPSLNIQYRH 119 Query: 137 LAQNQG-KAIALKTGAAAAKSEYLVCIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRI 195 G KA L G A E++ D D + D + N + V Sbjct: 120 RIDRTGYKAGNLDEGTTWATGEFMAIFDADFVPKPDYLQQTIR-YFQNEEIAIVQSRWGH 178 Query: 196 RTRS-TLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRSALAEVGYWSDDMIT 254 ++V ++Q + + +R + +G +R+ + + G W Sbjct: 179 LNPDSSIVTRVQQFFLDGHLSVEQRGRGDSDLFLIYNGSAGIWRKQVIVDCGGWMTTAAI 238 Query: 255 EDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRK 314 ED+D+S++ QL I Y +P+++ L Q RW +G ++ +K + ++W+ Sbjct: 239 EDVDMSYRAQLRGKKIVYLEDYTTPGELPDSMIALRLQLFRWWKGNLQIAIKYIRQVWQS 298 Query: 315 ENFRMWPLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCL 374 + + L + T ++ + + LAG I + + L + L Sbjct: 299 DYPLIKKL---HATTHLFGP-LMSAVTFANIILAGAVPLIVTWYPETRYWLASTLLGVAL 354 Query: 375 LQFIVSLMIENRYEHN---LTSSLFWIIWFPVIFWMLSLATTLVSFTRVMLMPKKQRARW 431 + + + R + II + +L + ++ W Sbjct: 355 IPVLFLVYGTGRIRFGEGSRWQKILGIIPLGSMLMVLHSGLSCQHTVSAFEAFFVKKNVW 414 Query: 432 VSPDRG 437 V +G Sbjct: 415 VVTPKG 420 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.315 0.160 0.457 Lambda K H 0.267 0.0490 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 2,782,942,791 Number of Sequences: 3077464 Number of extensions: 150298798 Number of successful extensions: 669352 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 15008 Number of HSP's successfully gapped in prelim test: 8716 Number of HSP's that attempted gapping in prelim test: 605161 Number of HSP's gapped (non-prelim): 29503 length of query: 441 length of database: 1,040,396,356 effective HSP length: 132 effective length of query: 309 effective length of database: 634,171,108 effective search space: 195958872372 effective search space used: 195958872372 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (20.8 bits) S2: 95 (40.9 bits)